Query psy4019
Match_columns 109
No_of_seqs 203 out of 1902
Neff 9.4
Searched_HMMs 46136
Date Fri Aug 16 23:19:11 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4019.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4019hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2462|consensus 99.7 3.9E-19 8.5E-24 118.7 1.4 73 2-90 176-248 (279)
2 KOG2462|consensus 99.7 1.2E-18 2.6E-23 116.4 -0.1 65 1-79 201-265 (279)
3 KOG3623|consensus 99.4 5.2E-14 1.1E-18 104.5 0.8 64 1-78 908-971 (1007)
4 KOG3576|consensus 99.2 6E-13 1.3E-17 86.2 -2.0 66 3-82 133-198 (267)
5 PHA02768 hypothetical protein; 99.1 1.2E-10 2.5E-15 60.8 2.7 44 15-74 5-48 (55)
6 KOG3623|consensus 99.0 4.3E-11 9.4E-16 89.3 -0.2 66 11-90 890-955 (1007)
7 KOG1074|consensus 98.9 5E-10 1.1E-14 84.6 2.4 70 2-82 620-693 (958)
8 KOG1074|consensus 98.8 1.7E-09 3.7E-14 81.8 2.3 58 15-86 353-410 (958)
9 KOG3576|consensus 98.8 4.5E-10 9.8E-15 73.0 -0.6 66 13-92 115-180 (267)
10 PF13465 zf-H2C2_2: Zinc-finge 98.8 2.9E-09 6.2E-14 47.7 1.7 25 32-68 1-25 (26)
11 PHA00733 hypothetical protein 98.7 4.1E-09 8.8E-14 64.6 1.6 62 3-81 62-123 (128)
12 PF13465 zf-H2C2_2: Zinc-finge 98.7 2.4E-09 5.3E-14 48.0 -0.3 25 2-28 1-25 (26)
13 PHA00616 hypothetical protein 98.4 1.3E-07 2.8E-12 47.2 1.3 33 15-49 1-33 (44)
14 PHA00732 hypothetical protein 98.2 2E-06 4.4E-11 48.4 3.1 46 15-80 1-47 (79)
15 KOG3608|consensus 98.1 8.5E-07 1.9E-11 62.1 1.1 69 3-88 253-322 (467)
16 KOG3608|consensus 98.0 8.5E-07 1.8E-11 62.1 -1.0 46 2-50 278-324 (467)
17 COG5189 SFP1 Putative transcri 98.0 4.8E-06 1E-10 57.7 2.4 55 12-78 346-419 (423)
18 PLN03086 PRLI-interacting fact 97.6 3.3E-05 7.1E-10 57.6 2.3 42 2-49 467-508 (567)
19 PF00096 zf-C2H2: Zinc finger, 97.5 4.9E-05 1.1E-09 32.5 1.2 23 16-40 1-23 (23)
20 KOG3993|consensus 97.5 9.6E-06 2.1E-10 58.0 -1.8 66 3-82 283-381 (500)
21 PLN03086 PRLI-interacting fact 97.5 5.6E-05 1.2E-09 56.4 2.1 59 13-90 451-509 (567)
22 PHA00616 hypothetical protein 97.5 5.2E-06 1.1E-10 41.4 -2.5 32 45-88 1-32 (44)
23 PHA02768 hypothetical protein; 97.5 1.7E-05 3.7E-10 41.5 -0.7 31 45-89 5-35 (55)
24 PF13912 zf-C2H2_6: C2H2-type 97.2 0.00018 3.9E-09 31.9 1.0 25 15-41 1-25 (27)
25 PHA00733 hypothetical protein 97.2 0.00016 3.5E-09 44.4 0.9 62 13-90 38-104 (128)
26 PF13894 zf-C2H2_4: C2H2-type 96.9 0.00076 1.6E-08 28.6 1.7 23 16-40 1-23 (24)
27 PF05605 zf-Di19: Drought indu 96.8 0.00085 1.8E-08 34.9 1.6 50 15-81 2-53 (54)
28 PF09237 GAGA: GAGA factor; I 96.7 0.0017 3.6E-08 33.3 2.4 34 10-45 19-52 (54)
29 PF00096 zf-C2H2: Zinc finger, 96.7 3.7E-05 8E-10 32.9 -3.5 18 62-79 5-22 (23)
30 smart00355 ZnF_C2H2 zinc finge 96.6 0.0025 5.4E-08 27.2 2.4 24 16-41 1-24 (26)
31 PRK04860 hypothetical protein; 96.6 0.0019 4.2E-08 41.1 2.4 38 15-70 119-156 (160)
32 PF13912 zf-C2H2_6: C2H2-type 96.2 0.00025 5.3E-09 31.5 -2.5 25 45-81 1-25 (27)
33 PF12874 zf-met: Zinc-finger o 94.8 0.015 3.3E-07 25.0 0.9 23 16-40 1-23 (25)
34 COG5048 FOG: Zn-finger [Genera 94.5 0.031 6.6E-07 39.6 2.3 63 14-88 288-354 (467)
35 PF13909 zf-H2C2_5: C2H2-type 93.7 0.053 1.2E-06 23.0 1.5 23 16-41 1-23 (24)
36 PF09237 GAGA: GAGA factor; I 93.6 0.0038 8.2E-08 32.0 -2.6 34 41-86 20-53 (54)
37 KOG3993|consensus 93.4 0.014 2.9E-07 42.4 -1.1 55 15-83 267-321 (500)
38 PHA00732 hypothetical protein 93.2 0.021 4.5E-07 32.1 -0.3 25 45-81 1-26 (79)
39 PF12171 zf-C2H2_jaz: Zinc-fin 93.2 0.026 5.5E-07 24.9 0.0 22 16-39 2-23 (27)
40 COG5048 FOG: Zn-finger [Genera 92.7 0.12 2.6E-06 36.5 2.8 46 2-49 304-355 (467)
41 PF12756 zf-C2H2_2: C2H2 type 91.0 0.018 3.8E-07 32.9 -2.5 19 62-80 55-73 (100)
42 PF13913 zf-C2HC_2: zinc-finge 90.7 0.25 5.5E-06 21.4 1.8 21 16-39 3-23 (25)
43 smart00451 ZnF_U1 U1-like zinc 90.1 0.3 6.5E-06 22.5 1.9 23 15-39 3-25 (35)
44 KOG2893|consensus 83.7 0.76 1.6E-05 31.3 1.6 29 14-49 10-38 (341)
45 COG5189 SFP1 Putative transcri 83.2 0.68 1.5E-05 32.8 1.3 30 43-82 347-376 (423)
46 PF12756 zf-C2H2_2: C2H2 type 82.7 1.1 2.3E-05 25.4 1.8 24 15-40 50-73 (100)
47 KOG2186|consensus 76.6 1.4 3.1E-05 30.2 1.1 51 16-83 4-54 (276)
48 COG4049 Uncharacterized protei 72.6 1.7 3.8E-05 22.7 0.6 30 8-39 10-39 (65)
49 KOG1146|consensus 64.3 1.6 3.6E-05 36.3 -0.7 55 10-78 460-539 (1406)
50 cd00350 rubredoxin_like Rubred 64.0 5.6 0.00012 18.2 1.4 20 16-49 2-21 (33)
51 KOG4124|consensus 56.4 4.2 9.1E-05 29.3 0.3 53 13-77 347-418 (442)
52 COG2888 Predicted Zn-ribbon RN 55.7 13 0.00027 19.8 1.9 29 14-49 26-54 (61)
53 PF04959 ARS2: Arsenite-resist 52.4 5 0.00011 26.9 0.1 28 12-41 74-101 (214)
54 PF13719 zinc_ribbon_5: zinc-r 50.1 13 0.00028 17.5 1.3 32 17-67 4-35 (37)
55 PRK00464 nrdR transcriptional 48.3 4.6 0.0001 25.6 -0.5 17 45-73 28-44 (154)
56 PF12013 DUF3505: Protein of u 46.2 19 0.00041 21.1 1.9 24 16-41 81-108 (109)
57 COG4957 Predicted transcriptio 44.8 4.8 0.0001 25.0 -0.7 22 62-86 81-102 (148)
58 COG3357 Predicted transcriptio 44.8 11 0.00023 21.9 0.7 26 14-51 57-82 (97)
59 TIGR00373 conserved hypothetic 44.7 16 0.00035 23.1 1.6 28 11-49 105-132 (158)
60 KOG2231|consensus 44.7 10 0.00022 29.7 0.7 21 17-39 184-204 (669)
61 KOG0978|consensus 42.7 9.3 0.0002 30.1 0.3 23 43-77 676-698 (698)
62 KOG3408|consensus 41.5 15 0.00032 22.5 1.0 24 14-39 56-79 (129)
63 smart00614 ZnF_BED BED zinc fi 41.3 25 0.00055 17.5 1.7 21 16-38 19-44 (50)
64 PF13717 zinc_ribbon_4: zinc-r 40.7 22 0.00048 16.6 1.3 32 17-67 4-35 (36)
65 KOG2593|consensus 40.1 23 0.00049 26.3 1.9 41 10-55 123-163 (436)
66 PF09538 FYDLN_acid: Protein o 40.0 23 0.0005 21.1 1.7 30 16-70 10-39 (108)
67 TIGR02098 MJ0042_CXXC MJ0042 f 39.6 18 0.00039 16.8 1.0 33 16-67 3-35 (38)
68 PF02892 zf-BED: BED zinc fing 38.8 26 0.00056 16.8 1.5 25 12-38 13-41 (45)
69 KOG4167|consensus 38.6 1.9 4.1E-05 33.9 -3.8 22 62-83 797-818 (907)
70 PF15269 zf-C2H2_7: Zinc-finge 37.8 28 0.00061 17.4 1.5 22 16-39 21-42 (54)
71 smart00531 TFIIE Transcription 37.6 27 0.00058 21.7 1.8 32 12-49 96-127 (147)
72 PRK06266 transcription initiat 36.0 26 0.00057 22.7 1.6 26 13-49 115-140 (178)
73 KOG4167|consensus 35.9 5.8 0.00012 31.4 -1.7 25 15-41 792-816 (907)
74 PRK09678 DNA-binding transcrip 35.6 21 0.00045 19.7 0.9 40 16-69 2-41 (72)
75 smart00734 ZnF_Rad18 Rad18-lik 33.9 45 0.00098 14.3 1.7 19 17-38 3-21 (26)
76 PRK14890 putative Zn-ribbon RN 33.2 38 0.00083 17.9 1.6 29 14-49 24-52 (59)
77 KOG4173|consensus 32.5 5.2 0.00011 26.8 -2.1 48 17-76 108-165 (253)
78 COG1592 Rubrerythrin [Energy p 32.2 38 0.00081 21.9 1.8 20 15-49 134-153 (166)
79 KOG2461|consensus 31.4 69 0.0015 23.6 3.2 68 6-87 322-389 (396)
80 COG1997 RPL43A Ribosomal prote 30.9 24 0.00052 20.3 0.7 33 14-70 34-66 (89)
81 smart00834 CxxC_CXXC_SSSS Puta 30.8 22 0.00048 16.5 0.5 7 62-68 10-16 (41)
82 PF05443 ROS_MUCR: ROS/MUCR tr 30.6 22 0.00048 22.0 0.6 25 16-45 73-97 (132)
83 cd00729 rubredoxin_SM Rubredox 29.1 42 0.00091 15.4 1.2 11 15-27 2-12 (34)
84 PF09723 Zn-ribbon_8: Zinc rib 27.8 25 0.00054 16.9 0.3 7 62-68 10-16 (42)
85 TIGR02605 CxxC_CxxC_SSSS putat 27.1 18 0.00038 18.1 -0.3 14 15-30 5-18 (52)
86 PF10571 UPF0547: Uncharacteri 25.4 44 0.00095 14.4 0.9 11 16-28 15-25 (26)
87 smart00154 ZnF_AN1 AN1-like Zi 25.2 41 0.00088 16.0 0.8 13 45-69 12-24 (39)
88 smart00659 RPOLCX RNA polymera 25.2 58 0.0013 16.0 1.4 12 16-29 3-14 (44)
89 KOG1146|consensus 24.5 18 0.00038 30.7 -0.8 56 13-83 1282-1354(1406)
90 PF07754 DUF1610: Domain of un 23.4 43 0.00092 14.3 0.6 6 44-49 15-20 (24)
91 PF09845 DUF2072: Zn-ribbon co 22.6 45 0.00097 20.7 0.8 15 15-31 1-15 (131)
92 PF08790 zf-LYAR: LYAR-type C2 22.0 12 0.00027 16.7 -1.2 18 16-36 1-18 (28)
93 KOG2482|consensus 21.7 61 0.0013 23.6 1.4 23 15-39 195-217 (423)
94 TIGR02300 FYDLN_acid conserved 21.7 77 0.0017 19.6 1.7 30 16-70 10-39 (129)
95 PF07975 C1_4: TFIIH C1-like d 21.2 17 0.00037 18.6 -1.0 25 14-40 20-44 (51)
96 KOG2636|consensus 20.6 78 0.0017 23.9 1.8 28 9-38 395-423 (497)
No 1
>KOG2462|consensus
Probab=99.74 E-value=3.9e-19 Score=118.73 Aligned_cols=73 Identities=22% Similarity=0.328 Sum_probs=58.8
Q ss_pred chhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccc
Q psy4019 2 KLNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTAS 81 (109)
Q Consensus 2 ~l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 81 (109)
.|+.|+|+|+ -++.|.+ |||.|.+..-|+-|+|+|+|||||.|. .|+|+|..+++|+.|++||+
T Consensus 176 ALkMHirTH~--l~c~C~i--CGKaFSRPWLLQGHiRTHTGEKPF~C~------------hC~kAFADRSNLRAHmQTHS 239 (279)
T KOG2462|consen 176 ALKMHIRTHT--LPCECGI--CGKAFSRPWLLQGHIRTHTGEKPFSCP------------HCGKAFADRSNLRAHMQTHS 239 (279)
T ss_pred HHhhHhhccC--CCccccc--ccccccchHHhhcccccccCCCCccCC------------cccchhcchHHHHHHHHhhc
Confidence 4677888776 5778888 888888888888888888888888888 88888888888888888888
Q ss_pred cCCCCCCCC
Q psy4019 82 VSSPSIKSE 90 (109)
Q Consensus 82 ~~~p~~~~~ 90 (109)
..+.|.+..
T Consensus 240 ~~K~~qC~~ 248 (279)
T KOG2462|consen 240 DVKKHQCPR 248 (279)
T ss_pred CCccccCcc
Confidence 887776643
No 2
>KOG2462|consensus
Probab=99.70 E-value=1.2e-18 Score=116.40 Aligned_cols=65 Identities=22% Similarity=0.290 Sum_probs=61.7
Q ss_pred CchhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccc
Q psy4019 1 MKLNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKT 79 (109)
Q Consensus 1 ~~l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~ 79 (109)
|-|+.|+|+|+||+||.|.. |+|+|...++|+.|+++|.+.|+|+|. .|+|.|++.+.|.+|...
T Consensus 201 WLLQGHiRTHTGEKPF~C~h--C~kAFADRSNLRAHmQTHS~~K~~qC~------------~C~KsFsl~SyLnKH~ES 265 (279)
T KOG2462|consen 201 WLLQGHIRTHTGEKPFSCPH--CGKAFADRSNLRAHMQTHSDVKKHQCP------------RCGKSFALKSYLNKHSES 265 (279)
T ss_pred HHhhcccccccCCCCccCCc--ccchhcchHHHHHHHHhhcCCccccCc------------chhhHHHHHHHHHHhhhh
Confidence 35889999999999999999 999999999999999999999999999 999999999999999863
No 3
>KOG3623|consensus
Probab=99.40 E-value=5.2e-14 Score=104.45 Aligned_cols=64 Identities=22% Similarity=0.237 Sum_probs=60.3
Q ss_pred CchhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCcccccc
Q psy4019 1 MKLNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAK 78 (109)
Q Consensus 1 ~~l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~ 78 (109)
+.|.+|.--|+|.+||.|.+ |.|+|..+..|..|+|+|.|+|||+|. -|+|.|+++..+..||.
T Consensus 908 SSLaRHKYEHsGqRPyqC~i--CkKAFKHKHHLtEHkRLHSGEKPfQCd------------KClKRFSHSGSYSQHMN 971 (1007)
T KOG3623|consen 908 SSLARHKYEHSGQRPYQCII--CKKAFKHKHHLTEHKRLHSGEKPFQCD------------KCLKRFSHSGSYSQHMN 971 (1007)
T ss_pred HHHHHhhhhhcCCCCcccch--hhHhhhhhhhhhhhhhhccCCCcchhh------------hhhhhcccccchHhhhc
Confidence 46888999999999999999 999999999999999999999999999 89999999999988885
No 4
>KOG3576|consensus
Probab=99.22 E-value=6e-13 Score=86.20 Aligned_cols=66 Identities=23% Similarity=0.272 Sum_probs=56.5
Q ss_pred hhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCcccccccccc
Q psy4019 3 LNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASV 82 (109)
Q Consensus 3 l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~ 82 (109)
|.+|++-|...+.|.|.. ||++|...-+|++|.|+|+|.+||+|. .|+++|.++-.|..|.+..++
T Consensus 133 lnrh~kch~~vkr~lct~--cgkgfndtfdlkrh~rthtgvrpykc~------------~c~kaftqrcsleshl~kvhg 198 (267)
T KOG3576|consen 133 LNRHLKCHSDVKRHLCTF--CGKGFNDTFDLKRHTRTHTGVRPYKCS------------LCEKAFTQRCSLESHLKKVHG 198 (267)
T ss_pred HHHHhhhccHHHHHHHhh--ccCcccchhhhhhhhccccCccccchh------------hhhHHHHhhccHHHHHHHHcC
Confidence 577888888888899998 999999999999999999999999999 889999888888877765443
No 5
>PHA02768 hypothetical protein; Provisional
Probab=99.07 E-value=1.2e-10 Score=60.80 Aligned_cols=44 Identities=16% Similarity=0.242 Sum_probs=40.5
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCcc
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMP 74 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~ 74 (109)
-|.|+. ||+.|....+|..|+++|+ ++|+|. .|++.|.+.+.|.
T Consensus 5 ~y~C~~--CGK~Fs~~~~L~~H~r~H~--k~~kc~------------~C~k~f~~~s~l~ 48 (55)
T PHA02768 5 GYECPI--CGEIYIKRKSMITHLRKHN--TNLKLS------------NCKRISLRTGEYI 48 (55)
T ss_pred ccCcch--hCCeeccHHHHHHHHHhcC--CcccCC------------cccceecccceeE
Confidence 378999 9999999999999999999 799999 9999999888775
No 6
>KOG3623|consensus
Probab=99.02 E-value=4.3e-11 Score=89.27 Aligned_cols=66 Identities=18% Similarity=0.297 Sum_probs=61.6
Q ss_pred CCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCCCCC
Q psy4019 11 SSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSIKSE 90 (109)
Q Consensus 11 ~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~~~~ 90 (109)
+.+..|.|+. |+|+|...+.|.+|+--|+|.+||+|. +|.|+|.+..+|..|++.|++.+||.|..
T Consensus 890 te~gmyaCDq--CDK~FqKqSSLaRHKYEHsGqRPyqC~------------iCkKAFKHKHHLtEHkRLHSGEKPfQCdK 955 (1007)
T KOG3623|consen 890 TEDGMYACDQ--CDKAFQKQSSLARHKYEHSGQRPYQCI------------ICKKAFKHKHHLTEHKRLHSGEKPFQCDK 955 (1007)
T ss_pred CccccchHHH--HHHHHHhhHHHHHhhhhhcCCCCcccc------------hhhHhhhhhhhhhhhhhhccCCCcchhhh
Confidence 3456899999 999999999999999999999999999 99999999999999999999999998754
No 7
>KOG1074|consensus
Probab=98.92 E-value=5e-10 Score=84.59 Aligned_cols=70 Identities=13% Similarity=0.311 Sum_probs=48.7
Q ss_pred chhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCC----cccCCCCCCCCCCCCCCCCCcCCCCCCccccc
Q psy4019 2 KLNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHC----YANKKHKGMPSKDKDGLGGASPSRSDDMPLSA 77 (109)
Q Consensus 2 ~l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp----~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~ 77 (109)
.|+.|.|+|+||+||+|.+ ||++|..+.+|+.|+-+|....+ |.|+. .-.|-+.|...-.|..|+
T Consensus 620 aLqmHyrtHtGERPFkCKi--CgRAFtTkGNLkaH~~vHka~p~~R~q~ScP~---------~~ic~~kftn~V~lpQhI 688 (958)
T KOG1074|consen 620 ALQMHYRTHTGERPFKCKI--CGRAFTTKGNLKAHMSVHKAKPPARVQFSCPS---------TFICQKKFTNAVTLPQHI 688 (958)
T ss_pred hhhhhhhcccCcCcccccc--ccchhccccchhhcccccccCccccccccCCc---------hhhhcccccccccccceE
Confidence 5777788888888888877 88888888888887777765433 44441 014667777777777777
Q ss_pred ccccc
Q psy4019 78 KTASV 82 (109)
Q Consensus 78 ~~~~~ 82 (109)
++|.+
T Consensus 689 riH~~ 693 (958)
T KOG1074|consen 689 RIHLG 693 (958)
T ss_pred EeecC
Confidence 77764
No 8
>KOG1074|consensus
Probab=98.83 E-value=1.7e-09 Score=81.81 Aligned_cols=58 Identities=19% Similarity=0.347 Sum_probs=53.6
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCC
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPS 86 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~ 86 (109)
+++|.. |.|.|...+.|+.|.|.|+|++||+|+ +||..|+.+.+|++|...|....|.
T Consensus 353 khkCr~--CakvfgS~SaLqiHlRSHTGERPfqCn------------vCG~~FSTkGNLKvH~~rH~e~~p~ 410 (958)
T KOG1074|consen 353 KHKCRF--CAKVFGSDSALQIHLRSHTGERPFQCN------------VCGNRFSTKGNLKVHFQRHREKYPH 410 (958)
T ss_pred cchhhh--hHhhcCchhhhhhhhhccCCCCCeeec------------ccccccccccceeeeeeeccccCCc
Confidence 467999 999999999999999999999999999 9999999999999999988776644
No 9
>KOG3576|consensus
Probab=98.83 E-value=4.5e-10 Score=72.97 Aligned_cols=66 Identities=18% Similarity=0.239 Sum_probs=60.6
Q ss_pred CCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCCCCCCC
Q psy4019 13 QKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSIKSEVS 92 (109)
Q Consensus 13 ~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~~~~~~ 92 (109)
...|.|.+ |+|+|....-|.+|++.|...|.|-|. .||+.|...-+|++|+++|++..||-++.-.
T Consensus 115 ~d~ftCrv--CgK~F~lQRmlnrh~kch~~vkr~lct------------~cgkgfndtfdlkrh~rthtgvrpykc~~c~ 180 (267)
T KOG3576|consen 115 QDSFTCRV--CGKKFGLQRMLNRHLKCHSDVKRHLCT------------FCGKGFNDTFDLKRHTRTHTGVRPYKCSLCE 180 (267)
T ss_pred CCeeeeeh--hhhhhhHHHHHHHHhhhccHHHHHHHh------------hccCcccchhhhhhhhccccCccccchhhhh
Confidence 34589999 999999999999999999999999999 9999999999999999999999999876543
No 10
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.80 E-value=2.9e-09 Score=47.74 Aligned_cols=25 Identities=40% Similarity=0.550 Sum_probs=22.9
Q ss_pred HHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCC
Q psy4019 32 SLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPS 68 (109)
Q Consensus 32 ~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~ 68 (109)
+|.+|+++|++++||.|+ .|++.|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~------------~C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCP------------YCGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEES------------SSSEEES
T ss_pred CHHHHhhhcCCCCCCCCC------------CCcCeeC
Confidence 588999999999999999 9999885
No 11
>PHA00733 hypothetical protein
Probab=98.74 E-value=4.1e-09 Score=64.64 Aligned_cols=62 Identities=23% Similarity=0.327 Sum_probs=51.5
Q ss_pred hhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccc
Q psy4019 3 LNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTAS 81 (109)
Q Consensus 3 l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 81 (109)
|..|+..+ +.+||.|.. |++.|.....|..|++.+ +.+|.|. .|++.|.....|..|+...+
T Consensus 62 l~~~~~~~-~~kPy~C~~--Cgk~Fss~s~L~~H~r~h--~~~~~C~------------~CgK~F~~~~sL~~H~~~~h 123 (128)
T PHA00733 62 LYKLLTSK-AVSPYVCPL--CLMPFSSSVSLKQHIRYT--EHSKVCP------------VCGKEFRNTDSTLDHVCKKH 123 (128)
T ss_pred HHhhcccC-CCCCccCCC--CCCcCCCHHHHHHHHhcC--CcCccCC------------CCCCccCCHHHHHHHHHHhc
Confidence 44555444 478999999 999999999999999876 4579999 99999999999988876544
No 12
>PF13465 zf-H2C2_2: Zinc-finger double domain; PDB: 2EN7_A 1TF6_A 1TF3_A 2ELT_A 2EOS_A 2EN2_A 2DMD_A 2WBS_A 2WBU_A 2EM5_A ....
Probab=98.69 E-value=2.4e-09 Score=47.96 Aligned_cols=25 Identities=28% Similarity=0.607 Sum_probs=23.2
Q ss_pred chhhhccccCCCCCcccccCCCCCcCC
Q psy4019 2 KLNLAYPLLSSQKPYVCGEENCTKRYT 28 (109)
Q Consensus 2 ~l~~H~~~h~~~~p~~C~~~~C~k~f~ 28 (109)
+|..|+++|++++||.|+. |++.|.
T Consensus 1 ~l~~H~~~H~~~k~~~C~~--C~k~F~ 25 (26)
T PF13465_consen 1 NLRRHMRTHTGEKPYKCPY--CGKSFS 25 (26)
T ss_dssp HHHHHHHHHSSSSSEEESS--SSEEES
T ss_pred CHHHHhhhcCCCCCCCCCC--CcCeeC
Confidence 5889999999999999999 999885
No 13
>PHA00616 hypothetical protein
Probab=98.39 E-value=1.3e-07 Score=47.15 Aligned_cols=33 Identities=18% Similarity=0.399 Sum_probs=31.4
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
||.|.. ||+.|.....|.+|++.|++++++.|+
T Consensus 1 pYqC~~--CG~~F~~~s~l~~H~r~~hg~~~~~~~ 33 (44)
T PHA00616 1 MYQCLR--CGGIFRKKKEVIEHLLSVHKQNKLTLE 33 (44)
T ss_pred CCccch--hhHHHhhHHHHHHHHHHhcCCCcccee
Confidence 689999 999999999999999999999999986
No 14
>PHA00732 hypothetical protein
Probab=98.17 E-value=2e-06 Score=48.45 Aligned_cols=46 Identities=11% Similarity=0.068 Sum_probs=37.7
Q ss_pred CcccccCCCCCcCCChhHHHHHhhh-hcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCcccccccc
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKT-VHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTA 80 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~-h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~ 80 (109)
||.|.. |++.|.....|..|++. |.+ +.|+ .|++.|. .+..|.++.
T Consensus 1 py~C~~--Cgk~F~s~s~Lk~H~r~~H~~---~~C~------------~CgKsF~---~l~~H~~~~ 47 (79)
T PHA00732 1 MFKCPI--CGFTTVTLFALKQHARRNHTL---TKCP------------VCNKSYR---RLNQHFYSQ 47 (79)
T ss_pred CccCCC--CCCccCCHHHHHHHhhcccCC---CccC------------CCCCEeC---Chhhhhccc
Confidence 588999 99999999999999984 653 6899 9999996 566676543
No 15
>KOG3608|consensus
Probab=98.13 E-value=8.5e-07 Score=62.12 Aligned_cols=69 Identities=17% Similarity=0.169 Sum_probs=46.5
Q ss_pred hhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhh-hcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccc
Q psy4019 3 LNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKT-VHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTAS 81 (109)
Q Consensus 3 l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~-h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 81 (109)
|..|++.|. .-|+|+. |.-+....+.|..|++. |...|||+|. .|++.|.+.++|.+|..+|+
T Consensus 253 L~~Hv~rHv--n~ykCpl--Cdmtc~~~ssL~~H~r~rHs~dkpfKCd------------~Cd~~c~~esdL~kH~~~HS 316 (467)
T KOG3608|consen 253 LKSHVVRHV--NCYKCPL--CDMTCSSASSLTTHIRYRHSKDKPFKCD------------ECDTRCVRESDLAKHVQVHS 316 (467)
T ss_pred HHHHHHHhh--hcccccc--cccCCCChHHHHHHHHhhhccCCCcccc------------chhhhhccHHHHHHHHHhcc
Confidence 445555554 3567777 77777777777777774 6667777777 77777777777777777666
Q ss_pred cCCCCCC
Q psy4019 82 VSSPSIK 88 (109)
Q Consensus 82 ~~~p~~~ 88 (109)
..-|.+
T Consensus 317 -~~~y~C 322 (467)
T KOG3608|consen 317 -KTVYQC 322 (467)
T ss_pred -ccceec
Confidence 334443
No 16
>KOG3608|consensus
Probab=97.99 E-value=8.5e-07 Score=62.12 Aligned_cols=46 Identities=28% Similarity=0.528 Sum_probs=37.4
Q ss_pred chhhhccc-cCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCC
Q psy4019 2 KLNLAYPL-LSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKK 50 (109)
Q Consensus 2 ~l~~H~~~-h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~ 50 (109)
.|.+|++. |..++||+|.. |.+.+.....|.+|...|. +..|.|+.
T Consensus 278 sL~~H~r~rHs~dkpfKCd~--Cd~~c~~esdL~kH~~~HS-~~~y~C~h 324 (467)
T KOG3608|consen 278 SLTTHIRYRHSKDKPFKCDE--CDTRCVRESDLAKHVQVHS-KTVYQCEH 324 (467)
T ss_pred HHHHHHHhhhccCCCccccc--hhhhhccHHHHHHHHHhcc-ccceecCC
Confidence 57788875 88889999999 9999999999999987776 45577764
No 17
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=97.97 E-value=4.8e-06 Score=57.72 Aligned_cols=55 Identities=22% Similarity=0.440 Sum_probs=46.2
Q ss_pred CCCCcccccCCCCCcCCChhHHHHHhhh-h------------------cCCCCcccCCCCCCCCCCCCCCCCCcCCCCCC
Q psy4019 12 SQKPYVCGEENCTKRYTDPSSLRKHIKT-V------------------HGEHCYANKKHKGMPSKDKDGLGGASPSRSDD 72 (109)
Q Consensus 12 ~~~p~~C~~~~C~k~f~~~~~l~~H~~~-h------------------~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~ 72 (109)
+++||+|++++|.|++...-.|+-|+.- | ...|||.|+ .|+|.+.....
T Consensus 346 d~KpykCpV~gC~K~YknqnGLKYH~lhGH~~~~~~~~p~p~~~~~F~~~~KPYrCe------------vC~KRYKNlNG 413 (423)
T COG5189 346 DGKPYKCPVEGCNKKYKNQNGLKYHMLHGHQNQKLHENPSPEKMNIFSAKDKPYRCE------------VCDKRYKNLNG 413 (423)
T ss_pred cCceecCCCCCchhhhccccchhhhhhccccCcccCCCCCccccccccccCCceecc------------ccchhhccCcc
Confidence 4599999999999999999999988642 2 124899999 99999998888
Q ss_pred cccccc
Q psy4019 73 MPLSAK 78 (109)
Q Consensus 73 l~~H~~ 78 (109)
|+.|++
T Consensus 414 LKYHr~ 419 (423)
T COG5189 414 LKYHRK 419 (423)
T ss_pred ceeccc
Confidence 988875
No 18
>PLN03086 PRLI-interacting factor K; Provisional
Probab=97.62 E-value=3.3e-05 Score=57.62 Aligned_cols=42 Identities=10% Similarity=0.263 Sum_probs=27.9
Q ss_pred chhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 2 KLNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 2 ~l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
.|..|+.+|+ +++.|. |++.+ ....|..|+..|...+++.|.
T Consensus 467 ~LekH~~~~H--kpv~Cp---Cg~~~-~R~~L~~H~~thCp~Kpi~C~ 508 (567)
T PLN03086 467 EMEKHMKVFH--EPLQCP---CGVVL-EKEQMVQHQASTCPLRLITCR 508 (567)
T ss_pred HHHHHHHhcC--CCccCC---CCCCc-chhHHHhhhhccCCCCceeCC
Confidence 3556766653 567774 77544 456777777777777777777
No 19
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=97.52 E-value=4.9e-05 Score=32.52 Aligned_cols=23 Identities=30% Similarity=0.792 Sum_probs=20.8
Q ss_pred cccccCCCCCcCCChhHHHHHhhhh
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTV 40 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h 40 (109)
|.|.. |++.|.....|..|++.|
T Consensus 1 y~C~~--C~~~f~~~~~l~~H~~~H 23 (23)
T PF00096_consen 1 YKCPI--CGKSFSSKSNLKRHMRRH 23 (23)
T ss_dssp EEETT--TTEEESSHHHHHHHHHHH
T ss_pred CCCCC--CCCccCCHHHHHHHHhHC
Confidence 67988 999999999999999864
No 20
>KOG3993|consensus
Probab=97.51 E-value=9.6e-06 Score=58.01 Aligned_cols=66 Identities=20% Similarity=0.232 Sum_probs=48.5
Q ss_pred hhhhccccCCCCCcccccCCCCCcCCChhHHHHHhhhhc--------CCC-------------------------CcccC
Q psy4019 3 LNLAYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVH--------GEH-------------------------CYANK 49 (109)
Q Consensus 3 l~~H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~--------~~k-------------------------p~~C~ 49 (109)
|..|.-...-...|+|.. |+|.|....+|..|.|.|. +.. -|.|.
T Consensus 283 LAQHrC~RIV~vEYrCPE--C~KVFsCPANLASHRRWHKPR~eaa~a~~~P~k~~~~~rae~~ea~rsg~dss~gi~~C~ 360 (500)
T KOG3993|consen 283 LAQHRCPRIVHVEYRCPE--CDKVFSCPANLASHRRWHKPRPEAAKAGSPPPKQAVETRAEVQEAERSGDDSSSGIFSCH 360 (500)
T ss_pred HhhccCCeeEEeeecCCc--ccccccCchhhhhhhcccCCchhhhhcCCCChhhhhhhhhhhhhccccCCcccCceeecH
Confidence 334433222334699999 9999999999999988763 212 25666
Q ss_pred CCCCCCCCCCCCCCCCcCCCCCCcccccccccc
Q psy4019 50 KHKGMPSKDKDGLGGASPSRSDDMPLSAKTASV 82 (109)
Q Consensus 50 ~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~ 82 (109)
.|++.|.+.-.|++|.-+|+.
T Consensus 361 ------------~C~KkFrRqAYLrKHqlthq~ 381 (500)
T KOG3993|consen 361 ------------TCGKKFRRQAYLRKHQLTHQR 381 (500)
T ss_pred ------------HhhhhhHHHHHHHHhHHhhhc
Confidence 899999999999999776664
No 21
>PLN03086 PRLI-interacting factor K; Provisional
Probab=97.50 E-value=5.6e-05 Score=56.42 Aligned_cols=59 Identities=14% Similarity=0.215 Sum_probs=30.0
Q ss_pred CCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCCCCC
Q psy4019 13 QKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSIKSE 90 (109)
Q Consensus 13 ~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~~~~ 90 (109)
+..+.|.. |++.|. ...|..|++.++ +++.|+ |++.+ .+..|..|+.++....++.|..
T Consensus 451 ~~H~~C~~--Cgk~f~-~s~LekH~~~~H--kpv~Cp-------------Cg~~~-~R~~L~~H~~thCp~Kpi~C~f 509 (567)
T PLN03086 451 KNHVHCEK--CGQAFQ-QGEMEKHMKVFH--EPLQCP-------------CGVVL-EKEQMVQHQASTCPLRLITCRF 509 (567)
T ss_pred ccCccCCC--CCCccc-hHHHHHHHHhcC--CCccCC-------------CCCCc-chhHHHhhhhccCCCCceeCCC
Confidence 34455666 666654 445666666543 455664 33333 3345555555555555554443
No 22
>PHA00616 hypothetical protein
Probab=97.48 E-value=5.2e-06 Score=41.42 Aligned_cols=32 Identities=6% Similarity=-0.060 Sum_probs=29.1
Q ss_pred CcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCCC
Q psy4019 45 CYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSIK 88 (109)
Q Consensus 45 p~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~~ 88 (109)
||+|. .||+.|....+|..|++.+++..+..+
T Consensus 1 pYqC~------------~CG~~F~~~s~l~~H~r~~hg~~~~~~ 32 (44)
T PHA00616 1 MYQCL------------RCGGIFRKKKEVIEHLLSVHKQNKLTL 32 (44)
T ss_pred CCccc------------hhhHHHhhHHHHHHHHHHhcCCCccce
Confidence 68999 999999999999999999999877654
No 23
>PHA02768 hypothetical protein; Provisional
Probab=97.47 E-value=1.7e-05 Score=41.45 Aligned_cols=31 Identities=13% Similarity=0.067 Sum_probs=26.1
Q ss_pred CcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCCCC
Q psy4019 45 CYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSIKS 89 (109)
Q Consensus 45 p~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~~~ 89 (109)
-|+|+ .||+.|.+.++|..|+++|+ +++.++
T Consensus 5 ~y~C~------------~CGK~Fs~~~~L~~H~r~H~--k~~kc~ 35 (55)
T PHA02768 5 GYECP------------ICGEIYIKRKSMITHLRKHN--TNLKLS 35 (55)
T ss_pred ccCcc------------hhCCeeccHHHHHHHHHhcC--CcccCC
Confidence 37899 99999999999999999999 455333
No 24
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=97.18 E-value=0.00018 Score=31.95 Aligned_cols=25 Identities=36% Similarity=0.829 Sum_probs=22.6
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhc
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVH 41 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~ 41 (109)
+|.|.. |++.|.....|..|++.|.
T Consensus 1 ~~~C~~--C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECDE--CGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEETT--TTEEESSHHHHHHHHCTTT
T ss_pred CCCCCc--cCCccCChhHHHHHhHHhc
Confidence 588999 9999999999999998764
No 25
>PHA00733 hypothetical protein
Probab=97.15 E-value=0.00016 Score=44.37 Aligned_cols=62 Identities=15% Similarity=0.065 Sum_probs=45.5
Q ss_pred CCCcccccCCCCCcCCChhHHHHH--hh---hhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCC
Q psy4019 13 QKPYVCGEENCTKRYTDPSSLRKH--IK---TVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSI 87 (109)
Q Consensus 13 ~~p~~C~~~~C~k~f~~~~~l~~H--~~---~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~ 87 (109)
.+++.|.. |.+.|.....|..+ .+ .+.+.+||.|. .|++.|.....|..|++.+. .++.
T Consensus 38 ~~~~~~~~--~~~~~~~~~~l~~~~~l~~~~~~~~~kPy~C~------------~Cgk~Fss~s~L~~H~r~h~--~~~~ 101 (128)
T PHA00733 38 QKRLIRAV--VKTLIYNPQLLDESSYLYKLLTSKAVSPYVCP------------LCLMPFSSSVSLKQHIRYTE--HSKV 101 (128)
T ss_pred hhhHHHHH--HhhhccChhhhcchHHHHhhcccCCCCCccCC------------CCCCcCCCHHHHHHHHhcCC--cCcc
Confidence 46788999 99887776666554 11 23457899999 99999999999999998652 3454
Q ss_pred CCC
Q psy4019 88 KSE 90 (109)
Q Consensus 88 ~~~ 90 (109)
+..
T Consensus 102 C~~ 104 (128)
T PHA00733 102 CPV 104 (128)
T ss_pred CCC
Confidence 444
No 26
>PF13894 zf-C2H2_4: C2H2-type zinc finger; PDB: 2ELX_A 2EPP_A 2DLK_A 1X6H_A 2EOU_A 2EMB_A 2GQJ_A 2CSH_A 2WBT_B 2ELM_A ....
Probab=96.89 E-value=0.00076 Score=28.56 Aligned_cols=23 Identities=35% Similarity=0.772 Sum_probs=19.0
Q ss_pred cccccCCCCCcCCChhHHHHHhhhh
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTV 40 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h 40 (109)
|.|.. |++.|.....|..|++.+
T Consensus 1 ~~C~~--C~~~~~~~~~l~~H~~~~ 23 (24)
T PF13894_consen 1 FQCPI--CGKSFRSKSELRQHMRTH 23 (24)
T ss_dssp EE-SS--TS-EESSHHHHHHHHHHH
T ss_pred CCCcC--CCCcCCcHHHHHHHHHhh
Confidence 67999 999999999999999875
No 27
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=96.78 E-value=0.00085 Score=34.93 Aligned_cols=50 Identities=18% Similarity=0.262 Sum_probs=36.6
Q ss_pred CcccccCCCCCcCCChhHHHHHhhh-hcCC-CCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccc
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKT-VHGE-HCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTAS 81 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~-h~~~-kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 81 (109)
-|.|+. |++ ......|..|... |..+ +.+.|+ .|...+. .+|..|+...+
T Consensus 2 ~f~CP~--C~~-~~~~~~L~~H~~~~H~~~~~~v~CP------------iC~~~~~--~~l~~Hl~~~H 53 (54)
T PF05605_consen 2 SFTCPY--CGK-GFSESSLVEHCEDEHRSESKNVVCP------------ICSSRVT--DNLIRHLNSQH 53 (54)
T ss_pred CcCCCC--CCC-ccCHHHHHHHHHhHCcCCCCCccCC------------Cchhhhh--hHHHHHHHHhc
Confidence 488999 999 4567789999764 5543 579999 8877553 47777876543
No 28
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=96.73 E-value=0.0017 Score=33.32 Aligned_cols=34 Identities=24% Similarity=0.427 Sum_probs=24.2
Q ss_pred cCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCC
Q psy4019 10 LSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHC 45 (109)
Q Consensus 10 h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp 45 (109)
+..+.|..|+. |+..+.+..+|.+|+.+..+.||
T Consensus 19 ~~S~~PatCP~--C~a~~~~srnLrRHle~~H~~k~ 52 (54)
T PF09237_consen 19 SQSEQPATCPI--CGAVIRQSRNLRRHLEIRHFKKP 52 (54)
T ss_dssp CTTS--EE-TT--T--EESSHHHHHHHHHHHTTTS-
T ss_pred hccCCCCCCCc--chhhccchhhHHHHHHHHhcccC
Confidence 34567899999 99999999999999998877765
No 29
>PF00096 zf-C2H2: Zinc finger, C2H2 type; InterPro: IPR007087 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The C2H2 zinc finger is the classical zinc finger domain. The two conserved cysteines and histidines co-ordinate a zinc ion. The following pattern describes the zinc finger: #-X-C-X(1-5)-C-X3-#-X5-#-X2-H-X(3-6)-[H/C], where X can be any amino acid, and numbers in brackets indicate the number of residues. The positions marked # are those that are important for the stable fold of the zinc finger. The final position can be either his or cys. The C2H2 zinc finger is composed of two short beta strands followed by an alpha helix. The amino terminal part of the helix binds the major groove in DNA binding zinc fingers. The accepted consensus binding sequence for Sp1 is usually defined by the asymmetric hexanucleotide core GGGCGG but this sequence does not include, among others, the GAG (=CTC) repeat that constitutes a high-affinity site for Sp1 binding to the wt1 promoter []. This entry represents the classical C2H2 zinc finger domain. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9H_A 2EPC_A 1SP1_A 1VA3_A 2WBT_B 2ELR_A 2YTP_A 2YTT_A 1VA1_A 2ELO_A ....
Probab=96.66 E-value=3.7e-05 Score=32.92 Aligned_cols=18 Identities=17% Similarity=0.220 Sum_probs=16.5
Q ss_pred CCCCcCCCCCCccccccc
Q psy4019 62 LGGASPSRSDDMPLSAKT 79 (109)
Q Consensus 62 ~C~~~f~~~~~l~~H~~~ 79 (109)
.|++.|.....|..|++.
T Consensus 5 ~C~~~f~~~~~l~~H~~~ 22 (23)
T PF00096_consen 5 ICGKSFSSKSNLKRHMRR 22 (23)
T ss_dssp TTTEEESSHHHHHHHHHH
T ss_pred CCCCccCCHHHHHHHHhH
Confidence 999999999999999875
No 30
>smart00355 ZnF_C2H2 zinc finger.
Probab=96.64 E-value=0.0025 Score=27.24 Aligned_cols=24 Identities=38% Similarity=0.807 Sum_probs=21.1
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhc
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVH 41 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~ 41 (109)
|.|.. |++.|.....|..|++.|.
T Consensus 1 ~~C~~--C~~~f~~~~~l~~H~~~H~ 24 (26)
T smart00355 1 YRCPE--CGKVFKSKSALKEHMRTHX 24 (26)
T ss_pred CCCCC--CcchhCCHHHHHHHHHHhc
Confidence 57888 9999999999999998664
No 31
>PRK04860 hypothetical protein; Provisional
Probab=96.57 E-value=0.0019 Score=41.08 Aligned_cols=38 Identities=29% Similarity=0.616 Sum_probs=31.7
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCC
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRS 70 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~ 70 (109)
+|.|. |+. ....+.+|.++++++++|.|. .|+..|...
T Consensus 119 ~Y~C~---C~~---~~~~~rrH~ri~~g~~~YrC~------------~C~~~l~~~ 156 (160)
T PRK04860 119 PYRCK---CQE---HQLTVRRHNRVVRGEAVYRCR------------RCGETLVFK 156 (160)
T ss_pred EEEcC---CCC---eeCHHHHHHHHhcCCccEECC------------CCCceeEEe
Confidence 58895 887 567789999999999999999 888877543
No 32
>PF13912 zf-C2H2_6: C2H2-type zinc finger; PDB: 1JN7_A 1FU9_A 2L1O_A 1NJQ_A 2EN8_A 2EMM_A 1FV5_A 1Y0J_B 2L6Z_B.
Probab=96.15 E-value=0.00025 Score=31.48 Aligned_cols=25 Identities=8% Similarity=0.028 Sum_probs=22.2
Q ss_pred CcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccc
Q psy4019 45 CYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTAS 81 (109)
Q Consensus 45 p~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~ 81 (109)
+|+|. .|++.|.....|..|++.+.
T Consensus 1 ~~~C~------------~C~~~F~~~~~l~~H~~~h~ 25 (27)
T PF13912_consen 1 PFECD------------ECGKTFSSLSALREHKRSHC 25 (27)
T ss_dssp SEEET------------TTTEEESSHHHHHHHHCTTT
T ss_pred CCCCC------------ccCCccCChhHHHHHhHHhc
Confidence 58899 99999999999999998764
No 33
>PF12874 zf-met: Zinc-finger of C2H2 type; PDB: 1ZU1_A 2KVG_A.
Probab=94.78 E-value=0.015 Score=24.96 Aligned_cols=23 Identities=30% Similarity=0.856 Sum_probs=20.1
Q ss_pred cccccCCCCCcCCChhHHHHHhhhh
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTV 40 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h 40 (109)
|.|.. |++.|.....+..|++.+
T Consensus 1 ~~C~~--C~~~f~s~~~~~~H~~s~ 23 (25)
T PF12874_consen 1 FYCDI--CNKSFSSENSLRQHLRSK 23 (25)
T ss_dssp EEETT--TTEEESSHHHHHHHHTTH
T ss_pred CCCCC--CCCCcCCHHHHHHHHCcC
Confidence 57888 999999999999998753
No 34
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=94.46 E-value=0.031 Score=39.56 Aligned_cols=63 Identities=22% Similarity=0.352 Sum_probs=52.5
Q ss_pred CCcccccCCCCCcCCChhHHHHHhh--hhcCC--CCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCCCC
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIK--TVHGE--HCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPSIK 88 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~--~h~~~--kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~~~ 88 (109)
.++.|.. |...|.....+..|.+ .|.++ +++.|..- .|++.|.+.+.+..|...++...+...
T Consensus 288 ~~~~~~~--~~~~~s~~~~l~~~~~~~~h~~~~~~~~~~p~~----------~~~~~~~~~~~~~~~~~~~~~~~~~~~ 354 (467)
T COG5048 288 LPIKSKQ--CNISFSRSSPLTRHLRSVNHSGESLKPFSCPYS----------LCGKLFSRNDALKRHILLHTSISPAKE 354 (467)
T ss_pred cCCCCcc--ccCCccccccccccccccccccccCCceeeecc----------CCCccccccccccCCcccccCCCcccc
Confidence 3577887 9999999999999999 79999 99999832 589999999999999998888766533
No 35
>PF13909 zf-H2C2_5: C2H2-type zinc-finger domain; PDB: 1X5W_A.
Probab=93.68 E-value=0.053 Score=23.02 Aligned_cols=23 Identities=35% Similarity=0.788 Sum_probs=17.4
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhc
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVH 41 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~ 41 (109)
|.|.. |..... ...|.+|++.+.
T Consensus 1 y~C~~--C~y~t~-~~~l~~H~~~~H 23 (24)
T PF13909_consen 1 YKCPH--CSYSTS-KSNLKRHLKRHH 23 (24)
T ss_dssp EE-SS--SS-EES-HHHHHHHHHHHH
T ss_pred CCCCC--CCCcCC-HHHHHHHHHhhC
Confidence 67988 998887 889999988753
No 36
>PF09237 GAGA: GAGA factor; InterPro: IPR015318 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. Members of this entry bind to a 5'-GAGAG-3' DNA consensus binding site, and contain a Cys2-His2 zinc finger core as well as an N-terminal extension containing two highly basic regions. The zinc finger core binds in the DNA major groove and recognises the first three GAG bases of the consensus in a manner similar to that seen in other classical zinc finger-DNA complexes. The second basic region forms a helix that interacts in the major groove recognising the last G of the consensus, while the first basic region wraps around the DNA in the minor groove and recognises the A in the fourth position of the consensus sequence []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1YUI_A 1YUJ_A.
Probab=93.60 E-value=0.0038 Score=32.04 Aligned_cols=34 Identities=15% Similarity=0.172 Sum_probs=23.9
Q ss_pred cCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCCC
Q psy4019 41 HGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSPS 86 (109)
Q Consensus 41 ~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p~ 86 (109)
..+.|-.|+ .|+..+.+..+|.+|+...+..+|-
T Consensus 20 ~S~~PatCP------------~C~a~~~~srnLrRHle~~H~~k~~ 53 (54)
T PF09237_consen 20 QSEQPATCP------------ICGAVIRQSRNLRRHLEIRHFKKPG 53 (54)
T ss_dssp TTS--EE-T------------TT--EESSHHHHHHHHHHHTTTS--
T ss_pred ccCCCCCCC------------cchhhccchhhHHHHHHHHhcccCC
Confidence 457889999 9999999999999999988887663
No 37
>KOG3993|consensus
Probab=93.37 E-value=0.014 Score=42.41 Aligned_cols=55 Identities=16% Similarity=0.196 Sum_probs=44.3
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccC
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVS 83 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~ 83 (109)
-|.|.. |...|...=.|.+|+-.-.-..-|+|+ +|+|.|+..-+|.-|.+.|...
T Consensus 267 dyiCqL--CK~kYeD~F~LAQHrC~RIV~vEYrCP------------EC~KVFsCPANLASHRRWHKPR 321 (500)
T KOG3993|consen 267 DYICQL--CKEKYEDAFALAQHRCPRIVHVEYRCP------------ECDKVFSCPANLASHRRWHKPR 321 (500)
T ss_pred HHHHHH--HHHhhhhHHHHhhccCCeeEEeeecCC------------cccccccCchhhhhhhcccCCc
Confidence 388999 998888888888886433323349999 9999999999999999998754
No 38
>PHA00732 hypothetical protein
Probab=93.22 E-value=0.021 Score=32.14 Aligned_cols=25 Identities=8% Similarity=0.082 Sum_probs=21.9
Q ss_pred CcccCCCCCCCCCCCCCCCCCcCCCCCCccccccc-cc
Q psy4019 45 CYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKT-AS 81 (109)
Q Consensus 45 p~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~-~~ 81 (109)
||.|. .|++.|.+...|..|++. |.
T Consensus 1 py~C~------------~Cgk~F~s~s~Lk~H~r~~H~ 26 (79)
T PHA00732 1 MFKCP------------ICGFTTVTLFALKQHARRNHT 26 (79)
T ss_pred CccCC------------CCCCccCCHHHHHHHhhcccC
Confidence 58899 999999999999999984 54
No 39
>PF12171 zf-C2H2_jaz: Zinc-finger double-stranded RNA-binding; InterPro: IPR022755 This zinc finger is found in archaea and eukaryotes, and is approximately 30 amino acids in length. The mammalian members of this group occur multiple times along the protein, joined by flexible linkers, and are referred to as JAZ - dsRNA-binding ZF protein - zinc-fingers. The JAZ proteins are expressed in all tissues tested and localise in the nucleus, particularly the nucleolus []. JAZ preferentially binds to double-stranded (ds) RNA or RNA/DNA hybrids rather than DNA. In addition to binding double-stranded RNA, these zinc-fingers are required for nucleolar localisation. This entry represents the multiple-adjacent-C2H2 zinc finger, JAZ. ; PDB: 4DGW_A 1ZR9_A.
Probab=93.20 E-value=0.026 Score=24.88 Aligned_cols=22 Identities=27% Similarity=0.872 Sum_probs=19.2
Q ss_pred cccccCCCCCcCCChhHHHHHhhh
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
|.|.. |++.|.....+..|++.
T Consensus 2 ~~C~~--C~k~f~~~~~~~~H~~s 23 (27)
T PF12171_consen 2 FYCDA--CDKYFSSENQLKQHMKS 23 (27)
T ss_dssp CBBTT--TTBBBSSHHHHHCCTTS
T ss_pred CCccc--CCCCcCCHHHHHHHHcc
Confidence 67988 99999999999988764
No 40
>COG5048 FOG: Zn-finger [General function prediction only]
Probab=92.68 E-value=0.12 Score=36.55 Aligned_cols=46 Identities=24% Similarity=0.359 Sum_probs=40.3
Q ss_pred chhhhcc--ccCCC--CCcccc--cCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 2 KLNLAYP--LLSSQ--KPYVCG--EENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 2 ~l~~H~~--~h~~~--~p~~C~--~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
.|..|.+ .|+++ +++.|. . |++.|.+...+..|...|.+.+++.|.
T Consensus 304 ~l~~~~~~~~h~~~~~~~~~~p~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~ 355 (467)
T COG5048 304 PLTRHLRSVNHSGESLKPFSCPYSL--CGKLFSRNDALKRHILLHTSISPAKEK 355 (467)
T ss_pred cccccccccccccccCCceeeeccC--CCccccccccccCCcccccCCCccccc
Confidence 4677888 89999 999999 6 999999999999999999988777765
No 41
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=90.96 E-value=0.018 Score=32.95 Aligned_cols=19 Identities=0% Similarity=0.006 Sum_probs=16.1
Q ss_pred CCCCcCCCCCCcccccccc
Q psy4019 62 LGGASPSRSDDMPLSAKTA 80 (109)
Q Consensus 62 ~C~~~f~~~~~l~~H~~~~ 80 (109)
.|++.|.....|..|++..
T Consensus 55 ~C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 55 YCNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SSS-EESSHHHHHHHHHHT
T ss_pred ccCCCCcCHHHHHHHHcCc
Confidence 9999999999999999864
No 42
>PF13913 zf-C2HC_2: zinc-finger of a C2HC-type
Probab=90.72 E-value=0.25 Score=21.39 Aligned_cols=21 Identities=33% Similarity=0.774 Sum_probs=16.3
Q ss_pred cccccCCCCCcCCChhHHHHHhhh
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
..|.. ||+.| ....|.+|+..
T Consensus 3 ~~C~~--CgR~F-~~~~l~~H~~~ 23 (25)
T PF13913_consen 3 VPCPI--CGRKF-NPDRLEKHEKI 23 (25)
T ss_pred CcCCC--CCCEE-CHHHHHHHHHh
Confidence 46888 99999 56778888753
No 43
>smart00451 ZnF_U1 U1-like zinc finger. Family of C2H2-type zinc fingers, present in matrin, U1 small nuclear ribonucleoprotein C and other RNA-binding proteins.
Probab=90.14 E-value=0.3 Score=22.46 Aligned_cols=23 Identities=30% Similarity=0.652 Sum_probs=19.9
Q ss_pred CcccccCCCCCcCCChhHHHHHhhh
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
+|.|.. |+..|.....+..|++.
T Consensus 3 ~~~C~~--C~~~~~~~~~~~~H~~g 25 (35)
T smart00451 3 GFYCKL--CNVTFTDEISVEAHLKG 25 (35)
T ss_pred CeEccc--cCCccCCHHHHHHHHCh
Confidence 588999 99999999999888753
No 44
>KOG2893|consensus
Probab=83.68 E-value=0.76 Score=31.33 Aligned_cols=29 Identities=28% Similarity=0.646 Sum_probs=22.5
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
+++ |=. |.+.|-..--|..|++. |-|+|.
T Consensus 10 kpw-cwy--cnrefddekiliqhqka----khfkch 38 (341)
T KOG2893|consen 10 KPW-CWY--CNREFDDEKILIQHQKA----KHFKCH 38 (341)
T ss_pred Cce-eee--cccccchhhhhhhhhhh----ccceee
Confidence 454 445 99999999888888764 679999
No 45
>COG5189 SFP1 Putative transcriptional repressor regulating G2/M transition [Transcription / Cell division and chromosome partitioning]
Probab=83.17 E-value=0.68 Score=32.84 Aligned_cols=30 Identities=3% Similarity=-0.091 Sum_probs=25.5
Q ss_pred CCCcccCCCCCCCCCCCCCCCCCcCCCCCCcccccccccc
Q psy4019 43 EHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASV 82 (109)
Q Consensus 43 ~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~ 82 (109)
+|||+|++. .|+|.+.....|+.|+.-.+.
T Consensus 347 ~KpykCpV~----------gC~K~YknqnGLKYH~lhGH~ 376 (423)
T COG5189 347 GKPYKCPVE----------GCNKKYKNQNGLKYHMLHGHQ 376 (423)
T ss_pred CceecCCCC----------Cchhhhccccchhhhhhcccc
Confidence 599999976 799999999999999875553
No 46
>PF12756 zf-C2H2_2: C2H2 type zinc-finger (2 copies); PDB: 2DMI_A.
Probab=82.66 E-value=1.1 Score=25.37 Aligned_cols=24 Identities=21% Similarity=0.684 Sum_probs=21.0
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhh
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTV 40 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h 40 (109)
.+.|.. |++.|.....|..|++.+
T Consensus 50 ~~~C~~--C~~~f~s~~~l~~Hm~~~ 73 (100)
T PF12756_consen 50 SFRCPY--CNKTFRSREALQEHMRSK 73 (100)
T ss_dssp SEEBSS--SS-EESSHHHHHHHHHHT
T ss_pred CCCCCc--cCCCCcCHHHHHHHHcCc
Confidence 589999 999999999999999864
No 47
>KOG2186|consensus
Probab=76.61 E-value=1.4 Score=30.18 Aligned_cols=51 Identities=14% Similarity=0.269 Sum_probs=36.4
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVS 83 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~ 83 (109)
|.|.. ||.... ...+.+|+-...+ ..|.|- .|++.|.+ .+...|.+..+..
T Consensus 4 FtCnv--CgEsvK-Kp~vekH~srCrn-~~fSCI------------DC~k~F~~-~sYknH~kCITEa 54 (276)
T KOG2186|consen 4 FTCNV--CGESVK-KPQVEKHMSRCRN-AYFSCI------------DCGKTFER-VSYKNHTKCITEA 54 (276)
T ss_pred Eehhh--hhhhcc-ccchHHHHHhccC-CeeEEe------------eccccccc-chhhhhhhhcchH
Confidence 67888 887644 3446678766665 578998 89999966 6677788766643
No 48
>COG4049 Uncharacterized protein containing archaeal-type C2H2 Zn-finger [General function prediction only]
Probab=72.56 E-value=1.7 Score=22.74 Aligned_cols=30 Identities=10% Similarity=0.255 Sum_probs=25.2
Q ss_pred cccCCCCCcccccCCCCCcCCChhHHHHHhhh
Q psy4019 8 PLLSSQKPYVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 8 ~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
+...|+.-+.|+. |+..|...-+..+|...
T Consensus 10 ~~RDGE~~lrCPR--C~~~FR~~K~Y~RHVNK 39 (65)
T COG4049 10 RDRDGEEFLRCPR--CGMVFRRRKDYIRHVNK 39 (65)
T ss_pred eccCCceeeeCCc--hhHHHHHhHHHHHHhhH
Confidence 3456888899999 99999999999999763
No 49
>KOG1146|consensus
Probab=64.32 E-value=1.6 Score=36.30 Aligned_cols=55 Identities=16% Similarity=0.277 Sum_probs=41.4
Q ss_pred cCCCCCcccccCCCCCcCCChhHHHHHhhh-hc------------------------CCCCcccCCCCCCCCCCCCCCCC
Q psy4019 10 LSSQKPYVCGEENCTKRYTDPSSLRKHIKT-VH------------------------GEHCYANKKHKGMPSKDKDGLGG 64 (109)
Q Consensus 10 h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~-h~------------------------~~kp~~C~~~~~~~~~~~~~~C~ 64 (109)
|.-.+-+.|+. |+..|.....|..|+|. |. +.++|.|. .|.
T Consensus 460 ~S~~kt~~cpk--c~~~yk~a~~L~vhmRskhp~~~~~~c~~gq~~~~~arg~~~~~~~~p~~C~------------~C~ 525 (1406)
T KOG1146|consen 460 HSFFKTLKCPK--CNWHYKLAQTLGVHMRSKHPESQSAYCKAGQNHPRLARGEVYRCPGKPYPCR------------ACN 525 (1406)
T ss_pred ecccccccCCc--cchhhhhHHHhhhcccccccccchhHhHhccccccccccccccCCCCcccce------------eee
Confidence 44457789999 99999999999999986 10 12466676 788
Q ss_pred CcCCCCCCcccccc
Q psy4019 65 ASPSRSDDMPLSAK 78 (109)
Q Consensus 65 ~~f~~~~~l~~H~~ 78 (109)
.++.....|.+|++
T Consensus 526 ~stttng~Lsihlq 539 (1406)
T KOG1146|consen 526 YSTTTNGNLSIHLQ 539 (1406)
T ss_pred eeeecchHHHHHHH
Confidence 88877777777765
No 50
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=63.96 E-value=5.6 Score=18.18 Aligned_cols=20 Identities=25% Similarity=0.511 Sum_probs=13.2
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
|.|.. ||..+.... .++.|+
T Consensus 2 ~~C~~--CGy~y~~~~------------~~~~CP 21 (33)
T cd00350 2 YVCPV--CGYIYDGEE------------APWVCP 21 (33)
T ss_pred EECCC--CCCEECCCc------------CCCcCc
Confidence 57877 887644322 578898
No 51
>KOG4124|consensus
Probab=56.37 E-value=4.2 Score=29.34 Aligned_cols=53 Identities=21% Similarity=0.396 Sum_probs=34.9
Q ss_pred CCCcccccCCCCCcCCChhHHHHHhh---------------hhc----CCCCcccCCCCCCCCCCCCCCCCCcCCCCCCc
Q psy4019 13 QKPYVCGEENCTKRYTDPSSLRKHIK---------------TVH----GEHCYANKKHKGMPSKDKDGLGGASPSRSDDM 73 (109)
Q Consensus 13 ~~p~~C~~~~C~k~f~~~~~l~~H~~---------------~h~----~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l 73 (109)
.++|.|.++.|.+.+...-.|..|.. .|+ ..|+|+|+ .|.+.+.--..|
T Consensus 347 ~~~~~~~vp~~~~~~~n~ng~~~~~~~~h~s~i~~~s~~~~ph~~~~~~nk~~r~~------------i~~~~~k~~~~l 414 (442)
T KOG4124|consen 347 DKPYKCPVPNCDKAYKNQNGLKYHKLHGHCSPITTPTPAPIPHQGFVVENKPYRCE------------VCSKRYKNLNGL 414 (442)
T ss_pred cCCCCCCCCcchhhcccCcceeeccccCcCCCCCCCCCCCCCcceeeeccCcccCh------------hhhhhhccCCCC
Confidence 46889998889988776666655532 122 24788998 888777655555
Q ss_pred cccc
Q psy4019 74 PLSA 77 (109)
Q Consensus 74 ~~H~ 77 (109)
..|.
T Consensus 415 ~~~~ 418 (442)
T KOG4124|consen 415 KYHR 418 (442)
T ss_pred Ccee
Confidence 5443
No 52
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=55.75 E-value=13 Score=19.79 Aligned_cols=29 Identities=10% Similarity=0.090 Sum_probs=17.6
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
..|.|+. ||........-- -.--.+|.|+
T Consensus 26 v~F~CPn--CGe~~I~Rc~~C-----Rk~g~~Y~Cp 54 (61)
T COG2888 26 VKFPCPN--CGEVEIYRCAKC-----RKLGNPYRCP 54 (61)
T ss_pred eEeeCCC--CCceeeehhhhH-----HHcCCceECC
Confidence 3488988 997655444322 1123589999
No 53
>PF04959 ARS2: Arsenite-resistance protein 2; InterPro: IPR007042 This entry represents Arsenite-resistance protein 2 (also known as Serrate RNA effector molecule homolog) which is thought to play a role in arsenite resistance [], although does not directly confer arsenite resistance but rather modulates arsenic sensitivity []. Arsenite is a carcinogenic compound which can act as a comutagen by inhibiting DNA repair. It is also involved in cell cycle progression at S phase. ; PDB: 3AX1_A.
Probab=52.41 E-value=5 Score=26.91 Aligned_cols=28 Identities=32% Similarity=0.646 Sum_probs=21.0
Q ss_pred CCCCcccccCCCCCcCCChhHHHHHhhhhc
Q psy4019 12 SQKPYVCGEENCTKRYTDPSSLRKHIKTVH 41 (109)
Q Consensus 12 ~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~ 41 (109)
++..|.|.. |+|.|.....+.+|+..-.
T Consensus 74 ~~~K~~C~l--c~KlFkg~eFV~KHI~nKH 101 (214)
T PF04959_consen 74 DEDKWRCPL--CGKLFKGPEFVRKHIFNKH 101 (214)
T ss_dssp SSEEEEE-S--SS-EESSHHHHHHHHHHH-
T ss_pred cCCEECCCC--CCcccCChHHHHHHHhhcC
Confidence 445699999 9999999999999987533
No 54
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=50.08 E-value=13 Score=17.48 Aligned_cols=32 Identities=22% Similarity=0.349 Sum_probs=20.4
Q ss_pred ccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcC
Q psy4019 17 VCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASP 67 (109)
Q Consensus 17 ~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f 67 (109)
.|+. |+..|.-... ++..+....+|. .|+..|
T Consensus 4 ~CP~--C~~~f~v~~~-----~l~~~~~~vrC~------------~C~~~f 35 (37)
T PF13719_consen 4 TCPN--CQTRFRVPDD-----KLPAGGRKVRCP------------KCGHVF 35 (37)
T ss_pred ECCC--CCceEEcCHH-----HcccCCcEEECC------------CCCcEe
Confidence 5777 8888776554 222333467898 777766
No 55
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=48.28 E-value=4.6 Score=25.63 Aligned_cols=17 Identities=6% Similarity=-0.122 Sum_probs=12.6
Q ss_pred CcccCCCCCCCCCCCCCCCCCcCCCCCCc
Q psy4019 45 CYANKKHKGMPSKDKDGLGGASPSRSDDM 73 (109)
Q Consensus 45 p~~C~~~~~~~~~~~~~~C~~~f~~~~~l 73 (109)
.++|+ .||+.|.....+
T Consensus 28 ~~~c~------------~c~~~f~~~e~~ 44 (154)
T PRK00464 28 RRECL------------ACGKRFTTFERV 44 (154)
T ss_pred eeecc------------ccCCcceEeEec
Confidence 38999 999999765433
No 56
>PF12013 DUF3505: Protein of unknown function (DUF3505); InterPro: IPR022698 This family of proteins is functionally uncharacterised. This protein is found in eukaryotes. Proteins in this family are typically between 247 to 1018 amino acids in length. This region contains two segments that are likely to be C2H2 zinc binding domains.
Probab=46.23 E-value=19 Score=21.08 Aligned_cols=24 Identities=29% Similarity=0.600 Sum_probs=20.1
Q ss_pred ccc----ccCCCCCcCCChhHHHHHhhhhc
Q psy4019 16 YVC----GEENCTKRYTDPSSLRKHIKTVH 41 (109)
Q Consensus 16 ~~C----~~~~C~k~f~~~~~l~~H~~~h~ 41 (109)
|.| .. |+........+.+|.+.+.
T Consensus 81 ~~C~~~~~~--C~y~~~~~~~m~~H~~~~H 108 (109)
T PF12013_consen 81 YRCQCDPPH--CGYITRSKKTMRKHWRKEH 108 (109)
T ss_pred eeeecCCCC--CCcEeccHHHHHHHHHHhc
Confidence 788 77 9999999999999987654
No 57
>COG4957 Predicted transcriptional regulator [Transcription]
Probab=44.83 E-value=4.8 Score=25.01 Aligned_cols=22 Identities=14% Similarity=0.055 Sum_probs=18.8
Q ss_pred CCCCcCCCCCCccccccccccCCCC
Q psy4019 62 LGGASPSRSDDMPLSAKTASVSSPS 86 (109)
Q Consensus 62 ~C~~~f~~~~~l~~H~~~~~~~~p~ 86 (109)
++|+.| ..|++|+.+|.+.+|.
T Consensus 81 EDGkkf---KSLKRHL~t~~gmTPd 102 (148)
T COG4957 81 EDGKKF---KSLKRHLTTHYGLTPD 102 (148)
T ss_pred ccCcch---HHHHHHHhcccCCCHH
Confidence 789999 5899999999988764
No 58
>COG3357 Predicted transcriptional regulator containing an HTH domain fused to a Zn-ribbon [Transcription]
Probab=44.76 E-value=11 Score=21.86 Aligned_cols=26 Identities=19% Similarity=0.327 Sum_probs=17.4
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCC
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKH 51 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~ 51 (109)
+|-.|.. ||..|.... -.+|-.|+.|
T Consensus 57 ~Pa~Ckk--CGfef~~~~----------ik~pSRCP~C 82 (97)
T COG3357 57 RPARCKK--CGFEFRDDK----------IKKPSRCPKC 82 (97)
T ss_pred cChhhcc--cCccccccc----------cCCcccCCcc
Confidence 5778988 998886511 1357788844
No 59
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=44.69 E-value=16 Score=23.14 Aligned_cols=28 Identities=11% Similarity=0.297 Sum_probs=21.5
Q ss_pred CCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 11 SSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 11 ~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
.+..-|.|+. |+..|....++. .-|.|+
T Consensus 105 ~~~~~Y~Cp~--c~~r~tf~eA~~---------~~F~Cp 132 (158)
T TIGR00373 105 TNNMFFICPN--MCVRFTFNEAME---------LNFTCP 132 (158)
T ss_pred cCCCeEECCC--CCcEeeHHHHHH---------cCCcCC
Confidence 3445689988 999888877774 259999
No 60
>KOG2231|consensus
Probab=44.67 E-value=10 Score=29.73 Aligned_cols=21 Identities=29% Similarity=0.763 Sum_probs=15.4
Q ss_pred ccccCCCCCcCCChhHHHHHhhh
Q psy4019 17 VCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 17 ~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
.|.. |...|.....|.+|++.
T Consensus 184 ~C~~--C~~~fld~~el~rH~~~ 204 (669)
T KOG2231|consen 184 LCKF--CHERFLDDDELYRHLRF 204 (669)
T ss_pred cchh--hhhhhccHHHHHHhhcc
Confidence 4666 77778888888777764
No 61
>KOG0978|consensus
Probab=42.71 E-value=9.3 Score=30.09 Aligned_cols=23 Identities=9% Similarity=-0.125 Sum_probs=18.1
Q ss_pred CCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccc
Q psy4019 43 EHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSA 77 (109)
Q Consensus 43 ~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~ 77 (109)
.+.-+|+ .|+..|...+.+.+|+
T Consensus 676 tRqRKCP------------~Cn~aFganDv~~I~l 698 (698)
T KOG0978|consen 676 TRQRKCP------------KCNAAFGANDVHRIHL 698 (698)
T ss_pred HhcCCCC------------CCCCCCCcccccccCC
Confidence 3456899 9999998888877664
No 62
>KOG3408|consensus
Probab=41.50 E-value=15 Score=22.48 Aligned_cols=24 Identities=33% Similarity=0.785 Sum_probs=21.1
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhh
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
..|.|.. |.+-|.....|..|.+.
T Consensus 56 GqfyCi~--CaRyFi~~~~l~~H~kt 79 (129)
T KOG3408|consen 56 GQFYCIE--CARYFIDAKALKTHFKT 79 (129)
T ss_pred ceeehhh--hhhhhcchHHHHHHHhc
Confidence 4588999 99999999999999874
No 63
>smart00614 ZnF_BED BED zinc finger. DNA-binding domain in chromatin-boundary-element-binding proteins and transposases
Probab=41.32 E-value=25 Score=17.47 Aligned_cols=21 Identities=33% Similarity=0.752 Sum_probs=15.9
Q ss_pred cccccCCCCCcCCC-----hhHHHHHhh
Q psy4019 16 YVCGEENCTKRYTD-----PSSLRKHIK 38 (109)
Q Consensus 16 ~~C~~~~C~k~f~~-----~~~l~~H~~ 38 (109)
-.|.. |++.+.. .++|.+|++
T Consensus 19 a~C~~--C~~~l~~~~~~gTs~L~rHl~ 44 (50)
T smart00614 19 AKCKY--CGKKLSRSSKGGTSNLRRHLR 44 (50)
T ss_pred EEecC--CCCEeeeCCCCCcHHHHHHHH
Confidence 46888 9987654 468888887
No 64
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=40.70 E-value=22 Score=16.59 Aligned_cols=32 Identities=13% Similarity=0.130 Sum_probs=20.1
Q ss_pred ccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcC
Q psy4019 17 VCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASP 67 (109)
Q Consensus 17 ~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f 67 (109)
.|+. |+..|.-.... +-......+|. .|+..|
T Consensus 4 ~Cp~--C~~~y~i~d~~-----ip~~g~~v~C~------------~C~~~f 35 (36)
T PF13717_consen 4 TCPN--CQAKYEIDDEK-----IPPKGRKVRCS------------KCGHVF 35 (36)
T ss_pred ECCC--CCCEEeCCHHH-----CCCCCcEEECC------------CCCCEe
Confidence 5777 88887766542 22233457888 777766
No 65
>KOG2593|consensus
Probab=40.09 E-value=23 Score=26.34 Aligned_cols=41 Identities=22% Similarity=0.356 Sum_probs=27.2
Q ss_pred cCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCC
Q psy4019 10 LSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMP 55 (109)
Q Consensus 10 h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~ 55 (109)
-+...-|.|+. |.+.|.....++ .+-...--|.|..|.+-.
T Consensus 123 ~t~~~~Y~Cp~--C~kkyt~Lea~~---L~~~~~~~F~C~~C~gel 163 (436)
T KOG2593|consen 123 DTNVAGYVCPN--CQKKYTSLEALQ---LLDNETGEFHCENCGGEL 163 (436)
T ss_pred ccccccccCCc--cccchhhhHHHH---hhcccCceEEEecCCCch
Confidence 34556799999 999998777654 233333469999554433
No 66
>PF09538 FYDLN_acid: Protein of unknown function (FYDLN_acid); InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=40.00 E-value=23 Score=21.07 Aligned_cols=30 Identities=13% Similarity=0.248 Sum_probs=21.7
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRS 70 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~ 70 (109)
..|+. ||+.|.-.. ..|-.|+ .||..|.-.
T Consensus 10 R~Cp~--CG~kFYDLn-----------k~PivCP------------~CG~~~~~~ 39 (108)
T PF09538_consen 10 RTCPS--CGAKFYDLN-----------KDPIVCP------------KCGTEFPPE 39 (108)
T ss_pred ccCCC--CcchhccCC-----------CCCccCC------------CCCCccCcc
Confidence 47888 998876422 3577899 899988544
No 67
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=39.55 E-value=18 Score=16.75 Aligned_cols=33 Identities=12% Similarity=0.019 Sum_probs=19.8
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASP 67 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f 67 (109)
+.|+. |+..|.-..... ........|+ .|+..|
T Consensus 3 ~~CP~--C~~~~~v~~~~~-----~~~~~~v~C~------------~C~~~~ 35 (38)
T TIGR02098 3 IQCPN--CKTSFRVVDSQL-----GANGGKVRCG------------KCGHVW 35 (38)
T ss_pred EECCC--CCCEEEeCHHHc-----CCCCCEEECC------------CCCCEE
Confidence 46877 888776554421 1112246888 787766
No 68
>PF02892 zf-BED: BED zinc finger; InterPro: IPR003656 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents predicted BED-type zinc finger domains. The BED finger which was named after the Drosophila proteins BEAF and DREF, is found in one or more copies in cellular regulatory factors and transposases from plants, animals and fungi. The BED finger is an about 50 to 60 amino acid residues domain that contains a characteristic motif with two highly conserved aromatic positions, as well as a shared pattern of cysteines and histidines that is predicted to form a zinc finger. As diverse BED fingers are able to bind DNA, it has been suggested that DNA-binding is the general function of this domain []. Some proteins known to contain a BED domain include animal, plant and fungi AC1 and Hobo-like transposases; Caenorhabditis elegans Dpy-20 protein, a predicted cuticular gene transcriptional regulator; Drosophila BEAF (boundary element-associated factor), thought to be involved in chromatin insulation; Drosophila DREF, a transcriptional regulator for S-phase genes; and tobacco 3AF1 and tomato E4/E8-BP1, light- and ethylene-regulated DNA binding proteins that contain two BED fingers. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003677 DNA binding; PDB: 2DJR_A 2CT5_A.
Probab=38.78 E-value=26 Score=16.76 Aligned_cols=25 Identities=32% Similarity=0.550 Sum_probs=15.8
Q ss_pred CCCCcccccCCCCCcCCC----hhHHHHHhh
Q psy4019 12 SQKPYVCGEENCTKRYTD----PSSLRKHIK 38 (109)
Q Consensus 12 ~~~p~~C~~~~C~k~f~~----~~~l~~H~~ 38 (109)
+..-..|.. |++.+.. .++|.+|++
T Consensus 13 ~~~~a~C~~--C~~~~~~~~~~ts~l~~HL~ 41 (45)
T PF02892_consen 13 DKKKAKCKY--CGKVIKYSSGGTSNLKRHLK 41 (45)
T ss_dssp CSS-EEETT--TTEE-----SSTHHHHHHHH
T ss_pred CcCeEEeCC--CCeEEeeCCCcHHHHHHhhh
Confidence 345567999 9987765 578888873
No 69
>KOG4167|consensus
Probab=38.64 E-value=1.9 Score=33.93 Aligned_cols=22 Identities=14% Similarity=-0.082 Sum_probs=19.1
Q ss_pred CCCCcCCCCCCccccccccccC
Q psy4019 62 LGGASPSRSDDMPLSAKTASVS 83 (109)
Q Consensus 62 ~C~~~f~~~~~l~~H~~~~~~~ 83 (109)
+|+|.|-.-..+..|||+|...
T Consensus 797 eC~kvF~KiKSrNAHMK~Hr~q 818 (907)
T KOG4167|consen 797 ECGKVFFKIKSRNAHMKTHRQQ 818 (907)
T ss_pred HHHHHHHHHhhhhHHHHHHHHH
Confidence 9999998888899999988653
No 70
>PF15269 zf-C2H2_7: Zinc-finger
Probab=37.76 E-value=28 Score=17.39 Aligned_cols=22 Identities=32% Similarity=0.572 Sum_probs=18.0
Q ss_pred cccccCCCCCcCCChhHHHHHhhh
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
|.|-. |..+....+.|-.|++.
T Consensus 21 ykcfq--cpftc~~kshl~nhmky 42 (54)
T PF15269_consen 21 YKCFQ--CPFTCNEKSHLFNHMKY 42 (54)
T ss_pred ceeec--CCcccchHHHHHHHHHH
Confidence 56877 88888888999988864
No 71
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=37.60 E-value=27 Score=21.74 Aligned_cols=32 Identities=16% Similarity=0.304 Sum_probs=20.1
Q ss_pred CCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 12 SQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 12 ~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
+..-|.|+. |+..|.....+.. ... ..-|.|+
T Consensus 96 ~~~~Y~Cp~--C~~~y~~~ea~~~---~d~-~~~f~Cp 127 (147)
T smart00531 96 NNAYYKCPN--CQSKYTFLEANQL---LDM-DGTFTCP 127 (147)
T ss_pred CCcEEECcC--CCCEeeHHHHHHh---cCC-CCcEECC
Confidence 345689998 9998886544332 011 2339999
No 72
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=35.95 E-value=26 Score=22.68 Aligned_cols=26 Identities=15% Similarity=0.387 Sum_probs=19.9
Q ss_pred CCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 13 QKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 13 ~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
..-|.|+. |+..|....++. .-|.|+
T Consensus 115 ~~~Y~Cp~--C~~rytf~eA~~---------~~F~Cp 140 (178)
T PRK06266 115 NMFFFCPN--CHIRFTFDEAME---------YGFRCP 140 (178)
T ss_pred CCEEECCC--CCcEEeHHHHhh---------cCCcCC
Confidence 35688988 999888777653 359999
No 73
>KOG4167|consensus
Probab=35.89 E-value=5.8 Score=31.41 Aligned_cols=25 Identities=32% Similarity=0.534 Sum_probs=22.4
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhc
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVH 41 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~ 41 (109)
-|.|.. |+|.|...-.++.||+.|.
T Consensus 792 iFpCre--C~kvF~KiKSrNAHMK~Hr 816 (907)
T KOG4167|consen 792 IFPCRE--CGKVFFKIKSRNAHMKTHR 816 (907)
T ss_pred eeehHH--HHHHHHHHhhhhHHHHHHH
Confidence 388999 9999999999999999886
No 74
>PRK09678 DNA-binding transcriptional regulator; Provisional
Probab=35.63 E-value=21 Score=19.70 Aligned_cols=40 Identities=13% Similarity=0.062 Sum_probs=21.9
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSR 69 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~ 69 (109)
+.|+. ||....-...-... ....+..++|.-- .||..|..
T Consensus 2 m~CP~--Cg~~a~irtSr~~s--~~~~~~Y~qC~N~----------eCg~tF~t 41 (72)
T PRK09678 2 FHCPL--CQHAAHARTSRYIT--DTTKERYHQCQNV----------NCSATFIT 41 (72)
T ss_pred ccCCC--CCCccEEEEChhcC--hhhheeeeecCCC----------CCCCEEEE
Confidence 46888 88654322221111 1244567788622 69999943
No 75
>smart00734 ZnF_Rad18 Rad18-like CCHC zinc finger. Yeast Rad18p functions with Rad5p in error-free post-replicative DNA repair. This zinc finger is likely to bind nucleic-acids.
Probab=33.93 E-value=45 Score=14.29 Aligned_cols=19 Identities=16% Similarity=0.490 Sum_probs=13.7
Q ss_pred ccccCCCCCcCCChhHHHHHhh
Q psy4019 17 VCGEENCTKRYTDPSSLRKHIK 38 (109)
Q Consensus 17 ~C~~~~C~k~f~~~~~l~~H~~ 38 (109)
.|+. |++.+ ....+..|..
T Consensus 3 ~CPi--C~~~v-~~~~in~HLD 21 (26)
T smart00734 3 QCPV--CFREV-PENLINSHLD 21 (26)
T ss_pred cCCC--CcCcc-cHHHHHHHHH
Confidence 5888 99887 5566777764
No 76
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=33.25 E-value=38 Score=17.92 Aligned_cols=29 Identities=10% Similarity=0.207 Sum_probs=16.1
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
..|.|+. ||........-- -....+|.|+
T Consensus 24 ~~F~CPn--CG~~~I~RC~~C-----Rk~~~~Y~CP 52 (59)
T PRK14890 24 VKFLCPN--CGEVIIYRCEKC-----RKQSNPYTCP 52 (59)
T ss_pred CEeeCCC--CCCeeEeechhH-----HhcCCceECC
Confidence 3578887 887633332211 1123578998
No 77
>KOG4173|consensus
Probab=32.51 E-value=5.2 Score=26.77 Aligned_cols=48 Identities=19% Similarity=0.191 Sum_probs=32.0
Q ss_pred ccccCCCCCcCCChhHHHHHhh-hh---------cCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCcccc
Q psy4019 17 VCGEENCTKRYTDPSSLRKHIK-TV---------HGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLS 76 (109)
Q Consensus 17 ~C~~~~C~k~f~~~~~l~~H~~-~h---------~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H 76 (109)
.|.. |.+.|....-|..|+. .| .|.-.|+|-+. .|+..|.....-..|
T Consensus 108 sCs~--C~r~~Pt~hLLd~HI~E~HDs~Fqa~veRG~dMy~ClvE----------gCt~KFkT~r~RkdH 165 (253)
T KOG4173|consen 108 SCSF--CKRAFPTGHLLDAHILEWHDSLFQALVERGQDMYQCLVE----------GCTEKFKTSRDRKDH 165 (253)
T ss_pred hhHH--HHHhCCchhhhhHHHHHHHHHHHHHHHHcCccHHHHHHH----------hhhhhhhhhhhhhhH
Confidence 6888 9999988888877764 12 34456888543 788878655444333
No 78
>COG1592 Rubrerythrin [Energy production and conversion]
Probab=32.24 E-value=38 Score=21.87 Aligned_cols=20 Identities=25% Similarity=0.360 Sum_probs=15.0
Q ss_pred CcccccCCCCCcCCChhHHHHHhhhhcCCCCcccC
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANK 49 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~ 49 (109)
-|.|.+ ||.. +-|+-|-.|+
T Consensus 134 ~~vC~v--CGy~-------------~~ge~P~~CP 153 (166)
T COG1592 134 VWVCPV--CGYT-------------HEGEAPEVCP 153 (166)
T ss_pred EEEcCC--CCCc-------------ccCCCCCcCC
Confidence 588998 9864 3456788999
No 79
>KOG2461|consensus
Probab=31.45 E-value=69 Score=23.63 Aligned_cols=68 Identities=4% Similarity=-0.287 Sum_probs=37.3
Q ss_pred hccccCCCCCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccccccccccCCC
Q psy4019 6 AYPLLSSQKPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPLSAKTASVSSP 85 (109)
Q Consensus 6 H~~~h~~~~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~H~~~~~~~~p 85 (109)
+...+++..+..+.. +.+.+.....+..+...+.++.++.+. .+...+.....+..+.-++.....
T Consensus 322 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~ 387 (396)
T KOG2461|consen 322 SEVPATVSVWTGETI--PVRTPAGQLIYTQSHSMEVAEPTDMAP------------NQIWKIYHTGVLGFLIITTDESEC 387 (396)
T ss_pred ccccccccccCcCcc--cccccccccchhhhhhcccCCCCcccc------------cccccceeccccceeeeecccccc
Confidence 334455555555555 555666666667777777777777666 333333344444444555555444
Q ss_pred CC
Q psy4019 86 SI 87 (109)
Q Consensus 86 ~~ 87 (109)
+.
T Consensus 388 ~~ 389 (396)
T KOG2461|consen 388 NN 389 (396)
T ss_pred cc
Confidence 43
No 80
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=30.88 E-value=24 Score=20.25 Aligned_cols=33 Identities=18% Similarity=0.292 Sum_probs=20.4
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCC
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRS 70 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~ 70 (109)
..|.|+. |++. .+.+.-+| -+.|. -|+..|.-.
T Consensus 34 ~~~~Cp~--C~~~--------~VkR~a~G--IW~C~------------kCg~~fAGg 66 (89)
T COG1997 34 AKHVCPF--CGRT--------TVKRIATG--IWKCR------------KCGAKFAGG 66 (89)
T ss_pred cCCcCCC--CCCc--------ceeeeccC--eEEcC------------CCCCeeccc
Confidence 3577887 8764 22334444 46788 788888543
No 81
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=30.81 E-value=22 Score=16.55 Aligned_cols=7 Identities=14% Similarity=-0.106 Sum_probs=5.4
Q ss_pred CCCCcCC
Q psy4019 62 LGGASPS 68 (109)
Q Consensus 62 ~C~~~f~ 68 (109)
.||..|.
T Consensus 10 ~Cg~~fe 16 (41)
T smart00834 10 DCGHTFE 16 (41)
T ss_pred CCCCEEE
Confidence 8888773
No 82
>PF05443 ROS_MUCR: ROS/MUCR transcriptional regulator protein; InterPro: IPR008807 This family consists of several ROS/MUCR transcriptional regulator proteins. The ros chromosomal gene is present in octopine and nopaline strains of Agrobacterium tumefaciens as well as in Rhizobium meliloti (Sinorhizobium meliloti). This gene encodes a 15.5 kDa protein that specifically represses the virC and virD operons in the virulence region of the Ti plasmid [] and is necessary for succinoglycan production []. S. meliloti can produce two types of acidic exopolysaccharides, succinoglycan and galactoglucan, that are interchangeable for infection of Medicago sativa (Alfalfa) nodules. MucR from S. meliloti acts as a transcriptional repressor that blocks the expression of the exp genes responsible for galactoglucan production therefore allowing the exclusive production of succinoglycan [].; GO: 0003677 DNA binding, 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 2JSP_A.
Probab=30.60 E-value=22 Score=21.97 Aligned_cols=25 Identities=36% Similarity=0.832 Sum_probs=15.3
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHC 45 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp 45 (109)
..|-. ||+.|.. |.+|++.|.|-.|
T Consensus 73 i~cle--cGk~~k~---LkrHL~~~~gltp 97 (132)
T PF05443_consen 73 IICLE--CGKKFKT---LKRHLRTHHGLTP 97 (132)
T ss_dssp EE-TB--T--EESB---HHHHHHHTT-S-H
T ss_pred eEEcc--CCcccch---HHHHHHHccCCCH
Confidence 56888 9999875 5899998866433
No 83
>cd00729 rubredoxin_SM Rubredoxin, Small Modular nonheme iron binding domain containing a [Fe(SCys)4] center, present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), and believed to be involved in electron transfer. Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain. Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=29.06 E-value=42 Score=15.40 Aligned_cols=11 Identities=27% Similarity=0.528 Sum_probs=7.6
Q ss_pred CcccccCCCCCcC
Q psy4019 15 PYVCGEENCTKRY 27 (109)
Q Consensus 15 p~~C~~~~C~k~f 27 (109)
.|.|.. ||..+
T Consensus 2 ~~~C~~--CG~i~ 12 (34)
T cd00729 2 VWVCPV--CGYIH 12 (34)
T ss_pred eEECCC--CCCEe
Confidence 367887 88654
No 84
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=27.78 E-value=25 Score=16.94 Aligned_cols=7 Identities=14% Similarity=-0.111 Sum_probs=5.5
Q ss_pred CCCCcCC
Q psy4019 62 LGGASPS 68 (109)
Q Consensus 62 ~C~~~f~ 68 (109)
.||..|.
T Consensus 10 ~Cg~~fe 16 (42)
T PF09723_consen 10 ECGHEFE 16 (42)
T ss_pred CCCCEEE
Confidence 8888884
No 85
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=27.14 E-value=18 Score=18.06 Aligned_cols=14 Identities=29% Similarity=0.743 Sum_probs=10.6
Q ss_pred CcccccCCCCCcCCCh
Q psy4019 15 PYVCGEENCTKRYTDP 30 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~ 30 (109)
.|.|.. |+..|...
T Consensus 5 ey~C~~--Cg~~fe~~ 18 (52)
T TIGR02605 5 EYRCTA--CGHRFEVL 18 (52)
T ss_pred EEEeCC--CCCEeEEE
Confidence 378988 99887654
No 86
>PF10571 UPF0547: Uncharacterised protein family UPF0547; InterPro: IPR018886 This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases.
Probab=25.41 E-value=44 Score=14.44 Aligned_cols=11 Identities=18% Similarity=0.646 Sum_probs=8.0
Q ss_pred cccccCCCCCcCC
Q psy4019 16 YVCGEENCTKRYT 28 (109)
Q Consensus 16 ~~C~~~~C~k~f~ 28 (109)
-.|+. ||..|.
T Consensus 15 ~~Cp~--CG~~F~ 25 (26)
T PF10571_consen 15 KFCPH--CGYDFE 25 (26)
T ss_pred CcCCC--CCCCCc
Confidence 35877 888774
No 87
>smart00154 ZnF_AN1 AN1-like Zinc finger. Zinc finger at the C-terminus of An1, a ubiquitin-like protein in Xenopus laevis.
Probab=25.20 E-value=41 Score=15.99 Aligned_cols=13 Identities=8% Similarity=-0.018 Sum_probs=10.1
Q ss_pred CcccCCCCCCCCCCCCCCCCCcCCC
Q psy4019 45 CYANKKHKGMPSKDKDGLGGASPSR 69 (109)
Q Consensus 45 p~~C~~~~~~~~~~~~~~C~~~f~~ 69 (109)
||+|. .|++.|-.
T Consensus 12 ~f~C~------------~C~~~FC~ 24 (39)
T smart00154 12 GFKCR------------HCGNLFCG 24 (39)
T ss_pred CeECC------------ccCCcccc
Confidence 78898 78888843
No 88
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=25.16 E-value=58 Score=15.97 Aligned_cols=12 Identities=42% Similarity=1.132 Sum_probs=9.0
Q ss_pred cccccCCCCCcCCC
Q psy4019 16 YVCGEENCTKRYTD 29 (109)
Q Consensus 16 ~~C~~~~C~k~f~~ 29 (109)
|.|.. ||..|..
T Consensus 3 Y~C~~--Cg~~~~~ 14 (44)
T smart00659 3 YICGE--CGRENEI 14 (44)
T ss_pred EECCC--CCCEeec
Confidence 77888 9887653
No 89
>KOG1146|consensus
Probab=24.46 E-value=18 Score=30.72 Aligned_cols=56 Identities=11% Similarity=0.242 Sum_probs=42.5
Q ss_pred CCCcccccCCCCCcCCChhHHHHHhhhhc-----------------CCCCcccCCCCCCCCCCCCCCCCCcCCCCCCccc
Q psy4019 13 QKPYVCGEENCTKRYTDPSSLRKHIKTVH-----------------GEHCYANKKHKGMPSKDKDGLGGASPSRSDDMPL 75 (109)
Q Consensus 13 ~~p~~C~~~~C~k~f~~~~~l~~H~~~h~-----------------~~kp~~C~~~~~~~~~~~~~~C~~~f~~~~~l~~ 75 (109)
.++|.|.. |.-.|.....|..|.+... ..++| |. .|...|+....|.+
T Consensus 1282 ~~~~~~~~--~~~~~~~~~~l~~~~~k~~~~~~~~~~~~~~~l~~~d~~~~-c~------------~c~~~~~~~~alqi 1346 (1406)
T KOG1146|consen 1282 THRYLCRQ--CKMAFDGEAPLTAHQRKFCFAGRGSGGSMPPPLRVPDCTYH-CL------------ACEVLLSGREALQI 1346 (1406)
T ss_pred chhHHHHH--HHhhhcchhHHHHHHHHHHhccCccccCCCCcccCcccccc-ch------------HHHhhcchhHHHHH
Confidence 46789999 9999999999998875321 01234 77 89999999999999
Q ss_pred cccccccC
Q psy4019 76 SAKTASVS 83 (109)
Q Consensus 76 H~~~~~~~ 83 (109)
|++.....
T Consensus 1347 hm~~~~~~ 1354 (1406)
T KOG1146|consen 1347 HMRSSAHR 1354 (1406)
T ss_pred HHHHhhhc
Confidence 99865543
No 90
>PF07754 DUF1610: Domain of unknown function (DUF1610); InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=23.35 E-value=43 Score=14.32 Aligned_cols=6 Identities=0% Similarity=-0.114 Sum_probs=4.6
Q ss_pred CCcccC
Q psy4019 44 HCYANK 49 (109)
Q Consensus 44 kp~~C~ 49 (109)
..|.|+
T Consensus 15 v~f~CP 20 (24)
T PF07754_consen 15 VPFPCP 20 (24)
T ss_pred ceEeCC
Confidence 468888
No 91
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=22.60 E-value=45 Score=20.66 Aligned_cols=15 Identities=33% Similarity=0.820 Sum_probs=12.5
Q ss_pred CcccccCCCCCcCCChh
Q psy4019 15 PYVCGEENCTKRYTDPS 31 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~ 31 (109)
|++|.. ||+.|...+
T Consensus 1 PH~Ct~--Cg~~f~dgs 15 (131)
T PF09845_consen 1 PHQCTK--CGRVFEDGS 15 (131)
T ss_pred CcccCc--CCCCcCCCc
Confidence 688999 999998655
No 92
>PF08790 zf-LYAR: LYAR-type C2HC zinc finger ; InterPro: IPR014898 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This C2HC zinc finger domain is found in LYAR proteins such as Q08288 from SWISSPROT, which are involved in cell growth regulation. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 1WJV_A.
Probab=22.04 E-value=12 Score=16.66 Aligned_cols=18 Identities=28% Similarity=0.827 Sum_probs=9.6
Q ss_pred cccccCCCCCcCCChhHHHHH
Q psy4019 16 YVCGEENCTKRYTDPSSLRKH 36 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H 36 (109)
|.|.. |++.|. ....+.|
T Consensus 1 ~sCiD--C~~~F~-~~~y~~H 18 (28)
T PF08790_consen 1 FSCID--CSKDFD-GDSYKSH 18 (28)
T ss_dssp EEETT--TTEEEE-GGGTTT-
T ss_pred Ceeec--CCCCcC-cCCcCCC
Confidence 35666 887773 3334444
No 93
>KOG2482|consensus
Probab=21.69 E-value=61 Score=23.63 Aligned_cols=23 Identities=26% Similarity=0.642 Sum_probs=20.8
Q ss_pred CcccccCCCCCcCCChhHHHHHhhh
Q psy4019 15 PYVCGEENCTKRYTDPSSLRKHIKT 39 (109)
Q Consensus 15 p~~C~~~~C~k~f~~~~~l~~H~~~ 39 (109)
.+.|.. |.+.|..+..|..||+.
T Consensus 195 r~~CLy--CekifrdkntLkeHMrk 217 (423)
T KOG2482|consen 195 RLRCLY--CEKIFRDKNTLKEHMRK 217 (423)
T ss_pred hheeee--eccccCCcHHHHHHHHh
Confidence 478999 99999999999999985
No 94
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=21.68 E-value=77 Score=19.56 Aligned_cols=30 Identities=10% Similarity=0.191 Sum_probs=21.5
Q ss_pred cccccCCCCCcCCChhHHHHHhhhhcCCCCcccCCCCCCCCCCCCCCCCCcCCCC
Q psy4019 16 YVCGEENCTKRYTDPSSLRKHIKTVHGEHCYANKKHKGMPSKDKDGLGGASPSRS 70 (109)
Q Consensus 16 ~~C~~~~C~k~f~~~~~l~~H~~~h~~~kp~~C~~~~~~~~~~~~~~C~~~f~~~ 70 (109)
..|+. ||+.|.-. ...|-.|+ .||..|...
T Consensus 10 r~Cp~--cg~kFYDL-----------nk~p~vcP------------~cg~~~~~~ 39 (129)
T TIGR02300 10 RICPN--TGSKFYDL-----------NRRPAVSP------------YTGEQFPPE 39 (129)
T ss_pred ccCCC--cCcccccc-----------CCCCccCC------------CcCCccCcc
Confidence 57888 99887642 23578899 888887544
No 95
>PF07975 C1_4: TFIIH C1-like domain; InterPro: IPR004595 All proteins in this domain for which functions are known are components of the TFIIH complex which is involved in the initiation of transcription and nucleotide excision repair. It includes the yeast transcription factor Ssl1 (Suppressor of stem-loop protein 1) that is essential for translation initiation and affects UV resistance. The C-terminal region is essential for transcription activity. This regions binds three zinc atoms through two independent domain. The first contains a C4 zinc finger motif, whereas the second is characterised by a CX(2)CX(2-4)FCADCD motif. The solution structure of the second C-terminal domain revealed homology with the regulatory domain of protein kinase C [].; GO: 0006281 DNA repair, 0005634 nucleus; PDB: 1Z60_A.
Probab=21.25 E-value=17 Score=18.62 Aligned_cols=25 Identities=16% Similarity=0.353 Sum_probs=13.9
Q ss_pred CCcccccCCCCCcCCChhHHHHHhhhh
Q psy4019 14 KPYVCGEENCTKRYTDPSSLRKHIKTV 40 (109)
Q Consensus 14 ~p~~C~~~~C~k~f~~~~~l~~H~~~h 40 (109)
..|.|+. |...|-..-++-.|..+|
T Consensus 20 ~~y~C~~--C~~~FC~dCD~fiHE~LH 44 (51)
T PF07975_consen 20 SRYRCPK--CKNHFCIDCDVFIHETLH 44 (51)
T ss_dssp EEE--TT--TT--B-HHHHHTTTTTS-
T ss_pred CeEECCC--CCCccccCcChhhhcccc
Confidence 4699999 999998777776665444
No 96
>KOG2636|consensus
Probab=20.62 E-value=78 Score=23.88 Aligned_cols=28 Identities=21% Similarity=0.466 Sum_probs=22.8
Q ss_pred ccCCCCCcccccCCCC-CcCCChhHHHHHhh
Q psy4019 9 LLSSQKPYVCGEENCT-KRYTDPSSLRKHIK 38 (109)
Q Consensus 9 ~h~~~~p~~C~~~~C~-k~f~~~~~l~~H~~ 38 (109)
.|--.+.|.|.. || +++.....+.+|..
T Consensus 395 LHGL~~ey~CEI--CGNy~Y~GrkaF~RHF~ 423 (497)
T KOG2636|consen 395 LHGLDIEYNCEI--CGNYVYKGRKAFDRHFN 423 (497)
T ss_pred hcCCCcccceee--ccCccccCcHHHHHHhH
Confidence 355567899999 99 78889999999864
Done!