Query psy7758
Match_columns 235
No_of_seqs 262 out of 1483
Neff 7.6
Searched_HMMs 46136
Date Fri Aug 16 18:03:58 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy7758.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7758hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG4442|consensus 100.0 2.2E-48 4.7E-53 361.3 14.5 175 2-177 93-268 (729)
2 KOG1082|consensus 100.0 5.7E-32 1.2E-36 244.7 11.1 171 1-172 150-357 (364)
3 KOG1080|consensus 100.0 1.7E-31 3.8E-36 261.8 11.9 139 28-168 866-1004(1005)
4 smart00317 SET SET (Su(var)3-9 99.9 6.8E-27 1.5E-31 176.5 15.0 116 29-144 1-116 (116)
5 KOG1079|consensus 99.9 9.1E-28 2E-32 222.9 9.1 146 4-151 560-715 (739)
6 KOG1083|consensus 99.9 1.5E-24 3.2E-29 208.7 1.8 137 15-152 1164-1301(1306)
7 KOG1085|consensus 99.8 1.4E-20 3E-25 160.5 9.6 124 25-148 253-380 (392)
8 KOG1141|consensus 99.8 4.8E-19 1E-23 167.6 7.9 165 3-168 981-1261(1262)
9 PF00856 SET: SET domain; Int 99.8 1.9E-18 4E-23 136.1 7.1 107 39-145 1-162 (162)
10 COG2940 Proteins containing SE 99.7 5.3E-19 1.1E-23 165.6 2.0 145 24-168 328-479 (480)
11 KOG2589|consensus 99.6 8.5E-16 1.8E-20 135.1 7.1 123 36-168 135-257 (453)
12 KOG1081|consensus 99.5 3.1E-15 6.8E-20 138.8 1.7 151 3-171 289-439 (463)
13 KOG2461|consensus 98.7 2.1E-08 4.6E-13 91.6 5.5 115 26-150 26-148 (396)
14 KOG1141|consensus 97.8 5.7E-06 1.2E-10 80.1 0.0 67 3-70 774-841 (1262)
15 smart00570 AWS associated with 97.5 5.1E-05 1.1E-09 49.3 1.4 25 2-27 27-51 (51)
16 smart00508 PostSET Cysteine-ri 96.4 0.0017 3.6E-08 36.2 1.4 18 154-171 2-19 (26)
17 KOG2084|consensus 95.3 0.023 5.1E-07 52.5 4.6 44 104-151 208-252 (482)
18 KOG1337|consensus 92.9 0.085 1.8E-06 49.7 3.1 41 103-146 238-278 (472)
19 smart00317 SET SET (Su(var)3-9 76.3 4.3 9.4E-05 29.3 3.9 34 22-55 77-113 (116)
20 PF08666 SAF: SAF domain; Int 75.5 1.6 3.5E-05 28.7 1.3 15 127-141 3-17 (63)
21 KOG1338|consensus 67.5 4.6 0.0001 37.2 2.7 45 101-149 218-263 (466)
22 smart00858 SAF This domain fam 51.3 10 0.00022 24.7 1.5 16 127-142 3-18 (64)
23 TIGR02059 swm_rep_I cyanobacte 49.2 40 0.00087 25.0 4.5 24 124-147 75-98 (101)
24 PF14100 PmoA: Methane oxygena 47.8 29 0.00062 30.3 4.2 104 29-147 144-253 (271)
25 KOG1338|consensus 43.8 16 0.00034 33.8 2.0 34 27-60 25-62 (466)
26 KOG1081|consensus 40.2 16 0.00035 34.6 1.5 20 213-233 86-105 (463)
27 PF00856 SET: SET domain; Int 38.8 49 0.0011 24.7 3.9 34 23-56 123-159 (162)
28 KOG2155|consensus 38.3 17 0.00038 34.1 1.4 48 101-148 204-253 (631)
29 COG1188 Ribosome-associated he 34.0 47 0.001 24.7 2.8 23 128-150 43-65 (100)
30 PF00098 zf-CCHC: Zinc knuckle 32.1 24 0.00052 17.6 0.7 10 218-228 2-11 (18)
31 TIGR01164 rplP_bact ribosomal 30.3 80 0.0017 24.3 3.8 41 29-71 77-117 (126)
32 smart00400 ZnF_CHCC zinc finge 29.1 15 0.00033 23.6 -0.4 15 215-230 22-36 (55)
33 TIGR03569 NeuB_NnaB N-acetylne 25.8 34 0.00073 30.9 1.1 20 125-144 277-296 (329)
34 PF05077 DUF678: Protein of un 25.7 34 0.00073 23.8 0.8 15 209-223 15-29 (74)
35 PF00628 PHD: PHD-finger; Int 24.1 50 0.0011 20.5 1.4 15 219-234 2-19 (51)
36 KOG2084|consensus 22.9 69 0.0015 29.3 2.6 25 36-60 22-46 (482)
37 PRK09203 rplP 50S ribosomal pr 22.1 1.3E+02 0.0029 23.4 3.7 42 29-72 78-119 (138)
38 TIGR03170 flgA_cterm flagella 21.1 43 0.00093 25.0 0.7 16 128-143 2-17 (122)
39 PF06718 DUF1203: Protein of u 21.1 75 0.0016 24.2 2.0 18 129-146 6-24 (117)
40 smart00249 PHD PHD zinc finger 21.1 58 0.0013 19.1 1.2 16 219-234 2-19 (47)
41 CHL00044 rpl16 ribosomal prote 21.0 1.8E+02 0.0039 22.7 4.2 39 29-69 78-116 (135)
No 1
>KOG4442|consensus
Probab=100.00 E-value=2.2e-48 Score=361.32 Aligned_cols=175 Identities=42% Similarity=0.913 Sum_probs=169.0
Q ss_pred ceEEcCCCCCC-CCCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCC
Q psy7758 2 LYVECNPDSCP-ARTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDH 80 (235)
Q Consensus 2 ~~~eC~~~~C~-~~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~ 80 (235)
+..||+++.|+ ||.+|+|+.||+.++.+++||.+.++||||+|..+|++|+||+||.|++|+..++..|...|..++..
T Consensus 93 t~iECs~~~C~~cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y~~d~~k 172 (729)
T KOG4442|consen 93 TSIECSDRECPRCGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRYAKDGIK 172 (729)
T ss_pred hhcccCCccCCCccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHHHhcCCc
Confidence 35799999999 99999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ceeeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecC
Q psy7758 81 NYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCG 160 (235)
Q Consensus 81 ~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg 160 (235)
++|++.+.++.+|||+.+||+||||||||+|||.++.|.|++..+|.|||.|+|++||||||||..+.++.. ...|.||
T Consensus 173 h~Yfm~L~~~e~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr~-AQ~CyCg 251 (729)
T KOG4442|consen 173 HYYFMALQGGEYIDATKKGNLARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGRD-AQPCYCG 251 (729)
T ss_pred eEEEEEecCCceecccccCcHHHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEeccccccccc-ccccccC
Confidence 999999999999999999999999999999999999999999999999999999999999999999999988 8999999
Q ss_pred CCCCceeccCCCCCCCC
Q psy7758 161 AASCSGFIGAKKAVPPT 177 (235)
Q Consensus 161 ~~~C~g~~~~~~~~~~~ 177 (235)
+++|+|||++.++....
T Consensus 252 eanC~G~IGgk~q~da~ 268 (729)
T KOG4442|consen 252 EANCRGWIGGKPQTDAS 268 (729)
T ss_pred CcccccccCCCCccccc
Confidence 99999999998765433
No 2
>KOG1082|consensus
Probab=99.97 E-value=5.7e-32 Score=244.71 Aligned_cols=171 Identities=34% Similarity=0.632 Sum_probs=141.1
Q ss_pred CceEEcCCCCCCCCCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCC-
Q psy7758 1 DLYVECNPDSCPARTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNND- 79 (235)
Q Consensus 1 ~~~~eC~~~~C~~~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~- 79 (235)
+++|||++ .|+|+..|.||+.|++...+|+|++++.+||||++...|++|++|+||.|++++.++++.+.........
T Consensus 150 ~~i~EC~~-~C~C~~~C~nRv~q~g~~~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~~ 228 (364)
T KOG1082|consen 150 EPVFECSV-ACGCHPDCANRVVQKGLQFHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDDD 228 (364)
T ss_pred cccccccc-CCCCCCcCcchhhccccccceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhccccccccccc
Confidence 46899998 8999999999999999999999999999999999999999999999999999999998876332222111
Q ss_pred -Cce------e---------------eeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCc----eEEEEEEccC
Q psy7758 80 -HNY------Y---------------FLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGD----TRVGLFALRD 133 (235)
Q Consensus 80 -~~~------~---------------~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~~~i~A~rd 133 (235)
..+ | .......+.|||...||++|||||||.||..++.+..+.. .++.++|+++
T Consensus 229 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ida~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~ 308 (364)
T KOG1082|consen 229 CDAYSIADREWVDESPVGNTFVAPSLPGGPGRELLIDAKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRD 308 (364)
T ss_pred cccchhhhccccccccccccccccccccCCCcceEEchhhcccccccccCCCCccceeeeeeecCCccchheeeeeeccc
Confidence 000 0 1112456789999999999999999999999887766532 7899999999
Q ss_pred CCCCCeEEEecCCCcc-----C-----CCCCeEEecCCCCCceeccCCC
Q psy7758 134 VPAGTELVFNYELQKA-----D-----NDGMRRCMCGAASCSGFIGAKK 172 (235)
Q Consensus 134 I~~GEElt~~Y~~~~~-----~-----~~~~~~C~Cg~~~C~g~~~~~~ 172 (235)
|.+|||||+||+..+. + ......|.|++..|++.+....
T Consensus 309 I~p~~ELT~dYg~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr~~~~~~~ 357 (364)
T KOG1082|consen 309 ISPGEELTLDYGKAYKLLVQDGANIYTPVMKKNCNCGLEKCRGLLGSAP 357 (364)
T ss_pred cCCCcccchhhcccccccccccccccccccchhhcCCCHHhCcccCCCc
Confidence 9999999999997753 1 2247789999999999987643
No 3
>KOG1080|consensus
Probab=99.97 E-value=1.7e-31 Score=261.79 Aligned_cols=139 Identities=38% Similarity=0.796 Sum_probs=131.5
Q ss_pred CCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccC
Q psy7758 28 PPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNH 107 (235)
Q Consensus 28 ~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNH 107 (235)
..|...++.++||||||+++|.+|++|+||+|+++...-++.|+..|...+....|+|.++..++|||+..||+||||||
T Consensus 866 k~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sYlfrid~~~ViDAtk~gniAr~InH 945 (1005)
T KOG1080|consen 866 KYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSYLFRIDDEVVVDATKKGNIARFINH 945 (1005)
T ss_pred hhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccceeeecccceEEeccccCchhheeec
Confidence 34788889999999999999999999999999999999999999999999989999999999999999999999999999
Q ss_pred CCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecCCCCCceec
Q psy7758 108 SCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCGAASCSGFI 168 (235)
Q Consensus 108 SC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg~~~C~g~~ 168 (235)
||.|||....+.++|+.+|+|+|.|+|.+|||||+||-....+. ...|+|||++|||++
T Consensus 946 sC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e~~--kipClCgap~Crg~~ 1004 (1005)
T KOG1080|consen 946 SCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTEDD--KIPCLCGAPNCRGFL 1004 (1005)
T ss_pred ccCCCceeeEEEecCeeEEEEEEecccccCceeeeecccccccc--ccccccCCCcccccc
Confidence 99999999999999999999999999999999999998766554 899999999999987
No 4
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.95 E-value=6.8e-27 Score=176.47 Aligned_cols=116 Identities=45% Similarity=0.903 Sum_probs=100.4
Q ss_pred CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccCC
Q psy7758 29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHS 108 (235)
Q Consensus 29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHS 108 (235)
+++++.++++|+||||+++|++|++|++|.|.++...+.......+........+++.......||+...++++||||||
T Consensus 1 ~~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~iNHs 80 (116)
T smart00317 1 KLEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKAYDTDGADSFYLFEIDSDLCIDARRKGNIARFINHS 80 (116)
T ss_pred CcEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHHHHhcCCCCEEEEECCCCEEEeCCccCcHHHeeCCC
Confidence 36788899999999999999999999999999999988877654333333324567777777899999999999999999
Q ss_pred CCCCceeEEEEECCceEEEEEEccCCCCCCeEEEec
Q psy7758 109 CEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNY 144 (235)
Q Consensus 109 C~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y 144 (235)
|.||+.+..+..++..++.++|+|||++|||||++|
T Consensus 81 c~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 81 CEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred CCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence 999999987777666689999999999999999999
No 5
>KOG1079|consensus
Probab=99.94 E-value=9.1e-28 Score=222.91 Aligned_cols=146 Identities=36% Similarity=0.687 Sum_probs=136.4
Q ss_pred EEcCCCCCCC-C---------CCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHh
Q psy7758 4 VECNPDSCPA-R---------TKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRD 73 (235)
Q Consensus 4 ~eC~~~~C~~-~---------~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~ 73 (235)
-||+|++|.+ + -+|.|-.++.+.+.++.+.+|...|||||+++.+.++++|.||+|++|+.+|+++|...
T Consensus 560 rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapSdVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGki 639 (739)
T KOG1079|consen 560 RECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPSDVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGKI 639 (739)
T ss_pred cccCchHHhccCcccccccCccccccchhhhhhhcceeechhhccccceeeccccCCCceeeeecceeccchhhhhcccc
Confidence 5899999986 3 39999999999999999999999999999999999999999999999999999999988
Q ss_pred hccCCCCceeeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCC
Q psy7758 74 MDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADN 151 (235)
Q Consensus 74 ~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~ 151 (235)
++.. ...|+|.+..+++||++..||.+||+|||-.|||....+.++|..+|.|+|.|+|.+|||||+||.++-...
T Consensus 640 YDr~--~cSflFnln~dyviDs~rkGnk~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrYs~~~~ 715 (739)
T KOG1079|consen 640 YDRY--MCSFLFNLNNDYVIDSTRKGNKIRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYSPEHA 715 (739)
T ss_pred cccc--cceeeeeccccceEeeeeecchhhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeeccCcccc
Confidence 8876 556888999999999999999999999999999999999999999999999999999999999998876553
No 6
>KOG1083|consensus
Probab=99.89 E-value=1.5e-24 Score=208.73 Aligned_cols=137 Identities=43% Similarity=0.896 Sum_probs=121.7
Q ss_pred CCCCCccCCCCC-CCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEE
Q psy7758 15 TKCQNRDFETRN-YPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYI 93 (235)
Q Consensus 15 ~~C~N~~~~~~~-~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~i 93 (235)
+.|.|+.|++.. -++|+|++.+..||||.|.++|++||+|+||+|++++..+++.++.....+. ...|.+.+..+.++
T Consensus 1164 d~c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~~d-~~~~cL~I~p~l~i 1242 (1306)
T KOG1083|consen 1164 DSCSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYHND-DDHYCLVIDPGLFI 1242 (1306)
T ss_pred hhhhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhcccccccCCCC-CcccccccCccccC
Confidence 468899888654 5899999999999999999999999999999999999999888854444433 34578899999999
Q ss_pred eccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCC
Q psy7758 94 DAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADND 152 (235)
Q Consensus 94 Da~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~ 152 (235)
|+..++|.+||+||+|.|||..+.|.++|..||.|||+|||.+|||||+||+...+...
T Consensus 1243 d~~R~~n~~RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ks~~~p 1301 (1306)
T KOG1083|consen 1243 DIPRMGNGARFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNFKSFNYP 1301 (1306)
T ss_pred ChhhccccccccccccCCCCccccccccceeeeeeeecCCCCCCceEEEeccccccCCc
Confidence 99999999999999999999999999999999999999999999999999987655433
No 7
>KOG1085|consensus
Probab=99.83 E-value=1.4e-20 Score=160.52 Aligned_cols=124 Identities=30% Similarity=0.485 Sum_probs=109.3
Q ss_pred CCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCC---CCceeeeecCCCeEEeccccCC-
Q psy7758 25 RNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNN---DHNYYFLSLDNSRYIDAGKKGN- 100 (235)
Q Consensus 25 ~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~---~~~~~~~~~~~~~~iDa~~~gn- 100 (235)
+....+.+....++|+||+|++++.+|+||.||.|.+|...++..++..|..+. .+.|||-+....++|||+...+
T Consensus 253 g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~De~~GcYMYyF~h~sk~yCiDAT~et~~ 332 (392)
T KOG1085|consen 253 GTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYANDEEIGCYMYYFEHNSKKYCIDATKETPW 332 (392)
T ss_pred ccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHHhccCcccceEEEeeeccCeeeeeeccccccc
Confidence 455567777788899999999999999999999999999999999988887654 3566666778899999996555
Q ss_pred ccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCc
Q psy7758 101 LARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQK 148 (235)
Q Consensus 101 ~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~ 148 (235)
++|.||||-.+|+....+.+++.++++++|.|||.+||||++||+.-.
T Consensus 333 lGRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS 380 (392)
T KOG1085|consen 333 LGRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS 380 (392)
T ss_pred chhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence 789999999999999999999999999999999999999999998754
No 8
>KOG1141|consensus
Probab=99.77 E-value=4.8e-19 Score=167.60 Aligned_cols=165 Identities=30% Similarity=0.549 Sum_probs=123.1
Q ss_pred eEEcCCCCCCCCCCCCCccCCCCCCC--------CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhh
Q psy7758 3 YVECNPDSCPARTKCQNRDFETRNYP--------PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDM 74 (235)
Q Consensus 3 ~~eC~~~~C~~~~~C~N~~~~~~~~~--------~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~ 74 (235)
++||+. -|.|...|.|++++..... .|.|+.+...|||+.+..||+.-++|.+|.|...+..-+.+.-...
T Consensus 981 f~e~~~-hss~~~~e~~~~v~~~~~~~me~~s~~~l~i~~~~~~~~~~~edtD~~~~~~~~~~~~~ppt~~l~~~~r~aq 1059 (1262)
T KOG1141|consen 981 FFECND-HSSCHRKEYNRVVQNNIKYPMEVSSFNDLQIFKTAQSGWGVREDTDIPQSTFICTYVGAPPTDDLADELRNAQ 1059 (1262)
T ss_pred ceeccc-cchhcccccchhhhcCCccceeeeecccccccccccccccccccccCCCCcccccccCCCCchhhHHHHhhhh
Confidence 578985 7999999999998766443 4556666778999999999999999999998875543322111100
Q ss_pred ccCC-------------------CCc------------------------------------------------------
Q psy7758 75 DRNN-------------------DHN------------------------------------------------------ 81 (235)
Q Consensus 75 ~~~~-------------------~~~------------------------------------------------------ 81 (235)
.... .+.
T Consensus 1060 ad~~sn~~D~~~~~~l~es~~~~~T~~r~~t~~~~~~~~~d~dd~q~I~k~ve~qd~~~~~~~T~~~~RQ~~~~s~k~~~ 1139 (1262)
T KOG1141|consen 1060 ADQYSNDLDLKDTVELEESREDHETDFRGDTSDYDDEEGSDGDDGQDIMKMVERQDSSESGEETKRLTRQKRKQSKKSGK 1139 (1262)
T ss_pred hccccCccchhhhhhhhhcccccccccCCCCCCCcccccccCccHHHHHHHhhcccccccccccchhhhhhhhhhhhccc
Confidence 0000 000
Q ss_pred -----------e-------------------eeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCc----eEEE
Q psy7758 82 -----------Y-------------------YFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGD----TRVG 127 (235)
Q Consensus 82 -----------~-------------------~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~~~ 127 (235)
. +++....-++|||...||++||+||||.||..++.++++.+ +.|+
T Consensus 1140 ~~s~~~~~~ts~~~~~~dkges~~~~~~~~~~y~~~~~~yvIDAk~eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVA 1219 (1262)
T KOG1141|consen 1140 GGSVEKDDTTSRDSMEKDKGESKDEPVFNWDKYFEPFPLYVIDAKQEGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVA 1219 (1262)
T ss_pred CccccccccCccchhhhccCccCcccccchhhccCCCceEEEecccccchhhhhccCCCccceeeeeeeeccccCCchhh
Confidence 0 00001123579999999999999999999999999998765 7799
Q ss_pred EEEccCCCCCCeEEEecCCCccCCC-CCeEEecCCCCCceec
Q psy7758 128 LFALRDVPAGTELVFNYELQKADND-GMRRCMCGAASCSGFI 168 (235)
Q Consensus 128 i~A~rdI~~GEElt~~Y~~~~~~~~-~~~~C~Cg~~~C~g~~ 168 (235)
+||.+-|++|.||||||++.-.... ....|+||+.+|||.|
T Consensus 1220 FFt~kyVkAgtELTWDY~Ye~g~v~~keL~C~CGa~~CrgrL 1261 (1262)
T KOG1141|consen 1220 FFTRKYVKAGTELTWDYQYEQGQVATKELTCHCGAENCRGRL 1261 (1262)
T ss_pred hhhhhhhccCceeeeeccccccccccceEEEecChhhhhccc
Confidence 9999999999999999988765433 3678999999999976
No 9
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.75 E-value=1.9e-18 Score=136.07 Aligned_cols=107 Identities=21% Similarity=0.270 Sum_probs=72.8
Q ss_pred ceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHH---hhc------------------------------------cCCC
Q psy7758 39 GWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRR---DMD------------------------------------RNND 79 (235)
Q Consensus 39 G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~---~~~------------------------------------~~~~ 79 (235)
|+||||+++|++|++|+...+.+++......... ... ....
T Consensus 1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (162)
T PF00856_consen 1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE 80 (162)
T ss_dssp SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence 8999999999999999988888888776643100 000 0000
Q ss_pred ----------------CceeeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEe
Q psy7758 80 ----------------HNYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFN 143 (235)
Q Consensus 80 ----------------~~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~ 143 (235)
..............++....+++.|+||||.|||.+..........+.|+|+|||++|||||++
T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~is 160 (162)
T PF00856_consen 81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFIS 160 (162)
T ss_dssp CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEE
T ss_pred ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEE
Confidence 0001111112334555667778999999999999997765567799999999999999999999
Q ss_pred cC
Q psy7758 144 YE 145 (235)
Q Consensus 144 Y~ 145 (235)
||
T Consensus 161 YG 162 (162)
T PF00856_consen 161 YG 162 (162)
T ss_dssp ST
T ss_pred EC
Confidence 97
No 10
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.74 E-value=5.3e-19 Score=165.56 Aligned_cols=145 Identities=35% Similarity=0.667 Sum_probs=118.4
Q ss_pred CCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccc
Q psy7758 24 TRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLAR 103 (235)
Q Consensus 24 ~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aR 103 (235)
.....+..+..+..+|+|+||.+.|++|++|.+|.|+++...++..+............+........++|+...|+.+|
T Consensus 328 ~~~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~g~~~r 407 (480)
T COG2940 328 KKRREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDLLGNEFSFGLLEDKDKVRDSQKAGDVAR 407 (480)
T ss_pred ccccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccccccccccchhhccccchhhhhhhcccccc
Confidence 33445667777789999999999999999999999999999888888776644333222222222377899999999999
Q ss_pred cccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCC-------CCeEEecCCCCCceec
Q psy7758 104 FMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADND-------GMRRCMCGAASCSGFI 168 (235)
Q Consensus 104 fiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~-------~~~~C~Cg~~~C~g~~ 168 (235)
|+||||.||+.......++...+.++|+|||.+||||+++|+...+... ....|.|++..|++++
T Consensus 408 ~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (480)
T COG2940 408 FINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTM 479 (480)
T ss_pred eeecCCCCCcceecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCCC
Confidence 9999999999987665555568999999999999999999999888755 4778999999999876
No 11
>KOG2589|consensus
Probab=99.61 E-value=8.5e-16 Score=135.10 Aligned_cols=123 Identities=23% Similarity=0.335 Sum_probs=94.1
Q ss_pred CCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccCCCCCCcee
Q psy7758 36 GSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTA 115 (235)
Q Consensus 36 ~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~ 115 (235)
...|--|+|++.+.+|+-|...+|.|....+.+++........+++..+ ....-.+..+-.+++||||.|.|||.|
T Consensus 135 ~~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~~g~nDFSvmy----StRk~caqLwLGPaafINHDCrpnCkF 210 (453)
T KOG2589|consen 135 SQNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLRGGGNDFSVMY----STRKRCAQLWLGPAAFINHDCRPNCKF 210 (453)
T ss_pred cCCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHhccCCceeeee----ecccchhhheeccHHhhcCCCCCCcee
Confidence 3579999999999999999999999987777777744444433332222 112233556778899999999999998
Q ss_pred EEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecCCCCCceec
Q psy7758 116 EKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCGAASCSGFI 168 (235)
Q Consensus 116 ~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg~~~C~g~~ 168 (235)
+ ..|..++.|.++|||+||||||.-|+..+++.. +..|.| ..|....
T Consensus 211 v---s~g~~tacvkvlRDIePGeEITcFYgs~fFG~~-N~~CeC--~TCER~g 257 (453)
T KOG2589|consen 211 V---STGRDTACVKVLRDIEPGEEITCFYGSGFFGEN-NEECEC--VTCERRG 257 (453)
T ss_pred e---cCCCceeeeehhhcCCCCceeEEeecccccCCC-CceeEE--eeccccc
Confidence 4 334589999999999999999999999999988 777777 5665443
No 12
>KOG1081|consensus
Probab=99.51 E-value=3.1e-15 Score=138.83 Aligned_cols=151 Identities=44% Similarity=0.858 Sum_probs=129.2
Q ss_pred eEEcCCCCCCCCCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCce
Q psy7758 3 YVECNPDSCPARTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNY 82 (235)
Q Consensus 3 ~~eC~~~~C~~~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~ 82 (235)
+++|.|..|+++..|.|+.+.+...+. +.. +|.++|.+| .|++++..+...+...........+
T Consensus 289 ~~~~~p~~~~~~~~~~~~~~sk~~~~e------~~~----~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~~~~~~ 352 (463)
T KOG1081|consen 289 AYEVHPKVCSAEERCHNQQFSKESYPE------PQK----TAKADIRKG------VGEVIDDKECKARLQRVKESDLVDF 352 (463)
T ss_pred hhhhcccccccccccccchhhhhcccc------cch----hhHHhhhcc------cCcccchhhheeehhhhhccchhhh
Confidence 567888899999999999988887755 222 899999998 8889999998888888887777888
Q ss_pred eeeecCCCeEEeccccCCccccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccCCCCCeEEecCCC
Q psy7758 83 YFLSLDNSRYIDAGKKGNLARFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKADNDGMRRCMCGAA 162 (235)
Q Consensus 83 ~~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~~~~~~~C~Cg~~ 162 (235)
|+.....+..||+..++|.+||+||||.||+....|.+.+..++.++|.+.|++|||||++|...-... .+.|.|++.
T Consensus 353 ~~~~~e~~~~id~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~~~~~~--~~~~~~~~e 430 (463)
T KOG1081|consen 353 YMVFIQKDRIIDAGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNGNCEGN--EKRCCCGSE 430 (463)
T ss_pred hhhhhhcccccccccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeeccccCC--cceEeeccc
Confidence 877777666999999999999999999999999999999999999999999999999999998765543 578999888
Q ss_pred CCceeccCC
Q psy7758 163 SCSGFIGAK 171 (235)
Q Consensus 163 ~C~g~~~~~ 171 (235)
+|.++++..
T Consensus 431 ~~~~~~~k~ 439 (463)
T KOG1081|consen 431 NCTETKGKK 439 (463)
T ss_pred ccccCCccc
Confidence 887776654
No 13
>KOG2461|consensus
Probab=98.69 E-value=2.1e-08 Score=91.58 Aligned_cols=115 Identities=23% Similarity=0.299 Sum_probs=82.9
Q ss_pred CCCCEEEEEc--CCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecC-CCeEEecc--ccCC
Q psy7758 26 NYPPLEVFNT--GSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLD-NSRYIDAG--KKGN 100 (235)
Q Consensus 26 ~~~~lev~~s--~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~-~~~~iDa~--~~gn 100 (235)
..+.|.|..+ +..|.||++...|.+|+.-+.|.|+++.... ....+......++..+ ...+||++ ...|
T Consensus 26 LP~~l~i~~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~~~~~------~~~~n~~y~W~I~~~d~~~~~iDg~d~~~sN 99 (396)
T KOG2461|consen 26 LPPELRIKPSSVPVTGLGVWSNASILPGTSFGPFEGEIIASID------SKSANNRYMWEIFSSDNGYEYIDGTDEEHSN 99 (396)
T ss_pred CCCceEeeccccCCccccccccccccCcccccCccCccccccc------cccccCcceEEEEeCCCceEEeccCChhhcc
Confidence 5678888888 5688999999999999999999999711110 0011111222222332 44789987 4789
Q ss_pred ccccccCCCCC---CceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCccC
Q psy7758 101 LARFMNHSCEP---NCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKAD 150 (235)
Q Consensus 101 ~aRfiNHSC~P---N~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~~ 150 (235)
+.||||-.+.. |... ......|+++|+|+|.+||||.|+|+.++..
T Consensus 100 WmRYV~~Ar~~eeQNL~A----~Q~~~~Ifyrt~r~I~p~eELlVWY~~e~~~ 148 (396)
T KOG2461|consen 100 WMRYVNSARSEEEQNLLA----FQIGENIFYRTIRDIRPNEELLVWYGSEYAE 148 (396)
T ss_pred eeeeecccCChhhhhHHH----HhccCceEEEecccCCCCCeEEEEeccchHh
Confidence 99999988864 6543 2335789999999999999999999988743
No 14
>KOG1141|consensus
Probab=97.76 E-value=5.7e-06 Score=80.10 Aligned_cols=67 Identities=28% Similarity=0.538 Sum_probs=60.2
Q ss_pred eEEcCCCCCCC-CCCCCCccCCCCCCCCEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHH
Q psy7758 3 YVECNPDSCPA-RTKCQNRDFETRNYPPLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRR 70 (235)
Q Consensus 3 ~~eC~~~~C~~-~~~C~N~~~~~~~~~~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~ 70 (235)
+|||+. .|.| +.-|.||+++.+.+.+|.++.+..+|||++...+|..|.+|+.|.|.+++..-.+..
T Consensus 774 ~yEc~k-~ckc~~~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks 841 (1262)
T KOG1141|consen 774 PYECLK-ACKCCGPDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKS 841 (1262)
T ss_pred HHHHHH-hhccCcHHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhh
Confidence 589986 8986 579999999999999999999999999999999999999999999999887654443
No 15
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=97.48 E-value=5.1e-05 Score=49.34 Aligned_cols=25 Identities=40% Similarity=1.069 Sum_probs=22.2
Q ss_pred ceEEcCCCCCCCCCCCCCccCCCCCC
Q psy7758 2 LYVECNPDSCPARTKCQNRDFETRNY 27 (235)
Q Consensus 2 ~~~eC~~~~C~~~~~C~N~~~~~~~~ 27 (235)
|++|| |..|+||+.|+||.|+++++
T Consensus 27 l~~EC-~~~C~~G~~C~NqrFqk~~y 51 (51)
T smart00570 27 LLIEC-SSDCPCGSYCSNQRFQKRQY 51 (51)
T ss_pred Hhhhc-CCCCCCCcCccCcccccCcC
Confidence 57899 57999999999999998864
No 16
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.44 E-value=0.0017 Score=36.19 Aligned_cols=18 Identities=44% Similarity=1.261 Sum_probs=15.6
Q ss_pred CeEEecCCCCCceeccCC
Q psy7758 154 MRRCMCGAASCSGFIGAK 171 (235)
Q Consensus 154 ~~~C~Cg~~~C~g~~~~~ 171 (235)
.+.|.||+++|||++...
T Consensus 2 ~~~C~CGs~~CRG~l~~~ 19 (26)
T smart00508 2 KQPCLCGAPNCRGFLGXX 19 (26)
T ss_pred CeeeeCCCccccceeccc
Confidence 579999999999999654
No 17
>KOG2084|consensus
Probab=95.28 E-value=0.023 Score=52.51 Aligned_cols=44 Identities=27% Similarity=0.468 Sum_probs=32.3
Q ss_pred cccCCCCCCceeEEEEECCceEEEEEEccCCCCCC-eEEEecCCCccCC
Q psy7758 104 FMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGT-ELVFNYELQKADN 151 (235)
Q Consensus 104 fiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GE-Elt~~Y~~~~~~~ 151 (235)
++||||.||+.. ..+ .....+.+..++.+++ ||+++|....++.
T Consensus 208 ~~~hsC~pn~~~---~~~-~~~~~~~~~~~~~~~~~~l~~~y~~~~~~~ 252 (482)
T KOG2084|consen 208 LFNHSCFPNISV---IFD-GRGLALLVPAGIDAGEEELTISYTDPLLST 252 (482)
T ss_pred hcccCCCCCeEE---EEC-CceeEEEeecccCCCCCEEEEeecccccCH
Confidence 789999999983 233 3445566777777776 9999998877653
No 18
>KOG1337|consensus
Probab=92.90 E-value=0.085 Score=49.69 Aligned_cols=41 Identities=39% Similarity=0.399 Sum_probs=31.5
Q ss_pred ccccCCCCCCceeEEEEECCceEEEEEEccCCCCCCeEEEecCC
Q psy7758 103 RFMNHSCEPNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYEL 146 (235)
Q Consensus 103 RfiNHSC~PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~ 146 (235)
-+.||++.+... .+......+.+++.++|.+||||+++||.
T Consensus 238 D~~NH~~~~~~~---~~~~~d~~~~l~~~~~v~~geevfi~YG~ 278 (472)
T KOG1337|consen 238 DLLNHSPEVIKA---GYNQEDEAVELVAERDVSAGEEVFINYGP 278 (472)
T ss_pred HhhccCchhccc---cccCCCCcEEEEEeeeecCCCeEEEecCC
Confidence 467999999222 11222348999999999999999999987
No 19
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=76.25 E-value=4.3 Score=29.30 Aligned_cols=34 Identities=24% Similarity=0.277 Sum_probs=25.5
Q ss_pred CCCCCCCCEEEEEcCCCc---eEEEEceeCCCCCEEE
Q psy7758 22 FETRNYPPLEVFNTGSRG---WGLKALTDLKRGQFVV 55 (235)
Q Consensus 22 ~~~~~~~~lev~~s~~~G---~GlfA~~~I~~Ge~I~ 55 (235)
+.+...|...+......| ..++|+++|++||-|.
T Consensus 77 iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~ 113 (116)
T smart00317 77 INHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELT 113 (116)
T ss_pred eCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEe
Confidence 445556777776665544 8999999999999875
No 20
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=75.55 E-value=1.6 Score=28.70 Aligned_cols=15 Identities=47% Similarity=0.718 Sum_probs=11.3
Q ss_pred EEEEccCCCCCCeEE
Q psy7758 127 GLFALRDVPAGTELV 141 (235)
Q Consensus 127 ~i~A~rdI~~GEElt 141 (235)
+++|.+||++|+.|+
T Consensus 3 vvVA~~di~~G~~i~ 17 (63)
T PF08666_consen 3 VVVAARDIPAGTVIT 17 (63)
T ss_dssp EEEESSTB-TT-BEC
T ss_pred EEEEeCccCCCCEEc
Confidence 478999999999993
No 21
>KOG1338|consensus
Probab=67.50 E-value=4.6 Score=37.18 Aligned_cols=45 Identities=24% Similarity=0.324 Sum_probs=31.8
Q ss_pred ccccccCCCC-CCceeEEEEECCceEEEEEEccCCCCCCeEEEecCCCcc
Q psy7758 101 LARFMNHSCE-PNCTAEKWTVSGDTRVGLFALRDVPAGTELVFNYELQKA 149 (235)
Q Consensus 101 ~aRfiNHSC~-PN~~~~~~~~~~~~~~~i~A~rdI~~GEElt~~Y~~~~~ 149 (235)
.+-|+||.-. -|+...+ +..-+.++|.|+|.+|+|+.-.|+....
T Consensus 218 ~ad~lNhd~~k~nanl~y----~~NcL~mva~r~iekgdev~n~dg~~p~ 263 (466)
T KOG1338|consen 218 IADFLNHDGLKANANLRY----EDNCLEMVADRNIEKGDEVDNSDGLKPM 263 (466)
T ss_pred hhhhhccchhhcccceec----cCcceeeeecCCCCCccccccccccCcc
Confidence 4678999743 3443321 2355678999999999999999975443
No 22
>smart00858 SAF This domain family includes a range of different proteins. Such as antifreeze proteins and flagellar FlgA proteins, and CpaB pilus proteins.
Probab=51.33 E-value=10 Score=24.69 Aligned_cols=16 Identities=38% Similarity=0.544 Sum_probs=14.0
Q ss_pred EEEEccCCCCCCeEEE
Q psy7758 127 GLFALRDVPAGTELVF 142 (235)
Q Consensus 127 ~i~A~rdI~~GEElt~ 142 (235)
+++|.++|.+|+.|+-
T Consensus 3 v~va~~~i~~G~~i~~ 18 (64)
T smart00858 3 VVVAARDLPAGEVITA 18 (64)
T ss_pred EEEEeCccCCCCCcch
Confidence 4789999999999985
No 23
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=49.22 E-value=40 Score=25.02 Aligned_cols=24 Identities=25% Similarity=0.364 Sum_probs=20.9
Q ss_pred eEEEEEEccCCCCCCeEEEecCCC
Q psy7758 124 TRVGLFALRDVPAGTELVFNYELQ 147 (235)
Q Consensus 124 ~~~~i~A~rdI~~GEElt~~Y~~~ 147 (235)
..+.|.-.+.|..||++|++|...
T Consensus 75 ktVTLTL~~~V~~Gq~VTVsYt~p 98 (101)
T TIGR02059 75 TTITLTLAQVVEDGDEVTLSYTKN 98 (101)
T ss_pred cEEEEEecccccCCCEEEEEeeCC
Confidence 478888899999999999999654
No 24
>PF14100 PmoA: Methane oxygenase PmoA
Probab=47.80 E-value=29 Score=30.31 Aligned_cols=104 Identities=18% Similarity=0.312 Sum_probs=52.5
Q ss_pred CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHHhhccCCCCceeeeecCCCeEEeccccCCccccccCC
Q psy7758 29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRRDMDRNNDHNYYFLSLDNSRYIDAGKKGNLARFMNHS 108 (235)
Q Consensus 29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~~~~~~~~~~~~~~~~~~~~~iDa~~~gn~aRfiNHS 108 (235)
+|.+....-.|+++++.+.+..|+++..- | ................|.-..++. ..... .-|++|.
T Consensus 144 ~v~l~~~~yGGl~~R~~~~~~~g~v~~s~-G-------~~g~~~~~g~~a~Wv~~~g~~~~~-----~~~~~-i~~~dhP 209 (271)
T PF14100_consen 144 PVTLGDPGYGGLFWRAARSWDGGTVLTSE-G-------KTGEEAAWGKRAPWVDYSGPIDGE-----DGTSG-IAILDHP 209 (271)
T ss_pred ceEecCCCcceEEEEccCcccCCeEECCC-C-------CcCcccccCCccCceEEEeeeCCC-----cceEE-EEEEeCC
Confidence 56666555678999999988665555331 1 000000001111111111111111 00112 2377888
Q ss_pred CCCCceeEEEEECCceEEEE------EEccCCCCCCeEEEecCCC
Q psy7758 109 CEPNCTAEKWTVSGDTRVGL------FALRDVPAGTELVFNYELQ 147 (235)
Q Consensus 109 C~PN~~~~~~~~~~~~~~~i------~A~rdI~~GEElt~~Y~~~ 147 (235)
=+||-- ..|.+.+...+.+ ..--.|++||.|++.|..-
T Consensus 210 ~N~~~P-~~W~vR~~g~~~~~p~~~~~~~~~l~~G~~l~~rYr~~ 253 (271)
T PF14100_consen 210 SNPNYP-TPWHVRGYGLFGANPAPAFDGPLTLPPGETLTLRYRVV 253 (271)
T ss_pred CCCCCC-cceEEeccCcceecccccccCceecCCCCeEEEEEEEE
Confidence 887644 3355554444433 3445699999999999543
No 25
>KOG1338|consensus
Probab=43.80 E-value=16 Score=33.80 Aligned_cols=34 Identities=21% Similarity=0.376 Sum_probs=25.8
Q ss_pred CCCEEEEEcC----CCceEEEEceeCCCCCEEEEeccE
Q psy7758 27 YPPLEVFNTG----SRGWGLKALTDLKRGQFVVEYVGE 60 (235)
Q Consensus 27 ~~~lev~~s~----~~G~GlfA~~~I~~Ge~I~ey~G~ 60 (235)
+|++.+..-+ ..|.|++|+++|++|+.++.|.+.
T Consensus 25 SpKi~~ndl~~v~~~~G~g~vAtesIkkgE~Lf~~prd 62 (466)
T KOG1338|consen 25 SPKIDNNDLPWVERIAGAGIVATESIKKGESLFAYPRD 62 (466)
T ss_pred cccccccccchhhhhcccceeeehhhcCCceEEEecCc
Confidence 3445554442 359999999999999999988665
No 26
>KOG1081|consensus
Probab=40.22 E-value=16 Score=34.56 Aligned_cols=20 Identities=25% Similarity=0.587 Sum_probs=17.8
Q ss_pred ccccccccccCCCCCeEEEee
Q psy7758 213 PAVVEVCERCGAEGGEMLRCA 233 (235)
Q Consensus 213 ~~~~d~~~~c~~~~~~~~~~~ 233 (235)
+..+++||.|. +||.+|.||
T Consensus 86 ~~~~~~c~vc~-~ggs~v~~~ 105 (463)
T KOG1081|consen 86 KIEPSECFVCF-KGGSLVTCK 105 (463)
T ss_pred CCCcchhcccc-CCCccceec
Confidence 44579999999 999999998
No 27
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=38.77 E-value=49 Score=24.71 Aligned_cols=34 Identities=21% Similarity=0.191 Sum_probs=24.4
Q ss_pred CCCCCCCEEEEEc---CCCceEEEEceeCCCCCEEEE
Q psy7758 23 ETRNYPPLEVFNT---GSRGWGLKALTDLKRGQFVVE 56 (235)
Q Consensus 23 ~~~~~~~lev~~s---~~~G~GlfA~~~I~~Ge~I~e 56 (235)
++...|...+... ....+-++|+++|++|+-|.-
T Consensus 123 NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i 159 (162)
T PF00856_consen 123 NHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFI 159 (162)
T ss_dssp EEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEE
T ss_pred ccccccccceeeEeecccceEEEEECCccCCCCEEEE
Confidence 3344556777766 688999999999999998864
No 28
>KOG2155|consensus
Probab=38.33 E-value=17 Score=34.10 Aligned_cols=48 Identities=17% Similarity=0.249 Sum_probs=34.2
Q ss_pred ccccccCCCCCCceeEEEEECCc--eEEEEEEccCCCCCCeEEEecCCCc
Q psy7758 101 LARFMNHSCEPNCTAEKWTVSGD--TRVGLFALRDVPAGTELVFNYELQK 148 (235)
Q Consensus 101 ~aRfiNHSC~PN~~~~~~~~~~~--~~~~i~A~rdI~~GEElt~~Y~~~~ 148 (235)
++.-+.||=.||..+.....-.. .--++.-+|++..|||||-|+....
T Consensus 204 fGsrvrHsdePnf~~aPf~fmPq~vaYsimwp~k~~~tgeE~trDfasg~ 253 (631)
T KOG2155|consen 204 FGSRVRHSDEPNFRIAPFMFMPQNVAYSIMWPTKPVNTGEEITRDFASGV 253 (631)
T ss_pred hhhhhccCCCCcceeeeheecchhcceeEEeeccCCCCchHHHHHHhhcC
Confidence 34568999999987654333222 2334688999999999999986544
No 29
>COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
Probab=34.03 E-value=47 Score=24.65 Aligned_cols=23 Identities=13% Similarity=0.189 Sum_probs=20.1
Q ss_pred EEEccCCCCCCeEEEecCCCccC
Q psy7758 128 LFALRDVPAGTELVFNYELQKAD 150 (235)
Q Consensus 128 i~A~rdI~~GEElt~~Y~~~~~~ 150 (235)
..+..+++.|++|+|.|+...+.
T Consensus 43 aKpS~~VK~GD~l~i~~~~~~~~ 65 (100)
T COG1188 43 AKPSKEVKVGDILTIRFGNKEFT 65 (100)
T ss_pred cccccccCCCCEEEEEeCCcEEE
Confidence 38999999999999999887764
No 30
>PF00098 zf-CCHC: Zinc knuckle; InterPro: IPR001878 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the CysCysHisCys (CCHC) type zinc finger domains, and have the sequence: C-X2-C-X4-H-X4-C where X can be any amino acid, and number indicates the number of residues. These 18 residues CCHC zinc finger domains are mainly found in the nucleocapsid protein of retroviruses. It is required for viral genome packaging and for early infection process [, , ]. It is also found in eukaryotic proteins involved in RNA binding or single-stranded DNA binding []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0003676 nucleic acid binding, 0008270 zinc ion binding; PDB: 2L44_A 1A1T_A 1WWG_A 1U6P_A 1WWD_A 1WWE_A 1A6B_B 1F6U_A 1MFS_A 1NCP_C ....
Probab=32.11 E-value=24 Score=17.55 Aligned_cols=10 Identities=50% Similarity=1.109 Sum_probs=7.8
Q ss_pred cccccCCCCCe
Q psy7758 218 VCERCGAEGGE 228 (235)
Q Consensus 218 ~~~~c~~~~~~ 228 (235)
.||.|| +-|-
T Consensus 2 ~C~~C~-~~GH 11 (18)
T PF00098_consen 2 KCFNCG-EPGH 11 (18)
T ss_dssp BCTTTS-CSSS
T ss_pred cCcCCC-CcCc
Confidence 599999 7764
No 31
>TIGR01164 rplP_bact ribosomal protein L16, bacterial/organelle. This model describes bacterial and organellar ribosomal protein L16. The homologous protein of the eukaryotic cytosol is designated L10
Probab=30.34 E-value=80 Score=24.31 Aligned_cols=41 Identities=20% Similarity=0.221 Sum_probs=29.2
Q ss_pred CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHH
Q psy7758 29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRR 71 (235)
Q Consensus 29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~ 71 (235)
+++++...++|-=..-...|++|++|+|..| ++.+.+..-.
T Consensus 77 ~~~~RMGkGKG~~~~~varV~~G~ilfEi~~--~~~~~a~~al 117 (126)
T TIGR01164 77 PLETRMGKGKGNPEYWVAVVKPGKILFEIAG--VPEEVAREAF 117 (126)
T ss_pred chhccccCCCCCCCEEEEEECCCCEEEEEeC--CCHHHHHHHH
Confidence 4555666666666666778999999999999 5665554433
No 32
>smart00400 ZnF_CHCC zinc finger.
Probab=29.09 E-value=15 Score=23.64 Aligned_cols=15 Identities=33% Similarity=0.798 Sum_probs=12.0
Q ss_pred ccccccccCCCCCeEE
Q psy7758 215 VVEVCERCGAEGGEML 230 (235)
Q Consensus 215 ~~d~~~~c~~~~~~~~ 230 (235)
.-+.||.|| .||..+
T Consensus 22 n~~~Cf~cg-~gGd~i 36 (55)
T smart00400 22 QFFHCFGCG-AGGNVI 36 (55)
T ss_pred CEEEEeCCC-CCCCHH
Confidence 347999999 898754
No 33
>TIGR03569 NeuB_NnaB N-acetylneuraminate synthase. This family is a subset of the Pfam model pfam03102 and is believed to include only authentic NeuB N-acetylneuraminate (sialic acid) synthase enzymes. The majority of the genes identified by this model are observed adjacent to both the NeuA and NeuC genes which together effect the biosynthesis of CMP-N-acetylneuraminate from UDP-N-acetylglucosamine.
Probab=25.82 E-value=34 Score=30.87 Aligned_cols=20 Identities=25% Similarity=0.350 Sum_probs=17.0
Q ss_pred EEEEEEccCCCCCCeEEEec
Q psy7758 125 RVGLFALRDVPAGTELVFNY 144 (235)
Q Consensus 125 ~~~i~A~rdI~~GEElt~~Y 144 (235)
+-.|+|.+||++||.||.+-
T Consensus 277 rrsl~a~~di~~G~~lt~~~ 296 (329)
T TIGR03569 277 RKSLVAAKDIKKGEIFTEDN 296 (329)
T ss_pred ceEEEEccCcCCCCEecHHh
Confidence 55689999999999998754
No 34
>PF05077 DUF678: Protein of unknown function (DUF678); InterPro: IPR007769 This family contains poxvirus proteins belonging to the A19 family. The proteins are of unknown function.
Probab=25.66 E-value=34 Score=23.82 Aligned_cols=15 Identities=27% Similarity=0.523 Sum_probs=8.5
Q ss_pred CCCcccccccccccC
Q psy7758 209 KGEVPAVVEVCERCG 223 (235)
Q Consensus 209 ~~~~~~~~d~~~~c~ 223 (235)
+..+.+.||.|.-|.
T Consensus 15 ~t~v~~~~d~c~tCS 29 (74)
T PF05077_consen 15 KTTVDDEEDDCTTCS 29 (74)
T ss_pred CceeecCCCCccchh
Confidence 335554467766664
No 35
>PF00628 PHD: PHD-finger; InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=24.14 E-value=50 Score=20.48 Aligned_cols=15 Identities=40% Similarity=1.241 Sum_probs=10.8
Q ss_pred ccccCCC---CCeEEEeec
Q psy7758 219 CERCGAE---GGEMLRCAL 234 (235)
Q Consensus 219 ~~~c~~~---~~~~~~~~~ 234 (235)
|..|+ . .+++|.||.
T Consensus 2 C~vC~-~~~~~~~~i~C~~ 19 (51)
T PF00628_consen 2 CPVCG-QSDDDGDMIQCDS 19 (51)
T ss_dssp BTTTT-SSCTTSSEEEBST
T ss_pred CcCCC-CcCCCCCeEEcCC
Confidence 56666 4 688898874
No 36
>KOG2084|consensus
Probab=22.88 E-value=69 Score=29.35 Aligned_cols=25 Identities=24% Similarity=0.300 Sum_probs=20.9
Q ss_pred CCCceEEEEceeCCCCCEEEEeccE
Q psy7758 36 GSRGWGLKALTDLKRGQFVVEYVGE 60 (235)
Q Consensus 36 ~~~G~GlfA~~~I~~Ge~I~ey~G~ 60 (235)
+..|.|++|+++|.+|++|++-...
T Consensus 22 ~~~Gr~~~a~~~i~~g~~i~~e~p~ 46 (482)
T KOG2084|consen 22 PELGRGLVATQAIEAGEVILEEEPL 46 (482)
T ss_pred cccCcceeeecccCCCceEEecCcc
Confidence 6889999999999999988764333
No 37
>PRK09203 rplP 50S ribosomal protein L16; Reviewed
Probab=22.12 E-value=1.3e+02 Score=23.44 Aligned_cols=42 Identities=21% Similarity=0.251 Sum_probs=28.7
Q ss_pred CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHHHHH
Q psy7758 29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNRRRR 72 (235)
Q Consensus 29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~~~~ 72 (235)
+++++...++|-=-.-...|++|++|+|..| ++.+.+.+-..
T Consensus 78 ~~~~RMGkGKG~~~~~varVk~G~iifEi~~--~~~~~a~~al~ 119 (138)
T PRK09203 78 PAEVRMGKGKGSPEYWVAVVKPGRILFEIAG--VSEELAREALR 119 (138)
T ss_pred hhhccccCCCCCCcEEEEEECCCCEEEEEeC--CCHHHHHHHHH
Confidence 4455555666655566668999999999999 56655544443
No 38
>TIGR03170 flgA_cterm flagella basal body P-ring formation protein FlgA. This model describes a conserved C-terminal region of the flagellar basal body P-ring formation protein FlgA. This sequence region contains a SAF domain, now described by Pfam model pfam08666.
Probab=21.09 E-value=43 Score=25.04 Aligned_cols=16 Identities=19% Similarity=0.216 Sum_probs=13.8
Q ss_pred EEEccCCCCCCeEEEe
Q psy7758 128 LFALRDVPAGTELVFN 143 (235)
Q Consensus 128 i~A~rdI~~GEElt~~ 143 (235)
++|.++|++||-|+-+
T Consensus 2 ~Va~r~I~~G~~i~~~ 17 (122)
T TIGR03170 2 VVAKRPLKRGEVISPE 17 (122)
T ss_pred EEECcccCCCCCcCHH
Confidence 6899999999998764
No 39
>PF06718 DUF1203: Protein of unknown function (DUF1203); InterPro: IPR009593 This family consists of several hypothetical bacterial proteins of around 155 residues in length. Family members are present in Rhizobium, Agrobacterium and Streptomyces species.
Probab=21.09 E-value=75 Score=24.23 Aligned_cols=18 Identities=39% Similarity=0.711 Sum_probs=14.1
Q ss_pred EEccCCCCCCeE-EEecCC
Q psy7758 129 FALRDVPAGTEL-VFNYEL 146 (235)
Q Consensus 129 ~A~rdI~~GEEl-t~~Y~~ 146 (235)
+.++||++||++ .++|..
T Consensus 6 ~cL~~~~~Ge~~lLlsy~p 24 (117)
T PF06718_consen 6 HCLRDAEPGEELLLLSYRP 24 (117)
T ss_pred EecccCCCCCeEEEEecCC
Confidence 479999999985 468854
No 40
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the CHL00044 rpl16 ribosomal protein L16
Probab=21.05 E-value=1.8e+02 Score=22.70 Aligned_cols=39 Identities=18% Similarity=0.186 Sum_probs=27.6
Q ss_pred CEEEEEcCCCceEEEEceeCCCCCEEEEeccEEeCHHHHHH
Q psy7758 29 PLEVFNTGSRGWGLKALTDLKRGQFVVEYVGEMIDQKELNR 69 (235)
Q Consensus 29 ~lev~~s~~~G~GlfA~~~I~~Ge~I~ey~G~ii~~~e~~~ 69 (235)
+.+++...++|-=..-...|++|++|+|..| ++.+.+.+
T Consensus 78 p~e~RMGkGKG~~~~~va~V~~G~ilfEi~g--~~~~~ak~ 116 (135)
T CHL00044 78 PAETRMGSGKGSPEYWVAVVKPGRILYEMGG--VSETIARA 116 (135)
T ss_pred cccccccCCCCCccEEEEEECCCcEEEEEeC--CCHHHHHH
Confidence 3566666667766677778999999999998 34444333
Done!