Query psy13861
Match_columns 1048
No_of_seqs 563 out of 2193
Neff 4.3
Searched_HMMs 46136
Date Fri Aug 16 22:18:15 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13861.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13861hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1079|consensus 100.0 1.1E-93 2.4E-98 817.3 24.2 509 136-1048 80-647 (739)
2 KOG1079|consensus 100.0 1.9E-57 4.1E-62 520.3 13.1 429 1-587 268-719 (739)
3 KOG4442|consensus 100.0 7.4E-31 1.6E-35 304.0 10.9 130 470-599 128-259 (729)
4 KOG1080|consensus 99.9 2.7E-27 5.7E-32 288.6 10.1 129 470-599 874-1004(1005)
5 smart00317 SET SET (Su(var)3-9 99.9 1E-21 2.2E-26 180.3 12.1 107 470-576 8-116 (116)
6 KOG1082|consensus 99.8 2.9E-20 6.4E-25 209.0 8.4 128 472-599 186-353 (364)
7 KOG1083|consensus 99.7 5.2E-19 1.1E-23 211.3 2.3 113 468-580 1184-1297(1306)
8 KOG1085|consensus 99.7 2.1E-17 4.6E-22 177.2 7.7 114 469-582 263-382 (392)
9 KOG4442|consensus 99.6 6E-16 1.3E-20 181.1 7.0 84 954-1045 87-170 (729)
10 PF00856 SET: SET domain; Int 99.5 4.7E-14 1E-18 134.3 8.1 105 473-577 1-162 (162)
11 COG2940 Proteins containing SE 99.5 1.1E-14 2.4E-19 169.4 2.0 131 469-599 339-479 (480)
12 KOG1141|consensus 99.5 2.5E-14 5.3E-19 168.2 4.1 77 523-599 1179-1261(1262)
13 KOG1082|consensus 99.3 9.4E-13 2E-17 148.8 6.7 72 960-1041 151-222 (364)
14 KOG2589|consensus 98.8 5.5E-09 1.2E-13 116.4 6.6 125 471-603 136-261 (453)
15 KOG1081|consensus 98.4 5.4E-08 1.2E-12 113.7 0.9 115 477-598 319-435 (463)
16 KOG1080|consensus 98.4 1.2E-07 2.7E-12 118.2 4.0 55 994-1048 865-921 (1005)
17 KOG1141|consensus 98.2 3.6E-07 7.7E-12 109.4 1.4 58 982-1039 786-843 (1262)
18 KOG1083|consensus 97.7 5.7E-06 1.2E-10 101.7 -1.3 55 983-1037 1165-1220(1306)
19 KOG2461|consensus 97.6 3.3E-05 7.3E-10 89.2 3.6 112 466-585 34-151 (396)
20 smart00317 SET SET (Su(var)3-9 97.4 0.00034 7.4E-09 64.4 6.2 49 997-1045 2-50 (116)
21 KOG1085|consensus 97.3 0.00019 4.1E-09 79.0 4.3 55 988-1042 249-303 (392)
22 COG2940 Proteins containing SE 95.2 0.0047 1E-07 73.1 -0.2 64 983-1046 320-383 (480)
23 smart00570 AWS associated with 92.9 0.042 9.1E-07 47.1 0.9 35 948-993 16-50 (51)
24 cd00167 SANT 'SWI3, ADA2, N-Co 92.6 0.3 6.4E-06 38.0 5.4 42 814-855 1-43 (45)
25 smart00717 SANT SANT SWI3, AD 92.4 0.35 7.5E-06 38.1 5.5 43 813-855 2-45 (49)
26 PF03638 TCR: Tesmin/TSO1-like 79.6 1.2 2.6E-05 37.1 1.8 28 940-967 2-30 (42)
27 PF00249 Myb_DNA-binding: Myb- 77.2 5.7 0.00012 32.8 5.2 43 813-855 2-46 (48)
28 PF13921 Myb_DNA-bind_6: Myb-l 75.2 6.3 0.00014 33.6 5.1 41 815-855 1-41 (60)
29 KOG2084|consensus 72.8 4.9 0.00011 46.4 5.1 42 536-581 208-250 (482)
30 KOG1337|consensus 71.7 2.8 6E-05 50.1 2.8 41 536-579 239-279 (472)
31 TIGR01557 myb_SHAQKYF myb-like 62.0 20 0.00044 31.5 5.5 45 813-857 4-54 (57)
32 PF05033 Pre-SET: Pre-SET moti 59.9 6.2 0.00013 37.2 2.2 20 958-987 84-103 (103)
33 KOG1171|consensus 58.6 3.6 7.7E-05 48.6 0.4 52 913-965 140-242 (406)
34 smart00570 AWS associated with 49.3 6.3 0.00014 34.1 0.3 29 912-940 16-44 (51)
35 KOG3813|consensus 44.5 9.9 0.00021 46.0 1.0 23 905-929 309-332 (640)
36 PF05033 Pre-SET: Pre-SET moti 35.6 22 0.00047 33.6 1.6 36 903-940 46-103 (103)
37 smart00508 PostSET Cysteine-ri 34.0 19 0.00042 27.4 0.8 15 585-599 2-16 (26)
38 PF03638 TCR: Tesmin/TSO1-like 26.9 36 0.00078 28.6 1.3 36 904-941 4-40 (42)
39 PF08666 SAF: SAF domain; Int 24.8 40 0.00087 28.7 1.3 15 559-573 3-17 (63)
40 smart00468 PreSET N-terminal t 24.1 50 0.0011 31.2 1.9 23 939-961 47-70 (98)
41 PF11403 Yeast_MT: Yeast metal 23.1 67 0.0015 26.0 2.1 10 925-934 18-27 (40)
42 KOG4167|consensus 22.7 1.1E+02 0.0025 38.9 5.0 39 811-849 618-656 (907)
No 1
>KOG1079|consensus
Probab=100.00 E-value=1.1e-93 Score=817.30 Aligned_cols=509 Identities=46% Similarity=0.814 Sum_probs=436.9
Q ss_pred CCcccccccccccccccccccCCCCccccccccccccCCCCCCCCCcccccccCCCcccccccccCCcccCCCccccccc
Q psy13861 136 NNIEVEPVSTTTSFSLLGLMGHEGGRYCGCFSTEWKHGLSGSSPMRDTSIHSLRSPYTSIRTCVTGASYRTEPAVYLPVK 215 (1048)
Q Consensus 136 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~ 215 (1048)
++++..++.+++.++.+++|+ .|.+++++++..+.|++|++ ||| |+++.+.+++|++++
T Consensus 80 ~~~~~~~i~~~n~~~~v~~~~------------~~~~~q~nfmv~~~~~~~~i--p~~-------~~~v~~~k~~~ieel 138 (739)
T KOG1079|consen 80 FPSQKSPINELNAVAQVPIMY------------SWPPLQQNFMVEDETVLHNI--PYM-------GDEVLDIKGPFIEEL 138 (739)
T ss_pred Ccccccchhhhcccccccccc------------cCChhhhcceecccceeccc--ccc-------cccccccccchhhhc
Confidence 889999999999999999999 99999999999999999999 999 999999999999999
Q ss_pred cccCCCccccCCCCcCcchhHHHHHHhhhhhhh------------------------------------------cccch
Q psy13861 216 LKNYDGKVHGDTGSAGFLDNQIFIELVNDLIKY------------------------------------------QVKDS 253 (1048)
Q Consensus 216 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------------------------------~~~~~ 253 (1048)
++ |||+||||+ ..+++.+++||+|++.+-+| .||..
T Consensus 139 ~~-y~~~v~~dr-~~~~~~d~v~ve~~~a~~Q~~~e~dg~D~~~e~~~~~ekr~~~e~~~~~~~~~~~~~~~~~~~if~~ 216 (739)
T KOG1079|consen 139 IK-YDGKVHGDR-NQRFMEDQVFVELVVALYQYGGEHDGSDDEEEEVLEEEKRDFLEGEDDDIIESINKLSFPADKIFQA 216 (739)
T ss_pred cc-ccceeeccc-cccchhhhhHHHHHHHHHhcCCccccCCCccccchhhhcccccCcccchhhHhhhhhccchHHHHHH
Confidence 99 999999999 59999999999999999988 22333
Q ss_pred hhhhcCCCCchHHHhhhccCCCCCCCCCCCCCCCCCCCCCCCCCCCCcccccccccccccchhhhccCCCCCCCCCCCCc
Q psy13861 254 EEESNSNKGSAEELRDKYIELPEQTDPNASPPECTPNVDGPTAESVPREQTMHSFHTLICPNLMRRKRPDLKPFSDPCSP 333 (1048)
Q Consensus 254 ~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 333 (1048)
...++++++.+.+|+++|.+||++++|.+.+++||||+||+.|++|+|+|++||||||
T Consensus 217 ~~~~f~~k~~~~~lke~~~~l~~~~~p~~~e~~~~~~id~~~ae~~~r~~~l~sF~tl---------------------- 274 (739)
T KOG1079|consen 217 ISSMFPDKLTASELKERYGELTSKSLPVAEEPECTPNIDGSSAEPVQREQALHSFHTL---------------------- 274 (739)
T ss_pred HhhhcccccchhhhhHHHhhhhhccccccCCcccccCCCccccChHHHHhhhcccccc----------------------
Confidence 4456699999999999999999999999999999999999999988888877777643
Q ss_pred cchhccchhHHHHHhhhhhHHHHHHHhhhccchhcccccccccccccchhhhcccccccccccccccCCCCCCCCCCCCC
Q psy13861 334 DCYMLLDGMKEKIEAKIKDEEEQEMKKKTKLDLEEDDKMQVDDQNAVQATEVKTTKGKLSIEKQVSLDSGSGNDASSEDS 413 (1048)
Q Consensus 334 ~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 413 (1048)
T Consensus 275 -------------------------------------------------------------------------------- 274 (739)
T KOG1079|consen 275 -------------------------------------------------------------------------------- 274 (739)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred CCccccccccccCCCCcccccccccccccCCCCCccccccCccccceeeEEeeccccceeeEEeeccccCCCceEEEEEe
Q psy13861 414 NDSRDLKNNIEVEPVSTTTSFSLLGLMEHEGNNEWTLDRLRPIHFRAIHKVLYNNYCAIAQVMMTKTCQQKNEFISEYCG 493 (1048)
Q Consensus 414 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~e~r~spvh~r~L~k~~~~g~~~~G~GLFAtrdI~KGEfI~EY~G 493 (1048)
T Consensus 275 -------------------------------------------------------------------------------- 274 (739)
T KOG1079|consen 275 -------------------------------------------------------------------------------- 274 (739)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred EEEeHHHHhhhhhccccccceeeeecCCCcccccccCCCccccccCCCCCCeeEEEEEEcCeeEEEEEEccCCCCCCeEE
Q psy13861 494 EIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYAKVMMVNGDHRIGIFAKRAILPGEELY 573 (1048)
Q Consensus 494 EVIt~~Eae~R~~~yd~~~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v~~v~v~g~~rI~ifA~RDI~aGEELT 573 (1048)
T Consensus 275 -------------------------------------------------------------------------------- 274 (739)
T KOG1079|consen 275 -------------------------------------------------------------------------------- 274 (739)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred EecCCCCCCCcceeeccCCccccchhhhccccCCCCcchhhhccchhhHHHHhhhhhhhhcccccCCCcccccccccccc
Q psy13861 574 FDYRYGPTEQLKFVVTLDSNVANKYIYEWDFNLRSPVSATILFGNMRAMEIKNYQSSKVVLGKNKTGGILMPLELLREAN 653 (1048)
Q Consensus 574 ~DYg~~~~~~~k~~C~Cg~~~CRk~I~ewd~n~~~p~~a~~l~g~~~a~~ik~~~~rrk~~~K~~~~g~~i~lei~~e~~ 653 (1048)
T Consensus 275 -------------------------------------------------------------------------------- 274 (739)
T KOG1079|consen 275 -------------------------------------------------------------------------------- 274 (739)
T ss_pred --------------------------------------------------------------------------------
Confidence
Q ss_pred cCcccccCCccccccccccccccCCCCcccccccCCCCCCCCCCChhhHhhhhcchhhHHHhhcchHHHHHh---hhccc
Q psy13861 654 TSCQYDTAGRCYKYDCFLHRLKDHHSGPNLMRRKRPDLKPFSDPCSPDCYMLLDGMKEKIEAEIKDEEEQEM---KKKTK 730 (1048)
Q Consensus 654 ~E~~~~~crRC~KYDCflH~~~~~~~tp~~~krk~~e~~~~~~PCg~~Cfl~l~g~~E~~~~~~~~~~~k~~---~~~r~ 730 (1048)
+|||||+||||||+ .++|++||+++|++.+.+.+.+|||+.||++|+||+|+.+ .... ++.. |.+|+
T Consensus 275 ------fCrrCl~ydC~lHg-~~~~~~pn~~~r~e~~~a~~~~pc~p~~~~~l~~~~~~~m-~~~~--~~~~p~~g~~~q 344 (739)
T KOG1079|consen 275 ------FCRRCLKYDCFLHG-SQFHAFPNTKKRKEDEPALENEPCGPGCYGLLEGAKEKTM-SAVV--SKCPPIRGDIRQ 344 (739)
T ss_pred ------eeeeeeeeeccccC-ccccccccccccCCCCccccccCCCCchhhhhhccchhhh-hccc--ccCCCCcchhhh
Confidence 46789999999998 4679999999999999999999999999999999999933 3333 2222 22222
Q ss_pred cccchhhhhhccchhhcccccccccCCcccccccccCCC----CCC-------CCCCCCCCCCcccccCCCcccCCcccc
Q psy13861 731 LDLEEDDKMQVDDQNAVQATEVKTTKGKLSIEKQVSLDS----GSG-------NDASSEDSNDSKDLKNNTEVEPVSTTT 799 (1048)
Q Consensus 731 ~~~~~~~k~~~~~~~~~~~~~~s~~~~~~s~~k~~~~d~----~~~-------~~~~~~~s~~~s~~~~~~~~~~~~~~~ 799 (1048)
+ +. +.+...|...-.....+..++|. +++ .++.+..|+.++ +||+|+.+
T Consensus 345 k-~~------------~~~~~~s~~~~~~~e~~g~~~d~~v~~~~~~~~~~v~~~~~~~~s~~~~-------~c~~~~~~ 404 (739)
T KOG1079|consen 345 K-LV------------KASSMDSDDEHVEEEDKGHDDDDGVPRGFGGSVNFVGEDDTSTHSSTNS-------ICQNPVHG 404 (739)
T ss_pred h-hc------------ccccCCcchhhccccccCcccccccccccccccccccCCcccccccccc-------cccCcccc
Confidence 2 22 12211122222222233344444 233 133344455566 99998754
Q ss_pred cccccccCCCCCCCCCCHHHHHHHHHHHHhcCCCchHHHhhcCCCChHHHHHHHHHhhcccccCCCC-CCCCCcchhhcc
Q psy13861 800 SFSLLGLMGHEGNNEWTGSDQSLFRAIHKVLYNNYCAIAQVMMTKTCQQVYQFAQKEAADITTEDSA-NDTTPPRKKKKK 878 (1048)
Q Consensus 800 ~~~~~~~~~~~~~~~Wt~~E~sL~r~l~~~~~~N~C~IA~~lg~KTC~EV~~~~~~~~~~~~~~p~~-~~~~~prKKkrK 878 (1048)
... .+.+|+++|..||++|+.+|++|+|+||+++++|||++||+|++.+....+..+.. ...+++++++++
T Consensus 405 ~~~--------~~~ew~~~ek~~fr~~~~~~~~n~c~Iar~l~~ktC~~v~~~~~~e~~~~~~~~~~~~~~~~~~~r~~~ 476 (739)
T KOG1079|consen 405 KKD--------TNVEWNGAEKVLFRVGSTLYGTNRCSIARNLLTKTCRQVYEYEQKEVLQGLYFDGRFRVELPGPKRARK 476 (739)
T ss_pred cCC--------cccccchhhhHHHHhccccccchhhHHHHHhcchHHHHHHHHhhcchhhceecccccccccCcchhhHH
Confidence 333 46799999999999999999999999999999999999999999876444334432 224677888999
Q ss_pred ccchhhhHhHhhhccCCCCCceeccccCCCCCCCC-CCCCCceecCCCCccccccCCccccccCCCCCcCCCCCCCCccc
Q psy13861 879 HRLWSVHCRKIQLKKDSSSNHVHNFTPCRHPPTQP-CDASCPCVSAQNFCEKFCKCSFDCQNRFPGCRCKAQCNTKQCPC 957 (1048)
Q Consensus 879 ~R~w~~h~rki~~kkd~~~~~~~~y~PC~H~~ggp-C~~~C~C~~~~~~Cek~C~C~~~C~nRFpGC~Ck~~C~tk~CpC 957 (1048)
+|+|..|+|+++.+++...++++.||||+|+ |++ |+.+|+|+.++++|||||+|+++|+|||+||+|+++|++++|||
T Consensus 477 ~r~~g~~r~k~q~kk~~~~~~v~~~qpC~hp-~~c~c~~~C~C~~n~~~CEk~C~C~~dC~nrF~GC~Ck~QC~tkqCpC 555 (739)
T KOG1079|consen 477 LRLWGRHRRKIQNKKDSRHTVVWNYQPCDHP-GPCNCGVGCPCIDNETFCEKFCYCSPDCRNRFPGCRCKAQCNTKQCPC 555 (739)
T ss_pred HHhhhhHHHhhhcccccCCceeeecCcccCC-CCCCCCCCCcccccCcchhhcccCCHHHHhcCCCCCcccccccCcCch
Confidence 9999999999999999999999999999999 555 67899999999999999999999999999999999999999999
Q ss_pred cccccccCCCCCccCC-CCcCcCCcccCcchhhhcceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHH
Q psy13861 958 YLAVRECDPDLCQTCG-ADQFDVSKISCKNVSVQRGLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADR 1036 (1048)
Q Consensus 958 ~~a~rECdPdlC~~Cg-~~~~d~~~~~C~Nr~lQrG~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdR 1036 (1048)
++|+|||||++|..|| ++++++...+|+|..+|+|++++|+|+.|.+.|||||+++++.|++||+||+||+||++||||
T Consensus 556 ~~A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapSdVaGwGlFlKe~v~KnefisEY~GE~IS~dEADr 635 (739)
T KOG1079|consen 556 YLAVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPSDVAGWGLFLKESVSKNEFISEYTGEIISHDEADR 635 (739)
T ss_pred hhhccccCchHHhccCcccccccCccccccchhhhhhhcceeechhhccccceeeccccCCCceeeeecceeccchhhhh
Confidence 9999999999999999 568999989999999999999999999999999999999999999999999999999999999
Q ss_pred HHhhhhhcCCCC
Q psy13861 1037 RGKVYDKYMCSF 1048 (1048)
Q Consensus 1037 RGkvYDk~~~Sy 1048 (1048)
||++||++||||
T Consensus 636 RGkiYDr~~cSf 647 (739)
T KOG1079|consen 636 RGKIYDRYMCSF 647 (739)
T ss_pred ccccccccccee
Confidence 999999999998
No 2
>KOG1079|consensus
Probab=100.00 E-value=1.9e-57 Score=520.28 Aligned_cols=429 Identities=37% Similarity=0.484 Sum_probs=307.8
Q ss_pred CCccchhhcccccccccccccccCCCCCCccccccCCCCCCCCCCCchhhhhhhhhhhHHHhHhhhhHHHHHHhhhcccc
Q psy13861 1 MHSFHTLFCRRCYKYDCFLHRLKDHHSGPNLMRRKRPDLKPFSDPCSPDCYMLLDGMKEKIEAEIKDEEEQEMKKKTKLD 80 (1048)
Q Consensus 1 ~~s~~~l~c~r~~kydc~~~~~~~~~~~~~~~~r~~~~~~~~~~~c~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (1048)
||||||||||||++||||||+. ..|.+||+++|+..+.+++.+||||.||++|+|++++. |....
T Consensus 268 l~sF~tlfCrrCl~ydC~lHg~-~~~~~pn~~~r~e~~~a~~~~pc~p~~~~~l~~~~~~~-----------m~~~~--- 332 (739)
T KOG1079|consen 268 LHSFHTLFCRRCLKYDCFLHGS-QFHAFPNTKKRKEDEPALENEPCGPGCYGLLEGAKEKT-----------MSAVV--- 332 (739)
T ss_pred hcccccceeeeeeeeeccccCc-cccccccccccCCCCccccccCCCCchhhhhhccchhh-----------hhccc---
Confidence 7999999999999999999984 36999999999999999999999999999999999991 11110
Q ss_pred hhhhhhhhcchhhhhhhhhhhcccCccc-cccccccCCCCCCCCCCCCCcccccccCCcccccccccccccccccccCCC
Q psy13861 81 LEEDDKMQVDDQNAVQATEVKTTKGKLS-IEKQVSLDSGSGNDASSEDSNDSRDLKNNIEVEPVSTTTSFSLLGLMGHEG 159 (1048)
Q Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 159 (1048)
...+....+++. +.+. ++++|..+...+.+.++. ... .+.++.
T Consensus 333 ----------------~~~~p~~g~~~qk~~~~--~~~~s~~~~~~~e~~g~~---------------~d~---~v~~~~ 376 (739)
T KOG1079|consen 333 ----------------SKCPPIRGDIRQKLVKA--SSMDSDDEHVEEEDKGHD---------------DDD---GVPRGF 376 (739)
T ss_pred ----------------ccCCCCcchhhhhhccc--ccCCcchhhccccccCcc---------------ccc---cccccc
Confidence 011111111111 1111 122222222111111110 011 112222
Q ss_pred CccccccccccccCCCCCCCCCcccccccCCCcccccccccCCcccCCCccccccccccCCCccccCCCCcCcchhHHHH
Q psy13861 160 GRYCGCFSTEWKHGLSGSSPMRDTSIHSLRSPYTSIRTCVTGASYRTEPAVYLPVKLKNYDGKVHGDTGSAGFLDNQIFI 239 (1048)
Q Consensus 160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 239 (1048)
+..| ++--++|++.|+-..++. . .-+. +. ++ -.-=|+|.+.+||+.+..
T Consensus 377 ~~~~------------~~v~~~~~~~~s~~~~~c-----~--~~~~------~~---~~---~~~ew~~~ek~~fr~~~~ 425 (739)
T KOG1079|consen 377 GGSV------------NFVGEDDTSTHSSTNSIC-----Q--NPVH------GK---KD---TNVEWNGAEKVLFRVGST 425 (739)
T ss_pred cccc------------ccccCCcccccccccccc-----c--Cccc------cc---CC---cccccchhhhHHHHhccc
Confidence 2111 111125555555533333 0 0000 00 00 011599999999999999
Q ss_pred HHhhhhhhh----------cccchhhhhcCCCCchHH-HhhhccCCCCCCCCCCCCCCC----------CCCCCCCCCCC
Q psy13861 240 ELVNDLIKY----------QVKDSEEESNSNKGSAEE-LRDKYIELPEQTDPNASPPEC----------TPNVDGPTAES 298 (1048)
Q Consensus 240 ~~~~~~~~~----------~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~q~~~~~~~~~----------~~~~~~~~~~~ 298 (1048)
-+..|.|.| +||++++++..-.--.-- -+...+.+++.++.+.|.-+| .+.|...+||.
T Consensus 426 ~~~~n~c~Iar~l~~ktC~~v~~~~~~e~~~~~~~~~~~~~~~~~~~r~~~~r~~g~~r~k~q~kk~~~~~~v~~~qpC~ 505 (739)
T KOG1079|consen 426 LYGTNRCSIARNLLTKTCRQVYEYEQKEVLQGLYFDGRFRVELPGPKRARKLRLWGRHRRKIQNKKDSRHTVVWNYQPCD 505 (739)
T ss_pred cccchhhHHHHHhcchHHHHHHHHhhcchhhceecccccccccCcchhhHHHHhhhhHHHhhhcccccCCceeeecCccc
Confidence 999898887 899999865421110000 234678888899999999999 67788899999
Q ss_pred CCccc-ccccccccccchhhhccCCCCCCCCCCCCccchhccchhHHHHHhhhhhHHHHHHHhhhccchhcccccccccc
Q psy13861 299 VPREQ-TMHSFHTLICPNLMRRKRPDLKPFSDPCSPDCYMLLDGMKEKIEAKIKDEEEQEMKKKTKLDLEEDDKMQVDDQ 377 (1048)
Q Consensus 299 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~~~~~~~~ 377 (1048)
||+.- |...+.+|.=.+++++++ .|+++|.++|+|++++.|+.++ ||||+
T Consensus 506 hp~~c~c~~~C~C~~n~~~CEk~C--------~C~~dC~nrF~GC~Ck~QC~tk---------------------qCpC~ 556 (739)
T KOG1079|consen 506 HPGPCNCGVGCPCIDNETFCEKFC--------YCSPDCRNRFPGCRCKAQCNTK---------------------QCPCY 556 (739)
T ss_pred CCCCCCCCCCCcccccCcchhhcc--------cCCHHHHhcCCCCCcccccccC---------------------cCchh
Confidence 99999 999999999999999999 9999999999999999999766 99999
Q ss_pred cccchhhhcccccccccccccccCCCCCCCCCCCCCCCccccccccccCCCCcccccccccccccCCCCCccccccCccc
Q psy13861 378 NAVQATEVKTTKGKLSIEKQVSLDSGSGNDASSEDSNDSRDLKNNIEVEPVSTTTSFSLLGLMEHEGNNEWTLDRLRPIH 457 (1048)
Q Consensus 378 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~W~e~r~spvh 457 (1048)
.|+++++ .+++..++..+--.....++.+..-++.....+.+..|
T Consensus 557 ~A~rECd--------------------------------Pd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapS--- 601 (739)
T KOG1079|consen 557 LAVRECD--------------------------------PDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPS--- 601 (739)
T ss_pred hhccccC--------------------------------chHHhccCcccccccCccccccchhhhhhhcceeechh---
Confidence 9999996 12222222211111111122222111222222222222
Q ss_pred cceeeEEeeccccceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccccceeeeecCCCcccccccCCCccccc
Q psy13861 458 FRAIHKVLYNNYCAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFA 537 (1048)
Q Consensus 458 ~r~L~k~~~~g~~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~~~sYlf~ld~~~vIDAt~~GN~ARFI 537 (1048)
..-|||||+++.+.|++||+||+||+|+++|+++|+..|+.++.+|+|+|+.+++|||++.||.+||+
T Consensus 602 ------------dVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGkiYDr~~cSflFnln~dyviDs~rkGnk~rFA 669 (739)
T KOG1079|consen 602 ------------DVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGKIYDRYMCSFLFNLNNDYVIDSTRKGNKIRFA 669 (739)
T ss_pred ------------hccccceeeccccCCCceeeeecceeccchhhhhcccccccccceeeeeccccceEeeeeecchhhhc
Confidence 33789999999999999999999999999999999999999999999999999999999999999999
Q ss_pred cCCCCCCeeEEEEEEcCeeEEEEEEccCCCCCCeEEEecCCCCCCCccee
Q psy13861 538 NHSINPNCYAKVMMVNGDHRIGIFAKRAILPGEELYFDYRYGPTEQLKFV 587 (1048)
Q Consensus 538 NHSC~PN~~v~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~~~k~~ 587 (1048)
|||-+|||++.+++|.|.+||+|||.|+|.+||||||||+|+.+.+.+|.
T Consensus 670 NHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrYs~~~~~k~~ 719 (739)
T KOG1079|consen 670 NHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYSPEHALKFV 719 (739)
T ss_pred cCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeeccCcccccccc
Confidence 99999999999999999999999999999999999999999988877654
No 3
>KOG4442|consensus
Probab=99.97 E-value=7.4e-31 Score=304.02 Aligned_cols=130 Identities=33% Similarity=0.485 Sum_probs=125.1
Q ss_pred cceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccc--cceeeeecCCCcccccccCCCccccccCCCCCCeeE
Q psy13861 470 CAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKY--MCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYA 547 (1048)
Q Consensus 470 ~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~--~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v 547 (1048)
..+||||+|..+|++|+||+||.||||...+++.|...|+.. .++|+|-+..+.+|||+.+||+||||||||+|||++
T Consensus 128 e~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~Y~~d~~kh~Yfm~L~~~e~IDAT~KGnlaRFiNHSC~PNa~~ 207 (729)
T KOG4442|consen 128 EKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKRYAKDGIKHYYFMALQGGEYIDATKKGNLARFINHSCDPNAEV 207 (729)
T ss_pred cCcccceeeccccCCCcEEeeeccccccHHHHHHHHHHHHhcCCceEEEEEecCCceecccccCcHHHhhcCCCCCCcee
Confidence 379999999999999999999999999999999999999865 588999999999999999999999999999999999
Q ss_pred EEEEEcCeeEEEEEEccCCCCCCeEEEecCCCCCCCcceeeccCCccccchh
Q psy13861 548 KVMMVNGDHRIGIFAKRAILPGEELYFDYRYGPTEQLKFVVTLDSNVANKYI 599 (1048)
Q Consensus 548 ~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~~~k~~C~Cg~~~CRk~I 599 (1048)
+.|+|.+..||+|||.|.|.+||||||||+++.+......|.||...|++||
T Consensus 208 ~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr~AQ~CyCgeanC~G~I 259 (729)
T KOG4442|consen 208 QKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGRDAQPCYCGEANCRGWI 259 (729)
T ss_pred eeeeeCCeeEEEEeEecccCCCceeeEecccccccccccccccCCccccccc
Confidence 9999999999999999999999999999999988888889999999999999
No 4
>KOG1080|consensus
Probab=99.94 E-value=2.7e-27 Score=288.58 Aligned_cols=129 Identities=37% Similarity=0.639 Sum_probs=122.2
Q ss_pred cceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccc--cceeeeecCCCcccccccCCCccccccCCCCCCeeE
Q psy13861 470 CAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKY--MCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYA 547 (1048)
Q Consensus 470 ~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~--~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v 547 (1048)
..-||||||.+.|.+|++|+||+||+|....++.|+..|... +.+|+|.++.+.||||+..||+||||||||.|||++
T Consensus 874 ~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sYlfrid~~~ViDAtk~gniAr~InHsC~PNCya 953 (1005)
T KOG1080|consen 874 GIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSYLFRIDDEVVVDATKKGNIARFINHSCNPNCYA 953 (1005)
T ss_pred cccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccceeeecccceEEeccccCchhheeecccCCCcee
Confidence 446899999999999999999999999999999999999876 578999999999999999999999999999999999
Q ss_pred EEEEEcCeeEEEEEEccCCCCCCeEEEecCCCCCCCcceeeccCCccccchh
Q psy13861 548 KVMMVNGDHRIGIFAKRAILPGEELYFDYRYGPTEQLKFVVTLDSNVANKYI 599 (1048)
Q Consensus 548 ~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~~~k~~C~Cg~~~CRk~I 599 (1048)
+++.|+|..+|+|||.|+|.+||||||||.+..++. +.+|.||+.+||+.+
T Consensus 954 kvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e~~-kipClCgap~Crg~~ 1004 (1005)
T KOG1080|consen 954 KVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTEDD-KIPCLCGAPNCRGFL 1004 (1005)
T ss_pred eEEEecCeeEEEEEEecccccCceeeeecccccccc-ccccccCCCcccccc
Confidence 999999999999999999999999999999976665 899999999999976
No 5
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.87 E-value=1e-21 Score=180.30 Aligned_cols=107 Identities=48% Similarity=0.721 Sum_probs=95.7
Q ss_pred cceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhcccccc--ceeeeecCCCcccccccCCCccccccCCCCCCeeE
Q psy13861 470 CAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKYM--CSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYA 547 (1048)
Q Consensus 470 ~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~~--~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v 547 (1048)
+..|+||||+++|++|++|++|.|.++...+...+...|.... ..|+|.+...++||+...||++|||||||.||+.+
T Consensus 8 ~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~iNHsc~pN~~~ 87 (116)
T smart00317 8 PGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKAYDTDGADSFYLFEIDSDLCIDARRKGNIARFINHSCEPNCEL 87 (116)
T ss_pred CCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHHHHhcCCCCEEEEECCCCEEEeCCccCcHHHeeCCCCCCCEEE
Confidence 4799999999999999999999999999888877654444433 47889988889999999999999999999999999
Q ss_pred EEEEEcCeeEEEEEEccCCCCCCeEEEec
Q psy13861 548 KVMMVNGDHRIGIFAKRAILPGEELYFDY 576 (1048)
Q Consensus 548 ~~v~v~g~~rI~ifA~RDI~aGEELT~DY 576 (1048)
..+..++..+|.|+|+|||++|||||++|
T Consensus 88 ~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y 116 (116)
T smart00317 88 LFVEVNGDSRIVIFALRDIKPGEELTIDY 116 (116)
T ss_pred EEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence 88888777799999999999999999999
No 6
>KOG1082|consensus
Probab=99.81 E-value=2.9e-20 Score=209.02 Aligned_cols=128 Identities=29% Similarity=0.409 Sum_probs=104.0
Q ss_pred eeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccc----cceeee---------------------ecCCCcccc
Q psy13861 472 IAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKY----MCSFLF---------------------NLNNDFVVD 526 (1048)
Q Consensus 472 ~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~----~~sYlf---------------------~ld~~~vID 526 (1048)
+||||+|.+.|++|+||+||.||+++..+++.+...+... ...+.+ .....+.||
T Consensus 186 kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~id 265 (364)
T KOG1082|consen 186 KGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDDDCDAYSIADREWVDESPVGNTFVAPSLPGGPGRELLID 265 (364)
T ss_pred ceeeecccccccCCCeeEEEeeEecChHHhhhccccccccccccccchhhhccccccccccccccccccccCCCcceEEc
Confidence 9999999999999999999999999999998774322211 111111 113468999
Q ss_pred cccCCCccccccCCCCCCeeEEEEEEcC----eeEEEEEEccCCCCCCeEEEecCCCCC-----------CCcceeeccC
Q psy13861 527 ATRKGNKIRFANHSINPNCYAKVMMVNG----DHRIGIFAKRAILPGEELYFDYRYGPT-----------EQLKFVVTLD 591 (1048)
Q Consensus 527 At~~GN~ARFINHSC~PN~~v~~v~v~g----~~rI~ifA~RDI~aGEELT~DYg~~~~-----------~~~k~~C~Cg 591 (1048)
|...||++|||||||.||+.+..+..+. ..+|+|||+++|.+|+|||+|||..+. ...+..|.|+
T Consensus 266 a~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~~~~~~~~~~~~~~~~~~~~c~c~ 345 (364)
T KOG1082|consen 266 AKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKAYKLLVQDGANIYTPVMKKNCNCG 345 (364)
T ss_pred hhhcccccccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhcccccccccccccccccccchhhcCC
Confidence 9999999999999999999998887774 369999999999999999999997642 2356679999
Q ss_pred Cccccchh
Q psy13861 592 SNVANKYI 599 (1048)
Q Consensus 592 ~~~CRk~I 599 (1048)
...|++++
T Consensus 346 ~~~cr~~~ 353 (364)
T KOG1082|consen 346 LEKCRGLL 353 (364)
T ss_pred CHHhCccc
Confidence 99998876
No 7
>KOG1083|consensus
Probab=99.73 E-value=5.2e-19 Score=211.32 Aligned_cols=113 Identities=34% Similarity=0.545 Sum_probs=104.8
Q ss_pred cccceeeEEeeccccCCCceEEEEEeEEEeHHHHhhh-hhccccccceeeeecCCCcccccccCCCccccccCCCCCCee
Q psy13861 468 NYCAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRR-GKVYDKYMCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCY 546 (1048)
Q Consensus 468 g~~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R-~~~yd~~~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~ 546 (1048)
..+..||||.|.++|++|+||+||+|+||+..+++.+ ...|......|+..+..+.+||+.++||.+|||||+|.|||+
T Consensus 1184 ~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~~d~~~~cL~I~p~l~id~~R~~n~~RfinhscKPNc~ 1263 (1306)
T KOG1083|consen 1184 RGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYHNDDDHYCLVIDPGLFIDIPRMGNGARFINHSCKPNCE 1263 (1306)
T ss_pred ccCCCCccccccccccccchHHHHHHHHHHHHhhcccccccCCCCCcccccccCccccCChhhccccccccccccCCCCc
Confidence 3467899999999999999999999999999998877 455666678899999999999999999999999999999999
Q ss_pred EEEEEEcCeeEEEEEEccCCCCCCeEEEecCCCC
Q psy13861 547 AKVMMVNGDHRIGIFAKRAILPGEELYFDYRYGP 580 (1048)
Q Consensus 547 v~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~ 580 (1048)
++.|.++|..||++||+|||.+||||||||++..
T Consensus 1264 ~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ks 1297 (1306)
T KOG1083|consen 1264 MQKWSVNGEYRVGLFALRDLPKGEELTYDYNFKS 1297 (1306)
T ss_pred cccccccceeeeeeeecCCCCCCceEEEeccccc
Confidence 9999999999999999999999999999998743
No 8
>KOG1085|consensus
Probab=99.69 E-value=2.1e-17 Score=177.18 Aligned_cols=114 Identities=28% Similarity=0.454 Sum_probs=99.3
Q ss_pred ccceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccc--cce--eee-ecCCCcccccccC-CCccccccCCCC
Q psy13861 469 YCAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKY--MCS--FLF-NLNNDFVVDATRK-GNKIRFANHSIN 542 (1048)
Q Consensus 469 ~~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~--~~s--Ylf-~ld~~~vIDAt~~-GN~ARFINHSC~ 542 (1048)
+-++|+||+|+..+.+|+||.||.|.+|...++..|+..|... ... |+| ..+..++|||+.. +-++|.||||-.
T Consensus 263 ~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya~De~~GcYMYyF~h~sk~yCiDAT~et~~lGRLINHS~~ 342 (392)
T KOG1085|consen 263 KDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYANDEEIGCYMYYFEHNSKKYCIDATKETPWLGRLINHSVR 342 (392)
T ss_pred eccccceeEeecccccCceEEEEecceeeechHHHHHHHhccCcccceEEEeeeccCeeeeeecccccccchhhhccccc
Confidence 3459999999999999999999999999999999999888653 222 334 3456899999986 567899999999
Q ss_pred CCeeEEEEEEcCeeEEEEEEccCCCCCCeEEEecCCCCCC
Q psy13861 543 PNCYAKVMMVNGDHRIGIFAKRAILPGEELYFDYRYGPTE 582 (1048)
Q Consensus 543 PN~~v~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~ 582 (1048)
+||..+++.+++.+||.++|.|||.+||||+||||.....
T Consensus 343 gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRSke 382 (392)
T KOG1085|consen 343 GNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRSKE 382 (392)
T ss_pred CcceeeEEEecCCceEEEEeccccccchhhhhhccccchh
Confidence 9999999999999999999999999999999999986443
No 9
>KOG4442|consensus
Probab=99.60 E-value=6e-16 Score=181.14 Aligned_cols=84 Identities=30% Similarity=0.529 Sum_probs=79.0
Q ss_pred CccccccccccCCCCCccCCCCcCcCCcccCcchhhhcceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHH
Q psy13861 954 QCPCYLAVRECDPDLCQTCGADQFDVSKISCKNVSVQRGLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDE 1033 (1048)
Q Consensus 954 ~CpC~~a~rECdPdlC~~Cg~~~~d~~~~~C~Nr~lQrG~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~E 1033 (1048)
.|.|+..+.||.+++|..||. .|+|+++|+.+..++.||.|+.+||||+|.++|++|+||+||+||||+.+|
T Consensus 87 ~CiNr~t~iECs~~~C~~cg~--------~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~E 158 (729)
T KOG4442|consen 87 DCINRMTSIECSDRECPRCGV--------YCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKE 158 (729)
T ss_pred cccchhhhcccCCccCCCccc--------cccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHH
Confidence 367777899999999999987 699999999999999999999999999999999999999999999999999
Q ss_pred HHHHHhhhhhcC
Q psy13861 1034 ADRRGKVYDKYM 1045 (1048)
Q Consensus 1034 AdRRGkvYDk~~ 1045 (1048)
+++|.+.|++.+
T Consensus 159 f~kR~~~Y~~d~ 170 (729)
T KOG4442|consen 159 FEKRVKRYAKDG 170 (729)
T ss_pred HHHHHHHHHhcC
Confidence 999999999744
No 10
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.49 E-value=4.7e-14 Score=134.26 Aligned_cols=105 Identities=19% Similarity=0.159 Sum_probs=73.9
Q ss_pred eeEEeeccccCCCceEEEEEeEEEeHHHHhhh-------------------h--------------------------hc
Q psy13861 473 AQVMMTKTCQQKNEFISEYCGEIISQDEADRR-------------------G--------------------------KV 507 (1048)
Q Consensus 473 G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R-------------------~--------------------------~~ 507 (1048)
|+||||+++|++|++|++..+.+++....... . ..
T Consensus 1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 80 (162)
T PF00856_consen 1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE 80 (162)
T ss_dssp SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence 89999999999999999999999987766431 0 00
Q ss_pred cc------------cccceeeeecCCCcccccccCCCccccccCCCCCCeeEEEEEEcCeeEEEEEEccCCCCCCeEEEe
Q psy13861 508 YD------------KYMCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYAKVMMVNGDHRIGIFAKRAILPGEELYFD 575 (1048)
Q Consensus 508 yd------------~~~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v~~v~v~g~~rI~ifA~RDI~aGEELT~D 575 (1048)
.. ................++.....++.|+||||.|||.+..........+.|+|.|||++|||||++
T Consensus 81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~is 160 (162)
T PF00856_consen 81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFIS 160 (162)
T ss_dssp CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEE
T ss_pred ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEE
Confidence 00 000011111122344566666788999999999999988766667889999999999999999999
Q ss_pred cC
Q psy13861 576 YR 577 (1048)
Q Consensus 576 Yg 577 (1048)
||
T Consensus 161 YG 162 (162)
T PF00856_consen 161 YG 162 (162)
T ss_dssp ST
T ss_pred EC
Confidence 97
No 11
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.47 E-value=1.1e-14 Score=169.38 Aligned_cols=131 Identities=34% Similarity=0.502 Sum_probs=108.3
Q ss_pred ccceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccccceeeee-cCC-CcccccccCCCccccccCCCCCCee
Q psy13861 469 YCAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFN-LNN-DFVVDATRKGNKIRFANHSINPNCY 546 (1048)
Q Consensus 469 ~~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~~~sYlf~-ld~-~~vIDAt~~GN~ARFINHSC~PN~~ 546 (1048)
....|+||||.+.|++|++|++|.|+++...++..+...+......+.|. +.. ..++|+...|+.+|||||||.||+.
T Consensus 339 ~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~g~~~r~~nHS~~pN~~ 418 (480)
T COG2940 339 SEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDLLGNEFSFGLLEDKDKVRDSQKAGDVARFINHSCTPNCE 418 (480)
T ss_pred hcccccceeehhhccchHHHHHhcCcccchHHHHhhhccccccccccchhhccccchhhhhhhcccccceeecCCCCCcc
Confidence 34589999999999999999999999999999988877775443333332 222 7899999999999999999999999
Q ss_pred EEEEEEcCeeEEEEEEccCCCCCCeEEEecCCCCCCC--------cceeeccCCccccchh
Q psy13861 547 AKVMMVNGDHRIGIFAKRAILPGEELYFDYRYGPTEQ--------LKFVVTLDSNVANKYI 599 (1048)
Q Consensus 547 v~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~~--------~k~~C~Cg~~~CRk~I 599 (1048)
+....+.|..++.++|+|||.+||||++||+...+.. ..+.|.|++..|+..+
T Consensus 419 ~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 479 (480)
T COG2940 419 ASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTM 479 (480)
T ss_pred eecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCCC
Confidence 8877666667899999999999999999998766553 3567899888887654
No 12
>KOG1141|consensus
Probab=99.46 E-value=2.5e-14 Score=168.25 Aligned_cols=77 Identities=26% Similarity=0.500 Sum_probs=69.6
Q ss_pred cccccccCCCccccccCCCCCCeeEEEEEEcCe----eEEEEEEccCCCCCCeEEEecCCCCCC--CcceeeccCCcccc
Q psy13861 523 FVVDATRKGNKIRFANHSINPNCYAKVMMVNGD----HRIGIFAKRAILPGEELYFDYRYGPTE--QLKFVVTLDSNVAN 596 (1048)
Q Consensus 523 ~vIDAt~~GN~ARFINHSC~PN~~v~~v~v~g~----~rI~ifA~RDI~aGEELT~DYg~~~~~--~~k~~C~Cg~~~CR 596 (1048)
++|||...||++||+||||.||+.++.|+++.+ +-|+|||.+=|++|.||||||+|.... .....|.||..+||
T Consensus 1179 yvIDAk~eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v~~keL~C~CGa~~Cr 1258 (1262)
T KOG1141|consen 1179 YVIDAKQEGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQVATKELTCHCGAENCR 1258 (1262)
T ss_pred EEEecccccchhhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccccccccccceEEEecChhhhh
Confidence 789999999999999999999999999999864 568999999999999999999986443 45678999999999
Q ss_pred chh
Q psy13861 597 KYI 599 (1048)
Q Consensus 597 k~I 599 (1048)
+.|
T Consensus 1259 grL 1261 (1262)
T KOG1141|consen 1259 GRL 1261 (1262)
T ss_pred ccc
Confidence 976
No 13
>KOG1082|consensus
Probab=99.34 E-value=9.4e-13 Score=148.75 Aligned_cols=72 Identities=38% Similarity=0.718 Sum_probs=63.5
Q ss_pred cccccCCCCCccCCCCcCcCCcccCcchhhhcceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHHHh
Q psy13861 960 AVRECDPDLCQTCGADQFDVSKISCKNVSVQRGLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRRGK 1039 (1048)
Q Consensus 960 a~rECdPdlC~~Cg~~~~d~~~~~C~Nr~lQrG~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRRGk 1039 (1048)
...||.+ .|+|. ..|.||++|+|.+.+|+|+++..+||||++++.|++|+||+||+||+++.+|+++|-.
T Consensus 151 ~i~EC~~----~C~C~------~~C~nRv~q~g~~~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~ 220 (364)
T KOG1082|consen 151 PVFECSV----ACGCH------PDCANRVVQKGLQFHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTH 220 (364)
T ss_pred ccccccc----CCCCC------CcCcchhhccccccceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhccc
Confidence 5677863 35553 5899999999999999999999999999999999999999999999999999998854
Q ss_pred hh
Q psy13861 1040 VY 1041 (1048)
Q Consensus 1040 vY 1041 (1048)
.|
T Consensus 221 ~~ 222 (364)
T KOG1082|consen 221 LR 222 (364)
T ss_pred cc
Confidence 43
No 14
>KOG2589|consensus
Probab=98.80 E-value=5.5e-09 Score=116.37 Aligned_cols=125 Identities=19% Similarity=0.146 Sum_probs=87.6
Q ss_pred ceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhcc-ccccceeeeecCCCcccccccCCCccccccCCCCCCeeEEE
Q psy13861 471 AIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVY-DKYMCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYAKV 549 (1048)
Q Consensus 471 ~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~y-d~~~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v~~ 549 (1048)
-.|--|+|++.+.+|+-|--.+|-|+.-.+.+++...- .....+.+|.-... .|...-+.|+||||-|.|||+|..
T Consensus 136 ~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~~g~nDFSvmyStRk~---caqLwLGPaafINHDCrpnCkFvs 212 (453)
T KOG2589|consen 136 QNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLRGGGNDFSVMYSTRKR---CAQLWLGPAAFINHDCRPNCKFVS 212 (453)
T ss_pred CCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHhccCCceeeeeecccc---hhhheeccHHhhcCCCCCCceeec
Confidence 35666899999999999999999997766666652211 11222333322221 122333789999999999999853
Q ss_pred EEEcCeeEEEEEEccCCCCCCeEEEecCCCCCCCcceeeccCCccccchhhhcc
Q psy13861 550 MMVNGDHRIGIFAKRAILPGEELYFDYRYGPTEQLKFVVTLDSNVANKYIYEWD 603 (1048)
Q Consensus 550 v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~~~k~~C~Cg~~~CRk~I~ewd 603 (1048)
.|...+.|-++|||.+|||||.-||.+++...+..|.| ..|-..-...+
T Consensus 213 ---~g~~tacvkvlRDIePGeEITcFYgs~fFG~~N~~CeC--~TCER~g~gaF 261 (453)
T KOG2589|consen 213 ---TGRDTACVKVLRDIEPGEEITCFYGSGFFGENNEECEC--VTCERRGTGAF 261 (453)
T ss_pred ---CCCceeeeehhhcCCCCceeEEeecccccCCCCceeEE--eecccccccch
Confidence 35578999999999999999999999888776655544 56655444433
No 15
>KOG1081|consensus
Probab=98.43 E-value=5.4e-08 Score=113.74 Aligned_cols=115 Identities=31% Similarity=0.421 Sum_probs=88.2
Q ss_pred eeccccCCCceEEEEEeEEEeHHHHhhhhhcccc--ccceeeeecCCCcccccccCCCccccccCCCCCCeeEEEEEEcC
Q psy13861 477 MTKTCQQKNEFISEYCGEIISQDEADRRGKVYDK--YMCSFLFNLNNDFVVDATRKGNKIRFANHSINPNCYAKVMMVNG 554 (1048)
Q Consensus 477 FAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~--~~~sYlf~ld~~~vIDAt~~GN~ARFINHSC~PN~~v~~v~v~g 554 (1048)
+|.++|.+| +|+++...+...+...-.. ....|+..+..+..||+...||..||+||||.||+.-..+.+.+
T Consensus 319 ~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~~~~~~~~~~~e~~~~id~~~~~n~sr~~nh~~~~~v~~~k~~~~~ 392 (463)
T KOG1081|consen 319 TAKADIRKG------VGEVIDDKECKARLQRVKESDLVDFYMVFIQKDRIIDAGPKGNYSRFLNHSCQPNVETEKWQVIG 392 (463)
T ss_pred hhHHhhhcc------cCcccchhhheeehhhhhccchhhhhhhhhhcccccccccccchhhhhcccCCCceeechhheec
Confidence 677788888 8999988887665432221 22334334444459999999999999999999999999999999
Q ss_pred eeEEEEEEccCCCCCCeEEEecCCCCCCCcceeeccCCccccch
Q psy13861 555 DHRIGIFAKRAILPGEELYFDYRYGPTEQLKFVVTLDSNVANKY 598 (1048)
Q Consensus 555 ~~rI~ifA~RDI~aGEELT~DYg~~~~~~~k~~C~Cg~~~CRk~ 598 (1048)
..++.++|.+.|.+|||||++|........ ..|.|++..|...
T Consensus 393 ~t~~~~~a~~~i~~g~e~t~~~n~~~~~~~-~~~~~~~e~~~~~ 435 (463)
T KOG1081|consen 393 DTRVGLFAPRQIEAGEELTFNYNGNCEGNE-KRCCCGSENCTET 435 (463)
T ss_pred ccccccccccccccchhhhheeeccccCCc-ceEeecccccccC
Confidence 999999999999999999999988644433 3455665566543
No 16
>KOG1080|consensus
Probab=98.43 E-value=1.2e-07 Score=118.18 Aligned_cols=55 Identities=27% Similarity=0.564 Sum_probs=50.9
Q ss_pred eeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHHHhhhhhcC--CCC
Q psy13861 994 HKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRRGKVYDKYM--CSF 1048 (1048)
Q Consensus 994 ~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRRGkvYDk~~--~Sy 1048 (1048)
.++|.+++|.+|||||||.++|.+||||+||+||+|.+-.||.|++.|.+.| .||
T Consensus 865 kk~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~Y~~~gi~~sY 921 (1005)
T KOG1080|consen 865 KKYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREARYERMGIGDSY 921 (1005)
T ss_pred hhhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHHHhccCcccce
Confidence 5558999999999999999999999999999999999999999999999855 565
No 17
>KOG1141|consensus
Probab=98.22 E-value=3.6e-07 Score=109.38 Aligned_cols=58 Identities=24% Similarity=0.344 Sum_probs=54.8
Q ss_pred ccCcchhhhcceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHHHh
Q psy13861 982 ISCKNVSVQRGLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRRGK 1039 (1048)
Q Consensus 982 ~~C~Nr~lQrG~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRRGk 1039 (1048)
.-|.||.+|.|.+.+|.++++..+|||++...+|.+|.||+-|.|-+++++-+|+-|-
T Consensus 786 ~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks~~ 843 (1262)
T KOG1141|consen 786 PDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKSEY 843 (1262)
T ss_pred HHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhhhh
Confidence 4799999999999999999999999999999999999999999999999998887653
No 18
>KOG1083|consensus
Probab=97.70 E-value=5.7e-06 Score=101.75 Aligned_cols=55 Identities=35% Similarity=0.596 Sum_probs=51.3
Q ss_pred cCcchhhhc-ceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHH
Q psy13861 983 SCKNVSVQR-GLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRR 1037 (1048)
Q Consensus 983 ~C~Nr~lQr-G~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRR 1037 (1048)
.|.|+.+|+ +....|.|+.....||||.++++|+.|+||+||+||||+.+|++.|
T Consensus 1165 ~c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~ 1220 (1306)
T KOG1083|consen 1165 SCSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPR 1220 (1306)
T ss_pred hhhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhccc
Confidence 488999988 6788899999999999999999999999999999999999998877
No 19
>KOG2461|consensus
Probab=97.63 E-value=3.3e-05 Score=89.16 Aligned_cols=112 Identities=21% Similarity=0.166 Sum_probs=78.9
Q ss_pred eccccceeeEEeeccccCCCceEEEEEeEEEeHHHHhhhhhccccccceeeeecC-CCcccccccC--CCccccccCCCC
Q psy13861 466 YNNYCAIAQVMMTKTCQQKNEFISEYCGEIISQDEADRRGKVYDKYMCSFLFNLN-NDFVVDATRK--GNKIRFANHSIN 542 (1048)
Q Consensus 466 ~~g~~~~G~GLFAtrdI~KGEfI~EY~GEVIt~~Eae~R~~~yd~~~~sYlf~ld-~~~vIDAt~~--GN~ARFINHSC~ 542 (1048)
.......|.||++...|.+|+-.+=|.|+++... .....+....-++|.-+ .-.+||++.. .|+.||||=+++
T Consensus 34 ~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~~~~----~~~~~n~~y~W~I~~~d~~~~~iDg~d~~~sNWmRYV~~Ar~ 109 (396)
T KOG2461|consen 34 PSSVPVTGLGVWSNASILPGTSFGPFEGEIIASI----DSKSANNRYMWEIFSSDNGYEYIDGTDEEHSNWMRYVNSARS 109 (396)
T ss_pred ccccCCccccccccccccCcccccCccCcccccc----ccccccCcceEEEEeCCCceEEeccCChhhcceeeeecccCC
Confidence 3445667789999999999999999999982111 11111111222344433 3478999865 699999998875
Q ss_pred C---CeeEEEEEEcCeeEEEEEEccCCCCCCeEEEecCCCCCCCcc
Q psy13861 543 P---NCYAKVMMVNGDHRIGIFAKRAILPGEELYFDYRYGPTEQLK 585 (1048)
Q Consensus 543 P---N~~v~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~~~~~~k 585 (1048)
. |+.+ +.....|++.|+|+|.+||||.|.|+.++...+.
T Consensus 110 ~eeQNL~A----~Q~~~~Ifyrt~r~I~p~eELlVWY~~e~~~~L~ 151 (396)
T KOG2461|consen 110 EEEQNLLA----FQIGENIFYRTIRDIRPNEELLVWYGSEYAEELA 151 (396)
T ss_pred hhhhhHHH----HhccCceEEEecccCCCCCeEEEEeccchHhHhc
Confidence 4 6543 2345689999999999999999999987655443
No 20
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=97.38 E-value=0.00034 Score=64.40 Aligned_cols=49 Identities=43% Similarity=0.697 Sum_probs=42.4
Q ss_pred eEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHHHhhhhhcC
Q psy13861 997 LLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRRGKVYDKYM 1045 (1048)
Q Consensus 997 L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRRGkvYDk~~ 1045 (1048)
+.+..+..+|+||||..+|++|++|+||+|.++...++..+...|+..+
T Consensus 2 ~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~~~~~~ 50 (116)
T smart00317 2 LEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKAYDTDG 50 (116)
T ss_pred cEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHHHHhcC
Confidence 4567777899999999999999999999999999999998876565443
No 21
>KOG1085|consensus
Probab=97.32 E-value=0.00019 Score=79.01 Aligned_cols=55 Identities=31% Similarity=0.497 Sum_probs=49.2
Q ss_pred hhhcceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHHHhhhh
Q psy13861 988 SVQRGLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRRGKVYD 1042 (1048)
Q Consensus 988 ~lQrG~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRRGkvYD 1042 (1048)
.|..|....|.+..-.++|.||.|...+..|+||.||.|.+|+..||..|++.|-
T Consensus 249 ~vl~g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~Ya 303 (392)
T KOG1085|consen 249 TVLKGTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQYA 303 (392)
T ss_pred HHHhccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHHhc
Confidence 3455667778887788899999999999999999999999999999999999995
No 22
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=95.24 E-value=0.0047 Score=73.10 Aligned_cols=64 Identities=33% Similarity=0.482 Sum_probs=54.7
Q ss_pred cCcchhhhcceeeeeEeeecCCcceEEEeCCccCCCCeEEEeeceecCHHHHHHHHhhhhhcCC
Q psy13861 983 SCKNVSVQRGLHKHLLMAPSDVAGWGIFLKDSAQKNEFISEYCGEIISQDEADRRGKVYDKYMC 1046 (1048)
Q Consensus 983 ~C~Nr~lQrG~~k~L~V~kS~~kGwGlfa~e~I~kGeFI~EYvGEvIS~~EAdRRGkvYDk~~~ 1046 (1048)
.+.|...+........+..+...|||+||.+.|++|+||.||.|++|.+.++..|...|+..+.
T Consensus 320 ~~~~~~~~~~~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~~~~~~ 383 (480)
T COG2940 320 ELLNSNGCKKRREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENYDLLGN 383 (480)
T ss_pred chhhhcccccccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhcccccccc
Confidence 4555555666677778888999999999999999999999999999999999999998866554
No 23
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=92.88 E-value=0.042 Score=47.14 Aligned_cols=35 Identities=29% Similarity=0.706 Sum_probs=27.8
Q ss_pred CCCCCCCccccccccccCCCCCccCCCCcCcCCcccCcchhhhcce
Q psy13861 948 AQCNTKQCPCYLAVRECDPDLCQTCGADQFDVSKISCKNVSVQRGL 993 (1048)
Q Consensus 948 ~~C~tk~CpC~~a~rECdPdlC~~Cg~~~~d~~~~~C~Nr~lQrG~ 993 (1048)
..|+ ..|.++++..|| |..|. || ..|+|+.+|+..
T Consensus 16 ~~Cg-sdClNR~l~~EC-~~~C~-~G--------~~C~NqrFqk~~ 50 (51)
T smart00570 16 GACG-SDCLNRMLLIEC-SSDCP-CG--------SYCSNQRFQKRQ 50 (51)
T ss_pred CCcc-hHHHHHHHhhhc-CCCCC-CC--------cCccCcccccCc
Confidence 3687 569999999999 67665 54 479999999864
No 24
>cd00167 SANT 'SWI3, ADA2, N-CoR and TFIIIB' DNA-binding domains. Tandem copies of the domain bind telomeric DNA tandem repeatsas part of the capping complex. Binding is sequence dependent for repeats which contain the G/C rich motif [C2-3 A (CA)1-6]. The domain is also found in regulatory transcriptional repressor complexes where it also binds DNA.
Probab=92.65 E-value=0.3 Score=38.00 Aligned_cols=42 Identities=24% Similarity=0.265 Sum_probs=38.3
Q ss_pred CCCHHHHHHHHHHHHhcC-CCchHHHhhcCCCChHHHHHHHHH
Q psy13861 814 EWTGSDQSLFRAIHKVLY-NNYCAIAQVMMTKTCQQVYQFAQK 855 (1048)
Q Consensus 814 ~Wt~~E~sL~r~l~~~~~-~N~C~IA~~lg~KTC~EV~~~~~~ 855 (1048)
.||..|..+|..++..++ .+...||+.++.||-.+|-.+...
T Consensus 1 ~Wt~eE~~~l~~~~~~~g~~~w~~Ia~~~~~rs~~~~~~~~~~ 43 (45)
T cd00167 1 PWTEEEDELLLEAVKKYGKNNWEKIAKELPGRTPKQCRERWRN 43 (45)
T ss_pred CCCHHHHHHHHHHHHHHCcCCHHHHHhHcCCCCHHHHHHHHHH
Confidence 599999999999999999 899999999999999999877653
No 25
>smart00717 SANT SANT SWI3, ADA2, N-CoR and TFIIIB'' DNA-binding domains.
Probab=92.38 E-value=0.35 Score=38.12 Aligned_cols=43 Identities=26% Similarity=0.343 Sum_probs=39.4
Q ss_pred CCCCHHHHHHHHHHHHhcC-CCchHHHhhcCCCChHHHHHHHHH
Q psy13861 813 NEWTGSDQSLFRAIHKVLY-NNYCAIAQVMMTKTCQQVYQFAQK 855 (1048)
Q Consensus 813 ~~Wt~~E~sL~r~l~~~~~-~N~C~IA~~lg~KTC~EV~~~~~~ 855 (1048)
..||+.|..+|..++..++ .++-.||..++.+|-.++..+...
T Consensus 2 ~~Wt~~E~~~l~~~~~~~g~~~w~~Ia~~~~~rt~~~~~~~~~~ 45 (49)
T smart00717 2 GEWTEEEDELLIELVKKYGKNNWEKIAKELPGRTAEQCRERWNN 45 (49)
T ss_pred CCCCHHHHHHHHHHHHHHCcCCHHHHHHHcCCCCHHHHHHHHHH
Confidence 4799999999999999999 999999999999999999887654
No 26
>PF03638 TCR: Tesmin/TSO1-like CXC domain, cysteine-rich domain; InterPro: IPR005172 This entry includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin Q9Y4I5 from SWISSPROT [] and TSO1 Q9LE32 from SWISSPROT []. This group of proteins is called a CXC domain in [].
Probab=79.61 E-value=1.2 Score=37.09 Aligned_cols=28 Identities=39% Similarity=1.029 Sum_probs=24.7
Q ss_pred cCCCCCcC-CCCCCCCccccccccccCCC
Q psy13861 940 RFPGCRCK-AQCNTKQCPCYLAVRECDPD 967 (1048)
Q Consensus 940 RFpGC~Ck-~~C~tk~CpC~~a~rECdPd 967 (1048)
...||.|+ +.|...-|.||++++.|++.
T Consensus 2 ~~~gC~Ckks~Clk~YC~Cf~~g~~C~~~ 30 (42)
T PF03638_consen 2 KKKGCNCKKSKCLKLYCECFQAGRFCTPN 30 (42)
T ss_pred CCCCCcccCcChhhhhCHHHHCcCcCCCC
Confidence 46799995 89999999999999999864
No 27
>PF00249 Myb_DNA-binding: Myb-like DNA-binding domain; InterPro: IPR014778 The retroviral oncogene v-myb, and its cellular counterpart c-myb, encode nuclear DNA-binding proteins. These belong to the SANT domain family that specifically recognise the sequence YAAC(G/T)G [, ]. In myb, one of the most conserved regions consisting of three tandem repeats has been shown to be involved in DNA-binding [].; PDB: 1X41_A 2XAF_B 2XAG_B 2XAH_B 2UXN_B 2Y48_B 2XAQ_B 2X0L_B 2IW5_B 2XAJ_B ....
Probab=77.20 E-value=5.7 Score=32.77 Aligned_cols=43 Identities=21% Similarity=0.228 Sum_probs=36.4
Q ss_pred CCCCHHHHHHHHHHHHhcCCC-chHHHhhcC-CCChHHHHHHHHH
Q psy13861 813 NEWTGSDQSLFRAIHKVLYNN-YCAIAQVMM-TKTCQQVYQFAQK 855 (1048)
Q Consensus 813 ~~Wt~~E~sL~r~l~~~~~~N-~C~IA~~lg-~KTC~EV~~~~~~ 855 (1048)
..||..|..+|..++..++.+ .=.||..++ ++|=.++-.+..+
T Consensus 2 ~~Wt~eE~~~l~~~v~~~g~~~W~~Ia~~~~~~Rt~~qc~~~~~~ 46 (48)
T PF00249_consen 2 GPWTEEEDEKLLEAVKKYGKDNWKKIAKRMPGGRTAKQCRSRYQN 46 (48)
T ss_dssp -SS-HHHHHHHHHHHHHSTTTHHHHHHHHHSSSSTHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHHHhCCcHHHHHHHHcCCCCCHHHHHHHHHh
Confidence 379999999999999999988 999999998 8998888766554
No 28
>PF13921 Myb_DNA-bind_6: Myb-like DNA-binding domain; PDB: 1A5J_A 1MBH_A 1GV5_A 1H89_C 1IDY_A 1MBK_A 1IDZ_A 1H88_C 1GVD_A 1MBG_A ....
Probab=75.21 E-value=6.3 Score=33.63 Aligned_cols=41 Identities=22% Similarity=0.288 Sum_probs=33.9
Q ss_pred CCHHHHHHHHHHHHhcCCCchHHHhhcCCCChHHHHHHHHH
Q psy13861 815 WTGSDQSLFRAIHKVLYNNYCAIAQVMMTKTCQQVYQFAQK 855 (1048)
Q Consensus 815 Wt~~E~sL~r~l~~~~~~N~C~IA~~lg~KTC~EV~~~~~~ 855 (1048)
||..|..+|..++..|+.+...||..++.+|=.+|......
T Consensus 1 WT~eEd~~L~~~~~~~g~~W~~Ia~~l~~Rt~~~~~~r~~~ 41 (60)
T PF13921_consen 1 WTKEEDELLLELVKKYGNDWKKIAEHLGNRTPKQCRNRWRN 41 (60)
T ss_dssp S-HHHHHHHHHHHHHHTS-HHHHHHHSTTS-HHHHHHHHHH
T ss_pred CCHHHHHHHHHHHHHHCcCHHHHHHHHCcCCHHHHHHHHHH
Confidence 99999999999999999999999999987888888776654
No 29
>KOG2084|consensus
Probab=72.79 E-value=4.9 Score=46.36 Aligned_cols=42 Identities=29% Similarity=0.424 Sum_probs=30.2
Q ss_pred cccCCCCCCeeEEEEEEcCeeEEEEEEccCCCCCC-eEEEecCCCCC
Q psy13861 536 FANHSINPNCYAKVMMVNGDHRIGIFAKRAILPGE-ELYFDYRYGPT 581 (1048)
Q Consensus 536 FINHSC~PN~~v~~v~v~g~~rI~ifA~RDI~aGE-ELT~DYg~~~~ 581 (1048)
++||||.||+. +..++ ..+.+++..++.+++ ||++.|....+
T Consensus 208 ~~~hsC~pn~~---~~~~~-~~~~~~~~~~~~~~~~~l~~~y~~~~~ 250 (482)
T KOG2084|consen 208 LFNHSCFPNIS---VIFDG-RGLALLVPAGIDAGEEELTISYTDPLL 250 (482)
T ss_pred hcccCCCCCeE---EEECC-ceeEEEeecccCCCCCEEEEeeccccc
Confidence 78999999987 23333 345566677777776 99999966543
No 30
>KOG1337|consensus
Probab=71.67 E-value=2.8 Score=50.08 Aligned_cols=41 Identities=27% Similarity=0.402 Sum_probs=31.4
Q ss_pred cccCCCCCCeeEEEEEEcCeeEEEEEEccCCCCCCeEEEecCCC
Q psy13861 536 FANHSINPNCYAKVMMVNGDHRIGIFAKRAILPGEELYFDYRYG 579 (1048)
Q Consensus 536 FINHSC~PN~~v~~v~v~g~~rI~ifA~RDI~aGEELT~DYg~~ 579 (1048)
+.||++.+. ...+......+.+++.++|.+||||++.||..
T Consensus 239 ~~NH~~~~~---~~~~~~~d~~~~l~~~~~v~~geevfi~YG~~ 279 (472)
T KOG1337|consen 239 LLNHSPEVI---KAGYNQEDEAVELVAERDVSAGEEVFINYGPK 279 (472)
T ss_pred hhccCchhc---cccccCCCCcEEEEEeeeecCCCeEEEecCCC
Confidence 569999992 12222233489999999999999999999873
No 31
>TIGR01557 myb_SHAQKYF myb-like DNA-binding domain, SHAQKYF class. This model describes a DNA-binding domain restricted to (but common in) plant proteins, many of which also contain a response regulator domain. The domain appears related to the Myb-like DNA-binding domain described by Pfam model pfam00249. It is distinguished in part by a well-conserved motif SH[AL]QKY[RF] at the C-terminal end of the motif.
Probab=61.96 E-value=20 Score=31.50 Aligned_cols=45 Identities=24% Similarity=0.227 Sum_probs=38.1
Q ss_pred CCCCHHHHHHHHHHHHhcCC-Cc---hHHHhhcC-CC-ChHHHHHHHHHhh
Q psy13861 813 NEWTGSDQSLFRAIHKVLYN-NY---CAIAQVMM-TK-TCQQVYQFAQKEA 857 (1048)
Q Consensus 813 ~~Wt~~E~sL~r~l~~~~~~-N~---C~IA~~lg-~K-TC~EV~~~~~~~~ 857 (1048)
..||+.|...|...++.+|. +. =.|+.+++ ++ |-.+|-+++++.-
T Consensus 4 ~~WT~eeh~~Fl~ai~~~G~g~~a~pk~I~~~~~~~~lT~~qV~SH~QKy~ 54 (57)
T TIGR01557 4 VVWTEDLHDRFLQAVQKLGGPDWATPKRILELMVVDGLTRDQVASHLQKYR 54 (57)
T ss_pred CCCCHHHHHHHHHHHHHhCCCcccchHHHHHHcCCCCCCHHHHHHHHHHHH
Confidence 47999999999999999997 66 67888765 56 9999999998753
No 32
>PF05033 Pre-SET: Pre-SET motif; InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=59.91 E-value=6.2 Score=37.25 Aligned_cols=20 Identities=35% Similarity=0.793 Sum_probs=8.8
Q ss_pred cccccccCCCCCccCCCCcCcCCcccCcch
Q psy13861 958 YLAVRECDPDLCQTCGADQFDVSKISCKNV 987 (1048)
Q Consensus 958 ~~a~rECdPdlC~~Cg~~~~d~~~~~C~Nr 987 (1048)
....+||.+ .|+++ ..|.||
T Consensus 84 ~~~i~EC~~----~C~C~------~~C~NR 103 (103)
T PF05033_consen 84 KPPIFECND----NCGCS------PSCRNR 103 (103)
T ss_dssp TSEEE---T----TSSS-------TTSTT-
T ss_pred CCeEEeCCC----CCCCC------CCCCCC
Confidence 345677863 25553 479886
No 33
>KOG1171|consensus
Probab=58.58 E-value=3.6 Score=48.60 Aligned_cols=52 Identities=37% Similarity=0.962 Sum_probs=44.3
Q ss_pred CCCC-CCceecCCCCccccccCCcccccc-------------------------------------------------CC
Q psy13861 913 PCDA-SCPCVSAQNFCEKFCKCSFDCQNR-------------------------------------------------FP 942 (1048)
Q Consensus 913 pC~~-~C~C~~~~~~Cek~C~C~~~C~nR-------------------------------------------------Fp 942 (1048)
.|-. -|-|+..|.||-.+|.|- +|.|. -.
T Consensus 140 kclklYCeCFAsG~yC~~~CnCv-nC~N~~~~e~~r~~a~k~~l~RNP~AFkPKia~s~~~~~da~~~~~~~~~sa~hkk 218 (406)
T KOG1171|consen 140 KCLKLYCECFASGVYCTGPCNCV-NCFNNPEHESVRLKARKQILERNPNAFKPKIAASSSGIADASEEASKTPASARHKK 218 (406)
T ss_pred HHHHHhHHHHhhcccccCCccee-eccCCCcchHHHHHHHHHHhhcCccccccccccCCcccchhhhhhhccchhhhhcC
Confidence 4554 799999999999999998 56654 35
Q ss_pred CCCcC-CCCCCCCccccccccccC
Q psy13861 943 GCRCK-AQCNTKQCPCYLAVRECD 965 (1048)
Q Consensus 943 GC~Ck-~~C~tk~CpC~~a~rECd 965 (1048)
||+|+ ..|..+-|.||+++.-|.
T Consensus 219 GC~CkkSgClKkYCECyQa~vlCS 242 (406)
T KOG1171|consen 219 GCNCKKSGCLKKYCECYQAGVLCS 242 (406)
T ss_pred CCCCccccchHHHHHHHhcCCCcc
Confidence 99996 689999999999999996
No 34
>smart00570 AWS associated with SET domains. subdomain of PRESET
Probab=49.30 E-value=6.3 Score=34.15 Aligned_cols=29 Identities=24% Similarity=0.558 Sum_probs=25.5
Q ss_pred CCCCCCCceecCCCCccccccCCcccccc
Q psy13861 912 QPCDASCPCVSAQNFCEKFCKCSFDCQNR 940 (1048)
Q Consensus 912 gpC~~~C~C~~~~~~Cek~C~C~~~C~nR 940 (1048)
.+|+++|.....-+-|...|.|...|+|+
T Consensus 16 ~~CgsdClNR~l~~EC~~~C~~G~~C~Nq 44 (51)
T smart00570 16 GACGSDCLNRMLLIECSSDCPCGSYCSNQ 44 (51)
T ss_pred CCcchHHHHHHHhhhcCCCCCCCcCccCc
Confidence 57999999999889999999999999865
No 35
>KOG3813|consensus
Probab=44.50 E-value=9.9 Score=45.99 Aligned_cols=23 Identities=30% Similarity=0.754 Sum_probs=13.8
Q ss_pred cCCCCCCCCCCC-CCceecCCCCccc
Q psy13861 905 PCRHPPTQPCDA-SCPCVSAQNFCEK 929 (1048)
Q Consensus 905 PC~H~~ggpC~~-~C~C~~~~~~Cek 929 (1048)
-|+|. +-|++ .|.|.+.|+-|..
T Consensus 309 GCsCr--~~CdPETCaCSqaGIkCQv 332 (640)
T KOG3813|consen 309 GCSCR--GVCDPETCACSQAGIKCQV 332 (640)
T ss_pred CCccc--ceeChhhcchhccCceEee
Confidence 55555 56665 5777666665543
No 36
>PF05033 Pre-SET: Pre-SET motif; InterPro: IPR007728 This region is found in a number of histone lysine methyltransferases (HMTase), N-terminal to the SET domain; it is generally described as the pre-SET domain. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils and stabilising the SET domain. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site [] when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity []. ; GO: 0008270 zinc ion binding, 0018024 histone-lysine N-methyltransferase activity, 0034968 histone lysine methylation, 0005634 nucleus; PDB: 3K5K_A 2O8J_D 3RJW_B 1ML9_A 1PEG_B 1MVH_A 1MVX_A 3BO5_A 2RFI_B 3MO5_B ....
Probab=35.59 E-value=22 Score=33.60 Aligned_cols=36 Identities=31% Similarity=0.879 Sum_probs=20.1
Q ss_pred cccCCCCCCCCC--CCCCceecCCC--------------------CccccccCCcccccc
Q psy13861 903 FTPCRHPPTQPC--DASCPCVSAQN--------------------FCEKFCKCSFDCQNR 940 (1048)
Q Consensus 903 y~PC~H~~ggpC--~~~C~C~~~~~--------------------~Cek~C~C~~~C~nR 940 (1048)
..-|++. +.| ...|.|..... -|...|.|+.+|.||
T Consensus 46 ~~~C~C~--~~C~~~~~C~C~~~~~~~~~Y~~~g~l~~~~~~~i~EC~~~C~C~~~C~NR 103 (103)
T PF05033_consen 46 LQGCDCS--GDCSNPSNCECLQRNGGIFAYDSNGRLRIPDKPPIFECNDNCGCSPSCRNR 103 (103)
T ss_dssp TS----S--SSSTCTTTSHHHCCTSSS-SB-TTSSBSSSSTSEEE---TTSSS-TTSTT-
T ss_pred CccCccC--CCCCCCCCCcCccccCccccccCCCcCccCCCCeEEeCCCCCCCCCCCCCC
Confidence 4578888 568 35899987653 277888888888886
No 37
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=33.97 E-value=19 Score=27.35 Aligned_cols=15 Identities=7% Similarity=0.040 Sum_probs=13.6
Q ss_pred ceeeccCCccccchh
Q psy13861 585 KFVVTLDSNVANKYI 599 (1048)
Q Consensus 585 k~~C~Cg~~~CRk~I 599 (1048)
.+.|.||+..||++|
T Consensus 2 ~~~C~CGs~~CRG~l 16 (26)
T smart00508 2 KQPCLCGAPNCRGFL 16 (26)
T ss_pred CeeeeCCCcccccee
Confidence 478999999999988
No 38
>PF03638 TCR: Tesmin/TSO1-like CXC domain, cysteine-rich domain; InterPro: IPR005172 This entry includes proteins that have two copies of a cysteine rich motif as follows: C-X-C-X4-C-X3-YC-X-C-X6-C-X3-C-X-C-X2-C. The family includes Tesmin Q9Y4I5 from SWISSPROT [] and TSO1 Q9LE32 from SWISSPROT []. This group of proteins is called a CXC domain in [].
Probab=26.88 E-value=36 Score=28.59 Aligned_cols=36 Identities=33% Similarity=0.868 Sum_probs=28.3
Q ss_pred ccCCCCCCCCCCC-CCceecCCCCccccccCCccccccC
Q psy13861 904 TPCRHPPTQPCDA-SCPCVSAQNFCEKFCKCSFDCQNRF 941 (1048)
Q Consensus 904 ~PC~H~~ggpC~~-~C~C~~~~~~Cek~C~C~~~C~nRF 941 (1048)
.+|.+. -..|-. -|.|+..+.+|...|.|. +|.|..
T Consensus 4 ~gC~Ck-ks~Clk~YC~Cf~~g~~C~~~C~C~-~C~N~~ 40 (42)
T PF03638_consen 4 KGCNCK-KSKCLKLYCECFQAGRFCTPNCKCQ-NCKNTE 40 (42)
T ss_pred CCCccc-CcChhhhhCHHHHCcCcCCCCcccC-CCCCcC
Confidence 467665 456765 799999999999999995 687763
No 39
>PF08666 SAF: SAF domain; InterPro: IPR013974 This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=24.79 E-value=40 Score=28.67 Aligned_cols=15 Identities=27% Similarity=0.308 Sum_probs=11.2
Q ss_pred EEEEccCCCCCCeEE
Q psy13861 559 GIFAKRAILPGEELY 573 (1048)
Q Consensus 559 ~ifA~RDI~aGEELT 573 (1048)
.++|.|||++|+.|+
T Consensus 3 vvVA~~di~~G~~i~ 17 (63)
T PF08666_consen 3 VVVAARDIPAGTVIT 17 (63)
T ss_dssp EEEESSTB-TT-BEC
T ss_pred EEEEeCccCCCCEEc
Confidence 478999999999984
No 40
>smart00468 PreSET N-terminal to some SET domains. A Cys-rich putative Zn2+-binding domain that occurs N-terminal to some SET domains. Function is unknown. Unpublished.
Probab=24.08 E-value=50 Score=31.17 Aligned_cols=23 Identities=26% Similarity=0.888 Sum_probs=16.3
Q ss_pred ccCCCCCcCCCCCCCC-ccccccc
Q psy13861 939 NRFPGCRCKAQCNTKQ-CPCYLAV 961 (1048)
Q Consensus 939 nRFpGC~Ck~~C~tk~-CpC~~a~ 961 (1048)
..+.||.|.+.|.... |.|.+.+
T Consensus 47 ~~~~gC~C~~~C~~~~~C~C~~~~ 70 (98)
T smart00468 47 SPLVGCSCSGDCSSSNKCECARKN 70 (98)
T ss_pred CCCCCCcCCCCCCCCCcCCcHhhc
Confidence 4566888888887665 8886544
No 41
>PF11403 Yeast_MT: Yeast metallothionein; InterPro: IPR022710 Metallothioneins are characterised by an abundance of cysteine residues and a lack of generic secondary structure motifs. This protein functions in primary metal storage, transport and detoxification []. For the first 40 residues in the protein the polypeptide wraps around the metal by forming two large parallel loops separated by a deep cleft containing the metal cluster []. ; PDB: 1AQS_A 1AQR_A 1RJU_V 1FMY_A 1AOO_A 1AQQ_A.
Probab=23.11 E-value=67 Score=25.99 Aligned_cols=10 Identities=40% Similarity=1.179 Sum_probs=3.9
Q ss_pred CCccccccCC
Q psy13861 925 NFCEKFCKCS 934 (1048)
Q Consensus 925 ~~Cek~C~C~ 934 (1048)
.-|.|.|.|+
T Consensus 18 eqcqkscscp 27 (40)
T PF11403_consen 18 EQCQKSCSCP 27 (40)
T ss_dssp TTSTTS-SS-
T ss_pred HHHhhcCCCC
Confidence 3455555554
No 42
>KOG4167|consensus
Probab=22.65 E-value=1.1e+02 Score=38.95 Aligned_cols=39 Identities=21% Similarity=0.454 Sum_probs=34.2
Q ss_pred CCCCCCHHHHHHHHHHHHhcCCCchHHHhhcCCCChHHH
Q psy13861 811 GNNEWTGSDQSLFRAIHKVLYNNYCAIAQVMMTKTCQQV 849 (1048)
Q Consensus 811 ~~~~Wt~~E~sL~r~l~~~~~~N~C~IA~~lg~KTC~EV 849 (1048)
...-||+.|+-||.+.+-++-.+|=+|+..+.+||=.|-
T Consensus 618 gSd~WTp~E~~lF~kA~y~~~KDF~~v~km~~~KtVaqC 656 (907)
T KOG4167|consen 618 GSDKWTPLERKLFNKALYTYSKDFIFVQKMVKSKTVAQC 656 (907)
T ss_pred CcccccHHHHHHHHHHHHHhcccHHHHHHHhccccHHHH
Confidence 445799999999999999999999999999988885544
Done!