Query         psy15819
Match_columns 203
No_of_seqs    156 out of 371
Neff          6.3 
Searched_HMMs 46136
Date          Fri Aug 16 22:08:37 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy15819.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15819hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF05712 MRG:  MRG;  InterPro:  100.0 1.3E-44 2.7E-49  301.1  13.0  136   67-202    27-178 (194)
  2 KOG3001|consensus              100.0 1.1E-34 2.4E-39  262.9   8.7  136   67-202   223-361 (391)
  3 PF11717 Tudor-knot:  RNA bindi  99.1 2.2E-11 4.7E-16   82.1   0.7   21    1-21     33-54  (55)
  4 PLN00104 MYST -like histone ac  98.1 6.7E-07 1.5E-11   82.9   0.3   21    1-21     91-112 (450)
  5 cd00024 CHROMO Chromatin organ  95.3    0.01 2.3E-07   38.6   1.8   23    1-23     22-45  (55)
  6 smart00298 CHROMO Chromatin or  94.6   0.011 2.3E-07   38.4   0.2   22    1-22     20-42  (55)
  7 PTZ00064 histone acetyltransfe  94.0   0.016 3.6E-07   54.7   0.2   22    1-22    151-173 (552)
  8 smart00561 MBT Present in Dros  91.3     0.1 2.2E-06   39.0   1.3   17    2-18     59-76  (96)
  9 PLN03239 histone acetyltransfe  89.2    0.15 3.3E-06   46.4   0.7   20    1-20      1-21  (351)
 10 KOG2748|consensus               78.3     1.7 3.7E-05   39.6   2.5   22    1-22     28-50  (369)
 11 PF02820 MBT:  mbt repeat;  Int  74.3     1.3 2.9E-05   30.9   0.6   20    2-21     28-50  (73)
 12 PF00385 Chromo:  Chromo (CHRro  69.2       2 4.3E-05   27.9   0.5   24    1-24     21-45  (55)
 13 smart00340 HALZ homeobox assoc  65.3     4.6 9.9E-05   25.9   1.6   18   22-39     16-33  (44)
 14 cd05162 PWWP The PWWP domain,   52.7      12 0.00027   26.5   2.3   27    1-29     40-67  (87)
 15 cd05837 MSH6_like The PWWP dom  45.1     8.6 0.00019   29.1   0.5   26    1-27     45-71  (110)
 16 PF15136 UPF0449:  Uncharacteri  44.5      18 0.00039   27.2   2.1   22   86-107     6-27  (97)
 17 PF13495 Phage_int_SAM_4:  Phag  41.8      37 0.00081   23.2   3.3   44   91-138    38-81  (85)
 18 COG5027 SAS2 Histone acetyltra  41.8       9 0.00019   35.2   0.1   22    1-22     37-59  (395)
 19 PF06543 Lac_bphage_repr:  Lact  40.9     8.5 0.00018   25.2  -0.1    9   10-18     19-27  (49)
 20 PF08822 DUF1804:  Protein of u  40.1 1.2E+02  0.0027   24.9   6.5  108   74-201    46-157 (165)
 21 PF11390 FdsD:  NADH-dependant   39.4      45 0.00099   22.8   3.3   33  169-201     1-33  (61)
 22 TIGR03650 violacein_E violacei  26.8     8.4 0.00018   31.2  -2.3   24  156-179   135-161 (184)
 23 PF00855 PWWP:  PWWP domain;  I  25.4      43 0.00093   23.2   1.4   26    1-29     37-63  (86)
 24 KOG3248|consensus               25.1 1.2E+02  0.0026   28.0   4.4   70   91-166   187-266 (421)
 25 KOG2573|consensus               24.4 1.6E+02  0.0034   27.9   5.1   75   67-148    70-146 (498)
 26 PF13045 DUF3905:  Protein of u  20.9      48   0.001   24.0   0.9   19   12-30     64-83  (84)
 27 KOG3035|consensus               20.7      29 0.00063   30.0  -0.3   14    4-17     44-58  (245)
 28 PF08006 DUF1700:  Protein of u  20.5      49  0.0011   26.7   1.0   24   85-109    12-35  (181)

No 1  
>PF05712 MRG:  MRG;  InterPro: IPR008676 This family consists of three different eukaryotic proteins (mortality factor 4 (MORF4/MRG15), male-specific lethal 3(MSL-3) and ESA1-associated factor 3(EAF3)). It is thought that the MRG family is involved in transcriptional regulation via histone acetylation [, ].; GO: 0005634 nucleus; PDB: 2AQL_A 2F5J_A 2LKM_B 2Y0N_D.
Probab=100.00  E-value=1.3e-44  Score=301.08  Aligned_cols=136  Identities=43%  Similarity=0.732  Sum_probs=115.7

Q ss_pred             ccc--hhhhHHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhccCCccch----hhhHHHHHHHHHHHHhhhhcccC
Q psy15819         67 KIC--SRTWLVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANSKQMSGAK----GKTVTEMVAGLKEYFNVMLGSQL  140 (203)
Q Consensus        67 ~iP--Lk~~LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~~~~s~~~----~~~~~e~~~Gl~~yFn~~L~~~L  140 (203)
                      +||  ||.+|||||++|+++++|++|||++||++||++|.++...........    ...++|+++||++|||++||++|
T Consensus        27 ~lP~~Lk~~LvdD~~~I~~~~~l~~LP~~~~V~~IL~~y~~~~~~~~~~~~~~~~~~~~~~~e~~~Gl~~yFn~~L~~~L  106 (194)
T PF05712_consen   27 ELPEELKKILVDDWELITKEKKLVKLPAKPSVDDILEDYVESFADSDDSEEESAEQERDLLKEVADGLRDYFNKALGSQL  106 (194)
T ss_dssp             ---HHHHHHHHHHHHHHHTS-EEE-SS-SSBHHHHHHHHHHHHHHCHCSS---THH--HHHHHHHHHHHHHHHHHCCCCT
T ss_pred             ECCHHHHHHHHHHHHHHHcCCceeeCCCCCCHHHHHHHHHHHHhhcccCcchhHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence            899  999999999999999999999999999999999999997532222211    15699999999999999999999


Q ss_pred             CChHHHHHHHHHHHhc----------CCCCCccccChHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhhc
Q psy15819        141 LYQAERRQYSQVMQKY----------QGAPLSSLYGASHLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDFL  202 (203)
Q Consensus       141 LY~~Er~Qy~~~~~~~----------p~~~~s~~YG~~HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~dfl  202 (203)
                      ||++||+||.++++.+          ++++||++||++|||||||+||+||+.++||+.+++.|+.++++||
T Consensus       107 LY~~Er~Qy~~~~~~~~~~~~~~~~~~~~~ps~~YG~~HLLRL~vkLPell~~~~~~~~~~~~l~~~l~~fl  178 (194)
T PF05712_consen  107 LYKFERPQYDELLKKHATRDDSPPDEPGFRPSDIYGAIHLLRLFVKLPELLSSTNMDEESINILLEHLQDFL  178 (194)
T ss_dssp             S-GGGHHHHHHHHHHS---------STTS-HHHC-BHHHHHHHHHHHHHHHCCCGGCHHHHHHHHHHHHHHH
T ss_pred             CcHHHHHHHHHHHHhcccchhccccCCCCCHHhhccHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHHH
Confidence            9999999999999887          7899999999999999999999999999999999999999999997


No 2  
>KOG3001|consensus
Probab=100.00  E-value=1.1e-34  Score=262.88  Aligned_cols=136  Identities=40%  Similarity=0.715  Sum_probs=121.6

Q ss_pred             ccc--hhhhHHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcccCCChH
Q psy15819         67 KIC--SRTWLVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGSQLLYQA  144 (203)
Q Consensus        67 ~iP--Lk~~LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~~LLY~~  144 (203)
                      .||  ++..|+|||+.+++..+++.+|+..+|+.|...|........+.+......+.+...|++.|||.+||.+|||++
T Consensus       223 ~l~~~~~~~l~dd~~~vt~~~~~~~~~~~~~~e~~~~~~~~~~k~~~~~~~~~~~~~~~~~~g~~~yfn~~lG~~llyk~  302 (391)
T KOG3001|consen  223 SLPQELKRSLVDDWDSVTEVDSLAELPQDVTVEQILKKYGFSEKKASGLSNSKEPEVLEVAAGLKRYFNGQLGVMLLYKF  302 (391)
T ss_pred             cCchhhcccccchhhhhhhhhhhhcccCCchhhhhhhhhhHhhhhccccccccccccccccccceeeecccchhhhhhhh
Confidence            777  999999999999999999999999999999999988776433332223334445559999999999999999999


Q ss_pred             HHHHHHHHHHhcC-CCCCccccChHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhhc
Q psy15819        145 ERRQYSQVMQKYQ-GAPLSSLYGASHLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDFL  202 (203)
Q Consensus       145 Er~Qy~~~~~~~p-~~~~s~~YG~~HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~dfl  202 (203)
                      ||.||.++++.+| +.+||++||++|||||||+||+||.++.|++++++.|+.++++|+
T Consensus       303 Er~qy~~~~~~~~~Ds~~s~vyGa~HLlRLfvKLpe~l~~~~~~~~~l~~Ll~~~~~fl  361 (391)
T KOG3001|consen  303 ERLQYAEVVAKYPKDSPPSNVYGAEHLLRLFVKLPEILKYTPMDEKSLALLLRHRKDFL  361 (391)
T ss_pred             hhHHHHHHHhcCCCCCCcccchhHHHHHHHHhHHHHHHHhccccHHHHHHHHHHHHHHH
Confidence            9999999999998 888999999999999999999999999999999999999999997


No 3  
>PF11717 Tudor-knot:  RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=99.09  E-value=2.2e-11  Score=82.13  Aligned_cols=21  Identities=48%  Similarity=1.126  Sum_probs=18.6

Q ss_pred             Ceeec-cCCCCccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVL   21 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~ril   21 (203)
                      |+||| |||++|||||++++|.
T Consensus        33 YyVHY~g~nkR~DeWV~~~~i~   54 (55)
T PF11717_consen   33 YYVHYQGWNKRLDEWVPESRIR   54 (55)
T ss_dssp             EEEEETTSTGCC-EEEETTTEE
T ss_pred             EEEEcCCCCCCceeeecHHHcc
Confidence            89999 9999999999999873


No 4  
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=98.10  E-value=6.7e-07  Score=82.91  Aligned_cols=21  Identities=33%  Similarity=0.646  Sum_probs=20.0

Q ss_pred             Ceeec-cCCCCccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVL   21 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~ril   21 (203)
                      |+||| |||++|||||+.+||-
T Consensus        91 YYVHY~g~nrRlDEWV~~~rLd  112 (450)
T PLN00104         91 YYVHYTEFNRRLDEWVKLEQLD  112 (450)
T ss_pred             EEEEEecCCccHhhccCHhhcc
Confidence            89999 9999999999999994


No 5  
>cd00024 CHROMO Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
Probab=95.34  E-value=0.01  Score=38.56  Aligned_cols=23  Identities=17%  Similarity=0.367  Sum_probs=20.9

Q ss_pred             Ceeec-cCCCCccccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLKF   23 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk~   23 (203)
                      |+||+ ||+...|.|++++.|...
T Consensus        22 y~VkW~g~~~~~~tWe~~~~l~~~   45 (55)
T cd00024          22 YLVKWKGYSYSEDTWEPEENLEDC   45 (55)
T ss_pred             EEEEECCCCCccCccccHHHhCch
Confidence            79999 999999999999987754


No 6  
>smart00298 CHROMO Chromatin organization modifier domain.
Probab=94.58  E-value=0.011  Score=38.36  Aligned_cols=22  Identities=23%  Similarity=0.502  Sum_probs=19.6

Q ss_pred             Ceeec-cCCCCcccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLK   22 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk   22 (203)
                      |+||| ||+.+.+.|++.+.+..
T Consensus        20 ylVkW~g~~~~~~tW~~~~~l~~   42 (55)
T smart00298       20 YLVKWKGYSYSEDTWEPEENLLN   42 (55)
T ss_pred             EEEEECCCCCccCceeeHHHHHH
Confidence            79999 99999999999986654


No 7  
>PTZ00064 histone acetyltransferase; Provisional
Probab=93.99  E-value=0.016  Score=54.72  Aligned_cols=22  Identities=23%  Similarity=0.478  Sum_probs=20.2

Q ss_pred             Ceeec-cCCCCcccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLK   22 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk   22 (203)
                      |+||| |.|.+=||||+-+||--
T Consensus       151 yYVHy~g~nrRlD~WV~~~ri~~  173 (552)
T PTZ00064        151 FYVHFRGLNRRLDRWVKGKDIKL  173 (552)
T ss_pred             EEEEecCcCchHhhhcChhhccc
Confidence            89999 99999999999998853


No 8  
>smart00561 MBT Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
Probab=91.33  E-value=0.1  Score=38.95  Aligned_cols=17  Identities=29%  Similarity=0.808  Sum_probs=15.8

Q ss_pred             eeec-cCCCCcccccccc
Q psy15819          2 IGHY-KWKRKWDEWVPEC   18 (203)
Q Consensus         2 ~vHy-GW~~~WDeWV~~~   18 (203)
                      +||| ||..+.|.|+..+
T Consensus        59 ~v~~dg~~~~~D~W~~~~   76 (96)
T smart00561       59 LLHFDGWDDKYDFWCDAD   76 (96)
T ss_pred             EEEEccCCCcCCEEEECC
Confidence            6999 9999999999876


No 9  
>PLN03239 histone acetyltransferase; Provisional
Probab=89.17  E-value=0.15  Score=46.37  Aligned_cols=20  Identities=30%  Similarity=0.777  Sum_probs=18.9

Q ss_pred             Ceeec-cCCCCcccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRV   20 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~ri   20 (203)
                      |+||| |-|+|-|+||+.+.+
T Consensus         1 yYVh~~~~nkRlD~Wv~~~~l   21 (351)
T PLN03239          1 YYVHYKDFNRRMDEWISKDKS   21 (351)
T ss_pred             CeEEeccccchHhhhcChhhc
Confidence            89999 999999999999876


No 10 
>KOG2748|consensus
Probab=78.27  E-value=1.7  Score=39.59  Aligned_cols=22  Identities=27%  Similarity=0.856  Sum_probs=20.3

Q ss_pred             Ceeec-cCCCCcccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLK   22 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk   22 (203)
                      |+|-+ ||+.++.-|=|++-||.
T Consensus        28 YlVKWkGWs~kyNTWEPEENILD   50 (369)
T KOG2748|consen   28 YLVKWKGWSQKYNTWEPEENILD   50 (369)
T ss_pred             EEEEecccccccCccCccccccC
Confidence            89999 99999999999988874


No 11 
>PF02820 MBT:  mbt repeat;  InterPro: IPR004092 The function of the malignant brain tumor (MBT) repeat is unknown, but is found in a number of nuclear proteins involved in transcriptional repression. The repeat contains a completely conserved glutamate at its amino terminus that may be important for function.  The crystal structure of the two MBT repeats of human SCM-like 2 protein has been reported. Each repeat consists of an extended "arm" and a globular core. The arm of the first repeat packs against the core of the second repeat and vice versa. The structure of the core-interacting part of each arm consists of an N-terminal alpha-helix and a turn of 310 helix connected by a short beta-strand. The core consists of an Src homology 3-like five-stranded beta-barrel followed by a C-terminal alpha-helix and another short beta-strand. Each arm interacts with its partner core in a similar way, with the orientation of the N-terminal helix relative to the barrel varying slightly. There are also extensive interactions between the two barrels [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2P0K_A 3F70_B 3CEY_A 2VYT_A 2BIV_A 1OI1_A 3OQ5_A 2RJE_B 2RJD_A 2RI3_A ....
Probab=74.35  E-value=1.3  Score=30.89  Aligned_cols=20  Identities=25%  Similarity=0.727  Sum_probs=17.0

Q ss_pred             eeec-cCCCCcccccccc--ccc
Q psy15819          2 IGHY-KWKRKWDEWVPEC--RVL   21 (203)
Q Consensus         2 ~vHy-GW~~~WDeWV~~~--ril   21 (203)
                      +||| ||.+..|.|+..+  +|.
T Consensus        28 ~v~~dg~~~~~d~w~~~~S~~i~   50 (73)
T PF02820_consen   28 LVRYDGWDDDYDFWCHIDSPRIF   50 (73)
T ss_dssp             EEEETTSTGGGEEEEETTSTTEE
T ss_pred             EEEEcCCCCCccEEEECCCCCee
Confidence            6999 9999999999865  554


No 12 
>PF00385 Chromo:  Chromo (CHRromatin Organisation MOdifier) domain;  InterPro: IPR023780 The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting.  Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain, with weak but significant sequence similarity to the N-terminal chromo domain,[], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1). The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigen and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1. Functional dissections of chromo domain proteins suggests a mechanistic role for chromo domains in targeting chromo domain proteins to specific regions of the nucleus. The mechanism of targeting may involve protein-protein and/or protein/nucleic acid interactions. Hence, several line of evidence show that the HP1 chromo domain is a methyl-specific histone binding module, whereas the chromo domain of two protein components of the drosophila dosage compensation complex, MSL3 and MOF, contain chromo domains that bind to RNA in vitro []. The high resolution structures of HP1-family protein chromo and chromo shadow domain reveal a conserved chromo domain fold motif consisting of three beta strands packed against an alpha helix. The chromo domain fold belongs to the OB (oligonucleotide/oligosaccharide binding)-fold class found in a variety of prokaryotic and eukaryotic nucleic acid binding protein [].; PDB: 2H1E_B 3MWY_W 2DY8_A 1KNE_A 1KNA_A 1Q3L_A 2EE1_A 1AP0_A 1GUW_A 1X3P_A ....
Probab=69.22  E-value=2  Score=27.90  Aligned_cols=24  Identities=13%  Similarity=0.278  Sum_probs=21.0

Q ss_pred             Ceeec-cCCCCcccccccccccccC
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLKFN   24 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk~~   24 (203)
                      |+|++ |+...-+.|.|++.|....
T Consensus        21 ylVkW~g~~~~~~tWe~~~~l~~~~   45 (55)
T PF00385_consen   21 YLVKWKGYPYSENTWEPEENLKNCF   45 (55)
T ss_dssp             EEEEETTSSGGGEEEEEGGGCSSHC
T ss_pred             EEEEECCCCCCCCeEeeHHHHhHhh
Confidence            79999 9999999999999777654


No 13 
>smart00340 HALZ homeobox associated leucin zipper.
Probab=65.29  E-value=4.6  Score=25.85  Aligned_cols=18  Identities=22%  Similarity=0.368  Sum_probs=14.6

Q ss_pred             ccChhhHHHHHHHHHHhH
Q psy15819         22 KFNESNVQRQKDLKKSQQ   39 (203)
Q Consensus        22 k~~~eN~~~q~~l~~~~~   39 (203)
                      ..++||+.+|+++++-..
T Consensus        16 ~LteeNrRL~ke~~eLra   33 (44)
T smart00340       16 SLTEENRRLQKEVQELRA   33 (44)
T ss_pred             HHHHHHHHHHHHHHHHHh
Confidence            368999999999977654


No 14 
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids.  The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation.  Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.  The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes.  Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=52.71  E-value=12  Score=26.51  Aligned_cols=27  Identities=19%  Similarity=0.457  Sum_probs=20.9

Q ss_pred             Ceeec-cCCCCcccccccccccccChhhHH
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLKFNESNVQ   29 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk~~~eN~~   29 (203)
                      |+|+| | ...+ -||+.++|..+++...+
T Consensus        40 ~~V~Ffg-~~~~-~wv~~~~l~pf~~~~~~   67 (87)
T cd05162          40 VLVLFFG-DKTF-AWVGAERLKPFTEHKES   67 (87)
T ss_pred             EEEEEeC-CCcE-EEeCccceeeccchHHh
Confidence            57999 9 3333 89999999999877654


No 15 
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS.   The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans.  The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain.   Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis.  In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=45.13  E-value=8.6  Score=29.10  Aligned_cols=26  Identities=15%  Similarity=0.274  Sum_probs=22.9

Q ss_pred             Ceeec-cCCCCcccccccccccccChhh
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLKFNESN   27 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk~~~eN   27 (203)
                      |+|.| |-+.+| -||+++.|..+++.+
T Consensus        45 ~~V~FFG~~~~~-aWv~~~~l~pf~~~~   71 (110)
T cd05837          45 YHVQFFGDNPER-AWISEKSLKPFKGSK   71 (110)
T ss_pred             EEEEEcCCCCCE-EEecHHHccccCCch
Confidence            68999 999778 599999999998777


No 16 
>PF15136 UPF0449:  Uncharacterised protein family UPF0449
Probab=44.47  E-value=18  Score=27.19  Aligned_cols=22  Identities=23%  Similarity=0.297  Sum_probs=16.5

Q ss_pred             ceeeCCCCCCHHHHHHHHHHhh
Q psy15819         86 KLHQIPAKTSVDEIFESYLKHK  107 (203)
Q Consensus        86 ~l~~LP~~~~V~~Il~~y~~~~  107 (203)
                      .|..=|+.|||++||+|-....
T Consensus         6 vLPtRP~PPTvEqILEDv~~A~   27 (97)
T PF15136_consen    6 VLPTRPEPPTVEQILEDVRGAP   27 (97)
T ss_pred             CCCCCCCCCCHHHHHHHHhcCC
Confidence            3445577899999999976653


No 17 
>PF13495 Phage_int_SAM_4:  Phage integrase, N-terminal SAM-like domain; PDB: 2A3V_A.
Probab=41.85  E-value=37  Score=23.21  Aligned_cols=44  Identities=25%  Similarity=0.543  Sum_probs=29.0

Q ss_pred             CCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcc
Q psy15819         91 PAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGS  138 (203)
Q Consensus        91 P~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~  138 (203)
                      |...|.++| ++|+.+....++.+   ...+.-.+.+|+.+|+.+++.
T Consensus        38 ~~~it~~~i-~~y~~~l~~~~~~s---~~T~~~~~~~l~~ff~~~~~~   81 (85)
T PF13495_consen   38 PDEITPEDI-EQYLNYLQNERGLS---PSTINQYLSALRSFFRWLLER   81 (85)
T ss_dssp             GGG--HHHH-HHHHHHHHTTT------HHHHHHHHHHHHHHHHCTSS-
T ss_pred             cchhHHHHH-HHHHHHHHHhcCCC---HHHHHHHHHHHHHHHHHHHcC
Confidence            567777777 68999887423332   245777889999999998764


No 18 
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=41.77  E-value=9  Score=35.18  Aligned_cols=22  Identities=27%  Similarity=0.572  Sum_probs=19.6

Q ss_pred             Ceeec-cCCCCcccccccccccc
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLK   22 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk   22 (203)
                      |+||| ..|.+=||||..+.|=+
T Consensus        37 fyvh~~~~nrrl~e~i~~~~i~~   59 (395)
T COG5027          37 FYVHYVELNRRLDEWITADLINL   59 (395)
T ss_pred             EEEeehhhhhhhhhheecccccc
Confidence            57999 99999999999998754


No 19 
>PF06543 Lac_bphage_repr:  Lactococcus bacteriophage repressor;  InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=40.87  E-value=8.5  Score=25.24  Aligned_cols=9  Identities=44%  Similarity=1.165  Sum_probs=7.6

Q ss_pred             Ccccccccc
Q psy15819         10 KWDEWVPEC   18 (203)
Q Consensus        10 ~WDeWV~~~   18 (203)
                      -||+||+.+
T Consensus        19 dWd~wvSf~   27 (49)
T PF06543_consen   19 DWDKWVSFD   27 (49)
T ss_pred             chHHheeeC
Confidence            499999976


No 20 
>PF08822 DUF1804:  Protein of unknown function (DUF1804);  InterPro: IPR014926 This entry is represented by Bacteriophage D3112, Orf24. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=40.08  E-value=1.2e+02  Score=24.86  Aligned_cols=108  Identities=13%  Similarity=0.234  Sum_probs=58.5

Q ss_pred             HHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhc----cCCccchhhhHHHHHHHHHHHHhhhhcccCCChHHHHHH
Q psy15819         74 LVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANS----KQMSGAKGKTVTEMVAGLKEYFNVMLGSQLLYQAERRQY  149 (203)
Q Consensus        74 LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~----~~~s~~~~~~~~e~~~Gl~~yFn~~L~~~LLY~~Er~Qy  149 (203)
                      -.|||+.+..-+.+..=--.-...++|.+|+......    +...+-....-.+++..|-+-|+++....          
T Consensus        46 ~GDDWDk~RaA~~laggg~e~v~~~~l~~f~~Q~~~tmeel~~~~~~~~~~k~~~LasLaDsf~K~vaas----------  115 (165)
T PF08822_consen   46 KGDDWDKARAAHTLAGGGIEDVARQMLEDFVVQYQATMEELKENEDMPPQEKVELLASLADSFSKMVAAS----------  115 (165)
T ss_pred             cCCcHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHHHHHHHHHH----------
Confidence            3589998753222221112234556677777554421    11111122345678888999999877642          


Q ss_pred             HHHHHhcCCCCCccccChHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhh
Q psy15819        150 SQVMQKYQGAPLSSLYGASHLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDF  201 (203)
Q Consensus       150 ~~~~~~~p~~~~s~~YG~~HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~df  201 (203)
                              ..-.-+.+...=-++.+-.|-+++..-  .++.+..+++.+..|
T Consensus       116 --------kr~lPets~LavA~~vl~~l~~fv~e~--~P~h~~af~eiLepF  157 (165)
T PF08822_consen  116 --------KRVLPETSELAVAMEVLELLAAFVQER--YPQHLAAFLEILEPF  157 (165)
T ss_pred             --------hhcCchHHHHHHHHHHHHHHHHHHHhc--CHHHHHHHHHHHHHH
Confidence                    122334555555566666666666543  455666666655554


No 21 
>PF11390 FdsD:  NADH-dependant formate dehydrogenase delta subunit FdsD;  InterPro: IPR021074  FdsD is the delta subunit of the enzyme formate dehydrogenase. This subunit may play a role in maintaining the quaternary structure by means of electrostatic interactions with the other subunits []. The delta subunit is not involved in the active centre of the enzyme []. 
Probab=39.38  E-value=45  Score=22.82  Aligned_cols=33  Identities=15%  Similarity=0.289  Sum_probs=27.3

Q ss_pred             HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhh
Q psy15819        169 HLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDF  201 (203)
Q Consensus       169 HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~df  201 (203)
                      ||.||--.+-..++..+-.++.+.-+-.|++.|
T Consensus         1 ~LI~MANQIa~ff~~~p~~~~a~~~va~Hi~kF   33 (61)
T PF11390_consen    1 KLIKMANQIAAFFESYPPEEEAVEGVANHIKKF   33 (61)
T ss_pred             CHHHHHHHHHHHHccCCCHHHHHHHHHHHHHHh
Confidence            688888888888888866778888888888877


No 22 
>TIGR03650 violacein_E violacein biosynthesis enzyme VioE. This enzyme catalyzes the third step in violacein biosynthesis from a pair of Trp residues, as in Chromobacterium violaceum, but the first step that distinguishes that pathway from staurosporine (an indolocarbazole antibiotic) biosynthesis.
Probab=26.82  E-value=8.4  Score=31.24  Aligned_cols=24  Identities=33%  Similarity=0.490  Sum_probs=19.8

Q ss_pred             cCCCCCcccc---ChHHHHHHHHHHHH
Q psy15819        156 YQGAPLSSLY---GASHLLRLFVRIGS  179 (203)
Q Consensus       156 ~p~~~~s~~Y---G~~HLLRL~v~LP~  179 (203)
                      .|++.||-.|   |.-|||||++-=|.
T Consensus       135 RpgK~ps~~Yl~aG~N~LlRMvTG~~~  161 (184)
T TIGR03650       135 RPGKGPSTLYLDAGSNTLLRMVTGDEA  161 (184)
T ss_pred             cCCCCCeEEEEecCCceEEEEecCCch
Confidence            3889999988   99999999875444


No 23 
>PF00855 PWWP:  PWWP domain;  InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=25.41  E-value=43  Score=23.22  Aligned_cols=26  Identities=12%  Similarity=0.363  Sum_probs=21.0

Q ss_pred             Ceeec-cCCCCcccccccccccccChhhHH
Q psy15819          1 MIGHY-KWKRKWDEWVPECRVLKFNESNVQ   29 (203)
Q Consensus         1 ~~vHy-GW~~~WDeWV~~~rilk~~~eN~~   29 (203)
                      |+|.| |=+ ++ -||+.+.|..++ ++.+
T Consensus        37 ~~V~Ffg~~-~~-~wv~~~~i~~f~-~~~~   63 (86)
T PF00855_consen   37 VLVRFFGDN-DY-AWVKPSNIKPFS-EFKE   63 (86)
T ss_dssp             EEEEETTTT-EE-EEEEGGGEEECC-HHHH
T ss_pred             EEEEecCCC-CE-EEECHHHhhChh-hhHH
Confidence            57889 977 45 799999999999 6654


No 24 
>KOG3248|consensus
Probab=25.06  E-value=1.2e+02  Score=27.96  Aligned_cols=70  Identities=21%  Similarity=0.365  Sum_probs=48.0

Q ss_pred             CCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcc--cCCChHHHHHHHHHHHh--------cCCCC
Q psy15819         91 PAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGS--QLLYQAERRQYSQVMQK--------YQGAP  160 (203)
Q Consensus        91 P~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~--~LLY~~Er~Qy~~~~~~--------~p~~~  160 (203)
                      +.++.|..=|+.|+=+....+.      ..+.|+..-=-.-.|+.||.  +=|=+.|...|.|+-++        ||+-.
T Consensus       187 ~KkphiKKPLNAFmlyMKEmRa------~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WS  260 (421)
T KOG3248|consen  187 AKKPHIKKPLNAFMLYMKEMRA------KVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWS  260 (421)
T ss_pred             ccCccccccHHHHHHHHHHHHH------HHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence            5667888888888877654221      23444432222456777875  46889999999998654        58888


Q ss_pred             CccccC
Q psy15819        161 LSSLYG  166 (203)
Q Consensus       161 ~s~~YG  166 (203)
                      .-+.||
T Consensus       261 ARdNYg  266 (421)
T KOG3248|consen  261 ARDNYG  266 (421)
T ss_pred             hhhhhh
Confidence            888888


No 25 
>KOG2573|consensus
Probab=24.42  E-value=1.6e+02  Score=27.94  Aligned_cols=75  Identities=17%  Similarity=0.174  Sum_probs=46.6

Q ss_pred             ccc--hhhhHHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcccCCChH
Q psy15819         67 KIC--SRTWLVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGSQLLYQA  144 (203)
Q Consensus        67 ~iP--Lk~~LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~~LLY~~  144 (203)
                      .++  |+..|..-+-.+.+.+....+--..=-..|-+.+-. .   ...   ....+.|++.|++..|++.+...==|..
T Consensus        70 ~~~edLr~fLe~nlpK~kkkk~sLgi~d~kLg~~i~E~~~~-i---~c~---~~~~~~ellRGvR~hf~kl~K~L~~~d~  142 (498)
T KOG2573|consen   70 VVHEDLRSFLELNLPKVKKKKVSLGIGDSKLGISIKEAFPK-I---PCQ---SNEVVQELLRGVRKHFDKLMKGLDPGDL  142 (498)
T ss_pred             cccHHHHHHHHhhChhhhcCceeeccCcchhhhHHHhhccC-c---ccc---cchhHHHHHHHHHHHHHHHHccCCCccH
Confidence            666  999998877776443333222221111223333210 0   011   1256889999999999999998888999


Q ss_pred             HHHH
Q psy15819        145 ERRQ  148 (203)
Q Consensus       145 Er~Q  148 (203)
                      |+.|
T Consensus       143 ~kaq  146 (498)
T KOG2573|consen  143 EKAQ  146 (498)
T ss_pred             HHHH
Confidence            9998


No 26 
>PF13045 DUF3905:  Protein of unknown function (DUF3905)
Probab=20.91  E-value=48  Score=23.98  Aligned_cols=19  Identities=26%  Similarity=0.288  Sum_probs=13.6

Q ss_pred             cccccc-ccccccChhhHHH
Q psy15819         12 DEWVPE-CRVLKFNESNVQR   30 (203)
Q Consensus        12 DeWV~~-~rilk~~~eN~~~   30 (203)
                      ||||.. ..|==.|+||+++
T Consensus        64 deWvhPtnDIGw~t~ENrel   83 (84)
T PF13045_consen   64 DEWVHPTNDIGWNTAENREL   83 (84)
T ss_pred             cccccCCcCccccchhhhhc
Confidence            788854 4666678888875


No 27 
>KOG3035|consensus
Probab=20.73  E-value=29  Score=30.03  Aligned_cols=14  Identities=21%  Similarity=0.499  Sum_probs=9.7

Q ss_pred             ec-cCCCCccccccc
Q psy15819          4 HY-KWKRKWDEWVPE   17 (203)
Q Consensus         4 Hy-GW~~~WDeWV~~   17 (203)
                      -| |||+||---|-.
T Consensus        44 GysGynSRwALkvL~   58 (245)
T KOG3035|consen   44 GYSGYNSRWALKVLP   58 (245)
T ss_pred             ccccchhHHHHHHhh
Confidence            38 999988654433


No 28 
>PF08006 DUF1700:  Protein of unknown function (DUF1700);  InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=20.49  E-value=49  Score=26.72  Aligned_cols=24  Identities=25%  Similarity=0.434  Sum_probs=19.1

Q ss_pred             CceeeCCCCCCHHHHHHHHHHhhhh
Q psy15819         85 NKLHQIPAKTSVDEIFESYLKHKAN  109 (203)
Q Consensus        85 ~~l~~LP~~~~V~~Il~~y~~~~~~  109 (203)
                      +.|-.||.. -.++|+++|.++...
T Consensus        12 ~~L~~lp~~-e~~e~l~~Y~e~f~d   35 (181)
T PF08006_consen   12 KYLKKLPEE-EREEILEYYEEYFDD   35 (181)
T ss_pred             HHHHcCCHH-HHHHHHHHHHHHHHH
Confidence            356667755 789999999999876


Done!