Query psy15819
Match_columns 203
No_of_seqs 156 out of 371
Neff 6.3
Searched_HMMs 46136
Date Fri Aug 16 22:08:37 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15819.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15819hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF05712 MRG: MRG; InterPro: 100.0 1.3E-44 2.7E-49 301.1 13.0 136 67-202 27-178 (194)
2 KOG3001|consensus 100.0 1.1E-34 2.4E-39 262.9 8.7 136 67-202 223-361 (391)
3 PF11717 Tudor-knot: RNA bindi 99.1 2.2E-11 4.7E-16 82.1 0.7 21 1-21 33-54 (55)
4 PLN00104 MYST -like histone ac 98.1 6.7E-07 1.5E-11 82.9 0.3 21 1-21 91-112 (450)
5 cd00024 CHROMO Chromatin organ 95.3 0.01 2.3E-07 38.6 1.8 23 1-23 22-45 (55)
6 smart00298 CHROMO Chromatin or 94.6 0.011 2.3E-07 38.4 0.2 22 1-22 20-42 (55)
7 PTZ00064 histone acetyltransfe 94.0 0.016 3.6E-07 54.7 0.2 22 1-22 151-173 (552)
8 smart00561 MBT Present in Dros 91.3 0.1 2.2E-06 39.0 1.3 17 2-18 59-76 (96)
9 PLN03239 histone acetyltransfe 89.2 0.15 3.3E-06 46.4 0.7 20 1-20 1-21 (351)
10 KOG2748|consensus 78.3 1.7 3.7E-05 39.6 2.5 22 1-22 28-50 (369)
11 PF02820 MBT: mbt repeat; Int 74.3 1.3 2.9E-05 30.9 0.6 20 2-21 28-50 (73)
12 PF00385 Chromo: Chromo (CHRro 69.2 2 4.3E-05 27.9 0.5 24 1-24 21-45 (55)
13 smart00340 HALZ homeobox assoc 65.3 4.6 9.9E-05 25.9 1.6 18 22-39 16-33 (44)
14 cd05162 PWWP The PWWP domain, 52.7 12 0.00027 26.5 2.3 27 1-29 40-67 (87)
15 cd05837 MSH6_like The PWWP dom 45.1 8.6 0.00019 29.1 0.5 26 1-27 45-71 (110)
16 PF15136 UPF0449: Uncharacteri 44.5 18 0.00039 27.2 2.1 22 86-107 6-27 (97)
17 PF13495 Phage_int_SAM_4: Phag 41.8 37 0.00081 23.2 3.3 44 91-138 38-81 (85)
18 COG5027 SAS2 Histone acetyltra 41.8 9 0.00019 35.2 0.1 22 1-22 37-59 (395)
19 PF06543 Lac_bphage_repr: Lact 40.9 8.5 0.00018 25.2 -0.1 9 10-18 19-27 (49)
20 PF08822 DUF1804: Protein of u 40.1 1.2E+02 0.0027 24.9 6.5 108 74-201 46-157 (165)
21 PF11390 FdsD: NADH-dependant 39.4 45 0.00099 22.8 3.3 33 169-201 1-33 (61)
22 TIGR03650 violacein_E violacei 26.8 8.4 0.00018 31.2 -2.3 24 156-179 135-161 (184)
23 PF00855 PWWP: PWWP domain; I 25.4 43 0.00093 23.2 1.4 26 1-29 37-63 (86)
24 KOG3248|consensus 25.1 1.2E+02 0.0026 28.0 4.4 70 91-166 187-266 (421)
25 KOG2573|consensus 24.4 1.6E+02 0.0034 27.9 5.1 75 67-148 70-146 (498)
26 PF13045 DUF3905: Protein of u 20.9 48 0.001 24.0 0.9 19 12-30 64-83 (84)
27 KOG3035|consensus 20.7 29 0.00063 30.0 -0.3 14 4-17 44-58 (245)
28 PF08006 DUF1700: Protein of u 20.5 49 0.0011 26.7 1.0 24 85-109 12-35 (181)
No 1
>PF05712 MRG: MRG; InterPro: IPR008676 This family consists of three different eukaryotic proteins (mortality factor 4 (MORF4/MRG15), male-specific lethal 3(MSL-3) and ESA1-associated factor 3(EAF3)). It is thought that the MRG family is involved in transcriptional regulation via histone acetylation [, ].; GO: 0005634 nucleus; PDB: 2AQL_A 2F5J_A 2LKM_B 2Y0N_D.
Probab=100.00 E-value=1.3e-44 Score=301.08 Aligned_cols=136 Identities=43% Similarity=0.732 Sum_probs=115.7
Q ss_pred ccc--hhhhHHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhccCCccch----hhhHHHHHHHHHHHHhhhhcccC
Q psy15819 67 KIC--SRTWLVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANSKQMSGAK----GKTVTEMVAGLKEYFNVMLGSQL 140 (203)
Q Consensus 67 ~iP--Lk~~LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~~~~s~~~----~~~~~e~~~Gl~~yFn~~L~~~L 140 (203)
+|| ||.+|||||++|+++++|++|||++||++||++|.++........... ...++|+++||++|||++||++|
T Consensus 27 ~lP~~Lk~~LvdD~~~I~~~~~l~~LP~~~~V~~IL~~y~~~~~~~~~~~~~~~~~~~~~~~e~~~Gl~~yFn~~L~~~L 106 (194)
T PF05712_consen 27 ELPEELKKILVDDWELITKEKKLVKLPAKPSVDDILEDYVESFADSDDSEEESAEQERDLLKEVADGLRDYFNKALGSQL 106 (194)
T ss_dssp ---HHHHHHHHHHHHHHHTS-EEE-SS-SSBHHHHHHHHHHHHHHCHCSS---THH--HHHHHHHHHHHHHHHHHCCCCT
T ss_pred ECCHHHHHHHHHHHHHHHcCCceeeCCCCCCHHHHHHHHHHHHhhcccCcchhHHHHHHHHHHHHHHHHHHHHHHhcccc
Confidence 899 999999999999999999999999999999999999997532222211 15699999999999999999999
Q ss_pred CChHHHHHHHHHHHhc----------CCCCCccccChHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhhc
Q psy15819 141 LYQAERRQYSQVMQKY----------QGAPLSSLYGASHLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDFL 202 (203)
Q Consensus 141 LY~~Er~Qy~~~~~~~----------p~~~~s~~YG~~HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~dfl 202 (203)
||++||+||.++++.+ ++++||++||++|||||||+||+||+.++||+.+++.|+.++++||
T Consensus 107 LY~~Er~Qy~~~~~~~~~~~~~~~~~~~~~ps~~YG~~HLLRL~vkLPell~~~~~~~~~~~~l~~~l~~fl 178 (194)
T PF05712_consen 107 LYKFERPQYDELLKKHATRDDSPPDEPGFRPSDIYGAIHLLRLFVKLPELLSSTNMDEESINILLEHLQDFL 178 (194)
T ss_dssp S-GGGHHHHHHHHHHS---------STTS-HHHC-BHHHHHHHHHHHHHHHCCCGGCHHHHHHHHHHHHHHH
T ss_pred CcHHHHHHHHHHHHhcccchhccccCCCCCHHhhccHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHHH
Confidence 9999999999999887 7899999999999999999999999999999999999999999997
No 2
>KOG3001|consensus
Probab=100.00 E-value=1.1e-34 Score=262.88 Aligned_cols=136 Identities=40% Similarity=0.715 Sum_probs=121.6
Q ss_pred ccc--hhhhHHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcccCCChH
Q psy15819 67 KIC--SRTWLVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGSQLLYQA 144 (203)
Q Consensus 67 ~iP--Lk~~LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~~LLY~~ 144 (203)
.|| ++..|+|||+.+++..+++.+|+..+|+.|...|........+.+......+.+...|++.|||.+||.+|||++
T Consensus 223 ~l~~~~~~~l~dd~~~vt~~~~~~~~~~~~~~e~~~~~~~~~~k~~~~~~~~~~~~~~~~~~g~~~yfn~~lG~~llyk~ 302 (391)
T KOG3001|consen 223 SLPQELKRSLVDDWDSVTEVDSLAELPQDVTVEQILKKYGFSEKKASGLSNSKEPEVLEVAAGLKRYFNGQLGVMLLYKF 302 (391)
T ss_pred cCchhhcccccchhhhhhhhhhhhcccCCchhhhhhhhhhHhhhhccccccccccccccccccceeeecccchhhhhhhh
Confidence 777 999999999999999999999999999999999988776433332223334445559999999999999999999
Q ss_pred HHHHHHHHHHhcC-CCCCccccChHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhhc
Q psy15819 145 ERRQYSQVMQKYQ-GAPLSSLYGASHLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDFL 202 (203)
Q Consensus 145 Er~Qy~~~~~~~p-~~~~s~~YG~~HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~dfl 202 (203)
||.||.++++.+| +.+||++||++|||||||+||+||.++.|++++++.|+.++++|+
T Consensus 303 Er~qy~~~~~~~~~Ds~~s~vyGa~HLlRLfvKLpe~l~~~~~~~~~l~~Ll~~~~~fl 361 (391)
T KOG3001|consen 303 ERLQYAEVVAKYPKDSPPSNVYGAEHLLRLFVKLPEILKYTPMDEKSLALLLRHRKDFL 361 (391)
T ss_pred hhHHHHHHHhcCCCCCCcccchhHHHHHHHHhHHHHHHHhccccHHHHHHHHHHHHHHH
Confidence 9999999999998 888999999999999999999999999999999999999999997
No 3
>PF11717 Tudor-knot: RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=99.09 E-value=2.2e-11 Score=82.13 Aligned_cols=21 Identities=48% Similarity=1.126 Sum_probs=18.6
Q ss_pred Ceeec-cCCCCccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVL 21 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~ril 21 (203)
|+||| |||++|||||++++|.
T Consensus 33 YyVHY~g~nkR~DeWV~~~~i~ 54 (55)
T PF11717_consen 33 YYVHYQGWNKRLDEWVPESRIR 54 (55)
T ss_dssp EEEEETTSTGCC-EEEETTTEE
T ss_pred EEEEcCCCCCCceeeecHHHcc
Confidence 89999 9999999999999873
No 4
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=98.10 E-value=6.7e-07 Score=82.91 Aligned_cols=21 Identities=33% Similarity=0.646 Sum_probs=20.0
Q ss_pred Ceeec-cCCCCccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVL 21 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~ril 21 (203)
|+||| |||++|||||+.+||-
T Consensus 91 YYVHY~g~nrRlDEWV~~~rLd 112 (450)
T PLN00104 91 YYVHYTEFNRRLDEWVKLEQLD 112 (450)
T ss_pred EEEEEecCCccHhhccCHhhcc
Confidence 89999 9999999999999994
No 5
>cd00024 CHROMO Chromatin organization modifier (chromo) domain is a conserved region of around 50 amino acids found in a variety of chromosomal proteins, which appear to play a role in the functional organization of the eukaryotic nucleus. Experimental evidence implicates the chromo domain in the binding activity of these proteins to methylated histone tails and maybe RNA. May occur as single instance, in a tandem arrangement or followd by a related "chromo shadow" domain.
Probab=95.34 E-value=0.01 Score=38.56 Aligned_cols=23 Identities=17% Similarity=0.367 Sum_probs=20.9
Q ss_pred Ceeec-cCCCCccccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLKF 23 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk~ 23 (203)
|+||+ ||+...|.|++++.|...
T Consensus 22 y~VkW~g~~~~~~tWe~~~~l~~~ 45 (55)
T cd00024 22 YLVKWKGYSYSEDTWEPEENLEDC 45 (55)
T ss_pred EEEEECCCCCccCccccHHHhCch
Confidence 79999 999999999999987754
No 6
>smart00298 CHROMO Chromatin organization modifier domain.
Probab=94.58 E-value=0.011 Score=38.36 Aligned_cols=22 Identities=23% Similarity=0.502 Sum_probs=19.6
Q ss_pred Ceeec-cCCCCcccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLK 22 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk 22 (203)
|+||| ||+.+.+.|++.+.+..
T Consensus 20 ylVkW~g~~~~~~tW~~~~~l~~ 42 (55)
T smart00298 20 YLVKWKGYSYSEDTWEPEENLLN 42 (55)
T ss_pred EEEEECCCCCccCceeeHHHHHH
Confidence 79999 99999999999986654
No 7
>PTZ00064 histone acetyltransferase; Provisional
Probab=93.99 E-value=0.016 Score=54.72 Aligned_cols=22 Identities=23% Similarity=0.478 Sum_probs=20.2
Q ss_pred Ceeec-cCCCCcccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLK 22 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk 22 (203)
|+||| |.|.+=||||+-+||--
T Consensus 151 yYVHy~g~nrRlD~WV~~~ri~~ 173 (552)
T PTZ00064 151 FYVHFRGLNRRLDRWVKGKDIKL 173 (552)
T ss_pred EEEEecCcCchHhhhcChhhccc
Confidence 89999 99999999999998853
No 8
>smart00561 MBT Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
Probab=91.33 E-value=0.1 Score=38.95 Aligned_cols=17 Identities=29% Similarity=0.808 Sum_probs=15.8
Q ss_pred eeec-cCCCCcccccccc
Q psy15819 2 IGHY-KWKRKWDEWVPEC 18 (203)
Q Consensus 2 ~vHy-GW~~~WDeWV~~~ 18 (203)
+||| ||..+.|.|+..+
T Consensus 59 ~v~~dg~~~~~D~W~~~~ 76 (96)
T smart00561 59 LLHFDGWDDKYDFWCDAD 76 (96)
T ss_pred EEEEccCCCcCCEEEECC
Confidence 6999 9999999999876
No 9
>PLN03239 histone acetyltransferase; Provisional
Probab=89.17 E-value=0.15 Score=46.37 Aligned_cols=20 Identities=30% Similarity=0.777 Sum_probs=18.9
Q ss_pred Ceeec-cCCCCcccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRV 20 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~ri 20 (203)
|+||| |-|+|-|+||+.+.+
T Consensus 1 yYVh~~~~nkRlD~Wv~~~~l 21 (351)
T PLN03239 1 YYVHYKDFNRRMDEWISKDKS 21 (351)
T ss_pred CeEEeccccchHhhhcChhhc
Confidence 89999 999999999999876
No 10
>KOG2748|consensus
Probab=78.27 E-value=1.7 Score=39.59 Aligned_cols=22 Identities=27% Similarity=0.856 Sum_probs=20.3
Q ss_pred Ceeec-cCCCCcccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLK 22 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk 22 (203)
|+|-+ ||+.++.-|=|++-||.
T Consensus 28 YlVKWkGWs~kyNTWEPEENILD 50 (369)
T KOG2748|consen 28 YLVKWKGWSQKYNTWEPEENILD 50 (369)
T ss_pred EEEEecccccccCccCccccccC
Confidence 89999 99999999999988874
No 11
>PF02820 MBT: mbt repeat; InterPro: IPR004092 The function of the malignant brain tumor (MBT) repeat is unknown, but is found in a number of nuclear proteins involved in transcriptional repression. The repeat contains a completely conserved glutamate at its amino terminus that may be important for function. The crystal structure of the two MBT repeats of human SCM-like 2 protein has been reported. Each repeat consists of an extended "arm" and a globular core. The arm of the first repeat packs against the core of the second repeat and vice versa. The structure of the core-interacting part of each arm consists of an N-terminal alpha-helix and a turn of 310 helix connected by a short beta-strand. The core consists of an Src homology 3-like five-stranded beta-barrel followed by a C-terminal alpha-helix and another short beta-strand. Each arm interacts with its partner core in a similar way, with the orientation of the N-terminal helix relative to the barrel varying slightly. There are also extensive interactions between the two barrels [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2P0K_A 3F70_B 3CEY_A 2VYT_A 2BIV_A 1OI1_A 3OQ5_A 2RJE_B 2RJD_A 2RI3_A ....
Probab=74.35 E-value=1.3 Score=30.89 Aligned_cols=20 Identities=25% Similarity=0.727 Sum_probs=17.0
Q ss_pred eeec-cCCCCcccccccc--ccc
Q psy15819 2 IGHY-KWKRKWDEWVPEC--RVL 21 (203)
Q Consensus 2 ~vHy-GW~~~WDeWV~~~--ril 21 (203)
+||| ||.+..|.|+..+ +|.
T Consensus 28 ~v~~dg~~~~~d~w~~~~S~~i~ 50 (73)
T PF02820_consen 28 LVRYDGWDDDYDFWCHIDSPRIF 50 (73)
T ss_dssp EEEETTSTGGGEEEEETTSTTEE
T ss_pred EEEEcCCCCCccEEEECCCCCee
Confidence 6999 9999999999865 554
No 12
>PF00385 Chromo: Chromo (CHRromatin Organisation MOdifier) domain; InterPro: IPR023780 The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting. Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain, with weak but significant sequence similarity to the N-terminal chromo domain,[], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1). The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigen and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1. Functional dissections of chromo domain proteins suggests a mechanistic role for chromo domains in targeting chromo domain proteins to specific regions of the nucleus. The mechanism of targeting may involve protein-protein and/or protein/nucleic acid interactions. Hence, several line of evidence show that the HP1 chromo domain is a methyl-specific histone binding module, whereas the chromo domain of two protein components of the drosophila dosage compensation complex, MSL3 and MOF, contain chromo domains that bind to RNA in vitro []. The high resolution structures of HP1-family protein chromo and chromo shadow domain reveal a conserved chromo domain fold motif consisting of three beta strands packed against an alpha helix. The chromo domain fold belongs to the OB (oligonucleotide/oligosaccharide binding)-fold class found in a variety of prokaryotic and eukaryotic nucleic acid binding protein [].; PDB: 2H1E_B 3MWY_W 2DY8_A 1KNE_A 1KNA_A 1Q3L_A 2EE1_A 1AP0_A 1GUW_A 1X3P_A ....
Probab=69.22 E-value=2 Score=27.90 Aligned_cols=24 Identities=13% Similarity=0.278 Sum_probs=21.0
Q ss_pred Ceeec-cCCCCcccccccccccccC
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLKFN 24 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk~~ 24 (203)
|+|++ |+...-+.|.|++.|....
T Consensus 21 ylVkW~g~~~~~~tWe~~~~l~~~~ 45 (55)
T PF00385_consen 21 YLVKWKGYPYSENTWEPEENLKNCF 45 (55)
T ss_dssp EEEEETTSSGGGEEEEEGGGCSSHC
T ss_pred EEEEECCCCCCCCeEeeHHHHhHhh
Confidence 79999 9999999999999777654
No 13
>smart00340 HALZ homeobox associated leucin zipper.
Probab=65.29 E-value=4.6 Score=25.85 Aligned_cols=18 Identities=22% Similarity=0.368 Sum_probs=14.6
Q ss_pred ccChhhHHHHHHHHHHhH
Q psy15819 22 KFNESNVQRQKDLKKSQQ 39 (203)
Q Consensus 22 k~~~eN~~~q~~l~~~~~ 39 (203)
..++||+.+|+++++-..
T Consensus 16 ~LteeNrRL~ke~~eLra 33 (44)
T smart00340 16 SLTEENRRLQKEVQELRA 33 (44)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 368999999999977654
No 14
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=52.71 E-value=12 Score=26.51 Aligned_cols=27 Identities=19% Similarity=0.457 Sum_probs=20.9
Q ss_pred Ceeec-cCCCCcccccccccccccChhhHH
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLKFNESNVQ 29 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk~~~eN~~ 29 (203)
|+|+| | ...+ -||+.++|..+++...+
T Consensus 40 ~~V~Ffg-~~~~-~wv~~~~l~pf~~~~~~ 67 (87)
T cd05162 40 VLVLFFG-DKTF-AWVGAERLKPFTEHKES 67 (87)
T ss_pred EEEEEeC-CCcE-EEeCccceeeccchHHh
Confidence 57999 9 3333 89999999999877654
No 15
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=45.13 E-value=8.6 Score=29.10 Aligned_cols=26 Identities=15% Similarity=0.274 Sum_probs=22.9
Q ss_pred Ceeec-cCCCCcccccccccccccChhh
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLKFNESN 27 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk~~~eN 27 (203)
|+|.| |-+.+| -||+++.|..+++.+
T Consensus 45 ~~V~FFG~~~~~-aWv~~~~l~pf~~~~ 71 (110)
T cd05837 45 YHVQFFGDNPER-AWISEKSLKPFKGSK 71 (110)
T ss_pred EEEEEcCCCCCE-EEecHHHccccCCch
Confidence 68999 999778 599999999998777
No 16
>PF15136 UPF0449: Uncharacterised protein family UPF0449
Probab=44.47 E-value=18 Score=27.19 Aligned_cols=22 Identities=23% Similarity=0.297 Sum_probs=16.5
Q ss_pred ceeeCCCCCCHHHHHHHHHHhh
Q psy15819 86 KLHQIPAKTSVDEIFESYLKHK 107 (203)
Q Consensus 86 ~l~~LP~~~~V~~Il~~y~~~~ 107 (203)
.|..=|+.|||++||+|-....
T Consensus 6 vLPtRP~PPTvEqILEDv~~A~ 27 (97)
T PF15136_consen 6 VLPTRPEPPTVEQILEDVRGAP 27 (97)
T ss_pred CCCCCCCCCCHHHHHHHHhcCC
Confidence 3445577899999999976653
No 17
>PF13495 Phage_int_SAM_4: Phage integrase, N-terminal SAM-like domain; PDB: 2A3V_A.
Probab=41.85 E-value=37 Score=23.21 Aligned_cols=44 Identities=25% Similarity=0.543 Sum_probs=29.0
Q ss_pred CCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcc
Q psy15819 91 PAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGS 138 (203)
Q Consensus 91 P~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~ 138 (203)
|...|.++| ++|+.+....++.+ ...+.-.+.+|+.+|+.+++.
T Consensus 38 ~~~it~~~i-~~y~~~l~~~~~~s---~~T~~~~~~~l~~ff~~~~~~ 81 (85)
T PF13495_consen 38 PDEITPEDI-EQYLNYLQNERGLS---PSTINQYLSALRSFFRWLLER 81 (85)
T ss_dssp GGG--HHHH-HHHHHHHHTTT------HHHHHHHHHHHHHHHHCTSS-
T ss_pred cchhHHHHH-HHHHHHHHHhcCCC---HHHHHHHHHHHHHHHHHHHcC
Confidence 567777777 68999887423332 245777889999999998764
No 18
>COG5027 SAS2 Histone acetyltransferase (MYST family) [Chromatin structure and dynamics]
Probab=41.77 E-value=9 Score=35.18 Aligned_cols=22 Identities=27% Similarity=0.572 Sum_probs=19.6
Q ss_pred Ceeec-cCCCCcccccccccccc
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLK 22 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk 22 (203)
|+||| ..|.+=||||..+.|=+
T Consensus 37 fyvh~~~~nrrl~e~i~~~~i~~ 59 (395)
T COG5027 37 FYVHYVELNRRLDEWITADLINL 59 (395)
T ss_pred EEEeehhhhhhhhhheecccccc
Confidence 57999 99999999999998754
No 19
>PF06543 Lac_bphage_repr: Lactococcus bacteriophage repressor; InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=40.87 E-value=8.5 Score=25.24 Aligned_cols=9 Identities=44% Similarity=1.165 Sum_probs=7.6
Q ss_pred Ccccccccc
Q psy15819 10 KWDEWVPEC 18 (203)
Q Consensus 10 ~WDeWV~~~ 18 (203)
-||+||+.+
T Consensus 19 dWd~wvSf~ 27 (49)
T PF06543_consen 19 DWDKWVSFD 27 (49)
T ss_pred chHHheeeC
Confidence 499999976
No 20
>PF08822 DUF1804: Protein of unknown function (DUF1804); InterPro: IPR014926 This entry is represented by Bacteriophage D3112, Orf24. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=40.08 E-value=1.2e+02 Score=24.86 Aligned_cols=108 Identities=13% Similarity=0.234 Sum_probs=58.5
Q ss_pred HHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhc----cCCccchhhhHHHHHHHHHHHHhhhhcccCCChHHHHHH
Q psy15819 74 LVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANS----KQMSGAKGKTVTEMVAGLKEYFNVMLGSQLLYQAERRQY 149 (203)
Q Consensus 74 LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~----~~~s~~~~~~~~e~~~Gl~~yFn~~L~~~LLY~~Er~Qy 149 (203)
-.|||+.+..-+.+..=--.-...++|.+|+...... +...+-....-.+++..|-+-|+++....
T Consensus 46 ~GDDWDk~RaA~~laggg~e~v~~~~l~~f~~Q~~~tmeel~~~~~~~~~~k~~~LasLaDsf~K~vaas---------- 115 (165)
T PF08822_consen 46 KGDDWDKARAAHTLAGGGIEDVARQMLEDFVVQYQATMEELKENEDMPPQEKVELLASLADSFSKMVAAS---------- 115 (165)
T ss_pred cCCcHHHHHHHHHhccCcHHHHHHHHHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHHHHHHHHHH----------
Confidence 3589998753222221112234556677777554421 11111122345678888999999877642
Q ss_pred HHHHHhcCCCCCccccChHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhh
Q psy15819 150 SQVMQKYQGAPLSSLYGASHLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDF 201 (203)
Q Consensus 150 ~~~~~~~p~~~~s~~YG~~HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~df 201 (203)
..-.-+.+...=-++.+-.|-+++..- .++.+..+++.+..|
T Consensus 116 --------kr~lPets~LavA~~vl~~l~~fv~e~--~P~h~~af~eiLepF 157 (165)
T PF08822_consen 116 --------KRVLPETSELAVAMEVLELLAAFVQER--YPQHLAAFLEILEPF 157 (165)
T ss_pred --------hhcCchHHHHHHHHHHHHHHHHHHHhc--CHHHHHHHHHHHHHH
Confidence 122334555555566666666666543 455666666655554
No 21
>PF11390 FdsD: NADH-dependant formate dehydrogenase delta subunit FdsD; InterPro: IPR021074 FdsD is the delta subunit of the enzyme formate dehydrogenase. This subunit may play a role in maintaining the quaternary structure by means of electrostatic interactions with the other subunits []. The delta subunit is not involved in the active centre of the enzyme [].
Probab=39.38 E-value=45 Score=22.82 Aligned_cols=33 Identities=15% Similarity=0.289 Sum_probs=27.3
Q ss_pred HHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhhh
Q psy15819 169 HLLRLFVRIGSVLAYTGLTERNIQLLQNAFQDF 201 (203)
Q Consensus 169 HLLRL~v~LP~ll~~t~~d~~s~~~l~~~~~df 201 (203)
||.||--.+-..++..+-.++.+.-+-.|++.|
T Consensus 1 ~LI~MANQIa~ff~~~p~~~~a~~~va~Hi~kF 33 (61)
T PF11390_consen 1 KLIKMANQIAAFFESYPPEEEAVEGVANHIKKF 33 (61)
T ss_pred CHHHHHHHHHHHHccCCCHHHHHHHHHHHHHHh
Confidence 688888888888888866778888888888877
No 22
>TIGR03650 violacein_E violacein biosynthesis enzyme VioE. This enzyme catalyzes the third step in violacein biosynthesis from a pair of Trp residues, as in Chromobacterium violaceum, but the first step that distinguishes that pathway from staurosporine (an indolocarbazole antibiotic) biosynthesis.
Probab=26.82 E-value=8.4 Score=31.24 Aligned_cols=24 Identities=33% Similarity=0.490 Sum_probs=19.8
Q ss_pred cCCCCCcccc---ChHHHHHHHHHHHH
Q psy15819 156 YQGAPLSSLY---GASHLLRLFVRIGS 179 (203)
Q Consensus 156 ~p~~~~s~~Y---G~~HLLRL~v~LP~ 179 (203)
.|++.||-.| |.-|||||++-=|.
T Consensus 135 RpgK~ps~~Yl~aG~N~LlRMvTG~~~ 161 (184)
T TIGR03650 135 RPGKGPSTLYLDAGSNTLLRMVTGDEA 161 (184)
T ss_pred cCCCCCeEEEEecCCceEEEEecCCch
Confidence 3889999988 99999999875444
No 23
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=25.41 E-value=43 Score=23.22 Aligned_cols=26 Identities=12% Similarity=0.363 Sum_probs=21.0
Q ss_pred Ceeec-cCCCCcccccccccccccChhhHH
Q psy15819 1 MIGHY-KWKRKWDEWVPECRVLKFNESNVQ 29 (203)
Q Consensus 1 ~~vHy-GW~~~WDeWV~~~rilk~~~eN~~ 29 (203)
|+|.| |=+ ++ -||+.+.|..++ ++.+
T Consensus 37 ~~V~Ffg~~-~~-~wv~~~~i~~f~-~~~~ 63 (86)
T PF00855_consen 37 VLVRFFGDN-DY-AWVKPSNIKPFS-EFKE 63 (86)
T ss_dssp EEEEETTTT-EE-EEEEGGGEEECC-HHHH
T ss_pred EEEEecCCC-CE-EEECHHHhhChh-hhHH
Confidence 57889 977 45 799999999999 6654
No 24
>KOG3248|consensus
Probab=25.06 E-value=1.2e+02 Score=27.96 Aligned_cols=70 Identities=21% Similarity=0.365 Sum_probs=48.0
Q ss_pred CCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcc--cCCChHHHHHHHHHHHh--------cCCCC
Q psy15819 91 PAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGS--QLLYQAERRQYSQVMQK--------YQGAP 160 (203)
Q Consensus 91 P~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~--~LLY~~Er~Qy~~~~~~--------~p~~~ 160 (203)
+.++.|..=|+.|+=+....+. ..+.|+..-=-.-.|+.||. +=|=+.|...|.|+-++ ||+-.
T Consensus 187 ~KkphiKKPLNAFmlyMKEmRa------~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WS 260 (421)
T KOG3248|consen 187 AKKPHIKKPLNAFMLYMKEMRA------KVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWS 260 (421)
T ss_pred ccCccccccHHHHHHHHHHHHH------HHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence 5667888888888877654221 23444432222456777875 46889999999998654 58888
Q ss_pred CccccC
Q psy15819 161 LSSLYG 166 (203)
Q Consensus 161 ~s~~YG 166 (203)
.-+.||
T Consensus 261 ARdNYg 266 (421)
T KOG3248|consen 261 ARDNYG 266 (421)
T ss_pred hhhhhh
Confidence 888888
No 25
>KOG2573|consensus
Probab=24.42 E-value=1.6e+02 Score=27.94 Aligned_cols=75 Identities=17% Similarity=0.174 Sum_probs=46.6
Q ss_pred ccc--hhhhHHHHHHHhhcCCceeeCCCCCCHHHHHHHHHHhhhhccCCccchhhhHHHHHHHHHHHHhhhhcccCCChH
Q psy15819 67 KIC--SRTWLVDDWDTINNKNKLHQIPAKTSVDEIFESYLKHKANSKQMSGAKGKTVTEMVAGLKEYFNVMLGSQLLYQA 144 (203)
Q Consensus 67 ~iP--Lk~~LvdD~e~i~k~~~l~~LP~~~~V~~Il~~y~~~~~~~~~~s~~~~~~~~e~~~Gl~~yFn~~L~~~LLY~~ 144 (203)
.++ |+..|..-+-.+.+.+....+--..=-..|-+.+-. . ... ....+.|++.|++..|++.+...==|..
T Consensus 70 ~~~edLr~fLe~nlpK~kkkk~sLgi~d~kLg~~i~E~~~~-i---~c~---~~~~~~ellRGvR~hf~kl~K~L~~~d~ 142 (498)
T KOG2573|consen 70 VVHEDLRSFLELNLPKVKKKKVSLGIGDSKLGISIKEAFPK-I---PCQ---SNEVVQELLRGVRKHFDKLMKGLDPGDL 142 (498)
T ss_pred cccHHHHHHHHhhChhhhcCceeeccCcchhhhHHHhhccC-c---ccc---cchhHHHHHHHHHHHHHHHHccCCCccH
Confidence 666 999998877776443333222221111223333210 0 011 1256889999999999999998888999
Q ss_pred HHHH
Q psy15819 145 ERRQ 148 (203)
Q Consensus 145 Er~Q 148 (203)
|+.|
T Consensus 143 ~kaq 146 (498)
T KOG2573|consen 143 EKAQ 146 (498)
T ss_pred HHHH
Confidence 9998
No 26
>PF13045 DUF3905: Protein of unknown function (DUF3905)
Probab=20.91 E-value=48 Score=23.98 Aligned_cols=19 Identities=26% Similarity=0.288 Sum_probs=13.6
Q ss_pred cccccc-ccccccChhhHHH
Q psy15819 12 DEWVPE-CRVLKFNESNVQR 30 (203)
Q Consensus 12 DeWV~~-~rilk~~~eN~~~ 30 (203)
||||.. ..|==.|+||+++
T Consensus 64 deWvhPtnDIGw~t~ENrel 83 (84)
T PF13045_consen 64 DEWVHPTNDIGWNTAENREL 83 (84)
T ss_pred cccccCCcCccccchhhhhc
Confidence 788854 4666678888875
No 27
>KOG3035|consensus
Probab=20.73 E-value=29 Score=30.03 Aligned_cols=14 Identities=21% Similarity=0.499 Sum_probs=9.7
Q ss_pred ec-cCCCCccccccc
Q psy15819 4 HY-KWKRKWDEWVPE 17 (203)
Q Consensus 4 Hy-GW~~~WDeWV~~ 17 (203)
-| |||+||---|-.
T Consensus 44 GysGynSRwALkvL~ 58 (245)
T KOG3035|consen 44 GYSGYNSRWALKVLP 58 (245)
T ss_pred ccccchhHHHHHHhh
Confidence 38 999988654433
No 28
>PF08006 DUF1700: Protein of unknown function (DUF1700); InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=20.49 E-value=49 Score=26.72 Aligned_cols=24 Identities=25% Similarity=0.434 Sum_probs=19.1
Q ss_pred CceeeCCCCCCHHHHHHHHHHhhhh
Q psy15819 85 NKLHQIPAKTSVDEIFESYLKHKAN 109 (203)
Q Consensus 85 ~~l~~LP~~~~V~~Il~~y~~~~~~ 109 (203)
+.|-.||.. -.++|+++|.++...
T Consensus 12 ~~L~~lp~~-e~~e~l~~Y~e~f~d 35 (181)
T PF08006_consen 12 KYLKKLPEE-EREEILEYYEEYFDD 35 (181)
T ss_pred HHHHcCCHH-HHHHHHHHHHHHHHH
Confidence 356667755 789999999999876
Done!