Query psy6849
Match_columns 439
No_of_seqs 187 out of 603
Neff 4.8
Searched_HMMs 46136
Date Fri Aug 16 19:57:08 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy6849.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/6849hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF11467 LEDGF: Lens epitheliu 100.0 9.8E-38 2.1E-42 268.8 7.4 104 299-420 1-106 (106)
2 KOG1904|consensus 99.9 1.2E-28 2.7E-33 258.1 7.6 405 5-431 10-475 (496)
3 cd05834 HDGF_related The PWWP 99.9 1.4E-27 3E-32 197.5 7.9 82 6-87 1-82 (83)
4 cd05836 N_Pac_NP60 The PWWP do 99.9 6E-27 1.3E-31 194.9 7.8 82 8-89 1-86 (86)
5 cd05841 BS69_related The PWWP 99.9 6.8E-25 1.5E-29 181.7 6.9 75 8-87 7-82 (83)
6 cd05835 Dnmt3b_related The PWW 99.9 1E-24 2.2E-29 181.6 6.9 81 8-88 1-86 (87)
7 cd05162 PWWP The PWWP domain, 99.9 3.8E-23 8.3E-28 170.5 8.6 80 8-87 1-86 (87)
8 cd05838 WHSC1_related The PWWP 99.9 4.1E-23 8.8E-28 174.7 7.8 83 8-90 1-93 (95)
9 PF00855 PWWP: PWWP domain; I 99.9 2.2E-23 4.7E-28 169.9 2.6 81 8-88 1-86 (86)
10 cd05840 SPBC215_ISWI_like The 99.8 2.8E-21 6.2E-26 163.1 7.9 79 8-86 1-91 (93)
11 cd05837 MSH6_like The PWWP dom 99.8 1.1E-19 2.4E-24 157.5 7.7 81 7-87 2-101 (110)
12 smart00293 PWWP domain with co 99.8 8.5E-20 1.8E-24 143.3 6.2 56 8-63 1-63 (63)
13 cd06080 MUM1_like Mutated mela 99.8 1.9E-19 4.2E-24 148.4 5.6 76 8-86 1-77 (80)
14 cd05839 BR140_related The PWWP 99.6 1.6E-15 3.5E-20 132.2 6.5 79 8-86 1-109 (111)
15 cd00183 TFIIS_I N-terminal dom 97.2 0.0023 4.9E-08 52.2 8.1 59 317-395 17-75 (76)
16 smart00509 TFS2N Domain in the 97.1 0.0027 5.8E-08 51.9 7.8 59 317-395 15-73 (75)
17 KOG1081|consensus 96.9 0.00046 9.9E-09 73.8 2.0 61 6-68 134-194 (463)
18 PF08711 Med26: TFIIS helical 96.5 0.0066 1.4E-07 45.9 5.3 52 325-395 1-52 (53)
19 smart00743 Agenet Tudor-like d 95.2 0.019 4.2E-07 44.1 3.1 52 7-62 2-56 (61)
20 TIGR01385 TFSII transcription 95.0 0.08 1.7E-06 53.9 7.8 71 305-397 7-77 (299)
21 smart00333 TUDOR Tudor domain. 94.6 0.038 8.3E-07 41.5 3.3 53 7-63 2-54 (57)
22 KOG1904|consensus 92.6 0.04 8.6E-07 59.6 0.4 65 296-361 431-495 (496)
23 cd04508 TUDOR Tudor domains ar 91.9 0.15 3.2E-06 37.0 2.7 47 11-61 1-48 (48)
24 PF11717 Tudor-knot: RNA bindi 90.0 0.25 5.3E-06 37.9 2.4 52 8-60 1-54 (55)
25 PF15057 DUF4537: Domain of un 85.7 1.4 3.1E-05 39.3 4.9 60 4-64 52-114 (124)
26 KOG1105|consensus 81.9 5.4 0.00012 40.9 7.7 62 317-397 19-80 (296)
27 cd03572 ENTH_epsin_related ENT 74.5 9.9 0.00021 34.2 6.3 79 300-389 34-115 (122)
28 COG5475 Uncharacterized small 71.6 8.5 0.00018 30.5 4.5 53 6-65 3-57 (60)
29 KOG1080|consensus 67.5 3.7 8.1E-05 48.3 2.6 87 6-92 190-295 (1005)
30 cd04370 BAH BAH, or Bromo Adja 61.9 8.8 0.00019 32.2 3.3 41 7-47 3-47 (123)
31 PF08169 RBB1NT: RBB1NT (NUC16 54.2 13 0.00027 32.4 2.9 79 10-92 8-91 (96)
32 PF01417 ENTH: ENTH domain; I 54.0 54 0.0012 28.6 7.0 81 300-390 35-118 (125)
33 PF09465 LBR_tudor: Lamin-B re 52.4 19 0.00041 28.3 3.3 50 6-59 4-54 (55)
34 PRK05260 condesin subunit F; P 46.7 32 0.00068 37.0 5.0 125 290-428 170-298 (440)
35 smart00439 BAH Bromo adjacent 45.7 22 0.00048 29.9 3.1 41 7-47 1-44 (120)
36 cd04721 BAH_plant_1 BAH, or Br 44.2 21 0.00045 32.1 2.9 43 7-49 7-49 (130)
37 PF05641 Agenet: Agenet domain 43.2 17 0.00038 28.8 2.0 51 8-62 1-63 (68)
38 PLN00104 MYST -like histone ac 42.3 27 0.00058 37.9 3.8 55 6-60 52-112 (450)
39 smart00561 MBT Present in Dros 41.6 39 0.00085 28.9 4.0 53 6-63 26-84 (96)
40 cd05503 Bromo_BAZ2A_B_like Bro 40.8 85 0.0018 26.4 5.9 55 319-384 1-72 (97)
41 PF14476 Chloroplast_duf: Peta 39.2 31 0.00068 35.4 3.4 92 299-392 92-200 (313)
42 PF14866 Toxin_38: Potassium c 37.3 41 0.0009 26.6 3.1 33 378-410 1-34 (56)
43 PF06003 SMN: Survival motor n 36.2 32 0.00069 34.6 3.0 58 5-65 66-124 (264)
44 PF08846 DUF1816: Domain of un 34.4 23 0.0005 28.9 1.4 23 18-48 5-27 (68)
45 PF01426 BAH: BAH domain; Int 33.3 23 0.00049 29.8 1.3 41 7-47 2-46 (119)
46 cd05505 Bromo_WSTF_like Bromod 31.6 1.1E+02 0.0023 26.1 5.1 59 319-389 1-76 (97)
47 cd04714 BAH_BAHCC1 BAH, or Bro 31.0 44 0.00096 29.4 2.8 42 7-48 3-46 (121)
48 KOG0412|consensus 29.4 4.7E+02 0.01 30.4 10.8 110 290-413 104-242 (773)
49 KOG3612|consensus 29.0 31 0.00067 38.3 1.7 51 8-61 331-382 (588)
50 smart00804 TAP_C C-terminal do 28.7 45 0.00098 26.7 2.2 23 310-332 31-53 (63)
51 PHA03165 hypothetical protein; 28.0 24 0.00053 27.0 0.5 20 392-411 6-26 (57)
52 cd05502 Bromo_tif1_like Bromod 27.8 1.1E+02 0.0023 26.3 4.5 37 318-354 4-54 (109)
53 KOG0955|consensus 27.8 51 0.0011 39.4 3.2 56 8-63 943-1025(1051)
54 cd05791 S1_CSL4 S1_CSL4: CSL4, 25.5 31 0.00068 29.0 0.8 20 3-22 56-75 (92)
55 PF03882 KicB: KicB killing fa 24.5 73 0.0016 34.3 3.4 123 290-428 170-298 (440)
56 cd04712 BAH_DCM_I BAH, or Brom 24.4 71 0.0015 28.7 2.9 51 6-56 4-66 (130)
57 cd04717 BAH_polybromo BAH, or 23.8 77 0.0017 27.5 3.0 42 7-48 3-46 (121)
58 cd04715 BAH_Orc1p_like BAH, or 22.7 77 0.0017 29.6 2.9 43 6-48 28-72 (159)
59 PF09926 DUF2158: Uncharacteri 21.8 2E+02 0.0044 22.2 4.6 38 8-49 1-40 (53)
60 PF07443 HARP: HepA-related pr 21.1 73 0.0016 25.1 2.0 29 343-371 8-38 (55)
61 KOG0749|consensus 20.6 39 0.00085 34.6 0.5 59 342-409 52-117 (298)
62 KOG3409|consensus 20.5 55 0.0012 31.6 1.4 18 3-20 118-135 (193)
63 KOG0981|consensus 20.4 90 0.002 35.2 3.2 49 342-397 434-482 (759)
No 1
>PF11467 LEDGF: Lens epithelium-derived growth factor (LEDGF) ; InterPro: IPR021567 LEDGF is a chromatin-associated protein that protects cells from stress-induced apoptosis. It is the binding partner of HIV-1 integrase in human cells. The integrase binding domain (IBD) of LEDGF is a compact right-handed bundle composed of five alpha-helices. The residues essential for the interaction with the integrase are present in the inter-helical loop regions of the bundle structure. ; PDB: 3F9K_K 3HPG_G 3U88_C 3HPH_H 2B4J_D 1Z9E_A.
Probab=100.00 E-value=9.8e-38 Score=268.81 Aligned_cols=104 Identities=42% Similarity=0.814 Sum_probs=73.8
Q ss_pred hhHHHHHHHHHHHhhCCCCCCHHHHHHHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHH
Q psy6849 299 EVCILDIDLRIKDSIGLEHADCDECLKALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKAS 378 (439)
Q Consensus 299 E~~l~~l~~~ik~sL~~~~~dv~~cl~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~ 378 (439)
|++|++||++|++||+++++||++||+|||+|.+|+||++||+|||+||+||||||||+|| +
T Consensus 1 E~~L~~l~~~Ik~~L~~~~~Dv~kcL~~LdeL~~l~vT~~mL~kn~e~V~TlkklRrY~gn------------------~ 62 (106)
T PF11467_consen 1 ERRLQELHSEIKSSLKVDNPDVKKCLKALDELKSLQVTSLMLQKNPECVETLKKLRRYKGN------------------Q 62 (106)
T ss_dssp HHHHHHHHHHHHHTCETTEE-HHHHHHHHHHHHTS---HHHHTTTHHHHHHHHHHTT-TT-------------------H
T ss_pred ChhHHHHHHHHHHHHcCCCCCHHHHHHHHHHHhccCCCHHHHHhCHHHHHHHHHHHHhhcc------------------H
Confidence 7899999999999999999999999999999999999999999999999999999999996 8
Q ss_pred HHHHHHHHHHHHhhhhcc-cCCC-cchHHHHHHHHHHHHHHhcC
Q psy6849 379 QVRAKADHVYNKFKNLFT-VPAG-ETFWDTFSKQLVTFNDVTRN 420 (439)
Q Consensus 379 ~Ir~kA~~ly~kfK~~F~-~~eg-~~Fw~~F~~~v~~fk~~t~~ 420 (439)
.||+||++|||+||+||+ .+|+ .+||.+||++|++|++.|+|
T Consensus 63 ~Ir~KA~~lYnkfK~~f~~~~e~~~~~~~~~~~~~~~~ke~~~~ 106 (106)
T PF11467_consen 63 QIRKKATELYNKFKSLFLPEPESEQNFWEEFCEEVKKFKEETKN 106 (106)
T ss_dssp HHHHHHHHHHHHHHHHHH--S-SH--------------------
T ss_pred HHHHHHHHHHHHHHHHhCCCchHHHHHHHHHHHHHHHHHHHhcC
Confidence 999999999999999996 4443 79999999999999999986
No 2
>KOG1904|consensus
Probab=99.95 E-value=1.2e-28 Score=258.07 Aligned_cols=405 Identities=18% Similarity=0.173 Sum_probs=248.2
Q ss_pred cCCCccCcEEEEeccCCCCCCceecCCCC--CCCCCceEEEEEecCCceeeecCCCcccchhhhhhhCCCCCh--HHHHH
Q psy6849 5 ERKFKTNDKVFAKVRGYPPWPARIEGLAD--ETPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGKGVKR--KFFTE 80 (439)
Q Consensus 5 ~~~fk~GDLVWAKvkGYP~WPArV~~~pe--vk~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K~~Kr--k~F~e 80 (439)
...|++||||||||+|||+|||+|.+.++ +++.+++|.||||||+++|||.+++|+||..+++.|++++++ ++|++
T Consensus 10 ~~~~~~GDLV~AKlkgyp~WParI~~~~~~~~kp~pkky~V~FfGT~e~Afl~p~dlqpy~~~k~~~g~~~k~~~k~F~~ 89 (496)
T KOG1904|consen 10 AGNFKCGDLVFAKLKGYPPWPARIRNGPDGAVKPPPKKYTVFFFGTKETAFLKPKDLQPYMLNKEKLGKPNKRVWKGFIE 89 (496)
T ss_pred cCCCCCCceeeecccCCCCCcccccCcccccccCCCceeEEEEeccCcccccchhhccchhhhhhhcccchhhhhHHHHH
Confidence 47899999999999999999999999986 578999999999999999999999999999999999999999 99999
Q ss_pred HHHHHHHHhCCCCCcC--CCCCCC--CCCCCCCCcCCCCCC---CCCCc---------ccCcccccc-----cCC-----
Q psy6849 81 ALQEIECDFGTPESRA--NLLPSE--STPKPEATESQAGSD---NEGNL---------VIDEGSEKK-----AAP----- 134 (439)
Q Consensus 81 Al~Eaee~l~n~~~~~--~~~~~~--~~~~~~~~~~~~~s~---~~~~~---------v~de~~~~k-----~~~----- 134 (439)
|+|+|++++ ++|... ++..+. .++.-.+.+....++ .+|.. +..+.+-.+ ...
T Consensus 90 av~eI~~a~-~np~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~as~e~~~~~~~~~~~~~ 168 (496)
T KOG1904|consen 90 AVEEIREAF-NNPKSESDGIFFSATATGRLLGPENLFVASSVLNDTGNSEGEPGADENNASEIATPESDNNRPEGNGEDS 168 (496)
T ss_pred HHHHHHHHh-cCCCccccCccccccccccccccccccccccccccccccccccccccccccccccccccccCCccCcccc
Confidence 999999999 555542 222111 111110000000000 01110 000000000 000
Q ss_pred CCCCC--CCccccccCCCCCCC---CCCCCCCccccCCCCCCCCCCCCCC-----CCCCCcc---CCCCCCCCCC-----
Q psy6849 135 KPKTS--TPRVSEAMDVDTPNY---HHKPSPVLKVTTSGRKIKPKKTFDP-----DDNDSTF---SSHSGGFKEQ----- 196 (439)
Q Consensus 135 ~~~~~--~~~~~~~~~~d~~~~---~~~~~~~~~~sr~gRk~kp~~~~~~-----~~~~~~~---~s~~~~~~~~----- 196 (439)
..+.+ +...|....+|.... ...........|+|++.++--.+.. +.+.... +..+.+....
T Consensus 169 ~~~~d~~~~~d~~~~~~~el~~~~~~~~~~~~~r~~~kg~rkr~ts~~~~~~~~~~a~~~k~t~~~~~~~~s~krP~~~q 248 (496)
T KOG1904|consen 169 SSKVDYSDSTDHFELGGDELNKGKRKPIDTMVKRKKRKGTRKRKTSDAEPKVQRSQASHEKLTERPCESNGSEKRPASTQ 248 (496)
T ss_pred ccccccccccccccccchhhccccccccccccccccCccccccccccccccccccccchhhccCCCcccccccccccccc
Confidence 00000 111111111111100 0011122334456666542211100 0000000 0011111111
Q ss_pred --CCCCCCcccccccceeeeeccCCcccc--ccccCCCCCCCccchHHHHHhhhhhhhHHHHHHHHhhcCCCChHHHhhh
Q psy6849 197 --SPIPGTENALIKASVCRIKTVDGKLVL--LDINKFTPPENCKTEKSINLWKMNKINEFKQIREKVEEGEPVKEEYAKI 272 (439)
Q Consensus 197 --~~~~~~~~~~~~~t~~~~~~~~~k~~g--~~~~~~~~p~~~~~~~~r~~w~~~~~~~a~~lk~~~esg~~~~~~~~~~ 272 (439)
..+++++++ ... ..+. |+..+.+ +++.. +-.|+-.++..-|+-.....+.+++..+.++++..+++...
T Consensus 249 ~~k~~~~~~~~-~~~-~~~~--~s~~s~~~s~~~e~---~~~~e~~~~~~~~~~~~~~~~~~~e~~r~~~~~~~~~~~~~ 321 (496)
T KOG1904|consen 249 AEKRSPKDTGR-SAQ-AQTS--PSIKSEHDSIAVEE---KSSFEVKAKSSPDMVNVVPSAQEKEEDRMSPSEACVQTVRD 321 (496)
T ss_pred ccccccccccc-chh-hccC--CCCcccccchhhhc---ccchhhhhcccccccccccccccchhhhcccccchhhhhhh
Confidence 111111111 101 1112 2233443 66667 88899899999999999999999999888888888777543
Q ss_pred hhhc-------cChHHHHHHHHHHHHHhhhhhhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHHhhcCCCCChHHhhhCch
Q psy6849 273 IEEQ-------CQPEVQAELEANEKALKTQLKLEVCILDIDLRIKDSIGLEHADCDECLKALDDLINLPITPLVLKKHPE 345 (439)
Q Consensus 273 i~~~-------~~~~~e~~~~~~E~~k~~~L~~E~~l~~l~~~ik~sL~~~~~dv~~cl~aLdel~~L~vt~~mL~k~~e 345 (439)
.... ++..+-..++ +.. ..+...+++...- ++....+..|+.+...+..++-+..+++++|.
T Consensus 322 ~~~~~~~~~~~~~~~~~~sl~--~~~--~~~~~~~~~~~~~-------~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~p~ 390 (496)
T KOG1904|consen 322 RETAVHGEAATGQTNQSSSLL--EGE--ARSVEVRRFKLRE-------GSESDSEGNETPMRPNIVDSSANKVKQLCSPP 390 (496)
T ss_pred hcccccccccccccccccccc--ccc--chhhhhccccccC-------CCcccccccccccccccccCccccchhhcCcc
Confidence 3221 1000000000 000 1112222222222 44455555999999999999999999999999
Q ss_pred HHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHHHHhhhhc-ccCC-CcchHHHHHHHHHHHHHHhcCCCc
Q psy6849 346 IVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVYNKFKNLF-TVPA-GETFWDTFSKQLVTFNDVTRNMPA 423 (439)
Q Consensus 346 ~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F-~~~e-g~~Fw~~F~~~v~~fk~~t~~~~~ 423 (439)
++....++++|.=+...|.|+-.....+.-.+|.|+..+...|. +...| .-.+ ..++|..+| .+.+..+++..+.
T Consensus 391 q~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~kv~k~~~~~~~-~~~~v~~~~~~~~~~~~~~~--~e~~~s~~k~~~~ 467 (496)
T KOG1904|consen 391 QKSPQLRSREYDLAKQGKVLKGSTEKSDACMAQKVQSDASEEVP-GSCSVVNKQEQENNQDTKNA--KENQGSLGKEAAR 467 (496)
T ss_pred ccChhhhhhHHHhhhhhhhhcccccccccccceeecccccccCc-cchhccccccccccccchhh--hhhcccHHHHHhH
Confidence 99999999999999999999999999999999999999999999 66555 2233 489999999 9999999998887
Q ss_pred ccceeccc
Q psy6849 424 HDLFSLCE 431 (439)
Q Consensus 424 ~~~~~l~~ 431 (439)
.....+..
T Consensus 468 ~~~~~~~q 475 (496)
T KOG1904|consen 468 VNQAQMRQ 475 (496)
T ss_pred hhhhhhhc
Confidence 76655544
No 3
>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.94 E-value=1.4e-27 Score=197.53 Aligned_cols=82 Identities=52% Similarity=0.921 Sum_probs=77.5
Q ss_pred CCCccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEecCCceeeecCCCcccchhhhhhhCCCCChHHHHHHHHHH
Q psy6849 6 RKFKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGKGVKRKFFTEALQEI 85 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K~~Krk~F~eAl~Ea 85 (439)
..|++||||||||+|||||||+|+++++.++..+.|+|+|||++++|||.+++|+||.+++++|.+++++++|++||+||
T Consensus 1 ~~f~~GdlVwaK~kGyp~WPa~I~~~~~~~~~~~~~~V~FfGt~~~a~v~~~~l~pf~~~~~~~~~~~k~k~F~~Av~ei 80 (83)
T cd05834 1 KQFKAGDLVFAKVKGYPAWPARVDEPEDWKPPGKKYPVYFFGTHETAFLKPEDLFPYTENKKKFGKPKKRKGFNEAVWEI 80 (83)
T ss_pred CCCCCCCEEEEecCCCCCCCEEEecccccCCCCCEEEEEEeCCCCEeEECHHHceecccchhhhccccchHHHHHHHHHH
Confidence 37999999999999999999999999876667899999999999999999999999999999999999999999999999
Q ss_pred HH
Q psy6849 86 EC 87 (439)
Q Consensus 86 ee 87 (439)
|.
T Consensus 81 e~ 82 (83)
T cd05834 81 EK 82 (83)
T ss_pred hh
Confidence 83
No 4
>cd05836 N_Pac_NP60 The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha. In a variety of cell lines, NP60 has been shown to localize to the nucleus. In addition to the PWWP domain, NP60 also contains an AT-hook and a C-terminal NAD-binding domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=99.94 E-value=6e-27 Score=194.86 Aligned_cols=82 Identities=30% Similarity=0.578 Sum_probs=75.1
Q ss_pred CccCcEEEEeccCCCCCCceecCCCC-C---CCCCceEEEEEecCCceeeecCCCcccchhhhhhhCCCCChHHHHHHHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLAD-E---TPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGKGVKRKFFTEALQ 83 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pe-v---k~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K~~Krk~F~eAl~ 83 (439)
|++||||||||+|||||||+|++++. + +...+.|+|+|||++++|||.+++|+||.+++++|.+.+|.++|++||+
T Consensus 1 f~~GDlVwaK~~g~P~WPa~V~~~~~~~~~~~~~~~~~~V~FFG~~~~~wv~~~~l~pF~~~~~~~~~~~k~~~F~~Av~ 80 (86)
T cd05836 1 LKLGDLVWAKMKGFPPWPGRIVKPPKDLKKPRGKAKCFFVFFFGSENHAWIKEENIKPYHEHKEEMIKLNKGARFQQAVD 80 (86)
T ss_pred CCCCCEEEEeCCCCCCCCEEEechhhhcccccCCCCeEEEEEeCCCCEEEECHHhCeechhhHHHHhcccchHHHHHHHH
Confidence 78999999999999999999999864 2 2235789999999999999999999999999999999999999999999
Q ss_pred HHHHHh
Q psy6849 84 EIECDF 89 (439)
Q Consensus 84 Eaee~l 89 (439)
|||.|+
T Consensus 81 ~ie~~~ 86 (86)
T cd05836 81 AIEEYI 86 (86)
T ss_pred HHHHhC
Confidence 999774
No 5
>cd05841 BS69_related The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions. BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains. The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.91 E-value=6.8e-25 Score=181.72 Aligned_cols=75 Identities=20% Similarity=0.429 Sum_probs=71.1
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEec-CCceeeecCCCcccchhhhhhhCCCCChHHHHHHHHHHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYG-TRETGICRQDELFPYTEFKDKYGKGVKRKFFTEALQEIE 86 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFG-t~e~Awv~~k~L~Py~e~kekf~K~~Krk~F~eAl~Eae 86 (439)
+++||||||||+|||||||+|+.. ..+.|+|+||| ||++|||+.++|+||.++++.+.+++|+++|++||+||+
T Consensus 7 ~~p~dLVwAK~kGyp~WPAkV~~~-----~~~~~~V~FFG~t~~~a~v~~~~i~~~~~~~~~~~~~~k~~~f~~A~~Eie 81 (83)
T cd05841 7 RPPHELVWAKLKGFPYWPAKVMRV-----EDNQVDVRFFGGQHDRAWIPSNNIQPISTEIPQQLVKKRSRGFNKAMDELE 81 (83)
T ss_pred CCCCCEEEEeCCCCCCCCEEEeec-----CCCeEEEEEcCCCCCeEEEehHHeeehhhhhhhhccccccHHHHHHHHHHH
Confidence 689999999999999999999987 45899999999 999999999999999999988999999999999999998
Q ss_pred H
Q psy6849 87 C 87 (439)
Q Consensus 87 e 87 (439)
.
T Consensus 82 ~ 82 (83)
T cd05841 82 L 82 (83)
T ss_pred h
Confidence 5
No 6
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis. In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes. In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.91 E-value=1e-24 Score=181.57 Aligned_cols=81 Identities=23% Similarity=0.499 Sum_probs=73.2
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC---CCCCceEEEEEecCCceeeecCCCcccchhhhhhhCC--CCChHHHHHHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE---TPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGK--GVKRKFFTEAL 82 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev---k~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K--~~Krk~F~eAl 82 (439)
|.+||||||||+|||||||+|++++.. ....++|+|+|||+++++||.+++|+||.++.+.|+. .+|+..|++||
T Consensus 1 f~vGDlVWaK~kg~pwWP~~V~~~~~~~~~~~~~~~~~V~fFGs~~~a~v~~~~l~pf~e~~~~f~~~~~~k~~~f~~Ai 80 (87)
T cd05835 1 FNVGDLVWGKIKGFPWWPGRVVSITVTSKRPPVVGMRWVTWFGSGTFSEVSVDKLSPFSEFFKAFSRYNRKKKGLYKKAI 80 (87)
T ss_pred CCCCCEEEEecCCCCCCCeEEechhhcccccCCCCeEEEEEeCCCCEeEECHHHCcChhHhHHHHhhhhhhhhHHHHHHH
Confidence 789999999999999999999999763 3467899999999999999999999999999999874 46789999999
Q ss_pred HHHHHH
Q psy6849 83 QEIECD 88 (439)
Q Consensus 83 ~Eaee~ 88 (439)
.||.+.
T Consensus 81 ~eA~e~ 86 (87)
T cd05835 81 YEALEV 86 (87)
T ss_pred HHHHHc
Confidence 999864
No 7
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes. The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes. Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=99.89 E-value=3.8e-23 Score=170.53 Aligned_cols=80 Identities=33% Similarity=0.552 Sum_probs=73.0
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC------CCCCceEEEEEecCCceeeecCCCcccchhhhhhhCCCCChHHHHHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE------TPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGKGVKRKFFTEA 81 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev------k~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K~~Krk~F~eA 81 (439)
|++|||||||++|||||||+|+++... +...+.|+|+|||+++++||..++|.||.++.....+.+++++|++|
T Consensus 1 f~~GdlVwaK~~g~pwWPa~V~~~~~~~~~~~~~~~~~~~~V~Ffg~~~~~wv~~~~l~pf~~~~~~~~~~~k~~~f~~A 80 (87)
T cd05162 1 FRPGDLVWAKMKGYPWWPALVVDPPKDSKKAKKKAKEGKVLVLFFGDKTFAWVGAERLKPFTEHKESEAKQSKRKGFKKA 80 (87)
T ss_pred CCCCCEEEEeCCCCCCCCEEEccccccchhhhccCCCCEEEEEEeCCCcEEEeCccceeeccchHHhhccCCccHHHHHH
Confidence 789999999999999999999999763 34568999999999999999999999999988777788899999999
Q ss_pred HHHHHH
Q psy6849 82 LQEIEC 87 (439)
Q Consensus 82 l~Eaee 87 (439)
|.+|++
T Consensus 81 ~~eA~~ 86 (87)
T cd05162 81 YDEALE 86 (87)
T ss_pred HHHHHh
Confidence 999984
No 8
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS). When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.88 E-value=4.1e-23 Score=174.73 Aligned_cols=83 Identities=25% Similarity=0.386 Sum_probs=71.7
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC-------CCCCceEEEEEecCCceeeecCCCcccchhhhhhhCC---CCChHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE-------TPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGK---GVKRKF 77 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev-------k~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K---~~Krk~ 77 (439)
+.+||||||||+|||||||+|+++.++ +...+.|+|+|||+++|+||.+++|+||.++...+.. .++.+.
T Consensus 1 ~~~GdlVWaK~~g~pwWPa~V~~~~~~p~~~~~~~~~~~~~~V~Ffgs~~y~Wv~~~~l~pf~e~~~~~~~~~~~~~~~~ 80 (95)
T cd05838 1 PLYGDIVWAKLGNFRWWPAIICDPREVPPNIQVLRHCIGEFCVMFFGTHDYYWVHRGRVFPYQEGDKGFKEQTKSYLAKR 80 (95)
T ss_pred CCcCCEEEEECCCCCCCCeEEcChhhcChhHhhccCCCCeEEEEEeCCCCEEEeccccccchhhhhhhhhhhhhhhhHHH
Confidence 468999999999999999999987542 3456889999999999999999999999988776552 256899
Q ss_pred HHHHHHHHHHHhC
Q psy6849 78 FTEALQEIECDFG 90 (439)
Q Consensus 78 F~eAl~Eaee~l~ 90 (439)
|++||.||.+++.
T Consensus 81 f~~AleEA~~~~~ 93 (95)
T cd05838 81 FRKALEEASLAFK 93 (95)
T ss_pred HHHHHHHHHHHhh
Confidence 9999999998875
No 9
>PF00855 PWWP: PWWP domain; InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=99.87 E-value=2.2e-23 Score=169.88 Aligned_cols=81 Identities=33% Similarity=0.624 Sum_probs=72.3
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC---CCCCceEEEEEecCCceeeecCCCcccchhhhhhhCCC--CChHHHHHHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE---TPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGKG--VKRKFFTEAL 82 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev---k~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K~--~Krk~F~eAl 82 (439)
|++||||||||+|||||||+|+++++. .+..+.|+|+|||+++++||..++|.||.++.+.+... ++++.|++||
T Consensus 1 f~~GdlVWaK~~g~pwWPa~V~~~~~~~~~~~~~~~~~V~Ffg~~~~~wv~~~~i~~f~~~~~~~~~~~~~k~~~~~~Ai 80 (86)
T PF00855_consen 1 FRPGDLVWAKLKGYPWWPARVCDPDEKSKKKRKDGHVLVRFFGDNDYAWVKPSNIKPFSEFKEKLKKKKKKKRKSFRKAI 80 (86)
T ss_dssp -STTEEEEEEETTSEEEEEEEEECCHCTSCSSSSTEEEEEETTTTEEEEEEGGGEEECCHHHHHHHHHHHHHSHHHHHHH
T ss_pred CCCCCEEEEEeCCCCCCceEEeecccccccCCCCCEEEEEecCCCCEEEECHHHhhChhhhHHHHHHhhccchHHHHHHH
Confidence 789999999999999999999999753 45678999999999999999999999999988888754 5789999999
Q ss_pred HHHHHH
Q psy6849 83 QEIECD 88 (439)
Q Consensus 83 ~Eaee~ 88 (439)
.+|+++
T Consensus 81 ~eA~~~ 86 (86)
T PF00855_consen 81 EEAEEA 86 (86)
T ss_dssp HHHHHH
T ss_pred HHHHhC
Confidence 999865
No 10
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4. The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.84 E-value=2.8e-21 Score=163.12 Aligned_cols=79 Identities=29% Similarity=0.467 Sum_probs=67.9
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC---------CCCCceEEEEEecCCceeeecCCCcccch-hhhhhhCCC--CCh
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE---------TPNRLKYHIFFYGTRETGICRQDELFPYT-EFKDKYGKG--VKR 75 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev---------k~~~~ky~V~FFGt~e~Awv~~k~L~Py~-e~kekf~K~--~Kr 75 (439)
|++||||||||+|||||||+|+++..+ +...+.|+|+|||+++|+||.+++|+||+ +.++.|... +|.
T Consensus 1 f~~GDlVwaK~~GyPwWPA~V~~~~~~p~~~l~~~~~~~~~~~~V~FFg~~~~~Wv~~~~l~pl~~~~~~~~l~~~~~k~ 80 (93)
T cd05840 1 FQPGDRVLAKVKGFPAWPAIVVPEEMLPDSVLKGKKKKNKRTYPVMFFPDGDYYWVPNKDLKPLTEEKIAKFLKKPKRKD 80 (93)
T ss_pred CCCCCEEEEeCCCCCCCCEEECChHHCCHHHHhcccCCCCCeEEEEEeCCCcEEEEChhhcccCCHHHHHHHhhcCCCCC
Confidence 789999999999999999999997532 23568899999999999999999999999 677787743 467
Q ss_pred HHHHHHHHHHH
Q psy6849 76 KFFTEALQEIE 86 (439)
Q Consensus 76 k~F~eAl~Eae 86 (439)
+.+..|.+.|.
T Consensus 81 k~l~~ay~~A~ 91 (93)
T cd05840 81 KELIKAYKAAK 91 (93)
T ss_pred HHHHHHHHHhc
Confidence 88999988775
No 11
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS. The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans. The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain. Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis. In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=99.80 E-value=1.1e-19 Score=157.49 Aligned_cols=81 Identities=28% Similarity=0.379 Sum_probs=65.8
Q ss_pred CCccCcEEEEeccCCCCCCceecCCCCC---------CCCCceEEEEEecC-CceeeecCCCcccchhhhhhh-------
Q psy6849 7 KFKTNDKVFAKVRGYPPWPARIEGLADE---------TPNRLKYHIFFYGT-RETGICRQDELFPYTEFKDKY------- 69 (439)
Q Consensus 7 ~fk~GDLVWAKvkGYP~WPArV~~~pev---------k~~~~ky~V~FFGt-~e~Awv~~k~L~Py~e~kekf------- 69 (439)
.|.+||||||||+|||||||+|++.+.. ++..+.|+|+|||+ +++|||+.++|.||.+..+..
T Consensus 2 ~~~~GdlVWaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~~V~FFG~~~~~aWv~~~~l~pf~~~~~~~~~~~~~~ 81 (110)
T cd05837 2 KYQVGDLVWAKVSGYPWWPCMVCSDPLLGTYTKTKRNKRKPRQYHVQFFGDNPERAWISEKSLKPFKGSKQFESEKGEKF 81 (110)
T ss_pred CCCCCCEEEEeCCCCCCCCEEEecccccchhhhhhhccCCCCeEEEEEcCCCCCEEEecHHHccccCCchhhhhhhhhhh
Confidence 6999999999999999999999987652 24578999999996 699999999999999766543
Q ss_pred --CCCCChHHHHHHHHHHHH
Q psy6849 70 --GKGVKRKFFTEALQEIEC 87 (439)
Q Consensus 70 --~K~~Krk~F~eAl~Eaee 87 (439)
.|++.++.++.|..+|..
T Consensus 82 ~~~K~~~~~~~~~a~~~~~~ 101 (110)
T cd05837 82 KVRKPNIKKARQKADIAIMQ 101 (110)
T ss_pred hccCCcchhHHHHHHHHHHH
Confidence 245556777777666643
No 12
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=99.79 E-value=8.5e-20 Score=143.26 Aligned_cols=56 Identities=34% Similarity=0.704 Sum_probs=51.0
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC-------CCCCceEEEEEecCCceeeecCCCcccch
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE-------TPNRLKYHIFFYGTRETGICRQDELFPYT 63 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev-------k~~~~ky~V~FFGt~e~Awv~~k~L~Py~ 63 (439)
|++||||||||+|||||||+|++.+.. +...+.|+|+|||++++|||.+++|+||.
T Consensus 1 f~~GdlVwaK~~G~p~WPa~V~~~~~~~~~~~~~~~~~~~~~V~Ffg~~~~awv~~~~l~p~~ 63 (63)
T smart00293 1 FKPGDLVWAKMKGFPWWPALVVSPKETPDNIRKRKRFENLYPVLFFGDKDTAWISSSKLFPLT 63 (63)
T ss_pred CCCCCEEEEECCCCCCCCeEEcCcccCChhHhhccCCCCEEEEEEeCCCCEEEECccceeeCC
Confidence 789999999999999999999999753 34578999999999999999999999984
No 13
>cd06080 MUM1_like Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA). MUM-1 belongs to the mutated or aberrantly expressed type of MAAs, along with antigens such as CDK4, beta-catenin, gp100-in4, p15, and N-acetylglucosaminyltransferase V. It is highly expressed in several types of human cancers. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.78 E-value=1.9e-19 Score=148.37 Aligned_cols=76 Identities=25% Similarity=0.374 Sum_probs=66.3
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEecCC-ceeeecCCCcccchhhhhhhCCCCChHHHHHHHHHHH
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYGTR-ETGICRQDELFPYTEFKDKYGKGVKRKFFTEALQEIE 86 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFGt~-e~Awv~~k~L~Py~e~kekf~K~~Krk~F~eAl~Eae 86 (439)
|.+|||||||++|||||||+|.+... ..++|+|+|||++ +++|+..++|+||.++.+.+.|....+..++++.+|.
T Consensus 1 f~~gdlVWaK~~g~P~WPa~I~~~~~---~~~k~~V~FfG~~~~~a~~~~~~l~p~~~~~~~~ek~~~~~k~ke~~~~ai 77 (80)
T cd06080 1 FEKNDLVWAKIQGYPWWPAVIKSISR---KKQKARVNFIGDNMQSEKKGIRVVKRWLKHFDCTEKQKLTNKAKESYEQAI 77 (80)
T ss_pred CCCCCEEEEeCCCCCCCCEEEeeecC---CCCEEEEEEeCCCCceeccchhhcccccccHHHHHHHHHHHHHHHHHHHHh
Confidence 78999999999999999999998754 4689999999999 9999999999999999998887666666677776664
No 14
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2. BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region. In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=99.59 E-value=1.6e-15 Score=132.16 Aligned_cols=79 Identities=27% Similarity=0.338 Sum_probs=60.7
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCC----------------------CCCCceEEEEEecC-CceeeecCCCcccchh
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADE----------------------TPNRLKYHIFFYGT-RETGICRQDELFPYTE 64 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pev----------------------k~~~~ky~V~FFGt-~e~Awv~~k~L~Py~e 64 (439)
+.+|||||||++|||||||+|++++.. ......|+|.||++ ++++||+.++|.||.+
T Consensus 1 ~~pg~lVwaK~~g~P~wPa~iidp~~~~~~~~~~~~p~~~l~~~~~~~~~~~~~~~lV~FFd~~~s~~Wv~~~~l~pl~~ 80 (111)
T cd05839 1 LEPLTLVWAKCRGYPSYPALIIDPKMPRDGVFHNGVPPDVLTLGEARAQNADERLYLVLFFDNKRTWQWLPGDKLEPLGV 80 (111)
T ss_pred CCCcCEeeeeecCCCCCCeEeeCCCCCCcccccCCCCchhhhHHHHHhccCCCcEEEEEEecCCCcceecCHHHCccccc
Confidence 478999999999999999999998621 12456799999996 7999999999999996
Q ss_pred hhh----hhC---CCCChHHHHHHHHHHH
Q psy6849 65 FKD----KYG---KGVKRKFFTEALQEIE 86 (439)
Q Consensus 65 ~ke----kf~---K~~Krk~F~eAl~Eae 86 (439)
... ++. ++..++..+.|...|.
T Consensus 81 ~~~~D~~kl~~~rk~~~rk~~~~Ay~~Al 109 (111)
T cd05839 81 DETLDKLKLKEGRKPSIRKAVQKAYDDAL 109 (111)
T ss_pred chhhhhhhhhhccCHHHHHHHHHHHHHHh
Confidence 322 222 3445677777776665
No 15
>cd00183 TFIIS_I N-terminal domain (domain I) of transcription elongation factor S-II (TFIIS); similar to a domain found in elongin A and CRSP70; likely to be involved in transcription; domain I from TFIIS interacts with RNA polymerase II holoenzyme
Probab=97.16 E-value=0.0023 Score=52.22 Aligned_cols=59 Identities=24% Similarity=0.371 Sum_probs=52.6
Q ss_pred CCCHHHHHHHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHHHHhhhhc
Q psy6849 317 HADCDECLKALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVYNKFKNLF 395 (439)
Q Consensus 317 ~~dv~~cl~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F 395 (439)
..+...|+++|..|..+++|..+|+. ..|=.++.+||++.. ..|+..|..|+.+||.++
T Consensus 17 ~~~~~~~~~~L~~L~~~~it~~~L~~-T~iG~~V~~Lrkh~~-------------------~~i~~~A~~Lv~~Wk~~v 75 (76)
T cd00183 17 NEEVSRLLDLLRLLKKLPLTVEILKE-TRIGKKVNSLRKHSN-------------------EKIRKLAKALIKSWKKLV 75 (76)
T ss_pred CCCHHHHHHHHHHHhcCCCCHHHHHH-CCHHHHHHHHHcCCc-------------------HHHHHHHHHHHHHHHHhc
Confidence 36999999999999999999998877 788888999999943 569999999999999875
No 16
>smart00509 TFS2N Domain in the N-terminus of transcription elongation factor S-II (and elsewhere).
Probab=97.09 E-value=0.0027 Score=51.88 Aligned_cols=59 Identities=27% Similarity=0.456 Sum_probs=52.6
Q ss_pred CCCHHHHHHHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHHHHhhhhc
Q psy6849 317 HADCDECLKALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVYNKFKNLF 395 (439)
Q Consensus 317 ~~dv~~cl~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F 395 (439)
..+++.||++|..|..+++|..+|+. ..|=.++.+||+.. | ..|+..|..|+.+||+++
T Consensus 15 ~~~~~~~l~~L~~L~~~~~t~~~L~~-T~iG~~v~~Lrkh~-~------------------~~I~~~A~~Li~~WK~~v 73 (75)
T smart00509 15 GKEVSRCLDILKKLKKLPITVDLLEE-TRIGKKVNGLRKHK-N------------------EEIRKLAKKLIKSWKKLV 73 (75)
T ss_pred CCCHHHHHHHHHHHhcCCCCHHHHHH-CcHHHHHHHHHcCC-c------------------HHHHHHHHHHHHHHHHHh
Confidence 36899999999999999999999988 57888899999984 3 579999999999999986
No 17
>KOG1081|consensus
Probab=96.88 E-value=0.00046 Score=73.78 Aligned_cols=61 Identities=26% Similarity=0.444 Sum_probs=48.6
Q ss_pred CCCccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEecCCceeeecCCCcccchhhhhh
Q psy6849 6 RKFKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYGTRETGICRQDELFPYTEFKDK 68 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kek 68 (439)
..+..||+||-++..|+||||.|+..+-+.-....++|+|||. ++|+....+++|....+.
T Consensus 134 ~~~~~~~~vw~~vg~~~~~~c~vc~~~~~~~~~~~~~~~f~~~--~~~~~~~~~~~~~g~~~~ 194 (463)
T KOG1081|consen 134 KKREVGDLVWSKVGEYPWWPCMVCHDPLLPKGMKHDHVNFFGC--YAWTHEKRVFPYEGQSSK 194 (463)
T ss_pred ccccceeEEeEEcCcccccccceecCcccchhhccccceeccc--hhhHHHhhhhhccchHHH
Confidence 4688999999999999999999999876531112228999999 999999999999433333
No 18
>PF08711 Med26: TFIIS helical bundle-like domain; InterPro: IPR017923 Transcription factor IIS (TFIIS) is a transcription elongation factor that increases the overall transcription rate of RNA polymerase II by reactivating transcription elongation complexes that have arrested transcription. The three structural domains of TFIIS are conserved from yeast to human. The 80 or so N-terminal residues form a protein interaction domain containing a conserved motif, which has been called the LW motif because of the invariant leucine and tryptophan residues it contains. Although the N-terminal domain is not needed for transcriptional activity, a similar sequence has been identified in other transcription factors and proteins that are predominantly nuclear localized [, ]: MED26 (also known as CRSP70 and ARC70), a subunit of the Mediator complex, which is required for the activity of the enhancer-binding protein Sp1. Elongin A, a subunit of a transcription elongation factor previously known as SIII. It increases the rate of transcription by suppressing transient pausing of the elongation complex. PPP1R10, a nuclear regulatory subunit of protein phosphatase 1 that was previously known as p99, FB19 or PNUTS. PIBP, a small hypothetical protein that could be a phosphoinositide binding protein. IWS1, which is thought to function in both transcription initiation and elongation. The TFIIS N-terminal domain is a compact four-helix bundle. The hydrophobic core residues of helices 2, 3, and 4 are well conserved among TFIIS domains, although helix 1 is less conserved []. ; GO: 0003677 DNA binding, 0006351 transcription, DNA-dependent, 0005634 nucleus; PDB: 1EO0_A 3OAK_A 3NFQ_B 3O8Z_A 1WJT_A 2XPL_A 2XPO_A 2XPP_A 2XPN_A.
Probab=96.46 E-value=0.0066 Score=45.88 Aligned_cols=52 Identities=27% Similarity=0.405 Sum_probs=44.2
Q ss_pred HHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHHHHhhhhc
Q psy6849 325 KALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVYNKFKNLF 395 (439)
Q Consensus 325 ~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F 395 (439)
++|+.|..||||.-+|+. +.|-.++.+|+++.. ...|++.|..|+.+|+.+.
T Consensus 1 ~iL~~L~~l~it~~~L~~-T~IGk~V~~l~k~~~------------------~~~i~~~A~~Li~~Wk~~v 52 (53)
T PF08711_consen 1 EILKVLEKLPITVELLKS-TGIGKAVNKLRKHSE------------------NPEIRKLAKELIKKWKRIV 52 (53)
T ss_dssp HHHHHHHCSS-SHHHHHH-HSHHHHHHHHHHCTS-------------------HHHHHHHHHHHHHHHHHH
T ss_pred CHHHHhhcCCCCHHHHHh-CChhHHHHHHHcCCC------------------CHHHHHHHHHHHHHHhHhc
Confidence 478899999999988885 999999999999944 3679999999999999874
No 19
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=95.17 E-value=0.019 Score=44.12 Aligned_cols=52 Identities=19% Similarity=0.211 Sum_probs=41.1
Q ss_pred CCccCcEEEEec-cCCCCCCceecCCCCCCCCCceEEEEEec--CCceeeecCCCcccc
Q psy6849 7 KFKTNDKVFAKV-RGYPPWPARIEGLADETPNRLKYHIFFYG--TRETGICRQDELFPY 62 (439)
Q Consensus 7 ~fk~GDLVWAKv-kGYP~WPArV~~~pevk~~~~ky~V~FFG--t~e~Awv~~k~L~Py 62 (439)
.|++||+|-|.. ..-.||+|+|.... ..++|.|+|.+ ....--+...+|.|-
T Consensus 2 ~~~~G~~Ve~~~~~~~~W~~a~V~~~~----~~~~~~V~~~~~~~~~~e~v~~~~LRp~ 56 (61)
T smart00743 2 DFKKGDRVEVFSKEEDSWWEAVVTKVL----GDGKYLVRYLTESEPLKETVDWSDLRPH 56 (61)
T ss_pred CcCCCCEEEEEECCCCEEEEEEEEEEC----CCCEEEEEECCCCcccEEEEeHHHcccC
Confidence 689999999998 46789999999874 25789999999 666666666666653
No 20
>TIGR01385 TFSII transcription elongation factor S-II. This model represents eukaryotic transcription elongation factor S-II. This protein allows stalled RNA transcription complexes to perform a cleavage of the nascent RNA and restart at the newly generated 3-prime end.
Probab=95.01 E-value=0.08 Score=53.95 Aligned_cols=71 Identities=14% Similarity=0.306 Sum_probs=57.0
Q ss_pred HHHHHHHhhCCCCCCHHHHHHHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHH
Q psy6849 305 IDLRIKDSIGLEHADCDECLKALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKA 384 (439)
Q Consensus 305 l~~~ik~sL~~~~~dv~~cl~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA 384 (439)
+...|..... ..+...||++|.+|..+++|.-+|+ ...|=.+|.+||++.. ..|+..|
T Consensus 7 ~~k~L~k~~~--~~~~~~~l~~L~~L~~~~~t~~lL~-~T~IG~~Vn~lrkh~~-------------------~~I~~lA 64 (299)
T TIGR01385 7 HAKALDKNKS--SKNVEQCLDILHQLKEFPPTEELLQ-ETKVGVKVNKLRKHPN-------------------EDISKLA 64 (299)
T ss_pred HHHHhhhhcc--CCCHHHHHHHHHHHhcCCCcHHHHh-hCchhHHHHHHHcCCc-------------------HHHHHHH
Confidence 3334444333 5788999999999999999997554 5889999999999832 4599999
Q ss_pred HHHHHHhhhhccc
Q psy6849 385 DHVYNKFKNLFTV 397 (439)
Q Consensus 385 ~~ly~kfK~~F~~ 397 (439)
..|+.+||.++.-
T Consensus 65 k~li~~WK~~v~~ 77 (299)
T TIGR01385 65 KKIIKSWKKVVDK 77 (299)
T ss_pred HHHHHHHHHHHhh
Confidence 9999999999964
No 21
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=94.57 E-value=0.038 Score=41.51 Aligned_cols=53 Identities=26% Similarity=0.363 Sum_probs=46.2
Q ss_pred CCccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEecCCceeeecCCCcccch
Q psy6849 7 KFKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYGTRETGICRQDELFPYT 63 (439)
Q Consensus 7 ~fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFGt~e~Awv~~k~L~Py~ 63 (439)
.|++|++|-|+...=-|.+|+|..... .+.|.|+|...+...||+..+|.+..
T Consensus 2 ~~~~G~~~~a~~~d~~wyra~I~~~~~----~~~~~V~f~D~G~~~~v~~~~l~~l~ 54 (57)
T smart00333 2 TFKVGDKVAARWEDGEWYRARIIKVDG----EQLYEVFFIDYGNEEVVPPSDLRPLP 54 (57)
T ss_pred CCCCCCEEEEEeCCCCEEEEEEEEECC----CCEEEEEEECCCccEEEeHHHeecCC
Confidence 588999999999666799999999852 28899999999999999999998765
No 22
>KOG1904|consensus
Probab=92.58 E-value=0.04 Score=59.58 Aligned_cols=65 Identities=5% Similarity=-0.021 Sum_probs=61.2
Q ss_pred hhhhhHHHHHHHHHHHhhCCCCCCHHHHHHHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCcc
Q psy6849 296 LKLEVCILDIDLRIKDSIGLEHADCDECLKALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIA 361 (439)
Q Consensus 296 L~~E~~l~~l~~~ik~sL~~~~~dv~~cl~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~ 361 (439)
-.++ .+..|++++...+..+..|+.+|+..|+++..+.++..|++++.+.+.|++.+++|.||+.
T Consensus 431 ~~~~-~~~~v~~~~~~~~~~~~~~~~e~~~s~~k~~~~~~~~~~~q~~~~~~a~~~~i~~~~~~~~ 495 (496)
T KOG1904|consen 431 EEVP-GSCSVVNKQEQENNQDTKNAKENQGSLGKEAARVNQAQMRQNPVDSAAESKVIVQEAGPAP 495 (496)
T ss_pred ccCc-cchhccccccccccccchhhhhhcccHHHHHhHhhhhhhhcchhhHHHHHHHHHHhhccCC
Confidence 3456 8999999999999999999999999999999999999999999999999999999999853
No 23
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=91.93 E-value=0.15 Score=37.03 Aligned_cols=47 Identities=23% Similarity=0.290 Sum_probs=39.8
Q ss_pred CcEEEEeccC-CCCCCceecCCCCCCCCCceEEEEEecCCceeeecCCCccc
Q psy6849 11 NDKVFAKVRG-YPPWPARIEGLADETPNRLKYHIFFYGTRETGICRQDELFP 61 (439)
Q Consensus 11 GDLVWAKvkG-YP~WPArV~~~pevk~~~~ky~V~FFGt~e~Awv~~k~L~P 61 (439)
|+++.|+... --|.+|+|.... ..+.|.|+|..-+....|+.++|.|
T Consensus 1 G~~c~a~~~~d~~wyra~V~~~~----~~~~~~V~f~DyG~~~~v~~~~l~~ 48 (48)
T cd04508 1 GDLCLAKYSDDGKWYRAKITSIL----SDGKVEVFFVDYGNTEVVPLSDLRP 48 (48)
T ss_pred CCEEEEEECCCCeEEEEEEEEEC----CCCcEEEEEEcCCCcEEEeHHHcCC
Confidence 7899999986 788999999874 2678999999988888888877754
No 24
>PF11717 Tudor-knot: RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=90.01 E-value=0.25 Score=37.91 Aligned_cols=52 Identities=15% Similarity=0.290 Sum_probs=39.6
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEecCCce--eeecCCCcc
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYGTRET--GICRQDELF 60 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFGt~e~--Awv~~k~L~ 60 (439)
|.+|+.|+++...--+-+|+|..... .+....|.|.|.|.+.+ -||+.++|.
T Consensus 1 ~~vG~~v~~~~~~~~~y~A~I~~~r~-~~~~~~YyVHY~g~nkR~DeWV~~~~i~ 54 (55)
T PF11717_consen 1 FEVGEKVLCKYKDGQWYEAKILDIRE-KNGEPEYYVHYQGWNKRLDEWVPESRIR 54 (55)
T ss_dssp --TTEEEEEEETTTEEEEEEEEEEEE-CTTCEEEEEEETTSTGCC-EEEETTTEE
T ss_pred CCcCCEEEEEECCCcEEEEEEEEEEe-cCCCEEEEEEcCCCCCCceeeecHHHcc
Confidence 67999999999666777999998743 23446899999996655 699988774
No 25
>PF15057 DUF4537: Domain of unknown function (DUF4537)
Probab=85.69 E-value=1.4 Score=39.29 Aligned_cols=60 Identities=17% Similarity=0.255 Sum_probs=47.3
Q ss_pred ccCCCccCcEEEEecc--CCCCCCceecCCCCC-CCCCceEEEEEecCCceeeecCCCcccchh
Q psy6849 4 VERKFKTNDKVFAKVR--GYPPWPARIEGLADE-TPNRLKYHIFFYGTRETGICRQDELFPYTE 64 (439)
Q Consensus 4 ~~~~fk~GDLVWAKvk--GYP~WPArV~~~pev-k~~~~ky~V~FFGt~e~Awv~~k~L~Py~e 64 (439)
+....++||-|+|+.. ++.+=||+|..-++. ......|.|.||.. ..+.|+...++....
T Consensus 52 ~~~~L~~GD~VLA~~~~~~~~Y~Pg~V~~~~~~~~~~~~~~~V~f~ng-~~~~vp~~~~~~I~~ 114 (124)
T PF15057_consen 52 MRHSLQVGDKVLAPWEPDDCRYGPGTVIAGPERRASEDKEYTVRFYNG-KTAKVPRGEVIWISP 114 (124)
T ss_pred ccCcCCCCCEEEEecCcCCCEEeCEEEEECccccccCCceEEEEEECC-CCCccchhhEEECCH
Confidence 4678999999999963 788889999976653 34678899999975 377888888876553
No 26
>KOG1105|consensus
Probab=81.92 E-value=5.4 Score=40.95 Aligned_cols=62 Identities=19% Similarity=0.278 Sum_probs=49.2
Q ss_pred CCCHHHHHHHHHhhcCCCCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHHHHhhhhcc
Q psy6849 317 HADCDECLKALDDLINLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVYNKFKNLFT 396 (439)
Q Consensus 317 ~~dv~~cl~aLdel~~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F~ 396 (439)
+.|+..||++|.+|..+|||. -|++-..+.-.+..+|+-.-| ..|+..|..|...||.++.
T Consensus 19 ~~n~~~~ld~l~~L~~~pvt~-ell~~Tr~g~~vn~~~Kk~~n------------------~ev~~~ak~Lik~Wkk~~~ 79 (296)
T KOG1105|consen 19 SKNVEAALDLLKRLKKIPVTL-ELLQETRTGMGVNEVLKKHKN------------------EEVRSLAKKLIKSWKKLVD 79 (296)
T ss_pred cccHHHHHHHHHHHHhcccHH-HHHHHhhHHHHHHHHHHhCCC------------------HHHHHHHHHHHHHHHHHhh
Confidence 479999999999999999955 455556666676655555443 4599999999999999996
Q ss_pred c
Q psy6849 397 V 397 (439)
Q Consensus 397 ~ 397 (439)
-
T Consensus 80 ~ 80 (296)
T KOG1105|consen 80 K 80 (296)
T ss_pred c
Confidence 4
No 27
>cd03572 ENTH_epsin_related ENTH domain, Epsin Related family; composed of hypothetical proteins containing an ENTH-like domain. The epsin N-terminal homology (ENTH) domain is an evolutionarily conserved protein module found primarily in proteins that participate in clathrin-mediated endocytosis. A set of proteins previously designated as harboring an ENTH domain in fact contains a highly similar, yet unique module referred to as an AP180 N-terminal homology (ANTH) domain. ENTH and ANTH (E/ANTH) domains are structurally similar to the VHS domain and are composed of a superhelix of eight alpha helices. E/ANTH domains bind both inositol phospholipids and proteins and contribute to the nucleation and formation of clathrin coats on membranes. ENTH domains also function in the development of membrane curvature through lipid remodeling during the formation of clathrin-coated vesicles. E/ANTH-bearing proteins have recently been shown to function with adaptor protein-1 and GGA adaptors at the t
Probab=74.53 E-value=9.9 Score=34.17 Aligned_cols=79 Identities=11% Similarity=0.074 Sum_probs=53.9
Q ss_pred hHHHHHHHHHHHhhCCCCCCH-HHHHHHHHhhcCCC--CChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhh
Q psy6849 300 VCILDIDLRIKDSIGLEHADC-DECLKALDDLINLP--ITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQK 376 (439)
Q Consensus 300 ~~l~~l~~~ik~sL~~~~~dv-~~cl~aLdel~~L~--vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~ 376 (439)
..+.++-.-|-..|.-..+.| -|||.+|+-|-.=- -=..-|++|..+|.+ ++.|.|..++=. =.+.
T Consensus 34 ~~~~ei~d~L~kRL~~~~~hVK~K~Lrilk~l~~~G~~~f~~~~~~~~~~Ik~---~~~f~g~~Dp~~--------Gd~~ 102 (122)
T cd03572 34 GSCQELLEYLLKRLKRSSPHVKLKVLKIIKHLCEKGNSDFKRELQRNSAQIRE---CANYKGPPDPLK--------GDSL 102 (122)
T ss_pred HHHHHHHHHHHHHhcCCCCcchHHHHHHHHHHHhhCCHHHHHHHHHhHHHHHH---HHHcCCCCCccc--------Ccch
Confidence 346677777777788788999 99999999875442 123456777765555 888888433201 1123
Q ss_pred HHHHHHHHHHHHH
Q psy6849 377 ASQVRAKADHVYN 389 (439)
Q Consensus 377 a~~Ir~kA~~ly~ 389 (439)
-+.||..|+.+.+
T Consensus 103 ~~~VR~~A~El~~ 115 (122)
T cd03572 103 NEKVREEAQELIK 115 (122)
T ss_pred hHHHHHHHHHHHH
Confidence 6899999998864
No 28
>COG5475 Uncharacterized small protein [Function unknown]
Probab=71.61 E-value=8.5 Score=30.49 Aligned_cols=53 Identities=23% Similarity=0.220 Sum_probs=43.1
Q ss_pred CCCccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEec--CCceeeecCCCcccchhh
Q psy6849 6 RKFKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYG--TRETGICRQDELFPYTEF 65 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFG--t~e~Awv~~k~L~Py~e~ 65 (439)
-.|..||.|--|-.| |.+++..-. ..+.|..+||. ...++-..++.|.||...
T Consensus 3 ~~FstgdvV~lKsGG----P~Mtvs~~s---s~Gmy~C~Wf~g~g~~~~~F~ed~Lvp~~a~ 57 (60)
T COG5475 3 MSFSTGDVVTLKSGG----PRMTVSGYS---SDGMYECRWFDGYGVKREAFHEDELVPGEAS 57 (60)
T ss_pred ceeecCcEEEeecCC----ceEEEeccc---cCCeEEEEEecCCCcccccccccceeccccC
Confidence 479999999999888 778776521 45899999998 678888899999998754
No 29
>KOG1080|consensus
Probab=67.54 E-value=3.7 Score=48.29 Aligned_cols=87 Identities=24% Similarity=0.379 Sum_probs=65.4
Q ss_pred CCCccCcEEEEec-cCCCCCCceecCCCC-----C--CCCCceEEEEEec------CCceeeecCCCcccchhhhhhhCC
Q psy6849 6 RKFKTNDKVFAKV-RGYPPWPARIEGLAD-----E--TPNRLKYHIFFYG------TRETGICRQDELFPYTEFKDKYGK 71 (439)
Q Consensus 6 ~~fk~GDLVWAKv-kGYP~WPArV~~~pe-----v--k~~~~ky~V~FFG------t~e~Awv~~k~L~Py~e~kekf~K 71 (439)
..+..|++||++. ++.+.|||.+..... + .......+|.||| ...++|+....+.+|......+.+
T Consensus 190 ~~~~~e~~~~~~~~~~~~~~~a~~~d~~~~~~~~v~as~~~~~~~~~~~~~s~~~~~~~~~~~r~~m~~~~~~~~~~~~~ 269 (1005)
T KOG1080|consen 190 EEFTVGDLVWAKSGRNEPPWPAIVIDPIRQAPRGVLASCLPVAACVMFFGNSGVPTERDYAWVRRGMERPFSRPVRPFQD 269 (1005)
T ss_pred cccccchhhhcccccCCcccccceeehhhcchhhhhccCcchhhhheeeeccCCccccchhhhhhccccccchhhhhccc
Confidence 5789999999997 789999998887743 1 1234567788888 347799999999999976655553
Q ss_pred -----CCChHHHHHHHHHHHHHhCCC
Q psy6849 72 -----GVKRKFFTEALQEIECDFGTP 92 (439)
Q Consensus 72 -----~~Krk~F~eAl~Eaee~l~n~ 92 (439)
..+...|.+++.++.++...+
T Consensus 270 ~~~~~~~~~~~~e~~~~~~~~~e~~~ 295 (1005)
T KOG1080|consen 270 QTELKREKARSFEQALEEAGLAEQGN 295 (1005)
T ss_pred cccccccCccchhHHHHHhhcccccc
Confidence 346788999999887664433
No 30
>cd04370 BAH BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). BAH domains have first been described as domains found in the polybromo protein and Yeast Rsc1/Rsc2 (Remodeling of the Structure of Chromatin). They also occur in mammalian DNA methyltransferases and the MTA1 subunits of histone deacetylase complexes. A BAH domain is also found in Yeast Sir3p and in the origin receptor complex protein 1 (Orc1p), where it was found to interact with the N-terminal lobe of the silence information regulator 1 protein (Sir1p), confirming the initial hypothesis that BAH plays a role in protein-protein interactions.
Probab=61.86 E-value=8.8 Score=32.16 Aligned_cols=41 Identities=24% Similarity=0.475 Sum_probs=32.6
Q ss_pred CCccCcEEEEeccC----CCCCCceecCCCCCCCCCceEEEEEec
Q psy6849 7 KFKTNDKVFAKVRG----YPPWPARIEGLADETPNRLKYHIFFYG 47 (439)
Q Consensus 7 ~fk~GDLVWAKvkG----YP~WPArV~~~pevk~~~~ky~V~FFG 47 (439)
.|++||.|..+-.. -|.|.|+|............+.|+||=
T Consensus 3 ~y~vgd~V~v~~~~~~~~~~~~i~~I~~i~~~~~~~~~~~v~wf~ 47 (123)
T cd04370 3 TYEVGDSVYVEPDDSIKSDPPYIARIEELWEDTNGSKQVKVRWFY 47 (123)
T ss_pred EEecCCEEEEecCCcCCCCCCEEEEEeeeeECCCCCEEEEEEEEE
Confidence 58899999999866 689999998886533344678898887
No 31
>PF08169 RBB1NT: RBB1NT (NUC162) domain; InterPro: IPR012603 This domain is found N-terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family [].; PDB: 2YRV_A.
Probab=54.20 E-value=13 Score=32.38 Aligned_cols=79 Identities=10% Similarity=0.003 Sum_probs=44.0
Q ss_pred cCcEEEEec--cCCCCCCceecCCC---CCCCCCceEEEEEecCCceeeecCCCcccchhhhhhhCCCCChHHHHHHHHH
Q psy6849 10 TNDKVFAKV--RGYPPWPARIEGLA---DETPNRLKYHIFFYGTRETGICRQDELFPYTEFKDKYGKGVKRKFFTEALQE 84 (439)
Q Consensus 10 ~GDLVWAKv--kGYP~WPArV~~~p---evk~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~kekf~K~~Krk~F~eAl~E 84 (439)
+|-+|-... +.-.|.||.|+.+. ++.-...+|.|+=|-++-|+-|..+++..|..... ++.-...+.|++.
T Consensus 8 lGkVV~V~~~~~k~~W~PALVVsPsc~ddv~VkKD~~lVRSFkD~KfysV~rkd~~e~~~~~~----~k~e~s~k~al~~ 83 (96)
T PF08169_consen 8 LGKVVCVESTKKKTSWFPALVVSPSCNDDVTVKKDQCLVRSFKDGKFYSVARKDVREFDIDSL----PKSESSLKPALDK 83 (96)
T ss_dssp TTSEEEEE-SS-SS-EEEEEEE--SS-SS----TT-EEEEESSS--EEEE-TTTEE---STTS-----HHHHHH-HHHHH
T ss_pred cCcEEEEEcCCCCCceeeEEEEcCCccceeeeccceEEEEEeccCceEEEEhhhhhhcccccC----CcccchhhHHHHH
Confidence 566666654 44579999999974 23334688999999999999999999988773211 1112356788888
Q ss_pred HHHHhCCC
Q psy6849 85 IECDFGTP 92 (439)
Q Consensus 85 aee~l~n~ 92 (439)
|..++...
T Consensus 84 A~~Fl~~~ 91 (96)
T PF08169_consen 84 ASTFLKTG 91 (96)
T ss_dssp HHHHHHS-
T ss_pred HHHHHhcC
Confidence 88887653
No 32
>PF01417 ENTH: ENTH domain; InterPro: IPR001026 The ENTH (Epsin N-terminal homology) domain is approximately 150 amino acids in length and is always found located at the N-termini of proteins. The domain forms a compact globular structure, composed of 9 alpha-helices connected by loops of varying length. The general topology is determined by three helical hairpins that are stacked consecutively with a right hand twist []. An N-terminal helix folds back, forming a deep basic groove that forms the binding pocket for the Ins(1,4,5)P3 ligand []. The ligand is coordinated by residues from surrounding alpha-helices and all three phosphates are multiply coordinated. The coordination of Ins(1,4,5)P3 suggests that ENTH is specific for particular head groups. Proteins containing this domain have been found to bind PtdIns(4,5)P2 and PtdIns(1,4,5)P3 suggesting that the domain may be a membrane interacting module. The main function of proteins containing this domain appears to be to act as accessory clathrin adaptors in endocytosis, Epsin is able to recruit and promote clathrin polymerisation on a lipid monolayer, but may have additional roles in signalling and actin regulation []. Epsin causes a strong degree of membrane curvature and tubulation, even fragmentation of membranes with a high PtdIns(4,5)P2 content. Epsin binding to membranes facilitates their deformation by insertion of the N-terminal helix into the outer leaflet of the bilayer, pushing the head groups apart. This would reduce the energy needed to curve the membrane into a vesicle, making it easier for the clathrin cage to fix and stabilise the curved membrane. This points to a pioneering role for epsin in vesicle budding as it provides both a driving force and a link between membrane invagination and clathrin polymerisation. ; PDB: 1H0A_A 1EYH_A 1EDU_A 2QY7_B 1XGW_A 2V8S_E 1VDY_A 2DCP_A 1INZ_A 3ONL_B ....
Probab=54.02 E-value=54 Score=28.62 Aligned_cols=81 Identities=17% Similarity=0.141 Sum_probs=45.1
Q ss_pred hHHHHHHHHHHHhh-CCCCCCHHHHHHHHHhhcCC--CCChHHhhhCchHHHHHHHhhhhccCccccccCHHHHHHhhhh
Q psy6849 300 VCILDIDLRIKDSI-GLEHADCDECLKALDDLINL--PITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQK 376 (439)
Q Consensus 300 ~~l~~l~~~ik~sL-~~~~~dv~~cl~aLdel~~L--~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~ 376 (439)
..+..+-.-|...| ..++.+....+++|.=|.-| .-+...+.--.+.++.|+.|+.|.+ ... +- ...
T Consensus 35 ~~~~~I~~~l~kRL~~~~~k~wr~~~KaL~ll~yLl~nG~~~~~~~~~~~~~~I~~l~~f~~-~d~-~g--------~d~ 104 (125)
T PF01417_consen 35 KDCQEIMDVLWKRLSKSDGKNWRHVYKALTLLEYLLKNGSERFVDELRDHIDIIRELQDFQY-VDP-KG--------KDQ 104 (125)
T ss_dssp HHHHHHHHHHHHHHHSSTSSGHHHHHHHHHHHHHHHHHS-HHHHHHHHHTHHHHHGGGG----BBT-TS--------TBH
T ss_pred ccHHHHHHHHHHHHHhcCCcchhHHHHHHHHHHHHHHHCCHHHHHHHHHHHHHHhhcceeec-cCC-CC--------ccH
Confidence 78888888899998 55566665544444433221 1111111111234567777888877 322 11 123
Q ss_pred HHHHHHHHHHHHHH
Q psy6849 377 ASQVRAKADHVYNK 390 (439)
Q Consensus 377 a~~Ir~kA~~ly~k 390 (439)
.+.||.+|..|++=
T Consensus 105 ~~~VR~~A~~i~~l 118 (125)
T PF01417_consen 105 GQNVREKAKEILEL 118 (125)
T ss_dssp HHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHHH
Confidence 67899999999863
No 33
>PF09465 LBR_tudor: Lamin-B receptor of TUDOR domain; InterPro: IPR019023 The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=52.44 E-value=19 Score=28.34 Aligned_cols=50 Identities=12% Similarity=0.313 Sum_probs=32.5
Q ss_pred CCCccCcEEEEeccCCCCC-CceecCCCCCCCCCceEEEEEecCCceeeecCCCc
Q psy6849 6 RKFKTNDKVFAKVRGYPPW-PARIEGLADETPNRLKYHIFFYGTRETGICRQDEL 59 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~W-PArV~~~pevk~~~~ky~V~FFGt~e~Awv~~k~L 59 (439)
..|..|+.|-++..|-..| +|+|..-.. ....|.|.|=. ++.--|+..+|
T Consensus 4 ~k~~~Ge~V~~rWP~s~lYYe~kV~~~d~---~~~~y~V~Y~D-Gtel~lke~di 54 (55)
T PF09465_consen 4 RKFAIGEVVMVRWPGSSLYYEGKVLSYDS---KSDRYTVLYED-GTELELKENDI 54 (55)
T ss_dssp SSS-SS-EEEEE-TTTS-EEEEEEEEEET---TTTEEEEEETT-S-EEEEECCCE
T ss_pred ccccCCCEEEEECCCCCcEEEEEEEEecc---cCceEEEEEcC-CCEEEeccccc
Confidence 6899999999999998888 999988532 46788888754 33344444443
No 34
>PRK05260 condesin subunit F; Provisional
Probab=46.69 E-value=32 Score=37.05 Aligned_cols=125 Identities=13% Similarity=0.131 Sum_probs=78.7
Q ss_pred HHHhhhhhhhhHHHHHHHHHHHhhCCC-CCCHHHHHHHHHhhcCC--CCChHHhhhCchHHHHHHHhhhhccCccccccC
Q psy6849 290 KALKTQLKLEVCILDIDLRIKDSIGLE-HADCDECLKALDDLINL--PITPLVLKKHPEIVDTCRRLQRYTGNIAEWTMT 366 (439)
Q Consensus 290 ~~k~~~L~~E~~l~~l~~~ik~sL~~~-~~dv~~cl~aLdel~~L--~vt~~mL~k~~e~V~tirklrry~gn~~~w~~~ 366 (439)
...++.=.|+.+.+....+|-.-|+.+ +..|..|..+|++...= .+.-.++-.--.++.-|-+|+.-+-+..
T Consensus 170 ~Idl~QR~mDeqQ~~vk~eIA~LL~qdW~~AI~~Ce~LLdEtsgtLRELqdtL~aagD~lqaqLl~IQ~~~~~~~----- 244 (440)
T PRK05260 170 SIDLTQRLMDEQQQQVKDDIAQLLNKDWRAAISSCELLLSETSGTLRELQDTLEAAGDKLQANLLRIQDATMGRD----- 244 (440)
T ss_pred hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhHHHHHHHHHHHHHHHhcCC-----
Confidence 334445557889999999999999998 88999999999986431 2222222222233333334443332211
Q ss_pred HHHHHHhhhhHHHHHHHHHHHHHHhhhhcccCC-CcchHHHHHHHHHHHHHHhcCCCccccee
Q psy6849 367 EEEAEVFTQKASQVRAKADHVYNKFKNLFTVPA-GETFWDTFSKQLVTFNDVTRNMPAHDLFS 428 (439)
Q Consensus 367 ~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F~~~e-g~~Fw~~F~~~v~~fk~~t~~~~~~~~~~ 428 (439)
....|-+....|.+|.=.+-.-+- --.-|..|-..|-.|.+.+-+|..|..|+
T Consensus 245 ---------~l~~vd~~~~~Lq~kLDRI~sWGqqaidlWigYdrhVHkfIRtaIdmDkNR~fs 298 (440)
T PRK05260 245 ---------DLDFVDRLVFDLQSKLDRIISWGQQAIDLWIGYDRHVHKFIRTAIDMDKNRVFS 298 (440)
T ss_pred ---------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcCcchhHH
Confidence 033344444445444433322121 15789999999999999999999998886
No 35
>smart00439 BAH Bromo adjacent homology domain.
Probab=45.72 E-value=22 Score=29.88 Aligned_cols=41 Identities=17% Similarity=0.265 Sum_probs=31.7
Q ss_pred CCccCcEEEEeccC--CCCCCceecCCCCCCCCC-ceEEEEEec
Q psy6849 7 KFKTNDKVFAKVRG--YPPWPARIEGLADETPNR-LKYHIFFYG 47 (439)
Q Consensus 7 ~fk~GDLVWAKvkG--YP~WPArV~~~pevk~~~-~ky~V~FFG 47 (439)
.|++||.|..+-.. =|+|-|+|.......... ..+.|+||=
T Consensus 1 ~~~vgd~V~v~~~~~~~~~~i~~I~~i~~~~~~~~~~~~v~Wf~ 44 (120)
T smart00439 1 TIRVGDFVLVEPDDADEPYYIGRIEEIFETKKNSEKMVRVRWFY 44 (120)
T ss_pred CcccCCEEEEeCCCCCCCCEEEEEEEEEECCCCCEEEEEEEEEE
Confidence 37899999999764 689999999886543334 488999995
No 36
>cd04721 BAH_plant_1 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=44.24 E-value=21 Score=32.12 Aligned_cols=43 Identities=16% Similarity=0.216 Sum_probs=34.1
Q ss_pred CCccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEecCC
Q psy6849 7 KFKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYGTR 49 (439)
Q Consensus 7 ~fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFGt~ 49 (439)
.+++||.|..+-.+=..|=|+|....+.......+.|+||-..
T Consensus 7 ~i~vGD~V~v~~~~~~~~va~Ie~i~ed~~g~~~v~v~WF~~p 49 (130)
T cd04721 7 TISVHDFVYVLSEEEDRYVAYIEDLYEDKKGSKMVKVRWFHTT 49 (130)
T ss_pred EEECCCEEEEeCCCCCcEEEEEEEEEEcCCCCEEEEEEEecCH
Confidence 4899999999987777789999988654334568999999843
No 37
>PF05641 Agenet: Agenet domain; InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=43.25 E-value=17 Score=28.75 Aligned_cols=51 Identities=20% Similarity=0.259 Sum_probs=31.4
Q ss_pred CccCcEEEEec--cCC--CCCCceecCCCCCCCCCceEEEEEec--CC------ceeeecCCCcccc
Q psy6849 8 FKTNDKVFAKV--RGY--PPWPARIEGLADETPNRLKYHIFFYG--TR------ETGICRQDELFPY 62 (439)
Q Consensus 8 fk~GDLVWAKv--kGY--P~WPArV~~~pevk~~~~ky~V~FFG--t~------e~Awv~~k~L~Py 62 (439)
|+.|+.|=..- .|| .|+||.|..... .++|.|.|-. .. -.-||....|.|-
T Consensus 1 F~~G~~VEV~s~e~g~~gaWf~a~V~~~~~----~~~~~V~Y~~~~~~~~~~~~l~e~V~~~~iRP~ 63 (68)
T PF05641_consen 1 FKKGDEVEVSSDEDGFRGAWFPATVLKENG----DDKYLVEYDDLPDEDGESPPLKEWVDARRIRPC 63 (68)
T ss_dssp --TT-EEEEEE-SBTT--EEEEEEEEEEET----T-EEEEEETT-SS--------EEEEEGGGEEE-
T ss_pred CCCCCEEEEEEcCCCCCcEEEEEEEEEeCC----CcEEEEEECCcccccccccccEEEechheEECc
Confidence 67888886664 555 699999998842 2299999953 11 2556666666653
No 38
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=42.29 E-value=27 Score=37.92 Aligned_cols=55 Identities=13% Similarity=0.224 Sum_probs=44.3
Q ss_pred CCCccCcEEEEeccCCCCC-CceecCCCCC---CCCCceEEEEEecCCce--eeecCCCcc
Q psy6849 6 RKFKTNDKVFAKVRGYPPW-PARIEGLADE---TPNRLKYHIFFYGTRET--GICRQDELF 60 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~W-PArV~~~pev---k~~~~ky~V~FFGt~e~--Awv~~k~L~ 60 (439)
..|.+|+.|.|...+-+-| +|.|+..... ......|.|.|-|.+.+ -||..+.|-
T Consensus 52 ~~~~VGekVla~~~~Dg~~~~A~VI~~R~~~~~~~~~~~YYVHY~g~nrRlDEWV~~~rLd 112 (450)
T PLN00104 52 LPLEVGTRVMCRWRFDGKYHPVKVIERRRGGSGGPNDYEYYVHYTEFNRRLDEWVKLEQLD 112 (450)
T ss_pred ceeccCCEEEEEECCCCCEEEEEEEEEeccCCCCCCCceEEEEEecCCccHhhccCHhhcc
Confidence 4699999999999988888 8999887531 12235799999998888 899999884
No 39
>smart00561 MBT Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. Present in Drosophila Scm, l(3)mbt, and vertebrate SCML2. These proteins are involved in transcriptional regulation.
Probab=41.58 E-value=39 Score=28.89 Aligned_cols=53 Identities=19% Similarity=0.201 Sum_probs=40.9
Q ss_pred CCCccCcEEEEeccCC--CCCCceecCCCCCCCCCceEEEEEecCCce--eeecCC--Ccccch
Q psy6849 6 RKFKTNDKVFAKVRGY--PPWPARIEGLADETPNRLKYHIFFYGTRET--GICRQD--ELFPYT 63 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGY--P~WPArV~~~pevk~~~~ky~V~FFGt~e~--Awv~~k--~L~Py~ 63 (439)
..|++|-.+=|--... -.|||.|... .+..+.|.|-|..+. -|+... +|+|--
T Consensus 26 ~~F~vGmkLEavD~~~~~~i~vAtV~~v-----~g~~l~v~~dg~~~~~D~W~~~~S~~I~PvG 84 (96)
T smart00561 26 NGFKVGMKLEAVDPRNPSLICVATVVEV-----KGYRLLLHFDGWDDKYDFWCDADSPDIHPVG 84 (96)
T ss_pred CcccCCCEEEEECCCCCceEEEEEEEEE-----ECCEEEEEEccCCCcCCEEEECCCCCcccCc
Confidence 5799999999987664 5789999987 457899999996555 688664 666643
No 40
>cd05503 Bromo_BAZ2A_B_like Bromodomain, BAZ2A/BAZ2B_like subfamily. Bromo adjacent to zinc finger 2A (BAZ2A) and 2B (BAZ2B) were identified as a novel human bromodomain gene by cDNA library screening. BAZ2A is also known as Tip5 (Transcription termination factor I-interacting protein 5) and hWALp3. The proteins may play roles in transcriptional regulation. Human Tip5 is part of a complex termed NoRC (nucleolar remodeling complex), which induces nucleosome sliding and may play a role in the regulation of the rDNA locus. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=40.81 E-value=85 Score=26.44 Aligned_cols=55 Identities=24% Similarity=0.262 Sum_probs=37.0
Q ss_pred CHHHHHHHHHhhcCCCCChH--------------HhhhCchHHHHHHH-hh--hhccCccccccCHHHHHHhhhhHHHHH
Q psy6849 319 DCDECLKALDDLINLPITPL--------------VLKKHPEIVDTCRR-LQ--RYTGNIAEWTMTEEEAEVFTQKASQVR 381 (439)
Q Consensus 319 dv~~cl~aLdel~~L~vt~~--------------mL~k~~e~V~tirk-lr--ry~gn~~~w~~~~~~~~~f~~~a~~Ir 381 (439)
|...|..+|++|...+.+.. -+.+||-.+.||++ |+ .|. -..+|..+..+|-
T Consensus 1 ~~~~c~~il~~l~~~~~~~~F~~pv~~~~~p~Y~~iIk~PmdL~tI~~kl~~~~Y~-----------s~~ef~~D~~li~ 69 (97)
T cd05503 1 DLALCETILDEMEAHEDAWPFLEPVNTKLVPGYRKIIKKPMDFSTIREKLESGQYK-----------TLEEFAEDVRLVF 69 (97)
T ss_pred CHHHHHHHHHHHHcCCCchhhcCCCCccccCCHHHHhCCCCCHHHHHHHHccCCCC-----------CHHHHHHHHHHHH
Confidence 56789999999998776553 34567777788765 43 332 2456777777776
Q ss_pred HHH
Q psy6849 382 AKA 384 (439)
Q Consensus 382 ~kA 384 (439)
..|
T Consensus 70 ~Na 72 (97)
T cd05503 70 DNC 72 (97)
T ss_pred HHH
Confidence 655
No 41
>PF14476 Chloroplast_duf: Petal formation-expressed
Probab=39.18 E-value=31 Score=35.39 Aligned_cols=92 Identities=22% Similarity=0.344 Sum_probs=61.9
Q ss_pred hhHHHHHHHHHHHhhCCCC---CCHHHHH---HHHHhhcCCCCChHHhhhCchHHHHHHHhhh-----------hccCcc
Q psy6849 299 EVCILDIDLRIKDSIGLEH---ADCDECL---KALDDLINLPITPLVLKKHPEIVDTCRRLQR-----------YTGNIA 361 (439)
Q Consensus 299 E~~l~~l~~~ik~sL~~~~---~dv~~cl---~aLdel~~L~vt~~mL~k~~e~V~tirklrr-----------y~gn~~ 361 (439)
=|.|.+|+.+|+.-|.+++ .||..+. -+||.-.=||+..-||-|-|+-|+.-+---+ -.++-.
T Consensus 92 tRLfkqL~~~I~t~la~~~~t~~dv~~amekVLALDkAYPLPLlg~MLeKFP~~~ePa~WWp~~~~~~~~~~~~~~~~~N 171 (313)
T PF14476_consen 92 TRLFKQLHSQIETTLALGNPTEEDVKDAMEKVLALDKAYPLPLLGAMLEKFPEKVEPAVWWPRKKQSQQKKSKKETNSGN 171 (313)
T ss_pred HHHHHHHHHHHHHHHhcCCccHhHHHHHHHHHHHhhccCCcccchhHHHhCccccCCccCCCCcCccccccchhcccCCC
Confidence 3689999999999999984 5666664 4788888999999999999998876321111 122334
Q ss_pred ccccCHHHHHHhhhhHHHHHHHHHHHHHHhh
Q psy6849 362 EWTMTEEEAEVFTQKASQVRAKADHVYNKFK 392 (439)
Q Consensus 362 ~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK 392 (439)
-|+- +=++|-.+=...|+.|=..=|.+.=
T Consensus 172 GWs~--eLE~EmReVv~VlK~KD~edY~rlg 200 (313)
T PF14476_consen 172 GWSE--ELEEEMREVVEVLKRKDEEDYLRLG 200 (313)
T ss_pred CCCH--HHHHHHHHHHHHHHhccHHHHHHHH
Confidence 5654 3344555556666655555555543
No 42
>PF14866 Toxin_38: Potassium channel toxin
Probab=37.35 E-value=41 Score=26.58 Aligned_cols=33 Identities=24% Similarity=0.311 Sum_probs=26.9
Q ss_pred HHHHHHHHHHHHHhhhhcccCC-CcchHHHHHHH
Q psy6849 378 SQVRAKADHVYNKFKNLFTVPA-GETFWDTFSKQ 410 (439)
Q Consensus 378 ~~Ir~kA~~ly~kfK~~F~~~e-g~~Fw~~F~~~ 410 (439)
+.||.+|..+|+|.|.+-...+ |=-|.+.||+.
T Consensus 1 ~~vk~~~k~~~~K~k~l~~kse~~Cp~i~~~Ce~ 34 (56)
T PF14866_consen 1 QKVKNKAKAVVHKVKALTAKSEYGCPAIEGWCED 34 (56)
T ss_pred ChHHHHHHHHHHHHHhhhcccccCchhHHhHHHH
Confidence 3689999999999999776665 66788888875
No 43
>PF06003 SMN: Survival motor neuron protein (SMN); InterPro: IPR010304 This family consists of several eukaryotic survival motor neuron (SMN) proteins. The Survival of Motor Neurons (SMN) protein, the product of the spinal muscular atrophy-determining gene, is part of a large macromolecular complex (SMN complex) that functions in the assembly of spliceosomal small nuclear ribonucleoproteins (snRNPs). The SMN complex functions as a specificity factor essential for the efficient assembly of Sm proteins on U snRNAs and likely protects cells from illicit, and potentially deleterious, non-specific binding of Sm proteins to RNAs.; GO: 0003723 RNA binding, 0006397 mRNA processing, 0005634 nucleus, 0005737 cytoplasm; PDB: 1MHN_A 4A4G_A 3S6N_M 4A4E_A 1G5V_A 4A4H_A 4A4F_A 2D9T_A.
Probab=36.17 E-value=32 Score=34.58 Aligned_cols=58 Identities=22% Similarity=0.277 Sum_probs=41.0
Q ss_pred cCCCccCcEEEEeccC-CCCCCceecCCCCCCCCCceEEEEEecCCceeeecCCCcccchhh
Q psy6849 5 ERKFKTNDKVFAKVRG-YPPWPARIEGLADETPNRLKYHIFFYGTRETGICRQDELFPYTEF 65 (439)
Q Consensus 5 ~~~fk~GDLVWAKvkG-YP~WPArV~~~pevk~~~~ky~V~FFGt~e~Awv~~k~L~Py~e~ 65 (439)
...|++||..-|...+ --+-||.|..... ..+.+.|.|-|=+..-.|...+|.|-...
T Consensus 66 ~~~WkvGd~C~A~~s~Dg~~Y~A~I~~i~~---~~~~~~V~f~gYgn~e~v~l~dL~~~~~~ 124 (264)
T PF06003_consen 66 NKKWKVGDKCMAVYSEDGQYYPATIESIDE---EDGTCVVVFTGYGNEEEVNLSDLKPSEGD 124 (264)
T ss_dssp TT---TT-EEEEE-TTTSSEEEEEEEEEET---TTTEEEEEETTTTEEEEEEGGGEEETT--
T ss_pred ccCCCCCCEEEEEECCCCCEEEEEEEEEcC---CCCEEEEEEcccCCeEeeehhhhcccccc
Confidence 3589999999999753 2355999998743 34688899999888899999999997743
No 44
>PF08846 DUF1816: Domain of unknown function (DUF1816); InterPro: IPR014945 Q4C9H3 from SWISSPROT is associated with the IPR008213 from INTERPRO domain suggesting this protein could have a role in phycobilisomes.
Probab=34.43 E-value=23 Score=28.93 Aligned_cols=23 Identities=22% Similarity=0.452 Sum_probs=16.2
Q ss_pred ccCCCCCCceecCCCCCCCCCceEEEEEecC
Q psy6849 18 VRGYPPWPARIEGLADETPNRLKYHIFFYGT 48 (439)
Q Consensus 18 vkGYP~WPArV~~~pevk~~~~ky~V~FFGt 48 (439)
.-|.+|| ++|... +--|.+|||.
T Consensus 5 ~~glaWW-veI~T~-------~P~ctYyFGP 27 (68)
T PF08846_consen 5 GLGLAWW-VEIETQ-------NPNCTYYFGP 27 (68)
T ss_pred ccCCcEE-EEEEcC-------CCCEEEEeCC
Confidence 4588999 566543 3448999994
No 45
>PF01426 BAH: BAH domain; InterPro: IPR001025 The BAH (bromo-adjacent homology) family contains proteins such as eukaryotic DNA (cytosine-5) methyltransferases IPR001525 from INTERPRO, the origin recognition complex 1 (Orc1) proteins, as well as several proteins involved in transcriptional regulation. The BAH domain appears to act as a protein-protein interaction module specialised in gene silencing, as suggested for example by its interaction within yeast Orc1p with the silent information regulator Sir1p. The BAH module might therefore play an important role by linking DNA methylation, replication and transcriptional regulation [].; GO: 0003677 DNA binding; PDB: 4DA4_A 3PT6_B 3AV6_A 3AV5_A 3AV4_A 3PT9_A 3SWR_A 3PTA_A 1M4Z_A 1ZBX_A ....
Probab=33.30 E-value=23 Score=29.82 Aligned_cols=41 Identities=24% Similarity=0.440 Sum_probs=31.9
Q ss_pred CCccCcEEEEeccC--CCCCCceecCCCCCCCCC--ceEEEEEec
Q psy6849 7 KFKTNDKVFAKVRG--YPPWPARIEGLADETPNR--LKYHIFFYG 47 (439)
Q Consensus 7 ~fk~GDLVWAKvkG--YP~WPArV~~~pevk~~~--~ky~V~FFG 47 (439)
.|++||.|..+-.. -|+|-|+|.......... ..+.|.||-
T Consensus 2 ~~~vGD~V~v~~~~~~~~~~v~~I~~i~~~~~~~~~~~~~v~Wf~ 46 (119)
T PF01426_consen 2 TYKVGDFVYVKPDDPPEPPYVARIEEIWEDKDGNKEKMVKVRWFY 46 (119)
T ss_dssp EEETTSEEEEECTSTTSEEEEEEEEEEEEETTTSEEEEEEEEEEE
T ss_pred EEeCCCEEEEeCCCCCCCCEEEEEEEEEcCCCCCEEEEEEEEEeE
Confidence 47899999999977 789999998875432222 578899996
No 46
>cd05505 Bromo_WSTF_like Bromodomain; Williams syndrome transcription factor-like subfamily (WSTF-like). The Williams-Beuren syndrome deletion transcript 9 is a putative transcriptional regulator. WSTF was found to play a role in vitamin D-mediated transcription as part of two chromatin remodeling complexes, WINAC and WICH. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=31.62 E-value=1.1e+02 Score=26.09 Aligned_cols=59 Identities=15% Similarity=0.269 Sum_probs=38.9
Q ss_pred CHHHHHHHHHhhcCCCCCh--------------HHhhhCchHHHHHHH-hh--hhccCccccccCHHHHHHhhhhHHHHH
Q psy6849 319 DCDECLKALDDLINLPITP--------------LVLKKHPEIVDTCRR-LQ--RYTGNIAEWTMTEEEAEVFTQKASQVR 381 (439)
Q Consensus 319 dv~~cl~aLdel~~L~vt~--------------~mL~k~~e~V~tirk-lr--ry~gn~~~w~~~~~~~~~f~~~a~~Ir 381 (439)
+..+|.++|++|.+.+.+. .-+.+||-...||++ |+ .|. -..+|..+.++|-
T Consensus 1 ~~~~c~~il~~l~~~~~s~~F~~pv~~~~~pdY~~iIk~PmDL~tI~~kl~~~~Y~-----------s~~ef~~D~~li~ 69 (97)
T cd05505 1 ELQKCEEILSKILKYRFSWPFREPVTADEAEDYKKVITNPMDLQTMQTKCSCGSYS-----------SVQEFLDDMKLVF 69 (97)
T ss_pred CHHHHHHHHHHHHhCCCcccccCCCChhhcccHHHHcCCcCCHHHHHHHHcCCCCC-----------CHHHHHHHHHHHH
Confidence 3568999999998865543 334567778888864 42 332 2456877778877
Q ss_pred HHHHHHHH
Q psy6849 382 AKADHVYN 389 (439)
Q Consensus 382 ~kA~~ly~ 389 (439)
..| ++||
T Consensus 70 ~Na-~~yN 76 (97)
T cd05505 70 SNA-EKYY 76 (97)
T ss_pred HHH-HHHC
Confidence 666 5555
No 47
>cd04714 BAH_BAHCC1 BAH, or Bromo Adjacent Homology domain, as present in mammalian BAHCC1 and similar proteins. BAHCC1 stands for BAH domain and coiled-coil containing 1. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=31.05 E-value=44 Score=29.40 Aligned_cols=42 Identities=26% Similarity=0.394 Sum_probs=31.7
Q ss_pred CCccCcEEEEeccCC--CCCCceecCCCCCCCCCceEEEEEecC
Q psy6849 7 KFKTNDKVFAKVRGY--PPWPARIEGLADETPNRLKYHIFFYGT 48 (439)
Q Consensus 7 ~fk~GDLVWAKvkGY--P~WPArV~~~pevk~~~~ky~V~FFGt 48 (439)
-|++||-|.-+-.+- ++|=|+|....+.+.....+.|+||-.
T Consensus 3 ~~~vGD~V~v~~~~~~~~pyIgrI~~i~e~~~g~~~~~v~Wfyr 46 (121)
T cd04714 3 IIRVGDCVLFKSPGRPSLPYVARIESLWEDPEGNMVVRVKWYYR 46 (121)
T ss_pred EEEcCCEEEEeCCCCCCCCEEEEEEEEEEcCCCCEEEEEEEEEc
Confidence 578999999998664 678899988754333445788999983
No 48
>KOG0412|consensus
Probab=29.41 E-value=4.7e+02 Score=30.43 Aligned_cols=110 Identities=15% Similarity=0.269 Sum_probs=64.6
Q ss_pred HHHhhhhhhhh-------HHHHHHHHHHHhhC-----CCCCCHHHHHHHHHhhcCCCCChHHhhhC-----------chH
Q psy6849 290 KALKTQLKLEV-------CILDIDLRIKDSIG-----LEHADCDECLKALDDLINLPITPLVLKKH-----------PEI 346 (439)
Q Consensus 290 ~~k~~~L~~E~-------~l~~l~~~ik~sL~-----~~~~dv~~cl~aLdel~~L~vt~~mL~k~-----------~e~ 346 (439)
.+|+++|..++ ++++=..+++.|+. +++-|+++|-+..-.+..| ++.++..- .+-
T Consensus 104 s~kVr~lDla~~Rv~~clq~v~dvrdlk~C~~gv~~Al~seDyE~AA~~IhRflsl--D~~~i~~~~~~~~~~ts~i~~~ 181 (773)
T KOG0412|consen 104 SGKVRALDLAQNRVNECLQRVDDVRDLKNCIEGVDTALESEDYEKAATHIHRFLSL--DQALIESRFAKQVVPTSEISDP 181 (773)
T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHhc--CHHHHhhhhhhccCCchhhhhH
Confidence 33455555553 23333445566654 3488999999999998887 77776655 444
Q ss_pred HHHHHHhhhhccCccccccCHHHHHHhhhh-HHHHHHHHHHHHHHhhhhc-ccC---CC-cchHHHHHHHHHH
Q psy6849 347 VDTCRRLQRYTGNIAEWTMTEEEAEVFTQK-ASQVRAKADHVYNKFKNLF-TVP---AG-ETFWDTFSKQLVT 413 (439)
Q Consensus 347 V~tirklrry~gn~~~w~~~~~~~~~f~~~-a~~Ir~kA~~ly~kfK~~F-~~~---eg-~~Fw~~F~~~v~~ 413 (439)
++||+-...= =...|..+ ...+|+.=-.=..+|-.|| ++| || +-||--||+.+-.
T Consensus 182 ~~~L~~a~e~------------L~~l~~~~f~eA~r~~D~~ei~RffKmFPliG~~~eGL~~ys~ylc~iIA~ 242 (773)
T KOG0412|consen 182 YETLKEAKER------------LSKLFKERFTEAVRKQDLKEITRFFKMFPLIGEEDEGLQLYSVYLCQIIAS 242 (773)
T ss_pred HHHHHHHHHH------------HHHHHHHHHHHHHhcccHHHHHHHHHHccccCCchhhHHHHHHHHHHHHHH
Confidence 4444322211 01122222 2334444344457889999 555 57 8999999998543
No 49
>KOG3612|consensus
Probab=28.97 E-value=31 Score=38.28 Aligned_cols=51 Identities=22% Similarity=0.486 Sum_probs=38.0
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCCCCCCceEEEEEec-CCceeeecCCCccc
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADETPNRLKYHIFFYG-TRETGICRQDELFP 61 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pevk~~~~ky~V~FFG-t~e~Awv~~k~L~P 61 (439)
.....++.|.+.|-++||+.|.-.+. ....|.|+||| ++..+-....++.+
T Consensus 331 ~~~~e~apap~ag~Sp~p~kvp~spr---e~i~~~~r~Vg~~~s~~~~e~~dv~s 382 (588)
T KOG3612|consen 331 LQRHELAPAPQAGSSPWPAKVPVSPR---EPIKYDVRFVGGTHSRALIEERDVTS 382 (588)
T ss_pred ccccccccCccccCCCcccccccCcc---cccccccccccccccccccccccCCc
Confidence 45667899999999999999985543 35679999999 77755555555543
No 50
>smart00804 TAP_C C-terminal domain of vertebrate Tap protein. The vertebrate Tap protein is a member of the NXF family of shuttling transport receptors for the nuclear export of mRNA. Its most C-terminal domain is important for binding to FG repeat-containing nuclear pore proteins (FG-nucleoporins) and is sufficient to mediate shuttling. This domain forms a compact four-helix fold related to that of a UBA domain.
Probab=28.70 E-value=45 Score=26.66 Aligned_cols=23 Identities=17% Similarity=0.252 Sum_probs=20.2
Q ss_pred HHhhCCCCCCHHHHHHHHHhhcC
Q psy6849 310 KDSIGLEHADCDECLKALDDLIN 332 (439)
Q Consensus 310 k~sL~~~~~dv~~cl~aLdel~~ 332 (439)
..||.-+++|+++|+.+|.+|.+
T Consensus 31 ~~cLe~~~Wd~~~Al~~F~~lk~ 53 (63)
T smart00804 31 QMCLEDNNWDYERALKNFTELKS 53 (63)
T ss_pred HHHHHHcCCCHHHHHHHHHHHHh
Confidence 45888889999999999999983
No 51
>PHA03165 hypothetical protein; Provisional
Probab=28.02 E-value=24 Score=27.03 Aligned_cols=20 Identities=35% Similarity=0.848 Sum_probs=15.4
Q ss_pred hhhcccCCC-cchHHHHHHHH
Q psy6849 392 KNLFTVPAG-ETFWDTFSKQL 411 (439)
Q Consensus 392 K~~F~~~eg-~~Fw~~F~~~v 411 (439)
|++|.+|-| +.||+.-.-.|
T Consensus 6 ktlfpiprgadefwegtyilv 26 (57)
T PHA03165 6 KTLFPIPRGADEFWEGTYILV 26 (57)
T ss_pred cccccCCCCchhhhcceehhH
Confidence 679999998 89998644443
No 52
>cd05502 Bromo_tif1_like Bromodomain; tif1_like subfamily. Tif1 (transcription intermediary factor 1) is a member of the tripartite motif (TRIM) protein family, which is characterized by a particular domain architecture. It functions by recruiting coactivators and/or corepressors to modulate transcription. Vertebrate Tif1-gamma, also labeled E3 ubiquitin-protein ligase TRIM33, plays a role in the control of hematopoiesis. Its homologue in Xenopus laevis, Ectodermin, has been shown to function in germ-layer specification and control of cell growth during embryogenesis. Bromodomains are 110 amino acid long domains, that are found in many chromatin associated proteins. Bromodomains can interact specifically with acetylated lysine.
Probab=27.84 E-value=1.1e+02 Score=26.34 Aligned_cols=37 Identities=24% Similarity=0.309 Sum_probs=24.7
Q ss_pred CCHHHHHHHHHhhcCCCCCh-------------HHhhhCchHHHHHHH-hh
Q psy6849 318 ADCDECLKALDDLINLPITP-------------LVLKKHPEIVDTCRR-LQ 354 (439)
Q Consensus 318 ~dv~~cl~aLdel~~L~vt~-------------~mL~k~~e~V~tirk-lr 354 (439)
.+..+|.++|++|.+.+.+. .-..+||-...||++ |+
T Consensus 4 ~~~~~c~~il~~l~~~~~s~~F~~pv~~~~p~Y~~iI~~PmdL~tI~~kL~ 54 (109)
T cd05502 4 IDQRKCERLLLELYCHELSLPFHEPVSPSVPNYYKIIKTPMDLSLIRKKLQ 54 (109)
T ss_pred HHHHHHHHHHHHHHhCCCChhhcCCCCCCCCCHHHHCCCCccHHHHHHHHh
Confidence 35678888888888854433 234567778787764 55
No 53
>KOG0955|consensus
Probab=27.77 E-value=51 Score=39.44 Aligned_cols=56 Identities=21% Similarity=0.361 Sum_probs=39.3
Q ss_pred CccCcEEEEeccCCCCCCceecCCCCCC--------------------------CCCceEEEEEec-CCceeeecCCCcc
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGLADET--------------------------PNRLKYHIFFYG-TRETGICRQDELF 60 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~pevk--------------------------~~~~ky~V~FFG-t~e~Awv~~k~L~ 60 (439)
+..=++||||.+|||+-|+.|..+...+ ....-+.|.||. ....-|+....+.
T Consensus 943 l~~~~~~~akc~g~~s~~~l~l~p~~~~~~~~~~g~~~~p~~dv~~l~eq~~~~~~~~l~~~L~~~n~~~~~~~~~s~~~ 1022 (1051)
T KOG0955|consen 943 LEELKLVWAKCRGYPSYPALILDPKMPREGNFHNGPDPAPPTDVLALPEQRTNKAPETLFLVLFFDNKRCWQWLPRSKVL 1022 (1051)
T ss_pred eeehhceeehhcCCccchhhhcccccccccCccCCCCCCCCcccccchHHHhcccChhheEEEeecccccccccCCCCcc
Confidence 3445699999999999999999884310 122346677777 4566788888766
Q ss_pred cch
Q psy6849 61 PYT 63 (439)
Q Consensus 61 Py~ 63 (439)
+.-
T Consensus 1023 ~l~ 1025 (1051)
T KOG0955|consen 1023 ELG 1025 (1051)
T ss_pred ccc
Confidence 544
No 54
>cd05791 S1_CSL4 S1_CSL4: CSL4, S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. ScCSL4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In S. cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
Probab=25.54 E-value=31 Score=29.01 Aligned_cols=20 Identities=30% Similarity=0.288 Sum_probs=16.1
Q ss_pred cccCCCccCcEEEEeccCCC
Q psy6849 3 AVERKFKTNDKVFAKVRGYP 22 (439)
Q Consensus 3 ~~~~~fk~GDLVWAKvkGYP 22 (439)
++..-|++||+|-||+-.+-
T Consensus 56 ~~~~~f~~GDiV~AkVis~~ 75 (92)
T cd05791 56 EMYKCFRPGDIVRAKVISLG 75 (92)
T ss_pred HHHhhcCCCCEEEEEEEEcC
Confidence 34567999999999997764
No 55
>PF03882 KicB: KicB killing factor; InterPro: IPR005582 This family contains MukF, which are proteins involved in chromosome condensation, segregation and cell cycle progression. MukE (IPR007385 from INTERPRO) along with MukF interact with MukB (IPR007406 from INTERPRO) in vivo forming a complex, which is required for chromosome condensation and segregation in Escherichia coli []. The Muk complex appears to be similar to the SMC-ScpA-ScpB complex in other prokaryotes where MukB is the homologue of SMC []. ScpA (IPR003768 from INTERPRO) and ScpB (IPR005234 from INTERPRO) have little sequence similarity to MukE or MukF, though they are predicted to be structurally similar, being predominantly alpha-helical with coiled coil regions. ; GO: 0005509 calcium ion binding, 0006260 DNA replication, 0007059 chromosome segregation, 0005737 cytoplasm; PDB: 1T98_B 3RPU_X 3EUH_B 3EUK_J 3EUJ_B.
Probab=24.49 E-value=73 Score=34.31 Aligned_cols=123 Identities=14% Similarity=0.152 Sum_probs=75.3
Q ss_pred HHHhhhhhhhhHHHHHHHHHHHhhCCC-CCCHHHHHHHHHhhc----CCCCChHHhhhCchHHHHHHHhhhhccCccccc
Q psy6849 290 KALKTQLKLEVCILDIDLRIKDSIGLE-HADCDECLKALDDLI----NLPITPLVLKKHPEIVDTCRRLQRYTGNIAEWT 364 (439)
Q Consensus 290 ~~k~~~L~~E~~l~~l~~~ik~sL~~~-~~dv~~cl~aLdel~----~L~vt~~mL~k~~e~V~tirklrry~gn~~~w~ 364 (439)
...++.=.|+.+.+....+|-.-|+.+ +.-|..|..+|++.. +|+-| ++-.--.+..-|-+++.-+-+..
T Consensus 170 ~Idl~QR~MDeqQ~~vk~eIA~LL~qdW~~AI~~Ce~LL~EtsgtLRELqdt--L~aagd~lqa~Ll~IQe~~~~~~--- 244 (440)
T PF03882_consen 170 SIDLNQRAMDEQQQSVKEEIAALLNQDWRAAIQSCEQLLDETSGTLRELQDT--LEAAGDKLQAQLLRIQEAVMGRD--- 244 (440)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHHHHHHHGGGGHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHCSS---
T ss_pred hhhHHHHHHHHHHHHHHHHHHHHHchhHHHHHHHHHHHHHHHHhhHHHHHHH--HHHhHHHHHHHHHHHHHHHhcCc---
Confidence 444455668889999999999999998 888999999999853 22221 11111112222222222211110
Q ss_pred cCHHHHHHhhhhHHHHHHHHHHHHHHhhhhcccCC-CcchHHHHHHHHHHHHHHhcCCCccccee
Q psy6849 365 MTEEEAEVFTQKASQVRAKADHVYNKFKNLFTVPA-GETFWDTFSKQLVTFNDVTRNMPAHDLFS 428 (439)
Q Consensus 365 ~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F~~~e-g~~Fw~~F~~~v~~fk~~t~~~~~~~~~~ 428 (439)
.+..|-+....|.+|.=.+-.-+- --.-|..|-..|-.|.+.+.+|..|..|+
T Consensus 245 -----------~l~~v~~l~~~Lq~kLDrI~sWGqq~idlWigYdrhVHkfIRtaIdmDkNR~fs 298 (440)
T PF03882_consen 245 -----------ELEFVDNLIFDLQMKLDRIISWGQQAIDLWIGYDRHVHKFIRTAIDMDKNRVFS 298 (440)
T ss_dssp -----------S-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTTCTTSHHH
T ss_pred -----------cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcchHHH
Confidence 133444444444444433321111 14789999999999999999999999886
No 56
>cd04712 BAH_DCM_I BAH, or Bromo Adjacent Homology domain, as present in DNA (Cytosine-5)-methyltransferases (DCM) 1. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the genome. These effects include transcriptional repression via inhibition of transcription factor binding, the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting, and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=24.41 E-value=71 Score=28.73 Aligned_cols=51 Identities=8% Similarity=0.037 Sum_probs=34.3
Q ss_pred CCCccCcEEEEeccCCCC---------CCceecCCCCC---CCCCceEEEEEecCCceeeecC
Q psy6849 6 RKFKTNDKVFAKVRGYPP---------WPARIEGLADE---TPNRLKYHIFFYGTRETGICRQ 56 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~---------WPArV~~~pev---k~~~~ky~V~FFGt~e~Awv~~ 56 (439)
..+++||.|...-.+-.- ||+.|+..... .....+++|+||--+.-.-++.
T Consensus 4 ~~i~vGD~V~v~~d~~~~~~~~~~~~~~~~~i~~V~~~~e~~~g~~~~h~~W~yrp~eTv~g~ 66 (130)
T cd04712 4 LTIRVGDVVSVERDDADSTTKWNDDHRWLPLVQFVEYMKKGSDGSKMFHGRWLYRGCDTVLGN 66 (130)
T ss_pred CEEeCCCEEEEcCCCCCccccccccccccceEEEEEEeeecCCCceEEEEEEEEcchhccccc
Confidence 458999999999876553 67777765432 2235689999998555554443
No 57
>cd04717 BAH_polybromo BAH, or Bromo Adjacent Homology domain, as present in polybromo and yeast RSC1/2. The human polybromo protein (BAF180) is a component of the SWI/SNF chromatin-remodeling complex PBAF. It is thought that polybromo participates in transcriptional regulation. Saccharomyces cerevisiae RSC1 and RSC2 are part of the 15-subunit nucleosome remodeling RSC complex. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=23.78 E-value=77 Score=27.54 Aligned_cols=42 Identities=17% Similarity=0.224 Sum_probs=31.3
Q ss_pred CCccCcEEEEeccC--CCCCCceecCCCCCCCCCceEEEEEecC
Q psy6849 7 KFKTNDKVFAKVRG--YPPWPARIEGLADETPNRLKYHIFFYGT 48 (439)
Q Consensus 7 ~fk~GDLVWAKvkG--YP~WPArV~~~pevk~~~~ky~V~FFGt 48 (439)
.|++||-|..+-.+ -++|-|+|............+.|+||-.
T Consensus 3 ~~~vGD~V~v~~~~~~~~~~i~~I~~i~~~~~g~~~~~~~Wf~r 46 (121)
T cd04717 3 QYRVGDCVYVANPEDPSKPIIFRIERLWKDEDGEKFFFGCWFYR 46 (121)
T ss_pred EEECCCEEEEeCCCCCCCCEEEEEeEEEECCCCCEEEEEEEEeC
Confidence 47899999999766 6678899988754332345678998883
No 58
>cd04715 BAH_Orc1p_like BAH, or Bromo Adjacent Homology domain, as present in the Schizosaccharomyces pombe homolog of Saccharomyces cerevisiae Orc1p and similar proteins. Orc1 is part of the Yeast Sir1-origin recognition complex, the Orc1p BAH doman functions in epigenetic silencing. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=22.70 E-value=77 Score=29.64 Aligned_cols=43 Identities=12% Similarity=0.319 Sum_probs=34.1
Q ss_pred CCCccCcEEEEeccCCCCCCceecCCCCCC--CCCceEEEEEecC
Q psy6849 6 RKFKTNDKVFAKVRGYPPWPARIEGLADET--PNRLKYHIFFYGT 48 (439)
Q Consensus 6 ~~fk~GDLVWAKvkGYP~WPArV~~~pevk--~~~~ky~V~FFGt 48 (439)
..|++||-|.-+-..-++|-|+|....+.. .....+.|+||-.
T Consensus 28 ~~y~lGD~Vlv~s~~~~~yIgkI~~iwe~~~~~g~~~~~v~WfyR 72 (159)
T cd04715 28 VEYRLYDDVYVHNGDSEPYIGKIIKIYETAIDSGKKKVKVIWFFR 72 (159)
T ss_pred EEEeCCCEEEEeCCCCCCEEEEEEEEEEcCCcCCceEEEEEeeeC
Confidence 348999999999878899999999886532 1445788999973
No 59
>PF09926 DUF2158: Uncharacterized small protein (DUF2158); InterPro: IPR019226 This entry represents a family of predominantly prokaryotic proteins with no known function.
Probab=21.81 E-value=2e+02 Score=22.24 Aligned_cols=38 Identities=18% Similarity=0.127 Sum_probs=25.5
Q ss_pred CccCcEEEEeccCCCCCCceecCC-CC-CCCCCceEEEEEecCC
Q psy6849 8 FKTNDKVFAKVRGYPPWPARIEGL-AD-ETPNRLKYHIFFYGTR 49 (439)
Q Consensus 8 fk~GDLVWAKvkGYP~WPArV~~~-pe-vk~~~~ky~V~FFGt~ 49 (439)
|++||+|-=|-.| |.+++.. .. .....+.|...||..+
T Consensus 1 f~~GDvV~LKSGG----p~MTV~~v~~~~~~~~~~v~C~WFd~~ 40 (53)
T PF09926_consen 1 FKIGDVVQLKSGG----PRMTVTEVGPNAGASGGWVECQWFDGH 40 (53)
T ss_pred CCCCCEEEEccCC----CCeEEEEccccccCCCCeEEEEeCCCC
Confidence 7899999888877 4555542 11 1224578888888844
No 60
>PF07443 HARP: HepA-related protein (HARP); InterPro: IPR010003 This entry represents a conserved region approximately 60 residues long within eukaryotic HepA-related protein (HARP). This exhibits single-stranded DNA-dependent ATPase activity, and is ubiquitously expressed in human and mouse tissues []. Family members may contain more than one copy of this region.; GO: 0004386 helicase activity, 0005524 ATP binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0016568 chromatin modification, 0005634 nucleus
Probab=21.11 E-value=73 Score=25.07 Aligned_cols=29 Identities=17% Similarity=0.564 Sum_probs=26.2
Q ss_pred CchHHHHHHHh--hhhccCccccccCHHHHH
Q psy6849 343 HPEIVDTCRRL--QRYTGNIAEWTMTEEEAE 371 (439)
Q Consensus 343 ~~e~V~tirkl--rry~gn~~~w~~~~~~~~ 371 (439)
|+++|+..|.| |.|-.++..|+.+-+|-.
T Consensus 8 ~~~lI~vFK~~pSr~YD~~Tr~W~F~L~Dy~ 38 (55)
T PF07443_consen 8 HEELIAVFKQMPSRNYDPKTRKWNFSLEDYS 38 (55)
T ss_pred CHHHHHHHHcCcccccCccceeeeeeHHHHH
Confidence 78899999998 999999999999998853
No 61
>KOG0749|consensus
Probab=20.58 E-value=39 Score=34.57 Aligned_cols=59 Identities=24% Similarity=0.423 Sum_probs=43.6
Q ss_pred hCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHH-----HHhhhhcccCC--CcchHHHHHH
Q psy6849 342 KHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVY-----NKFKNLFTVPA--GETFWDTFSK 409 (439)
Q Consensus 342 k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly-----~kfK~~F~~~e--g~~Fw~~F~~ 409 (439)
|.--|||++.|+.+=.|-+.-|.=|- |..||.--++-. .+||++|.-+- .++||..|.-
T Consensus 52 ~YkGi~Dc~~r~~~eqG~~sfWRGN~---------anViRyfPtqAlNFAFKd~yk~~~~~~~dk~~~~~k~fag 117 (298)
T KOG0749|consen 52 RYKGIVDCFVRIPKEQGFLSFWRGNL---------ANVIRYFPTQALNFAFKDKYKQIFLGGVDKKTQFWKWFAG 117 (298)
T ss_pred Cccchhheeeechhhhhhhheecccc---------cchhhcCchhhhchhHHHHHHHHHhcCcccccchHHHHHh
Confidence 66678999999988877777777776 677887544433 48899996543 3799999853
No 62
>KOG3409|consensus
Probab=20.47 E-value=55 Score=31.59 Aligned_cols=18 Identities=39% Similarity=0.525 Sum_probs=14.4
Q ss_pred cccCCCccCcEEEEeccC
Q psy6849 3 AVERKFKTNDKVFAKVRG 20 (439)
Q Consensus 3 ~~~~~fk~GDLVWAKvkG 20 (439)
+.-+.|+|||+|.||+-.
T Consensus 118 ~v~ksFrPgDiVlAkVis 135 (193)
T KOG3409|consen 118 KVYKSFRPGDIVLAKVIS 135 (193)
T ss_pred hhhhccCCCcEEEEEEee
Confidence 445789999999998744
No 63
>KOG0981|consensus
Probab=20.37 E-value=90 Score=35.22 Aligned_cols=49 Identities=29% Similarity=0.478 Sum_probs=39.0
Q ss_pred hCchHHHHHHHhhhhccCccccccCHHHHHHhhhhHHHHHHHHHHHHHHhhhhccc
Q psy6849 342 KHPEIVDTCRRLQRYTGNIAEWTMTEEEAEVFTQKASQVRAKADHVYNKFKNLFTV 397 (439)
Q Consensus 342 k~~e~V~tirklrry~gn~~~w~~~~~~~~~f~~~a~~Ir~kA~~ly~kfK~~F~~ 397 (439)
..-+=-+|-|+|+.|+++.. ......|+.+-+.+|+.|..|| |...|.+
T Consensus 434 sD~~KyE~ARrLk~~Id~IR-----~~Y~~d~ksk~m~~RQravAlY--~IDklAL 482 (759)
T KOG0981|consen 434 SDKEKYETARRLKKYIDKIR-----ATYTKDFKSKEMKVRQRAVALY--FIDKLAL 482 (759)
T ss_pred hhHHHHHHHHHHHHHHHHHH-----HHHHhhhhhHHHHHHHHHHHHH--HHHHHHH
Confidence 34455789999999999754 5778889999999999999999 5555544
Done!