Query         002995
Match_columns 860
No_of_seqs    699 out of 2771
Neff          7.7 
Searched_HMMs 46136
Date          Thu Mar 28 14:56:32 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/002995.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/002995hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1080 Histone H3 (Lys4) meth 100.0 2.2E-64 4.9E-69  599.3  25.9  506  301-836   480-1005(1005)
  2 KOG0954 PHD finger protein [Ge 100.0 9.7E-40 2.1E-44  362.9   8.1  313  221-580   103-440 (893)
  3 KOG4442 Clathrin coat binding  100.0 2.5E-37 5.3E-42  345.5  14.3  160  681-841   105-265 (729)
  4 KOG0956 PHD finger protein AF1 100.0 3.7E-37   8E-42  339.3   5.5  167  398-578     7-182 (900)
  5 COG5141 PHD zinc finger-contai 100.0 3.8E-37 8.2E-42  327.9   4.3  192  381-583   177-369 (669)
  6 KOG0955 PHD finger protein BR1 100.0 5.1E-35 1.1E-39  348.5   9.0  178  395-579   218-396 (1051)
  7 KOG1079 Transcriptional repres 100.0 1.8E-32 3.8E-37  304.4   9.9  179  637-819   523-714 (739)
  8 KOG0957 PHD finger protein [Ge 100.0 5.9E-30 1.3E-34  273.5   3.8  171  395-580   118-302 (707)
  9 PF13832 zf-HC5HC2H_2:  PHD-zin  99.9 9.4E-25   2E-29  202.4   7.3  109  454-576     2-110 (110)
 10 cd05834 HDGF_related The PWWP   99.9 9.2E-24   2E-28  183.3   8.1   81   81-170     2-82  (83)
 11 cd05836 N_Pac_NP60 The PWWP do  99.9 8.5E-24 1.8E-28  185.0   7.4   86   82-172     1-86  (86)
 12 KOG1082 Histone H3 (Lys9) meth  99.9 1.5E-23 3.2E-28  233.9   8.5  159  681-839   161-357 (364)
 13 cd05838 WHSC1_related The PWWP  99.9 9.8E-23 2.1E-27  182.0   7.8   92   82-174     1-94  (95)
 14 cd05840 SPBC215_ISWI_like The   99.9 6.5E-23 1.4E-27  181.8   6.5   88   82-170     1-92  (93)
 15 smart00317 SET SET (Su(var)3-9  99.9 9.7E-22 2.1E-26  183.2  13.7  115  698-813     2-116 (116)
 16 cd05835 Dnmt3b_related The PWW  99.9 2.5E-22 5.5E-27  176.6   6.9   84   82-171     1-86  (87)
 17 cd05162 PWWP The PWWP domain,   99.9 5.5E-22 1.2E-26  175.7   8.1   86   82-170     1-86  (87)
 18 cd05841 BS69_related The PWWP   99.9 9.7E-22 2.1E-26  168.2   7.0   76   81-170     6-82  (83)
 19 KOG1083 Putative transcription  99.8 3.7E-22   8E-27  230.2   2.9  133  685-819  1166-1299(1306)
 20 PF00855 PWWP:  PWWP domain;  I  99.8 2.7E-21 5.8E-26  171.2   4.1   84   82-171     1-86  (86)
 21 smart00293 PWWP domain with co  99.8 2.2E-20 4.8E-25  154.2   6.4   63   82-145     1-63  (63)
 22 KOG1085 Predicted methyltransf  99.8 1.3E-19 2.8E-24  183.1  10.0  124  693-817   253-380 (392)
 23 cd05837 MSH6_like The PWWP dom  99.8 3.1E-19 6.8E-24  164.1   8.0   90   81-170     2-101 (110)
 24 cd06080 MUM1_like Mutated mela  99.7 1.7E-17 3.8E-22  141.3   5.9   75   82-168     1-76  (80)
 25 cd05839 BR140_related The PWWP  99.7 1.9E-17 4.1E-22  149.6   6.3   66   82-147     1-81  (111)
 26 PF05964 FYRN:  F/Y-rich N-term  99.7 5.4E-17 1.2E-21  128.9   5.3   54  230-283     1-54  (54)
 27 PF13771 zf-HC5HC2H:  PHD-like   99.6   3E-16 6.5E-21  140.0   4.9   88  475-577     1-90  (90)
 28 COG2940 Proteins containing SE  99.6 1.3E-16 2.8E-21  185.1   1.2  141  694-836   330-480 (480)
 29 PF05965 FYRC:  F/Y rich C-term  99.6 5.8E-16 1.3E-20  136.6   4.5   79  290-373     3-83  (86)
 30 smart00542 FYRC "FY-rich" doma  99.6 8.6E-16 1.9E-20  134.7   5.5   79  292-375     1-81  (86)
 31 KOG1141 Predicted histone meth  99.6 1.4E-15   3E-20  171.9   5.6   76  760-835  1179-1261(1262)
 32 KOG1904 Transcription coactiva  99.5 6.5E-14 1.4E-18  156.8   6.8   88   79-173    10-99  (496)
 33 PF00856 SET:  SET domain;  Int  99.4 2.4E-13 5.3E-18  133.3   7.4  108  707-814     1-162 (162)
 34 smart00541 FYRN "FY-rich" doma  99.3   2E-12 4.3E-17   97.5   4.5   41  241-281     2-42  (44)
 35 PF13831 PHD_2:  PHD-finger; PD  99.1 2.5E-11 5.4E-16   87.6   0.6   36  410-445     1-36  (36)
 36 KOG1081 Transcription factor N  98.8 1.1E-09 2.4E-14  125.0   2.1  127  711-844   319-445 (463)
 37 KOG2589 Histone tail methylase  98.8 5.8E-09 1.3E-13  110.2   5.4  115  705-827   136-251 (453)
 38 PF00628 PHD:  PHD-finger;  Int  97.6 1.6E-05 3.6E-10   62.7   0.4   45  398-445     1-49  (51)
 39 KOG4323 Polycomb-like PHD Zn-f  97.6 6.3E-05 1.4E-09   84.3   4.4  117  396-541    83-203 (464)
 40 smart00249 PHD PHD zinc finger  97.5   9E-05 1.9E-09   56.8   3.2   45  398-444     1-47  (47)
 41 PF09465 LBR_tudor:  Lamin-B re  97.5  0.0001 2.3E-09   57.6   3.3   32    2-33     20-51  (55)
 42 KOG2461 Transcription factor B  97.4 0.00017 3.6E-09   81.0   4.9  110  694-817    26-146 (396)
 43 KOG1244 Predicted transcriptio  97.1  0.0002 4.3E-09   73.5   1.6   49  395-446   280-330 (336)
 44 KOG4323 Polycomb-like PHD Zn-f  96.9 0.00045 9.8E-09   77.6   2.8   53  396-449   168-226 (464)
 45 KOG1512 PHD Zn-finger protein   96.8 0.00041   9E-09   71.5   1.0   49  395-446   313-363 (381)
 46 KOG1081 Transcription factor N  96.8 0.00062 1.4E-08   78.3   2.3   55   80-144   134-188 (463)
 47 KOG1084 Transcription factor T  96.5 0.00087 1.9E-08   75.0   0.9   85  452-543   221-306 (375)
 48 smart00508 PostSET Cysteine-ri  96.0  0.0036 7.7E-08   41.5   1.5   19  821-839     2-20  (26)
 49 PF15057 DUF4537:  Domain of un  95.6   0.034 7.4E-07   52.5   7.1  100    4-147    14-115 (124)
 50 PF15446 zf-PHD-like:  PHD/FYVE  95.4   0.015 3.2E-07   56.4   3.7   70  398-467     1-86  (175)
 51 KOG0825 PHD Zn-finger protein   95.1  0.0095 2.1E-07   69.5   1.7   49  395-446   214-265 (1134)
 52 KOG4299 PHD Zn-finger protein   93.8   0.021 4.5E-07   66.0   0.6   48  397-447   254-305 (613)
 53 PF08169 RBB1NT:  RBB1NT (NUC16  92.3    0.24 5.2E-06   44.0   4.9   85   84-178     8-94  (96)
 54 KOG1141 Predicted histone meth  92.0    0.15 3.2E-06   60.3   4.0   59  681-739   784-842 (1262)
 55 KOG1973 Chromatin remodeling p  91.7   0.079 1.7E-06   57.2   1.4   36  411-447   230-268 (274)
 56 PF14446 Prok-RING_1:  Prokaryo  91.5    0.12 2.6E-06   40.8   1.9   33  395-428     4-36  (54)
 57 COG5034 TNG2 Chromatin remodel  91.1    0.13 2.9E-06   53.1   2.3   54  388-446   213-269 (271)
 58 PF11717 Tudor-knot:  RNA bindi  90.9    0.28 6.1E-06   39.2   3.5   52   82-142     1-54  (55)
 59 smart00743 Agenet Tudor-like d  90.5    0.41 8.8E-06   39.0   4.3   53   81-145     2-57  (61)
 60 smart00249 PHD PHD zinc finger  89.9    0.17 3.7E-06   38.2   1.5   30  510-541     1-32  (47)
 61 smart00333 TUDOR Tudor domain.  89.3    0.29 6.3E-06   39.1   2.5   33    2-35     16-49  (57)
 62 smart00333 TUDOR Tudor domain.  88.4    0.69 1.5E-05   36.9   4.1   54   81-146     2-55  (57)
 63 KOG0383 Predicted helicase [Ge  87.7    0.22 4.7E-06   59.8   1.0   50  391-446    42-93  (696)
 64 KOG0957 PHD finger protein [Ge  85.0    0.69 1.5E-05   52.0   3.1   80  357-444   510-595 (707)
 65 PF00628 PHD:  PHD-finger;  Int  84.6    0.51 1.1E-05   36.9   1.4   29  511-541     2-32  (51)
 66 KOG1245 Chromatin remodeling c  84.1    0.25 5.3E-06   64.2  -1.0   49  396-447  1108-1158(1404)
 67 smart00743 Agenet Tudor-like d  84.1    0.78 1.7E-05   37.3   2.3   23    2-25     17-39  (61)
 68 KOG1244 Predicted transcriptio  83.8    0.49 1.1E-05   49.3   1.2   75  397-479   225-311 (336)
 69 PF07649 C1_3:  C1-like domain;  83.0    0.54 1.2E-05   32.5   0.8   28  398-427     2-29  (30)
 70 cd04508 TUDOR Tudor domains ar  81.4     1.3 2.7E-05   34.0   2.4   32    2-34     12-44  (48)
 71 KOG1512 PHD Zn-finger protein   81.2    0.55 1.2E-05   49.2   0.4   49  397-445   259-315 (381)
 72 PF00130 C1_1:  Phorbol esters/  80.2     1.9 4.2E-05   33.9   3.2   35  396-430    11-45  (53)
 73 KOG2041 WD40 repeat protein [G  80.0       1 2.2E-05   52.9   2.1  110  343-463  1069-1185(1189)
 74 PF03107 C1_2:  C1 domain;  Int  79.7     1.5 3.3E-05   30.3   2.1   28  398-427     2-29  (30)
 75 KOG2084 Predicted histone tail  79.2     2.8   6E-05   48.8   5.5   40  773-816   208-248 (482)
 76 KOG1973 Chromatin remodeling p  78.5    0.93   2E-05   49.0   1.2   29  510-538   220-250 (274)
 77 cd04508 TUDOR Tudor domains ar  77.9       3 6.4E-05   31.9   3.5   47   85-143     1-48  (48)
 78 PF11793 FANCL_C:  FANCL C-term  77.7     1.6 3.4E-05   36.9   2.0   50  397-446     3-63  (70)
 79 cd00029 C1 Protein kinase C co  75.2     1.4 3.1E-05   34.0   1.1   34  397-430    12-45  (50)
 80 KOG1337 N-methyltransferase [G  73.6     2.3   5E-05   49.9   2.7   40  773-815   239-278 (472)
 81 PF07039 DUF1325:  SGF29 tudor-  72.8      16 0.00035   34.8   7.8  107    2-143    17-127 (130)
 82 PF02318 FYVE_2:  FYVE-type zin  70.9     3.5 7.7E-05   38.6   2.8   49  396-446    54-102 (118)
 83 KOG4443 Putative transcription  69.8     2.1 4.5E-05   50.3   1.2   81  339-446    34-117 (694)
 84 KOG1473 Nucleosome remodeling   68.5     5.4 0.00012   49.6   4.3  110  396-538   344-457 (1414)
 85 cd04718 BAH_plant_2 BAH, or Br  68.3     3.2   7E-05   40.1   2.0   26  422-448     1-28  (148)
 86 KOG1044 Actin-binding LIM Zn-f  68.0     2.9 6.3E-05   48.4   1.8  102  437-544   112-225 (670)
 87 smart00109 C1 Protein kinase C  65.7     2.4 5.2E-05   32.3   0.5   33  396-429    11-43  (49)
 88 PF13901 DUF4206:  Domain of un  64.4     4.5 9.8E-05   41.7   2.3   42  397-446   153-197 (202)
 89 PF14569 zf-UDP:  Zinc-binding   60.7     3.9 8.5E-05   34.7   0.9   50  395-446     8-59  (80)
 90 PF11764 N-SET:  COMPASS (Compl  57.6     2.5 5.5E-05   42.1  -0.9   16  603-618    78-93  (167)
 91 KOG4443 Putative transcription  56.3     4.1 8.8E-05   48.0   0.3   48  397-445    19-69  (694)
 92 PF05502 Dynactin_p62:  Dynacti  55.8     8.5 0.00018   45.1   2.9   53  412-468     4-65  (483)
 93 KOG3799 Rab3 effector RIM1 and  55.0     5.5 0.00012   37.3   0.9   52  394-446    63-115 (169)
 94 PF11781 RRN7:  RNA polymerase   51.9      11 0.00023   27.5   1.8   26  397-422     9-34  (36)
 95 COG5475 Uncharacterized small   51.1      35 0.00077   27.1   4.6   52   80-146     3-56  (60)
 96 PF08746 zf-RING-like:  RING-li  50.7     7.4 0.00016   29.5   0.8   30  511-540     1-30  (43)
 97 COG5034 TNG2 Chromatin remodel  49.3      12 0.00026   39.3   2.3   40  499-538   211-252 (271)
 98 PF07227 DUF1423:  Protein of u  46.7     9.8 0.00021   43.2   1.4   14  470-483   152-165 (446)
 99 PHA02862 5L protein; Provision  44.0     7.7 0.00017   37.1   0.0   50  397-462     3-52  (156)
100 PF09926 DUF2158:  Uncharacteri  43.5      45 0.00097   26.6   4.2   41   82-132     1-41  (53)
101 PF13639 zf-RING_2:  Ring finge  42.9      13 0.00027   28.0   1.1   31  397-429     1-31  (44)
102 PF10080 DUF2318:  Predicted me  42.5      13 0.00028   33.9   1.2   34  396-429    35-68  (102)
103 PF05641 Agenet:  Agenet domain  41.9      22 0.00048   29.6   2.5   26    3-29     19-44  (68)
104 cd04712 BAH_DCM_I BAH, or Brom  41.3      24 0.00052   33.6   3.0   47   79-130     3-58  (130)
105 KOG0955 PHD finger protein BR1  41.2      19 0.00042   45.6   2.8   64   82-145   943-1025(1051)
106 PF07649 C1_3:  C1-like domain;  41.1      14  0.0003   25.5   1.0   27  510-538     2-30  (30)
107 PF10447 EXOSC1:  Exosome compo  41.0      17 0.00037   31.7   1.7   14   79-92     66-79  (82)
108 COG5194 APC11 Component of SCF  39.0     9.6 0.00021   32.6  -0.1   31  510-540    22-65  (88)
109 PF08666 SAF:  SAF domain;  Int  39.0      18 0.00038   29.2   1.5   16  796-811     3-18  (63)
110 PLN00104 MYST -like histone ac  38.3      45 0.00097   38.4   4.9   61   78-143    50-113 (450)
111 PF14446 Prok-RING_1:  Prokaryo  37.8      30 0.00065   27.6   2.4   37  508-546     5-44  (54)
112 PLN02436 cellulose synthase A   36.8      21 0.00046   45.0   2.3   49  396-446    36-86  (1094)
113 PF13771 zf-HC5HC2H:  PHD-like   35.2      34 0.00073   30.0   2.8   34  452-485    36-72  (90)
114 PF12906 RINGv:  RING-variant d  34.9      13 0.00029   28.6   0.1   30  399-429     1-33  (47)
115 PLN02400 cellulose synthase     34.7      27 0.00057   44.3   2.6   50  395-446    35-86  (1085)
116 PLN02638 cellulose synthase A   34.6      23  0.0005   44.8   2.1   50  395-446    16-67  (1079)
117 KOG0954 PHD finger protein [Ge  34.4      20 0.00043   43.0   1.4   30  510-541   325-354 (893)
118 PF08274 PhnA_Zn_Ribbon:  PhnA   34.3      20 0.00044   25.0   0.9   25  397-421     3-27  (30)
119 PF12773 DZR:  Double zinc ribb  33.5      32 0.00068   26.6   2.0   38  413-458    12-49  (50)
120 PF13832 zf-HC5HC2H_2:  PHD-zin  33.2      24 0.00053   32.3   1.6   29  396-428    55-85  (110)
121 smart00547 ZnF_RBZ Zinc finger  32.6      25 0.00054   23.1   1.1   23  438-460     2-24  (26)
122 cd04370 BAH BAH, or Bromo Adja  31.2      53  0.0011   30.1   3.6   41   80-128     2-46  (123)
123 PLN02915 cellulose synthase A   29.9      32 0.00069   43.5   2.2   49  396-446    15-65  (1044)
124 KOG0383 Predicted helicase [Ge  29.1      29 0.00062   42.3   1.6   31  509-542    48-78  (696)
125 KOG3409 Exosomal 3'-5' exoribo  28.8      74  0.0016   31.6   4.0   16   78-93    119-134 (193)
126 PF07227 DUF1423:  Protein of u  28.6      45 0.00097   38.1   2.9   31  398-428   130-161 (446)
127 PF12861 zf-Apc11:  Anaphase-pr  27.8      25 0.00054   30.8   0.6   34  395-429    20-62  (85)
128 PLN02195 cellulose synthase A   27.7      36 0.00079   42.7   2.2   48  397-446     7-56  (977)
129 KOG3038 Histone acetyltransfer  27.7 2.9E+02  0.0063   29.3   8.3   43   80-132   197-240 (264)
130 PF04810 zf-Sec23_Sec24:  Sec23  27.4      33 0.00071   25.5   1.1   31  413-446     2-32  (40)
131 smart00744 RINGv The RING-vari  26.9      24 0.00052   27.5   0.3   31  398-429     1-34  (49)
132 PF10497 zf-4CXXC_R1:  Zinc-fin  26.8      47   0.001   30.4   2.2   61  396-464     7-82  (105)
133 PF11302 DUF3104:  Protein of u  26.7 1.6E+02  0.0036   25.2   5.2   59   81-144     5-70  (75)
134 cd04721 BAH_plant_1 BAH, or Br  26.5      65  0.0014   30.7   3.2   53   78-138     4-58  (130)
135 KOG0696 Serine/threonine prote  26.3      19  0.0004   40.6  -0.5   33  397-429    57-89  (683)
136 PF07282 OrfB_Zn_ribbon:  Putat  26.1      60  0.0013   26.9   2.6   35  395-429    27-62  (69)
137 PLN02189 cellulose synthase     26.0      41 0.00088   42.6   2.2   49  396-446    34-84  (1040)
138 PF00641 zf-RanBP:  Zn-finger i  25.6      22 0.00048   24.4  -0.1   22  438-459     4-25  (30)
139 KOG2807 RNA polymerase II tran  25.2      48   0.001   36.1   2.2   84  434-539   272-361 (378)
140 PF09038 53-BP1_Tudor:  Tumour   25.2      62  0.0013   30.3   2.7   28    5-33     20-47  (122)
141 cd05791 S1_CSL4 S1_CSL4: CSL4,  25.1      46   0.001   29.6   1.9   17   80-96     59-75  (92)
142 PF11793 FANCL_C:  FANCL C-term  24.6      34 0.00074   28.8   0.8   33  509-541     3-40  (70)
143 PF08792 A2L_zn_ribbon:  A2L zi  23.7      69  0.0015   22.8   2.1   27  396-422     3-30  (33)
144 KOG2752 Uncharacterized conser  23.0      89  0.0019   34.0   3.7  112  398-543    57-169 (345)
145 KOG0782 Predicted diacylglycer  22.8      23 0.00051   40.9  -0.6   20  413-432   199-218 (1004)
146 KOG0695 Serine/threonine prote  22.7      32  0.0007   37.6   0.4   35  397-431   142-176 (593)
147 KOG4299 PHD Zn-finger protein   21.8      33 0.00071   40.6   0.3   46  396-446    47-94  (613)
148 KOG1844 PHD Zn-finger proteins  21.7      56  0.0012   38.6   2.2   49  397-447    87-135 (508)
149 PF10367 Vps39_2:  Vacuolar sor  21.4      76  0.0017   28.5   2.6   31  395-428    77-107 (109)
150 PF10367 Vps39_2:  Vacuolar sor  21.3      39 0.00085   30.4   0.6   30  509-539    79-108 (109)
151 PF09465 LBR_tudor:  Lamin-B re  21.0 1.7E+02  0.0037   23.5   4.0   50   80-141     4-54  (55)
152 PF12148 DUF3590:  Protein of u  20.5   1E+02  0.0022   27.1   2.9   31    1-31      9-45  (85)
153 smart00439 BAH Bromo adjacent   20.2 1.2E+02  0.0027   27.6   3.8   39   82-128     2-43  (120)

No 1  
>KOG1080 consensus Histone H3 (Lys4) methyltransferase complex, subunit SET1 and related methyltransferases [Chromatin structure and dynamics; Transcription]
Probab=100.00  E-value=2.2e-64  Score=599.30  Aligned_cols=506  Identities=35%  Similarity=0.617  Sum_probs=377.4

Q ss_pred             CceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHHHHhhcCCCCCCCchhhhhhcc-cc
Q 002995          301 EQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSKSSLCKLTS-KY  379 (860)
Q Consensus       301 ~~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~li~~Lp~a~~c~k~~~~k~~s-~~  379 (860)
                      ..+-..+++.+|....+...+.+.........   ...+..++..+|++..+.....+.........+....++-.. +.
T Consensus       480 ~~~~~~~~~~~~k~~~~e~~k~~~~~~~~~k~---~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~p~s~~~~~~~s~~~~  556 (1005)
T KOG1080|consen  480 RVFHMLRYSVKKKKFLSEWERHTGATAKIWKD---SSRVKDELLPLPKWVESRGRSIMNTYNSEKPKSIVLMGKTSVQRM  556 (1005)
T ss_pred             eeecccCcchhhhhcccchhhhhccccccccc---ccccccccccchhhhhhccccccccccccCCcchhhhccchhhhh
Confidence            34778889999999988877666555433211   112334666777777776655566655555555544443322 23


Q ss_pred             cCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccC
Q 002995          380 RDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCP  459 (860)
Q Consensus       380 ~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~  459 (860)
                      +..+..+......|.+...|.+|.+.+.+..|.++.|++|++.+|+.|||...++.+..|+|+.|...  ...+.|+||+
T Consensus       557 ~~~~~~~~~~~l~~~~t~~c~~~~~~~~~~~n~~~~~~~~~~~~~s~~~g~~~~~~~~~~~~~~~~~~--~~~r~~~l~~  634 (1005)
T KOG1080|consen  557 LLELIEKREPRLSKWTTERCAVCRDDEDWEKNVSIICDRCTRSVHSECYGNLKSYDGTSWVCDSCETL--DIKRSCCLCP  634 (1005)
T ss_pred             cCcccccchhhhcCCCcccccccccccccccceeeeeccccccCCCcccccCCCCCCCcchhhccccc--cCCchhhhcc
Confidence            44455566666666677899999999999999999999999999999999999898999999999864  4467899999


Q ss_pred             CCCCCceecCCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccchhhh
Q 002995          460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCAR  539 (860)
Q Consensus       460 ~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~  539 (860)
                      .+||||+++..|+|+|+.||.|.|++.+.++..|+|..++..++...+...|.+    .|.|.||.  .|.+.||.+||+
T Consensus       635 ~~g~al~p~d~gr~~~~e~a~~~~e~~~~~~~~~~p~~~~~~~p~~~~~~~~~~----~~~~~~~~--~~~~~~~~~~a~  708 (1005)
T KOG1080|consen  635 VKGGALKPTDEGRWVHVECAWFRPEVCLASPERMEPAVGTFKIPALSFLKICFI----HGSCRQCC--KCETGSHAMCAS  708 (1005)
T ss_pred             ccCcccCCCCccchhhhhchhccccccCCCccCCCCcccccccCccchhhhccc----cccccccc--hhhhcceehhhc
Confidence            999999999999999999999999999999999999999999999988888877    78899998  899999999999


Q ss_pred             hcCeeEEEeccccc----------------cccccCCCccccccceeeccccCCCcccchhhhhhccccccccccCCCCC
Q 002995          540 AAGLCVELEDEDRL----------------NLLSLDEDDEDQCIRLLSFCKKHKQPLNDRLAVDERLVQVTRRCCDYIPP  603 (860)
Q Consensus       540 ~~g~~~~~~~~~~~----------------~~~~~~~~~~~~~~~~~~~C~~H~~~~~~~~~~~~~~~~~~~~~~~~~p~  603 (860)
                      .+|+.++...-...                .++.++........+..+++.+|+...... .+..         .++.+ 
T Consensus       709 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~d~~l~~~~~~~~~~~~~~~~~k~~~~~g~~~-~~~~---------~~~~~-  777 (1005)
T KOG1080|consen  709 RAGYIMEAVSLEEVSQQTTSYVKEDGPGPDSVLKVNTPSGKFGAENLSQNKKSRTDGVRL-VLTY---------KEYAP-  777 (1005)
T ss_pred             CccChhhhhhhhhhhhhhhhhhhhccCCcccceeecCccccccccchhhhhhcccccccc-cccc---------ccccc-
Confidence            99987644311111                111121111122222333344433321111 0000         11111 


Q ss_pred             CCCCCcccCCcccccc--cCCCCChhHHHHhhhccccccCCCcccCCcccCCCCCCCcCcccccccccccccccCCCCcC
Q 002995          604 SNPSGCARSEPYNYFG--RRGRKEPEALAAASLKRLFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAPNFL  681 (860)
Q Consensus       604 ~npsGcaRsE~~~~~~--Rr~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  681 (860)
                       .+.-|+|.+|++...  ...+..+..++.+..+++..+..++..++..++.+.....    .......  +.-..+...
T Consensus       778 -~~~~~~~~~~~~~~~~~s~~~~~~~~r~~~~~~~~~~~s~~~~~s~~~s~~~s~~~r----l~q~rl~--a~~~~~~~~  850 (1005)
T KOG1080|consen  778 -LREIKSRAEPLNRIDFSSEARCRSESRSDNSKSPLAEESESDITSGGSSHDLSAEER----LNQFRLS--ASFTASFIL  850 (1005)
T ss_pred             -cccchhcccCCcccCccccccccchhhcccccccccccccccccccccccchhHHhh----hHHHHhh--hhccccccc
Confidence             123366667765331  1122333445566667777777788777666665432111    1011111  011223446


Q ss_pred             ChhhhhHHhhhcccee-EEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcc
Q 002995          682 SMADKYKHMKETFRKR-LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDER  760 (860)
Q Consensus       682 ~~~~~~~~l~~~~~~~-l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~  760 (860)
                      ..+...+.+|..++++ |.|++|.||||||||.++|.+||||+||+||+|++..||.|+.+ |......++|+|++|++.
T Consensus       851 ~~~~~~~~~~~~~rkk~~~F~~s~iH~wglfa~~~i~~~dmViEY~Ge~vR~~iad~RE~~-Y~~~gi~~sYlfrid~~~  929 (1005)
T KOG1080|consen  851 DEAEVLRYNQLKFRKKYVKFGRSGIHGWGLFAMENIAAGDMVIEYRGELVRSSIADLREAR-YERMGIGDSYLFRIDDEV  929 (1005)
T ss_pred             chHHHHHHHHHhhhhhhhccccccccccceeeccCccccceEEEeeceehhhhHHHHHHHH-HhccCcccceeeecccce
Confidence            6777888888887755 99999999999999999999999999999999999999999975 877766889999999999


Q ss_pred             eeeccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCCCCeEEeeCCCCCccccC
Q 002995          761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVN  836 (860)
Q Consensus       761 ~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~~~~~C~Cg~~~Cr~~~~  836 (860)
                      |||||++||+||||||||+|||+++++.|+|+.+|+|||.|+|.+||||||||.|+.++.+++|+||+++|||+||
T Consensus       930 ViDAtk~gniAr~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e~~kipClCgap~Crg~~n 1005 (1005)
T KOG1080|consen  930 VVDATKKGNIARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTEDDKIPCLCGAPNCRGFLN 1005 (1005)
T ss_pred             EEeccccCchhheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeeccccccccccccccCCCccccccC
Confidence            9999999999999999999999999999999999999999999999999999999999999999999999999997


No 2  
>KOG0954 consensus PHD finger protein [General function prediction only]
Probab=100.00  E-value=9.7e-40  Score=362.92  Aligned_cols=313  Identities=30%  Similarity=0.618  Sum_probs=226.5

Q ss_pred             CcccccceeeCCEEEEeeeeeeeCCCCCCCCCcccCCCeEE-----EEEeccCCC-CCCcceEEEEEEecCCCCccceEE
Q 002995          221 GSIGISPYVFGDLQILSLGKIVKDSEYFQDDRFIWPEGYTA-----VRKFTSLAD-PRVCNSYKMEVLRDTESKIRPLFR  294 (860)
Q Consensus       221 ~~~~~~P~~~g~l~i~slG~iv~~~~~fh~~~~i~P~Gy~s-----~r~~~S~~d-p~~~~~y~~~I~~~~~~~~~P~F~  294 (860)
                      .+....|-++..-.+++.+++++.-+.-.+++++||.-++-     .+...|..- |.-.+.|..+=+.-...-.+|.-.
T Consensus       103 ~~adkkpkel~Rqdli~amkl~dse~~~p~~y~~~~D~wk~ewekgvQvpaspd~lpqp~v~~dse~v~~~~~fs~pkky  182 (893)
T KOG0954|consen  103 RGADKKPKELYRQDLISAMKLPDSEHLNPDEYLDFADPWKQEWEKGVQVPASPDTLPQPSVRVDSEDVQPETDFSRPKKY  182 (893)
T ss_pred             cccccchHHHHHhhhhhhccCccccccCccceeecCCccchhhhccccccCCCCcCCCcceeccchhcchhhhhcCCcce
Confidence            45567788887777778777765433333666776554441     222222211 233344444433222222456655


Q ss_pred             EEcCCCC------------------ceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHH
Q 002995          295 VTLDNGE------------------QFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMK  356 (860)
Q Consensus       295 Vt~~~~~------------------~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~  356 (860)
                      |...++.                  -|+-+-.+.+|...++.                   .+..+|  +|-+..-++.+
T Consensus       183 ivc~~~~~~e~~yn~~~~~lae~tcrydid~~d~awL~~~n~-------------------e~~~~G--~~~l~~~~~eR  241 (893)
T KOG0954|consen  183 IVCSDGEVPELGYNLLIKDLAESTCRYDIDDMDPAWLQLVNE-------------------ERAEMG--SLELDEGTFER  241 (893)
T ss_pred             EEeCCCCCcccchhhhHHHHhhhhhhcccccccHHHHHHhcc-------------------hHHhhC--CcccchHHHHH
Confidence            5543332                  13334455666655541                   123355  56666777888


Q ss_pred             HHhhcCCCCCCCchhhhhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccCceeccccccccccCCC
Q 002995          357 LILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNG  436 (860)
Q Consensus       357 li~~Lp~a~~c~k~~~~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~  436 (860)
                      +|++|..-  |-+   +.......++-.+     +..+++..|+||...++++.|+||+||+|++.|||.|||+..+|++
T Consensus       242 iieelE~~--c~k---qi~~~l~~eeglg-----ie~dedviCDvCrspD~e~~neMVfCd~Cn~cVHqaCyGIle~p~g  311 (893)
T KOG0954|consen  242 IIEELERR--CKK---QINHALETEEGLG-----IEYDEDVICDVCRSPDSEEANEMVFCDKCNICVHQACYGILEVPEG  311 (893)
T ss_pred             HHHHHHHH--HHH---HHHhhhhhcccce-----eeccccceeceecCCCccccceeEEeccchhHHHHhhhceeecCCC
Confidence            99888521  221   2222222233333     3346889999999999999999999999999999999999998987


Q ss_pred             CceeeecccCCCCCCCCCccccCCCCCCceecCCC-ceeccccccccCceEEccCCCccccccchhhhccccCccceecC
Q 002995          437 VLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDG-RWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICG  515 (860)
Q Consensus       437 ~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g-~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~  515 (860)
                       +|+|..|..+   ..+.|+|||.+|||||++..| +|+|+.||||+||++|.++..|+||..++.|+..||.+.|.+|+
T Consensus       312 -pWlCr~Calg---~~ppCvLCPkkGGamK~~~sgT~wAHvsCALwIPEVsie~~ekmePItkfs~IpesRwslvC~LCk  387 (893)
T KOG0954|consen  312 -PWLCRTCALG---IEPPCVLCPKKGGAMKPTKSGTKWAHVSCALWIPEVSIECPEKMEPITKFSHIPESRWSLVCNLCK  387 (893)
T ss_pred             -Ceeehhcccc---CCCCeeeccccCCcccccCCCCeeeEeeeeeccceeeccCHhhcCcccccCCCcHHHHHHHHHHhc
Confidence             9999999986   468999999999999999866 79999999999999999999999999999999999999999999


Q ss_pred             CcceeeEEcCCCCcCccccchhhhhcCeeEEEeccccccccccCCCccccccceeeccccCCCcc
Q 002995          516 VSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPL  580 (860)
Q Consensus       516 ~~~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~~~  580 (860)
                      .+.||||||+...|.++||+.||+.+|+.|++....            -+.+.+..||.+|+.-.
T Consensus       388 ~k~GACIqCs~k~C~t~fHv~CA~~aG~~~~~~~~~------------~D~v~~~s~c~khs~~~  440 (893)
T KOG0954|consen  388 VKSGACIQCSNKTCRTAFHVTCAFEAGLEMKTILKE------------NDEVKFKSYCSKHSDHR  440 (893)
T ss_pred             ccCcceEEecccchhhhccchhhhhcCCeeeeeecc------------CCchhheeecccccccc
Confidence            999999999999999999999999999998876432            11267889999998644


No 3  
>KOG4442 consensus Clathrin coat binding protein/Huntingtin interacting protein HIP1, involved in regulation of endocytosis [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00  E-value=2.5e-37  Score=345.47  Aligned_cols=160  Identities=28%  Similarity=0.491  Sum_probs=151.0

Q ss_pred             CChhhhhHHhhhccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcc
Q 002995          681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDER  760 (860)
Q Consensus       681 ~~~~~~~~~l~~~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~  760 (860)
                      +-..|+|+..|+.-..+++|+.+..+||||.|..+|++|+||+||.||||...+++.|... |......+.|+|.+..+.
T Consensus       105 cg~~C~NQRFQkkqyA~vevF~Te~KG~GLRA~~dI~~g~FI~EY~GEVI~~~Ef~kR~~~-Y~~d~~kh~Yfm~L~~~e  183 (729)
T KOG4442|consen  105 CGVYCKNQRFQKKQYAKVEVFLTEKKGCGLRAEEDIPKGQFILEYIGEVIEEKEFEKRVKR-YAKDGIKHYYFMALQGGE  183 (729)
T ss_pred             ccccccchhhhhhccCceeEEEecCcccceeeccccCCCcEEeeeccccccHHHHHHHHHH-HHhcCCceEEEEEecCCc
Confidence            3467899999998889999999999999999999999999999999999999999999986 777777889999999999


Q ss_pred             eeeccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC-CCeEEeeCCCCCccccCCCh
Q 002995          761 VIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID-EQLACYCGFPRCRGVVNDTE  839 (860)
Q Consensus       761 ~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~-~~~~C~Cg~~~Cr~~~~~~~  839 (860)
                      +||||.+||+||||||||+|||++..|.|++..||+|||.|.|.+||||||||+|...+ ..++|+||+++|+|||++..
T Consensus       184 ~IDAT~KGnlaRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr~AQ~CyCgeanC~G~IGgk~  263 (729)
T KOG4442|consen  184 YIDATKKGNLARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGRDAQPCYCGEANCRGWIGGKP  263 (729)
T ss_pred             eecccccCcHHHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEecccccccccccccccCCcccccccCCCC
Confidence            99999999999999999999999999999999999999999999999999999998876 67999999999999999995


Q ss_pred             HH
Q 002995          840 AE  841 (860)
Q Consensus       840 ~~  841 (860)
                      .+
T Consensus       264 q~  265 (729)
T KOG4442|consen  264 QT  265 (729)
T ss_pred             cc
Confidence            54


No 4  
>KOG0956 consensus PHD finger protein AF10 [General function prediction only]
Probab=100.00  E-value=3.7e-37  Score=339.25  Aligned_cols=167  Identities=37%  Similarity=0.839  Sum_probs=150.0

Q ss_pred             ccccccCCcccCCCeeEEcc--ccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCceec
Q 002995          398 KCSVCHMDEEYQNNLFLQCD--KCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAH  475 (860)
Q Consensus       398 ~C~vC~~~~~~~~n~ll~C~--~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wvH  475 (860)
                      -|+||-+...|.+|+||+||  +|-++|||.||||..+|.| +|||++|+..-......|.|||.++||||+|++|.|+|
T Consensus         7 GCCVCSDErGWaeNPLVYCDG~nCsVAVHQaCYGIvqVPtG-pWfCrKCesqeraarvrCeLCP~kdGALKkTDn~GWAH   85 (900)
T KOG0956|consen    7 GCCVCSDERGWAENPLVYCDGHNCSVAVHQACYGIVQVPTG-PWFCRKCESQERAARVRCELCPHKDGALKKTDNGGWAH   85 (900)
T ss_pred             ceeeecCcCCCccCceeeecCCCceeeeehhcceeEecCCC-chhhhhhhhhhhhccceeecccCcccceecccCCCceE
Confidence            69999988899999999999  7999999999999999999 99999999765445678999999999999999999999


Q ss_pred             cccccccCceEEccCCCccccccchhhhccccCccceecCC-------cceeeEEcCCCCcCccccchhhhhcCeeEEEe
Q 002995          476 LACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV-------SYGACIQCSNTTCRVAYHPLCARAAGLCVELE  548 (860)
Q Consensus       476 ~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~-------~~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~  548 (860)
                      |+|||+||||.|.++..||||. +..++..|+.++|.||..       ..|||++|...+|+++||++||+.+||..|.+
T Consensus        86 VVCALYIPEVrFgNV~TMEPIi-Lq~VP~dRfnKtCYIC~E~GrpnkA~~GACMtCNKs~CkqaFHVTCAQ~~GLLCEE~  164 (900)
T KOG0956|consen   86 VVCALYIPEVRFGNVHTMEPII-LQDVPHDRFNKTCYICNEEGRPNKAAKGACMTCNKSGCKQAFHVTCAQRAGLLCEEE  164 (900)
T ss_pred             EEEEeeccceeeccccccccee-eccCchhhhcceeeeecccCCccccccccceecccccchhhhhhhHhhhhccceecc
Confidence            9999999999999999999998 999999999999999975       48999999999999999999999999988654


Q ss_pred             ccccccccccCCCccccccceeeccccCCC
Q 002995          549 DEDRLNLLSLDEDDEDQCIRLLSFCKKHKQ  578 (860)
Q Consensus       549 ~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~  578 (860)
                      ..            ...++++--||+.|-.
T Consensus       165 gn------------~~dNVKYCGYCk~Hfs  182 (900)
T KOG0956|consen  165 GN------------ISDNVKYCGYCKYHFS  182 (900)
T ss_pred             cc------------ccccceechhHHHHHH
Confidence            21            1223566668888864


No 5  
>COG5141 PHD zinc finger-containing protein [General function prediction only]
Probab=100.00  E-value=3.8e-37  Score=327.89  Aligned_cols=192  Identities=36%  Similarity=0.714  Sum_probs=169.4

Q ss_pred             CCCCC-CCCccccccCCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccC
Q 002995          381 DLPGG-YRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCP  459 (860)
Q Consensus       381 ~~p~~-~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~  459 (860)
                      .+|+. ++|++++..-+..|.+|.+.+.++.|.||+||+|.+.|||+|||+...|+| .|+|++|..+.. ....|.+||
T Consensus       177 ~lp~k~vepi~~~d~~d~~C~~c~~t~~eN~naiVfCdgC~i~VHq~CYGI~f~peG-~WlCrkCi~~~~-~i~~C~fCp  254 (669)
T COG5141         177 GLPDKHVEPIEPSDEFDDICTKCTSTHNENSNAIVFCDGCEICVHQSCYGIQFLPEG-FWLCRKCIYGEY-QIRCCSFCP  254 (669)
T ss_pred             cCccccccccCCchhhhhhhHhccccccCCcceEEEecCcchhhhhhcccceecCcc-hhhhhhhccccc-ceeEEEecc
Confidence            46644 888888877778999999999988999999999999999999999999988 999999998643 345699999


Q ss_pred             CCCCCceecCCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccchhhh
Q 002995          460 VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCAR  539 (860)
Q Consensus       460 ~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~  539 (860)
                      .+.||+|+|.+|+|+|+.||+|+|+.+|.+...++||+||..++..||++.|.||+...|+||||++.+|.++||++||+
T Consensus       255 s~dGaFkqT~dgrW~H~iCA~~~pelsF~~l~~~dpI~~i~sVs~srwkl~C~iCk~~~GtcIqCs~~nC~~aYHVtCAr  334 (669)
T COG5141         255 SSDGAFKQTSDGRWGHVICAMFNPELSFGHLLSKDPIDNIASVSSSRWKLGCLICKEFGGTCIQCSYFNCTRAYHVTCAR  334 (669)
T ss_pred             CCCCceeeccCCchHhHhHHHhcchhccccccccchhhhhcccchhhHhheeeEEcccCcceeeecccchhhhhhhhhhh
Confidence            99999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hcCeeEEEeccccccccccCCCccccccceeeccccCCCcccch
Q 002995          540 AAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPLNDR  583 (860)
Q Consensus       540 ~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~~~~~~  583 (860)
                      ++|+++.-       +  ..-++-.+++....||.+|.++...+
T Consensus       335 rag~f~~~-------~--~s~n~~s~~id~e~~c~kh~p~gy~~  369 (669)
T COG5141         335 RAGYFDLN-------I--YSHNGISYCIDHEPLCRKHYPLGYGR  369 (669)
T ss_pred             hcchhhhh-------h--hcccccceeecchhhhcCCCCcchhc
Confidence            99997631       0  11123355667788999999876544


No 6  
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=100.00  E-value=5.1e-35  Score=348.53  Aligned_cols=178  Identities=42%  Similarity=0.902  Sum_probs=158.8

Q ss_pred             CCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCcee
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWA  474 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wv  474 (860)
                      ++.+|+||.+.+....|.||+||+|+++|||+|||+..+|+| .|+|.+|..+.. ....|.|||.++||+|+|.+|+|+
T Consensus       218 ~D~~C~iC~~~~~~n~n~ivfCD~Cnl~VHq~Cygi~~ipeg-~WlCr~Cl~s~~-~~v~c~~cp~~~gAFkqt~dgrw~  295 (1051)
T KOG0955|consen  218 EDAVCCICLDGECQNSNVIVFCDGCNLAVHQECYGIPFIPEG-QWLCRRCLQSPQ-RPVRCLLCPSKGGAFKQTDDGRWA  295 (1051)
T ss_pred             CCccceeecccccCCCceEEEcCCCcchhhhhccCCCCCCCC-cEeehhhccCcC-cccceEeccCCCCcceeccCCcee
Confidence            457999999999998999999999999999999998888888 999999987443 447899999999999999999999


Q ss_pred             ccccccccCceEEccCCCccccccchhhhccccCccceecCCcc-eeeEEcCCCCcCccccchhhhhcCeeEEEeccccc
Q 002995          475 HLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSY-GACIQCSNTTCRVAYHPLCARAAGLCVELEDEDRL  553 (860)
Q Consensus       475 H~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~-Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~~~~~~  553 (860)
                      |++||+|+|++.|.+...+++|++|+.|+..||++.|.+|++.. |+||||+..+|.++||++||+++|++|.+....  
T Consensus       296 Hv~caiwipev~F~nt~~~E~I~~i~~i~~aRwkL~cy~cK~~~~gaciqcs~~~c~~a~hvtca~~agl~m~~~~~~--  373 (1051)
T KOG0955|consen  296 HVVCAIWIPEVSFANTVFLEPIDSIENIPPARWKLTCYICKQKGLGACIQCSKANCYTAFHVTCARRAGLYMKSNTVK--  373 (1051)
T ss_pred             eeehhhcccccccccchhhccccchhcCcHhhhhceeeeeccCCCCcceecchhhhhhhhhhhhHhhcCceEeecccc--
Confidence            99999999999999999999999999999999999999999988 999999999999999999999999999844221  


Q ss_pred             cccccCCCccccccceeeccccCCCc
Q 002995          554 NLLSLDEDDEDQCIRLLSFCKKHKQP  579 (860)
Q Consensus       554 ~~~~~~~~~~~~~~~~~~~C~~H~~~  579 (860)
                         ....++-.+.+...+||++|.++
T Consensus       374 ---~~s~~~~s~~v~~~syC~~H~pp  396 (1051)
T KOG0955|consen  374 ---ELSKNGTSQSVNKISYCDKHTPP  396 (1051)
T ss_pred             ---cccccccccccceeeeccCCCCc
Confidence               11112223456789999999998


No 7  
>KOG1079 consensus Transcriptional repressor EZH1 [Transcription]
Probab=99.97  E-value=1.8e-32  Score=304.43  Aligned_cols=179  Identities=31%  Similarity=0.546  Sum_probs=154.3

Q ss_pred             ccccCCCcccCCcccCCCCCCCcCcccccccccccccccCCC-CcC------------ChhhhhHHhhhccceeEEEEEe
Q 002995          637 LFVENQPYLVGGYCQNGLSGNTLPSIRVIGSKFSFSLHRDAP-NFL------------SMADKYKHMKETFRKRLAFGKS  703 (860)
Q Consensus       637 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~------------~~~~~~~~l~~~~~~~l~v~~s  703 (860)
                      .|.+..- .|...|+++|+||.|...+..+.|.+..+.|+|. +++            ...|+|..|+++.++++.++.|
T Consensus       523 ~~CEk~C-~C~~dC~nrF~GC~Ck~QC~tkqCpC~~A~rECdPd~Cl~cg~~~~~d~~~~~C~N~~l~~~~qkr~llapS  601 (739)
T KOG1079|consen  523 TFCEKFC-YCSPDCRNRFPGCRCKAQCNTKQCPCYLAVRECDPDVCLMCGNVDHFDSSKISCKNTNLQRGEQKRVLLAPS  601 (739)
T ss_pred             cchhhcc-cCCHHHHhcCCCCCcccccccCcCchhhhccccCchHHhccCcccccccCccccccchhhhhhhcceeechh
Confidence            3444443 3778899999999997554444444444445443 222            3489999999999999999999


Q ss_pred             cccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCCCCCCee
Q 002995          704 GIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCY  783 (860)
Q Consensus       704 ~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~  783 (860)
                      .+.|||||+++.+.+++||.||+||+|++.|||+|.+ +|+..  ..+|+|+++.+++|||++.||.+||+|||-+|||+
T Consensus       602 dVaGwGlFlKe~v~KnefisEY~GE~IS~dEADrRGk-iYDr~--~cSflFnln~dyviDs~rkGnk~rFANHS~nPNCY  678 (739)
T KOG1079|consen  602 DVAGWGLFLKESVSKNEFISEYTGEIISHDEADRRGK-IYDRY--MCSFLFNLNNDYVIDSTRKGNKIRFANHSFNPNCY  678 (739)
T ss_pred             hccccceeeccccCCCceeeeecceeccchhhhhccc-ccccc--cceeeeeccccceEeeeeecchhhhccCCCCCCcE
Confidence            9999999999999999999999999999999999998 57754  47999999999999999999999999999999999


Q ss_pred             EEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC
Q 002995          784 SRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID  819 (860)
Q Consensus       784 ~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~  819 (860)
                      +++++|.|.+||.|||+|+|.+||||||||++..+.
T Consensus       679 Akvm~V~GdhRIGifAkRaIeagEELffDYrYs~~~  714 (739)
T KOG1079|consen  679 AKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYSPEH  714 (739)
T ss_pred             EEEEEecCCcceeeeehhhcccCceeeeeeccCccc
Confidence            999999999999999999999999999999987554


No 8  
>KOG0957 consensus PHD finger protein [General function prediction only]
Probab=99.96  E-value=5.9e-30  Score=273.46  Aligned_cols=171  Identities=35%  Similarity=0.732  Sum_probs=146.9

Q ss_pred             CCCccccccCCcccCCCeeEEccccCceeccccccccc---cCCC------CceeeecccCCCCCCCCCccccCCCCCCc
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELE---PVNG------VLWLCNLCRPGAPEPPPPCCLCPVVGGAM  465 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~---~~~~------~~W~C~~C~~~~~~~~~~C~lC~~~gGal  465 (860)
                      +...|+||.+..+.+-+.||+||+||+.||..|||+..   .+.+      .+|||+.|..+..  .+.|-|||.++|++
T Consensus       118 k~~iCcVClg~rs~da~ei~qCd~CGi~VHEgCYGv~dn~si~s~~s~~stepWfCeaC~~Gvs--~P~CElCPn~~Gif  195 (707)
T KOG0957|consen  118 KAVICCVCLGQRSVDAGEILQCDKCGINVHEGCYGVLDNVSIPSGSSDCSTEPWFCEACLYGVS--LPHCELCPNRFGIF  195 (707)
T ss_pred             cceEEEEeecCccccccceeeccccCceecccccccccccccCCCCccCCCCchhhhhHhcCCC--CCccccCCCcCCcc
Confidence            34599999998888889999999999999999999751   1211      4799999998865  48999999999999


Q ss_pred             eecCCCceeccccccccCceEEccCCCccccccchhhhccccC-ccceecCC----cceeeEEcCCCCcCccccchhhhh
Q 002995          466 KPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWK-LLCSICGV----SYGACIQCSNTTCRVAYHPLCARA  540 (860)
Q Consensus       466 k~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~-~~C~~C~~----~~Ga~iqC~~~~C~~~fH~~CA~~  540 (860)
                      |.|+-|+|||++|||++|+|.|.....+.+|. +.......|. ..|++|..    +.|.||.|..+.|..+||++||+.
T Consensus       196 KetDigrWvH~iCALYvpGVafg~~~~l~~Vt-l~em~ysk~Gak~Cs~Ced~~fARtGvci~CdaGMCk~YfHVTCAQk  274 (707)
T KOG0957|consen  196 KETDIGRWVHAICALYVPGVAFGQTHTLCGVT-LEEMDYSKFGAKTCSACEDKIFARTGVCIRCDAGMCKEYFHVTCAQK  274 (707)
T ss_pred             cccchhhHHHHHHHhhcCcccccccccccccc-HHHhhhhhhccchhccccchhhhhcceeeeccchhhhhhhhhhHHhh
Confidence            99999999999999999999999999999887 6655555554 78999985    589999999999999999999999


Q ss_pred             cCeeEEEeccccccccccCCCccccccceeeccccCCCcc
Q 002995          541 AGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKHKQPL  580 (860)
Q Consensus       541 ~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H~~~~  580 (860)
                      +|+.++..+++            +....+.+||++|+...
T Consensus       275 ~GlLvea~~e~------------DiAdpfya~CK~Ht~r~  302 (707)
T KOG0957|consen  275 LGLLVEATDEN------------DIADPFYAFCKKHTNRD  302 (707)
T ss_pred             hcceeeccccc------------cchhhHHHHHHhhcchh
Confidence            99998877654            33457889999999754


No 9  
>PF13832 zf-HC5HC2H_2:  PHD-zinc-finger like domain
Probab=99.91  E-value=9.4e-25  Score=202.43  Aligned_cols=109  Identities=50%  Similarity=1.122  Sum_probs=101.9

Q ss_pred             CccccCCCCCCceecCCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccc
Q 002995          454 PCCLCPVVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAY  533 (860)
Q Consensus       454 ~C~lC~~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~f  533 (860)
                      .|+|||.+|||||+|.++.|||++||+|+|++.|.+...+++++ ++.++.+++++.|.+|++..|++|+|..++|.++|
T Consensus         2 ~C~lC~~~~Galk~t~~~~WvHv~Cal~~~~~~~~~~~~~~~v~-~~~i~~~~~~~~C~iC~~~~G~~i~C~~~~C~~~f   80 (110)
T PF13832_consen    2 SCVLCPKRGGALKRTSDGQWVHVLCALWIPEVIFNNGESMEPVD-ISNIPPSRFKLKCSICGKSGGACIKCSHPGCSTAF   80 (110)
T ss_pred             ccEeCCCCCCcccCccCCcEEEeEccceeCccEEeechhcCccc-ceeecchhcCCcCcCCCCCCceeEEcCCCCCCcCC
Confidence            69999999999999999999999999999999999999999999 99999999999999999999999999999999999


Q ss_pred             cchhhhhcCeeEEEeccccccccccCCCccccccceeeccccC
Q 002995          534 HPLCARAAGLCVELEDEDRLNLLSLDEDDEDQCIRLLSFCKKH  576 (860)
Q Consensus       534 H~~CA~~~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~C~~H  576 (860)
                      ||+||+.+|+.+++...+.             ...+.+||++|
T Consensus        81 H~~CA~~~g~~~~~~~~~~-------------~~~~~~~C~~H  110 (110)
T PF13832_consen   81 HPTCARKAGLYFEIENEED-------------NVQFIAYCPKH  110 (110)
T ss_pred             CHHHHHHCCCeEEeeecCC-------------CceEEEECCCC
Confidence            9999999999998775421             35789999999


No 10 
>cd05834 HDGF_related The PWWP domain is an essential part of the Hepatoma Derived Growth Factor (HDGF) family of proteins, and is necessary for DNA binding by HDGF. This family of endogenous nuclear-targeted mitogens includes HRP (HDGF-related proteins 1, 2, 3, 4, or HPR1, HPR2, HPR3, HPR4, respectively) and lens epithelium-derived growth factor, LEDGF. Members of the HDGF family have been linked to human diseases, and HDGF is a prognostic factor in several types of cancer. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.89  E-value=9.2e-24  Score=183.34  Aligned_cols=81  Identities=27%  Similarity=0.515  Sum_probs=73.0

Q ss_pred             CCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHH
Q 002995           81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPR  160 (860)
Q Consensus        81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~  160 (860)
                      +|++||||||||+|||||||+|++++..        .+..+.|+|+|||++++|||++++|+||+++ ..++..+++++.
T Consensus         2 ~f~~GdlVwaK~kGyp~WPa~I~~~~~~--------~~~~~~~~V~FfGt~~~a~v~~~~l~pf~~~-~~~~~~~~k~k~   72 (83)
T cd05834           2 QFKAGDLVFAKVKGYPAWPARVDEPEDW--------KPPGKKYPVYFFGTHETAFLKPEDLFPYTEN-KKKFGKPKKRKG   72 (83)
T ss_pred             CCCCCCEEEEecCCCCCCCEEEeccccc--------CCCCCEEEEEEeCCCCEeEECHHHceecccc-hhhhccccchHH
Confidence            6999999999999999999999998653        2345779999999999999999999999997 677888889999


Q ss_pred             HHHHHHHHHH
Q 002995          161 FTQSLEEAKV  170 (860)
Q Consensus       161 ~~~A~~ea~~  170 (860)
                      |++||+|+++
T Consensus        73 F~~Av~eie~   82 (83)
T cd05834          73 FNEAVWEIEK   82 (83)
T ss_pred             HHHHHHHHhh
Confidence            9999999975


No 11 
>cd05836 N_Pac_NP60 The PWWP domain is an essential part of the cytokine-like nuclear factor n-pac protein, or NP60, which enhances the activity of MAP2K4 and MAP2K6 kinases to phosphorylate p38-alpha.  In a variety of cell lines, NP60 has been shown to localize to the nucleus. In addition to the PWWP domain, NP60 also contains an AT-hook and a C-terminal NAD-binding domain. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=99.89  E-value=8.5e-24  Score=185.03  Aligned_cols=86  Identities=24%  Similarity=0.458  Sum_probs=73.3

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHHH
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRF  161 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~~  161 (860)
                      |++||||||||+|||||||+|++++..    ..+.+.+.+.++|+|||+++||||.+++|+||+++ ...+...+|.+.|
T Consensus         1 f~~GDlVwaK~~g~P~WPa~V~~~~~~----~~~~~~~~~~~~V~FFG~~~~~wv~~~~l~pF~~~-~~~~~~~~k~~~F   75 (86)
T cd05836           1 LKLGDLVWAKMKGFPPWPGRIVKPPKD----LKKPRGKAKCFFVFFFGSENHAWIKEENIKPYHEH-KEEMIKLNKGARF   75 (86)
T ss_pred             CCCCCEEEEeCCCCCCCCEEEechhhh----cccccCCCCeEEEEEeCCCCEEEECHHhCeechhh-HHHHhcccchHHH
Confidence            789999999999999999999986532    22222335889999999999999999999999997 6677777888999


Q ss_pred             HHHHHHHHHHH
Q 002995          162 TQSLEEAKVYL  172 (860)
Q Consensus       162 ~~A~~ea~~~~  172 (860)
                      ++||+|+++++
T Consensus        76 ~~Av~~ie~~~   86 (86)
T cd05836          76 QQAVDAIEEYI   86 (86)
T ss_pred             HHHHHHHHHhC
Confidence            99999999873


No 12 
>KOG1082 consensus Histone H3 (Lys9) methyltransferase SUV39H1/Clr4, required for transcriptional silencing [Chromatin structure and dynamics; Transcription]
Probab=99.89  E-value=1.5e-23  Score=233.91  Aligned_cols=159  Identities=30%  Similarity=0.429  Sum_probs=128.9

Q ss_pred             CChhhhhHHhhhccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHh-hhcccCCccee------
Q 002995          681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFI-YNSLVGAGTYM------  753 (860)
Q Consensus       681 ~~~~~~~~~l~~~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~-~~~~~~~~~y~------  753 (860)
                      +...|.|+.+|.+.+.+|+|.+++.+||||++.+.|++|+||+||+||+++..+++.+.... |........+.      
T Consensus       161 C~~~C~nRv~q~g~~~~leIfrt~~kGwgvRs~~~I~~G~fvcEyaGe~~t~~e~~~~~~~~~~~~~~~~~~~~~~~~~~  240 (364)
T KOG1082|consen  161 CHPDCANRVVQKGLQFHLEVFRTPEKGWGVRTLDPIPAGEFVCEYAGEVLTSEEAQRRTHLREYLDDDCDAYSIADREWV  240 (364)
T ss_pred             CCCcCcchhhccccccceEEEecCCceeeecccccccCCCeeEEEeeEecChHHhhhccccccccccccccchhhhcccc
Confidence            34578999999999999999999999999999999999999999999999999998774211 11100111111      


Q ss_pred             ---------------eeecCcceeeccccCCccccCCCCCCCCeeEEEEEECCe----eEEEEEEccCCCCCCeEEEecC
Q 002995          754 ---------------FRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGD----EHIIIFAKRDIKQWEELTYDYR  814 (860)
Q Consensus       754 ---------------~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~i~i~A~RdI~~GEELt~dY~  814 (860)
                                     ......+.|||...||++|||||||.||+.+..+..++.    .+|+|||+++|.+||||||||+
T Consensus       241 ~~~~~~~~~~~~~~~~~~~~~~~ida~~~GNv~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg  320 (364)
T KOG1082|consen  241 DESPVGNTFVAPSLPGGPGRELLIDAKPHGNVARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYG  320 (364)
T ss_pred             ccccccccccccccccCCCcceEEchhhcccccccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhc
Confidence                           122467789999999999999999999999988877633    5899999999999999999999


Q ss_pred             CCCC------------CCCeEEeeCCCCCccccCCCh
Q 002995          815 FFSI------------DEQLACYCGFPRCRGVVNDTE  839 (860)
Q Consensus       815 ~~~~------------~~~~~C~Cg~~~Cr~~~~~~~  839 (860)
                      ....            ..+..|.||+.+||+.+....
T Consensus       321 ~~~~~~~~~~~~~~~~~~~~~c~c~~~~cr~~~~~~~  357 (364)
T KOG1082|consen  321 KAYKLLVQDGANIYTPVMKKNCNCGLEKCRGLLGSAP  357 (364)
T ss_pred             ccccccccccccccccccchhhcCCCHHhCcccCCCc
Confidence            6532            245789999999999886654


No 13 
>cd05838 WHSC1_related The PWWP domain was first identified in the WHSC1 (Wolf-Hirschhorn syndrome candidate 1) protein, a protein implicated in Wolf-Hirschhorn syndrome (WHS).  When translocated, WHSC1 plays a role in lymphoid multiple myeloma (MM) disease, also known as plasmacytoma. WHCS1 proteins typically contain two copies of the PWWP domain.  The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.87  E-value=9.8e-23  Score=181.95  Aligned_cols=92  Identities=29%  Similarity=0.404  Sum_probs=74.7

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccccccccc--ccCChH
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFH--LKCKKP  159 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~--~k~k~~  159 (860)
                      +.+||||||||+|||||||+|++++.++. +....+++.+.++|+|||+++|+||.+++|+||.+++.....  ++++.+
T Consensus         1 ~~~GdlVWaK~~g~pwWPa~V~~~~~~p~-~~~~~~~~~~~~~V~Ffgs~~y~Wv~~~~l~pf~e~~~~~~~~~~~~~~~   79 (95)
T cd05838           1 PLYGDIVWAKLGNFRWWPAIICDPREVPP-NIQVLRHCIGEFCVMFFGTHDYYWVHRGRVFPYQEGDKGFKEQTKSYLAK   79 (95)
T ss_pred             CCcCCEEEEECCCCCCCCeEEcChhhcCh-hHhhccCCCCeEEEEEeCCCCEEEeccccccchhhhhhhhhhhhhhhhHH
Confidence            35899999999999999999999887766 333334556889999999999999999999999987433222  224669


Q ss_pred             HHHHHHHHHHHHHHh
Q 002995          160 RFTQSLEEAKVYLSE  174 (860)
Q Consensus       160 ~~~~A~~ea~~~~~~  174 (860)
                      .|++||+||.++++.
T Consensus        80 ~f~~AleEA~~~~~~   94 (95)
T cd05838          80 RFRKALEEASLAFKG   94 (95)
T ss_pred             HHHHHHHHHHHHhhc
Confidence            999999999998753


No 14 
>cd05840 SPBC215_ISWI_like The PWWP domain is a component of the S. pombe hypothetical protein SPBC215, as well as ISWI complex protein 4.  The ISWI (imitation switch) proteins are ATPases responsible for chromatin remodeling in eukaryotes, and SPBC215 is proposed to also bind chromatin.   The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding,  proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.87  E-value=6.5e-23  Score=181.82  Aligned_cols=88  Identities=32%  Similarity=0.528  Sum_probs=76.2

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccc--cccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccC--C
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGL--NKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKC--K  157 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~--~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~--k  157 (860)
                      |++||||||||+|||||||+|++++.++. ..+  +.+++.+.|+|+|||+++|+||.+++|+||+++.+..+..+.  |
T Consensus         1 f~~GDlVwaK~~GyPwWPA~V~~~~~~p~-~~l~~~~~~~~~~~~V~FFg~~~~~Wv~~~~l~pl~~~~~~~~l~~~~~k   79 (93)
T cd05840           1 FQPGDRVLAKVKGFPAWPAIVVPEEMLPD-SVLKGKKKKNKRTYPVMFFPDGDYYWVPNKDLKPLTEEKIAKFLKKPKRK   79 (93)
T ss_pred             CCCCCEEEEeCCCCCCCCEEECChHHCCH-HHHhcccCCCCCeEEEEEeCCCcEEEEChhhcccCCHHHHHHHhhcCCCC
Confidence            78999999999999999999999988877 445  445677999999999999999999999999977677777654  5


Q ss_pred             hHHHHHHHHHHHH
Q 002995          158 KPRFTQSLEEAKV  170 (860)
Q Consensus       158 ~~~~~~A~~ea~~  170 (860)
                      ++.+.+|++.|.+
T Consensus        80 ~k~l~~ay~~A~~   92 (93)
T cd05840          80 DKELIKAYKAAKD   92 (93)
T ss_pred             CHHHHHHHHHhcC
Confidence            6889999998853


No 15 
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.87  E-value=9.7e-22  Score=183.20  Aligned_cols=115  Identities=45%  Similarity=0.712  Sum_probs=100.4

Q ss_pred             EEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCC
Q 002995          698 LAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHS  777 (860)
Q Consensus       698 l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHS  777 (860)
                      +.+.+++++|+||||+++|++|++|++|.|.++...++..+... +........|+|......+||+...||++||||||
T Consensus         2 ~~~~~~~~~G~gl~a~~~i~~g~~i~~~~g~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~id~~~~~~~~~~iNHs   80 (116)
T smart00317        2 LEVFKSPGKGWGVRATEDIPKGEFIGEYVGEIITSEEAEERSKA-YDTDGADSFYLFEIDSDLCIDARRKGNIARFINHS   80 (116)
T ss_pred             cEEEecCCCcEEEEECCccCCCCEEEEEEeEEECHHHHHHHHHH-HHhcCCCCEEEEECCCCEEEeCCccCcHHHeeCCC
Confidence            56777789999999999999999999999999998887776542 22222225788998888999999999999999999


Q ss_pred             CCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEec
Q 002995          778 CEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDY  813 (860)
Q Consensus       778 C~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY  813 (860)
                      |.|||.......++..++.|+|+|||++|||||+||
T Consensus        81 c~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y  116 (116)
T smart00317       81 CEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY  116 (116)
T ss_pred             CCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence            999999998888777799999999999999999999


No 16 
>cd05835 Dnmt3b_related The PWWP domain is an essential component of DNA methyltransferase 3 B (Dnmt3b) which is responsible for establishing DNA methylation patterns during embryogenesis and gametogenesis.  In tumorigenesis, DNA methylation by Dnmt3b is known to play a role in the inactivation of tumor suppressor genes.  In addition, a point mutation in the PWWP domain of Dnmt3b has been identified in patients with ICF syndrome (immunodeficiency, centromeric instability, and facial anomalies), a rare autosomal recessive disorder characterized by hypomethylation of classical satellite DNA. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.86  E-value=2.5e-22  Score=176.60  Aligned_cols=84  Identities=23%  Similarity=0.410  Sum_probs=71.1

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccccccc--ccccCChH
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSS--FHLKCKKP  159 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~--~~~k~k~~  159 (860)
                      |.+||||||||+|||||||+|++++....     .+...+.++|+|||+++++||++++|+||.++ ...  ...++|+.
T Consensus         1 f~vGDlVWaK~kg~pwWP~~V~~~~~~~~-----~~~~~~~~~V~fFGs~~~a~v~~~~l~pf~e~-~~~f~~~~~~k~~   74 (87)
T cd05835           1 FNVGDLVWGKIKGFPWWPGRVVSITVTSK-----RPPVVGMRWVTWFGSGTFSEVSVDKLSPFSEF-FKAFSRYNRKKKG   74 (87)
T ss_pred             CCCCCEEEEecCCCCCCCeEEechhhccc-----ccCCCCeEEEEEeCCCCEeEECHHHCcChhHh-HHHHhhhhhhhhH
Confidence            78999999999999999999999865443     13556889999999999999999999999997 555  34445679


Q ss_pred             HHHHHHHHHHHH
Q 002995          160 RFTQSLEEAKVY  171 (860)
Q Consensus       160 ~~~~A~~ea~~~  171 (860)
                      .|++||+||.+.
T Consensus        75 ~f~~Ai~eA~e~   86 (87)
T cd05835          75 LYKKAIYEALEV   86 (87)
T ss_pred             HHHHHHHHHHHc
Confidence            999999999874


No 17 
>cd05162 PWWP The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids.  The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation.  Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.  The function of the PWWP domain is still not known precisely; however, based on the fact that other regions of PWWP-domain proteins are responsible for nuclear localization and DNA-binding, is likely that the PWWP domain acts as a site for protein-protein binding interactions, influencing chromatin remodeling and thereby regulating transcriptional processes.  Some PWWP-domain proteins have been linked to cancer or other diseases; some are known to function as growth factors.
Probab=99.86  E-value=5.5e-22  Score=175.65  Aligned_cols=86  Identities=30%  Similarity=0.539  Sum_probs=74.8

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHHH
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRF  161 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~~  161 (860)
                      |++|||||||++|||||||+|+++...+...  ..+...+.++|+|||+++|+||+.++|.||.++... ...+++++.|
T Consensus         1 f~~GdlVwaK~~g~pwWPa~V~~~~~~~~~~--~~~~~~~~~~V~Ffg~~~~~wv~~~~l~pf~~~~~~-~~~~~k~~~f   77 (87)
T cd05162           1 FRPGDLVWAKMKGYPWWPALVVDPPKDSKKA--KKKAKEGKVLVLFFGDKTFAWVGAERLKPFTEHKES-EAKQSKRKGF   77 (87)
T ss_pred             CCCCCEEEEeCCCCCCCCEEEccccccchhh--hccCCCCEEEEEEeCCCcEEEeCccceeeccchHHh-hccCCccHHH
Confidence            7899999999999999999999998776622  334567889999999999999999999999998444 6667788999


Q ss_pred             HHHHHHHHH
Q 002995          162 TQSLEEAKV  170 (860)
Q Consensus       162 ~~A~~ea~~  170 (860)
                      ++||++|.+
T Consensus        78 ~~A~~eA~~   86 (87)
T cd05162          78 KKAYDEALE   86 (87)
T ss_pred             HHHHHHHHh
Confidence            999999976


No 18 
>cd05841 BS69_related The PWWP domain is part of BS69 protein, a nuclear protein that specifically binds adenoviral E1A and Epstein-Barr viral EBNA2 proteins, suppressing their transactivation functions.  BS69 is a multi-domain protein, containing bromo, PHD, PWWP, and MYND domains.  The specific role of the PWWP domain within BS69 is not clearly identified, but BS69 functions in chromatin remodeling, consistent with other PWWP-containing proteins. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.85  E-value=9.7e-22  Score=168.22  Aligned_cols=76  Identities=28%  Similarity=0.562  Sum_probs=67.4

Q ss_pred             CCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeC-CCCEEEEeCCCcccccccccccccccCChH
Q 002995           81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFG-THDFARINVKQVISFLKGLLSSFHLKCKKP  159 (860)
Q Consensus        81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg-~~~~awv~~~~l~~f~~~~~~~~~~k~k~~  159 (860)
                      -++|||||||||+|||||||+|+.++             .+.|+|+||| ++++|||+.++|+||+++ +.++..++|++
T Consensus         6 c~~p~dLVwAK~kGyp~WPAkV~~~~-------------~~~~~V~FFG~t~~~a~v~~~~i~~~~~~-~~~~~~~~k~~   71 (83)
T cd05841           6 CRPPHELVWAKLKGFPYWPAKVMRVE-------------DNQVDVRFFGGQHDRAWIPSNNIQPISTE-IPQQLVKKRSR   71 (83)
T ss_pred             cCCCCCEEEEeCCCCCCCCEEEeecC-------------CCeEEEEEcCCCCCeEEEehHHeeehhhh-hhhhccccccH
Confidence            36899999999999999999998753             1459999999 999999999999999997 55677788889


Q ss_pred             HHHHHHHHHHH
Q 002995          160 RFTQSLEEAKV  170 (860)
Q Consensus       160 ~~~~A~~ea~~  170 (860)
                      .|++||+||+.
T Consensus        72 ~f~~A~~Eie~   82 (83)
T cd05841          72 GFNKAMDELEL   82 (83)
T ss_pred             HHHHHHHHHHh
Confidence            99999999975


No 19 
>KOG1083 consensus Putative transcription factor ASH1/LIN-59 [Transcription]
Probab=99.84  E-value=3.7e-22  Score=230.17  Aligned_cols=133  Identities=35%  Similarity=0.568  Sum_probs=118.3

Q ss_pred             hhhHHhhh-ccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceee
Q 002995          685 DKYKHMKE-TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVID  763 (860)
Q Consensus       685 ~~~~~l~~-~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iD  763 (860)
                      |+|+.|++ +....|.+.+.+..||||.|+++|++||||+||+|+||...+++.+...+|.  ...+.|...++.+.+||
T Consensus      1166 c~nqrm~r~e~cp~L~v~~gp~~G~~v~tk~PikagtfI~EYvGeVit~ke~e~~mmtl~~--~d~~~~cL~I~p~l~id 1243 (1306)
T KOG1083|consen 1166 CSNQRMQRHEECPPLEVFRGPKKGWGVRTKEPIKAGTFIMEYVGEVITEKEFEPRMMTLYH--NDDDHYCLVIDPGLFID 1243 (1306)
T ss_pred             hhhHHhhhhccCCCcceeccCCCCccccccccccccchHHHHHHHHHHHHhhcccccccCC--CCCcccccccCccccCC
Confidence            66766655 5678999999999999999999999999999999999998877776332232  23467999999999999


Q ss_pred             ccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC
Q 002995          764 ATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID  819 (860)
Q Consensus       764 a~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~  819 (860)
                      +.++||.+|||||||.|||.++.+.|+|..||++||+|||.+||||||||+|...+
T Consensus      1244 ~~R~~n~~RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ks~~ 1299 (1306)
T KOG1083|consen 1244 IPRMGNGARFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNFKSFN 1299 (1306)
T ss_pred             hhhccccccccccccCCCCccccccccceeeeeeeecCCCCCCceEEEeccccccC
Confidence            99999999999999999999999999999999999999999999999999987655


No 20 
>PF00855 PWWP:  PWWP domain;  InterPro: IPR000313 Upon characterisation of WHSC1, a gene mapping to the Wolf-Hirschhornsyndrome critical region and at its C terminus similar to the Drosophila melanogaster ASH1/trithorax group proteins, a novel protein domain designated PWWP domain was identified []. The PWWP domain is named after a conserved Pro-Trp-Trp-Pro motif. It is present in proteins of nuclear origin and plays a role in cell growth and differentiation. Due to its position, the composition of amino acids close to the PWWP motif and the pattern of other domains present it has been suggested that the domain is involved in protein-protein interactions [].; PDB: 3LYI_B 2L89_A 2NLU_A 1RI0_A 1KHC_A 3QKJ_C 2DAQ_A 1N27_A 3PFS_B 3QJ6_A ....
Probab=99.83  E-value=2.7e-21  Score=171.17  Aligned_cols=84  Identities=32%  Similarity=0.579  Sum_probs=71.5

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccccccccccc--CChH
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLK--CKKP  159 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k--~k~~  159 (860)
                      |++|||||||++|||||||+|++++....     .....+.++|+|||+++++||+.++|.||.+. ...+..+  ++++
T Consensus         1 f~~GdlVWaK~~g~pwWPa~V~~~~~~~~-----~~~~~~~~~V~Ffg~~~~~wv~~~~i~~f~~~-~~~~~~~~~~k~~   74 (86)
T PF00855_consen    1 FRPGDLVWAKLKGYPWWPARVCDPDEKSK-----KKRKDGHVLVRFFGDNDYAWVKPSNIKPFSEF-KEKLKKKKKKKRK   74 (86)
T ss_dssp             -STTEEEEEEETTSEEEEEEEEECCHCTS-----CSSSSTEEEEEETTTTEEEEEEGGGEEECCHH-HHHHHHHHHHHSH
T ss_pred             CCCCCEEEEEeCCCCCCceEEeecccccc-----cCCCCCEEEEEecCCCCEEEECHHHhhChhhh-HHHHHHhhccchH
Confidence            78999999999999999999999875444     24667889999999999999999999999954 5556654  5669


Q ss_pred             HHHHHHHHHHHH
Q 002995          160 RFTQSLEEAKVY  171 (860)
Q Consensus       160 ~~~~A~~ea~~~  171 (860)
                      .|++||+||+++
T Consensus        75 ~~~~Ai~eA~~~   86 (86)
T PF00855_consen   75 SFRKAIEEAEEA   86 (86)
T ss_dssp             HHHHHHHHHHHH
T ss_pred             HHHHHHHHHHhC
Confidence            999999999875


No 21 
>smart00293 PWWP domain with conserved PWWP motif. conservation of Pro-Trp-Trp-Pro residues
Probab=99.81  E-value=2.2e-20  Score=154.24  Aligned_cols=63  Identities=33%  Similarity=0.636  Sum_probs=56.2

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccc
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFL  145 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~  145 (860)
                      |++||||||||+|||||||+|++++..+. ...+.++..+.|+|+|||++++|||.+++|+||+
T Consensus         1 f~~GdlVwaK~~G~p~WPa~V~~~~~~~~-~~~~~~~~~~~~~V~Ffg~~~~awv~~~~l~p~~   63 (63)
T smart00293        1 FKPGDLVWAKMKGFPWWPALVVSPKETPD-NIRKRKRFENLYPVLFFGDKDTAWISSSKLFPLT   63 (63)
T ss_pred             CCCCCEEEEECCCCCCCCeEEcCcccCCh-hHhhccCCCCEEEEEEeCCCCEEEECccceeeCC
Confidence            78999999999999999999999987766 3444566778999999999999999999999985


No 22 
>KOG1085 consensus Predicted methyltransferase (contains a SET domain) [General function prediction only]
Probab=99.80  E-value=1.3e-19  Score=183.13  Aligned_cols=124  Identities=37%  Similarity=0.507  Sum_probs=110.2

Q ss_pred             ccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeee---cCcceeeccccC-
Q 002995          693 TFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRI---DDERVIDATRAG-  768 (860)
Q Consensus       693 ~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~---~~~~~iDa~~~g-  768 (860)
                      +....+.+..-.++|.||.|++.+++|+||.||.|.+|.-.++..|+.. |......++|||.+   +..++||||+-- 
T Consensus       253 g~~egl~~~~~dgKGRGv~a~~~F~rgdFVVEY~Gdliei~eAk~rE~~-Ya~De~~GcYMYyF~h~sk~yCiDAT~et~  331 (392)
T KOG1085|consen  253 GTNEGLLEVYKDGKGRGVRAKVNFERGDFVVEYRGDLIEISEAKVREEQ-YANDEEIGCYMYYFEHNSKKYCIDATKETP  331 (392)
T ss_pred             ccccceeEEeeccccceeEeecccccCceEEEEecceeeechHHHHHHH-hccCcccceEEEeeeccCeeeeeecccccc
Confidence            3456677888888999999999999999999999999999999999986 66666778888765   467899999854 


Q ss_pred             CccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCC
Q 002995          769 SIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFS  817 (860)
Q Consensus       769 n~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~  817 (860)
                      -+.|+||||-.+||..+++.++|.+|++++|.|||++||||+||||.-+
T Consensus       332 ~lGRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS  380 (392)
T KOG1085|consen  332 WLGRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS  380 (392)
T ss_pred             cchhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence            4799999999999999999999999999999999999999999998643


No 23 
>cd05837 MSH6_like The PWWP domain is present in MSH6, a mismatch repair protein homologous to bacterial MutS.   The PWWP domain of histone-lysine N-methyltransferase, also known as Nuclear SET domain-containing protein 3, is also included. Mutations in MSH6 have been linked to increased cancer susceptibility, particularly in hereditary nonpolyposis colorectal cancer in humans.  The role of the PWWP domain in MSH6 is not clear; MSH6 orthologs found in S. cerevisiae, Caenorhabditis elegans and Arabidopsis thaliana lack the PWWP domain.   Histone methyltransferases (HMTases) induce the posttranslational methylation of lysine residues in histones and play a role in apoptosis.  In the HMTase Whistle, the PWWP domain is necessary for HMTase activity. The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain pro
Probab=99.78  E-value=3.1e-19  Score=164.05  Aligned_cols=90  Identities=26%  Similarity=0.416  Sum_probs=65.5

Q ss_pred             CCCCCCEEEEEecccCCCceEeeCCCCcCccc-cccccCCCCcEEEEEeCCC-CEEEEeCCCccccccccccc-------
Q 002995           81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYK-GLNKISGGRSIPVQFFGTH-DFARINVKQVISFLKGLLSS-------  151 (860)
Q Consensus        81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~-~~~~~~~~~~~~V~FFg~~-~~awv~~~~l~~f~~~~~~~-------  151 (860)
                      .|.+||||||||+|||||||+|++++..+... .....+..+.|+|+|||++ ++|||+.++|.||++.....       
T Consensus         2 ~~~~GdlVWaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~~V~FFG~~~~~aWv~~~~l~pf~~~~~~~~~~~~~~   81 (110)
T cd05837           2 KYQVGDLVWAKVSGYPWWPCMVCSDPLLGTYTKTKRNKRKPRQYHVQFFGDNPERAWISEKSLKPFKGSKQFESEKGEKF   81 (110)
T ss_pred             CCCCCCEEEEeCCCCCCCCEEEecccccchhhhhhhccCCCCeEEEEEcCCCCCEEEecHHHccccCCchhhhhhhhhhh
Confidence            58999999999999999999999877665522 1133456688999999985 79999999999999752111       


Q ss_pred             -ccccCChHHHHHHHHHHHH
Q 002995          152 -FHLKCKKPRFTQSLEEAKV  170 (860)
Q Consensus       152 -~~~k~k~~~~~~A~~ea~~  170 (860)
                       ..+.+.++.+++|..+++.
T Consensus        82 ~~~K~~~~~~~~~a~~~~~~  101 (110)
T cd05837          82 KVRKPNIKKARQKADIAIMQ  101 (110)
T ss_pred             hccCCcchhHHHHHHHHHHH
Confidence             1122233566666666554


No 24 
>cd06080 MUM1_like Mutated melanoma-associated antigen 1 (MUM-1) is a melanoma-associated antigen (MAA).  MUM-1 belongs to the mutated or aberrantly expressed type of MAAs, along with antigens such as CDK4, beta-catenin, gp100-in4, p15, and N-acetylglucosaminyltransferase V.  It is highly expressed in several types of human cancers.  The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding, proteins that function as transcription factors regulating a variety of developmental processes.
Probab=99.70  E-value=1.7e-17  Score=141.30  Aligned_cols=75  Identities=25%  Similarity=0.476  Sum_probs=57.3

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCC-CEEEEeCCCcccccccccccccccCChHH
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTH-DFARINVKQVISFLKGLLSSFHLKCKKPR  160 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~-~~awv~~~~l~~f~~~~~~~~~~k~k~~~  160 (860)
                      |++|||||||++|||||||+|.+...           +.+.++|+||||+ +++|+..++|+||.++ .....+....+.
T Consensus         1 f~~gdlVWaK~~g~P~WPa~I~~~~~-----------~~~k~~V~FfG~~~~~a~~~~~~l~p~~~~-~~~~ek~~~~~k   68 (80)
T cd06080           1 FEKNDLVWAKIQGYPWWPAVIKSISR-----------KKQKARVNFIGDNMQSEKKGIRVVKRWLKH-FDCTEKQKLTNK   68 (80)
T ss_pred             CCCCCEEEEeCCCCCCCCEEEeeecC-----------CCCEEEEEEeCCCCceeccchhhccccccc-HHHHHHHHHHHH
Confidence            78999999999999999999987532           2457999999999 7999999999999997 333222223344


Q ss_pred             HHHHHHHH
Q 002995          161 FTQSLEEA  168 (860)
Q Consensus       161 ~~~A~~ea  168 (860)
                      .++++++|
T Consensus        69 ~ke~~~~a   76 (80)
T cd06080          69 AKESYEQA   76 (80)
T ss_pred             HHHHHHHH
Confidence            45555544


No 25 
>cd05839 BR140_related The PWWP domain is found in the BR140 family, which includes peregrin and BR140-like proteins 1 and 2.   BR140 is the only family to contain the PWWP domain at the C terminus, with PHD and bromo domains in the N-terminal region.  In myeloid leukemias, BR140 is disrupted by chromosomal translocations, similar to translocations of WHSC1 in lymphoid multiple myeloma.  The PWWP domain, named for a conserved Pro-Trp-Trp-Pro motif, is a small domain consisting of 100-150 amino acids. The PWWP domain is found in numerous proteins that are involved in cell division, growth and differentiation. Most PWWP-domain proteins seem to be nuclear, often DNA-binding proteins, that function as transcription factors regulating a variety of developmental processes.
Probab=99.69  E-value=1.9e-17  Score=149.61  Aligned_cols=66  Identities=23%  Similarity=0.394  Sum_probs=53.1

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCcc---c-----------cccccCCCCcEEEEEeCC-CCEEEEeCCCcccccc
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDY---K-----------GLNKISGGRSIPVQFFGT-HDFARINVKQVISFLK  146 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~---~-----------~~~~~~~~~~~~V~FFg~-~~~awv~~~~l~~f~~  146 (860)
                      +.||||||||++|||||||+|+++......   .           .++.......|+|+||++ .+++||+.++|.||..
T Consensus         1 ~~pg~lVwaK~~g~P~wPa~iidp~~~~~~~~~~~~p~~~l~~~~~~~~~~~~~~~lV~FFd~~~s~~Wv~~~~l~pl~~   80 (111)
T cd05839           1 LEPLTLVWAKCRGYPSYPALIIDPKMPRDGVFHNGVPPDVLTLGEARAQNADERLYLVLFFDNKRTWQWLPGDKLEPLGV   80 (111)
T ss_pred             CCCcCEeeeeecCCCCCCeEeeCCCCCCcccccCCCCchhhhHHHHHhccCCCcEEEEEEecCCCcceecCHHHCccccc
Confidence            469999999999999999999998653321   1           112345667899999998 6799999999999986


Q ss_pred             c
Q 002995          147 G  147 (860)
Q Consensus       147 ~  147 (860)
                      +
T Consensus        81 ~   81 (111)
T cd05839          81 D   81 (111)
T ss_pred             c
Confidence            4


No 26 
>PF05964 FYRN:  F/Y-rich N-terminus;  InterPro: IPR003888 The "FY-rich" domain N-terminal region is sometimes closely juxtaposed with the C-terminal region (IPR003889 from INTERPRO), but sometimes is far distant. It is of unknown function, but occurs frequently in chromatin-associated proteins like trithorax and its homologues.; GO: 0005634 nucleus; PDB: 2WZO_A.
Probab=99.67  E-value=5.4e-17  Score=128.93  Aligned_cols=54  Identities=43%  Similarity=0.746  Sum_probs=42.3

Q ss_pred             eCCEEEEeeeeeeeCCCCCCCCCcccCCCeEEEEEeccCCCCCCcceEEEEEEe
Q 002995          230 FGDLQILSLGKIVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEVLR  283 (860)
Q Consensus       230 ~g~l~i~slG~iv~~~~~fh~~~~i~P~Gy~s~r~~~S~~dp~~~~~y~~~I~~  283 (860)
                      +|+|+|+|||+|++++++||+++||||+||+++|.|+|+.||.+++.|+|+|+|
T Consensus         1 igsl~v~sLG~i~~~~~~fh~~~~IyP~Gy~s~R~y~S~~~p~~~~~Y~~~Ild   54 (54)
T PF05964_consen    1 IGSLTVHSLGKIVPDRPAFHSERYIYPVGYKSSRLYWSTVDPRRRCRYTCEILD   54 (54)
T ss_dssp             -TTEEEEEEEE---SSGGGB-SS-B--EEEEEEEEEE-SS-TTSEEEEEEEEE-
T ss_pred             CCceEEEECeEEeCCCCCccCCCEEeeCCEEEEEEEccccCCCCEEEEEEEEeC
Confidence            589999999999999989999999999999999999999999999999999986


No 27 
>PF13771 zf-HC5HC2H:  PHD-like zinc-binding domain
Probab=99.62  E-value=3e-16  Score=140.03  Aligned_cols=88  Identities=38%  Similarity=0.759  Sum_probs=79.0

Q ss_pred             ccccccccCceEEccCCC--ccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccchhhhhcCeeEEEecccc
Q 002995          475 HLACAIWIPETCLTDVKR--MEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHPLCARAAGLCVELEDEDR  552 (860)
Q Consensus       475 H~~Ca~~~p~~~~~~~~~--~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~~~~~~~  552 (860)
                      |+.||+|+|++.+.+...  +.++.+++.+..++++++|.+|+++.||+|+|..++|.+.||++||+.+|+.+++...  
T Consensus         1 H~~Calwsp~v~~~~~~~~~~~~i~~v~~~~~~~~~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~~~~~~~~~~~--   78 (90)
T PF13771_consen    1 HENCALWSPEVYFDESEDIGGFSIEDVEKEIKRRRKLKCSICKKKGGACIGCSHPGCSRSFHVPCARKAGCFIEFDED--   78 (90)
T ss_pred             ChHHheecCceEEeCCCccccccHHhHHHHHHHHhCCCCcCCCCCCCeEEEEeCCCCCcEEChHHHccCCeEEEEccC--
Confidence            899999999999998764  6788999999999999999999998899999999999999999999999999876531  


Q ss_pred             ccccccCCCccccccceeeccccCC
Q 002995          553 LNLLSLDEDDEDQCIRLLSFCKKHK  577 (860)
Q Consensus       553 ~~~~~~~~~~~~~~~~~~~~C~~H~  577 (860)
                                   ...+.+||++|+
T Consensus        79 -------------~~~~~~~C~~H~   90 (90)
T PF13771_consen   79 -------------NGKFRIFCPKHS   90 (90)
T ss_pred             -------------CCceEEEChhcC
Confidence                         136889999996


No 28 
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=99.60  E-value=1.3e-16  Score=185.14  Aligned_cols=141  Identities=40%  Similarity=0.674  Sum_probs=118.7

Q ss_pred             cceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecC-cceeeccccCCccc
Q 002995          694 FRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDD-ERVIDATRAGSIAH  772 (860)
Q Consensus       694 ~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~-~~~iDa~~~gn~aR  772 (860)
                      ......+..+.++|+|+||++.|++|++|.+|.|++++..++..++... . ..+....++.+.. ..++|+...|+.+|
T Consensus       330 ~~~~~~~~~~~~~~~g~fa~~~i~~~e~i~~~~~~~~~~~~~~~~~~~~-~-~~~~~~~~~~~~~~~~~~d~~~~g~~~r  407 (480)
T COG2940         330 RREPNVVQESEIKGYGVFALESIKKGEFIIEYHGEIIRRKEAREREENY-D-LLGNEFSFGLLEDKDKVRDSQKAGDVAR  407 (480)
T ss_pred             ccchhhhhhhcccccceeehhhccchHHHHHhcCcccchHHHHhhhccc-c-ccccccchhhccccchhhhhhhcccccc
Confidence            3556778888999999999999999999999999999999998888753 2 2233333333333 78999999999999


Q ss_pred             cCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC---------CCeEEeeCCCCCccccC
Q 002995          773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID---------EQLACYCGFPRCRGVVN  836 (860)
Q Consensus       773 fiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~---------~~~~C~Cg~~~Cr~~~~  836 (860)
                      ||||||.|||......+.|..++.++|+|||.+||||++||+...++         ..+.|.|+++.|+++|+
T Consensus       408 ~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  480 (480)
T COG2940         408 FINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHTMS  480 (480)
T ss_pred             eeecCCCCCcceecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCCCC
Confidence            99999999999998888777799999999999999999999886655         35789999999999874


No 29 
>PF05965 FYRC:  F/Y rich C-terminus;  InterPro: IPR003889 The "FY-rich" domain C-terminal region is sometimes closely juxtaposed with the N-terminal region (IPR003888 from INTERPRO), but sometimes is far distant. It is of unknown function, but occurs frequently in chromatin-associated proteins like trithorax and its homologues.; GO: 0005634 nucleus; PDB: 2WZO_A.
Probab=99.60  E-value=5.8e-16  Score=136.62  Aligned_cols=79  Identities=37%  Similarity=0.594  Sum_probs=57.7

Q ss_pred             cceEEEEcCC--CCceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHHHHhhcCCCCCC
Q 002995          290 RPLFRVTLDN--GEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPT  367 (860)
Q Consensus       290 ~P~F~Vt~~~--~~~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~li~~Lp~a~~c  367 (860)
                      +|+|+|+.+|  +..|.|+||++||.+|+++|++.+.....     .......++|++||||++|.|++|||+||+|++|
T Consensus         3 ~P~F~Vt~~d~p~~~~~g~s~~~~W~~i~~~v~~~r~~~~~-----~~~~~~~isG~~~FGls~p~V~~lie~Lp~a~~c   77 (86)
T PF05965_consen    3 GPLFEVTSEDDPGEVFEGSSPTEAWSEILERVNEARKQSGL-----LKLPPNSISGPEMFGLSNPAVQRLIESLPGADKC   77 (86)
T ss_dssp             SEEEEEEETT-GGG-EEESSHHHHHHHHHHHHHHHHT------------TT----HHHHHSTTSHHHHHHHTTSTTGGG-
T ss_pred             CCEEEEEECCCCCCEEEeCCHHHHHHHHHHHHHHHHhhccc-----cccCCCCCCHhHhcCCCCHHHHHHHHhCCCcchh
Confidence            3999999999  45699999999999999999987766431     1122457899999999999999999999999999


Q ss_pred             Cchhhh
Q 002995          368 SKSSLC  373 (860)
Q Consensus       368 ~k~~~~  373 (860)
                      .+|.++
T Consensus        78 ~~Y~f~   83 (86)
T PF05965_consen   78 SNYKFR   83 (86)
T ss_dssp             TT----
T ss_pred             hcCCcc
Confidence            999654


No 30 
>smart00542 FYRC "FY-rich" domain, C-terminal region. is sometimes closely juxtaposed with the N-terminal region (FYRN), but sometimes is far distant. Unknown function, but occurs frequently in chromatin-associated proteins.
Probab=99.60  E-value=8.6e-16  Score=134.73  Aligned_cols=79  Identities=30%  Similarity=0.498  Sum_probs=66.2

Q ss_pred             eEEEEcCC--CCceecCCchHHHHHHHHHHHhhccCCCCCcccccccccccccCccccCCCcHHHHHHHhhcCCCCCCCc
Q 002995          292 LFRVTLDN--GEQFTGSTPSTCWSKICMKIREGQNNTSDDFSAEGAAEKISESGSDMFGFSNPEVMKLILGLTKSRPTSK  369 (860)
Q Consensus       292 ~F~Vt~~~--~~~~~~~s~~~~W~~i~~~i~~~~~~~~~~~~~~~~~~~~~~sG~~~FGls~p~v~~li~~Lp~a~~c~k  369 (860)
                      +|+|+.++  +..|.|+||++||.+|+++|++.+......     ......+||++||||++|+|++|||+||+|++|.+
T Consensus         1 lF~v~~~~~~~~~~~~~S~~~~W~~vl~~v~~~r~~~~~~-----~~~~~~isG~~mFGls~p~V~~lie~Lpga~~C~~   75 (86)
T smart00542        1 LFRVEIESDPDEVFKGESPEKCWEMVLERVQEARIVARLL-----QLLPEGVSGEDMFGLSSPAVVKLIEQLPGVHQCTN   75 (86)
T ss_pred             CeEEEEecCCCCeEEeCCHHHHHHHHHHHHHHHHHHcccC-----CCCCCCCCcHHHhCCCcHHHHHHHHhCCCchhhhh
Confidence            58999877  567999999999999999999887664321     11234579999999999999999999999999999


Q ss_pred             hhhhhh
Q 002995          370 SSLCKL  375 (860)
Q Consensus       370 ~~~~k~  375 (860)
                      |.++..
T Consensus        76 Y~~~~~   81 (86)
T smart00542       76 YWFRYH   81 (86)
T ss_pred             hhhccC
Confidence            998754


No 31 
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=99.57  E-value=1.4e-15  Score=171.85  Aligned_cols=76  Identities=34%  Similarity=0.711  Sum_probs=68.4

Q ss_pred             ceeeccccCCccccCCCCCCCCeeEEEEEECCe----eEEEEEEccCCCCCCeEEEecCCCCC---CCCeEEeeCCCCCc
Q 002995          760 RVIDATRAGSIAHLINHSCEPNCYSRVISVNGD----EHIIIFAKRDIKQWEELTYDYRFFSI---DEQLACYCGFPRCR  832 (860)
Q Consensus       760 ~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~----~~i~i~A~RdI~~GEELt~dY~~~~~---~~~~~C~Cg~~~Cr  832 (860)
                      ++|||+..||++||+||||.||+.++.++|+..    +-++|||.|-|+||.|||+||++.-.   .....|.||+.+||
T Consensus      1179 yvIDAk~eGNlGRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v~~keL~C~CGa~~Cr 1258 (1262)
T KOG1141|consen 1179 YVIDAKQEGNLGRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQVATKELTCHCGAENCR 1258 (1262)
T ss_pred             EEEecccccchhhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccccccccccceEEEecChhhhh
Confidence            479999999999999999999999999999865    45899999999999999999998643   25689999999999


Q ss_pred             ccc
Q 002995          833 GVV  835 (860)
Q Consensus       833 ~~~  835 (860)
                      |.|
T Consensus      1259 grL 1261 (1262)
T KOG1141|consen 1259 GRL 1261 (1262)
T ss_pred             ccc
Confidence            976


No 32 
>KOG1904 consensus Transcription coactivator [Transcription]
Probab=99.45  E-value=6.5e-14  Score=156.80  Aligned_cols=88  Identities=23%  Similarity=0.374  Sum_probs=76.7

Q ss_pred             CCCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCCh
Q 002995           79 CQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKK  158 (860)
Q Consensus        79 ~~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~  158 (860)
                      ...|++||||||||+|||.|||+|.+......      ++....|.|+|||++++|||.+++|+||..+ .+.+....++
T Consensus        10 ~~~~~~GDLV~AKlkgyp~WParI~~~~~~~~------kp~pkky~V~FfGT~e~Afl~p~dlqpy~~~-k~~~g~~~k~   82 (496)
T KOG1904|consen   10 AGNFKCGDLVFAKLKGYPPWPARIRNGPDGAV------KPPPKKYTVFFFGTKETAFLKPKDLQPYMLN-KEKLGKPNKR   82 (496)
T ss_pred             cCCCCCCceeeecccCCCCCcccccCcccccc------cCCCceeEEEEeccCcccccchhhccchhhh-hhhcccchhh
Confidence            45799999999999999999999988654322      2356779999999999999999999999998 7888888888


Q ss_pred             --HHHHHHHHHHHHHHH
Q 002995          159 --PRFTQSLEEAKVYLS  173 (860)
Q Consensus       159 --~~~~~A~~ea~~~~~  173 (860)
                        +.|++|++++.+++.
T Consensus        83 ~~k~F~~av~eI~~a~~   99 (496)
T KOG1904|consen   83 VWKGFIEAVEEIREAFN   99 (496)
T ss_pred             hhHHHHHHHHHHHHHhc
Confidence              999999999999873


No 33 
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.42  E-value=2.4e-13  Score=133.29  Aligned_cols=108  Identities=26%  Similarity=0.358  Sum_probs=74.8

Q ss_pred             CeeEEeeecCCCCCEEEEecCEeeChhhHhhh-------------------H---------H-----------Hhhhc--
Q 002995          707 GFGIFAKHPHRAGDMVIEYTGELVRPSIADRR-------------------E---------H-----------FIYNS--  745 (860)
Q Consensus       707 G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r-------------------~---------~-----------~~~~~--  745 (860)
                      |+||||+++|++|++|++..+.+++.......                   .         .           .....  
T Consensus         1 GrGl~At~dI~~Ge~I~~p~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   80 (162)
T PF00856_consen    1 GRGLFATRDIKAGEVILIPRPAILTPDEVSPQPELLRLQLSKALEEQSRSDFSIQKKQKAEKSERSPQLESLHSISLRSE   80 (162)
T ss_dssp             SEEEEESS-B-TTEEEEEESEEEEEHHHHHCHHHHSHHTTCSSSCSHHTTHHHHHHHHHHHHHHHHHHHHHHHHHCHTTT
T ss_pred             CEEEEECccCCCCCEEEEECcceEEehhhhhcccchhhhhhhhhcccccccccccccccccccccccccccccccccccc
Confidence            89999999999999999999999886665431                   0         0           00000  


Q ss_pred             c-------------cCCcceeeeecCcceeeccccCCccccCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEe
Q 002995          746 L-------------VGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYD  812 (860)
Q Consensus       746 ~-------------~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~d  812 (860)
                      .             ................++....+++.|+||||.|||.+..........+.|+|.|+|++|||||++
T Consensus        81 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~is  160 (162)
T PF00856_consen   81 LQFSQAFQWSWFISWTRSDFSSRSFSEDDRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFIS  160 (162)
T ss_dssp             CCTCCHHHHHHHHHHHHHEEEEEEETTEEEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEE
T ss_pred             ccccccccchhhccccceeeeccccccccccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEE
Confidence            0             000011112223334566777889999999999999998877777899999999999999999999


Q ss_pred             cC
Q 002995          813 YR  814 (860)
Q Consensus       813 Y~  814 (860)
                      ||
T Consensus       161 YG  162 (162)
T PF00856_consen  161 YG  162 (162)
T ss_dssp             ST
T ss_pred             EC
Confidence            97


No 34 
>smart00541 FYRN "FY-rich" domain, N-terminal region. is sometimes closely juxtaposed with the C-terminal region (FYRC), but sometimes is far distant. Unknown function, but occurs frequently in chromatin-associated proteins.
Probab=99.31  E-value=2e-12  Score=97.54  Aligned_cols=41  Identities=27%  Similarity=0.553  Sum_probs=38.1

Q ss_pred             eeeCCCCCCCCCcccCCCeEEEEEeccCCCCCCcceEEEEE
Q 002995          241 IVKDSEYFQDDRFIWPEGYTAVRKFTSLADPRVCNSYKMEV  281 (860)
Q Consensus       241 iv~~~~~fh~~~~i~P~Gy~s~r~~~S~~dp~~~~~y~~~I  281 (860)
                      ++.++++||++++|||+||+++|.|+|+.||.++|.|+|.|
T Consensus         2 ~~~~~~~fh~~~~IyP~Gy~s~R~y~S~~dp~~~c~Y~c~i   42 (44)
T smart00541        2 LPIQGKLFHSEDAIFPVGYKSTRKYWSVKDPNRRCNYSCVI   42 (44)
T ss_pred             ccccCCCcccCCEEecCCEEEEEEEecccCCCCEEEEEEEE
Confidence            56788999999999999999999999999999999996655


No 35 
>PF13831 PHD_2:  PHD-finger; PDB: 2L43_A 2KU3_A.
Probab=99.07  E-value=2.5e-11  Score=87.57  Aligned_cols=36  Identities=44%  Similarity=1.013  Sum_probs=21.9

Q ss_pred             CCeeEEccccCceeccccccccccCCCCceeeeccc
Q 002995          410 NNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCR  445 (860)
Q Consensus       410 ~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~  445 (860)
                      +|+||+|++|++.||+.|||+..++++..|+|++|+
T Consensus         1 ~n~ll~C~~C~v~VH~~CYGv~~~~~~~~W~C~~C~   36 (36)
T PF13831_consen    1 TNPLLFCDNCNVAVHQSCYGVSEVPDGDDWLCDRCE   36 (36)
T ss_dssp             -CEEEE-SSS--EEEHHHHT-SS--SS-----HHH-
T ss_pred             CCceEEeCCCCCcCChhhCCcccCCCCCcEECCcCC
Confidence            478999999999999999999988888789999995


No 36 
>KOG1081 consensus Transcription factor NSD1 and related SET domain proteins [Transcription]
Probab=98.82  E-value=1.1e-09  Score=124.98  Aligned_cols=127  Identities=30%  Similarity=0.481  Sum_probs=106.4

Q ss_pred             EeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCCCCCCeeEEEEEEC
Q 002995          711 FAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYSRVISVN  790 (860)
Q Consensus       711 fA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~~~~~~~  790 (860)
                      +|..+|.+|      +|+++...+...+..+.-.. .....|+..+..+..||+...||+.||+||||+|||....+.+.
T Consensus       319 ~~~~~~~k~------vg~~i~~~e~~~~~~~~~~~-~~~~~~~~~~e~~~~id~~~~~n~sr~~nh~~~~~v~~~k~~~~  391 (463)
T KOG1081|consen  319 TAKADIRKG------VGEVIDDKECKARLQRVKES-DLVDFYMVFIQKDRIIDAGPKGNYSRFLNHSCQPNVETEKWQVI  391 (463)
T ss_pred             hhHHhhhcc------cCcccchhhheeehhhhhcc-chhhhhhhhhhcccccccccccchhhhhcccCCCceeechhhee
Confidence            888888888      89999888877776643322 23345555555555999999999999999999999999988899


Q ss_pred             CeeEEEEEEccCCCCCCeEEEecCCCCCCCCeEEeeCCCCCccccCCChHHHHH
Q 002995          791 GDEHIIIFAKRDIKQWEELTYDYRFFSIDEQLACYCGFPRCRGVVNDTEAEEQV  844 (860)
Q Consensus       791 ~~~~i~i~A~RdI~~GEELt~dY~~~~~~~~~~C~Cg~~~Cr~~~~~~~~~~~~  844 (860)
                      +..++.++|.+.|.+|||||++|+.......+.|.|++.+|.++++....+...
T Consensus       392 ~~t~~~~~a~~~i~~g~e~t~~~n~~~~~~~~~~~~~~e~~~~~~~k~~~~~~~  445 (463)
T KOG1081|consen  392 GDTRVGLFAPRQIEAGEELTFNYNGNCEGNEKRCCCGSENCTETKGKKKKKKKQ  445 (463)
T ss_pred             cccccccccccccccchhhhheeeccccCCcceEeecccccccCCcccccchhh
Confidence            999999999999999999999999887778899999999999999988774443


No 37 
>KOG2589 consensus Histone tail methylase [Chromatin structure and dynamics]
Probab=98.78  E-value=5.8e-09  Score=110.21  Aligned_cols=115  Identities=24%  Similarity=0.260  Sum_probs=83.4

Q ss_pred             ccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeecCcceeeccccCCccccCCCCCCCCeeE
Q 002995          705 IHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRIDDERVIDATRAGSIAHLINHSCEPNCYS  784 (860)
Q Consensus       705 ~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~~~~~iDa~~~gn~aRfiNHSC~PN~~~  784 (860)
                      ..|--|.+++.+.+||=|--.+|-|+.-.+++++....   ..+.+ |...-+ ..--.|...=+.|+||||.|.|||.+
T Consensus       136 ~~gAkivst~~w~~ndkIe~LvGcIaeLse~eE~~ll~---~g~nD-FSvmyS-tRk~caqLwLGPaafINHDCrpnCkF  210 (453)
T KOG2589|consen  136 QNGAKIVSTKSWSRNDKIELLVGCIAELSEAEERSLLR---GGGND-FSVMYS-TRKRCAQLWLGPAAFINHDCRPNCKF  210 (453)
T ss_pred             CCCceEEeeccccCCccHHHhhhhhhhcChhhhHHHHh---ccCCc-eeeeee-cccchhhheeccHHhhcCCCCCCcee
Confidence            45778999999999999999999998888888775321   11221 111111 01112333345799999999999988


Q ss_pred             EEEEECCeeEEEEEEccCCCCCCeEEEecCCCCCC-CCeEEeeC
Q 002995          785 RVISVNGDEHIIIFAKRDIKQWEELTYDYRFFSID-EQLACYCG  827 (860)
Q Consensus       785 ~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~~~-~~~~C~Cg  827 (860)
                      ..   .|..++.+.++|||++|||||.-|+..+.+ ....|.|-
T Consensus       211 vs---~g~~tacvkvlRDIePGeEITcFYgs~fFG~~N~~CeC~  251 (453)
T KOG2589|consen  211 VS---TGRDTACVKVLRDIEPGEEITCFYGSGFFGENNEECECV  251 (453)
T ss_pred             ec---CCCceeeeehhhcCCCCceeEEeecccccCCCCceeEEe
Confidence            63   466899999999999999999999877665 45678773


No 38 
>PF00628 PHD:  PHD-finger;  InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=97.60  E-value=1.6e-05  Score=62.66  Aligned_cols=45  Identities=36%  Similarity=0.842  Sum_probs=36.2

Q ss_pred             ccccccCCcccCCCeeEEccccCceecccccccccc----CCCCceeeeccc
Q 002995          398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEP----VNGVLWLCNLCR  445 (860)
Q Consensus       398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~----~~~~~W~C~~C~  445 (860)
                      +|.||..  ..+.+.+|+|+.|+..+|..|++....    +.+ .|+|+.|.
T Consensus         1 ~C~vC~~--~~~~~~~i~C~~C~~~~H~~C~~~~~~~~~~~~~-~w~C~~C~   49 (51)
T PF00628_consen    1 YCPVCGQ--SDDDGDMIQCDSCNRWYHQECVGPPEKAEEIPSG-DWYCPNCR   49 (51)
T ss_dssp             EBTTTTS--SCTTSSEEEBSTTSCEEETTTSTSSHSHHSHHSS-SBSSHHHH
T ss_pred             eCcCCCC--cCCCCCeEEcCCCChhhCcccCCCChhhccCCCC-cEECcCCc
Confidence            4889998  344567999999999999999997533    223 89999996


No 39 
>KOG4323 consensus Polycomb-like PHD Zn-finger protein [General function prediction only]
Probab=97.56  E-value=6.3e-05  Score=84.30  Aligned_cols=117  Identities=24%  Similarity=0.345  Sum_probs=79.7

Q ss_pred             CCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCceec
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAH  475 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wvH  475 (860)
                      ...|.||........|.+++|++|+...||.|--... +.-+.|.+..|.....         .+.||++|   .|.-+|
T Consensus        83 e~~~nv~~s~~~~p~~e~~~~~r~~~~~~q~~~i~~~-~~~~~~~~~~c~~~~~---------~~~g~a~K---~g~~a~  149 (464)
T KOG4323|consen   83 ELNPNVLTSETVLPENEKVICGRCKSGYHQGCNIPRF-PSLDIGESTECVFPIF---------SQEGGALK---KGRLAR  149 (464)
T ss_pred             ccCCcccccccccCchhhhhhhhhccCcccccCccCc-CcCCcccccccccccc---------cccccccc---cccccc
Confidence            4689999998877889999999999999999975433 3334788888887544         46788888   455565


Q ss_pred             cccccccCceEEccCCCccccccchhhhccccCccceecCC----cceeeEEcCCCCcCccccchhhhhc
Q 002995          476 LACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGV----SYGACIQCSNTTCRVAYHPLCARAA  541 (860)
Q Consensus       476 ~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~----~~Ga~iqC~~~~C~~~fH~~CA~~~  541 (860)
                             |-+.+..       .+++.-...+..+.|++|..    ..-.+|||.  +|.++||-.|-+-.
T Consensus       150 -------~~l~y~~-------~~l~wD~~~~~n~qc~vC~~g~~~~~NrmlqC~--~C~~~fHq~Chqp~  203 (464)
T KOG4323|consen  150 -------PSLPYPE-------ASLDWDSGHKVNLQCSVCYCGGPGAGNRMLQCD--KCRQWYHQACHQPL  203 (464)
T ss_pred             -------ccccCcc-------cccccCccccccceeeeeecCCcCccceeeeec--ccccHHHHHhccCC
Confidence                   2111111       11111111222345999974    123899998  99999999997543


No 40 
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the PF09465 LBR_tudor:  Lamin-B receptor of TUDOR domain;  InterPro: IPR019023  The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=97.47  E-value=0.0001  Score=57.61  Aligned_cols=32  Identities=44%  Similarity=0.634  Sum_probs=27.1

Q ss_pred             CCceeeeeeeecCCCCceEEEecCCCcccccc
Q 002995            2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLIL   33 (860)
Q Consensus         2 ~~~~~g~v~~~~~~~~~h~~~y~d~d~e~~~l   33 (860)
                      ..||.|.|.+||+.+++|.|.|+||++.++..
T Consensus        20 ~lYYe~kV~~~d~~~~~y~V~Y~DGtel~lke   51 (55)
T PF09465_consen   20 SLYYEGKVLSYDSKSDRYTVLYEDGTELELKE   51 (55)
T ss_dssp             S-EEEEEEEEEETTTTEEEEEETTS-EEEEEC
T ss_pred             CcEEEEEEEEecccCceEEEEEcCCCEEEecc
Confidence            46999999999999999999999999966554


No 42 
>KOG2461 consensus Transcription factor BLIMP-1/PRDI-BF1, contains C2H2-type Zn-finger and SET domains [Transcription]
Probab=97.37  E-value=0.00017  Score=81.00  Aligned_cols=110  Identities=22%  Similarity=0.273  Sum_probs=81.6

Q ss_pred             cceeEEEEEeccc--CeeEEeeecCCCCCEEEEecCEeeChhhHhhhHHHhhhcccCCcceeeeec----Ccceeeccc-
Q 002995          694 FRKRLAFGKSGIH--GFGIFAKHPHRAGDMVIEYTGELVRPSIADRREHFIYNSLVGAGTYMFRID----DERVIDATR-  766 (860)
Q Consensus       694 ~~~~l~v~~s~~~--G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~~~~~~~~~~~~~y~~~~~----~~~~iDa~~-  766 (860)
                      ....|.|..|.+-  |.||++...|.+|+--+=|.|+++...    ...      .....|++.+-    .-.+||++. 
T Consensus        26 LP~~l~i~~Ssv~~~~lgV~s~~~i~~G~~FGP~~G~~~~~~----~~~------~~n~~y~W~I~~~d~~~~~iDg~d~   95 (396)
T KOG2461|consen   26 LPPELRIKPSSVPVTGLGVWSNASILPGTSFGPFEGEIIASI----DSK------SANNRYMWEIFSSDNGYEYIDGTDE   95 (396)
T ss_pred             CCCceEeeccccCCccccccccccccCcccccCccCcccccc----ccc------cccCcceEEEEeCCCceEEeccCCh
Confidence            4566788887654  789999999999999999999982111    111      12234555543    235899875 


Q ss_pred             -cCCccccCCCCCCC---CeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCCCC
Q 002995          767 -AGSIAHLINHSCEP---NCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRFFS  817 (860)
Q Consensus       767 -~gn~aRfiNHSC~P---N~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~~~  817 (860)
                       ..|+.||+|=.++.   |..+.    .....|.++|+|+|.+||||-+.|+-++
T Consensus        96 ~~sNWmRYV~~Ar~~eeQNL~A~----Q~~~~Ifyrt~r~I~p~eELlVWY~~e~  146 (396)
T KOG2461|consen   96 EHSNWMRYVNSARSEEEQNLLAF----QIGENIFYRTIRDIRPNEELLVWYGSEY  146 (396)
T ss_pred             hhcceeeeecccCChhhhhHHHH----hccCceEEEecccCCCCCeEEEEeccch
Confidence             88999999988865   75543    2456899999999999999999997654


No 43 
>KOG1244 consensus Predicted transcription factor Requiem/NEURO-D4 [Transcription]
Probab=97.09  E-value=0.0002  Score=73.46  Aligned_cols=49  Identities=35%  Similarity=0.868  Sum_probs=40.1

Q ss_pred             CCCccccccCCcccCCCeeEEccccCceeccccccc--cccCCCCceeeecccC
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE--LEPVNGVLWLCNLCRP  446 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~--~~~~~~~~W~C~~C~~  446 (860)
                      +--.|+||+..+  ++++||+||.|+.++|++|+..  ..+|+| +|-|..|..
T Consensus       280 eck~csicgtse--nddqllfcddcdrgyhmyclsppm~eppeg-swsc~KOG~  330 (336)
T KOG1244|consen  280 ECKYCSICGTSE--NDDQLLFCDDCDRGYHMYCLSPPMVEPPEG-SWSCHLCLE  330 (336)
T ss_pred             ecceeccccCcC--CCceeEeecccCCceeeEecCCCcCCCCCC-chhHHHHHH
Confidence            445899998754  4578999999999999999984  456777 999999964


No 44 
>KOG4323 consensus Polycomb-like PHD Zn-finger protein [General function prediction only]
Probab=96.95  E-value=0.00045  Score=77.58  Aligned_cols=53  Identities=36%  Similarity=0.698  Sum_probs=42.4

Q ss_pred             CCccccccCCcccCCCeeEEccccCceecccccccc------ccCCCCceeeecccCCCC
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL------EPVNGVLWLCNLCRPGAP  449 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~------~~~~~~~W~C~~C~~~~~  449 (860)
                      +..|+||........|.||+|++|+-.+|+.|.--.      ..+ ...|+|..|..+..
T Consensus       168 n~qc~vC~~g~~~~~NrmlqC~~C~~~fHq~Chqp~i~~~l~~D~-~~~w~C~~C~~~~~  226 (464)
T KOG4323|consen  168 NLQCSVCYCGGPGAGNRMLQCDKCRQWYHQACHQPLIKDELAGDP-FYEWFCDVCNRGPK  226 (464)
T ss_pred             cceeeeeecCCcCccceeeeecccccHHHHHhccCCCCHhhccCc-cceEeehhhccchh
Confidence            457999998888888899999999999999998522      122 23799999997643


No 45 
>KOG1512 consensus PHD Zn-finger protein [General function prediction only]
Probab=96.81  E-value=0.00041  Score=71.54  Aligned_cols=49  Identities=24%  Similarity=0.529  Sum_probs=41.2

Q ss_pred             CCCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeee--cccC
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCN--LCRP  446 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~--~C~~  446 (860)
                      +-..|.||+...-+  +++++||.|+.++|..|.|....|.| .|+|+  .|+.
T Consensus       313 ~C~lC~IC~~P~~E--~E~~FCD~CDRG~HT~CVGL~~lP~G-~WICD~~C~~~  363 (381)
T KOG1512|consen  313 SCELCRICLGPVIE--SEHLFCDVCDRGPHTLCVGLQDLPRG-EWICDMRCREA  363 (381)
T ss_pred             ccHhhhccCCcccc--hheeccccccCCCCccccccccccCc-cchhhhHHHHh
Confidence            45689999987654  44999999999999999999888888 99999  5543


No 46 
>KOG1081 consensus Transcription factor NSD1 and related SET domain proteins [Transcription]
Probab=96.78  E-value=0.00062  Score=78.35  Aligned_cols=55  Identities=29%  Similarity=0.655  Sum_probs=46.1

Q ss_pred             CCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccc
Q 002995           80 QELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISF  144 (860)
Q Consensus        80 ~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f  144 (860)
                      ..++.||+||-++..|+|||++|+.+..++    +.. .  . .+|+||+.  ++|+....+.++
T Consensus       134 ~~~~~~~~vw~~vg~~~~~~c~vc~~~~~~----~~~-~--~-~~~~f~~~--~~~~~~~~~~~~  188 (463)
T KOG1081|consen  134 KKREVGDLVWSKVGEYPWWPCMVCHDPLLP----KGM-K--H-DHVNFFGC--YAWTHEKRVFPY  188 (463)
T ss_pred             ccccceeEEeEEcCcccccccceecCcccc----hhh-c--c-ccceeccc--hhhHHHhhhhhc
Confidence            368999999999999999999999887665    111 1  1 18999999  999999999999


No 47 
>KOG1084 consensus Transcription factor TCF20 [Transcription]
Probab=96.50  E-value=0.00087  Score=75.03  Aligned_cols=85  Identities=26%  Similarity=0.440  Sum_probs=65.0

Q ss_pred             CCCccccCCCCCCceec-CCCceeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcC
Q 002995          452 PPPCCLCPVVGGAMKPT-TDGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCR  530 (860)
Q Consensus       452 ~~~C~lC~~~gGalk~t-~~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~  530 (860)
                      ...|+|++...   .+. ....|+|+.|++|.|.+.+.....+..+.   ........+.|..|.+ .|+.+.|...+|.
T Consensus       221 e~~~~l~~~~~---~~d~~~~~~~h~~c~~~~~~~~~~q~~~l~~~~---~~v~r~~~~~c~~c~k-~ga~~~c~~~~~~  293 (375)
T KOG1084|consen  221 EFFCALSPKAT---IPDIGFELWYHRYCALWAPNVHESQGGQLTNVD---NAVIRFPSLQCILCQK-PGATLKCVQASLL  293 (375)
T ss_pred             hhhhhhcCCCc---CCccchhHHHHHHHHhcCCcceeccCccccCch---hhhhcccchhcccccC-CCCchhhhhhhhh
Confidence            34788886443   333 45689999999999999988876655444   3222333379999999 8999999999999


Q ss_pred             ccccchhhhhcCe
Q 002995          531 VAYHPLCARAAGL  543 (860)
Q Consensus       531 ~~fH~~CA~~~g~  543 (860)
                      ..+|.+|+..+-.
T Consensus       294 ~~~h~~c~~~~~~  306 (375)
T KOG1084|consen  294 SNAHFPCARAKNG  306 (375)
T ss_pred             cccCcccccCccc
Confidence            9999999987755


No 48 
>smart00508 PostSET Cysteine-rich motif following a subset of SET domains.
Probab=96.02  E-value=0.0036  Score=41.49  Aligned_cols=19  Identities=37%  Similarity=1.015  Sum_probs=16.3

Q ss_pred             CeEEeeCCCCCccccCCCh
Q 002995          821 QLACYCGFPRCRGVVNDTE  839 (860)
Q Consensus       821 ~~~C~Cg~~~Cr~~~~~~~  839 (860)
                      .++|+||+++|||+|+...
T Consensus         2 ~~~C~CGs~~CRG~l~~~~   20 (26)
T smart00508        2 KQPCLCGAPNCRGFLGXXX   20 (26)
T ss_pred             CeeeeCCCccccceecccc
Confidence            5799999999999996643


No 49 
>PF15057 DUF4537:  Domain of unknown function (DUF4537)
Probab=95.63  E-value=0.034  Score=52.52  Aligned_cols=100  Identities=22%  Similarity=0.335  Sum_probs=66.4

Q ss_pred             ceeeeeeeecCCCCceEEEecCCCccccccccceEEEEEccchhhhhhcccCCCCCCCCCCChhhHHHhhcCccCCCCCC
Q 002995            4 WYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDGYDYDEMVVLAASLDDCQELE   83 (860)
Q Consensus         4 ~~~g~v~~~~~~~~~h~~~y~d~d~e~~~l~~e~i~~~~s~~e~~~l~~~~~~~~~~~~~~~~de~~~~~~~~~~~~~f~   83 (860)
                      +|.|.|++.- ....-.|.+++++.+.+...+                                 +..+..  .-...++
T Consensus        14 YY~GtV~~~~-~~~~~lV~f~~~~~~~v~~~~---------------------------------iI~~~~--~~~~~L~   57 (124)
T PF15057_consen   14 YYPGTVKKCV-SSGQFLVEFDDGDTQEVPISD---------------------------------IIALSD--AMRHSLQ   57 (124)
T ss_pred             EEeEEEEEcc-CCCEEEEEECCCCEEEeChHH---------------------------------eEEccC--cccCcCC
Confidence            6889999887 666777778777766654411                                 111111  1156789


Q ss_pred             CCCEEEEEec--ccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccccccc
Q 002995           84 PGDIIWAKLT--GHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKG  147 (860)
Q Consensus        84 ~GdlVwaK~~--g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~  147 (860)
                      +||-|+|+..  ++.|=||+|..-.+..       ......|.|.||.. ..+.++...+....+.
T Consensus        58 ~GD~VLA~~~~~~~~Y~Pg~V~~~~~~~-------~~~~~~~~V~f~ng-~~~~vp~~~~~~I~~~  115 (124)
T PF15057_consen   58 VGDKVLAPWEPDDCRYGPGTVIAGPERR-------ASEDKEYTVRFYNG-KTAKVPRGEVIWISPS  115 (124)
T ss_pred             CCCEEEEecCcCCCEEeCEEEEECcccc-------ccCCceEEEEEECC-CCCccchhhEEECCHH
Confidence            9999999953  7778899998632211       34556799999954 4677777776665543


No 50 
>PF15446 zf-PHD-like:  PHD/FYVE-zinc-finger like domain
Probab=95.41  E-value=0.015  Score=56.44  Aligned_cols=70  Identities=21%  Similarity=0.485  Sum_probs=48.0

Q ss_pred             ccccccC-CcccCCCeeEEccccCceeccccccccc--------cCCC-CceeeecccCCCC------CCCCCccccCCC
Q 002995          398 KCSVCHM-DEEYQNNLFLQCDKCRMMVHARCYGELE--------PVNG-VLWLCNLCRPGAP------EPPPPCCLCPVV  461 (860)
Q Consensus       398 ~C~vC~~-~~~~~~n~ll~C~~C~~~vH~~CYg~~~--------~~~~-~~W~C~~C~~~~~------~~~~~C~lC~~~  461 (860)
                      .|.+|.. +.+.+..+||+|.+|..++|+.|.|...        +-++ ----|.+|.....      ...-.|.-|...
T Consensus         1 ~C~~C~~~g~~~~kG~Lv~CQGCs~sYHk~CLG~Rs~ReHlVTKVg~d~FVLQCr~Cig~~~kKD~~aP~~~~C~~C~~~   80 (175)
T PF15446_consen    1 TCDTCGYEGDDRNKGPLVYCQGCSSSYHKACLGPRSQREHLVTKVGDDDFVLQCRRCIGIAHKKDPRAPHHGMCQQCKKP   80 (175)
T ss_pred             CcccccCCCCCccCCCeEEcCccChHHHhhhcCCccccceeeEEEcCCceEEechhhcChhhcccCCCCCCCcccccCCC
Confidence            5999976 4566677899999999999999999432        2221 1355999975321      123469999887


Q ss_pred             CCCcee
Q 002995          462 GGAMKP  467 (860)
Q Consensus       462 gGalk~  467 (860)
                      |-+-++
T Consensus        81 G~~c~p   86 (175)
T PF15446_consen   81 GPSCKP   86 (175)
T ss_pred             CCCCcc
Confidence            755443


No 51 
>KOG0825 consensus PHD Zn-finger protein [General function prediction only]
Probab=95.11  E-value=0.0095  Score=69.49  Aligned_cols=49  Identities=27%  Similarity=0.713  Sum_probs=39.1

Q ss_pred             CCCccccccCCcccCCCeeEEccccCce-ecccccccc--ccCCCCceeeecccC
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMM-VHARCYGEL--EPVNGVLWLCNLCRP  446 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~-vH~~CYg~~--~~~~~~~W~C~~C~~  446 (860)
                      +...|.||...+.+  +.||.||.|+.+ +|.+|+...  .++-+ .|+|+-|..
T Consensus       214 E~~~C~IC~~~DpE--dVLLLCDsCN~~~YH~YCLDPdl~eiP~~-eWYC~NC~d  265 (1134)
T KOG0825|consen  214 EEVKCDICTVHDPE--DVLLLCDSCNKVYYHVYCLDPDLSESPVN-EWYCTNCSL  265 (1134)
T ss_pred             ccccceeeccCChH--HhheeecccccceeeccccCccccccccc-ceecCcchh
Confidence            34689999987654  569999999999 999999753  24555 899999965


No 52 
>KOG4299 consensus PHD Zn-finger protein [General function prediction only]
Probab=93.79  E-value=0.021  Score=66.04  Aligned_cols=48  Identities=29%  Similarity=0.684  Sum_probs=40.0

Q ss_pred             CccccccCCcccCCCeeEEccccCceecccccccc----ccCCCCceeeecccCC
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL----EPVNGVLWLCNLCRPG  447 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~----~~~~~~~W~C~~C~~~  447 (860)
                      ++|+-|.+.+..  |.||.||.|...|||.|.--+    ..|.| .|+|..|...
T Consensus       254 ~fCsaCn~~~~F--~~~i~CD~Cp~sFH~~CLePPl~~eniP~g-~W~C~ec~~k  305 (613)
T KOG4299|consen  254 DFCSACNGSGLF--NDIICCDGCPRSFHQTCLEPPLEPENIPPG-SWFCPECKIK  305 (613)
T ss_pred             HHHHHhCCcccc--ccceeecCCchHHHHhhcCCCCCcccCCCC-ccccCCCeee
Confidence            599999998776  779999999999999998643    34555 8999999753


No 53 
>PF08169 RBB1NT:  RBB1NT (NUC162) domain;  InterPro: IPR012603 This domain is found N-terminal to the ARID/BRIGHT domain in DNA-binding proteins of the Retinoblastoma-binding protein 1 family [].; PDB: 2YRV_A.
Probab=92.32  E-value=0.24  Score=44.00  Aligned_cols=85  Identities=21%  Similarity=0.289  Sum_probs=48.8

Q ss_pred             CCCEEEEEe--cccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccccccccccccCChHHH
Q 002995           84 PGDIIWAKL--TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLKGLLSSFHLKCKKPRF  161 (860)
Q Consensus        84 ~GdlVwaK~--~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~~~~~~~~~k~k~~~~  161 (860)
                      +|-+|-...  +.-.|-||.|+.++.-..  .   .-++.+++|+=|-|..|+=|+.+++..|..+     .........
T Consensus         8 lGkVV~V~~~~~k~~W~PALVVsPsc~dd--v---~VkKD~~lVRSFkD~KfysV~rkd~~e~~~~-----~~~k~e~s~   77 (96)
T PF08169_consen    8 LGKVVCVESTKKKTSWFPALVVSPSCNDD--V---TVKKDQCLVRSFKDGKFYSVARKDVREFDID-----SLPKSESSL   77 (96)
T ss_dssp             TTSEEEEE-SS-SS-EEEEEEE--SS-SS---------TT-EEEEESSS--EEEE-TTTEE---ST-----TS-HHHHHH
T ss_pred             cCcEEEEEcCCCCCceeeEEEEcCCccce--e---eeccceEEEEEeccCceEEEEhhhhhhcccc-----cCCcccchh
Confidence            788887764  344699999999754333  1   1346789999999999999999999999864     111223467


Q ss_pred             HHHHHHHHHHHHhCCCc
Q 002995          162 TQSLEEAKVYLSEQKLP  178 (860)
Q Consensus       162 ~~A~~ea~~~~~~~~~~  178 (860)
                      +.|++.|..+++....|
T Consensus        78 k~al~~A~~Fl~~~~vP   94 (96)
T PF08169_consen   78 KPALDKASTFLKTGVVP   94 (96)
T ss_dssp             -HHHHHHHHHHHS----
T ss_pred             hHHHHHHHHHHhcCCCC
Confidence            89999999998887665


No 54 
>KOG1141 consensus Predicted histone methyl transferase [Chromatin structure and dynamics]
Probab=91.95  E-value=0.15  Score=60.32  Aligned_cols=59  Identities=20%  Similarity=0.236  Sum_probs=53.3

Q ss_pred             CChhhhhHHhhhccceeEEEEEecccCeeEEeeecCCCCCEEEEecCEeeChhhHhhhH
Q 002995          681 LSMADKYKHMKETFRKRLAFGKSGIHGFGIFAKHPHRAGDMVIEYTGELVRPSIADRRE  739 (860)
Q Consensus       681 ~~~~~~~~~l~~~~~~~l~v~~s~~~G~GvfA~~~i~~g~~I~ey~Gevi~~~~~~~r~  739 (860)
                      +..-|.|+..|-+.+.+|.+.+...+|||++...+|..|.||+-|.|.+++...++.-+
T Consensus       784 ~~~~C~nrmvqhg~qvRlq~fkt~~kGWg~rclddi~~g~fVciy~g~~l~~~~sdks~  842 (1262)
T KOG1141|consen  784 CGPDCLNRMVQHGYQVRLQRFKTIHKGWGRRCLDDITGGNFVCIYPGGALLHQISDKSE  842 (1262)
T ss_pred             CcHHHHHHHhhcCceeEeeeccccccccceEeeeecCCceEEEEecchhhhhhhchhhh
Confidence            34467899999999999999999999999999999999999999999999888777764


No 55 
>KOG1973 consensus Chromatin remodeling protein, contains PHD Zn-finger [Chromatin structure and dynamics]
Probab=91.66  E-value=0.079  Score=57.25  Aligned_cols=36  Identities=28%  Similarity=0.751  Sum_probs=32.0

Q ss_pred             CeeEEccc--cC-ceeccccccccccCCCCceeeecccCC
Q 002995          411 NLFLQCDK--CR-MMVHARCYGELEPVNGVLWLCNLCRPG  447 (860)
Q Consensus       411 n~ll~C~~--C~-~~vH~~CYg~~~~~~~~~W~C~~C~~~  447 (860)
                      ..||-||+  |. -=||..|.|+..+|.| .|+|..|...
T Consensus       230 g~Mi~CDn~~C~~eWFH~~CVGL~~~Pkg-kWyC~~C~~~  268 (274)
T KOG1973|consen  230 GKMIGCDNPGCPIEWFHFTCVGLKTKPKG-KWYCPRCKAE  268 (274)
T ss_pred             ccccccCCCCCCcceEEEeccccccCCCC-cccchhhhhh
Confidence            34999997  99 8899999999999988 6999999864


No 56 
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=91.47  E-value=0.12  Score=40.76  Aligned_cols=33  Identities=24%  Similarity=0.568  Sum_probs=26.6

Q ss_pred             CCCccccccCCcccCCCeeEEccccCceeccccc
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCY  428 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CY  428 (860)
                      ...+|.+|+..-.. .+.+|+|..|+...|..||
T Consensus         4 ~~~~C~~Cg~~~~~-~dDiVvCp~CgapyHR~C~   36 (54)
T PF14446_consen    4 EGCKCPVCGKKFKD-GDDIVVCPECGAPYHRDCW   36 (54)
T ss_pred             cCccChhhCCcccC-CCCEEECCCCCCcccHHHH
Confidence            34689999986532 2349999999999999999


No 57 
>COG5034 TNG2 Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics]
Probab=91.09  E-value=0.13  Score=53.11  Aligned_cols=54  Identities=26%  Similarity=0.606  Sum_probs=40.0

Q ss_pred             CccccccCCCccccccCCcccCCCeeEEcc--ccCc-eeccccccccccCCCCceeeecccC
Q 002995          388 PVRVDWKDLDKCSVCHMDEEYQNNLFLQCD--KCRM-MVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       388 ~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~--~C~~-~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ++....++...|= |...-.   .+||-||  +|.. =||..|.|+..+|.| .|+|+.|+.
T Consensus       213 s~d~se~e~lYCf-CqqvSy---GqMVaCDn~nCkrEWFH~~CVGLk~pPKG-~WYC~eCk~  269 (271)
T COG5034         213 SEDNSEGEELYCF-CQQVSY---GQMVACDNANCKREWFHLECVGLKEPPKG-KWYCPECKK  269 (271)
T ss_pred             ccccccCceeEEE-eccccc---ccceecCCCCCchhheeccccccCCCCCC-cEeCHHhHh
Confidence            3343444555664 776532   3499999  6876 499999999999998 999999974


No 58 
>PF11717 Tudor-knot:  RNA binding activity-knot of a chromodomain ; PDB: 2EKO_A 2RO0_A 2RNZ_A 1WGS_A 3E9G_A 3E9F_A 2K3X_A 2K3Y_A 2EFI_A 2F5K_F ....
Probab=90.85  E-value=0.28  Score=39.25  Aligned_cols=52  Identities=15%  Similarity=0.209  Sum_probs=40.7

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCE--EEEeCCCcc
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDF--ARINVKQVI  142 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~--awv~~~~l~  142 (860)
                      |++|+.|+++.+.--|-+|.|++...         ..+...|.|.|.|-+.+  -||+.++|.
T Consensus         1 ~~vG~~v~~~~~~~~~y~A~I~~~r~---------~~~~~~YyVHY~g~nkR~DeWV~~~~i~   54 (55)
T PF11717_consen    1 FEVGEKVLCKYKDGQWYEAKILDIRE---------KNGEPEYYVHYQGWNKRLDEWVPESRIR   54 (55)
T ss_dssp             --TTEEEEEEETTTEEEEEEEEEEEE---------CTTCEEEEEEETTSTGCC-EEEETTTEE
T ss_pred             CCcCCEEEEEECCCcEEEEEEEEEEe---------cCCCEEEEEEcCCCCCCceeeecHHHcc
Confidence            67999999999777788999987432         23446799999999885  899999875


No 59 
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=90.48  E-value=0.41  Score=39.04  Aligned_cols=53  Identities=13%  Similarity=0.206  Sum_probs=38.5

Q ss_pred             CCCCCCEEEEEe-cccCCCceEeeCCCCcCccccccccCCCCcEEEEEeC--CCCEEEEeCCCccccc
Q 002995           81 ELEPGDIIWAKL-TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFG--THDFARINVKQVISFL  145 (860)
Q Consensus        81 ~f~~GdlVwaK~-~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg--~~~~awv~~~~l~~f~  145 (860)
                      .|++||+|-|+. ..-.||+|+|....            +.+.|.|.|.+  ....-=++.++|.|-.
T Consensus         2 ~~~~G~~Ve~~~~~~~~W~~a~V~~~~------------~~~~~~V~~~~~~~~~~e~v~~~~LRp~~   57 (61)
T smart00743        2 DFKKGDRVEVFSKEEDSWWEAVVTKVL------------GDGKYLVRYLTESEPLKETVDWSDLRPHP   57 (61)
T ss_pred             CcCCCCEEEEEECCCCEEEEEEEEEEC------------CCCEEEEEECCCCcccEEEEeHHHcccCC
Confidence            589999999998 45679999997631            13458999988  5555556666666643


No 60 
>smart00249 PHD PHD zinc finger. The plant homeodomain (PHD) finger is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in epigenetics and chromatin-mediated transcriptional regulation. The PHD finger binds two zinc ions using the so-called 'cross-brace' motif and is thus structurally related to the smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=89.34  E-value=0.29  Score=39.13  Aligned_cols=33  Identities=30%  Similarity=0.582  Sum_probs=29.1

Q ss_pred             CCceeeeeeeecCCCCceEEEecC-CCcccccccc
Q 002995            2 QDWYSGFVVGYDSESNRHHVKYVD-GDEEDLILSN   35 (860)
Q Consensus         2 ~~~~~g~v~~~~~~~~~h~~~y~d-~d~e~~~l~~   35 (860)
                      ..||.|.|+++++. +..+|.|.| |..|++...+
T Consensus        16 ~~wyra~I~~~~~~-~~~~V~f~D~G~~~~v~~~~   49 (57)
T smart00333       16 GEWYRARIIKVDGE-QLYEVFFIDYGNEEVVPPSD   49 (57)
T ss_pred             CCEEEEEEEEECCC-CEEEEEEECCCccEEEeHHH
Confidence            36999999999998 889999999 9999887643


No 62 
>smart00333 TUDOR Tudor domain. Domain of unknown function present in several RNA-binding proteins. 10 copies in the Drosophila Tudor protein. Initial proposal that the survival motor neuron gene product contain a Tudor domain are corroborated by more recent database search techniques such as PSI-BLAST (unpublished).
Probab=88.40  E-value=0.69  Score=36.92  Aligned_cols=54  Identities=11%  Similarity=0.110  Sum_probs=44.3

Q ss_pred             CCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccccc
Q 002995           81 ELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISFLK  146 (860)
Q Consensus        81 ~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f~~  146 (860)
                      .|++|++|-|+...=-|.+|+|.....            .+.+.|.|.+.....||+..+|.++..
T Consensus         2 ~~~~G~~~~a~~~d~~wyra~I~~~~~------------~~~~~V~f~D~G~~~~v~~~~l~~l~~   55 (57)
T smart00333        2 TFKVGDKVAARWEDGEWYRARIIKVDG------------EQLYEVFFIDYGNEEVVPPSDLRPLPE   55 (57)
T ss_pred             CCCCCCEEEEEeCCCCEEEEEEEEECC------------CCEEEEEEECCCccEEEeHHHeecCCC
Confidence            478999999999666699999987421            146899999989999999999988754


No 63 
>KOG0383 consensus Predicted helicase [General function prediction only]
Probab=87.68  E-value=0.22  Score=59.78  Aligned_cols=50  Identities=32%  Similarity=0.684  Sum_probs=39.6

Q ss_pred             ccccCCCccccccCCcccCCCeeEEccccCceecccccccc--ccCCCCceeeecccC
Q 002995          391 VDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL--EPVNGVLWLCNLCRP  446 (860)
Q Consensus       391 ~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~--~~~~~~~W~C~~C~~  446 (860)
                      ..|.+...|.||...+.     +|-|+.|-..+|..|.+..  ..+.++ |+|.+|..
T Consensus        42 ~~~~~~e~c~ic~~~g~-----~l~c~tC~~s~h~~cl~~pl~~~p~~~-~~c~Rc~~   93 (696)
T KOG0383|consen   42 WDDAEQEACRICADGGE-----LLWCDTCPASFHASCLGPPLTPQPNGE-FICPRCFC   93 (696)
T ss_pred             cchhhhhhhhhhcCCCc-----EEEeccccHHHHHHccCCCCCcCCccc-eeeeeecc
Confidence            34566789999987643     7889999999999999854  345565 99999954


No 64 
>KOG0957 consensus PHD finger protein [General function prediction only]
Probab=85.02  E-value=0.69  Score=52.00  Aligned_cols=80  Identities=23%  Similarity=0.351  Sum_probs=49.9

Q ss_pred             HHhhcCCCCCCCchhh-hhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccCceecccccccc--cc
Q 002995          357 LILGLTKSRPTSKSSL-CKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL--EP  433 (860)
Q Consensus       357 li~~Lp~a~~c~k~~~-~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~--~~  433 (860)
                      |++.-|+...++++.+ +|..++  ..|...    ....-...|.||....+  ..++++||.|.+.+|..|+.-+  ..
T Consensus       510 l~e~~pks~~p~~~~~~rk~g~~--st~~~s----~~~a~~ysCgiCkks~d--QHll~~CDtC~lhYHlGCL~PPLTR~  581 (707)
T KOG0957|consen  510 LTELGPKSGLPLSDFNNRKSGSR--STPAIS----APKAMNYSCGICKKSTD--QHLLTQCDTCHLHYHLGCLSPPLTRL  581 (707)
T ss_pred             hhhcCCccCCcCccccccccccc--CCcccc----cccccceeeeeeccchh--hHHHhhcchhhceeeccccCCccccC
Confidence            5666677777766433 222211  112111    11233458999987644  4569999999999999999732  33


Q ss_pred             CC---CCceeeecc
Q 002995          434 VN---GVLWLCNLC  444 (860)
Q Consensus       434 ~~---~~~W~C~~C  444 (860)
                      |.   ...|.|..|
T Consensus       582 Pkk~kn~gWqCsEC  595 (707)
T KOG0957|consen  582 PKKNKNFGWQCSEC  595 (707)
T ss_pred             cccccCcceeeccc
Confidence            32   347999999


No 65 
>PF00628 PHD:  PHD-finger;  InterPro: IPR019787 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the PHD (homeodomain) zinc finger domain [,], which is a C4HC3 zinc-finger-like motif found in nuclear proteins thought to be involved in chromatin-mediated transcriptional regulation. The PHD finger motif is reminiscent of, but distinct from the C3HC4 type RING finger. The function of this domain is not yet known but in analogy with the LIM domain it could be involved in protein-protein interaction and be important for the assembly or activity of multicomponent complexes involved in transcriptional activation or repression. Alternatively, the interactions could be intra-molecular and be important in maintaining the structural integrity of the protein. In similarity to the RING finger and the LIM domain, the PHD finger is thought to bind two zinc ions. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0005515 protein binding; PDB: 3ZVY_A 2LGG_A 3SOW_A 3SOU_B 3ASL_A 3ASK_A 3ZVZ_B 3T6R_A 2LGK_A 3SOX_B ....
Probab=84.63  E-value=0.51  Score=36.88  Aligned_cols=29  Identities=38%  Similarity=0.954  Sum_probs=24.2

Q ss_pred             ceecCC--cceeeEEcCCCCcCccccchhhhhc
Q 002995          511 CSICGV--SYGACIQCSNTTCRVAYHPLCARAA  541 (860)
Q Consensus       511 C~~C~~--~~Ga~iqC~~~~C~~~fH~~CA~~~  541 (860)
                      |.+|++  ..+.+|+|.  .|..+||..|....
T Consensus         2 C~vC~~~~~~~~~i~C~--~C~~~~H~~C~~~~   32 (51)
T PF00628_consen    2 CPVCGQSDDDGDMIQCD--SCNRWYHQECVGPP   32 (51)
T ss_dssp             BTTTTSSCTTSSEEEBS--TTSCEEETTTSTSS
T ss_pred             CcCCCCcCCCCCeEEcC--CCChhhCcccCCCC
Confidence            677876  478999998  89999999998544


No 66 
>KOG1245 consensus Chromatin remodeling complex WSTF-ISWI, large subunit (contains heterochromatin localization, PHD and BROMO domains) [Chromatin structure and dynamics]
Probab=84.07  E-value=0.25  Score=64.19  Aligned_cols=49  Identities=29%  Similarity=0.636  Sum_probs=40.4

Q ss_pred             CCccccccCCcccCCCeeEEccccCceeccccccc--cccCCCCceeeecccCC
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE--LEPVNGVLWLCNLCRPG  447 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~--~~~~~~~~W~C~~C~~~  447 (860)
                      ...|.||......  +.++.|+.|..++|..|...  ..++.+ .|+|..|...
T Consensus      1108 ~~~c~~cr~k~~~--~~m~lc~~c~~~~h~~C~rp~~~~~~~~-dW~C~~c~~e 1158 (1404)
T KOG1245|consen 1108 NALCKVCRRKKQD--EKMLLCDECLSGFHLFCLRPALSSVPPG-DWMCPSCRKE 1158 (1404)
T ss_pred             hhhhhhhhhcccc--hhhhhhHhhhhhHHHHhhhhhhccCCcC-CccCCccchh
Confidence            3589999987654  44999999999999999984  456767 8999999864


No 67 
>smart00743 Agenet Tudor-like domain present in plant sequences. Domain in plant sequences with possible chromatin-associated functions.
Probab=84.06  E-value=0.78  Score=37.34  Aligned_cols=23  Identities=17%  Similarity=0.484  Sum_probs=21.0

Q ss_pred             CCceeeeeeeecCCCCceEEEecC
Q 002995            2 QDWYSGFVVGYDSESNRHHVKYVD   25 (860)
Q Consensus         2 ~~~~~g~v~~~~~~~~~h~~~y~d   25 (860)
                      ..||.|.|++++. .+++.|.|.+
T Consensus        17 ~~W~~a~V~~~~~-~~~~~V~~~~   39 (61)
T smart00743       17 DSWWEAVVTKVLG-DGKYLVRYLT   39 (61)
T ss_pred             CEEEEEEEEEECC-CCEEEEEECC
Confidence            4799999999998 7789999998


No 68 
>KOG1244 consensus Predicted transcription factor Requiem/NEURO-D4 [Transcription]
Probab=83.76  E-value=0.49  Score=49.32  Aligned_cols=75  Identities=21%  Similarity=0.477  Sum_probs=51.9

Q ss_pred             CccccccCCccc-----CCCeeEEccccCceecccccccc----ccCCCCceeeecccCCCCCCCCCccccCCCCCC--c
Q 002995          397 DKCSVCHMDEEY-----QNNLFLQCDKCRMMVHARCYGEL----EPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGA--M  465 (860)
Q Consensus       397 ~~C~vC~~~~~~-----~~n~ll~C~~C~~~vH~~CYg~~----~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGa--l  465 (860)
                      ..|+-|+++...     -...||.|..|+..-|.+|+-..    ..+....|-|..|+.        |.+|+.....  |
T Consensus       225 ~YCDFclgdsr~nkkt~~peelvscsdcgrsghpsclqft~nm~~avk~yrwqcieck~--------csicgtsenddql  296 (336)
T KOG1244|consen  225 PYCDFCLGDSRENKKTGMPEELVSCSDCGRSGHPSCLQFTANMIAAVKTYRWQCIECKY--------CSICGTSENDDQL  296 (336)
T ss_pred             cccceeccccccccccCCchhhcchhhcCCCCCcchhhhhHHHHHHHHhheeeeeecce--------eccccCcCCCcee
Confidence            489999876522     12459999999999999999642    234456899999984        8888765432  2


Q ss_pred             eec-CCCceeccccc
Q 002995          466 KPT-TDGRWAHLACA  479 (860)
Q Consensus       466 k~t-~~g~wvH~~Ca  479 (860)
                      .-. ...+=.|+-|.
T Consensus       297 lfcddcdrgyhmycl  311 (336)
T KOG1244|consen  297 LFCDDCDRGYHMYCL  311 (336)
T ss_pred             EeecccCCceeeEec
Confidence            222 23456787775


No 69 
>PF07649 C1_3:  C1-like domain;  InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=82.97  E-value=0.54  Score=32.54  Aligned_cols=28  Identities=25%  Similarity=0.829  Sum_probs=12.5

Q ss_pred             ccccccCCcccCCCeeEEccccCceecccc
Q 002995          398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARC  427 (860)
Q Consensus       398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~C  427 (860)
                      .|++|......  .....|..|+..+|..|
T Consensus         2 ~C~~C~~~~~~--~~~Y~C~~Cdf~lH~~C   29 (30)
T PF07649_consen    2 RCDACGKPIDG--GWFYRCSECDFDLHEEC   29 (30)
T ss_dssp             --TTTS----S----EEE-TTT-----HHH
T ss_pred             cCCcCCCcCCC--CceEECccCCCccChhc
Confidence            59999876543  46899999999999988


No 70 
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=81.43  E-value=1.3  Score=34.03  Aligned_cols=32  Identities=31%  Similarity=0.517  Sum_probs=28.0

Q ss_pred             CCceeeeeeeecCCCCceEEEecC-CCccccccc
Q 002995            2 QDWYSGFVVGYDSESNRHHVKYVD-GDEEDLILS   34 (860)
Q Consensus         2 ~~~~~g~v~~~~~~~~~h~~~y~d-~d~e~~~l~   34 (860)
                      ..||.|.|.+.++ .+.-.|.|.| |..|.+.+.
T Consensus        12 ~~wyra~V~~~~~-~~~~~V~f~DyG~~~~v~~~   44 (48)
T cd04508          12 GKWYRAKITSILS-DGKVEVFFVDYGNTEVVPLS   44 (48)
T ss_pred             CeEEEEEEEEECC-CCcEEEEEEcCCCcEEEeHH
Confidence            4799999999998 7789999999 999887653


No 71 
>KOG1512 consensus PHD Zn-finger protein [General function prediction only]
Probab=81.16  E-value=0.55  Score=49.18  Aligned_cols=49  Identities=16%  Similarity=0.455  Sum_probs=37.1

Q ss_pred             CccccccCCccc----CCCeeEEccccCceecccccccc----ccCCCCceeeeccc
Q 002995          397 DKCSVCHMDEEY----QNNLFLQCDKCRMMVHARCYGEL----EPVNGVLWLCNLCR  445 (860)
Q Consensus       397 ~~C~vC~~~~~~----~~n~ll~C~~C~~~vH~~CYg~~----~~~~~~~W~C~~C~  445 (860)
                      ..|.+|......    .-|-+|.|..|....|..|....    .......|-|-.|+
T Consensus       259 ~~~~~~~~~~~~~~~~r~~S~I~C~~C~~~~HP~Ci~M~~elv~~~KTY~W~C~~C~  315 (381)
T KOG1512|consen  259 NERKHFWDIQTNIIQSRRNSWIVCKPCATRPHPYCVAMIPELVGQYKTYFWKCSSCE  315 (381)
T ss_pred             hhhhhhhcchhhhhhhhhccceeecccccCCCCcchhcCHHHHhHHhhcchhhcccH
Confidence            579999875432    34669999999999999998643    22344589999997


No 72 
>PF00130 C1_1:  Phorbol esters/diacylglycerol binding domain (C1 domain);  InterPro: IPR002219 Diacylglycerol (DAG) is an important second messenger. Phorbol esters (PE) are analogues of DAG and potent tumour promoters that cause a variety of physiological changes when administered to both cells and tissues. DAG activates a family of serine/threonine protein kinases, collectively known as protein kinase C (PKC) []. Phorbol esters can directly stimulate PKC. The N-terminal region of PKC, known as C1, has been shown [] to bind PE and DAG in a phospholipid and zinc-dependent fashion. The C1 region contains one or two copies (depending on the isozyme of PKC) of a cysteine-rich domain, which is about 50 amino-acid residues long, and which is essential for DAG/PE-binding. The DAG/PE-binding domain binds two zinc ions; the ligands of these metal ions are probably the six cysteines and two histidines that are conserved in this domain.; GO: 0035556 intracellular signal transduction; PDB: 1RFH_A 2FNF_X 3PFQ_A 1PTQ_A 1PTR_A 2VRW_B 1XA6_A 2ENN_A 1TBN_A 1TBO_A ....
Probab=80.17  E-value=1.9  Score=33.90  Aligned_cols=35  Identities=20%  Similarity=0.414  Sum_probs=25.6

Q ss_pred             CCccccccCCcccCCCeeEEccccCceeccccccc
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE  430 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~  430 (860)
                      ...|++|...--.....-+.|..|++.+|..|...
T Consensus        11 ~~~C~~C~~~i~g~~~~g~~C~~C~~~~H~~C~~~   45 (53)
T PF00130_consen   11 PTYCDVCGKFIWGLGKQGYRCSWCGLVCHKKCLSK   45 (53)
T ss_dssp             TEB-TTSSSBECSSSSCEEEETTTT-EEETTGGCT
T ss_pred             CCCCcccCcccCCCCCCeEEECCCCChHhhhhhhh
Confidence            36899998754224455789999999999999864


No 73 
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=80.01  E-value=1  Score=52.89  Aligned_cols=110  Identities=23%  Similarity=0.412  Sum_probs=63.4

Q ss_pred             CccccCCCcHHHHH--HHhhcCCCCCCCchhhhhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEccccC
Q 002995          343 GSDMFGFSNPEVMK--LILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCDKCR  420 (860)
Q Consensus       343 G~~~FGls~p~v~~--li~~Lp~a~~c~k~~~~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~~C~  420 (860)
                      ...-||.-...+++  ..+.||.|.+      +.+-...-++-+.+.|+..+. ...-|.+|+..-+.-   -++|..|+
T Consensus      1069 a~raFGtCSKAfmkLe~~e~l~~a~k------q~ye~La~~iFsk~~p~d~~~-~~vdc~~cg~~i~~~---~~~c~ec~ 1138 (1189)
T KOG2041|consen 1069 AVRAFGTCSKAFMKLEAFEELDDAEK------QEYENLAFRIFSKNPPVDPNS-AKVDCSVCGAKIDPY---DLQCSECQ 1138 (1189)
T ss_pred             hhhhhhhhHHHHHHHHhhhhCCHHHH------HHHHHHHHHHhccCCCCCCCc-cceeeeecCCcCCcc---CCCChhhc
Confidence            34467777777765  3556665553      111112223344455555443 346899998753321   36899998


Q ss_pred             ceecccccccccc-CCCCceeeecccCCCCC----CCCCccccCCCCC
Q 002995          421 MMVHARCYGELEP-VNGVLWLCNLCRPGAPE----PPPPCCLCPVVGG  463 (860)
Q Consensus       421 ~~vH~~CYg~~~~-~~~~~W~C~~C~~~~~~----~~~~C~lC~~~gG  463 (860)
                      -.+- -|-....+ .+..-|+|.+|+.....    .-..|.||...-+
T Consensus      1139 ~kfP-~CiasG~pIt~~~fWlC~~CkH~a~~~EIs~y~~CPLCHs~~~ 1185 (1189)
T KOG2041|consen 1139 TKFP-VCIASGRPITDNIFWLCPRCKHRAHQHEISKYNCCPLCHSMES 1185 (1189)
T ss_pred             CcCc-eeeccCCccccceEEEccccccccccccccccccCccccChhh
Confidence            7764 46543222 23347999999986543    2245888875433


No 74 
>PF03107 C1_2:  C1 domain;  InterPro: IPR004146 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in DAG_PE-bind (IPR002219 from INTERPRO), therefore we have termed this domain DC1 for divergent C1 domain. This domain probably also binds to two zinc ions. The function of proteins with this domain is uncertain, however this domain may bind to molecules such as diacylglycerol. This family are found in plant proteins.
Probab=79.71  E-value=1.5  Score=30.33  Aligned_cols=28  Identities=32%  Similarity=0.840  Sum_probs=22.4

Q ss_pred             ccccccCCcccCCCeeEEccccCceecccc
Q 002995          398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARC  427 (860)
Q Consensus       398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~C  427 (860)
                      .|+||+..-+...  ...|+.|+..+|..|
T Consensus         2 ~C~~C~~~~~~~~--~Y~C~~c~f~lh~~C   29 (30)
T PF03107_consen    2 WCDVCRRKIDGFY--FYHCSECCFTLHVRC   29 (30)
T ss_pred             CCCCCCCCcCCCE--eEEeCCCCCeEcCcc
Confidence            5999976544322  899999999999988


No 75 
>KOG2084 consensus Predicted histone tail methylase containing SET domain [Chromatin structure and dynamics]
Probab=79.21  E-value=2.8  Score=48.81  Aligned_cols=40  Identities=33%  Similarity=0.422  Sum_probs=28.8

Q ss_pred             cCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCC-eEEEecCCC
Q 002995          773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWE-ELTYDYRFF  816 (860)
Q Consensus       773 fiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GE-ELt~dY~~~  816 (860)
                      ++||||.||+..   ..+ .....+++...+.+++ ||++.|-..
T Consensus       208 ~~~hsC~pn~~~---~~~-~~~~~~~~~~~~~~~~~~l~~~y~~~  248 (482)
T KOG2084|consen  208 LFNHSCFPNISV---IFD-GRGLALLVPAGIDAGEEELTISYTDP  248 (482)
T ss_pred             hcccCCCCCeEE---EEC-CceeEEEeecccCCCCCEEEEeeccc
Confidence            679999999872   233 3445566777777776 999999543


No 76 
>KOG1973 consensus Chromatin remodeling protein, contains PHD Zn-finger [Chromatin structure and dynamics]
Probab=78.51  E-value=0.93  Score=49.02  Aligned_cols=29  Identities=38%  Similarity=0.787  Sum_probs=24.7

Q ss_pred             cceecCC-cceeeEEcCCCCcC-ccccchhh
Q 002995          510 LCSICGV-SYGACIQCSNTTCR-VAYHPLCA  538 (860)
Q Consensus       510 ~C~~C~~-~~Ga~iqC~~~~C~-~~fH~~CA  538 (860)
                      .-++|.+ .+|-||.|...+|. .|||..|-
T Consensus       220 ~yC~Cnqvsyg~Mi~CDn~~C~~eWFH~~CV  250 (274)
T KOG1973|consen  220 TYCICNQVSYGKMIGCDNPGCPIEWFHFTCV  250 (274)
T ss_pred             EEEEecccccccccccCCCCCCcceEEEecc
Confidence            3456665 69999999999999 99999996


No 77 
>cd04508 TUDOR Tudor domains are found in many eukaryotic organisms and have been implicated in protein-protein interactions in which methylated protein substrates bind to these domains. For example, the Tudor domain of Survival of Motor Neuron (SMN) binds to symmetrically dimethylated arginines of arginine-glycine (RG) rich sequences found in the C-terminal tails of Sm proteins. The SMN protein is linked to spinal muscular atrophy. Another example is the tandem tudor domains of 53BP1, which bind to histone H4 specifically dimethylated at Lys20 (H4-K20me2). 53BP1 is a key transducer of the DNA damage checkpoint signal.
Probab=77.91  E-value=3  Score=31.92  Aligned_cols=47  Identities=15%  Similarity=0.205  Sum_probs=37.3

Q ss_pred             CCEEEEEecc-cCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCccc
Q 002995           85 GDIIWAKLTG-HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVIS  143 (860)
Q Consensus        85 GdlVwaK~~g-~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~  143 (860)
                      |++|.|+... --|.+|+|....            ..+.+.|.|.+-.....|+.++|.+
T Consensus         1 G~~c~a~~~~d~~wyra~V~~~~------------~~~~~~V~f~DyG~~~~v~~~~l~~   48 (48)
T cd04508           1 GDLCLAKYSDDGKWYRAKITSIL------------SDGKVEVFFVDYGNTEVVPLSDLRP   48 (48)
T ss_pred             CCEEEEEECCCCeEEEEEEEEEC------------CCCcEEEEEEcCCCcEEEeHHHcCC
Confidence            7899999986 779999997632            1455899999888888888877754


No 78 
>PF11793 FANCL_C:  FANCL C-terminal domain; PDB: 3K1L_A.
Probab=77.71  E-value=1.6  Score=36.90  Aligned_cols=50  Identities=18%  Similarity=0.365  Sum_probs=19.0

Q ss_pred             CccccccCCcc-cCCCeeEEcc--ccCceeccccccc-----ccc---CCCCceeeecccC
Q 002995          397 DKCSVCHMDEE-YQNNLFLQCD--KCRMMVHARCYGE-----LEP---VNGVLWLCNLCRP  446 (860)
Q Consensus       397 ~~C~vC~~~~~-~~~n~ll~C~--~C~~~vH~~CYg~-----~~~---~~~~~W~C~~C~~  446 (860)
                      ..|.||..... .+.-+.+.|+  .|+..+|..|.-.     ...   ..-..+.|+.|..
T Consensus         3 ~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~~wf~~~~~~~~~~~~~~G~CP~C~~   63 (70)
T PF11793_consen    3 LECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLSEWFLSLEKSRQSFIPIFGECPYCSS   63 (70)
T ss_dssp             -S-SSS--SS-TT-----B--S-TT----B-SGGGHHHHHHHHSSS-TTT--EEE-TTT-S
T ss_pred             CCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHHHHHHHcccCCeeecccccCCcCCCC
Confidence            57999998654 3344578998  9999999999852     111   1112467888874


No 79 
>cd00029 C1 Protein kinase C conserved region 1 (C1) . Cysteine-rich zinc binding domain. Some members of this domain family bind phorbol esters and diacylglycerol, some are reported to bind RasGTP. May occur in tandem arrangement. Diacylglycerol (DAG) is a second messenger, released by activation of Phospholipase D. Phorbol Esters (PE) can act as analogues of DAG and mimic its downstream effects in, for example, tumor promotion. Protein Kinases C are activated by DAG/PE, this activation is mediated by their N-terminal conserved region (C1). DAG/PE binding may be phospholipid dependent. C1 domains may also mediate DAG/PE signals in chimaerins (a family of Rac GTPase activating proteins), RasGRPs (exchange factors for Ras/Rap1), and Munc13 isoforms (scaffolding proteins involved in exocytosis).
Probab=75.21  E-value=1.4  Score=33.96  Aligned_cols=34  Identities=24%  Similarity=0.480  Sum_probs=25.7

Q ss_pred             CccccccCCcccCCCeeEEccccCceeccccccc
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGE  430 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~  430 (860)
                      ..|++|...--.-...-+.|+.|++.||..|...
T Consensus        12 ~~C~~C~~~i~~~~~~~~~C~~C~~~~H~~C~~~   45 (50)
T cd00029          12 TFCDVCRKSIWGLFKQGLRCSWCKVKCHKKCADK   45 (50)
T ss_pred             CChhhcchhhhccccceeEcCCCCCchhhhhhcc
Confidence            5899998643222245678999999999999863


No 80 
>KOG1337 consensus N-methyltransferase [General function prediction only]
Probab=73.55  E-value=2.3  Score=49.86  Aligned_cols=40  Identities=33%  Similarity=0.359  Sum_probs=30.5

Q ss_pred             cCCCCCCCCeeEEEEEECCeeEEEEEEccCCCCCCeEEEecCC
Q 002995          773 LINHSCEPNCYSRVISVNGDEHIIIFAKRDIKQWEELTYDYRF  815 (860)
Q Consensus       773 fiNHSC~PN~~~~~~~~~~~~~i~i~A~RdI~~GEELt~dY~~  815 (860)
                      +.||++.+.   ...+......+.+++.++|.+|||++++||.
T Consensus       239 ~~NH~~~~~---~~~~~~~d~~~~l~~~~~v~~geevfi~YG~  278 (472)
T KOG1337|consen  239 LLNHSPEVI---KAGYNQEDEAVELVAERDVSAGEEVFINYGP  278 (472)
T ss_pred             hhccCchhc---cccccCCCCcEEEEEeeeecCCCeEEEecCC
Confidence            569999992   1122222348999999999999999999976


No 81 
>PF07039 DUF1325:  SGF29 tudor-like domain;  InterPro: IPR010750  SAGA-associated factor 29 is involved in transcriptional regulation, probably through association with histone acetyltransferase (HAT) complexes like the TFTC-HAT or STAGA complexes. It also may be involved in MYC-mediated oncogenic transformation. It is a component of the ATAC complex, which is a complex with histone acetyltransferase activity on histones H3 and H4 [].   This entry represents a domain found in yeast and human SAGA-associated factor 29 proteins that is related to the tudor domain. ; PDB: 3MP6_A 3MP1_A 3MP8_A 3MET_B 3ME9_A 3MEU_B 3MEA_A 3MEV_B 3LX7_A 3MEW_A.
Probab=72.77  E-value=16  Score=34.79  Aligned_cols=107  Identities=22%  Similarity=0.351  Sum_probs=59.6

Q ss_pred             CCceeeeeeeecCCCCceEEEecCCCccccccccceEEEEEccchhhhhhcccCCCCCCCCCCChhhHHHhhcCccCCCC
Q 002995            2 QDWYSGFVVGYDSESNRHHVKYVDGDEEDLILSNERIKFYISQEEMDCLKLSFSINNVDNDGYDYDEMVVLAASLDDCQE   81 (860)
Q Consensus         2 ~~~~~g~v~~~~~~~~~h~~~y~d~d~e~~~l~~e~i~~~~s~~e~~~l~~~~~~~~~~~~~~~~de~~~~~~~~~~~~~   81 (860)
                      +.|.-+.|++|+.++++..|.=.|-++       +.-++.++....--|-.+..                  ........
T Consensus        17 ~~WIla~Vv~~~~~~~rYeV~D~d~~~-------~~~~~~~~~~~iIPLP~~~~------------------~~~~~~~~   71 (130)
T PF07039_consen   17 EEWILAEVVKYNSDGNRYEVEDPDPEE-------EKKRYKLSRKQIIPLPKKAP------------------PDTDPLAE   71 (130)
T ss_dssp             CEEEEEEEEEEETTTTEEEEEETTTCT-------TTEEEEEEGGGEEEE-SB--------------------TTT-GGGS
T ss_pred             CCEEEEEEEEEeCCCCEEEEecCCCCC-------CCceEEeCHHHEEECCCccC------------------CCCCchhh
Confidence            469999999999999877776544443       11144444433333322111                  01122457


Q ss_pred             CCCCCEEEEEe-cccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCE---EEEeCCCccc
Q 002995           82 LEPGDIIWAKL-TGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDF---ARINVKQVIS  143 (860)
Q Consensus        82 f~~GdlVwaK~-~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~---awv~~~~l~~  143 (860)
                      |..|..|+|.. ..--.-||.|...+          ....+.|.|.|=|+.+-   --|+..-+.+
T Consensus        72 f~~g~~VLAlYP~TT~FY~A~V~~~p----------~~~~~~y~l~Fedd~~~~~~~~V~~r~Vv~  127 (130)
T PF07039_consen   72 FPKGTKVLALYPDTTCFYPATVVSPP----------KKKSGEYKLKFEDDEDADGYREVPQRYVVA  127 (130)
T ss_dssp             --TT-EEEEE-TTSSEEEEEEEEEE-----------SSTTS-EEEEECTTTSTTSBEEE-GGGEEE
T ss_pred             CCCCCEEEEECCCCceEEEEEEEeCC----------CCCCCcEEEEEeCCCCcCCcEEEccceEEc
Confidence            99999999995 33346789997742          13456799999988662   3444444443


No 82 
>PF02318 FYVE_2:  FYVE-type zinc finger;  InterPro: IPR003315 This entry represents the zinc-binding domain found in rabphilin Rab3A. The small G protein Rab3A plays an important role in the regulation of neurotransmitter release. The crystal structure of the small G protein Rab3A complexed with the effector domain of rabphilin-3A shows that the effector domain of rabphilin-3A contacts Rab3A in two distinct areas. The first interface involves the Rab3A switch I and switch II regions, which are sensitive to the nucleotide-binding state of Rab3A. The second interface consists of a deep pocket in Rab3A that interacts with a SGAWFF structural element of rabphilin-3A. Sequence and structure analysis, and biochemical data suggest that this pocket, or Rab complementarity-determining region (RabCDR), establishes a specific interaction between each Rab protein and its effectors. It has been suggested that RabCDRs could be major determinants of effector specificity during vesicle trafficking and fusion [].; GO: 0008270 zinc ion binding, 0017137 Rab GTPase binding, 0006886 intracellular protein transport; PDB: 2CSZ_A 2ZET_C 1ZBD_B 3BC1_B 2CJS_C 2A20_A.
Probab=70.86  E-value=3.5  Score=38.57  Aligned_cols=49  Identities=22%  Similarity=0.510  Sum_probs=35.1

Q ss_pred             CCccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ...|..|......-.|.-..|..|...|=..|-.. . .....|+|..|..
T Consensus        54 ~~~C~~C~~~fg~l~~~~~~C~~C~~~VC~~C~~~-~-~~~~~WlC~vC~k  102 (118)
T PF02318_consen   54 ERHCARCGKPFGFLFNRGRVCVDCKHRVCKKCGVY-S-KKEPIWLCKVCQK  102 (118)
T ss_dssp             CSB-TTTS-BCSCTSTTCEEETTTTEEEETTSEEE-T-SSSCCEEEHHHHH
T ss_pred             CcchhhhCCcccccCCCCCcCCcCCccccCccCCc-C-CCCCCEEChhhHH
Confidence            35899998765444555689999999999999543 2 2334899999975


No 83 
>KOG4443 consensus Putative transcription factor HALR/MLL3, involved in embryonic development [General function prediction only]
Probab=69.82  E-value=2.1  Score=50.31  Aligned_cols=81  Identities=21%  Similarity=0.338  Sum_probs=51.8

Q ss_pred             ccccCccccCCCcHHHHHH-HhhcCCCCCCCchhhhhhcccccCCCCCCCCccccccCCCccccccCCcccCCCeeEEcc
Q 002995          339 ISESGSDMFGFSNPEVMKL-ILGLTKSRPTSKSSLCKLTSKYRDLPGGYRPVRVDWKDLDKCSVCHMDEEYQNNLFLQCD  417 (860)
Q Consensus       339 ~~~sG~~~FGls~p~v~~l-i~~Lp~a~~c~k~~~~k~~s~~~~~p~~~~~~~~~~~~~~~C~vC~~~~~~~~n~ll~C~  417 (860)
                      ....+.+.|++.||....+ ++.+.             -..+...           ..-.+|..|+..++  ...+++|+
T Consensus        34 ~m~ac~~c~~~yH~~cvt~~~~~~~-------------l~~gWrC-----------~~crvCe~c~~~gD--~~kf~~Ck   87 (694)
T KOG4443|consen   34 RLLACSDCGQKYHPYCVTSWAQHAV-------------LSGGWRC-----------PSCRVCEACGTTGD--PKKFLLCK   87 (694)
T ss_pred             cchhhhhhcccCCcchhhHHHhHHH-------------hcCCccc-----------CCceeeeeccccCC--cccccccc
Confidence            3468889999999986653 22110             0011111           23357888874443  45699999


Q ss_pred             ccCceeccccccc--cccCCCCceeeecccC
Q 002995          418 KCRMMVHARCYGE--LEPVNGVLWLCNLCRP  446 (860)
Q Consensus       418 ~C~~~vH~~CYg~--~~~~~~~~W~C~~C~~  446 (860)
                      +|.+.+|-+|---  ..++.+ +|+|..|..
T Consensus        88 ~cDvsyh~yc~~P~~~~v~sg-~~~ckk~~~  117 (694)
T KOG4443|consen   88 RCDVSYHCYCQKPPNDKVPSG-PWLCKKCTR  117 (694)
T ss_pred             cccccccccccCCccccccCc-ccccHHHHh
Confidence            9999999999642  234555 899998853


No 84 
>KOG1473 consensus Nucleosome remodeling factor, subunit NURF301/BPTF [Chromatin structure and dynamics; Transcription]
Probab=68.54  E-value=5.4  Score=49.59  Aligned_cols=110  Identities=25%  Similarity=0.423  Sum_probs=70.0

Q ss_pred             CCccccccCCcccCCCeeEEccccCceecccccccc--ccCCCCceeeecccCCCCCCCCCccccCCCCCCceecC-CCc
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL--EPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT-DGR  472 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~--~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~-~g~  472 (860)
                      ++.|-+|....+     ++.|..|...||..|.--+  .++. ..|-|..|..-+.+....|+|=+.+.+...+.+ .|.
T Consensus       344 ddhcrf~~d~~~-----~lc~Et~prvvhlEcv~hP~~~~~s-~~~e~evc~~hkvngvvd~vl~~~K~~~~iR~~~iG~  417 (1414)
T KOG1473|consen  344 DDHCRFCHDLGD-----LLCCETCPRVVHLECVFHPRFAVPS-AFWECEVCNIHKVNGVVDCVLPPSKNVDSIRHTPIGR  417 (1414)
T ss_pred             cccccccCcccc-----eeecccCCceEEeeecCCccccCCC-ccchhhhhhhhccCcccccccChhhcccceeccCCCc
Confidence            468999986544     8899999999999996422  2333 389999998766555566887777766663332 111


Q ss_pred             eeccccccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCccccc-hhh
Q 002995          473 WAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYHP-LCA  538 (860)
Q Consensus       473 wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH~-~CA  538 (860)
                      ==|-. -+|                   .+     .-.|.||.. .+..+ |+++.|...||. .|-
T Consensus       418 dr~gr-~yw-------------------fi-----~rrl~Ie~~-det~l-~yysT~pqly~ll~cL  457 (1414)
T KOG1473|consen  418 DRYGR-KYW-------------------FI-----SRRLRIEGM-DETLL-WYYSTCPQLYHLLRCL  457 (1414)
T ss_pred             Ccccc-chh-------------------ce-----eeeeEEecC-CCcEE-EEecCcHHHHHHHHHh
Confidence            00000 000                   00     126888886 44444 556789999999 664


No 85 
>cd04718 BAH_plant_2 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=68.30  E-value=3.2  Score=40.14  Aligned_cols=26  Identities=27%  Similarity=0.600  Sum_probs=20.6

Q ss_pred             eecccccc--ccccCCCCceeeecccCCC
Q 002995          422 MVHARCYG--ELEPVNGVLWLCNLCRPGA  448 (860)
Q Consensus       422 ~vH~~CYg--~~~~~~~~~W~C~~C~~~~  448 (860)
                      ++|..|..  +..+|+| .|+|+.|....
T Consensus         1 g~H~~CL~Ppl~~~P~g-~W~Cp~C~~~~   28 (148)
T cd04718           1 GFHLCCLRPPLKEVPEG-DWICPFCEVEK   28 (148)
T ss_pred             CcccccCCCCCCCCCCC-CcCCCCCcCCC
Confidence            47999997  4457777 89999998653


No 86 
>KOG1044 consensus Actin-binding LIM Zn-finger protein Limatin involved in axon guidance [Signal transduction mechanisms; Cytoskeleton]
Probab=68.01  E-value=2.9  Score=48.35  Aligned_cols=102  Identities=23%  Similarity=0.472  Sum_probs=51.0

Q ss_pred             CceeeecccCCCCC------CCCCccccC---CCCCCceecCCCceeccccccccCceEEccCCCcc--ccccchhhhcc
Q 002995          437 VLWLCNLCRPGAPE------PPPPCCLCP---VVGGAMKPTTDGRWAHLACAIWIPETCLTDVKRME--PIDGLNRVSKD  505 (860)
Q Consensus       437 ~~W~C~~C~~~~~~------~~~~C~lC~---~~gGalk~t~~g~wvH~~Ca~~~p~~~~~~~~~~~--~v~~i~~i~~~  505 (860)
                      ..-+|+.|....+.      ....|+-|.   ..|++|.-. +.+| |+.|..---...+.+.+.|.  -+---..-...
T Consensus       112 k~~~c~~c~~~~~~~p~~~~~ps~cagc~~~lk~gq~llal-d~qw-hv~cfkc~~c~~vL~gey~skdg~pyce~dy~~  189 (670)
T KOG1044|consen  112 KECLCQTCSQPMPVSPAESYGPSTCAGCGEELKNGQALLAL-DKQW-HVSCFKCKSCSAVLNGEYMSKDGVPYCEKDYQA  189 (670)
T ss_pred             hhhhhhhhcCcccCCcccccCCccccchhhhhhccceeeee-ccce-eeeeeehhhhcccccceeeccCCCcchhhhhhh
Confidence            34667777654332      234677775   344554433 3456 44443221111111111111  01112233445


Q ss_pred             ccCccceecCC-cceeeEEcCCCCcCccccchhhhhcCee
Q 002995          506 RWKLLCSICGV-SYGACIQCSNTTCRVAYHPLCARAAGLC  544 (860)
Q Consensus       506 ~~~~~C~~C~~-~~Ga~iqC~~~~C~~~fH~~CA~~~g~~  544 (860)
                      .+..+|..|.+ ..|..+|=.  .  ..|||+||+=.-+.
T Consensus       190 ~fgvkc~~c~~fisgkvLqag--~--kh~HPtCARCsRCg  225 (670)
T KOG1044|consen  190 KFGVKCEECEKFISGKVLQAG--D--KHFHPTCARCSRCG  225 (670)
T ss_pred             hcCeehHHhhhhhhhhhhhcc--C--cccCcchhhhhhhc
Confidence            56678888875 456666653  3  78999999776554


No 87 
>smart00109 C1 Protein kinase C conserved region 1 (C1) domains (Cysteine-rich domains). Some bind phorbol esters and diacylglycerol. Some bind RasGTP. Zinc-binding domains.
Probab=65.67  E-value=2.4  Score=32.34  Aligned_cols=33  Identities=27%  Similarity=0.522  Sum_probs=24.9

Q ss_pred             CCccccccCCcccCCCeeEEccccCceecccccc
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG  429 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg  429 (860)
                      ...|++|...-.... .-+.|..|++.+|..|..
T Consensus        11 ~~~C~~C~~~i~~~~-~~~~C~~C~~~~H~~C~~   43 (49)
T smart00109       11 PTKCCVCRKSIWGSF-QGLRCSWCKVKCHKKCAE   43 (49)
T ss_pred             CCCccccccccCcCC-CCcCCCCCCchHHHHHHh
Confidence            358999987532211 367899999999999975


No 88 
>PF13901 DUF4206:  Domain of unknown function (DUF4206)
Probab=64.39  E-value=4.5  Score=41.67  Aligned_cols=42  Identities=26%  Similarity=0.789  Sum_probs=32.4

Q ss_pred             CccccccCCc---ccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          397 DKCSVCHMDE---EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       397 ~~C~vC~~~~---~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ..|.+|..++   ..+.+..+.|.+|+..+|..|+.-     .   -|++|..
T Consensus       153 fiCe~C~~~~~IfPF~~~~~~~C~~C~~v~H~~C~~~-----~---~CpkC~R  197 (202)
T PF13901_consen  153 FICEICNSDDIIFPFQIDTTVRCPKCKSVFHKSCFRK-----K---SCPKCAR  197 (202)
T ss_pred             CCCccCCCCCCCCCCCCCCeeeCCcCccccchhhcCC-----C---CCCCcHh
Confidence            5899998764   234456889999999999999972     1   2888864


No 89 
>PF14569 zf-UDP:  Zinc-binding RING-finger; PDB: 1WEO_A.
Probab=60.67  E-value=3.9  Score=34.73  Aligned_cols=50  Identities=24%  Similarity=0.616  Sum_probs=23.2

Q ss_pred             CCCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          395 DLDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       395 ~~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ....|.||+..  -..+.+.++.|..|+..|-..||--.. .+| .-.|..|..
T Consensus         8 ~~qiCqiCGD~VGl~~~Ge~FVAC~eC~fPvCr~CyEYEr-keg-~q~CpqCkt   59 (80)
T PF14569_consen    8 NGQICQICGDDVGLTENGEVFVACHECAFPVCRPCYEYER-KEG-NQVCPQCKT   59 (80)
T ss_dssp             SS-B-SSS--B--B-SSSSB--S-SSS-----HHHHHHHH-HTS--SB-TTT--
T ss_pred             CCcccccccCccccCCCCCEEEEEcccCCccchhHHHHHh-hcC-cccccccCC
Confidence            34699999874  245678999999999999999996322 223 567888864


No 90 
>PF11764 N-SET:  COMPASS (Complex proteins associated with Set1p) component N;  InterPro: IPR024657 The COMPASS complex (complex proteins associated with Set1p) is conserved in yeasts and in other eukaryotes up to humans. The COMPASS complex functions to methylate the fourth lysine of Histone 3 and for the silencing of genes close to the telomeres of chromosomes [].  Five domains are conserved in Saccharomyces cerevisiae Set1 and other eukaryotic Set1-related proteins: an amino-terminal RNA-recognition motif (RRM), a semi-conserved central domain, an N-SET domain, the catalytic SET domain, and the C-terminal post-SET domain. This entry represents the N-SET domain, which promotes trimethylation in conjunction with the RRM domain [] and is necessary for binding of the Spp1 component of COMPASS into the complex [].; PDB: 3UVO_B 4ES0_C 3UVN_D 4EWR_C.
Probab=57.65  E-value=2.5  Score=42.12  Aligned_cols=16  Identities=31%  Similarity=0.507  Sum_probs=6.0

Q ss_pred             CCCCCCcccCCccccc
Q 002995          603 PSNPSGCARSEPYNYF  618 (860)
Q Consensus       603 ~~npsGcaRsE~~~~~  618 (860)
                      ..|++||||+|||+++
T Consensus        78 ~~h~TGsARtEGy~KI   93 (167)
T PF11764_consen   78 VPHSTGSARTEGYYKI   93 (167)
T ss_dssp             -----SSHHHH-----
T ss_pred             ccCCCcCcccccceeC
Confidence            3578999999999754


No 91 
>KOG4443 consensus Putative transcription factor HALR/MLL3, involved in embryonic development [General function prediction only]
Probab=56.27  E-value=4.1  Score=47.98  Aligned_cols=48  Identities=25%  Similarity=0.637  Sum_probs=38.0

Q ss_pred             CccccccCCcccCCCeeEEccccCceecccccccc---ccCCCCceeeeccc
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL---EPVNGVLWLCNLCR  445 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~---~~~~~~~W~C~~C~  445 (860)
                      ..|-+|...+....+.|+.|..|+...|..|....   .+..+ .|.|..|.
T Consensus        19 ~mc~l~~s~G~~~ag~m~ac~~c~~~yH~~cvt~~~~~~~l~~-gWrC~~cr   69 (694)
T KOG4443|consen   19 LMCPLCGSSGKGRAGRLLACSDCGQKYHPYCVTSWAQHAVLSG-GWRCPSCR   69 (694)
T ss_pred             hhhhhhccccccccCcchhhhhhcccCCcchhhHHHhHHHhcC-CcccCCce
Confidence            47888888777778889999999999999998731   22333 59999997


No 92 
>PF05502 Dynactin_p62:  Dynactin p62 family;  InterPro: IPR008603 Dynactin is a multi-subunit complex and a required cofactor for most, or all, o f the cellular processes powered by the microtubule-based motor cytoplasmic dyn ein. p62 binds directly to the Arp1 subunit of dynactin [, ].
Probab=55.82  E-value=8.5  Score=45.10  Aligned_cols=53  Identities=21%  Similarity=0.478  Sum_probs=37.4

Q ss_pred             eeEEccccCceeccccccccccCCCCceeeecccCCCCC--------CC-CCccccCCCCCCceec
Q 002995          412 LFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPE--------PP-PPCCLCPVVGGAMKPT  468 (860)
Q Consensus       412 ~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~--------~~-~~C~lC~~~gGalk~t  468 (860)
                      .|++|..|.......|...    +-..|||..|....+.        .- ..|+-||.=.+.|.-+
T Consensus         4 ~L~fC~~C~~irc~~c~~~----Ei~~~yCp~CL~~~p~~e~~~~~nrC~r~Cf~CP~C~~~L~~~   65 (483)
T PF05502_consen    4 ELYFCEHCHKIRCPRCVSE----EIDSYYCPNCLFEVPSSEARSEKNRCSRNCFDCPICFSPLSVR   65 (483)
T ss_pred             cceecccccccCChhhccc----ccceeECccccccCChhhheeccceeccccccCCCCCCcceeE
Confidence            4889999999888888542    2237999999875442        12 3788998766666543


No 93 
>KOG3799 consensus Rab3 effector RIM1 and related proteins, contain Rab3a binding domain [Intracellular trafficking, secretion, and vesicular transport]
Probab=54.98  E-value=5.5  Score=37.29  Aligned_cols=52  Identities=35%  Similarity=0.749  Sum_probs=35.9

Q ss_pred             cCCCccccccCCcccCCCeeEEccccCceecccccccc-ccCCCCceeeecccC
Q 002995          394 KDLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL-EPVNGVLWLCNLCRP  446 (860)
Q Consensus       394 ~~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~-~~~~~~~W~C~~C~~  446 (860)
                      .++..|.||....-.+. .=-.|.-|++.+-+.|-|-. .-....-|+|..|..
T Consensus        63 ~ddatC~IC~KTKFADG-~GH~C~YCq~r~CARCGGrv~lrsNKv~wvcnlc~k  115 (169)
T KOG3799|consen   63 GDDATCGICHKTKFADG-CGHNCSYCQTRFCARCGGRVSLRSNKVMWVCNLCRK  115 (169)
T ss_pred             CcCcchhhhhhcccccc-cCcccchhhhhHHHhcCCeeeeccCceEEeccCCcH
Confidence            35679999987432221 12358999999999998743 333455899999975


No 94 
>PF11781 RRN7:  RNA polymerase I-specific transcription initiation factor Rrn7;  InterPro: IPR021752  Rrn7 is a transcription binding factor that associates strongly with both Rrn6 and Rrn11 to form a complex which itself binds the TATA-binding protein and is required for transcription by the core domain of the RNA PolI promoter [],[]. 
Probab=51.92  E-value=11  Score=27.45  Aligned_cols=26  Identities=31%  Similarity=0.744  Sum_probs=20.0

Q ss_pred             CccccccCCcccCCCeeEEccccCce
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMM  422 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~  422 (860)
                      ..|.+|....-..++-..+|+.|+-.
T Consensus         9 ~~C~~C~~~~~~~~dG~~yC~~cG~~   34 (36)
T PF11781_consen    9 EPCPVCGSRWFYSDDGFYYCDRCGHQ   34 (36)
T ss_pred             CcCCCCCCeEeEccCCEEEhhhCceE
Confidence            45999998765566668899999854


No 95 
>COG5475 Uncharacterized small protein [Function unknown]
Probab=51.13  E-value=35  Score=27.14  Aligned_cols=52  Identities=15%  Similarity=0.232  Sum_probs=40.6

Q ss_pred             CCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCC--CCEEEEeCCCcccccc
Q 002995           80 QELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGT--HDFARINVKQVISFLK  146 (860)
Q Consensus        80 ~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~--~~~awv~~~~l~~f~~  146 (860)
                      ..|..||+|-.|-+|    |++++..-           ..++.|..+||.-  ..++=.+++.|+|+..
T Consensus         3 ~~FstgdvV~lKsGG----P~Mtvs~~-----------ss~Gmy~C~Wf~g~g~~~~~F~ed~Lvp~~a   56 (60)
T COG5475           3 MSFSTGDVVTLKSGG----PRMTVSGY-----------SSDGMYECRWFDGYGVKREAFHEDELVPGEA   56 (60)
T ss_pred             ceeecCcEEEeecCC----ceEEEecc-----------ccCCeEEEEEecCCCcccccccccceecccc
Confidence            469999999999988    88998731           2347899999964  3477788889988764


No 96 
>PF08746 zf-RING-like:  RING-like domain;  InterPro: IPR014857 This is a zinc finger domain that is related to the C3HC4 RING finger domain (IPR001841 from INTERPRO). ; PDB: 3NW0_A 2CT0_A.
Probab=50.70  E-value=7.4  Score=29.46  Aligned_cols=30  Identities=30%  Similarity=0.639  Sum_probs=17.0

Q ss_pred             ceecCCcceeeEEcCCCCcCccccchhhhh
Q 002995          511 CSICGVSYGACIQCSNTTCRVAYHPLCARA  540 (860)
Q Consensus       511 C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~  540 (860)
                      |.+|+...-.-+.|....|...+|..|+..
T Consensus         1 C~~C~~iv~~G~~C~~~~C~~r~H~~C~~~   30 (43)
T PF08746_consen    1 CEACKEIVTQGQRCSNRDCNVRLHDDCFKK   30 (43)
T ss_dssp             -TTT-SB-SSSEE-SS--S--EE-HHHHHH
T ss_pred             CcccchhHeeeccCCCCccCchHHHHHHHH
Confidence            667877666668999999999999999854


No 97 
>COG5034 TNG2 Chromatin remodeling protein, contains PhD zinc finger [Chromatin structure and dynamics]
Probab=49.28  E-value=12  Score=39.27  Aligned_cols=40  Identities=25%  Similarity=0.603  Sum_probs=29.9

Q ss_pred             chhhhccccCccceecCC-cceeeEEcCCCCcCc-cccchhh
Q 002995          499 LNRVSKDRWKLLCSICGV-SYGACIQCSNTTCRV-AYHPLCA  538 (860)
Q Consensus       499 i~~i~~~~~~~~C~~C~~-~~Ga~iqC~~~~C~~-~fH~~CA  538 (860)
                      ++....+-.....++|++ ++|-||.|...+|.. |||..|-
T Consensus       211 vss~d~se~e~lYCfCqqvSyGqMVaCDn~nCkrEWFH~~CV  252 (271)
T COG5034         211 VSSEDNSEGEELYCFCQQVSYGQMVACDNANCKREWFHLECV  252 (271)
T ss_pred             cCccccccCceeEEEecccccccceecCCCCCchhheecccc
Confidence            333333344455678987 699999999999975 6999996


No 98 
>PF07227 DUF1423:  Protein of unknown function (DUF1423);  InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=46.75  E-value=9.8  Score=43.24  Aligned_cols=14  Identities=43%  Similarity=0.971  Sum_probs=11.8

Q ss_pred             CCceeccccccccC
Q 002995          470 DGRWAHLACAIWIP  483 (860)
Q Consensus       470 ~g~wvH~~Ca~~~p  483 (860)
                      .|.|+|+-||+-..
T Consensus       152 CgH~cH~dCALr~~  165 (446)
T PF07227_consen  152 CGHWCHLDCALRHE  165 (446)
T ss_pred             CCceehhhhhcccc
Confidence            67999999998743


No 99 
>PHA02862 5L protein; Provisional
Probab=43.98  E-value=7.7  Score=37.10  Aligned_cols=50  Identities=28%  Similarity=0.606  Sum_probs=34.2

Q ss_pred             CccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCC
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVG  462 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~g  462 (860)
                      +.|=||+..++++. .--.|.+-..-||+.|..        .|+       .......|.||....
T Consensus         3 diCWIC~~~~~e~~-~PC~C~GS~K~VHq~CL~--------~WI-------n~S~k~~CeLCkteY   52 (156)
T PHA02862          3 DICWICNDVCDERN-NFCGCNEEYKVVHIKCMQ--------LWI-------NYSKKKECNLCKTKY   52 (156)
T ss_pred             CEEEEecCcCCCCc-ccccccCcchhHHHHHHH--------HHH-------hcCCCcCccCCCCeE
Confidence            58999998765443 345688889999999986        455       122344677776543


No 100
>PF09926 DUF2158:  Uncharacterized small protein (DUF2158);  InterPro: IPR019226 This entry represents a family of predominantly prokaryotic proteins with no known function. 
Probab=43.53  E-value=45  Score=26.56  Aligned_cols=41  Identities=32%  Similarity=0.570  Sum_probs=27.7

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCC
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHD  132 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~  132 (860)
                      |++||+|--|-+|    |.|++..  +..    ..-...+.+..+||..+.
T Consensus         1 f~~GDvV~LKSGG----p~MTV~~--v~~----~~~~~~~~v~C~WFd~~~   41 (53)
T PF09926_consen    1 FKIGDVVQLKSGG----PRMTVTE--VGP----NAGASGGWVECQWFDGHG   41 (53)
T ss_pred             CCCCCEEEEccCC----CCeEEEE--ccc----cccCCCCeEEEEeCCCCC
Confidence            7899999999998    6677762  111    111234568899997654


No 101
>PF13639 zf-RING_2:  Ring finger domain; PDB: 2KIZ_A 4EPO_C 1IYM_A 2EP4_A 2ECT_A 2JRJ_A 2ECN_A 2ECM_A 3NG2_A 2EA6_A ....
Probab=42.90  E-value=13  Score=27.97  Aligned_cols=31  Identities=23%  Similarity=0.584  Sum_probs=22.6

Q ss_pred             CccccccCCcccCCCeeEEccccCceecccccc
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG  429 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg  429 (860)
                      +.|.||...-.. .+.++... |+-.+|..|..
T Consensus         1 d~C~IC~~~~~~-~~~~~~l~-C~H~fh~~Ci~   31 (44)
T PF13639_consen    1 DECPICLEEFED-GEKVVKLP-CGHVFHRSCIK   31 (44)
T ss_dssp             -CETTTTCBHHT-TSCEEEET-TSEEEEHHHHH
T ss_pred             CCCcCCChhhcC-CCeEEEcc-CCCeeCHHHHH
Confidence            379999987644 33455555 99999999975


No 102
>PF10080 DUF2318:  Predicted membrane protein (DUF2318);  InterPro: IPR018758 This domain of unknown function is found in hypothetical bacterial membrane proteins with no known function. 
Probab=42.49  E-value=13  Score=33.86  Aligned_cols=34  Identities=26%  Similarity=0.601  Sum_probs=26.3

Q ss_pred             CCccccccCCcccCCCeeEEccccCceecccccc
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG  429 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg  429 (860)
                      .+.|+||....-.....-|.|..|++.++..=.|
T Consensus        35 ~daCeiC~~~GY~q~g~~lvC~~C~~~~~~~~ig   68 (102)
T PF10080_consen   35 FDACEICGPKGYYQEGDQLVCKNCGVRFNLPTIG   68 (102)
T ss_pred             EEeccccCCCceEEECCEEEEecCCCEEehhhcc
Confidence            3689999776655555577899999999987554


No 103
>PF05641 Agenet:  Agenet domain;  InterPro: IPR008395 This domain is related to the TUDOR domain IPR008191 from INTERPRO []. The function of the agenet domain is unknown. This signature matches one of the two Agenet domains in the FMR proteins [].; GO: 0003723 RNA binding; PDB: 2BKD_N 3O8V_A 3KUF_A 3H8Z_A.
Probab=41.92  E-value=22  Score=29.65  Aligned_cols=26  Identities=23%  Similarity=0.500  Sum_probs=20.5

Q ss_pred             CceeeeeeeecCCCCceEEEecCCCcc
Q 002995            3 DWYSGFVVGYDSESNRHHVKYVDGDEE   29 (860)
Q Consensus         3 ~~~~g~v~~~~~~~~~h~~~y~d~d~e   29 (860)
                      +||.|.|.+.+... +..|+|+|-..+
T Consensus        19 aWf~a~V~~~~~~~-~~~V~Y~~~~~~   44 (68)
T PF05641_consen   19 AWFPATVLKENGDD-KYLVEYDDLPDE   44 (68)
T ss_dssp             EEEEEEEEEEETT--EEEEEETT-SS-
T ss_pred             EEEEEEEEEeCCCc-EEEEEECCcccc
Confidence            69999999999888 999999765444


No 104
>cd04712 BAH_DCM_I BAH, or Bromo Adjacent Homology domain, as present in DNA (Cytosine-5)-methyltransferases (DCM) 1. DNA methylation, or the covalent addition of a methyl group to cytosine within the context of the CpG dinucleotide, has profound effects on the genome. These effects include transcriptional repression via inhibition of transcription factor binding, the recruitment of methyl-binding proteins and their associated chromatin remodeling factors, X chromosome inactivation, imprinting, and the suppression of parasitic DNA sequences. DNA methylation is also essential for proper embryonic development and is an important player in both DNA repair and genome stability. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=41.32  E-value=24  Score=33.64  Aligned_cols=47  Identities=11%  Similarity=0.191  Sum_probs=30.7

Q ss_pred             CCCCCCCCEEEEEecccCC---------CceEeeCCCCcCccccccccCCCCcEEEEEeCC
Q 002995           79 CQELEPGDIIWAKLTGHAM---------WPAIVVDESLIGDYKGLNKISGGRSIPVQFFGT  130 (860)
Q Consensus        79 ~~~f~~GdlVwaK~~g~Pw---------WPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~  130 (860)
                      +..+++||.|..+-.+-+-         ||+.|.....+..     ...+...++|+||--
T Consensus         3 ~~~i~vGD~V~v~~d~~~~~~~~~~~~~~~~~i~~V~~~~e-----~~~g~~~~h~~W~yr   58 (130)
T cd04712           3 GLTIRVGDVVSVERDDADSTTKWNDDHRWLPLVQFVEYMKK-----GSDGSKMFHGRWLYR   58 (130)
T ss_pred             CCEEeCCCEEEEcCCCCCccccccccccccceEEEEEEeee-----cCCCceEEEEEEEEc
Confidence            4568999999999876553         5777765433222     223445788998843


No 105
>KOG0955 consensus PHD finger protein BR140/LIN-49 [General function prediction only]
Probab=41.18  E-value=19  Score=45.62  Aligned_cols=64  Identities=22%  Similarity=0.365  Sum_probs=46.3

Q ss_pred             CCCCCEEEEEecccCCCceEeeCCCCcCcccccc------------------ccCCCCcEEEEEeCCCC-EEEEeCCCcc
Q 002995           82 LEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLN------------------KISGGRSIPVQFFGTHD-FARINVKQVI  142 (860)
Q Consensus        82 f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~------------------~~~~~~~~~V~FFg~~~-~awv~~~~l~  142 (860)
                      +.+=++||||..||||-|+.++++.........+                  .+.....+.|-||.+.- .-|++...+.
T Consensus       943 l~~~~~~~akc~g~~s~~~l~l~p~~~~~~~~~~g~~~~p~~dv~~l~eq~~~~~~~~l~~~L~~~n~~~~~~~~~s~~~ 1022 (1051)
T KOG0955|consen  943 LEELKLVWAKCRGYPSYPALILDPKMPREGNFHNGPDPAPPTDVLALPEQRTNKAPETLFLVLFFDNKRCWQWLPRSKVL 1022 (1051)
T ss_pred             eeehhceeehhcCCccchhhhcccccccccCccCCCCCCCCcccccchHHHhcccChhheEEEeecccccccccCCCCcc
Confidence            7777899999999999999999986655433211                  11122345689997755 5799999887


Q ss_pred             ccc
Q 002995          143 SFL  145 (860)
Q Consensus       143 ~f~  145 (860)
                      +.-
T Consensus      1023 ~l~ 1025 (1051)
T KOG0955|consen 1023 ELG 1025 (1051)
T ss_pred             ccc
Confidence            764


No 106
>PF07649 C1_3:  C1-like domain;  InterPro: IPR011424 This short domain is rich in cysteines and histidines. The pattern of conservation is similar to that found in IPR002219 from INTERPRO. C1 domains are protein kinase C-like zinc finger structures. Diacylglycerol (DAG) kinases (DGKs) have a two or three commonly conserved cysteine-rich C1 domains []. DGKs modulate the balance between the two signaling lipids, DAG and phosphatidic acid (PA), by phosphorylating DAG to yield PA []. The PKD (protein kinase D) family are novel DAG receptors. They have twin C1 domains, designated C1a and C1b, which bind DAG or phorbol esters. Individual C1 domains differ in ligand-binding activity and selectivity []. ; GO: 0047134 protein-disulfide reductase activity, 0055114 oxidation-reduction process; PDB: 1V5N_A.
Probab=41.12  E-value=14  Score=25.46  Aligned_cols=27  Identities=33%  Similarity=0.775  Sum_probs=12.2

Q ss_pred             cceecCCcce--eeEEcCCCCcCccccchhh
Q 002995          510 LCSICGVSYG--ACIQCSNTTCRVAYHPLCA  538 (860)
Q Consensus       510 ~C~~C~~~~G--a~iqC~~~~C~~~fH~~CA  538 (860)
                      .|..|+.+..  ....|.  .|.-.+|..||
T Consensus         2 ~C~~C~~~~~~~~~Y~C~--~Cdf~lH~~Ca   30 (30)
T PF07649_consen    2 RCDACGKPIDGGWFYRCS--ECDFDLHEECA   30 (30)
T ss_dssp             --TTTS----S--EEE-T--TT-----HHHH
T ss_pred             cCCcCCCcCCCCceEECc--cCCCccChhcC
Confidence            5888988644  577897  89999999997


No 107
>PF10447 EXOSC1:  Exosome component EXOSC1/CSL4;  InterPro: IPR019495  The exosome mediates degradation of unstable mRNAs that contain AU-rich elements (AREs) within their 3' untranslated regions []. The proteins in this entry are components of the exosome 3'->5' exoribonuclease complex. They do not have exonuclease activity, but are required for the 3'-processing of the 7S pre-RNA to the mature 5.8S rRNA and for mRNA decay [, ].; PDB: 2NN6_I.
Probab=40.97  E-value=17  Score=31.72  Aligned_cols=14  Identities=36%  Similarity=0.741  Sum_probs=10.4

Q ss_pred             CCCCCCCCEEEEEe
Q 002995           79 CQELEPGDIIWAKL   92 (860)
Q Consensus        79 ~~~f~~GdlVwaK~   92 (860)
                      .+-|+|||+|-|||
T Consensus        66 ~~~FrpGDIVrA~V   79 (82)
T PF10447_consen   66 YDCFRPGDIVRARV   79 (82)
T ss_dssp             GGT--SSSEEEEEE
T ss_pred             HhccCCCCEEEEEE
Confidence            45699999999997


No 108
>COG5194 APC11 Component of SCF ubiquitin ligase and anaphase-promoting complex [Posttranslational modification, protein turnover, chaperones / Cell division and chromosome partitioning]
Probab=39.05  E-value=9.6  Score=32.58  Aligned_cols=31  Identities=32%  Similarity=0.866  Sum_probs=25.5

Q ss_pred             cceecCCc-ceeeEEcCC------------CCcCccccchhhhh
Q 002995          510 LCSICGVS-YGACIQCSN------------TTCRVAYHPLCARA  540 (860)
Q Consensus       510 ~C~~C~~~-~Ga~iqC~~------------~~C~~~fH~~CA~~  540 (860)
                      .|.||+.. .|.|++|..            +.|..+||..|-.+
T Consensus        22 ~CaICRnhim~~C~eCq~~~~~~~eC~v~wG~CnHaFH~HCI~r   65 (88)
T COG5194          22 VCAICRNHIMGTCPECQFGMTPGDECPVVWGVCNHAFHDHCIYR   65 (88)
T ss_pred             hhhhhhccccCcCcccccCCCCCCcceEEEEecchHHHHHHHHH
Confidence            69999863 789999987            35999999999854


No 109
>PF08666 SAF:  SAF domain;  InterPro: IPR013974  This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=38.98  E-value=18  Score=29.24  Aligned_cols=16  Identities=31%  Similarity=0.376  Sum_probs=11.9

Q ss_pred             EEEEccCCCCCCeEEE
Q 002995          796 IIFAKRDIKQWEELTY  811 (860)
Q Consensus       796 ~i~A~RdI~~GEELt~  811 (860)
                      +++|.|||++|+.|+-
T Consensus         3 vvVA~~di~~G~~i~~   18 (63)
T PF08666_consen    3 VVVAARDIPAGTVITA   18 (63)
T ss_dssp             EEEESSTB-TT-BECT
T ss_pred             EEEEeCccCCCCEEcc
Confidence            4799999999999963


No 110
>PLN00104 MYST -like histone acetyltransferase; Provisional
Probab=38.31  E-value=45  Score=38.41  Aligned_cols=61  Identities=13%  Similarity=0.104  Sum_probs=45.7

Q ss_pred             CCCCCCCCCEEEEEecccCCC-ceEeeCCCCcCccccccccCCCCcEEEEEeCCCCE--EEEeCCCccc
Q 002995           78 DCQELEPGDIIWAKLTGHAMW-PAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDF--ARINVKQVIS  143 (860)
Q Consensus        78 ~~~~f~~GdlVwaK~~g~PwW-Pa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~--awv~~~~l~~  143 (860)
                      ....|.+|+.|.|+..+-+-| +|.|+.-....     ....+...|.|.|-|.+.+  -||..++|..
T Consensus        50 ~~~~~~VGekVla~~~~Dg~~~~A~VI~~R~~~-----~~~~~~~~YYVHY~g~nrRlDEWV~~~rLdl  113 (450)
T PLN00104         50 VMLPLEVGTRVMCRWRFDGKYHPVKVIERRRGG-----SGGPNDYEYYVHYTEFNRRLDEWVKLEQLDL  113 (450)
T ss_pred             ccceeccCCEEEEEECCCCCEEEEEEEEEeccC-----CCCCCCceEEEEEecCCccHhhccCHhhccc
Confidence            344699999999999977877 99998854211     0012334699999999988  8999999853


No 111
>PF14446 Prok-RING_1:  Prokaryotic RING finger family 1
Probab=37.76  E-value=30  Score=27.60  Aligned_cols=37  Identities=30%  Similarity=0.745  Sum_probs=31.0

Q ss_pred             CccceecCCc---ceeeEEcCCCCcCccccchhhhhcCeeEE
Q 002995          508 KLLCSICGVS---YGACIQCSNTTCRVAYHPLCARAAGLCVE  546 (860)
Q Consensus       508 ~~~C~~C~~~---~Ga~iqC~~~~C~~~fH~~CA~~~g~~~~  546 (860)
                      ..+|.+|+..   .+-.++|-  .|.+.||-.|...+|-...
T Consensus         5 ~~~C~~Cg~~~~~~dDiVvCp--~CgapyHR~C~~~~g~C~~   44 (54)
T PF14446_consen    5 GCKCPVCGKKFKDGDDIVVCP--ECGAPYHRDCWEKAGGCIN   44 (54)
T ss_pred             CccChhhCCcccCCCCEEECC--CCCCcccHHHHhhCCceEe
Confidence            3479999974   57889997  9999999999999887653


No 112
>PLN02436 cellulose synthase A
Probab=36.78  E-value=21  Score=44.99  Aligned_cols=49  Identities=24%  Similarity=0.643  Sum_probs=38.0

Q ss_pred             CCccccccCCc--ccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          396 LDKCSVCHMDE--EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       396 ~~~C~vC~~~~--~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ..+|.||+++-  ..++++.|-|.-|+..|-..||- ..-.+| .=.|+.|+.
T Consensus        36 ~~iCqICGD~Vg~t~dGe~FVACn~C~fpvCr~Cye-yer~eg-~~~Cpqckt   86 (1094)
T PLN02436         36 GQTCQICGDEIELTVDGEPFVACNECAFPVCRPCYE-YERREG-NQACPQCKT   86 (1094)
T ss_pred             CccccccccccCcCCCCCEEEeeccCCCccccchhh-hhhhcC-CccCcccCC
Confidence            35999998752  55678899999999999999994 222333 678999975


No 113
>PF13771 zf-HC5HC2H:  PHD-like zinc-binding domain
Probab=35.21  E-value=34  Score=29.95  Aligned_cols=34  Identities=26%  Similarity=0.346  Sum_probs=27.3

Q ss_pred             CCCccccCCCCCCceecC---CCceeccccccccCce
Q 002995          452 PPPCCLCPVVGGAMKPTT---DGRWAHLACAIWIPET  485 (860)
Q Consensus       452 ~~~C~lC~~~gGalk~t~---~g~wvH~~Ca~~~p~~  485 (860)
                      ...|.+|...+||..+..   .....|+.||+...-.
T Consensus        36 ~~~C~~C~~~~Ga~i~C~~~~C~~~fH~~CA~~~~~~   72 (90)
T PF13771_consen   36 KLKCSICKKKGGACIGCSHPGCSRSFHVPCARKAGCF   72 (90)
T ss_pred             CCCCcCCCCCCCeEEEEeCCCCCcEEChHHHccCCeE
Confidence            468999999988887765   4689999999886543


No 114
>PF12906 RINGv:  RING-variant domain; PDB: 2D8S_A 1VYX_A.
Probab=34.92  E-value=13  Score=28.63  Aligned_cols=30  Identities=33%  Similarity=0.824  Sum_probs=18.0

Q ss_pred             cccccCCcccCCCeeEEccccCc---eecccccc
Q 002995          399 CSVCHMDEEYQNNLFLQCDKCRM---MVHARCYG  429 (860)
Q Consensus       399 C~vC~~~~~~~~n~ll~C~~C~~---~vH~~CYg  429 (860)
                      |-||+.+++.+...+.-| +|.-   .||+.|+-
T Consensus         1 CrIC~~~~~~~~~li~pC-~C~Gs~~~vH~~CL~   33 (47)
T PF12906_consen    1 CRICLEGEEEDEPLISPC-RCKGSMKYVHRSCLE   33 (47)
T ss_dssp             ETTTTEE-SSSS-EE-SS-S-SSCCGSEECCHHH
T ss_pred             CeEeCCcCCCCCceeccc-ccCCCcchhHHHHHH
Confidence            678988766544233344 3555   99999985


No 115
>PLN02400 cellulose synthase
Probab=34.69  E-value=27  Score=44.32  Aligned_cols=50  Identities=26%  Similarity=0.688  Sum_probs=38.2

Q ss_pred             CCCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          395 DLDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       395 ~~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ...+|.||+++  -..+.++.|-|..|+..|-.-||- -.-.+| .=.|+.|+.
T Consensus        35 ~gqiCqICGD~VG~t~dGe~FVAC~eCaFPVCRpCYE-YERkeG-nq~CPQCkT   86 (1085)
T PLN02400         35 NGQICQICGDDVGVTETGDVFVACNECAFPVCRPCYE-YERKDG-TQCCPQCKT   86 (1085)
T ss_pred             CCceeeecccccCcCCCCCEEEEEccCCCccccchhh-eecccC-CccCcccCC
Confidence            34599999874  245677899999999999999994 222334 678999975


No 116
>PLN02638 cellulose synthase A (UDP-forming), catalytic subunit
Probab=34.55  E-value=23  Score=44.82  Aligned_cols=50  Identities=26%  Similarity=0.644  Sum_probs=38.0

Q ss_pred             CCCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          395 DLDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       395 ~~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ...+|.||+++  -..+.++.|-|..|+..|-..||- ..-.+| .=.|+.|+.
T Consensus        16 ~~qiCqICGD~vg~~~~Ge~FVAC~eC~FPVCrpCYE-YEr~eG-~q~CPqCkt   67 (1079)
T PLN02638         16 GGQVCQICGDNVGKTVDGEPFVACDVCAFPVCRPCYE-YERKDG-NQSCPQCKT   67 (1079)
T ss_pred             CCceeeecccccCcCCCCCEEEEeccCCCccccchhh-hhhhcC-CccCCccCC
Confidence            34599999874  245677899999999999999994 222334 678999975


No 117
>KOG0954 consensus PHD finger protein [General function prediction only]
Probab=34.37  E-value=20  Score=43.00  Aligned_cols=30  Identities=23%  Similarity=0.238  Sum_probs=23.0

Q ss_pred             cceecCCcceeeEEcCCCCcCccccchhhhhc
Q 002995          510 LCSICGVSYGACIQCSNTTCRVAYHPLCARAA  541 (860)
Q Consensus       510 ~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~~  541 (860)
                      .|.+|-..+|++-.=..  =..|-|+.||+.-
T Consensus       325 pCvLCPkkGGamK~~~s--gT~wAHvsCALwI  354 (893)
T KOG0954|consen  325 PCVLCPKKGGAMKPTKS--GTKWAHVSCALWI  354 (893)
T ss_pred             CeeeccccCCcccccCC--CCeeeEeeeeecc
Confidence            69999998888876553  3488999999543


No 118
>PF08274 PhnA_Zn_Ribbon:  PhnA Zinc-Ribbon ;  InterPro: IPR013987 The PhnA protein family includes the uncharacterised Escherichia coli protein PhnA and its homologues. The E. coli phnA gene is part of a large operon associated with alkylphosphonate uptake and carbon-phosphorus bond cleavage []. The protein is not related to the characterised phosphonoacetate hydrolase designated PhnA []. This entry represents the N-terminal domain of PhnA, which is predicted to form a zinc-ribbon.; PDB: 2AKL_A.
Probab=34.26  E-value=20  Score=24.96  Aligned_cols=25  Identities=28%  Similarity=0.797  Sum_probs=13.4

Q ss_pred             CccccccCCcccCCCeeEEccccCc
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRM  421 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~  421 (860)
                      ..|-.|.+.....++.++.|..|+-
T Consensus         3 p~Cp~C~se~~y~D~~~~vCp~C~~   27 (30)
T PF08274_consen    3 PKCPLCGSEYTYEDGELLVCPECGH   27 (30)
T ss_dssp             ---TTT-----EE-SSSEEETTTTE
T ss_pred             CCCCCCCCcceeccCCEEeCCcccc
Confidence            4788999888888888889998864


No 119
>PF12773 DZR:  Double zinc ribbon
Probab=33.50  E-value=32  Score=26.59  Aligned_cols=38  Identities=21%  Similarity=0.458  Sum_probs=23.0

Q ss_pred             eEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCcccc
Q 002995          413 FLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLC  458 (860)
Q Consensus       413 ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC  458 (860)
                      ..+|..|+..+-        ......++|..|..........|.-|
T Consensus        12 ~~fC~~CG~~l~--------~~~~~~~~C~~Cg~~~~~~~~fC~~C   49 (50)
T PF12773_consen   12 AKFCPHCGTPLP--------PPDQSKKICPNCGAENPPNAKFCPNC   49 (50)
T ss_pred             ccCChhhcCChh--------hccCCCCCCcCCcCCCcCCcCccCcc
Confidence            456777776665        22334678888877655445556555


No 120
>PF13832 zf-HC5HC2H_2:  PHD-zinc-finger like domain
Probab=33.15  E-value=24  Score=32.26  Aligned_cols=29  Identities=28%  Similarity=0.711  Sum_probs=24.4

Q ss_pred             CCccccccCCcccCCCeeEEccc--cCceeccccc
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDK--CRMMVHARCY  428 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~--C~~~vH~~CY  428 (860)
                      ...|.+|.....    ..+.|..  |...+|..|-
T Consensus        55 ~~~C~iC~~~~G----~~i~C~~~~C~~~fH~~CA   85 (110)
T PF13832_consen   55 KLKCSICGKSGG----ACIKCSHPGCSTAFHPTCA   85 (110)
T ss_pred             CCcCcCCCCCCc----eeEEcCCCCCCcCCCHHHH
Confidence            469999998633    3899997  9999999996


No 121
>smart00547 ZnF_RBZ Zinc finger domain. Zinc finger domain in Ran-binding proteins (RanBPs), and other proteins. In RanBPs, this domain binds RanGDP.
Probab=32.64  E-value=25  Score=23.12  Aligned_cols=23  Identities=22%  Similarity=0.502  Sum_probs=18.3

Q ss_pred             ceeeecccCCCCCCCCCccccCC
Q 002995          438 LWLCNLCRPGAPEPPPPCCLCPV  460 (860)
Q Consensus       438 ~W~C~~C~~~~~~~~~~C~lC~~  460 (860)
                      .|.|..|..........|..|..
T Consensus         2 ~W~C~~C~~~N~~~~~~C~~C~~   24 (26)
T smart00547        2 DWECPACTFLNFASRSKCFACGA   24 (26)
T ss_pred             cccCCCCCCcChhhhccccccCC
Confidence            79999998766666678888864


No 122
>cd04370 BAH BAH, or Bromo Adjacent Homology domain (also called ELM1 and BAM for Bromo Adjacent Motif). BAH domains have first been described as domains found in the polybromo protein and Yeast Rsc1/Rsc2 (Remodeling of the Structure of Chromatin). They also occur in mammalian DNA methyltransferases and the MTA1 subunits of histone deacetylase complexes. A BAH domain is also found in Yeast Sir3p and in the origin receptor complex protein 1 (Orc1p), where it was found to interact with the N-terminal lobe of the silence information regulator 1 protein (Sir1p), confirming the initial hypothesis that BAH plays a role in protein-protein interactions.
Probab=31.23  E-value=53  Score=30.07  Aligned_cols=41  Identities=20%  Similarity=0.353  Sum_probs=29.4

Q ss_pred             CCCCCCCEEEEEecc----cCCCceEeeCCCCcCccccccccCCCCcEEEEEe
Q 002995           80 QELEPGDIIWAKLTG----HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFF  128 (860)
Q Consensus        80 ~~f~~GdlVwaK~~g----~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FF  128 (860)
                      ..|++||.|..+-..    -|.|.|+|..--.  .      ..+...+.|+||
T Consensus         2 ~~y~vgd~V~v~~~~~~~~~~~~i~~I~~i~~--~------~~~~~~~~v~wf   46 (123)
T cd04370           2 ITYEVGDSVYVEPDDSIKSDPPYIARIEELWE--D------TNGSKQVKVRWF   46 (123)
T ss_pred             CEEecCCEEEEecCCcCCCCCCEEEEEeeeeE--C------CCCCEEEEEEEE
Confidence            468899999999766    6899999965321  1      123346788888


No 123
>PLN02915 cellulose synthase A [UDP-forming], catalytic subunit
Probab=29.90  E-value=32  Score=43.48  Aligned_cols=49  Identities=27%  Similarity=0.619  Sum_probs=37.5

Q ss_pred             CCccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          396 LDKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       396 ~~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ..+|.||+++  -..+.++.|-|.-|+..|-..||- -...+| .=.|+.|+.
T Consensus        15 ~~~c~iCGd~vg~~~~Ge~FVAC~eC~fpvCr~cye-ye~~~g-~~~cp~c~t   65 (1044)
T PLN02915         15 AKTCRVCGDEVGVKEDGQPFVACHVCGFPVCKPCYE-YERSEG-NQCCPQCNT   65 (1044)
T ss_pred             cchhhccccccCcCCCCCEEEEeccCCCccccchhh-hhhhcC-CccCCccCC
Confidence            3699999874  245667899999999999999994 222334 678888874


No 124
>KOG0383 consensus Predicted helicase [General function prediction only]
Probab=29.06  E-value=29  Score=42.26  Aligned_cols=31  Identities=29%  Similarity=0.674  Sum_probs=26.0

Q ss_pred             ccceecCCcceeeEEcCCCCcCccccchhhhhcC
Q 002995          509 LLCSICGVSYGACIQCSNTTCRVAYHPLCARAAG  542 (860)
Q Consensus       509 ~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~~~g  542 (860)
                      ..|.+|.. +|-.|-|.  .|..+||..|.-.-+
T Consensus        48 e~c~ic~~-~g~~l~c~--tC~~s~h~~cl~~pl   78 (696)
T KOG0383|consen   48 EACRICAD-GGELLWCD--TCPASFHASCLGPPL   78 (696)
T ss_pred             hhhhhhcC-CCcEEEec--cccHHHHHHccCCCC
Confidence            47999998 78888898  999999999984443


No 125
>KOG3409 consensus Exosomal 3'-5' exoribonuclease complex, subunit ski4 (Csl4) [Translation, ribosomal structure and biogenesis]
Probab=28.78  E-value=74  Score=31.58  Aligned_cols=16  Identities=38%  Similarity=0.684  Sum_probs=13.1

Q ss_pred             CCCCCCCCCEEEEEec
Q 002995           78 DCQELEPGDIIWAKLT   93 (860)
Q Consensus        78 ~~~~f~~GdlVwaK~~   93 (860)
                      -...|+|||+|.|||=
T Consensus       119 v~ksFrPgDiVlAkVi  134 (193)
T KOG3409|consen  119 VYKSFRPGDIVLAKVI  134 (193)
T ss_pred             hhhccCCCcEEEEEEe
Confidence            3557999999999973


No 126
>PF07227 DUF1423:  Protein of unknown function (DUF1423);  InterPro: IPR004082 A total of 715 potential protein-coding genes have been identified in the nucleotide sequence of Arabidopsis thaliana chromosome 5, with an average gene density of 1 gene per 4001 bp []. Amongst the gene products is a well-conserved family of 130.7kDa proteins that share no sequence similarity with any other known proteins, other than in plants. The sequences are characterised by an N-terminal domain of variable length, a central cysteine-rich region and a relatively acidic C-terminal domain. The sequences may possess a PHD finger.
Probab=28.64  E-value=45  Score=38.14  Aligned_cols=31  Identities=23%  Similarity=0.639  Sum_probs=23.0

Q ss_pred             ccccccCCc-ccCCCeeEEccccCceeccccc
Q 002995          398 KCSVCHMDE-EYQNNLFLQCDKCRMMVHARCY  428 (860)
Q Consensus       398 ~C~vC~~~~-~~~~n~ll~C~~C~~~vH~~CY  428 (860)
                      -|.||...+ ..+.-.-|.||-|+...|..|-
T Consensus       130 ~C~iC~kfD~~~n~~~Wi~Cd~CgH~cH~dCA  161 (446)
T PF07227_consen  130 MCCICSKFDDNKNTCSWIGCDVCGHWCHLDCA  161 (446)
T ss_pred             CccccCCcccCCCCeeEEeccCCCceehhhhh
Confidence            577887643 2334457899999999999994


No 127
>PF12861 zf-Apc11:  Anaphase-promoting complex subunit 11 RING-H2 finger
Probab=27.80  E-value=25  Score=30.85  Aligned_cols=34  Identities=21%  Similarity=0.447  Sum_probs=23.7

Q ss_pred             CCCccccccCCcc---------cCCCeeEEccccCceecccccc
Q 002995          395 DLDKCSVCHMDEE---------YQNNLFLQCDKCRMMVHARCYG  429 (860)
Q Consensus       395 ~~~~C~vC~~~~~---------~~~n~ll~C~~C~~~vH~~CYg  429 (860)
                      +++.|.||+...+         .++.+|+.+. |+-.||..|.-
T Consensus        20 ~dd~CgICr~~fdg~Cp~Ck~Pgd~Cplv~g~-C~H~FH~hCI~   62 (85)
T PF12861_consen   20 NDDVCGICRMPFDGCCPDCKFPGDDCPLVWGK-CSHNFHMHCIL   62 (85)
T ss_pred             CCCceeeEecccccCCCCccCCCCCCceeecc-CccHHHHHHHH
Confidence            4678888876443         2234566555 99999999964


No 128
>PLN02195 cellulose synthase A
Probab=27.71  E-value=36  Score=42.69  Aligned_cols=48  Identities=21%  Similarity=0.549  Sum_probs=36.9

Q ss_pred             CccccccCC--cccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          397 DKCSVCHMD--EEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       397 ~~C~vC~~~--~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ..|.||++.  -..+.++.|-|..|+..|-..||- ..-.+| .=-|+.|+.
T Consensus         7 ~~c~~cgd~~~~~~~g~~fvaC~eC~~pvCrpCye-yer~eg-~q~CpqCkt   56 (977)
T PLN02195          7 PICATCGEEVGVDSNGEAFVACHECSYPLCKACLE-YEIKEG-RKVCLRCGG   56 (977)
T ss_pred             ccceecccccCcCCCCCeEEEeccCCCccccchhh-hhhhcC-CccCCccCC
Confidence            589999873  345667899999999999999994 222334 678888864


No 129
>KOG3038 consensus Histone acetyltransferase SAGA associated factor SGF29 [General function prediction only]
Probab=27.69  E-value=2.9e+02  Score=29.35  Aligned_cols=43  Identities=28%  Similarity=0.461  Sum_probs=31.5

Q ss_pred             CCCCCCCEEEEEecccC-CCceEeeCCCCcCccccccccCCCCcEEEEEeCCCC
Q 002995           80 QELEPGDIIWAKLTGHA-MWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHD  132 (860)
Q Consensus        80 ~~f~~GdlVwaK~~g~P-wWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~  132 (860)
                      .-|.+|.+|+|...|-- ..+|+|..++.          .+...|.|.||.+..
T Consensus       197 ~~fpp~~~VLA~YP~TTcFY~aiVh~tp~----------d~s~~y~vlffD~~e  240 (264)
T KOG3038|consen  197 ALFPPGTIVLAVYPGTTCFYKAIVHSTPR----------DGSCDYYVLFFDDEE  240 (264)
T ss_pred             cCCCCCCEEEEEcCCcceeeeeEeecCCC----------CCCCcceeeeecCcc
Confidence            46999999999975543 77899987532          334568999996643


No 130
>PF04810 zf-Sec23_Sec24:  Sec23/Sec24 zinc finger;  InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=27.41  E-value=33  Score=25.48  Aligned_cols=31  Identities=29%  Similarity=0.860  Sum_probs=17.2

Q ss_pred             eEEccccCceeccccccccccCCCCceeeecccC
Q 002995          413 FLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       413 ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ++.|.+|+.-+-..|-   ....+..|.|..|..
T Consensus         2 p~rC~~C~aylNp~~~---~~~~~~~w~C~~C~~   32 (40)
T PF04810_consen    2 PVRCRRCRAYLNPFCQ---FDDGGKTWICNFCGT   32 (40)
T ss_dssp             S-B-TTT--BS-TTSE---EETTTTEEEETTT--
T ss_pred             ccccCCCCCEECCcce---EcCCCCEEECcCCCC
Confidence            5678888887777763   223456899999975


No 131
>smart00744 RINGv The RING-variant domain is a C4HC3 zinc-finger like motif found in a number of cellular and viral proteins. Some of these proteins have been shown both in vivo and in vitro to have ubiquitin E3 ligase activity. The RING-variant domain is reminiscent of both the RING and the PHD domains and may represent an evolutionary intermediate. To describe this domain the term PHD/LAP domain has been used in the past. Extended description: The RING-variant (RINGv) domain contains a C4HC3 zinc-finger-like motif similar to the PHD domain, while some of the spacing between the Cys/His residues follow a pattern somewhat closer to that found in the RING domain. The RINGv domain, similar to the RING, PHD and LIM domains, is thought to bind two zinc ions co-ordinated by the highly conserved Cys and His residues. RING variant domain: C-x (2) -C-x(10-45)-C-x (1) -C-x (7) -H-x(2)-C-x(11-25)-C-x(2)-C As opposed to a PHD: C-x(1-2) -C-x (7-13)-C-x(2-4)-C-x(4-5)-H-x(2)-C-x(10-21)-C-x(2)-C Class
Probab=26.89  E-value=24  Score=27.50  Aligned_cols=31  Identities=23%  Similarity=0.685  Sum_probs=18.9

Q ss_pred             ccccccCCcccCCCeeE-Ecc--ccCceecccccc
Q 002995          398 KCSVCHMDEEYQNNLFL-QCD--KCRMMVHARCYG  429 (860)
Q Consensus       398 ~C~vC~~~~~~~~n~ll-~C~--~C~~~vH~~CYg  429 (860)
                      .|-||++.++ ++++++ -|.  +-...||+.|.-
T Consensus         1 ~CrIC~~~~~-~~~~l~~PC~C~G~~~~vH~~Cl~   34 (49)
T smart00744        1 ICRICHDEGD-EGDPLVSPCRCKGSLKYVHQECLE   34 (49)
T ss_pred             CccCCCCCCC-CCCeeEeccccCCchhHHHHHHHH
Confidence            4889998333 334443 333  223679999975


No 132
>PF10497 zf-4CXXC_R1:  Zinc-finger domain of monoamine-oxidase A repressor R1;  InterPro: IPR018866  R1 is a transcription factor repressor that inhibits monoamine oxidase A gene expression. This domain is a four-CXXC zinc finger putative DNA-binding domain found at the C-terminal end of R1. The domain carries 12 cysteines of which four pairs are of the CXXC type []. 
Probab=26.78  E-value=47  Score=30.43  Aligned_cols=61  Identities=26%  Similarity=0.510  Sum_probs=38.0

Q ss_pred             CCccccccCCcccCCCeeEEc------cccCceecccccc-------c--cccCCCCceeeecccCCCCCCCCCccccCC
Q 002995          396 LDKCSVCHMDEEYQNNLFLQC------DKCRMMVHARCYG-------E--LEPVNGVLWLCNLCRPGAPEPPPPCCLCPV  460 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C------~~C~~~vH~~CYg-------~--~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~  460 (860)
                      ...|-.|.....   +....|      ..|...-=+.|++       .  ..+.....|.|+.|+..     -.|..|..
T Consensus         7 g~~CHqCrqKt~---~~~~~C~~~~~~~~C~~~~~~fC~~CL~~ryge~~~ev~~~~~W~CP~Crgi-----CnCs~Crr   78 (105)
T PF10497_consen    7 GKTCHQCRQKTL---DFKTICTGHWKNSSCRGCRGKFCGGCLRNRYGENVEEVLEDPNWKCPKCRGI-----CNCSFCRR   78 (105)
T ss_pred             CCCchhhcCCCC---CCceEcCCCCCCCCCccCcceehHhHHHHHHhhhHHHHhcCCceECCCCCCe-----eCCHhhhc
Confidence            357888887543   234678      7783333345654       2  12334458999999963     36889977


Q ss_pred             CCCC
Q 002995          461 VGGA  464 (860)
Q Consensus       461 ~gGa  464 (860)
                      +.|.
T Consensus        79 k~g~   82 (105)
T PF10497_consen   79 KRGW   82 (105)
T ss_pred             cCCC
Confidence            7653


No 133
>PF11302 DUF3104:  Protein of unknown function (DUF3104);  InterPro: IPR021453  This family of proteins with unknown function appears to be restricted to Cyanobacteria. 
Probab=26.73  E-value=1.6e+02  Score=25.17  Aligned_cols=59  Identities=17%  Similarity=0.198  Sum_probs=43.0

Q ss_pred             CCCCCCEEEEEecc-------cCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCcccc
Q 002995           81 ELEPGDIIWAKLTG-------HAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQVISF  144 (860)
Q Consensus        81 ~f~~GdlVwaK~~g-------~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l~~f  144 (860)
                      ..++||.|..+-..       --||=|.|+.-..     +.+.+.....+.|.=-.+...-||+.+.+...
T Consensus         5 ~Vk~Gd~ViV~~~~~~~~~~~~dWWmg~Vi~~~g-----gaR~P~~~tlFQVadVDtG~I~wVnaD~Vt~I   70 (75)
T PF11302_consen    5 SVKPGDTVIVQDEQEVGQKQDKDWWMGQVIHCEG-----GARDPKVPTLFQVADVDTGVIRWVNADEVTHI   70 (75)
T ss_pred             ccCCCCEEEEecCccccccCCCCcEEEEEEEEec-----cccCCCCCceEEEEEccCCeEEEEEchheeee
Confidence            46899999998766       7899999987532     22333334446788888999999998766543


No 134
>cd04721 BAH_plant_1 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=26.47  E-value=65  Score=30.71  Aligned_cols=53  Identities=19%  Similarity=0.266  Sum_probs=35.7

Q ss_pred             CCCCCCCCCEEEEEecccCCCceEeeCCCCcCccccccccCCCCcEEEEEeCCCC--EEEEeC
Q 002995           78 DCQELEPGDIIWAKLTGHAMWPAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHD--FARINV  138 (860)
Q Consensus        78 ~~~~f~~GdlVwaK~~g~PwWPa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~--~awv~~  138 (860)
                      ++..|++||.|..+-.+-..|=|+|.+--  ..      ..+...+.|+||-..+  .+..++
T Consensus         4 ~~~~i~vGD~V~v~~~~~~~~va~Ie~i~--ed------~~g~~~v~v~WF~~p~E~~~~~~~   58 (130)
T cd04721           4 NGVTISVHDFVYVLSEEEDRYVAYIEDLY--ED------KKGSKMVKVRWFHTTDEVGAALSP   58 (130)
T ss_pred             CCEEEECCCEEEEeCCCCCcEEEEEEEEE--Ec------CCCCEEEEEEEecCHHHhccccCC
Confidence            34568999999999877677999997631  11      1233468899997654  344443


No 135
>KOG0696 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=26.33  E-value=19  Score=40.61  Aligned_cols=33  Identities=30%  Similarity=0.444  Sum_probs=23.7

Q ss_pred             CccccccCCcccCCCeeEEccccCceecccccc
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYG  429 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg  429 (860)
                      ..|+-|-.---.=.-+=.+|.-|...||+.|.-
T Consensus        57 TfCsHCkDFiwGfgKQGfQCqvC~fvvHkrChe   89 (683)
T KOG0696|consen   57 TFCSHCKDFIWGFGKQGFQCQVCCFVVHKRCHE   89 (683)
T ss_pred             chhhhhhhheeccccCceeeeEEeehhhhhhcc
Confidence            689999643211223357999999999999974


No 136
>PF07282 OrfB_Zn_ribbon:  Putative transposase DNA-binding domain;  InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=26.14  E-value=60  Score=26.89  Aligned_cols=35  Identities=11%  Similarity=0.400  Sum_probs=26.6

Q ss_pred             CCCccccccCCccc-CCCeeEEccccCceecccccc
Q 002995          395 DLDKCSVCHMDEEY-QNNLFLQCDKCRMMVHARCYG  429 (860)
Q Consensus       395 ~~~~C~vC~~~~~~-~~n~ll~C~~C~~~vH~~CYg  429 (860)
                      ....|..|+..... ..+....|..|+...|..--+
T Consensus        27 TSq~C~~CG~~~~~~~~~r~~~C~~Cg~~~~rD~na   62 (69)
T PF07282_consen   27 TSQTCPRCGHRNKKRRSGRVFTCPNCGFEMDRDVNA   62 (69)
T ss_pred             CccCccCcccccccccccceEEcCCCCCEECcHHHH
Confidence            44689999986544 566789999999999876443


No 137
>PLN02189 cellulose synthase
Probab=25.99  E-value=41  Score=42.55  Aligned_cols=49  Identities=27%  Similarity=0.679  Sum_probs=37.5

Q ss_pred             CCccccccCCc--ccCCCeeEEccccCceeccccccccccCCCCceeeecccC
Q 002995          396 LDKCSVCHMDE--EYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRP  446 (860)
Q Consensus       396 ~~~C~vC~~~~--~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~  446 (860)
                      ...|.||++.-  ..+++..+-|.-|+..|-..||- ..-.+| .=.|+.|+.
T Consensus        34 ~~~C~iCgd~vg~~~~g~~fvaC~~C~fpvCr~Cye-yer~eg-~q~CpqCkt   84 (1040)
T PLN02189         34 GQVCEICGDEIGLTVDGDLFVACNECGFPVCRPCYE-YERREG-TQNCPQCKT   84 (1040)
T ss_pred             CccccccccccCcCCCCCEEEeeccCCCccccchhh-hhhhcC-CccCcccCC
Confidence            35999999753  35667899999999999999994 222334 678999975


No 138
>PF00641 zf-RanBP:  Zn-finger in Ran binding protein and others;  InterPro: IPR001876 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents the zinc finger domain found in RanBP2 proteins. Ran is an evolutionary conserved member of the Ras superfamily that regulates all receptor-mediated transport between the nucleus and the cytoplasm. Ran binding protein 2 (RanBP2) is a 358kDa nucleoporin located on the cytoplasmic side of the nuclear pore complex which plays a role in nuclear protein import []. RanBP2 contains multiple zinc fingers which mediate binding to RanGDP []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 2D9G_A 2EBR_A 2WX0_C 2WX1_C 2WWZ_C 3GJ6_B 2LK0_A 2LK1_A 3GJ5_B 3GJ8_B ....
Probab=25.60  E-value=22  Score=24.40  Aligned_cols=22  Identities=27%  Similarity=0.718  Sum_probs=16.7

Q ss_pred             ceeeecccCCCCCCCCCccccC
Q 002995          438 LWLCNLCRPGAPEPPPPCCLCP  459 (860)
Q Consensus       438 ~W~C~~C~~~~~~~~~~C~lC~  459 (860)
                      .|.|..|..........|..|.
T Consensus         4 ~W~C~~C~~~N~~~~~~C~~C~   25 (30)
T PF00641_consen    4 DWKCPSCTFMNPASRSKCVACG   25 (30)
T ss_dssp             SEEETTTTEEEESSSSB-TTT-
T ss_pred             CccCCCCcCCchHHhhhhhCcC
Confidence            8999999986666667899885


No 139
>KOG2807 consensus RNA polymerase II transcription initiation/nucleotide excision repair factor TFIIH, subunit SSL1 [Transcription; Replication, recombination and repair]
Probab=25.24  E-value=48  Score=36.11  Aligned_cols=84  Identities=15%  Similarity=0.501  Sum_probs=51.0

Q ss_pred             CCCCceeeecccCCCCCCCCCccccCCCCCCceecC--CCceeccccccccCceEEccCCCccccccchhhhccccC--c
Q 002995          434 VNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTT--DGRWAHLACAIWIPETCLTDVKRMEPIDGLNRVSKDRWK--L  509 (860)
Q Consensus       434 ~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~--~g~wvH~~Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~--~  509 (860)
                      +.+..++|+.|+...-.....|..|...   |.-+.  ...+-|                 +-|+.....++.....  .
T Consensus       272 ~~~~Gy~CP~CkakvCsLP~eCpiC~lt---LVss~hLARSyhh-----------------L~PL~~F~Eip~~~~~~~~  331 (378)
T KOG2807|consen  272 LSGGGYFCPQCKAKVCSLPIECPICSLT---LVSSPHLARSYHH-----------------LFPLKPFVEIPETEYNGSR  331 (378)
T ss_pred             cccCceeCCcccCeeecCCccCCcccee---EecchHHHHHHHh-----------------hcCCcchhhccccccCCCc
Confidence            4455899999997655556788888643   22111  011222                 2244445555555433  2


Q ss_pred             cceecCC--cceeeEEcCCCCcCccccchhhh
Q 002995          510 LCSICGV--SYGACIQCSNTTCRVAYHPLCAR  539 (860)
Q Consensus       510 ~C~~C~~--~~Ga~iqC~~~~C~~~fH~~CA~  539 (860)
                      .|..|+.  ..+...+|.  .|...|=.-|-.
T Consensus       332 ~Cf~C~~~~~~~~~y~C~--~Ck~~FCldCDv  361 (378)
T KOG2807|consen  332 FCFACQGELLSSGRYRCE--SCKNVFCLDCDV  361 (378)
T ss_pred             ceeeeccccCCCCcEEch--hccceeeccchH
Confidence            4999943  356778997  898888877763


No 140
>PF09038 53-BP1_Tudor:  Tumour suppressor p53-binding protein-1 Tudor;  InterPro: IPR015125 This domain consist of ten beta-strands and a carboxy-terminal alpha-helix. The amino-terminal five beta-strands and the C-terminal five beta-strands adopt folds that are identical to each other. The domain is essential for the recruitment of proteins to double stranded breaks in DNA, which is mediated by interaction with methylated Lys 79 of histone H3 []. ; PDB: 3LGL_A 1XNI_B 3LGF_A 2G3R_A 2IG0_A 3LH0_A 1SSF_A.
Probab=25.20  E-value=62  Score=30.28  Aligned_cols=28  Identities=29%  Similarity=0.436  Sum_probs=22.7

Q ss_pred             eeeeeeeecCCCCceEEEecCCCcccccc
Q 002995            5 YSGFVVGYDSESNRHHVKYVDGDEEDLIL   33 (860)
Q Consensus         5 ~~g~v~~~~~~~~~h~~~y~d~d~e~~~l   33 (860)
                      |+|.|+. |....+..|..+||.+-.+..
T Consensus        20 Y~G~I~~-~~~~~kykv~FdDG~~~~v~~   47 (122)
T PF09038_consen   20 YPGKITS-DKGKNKYKVLFDDGYECRVLG   47 (122)
T ss_dssp             EEEEEEE-EETTTEEEEEETTS-EEEEEC
T ss_pred             cCceEee-cCCCCeEEEEecCCccceecc
Confidence            7999999 578889999999999865543


No 141
>cd05791 S1_CSL4 S1_CSL4: CSL4, S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. ScCSL4 protein is a subunit of the exosome complex. The exosome plays a central role in 3' to 5' RNA processing and degradation in eukarytes and archaea. Its functions include the removal of incorrectly processed RNA and the maintenance of proper levels of mRNA, rRNA and a number of small RNA species. In S. cerevisiae, the exosome includes nine core components, six of which are homologous to bacterial RNase PH. These form a hexameric ring structure. The other three subunits (RrP4, Rrp40, and Csl4) contain an S1 RNA binding domain and are part of the "S1 pore structure".
Probab=25.14  E-value=46  Score=29.58  Aligned_cols=17  Identities=35%  Similarity=0.610  Sum_probs=14.3

Q ss_pred             CCCCCCCEEEEEecccC
Q 002995           80 QELEPGDIIWAKLTGHA   96 (860)
Q Consensus        80 ~~f~~GdlVwaK~~g~P   96 (860)
                      .-|+|||+|-||+-++-
T Consensus        59 ~~f~~GDiV~AkVis~~   75 (92)
T cd05791          59 KCFRPGDIVRAKVISLG   75 (92)
T ss_pred             hhcCCCCEEEEEEEEcC
Confidence            45999999999997664


No 142
>PF11793 FANCL_C:  FANCL C-terminal domain; PDB: 3K1L_A.
Probab=24.57  E-value=34  Score=28.80  Aligned_cols=33  Identities=33%  Similarity=0.699  Sum_probs=13.5

Q ss_pred             ccceecCCc---ce--eeEEcCCCCcCccccchhhhhc
Q 002995          509 LLCSICGVS---YG--ACIQCSNTTCRVAYHPLCARAA  541 (860)
Q Consensus       509 ~~C~~C~~~---~G--a~iqC~~~~C~~~fH~~CA~~~  541 (860)
                      ..|.||...   .+  .-+.|....|...||..|-...
T Consensus         3 ~~C~IC~~~~~~~~~~p~~~C~n~~C~~~fH~~CL~~w   40 (70)
T PF11793_consen    3 LECGICYSYRLDDGEIPDVVCPNPSCGKKFHLLCLSEW   40 (70)
T ss_dssp             -S-SSS--SS-TT-----B--S-TT----B-SGGGHHH
T ss_pred             CCCCcCCcEecCCCCcCceEcCCcccCCHHHHHHHHHH
Confidence            468889752   22  3579999999999999997543


No 143
>PF08792 A2L_zn_ribbon:  A2L zinc ribbon domain;  InterPro: IPR014900 This zinc ribbon protein is found associated with some viral A2L transcription factors []. 
Probab=23.69  E-value=69  Score=22.83  Aligned_cols=27  Identities=26%  Similarity=0.463  Sum_probs=19.0

Q ss_pred             CCccccccCCccc-CCCeeEEccccCce
Q 002995          396 LDKCSVCHMDEEY-QNNLFLQCDKCRMM  422 (860)
Q Consensus       396 ~~~C~vC~~~~~~-~~n~ll~C~~C~~~  422 (860)
                      +..|..|...... .++-.++|-.|+..
T Consensus         3 ~~~C~~C~~~~i~~~~~~~~~C~~Cg~~   30 (33)
T PF08792_consen    3 LKKCSKCGGNGIVNKEDDYEVCIFCGSS   30 (33)
T ss_pred             ceEcCCCCCCeEEEecCCeEEcccCCcE
Confidence            3578889876654 55557788888754


No 144
>KOG2752 consensus Uncharacterized conserved protein, contains N-recognin-type Zn-finger [General function prediction only]
Probab=23.03  E-value=89  Score=33.99  Aligned_cols=112  Identities=16%  Similarity=0.288  Sum_probs=60.7

Q ss_pred             ccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCCCCCCCCCccccCCCCCCceecCCCceeccc
Q 002995          398 KCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPGAPEPPPPCCLCPVVGGAMKPTTDGRWAHLA  477 (860)
Q Consensus       398 ~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~~~~~~~~C~lC~~~gGalk~t~~g~wvH~~  477 (860)
                      .|--|--...    ...+|-.|.+..|..=-++...... .|.|+.|.....    .|.+=+...+.-+ -+.+.|.|-+
T Consensus        57 sClTC~P~~~----~agvC~~C~~~CH~~H~lveL~tKR-~FrCDCg~sk~g----~~sc~l~~~~~~~-n~~N~YNhNf  126 (345)
T KOG2752|consen   57 SCLTCTPAPE----MAGVCYACSLSCHDGHELVELYTKR-NFRCDCGNSKFG----RCSCNLLEDKDAE-NSENLYNHNF  126 (345)
T ss_pred             EeecccCChh----hceeEEEeeeeecCCceeeeccccC-Cccccccccccc----ccccccccccccc-cchhhhhhhh
Confidence            4555643221    3678999999999876555444444 899998875332    2222222222222 3344566633


Q ss_pred             cccccCceEEccCCCccccccchhhhccccCccceecCCcceeeEEcCCCCcCcccc-chhhhhcCe
Q 002995          478 CAIWIPETCLTDVKRMEPIDGLNRVSKDRWKLLCSICGVSYGACIQCSNTTCRVAYH-PLCARAAGL  543 (860)
Q Consensus       478 Ca~~~p~~~~~~~~~~~~v~~i~~i~~~~~~~~C~~C~~~~Ga~iqC~~~~C~~~fH-~~CA~~~g~  543 (860)
                      =.++.    .-+.....|+..                  -.|.++||.  -|..||| -.|-....+
T Consensus       127 qG~~C----~Cd~~Ypdp~~~------------------~e~~m~QC~--iCEDWFHce~c~~~~~~  169 (345)
T KOG2752|consen  127 QGLFC----KCDTPYPDPVRT------------------EEGEMLQCV--ICEDWFHCEGCMQAKTF  169 (345)
T ss_pred             cceeE----EecCCCCCcccc------------------ccceeeeEE--eccchhcccccCcccch
Confidence            22221    011111112211                  268899998  8999999 666655544


No 145
>KOG0782 consensus Predicted diacylglycerol kinase [Signal transduction mechanisms]
Probab=22.84  E-value=23  Score=40.92  Aligned_cols=20  Identities=35%  Similarity=0.913  Sum_probs=16.2

Q ss_pred             eEEccccCceeccccccccc
Q 002995          413 FLQCDKCRMMVHARCYGELE  432 (860)
Q Consensus       413 ll~C~~C~~~vH~~CYg~~~  432 (860)
                      --.|.-|.+.||..|.....
T Consensus       199 rRkCAaCkIVvHT~CieqLe  218 (1004)
T KOG0782|consen  199 RRKCAACKIVVHTNCIEQLE  218 (1004)
T ss_pred             hccceeeeEEEechHHHHHH
Confidence            34799999999999986443


No 146
>KOG0695 consensus Serine/threonine protein kinase [Signal transduction mechanisms]
Probab=22.68  E-value=32  Score=37.65  Aligned_cols=35  Identities=23%  Similarity=0.556  Sum_probs=25.8

Q ss_pred             CccccccCCcccCCCeeEEccccCceecccccccc
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGEL  431 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~  431 (860)
                      ..|.||..--=.-..+=..|-+|.+.||.+|.+..
T Consensus       142 ~~c~ic~d~iwglgrqgyrcinckl~vhkkch~~v  176 (593)
T KOG0695|consen  142 AYCGICSDRIWGLGRQGYRCINCKLLVHKKCHGLV  176 (593)
T ss_pred             eeeeechhhhhhcccccceeecceeehhhhhcccc
Confidence            58999975322223345689999999999999854


No 147
>KOG4299 consensus PHD Zn-finger protein [General function prediction only]
Probab=21.84  E-value=33  Score=40.62  Aligned_cols=46  Identities=24%  Similarity=0.570  Sum_probs=30.0

Q ss_pred             CCccccccCCcccCCCeeEEccccCceeccccccccccC--CCCceeeecccC
Q 002995          396 LDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPV--NGVLWLCNLCRP  446 (860)
Q Consensus       396 ~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~--~~~~W~C~~C~~  446 (860)
                      ...|.+|...+.     ++-|+.|.-.+|+.|-+....+  .+..|.|..|..
T Consensus        47 ~ts~~~~~~~gn-----~~~~~~~~~s~h~~~~~~~~sp~~~~~~~~~~~~~~   94 (613)
T KOG4299|consen   47 ATSCGICKSGGN-----LLCCDHCPASFHLECDKPPLSPDLKGSEINCSRCPK   94 (613)
T ss_pred             hhhcchhhhcCC-----ccccccCccccchhccCcccCcccccccccccCCCc
Confidence            567888876543     5678888888888886643322  234677777754


No 148
>KOG1844 consensus PHD Zn-finger proteins [General function prediction only]
Probab=21.71  E-value=56  Score=38.62  Aligned_cols=49  Identities=27%  Similarity=0.851  Sum_probs=36.1

Q ss_pred             CccccccCCcccCCCeeEEccccCceeccccccccccCCCCceeeecccCC
Q 002995          397 DKCSVCHMDEEYQNNLFLQCDKCRMMVHARCYGELEPVNGVLWLCNLCRPG  447 (860)
Q Consensus       397 ~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CYg~~~~~~~~~W~C~~C~~~  447 (860)
                      ..|. |...++. +..++.|+.|+.--|..|+|.....+-....|..|...
T Consensus        87 ~~c~-c~~~~~~-~g~~i~c~~c~~Wqh~~C~g~~~~~~p~~y~c~~c~~~  135 (508)
T KOG1844|consen   87 SRCD-CGLEDDM-EGLMIQCDWCGRWQHKICCGSFKSTKPDKYVCEICTPR  135 (508)
T ss_pred             cccc-cccccCC-CceeeCCcccCcccCceeeeecCCCCchhceeeeeccc
Confidence            3554 7765542 66799999999999999999654333236889999754


No 149
>PF10367 Vps39_2:  Vacuolar sorting protein 39 domain 2;  InterPro: IPR019453  This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 []. 
Probab=21.40  E-value=76  Score=28.49  Aligned_cols=31  Identities=32%  Similarity=0.634  Sum_probs=21.2

Q ss_pred             CCCccccccCCcccCCCeeEEccccCceeccccc
Q 002995          395 DLDKCSVCHMDEEYQNNLFLQCDKCRMMVHARCY  428 (860)
Q Consensus       395 ~~~~C~vC~~~~~~~~n~ll~C~~C~~~vH~~CY  428 (860)
                      +...|.||+..-..  ..++ -.-|+..||..|.
T Consensus        77 ~~~~C~vC~k~l~~--~~f~-~~p~~~v~H~~C~  107 (109)
T PF10367_consen   77 ESTKCSVCGKPLGN--SVFV-VFPCGHVVHYSCI  107 (109)
T ss_pred             CCCCccCcCCcCCC--ceEE-EeCCCeEEecccc
Confidence            45689999875432  2333 3366799999996


No 150
>PF10367 Vps39_2:  Vacuolar sorting protein 39 domain 2;  InterPro: IPR019453  This entry represents a domain found in the vacuolar sorting protein Vps39 and transforming growth factor beta receptor-associated protein Trap1. Vps39, a component of the C-Vps complex, is thought to be required for the fusion of endosomes and other types of transport intermediates with the vacuole [, ]. In Saccharomyces cerevisiae (Baker's yeast), Vps39 has been shown to stimulate nucleotide exchange []. Trap1 plays a role in the TGF-beta/activin signaling pathway. It associates with inactive heteromeric TGF-beta and activin receptor complexes, mainly through the type II receptor, and is released upon activation of signaling [, ]. The precise function of this domain has not been characterised In Vps39 this domain is involved in localisation and in mediating the interactions with Vps11 []. 
Probab=21.27  E-value=39  Score=30.43  Aligned_cols=30  Identities=27%  Similarity=0.609  Sum_probs=20.4

Q ss_pred             ccceecCCcceeeEEcCCCCcCccccchhhh
Q 002995          509 LLCSICGVSYGACIQCSNTTCRVAYHPLCAR  539 (860)
Q Consensus       509 ~~C~~C~~~~Ga~iqC~~~~C~~~fH~~CA~  539 (860)
                      ..|.+|+++-|...--.+ -|...||..|+.
T Consensus        79 ~~C~vC~k~l~~~~f~~~-p~~~v~H~~C~~  108 (109)
T PF10367_consen   79 TKCSVCGKPLGNSVFVVF-PCGHVVHYSCIK  108 (109)
T ss_pred             CCccCcCCcCCCceEEEe-CCCeEEeccccc
Confidence            469999997553322222 456899999985


No 151
>PF09465 LBR_tudor:  Lamin-B receptor of TUDOR domain;  InterPro: IPR019023  The Lamin-B receptor is a chromatin and lamin binding protein in the inner nuclear membrane. It is one of the integral inner nuclear envelope membrane proteins responsible for targeting nuclear membranes to chromatin, being a downstream effector of Ran, a small Ras-like nuclear GTPase which regulates NE assembly. Lamin-B receptor interacts with importin beta, a Ran-binding protein, thereby directly contributing to the fusion of membrane vesicles and the formation of the nuclear envelope []. ; PDB: 2L8D_A 2DIG_A.
Probab=20.96  E-value=1.7e+02  Score=23.53  Aligned_cols=50  Identities=10%  Similarity=0.258  Sum_probs=29.6

Q ss_pred             CCCCCCCEEEEEecccCCC-ceEeeCCCCcCccccccccCCCCcEEEEEeCCCCEEEEeCCCc
Q 002995           80 QELEPGDIIWAKLTGHAMW-PAIVVDESLIGDYKGLNKISGGRSIPVQFFGTHDFARINVKQV  141 (860)
Q Consensus        80 ~~f~~GdlVwaK~~g~PwW-Pa~V~~~~~~~~~~~~~~~~~~~~~~V~FFg~~~~awv~~~~l  141 (860)
                      ..|..|++|-+|-.|-.-| +|+|.+-+           .....|.|.|= |++--=++..+|
T Consensus         4 ~k~~~Ge~V~~rWP~s~lYYe~kV~~~d-----------~~~~~y~V~Y~-DGtel~lke~di   54 (55)
T PF09465_consen    4 RKFAIGEVVMVRWPGSSLYYEGKVLSYD-----------SKSDRYTVLYE-DGTELELKENDI   54 (55)
T ss_dssp             SSS-SS-EEEEE-TTTS-EEEEEEEEEE-----------TTTTEEEEEET-TS-EEEEECCCE
T ss_pred             ccccCCCEEEEECCCCCcEEEEEEEEec-----------ccCceEEEEEc-CCCEEEeccccc
Confidence            6799999999998888877 99997621           12334667764 444444444443


No 152
>PF12148 DUF3590:  Protein of unknown function (DUF3590);  InterPro: IPR021991  This domain is found in eukaryotes, and is typically between 83 and 97 amino acids in length. It is found in association with PF00097 from PFAM, PF02182 from PFAM, PF00628 from PFAM, PF00240 from PFAM. There are two conserved sequence motifs: RAR and NYN. The domain is part of the protein NIRF which has zinc finger and ubiquitinating domains. The function of this domain is likely to be mainly structural, however this has not been confirmed. ; PDB: 3DB4_A 3ASK_A 3DB3_A 2L3R_A.
Probab=20.48  E-value=1e+02  Score=27.10  Aligned_cols=31  Identities=26%  Similarity=0.467  Sum_probs=20.0

Q ss_pred             CCCceeeeeeeecCCC------CceEEEecCCCcccc
Q 002995            1 MQDWYSGFVVGYDSES------NRHHVKYVDGDEEDL   31 (860)
Q Consensus         1 ~~~~~~g~v~~~~~~~------~~h~~~y~d~d~e~~   31 (860)
                      |-+||.+.|+.--...      =..||||||=.++-+
T Consensus         9 ~gAWfEa~i~~i~~~~~~~~e~viYhIkyddype~gv   45 (85)
T PF12148_consen    9 MGAWFEAQIVTITKKCMSDDEDVIYHIKYDDYPENGV   45 (85)
T ss_dssp             T-EEEEEEEEEEEES-SSSSTTEEEEEEETT-GGG-E
T ss_pred             CcceEEEEEEEeeccCCCCCCCEEEEEEeccCCCcCc
Confidence            4579999887655433      278999998765544


No 153
>smart00439 BAH Bromo adjacent homology domain.
Probab=20.25  E-value=1.2e+02  Score=27.58  Aligned_cols=39  Identities=10%  Similarity=0.250  Sum_probs=28.0

Q ss_pred             CCCCCEEEEEecc--cCCCceEeeCCCCcCccccccccCCC-CcEEEEEe
Q 002995           82 LEPGDIIWAKLTG--HAMWPAIVVDESLIGDYKGLNKISGG-RSIPVQFF  128 (860)
Q Consensus        82 f~~GdlVwaK~~g--~PwWPa~V~~~~~~~~~~~~~~~~~~-~~~~V~FF  128 (860)
                      |++||.|..+-..  =|.|-|+|..--...        .+. ..+.|+||
T Consensus         2 ~~vgd~V~v~~~~~~~~~~i~~I~~i~~~~--------~~~~~~~~v~Wf   43 (120)
T smart00439        2 IRVGDFVLVEPDDADEPYYIGRIEEIFETK--------KNSEKMVRVRWF   43 (120)
T ss_pred             cccCCEEEEeCCCCCCCCEEEEEEEEEECC--------CCCEEEEEEEEE
Confidence            7899999999764  689999997632211        122 36789999


Done!