Query         047869
Match_columns 2233
No_of_seqs    157 out of 219
Neff          3.2 
Searched_HMMs 46136
Date          Fri Mar 29 04:01:23 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/047869.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/047869hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2752 Uncharacterized conser  99.4 4.5E-14 9.7E-19  161.9   2.1  102 1574-1681   26-130 (345)
  2 KOG1777 Putative Zn-finger pro  99.4 3.2E-13   7E-18  159.1   4.5   86 1573-1659  528-614 (625)
  3 KOG0943 Predicted ubiquitin-pr  99.2 2.6E-12 5.6E-17  160.1   2.8   71 1587-1659 1239-1310(3015)
  4 PF02207 zf-UBR:  Putative zinc  99.2 7.3E-12 1.6E-16  118.6   3.7   58 1589-1650    1-60  (71)
  5 smart00396 ZnF_UBR1 Putative z  99.0 4.8E-10   1E-14  107.0   5.5   64 1589-1656    1-68  (71)
  6 KOG1776 Zn-binding protein Pus  99.0 2.3E-11 4.9E-16  148.3  -4.9   62 1588-1650  764-826 (1110)
  7 PF10168 Nup88:  Nuclear pore c  97.9 0.00077 1.7E-08   87.7  22.4  218 1809-2036   32-312 (717)
  8 cd00200 WD40 WD40 domain, foun  97.7   0.068 1.5E-06   56.0  28.9  185 1818-2039   20-207 (289)
  9 cd00200 WD40 WD40 domain, foun  97.6   0.057 1.2E-06   56.6  26.8  153 1865-2039   11-165 (289)
 10 PLN00181 protein SPA1-RELATED;  97.0     1.1 2.4E-05   59.4  33.8  156 1814-1987  489-648 (793)
 11 PF10282 Lactonase:  Lactonase,  96.9    0.61 1.3E-05   55.9  28.8  259 1814-2097   42-325 (345)
 12 PRK11028 6-phosphogluconolacto  96.6     1.7 3.7E-05   51.0  28.5  197 1814-2030   40-248 (330)
 13 PRK11028 6-phosphogluconolacto  96.3     3.3 7.1E-05   48.7  31.6  253 1867-2145   38-305 (330)
 14 PLN00181 protein SPA1-RELATED;  96.1     1.5 3.3E-05   58.1  27.0  160 1863-2039  483-648 (793)
 15 KOG0291 WD40-repeat-containing  95.8     5.7 0.00012   52.7  28.9  150 1814-1987   61-219 (893)
 16 PTZ00421 coronin; Provisional   95.2     2.6 5.6E-05   53.9  23.2  196 1819-2040   88-291 (493)
 17 KOG0318 WD40 repeat stress pro  93.2      22 0.00049   46.1  24.7  262 1865-2144  192-473 (603)
 18 KOG0283 WD40 repeat-containing  92.9    0.63 1.4E-05   61.2  11.6  117 1866-1987  412-576 (712)
 19 KOG2110 Uncharacterized conser  92.9     9.5 0.00021   47.6  20.4  164 1848-2041   75-250 (391)
 20 TIGR03866 PQQ_ABC_repeats PQQ-  92.7      10 0.00022   42.3  19.1  191 1814-2038   78-278 (300)
 21 PTZ00421 coronin; Provisional   92.5      43 0.00092   43.4  28.1  153 1873-2038   37-197 (493)
 22 KOG0289 mRNA splicing factor [  91.4     4.3 9.3E-05   51.2  15.4  136 1855-1999  295-431 (506)
 23 PF10214 Rrn6:  RNA polymerase   91.1     8.4 0.00018   51.8  19.1  172 1805-1988   77-277 (765)
 24 KOG0294 WD40 repeat-containing  90.8     3.8 8.3E-05   50.1  14.0  170 1798-1999  121-293 (362)
 25 PF04762 IKI3:  IKI3 family;  I  90.6      28 0.00062   48.2  23.5  122 1862-1987  303-456 (928)
 26 TIGR02658 TTQ_MADH_Hv methylam  90.3      59  0.0013   40.8  26.9   26 2016-2042  200-226 (352)
 27 KOG1446 Histone H3 (Lys4) meth  89.2      10 0.00023   46.3  15.8  106 1926-2039  155-262 (311)
 28 PF14727 PHTB1_N:  PTHB1 N-term  88.8      48  0.0011   42.4  21.9  261 1866-2143   27-317 (418)
 29 KOG0310 Conserved WD40 repeat-  87.9      21 0.00045   45.9  17.7  193 1819-2037   79-307 (487)
 30 PF08662 eIF2A:  Eukaryotic tra  87.8      25 0.00054   39.8  17.0  138 1868-2029   10-163 (194)
 31 PF10282 Lactonase:  Lactonase,  87.7      75  0.0016   38.6  22.3  225 1812-2049   90-332 (345)
 32 TIGR03866 PQQ_ABC_repeats PQQ-  87.1      56  0.0012   36.5  28.1  148 1867-2038   34-186 (300)
 33 KOG2110 Uncharacterized conser  86.5      12 0.00025   46.9  14.2  149 1826-2001  151-343 (391)
 34 KOG4460 Nuclear pore complex,   86.3     2.6 5.6E-05   54.0   8.9   87 1857-1944   97-202 (741)
 35 PTZ00420 coronin; Provisional   85.8      31 0.00067   45.5  18.5  195 1819-2039   87-293 (568)
 36 PF15492 Nbas_N:  Neuroblastoma  85.8      69  0.0015   39.3  19.7  237 1875-2142    8-257 (282)
 37 KOG4378 Nuclear protein COP1 [  85.6      24 0.00051   45.7  16.4  243 1823-2123    7-254 (673)
 38 PTZ00420 coronin; Provisional   85.4 1.4E+02  0.0031   39.7  27.3  116 1910-2038   74-196 (568)
 39 KOG0650 WD40 repeat nucleolar   84.8      22 0.00047   46.8  15.9  261 1810-2144  402-680 (733)
 40 PF04053 Coatomer_WDAD:  Coatom  84.3      20 0.00044   45.8  15.6  148 1859-2040  104-263 (443)
 41 KOG0650 WD40 repeat nucleolar   84.1     9.1  0.0002   50.0  12.3  148 1866-2036  524-677 (733)
 42 PF11715 Nup160:  Nucleoporin N  82.8     4.2   9E-05   51.8   8.9  102 1939-2040   65-177 (547)
 43 KOG0264 Nucleosome remodeling   78.8      11 0.00023   47.8  10.2  140 1823-1987  245-404 (422)
 44 KOG0286 G-protein beta subunit  78.4 1.4E+02  0.0031   37.2  18.6  181 1826-2035   75-299 (343)
 45 PF08596 Lgl_C:  Lethal giant l  78.2      49  0.0011   41.8  15.7  171 1811-1987   89-290 (395)
 46 PF00643 zf-B_box:  B-box zinc   77.4     1.9 4.1E-05   37.4   2.5   28 1604-1635   14-41  (42)
 47 KOG0279 G protein beta subunit  76.7      66  0.0014   39.6  15.3   90 1865-1962  107-200 (315)
 48 KOG0269 WD40 repeat-containing  75.0      14 0.00031   49.4  10.1  144 1822-1987  193-340 (839)
 49 PF08662 eIF2A:  Eukaryotic tra  72.9      33 0.00073   38.8  11.4   98 1866-1976   62-163 (194)
 50 KOG1274 WD40 repeat protein [G  72.2      23 0.00051   48.2  11.2  118 1863-1987  138-262 (933)
 51 KOG1445 Tumor-specific antigen  68.5      67  0.0015   42.8  13.6  154 1818-1987  590-750 (1012)
 52 KOG0315 G-protein beta subunit  66.9      24 0.00051   42.7   8.8   75 1910-1988   40-114 (311)
 53 PF03178 CPSF_A:  CPSF A subuni  64.0 1.1E+02  0.0024   36.6  13.7  149 1819-1987   98-267 (321)
 54 KOG2055 WD40 repeat protein [G  63.5 1.7E+02  0.0036   38.3  15.4  158 1866-2041  216-376 (514)
 55 KOG0293 WD40 repeat-containing  62.9      63  0.0014   41.4  11.6  155 1864-2038  270-424 (519)
 56 KOG4378 Nuclear protein COP1 [  62.1      48   0.001   43.1  10.6  119 1804-1988  161-281 (673)
 57 PRK01742 tolB translocation pr  61.9 4.2E+02  0.0091   33.5  18.8  108 1865-1981  205-316 (429)
 58 PF04841 Vps16_N:  Vps16, N-ter  61.7      29 0.00063   43.7   8.8   86 1888-1984   62-152 (410)
 59 KOG1587 Cytoplasmic dynein int  61.4      33 0.00072   45.2   9.5  115 1864-1987  399-516 (555)
 60 PF04053 Coatomer_WDAD:  Coatom  60.3      71  0.0015   41.1  11.9   37 1852-1891   23-59  (443)
 61 KOG1539 WD repeat protein [Gen  57.4 7.4E+02   0.016   34.9  23.1  196 1849-2100   63-271 (910)
 62 KOG0266 WD40 repeat-containing  57.2 5.3E+02   0.011   33.1  24.2  241 1869-2143  165-408 (456)
 63 KOG0277 Peroxisomal targeting   56.8      38 0.00083   41.2   8.1   93 1864-1960    9-110 (311)
 64 smart00336 BBOX B-Box-type zin  55.5     7.7 0.00017   33.1   1.8   30 1602-1635   12-41  (42)
 65 PF00780 CNH:  CNH domain;  Int  55.3 3.9E+02  0.0084   31.0  24.6  142 1816-1982    4-161 (275)
 66 PF08596 Lgl_C:  Lethal giant l  51.8 4.3E+02  0.0093   33.9  16.5  157 1865-2040    3-174 (395)
 67 KOG0289 mRNA splicing factor [  51.7 1.5E+02  0.0034   38.4  12.4  141 1808-1983  313-458 (506)
 68 KOG1897 Damage-specific DNA bi  51.1 9.8E+02   0.021   34.4  22.5  110 1866-1990  409-520 (1096)
 69 KOG0266 WD40 repeat-containing  50.4 6.7E+02   0.014   32.3  24.5  162 1860-2038  200-363 (456)
 70 KOG0290 Conserved WD40 repeat-  50.3      44 0.00096   41.2   7.4  117 1819-1951  209-333 (364)
 71 cd00021 BBOX B-Box-type zinc f  50.2      11 0.00023   31.9   1.8   29 1603-1635   10-38  (39)
 72 PF02239 Cytochrom_D1:  Cytochr  50.2 6.3E+02   0.014   31.9  18.0  132 1825-1987   13-158 (369)
 73 PF00780 CNH:  CNH domain;  Int  49.8 4.7E+02    0.01   30.3  15.5  132 1810-1948   38-173 (275)
 74 KOG0315 G-protein beta subunit  47.2 6.6E+02   0.014   31.3  18.1  198 1813-2041   45-247 (311)
 75 PF04841 Vps16_N:  Vps16, N-ter  46.8 7.4E+02   0.016   31.7  18.0   48 1933-1987   62-109 (410)
 76 KOG2048 WD40 repeat protein [G  46.3 9.8E+02   0.021   33.0  21.5  200 1818-2037  393-599 (691)
 77 KOG0264 Nucleosome remodeling   45.6 2.8E+02  0.0061   36.0  13.4  156 1819-1991  190-351 (422)
 78 KOG1063 RNA polymerase II elon  45.4 1.3E+02  0.0027   40.8  10.7  153 1814-1987  531-699 (764)
 79 KOG0278 Serine/threonine kinas  45.3 3.9E+02  0.0084   33.1  13.7   89 1933-2036  123-211 (334)
 80 PF04762 IKI3:  IKI3 family;  I  44.7 1.2E+03   0.025   33.4  24.6  117 1868-1987  261-379 (928)
 81 PF06977 SdiA-regulated:  SdiA-  43.6 6.8E+02   0.015   30.4  19.0  117 1860-2030   18-138 (248)
 82 KOG1446 Histone H3 (Lys4) meth  43.4 7.2E+02   0.016   31.5  15.9  160 1810-1986  142-303 (311)
 83 KOG0647 mRNA export protein (c  42.3 7.3E+02   0.016   31.6  15.6   87 1865-1954   29-157 (347)
 84 KOG1140 N-end rule pathway, re  41.5      14  0.0003   53.3   1.9   65 1588-1657   13-80  (1738)
 85 KOG0279 G protein beta subunit  40.3 2.1E+02  0.0045   35.6  10.8  114 1810-1941  194-314 (315)
 86 PF14761 HPS3_N:  Hermansky-Pud  40.2      66  0.0014   38.2   6.6   74 1813-1893  140-214 (215)
 87 KOG0276 Vesicle coat complex C  40.0 5.3E+02   0.011   35.2  14.8  224 1866-2143  143-379 (794)
 88 KOG4532 WD40-like repeat conta  39.8 5.5E+02   0.012   32.2  13.9  112 1866-1985  161-278 (344)
 89 KOG0269 WD40 repeat-containing  39.8 1.3E+03   0.027   32.6  18.3  128 1852-1987   76-207 (839)
 90 KOG0307 Vesicle coat complex C  38.7      71  0.0015   44.7   7.5  195 1820-2039   81-284 (1049)
 91 KOG0772 Uncharacterized conser  38.2 1.1E+03   0.024   31.8  17.0  258 1866-2165  217-508 (641)
 92 COG2319 FOG: WD40 repeat [Gene  36.2 4.9E+02   0.011   29.0  12.1  113 1864-1984  110-226 (466)
 93 PRK02889 tolB translocation pr  36.0   1E+03   0.022   30.2  17.0  175 1866-2068  242-423 (427)
 94 KOG1240 Protein kinase contain  35.2   7E+02   0.015   36.5  15.4  148 1825-1987 1068-1225(1431)
 95 PF11768 DUF3312:  Protein of u  35.1 1.3E+03   0.029   31.3  18.7   63 2082-2145  202-290 (545)
 96 KOG0293 WD40 repeat-containing  35.1 1.2E+03   0.026   30.8  17.1  113 1865-1987  226-342 (519)
 97 KOG2041 WD40 repeat protein [G  33.7 1.7E+02  0.0037   39.8   9.3   98 1818-1937  228-334 (1189)
 98 PF12657 TFIIIC_delta:  Transcr  33.7 1.7E+02  0.0036   32.8   8.2  108 1811-1942    7-123 (173)
 99 PRK04922 tolB translocation pr  33.4 1.1E+03   0.024   29.9  21.1  112 1865-1987  205-324 (433)
100 KOG3339 Predicted glycosyltran  33.0      55  0.0012   38.4   4.4  108  289-398    51-173 (211)
101 KOG1332 Vesicle coat complex C  32.8 3.9E+02  0.0084   33.0  11.3  162 1866-2037  105-284 (299)
102 KOG0268 Sof1-like rRNA process  29.5 3.3E+02  0.0071   35.0  10.3  160 1855-2033  178-339 (433)
103 PRK03629 tolB translocation pr  29.4 1.3E+03   0.028   29.4  21.3  155 1865-2042  200-364 (429)
104 KOG2096 WD40 repeat protein [G  28.7 3.4E+02  0.0073   34.5  10.1  147 1814-1982  193-346 (420)
105 KOG1240 Protein kinase contain  28.3 3.8E+02  0.0082   38.9  11.4  135 1892-2035 1034-1177(1431)
106 KOG1274 WD40 repeat protein [G  28.3 5.8E+02   0.013   36.0  12.9  118 1814-1941  144-263 (933)
107 KOG0641 WD40 repeat protein [G  28.1 6.1E+02   0.013   31.0  11.7   75 1912-1987   91-171 (350)
108 PF10168 Nup88:  Nuclear pore c  27.5 1.8E+02  0.0038   39.9   8.3   34 2111-2144  146-179 (717)
109 KOG3881 Uncharacterized conser  27.5   4E+02  0.0087   34.5  10.6  103 1877-1987  173-277 (412)
110 PF06977 SdiA-regulated:  SdiA-  27.2 3.2E+02   0.007   33.0   9.6   71 1817-1894   73-148 (248)
111 KOG0321 WD40 repeat-containing  27.2 7.7E+02   0.017   33.8  13.3  110 1868-1979   54-178 (720)
112 KOG0973 Histone transcription   26.9 8.6E+02   0.019   34.8  14.3  118 1862-1986  128-261 (942)
113 TIGR01171 rplB_bact ribosomal   26.5 1.3E+03   0.029   28.7  14.6  130 1850-1984   61-212 (273)
114 KOG0307 Vesicle coat complex C  26.3      62  0.0013   45.2   4.0   71 1866-1941  256-328 (1049)
115 PF06433 Me-amine-dh_H:  Methyl  26.3 8.3E+02   0.018   31.3  13.0   93 1932-2042  118-216 (342)
116 PF00400 WD40:  WD domain, G-be  25.4 1.4E+02  0.0029   24.7   4.5   28 1910-1938   11-39  (39)
117 PRK03629 tolB translocation pr  24.9 9.8E+02   0.021   30.5  13.7  151 1868-2039  247-404 (429)
118 KOG1900 Nuclear pore complex,   24.4 1.4E+03   0.031   33.8  15.8  192 1818-2027   89-328 (1311)
119 KOG2445 Nuclear pore complex c  24.4 2.5E+02  0.0054   35.4   8.0   85 1856-1941  164-257 (361)
120 KOG1007 WD repeat protein TSSC  24.4 1.6E+03   0.034   28.8  15.1  231 1785-2036   40-286 (370)
121 CHL00052 rpl2 ribosomal protei  23.9 1.4E+03   0.029   28.7  13.9  130 1850-1984   61-212 (273)
122 KOG0642 Cell-cycle nuclear pro  23.2 8.7E+02   0.019   32.8  12.7  118 1913-2036  296-423 (577)
123 KOG1273 WD40 repeat protein [G  23.1 1.4E+03   0.031   29.4  13.8  188 1797-2012   14-208 (405)
124 KOG0319 WD40-repeat-containing  22.4 2.3E+03    0.05   30.1  18.8  160 1805-1987  104-268 (775)
125 KOG0296 Angio-associated migra  22.3 8.2E+02   0.018   31.7  11.8  141 1879-2053  247-396 (399)
126 KOG1407 WD40 repeat protein [F  21.9 4.3E+02  0.0093   32.9   9.2  143 1818-1987  117-261 (313)
127 KOG0639 Transducin-like enhanc  21.6 6.3E+02   0.014   33.8  10.9  159 1877-2038  432-621 (705)
128 KOG1587 Cytoplasmic dynein int  20.6 1.1E+03   0.023   32.0  13.2  162 1867-2041  351-518 (555)
129 KOG3334 Transcription initiati  20.3      76  0.0017   35.8   2.6   50   92-144    14-77  (148)
130 PF09826 Beta_propel:  Beta pro  20.2 5.7E+02   0.012   34.0  10.6  108 1824-1940  398-518 (521)

No 1  
>KOG2752 consensus Uncharacterized conserved protein, contains N-recognin-type Zn-finger [General function prediction only]
Probab=99.42  E-value=4.5e-14  Score=161.86  Aligned_cols=102  Identities=26%  Similarity=0.573  Sum_probs=81.2

Q ss_pred             CCCCCCcchhhcccCCCcceeccCCccc-ccceEeeccCCCCC-CceeehhhhhhhcCCCcEEEE-eecceeeecCCCCC
Q 047869         1574 KDEEDDPNSERALASKVCTFTSSGSNFM-EQHWYFCYTCDLTV-SKGCCSVCAKVCHRGHRVVYS-RSSRFFCDCGAGGV 1650 (2233)
Q Consensus      1574 ~~~~~~~~~e~al~~~~CTFt~TG~~fi-~Q~~Y~C~TC~l~~-~~GVC~aCA~vCHkGHdVvyl-~k~~FfCDCGa~~~ 1650 (2233)
                      +.++..+-+..+.+.+.|||.   ++|+ ||.+|.|+||.+.. ..|||++|+..||.||+++++ ++|+|+||||+.++
T Consensus        26 ~lE~~a~~vL~~~~~~~CTy~---~Gy~~rQ~l~sClTC~P~~~~agvC~~C~~~CH~~H~lveL~tKR~FrCDCg~sk~  102 (345)
T KOG2752|consen   26 ELEDEADVVLGTQNPDVCTYA---KGYKKRQALFSCLTCTPAPEMAGVCYACSLSCHDGHELVELYTKRNFRCDCGNSKF  102 (345)
T ss_pred             HHHHHHHhhcCCCCCcccccc---cCcccccceeEeecccCChhhceeEEEeeeeecCCceeeeccccCCcccccccccc
Confidence            333445556667788999999   5566 89999999999975 789999999999999999999 99999999999998


Q ss_pred             CCCCceeCCCCCCCCCCCccccccCcccccC
Q 047869         1651 RGSSCQCLKPRKYTGSDSASSRAASNFQSFL 1681 (2233)
Q Consensus      1651 ~~~~Cqclk~r~~~~~~~as~r~s~nf~~~~ 1681 (2233)
                      ...+|.++......   ...+.+++||++.+
T Consensus       103 g~~sc~l~~~~~~~---n~~N~YNhNfqG~~  130 (345)
T KOG2752|consen  103 GRCSCNLLEDKDAE---NSENLYNHNFQGLF  130 (345)
T ss_pred             cccccccccccccc---cchhhhhhhhccee
Confidence            77667665432111   33567889999876


No 2  
>KOG1777 consensus Putative Zn-finger protein [General function prediction only]
Probab=99.36  E-value=3.2e-13  Score=159.09  Aligned_cols=86  Identities=34%  Similarity=0.743  Sum_probs=77.0

Q ss_pred             CCCCCCCcchhhcccCCCcceeccCCc-ccccceEeeccCCCCCCceeehhhhhhhcCCCcEEEEeecceeeecCCCCCC
Q 047869         1573 DKDEEDDPNSERALASKVCTFTSSGSN-FMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGVR 1651 (2233)
Q Consensus      1573 d~~~~~~~~~e~al~~~~CTFt~TG~~-fi~Q~~Y~C~TC~l~~~~GVC~aCA~vCHkGHdVvyl~k~~FfCDCGa~~~~ 1651 (2233)
                      ++.-++.|.+|+|+..+.|.|.+++.. |+++.+|.|.||+.++..+||..|.+.||+||+|.+++..+||||||++.. 
T Consensus       528 N~iydN~D~vekAik~GqCLfkvSs~~syPMHnFYRC~TCNttdRNAIC~nCI~~CH~GH~Vefir~Drffcdcgagtl-  606 (625)
T KOG1777|consen  528 NQIYDNLDHVEKAIKKGQCLFKVSSYTSYPMHNFYRCITCNTTDRNAICVNCIKRCHEGHDVEFIRHDRFFCDCGAGTL-  606 (625)
T ss_pred             cccccchHHHHHHhhcCceEEEecCCCcccccceeEeeecCCccccHHHHHHHHHhcCCCceEEEeeceEEEecCCcee-
Confidence            455567889999999999999977666 669999999999999999999999999999999999999999999999876 


Q ss_pred             CCCceeCC
Q 047869         1652 GSSCQCLK 1659 (2233)
Q Consensus      1652 ~~~Cqclk 1659 (2233)
                      ...|++..
T Consensus       607 ~~~c~lq~  614 (625)
T KOG1777|consen  607 SNVCDLQG  614 (625)
T ss_pred             cceeeccC
Confidence            45588754


No 3  
>KOG0943 consensus Predicted ubiquitin-protein ligase/hyperplastic discs protein, HECT superfamily [Posttranslational modification, protein turnover, chaperones]
Probab=99.23  E-value=2.6e-12  Score=160.13  Aligned_cols=71  Identities=37%  Similarity=0.829  Sum_probs=63.1

Q ss_pred             cCCCcceeccCCcccccceEeeccCCCCCCceeehhhhhhhcCCCcEEEEeec-ceeeecCCCCCCCCCceeCC
Q 047869         1587 ASKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYSRSS-RFFCDCGAGGVRGSSCQCLK 1659 (2233)
Q Consensus      1587 ~~~~CTFt~TG~~fi~Q~~Y~C~TC~l~~~~GVC~aCA~vCHkGHdVvyl~k~-~FfCDCGa~~~~~~~Cqclk 1659 (2233)
                      .+..|+|++||.++++|+.|.|.||++.++.+||+.||.+||+|||....+.+ ..||||+.+.  .++|+.+.
T Consensus      1239 ~NDtCSFTWTGadHINQDIfECkTCGL~~SLCCCsECAltCHk~HDCkLKRTSPTAYCDCWEKs--sCkCKaLI 1310 (3015)
T KOG0943|consen 1239 CNDTCSFTWTGADHINQDIFECKTCGLLESLCCCSECALTCHKGHDCKLKRTSPTAYCDCWEKS--SCKCKALI 1310 (3015)
T ss_pred             ecCccceeecchhhccchhhhhcccccchhhhhhHHHHHHhccCCccceeccCCcceeehhhcc--cccchhhh
Confidence            68899999999999999999999999999999999999999999999999655 6999999853  45555543


No 4  
>PF02207 zf-UBR:  Putative zinc finger in N-recognin (UBR box);  InterPro: IPR003126 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  The N-end rule-based degradation signal, which targets a protein for ubiquitin-dependent proteolysis, comprises a destabilising amino-terminal residue and a specific internal lysine residue. This entry describes a putative zinc finger in N-recognin, a recognition component of the N-end rule pathway []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0004842 ubiquitin-protein ligase activity, 0008270 zinc ion binding; PDB: 3NY1_B 3NIS_F 3NIM_A 3NIK_A 3NII_A 3NIH_A 3NIL_D 3NIN_B 3NIJ_A 3NIT_A ....
Probab=99.21  E-value=7.3e-12  Score=118.59  Aligned_cols=58  Identities=50%  Similarity=0.973  Sum_probs=44.0

Q ss_pred             CCcceeccCCcccccceEeeccCCCCCCceeehhh-hhhhcCCCcEEEEeec-ceeeecCCCCC
Q 047869         1589 KVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVC-AKVCHRGHRVVYSRSS-RFFCDCGAGGV 1650 (2233)
Q Consensus      1589 ~~CTFt~TG~~fi~Q~~Y~C~TC~l~~~~GVC~aC-A~vCHkGHdVvyl~k~-~FfCDCGa~~~ 1650 (2233)
                      +.|++.++.+    |.+|.|+||.+++..++|..| ++.||+||++++.+.. +|+||||+...
T Consensus         1 ~~C~~~~~~~----q~~y~C~tC~~~~~~~iC~~CF~~~~H~gH~~~~~~~~~~~~CDCG~~~~   60 (71)
T PF02207_consen    1 KKCTYVWTSG----QIFYRCLTCSLDESSGICEECFANSCHEGHRVVYYRSSSGGCCDCGDPEA   60 (71)
T ss_dssp             -SS--B--TT-----EEEEETTTBSSTT-BBEHHHHCTSGGGGSSEEEEE--SCEBB-TT-GGG
T ss_pred             CcCCCCCcCC----CEEEECccCCCCCCEEEchhhCCCCCcCCCcEEEEEeCCCeEEeCCCCcc
Confidence            4799987755    999999999999999999999 9999999999999877 99999998765


No 5  
>smart00396 ZnF_UBR1 Putative zinc finger in N-recognin, a recognition component of the N-end rule pathway. Domain is involved in recognition of N-end rule substrates in yeast Ubr1p
Probab=98.99  E-value=4.8e-10  Score=106.96  Aligned_cols=64  Identities=34%  Similarity=0.770  Sum_probs=54.9

Q ss_pred             CCcceeccCCcccccceEeeccCCCCCCceeehhhhh-hhcCCCcEEEEeecc-eeeecCCCCC--CCCCce
Q 047869         1589 KVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAK-VCHRGHRVVYSRSSR-FFCDCGAGGV--RGSSCQ 1656 (2233)
Q Consensus      1589 ~~CTFt~TG~~fi~Q~~Y~C~TC~l~~~~GVC~aCA~-vCHkGHdVvyl~k~~-FfCDCGa~~~--~~~~Cq 1656 (2233)
                      ..|+|..+.++.+    |.|+||.+.+..++|..|++ .||+||++.+.+.++ |+||||+...  +++.|+
T Consensus         1 ~~C~~~~~~~~~~----y~C~tC~~~~~~~iC~~Cf~~~~H~gH~~~~~~~~~~~~CDCG~~~~~~~~~~C~   68 (71)
T smart00396        1 DVCTYKFTGGEVI----YRCKTCGLDPTCVLCSDCFRSNCHKGHDYSLKTSRGSGICDCGDKEAWNEDLKCK   68 (71)
T ss_pred             CCCCCccCCCCEE----EECcCCCCCCCEeEChHHCCCCCCCCCCEEEEEecCCEEECCCChhccCCCcccc
Confidence            3699998877655    99999999999999999999 999999999998888 9999999742  445554


No 6  
>KOG1776 consensus Zn-binding protein Push [Signal transduction mechanisms]
Probab=98.97  E-value=2.3e-11  Score=148.34  Aligned_cols=62  Identities=16%  Similarity=0.070  Sum_probs=57.4

Q ss_pred             CCCcceeccCCcccccceEeeccCCCCCCc-eeehhhhhhhcCCCcEEEEeecceeeecCCCCC
Q 047869         1588 SKVCTFTSSGSNFMEQHWYFCYTCDLTVSK-GCCSVCAKVCHRGHRVVYSRSSRFFCDCGAGGV 1650 (2233)
Q Consensus      1588 ~~~CTFt~TG~~fi~Q~~Y~C~TC~l~~~~-GVC~aCA~vCHkGHdVvyl~k~~FfCDCGa~~~ 1650 (2233)
                      -..|||.++|+-++-|+||.|+||+|..+. |+|.+||++||+||++-|. ++.|+||||.+..
T Consensus       764 v~~~T~Kkk~q~~m~n~~~q~~k~~M~~~~gG~~kV~s~t~H~~~~i~~S-~~~~~C~C~Es~~  826 (1110)
T KOG1776|consen  764 VRDETEKKKKQMAMLNREKQLTKMRMKVGTGGQIKVSSRTLHNEPSIDDS-DSLPCCICRESVI  826 (1110)
T ss_pred             HHHHHHhhhhhHHHHHHHhhhhhheeeeccCceEEEeeecccCCCCcccc-CCCceeecccccc
Confidence            456999999999999999999999998776 8999999999999999999 9999999998764


No 7  
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=97.90  E-value=0.00077  Score=87.69  Aligned_cols=218  Identities=23%  Similarity=0.328  Sum_probs=136.7

Q ss_pred             ccccceecccCceEEEe--eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccc
Q 047869         1809 LVKSLLSVSSRGRLAVG--EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYED 1886 (2233)
Q Consensus      1809 ~iRqLLSas~rGrLAVa--EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkD 1886 (2233)
                      ..|+++.+.. |+|.|-  +...+..+++..+...++..  .+. ---+-+-++++.|+|-+|..|| ++.+||++|-+.
T Consensus        32 ~~rNLl~~~d-~~L~vWd~~e~~l~~~nlr~~~~~~~~~--~~~-~~q~L~~~~~~~f~v~~i~~n~-~g~~lal~G~~~  106 (717)
T PF10168_consen   32 HTRNLLACRD-GDLFVWDSSECCLLTVNLRSLESDAEGP--AKS-SYQKLLPSNPPLFEVHQISLNP-TGSLLALVGPRG  106 (717)
T ss_pred             cceeeEEEeC-CEEEEEECCCCEEEEEeeccccccccCc--ccc-CcceeecCCCCceeEEEEEECC-CCCEEEEEcCCc
Confidence            4588888885 787764  55556666777776554421  111 1112234567889999999999 999999999999


Q ss_pred             eEEEEec----CCCceee-------e-eeeeecc----CCceEEEeEEecCC--CceEEEEecC-eEEEEeCcCCCCCCc
Q 047869         1887 CQVLTLN----PRGEVTD-------R-LAIELAL----QGAYIRRVDWVPGS--PVQLMVVTNK-FVKIYDLSQDNISPL 1947 (2233)
Q Consensus      1887 C~VLTfs----s~GeV~D-------R-L~LeL~L----eg~fIIKa~WLPGS--Qt~LAVVT~~-FVKIYDLS~D~lSPv 1947 (2233)
                      +.|+-+=    ++|...|       | ..|.-.+    .+.-|.+|.|=|+|  .+-|+|-|++ -+++||++ +.-.|.
T Consensus       107 v~V~~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~~~~l~vLtsdn~lR~y~~~-~~~~p~  185 (717)
T PF10168_consen  107 VVVLELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSESDSHLVVLTSDNTLRLYDIS-DPQHPW  185 (717)
T ss_pred             EEEEEeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCCCCeEEEEecCCEEEEEecC-CCCCCe
Confidence            9999983    3443322       1 1222222    25689999999997  4667777776 89999997 666776


Q ss_pred             EEEEcCCCC-------------------eeEEEEEEec--C--------C----cEEEEEEecCCceEEEEeccc---CC
Q 047869         1948 HYFTLPDDM-------------------IVDATLVIAS--R--------G----KMFLIVLSECGSLYRLELSVE---GN 1991 (2233)
Q Consensus      1948 yyF~LpsGk-------------------IrDaTfv~~e--~--------G----~~~ILVLSS~G~LY~Qels~s---~d 1991 (2233)
                      -.+.+.++.                   =.|+.|....  .        +    ..-|+|+-..|.+|+--.+..   .+
T Consensus       186 ~v~~~~~~~~~~~~~~~~~~~~~slge~AV~FDfgP~~~~~~~~~~~~~~~~~~~~p~~vL~~ng~v~~~~~~l~~~~~~  265 (717)
T PF10168_consen  186 QVLSLSPGEKSSSLSSRGRSFLASLGETAVDFDFGPLDTSPKTLTGQKSKQEKIEWPIFVLRENGDVYLLYTSLQDENSN  265 (717)
T ss_pred             EEEEcccCcccccccCCCccccccchheeeecccccccccccccccccCCCCceeccEEEEecCCCEEEEEEecccCccc
Confidence            666654211                   1333443311  1        1    235899999999998655541   11


Q ss_pred             ----Cccccceeeeecccccc-cCCeEEEEeccc-cceeeEEecCCcEEEE
Q 047869         1992 ----VGATPLKEIIQFNDREI-HAKGLSLYFSST-YKLLFLSFQDGTTLVG 2036 (2233)
Q Consensus      1992 ----~g~~~ltEvvq~~~~q~-~~~GVSVyYS~t-l~LLF~SY~~G~Sf~a 2036 (2233)
                          .|+..|.    .+..++ +..+=||.+-++ =.+|.+++++|+-+=|
T Consensus       266 ~~~~~gpl~~~----p~~~dnyg~d~c~i~~l~~~p~~~via~~~G~l~h~  312 (717)
T PF10168_consen  266 LPKLQGPLPMQ----PPADDNYGLDACSILCLPSLPPVLVIATSNGKLYHC  312 (717)
T ss_pred             cceecCceecC----CCCcccCCCceeeEEEecCCCCEEEEEecCCeEEEE
Confidence                1222222    222222 334555555444 3788999999999943


No 8  
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.71  E-value=0.068  Score=56.04  Aligned_cols=185  Identities=19%  Similarity=0.181  Sum_probs=107.9

Q ss_pred             cCceEEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecc-cceEEEEecCC
Q 047869         1818 SRGRLAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGY-EDCQVLTLNPR 1895 (2233)
Q Consensus      1818 ~rGrLAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGL-kDC~VLTfss~ 1895 (2233)
                      .++.+|++ +.+.|.++++..--.             ...+.  .-...+..+.++|.. ++|+++|. ..+.|..+...
T Consensus        20 ~~~~l~~~~~~g~i~i~~~~~~~~-------------~~~~~--~~~~~i~~~~~~~~~-~~l~~~~~~~~i~i~~~~~~   83 (289)
T cd00200          20 DGKLLATGSGDGTIKVWDLETGEL-------------LRTLK--GHTGPVRDVAASADG-TYLASGSSDKTIRLWDLETG   83 (289)
T ss_pred             CCCEEEEeecCcEEEEEEeeCCCc-------------EEEEe--cCCcceeEEEECCCC-CEEEEEcCCCeEEEEEcCcc
Confidence            34667765 688999998753110             00000  011234688888854 78888774 34444444322


Q ss_pred             CceeeeeeeeeccCCceEEEeEEecCCCceEEEEe-cCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEE
Q 047869         1896 GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT-NKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIV 1974 (2233)
Q Consensus      1896 GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT-~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILV 1974 (2233)
                       +    ....+......|..+.|.|.. ..++..+ ...|+|||+..  ..+.+.+....+.|....+-.  ++ .+++.
T Consensus        84 -~----~~~~~~~~~~~i~~~~~~~~~-~~~~~~~~~~~i~~~~~~~--~~~~~~~~~~~~~i~~~~~~~--~~-~~l~~  152 (289)
T cd00200          84 -E----CVRTLTGHTSYVSSVAFSPDG-RILSSSSRDKTIKVWDVET--GKCLTTLRGHTDWVNSVAFSP--DG-TFVAS  152 (289)
T ss_pred             -c----ceEEEeccCCcEEEEEEcCCC-CEEEEecCCCeEEEEECCC--cEEEEEeccCCCcEEEEEEcC--cC-CEEEE
Confidence             1    122222334579999999984 3345555 56999999983  345566665566788777543  34 35555


Q ss_pred             EecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEEEcC
Q 047869         1975 LSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLS 2039 (2233)
Q Consensus      1975 LSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls 2039 (2233)
                      .+.+|.++.-++.....      ...++.    ..+.-.++.+++.-+.+++...+|...+-.+.
T Consensus       153 ~~~~~~i~i~d~~~~~~------~~~~~~----~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~  207 (289)
T cd00200         153 SSQDGTIKLWDLRTGKC------VATLTG----HTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS  207 (289)
T ss_pred             EcCCCcEEEEEcccccc------ceeEec----CccccceEEECCCcCEEEEecCCCcEEEEECC
Confidence            55699999887762111      111111    11234467888887778887778777665553


No 9  
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.62  E-value=0.057  Score=56.61  Aligned_cols=153  Identities=16%  Similarity=0.132  Sum_probs=94.0

Q ss_pred             EEEEeecccCccceEEeec-ccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEe-cCeEEEEeCcCC
Q 047869         1865 EIVHLAFNSIVENYLTVAG-YEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT-NKFVKIYDLSQD 1942 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcG-LkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT-~~FVKIYDLS~D 1942 (2233)
                      .|.++.++|. +++||+++ -..+.++.+.... .    ..........|..+.|.|..+. |++++ ...|+|||+...
T Consensus        11 ~i~~~~~~~~-~~~l~~~~~~g~i~i~~~~~~~-~----~~~~~~~~~~i~~~~~~~~~~~-l~~~~~~~~i~i~~~~~~   83 (289)
T cd00200          11 GVTCVAFSPD-GKLLATGSGDGTIKVWDLETGE-L----LRTLKGHTGPVRDVAASADGTY-LASGSSDKTIRLWDLETG   83 (289)
T ss_pred             CEEEEEEcCC-CCEEEEeecCcEEEEEEeeCCC-c----EEEEecCCcceeEEEECCCCCE-EEEEcCCCeEEEEEcCcc
Confidence            4778899885 67888877 4455666664332 1    1112223455789999999844 55555 669999999865


Q ss_pred             CCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccc
Q 047869         1943 NISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYK 2022 (2233)
Q Consensus      1943 ~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~ 2022 (2233)
                        .+...|.-..+.|....+.  +++ .+++..+.+|.++.-++....      ....++  .  ..+.-.++.+++.-+
T Consensus        84 --~~~~~~~~~~~~i~~~~~~--~~~-~~~~~~~~~~~i~~~~~~~~~------~~~~~~--~--~~~~i~~~~~~~~~~  148 (289)
T cd00200          84 --ECVRTLTGHTSYVSSVAFS--PDG-RILSSSSRDKTIKVWDVETGK------CLTTLR--G--HTDWVNSVAFSPDGT  148 (289)
T ss_pred             --cceEEEeccCCcEEEEEEc--CCC-CEEEEecCCCeEEEEECCCcE------EEEEec--c--CCCcEEEEEEcCcCC
Confidence              3555666666678777754  333 344455559999988776211      001111  1  122345677888866


Q ss_pred             eeeEEecCCcEEEEEcC
Q 047869         2023 LLFLSFQDGTTLVGRLS 2039 (2233)
Q Consensus      2023 LLF~SY~~G~Sf~a~Ls 2039 (2233)
                      +++.+..+|...+-.+.
T Consensus       149 ~l~~~~~~~~i~i~d~~  165 (289)
T cd00200         149 FVASSSQDGTIKLWDLR  165 (289)
T ss_pred             EEEEEcCCCcEEEEEcc
Confidence            66666668877776653


No 10 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.96  E-value=1.1  Score=59.36  Aligned_cols=156  Identities=16%  Similarity=0.266  Sum_probs=94.8

Q ss_pred             eecccCc-eEEE-eeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEE
Q 047869         1814 LSVSSRG-RLAV-GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLT 1891 (2233)
Q Consensus      1814 LSas~rG-rLAV-aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLT 1891 (2233)
                      ++.+..| .+|. ++.++|-|+++........        ....|.....-...|..+++||..+++||.+|. |-.|.-
T Consensus       489 i~fs~dg~~latgg~D~~I~iwd~~~~~~~~~--------~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~-Dg~v~l  559 (793)
T PLN00181        489 IGFDRDGEFFATAGVNKKIKIFECESIIKDGR--------DIHYPVVELASRSKLSGICWNSYIKSQVASSNF-EGVVQV  559 (793)
T ss_pred             EEECCCCCEEEEEeCCCEEEEEECCccccccc--------ccccceEEecccCceeeEEeccCCCCEEEEEeC-CCeEEE
Confidence            3444444 4665 4889999999865432211        000111111113468899999998999988876 445555


Q ss_pred             ecCC-CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCc
Q 047869         1892 LNPR-GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGK 1969 (2233)
Q Consensus      1892 fss~-GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~ 1969 (2233)
                      ++-. |...-    .+.-....|..+.|-|.....|+....+ .|+|||+....  +...+.. .+.|..+.|. ..+| 
T Consensus       560 Wd~~~~~~~~----~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~--~~~~~~~-~~~v~~v~~~-~~~g-  630 (793)
T PLN00181        560 WDVARSQLVT----EMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGV--SIGTIKT-KANICCVQFP-SESG-  630 (793)
T ss_pred             EECCCCeEEE----EecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCc--EEEEEec-CCCeEEEEEe-CCCC-
Confidence            5432 32221    2222356799999998666667766665 89999997543  3444443 2355555442 3445 


Q ss_pred             EEEEEEecCCceEEEEec
Q 047869         1970 MFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1970 ~~ILVLSS~G~LY~Qels 1987 (2233)
                      .++++-|.+|.||+-++.
T Consensus       631 ~~latgs~dg~I~iwD~~  648 (793)
T PLN00181        631 RSLAFGSADHKVYYYDLR  648 (793)
T ss_pred             CEEEEEeCCCeEEEEECC
Confidence            467788899999988876


No 11 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.95  E-value=0.61  Score=55.92  Aligned_cols=259  Identities=18%  Similarity=0.270  Sum_probs=157.7

Q ss_pred             eecc-cCceEEE-ee----CCeEEEEechhhhcccccCCccccccccccccccc-cceEEEEeecccCccceEEeeccc-
Q 047869         1814 LSVS-SRGRLAV-GE----GDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNI-VRFEIVHLAFNSIVENYLTVAGYE- 1885 (2233)
Q Consensus      1814 LSas-~rGrLAV-aE----gdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~-VpFeVlsLafNP~nEdyLAVcGLk- 1885 (2233)
                      |..+ .+++|++ .|    .+.|+.+.+..            ++-+++.+++.+ .+-.-.+++.+| ++++|.|+-|. 
T Consensus        42 l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~------------~~g~L~~~~~~~~~g~~p~~i~~~~-~g~~l~vany~~  108 (345)
T PF10282_consen   42 LAVSPDGRRLYVVNEGSGDSGGVSSYRIDP------------DTGTLTLLNSVPSGGSSPCHIAVDP-DGRFLYVANYGG  108 (345)
T ss_dssp             EEE-TTSSEEEEEETTSSTTTEEEEEEEET------------TTTEEEEEEEEEESSSCEEEEEECT-TSSEEEEEETTT
T ss_pred             EEEEeCCCEEEEEEccccCCCCEEEEEECC------------CcceeEEeeeeccCCCCcEEEEEec-CCCEEEEEEccC
Confidence            4443 3455554 35    46887777632            223455555555 555666888988 78999998664 


Q ss_pred             -ceEEEEecCCCceeeeeeee-e-------c-cCCceEEEeEEecCCCceEE-EEecCeEEEEeCcCCC--CCCcEEEEc
Q 047869         1886 -DCQVLTLNPRGEVTDRLAIE-L-------A-LQGAYIRRVDWVPGSPVQLM-VVTNKFVKIYDLSQDN--ISPLHYFTL 1952 (2233)
Q Consensus      1886 -DC~VLTfss~GeV~DRL~Le-L-------~-Leg~fIIKa~WLPGSQt~LA-VVT~~FVKIYDLS~D~--lSPvyyF~L 1952 (2233)
                       ...|+.++.+|.+.....+- .       . .++.+.-.+.|-|..+..++ =.-++.|.+|++..+.  +.|...+.+
T Consensus       109 g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~  188 (345)
T PF10282_consen  109 GSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKV  188 (345)
T ss_dssp             TEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEEC
T ss_pred             CeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeecccc
Confidence             56678999999998886431 1       1 23778888999998765332 1237899999999887  888889999


Q ss_pred             CCCC-eeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeee-cccc-cccCCeEEEEeccccceeeEEec
Q 047869         1953 PDDM-IVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQ-FNDR-EIHAKGLSLYFSSTYKLLFLSFQ 2029 (2233)
Q Consensus      1953 psGk-IrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq-~~~~-q~~~~GVSVyYS~tl~LLF~SY~ 2029 (2233)
                      |.|. =|..+|  .++|++.-++---.+.|...++....  +.......+. .|.. .....+--|..|++-+.||+|-.
T Consensus       189 ~~G~GPRh~~f--~pdg~~~Yv~~e~s~~v~v~~~~~~~--g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr  264 (345)
T PF10282_consen  189 PPGSGPRHLAF--SPDGKYAYVVNELSNTVSVFDYDPSD--GSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNR  264 (345)
T ss_dssp             STTSSEEEEEE---TTSSEEEEEETTTTEEEEEEEETTT--TEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEEC
T ss_pred             ccCCCCcEEEE--cCCcCEEEEecCCCCcEEEEeecccC--CceeEEEEeeeccccccccCCceeEEEecCCCEEEEEec
Confidence            9887 455554  46787666666667778877776322  2333333222 2221 11225777899999999999998


Q ss_pred             CCcEEEE-EcCCCcccccceeEEEEccCCCCCCCcccceeeccCCCceEEEEeccCCCceEEEEecCCc
Q 047869         2030 DGTTLVG-RLSPNAASLSEVSYVFEEQDGKLRSAGLHRWKELLASSGLFFCFSSLKSNAAVAVSLGTNE 2097 (2233)
Q Consensus      2030 ~G~Sf~a-~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qWsEV~~hPGLf~cls~~~sn~pvvv~l~pd~ 2097 (2233)
                      ...++.. .++..++.++.+..+-..  |+      .-+.-.+.+-|=+..++.+.+|...++.+.++.
T Consensus       265 ~~~sI~vf~~d~~~g~l~~~~~~~~~--G~------~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~~t  325 (345)
T PF10282_consen  265 GSNSISVFDLDPATGTLTLVQTVPTG--GK------FPRHFAFSPDGRYLYVANQDSNTVSVFDIDPDT  325 (345)
T ss_dssp             TTTEEEEEEECTTTTTEEEEEEEEES--SS------SEEEEEE-TTSSEEEEEETTTTEEEEEEEETTT
T ss_pred             cCCEEEEEEEecCCCceEEEEEEeCC--CC------CccEEEEeCCCCEEEEEecCCCeEEEEEEeCCC
Confidence            7665543 455555445444433221  11      112223344555555567788888887776544


No 12 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.61  E-value=1.7  Score=51.01  Aligned_cols=197  Identities=12%  Similarity=0.102  Sum_probs=105.8

Q ss_pred             eecccCc-eEEEe--eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccc--eE
Q 047869         1814 LSVSSRG-RLAVG--EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYED--CQ 1888 (2233)
Q Consensus      1814 LSas~rG-rLAVa--EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkD--C~ 1888 (2233)
                      +..+..| +|+++  ..+.|.++++..     +      .+  ++.....+.+-...+|+++| ++++|.|+++.+  +.
T Consensus        40 l~~spd~~~lyv~~~~~~~i~~~~~~~-----~------g~--l~~~~~~~~~~~p~~i~~~~-~g~~l~v~~~~~~~v~  105 (330)
T PRK11028         40 MVISPDKRHLYVGVRPEFRVLSYRIAD-----D------GA--LTFAAESPLPGSPTHISTDH-QGRFLFSASYNANCVS  105 (330)
T ss_pred             EEECCCCCEEEEEECCCCcEEEEEECC-----C------Cc--eEEeeeecCCCCceEEEECC-CCCEEEEEEcCCCeEE
Confidence            4555444 46664  467787787731     1      00  11111222222346899998 778888888744  45


Q ss_pred             EEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEE-ecCeEEEEeCcCCC-CCC--cEEEEcCCCC-eeEEEEE
Q 047869         1889 VLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVV-TNKFVKIYDLSQDN-ISP--LHYFTLPDDM-IVDATLV 1963 (2233)
Q Consensus      1889 VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVV-T~~FVKIYDLS~D~-lSP--vyyF~LpsGk-IrDaTfv 1963 (2233)
                      |+-++.+|.+...+..-..  .....-+.+-|+.+..++.- -.+.|.|||+..+. +.|  .....++.|. .+.++| 
T Consensus       106 v~~~~~~g~~~~~~~~~~~--~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~-  182 (330)
T PRK11028        106 VSPLDKDGIPVAPIQIIEG--LEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVF-  182 (330)
T ss_pred             EEEECCCCCCCCceeeccC--CCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCcccccCCCceecCCCCCCceEEE-
Confidence            6666666765443332111  12233455677765433211 23689999998642 332  2344455453 455543 


Q ss_pred             EecCCcEEEEEEecCCceEEEEecccCCCccccceeeee-cccc-cccCCeEEEEeccccceeeEEecC
Q 047869         1964 IASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQ-FNDR-EIHAKGLSLYFSSTYKLLFLSFQD 2030 (2233)
Q Consensus      1964 ~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq-~~~~-q~~~~GVSVyYS~tl~LLF~SY~~ 2030 (2233)
                       .++|++..++...+|.|..-++....  +...+...+. .|.. ....-+..|.++++-+.||++-..
T Consensus       183 -~pdg~~lyv~~~~~~~v~v~~~~~~~--~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~  248 (330)
T PRK11028        183 -HPNQQYAYCVNELNSSVDVWQLKDPH--GEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRT  248 (330)
T ss_pred             -CCCCCEEEEEecCCCEEEEEEEeCCC--CCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCC
Confidence             47776444444448999988886321  1222222221 1111 112224458899999999998543


No 13 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.33  E-value=3.3  Score=48.72  Aligned_cols=253  Identities=13%  Similarity=0.157  Sum_probs=128.8

Q ss_pred             EEeecccCccceEEeecccceEE--EEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEe--cCeEEEEeCcCC
Q 047869         1867 VHLAFNSIVENYLTVAGYEDCQV--LTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT--NKFVKIYDLSQD 1942 (2233)
Q Consensus      1867 lsLafNP~nEdyLAVcGLkDC~V--LTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT--~~FVKIYDLS~D 1942 (2233)
                      ..|+++| +.++|+|.+..+-.|  +.++.+|.+...-.+  ...+. .--+.+-|..+. |.++.  ...|.+||+..|
T Consensus        38 ~~l~~sp-d~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~--~~~~~-p~~i~~~~~g~~-l~v~~~~~~~v~v~~~~~~  112 (330)
T PRK11028         38 QPMVISP-DKRHLYVGVRPEFRVLSYRIADDGALTFAAES--PLPGS-PTHISTDHQGRF-LFSASYNANCVSVSPLDKD  112 (330)
T ss_pred             ccEEECC-CCCEEEEEECCCCcEEEEEECCCCceEEeeee--cCCCC-ceEEEECCCCCE-EEEEEcCCCeEEEEEECCC
Confidence            3678888 778998877755555  555656665422222  12222 223556666654 33333  468999999765


Q ss_pred             CCC--CcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccc
Q 047869         1943 NIS--PLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSST 2020 (2233)
Q Consensus      1943 ~lS--PvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~t 2020 (2233)
                      ...  +...+.-.+ ..+.++  +.++|+...+.-..+|.|+.-++...+.- .......++.+.+   .+--.+.++++
T Consensus       113 g~~~~~~~~~~~~~-~~~~~~--~~p~g~~l~v~~~~~~~v~v~d~~~~g~l-~~~~~~~~~~~~g---~~p~~~~~~pd  185 (330)
T PRK11028        113 GIPVAPIQIIEGLE-GCHSAN--IDPDNRTLWVPCLKEDRIRLFTLSDDGHL-VAQEPAEVTTVEG---AGPRHMVFHPN  185 (330)
T ss_pred             CCCCCceeeccCCC-cccEeE--eCCCCCEEEEeeCCCCEEEEEEECCCCcc-cccCCCceecCCC---CCCceEEECCC
Confidence            422  222221111 123333  36778766666666788998888632210 0001122233322   11224678899


Q ss_pred             cceeeEEec-CCcEEEEEcCCCcccccceeEEEEccCCCCCCCccccee-eccCCC-c--eEEEEeccCCCceEEEEecC
Q 047869         2021 YKLLFLSFQ-DGTTLVGRLSPNAASLSEVSYVFEEQDGKLRSAGLHRWK-ELLASS-G--LFFCFSSLKSNAAVAVSLGT 2095 (2233)
Q Consensus      2021 l~LLF~SY~-~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qWs-EV~~hP-G--Lf~cls~~~sn~pvvv~l~p 2095 (2233)
                      -+.||++-. +|+..+-.++..++.++.+..+....  .....  .+|. ++.-+| |  ++++  ...+|..-++.+..
T Consensus       186 g~~lyv~~~~~~~v~v~~~~~~~~~~~~~~~~~~~p--~~~~~--~~~~~~i~~~pdg~~lyv~--~~~~~~I~v~~i~~  259 (330)
T PRK11028        186 QQYAYCVNELNSSVDVWQLKDPHGEIECVQTLDMMP--ADFSD--TRWAADIHITPDGRHLYAC--DRTASLISVFSVSE  259 (330)
T ss_pred             CCEEEEEecCCCEEEEEEEeCCCCCEEEEEEEecCC--CcCCC--CccceeEEECCCCCEEEEe--cCCCCeEEEEEEeC
Confidence            999999987 77777778865443333333221111  11111  1342 333233 3  4443  44566666666655


Q ss_pred             Cce---eeeccccccCCCCCeEEEEEeecCCCCCeEEEE-EeeCCceeEEeccC
Q 047869         2096 NEL---IAQNMRHAAGSTSPLVGVTAYKPLSKDKVHCLV-LHDDGSLQIYSHVP 2145 (2233)
Q Consensus      2096 d~I---~iQeiK~~~~sSs~vdgva~y~p~s~~rttlLL-LcEDGSLrIYsa~P 2145 (2233)
                      +.-   .++.+...  ..+  .+++    ++.+-..+++ ...||++.+|...+
T Consensus       260 ~~~~~~~~~~~~~~--~~p--~~~~----~~~dg~~l~va~~~~~~v~v~~~~~  305 (330)
T PRK11028        260 DGSVLSFEGHQPTE--TQP--RGFN----IDHSGKYLIAAGQKSHHISVYEIDG  305 (330)
T ss_pred             CCCeEEEeEEEecc--ccC--CceE----ECCCCCEEEEEEccCCcEEEEEEcC
Confidence            442   33333321  111  1332    2334445544 44599999997543


No 14 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.09  E-value=1.5  Score=58.08  Aligned_cols=160  Identities=13%  Similarity=0.160  Sum_probs=97.7

Q ss_pred             ceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeee---eccC-CceEEEeEEecCCCceEEEEecC-eEEEE
Q 047869         1863 RFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIE---LALQ-GAYIRRVDWVPGSPVQLMVVTNK-FVKIY 1937 (2233)
Q Consensus      1863 pFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~Le---L~Le-g~fIIKa~WLPGSQt~LAVVT~~-FVKIY 1937 (2233)
                      ...|.+++|+| ++++||.+|. |..|.-++.+..+.+...++   ..+. ..-|..+.|.|.....||....+ .|+||
T Consensus       483 ~~~V~~i~fs~-dg~~latgg~-D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg~v~lW  560 (793)
T PLN00181        483 SNLVCAIGFDR-DGEFFATAGV-NKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVW  560 (793)
T ss_pred             CCcEEEEEECC-CCCEEEEEeC-CCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCCeEEEE
Confidence            34478899998 7899999884 55555555433222211111   1112 33578899999766677766655 89999


Q ss_pred             eCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEe
Q 047869         1938 DLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYF 2017 (2233)
Q Consensus      1938 DLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyY 2017 (2233)
                      |+....  ....+.--.+.|.++.|-. .+| .+++.-|.+|.+..-++......      ..++  .   .....++.|
T Consensus       561 d~~~~~--~~~~~~~H~~~V~~l~~~p-~~~-~~L~Sgs~Dg~v~iWd~~~~~~~------~~~~--~---~~~v~~v~~  625 (793)
T PLN00181        561 DVARSQ--LVTEMKEHEKRVWSIDYSS-ADP-TLLASGSDDGSVKLWSINQGVSI------GTIK--T---KANICCVQF  625 (793)
T ss_pred             ECCCCe--EEEEecCCCCCEEEEEEcC-CCC-CEEEEEcCCCEEEEEECCCCcEE------EEEe--c---CCCeEEEEE
Confidence            997543  3444544567788777542 344 46778888999998888632111      1111  1   123445666


Q ss_pred             -ccccceeeEEecCCcEEEEEcC
Q 047869         2018 -SSTYKLLFLSFQDGTTLVGRLS 2039 (2233)
Q Consensus      2018 -S~tl~LLF~SY~~G~Sf~a~Ls 2039 (2233)
                       ++.-++|..+..+|+..+-.+.
T Consensus       626 ~~~~g~~latgs~dg~I~iwD~~  648 (793)
T PLN00181        626 PSESGRSLAFGSADHKVYYYDLR  648 (793)
T ss_pred             eCCCCCEEEEEeCCCeEEEEECC
Confidence             3456677777777777766553


No 15 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.79  E-value=5.7  Score=52.75  Aligned_cols=150  Identities=16%  Similarity=0.274  Sum_probs=96.4

Q ss_pred             eecccCceE--EEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEE
Q 047869         1814 LSVSSRGRL--AVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLT 1891 (2233)
Q Consensus      1814 LSas~rGrL--AVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLT 1891 (2233)
                      |+.+..|++  ||-|+|..-+..+-     .....   ...++        .-.|-.|.|+| |+.++||+==+..+|..
T Consensus        61 ialSp~g~lllavdE~g~~~lvs~~-----~r~Vl---h~f~f--------k~~v~~i~fSP-ng~~fav~~gn~lqiw~  123 (893)
T KOG0291|consen   61 IALSPDGTLLLAVDERGRALLVSLL-----SRSVL---HRFNF--------KRGVGAIKFSP-NGKFFAVGCGNLLQIWH  123 (893)
T ss_pred             EEeCCCceEEEEEcCCCcEEEEecc-----cceee---EEEee--------cCccceEEECC-CCcEEEEEecceeEEEe
Confidence            566677765  45688887655541     11100   00111        12356899988 99999999999999998


Q ss_pred             ecCCCceeeeee---eeeccCCce--EEEeEEecCCCceEEEEecC--eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEE
Q 047869         1892 LNPRGEVTDRLA---IELALQGAY--IRRVDWVPGSPVQLMVVTNK--FVKIYDLSQDNISPLHYFTLPDDMIVDATLVI 1964 (2233)
Q Consensus      1892 fss~GeV~DRL~---LeL~Leg~f--IIKa~WLPGSQt~LAVVT~~--FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~ 1964 (2233)
                      .  -|++.+...   ++=..-|-|  |+-+.|...|..  -++|.+  .+|||+...-.---.|.+.=-.|.|..+-|..
T Consensus       124 ~--P~~~~~~~~pFvl~r~~~g~fddi~si~Ws~DSr~--l~~gsrD~s~rl~~v~~~k~~~~~~l~gHkd~VvacfF~~  199 (893)
T KOG0291|consen  124 A--PGEIKNEFNPFVLHRTYLGHFDDITSIDWSDDSRL--LVTGSRDLSARLFGVDGNKNLFTYALNGHKDYVVACFFGA  199 (893)
T ss_pred             c--CcchhcccCcceEeeeecCCccceeEEEeccCCce--EEeccccceEEEEEeccccccceEeccCCCcceEEEEecc
Confidence            7  234444221   121222555  999999999954  445444  89999987765544444444466777776665


Q ss_pred             ecCCcEEEEEEecCCceEEEEec
Q 047869         1965 ASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1965 ~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      +   .+.++..|.+|+|++=...
T Consensus       200 ~---~~~l~tvskdG~l~~W~~~  219 (893)
T KOG0291|consen  200 N---SLDLYTVSKDGALFVWTCD  219 (893)
T ss_pred             C---cceEEEEecCceEEEEEec
Confidence            3   3669999999999976555


No 16 
>PTZ00421 coronin; Provisional
Probab=95.19  E-value=2.6  Score=53.91  Aligned_cols=196  Identities=14%  Similarity=0.153  Sum_probs=111.5

Q ss_pred             CceEEEe-eCCeEEEEechhh-hcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecC-C
Q 047869         1819 RGRLAVG-EGDKVAIFDVGQL-IGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNP-R 1895 (2233)
Q Consensus      1819 rGrLAVa-EgdKVTILqlsaL-LkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss-~ 1895 (2233)
                      ..+||.+ +.++|.|+++..- +.....       ..+..+.  .-.-.|..++|+|..+++||.+|. |..|..++- .
T Consensus        88 ~~~LaSgS~DgtIkIWdi~~~~~~~~~~-------~~l~~L~--gH~~~V~~l~f~P~~~~iLaSgs~-DgtVrIWDl~t  157 (493)
T PTZ00421         88 PQKLFTASEDGTIMGWGIPEEGLTQNIS-------DPIVHLQ--GHTKKVGIVSFHPSAMNVLASAGA-DMVVNVWDVER  157 (493)
T ss_pred             CCEEEEEeCCCEEEEEecCCCccccccC-------cceEEec--CCCCcEEEEEeCcCCCCEEEEEeC-CCEEEEEECCC
Confidence            3456654 8889999998431 000000       0011111  113457899999988889998886 444444442 3


Q ss_pred             CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEE
Q 047869         1896 GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIV 1974 (2233)
Q Consensus      1896 GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILV 1974 (2233)
                      |+...    .+.-....|..+.|-|... .||....+ .|+|||+....  +++.+.-..|.+...+++. .++..++.+
T Consensus       158 g~~~~----~l~~h~~~V~sla~spdG~-lLatgs~Dg~IrIwD~rsg~--~v~tl~~H~~~~~~~~~w~-~~~~~ivt~  229 (493)
T PTZ00421        158 GKAVE----VIKCHSDQITSLEWNLDGS-LLCTTSKDKKLNIIDPRDGT--IVSSVEAHASAKSQRCLWA-KRKDLIITL  229 (493)
T ss_pred             CeEEE----EEcCCCCceEEEEEECCCC-EEEEecCCCEEEEEECCCCc--EEEEEecCCCCcceEEEEc-CCCCeEEEE
Confidence            33222    2222356799999999754 46655554 99999997543  5666655555544444443 333333322


Q ss_pred             E---ecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEe-cCCcEEEEEcCC
Q 047869         1975 L---SECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSF-QDGTTLVGRLSP 2040 (2233)
Q Consensus      1975 L---SS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY-~~G~Sf~a~Ls~ 2040 (2233)
                      -   +++|.|..=++.....  ....      ...+....-...+|+++-++|++.- .+|...+-.+..
T Consensus       230 G~s~s~Dr~VklWDlr~~~~--p~~~------~~~d~~~~~~~~~~d~d~~~L~lggkgDg~Iriwdl~~  291 (493)
T PTZ00421        230 GCSKSQQRQIMLWDTRKMAS--PYST------VDLDQSSALFIPFFDEDTNLLYIGSKGEGNIRCFELMN  291 (493)
T ss_pred             ecCCCCCCeEEEEeCCCCCC--ceeE------eccCCCCceEEEEEcCCCCEEEEEEeCCCeEEEEEeeC
Confidence            2   3468888777763221  1111      1111223344568999999998875 478777766644


No 17 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=93.20  E-value=22  Score=46.09  Aligned_cols=262  Identities=21%  Similarity=0.248  Sum_probs=140.9

Q ss_pred             EEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCC
Q 047869         1865 EIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNI 1944 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~l 1944 (2233)
                      =|-.+-|||- +.+.|-+|-.-=-++-=+..|+.+-.+.=..+..| -|--+.|-|.|+..+-+-.-..+||||.+...+
T Consensus       192 FV~~VRysPD-G~~Fat~gsDgki~iyDGktge~vg~l~~~~aHkG-sIfalsWsPDs~~~~T~SaDkt~KIWdVs~~sl  269 (603)
T KOG0318|consen  192 FVNCVRYSPD-GSRFATAGSDGKIYIYDGKTGEKVGELEDSDAHKG-SIFALSWSPDSTQFLTVSADKTIKIWDVSTNSL  269 (603)
T ss_pred             ceeeEEECCC-CCeEEEecCCccEEEEcCCCccEEEEecCCCCccc-cEEEEEECCCCceEEEecCCceEEEEEeeccce
Confidence            3678999995 77777777654333333555655544433333332 244678999987655455555999999999955


Q ss_pred             CCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCcccc---ceeeeecccccccCCeEEEEecccc
Q 047869         1945 SPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATP---LKEIIQFNDREIHAKGLSLYFSSTY 2021 (2233)
Q Consensus      1945 SPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~---ltEvvq~~~~q~~~~GVSVyYS~tl 2021 (2233)
                      .-  +|.+.+ +|.|--+.+-+. +-+||..|-.|.|=+-+.+.-. .....   .+.+   ...-+..+| +--||-.+
T Consensus       270 v~--t~~~~~-~v~dqqvG~lWq-kd~lItVSl~G~in~ln~~d~~-~~~~i~GHnK~I---TaLtv~~d~-~~i~Sgsy  340 (603)
T KOG0318|consen  270 VS--TWPMGS-TVEDQQVGCLWQ-KDHLITVSLSGTINYLNPSDPS-VLKVISGHNKSI---TALTVSPDG-KTIYSGSY  340 (603)
T ss_pred             EE--EeecCC-chhceEEEEEEe-CCeEEEEEcCcEEEEecccCCC-hhheecccccce---eEEEEcCCC-CEEEeecc
Confidence            43  444443 477766655444 3478889999999877665211 10000   0000   011123334 56899999


Q ss_pred             ceeeEEecCCcEEEEEcCCC--cccc-----cceeEEEEcc-CCCCCCCcc--cce-----eeccCCCceEEEEeccCCC
Q 047869         2022 KLLFLSFQDGTTLVGRLSPN--AASL-----SEVSYVFEEQ-DGKLRSAGL--HRW-----KELLASSGLFFCFSSLKSN 2086 (2233)
Q Consensus      2022 ~LLF~SY~~G~Sf~a~Ls~~--~~sv-----~eis~Vfe~~-~gk~~~a~L--~qW-----sEV~~hPGLf~cls~~~sn 2086 (2233)
                      .-++.+...|+=+..++-..  +.-+     .+-..+|... |...+.-++  .+.     -++-.-|- -.|+..  .+
T Consensus       341 DG~I~~W~~~~g~~~~~~g~~h~nqI~~~~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg~QP~-~lav~~--d~  417 (603)
T KOG0318|consen  341 DGHINSWDSGSGTSDRLAGKGHTNQIKGMAASESGELFTIGWDDTLRVISLKDNGYTKSEVVKLGSQPK-GLAVLS--DG  417 (603)
T ss_pred             CceEEEEecCCccccccccccccceEEEEeecCCCcEEEEecCCeEEEEecccCcccccceeecCCCce-eEEEcC--CC
Confidence            99999999888888777321  1001     1112222221 222222111  111     12222232 234222  44


Q ss_pred             ceEEEEecCCceeeeccccccCCCCC--eEEEEEeecCCCCCeEEEEEeeCCceeEEecc
Q 047869         2087 AAVAVSLGTNELIAQNMRHAAGSTSP--LVGVTAYKPLSKDKVHCLVLHDDGSLQIYSHV 2144 (2233)
Q Consensus      2087 ~pvvv~l~pd~I~iQeiK~~~~sSs~--vdgva~y~p~s~~rttlLLLcEDGSLrIYsa~ 2144 (2233)
                      +..+|....+-.+.|..+-.....-.  ..++|+    +.++....+=-+||-++||+-+
T Consensus       418 ~~avv~~~~~iv~l~~~~~~~~~~~~y~~s~vAv----~~~~~~vaVGG~Dgkvhvysl~  473 (603)
T KOG0318|consen  418 GTAVVACISDIVLLQDQTKVSSIPIGYESSAVAV----SPDGSEVAVGGQDGKVHVYSLS  473 (603)
T ss_pred             CEEEEEecCcEEEEecCCcceeeccccccceEEE----cCCCCEEEEecccceEEEEEec
Confidence            44555555555555544432111100  112222    3477788888999999999843


No 18 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=92.89  E-value=0.63  Score=61.17  Aligned_cols=117  Identities=26%  Similarity=0.450  Sum_probs=80.2

Q ss_pred             EEEeecccCccceEE---------eecccceEEE------------EecCCCc--eee------------------eeee
Q 047869         1866 IVHLAFNSIVENYLT---------VAGYEDCQVL------------TLNPRGE--VTD------------------RLAI 1904 (2233)
Q Consensus      1866 VlsLafNP~nEdyLA---------VcGLkDC~VL------------Tfss~Ge--V~D------------------RL~L 1904 (2233)
                      |-.|+|||.+++|.+         .|.+-||+|.            ++.|+|+  |+-                  ..+|
T Consensus       412 VTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I  491 (712)
T KOG0283|consen  412 VTCVAFNPVDDRYFISGSLDGKVRLWSISDKKVVDWNDLRDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHI  491 (712)
T ss_pred             eEEEEecccCCCcEeecccccceEEeecCcCeeEeehhhhhhheeEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeE
Confidence            568999999999976         6888888875            4556663  222                  1122


Q ss_pred             eecc----CCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcC--CCCeeEEEEEEecCCcEEEEEEec
Q 047869         1905 ELAL----QGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLP--DDMIVDATLVIASRGKMFLIVLSE 1977 (2233)
Q Consensus      1905 eL~L----eg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~Lp--sGkIrDaTfv~~e~G~~~ILVLSS 1977 (2233)
                      .+.-    .+--|.-.+-.|+...++.|+|+| .|+|||+-.-+  +++-|.=+  .+.-..|.|.  .+|+ +||..|+
T Consensus       492 ~~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSnDSrIRI~d~~~~~--lv~KfKG~~n~~SQ~~Asfs--~Dgk-~IVs~se  566 (712)
T KOG0283|consen  492 RLHNKKKKQGKRITGLQFFPGDPDEVLVTSNDSRIRIYDGRDKD--LVHKFKGFRNTSSQISASFS--SDGK-HIVSASE  566 (712)
T ss_pred             eeccCccccCceeeeeEecCCCCCeEEEecCCCceEEEeccchh--hhhhhcccccCCcceeeeEc--cCCC-EEEEeec
Confidence            2221    133699999999999999999999 89999996544  33444422  2233455554  4785 7888889


Q ss_pred             CCceEEEEec
Q 047869         1978 CGSLYRLELS 1987 (2233)
Q Consensus      1978 ~G~LY~Qels 1987 (2233)
                      +-++|+=.+.
T Consensus       567 Ds~VYiW~~~  576 (712)
T KOG0283|consen  567 DSWVYIWKND  576 (712)
T ss_pred             CceEEEEeCC
Confidence            9999976654


No 19 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=92.87  E-value=9.5  Score=47.57  Aligned_cols=164  Identities=19%  Similarity=0.226  Sum_probs=105.4

Q ss_pred             cccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCceeee-----------eeeeeccCCceEEEe
Q 047869         1848 TADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDR-----------LAIELALQGAYIRRV 1916 (2233)
Q Consensus      1848 skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DR-----------L~LeL~Leg~fIIKa 1916 (2233)
                      .|+|.+++.+   --|-+|++|..   |.+.|+||=-++.+|+-|+. =...+-           .++.+..++.|+   
T Consensus        75 ~Kk~~~ICe~---~fpt~IL~Vrm---Nr~RLvV~Lee~IyIydI~~-MklLhTI~t~~~n~~gl~AlS~n~~n~yl---  144 (391)
T KOG2110|consen   75 FKKKTTICEI---FFPTSILAVRM---NRKRLVVCLEESIYIYDIKD-MKLLHTIETTPPNPKGLCALSPNNANCYL---  144 (391)
T ss_pred             cccCceEEEE---ecCCceEEEEE---ccceEEEEEcccEEEEeccc-ceeehhhhccCCCccceEeeccCCCCceE---
Confidence            4777888776   46788999988   88999999988888887731 111111           233444445564   


Q ss_pred             EEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEE-EEecccCCCccc
Q 047869         1917 DWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYR-LELSVEGNVGAT 1995 (2233)
Q Consensus      1917 ~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~-Qels~s~d~g~~ 1995 (2233)
                       =.|+|++      .--|.|||+  +++.|+-++-.=.|.+.-.+|  ..+|+ .|--.|+.|-|-+ ..++..+-.   
T Consensus       145 -Ayp~s~t------~GdV~l~d~--~nl~~v~~I~aH~~~lAalaf--s~~G~-llATASeKGTVIRVf~v~~G~kl---  209 (391)
T KOG2110|consen  145 -AYPGSTT------SGDVVLFDT--INLQPVNTINAHKGPLAALAF--SPDGT-LLATASEKGTVIRVFSVPEGQKL---  209 (391)
T ss_pred             -EecCCCC------CceEEEEEc--ccceeeeEEEecCCceeEEEE--CCCCC-EEEEeccCceEEEEEEcCCccEe---
Confidence             3578855      788999996  578899888888888876664  34553 3444455555442 222111000   


Q ss_pred             cceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEEEcCCC
Q 047869         1996 PLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLSPN 2041 (2233)
Q Consensus      1996 ~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls~~ 2041 (2233)
                        .|   .--+-....=.|+-||++-++|-+|=+.++..+.+|+..
T Consensus       210 --~e---FRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~  250 (391)
T KOG2110|consen  210 --YE---FRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKV  250 (391)
T ss_pred             --ee---eeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEeccc
Confidence              00   001111233567889999999999999999999888554


No 20 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=92.72  E-value=10  Score=42.26  Aligned_cols=191  Identities=13%  Similarity=0.168  Sum_probs=101.7

Q ss_pred             eecccCc-eEEEe--eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEE
Q 047869         1814 LSVSSRG-RLAVG--EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVL 1890 (2233)
Q Consensus      1814 LSas~rG-rLAVa--EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VL 1890 (2233)
                      +..+..| .+++.  ..+.+.++++..-   ..             +...+.+..+.+++++| ++++++++.-..-.+.
T Consensus        78 ~~~~~~g~~l~~~~~~~~~l~~~d~~~~---~~-------------~~~~~~~~~~~~~~~~~-dg~~l~~~~~~~~~~~  140 (300)
T TIGR03866        78 FALHPNGKILYIANEDDNLVTVIDIETR---KV-------------LAEIPVGVEPEGMAVSP-DGKIVVNTSETTNMAH  140 (300)
T ss_pred             EEECCCCCEEEEEcCCCCeEEEEECCCC---eE-------------EeEeeCCCCcceEEECC-CCCEEEEEecCCCeEE
Confidence            3444444 46554  4568888887431   00             01111122345788888 7788888776532333


Q ss_pred             EecCC-CceeeeeeeeeccCCceEEEeEEecCCCceEEEEe--cCeEEEEeCcCCCCCCcEEEEcC---CCCeeEEEEEE
Q 047869         1891 TLNPR-GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT--NKFVKIYDLSQDNISPLHYFTLP---DDMIVDATLVI 1964 (2233)
Q Consensus      1891 Tfss~-GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT--~~FVKIYDLS~D~lSPvyyF~Lp---sGkIrDaTfv~ 1964 (2233)
                      .++.+ |++...+.     .+.-+..+.|-|..+. |++.+  ...|+|||+.....--.+.+..+   ++.+.-..+.+
T Consensus       141 ~~d~~~~~~~~~~~-----~~~~~~~~~~s~dg~~-l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~  214 (300)
T TIGR03866       141 FIDTKTYEIVDNVL-----VDQRPRFAEFTADGKE-LWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKL  214 (300)
T ss_pred             EEeCCCCeEEEEEE-----cCCCccEEEECCCCCE-EEEEcCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEE
Confidence            33332 33332221     1233456789888765 43433  56899999987654333333322   12222223445


Q ss_pred             ecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEE-ecCCcEEEEEc
Q 047869         1965 ASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLS-FQDGTTLVGRL 2038 (2233)
Q Consensus      1965 ~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~S-Y~~G~Sf~a~L 2038 (2233)
                      .++|+...+....++.++.-++. .+   ..  ...+.  .   .+.-.++.++++-+.|+++ ..+|+-.+-.+
T Consensus       215 s~dg~~~~~~~~~~~~i~v~d~~-~~---~~--~~~~~--~---~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~  278 (300)
T TIGR03866       215 TKDGKTAFVALGPANRVAVVDAK-TY---EV--LDYLL--V---GQRVWQLAFTPDEKYLLTTNGVSNDVSVIDV  278 (300)
T ss_pred             CCCCCEEEEEcCCCCeEEEEECC-CC---cE--EEEEE--e---CCCcceEEECCCCCEEEEEcCCCCeEEEEEC
Confidence            67787655556667777765553 11   11  11111  1   1223467788988888875 45777655444


No 21 
>PTZ00421 coronin; Provisional
Probab=92.51  E-value=43  Score=43.40  Aligned_cols=153  Identities=12%  Similarity=0.079  Sum_probs=94.0

Q ss_pred             cCccceEEee--cccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCC-----
Q 047869         1873 SIVENYLTVA--GYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNI----- 1944 (2233)
Q Consensus      1873 P~nEdyLAVc--GLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~l----- 1944 (2233)
                      -+|.+|+|+.  +.-..-|+.++..|.+.+.. ..+.--...|..+.|-|.....||....+ .|+|||+.....     
T Consensus        37 ~~n~~~~a~~w~~~gg~~v~~~~~~G~~~~~~-~~l~GH~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~  115 (493)
T PTZ00421         37 ACNDRFIAVPWQQLGSTAVLKHTDYGKLASNP-PILLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNIS  115 (493)
T ss_pred             eECCceEEEEEecCCceEEeeccccccCCCCC-ceEeCCCCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccC
Confidence            3477888762  11123567777777654421 11111256799999999444457766665 899999986543     


Q ss_pred             CCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEecccccee
Q 047869         1945 SPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLL 2024 (2233)
Q Consensus      1945 SPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LL 2024 (2233)
                      .|...+.--.+.|..+.|-  +.+..+++..+.+|.|.+-++.....      ..  ...+  -...-.++-++++-++|
T Consensus       116 ~~l~~L~gH~~~V~~l~f~--P~~~~iLaSgs~DgtVrIWDl~tg~~------~~--~l~~--h~~~V~sla~spdG~lL  183 (493)
T PTZ00421        116 DPIVHLQGHTKKVGIVSFH--PSAMNVLASAGADMVVNVWDVERGKA------VE--VIKC--HSDQITSLEWNLDGSLL  183 (493)
T ss_pred             cceEEecCCCCcEEEEEeC--cCCCCEEEEEeCCCEEEEEECCCCeE------EE--EEcC--CCCceEEEEEECCCCEE
Confidence            4666666667778777653  44445777778899999888863210      01  1111  11223467778877777


Q ss_pred             eEEecCCcEEEEEc
Q 047869         2025 FLSFQDGTTLVGRL 2038 (2233)
Q Consensus      2025 F~SY~~G~Sf~a~L 2038 (2233)
                      ..+-.+|+..+-.+
T Consensus       184 atgs~Dg~IrIwD~  197 (493)
T PTZ00421        184 CTTSKDKKLNIIDP  197 (493)
T ss_pred             EEecCCCEEEEEEC
Confidence            77777777666554


No 22 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=91.36  E-value=4.3  Score=51.21  Aligned_cols=136  Identities=18%  Similarity=0.204  Sum_probs=95.5

Q ss_pred             ccccccccceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-e
Q 047869         1855 KPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-F 1933 (2233)
Q Consensus      1855 trLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-F 1933 (2233)
                      .|.++.+..=.|..+.-+| +++||.-+..+.|..+.-=++|...=...-+  ..+-=+-.+..=|..-- ++.=|.+ -
T Consensus       295 ~~~~~~~h~~~V~~ls~h~-tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~--~s~v~~ts~~fHpDgLi-fgtgt~d~~  370 (506)
T KOG0289|consen  295 EPTSSRPHEEPVTGLSLHP-TGEYLLSASNDGTWAFSDISSGSQLTVVSDE--TSDVEYTSAAFHPDGLI-FGTGTPDGV  370 (506)
T ss_pred             Cccccccccccceeeeecc-CCcEEEEecCCceEEEEEccCCcEEEEEeec--cccceeEEeeEcCCceE-EeccCCCce
Confidence            4566666666777777666 8899999999999999988887654333332  11223455556666522 3444444 7


Q ss_pred             EEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCcccccee
Q 047869         1934 VKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKE 1999 (2233)
Q Consensus      1934 VKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltE 1999 (2233)
                      ||||||+.-+  -+-.|-.-+|.|+...|-  ++| |++.+.+++|.+..=++++.-+...+.+.|
T Consensus       371 vkiwdlks~~--~~a~Fpght~~vk~i~Fs--ENG-Y~Lat~add~~V~lwDLRKl~n~kt~~l~~  431 (506)
T KOG0289|consen  371 VKIWDLKSQT--NVAKFPGHTGPVKAISFS--ENG-YWLATAADDGSVKLWDLRKLKNFKTIQLDE  431 (506)
T ss_pred             EEEEEcCCcc--ccccCCCCCCceeEEEec--cCc-eEEEEEecCCeEEEEEehhhcccceeeccc
Confidence            9999998877  334565678999988864  777 999999999999999999776554444444


No 23 
>PF10214 Rrn6:  RNA polymerase I-specific transcription-initiation factor;  InterPro: IPR019350  RNA polymerase I-specific transcription-initiation factor Rrn6 and Rrn7 represent components of a multisubunit transcription factor essential for the initiation of rDNA transcription by Pol I []. These proteins are found in fungi. 
Probab=91.15  E-value=8.4  Score=51.78  Aligned_cols=172  Identities=16%  Similarity=0.231  Sum_probs=105.9

Q ss_pred             hcCcccccceec---c---cC-ceEEEeeCCeEEEEec--hhhhcccccCCccccccccccccccc----cceEEEEeec
Q 047869         1805 ASGSLVKSLLSV---S---SR-GRLAVGEGDKVAIFDV--GQLIGQATIQPVTADKTNVKPLSRNI----VRFEIVHLAF 1871 (2233)
Q Consensus      1805 ~sGq~iRqLLSa---s---~r-GrLAVaEgdKVTILql--saLLkQad~s~~skdKlTLtrLSsa~----VpFeVlsLaf 1871 (2233)
                      .-|.-|+|+--+   .   .. +-|||.-.-+++|++.  ...+.-..   ....++...|+..-+    -+|+...++|
T Consensus        77 ~~~~PI~qI~fa~~~~~~~~~~~~l~Vrt~~st~I~~p~~~~~~~~~~---~~~s~i~~~~l~~i~~~~tgg~~~aDv~F  153 (765)
T PF10214_consen   77 DDGSPIKQIKFATLSESFDEKSRWLAVRTETSTTILRPEYHRVISSIR---SRPSRIDPNPLLTISSSDTGGFPHADVAF  153 (765)
T ss_pred             CCCCCeeEEEecccccccCCcCcEEEEEcCCEEEEEEccccccccccc---CCccccccceeEEechhhcCCCccceEEe
Confidence            467788887665   2   22 4689999999999993  33311111   123445666665433    6899999999


Q ss_pred             ccCccceEEee---cccceEEEEe-cCCCceeeeeeeeec----c--C---CceEEEeEEecCCCceEEEEecCeEEEEe
Q 047869         1872 NSIVENYLTVA---GYEDCQVLTL-NPRGEVTDRLAIELA----L--Q---GAYIRRVDWVPGSPVQLMVVTNKFVKIYD 1938 (2233)
Q Consensus      1872 NP~nEdyLAVc---GLkDC~VLTf-ss~GeV~DRL~LeL~----L--e---g~fIIKa~WLPGSQt~LAVVT~~FVKIYD 1938 (2233)
                      ||++...+||+   |+.-  |..+ .......+.+.+...    +  +   -.-..++.|++.... |.|.+...+.+||
T Consensus       154 nP~~~~q~AiVD~~G~Ws--vw~i~~~~~~~~~~~~~~~~~~gsi~~d~~e~s~w~rI~W~~~~~~-lLv~~r~~l~~~d  230 (765)
T PF10214_consen  154 NPWDQRQFAIVDEKGNWS--VWDIKGRPKRKSSNLRLSRNISGSIIFDPEELSNWKRILWVSDSNR-LLVCNRSKLMLID  230 (765)
T ss_pred             ccCccceEEEEeccCcEE--EEEeccccccCCcceeeccCCCccccCCCcccCcceeeEecCCCCE-EEEEcCCceEEEE
Confidence            99999999985   4433  3333 000001111111110    0  1   122569999877654 7789999999999


Q ss_pred             CcCCCCCCcEEEEcC---CCCeeEEEEEEecCCcEEEEEEecCCceEEEEecc
Q 047869         1939 LSQDNISPLHYFTLP---DDMIVDATLVIASRGKMFLIVLSECGSLYRLELSV 1988 (2233)
Q Consensus      1939 LS~D~lSPvyyF~Lp---sGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~ 1988 (2233)
                      +..+...+.   ++.   ...|+|+.-.  .....+++|||+. .|+.-++..
T Consensus       231 ~~~~~~~~~---l~~~~~~~~IlDv~~~--~~~~~~~FiLTs~-eiiw~~~~~  277 (765)
T PF10214_consen  231 FESNWQTEY---LVTAKTWSWILDVKRS--PDNPSHVFILTSK-EIIWLDVKS  277 (765)
T ss_pred             CCCCCccch---hccCCChhheeeEEec--CCccceEEEEecC-eEEEEEccC
Confidence            997776554   333   3569998854  3344678888774 566555553


No 24 
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=90.82  E-value=3.8  Score=50.07  Aligned_cols=170  Identities=20%  Similarity=0.305  Sum_probs=108.5

Q ss_pred             HHhHHHhhcCcccccceecccCceEEEe--eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCc
Q 047869         1798 RELKSHLASGSLVKSLLSVSSRGRLAVG--EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIV 1875 (2233)
Q Consensus      1798 relks~l~sGq~iRqLLSas~rGrLAVa--EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~n 1875 (2233)
                      .-+|+|  .||  =.-|++++-|+||+.  -.+++-.++|-.           .+..-+++|+..+-     .|.|.| .
T Consensus       121 ~slK~H--~~~--Vt~lsiHPS~KLALsVg~D~~lr~WNLV~-----------Gr~a~v~~L~~~at-----~v~w~~-~  179 (362)
T KOG0294|consen  121 KSLKAH--KGQ--VTDLSIHPSGKLALSVGGDQVLRTWNLVR-----------GRVAFVLNLKNKAT-----LVSWSP-Q  179 (362)
T ss_pred             eeeccc--ccc--cceeEecCCCceEEEEcCCceeeeehhhc-----------CccceeeccCCcce-----eeEEcC-C
Confidence            344454  444  345889999998865  344555555522           22223333433221     266664 5


Q ss_pred             cceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEe-cCeEEEEeCcCCCCCCcEEEEcCC
Q 047869         1876 ENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT-NKFVKIYDLSQDNISPLHYFTLPD 1954 (2233)
Q Consensus      1876 EdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT-~~FVKIYDLS~D~lSPvyyF~Lps 1954 (2233)
                      +++.+|.|-+-+-|+-+... .|.-    +... ..-|..+-|+.++  .|+|-- ++-|+.+|=..  ..|-+.|.--+
T Consensus       180 Gd~F~v~~~~~i~i~q~d~A-~v~~----~i~~-~~r~l~~~~l~~~--~L~vG~d~~~i~~~D~ds--~~~~~~~~AH~  249 (362)
T KOG0294|consen  180 GDHFVVSGRNKIDIYQLDNA-SVFR----EIEN-PKRILCATFLDGS--ELLVGGDNEWISLKDTDS--DTPLTEFLAHE  249 (362)
T ss_pred             CCEEEEEeccEEEEEecccH-hHhh----hhhc-cccceeeeecCCc--eEEEecCCceEEEeccCC--Cccceeeecch
Confidence            66667777666666665322 1111    1111 1447888898887  666554 45899999655  88999999999


Q ss_pred             CCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCcccccee
Q 047869         1955 DMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKE 1999 (2233)
Q Consensus      1955 GkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltE 1999 (2233)
                      -.|.|..++-++++ .+|+-.||+|.|-+=++++.....+..+.|
T Consensus       250 ~RVK~i~~~~~~~~-~~lvTaSSDG~I~vWd~~~~~k~~~~~l~e  293 (362)
T KOG0294|consen  250 NRVKDIASYTNPEH-EYLVTASSDGFIKVWDIDMETKKRPTLLAE  293 (362)
T ss_pred             hheeeeEEEecCCc-eEEEEeccCceEEEEEccccccCCcceeEE
Confidence            99999998877776 688899999999999998765544444443


No 25 
>PF04762 IKI3:  IKI3 family;  InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=90.62  E-value=28  Score=48.18  Aligned_cols=122  Identities=20%  Similarity=0.268  Sum_probs=81.4

Q ss_pred             cceEEEEeecccCccceEEeecccceEEEEecC-------------CCcee-------eeeeeeeccC-Cce-EEEeEEe
Q 047869         1862 VRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNP-------------RGEVT-------DRLAIELALQ-GAY-IRRVDWV 1919 (2233)
Q Consensus      1862 VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss-------------~GeV~-------DRL~LeL~Le-g~f-IIKa~WL 1919 (2233)
                      -...|..|..|+ +.+.|||+--..+|+.|.+.             ...+.       +-+.++.... |.| +.+..|.
T Consensus       303 ~~~~v~~l~Wn~-ds~iLAv~~~~~vqLWt~~NYHWYLKqei~~~~~~~~~~~~Wdpe~p~~L~v~t~~g~~~~~~~~~~  381 (928)
T PF04762_consen  303 EEEKVIELAWNS-DSEILAVWLEDRVQLWTRSNYHWYLKQEIRFSSSESVNFVKWDPEKPLRLHVLTSNGQYEIYDFAWD  381 (928)
T ss_pred             CCceeeEEEECC-CCCEEEEEecCCceEEEeeCCEEEEEEEEEccCCCCCCceEECCCCCCEEEEEecCCcEEEEEEEEE
Confidence            344679999998 77899998776788877632             21111       1123344444 333 5555565


Q ss_pred             cC--------CCceEEEEecCeEEEEeCcCCCCC-CcEEEEcC-CCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1920 PG--------SPVQLMVVTNKFVKIYDLSQDNIS-PLHYFTLP-DDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1920 PG--------SQt~LAVVT~~FVKIYDLS~D~lS-PvyyF~Lp-sGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      ..        ..+..||+-.+.+++.+|..-++. |++.|.+. +..|.+++|-... .  .+.+++++|.|+.-...
T Consensus       382 v~~s~~~~~~D~g~vaVIDG~~lllTpf~~a~VPPPMs~~~l~~~~~v~~vaf~~~~-~--~~avl~~d~~l~~~~~~  456 (928)
T PF04762_consen  382 VSRSPGSSPNDNGTVAVIDGNKLLLTPFRRAVVPPPMSSYELELPSPVNDVAFSPSN-S--RFAVLTSDGSLSIYEWD  456 (928)
T ss_pred             EEecCCCCccCceEEEEEeCCeEEEecccccCCCchHhceEEcCCCCcEEEEEeCCC-C--eEEEEECCCCEEEEEec
Confidence            44        234578888888999999988887 67666665 5579999977432 2  28899999988866554


No 26 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=90.27  E-value=59  Score=40.79  Aligned_cols=26  Identities=19%  Similarity=0.365  Sum_probs=21.1

Q ss_pred             Eecc-ccceeeEEecCCcEEEEEcCCCc
Q 047869         2016 YFSS-TYKLLFLSFQDGTTLVGRLSPNA 2042 (2233)
Q Consensus      2016 yYS~-tl~LLF~SY~~G~Sf~a~Ls~~~ 2042 (2233)
                      +|+. +=+.+|+||+ |+.+...++...
T Consensus       200 ~~~~~dg~~~~vs~e-G~V~~id~~~~~  226 (352)
T TIGR02658       200 AYSNKSGRLVWPTYT-GKIFQIDLSSGD  226 (352)
T ss_pred             ceEcCCCcEEEEecC-CeEEEEecCCCc
Confidence            5566 7899999999 999998876554


No 27 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=89.21  E-value=10  Score=46.34  Aligned_cols=106  Identities=22%  Similarity=0.328  Sum_probs=72.6

Q ss_pred             EEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEE-EEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeec
Q 047869         1926 LMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDAT-LVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQF 2003 (2233)
Q Consensus      1926 LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaT-fv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~ 2003 (2233)
                      +|++... .||+||+-.=---|--.|.++.+.-...+ +=+..+| .+|++.+..|.+|.-+=- .|   .  +....+.
T Consensus       155 fA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dG-K~iLlsT~~s~~~~lDAf-~G---~--~~~tfs~  227 (311)
T KOG1446|consen  155 FALANGSELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDG-KSILLSTNASFIYLLDAF-DG---T--VKSTFSG  227 (311)
T ss_pred             EEEecCCCeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCC-CEEEEEeCCCcEEEEEcc-CC---c--EeeeEee
Confidence            5666555 89999999888899999999954433333 3334677 589999999998864432 11   1  2222221


Q ss_pred             ccccccCCeEEEEeccccceeeEEecCCcEEEEEcC
Q 047869         2004 NDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLS 2039 (2233)
Q Consensus      2004 ~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls 2039 (2233)
                      .-... +--++-.|+++-+.++.++.+|+..+=++.
T Consensus       228 ~~~~~-~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~  262 (311)
T KOG1446|consen  228 YPNAG-NLPLSATFTPDSKFVLSGSDDGTIHVWNLE  262 (311)
T ss_pred             ccCCC-CcceeEEECCCCcEEEEecCCCcEEEEEcC
Confidence            11111 112788999999999999999999998873


No 28 
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=88.78  E-value=48  Score=42.39  Aligned_cols=261  Identities=18%  Similarity=0.165  Sum_probs=141.9

Q ss_pred             EEEeecccCccceEEeecccce-EEEEecCCCceeeeeeeeeccCCceEEEeEE---ecCCC-ceEEEEecCeEEEEeCc
Q 047869         1866 IVHLAFNSIVENYLTVAGYEDC-QVLTLNPRGEVTDRLAIELALQGAYIRRVDW---VPGSP-VQLMVVTNKFVKIYDLS 1940 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkDC-~VLTfss~GeV~DRL~LeL~Leg~fIIKa~W---LPGSQ-t~LAVVT~~FVKIYDLS 1940 (2233)
                      |-++-.++.+.|.++|-.+.-. .|+.=+.+|.-.+++-+|-+++ .-|..+++   +++++ .+|||...+-+-||.+.
T Consensus        27 v~~~~~~~~~~d~IivGS~~G~LrIy~P~~~~~~~~~lllE~~l~-~PILqv~~G~F~s~~~~~~LaVLhP~kl~vY~v~  105 (418)
T PF14727_consen   27 VGNLDNSPSGSDKIIVGSYSGILRIYDPSGNEFQPEDLLLETQLK-DPILQVECGKFVSGSEDLQLAVLHPRKLSVYSVS  105 (418)
T ss_pred             EEcccCCCCCccEEEEeccccEEEEEccCCCCCCCccEEEEEecC-CcEEEEEeccccCCCCcceEEEecCCEEEEEEEE
Confidence            3344445777888888776544 3444455553444666665553 12222222   67765 58999999999999993


Q ss_pred             CC----------CCCCcEEEEcCCCCeeEEEEEEe--cCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccc
Q 047869         1941 QD----------NISPLHYFTLPDDMIVDATLVIA--SRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREI 2008 (2233)
Q Consensus      1941 ~D----------~lSPvyyF~LpsGkIrDaTfv~~--e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~ 2008 (2233)
                      ..          .+...|.-.++- .--.+|.+.-  ..|+-.|.|-|=+|.|..-+=+.-  .-...+.+ .-+|+.  
T Consensus       106 ~~~g~~~~g~~~~L~~~yeh~l~~-~a~nm~~G~Fgg~~~~~~IcVQS~DG~L~~feqe~~--~f~~~lp~-~llPgP--  179 (418)
T PF14727_consen  106 LVDGTVEHGNQYQLELIYEHSLQR-TAYNMCCGPFGGVKGRDFICVQSMDGSLSFFEQESF--AFSRFLPD-FLLPGP--  179 (418)
T ss_pred             ecCCCcccCcEEEEEEEEEEeccc-ceeEEEEEECCCCCCceEEEEEecCceEEEEeCCcE--EEEEEcCC-CCCCcC--
Confidence            32          234555555543 2233343332  356899999999999975332210  01122333 333443  


Q ss_pred             cCCeEEEEeccccceeeEEecCCcEEEEEcCCCccccc-ceeEEEEccCCCCCCCcccceeeccCCCceEEEEeccCCCc
Q 047869         2009 HAKGLSLYFSSTYKLLFLSFQDGTTLVGRLSPNAASLS-EVSYVFEEQDGKLRSAGLHRWKELLASSGLFFCFSSLKSNA 2087 (2233)
Q Consensus      2009 ~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls~~~~sv~-eis~Vfe~~~gk~~~a~L~qWsEV~~hPGLf~cls~~~sn~ 2087 (2233)
                            +-|.+..+-++..-.+.+--+-+...-+.+-. +-..--..++...+..-...|+=..|-+-+=+++...+++-
T Consensus       180 ------l~Y~~~tDsfvt~sss~~l~~Yky~~La~~s~~~~~~~~~~~~~~~~k~l~~dWs~nlGE~~l~i~v~~~~~~~  253 (418)
T PF14727_consen  180 ------LCYCPRTDSFVTASSSWTLECYKYQDLASASEASSRQSGTEQDISSGKKLNPDWSFNLGEQALDIQVVRFSSSE  253 (418)
T ss_pred             ------eEEeecCCEEEEecCceeEEEecHHHhhhccccccccccccccccccccccceeEEECCceeEEEEEEEcCCCC
Confidence                  56777776666554433322222111110000 00000001111122223678998888877766666655677


Q ss_pred             eEEEEecCCceeee----ccccccCCCCCeE----EEEEeecCCCCC----eEEEEEeeCCceeEEec
Q 047869         2088 AVAVSLGTNELIAQ----NMRHAAGSTSPLV----GVTAYKPLSKDK----VHCLVLHDDGSLQIYSH 2143 (2233)
Q Consensus      2088 pvvv~l~pd~I~iQ----eiK~~~~sSs~vd----gva~y~p~s~~r----ttlLLLcEDGSLrIYsa 2143 (2233)
                      +-++.++...++.=    .+|.    .++++    .+..|.......    -.+|+-.++|+|.||.-
T Consensus       254 ~~IvvLger~Lf~l~~~G~l~~----~krLd~~p~~~~~Y~~~~~~~~~~~~~llV~t~t~~LlVy~d  317 (418)
T PF14727_consen  254 SDIVVLGERSLFCLKDNGSLRF----QKRLDYNPSCFCPYRVPWYNEPSTRLNLLVGTHTGTLLVYED  317 (418)
T ss_pred             ceEEEEecceEEEEcCCCeEEE----EEecCCceeeEEEEEeecccCCCCceEEEEEecCCeEEEEeC
Confidence            77777777666541    2333    22222    455677754333    34899999999999964


No 29 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=87.85  E-value=21  Score=45.87  Aligned_cols=193  Identities=21%  Similarity=0.255  Sum_probs=112.8

Q ss_pred             CceEEE-e-eCCeEEEEechh--hhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecC
Q 047869         1819 RGRLAV-G-EGDKVAIFDVGQ--LIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNP 1894 (2233)
Q Consensus      1819 rGrLAV-a-EgdKVTILqlsa--LLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss 1894 (2233)
                      .|+|+. + |.|+|-|++...  .|.|-.++                 -++|..+.|.| .++-++|-|=.|=.+.-+.-
T Consensus        79 DG~LlaaGD~sG~V~vfD~k~r~iLR~~~ah-----------------~apv~~~~f~~-~d~t~l~s~sDd~v~k~~d~  140 (487)
T KOG0310|consen   79 DGRLLAAGDESGHVKVFDMKSRVILRQLYAH-----------------QAPVHVTKFSP-QDNTMLVSGSDDKVVKYWDL  140 (487)
T ss_pred             CCeEEEccCCcCcEEEeccccHHHHHHHhhc-----------------cCceeEEEecc-cCCeEEEecCCCceEEEEEc
Confidence            466654 3 999999999765  45443321                 23455677866 55555566656655555555


Q ss_pred             CCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC---eEEEEeCcCCCCCCcEEEE----------cCCCCe----
Q 047869         1895 RGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK---FVKIYDLSQDNISPLHYFT----------LPDDMI---- 1957 (2233)
Q Consensus      1895 ~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~---FVKIYDLS~D~lSPvyyF~----------LpsGkI---- 1957 (2233)
                      ++..+   +.+++--..|||...|.|+..-  +|+|--   +||.||.-... +++++|-          ||+|.+    
T Consensus       141 s~a~v---~~~l~~htDYVR~g~~~~~~~h--ivvtGsYDg~vrl~DtR~~~-~~v~elnhg~pVe~vl~lpsgs~iasA  214 (487)
T KOG0310|consen  141 STAYV---QAELSGHTDYVRCGDISPANDH--IVVTGSYDGKVRLWDTRSLT-SRVVELNHGCPVESVLALPSGSLIASA  214 (487)
T ss_pred             CCcEE---EEEecCCcceeEeeccccCCCe--EEEecCCCceEEEEEeccCC-ceeEEecCCCceeeEEEcCCCCEEEEc
Confidence            44442   4455555789999999999744  456654   89999997775 6666553          344322    


Q ss_pred             -------eEEEEE-------EecCCcEEEEEEecCC-ceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccc
Q 047869         1958 -------VDATLV-------IASRGKMFLIVLSECG-SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYK 2022 (2233)
Q Consensus      1958 -------rDaTfv-------~~e~G~~~ILVLSS~G-~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~ 2022 (2233)
                             -|.+-.       .+.+.+.-.+.+.++| .||.--+  .+....+..++.--+......+.-+||--|+.-+
T Consensus       215 gGn~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sL--D~~VKVfd~t~~Kvv~s~~~~~pvLsiavs~dd~  292 (487)
T KOG0310|consen  215 GGNSVKVWDLTTGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSL--DRHVKVFDTTNYKVVHSWKYPGPVLSIAVSPDDQ  292 (487)
T ss_pred             CCCeEEEEEecCCceehhhhhcccceEEEEEeecCCceEeeccc--ccceEEEEccceEEEEeeecccceeeEEecCCCc
Confidence                   122100       0011122233333333 1111111  1233344444444444556677788999999889


Q ss_pred             eeeEEecCCcEEEEE
Q 047869         2023 LLFLSFQDGTTLVGR 2037 (2233)
Q Consensus      2023 LLF~SY~~G~Sf~a~ 2037 (2233)
                      .+-+...+|..++.+
T Consensus       293 t~viGmsnGlv~~rr  307 (487)
T KOG0310|consen  293 TVVIGMSNGLVSIRR  307 (487)
T ss_pred             eEEEecccceeeeeh
Confidence            999999998888863


No 30 
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=87.84  E-value=25  Score=39.76  Aligned_cols=138  Identities=18%  Similarity=0.291  Sum_probs=80.2

Q ss_pred             EeecccCccceEEeecc-----------cceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEec---Ce
Q 047869         1868 HLAFNSIVENYLTVAGY-----------EDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTN---KF 1933 (2233)
Q Consensus      1868 sLafNP~nEdyLAVcGL-----------kDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~---~F 1933 (2233)
                      .|.-|| +++||+|-=-           -+-.++.++.++.-.+.+.++  -+| -|..+.|=|.++. +||++.   ..
T Consensus        10 ~~~W~~-~G~~l~~~~~~~~~~~~ks~~~~~~l~~~~~~~~~~~~i~l~--~~~-~I~~~~WsP~g~~-favi~g~~~~~   84 (194)
T PF08662_consen   10 KLHWQP-SGDYLLVKVQTRVDKSGKSYYGEFELFYLNEKNIPVESIELK--KEG-PIHDVAWSPNGNE-FAVIYGSMPAK   84 (194)
T ss_pred             EEEecc-cCCEEEEEEEEeeccCcceEEeeEEEEEEecCCCccceeecc--CCC-ceEEEEECcCCCE-EEEEEccCCcc
Confidence            555666 5666665433           245566665555444333332  223 4999999998754 776653   37


Q ss_pred             EEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEE--ecCCceEEEEecccCCCccccceeeeecccccccCC
Q 047869         1934 VKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVL--SECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAK 2011 (2233)
Q Consensus      1934 VKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVL--SS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~ 2011 (2233)
                      |.|||+.   ..|++.  ++.+.+-  ++...+.|++.++.-  ...|.|+.-++..        ...+.+.    ....
T Consensus        85 v~lyd~~---~~~i~~--~~~~~~n--~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~--------~~~i~~~----~~~~  145 (194)
T PF08662_consen   85 VTLYDVK---GKKIFS--FGTQPRN--TISWSPDGRFLVLAGFGNLNGDLEFWDVRK--------KKKISTF----EHSD  145 (194)
T ss_pred             cEEEcCc---ccEeEe--ecCCCce--EEEECCCCCEEEEEEccCCCcEEEEEECCC--------CEEeecc----ccCc
Confidence            9999995   445554  4565554  344568897555543  2357777766651        1111111    1233


Q ss_pred             eEEEEeccccceeeEEec
Q 047869         2012 GLSLYFSSTYKLLFLSFQ 2029 (2233)
Q Consensus      2012 GVSVyYS~tl~LLF~SY~ 2029 (2233)
                      ...+.+|++-+.+..+.+
T Consensus       146 ~t~~~WsPdGr~~~ta~t  163 (194)
T PF08662_consen  146 ATDVEWSPDGRYLATATT  163 (194)
T ss_pred             EEEEEEcCCCCEEEEEEe
Confidence            577889988777776654


No 31 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=87.68  E-value=75  Score=38.64  Aligned_cols=225  Identities=17%  Similarity=0.280  Sum_probs=118.8

Q ss_pred             cceeccc-CceEEEe--eCCeEEEEechhh--hcccccCCccccccccccccccccceEEEEeecccCccceEEee--cc
Q 047869         1812 SLLSVSS-RGRLAVG--EGDKVAIFDVGQL--IGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVA--GY 1884 (2233)
Q Consensus      1812 qLLSas~-rGrLAVa--EgdKVTILqlsaL--LkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVc--GL 1884 (2233)
                      .-++.+. ++.|+|+  .++.|+++++..-  ++....   ...-..--|-.....+-...++.+.| +++|+.|+  |.
T Consensus        90 ~~i~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~---~~~~~g~g~~~~rq~~~h~H~v~~~p-dg~~v~v~dlG~  165 (345)
T PF10282_consen   90 CHIAVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQ---TVRHEGSGPNPDRQEGPHPHQVVFSP-DGRFVYVPDLGA  165 (345)
T ss_dssp             EEEEECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEE---EEESEEEESSTTTTSSTCEEEEEE-T-TSSEEEEEETTT
T ss_pred             EEEEEecCCCEEEEEEccCCeEEEEEccCCcccceeee---ecccCCCCCcccccccccceeEEECC-CCCEEEEEecCC
Confidence            3456654 4557776  7999999999652  111110   00000001111122344567899988 78888887  78


Q ss_pred             cceEEEEecCCC-ceeeeeeeeeccC-CceEEEeEEecCCCceEEEEe--cCeEEEEeCcCCC--CCCcEEEEc-CC---
Q 047869         1885 EDCQVLTLNPRG-EVTDRLAIELALQ-GAYIRRVDWVPGSPVQLMVVT--NKFVKIYDLSQDN--ISPLHYFTL-PD--- 1954 (2233)
Q Consensus      1885 kDC~VLTfss~G-eV~DRL~LeL~Le-g~fIIKa~WLPGSQt~LAVVT--~~FVKIYDLS~D~--lSPvyyF~L-ps--- 1954 (2233)
                      ..+.++.++..+ .+.....+  .++ |.==|.+.|-|..+. +-|+.  +..|.+|++....  +........ |.   
T Consensus       166 D~v~~~~~~~~~~~l~~~~~~--~~~~G~GPRh~~f~pdg~~-~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~  242 (345)
T PF10282_consen  166 DRVYVYDIDDDTGKLTPVDSI--KVPPGSGPRHLAFSPDGKY-AYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFT  242 (345)
T ss_dssp             TEEEEEEE-TTS-TEEEEEEE--ECSTTSSEEEEEE-TTSSE-EEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSC
T ss_pred             CEEEEEEEeCCCceEEEeecc--ccccCCCCcEEEEcCCcCE-EEEecCCCCcEEEEeecccCCceeEEEEeeecccccc
Confidence            889999998776 44442333  333 555677888886543 33443  4489999998333  333333332 22   


Q ss_pred             CCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEec-CCcE
Q 047869         1955 DMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQ-DGTT 2033 (2233)
Q Consensus      1955 GkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~-~G~S 2033 (2233)
                      |.-.-+.+...++|+...+.--..+.|-.-++...  .+.......  ++.+-..  --.+-.+++=++|+++-+ +++.
T Consensus       243 ~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~--~g~l~~~~~--~~~~G~~--Pr~~~~s~~g~~l~Va~~~s~~v  316 (345)
T PF10282_consen  243 GENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPA--TGTLTLVQT--VPTGGKF--PRHFAFSPDGRYLYVANQDSNTV  316 (345)
T ss_dssp             SSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTT--TTTEEEEEE--EEESSSS--EEEEEE-TTSSEEEEEETTTTEE
T ss_pred             ccCCceeEEEecCCCEEEEEeccCCEEEEEEEecC--CCceEEEEE--EeCCCCC--ccEEEEeCCCCEEEEEecCCCeE
Confidence            22234555556777643333334555666666422  222222222  2221111  234455999999999875 4455


Q ss_pred             EEEEcCCCccccccee
Q 047869         2034 LVGRLSPNAASLSEVS 2049 (2233)
Q Consensus      2034 f~a~Ls~~~~sv~eis 2049 (2233)
                      .+-+++..++.++.+.
T Consensus       317 ~vf~~d~~tG~l~~~~  332 (345)
T PF10282_consen  317 SVFDIDPDTGKLTPVG  332 (345)
T ss_dssp             EEEEEETTTTEEEEEE
T ss_pred             EEEEEeCCCCcEEEec
Confidence            5556666665554444


No 32 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=87.14  E-value=56  Score=36.55  Aligned_cols=148  Identities=16%  Similarity=0.188  Sum_probs=78.1

Q ss_pred             EEeecccCccceEEeecccceEEEEecCC-CceeeeeeeeeccCCceEEEeEEecCCCceEEEEe--cCeEEEEeCcCCC
Q 047869         1867 VHLAFNSIVENYLTVAGYEDCQVLTLNPR-GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT--NKFVKIYDLSQDN 1943 (2233)
Q Consensus      1867 lsLafNP~nEdyLAVcGLkDC~VLTfss~-GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT--~~FVKIYDLS~D~ 1943 (2233)
                      .+++++| ++++|++++..+-.|..++.+ |++...+.    . +.-+..+.|-|..+. ++++.  ...|++||+....
T Consensus        34 ~~l~~~~-dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~----~-~~~~~~~~~~~~g~~-l~~~~~~~~~l~~~d~~~~~  106 (300)
T TIGR03866        34 RGITLSK-DGKLLYVCASDSDTIQVIDLATGEVIGTLP----S-GPDPELFALHPNGKI-LYIANEDDNLVTVIDIETRK  106 (300)
T ss_pred             CceEECC-CCCEEEEEECCCCeEEEEECCCCcEEEecc----C-CCCccEEEECCCCCE-EEEEcCCCCeEEEEECCCCe
Confidence            4688887 566777777666666666644 44433222    1 222455678888764 44443  3589999997632


Q ss_pred             CCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCC-ceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccc
Q 047869         1944 ISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECG-SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYK 2022 (2233)
Q Consensus      1944 lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G-~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~ 2022 (2233)
                        +...+... ..+..  +.+.++|+.+ ++.+..+ .++..+...    +  .....+..     .....++.++++-+
T Consensus       107 --~~~~~~~~-~~~~~--~~~~~dg~~l-~~~~~~~~~~~~~d~~~----~--~~~~~~~~-----~~~~~~~~~s~dg~  169 (300)
T TIGR03866       107 --VLAEIPVG-VEPEG--MAVSPDGKIV-VNTSETTNMAHFIDTKT----Y--EIVDNVLV-----DQRPRFAEFTADGK  169 (300)
T ss_pred             --EEeEeeCC-CCcce--EEECCCCCEE-EEEecCCCeEEEEeCCC----C--eEEEEEEc-----CCCccEEEECCCCC
Confidence              33333221 12333  3335677643 3344433 344433321    1  11111111     11223467888888


Q ss_pred             eeeEEec-CCcEEEEEc
Q 047869         2023 LLFLSFQ-DGTTLVGRL 2038 (2233)
Q Consensus      2023 LLF~SY~-~G~Sf~a~L 2038 (2233)
                      .|+++.. +|+.++-.+
T Consensus       170 ~l~~~~~~~~~v~i~d~  186 (300)
T TIGR03866       170 ELWVSSEIGGTVSVIDV  186 (300)
T ss_pred             EEEEEcCCCCEEEEEEc
Confidence            8887764 666555444


No 33 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=86.48  E-value=12  Score=46.88  Aligned_cols=149  Identities=23%  Similarity=0.279  Sum_probs=93.0

Q ss_pred             eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeee-
Q 047869         1826 EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAI- 1904 (2233)
Q Consensus      1826 EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~L- 1904 (2233)
                      -.|.|-|+++..|---..+.   +.            .=+|.-|+||| ++.+||          |=+.+|+|+.=..+ 
T Consensus       151 t~GdV~l~d~~nl~~v~~I~---aH------------~~~lAalafs~-~G~llA----------TASeKGTVIRVf~v~  204 (391)
T KOG2110|consen  151 TSGDVVLFDTINLQPVNTIN---AH------------KGPLAALAFSP-DGTLLA----------TASEKGTVIRVFSVP  204 (391)
T ss_pred             CCceEEEEEcccceeeeEEE---ec------------CCceeEEEECC-CCCEEE----------EeccCceEEEEEEcC
Confidence            46677777775553332211   11            22455788988 788887          44677777765444 


Q ss_pred             eecc--C---Cce---EEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcE----------------------------
Q 047869         1905 ELAL--Q---GAY---IRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLH---------------------------- 1948 (2233)
Q Consensus      1905 eL~L--e---g~f---IIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvy---------------------------- 1948 (2233)
                      +.+.  +   |.+   |--...=|.+|..-|.--++.|.||-|.+-..+|..                            
T Consensus       205 ~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~~~~~~~~p~~~~~~~~~~sk~~~sylps~V~~~~  284 (391)
T KOG2110|consen  205 EGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKVSNNPPESPTAGTSWFGKVSKAATSYLPSQVSSVL  284 (391)
T ss_pred             CccEeeeeeCCceeeEEEEEEECCCCCeEEEecCCCeEEEEEecccccCCCCCCCCCCcccchhhhhhhhhcchhhhhhh
Confidence            2221  2   554   666778889987556666779999999887655433                            


Q ss_pred             -------EEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeee
Q 047869         1949 -------YFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEII 2001 (2233)
Q Consensus      1949 -------yF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvv 2001 (2233)
                             |-.+|.+..+-.+.+..-.-..++.|.|++|++|+..++.. ++|.+.+.+.-
T Consensus       285 ~~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~dG~~y~y~l~~~-~gGec~lik~h  343 (391)
T KOG2110|consen  285 DQSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYDGHLYSYRLPPK-EGGECALIKRH  343 (391)
T ss_pred             hhccceeEEEccCCCccceEEeeccCCCCEEEEEEcCCeEEEEEcCCC-CCceeEEEEee
Confidence                   22333333322232322222358899999999999999965 66778877753


No 34 
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=86.28  E-value=2.6  Score=54.05  Aligned_cols=87  Identities=22%  Similarity=0.422  Sum_probs=68.7

Q ss_pred             ccccccceEEEEeecccCccceEEeecccceEEEEe----cCCCceeeee-ee---eecc--------CCceEEEeEEec
Q 047869         1857 LSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTL----NPRGEVTDRL-AI---ELAL--------QGAYIRRVDWVP 1920 (2233)
Q Consensus      1857 LSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTf----ss~GeV~DRL-~L---eL~L--------eg~fIIKa~WLP 1920 (2233)
                      +-.++|-|+|-+|..|| .+...|..|.+-.-|+-+    +++|.+.|-- +|   ...+        ...-++.|.|=|
T Consensus        97 ~P~~~V~feV~~vl~s~-~GS~VaL~G~~Gi~vMeLp~rwG~~s~~eDgk~~v~CRt~~i~~~~ftss~~ltl~Qa~WHP  175 (741)
T KOG4460|consen   97 LPINPVLFEVYQVLLSP-TGSHVALIGIKGLMVMELPKRWGKNSEFEDGKSTVNCRTTPVAERFFTSSTSLTLKQAAWHP  175 (741)
T ss_pred             ccCCcceEEEEEEEecC-CCceEEEecCCeeEEEEchhhcCccceecCCCceEEEEeecccceeeccCCceeeeeccccC
Confidence            45689999999999999 789999999999999876    8888888862 11   1111        133589999999


Q ss_pred             CC--CceEEEEecC-eEEEEeCcCCCC
Q 047869         1921 GS--PVQLMVVTNK-FVKIYDLSQDNI 1944 (2233)
Q Consensus      1921 GS--Qt~LAVVT~~-FVKIYDLS~D~l 1944 (2233)
                      .|  -+-|.|.|++ -++|||||++.-
T Consensus       176 ~S~~D~hL~iL~sdnviRiy~lS~~te  202 (741)
T KOG4460|consen  176 SSILDPHLVLLTSDNVIRIYSLSEPTE  202 (741)
T ss_pred             CccCCceEEEEecCcEEEEEecCCcch
Confidence            99  7777777766 889999998763


No 35 
>PTZ00420 coronin; Provisional
Probab=85.83  E-value=31  Score=45.52  Aligned_cols=195  Identities=11%  Similarity=0.134  Sum_probs=108.0

Q ss_pred             CceEEE-eeCCeEEEEechhhhcccccCCccccccccccccc-cccceEEEEeecccCccceEEeeccc-ceEEEEecCC
Q 047869         1819 RGRLAV-GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSR-NIVRFEIVHLAFNSIVENYLTVAGYE-DCQVLTLNPR 1895 (2233)
Q Consensus      1819 rGrLAV-aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSs-a~VpFeVlsLafNP~nEdyLAVcGLk-DC~VLTfss~ 1895 (2233)
                      ...||. ++.++|.|+++..--..       ..++. .|+.. ..-.-.|-.++|+|...++||.+|.. .+.|.-+. +
T Consensus        87 ~~lLASgS~DgtIrIWDi~t~~~~-------~~~i~-~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~-t  157 (568)
T PTZ00420         87 SEILASGSEDLTIRVWEIPHNDES-------VKEIK-DPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIE-N  157 (568)
T ss_pred             CCEEEEEeCCCeEEEEECCCCCcc-------ccccc-cceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECC-C
Confidence            345665 48889999998531000       00000 01100 01123588999999877777777753 34444443 3


Q ss_pred             CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEe---cCCcEE
Q 047869         1896 GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIA---SRGKMF 1971 (2233)
Q Consensus      1896 GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~---e~G~~~ 1971 (2233)
                      |...    ..+.. +.-|..+.|-|... .||.++.+ .|+|||+...  .+...|.-..|.+...+++..   .++. +
T Consensus       158 g~~~----~~i~~-~~~V~SlswspdG~-lLat~s~D~~IrIwD~Rsg--~~i~tl~gH~g~~~s~~v~~~~fs~d~~-~  228 (568)
T PTZ00420        158 EKRA----FQINM-PKKLSSLKWNIKGN-LLSGTCVGKHMHIIDPRKQ--EIASSFHIHDGGKNTKNIWIDGLGGDDN-Y  228 (568)
T ss_pred             CcEE----EEEec-CCcEEEEEECCCCC-EEEEEecCCEEEEEECCCC--cEEEEEecccCCceeEEEEeeeEcCCCC-E
Confidence            3322    12222 34588999999765 45555544 8999999764  455677777777655555542   3444 4


Q ss_pred             EEEEecCC----ceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEe-cCCcEEEEEcC
Q 047869         1972 LIVLSECG----SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSF-QDGTTLVGRLS 2039 (2233)
Q Consensus      1972 ILVLSS~G----~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY-~~G~Sf~a~Ls 2039 (2233)
                      |+..+.++    .|+.=++...+.    .+    .....+...+.+.-+|-+..+++|++= .+|+..+-.+.
T Consensus       229 IlTtG~d~~~~R~VkLWDlr~~~~----pl----~~~~ld~~~~~L~p~~D~~tg~l~lsGkGD~tIr~~e~~  293 (568)
T PTZ00420        229 ILSTGFSKNNMREMKLWDLKNTTS----AL----VTMSIDNASAPLIPHYDESTGLIYLIGKGDGNCRYYQHS  293 (568)
T ss_pred             EEEEEcCCCCccEEEEEECCCCCC----ce----EEEEecCCccceEEeeeCCCCCEEEEEECCCeEEEEEcc
Confidence            55544443    566666653221    11    111223344455567777777777554 56666666664


No 36 
>PF15492 Nbas_N:  Neuroblastoma-amplified sequence, N terminal
Probab=85.79  E-value=69  Score=39.33  Aligned_cols=237  Identities=18%  Similarity=0.238  Sum_probs=122.9

Q ss_pred             ccceEEeecccceEEEEecCC--CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCC---CCCcE
Q 047869         1875 VENYLTVAGYEDCQVLTLNPR--GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDN---ISPLH 1948 (2233)
Q Consensus      1875 nEdyLAVcGLkDC~VLTfss~--GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~---lSPvy 1948 (2233)
                      |++.|||.= +.|.-++=..+  ++++-+-+| +.-..--=||+.|=|+. +.||.+.+. .|+||||--.+   ++|..
T Consensus         8 ~Gk~lAi~q-d~~iEiRsa~Ddf~si~~kcqV-pkD~~PQWRkl~WSpD~-tlLa~a~S~G~i~vfdl~g~~lf~I~p~~   84 (282)
T PF15492_consen    8 DGKLLAILQ-DQCIEIRSAKDDFSSIIGKCQV-PKDPNPQWRKLAWSPDC-TLLAYAESTGTIRVFDLMGSELFVIPPAM   84 (282)
T ss_pred             CCcEEEEEe-ccEEEEEeccCCchheeEEEec-CCCCCchheEEEECCCC-cEEEEEcCCCeEEEEecccceeEEcCccc
Confidence            667777653 23333332222  233344433 11123347999998885 568887766 99999998543   34544


Q ss_pred             EEEcC-CCCeeEEEEEEecC-C--cEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEecccccee
Q 047869         1949 YFTLP-DDMIVDATLVIASR-G--KMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLL 2024 (2233)
Q Consensus      1949 yF~Lp-sGkIrDaTfv~~e~-G--~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LL 2024 (2233)
                      .|... +..|....|.-... -  ...++|++=.|.|=..-++...+.+ +.-.-..+... .-..|--++-|.+.++||
T Consensus        85 ~~~~d~~~Aiagl~Fl~~~~s~~ws~ELlvi~Y~G~L~Sy~vs~gt~q~-y~e~hsfsf~~-~yp~Gi~~~vy~p~h~LL  162 (282)
T PF15492_consen   85 SFPGDLSDAIAGLIFLEYKKSAQWSYELLVINYRGQLRSYLVSVGTNQG-YQENHSFSFSS-HYPHGINSAVYHPKHRLL  162 (282)
T ss_pred             ccCCccccceeeeEeeccccccccceeEEEEeccceeeeEEEEcccCCc-ceeeEEEEecc-cCCCceeEEEEcCCCCEE
Confidence            33211 23466666654321 1  3477888877777655554221111 11111112111 112334457899999999


Q ss_pred             eEEecCCcEEEEEcCCCcccccceeEEEEccCCCCCCCcccceeeccCCCceEEEEeccCCCceEEEEecCCceeeec--
Q 047869         2025 FLSFQDGTTLVGRLSPNAASLSEVSYVFEEQDGKLRSAGLHRWKELLASSGLFFCFSSLKSNAAVAVSLGTNELIAQN-- 2102 (2233)
Q Consensus      2025 F~SY~~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qWsEV~~hPGLf~cls~~~sn~pvvv~l~pd~I~iQe-- 2102 (2233)
                      +++=..        +.            +...++...+|++.|+-+-+-|=-....++   +..+...  +.+-..++  
T Consensus       163 lVgG~~--------~~------------~~~~s~a~~~GLtaWRiL~~~Pyyk~v~~~---~~~~~~~--~~~~~~~~~~  217 (282)
T PF15492_consen  163 LVGGCE--------QN------------QDGMSKASSCGLTAWRILSDSPYYKQVTSS---EDDITAS--SKRRGLLRIP  217 (282)
T ss_pred             EEeccC--------CC------------CCccccccccCceEEEEcCCCCcEEEcccc---Ccccccc--ccccceeecc
Confidence            964221        11            111234567899999988777755433222   2222111  11111111  


Q ss_pred             -cccccCCCCCeEEEEEeecCCCCCeEEEEEeeCCceeEEe
Q 047869         2103 -MRHAAGSTSPLVGVTAYKPLSKDKVHCLVLHDDGSLQIYS 2142 (2233)
Q Consensus      2103 -iK~~~~sSs~vdgva~y~p~s~~rttlLLLcEDGSLrIYs 2142 (2233)
                       .|.-.......+++-- =.++-+.+.+..++-+|+|-+|.
T Consensus       218 ~~~~fs~~~~~~d~i~k-mSlSPdg~~La~ih~sG~lsLW~  257 (282)
T PF15492_consen  218 SFKFFSRQGQEQDGIFK-MSLSPDGSLLACIHFSGSLSLWE  257 (282)
T ss_pred             ceeeeeccccCCCceEE-EEECCCCCEEEEEEcCCeEEEEe
Confidence             1111111222233221 23467888999999999999998


No 37 
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=85.60  E-value=24  Score=45.66  Aligned_cols=243  Identities=19%  Similarity=0.232  Sum_probs=141.5

Q ss_pred             EEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCC-Cceeee
Q 047869         1823 AVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPR-GEVTDR 1901 (2233)
Q Consensus      1823 AVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~-GeV~DR 1901 (2233)
                      +..-|||+-|+|.+++-....-.+.+            +-| .+-++..|.-  ++|+|+---+=.|+++..+ |. +-.
T Consensus         7 ~aS~gd~~kl~D~s~~~~~~~~~~~t------------~~p-g~~s~~w~~~--n~lvvas~~gdk~~~~~~K~g~-~~~   70 (673)
T KOG4378|consen    7 VASTGDKTKLSDFSDLETKSEYVHQT------------AEP-GDFSFNWQRR--NFLVVASMAGDKVMRIKEKDGK-TPE   70 (673)
T ss_pred             eeccCCceEEeecccccCccccccCC------------CCC-cceeeecccc--ceEEEeecCCceeEEEecccCC-CCc
Confidence            44579999999998776555432211            111 1567766654  4599988887788877443 33 222


Q ss_pred             eee-eecc-CCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCC
Q 047869         1902 LAI-ELAL-QGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECG 1979 (2233)
Q Consensus      1902 L~L-eL~L-eg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G 1979 (2233)
                      +.+ +... +.+++..+  ...|.++++.=+++-||||||-..-   .|-|  +-|---.+|.+-..+.--+|--.|-.|
T Consensus        71 Vp~~~k~~gd~~~Cv~~--~s~S~y~~sgG~~~~Vkiwdl~~kl---~hr~--lkdh~stvt~v~YN~~DeyiAsvs~gG  143 (673)
T KOG4378|consen   71 VPRVRKLTGDNAFCVAC--ASQSLYEISGGQSGCVKIWDLRAKL---IHRF--LKDHQSTVTYVDYNNTDEYIASVSDGG  143 (673)
T ss_pred             cceeeccccchHHHHhh--hhcceeeeccCcCceeeehhhHHHH---Hhhh--ccCCcceeEEEEecCCcceeEEeccCC
Confidence            222 2211 23343332  3456777888899999999997332   2222  223223344443444446888889999


Q ss_pred             ceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEec-CCcEEEEEcCCCcccccceeEEEEccCCC
Q 047869         1980 SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQ-DGTTLVGRLSPNAASLSEVSYVFEEQDGK 2058 (2233)
Q Consensus      1980 ~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~-~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk 2058 (2233)
                      .|-.+.+.-.+..      +....+.+|.-   ==+.||..-+.|..+-. +|..-+=-+       +            
T Consensus       144 diiih~~~t~~~t------t~f~~~sgqsv---Rll~ys~skr~lL~~asd~G~VtlwDv-------~------------  195 (673)
T KOG4378|consen  144 DIIIHGTKTKQKT------TTFTIDSGQSV---RLLRYSPSKRFLLSIASDKGAVTLWDV-------Q------------  195 (673)
T ss_pred             cEEEEecccCccc------cceecCCCCeE---EEeecccccceeeEeeccCCeEEEEec-------c------------
Confidence            9999887643322      11222322221   13578887777766543 333222111       1            


Q ss_pred             CCCCcccceeeccCCCceEEEEeccCCCceEEEEecCC-ceeeeccccccCCCCCeEEEEEeecCC
Q 047869         2059 LRSAGLHRWKELLASSGLFFCFSSLKSNAAVAVSLGTN-ELIAQNMRHAAGSTSPLVGVTAYKPLS 2123 (2233)
Q Consensus      2059 ~~~a~L~qWsEV~~hPGLf~cls~~~sn~pvvv~l~pd-~I~iQeiK~~~~sSs~vdgva~y~p~s 2123 (2233)
                       ...+.++|.|+-.-|-==+|++-  +|-.+++.++.| .|.+-.++.    .+..+-++.-||++
T Consensus       196 -g~sp~~~~~~~HsAP~~gicfsp--sne~l~vsVG~Dkki~~yD~~s----~~s~~~l~y~~Pls  254 (673)
T KOG4378|consen  196 -GMSPIFHASEAHSAPCRGICFSP--SNEALLVSVGYDKKINIYDIRS----QASTDRLTYSHPLS  254 (673)
T ss_pred             -CCCcccchhhhccCCcCcceecC--CccceEEEecccceEEEeeccc----ccccceeeecCCcc
Confidence             22458999998666644347555  899999999875 566767774    34445666667765


No 38 
>PTZ00420 coronin; Provisional
Probab=85.36  E-value=1.4e+02  Score=39.66  Aligned_cols=116  Identities=11%  Similarity=0.029  Sum_probs=76.3

Q ss_pred             CceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCC------CCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceE
Q 047869         1910 GAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNI------SPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLY 1982 (2233)
Q Consensus      1910 g~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~l------SPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY 1982 (2233)
                      ..-|..+.|-|.....||....+ .|||||+.....      .|...+.--.+.|..+.|  .+.|..+++..|.+|.|.
T Consensus        74 ~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf--~P~g~~iLaSgS~DgtIr  151 (568)
T PTZ00420         74 TSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDW--NPMNYYIMCSSGFDSFVN  151 (568)
T ss_pred             CCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEE--CCCCCeEEEEEeCCCeEE
Confidence            56799999999754567766655 999999975432      355555544667776664  356766666778899999


Q ss_pred             EEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEEEc
Q 047869         1983 RLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRL 2038 (2233)
Q Consensus      1983 ~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~L 2038 (2233)
                      .-+++....        ...+..   ...-.++-|+++-++|..+..+|+..+--+
T Consensus       152 IWDl~tg~~--------~~~i~~---~~~V~SlswspdG~lLat~s~D~~IrIwD~  196 (568)
T PTZ00420        152 IWDIENEKR--------AFQINM---PKKLSSLKWNIKGNLLSGTCVGKHMHIIDP  196 (568)
T ss_pred             EEECCCCcE--------EEEEec---CCcEEEEEECCCCCEEEEEecCCEEEEEEC
Confidence            888763211        011111   223567888888887777777777666554


No 39 
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=84.81  E-value=22  Score=46.79  Aligned_cols=261  Identities=19%  Similarity=0.254  Sum_probs=155.1

Q ss_pred             cccceecccCceEEE-e-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccce
Q 047869         1810 VKSLLSVSSRGRLAV-G-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDC 1887 (2233)
Q Consensus      1810 iRqLLSas~rGrLAV-a-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC 1887 (2233)
                      +...||++.+|...+ + +-|.|-|+-+.           |.+++-...+-.     +|.+|+|||....-+..+-..+|
T Consensus       402 ~Vr~iSvdp~G~wlasGsdDGtvriWEi~-----------TgRcvr~~~~d~-----~I~~vaw~P~~~~~vLAvA~~~~  465 (733)
T KOG0650|consen  402 LVRSISVDPSGEWLASGSDDGTVRIWEIA-----------TGRCVRTVQFDS-----EIRSVAWNPLSDLCVLAVAVGEC  465 (733)
T ss_pred             eEEEEEecCCcceeeecCCCCcEEEEEee-----------cceEEEEEeecc-----eeEEEEecCCCCceeEEEEecCc
Confidence            456678887776433 2 55556666651           234443333333     89999999999988888888899


Q ss_pred             EEEEecCCCceeeeeeeeeccC--Cce-------EEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCee
Q 047869         1888 QVLTLNPRGEVTDRLAIELALQ--GAY-------IRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIV 1958 (2233)
Q Consensus      1888 ~VLTfss~GeV~DRL~LeL~Le--g~f-------IIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIr 1958 (2233)
                       ++.+|+.  +.||+...+.-+  +..       ---|.|.++++-++-.-    |+             ...--.-.|+
T Consensus       466 -~~ivnp~--~G~~~e~~~t~ell~~~~~~~~p~~~~~~W~~~~~~e~~~~----v~-------------~~I~~~k~i~  525 (733)
T KOG0650|consen  466 -VLIVNPI--FGDRLEVGPTKELLASAPNESEPDAAVVTWSRASLDELEKG----VC-------------IVIKHPKSIR  525 (733)
T ss_pred             -eEEeCcc--ccchhhhcchhhhhhcCCCccCCcccceeechhhhhhhccc----eE-------------EEEecCCccc
Confidence             7777664  336666644432  111       12368999987665311    11             1111223688


Q ss_pred             EEEEEEecCCcEEEEEEecCCc--eEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEE
Q 047869         1959 DATLVIASRGKMFLIVLSECGS--LYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVG 2036 (2233)
Q Consensus      1959 DaTfv~~e~G~~~ILVLSS~G~--LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a 2036 (2233)
                      ++| |. .+|.++-.||-+.|.  ++++++++..          .|.|.....|--+-+.|-++...||++.+.---.. 
T Consensus       526 ~vt-WH-rkGDYlatV~~~~~~~~VliHQLSK~~----------sQ~PF~kskG~vq~v~FHPs~p~lfVaTq~~vRiY-  592 (733)
T KOG0650|consen  526 QVT-WH-RKGDYLATVMPDSGNKSVLIHQLSKRK----------SQSPFRKSKGLVQRVKFHPSKPYLFVATQRSVRIY-  592 (733)
T ss_pred             eee-ee-cCCceEEEeccCCCcceEEEEeccccc----------ccCchhhcCCceeEEEecCCCceEEEEeccceEEE-
Confidence            888 64 458888888887664  8899999754          23344444455566788888889999876532222 


Q ss_pred             EcCCCcccccceeEEEEccCCCCCCCcccce-eeccCCC---ceEEEEeccCCCceEEEEecCCceeeeccccccCCCCC
Q 047869         2037 RLSPNAASLSEVSYVFEEQDGKLRSAGLHRW-KELLASS---GLFFCFSSLKSNAAVAVSLGTNELIAQNMRHAAGSTSP 2112 (2233)
Q Consensus      2037 ~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qW-sEV~~hP---GLf~cls~~~sn~pvvv~l~pd~I~iQeiK~~~~sSs~ 2112 (2233)
                      .|... .-|             ++.-.-|.| +-+..||   .||+. +.  .+..+.+-+.-...-.|.+|+...    
T Consensus       593 dL~kq-elv-------------KkL~tg~kwiS~msihp~GDnli~g-s~--d~k~~WfDldlsskPyk~lr~H~~----  651 (733)
T KOG0650|consen  593 DLSKQ-ELV-------------KKLLTGSKWISSMSIHPNGDNLILG-SY--DKKMCWFDLDLSSKPYKTLRLHEK----  651 (733)
T ss_pred             ehhHH-HHH-------------HHHhcCCeeeeeeeecCCCCeEEEe-cC--CCeeEEEEcccCcchhHHhhhhhh----
Confidence            12110 001             112224666 5566666   56544 44  677777777777777788887321    


Q ss_pred             eEEEEEeecCCCCCeEEEEEe-eCCceeEEecc
Q 047869         2113 LVGVTAYKPLSKDKVHCLVLH-DDGSLQIYSHV 2144 (2233)
Q Consensus      2113 vdgva~y~p~s~~rttlLLLc-EDGSLrIYsa~ 2144 (2233)
                      .+--++||+    |-+++.-+ |||.+.||-..
T Consensus       652 avr~Va~H~----ryPLfas~sdDgtv~Vfhg~  680 (733)
T KOG0650|consen  652 AVRSVAFHK----RYPLFASGSDDGTVIVFHGM  680 (733)
T ss_pred             hhhhhhhcc----ccceeeeecCCCcEEEEeee
Confidence            111122332    55555444 56999999653


No 40 
>PF04053 Coatomer_WDAD:  Coatomer WD associated region ;  InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=84.29  E-value=20  Score=45.76  Aligned_cols=148  Identities=16%  Similarity=0.200  Sum_probs=86.5

Q ss_pred             ccccceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEe
Q 047869         1859 RNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYD 1938 (2233)
Q Consensus      1859 sa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYD 1938 (2233)
                      +-.++|.+..|-+    +..|+|.|-.-...+.+ .+|.++-++.+..      |+++.|-+. ...+|++|.+.|-|++
T Consensus       104 ~i~~~~~~~~If~----G~LL~~~~~~~i~~yDw-~~~~~i~~i~v~~------vk~V~Ws~~-g~~val~t~~~i~il~  171 (443)
T PF04053_consen  104 SIKLPFSVEKIFG----GNLLGVKSSDFICFYDW-ETGKLIRRIDVSA------VKYVIWSDD-GELVALVTKDSIYILK  171 (443)
T ss_dssp             ----SS-EEEEE-----SSSEEEEETTEEEEE-T-TT--EEEEESS-E-------EEEEE-TT-SSEEEEE-S-SEEEEE
T ss_pred             EEcCCcccceEEc----CcEEEEECCCCEEEEEh-hHcceeeEEecCC------CcEEEEECC-CCEEEEEeCCeEEEEE
Confidence            3455667777866    88999998886777777 5568888877652      888999877 4569999999999998


Q ss_pred             CcCC------------CCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccc
Q 047869         1939 LSQD------------NISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDR 2006 (2233)
Q Consensus      1939 LS~D------------~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~ 2006 (2233)
                      -..+            ++...++.   ..+|.+.++.    |.  +++-|+.++|.|  +- .|+.+.....+.      
T Consensus       172 ~~~~~~~~~~~~g~e~~f~~~~E~---~~~IkSg~W~----~d--~fiYtT~~~lkY--l~-~Ge~~~i~~ld~------  233 (443)
T PF04053_consen  172 YNLEAVAAIPEEGVEDAFELIHEI---SERIKSGCWV----ED--CFIYTTSNHLKY--LV-NGETGIIAHLDK------  233 (443)
T ss_dssp             E-HHHHHHBTTTB-GGGEEEEEEE----S--SEEEEE----TT--EEEEE-TTEEEE--EE-TTEEEEEEE-SS------
T ss_pred             ecchhcccccccCchhceEEEEEe---cceeEEEEEE----cC--EEEEEcCCeEEE--EE-cCCcceEEEcCC------
Confidence            8888            76666665   5577777744    33  566666669998  33 344444433331      


Q ss_pred             cccCCeEEEEeccccceeeEEecCCcEEEEEcCC
Q 047869         2007 EIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLSP 2040 (2233)
Q Consensus      2007 q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls~ 2040 (2233)
                          .---+.|.+..+.||+-=.++..+.-+++.
T Consensus       234 ----~~yllgy~~~~~~ly~~Dr~~~v~~~~ld~  263 (443)
T PF04053_consen  234 ----PLYLLGYLPKENRLYLIDRDGNVISYELDL  263 (443)
T ss_dssp             ------EEEEEETTTTEEEEE-TT--EEEEE--H
T ss_pred             ----ceEEEEEEccCCEEEEEECCCCEEEEEECH
Confidence                134466777778888887888777776643


No 41 
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=84.15  E-value=9.1  Score=49.96  Aligned_cols=148  Identities=22%  Similarity=0.252  Sum_probs=92.3

Q ss_pred             EEEeecccCccceEEeecc----cceEEEEecCCCceeeeeeeeec-cCCceEEEeEEecCCCceEEEEecCeEEEEeCc
Q 047869         1866 IVHLAFNSIVENYLTVAGY----EDCQVLTLNPRGEVTDRLAIELA-LQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLS 1940 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGL----kDC~VLTfss~GeV~DRL~LeL~-Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS 1940 (2233)
                      |-+|+. +-.+||||++-.    +.+.|.-++.+      ....|- .-+..|.++-.=|-. -.|-|+|..+|+||||+
T Consensus       524 i~~vtW-HrkGDYlatV~~~~~~~~VliHQLSK~------~sQ~PF~kskG~vq~v~FHPs~-p~lfVaTq~~vRiYdL~  595 (733)
T KOG0650|consen  524 IRQVTW-HRKGDYLATVMPDSGNKSVLIHQLSKR------KSQSPFRKSKGLVQRVKFHPSK-PYLFVATQRSVRIYDLS  595 (733)
T ss_pred             cceeee-ecCCceEEEeccCCCcceEEEEecccc------cccCchhhcCCceeEEEecCCC-ceEEEEeccceEEEehh
Confidence            334444 557899998766    66777766544      222222 336667777777664 55889999999999999


Q ss_pred             CCCCCCcEEEEcCCC-CeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEecc
Q 047869         1941 QDNISPLHYFTLPDD-MIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSS 2019 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsG-kIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~ 2019 (2233)
                      +-.+.=    .|-+| ++.+.--+ ...|- -+|+.|-++.+-..++..+...+..-..         -...+=+|-|-.
T Consensus       596 kqelvK----kL~tg~kwiS~msi-hp~GD-nli~gs~d~k~~WfDldlsskPyk~lr~---------H~~avr~Va~H~  660 (733)
T KOG0650|consen  596 KQELVK----KLLTGSKWISSMSI-HPNGD-NLILGSYDKKMCWFDLDLSSKPYKTLRL---------HEKAVRSVAFHK  660 (733)
T ss_pred             HHHHHH----HHhcCCeeeeeeee-cCCCC-eEEEecCCCeeEEEEcccCcchhHHhhh---------hhhhhhhhhhcc
Confidence            855322    12233 34433323 35564 5778899999999999866543222111         122344566667


Q ss_pred             ccceeeEEecCCcEEEE
Q 047869         2020 TYKLLFLSFQDGTTLVG 2036 (2233)
Q Consensus      2020 tl~LLF~SY~~G~Sf~a 2036 (2233)
                      .+.|.=..+.+|+.++.
T Consensus       661 ryPLfas~sdDgtv~Vf  677 (733)
T KOG0650|consen  661 RYPLFASGSDDGTVIVF  677 (733)
T ss_pred             ccceeeeecCCCcEEEE
Confidence            77776667777887664


No 42 
>PF11715 Nup160:  Nucleoporin Nup120/160;  InterPro: IPR021717  Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=82.81  E-value=4.2  Score=51.80  Aligned_cols=102  Identities=17%  Similarity=0.205  Sum_probs=49.3

Q ss_pred             CcCCCCCCcEEEEcCCCCe-eEEEEEEe-cCCcEEEEEEecCCceEEEEeccc-----CCCccccceeeee--ccccccc
Q 047869         1939 LSQDNISPLHYFTLPDDMI-VDATLVIA-SRGKMFLIVLSECGSLYRLELSVE-----GNVGATPLKEIIQ--FNDREIH 2009 (2233)
Q Consensus      1939 LS~D~lSPvyyF~LpsGkI-rDaTfv~~-e~G~~~ILVLSS~G~LY~Qels~s-----~d~g~~~ltEvvq--~~~~q~~ 2009 (2233)
                      +.+....+...|..|..-+ -.++.+.. ++..++|+|++++|++|+-.++..     .+.....+.++.+  .|..-..
T Consensus        65 ~~~~~~~~~lri~Fp~~~~~~~~v~~~~~~~~~~~v~v~t~s~~~~~l~l~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~  144 (547)
T PF11715_consen   65 LDKNLLNNTLRIHFPSPIILPGCVAFSETEDHVLIVFVTTSSGHLYTLTLPSDFFRSFSDLSEDNFEDWCRSYVPYSFSF  144 (547)
T ss_dssp             --------EEEEE-SS-BTT-GGGGEEEEE-SEEEEEEEBTTS-EEEEEEEHHHHHS---S---S--S-EE-B-SS-TTT
T ss_pred             ccccccCCeEEEECCCcCeeCCeEEEEECCCCEEEEEEEeCCCEEEEEECCChhhccccccccccccCccEeeeCCCCCc
Confidence            3334444777888888666 34443433 334789999999999999888743     2333333344422  2222122


Q ss_pred             CCeEEEEec--cccceeeEEecCCcEEEEEcCC
Q 047869         2010 AKGLSLYFS--STYKLLFLSFQDGTTLVGRLSP 2040 (2233)
Q Consensus      2010 ~~GVSVyYS--~tl~LLF~SY~~G~Sf~a~Ls~ 2040 (2233)
                      ..-.-++.+  ..-..+++++.+|.-+.-.+..
T Consensus       145 ~~~~~~~~~~~~~~~~l~v~~~dG~ll~l~~~~  177 (547)
T PF11715_consen  145 RSPHRLAAVTHDSEANLVVSLQDGGLLRLKRSS  177 (547)
T ss_dssp             S-EEEEEEE---SSSBEEEEESSS-EEEEEES-
T ss_pred             cCCCeEEEEEecCCCEEEEEECCCCeEEEECCc
Confidence            222223333  2778999999999888776644


No 43 
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=78.83  E-value=11  Score=47.80  Aligned_cols=140  Identities=18%  Similarity=0.228  Sum_probs=87.4

Q ss_pred             EEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCcee--e
Q 047869         1823 AVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVT--D 1900 (2233)
Q Consensus      1823 AVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~--D 1900 (2233)
                      +|++.+++-|+|++.  ++...+ .+.+          +=.=+|-.++|||-|+..||-++          ++|+|-  |
T Consensus       245 sv~dd~~L~iwD~R~--~~~~~~-~~~~----------ah~~~vn~~~fnp~~~~ilAT~S----------~D~tV~LwD  301 (422)
T KOG0264|consen  245 SVGDDGKLMIWDTRS--NTSKPS-HSVK----------AHSAEVNCVAFNPFNEFILATGS----------ADKTVALWD  301 (422)
T ss_pred             eecCCCeEEEEEcCC--CCCCCc-cccc----------ccCCceeEEEeCCCCCceEEecc----------CCCcEEEee
Confidence            478999999999986  322211 1111          11346789999999988888766          233221  2


Q ss_pred             ee----eeeeccC-CceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCC----------CcEEEEcC--CCCeeEEEE
Q 047869         1901 RL----AIELALQ-GAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNIS----------PLHYFTLP--DDMIVDATL 1962 (2233)
Q Consensus      1901 RL----~LeL~Le-g~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lS----------PvyyF~Lp--sGkIrDaTf 1962 (2233)
                      +-    .++-... +.=|-+++|=|...+-||....+ .+.||||++----          |-.-|+.-  .++|.|.+.
T Consensus       302 lRnL~~~lh~~e~H~dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~~kV~DfsW  381 (422)
T KOG0264|consen  302 LRNLNKPLHTFEGHEDEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHTAKVSDFSW  381 (422)
T ss_pred             chhcccCceeccCCCcceEEEEeCCCCCceeEecccCCcEEEEeccccccccChhhhccCCcceeEEecCcccccccccC
Confidence            21    2222222 56689999999999999966555 9999999975432          33345544  456777773


Q ss_pred             EEecCCcEEEEEEecCCceEEEEec
Q 047869         1963 VIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1963 v~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                        ++.---.|.-.++++.|-+=++.
T Consensus       382 --np~ePW~I~SvaeDN~LqIW~~s  404 (422)
T KOG0264|consen  382 --NPNEPWTIASVAEDNILQIWQMA  404 (422)
T ss_pred             --CCCCCeEEEEecCCceEEEeecc
Confidence              23333456666677666654444


No 44 
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=78.38  E-value=1.4e+02  Score=37.18  Aligned_cols=181  Identities=20%  Similarity=0.307  Sum_probs=98.2

Q ss_pred             eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeeccc-ceEEEEecCC-Cceeeeee
Q 047869         1826 EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYE-DCQVLTLNPR-GEVTDRLA 1903 (2233)
Q Consensus      1826 EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLk-DC~VLTfss~-GeV~DRL~ 1903 (2233)
                      ..||+.|++.-.           .-|...-||-++    =||..+|.| +.+|.|--|+. .|.|+.++.+ -+..-+..
T Consensus        75 qDGklIvWDs~T-----------tnK~haipl~s~----WVMtCA~sP-Sg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~  138 (343)
T KOG0286|consen   75 QDGKLIVWDSFT-----------TNKVHAIPLPSS----WVMTCAYSP-SGNFVACGGLDNKCSIYPLSTRDAEGNVRVS  138 (343)
T ss_pred             cCCeEEEEEccc-----------ccceeEEecCce----eEEEEEECC-CCCeEEecCcCceeEEEecccccccccceee
Confidence            556666776522           233444444332    478999999 89999999996 5889999733 22222222


Q ss_pred             eeeccCCceEEEeEEecCCCc------------------eEEE------------------------EecCeEEEEeCcC
Q 047869         1904 IELALQGAYIRRVDWVPGSPV------------------QLMV------------------------VTNKFVKIYDLSQ 1941 (2233)
Q Consensus      1904 LeL~Leg~fIIKa~WLPGSQt------------------~LAV------------------------VT~~FVKIYDLS~ 1941 (2233)
                      =++.--..|+-.++.++..|-                  ++.+                        ..-...|+||+-.
T Consensus       139 r~l~gHtgylScC~f~dD~~ilT~SGD~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~  218 (343)
T KOG0286|consen  139 RELAGHTGYLSCCRFLDDNHILTGSGDMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRS  218 (343)
T ss_pred             eeecCccceeEEEEEcCCCceEecCCCceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccC
Confidence            223323455555555553321                  1110                        0111223343322


Q ss_pred             CCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEecccc
Q 047869         1942 DNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTY 2021 (2233)
Q Consensus      1942 D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl 2021 (2233)
                      -  .-+-.|---+..|-.+.|+  ++| ..+.-=|++|-.-+.+|+-......+        .......+-.||-||-.=
T Consensus       219 ~--~c~qtF~ghesDINsv~ff--P~G-~afatGSDD~tcRlyDlRaD~~~a~y--------s~~~~~~gitSv~FS~SG  285 (343)
T KOG0286|consen  219 G--QCVQTFEGHESDINSVRFF--PSG-DAFATGSDDATCRLYDLRADQELAVY--------SHDSIICGITSVAFSKSG  285 (343)
T ss_pred             c--ceeEeecccccccceEEEc--cCC-CeeeecCCCceeEEEeecCCcEEeee--------ccCcccCCceeEEEcccc
Confidence            2  1122333334455555554  344 34555566666666666532211111        122335567899999999


Q ss_pred             ceeeEEecCCcEEE
Q 047869         2022 KLLFLSFQDGTTLV 2035 (2233)
Q Consensus      2022 ~LLF~SY~~G~Sf~ 2035 (2233)
                      ++||..|.+++...
T Consensus       286 RlLfagy~d~~c~v  299 (343)
T KOG0286|consen  286 RLLFAGYDDFTCNV  299 (343)
T ss_pred             cEEEeeecCCceeE
Confidence            99999999988765


No 45 
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=78.24  E-value=49  Score=41.84  Aligned_cols=171  Identities=18%  Similarity=0.250  Sum_probs=86.9

Q ss_pred             ccceecccCceEEEe-eCCeEEEEech--hhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccce
Q 047869         1811 KSLLSVSSRGRLAVG-EGDKVAIFDVG--QLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDC 1887 (2233)
Q Consensus      1811 RqLLSas~rGrLAVa-EgdKVTILqls--aLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC 1887 (2233)
                      =.+++.+..|.|||+ |.|.++|+|++  +++=++....+.  .-....-.-..+-|.|+.+.. -..-.-+..||...=
T Consensus        89 vtal~~S~iGFvaigy~~G~l~viD~RGPavI~~~~i~~~~--~~~~~~~~vt~ieF~vm~~~~-D~ySSi~L~vGTn~G  165 (395)
T PF08596_consen   89 VTALKNSDIGFVAIGYESGSLVVIDLRGPAVIYNENIRESF--LSKSSSSYVTSIEFSVMTLGG-DGYSSICLLVGTNSG  165 (395)
T ss_dssp             EEEEEE-BTSEEEEEETTSEEEEEETTTTEEEEEEEGGG----T-SS----EEEEEEEEEE-TT-SSSEEEEEEEEETTS
T ss_pred             EeEEecCCCcEEEEEecCCcEEEEECCCCeEEeeccccccc--cccccccCeeEEEEEEEecCC-CcccceEEEEEeCCC
Confidence            356777899999999 99999999994  233222211100  000011111136677777632 112235777788777


Q ss_pred             EEEEe----cCCCceeeeeeeeeccCCceEEEeEEe-------------------cC--CCceEEEEecCeEEEEeCcCC
Q 047869         1888 QVLTL----NPRGEVTDRLAIELALQGAYIRRVDWV-------------------PG--SPVQLMVVTNKFVKIYDLSQD 1942 (2233)
Q Consensus      1888 ~VLTf----ss~GeV~DRL~LeL~Leg~fIIKa~WL-------------------PG--SQt~LAVVT~~FVKIYDLS~D 1942 (2233)
                      .+++|    +++|.-.-..+-.......-|+++.=+                   +|  -+..+.++|..-||||.+.+.
T Consensus       166 ~v~~fkIlp~~~g~f~v~~~~~~~~~~~~i~~I~~i~~~~G~~a~At~~~~~~l~~g~~i~g~vVvvSe~~irv~~~~~~  245 (395)
T PF08596_consen  166 NVLTFKILPSSNGRFSVQFAGATTNHDSPILSIIPINADTGESALATISAMQGLSKGISIPGYVVVVSESDIRVFKPPKS  245 (395)
T ss_dssp             EEEEEEEEE-GGG-EEEEEEEEE--SS----EEEEEETTT--B-B-BHHHHHGGGGT----EEEEEE-SSEEEEE-TT--
T ss_pred             CEEEEEEecCCCCceEEEEeeccccCCCceEEEEEEECCCCCcccCchhHhhccccCCCcCcEEEEEcccceEEEeCCCC
Confidence            77666    455543332222221122222222222                   11  123577888889999999987


Q ss_pred             CCCCcEEEEcCCCCeeEEEEEE-e--cCCcEEEEEEecCCceEEEEec
Q 047869         1943 NISPLHYFTLPDDMIVDATLVI-A--SRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1943 ~lSPvyyF~LpsGkIrDaTfv~-~--e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      ...-+.+   -.+...+++-+. .  ..+...++.+..+|.+..--++
T Consensus       246 k~~~K~~---~~~~~~~~~~vv~~~~~~~~~~Lv~l~~~G~i~i~SLP  290 (395)
T PF08596_consen  246 KGAHKSF---DDPFLCSSASVVPTISRNGGYCLVCLFNNGSIRIYSLP  290 (395)
T ss_dssp             -EEEEE----SS-EEEEEEEEEEEE-EEEEEEEEEEETTSEEEEEETT
T ss_pred             cccceee---ccccccceEEEEeecccCCceEEEEEECCCcEEEEECC
Confidence            7533333   333455544332 2  3466899999999999988776


No 46 
>PF00643 zf-B_box:  B-box zinc finger;  InterPro: IPR000315 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents B-box-type zinc finger domains, which are around 40 residues in length. B-box zinc fingers can be divided into two groups, where types 1 and 2 B-box domains differ in their consensus sequence and in the spacing of the 7-8 zinc-binding residues. Several proteins contain both types 1 and 2 B-boxes, suggesting some level of cooperativity between these two domains. B-box domains are found in over 1500 proteins from a variety of organisms. They are found in TRIM (tripartite motif) proteins that consist of an N-terminal RING finger (originally called an A-box), followed by 1-2 B-box domains and a coiled-coil domain (also called RBCC for Ring, B-box, Coiled-Coil). TRIM proteins contain a type 2 B-box domain, and may also contain a type 1 B-box. In proteins that do not contain RING or coiled-coil domains, the B-box domain is primarily type 2. Many type 2 B-box proteins are involved in ubiquitinylation. Proteins containing a B-box zinc finger domain include transcription factors, ribonucleoproteins and proto-oncoproteins; for example, MID1, MID2, TRIM9, TNL, TRIM36, TRIM63, TRIFIC, NCL1 and CONSTANS-like proteins []. The microtubule-associated E3 ligase MID1 (6.3.2 from EC) contains a type 1 B-box zinc finger domain. MID1 specifically binds Alpha-4, which in turn recruits the catalytic subunit of phosphatase 2A (PP2Ac). This complex is required for targeting of PP2Ac for proteasome-mediated degradation. The MID1 B-box coordinates two zinc ions and adopts a beta/beta/alpha cross-brace structure similar to that of ZZ, PHD, RING and FYVE zinc fingers [, ]. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0005622 intracellular; PDB: 3DDT_B 2D8U_A 3Q1D_A 2EGM_A 2YVR_B 2DJA_A 2DQ5_A 2JUN_A 2YRG_A 2DID_A ....
Probab=77.37  E-value=1.9  Score=37.41  Aligned_cols=28  Identities=36%  Similarity=0.774  Sum_probs=25.8

Q ss_pred             ceEeeccCCCCCCceeehhhhhhhcCCCcEEE
Q 047869         1604 HWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVY 1635 (2233)
Q Consensus      1604 ~~Y~C~TC~l~~~~GVC~aCA~vCHkGHdVvy 1635 (2233)
                      .-|.|.+|..    .+|..|+..=|+||+++.
T Consensus        14 ~~~~C~~C~~----~~C~~C~~~~H~~H~~~~   41 (42)
T PF00643_consen   14 LSLFCEDCNE----PLCSECTVSGHKGHKIVP   41 (42)
T ss_dssp             EEEEETTTTE----EEEHHHHHTSTTTSEEEE
T ss_pred             eEEEecCCCC----ccCccCCCCCCCCCEEeE
Confidence            6899999997    899999999999999875


No 47 
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=76.70  E-value=66  Score=39.62  Aligned_cols=90  Identities=17%  Similarity=0.255  Sum_probs=61.3

Q ss_pred             EEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccC--CceEEEeEEecCCCceEEEEe-cC-eEEEEeCc
Q 047869         1865 EIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQ--GAYIRRVDWVPGSPVQLMVVT-NK-FVKIYDLS 1940 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Le--g~fIIKa~WLPGSQt~LAVVT-~~-FVKIYDLS 1940 (2233)
                      +|++++|||-|  .=+|-|-+|-.|.-.|--|+-    ..+.+-+  .+.|-.+.|.|..-..+.|-+ .| -||||||.
T Consensus       107 dVlsva~s~dn--~qivSGSrDkTiklwnt~g~c----k~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~  180 (315)
T KOG0279|consen  107 DVLSVAFSTDN--RQIVSGSRDKTIKLWNTLGVC----KYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLR  180 (315)
T ss_pred             ceEEEEecCCC--ceeecCCCcceeeeeeecccE----EEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccC
Confidence            58999999854  457889999999888877643    2333333  578999999999633333333 33 99999997


Q ss_pred             CCCCCCcEEEEcCCCCeeEEEE
Q 047869         1941 QDNISPLHYFTLPDDMIVDATL 1962 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsGkIrDaTf 1962 (2233)
                      .=.+  .+.|.=-+|.+..+|+
T Consensus       181 ~~~l--~~~~~gh~~~v~t~~v  200 (315)
T KOG0279|consen  181 NCQL--RTTFIGHSGYVNTVTV  200 (315)
T ss_pred             Ccch--hhccccccccEEEEEE
Confidence            5443  3344445666666665


No 48 
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=75.03  E-value=14  Score=49.43  Aligned_cols=144  Identities=19%  Similarity=0.300  Sum_probs=98.3

Q ss_pred             EEEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCceeee
Q 047869         1822 LAVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDR 1901 (2233)
Q Consensus      1822 LAVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DR 1901 (2233)
                      .+++|+|-+-++|+++    .+-   ...|+|    +.+   =.|.-+..+| |..|||-+| +|=.|..++-.|   -|
T Consensus       193 ~s~~dsG~lqlWDlRq----p~r---~~~k~~----AH~---GpV~c~nwhP-nr~~lATGG-RDK~vkiWd~t~---~~  253 (839)
T KOG0269|consen  193 ASIHDSGYLQLWDLRQ----PDR---CEKKLT----AHN---GPVLCLNWHP-NREWLATGG-RDKMVKIWDMTD---SR  253 (839)
T ss_pred             EEecCCceEEEeeccC----chh---HHHHhh----ccc---CceEEEeecC-CCceeeecC-CCccEEEEeccC---CC
Confidence            5567889888888753    220   122222    111   1356788889 999999999 999999997775   22


Q ss_pred             eeeeeccC-CceEEEeEEecCCCceEE---EEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEec
Q 047869         1902 LAIELALQ-GAYIRRVDWVPGSPVQLM---VVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSE 1977 (2233)
Q Consensus      1902 L~LeL~Le-g~fIIKa~WLPGSQt~LA---VVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS 1977 (2233)
                      .-.....+ ++-+-|+.|=|..+..||   .|.--.|.|||+....+ |-+.|.--.+.+.+++ | ++.....+.--|.
T Consensus       254 ~~~~~tInTiapv~rVkWRP~~~~hLAtcsmv~dtsV~VWDvrRPYI-P~~t~~eH~~~vt~i~-W-~~~d~~~l~s~sK  330 (839)
T KOG0269|consen  254 AKPKHTINTIAPVGRVKWRPARSYHLATCSMVVDTSVHVWDVRRPYI-PYATFLEHTDSVTGIA-W-DSGDRINLWSCSK  330 (839)
T ss_pred             ccceeEEeecceeeeeeeccCccchhhhhhccccceEEEEeeccccc-cceeeeccCcccccee-c-cCCCceeeEeecC
Confidence            22223333 667999999999999998   45556999999998765 6677766565555554 3 2333556666788


Q ss_pred             CCceEEEEec
Q 047869         1978 CGSLYRLELS 1987 (2233)
Q Consensus      1978 ~G~LY~Qels 1987 (2233)
                      +|-+|-+.+.
T Consensus       331 D~tv~qh~~k  340 (839)
T KOG0269|consen  331 DGTVLQHLFK  340 (839)
T ss_pred             ccHHHHhhhh
Confidence            8988866554


No 49 
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=72.92  E-value=33  Score=38.77  Aligned_cols=98  Identities=12%  Similarity=0.224  Sum_probs=63.7

Q ss_pred             EEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEec----CeEEEEeCcC
Q 047869         1866 IVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTN----KFVKIYDLSQ 1941 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~----~FVKIYDLS~ 1941 (2233)
                      |..++++|-.+.|.++.|..+..|--++.+|+.+-.      ++...+..+.|=|..+. |+++..    -.|.|||.. 
T Consensus        62 I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~i~~------~~~~~~n~i~wsP~G~~-l~~~g~~n~~G~l~~wd~~-  133 (194)
T PF08662_consen   62 IHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKKIFS------FGTQPRNTISWSPDGRF-LVLAGFGNLNGDLEFWDVR-  133 (194)
T ss_pred             eEEEEECcCCCEEEEEEccCCcccEEEcCcccEeEe------ecCCCceEEEECCCCCE-EEEEEccCCCcEEEEEECC-
Confidence            899999996666666669777777777776544432      23456677899999875 555542    259999998 


Q ss_pred             CCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEe
Q 047869         1942 DNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLS 1976 (2233)
Q Consensus      1942 D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLS 1976 (2233)
                       +..++..+..+  .+.++.  ..++|++++...|
T Consensus       134 -~~~~i~~~~~~--~~t~~~--WsPdGr~~~ta~t  163 (194)
T PF08662_consen  134 -KKKKISTFEHS--DATDVE--WSPDGRYLATATT  163 (194)
T ss_pred             -CCEEeeccccC--cEEEEE--EcCCCCEEEEEEe
Confidence             44455554433  344443  2578976655443


No 50 
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=72.16  E-value=23  Score=48.19  Aligned_cols=118  Identities=15%  Similarity=0.291  Sum_probs=74.7

Q ss_pred             ceEEEEeecccCccceEEeecc-cceEEEEecCCCceeeee-eeeeccC---CceEEEeEEecCCCceEEEEecCeEEEE
Q 047869         1863 RFEIVHLAFNSIVENYLTVAGY-EDCQVLTLNPRGEVTDRL-AIELALQ---GAYIRRVDWVPGSPVQLMVVTNKFVKIY 1937 (2233)
Q Consensus      1863 pFeVlsLafNP~nEdyLAVcGL-kDC~VLTfss~GeV~DRL-~LeL~Le---g~fIIKa~WLPGSQt~LAVVT~~FVKIY 1937 (2233)
                      .=+|.+|.|+| +++||||.-. -.++|.-|. +|.+.--+ -+.+-.+   ..-+.+..|=|.+-+.+++-+-++||||
T Consensus       138 ~apVl~l~~~p-~~~fLAvss~dG~v~iw~~~-~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy  215 (933)
T KOG1274|consen  138 DAPVLQLSYDP-KGNFLAVSSCDGKVQIWDLQ-DGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVY  215 (933)
T ss_pred             CCceeeeeEcC-CCCEEEEEecCceEEEEEcc-cchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEE
Confidence            34688999999 8899998754 345666663 33222211 1111122   3357899999999998888899999999


Q ss_pred             eCcCCCCCCcEEEEcC--CCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1938 DLSQDNISPLHYFTLP--DDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1938 DLS~D~lSPvyyF~Lp--sGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      +-.  .-++.+.+..-  +-++.|.++  .+.|+ ||-..+-+|.|-.=++.
T Consensus       216 ~r~--~we~~f~Lr~~~~ss~~~~~~w--sPnG~-YiAAs~~~g~I~vWnv~  262 (933)
T KOG1274|consen  216 SRK--GWELQFKLRDKLSSSKFSDLQW--SPNGK-YIAASTLDGQILVWNVD  262 (933)
T ss_pred             ccC--CceeheeecccccccceEEEEE--cCCCc-EEeeeccCCcEEEEecc
Confidence            754  44444444322  223556663  35674 67777778877764444


No 51 
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=68.54  E-value=67  Score=42.79  Aligned_cols=154  Identities=18%  Similarity=0.261  Sum_probs=101.3

Q ss_pred             cCceEEE---eeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeeccc-ceEEEEec
Q 047869         1818 SRGRLAV---GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYE-DCQVLTLN 1893 (2233)
Q Consensus      1818 ~rGrLAV---aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLk-DC~VLTfs 1893 (2233)
                      +.-|+||   +-||+|.|+.++.=-+-.|..           +-.--=+--|..+..||-....|||++=. .+.+-++.
T Consensus       590 n~~rvAVPL~g~gG~iai~el~~PGrLPDgv-----------~p~l~Ngt~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~  658 (1012)
T KOG1445|consen  590 NNKRVAVPLAGSGGVIAIYELNEPGRLPDGV-----------MPGLFNGTLVTDLHWDPFDDERLAVATDDGQINLWRLT  658 (1012)
T ss_pred             ccceEEEEecCCCceEEEEEcCCCCCCCccc-----------ccccccCceeeecccCCCChHHeeecccCceEEEEEec
Confidence            3457887   479999999997655555522           11222244578899999999999998732 34566676


Q ss_pred             CCCceeeeeeeeecc--CCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcE
Q 047869         1894 PRGEVTDRLAIELAL--QGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKM 1970 (2233)
Q Consensus      1894 s~GeV~DRL~LeL~L--eg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~ 1970 (2233)
                      .+|--.-....+-.+  .++-|--++|=|=-.--||++..+ .|++|||..-..-  .-|.=-.|.|-+.+ | ..+|+ 
T Consensus       659 a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd~Ti~lWDl~~~~~~--~~l~gHtdqIf~~A-W-SpdGr-  733 (1012)
T KOG1445|consen  659 ANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYDSTIELWDLANAKLY--SRLVGHTDQIFGIA-W-SPDGR-  733 (1012)
T ss_pred             cCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhccceeeeeehhhhhhh--heeccCcCceeEEE-E-CCCCc-
Confidence            666433222333223  267888999999887778888777 8999999865521  12222366777766 4 35675 


Q ss_pred             EEEEEecCCceEEEEec
Q 047869         1971 FLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1971 ~ILVLSS~G~LY~Qels 1987 (2233)
                      .+--.-.+|.|..++=.
T Consensus       734 ~~AtVcKDg~~rVy~Pr  750 (1012)
T KOG1445|consen  734 RIATVCKDGTLRVYEPR  750 (1012)
T ss_pred             ceeeeecCceEEEeCCC
Confidence            44555689998876544


No 52 
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=66.91  E-value=24  Score=42.70  Aligned_cols=75  Identities=21%  Similarity=0.368  Sum_probs=59.6

Q ss_pred             CceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecc
Q 047869         1910 GAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSV 1988 (2233)
Q Consensus      1910 g~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~ 1988 (2233)
                      +..+.+.+=-|. ...||++.++.|++||++..+-.|+-.|-.+..+|..+.|-  .+| .-+.--|++|-+=+=+|+.
T Consensus        40 dsqVNrLeiTpd-k~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~--~dg-rWMyTgseDgt~kIWdlR~  114 (311)
T KOG0315|consen   40 DSQVNRLEITPD-KKDLAAAGNQHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQ--CDG-RWMYTGSEDGTVKIWDLRS  114 (311)
T ss_pred             ccceeeEEEcCC-cchhhhccCCeeEEEEccCCCCCceeEEeccCCceEEEEEe--ecC-eEEEecCCCceEEEEeccC
Confidence            444445554444 34699999999999999999999999999999999887764  456 5788888999888877763


No 53 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=63.97  E-value=1.1e+02  Score=36.65  Aligned_cols=149  Identities=18%  Similarity=0.283  Sum_probs=92.7

Q ss_pred             CceEEEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeec-ccceEEEEecCCCc
Q 047869         1819 RGRLAVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAG-YEDCQVLTLNPRGE 1897 (2233)
Q Consensus      1819 rGrLAVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcG-LkDC~VLTfss~Ge 1897 (2233)
                      .|+|+++-|.+|.++++.     .+       + ++.+.+....++.|.+|..   .+|+++|.. .+=..++.++..+.
T Consensus        98 ~~~lv~~~g~~l~v~~l~-----~~-------~-~l~~~~~~~~~~~i~sl~~---~~~~I~vgD~~~sv~~~~~~~~~~  161 (321)
T PF03178_consen   98 NGRLVVAVGNKLYVYDLD-----NS-------K-TLLKKAFYDSPFYITSLSV---FKNYILVGDAMKSVSLLRYDEENN  161 (321)
T ss_dssp             TTEEEEEETTEEEEEEEE-----TT-------S-SEEEEEEE-BSSSEEEEEE---ETTEEEEEESSSSEEEEEEETTTE
T ss_pred             CCEEEEeecCEEEEEEcc-----Cc-------c-cchhhheecceEEEEEEec---cccEEEEEEcccCEEEEEEEccCC
Confidence            788999999999999982     22       1 6888888888999999987   577888776 47789999988543


Q ss_pred             eeeeeeeeeccCCceEEEeEEecCCCceEEEEec-CeEEEEeCcCC---------CCCCcEEEEcCCCCeeEE---EEEE
Q 047869         1898 VTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTN-KFVKIYDLSQD---------NISPLHYFTLPDDMIVDA---TLVI 1964 (2233)
Q Consensus      1898 V~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~-~FVKIYDLS~D---------~lSPvyyF~LpsGkIrDa---Tfv~ 1964 (2233)
                      -...+.=+  .....+..+.-++... .++++.. ..+.++....+         .+.+...|-+.+ .|...   ++..
T Consensus       162 ~l~~va~d--~~~~~v~~~~~l~d~~-~~i~~D~~gnl~~l~~~~~~~~~~~~~~~L~~~~~f~lg~-~v~~~~~~~l~~  237 (321)
T PF03178_consen  162 KLILVARD--YQPRWVTAAEFLVDED-TIIVGDKDGNLFVLRYNPEIPNSRDGDPKLERISSFHLGD-IVNSFRRGSLIP  237 (321)
T ss_dssp             -EEEEEEE--SS-BEEEEEEEE-SSS-EEEEEETTSEEEEEEE-SS-SSTTTTTTBEEEEEEEE-SS--EEEEEE--SS-
T ss_pred             EEEEEEec--CCCccEEEEEEecCCc-EEEEEcCCCeEEEEEECCCCcccccccccceeEEEEECCC-ccceEEEEEeee
Confidence            22222111  1256788899995544 3433333 37777777643         344677787775 56555   3332


Q ss_pred             ecCCc-----EEEEEEecCCceE-EEE-ec
Q 047869         1965 ASRGK-----MFLIVLSECGSLY-RLE-LS 1987 (2233)
Q Consensus      1965 ~e~G~-----~~ILVLSS~G~LY-~Qe-ls 1987 (2233)
                      ...+.     ..++..|.+|.|| ..+ ++
T Consensus       238 ~~~~~~~~~~~~i~~~T~~G~Ig~l~p~l~  267 (321)
T PF03178_consen  238 RSGSSESPNRPQILYGTVDGSIGVLIPFLS  267 (321)
T ss_dssp             -SSSS-TTEEEEEEEEETTS-EEEEEE-E-
T ss_pred             cCCCCcccccceEEEEecCCEEEEEEecCC
Confidence            21122     3588889999999 556 44


No 54 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=63.51  E-value=1.7e+02  Score=38.32  Aligned_cols=158  Identities=19%  Similarity=0.308  Sum_probs=100.5

Q ss_pred             EEEeecccCccceEEeecccc-eEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCC
Q 047869         1866 IVHLAFNSIVENYLTVAGYED-CQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDN 1943 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkD-C~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~ 1943 (2233)
                      |-+|.|-| +-..|.|||+.. ..|+-++  |.+.-+ .-.+-+.+--|.++...|+-+..++..+.. |.-+|||-...
T Consensus       216 I~sv~FHp-~~plllvaG~d~~lrifqvD--Gk~N~~-lqS~~l~~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle~ak  291 (514)
T KOG2055|consen  216 ITSVQFHP-TAPLLLVAGLDGTLRIFQVD--GKVNPK-LQSIHLEKFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLETAK  291 (514)
T ss_pred             ceEEEecC-CCceEEEecCCCcEEEEEec--CccChh-heeeeeccCccceeeecCCCceEEEecccceEEEEeeccccc
Confidence            45788866 788999999964 5577663  444443 334445666799999999988777766655 99999997776


Q ss_pred             CCCcEEE-EcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccc
Q 047869         1944 ISPLHYF-TLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYK 2022 (2233)
Q Consensus      1944 lSPvyyF-~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~ 2022 (2233)
                      +.+..-. .++.-.++-..+  ...|. +|++--..||||.--.. +     ..+...+     +++|.--.+.||.+-+
T Consensus       292 ~~k~~~~~g~e~~~~e~FeV--Shd~~-fia~~G~~G~I~lLhak-T-----~eli~s~-----KieG~v~~~~fsSdsk  357 (514)
T KOG2055|consen  292 VTKLKPPYGVEEKSMERFEV--SHDSN-FIAIAGNNGHIHLLHAK-T-----KELITSF-----KIEGVVSDFTFSSDSK  357 (514)
T ss_pred             cccccCCCCcccchhheeEe--cCCCC-eEEEcccCceEEeehhh-h-----hhhhhee-----eeccEEeeEEEecCCc
Confidence            6643311 111222333321  23443 66677777877743211 0     0011112     2344455678888889


Q ss_pred             eeeEEecCCcEEEEEcCCC
Q 047869         2023 LLFLSFQDGTTLVGRLSPN 2041 (2233)
Q Consensus      2023 LLF~SY~~G~Sf~a~Ls~~ 2041 (2233)
                      .|.+|=..|+.+...|...
T Consensus       358 ~l~~~~~~GeV~v~nl~~~  376 (514)
T KOG2055|consen  358 ELLASGGTGEVYVWNLRQN  376 (514)
T ss_pred             EEEEEcCCceEEEEecCCc
Confidence            9999999999999988544


No 55 
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=62.89  E-value=63  Score=41.37  Aligned_cols=155  Identities=14%  Similarity=0.161  Sum_probs=84.1

Q ss_pred             eEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCC
Q 047869         1864 FEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDN 1943 (2233)
Q Consensus      1864 FeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~ 1943 (2233)
                      ..|.-|.++| .++||+.||..||..+   .+-...|....-+.--|.-...+.|.|..+..++=-+.+.+--|||.-..
T Consensus       270 ~~V~yi~wSP-DdryLlaCg~~e~~~l---wDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~dr~i~~wdlDgn~  345 (519)
T KOG0293|consen  270 QPVSYIMWSP-DDRYLLACGFDEVLSL---WDVDTGDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPDRTIIMWDLDGNI  345 (519)
T ss_pred             CceEEEEECC-CCCeEEecCchHheee---ccCCcchhhhhcccCcCCCcceeEEccCCceeEecCCCCcEEEecCCcch
Confidence            4567788988 8999999999999333   22222333333222236778999999999874333334455556665443


Q ss_pred             CCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccce
Q 047869         1944 ISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKL 2023 (2233)
Q Consensus      1944 lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~L 2023 (2233)
                      +--=.+...  -+|.|.++  +.+|+.. +.+.++-.|..-+.+-.-+-+         +.  .-..+=-|..-|.+-++
T Consensus       346 ~~~W~gvr~--~~v~dlai--t~Dgk~v-l~v~~d~~i~l~~~e~~~dr~---------li--se~~~its~~iS~d~k~  409 (519)
T KOG0293|consen  346 LGNWEGVRD--PKVHDLAI--TYDGKYV-LLVTVDKKIRLYNREARVDRG---------LI--SEEQPITSFSISKDGKL  409 (519)
T ss_pred             hhccccccc--ceeEEEEE--cCCCcEE-EEEecccceeeechhhhhhhc---------cc--cccCceeEEEEcCCCcE
Confidence            222222222  25666663  3567644 444455555544333111111         01  11222345555666666


Q ss_pred             eeEEecCCcEEEEEc
Q 047869         2024 LFLSFQDGTTLVGRL 2038 (2233)
Q Consensus      2024 LF~SY~~G~Sf~a~L 2038 (2233)
                      ..++.++.+.++=.+
T Consensus       410 ~LvnL~~qei~LWDl  424 (519)
T KOG0293|consen  410 ALVNLQDQEIHLWDL  424 (519)
T ss_pred             EEEEcccCeeEEeec
Confidence            666666666666544


No 56 
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=62.12  E-value=48  Score=43.06  Aligned_cols=119  Identities=21%  Similarity=0.280  Sum_probs=72.2

Q ss_pred             hhcCcccccc-eecccCceEE-EeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEe
Q 047869         1804 LASGSLVKSL-LSVSSRGRLA-VGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTV 1881 (2233)
Q Consensus      1804 l~sGq~iRqL-LSas~rGrLA-VaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAV 1881 (2233)
                      +.+||.+|-+ -+-+.|-.|- +..+|-|+++|+..   +..  ....++     .-+||-    -.|.|+|.||..||-
T Consensus       161 ~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g---~sp--~~~~~~-----~HsAP~----~gicfspsne~l~vs  226 (673)
T KOG4378|consen  161 IDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQG---MSP--IFHASE-----AHSAPC----RGICFSPSNEALLVS  226 (673)
T ss_pred             cCCCCeEEEeecccccceeeEeeccCCeEEEEeccC---CCc--ccchhh-----hccCCc----CcceecCCccceEEE
Confidence            3678888654 2233333343 35999999999832   111  111222     222222    278999999999999


Q ss_pred             ecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEE
Q 047869         1882 AGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDAT 1961 (2233)
Q Consensus      1882 cGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaT 1961 (2233)
                      +|| ||.|.++..+                                             .-..+-...|..|-     .|
T Consensus       227 VG~-Dkki~~yD~~---------------------------------------------s~~s~~~l~y~~Pl-----st  255 (673)
T KOG4378|consen  227 VGY-DKKINIYDIR---------------------------------------------SQASTDRLTYSHPL-----ST  255 (673)
T ss_pred             ecc-cceEEEeecc---------------------------------------------cccccceeeecCCc-----ce
Confidence            997 4666655433                                             33333333444443     12


Q ss_pred             EEEecCCcEEEEEEecCCceEEEEecc
Q 047869         1962 LVIASRGKMFLIVLSECGSLYRLELSV 1988 (2233)
Q Consensus      1962 fv~~e~G~~~ILVLSS~G~LY~Qels~ 1988 (2233)
                      +-+-+.| .++.+=++.|.||..+|+.
T Consensus       256 vaf~~~G-~~L~aG~s~G~~i~YD~R~  281 (673)
T KOG4378|consen  256 VAFSECG-TYLCAGNSKGELIAYDMRS  281 (673)
T ss_pred             eeecCCc-eEEEeecCCceEEEEeccc
Confidence            3334667 6888899999999999984


No 57 
>PRK01742 tolB translocation protein TolB; Provisional
Probab=61.90  E-value=4.2e+02  Score=33.46  Aligned_cols=108  Identities=15%  Similarity=0.244  Sum_probs=57.2

Q ss_pred             EEEEeecccCccceEEeecccc--eEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEe--cCeEEEEeCc
Q 047869         1865 EIVHLAFNSIVENYLTVAGYED--CQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT--NKFVKIYDLS 1940 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkD--C~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT--~~FVKIYDLS 1940 (2233)
                      .|.+.+++| ++++||.++..+  -+|..++-.+.-..++  . ..+| ......|-|..+. ||++.  ...++||.+.
T Consensus       205 ~v~~p~wSP-DG~~la~~s~~~~~~~i~i~dl~tg~~~~l--~-~~~g-~~~~~~wSPDG~~-La~~~~~~g~~~Iy~~d  278 (429)
T PRK01742        205 PLMSPAWSP-DGSKLAYVSFENKKSQLVVHDLRSGARKVV--A-SFRG-HNGAPAFSPDGSR-LAFASSKDGVLNIYVMG  278 (429)
T ss_pred             ccccceEcC-CCCEEEEEEecCCCcEEEEEeCCCCceEEE--e-cCCC-ccCceeECCCCCE-EEEEEecCCcEEEEEEE
Confidence            367899999 788999887642  4455554433211111  1 1222 2235789998765 55443  3467777554


Q ss_pred             CCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCce
Q 047869         1941 QDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSL 1981 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~L 1981 (2233)
                      .+.-.+   -.+..+.-.+....+.++|+.++++....|..
T Consensus       279 ~~~~~~---~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g~~  316 (429)
T PRK01742        279 ANGGTP---SQLTSGAGNNTEPSWSPDGQSILFTSDRSGSP  316 (429)
T ss_pred             CCCCCe---EeeccCCCCcCCEEECCCCCEEEEEECCCCCc
Confidence            433222   12333332233334457787666665556643


No 58 
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=61.66  E-value=29  Score=43.65  Aligned_cols=86  Identities=19%  Similarity=0.357  Sum_probs=58.8

Q ss_pred             EEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcC----CCCeeEEEE
Q 047869         1888 QVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLP----DDMIVDATL 1962 (2233)
Q Consensus      1888 ~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~Lp----sGkIrDaTf 1962 (2233)
                      .|..+|.+|...-++..+.    .-|+.+.|-.. . +|.||+.+ +|+|||+--..     .|.++    ..+|.++-+
T Consensus        62 ~I~iys~sG~ll~~i~w~~----~~iv~~~wt~~-e-~LvvV~~dG~v~vy~~~G~~-----~fsl~~~i~~~~v~e~~i  130 (410)
T PF04841_consen   62 SIQIYSSSGKLLSSIPWDS----GRIVGMGWTDD-E-ELVVVQSDGTVRVYDLFGEF-----QFSLGEEIEEEKVLECRI  130 (410)
T ss_pred             EEEEECCCCCEeEEEEECC----CCEEEEEECCC-C-eEEEEEcCCEEEEEeCCCce-----eechhhhccccCcccccc
Confidence            5778899999888865544    68999999774 4 45555555 99999986433     45544    345667744


Q ss_pred             EEecCCcEEEEEEecCCceEEE
Q 047869         1963 VIASRGKMFLIVLSECGSLYRL 1984 (2233)
Q Consensus      1963 v~~e~G~~~ILVLSS~G~LY~Q 1984 (2233)
                      ...+.+..=++||++++.+|.-
T Consensus       131 ~~~~~~~~GivvLt~~~~~~~v  152 (410)
T PF04841_consen  131 FAIWFYKNGIVVLTGNNRFYVV  152 (410)
T ss_pred             cccccCCCCEEEECCCCeEEEE
Confidence            3333333347888999999953


No 59 
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=61.39  E-value=33  Score=45.23  Aligned_cols=115  Identities=22%  Similarity=0.260  Sum_probs=84.5

Q ss_pred             eEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeecc--CCceEEEeEEecCCCceEEEEecC-eEEEEeCc
Q 047869         1864 FEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELAL--QGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLS 1940 (2233)
Q Consensus      1864 FeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~L--eg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS 1940 (2233)
                      =.|..|.+||-.+.++..||  |=.|-.++..=    +...-+.+  .-+|+-.+.|=|.+.+.+|++..+ .+-||||-
T Consensus       399 g~v~~v~~nPF~~k~fls~g--DW~vriWs~~~----~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl  472 (555)
T KOG1587|consen  399 GPVYAVSRNPFYPKNFLSVG--DWTVRIWSEDV----IASPLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGNLDIWDLL  472 (555)
T ss_pred             cceEeeecCCCccceeeeec--cceeEeccccC----CCCcchhhhhccceeeeeEEcCcCceEEEEEcCCCceehhhhh
Confidence            35778999999999999999  55554443220    11122222  367899999999999999999866 99999999


Q ss_pred             CCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1941 QDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      .+..-|+-.-.+- +.+....++ ...| ..+.+-...|.+++-+++
T Consensus       473 ~~~~~Pv~s~~~~-~~~l~~~~~-s~~g-~~lavGd~~G~~~~~~l~  516 (555)
T KOG1587|consen  473 QDDEEPVLSQKVC-SPALTRVRW-SPNG-KLLAVGDANGTTHILKLS  516 (555)
T ss_pred             ccccCCccccccc-ccccceeec-CCCC-cEEEEecCCCcEEEEEcC
Confidence            9999998876555 344444444 3445 588889999999988885


No 60 
>PF04053 Coatomer_WDAD:  Coatomer WD associated region ;  InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=60.33  E-value=71  Score=41.11  Aligned_cols=37  Identities=22%  Similarity=0.280  Sum_probs=24.9

Q ss_pred             cccccccccccceEEEEeecccCccceEEeecccceEEEE
Q 047869         1852 TNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLT 1891 (2233)
Q Consensus      1852 lTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLT 1891 (2233)
                      +.++.+++..+.  .-+|.+|| |+++++|||-.+-.|+|
T Consensus        23 l~~k~lg~~~~~--p~~ls~np-ngr~v~V~g~geY~iyt   59 (443)
T PF04053_consen   23 LSVKELGSCEIY--PQSLSHNP-NGRFVLVCGDGEYEIYT   59 (443)
T ss_dssp             ---EEEEE-SS----SEEEE-T-TSSEEEEEETTEEEEEE
T ss_pred             EEeccCCCCCcC--CeeEEECC-CCCEEEEEcCCEEEEEE
Confidence            334445544333  55888899 99999999999999999


No 61 
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=57.38  E-value=7.4e+02  Score=34.86  Aligned_cols=196  Identities=17%  Similarity=0.191  Sum_probs=118.9

Q ss_pred             ccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCceeeee------eeeeccC-CceEEEeEEecC
Q 047869         1849 ADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRL------AIELALQ-GAYIRRVDWVPG 1921 (2233)
Q Consensus      1849 kdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL------~LeL~Le-g~fIIKa~WLPG 1921 (2233)
                      ..|+++--.| .++|=+|..|+.   ..+|--|+-.+++.+++.   |.++.|.      .|++.+. |.          
T Consensus        63 ~~kl~ll~vs-~~lp~~I~alas---~~~~vy~A~g~~i~~~~r---gk~i~~~~~~~~a~v~~l~~fGe----------  125 (910)
T KOG1539|consen   63 VNKLNLLFVS-KPLPDKITALAS---DKDYVYVASGNKIYAYAR---GKHIRHTTLLHGAKVHLLLPFGE----------  125 (910)
T ss_pred             ccceEEEEec-CCCCCceEEEEe---cCceEEEecCcEEEEEEc---cceEEEEeccccceEEEEeeecc----------
Confidence            3556665555 688889999977   889999999999999876   4334432      2222222 33          


Q ss_pred             CCceEEEEecCeEEEEeCcCCCCCCcEE---EEcCCCC-eeEEEEEEec-CCcEEEEEEecCCceEEEEecccCCCcccc
Q 047869         1922 SPVQLMVVTNKFVKIYDLSQDNISPLHY---FTLPDDM-IVDATLVIAS-RGKMFLIVLSECGSLYRLELSVEGNVGATP 1996 (2233)
Q Consensus      1922 SQt~LAVVT~~FVKIYDLS~D~lSPvyy---F~LpsGk-IrDaTfv~~e-~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ 1996 (2233)
                        ..+|+.+.+-+.||+.+.. ..|.|.   |.=-+|+ |+.   +..+ .=-+-|+|-|++|.+-.-++...--.  +.
T Consensus       126 --~lia~d~~~~l~vw~~s~~-~~e~~l~~~~~~~~~~~Ita---l~HP~TYLNKIvvGs~~G~lql~Nvrt~K~v--~~  197 (910)
T KOG1539|consen  126 --HLIAVDISNILFVWKTSSI-QEELYLQSTFLKVEGDFITA---LLHPSTYLNKIVVGSSQGRLQLWNVRTGKVV--YT  197 (910)
T ss_pred             --eEEEEEccCcEEEEEeccc-cccccccceeeeccCCceee---EecchhheeeEEEeecCCcEEEEEeccCcEE--EE
Confidence              3589999999999999886 444332   3333444 332   2222 22235677789998887777632111  11


Q ss_pred             ceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEEEcCCCcccccceeEEEEccCCCCCCCcccceeeccCCCce
Q 047869         1997 LKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLSPNAASLSEVSYVFEEQDGKLRSAGLHRWKELLASSGL 2076 (2233)
Q Consensus      1997 ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qWsEV~~hPGL 2076 (2233)
                      ..++        ..+=-++-=|+.++.+=+...+|+..+.++..+.     +-..|..+           |       |=
T Consensus       198 f~~~--------~s~IT~ieqsPaLDVVaiG~~~G~ViifNlK~dk-----il~sFk~d-----------~-------g~  246 (910)
T KOG1539|consen  198 FQEF--------FSRITAIEQSPALDVVAIGLENGTVIIFNLKFDK-----ILMSFKQD-----------W-------GR  246 (910)
T ss_pred             eccc--------ccceeEeccCCcceEEEEeccCceEEEEEcccCc-----EEEEEEcc-----------c-------cc
Confidence            1111        1112234557779999999999999999986553     22223221           3       33


Q ss_pred             EEEEeccCCCceEEEEecCCc-eee
Q 047869         2077 FFCFSSLKSNAAVAVSLGTNE-LIA 2100 (2233)
Q Consensus      2077 f~cls~~~sn~pvvv~l~pd~-I~i 2100 (2233)
                      +..+|-.++|+|+...=++.. +.+
T Consensus       247 VtslSFrtDG~p~las~~~~G~m~~  271 (910)
T KOG1539|consen  247 VTSLSFRTDGNPLLASGRSNGDMAF  271 (910)
T ss_pred             eeEEEeccCCCeeEEeccCCceEEE
Confidence            466677777777777766633 444


No 62 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=57.23  E-value=5.3e+02  Score=33.13  Aligned_cols=241  Identities=15%  Similarity=0.133  Sum_probs=125.8

Q ss_pred             eecccCccceEEeecccceEEEEecC-CCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCc
Q 047869         1869 LAFNSIVENYLTVAGYEDCQVLTLNP-RGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPL 1947 (2233)
Q Consensus      1869 LafNP~nEdyLAVcGLkDC~VLTfss-~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPv 1947 (2233)
                      +.|.| ++++|| .+-.+-.|..++. .+.  +++..++.-....|..+.|=|..+..+...--..+||||+ .+...=+
T Consensus       165 ~~fs~-~g~~l~-~~~~~~~i~~~~~~~~~--~~~~~~l~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~-~~~~~~~  239 (456)
T KOG0266|consen  165 VDFSP-DGRALA-AASSDGLIRIWKLEGIK--SNLLRELSGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDL-KDDGRNL  239 (456)
T ss_pred             EEEcC-CCCeEE-EccCCCcEEEeeccccc--chhhccccccccceeeeEECCCCcEEEEecCCceEEEeec-cCCCeEE
Confidence            66766 666644 4444444444444 221  1222233222667999999999984333333349999999 3333333


Q ss_pred             EEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEE
Q 047869         1948 HYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLS 2027 (2233)
Q Consensus      1948 yyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~S 2027 (2233)
                      ..+.=-+..|.+++|  ...| ..++--|.+|.+++=++.. +     .....+  .+.  .++..++.++.+-+.|-.+
T Consensus       240 ~~l~gH~~~v~~~~f--~p~g-~~i~Sgs~D~tvriWd~~~-~-----~~~~~l--~~h--s~~is~~~f~~d~~~l~s~  306 (456)
T KOG0266|consen  240 KTLKGHSTYVTSVAF--SPDG-NLLVSGSDDGTVRIWDVRT-G-----ECVRKL--KGH--SDGISGLAFSPDGNLLVSA  306 (456)
T ss_pred             EEecCCCCceEEEEe--cCCC-CEEEEecCCCcEEEEeccC-C-----eEEEee--ecc--CCceEEEEECCCCCEEEEc
Confidence            444434556766664  4677 7888999999999888873 1     111111  111  2234447888888887777


Q ss_pred             ecCCcEEEEEcCCCcccccceeEEEEccCCCCC-CCcccceeeccCCCceEEEEeccCCCceEEEEecCCceeeeccccc
Q 047869         2028 FQDGTTLVGRLSPNAASLSEVSYVFEEQDGKLR-SAGLHRWKELLASSGLFFCFSSLKSNAAVAVSLGTNELIAQNMRHA 2106 (2233)
Q Consensus      2028 Y~~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk~~-~a~L~qWsEV~~hPGLf~cls~~~sn~pvvv~l~pd~I~iQeiK~~ 2106 (2233)
                      =.+|...+--+....  ..   .+.+..+.... +-..++|+    +.|.+.-...  .++-+.+.--...-.++..+-.
T Consensus       307 s~d~~i~vwd~~~~~--~~---~~~~~~~~~~~~~~~~~~fs----p~~~~ll~~~--~d~~~~~w~l~~~~~~~~~~~~  375 (456)
T KOG0266|consen  307 SYDGTIRVWDLETGS--KL---CLKLLSGAENSAPVTSVQFS----PNGKYLLSAS--LDRTLKLWDLRSGKSVGTYTGH  375 (456)
T ss_pred             CCCccEEEEECCCCc--ee---eeecccCCCCCCceeEEEEC----CCCcEEEEec--CCCeEEEEEccCCcceeeeccc
Confidence            446666554442221  00   11111111111 22233333    4455444222  3444444332222222222221


Q ss_pred             cCCCCCeEEEEEeecCC-CCCeEEEEEeeCCceeEEec
Q 047869         2107 AGSTSPLVGVTAYKPLS-KDKVHCLVLHDDGSLQIYSH 2143 (2233)
Q Consensus      2107 ~~sSs~vdgva~y~p~s-~~rttlLLLcEDGSLrIYsa 2143 (2233)
                      ..  .   ...++++.. .+...++.=.+||++.+|.-
T Consensus       376 ~~--~---~~~~~~~~~~~~~~~i~sg~~d~~v~~~~~  408 (456)
T KOG0266|consen  376 SN--L---VRCIFSPTLSTGGKLIYSGSEDGSVYVWDS  408 (456)
T ss_pred             CC--c---ceeEecccccCCCCeEEEEeCCceEEEEeC
Confidence            11  1   134555553 35666788999999999983


No 63 
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=56.82  E-value=38  Score=41.17  Aligned_cols=93  Identities=20%  Similarity=0.396  Sum_probs=61.8

Q ss_pred             eEEEEeecccCccceEEee--------cccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eE
Q 047869         1864 FEIVHLAFNSIVENYLTVA--------GYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FV 1934 (2233)
Q Consensus      1864 FeVlsLafNP~nEdyLAVc--------GLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FV 1934 (2233)
                      |-=-++.|+|-.+++|||+        |--.-+||.++..+.+.+-...+. .+|-|  .+.|-+....+++++.-+ .+
T Consensus         9 f~GysvqfSPf~~nrLavAt~q~yGl~G~G~L~ile~~~~~gi~e~~s~d~-~D~Lf--dV~Wse~~e~~~~~a~GDGSL   85 (311)
T KOG0277|consen    9 FHGYSVQFSPFVENRLAVATAQHYGLAGNGRLFILEVTDPKGIQECQSYDT-EDGLF--DVAWSENHENQVIAASGDGSL   85 (311)
T ss_pred             cccceeEecccccchhheeehhhcccccCceEEEEecCCCCCeEEEEeeec-cccee--EeeecCCCcceEEEEecCceE
Confidence            4456899999999999996        555667777763333332222111 12444  888999998888777766 99


Q ss_pred             EEEeCcCCCCCCcEEEEcCCCCeeEE
Q 047869         1935 KIYDLSQDNISPLHYFTLPDDMIVDA 1960 (2233)
Q Consensus      1935 KIYDLS~D~lSPvyyF~LpsGkIrDa 1960 (2233)
                      ||||+...+ .|.+-|.=-.-.|-.+
T Consensus        86 rl~d~~~~s-~Pi~~~kEH~~EV~Sv  110 (311)
T KOG0277|consen   86 RLFDLTMPS-KPIHKFKEHKREVYSV  110 (311)
T ss_pred             EEeccCCCC-cchhHHHhhhhheEEe
Confidence            999977665 4777665444444433


No 64 
>smart00336 BBOX B-Box-type zinc finger.
Probab=55.52  E-value=7.7  Score=33.14  Aligned_cols=30  Identities=40%  Similarity=0.825  Sum_probs=25.3

Q ss_pred             ccceEeeccCCCCCCceeehhhhhhhcCCCcEEE
Q 047869         1602 EQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVY 1635 (2233)
Q Consensus      1602 ~Q~~Y~C~TC~l~~~~GVC~aCA~vCHkGHdVvy 1635 (2233)
                      +..+|.|.||..    .+|..|...-|+||.++.
T Consensus        12 ~~~~~~C~~c~~----~iC~~C~~~~H~~H~~~~   41 (42)
T smart00336       12 EPAEFFCEECGA----LLCRTCDEAEHRGHTVVL   41 (42)
T ss_pred             CceEEECCCCCc----ccccccChhhcCCCceec
Confidence            444888999886    799999988999999864


No 65 
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=55.29  E-value=3.9e+02  Score=30.99  Aligned_cols=142  Identities=21%  Similarity=0.247  Sum_probs=88.8

Q ss_pred             cccCceEEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecC
Q 047869         1816 VSSRGRLAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNP 1894 (2233)
Q Consensus      1816 as~rGrLAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss 1894 (2233)
                      +..+.+|+|+ |+| +-+++.    ++.      .....+..+.+      |.+|+..|.-+-.|+++| +..+++.++.
T Consensus         4 ~~~~~~L~vGt~~G-l~~~~~----~~~------~~~~~i~~~~~------I~ql~vl~~~~~llvLsd-~~l~~~~L~~   65 (275)
T PF00780_consen    4 DSWGDRLLVGTEDG-LYVYDL----SDP------SKPTRILKLSS------ITQLSVLPELNLLLVLSD-GQLYVYDLDS   65 (275)
T ss_pred             ccCCCEEEEEECCC-EEEEEe----cCC------ccceeEeecce------EEEEEEecccCEEEEEcC-CccEEEEchh
Confidence            4557789998 655 777777    111      12222222222      999999999999999999 8888888744


Q ss_pred             CCceeeee----------eeeecc-CCceEEE-eEEecCCCceEEEEecCeEEEEeCcCC--CC-CCcEEEEcCCCCeeE
Q 047869         1895 RGEVTDRL----------AIELAL-QGAYIRR-VDWVPGSPVQLMVVTNKFVKIYDLSQD--NI-SPLHYFTLPDDMIVD 1959 (2233)
Q Consensus      1895 ~GeV~DRL----------~LeL~L-eg~fIIK-a~WLPGSQt~LAVVT~~FVKIYDLS~D--~l-SPvyyF~LpsGkIrD 1959 (2233)
                      =.....+-          ...+.. .|....+ .... .....|+|+....|.||....+  .. ....+|.+| +.+.+
T Consensus        66 l~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~~~-~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~lp-~~~~~  143 (275)
T PF00780_consen   66 LEPVSTSAPLAFPKSRSLPTKLPETKGVSFFAVNGGH-EGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEISLP-DPPSS  143 (275)
T ss_pred             hccccccccccccccccccccccccCCeeEEeecccc-ccceEEEEEECCEEEEEEEECCcccccceeEEEEcC-CCcEE
Confidence            33333211          011111 1322222 2333 3446689999999999999985  33 578889999 58899


Q ss_pred             EEEEEecCCcEEEEEEecCCceE
Q 047869         1960 ATLVIASRGKMFLIVLSECGSLY 1982 (2233)
Q Consensus      1960 aTfv~~e~G~~~ILVLSS~G~LY 1982 (2233)
                      .++.    + ..++|.++.||..
T Consensus       144 i~~~----~-~~i~v~~~~~f~~  161 (275)
T PF00780_consen  144 IAFL----G-NKICVGTSKGFYL  161 (275)
T ss_pred             EEEe----C-CEEEEEeCCceEE
Confidence            9877    3 3455555666443


No 66 
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=51.84  E-value=4.3e+02  Score=33.89  Aligned_cols=157  Identities=18%  Similarity=0.212  Sum_probs=76.8

Q ss_pred             EEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceE-EEeEEecCCCceEEEEecCeEEEEeCcCCC
Q 047869         1865 EIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYI-RRVDWVPGSPVQLMVVTNKFVKIYDLSQDN 1943 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fI-IKa~WLPGSQt~LAVVT~~FVKIYDLS~D~ 1943 (2233)
                      +|.+|.|.+....+.+.-.--|+-|++|+.+..-..+ ...+.++-.+- ....|-|+          .-+.|-|.+..+
T Consensus         3 ~v~~vs~a~~t~Elav~~~~GeVv~~k~~~n~~~~~~-~~~~~~~~~~~~~~~~~~~~----------~l~di~~r~~~~   71 (395)
T PF08596_consen    3 SVTHVSFAPETLELAVGLESGEVVLFKFGKNQNYGNR-EQPPDLDYNFRRFSLNNSPG----------KLTDISDRAPPS   71 (395)
T ss_dssp             -EEEEEEETTTTEEEEEETTS-EEEEEEEE-------------------S--GGGSS-----------SEEE-GGG--TT
T ss_pred             eEEEEEecCCCceEEEEccCCcEEEEEcccCCCCCcc-CCCcccCcccccccccCCCc----------ceEEehhhCCcc
Confidence            5677888888888988888899999999766443311 11111110000 00000011          123334433333


Q ss_pred             ----CCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeec--ccccccCCeEEEEe
Q 047869         1944 ----ISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQF--NDREIHAKGLSLYF 2017 (2233)
Q Consensus      1944 ----lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~--~~~q~~~~GVSVyY 2017 (2233)
                          +-|.+.+-.-.|.|.....  ..-|  ++-|-.++|.|-.=+++    ..++..++.+..  ........--++.|
T Consensus        72 ~~~gf~P~~l~~~~~g~vtal~~--S~iG--Fvaigy~~G~l~viD~R----GPavI~~~~i~~~~~~~~~~~~vt~ieF  143 (395)
T PF08596_consen   72 LKEGFLPLTLLDAKQGPVTALKN--SDIG--FVAIGYESGSLVVIDLR----GPAVIYNENIRESFLSKSSSSYVTSIEF  143 (395)
T ss_dssp             -SEEEEEEEEE---S-SEEEEEE---BTS--EEEEEETTSEEEEEETT----TTEEEEEEEGGG--T-SS----EEEEEE
T ss_pred             cccccCchhheeccCCcEeEEec--CCCc--EEEEEecCCcEEEEECC----CCeEEeeccccccccccccccCeeEEEE
Confidence                4588888888888877663  2335  77788888888777775    111222222222  11111111223433


Q ss_pred             c--------cccceeeEEecCCcEEEEEcCC
Q 047869         2018 S--------STYKLLFLSFQDGTTLVGRLSP 2040 (2233)
Q Consensus      2018 S--------~tl~LLF~SY~~G~Sf~a~Ls~ 2040 (2233)
                      +        ...-.||+.++.|+.++-++.+
T Consensus       144 ~vm~~~~D~ySSi~L~vGTn~G~v~~fkIlp  174 (395)
T PF08596_consen  144 SVMTLGGDGYSSICLLVGTNSGNVLTFKILP  174 (395)
T ss_dssp             EEEE-TTSSSEEEEEEEEETTSEEEEEEEEE
T ss_pred             EEEecCCCcccceEEEEEeCCCCEEEEEEec
Confidence            3        2346899999999999998865


No 67 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=51.66  E-value=1.5e+02  Score=38.35  Aligned_cols=141  Identities=16%  Similarity=0.170  Sum_probs=80.5

Q ss_pred             cccccceecccCceEEEeeC---CeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecc
Q 047869         1808 SLVKSLLSVSSRGRLAVGEG---DKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGY 1884 (2233)
Q Consensus      1808 q~iRqLLSas~rGrLAVaEg---dKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGL 1884 (2233)
                      .+=.-+|+++.+|..|....   ..+|+.+.              +          .-..++-+.+|-|  +-.|-+.|.
T Consensus       313 ~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~--------------~----------~s~v~~ts~~fHp--DgLifgtgt  366 (506)
T KOG0289|consen  313 PTGEYLLSASNDGTWAFSDISSGSQLTVVSD--------------E----------TSDVEYTSAAFHP--DGLIFGTGT  366 (506)
T ss_pred             cCCcEEEEecCCceEEEEEccCCcEEEEEee--------------c----------cccceeEEeeEcC--CceEEeccC
Confidence            33455788999999887632   22222211              0          0123455778877  456777888


Q ss_pred             cceEEEEec-CCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCe-EEEEeCcCCCCCCcEEEEcCCCCeeEEEE
Q 047869         1885 EDCQVLTLN-PRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKF-VKIYDLSQDNISPLHYFTLPDDMIVDATL 1962 (2233)
Q Consensus      1885 kDC~VLTfs-s~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~F-VKIYDLS~D~lSPvyyF~LpsGkIrDaTf 1962 (2233)
                      .|-.|=.++ +++..+-+...|    ..-|.-+.. ...-+-||+++.|- ||+|||-++..=|  .|.+++++ .-.++
T Consensus       367 ~d~~vkiwdlks~~~~a~Fpgh----t~~vk~i~F-sENGY~Lat~add~~V~lwDLRKl~n~k--t~~l~~~~-~v~s~  438 (506)
T KOG0289|consen  367 PDGVVKIWDLKSQTNVAKFPGH----TGPVKAISF-SENGYWLATAADDGSVKLWDLRKLKNFK--TIQLDEKK-EVNSL  438 (506)
T ss_pred             CCceEEEEEcCCccccccCCCC----CCceeEEEe-ccCceEEEEEecCCeEEEEEehhhcccc--eeeccccc-cceeE
Confidence            876654442 222222222111    111222221 12223489999997 9999999999544  56778776 55566


Q ss_pred             EEecCCcEEEEEEecCCceEE
Q 047869         1963 VIASRGKMFLIVLSECGSLYR 1983 (2233)
Q Consensus      1963 v~~e~G~~~ILVLSS~G~LY~ 1983 (2233)
                      -++..|++.++- +++=++|.
T Consensus       439 ~fD~SGt~L~~~-g~~l~Vy~  458 (506)
T KOG0289|consen  439 SFDQSGTYLGIA-GSDLQVYI  458 (506)
T ss_pred             EEcCCCCeEEee-cceeEEEE
Confidence            667889755444 67667774


No 68 
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=51.06  E-value=9.8e+02  Score=34.42  Aligned_cols=110  Identities=16%  Similarity=0.216  Sum_probs=69.1

Q ss_pred             EEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccC--CceEEEeEEecCCCceEEEEecCeEEEEeCcCCC
Q 047869         1866 IVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQ--GAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDN 1943 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Le--g~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~ 1943 (2233)
                      .++--+++..++||.|.=..|=.||.++..  +.+-.-..+.-+  ..|   +-=+-|.|  |+=||+.+|++||=+   
T Consensus       409 ~lk~~v~~~~d~ylvlsf~~eTrvl~i~~e--~ee~~~~gf~~~~~Tif---~S~i~g~~--lvQvTs~~iRl~ss~---  478 (1096)
T KOG1897|consen  409 SLKSMVDENYDNYLVLSFISETRVLNISEE--VEETEDPGFSTDEQTIF---CSTINGNQ--LVQVTSNSIRLVSSA---  478 (1096)
T ss_pred             EeeccccccCCcEEEEEeccceEEEEEccc--eEEeccccccccCceEE---EEccCCce--EEEEecccEEEEcch---
Confidence            334447788888999999999999999877  333322222221  223   22333443  788999999999976   


Q ss_pred             CCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccC
Q 047869         1944 ISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEG 1990 (2233)
Q Consensus      1944 lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~ 1990 (2233)
                        -.-.+-.|++++.=.....   ....|+|...+|.+|+.++.+.+
T Consensus       479 --~~~~~W~~p~~~ti~~~~~---n~sqVvvA~~~~~l~y~~i~~~~  520 (1096)
T KOG1897|consen  479 --GLRSEWRPPGKITIGVVSA---NASQVVVAGGGLALFYLEIEDGG  520 (1096)
T ss_pred             --hhhhcccCCCceEEEEEee---cceEEEEecCccEEEEEEeeccc
Confidence              1223445555554333222   23478888888888888776443


No 69 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=50.37  E-value=6.7e+02  Score=32.26  Aligned_cols=162  Identities=18%  Similarity=0.188  Sum_probs=103.1

Q ss_pred             cccceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccC-CceEEEeEEecCCCceEEEEecC-eEEEE
Q 047869         1860 NIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQ-GAYIRRVDWVPGSPVQLMVVTNK-FVKIY 1937 (2233)
Q Consensus      1860 a~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Le-g~fIIKa~WLPGSQt~LAVVT~~-FVKIY 1937 (2233)
                      ..=...|-.++|.|...  ..|.|-.|-.|-.+...   .+...+.--.. .+||..+.+-|.+ .+++-...| -||||
T Consensus       200 ~~h~~~v~~~~fs~d~~--~l~s~s~D~tiriwd~~---~~~~~~~~l~gH~~~v~~~~f~p~g-~~i~Sgs~D~tvriW  273 (456)
T KOG0266|consen  200 SGHTRGVSDVAFSPDGS--YLLSGSDDKTLRIWDLK---DDGRNLKTLKGHSTYVTSVAFSPDG-NLLVSGSDDGTVRIW  273 (456)
T ss_pred             cccccceeeeEECCCCc--EEEEecCCceEEEeecc---CCCeEEEEecCCCCceEEEEecCCC-CEEEEecCCCcEEEE
Confidence            33456788999988665  66677777776666441   22122222223 6789999999999 666666666 99999


Q ss_pred             eCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEe
Q 047869         1938 DLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYF 2017 (2233)
Q Consensus      1938 DLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyY 2017 (2233)
                      |+-.  -.++--+..-++.|..+.|  ..+|+ .++..|.+|.|.+=++......   -+.+.......  . ---++.+
T Consensus       274 d~~~--~~~~~~l~~hs~~is~~~f--~~d~~-~l~s~s~d~~i~vwd~~~~~~~---~~~~~~~~~~~--~-~~~~~~f  342 (456)
T KOG0266|consen  274 DVRT--GECVRKLKGHSDGISGLAF--SPDGN-LLVSASYDGTIRVWDLETGSKL---CLKLLSGAENS--A-PVTSVQF  342 (456)
T ss_pred             eccC--CeEEEeeeccCCceEEEEE--CCCCC-EEEEcCCCccEEEEECCCCcee---eeecccCCCCC--C-ceeEEEE
Confidence            9988  3455555556777776664  46675 5555588999998887733211   11111111111  0 2456888


Q ss_pred             ccccceeeEEecCCcEEEEEc
Q 047869         2018 SSTYKLLFLSFQDGTTLVGRL 2038 (2233)
Q Consensus      2018 S~tl~LLF~SY~~G~Sf~a~L 2038 (2233)
                      |+.-..|+..+.+++-=+=.+
T Consensus       343 sp~~~~ll~~~~d~~~~~w~l  363 (456)
T KOG0266|consen  343 SPNGKYLLSASLDRTLKLWDL  363 (456)
T ss_pred             CCCCcEEEEecCCCeEEEEEc
Confidence            999999999988877655444


No 70 
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=50.26  E-value=44  Score=41.24  Aligned_cols=117  Identities=23%  Similarity=0.378  Sum_probs=77.8

Q ss_pred             CceEE-EeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCC--
Q 047869         1819 RGRLA-VGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPR-- 1895 (2233)
Q Consensus      1819 rGrLA-VaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~-- 1895 (2233)
                      ++..| |+..|.|-+|+++.|=--.-   .+.+--+         .-..++++.|+...+|+|-.+-.-+.|..+.=+  
T Consensus       209 ~~~FASvgaDGSvRmFDLR~leHSTI---IYE~p~~---------~~pLlRLswnkqDpnymATf~~dS~~V~iLDiR~P  276 (364)
T KOG0290|consen  209 RDVFASVGADGSVRMFDLRSLEHSTI---IYEDPSP---------STPLLRLSWNKQDPNYMATFAMDSNKVVILDIRVP  276 (364)
T ss_pred             cceEEEecCCCcEEEEEecccccceE---EecCCCC---------CCcceeeccCcCCchHHhhhhcCCceEEEEEecCC
Confidence            33444 78999999999987643222   2333222         223478999999999999887776666555221  


Q ss_pred             CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCC----CCcEEEE
Q 047869         1896 GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNI----SPLHYFT 1951 (2233)
Q Consensus      1896 GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~l----SPvyyF~ 1951 (2233)
                      +..    +.+|+--++-+.-+.|-|+|..-|.-+..+ .+-||||.+-.-    .|..-|+
T Consensus       277 ~tp----va~L~~H~a~VNgIaWaPhS~~hictaGDD~qaliWDl~q~~~~~~~dPilay~  333 (364)
T KOG0290|consen  277 CTP----VARLRNHQASVNGIAWAPHSSSHICTAGDDCQALIWDLQQMPRENGEDPILAYT  333 (364)
T ss_pred             Ccc----eehhhcCcccccceEecCCCCceeeecCCcceEEEEecccccccCCCCchhhhh
Confidence            111    223333367788999999999999877777 788999987543    3444444


No 71 
>cd00021 BBOX B-Box-type zinc finger; zinc binding domain (CHC3H2); often present in combination with other motifs, like RING zinc finger, NHL motif, coiled-coil or RFP domain in functionally unrelated proteins, most likely mediating protein-protein interaction.
Probab=50.23  E-value=11  Score=31.89  Aligned_cols=29  Identities=34%  Similarity=0.454  Sum_probs=24.2

Q ss_pred             cceEeeccCCCCCCceeehhhhhhhcCCCcEEE
Q 047869         1603 QHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVY 1635 (2233)
Q Consensus      1603 Q~~Y~C~TC~l~~~~GVC~aCA~vCHkGHdVvy 1635 (2233)
                      ...|.|.+|..    .+|..|...=|+||.++.
T Consensus        10 ~~~~fC~~~~~----~iC~~C~~~~H~~H~~~~   38 (39)
T cd00021          10 PLSLFCETDRA----LLCVDCDLSVHSGHRRVP   38 (39)
T ss_pred             ceEEEeCccCh----hhhhhcChhhcCCCCEee
Confidence            44889999887    799999866699999875


No 72 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=50.20  E-value=6.3e+02  Score=31.91  Aligned_cols=132  Identities=21%  Similarity=0.337  Sum_probs=72.1

Q ss_pred             eeCCeEEEEechhhhcccccCCccccccccccccccccceEE-EEeecccCccceEEeecccceEEEEec-CCCceeeee
Q 047869         1825 GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEI-VHLAFNSIVENYLTVAGYEDCQVLTLN-PRGEVTDRL 1902 (2233)
Q Consensus      1825 aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeV-lsLafNP~nEdyLAVcGLkDC~VLTfs-s~GeV~DRL 1902 (2233)
                      .+.++|+|++...                .+.+++-+.+..+ .+++|.| +++|+.|++ +|..|--++ .+++++.++
T Consensus        13 ~~~~~v~viD~~t----------------~~~~~~i~~~~~~h~~~~~s~-Dgr~~yv~~-rdg~vsviD~~~~~~v~~i   74 (369)
T PF02239_consen   13 RGSGSVAVIDGAT----------------NKVVARIPTGGAPHAGLKFSP-DGRYLYVAN-RDGTVSVIDLATGKVVATI   74 (369)
T ss_dssp             GGGTEEEEEETTT-----------------SEEEEEE-STTEEEEEE-TT--SSEEEEEE-TTSEEEEEETTSSSEEEEE
T ss_pred             cCCCEEEEEECCC----------------CeEEEEEcCCCCceeEEEecC-CCCEEEEEc-CCCeEEEEECCcccEEEEE
Confidence            4678999888622                2223444444444 3456655 888999998 566666664 356677776


Q ss_pred             ee-------eeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCC-----CCeeEEEEEEecCCcE
Q 047869         1903 AI-------ELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPD-----DMIVDATLVIASRGKM 1970 (2233)
Q Consensus      1903 ~L-------eL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~Lps-----GkIrDaTfv~~e~G~~ 1970 (2233)
                      .+       ...-+|.|+.=+-|.|+           .|.|+|...  +.|+.......     ..=|-++++....+..
T Consensus        75 ~~G~~~~~i~~s~DG~~~~v~n~~~~-----------~v~v~D~~t--le~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~  141 (369)
T PF02239_consen   75 KVGGNPRGIAVSPDGKYVYVANYEPG-----------TVSVIDAET--LEPVKTIPTGGMPVDGPESRVAAIVASPGRPE  141 (369)
T ss_dssp             E-SSEEEEEEE--TTTEEEEEEEETT-----------EEEEEETTT----EEEEEE--EE-TTTS---EEEEEE-SSSSE
T ss_pred             ecCCCcceEEEcCCCCEEEEEecCCC-----------ceeEecccc--ccceeecccccccccccCCCceeEEecCCCCE
Confidence            44       22224556555555555           455666532  44444332221     1124566666666777


Q ss_pred             EEEEEecCCceEEEEec
Q 047869         1971 FLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1971 ~ILVLSS~G~LY~Qels 1987 (2233)
                      +++-+-+.|.|+.=+.+
T Consensus       142 fVv~lkd~~~I~vVdy~  158 (369)
T PF02239_consen  142 FVVNLKDTGEIWVVDYS  158 (369)
T ss_dssp             EEEEETTTTEEEEEETT
T ss_pred             EEEEEccCCeEEEEEec
Confidence            88888899999866555


No 73 
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=49.77  E-value=4.7e+02  Score=30.33  Aligned_cols=132  Identities=18%  Similarity=0.173  Sum_probs=87.5

Q ss_pred             cccceecccCceEEEeeCCeEEEEechhhhcccccCCcccccc----ccccccccccceEEEEeecccCccceEEeeccc
Q 047869         1810 VKSLLSVSSRGRLAVGEGDKVAIFDVGQLIGQATIQPVTADKT----NVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYE 1885 (2233)
Q Consensus      1810 iRqLLSas~rGrLAVaEgdKVTILqlsaLLkQad~s~~skdKl----TLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLk 1885 (2233)
                      |+|+..+..-+.+.|--++.+.++++..+-......+..+.|.    ..-+..+..--|.   ..--+....+|+|+==+
T Consensus        38 I~ql~vl~~~~~llvLsd~~l~~~~L~~l~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~---~~~~~~~~~~L~va~kk  114 (275)
T PF00780_consen   38 ITQLSVLPELNLLLVLSDGQLYVYDLDSLEPVSTSAPLAFPKSRSLPTKLPETKGVSFFA---VNGGHEGSRRLCVAVKK  114 (275)
T ss_pred             EEEEEEecccCEEEEEcCCccEEEEchhhccccccccccccccccccccccccCCeeEEe---eccccccceEEEEEECC
Confidence            8888888888888877779999999999987765332222222    1122333333333   22346677888888888


Q ss_pred             ceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcE
Q 047869         1886 DCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLH 1948 (2233)
Q Consensus      1886 DC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvy 1948 (2233)
                      ..+|+++..+..-..+..-|..++ .-+..+.|.   ...+.|.+.+-.-++|+...+..+.+
T Consensus       115 ~i~i~~~~~~~~~f~~~~ke~~lp-~~~~~i~~~---~~~i~v~~~~~f~~idl~~~~~~~l~  173 (275)
T PF00780_consen  115 KILIYEWNDPRNSFSKLLKEISLP-DPPSSIAFL---GNKICVGTSKGFYLIDLNTGSPSELL  173 (275)
T ss_pred             EEEEEEEECCcccccceeEEEEcC-CCcEEEEEe---CCEEEEEeCCceEEEecCCCCceEEe
Confidence            999999977421110233333343 457788899   33689999999999999976665554


No 74 
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=47.16  E-value=6.6e+02  Score=31.29  Aligned_cols=198  Identities=18%  Similarity=0.219  Sum_probs=114.8

Q ss_pred             ceeccc-CceEEEeeCCeEEEEechhhhcccccCCccccccccccccccc-cceEEEEeecccCccceEEeecccceEEE
Q 047869         1813 LLSVSS-RGRLAVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNI-VRFEIVHLAFNSIVENYLTVAGYEDCQVL 1890 (2233)
Q Consensus      1813 LLSas~-rGrLAVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~-VpFeVlsLafNP~nEdyLAVcGLkDC~VL 1890 (2233)
                      .|++.+ ++.||++-..+|-++|++.-=-+              |+.+-. ..--|..+.| .+.++ --+-|=+||.|=
T Consensus        45 rLeiTpdk~~LAaa~~qhvRlyD~~S~np~--------------Pv~t~e~h~kNVtaVgF-~~dgr-WMyTgseDgt~k  108 (311)
T KOG0315|consen   45 RLEITPDKKDLAAAGNQHVRLYDLNSNNPN--------------PVATFEGHTKNVTAVGF-QCDGR-WMYTGSEDGTVK  108 (311)
T ss_pred             eEEEcCCcchhhhccCCeeEEEEccCCCCC--------------ceeEEeccCCceEEEEE-eecCe-EEEecCCCceEE
Confidence            455554 56689999999999998532111              111111 1133445555 33333 234455665554


Q ss_pred             EecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCC--eeEEEEEEecC
Q 047869         1891 TLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDM--IVDATLVIASR 1967 (2233)
Q Consensus      1891 Tfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGk--IrDaTfv~~e~ 1967 (2233)
                      .+.=+--...|+   .+. ..-|..+. +--.|++|.+-++. .|+||||-.+..+   .=++|++.  |+..|+.  ++
T Consensus       109 IWdlR~~~~qR~---~~~-~spVn~vv-lhpnQteLis~dqsg~irvWDl~~~~c~---~~liPe~~~~i~sl~v~--~d  178 (311)
T KOG0315|consen  109 IWDLRSLSCQRN---YQH-NSPVNTVV-LHPNQTELISGDQSGNIRVWDLGENSCT---HELIPEDDTSIQSLTVM--PD  178 (311)
T ss_pred             EEeccCcccchh---ccC-CCCcceEE-ecCCcceEEeecCCCcEEEEEccCCccc---cccCCCCCcceeeEEEc--CC
Confidence            443332111111   000 12244443 34459999887776 9999999988654   23456554  6666644  77


Q ss_pred             CcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEEEcCCC
Q 047869         1968 GKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLSPN 2041 (2233)
Q Consensus      1968 G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls~~ 2041 (2233)
                      |. .+....+.|..|.=++--+.  ....+.-+-+.+-  -++-+.+.-||++-+.|--+=.+-+..+-+.+..
T Consensus       179 gs-ml~a~nnkG~cyvW~l~~~~--~~s~l~P~~k~~a--h~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~  247 (311)
T KOG0315|consen  179 GS-MLAAANNKGNCYVWRLLNHQ--TASELEPVHKFQA--HNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDF  247 (311)
T ss_pred             Cc-EEEEecCCccEEEEEccCCC--ccccceEhhheec--ccceEEEEEECCCCcEEEeecCCceEEEEecCCc
Confidence            85 66777899999988776422  2222222222211  1556899999999998888888888777766443


No 75 
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=46.79  E-value=7.4e+02  Score=31.72  Aligned_cols=48  Identities=21%  Similarity=0.218  Sum_probs=34.8

Q ss_pred             eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1933 FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1933 FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      .|+||+.+-..++   ......|+|.+..+-  .+  -.++|++++|.+++.++.
T Consensus        62 ~I~iys~sG~ll~---~i~w~~~~iv~~~wt--~~--e~LvvV~~dG~v~vy~~~  109 (410)
T PF04841_consen   62 SIQIYSSSGKLLS---SIPWDSGRIVGMGWT--DD--EELVVVQSDGTVRVYDLF  109 (410)
T ss_pred             EEEEECCCCCEeE---EEEECCCCEEEEEEC--CC--CeEEEEEcCCEEEEEeCC
Confidence            7999998887554   466667999988853  22  256677799998877654


No 76 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=46.34  E-value=9.8e+02  Score=33.02  Aligned_cols=200  Identities=12%  Similarity=0.084  Sum_probs=115.3

Q ss_pred             cCceEEEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCC-
Q 047869         1818 SRGRLAVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRG- 1896 (2233)
Q Consensus      1818 ~rGrLAVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~G- 1896 (2233)
                      .+..+|++--..+.|+++     |.+. ...+.++..-|++..++--.+..+     ..+.|.+|-.....+.++--.+ 
T Consensus       393 dg~~Ia~st~~~~~iy~L-----~~~~-~vk~~~v~~~~~~~~~a~~i~fti-----d~~k~~~~s~~~~~le~~el~~p  461 (691)
T KOG2048|consen  393 DGNLIAISTVSRTKIYRL-----QPDP-NVKVINVDDVPLALLDASAISFTI-----DKNKLFLVSKNIFSLEEFELETP  461 (691)
T ss_pred             CCCEEEEeeccceEEEEe-----ccCc-ceeEEEeccchhhhccceeeEEEe-----cCceEEEEecccceeEEEEecCc
Confidence            456678887778888887     3332 122333444444444333333333     4566666666666666663332 


Q ss_pred             ceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEe
Q 047869         1897 EVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLS 1976 (2233)
Q Consensus      1897 eV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLS 1976 (2233)
                      +--+...+.++-.-.+|-++.=-|.-|..-|+.|..-|-||+|..-...+.-  ..+.-.++.+.|.  ......++|.+
T Consensus       462 s~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~t~g~I~v~nl~~~~~~~l~--~rln~~vTa~~~~--~~~~~~lvvat  537 (691)
T KOG2048|consen  462 SFKELKSIQSQAKCPSISRLVVSSDGNYIAAISTRGQIFVYNLETLESHLLK--VRLNIDVTAAAFS--PFVRNRLVVAT  537 (691)
T ss_pred             chhhhhccccccCCCcceeEEEcCCCCEEEEEeccceEEEEEcccceeecch--hccCcceeeeecc--ccccCcEEEEe
Confidence            2233334444433567888888888888556667779999999876633211  0222233333332  35667899999


Q ss_pred             cCCceEEEEecccC------CCccccceeeeecccccccCCeEEEEeccccceeeEEecCCcEEEEE
Q 047869         1977 ECGSLYRLELSVEG------NVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQDGTTLVGR 2037 (2233)
Q Consensus      1977 S~G~LY~Qels~s~------d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~ 2037 (2233)
                      +++++|..+|+...      ++-...-.+..+.++   +.-|.|.  .+.-..-|..|+.+-..+--
T Consensus       538 s~nQv~efdi~~~~l~~ws~~nt~nlpk~~~~l~~---~~~gisf--d~~n~s~~~~~~a~w~~~id  599 (691)
T KOG2048|consen  538 SNNQVFEFDIEARNLTRWSKNNTRNLPKEPKTLIP---GIPGISF--DPKNSSRFIVYDAHWSCLID  599 (691)
T ss_pred             cCCeEEEEecchhhhhhhhhccccccccChhhcCC---CCceEEe--CCCCccEEEEEcCcEEEEEe
Confidence            99999999995221      111111122222222   2235544  48888999999887666653


No 77 
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=45.57  E-value=2.8e+02  Score=35.98  Aligned_cols=156  Identities=17%  Similarity=0.164  Sum_probs=105.2

Q ss_pred             CceEE-EeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCc
Q 047869         1819 RGRLA-VGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGE 1897 (2233)
Q Consensus      1819 rGrLA-VaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~Ge 1897 (2233)
                      -|+|+ -++.++|+++++++-=..        +|.-.++---....-.|..++|.|-+++.++-||-..|-++-=...+ 
T Consensus       190 ~g~Lls~~~d~~i~lwdi~~~~~~--------~~~~~p~~~~~~h~~~VeDV~~h~~h~~lF~sv~dd~~L~iwD~R~~-  260 (422)
T KOG0264|consen  190 EGTLLSGSDDHTICLWDINAESKE--------DKVVDPKTIFSGHEDVVEDVAWHPLHEDLFGSVGDDGKLMIWDTRSN-  260 (422)
T ss_pred             ceeEeeccCCCcEEEEeccccccC--------CccccceEEeecCCcceehhhccccchhhheeecCCCeEEEEEcCCC-
Confidence            35544 458999999999653322        11111122223334457899999999999999998777655332222 


Q ss_pred             eeeeeeeeeccC----CceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEE
Q 047869         1898 VTDRLAIELALQ----GAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFL 1972 (2233)
Q Consensus      1898 V~DRL~LeL~Le----g~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~I 1972 (2233)
                           .-+++..    ++=|..+.|=|-+..-||-..++ .|+.|||-.=+. |.|.|.-..+.|--+.+-.  ...-++
T Consensus       261 -----~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~-~lh~~e~H~dev~~V~WSP--h~etvL  332 (422)
T KOG0264|consen  261 -----TSKPSHSVKAHSAEVNCVAFNPFNEFILATGSADKTVALWDLRNLNK-PLHTFEGHEDEVFQVEWSP--HNETVL  332 (422)
T ss_pred             -----CCCCcccccccCCceeEEEeCCCCCceEEeccCCCcEEEeechhccc-CceeccCCCcceEEEEeCC--CCCcee
Confidence                 2222221    56688999999998888877755 999999976665 9999999999998887543  233355


Q ss_pred             EEEecCCceEEEEecccCC
Q 047869         1973 IVLSECGSLYRLELSVEGN 1991 (2233)
Q Consensus      1973 LVLSS~G~LY~Qels~s~d 1991 (2233)
                      --.+++|.+-+=++++-+.
T Consensus       333 ASSg~D~rl~vWDls~ig~  351 (422)
T KOG0264|consen  333 ASSGTDRRLNVWDLSRIGE  351 (422)
T ss_pred             EecccCCcEEEEecccccc
Confidence            5666788888888875443


No 78 
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=45.35  E-value=1.3e+02  Score=40.83  Aligned_cols=153  Identities=14%  Similarity=0.206  Sum_probs=88.6

Q ss_pred             eecccCceEEE-------eeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccc
Q 047869         1814 LSVSSRGRLAV-------GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYED 1886 (2233)
Q Consensus      1814 LSas~rGrLAV-------aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkD 1886 (2233)
                      +++++.|.+++       .|---|-+++.+.=+.+.-...+               ..+|-+|+|||-..-.|+|+-=+-
T Consensus       531 l~~s~~gnliASaCKS~~~ehAvI~lw~t~~W~~~~~L~~H---------------sLTVT~l~FSpdg~~LLsvsRDRt  595 (764)
T KOG1063|consen  531 LAISPTGNLIASACKSSLKEHAVIRLWNTANWLQVQELEGH---------------SLTVTRLAFSPDGRYLLSVSRDRT  595 (764)
T ss_pred             EEecCCCCEEeehhhhCCccceEEEEEeccchhhhheeccc---------------ceEEEEEEECCCCcEEEEeecCce
Confidence            44555555544       24445666666555544432221               246778899996555566665555


Q ss_pred             eEEEEecCCCceeeeeeeeeccC---CceEEEeEEecCCCceEEEEecC-eEEEEeCcCC--CCCCcEEEEcC-CCCeeE
Q 047869         1887 CQVLTLNPRGEVTDRLAIELALQ---GAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQD--NISPLHYFTLP-DDMIVD 1959 (2233)
Q Consensus      1887 C~VLTfss~GeV~DRL~LeL~Le---g~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D--~lSPvyyF~Lp-sGkIrD 1959 (2233)
                      ..+++.  +-++.|-..  .+..   .--|=.+.|-|.+.+ +|-+..| +||||-...+  ..-|.. -.++ ++.++.
T Consensus       596 ~sl~~~--~~~~~~e~~--fa~~k~HtRIIWdcsW~pde~~-FaTaSRDK~VkVW~~~~~~d~~i~~~-a~~~~~~aVTA  669 (764)
T KOG1063|consen  596 VSLYEV--QEDIKDEFR--FACLKAHTRIIWDCSWSPDEKY-FATASRDKKVKVWEEPDLRDKYISRF-ACLKFSLAVTA  669 (764)
T ss_pred             EEeeee--ecccchhhh--hccccccceEEEEcccCcccce-eEEecCCceEEEEeccCchhhhhhhh-chhccCCceee
Confidence            666655  222232222  1111   335779999999988 7766666 9999999888  433433 2223 334544


Q ss_pred             EEEEE--ecCCcEEEEEEecCCceEEEEec
Q 047869         1960 ATLVI--ASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1960 aTfv~--~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      .++..  ..+....+.|=-+.|.||.-...
T Consensus       670 v~~~~~~~~e~~~~vavGle~GeI~l~~~~  699 (764)
T KOG1063|consen  670 VAYLPVDHNEKGDVVAVGLEKGEIVLWRRK  699 (764)
T ss_pred             EEeeccccccccceEEEEecccEEEEEecc
Confidence            44432  11222477888899999976544


No 79 
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=45.29  E-value=3.9e+02  Score=33.11  Aligned_cols=89  Identities=11%  Similarity=0.184  Sum_probs=60.7

Q ss_pred             eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCe
Q 047869         1933 FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKG 2012 (2233)
Q Consensus      1933 FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~G 2012 (2233)
                      -++||||.+..-.|+ +|.=-+|.||++.+. .++  .+||-.+.+|-+-+=+..- +     .....++.     +..-
T Consensus       123 llrvfdln~p~App~-E~~ghtg~Ir~v~wc-~eD--~~iLSSadd~tVRLWD~rT-g-----t~v~sL~~-----~s~V  187 (334)
T KOG0278|consen  123 LLRVFDLNRPKAPPK-EISGHTGGIRTVLWC-HED--KCILSSADDKTVRLWDHRT-G-----TEVQSLEF-----NSPV  187 (334)
T ss_pred             HhhhhhccCCCCCch-hhcCCCCcceeEEEe-ccC--ceEEeeccCCceEEEEecc-C-----cEEEEEec-----CCCC
Confidence            589999999887776 556667899999966 343  3666667777766555441 1     11112232     3335


Q ss_pred             EEEEeccccceeeEEecCCcEEEE
Q 047869         2013 LSLYFSSTYKLLFLSFQDGTTLVG 2036 (2233)
Q Consensus      2013 VSVyYS~tl~LLF~SY~~G~Sf~a 2036 (2233)
                      -|+-||++-+.|-++|-.+-.|.-
T Consensus       188 tSlEvs~dG~ilTia~gssV~Fwd  211 (334)
T KOG0278|consen  188 TSLEVSQDGRILTIAYGSSVKFWD  211 (334)
T ss_pred             cceeeccCCCEEEEecCceeEEec
Confidence            578999999999999988877763


No 80 
>PF04762 IKI3:  IKI3 family;  InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=44.74  E-value=1.2e+03  Score=33.43  Aligned_cols=117  Identities=13%  Similarity=0.184  Sum_probs=74.5

Q ss_pred             EeecccCccceEEeecc--cceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCC
Q 047869         1868 HLAFNSIVENYLTVAGY--EDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNIS 1945 (2233)
Q Consensus      1868 sLafNP~nEdyLAVcGL--kDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lS 1945 (2233)
                      .++.-| +++++|..--  ..-.|.-|-.+|-.-....+....++.-|+...|=+.|.. |||.+.+.|.+|-.+-=---
T Consensus       261 ~l~WrP-sG~lIA~~q~~~~~~~VvFfErNGLrhgeF~l~~~~~~~~v~~l~Wn~ds~i-LAv~~~~~vqLWt~~NYHWY  338 (928)
T PF04762_consen  261 ALSWRP-SGNLIASSQRLPDRHDVVFFERNGLRHGEFTLRFDPEEEKVIELAWNSDSEI-LAVWLEDRVQLWTRSNYHWY  338 (928)
T ss_pred             CccCCC-CCCEEEEEEEcCCCcEEEEEecCCcEeeeEecCCCCCCceeeEEEECCCCCE-EEEEecCCceEEEeeCCEEE
Confidence            455555 6777777653  1244666778887766666665456778999999999987 89999989999976521111


Q ss_pred             CcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1946 PLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1946 PvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      =+++...+.+.-... +...++..+.+.++++.|.++..++.
T Consensus       339 LKqei~~~~~~~~~~-~~Wdpe~p~~L~v~t~~g~~~~~~~~  379 (928)
T PF04762_consen  339 LKQEIRFSSSESVNF-VKWDPEKPLRLHVLTSNGQYEIYDFA  379 (928)
T ss_pred             EEEEEEccCCCCCCc-eEECCCCCCEEEEEecCCcEEEEEEE
Confidence            122223333221111 33355566788888888888776664


No 81 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=43.62  E-value=6.8e+02  Score=30.39  Aligned_cols=117  Identities=16%  Similarity=0.175  Sum_probs=59.8

Q ss_pred             cccceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeC
Q 047869         1860 NIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDL 1939 (2233)
Q Consensus      1860 a~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDL 1939 (2233)
                      ..+.=++-+|++||..+. |+++.=+...|+.++.+|+|..++.+...                                
T Consensus        18 ~g~~~e~SGLTy~pd~~t-LfaV~d~~~~i~els~~G~vlr~i~l~g~--------------------------------   64 (248)
T PF06977_consen   18 PGILDELSGLTYNPDTGT-LFAVQDEPGEIYELSLDGKVLRRIPLDGF--------------------------------   64 (248)
T ss_dssp             TT--S-EEEEEEETTTTE-EEEEETTTTEEEEEETT--EEEEEE-SS---------------------------------
T ss_pred             CCccCCccccEEcCCCCe-EEEEECCCCEEEEEcCCCCEEEEEeCCCC--------------------------------
Confidence            334446889999995555 55555568888999888888877655321                                


Q ss_pred             cCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEec-CCceEEEEecccCCCccccceeeeeccccccc---CCeEEE
Q 047869         1940 SQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSE-CGSLYRLELSVEGNVGATPLKEIIQFNDREIH---AKGLSL 2015 (2233)
Q Consensus      1940 S~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS-~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~---~~GVSV 2015 (2233)
                                     |.-.+.|++  .+| .+ ++.++ .|.||.-++......-...-...+++......   -+|  |
T Consensus        65 ---------------~D~EgI~y~--g~~-~~-vl~~Er~~~L~~~~~~~~~~~~~~~~~~~~~l~~~~~~N~G~EG--l  123 (248)
T PF06977_consen   65 ---------------GDYEGITYL--GNG-RY-VLSEERDQRLYIFTIDDDTTSLDRADVQKISLGFPNKGNKGFEG--L  123 (248)
T ss_dssp             ---------------SSEEEEEE---STT-EE-EEEETTTTEEEEEEE----TT--EEEEEEEE---S---SS--EE--E
T ss_pred             ---------------CCceeEEEE--CCC-EE-EEEEcCCCcEEEEEEeccccccchhhceEEecccccCCCcceEE--E
Confidence                           456677754  334 22 22232 78888877754322111111111222221112   235  4


Q ss_pred             EeccccceeeEEecC
Q 047869         2016 YFSSTYKLLFLSFQD 2030 (2233)
Q Consensus      2016 yYS~tl~LLF~SY~~ 2030 (2233)
                      .|.+..+-||+.-+.
T Consensus       124 a~D~~~~~L~v~kE~  138 (248)
T PF06977_consen  124 AYDPKTNRLFVAKER  138 (248)
T ss_dssp             EEETTTTEEEEEEES
T ss_pred             EEcCCCCEEEEEeCC
Confidence            999999999988654


No 82 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=43.42  E-value=7.2e+02  Score=31.46  Aligned_cols=160  Identities=11%  Similarity=0.108  Sum_probs=100.7

Q ss_pred             cccceecccCceEE-Ee-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccce
Q 047869         1810 VKSLLSVSSRGRLA-VG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDC 1887 (2233)
Q Consensus      1810 iRqLLSas~rGrLA-Va-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC 1887 (2233)
                      -|.+.+-+..|.++ ++ +++.|-++|++.-=+.+-      ++..+.-    +---+.-.|.|+| ++.+|.++--...
T Consensus       142 ~~pi~AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF------~tf~i~~----~~~~ew~~l~FS~-dGK~iLlsT~~s~  210 (311)
T KOG1446|consen  142 GRPIAAFDPEGLIFALANGSELIKLYDLRSFDKGPF------TTFSITD----NDEAEWTDLEFSP-DGKSILLSTNASF  210 (311)
T ss_pred             CCcceeECCCCcEEEEecCCCeEEEEEecccCCCCc------eeEccCC----CCccceeeeEEcC-CCCEEEEEeCCCc
Confidence            35567778888864 44 555999999976533322      1111111    2234677889988 8888888877776


Q ss_pred             EEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecC
Q 047869         1888 QVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASR 1967 (2233)
Q Consensus      1888 ~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~ 1967 (2233)
                      +-+--.-+|++..-...++.- ++.=..|...|.||.-|.=.....|.||++....  +++-+.=|  .+-..+.+ +.+
T Consensus       211 ~~~lDAf~G~~~~tfs~~~~~-~~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~--~v~~~~~~--~~~~~~~~-~fn  284 (311)
T KOG1446|consen  211 IYLLDAFDGTVKSTFSGYPNA-GNLPLSATFTPDSKFVLSGSDDGTIHVWNLETGK--KVAVLRGP--NGGPVSCV-RFN  284 (311)
T ss_pred             EEEEEccCCcEeeeEeeccCC-CCcceeEEECCCCcEEEEecCCCcEEEEEcCCCc--EeeEecCC--CCCCcccc-ccC
Confidence            666667888876666655543 3344888899999775544455699999994433  44444333  34444433 344


Q ss_pred             CcEEEEEEecCCceEEEEe
Q 047869         1968 GKMFLIVLSECGSLYRLEL 1986 (2233)
Q Consensus      1968 G~~~ILVLSS~G~LY~Qel 1986 (2233)
                      =++.++|.++.--.++-+.
T Consensus       285 P~~~mf~sa~s~l~fw~p~  303 (311)
T KOG1446|consen  285 PRYAMFVSASSNLVFWLPD  303 (311)
T ss_pred             CceeeeeecCceEEEEecc
Confidence            5567777777666665443


No 83 
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=42.28  E-value=7.3e+02  Score=31.58  Aligned_cols=87  Identities=29%  Similarity=0.453  Sum_probs=59.5

Q ss_pred             EEEEeecccCccceEEeecccceEEEEe--cCCCceeeeeeee------------------------------e------
Q 047869         1865 EIVHLAFNSIVENYLTVAGYEDCQVLTL--NPRGEVTDRLAIE------------------------------L------ 1906 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkDC~VLTf--ss~GeV~DRL~Le------------------------------L------ 1906 (2233)
                      +|-.|+|+| -.++|++||=.|..|=+.  ..+|..+-+.+.+                              |      
T Consensus        29 sIS~l~FSP-~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~WsddgskVf~g~~Dk~~k~wDL~S~Q~~  107 (347)
T KOG0647|consen   29 SISALAFSP-QADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWSDDGSKVFSGGCDKQAKLWDLASGQVS  107 (347)
T ss_pred             chheeEecc-ccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEEEccCCceEEeeccCCceEEEEccCCCee
Confidence            466899999 677788899988876554  4434333322111                              1      


Q ss_pred             --ccCCceEEEeEEecCCCceEEEEec--CeEEEEeCcCCCCCCcEEEEcCC
Q 047869         1907 --ALQGAYIRRVDWVPGSPVQLMVVTN--KFVKIYDLSQDNISPLHYFTLPD 1954 (2233)
Q Consensus      1907 --~Leg~fIIKa~WLPGSQt~LAVVT~--~FVKIYDLS~D~lSPvyyF~Lps 1954 (2233)
                        ++-+.=|+-+.||++-.+++.++++  ..+|.||.-  +-.|++..-||+
T Consensus       108 ~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R--~~~pv~t~~LPe  157 (347)
T KOG0647|consen  108 QVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTR--SSNPVATLQLPE  157 (347)
T ss_pred             eeeecccceeEEEEecCCCcceeEecccccceeecccC--CCCeeeeeeccc
Confidence              1112358999999998877666554  389999987  455889988886


No 84 
>KOG1140 consensus N-end rule pathway, recognition component UBR1 [Posttranslational modification, protein turnover, chaperones]
Probab=41.46  E-value=14  Score=53.30  Aligned_cols=65  Identities=23%  Similarity=0.436  Sum_probs=47.8

Q ss_pred             CCCcceeccCCcccccceEeeccCCCCCCceeehhhhhhhcCCCcEEEE---eecceeeecCCCCCCCCCcee
Q 047869         1588 SKVCTFTSSGSNFMEQHWYFCYTCDLTVSKGCCSVCAKVCHRGHRVVYS---RSSRFFCDCGAGGVRGSSCQC 1657 (2233)
Q Consensus      1588 ~~~CTFt~TG~~fi~Q~~Y~C~TC~l~~~~GVC~aCA~vCHkGHdVvyl---~k~~FfCDCGa~~~~~~~Cqc 1657 (2233)
                      -..|+-.    -++-|..|.|++|+..+.--+|.-| ..=|..|-+.+.   ......||||+-.--..+|.|
T Consensus        13 g~~c~~~----~~~~e~~y~c~~c~~~~~~~~c~~c-~~~~~~~~~~~~v~~~~~~~~c~cgd~da~n~~~~~   80 (1738)
T KOG1140|consen   13 GRNCGRV----FKIGEPTYRCHECGTDDTCVLCIHC-PEVHVNHSVCTKVHTEFTSGICDCGDEDAWNSPLHC   80 (1738)
T ss_pred             ccccccc----cccCCceEEEEecCCCcchhHHHhc-chhhhhhhhcceeEecccccccCCCChhhccCcchH
Confidence            3445543    4567899999999998777788888 555888888876   456789999986554445555


No 85 
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=40.28  E-value=2.1e+02  Score=35.61  Aligned_cols=114  Identities=14%  Similarity=0.252  Sum_probs=84.5

Q ss_pred             cccceecccCceEEE--eeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccce
Q 047869         1810 VKSLLSVSSRGRLAV--GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDC 1887 (2233)
Q Consensus      1810 iRqLLSas~rGrLAV--aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC 1887 (2233)
                      .-....++..|.+-.  ++++++.++++.+     .      ++     +-+-...=+|-+++|.| |+-.|+.+=-.-+
T Consensus       194 ~v~t~~vSpDGslcasGgkdg~~~LwdL~~-----~------k~-----lysl~a~~~v~sl~fsp-nrywL~~at~~sI  256 (315)
T KOG0279|consen  194 YVNTVTVSPDGSLCASGGKDGEAMLWDLNE-----G------KN-----LYSLEAFDIVNSLCFSP-NRYWLCAATATSI  256 (315)
T ss_pred             cEEEEEECCCCCEEecCCCCceEEEEEccC-----C------ce-----eEeccCCCeEeeEEecC-CceeEeeccCCce
Confidence            445677888888765  3889999999842     1      11     33334445788999988 6666666655567


Q ss_pred             EEEEecCCCceeeeeeeeeccC-----CceEEEeEEecCCCceEEEEecCeEEEEeCcC
Q 047869         1888 QVLTLNPRGEVTDRLAIELALQ-----GAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQ 1941 (2233)
Q Consensus      1888 ~VLTfss~GeV~DRL~LeL~Le-----g~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~ 1941 (2233)
                      .|.-+ ..+.+++.+.++..=+     +.+.+-..|-+.-|++++=-|.+-|++|.+++
T Consensus       257 kIwdl-~~~~~v~~l~~d~~g~s~~~~~~~clslaws~dG~tLf~g~td~~irv~qv~~  314 (315)
T KOG0279|consen  257 KIWDL-ESKAVVEELKLDGIGPSSKAGDPICLSLAWSADGQTLFAGYTDNVIRVWQVAK  314 (315)
T ss_pred             EEEec-cchhhhhhccccccccccccCCcEEEEEEEcCCCcEEEeeecCCcEEEEEeec
Confidence            77776 4557777777766544     77899999999999999999999999998764


No 86 
>PF14761 HPS3_N:  Hermansky-Pudlak syndrome 3
Probab=40.19  E-value=66  Score=38.19  Aligned_cols=74  Identities=24%  Similarity=0.358  Sum_probs=57.2

Q ss_pred             ceecccCceEEEeeCCeEEEEechhhhcccccCCcccccccc-ccccccccceEEEEeecccCccceEEeecccceEEEE
Q 047869         1813 LLSVSSRGRLAVGEGDKVAIFDVGQLIGQATIQPVTADKTNV-KPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLT 1891 (2233)
Q Consensus      1813 LLSas~rGrLAVaEgdKVTILqlsaLLkQad~s~~skdKlTL-trLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLT 1891 (2233)
                      +-+|..-|.|+|+=++|+.|+.+..-..|..    ..+=+++ ..+-.....|...+|++   -++|+|+..-.|++|+.
T Consensus       140 iaCC~~tG~LlVg~~~~l~lf~l~~~~~~~~----~~~~lDFe~~l~~~~~~~~p~~v~i---c~~yiA~~s~~ev~Vlk  212 (215)
T PF14761_consen  140 IACCPVTGNLLVGCGNKLVLFTLKYQTIQSE----KFSFLDFERSLIDHIDNFKPTQVAI---CEGYIAVMSDLEVLVLK  212 (215)
T ss_pred             EEecCCCCCEEEEcCCEEEEEEEEEEEEecc----cccEEechhhhhheecCceEEEEEE---EeeEEEEecCCEEEEEE
Confidence            3456788999999999999999976665421    2233344 34456677888999999   79999999999999998


Q ss_pred             ec
Q 047869         1892 LN 1893 (2233)
Q Consensus      1892 fs 1893 (2233)
                      +.
T Consensus       213 l~  214 (215)
T PF14761_consen  213 LE  214 (215)
T ss_pred             Ee
Confidence            74


No 87 
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=39.96  E-value=5.3e+02  Score=35.23  Aligned_cols=224  Identities=19%  Similarity=0.286  Sum_probs=0.0

Q ss_pred             EEEeecccCccceEEeecc-cceEEEEecCCCceeeeeeeeeccCCce--EEEeEEecCCCceEEEEecC--eEEEEeCc
Q 047869         1866 IVHLAFNSIVENYLTVAGY-EDCQVLTLNPRGEVTDRLAIELALQGAY--IRRVDWVPGSPVQLMVVTNK--FVKIYDLS 1940 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGL-kDC~VLTfss~GeV~DRL~LeL~Leg~f--IIKa~WLPGSQt~LAVVT~~--FVKIYDLS 1940 (2233)
                      ||+|+|||-..+-.|-|-| +-+.|-.+.+.       +....|+|--  |..+...||-.-.-.|..++  -|||||--
T Consensus       143 VMqv~fnPkD~ntFaS~sLDrTVKVWslgs~-------~~nfTl~gHekGVN~Vdyy~~gdkpylIsgaDD~tiKvWDyQ  215 (794)
T KOG0276|consen  143 VMQVAFNPKDPNTFASASLDRTVKVWSLGSP-------HPNFTLEGHEKGVNCVDYYTGGDKPYLISGADDLTIKVWDYQ  215 (794)
T ss_pred             EEEEEecCCCccceeeeeccccEEEEEcCCC-------CCceeeeccccCcceEEeccCCCcceEEecCCCceEEEeecc


Q ss_pred             CCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccc
Q 047869         1941 QDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSST 2020 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~t 2020 (2233)
                      ...     .....+|--....+++-..---+|+--|++|-+=      ==+...+.+..+++.-.    ...-.|-+-..
T Consensus       216 tk~-----CV~TLeGHt~Nvs~v~fhp~lpiiisgsEDGTvr------iWhs~Ty~lE~tLn~gl----eRvW~I~~~k~  280 (794)
T KOG0276|consen  216 TKS-----CVQTLEGHTNNVSFVFFHPELPIIISGSEDGTVR------IWNSKTYKLEKTLNYGL----ERVWCIAAHKG  280 (794)
T ss_pred             hHH-----HHHHhhcccccceEEEecCCCcEEEEecCCccEE------EecCcceehhhhhhcCC----ceEEEEeecCC


Q ss_pred             cceeeEEecCCcEEEEEcCCCccccc--ceeEEE-----EccCCCCCCCcccceeeccCCCceEEEEeccCCCceEEEEe
Q 047869         2021 YKLLFLSFQDGTTLVGRLSPNAASLS--EVSYVF-----EEQDGKLRSAGLHRWKELLASSGLFFCFSSLKSNAAVAVSL 2093 (2233)
Q Consensus      2021 l~LLF~SY~~G~Sf~a~Ls~~~~sv~--eis~Vf-----e~~~gk~~~a~L~qWsEV~~hPGLf~cls~~~sn~pvvv~l 2093 (2233)
                      -+.+-+.|++|..++ .+-+..-.++  ....++     +.+..+.++.+---  |+           ...+--||.++-
T Consensus       281 ~~~i~vG~Deg~i~v-~lgreeP~vsMd~~gKIiwa~~~ei~~~~~ks~~~~~--ev-----------~DgErL~LsvKe  346 (794)
T KOG0276|consen  281 DGKIAVGFDEGSVTV-KLGREEPAVSMDSNGKIIWAVHSEIQAVNLKSVGAQK--EV-----------TDGERLPLSVKE  346 (794)
T ss_pred             CCeEEEeccCCcEEE-EccCCCCceeecCCccEEEEcCceeeeeeceeccCcc--cc-----------cCCccccchhhh


Q ss_pred             -cCCceeeeccccccCCCCCeEEEEEeecCCCCCeEEEEEeeCCceeEEec
Q 047869         2094 -GTNELIAQNMRHAAGSTSPLVGVTAYKPLSKDKVHCLVLHDDGSLQIYSH 2143 (2233)
Q Consensus      2094 -~pd~I~iQeiK~~~~sSs~vdgva~y~p~s~~rttlLLLcEDGSLrIYsa 2143 (2233)
                       +.-+|+-|.|+|                  .-.-...+.|-||--.||.+
T Consensus       347 Lgs~eiyPq~L~h------------------sPNGrfV~VcgdGEyiIyTa  379 (794)
T KOG0276|consen  347 LGSVEIYPQTLAH------------------SPNGRFVVVCGDGEYIIYTA  379 (794)
T ss_pred             ccccccchHHhcc------------------CCCCcEEEEecCccEEEEEe


No 88 
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=39.82  E-value=5.5e+02  Score=32.23  Aligned_cols=112  Identities=11%  Similarity=0.114  Sum_probs=78.6

Q ss_pred             EEEeecccCccceEEeec-ccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCC
Q 047869         1866 IVHLAFNSIVENYLTVAG-YEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDN 1943 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcG-LkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~ 1943 (2233)
                      +-+....| .+.+++++| -+++.-+.|...|+-..++.+.+..++.|..  .|=..+ .++||++.+ .+.|||.-.+.
T Consensus       161 ~ns~~~sn-d~~~~~~Vgds~~Vf~y~id~~sey~~~~~~a~t~D~gF~~--S~s~~~-~~FAv~~Qdg~~~I~DVR~~~  236 (344)
T KOG4532|consen  161 QNSLHYSN-DPSWGSSVGDSRRVFRYAIDDESEYIENIYEAPTSDHGFYN--SFSEND-LQFAVVFQDGTCAIYDVRNMA  236 (344)
T ss_pred             eeeeEEcC-CCceEEEecCCCcceEEEeCCccceeeeeEecccCCCceee--eeccCc-ceEEEEecCCcEEEEEecccc
Confidence            44556655 667888777 5789999999999999998888888888854  354443 458999988 99999998777


Q ss_pred             CCCcEEEE----cCCCCeeEEEEEEecCCcEEEEEEecCCceEEEE
Q 047869         1944 ISPLHYFT----LPDDMIVDATLVIASRGKMFLIVLSECGSLYRLE 1985 (2233)
Q Consensus      1944 lSPvyyF~----LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qe 1985 (2233)
                      .. +-+-+    -+-|.||.+-|-.  .|-.-++..| +|.=|.+-
T Consensus       237 tp-m~~~sstrp~hnGa~R~c~Fsl--~g~lDLLf~s-Ehfs~~hv  278 (344)
T KOG4532|consen  237 TP-MAEISSTRPHHNGAFRVCRFSL--YGLLDLLFIS-EHFSRVHV  278 (344)
T ss_pred             cc-hhhhcccCCCCCCceEEEEecC--CCcceEEEEe-cCcceEEE
Confidence            43 22221    1578899888753  4555555554 44444443


No 89 
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=39.78  E-value=1.3e+03  Score=32.57  Aligned_cols=128  Identities=14%  Similarity=0.207  Sum_probs=75.8

Q ss_pred             cccccccccccceEEEEeecccCccceEEeecccceE-EEEecCCCceeeeeeeeeccC-CceEEEeEEecCCCceEEEE
Q 047869         1852 TNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQ-VLTLNPRGEVTDRLAIELALQ-GAYIRRVDWVPGSPVQLMVV 1929 (2233)
Q Consensus      1852 lTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~-VLTfss~GeV~DRL~LeL~Le-g~fIIKa~WLPGSQt~LAVV 1929 (2233)
                      +.+..=++.-.-+..-.|+-+.-.+|+||+|--.-.. |.-+|..   ++.-.+..--+ .--+.+..|=+-... +.|.
T Consensus        76 ~~~~~k~kqn~~~S~~DVkW~~~~~NlIAT~s~nG~i~vWdlnk~---~rnk~l~~f~EH~Rs~~~ldfh~tep~-iliS  151 (839)
T KOG0269|consen   76 CNHRFKTKQNKFYSAADVKWGQLYSNLIATCSTNGVISVWDLNKS---IRNKLLTVFNEHERSANKLDFHSTEPN-ILIS  151 (839)
T ss_pred             eeeecccccceeeehhhcccccchhhhheeecCCCcEEEEecCcc---ccchhhhHhhhhccceeeeeeccCCcc-EEEe
Confidence            3333333333444555777788899999988544322 2333332   11111111112 345888888877665 4444


Q ss_pred             ecC--eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1930 TNK--FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1930 T~~--FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      .+|  +||+|||-.+.-  +-.|.--+..|||+.|.... | +.-.-..+.|+|--=+|+
T Consensus       152 GSQDg~vK~~DlR~~~S--~~t~~~nSESiRDV~fsp~~-~-~~F~s~~dsG~lqlWDlR  207 (839)
T KOG0269|consen  152 GSQDGTVKCWDLRSKKS--KSTFRSNSESIRDVKFSPGY-G-NKFASIHDSGYLQLWDLR  207 (839)
T ss_pred             cCCCceEEEEeeecccc--cccccccchhhhceeeccCC-C-ceEEEecCCceEEEeecc
Confidence            444  999999976542  23444488899999988654 4 455566688887766666


No 90 
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.73  E-value=71  Score=44.71  Aligned_cols=195  Identities=19%  Similarity=0.226  Sum_probs=122.8

Q ss_pred             ceEEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCCce
Q 047869         1820 GRLAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEV 1898 (2233)
Q Consensus      1820 GrLAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV 1898 (2233)
                      |.+|=+ |.|+|.+++..++++-...       --+.+.+....  .|..+-|||--+|.||-+| .|=.|+.++=+   
T Consensus        81 GlIaGG~edG~I~ly~p~~~~~~~~~-------~~la~~~~h~G--~V~gLDfN~~q~nlLASGa-~~geI~iWDln---  147 (1049)
T KOG0307|consen   81 GLIAGGLEDGNIVLYDPASIIANASE-------EVLATKSKHTG--PVLGLDFNPFQGNLLASGA-DDGEILIWDLN---  147 (1049)
T ss_pred             ceeeccccCCceEEecchhhccCcch-------HHHhhhcccCC--ceeeeeccccCCceeeccC-CCCcEEEeccC---
Confidence            566655 9999999999998543321       11222222222  3558999999999998765 33444444222   


Q ss_pred             eeeeeeeecc------CCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEE
Q 047869         1899 TDRLAIELAL------QGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMF 1971 (2233)
Q Consensus      1899 ~DRL~LeL~L------eg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ 1971 (2233)
                          ..+-..      .-.-|..+.|=-.-|--||=++.. ++-||||-+.  .|+..|.-..+..+=..+--..++.-.
T Consensus       148 ----n~~tP~~~~~~~~~~eI~~lsWNrkvqhILAS~s~sg~~~iWDlr~~--~pii~ls~~~~~~~~S~l~WhP~~aTq  221 (1049)
T KOG0307|consen  148 ----KPETPFTPGSQAPPSEIKCLSWNRKVSHILASGSPSGRAVIWDLRKK--KPIIKLSDTPGRMHCSVLAWHPDHATQ  221 (1049)
T ss_pred             ----CcCCCCCCCCCCCcccceEeccchhhhHHhhccCCCCCceeccccCC--CcccccccCCCccceeeeeeCCCCcee
Confidence                111111      124589999988888888877777 9999999988  899999999988433333335678888


Q ss_pred             EEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEecccc-ceeeEEecCCcEEEEEcC
Q 047869         1972 LIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTY-KLLFLSFQDGTTLVGRLS 2039 (2233)
Q Consensus      1972 ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl-~LLF~SY~~G~Sf~a~Ls 2039 (2233)
                      |++.|.+-+.=.=.+-.-. +-..+    +++-.+ -..|-+||-..++= .||+=|=.||+.++=..+
T Consensus       222 l~~As~dd~~PviqlWDlR-~assP----~k~~~~-H~~GilslsWc~~D~~lllSsgkD~~ii~wN~~  284 (1049)
T KOG0307|consen  222 LLVASGDDSAPVIQLWDLR-FASSP----LKILEG-HQRGILSLSWCPQDPRLLLSSGKDNRIICWNPN  284 (1049)
T ss_pred             eeeecCCCCCceeEeeccc-ccCCc----hhhhcc-cccceeeeccCCCCchhhhcccCCCCeeEecCC
Confidence            8888876554432222100 00011    111111 13456788888877 777777889998885543


No 91 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=38.24  E-value=1.1e+03  Score=31.79  Aligned_cols=258  Identities=17%  Similarity=0.183  Sum_probs=0.0

Q ss_pred             EEEeecccCccceEEeecccceEEEEecCCCceeee---eeeeeccCCceEEEe-------------EEecCCCceEEEE
Q 047869         1866 IVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDR---LAIELALQGAYIRRV-------------DWVPGSPVQLMVV 1929 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DR---L~LeL~Leg~fIIKa-------------~WLPGSQt~LAVV 1929 (2233)
                      |-++.|.|.-+.+|+|-|         +++-.+.||   ..++..-.+-||+++             .|=|.+..++.-.
T Consensus       217 i~sl~ys~Tg~~iLvvsg---------~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~  287 (641)
T KOG0772|consen  217 INSLQYSVTGDQILVVSG---------SAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTC  287 (641)
T ss_pred             cceeeecCCCCeEEEEec---------CcceeEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEe


Q ss_pred             ecC-eEEEEeCcCCCCCCcEEEEcCCC--CeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccc
Q 047869         1930 TNK-FVKIYDLSQDNISPLHYFTLPDD--MIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDR 2006 (2233)
Q Consensus      1930 T~~-FVKIYDLS~D~lSPvyyF~LpsG--kIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~ 2006 (2233)
                      ..+ .++|||+....--=.-.-..+.|  +|--.+--++.+|+. |-.---+|.|-+=+...-.-+..+...+     -.
T Consensus       288 s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nrdg~~-iAagc~DGSIQ~W~~~~~~v~p~~~vk~-----AH  361 (641)
T KOG0772|consen  288 SYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNRDGKL-IAAGCLDGSIQIWDKGSRTVRPVMKVKD-----AH  361 (641)
T ss_pred             cCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCCCcch-hhhcccCCceeeeecCCcccccceEeee-----cc


Q ss_pred             cccCCeEEEEeccccceeeEEecCCcEEEEEcCCCcccccceeEEEEccCCCCCCCcccceeeccCC-CceEEEEe----
Q 047869         2007 EIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRLSPNAASLSEVSYVFEEQDGKLRSAGLHRWKELLAS-SGLFFCFS---- 2081 (2233)
Q Consensus      2007 q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qWsEV~~h-PGLf~cls---- 2081 (2233)
                      +-..+=-||.||.+-+.|.--=.+++-=+=-|....                   .+|.-|+-+++. ||.=||+|    
T Consensus       362 ~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~k-------------------kpL~~~tgL~t~~~~tdc~FSPd~k  422 (641)
T KOG0772|consen  362 LPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQFK-------------------KPLNVRTGLPTPFPGTDCCFSPDDK  422 (641)
T ss_pred             CCCCceeEEEeccccchhhhccCCCceeeeeccccc-------------------cchhhhcCCCccCCCCccccCCCce


Q ss_pred             ----------ccCCCceEEEEecCCceeeeccccccCCCCCeEEEEEeecCCCCCeEEEEEeeCCceeEEeccCCCCCcc
Q 047869         2082 ----------SLKSNAAVAVSLGTNELIAQNMRHAAGSTSPLVGVTAYKPLSKDKVHCLVLHDDGSLQIYSHVPHGVDAA 2151 (2233)
Q Consensus      2082 ----------~~~sn~pvvv~l~pd~I~iQeiK~~~~sSs~vdgva~y~p~s~~rttlLLLcEDGSLrIYsa~P~~~~a~ 2151 (2233)
                                .++.+ -|.+.....-=.+|.|-.   +++.|+-+.|--.+++    +++=+-||...||--.-...-.+
T Consensus       423 li~TGtS~~~~~~~g-~L~f~d~~t~d~v~ki~i---~~aSvv~~~WhpkLNQ----i~~gsgdG~~~vyYdp~~S~RGa  494 (641)
T KOG0772|consen  423 LILTGTSAPNGMTAG-TLFFFDRMTLDTVYKIDI---STASVVRCLWHPKLNQ----IFAGSGDGTAHVYYDPNESIRGA  494 (641)
T ss_pred             EEEecccccCCCCCc-eEEEEeccceeeEEEecC---CCceEEEEeecchhhh----eeeecCCCceEEEECccccccch


Q ss_pred             cchhhhhhhccccc
Q 047869         2152 TSVTAEKVKKLGSN 2165 (2233)
Q Consensus      2152 ~s~~~~k~kk~ga~ 2165 (2233)
                      ..--.++.||...+
T Consensus       495 k~cv~k~~rkk~~~  508 (641)
T KOG0772|consen  495 KLCVVKPPRKKHID  508 (641)
T ss_pred             hheeecCccccchh


No 92 
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=36.16  E-value=4.9e+02  Score=29.03  Aligned_cols=113  Identities=18%  Similarity=0.210  Sum_probs=69.3

Q ss_pred             eEEEEeec-ccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEe--cCeEEEEeCc
Q 047869         1864 FEIVHLAF-NSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT--NKFVKIYDLS 1940 (2233)
Q Consensus      1864 FeVlsLaf-NP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT--~~FVKIYDLS 1940 (2233)
                      ..+..+.. +|....+++.++..+..+..++..+  ..............|..+.|-|..+. ++...  ...+++||+.
T Consensus       110 ~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~--~~~~~~~~~~~~~~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~~  186 (466)
T COG2319         110 SSVSKLALSSPDGNSILLASSSLDGTVKLWDLST--PGKLIRTLEGHSESVTSLAFSPDGKL-LASGSSLDGTIKLWDLR  186 (466)
T ss_pred             CceeeEEEECCCcceEEeccCCCCccEEEEEecC--CCeEEEEEecCcccEEEEEECCCCCE-EEecCCCCCceEEEEcC
Confidence            34444444 5655557777777676666665554  01111122223556778999999984 44443  6799999999


Q ss_pred             CCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEE-ecCCceEEE
Q 047869         1941 QDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVL-SECGSLYRL 1984 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVL-SS~G~LY~Q 1984 (2233)
                      .  ..+...+....+.|...++.  ++|. .+++. +.+|.++.-
T Consensus       187 ~--~~~~~~~~~~~~~v~~~~~~--~~~~-~~~~~~~~d~~i~~w  226 (466)
T COG2319         187 T--GKPLSTLAGHTDPVSSLAFS--PDGG-LLIASGSSDGTIRLW  226 (466)
T ss_pred             C--CceEEeeccCCCceEEEEEc--CCcc-eEEEEecCCCcEEEE
Confidence            8  44455555566778888866  5565 33333 788888733


No 93 
>PRK02889 tolB translocation protein TolB; Provisional
Probab=36.02  E-value=1e+03  Score=30.23  Aligned_cols=175  Identities=18%  Similarity=0.282  Sum_probs=0.0

Q ss_pred             EEEeecccCccceEEeecccc--eEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC--eEEEEeCcC
Q 047869         1866 IVHLAFNSIVENYLTVAGYED--CQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK--FVKIYDLSQ 1941 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkD--C~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~--FVKIYDLS~ 1941 (2233)
                      +...+++| +++.||+..-++  .+|+.++.+|....++    .-.........|-|..+. |+.++++  ...||.+..
T Consensus       242 ~~~~~~SP-DG~~la~~~~~~g~~~Iy~~d~~~~~~~~l----t~~~~~~~~~~wSpDG~~-l~f~s~~~g~~~Iy~~~~  315 (427)
T PRK02889        242 NSAPAWSP-DGRTLAVALSRDGNSQIYTVNADGSGLRRL----TQSSGIDTEPFFSPDGRS-IYFTSDRGGAPQIYRMPA  315 (427)
T ss_pred             ccceEECC-CCCEEEEEEccCCCceEEEEECCCCCcEEC----CCCCCCCcCeEEcCCCCE-EEEEecCCCCcEEEEEEC


Q ss_pred             CCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCC--ceEEEEecccCCCccccceeeeecccccccCCeEEEEecc
Q 047869         1942 DNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECG--SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSS 2019 (2233)
Q Consensus      1942 D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G--~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~ 2019 (2233)
                      +.-.+.--. ...+......+  .++|+.+.++....|  +||..++....   ...+++         ....-+..+|+
T Consensus       316 ~~g~~~~lt-~~g~~~~~~~~--SpDG~~Ia~~s~~~g~~~I~v~d~~~g~---~~~lt~---------~~~~~~p~~sp  380 (427)
T PRK02889        316 SGGAAQRVT-FTGSYNTSPRI--SPDGKLLAYISRVGGAFKLYVQDLATGQ---VTALTD---------TTRDESPSFAP  380 (427)
T ss_pred             CCCceEEEe-cCCCCcCceEE--CCCCCEEEEEEccCCcEEEEEEECCCCC---eEEccC---------CCCccCceECC


Q ss_pred             ccceeeEEe-cCCcEEEEEcCCCcccccceeEEEEccCCCCCCCccccee
Q 047869         2020 TYKLLFLSF-QDGTTLVGRLSPNAASLSEVSYVFEEQDGKLRSAGLHRWK 2068 (2233)
Q Consensus      2020 tl~LLF~SY-~~G~Sf~a~Ls~~~~sv~eis~Vfe~~~gk~~~a~L~qWs 2068 (2233)
                      +-++|+++- ..|++.+..++.+.    ..........|....+   .|+
T Consensus       381 dg~~l~~~~~~~g~~~l~~~~~~g----~~~~~l~~~~g~~~~p---~ws  423 (427)
T PRK02889        381 NGRYILYATQQGGRSVLAAVSSDG----RIKQRLSVQGGDVREP---SWG  423 (427)
T ss_pred             CCCEEEEEEecCCCEEEEEEECCC----CceEEeecCCCCCCCC---ccC


No 94 
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=35.18  E-value=7e+02  Score=36.48  Aligned_cols=148  Identities=15%  Similarity=0.171  Sum_probs=98.9

Q ss_pred             eeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccc--eEEEEecCCCceeeee
Q 047869         1825 GEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYED--CQVLTLNPRGEVTDRL 1902 (2233)
Q Consensus      1825 aEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkD--C~VLTfss~GeV~DRL 1902 (2233)
                      ...|-|-|+|+.++.+...   +....+|.-|   ..-+++.+.+.+   |.|.+||. -+|  +++++++..  ...+.
T Consensus      1068 S~DGtVKvW~~~k~~~~~~---s~rS~ltys~---~~sr~~~vt~~~---~~~~~Av~-t~DG~v~~~~id~~--~~~~~ 1135 (1431)
T KOG1240|consen 1068 SDDGTVKVWNLRKLEGEGG---SARSELTYSP---EGSRVEKVTMCG---NGDQFAVS-TKDGSVRVLRIDHY--NVSKR 1135 (1431)
T ss_pred             cCCceEEEeeehhhhcCcc---eeeeeEEEec---cCCceEEEEecc---CCCeEEEE-cCCCeEEEEEcccc--ccccc
Confidence            4788999999999998854   2345555554   344556666555   78899988 444  566777653  12222


Q ss_pred             ----eeeecc--CCceEEEeEEecCCCc-eEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEE
Q 047869         1903 ----AIELAL--QGAYIRRVDWVPGSPV-QLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIV 1974 (2233)
Q Consensus      1903 ----~LeL~L--eg~fIIKa~WLPGSQt-~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILV 1974 (2233)
                          ...+.+  +|.++.---..-..|+ .|+.+|.. .|-+||.-.+.-.=.-.+-+-.|.|...+  .++.| .-+++
T Consensus      1136 ~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~--idp~~-~Wlvi 1212 (1431)
T KOG1240|consen 1136 VATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIV--IDPWC-NWLVI 1212 (1431)
T ss_pred             eeeeeecccccCCCceEEeecccccccceeEEEEEeccceEEecchhhhhHHhhhcCccccceeEEE--ecCCc-eEEEE
Confidence                223333  4776666666677777 66666655 88889887766655556666678877655  35655 58889


Q ss_pred             EecCCceEEEEec
Q 047869         1975 LSECGSLYRLELS 1987 (2233)
Q Consensus      1975 LSS~G~LY~Qels 1987 (2233)
                      =|+.|.+-.=+++
T Consensus      1213 Gts~G~l~lWDLR 1225 (1431)
T KOG1240|consen 1213 GTSRGQLVLWDLR 1225 (1431)
T ss_pred             ecCCceEEEEEee
Confidence            9999998887776


No 95 
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=35.07  E-value=1.3e+03  Score=31.29  Aligned_cols=63  Identities=17%  Similarity=0.274  Sum_probs=38.8

Q ss_pred             ccCCCceEEEEe---cCCceeeeccccccCCCCCeEEEEEee--------------cC---------CCCCeEEEEEeeC
Q 047869         2082 SLKSNAAVAVSL---GTNELIAQNMRHAAGSTSPLVGVTAYK--------------PL---------SKDKVHCLVLHDD 2135 (2233)
Q Consensus      2082 ~~~sn~pvvv~l---~pd~I~iQeiK~~~~sSs~vdgva~y~--------------p~---------s~~rttlLLLcED 2135 (2233)
                      .++++.|+-+.+   .|.+++.=|.+.... ...++...+|.              |+         +.+..++++=|+|
T Consensus       202 irTE~dPl~~~Fs~~~~~qi~tVE~s~s~~-g~~~~d~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~D  280 (545)
T PF11768_consen  202 IRTENDPLDVEFSLNQPYQIHTVEQSISVK-GEPSADSCIYECSRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCED  280 (545)
T ss_pred             EEecCCcEEEEccCCCCcEEEEEEEecCCC-CCceeEEEEEEeecCceeEEEEEEEecCCcceEEecCcccceEEEEecC
Confidence            557888888887   567776666554211 11222222211              11         2356678999999


Q ss_pred             CceeEEeccC
Q 047869         2136 GSLQIYSHVP 2145 (2233)
Q Consensus      2136 GSLrIYsa~P 2145 (2233)
                      ||+.+|..+.
T Consensus       281 gSiiLyD~~~  290 (545)
T PF11768_consen  281 GSIILYDTTR  290 (545)
T ss_pred             CeEEEEEcCC
Confidence            9999999764


No 96 
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=35.05  E-value=1.2e+03  Score=30.83  Aligned_cols=113  Identities=15%  Similarity=0.216  Sum_probs=77.0

Q ss_pred             EEEEeecccCccceEEeecccceEEEEe--cCCCceeeeeeeeeccCC--ceEEEeEEecCCCceEEEEecCeEEEEeCc
Q 047869         1865 EIVHLAFNSIVENYLTVAGYEDCQVLTL--NPRGEVTDRLAIELALQG--AYIRRVDWVPGSPVQLMVVTNKFVKIYDLS 1940 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkDC~VLTf--ss~GeV~DRL~LeL~Leg--~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS 1940 (2233)
                      ||=-+.|+| |+.|||- +-+||.+..+  .+.+.    +.+.-.+.|  .-+.=+.|=|.+...+|---.+-++.||..
T Consensus       226 EVWfl~FS~-nGkyLAs-aSkD~Taiiw~v~~d~~----~kl~~tlvgh~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~  299 (519)
T KOG0293|consen  226 EVWFLQFSH-NGKYLAS-ASKDSTAIIWIVVYDVH----FKLKKTLVGHSQPVSYIMWSPDDRYLLACGFDEVLSLWDVD  299 (519)
T ss_pred             cEEEEEEcC-CCeeEee-ccCCceEEEEEEecCcc----eeeeeeeecccCceEEEEECCCCCeEEecCchHheeeccCC
Confidence            567788987 9999996 5678877666  22222    344444553  347778899999999998888899999987


Q ss_pred             CCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1941 QDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1941 ~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      .-..--+|  .--.|.-..++.|+ ++|. ++++-|.++.+|+=++.
T Consensus       300 tgd~~~~y--~~~~~~S~~sc~W~-pDg~-~~V~Gs~dr~i~~wdlD  342 (519)
T KOG0293|consen  300 TGDLRHLY--PSGLGFSVSSCAWC-PDGF-RFVTGSPDRTIIMWDLD  342 (519)
T ss_pred             cchhhhhc--ccCcCCCcceeEEc-cCCc-eeEecCCCCcEEEecCC
Confidence            65533222  21123444555564 7784 58888899999976665


No 97 
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=33.70  E-value=1.7e+02  Score=39.84  Aligned_cols=98  Identities=23%  Similarity=0.414  Sum_probs=68.7

Q ss_pred             cCceEEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeeccc-------ce-E
Q 047869         1818 SRGRLAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYE-------DC-Q 1888 (2233)
Q Consensus      1818 ~rGrLAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLk-------DC-~ 1888 (2233)
                      .|-++||+ ++|.+-|..-     ..|-.|+           --..||.|+..+.|| |+.+|||||..       |. +
T Consensus       228 drP~lavcy~nGr~QiMR~-----eND~~Pv-----------v~dtgm~~vgakWnh-~G~vLAvcG~~~da~~~~d~n~  290 (1189)
T KOG2041|consen  228 DRPRLAVCYANGRMQIMRS-----ENDPEPV-----------VVDTGMKIVGAKWNH-NGAVLAVCGNDSDADEPTDSNK  290 (1189)
T ss_pred             CCCEEEEEEcCceehhhhh-----cCCCCCe-----------EEecccEeecceecC-CCcEEEEccCcccccCccccce
Confidence            46678998 8888833221     2232222           223579999999999 99999999974       34 5


Q ss_pred             EEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEE
Q 047869         1889 VLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIY 1937 (2233)
Q Consensus      1889 VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIY 1937 (2233)
                      |.-.++=|.+..    .+..+|.-|.-.-|=-++ -.+|++...||=+=
T Consensus       291 v~Fysp~G~i~g----tlkvpg~~It~lsWEg~g-LriA~Avdsfiyfa  334 (1189)
T KOG2041|consen  291 VHFYSPYGHIVG----TLKVPGSCITGLSWEGTG-LRIAIAVDSFIYFA  334 (1189)
T ss_pred             EEEeccchhheE----EEecCCceeeeeEEcCCc-eEEEEEecceEEEE
Confidence            555667766554    445578889999997765 46899998888664


No 98 
>PF12657 TFIIIC_delta:  Transcription factor IIIC subunit delta N-term;  InterPro: IPR024761  This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 [].
Probab=33.70  E-value=1.7e+02  Score=32.83  Aligned_cols=108  Identities=22%  Similarity=0.368  Sum_probs=57.4

Q ss_pred             ccceecccCceEEEeeCCeEEEE---echhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccce
Q 047869         1811 KSLLSVSSRGRLAVGEGDKVAIF---DVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDC 1887 (2233)
Q Consensus      1811 RqLLSas~rGrLAVaEgdKVTIL---qlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC 1887 (2233)
                      .+.|+-+..|+|||+-++.|.||   ....+-+...      ..-...+.  ..+++++..+..|-     .+.+++.  
T Consensus         7 ~~~l~WS~Dg~laV~t~~~v~IL~~~~P~~~~~~~~------~~~~~~~~--~~~~~~~~~~~~~~-----~~~~~~p--   71 (173)
T PF12657_consen    7 PNALAWSEDGQLAVATGESVHILDPQTPNSLSKSFI------PRPLTLPP--SSIQWPITSIRRNL-----FTSSEWP--   71 (173)
T ss_pred             CcCeeECCCCCEEEEcCCeEEEEeccCCcccccccc------cCCccccc--ccCCCccceEecCc-----cccccCc--
Confidence            46788899999999999999999   3320111100      00001111  22233333333321     1223332  


Q ss_pred             EEEEecCCCceeeeeeeeeccCCceEEEeEEec-----CCCceEEEEecC-eEEEEeCcCC
Q 047869         1888 QVLTLNPRGEVTDRLAIELALQGAYIRRVDWVP-----GSPVQLMVVTNK-FVKIYDLSQD 1942 (2233)
Q Consensus      1888 ~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLP-----GSQt~LAVVT~~-FVKIYDLS~D 1942 (2233)
                         +.++. ...+     .......|+.+.|=|     .....|||.|++ -|+||--..+
T Consensus        72 ---~~~~~-~~~~-----~~~s~~~vv~~aWSP~Gl~~~~rClLavLTs~~~l~l~~~~~~  123 (173)
T PF12657_consen   72 ---TESPR-SMDD-----EEISSSQVVSAAWSPSGLGPNGRCLLAVLTSNGRLSLYGPPGN  123 (173)
T ss_pred             ---eeccc-cccc-----cccccccEEEEEECCCCCCCCCceEEEEEcCCCeEEEEecCCC
Confidence               11111 1111     111134899999988     346788887777 8999987654


No 99 
>PRK04922 tolB translocation protein TolB; Provisional
Probab=33.39  E-value=1.1e+03  Score=29.87  Aligned_cols=112  Identities=16%  Similarity=0.241  Sum_probs=61.1

Q ss_pred             EEEEeecccCccceEEeecccc--eEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEe-cC---eEEEEe
Q 047869         1865 EIVHLAFNSIVENYLTVAGYED--CQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVT-NK---FVKIYD 1938 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGLkD--C~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT-~~---FVKIYD 1938 (2233)
                      .+.+.+++| ++++||.+.+.+  .+|+.++-++.-..++.   ..+| ......|-|..+. |+++. .+   -|.+||
T Consensus       205 ~v~~p~wSp-Dg~~la~~s~~~~~~~l~~~dl~~g~~~~l~---~~~g-~~~~~~~SpDG~~-l~~~~s~~g~~~Iy~~d  278 (433)
T PRK04922        205 PILSPAWSP-DGKKLAYVSFERGRSAIYVQDLATGQRELVA---SFRG-INGAPSFSPDGRR-LALTLSRDGNPEIYVMD  278 (433)
T ss_pred             ccccccCCC-CCCEEEEEecCCCCcEEEEEECCCCCEEEec---cCCC-CccCceECCCCCE-EEEEEeCCCCceEEEEE
Confidence            467888988 677888876543  45555554432222221   1122 2346789997655 55443 22   478888


Q ss_pred             CcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCc--eEEEEec
Q 047869         1939 LSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGS--LYRLELS 1987 (2233)
Q Consensus      1939 LS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~--LY~Qels 1987 (2233)
                      +......+...   -.+...+.  -+.++|+.++++....|.  ||..++.
T Consensus       279 ~~~g~~~~lt~---~~~~~~~~--~~spDG~~l~f~sd~~g~~~iy~~dl~  324 (433)
T PRK04922        279 LGSRQLTRLTN---HFGIDTEP--TWAPDGKSIYFTSDRGGRPQIYRVAAS  324 (433)
T ss_pred             CCCCCeEECcc---CCCCccce--EECCCCCEEEEEECCCCCceEEEEECC
Confidence            87665432211   11222333  335778766666545554  7876654


No 100
>KOG3339 consensus Predicted glycosyltransferase [General function prediction only]
Probab=32.96  E-value=55  Score=38.38  Aligned_cols=108  Identities=19%  Similarity=0.157  Sum_probs=78.7

Q ss_pred             HHHHHHHhhhhhccCCcc-----cchHHHHHHHHhhccccccccccccccccccccc----cCchhHHHHHHHHHHHHHh
Q 047869          289 SLRIMKLLGSLVKDMPYV-----KYDALILHAIASFADVLPSLFQPCFEFANNHCAA----EGSFESIILLLLEEFLHIV  359 (2233)
Q Consensus       289 ~~r~lkl~~~l~~~~~~~-----~~d~~~l~~va~~~d~lp~lf~~~f~f~~~h~~~----~~~~~~~~l~l~e~fL~~~  359 (2233)
                      +--||+||+.|.+-+...     ..|.|=.+-+++|-+.++..=-.++++ -.--.|    -.++.+.+-+++-+| -++
T Consensus        51 T~EMlrLl~~l~~~y~~r~yI~a~tD~mS~~k~~~F~~~~a~~~a~~~~i-pRsReVgQS~ltSv~Tti~all~s~-~lv  128 (211)
T KOG3339|consen   51 TGEMLRLLEALQDLYSPRSYIAADTDEMSEQKARSFELSLAHCKAKNYEI-PRSREVGQSWLTSVFTTIWALLQSF-VLV  128 (211)
T ss_pred             HHHHHHHHHHHHhhcCceEEEEecCchhhHHHHHhhhccccccchhheec-chhhhhhhhhhhhHHHHHHHHHHHh-eEE
Confidence            356778888876555443     459999999999999999988877776 222223    356777888888888 566


Q ss_pred             hhhccCccccch----hHHHHHHHhhhccCCCcce--ecCCCcCC
Q 047869          360 QVIFCSGNFFQN----IRACIMASILDNLDPSIWR--YDNSSANL  398 (2233)
Q Consensus       360 ~~if~~~~v~qn----v~~ci~as~l~~l~~~~wr--~~~~~~~~  398 (2233)
                      -.|+|+-..|.-    |..|.+|-+.++|+...|+  |..|-|-.
T Consensus       129 ~RirPdlil~NGPGTCv~i~~~a~l~~iL~~~~~~IvyvES~cRV  173 (211)
T KOG3339|consen  129 WRIRPDLILCNGPGTCVPICLSAYLMEILGLKSSHIVYVESICRV  173 (211)
T ss_pred             EecCCCEEEECCCCcEeHHHHHHHHHHHhCcCceEEEEEeeeeEe
Confidence            667887766654    7889999999999987776  44454433


No 101
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=32.82  E-value=3.9e+02  Score=33.04  Aligned_cols=162  Identities=18%  Similarity=0.167  Sum_probs=102.8

Q ss_pred             EEEeecccCccceEEeecccc--eEEEEecCC-CceeeeeeeeeccCCceEEEeEEecCCCc-------------eEEEE
Q 047869         1866 IVHLAFNSIVENYLTVAGYED--CQVLTLNPR-GEVTDRLAIELALQGAYIRRVDWVPGSPV-------------QLMVV 1929 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkD--C~VLTfss~-GeV~DRL~LeL~Leg~fIIKa~WLPGSQt-------------~LAVV 1929 (2233)
                      |-+|++-|-.=-++..||--|  +-|||++.+ |-.+.++.--...   =+.-+-|-|.+.-             .||-.
T Consensus       105 VNsV~wapheygl~LacasSDG~vsvl~~~~~g~w~t~ki~~aH~~---GvnsVswapa~~~g~~~~~~~~~~~krlvSg  181 (299)
T KOG1332|consen  105 VNSVAWAPHEYGLLLACASSDGKVSVLTYDSSGGWTTSKIVFAHEI---GVNSVSWAPASAPGSLVDQGPAAKVKRLVSG  181 (299)
T ss_pred             ceeecccccccceEEEEeeCCCcEEEEEEcCCCCccchhhhhcccc---ccceeeecCcCCCccccccCcccccceeecc
Confidence            447778787777788888765  678999888 4444444321111   1556678887533             24444


Q ss_pred             ecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEec-CCcEEEEEEecCCceEEEEecccCCCccccceeeeeccccc
Q 047869         1930 TNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIAS-RGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDRE 2007 (2233)
Q Consensus      1930 T~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e-~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q 2007 (2233)
                      -.+ .||||+...|.----.++-=-.|-+||++--..- ..+.+|.--|.+|.+.+-..+...+.-...+.+.       
T Consensus       182 GcDn~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~gl~~s~iAS~SqDg~viIwt~~~e~e~wk~tll~~-------  254 (299)
T KOG1332|consen  182 GCDNLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVGLPKSTIASCSQDGTVIIWTKDEEYEPWKKTLLEE-------  254 (299)
T ss_pred             CCccceeeeecCCcchhhhhhhhhcchhhhhhhhccccCCCceeeEEecCCCcEEEEEecCccCccccccccc-------
Confidence            444 8999999998433333333346789999965442 4677888999999987654443332222222221       


Q ss_pred             ccCCeEEEEeccccceeeEEecCCcEEEEE
Q 047869         2008 IHAKGLSLYFSSTYKLLFLSFQDGTTLVGR 2037 (2233)
Q Consensus      2008 ~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~ 2037 (2233)
                      ...---++-+|.+-++|=+|+-+++..+-+
T Consensus       255 f~~~~w~vSWS~sGn~LaVs~GdNkvtlwk  284 (299)
T KOG1332|consen  255 FPDVVWRVSWSLSGNILAVSGGDNKVTLWK  284 (299)
T ss_pred             CCcceEEEEEeccccEEEEecCCcEEEEEE
Confidence            223366889999999999999887665543


No 102
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=29.48  E-value=3.3e+02  Score=35.03  Aligned_cols=160  Identities=17%  Similarity=0.217  Sum_probs=90.8

Q ss_pred             ccccccccceE-EEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCe
Q 047869         1855 KPLSRNIVRFE-IVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKF 1933 (2233)
Q Consensus      1855 trLSsa~VpFe-VlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~F 1933 (2233)
                      +|+++-.-+++ |.++.|||...+.||+||=..=.||-=-..+...-++.+....++.     -|=|  ....-++.++=
T Consensus       178 ~Pv~smswG~Dti~svkfNpvETsILas~~sDrsIvLyD~R~~~Pl~KVi~~mRTN~I-----swnP--eafnF~~a~ED  250 (433)
T KOG0268|consen  178 NPVSSMSWGADSISSVKFNPVETSILASCASDRSIVLYDLRQASPLKKVILTMRTNTI-----CWNP--EAFNFVAANED  250 (433)
T ss_pred             CccceeecCCCceeEEecCCCcchheeeeccCCceEEEecccCCccceeeeeccccce-----ecCc--cccceeecccc
Confidence            68888888887 7899999999999999987666665555666666666666666554     4999  33444556654


Q ss_pred             EEEEeCcCCCCC-CcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCe
Q 047869         1934 VKIYDLSQDNIS-PLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKG 2012 (2233)
Q Consensus      1934 VKIYDLS~D~lS-PvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~G 2012 (2233)
                      -.+|-.---++. |..-|.=--..+.|+.|-  +.|+-+ +--|=+-.|=+.... ++---.++.|-        ....-
T Consensus       251 ~nlY~~DmR~l~~p~~v~~dhvsAV~dVdfs--ptG~Ef-vsgsyDksIRIf~~~-~~~SRdiYhtk--------RMq~V  318 (433)
T KOG0268|consen  251 HNLYTYDMRNLSRPLNVHKDHVSAVMDVDFS--PTGQEF-VSGSYDKSIRIFPVN-HGHSRDIYHTK--------RMQHV  318 (433)
T ss_pred             ccceehhhhhhcccchhhcccceeEEEeccC--CCcchh-ccccccceEEEeecC-CCcchhhhhHh--------hhhee
Confidence            455543333332 333222222244555432  444322 222222222222222 11111123332        23447


Q ss_pred             EEEEeccccceeeEEecCCcE
Q 047869         2013 LSLYFSSTYKLLFLSFQDGTT 2033 (2233)
Q Consensus      2013 VSVyYS~tl~LLF~SY~~G~S 2033 (2233)
                      ++|.||++.+.+|-.-++|-.
T Consensus       319 ~~Vk~S~Dskyi~SGSdd~nv  339 (433)
T KOG0268|consen  319 FCVKYSMDSKYIISGSDDGNV  339 (433)
T ss_pred             eEEEEeccccEEEecCCCcce
Confidence            788999999988866666543


No 103
>PRK03629 tolB translocation protein TolB; Provisional
Probab=29.43  E-value=1.3e+03  Score=29.45  Aligned_cols=155  Identities=12%  Similarity=0.203  Sum_probs=75.6

Q ss_pred             EEEEeecccCccceEEeecc--cceEEEEecCC-CceeeeeeeeeccCCceEEEeEEecCCCceEEEEec----CeEEEE
Q 047869         1865 EIVHLAFNSIVENYLTVAGY--EDCQVLTLNPR-GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTN----KFVKIY 1937 (2233)
Q Consensus      1865 eVlsLafNP~nEdyLAVcGL--kDC~VLTfss~-GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~----~FVKIY 1937 (2233)
                      .+.+.+++| +++.||....  .+-+|+..+-. |+. .++.   ...+ ......|-|..+. ||.+..    ..|.+|
T Consensus       200 ~~~~p~wSP-DG~~la~~s~~~g~~~i~i~dl~~G~~-~~l~---~~~~-~~~~~~~SPDG~~-La~~~~~~g~~~I~~~  272 (429)
T PRK03629        200 PLMSPAWSP-DGSKLAYVTFESGRSALVIQTLANGAV-RQVA---SFPR-HNGAPAFSPDGSK-LAFALSKTGSLNLYVM  272 (429)
T ss_pred             ceeeeEEcC-CCCEEEEEEecCCCcEEEEEECCCCCe-EEcc---CCCC-CcCCeEECCCCCE-EEEEEcCCCCcEEEEE
Confidence            477899998 6788887633  12344444333 332 2221   1122 2335689998765 554432    247778


Q ss_pred             eCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCC--ceEEEEecccCCCccccceeeeecccccccCCeEEE
Q 047869         1938 DLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECG--SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSL 2015 (2233)
Q Consensus      1938 DLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G--~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSV 2015 (2233)
                      |+......     .+..+.-.+......++|+.++++....|  .||..++.. +.  ...++    ...    +.-.+.
T Consensus       273 d~~tg~~~-----~lt~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~-g~--~~~lt----~~~----~~~~~~  336 (429)
T PRK03629        273 DLASGQIR-----QVTDGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNING-GA--PQRIT----WEG----SQNQDA  336 (429)
T ss_pred             ECCCCCEE-----EccCCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCC-CC--eEEee----cCC----CCccCE
Confidence            88655432     22322222233344577876555554444  577665531 11  11111    111    111234


Q ss_pred             Eecccccee-eEEecCCcEEEEEcCCCc
Q 047869         2016 YFSSTYKLL-FLSFQDGTTLVGRLSPNA 2042 (2233)
Q Consensus      2016 yYS~tl~LL-F~SY~~G~Sf~a~Ls~~~ 2042 (2233)
                      .+|++=+.| |.+..+|...+..++...
T Consensus       337 ~~SpDG~~Ia~~~~~~g~~~I~~~dl~~  364 (429)
T PRK03629        337 DVSSDGKFMVMVSSNGGQQHIAKQDLAT  364 (429)
T ss_pred             EECCCCCEEEEEEccCCCceEEEEECCC
Confidence            567765554 445556655555444443


No 104
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=28.73  E-value=3.4e+02  Score=34.51  Aligned_cols=147  Identities=15%  Similarity=0.233  Sum_probs=0.0

Q ss_pred             eecccCceEEEe--eCCeEEEEec-hhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeeccc-ceEE
Q 047869         1814 LSVSSRGRLAVG--EGDKVAIFDV-GQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYE-DCQV 1889 (2233)
Q Consensus      1814 LSas~rGrLAVa--EgdKVTILql-saLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLk-DC~V 1889 (2233)
                      +-....+.+.|.  ++-||+++++ +++|+.-+.+-.+-.-..+.|                  +++|+||||+- |+.|
T Consensus       193 iGiA~~~k~imsas~dt~i~lw~lkGq~L~~idtnq~~n~~aavSP------------------~GRFia~~gFTpDVkV  254 (420)
T KOG2096|consen  193 IGIAGNAKYIMSASLDTKICLWDLKGQLLQSIDTNQSSNYDAAVSP------------------DGRFIAVSGFTPDVKV  254 (420)
T ss_pred             EeecCCceEEEEecCCCcEEEEecCCceeeeeccccccccceeeCC------------------CCcEEEEecCCCCceE


Q ss_pred             EEe--cCCCceeeeeee-eeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEec
Q 047869         1890 LTL--NPRGEVTDRLAI-ELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVDATLVIAS 1966 (2233)
Q Consensus      1890 LTf--ss~GeV~DRL~L-eL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e 1966 (2233)
                      ...  ..+|++-+-..+ +|.=-..=+.-+-.=|.|...+.|--...+||||...-.---+--+.|-.|.    .+..+.
T Consensus       255 wE~~f~kdG~fqev~rvf~LkGH~saV~~~aFsn~S~r~vtvSkDG~wriwdtdVrY~~~qDpk~Lk~g~----~pl~aa  330 (420)
T KOG2096|consen  255 WEPIFTKDGTFQEVKRVFSLKGHQSAVLAAAFSNSSTRAVTVSKDGKWRIWDTDVRYEAGQDPKILKEGS----APLHAA  330 (420)
T ss_pred             EEEEeccCcchhhhhhhheeccchhheeeeeeCCCcceeEEEecCCcEEEeeccceEecCCCchHhhcCC----cchhhc


Q ss_pred             CCcEEEEEEecCCceE
Q 047869         1967 RGKMFLIVLSECGSLY 1982 (2233)
Q Consensus      1967 ~G~~~ILVLSS~G~LY 1982 (2233)
                      .+.-.=+-||-+|.++
T Consensus       331 g~~p~RL~lsP~g~~l  346 (420)
T KOG2096|consen  331 GSEPVRLELSPSGDSL  346 (420)
T ss_pred             CCCceEEEeCCCCcEE


No 105
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=28.34  E-value=3.8e+02  Score=38.86  Aligned_cols=135  Identities=12%  Similarity=0.125  Sum_probs=87.5

Q ss_pred             ecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCC-----CcEEEEcCCCCeeEEEEEEe
Q 047869         1892 LNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNIS-----PLHYFTLPDDMIVDATLVIA 1965 (2233)
Q Consensus      1892 fss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lS-----PvyyF~LpsGkIrDaTfv~~ 1965 (2233)
                      ++++|..+-||+-|=    .-++|..=.+++...++-...| -|||||+.+-.-.     .-.+|..-.+.+.-.|..  
T Consensus      1034 W~p~G~lVAhL~Ehs----~~v~k~a~s~~~~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~~vt~~-- 1107 (1431)
T KOG1240|consen 1034 WNPRGILVAHLHEHS----SAVIKLAVSSEHTSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVEKVTMC-- 1107 (1431)
T ss_pred             CCccceEeehhhhcc----ccccceeecCCCCceEEEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceEEEEec--
Confidence            788888877765432    2234777788887777655555 8999999875543     555666666667666655  


Q ss_pred             cCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccc--c-ceeeEEecCCcEEE
Q 047869         1966 SRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSST--Y-KLLFLSFQDGTTLV 2035 (2233)
Q Consensus      1966 e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~t--l-~LLF~SY~~G~Sf~ 2035 (2233)
                      .+|+ ...|.|++|.+-+..+...  +.+....+...+++.+..|..|+|+-.-.  . .+|-++...|.-+.
T Consensus      1108 ~~~~-~~Av~t~DG~v~~~~id~~--~~~~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~iv~ 1177 (1431)
T KOG1240|consen 1108 GNGD-QFAVSTKDGSVRVLRIDHY--NVSKRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRIVS 1177 (1431)
T ss_pred             cCCC-eEEEEcCCCeEEEEEcccc--ccccceeeeeecccccCCCceEEeecccccccceeEEEEEeccceEE
Confidence            3454 4455599999999988743  44455555566666666777888875432  2 25555555554443


No 106
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=28.33  E-value=5.8e+02  Score=36.02  Aligned_cols=118  Identities=20%  Similarity=0.212  Sum_probs=70.7

Q ss_pred             eecccCce-EEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEE
Q 047869         1814 LSVSSRGR-LAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLT 1891 (2233)
Q Consensus      1814 LSas~rGr-LAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLT 1891 (2233)
                      |+-+..|. |||. -.|+|.|+++..-.---+....-++  +=..+     .=-+-.+++.|-++.++|++==+++.|+.
T Consensus       144 l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~--n~~~~-----s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~  216 (933)
T KOG1274|consen  144 LSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKD--NEFIL-----SRICTRLAWHPKGGTLAVPPVDNTVKVYS  216 (933)
T ss_pred             eeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCcc--ccccc-----cceeeeeeecCCCCeEEeeccCCeEEEEc
Confidence            55566554 7775 8999999999532221111111111  11111     11245788999888888777777777776


Q ss_pred             ecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcC
Q 047869         1892 LNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQ 1941 (2233)
Q Consensus      1892 fss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~ 1941 (2233)
                      - .+++..-.|..+..-.+  +..+.|-|+-++.=|..+..-|-|||--.
T Consensus       217 r-~~we~~f~Lr~~~~ss~--~~~~~wsPnG~YiAAs~~~g~I~vWnv~t  263 (933)
T KOG1274|consen  217 R-KGWELQFKLRDKLSSSK--FSDLQWSPNGKYIAASTLDGQILVWNVDT  263 (933)
T ss_pred             c-CCceeheeecccccccc--eEEEEEcCCCcEEeeeccCCcEEEEeccc
Confidence            5 33333333322222223  88899999977744566666999999876


No 107
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=28.09  E-value=6.1e+02  Score=30.99  Aligned_cols=75  Identities=16%  Similarity=0.271  Sum_probs=56.5

Q ss_pred             eEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCC---CcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCC--ceEEEE
Q 047869         1912 YIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNIS---PLHYFTLPDDMIVDATLVIASRGKMFLIVLSECG--SLYRLE 1985 (2233)
Q Consensus      1912 fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lS---PvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G--~LY~Qe 1985 (2233)
                      -|-...|-|.-|- ||.-.|+ .||+--.+-|..+   |-.+|..-.|-|||.+|+-.++-.-.|++..-+|  .||+-+
T Consensus        91 siyc~~ws~~gel-iatgsndk~ik~l~fn~dt~~~~g~dle~nmhdgtirdl~fld~~~s~~~il~s~gagdc~iy~td  169 (350)
T KOG0641|consen   91 SIYCTAWSPCGEL-IATGSNDKTIKVLPFNADTCNATGHDLEFNMHDGTIRDLAFLDDPESGGAILASAGAGDCKIYITD  169 (350)
T ss_pred             cEEEEEecCccCe-EEecCCCceEEEEecccccccccCcceeeeecCCceeeeEEecCCCcCceEEEecCCCcceEEEee
Confidence            4677889999753 6666666 9999999888766   7789999999999999997765555677665555  356544


Q ss_pred             ec
Q 047869         1986 LS 1987 (2233)
Q Consensus      1986 ls 1987 (2233)
                      -.
T Consensus       170 c~  171 (350)
T KOG0641|consen  170 CG  171 (350)
T ss_pred             cC
Confidence            43


No 108
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=27.47  E-value=1.8e+02  Score=39.94  Aligned_cols=34  Identities=26%  Similarity=0.451  Sum_probs=28.4

Q ss_pred             CCeEEEEEeecCCCCCeEEEEEeeCCceeEEecc
Q 047869         2111 SPLVGVTAYKPLSKDKVHCLVLHDDGSLQIYSHV 2144 (2233)
Q Consensus      2111 s~vdgva~y~p~s~~rttlLLLcEDGSLrIYsa~ 2144 (2233)
                      +..+-=+.+||.+-..+|+++|..|+.||+|...
T Consensus       146 ~~~i~qv~WhP~s~~~~~l~vLtsdn~lR~y~~~  179 (717)
T PF10168_consen  146 SLEIKQVRWHPWSESDSHLVVLTSDNTLRLYDIS  179 (717)
T ss_pred             CceEEEEEEcCCCCCCCeEEEEecCCEEEEEecC
Confidence            3444556779999899999999999999999874


No 109
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=27.46  E-value=4e+02  Score=34.46  Aligned_cols=103  Identities=19%  Similarity=0.347  Sum_probs=78.4

Q ss_pred             ceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecC--CCceEEEEecCeEEEEeCcCCCCCCcEEEEcCC
Q 047869         1877 NYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPG--SPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPD 1954 (2233)
Q Consensus      1877 dyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPG--SQt~LAVVT~~FVKIYDLS~D~lSPvyyF~Lps 1954 (2233)
                      |-|=+|-+.+| .=+|+++---.|+|.+...   ..|-.+..+||  .+....++-...|++||-. .-=-|+-.|-.-+
T Consensus       173 n~lkiwdle~~-~qiw~aKNvpnD~L~LrVP---vW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~-~qRRPV~~fd~~E  247 (412)
T KOG3881|consen  173 NELKIWDLEQS-KQIWSAKNVPNDRLGLRVP---VWITDIRFLEGSPNYKFATITRYHQVRLYDTR-HQRRPVAQFDFLE  247 (412)
T ss_pred             cceeeeecccc-eeeeeccCCCCccccceee---eeeccceecCCCCCceEEEEecceeEEEecCc-ccCcceeEecccc
Confidence            66777888888 7777777777777766554   45777888999  5554556667799999998 6667999999988


Q ss_pred             CCeeEEEEEEecCCcEEEEEEecCCceEEEEec
Q 047869         1955 DMIVDATLVIASRGKMFLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1955 GkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels 1987 (2233)
                      -.|...+..  ..| +.|++-.+.|.|..-+++
T Consensus       248 ~~is~~~l~--p~g-n~Iy~gn~~g~l~~FD~r  277 (412)
T KOG3881|consen  248 NPISSTGLT--PSG-NFIYTGNTKGQLAKFDLR  277 (412)
T ss_pred             Ccceeeeec--CCC-cEEEEecccchhheeccc
Confidence            777776655  445 468899999999877776


No 110
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=27.20  E-value=3.2e+02  Score=32.96  Aligned_cols=71  Identities=15%  Similarity=0.225  Sum_probs=39.2

Q ss_pred             ccCceEEEee--CCeEEEEechh---hhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEE
Q 047869         1817 SSRGRLAVGE--GDKVAIFDVGQ---LIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLT 1891 (2233)
Q Consensus      1817 s~rGrLAVaE--gdKVTILqlsa---LLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLT 1891 (2233)
                      -..|+++|.+  .+.+.++++..   .+..++     ..++++-.-  ..-...+.+|+++|.++.+++|.-=+--.|+.
T Consensus        73 ~g~~~~vl~~Er~~~L~~~~~~~~~~~~~~~~-----~~~~~l~~~--~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~  145 (248)
T PF06977_consen   73 LGNGRYVLSEERDQRLYIFTIDDDTTSLDRAD-----VQKISLGFP--NKGNKGFEGLAYDPKTNRLFVAKERKPKRLYE  145 (248)
T ss_dssp             -STTEEEEEETTTTEEEEEEE----TT--EEE-----EEEEE---S-----SS--EEEEEETTTTEEEEEEESSSEEEEE
T ss_pred             ECCCEEEEEEcCCCcEEEEEEeccccccchhh-----ceEEecccc--cCCCcceEEEEEcCCCCEEEEEeCCCChhhEE
Confidence            3567777764  67788888843   121111     122222111  22334467999999999999997666667777


Q ss_pred             ecC
Q 047869         1892 LNP 1894 (2233)
Q Consensus      1892 fss 1894 (2233)
                      ++.
T Consensus       146 ~~~  148 (248)
T PF06977_consen  146 VNG  148 (248)
T ss_dssp             EES
T ss_pred             Ecc
Confidence            765


No 111
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=27.18  E-value=7.7e+02  Score=33.85  Aligned_cols=110  Identities=14%  Similarity=0.200  Sum_probs=66.1

Q ss_pred             Eeecc--cCccceEEeecccceEEEEecCCCceeee----eeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcC
Q 047869         1868 HLAFN--SIVENYLTVAGYEDCQVLTLNPRGEVTDR----LAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQ 1941 (2233)
Q Consensus      1868 sLafN--P~nEdyLAVcGLkDC~VLTfss~GeV~DR----L~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~ 1941 (2233)
                      +..|.  |-++..||| +.+|=+|--++.+- +.+|    .-.+|..-.+-|.++.|+||....+-+.--+.+|.||+..
T Consensus        54 ~~sFs~~~n~eHiLav-adE~G~i~l~dt~~-~~fr~ee~~lk~~~aH~nAifDl~wapge~~lVsasGDsT~r~Wdvk~  131 (720)
T KOG0321|consen   54 ADSFSAAPNKEHILAV-ADEDGGIILFDTKS-IVFRLEERQLKKPLAHKNAIFDLKWAPGESLLVSASGDSTIRPWDVKT  131 (720)
T ss_pred             cccccCCCCccceEEE-ecCCCceeeecchh-hhcchhhhhhcccccccceeEeeccCCCceeEEEccCCceeeeeeecc
Confidence            44454  555666665 56777776665442 2222    1234444578899999999754444455667999999999


Q ss_pred             CCCCCcEEEEcCCCCeeEEEEEEec---------CCcEEEEEEecCC
Q 047869         1942 DNISPLHYFTLPDDMIVDATLVIAS---------RGKMFLIVLSECG 1979 (2233)
Q Consensus      1942 D~lSPvyyF~LpsGkIrDaTfv~~e---------~G~~~ILVLSS~G 1979 (2233)
                      ..+-=+--|.=-+|.+..++|....         +|.+.|-.+--.|
T Consensus       132 s~l~G~~~~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~R~n~  178 (720)
T KOG0321|consen  132 SRLVGGRLNLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDCRCNG  178 (720)
T ss_pred             ceeecceeecccccccchhhhccCCCcceeeccCCCcEEEEEEeccc
Confidence            8876553333345555666664322         4444555555444


No 112
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=26.88  E-value=8.6e+02  Score=34.77  Aligned_cols=118  Identities=15%  Similarity=0.231  Sum_probs=70.3

Q ss_pred             cceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCc
Q 047869         1862 VRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLS 1940 (2233)
Q Consensus      1862 VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS 1940 (2233)
                      =+-+|+.+...| ++.|||-||+.+=-|+-=....+.+-++.-|-    .+++-+.|.|--|+ ||.-+.| .||||+.+
T Consensus       128 H~~DV~Dv~Wsp-~~~~lvS~s~DnsViiwn~~tF~~~~vl~~H~----s~VKGvs~DP~Gky-~ASqsdDrtikvwrt~  201 (942)
T KOG0973|consen  128 HDSDVLDVNWSP-DDSLLVSVSLDNSVIIWNAKTFELLKVLRGHQ----SLVKGVSWDPIGKY-FASQSDDRTLKVWRTS  201 (942)
T ss_pred             CCCccceeccCC-CccEEEEecccceEEEEccccceeeeeeeccc----ccccceEECCccCe-eeeecCCceEEEEEcc
Confidence            345788999999 99999999998754432223333333333333    34666789999887 8877777 89999966


Q ss_pred             CC----------CCCCcEEEEc-----CCCCeeEEEEEEecCCcEEEEEEecCCceEEEEe
Q 047869         1941 QD----------NISPLHYFTL-----PDDMIVDATLVIASRGKMFLIVLSECGSLYRLEL 1986 (2233)
Q Consensus      1941 ~D----------~lSPvyyF~L-----psGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qel 1986 (2233)
                      .=          .-+|.++|.+     |.|++.-+.-.++ +|...+-++.-+++-+-+.+
T Consensus       202 dw~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~n-~~~~~~~IieR~tWk~~~~L  261 (942)
T KOG0973|consen  202 DWGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAVN-GGKSTIAIIERGTWKVDKDL  261 (942)
T ss_pred             cceeeEeeccchhhCCCcceeeecccCCCcCeecchhhcc-CCcceeEEEecCCceeeeee
Confidence            51          1134444332     4555444433322 34455555555555554444


No 113
>TIGR01171 rplB_bact ribosomal protein L2, bacterial/organellar. This model distinguishes bacterial and organellar ribosomal protein L2 from its counterparts in the archaea nad in the eukaryotic cytosol. Plant mitochondrial examples tend to have long, variable inserts.
Probab=26.47  E-value=1.3e+03  Score=28.70  Aligned_cols=130  Identities=20%  Similarity=0.272  Sum_probs=88.5

Q ss_pred             cccccccccccccceEEEEeecccCccceEEeecccc-eEEEEecCCCc-eeeeee------ee----ecc----CCceE
Q 047869         1850 DKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYED-CQVLTLNPRGE-VTDRLA------IE----LAL----QGAYI 1913 (2233)
Q Consensus      1850 dKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkD-C~VLTfss~Ge-V~DRL~------Le----L~L----eg~fI 1913 (2233)
                      +.++..+- ...++=.|++|-++|.-..+||.+=+.| ..-+.+.+.|- +.|.+.      ++    +.|    .|.+|
T Consensus        61 R~IDf~r~-~~~i~g~V~~IeyDP~Rsa~IAlv~~~~g~~~YIlap~gl~~Gd~I~~g~~~~i~~Gn~lpL~~IP~Gt~I  139 (273)
T TIGR01171        61 RIIDFKRN-KDGIPAKVAAIEYDPNRSARIALLHYADGEKRYILAPKGLKVGDTVISGPEAPIKPGNALPLRNIPVGTTV  139 (273)
T ss_pred             ceeecccc-cCCCcEEEEEEEeCCCCCcCEEEEEecCCcEEEEEccCCCCCCCEEEECCCCCCCCcCCcccccCCCCCEE
Confidence            44444442 1234457999999999999999997765 55677767762 222222      11    111    38899


Q ss_pred             EEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeE------EEEEEecCCcEEEEEEecCCceEEE
Q 047869         1914 RRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVD------ATLVIASRGKMFLIVLSECGSLYRL 1984 (2233)
Q Consensus      1914 IKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrD------aTfv~~e~G~~~ILVLSS~G~LY~Q 1984 (2233)
                      -+++--||.-.+||=+.-.|..|-.  ++  .=.....||+|+++.      ||+....++......+-.+|.-+.-
T Consensus       140 ~NIE~~pg~Ggkl~RsAGt~A~ii~--k~--~~~~~vkLPSGe~r~i~~~c~AtiG~Vsn~~~~~~~~gKAG~~r~l  212 (273)
T TIGR01171       140 HNIELKPGKGGQLARSAGTSAQILA--KE--GGYVTLRLPSGEMRMVLKECRATIGEVGNEDHNNIVLGKAGRSRWL  212 (273)
T ss_pred             EEEEecCCCCceEEEecCCeEEEEE--ec--CCEEEEECCCCCeEEECCcCeEEEEEccCCchhccEeccchhheeC
Confidence            9999999999999998888988873  33  234467899999865      5665555555555666678877764


No 114
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=26.33  E-value=62  Score=45.24  Aligned_cols=71  Identities=28%  Similarity=0.524  Sum_probs=58.3

Q ss_pred             EEEeecccCccceEEeecccceEEEEecCC-CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcC
Q 047869         1866 IVHLAFNSIVENYLTVAGYEDCQVLTLNPR-GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQ 1941 (2233)
Q Consensus      1866 VlsLafNP~nEdyLAVcGLkDC~VLTfss~-GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~ 1941 (2233)
                      |++|.-||..+++|+-|| ||-.|+..|.+ |+|...+    .-.|+.+.++.|-|.....+|++.-+ -|.||-|-.
T Consensus       256 ilslsWc~~D~~lllSsg-kD~~ii~wN~~tgEvl~~~----p~~~nW~fdv~w~pr~P~~~A~asfdgkI~I~sl~~  328 (1049)
T KOG0307|consen  256 ILSLSWCPQDPRLLLSSG-KDNRIICWNPNTGEVLGEL----PAQGNWCFDVQWCPRNPSVMAAASFDGKISIYSLQG  328 (1049)
T ss_pred             eeeeccCCCCchhhhccc-CCCCeeEecCCCceEeeec----CCCCcceeeeeecCCCcchhhhheeccceeeeeeec
Confidence            789999999999999999 56677777654 6666654    44899999999999999999887766 899998743


No 115
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=26.26  E-value=8.3e+02  Score=31.34  Aligned_cols=93  Identities=18%  Similarity=0.302  Sum_probs=53.8

Q ss_pred             CeEEEEeCcCCCCC-----CcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccc
Q 047869         1932 KFVKIYDLSQDNIS-----PLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDR 2006 (2233)
Q Consensus      1932 ~FVKIYDLS~D~lS-----PvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~ 2006 (2233)
                      ..|.|-||.....-     |-|+-..|.|+= ..+-+ ..+|.+.-+.+=++|... |..+.-.+...=++-        
T Consensus       118 ~SVtVVDl~~~kvv~ei~~PGC~~iyP~~~~-~F~~l-C~DGsl~~v~Ld~~Gk~~-~~~t~~F~~~~dp~f--------  186 (342)
T PF06433_consen  118 TSVTVVDLAAKKVVGEIDTPGCWLIYPSGNR-GFSML-CGDGSLLTVTLDADGKEA-QKSTKVFDPDDDPLF--------  186 (342)
T ss_dssp             EEEEEEETTTTEEEEEEEGTSEEEEEEEETT-EEEEE-ETTSCEEEEEETSTSSEE-EEEEEESSTTTS-B---------
T ss_pred             CeEEEEECCCCceeeeecCCCEEEEEecCCC-ceEEE-ecCCceEEEEECCCCCEe-EeeccccCCCCcccc--------
Confidence            46677777766553     778888887763 34433 367888888888888886 443321111000000        


Q ss_pred             cccCCeEEEEecc-ccceeeEEecCCcEEEEEcCCCc
Q 047869         2007 EIHAKGLSLYFSS-TYKLLFLSFQDGTTLVGRLSPNA 2042 (2233)
Q Consensus      2007 q~~~~GVSVyYS~-tl~LLF~SY~~G~Sf~a~Ls~~~ 2042 (2233)
                        .. +   .|+. .-+++|+|| +|..|-+.++...
T Consensus       187 --~~-~---~~~~~~~~~~F~Sy-~G~v~~~dlsg~~  216 (342)
T PF06433_consen  187 --EH-P---AYSRDGGRLYFVSY-EGNVYSADLSGDS  216 (342)
T ss_dssp             --S------EEETTTTEEEEEBT-TSEEEEEEETTSS
T ss_pred             --cc-c---ceECCCCeEEEEec-CCEEEEEeccCCc
Confidence              11 1   1222 346889998 5788888886654


No 116
>PF00400 WD40:  WD domain, G-beta repeat;  InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=25.41  E-value=1.4e+02  Score=24.69  Aligned_cols=28  Identities=18%  Similarity=0.510  Sum_probs=21.9

Q ss_pred             CceEEEeEEecCCCceEEEEec-CeEEEEe
Q 047869         1910 GAYIRRVDWVPGSPVQLMVVTN-KFVKIYD 1938 (2233)
Q Consensus      1910 g~fIIKa~WLPGSQt~LAVVT~-~FVKIYD 1938 (2233)
                      ...|..+.|-|..+. ||.+.. ..|+|||
T Consensus        11 ~~~i~~i~~~~~~~~-~~s~~~D~~i~vwd   39 (39)
T PF00400_consen   11 SSSINSIAWSPDGNF-LASGSSDGTIRVWD   39 (39)
T ss_dssp             SSSEEEEEEETTSSE-EEEEETTSEEEEEE
T ss_pred             CCcEEEEEEeccccc-ceeeCCCCEEEEEC
Confidence            567999999999655 554444 6999998


No 117
>PRK03629 tolB translocation protein TolB; Provisional
Probab=24.91  E-value=9.8e+02  Score=30.48  Aligned_cols=151  Identities=17%  Similarity=0.226  Sum_probs=76.0

Q ss_pred             EeecccCccceEEeec--ccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC--eEEEEeCcCCC
Q 047869         1868 HLAFNSIVENYLTVAG--YEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK--FVKIYDLSQDN 1943 (2233)
Q Consensus      1868 sLafNP~nEdyLAVcG--LkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~--FVKIYDLS~D~ 1943 (2233)
                      +.+++| ++++||+..  -.+.+|..++.++.-..++.    -....+....|-|+.+. |+.+..+  ..+||.+..+.
T Consensus       247 ~~~~SP-DG~~La~~~~~~g~~~I~~~d~~tg~~~~lt----~~~~~~~~~~wSPDG~~-I~f~s~~~g~~~Iy~~d~~~  320 (429)
T PRK03629        247 APAFSP-DGSKLAFALSKTGSLNLYVMDLASGQIRQVT----DGRSNNTEPTWFPDSQN-LAYTSDQAGRPQVYKVNING  320 (429)
T ss_pred             CeEECC-CCCEEEEEEcCCCCcEEEEEECCCCCEEEcc----CCCCCcCceEECCCCCE-EEEEeCCCCCceEEEEECCC
Confidence            467888 778888752  22234555544432222221    12334667899998765 6655543  46788655444


Q ss_pred             CCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCC--ceEEEEecccCCCccccceeeeecccccccCCeEEEEecccc
Q 047869         1944 ISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECG--SLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTY 2021 (2233)
Q Consensus      1944 lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G--~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl 2021 (2233)
                      -.+. ...-..+....  ....++|+.++++....|  +||..++.. +  ....+++.    .     ...+..+|++-
T Consensus       321 g~~~-~lt~~~~~~~~--~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~-g--~~~~Lt~~----~-----~~~~p~~SpDG  385 (429)
T PRK03629        321 GAPQ-RITWEGSQNQD--ADVSSDGKFMVMVSSNGGQQHIAKQDLAT-G--GVQVLTDT----F-----LDETPSIAPNG  385 (429)
T ss_pred             CCeE-EeecCCCCccC--EEECCCCCEEEEEEccCCCceEEEEECCC-C--CeEEeCCC----C-----CCCCceECCCC
Confidence            3221 12222222222  333578876665554444  466666541 1  12222221    1     12245678888


Q ss_pred             ceeeEEecCC-cEEEEEcC
Q 047869         2022 KLLFLSFQDG-TTLVGRLS 2039 (2233)
Q Consensus      2022 ~LLF~SY~~G-~Sf~a~Ls 2039 (2233)
                      ++|.++-.+| ...+.-++
T Consensus       386 ~~i~~~s~~~~~~~l~~~~  404 (429)
T PRK03629        386 TMVIYSSSQGMGSVLNLVS  404 (429)
T ss_pred             CEEEEEEcCCCceEEEEEE
Confidence            7666666544 44444443


No 118
>KOG1900 consensus Nuclear pore complex, Nup155 component (D Nup154, sc Nup157/Nup170) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.43  E-value=1.4e+03  Score=33.80  Aligned_cols=192  Identities=17%  Similarity=0.224  Sum_probs=117.0

Q ss_pred             cCceEEEeeCCeEEEEechhhhcccccCCccccccccccccccccceEEEEe---ecccCccceEEeecccceEEEEecC
Q 047869         1818 SRGRLAVGEGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHL---AFNSIVENYLTVAGYEDCQVLTLNP 1894 (2233)
Q Consensus      1818 ~rGrLAVaEgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsL---afNP~nEdyLAVcGLkDC~VLTfss 1894 (2233)
                      ..||.++-.+.++-+++-.+  ++..   ...|-+..+.|.     -.++.-   .|+|.-...|+|+=..|..|+.++-
T Consensus        89 eI~RaWiTiDn~L~lWny~~--~~e~---~~~d~~shtIl~-----V~LvkPkpgvFv~~IqhlLvvaT~~ei~ilgV~~  158 (1311)
T KOG1900|consen   89 EIGRAWITIDNNLFLWNYES--DNEL---AEYDGLSHTILK-----VGLVKPKPGVFVPEIQHLLVVATPVEIVILGVSF  158 (1311)
T ss_pred             hhcceEEEeCCeEEEEEcCC--CCcc---ccccchhhhhee-----eeeecCCCCcchhhhheeEEecccceEEEEEEEe
Confidence            46889999999999999865  2222   234444444443     333333   4779999999999999999999832


Q ss_pred             C------CceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCe-----------------EEEEeCcCCCC---CCcE
Q 047869         1895 R------GEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKF-----------------VKIYDLSQDNI---SPLH 1948 (2233)
Q Consensus      1895 ~------GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~F-----------------VKIYDLS~D~l---SPvy 1948 (2233)
                      .      +...++  +....+|..++.+.-..+-+.  -.+.++-                 -+-|+|++..+   .|+ 
T Consensus       159 ~~~~~~~~~f~~~--~~i~~dg~~V~~I~~t~nGRI--F~~G~dg~lyEl~Yq~~~gWf~~rc~Kiclt~s~ls~lvPs-  233 (1311)
T KOG1900|consen  159 DEFTGELSIFNTS--FKISVDGVSVNCITYTENGRI--FFAGRDGNLYELVYQAEDGWFGSRCRKICLTKSVLSSLVPS-  233 (1311)
T ss_pred             ccccCcccccccc--eeeecCCceEEEEEeccCCcE--EEeecCCCEEEEEEeccCchhhcccccccCchhHHHHhhhh-
Confidence            2      122223  455668999999885444433  3333332                 33455555443   377 


Q ss_pred             EEEcC---CCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCcc--------------ccceeeeecc--ccccc
Q 047869         1949 YFTLP---DDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGA--------------TPLKEIIQFN--DREIH 2009 (2233)
Q Consensus      1949 yF~Lp---sGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~--------------~~ltEvvq~~--~~q~~ 2009 (2233)
                      .+.+|   .+-|+-.++- +++  .++.++|++|-+=.-+++..|-.+.              ...++.+..+  ...+.
T Consensus       234 ~~~~~~~~~dpI~qi~ID-~SR--~IlY~lsek~~v~~Y~i~~~G~~~~r~~~~~~~~i~~qa~~~~~~~~~s~f~~Ivs  310 (1311)
T KOG1900|consen  234 LLSVPGSSKDPIRQITID-NSR--NILYVLSEKGTVSAYDIGGNGLGGPRFVSVSRNYIDVQALSLKNPLDDSVFFSIVS  310 (1311)
T ss_pred             hhcCCCCCCCcceeeEec-ccc--ceeeeeccCceEEEEEccCCCccceeeeehhHHHHHHHhhhccccCCCcccceeEE
Confidence            66677   4457777754 343  4888899999888888875443222              2222221111  11224


Q ss_pred             CCeEEEEeccccceeeEE
Q 047869         2010 AKGLSLYFSSTYKLLFLS 2027 (2233)
Q Consensus      2010 ~~GVSVyYS~tl~LLF~S 2027 (2233)
                      -.+++.++|.++.++-+.
T Consensus       311 I~~l~~~es~~l~LvA~t  328 (1311)
T KOG1900|consen  311 ISPLSASESNDLHLVAIT  328 (1311)
T ss_pred             ecccCcccccceeEEEEe
Confidence            457888888888775443


No 119
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.42  E-value=2.5e+02  Score=35.42  Aligned_cols=85  Identities=24%  Similarity=0.291  Sum_probs=59.1

Q ss_pred             cccccccceEEEEeecccCccceEEeeccc------ceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCC---CceE
Q 047869         1856 PLSRNIVRFEIVHLAFNSIVENYLTVAGYE------DCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGS---PVQL 1926 (2233)
Q Consensus      1856 rLSsa~VpFeVlsLafNP~nEdyLAVcGLk------DC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGS---Qt~L 1926 (2233)
                      |++++.-+.--++=...-..+.+|||--.+      +.+|+..+.+|.---+++ +|.--+.-|+.+-|-|..   -..|
T Consensus       164 pp~~~~~~~~CvsWn~sr~~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva-~L~d~~dpI~di~wAPn~Gr~y~~l  242 (361)
T KOG2445|consen  164 PPGKNKQPCFCVSWNPSRMHEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVA-ELPDHTDPIRDISWAPNIGRSYHLL  242 (361)
T ss_pred             CcccccCcceEEeeccccccCceEEEEcccCCccccceEEEEecCCcceeeeeh-hcCCCCCcceeeeeccccCCceeeE
Confidence            555555554444444445678899998877      899999999973222211 222227789999999975   3457


Q ss_pred             EEEecCeEEEEeCcC
Q 047869         1927 MVVTNKFVKIYDLSQ 1941 (2233)
Q Consensus      1927 AVVT~~FVKIYDLS~ 1941 (2233)
                      ||+|-+-|+||.+-.
T Consensus       243 AvA~kDgv~I~~v~~  257 (361)
T KOG2445|consen  243 AVATKDGVRIFKVKV  257 (361)
T ss_pred             EEeecCcEEEEEEee
Confidence            999999999999973


No 120
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=24.39  E-value=1.6e+03  Score=28.77  Aligned_cols=231  Identities=15%  Similarity=0.169  Sum_probs=142.4

Q ss_pred             cccccccccccchHHhHH-HhhcCcccccceecccCce-EEEe---eC-----CeEEEEechhhhcccccCCcccccccc
Q 047869         1785 SLDLKIKADYSNARELKS-HLASGSLVKSLLSVSSRGR-LAVG---EG-----DKVAIFDVGQLIGQATIQPVTADKTNV 1854 (2233)
Q Consensus      1785 SFE~kir~d~~~~relks-~l~sGq~iRqLLSas~rGr-LAVa---Eg-----dKVTILqlsaLLkQad~s~~skdKlTL 1854 (2233)
                      -+-..++++-++.+-+.. ....--.|+.+-++...+| ||-.   -+     -+.+|+++-.=++|+..+    .=-.+
T Consensus        40 NqVhll~~d~e~s~l~skvf~h~agEvw~las~P~d~~ilaT~yn~~s~s~vl~~aaiw~ipe~~~~S~~~----tlE~v  115 (370)
T KOG1007|consen   40 NQVHLLRLDSEGSELLSKVFFHHAGEVWDLASSPFDQRILATVYNDTSDSGVLTGAAIWQIPEPLGQSNSS----TLECV  115 (370)
T ss_pred             ceeEEEEecCccchhhhhhhhcCCcceehhhcCCCCCceEEEEEeccCCCcceeeEEEEecccccCccccc----hhhHh
Confidence            333445555555443222 2233446777766665544 4421   11     368899998888885532    22344


Q ss_pred             ccccccccceEEEEeecccCccceEEeecccceEEEEecCCCceeeeeeeeeccCC-ceEEEeEEecCCCceEEEEecC-
Q 047869         1855 KPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQG-AYIRRVDWVPGSPVQLMVVTNK- 1932 (2233)
Q Consensus      1855 trLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg-~fIIKa~WLPGSQt~LAVVT~~- 1932 (2233)
                      .+|-...|+ .|--|.+.| |.+.||-.-=.+..++.+..+-+++-.+......++ --.--..|=|-.+.....+|++ 
T Consensus       116 ~~Ldteavg-~i~cvew~P-ns~klasm~dn~i~l~~l~ess~~vaev~ss~s~e~~~~ftsg~WspHHdgnqv~tt~d~  193 (370)
T KOG1007|consen  116 ASLDTEAVG-KINCVEWEP-NSDKLASMDDNNIVLWSLDESSKIVAEVLSSESAEMRHSFTSGAWSPHHDGNQVATTSDS  193 (370)
T ss_pred             hcCCHHHhC-ceeeEEEcC-CCCeeEEeccCceEEEEcccCcchheeecccccccccceecccccCCCCccceEEEeCCC
Confidence            666667788 888899999 888888777778888888777666555544444442 2356677999777666555655 


Q ss_pred             eEEEEeCcCCCCCCcEEEEcCCCC---eeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeeccccccc
Q 047869         1933 FVKIYDLSQDNISPLHYFTLPDDM---IVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIH 2009 (2233)
Q Consensus      1933 FVKIYDLS~D~lSPvyyF~LpsGk---IrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~ 2009 (2233)
                      .+.-||+-.++    .++.+-.--   +||.-|  +++.+.+++---.+||+-+=+.++..    ++++|.   ++  -.
T Consensus       194 tl~~~D~RT~~----~~~sI~dAHgq~vrdlDf--Npnkq~~lvt~gDdgyvriWD~R~tk----~pv~el---~~--Hs  258 (370)
T KOG1007|consen  194 TLQFWDLRTMK----KNNSIEDAHGQRVRDLDF--NPNKQHILVTCGDDGYVRIWDTRKTK----FPVQEL---PG--HS  258 (370)
T ss_pred             cEEEEEccchh----hhcchhhhhcceeeeccC--CCCceEEEEEcCCCccEEEEeccCCC----cccccc---CC--Cc
Confidence            89999998665    234443322   566653  46666677777788999887776422    222221   11  12


Q ss_pred             CCeEEEEeccccceeeEEec-CCcEEEE
Q 047869         2010 AKGLSLYFSSTYKLLFLSFQ-DGTTLVG 2036 (2233)
Q Consensus      2010 ~~GVSVyYS~tl~LLF~SY~-~G~Sf~a 2036 (2233)
                      .=--+|.|.+.+.-|++|=. +-+..+.
T Consensus       259 HWvW~VRfn~~hdqLiLs~~SDs~V~Ls  286 (370)
T KOG1007|consen  259 HWVWAVRFNPEHDQLILSGGSDSAVNLS  286 (370)
T ss_pred             eEEEEEEecCccceEEEecCCCceeEEE
Confidence            22457889999988888864 3344433


No 121
>CHL00052 rpl2 ribosomal protein L2
Probab=23.89  E-value=1.4e+03  Score=28.69  Aligned_cols=130  Identities=22%  Similarity=0.319  Sum_probs=87.3

Q ss_pred             cccccccccccccceEEEEeecccCccceEEeecccc-eEEEEecCCCc-eeeeee------e----eecc----CCceE
Q 047869         1850 DKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYED-CQVLTLNPRGE-VTDRLA------I----ELAL----QGAYI 1913 (2233)
Q Consensus      1850 dKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkD-C~VLTfss~Ge-V~DRL~------L----eL~L----eg~fI 1913 (2233)
                      +.++.++ ....++-.|++|-++|.-.-+||.+=+.| -.-+.+.+.|- +.|.+.      +    -+.|    +|.+|
T Consensus        61 R~IDf~r-~~~~i~~~V~~IeyDP~Rsa~IAlv~~~~g~~~YIlAp~gl~~Gd~I~~g~~~~i~~Gn~lpL~~IP~Gt~I  139 (273)
T CHL00052         61 RKIDFRR-NKKDIYGRIVTIEYDPNRNAYICLIHYGDGEKRYILHPRGLKIGDTIVSGTEAPIKIGNALPLTNIPLGTAI  139 (273)
T ss_pred             ceecccc-ccCCCcEEEEEEEECCCCCccEEEEEeCCCcEEEEEccCCCCCCCEEEeCCCCCCCcccccccccCCCCCEE
Confidence            3444444 23457899999999999999999997776 45566666652 222221      1    1111    38899


Q ss_pred             EEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcCCCCeeE------EEEEEecCCcEEEEEEecCCceEEE
Q 047869         1914 RRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLPDDMIVD------ATLVIASRGKMFLIVLSECGSLYRL 1984 (2233)
Q Consensus      1914 IKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~LpsGkIrD------aTfv~~e~G~~~ILVLSS~G~LY~Q 1984 (2233)
                      -+++=.||.-.+||=+.-.|..|-.  ++  .=.....||+|+++.      ||+....++......+-.+|.-+..
T Consensus       140 ~NIE~~pg~Ggk~~RsAGt~A~ii~--k~--~~~~~vkLPSGe~r~v~~~c~AtIG~Vsn~~~~~~~lgKAG~~r~l  212 (273)
T CHL00052        140 HNIEITPGKGGQLARAAGAVAKLIA--KE--GKSATLKLPSGEVRLISKNCSATIGQVGNVDVNNKSLGKAGSKRWL  212 (273)
T ss_pred             EEEEecCCCCceEEEecCCeEEEEE--ec--CCEEEEECCCCCeEEECCcCeEEEEEccCCchhhcEecchhhhhcC
Confidence            9999999999999988888888864  22  224567899999865      4554444444444555667766653


No 122
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=23.15  E-value=8.7e+02  Score=32.85  Aligned_cols=118  Identities=19%  Similarity=0.160  Sum_probs=71.3

Q ss_pred             EEEeEEecCCCceEEEEecC-eEEEEeCcCCC-------CCCcEEEEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEE
Q 047869         1913 IRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDN-------ISPLHYFTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRL 1984 (2233)
Q Consensus      1913 IIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~-------lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Q 1984 (2233)
                      .|++.=.|+++.-|+.++-+ ++|+|.| +++       +-|.|+|.-=.|.|.-.++.  ++| -+++--+-+|.|-.=
T Consensus       296 ~ir~l~~~~sep~lit~sed~~lk~WnL-qk~~~s~~~~~epi~tfraH~gPVl~v~v~--~n~-~~~ysgg~Dg~I~~w  371 (577)
T KOG0642|consen  296 CIRALAFHPSEPVLITASEDGTLKLWNL-QKAKKSAEKDVEPILTFRAHEGPVLCVVVP--SNG-EHCYSGGIDGTIRCW  371 (577)
T ss_pred             hhhhhhcCCCCCeEEEeccccchhhhhh-cccCCccccceeeeEEEecccCceEEEEec--CCc-eEEEeeccCceeeee
Confidence            55566667787766655544 8899999 332       22888888888887655533  445 366777778888777


Q ss_pred             EecccCCCccccceeeeecccccccCCeE-EEEeccccceeeEEe-cCCcEEEE
Q 047869         1985 ELSVEGNVGATPLKEIIQFNDREIHAKGL-SLYFSSTYKLLFLSF-QDGTTLVG 2036 (2233)
Q Consensus      1985 els~s~d~g~~~ltEvvq~~~~q~~~~GV-SVyYS~tl~LLF~SY-~~G~Sf~a 2036 (2233)
                      .++..++....+-..++.-.-. -..+.| .+.||.+-+-| +|+ .+||...=
T Consensus       372 ~~p~n~dp~ds~dp~vl~~~l~-Ghtdavw~l~~s~~~~~L-lscs~DgTvr~w  423 (577)
T KOG0642|consen  372 NLPPNQDPDDSYDPSVLSGTLL-GHTDAVWLLALSSTKDRL-LSCSSDGTVRLW  423 (577)
T ss_pred             ccCCCCCcccccCcchhcccee-ccccceeeeeecccccce-eeecCCceEEee
Confidence            7775555444442222211111 122244 67888887774 344 57776653


No 123
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=23.06  E-value=1.4e+03  Score=29.42  Aligned_cols=188  Identities=21%  Similarity=0.271  Sum_probs=0.0

Q ss_pred             hHHhHHHhhcCcccccceecccCce-EEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccC
Q 047869         1797 ARELKSHLASGSLVKSLLSVSSRGR-LAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSI 1874 (2233)
Q Consensus      1797 ~relks~l~sGq~iRqLLSas~rGr-LAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~ 1874 (2233)
                      .+++-..++.+.  ...+.-++-|. |||+ -.|.|.|++.              ++..+-+.=++-++ .|-++..+|+
T Consensus        14 PEel~~tld~~~--a~~~~Fs~~G~~lAvGc~nG~vvI~D~--------------~T~~iar~lsaH~~-pi~sl~WS~d   76 (405)
T KOG1273|consen   14 PEELTHTLDNPL--AECCQFSRWGDYLAVGCANGRVVIYDF--------------DTFRIARMLSAHVR-PITSLCWSRD   76 (405)
T ss_pred             hHhhceeccCCc--cceEEeccCcceeeeeccCCcEEEEEc--------------cccchhhhhhcccc-ceeEEEecCC


Q ss_pred             ccceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcC
Q 047869         1875 VENYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLP 1953 (2233)
Q Consensus      1875 nEdyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~Lp 1953 (2233)
                      -...|----=+-|...-+ .+|+..-|+..     ..-|-.|.|.|..+....+.-.+ .--+-+++-    |+|.++-.
T Consensus        77 gr~LltsS~D~si~lwDl-~~gs~l~rirf-----~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~----~~h~~Lp~  146 (405)
T KOG1273|consen   77 GRKLLTSSRDWSIKLWDL-LKGSPLKRIRF-----DSPVWGAQWHPRKRNKCVATIMEESPVVIDFSD----PKHSVLPK  146 (405)
T ss_pred             CCEeeeecCCceeEEEec-cCCCceeEEEc-----cCccceeeeccccCCeEEEEEecCCcEEEEecC----CceeeccC


Q ss_pred             CC----CeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCe
Q 047869         1954 DD----MIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKG 2012 (2233)
Q Consensus      1954 sG----kIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~G 2012 (2233)
                      ++    ++.+..-+++..| .+|++-++.|.+..-+-+.--=..++..|.+.++....+..+|
T Consensus       147 d~d~dln~sas~~~fdr~g-~yIitGtsKGkllv~~a~t~e~vas~rits~~~IK~I~~s~~g  208 (405)
T KOG1273|consen  147 DDDGDLNSSASHGVFDRRG-KYIITGTSKGKLLVYDAETLECVASFRITSVQAIKQIIVSRKG  208 (405)
T ss_pred             CCccccccccccccccCCC-CEEEEecCcceEEEEecchheeeeeeeechheeeeEEEEeccC


No 124
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=22.41  E-value=2.3e+03  Score=30.06  Aligned_cols=160  Identities=17%  Similarity=0.202  Sum_probs=99.9

Q ss_pred             hcCcccccceecccCceEEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeec
Q 047869         1805 ASGSLVKSLLSVSSRGRLAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAG 1883 (2233)
Q Consensus      1805 ~sGq~iRqLLSas~rGrLAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcG 1883 (2233)
                      ..|+.++-...... |.||.+ --++|-|++..-   +...       .     +-..-|=.|-++.|+|--...|.+.|
T Consensus       104 He~Pvi~ma~~~~g-~LlAtggaD~~v~VWdi~~---~~~t-------h-----~fkG~gGvVssl~F~~~~~~~lL~sg  167 (775)
T KOG0319|consen  104 HEAPVITMAFDPTG-TLLATGGADGRVKVWDIKN---GYCT-------H-----SFKGHGGVVSSLLFHPHWNRWLLASG  167 (775)
T ss_pred             cCCCeEEEEEcCCC-ceEEeccccceEEEEEeeC---CEEE-------E-----EecCCCceEEEEEeCCccchhheeec
Confidence            34555544333333 666764 666777777632   1110       1     11233456889999998777899999


Q ss_pred             ccceEEEEecCCCceeeeeeeeec-cCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCCCcEEEEcC-CCCeeEEE
Q 047869         1884 YEDCQVLTLNPRGEVTDRLAIELA-LQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNISPLHYFTLP-DDMIVDAT 1961 (2233)
Q Consensus      1884 LkDC~VLTfss~GeV~DRL~LeL~-Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lSPvyyF~Lp-sGkIrDaT 1961 (2233)
                      .-|-.|.-.|-+-...   .++.. ....-++-..-.+.+.+.+++-.-+-+-||||-    .-..-=++| ...+..++
T Consensus       168 ~~D~~v~vwnl~~~~t---cl~~~~~H~S~vtsL~~~~d~~~~ls~~RDkvi~vwd~~----~~~~l~~lp~ye~~E~vv  240 (775)
T KOG0319|consen  168 ATDGTVRVWNLNDKRT---CLHTMILHKSAVTSLAFSEDSLELLSVGRDKVIIVWDLV----QYKKLKTLPLYESLESVV  240 (775)
T ss_pred             CCCceEEEEEcccCch---HHHHHHhhhhheeeeeeccCCceEEEeccCcEEEEeehh----hhhhhheechhhheeeEE
Confidence            9999999988773222   33222 235557777778889888888888899999992    112222334 23567777


Q ss_pred             EEEecCCcE--EEEEEecCCceEEEEec
Q 047869         1962 LVIASRGKM--FLIVLSECGSLYRLELS 1987 (2233)
Q Consensus      1962 fv~~e~G~~--~ILVLSS~G~LY~Qels 1987 (2233)
                      +...+.|..  +++...++|.+=+-+.+
T Consensus       241 ~l~~~~~~~~~~~~TaG~~g~~~~~d~e  268 (775)
T KOG0319|consen  241 RLREELGGKGEYIITAGGSGVVQYWDSE  268 (775)
T ss_pred             EechhcCCcceEEEEecCCceEEEEecc
Confidence            766555544  77777777776554443


No 125
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=22.26  E-value=8.2e+02  Score=31.73  Aligned_cols=141  Identities=15%  Similarity=0.201  Sum_probs=0.0

Q ss_pred             EEeecccceEEEEe-cCCCceeeeee-----eeeccC--CceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEE
Q 047869         1879 LTVAGYEDCQVLTL-NPRGEVTDRLA-----IELALQ--GAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHY 1949 (2233)
Q Consensus      1879 LAVcGLkDC~VLTf-ss~GeV~DRL~-----LeL~Le--g~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyy 1949 (2233)
                      +++.|-+||.++-. +..|.|+--..     +.++.+  .+-|--+-|-..-.- .|+-..+ .|-|||++...  |.|-
T Consensus       247 ~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~esve~~~~ss~lpL-~A~G~vdG~i~iyD~a~~~--~R~~  323 (399)
T KOG0296|consen  247 TLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDESVESIPSSSKLPL-AACGSVDGTIAIYDLAAST--LRHI  323 (399)
T ss_pred             eeEeccCCccEEEEccccceEEEecCCCCccccccchhhhhhhhhcccccccch-hhcccccceEEEEecccch--hhee


Q ss_pred             EEcCCCCeeEEEEEEecCCcEEEEEEecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccccceeeEEec
Q 047869         1950 FTLPDDMIVDATLVIASRGKMFLIVLSECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSSTYKLLFLSFQ 2029 (2233)
Q Consensus      1950 F~LpsGkIrDaTfv~~e~G~~~ILVLSS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~tl~LLF~SY~ 2029 (2233)
                      +..+.| |++..+.    +.-+++..+.+|.+|.-+.+                     .|.-+-.|--|+...|=|+|.
T Consensus       324 c~he~~-V~~l~w~----~t~~l~t~c~~g~v~~wDaR---------------------tG~l~~~y~GH~~~Il~f~ls  377 (399)
T KOG0296|consen  324 CEHEDG-VTKLKWL----NTDYLLTACANGKVRQWDAR---------------------TGQLKFTYTGHQMGILDFALS  377 (399)
T ss_pred             ccCCCc-eEEEEEc----CcchheeeccCceEEeeecc---------------------ccceEEEEecCchheeEEEEc


Q ss_pred             CCcEEEEEcCCCcccccceeEEEE
Q 047869         2030 DGTTLVGRLSPNAASLSEVSYVFE 2053 (2233)
Q Consensus      2030 ~G~Sf~a~Ls~~~~sv~eis~Vfe 2053 (2233)
                      -.+.++-..+..+     ...||+
T Consensus       378 ~~~~~vvT~s~D~-----~a~VF~  396 (399)
T KOG0296|consen  378 PQKRLVVTVSDDN-----TALVFE  396 (399)
T ss_pred             CCCcEEEEecCCC-----eEEEEe


No 126
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=21.94  E-value=4.3e+02  Score=32.93  Aligned_cols=143  Identities=15%  Similarity=0.237  Sum_probs=93.3

Q ss_pred             cCceEEEe-eCCeEEEEechhhhcccccCCccccccccccccccccceEEEEeecccCccceEEeecccceEEEEecCCC
Q 047869         1818 SRGRLAVG-EGDKVAIFDVGQLIGQATIQPVTADKTNVKPLSRNIVRFEIVHLAFNSIVENYLTVAGYEDCQVLTLNPRG 1896 (2233)
Q Consensus      1818 ~rGrLAVa-EgdKVTILqlsaLLkQad~s~~skdKlTLtrLSsa~VpFeVlsLafNP~nEdyLAVcGLkDC~VLTfss~G 1896 (2233)
                      .++..||. ..|.|+.++.                .+.+++....-+|+|..+.+|-.|+=|.+-.|+--++||.+ |.=
T Consensus       117 ~g~~~~~~~kdD~it~id~----------------r~~~~~~~~~~~~e~ne~~w~~~nd~Fflt~GlG~v~ILsy-psL  179 (313)
T KOG1407|consen  117 DGEYIAVGNKDDRITFIDA----------------RTYKIVNEEQFKFEVNEISWNNSNDLFFLTNGLGCVEILSY-PSL  179 (313)
T ss_pred             CCCEEEEecCcccEEEEEe----------------cccceeehhcccceeeeeeecCCCCEEEEecCCceEEEEec-ccc
Confidence            45566665 6667766654                45567888899999999999988888999999999999998 432


Q ss_pred             ceeeeeeeeeccCCceEEEeEEecCCCceEEEEecC-eEEEEeCcCCCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEE
Q 047869         1897 EVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNK-FVKIYDLSQDNISPLHYFTLPDDMIVDATLVIASRGKMFLIVL 1975 (2233)
Q Consensus      1897 eV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~-FVKIYDLS~D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVL 1975 (2233)
                      +-++-+.-|+    .=++.+..-|.-.. +|+=.++ -|..||+..=..  .-.|.=.+=.||..+|-+  +|+ +|--.
T Consensus       180 kpv~si~AH~----snCicI~f~p~Gry-fA~GsADAlvSLWD~~ELiC--~R~isRldwpVRTlSFS~--dg~-~lASa  249 (313)
T KOG1407|consen  180 KPVQSIKAHP----SNCICIEFDPDGRY-FATGSADALVSLWDVDELIC--ERCISRLDWPVRTLSFSH--DGR-MLASA  249 (313)
T ss_pred             ccccccccCC----cceEEEEECCCCce-EeeccccceeeccChhHhhh--heeeccccCceEEEEecc--Ccc-eeecc
Confidence            2222222232    33667777788766 8888888 899999865332  222222233578777653  453 34444


Q ss_pred             ecCCceEEEEec
Q 047869         1976 SECGSLYRLELS 1987 (2233)
Q Consensus      1976 SS~G~LY~Qels 1987 (2233)
                      |++=+|=+-+++
T Consensus       250 SEDh~IDIA~ve  261 (313)
T KOG1407|consen  250 SEDHFIDIAEVE  261 (313)
T ss_pred             CccceEEeEecc
Confidence            455444444444


No 127
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=21.57  E-value=6.3e+02  Score=33.80  Aligned_cols=159  Identities=20%  Similarity=0.248  Sum_probs=94.5

Q ss_pred             ceEEeecccceEEEEecCCCceeeeeeeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcCCCCC---------Cc
Q 047869         1877 NYLTVAGYEDCQVLTLNPRGEVTDRLAIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQDNIS---------PL 1947 (2233)
Q Consensus      1877 dyLAVcGLkDC~VLTfss~GeV~DRL~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~D~lS---------Pv 1947 (2233)
                      .+.--||-.=+.|--++..|.-.---++.-...+||||-+--+|++.+.|.==-+-.|.||||+....-         |-
T Consensus       432 rhVyTgGkgcVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~pdgrtLivGGeastlsiWDLAapTprikaeltssapa  511 (705)
T KOG0639|consen  432 RHVYTGGKGCVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLPDGRTLIVGGEASTLSIWDLAAPTPRIKAELTSSAPA  511 (705)
T ss_pred             ceeEecCCCeEEEeeccCCCCCCccccccccCcccceeeeEecCCCceEEeccccceeeeeeccCCCcchhhhcCCcchh
Confidence            344455666667888877755444444455556999999999999999886555668999999865421         22


Q ss_pred             EE-----------E-EcCCCCee-----EEEEEEe----cCCcEEEEEEecCCceEE-EEecccCCCccccceeeeeccc
Q 047869         1948 HY-----------F-TLPDDMIV-----DATLVIA----SRGKMFLIVLSECGSLYR-LELSVEGNVGATPLKEIIQFND 2005 (2233)
Q Consensus      1948 yy-----------F-~LpsGkIr-----DaTfv~~----e~G~~~ILVLSS~G~LY~-Qels~s~d~g~~~ltEvvq~~~ 2005 (2233)
                      +|           | -+.+|+|+     +-|.+-+    .+|..+ |.+|.+|+-.+ --+.  .-.-.-.+-|..|+--
T Consensus       512 CyALa~spDakvcFsccsdGnI~vwDLhnq~~VrqfqGhtDGasc-Idis~dGtklWTGGlD--ntvRcWDlregrqlqq  588 (705)
T KOG0639|consen  512 CYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASC-IDISKDGTKLWTGGLD--NTVRCWDLREGRQLQQ  588 (705)
T ss_pred             hhhhhcCCccceeeeeccCCcEEEEEcccceeeecccCCCCCcee-EEecCCCceeecCCCc--cceeehhhhhhhhhhh
Confidence            21           2 23345442     2333321    245433 44456665332 1110  0001112334445545


Q ss_pred             ccccCCeEEEEeccccceeeEEecCCcEEEEEc
Q 047869         2006 REIHAKGLSLYFSSTYKLLFLSFQDGTTLVGRL 2038 (2233)
Q Consensus      2006 ~q~~~~GVSVyYS~tl~LLF~SY~~G~Sf~a~L 2038 (2233)
                      .+...-=+|+-|+++-..|-+.++++..-+-..
T Consensus       589 hdF~SQIfSLg~cP~~dWlavGMens~vevlh~  621 (705)
T KOG0639|consen  589 HDFSSQIFSLGYCPTGDWLAVGMENSNVEVLHT  621 (705)
T ss_pred             hhhhhhheecccCCCccceeeecccCcEEEEec
Confidence            556677889999999999999999887766443


No 128
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=20.60  E-value=1.1e+03  Score=32.04  Aligned_cols=162  Identities=19%  Similarity=0.173  Sum_probs=107.4

Q ss_pred             EEeecccCccceEEeecccceEEEEecCCCceeeee-----eeeeccCCceEEEeEEecCCCceEEEEecCeEEEEeCcC
Q 047869         1867 VHLAFNSIVENYLTVAGYEDCQVLTLNPRGEVTDRL-----AIELALQGAYIRRVDWVPGSPVQLMVVTNKFVKIYDLSQ 1941 (2233)
Q Consensus      1867 lsLafNP~nEdyLAVcGLkDC~VLTfss~GeV~DRL-----~LeL~Leg~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS~ 1941 (2233)
                      .+|+|=|. +.+.-+.|-.+=+|++-+..|.-..--     +..+..-...|.-+.|-|=+.-.+..+-.-.||||.-..
T Consensus       351 t~~~F~~~-~p~~FiVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~v~~nPF~~k~fls~gDW~vriWs~~~  429 (555)
T KOG1587|consen  351 TSLKFEPT-DPNHFIVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVYAVSRNPFYPKNFLSVGDWTVRIWSEDV  429 (555)
T ss_pred             eeEeeccC-CCceEEEEcCCcEEEEEeccCCcccccccccccccccccCcceEeeecCCCccceeeeeccceeEeccccC
Confidence            35667554 444467799999999988777544431     222223356788889999887777777777999998765


Q ss_pred             CCCCCcEEEEcCCCCeeEEEEEEecCCcEEEEEE-ecCCceEEEEecccCCCccccceeeeecccccccCCeEEEEeccc
Q 047869         1942 DNISPLHYFTLPDDMIVDATLVIASRGKMFLIVL-SECGSLYRLELSVEGNVGATPLKEIIQFNDREIHAKGLSLYFSST 2020 (2233)
Q Consensus      1942 D~lSPvyyF~LpsGkIrDaTfv~~e~G~~~ILVL-SS~G~LY~Qels~s~d~g~~~ltEvvq~~~~q~~~~GVSVyYS~t 2020 (2233)
                       ..+|.+-|-.-++.+.|+. |.  -.+-.+++. ..+|+|+.=++..+.........  +.  ....+    -+..++.
T Consensus       430 -~~~Pl~~~~~~~~~v~~va-WS--ptrpavF~~~d~~G~l~iWDLl~~~~~Pv~s~~--~~--~~~l~----~~~~s~~  497 (555)
T KOG1587|consen  430 -IASPLLSLDSSPDYVTDVA-WS--PTRPAVFATVDGDGNLDIWDLLQDDEEPVLSQK--VC--SPALT----RVRWSPN  497 (555)
T ss_pred             -CCCcchhhhhccceeeeeE-Ec--CcCceEEEEEcCCCceehhhhhccccCCccccc--cc--ccccc----eeecCCC
Confidence             5689999988888888877 53  233344444 45999998777643222111100  00  00011    1466777


Q ss_pred             cceeeEEecCCcEEEEEcCCC
Q 047869         2021 YKLLFLSFQDGTTLVGRLSPN 2041 (2233)
Q Consensus      2021 l~LLF~SY~~G~Sf~a~Ls~~ 2041 (2233)
                      .++|.+.=.+|++++-.|+.+
T Consensus       498 g~~lavGd~~G~~~~~~l~~~  518 (555)
T KOG1587|consen  498 GKLLAVGDANGTTHILKLSES  518 (555)
T ss_pred             CcEEEEecCCCcEEEEEcCch
Confidence            899999999999999888544


No 129
>KOG3334 consensus Transcription initiation factor TFIID, subunit TAF9 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=20.32  E-value=76  Score=35.81  Aligned_cols=50  Identities=32%  Similarity=0.527  Sum_probs=36.7

Q ss_pred             hhhHHHHhhccccceecccchhHHHHHHHHHHHhhhh--------------hccccccCccccchhH
Q 047869           92 SLGHVIASASRSLAVEQAGPVIVAVMQELLEFAVCYL--------------ERSEFDNDDFSVQNHM  144 (2233)
Q Consensus        92 sl~~~i~~~~rslsv~q~~p~~v~v~q~~~ef~~~~l--------------e~s~~~~~d~~~~~~~  144 (2233)
                      -=+.+|++..|||.+++.||.+   +-++||||-.|-              +|...+.+|....+.|
T Consensus        14 kDa~~i~~iL~s~GI~eyEprV---i~qlLefa~rYtt~vL~DA~vys~HA~ka~i~~eDVrlA~~~   77 (148)
T KOG3334|consen   14 KDARVIASILKSLGIQEYEPRV---INQLLEFAYRYTTTVLDDAKVYSSHAKKATIDAEDVRLAIQM   77 (148)
T ss_pred             HHHHHHHHHHHHcCccccChHH---HHHHHHHHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHH
Confidence            3468899999999999999953   457777777663              3555666676666555


No 130
>PF09826 Beta_propel:  Beta propeller domain;  InterPro: IPR019198 This entry consists of predicted secreted proteins containing a C-terminal beta-propeller domain distantly related to WD-40 repeats. 
Probab=20.19  E-value=5.7e+02  Score=34.01  Aligned_cols=108  Identities=22%  Similarity=0.311  Sum_probs=80.7

Q ss_pred             EeeCCeEEEEechhhhcccccCCcccccccc-ccccccccceEEEEeecccCccceEEeecc--------cceEEEEecC
Q 047869         1824 VGEGDKVAIFDVGQLIGQATIQPVTADKTNV-KPLSRNIVRFEIVHLAFNSIVENYLTVAGY--------EDCQVLTLNP 1894 (2233)
Q Consensus      1824 VaEgdKVTILqlsaLLkQad~s~~skdKlTL-trLSsa~VpFeVlsLafNP~nEdyLAVcGL--------kDC~VLTfss 1894 (2233)
                      ...|=||++||++..-.     |..++|..+ .+-+..++-.+=..+.|.| ..+.||+-=.        ...+|+.+++
T Consensus       398 ~~~GlKisLFDVSD~~~-----P~e~~~~~iG~~~s~S~a~~dhkAfl~~~-~~~ll~~Pv~~~~~~~~~~g~~v~~i~~  471 (521)
T PF09826_consen  398 WTQGLKISLFDVSDPAN-----PKELDKEVIGDRGSYSEALYDHKAFLFDK-EKNLLAFPVSSSYGYFNFQGAYVFSIDP  471 (521)
T ss_pred             ccceeEEEEEecCCCCC-----ccEeEEEEcCCCCccCccccCceEEEEeC-CCCEEEEEEEEccCccccceEEEEEEeC
Confidence            34556999999976543     334667777 6677777777777888877 4466666544        6788999997


Q ss_pred             CCceeeeeeeeeccC----CceEEEeEEecCCCceEEEEecCeEEEEeCc
Q 047869         1895 RGEVTDRLAIELALQ----GAYIRRVDWVPGSPVQLMVVTNKFVKIYDLS 1940 (2233)
Q Consensus      1895 ~GeV~DRL~LeL~Le----g~fIIKa~WLPGSQt~LAVVT~~FVKIYDLS 1940 (2233)
                      +..+..+-.|+..-+    ...|.|+.-+.+.   |-.++...||+|||.
T Consensus       472 ~~g~~~~g~i~h~~~~~~~~~~~~R~lyi~d~---lYtvS~~~i~~~~l~  518 (521)
T PF09826_consen  472 EDGFTLKGKITHPSPDYYYSYQIQRSLYIGDT---LYTVSDNGIKAYDLN  518 (521)
T ss_pred             CCCeEEEEEEEccCcccccccceeEEEEECCE---EEEEECCEEEEEehH
Confidence            777887777765543    3458888888874   889999999999986


Done!