Query         001853
Match_columns 1004
No_of_seqs    226 out of 560
Neff          7.3 
Searched_HMMs 46136
Date          Fri Mar 29 10:52:35 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001853.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001853hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1896 mRNA cleavage and poly 100.0  1E-149  3E-154 1317.6  74.5  875    2-1001    1-953 (1366)
  2 KOG1897 Damage-specific DNA bi 100.0 2.2E-89 4.7E-94  797.3  68.3  693    2-1001    1-714 (1096)
  3 COG5161 SFT1 Pre-mRNA cleavage 100.0 2.4E-79 5.2E-84  697.3  45.9  826    1-1001    1-908 (1319)
  4 KOG1898 Splicing factor 3b, su 100.0 1.4E-68   3E-73  625.1  49.8  737    3-1001    2-778 (1205)
  5 PF10433 MMS1_N:  Mono-function 100.0 2.4E-55 5.2E-60  525.0  48.2  434  131-693     1-483 (504)
  6 PF08596 Lgl_C:  Lethal giant l  90.3     8.9 0.00019   44.9  16.7   75  753-857    99-174 (395)
  7 COG4247 Phy 3-phytase (myo-ino  90.1     4.4 9.5E-05   43.6  12.4  128  606-770    52-187 (364)
  8 PF14727 PHTB1_N:  PTHB1 N-term  90.1      31 0.00067   40.7  20.8   69  913-982   243-316 (418)
  9 KOG0294 WD40 repeat-containing  87.8      16 0.00035   40.6  15.0   81  659-778    44-124 (362)
 10 PF14727 PHTB1_N:  PTHB1 N-term  85.3      87  0.0019   37.1  25.9   76   56-149    89-164 (418)
 11 COG5161 SFT1 Pre-mRNA cleavage  84.9    0.22 4.9E-06   60.7  -0.8   90  100-206    88-177 (1319)
 12 KOG1274 WD40 repeat protein [G  81.8   1E+02  0.0022   39.1  19.6  147  578-781    27-180 (933)
 13 COG2706 3-carboxymuconate cycl  80.8 1.1E+02  0.0024   34.9  24.3   69  129-210    50-122 (346)
 14 PF14783 BBS2_Mid:  Ciliary BBS  80.8      51  0.0011   31.4  13.3   92  623-765    19-110 (111)
 15 KOG1539 WD repeat protein [Gen  78.3   1E+02  0.0022   38.8  17.9   81  661-781   205-287 (910)
 16 KOG0649 WD40 repeat protein [G  77.4   1E+02  0.0023   33.4  15.6   73  667-776   167-242 (325)
 17 PF03178 CPSF_A:  CPSF A subuni  73.9      19 0.00042   40.6  10.3   69   56-158    99-167 (321)
 18 KOG0294 WD40 repeat-containing  70.3 1.9E+02  0.0042   32.6  22.6   95   63-207    62-157 (362)
 19 COG2706 3-carboxymuconate cycl  69.7      90  0.0019   35.5  13.7   71  130-207   202-274 (346)
 20 PF02333 Phytase:  Phytase;  In  69.5      73  0.0016   37.1  13.5   61  678-769   127-189 (381)
 21 PF03178 CPSF_A:  CPSF A subuni  68.1 2.1E+02  0.0046   32.2  19.6   63   64-158    62-124 (321)
 22 KOG0310 Conserved WD40 repeat-  67.7 2.6E+02  0.0057   33.1  18.7  101  628-777   175-276 (487)
 23 PF10282 Lactonase:  Lactonase,  67.2 1.2E+02  0.0026   34.7  15.0   89  100-209    26-119 (345)
 24 KOG2048 WD40 repeat protein [G  67.1 2.8E+02  0.0061   34.3  17.7   29  180-208   478-506 (691)
 25 KOG0318 WD40 repeat stress pro  66.5 2.8E+02  0.0062   33.3  17.2  118  606-772   403-520 (603)
 26 KOG2110 Uncharacterized conser  63.4 2.8E+02  0.0061   31.9  18.4  156  611-857    89-249 (391)
 27 cd00200 WD40 WD40 domain, foun  61.9   2E+02  0.0044   29.8  29.9   75  661-776   180-256 (289)
 28 KOG0289 mRNA splicing factor [  61.2 3.3E+02  0.0071   32.0  16.5  113  659-855   306-418 (506)
 29 KOG2048 WD40 repeat protein [G  57.5 4.6E+02  0.0099   32.5  44.4   97   56-204    38-137 (691)
 30 KOG2111 Uncharacterized conser  56.9      99  0.0021   34.7  10.9   22  749-770   236-257 (346)
 31 KOG2110 Uncharacterized conser  54.5 3.9E+02  0.0085   30.8  15.4   28  830-857   304-331 (391)
 32 KOG2055 WD40 repeat protein [G  54.1      51  0.0011   38.6   8.5   91  827-960   216-309 (514)
 33 PF07569 Hira:  TUP1-like enhan  51.5      54  0.0012   35.2   8.0   73  751-855    22-94  (219)
 34 KOG0285 Pleiotropic regulator   50.2 2.3E+02  0.0049   32.5  12.4  123  630-770   258-390 (460)
 35 KOG0306 WD40-repeat-containing  49.7 5.1E+02   0.011   32.6  16.1  122  660-856   414-538 (888)
 36 KOG0772 Uncharacterized conser  48.8 1.7E+02  0.0037   35.0  11.6  108  574-687   225-348 (641)
 37 KOG1897 Damage-specific DNA bi  48.4      66  0.0014   41.1   8.8   84    5-150   764-858 (1096)
 38 KOG2055 WD40 repeat protein [G  46.4 5.7E+02   0.012   30.4  17.6   25  752-776   316-340 (514)
 39 KOG0772 Uncharacterized conser  46.2      75  0.0016   37.8   8.2  111  668-855   281-393 (641)
 40 PF08596 Lgl_C:  Lethal giant l  46.2 1.6E+02  0.0036   34.5  11.4   96  668-777   155-251 (395)
 41 KOG0319 WD40-repeat-containing  45.5 3.6E+02  0.0078   33.7  14.0  117  607-770   322-443 (775)
 42 KOG0283 WD40 repeat-containing  45.4 7.1E+02   0.015   31.4  16.8  196  660-986   371-568 (712)
 43 KOG1407 WD40 repeat protein [F  44.7 1.9E+02  0.0041   31.8  10.4   98  629-778    87-186 (313)
 44 KOG0641 WD40 repeat protein [G  44.5 4.3E+02  0.0093   28.4  15.6   53  184-255    96-148 (350)
 45 PF14781 BBS2_N:  Ciliary BBSom  43.8 1.6E+02  0.0034   29.2   8.9   72   56-152    11-83  (136)
 46 KOG0279 G protein beta subunit  43.5 5.1E+02   0.011   28.9  17.1  117  661-856   195-313 (315)
 47 KOG4378 Nuclear protein COP1 [  41.1 3.6E+02  0.0078   32.2  12.6   87  659-784   122-210 (673)
 48 KOG1446 Histone H3 (Lys4) meth  41.0 5.6E+02   0.012   28.9  13.5  113  628-784   161-277 (311)
 49 PF12894 Apc4_WD40:  Anaphase-p  39.3      54  0.0012   26.2   4.2   41  101-149     2-42  (47)
 50 PF14779 BBS1:  Ciliary BBSome   38.7 1.8E+02  0.0039   32.0   9.5   62   55-146   195-256 (257)
 51 PF02239 Cytochrom_D1:  Cytochr  38.0 2.4E+02  0.0052   32.8  11.1   81  100-205    25-106 (369)
 52 KOG0289 mRNA splicing factor [  37.8 7.5E+02   0.016   29.2  17.6  100  575-687   314-420 (506)
 53 KOG0291 WD40-repeat-containing  37.5 9.6E+02   0.021   30.4  56.5  160  535-719   285-448 (893)
 54 KOG1538 Uncharacterized conser  37.0 5.2E+02   0.011   32.1  13.3   18   56-73     25-42  (1081)
 55 KOG0647 mRNA export protein (c  37.0 1.9E+02  0.0042   32.3   9.2   55  660-718   158-212 (347)
 56 PRK11028 6-phosphogluconolacto  36.7 6.1E+02   0.013   28.3  14.1   87  100-208    69-157 (330)
 57 KOG0295 WD40 repeat-containing  36.3 4.5E+02  0.0097   30.3  12.0   62  750-856   303-364 (406)
 58 KOG0296 Angio-associated migra  35.6 7.5E+02   0.016   28.6  13.8  118  602-778   111-229 (399)
 59 PTZ00421 coronin; Provisional   34.6 9.1E+02    0.02   29.3  16.2  119  658-855    75-197 (493)
 60 KOG0276 Vesicle coat complex C  34.5 2.9E+02  0.0062   34.0  10.7   97  668-852    25-121 (794)
 61 KOG0316 Conserved WD40 repeat-  33.5 6.7E+02   0.014   27.4  16.4  166  577-776    81-264 (307)
 62 cd00200 WD40 WD40 domain, foun  32.8 5.6E+02   0.012   26.3  26.8   29  660-688   137-167 (289)
 63 KOG1274 WD40 repeat protein [G  32.0 1.2E+03   0.027   30.0  20.3   53  628-689   117-171 (933)
 64 PF06977 SdiA-regulated:  SdiA-  30.9      82  0.0018   34.5   5.4   60  375-438   184-248 (248)
 65 KOG0643 Translation initiation  29.7 7.2E+02   0.016   27.6  11.9   66  611-687    54-129 (327)
 66 KOG0295 WD40 repeat-containing  29.1 2.8E+02   0.006   31.9   9.0   70  668-778   304-373 (406)
 67 PF07569 Hira:  TUP1-like enhan  28.2 2.6E+02  0.0057   29.9   8.6   33  660-692    14-46  (219)
 68 PF12894 Apc4_WD40:  Anaphase-p  27.7      61  0.0013   25.9   2.8   24  751-775    23-46  (47)
 69 KOG1898 Splicing factor 3b, su  27.7 1.6E+03   0.034   29.8  21.4   58  131-200   299-356 (1205)
 70 KOG0288 WD40 repeat protein Ti  27.4 6.6E+02   0.014   29.5  11.6   27  751-777   399-425 (459)
 71 KOG0641 WD40 repeat protein [G  26.3 8.4E+02   0.018   26.3  11.5   19  106-124   131-150 (350)
 72 KOG0639 Transducin-like enhanc  25.4 5.9E+02   0.013   30.6  11.0   36  751-786   521-556 (705)
 73 KOG0285 Pleiotropic regulator   25.0 2.4E+02  0.0051   32.4   7.5   61  750-855   162-222 (460)
 74 PRK11028 6-phosphogluconolacto  24.6   1E+03   0.022   26.6  31.4   99   62-207    10-110 (330)
 75 PF10282 Lactonase:  Lactonase,  24.1 1.1E+03   0.024   26.8  31.1   96  100-208    75-175 (345)
 76 KOG0288 WD40 repeat protein Ti  24.0 4.7E+02    0.01   30.6   9.7   85  102-207   333-417 (459)
 77 KOG4649 PQQ (pyrrolo-quinoline  24.0   6E+02   0.013   28.1  10.0   73  361-439    52-124 (354)
 78 KOG0283 WD40 repeat-containing  23.5 4.6E+02    0.01   33.0  10.3   75  660-776   411-488 (712)
 79 PTZ00420 coronin; Provisional   23.4 1.5E+03   0.032   28.1  18.7   84  660-777    76-164 (568)
 80 PTZ00420 coronin; Provisional   22.9 1.5E+03   0.033   28.0  23.0   50  939-989   283-333 (568)
 81 KOG0263 Transcription initiati  22.6 8.7E+02   0.019   30.6  12.3  113  611-770   529-650 (707)
 82 KOG0278 Serine/threonine kinas  22.0 1.9E+02   0.004   31.6   5.8   69  924-998   236-310 (334)
 83 PF02239 Cytochrom_D1:  Cytochr  21.7 8.3E+02   0.018   28.3  11.8   83  100-206   109-201 (369)
 84 PF11715 Nup160:  Nucleoporin N  21.7 4.6E+02    0.01   32.0  10.3   30  751-780   230-259 (547)
 85 KOG3881 Uncharacterized conser  20.3 1.1E+03   0.023   27.6  11.5   29  659-687   106-134 (412)

No 1  
>KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification]
Probab=100.00  E-value=1.4e-149  Score=1317.60  Aligned_cols=875  Identities=40%  Similarity=0.664  Sum_probs=730.2

Q ss_pred             chhhhhhccCCCceeeEEEEEEecCCCCCCCCccccccccccccCCCCCCCCCCCeEEEEcCCEEEEEEEEecccccccc
Q 001853            2 SFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKES   81 (1004)
Q Consensus         2 ~~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~~~~~~   81 (1004)
                      +|++|++.|+||+|+||++|+||....                           +||||+++|.|+||++.++.+..+..
T Consensus         1 m~~vykq~h~~T~ve~s~ag~Ft~~~~---------------------------~nlvV~~~N~L~vyri~~~~e~~t~~   53 (1366)
T KOG1896|consen    1 MFAVYKQEHDPTVVENSSAGLFTNNRT---------------------------ENLVVAGTNILRVYRISRDAEALTKN   53 (1366)
T ss_pred             CcchhhhccCchhhccceeeeEecCCC---------------------------cceEEecccEEEEEEeccchhhcccc
Confidence            468999999999999999999998877                           99999999999999998764221110


Q ss_pred             cCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEeee
Q 001853           82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC  161 (1004)
Q Consensus        82 ~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh~  161 (1004)
                       +      +..|++....+|+|+++|++||+|++|++++.+++    .+|+|+++|++||+|++|||+.+|.|+|+||||
T Consensus        54 -~------~~~~~~~~~~~LeLv~~~~l~GnV~si~~~~~~gs----~rD~LlL~f~~AKiSvlefD~~t~sl~TlSLHy  122 (1366)
T KOG1896|consen   54 -D------PGDMGKAHRKKLELVAEFKLFGNVTSIAKLPLKGS----NRDALLLLFKDAKISVLEFDPQTNSLRTLSLHY  122 (1366)
T ss_pred             -C------ccccccccceEEEEEEEEEeecceeeEEEeecCCC----CcceEEEEeccceEEEEEecCCccceeeeeeEE
Confidence             1      11222223347999999999999999999999999    899999999999999999999999999999999


Q ss_pred             ecCcchhcccCCcccccCCCeEEECCCCCEEEEEEecCeEEEEEcccCCCCCCCCCCCCCCCCCcccceeeeEEEEeccc
Q 001853          162 FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL  241 (1004)
Q Consensus       162 ~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~~~~L~ilP~~~~~~~l~~~d~~~~~~~~~~~~~~~s~~i~l~~l  241 (1004)
                      ||+++   .+.|++....+|.++|||++||++|++|+..|+||||++.+ .+++++.- ..+...++++.+||+|.+++|
T Consensus       123 fE~~~---~~~~~~~~~~~p~vrvDPdsrCa~llvyg~~m~iLpf~~~e-~~~~~~~~-~~~~~~ss~~~pSyvi~~reL  197 (1366)
T KOG1896|consen  123 FEGPE---FRKGLVGRAKIPTVRVDPDSRCALLLVYGLRMAILPFRVNE-HLDDEELF-PSGFSKSSFTAPSYVIALREL  197 (1366)
T ss_pred             ecccc---ccccccccccCceEEECCCCCeEEEEEecceEEEeeccccc-cccccccc-cccccccccccceeEEEhhhh
Confidence            99998   55677766789999999999999999999999999998864 45444321 123344568999999999999


Q ss_pred             C--CCceeeEEEecCCCCceEEEEEEccCCcccceeeeeeeEEEEEEeeccccccccceeeeccCCCCCcEEEEecCCCC
Q 001853          242 D--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIG  319 (1004)
Q Consensus       242 d--i~nViD~~FL~gy~ePtlaiLye~~~tw~gr~~~r~dt~~~~~~sLn~~~k~~~~i~s~~~LP~d~~~lipvP~plG  319 (1004)
                      |  |+||+|++|||||+|||+||||||.+||+||+..|+|||.+.+++||+.+|.||+||++.+||+||+++.++|.|+|
T Consensus       198 deki~niiD~qFLhgY~ePTl~ILyep~~tw~grv~~r~dt~~~vaisLni~q~~hpVI~sv~sLP~D~~~~~~vp~piG  277 (1366)
T KOG1896|consen  198 DEKIKNIIDFQFLHGYYEPTLAILYEPEQTWAGRVILRKDTCVLVAISLNITQKVHPVIWSVLSLPFDCYQATAVPTPIG  277 (1366)
T ss_pred             hhhhccceeEEeecCcccceEEEEecccccccceEEEecCcEEEEEEEcCccccccceEeeeccCChhhhhceeecccCc
Confidence            9  99999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             eEEEEecCeEEEEeCCC-ceeEeecccccccCCCcCCCCCccEEEecceeEEEeeCCEEEEEeCCCCEEEEEEEEC-Cce
Q 001853          320 GVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD-GRV  397 (1004)
Q Consensus       320 GvLVig~n~Iiy~dq~~-~~~v~vN~~~~~~t~~~~~~~~~~~i~l~~~~~~~l~~~~~Ll~~~~G~L~~L~l~~d-gr~  397 (1004)
                      ||||++.|.++|++|++ +++|++|.++...+.||+.+|+.+.|.|+|+..+|++.++++++..+|++|+|+|.+| +|.
T Consensus       278 gvLv~~~n~~iy~nqsv~~~gv~LNs~a~~~t~fpl~~qs~v~i~ld~a~~t~i~~dk~vis~~~Gd~y~Ltl~~D~~r~  357 (1366)
T KOG1896|consen  278 GVLVFTVNNLIYLNQSVSPYGVALNSYASKYTAFPLIPQSGVRIELDCANATWISNDKCVISLKNGDLYLLTLILDIGRS  357 (1366)
T ss_pred             cEEEEeeeeEEEEccCCCceeEEecchhhcccCCccccccceEEEEeeccceeecCCeEEEecCCCcEEEEEEEeccccc
Confidence            99999999999999999 7999999999999999999999999999999999999999999999999999999999 799


Q ss_pred             EeEEEEEecCCCcccceEEEEcCCeEEEEeeeCCeeEEEEeeCCCcccccCCCccccCCcccCCccccccccCCcccccc
Q 001853          398 VQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQD  477 (1004)
Q Consensus       398 V~~l~l~~~g~~~~~S~l~~l~~g~lFvGS~~GDS~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  477 (1004)
                      |+.+++.++...++++|++..+|++||+||+.|||+|++|+++....  ..+...+..+.+.+....+++        ++
T Consensus       358 V~~~~f~k~~asvl~t~~v~~~n~llFlGSrlgnSlll~~s~~~~~~--~e~~~re~~d~~~~~~~~~~~--------d~  427 (1366)
T KOG1896|consen  358 VQLLHFDKFKASVLATSIVGHGNNLLFLGSRLGNSLLLRFSELLQRA--SEGVRREEGDTESDGYSKKRV--------DD  427 (1366)
T ss_pred             hhhhhhhhhhcccceeeeeccCCccEEEEecCCCEEEEEehhccccC--CccccccccCCcCCcchhhcc--------cc
Confidence            99999999999999999999999999999999999999999875421  111111111111111111111        11


Q ss_pred             cccccccc-------------cccCCCCCcccccceeeEEEeeeecccCCcccccccccccCC---------------CC
Q 001853          478 MVNGEELS-------------LYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD---------------AS  529 (1004)
Q Consensus       478 ~~d~~~~~-------------ly~~~~~~~~~~~~~~~l~v~Dsl~NigPI~D~~vg~~~~~~---------------~~  529 (1004)
                      +.|...++             -||++...   ....|.|++||+|+|||||.||++|+....+               +.
T Consensus       428 ~~d~~~~d~~~~~~~~~g~~~~~g~~a~~---t~~~f~fevcDsL~NIGPi~~~avG~~~~~~~~~~gl~~~~~~~elV~  504 (1366)
T KOG1896|consen  428 TQDVRRDDEKSAELFEAGSEENYGSGAQE---TVQPFSFEVCDSLPNIGPITDFAVGKRSSASEAVEGLSPHNKCLELVA  504 (1366)
T ss_pred             hhhhhhhhhhccchhhccccccCCcccce---eeeeeEEeehhccccccccccceeccccchhhhccCCCCCCCeEEEEE
Confidence            11111111             22222221   1233899999999999999999999876543               13


Q ss_pred             ceeccCCceEEE------------EeCCCccEeEEEeecCCCCCCCCcccccccCcccccEEEEEecCceEEEEecCcee
Q 001853          530 ATGISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLT  597 (1004)
Q Consensus       530 ~sG~g~~GsL~v------------~~lpg~~~iWtv~~~~~~~~~~~~~~~~~~~~~~~~yLilS~~~~T~Vl~~~~~l~  597 (1004)
                      |+|+|++|+|++            |+||||.++|||..+..+++         .++..|.||++|..++|+||++++++.
T Consensus       505 ~sGhgkngaL~V~r~sI~P~i~t~fel~Gc~~iWtV~~~~~~~~---------~~~~~h~~lilS~e~~t~il~tge~~~  575 (1366)
T KOG1896|consen  505 TSGHGKNGALSVIRRSIRPEIATEFELPGCVDIWTVFIKGRKRE---------EDNTQHLYLILSTESRTMILETGEELL  575 (1366)
T ss_pred             eccCCCCcceEEEeecccceeeEEEEecCeeeEEEEEEeccccc---------cccCcceEEEeecccchhhhhccchhh
Confidence            999999999999            78999999999998654432         223459999999999999999999999


Q ss_pred             EEecCCCccccCCeEEEEEeCCCcEEEEEecCcEEEEeCC-cceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEe
Q 001853          598 EVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS  676 (1004)
Q Consensus       598 ev~~~~~F~~~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~-~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~  676 (1004)
                      |++ .++|..+++||+||++|++++|||||++++|++|++ ++.|.+++.     .|     ..+++++++||||++...
T Consensus       576 Ev~-~s~f~~~~~Tl~~gnlg~~rriVQVtp~~~rllDg~~r~lq~i~fd-----~~-----~~vv~~sv~dpyv~v~~~  644 (1366)
T KOG1896|consen  576 EVS-GSGFTRDGPTLFAGNLGNERRIVQVTPSGLRLLDGDLRMLQRIPFD-----SG-----AIVVQTSVADPYVAVRSS  644 (1366)
T ss_pred             hcc-cceeEeccceEEEEecCCceEEEEEccceeEEecCcchheeEeccc-----cC-----CcEEEEeccCceEEEEEc
Confidence            999 999999999999999999999999999999999995 689999883     33     459999999999999999


Q ss_pred             CCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccc------cccccCc-------cccccCCCC
Q 001853          677 DGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTST------DAWLSTG-------VGEAIDGAD  743 (1004)
Q Consensus       677 ~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~------~~~~~~~-------~~~~~~~~~  743 (1004)
                      .|.|.+|.++.+..+|-+..+   .  +.++.++|++.|.+|  +|.+-..      ..++-.+       ....+++++
T Consensus       645 ~g~i~~~~l~~~s~rl~~~~~---~--s~~~~sv~~~~dlsg--~f~~~s~l~~k~~~~~gr~~~~~~~~~~~~kv~~~e  717 (1366)
T KOG1896|consen  645 EGRITLYDLEEKSHRLALHDP---M--SFKVVSVSLPADLSG--MFTTLSDLSLKGNEANGRSSEAEGLQSLPCKVDDEE  717 (1366)
T ss_pred             CCceEEEEeccccchhhccCc---c--cceeEEEechhhhcc--ceEEEeeecccCcccccccccccccccCCccccCCC
Confidence            999999999998877776665   2  667999999999999  7765441      1111111       012344443


Q ss_pred             CCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccc
Q 001853          744 GGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMK  823 (1004)
Q Consensus       744 ~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  823 (1004)
                      .....+..+||++++++|.|+||++|++++||.++.|+.++++|.|......+.+                  .....+.
T Consensus       718 gg~~~~~~~~~~~~~e~g~leiy~~pd~~lVf~v~~f~~~~~~L~~~~~~~~~~~------------------~~s~~~~  779 (1366)
T KOG1896|consen  718 GGSPEQEPYWCVFVTESGTLEIYALPDFDLVFEVDMFDTGNRVLMDSRLRGPTTN------------------KESEDLE  779 (1366)
T ss_pred             CCCcccCceEEEEEcCCCceEEEccCCcceEEEeeccCCCcceEEeecccCcccc------------------ccccchH
Confidence            2111222399999999999999999999999999999999999988654443221                  0012357


Q ss_pred             eEEEEEeecCCC--CCCcEEEEEeeCCcEEEEEEEeecCCCCCCCCCCCCcccccccccccccccccceeEEeccCCcCC
Q 001853          824 VVELAMQRWSAH--HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYT  901 (1004)
Q Consensus       824 i~eill~~lg~~--~~~p~L~v~~~~g~l~iY~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrf~Kv~~~~~~  901 (1004)
                      ++++.+..||.+  ..+|||++.+.+|++++|++|+....                        +.++++|+|+|+....
T Consensus       780 l~q~~~~~L~~e~~~~e~~L~lv~~~~eil~Ykaf~~~~~------------------------~~~~~~f~kvp~~~~~  835 (1366)
T KOG1896|consen  780 LKQLFVNPLGSEIVFKEPHLFLVVSDNEILIYKAFPQLSQ------------------------GNLKVFFKKVPHNLNI  835 (1366)
T ss_pred             HHHhhccccchhhhccCCceEEEEeCceEEEEeeccccCc------------------------cchhhhhhhCCHhhcc
Confidence            899999999988  77899999999999999999961110                        2388999999985332


Q ss_pred             C----------C------CCC-CCCCccceEEecccCCceEEEecCCCCeEE-EEcccccEEEeccCCCceEEEecCCCC
Q 001853          902 R----------E------ETP-HGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNV  963 (1004)
Q Consensus       902 ~----------~------~~~-~~~~~~~l~~f~~i~g~sgVFv~G~~P~~i-~~~~~~l~~~~~~~~~~v~~f~~F~~~  963 (1004)
                      +          +      +.+ .+...++|++|++|+||+||||||.+|+|| .+.||.+|+||+.++|+|.+|+||||+
T Consensus       836 ~~~~p~~~~~~~~~~~~e~~~~~~~~~~~m~~f~~i~ghsgvfv~Gs~P~~il~t~rg~lr~h~~~gngpv~sfapfhnv  915 (1366)
T KOG1896|consen  836 RTDKPHFLCKKREGGGAEEGASVSVIVQRMTYFEDIGGHSGVFVTGSKPYLILLTFRGVLRFHPVFGNGPVGSFAPFHNV  915 (1366)
T ss_pred             cccCCcccchhhccccccccccccceeeeEEeeccccCeeEEEEecCCceEEEEEcccccceeeeecCCcceeeeeeecc
Confidence            1          1      111 346788999999999999999999999999 689999999999999999999999999


Q ss_pred             CCCCcEEEEecCCcEEEEECCCCCccccCccceEEeee
Q 001853          964 NCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVFF 1001 (1004)
Q Consensus       964 ~~~~gfiy~~~~~~lri~~lp~~~~~d~~wp~rkvpl~ 1001 (1004)
                      |||+||||+|.+|.+|||++|..+.||+.||+||||||
T Consensus       916 n~p~gfiyvd~~~~l~i~~lp~~~~Ydn~wPvkkIpl~  953 (1366)
T KOG1896|consen  916 NCPRGFIYVDRQGELVICVLPEALSYDNKWPVKKIPLR  953 (1366)
T ss_pred             CCCcceEEECCCceEEEEEcchhcccCCCCcccccccc
Confidence            99999999999999999999999999999999999998


No 2  
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=100.00  E-value=2.2e-89  Score=797.26  Aligned_cols=693  Identities=19%  Similarity=0.308  Sum_probs=569.6

Q ss_pred             chhhhhhccCCCceeeEEEEEEecCCCCCCCCccccccccccccCCCCCCCCCCCeEEEEcCCEEEEEEEEecccccccc
Q 001853            2 SFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKES   81 (1004)
Q Consensus         2 ~~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~~~~~~   81 (1004)
                      +|+|..++|+||+|.+|+.|||+++..                           .||+|||+|+|+||.+.++|      
T Consensus         1 ~~~Y~vtaqkpT~V~~av~gnFts~e~---------------------------~nlivAk~~~lei~~~~~~G------   47 (1096)
T KOG1897|consen    1 SMNYVVTAQKPTAVVTAVVGNFTSPEN---------------------------LNLIVAKGNRLEILLVEPNG------   47 (1096)
T ss_pred             CeeEEEEecCCceEeEEEeecccCccc---------------------------eeeeeeccceEEEEeecccc------
Confidence            477888899999999999999999988                           99999999999999998876      


Q ss_pred             cCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEeee
Q 001853           82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC  161 (1004)
Q Consensus        82 ~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh~  161 (1004)
                                         |+.+++.++||+|..|+.+|++++    .+|+|+|+|+++++++|+||.+..+..|+.+..
T Consensus        48 -------------------Lq~i~sv~ifg~I~~i~~fRp~g~----~kD~LfV~t~~~~~~iL~~d~~~~~vv~~a~~~  104 (1096)
T KOG1897|consen   48 -------------------LQPITSVPIFGTIATIALFRPPGS----DKDYLFVATDSYRYFILEWDEESIQVVTRAHGD  104 (1096)
T ss_pred             -------------------ceeeEeeccceeEEEEEeecCCCC----CcceEEEEECcceEEEEEEccccceEEEEeccc
Confidence                               999999999999999999999999    999999999999999999999877788887655


Q ss_pred             ecCcchhcccCCcccccCCCeEEECCCCCEEEEEEecCeEEEEEcccCCCCCCCCCCCCCCCCCcccceeeeEEEEeccc
Q 001853          162 FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL  241 (1004)
Q Consensus       162 ~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~~~~L~ilP~~~~~~~l~~~d~~~~~~~~~~~~~~~s~~i~l~~l  241 (1004)
                      ..      -|.| |+...++++.|||.+|.+++++|++.+.+||+....+             .........|.+++.+ 
T Consensus       105 v~------dr~g-r~s~~g~~~~VDp~~R~Igl~~yqgl~~vIp~d~~~s-------------ht~~s~l~~fn~rfde-  163 (1096)
T KOG1897|consen  105 VS------DRSG-RPSDNGQILLVDPKGRVIGLHLYQGLFKVIPIDSDES-------------HTGGSLLKAFNVRFDE-  163 (1096)
T ss_pred             cc------cccc-ccCCCceEEEECCCCcEEEEEeecCeEEEEEeccccc-------------ccCcccccccccccCc-
Confidence            42      3567 6678899999999999999999999999999975421             0011234578888764 


Q ss_pred             CCCceeeEEEecCCCCceEEEEEEccCCcccceeeeeeeEEEEEEeeccccccc-cceeeeccCCCCCcEEEEecCCCCe
Q 001853          242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQH-PLIWSAMNLPHDAYKLLAVPSPIGG  320 (1004)
Q Consensus       242 di~nViD~~FL~gy~ePtlaiLye~~~tw~gr~~~r~dt~~~~~~sLn~~~k~~-~~i~s~~~LP~d~~~lipvP~plGG  320 (1004)
                        .||.||+|||+...||+|+||++..   |    |+.++|    .||+..|.+ ...|+ .++..++..+||||.|.||
T Consensus       164 --l~v~Di~fly~~s~pt~~vly~Ds~---~----~Hv~~y----elnl~~ke~~~~~w~-~~v~~~a~~li~VP~~~gG  229 (1096)
T KOG1897|consen  164 --LNVYDIKFLYGCSDPTLAVLYKDSD---G----RHVKTY----ELNLRDKEFVKGPWS-NNVDNGASMLIPVPSPIGG  229 (1096)
T ss_pred             --ceEEEEEEEcCCCCCceEEEEEcCC---C----cEEEEE----Eeccchhhccccccc-cccccCCceeeecCCCCce
Confidence              9999999999999999999999974   4    344444    567765543 46799 8999999999999999999


Q ss_pred             EEEEecCeEEEEeCCCceeEeecccccccCCCcCCCCCccEEEecceeEEEee--CCEEEEEeCCCCEEEEEEEECCceE
Q 001853          321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQ--NDVALLSTKTGDLVLLTVVYDGRVV  398 (1004)
Q Consensus       321 vLVig~n~Iiy~dq~~~~~v~vN~~~~~~t~~~~~~~~~~~i~l~~~~~~~l~--~~~~Ll~~~~G~L~~L~l~~dgr~V  398 (1004)
                      |||+|+++|+|+++....++  ++.+.+        +.  .+.    ++..++  ..+|||+|++|+||+|.+...+.+|
T Consensus       230 vlV~ge~~I~Y~~~~~~~ai--~p~~~~--------~~--t~~----~~~~v~~~~~~yLl~d~~G~Lf~l~l~~~~e~~  293 (1096)
T KOG1897|consen  230 VLVIGEEFIVYMSGDNFVAI--APLTAE--------QS--TIV----CYGRVDLQGSRYLLGDEDGMLFKLLLSHTGETV  293 (1096)
T ss_pred             EEEEeeeEEEEeeCCceeEe--cccccC--------Cc--eEE----EcccccCCccEEEEecCCCcEEEEEeecccccc
Confidence            99999999999998654433  333211        11  121    344443  4589999999999999999889888


Q ss_pred             eE--EEEEecCCCcccceEEEEcCCeEEEEeeeCCeeEEEEeeCCCcccccCCCccccCCcccCCccccccccCCccccc
Q 001853          399 QR--LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQ  476 (1004)
Q Consensus       399 ~~--l~l~~~g~~~~~S~l~~l~~g~lFvGS~~GDS~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  476 (1004)
                      ++  |+++++|++++|+||+||++|+||+||++|||+|+++...+                                   
T Consensus       294 s~~~lkve~lge~siassi~~L~ng~lFvGS~~gdSqLi~L~~e~-----------------------------------  338 (1096)
T KOG1897|consen  294 SGLDLKVEYLGETSIASSINYLDNGVLFVGSRFGDSQLIKLNTEP-----------------------------------  338 (1096)
T ss_pred             cceEEEEEecCCcchhhhhhcccCceEEEeccCCceeeEEccccC-----------------------------------
Confidence            88  99999999999999999999999999999999999986420                                   


Q ss_pred             ccccccccccccCCCCCcccccceeeEEEeeeecccCCcccccccccccCC----CCceeccCCceEEE-----------
Q 001853          477 DMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINAD----ASATGISKQSNYEL-----------  541 (1004)
Q Consensus       477 ~~~d~~~~~ly~~~~~~~~~~~~~~~l~v~Dsl~NigPI~D~~vg~~~~~~----~~~sG~g~~GsL~v-----------  541 (1004)
                         |                . ++| ..+++++.|||||.||+|-+.....    .+|||++|+|+||+           
T Consensus       339 ---d----------------~-gsy-~~ilet~~NLgPI~Dm~Vvd~d~q~q~qivtCsGa~kdgSLRiiRngi~I~e~A  397 (1096)
T KOG1897|consen  339 ---D----------------V-GSY-VVILETFVNLGPIVDMCVVDLDRQGQGQIVTCSGAFKDGSLRIIRNGIGIDELA  397 (1096)
T ss_pred             ---C----------------C-Cch-hhhhhhcccccceeeEEEEeccccCCceEEEEeCCCCCCcEEEEecccccceee
Confidence               1                1 334 6889999999999999997643111    25999999999999           


Q ss_pred             -EeCCCccEeEEEeecCCCCCCCCcccccccCcccccEEEEEecCceEEEEecCceeEEecCCCccccCCeEEEEEeCCC
Q 001853          542 -VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGR  620 (1004)
Q Consensus       542 -~~lpg~~~iWtv~~~~~~~~~~~~~~~~~~~~~~~~yLilS~~~~T~Vl~~~~~l~ev~~~~~F~~~~~TI~ag~l~~~  620 (1004)
                       ++|||+++||+++..              -++++|.|||+||.++|++|.++++++|+. ..||.++++||+|++++++
T Consensus       398 ~i~l~Gikg~w~lk~~--------------v~~~~d~ylvlsf~~eTrvl~i~~e~ee~~-~~gf~~~~~Tif~S~i~g~  462 (1096)
T KOG1897|consen  398 SIDLPGIKGMWSLKSM--------------VDENYDNYLVLSFISETRVLNISEEVEETE-DPGFSTDEQTIFCSTINGN  462 (1096)
T ss_pred             EeecCCccceeEeecc--------------ccccCCcEEEEEeccceEEEEEccceEEec-cccccccCceEEEEccCCc
Confidence             589999999999853              467889999999999999999998899998 9999999999999999888


Q ss_pred             cEEEEEecCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeeccccc
Q 001853          621 RRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAI  700 (1004)
Q Consensus       621 ~~IvQVt~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~  700 (1004)
                       .++|||+++||++++.++..+|..+          ++..|..|+++..+|+|+..++.+.+++++..+. -++....+ 
T Consensus       463 -~lvQvTs~~iRl~ss~~~~~~W~~p----------~~~ti~~~~~n~sqVvvA~~~~~l~y~~i~~~~l-~e~~~~~~-  529 (1096)
T KOG1897|consen  463 -QLVQVTSNSIRLVSSAGLRSEWRPP----------GKITIGVVSANASQVVVAGGGLALFYLEIEDGGL-REVSHKEF-  529 (1096)
T ss_pred             -eEEEEecccEEEEcchhhhhcccCC----------CceEEEEEeecceEEEEecCccEEEEEEeeccce-eeeeehee-
Confidence             7999999999999999778889764          3477999999999999999989999999987762 22222222 


Q ss_pred             ccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCc
Q 001853          701 ESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKF  780 (1004)
Q Consensus       701 ~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~  780 (1004)
                         +  ...+||--.+-|                      +    ....+....+.+|++-.+.|..+||+.+++.. .+
T Consensus       530 ---e--~evaCLDisp~~----------------------d----~~~~s~~~aVG~Ws~~~~~l~~~pd~~~~~~~-~l  577 (1096)
T KOG1897|consen  530 ---E--YEVACLDISPLG----------------------D----APNKSRLLAVGLWSDISMILTFLPDLILITHE-QL  577 (1096)
T ss_pred             ---c--ceeEEEecccCC----------------------C----CCCcceEEEEEeecceEEEEEECCCcceeeee-cc
Confidence               2  334477322222                      1    11345689999999999999999999887762 11


Q ss_pred             CccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEEEeecC
Q 001853          781 VSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEG  860 (1004)
Q Consensus       781 ~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~f~~~~  860 (1004)
                      +                                      ....++.|++..++.+  +-||+|.+.||.++-|..+...+
T Consensus       578 ~--------------------------------------~~~iPRSIl~~~~e~d--~~yLlvalgdG~l~~fv~d~~tg  617 (1096)
T KOG1897|consen  578 S--------------------------------------GEIIPRSILLTTFEGD--IHYLLVALGDGALLYFVLDINTG  617 (1096)
T ss_pred             C--------------------------------------CCccchheeeEEeecc--ceEEEEEcCCceEEEEEEEcccc
Confidence            1                                      2235788999999754  68999999999999887775333


Q ss_pred             CCCCCCCCCCCcccccccccccccccccceeEEeccCCcCCCCCCCCCCCccceEEecccCCceEEEecCCCCeEEEEcc
Q 001853          861 PENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVFR  940 (1004)
Q Consensus       861 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrf~Kv~~~~~~~~~~~~~~~~~~l~~f~~i~g~sgVFv~G~~P~~i~~~~  940 (1004)
                      ..++                     +      ||+.          .|.++..||.|. ..+.+.||+++++|..||+++
T Consensus       618 ~lsd---------------------~------Kk~~----------lGt~P~~Lr~f~-sk~~t~vfa~sdrP~viY~~n  659 (1096)
T KOG1897|consen  618 QLSD---------------------R------KKVT----------LGTQPISLRTFS-SKSRTAVFALSDRPTVIYSSN  659 (1096)
T ss_pred             eEcc---------------------c------cccc----------cCCCCcEEEEEe-eCCceEEEEeCCCCEEEEecC
Confidence            2111                     2      4554          678899999995 567899999999999999999


Q ss_pred             cccEEEeccCCCceEEEecCCCCCCCCcEEEEecCCcEEEEECCCCCccccCccceEEeee
Q 001853          941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVVFF 1001 (1004)
Q Consensus       941 ~~l~~~~~~~~~~v~~f~~F~~~~~~~gfiy~~~~~~lri~~lp~~~~~d~~wp~rkvpl~ 1001 (1004)
                      +.+.+.|++.+ .+..+|||++..||++.++++..+ |+|.++++...    ..+|+||++
T Consensus       660 ~kLv~spls~k-ev~~~c~f~s~a~~d~l~~~~~~~-l~i~tid~iqk----l~irtvpl~  714 (1096)
T KOG1897|consen  660 GKLVYSPLSLK-EVNHMCPFNSDAYPDSLASANGGA-LTIGTIDEIQK----LHIRTVPLG  714 (1096)
T ss_pred             CcEEEeccchH-HhhhhcccccccCCceEEEecCCc-eEEEEecchhh----cceeeecCC
Confidence            99999999998 899999999999999999999885 99999998765    456777765


No 3  
>COG5161 SFT1 Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification]
Probab=100.00  E-value=2.4e-79  Score=697.29  Aligned_cols=826  Identities=18%  Similarity=0.223  Sum_probs=624.0

Q ss_pred             CchhhhhhccCCCceeeEEEEEEecCCCCCCCCccccccccccccCCCCCCCCCCCeEEEEcCCEEEEEEEEeccccccc
Q 001853            1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE   80 (1004)
Q Consensus         1 m~~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~~~~~   80 (1004)
                      |+ .+|.++..+|.++||+.|+||+.+.                           ++|+|.|+|.|+||+...++     
T Consensus         1 m~-~~y~d~~d~tv~~~~~ag~Ft~s~~---------------------------~~llv~~~Nil~v~~~~~d~-----   47 (1319)
T COG5161           1 MN-YLYSDESDWTVTEGCSAGLFTPSRT---------------------------CSLLVYNGNILAVRLWKYDS-----   47 (1319)
T ss_pred             Cc-chhhhhhHHHHhhccccceeecccc---------------------------ceEEEEeccEEEEEEeeccC-----
Confidence            44 5788899999999999999999887                           99999999999999998876     


Q ss_pred             ccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEee
Q 001853           81 SKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH  160 (1004)
Q Consensus        81 ~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh  160 (1004)
                                         +|.++.++.++|.|++|....-..+    .+|.|++.|..||+++++||.+.+.|.|+|+|
T Consensus        48 -------------------~l~l~de~~~~e~~t~I~~~pq~~s----e~~~lll~t~~akis~lrf~sq~n~f~Tislh  104 (1319)
T COG5161          48 -------------------GLVLVDEHMLLEKVTQIEKYPQISS----EQDGLLLLTHRAKISLLRFDSQANEFRTISLH  104 (1319)
T ss_pred             -------------------CeeEchHHhhhhhhhhhhhcccccC----ccceEEEEeccceEEEEEehhhcccceeEEEe
Confidence                               7999999999999999999988888    89999999999999999999999999999999


Q ss_pred             eecCcchhcccCCc-ccccCCCeEEECCCCCEEEEEEecCeEEEEEcccCCC--CCCCCCCCCCC--------------C
Q 001853          161 CFESPEWLHLKRGR-ESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGS--GLVGDEDTFGS--------------G  223 (1004)
Q Consensus       161 ~~E~~~~~~~k~g~-~~~~~~~~l~VDP~~Rca~l~~~~~~L~ilP~~~~~~--~l~~~d~~~~~--------------~  223 (1004)
                      |||...    |.-. ........++-||++-|+ |+++++..+++||+-+..  ++++.|.++..              |
T Consensus       105 yyeGKf----kgksLvelak~stle~D~~ssca-LlfneDi~~flpfhvnkndddev~~d~D~~~~~~~~~h~~i~psqg  179 (1319)
T COG5161         105 YYEGKF----KGKSLVELAKFSTLEFDIRSSCA-LLFNEDIGNFLPFHVNKNDDDEVRIDVDLGMFQMSKRHFSIFPSQG  179 (1319)
T ss_pred             eecccc----CCchhhhhhhhhheeeccCccch-hhhhhhhhhcccccccCCccccccccccccHHHHHHHHhhcCCCCC
Confidence            998862    2211 223445679999999887 688899999999974332  22222211100              0


Q ss_pred             C----------CcccceeeeEEEEecccC--CCceeeEEEecCCCCceEEEEEEccCCcccceeeeeeeEEEEEEeeccc
Q 001853          224 G----------GFSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTT  291 (1004)
Q Consensus       224 ~----------~~~~~~~~s~~i~l~~ld--i~nViD~~FL~gy~ePtlaiLye~~~tw~gr~~~r~dt~~~~~~sLn~~  291 (1004)
                      .          -...--.||+++..++||  |.||+|++||++|++||+|+||+|.++|++....+|+++.+.+++||+.
T Consensus       180 tntfnkrkrt~~~~kfsaPs~Vl~~seld~~ikniiD~~FL~ny~~PTvallY~Pkl~~~~~~ti~k~p~~~~v~Tldl~  259 (1319)
T COG5161         180 TNTFNKRKRTLFPGKFSAPSKVLKFSELDGKIKNIIDFVFLENYSIPTVALLYDPKLSLPRKYTILKNPYNAIVFTLDLG  259 (1319)
T ss_pred             ccccchhhhhhcCCcccCceeEEEehhhhccccccEEEEeeccCCCceEEEEecccccccceeEeecCceeEEEEEEecC
Confidence            0          001112579999999999  9999999999999999999999999999999999999999999999999


Q ss_pred             cccccceeeeccCCCCCcEEEEecCCCCeEEEEecCeEEEEeCCC-ceeEeecccccccCCCc-CCCCC--ccEEEecce
Q 001853          292 LKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ-ELPRS--SFSVELDAA  367 (1004)
Q Consensus       292 ~k~~~~i~s~~~LP~d~~~lipvP~plGGvLVig~n~Iiy~dq~~-~~~v~vN~~~~~~t~~~-~~~~~--~~~i~l~~~  367 (1004)
                      ++++.+|-.+..||+|.+..+|+|.   |+|++|.|+++|+|..| .+++.+|.++...+.++ +.+++  ++++.+.|.
T Consensus       260 ~~~saVI~~~~~lP~d~~~~v~~p~---Gall~g~neli~idstg~~~~I~lNs~~~k~~~~~~v~d~s~~d~n~~~~gt  336 (1319)
T COG5161         260 AGRSAVIDEFLVLPRDFRVTVAGPV---GALLFGSNELILIDSTGSSYTIPLNSMSEKYGGNKIVEDISLSDVNCFSRGT  336 (1319)
T ss_pred             cchhhhhHhHhcCCceEEEEEeccc---ceEEEecccEEEEecCCcEEEeechhhHHHhcCCceEeecccceeeEeecCc
Confidence            9999999999999999999999985   99999999999999999 78999999999988887 55666  567777887


Q ss_pred             eEEEeeC-----CEEEEEeCCCCEEEEEEEECCceEeEEEEEec---C----CCcccceEEEEcCCeEEEEeeeCCeeEE
Q 001853          368 HATWLQN-----DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT---N----PSVLTSDITTIGNSLFFLGSRLGDSLLV  435 (1004)
Q Consensus       368 ~~~~l~~-----~~~Ll~~~~G~L~~L~l~~dgr~V~~l~l~~~---g----~~~~~S~l~~l~~g~lFvGS~~GDS~Ll  435 (1004)
                      ...|+-.     +.+++++-+|+.|.|.+.+||++|.++.+..+   +    ..+-++|+..+++.++|+|+..+||.++
T Consensus       337 tsIwipsSK~~~etl~l~dl~g~~yyl~~~~dgk~iigfdi~~L~~e~dllk~~s~~~Cv~~~n~~l~f~g~g~~ns~vl  416 (1319)
T COG5161         337 TSIWIPSSKCLIETLFLGDLNGDRYYLRISMDGKRIIGFDIASLEFEGDLLKKGSAVSCVGHVNNLLFFGGVGDSNSRVL  416 (1319)
T ss_pred             eeeeccCcccccceEEEEecCCCEEEEEEEeccceeeccceeeeeeeccccccCCCCeeEEEcCceEEEEEecCCceEEE
Confidence            7777744     46899999999999999999999999777654   2    5688999999999999999999999999


Q ss_pred             EEeeCCCcccccCCCccccCCcccCCccccccccCCcccccccccccccccccCCCCCcccccceeeEEEeeeecccCCc
Q 001853          436 QFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPL  515 (1004)
Q Consensus       436 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ly~~~~~~~~~~~~~~~l~v~Dsl~NigPI  515 (1004)
                      +|++..+...  ...+|....+.+.          .+++|||.-+..|-++++...+....+.++|.+++|+.+.|+|||
T Consensus       417 r~~~l~~tiE--tR~~eG~~~l~g~----------nDeEmdD~y~apEn~l~~n~~~~v~~~~~p~d~el~~~l~n~gpi  484 (1319)
T COG5161         417 RIKSLLPTIE--TRASEGVGPLEGG----------NDEEMDDEYSAPENKLFGNKEQEVRRQDEPYDAELFNALSNAGPI  484 (1319)
T ss_pred             EecccCCchh--hhhhcCCCcccCC----------ChhhhhhhhcccccccccCcccceeeccCcchhHHhhhhccCCcc
Confidence            9998654321  0111111111100          001111110001112222222222236678899999999999999


Q ss_pred             ccccccccccCC------------CCceeccCCceEEE------------EeCCCccEeEEEeecCCCCCCCCccccccc
Q 001853          516 KDFSYGLRINAD------------ASATGISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAY  571 (1004)
Q Consensus       516 ~D~~vg~~~~~~------------~~~sG~g~~GsL~v------------~~lpg~~~iWtv~~~~~~~~~~~~~~~~~~  571 (1004)
                      .||+||+.....            +.++|++..|+|.|            +.+-++..+|+++.+...           .
T Consensus       485 tdfavgkv~v~kglP~pN~g~l~lV~t~G~ds~~~l~V~~ts~~P~I~~~~~fi~~e~vw~~kI~g~l-----------r  553 (1319)
T COG5161         485 TDFAVGKVDVEKGLPIPNIGLLNLVVTKGSDSEAALAVEGTSLEPCICTVSSFIPLEIVWSQKIRGYL-----------R  553 (1319)
T ss_pred             cceeeeeccceecCCCCCccceeeEEeccCCCcceEEEEeccccceeeehccccchhheeehhcccee-----------h
Confidence            999999865211            13889999999999            345578999999986421           1


Q ss_pred             CcccccEEEEEecCceEEEEecCceeEEecCCCccccCCeEEEEEeCCCcEEEEEecCcEEEEeCC-cceeEEeCCCCCC
Q 001853          572 DDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNS  650 (1004)
Q Consensus       572 ~~~~~~yLilS~~~~T~Vl~~~~~l~ev~~~~~F~~~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~-~~~q~~~~~~~~~  650 (1004)
                      ....-.|+++|..+.|.||+.++++.+.. ..+|..+..|+.++.++.++++|||||+.+++||.+ ++.+.+.+.    
T Consensus       554 ~~~~~~~~~ls~~s~S~If~~~e~f~l~~-~g~~~rd~~Tl~~~~fgee~rvVQvtp~~l~~yD~~lR~l~~~~F~----  628 (1319)
T COG5161         554 CSRALDFYILSRVSDSRIFRWSEEFLLEV-SGEYTRDVNTLLFVEFGEENRVVQVTPSYLLRYDQDLRMLGRVEFA----  628 (1319)
T ss_pred             hcceeeEEEeecccccceeeccccceeee-cceeeccccEEEeeeccCcceEEEecchHhhhhcccceeeeeEeec----
Confidence            23345799999999999999999999988 899999999999999999999999999999999988 467777773    


Q ss_pred             CCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCce-EeeecccccccCCCceeEEEEeecCCCCcceecccccc
Q 001853          651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA  729 (1004)
Q Consensus       651 e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~-l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~  729 (1004)
                             ...|++.|++|||+++..+.|.|.+|.+++.+++ +.+..+..+.  +-++.++-+ .|.+---.|...    
T Consensus       629 -------~~~V~~~Sv~Dp~ilvv~~~g~i~~f~~~ekn~rL~k~dl~~~l~--d~k~~s~v~-~dsN~~g~f~ig----  694 (1319)
T COG5161         629 -------SRAVEARSVRDPLILVVRDSGKILTFYDREKNMRLFKIDLVTCLA--DAKNKSFVL-SDSNSLGIFDIG----  694 (1319)
T ss_pred             -------eeeeEEEeccCCEEEEEEecCceEEEEehhhhchhccCChHHHHH--hhhhheEec-cCcccccceecc----
Confidence                   1249999999999999999999999999998776 4455555554  444443222 221100022100    


Q ss_pred             cccCccccccCCCCCCCCCCCcEEEE-EEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCc
Q 001853          730 WLSTGVGEAIDGADGGPLDQGDIYSV-VCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE  808 (1004)
Q Consensus       730 ~~~~~~~~~~~~~~~~~~~~~~~~l~-~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~  808 (1004)
                                     ....+...+++ .+..+-.+.-..-|.+..+++.++++.+    .+.+....+.         ..
T Consensus       695 ---------------~~~Sq~e~~l~~~~~~~~q~~~~~s~~~D~~~e~dg~dQl----te~~~~~tyn---------l~  746 (1319)
T COG5161         695 ---------------KRISQLEPCLVKGLPYAIQFSPEASPAMDLAGEEDGDDQL----TEISMSLTYN---------LI  746 (1319)
T ss_pred             ---------------cchhhhchhhhhcCcccceeccccCcchhhccccccchhh----hhHHHHHHHh---------hh
Confidence                           00011112222 2223334344445667777777666532    2211111100         00


Q ss_pred             cccCCCCcccccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEEEeecCCCCCCCCCCCCccccccccccccccccc
Q 001853          809 EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLR  888 (1004)
Q Consensus       809 ~~~~~~~~~~~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  888 (1004)
                      .       .--..+.|.+++++.||++-..|||+.+...++++.|+.|.+..                            
T Consensus       747 d-------~~f~lpsi~~~mVa~lg~D~keeyLf~~s~~~EI~~yk~~l~r~----------------------------  791 (1319)
T COG5161         747 D-------MLFRLPSIGNYMVAYLGLDLKEEYLFDNSLSSEIVFYKTHLPRH----------------------------  791 (1319)
T ss_pred             h-------hhccChhhhhhhhHhhcccccchheehhhcCceEEEEeeccccc----------------------------
Confidence            0       01134578999999999999899999999999999999995332                            


Q ss_pred             ceeEEec------cCCcCCCCCCCC--CCCccceEEecccCCceEEEecCCCCeEE-EEcccccEEEeccCCCceEEEec
Q 001853          889 NLRFSRT------PLDAYTREETPH--GAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTV  959 (1004)
Q Consensus       889 ~lrf~Kv------~~~~~~~~~~~~--~~~~~~l~~f~~i~g~sgVFv~G~~P~~i-~~~~~~l~~~~~~~~~~v~~f~~  959 (1004)
                       .+|-|=      .+...|+..+.+  +...+-...|+...||+.||+||..|++| ...++...+.+. ++-|+.+.+|
T Consensus       792 -~~f~~nvTRndlAitGaPdna~~Ka~sSV~ri~m~f~~~vghs~~fvTg~~pfl~~s~~~s~~k~f~~-gNIPlvsv~p  869 (1319)
T COG5161         792 -VSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPFLCRSRYTSSSKAFHR-GNIPLVSVIP  869 (1319)
T ss_pred             -chhhhhcchhhhhccCCCcchhhhhcccccceeEEEeeccCeEEEEEcCCccEEEEEeccCCcceeec-CCCceeeeee
Confidence             222221      111122221111  24455678899999999999999999999 788888888887 4779999999


Q ss_pred             CCCCCCCCcEEEEecCCcEEEEECCCCCccc-cCccceEEeee
Q 001853          960 LHNVNCNHGFIYVTSQGILKICQLPSGSTYD-NYWPVQKVVFF 1001 (1004)
Q Consensus       960 F~~~~~~~gfiy~~~~~~lri~~lp~~~~~d-~~wp~rkvpl~ 1001 (1004)
                      ||-    +|++|+++...+|+|++-.+..|+ |-||++|+|+.
T Consensus       870 ~s~----rgy~~Vd~~~~vr~~~~~~dn~y~gnK~p~k~~~~~  908 (1319)
T COG5161         870 LSK----RGYLMVDNVLGVRASQYVFDNGYVGNKNPVKRTPKH  908 (1319)
T ss_pred             ccc----ccEEEEecccceeEEEEEeccceecccCceeecccc
Confidence            996    999999998779999999999998 99999999986


No 4  
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=100.00  E-value=1.4e-68  Score=625.14  Aligned_cols=737  Identities=20%  Similarity=0.315  Sum_probs=571.9

Q ss_pred             hhhhhhccCCCceeeEEEEEEecCCCCCCCCccccccccccccCCCCCCCCCCCeEEEEcCCEEEEEEEEec-ccccccc
Q 001853            3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQ-EEGSKES   81 (1004)
Q Consensus         3 ~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~-~~~~~~~   81 (1004)
                      |.|..+++.||+|.||++|+|.+++.                           +++|+++++.|++|++.++ |      
T Consensus         2 ~lysltlq~~t~i~~~~~g~fs~~k~---------------------------qeIv~~~~s~l~L~~~d~~~G------   48 (1205)
T KOG1898|consen    2 FLYSLTLQNQTGIVQAIYGNFSGPKA---------------------------QEIVLGRGSILELYRIDENDG------   48 (1205)
T ss_pred             chhhhhhhcccceeeeehhhccCCch---------------------------heEEEEeeeEEEEEEecCCCc------
Confidence            56788899999999999999999987                           8999999999999999865 3      


Q ss_pred             cCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEeee
Q 001853           82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC  161 (1004)
Q Consensus        82 ~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh~  161 (1004)
                                        ||+.++++.+||+|++|+.+|..+.    .+|+|+|++|+++++|++|+.+++.++++..|.
T Consensus        49 ------------------~l~~i~~~~vFg~Irsla~~~lt~~----~kD~LaV~SDSGri~il~y~~ek~~~~~~~qet  106 (1205)
T KOG1898|consen   49 ------------------RLKTICRQEVFGTIRSLAAFRLTGG----TKDYLAVGSDSGRISILEYNNEKNHFEKLHQET  106 (1205)
T ss_pred             ------------------eEEEEEEEeehhhhhhhhccccCCC----CccEEEEEcCCceEEEEEechhhhccccccccc
Confidence                              8999999999999999999999998    999999999999999999999999999886666


Q ss_pred             ecCcchhcccCCcccccCCCeEEECCCCCEEEEEEe-cCeEEEEEcccCCCCCCCCCCCCCCCCCcccceeeeEEEEecc
Q 001853          162 FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRD  240 (1004)
Q Consensus       162 ~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~-~~~L~ilP~~~~~~~l~~~d~~~~~~~~~~~~~~~s~~i~l~~  240 (1004)
                      |       +|+|+|+..++.|+.+||.|||+++++. +++|+++-.+...       ..+..++++++....+.++.+..
T Consensus       107 f-------Gks~~rrivpG~y~~idp~Gra~misave~~kLvyvlnrD~~-------a~ltisSpleahk~~sic~~l~~  172 (1205)
T KOG1898|consen  107 F-------GKSGCRRIVPGQYLAIDPKGRAVMISAVEKQKLVYVLNRDGA-------ARLTISSPLEAHKAHSICLDLVG  172 (1205)
T ss_pred             c-------CcccceEeccccEEEEcCCccceeeehhhcCcEEEEEccchh-------hhceecCchhhccCCcEEEEEEE
Confidence            6       8999999999999999999999999987 9999998776543       24445678888888999999999


Q ss_pred             cCCCceeeEEEecCCCCceEEEEEEcc----CCcccce---eeeeeeEEEEEEeeccccccccceeeeccCCCCCcEEEE
Q 001853          241 LDMKHVKDFIFVHGYIEPVMVILHERE----LTWAGRV---SWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLA  313 (1004)
Q Consensus       241 ldi~nViD~~FL~gy~ePtlaiLye~~----~tw~gr~---~~r~dt~~~~~~sLn~~~k~~~~i~s~~~LP~d~~~lip  313 (1004)
                      +|.          ||.||+||.|+-+.    ...+|..   ..+..++|..+++||++.|+    |+ .-+....+.+++
T Consensus       173 Vd~----------gf~np~fa~LE~dy~~a~~d~tgeaa~~~~~~l~fYeldlglnhvvrk----~s-~p~~~~~n~l~~  237 (1205)
T KOG1898|consen  173 VDV----------GFENPIFAALERDYSEADNDPTGEAATMTQKVLTFYELDLGLNHVVRK----AS-EPVNHFGNFLLT  237 (1205)
T ss_pred             Eec----------cCCCceEEEEeechhhcccCchhhhhhccccceeEEEEecccceeEEE----cc-cccCCCceEEEE
Confidence            888          99999999999762    1122322   25778999999999999998    77 346677999999


Q ss_pred             ecCC---CCeEEEEecCeEEEEeCCC--ceeEeecccccccCCCcCCCCCccEEEecceeEEEeeCCEEEEEeCCCCEEE
Q 001853          314 VPSP---IGGVLVVGANTIHYHSQSA--SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVL  388 (1004)
Q Consensus       314 vP~p---lGGvLVig~n~Iiy~dq~~--~~~v~vN~~~~~~t~~~~~~~~~~~i~l~~~~~~~l~~~~~Ll~~~~G~L~~  388 (1004)
                      ||..   ..||+|++.|++.|.+..-  .+.++.   +++.+..+. ..+.+-+. .+......+..++|+++++||+|+
T Consensus       238 VP~G~D~ps~v~vc~~n~~~y~~~~d~p~~ri~~---~rr~~~L~~-~~~~vliv-~s~~hk~k~~ff~llqt~~GD~fk  312 (1205)
T KOG1898|consen  238 VPGGSDGPSGVLVCAENYLLYRNLGDHPDVRIPI---ERRINELSD-AEDGVLIV-SSAEHKTKSMFFFLLQTEYGDLFK  312 (1205)
T ss_pred             ecCCCCCCcceEEecCceeeccccccCCCEEecc---ccccccCCc-cccccEEE-EeecccccCCeEEEEEecCCceEE
Confidence            9975   3499999999999999873  345533   555432221 12233221 222222334459999999999999


Q ss_pred             EEEEECCceEeEEEEEecCCCcccceEEEEcCCeEEEEeeeCCeeEEEEeeCCCcccccCCCccccCCcccCCccccccc
Q 001853          389 LTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLR  468 (1004)
Q Consensus       389 L~l~~dgr~V~~l~l~~~g~~~~~S~l~~l~~g~lFvGS~~GDS~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  468 (1004)
                      ++|..|+..|..+++.|+++.+.+..|+++++|+||+.|++||..|||+...        |++++  +..+         
T Consensus       313 ~tl~~d~d~v~el~lkYfDtvp~a~~L~I~k~GfLf~~sE~~n~~lyq~~~L--------G~~~~--~~s~---------  373 (1205)
T KOG1898|consen  313 LTLEHDGDNVVELRLKYFDTVPCALQLCILKTGFLFVASEFGNHRLYQFEKL--------GEEDD--DFSN---------  373 (1205)
T ss_pred             EEEecCCCcceeeeeehhcCCccceEEEEeccceEEEhhhccCcceeehhhc--------CCCcc--chhh---------
Confidence            9999999999999999999999999999999999999999999999999875        32211  1110         


Q ss_pred             cCCcccccccccccccccccCCCCCcccccceeeEEEeeeecccCCcccccccccccCCC----CceeccCCceEEE---
Q 001853          469 RSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYEL---  541 (1004)
Q Consensus       469 ~~~~~~~~~~~d~~~~~ly~~~~~~~~~~~~~~~l~v~Dsl~NigPI~D~~vg~~~~~~~----~~sG~g~~GsL~v---  541 (1004)
                           +|+- ++.  ..++ ++|+..    +  +|..++++.|+.|+.|+.+|+..+.+.    .|||+|.+++|++   
T Consensus       374 -----~~~~-~~~--~~~~-f~p~~l----~--nL~~~~~i~sl~p~~d~~I~~~~ne~~~qi~~~cg~~~~sslr~lR~  438 (1205)
T KOG1898|consen  374 -----AMTS-EEG--KSVF-FEPRIL----K--NLSPVSSVESLSPLLDISIGDDSNEDTPQIYSACGRGPRSSLRILRN  438 (1205)
T ss_pred             -----hccc-ccC--ccee-cccccc----c--cccchhhhhccCccceeEeeccCcccchhhhhhhCcCccccchhhcc
Confidence                 1110 011  1222 334432    2  688899999999999999998665442    3999999999998   


Q ss_pred             ---------EeCCC-ccEeEEEeecCCCCCCCCcccccccCcccccEEEEEecCceEEEEecCceeEEecCCCccccCCe
Q 001853          542 ---------VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT  611 (1004)
Q Consensus       542 ---------~~lpg-~~~iWtv~~~~~~~~~~~~~~~~~~~~~~~~yLilS~~~~T~Vl~~~~~l~ev~~~~~F~~~~~T  611 (1004)
                               .+||+ ++++||++.+              ..+.||.||++||.+.|+||++|+.+||++ ++||..+.+|
T Consensus       439 gle~sel~~t~lp~~~ta~WTvk~~--------------~td~ydsyivvsF~n~TlVLsIgesveEvt-dsgFls~~~T  503 (1205)
T KOG1898|consen  439 GLEVSELLVTELPGNPTATWTVKKN--------------ITDVYDSYIVVSFVNGTLVLSIGESVEEVT-DSGFLSTTPT  503 (1205)
T ss_pred             ccchHHHhhhccCCCCceEEEEcCc--------------cccccceEEEEEeeccEEEEEcchhHHHhh-hcccccCCce
Confidence                     25787 9999999863              467899999999999999999999999999 9999999999


Q ss_pred             EEEEEeCCCcEEEEEecCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCce
Q 001853          612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT  691 (1004)
Q Consensus       612 I~ag~l~~~~~IvQVt~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~  691 (1004)
                      |+|+.||++ .+|||++.+||++-..+++.+|.+|          ++.+|+.+.++..+|++++++|+++||++|.++++
T Consensus       504 l~~~l~Gd~-slVQi~~d~iRhi~~~~r~~ew~~P----------~~~~Iv~~avnr~qiVvalSngelvyfe~d~sgql  572 (1205)
T KOG1898|consen  504 LACSLMGDD-SLVQIHPDGIRHIRPTKRINEWKTP----------ERVRIVKCAVNRRQIVVALSNGELVYFEGDVSGQL  572 (1205)
T ss_pred             EEEEEecCC-cEEEEchhhhhhcccccccccccCC----------CceEEEEEeecceEEEEEccCCeEEEEEeccCccc
Confidence            999999999 8999999999999988888889875          46889999999999999999999999999988887


Q ss_pred             Eeee-cccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCC
Q 001853          692 VSVQ-TPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPN  770 (1004)
Q Consensus       692 l~~~-~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~  770 (1004)
                      .|.. ++.+    +..+++.++-.+.-|                             .+.+-+|.+...++.++|++|.-
T Consensus       573 ~E~~er~tl----~~~vac~ai~~~~~g-----------------------------~krsrfla~a~~d~~vriisL~p  619 (1205)
T KOG1898|consen  573 NEFTERVTL----STDVACLAIGQDPEG-----------------------------EKRSRFLALASVDNMVRIISLDP  619 (1205)
T ss_pred             eeeeeeeee----ceeehhhccCCCCcc-----------------------------hhhcceeeeeccccceeEEEecC
Confidence            7764 4433    334554444333322                             23455899999999999999863


Q ss_pred             CeE--EEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEEeecCCCCC----CcEEEEE
Q 001853          771 FNC--VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----RPFLFAI  844 (1004)
Q Consensus       771 ~~~--v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill~~lg~~~~----~p~L~v~  844 (1004)
                      -.+  .++..+++                                        ..+..+++..+.....    .=||.+.
T Consensus       620 ~d~l~~ls~q~l~----------------------------------------~~~~s~~iv~~~~~~~~~~~~L~l~~G  659 (1205)
T KOG1898|consen  620 SDCLQPLSVQGLS----------------------------------------SPPESLCIVEMEATGGTDVAQLYLLIG  659 (1205)
T ss_pred             cceEEEccccccC----------------------------------------CCccceEEEEecccCCccceeEEEEec
Confidence            222  22211111                                        1234466666654432    5788888


Q ss_pred             eeCCcEEEEEEEeecCCCCCCCCCCCCcccccccccccccccccceeEEeccCCcCCCCCCCCCCCccceEEecccCCce
Q 001853          845 LTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFKNISGHQ  924 (1004)
Q Consensus       845 ~~~g~l~iY~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrf~Kv~~~~~~~~~~~~~~~~~~l~~f~~i~g~s  924 (1004)
                      +.+|-++=+..   +..            .|          ..+.+|=++            .|.++..|.+|. ..|-+
T Consensus       660 L~NGvllR~~i---d~v------------~G----------~l~d~rtR~------------lG~~pvkLf~~~-~~~~s  701 (1205)
T KOG1898|consen  660 LRNGVLLRFVI---DTV------------TG----------QLLDIRTRF------------LGLRPVKLFPIS-MRGQS  701 (1205)
T ss_pred             ccccEEEEEEe---ccc------------cc----------ceeeeheee------------eccccceEEEEe-ecCcc
Confidence            88886654432   221            11          112222222            345566677774 57888


Q ss_pred             EEEecCCCCeEEEEcccccEEEeccCCCceEEEecCCCCCCCCcEEEEecCCcEEEEECCCC-Ccc-ccCccceEEeee
Q 001853          925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG-STY-DNYWPVQKVVFF 1001 (1004)
Q Consensus       925 gVFv~G~~P~~i~~~~~~l~~~~~~~~~~v~~f~~F~~~~~~~gfiy~~~~~~lri~~lp~~-~~~-d~~wp~rkvpl~ 1001 (1004)
                      .|....++|-.+++.+.++++.|..-+ +..-.+||-+..||.|..+.... .+||-++... ..+ .-.||.+--|.+
T Consensus       702 ~vL~lSsr~wl~y~~~~~~h~t~Isy~-~l~~as~~~S~qcpeGiv~i~~n-~l~i~~~~~~g~~~n~~~~~l~~tprk  778 (1205)
T KOG1898|consen  702 DVLALSSRPWLLYTYQQEFHLTPISYS-TLEHASPFCSEQCPEGIVAISKN-TLRIIALDKLGKVLNVDGFPLAYTPRK  778 (1205)
T ss_pred             eeEEecCChhhhhhhcceeeeeccccc-chhccccccccCCCcchhhhhhh-hhheeeehhhcccccccccccccCcce
Confidence            888888888666999999999999777 78899999999999998877666 7999888765 333 344555544443


No 5  
>PF10433 MMS1_N:  Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A ....
Probab=100.00  E-value=2.4e-55  Score=524.97  Aligned_cols=434  Identities=29%  Similarity=0.450  Sum_probs=299.4

Q ss_pred             cEEEEEECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCCeEEECCCCCEEEEEEecCeEEEEEcccCC
Q 001853          131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGG  210 (1004)
Q Consensus       131 D~Llv~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~~~~L~ilP~~~~~  210 (1004)
                      |+|+|+|+++|+++|+||++++++.+.+.|+++.-    .++|.|+..++++++|||.|||+|+.+|++.+.|+|+.+..
T Consensus         1 D~L~v~tdsg~l~~l~~~~~~~~~~~~~v~~~~~~----~~~~~r~~~~G~~l~vDP~~R~i~v~a~e~~~~v~~l~~~~   76 (504)
T PF10433_consen    1 DSLVVTTDSGKLSILEYDPSTHGFFKEFVHQWEPL----SKSGSRLSQPGQYLAVDPSGRCIAVSAYEGNFLVYPLNRSL   76 (504)
T ss_dssp             -EEEEEETTTEEEEEEEEEETTEE-E-EEEEEEE-------SSSEB-TT--EEEE-TTSSEEEEEEBTTEEEEEE-SS--
T ss_pred             CEEEEEECCCCEEEEEEECCCCccceeeEEEeEec----CCCCCChhcCCcEEEECCcCCEEEEEecCCeEEEEEecccc
Confidence            79999999999999999999998866566776443    57888999999999999999999999999999999998711


Q ss_pred             CCCCCCCCCCCCCCCcccceeeeEEEEecccCCCceeeEEEec---CCCCceEEEEEEccCCcccceeeeeeeEEE--EE
Q 001853          211 SGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVH---GYIEPVMVILHERELTWAGRVSWKHHTCMI--SA  285 (1004)
Q Consensus       211 ~~l~~~d~~~~~~~~~~~~~~~s~~i~l~~ldi~nViD~~FL~---gy~ePtlaiLye~~~tw~gr~~~r~dt~~~--~~  285 (1004)
                       .     ...        .....+..++.+  ..+|+||||||   ||++|+||+||.+.+.|.      +..++.  ..
T Consensus        77 -~-----~~~--------~~~~~~~~pi~s--~~~i~~~~FL~~~~~~~~p~la~L~~~~~~~~------~~~~y~w~~~  134 (504)
T PF10433_consen   77 -D-----SDI--------AFSPHINSPIKS--EGNILDMCFLHPSVGYDNPTLAILYVDSQRRT------HLVTYEWSLD  134 (504)
T ss_dssp             --------T---------TT---EEEE--S---SEEEEEEEES---S-SS-EEEEEEEETT-EE------EEEEEE----
T ss_pred             -c-----ccc--------cccccccccccC--CceEEEEEEEecccCCCCceEEEEEEEecccc------eeEEEeeecc
Confidence             0     000        011222223311  49999999999   999999999999976522      222332  33


Q ss_pred             Eeecccccccc-c--eeeeccCCCCCcEEEEecCCCCeEEEEecCeEEEEeCCCc----eeEeecccccccCCCcCCCCC
Q 001853          286 LSISTTLKQHP-L--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSAS----CALALNNYAVSLDSSQELPRS  358 (1004)
Q Consensus       286 ~sLn~~~k~~~-~--i~s~~~LP~d~~~lipvP~plGGvLVig~n~Iiy~dq~~~----~~v~vN~~~~~~t~~~~~~~~  358 (1004)
                      ..++...++.+ .  +|...++|   .+|||||.|.||+||++++.++|.++...    ...+++.....        +.
T Consensus       135 ~~l~~~~~~~~~~~~l~~~~~~p---~~LIPlp~~~ggllV~~~~~i~y~~~~~~~~~~~~~~~~~~~~~--------~~  203 (504)
T PF10433_consen  135 DGLNHVISKSTLPIRLPNEDELP---SFLIPLPNPPGGLLVGGENIIIYKNHLIGSGDYSFLSIPSPPSS--------SS  203 (504)
T ss_dssp             ----EETTTTEEEE--EEEE-TT---EEEEEE-TTT-SEEEEESSEEEEEE------TTEEEEE--H-HH--------HT
T ss_pred             cccceeeeeccccccccccCCCc---cEEEEcCCCCcEEEEECCEEEEEecccccccccccccccCCccC--------CC
Confidence            45555544433 2  66767777   99999999999999999999999976432    22222110000        11


Q ss_pred             ccEEEecc---eeEEEeeCCEEEEEeCCCCEEEEEEEECCceEeEEEEEecCC-CcccceEEEEcCC--eEEEEeeeCCe
Q 001853          359 SFSVELDA---AHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-SVLTSDITTIGNS--LFFLGSRLGDS  432 (1004)
Q Consensus       359 ~~~i~l~~---~~~~~l~~~~~Ll~~~~G~L~~L~l~~dgr~V~~l~l~~~g~-~~~~S~l~~l~~g--~lFvGS~~GDS  432 (1004)
                      .+.+....   ......+.+++||++++|+||+|.+..+++   ++++.++|+ .+++++++++++|  +||+||++|||
T Consensus       204 ~~~~~~~~p~~~~~~~~~~~~~lL~~e~G~l~~l~l~~~~~---~i~i~~~g~~~~~~s~l~~l~~g~d~lf~gs~~gds  280 (504)
T PF10433_consen  204 SLWTSWARPERNISYDKDGDRILLQDEDGDLYLLTLDNDGG---SISITYLGTLCSIASSLTYLKNGGDYLFVGSEFGDS  280 (504)
T ss_dssp             S-EEEEEE------SSTTSSEEEEEETTSEEEEEEEEEEEE---EEEEEEEEE--S-ESEEEEESTT--EEEEEESSS-E
T ss_pred             ceEEEEEeccccceecCCCCEEEEEeCCCeEEEEEEEECCC---eEEEEEcCCcCChhheEEEEcCCCEEEEEEEecCCc
Confidence            22221000   000233457999999999999999999877   799999999 9999999999999  99999999999


Q ss_pred             eEEEEeeCCCcccccCCCccccCCcccCCccccccccCCcccccccccccccccccCCCCCcccccceeeEEEeeeeccc
Q 001853          433 LLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNI  512 (1004)
Q Consensus       433 ~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~ly~~~~~~~~~~~~~~~l~v~Dsl~Ni  512 (1004)
                      +|+++...                                                             .++++|+++|+
T Consensus       281 ~l~~~~~~-------------------------------------------------------------~l~~~~~~~N~  299 (504)
T PF10433_consen  281 QLLQISLS-------------------------------------------------------------NLEVLDSLPNW  299 (504)
T ss_dssp             EEEEEESE-------------------------------------------------------------SEEEEEEE---
T ss_pred             EEEEEeCC-------------------------------------------------------------CcEEEEeccCc
Confidence            99998630                                                             48999999999


Q ss_pred             CCcccccccccccCC----------CCceeccCCceEEE--------------EeCCCccEeEEEeecCCCCCCCCcccc
Q 001853          513 GPLKDFSYGLRINAD----------ASATGISKQSNYEL--------------VELPGCKGIWTVYHKSSRGHNADSSRM  568 (1004)
Q Consensus       513 gPI~D~~vg~~~~~~----------~~~sG~g~~GsL~v--------------~~lpg~~~iWtv~~~~~~~~~~~~~~~  568 (1004)
                      |||.||++++.....          .+|||.|++|+|++              .+++++++||+++...           
T Consensus       300 ~Pi~D~~v~~~~~~~~~~~~~~~~lv~~sG~g~~gsL~~lr~Gi~~~~~~~~~~~l~~v~~iW~l~~~~-----------  368 (504)
T PF10433_consen  300 GPIVDFCVVDSSNSGQPSNPSSDQLVACSGAGKRGSLRILRNGIGIEGLELASSELPGVTGIWTLKLSS-----------  368 (504)
T ss_dssp             -SEEEEEEE-TSSSSS-------EEEEEESSGGG-EEEEEEESBEEE--EEEEEEESTEEEEEEE-SSS-----------
T ss_pred             CCccceEEeccccCCCCcccccceEEEEECcCCCCcEEEEeccCCceeeeeeccCCCCceEEEEeeecC-----------
Confidence            999999998653221          25999999999999              3688999999998531           


Q ss_pred             cccCcccccEEEEEecCceEEEEec-----CceeEEecCCCccccCCeEEEEEeCCCcEEEEEecCcEEEEeCC--ccee
Q 001853          569 AAYDDEYHAYLIISLEARTMVLETA-----DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS--YMTQ  641 (1004)
Q Consensus       569 ~~~~~~~~~yLilS~~~~T~Vl~~~-----~~l~ev~~~~~F~~~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~--~~~q  641 (1004)
                          +. |.|||+|+.++|+||+++     ++++|++ ..+|.++++||+||+++++ ++||||+++||+++..  ...+
T Consensus       369 ----~~-~~~lv~S~~~~T~vl~~~~~d~~e~~~e~~-~~~f~~~~~Tl~~~~~~~~-~ivQVt~~~i~l~~~~~~~~~~  441 (504)
T PF10433_consen  369 ----SD-HSYLVLSFPNETRVLQISEGDDGEEVEEVE-EDGFDTDEPTLAAGNVGDG-RIVQVTPKGIRLIDLEDGKLTQ  441 (504)
T ss_dssp             ----SS-BSEEEEEESSEEEEEEES----SSEEEEE----TS-SSS-EEEEEEETTT-EEEEEESSEEEEEESSSTSEEE
T ss_pred             ----CC-ceEEEEEcCCceEEEEEecccCCcchhhhh-hccCCCCCCCeEEEEcCCC-eEEEEecCeEEEEECCCCeEEE
Confidence                12 999999999999999984     5677775 4499999999999999966 9999999999999844  4577


Q ss_pred             EEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEe
Q 001853          642 DLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVS  693 (1004)
Q Consensus       642 ~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~  693 (1004)
                      .|.++          .+..|++|+++++|++|++.++.+.+|+++......+
T Consensus       442 ~w~~~----------~~~~I~~a~~~~~~v~v~~~~~~~~~~~~~~~~~~~~  483 (504)
T PF10433_consen  442 EWKPP----------AGSIIVAASINDPQVLVALSGGELVYFELDDNKISVS  483 (504)
T ss_dssp             EEE-T----------TS---SEEEESSSEEEEEE-TTEEEEEEEETTEEEEE
T ss_pred             EEeCC----------CCCeEEEEEECCCEEEEEEeCCcEEEEEEECCceeee
Confidence            89874          2467999999999999999999999999998755444


No 6  
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=90.32  E-value=8.9  Score=44.94  Aligned_cols=75  Identities=16%  Similarity=0.348  Sum_probs=50.4

Q ss_pred             EEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEEeec
Q 001853          753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW  832 (1004)
Q Consensus       753 ~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill~~l  832 (1004)
                      ++++..++|.|.|+.+-.-.++|. +++..  ..+         .                  +........-|-.+..+
T Consensus        99 Fvaigy~~G~l~viD~RGPavI~~-~~i~~--~~~---------~------------------~~~~~~vt~ieF~vm~~  148 (395)
T PF08596_consen   99 FVAIGYESGSLVVIDLRGPAVIYN-ENIRE--SFL---------S------------------KSSSSYVTSIEFSVMTL  148 (395)
T ss_dssp             EEEEEETTSEEEEEETTTTEEEEE-EEGGG----T----------------------------SS----EEEEEEEEEE-
T ss_pred             EEEEEecCCcEEEEECCCCeEEee-ccccc--ccc---------c------------------cccccCeeEEEEEEEec
Confidence            677888999999999988888888 54442  000         0                  00011122345556667


Q ss_pred             CCCC-CCcEEEEEeeCCcEEEEEEEe
Q 001853          833 SAHH-SRPFLFAILTDGTILCYQAYL  857 (1004)
Q Consensus       833 g~~~-~~p~L~v~~~~g~l~iY~~f~  857 (1004)
                      |++. +.|.|+|.+..|++++|+..+
T Consensus       149 ~~D~ySSi~L~vGTn~G~v~~fkIlp  174 (395)
T PF08596_consen  149 GGDGYSSICLLVGTNSGNVLTFKILP  174 (395)
T ss_dssp             TTSSSEEEEEEEEETTSEEEEEEEEE
T ss_pred             CCCcccceEEEEEeCCCCEEEEEEec
Confidence            7654 679999999999999999975


No 7  
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=90.08  E-value=4.4  Score=43.62  Aligned_cols=128  Identities=18%  Similarity=0.191  Sum_probs=79.7

Q ss_pred             cccCCeEEEEEeCCCc--EEEEEecCcEEEEeCCc-ceeEEeCCCCC-CC--CCCCCCCccEEEEEEcCCEEEEEEeCCe
Q 001853          606 FVQGRTIAAGNLFGRR--RVIQVFERGARILDGSY-MTQDLSFGPSN-SE--SGSGSENSTVLSVSIADPYVLLGMSDGS  679 (1004)
Q Consensus       606 ~~~~~TI~ag~l~~~~--~IvQVt~~~vrl~~~~~-~~q~~~~~~~~-~e--~g~~~~~~~I~~As~~dpyvll~~~~g~  679 (1004)
                      ..+.|-|++..-.-.+  .|--+-..++|+||-.+ +.|.+++...+ .|  -|.+..|..|.-|...|.+      ...
T Consensus        52 aADDPAIwVh~t~P~kS~vItt~Kk~Gl~VYDLsGkqLqs~~~Gk~NNVDLrygF~LgG~~idiaaASdR~------~~~  125 (364)
T COG4247          52 AADDPAIWVHATNPDKSLVITTVKKAGLRVYDLSGKQLQSVNPGKYNNVDLRYGFQLGGQSIDIAAASDRQ------NDK  125 (364)
T ss_pred             ccCCcceEeccCCcCcceEEEeeccCCeEEEecCCCeeeecCCCcccccccccCcccCCeEEEEEeccccc------CCe
Confidence            3566777777654332  34445577899999764 56666543222 12  1222234456655555543      789


Q ss_pred             EEEEEecCCCceEeee-ccc-ccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEE
Q 001853          680 IRLLVGDPSTCTVSVQ-TPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVC  757 (1004)
Q Consensus       680 I~~l~~d~~~~~l~~~-~~~-~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~  757 (1004)
                      |.+|.+|++...|+-. .+. ...+..+..-.+|||++.                               ....+++|+.
T Consensus       126 i~~y~Idp~~~~L~sitD~n~p~ss~~s~~YGl~lyrs~-------------------------------ktgd~yvfV~  174 (364)
T COG4247         126 IVFYKIDPNPQYLESITDSNAPYSSSSSSAYGLALYRSP-------------------------------KTGDYYVFVN  174 (364)
T ss_pred             EEEEEeCCCccceeeccCCCCccccCcccceeeEEEecC-------------------------------CcCcEEEEEe
Confidence            9999999988777733 221 111113445678898875                               3357899999


Q ss_pred             ecCCeEEEEEcCC
Q 001853          758 YESGALEIFDVPN  770 (1004)
Q Consensus       758 ~~~g~l~I~sLp~  770 (1004)
                      +..|.++=|+|-+
T Consensus       175 ~~qG~~~Qy~l~d  187 (364)
T COG4247         175 RRQGDIAQYKLID  187 (364)
T ss_pred             cCCCceeEEEEEe
Confidence            9999999888754


No 8  
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=90.06  E-value=31  Score=40.70  Aligned_cols=69  Identities=19%  Similarity=0.268  Sum_probs=54.1

Q ss_pred             ceEEecccCCceEEEecCCCCeEEEEcccccEEEeccCCCceEEEecCCC--CCCCC---cEEEEecCCcEEEEE
Q 001853          913 RITIFKNISGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHN--VNCNH---GFIYVTSQGILKICQ  982 (1004)
Q Consensus       913 ~l~~f~~i~g~sgVFv~G~~P~~i~~~~~~l~~~~~~~~~~v~~f~~F~~--~~~~~---gfiy~~~~~~lri~~  982 (1004)
                      .+.....-+.-+.|+|-|+|-.+.....|.+++-.. .|..-.||++|..  .+-++   -+|+.|+++.|.|-+
T Consensus       243 ~i~v~~~~~~~~~IvvLger~Lf~l~~~G~l~~~kr-Ld~~p~~~~~Y~~~~~~~~~~~~~llV~t~t~~LlVy~  316 (418)
T PF14727_consen  243 DIQVVRFSSSESDIVVLGERSLFCLKDNGSLRFQKR-LDYNPSCFCPYRVPWYNEPSTRLNLLVGTHTGTLLVYE  316 (418)
T ss_pred             EEEEEEcCCCCceEEEEecceEEEEcCCCeEEEEEe-cCCceeeEEEEEeecccCCCCceEEEEEecCCeEEEEe
Confidence            344443334778999999999999999999999864 7889999999998  44443   299999999887754


No 9  
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.76  E-value=16  Score=40.65  Aligned_cols=81  Identities=14%  Similarity=0.274  Sum_probs=60.6

Q ss_pred             ccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccc
Q 001853          659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1004)
Q Consensus       659 ~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~  738 (1004)
                      ..|++.+++.||++=..+|.+|.+|.+..... +.    ..+. ....|+|+..|.+.+                     
T Consensus        44 ~sitavAVs~~~~aSGssDetI~IYDm~k~~q-lg----~ll~-HagsitaL~F~~~~S---------------------   96 (362)
T KOG0294|consen   44 GSITALAVSGPYVASGSSDETIHIYDMRKRKQ-LG----ILLS-HAGSITALKFYPPLS---------------------   96 (362)
T ss_pred             cceeEEEecceeEeccCCCCcEEEEeccchhh-hc----ceec-cccceEEEEecCCcc---------------------
Confidence            34999999999999999999999999876422 11    1111 155688776654432                     


Q ss_pred             cCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEec
Q 001853          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1004)
Q Consensus       739 ~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~  778 (1004)
                                  .-||+-+-+||.+.||+..+++++-..+
T Consensus        97 ------------~shLlS~sdDG~i~iw~~~~W~~~~slK  124 (362)
T KOG0294|consen   97 ------------KSHLLSGSDDGHIIIWRVGSWELLKSLK  124 (362)
T ss_pred             ------------hhheeeecCCCcEEEEEcCCeEEeeeec
Confidence                        1289999999999999999998877654


No 10 
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=85.34  E-value=87  Score=37.06  Aligned_cols=76  Identities=21%  Similarity=0.275  Sum_probs=60.3

Q ss_pred             CeEEEEcCCEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEE
Q 001853           56 PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIIL  135 (1004)
Q Consensus        56 ~nLVvak~n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv  135 (1004)
                      ..|.|--.+.|.||.+...+..         ...      .+..+|+++.++.+.-+.-+|..-+..+..   ++|.|.|
T Consensus        89 ~~LaVLhP~kl~vY~v~~~~g~---------~~~------g~~~~L~~~yeh~l~~~a~nm~~G~Fgg~~---~~~~IcV  150 (418)
T PF14727_consen   89 LQLAVLHPRKLSVYSVSLVDGT---------VEH------GNQYQLELIYEHSLQRTAYNMCCGPFGGVK---GRDFICV  150 (418)
T ss_pred             ceEEEecCCEEEEEEEEecCCC---------ccc------CcEEEEEEEEEEecccceeEEEEEECCCCC---CceEEEE
Confidence            6899999999999999643210         000      112479999999999999999999988872   4999999


Q ss_pred             EECCCeEEEEEEeC
Q 001853          136 AFEDAKISVLEFDD  149 (1004)
Q Consensus       136 ~~~~aklsile~d~  149 (1004)
                      =+-|++|++.+-|.
T Consensus       151 QS~DG~L~~feqe~  164 (418)
T PF14727_consen  151 QSMDGSLSFFEQES  164 (418)
T ss_pred             EecCceEEEEeCCc
Confidence            99999999997664


No 11 
>COG5161 SFT1 Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification]
Probab=84.85  E-value=0.22  Score=60.68  Aligned_cols=90  Identities=16%  Similarity=-0.025  Sum_probs=71.8

Q ss_pred             cEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccC
Q 001853          100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR  179 (1004)
Q Consensus       100 kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~  179 (1004)
                      -|++..+.+.|++|. |..++.+.+    +    ...++-||++.+|||+..    +-++||+|+-.   --..+ ....
T Consensus        88 ~lrf~sq~n~f~Tis-lhyyeGKfk----g----ksLvelak~stle~D~~s----scaLlfneDi~---~flpf-hvnk  150 (1319)
T COG5161          88 LLRFDSQANEFRTIS-LHYYEGKFK----G----KSLVELAKFSTLEFDIRS----SCALLFNEDIG---NFLPF-HVNK  150 (1319)
T ss_pred             EEEehhhcccceeEE-EeeeccccC----C----chhhhhhhhhheeeccCc----cchhhhhhhhh---hcccc-cccC
Confidence            588889999999999 999998877    4    456788999999999986    66889999852   00111 1233


Q ss_pred             CCeEEECCCCCEEEEEEecCeEEEEEc
Q 001853          180 GPLVKVDPQGRCGGVLVYGLQMIILKA  206 (1004)
Q Consensus       180 ~~~l~VDP~~Rca~l~~~~~~L~ilP~  206 (1004)
                      .....|||+..|.++.+-.++++++|-
T Consensus       151 ndddev~~d~D~~~~~~~~~h~~i~ps  177 (1319)
T COG5161         151 NDDDEVRIDVDLGMFQMSKRHFSIFPS  177 (1319)
T ss_pred             CccccccccccccHHHHHHHHhhcCCC
Confidence            557889999999999999999999995


No 12 
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=81.80  E-value=1e+02  Score=39.14  Aligned_cols=147  Identities=13%  Similarity=0.086  Sum_probs=84.7

Q ss_pred             EEEEEecC-ceEEEEecCceeEEecCCCccc-cCCeEEEEEeCCCcEEEEEecCcEEEEeCCc-----ceeEEeCCCCCC
Q 001853          578 YLIISLEA-RTMVLETADLLTEVTESVDYFV-QGRTIAAGNLFGRRRVIQVFERGARILDGSY-----MTQDLSFGPSNS  650 (1004)
Q Consensus       578 yLilS~~~-~T~Vl~~~~~l~ev~~~~~F~~-~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~~-----~~q~~~~~~~~~  650 (1004)
                      ||+....+ .+++++.....+    .++++. .+.++.+-....+..+.=.-.+-|.+|.-..     .+....++    
T Consensus        27 fi~tcgsdg~ir~~~~~sd~e----~P~ti~~~g~~v~~ia~~s~~f~~~s~~~tv~~y~fps~~~~~iL~Rftlp----   98 (933)
T KOG1274|consen   27 FICTCGSDGDIRKWKTNSDEE----EPETIDISGELVSSIACYSNHFLTGSEQNTVLRYKFPSGEEDTILARFTLP----   98 (933)
T ss_pred             EEEEecCCCceEEeecCCccc----CCchhhccCceeEEEeecccceEEeeccceEEEeeCCCCCccceeeeeecc----
Confidence            66666554 466776544332    344543 4444444433333234333344454444221     11122221    


Q ss_pred             CCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceeccccccc
Q 001853          651 ESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAW  730 (1004)
Q Consensus       651 e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~  730 (1004)
                              .+.++.+..+.+++.+.+|-.|.++.++..+....+...      +.+++++++  ++              
T Consensus        99 --------~r~~~v~g~g~~iaagsdD~~vK~~~~~D~s~~~~lrgh------~apVl~l~~--~p--------------  148 (933)
T KOG1274|consen   99 --------IRDLAVSGSGKMIAAGSDDTAVKLLNLDDSSQEKVLRGH------DAPVLQLSY--DP--------------  148 (933)
T ss_pred             --------ceEEEEecCCcEEEeecCceeEEEEeccccchheeeccc------CCceeeeeE--cC--------------
Confidence                    334555555679999999999999999876543322211      455775544  22              


Q ss_pred             ccCccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcC
Q 001853          731 LSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV  781 (1004)
Q Consensus       731 ~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~  781 (1004)
                                         ...+|++..-||.|.||++.+..+.++..++.
T Consensus       149 -------------------~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~  180 (933)
T KOG1274|consen  149 -------------------KGNFLAVSSCDGKVQIWDLQDGILSKTLTGVD  180 (933)
T ss_pred             -------------------CCCEEEEEecCceEEEEEcccchhhhhcccCC
Confidence                               23578888999999999999988777655443


No 13 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=80.81  E-value=1.1e+02  Score=34.89  Aligned_cols=69  Identities=17%  Similarity=0.179  Sum_probs=49.8

Q ss_pred             CccEEEEEEC---CCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCCeEEECCCCCEEEEEEe-cCeEEEE
Q 001853          129 RRDSIILAFE---DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIIL  204 (1004)
Q Consensus       129 ~~D~Llv~~~---~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~-~~~L~il  204 (1004)
                      +...|.+.-+   .+.++..+||++.++|.-+. +..        -.|    .++-++.+|++||.....-| .+.+.++
T Consensus        50 ~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln-~~~--------~~g----~~p~yvsvd~~g~~vf~AnY~~g~v~v~  116 (346)
T COG2706          50 DQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLN-RQT--------LPG----SPPCYVSVDEDGRFVFVANYHSGSVSVY  116 (346)
T ss_pred             CCCEEEEEEecCCcCcEEEEEEcCCCCeEEEee-ccc--------cCC----CCCeEEEECCCCCEEEEEEccCceEEEE
Confidence            3445554433   69999999999988875442 221        112    22368999999999999999 8999999


Q ss_pred             EcccCC
Q 001853          205 KASQGG  210 (1004)
Q Consensus       205 P~~~~~  210 (1004)
                      |+...+
T Consensus       117 p~~~dG  122 (346)
T COG2706         117 PLQADG  122 (346)
T ss_pred             EcccCC
Confidence            997654


No 14 
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=80.75  E-value=51  Score=31.44  Aligned_cols=92  Identities=13%  Similarity=0.218  Sum_probs=59.1

Q ss_pred             EEEEecCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeeccccccc
Q 001853          623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES  702 (1004)
Q Consensus       623 IvQVt~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~  702 (1004)
                      +|==....||+++.+..+.++.-.      +     .-+.-+.+...+.+-++.+|+|.+|+....   +=..+.     
T Consensus        19 lvGs~D~~IRvf~~~e~~~Ei~e~------~-----~v~~L~~~~~~~F~Y~l~NGTVGvY~~~~R---lWRiKS-----   79 (111)
T PF14783_consen   19 LVGSDDFEIRVFKGDEIVAEITET------D-----KVTSLCSLGGGRFAYALANGTVGVYDRSQR---LWRIKS-----   79 (111)
T ss_pred             EEecCCcEEEEEeCCcEEEEEecc------c-----ceEEEEEcCCCEEEEEecCCEEEEEeCcce---eeeecc-----
Confidence            333345678999888766555432      1     235566677888999999999999976432   211111     


Q ss_pred             CCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEE
Q 001853          703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI  765 (1004)
Q Consensus       703 ~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I  765 (1004)
                       +.++.+++.|.- +|                              ....=|++.|.||.+++
T Consensus        80 -K~~~~~~~~~D~-~g------------------------------dG~~eLI~GwsnGkve~  110 (111)
T PF14783_consen   80 -KNQVTSMAFYDI-NG------------------------------DGVPELIVGWSNGKVEV  110 (111)
T ss_pred             -CCCeEEEEEEcC-CC------------------------------CCceEEEEEecCCeEEe
Confidence             455777776632 22                              12345899999999986


No 15 
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=78.29  E-value=1e+02  Score=38.79  Aligned_cols=81  Identities=14%  Similarity=0.183  Sum_probs=57.1

Q ss_pred             EEEEEE--cCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccc
Q 001853          661 VLSVSI--ADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1004)
Q Consensus       661 I~~As~--~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~  738 (1004)
                      |+++.-  +=..|+|.+.+|+|++|.+..+....+.. .+     ..+|+++++-.|                       
T Consensus       205 IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~sFk-~d-----~g~VtslSFrtD-----------------------  255 (910)
T KOG1539|consen  205 ITAIEQSPALDVVAIGLENGTVIIFNLKFDKILMSFK-QD-----WGRVTSLSFRTD-----------------------  255 (910)
T ss_pred             eeEeccCCcceEEEEeccCceEEEEEcccCcEEEEEE-cc-----ccceeEEEeccC-----------------------
Confidence            554443  24577889999999999997764333322 11     356888775322                       


Q ss_pred             cCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcC
Q 001853          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV  781 (1004)
Q Consensus       739 ~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~  781 (1004)
                                 +...++..+.+|.|.||.|.+-+++.++.+..
T Consensus       256 -----------G~p~las~~~~G~m~~wDLe~kkl~~v~~nah  287 (910)
T KOG1539|consen  256 -----------GNPLLASGRSNGDMAFWDLEKKKLINVTRNAH  287 (910)
T ss_pred             -----------CCeeEEeccCCceEEEEEcCCCeeeeeeeccc
Confidence                       24678889999999999999988888776554


No 16 
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=77.41  E-value=1e+02  Score=33.37  Aligned_cols=73  Identities=15%  Similarity=0.223  Sum_probs=44.0

Q ss_pred             cCCEEEEEEeCCeEEEEEecCCCc--eEeee-cccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCC
Q 001853          667 ADPYVLLGMSDGSIRLLVGDPSTC--TVSVQ-TPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGAD  743 (1004)
Q Consensus       667 ~dpyvll~~~~g~I~~l~~d~~~~--~l~~~-~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~  743 (1004)
                      ..++|+-..+||++.++......+  .|+.. .+..+.+.-++|. +||..                             
T Consensus       167 ~~~qilsG~EDGtvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wi-gala~-----------------------------  216 (325)
T KOG0649|consen  167 ANGQILSGAEDGTVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWI-GALAV-----------------------------  216 (325)
T ss_pred             cCcceeecCCCccEEEEeccccceeEEeccccChhhcCcccCcee-EEEec-----------------------------
Confidence            488999999999999998865432  23322 2222221122222 23311                             


Q ss_pred             CCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEE
Q 001853          744 GGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT  776 (1004)
Q Consensus       744 ~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~  776 (1004)
                            ...|+ +|...-.+.+|.||..+++..
T Consensus       217 ------~edWl-vCGgGp~lslwhLrsse~t~v  242 (325)
T KOG0649|consen  217 ------NEDWL-VCGGGPKLSLWHLRSSESTCV  242 (325)
T ss_pred             ------cCceE-EecCCCceeEEeccCCCceEE
Confidence                  13475 456677899999999777555


No 17 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=73.86  E-value=19  Score=40.65  Aligned_cols=69  Identities=20%  Similarity=0.317  Sum_probs=57.1

Q ss_pred             CeEEEEcCCEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEE
Q 001853           56 PNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIIL  135 (1004)
Q Consensus        56 ~nLVvak~n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv  135 (1004)
                      ..||+|-++.|.||++..+.                        +|..++.+...-.|++|..+          .+.|+|
T Consensus        99 ~~lv~~~g~~l~v~~l~~~~------------------------~l~~~~~~~~~~~i~sl~~~----------~~~I~v  144 (321)
T PF03178_consen   99 GRLVVAVGNKLYVYDLDNSK------------------------TLLKKAFYDSPFYITSLSVF----------KNYILV  144 (321)
T ss_dssp             TEEEEEETTEEEEEEEETTS------------------------SEEEEEEE-BSSSEEEEEEE----------TTEEEE
T ss_pred             CEEEEeecCEEEEEEccCcc------------------------cchhhheecceEEEEEEecc----------ccEEEE
Confidence            56999999999999997542                        39999999998899999885          369999


Q ss_pred             EECCCeEEEEEEeCCCCcEEEEE
Q 001853          136 AFEDAKISVLEFDDSIHGLRITS  158 (1004)
Q Consensus       136 ~~~~aklsile~d~~~~~l~TvS  158 (1004)
                      +.-..-+++++|+.+.++|.-++
T Consensus       145 gD~~~sv~~~~~~~~~~~l~~va  167 (321)
T PF03178_consen  145 GDAMKSVSLLRYDEENNKLILVA  167 (321)
T ss_dssp             EESSSSEEEEEEETTTE-EEEEE
T ss_pred             EEcccCEEEEEEEccCCEEEEEE
Confidence            99999999999999777787665


No 18 
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=70.30  E-value=1.9e+02  Score=32.55  Aligned_cols=95  Identities=14%  Similarity=0.244  Sum_probs=63.5

Q ss_pred             CCEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeE
Q 001853           63 ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKI  142 (1004)
Q Consensus        63 ~n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~akl  142 (1004)
                      ..+|.||++....                        .|.-+..+  .|.|+.|+-..+..      ...||=+.+|+++
T Consensus        62 DetI~IYDm~k~~------------------------qlg~ll~H--agsitaL~F~~~~S------~shLlS~sdDG~i  109 (362)
T KOG0294|consen   62 DETIHIYDMRKRK------------------------QLGILLSH--AGSITALKFYPPLS------KSHLLSGSDDGHI  109 (362)
T ss_pred             CCcEEEEeccchh------------------------hhcceecc--ccceEEEEecCCcc------hhheeeecCCCcE
Confidence            4589999987532                        24445554  79999988666553      3499999999999


Q ss_pred             EEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCCeEEECCCCCEEEEEEe-cCeEEEEEcc
Q 001853          143 SVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKAS  207 (1004)
Q Consensus       143 sile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~-~~~L~ilP~~  207 (1004)
                      ++.+-    ..++++  |.+        |..--+   ...+.++|.|+.| |.++ +..|...-+.
T Consensus       110 ~iw~~----~~W~~~--~sl--------K~H~~~---Vt~lsiHPS~KLA-LsVg~D~~lr~WNLV  157 (362)
T KOG0294|consen  110 IIWRV----GSWELL--KSL--------KAHKGQ---VTDLSIHPSGKLA-LSVGGDQVLRTWNLV  157 (362)
T ss_pred             EEEEc----CCeEEe--eee--------cccccc---cceeEecCCCceE-EEEcCCceeeeehhh
Confidence            88653    345554  554        211111   4679999999987 5666 6667666554


No 19 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=69.66  E-value=90  Score=35.55  Aligned_cols=71  Identities=13%  Similarity=0.134  Sum_probs=45.2

Q ss_pred             ccEEEEEEC-CCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCCeEEECCCCCEEEEEEe-cCeEEEEEcc
Q 001853          130 RDSIILAFE-DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKAS  207 (1004)
Q Consensus       130 ~D~Llv~~~-~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~-~~~L~ilP~~  207 (1004)
                      ..+.-+..+ +..+.+++||+..++|+.+=-+.-       +...+....+..-+.+.|+||+.-.+=- .+.|+++-..
T Consensus       202 ~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~t-------lP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~  274 (346)
T COG2706         202 GKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDT-------LPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVD  274 (346)
T ss_pred             CcEEEEEeccCCEEEEEEEcCCCceEEEeeeecc-------CccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEc
Confidence            445556666 889999999999888776622211       2233333445567999999999865433 4555554443


No 20 
>PF02333 Phytase:  Phytase;  InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=69.48  E-value=73  Score=37.06  Aligned_cols=61  Identities=26%  Similarity=0.436  Sum_probs=38.1

Q ss_pred             CeEEEEEecCCCceEeee-ccc-ccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEE
Q 001853          678 GSIRLLVGDPSTCTVSVQ-TPA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV  755 (1004)
Q Consensus       678 g~I~~l~~d~~~~~l~~~-~~~-~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~  755 (1004)
                      .+|.+|.+++.+..|... .+. .+...-..+-.+|||++...                               ..+++|
T Consensus       127 n~l~~f~id~~~g~L~~v~~~~~p~~~~~~e~yGlcly~~~~~-------------------------------g~~ya~  175 (381)
T PF02333_consen  127 NSLRLFRIDPDTGELTDVTDPAAPIATDLSEPYGLCLYRSPST-------------------------------GALYAF  175 (381)
T ss_dssp             -EEEEEEEETTTTEEEE-CBTTC-EE-SSSSEEEEEEEE-TTT---------------------------------EEEE
T ss_pred             CeEEEEEecCCCCcceEcCCCCcccccccccceeeEEeecCCC-------------------------------CcEEEE
Confidence            579999999865556532 211 11111233678999987521                               358999


Q ss_pred             EEecCCeEEEEEcC
Q 001853          756 VCYESGALEIFDVP  769 (1004)
Q Consensus       756 ~~~~~g~l~I~sLp  769 (1004)
                      +.+++|.++-|.|-
T Consensus       176 v~~k~G~~~Qy~L~  189 (381)
T PF02333_consen  176 VNGKDGRVEQYELT  189 (381)
T ss_dssp             EEETTSEEEEEEEE
T ss_pred             EecCCceEEEEEEE
Confidence            99999999988874


No 21 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=68.12  E-value=2.1e+02  Score=32.18  Aligned_cols=63  Identities=19%  Similarity=0.336  Sum_probs=45.7

Q ss_pred             CEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEE
Q 001853           64 NVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKIS  143 (1004)
Q Consensus        64 n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~akls  143 (1004)
                      .+|-+|++...+.+                    ..+|+++++..+.|.|++|+.++          +.|+++. ..++.
T Consensus        62 Gri~v~~i~~~~~~--------------------~~~l~~i~~~~~~g~V~ai~~~~----------~~lv~~~-g~~l~  110 (321)
T PF03178_consen   62 GRILVFEISESPEN--------------------NFKLKLIHSTEVKGPVTAICSFN----------GRLVVAV-GNKLY  110 (321)
T ss_dssp             EEEEEEEECSS-------------------------EEEEEEEEEESS-EEEEEEET----------TEEEEEE-TTEEE
T ss_pred             cEEEEEEEEccccc--------------------ceEEEEEEEEeecCcceEhhhhC----------CEEEEee-cCEEE
Confidence            67889998753110                    12799999999999999999982          3566655 59999


Q ss_pred             EEEEeCCCCcEEEEE
Q 001853          144 VLEFDDSIHGLRITS  158 (1004)
Q Consensus       144 ile~d~~~~~l~TvS  158 (1004)
                      +.+|+... .|...+
T Consensus       111 v~~l~~~~-~l~~~~  124 (321)
T PF03178_consen  111 VYDLDNSK-TLLKKA  124 (321)
T ss_dssp             EEEEETTS-SEEEEE
T ss_pred             EEEccCcc-cchhhh
Confidence            99999876 566554


No 22 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=67.68  E-value=2.6e+02  Score=33.13  Aligned_cols=101  Identities=16%  Similarity=0.159  Sum_probs=65.2

Q ss_pred             cCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCC-EEEEEEeCCeEEEEEecCCCceEeeecccccccCCCc
Q 001853          628 ERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP-YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP  706 (1004)
Q Consensus       628 ~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dp-yvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~  706 (1004)
                      ...||++|..... .|-...   +.|.     .|.....-.+ .+++...++++.+|.+...+.++.....   +  ...
T Consensus       175 Dg~vrl~DtR~~~-~~v~el---nhg~-----pVe~vl~lpsgs~iasAgGn~vkVWDl~~G~qll~~~~~---H--~Kt  240 (487)
T KOG0310|consen  175 DGKVRLWDTRSLT-SRVVEL---NHGC-----PVESVLALPSGSLIASAGGNSVKVWDLTTGGQLLTSMFN---H--NKT  240 (487)
T ss_pred             CceEEEEEeccCC-ceeEEe---cCCC-----ceeeEEEcCCCCEEEEcCCCeEEEEEecCCceehhhhhc---c--cce
Confidence            4568998865431 222210   2333     3666666555 6777778999999999876654432211   2  456


Q ss_pred             eeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEe
Q 001853          707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV  777 (1004)
Q Consensus       707 i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~  777 (1004)
                      ++|++++.|.+                                   -|+-.-=+|.++||++-+++.+|..
T Consensus       241 VTcL~l~s~~~-----------------------------------rLlS~sLD~~VKVfd~t~~Kvv~s~  276 (487)
T KOG0310|consen  241 VTCLRLASDST-----------------------------------RLLSGSLDRHVKVFDTTNYKVVHSW  276 (487)
T ss_pred             EEEEEeecCCc-----------------------------------eEeecccccceEEEEccceEEEEee
Confidence            99888865432                                   3444455899999999999999884


No 23 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=67.21  E-value=1.2e+02  Score=34.71  Aligned_cols=89  Identities=21%  Similarity=0.204  Sum_probs=64.5

Q ss_pred             cEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEEC----CCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcc
Q 001853          100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFE----DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRE  175 (1004)
Q Consensus       100 kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~----~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~  175 (1004)
                      +|.++.....-+...-|+.    +.    ....|.++.+    .+.++.+.+++++..|.-++-...         .|. 
T Consensus        26 ~l~~~~~~~~~~~Ps~l~~----~~----~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~---------~g~-   87 (345)
T PF10282_consen   26 TLTLVQTVAEGENPSWLAV----SP----DGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPS---------GGS-   87 (345)
T ss_dssp             EEEEEEEEEESSSECCEEE-----T----TSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEE---------SSS-
T ss_pred             CceEeeeecCCCCCceEEE----Ee----CCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeecc---------CCC-
Confidence            5888888665555566554    22    5678888887    479999999999887876642221         121 


Q ss_pred             cccCCCeEEECCCCCEEEEEEe-cCeEEEEEcccC
Q 001853          176 SFARGPLVKVDPQGRCGGVLVY-GLQMIILKASQG  209 (1004)
Q Consensus       176 ~~~~~~~l~VDP~~Rca~l~~~-~~~L~ilP~~~~  209 (1004)
                         .+-++.+||++|.+.+.-| .+.+.++++...
T Consensus        88 ---~p~~i~~~~~g~~l~vany~~g~v~v~~l~~~  119 (345)
T PF10282_consen   88 ---SPCHIAVDPDGRFLYVANYGGGSVSVFPLDDD  119 (345)
T ss_dssp             ---CEEEEEECTTSSEEEEEETTTTEEEEEEECTT
T ss_pred             ---CcEEEEEecCCCEEEEEEccCCeEEEEEccCC
Confidence               1347999999999999998 899999998654


No 24 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=67.11  E-value=2.8e+02  Score=34.25  Aligned_cols=29  Identities=10%  Similarity=0.061  Sum_probs=22.0

Q ss_pred             CCeEEECCCCCEEEEEEecCeEEEEEccc
Q 001853          180 GPLVKVDPQGRCGGVLVYGLQMIILKASQ  208 (1004)
Q Consensus       180 ~~~l~VDP~~Rca~l~~~~~~L~ilP~~~  208 (1004)
                      ...+.+-|.|..+|..--.+.+-++-+.+
T Consensus       478 I~~l~~SsdG~yiaa~~t~g~I~v~nl~~  506 (691)
T KOG2048|consen  478 ISRLVVSSDGNYIAAISTRGQIFVYNLET  506 (691)
T ss_pred             ceeEEEcCCCCEEEEEeccceEEEEEccc
Confidence            44688999999998887777777766643


No 25 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=66.46  E-value=2.8e+02  Score=33.27  Aligned_cols=118  Identities=12%  Similarity=0.135  Sum_probs=77.3

Q ss_pred             cccCCeEEEEEeCCCcEEEEEecCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEe
Q 001853          606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVG  685 (1004)
Q Consensus       606 ~~~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~  685 (1004)
                      ....+-+.++...++...|-+|-.+|.++...+.....++.-          ....++.+-...+++|.-.|+.|.+|.+
T Consensus       403 ~lg~QP~~lav~~d~~~avv~~~~~iv~l~~~~~~~~~~~~y----------~~s~vAv~~~~~~vaVGG~Dgkvhvysl  472 (603)
T KOG0318|consen  403 KLGSQPKGLAVLSDGGTAVVACISDIVLLQDQTKVSSIPIGY----------ESSAVAVSPDGSEVAVGGQDGKVHVYSL  472 (603)
T ss_pred             ecCCCceeEEEcCCCCEEEEEecCcEEEEecCCcceeecccc----------ccceEEEcCCCCEEEEecccceEEEEEe
Confidence            344455566666676689999999999998665554554420          1235555666899999999999999999


Q ss_pred             cCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEE
Q 001853          686 DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI  765 (1004)
Q Consensus       686 d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I  765 (1004)
                      .....+-+....+-    ...|+.++.  .+                                 ...|++.+..++.+-+
T Consensus       473 ~g~~l~ee~~~~~h----~a~iT~vay--Sp---------------------------------d~~yla~~Da~rkvv~  513 (603)
T KOG0318|consen  473 SGDELKEEAKLLEH----RAAITDVAY--SP---------------------------------DGAYLAAGDASRKVVL  513 (603)
T ss_pred             cCCcccceeeeecc----cCCceEEEE--CC---------------------------------CCcEEEEeccCCcEEE
Confidence            76543222221111    334554332  11                                 1358899999999999


Q ss_pred             EEcCCCe
Q 001853          766 FDVPNFN  772 (1004)
Q Consensus       766 ~sLp~~~  772 (1004)
                      |++.+-+
T Consensus       514 yd~~s~~  520 (603)
T KOG0318|consen  514 YDVASRE  520 (603)
T ss_pred             EEcccCc
Confidence            9987644


No 26 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=63.36  E-value=2.8e+02  Score=31.90  Aligned_cols=156  Identities=18%  Similarity=0.238  Sum_probs=90.0

Q ss_pred             eEEEEEeCCCcEEEEEecCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcC--CEEEE--EEeCCeEEEEEec
Q 001853          611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIAD--PYVLL--GMSDGSIRLLVGD  686 (1004)
Q Consensus       611 TI~ag~l~~~~~IvQVt~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~d--pyvll--~~~~g~I~~l~~d  686 (1004)
                      .|.+-.|. ++|+|-+.+..|-++|-..+.---.+..| ++.     .....+-|.+.  .|++.  .+..|+|++|...
T Consensus        89 ~IL~VrmN-r~RLvV~Lee~IyIydI~~MklLhTI~t~-~~n-----~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~  161 (391)
T KOG2110|consen   89 SILAVRMN-RKRLVVCLEESIYIYDIKDMKLLHTIETT-PPN-----PKGLCALSPNNANCYLAYPGSTTSGDVVLFDTI  161 (391)
T ss_pred             ceEEEEEc-cceEEEEEcccEEEEecccceeehhhhcc-CCC-----ccceEeeccCCCCceEEecCCCCCceEEEEEcc
Confidence            46666774 45899999999999997643211112111 011     12255555554  47776  4578999999876


Q ss_pred             CCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCe-EEE
Q 001853          687 PSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA-LEI  765 (1004)
Q Consensus       687 ~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~-l~I  765 (1004)
                      +-..   +..... +  ++.+.|+.+  ..+|                                 .+++.+.+.|+ +.+
T Consensus       162 nl~~---v~~I~a-H--~~~lAalaf--s~~G---------------------------------~llATASeKGTVIRV  200 (391)
T KOG2110|consen  162 NLQP---VNTINA-H--KGPLAALAF--SPDG---------------------------------TLLATASEKGTVIRV  200 (391)
T ss_pred             ccee---eeEEEe-c--CCceeEEEE--CCCC---------------------------------CEEEEeccCceEEEE
Confidence            5311   111111 1  455664333  3333                                 35556666665 478


Q ss_pred             EEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEEeecCCCCCCcEEEEEe
Q 001853          766 FDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAIL  845 (1004)
Q Consensus       766 ~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill~~lg~~~~~p~L~v~~  845 (1004)
                      |+.|+=+.+|+..                                      +......|-+|.   |+++  .++|.+.-
T Consensus       201 f~v~~G~kl~eFR--------------------------------------RG~~~~~IySL~---Fs~d--s~~L~~sS  237 (391)
T KOG2110|consen  201 FSVPEGQKLYEFR--------------------------------------RGTYPVSIYSLS---FSPD--SQFLAASS  237 (391)
T ss_pred             EEcCCccEeeeee--------------------------------------CCceeeEEEEEE---ECCC--CCeEEEec
Confidence            8888888888742                                      111112233333   3332  47999999


Q ss_pred             eCCcEEEEEEEe
Q 001853          846 TDGTILCYQAYL  857 (1004)
Q Consensus       846 ~~g~l~iY~~f~  857 (1004)
                      ..++|-+|+.-.
T Consensus       238 ~TeTVHiFKL~~  249 (391)
T KOG2110|consen  238 NTETVHIFKLEK  249 (391)
T ss_pred             CCCeEEEEEecc
Confidence            999999998764


No 27 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=61.91  E-value=2e+02  Score=29.76  Aligned_cols=75  Identities=24%  Similarity=0.321  Sum_probs=45.0

Q ss_pred             EEEEEEcCC--EEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccc
Q 001853          661 VLSVSIADP--YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1004)
Q Consensus       661 I~~As~~dp--yvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~  738 (1004)
                      |.+..+...  +++++..++.|.+|.+..... +...  . ..  ...+.++++.  .                      
T Consensus       180 i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~-~~~~--~-~~--~~~i~~~~~~--~----------------------  229 (289)
T cd00200         180 VNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKC-LGTL--R-GH--ENGVNSVAFS--P----------------------  229 (289)
T ss_pred             cceEEECCCcCEEEEecCCCcEEEEECCCCce-ecch--h-hc--CCceEEEEEc--C----------------------
Confidence            666666543  788887899999998865322 1110  0 11  2235543331  1                      


Q ss_pred             cCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEE
Q 001853          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT  776 (1004)
Q Consensus       739 ~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~  776 (1004)
                                 ...+++.+..+|.+.+|.+...+++..
T Consensus       230 -----------~~~~~~~~~~~~~i~i~~~~~~~~~~~  256 (289)
T cd00200         230 -----------DGYLLASGSEDGTIRVWDLRTGECVQT  256 (289)
T ss_pred             -----------CCcEEEEEcCCCcEEEEEcCCceeEEE
Confidence                       134666777799999999887665544


No 28 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=61.22  E-value=3.3e+02  Score=32.02  Aligned_cols=113  Identities=19%  Similarity=0.217  Sum_probs=69.2

Q ss_pred             ccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccc
Q 001853          659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1004)
Q Consensus       659 ~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~  738 (1004)
                      ..-.++-.++.|++=+..|+...+..+........+....  .  +-.++++-++.|                       
T Consensus       306 V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~--s--~v~~ts~~fHpD-----------------------  358 (506)
T KOG0289|consen  306 VTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDET--S--DVEYTSAAFHPD-----------------------  358 (506)
T ss_pred             ceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeecc--c--cceeEEeeEcCC-----------------------
Confidence            3455666779999988888888877776655433332210  1  334665555433                       


Q ss_pred             cCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCccc
Q 001853          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN  818 (1004)
Q Consensus       739 ~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  818 (1004)
                                  ...+.....+|.|.||.|.+-.-   +..|+   .                                 
T Consensus       359 ------------gLifgtgt~d~~vkiwdlks~~~---~a~Fp---g---------------------------------  387 (506)
T KOG0289|consen  359 ------------GLIFGTGTPDGVVKIWDLKSQTN---VAKFP---G---------------------------------  387 (506)
T ss_pred             ------------ceEEeccCCCceEEEEEcCCccc---cccCC---C---------------------------------
Confidence                        34555667899999999986441   12222   1                                 


Q ss_pred             ccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEE
Q 001853          819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA  855 (1004)
Q Consensus       819 ~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~  855 (1004)
                       ....|++|.+..=|     -||.+...||.|.++..
T Consensus       388 -ht~~vk~i~FsENG-----Y~Lat~add~~V~lwDL  418 (506)
T KOG0289|consen  388 -HTGPVKAISFSENG-----YWLATAADDGSVKLWDL  418 (506)
T ss_pred             -CCCceeEEEeccCc-----eEEEEEecCCeEEEEEe
Confidence             12246777665444     57888777777887744


No 29 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=57.51  E-value=4.6e+02  Score=32.50  Aligned_cols=97  Identities=20%  Similarity=0.155  Sum_probs=60.9

Q ss_pred             CeEEEEcCC-EEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEE
Q 001853           56 PNLVVTAAN-VIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII  134 (1004)
Q Consensus        56 ~nLVvak~n-~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Ll  134 (1004)
                      .+|-|++++ .||||.+..+=                        -++.+..-+-.+.|.+|+=.   .+      -.| 
T Consensus        38 ~~lAvsRt~g~IEiwN~~~~w------------------------~~~~vi~g~~drsIE~L~W~---e~------~RL-   83 (691)
T KOG2048|consen   38 NQLAVSRTDGNIEIWNLSNNW------------------------FLEPVIHGPEDRSIESLAWA---EG------GRL-   83 (691)
T ss_pred             CceeeeccCCcEEEEccCCCc------------------------eeeEEEecCCCCceeeEEEc---cC------CeE-
Confidence            679999865 89999987531                        36666666666777777644   11      122 


Q ss_pred             EEECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCC--eEEECCCCCEEEEEEecCeEEEE
Q 001853          135 LAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP--LVKVDPQGRCGGVLVYGLQMIIL  204 (1004)
Q Consensus       135 v~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~--~l~VDP~~Rca~l~~~~~~L~il  204 (1004)
                       -+-.+.=+|.|||..+.+-.-.  |.   .      .|      ++  -+.+.|.+.-+++.+-++.|.++
T Consensus        84 -FS~g~sg~i~EwDl~~lk~~~~--~d---~------~g------g~IWsiai~p~~~~l~IgcddGvl~~~  137 (691)
T KOG2048|consen   84 -FSSGLSGSITEWDLHTLKQKYN--ID---S------NG------GAIWSIAINPENTILAIGCDDGVLYDF  137 (691)
T ss_pred             -EeecCCceEEEEecccCceeEE--ec---C------CC------cceeEEEeCCccceEEeecCCceEEEE
Confidence             2335666789999865433211  11   0      11      22  27888999888888777855443


No 30 
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=56.90  E-value=99  Score=34.74  Aligned_cols=22  Identities=23%  Similarity=0.480  Sum_probs=19.6

Q ss_pred             CCcEEEEEEecCCeEEEEEcCC
Q 001853          749 QGDIYSVVCYESGALEIFDVPN  770 (1004)
Q Consensus       749 ~~~~~l~~~~~~g~l~I~sLp~  770 (1004)
                      ...-||++..+.|+|+||+|.+
T Consensus       236 p~~s~LavsSdKgTlHiF~l~~  257 (346)
T KOG2111|consen  236 PNSSWLAVSSDKGTLHIFSLRD  257 (346)
T ss_pred             CCccEEEEEcCCCeEEEEEeec
Confidence            3467999999999999999987


No 31 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=54.55  E-value=3.9e+02  Score=30.79  Aligned_cols=28  Identities=18%  Similarity=0.202  Sum_probs=21.4

Q ss_pred             eecCCCCCCcEEEEEeeCCcEEEEEEEe
Q 001853          830 QRWSAHHSRPFLFAILTDGTILCYQAYL  857 (1004)
Q Consensus       830 ~~lg~~~~~p~L~v~~~~g~l~iY~~f~  857 (1004)
                      +-|+.....|++.|+..||.+.+|+.-.
T Consensus       304 ~~l~~~~~~~~v~vas~dG~~y~y~l~~  331 (391)
T KOG2110|consen  304 CSLSSIQKIPRVLVASYDGHLYSYRLPP  331 (391)
T ss_pred             EEeeccCCCCEEEEEEcCCeEEEEEcCC
Confidence            3344444579999999999999997653


No 32 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=54.10  E-value=51  Score=38.55  Aligned_cols=91  Identities=18%  Similarity=0.304  Sum_probs=59.8

Q ss_pred             EEEeecCCCCCCcEEEEEeeCCcEEEEEEEeecCCCCCCCCCCCCcccccccccccccccccceeEEeccCCcCCCCCCC
Q 001853          827 LAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETP  906 (1004)
Q Consensus       827 ill~~lg~~~~~p~L~v~~~~g~l~iY~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrf~Kv~~~~~~~~~~~  906 (1004)
                      |..+.|.+  .+|.|+++--||.|-+|+.--....                        +..+++|+|.|..        
T Consensus       216 I~sv~FHp--~~plllvaG~d~~lrifqvDGk~N~------------------------~lqS~~l~~fPi~--------  261 (514)
T KOG2055|consen  216 ITSVQFHP--TAPLLLVAGLDGTLRIFQVDGKVNP------------------------KLQSIHLEKFPIQ--------  261 (514)
T ss_pred             ceEEEecC--CCceEEEecCCCcEEEEEecCccCh------------------------hheeeeeccCccc--------
Confidence            44455544  3799999999999999977522111                        2356888887743        


Q ss_pred             CCCCccceEEecccCCceEEEecCCCCeEE-E--EcccccEEEeccCCCceEEEecC
Q 001853          907 HGAPCQRITIFKNISGHQGFFLSGSRPCWC-M--VFRERLRVHPQLCDGSIVAFTVL  960 (1004)
Q Consensus       907 ~~~~~~~l~~f~~i~g~sgVFv~G~~P~~i-~--~~~~~l~~~~~~~~~~v~~f~~F  960 (1004)
                          ...|.    -+|.+-||.+|.++++- +  -......++|+.+. +=.+|-.|
T Consensus       262 ----~a~f~----p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g~-e~~~~e~F  309 (514)
T KOG2055|consen  262 ----KAEFA----PNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYGV-EEKSMERF  309 (514)
T ss_pred             ----eeeec----CCCceEEEecccceEEEEeeccccccccccCCCCc-ccchhhee
Confidence                22222    27899999999999987 3  44566667777655 33344444


No 33 
>PF07569 Hira:  TUP1-like enhancer of split;  InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=51.54  E-value=54  Score=35.16  Aligned_cols=73  Identities=14%  Similarity=0.244  Sum_probs=46.6

Q ss_pred             cEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEEe
Q 001853          751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ  830 (1004)
Q Consensus       751 ~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill~  830 (1004)
                      ..||.+.+.+|.+.+|.++..++++.-.++.  | +|..    ....                   .....+.|+.+.|.
T Consensus        22 ~~~Ll~iT~~G~l~vWnl~~~k~~~~~~Si~--p-ll~~----~~~~-------------------~~~~~~~i~~~~lt   75 (219)
T PF07569_consen   22 GSYLLAITSSGLLYVWNLKKGKAVLPPVSIA--P-LLNS----SPVS-------------------DKSSSPNITSCSLT   75 (219)
T ss_pred             CCEEEEEeCCCeEEEEECCCCeeccCCccHH--H-Hhcc----cccc-------------------cCCCCCcEEEEEEc
Confidence            4579999999999999999999988743332  3 4321    1100                   00234556666666


Q ss_pred             ecCCCCCCcEEEEEeeCCcEEEEEE
Q 001853          831 RWSAHHSRPFLFAILTDGTILCYQA  855 (1004)
Q Consensus       831 ~lg~~~~~p~L~v~~~~g~l~iY~~  855 (1004)
                      .=|    .|.  |.+.+|+..+|..
T Consensus        76 ~~G----~Pi--V~lsng~~y~y~~   94 (219)
T PF07569_consen   76 SNG----VPI--VTLSNGDSYSYSP   94 (219)
T ss_pred             CCC----CEE--EEEeCCCEEEecc
Confidence            333    464  4578899888854


No 34 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=50.22  E-value=2.3e+02  Score=32.49  Aligned_cols=123  Identities=20%  Similarity=0.235  Sum_probs=66.9

Q ss_pred             cEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEc--CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCce
Q 001853          630 GARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV  707 (1004)
Q Consensus       630 ~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~--dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i  707 (1004)
                      -+|+.|-....+...+.      |   ....|...-++  ||+|+-+..|++|.++.+-.+.....+...      +..+
T Consensus       258 t~RvWDiRtr~~V~~l~------G---H~~~V~~V~~~~~dpqvit~S~D~tvrlWDl~agkt~~tlt~h------kksv  322 (460)
T KOG0285|consen  258 TIRVWDIRTRASVHVLS------G---HTNPVASVMCQPTDPQVITGSHDSTVRLWDLRAGKTMITLTHH------KKSV  322 (460)
T ss_pred             eEEEeeecccceEEEec------C---CCCcceeEEeecCCCceEEecCCceEEEeeeccCceeEeeecc------ccee
Confidence            46666655444444442      2   11235555555  999999999999999998776543433322      4457


Q ss_pred             eEEEEeecCCCCcceecccccccc----cCccc-cccCCCCC--CC-CCCCcEEEEEEecCCeEEEEEcCC
Q 001853          708 SSCTLYHDKGPEPWLRKTSTDAWL----STGVG-EAIDGADG--GP-LDQGDIYSVVCYESGALEIFDVPN  770 (1004)
Q Consensus       708 ~~~~l~~d~~g~~~f~~~~~~~~~----~~~~~-~~~~~~~~--~~-~~~~~~~l~~~~~~g~l~I~sLp~  770 (1004)
                      .|.||+-..+   .|....++.-.    +.+.. .+....+.  .. ....+-++|..-++|.|..|.-.+
T Consensus       323 ral~lhP~e~---~fASas~dnik~w~~p~g~f~~nlsgh~~iintl~~nsD~v~~~G~dng~~~fwdwks  390 (460)
T KOG0285|consen  323 RALCLHPKEN---LFASASPDNIKQWKLPEGEFLQNLSGHNAIINTLSVNSDGVLVSGGDNGSIMFWDWKS  390 (460)
T ss_pred             eEEecCCchh---hhhccCCccceeccCCccchhhccccccceeeeeeeccCceEEEcCCceEEEEEecCc
Confidence            7888875443   55444332110    11110 01110000  00 012345778888899999988544


No 35 
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=49.68  E-value=5.1e+02  Score=32.61  Aligned_cols=122  Identities=18%  Similarity=0.175  Sum_probs=72.0

Q ss_pred             cEEEEEEc--CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCcccc
Q 001853          660 TVLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE  737 (1004)
Q Consensus       660 ~I~~As~~--dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~  737 (1004)
                      .|.++.++  |.||++...+|.+.+|.+-.... +|.  .++.   +.-|.++++-.|.                     
T Consensus       414 y~l~~~Fvpgd~~Iv~G~k~Gel~vfdlaS~~l-~Et--i~AH---dgaIWsi~~~pD~---------------------  466 (888)
T KOG0306|consen  414 YILASKFVPGDRYIVLGTKNGELQVFDLASASL-VET--IRAH---DGAIWSISLSPDN---------------------  466 (888)
T ss_pred             cEEEEEecCCCceEEEeccCCceEEEEeehhhh-hhh--hhcc---ccceeeeeecCCC---------------------
Confidence            35666664  99999999999999999976532 332  1221   4446655553332                     


Q ss_pred             ccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccc-cccccccccccccccchhccCCCccccCCCCc
Q 001853          738 AIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGR-THIVDTYMREALKDSETEINSSSEEGTGQGRK  816 (1004)
Q Consensus       738 ~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~-~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  816 (1004)
                                    -.++....+-++.+|++   +++..+   +.-+ .+|.         .                 +
T Consensus       467 --------------~g~vT~saDktVkfWdf---~l~~~~---~gt~~k~ls---------l-----------------~  500 (888)
T KOG0306|consen  467 --------------KGFVTGSADKTVKFWDF---KLVVSV---PGTQKKVLS---------L-----------------K  500 (888)
T ss_pred             --------------CceEEecCCcEEEEEeE---EEEecc---Ccccceeee---------e-----------------c
Confidence                          24566677888899874   555542   1111 1110         0                 0


Q ss_pred             ccccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEEE
Q 001853          817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY  856 (1004)
Q Consensus       817 ~~~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~f  856 (1004)
                      ....-+.--+|+-+.+-++  .-||.|.+-|.++-+|-.-
T Consensus       501 ~~rtLel~ddvL~v~~Spd--gk~LaVsLLdnTVkVyflD  538 (888)
T KOG0306|consen  501 HTRTLELEDDVLCVSVSPD--GKLLAVSLLDNTVKVYFLD  538 (888)
T ss_pred             cceEEeccccEEEEEEcCC--CcEEEEEeccCeEEEEEec
Confidence            0001111235666666554  4799999999999999543


No 36 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=48.82  E-value=1.7e+02  Score=34.96  Aligned_cols=108  Identities=19%  Similarity=0.239  Sum_probs=56.7

Q ss_pred             ccccEEEEEecCceEEEE-ecCceeEEecCCCcc-----ccC--CeEEEEEeCC---CcEEEEEecCcEEEEeCC---cc
Q 001853          574 EYHAYLIISLEARTMVLE-TADLLTEVTESVDYF-----VQG--RTIAAGNLFG---RRRVIQVFERGARILDGS---YM  639 (1004)
Q Consensus       574 ~~~~yLilS~~~~T~Vl~-~~~~l~ev~~~~~F~-----~~~--~TI~ag~l~~---~~~IvQVt~~~vrl~~~~---~~  639 (1004)
                      .-+.+|++|....-.||- -|-++.|.--...++     +.+  .+|.+|...-   +.++---....+|+.+.+   .+
T Consensus       225 Tg~~iLvvsg~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q  304 (641)
T KOG0772|consen  225 TGDQILVVSGSAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQ  304 (641)
T ss_pred             CCCeEEEEecCcceeEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchhh
Confidence            346788888887777774 344555543122222     112  2444444321   111111223457777755   24


Q ss_pred             eeEEeCCCCCCCCCCCCCCccEEEEEEc--CCEEEEEEeCCeEEEEEecC
Q 001853          640 TQDLSFGPSNSESGSGSENSTVLSVSIA--DPYVLLGMSDGSIRLLVGDP  687 (1004)
Q Consensus       640 ~q~~~~~~~~~e~g~~~~~~~I~~As~~--dpyvll~~~~g~I~~l~~d~  687 (1004)
                      .|.+..-    ..|  +....++.|..+  .+.++-++.||+|.+|....
T Consensus       305 ~qVik~k----~~~--g~Rv~~tsC~~nrdg~~iAagc~DGSIQ~W~~~~  348 (641)
T KOG0772|consen  305 LQVIKTK----PAG--GKRVPVTSCAWNRDGKLIAAGCLDGSIQIWDKGS  348 (641)
T ss_pred             eeEEeec----cCC--CcccCceeeecCCCcchhhhcccCCceeeeecCC
Confidence            5555542    112  112223444443  67888889999999999744


No 37 
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=48.41  E-value=66  Score=41.14  Aligned_cols=84  Identities=17%  Similarity=0.188  Sum_probs=63.0

Q ss_pred             hhhhccCCCceeeEEEEEEecCCCCCCCCccccccccccccCCCCCCCCCCCeEEEE-----------cCCEEEEEEEEe
Q 001853            5 AYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT-----------AANVIEIYVVRV   73 (1004)
Q Consensus         5 ~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVva-----------k~n~LeIy~v~~   73 (1004)
                      .+.++.++-.+..+++|+|++...                           .-+||+           +..+|-||++.+
T Consensus       764 ~~hef~~~E~~~Si~s~~~~~d~~---------------------------t~~vVGT~~v~Pde~ep~~GRIivfe~~e  816 (1096)
T KOG1897|consen  764 SSHEFERNETALSIISCKFTDDPN---------------------------TYYVVGTGLVYPDENEPVNGRIIVFEFEE  816 (1096)
T ss_pred             eeccccccceeeeeeeeeecCCCc---------------------------eEEEEEEEeeccCCCCcccceEEEEEEec
Confidence            345688888999999999997654                           345543           345788888875


Q ss_pred             cccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCC
Q 001853           74 QEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDS  150 (1004)
Q Consensus        74 ~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~  150 (1004)
                      .+                        +|++++|..+-|.+.+|..+.          +.|+.++ ...+.+.+|-.+
T Consensus       817 ~~------------------------~L~~v~e~~v~Gav~aL~~fn----------gkllA~I-n~~vrLye~t~~  858 (1096)
T KOG1897|consen  817 LN------------------------SLELVAETVVKGAVYALVEFN----------GKLLAGI-NQSVRLYEWTTE  858 (1096)
T ss_pred             CC------------------------ceeeeeeeeeccceeehhhhC----------CeEEEec-CcEEEEEEcccc
Confidence            22                        799999999999999987654          3455444 688999999765


No 38 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=46.39  E-value=5.7e+02  Score=30.35  Aligned_cols=25  Identities=4%  Similarity=0.198  Sum_probs=19.2

Q ss_pred             EEEEEEecCCeEEEEEcCCCeEEEE
Q 001853          752 IYSVVCYESGALEIFDVPNFNCVFT  776 (1004)
Q Consensus       752 ~~l~~~~~~g~l~I~sLp~~~~v~~  776 (1004)
                      .++++...+|.+.++.-.+.+++-+
T Consensus       316 ~fia~~G~~G~I~lLhakT~eli~s  340 (514)
T KOG2055|consen  316 NFIAIAGNNGHIHLLHAKTKELITS  340 (514)
T ss_pred             CeEEEcccCceEEeehhhhhhhhhe
Confidence            4888899999999988776665443


No 39 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=46.19  E-value=75  Score=37.78  Aligned_cols=111  Identities=14%  Similarity=0.214  Sum_probs=68.7

Q ss_pred             CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCC
Q 001853          668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL  747 (1004)
Q Consensus       668 dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~  747 (1004)
                      -..+|-+..||++.+|.++.....+++.++.........++ .|-|.-                                
T Consensus       281 k~~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~t-sC~~nr--------------------------------  327 (641)
T KOG0772|consen  281 KEEFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVT-SCAWNR--------------------------------  327 (641)
T ss_pred             ccceEEecCCCcEEEEecCCchhheeEEeeccCCCcccCce-eeecCC--------------------------------
Confidence            34455566899999999998766677665554332122233 344321                                


Q ss_pred             CCCcEEEEEEecCCeEEEEEcCCCe--EEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceE
Q 001853          748 DQGDIYSVVCYESGALEIFDVPNFN--CVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVV  825 (1004)
Q Consensus       748 ~~~~~~l~~~~~~g~l~I~sLp~~~--~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~  825 (1004)
                        ...|++..+.+|++.||+++.+.  ++|.+.                                     +.|.....|.
T Consensus       328 --dg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk-------------------------------------~AH~~g~~It  368 (641)
T KOG0772|consen  328 --DGKLIAAGCLDGSIQIWDKGSRTVRPVMKVK-------------------------------------DAHLPGQDIT  368 (641)
T ss_pred             --CcchhhhcccCCceeeeecCCcccccceEee-------------------------------------eccCCCCcee
Confidence              12367778889999999998743  233332                                     1222333577


Q ss_pred             EEEEeecCCCCCCcEEEEEeeCCcEEEEEE
Q 001853          826 ELAMQRWSAHHSRPFLFAILTDGTILCYQA  855 (1004)
Q Consensus       826 eill~~lg~~~~~p~L~v~~~~g~l~iY~~  855 (1004)
                      .|.+..-|.     ||+-+-.|+.|-++..
T Consensus       369 si~FS~dg~-----~LlSRg~D~tLKvWDL  393 (641)
T KOG0772|consen  369 SISFSYDGN-----YLLSRGFDDTLKVWDL  393 (641)
T ss_pred             EEEeccccc-----hhhhccCCCceeeeec
Confidence            777776663     5777777777776644


No 40 
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=46.17  E-value=1.6e+02  Score=34.51  Aligned_cols=96  Identities=19%  Similarity=0.176  Sum_probs=41.1

Q ss_pred             CCEEEEEEeCCeEEEEEecC-CCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCC
Q 001853          668 DPYVLLGMSDGSIRLLVGDP-STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGP  746 (1004)
Q Consensus       668 dpyvll~~~~g~I~~l~~d~-~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~  746 (1004)
                      .+.+++.++.|.+..|.+.+ .+.+..+.........++++.+++.+...+|.+-.  ++....+      ...      
T Consensus       155 Si~L~vGTn~G~v~~fkIlp~~~g~f~v~~~~~~~~~~~~i~~I~~i~~~~G~~a~--At~~~~~------~l~------  220 (395)
T PF08596_consen  155 SICLLVGTNSGNVLTFKILPSSNGRFSVQFAGATTNHDSPILSIIPINADTGESAL--ATISAMQ------GLS------  220 (395)
T ss_dssp             EEEEEEEETTSEEEEEEEEE-GGG-EEEEEEEEE--SS----EEEEEETTT--B-B---BHHHHH------GGG------
T ss_pred             ceEEEEEeCCCCEEEEEEecCCCCceEEEEeeccccCCCceEEEEEEECCCCCccc--CchhHhh------ccc------
Confidence            35677788999999999974 33334433222111115677877777655552100  0000000      000      


Q ss_pred             CCCCcEEEEEEecCCeEEEEEcCCCeEEEEe
Q 001853          747 LDQGDIYSVVCYESGALEIFDVPNFNCVFTV  777 (1004)
Q Consensus       747 ~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~  777 (1004)
                      ....-..++++..+-.++||++|+.+..++.
T Consensus       221 ~g~~i~g~vVvvSe~~irv~~~~~~k~~~K~  251 (395)
T PF08596_consen  221 KGISIPGYVVVVSESDIRVFKPPKSKGAHKS  251 (395)
T ss_dssp             GT----EEEEEE-SSEEEEE-TT---EEEEE
T ss_pred             cCCCcCcEEEEEcccceEEEeCCCCccccee
Confidence            0111223444555667799999998876653


No 41 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=45.48  E-value=3.6e+02  Score=33.72  Aligned_cols=117  Identities=16%  Similarity=0.118  Sum_probs=65.0

Q ss_pred             ccCCeEEEEEeCCCcEEEEEec--CcEEEEeCCcc-eeEEeCCCCCCCCCCCCCCccEEEEE--EcCCEEEEEEeCCeEE
Q 001853          607 VQGRTIAAGNLFGRRRVIQVFE--RGARILDGSYM-TQDLSFGPSNSESGSGSENSTVLSVS--IADPYVLLGMSDGSIR  681 (1004)
Q Consensus       607 ~~~~TI~ag~l~~~~~IvQVt~--~~vrl~~~~~~-~q~~~~~~~~~e~g~~~~~~~I~~As--~~dpyvll~~~~g~I~  681 (1004)
                      .++.-..+.-+|.....+=|-+  .++|+|+-..+ -|.++--           .-.|-+.+  ..+-+++-+..|.++.
T Consensus       322 ~ndEI~Dm~~lG~e~~~laVATNs~~lr~y~~~~~~c~ii~GH-----------~e~vlSL~~~~~g~llat~sKD~svi  390 (775)
T KOG0319|consen  322 YNDEILDMKFLGPEESHLAVATNSPELRLYTLPTSYCQIIPGH-----------TEAVLSLDVWSSGDLLATGSKDKSVI  390 (775)
T ss_pred             CchhheeeeecCCccceEEEEeCCCceEEEecCCCceEEEeCc-----------hhheeeeeecccCcEEEEecCCceEE
Confidence            3444555666664444444443  35999975542 3333211           11244444  3343455556899999


Q ss_pred             EEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCC
Q 001853          682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG  761 (1004)
Q Consensus       682 ~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g  761 (1004)
                      +++++++-.+.........+  ...+.+.++  ...                                .--+++...+++
T Consensus       391 lWr~~~~~~~~~~~a~~~gH--~~svgava~--~~~--------------------------------~asffvsvS~D~  434 (775)
T KOG0319|consen  391 LWRLNNNCSKSLCVAQANGH--TNSVGAVAG--SKL--------------------------------GASFFVSVSQDC  434 (775)
T ss_pred             EEEecCCcchhhhhhhhccc--ccccceeee--ccc--------------------------------CccEEEEecCCc
Confidence            99996654322211111222  445666555  221                                123678888999


Q ss_pred             eEEEEEcCC
Q 001853          762 ALEIFDVPN  770 (1004)
Q Consensus       762 ~l~I~sLp~  770 (1004)
                      +|++|.||.
T Consensus       435 tlK~W~l~~  443 (775)
T KOG0319|consen  435 TLKLWDLPK  443 (775)
T ss_pred             eEEEecCCC
Confidence            999999998


No 42 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=45.43  E-value=7.1e+02  Score=31.44  Aligned_cols=196  Identities=15%  Similarity=0.160  Sum_probs=112.4

Q ss_pred             cEEEEEEcCCEEEEEE-eCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccc
Q 001853          660 TVLSVSIADPYVLLGM-SDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1004)
Q Consensus       660 ~I~~As~~dpyvll~~-~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~  738 (1004)
                      .|...+-...+.||.. -|.++.|++...+.+ |.+-..      .+-++|+.          |+              +
T Consensus       371 DILDlSWSKn~fLLSSSMDKTVRLWh~~~~~C-L~~F~H------ndfVTcVa----------Fn--------------P  419 (712)
T KOG0283|consen  371 DILDLSWSKNNFLLSSSMDKTVRLWHPGRKEC-LKVFSH------NDFVTCVA----------FN--------------P  419 (712)
T ss_pred             hheecccccCCeeEeccccccEEeecCCCcce-eeEEec------CCeeEEEE----------ec--------------c
Confidence            3777777777776644 699999999988776 432211      33466432          21              2


Q ss_pred             cCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCccc
Q 001853          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN  818 (1004)
Q Consensus       739 ~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  818 (1004)
                      +|          .-|.+-..=+|.+.||++|+.+.++=.+ +.                                     
T Consensus       420 vD----------DryFiSGSLD~KvRiWsI~d~~Vv~W~D-l~-------------------------------------  451 (712)
T KOG0283|consen  420 VD----------DRYFISGSLDGKVRLWSISDKKVVDWND-LR-------------------------------------  451 (712)
T ss_pred             cC----------CCcEeecccccceEEeecCcCeeEeehh-hh-------------------------------------
Confidence            22          2244555669999999999999877522 11                                     


Q ss_pred             ccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEEEeecCCCCCCCCCCCCcccccccccccccccccceeEEeccCC
Q 001853          819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLD  898 (1004)
Q Consensus       819 ~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrf~Kv~~~  898 (1004)
                         ..|+-+++..=|     -+-+|.+-+|...+|...-...+             -+..+..++.        ||..  
T Consensus       452 ---~lITAvcy~PdG-----k~avIGt~~G~C~fY~t~~lk~~-------------~~~~I~~~~~--------Kk~~--  500 (712)
T KOG0283|consen  452 ---DLITAVCYSPDG-----KGAVIGTFNGYCRFYDTEGLKLV-------------SDFHIRLHNK--------KKKQ--  500 (712)
T ss_pred             ---hhheeEEeccCC-----ceEEEEEeccEEEEEEccCCeEE-------------EeeeEeeccC--------cccc--
Confidence               234445554333     57889999999999976521110             0000000000        1111  


Q ss_pred             cCCCCCCCCCCCccceEEecccCCceEEEecCCCCeEE-EEcccccEEEeccCCCceEEEecCCCCCCCCcEEEEecCCc
Q 001853          899 AYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWC-MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGI  977 (1004)
Q Consensus       899 ~~~~~~~~~~~~~~~l~~f~~i~g~sgVFv~G~~P~~i-~~~~~~l~~~~~~~~~~v~~f~~F~~~~~~~gfiy~~~~~~  977 (1004)
                               +.   +++-|.        |.+|..--.| -+..+++|++-......|.-|--|+|.+ .+-.+.|+.+|.
T Consensus       501 ---------~~---rITG~Q--------~~p~~~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n~~-SQ~~Asfs~Dgk  559 (712)
T KOG0283|consen  501 ---------GK---RITGLQ--------FFPGDPDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRNTS-SQISASFSSDGK  559 (712)
T ss_pred             ---------Cc---eeeeeE--------ecCCCCCeEEEecCCCceEEEeccchhhhhhhcccccCC-cceeeeEccCCC
Confidence                     11   222221        3334333344 5778999999764454677788888843 455677777777


Q ss_pred             EEEEECCCC
Q 001853          978 LKICQLPSG  986 (1004)
Q Consensus       978 lri~~lp~~  986 (1004)
                      --||--...
T Consensus       560 ~IVs~seDs  568 (712)
T KOG0283|consen  560 HIVSASEDS  568 (712)
T ss_pred             EEEEeecCc
Confidence            666666443


No 43 
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=44.73  E-value=1.9e+02  Score=31.83  Aligned_cols=98  Identities=15%  Similarity=0.191  Sum_probs=0.0

Q ss_pred             CcEEEEeCC--cceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCc
Q 001853          629 RGARILDGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP  706 (1004)
Q Consensus       629 ~~vrl~~~~--~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~  706 (1004)
                      +.||+.+..  +.++.+...          .+..++..+-.+.|+++.-.|..|..+..-......+.+.....+     
T Consensus        87 k~ir~wd~r~~k~~~~i~~~----------~eni~i~wsp~g~~~~~~~kdD~it~id~r~~~~~~~~~~~~e~n-----  151 (313)
T KOG1407|consen   87 KTIRIWDIRSGKCTARIETK----------GENINITWSPDGEYIAVGNKDDRITFIDARTYKIVNEEQFKFEVN-----  151 (313)
T ss_pred             ceEEEEEeccCcEEEEeecc----------CcceEEEEcCCCCEEEEecCcccEEEEEecccceeehhcccceee-----


Q ss_pred             eeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEec
Q 001853          707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1004)
Q Consensus       707 i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~  778 (1004)
                        -+|-                                   ..+..+.|+.+..|.++|++-|.|++|+..+
T Consensus       152 --e~~w-----------------------------------~~~nd~Fflt~GlG~v~ILsypsLkpv~si~  186 (313)
T KOG1407|consen  152 --EISW-----------------------------------NNSNDLFFLTNGLGCVEILSYPSLKPVQSIK  186 (313)
T ss_pred             --eeee-----------------------------------cCCCCEEEEecCCceEEEEeccccccccccc


No 44 
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=44.55  E-value=4.3e+02  Score=28.35  Aligned_cols=53  Identities=13%  Similarity=0.154  Sum_probs=34.5

Q ss_pred             EECCCCCEEEEEEecCeEEEEEcccCCCCCCCCCCCCCCCCCcccceeeeEEEEecccCCCceeeEEEecCC
Q 001853          184 KVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLDMKHVKDFIFVHGY  255 (1004)
Q Consensus       184 ~VDP~~Rca~l~~~~~~L~ilP~~~~~~~l~~~d~~~~~~~~~~~~~~~s~~i~l~~ldi~nViD~~FL~gy  255 (1004)
                      .--|.|..++-.-.+..++++||+.+.-.                ...+-..+++-+   ..|+|||||.+-
T Consensus        96 ~ws~~geliatgsndk~ik~l~fn~dt~~----------------~~g~dle~nmhd---gtirdl~fld~~  148 (350)
T KOG0641|consen   96 AWSPCGELIATGSNDKTIKVLPFNADTCN----------------ATGHDLEFNMHD---GTIRDLAFLDDP  148 (350)
T ss_pred             EecCccCeEEecCCCceEEEEeccccccc----------------ccCcceeeeecC---CceeeeEEecCC
Confidence            45777887777767888999999754310                122333444443   788899998663


No 45 
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=43.79  E-value=1.6e+02  Score=29.19  Aligned_cols=72  Identities=15%  Similarity=0.203  Sum_probs=49.4

Q ss_pred             CeEEEEc-CCEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEE
Q 001853           56 PNLVVTA-ANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII  134 (1004)
Q Consensus        56 ~nLVvak-~n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Ll  134 (1004)
                      ++|+.|. +..+=||...........                   .=.-+....+.-.|++|++=|+..+.   .+|.|+
T Consensus        11 pcL~~aT~~gKV~IH~ph~~~~~~~~-------------------~~~~i~~LNin~~italaaG~l~~~~---~~D~Ll   68 (136)
T PF14781_consen   11 PCLACATTGGKVFIHNPHERGQRTGR-------------------QDSDISFLNINQEITALAAGRLKPDD---GRDCLL   68 (136)
T ss_pred             eeEEEEecCCEEEEECCCcccccccc-------------------ccCceeEEECCCceEEEEEEecCCCC---CcCEEE
Confidence            7888875 678888876533210000                   01235666788889999988886432   899999


Q ss_pred             EEECCCeEEEEEEeCCCC
Q 001853          135 LAFEDAKISVLEFDDSIH  152 (1004)
Q Consensus       135 v~~~~aklsile~d~~~~  152 (1004)
                      |+|..   +|+-||-+.+
T Consensus        69 iGt~t---~llaYDV~~N   83 (136)
T PF14781_consen   69 IGTQT---SLLAYDVENN   83 (136)
T ss_pred             Eeccc---eEEEEEcccC
Confidence            99976   6888997665


No 46 
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=43.46  E-value=5.1e+02  Score=28.90  Aligned_cols=117  Identities=19%  Similarity=0.178  Sum_probs=71.6

Q ss_pred             EEEEEEc-CCEEEE-EEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccc
Q 001853          661 VLSVSIA-DPYVLL-GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1004)
Q Consensus       661 I~~As~~-dpyvll-~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~  738 (1004)
                      +..+.+. |--++. .=.||++.++.+++....-.+..       ...|.++|+                          
T Consensus       195 v~t~~vSpDGslcasGgkdg~~~LwdL~~~k~lysl~a-------~~~v~sl~f--------------------------  241 (315)
T KOG0279|consen  195 VNTVTVSPDGSLCASGGKDGEAMLWDLNEGKNLYSLEA-------FDIVNSLCF--------------------------  241 (315)
T ss_pred             EEEEEECCCCCEEecCCCCceEEEEEccCCceeEeccC-------CCeEeeEEe--------------------------
Confidence            4444443 333333 33688999999988755222211       233555554                          


Q ss_pred             cCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCccc
Q 001853          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKEN  818 (1004)
Q Consensus       739 ~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  818 (1004)
                               ....+||..++..+ ++||.|-.-.+|+..+     +...         ..                   .
T Consensus       242 ---------spnrywL~~at~~s-IkIwdl~~~~~v~~l~-----~d~~---------g~-------------------s  278 (315)
T KOG0279|consen  242 ---------SPNRYWLCAATATS-IKIWDLESKAVVEELK-----LDGI---------GP-------------------S  278 (315)
T ss_pred             ---------cCCceeEeeccCCc-eEEEeccchhhhhhcc-----cccc---------cc-------------------c
Confidence                     22368998888877 7999998877766532     1110         00                   0


Q ss_pred             ccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEEE
Q 001853          819 IHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY  856 (1004)
Q Consensus       819 ~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~f  856 (1004)
                      .....+..+.++-.-+   ..+||+...||-|.++|.-
T Consensus       279 ~~~~~~~clslaws~d---G~tLf~g~td~~irv~qv~  313 (315)
T KOG0279|consen  279 SKAGDPICLSLAWSAD---GQTLFAGYTDNVIRVWQVA  313 (315)
T ss_pred             cccCCcEEEEEEEcCC---CcEEEeeecCCcEEEEEee
Confidence            1123466777776643   4799999999999998764


No 47 
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=41.14  E-value=3.6e+02  Score=32.22  Aligned_cols=87  Identities=18%  Similarity=0.197  Sum_probs=54.1

Q ss_pred             ccEEEEEEc--CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccc
Q 001853          659 STVLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG  736 (1004)
Q Consensus       659 ~~I~~As~~--dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~  736 (1004)
                      ..|+....+  |.||+-...+|.|++.....+-..-+...+      ..+..  -|. +                     
T Consensus       122 stvt~v~YN~~DeyiAsvs~gGdiiih~~~t~~~tt~f~~~------sgqsv--Rll-~---------------------  171 (673)
T KOG4378|consen  122 STVTYVDYNNTDEYIASVSDGGDIIIHGTKTKQKTTTFTID------SGQSV--RLL-R---------------------  171 (673)
T ss_pred             ceeEEEEecCCcceeEEeccCCcEEEEecccCccccceecC------CCCeE--EEe-e---------------------
Confidence            347777654  999998888999998877554211111000      11211  010 0                     


Q ss_pred             cccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccc
Q 001853          737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGR  784 (1004)
Q Consensus       737 ~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~  784 (1004)
                               ........|.++.++|.+.+|....+.+.|........|
T Consensus       172 ---------ys~skr~lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP  210 (673)
T KOG4378|consen  172 ---------YSPSKRFLLSIASDKGAVTLWDVQGMSPIFHASEAHSAP  210 (673)
T ss_pred             ---------cccccceeeEeeccCCeEEEEeccCCCcccchhhhccCC
Confidence                     013346789999999999999999888888765554444


No 48 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=40.97  E-value=5.6e+02  Score=28.89  Aligned_cols=113  Identities=12%  Similarity=0.105  Sum_probs=0.0

Q ss_pred             cCcEEEEe----CCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccC
Q 001853          628 ERGARILD----GSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESS  703 (1004)
Q Consensus       628 ~~~vrl~~----~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~  703 (1004)
                      .+.|+|+|    ..+--+.+.+.    +..  ..+-+-+.-|=++.|+||...++.+.++..=....+-.....+.    
T Consensus       161 ~~~IkLyD~Rs~dkgPF~tf~i~----~~~--~~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~----  230 (311)
T KOG1446|consen  161 SELIKLYDLRSFDKGPFTTFSIT----DND--EAEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPN----  230 (311)
T ss_pred             CCeEEEEEecccCCCCceeEccC----CCC--ccceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccC----


Q ss_pred             CCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCcc
Q 001853          704 KKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSG  783 (1004)
Q Consensus       704 ~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~  783 (1004)
                      ...+...+-|...                                  +.+++.+-.+|+++||.+.+-+.|....+...+
T Consensus       231 ~~~~~~~a~ftPd----------------------------------s~Fvl~gs~dg~i~vw~~~tg~~v~~~~~~~~~  276 (311)
T KOG1446|consen  231 AGNLPLSATFTPD----------------------------------SKFVLSGSDDGTIHVWNLETGKKVAVLRGPNGG  276 (311)
T ss_pred             CCCcceeEEECCC----------------------------------CcEEEEecCCCcEEEEEcCCCcEeeEecCCCCC


Q ss_pred             c
Q 001853          784 R  784 (1004)
Q Consensus       784 ~  784 (1004)
                      |
T Consensus       277 ~  277 (311)
T KOG1446|consen  277 P  277 (311)
T ss_pred             C


No 49 
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=39.25  E-value=54  Score=26.17  Aligned_cols=41  Identities=20%  Similarity=0.242  Sum_probs=31.0

Q ss_pred             EEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeC
Q 001853          101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDD  149 (1004)
Q Consensus       101 L~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~  149 (1004)
                      ++++.+..+...|..++ .-|       ..|.|.+++.++.+.+-+.+-
T Consensus         2 f~~~~~k~l~~~v~~~~-w~P-------~mdLiA~~t~~g~v~v~Rl~~   42 (47)
T PF12894_consen    2 FRQLGEKNLPSRVSCMS-WCP-------TMDLIALGTEDGEVLVYRLNW   42 (47)
T ss_pred             cceecccCCCCcEEEEE-ECC-------CCCEEEEEECCCeEEEEECCC
Confidence            56777888877777443 222       679999999999999988753


No 50 
>PF14779 BBS1:  Ciliary BBSome complex subunit 1
Probab=38.74  E-value=1.8e+02  Score=32.01  Aligned_cols=62  Identities=23%  Similarity=0.291  Sum_probs=44.4

Q ss_pred             CCeEEEEcCCEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEE
Q 001853           55 VPNLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSII  134 (1004)
Q Consensus        55 ~~nLVvak~n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Ll  134 (1004)
                      ..+|||+..+. +||-+.+++                         +..+.++.+-+..+.|.+.=.-..    .-=+|+
T Consensus       195 ~scLViGTE~~-~i~iLd~~a-------------------------f~il~~~~lpsvPv~i~~~G~~de----vdyRI~  244 (257)
T PF14779_consen  195 VSCLVIGTESG-EIYILDPQA-------------------------FTILKQVQLPSVPVFISVSGQYDE----VDYRIV  244 (257)
T ss_pred             cceEEEEecCC-eEEEECchh-------------------------heeEEEEecCCCceEEEEEeeeec----cceEEE
Confidence            37999998764 367666654                         778889999888777665421110    122899


Q ss_pred             EEECCCeEEEEE
Q 001853          135 LAFEDAKISVLE  146 (1004)
Q Consensus       135 v~~~~aklsile  146 (1004)
                      |+++++++-+++
T Consensus       245 Va~Rdg~iy~ir  256 (257)
T PF14779_consen  245 VACRDGKIYTIR  256 (257)
T ss_pred             EEeCCCEEEEEe
Confidence            999999998875


No 51 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=38.02  E-value=2.4e+02  Score=32.78  Aligned_cols=81  Identities=20%  Similarity=0.197  Sum_probs=47.5

Q ss_pred             cEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccC
Q 001853          100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR  179 (1004)
Q Consensus       100 kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~  179 (1004)
                      ..+.+.+.+..|.+-....+.+       ..-+++|+.+++.+++  +|..++++.-.            .+.|..    
T Consensus        25 t~~~~~~i~~~~~~h~~~~~s~-------Dgr~~yv~~rdg~vsv--iD~~~~~~v~~------------i~~G~~----   79 (369)
T PF02239_consen   25 TNKVVARIPTGGAPHAGLKFSP-------DGRYLYVANRDGTVSV--IDLATGKVVAT------------IKVGGN----   79 (369)
T ss_dssp             T-SEEEEEE-STTEEEEEE-TT--------SSEEEEEETTSEEEE--EETTSSSEEEE------------EE-SSE----
T ss_pred             CCeEEEEEcCCCCceeEEEecC-------CCCEEEEEcCCCeEEE--EECCcccEEEE------------EecCCC----
Confidence            3667777777655422222222       2348999999997665  58877764422            122332    


Q ss_pred             CCeEEECCCCCEEEEEEe-cCeEEEEE
Q 001853          180 GPLVKVDPQGRCGGVLVY-GLQMIILK  205 (1004)
Q Consensus       180 ~~~l~VDP~~Rca~l~~~-~~~L~ilP  205 (1004)
                      ..-+.+.|+||++++..| .+.+.|+-
T Consensus        80 ~~~i~~s~DG~~~~v~n~~~~~v~v~D  106 (369)
T PF02239_consen   80 PRGIAVSPDGKYVYVANYEPGTVSVID  106 (369)
T ss_dssp             EEEEEE--TTTEEEEEEEETTEEEEEE
T ss_pred             cceEEEcCCCCEEEEEecCCCceeEec
Confidence            123788899999999988 78888863


No 52 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=37.80  E-value=7.5e+02  Score=29.24  Aligned_cols=100  Identities=14%  Similarity=0.133  Sum_probs=56.4

Q ss_pred             cccEEEEEecCceEEEEe---cCceeEEecCCCccccCCeEEEEEeCCCcEEEEEecCcEEEEeCC--cceeEEeCCCCC
Q 001853          575 YHAYLIISLEARTMVLET---ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS--YMTQDLSFGPSN  649 (1004)
Q Consensus       575 ~~~yLilS~~~~T~Vl~~---~~~l~ev~~~~~F~~~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~--~~~q~~~~~~~~  649 (1004)
                      ...||+-+-.++|-.|+.   +..+..+. +.   +++--+.         -.|++|.|+.+-.+.  +.+..|.+..-+
T Consensus       314 tgeYllsAs~d~~w~Fsd~~~g~~lt~vs-~~---~s~v~~t---------s~~fHpDgLifgtgt~d~~vkiwdlks~~  380 (506)
T KOG0289|consen  314 TGEYLLSASNDGTWAFSDISSGSQLTVVS-DE---TSDVEYT---------SAAFHPDGLIFGTGTPDGVVKIWDLKSQT  380 (506)
T ss_pred             CCcEEEEecCCceEEEEEccCCcEEEEEe-ec---cccceeE---------EeeEcCCceEEeccCCCceEEEEEcCCcc
Confidence            346888887788888873   44444443 10   1212222         345566665554432  355566553110


Q ss_pred             CCCCCCCCCccEEEEEE--cCCEEEEEEeCCeEEEEEecC
Q 001853          650 SESGSGSENSTVLSVSI--ADPYVLLGMSDGSIRLLVGDP  687 (1004)
Q Consensus       650 ~e~g~~~~~~~I~~As~--~dpyvll~~~~g~I~~l~~d~  687 (1004)
                      .-...|++...|...++  |+=|+++.++|++|.+|.+-.
T Consensus       381 ~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLRK  420 (506)
T KOG0289|consen  381 NVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLRK  420 (506)
T ss_pred             ccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEehh
Confidence            00111223345777777  466888899999999999854


No 53 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=37.48  E-value=9.6e+02  Score=30.40  Aligned_cols=160  Identities=16%  Similarity=0.187  Sum_probs=80.4

Q ss_pred             CCceEEEEeCCCccEeEEEeecCCCCCC----CCcccccccCcccccEEEEEecCceEEEEecCceeEEecCCCccccCC
Q 001853          535 KQSNYELVELPGCKGIWTVYHKSSRGHN----ADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGR  610 (1004)
Q Consensus       535 ~~GsL~v~~lpg~~~iWtv~~~~~~~~~----~~~~~~~~~~~~~~~yLilS~~~~T~Vl~~~~~l~ev~~~~~F~~~~~  610 (1004)
                      .+|-+.+.+||+..-|-.+.....+-..    ..++-..-...+--..||---..++.||+-.+....++ .-.+..+++
T Consensus       285 ssG~f~LyelP~f~lih~LSis~~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsEsYVlKQQgH~~~i~-~l~YSpDgq  363 (893)
T KOG0291|consen  285 SSGEFGLYELPDFNLIHSLSISDQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSESYVLKQQGHSDRIT-SLAYSPDGQ  363 (893)
T ss_pred             cCCeeEEEecCCceEEEEeecccceeeEEEecccCCEEEEcCCccceEEEEEeeccceeeecccccccee-eEEECCCCc
Confidence            3444445788886666656543211000    00000000112223445555556777776655555554 344445555


Q ss_pred             eEEEEEeCCCcEEEEEecCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCc
Q 001853          611 TIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC  690 (1004)
Q Consensus       611 TI~ag~l~~~~~IvQVt~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~  690 (1004)
                      -|+.|-=          ...|++++....-=...+.    |.   ..+...++-+.....++-+.-||++..|.+..-..
T Consensus       364 ~iaTG~e----------DgKVKvWn~~SgfC~vTFt----eH---ts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRYrN  426 (893)
T KOG0291|consen  364 LIATGAE----------DGKVKVWNTQSGFCFVTFT----EH---TSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRYRN  426 (893)
T ss_pred             EEEeccC----------CCcEEEEeccCceEEEEec----cC---CCceEEEEEEecCCEEEEeecCCeEEeeeecccce
Confidence            5544431          3346777654311112222    21   12344566666666666677899999999865321


Q ss_pred             eEeeecccccccCCCceeEEEEeecCCCC
Q 001853          691 TVSVQTPAAIESSKKPVSSCTLYHDKGPE  719 (1004)
Q Consensus       691 ~l~~~~~~~~~~~~~~i~~~~l~~d~~g~  719 (1004)
                      -=....       ..++...|+..|++|+
T Consensus       427 fRTft~-------P~p~QfscvavD~sGe  448 (893)
T KOG0291|consen  427 FRTFTS-------PEPIQFSCVAVDPSGE  448 (893)
T ss_pred             eeeecC-------CCceeeeEEEEcCCCC
Confidence            111112       2345566999999884


No 54 
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=37.02  E-value=5.2e+02  Score=32.08  Aligned_cols=18  Identities=17%  Similarity=0.316  Sum_probs=16.4

Q ss_pred             CeEEEEcCCEEEEEEEEe
Q 001853           56 PNLVVTAANVIEIYVVRV   73 (1004)
Q Consensus        56 ~nLVvak~n~LeIy~v~~   73 (1004)
                      .+||+|.+|+|-||+++.
T Consensus        25 sqL~lAAg~rlliyD~nd   42 (1081)
T KOG1538|consen   25 TQLILAAGSRLLVYDTSD   42 (1081)
T ss_pred             ceEEEecCCEEEEEeCCC
Confidence            799999999999999863


No 55 
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=36.98  E-value=1.9e+02  Score=32.34  Aligned_cols=55  Identities=11%  Similarity=0.177  Sum_probs=41.6

Q ss_pred             cEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCC
Q 001853          660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP  718 (1004)
Q Consensus       660 ~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g  718 (1004)
                      ++=+|.+-.|.+++++.+..|.+|.+.+..........+ +   +.++-++++|.|..+
T Consensus       158 RvYa~Dv~~pm~vVata~r~i~vynL~n~~te~k~~~Sp-L---k~Q~R~va~f~d~~~  212 (347)
T KOG0647|consen  158 RVYAADVLYPMAVVATAERHIAVYNLENPPTEFKRIESP-L---KWQTRCVACFQDKDG  212 (347)
T ss_pred             eeeehhccCceeEEEecCCcEEEEEcCCCcchhhhhcCc-c---cceeeEEEEEecCCc
Confidence            478899999999999999999999997653322222222 2   667888889999876


No 56 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=36.66  E-value=6.1e+02  Score=28.33  Aligned_cols=87  Identities=16%  Similarity=0.137  Sum_probs=48.3

Q ss_pred             cEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEEC-CCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCccccc
Q 001853          100 SLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFE-DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA  178 (1004)
Q Consensus       100 kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~-~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~  178 (1004)
                      +|.++.+.++-|....|+.- +       ....|+++.. +++++++.++....-...+  +.++         +   ..
T Consensus        69 ~l~~~~~~~~~~~p~~i~~~-~-------~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~--~~~~---------~---~~  126 (330)
T PRK11028         69 ALTFAAESPLPGSPTHISTD-H-------QGRFLFSASYNANCVSVSPLDKDGIPVAPI--QIIE---------G---LE  126 (330)
T ss_pred             ceEEeeeecCCCCceEEEEC-C-------CCCEEEEEEcCCCeEEEEEECCCCCCCCce--eecc---------C---CC
Confidence            47666666665555544421 1       3446776654 6777776665321111111  1110         1   01


Q ss_pred             CCCeEEECCCCCEEEEEEe-cCeEEEEEccc
Q 001853          179 RGPLVKVDPQGRCGGVLVY-GLQMIILKASQ  208 (1004)
Q Consensus       179 ~~~~l~VDP~~Rca~l~~~-~~~L~ilP~~~  208 (1004)
                      ...-+.++|+|+.+.+.-+ .+.+.++.+..
T Consensus       127 ~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~  157 (330)
T PRK11028        127 GCHSANIDPDNRTLWVPCLKEDRIRLFTLSD  157 (330)
T ss_pred             cccEeEeCCCCCEEEEeeCCCCEEEEEEECC
Confidence            1234679999999976666 68888887754


No 57 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.35  E-value=4.5e+02  Score=30.30  Aligned_cols=62  Identities=18%  Similarity=0.204  Sum_probs=48.7

Q ss_pred             CcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEE
Q 001853          750 GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAM  829 (1004)
Q Consensus       750 ~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill  829 (1004)
                      +..+|+....+++++||.++.-.|+|+..+..                                        .=|+++++
T Consensus       303 ~~~~l~s~SrDktIk~wdv~tg~cL~tL~ghd----------------------------------------nwVr~~af  342 (406)
T KOG0295|consen  303 GGQVLGSGSRDKTIKIWDVSTGMCLFTLVGHD----------------------------------------NWVRGVAF  342 (406)
T ss_pred             CccEEEeecccceEEEEeccCCeEEEEEeccc----------------------------------------ceeeeeEE
Confidence            56899999999999999999999999864332                                        13566766


Q ss_pred             eecCCCCCCcEEEEEeeCCcEEEEEEE
Q 001853          830 QRWSAHHSRPFLFAILTDGTILCYQAY  856 (1004)
Q Consensus       830 ~~lg~~~~~p~L~v~~~~g~l~iY~~f  856 (1004)
                      ..=|     -||+-...|+.|-+|..-
T Consensus       343 ~p~G-----kyi~ScaDDktlrvwdl~  364 (406)
T KOG0295|consen  343 SPGG-----KYILSCADDKTLRVWDLK  364 (406)
T ss_pred             cCCC-----eEEEEEecCCcEEEEEec
Confidence            5433     689988999999999654


No 58 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=35.64  E-value=7.5e+02  Score=28.59  Aligned_cols=118  Identities=15%  Similarity=0.248  Sum_probs=73.6

Q ss_pred             CCCccccCCeEEEEEeCCCcEEEEEecCcEEE-EeCCcceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEeCCeE
Q 001853          602 SVDYFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSI  680 (1004)
Q Consensus       602 ~~~F~~~~~TI~ag~l~~~~~IvQVt~~~vrl-~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~~g~I  680 (1004)
                      ...|..++.-|+-|-|.+.-+|-+|-+.+.+. ++..-.-.+|--|  -       .         ..+.++-...||++
T Consensus       111 ~~~FshdgtlLATGdmsG~v~v~~~stg~~~~~~~~e~~dieWl~W--H-------p---------~a~illAG~~DGsv  172 (399)
T KOG0296|consen  111 CCSFSHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQEVEDIEWLKW--H-------P---------RAHILLAGSTDGSV  172 (399)
T ss_pred             EEEEccCceEEEecCCCccEEEEEcccCceEEEeecccCceEEEEe--c-------c---------cccEEEeecCCCcE
Confidence            35588888888888887665666666655443 3322111244332  0       0         24455667789999


Q ss_pred             EEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecC
Q 001853          681 RLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYES  760 (1004)
Q Consensus       681 ~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~  760 (1004)
                      -+|++.+..  +.    +........++++++..|  |                                 .-++...++
T Consensus       173 Wmw~ip~~~--~~----kv~~Gh~~~ct~G~f~pd--G---------------------------------Kr~~tgy~d  211 (399)
T KOG0296|consen  173 WMWQIPSQA--LC----KVMSGHNSPCTCGEFIPD--G---------------------------------KRILTGYDD  211 (399)
T ss_pred             EEEECCCcc--ee----eEecCCCCCcccccccCC--C---------------------------------ceEEEEecC
Confidence            999998852  22    111112556777776433  2                                 234555669


Q ss_pred             CeEEEEEcCCCeEEEEec
Q 001853          761 GALEIFDVPNFNCVFTVD  778 (1004)
Q Consensus       761 g~l~I~sLp~~~~v~~~~  778 (1004)
                      |+|.+|.+...++.+...
T Consensus       212 gti~~Wn~ktg~p~~~~~  229 (399)
T KOG0296|consen  212 GTIIVWNPKTGQPLHKIT  229 (399)
T ss_pred             ceEEEEecCCCceeEEec
Confidence            999999999988888866


No 59 
>PTZ00421 coronin; Provisional
Probab=34.62  E-value=9.1e+02  Score=29.28  Aligned_cols=119  Identities=11%  Similarity=0.075  Sum_probs=0.0

Q ss_pred             CccEEEEEE---cCCEEEEEEeCCeEEEEEecCCCceEeee-cccccccCCCceeEEEEeecCCCCcceecccccccccC
Q 001853          658 NSTVLSVSI---ADPYVLLGMSDGSIRLLVGDPSTCTVSVQ-TPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLST  733 (1004)
Q Consensus       658 ~~~I~~As~---~dpyvll~~~~g~I~~l~~d~~~~~l~~~-~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~  733 (1004)
                      ...|.+.++   .+.+++.+..|++|.+|.+...+..-... ....+......|.+++...+...               
T Consensus        75 ~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~---------------  139 (493)
T PTZ00421         75 EGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMN---------------  139 (493)
T ss_pred             CCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCC---------------


Q ss_pred             ccccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCC
Q 001853          734 GVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQ  813 (1004)
Q Consensus       734 ~~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~  813 (1004)
                                         +++.+..+|.+.||.+.+-+++.....-...                              
T Consensus       140 -------------------iLaSgs~DgtVrIWDl~tg~~~~~l~~h~~~------------------------------  170 (493)
T PTZ00421        140 -------------------VLASAGADMVVNVWDVERGKAVEVIKCHSDQ------------------------------  170 (493)
T ss_pred             -------------------EEEEEeCCCEEEEEECCCCeEEEEEcCCCCc------------------------------


Q ss_pred             CCcccccccceEEEEEeecCCCCCCcEEEEEeeCCcEEEEEE
Q 001853          814 GRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA  855 (1004)
Q Consensus       814 ~~~~~~~~~~i~eill~~lg~~~~~p~L~v~~~~g~l~iY~~  855 (1004)
                                |..+.+..-|     .+|+....||.|-+|.+
T Consensus       171 ----------V~sla~spdG-----~lLatgs~Dg~IrIwD~  197 (493)
T PTZ00421        171 ----------ITSLEWNLDG-----SLLCTTSKDKKLNIIDP  197 (493)
T ss_pred             ----------eEEEEEECCC-----CEEEEecCCCEEEEEEC


No 60 
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=34.51  E-value=2.9e+02  Score=33.97  Aligned_cols=97  Identities=14%  Similarity=0.212  Sum_probs=0.0

Q ss_pred             CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCC
Q 001853          668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL  747 (1004)
Q Consensus       668 dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~  747 (1004)
                      .|+++.++-+|++.++..+....                +.++-+..-+-..+.|                         
T Consensus        25 ePw~la~LynG~V~IWnyetqtm----------------VksfeV~~~PvRa~kf-------------------------   63 (794)
T KOG0276|consen   25 EPWILAALYNGDVQIWNYETQTM----------------VKSFEVSEVPVRAAKF-------------------------   63 (794)
T ss_pred             CceEEEeeecCeeEEEeccccee----------------eeeeeecccchhhhee-------------------------


Q ss_pred             CCCcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEE
Q 001853          748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVEL  827 (1004)
Q Consensus       748 ~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ei  827 (1004)
                      -....|+++..+++.+.||..-+++.|.+.+                                        .....|+.|
T Consensus        64 iaRknWiv~GsDD~~IrVfnynt~ekV~~Fe----------------------------------------AH~DyIR~i  103 (794)
T KOG0276|consen   64 IARKNWIVTGSDDMQIRVFNYNTGEKVKTFE----------------------------------------AHSDYIRSI  103 (794)
T ss_pred             eeccceEEEecCCceEEEEecccceeeEEee----------------------------------------ccccceeee


Q ss_pred             EEeecCCCCCCcEEEEEeeCCcEEE
Q 001853          828 AMQRWSAHHSRPFLFAILTDGTILC  852 (1004)
Q Consensus       828 ll~~lg~~~~~p~L~v~~~~g~l~i  852 (1004)
                      .++.--     ||++  ++.++++|
T Consensus       104 avHPt~-----P~vL--tsSDDm~i  121 (794)
T KOG0276|consen  104 AVHPTL-----PYVL--TSSDDMTI  121 (794)
T ss_pred             eecCCC-----CeEE--ecCCccEE


No 61 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=33.52  E-value=6.7e+02  Score=27.39  Aligned_cols=166  Identities=18%  Similarity=0.166  Sum_probs=88.1

Q ss_pred             cEEEEEecCceEEEE-ecCceeEEecCCCccccCCeEEEEEeCCCcEEEEEecCcEEEEeCC----cceeEEeCCCCCCC
Q 001853          577 AYLIISLEARTMVLE-TADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS----YMTQDLSFGPSNSE  651 (1004)
Q Consensus       577 ~yLilS~~~~T~Vl~-~~~~l~ev~~~~~F~~~~~TI~ag~l~~~~~IvQVt~~~vrl~~~~----~~~q~~~~~~~~~e  651 (1004)
                      +.+.+=..+.-.|.+ ..+...+++ ...|+-+..-++-|.|          ...+|+.|..    ..+|.+.-      
T Consensus        81 k~v~vwDV~TGkv~Rr~rgH~aqVN-tV~fNeesSVv~Sgsf----------D~s~r~wDCRS~s~ePiQilde------  143 (307)
T KOG0316|consen   81 KAVQVWDVNTGKVDRRFRGHLAQVN-TVRFNEESSVVASGSF----------DSSVRLWDCRSRSFEPIQILDE------  143 (307)
T ss_pred             ceEEEEEcccCeeeeecccccceee-EEEecCcceEEEeccc----------cceeEEEEcccCCCCccchhhh------
Confidence            344444444444544 455666666 5666655555665555          3457778754    23555421      


Q ss_pred             CCCCCCCccEEEEEEcCCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCccee--------
Q 001853          652 SGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLR--------  723 (1004)
Q Consensus       652 ~g~~~~~~~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~--------  723 (1004)
                      .     ...|.+..+++.-|+-...||++..|.+-..+..-..     +   ..+|++.|+-+|.+-  .+.        
T Consensus       144 a-----~D~V~Si~v~~heIvaGS~DGtvRtydiR~G~l~sDy-----~---g~pit~vs~s~d~nc--~La~~l~stlr  208 (307)
T KOG0316|consen  144 A-----KDGVSSIDVAEHEIVAGSVDGTVRTYDIRKGTLSSDY-----F---GHPITSVSFSKDGNC--SLASSLDSTLR  208 (307)
T ss_pred             h-----cCceeEEEecccEEEeeccCCcEEEEEeecceeehhh-----c---CCcceeEEecCCCCE--EEEeeccceee
Confidence            1     1238888889999999999999999988654321111     1   233554444333211  110        


Q ss_pred             ---ccccc-ccccCccc-cccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEE
Q 001853          724 ---KTSTD-AWLSTGVG-EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT  776 (1004)
Q Consensus       724 ---~~~~~-~~~~~~~~-~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~  776 (1004)
                         +.+.. -....|.. +..+-  +--..+...-++.+.++|.+.+|+|-+-+.+-.
T Consensus       209 LlDk~tGklL~sYkGhkn~eykl--dc~l~qsdthV~sgSEDG~Vy~wdLvd~~~~sk  264 (307)
T KOG0316|consen  209 LLDKETGKLLKSYKGHKNMEYKL--DCCLNQSDTHVFSGSEDGKVYFWDLVDETQISK  264 (307)
T ss_pred             ecccchhHHHHHhcccccceeee--eeeecccceeEEeccCCceEEEEEeccceeeee
Confidence               00000 00000000 00000  001234556789999999999999988766554


No 62 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=32.76  E-value=5.6e+02  Score=26.28  Aligned_cols=29  Identities=28%  Similarity=0.291  Sum_probs=19.2

Q ss_pred             cEEEEEEcC--CEEEEEEeCCeEEEEEecCC
Q 001853          660 TVLSVSIAD--PYVLLGMSDGSIRLLVGDPS  688 (1004)
Q Consensus       660 ~I~~As~~d--pyvll~~~~g~I~~l~~d~~  688 (1004)
                      .|.+..+..  .+++....++.|.+|.+...
T Consensus       137 ~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~  167 (289)
T cd00200         137 WVNSVAFSPDGTFVASSSQDGTIKLWDLRTG  167 (289)
T ss_pred             cEEEEEEcCcCCEEEEEcCCCcEEEEEcccc
Confidence            377777764  44444444999999988643


No 63 
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=31.96  E-value=1.2e+03  Score=30.04  Aligned_cols=53  Identities=15%  Similarity=0.310  Sum_probs=35.0

Q ss_pred             cCcEEEEeCCcceeEEeCCCCCCCCCCCCCCccEEEEEE--cCCEEEEEEeCCeEEEEEecCCC
Q 001853          628 ERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI--ADPYVLLGMSDGSIRLLVGDPST  689 (1004)
Q Consensus       628 ~~~vrl~~~~~~~q~~~~~~~~~e~g~~~~~~~I~~As~--~dpyvll~~~~g~I~~l~~d~~~  689 (1004)
                      ..+|++++.....|+..+-      |   ....|.+.+.  ++.+++++.-||.|.+|++++..
T Consensus       117 D~~vK~~~~~D~s~~~~lr------g---h~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~  171 (933)
T KOG1274|consen  117 DTAVKLLNLDDSSQEKVLR------G---HDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGI  171 (933)
T ss_pred             ceeEEEEeccccchheeec------c---cCCceeeeeEcCCCCEEEEEecCceEEEEEcccch
Confidence            4567777765433333221      1   1133777776  58899999999999999998653


No 64 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=30.94  E-value=82  Score=34.51  Aligned_cols=60  Identities=23%  Similarity=0.303  Sum_probs=36.1

Q ss_pred             CEEEEEeCCCCEEEEEEEECCceEeEEEEEec-----CCCcccceEEEEcCCeEEEEeeeCCeeEEEEe
Q 001853          375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT  438 (1004)
Q Consensus       375 ~~~Ll~~~~G~L~~L~l~~dgr~V~~l~l~~~-----g~~~~~S~l~~l~~g~lFvGS~~GDS~Ll~~~  438 (1004)
                      +.++|.++...|..+  ..+|+-++.+.|..-     ...+.|.-|+.-.+|.|||-|+  -.++|+|+
T Consensus       184 ~lliLS~es~~l~~~--d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE--pNlfy~f~  248 (248)
T PF06977_consen  184 HLLILSDESRLLLEL--DRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE--PNLFYRFE  248 (248)
T ss_dssp             EEEEEETTTTEEEEE---TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET--TTEEEEEE
T ss_pred             eEEEEECCCCeEEEE--CCCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC--CceEEEeC
Confidence            456777776666444  467777777777652     3457799999999999999998  34777763


No 65 
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=29.66  E-value=7.2e+02  Score=27.62  Aligned_cols=66  Identities=8%  Similarity=-0.009  Sum_probs=38.6

Q ss_pred             eEEEEEeCCCcEEEEEe---cCcEEEEeCCc--ceeEEeCCCCCCCCCCCCCCccEEEEEEcCCEEEEEEe-----CCeE
Q 001853          611 TIAAGNLFGRRRVIQVF---ERGARILDGSY--MTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMS-----DGSI  680 (1004)
Q Consensus       611 TI~ag~l~~~~~IvQVt---~~~vrl~~~~~--~~q~~~~~~~~~e~g~~~~~~~I~~As~~dpyvll~~~-----~g~I  680 (1004)
                      +|.+..+...+ =.-||   .+.++|.|-+.  ++-.|+.+       ++   .+.+.-+..+.+++++++     -+.|
T Consensus        54 avW~~Did~~s-~~liTGSAD~t~kLWDv~tGk~la~~k~~-------~~---Vk~~~F~~~gn~~l~~tD~~mg~~~~v  122 (327)
T KOG0643|consen   54 AVWCCDIDWDS-KHLITGSADQTAKLWDVETGKQLATWKTN-------SP---VKRVDFSFGGNLILASTDKQMGYTCFV  122 (327)
T ss_pred             eEEEEEecCCc-ceeeeccccceeEEEEcCCCcEEEEeecC-------Ce---eEEEeeccCCcEEEEEehhhcCcceEE
Confidence            45555554332 11222   45678887652  45556542       22   556777778999998874     3467


Q ss_pred             EEEEecC
Q 001853          681 RLLVGDP  687 (1004)
Q Consensus       681 ~~l~~d~  687 (1004)
                      .+|.+..
T Consensus       123 ~~fdi~~  129 (327)
T KOG0643|consen  123 SVFDIRD  129 (327)
T ss_pred             EEEEccC
Confidence            7777743


No 66 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=29.10  E-value=2.8e+02  Score=31.88  Aligned_cols=70  Identities=17%  Similarity=0.377  Sum_probs=50.3

Q ss_pred             CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCC
Q 001853          668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL  747 (1004)
Q Consensus       668 dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~  747 (1004)
                      .+++.....|++|.++.+....+++++...       ..|....++                                  
T Consensus       304 ~~~l~s~SrDktIk~wdv~tg~cL~tL~gh-------dnwVr~~af----------------------------------  342 (406)
T KOG0295|consen  304 GQVLGSGSRDKTIKIWDVSTGMCLFTLVGH-------DNWVRGVAF----------------------------------  342 (406)
T ss_pred             ccEEEeecccceEEEEeccCCeEEEEEecc-------cceeeeeEE----------------------------------
Confidence            467777888999999999887766664421       224433222                                  


Q ss_pred             CCCcEEEEEEecCCeEEEEEcCCCeEEEEec
Q 001853          748 DQGDIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1004)
Q Consensus       748 ~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~~  778 (1004)
                      .....|++-|-+|++|.||+|.+.++.-..+
T Consensus       343 ~p~Gkyi~ScaDDktlrvwdl~~~~cmk~~~  373 (406)
T KOG0295|consen  343 SPGGKYILSCADDKTLRVWDLKNLQCMKTLE  373 (406)
T ss_pred             cCCCeEEEEEecCCcEEEEEeccceeeeccC
Confidence            2235799999999999999999988766543


No 67 
>PF07569 Hira:  TUP1-like enhancer of split;  InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=28.15  E-value=2.6e+02  Score=29.91  Aligned_cols=33  Identities=15%  Similarity=0.127  Sum_probs=27.9

Q ss_pred             cEEEEEEcCCEEEEEEeCCeEEEEEecCCCceE
Q 001853          660 TVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTV  692 (1004)
Q Consensus       660 ~I~~As~~dpyvll~~~~g~I~~l~~d~~~~~l  692 (1004)
                      .++...+++.|+++.+.+|.+.++.+......+
T Consensus        14 ~~~~l~~~~~~Ll~iT~~G~l~vWnl~~~k~~~   46 (219)
T PF07569_consen   14 PVSFLECNGSYLLAITSSGLLYVWNLKKGKAVL   46 (219)
T ss_pred             ceEEEEeCCCEEEEEeCCCeEEEEECCCCeecc
Confidence            378888999999999999999999998764433


No 68 
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=27.73  E-value=61  Score=25.87  Aligned_cols=24  Identities=13%  Similarity=0.299  Sum_probs=20.3

Q ss_pred             cEEEEEEecCCeEEEEEcCCCeEEE
Q 001853          751 DIYSVVCYESGALEIFDVPNFNCVF  775 (1004)
Q Consensus       751 ~~~l~~~~~~g~l~I~sLp~~~~v~  775 (1004)
                      ...+++.+.+|.+.||++ +.+.+|
T Consensus        23 mdLiA~~t~~g~v~v~Rl-~~qriw   46 (47)
T PF12894_consen   23 MDLIALGTEDGEVLVYRL-NWQRIW   46 (47)
T ss_pred             CCEEEEEECCCeEEEEEC-CCcCcc
Confidence            348899999999999999 777666


No 69 
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=27.67  E-value=1.6e+03  Score=29.85  Aligned_cols=58  Identities=10%  Similarity=0.033  Sum_probs=40.7

Q ss_pred             cEEEEEECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCCeEEECCCCCEEEEEEecCe
Q 001853          131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQ  200 (1004)
Q Consensus       131 D~Llv~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~~~~  200 (1004)
                      =..++-|+.+.++-++-+++.....-+++.||..            .++...|.|+-.|-..++.-+.++
T Consensus       299 ff~llqt~~GD~fk~tl~~d~d~v~el~lkYfDt------------vp~a~~L~I~k~GfLf~~sE~~n~  356 (1205)
T KOG1898|consen  299 FFFLLQTEYGDLFKLTLEHDGDNVVELRLKYFDT------------VPCALQLCILKTGFLFVASEFGNH  356 (1205)
T ss_pred             eEEEEEecCCceEEEEEecCCCcceeeeeehhcC------------CccceEEEEeccceEEEhhhccCc
Confidence            3566778888888888887777666677788733            233456788877877777766544


No 70 
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=27.37  E-value=6.6e+02  Score=29.46  Aligned_cols=27  Identities=15%  Similarity=0.096  Sum_probs=21.9

Q ss_pred             cEEEEEEecCCeEEEEEcCCCeEEEEe
Q 001853          751 DIYSVVCYESGALEIFDVPNFNCVFTV  777 (1004)
Q Consensus       751 ~~~l~~~~~~g~l~I~sLp~~~~v~~~  777 (1004)
                      +-|++....||+++||++-.-++.+..
T Consensus       399 ~~YvaAGS~dgsv~iW~v~tgKlE~~l  425 (459)
T KOG0288|consen  399 GSYVAAGSADGSVYIWSVFTGKLEKVL  425 (459)
T ss_pred             CceeeeccCCCcEEEEEccCceEEEEe
Confidence            457788888999999999887776653


No 71 
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=26.34  E-value=8.4e+02  Score=26.25  Aligned_cols=19  Identities=26%  Similarity=0.653  Sum_probs=13.6

Q ss_pred             EEEee-eeeeEeEEEecCCC
Q 001853          106 HYRLH-GNVESLAILSQGGA  124 (1004)
Q Consensus       106 e~~l~-G~I~~l~~vr~~~s  124 (1004)
                      |+.++ |+|.+|+-+.-+.+
T Consensus       131 e~nmhdgtirdl~fld~~~s  150 (350)
T KOG0641|consen  131 EFNMHDGTIRDLAFLDDPES  150 (350)
T ss_pred             eeeecCCceeeeEEecCCCc
Confidence            45555 99999988765554


No 72 
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=25.41  E-value=5.9e+02  Score=30.56  Aligned_cols=36  Identities=17%  Similarity=0.330  Sum_probs=29.4

Q ss_pred             cEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccc
Q 001853          751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTH  786 (1004)
Q Consensus       751 ~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~  786 (1004)
                      ...||-|..+|.+.||.|.+..+|-+..+-..+-..
T Consensus       521 akvcFsccsdGnI~vwDLhnq~~VrqfqGhtDGasc  556 (705)
T KOG0639|consen  521 AKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASC  556 (705)
T ss_pred             cceeeeeccCCcEEEEEcccceeeecccCCCCCcee
Confidence            458999999999999999999988877666555443


No 73 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=24.99  E-value=2.4e+02  Score=32.38  Aligned_cols=61  Identities=20%  Similarity=0.384  Sum_probs=44.5

Q ss_pred             CcEEEEEEecCCeEEEEEcCCCeEEEEecCcCccccccccccccccccccchhccCCCccccCCCCcccccccceEEEEE
Q 001853          750 GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAM  829 (1004)
Q Consensus       750 ~~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~~~~~~~L~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~eill  829 (1004)
                      .+.|++....++++.||.|.+-++..++.+                                        ..+.++++.+
T Consensus       162 ~n~wf~tgs~DrtikIwDlatg~LkltltG----------------------------------------hi~~vr~vav  201 (460)
T KOG0285|consen  162 GNEWFATGSADRTIKIWDLATGQLKLTLTG----------------------------------------HIETVRGVAV  201 (460)
T ss_pred             CceeEEecCCCceeEEEEcccCeEEEeecc----------------------------------------hhheeeeeee
Confidence            356888888999999999988666554321                                        1124566666


Q ss_pred             eecCCCCCCcEEEEEeeCCcEEEEEE
Q 001853          830 QRWSAHHSRPFLFAILTDGTILCYQA  855 (1004)
Q Consensus       830 ~~lg~~~~~p~L~v~~~~g~l~iY~~  855 (1004)
                      ..-     .||||....|++|-+|..
T Consensus       202 S~r-----HpYlFs~gedk~VKCwDL  222 (460)
T KOG0285|consen  202 SKR-----HPYLFSAGEDKQVKCWDL  222 (460)
T ss_pred             ccc-----CceEEEecCCCeeEEEec
Confidence            532     499999999999999964


No 74 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=24.58  E-value=1e+03  Score=26.55  Aligned_cols=99  Identities=13%  Similarity=0.180  Sum_probs=62.9

Q ss_pred             cCCEEEEEEEEecccccccccCCccccccccccccccccEEEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEEC-CC
Q 001853           62 AANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFE-DA  140 (1004)
Q Consensus        62 k~n~LeIy~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~kL~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~-~a  140 (1004)
                      ..+.|.+|++..++                        +|+++...+..|....|+. .+       ..+.|+++.. +.
T Consensus        10 ~~~~I~~~~~~~~g------------------------~l~~~~~~~~~~~~~~l~~-sp-------d~~~lyv~~~~~~   57 (330)
T PRK11028         10 ESQQIHVWNLNHEG------------------------ALTLLQVVDVPGQVQPMVI-SP-------DKRHLYVGVRPEF   57 (330)
T ss_pred             CCCCEEEEEECCCC------------------------ceeeeeEEecCCCCccEEE-CC-------CCCEEEEEECCCC
Confidence            35678999985333                        5788877776666665532 21       4568887754 56


Q ss_pred             eEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCCeEEECCCCCEEEEEEe-cCeEEEEEcc
Q 001853          141 KISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKAS  207 (1004)
Q Consensus       141 klsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~~l~VDP~~Rca~l~~~-~~~L~ilP~~  207 (1004)
                      .+.+++++ ++..+..+..+..          +    ..+..+..||+||.+.+.-+ .+.+.++.+.
T Consensus        58 ~i~~~~~~-~~g~l~~~~~~~~----------~----~~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~  110 (330)
T PRK11028         58 RVLSYRIA-DDGALTFAAESPL----------P----GSPTHISTDHQGRFLFSASYNANCVSVSPLD  110 (330)
T ss_pred             cEEEEEEC-CCCceEEeeeecC----------C----CCceEEEECCCCCEEEEEEcCCCeEEEEEEC
Confidence            66666665 3455543321111          1    12347999999998887776 7888888774


No 75 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=24.06  E-value=1.1e+03  Score=26.78  Aligned_cols=96  Identities=17%  Similarity=0.136  Sum_probs=56.4

Q ss_pred             cEEEEEEEEeeeee-eEeEEEecCCCCCCCCccEEEEEE-CCCeEEEEEEeCCCCcEEEE-EeeeecCcchhcccCCccc
Q 001853          100 SLELVCHYRLHGNV-ESLAILSQGGADNSRRRDSIILAF-EDAKISVLEFDDSIHGLRIT-SMHCFESPEWLHLKRGRES  176 (1004)
Q Consensus       100 kL~lv~e~~l~G~I-~~l~~vr~~~s~~~~~~D~Llv~~-~~aklsile~d~~~~~l~Tv-Slh~~E~~~~~~~k~g~~~  176 (1004)
                      +|+++.+.+..|.- ..++ +-+       ...+|+++- ..+.++++..+.. +.+... .+..++...    ....|+
T Consensus        75 ~L~~~~~~~~~g~~p~~i~-~~~-------~g~~l~vany~~g~v~v~~l~~~-g~l~~~~~~~~~~g~g----~~~~rq  141 (345)
T PF10282_consen   75 TLTLLNSVPSGGSSPCHIA-VDP-------DGRFLYVANYGGGSVSVFPLDDD-GSLGEVVQTVRHEGSG----PNPDRQ  141 (345)
T ss_dssp             EEEEEEEEEESSSCEEEEE-ECT-------TSSEEEEEETTTTEEEEEEECTT-SEEEEEEEEEESEEEE----SSTTTT
T ss_pred             eeEEeeeeccCCCCcEEEE-Eec-------CCCEEEEEEccCCeEEEEEccCC-cccceeeeecccCCCC----Cccccc
Confidence            68888888866553 2222 221       456777775 6899999999887 544443 233333221    111123


Q ss_pred             ccCC-CeEEECCCCCEEEEEEe-cCeEEEEEccc
Q 001853          177 FARG-PLVKVDPQGRCGGVLVY-GLQMIILKASQ  208 (1004)
Q Consensus       177 ~~~~-~~l~VDP~~Rca~l~~~-~~~L~ilP~~~  208 (1004)
                      ..+. ..+..+|+||.+.+.-. .+.+.++-+..
T Consensus       142 ~~~h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~  175 (345)
T PF10282_consen  142 EGPHPHQVVFSPDGRFVYVPDLGADRVYVYDIDD  175 (345)
T ss_dssp             SSTCEEEEEE-TTSSEEEEEETTTTEEEEEEE-T
T ss_pred             ccccceeEEECCCCCEEEEEecCCCEEEEEEEeC
Confidence            2223 35789999998877555 77777766643


No 76 
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=24.04  E-value=4.7e+02  Score=30.61  Aligned_cols=85  Identities=18%  Similarity=0.169  Sum_probs=57.8

Q ss_pred             EEEEEEEeeeeeeEeEEEecCCCCCCCCccEEEEEECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcccCCcccccCCC
Q 001853          102 ELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGP  181 (1004)
Q Consensus       102 ~lv~e~~l~G~I~~l~~vr~~~s~~~~~~D~Llv~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~k~g~~~~~~~~  181 (1004)
                      .++.+.++.|.|+++....        ++-.|+.++++-.+.++  |-.+.++.    |.|--       .|++-..--.
T Consensus       333 ~~~~sv~~gg~vtSl~ls~--------~g~~lLsssRDdtl~vi--DlRt~eI~----~~~sA-------~g~k~asDwt  391 (459)
T KOG0288|consen  333 DKTRSVPLGGRVTSLDLSM--------DGLELLSSSRDDTLKVI--DLRTKEIR----QTFSA-------EGFKCASDWT  391 (459)
T ss_pred             ceeeEeecCcceeeEeecc--------CCeEEeeecCCCceeee--ecccccEE----EEeec-------cccccccccc
Confidence            3577899999999987644        45577788888888876  33333333    66633       3443333346


Q ss_pred             eEEECCCCCEEEEEEecCeEEEEEcc
Q 001853          182 LVKVDPQGRCGGVLVYGLQMIILKAS  207 (1004)
Q Consensus       182 ~l~VDP~~Rca~l~~~~~~L~ilP~~  207 (1004)
                      .+..-|++++++-.-.++.+.|.-..
T Consensus       392 rvvfSpd~~YvaAGS~dgsv~iW~v~  417 (459)
T KOG0288|consen  392 RVVFSPDGSYVAAGSADGSVYIWSVF  417 (459)
T ss_pred             eeEECCCCceeeeccCCCcEEEEEcc
Confidence            67888999999877778887776543


No 77 
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=23.97  E-value=6e+02  Score=28.13  Aligned_cols=73  Identities=18%  Similarity=0.249  Sum_probs=47.4

Q ss_pred             EEEecceeEEEeeCCEEEEEeCCCCEEEEEEEECCceEeEEEEEecCCCcccceEEEEcCCeEEEEeeeCCeeEEEEee
Q 001853          361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC  439 (1004)
Q Consensus       361 ~i~l~~~~~~~l~~~~~Ll~~~~G~L~~L~l~~dgr~V~~l~l~~~g~~~~~S~l~~l~~g~lFvGS~~GDS~Ll~~~~  439 (1004)
                      ..+.|++.. . =.++++|+-.+|-||.|.+.. |.....+  ...+ .+-.+..+-.+.|+++.||+.|+-+.+.+..
T Consensus        52 g~RiE~sa~-v-vgdfVV~GCy~g~lYfl~~~t-Gs~~w~f--~~~~-~vk~~a~~d~~~glIycgshd~~~yalD~~~  124 (354)
T KOG4649|consen   52 GVRIECSAI-V-VGDFVVLGCYSGGLYFLCVKT-GSQIWNF--VILE-TVKVRAQCDFDGGLIYCGSHDGNFYALDPKT  124 (354)
T ss_pred             CceeeeeeE-E-ECCEEEEEEccCcEEEEEecc-hhheeee--eehh-hhccceEEcCCCceEEEecCCCcEEEecccc
Confidence            345566533 2 345699999999999999865 3323222  2222 2334455667899999999999877665543


No 78 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=23.46  E-value=4.6e+02  Score=33.02  Aligned_cols=75  Identities=20%  Similarity=0.259  Sum_probs=54.6

Q ss_pred             cEEEEEEc---CCEEEEEEeCCeEEEEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccc
Q 001853          660 TVLSVSIA---DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG  736 (1004)
Q Consensus       660 ~I~~As~~---dpyvll~~~~g~I~~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~  736 (1004)
                      .++++.++   |.|.+=..=||.|.++.+..... ..-   ..+   ...|+|+|+..|  |                  
T Consensus       411 fVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~V-v~W---~Dl---~~lITAvcy~Pd--G------------------  463 (712)
T KOG0283|consen  411 FVTCVAFNPVDDRYFISGSLDGKVRLWSISDKKV-VDW---NDL---RDLITAVCYSPD--G------------------  463 (712)
T ss_pred             eeEEEEecccCCCcEeecccccceEEeecCcCee-Eee---hhh---hhhheeEEeccC--C------------------
Confidence            48888886   89999988999999999876421 111   111   345899998544  3                  


Q ss_pred             cccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEE
Q 001853          737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT  776 (1004)
Q Consensus       737 ~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~  776 (1004)
                                     .+.++.+=+|...+|.-.+++++-.
T Consensus       464 ---------------k~avIGt~~G~C~fY~t~~lk~~~~  488 (712)
T KOG0283|consen  464 ---------------KGAVIGTFNGYCRFYDTEGLKLVSD  488 (712)
T ss_pred             ---------------ceEEEEEeccEEEEEEccCCeEEEe
Confidence                           3677788899999999888886554


No 79 
>PTZ00420 coronin; Provisional
Probab=23.41  E-value=1.5e+03  Score=28.10  Aligned_cols=84  Identities=17%  Similarity=0.211  Sum_probs=48.1

Q ss_pred             cEEEEEEc---CCEEEEEEeCCeEEEEEecCCCceEe-eecc-cccccCCCceeEEEEeecCCCCcceecccccccccCc
Q 001853          660 TVLSVSIA---DPYVLLGMSDGSIRLLVGDPSTCTVS-VQTP-AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG  734 (1004)
Q Consensus       660 ~I~~As~~---dpyvll~~~~g~I~~l~~d~~~~~l~-~~~~-~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~  734 (1004)
                      .|.+++++   +.+++-+..||+|.+|.+...+.... +..+ ..+......|.+++..                     
T Consensus        76 ~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~---------------------  134 (568)
T PTZ00420         76 SILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWN---------------------  134 (568)
T ss_pred             CEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEEC---------------------
Confidence            37777775   35666777899999999865432111 0000 0111112334433321                     


Q ss_pred             cccccCCCCCCCCCCCcEEEEEEecCCeEEEEEcCCCeEEEEe
Q 001853          735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV  777 (1004)
Q Consensus       735 ~~~~~~~~~~~~~~~~~~~l~~~~~~g~l~I~sLp~~~~v~~~  777 (1004)
                                   +....+++.+..+|.+.||.+...+.++..
T Consensus       135 -------------P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i  164 (568)
T PTZ00420        135 -------------PMNYYIMCSSGFDSFVNIWDIENEKRAFQI  164 (568)
T ss_pred             -------------CCCCeEEEEEeCCCeEEEEECCCCcEEEEE
Confidence                         112345667778999999999887766654


No 80 
>PTZ00420 coronin; Provisional
Probab=22.86  E-value=1.5e+03  Score=28.02  Aligned_cols=50  Identities=14%  Similarity=0.048  Sum_probs=30.9

Q ss_pred             cccccEEEeccCCCceEEEecCCCCCCCCcEEEEecCC-cEEEEECCCCCcc
Q 001853          939 FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG-ILKICQLPSGSTY  989 (1004)
Q Consensus       939 ~~~~l~~~~~~~~~~v~~f~~F~~~~~~~gfiy~~~~~-~lri~~lp~~~~~  989 (1004)
                      ..+.+|++-+.. +.+..++.|+....-+|+.++-..+ .++-|++-.-+..
T Consensus       283 GD~tIr~~e~~~-~~~~~l~~~~s~~p~~g~~f~Pkr~~dv~~cEi~R~~kl  333 (568)
T PTZ00420        283 GDGNCRYYQHSL-GSIRKVNEYKSCSPFRSFGFLPKQICDVYKCEIGRVYKN  333 (568)
T ss_pred             CCCeEEEEEccC-CcEEeecccccCCCccceEEccccccCchhhhHhHHhhh
Confidence            345566666643 3677777888777777887776654 3555555554443


No 81 
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=22.62  E-value=8.7e+02  Score=30.61  Aligned_cols=113  Identities=16%  Similarity=0.202  Sum_probs=70.9

Q ss_pred             eEEEEEeCCCcEEEEEecCcEEEEeCC--cceeEEeCCCCCCCCCCC-----CCC--ccEEEEEEcCCEEEEEEeCCeEE
Q 001853          611 TIAAGNLFGRRRVIQVFERGARILDGS--YMTQDLSFGPSNSESGSG-----SEN--STVLSVSIADPYVLLGMSDGSIR  681 (1004)
Q Consensus       611 TI~ag~l~~~~~IvQVt~~~vrl~~~~--~~~q~~~~~~~~~e~g~~-----~~~--~~I~~As~~dpyvll~~~~g~I~  681 (1004)
                      -|+||.+.+- ..|+++||+--+..+.  .-++.|.+.     .|..     +..  ...++-|.|+-|++.+-++|.|.
T Consensus       529 RifaghlsDV-~cv~FHPNs~Y~aTGSsD~tVRlWDv~-----~G~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~I~  602 (707)
T KOG0263|consen  529 RIFAGHLSDV-DCVSFHPNSNYVATGSSDRTVRLWDVS-----TGNSVRIFTGHKGPVTALAFSPCGRYLASGDEDGLIK  602 (707)
T ss_pred             hhhccccccc-ceEEECCcccccccCCCCceEEEEEcC-----CCcEEEEecCCCCceEEEEEcCCCceEeecccCCcEE
Confidence            4888888876 6888888887776653  224555432     2210     111  23444555799999999999999


Q ss_pred             EEEecCCCceEeeecccccccCCCceeEEEEeecCCCCcceecccccccccCccccccCCCCCCCCCCCcEEEEEEecCC
Q 001853          682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG  761 (1004)
Q Consensus       682 ~l~~d~~~~~l~~~~~~~~~~~~~~i~~~~l~~d~~g~~~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~g  761 (1004)
                      +|.+.......++...      ...|.++++-.|                                   +..|++...+.
T Consensus       603 iWDl~~~~~v~~l~~H------t~ti~SlsFS~d-----------------------------------g~vLasgg~Dn  641 (707)
T KOG0263|consen  603 IWDLANGSLVKQLKGH------TGTIYSLSFSRD-----------------------------------GNVLASGGADN  641 (707)
T ss_pred             EEEcCCCcchhhhhcc------cCceeEEEEecC-----------------------------------CCEEEecCCCC
Confidence            9999875432222111      333555544211                                   34678888999


Q ss_pred             eEEEEEcCC
Q 001853          762 ALEIFDVPN  770 (1004)
Q Consensus       762 ~l~I~sLp~  770 (1004)
                      ++.+|++-.
T Consensus       642 sV~lWD~~~  650 (707)
T KOG0263|consen  642 SVRLWDLTK  650 (707)
T ss_pred             eEEEEEchh
Confidence            999997643


No 82 
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=21.98  E-value=1.9e+02  Score=31.64  Aligned_cols=69  Identities=20%  Similarity=0.360  Sum_probs=0.0

Q ss_pred             eEEEecCCCCeEE----EEcccccEEEeccCCCceEEEecCCCCCCCCcEEEE--ecCCcEEEEECCCCCccccCccceE
Q 001853          924 QGFFLSGSRPCWC----MVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYV--TSQGILKICQLPSGSTYDNYWPVQK  997 (1004)
Q Consensus       924 sgVFv~G~~P~~i----~~~~~~l~~~~~~~~~~v~~f~~F~~~~~~~gfiy~--~~~~~lri~~lp~~~~~d~~wp~rk  997 (1004)
                      .++||||..=.|.    |-.-..+-.|.-.-.|||+|..     =-|+|-+|+  .++|++||=|.-+.-.|. -|-.+|
T Consensus       236 k~~fVaGged~~~~kfDy~TgeEi~~~nkgh~gpVhcVr-----FSPdGE~yAsGSEDGTirlWQt~~~~~~~-~~~~~~  309 (334)
T KOG0278|consen  236 KEFFVAGGEDFKVYKFDYNTGEEIGSYNKGHFGPVHCVR-----FSPDGELYASGSEDGTIRLWQTTPGKTYG-LWKCVK  309 (334)
T ss_pred             CceEEecCcceEEEEEeccCCceeeecccCCCCceEEEE-----ECCCCceeeccCCCceEEEEEecCCCchh-hccccC


Q ss_pred             E
Q 001853          998 V  998 (1004)
Q Consensus       998 v  998 (1004)
                      +
T Consensus       310 ~  310 (334)
T KOG0278|consen  310 P  310 (334)
T ss_pred             h


No 83 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=21.72  E-value=8.3e+02  Score=28.33  Aligned_cols=83  Identities=20%  Similarity=0.287  Sum_probs=45.2

Q ss_pred             cEEEEEEEEeeeee--------eEeEEEecCCCCCCCCccEEEE-EECCCeEEEEEEeCCCCcEEEEEeeeecCcchhcc
Q 001853          100 SLELVCHYRLHGNV--------ESLAILSQGGADNSRRRDSIIL-AFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHL  170 (1004)
Q Consensus       100 kL~lv~e~~l~G~I--------~~l~~vr~~~s~~~~~~D~Llv-~~~~aklsile~d~~~~~l~TvSlh~~E~~~~~~~  170 (1004)
                      .|+++...+.-+..        .+|-.-  +      .++..++ ..+.+++.++.|... ..+.+..++-         
T Consensus       109 tle~v~~I~~~~~~~~~~~~Rv~aIv~s--~------~~~~fVv~lkd~~~I~vVdy~d~-~~~~~~~i~~---------  170 (369)
T PF02239_consen  109 TLEPVKTIPTGGMPVDGPESRVAAIVAS--P------GRPEFVVNLKDTGEIWVVDYSDP-KNLKVTTIKV---------  170 (369)
T ss_dssp             T--EEEEEE--EE-TTTS---EEEEEE---S------SSSEEEEEETTTTEEEEEETTTS-SCEEEEEEE----------
T ss_pred             cccceeecccccccccccCCCceeEEec--C------CCCEEEEEEccCCeEEEEEeccc-cccceeeecc---------
Confidence            58888888876543        333221  1      3444444 455699999987765 3343332221         


Q ss_pred             cCCcccccCCCeEEECCCCCEEEEEEe-cCeEEEEEc
Q 001853          171 KRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILKA  206 (1004)
Q Consensus       171 k~g~~~~~~~~~l~VDP~~Rca~l~~~-~~~L~ilP~  206 (1004)
                        |.    ...-...||.+|+.++.+. .++++++-.
T Consensus       171 --g~----~~~D~~~dpdgry~~va~~~sn~i~viD~  201 (369)
T PF02239_consen  171 --GR----FPHDGGFDPDGRYFLVAANGSNKIAVIDT  201 (369)
T ss_dssp             ---T----TEEEEEE-TTSSEEEEEEGGGTEEEEEET
T ss_pred             --cc----cccccccCcccceeeecccccceeEEEee
Confidence              11    1123688999999888777 788888754


No 84 
>PF11715 Nup160:  Nucleoporin Nup120/160;  InterPro: IPR021717  Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=21.69  E-value=4.6e+02  Score=31.96  Aligned_cols=30  Identities=20%  Similarity=0.389  Sum_probs=25.4

Q ss_pred             cEEEEEEecCCeEEEEEcCCCeEEEEecCc
Q 001853          751 DIYSVVCYESGALEIFDVPNFNCVFTVDKF  780 (1004)
Q Consensus       751 ~~~l~~~~~~g~l~I~sLp~~~~v~~~~~~  780 (1004)
                      ..++|.++.|+.|+||+|.+.+++++.+-+
T Consensus       230 ~~~l~tl~~D~~LRiW~l~t~~~~~~~~~~  259 (547)
T PF11715_consen  230 DTFLFTLSRDHTLRIWSLETGQCLATIDLL  259 (547)
T ss_dssp             TTEEEEEETTSEEEEEETTTTCEEEEEETT
T ss_pred             CCEEEEEeCCCeEEEEECCCCeEEEEeccc
Confidence            347889999999999999999998886544


No 85 
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=20.26  E-value=1.1e+03  Score=27.64  Aligned_cols=29  Identities=14%  Similarity=0.315  Sum_probs=24.9

Q ss_pred             ccEEEEEEcCCEEEEEEeCCeEEEEEecC
Q 001853          659 STVLSVSIADPYVLLGMSDGSIRLLVGDP  687 (1004)
Q Consensus       659 ~~I~~As~~dpyvll~~~~g~I~~l~~d~  687 (1004)
                      ..|..-.-.|..++.+.++|.+.++....
T Consensus       106 ~~I~gl~~~dg~Litc~~sG~l~~~~~k~  134 (412)
T KOG3881|consen  106 KSIKGLKLADGTLITCVSSGNLQVRHDKS  134 (412)
T ss_pred             ccccchhhcCCEEEEEecCCcEEEEeccC
Confidence            45888888899999999999999998764


Done!