Query         001003
Match_columns 1192
No_of_seqs    196 out of 586
Neff          7.4 
Searched_HMMs 46136
Date          Thu Mar 28 13:05:23 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/001003.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/001003hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG1896 mRNA cleavage and poly 100.0  1E-169  2E-174 1511.9  89.6 1038    3-1180    2-1123(1366)
  2 KOG1897 Damage-specific DNA bi 100.0  2E-117  5E-122 1047.3  81.3  841    2-1188    1-863 (1096)
  3 KOG1898 Splicing factor 3b, su 100.0 5.9E-95 1.3E-99  859.6  63.0  890    3-1172    2-954 (1205)
  4 COG5161 SFT1 Pre-mRNA cleavage 100.0   9E-94 1.9E-98  827.3  60.5  974    1-1173    1-1061(1319)
  5 PF10433 MMS1_N:  Mono-function 100.0 7.3E-55 1.6E-59  532.3  44.1  451  131-712     1-501 (504)
  6 PF03178 CPSF_A:  CPSF A subuni  99.3 2.1E-12 4.6E-17  149.5  10.6  114 1063-1184    1-118 (321)
  7 KOG2055 WD40 repeat protein [G  95.7    0.76 1.6E-05   53.8  19.1  223  837-1173  224-456 (514)
  8 cd00200 WD40 WD40 domain, foun  95.3     5.4 0.00012   43.1  32.4   31  659-689    10-42  (289)
  9 KOG1539 WD repeat protein [Gen  95.0     8.6 0.00019   48.5  25.9   81  661-781   205-287 (910)
 10 PF08596 Lgl_C:  Lethal giant l  94.8     4.2 9.1E-05   48.7  22.8   75  753-857    99-174 (395)
 11 KOG1274 WD40 repeat protein [G  94.6     4.6 9.9E-05   51.3  22.7  122  950-1137  137-263 (933)
 12 KOG2055 WD40 repeat protein [G  93.8     4.1 8.9E-05   48.0  18.9   44  975-1021  458-501 (514)
 13 PF14727 PHTB1_N:  PTHB1 N-term  93.4     4.8  0.0001   48.3  19.3   95  921-1017  251-354 (418)
 14 cd00200 WD40 WD40 domain, foun  92.8      16 0.00036   39.3  31.4   47  967-1020  147-195 (289)
 15 PF14727 PHTB1_N:  PTHB1 N-term  91.7      30 0.00064   41.7  23.1   75   57-149    90-164 (418)
 16 PF14783 BBS2_Mid:  Ciliary BBS  90.8     5.8 0.00013   38.4  12.8   87  628-765    24-110 (111)
 17 KOG0294 WD40 repeat-containing  89.8      41 0.00089   38.3  21.4   65 1108-1183  218-283 (362)
 18 KOG0318 WD40 repeat stress pro  87.5      72  0.0016   38.7  21.0  145  577-773   376-521 (603)
 19 KOG0294 WD40 repeat-containing  80.7     5.9 0.00013   44.7   8.1   83  657-778    42-124 (362)
 20 KOG0283 WD40 repeat-containing  79.6      50  0.0011   41.9  16.4  206  659-997   370-577 (712)
 21 KOG0318 WD40 repeat stress pro  78.6 1.7E+02  0.0036   35.8  30.8   60 1108-1178  454-514 (603)
 22 KOG0650 WD40 repeat nucleolar   78.0      87  0.0019   38.6  17.0   27  752-778   413-439 (733)
 23 COG5161 SFT1 Pre-mRNA cleavage  77.8     1.1 2.4E-05   56.0   1.5   91   99-206    87-177 (1319)
 24 PF10282 Lactonase:  Lactonase,  75.9 1.7E+02  0.0036   34.3  27.4   52  968-1021  156-210 (345)
 25 KOG2110 Uncharacterized conser  72.5   2E+02  0.0043   33.7  17.6  154  611-857    89-249 (391)
 26 PTZ00420 coronin; Provisional   71.5 1.2E+02  0.0026   38.2  17.1  148  974-1183  146-295 (568)
 27 KOG2096 WD40 repeat protein [G  69.4 1.1E+02  0.0023   35.1  14.0   53  968-1022  100-152 (420)
 28 KOG0310 Conserved WD40 repeat-  68.4      94   0.002   37.4  14.1  102  628-778   175-277 (487)
 29 PF03178 CPSF_A:  CPSF A subuni  67.7 2.4E+02  0.0051   32.6  21.0  144  575-770    41-203 (321)
 30 KOG0316 Conserved WD40 repeat-  67.2 1.5E+02  0.0033   32.6  14.1   25  753-777   241-265 (307)
 31 COG4247 Phy 3-phytase (myo-ino  66.5      77  0.0017   35.2  11.9  130  604-770    50-187 (364)
 32 COG2706 3-carboxymuconate cycl  66.1   1E+02  0.0023   35.7  13.5   72  130-208   202-275 (346)
 33 KOG0772 Uncharacterized conser  63.8      60  0.0013   39.2  11.3   69  670-773   283-351 (641)
 34 KOG0295 WD40 repeat-containing  63.4 1.2E+02  0.0026   35.3  13.2   66  668-774   304-369 (406)
 35 KOG0285 Pleiotropic regulator   61.9      94   0.002   36.0  12.0   60  751-855   163-222 (460)
 36 KOG1273 WD40 repeat protein [G  61.6 2.2E+02  0.0048   32.7  14.6   19  838-856    35-53  (405)
 37 KOG2048 WD40 repeat protein [G  61.4 4.3E+02  0.0093   33.4  30.6   60 1108-1178  486-545 (691)
 38 KOG0645 WD40 repeat protein [G  61.4 2.8E+02  0.0061   31.3  15.7  135  976-1179   37-178 (312)
 39 KOG0299 U3 snoRNP-associated p  59.3      88  0.0019   37.4  11.6  107  620-778   257-365 (479)
 40 KOG0289 mRNA splicing factor [  59.0 3.8E+02  0.0082   32.1  16.4   99  575-686   314-419 (506)
 41 KOG3881 Uncharacterized conser  57.5      22 0.00048   41.4   6.3   92 1064-1179  226-318 (412)
 42 PF08662 eIF2A:  Eukaryotic tra  57.3 2.4E+02  0.0052   30.2  14.1  121  994-1181   50-179 (194)
 43 KOG2110 Uncharacterized conser  56.4 3.6E+02  0.0078   31.7  15.5   26  832-857   306-331 (391)
 44 KOG1898 Splicing factor 3b, su  56.1 4.4E+02  0.0095   35.2  17.6   64  923-986   732-796 (1205)
 45 KOG1036 Mitotic spindle checkp  55.4 3.7E+02  0.0081   30.8  22.4   74  620-712    26-102 (323)
 46 PLN00181 protein SPA1-RELATED;  55.0 6.3E+02   0.014   33.3  35.2   28  659-686   484-513 (793)
 47 PRK11028 6-phosphogluconolacto  54.0 3.9E+02  0.0085   30.7  17.2   52  969-1021  139-193 (330)
 48 KOG1036 Mitotic spindle checkp  53.8   4E+02  0.0086   30.6  27.0  170  915-1142  137-322 (323)
 49 KOG1539 WD repeat protein [Gen  51.3 6.8E+02   0.015   32.6  42.5   77  659-773   451-527 (910)
 50 KOG1446 Histone H3 (Lys4) meth  50.3 4.5E+02  0.0097   30.2  24.0  162  939-1136   89-262 (311)
 51 KOG0291 WD40-repeat-containing  50.1 6.8E+02   0.015   32.3  43.7  263  605-1024  261-542 (893)
 52 KOG0276 Vesicle coat complex C  50.0 6.3E+02   0.014   31.8  20.9   45  969-1022  437-481 (794)
 53 KOG0296 Angio-associated migra  49.2 4.4E+02  0.0095   31.0  14.7  117  603-778   112-229 (399)
 54 PF10282 Lactonase:  Lactonase,  49.1 4.9E+02   0.011   30.4  29.6   52  966-1021  254-310 (345)
 55 KOG1274 WD40 repeat protein [G  48.4 7.7E+02   0.017   32.4  19.8   53  628-689   117-171 (933)
 56 KOG1273 WD40 repeat protein [G  48.1 4.9E+02   0.011   30.1  18.4   61 1108-1179  260-320 (405)
 57 KOG0650 WD40 repeat nucleolar   47.9 6.6E+02   0.014   31.5  20.0   92  924-1021  570-669 (733)
 58 KOG0263 Transcription initiati  47.5 1.3E+02  0.0029   38.1  11.1   66 1108-1186  588-654 (707)
 59 KOG0263 Transcription initiati  46.2 4.7E+02    0.01   33.5  15.5  110  611-770   529-650 (707)
 60 PF02333 Phytase:  Phytase;  In  45.5 2.9E+02  0.0064   32.9  13.3   61  678-769   127-189 (381)
 61 KOG0288 WD40 repeat protein Ti  45.4 2.1E+02  0.0046   33.9  11.6   93 1066-1178  365-458 (459)
 62 KOG0291 WD40-repeat-containing  44.5 8.2E+02   0.018   31.6  52.6  137  536-690   286-426 (893)
 63 KOG1897 Damage-specific DNA bi  42.0 2.4E+02  0.0052   37.2  12.3   64    5-119   764-838 (1096)
 64 KOG0277 Peroxisomal targeting   41.2      34 0.00075   37.8   4.4   67 1109-1180   21-90  (311)
 65 KOG2048 WD40 repeat protein [G  40.6 8.7E+02   0.019   30.8  19.6   28  179-206   477-504 (691)
 66 TIGR02276 beta_rpt_yvtn 40-res  40.5      64  0.0014   24.6   4.8   40  967-1011    3-42  (42)
 67 PF08596 Lgl_C:  Lethal giant l  39.4   1E+02  0.0022   37.0   8.5   28  751-778   272-299 (395)
 68 KOG2066 Vacuolar assembly/sort  38.5 1.4E+02   0.003   38.3   9.4   85  921-1017  135-220 (846)
 69 PTZ00421 coronin; Provisional   38.5 7.3E+02   0.016   30.8  16.0   31  658-688    75-108 (493)
 70 KOG0319 WD40-repeat-containing  38.1 9.9E+02   0.021   30.7  28.7  164  941-1180  225-394 (775)
 71 KOG0278 Serine/threonine kinas  37.7 5.3E+02   0.011   28.9  12.5  117  994-1178  177-294 (334)
 72 KOG0293 WD40 repeat-containing  37.7 7.9E+02   0.017   29.5  18.3  148  826-1024  226-376 (519)
 73 KOG0295 WD40 repeat-containing  36.7 5.2E+02   0.011   30.4  12.8   62  750-856   303-364 (406)
 74 KOG0293 WD40 repeat-containing  36.7 4.4E+02  0.0096   31.4  12.4  102  577-690   367-474 (519)
 75 KOG0283 WD40 repeat-containing  36.5   2E+02  0.0044   36.7  10.5   77  659-777   410-489 (712)
 76 COG2706 3-carboxymuconate cycl  36.4 7.7E+02   0.017   28.9  26.9   68  130-210    51-122 (346)
 77 PF12341 DUF3639:  Protein of u  36.1      74  0.0016   22.9   3.9   24  660-683     3-26  (27)
 78 KOG0647 mRNA export protein (c  35.5 7.4E+02   0.016   28.5  21.3   93  925-1023  169-272 (347)
 79 KOG0772 Uncharacterized conser  33.6 2.3E+02   0.005   34.6   9.8  161  574-780   225-406 (641)
 80 KOG4649 PQQ (pyrrolo-quinoline  32.9 4.5E+02  0.0098   29.6  11.1   73  361-439    52-124 (354)
 81 KOG4378 Nuclear protein COP1 [  32.9 6.2E+02   0.013   30.9  13.0   82  659-779   122-205 (673)
 82 PLN00181 protein SPA1-RELATED;  32.7 1.3E+03   0.028   30.4  27.2   73  658-771   575-650 (793)
 83 KOG2445 Nuclear pore complex c  31.8 3.5E+02  0.0076   31.1  10.3   23  751-773   126-149 (361)
 84 KOG0299 U3 snoRNP-associated p  30.9 6.7E+02   0.015   30.3  12.8   32  659-690   328-360 (479)
 85 KOG1407 WD40 repeat protein [F  30.9 4.5E+02  0.0097   29.6  10.7   40  977-1023  254-293 (313)
 86 KOG0289 mRNA splicing factor [  30.3   7E+02   0.015   30.0  12.7  110  662-855   309-418 (506)
 87 PTZ00420 coronin; Provisional   29.7 1.3E+03   0.027   29.4  29.3   29  659-687    75-106 (568)
 88 PRK11028 6-phosphogluconolacto  29.6   6E+02   0.013   29.1  12.8   84  101-207    25-110 (330)
 89 PF00780 CNH:  CNH domain;  Int  29.5 8.1E+02   0.018   27.1  24.1   22  663-685     2-23  (275)
 90 KOG0645 WD40 repeat protein [G  28.9 6.5E+02   0.014   28.5  11.6   91 1065-1180   38-134 (312)
 91 KOG0292 Vesicle coat complex C  28.0 1.5E+03   0.034   29.9  22.2   97  910-1021  281-384 (1202)
 92 PF12894 Apc4_WD40:  Anaphase-p  27.9      95  0.0021   25.3   3.9   40  101-148     2-41  (47)
 93 PF06977 SdiA-regulated:  SdiA-  27.5      96  0.0021   34.8   5.3   60  375-438   184-248 (248)
 94 KOG0319 WD40-repeat-containing  27.0 9.3E+02    0.02   31.0  13.7   74  661-770   368-443 (775)
 95 KOG0296 Angio-associated migra  26.6 4.8E+02    0.01   30.7  10.4   56  753-855   300-355 (399)
 96 KOG0266 WD40 repeat-containing  25.5 1.3E+03   0.028   28.1  17.0   77  659-774   289-369 (456)
 97 TIGR02658 TTQ_MADH_Hv methylam  25.4 1.2E+03   0.026   27.7  27.1  139  921-1096  147-305 (352)
 98 KOG0266 WD40 repeat-containing  25.3 1.3E+03   0.028   28.1  18.2   74  659-773   247-322 (456)
 99 KOG0285 Pleiotropic regulator   25.3 5.3E+02   0.012   30.3  10.4   68  631-713   259-328 (460)
100 KOG0275 Conserved WD40 repeat-  24.9 9.6E+02   0.021   27.7  12.1   53  660-717   265-319 (508)
101 KOG0279 G protein beta subunit  24.4 1.1E+03   0.024   26.9  16.5  119  659-856   193-313 (315)
102 KOG0276 Vesicle coat complex C  23.8 1.6E+03   0.034   28.6  17.8  102  629-778    77-180 (794)
103 PF02239 Cytochrom_D1:  Cytochr  22.9 4.6E+02    0.01   31.2  10.2   67  969-1085   50-117 (369)
104 KOG0282 mRNA splicing factor [  22.8   3E+02  0.0065   33.3   8.2   78  935-1023  242-321 (503)
105 PTZ00421 coronin; Provisional   22.4 1.5E+03   0.033   28.0  23.3   77  659-776   126-205 (493)
106 PF14781 BBS2_N:  Ciliary BBSom  22.3 2.1E+02  0.0046   28.9   5.9   44  103-152    40-83  (136)
107 PF06977 SdiA-regulated:  SdiA-  22.0 6.5E+02   0.014   28.2  10.5   92  925-1022   35-137 (248)
108 COG4257 Vgb Streptogramin lyas  22.0 8.5E+02   0.018   27.8  10.9   80  925-1019   39-120 (353)
109 KOG0288 WD40 repeat protein Ti  21.5 8.5E+02   0.018   29.2  11.3   24  753-776   401-424 (459)
110 KOG0306 WD40-repeat-containing  21.2 1.9E+03   0.041   28.5  24.7  113  310-439   327-443 (888)
111 KOG2096 WD40 repeat protein [G  21.1 1.3E+03   0.029   26.7  18.2   72  947-1021  273-350 (420)
112 PF14781 BBS2_N:  Ciliary BBSom  20.8 2.7E+02  0.0058   28.2   6.3   39 1089-1140   47-85  (136)
113 KOG0639 Transducin-like enhanc  20.8 6.5E+02   0.014   30.8  10.3   26  752-777   522-547 (705)
114 KOG2314 Translation initiation  20.4 7.1E+02   0.015   31.0  10.6  129  977-1173  426-558 (698)
115 KOG1517 Guanine nucleotide bin  20.2 1.9E+03   0.041   29.8  14.8  192  578-854  1179-1379(1387)
116 COG3204 Uncharacterized protei  20.1 5.3E+02   0.012   29.6   9.1   91  924-1021   98-199 (316)

No 1  
>KOG1896 consensus mRNA cleavage and polyadenylation factor II complex, subunit CFT1 (CPSF subunit) [RNA processing and modification]
Probab=100.00  E-value=1e-169  Score=1511.92  Aligned_cols=1038  Identities=39%  Similarity=0.642  Sum_probs=850.9

Q ss_pred             chhhhhccCCCceeeEEEEEeecCCCCCCCCCcccccccccccCCCCCCCCCCCeEEEEcCCeEEEEEEEEeccCCcccc
Q 001003            3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESK   82 (1192)
Q Consensus         3 ~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~g~~~~~   82 (1192)
                      |++|++.|+||+|+||++|+||+....                           ||||+++|.|+||+++.++.+.+.  
T Consensus         2 ~~vykq~h~~T~ve~s~ag~Ft~~~~~---------------------------nlvV~~~N~L~vyri~~~~e~~t~--   52 (1366)
T KOG1896|consen    2 FAVYKQEHDPTVVENSSAGLFTNNRTE---------------------------NLVVAGTNILRVYRISRDAEALTK--   52 (1366)
T ss_pred             cchhhhccCchhhccceeeeEecCCCc---------------------------ceEEecccEEEEEEeccchhhccc--
Confidence            689999999999999999999987765                           999999999999999865322100  


Q ss_pred             CCccccccccccccccccEEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEeCCCCCEEEEeeeee
Q 001003           83 NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCF  162 (1192)
Q Consensus        83 ~~~~~~~~~~~~~~~~~~L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d~~~~~l~t~Slh~~  162 (1192)
                           .....+.+..+.||+|+++|.+||+|++|++++..|+    .+|+|+++|++||+|++|||+.+|+|+|.|||||
T Consensus        53 -----~~~~~~~~~~~~~LeLv~~~~l~GnV~si~~~~~~gs----~rD~LlL~f~~AKiSvlefD~~t~sl~TlSLHyf  123 (1366)
T KOG1896|consen   53 -----NDPGDMGKAHRKKLELVAEFKLFGNVTSIAKLPLKGS----NRDALLLLFKDAKISVLEFDPQTNSLRTLSLHYF  123 (1366)
T ss_pred             -----cCccccccccceEEEEEEEEEeecceeeEEEeecCCC----CcceEEEEeccceEEEEEecCCccceeeeeeEEe
Confidence                 1122344445567999999999999999999999987    6999999999999999999999999999999999


Q ss_pred             eccccccccCCcccccCCCeEEECCCCcEEEEEEcCceEEEEeCccCCCCCCCCCCCCCCCCCccceeeceEEEEccccC
Q 001003          163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDLD  242 (1192)
Q Consensus       163 E~~~~~~~~~G~~~~~~~~~l~VDP~~Rc~~l~~y~~~L~ilP~~~~~~~l~~~~~~~~~~~~~~~~~~~s~~i~l~~ld  242 (1192)
                      |.++   .+.|++....+|.++|||++||++|++|+..|+||||++.+ .+++++.. ..+....+.+.+||+|.+.+||
T Consensus       124 E~~~---~~~~~~~~~~~p~vrvDPdsrCa~llvyg~~m~iLpf~~~e-~~~~~~~~-~~~~~~ss~~~pSyvi~~reLd  198 (1366)
T KOG1896|consen  124 EGPE---FRKGLVGRAKIPTVRVDPDSRCALLLVYGLRMAILPFRVNE-HLDDEELF-PSGFSKSSFTAPSYVIALRELD  198 (1366)
T ss_pred             cccc---ccccccccccCceEEECCCCCeEEEEEecceEEEeeccccc-cccccccc-cccccccccccceeEEEhhhhh
Confidence            9986   45566666778999999999999999999999999998863 24433322 2223334578999999999998


Q ss_pred             --ccceeeeeeccCCcccEEEEEEecCCCcccceeeeeeeeEEEEEEEeeccceeeeeeeeccCCcccceeEEecCCCCe
Q 001003          243 --MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPSPIGG  320 (1192)
Q Consensus       243 --i~~V~D~~FL~gy~~PtlaiL~e~~~tw~gr~~~~~dt~~l~~~sLdl~~k~~~~i~s~~~Lp~d~~~LipvP~p~GG  320 (1192)
                        |+||+|++|||||++||+||||||.+||+||+..|+|||.+++++||+++|.||+||++.+||+||+.+.+||.|+||
T Consensus       199 eki~niiD~qFLhgY~ePTl~ILyep~~tw~grv~~r~dt~~~vaisLni~q~~hpVI~sv~sLP~D~~~~~~vp~piGg  278 (1366)
T KOG1896|consen  199 EKIKNIIDFQFLHGYYEPTLAILYEPEQTWAGRVILRKDTCVLVAISLNITQKVHPVIWSVLSLPFDCYQATAVPTPIGG  278 (1366)
T ss_pred             hhhccceeEEeecCcccceEEEEecccccccceEEEecCcEEEEEEEcCccccccceEeeeccCChhhhhceeecccCcc
Confidence              889999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEEecceEEEEeCCC-ceeEeccccccccCCCccCcCCCceeEeeceeeEEeeCcEEEEEcCCCCEEEEEEEEC-CceE
Q 001003          321 VLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYD-GRVV  398 (1192)
Q Consensus       321 vLVig~n~I~y~d~~~-~~~~a~N~~~~~~~~~~~~~~~~~~l~l~~~~~~~~~~~~~Ll~~~~G~L~~l~l~~d-g~~V  398 (1192)
                      |||++.|.++|++|++ ++++++|++++..+.++.+||+.+++.|+++..+|++.+++++++.+|++|+|+|.+| +|.|
T Consensus       279 vLv~~~n~~iy~nqsv~~~gv~LNs~a~~~t~fpl~~qs~v~i~ld~a~~t~i~~dk~vis~~~Gd~y~Ltl~~D~~r~V  358 (1366)
T KOG1896|consen  279 VLVFTVNNLIYLNQSVSPYGVALNSYASKYTAFPLIPQSGVRIELDCANATWISNDKCVISLKNGDLYLLTLILDIGRSV  358 (1366)
T ss_pred             EEEEeeeeEEEEccCCCceeEEecchhhcccCCccccccceEEEEeeccceeecCCeEEEecCCCcEEEEEEEeccccch
Confidence            9999999999999998 5999999999999999999999999999999999999999999999999999999999 7999


Q ss_pred             eeEEEEecCCCccccceEEecCCeEEEEeeeCCeEEEEEeeCCCcccccCCCccccCCcccCCccchhccCCCcchhhcc
Q 001003          399 QRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM  478 (1192)
Q Consensus       399 ~~l~l~~~~~~~~~s~l~~l~~g~lFvGS~~GDS~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~d~~  478 (1192)
                      +.+++.++..+++++|++-..+++||+||+.|||+|+||+++...+.  .+...|+.+.+....+.++.+...+..+||.
T Consensus       359 ~~~~f~k~~asvl~t~~v~~~n~llFlGSrlgnSlll~~s~~~~~~~--e~~~re~~d~~~~~~~~~~~d~~~d~~~~d~  436 (1366)
T KOG1896|consen  359 QLLHFDKFKASVLATSIVGHGNNLLFLGSRLGNSLLLRFSELLQRAS--EGVRREEGDTESDGYSKKRVDDTQDVRRDDE  436 (1366)
T ss_pred             hhhhhhhhhcccceeeeeccCCccEEEEecCCCEEEEEehhccccCC--ccccccccCCcCCcchhhcccchhhhhhhhh
Confidence            99999999999999999999999999999999999999998764221  2222222222222223333221111111111


Q ss_pred             cCcccc------ccccCCCCCccccccceeEEEEeeecccCCccccccccccCCCC---------------CccCCCCCC
Q 001003          479 VNGEEL------SLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA---------------SATGISKQS  537 (1192)
Q Consensus       479 ~~~~e~------~ly~~~~~~~~~~~~~~~~~v~Dsl~NigPI~D~~vg~~~~~~~---------------~~sG~g~~g  537 (1192)
                       .-++.      .-||+++..+.   ..+.|++||+|+|||||.||++|...+.+.               .|+|.|++|
T Consensus       437 -~~~~~~~~g~~~~~g~~a~~t~---~~f~fevcDsL~NIGPi~~~avG~~~~~~~~~~gl~~~~~~~elV~~sGhgkng  512 (1366)
T KOG1896|consen  437 -KSAELFEAGSEENYGSGAQETV---QPFSFEVCDSLPNIGPITDFAVGKRSSASEAVEGLSPHNKCLELVATSGHGKNG  512 (1366)
T ss_pred             -hccchhhccccccCCcccceee---eeeEEeehhccccccccccceeccccchhhhccCCCCCCCeEEEEEeccCCCCc
Confidence             00111      22333322211   238899999999999999999998654221               278999999


Q ss_pred             ceEE------------EeCCCcCEEEEEEecCCCCCCCCcccccccCCCcceEEEEEeccceEEEEecCceeeeecccCc
Q 001003          538 NYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDY  605 (1192)
Q Consensus       538 sL~i------------~eLpg~~~iWtv~~~~~~~~~~~~~~~~~~~~~~~~yLvlS~~~~T~Vl~~g~~~eEv~~~~gF  605 (1192)
                      .|.+            ++||||.++|||..+....+         .++.-|.||++|..++|+||++|+++.|++. .+|
T Consensus       513 aL~V~r~sI~P~i~t~fel~Gc~~iWtV~~~~~~~~---------~~~~~h~~lilS~e~~t~il~tge~~~Ev~~-s~f  582 (1366)
T KOG1896|consen  513 ALSVIRRSIRPEIATEFELPGCVDIWTVFIKGRKRE---------EDNTQHLYLILSTESRTMILETGEELLEVSG-SGF  582 (1366)
T ss_pred             ceEEEeecccceeeEEEEecCeeeEEEEEEeccccc---------cccCcceEEEeecccchhhhhccchhhhccc-cee
Confidence            9987            78999999999998654322         2334599999999999999999999999975 589


Q ss_pred             cccCCcEEEEeeCCCCEEEEEecCcEEEEeCC-cceeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEE
Q 001003          606 FVQGRTIAAGNLFGRRRVIQVFERGARILDGS-YMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLV  684 (1192)
Q Consensus       606 ~~~~~TI~ag~l~~~~~IvQVt~~~vrli~~~-~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~  684 (1192)
                      ..+++||++|+++++++|||||++++|++|++ .+.|.+++.          .+..+++++++||||+|....|.|.+|.
T Consensus       583 ~~~~~Tl~~gnlg~~rriVQVtp~~~rllDg~~r~lq~i~fd----------~~~~vv~~sv~dpyv~v~~~~g~i~~~~  652 (1366)
T KOG1896|consen  583 TRDGPTLFAGNLGNERRIVQVTPSGLRLLDGDLRMLQRIPFD----------SGAIVVQTSVADPYVAVRSSEGRITLYD  652 (1366)
T ss_pred             EeccceEEEEecCCceEEEEEccceeEEecCcchheeEeccc----------cCCcEEEEeccCceEEEEEcCCceEEEE
Confidence            99999999999999999999999999999995 578888882          3456999999999999999999999999


Q ss_pred             ecCCCceEeeccccccccCCCceeEEEeeccCCC-------------CCcccccccccccccCccccccCCCCCCCCCCc
Q 001003          685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGP-------------EPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGD  751 (1192)
Q Consensus       685 ~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~-------------~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  751 (1192)
                      ++.+..+|-+.++  +   ...+.++++|.|.+.             .+|.+.. ++.+..... ..+++.+++...+..
T Consensus       653 l~~~s~rl~~~~~--~---s~~~~sv~~~~dlsg~f~~~s~l~~k~~~~~gr~~-~~~~~~~~~-~kv~~~egg~~~~~~  725 (1366)
T KOG1896|consen  653 LEEKSHRLALHDP--M---SFKVVSVSLPADLSGMFTTLSDLSLKGNEANGRSS-EAEGLQSLP-CKVDDEEGGSPEQEP  725 (1366)
T ss_pred             eccccchhhccCc--c---cceeEEEechhhhccceEEEeeecccCcccccccc-cccccccCC-ccccCCCCCCcccCc
Confidence            9887666655554  2   345666677766542             2222222 111111111 334433322222223


Q ss_pred             EEEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEee
Q 001003          752 IYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQR  831 (1192)
Q Consensus       752 ~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~  831 (1192)
                      +||++++++|.|+||++|++++||.++.|+.++++|.|+....++..                  +...+..+.++...+
T Consensus       726 ~~~~~~~e~g~leiy~~pd~~lVf~v~~f~~~~~~L~~~~~~~~~~~------------------~~s~~~~l~q~~~~~  787 (1366)
T KOG1896|consen  726 YWCVFVTESGTLEIYALPDFDLVFEVDMFDTGNRVLMDSRLRGPTTN------------------KESEDLELKQLFVNP  787 (1366)
T ss_pred             eEEEEEcCCCceEEEccCCcceEEEeeccCCCcceEEeecccCcccc------------------ccccchHHHHhhccc
Confidence            99999999999999999999999999999999999988533222100                  001124567777888


Q ss_pred             cCCC--CCccEEEEEecCCcEEEEEEeeecCCCCCCCCCCCCccccccccccccccccceeeEEecCCCccCCCCC----
Q 001003          832 WSAH--HSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET----  905 (1192)
Q Consensus       832 ~g~~--~~~p~Llv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~~~~~~~~~----  905 (1192)
                      +|.+  ..+|||+..+.+|++++|++|+..   +                     ++..+++|+|+|++...++.+    
T Consensus       788 L~~e~~~~e~~L~lv~~~~eil~Ykaf~~~---~---------------------~~~~~~~f~kvp~~~~~~~~~p~~~  843 (1366)
T KOG1896|consen  788 LGSEIVFKEPHLFLVVSDNEILIYKAFPQL---S---------------------QGNLKVFFKKVPHNLNIRTDKPHFL  843 (1366)
T ss_pred             cchhhhccCCceEEEEeCceEEEEeecccc---C---------------------ccchhhhhhhCCHhhcccccCCccc
Confidence            8877  789999999999999999999611   1                     111256899999866543321    


Q ss_pred             -------------CCCCCccceEEeeccCCceEEEEcCCCCeEEEE-eCCceEEEecCCCCceeEEecccCCCCCCcEEE
Q 001003          906 -------------PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIY  971 (1192)
Q Consensus       906 -------------~~~~g~~~l~~f~~i~G~~gVF~~G~rP~wi~~-~~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~  971 (1192)
                                   ..+...+++++|++++||+|||+||++|+||+. .+|.+++||+.+.++|.+|+||||+|||+||+|
T Consensus       844 ~~~~~~~~~e~~~~~~~~~~~m~~f~~i~ghsgvfv~Gs~P~~il~t~rg~lr~h~~~gngpv~sfapfhnvn~p~gfiy  923 (1366)
T KOG1896|consen  844 CKKREGGGAEEGASVSVIVQRMTYFEDIGGHSGVFVTGSKPYLILLTFRGVLRFHPVFGNGPVGSFAPFHNVNCPRGFIY  923 (1366)
T ss_pred             chhhccccccccccccceeeeEEeeccccCeeEEEEecCCceEEEEEcccccceeeeecCCcceeeeeeeccCCCcceEE
Confidence                         122345689999999999999999999999998 599999999999999999999999999999999


Q ss_pred             EEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCC
Q 001003          972 VTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLS 1051 (1192)
Q Consensus       972 ~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~ 1051 (1192)
                      |+.++.++||.+|+...||+.||+|| |||++|||+++||++.++|+|+++.+.  ++   +.  .++|..+    +..+
T Consensus       924 vd~~~~l~i~~lp~~~~Ydn~wPvkk-Ipl~~T~~~vvYh~e~~vy~v~t~~~~--~~---~~--~~~d~~e----~~~~  991 (1366)
T KOG1896|consen  924 VDRQGELVICVLPEALSYDNKWPVKK-IPLRKTPHQVVYHYEKKVYAVITSTPV--PY---ER--LGEDGEE----EVIS  991 (1366)
T ss_pred             ECCCceEEEEEcchhcccCCCCcccc-cccccchhheeeeccceEEEEEEeccc--ee---ee--ccccccc----cccc
Confidence            99999999999999999999999999 999999999999999999999998752  22   22  2333221    1345


Q ss_pred             ccccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEecc-ccCCCCceEEEEEeccccCcccccCceE
Q 001003         1052 SVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFN-TTTKENETLLAIGTAYVQGEDVAARGRV 1130 (1192)
Q Consensus      1052 ~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s-~~t~~~~~ylaVGTa~~~gEd~~~rGRI 1130 (1192)
                      .+|..+.|+.++|+|+|++|.    +|++++.|+|++||||++||.|.|.+ ++++++|+||||||++++|||.|+||||
T Consensus       992 ~de~~~~p~~~~f~i~LisP~----sw~vi~~iefq~~E~v~~~k~v~L~~~~t~~~~k~ylavGT~~~~gEDv~~RGr~ 1067 (1366)
T KOG1896|consen  992 RDENVIHPEGEQFSIQLISPE----SWEVIDKIEFQENEHVLHMKYVILDDEETTKGKKPYLAVGTAFIQGEDVPARGRI 1067 (1366)
T ss_pred             ccccccccccccceeEEecCC----ccccccccccCccceeeEEEEEEEEecccccCCcceEEEEEeecccccccCcccE
Confidence            678889999999999999995    99999999999999999999999995 4567789999999999999999999999


Q ss_pred             EEEEeee---CCCCCceeEeecccCcccccchhcccCceEEEeecc---------eEEeeeh
Q 001003         1131 LLFSTGR---NADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS---------FVFVFLF 1180 (1192)
Q Consensus      1131 lvfev~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~---------~~~~~~~ 1180 (1192)
                      +|||||+   +|++|.|.+   |+|-+|.   +|.||++.|.|.++         ||+||-|
T Consensus      1068 hi~diIeVVPepgkP~t~~---KlKel~~---eE~KGtVsavceV~G~l~~~~GqKI~v~~l 1123 (1366)
T KOG1896|consen 1068 HIFDIIEVVPEPGKPFTKN---KLKELYI---EEQKGTVSAVCEVRGHLLSSQGQKIIVRKL 1123 (1366)
T ss_pred             EEEEEEEecCCCCCCcccc---eeeeeeh---hhcccceEEEEEeccEEEEccCcEEEEEEe
Confidence            9999997   999999999   9999999   99999999999988         7888777


No 2  
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=100.00  E-value=2.5e-117  Score=1047.32  Aligned_cols=841  Identities=20%  Similarity=0.304  Sum_probs=688.1

Q ss_pred             cchhhhhccCCCceeeEEEEEeecCCCCCCCCCcccccccccccCCCCCCCCCCCeEEEEcCCeEEEEEEEEeccCCccc
Q 001003            2 SFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKES   81 (1192)
Q Consensus         2 ~~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~g~~~~   81 (1192)
                      +|+|+.++|+||+|.+|+.||||++...                           ||+|||+|+|+||.+.++  |    
T Consensus         1 ~~~Y~vtaqkpT~V~~av~gnFts~e~~---------------------------nlivAk~~~lei~~~~~~--G----   47 (1096)
T KOG1897|consen    1 SMNYVVTAQKPTAVVTAVVGNFTSPENL---------------------------NLIVAKGNRLEILLVEPN--G----   47 (1096)
T ss_pred             CeeEEEEecCCceEeEEEeecccCccce---------------------------eeeeeccceEEEEeeccc--c----
Confidence            5889999999999999999999999876                           999999999999998643  6    


Q ss_pred             cCCccccccccccccccccEEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEeCCCCCEEEEeeee
Q 001003           82 KNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHC  161 (1192)
Q Consensus        82 ~~~~~~~~~~~~~~~~~~~L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d~~~~~l~t~Slh~  161 (1192)
                                         |+.+++.++||+|..|+.||+++.    .+|+|+|+|+++++++|+||.+..+..|+....
T Consensus        48 -------------------Lq~i~sv~ifg~I~~i~~fRp~g~----~kD~LfV~t~~~~~~iL~~d~~~~~vv~~a~~~  104 (1096)
T KOG1897|consen   48 -------------------LQPITSVPIFGTIATIALFRPPGS----DKDYLFVATDSYRYFILEWDEESIQVVTRAHGD  104 (1096)
T ss_pred             -------------------ceeeEeeccceeEEEEEeecCCCC----CcceEEEEECcceEEEEEEccccceEEEEeccc
Confidence                               999999999999999999999987    799999999999999999999767777766655


Q ss_pred             eeccccccccCCcccccCCCeEEECCCCcEEEEEEcCceEEEEeCccCCCCCCCCCCCCCCCCCccceeeceEEEEcccc
Q 001003          162 FESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSGLVGDEDTFGSGGGFSARIESSHVINLRDL  241 (1192)
Q Consensus       162 ~E~~~~~~~~~G~~~~~~~~~l~VDP~~Rc~~l~~y~~~L~ilP~~~~~~~l~~~~~~~~~~~~~~~~~~~s~~i~l~~l  241 (1192)
                      .      ..|.| |+..+|++++|||.+|.|++++|++.+.+||+...+..             ........|.+++.++
T Consensus       105 v------~dr~g-r~s~~g~~~~VDp~~R~Igl~~yqgl~~vIp~d~~~sh-------------t~~s~l~~fn~rfdel  164 (1096)
T KOG1897|consen  105 V------SDRSG-RPSDNGQILLVDPKGRVIGLHLYQGLFKVIPIDSDESH-------------TGGSLLKAFNVRFDEL  164 (1096)
T ss_pred             c------ccccc-ccCCCceEEEECCCCcEEEEEeecCeEEEEEecccccc-------------cCcccccccccccCcc
Confidence            2      35788 55799999999999999999999999999999754210             1112346788999888


Q ss_pred             CccceeeeeeccCCcccEEEEEEecCCCcccceeeeeeeeEEEEEEEeeccce-eeeeeeeccCCcccceeEEecCCCCe
Q 001003          242 DMKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSISTTLKQ-HPLIWSAMNLPHDAYKLLAVPSPIGG  320 (1192)
Q Consensus       242 di~~V~D~~FL~gy~~PtlaiL~e~~~tw~gr~~~~~dt~~l~~~sLdl~~k~-~~~i~s~~~Lp~d~~~LipvP~p~GG  320 (1192)
                         ||.||+||||...||+|+||++..   ||        ++..|.||+..|. ....|+ +++..++..+||||.|.||
T Consensus       165 ---~v~Di~fly~~s~pt~~vly~Ds~---~~--------Hv~~yelnl~~ke~~~~~w~-~~v~~~a~~li~VP~~~gG  229 (1096)
T KOG1897|consen  165 ---NVYDIKFLYGCSDPTLAVLYKDSD---GR--------HVKTYELNLRDKEFVKGPWS-NNVDNGASMLIPVPSPIGG  229 (1096)
T ss_pred             ---eEEEEEEEcCCCCCceEEEEEcCC---Cc--------EEEEEEeccchhhccccccc-cccccCCceeeecCCCCce
Confidence               999999999999999999999874   43        4558899998665 456899 8999999999999999999


Q ss_pred             EEEEecceEEEEeCCCceeEeccccccccCCCccCcCCCceeEeeceeeEEeeCcEEEEEcCCCCEEEEEEEECCceEee
Q 001003          321 VLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQR  400 (1192)
Q Consensus       321 vLVig~n~I~y~d~~~~~~~a~N~~~~~~~~~~~~~~~~~~l~l~~~~~~~~~~~~~Ll~~~~G~L~~l~l~~dg~~V~~  400 (1192)
                      |||+|++.|+|.++....+++  ++        ..++..  +.  +....-.+..+|||+|++|.||+|.+...+.+|++
T Consensus       230 vlV~ge~~I~Y~~~~~~~ai~--p~--------~~~~~t--~~--~~~~v~~~~~~yLl~d~~G~Lf~l~l~~~~e~~s~  295 (1096)
T KOG1897|consen  230 VLVIGEEFIVYMSGDNFVAIA--PL--------TAEQST--IV--CYGRVDLQGSRYLLGDEDGMLFKLLLSHTGETVSG  295 (1096)
T ss_pred             EEEEeeeEEEEeeCCceeEec--cc--------ccCCce--EE--EcccccCCccEEEEecCCCcEEEEEeecccccccc
Confidence            999999999999997544332  21        112221  10  00011134457999999999999999988888888


Q ss_pred             --EEEEecCCCccccceEEecCCeEEEEeeeCCeEEEEEeeCCCcccccCCCccccCCcccCCccchhccCCCcchhhcc
Q 001003          401 --LDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDM  478 (1192)
Q Consensus       401 --l~l~~~~~~~~~s~l~~l~~g~lFvGS~~GDS~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~d~~  478 (1192)
                        |+++|+|++++|+||+||++|+||+||++|||||+++...+                                  |  
T Consensus       296 ~~lkve~lge~siassi~~L~ng~lFvGS~~gdSqLi~L~~e~----------------------------------d--  339 (1096)
T KOG1897|consen  296 LDLKVEYLGETSIASSINYLDNGVLFVGSRFGDSQLIKLNTEP----------------------------------D--  339 (1096)
T ss_pred             eEEEEEecCCcchhhhhhcccCceEEEeccCCceeeEEccccC----------------------------------C--
Confidence              99999999999999999999999999999999999987531                                  0  


Q ss_pred             cCccccccccCCCCCccccccceeEEEEeeecccCCcccccccccc--CCCC--CccCCCCCCceEE------------E
Q 001003          479 VNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRI--NADA--SATGISKQSNYEL------------V  542 (1192)
Q Consensus       479 ~~~~e~~ly~~~~~~~~~~~~~~~~~v~Dsl~NigPI~D~~vg~~~--~~~~--~~sG~g~~gsL~i------------~  542 (1192)
                                          ...+..++++++|||||.||+|.+..  ++.+  +|||++++|+||+            +
T Consensus       340 --------------------~gsy~~ilet~~NLgPI~Dm~Vvd~d~q~q~qivtCsGa~kdgSLRiiRngi~I~e~A~i  399 (1096)
T KOG1897|consen  340 --------------------VGSYVVILETFVNLGPIVDMCVVDLDRQGQGQIVTCSGAFKDGSLRIIRNGIGIDELASI  399 (1096)
T ss_pred             --------------------CCchhhhhhhcccccceeeEEEEeccccCCceEEEEeCCCCCCcEEEEecccccceeeEe
Confidence                                02346789999999999999997754  2222  6999999999999            6


Q ss_pred             eCCCcCEEEEEEecCCCCCCCCcccccccCCCcceEEEEEeccceEEEEecCceeeeecccCccccCCcEEEEeeCCCCE
Q 001003          543 ELPGCKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRR  622 (1192)
Q Consensus       543 eLpg~~~iWtv~~~~~~~~~~~~~~~~~~~~~~~~yLvlS~~~~T~Vl~~g~~~eEv~~~~gF~~~~~TI~ag~l~~~~~  622 (1192)
                      +|||+++||++|..              .+++||.|||+||.++|+||.+++++||.. ..||.++++||+|++++ ++.
T Consensus       400 ~l~Gikg~w~lk~~--------------v~~~~d~ylvlsf~~eTrvl~i~~e~ee~~-~~gf~~~~~Tif~S~i~-g~~  463 (1096)
T KOG1897|consen  400 DLPGIKGMWSLKSM--------------VDENYDNYLVLSFISETRVLNISEEVEETE-DPGFSTDEQTIFCSTIN-GNQ  463 (1096)
T ss_pred             ecCCccceeEeecc--------------ccccCCcEEEEEeccceEEEEEccceEEec-cccccccCceEEEEccC-Cce
Confidence            89999999999964              567899999999999999999999999985 47999999999999995 566


Q ss_pred             EEEEecCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeecccccccc
Q 001003          623 VIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIES  702 (1192)
Q Consensus       623 IvQVt~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~  702 (1192)
                      |+|||+++|||+++..+..         +|.+| .+..|..++.+..+|+|+..++.+.++..+..+.  .......+  
T Consensus       464 lvQvTs~~iRl~ss~~~~~---------~W~~p-~~~ti~~~~~n~sqVvvA~~~~~l~y~~i~~~~l--~e~~~~~~--  529 (1096)
T KOG1897|consen  464 LVQVTSNSIRLVSSAGLRS---------EWRPP-GKITIGVVSANASQVVVAGGGLALFYLEIEDGGL--REVSHKEF--  529 (1096)
T ss_pred             EEEEecccEEEEcchhhhh---------cccCC-CceEEEEEeecceEEEEecCccEEEEEEeeccce--eeeeehee--
Confidence            9999999999999874433         78887 7778888999999999999888888888776552  22233333  


Q ss_pred             CCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEeecccc
Q 001003          703 SKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVS  782 (1192)
Q Consensus       703 ~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~  782 (1192)
                       ..+++|+    |.+|.+                        .....+....+..|++-++.|..+||+.+++....   
T Consensus       530 -e~evaCL----Disp~~------------------------d~~~~s~~~aVG~Ws~~~~~l~~~pd~~~~~~~~l---  577 (1096)
T KOG1897|consen  530 -EYEVACL----DISPLG------------------------DAPNKSRLLAVGLWSDISMILTFLPDLILITHEQL---  577 (1096)
T ss_pred             -cceeEEE----ecccCC------------------------CCCCcceEEEEEeecceEEEEEECCCcceeeeecc---
Confidence             4566654    666531                        00112334444599999999999999888766531   


Q ss_pred             ccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEeecCCCCCccEEEEEecCCcEEEEEEeeecCCC
Q 001003          783 GRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPE  862 (1192)
Q Consensus       783 ~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~~g~~~~~p~Llv~l~dG~l~~Y~~~~~~~~~  862 (1192)
                       . .                                  +..+++|++..++  .++-||++.++||.|+.|.++...+  
T Consensus       578 -~-~----------------------------------~~iPRSIl~~~~e--~d~~yLlvalgdG~l~~fv~d~~tg--  617 (1096)
T KOG1897|consen  578 -S-G----------------------------------EIIPRSILLTTFE--GDIHYLLVALGDGALLYFVLDINTG--  617 (1096)
T ss_pred             -C-C----------------------------------CccchheeeEEee--ccceEEEEEcCCceEEEEEEEcccc--
Confidence             1 0                                  1234468888885  3489999999999999999874221  


Q ss_pred             CCCCCCCCCccccccccccccccccceeeEEecCCCccCCCCCCCCCCccceE--EeeccCCceEEEEcCCCCeEEEEeC
Q 001003          863 NTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRIT--IFKNISGHQGFFLSGSRPCWCMVFR  940 (1192)
Q Consensus       863 ~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~~~~~~~~~~~~~g~~~l~--~f~~i~G~~gVF~~G~rP~wi~~~~  940 (1192)
                         .+               +++|.   +                .+|.+|+.  .|.+.+ .++||+||+||+.+|+++
T Consensus       618 ---~l---------------sd~Kk---~----------------~lGt~P~~Lr~f~sk~-~t~vfa~sdrP~viY~~n  659 (1096)
T KOG1897|consen  618 ---QL---------------SDRKK---V----------------TLGTQPISLRTFSSKS-RTAVFALSDRPTVIYSSN  659 (1096)
T ss_pred             ---eE---------------ccccc---c----------------ccCCCCcEEEEEeeCC-ceEEEEeCCCCEEEEecC
Confidence               11               12221   1                15777655  555544 489999999996667789


Q ss_pred             CceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEE
Q 001003          941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020 (1192)
Q Consensus       941 g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~ 1020 (1192)
                      |+|.+.|+ +.+.+..+|||++..||+++++++.. +|+|++|++++++    ++|+ ||++++||||+||+.+.+|+|.
T Consensus       660 ~kLv~spl-s~kev~~~c~f~s~a~~d~l~~~~~~-~l~i~tid~iqkl----~irt-vpl~~~prrI~~q~~sl~~~v~  732 (1096)
T KOG1897|consen  660 GKLVYSPL-SLKEVNHMCPFNSDAYPDSLASANGG-ALTIGTIDEIQKL----HIRT-VPLGESPRRICYQESSLTFGVL  732 (1096)
T ss_pred             CcEEEecc-chHHhhhhcccccccCCceEEEecCC-ceEEEEecchhhc----ceee-ecCCCChhheEecccceEEEEE
Confidence            99999997 77999999999999999998888776 9999999999976    9999 9999999999999999999998


Q ss_pred             EeecCccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEe
Q 001003         1021 VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100 (1192)
Q Consensus      1021 ~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L 1100 (1192)
                      +.+-+.          ..+.                ..++.+.++++++|+.    ||++++.++|++||.+.|+.+++|
T Consensus       733 s~r~e~----------~~~~----------------~~ee~~~s~l~vlD~n----Tf~vl~~hef~~~E~~~Si~s~~~  782 (1096)
T KOG1897|consen  733 SNRIES----------SAEY----------------YGEEYEVSFLRVLDQN----TFEVLSSHEFERNETALSIISCKF  782 (1096)
T ss_pred             eccccc----------chhh----------------cCCcceEEEEEEecCC----ceeEEeeccccccceeeeeeeeee
Confidence            875310          1110                0122578899999985    999999999999999999999999


Q ss_pred             ccccCCCCceEEEEEeccccC-cccccCceEEEEEeeeCCCCCceeEeecccCcccccchhcccCceEEEeecceEEeee
Q 001003         1101 FNTTTKENETLLAIGTAYVQG-EDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNSFVFVFL 1179 (1192)
Q Consensus      1101 ~s~~t~~~~~ylaVGTa~~~g-Ed~~~rGRIlvfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 1179 (1192)
                      .   .+ ...|++|||+++++ |++|..|||+||++.+ ..++|.+|++.+-|+++++++  |||+++|.. +++|++|+
T Consensus       783 ~---~d-~~t~~vVGT~~v~Pde~ep~~GRIivfe~~e-~~~L~~v~e~~v~Gav~aL~~--fngkllA~I-n~~vrLye  854 (1096)
T KOG1897|consen  783 T---DD-PNTYYVVGTGLVYPDENEPVNGRIIVFEFEE-LNSLELVAETVVKGAVYALVE--FNGKLLAGI-NQSVRLYE  854 (1096)
T ss_pred             c---CC-CceEEEEEEEeeccCCCCcccceEEEEEEec-CCceeeeeeeeeccceeehhh--hCCeEEEec-CcEEEEEE
Confidence            6   22 36899999999998 7789999999999999 899999999999999999999  999999998 57899999


Q ss_pred             hhhheeeee
Q 001003         1180 FSFLRSLFI 1188 (1192)
Q Consensus      1180 ~~~~~~~~~ 1188 (1192)
                      |--.|+|-+
T Consensus       855 ~t~~~eLr~  863 (1096)
T KOG1897|consen  855 WTTERELRI  863 (1096)
T ss_pred             ccccceehh
Confidence            999998865


No 3  
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=100.00  E-value=5.9e-95  Score=859.59  Aligned_cols=890  Identities=20%  Similarity=0.309  Sum_probs=703.7

Q ss_pred             chhhhhccCCCceeeEEEEEeecCCCCCCCCCcccccccccccCCCCCCCCCCCeEEEEcCCeEEEEEEEEeccCCcccc
Q 001003            3 FAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKESK   82 (1192)
Q Consensus         3 ~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~g~~~~~   82 (1192)
                      |.|..+++.+|+|.||++|+|.+++.+                           ++++++++.|++|++.++ +|     
T Consensus         2 ~lysltlq~~t~i~~~~~g~fs~~k~q---------------------------eIv~~~~s~l~L~~~d~~-~G-----   48 (1205)
T KOG1898|consen    2 FLYSLTLQNQTGIVQAIYGNFSGPKAQ---------------------------EIVLGRGSILELYRIDEN-DG-----   48 (1205)
T ss_pred             chhhhhhhcccceeeeehhhccCCchh---------------------------eEEEEeeeEEEEEEecCC-Cc-----
Confidence            678899999999999999999999876                           999999999999998632 35     


Q ss_pred             CCccccccccccccccccEEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEeCCCCCEEEEeeeee
Q 001003           83 NSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCF  162 (1192)
Q Consensus        83 ~~~~~~~~~~~~~~~~~~L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d~~~~~l~t~Slh~~  162 (1192)
                                       ||++++++.+||+|++++.+|+.+.    .+|+|+|++|+++++|++|+.+++.+++  +|+ 
T Consensus        49 -----------------~l~~i~~~~vFg~Irsla~~~lt~~----~kD~LaV~SDSGri~il~y~~ek~~~~~--~~q-  104 (1205)
T KOG1898|consen   49 -----------------RLKTICRQEVFGTIRSLAAFRLTGG----TKDYLAVGSDSGRISILEYNNEKNHFEK--LHQ-  104 (1205)
T ss_pred             -----------------eEEEEEEEeehhhhhhhhccccCCC----CccEEEEEcCCceEEEEEechhhhcccc--ccc-
Confidence                             4999999999999999999999986    8999999999999999999999988865  888 


Q ss_pred             eccccccccCCcccccCCCeEEECCCCcEEEEE-EcCceEEEEeCccCCCCCCCC-CCCCCCCCCccceeeceEEEEccc
Q 001003          163 ESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL-VYGLQMIILKASQGGSGLVGD-EDTFGSGGGFSARIESSHVINLRD  240 (1192)
Q Consensus       163 E~~~~~~~~~G~~~~~~~~~l~VDP~~Rc~~l~-~y~~~L~ilP~~~~~~~l~~~-~~~~~~~~~~~~~~~~s~~i~l~~  240 (1192)
                      |+    ++|+|+|+..||+|+.+||.|||+++. +|+++|+++        +++| ....++++|+++++.++.++++..
T Consensus       105 et----fGks~~rrivpG~y~~idp~Gra~misave~~kLvyv--------lnrD~~a~ltisSpleahk~~sic~~l~~  172 (1205)
T KOG1898|consen  105 ET----FGKSGCRRIVPGQYLAIDPKGRAVMISAVEKQKLVYV--------LNRDGAARLTISSPLEAHKAHSICLDLVG  172 (1205)
T ss_pred             cc----cCcccceEeccccEEEEcCCccceeeehhhcCcEEEE--------EccchhhhceecCchhhccCCcEEEEEEE
Confidence            66    699999999999999999999999998 999999998        3222 236678999999999999999999


Q ss_pred             cCccceeeeeeccCCcccEEEEEEec----CCCcccceeeeeeeeEEEEEEEeeccceeeeeeeeccCCcccceeEEecC
Q 001003          241 LDMKHVKDFIFVHGYIEPVMVILHER----ELTWAGRVSWKHHTCMISALSISTTLKQHPLIWSAMNLPHDAYKLLAVPS  316 (1192)
Q Consensus       241 ldi~~V~D~~FL~gy~~PtlaiL~e~----~~tw~gr~~~~~dt~~l~~~sLdl~~k~~~~i~s~~~Lp~d~~~LipvP~  316 (1192)
                      +|.          ||.||+||.|+-+    ....+|... ..+-..+++|.||+..++..+.|+. -+...++++++||+
T Consensus       173 Vd~----------gf~np~fa~LE~dy~~a~~d~tgeaa-~~~~~~l~fYeldlglnhvvrk~s~-p~~~~~n~l~~VP~  240 (1205)
T KOG1898|consen  173 VDV----------GFENPIFAALERDYSEADNDPTGEAA-TMTQKVLTFYELDLGLNHVVRKASE-PVNHFGNFLLTVPG  240 (1205)
T ss_pred             Eec----------cCCCceEEEEeechhhcccCchhhhh-hccccceeEEEEecccceeEEEccc-ccCCCceEEEEecC
Confidence            998          9999999999954    222334332 2233456799999999999999987 47788999999998


Q ss_pred             C---CCeEEEEecceEEEEeCC-CceeEeccccccccCCCccCcCC--------CceeEeeceeeEEeeCcEEEEEcCCC
Q 001003          317 P---IGGVLVVGANTIHYHSQS-ASCALALNNYAVSLDSSQELPRS--------SFSVELDAAHATWLQNDVALLSTKTG  384 (1192)
Q Consensus       317 p---~GGvLVig~n~I~y~d~~-~~~~~a~N~~~~~~~~~~~~~~~--------~~~l~l~~~~~~~~~~~~~Ll~~~~G  384 (1192)
                      .   ..|||||+.|++.|++.. .+.            .+.+++++        ...+...++.+.-++.+++|+|+++|
T Consensus       241 G~D~ps~v~vc~~n~~~y~~~~d~p~------------~ri~~~rr~~~L~~~~~~vliv~s~~hk~k~~ff~llqt~~G  308 (1205)
T KOG1898|consen  241 GSDGPSGVLVCAENYLLYRNLGDHPD------------VRIPIERRINELSDAEDGVLIVSSAEHKTKSMFFFLLQTEYG  308 (1205)
T ss_pred             CCCCCcceEEecCceeeccccccCCC------------EEeccccccccCCccccccEEEEeecccccCCeEEEEEecCC
Confidence            6   359999999999999986 321            22233332        22344544444456778999999999


Q ss_pred             CEEEEEEEECCceEeeEEEEecCCCccccceEEecCCeEEEEeeeCCeEEEEEeeCCCcccccCCCccccCCcccCCccc
Q 001003          385 DLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTCGSGTSMLSSGLKEEFGDIEADAPST  464 (1192)
Q Consensus       385 ~L~~l~l~~dg~~V~~l~l~~~~~~~~~s~l~~l~~g~lFvGS~~GDS~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  464 (1192)
                      |+|+++|..|+..|..+++.|+++.|.+..||++++|+||+.|++||+.|||+.+.+          +++++.       
T Consensus       309 D~fk~tl~~d~d~v~el~lkYfDtvp~a~~L~I~k~GfLf~~sE~~n~~lyq~~~LG----------~~~~~~-------  371 (1205)
T KOG1898|consen  309 DLFKLTLEHDGDNVVELRLKYFDTVPCALQLCILKTGFLFVASEFGNHRLYQFEKLG----------EEDDDF-------  371 (1205)
T ss_pred             ceEEEEEecCCCcceeeeeehhcCCccceEEEEeccceEEEhhhccCcceeehhhcC----------CCccch-------
Confidence            999999999999999999999999999999999999999999999999999999864          333221       


Q ss_pred             hhccCCCcchhhcccCccccccccCCCCCccccccceeEEEEeeecccCCccccccccccCCCC----CccCCCCCCceE
Q 001003          465 KRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGPLKDFSYGLRINADA----SATGISKQSNYE  540 (1192)
Q Consensus       465 k~~~~~~~~~~d~~~~~~e~~ly~~~~~~~~~~~~~~~~~v~Dsl~NigPI~D~~vg~~~~~~~----~~sG~g~~gsL~  540 (1192)
                             ++.++..  +.+...|+++        ...+|..+++++||.|+.|+.+++..+++.    .|||.+.+++||
T Consensus       372 -------s~~~~~~--~~~~~~f~p~--------~l~nL~~~~~i~sl~p~~d~~I~~~~ne~~~qi~~~cg~~~~sslr  434 (1205)
T KOG1898|consen  372 -------SNAMTSE--EGKSVFFEPR--------ILKNLSPVSSVESLSPLLDISIGDDSNEDTPQIYSACGRGPRSSLR  434 (1205)
T ss_pred             -------hhhcccc--cCcceecccc--------ccccccchhhhhccCccceeEeeccCcccchhhhhhhCcCccccch
Confidence                   1111111  0122344443        245788999999999999999998766554    399999999998


Q ss_pred             E------------EeCCC-cCEEEEEEecCCCCCCCCcccccccCCCcceEEEEEeccceEEEEecCceeeeecccCccc
Q 001003          541 L------------VELPG-CKGIWTVYHKSSRGHNADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFV  607 (1192)
Q Consensus       541 i------------~eLpg-~~~iWtv~~~~~~~~~~~~~~~~~~~~~~~~yLvlS~~~~T~Vl~~g~~~eEv~~~~gF~~  607 (1192)
                      +            .+||| ++++||++.+              ..+.||+|||+||.++|+||++|+.+||+++ .||..
T Consensus       435 ~lR~gle~sel~~t~lp~~~ta~WTvk~~--------------~td~ydsyivvsF~n~TlVLsIgesveEvtd-sgFls  499 (1205)
T KOG1898|consen  435 ILRNGLEVSELLVTELPGNPTATWTVKKN--------------ITDVYDSYIVVSFVNGTLVLSIGESVEEVTD-SGFLS  499 (1205)
T ss_pred             hhccccchHHHhhhccCCCCceEEEEcCc--------------cccccceEEEEEeeccEEEEEcchhHHHhhh-ccccc
Confidence            8            25887 9999999875              5789999999999999999999999999985 69999


Q ss_pred             cCCcEEEEeeCCCCEEEEEecCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecC
Q 001003          608 QGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDP  687 (1192)
Q Consensus       608 ~~~TI~ag~l~~~~~IvQVt~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~  687 (1192)
                      +.+||+|+.| |+..+|||++.+||++..+.++.         +|..| ++.+|+.++++..+|+|++++|.+++|.++.
T Consensus       500 ~~~Tl~~~l~-Gd~slVQi~~d~iRhi~~~~r~~---------ew~~P-~~~~Iv~~avnr~qiVvalSngelvyfe~d~  568 (1205)
T KOG1898|consen  500 TTPTLACSLM-GDDSLVQIHPDGIRHIRPTKRIN---------EWKTP-ERVRIVKCAVNRRQIVVALSNGELVYFEGDV  568 (1205)
T ss_pred             CCceEEEEEe-cCCcEEEEchhhhhhcccccccc---------cccCC-CceEEEEEeecceEEEEEccCCeEEEEEecc
Confidence            9999999999 67889999999999998776432         78887 8899999999999999999999999999997


Q ss_pred             CCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEE
Q 001003          688 STCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFD  767 (1192)
Q Consensus       688 ~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~s  767 (1192)
                      .+.+.+......+   +..++|+++..+.-                            +.+ -+-+|+++..++.++|++
T Consensus       569 sgql~E~~er~tl---~~~vac~ai~~~~~----------------------------g~k-rsrfla~a~~d~~vriis  616 (1205)
T KOG1898|consen  569 SGQLNEFTERVTL---STDVACLAIGQDPE----------------------------GEK-RSRFLALASVDNMVRIIS  616 (1205)
T ss_pred             Cccceeeeeeeee---ceeehhhccCCCCc----------------------------chh-hcceeeeeccccceeEEE
Confidence            7665665444444   44567665543220                            111 234899999999999999


Q ss_pred             CCCcee--eEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEeecCCCCC----ccEE
Q 001003          768 VPNFNC--VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS----RPFL  841 (1192)
Q Consensus       768 LP~~~~--v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~~g~~~~----~p~L  841 (1192)
                      |-.-..  .++...++                                        ..+..++++.++....    .-||
T Consensus       617 L~p~d~l~~ls~q~l~----------------------------------------~~~~s~~iv~~~~~~~~~~~~L~l  656 (1205)
T KOG1898|consen  617 LDPSDCLQPLSVQGLS----------------------------------------SPPESLCIVEMEATGGTDVAQLYL  656 (1205)
T ss_pred             ecCcceEEEccccccC----------------------------------------CCccceEEEEecccCCccceeEEE
Confidence            864222  22222211                                        1233456666654443    7899


Q ss_pred             EEEecCCcEEEEEEeeecCCCCCCCCCCCCccccccccccccccccceeeEEecCCCccCCCCCCCCCCccceEEee-cc
Q 001003          842 FAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIFK-NI  920 (1192)
Q Consensus       842 lv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~~~~~~~~~~~~~g~~~l~~f~-~i  920 (1192)
                      .++|.||-++.+.+..-.+                    +..+.|+   ||                +|.+|++.|+ ..
T Consensus       657 ~~GL~NGvllR~~id~v~G--------------------~l~d~rt---R~----------------lG~~pvkLf~~~~  697 (1205)
T KOG1898|consen  657 LIGLRNGVLLRFVIDTVTG--------------------QLLDIRT---RF----------------LGLRPVKLFPISM  697 (1205)
T ss_pred             EecccccEEEEEEeccccc--------------------ceeeehe---ee----------------eccccceEEEEee
Confidence            9999999999887753221                    1245555   66                4899999998 77


Q ss_pred             CCceEEEEcCCCCeEEEEe-CCceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceEEEe
Q 001003          921 SGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVI  999 (1192)
Q Consensus       921 ~G~~gVF~~G~rP~wi~~~-~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~i  999 (1192)
                      .|.+.|++.++|| |+.++ +.++.+.|+ ++.+...++||.+..||.|.+++..+ .|||.++....+   .++++. +
T Consensus       698 ~~~s~vL~lSsr~-wl~y~~~~~~h~t~I-sy~~l~~as~~~S~qcpeGiv~i~~n-~l~i~~~~~~g~---~~n~~~-~  770 (1205)
T KOG1898|consen  698 RGQSDVLALSSRP-WLLYTYQQEFHLTPI-SYSTLEHASPFCSEQCPEGIVAISKN-TLRIIALDKLGK---VLNVDG-F  770 (1205)
T ss_pred             cCcceeEEecCCh-hhhhhhcceeeeecc-cccchhccccccccCCCcchhhhhhh-hhheeeehhhcc---cccccc-c
Confidence            8899999999999 99875 889999998 77899999999999999998888777 999999998863   579999 9


Q ss_pred             cCCCccCeEEEecCCCEEEEEEeecCccc-----cccccccc---cccccccccccC-------CCCccccccCcc---e
Q 001003         1000 PLKATPHQITYFAEKNLYPLIVSVPVLKP-----LNQVLSLL---IDQEVGHQIDNH-------NLSSVDLHRTYT---V 1061 (1192)
Q Consensus      1000 pL~~tp~~Iay~~~~~~y~v~~s~~~~~~-----~~~~~~~~---~~ee~~~~~~~~-------~~~~~~~~~~p~---~ 1061 (1192)
                      |+++|||+++|||+++..+++++.....-     .++.....   ..+++..|.+.+       +...+.....|.   .
T Consensus       771 ~l~~tprkvv~h~es~lLii~~td~~~~~~~~a~~~~~~~g~v~~s~~~~e~e~g~em~~~~~~~~~~~~v~~~p~a~~~  850 (1205)
T KOG1898|consen  771 PLAYTPRKVVIHPESGLLIIGRTDHNATLTKDARKNQMEAGGVLESGEEKEDEMGGEMEIIGREEVLPENVYGSPRAGNG  850 (1205)
T ss_pred             ccccCcceEEEecCCCeEEEEEecccchhhHHHhhhhhhcccccccccccchhhccchhhhccccccccccccCcccccC
Confidence            99999999999999999999988642110     01000000   011222222211       011111122221   3


Q ss_pred             eeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcc--cccCceEEEEEeeeCC
Q 001003         1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGED--VAARGRVLLFSTGRNA 1139 (1192)
Q Consensus      1062 ~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd--~~~rGRIlvfev~~~~ 1139 (1192)
                      ..++|+.+|+.    +-.+++.+++.+||.++|++.+.+++   ++...|++||++...-+|  .-++|++|.|+++.+.
T Consensus       851 w~s~I~~~d~~----s~~~~~~~~l~~ne~a~~v~~~~fs~---~~~~~~~~v~~~~~~~l~~~~~~~g~~ytyk~~~~g  923 (1205)
T KOG1898|consen  851 WVSSIRVFDPK----SGKIICLVELGQNEAAFSVCAVDFSS---SEYQPFVAVGVATTEQLDSKSISSGFVYTYKFVRNG  923 (1205)
T ss_pred             ccceEEEEcCC----CCceEEEEeecCCcchhheeeeeecc---CCCceEEEEEeeccccccccccCCCceEEEEEEecC
Confidence            77899999997    55889999999999999999999983   334489999999988766  2389999999999999


Q ss_pred             CCCceeEeecccCcccccchhcccCceEEEeec
Q 001003         1140 DNPQNLVLSGSYGPLFSSVQIDFASHFFAICSN 1172 (1192)
Q Consensus      1140 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~ 1172 (1192)
                      +++|.+|.+..-+.+.++.+  |+|.+||.-+.
T Consensus       924 ~~lellh~T~~~~~v~Ai~~--f~~~~LagvG~  954 (1205)
T KOG1898|consen  924 DKLELLHKTEIPGPVGAICP--FQGRVLAGVGR  954 (1205)
T ss_pred             ceeeeeeccCCCccceEEec--cCCEEEEeccc
Confidence            99999998888888888887  88887776553


No 4  
>COG5161 SFT1 Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification]
Probab=100.00  E-value=9e-94  Score=827.32  Aligned_cols=974  Identities=17%  Similarity=0.215  Sum_probs=716.3

Q ss_pred             CcchhhhhccCCCceeeEEEEEeecCCCCCCCCCcccccccccccCCCCCCCCCCCeEEEEcCCeEEEEEEEEeccCCcc
Q 001003            1 MSFAAYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVTAANVIEIYVVRVQEEGSKE   80 (1192)
Q Consensus         1 m~~~~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVvak~n~LeIy~v~~~~~g~~~   80 (1192)
                      |+ -+|.+....|.++||+.|+||+.+..                           ||+|.|+|.|+||+...|  +   
T Consensus         1 m~-~~y~d~~d~tv~~~~~ag~Ft~s~~~---------------------------~llv~~~Nil~v~~~~~d--~---   47 (1319)
T COG5161           1 MN-YLYSDESDWTVTEGCSAGLFTPSRTC---------------------------SLLVYNGNILAVRLWKYD--S---   47 (1319)
T ss_pred             Cc-chhhhhhHHHHhhccccceeeccccc---------------------------eEEEEeccEEEEEEeecc--C---
Confidence            54 46788999999999999999998876                           999999999999998744  4   


Q ss_pred             ccCCccccccccccccccccEEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEeCCCCCEEEEeee
Q 001003           81 SKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMH  160 (1192)
Q Consensus        81 ~~~~~~~~~~~~~~~~~~~~L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d~~~~~l~t~Slh  160 (1192)
                                         +|.++-++.++|.|++|...+...+    .+|.|++.|..||+++++||.+.+.|.|+|+|
T Consensus        48 -------------------~l~l~de~~~~e~~t~I~~~pq~~s----e~~~lll~t~~akis~lrf~sq~n~f~Tislh  104 (1319)
T COG5161          48 -------------------GLVLVDEHMLLEKVTQIEKYPQISS----EQDGLLLLTHRAKISLLRFDSQANEFRTISLH  104 (1319)
T ss_pred             -------------------CeeEchHHhhhhhhhhhhhcccccC----ccceEEEEeccceEEEEEehhhcccceeEEEe
Confidence                               2999999999999999999977765    79999999999999999999999999999999


Q ss_pred             eeeccccccccCCcc--cccCCCeEEECCCCcEEEEEEcCceEEEEeCccCCCC--CCCCCCCCC---------CCCC--
Q 001003          161 CFESPEWLHLKRGRE--SFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGGSG--LVGDEDTFG---------SGGG--  225 (1192)
Q Consensus       161 ~~E~~~~~~~~~G~~--~~~~~~~l~VDP~~Rc~~l~~y~~~L~ilP~~~~~~~--l~~~~~~~~---------~~~~--  225 (1192)
                      |||..    . .|..  .......+.-||++-|+ |++|++..+.+||+-...+  +.+.|.+..         ...|  
T Consensus       105 yyeGK----f-kgksLvelak~stle~D~~ssca-LlfneDi~~flpfhvnkndddev~~d~D~~~~~~~~~h~~i~psq  178 (1319)
T COG5161         105 YYEGK----F-KGKSLVELAKFSTLEFDIRSSCA-LLFNEDIGNFLPFHVNKNDDDEVRIDVDLGMFQMSKRHFSIFPSQ  178 (1319)
T ss_pred             eeccc----c-CCchhhhhhhhhheeeccCccch-hhhhhhhhhcccccccCCccccccccccccHHHHHHHHhhcCCCC
Confidence            99974    1 1222  12345678999999887 6788999999999754322  211111100         0000  


Q ss_pred             -------------ccceeeceEEEEccccC--ccceeeeeeccCCcccEEEEEEecCCCcccceeeeeeeeEEEEEEEee
Q 001003          226 -------------FSARIESSHVINLRDLD--MKHVKDFIFVHGYIEPVMVILHERELTWAGRVSWKHHTCMISALSIST  290 (1192)
Q Consensus       226 -------------~~~~~~~s~~i~l~~ld--i~~V~D~~FL~gy~~PtlaiL~e~~~tw~gr~~~~~dt~~l~~~sLdl  290 (1192)
                                   ...-..||+++...+||  |+||+|++||.||.+||+|+||+|.++|++....+|+++.+.+++||+
T Consensus       179 gtntfnkrkrt~~~~kfsaPs~Vl~~seld~~ikniiD~~FL~ny~~PTvallY~Pkl~~~~~~ti~k~p~~~~v~Tldl  258 (1319)
T COG5161         179 GTNTFNKRKRTLFPGKFSAPSKVLKFSELDGKIKNIIDFVFLENYSIPTVALLYDPKLSLPRKYTILKNPYNAIVFTLDL  258 (1319)
T ss_pred             CccccchhhhhhcCCcccCceeEEEehhhhccccccEEEEeeccCCCceEEEEecccccccceeEeecCceeEEEEEEec
Confidence                         01123589999999998  999999999999999999999999999999999999999999999999


Q ss_pred             ccceeeeeeeeccCCcccceeEEecCCCCeEEEEecceEEEEeCCC-ceeEeccccccccCCCc-cCcCC--CceeEeec
Q 001003          291 TLKQHPLIWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSA-SCALALNNYAVSLDSSQ-ELPRS--SFSVELDA  366 (1192)
Q Consensus       291 ~~k~~~~i~s~~~Lp~d~~~LipvP~p~GGvLVig~n~I~y~d~~~-~~~~a~N~~~~~~~~~~-~~~~~--~~~l~l~~  366 (1192)
                      +++.+.+|-.+..||+|.+..+|+|.   |.|++|.|+++|+|..+ .+++.+|+++.+...+. ..+++  ++++...|
T Consensus       259 ~~~~saVI~~~~~lP~d~~~~v~~p~---Gall~g~neli~idstg~~~~I~lNs~~~k~~~~~~v~d~s~~d~n~~~~g  335 (1319)
T COG5161         259 GAGRSAVIDEFLVLPRDFRVTVAGPV---GALLFGSNELILIDSTGSSYTIPLNSMSEKYGGNKIVEDISLSDVNCFSRG  335 (1319)
T ss_pred             CcchhhhhHhHhcCCceEEEEEeccc---ceEEEecccEEEEecCCcEEEeechhhHHHhcCCceEeecccceeeEeecC
Confidence            99999999888889999999999984   99999999999999988 67999999997765555 44555  56777888


Q ss_pred             eeeEEeeC-----cEEEEEcCCCCEEEEEEEECCceEeeEEEEec-------CCCccccceEEecCCeEEEEeeeCCeEE
Q 001003          367 AHATWLQN-----DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-------NPSVLTSDITTIGNSLFFLGSRLGDSLL  434 (1192)
Q Consensus       367 ~~~~~~~~-----~~~Ll~~~~G~L~~l~l~~dg~~V~~l~l~~~-------~~~~~~s~l~~l~~g~lFvGS~~GDS~L  434 (1192)
                      ...-|+..     .++++++-+|+.|.|.+.+||+++.++.|..+       ...+-++|+..+++..+|+|+..+||.+
T Consensus       336 ttsIwipsSK~~~etl~l~dl~g~~yyl~~~~dgk~iigfdi~~L~~e~dllk~~s~~~Cv~~~n~~l~f~g~g~~ns~v  415 (1319)
T COG5161         336 TTSIWIPSSKCLIETLFLGDLNGDRYYLRISMDGKRIIGFDIASLEFEGDLLKKGSAVSCVGHVNNLLFFGGVGDSNSRV  415 (1319)
T ss_pred             ceeeeccCcccccceEEEEecCCCEEEEEEEeccceeeccceeeeeeeccccccCCCCeeEEEcCceEEEEEecCCceEE
Confidence            88888754     46899999999999999999999998777654       3577899999999999999999999999


Q ss_pred             EEEeeCCCcccccCCCccccCCcccCCccchhccCCCcchhhcccCccccccccCCCCCccccccceeEEEEeeecccCC
Q 001003          435 VQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVNIGP  514 (1192)
Q Consensus       435 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~d~~~~~~e~~ly~~~~~~~~~~~~~~~~~v~Dsl~NigP  514 (1192)
                      +||.+.........+.|.+.  ..+          -.+++|||.++--|..+++........+..++.+++|+.+.|+||
T Consensus       416 lr~~~l~~tiEtR~~eG~~~--l~g----------~nDeEmdD~y~apEn~l~~n~~~~v~~~~~p~d~el~~~l~n~gp  483 (1319)
T COG5161         416 LRIKSLLPTIETRASEGVGP--LEG----------GNDEEMDDEYSAPENKLFGNKEQEVRRQDEPYDAELFNALSNAGP  483 (1319)
T ss_pred             EEecccCCchhhhhhcCCCc--ccC----------CChhhhhhhhcccccccccCcccceeeccCcchhHHhhhhccCCc
Confidence            99998753221100000000  000          001223333332222333332222222335678999999999999


Q ss_pred             ccccccccccCC---CC---------CccCCCCCCceEE------------EeCCCcCEEEEEEecCCCCCCCCcccccc
Q 001003          515 LKDFSYGLRINA---DA---------SATGISKQSNYEL------------VELPGCKGIWTVYHKSSRGHNADSSRMAA  570 (1192)
Q Consensus       515 I~D~~vg~~~~~---~~---------~~sG~g~~gsL~i------------~eLpg~~~iWtv~~~~~~~~~~~~~~~~~  570 (1192)
                      |.||+||+....   +.         ...|++..+.|.+            ..+-++..+|+++.++..           
T Consensus       484 itdfavgkv~v~kglP~pN~g~l~lV~t~G~ds~~~l~V~~ts~~P~I~~~~~fi~~e~vw~~kI~g~l-----------  552 (1319)
T COG5161         484 ITDFAVGKVDVEKGLPIPNIGLLNLVVTKGSDSEAALAVEGTSLEPCICTVSSFIPLEIVWSQKIRGYL-----------  552 (1319)
T ss_pred             ccceeeeeccceecCCCCCccceeeEEeccCCCcceEEEEeccccceeeehccccchhheeehhcccee-----------
Confidence            999999986532   11         1467777788877            234578999999986421           


Q ss_pred             cCCCcceEEEEEeccceEEEEecCceeeeecccCccccCCcEEEEeeCCCCEEEEEecCcEEEEeCCc-ceeeeecCCCC
Q 001003          571 YDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSY-MTQDLSFGPSN  649 (1192)
Q Consensus       571 ~~~~~~~yLvlS~~~~T~Vl~~g~~~eEv~~~~gF~~~~~TI~ag~l~~~~~IvQVt~~~vrli~~~~-~~q~i~~~~~~  649 (1192)
                      ....--.|+++|..+.|.||+-++++.+.. +..|..+..|+.++.++.++++|||||+.+++||.+. +.+.+.+.   
T Consensus       553 r~~~~~~~~~ls~~s~S~If~~~e~f~l~~-~g~~~rd~~Tl~~~~fgee~rvVQvtp~~l~~yD~~lR~l~~~~F~---  628 (1319)
T COG5161         553 RCSRALDFYILSRVSDSRIFRWSEEFLLEV-SGEYTRDVNTLLFVEFGEENRVVQVTPSYLLRYDQDLRMLGRVEFA---  628 (1319)
T ss_pred             hhcceeeEEEeecccccceeeccccceeee-cceeeccccEEEeeeccCcceEEEecchHhhhhcccceeeeeEeec---
Confidence            112234799999999999999999998874 5689999999999999889999999999999999885 45545442   


Q ss_pred             CCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCce-EeeccccccccCCCceeEEEeeccCCCCCcccccccc
Q 001003          650 SESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCT-VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTD  728 (1192)
Q Consensus       650 ~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~-l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~  728 (1192)
                              ..-|++.|++||++++....|.|.+|..+..+.+ +++..+..+.  +-...+.-+. |....         
T Consensus       629 --------~~~V~~~Sv~Dp~ilvv~~~g~i~~f~~~ekn~rL~k~dl~~~l~--d~k~~s~v~~-dsN~~---------  688 (1319)
T COG5161         629 --------SRAVEARSVRDPLILVVRDSGKILTFYDREKNMRLFKIDLVTCLA--DAKNKSFVLS-DSNSL---------  688 (1319)
T ss_pred             --------eeeeEEEeccCCEEEEEEecCceEEEEehhhhchhccCChHHHHH--hhhhheEecc-Ccccc---------
Confidence                    1249999999999999999999999998876655 3433332231  1112221111 11100         


Q ss_pred             cccccCccccccCCCCCCCCCCcEEEEEE-ecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCC
Q 001003          729 AWLSTGVGEAIDGADGGPLDQGDIYSVVC-YESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSS  807 (1192)
Q Consensus       729 ~~~~~~~~~~~~~~~~~~~~~~~~~l~v~-~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~  807 (1192)
                           |.+ .++...   +. .-..++.. ..+..+--..-|.+..++.+++.+.+.+...-    .     +.      
T Consensus       689 -----g~f-~ig~~~---Sq-~e~~l~~~~~~~~q~~~~~s~~~D~~~e~dg~dQlte~~~~----~-----ty------  743 (1319)
T COG5161         689 -----GIF-DIGKRI---SQ-LEPCLVKGLPYAIQFSPEASPAMDLAGEEDGDDQLTEISMS----L-----TY------  743 (1319)
T ss_pred             -----cce-ecccch---hh-hchhhhhcCcccceeccccCcchhhccccccchhhhhHHHH----H-----HH------
Confidence                 000 000000   00 00112221 12223322334446666666544332221100    0     00      


Q ss_pred             ccCCCCCcccccccccEEEEEEeecCCCCCccEEEEEecCCcEEEEEEeeecCCCCCCCCCCCCcccccccccccccccc
Q 001003          808 EEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRL  887 (1192)
Q Consensus       808 ~~~~~~~~~~~~~~~~v~eil~~~~g~~~~~p~Llv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  887 (1192)
                       .-++    +--..|.|.++++..+|.+-+.|||+.....++++.|+.+++..                           
T Consensus       744 -nl~d----~~f~lpsi~~~mVa~lg~D~keeyLf~~s~~~EI~~yk~~l~r~---------------------------  791 (1319)
T COG5161         744 -NLID----MLFRLPSIGNYMVAYLGLDLKEEYLFDNSLSSEIVFYKTHLPRH---------------------------  791 (1319)
T ss_pred             -hhhh----hhccChhhhhhhhHhhcccccchheehhhcCceEEEEeeccccc---------------------------
Confidence             0000    01124678899999999999999999999999999999985221                           


Q ss_pred             ceeeEEec-C--CCccCC-----CCCCCCCCccceEEeeccCCceEEEEcCCCCeEEEEe-CCceEEEecCCCCceeEEe
Q 001003          888 RNLRFSRT-P--LDAYTR-----EETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFT  958 (1192)
Q Consensus       888 ~~lrF~kv-~--~~~~~~-----~~~~~~~g~~~l~~f~~i~G~~gVF~~G~rP~wi~~~-~g~l~~~p~~~~~~v~~~t  958 (1192)
                        .+|.|- .  ..+...     +.+..+.-.+-...|....||+.+|++|..|..+.+. ....++.+. +.-|+.+.+
T Consensus       792 --~~f~~nvTRndlAitGaPdna~~Ka~sSV~ri~m~f~~~vghs~~fvTg~~pfl~~s~~~s~~k~f~~-gNIPlvsv~  868 (1319)
T COG5161         792 --VSFNLNVTRNDLAITGAPDNADIKAFSSVGRIDMVFIKAVGHSFMFVTGKGPFLCRSRYTSSSKAFHR-GNIPLVSVI  868 (1319)
T ss_pred             --chhhhhcchhhhhccCCCcchhhhhcccccceeEEEeeccCeEEEEEcCCccEEEEEeccCCcceeec-CCCceeeee
Confidence              123221 0  000000     0111111112234777778899999999999777663 344444454 347999999


Q ss_pred             cccCCCCCCcEEEEEecCeEEEEEcCCCCccC-CccceEEEecCCCccCeEEEecCCCEEEEEEeecCcccccccccccc
Q 001003          959 VLHNVNCNHGFIYVTSQGILKICQLPSGSTYD-NYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLI 1037 (1192)
Q Consensus       959 ~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d-~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~ 1037 (1192)
                      |||-    .|.+|++.....|+|++-...-|+ +.||+++ +|++.|..+++||+..++|+|....+  .++.     ..
T Consensus       869 p~s~----rgy~~Vd~~~~vr~~~~~~dn~y~gnK~p~k~-~~~~Ktlqklvyh~~~~~~~Vgsc~~--~~f~-----~~  936 (1319)
T COG5161         869 PLSK----RGYLMVDNVLGVRASQYVFDNGYVGNKNPVKR-TPKHKTLQKLVYHCAGRYMVVGSCEE--AGFS-----PK  936 (1319)
T ss_pred             eccc----ccEEEEecccceeEEEEEeccceecccCceee-ccccccccceeeeccceEEEEEeeee--cCcc-----cc
Confidence            9986    899999999999999999998887 9999999 99999999999999999999976543  3331     23


Q ss_pred             ccccccccccCCCCccccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEec-cccCCCCceEEEEEe
Q 001003         1038 DQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLF-NTTTKENETLLAIGT 1116 (1192)
Q Consensus      1038 ~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~-s~~t~~~~~ylaVGT 1116 (1192)
                      +||+++.+.-     ++-...|+..+++|-|++|.    +|++||+|||++||.|++|+.+.|+ +++|+.+++||+|||
T Consensus       937 gEdgE~~i~~-----D~Nvphaeg~~~~vdL~spk----sw~vID~yef~~ne~v~~i~~~~l~~~~~tk~k~pyi~vgt 1007 (1319)
T COG5161         937 GEDGESGIPV-----DTNVPHAEGYRFYVDLYSPK----SWEVIDTYEFDENEYVFHIKYLILDDMQGTKGKSPYILVGT 1007 (1319)
T ss_pred             CCCCCccCcc-----CCCCcccccceeeEEEecCc----ceeEeeeeecccceeeeeeeeeeeeccccccCCCceEEEEe
Confidence            5554333221     12235567889999999996    9999999999999999999999999 788899999999999


Q ss_pred             ccccCcccccCceEEEEEeee---CCCCCceeEeecccCcccccchhcccCceEEEeecc
Q 001003         1117 AYVQGEDVAARGRVLLFSTGR---NADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS 1173 (1192)
Q Consensus      1117 a~~~gEd~~~rGRIlvfev~~---~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 1173 (1192)
                      ++..|||.|.|||.+|||||+   +|++|+|.+   |+|-+..   |+.||-+...|-++
T Consensus      1008 t~~~gED~p~rG~~hv~eII~VVP~pg~P~t~~---KLK~~~~---Ee~kGTV~~vcEV~ 1061 (1319)
T COG5161        1008 TFIEGEDRPARGRLHVLEIISVVPSPGSPFTDC---KLKVLGI---EETKGTVVRVCEVR 1061 (1319)
T ss_pred             eecccCccCCcCceEEEEEEEecCCCCCCcccc---eeeEEeh---hhcccEEEEEEEEc
Confidence            999999999999999999985   999999999   8888877   99999998888776


No 5  
>PF10433 MMS1_N:  Mono-functional DNA-alkylating methyl methanesulfonate N-term; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 2B5N_C 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A ....
Probab=100.00  E-value=7.3e-55  Score=532.32  Aligned_cols=451  Identities=29%  Similarity=0.443  Sum_probs=300.1

Q ss_pred             cEEEEEeCCCeEEEEEEeCCCCCEEEEeeeeeeccccccccCCcccccCCCeEEECCCCcEEEEEEcCceEEEEeCccCC
Q 001003          131 DSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVYGLQMIILKASQGG  210 (1192)
Q Consensus       131 D~Llv~~~~aklsil~~d~~~~~l~t~Slh~~E~~~~~~~~~G~~~~~~~~~l~VDP~~Rc~~l~~y~~~L~ilP~~~~~  210 (1192)
                      |+|+|+|+++++++|+||++++++.+.++|+++.    +.+.|.|+..+|++++|||.|||+|+.+|++.+.|+|+.+..
T Consensus         1 D~L~v~tdsg~l~~l~~~~~~~~~~~~~v~~~~~----~~~~~~r~~~~G~~l~vDP~~R~i~v~a~e~~~~v~~l~~~~   76 (504)
T PF10433_consen    1 DSLVVTTDSGKLSILEYDPSTHGFFKEFVHQWEP----LSKSGSRLSQPGQYLAVDPSGRCIAVSAYEGNFLVYPLNRSL   76 (504)
T ss_dssp             -EEEEEETTTEEEEEEEEEETTEE-E-EEEEEEE-------SSSEB-TT--EEEE-TTSSEEEEEEBTTEEEEEE-SS--
T ss_pred             CEEEEEECCCCEEEEEEECCCCccceeeEEEeEe----cCCCCCChhcCCcEEEECCcCCEEEEEecCCeEEEEEecccc
Confidence            8999999999999999999999886557787755    478999999999999999999999999999999999998711


Q ss_pred             CCCCCCCCCCCCCCCccceeeceEEEEc-cccCccceeeeeecc---CCcccEEEEEEecCCCcccceeeeeeeeEE--E
Q 001003          211 SGLVGDEDTFGSGGGFSARIESSHVINL-RDLDMKHVKDFIFVH---GYIEPVMVILHERELTWAGRVSWKHHTCMI--S  284 (1192)
Q Consensus       211 ~~l~~~~~~~~~~~~~~~~~~~s~~i~l-~~ldi~~V~D~~FL~---gy~~PtlaiL~e~~~tw~gr~~~~~dt~~l--~  284 (1192)
                         +.+.   .        ....+..++ ++   .+|+||+|||   ||++||||+||++.+.|..      ..++-  .
T Consensus        77 ---~~~~---~--------~~~~~~~pi~s~---~~i~~~~FL~~~~~~~~p~la~L~~~~~~~~~------~~~y~w~~  133 (504)
T PF10433_consen   77 ---DSDI---A--------FSPHINSPIKSE---GNILDMCFLHPSVGYDNPTLAILYVDSQRRTH------LVTYEWSL  133 (504)
T ss_dssp             -----T----T--------T---EEEE--S----SEEEEEEEES---S-SS-EEEEEEEETT-EEE------EEEEE---
T ss_pred             ---cccc---c--------ccccccccccCC---ceEEEEEEEecccCCCCceEEEEEEEecccce------eEEEeeec
Confidence               0000   0        112222223 23   3999999999   9999999999999664221      11110  1


Q ss_pred             EEEEeeccceee-e--eeeeccCCcccceeEEecCCCCeEEEEecceEEEEeCCCce----eEeccccccccCCCccCcC
Q 001003          285 ALSISTTLKQHP-L--IWSAMNLPHDAYKLLAVPSPIGGVLVVGANTIHYHSQSASC----ALALNNYAVSLDSSQELPR  357 (1192)
Q Consensus       285 ~~sLdl~~k~~~-~--i~s~~~Lp~d~~~LipvP~p~GGvLVig~n~I~y~d~~~~~----~~a~N~~~~~~~~~~~~~~  357 (1192)
                      ...++...+..+ .  +|....+|   ++|||||.|.||+||++++.|+|.++....    ...++..        ...+
T Consensus       134 ~~~l~~~~~~~~~~~~l~~~~~~p---~~LIPlp~~~ggllV~~~~~i~y~~~~~~~~~~~~~~~~~~--------~~~~  202 (504)
T PF10433_consen  134 DDGLNHVISKSTLPIRLPNEDELP---SFLIPLPNPPGGLLVGGENIIIYKNHLIGSGDYSFLSIPSP--------PSSS  202 (504)
T ss_dssp             -----EETTTTEEEE--EEEE-TT---EEEEEE-TTT-SEEEEESSEEEEEE------TTEEEEE--H---------HHH
T ss_pred             ccccceeeeeccccccccccCCCc---cEEEEcCCCCcEEEEECCEEEEEecccccccccccccccCC--------ccCC
Confidence            222333333222 2  66666677   999999999999999999999999764321    1111100        0001


Q ss_pred             CCceeEeece---eeEEeeCcEEEEEcCCCCEEEEEEEECCceEeeEEEEecCC-CccccceEEecCC--eEEEEeeeCC
Q 001003          358 SSFSVELDAA---HATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNP-SVLTSDITTIGNS--LFFLGSRLGD  431 (1192)
Q Consensus       358 ~~~~l~l~~~---~~~~~~~~~~Ll~~~~G~L~~l~l~~dg~~V~~l~l~~~~~-~~~~s~l~~l~~g--~lFvGS~~GD  431 (1192)
                      ..+.......   .....+.+++||++++|+||+|.+..+++   +++++++|+ ++++++++++++|  +||+||++||
T Consensus       203 ~~~~~~~~~p~~~~~~~~~~~~~lL~~e~G~l~~l~l~~~~~---~i~i~~~g~~~~~~s~l~~l~~g~d~lf~gs~~gd  279 (504)
T PF10433_consen  203 SSLWTSWARPERNISYDKDGDRILLQDEDGDLYLLTLDNDGG---SISITYLGTLCSIASSLTYLKNGGDYLFVGSEFGD  279 (504)
T ss_dssp             TS-EEEEEE------SSTTSSEEEEEETTSEEEEEEEEEEEE---EEEEEEEEE--S-ESEEEEESTT--EEEEEESSS-
T ss_pred             CceEEEEEeccccceecCCCCEEEEEeCCCeEEEEEEEECCC---eEEEEEcCCcCChhheEEEEcCCCEEEEEEEecCC
Confidence            1111110000   00124567999999999999999999877   799999999 9999999999999  9999999999


Q ss_pred             eEEEEEeeCCCcccccCCCccccCCcccCCccchhccCCCcchhhcccCccccccccCCCCCccccccceeEEEEeeecc
Q 001003          432 SLLVQFTCGSGTSMLSSGLKEEFGDIEADAPSTKRLRRSSSDALQDMVNGEELSLYGSASNNTESAQKTFSFAVRDSLVN  511 (1192)
Q Consensus       432 S~Ll~~~~~~~~~~~~~~~~~~~~~~~~~~~~~k~~~~~~~~~~d~~~~~~e~~ly~~~~~~~~~~~~~~~~~v~Dsl~N  511 (1192)
                      |+|+|+..                                                             ..++++|+++|
T Consensus       280 s~l~~~~~-------------------------------------------------------------~~l~~~~~~~N  298 (504)
T PF10433_consen  280 SQLLQISL-------------------------------------------------------------SNLEVLDSLPN  298 (504)
T ss_dssp             EEEEEEES-------------------------------------------------------------ESEEEEEEE--
T ss_pred             cEEEEEeC-------------------------------------------------------------CCcEEEEeccC
Confidence            99999963                                                             24889999999


Q ss_pred             cCCccccccccccCC--C--------CCccCCCCCCceEE--------------EeCCCcCEEEEEEecCCCCCCCCccc
Q 001003          512 IGPLKDFSYGLRINA--D--------ASATGISKQSNYEL--------------VELPGCKGIWTVYHKSSRGHNADSSR  567 (1192)
Q Consensus       512 igPI~D~~vg~~~~~--~--------~~~sG~g~~gsL~i--------------~eLpg~~~iWtv~~~~~~~~~~~~~~  567 (1192)
                      +|||.||++++....  .        .+|||.|++|+|++              .+|||+++||+++.+           
T Consensus       299 ~~Pi~D~~v~~~~~~~~~~~~~~~~lv~~sG~g~~gsL~~lr~Gi~~~~~~~~~~~l~~v~~iW~l~~~-----------  367 (504)
T PF10433_consen  299 WGPIVDFCVVDSSNSGQPSNPSSDQLVACSGAGKRGSLRILRNGIGIEGLELASSELPGVTGIWTLKLS-----------  367 (504)
T ss_dssp             --SEEEEEEE-TSSSSS-------EEEEEESSGGG-EEEEEEESBEEE--EEEEEEESTEEEEEEE-SS-----------
T ss_pred             cCCccceEEeccccCCCCcccccceEEEEECcCCCCcEEEEeccCCceeeeeeccCCCCceEEEEeeec-----------
Confidence            999999999865322  1        15999999999988              368899999999864           


Q ss_pred             ccccCCCcceEEEEEeccceEEEEec-----CceeeeecccCccccCCcEEEEeeCCCCEEEEEecCcEEEEeCC--cce
Q 001003          568 MAAYDDEYHAYLIISLEARTMVLETA-----DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS--YMT  640 (1192)
Q Consensus       568 ~~~~~~~~~~yLvlS~~~~T~Vl~~g-----~~~eEv~~~~gF~~~~~TI~ag~l~~~~~IvQVt~~~vrli~~~--~~~  640 (1192)
                         ..+  |+|||+|+.++|+||+++     ++++|++.. ||.++++||+||++ ++++|||||+++||+++..  +..
T Consensus       368 ---~~~--~~~lv~S~~~~T~vl~~~~~d~~e~~~e~~~~-~f~~~~~Tl~~~~~-~~~~ivQVt~~~i~l~~~~~~~~~  440 (504)
T PF10433_consen  368 ---SSD--HSYLVLSFPNETRVLQISEGDDGEEVEEVEED-GFDTDEPTLAAGNV-GDGRIVQVTPKGIRLIDLEDGKLT  440 (504)
T ss_dssp             ---SSS--BSEEEEEESSEEEEEEES----SSEEEEE----TS-SSS-EEEEEEE-TTTEEEEEESSEEEEEESSSTSEE
T ss_pred             ---CCC--ceEEEEEcCCceEEEEEecccCCcchhhhhhc-cCCCCCCCeEEEEc-CCCeEEEEecCeEEEEECCCCeEE
Confidence               122  899999999999999984     567777434 99999999999999 5899999999999999843  233


Q ss_pred             eeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEe
Q 001003          641 QDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTL  712 (1192)
Q Consensus       641 q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l  712 (1192)
                      +         +|.+| .+..|++|+++++|++|++.++.+.+|+++......+......+. ...+|+|+.+
T Consensus       441 ~---------~w~~~-~~~~I~~a~~~~~~v~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~eis~l~i  501 (504)
T PF10433_consen  441 Q---------EWKPP-AGSIIVAASINDPQVLVALSGGELVYFELDDNKISVSDNDETILE-LDNEISCLSI  501 (504)
T ss_dssp             E---------EEE-T-TS---SEEEESSSEEEEEE-TTEEEEEEEETTEEEEEEE----EE--SS-EEEEE-
T ss_pred             E---------EEeCC-CCCeEEEEEECCCEEEEEEeCCcEEEEEEECCceeeeeecccccc-CCCceEEEEe
Confidence            3         34444 677899999999999999999999999998765433332221111 2668888754


No 6  
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=99.35  E-value=2.1e-12  Score=149.47  Aligned_cols=114  Identities=18%  Similarity=0.380  Sum_probs=89.9

Q ss_pred             eeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccC-ceEEEEEeeeCC--
Q 001003         1063 EYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAAR-GRVLLFSTGRNA-- 1139 (1192)
Q Consensus      1063 ~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~r-GRIlvfev~~~~-- 1139 (1192)
                      +|+|||+||.    +|+++++|+|+++|+++|++.++|.+..+ +.++||||||++..+|+..++ |||++|++.+.+  
T Consensus         1 ~s~i~l~d~~----~~~~~~~~~l~~~E~~~s~~~~~l~~~~~-~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~   75 (321)
T PF03178_consen    1 ASSIRLVDPT----TFEVLDSFELEPNEHVTSLCSVKLKGDST-GKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPEN   75 (321)
T ss_dssp             --EEEEEETT----TSSEEEEEEEETTEEEEEEEEEEETTS----SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS---
T ss_pred             CcEEEEEeCC----CCeEEEEEECCCCceEEEEEEEEEcCccc-cccCEEEEEecccccccccccCcEEEEEEEEccccc
Confidence            4789999995    99999999999999999999999984433 458999999999999999888 999999999863  


Q ss_pred             -CCCceeEeecccCcccccchhcccCceEEEeecceEEeeehhhhe
Q 001003         1140 -DNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNSFVFVFLFSFLR 1184 (1192)
Q Consensus      1140 -~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~~~~~ 1184 (1192)
                       .+++.++.+...+++++..+  ++|. +++|.++++++|.|.--.
T Consensus        76 ~~~l~~i~~~~~~g~V~ai~~--~~~~-lv~~~g~~l~v~~l~~~~  118 (321)
T PF03178_consen   76 NFKLKLIHSTEVKGPVTAICS--FNGR-LVVAVGNKLYVYDLDNSK  118 (321)
T ss_dssp             --EEEEEEEEEESS-EEEEEE--ETTE-EEEEETTEEEEEEEETTS
T ss_pred             ceEEEEEEEEeecCcceEhhh--hCCE-EEEeecCEEEEEEccCcc
Confidence             37788887777777777665  8888 777778999999886444


No 7  
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=95.67  E-value=0.76  Score=53.79  Aligned_cols=223  Identities=17%  Similarity=0.269  Sum_probs=130.8

Q ss_pred             CccEEEEEecCCcEEEEEEeeecCCCCCCCCCCCCccccccccccccccccceeeEEecCCCccCCCCCCCCCCccceEE
Q 001003          837 SRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITI  916 (1192)
Q Consensus       837 ~~p~Llv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~~~~~~~~~~~~~g~~~l~~  916 (1192)
                      .+|.|++.=-||.|-+|++.   +.                     ..+++..++|.|.|.            ...   .
T Consensus       224 ~~plllvaG~d~~lrifqvD---Gk---------------------~N~~lqS~~l~~fPi------------~~a---~  264 (514)
T KOG2055|consen  224 TAPLLLVAGLDGTLRIFQVD---GK---------------------VNPKLQSIHLEKFPI------------QKA---E  264 (514)
T ss_pred             CCceEEEecCCCcEEEEEec---Cc---------------------cChhheeeeeccCcc------------cee---e
Confidence            57877776679999999984   11                     123455778877653            111   1


Q ss_pred             eeccCCceEEEEcCCCCeEEEE---eCCceEEEecCCCCceeEEecccCCCCCCcEEEEEec-CeEEEEEcCCCCccCCc
Q 001003          917 FKNISGHQGFFLSGSRPCWCMV---FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQ-GILKICQLPSGSTYDNY  992 (1192)
Q Consensus       917 f~~i~G~~gVF~~G~rP~wi~~---~~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~-~~LrI~~l~~~~~~d~~  992 (1192)
                      | -.+|.+-||..|.|+++-.+   +..--.++|+... +=.+|-.|--..|.+ ||++... |.+.+...-...     
T Consensus       265 f-~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g~-e~~~~e~FeVShd~~-fia~~G~~G~I~lLhakT~e-----  336 (514)
T KOG2055|consen  265 F-APNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYGV-EEKSMERFEVSHDSN-FIAIAGNNGHIHLLHAKTKE-----  336 (514)
T ss_pred             e-cCCCceEEEecccceEEEEeeccccccccccCCCCc-ccchhheeEecCCCC-eEEEcccCceEEeehhhhhh-----
Confidence            2 22787899999999944334   2444556676442 233455554444555 7777775 555555544433     


Q ss_pred             cceEEEecCCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcCC
Q 001003          993 WPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPD 1072 (1192)
Q Consensus       993 ~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~ 1072 (1192)
                       .+.. +.+..-.+-++++..++.+.+.+...               +                    +.     +.+-.
T Consensus       337 -li~s-~KieG~v~~~~fsSdsk~l~~~~~~G---------------e--------------------V~-----v~nl~  374 (514)
T KOG2055|consen  337 -LITS-FKIEGVVSDFTFSSDSKELLASGGTG---------------E--------------------VY-----VWNLR  374 (514)
T ss_pred             -hhhe-eeeccEEeeEEEecCCcEEEEEcCCc---------------e--------------------EE-----EEecC
Confidence             5666 77888889999998887777754321               0                    01     11110


Q ss_pred             CCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEeee----CCCCCceeEee
Q 001003         1073 RAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR----NADNPQNLVLS 1148 (1192)
Q Consensus      1073 ~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~----~~~~~e~~~~~ 1148 (1192)
                          .-..+.++.=+..=+.++++. .+       ...|+|+|         +-+|-+=||+-..    .-.+|.+..  
T Consensus       375 ----~~~~~~rf~D~G~v~gts~~~-S~-------ng~ylA~G---------S~~GiVNIYd~~s~~~s~~PkPik~~--  431 (514)
T KOG2055|consen  375 ----QNSCLHRFVDDGSVHGTSLCI-SL-------NGSYLATG---------SDSGIVNIYDGNSCFASTNPKPIKTV--  431 (514)
T ss_pred             ----CcceEEEEeecCccceeeeee-cC-------CCceEEec---------cCcceEEEeccchhhccCCCCchhhh--
Confidence                001222222222225555432 12       24599999         5789999998643    333444322  


Q ss_pred             cccCcccccchhccc--CceEEEeecc
Q 001003         1149 GSYGPLFSSVQIDFA--SHFFAICSNS 1173 (1192)
Q Consensus      1149 ~~~~~~~~~~~~~~~--~~~~a~~~~~ 1173 (1192)
                      .-+....+++|  ||  +++||+|+.-
T Consensus       432 dNLtt~Itsl~--Fn~d~qiLAiaS~~  456 (514)
T KOG2055|consen  432 DNLTTAITSLQ--FNHDAQILAIASRV  456 (514)
T ss_pred             hhhheeeeeee--eCcchhhhhhhhhc
Confidence            24556667888  65  7899999865


No 8  
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=95.28  E-value=5.4  Score=43.14  Aligned_cols=31  Identities=19%  Similarity=0.304  Sum_probs=24.7

Q ss_pred             CcEEEEEEcC--CEEEEEEeCCcEEEEEecCCC
Q 001003          659 STVLSVSIAD--PYVLLGMSDGSIRLLVGDPST  689 (1192)
Q Consensus       659 ~~Iv~asi~d--pyvlv~~~dg~i~~l~~d~~~  689 (1192)
                      ..|.++++..  .+++.+..||.+.+|..+...
T Consensus        10 ~~i~~~~~~~~~~~l~~~~~~g~i~i~~~~~~~   42 (289)
T cd00200          10 GGVTCVAFSPDGKLLATGSGDGTIKVWDLETGE   42 (289)
T ss_pred             CCEEEEEEcCCCCEEEEeecCcEEEEEEeeCCC
Confidence            4588888865  788888889999999887543


No 9  
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=94.95  E-value=8.6  Score=48.47  Aligned_cols=81  Identities=12%  Similarity=0.181  Sum_probs=56.3

Q ss_pred             EEEEEEc--CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcccc
Q 001003          661 VLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1192)
Q Consensus       661 Iv~asi~--dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~  738 (1192)
                      |+++.-+  =.-|+|++.+|+|.+|.+.-+..+.+..-      ...+|+++++-+|                       
T Consensus       205 IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~sFk~------d~g~VtslSFrtD-----------------------  255 (910)
T KOG1539|consen  205 ITAIEQSPALDVVAIGLENGTVIIFNLKFDKILMSFKQ------DWGRVTSLSFRTD-----------------------  255 (910)
T ss_pred             eeEeccCCcceEEEEeccCceEEEEEcccCcEEEEEEc------cccceeEEEeccC-----------------------
Confidence            6655432  24688999999999999876544333221      1356888776433                       


Q ss_pred             ccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEeeccc
Q 001003          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFV  781 (1192)
Q Consensus       739 ~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~  781 (1192)
                                 +...++.+..+|.|.+|.|.+-+++....+..
T Consensus       256 -----------G~p~las~~~~G~m~~wDLe~kkl~~v~~nah  287 (910)
T KOG1539|consen  256 -----------GNPLLASGRSNGDMAFWDLEKKKLINVTRNAH  287 (910)
T ss_pred             -----------CCeeEEeccCCceEEEEEcCCCeeeeeeeccc
Confidence                       23578889999999999999988876665443


No 10 
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=94.77  E-value=4.2  Score=48.66  Aligned_cols=75  Identities=17%  Similarity=0.357  Sum_probs=46.7

Q ss_pred             EEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEeec
Q 001003          753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW  832 (1192)
Q Consensus       753 ~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~~  832 (1192)
                      +++|+.++|.|.|+.+--=.++|... +   .+....                          +.....+...|--+..+
T Consensus        99 Fvaigy~~G~l~viD~RGPavI~~~~-i---~~~~~~--------------------------~~~~~~vt~ieF~vm~~  148 (395)
T PF08596_consen   99 FVAIGYESGSLVVIDLRGPAVIYNEN-I---RESFLS--------------------------KSSSSYVTSIEFSVMTL  148 (395)
T ss_dssp             EEEEEETTSEEEEEETTTTEEEEEEE-G---GG--T---------------------------SS----EEEEEEEEEE-
T ss_pred             EEEEEecCCcEEEEECCCCeEEeecc-c---cccccc--------------------------cccccCeeEEEEEEEec
Confidence            88899999999999986555566532 2   110000                          00000122233344456


Q ss_pred             CCCC-CccEEEEEecCCcEEEEEEee
Q 001003          833 SAHH-SRPFLFAILTDGTILCYQAYL  857 (1192)
Q Consensus       833 g~~~-~~p~Llv~l~dG~l~~Y~~~~  857 (1192)
                      +++. ..|.|+|++..|++++|++.+
T Consensus       149 ~~D~ySSi~L~vGTn~G~v~~fkIlp  174 (395)
T PF08596_consen  149 GGDGYSSICLLVGTNSGNVLTFKILP  174 (395)
T ss_dssp             TTSSSEEEEEEEEETTSEEEEEEEEE
T ss_pred             CCCcccceEEEEEeCCCCEEEEEEec
Confidence            6554 789999999999999999975


No 11 
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=94.60  E-value=4.6  Score=51.26  Aligned_cols=122  Identities=15%  Similarity=0.241  Sum_probs=79.8

Q ss_pred             CCCceeEEecccCCCCCCc-EEEEEe-cCeEEEEEcCCCCc---cCCccceEEEecCCCccCeEEEecCCCEEEEEEeec
Q 001003          950 CDGSIVAFTVLHNVNCNHG-FIYVTS-QGILKICQLPSGST---YDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024 (1192)
Q Consensus       950 ~~~~v~~~t~F~~~~c~~G-fi~~~~-~~~LrI~~l~~~~~---~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~ 1024 (1192)
                      .+++|.+..=+     |+| |+.+.+ +|.++|-.+.+..-   +..-.+..- .-+.+...+.++||.++.+++.+.+ 
T Consensus       137 h~apVl~l~~~-----p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~-~~~s~i~~~~aW~Pk~g~la~~~~d-  209 (933)
T KOG1274|consen  137 HDAPVLQLSYD-----PKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNE-FILSRICTRLAWHPKGGTLAVPPVD-  209 (933)
T ss_pred             cCCceeeeeEc-----CCCCEEEEEecCceEEEEEcccchhhhhcccCCcccc-ccccceeeeeeecCCCCeEEeeccC-
Confidence            34677776542     333 454433 69999999986531   122222222 3334457889999999999996432 


Q ss_pred             CccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEecccc
Q 001003         1025 VLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTT 1104 (1192)
Q Consensus      1025 ~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~ 1104 (1192)
                                                             .+|+++++.    +|+..-.+.....+..  +..+++.   
T Consensus       210 ---------------------------------------~~Vkvy~r~----~we~~f~Lr~~~~ss~--~~~~~ws---  241 (933)
T KOG1274|consen  210 ---------------------------------------NTVKVYSRK----GWELQFKLRDKLSSSK--FSDLQWS---  241 (933)
T ss_pred             ---------------------------------------CeEEEEccC----Cceeheeecccccccc--eEEEEEc---
Confidence                                                   268899985    9998655555555554  4555553   


Q ss_pred             CCCCceEEEEEeccccCcccccCceEEEEEeee
Q 001003         1105 TKENETLLAIGTAYVQGEDVAARGRVLLFSTGR 1137 (1192)
Q Consensus      1105 t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~ 1137 (1192)
                        ....|||-||         ..|.|+||++..
T Consensus       242 --PnG~YiAAs~---------~~g~I~vWnv~t  263 (933)
T KOG1274|consen  242 --PNGKYIAAST---------LDGQILVWNVDT  263 (933)
T ss_pred             --CCCcEEeeec---------cCCcEEEEeccc
Confidence              2467999995         688999999975


No 12 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=93.83  E-value=4.1  Score=47.97  Aligned_cols=44  Identities=9%  Similarity=0.277  Sum_probs=39.4

Q ss_pred             cCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEE
Q 001003          975 QGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021 (1192)
Q Consensus       975 ~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~ 1021 (1192)
                      ++.||+..+|+-..| ..||.+. -|++. +|.++++|.++.++|+.
T Consensus       458 knalrLVHvPS~TVF-sNfP~~n-~~vg~-vtc~aFSP~sG~lAvGN  501 (514)
T KOG2055|consen  458 KNALRLVHVPSCTVF-SNFPTSN-TKVGH-VTCMAFSPNSGYLAVGN  501 (514)
T ss_pred             ccceEEEeccceeee-ccCCCCC-Ccccc-eEEEEecCCCceEEeec
Confidence            369999999999888 7899998 99988 89999999999999964


No 13 
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=93.36  E-value=4.8  Score=48.33  Aligned_cols=95  Identities=14%  Similarity=0.120  Sum_probs=61.8

Q ss_pred             CCceEEEEcCCCCeEEEEeCCceEEEecCCCCceeEEecccC--CCCCC---cEEEEEecCeEEEEEcCCCCc----cCC
Q 001003          921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHN--VNCNH---GFIYVTSQGILKICQLPSGST----YDN  991 (1192)
Q Consensus       921 ~G~~gVF~~G~rP~wi~~~~g~l~~~p~~~~~~v~~~t~F~~--~~c~~---Gfi~~~~~~~LrI~~l~~~~~----~d~  991 (1192)
                      +..+.|+|.|+|-.+.+..+|.+++.-- .+-.-.||++|..  .+-++   -++..+.++.|.|.+=..+..    -..
T Consensus       251 ~~~~~IvvLger~Lf~l~~~G~l~~~kr-Ld~~p~~~~~Y~~~~~~~~~~~~~llV~t~t~~LlVy~d~~L~WsA~l~~~  329 (418)
T PF14727_consen  251 SSESDIVVLGERSLFCLKDNGSLRFQKR-LDYNPSCFCPYRVPWYNEPSTRLNLLVGTHTGTLLVYEDTTLVWSAQLPHV  329 (418)
T ss_pred             CCCceEEEEecceEEEEcCCCeEEEEEe-cCCceeeEEEEEeecccCCCCceEEEEEecCCeEEEEeCCeEEEecCCCCC
Confidence            3667899999999888888999998663 5677889999998  33333   277778888888877333210    013


Q ss_pred             ccceEEEecCCCccCeEEEecCCCEE
Q 001003          992 YWPVQKVIPLKATPHQITYFAEKNLY 1017 (1192)
Q Consensus       992 ~~~vrk~ipL~~tp~~Iay~~~~~~y 1017 (1192)
                      +..++. -.+...+--|+-..+.+..
T Consensus       330 PVal~v-~~~~~~~G~IV~Ls~~G~L  354 (418)
T PF14727_consen  330 PVALSV-ANFNGLKGLIVSLSDEGQL  354 (418)
T ss_pred             CEEEEe-cccCCCCceEEEEcCCCcE
Confidence            334444 4444444445554444443


No 14 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=92.76  E-value=16  Score=39.27  Aligned_cols=47  Identities=15%  Similarity=0.249  Sum_probs=31.6

Q ss_pred             CcEEEEEe-cCeEEEEEcCCCCccCCccceEEEecCCC-ccCeEEEecCCCEEEEE
Q 001003          967 HGFIYVTS-QGILKICQLPSGSTYDNYWPVQKVIPLKA-TPHQITYFAEKNLYPLI 1020 (1192)
Q Consensus       967 ~Gfi~~~~-~~~LrI~~l~~~~~~d~~~~vrk~ipL~~-tp~~Iay~~~~~~y~v~ 1020 (1192)
                      +.+++... ++.+++..+....      +++. +.... .+..+++++..+.++++
T Consensus       147 ~~~l~~~~~~~~i~i~d~~~~~------~~~~-~~~~~~~i~~~~~~~~~~~l~~~  195 (289)
T cd00200         147 GTFVASSSQDGTIKLWDLRTGK------CVAT-LTGHTGEVNSVAFSPDGEKLLSS  195 (289)
T ss_pred             CCEEEEEcCCCcEEEEEccccc------ccee-EecCccccceEEECCCcCEEEEe
Confidence            45666665 7889998887543      4455 55444 57888898887666664


No 15 
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=91.74  E-value=30  Score=41.73  Aligned_cols=75  Identities=24%  Similarity=0.333  Sum_probs=58.9

Q ss_pred             eEEEEcCCeEEEEEEEEeccCCccccCCccccccccccccccccEEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEE
Q 001003           57 NLVVTAANVIEIYVVRVQEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILA  136 (1192)
Q Consensus        57 nLVvak~n~LeIy~v~~~~~g~~~~~~~~~~~~~~~~~~~~~~~L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~  136 (1192)
                      .|+|--.+.|.||.+... +|..+.              .+..+|+++.++.|--..-+|..-++.+..   ++|.|.|=
T Consensus        90 ~LaVLhP~kl~vY~v~~~-~g~~~~--------------g~~~~L~~~yeh~l~~~a~nm~~G~Fgg~~---~~~~IcVQ  151 (418)
T PF14727_consen   90 QLAVLHPRKLSVYSVSLV-DGTVEH--------------GNQYQLELIYEHSLQRTAYNMCCGPFGGVK---GRDFICVQ  151 (418)
T ss_pred             eEEEecCCEEEEEEEEec-CCCccc--------------CcEEEEEEEEEEecccceeEEEEEECCCCC---CceEEEEE
Confidence            788889999999999532 121000              123569999999999999999998988762   58999999


Q ss_pred             eCCCeEEEEEEeC
Q 001003          137 FEDAKISVLEFDD  149 (1192)
Q Consensus       137 ~~~aklsil~~d~  149 (1192)
                      +-||+|++.+.|.
T Consensus       152 S~DG~L~~feqe~  164 (418)
T PF14727_consen  152 SMDGSLSFFEQES  164 (418)
T ss_pred             ecCceEEEEeCCc
Confidence            9999999988763


No 16 
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=90.76  E-value=5.8  Score=38.45  Aligned_cols=87  Identities=16%  Similarity=0.289  Sum_probs=55.7

Q ss_pred             cCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCce
Q 001003          628 ERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPV  707 (1192)
Q Consensus       628 ~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i  707 (1192)
                      ...||+++.+..+.++.-.           +.-+.-+.+...+.+-++++|+|.+|....  .+-.+.       ..+++
T Consensus        24 D~~IRvf~~~e~~~Ei~e~-----------~~v~~L~~~~~~~F~Y~l~NGTVGvY~~~~--RlWRiK-------SK~~~   83 (111)
T PF14783_consen   24 DFEIRVFKGDEIVAEITET-----------DKVTSLCSLGGGRFAYALANGTVGVYDRSQ--RLWRIK-------SKNQV   83 (111)
T ss_pred             CcEEEEEeCCcEEEEEecc-----------cceEEEEEcCCCEEEEEecCCEEEEEeCcc--eeeeec-------cCCCe
Confidence            5568998888766554431           112344556778899999999999997632  112222       14457


Q ss_pred             eEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEE
Q 001003          708 SSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEI  765 (1192)
Q Consensus       708 ~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I  765 (1192)
                      .|++.| |...                              ....=++++|.||.+++
T Consensus        84 ~~~~~~-D~~g------------------------------dG~~eLI~GwsnGkve~  110 (111)
T PF14783_consen   84 TSMAFY-DING------------------------------DGVPELIVGWSNGKVEV  110 (111)
T ss_pred             EEEEEE-cCCC------------------------------CCceEEEEEecCCeEEe
Confidence            777777 4310                              12346889999999975


No 17 
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=89.78  E-value=41  Score=38.27  Aligned_cols=65  Identities=15%  Similarity=0.228  Sum_probs=46.6

Q ss_pred             CceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeecccCcccccchhcccCceEEEeecc-eEEeeehhhh
Q 001003         1108 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS-FVFVFLFSFL 1183 (1192)
Q Consensus      1108 ~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~~~~~~ 1183 (1192)
                      .+.+++||     |+|    +-|.+||-.+..---+.+.++...|..+. + .+--+.+|++.++- ++.||--.|.
T Consensus       218 ~~~~L~vG-----~d~----~~i~~~D~ds~~~~~~~~AH~~RVK~i~~-~-~~~~~~~lvTaSSDG~I~vWd~~~~  283 (362)
T KOG0294|consen  218 DGSELLVG-----GDN----EWISLKDTDSDTPLTEFLAHENRVKDIAS-Y-TNPEHEYLVTASSDGFIKVWDIDME  283 (362)
T ss_pred             CCceEEEe-----cCC----ceEEEeccCCCccceeeecchhheeeeEE-E-ecCCceEEEEeccCceEEEEEcccc
Confidence            35788888     444    78899988863333466777888888875 2 23346788888776 9999988876


No 18 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=87.47  E-value=72  Score=38.70  Aligned_cols=145  Identities=14%  Similarity=0.147  Sum_probs=89.6

Q ss_pred             eEEEEEeccceEEEEec-CceeeeecccCccccCCcEEEEeeCCCCEEEEEecCcEEEEeCCcceeeeecCCCCCCCCCC
Q 001003          577 AYLIISLEARTMVLETA-DLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSG  655 (1192)
Q Consensus       577 ~yLvlS~~~~T~Vl~~g-~~~eEv~~~~gF~~~~~TI~ag~l~~~~~IvQVt~~~vrli~~~~~~q~i~~~~~~~e~~~~  655 (1192)
                      .++.++..+.-++..+. ..+   +...-+....+-+.++...++...|-+|-++|.++........+++.         
T Consensus       376 ~~~t~g~Dd~l~~~~~~~~~~---t~~~~~~lg~QP~~lav~~d~~~avv~~~~~iv~l~~~~~~~~~~~~---------  443 (603)
T KOG0318|consen  376 ELFTIGWDDTLRVISLKDNGY---TKSEVVKLGSQPKGLAVLSDGGTAVVACISDIVLLQDQTKVSSIPIG---------  443 (603)
T ss_pred             cEEEEecCCeEEEEecccCcc---cccceeecCCCceeEEEcCCCCEEEEEecCcEEEEecCCcceeeccc---------
Confidence            45555555555565542 222   11112345555666666656678888999999999866555555541         


Q ss_pred             CCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCc
Q 001003          656 SENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGV  735 (1192)
Q Consensus       656 ~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~  735 (1192)
                       -....++.+--..+++|+-.||.|.+|.+......-+.....    ....|++++.-.|                    
T Consensus       444 -y~~s~vAv~~~~~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~----h~a~iT~vaySpd--------------------  498 (603)
T KOG0318|consen  444 -YESSAVAVSPDGSEVAVGGQDGKVHVYSLSGDELKEEAKLLE----HRAAITDVAYSPD--------------------  498 (603)
T ss_pred             -cccceEEEcCCCCEEEEecccceEEEEEecCCcccceeeeec----ccCCceEEEECCC--------------------
Confidence             122356666678899999999999999987654322211111    1335666643211                    


Q ss_pred             cccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCcee
Q 001003          736 GEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC  773 (1192)
Q Consensus       736 ~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~  773 (1192)
                                    . .+++.+-.++.+.+|++.+-+.
T Consensus       499 --------------~-~yla~~Da~rkvv~yd~~s~~~  521 (603)
T KOG0318|consen  499 --------------G-AYLAAGDASRKVVLYDVASREV  521 (603)
T ss_pred             --------------C-cEEEEeccCCcEEEEEcccCce
Confidence                          1 2888899999999999877444


No 19 
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=80.71  E-value=5.9  Score=44.71  Aligned_cols=83  Identities=14%  Similarity=0.256  Sum_probs=60.9

Q ss_pred             CCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcc
Q 001003          657 ENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG  736 (1192)
Q Consensus       657 ~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~  736 (1192)
                      ....|+++++..||++=+.+|.+|.+|.+.....      ...+.+....|.|+.+|...+                   
T Consensus        42 H~~sitavAVs~~~~aSGssDetI~IYDm~k~~q------lg~ll~HagsitaL~F~~~~S-------------------   96 (362)
T KOG0294|consen   42 HAGSITALAVSGPYVASGSSDETIHIYDMRKRKQ------LGILLSHAGSITALKFYPPLS-------------------   96 (362)
T ss_pred             cccceeEEEecceeEeccCCCCcEEEEeccchhh------hcceeccccceEEEEecCCcc-------------------
Confidence            4567999999999999999999999998754321      112223355688877764321                   


Q ss_pred             ccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEee
Q 001003          737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1192)
Q Consensus       737 ~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~  778 (1192)
                                   .+ |+.-+-+||.+.||+.-+++++-+.+
T Consensus        97 -------------~s-hLlS~sdDG~i~iw~~~~W~~~~slK  124 (362)
T KOG0294|consen   97 -------------KS-HLLSGSDDGHIIIWRVGSWELLKSLK  124 (362)
T ss_pred             -------------hh-heeeecCCCcEEEEEcCCeEEeeeec
Confidence                         12 78889999999999999988875554


No 20 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=79.62  E-value=50  Score=41.91  Aligned_cols=206  Identities=15%  Similarity=0.181  Sum_probs=116.7

Q ss_pred             CcEEEEEEcCCEEEE-EEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccc
Q 001003          659 STVLSVSIADPYVLL-GMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGE  737 (1192)
Q Consensus       659 ~~Iv~asi~dpyvlv-~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~  737 (1192)
                      ..|..++=...+.|| +.-|.|+++|....+.| |.+     +. ...-|+|+.+.                        
T Consensus       370 ~DILDlSWSKn~fLLSSSMDKTVRLWh~~~~~C-L~~-----F~-HndfVTcVaFn------------------------  418 (712)
T KOG0283|consen  370 ADILDLSWSKNNFLLSSSMDKTVRLWHPGRKEC-LKV-----FS-HNDFVTCVAFN------------------------  418 (712)
T ss_pred             hhheecccccCCeeEeccccccEEeecCCCcce-eeE-----Ee-cCCeeEEEEec------------------------
Confidence            346666666555555 66799999999987777 332     21 24457776542                        


Q ss_pred             cccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCccc
Q 001003          738 AIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKE  817 (1192)
Q Consensus       738 ~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~  817 (1192)
                      |+|         ..|+ +-+--+|.++||++|+.+.++=.+    ++.                                
T Consensus       419 PvD---------DryF-iSGSLD~KvRiWsI~d~~Vv~W~D----l~~--------------------------------  452 (712)
T KOG0283|consen  419 PVD---------DRYF-ISGSLDGKVRLWSISDKKVVDWND----LRD--------------------------------  452 (712)
T ss_pred             ccC---------CCcE-eecccccceEEeecCcCeeEeehh----hhh--------------------------------
Confidence            222         1333 334458999999999988765443    222                                


Q ss_pred             ccccccEEEEEEeecCCCCCccEEEEEecCCcEEEEEEeeecCCCCCCCCCCCCccccccccccccccccceeeEEecCC
Q 001003          818 NIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPL  897 (1192)
Q Consensus       818 ~~~~~~v~eil~~~~g~~~~~p~Llv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~  897 (1192)
                           .|.-+++.+     ..-|-++++-+|....|...-..-.                     .+.   .|+-     
T Consensus       453 -----lITAvcy~P-----dGk~avIGt~~G~C~fY~t~~lk~~---------------------~~~---~I~~-----  493 (712)
T KOG0283|consen  453 -----LITAVCYSP-----DGKGAVIGTFNGYCRFYDTEGLKLV---------------------SDF---HIRL-----  493 (712)
T ss_pred             -----hheeEEecc-----CCceEEEEEeccEEEEEEccCCeEE---------------------Eee---eEee-----
Confidence                 233333322     2456678888999999986410000                     000   0010     


Q ss_pred             CccCCCCCCCCCCccceEEeeccCCceEEEEcCCCCeEEEEe-CCceEEEecCCCCceeEEecccCCCCCCcEEEEEecC
Q 001003          898 DAYTREETPHGAPCQRITIFKNISGHQGFFLSGSRPCWCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQG  976 (1192)
Q Consensus       898 ~~~~~~~~~~~~g~~~l~~f~~i~G~~gVF~~G~rP~wi~~~-~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~  976 (1192)
                          +..+ +..+. +|+-|+-.        .|+.--+|+.+ ..++|++-......|.-|-.|+|.+-+.- ..++.+|
T Consensus       494 ----~~~K-k~~~~-rITG~Q~~--------p~~~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n~~SQ~~-Asfs~Dg  558 (712)
T KOG0283|consen  494 ----HNKK-KKQGK-RITGLQFF--------PGDPDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRNTSSQIS-ASFSSDG  558 (712)
T ss_pred             ----ccCc-cccCc-eeeeeEec--------CCCCCeEEEecCCCceEEEeccchhhhhhhcccccCCccee-eeEccCC
Confidence                0000 00111 34433321        12221234443 68899988643455777888888776554 7888888


Q ss_pred             eEEEEEcCCCCccCCccceEE
Q 001003          977 ILKICQLPSGSTYDNYWPVQK  997 (1192)
Q Consensus       977 ~LrI~~l~~~~~~d~~~~vrk  997 (1192)
                      .--||.-++.+-|  -|....
T Consensus       559 k~IVs~seDs~VY--iW~~~~  577 (712)
T KOG0283|consen  559 KHIVSASEDSWVY--IWKNDS  577 (712)
T ss_pred             CEEEEeecCceEE--EEeCCC
Confidence            8888888877766  666543


No 21 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=78.63  E-value=1.7e+02  Score=35.76  Aligned_cols=60  Identities=18%  Similarity=0.282  Sum_probs=39.5

Q ss_pred             CceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeecccCcccccchhcccCceEEEeecc-eEEee
Q 001003         1108 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS-FVFVF 1178 (1192)
Q Consensus      1108 ~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~ 1178 (1192)
                      .+.++|||     |||    |.++||.+.-..-..|  ....+-.+..+.|.+--.|..||+|-.+ +|-+|
T Consensus       454 ~~~~vaVG-----G~D----gkvhvysl~g~~l~ee--~~~~~h~a~iT~vaySpd~~yla~~Da~rkvv~y  514 (603)
T KOG0318|consen  454 DGSEVAVG-----GQD----GKVHVYSLSGDELKEE--AKLLEHRAAITDVAYSPDGAYLAAGDASRKVVLY  514 (603)
T ss_pred             CCCEEEEe-----ccc----ceEEEEEecCCcccce--eeeecccCCceEEEECCCCcEEEEeccCCcEEEE
Confidence            36799999     565    6799999975332223  1223444555555566789999999888 55544


No 22 
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=78.00  E-value=87  Score=38.62  Aligned_cols=27  Identities=22%  Similarity=0.432  Sum_probs=23.7

Q ss_pred             EEEEEEecCCeEEEEECCCceeeEEee
Q 001003          752 IYSVVCYESGALEIFDVPNFNCVFTVD  778 (1192)
Q Consensus       752 ~~l~v~~~~g~l~I~sLP~~~~v~~~~  778 (1192)
                      .|++-+.++|+++||.+-...++.++.
T Consensus       413 ~wlasGsdDGtvriWEi~TgRcvr~~~  439 (733)
T KOG0650|consen  413 EWLASGSDDGTVRIWEIATGRCVRTVQ  439 (733)
T ss_pred             ceeeecCCCCcEEEEEeecceEEEEEe
Confidence            499999999999999999988876664


No 23 
>COG5161 SFT1 Pre-mRNA cleavage and polyadenylation specificity factor [RNA processing and modification]
Probab=77.78  E-value=1.1  Score=55.97  Aligned_cols=91  Identities=15%  Similarity=-0.027  Sum_probs=69.4

Q ss_pred             ccEEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEeCCCCCEEEEeeeeeeccccccccCCccccc
Q 001003           99 ASLELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFA  178 (1192)
Q Consensus        99 ~~L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d~~~~~l~t~Slh~~E~~~~~~~~~G~~~~~  178 (1192)
                      .-|++..+.+.||+|. |..+.-+++    +    .-+++-||++.+|||+..    +-+||++|+-    ...-.-...
T Consensus        87 s~lrf~sq~n~f~Tis-lhyyeGKfk----g----ksLvelak~stle~D~~s----scaLlfneDi----~~flpfhvn  149 (1319)
T COG5161          87 SLLRFDSQANEFRTIS-LHYYEGKFK----G----KSLVELAKFSTLEFDIRS----SCALLFNEDI----GNFLPFHVN  149 (1319)
T ss_pred             EEEEehhhcccceeEE-EeeeccccC----C----chhhhhhhhhheeeccCc----cchhhhhhhh----hhccccccc
Confidence            4588888999999998 888877764    3    345778999999999986    4589999983    111111123


Q ss_pred             CCCeEEECCCCcEEEEEEcCceEEEEeC
Q 001003          179 RGPLVKVDPQGRCGGVLVYGLQMIILKA  206 (1192)
Q Consensus       179 ~~~~l~VDP~~Rc~~l~~y~~~L~ilP~  206 (1192)
                      .+....|||+.-|.++....++++|+|-
T Consensus       150 kndddev~~d~D~~~~~~~~~h~~i~ps  177 (1319)
T COG5161         150 KNDDDEVRIDVDLGMFQMSKRHFSIFPS  177 (1319)
T ss_pred             CCccccccccccccHHHHHHHHhhcCCC
Confidence            3456789999999999999999999985


No 24 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=75.86  E-value=1.7e+02  Score=34.32  Aligned_cols=52  Identities=12%  Similarity=0.278  Sum_probs=34.9

Q ss_pred             cEEEEEec--CeEEEEEcCCCC-ccCCccceEEEecCCCccCeEEEecCCCEEEEEE
Q 001003          968 GFIYVTSQ--GILKICQLPSGS-TYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIV 1021 (1192)
Q Consensus       968 Gfi~~~~~--~~LrI~~l~~~~-~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~ 1021 (1192)
                      .++|+..-  +.+++..++... +|.. ...-+ +|.|.-||++++||+.+.+.|++
T Consensus       156 ~~v~v~dlG~D~v~~~~~~~~~~~l~~-~~~~~-~~~G~GPRh~~f~pdg~~~Yv~~  210 (345)
T PF10282_consen  156 RFVYVPDLGADRVYVYDIDDDTGKLTP-VDSIK-VPPGSGPRHLAFSPDGKYAYVVN  210 (345)
T ss_dssp             SEEEEEETTTTEEEEEEE-TTS-TEEE-EEEEE-CSTTSSEEEEEE-TTSSEEEEEE
T ss_pred             CEEEEEecCCCEEEEEEEeCCCceEEE-eeccc-cccCCCCcEEEEcCCcCEEEEec
Confidence            47777653  478888887654 2322 23346 89999999999999988766654


No 25 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=72.54  E-value=2e+02  Score=33.68  Aligned_cols=154  Identities=18%  Similarity=0.275  Sum_probs=92.5

Q ss_pred             cEEEEeeCCCCEEEEEecCcEEEEeCCc--ceeeeecCCCCCCCCCCCCCCcEEEEEEcCC--EEEE--EEeCCcEEEEE
Q 001003          611 TIAAGNLFGRRRVIQVFERGARILDGSY--MTQDLSFGPSNSESGSGSENSTVLSVSIADP--YVLL--GMSDGSIRLLV  684 (1192)
Q Consensus       611 TI~ag~l~~~~~IvQVt~~~vrli~~~~--~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dp--yvlv--~~~dg~i~~l~  684 (1192)
                      .|.+-.| +++|+|-+....|-++|-..  .++.|.      +|. | ......+.|++..  |++.  ....|+|++|.
T Consensus        89 ~IL~Vrm-Nr~RLvV~Lee~IyIydI~~MklLhTI~------t~~-~-n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d  159 (391)
T KOG2110|consen   89 SILAVRM-NRKRLVVCLEESIYIYDIKDMKLLHTIE------TTP-P-NPKGLCALSPNNANCYLAYPGSTTSGDVVLFD  159 (391)
T ss_pred             ceEEEEE-ccceEEEEEcccEEEEecccceeehhhh------ccC-C-CccceEeeccCCCCceEEecCCCCCceEEEEE
Confidence            4666667 78899989999999998653  233222      331 1 2233666666544  8777  45578999998


Q ss_pred             ecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCe-E
Q 001003          685 GDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGA-L  763 (1192)
Q Consensus       685 ~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~-l  763 (1192)
                      +..-...-      .+...+..++|+.+-.|                                  + .+++-+-+.|+ +
T Consensus       160 ~~nl~~v~------~I~aH~~~lAalafs~~----------------------------------G-~llATASeKGTVI  198 (391)
T KOG2110|consen  160 TINLQPVN------TINAHKGPLAALAFSPD----------------------------------G-TLLATASEKGTVI  198 (391)
T ss_pred             cccceeee------EEEecCCceeEEEECCC----------------------------------C-CEEEEeccCceEE
Confidence            75422111      12224567777654211                                  1 24555555554 5


Q ss_pred             EEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEeecCCCCCccEEEE
Q 001003          764 EIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFA  843 (1192)
Q Consensus       764 ~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~~g~~~~~p~Llv  843 (1192)
                      ++|+.|+-+.++...      +-..                                -..|.++.   |+  .+.+||.+
T Consensus       199 RVf~v~~G~kl~eFR------RG~~--------------------------------~~~IySL~---Fs--~ds~~L~~  235 (391)
T KOG2110|consen  199 RVFSVPEGQKLYEFR------RGTY--------------------------------PVSIYSLS---FS--PDSQFLAA  235 (391)
T ss_pred             EEEEcCCccEeeeee------CCce--------------------------------eeEEEEEE---EC--CCCCeEEE
Confidence            788888877766553      1100                                12344443   43  45679999


Q ss_pred             EecCCcEEEEEEee
Q 001003          844 ILTDGTILCYQAYL  857 (1192)
Q Consensus       844 ~l~dG~l~~Y~~~~  857 (1192)
                      .-..+++.+|++-.
T Consensus       236 sS~TeTVHiFKL~~  249 (391)
T KOG2110|consen  236 SSNTETVHIFKLEK  249 (391)
T ss_pred             ecCCCeEEEEEecc
Confidence            99999999999853


No 26 
>PTZ00420 coronin; Provisional
Probab=71.53  E-value=1.2e+02  Score=38.20  Aligned_cols=148  Identities=7%  Similarity=0.130  Sum_probs=77.4

Q ss_pred             ecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCcc
Q 001003          974 SQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSV 1053 (1192)
Q Consensus       974 ~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~ 1053 (1192)
                      .++.++|-.+....      .+.. +.....+..+++++..+.++..+.                               
T Consensus       146 ~DgtIrIWDl~tg~------~~~~-i~~~~~V~SlswspdG~lLat~s~-------------------------------  187 (568)
T PTZ00420        146 FDSFVNIWDIENEK------RAFQ-INMPKKLSSLKWNIKGNLLSGTCV-------------------------------  187 (568)
T ss_pred             CCCeEEEEECCCCc------EEEE-EecCCcEEEEEECCCCCEEEEEec-------------------------------
Confidence            35688887776554      3455 655566888999998876654321                               


Q ss_pred             ccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEE
Q 001003         1054 DLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLF 1133 (1192)
Q Consensus      1054 ~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvf 1133 (1192)
                               ...|+++|+.    +++.+.+++-  ++.....+.+.+...  ++...+|+.+. +    |-...+.|.||
T Consensus       188 ---------D~~IrIwD~R----sg~~i~tl~g--H~g~~~s~~v~~~~f--s~d~~~IlTtG-~----d~~~~R~VkLW  245 (568)
T PTZ00420        188 ---------GKHMHIIDPR----KQEIASSFHI--HDGGKNTKNIWIDGL--GGDDNYILSTG-F----SKNNMREMKLW  245 (568)
T ss_pred             ---------CCEEEEEECC----CCcEEEEEec--ccCCceeEEEEeeeE--cCCCCEEEEEE-c----CCCCccEEEEE
Confidence                     1157888885    6677655443  333222233333211  11223444431 1    11233469999


Q ss_pred             EeeeCCCCCce-eEeecccCcccccchhcccCceEEEe-ecceEEeeehhhh
Q 001003         1134 STGRNADNPQN-LVLSGSYGPLFSSVQIDFASHFFAIC-SNSFVFVFLFSFL 1183 (1192)
Q Consensus      1134 ev~~~~~~~e~-~~~~~~~~~~~~~~~~~~~~~~~a~~-~~~~~~~~~~~~~ 1183 (1192)
                      ++-... +|-. .......+.+.... ....|.+++++ +-+.+++|++...
T Consensus       246 Dlr~~~-~pl~~~~ld~~~~~L~p~~-D~~tg~l~lsGkGD~tIr~~e~~~~  295 (568)
T PTZ00420        246 DLKNTT-SALVTMSIDNASAPLIPHY-DESTGLIYLIGKGDGNCRYYQHSLG  295 (568)
T ss_pred             ECCCCC-CceEEEEecCCccceEEee-eCCCCCEEEEEECCCeEEEEEccCC
Confidence            987422 2222 22222223322211 12346655544 6779999998643


No 27 
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=69.36  E-value=1.1e+02  Score=35.15  Aligned_cols=53  Identities=6%  Similarity=0.136  Sum_probs=41.0

Q ss_pred             cEEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEe
Q 001003          968 GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022 (1192)
Q Consensus       968 Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s 1022 (1192)
                      -|+-+..++.+||-.+.+-..-++..--+. +|++ +|.+++|.|+-+.++|.+-
T Consensus       100 ~lat~~~Dr~Ir~w~~~DF~~~eHr~~R~n-ve~d-hpT~V~FapDc~s~vv~~~  152 (420)
T KOG2096|consen  100 KLATISGDRSIRLWDVRDFENKEHRCIRQN-VEYD-HPTRVVFAPDCKSVVVSVK  152 (420)
T ss_pred             eeEEEeCCceEEEEecchhhhhhhhHhhcc-ccCC-CceEEEECCCcceEEEEEc
Confidence            455556667899988887654455555666 8888 9999999999999999775


No 28 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=68.44  E-value=94  Score=37.38  Aligned_cols=102  Identities=15%  Similarity=0.157  Sum_probs=61.6

Q ss_pred             cCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEcCC-EEEEEEeCCcEEEEEecCCCceEeeccccccccCCCc
Q 001003          628 ERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADP-YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP  706 (1192)
Q Consensus       628 ~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dp-yvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~  706 (1192)
                      ..-||++|..... .+.+ ++  .     .|..|-.+-.-.+ .+++.+.+.++.+|.+...+.++-     .+.+..+.
T Consensus       175 Dg~vrl~DtR~~~-~~v~-el--n-----hg~pVe~vl~lpsgs~iasAgGn~vkVWDl~~G~qll~-----~~~~H~Kt  240 (487)
T KOG0310|consen  175 DGKVRLWDTRSLT-SRVV-EL--N-----HGCPVESVLALPSGSLIASAGGNSVKVWDLTTGGQLLT-----SMFNHNKT  240 (487)
T ss_pred             CceEEEEEeccCC-ceeE-Ee--c-----CCCceeeEEEcCCCCEEEEcCCCeEEEEEecCCceehh-----hhhcccce
Confidence            4568998865431 1111 11  1     2333555555444 666677788999998875544332     12224678


Q ss_pred             eeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEee
Q 001003          707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1192)
Q Consensus       707 i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~  778 (1192)
                      |+|++++.+.                                  +.++-.. -+|.+.||++-+++.++.-.
T Consensus       241 VTcL~l~s~~----------------------------------~rLlS~s-LD~~VKVfd~t~~Kvv~s~~  277 (487)
T KOG0310|consen  241 VTCLRLASDS----------------------------------TRLLSGS-LDRHVKVFDTTNYKVVHSWK  277 (487)
T ss_pred             EEEEEeecCC----------------------------------ceEeecc-cccceEEEEccceEEEEeee
Confidence            9999887432                                  1233333 37999999999999987775


No 29 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=67.74  E-value=2.4e+02  Score=32.58  Aligned_cols=144  Identities=18%  Similarity=0.233  Sum_probs=88.7

Q ss_pred             cceEEEEEec-----------cceEEEEecC------ceeeeecccCccccCCcEEEEeeCCCCEEEEEecCcEEEEeCC
Q 001003          575 YHAYLIISLE-----------ARTMVLETAD------LLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS  637 (1192)
Q Consensus       575 ~~~yLvlS~~-----------~~T~Vl~~g~------~~eEv~~~~gF~~~~~TI~ag~l~~~~~IvQVt~~~vrli~~~  637 (1192)
                      ...||++...           ..-.+|++.+      .++.+..   ...+++-.++..+ ++ ++|=-..+.|++++-+
T Consensus        41 ~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i~~---~~~~g~V~ai~~~-~~-~lv~~~g~~l~v~~l~  115 (321)
T PF03178_consen   41 KKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLIHS---TEVKGPVTAICSF-NG-RLVVAVGNKLYVYDLD  115 (321)
T ss_dssp             SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEEEE---EEESS-EEEEEEE-TT-EEEEEETTEEEEEEEE
T ss_pred             ccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEEEE---EeecCcceEhhhh-CC-EEEEeecCEEEEEEcc
Confidence            3578888543           3456777765      5666542   2346677777777 34 4776778888888743


Q ss_pred             c-c-eeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeecc
Q 001003          638 Y-M-TQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHD  715 (1192)
Q Consensus       638 ~-~-~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d  715 (1192)
                      . . +.....      |.   ....|++.++.+++++++..-.++.++..+++...+.......   ...++.++++..|
T Consensus       116 ~~~~l~~~~~------~~---~~~~i~sl~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~---~~~~v~~~~~l~d  183 (321)
T PF03178_consen  116 NSKTLLKKAF------YD---SPFYITSLSVFKNYILVGDAMKSVSLLRYDEENNKLILVARDY---QPRWVTAAEFLVD  183 (321)
T ss_dssp             TTSSEEEEEE------E----BSSSEEEEEEETTEEEEEESSSSEEEEEEETTTE-EEEEEEES---S-BEEEEEEEE-S
T ss_pred             Ccccchhhhe------ec---ceEEEEEEeccccEEEEEEcccCEEEEEEEccCCEEEEEEecC---CCccEEEEEEecC
Confidence            2 2 222222      12   2346999999999999999999999998877555443222211   2445666654311


Q ss_pred             CCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCC
Q 001003          716 KGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPN  770 (1192)
Q Consensus       716 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~  770 (1192)
                                                       . . .++++..+|.+.+++.|.
T Consensus       184 ---------------------------------~-~-~~i~~D~~gnl~~l~~~~  203 (321)
T PF03178_consen  184 ---------------------------------E-D-TIIVGDKDGNLFVLRYNP  203 (321)
T ss_dssp             ---------------------------------S-S-EEEEEETTSEEEEEEE-S
T ss_pred             ---------------------------------C-c-EEEEEcCCCeEEEEEECC
Confidence                                             1 2 566788899999999873


No 30 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=67.18  E-value=1.5e+02  Score=32.64  Aligned_cols=25  Identities=12%  Similarity=0.126  Sum_probs=19.0

Q ss_pred             EEEEEecCCeEEEEECCCceeeEEe
Q 001003          753 YSVVCYESGALEIFDVPNFNCVFTV  777 (1192)
Q Consensus       753 ~l~v~~~~g~l~I~sLP~~~~v~~~  777 (1192)
                      .++-+-++|.+.+|+|-+-.++...
T Consensus       241 hV~sgSEDG~Vy~wdLvd~~~~sk~  265 (307)
T KOG0316|consen  241 HVFSGSEDGKVYFWDLVDETQISKL  265 (307)
T ss_pred             eEEeccCCceEEEEEeccceeeeee
Confidence            5666889999999999876655433


No 31 
>COG4247 Phy 3-phytase (myo-inositol-hexaphosphate 3-phosphohydrolase) [Lipid metabolism]
Probab=66.50  E-value=77  Score=35.18  Aligned_cols=130  Identities=18%  Similarity=0.185  Sum_probs=76.4

Q ss_pred             CccccCCcEEEEeeCCCC--EEEEEecCcEEEEeCC-cceeeeecCCCC---CCCCCCCCCCcEEEEEEcCCEEEEEEeC
Q 001003          604 DYFVQGRTIAAGNLFGRR--RVIQVFERGARILDGS-YMTQDLSFGPSN---SESGSGSENSTVLSVSIADPYVLLGMSD  677 (1192)
Q Consensus       604 gF~~~~~TI~ag~l~~~~--~IvQVt~~~vrli~~~-~~~q~i~~~~~~---~e~~~~~~~~~Iv~asi~dpyvlv~~~d  677 (1192)
                      +=..+.|.|++..-....  .|--+-..|+|+||-. ++.|.++...+.   ...+.+-.+..|.-|..+|.+      .
T Consensus        50 ~daADDPAIwVh~t~P~kS~vItt~Kk~Gl~VYDLsGkqLqs~~~Gk~NNVDLrygF~LgG~~idiaaASdR~------~  123 (364)
T COG4247          50 NDAADDPAIWVHATNPDKSLVITTVKKAGLRVYDLSGKQLQSVNPGKYNNVDLRYGFQLGGQSIDIAAASDRQ------N  123 (364)
T ss_pred             CcccCCcceEeccCCcCcceEEEeeccCCeEEEecCCCeeeecCCCcccccccccCcccCCeEEEEEeccccc------C
Confidence            334677888887664333  2444556789999854 355544332211   011222235566666666654      7


Q ss_pred             CcEEEEEecCCCceEeeccccc--cccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEE
Q 001003          678 GSIRLLVGDPSTCTVSVQTPAA--IESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV  755 (1192)
Q Consensus       678 g~i~~l~~d~~~~~l~~~~~~~--l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~  755 (1192)
                      ..|.+|.+|++...|+-..-..  .++..++.-..|||++..                               ...++++
T Consensus       124 ~~i~~y~Idp~~~~L~sitD~n~p~ss~~s~~YGl~lyrs~k-------------------------------tgd~yvf  172 (364)
T COG4247         124 DKIVFYKIDPNPQYLESITDSNAPYSSSSSSAYGLALYRSPK-------------------------------TGDYYVF  172 (364)
T ss_pred             CeEEEEEeCCCccceeeccCCCCccccCcccceeeEEEecCC-------------------------------cCcEEEE
Confidence            7899999999876665332221  112233344467875431                               2358999


Q ss_pred             EEecCCeEEEEECCC
Q 001003          756 VCYESGALEIFDVPN  770 (1192)
Q Consensus       756 v~~~~g~l~I~sLP~  770 (1192)
                      |.+..|.++=|+|-+
T Consensus       173 V~~~qG~~~Qy~l~d  187 (364)
T COG4247         173 VNRRQGDIAQYKLID  187 (364)
T ss_pred             EecCCCceeEEEEEe
Confidence            988889998888754


No 32 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=66.08  E-value=1e+02  Score=35.75  Aligned_cols=72  Identities=13%  Similarity=0.139  Sum_probs=47.3

Q ss_pred             ccEEEEEeC-CCeEEEEEEeCCCCCEEEEeeeeeeccccccccCCcccccCCCeEEECCCCcEEEEE-EcCceEEEEeCc
Q 001003          130 RDSIILAFE-DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVL-VYGLQMIILKAS  207 (1192)
Q Consensus       130 ~D~Llv~~~-~aklsil~~d~~~~~l~t~Slh~~E~~~~~~~~~G~~~~~~~~~l~VDP~~Rc~~l~-~y~~~L~ilP~~  207 (1192)
                      ..+..+..+ +..+.+++||+..++|..+--+.  .     ++.+........-+.+.|+||..-.+ =+.+.|++.-..
T Consensus       202 ~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~--t-----lP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~  274 (346)
T COG2706         202 GKYAYLVNELNSTVDVLEYNPAVGKFEELQTID--T-----LPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVD  274 (346)
T ss_pred             CcEEEEEeccCCEEEEEEEcCCCceEEEeeeec--c-----CccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEc
Confidence            556666666 78999999999988887643332  1     22223333344668999999998776 445667776443


Q ss_pred             c
Q 001003          208 Q  208 (1192)
Q Consensus       208 ~  208 (1192)
                      +
T Consensus       275 ~  275 (346)
T COG2706         275 P  275 (346)
T ss_pred             C
Confidence            3


No 33 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=63.78  E-value=60  Score=39.24  Aligned_cols=69  Identities=17%  Similarity=0.262  Sum_probs=43.2

Q ss_pred             EEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCC
Q 001003          670 YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQ  749 (1192)
Q Consensus       670 yvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  749 (1192)
                      .++-...||++++|.++.....+++.++.......-.+++ |-|+-                                 .
T Consensus       283 ~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~ts-C~~nr---------------------------------d  328 (641)
T KOG0772|consen  283 EFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTS-CAWNR---------------------------------D  328 (641)
T ss_pred             ceEEecCCCcEEEEecCCchhheeEEeeccCCCcccCcee-eecCC---------------------------------C
Confidence            3445557999999999876555666666555322223333 33311                                 1


Q ss_pred             CcEEEEEEecCCeEEEEECCCcee
Q 001003          750 GDIYSVVCYESGALEIFDVPNFNC  773 (1192)
Q Consensus       750 ~~~~l~v~~~~g~l~I~sLP~~~~  773 (1192)
                      . .|.+.+..+|+++||+++++..
T Consensus       329 g-~~iAagc~DGSIQ~W~~~~~~v  351 (641)
T KOG0772|consen  329 G-KLIAAGCLDGSIQIWDKGSRTV  351 (641)
T ss_pred             c-chhhhcccCCceeeeecCCccc
Confidence            1 2566677799999999987543


No 34 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=63.38  E-value=1.2e+02  Score=35.33  Aligned_cols=66  Identities=20%  Similarity=0.436  Sum_probs=47.9

Q ss_pred             CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCC
Q 001003          668 DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPL  747 (1192)
Q Consensus       668 dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  747 (1192)
                      .+|+..+..|++|+++.+....+++.+..      .++.|..+.+-    |                             
T Consensus       304 ~~~l~s~SrDktIk~wdv~tg~cL~tL~g------hdnwVr~~af~----p-----------------------------  344 (406)
T KOG0295|consen  304 GQVLGSGSRDKTIKIWDVSTGMCLFTLVG------HDNWVRGVAFS----P-----------------------------  344 (406)
T ss_pred             ccEEEeecccceEEEEeccCCeEEEEEec------ccceeeeeEEc----C-----------------------------
Confidence            46888899999999999887667666443      24455544331    0                             


Q ss_pred             CCCcEEEEEEecCCeEEEEECCCceee
Q 001003          748 DQGDIYSVVCYESGALEIFDVPNFNCV  774 (1192)
Q Consensus       748 ~~~~~~l~v~~~~g~l~I~sLP~~~~v  774 (1192)
                        ...|++-|-+|++|+||+|.+.++.
T Consensus       345 --~Gkyi~ScaDDktlrvwdl~~~~cm  369 (406)
T KOG0295|consen  345 --GGKYILSCADDKTLRVWDLKNLQCM  369 (406)
T ss_pred             --CCeEEEEEecCCcEEEEEeccceee
Confidence              1248888999999999999887764


No 35 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=61.85  E-value=94  Score=36.05  Aligned_cols=60  Identities=17%  Similarity=0.316  Sum_probs=42.5

Q ss_pred             cEEEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEe
Q 001003          751 DIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQ  830 (1192)
Q Consensus       751 ~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~  830 (1192)
                      +.|++-+-.++++.||.|.+-++..+..+.                                        -..++++.+.
T Consensus       163 n~wf~tgs~DrtikIwDlatg~LkltltGh----------------------------------------i~~vr~vavS  202 (460)
T KOG0285|consen  163 NEWFATGSADRTIKIWDLATGQLKLTLTGH----------------------------------------IETVRGVAVS  202 (460)
T ss_pred             ceeEEecCCCceeEEEEcccCeEEEeecch----------------------------------------hheeeeeeec
Confidence            358888888999999999886665444311                                        0123334332


Q ss_pred             ecCCCCCccEEEEEecCCcEEEEEE
Q 001003          831 RWSAHHSRPFLFAILTDGTILCYQA  855 (1192)
Q Consensus       831 ~~g~~~~~p~Llv~l~dG~l~~Y~~  855 (1192)
                           ...||||....|+++-.|.+
T Consensus       203 -----~rHpYlFs~gedk~VKCwDL  222 (460)
T KOG0285|consen  203 -----KRHPYLFSAGEDKQVKCWDL  222 (460)
T ss_pred             -----ccCceEEEecCCCeeEEEec
Confidence                 35799999999999998886


No 36 
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=61.56  E-value=2.2e+02  Score=32.71  Aligned_cols=19  Identities=16%  Similarity=0.279  Sum_probs=17.0

Q ss_pred             ccEEEEEecCCcEEEEEEe
Q 001003          838 RPFLFAILTDGTILCYQAY  856 (1192)
Q Consensus       838 ~p~Llv~l~dG~l~~Y~~~  856 (1192)
                      .-||-++..||.|++|.+-
T Consensus        35 G~~lAvGc~nG~vvI~D~~   53 (405)
T KOG1273|consen   35 GDYLAVGCANGRVVIYDFD   53 (405)
T ss_pred             cceeeeeccCCcEEEEEcc
Confidence            4799999999999999974


No 37 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=61.41  E-value=4.3e+02  Score=33.39  Aligned_cols=60  Identities=17%  Similarity=0.179  Sum_probs=37.0

Q ss_pred             CceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeecccCcccccchhcccCceEEEeecceEEee
Q 001003         1108 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNSFVFVF 1178 (1192)
Q Consensus      1108 ~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~ 1178 (1192)
                      ..+||||-         .++|-|++|.+-...-++=-.+.-....++..+  -+-++.+..+=+++.||=|
T Consensus       486 dG~yiaa~---------~t~g~I~v~nl~~~~~~~l~~rln~~vTa~~~~--~~~~~~lvvats~nQv~ef  545 (691)
T KOG2048|consen  486 DGNYIAAI---------STRGQIFVYNLETLESHLLKVRLNIDVTAAAFS--PFVRNRLVVATSNNQVFEF  545 (691)
T ss_pred             CCCEEEEE---------eccceEEEEEcccceeecchhccCcceeeeecc--ccccCcEEEEecCCeEEEE
Confidence            35799987         478999999986533333222222233343332  2467888888888887644


No 38 
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=61.39  E-value=2.8e+02  Score=31.29  Aligned_cols=135  Identities=16%  Similarity=0.243  Sum_probs=79.8

Q ss_pred             CeEEEEEcCCCCccCCccceEEEec---CCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCc
Q 001003          976 GILKICQLPSGSTYDNYWPVQKVIP---LKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSS 1052 (1192)
Q Consensus       976 ~~LrI~~l~~~~~~d~~~~vrk~ip---L~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~ 1052 (1192)
                      ..+||-.....    ..|..|. +-   -.+++|+||+.|..++++.++-..                            
T Consensus        37 k~vriw~~~~~----~s~~ck~-vld~~hkrsVRsvAwsp~g~~La~aSFD~----------------------------   83 (312)
T KOG0645|consen   37 KAVRIWSTSSG----DSWTCKT-VLDDGHKRSVRSVAWSPHGRYLASASFDA----------------------------   83 (312)
T ss_pred             ceEEEEecCCC----CcEEEEE-eccccchheeeeeeecCCCcEEEEeeccc----------------------------
Confidence            36777666642    2477776 43   356899999999999555543211                            


Q ss_pred             cccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCc-eEE
Q 001003         1053 VDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARG-RVL 1131 (1192)
Q Consensus      1053 ~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rG-RIl 1131 (1192)
                                  .+-+....  +..||.++.+|=.+||    +|+|...   .+  ..|||-.+          |+ .+.
T Consensus        84 ------------t~~Iw~k~--~~efecv~~lEGHEnE----VK~Vaws---~s--G~~LATCS----------RDKSVW  130 (312)
T KOG0645|consen   84 ------------TVVIWKKE--DGEFECVATLEGHENE----VKCVAWS---AS--GNYLATCS----------RDKSVW  130 (312)
T ss_pred             ------------eEEEeecC--CCceeEEeeeeccccc----eeEEEEc---CC--CCEEEEee----------CCCeEE
Confidence                        11122222  2488888888888888    4888775   22  36888763          33 488


Q ss_pred             EEEeeeCCCCCceeEeecccCcccccchhcccC--ceEEEeecc-eEEeee
Q 001003         1132 LFSTGRNADNPQNLVLSGSYGPLFSSVQIDFAS--HFFAICSNS-FVFVFL 1179 (1192)
Q Consensus      1132 vfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~--~~~a~~~~~-~~~~~~ 1179 (1192)
                      |+|+.+ .+..|....-..-..=+-+|+  ++.  .||++|+-- .|.+|-
T Consensus       131 iWe~de-ddEfec~aVL~~HtqDVK~V~--WHPt~dlL~S~SYDnTIk~~~  178 (312)
T KOG0645|consen  131 IWEIDE-DDEFECIAVLQEHTQDVKHVI--WHPTEDLLFSCSYDNTIKVYR  178 (312)
T ss_pred             EEEecC-CCcEEEEeeeccccccccEEE--EcCCcceeEEeccCCeEEEEe
Confidence            999983 344444321111111111344  566  799999864 666653


No 39 
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=59.31  E-value=88  Score=37.36  Aligned_cols=107  Identities=13%  Similarity=0.196  Sum_probs=66.9

Q ss_pred             CCEEEEEecCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEE--cCCEEEEEEeCCcEEEEEecCCCceEeeccc
Q 001003          620 RRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI--ADPYVLLGMSDGSIRLLVGDPSTCTVSVQTP  697 (1192)
Q Consensus       620 ~~~IvQVt~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi--~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~  697 (1192)
                      +.++.=-+..+|++++.+++...=.++      +++ .  .|...++  ...-+-|+-.|.++++|.+.+++. +-... 
T Consensus       257 ~~lys~s~Drsvkvw~~~~~s~vetly------GHq-d--~v~~IdaL~reR~vtVGgrDrT~rlwKi~eesq-lifrg-  325 (479)
T KOG0299|consen  257 SELYSASADRSVKVWSIDQLSYVETLY------GHQ-D--GVLGIDALSRERCVTVGGRDRTVRLWKIPEESQ-LIFRG-  325 (479)
T ss_pred             cceeeeecCCceEEEehhHhHHHHHHh------CCc-c--ceeeechhcccceEEeccccceeEEEeccccce-eeeeC-
Confidence            344555556778887765432111111      333 2  3555555  455666676799999999955443 22111 


Q ss_pred             cccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEe
Q 001003          698 AAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV  777 (1192)
Q Consensus       698 ~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~  777 (1192)
                           ....+-|+|+.++.                                    -.+.+-+||++.+|++-.++++|+.
T Consensus       326 -----~~~sidcv~~In~~------------------------------------HfvsGSdnG~IaLWs~~KKkplf~~  364 (479)
T KOG0299|consen  326 -----GEGSIDCVAFINDE------------------------------------HFVSGSDNGSIALWSLLKKKPLFTS  364 (479)
T ss_pred             -----CCCCeeeEEEeccc------------------------------------ceeeccCCceEEEeeecccCceeEe
Confidence                 24468888886543                                    2345667999999999999999998


Q ss_pred             e
Q 001003          778 D  778 (1192)
Q Consensus       778 ~  778 (1192)
                      .
T Consensus       365 ~  365 (479)
T KOG0299|consen  365 R  365 (479)
T ss_pred             e
Confidence            6


No 40 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=59.03  E-value=3.8e+02  Score=32.15  Aligned_cols=99  Identities=14%  Similarity=0.191  Sum_probs=55.0

Q ss_pred             cceEEEEEeccceEEEEe---cCceeeeecccCccccCCcEEEEeeCCCCEEEEEecCcEEEEeCC--cceeeeecCCCC
Q 001003          575 YHAYLIISLEARTMVLET---ADLLTEVTESVDYFVQGRTIAAGNLFGRRRVIQVFERGARILDGS--YMTQDLSFGPSN  649 (1192)
Q Consensus       575 ~~~yLvlS~~~~T~Vl~~---g~~~eEv~~~~gF~~~~~TI~ag~l~~~~~IvQVt~~~vrli~~~--~~~q~i~~~~~~  649 (1192)
                      .-+||+-.-.++|-.|+.   |..+.-+.+  .  +++--+.++.         +||.|..+-.+.  ..+..|.+.+.+
T Consensus       314 tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~--~--~s~v~~ts~~---------fHpDgLifgtgt~d~~vkiwdlks~~  380 (506)
T KOG0289|consen  314 TGEYLLSASNDGTWAFSDISSGSQLTVVSD--E--TSDVEYTSAA---------FHPDGLIFGTGTPDGVVKIWDLKSQT  380 (506)
T ss_pred             CCcEEEEecCCceEEEEEccCCcEEEEEee--c--cccceeEEee---------EcCCceEEeccCCCceEEEEEcCCcc
Confidence            346888888888888873   444444432  1  2223334444         444444433222  122222222111


Q ss_pred             CCCCCCCCCCcEEEEEEcCC--EEEEEEeCCcEEEEEec
Q 001003          650 SESGSGSENSTVLSVSIADP--YVLLGMSDGSIRLLVGD  686 (1192)
Q Consensus       650 ~e~~~~~~~~~Iv~asi~dp--yvlv~~~dg~i~~l~~d  686 (1192)
                      .--..|...++|.+.++.++  |++++++|++|++|.+-
T Consensus       381 ~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLR  419 (506)
T KOG0289|consen  381 NVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLR  419 (506)
T ss_pred             ccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEeh
Confidence            11122335678999999655  89999999999999873


No 41 
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=57.48  E-value=22  Score=41.35  Aligned_cols=92  Identities=14%  Similarity=0.166  Sum_probs=58.8

Q ss_pred             eEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEeeeCCCCCc
Q 001003         1064 YEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQ 1143 (1192)
Q Consensus      1064 ~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e 1143 (1192)
                      +.||++|+..   -=..+.+|+|.  |++.+..  .|-   -+  ..+|.+|         .++|.+-.||+-...  .-
T Consensus       226 hqvR~YDt~~---qRRPV~~fd~~--E~~is~~--~l~---p~--gn~Iy~g---------n~~g~l~~FD~r~~k--l~  282 (412)
T KOG3881|consen  226 HQVRLYDTRH---QRRPVAQFDFL--ENPISST--GLT---PS--GNFIYTG---------NTKGQLAKFDLRGGK--LL  282 (412)
T ss_pred             eeEEEecCcc---cCcceeEeccc--cCcceee--eec---CC--CcEEEEe---------cccchhheecccCce--ee
Confidence            4688999852   22235666665  6665433  232   12  3578888         468889999886521  00


Q ss_pred             eeEeecccCcccccchhcccCceEEEeecc-eEEeee
Q 001003         1144 NLVLSGSYGPLFSSVQIDFASHFFAICSNS-FVFVFL 1179 (1192)
Q Consensus      1144 ~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~~ 1179 (1192)
                       -+--+.+++-.++++-.-.+|++|+|+.- .||||-
T Consensus       283 -g~~~kg~tGsirsih~hp~~~~las~GLDRyvRIhD  318 (412)
T KOG3881|consen  283 -GCGLKGITGSIRSIHCHPTHPVLASCGLDRYVRIHD  318 (412)
T ss_pred             -ccccCCccCCcceEEEcCCCceEEeeccceeEEEee
Confidence             11113566667778877888999999998 788774


No 42 
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=57.34  E-value=2.4e+02  Score=30.15  Aligned_cols=121  Identities=14%  Similarity=0.220  Sum_probs=71.4

Q ss_pred             ceEEEecCCC--ccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcC
Q 001003          994 PVQKVIPLKA--TPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEP 1071 (1192)
Q Consensus       994 ~vrk~ipL~~--tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp 1071 (1192)
                      ++.. ++|..  .++.++++|..+.++|+....   |                                  . .|+|++.
T Consensus        50 ~~~~-i~l~~~~~I~~~~WsP~g~~favi~g~~---~----------------------------------~-~v~lyd~   90 (194)
T PF08662_consen   50 PVES-IELKKEGPIHDVAWSPNGNEFAVIYGSM---P----------------------------------A-KVTLYDV   90 (194)
T ss_pred             ccce-eeccCCCceEEEEECcCCCEEEEEEccC---C----------------------------------c-ccEEEcC
Confidence            6777 88854  499999999999999875321   0                                  1 4556665


Q ss_pred             CCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeeccc
Q 001003         1072 DRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSY 1151 (1192)
Q Consensus      1072 ~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~ 1151 (1192)
                           ..+.+..+  .. ..+.+   +...     ....|+|+|.--      ...|.|.+|++.+    .+.+...+. 
T Consensus        91 -----~~~~i~~~--~~-~~~n~---i~ws-----P~G~~l~~~g~~------n~~G~l~~wd~~~----~~~i~~~~~-  143 (194)
T PF08662_consen   91 -----KGKKIFSF--GT-QPRNT---ISWS-----PDGRFLVLAGFG------NLNGDLEFWDVRK----KKKISTFEH-  143 (194)
T ss_pred             -----cccEeEee--cC-CCceE---EEEC-----CCCCEEEEEEcc------CCCcEEEEEECCC----CEEeecccc-
Confidence                 23444333  22 23333   3332     234588887531      2448999999973    222221111 


Q ss_pred             CcccccchhcccCceEEEeecc-------eEEeeehh
Q 001003         1152 GPLFSSVQIDFASHFFAICSNS-------FVFVFLFS 1181 (1192)
Q Consensus      1152 ~~~~~~~~~~~~~~~~a~~~~~-------~~~~~~~~ 1181 (1192)
                       ...+.++=+-.|+.+|+++.+       .+.||-|.
T Consensus       144 -~~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~  179 (194)
T PF08662_consen  144 -SDATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ  179 (194)
T ss_pred             -CcEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence             123445567889999999875       56677664


No 43 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=56.37  E-value=3.6e+02  Score=31.70  Aligned_cols=26  Identities=19%  Similarity=0.250  Sum_probs=21.6

Q ss_pred             cCCCCCccEEEEEecCCcEEEEEEee
Q 001003          832 WSAHHSRPFLFAILTDGTILCYQAYL  857 (1192)
Q Consensus       832 ~g~~~~~p~Llv~l~dG~l~~Y~~~~  857 (1192)
                      |+.....|++++...||.+.+|++.+
T Consensus       306 l~~~~~~~~v~vas~dG~~y~y~l~~  331 (391)
T KOG2110|consen  306 LSSIQKIPRVLVASYDGHLYSYRLPP  331 (391)
T ss_pred             eeccCCCCEEEEEEcCCeEEEEEcCC
Confidence            44446789999999999999999864


No 44 
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=56.11  E-value=4.4e+02  Score=35.17  Aligned_cols=64  Identities=16%  Similarity=0.124  Sum_probs=44.6

Q ss_pred             ceEEEEcCCCCeEEEE-eCCceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCC
Q 001003          923 HQGFFLSGSRPCWCMV-FRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG  986 (1192)
Q Consensus       923 ~~gVF~~G~rP~wi~~-~~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~  986 (1192)
                      +..=|++=..|.||.+ +..-+++.-+...+.+.+...|...--|+++++..+-+-|-|.+-+..
T Consensus       732 ~as~~~S~qcpeGiv~i~~n~l~i~~~~~~g~~~n~~~~~l~~tprkvv~h~es~lLii~~td~~  796 (1205)
T KOG1898|consen  732 HASPFCSEQCPEGIVAISKNTLRIIALDKLGKVLNVDGFPLAYTPRKVVIHPESGLLIIGRTDHN  796 (1205)
T ss_pred             ccccccccCCCcchhhhhhhhhheeeehhhcccccccccccccCcceEEEecCCCeEEEEEeccc
Confidence            3455677778888875 456666666655567777777777777888888777777777775543


No 45 
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=55.38  E-value=3.7e+02  Score=30.82  Aligned_cols=74  Identities=14%  Similarity=0.210  Sum_probs=45.5

Q ss_pred             CCEEEEEecCcEEEEeCCcc--eeeeecCCCCCCCCCCCCCCcEEEEEEcC-CEEEEEEeCCcEEEEEecCCCceEeecc
Q 001003          620 RRRVIQVFERGARILDGSYM--TQDLSFGPSNSESGSGSENSTVLSVSIAD-PYVLLGMSDGSIRLLVGDPSTCTVSVQT  696 (1192)
Q Consensus       620 ~~~IvQVt~~~vrli~~~~~--~q~i~~~~~~~e~~~~~~~~~Iv~asi~d-pyvlv~~~dg~i~~l~~d~~~~~l~~~~  696 (1192)
                      +..+|=-....+|||+....  .+.+.            .+..|..+...| ..++.+..||.|+.+.++..+. ..   
T Consensus        26 ~~LLvssWDgslrlYdv~~~~l~~~~~------------~~~plL~c~F~d~~~~~~G~~dg~vr~~Dln~~~~-~~---   89 (323)
T KOG1036|consen   26 SDLLVSSWDGSLRLYDVPANSLKLKFK------------HGAPLLDCAFADESTIVTGGLDGQVRRYDLNTGNE-DQ---   89 (323)
T ss_pred             CcEEEEeccCcEEEEeccchhhhhhee------------cCCceeeeeccCCceEEEeccCceEEEEEecCCcc-ee---
Confidence            34444455677888886542  11111            233488888766 5888899999999998865433 12   


Q ss_pred             ccccccCCCceeEEEe
Q 001003          697 PAAIESSKKPVSSCTL  712 (1192)
Q Consensus       697 ~~~l~~~~~~i~~~~l  712 (1192)
                         ++.....+.|++-
T Consensus        90 ---igth~~~i~ci~~  102 (323)
T KOG1036|consen   90 ---IGTHDEGIRCIEY  102 (323)
T ss_pred             ---eccCCCceEEEEe
Confidence               2223556777654


No 46 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=54.96  E-value=6.3e+02  Score=33.30  Aligned_cols=28  Identities=7%  Similarity=0.199  Sum_probs=22.1

Q ss_pred             CcEEEEEE--cCCEEEEEEeCCcEEEEEec
Q 001003          659 STVLSVSI--ADPYVLLGMSDGSIRLLVGD  686 (1192)
Q Consensus       659 ~~Iv~asi--~dpyvlv~~~dg~i~~l~~d  686 (1192)
                      ..|.++++  .+.+++.+..||+|.+|...
T Consensus       484 ~~V~~i~fs~dg~~latgg~D~~I~iwd~~  513 (793)
T PLN00181        484 NLVCAIGFDRDGEFFATAGVNKKIKIFECE  513 (793)
T ss_pred             CcEEEEEECCCCCEEEEEeCCCEEEEEECC
Confidence            34777777  36788899999999999764


No 47 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=54.04  E-value=3.9e+02  Score=30.67  Aligned_cols=52  Identities=8%  Similarity=0.110  Sum_probs=32.1

Q ss_pred             EEEEEe--cCeEEEEEcCCCCccCCc-cceEEEecCCCccCeEEEecCCCEEEEEE
Q 001003          969 FIYVTS--QGILKICQLPSGSTYDNY-WPVQKVIPLKATPHQITYFAEKNLYPLIV 1021 (1192)
Q Consensus       969 fi~~~~--~~~LrI~~l~~~~~~d~~-~~vrk~ipL~~tp~~Iay~~~~~~y~v~~ 1021 (1192)
                      ++|+..  .+.+++..+.....+... -...+ +|.+..||++++||..+.++|+.
T Consensus       139 ~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~-~~~g~~p~~~~~~pdg~~lyv~~  193 (330)
T PRK11028        139 TLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVT-TVEGAGPRHMVFHPNQQYAYCVN  193 (330)
T ss_pred             EEEEeeCCCCEEEEEEECCCCcccccCCCcee-cCCCCCCceEEECCCCCEEEEEe
Confidence            455544  246666666653322110 12235 77899999999999988777754


No 48 
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=53.77  E-value=4e+02  Score=30.63  Aligned_cols=170  Identities=12%  Similarity=0.157  Sum_probs=89.1

Q ss_pred             EEee-ccCCceEEEEcCCCCeEEEEeCC---ce--EEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCC--
Q 001003          915 TIFK-NISGHQGFFLSGSRPCWCMVFRE---RL--RVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG--  986 (1192)
Q Consensus       915 ~~f~-~i~G~~gVF~~G~rP~wi~~~~g---~l--~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~--  986 (1192)
                      +.|. +++|..-|..|.+|-..++--|.   .+  |-.++ . =.+.|.+.|-|   ..||+.-+-+|-.-+--+++.  
T Consensus       137 kVy~~~v~g~~LvVg~~~r~v~iyDLRn~~~~~q~reS~l-k-yqtR~v~~~pn---~eGy~~sSieGRVavE~~d~s~~  211 (323)
T KOG1036|consen  137 KVYCMDVSGNRLVVGTSDRKVLIYDLRNLDEPFQRRESSL-K-YQTRCVALVPN---GEGYVVSSIEGRVAVEYFDDSEE  211 (323)
T ss_pred             eEEEEeccCCEEEEeecCceEEEEEcccccchhhhccccc-e-eEEEEEEEecC---CCceEEEeecceEEEEccCCchH
Confidence            4444 56665555557777755543221   11  11111 1 14666766543   368888777887777777765  


Q ss_pred             ---CccCCccceEE--EecCCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCccccccCcce
Q 001003          987 ---STYDNYWPVQK--VIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTV 1061 (1192)
Q Consensus       987 ---~~~d~~~~vrk--~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~ 1061 (1192)
                         .+|-..-|-.+  -..+.+-++.|++||-.++|+-+-+.                ..                    
T Consensus       212 ~~skkyaFkCHr~~~~~~~~~yPVNai~Fhp~~~tfaTgGsD----------------G~--------------------  255 (323)
T KOG1036|consen  212 AQSKKYAFKCHRLSEKDTEIIYPVNAIAFHPIHGTFATGGSD----------------GI--------------------  255 (323)
T ss_pred             HhhhceeEEeeecccCCceEEEEeceeEeccccceEEecCCC----------------ce--------------------
Confidence               23321222111  01223346677777766666654321                01                    


Q ss_pred             eeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccC---cccccCceEEEEEeeeC
Q 001003         1062 EEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQG---EDVAARGRVLLFSTGRN 1138 (1192)
Q Consensus      1062 ~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~g---Ed~~~rGRIlvfev~~~ 1138 (1192)
                          |-+-++.    +=+.+  .+|...  -+||.+..|.   .  ....||||+.+.+.   ++..++-+|+|..+.+.
T Consensus       256 ----V~~Wd~~----~rKrl--~q~~~~--~~SI~slsfs---~--dG~~LAia~sy~ye~~~~~~~~~~~i~I~~l~d~  318 (323)
T KOG1036|consen  256 ----VNIWDLF----NRKRL--KQLAKY--ETSISSLSFS---M--DGSLLAIASSYQYERADTPTHERNAIFIRDLTDY  318 (323)
T ss_pred             ----EEEccCc----chhhh--hhccCC--CCceEEEEec---c--CCCeEEEEechhhhcCCCCCCCCCceEEEecccc
Confidence                1111110    11111  133333  2566666664   2  24699999999984   33778899999998764


Q ss_pred             CCCC
Q 001003         1139 ADNP 1142 (1192)
Q Consensus      1139 ~~~~ 1142 (1192)
                      -.+|
T Consensus       319 e~~p  322 (323)
T KOG1036|consen  319 ETKP  322 (323)
T ss_pred             ccCC
Confidence            4443


No 49 
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=51.26  E-value=6.8e+02  Score=32.59  Aligned_cols=77  Identities=19%  Similarity=0.248  Sum_probs=48.7

Q ss_pred             CcEEEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcccc
Q 001003          659 STVLSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1192)
Q Consensus       659 ~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~  738 (1192)
                      ...++.+.|++|++++.+.|+|-.|.+.+.-.   ..+-..-+..+.+|..+++  |.                      
T Consensus       451 ~~av~vs~CGNF~~IG~S~G~Id~fNmQSGi~---r~sf~~~~ah~~~V~gla~--D~----------------------  503 (910)
T KOG1539|consen  451 ATAVCVSFCGNFVFIGYSKGTIDRFNMQSGIH---RKSFGDSPAHKGEVTGLAV--DG----------------------  503 (910)
T ss_pred             eEEEEEeccCceEEEeccCCeEEEEEcccCee---ecccccCccccCceeEEEe--cC----------------------
Confidence            34566667999999999999999998876422   1111000123445655543  21                      


Q ss_pred             ccCCCCCCCCCCcEEEEEEecCCeEEEEECCCcee
Q 001003          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC  773 (1192)
Q Consensus       739 ~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~  773 (1192)
                                 .+..++-+..+|.+..|+.-...+
T Consensus       504 -----------~n~~~vsa~~~Gilkfw~f~~k~l  527 (910)
T KOG1539|consen  504 -----------TNRLLVSAGADGILKFWDFKKKVL  527 (910)
T ss_pred             -----------CCceEEEccCcceEEEEecCCcce
Confidence                       123556677789999999876554


No 50 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=50.31  E-value=4.5e+02  Score=30.23  Aligned_cols=162  Identities=10%  Similarity=0.136  Sum_probs=89.5

Q ss_pred             eCCceEEEecCCCCceeEEecccCCCCCC--cEEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCE
Q 001003          939 FRERLRVHPQLCDGSIVAFTVLHNVNCNH--GFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNL 1016 (1192)
Q Consensus       939 ~~g~l~~~p~~~~~~v~~~t~F~~~~c~~--Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~ 1016 (1192)
                      .|..+|+-+= -.+.|++.+.     ||.  .||-..-++++|+=.|....      +.-- +++.. +--+||+|+.-.
T Consensus        89 dNkylRYF~G-H~~~V~sL~~-----sP~~d~FlS~S~D~tvrLWDlR~~~------cqg~-l~~~~-~pi~AfDp~GLi  154 (311)
T KOG1446|consen   89 DNKYLRYFPG-HKKRVNSLSV-----SPKDDTFLSSSLDKTVRLWDLRVKK------CQGL-LNLSG-RPIAAFDPEGLI  154 (311)
T ss_pred             cCceEEEcCC-CCceEEEEEe-----cCCCCeEEecccCCeEEeeEecCCC------CceE-EecCC-CcceeECCCCcE
Confidence            4667777662 2245666554     554  45544445677765555322      1122 33333 235789998888


Q ss_pred             EEEEEeecC----------ccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcCCCCCCCceeeeeEEC
Q 001003         1017 YPLIVSVPV----------LKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPM 1086 (1192)
Q Consensus      1017 y~v~~s~~~----------~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el 1086 (1192)
                      ||+++-...          +.|+.-.  . .+++...+|..-+.+.+-..+.-....+.+.++|.-.    -+.+.++++
T Consensus       155 fA~~~~~~~IkLyD~Rs~dkgPF~tf--~-i~~~~~~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~----G~~~~tfs~  227 (311)
T KOG1446|consen  155 FALANGSELIKLYDLRSFDKGPFTTF--S-ITDNDEAEWTDLEFSPDGKSILLSTNASFIYLLDAFD----GTVKSTFSG  227 (311)
T ss_pred             EEEecCCCeEEEEEecccCCCCceeE--c-cCCCCccceeeeEEcCCCCEEEEEeCCCcEEEEEccC----CcEeeeEee
Confidence            888764311          1122100  0 1112234444322233222344445667788999863    358889999


Q ss_pred             CCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEee
Q 001003         1087 QSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136 (1192)
Q Consensus      1087 ~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~ 1136 (1192)
                      .+++--..+...-      .....||..|         ...|||++|.+.
T Consensus       228 ~~~~~~~~~~a~f------tPds~Fvl~g---------s~dg~i~vw~~~  262 (311)
T KOG1446|consen  228 YPNAGNLPLSATF------TPDSKFVLSG---------SDDGTIHVWNLE  262 (311)
T ss_pred             ccCCCCcceeEEE------CCCCcEEEEe---------cCCCcEEEEEcC
Confidence            9888755433321      1234688888         678999999993


No 51 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=50.06  E-value=6.8e+02  Score=32.26  Aligned_cols=263  Identities=15%  Similarity=0.253  Sum_probs=146.1

Q ss_pred             ccccCCcEEEEeeCCCCEEEE-EecCcE-EEEeCC--cceeeeecCCCCCCCCCCCCCCcEEEEEEc--CCEEEEEEeC-
Q 001003          605 YFVQGRTIAAGNLFGRRRVIQ-VFERGA-RILDGS--YMTQDLSFGPSNSESGSGSENSTVLSVSIA--DPYVLLGMSD-  677 (1192)
Q Consensus       605 F~~~~~TI~ag~l~~~~~IvQ-Vt~~~v-rli~~~--~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~--dpyvlv~~~d-  677 (1192)
                      |..+..-+-|+.+..+..++- -..+|+ .|++-.  .+++.+.+.           ..+|..++++  +.+++++++. 
T Consensus       261 ln~~~~kvtaa~fH~~t~~lvvgFssG~f~LyelP~f~lih~LSis-----------~~~I~t~~~N~tGDWiA~g~~kl  329 (893)
T KOG0291|consen  261 LNQNSSKVTAAAFHKGTNLLVVGFSSGEFGLYELPDFNLIHSLSIS-----------DQKILTVSFNSTGDWIAFGCSKL  329 (893)
T ss_pred             ecccccceeeeeccCCceEEEEEecCCeeEEEecCCceEEEEeecc-----------cceeeEEEecccCCEEEEcCCcc
Confidence            333335566666644443333 234554 466633  356666662           3459999998  9999999876 


Q ss_pred             CcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEE
Q 001003          678 GSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVC  757 (1192)
Q Consensus       678 g~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~  757 (1192)
                      |.+.+|+...+.-.++.+..      -.+++|+. |   +|                              . ..+++.+
T Consensus       330 gQLlVweWqsEsYVlKQQgH------~~~i~~l~-Y---Sp------------------------------D-gq~iaTG  368 (893)
T KOG0291|consen  330 GQLLVWEWQSESYVLKQQGH------SDRITSLA-Y---SP------------------------------D-GQLIATG  368 (893)
T ss_pred             ceEEEEEeeccceeeecccc------ccceeeEE-E---CC------------------------------C-CcEEEec
Confidence            79999988777665654432      33455542 1   11                              1 1367778


Q ss_pred             ecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEeecCCCCC
Q 001003          758 YESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRWSAHHS  837 (1192)
Q Consensus       758 ~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~~g~~~~  837 (1192)
                      -++|.++||..-+--|..+.+.    +..                                    -|.-+.+...     
T Consensus       369 ~eDgKVKvWn~~SgfC~vTFte----Hts------------------------------------~Vt~v~f~~~-----  403 (893)
T KOG0291|consen  369 AEDGKVKVWNTQSGFCFVTFTE----HTS------------------------------------GVTAVQFTAR-----  403 (893)
T ss_pred             cCCCcEEEEeccCceEEEEecc----CCC------------------------------------ceEEEEEEec-----
Confidence            8899999999766333222210    100                                    0111222211     


Q ss_pred             ccEEEEEecCCcEEEEEEeeecCCCCCCCCCCCCccccccccccccccccceeeEEecCCCccCCCCCCCCCCccceEEe
Q 001003          838 RPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREETPHGAPCQRITIF  917 (1192)
Q Consensus       838 ~p~Llv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~~~~~~~~~~~~~g~~~l~~f  917 (1192)
                      .-.|+..--||++-++.+..|.                             ++|-.               ..+.|+..-
T Consensus       404 g~~llssSLDGtVRAwDlkRYr-----------------------------NfRTf---------------t~P~p~Qfs  439 (893)
T KOG0291|consen  404 GNVLLSSSLDGTVRAWDLKRYR-----------------------------NFRTF---------------TSPEPIQFS  439 (893)
T ss_pred             CCEEEEeecCCeEEeeeecccc-----------------------------eeeee---------------cCCCceeee
Confidence            2233444458888777664322                             22211               123333322


Q ss_pred             e---ccCCceEEEEcCCCCeE-EEE-e--CCceE-EEecCCCCceeE--EecccCCCCCCcEEEEE--ecCeEEEEEcCC
Q 001003          918 K---NISGHQGFFLSGSRPCW-CMV-F--RERLR-VHPQLCDGSIVA--FTVLHNVNCNHGFIYVT--SQGILKICQLPS  985 (1192)
Q Consensus       918 ~---~i~G~~gVF~~G~rP~w-i~~-~--~g~l~-~~p~~~~~~v~~--~t~F~~~~c~~Gfi~~~--~~~~LrI~~l~~  985 (1192)
                      .   +.+| . +.++|+.-++ |+. +  .|++. +.+ --++||.+  |.|-.+       .+++  .+.++||=.+=.
T Consensus       440 cvavD~sG-e-lV~AG~~d~F~IfvWS~qTGqllDiLs-GHEgPVs~l~f~~~~~-------~LaS~SWDkTVRiW~if~  509 (893)
T KOG0291|consen  440 CVAVDPSG-E-LVCAGAQDSFEIFVWSVQTGQLLDILS-GHEGPVSGLSFSPDGS-------LLASGSWDKTVRIWDIFS  509 (893)
T ss_pred             EEEEcCCC-C-EEEeeccceEEEEEEEeecCeeeehhc-CCCCcceeeEEccccC-------eEEeccccceEEEEEeec
Confidence            2   2233 2 4444555444 543 3  46655 222 24578885  555322       3433  356888876654


Q ss_pred             CCccCCccceEEEecCCCccCeEEEecCCCEEEEEEeec
Q 001003          986 GSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024 (1192)
Q Consensus       986 ~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~ 1024 (1192)
                      ..     =.+-. +++.-..-.++++|+.+-.+|+|..-
T Consensus       510 s~-----~~vEt-l~i~sdvl~vsfrPdG~elaVaTldg  542 (893)
T KOG0291|consen  510 SS-----GTVET-LEIRSDVLAVSFRPDGKELAVATLDG  542 (893)
T ss_pred             cC-----ceeee-EeeccceeEEEEcCCCCeEEEEEecc
Confidence            42     25566 88888899999999999999999763


No 52 
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=49.95  E-value=6.3e+02  Score=31.83  Aligned_cols=45  Identities=20%  Similarity=0.287  Sum_probs=33.2

Q ss_pred             EEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEe
Q 001003          969 FIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVS 1022 (1192)
Q Consensus       969 fi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s 1022 (1192)
                      ++.+.+.+.+++..++...      -||+ |  .-+|.++.+.......++++-
T Consensus       437 Llg~~ss~~~~fydW~~~~------lVrr-I--~v~~k~v~w~d~g~lVai~~d  481 (794)
T KOG0276|consen  437 LLGVRSSDFLCFYDWESGE------LVRR-I--EVTSKHVYWSDNGELVAIAGD  481 (794)
T ss_pred             eEEEEeCCeEEEEEcccce------EEEE-E--eeccceeEEecCCCEEEEEec
Confidence            4455667788888888765      7888 4  567899988877677666654


No 53 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=49.24  E-value=4.4e+02  Score=30.96  Aligned_cols=117  Identities=19%  Similarity=0.300  Sum_probs=71.9

Q ss_pred             cCccccCCcEEEEeeCCCCEEEEEecCcEEE-EeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEE
Q 001003          603 VDYFVQGRTIAAGNLFGRRRVIQVFERGARI-LDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIR  681 (1192)
Q Consensus       603 ~gF~~~~~TI~ag~l~~~~~IvQVt~~~vrl-i~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~  681 (1192)
                      ..|..++.-|+-|-|.+.-+|-++-+++.+. ++..-  .+       -+|-.-..         ..+.++-+..||++-
T Consensus       112 ~~FshdgtlLATGdmsG~v~v~~~stg~~~~~~~~e~--~d-------ieWl~WHp---------~a~illAG~~DGsvW  173 (399)
T KOG0296|consen  112 CSFSHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQEV--ED-------IEWLKWHP---------RAHILLAGSTDGSVW  173 (399)
T ss_pred             EEEccCceEEEecCCCccEEEEEcccCceEEEeeccc--Cc-------eEEEEecc---------cccEEEeecCCCcEE
Confidence            4788999999999996666677776666543 22110  00       02221001         345667789999999


Q ss_pred             EEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCC
Q 001003          682 LLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESG  761 (1192)
Q Consensus       682 ~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g  761 (1192)
                      +|++.++.. .++     +......++|.++-.|                                  .. -++.+.++|
T Consensus       174 mw~ip~~~~-~kv-----~~Gh~~~ct~G~f~pd----------------------------------GK-r~~tgy~dg  212 (399)
T KOG0296|consen  174 MWQIPSQAL-CKV-----MSGHNSPCTCGEFIPD----------------------------------GK-RILTGYDDG  212 (399)
T ss_pred             EEECCCcce-eeE-----ecCCCCCcccccccCC----------------------------------Cc-eEEEEecCc
Confidence            999876522 221     2223445666655322                                  11 345566799


Q ss_pred             eEEEEECCCceeeEEee
Q 001003          762 ALEIFDVPNFNCVFTVD  778 (1192)
Q Consensus       762 ~l~I~sLP~~~~v~~~~  778 (1192)
                      ++.+|.+-+..+.+..+
T Consensus       213 ti~~Wn~ktg~p~~~~~  229 (399)
T KOG0296|consen  213 TIIVWNPKTGQPLHKIT  229 (399)
T ss_pred             eEEEEecCCCceeEEec
Confidence            99999998888877765


No 54 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=49.07  E-value=4.9e+02  Score=30.36  Aligned_cols=52  Identities=19%  Similarity=0.386  Sum_probs=37.4

Q ss_pred             CCc-EEEEEec--CeEEEEEcCCC-CccCCccceEEEecC-CCccCeEEEecCCCEEEEEE
Q 001003          966 NHG-FIYVTSQ--GILKICQLPSG-STYDNYWPVQKVIPL-KATPHQITYFAEKNLYPLIV 1021 (1192)
Q Consensus       966 ~~G-fi~~~~~--~~LrI~~l~~~-~~~d~~~~vrk~ipL-~~tp~~Iay~~~~~~y~v~~ 1021 (1192)
                      |+| |+|++..  +.+.+..+++. .++   -.++. +|. |..||.++.+|+.+.++|+.
T Consensus       254 pdg~~lyvsnr~~~sI~vf~~d~~~g~l---~~~~~-~~~~G~~Pr~~~~s~~g~~l~Va~  310 (345)
T PF10282_consen  254 PDGRFLYVSNRGSNSISVFDLDPATGTL---TLVQT-VPTGGKFPRHFAFSPDGRYLYVAN  310 (345)
T ss_dssp             TTSSEEEEEECTTTEEEEEEECTTTTTE---EEEEE-EEESSSSEEEEEE-TTSSEEEEEE
T ss_pred             cCCCEEEEEeccCCEEEEEEEecCCCce---EEEEE-EeCCCCCccEEEEeCCCCEEEEEe
Confidence            444 8999775  47889999543 333   25666 888 66799999999988887754


No 55 
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=48.40  E-value=7.7e+02  Score=32.41  Aligned_cols=53  Identities=13%  Similarity=0.271  Sum_probs=36.7

Q ss_pred             cCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEE--cCCEEEEEEeCCcEEEEEecCCC
Q 001003          628 ERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI--ADPYVLLGMSDGSIRLLVGDPST  689 (1192)
Q Consensus       628 ~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi--~dpyvlv~~~dg~i~~l~~d~~~  689 (1192)
                      ...|++++.+..-|+..+-         .....|.+.+.  .+.++++..-||.|.+|.++...
T Consensus       117 D~~vK~~~~~D~s~~~~lr---------gh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~  171 (933)
T KOG1274|consen  117 DTAVKLLNLDDSSQEKVLR---------GHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGI  171 (933)
T ss_pred             ceeEEEEeccccchheeec---------ccCCceeeeeEcCCCCEEEEEecCceEEEEEcccch
Confidence            4467777766443433331         12345888887  57899999999999999987643


No 56 
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=48.13  E-value=4.9e+02  Score=30.06  Aligned_cols=61  Identities=13%  Similarity=0.242  Sum_probs=36.0

Q ss_pred             CceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeecccCcccccchhcccCceEEEeecceEEeee
Q 001003         1108 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNSFVFVFL 1179 (1192)
Q Consensus      1108 ~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~ 1179 (1192)
                      ..+|++-||+        ..=.|||||=.-  +.+= +..|+-.|-..--|+-++-.|.+|....-.|+||-
T Consensus       260 dgeYv~a~s~--------~aHaLYIWE~~~--GsLV-KILhG~kgE~l~DV~whp~rp~i~si~sg~v~iw~  320 (405)
T KOG1273|consen  260 DGEYVCAGSA--------RAHALYIWEKSI--GSLV-KILHGTKGEELLDVNWHPVRPIIASIASGVVYIWA  320 (405)
T ss_pred             CccEEEeccc--------cceeEEEEecCC--ccee-eeecCCchhheeecccccceeeeeeccCCceEEEE
Confidence            4689999984        344599998643  2221 11223332222234456667788888777888884


No 57 
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=47.91  E-value=6.6e+02  Score=31.49  Aligned_cols=92  Identities=11%  Similarity=0.117  Sum_probs=52.3

Q ss_pred             eEEEEcCCCCeEEEEeCCceEEEecCCCC-------ceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceE
Q 001003          924 QGFFLSGSRPCWCMVFRERLRVHPQLCDG-------SIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQ  996 (1192)
Q Consensus       924 ~gVF~~G~rP~wi~~~~g~l~~~p~~~~~-------~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vr  996 (1192)
                      ..|-.-+.+|+++++++..+|++-+....       -...++.|.-..-.+.+|+-+-+  =|+|-.+-.+   ..-|++
T Consensus       570 q~v~FHPs~p~lfVaTq~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d--~k~~WfDldl---sskPyk  644 (733)
T KOG0650|consen  570 QRVKFHPSKPYLFVATQRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYD--KKMCWFDLDL---SSKPYK  644 (733)
T ss_pred             eEEEecCCCceEEEEeccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCC--CeeEEEEccc---CcchhH
Confidence            45666677887777777778877754211       12222222221122333443333  3444444333   223778


Q ss_pred             EEecCCC-ccCeEEEecCCCEEEEEE
Q 001003          997 KVIPLKA-TPHQITYFAEKNLYPLIV 1021 (1192)
Q Consensus       997 k~ipL~~-tp~~Iay~~~~~~y~v~~ 1021 (1192)
                      + +-+.. -.|.||||+.-.+|+.+.
T Consensus       645 ~-lr~H~~avr~Va~H~ryPLfas~s  669 (733)
T KOG0650|consen  645 T-LRLHEKAVRSVAFHKRYPLFASGS  669 (733)
T ss_pred             H-hhhhhhhhhhhhhccccceeeeec
Confidence            7 77766 599999999888888764


No 58 
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=47.46  E-value=1.3e+02  Score=38.13  Aligned_cols=66  Identities=15%  Similarity=0.091  Sum_probs=45.0

Q ss_pred             CceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeecccCcccccchhcccCceEEEeecc-eEEeeehhhheee
Q 001003         1108 NETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS-FVFVFLFSFLRSL 1186 (1192)
Q Consensus      1108 ~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~~~~~~~~~ 1186 (1192)
                      .+.|||-|         ..-|+|.|||+-...--.+ +..|   ++..+|+.--.-|-+||.|+.. .|++|-+--.+.+
T Consensus       588 ~Gr~LaSg---------~ed~~I~iWDl~~~~~v~~-l~~H---t~ti~SlsFS~dg~vLasgg~DnsV~lWD~~~~~~~  654 (707)
T KOG0263|consen  588 CGRYLASG---------DEDGLIKIWDLANGSLVKQ-LKGH---TGTIYSLSFSRDGNVLASGGADNSVRLWDLTKVIEL  654 (707)
T ss_pred             CCceEeec---------ccCCcEEEEEcCCCcchhh-hhcc---cCceeEEEEecCCCEEEecCCCCeEEEEEchhhccc
Confidence            35688888         5679999999976222222 2222   4444455555789999999865 9999977666655


No 59 
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=46.20  E-value=4.7e+02  Score=33.49  Aligned_cols=110  Identities=15%  Similarity=0.194  Sum_probs=64.5

Q ss_pred             cEEEEeeCCCCEEEEEecCcEEEEeCCc--ceeee--------ecCCCCCCCCCCCCCCcEEEEEE--cCCEEEEEEeCC
Q 001003          611 TIAAGNLFGRRRVIQVFERGARILDGSY--MTQDL--------SFGPSNSESGSGSENSTVLSVSI--ADPYVLLGMSDG  678 (1192)
Q Consensus       611 TI~ag~l~~~~~IvQVt~~~vrli~~~~--~~q~i--------~~~~~~~e~~~~~~~~~Iv~asi--~dpyvlv~~~dg  678 (1192)
                      -|+||-+ .+-.-|+++|++-.+..+..  .++-|        .+|.     |   ....|+++.+  ++-|++-+-.||
T Consensus       529 Rifaghl-sDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~VRiF~-----G---H~~~V~al~~Sp~Gr~LaSg~ed~  599 (707)
T KOG0263|consen  529 RIFAGHL-SDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNSVRIFT-----G---HKGPVTALAFSPCGRYLASGDEDG  599 (707)
T ss_pred             hhhcccc-cccceEEECCcccccccCCCCceEEEEEcCCCcEEEEec-----C---CCCceEEEEEcCCCceEeecccCC
Confidence            4667777 45556777777655544321  12211        1220     1   2334555544  789999999999


Q ss_pred             cEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEe
Q 001003          679 SIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCY  758 (1192)
Q Consensus       679 ~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~  758 (1192)
                      .|.+|.+.......++...      ...|.++++-.|                                   ..+++++-
T Consensus       600 ~I~iWDl~~~~~v~~l~~H------t~ti~SlsFS~d-----------------------------------g~vLasgg  638 (707)
T KOG0263|consen  600 LIKIWDLANGSLVKQLKGH------TGTIYSLSFSRD-----------------------------------GNVLASGG  638 (707)
T ss_pred             cEEEEEcCCCcchhhhhcc------cCceeEEEEecC-----------------------------------CCEEEecC
Confidence            9999988664442222222      223444444211                                   12788888


Q ss_pred             cCCeEEEEECCC
Q 001003          759 ESGALEIFDVPN  770 (1192)
Q Consensus       759 ~~g~l~I~sLP~  770 (1192)
                      .++++.+|++-.
T Consensus       639 ~DnsV~lWD~~~  650 (707)
T KOG0263|consen  639 ADNSVRLWDLTK  650 (707)
T ss_pred             CCCeEEEEEchh
Confidence            899999997643


No 60 
>PF02333 Phytase:  Phytase;  InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=45.51  E-value=2.9e+02  Score=32.93  Aligned_cols=61  Identities=28%  Similarity=0.449  Sum_probs=37.0

Q ss_pred             CcEEEEEecCCCceEeecc-cc-ccccCCCceeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEE
Q 001003          678 GSIRLLVGDPSTCTVSVQT-PA-AIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSV  755 (1192)
Q Consensus       678 g~i~~l~~d~~~~~l~~~~-~~-~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~  755 (1192)
                      .+|.+|.+++.+..|.... +. .+...-.++-.+|||.+..                               ...++++
T Consensus       127 n~l~~f~id~~~g~L~~v~~~~~p~~~~~~e~yGlcly~~~~-------------------------------~g~~ya~  175 (381)
T PF02333_consen  127 NSLRLFRIDPDTGELTDVTDPAAPIATDLSEPYGLCLYRSPS-------------------------------TGALYAF  175 (381)
T ss_dssp             -EEEEEEEETTTTEEEE-CBTTC-EE-SSSSEEEEEEEE-TT-------------------------------T--EEEE
T ss_pred             CeEEEEEecCCCCcceEcCCCCcccccccccceeeEEeecCC-------------------------------CCcEEEE
Confidence            4699999998655454221 11 1222234567789996531                               1357999


Q ss_pred             EEecCCeEEEEECC
Q 001003          756 VCYESGALEIFDVP  769 (1192)
Q Consensus       756 v~~~~g~l~I~sLP  769 (1192)
                      +...+|.++-|.|-
T Consensus       176 v~~k~G~~~Qy~L~  189 (381)
T PF02333_consen  176 VNGKDGRVEQYELT  189 (381)
T ss_dssp             EEETTSEEEEEEEE
T ss_pred             EecCCceEEEEEEE
Confidence            99999999988873


No 61 
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=45.39  E-value=2.1e+02  Score=33.93  Aligned_cols=93  Identities=13%  Similarity=0.167  Sum_probs=50.9

Q ss_pred             EEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEeeeCCCCCcee
Q 001003         1066 VRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNL 1145 (1192)
Q Consensus      1066 v~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~ 1145 (1192)
                      ++++|-.    +.++...|.-+.+--.-....|.|.     ....|++-|         .+-|+||||++..  ++.|.+
T Consensus       365 l~viDlR----t~eI~~~~sA~g~k~asDwtrvvfS-----pd~~YvaAG---------S~dgsv~iW~v~t--gKlE~~  424 (459)
T KOG0288|consen  365 LKVIDLR----TKEIRQTFSAEGFKCASDWTRVVFS-----PDGSYVAAG---------SADGSVYIWSVFT--GKLEKV  424 (459)
T ss_pred             eeeeecc----cccEEEEeeccccccccccceeEEC-----CCCceeeec---------cCCCcEEEEEccC--ceEEEE
Confidence            4455543    6666666654443333334444453     246899999         7899999999976  455554


Q ss_pred             EeecccCcccccchhcccCceEEEeecc-eEEee
Q 001003         1146 VLSGSYGPLFSSVQIDFASHFFAICSNS-FVFVF 1178 (1192)
Q Consensus      1146 ~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~ 1178 (1192)
                      ...----++..++.-+--|.=|++|... +|.+|
T Consensus       425 l~~s~s~~aI~s~~W~~sG~~Llsadk~~~v~lW  458 (459)
T KOG0288|consen  425 LSLSTSNAAITSLSWNPSGSGLLSADKQKAVTLW  458 (459)
T ss_pred             eccCCCCcceEEEEEcCCCchhhcccCCcceEec
Confidence            3111111122223323345556666665 56665


No 62 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=44.48  E-value=8.2e+02  Score=31.59  Aligned_cols=137  Identities=17%  Similarity=0.187  Sum_probs=74.4

Q ss_pred             CCceEEEeCCCcCEEEEEEecCCCCC----CCCcccccccCCCcceEEEEEeccceEEEEecCceeeeecccCccccCCc
Q 001003          536 QSNYELVELPGCKGIWTVYHKSSRGH----NADSSRMAAYDDEYHAYLIISLEARTMVLETADLLTEVTESVDYFVQGRT  611 (1192)
Q Consensus       536 ~gsL~i~eLpg~~~iWtv~~~~~~~~----~~~~~~~~~~~~~~~~yLvlS~~~~T~Vl~~g~~~eEv~~~~gF~~~~~T  611 (1192)
                      +|-+.+.+||+..=|-.+......-.    |..+.-.+-.....-..||---.+++.||+-...+..++ ...+..+++-
T Consensus       286 sG~f~LyelP~f~lih~LSis~~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsEsYVlKQQgH~~~i~-~l~YSpDgq~  364 (893)
T KOG0291|consen  286 SGEFGLYELPDFNLIHSLSISDQKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSESYVLKQQGHSDRIT-SLAYSPDGQL  364 (893)
T ss_pred             CCeeEEEecCCceEEEEeecccceeeEEEecccCCEEEEcCCccceEEEEEeeccceeeecccccccee-eEEECCCCcE
Confidence            34444568887666666654321000    000000000112233455556777888888776666654 3456667677


Q ss_pred             EEEEeeCCCCEEEEEecCcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEcCCEEEEEEeCCcEEEEEecCCCc
Q 001003          612 IAAGNLFGRRRVIQVFERGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIADPYVLLGMSDGSIRLLVGDPSTC  690 (1192)
Q Consensus       612 I~ag~l~~~~~IvQVt~~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~dpyvlv~~~dg~i~~l~~d~~~~  690 (1192)
                      |+.|.=          .+.|++++....--.+.+.       .+..+...++-+.....++-.+-||+|+.|.+.....
T Consensus       365 iaTG~e----------DgKVKvWn~~SgfC~vTFt-------eHts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRYrN  426 (893)
T KOG0291|consen  365 IATGAE----------DGKVKVWNTQSGFCFVTFT-------EHTSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRYRN  426 (893)
T ss_pred             EEeccC----------CCcEEEEeccCceEEEEec-------cCCCceEEEEEEecCCEEEEeecCCeEEeeeecccce
Confidence            766653          3347777644311112221       1113444566666777788788899999998875543


No 63 
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=41.98  E-value=2.4e+02  Score=37.20  Aligned_cols=64  Identities=20%  Similarity=0.131  Sum_probs=45.8

Q ss_pred             hhhhccCCCceeeEEEEEeecCCCCCCCCCcccccccccccCCCCCCCCCCCeEEEE-----------cCCeEEEEEEEE
Q 001003            5 AYKMMHWPTGIANCGSGFITHSRADYVPQIPLIQTEELDSELPSKRGIGPVPNLVVT-----------AANVIEIYVVRV   73 (1192)
Q Consensus         5 ~~~~~~~pT~V~hsv~~~Ft~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~nLVva-----------k~n~LeIy~v~~   73 (1192)
                      .+.+..++-.+..++.|+|++....                           -+||+           +..+|-||++..
T Consensus       764 ~~hef~~~E~~~Si~s~~~~~d~~t---------------------------~~vVGT~~v~Pde~ep~~GRIivfe~~e  816 (1096)
T KOG1897|consen  764 SSHEFERNETALSIISCKFTDDPNT---------------------------YYVVGTGLVYPDENEPVNGRIIVFEFEE  816 (1096)
T ss_pred             eeccccccceeeeeeeeeecCCCce---------------------------EEEEEEEeeccCCCCcccceEEEEEEec
Confidence            3457888889999999999976532                           44544           234666676541


Q ss_pred             eccCCccccCCccccccccccccccccEEEEEEEEeeeEEeEEEEE
Q 001003           74 QEEGSKESKNSGETKRRVLMDGISAASLELVCHYRLHGNVESLAIL  119 (1192)
Q Consensus        74 ~~~g~~~~~~~~~~~~~~~~~~~~~~~L~lv~e~~l~G~I~~l~~~  119 (1192)
                                              ..||++++|..+-|.+.+|..+
T Consensus       817 ------------------------~~~L~~v~e~~v~Gav~aL~~f  838 (1096)
T KOG1897|consen  817 ------------------------LNSLELVAETVVKGAVYALVEF  838 (1096)
T ss_pred             ------------------------CCceeeeeeeeeccceeehhhh
Confidence                                    1259999999999999887643


No 64 
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=41.21  E-value=34  Score=37.81  Aligned_cols=67  Identities=19%  Similarity=0.289  Sum_probs=47.8

Q ss_pred             ceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEee---cccCcccccchhcccCceEEEeecceEEeeeh
Q 001003         1109 ETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLS---GSYGPLFSSVQIDFASHFFAICSNSFVFVFLF 1180 (1192)
Q Consensus      1109 ~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~---~~~~~~~~~~~~~~~~~~~a~~~~~~~~~~~~ 1180 (1192)
                      ..+|||-|+-++|  +.-.|||+|.|+...++--|..+..   .-|.-+-+   +--...++|+|+--+++||--
T Consensus        21 ~nrLavAt~q~yG--l~G~G~L~ile~~~~~gi~e~~s~d~~D~LfdV~Ws---e~~e~~~~~a~GDGSLrl~d~   90 (311)
T KOG0277|consen   21 ENRLAVATAQHYG--LAGNGRLFILEVTDPKGIQECQSYDTEDGLFDVAWS---ENHENQVIAASGDGSLRLFDL   90 (311)
T ss_pred             cchhheeehhhcc--cccCceEEEEecCCCCCeEEEEeeecccceeEeeec---CCCcceEEEEecCceEEEecc
Confidence            5789999999999  7889999999998544444443321   22333334   445677889999999999963


No 65 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=40.62  E-value=8.7e+02  Score=30.82  Aligned_cols=28  Identities=11%  Similarity=0.058  Sum_probs=19.5

Q ss_pred             CCCeEEECCCCcEEEEEEcCceEEEEeC
Q 001003          179 RGPLVKVDPQGRCGGVLVYGLQMIILKA  206 (1192)
Q Consensus       179 ~~~~l~VDP~~Rc~~l~~y~~~L~ilP~  206 (1192)
                      +...+.+-|.|..+|+.=-.+.+-++-+
T Consensus       477 ~I~~l~~SsdG~yiaa~~t~g~I~v~nl  504 (691)
T KOG2048|consen  477 SISRLVVSSDGNYIAAISTRGQIFVYNL  504 (691)
T ss_pred             cceeEEEcCCCCEEEEEeccceEEEEEc
Confidence            3467889999998888755555555544


No 66 
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=40.54  E-value=64  Score=24.63  Aligned_cols=40  Identities=18%  Similarity=0.281  Sum_probs=27.0

Q ss_pred             CcEEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEe
Q 001003          967 HGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYF 1011 (1192)
Q Consensus       967 ~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~ 1011 (1192)
                      ..++|++..+.=.|..++..    ....+++ ++++..|+.|+++
T Consensus         3 ~~~lyv~~~~~~~v~~id~~----~~~~~~~-i~vg~~P~~i~~~   42 (42)
T TIGR02276         3 GTKLYVTNSGSNTVSVIDTA----TNKVIAT-IPVGGYPFGVAVS   42 (42)
T ss_pred             CCEEEEEeCCCCEEEEEECC----CCeEEEE-EECCCCCceEEeC
Confidence            35678876544444445543    2347788 9999999999874


No 67 
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=39.37  E-value=1e+02  Score=37.03  Aligned_cols=28  Identities=18%  Similarity=0.494  Sum_probs=22.0

Q ss_pred             cEEEEEEecCCeEEEEECCCceeeEEee
Q 001003          751 DIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1192)
Q Consensus       751 ~~~l~v~~~~g~l~I~sLP~~~~v~~~~  778 (1192)
                      ...++++..+|.+++||||+|+.+...+
T Consensus       272 ~~~Lv~l~~~G~i~i~SLP~Lkei~~~~  299 (395)
T PF08596_consen  272 GYCLVCLFNNGSIRIYSLPSLKEIKSVS  299 (395)
T ss_dssp             EEEEEEEETTSEEEEEETTT--EEEEEE
T ss_pred             ceEEEEEECCCcEEEEECCCchHhhccc
Confidence            3566678899999999999999988776


No 68 
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.50  E-value=1.4e+02  Score=38.30  Aligned_cols=85  Identities=16%  Similarity=0.248  Sum_probs=60.1

Q ss_pred             CCceEEEEcCCCCeEEEEeCCceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCcc-CCccceEEEe
Q 001003          921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTY-DNYWPVQKVI  999 (1192)
Q Consensus       921 ~G~~gVF~~G~rP~wi~~~~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~-d~~~~vrk~i  999 (1192)
                      ||..| .+..+|- |+-.-.. ...+.  .+|+|++.+=      -..+|+.+++-+.|++..+..+.+ .-+||.+. +
T Consensus       135 GG~ag-lvL~er~-wlgnk~~-v~l~~--~eG~I~~i~W------~g~lIAWand~Gv~vyd~~~~~~l~~i~~p~~~-~  202 (846)
T KOG2066|consen  135 GGMAG-LVLSERN-WLGNKDS-VVLSE--GEGPIHSIKW------RGNLIAWANDDGVKVYDTPTRQRLTNIPPPSQS-V  202 (846)
T ss_pred             cCcce-EEEehhh-hhcCccc-eeeec--CccceEEEEe------cCcEEEEecCCCcEEEeccccceeeccCCCCCC-C
Confidence            66667 6666666 6643222 33444  4578887754      345788888889999999988765 45778877 7


Q ss_pred             cCCCccCeEEEecCCCEE
Q 001003         1000 PLKATPHQITYFAEKNLY 1017 (1192)
Q Consensus      1000 pL~~tp~~Iay~~~~~~y 1017 (1192)
                      -....|-++.+.++.++.
T Consensus       203 R~e~fpphl~W~~~~~LV  220 (846)
T KOG2066|consen  203 RPELFPPHLHWQDEDRLV  220 (846)
T ss_pred             CcccCCCceEecCCCeEE
Confidence            777789999999887655


No 69 
>PTZ00421 coronin; Provisional
Probab=38.47  E-value=7.3e+02  Score=30.78  Aligned_cols=31  Identities=16%  Similarity=0.200  Sum_probs=23.5

Q ss_pred             CCcEEEEEEc--C-CEEEEEEeCCcEEEEEecCC
Q 001003          658 NSTVLSVSIA--D-PYVLLGMSDGSIRLLVGDPS  688 (1192)
Q Consensus       658 ~~~Iv~asi~--d-pyvlv~~~dg~i~~l~~d~~  688 (1192)
                      ...|.+++.+  + .+++.+..||+|.+|.+...
T Consensus        75 ~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~  108 (493)
T PTZ00421         75 EGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEE  108 (493)
T ss_pred             CCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCC
Confidence            3458888875  3 47777889999999988653


No 70 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=38.13  E-value=9.9e+02  Score=30.72  Aligned_cols=164  Identities=13%  Similarity=0.110  Sum_probs=87.9

Q ss_pred             CceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEE
Q 001003          941 ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLI 1020 (1192)
Q Consensus       941 g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~ 1020 (1192)
                      ..++.-|+  +..+.+..-.+++-=..|-.+++.-+.-..+.+++...-  -.-.++ .|-.+-..+.+|-+.++.+.++
T Consensus       225 ~~l~~lp~--ye~~E~vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~--~~~~~~-~~~~~e~~~~~~~~~~~~~l~v  299 (775)
T KOG0319|consen  225 KKLKTLPL--YESLESVVRLREELGGKGEYIITAGGSGVVQYWDSESGK--CVYKQR-QSDSEEIDHLLAIESMSQLLLV  299 (775)
T ss_pred             hhhheech--hhheeeEEEechhcCCcceEEEEecCCceEEEEecccch--hhhhhc-cCCchhhhcceeccccCceEEE
Confidence            55676674  467888887777443344444444334444445443210  112233 4433336777887777777777


Q ss_pred             EeecCccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEe
Q 001003         1021 VSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTL 1100 (1192)
Q Consensus      1021 ~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L 1100 (1192)
                      +..++                                        +.|+|..    +-++... =..-||.++.|+.+  
T Consensus       300 taeQn----------------------------------------l~l~d~~----~l~i~k~-ivG~ndEI~Dm~~l--  332 (775)
T KOG0319|consen  300 TAEQN----------------------------------------LFLYDED----ELTIVKQ-IVGYNDEILDMKFL--  332 (775)
T ss_pred             Eccce----------------------------------------EEEEEcc----ccEEehh-hcCCchhheeeeec--
Confidence            65421                                        1123221    1122111 12347788888876  


Q ss_pred             ccccCCCCceEEEEEeccccCcccccCceEEEEEeee-----CCCCCceeEeecccCcccccchhcccCceEEEeecc-e
Q 001003         1101 FNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGR-----NADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS-F 1174 (1192)
Q Consensus      1101 ~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~-----~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~ 1174 (1192)
                           ++-..++||-|.         .+++.+|++-.     -+++-|.+-      ++..    --.|-+||+|+-- +
T Consensus       333 -----G~e~~~laVATN---------s~~lr~y~~~~~~c~ii~GH~e~vl------SL~~----~~~g~llat~sKD~s  388 (775)
T KOG0319|consen  333 -----GPEESHLAVATN---------SPELRLYTLPTSYCQIIPGHTEAVL------SLDV----WSSGDLLATGSKDKS  388 (775)
T ss_pred             -----CCccceEEEEeC---------CCceEEEecCCCceEEEeCchhhee------eeee----cccCcEEEEecCCce
Confidence                 223679999975         45666774432     223334332      1111    1356699999875 8


Q ss_pred             EEeeeh
Q 001003         1175 VFVFLF 1180 (1192)
Q Consensus      1175 ~~~~~~ 1180 (1192)
                      |++|-.
T Consensus       389 vilWr~  394 (775)
T KOG0319|consen  389 VILWRL  394 (775)
T ss_pred             EEEEEe
Confidence            999843


No 71 
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=37.68  E-value=5.3e+02  Score=28.88  Aligned_cols=117  Identities=15%  Similarity=0.289  Sum_probs=69.4

Q ss_pred             ceEEEecCCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCccccccCcceeeeEEEEEcCCC
Q 001003          994 PVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYTVEEYEVRILEPDR 1073 (1192)
Q Consensus       994 ~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~~~~~sv~Lldp~~ 1073 (1192)
                      -+++ +-+...+..+-||++.+.+.+                                         ...++|+..|+. 
T Consensus       177 ~v~s-L~~~s~VtSlEvs~dG~ilTi-----------------------------------------a~gssV~Fwdak-  213 (334)
T KOG0278|consen  177 EVQS-LEFNSPVTSLEVSQDGRILTI-----------------------------------------AYGSSVKFWDAK-  213 (334)
T ss_pred             EEEE-EecCCCCcceeeccCCCEEEE-----------------------------------------ecCceeEEeccc-
Confidence            5666 777777777777776554433                                         134578899986 


Q ss_pred             CCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEeeeCCCCCceeEeecccCc
Q 001003         1074 AGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNADNPQNLVLSGSYGP 1153 (1192)
Q Consensus      1074 ~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~~~e~~~~~~~~~~ 1153 (1192)
                         +|+.+-+|+++-|     |.+..|+     ..++++|-|     |||    +.+|.||.+...+. + .+.++.+++
T Consensus       214 ---sf~~lKs~k~P~n-----V~SASL~-----P~k~~fVaG-----ged----~~~~kfDy~TgeEi-~-~~nkgh~gp  269 (334)
T KOG0278|consen  214 ---SFGLLKSYKMPCN-----VESASLH-----PKKEFFVAG-----GED----FKVYKFDYNTGEEI-G-SYNKGHFGP  269 (334)
T ss_pred             ---cccceeeccCccc-----ccccccc-----CCCceEEec-----Ccc----eEEEEEeccCCcee-e-ecccCCCCc
Confidence               8899999888753     3455554     346788877     566    56788887652211 1 134566666


Q ss_pred             ccccchhcccCceEEEeecc-eEEee
Q 001003         1154 LFSSVQIDFASHFFAICSNS-FVFVF 1178 (1192)
Q Consensus      1154 ~~~~~~~~~~~~~~a~~~~~-~~~~~ 1178 (1192)
                      +-+ |.----|-+-|+=+-- .++||
T Consensus       270 Vhc-VrFSPdGE~yAsGSEDGTirlW  294 (334)
T KOG0278|consen  270 VHC-VRFSPDGELYASGSEDGTIRLW  294 (334)
T ss_pred             eEE-EEECCCCceeeccCCCceEEEE
Confidence            655 2211223333332211 66776


No 72 
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=37.67  E-value=7.9e+02  Score=29.46  Aligned_cols=148  Identities=16%  Similarity=0.204  Sum_probs=83.7

Q ss_pred             EEEEeecCCCCCccEEEEEecCCcEEEEEEeeecCCCCCCCCCCCCccccccccccccccccceeeEEecCCCccCCCCC
Q 001003          826 ELAMQRWSAHHSRPFLFAILTDGTILCYQAYLFEGPENTSKSDDPVSTSRSLSVSNVSASRLRNLRFSRTPLDAYTREET  905 (1192)
Q Consensus       826 eil~~~~g~~~~~p~Llv~l~dG~l~~Y~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~lrF~kv~~~~~~~~~~  905 (1192)
                      ||=+..|.  +..-||--+..|-+.+++++..                          +.++ ++.+.-+.|        
T Consensus       226 EVWfl~FS--~nGkyLAsaSkD~Taiiw~v~~--------------------------d~~~-kl~~tlvgh--------  268 (519)
T KOG0293|consen  226 EVWFLQFS--HNGKYLASASKDSTAIIWIVVY--------------------------DVHF-KLKKTLVGH--------  268 (519)
T ss_pred             cEEEEEEc--CCCeeEeeccCCceEEEEEEec--------------------------Ccce-eeeeeeecc--------
Confidence            46666775  5677998888999999998852                          1221 222221111        


Q ss_pred             CCCCCccceEEeeccCCceEEEEcCCCCeEEEE--eCCceE-EEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEE
Q 001003          906 PHGAPCQRITIFKNISGHQGFFLSGSRPCWCMV--FRERLR-VHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQ  982 (1192)
Q Consensus       906 ~~~~g~~~l~~f~~i~G~~gVF~~G~rP~wi~~--~~g~l~-~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~  982 (1192)
                           ..++....=.-..+-+.+||.--...+.  .-|.++ .+|- +    ..|++=.-.=||+||=+++..-.=.|+.
T Consensus       269 -----~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~tgd~~~~y~~-~----~~~S~~sc~W~pDg~~~V~Gs~dr~i~~  338 (519)
T KOG0293|consen  269 -----SQPVSYIMWSPDDRYLLACGFDEVLSLWDVDTGDLRHLYPS-G----LGFSVSSCAWCPDGFRFVTGSPDRTIIM  338 (519)
T ss_pred             -----cCceEEEEECCCCCeEEecCchHheeeccCCcchhhhhccc-C----cCCCcceeEEccCCceeEecCCCCcEEE
Confidence                 2233322211112458888855544444  234333 3342 1    2233322333799988887754556666


Q ss_pred             cCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEeec
Q 001003          983 LPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVP 1024 (1192)
Q Consensus       983 l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~ 1024 (1192)
                      ++-..+.-..|--.+ +|   -.+-+|..++.+..++++..+
T Consensus       339 wdlDgn~~~~W~gvr-~~---~v~dlait~Dgk~vl~v~~d~  376 (519)
T KOG0293|consen  339 WDLDGNILGNWEGVR-DP---KVHDLAITYDGKYVLLVTVDK  376 (519)
T ss_pred             ecCCcchhhcccccc-cc---eeEEEEEcCCCcEEEEEeccc
Confidence            666666556787776 53   356777777788777777653


No 73 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.71  E-value=5.2e+02  Score=30.42  Aligned_cols=62  Identities=18%  Similarity=0.220  Sum_probs=45.8

Q ss_pred             CcEEEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEE
Q 001003          750 GDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAM  829 (1192)
Q Consensus       750 ~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~  829 (1192)
                      +..+|+.+-.+++++||.++...++|+..+..                                        .=|.++++
T Consensus       303 ~~~~l~s~SrDktIk~wdv~tg~cL~tL~ghd----------------------------------------nwVr~~af  342 (406)
T KOG0295|consen  303 GGQVLGSGSRDKTIKIWDVSTGMCLFTLVGHD----------------------------------------NWVRGVAF  342 (406)
T ss_pred             CccEEEeecccceEEEEeccCCeEEEEEeccc----------------------------------------ceeeeeEE
Confidence            34689998889999999999988887775321                                        12444554


Q ss_pred             eecCCCCCccEEEEEecCCcEEEEEEe
Q 001003          830 QRWSAHHSRPFLFAILTDGTILCYQAY  856 (1192)
Q Consensus       830 ~~~g~~~~~p~Llv~l~dG~l~~Y~~~  856 (1192)
                      .+     ..-||+--..|+.|-+|.+.
T Consensus       343 ~p-----~Gkyi~ScaDDktlrvwdl~  364 (406)
T KOG0295|consen  343 SP-----GGKYILSCADDKTLRVWDLK  364 (406)
T ss_pred             cC-----CCeEEEEEecCCcEEEEEec
Confidence            32     34688888899999999874


No 74 
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.68  E-value=4.4e+02  Score=31.45  Aligned_cols=102  Identities=7%  Similarity=0.124  Sum_probs=56.5

Q ss_pred             eEEEEEeccce-EEEEecCceeeeecccCccccCCcEEEEeeCCCC--EEEEEecCcEEEEeCCcceeeeecCCCCCCCC
Q 001003          577 AYLIISLEART-MVLETADLLTEVTESVDYFVQGRTIAAGNLFGRR--RVIQVFERGARILDGSYMTQDLSFGPSNSESG  653 (1192)
Q Consensus       577 ~yLvlS~~~~T-~Vl~~g~~~eEv~~~~gF~~~~~TI~ag~l~~~~--~IvQVt~~~vrli~~~~~~q~i~~~~~~~e~~  653 (1192)
                      +|+++.+.+.- .++... .-.   + .|.+..+.-|.--.+.+++  .+|-+-+.++++.|-........++      |
T Consensus       367 k~vl~v~~d~~i~l~~~e-~~~---d-r~lise~~~its~~iS~d~k~~LvnL~~qei~LWDl~e~~lv~kY~------G  435 (519)
T KOG0293|consen  367 KYVLLVTVDKKIRLYNRE-ARV---D-RGLISEEQPITSFSISKDGKLALVNLQDQEIHLWDLEENKLVRKYF------G  435 (519)
T ss_pred             cEEEEEecccceeeechh-hhh---h-hccccccCceeEEEEcCCCcEEEEEcccCeeEEeecchhhHHHHhh------c
Confidence            67777765532 333221 111   1 2445555555555664444  3555667788887754322111121      2


Q ss_pred             CCCCCCcEEEEEE---cCCEEEEEEeCCcEEEEEecCCCc
Q 001003          654 SGSENSTVLSVSI---ADPYVLLGMSDGSIRLLVGDPSTC  690 (1192)
Q Consensus       654 ~~~~~~~Iv~asi---~dpyvlv~~~dg~i~~l~~d~~~~  690 (1192)
                      .- .+.-|+...+   ++.||+=+.+|+.|.+|.......
T Consensus       436 hk-q~~fiIrSCFgg~~~~fiaSGSED~kvyIWhr~sgkl  474 (519)
T KOG0293|consen  436 HK-QGHFIIRSCFGGGNDKFIASGSEDSKVYIWHRISGKL  474 (519)
T ss_pred             cc-ccceEEEeccCCCCcceEEecCCCceEEEEEccCCce
Confidence            11 2334555444   468999999999999998766544


No 75 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.54  E-value=2e+02  Score=36.72  Aligned_cols=77  Identities=19%  Similarity=0.246  Sum_probs=55.7

Q ss_pred             CcEEEEEEc---CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCc
Q 001003          659 STVLSVSIA---DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGV  735 (1192)
Q Consensus       659 ~~Iv~asi~---dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~  735 (1192)
                      .-++|++++   |.|.+=++-||.|++|.+..... ....+.      ..-|+|+|+..|                    
T Consensus       410 dfVTcVaFnPvDDryFiSGSLD~KvRiWsI~d~~V-v~W~Dl------~~lITAvcy~Pd--------------------  462 (712)
T KOG0283|consen  410 DFVTCVAFNPVDDRYFISGSLDGKVRLWSISDKKV-VDWNDL------RDLITAVCYSPD--------------------  462 (712)
T ss_pred             CeeEEEEecccCCCcEeecccccceEEeecCcCee-Eeehhh------hhhheeEEeccC--------------------
Confidence            348888884   88999999999999999754322 222222      334788888633                    


Q ss_pred             cccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEe
Q 001003          736 GEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTV  777 (1192)
Q Consensus       736 ~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~  777 (1192)
                                     ..+.+|++=+|..++|..-+++++.+.
T Consensus       463 ---------------Gk~avIGt~~G~C~fY~t~~lk~~~~~  489 (712)
T KOG0283|consen  463 ---------------GKGAVIGTFNGYCRFYDTEGLKLVSDF  489 (712)
T ss_pred             ---------------CceEEEEEeccEEEEEEccCCeEEEee
Confidence                           126778888999999999888876554


No 76 
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=36.45  E-value=7.7e+02  Score=28.95  Aligned_cols=68  Identities=16%  Similarity=0.154  Sum_probs=47.9

Q ss_pred             ccEEEEE-eC--CCeEEEEEEeCCCCCEEEEeeeeeeccccccccCCcccccCCCeEEECCCCcEEEEEEc-CceEEEEe
Q 001003          130 RDSIILA-FE--DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFARGPLVKVDPQGRCGGVLVY-GLQMIILK  205 (1192)
Q Consensus       130 ~D~Llv~-~~--~aklsil~~d~~~~~l~t~Slh~~E~~~~~~~~~G~~~~~~~~~l~VDP~~Rc~~l~~y-~~~L~ilP  205 (1192)
                      ...|.+. ..  .+.++..+||++.+.|.-   -+  ..    .-.|    .++.++.+|++||..+..-| .+.+.+.|
T Consensus        51 ~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~---ln--~~----~~~g----~~p~yvsvd~~g~~vf~AnY~~g~v~v~p  117 (346)
T COG2706          51 QRHLYVVNEPGEEGGVAAYRIDPDDGRLTF---LN--RQ----TLPG----SPPCYVSVDEDGRFVFVANYHSGSVSVYP  117 (346)
T ss_pred             CCEEEEEEecCCcCcEEEEEEcCCCCeEEE---ee--cc----ccCC----CCCeEEEECCCCCEEEEEEccCceEEEEE
Confidence            4445443 33  699999999998887743   22  11    1122    23489999999999999988 56899999


Q ss_pred             CccCC
Q 001003          206 ASQGG  210 (1192)
Q Consensus       206 ~~~~~  210 (1192)
                      ++.++
T Consensus       118 ~~~dG  122 (346)
T COG2706         118 LQADG  122 (346)
T ss_pred             cccCC
Confidence            97654


No 77 
>PF12341 DUF3639:  Protein of unknown function (DUF3639) ;  InterPro: IPR022100  This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important. 
Probab=36.06  E-value=74  Score=22.93  Aligned_cols=24  Identities=21%  Similarity=0.517  Sum_probs=22.0

Q ss_pred             cEEEEEEcCCEEEEEEeCCcEEEE
Q 001003          660 TVLSVSIADPYVLLGMSDGSIRLL  683 (1192)
Q Consensus       660 ~Iv~asi~dpyvlv~~~dg~i~~l  683 (1192)
                      +|.+.+..+.+++++.+-+-+++|
T Consensus         3 ~i~aia~g~~~vavaTS~~~lRif   26 (27)
T PF12341_consen    3 EIEAIAAGDSWVAVATSAGYLRIF   26 (27)
T ss_pred             eEEEEEccCCEEEEEeCCCeEEec
Confidence            489999999999999999998887


No 78 
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=35.49  E-value=7.4e+02  Score=28.52  Aligned_cols=93  Identities=10%  Similarity=0.021  Sum_probs=55.8

Q ss_pred             EEEEcCCCCeEEEE-eCCc----eEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceEE--
Q 001003          925 GFFLSGSRPCWCMV-FRER----LRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQK--  997 (1192)
Q Consensus       925 gVF~~G~rP~wi~~-~~g~----l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk--  997 (1192)
                      .|.++++|-..+|- .++.    ..-.|+ . -.+.|++-|..   .+||++-.-+|-.-|-.|++-.. -..+..|-  
T Consensus       169 ~vVata~r~i~vynL~n~~te~k~~~SpL-k-~Q~R~va~f~d---~~~~alGsiEGrv~iq~id~~~~-~~nFtFkCHR  242 (347)
T KOG0647|consen  169 AVVATAERHIAVYNLENPPTEFKRIESPL-K-WQTRCVACFQD---KDGFALGSIEGRVAIQYIDDPNP-KDNFTFKCHR  242 (347)
T ss_pred             eEEEecCCcEEEEEcCCCcchhhhhcCcc-c-ceeeEEEEEec---CCceEeeeecceEEEEecCCCCc-cCceeEEEec
Confidence            67788888844443 2211    112343 2 35778888776   45788878889999999987411 11223331  


Q ss_pred             ----EecCCCccCeEEEecCCCEEEEEEee
Q 001003          998 ----VIPLKATPHQITYFAEKNLYPLIVSV 1023 (1192)
Q Consensus       998 ----~ipL~~tp~~Iay~~~~~~y~v~~s~ 1023 (1192)
                          +-+.=+.+..|++||..++++-+-+.
T Consensus       243 ~~~~~~~~VYaVNsi~FhP~hgtlvTaGsD  272 (347)
T KOG0647|consen  243 STNSVNDDVYAVNSIAFHPVHGTLVTAGSD  272 (347)
T ss_pred             cCCCCCCceEEecceEeecccceEEEecCC
Confidence                01112346778999988888877654


No 79 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=33.60  E-value=2.3e+02  Score=34.55  Aligned_cols=161  Identities=17%  Similarity=0.215  Sum_probs=83.9

Q ss_pred             CcceEEEEEeccceEEEEe-cCceeeeecccCcc-----ccC--CcEEEEeeCCCC---EEEEEecCcEEEEeCCc---c
Q 001003          574 EYHAYLIISLEARTMVLET-ADLLTEVTESVDYF-----VQG--RTIAAGNLFGRR---RVIQVFERGARILDGSY---M  639 (1192)
Q Consensus       574 ~~~~yLvlS~~~~T~Vl~~-g~~~eEv~~~~gF~-----~~~--~TI~ag~l~~~~---~IvQVt~~~vrli~~~~---~  639 (1192)
                      .-+.+|++|-...-.||-- |-++.|...---|+     |.+  .+|.||...-.+   .+---....+|+.+.+.   +
T Consensus       225 Tg~~iLvvsg~aqakl~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q  304 (641)
T KOG0772|consen  225 TGDQILVVSGSAQAKLLDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQ  304 (641)
T ss_pred             CCCeEEEEecCcceeEEccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchhh
Confidence            3457899998888888864 55565553211122     222  355666553211   12222234577777553   2


Q ss_pred             eeeeecCCCCCCCCCCCCCCcEEEEEE----cCCEEEEEEeCCcEEEEEecCCCc--eEeeccccccccCCCceeEEEee
Q 001003          640 TQDLSFGPSNSESGSGSENSTVLSVSI----ADPYVLLGMSDGSIRLLVGDPSTC--TVSVQTPAAIESSKKPVSSCTLY  713 (1192)
Q Consensus       640 ~q~i~~~~~~~e~~~~~~~~~Iv~asi----~dpyvlv~~~dg~i~~l~~d~~~~--~l~~~~~~~l~~~~~~i~~~~l~  713 (1192)
                      .|-|.-       .. ..+.+|...++    ..+.++-+|.||+|.+|..-....  ...+.+  +- .....|+|+.+-
T Consensus       305 ~qVik~-------k~-~~g~Rv~~tsC~~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~--AH-~~g~~Itsi~FS  373 (641)
T KOG0772|consen  305 LQVIKT-------KP-AGGKRVPVTSCAWNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKD--AH-LPGQDITSISFS  373 (641)
T ss_pred             eeEEee-------cc-CCCcccCceeeecCCCcchhhhcccCCceeeeecCCcccccceEeee--cc-CCCCceeEEEec
Confidence            232221       11 12344444444    267888999999999998532111  111111  11 124578888663


Q ss_pred             ccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCcee-eEEeecc
Q 001003          714 HDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC-VFTVDKF  780 (1192)
Q Consensus       714 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~-v~~~~~l  780 (1192)
                      .|-                                  . +|+---.+++|.+|+|-.++. ++.-.+|
T Consensus       374 ~dg----------------------------------~-~LlSRg~D~tLKvWDLrq~kkpL~~~tgL  406 (641)
T KOG0772|consen  374 YDG----------------------------------N-YLLSRGFDDTLKVWDLRQFKKPLNVRTGL  406 (641)
T ss_pred             ccc----------------------------------c-hhhhccCCCceeeeeccccccchhhhcCC
Confidence            221                                  1 222222478999999998665 4444433


No 80 
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=32.95  E-value=4.5e+02  Score=29.60  Aligned_cols=73  Identities=18%  Similarity=0.263  Sum_probs=47.0

Q ss_pred             eeEeeceeeEEeeCcEEEEEcCCCCEEEEEEEECCceEeeEEEEecCCCccccceEEecCCeEEEEeeeCCeEEEEEee
Q 001003          361 SVELDAAHATWLQNDVALLSTKTGDLVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC  439 (1192)
Q Consensus       361 ~l~l~~~~~~~~~~~~~Ll~~~~G~L~~l~l~~dg~~V~~l~l~~~~~~~~~s~l~~l~~g~lFvGS~~GDS~Ll~~~~  439 (1192)
                      ..+++++. .. -.++++|+-.+|-||.|.+.. |.....+.+  .+ .+-.+..+-.+.|+++.||+.|+-+.+.+..
T Consensus        52 g~RiE~sa-~v-vgdfVV~GCy~g~lYfl~~~t-Gs~~w~f~~--~~-~vk~~a~~d~~~glIycgshd~~~yalD~~~  124 (354)
T KOG4649|consen   52 GVRIECSA-IV-VGDFVVLGCYSGGLYFLCVKT-GSQIWNFVI--LE-TVKVRAQCDFDGGLIYCGSHDGNFYALDPKT  124 (354)
T ss_pred             Cceeeeee-EE-ECCEEEEEEccCcEEEEEecc-hhheeeeee--hh-hhccceEEcCCCceEEEecCCCcEEEecccc
Confidence            45666643 33 355699999999999998864 433332222  11 2234455667899999999988866655443


No 81 
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=32.85  E-value=6.2e+02  Score=30.95  Aligned_cols=82  Identities=17%  Similarity=0.229  Sum_probs=49.1

Q ss_pred             CcEEEEEE--cCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcc
Q 001003          659 STVLSVSI--ADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG  736 (1192)
Q Consensus       659 ~~Iv~asi--~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~  736 (1192)
                      ..|+++.-  .|-|++-...+|.|.+........      ...+.....+..  -+. +-+                   
T Consensus       122 stvt~v~YN~~DeyiAsvs~gGdiiih~~~t~~~------tt~f~~~sgqsv--Rll-~ys-------------------  173 (673)
T KOG4378|consen  122 STVTYVDYNNTDEYIASVSDGGDIIIHGTKTKQK------TTTFTIDSGQSV--RLL-RYS-------------------  173 (673)
T ss_pred             ceeEEEEecCCcceeEEeccCCcEEEEecccCcc------ccceecCCCCeE--EEe-ecc-------------------
Confidence            45777776  588999999999999887643311      111211111111  111 111                   


Q ss_pred             ccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEeec
Q 001003          737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDK  779 (1192)
Q Consensus       737 ~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~  779 (1192)
                                 ....+.+.+.-++|.+.+|....+.+.|....
T Consensus       174 -----------~skr~lL~~asd~G~VtlwDv~g~sp~~~~~~  205 (673)
T KOG4378|consen  174 -----------PSKRFLLSIASDKGAVTLWDVQGMSPIFHASE  205 (673)
T ss_pred             -----------cccceeeEeeccCCeEEEEeccCCCcccchhh
Confidence                       11346777788899999999887777665543


No 82 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=32.65  E-value=1.3e+03  Score=30.41  Aligned_cols=73  Identities=15%  Similarity=0.216  Sum_probs=45.9

Q ss_pred             CCcEEEEEEc--C-CEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccC
Q 001003          658 NSTVLSVSIA--D-PYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG  734 (1192)
Q Consensus       658 ~~~Iv~asi~--d-pyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~  734 (1192)
                      ...|.++++.  + .+++.+..||+|.+|.+......-.+.       ....+.++++..+                   
T Consensus       575 ~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~-------~~~~v~~v~~~~~-------------------  628 (793)
T PLN00181        575 EKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIK-------TKANICCVQFPSE-------------------  628 (793)
T ss_pred             CCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEEe-------cCCCeEEEEEeCC-------------------
Confidence            3458888885  3 477778889999999886543321211       1223444433110                   


Q ss_pred             ccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCc
Q 001003          735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNF  771 (1192)
Q Consensus       735 ~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~  771 (1192)
                                     ...+++.+..+|.+.||.+.+.
T Consensus       629 ---------------~g~~latgs~dg~I~iwD~~~~  650 (793)
T PLN00181        629 ---------------SGRSLAFGSADHKVYYYDLRNP  650 (793)
T ss_pred             ---------------CCCEEEEEeCCCeEEEEECCCC
Confidence                           1236778888999999998653


No 83 
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=31.79  E-value=3.5e+02  Score=31.11  Aligned_cols=23  Identities=22%  Similarity=0.491  Sum_probs=18.7

Q ss_pred             cEEEEEEecCCeEEEEECCC-cee
Q 001003          751 DIYSVVCYESGALEIFDVPN-FNC  773 (1192)
Q Consensus       751 ~~~l~v~~~~g~l~I~sLP~-~~~  773 (1192)
                      ..-++.+-.+|.|+||..|+ +++
T Consensus       126 GLklA~~~aDG~lRIYEA~dp~nL  149 (361)
T KOG2445|consen  126 GLKLAAASADGILRIYEAPDPMNL  149 (361)
T ss_pred             ceEEEEeccCcEEEEEecCCcccc
Confidence            45677788999999999998 554


No 84 
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=30.86  E-value=6.7e+02  Score=30.35  Aligned_cols=32  Identities=25%  Similarity=0.227  Sum_probs=24.1

Q ss_pred             CcEEEEE-EcCCEEEEEEeCCcEEEEEecCCCc
Q 001003          659 STVLSVS-IADPYVLLGMSDGSIRLLVGDPSTC  690 (1192)
Q Consensus       659 ~~Iv~as-i~dpyvlv~~~dg~i~~l~~d~~~~  690 (1192)
                      ..|-|++ |++...+.+.+||+|.+|.+-....
T Consensus       328 ~sidcv~~In~~HfvsGSdnG~IaLWs~~KKkp  360 (479)
T KOG0299|consen  328 GSIDCVAFINDEHFVSGSDNGSIALWSLLKKKP  360 (479)
T ss_pred             CCeeeEEEecccceeeccCCceEEEeeecccCc
Confidence            3466655 5788899999999999998754433


No 85 
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=30.85  E-value=4.5e+02  Score=29.64  Aligned_cols=40  Identities=15%  Similarity=0.379  Sum_probs=28.5

Q ss_pred             eEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEee
Q 001003          977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSV 1023 (1192)
Q Consensus       977 ~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~ 1023 (1192)
                      -+-|+..+...      .+-. ||-...--.||+||....++-+++.
T Consensus       254 ~IDIA~vetGd------~~~e-I~~~~~t~tVAWHPk~~LLAyA~dd  293 (313)
T KOG1407|consen  254 FIDIAEVETGD------RVWE-IPCEGPTFTVAWHPKRPLLAYACDD  293 (313)
T ss_pred             eEEeEecccCC------eEEE-eeccCCceeEEecCCCceeeEEecC
Confidence            34456666555      3455 6666666789999999999988875


No 86 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=30.26  E-value=7e+02  Score=30.04  Aligned_cols=110  Identities=20%  Similarity=0.252  Sum_probs=67.0

Q ss_pred             EEEEEcCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCccccccC
Q 001003          662 LSVSIADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDG  741 (1192)
Q Consensus       662 v~asi~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~  741 (1192)
                      .++-.+..|++=+..||+..+...........++...    .+-+++++.+..|                          
T Consensus       309 ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~----s~v~~ts~~fHpD--------------------------  358 (506)
T KOG0289|consen  309 LSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDET----SDVEYTSAAFHPD--------------------------  358 (506)
T ss_pred             eeeccCCcEEEEecCCceEEEEEccCCcEEEEEeecc----ccceeEEeeEcCC--------------------------
Confidence            3344478899999888888877666554433332210    1334565544322                          


Q ss_pred             CCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCccccccc
Q 001003          742 ADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHS  821 (1192)
Q Consensus       742 ~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  821 (1192)
                               ..++..+..+|.+.||.|.+-.-   +..|+   -                                  +.
T Consensus       359 ---------gLifgtgt~d~~vkiwdlks~~~---~a~Fp---g----------------------------------ht  389 (506)
T KOG0289|consen  359 ---------GLIFGTGTPDGVVKIWDLKSQTN---VAKFP---G----------------------------------HT  389 (506)
T ss_pred             ---------ceEEeccCCCceEEEEEcCCccc---cccCC---C----------------------------------CC
Confidence                     23555688999999999865331   11221   0                                  11


Q ss_pred             ccEEEEEEeecCCCCCccEEEEEecCCcEEEEEE
Q 001003          822 MKVVELAMQRWSAHHSRPFLFAILTDGTILCYQA  855 (1192)
Q Consensus       822 ~~v~eil~~~~g~~~~~p~Llv~l~dG~l~~Y~~  855 (1192)
                      -.|.+|   .|+  ++..||.+...||.+..+.+
T Consensus       390 ~~vk~i---~Fs--ENGY~Lat~add~~V~lwDL  418 (506)
T KOG0289|consen  390 GPVKAI---SFS--ENGYWLATAADDGSVKLWDL  418 (506)
T ss_pred             CceeEE---Eec--cCceEEEEEecCCeEEEEEe
Confidence            235555   555  66789999888888888776


No 87 
>PTZ00420 coronin; Provisional
Probab=29.69  E-value=1.3e+03  Score=29.38  Aligned_cols=29  Identities=21%  Similarity=0.239  Sum_probs=22.7

Q ss_pred             CcEEEEEEc---CCEEEEEEeCCcEEEEEecC
Q 001003          659 STVLSVSIA---DPYVLLGMSDGSIRLLVGDP  687 (1192)
Q Consensus       659 ~~Iv~asi~---dpyvlv~~~dg~i~~l~~d~  687 (1192)
                      ..|.+++.+   +.+++.+..||+|.+|.+..
T Consensus        75 ~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t  106 (568)
T PTZ00420         75 SSILDLQFNPCFSEILASGSEDLTIRVWEIPH  106 (568)
T ss_pred             CCEEEEEEcCCCCCEEEEEeCCCeEEEEECCC
Confidence            458888775   35777788999999998764


No 88 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=29.60  E-value=6e+02  Score=29.09  Aligned_cols=84  Identities=13%  Similarity=0.146  Sum_probs=50.8

Q ss_pred             EEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeC-CCeEEEEEEeCCCCCEEEEeeeeeeccccccccCCcccccC
Q 001003          101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFE-DAKISVLEFDDSIHGLRITSMHCFESPEWLHLKRGRESFAR  179 (1192)
Q Consensus       101 L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~-~aklsil~~d~~~~~l~t~Slh~~E~~~~~~~~~G~~~~~~  179 (1192)
                      |+++...+..|....|+.   ..     ..+.|+++.. +..+.+.+++ +.+.+..+.-+          ..+    ..
T Consensus        25 l~~~~~~~~~~~~~~l~~---sp-----d~~~lyv~~~~~~~i~~~~~~-~~g~l~~~~~~----------~~~----~~   81 (330)
T PRK11028         25 LTLLQVVDVPGQVQPMVI---SP-----DKRHLYVGVRPEFRVLSYRIA-DDGALTFAAES----------PLP----GS   81 (330)
T ss_pred             eeeeeEEecCCCCccEEE---CC-----CCCEEEEEECCCCcEEEEEEC-CCCceEEeeee----------cCC----CC
Confidence            777777766666655532   21     3578887754 5666666665 34445321111          111    12


Q ss_pred             CCeEEECCCCcEEEEEEc-CceEEEEeCc
Q 001003          180 GPLVKVDPQGRCGGVLVY-GLQMIILKAS  207 (1192)
Q Consensus       180 ~~~l~VDP~~Rc~~l~~y-~~~L~ilP~~  207 (1192)
                      +..+..||+||.+.+.-| .+.+.++.+.
T Consensus        82 p~~i~~~~~g~~l~v~~~~~~~v~v~~~~  110 (330)
T PRK11028         82 PTHISTDHQGRFLFSASYNANCVSVSPLD  110 (330)
T ss_pred             ceEEEECCCCCEEEEEEcCCCeEEEEEEC
Confidence            357899999998887754 6788888664


No 89 
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=29.54  E-value=8.1e+02  Score=27.12  Aligned_cols=22  Identities=18%  Similarity=0.206  Sum_probs=18.1

Q ss_pred             EEEEcCCEEEEEEeCCcEEEEEe
Q 001003          663 SVSIADPYVLLGMSDGSIRLLVG  685 (1192)
Q Consensus       663 ~asi~dpyvlv~~~dg~i~~l~~  685 (1192)
                      |+...+..++|++++| |.++..
T Consensus         2 c~~~~~~~L~vGt~~G-l~~~~~   23 (275)
T PF00780_consen    2 CADSWGDRLLVGTEDG-LYVYDL   23 (275)
T ss_pred             CcccCCCEEEEEECCC-EEEEEe
Confidence            5666788999999998 777777


No 90 
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=28.92  E-value=6.5e+02  Score=28.54  Aligned_cols=91  Identities=21%  Similarity=0.336  Sum_probs=47.3

Q ss_pred             EEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEe-----ccccCcccccCceEEEEEeeeCC
Q 001003         1065 EVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGT-----AYVQGEDVAARGRVLLFSTGRNA 1139 (1192)
Q Consensus      1065 sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGT-----a~~~gEd~~~rGRIlvfev~~~~ 1139 (1192)
                      .||+.+..+ +.+|...+.  |++ -|=.+|++|...   ..+  .|||.|.     ++..-||    |-   ||.+..-
T Consensus        38 ~vriw~~~~-~~s~~ck~v--ld~-~hkrsVRsvAws---p~g--~~La~aSFD~t~~Iw~k~~----~e---fecv~~l  101 (312)
T KOG0645|consen   38 AVRIWSTSS-GDSWTCKTV--LDD-GHKRSVRSVAWS---PHG--RYLASASFDATVVIWKKED----GE---FECVATL  101 (312)
T ss_pred             eEEEEecCC-CCcEEEEEe--ccc-cchheeeeeeec---CCC--cEEEEeeccceEEEeecCC----Cc---eeEEeee
Confidence            466666543 346776533  222 244566776654   222  3887762     1111121    11   3332211


Q ss_pred             CCCceeEeecccCcccccchhcccCceEEEeecc-eEEeeeh
Q 001003         1140 DNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS-FVFVFLF 1180 (1192)
Q Consensus      1140 ~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~-~~~~~~~ 1180 (1192)
                         | =|+.+++.-+++     -.|.+||+|+-- +|.||+-
T Consensus       102 ---E-GHEnEVK~Vaws-----~sG~~LATCSRDKSVWiWe~  134 (312)
T KOG0645|consen  102 ---E-GHENEVKCVAWS-----ASGNYLATCSRDKSVWIWEI  134 (312)
T ss_pred             ---e-ccccceeEEEEc-----CCCCEEEEeeCCCeEEEEEe
Confidence               1 233334444555     679999999987 7888874


No 91 
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=27.98  E-value=1.5e+03  Score=29.91  Aligned_cols=97  Identities=13%  Similarity=0.235  Sum_probs=53.2

Q ss_pred             CccceEEeeccCCceEEEEcCCCCeE-EEEe--CCceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCC
Q 001003          910 PCQRITIFKNISGHQGFFLSGSRPCW-CMVF--RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSG  986 (1192)
Q Consensus       910 g~~~l~~f~~i~G~~gVF~~G~rP~w-i~~~--~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~  986 (1192)
                      .+..+..|+.-++.  .++...||.. |+..  .+.+...-+.-+.|+.+..       .+++.|+..+ -+|-..+...
T Consensus       281 kRt~v~tfrrendR--FW~laahP~lNLfAAgHDsGm~VFkleRErpa~~v~-------~n~LfYvkd~-~i~~~d~~t~  350 (1202)
T KOG0292|consen  281 KRTSVQTFRRENDR--FWILAAHPELNLFAAGHDSGMIVFKLERERPAYAVN-------GNGLFYVKDR-FIRSYDLRTQ  350 (1202)
T ss_pred             cccceeeeeccCCe--EEEEEecCCcceeeeecCCceEEEEEcccCceEEEc-------CCEEEEEccc-eEEeeecccc
Confidence            34567778754442  4444455554 2332  2333333433344454442       4677888754 7776666653


Q ss_pred             CccCCccceEEEecC----CCccCeEEEecCCCEEEEEE
Q 001003          987 STYDNYWPVQKVIPL----KATPHQITYFAEKNLYPLIV 1021 (1192)
Q Consensus       987 ~~~d~~~~vrk~ipL----~~tp~~Iay~~~~~~y~v~~ 1021 (1192)
                      .    ..++-+ +.-    ...|+.+.|.|..++..+.+
T Consensus       351 ~----d~~v~~-lr~~g~~~~~~~smsYNpae~~vlics  384 (1202)
T KOG0292|consen  351 K----DTAVAS-LRRPGTLWQPPRSLSYNPAENAVLICS  384 (1202)
T ss_pred             c----cceeEe-ccCCCcccCCcceeeeccccCeEEEEe
Confidence            3    234433 222    24689999999888766644


No 92 
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=27.87  E-value=95  Score=25.33  Aligned_cols=40  Identities=20%  Similarity=0.292  Sum_probs=30.0

Q ss_pred             EEEEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEe
Q 001003          101 LELVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFD  148 (1192)
Q Consensus       101 L~lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d  148 (1192)
                      ++++.+..+-..|..+. .-+       ..|.|.++++++.+.+-+.|
T Consensus         2 f~~~~~k~l~~~v~~~~-w~P-------~mdLiA~~t~~g~v~v~Rl~   41 (47)
T PF12894_consen    2 FRQLGEKNLPSRVSCMS-WCP-------TMDLIALGTEDGEVLVYRLN   41 (47)
T ss_pred             cceecccCCCCcEEEEE-ECC-------CCCEEEEEECCCeEEEEECC
Confidence            56677777777766333 332       58999999999999998875


No 93 
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=27.52  E-value=96  Score=34.76  Aligned_cols=60  Identities=23%  Similarity=0.321  Sum_probs=36.9

Q ss_pred             cEEEEEcCCCCEEEEEEEECCceEeeEEEEec-----CCCccccceEEecCCeEEEEeeeCCeEEEEEe
Q 001003          375 DVALLSTKTGDLVLLTVVYDGRVVQRLDLSKT-----NPSVLTSDITTIGNSLFFLGSRLGDSLLVQFT  438 (1192)
Q Consensus       375 ~~~Ll~~~~G~L~~l~l~~dg~~V~~l~l~~~-----~~~~~~s~l~~l~~g~lFvGS~~GDS~Ll~~~  438 (1192)
                      +.++|+++...|..+.  .+|+-++.+.|..-     ...+.|..|+.-.+|.|||-|+ .| ++|+|.
T Consensus       184 ~lliLS~es~~l~~~d--~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE-pN-lfy~f~  248 (248)
T PF06977_consen  184 HLLILSDESRLLLELD--RQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE-PN-LFYRFE  248 (248)
T ss_dssp             EEEEEETTTTEEEEE---TT--EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET-TT-EEEEEE
T ss_pred             eEEEEECCCCeEEEEC--CCCCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC-Cc-eEEEeC
Confidence            4577777777775444  56776777777652     4567799999999999999998 44 788773


No 94 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=27.05  E-value=9.3e+02  Score=30.96  Aligned_cols=74  Identities=19%  Similarity=0.223  Sum_probs=43.8

Q ss_pred             EEEEEE--cCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcccc
Q 001003          661 VLSVSI--ADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEA  738 (1192)
Q Consensus       661 Iv~asi--~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~  738 (1192)
                      |.+.++  .+-+++-+..|.++++|.++++..+.. .-.++. .....|.|+++.  ..                     
T Consensus       368 vlSL~~~~~g~llat~sKD~svilWr~~~~~~~~~-~~a~~~-gH~~svgava~~--~~---------------------  422 (775)
T KOG0319|consen  368 VLSLDVWSSGDLLATGSKDKSVILWRLNNNCSKSL-CVAQAN-GHTNSVGAVAGS--KL---------------------  422 (775)
T ss_pred             eeeeeecccCcEEEEecCCceEEEEEecCCcchhh-hhhhhc-ccccccceeeec--cc---------------------
Confidence            556553  343666677899999999955433211 111111 134456666551  10                     


Q ss_pred             ccCCCCCCCCCCcEEEEEEecCCeEEEEECCC
Q 001003          739 IDGADGGPLDQGDIYSVVCYESGALEIFDVPN  770 (1192)
Q Consensus       739 ~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~  770 (1192)
                                ..+ +++..-.+++|.+|.||.
T Consensus       423 ----------~as-ffvsvS~D~tlK~W~l~~  443 (775)
T KOG0319|consen  423 ----------GAS-FFVSVSQDCTLKLWDLPK  443 (775)
T ss_pred             ----------Ccc-EEEEecCCceEEEecCCC
Confidence                      112 666677899999999998


No 95 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=26.59  E-value=4.8e+02  Score=30.67  Aligned_cols=56  Identities=14%  Similarity=0.267  Sum_probs=40.1

Q ss_pred             EEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcccccccccEEEEEEeec
Q 001003          753 YSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRKENIHSMKVVELAMQRW  832 (1192)
Q Consensus       753 ~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~eil~~~~  832 (1192)
                      +++++.-+|+|.||.+....+.+.+.+                                         +.-|.++..   
T Consensus       300 L~A~G~vdG~i~iyD~a~~~~R~~c~h-----------------------------------------e~~V~~l~w---  335 (399)
T KOG0296|consen  300 LAACGSVDGTIAIYDLAASTLRHICEH-----------------------------------------EDGVTKLKW---  335 (399)
T ss_pred             hhhcccccceEEEEecccchhheeccC-----------------------------------------CCceEEEEE---
Confidence            677888899999999988776655531                                         112444543   


Q ss_pred             CCCCCccEEEEEecCCcEEEEEE
Q 001003          833 SAHHSRPFLFAILTDGTILCYQA  855 (1192)
Q Consensus       833 g~~~~~p~Llv~l~dG~l~~Y~~  855 (1192)
                         ...+||+..-.||.|-.|.+
T Consensus       336 ---~~t~~l~t~c~~g~v~~wDa  355 (399)
T KOG0296|consen  336 ---LNTDYLLTACANGKVRQWDA  355 (399)
T ss_pred             ---cCcchheeeccCceEEeeec
Confidence               23799999999998876664


No 96 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=25.50  E-value=1.3e+03  Score=28.15  Aligned_cols=77  Identities=16%  Similarity=0.220  Sum_probs=48.2

Q ss_pred             CcEEEEEE--cCCEEEEEEeCCcEEEEEecCCCce--EeeccccccccCCCceeEEEeeccCCCCCcccccccccccccC
Q 001003          659 STVLSVSI--ADPYVLLGMSDGSIRLLVGDPSTCT--VSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTG  734 (1192)
Q Consensus       659 ~~Iv~asi--~dpyvlv~~~dg~i~~l~~d~~~~~--l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~  734 (1192)
                      ..|+++++  .+.+++.+..||.|++|.......+  -.+....   ... .+.+++.-  .                  
T Consensus       289 ~~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~---~~~-~~~~~~fs--p------------------  344 (456)
T KOG0266|consen  289 DGISGLAFSPDGNLLVSASYDGTIRVWDLETGSKLCLKLLSGAE---NSA-PVTSVQFS--P------------------  344 (456)
T ss_pred             CceEEEEECCCCCEEEEcCCCccEEEEECCCCceeeeecccCCC---CCC-ceeEEEEC--C------------------
Confidence            34777777  4668888888999999988765421  1111111   111 34444331  0                  


Q ss_pred             ccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceee
Q 001003          735 VGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCV  774 (1192)
Q Consensus       735 ~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v  774 (1192)
                                     ...++++...++.+.+|.+.....+
T Consensus       345 ---------------~~~~ll~~~~d~~~~~w~l~~~~~~  369 (456)
T KOG0266|consen  345 ---------------NGKYLLSASLDRTLKLWDLRSGKSV  369 (456)
T ss_pred             ---------------CCcEEEEecCCCeEEEEEccCCcce
Confidence                           1237888888999999999875554


No 97 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=25.39  E-value=1.2e+03  Score=27.66  Aligned_cols=139  Identities=10%  Similarity=0.067  Sum_probs=74.2

Q ss_pred             CCceEEEEcCCCCeEEEEeCCceEEEecCCCCcee--EEecccC------CC---C--CCcEEEEEecCeEEEEEcCCCC
Q 001003          921 SGHQGFFLSGSRPCWCMVFRERLRVHPQLCDGSIV--AFTVLHN------VN---C--NHGFIYVTSQGILKICQLPSGS  987 (1192)
Q Consensus       921 ~G~~gVF~~G~rP~wi~~~~g~l~~~p~~~~~~v~--~~t~F~~------~~---c--~~Gfi~~~~~~~LrI~~l~~~~  987 (1192)
                      .+...+|++++.-.++.+-.|++...-+-..+.+.  +.--|+.      .+   .  ....+|++.+|.+.+..+....
T Consensus       147 p~~~~vy~t~e~~~~~~~~Dg~~~~v~~d~~g~~~~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs~eG~V~~id~~~~~  226 (352)
T TIGR02658       147 PDCYHIFPTANDTFFMHCRDGSLAKVGYGTKGNPKIKPTEVFHPEDEYLINHPAYSNKSGRLVWPTYTGKIFQIDLSSGD  226 (352)
T ss_pred             CCCcEEEEecCCccEEEeecCceEEEEecCCCceEEeeeeeecCCccccccCCceEcCCCcEEEEecCCeEEEEecCCCc
Confidence            55678888888887777766666553322222211  1111332      11   1  2467788888888888764432


Q ss_pred             -ccCCccceEEEecC---CCccCe---EEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCccccccCcc
Q 001003          988 -TYDNYWPVQKVIPL---KATPHQ---ITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLHRTYT 1060 (1192)
Q Consensus       988 -~~d~~~~vrk~ipL---~~tp~~---Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~~~p~ 1060 (1192)
                       ..-..|..-. ..-   +-.|-.   |++|++.+.+.|+....             .+..|.                 
T Consensus       227 ~~~~~~~~~~~-~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~-------------~~~thk-----------------  275 (352)
T TIGR02658       227 AKFLPAIEAFT-EAEKADGWRPGGWQQVAYHRARDRIYLLADQR-------------AKWTHK-----------------  275 (352)
T ss_pred             ceecceeeecc-ccccccccCCCcceeEEEcCCCCEEEEEecCC-------------cccccc-----------------
Confidence             1112233211 110   001222   99999888776643221             011111                 


Q ss_pred             eeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEE
Q 001003         1061 VEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVR 1096 (1192)
Q Consensus      1061 ~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~ 1096 (1192)
                      .....|-++|..    +++++..+.+..  .+..|.
T Consensus       276 ~~~~~V~ViD~~----t~kvi~~i~vG~--~~~~ia  305 (352)
T TIGR02658       276 TASRFLFVVDAK----TGKRLRKIELGH--EIDSIN  305 (352)
T ss_pred             CCCCEEEEEECC----CCeEEEEEeCCC--ceeeEE
Confidence            012257799985    999999999865  444443


No 98 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=25.31  E-value=1.3e+03  Score=28.12  Aligned_cols=74  Identities=19%  Similarity=0.344  Sum_probs=47.8

Q ss_pred             CcEEEEEEc--CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcc
Q 001003          659 STVLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG  736 (1192)
Q Consensus       659 ~~Iv~asi~--dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~  736 (1192)
                      ..|.+++++  +..++-+..|++|++|......++-.+.      .....|+++++-.|                     
T Consensus       247 ~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~------~hs~~is~~~f~~d---------------------  299 (456)
T KOG0266|consen  247 TYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLK------GHSDGISGLAFSPD---------------------  299 (456)
T ss_pred             CceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeee------ccCCceEEEEECCC---------------------
Confidence            447777774  3577779999999999987643422222      23557777755322                     


Q ss_pred             ccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCcee
Q 001003          737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC  773 (1192)
Q Consensus       737 ~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~  773 (1192)
                                   .. +++..-.+|.++||++-....
T Consensus       300 -------------~~-~l~s~s~d~~i~vwd~~~~~~  322 (456)
T KOG0266|consen  300 -------------GN-LLVSASYDGTIRVWDLETGSK  322 (456)
T ss_pred             -------------CC-EEEEcCCCccEEEEECCCCce
Confidence                         12 344444489999999987664


No 99 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=25.27  E-value=5.3e+02  Score=30.26  Aligned_cols=68  Identities=25%  Similarity=0.300  Sum_probs=41.3

Q ss_pred             EEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEEc--CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCcee
Q 001003          631 ARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVS  708 (1192)
Q Consensus       631 vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi~--dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~  708 (1192)
                      +|+.|.....+...+.      |   ....|..+-+.  ||+|+-+.-|++|++|.+-......      .+......+.
T Consensus       259 ~RvWDiRtr~~V~~l~------G---H~~~V~~V~~~~~dpqvit~S~D~tvrlWDl~agkt~~------tlt~hkksvr  323 (460)
T KOG0285|consen  259 IRVWDIRTRASVHVLS------G---HTNPVASVMCQPTDPQVITGSHDSTVRLWDLRAGKTMI------TLTHHKKSVR  323 (460)
T ss_pred             EEEeeecccceEEEec------C---CCCcceeEEeecCCCceEEecCCceEEEeeeccCceeE------eeecccceee
Confidence            5666655444433332      1   22336666665  9999999999999999876543322      2223355667


Q ss_pred             EEEee
Q 001003          709 SCTLY  713 (1192)
Q Consensus       709 ~~~l~  713 (1192)
                      |+|+.
T Consensus       324 al~lh  328 (460)
T KOG0285|consen  324 ALCLH  328 (460)
T ss_pred             EEecC
Confidence            77765


No 100
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=24.89  E-value=9.6e+02  Score=27.65  Aligned_cols=53  Identities=23%  Similarity=0.399  Sum_probs=34.6

Q ss_pred             cEEEEEEc-CC-EEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCC
Q 001003          660 TVLSVSIA-DP-YVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKG  717 (1192)
Q Consensus       660 ~Iv~asi~-dp-yvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~  717 (1192)
                      .+.|.++. |. .++-+..||.|.+|.+....|+-....     .....|+|+++..|.+
T Consensus       265 aVlci~FSRDsEMlAsGsqDGkIKvWri~tG~ClRrFdr-----AHtkGvt~l~FSrD~S  319 (508)
T KOG0275|consen  265 AVLCISFSRDSEMLASGSQDGKIKVWRIETGQCLRRFDR-----AHTKGVTCLSFSRDNS  319 (508)
T ss_pred             ceEEEeecccHHHhhccCcCCcEEEEEEecchHHHHhhh-----hhccCeeEEEEccCcc
Confidence            36777664 33 455577899999999887766322111     1355688888877765


No 101
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=24.42  E-value=1.1e+03  Score=26.94  Aligned_cols=119  Identities=20%  Similarity=0.185  Sum_probs=69.2

Q ss_pred             CcEEEEEEc--CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCcc
Q 001003          659 STVLSVSIA--DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGVG  736 (1192)
Q Consensus       659 ~~Iv~asi~--dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~  736 (1192)
                      .-+..+.+.  +..++=+=.||.+.++.+++..+...+..       ...|.++|+-    |                  
T Consensus       193 ~~v~t~~vSpDGslcasGgkdg~~~LwdL~~~k~lysl~a-------~~~v~sl~fs----p------------------  243 (315)
T KOG0279|consen  193 GYVNTVTVSPDGSLCASGGKDGEAMLWDLNEGKNLYSLEA-------FDIVNSLCFS----P------------------  243 (315)
T ss_pred             ccEEEEEECCCCCEEecCCCCceEEEEEccCCceeEeccC-------CCeEeeEEec----C------------------
Confidence            345555554  22333355688999999887665333221       2346666552    1                  


Q ss_pred             ccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEeeccccccceecccccccccccccccccCCCccCCCCCcc
Q 001003          737 EAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVDKFVSGRTHIVDTYMREALKDSETEINSSSEEGTGQGRK  816 (1192)
Q Consensus       737 ~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~~~~~~~~~  816 (1192)
                                   ..+|+..++.. ++.||.|.+-.+++.-+     ++..-.         +               .+
T Consensus       244 -------------nrywL~~at~~-sIkIwdl~~~~~v~~l~-----~d~~g~---------s---------------~~  280 (315)
T KOG0279|consen  244 -------------NRYWLCAATAT-SIKIWDLESKAVVEELK-----LDGIGP---------S---------------SK  280 (315)
T ss_pred             -------------CceeEeeccCC-ceEEEeccchhhhhhcc-----cccccc---------c---------------cc
Confidence                         24788888865 48999998866664443     111000         0               00


Q ss_pred             cccccccEEEEEEeecCCCCCccEEEEEecCCcEEEEEEe
Q 001003          817 ENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQAY  856 (1192)
Q Consensus       817 ~~~~~~~v~eil~~~~g~~~~~p~Llv~l~dG~l~~Y~~~  856 (1192)
                          .-.+..+.+.-.   ....+||++..||-+-.+++-
T Consensus       281 ----~~~~~clslaws---~dG~tLf~g~td~~irv~qv~  313 (315)
T KOG0279|consen  281 ----AGDPICLSLAWS---ADGQTLFAGYTDNVIRVWQVA  313 (315)
T ss_pred             ----cCCcEEEEEEEc---CCCcEEEeeecCCcEEEEEee
Confidence                012344555433   457899999999999988863


No 102
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=23.79  E-value=1.6e+03  Score=28.56  Aligned_cols=102  Identities=13%  Similarity=0.176  Sum_probs=60.2

Q ss_pred             CcEEEEeCCcceeeeecCCCCCCCCCCCCCCcEEEEEE--cCCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCc
Q 001003          629 RGARILDGSYMTQDLSFGPSNSESGSGSENSTVLSVSI--ADPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKP  706 (1192)
Q Consensus       629 ~~vrli~~~~~~q~i~~~~~~~e~~~~~~~~~Iv~asi--~dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~  706 (1192)
                      ..||++..+..- .+..|+-        ...-|.+..+  ..||++-..+|-+|.+|.-+.+=.     ..+.+....+-
T Consensus        77 ~~IrVfnynt~e-kV~~FeA--------H~DyIR~iavHPt~P~vLtsSDDm~iKlW~we~~wa-----~~qtfeGH~Hy  142 (794)
T KOG0276|consen   77 MQIRVFNYNTGE-KVKTFEA--------HSDYIRSIAVHPTLPYVLTSSDDMTIKLWDWENEWA-----CEQTFEGHEHY  142 (794)
T ss_pred             ceEEEEecccce-eeEEeec--------cccceeeeeecCCCCeEEecCCccEEEEeeccCcee-----eeeEEcCcceE
Confidence            358888766421 1222321        2234777777  699999999999999997664311     11223334667


Q ss_pred             eeEEEeeccCCCCCcccccccccccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEEee
Q 001003          707 VSSCTLYHDKGPEPWLRKTSTDAWLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFTVD  778 (1192)
Q Consensus       707 i~~~~l~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~~~  778 (1192)
                      |.++|++..                                 ..+.+.- +.-++++.||+|-+-.+.|+-+
T Consensus       143 VMqv~fnPk---------------------------------D~ntFaS-~sLDrTVKVWslgs~~~nfTl~  180 (794)
T KOG0276|consen  143 VMQVAFNPK---------------------------------DPNTFAS-ASLDRTVKVWSLGSPHPNFTLE  180 (794)
T ss_pred             EEEEEecCC---------------------------------Cccceee-eeccccEEEEEcCCCCCceeee
Confidence            777776521                                 0122322 3337899999997655555544


No 103
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=22.87  E-value=4.6e+02  Score=31.16  Aligned_cols=67  Identities=21%  Similarity=0.425  Sum_probs=0.0

Q ss_pred             EEEEEec-CeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEeecCcccccccccccccccccccccc
Q 001003          969 FIYVTSQ-GILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDN 1047 (1192)
Q Consensus       969 fi~~~~~-~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~ 1047 (1192)
                      ++|+.+. |.+.+-.+....      .+++ |+.|..|+.|+++++.+..++.+..+.                      
T Consensus        50 ~~yv~~rdg~vsviD~~~~~------~v~~-i~~G~~~~~i~~s~DG~~~~v~n~~~~----------------------  100 (369)
T PF02239_consen   50 YLYVANRDGTVSVIDLATGK------VVAT-IKVGGNPRGIAVSPDGKYVYVANYEPG----------------------  100 (369)
T ss_dssp             EEEEEETTSEEEEEETTSSS------EEEE-EE-SSEEEEEEE--TTTEEEEEEEETT----------------------
T ss_pred             EEEEEcCCCeEEEEECCccc------EEEE-EecCCCcceEEEcCCCCEEEEEecCCC----------------------


Q ss_pred             CCCCccccccCcceeeeEEEEEcCCCCCCCceeeeeEE
Q 001003         1048 HNLSSVDLHRTYTVEEYEVRILEPDRAGGPWQTRATIP 1085 (1192)
Q Consensus      1048 ~~~~~~~~~~~p~~~~~sv~Lldp~~~~~twe~id~~e 1085 (1192)
                                       .+.++|..    |++.+..+.
T Consensus       101 -----------------~v~v~D~~----tle~v~~I~  117 (369)
T PF02239_consen  101 -----------------TVSVIDAE----TLEPVKTIP  117 (369)
T ss_dssp             -----------------EEEEEETT----T--EEEEEE
T ss_pred             -----------------ceeEeccc----cccceeecc


No 104
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=22.84  E-value=3e+02  Score=33.33  Aligned_cols=78  Identities=18%  Similarity=0.344  Sum_probs=53.2

Q ss_pred             EEEEe-CCceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceEEEecCCCccCeEEEecC
Q 001003          935 WCMVF-RERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAE 1013 (1192)
Q Consensus       935 wi~~~-~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~ 1013 (1192)
                      |=++. ++-+|.+. .-..+|..++. +  +|...|+-..=+..+++--....+      -..+ ..+++.|--+-+||+
T Consensus       242 W~vy~~~~~lrtf~-gH~k~Vrd~~~-s--~~g~~fLS~sfD~~lKlwDtETG~------~~~~-f~~~~~~~cvkf~pd  310 (503)
T KOG0282|consen  242 WNVYDDRRCLRTFK-GHRKPVRDASF-N--NCGTSFLSASFDRFLKLWDTETGQ------VLSR-FHLDKVPTCVKFHPD  310 (503)
T ss_pred             EEEecCcceehhhh-cchhhhhhhhc-c--ccCCeeeeeecceeeeeeccccce------EEEE-EecCCCceeeecCCC
Confidence            43444 44444333 22245555443 3  377777777667788887777665      5577 999999999999999


Q ss_pred             C-CEEEEEEee
Q 001003         1014 K-NLYPLIVSV 1023 (1192)
Q Consensus      1014 ~-~~y~v~~s~ 1023 (1192)
                      . ++|.+..+.
T Consensus       311 ~~n~fl~G~sd  321 (503)
T KOG0282|consen  311 NQNIFLVGGSD  321 (503)
T ss_pred             CCcEEEEecCC
Confidence            8 888888775


No 105
>PTZ00421 coronin; Provisional
Probab=22.40  E-value=1.5e+03  Score=27.96  Aligned_cols=77  Identities=14%  Similarity=0.077  Sum_probs=47.5

Q ss_pred             CcEEEEEEc---CCEEEEEEeCCcEEEEEecCCCceEeeccccccccCCCceeEEEeeccCCCCCcccccccccccccCc
Q 001003          659 STVLSVSIA---DPYVLLGMSDGSIRLLVGDPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDAWLSTGV  735 (1192)
Q Consensus       659 ~~Iv~asi~---dpyvlv~~~dg~i~~l~~d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~~~~~~~  735 (1192)
                      ..|.+++..   +.+++.+..|++|.+|.+........+.      .....|.++++..|                    
T Consensus       126 ~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~------~h~~~V~sla~spd--------------------  179 (493)
T PTZ00421        126 KKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIK------CHSDQITSLEWNLD--------------------  179 (493)
T ss_pred             CcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEc------CCCCceEEEEEECC--------------------
Confidence            346666664   2467778889999999887543322211      12345666544211                    


Q ss_pred             cccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCceeeEE
Q 001003          736 GEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNCVFT  776 (1192)
Q Consensus       736 ~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~v~~  776 (1192)
                                    .. +++.+..+|.++||.+-+.+.+..
T Consensus       180 --------------G~-lLatgs~Dg~IrIwD~rsg~~v~t  205 (493)
T PTZ00421        180 --------------GS-LLCTTSKDKKLNIIDPRDGTIVSS  205 (493)
T ss_pred             --------------CC-EEEEecCCCEEEEEECCCCcEEEE
Confidence                          12 566677799999999877665443


No 106
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=22.28  E-value=2.1e+02  Score=28.91  Aligned_cols=44  Identities=18%  Similarity=0.296  Sum_probs=34.1

Q ss_pred             EEEEEEeeeEEeEEEEEecCCCCCCCCccEEEEEeCCCeEEEEEEeCCCC
Q 001003          103 LVCHYRLHGNVESLAILSQGGADNSRRRDSIILAFEDAKISVLEFDDSIH  152 (1192)
Q Consensus       103 lv~e~~l~G~I~~l~~~r~~~~~~~~~~D~Llv~~~~aklsil~~d~~~~  152 (1192)
                      -+...++.-.|++|+.=|+...+   .+|.|+|+|..   .||-||-+.+
T Consensus        40 ~i~~LNin~~italaaG~l~~~~---~~D~LliGt~t---~llaYDV~~N   83 (136)
T PF14781_consen   40 DISFLNINQEITALAAGRLKPDD---GRDCLLIGTQT---SLLAYDVENN   83 (136)
T ss_pred             ceeEEECCCceEEEEEEecCCCC---CcCEEEEeccc---eEEEEEcccC
Confidence            35556777789999988886433   79999999985   6888997655


No 107
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=22.03  E-value=6.5e+02  Score=28.24  Aligned_cols=92  Identities=20%  Similarity=0.102  Sum_probs=54.9

Q ss_pred             EEEEcCCCCeEEEE-e-C-CceEEEecCCCCceeEEecccCCCCCCcEEEEEe--cCeEEEEEcCCCCccCCccceEEEe
Q 001003          925 GFFLSGSRPCWCMV-F-R-ERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTS--QGILKICQLPSGSTYDNYWPVQKVI  999 (1192)
Q Consensus       925 gVF~~G~rP~wi~~-~-~-g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~--~~~LrI~~l~~~~~~d~~~~vrk~i  999 (1192)
                      ..|+..+.|..|+. + + ..++-.|+...+....++-     ..+|.+++.+  .+.|.+.+++.....-..=.+++ +
T Consensus        35 tLfaV~d~~~~i~els~~G~vlr~i~l~g~~D~EgI~y-----~g~~~~vl~~Er~~~L~~~~~~~~~~~~~~~~~~~-~  108 (248)
T PF06977_consen   35 TLFAVQDEPGEIYELSLDGKVLRRIPLDGFGDYEGITY-----LGNGRYVLSEERDQRLYIFTIDDDTTSLDRADVQK-I  108 (248)
T ss_dssp             EEEEEETTTTEEEEEETT--EEEEEE-SS-SSEEEEEE------STTEEEEEETTTTEEEEEEE----TT--EEEEEE-E
T ss_pred             eEEEEECCCCEEEEEcCCCCEEEEEeCCCCCCceeEEE-----ECCCEEEEEEcCCCcEEEEEEeccccccchhhceE-E
Confidence            48888888888875 3 4 3456666666666777766     4556666666  47899999976543211223566 7


Q ss_pred             cCCCc------cCeEEEecCCCEEEEEEe
Q 001003         1000 PLKAT------PHQITYFAEKNLYPLIVS 1022 (1192)
Q Consensus      1000 pL~~t------p~~Iay~~~~~~y~v~~s 1022 (1192)
                      +|+-.      .--|||.+..+.+.++.-
T Consensus       109 ~l~~~~~~N~G~EGla~D~~~~~L~v~kE  137 (248)
T PF06977_consen  109 SLGFPNKGNKGFEGLAYDPKTNRLFVAKE  137 (248)
T ss_dssp             E---S---SS--EEEEEETTTTEEEEEEE
T ss_pred             ecccccCCCcceEEEEEcCCCCEEEEEeC
Confidence            77765      456999999998888754


No 108
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=21.99  E-value=8.5e+02  Score=27.84  Aligned_cols=80  Identities=20%  Similarity=0.230  Sum_probs=59.2

Q ss_pred             EEEEcCCCCeEEEEeCCceEEEecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCccceEEEecCCC-
Q 001003          925 GFFLSGSRPCWCMVFRERLRVHPQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNYWPVQKVIPLKA- 1003 (1192)
Q Consensus       925 gVF~~G~rP~wi~~~~g~l~~~p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~~~vrk~ipL~~- 1003 (1192)
                      ...+.|+.|    ...+...++|+.+     .-.||.-.--|+|-+.++.+++=-|+.|++..-     -+.+ +||+. 
T Consensus        39 a~~A~gs~p----a~~~s~~~fpvp~-----G~ap~dvapapdG~VWft~qg~gaiGhLdP~tG-----ev~~-ypLg~G  103 (353)
T COG4257          39 ATPAAGSSP----APDGSSAEFPVPN-----GSAPFDVAPAPDGAVWFTAQGTGAIGHLDPATG-----EVET-YPLGSG  103 (353)
T ss_pred             cchhhcCCC----CCCCccceeccCC-----CCCccccccCCCCceEEecCccccceecCCCCC-----ceEE-EecCCC
Confidence            344556666    3456667777644     244555555899999999999999999999763     6777 99976 


Q ss_pred             -ccCeEEEecCCCEEEE
Q 001003         1004 -TPHQITYFAEKNLYPL 1019 (1192)
Q Consensus      1004 -tp~~Iay~~~~~~y~v 1019 (1192)
                       .||.|.--|+....+.
T Consensus       104 a~Phgiv~gpdg~~Wit  120 (353)
T COG4257         104 ASPHGIVVGPDGSAWIT  120 (353)
T ss_pred             CCCceEEECCCCCeeEe
Confidence             7999999988777655


No 109
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=21.47  E-value=8.5e+02  Score=29.18  Aligned_cols=24  Identities=17%  Similarity=0.106  Sum_probs=18.8

Q ss_pred             EEEEEecCCeEEEEECCCceeeEE
Q 001003          753 YSVVCYESGALEIFDVPNFNCVFT  776 (1192)
Q Consensus       753 ~l~v~~~~g~l~I~sLP~~~~v~~  776 (1192)
                      |++.+-.+|++.||++-.-++.+.
T Consensus       401 YvaAGS~dgsv~iW~v~tgKlE~~  424 (459)
T KOG0288|consen  401 YVAAGSADGSVYIWSVFTGKLEKV  424 (459)
T ss_pred             eeeeccCCCcEEEEEccCceEEEE
Confidence            777788899999999977555433


No 110
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=21.20  E-value=1.9e+03  Score=28.53  Aligned_cols=113  Identities=13%  Similarity=0.257  Sum_probs=63.4

Q ss_pred             eeEEecCCCCeEEEEecceEEEEeCCCceeEeccccccccCCCccCcCCCceeEeeceeeE----EeeCcEEEEEcCCCC
Q 001003          310 KLLAVPSPIGGVLVVGANTIHYHSQSASCALALNNYAVSLDSSQELPRSSFSVELDAAHAT----WLQNDVALLSTKTGD  385 (1192)
Q Consensus       310 ~LipvP~p~GGvLVig~n~I~y~d~~~~~~~a~N~~~~~~~~~~~~~~~~~~l~l~~~~~~----~~~~~~~Ll~~~~G~  385 (1192)
                      -++|.++-..-++.+..|.+-|+......            ...+.+.+..++.+-|.+..    -++++..++....|+
T Consensus       327 dv~~~~~~~~~lv~l~nNtv~~ysl~~s~------------~~~p~~~~~~~i~~~GHR~dVRsl~vS~d~~~~~Sga~~  394 (888)
T KOG0306|consen  327 DVTPSGGTENTLVLLANNTVEWYSLENSG------------KTSPEADRTSNIEIGGHRSDVRSLCVSSDSILLASGAGE  394 (888)
T ss_pred             EEEecCCcceeEEEeecCceEEEEeccCC------------CCCccccccceeeeccchhheeEEEeecCceeeeecCCC
Confidence            35555544433334667777777654311            00122223334444443221    246677788888888


Q ss_pred             EEEEEEEECCceEeeEEEEecCCCccccceEEecCCeEEEEeeeCCeEEEEEee
Q 001003          386 LVLLTVVYDGRVVQRLDLSKTNPSVLTSDITTIGNSLFFLGSRLGDSLLVQFTC  439 (1192)
Q Consensus       386 L~~l~l~~dg~~V~~l~l~~~~~~~~~s~l~~l~~g~lFvGS~~GDS~Ll~~~~  439 (1192)
                      =+++-.....+.++.|...    ..++++++. ++.++-+|-..|.=++|-+.+
T Consensus       395 SikiWn~~t~kciRTi~~~----y~l~~~Fvp-gd~~Iv~G~k~Gel~vfdlaS  443 (888)
T KOG0306|consen  395 SIKIWNRDTLKCIRTITCG----YILASKFVP-GDRYIVLGTKNGELQVFDLAS  443 (888)
T ss_pred             cEEEEEccCcceeEEeccc----cEEEEEecC-CCceEEEeccCCceEEEEeeh
Confidence            7777665445555545443    234555544 677888888888888887765


No 111
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=21.08  E-value=1.3e+03  Score=26.74  Aligned_cols=72  Identities=13%  Similarity=0.097  Sum_probs=44.6

Q ss_pred             ecCCCCceeEEecccCCCCCCcEEEEEecCeEEEEEcCCCCccCCc---cceEEEecCC---CccCeEEEecCCCEEEEE
Q 001003          947 PQLCDGSIVAFTVLHNVNCNHGFIYVTSQGILKICQLPSGSTYDNY---WPVQKVIPLK---ATPHQITYFAEKNLYPLI 1020 (1192)
Q Consensus       947 p~~~~~~v~~~t~F~~~~c~~Gfi~~~~~~~LrI~~l~~~~~~d~~---~~vrk~ipL~---~tp~~Iay~~~~~~y~v~ 1020 (1192)
                      .+.+.........|+|.  ..-.+-++.+|..||-..+=..+.+..   +-.-. +||.   ..|-|++.+|+.+.+++.
T Consensus       273 ~LkGH~saV~~~aFsn~--S~r~vtvSkDG~wriwdtdVrY~~~qDpk~Lk~g~-~pl~aag~~p~RL~lsP~g~~lA~s  349 (420)
T KOG2096|consen  273 SLKGHQSAVLAAAFSNS--STRAVTVSKDGKWRIWDTDVRYEAGQDPKILKEGS-APLHAAGSEPVRLELSPSGDSLAVS  349 (420)
T ss_pred             eeccchhheeeeeeCCC--cceeEEEecCCcEEEeeccceEecCCCchHhhcCC-cchhhcCCCceEEEeCCCCcEEEee
Confidence            33344444444556553  234588888999999887654433211   12223 4553   358999999999999985


Q ss_pred             E
Q 001003         1021 V 1021 (1192)
Q Consensus      1021 ~ 1021 (1192)
                      .
T Consensus       350 ~  350 (420)
T KOG2096|consen  350 F  350 (420)
T ss_pred             c
Confidence            3


No 112
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=20.80  E-value=2.7e+02  Score=28.21  Aligned_cols=39  Identities=21%  Similarity=0.309  Sum_probs=31.0

Q ss_pred             CcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEeeeCCC
Q 001003         1089 SENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTGRNAD 1140 (1192)
Q Consensus      1089 ~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~~~~~ 1140 (1192)
                      |+.++|+..-.|+   ....++.|+|||..          .|+.|+|.++.+
T Consensus        47 n~~italaaG~l~---~~~~~D~LliGt~t----------~llaYDV~~N~d   85 (136)
T PF14781_consen   47 NQEITALAAGRLK---PDDGRDCLLIGTQT----------SLLAYDVENNSD   85 (136)
T ss_pred             CCceEEEEEEecC---CCCCcCEEEEeccc----------eEEEEEcccCch
Confidence            6789999999997   33457999999864          499999987544


No 113
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=20.77  E-value=6.5e+02  Score=30.84  Aligned_cols=26  Identities=19%  Similarity=0.378  Sum_probs=21.3

Q ss_pred             EEEEEEecCCeEEEEECCCceeeEEe
Q 001003          752 IYSVVCYESGALEIFDVPNFNCVFTV  777 (1192)
Q Consensus       752 ~~l~v~~~~g~l~I~sLP~~~~v~~~  777 (1192)
                      -+||-|..+|.+.||.|-+..+|=+.
T Consensus       522 kvcFsccsdGnI~vwDLhnq~~Vrqf  547 (705)
T KOG0639|consen  522 KVCFSCCSDGNIAVWDLHNQTLVRQF  547 (705)
T ss_pred             ceeeeeccCCcEEEEEcccceeeecc
Confidence            38999999999999999886665433


No 114
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=20.38  E-value=7.1e+02  Score=30.97  Aligned_cols=129  Identities=16%  Similarity=0.231  Sum_probs=80.9

Q ss_pred             eEEEEEcCCCCccCCccceEEEecCCCccCeEEEecCCCEEEEEEeecCccccccccccccccccccccccCCCCccccc
Q 001003          977 ILKICQLPSGSTYDNYWPVQKVIPLKATPHQITYFAEKNLYPLIVSVPVLKPLNQVLSLLIDQEVGHQIDNHNLSSVDLH 1056 (1192)
Q Consensus       977 ~LrI~~l~~~~~~d~~~~vrk~ipL~~tp~~Iay~~~~~~y~v~~s~~~~~~~~~~~~~~~~ee~~~~~~~~~~~~~~~~ 1056 (1192)
                      .+-||+|..-     ..|+-. ++|+++.-..|+.|..+-|+|+.....   .+                       .  
T Consensus       426 n~eIfrireK-----dIpve~-velke~vi~FaWEP~gdkF~vi~g~~~---k~-----------------------t--  471 (698)
T KOG2314|consen  426 NLEIFRIREK-----DIPVEV-VELKESVIAFAWEPHGDKFAVISGNTV---KN-----------------------T--  471 (698)
T ss_pred             eEEEEEeecc-----CCCcee-eecchheeeeeeccCCCeEEEEEcccc---cc-----------------------c--
Confidence            7889999975     479999 999999999999999999999864310   00                       0  


Q ss_pred             cCcceeeeEEEEEcCCCCCCCceeeeeEECCCCcceEEEEEEEeccccCCCCceEEEEEeccccCcccccCceEEEEEee
Q 001003         1057 RTYTVEEYEVRILEPDRAGGPWQTRATIPMQSSENALTVRVVTLFNTTTKENETLLAIGTAYVQGEDVAARGRVLLFSTG 1136 (1192)
Q Consensus      1057 ~~p~~~~~sv~Lldp~~~~~twe~id~~el~~~E~v~sv~~v~L~s~~t~~~~~ylaVGTa~~~gEd~~~rGRIlvfev~ 1136 (1192)
                          ..-|.++- .    +..|..+-.++  . -+   +-.|..     .....|+||++--      ..+|.+..+|..
T Consensus       472 ----vsfY~~e~-~----~~~~~lVk~~d--k-~~---~N~vfw-----sPkG~fvvva~l~------s~~g~l~F~D~~  525 (698)
T KOG2314|consen  472 ----VSFYAVET-N----IKKPSLVKELD--K-KF---ANTVFW-----SPKGRFVVVAALV------SRRGDLEFYDTD  525 (698)
T ss_pred             ----eeEEEeec-C----CCchhhhhhhc--c-cc---cceEEE-----cCCCcEEEEEEec------ccccceEEEecc
Confidence                01111111 1    13555543332  1 11   122322     2234699998732      378999999887


Q ss_pred             e----CCCCCceeEeecccCcccccchhcccCceEEEeecc
Q 001003         1137 R----NADNPQNLVLSGSYGPLFSSVQIDFASHFFAICSNS 1173 (1192)
Q Consensus      1137 ~----~~~~~e~~~~~~~~~~~~~~~~~~~~~~~~a~~~~~ 1173 (1192)
                      -    ....||--        +++-+.=|--|+-.++|+.+
T Consensus       526 ~a~~k~~~~~eh~--------~at~veWDPtGRYvvT~ss~  558 (698)
T KOG2314|consen  526 YADLKDTASPEHF--------AATEVEWDPTGRYVVTSSSS  558 (698)
T ss_pred             hhhhhhccCcccc--------ccccceECCCCCEEEEeeeh
Confidence            2    33333321        23344468889999999988


No 115
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=20.20  E-value=1.9e+03  Score=29.78  Aligned_cols=192  Identities=16%  Similarity=0.158  Sum_probs=0.0

Q ss_pred             EEEEEeccceEEEEe-cCceeeeecccCcc-ccCCcEEEEeeCCCCEEEEEecCc-EEEEeCCcceeeeecCCCCCCCCC
Q 001003          578 YLIISLEARTMVLET-ADLLTEVTESVDYF-VQGRTIAAGNLFGRRRVIQVFERG-ARILDGSYMTQDLSFGPSNSESGS  654 (1192)
Q Consensus       578 yLvlS~~~~T~Vl~~-g~~~eEv~~~~gF~-~~~~TI~ag~l~~~~~IvQVt~~~-vrli~~~~~~q~i~~~~~~~e~~~  654 (1192)
                      ||+++-.  +++.++ +-+-|.+-....+. ....|-.-+.+-.++.|+==+..| ||+||.....++-.+.    -|..
T Consensus      1179 ~Ll~tGd--~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~----~~R~ 1252 (1387)
T KOG1517|consen 1179 HLLVTGD--VRSIRIWDAHKEQVVADIPYGSSTLVTALSADLVHGNIIAAGFADGSVRVYDRRMAPPDSLVC----VYRE 1252 (1387)
T ss_pred             eEEecCC--eeEEEEEecccceeEeecccCCCccceeecccccCCceEEEeecCCceEEeecccCCccccce----eecc


Q ss_pred             CCCCCcEEEEEEcC-CEE-EE-EEeCCcEEEEEe--cCCCceEeeccccccccCCCceeEEEeeccCCCCCccccccccc
Q 001003          655 GSENSTVLSVSIAD-PYV-LL-GMSDGSIRLLVG--DPSTCTVSVQTPAAIESSKKPVSSCTLYHDKGPEPWLRKTSTDA  729 (1192)
Q Consensus       655 ~~~~~~Iv~asi~d-pyv-lv-~~~dg~i~~l~~--d~~~~~l~~~~~~~l~~~~~~i~~~~l~~d~~~~~~~~~~~~~~  729 (1192)
                      -.....|+++++.. ++. +| ++.||.|.+|.+  .+....+.+..+-..   .+.++|+.+..+.             
T Consensus      1253 h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~~~e~~~~iv~~~~y---Gs~lTal~VH~ha------------- 1316 (1387)
T KOG1517|consen 1253 HNDVEPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMSSKETFLTIVAHWEY---GSALTALTVHEHA------------- 1316 (1387)
T ss_pred             cCCcccceeEEeecCCCcceeeeccCCeEEEEecccCcccccceeeecccc---CccceeeeeccCC-------------


Q ss_pred             ccccCccccccCCCCCCCCCCcEEEEEEecCCeEEEEECCCcee-eEEeeccccccceecccccccccccccccccCCCc
Q 001003          730 WLSTGVGEAIDGADGGPLDQGDIYSVVCYESGALEIFDVPNFNC-VFTVDKFVSGRTHIVDTYMREALKDSETEINSSSE  808 (1192)
Q Consensus       730 ~~~~~~~~~~~~~~~~~~~~~~~~l~v~~~~g~l~I~sLP~~~~-v~~~~~l~~~~~~l~d~~~~~~~~~~~~~~~~~~~  808 (1192)
                                            .+++.+.. +.+.||++.--.+ ++.....-+++++                      
T Consensus      1317 ----------------------piiAsGs~-q~ikIy~~~G~~l~~~k~n~~F~~q~~---------------------- 1351 (1387)
T KOG1517|consen 1317 ----------------------PIIASGSA-QLIKIYSLSGEQLNIIKYNPGFMGQRI---------------------- 1351 (1387)
T ss_pred             ----------------------CeeeecCc-ceEEEEecChhhhcccccCcccccCcC----------------------


Q ss_pred             cCCCCCcccccccccEEEEEEeecCCCCCccEEEEEecCCcEEEEE
Q 001003          809 EGTGQGRKENIHSMKVVELAMQRWSAHHSRPFLFAILTDGTILCYQ  854 (1192)
Q Consensus       809 ~~~~~~~~~~~~~~~v~eil~~~~g~~~~~p~Llv~l~dG~l~~Y~  854 (1192)
                                   ..+..+-+++.     +++|-++-.|..+-+|.
T Consensus      1352 -------------gs~scL~FHP~-----~~llAaG~~Ds~V~iYs 1379 (1387)
T KOG1517|consen 1352 -------------GSVSCLAFHPH-----RLLLAAGSADSTVSIYS 1379 (1387)
T ss_pred             -------------CCcceeeecch-----hHhhhhccCCceEEEee


No 116
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=20.11  E-value=5.3e+02  Score=29.62  Aligned_cols=91  Identities=16%  Similarity=0.155  Sum_probs=57.0

Q ss_pred             eEEEEcCCCCeEEEE--eCCc-eEEEecCCCCceeEEecccCCCCCCc-EEEEEe-cCeEEEEEcCCCCccCCccceEEE
Q 001003          924 QGFFLSGSRPCWCMV--FRER-LRVHPQLCDGSIVAFTVLHNVNCNHG-FIYVTS-QGILKICQLPSGSTYDNYWPVQKV  998 (1192)
Q Consensus       924 ~gVF~~G~rP~wi~~--~~g~-l~~~p~~~~~~v~~~t~F~~~~c~~G-fi~~~~-~~~LrI~~l~~~~~~d~~~~vrk~  998 (1192)
                      +..|+.+.+|.-|+.  .+|. ++-.|+.......++.=     ..+| |+..+. +..|.+.+++..... -..-..+ 
T Consensus        98 rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Iey-----ig~n~fvi~dER~~~l~~~~vd~~t~~-~~~~~~~-  170 (316)
T COG3204          98 RTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEY-----IGGNQFVIVDERDRALYLFTVDADTTV-ISAKVQK-  170 (316)
T ss_pred             ceEEEecCCCceEEEEecCCceEEEecccccCChhHeEE-----ecCCEEEEEehhcceEEEEEEcCCccE-EeccceE-
Confidence            579999999988865  3444 55556543222111111     1334 333333 268888888887432 1223347 


Q ss_pred             ecCCCccC------eEEEecCCCEEEEEE
Q 001003          999 IPLKATPH------QITYFAEKNLYPLIV 1021 (1192)
Q Consensus       999 ipL~~tp~------~Iay~~~~~~y~v~~ 1021 (1192)
                      |||+.+++      -+||.|..+.+.++-
T Consensus       171 i~L~~~~k~N~GfEGlA~d~~~~~l~~aK  199 (316)
T COG3204         171 IPLGTTNKKNKGFEGLAWDPVDHRLFVAK  199 (316)
T ss_pred             EeccccCCCCcCceeeecCCCCceEEEEE
Confidence            99999988      699999999988864


Done!