Query         015021
Match_columns 414
No_of_seqs    97 out of 116
Neff          5.0 
Searched_HMMs 46136
Date          Fri Mar 29 02:20:04 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/015021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/015021hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF14655 RAB3GAP2_N:  Rab3 GTPa 100.0  2E-100  5E-105  782.6  38.0  365   28-405     1-405 (415)
  2 KOG2727 Rab3 GTPase-activating 100.0 1.1E-90 2.3E-95  739.9  20.1  381    1-405     1-413 (1244)
  3 cd00200 WD40 WD40 domain, foun  97.9   0.018   4E-07   51.1  25.7   81  305-404   175-255 (289)
  4 KOG0273 Beta-transducin family  97.8 0.00026 5.7E-09   74.2  12.2   53  295-347   440-492 (524)
  5 KOG0271 Notchless-like WD40 re  97.4  0.0039 8.5E-08   64.6  14.8   79  303-402   363-443 (480)
  6 KOG0263 Transcription initiati  96.9   0.016 3.4E-07   64.0  13.8   83  298-399   568-650 (707)
  7 KOG0266 WD40 repeat-containing  96.7   0.011 2.3E-07   62.2  10.2   88  299-405   195-283 (456)
  8 PTZ00421 coronin; Provisional   96.6   0.014 3.1E-07   62.3  11.0   88  299-405   117-205 (493)
  9 KOG1539 WD repeat protein [Gen  96.6   0.066 1.4E-06   60.0  16.1  286   77-403   203-528 (910)
 10 PLN00181 protein SPA1-RELATED;  96.5     1.2 2.6E-05   49.9  25.9   93   33-129   487-585 (793)
 11 KOG0266 WD40 repeat-containing  96.5   0.014   3E-07   61.3  10.2   85  299-402   238-322 (456)
 12 KOG0299 U3 snoRNP-associated p  96.5  0.0039 8.5E-08   65.4   6.0   82  305-405   200-281 (479)
 13 KOG0319 WD40-repeat-containing  96.2    0.41 8.8E-06   53.3  19.2   86  299-405   497-584 (775)
 14 cd00200 WD40 WD40 domain, foun  96.2   0.034 7.5E-07   49.4   9.3   82  305-405   133-214 (289)
 15 KOG0649 WD40 repeat protein [G  96.0   0.035 7.6E-07   55.0   8.8   82  309-409   116-197 (325)
 16 KOG0263 Transcription initiati  95.9   0.021 4.5E-07   63.1   7.8   88  299-405   527-614 (707)
 17 KOG0271 Notchless-like WD40 re  95.9   0.026 5.7E-07   58.6   7.9   96  297-406   147-243 (480)
 18 PF12341 DUF3639:  Protein of u  95.8   0.014 3.1E-07   38.9   3.6   27   76-106     1-27  (27)
 19 KOG0291 WD40-repeat-containing  95.8     1.7 3.6E-05   49.0  21.3  112   34-160   101-221 (893)
 20 KOG0639 Transducin-like enhanc  95.0   0.027 5.8E-07   60.3   4.5   77  308-404   510-587 (705)
 21 KOG0265 U5 snRNP-specific prot  94.8   0.097 2.1E-06   53.0   7.6   79  310-406    93-171 (338)
 22 KOG2727 Rab3 GTPase-activating  94.7   0.039 8.5E-07   62.5   4.9   84  140-223   150-243 (1244)
 23 PTZ00421 coronin; Provisional   94.5    0.26 5.6E-06   52.9  10.5   81  303-402   164-249 (493)
 24 KOG0315 G-protein beta subunit  94.5    0.28 6.1E-06   48.9   9.8   87  299-404   207-294 (311)
 25 KOG1446 Histone H3 (Lys4) meth  94.2    0.26 5.7E-06   49.9   9.2   82  308-405   188-269 (311)
 26 KOG0279 G protein beta subunit  94.1    0.29 6.3E-06   49.3   9.2   88  301-405    99-187 (315)
 27 KOG0316 Conserved WD40 repeat-  94.1    0.18   4E-06   49.9   7.6   84  297-399    49-132 (307)
 28 KOG1273 WD40 repeat protein [G  93.9    0.18   4E-06   51.6   7.5   87  301-412    18-104 (405)
 29 TIGR03866 PQQ_ABC_repeats PQQ-  93.8    0.54 1.2E-05   43.7  10.1   76  310-405   209-286 (300)
 30 TIGR03866 PQQ_ABC_repeats PQQ-  93.8    0.49 1.1E-05   44.0   9.8   75  311-404    34-109 (300)
 31 PTZ00420 coronin; Provisional   93.8    0.47   1E-05   52.0  10.9   87  299-405   117-204 (568)
 32 KOG0295 WD40 repeat-containing  93.7     5.8 0.00013   41.5  17.8   92  309-406   237-330 (406)
 33 PF00400 WD40:  WD domain, G-be  93.0    0.18   4E-06   33.9   4.1   36  300-335     4-39  (39)
 34 PTZ00420 coronin; Provisional   93.0    0.99 2.2E-05   49.5  11.8   50  308-357   168-220 (568)
 35 PLN00181 protein SPA1-RELATED;  92.7    0.66 1.4E-05   51.9  10.3   79  309-405   534-613 (793)
 36 PF14727 PHTB1_N:  PTHB1 N-term  91.6     1.3 2.9E-05   46.8  10.4   70   80-154    23-102 (418)
 37 KOG0308 Conserved WD40 repeat-  91.2    0.73 1.6E-05   50.9   8.0   80  307-405   171-250 (735)
 38 KOG1274 WD40 repeat protein [G  91.0     0.7 1.5E-05   52.5   7.9  108    9-127   442-570 (933)
 39 KOG0272 U4/U6 small nuclear ri  90.6    0.93   2E-05   47.8   7.9   76  307-403   303-380 (459)
 40 KOG0284 Polyadenylation factor  90.6    0.44 9.5E-06   50.1   5.4   79  309-408   182-262 (464)
 41 PF02239 Cytochrom_D1:  Cytochr  89.9     1.9   4E-05   44.6   9.4   50  303-352    32-81  (369)
 42 KOG0646 WD40 repeat protein [G  89.8     3.6 7.7E-05   44.0  11.4  203   88-346    91-316 (476)
 43 KOG0279 G protein beta subunit  88.9     2.2 4.9E-05   43.1   8.8   78  307-403   148-227 (315)
 44 PF15492 Nbas_N:  Neuroblastoma  88.8    0.63 1.4E-05   46.7   4.8   43  304-346   226-268 (282)
 45 KOG0277 Peroxisomal targeting   88.5     1.5 3.2E-05   44.1   7.1   88  297-402    94-183 (311)
 46 PF15492 Nbas_N:  Neuroblastoma  88.2     2.5 5.4E-05   42.6   8.6   76   35-113     3-78  (282)
 47 KOG0282 mRNA splicing factor [  88.1     1.2 2.6E-05   47.7   6.6   67  310-396   435-503 (503)
 48 KOG0319 WD40-repeat-containing  88.1     2.2 4.7E-05   47.8   8.8   82  305-405   461-542 (775)
 49 KOG0283 WD40 repeat-containing  87.6     3.1 6.7E-05   46.7   9.7   69   88-164   461-541 (712)
 50 KOG0285 Pleiotropic regulator   86.8     4.7  0.0001   42.2   9.8   50  301-350   145-194 (460)
 51 KOG0318 WD40 repeat stress pro  86.8     3.3 7.1E-05   45.0   9.0   40  303-342   483-522 (603)
 52 KOG1407 WD40 repeat protein [F  86.7    0.58 1.3E-05   46.9   3.2   47  299-345   181-227 (313)
 53 KOG0291 WD40-repeat-containing  86.3      12 0.00026   42.5  13.1   88   35-131   313-404 (893)
 54 PF08662 eIF2A:  Eukaryotic tra  86.3     5.6 0.00012   37.1   9.4   75  310-403   103-183 (194)
 55 PF08662 eIF2A:  Eukaryotic tra  86.2     5.3 0.00011   37.3   9.2   76  309-405    61-140 (194)
 56 KOG0288 WD40 repeat protein Ti  86.0     1.5 3.2E-05   46.3   5.8   42  310-351   390-431 (459)
 57 TIGR02658 TTQ_MADH_Hv methylam  85.8     4.5 9.7E-05   41.9   9.2   67  318-404    11-91  (352)
 58 KOG0281 Beta-TrCP (transducin   84.9     2.6 5.6E-05   44.0   6.8   98  308-405   236-355 (499)
 59 PF08596 Lgl_C:  Lethal giant l  84.8     4.8  0.0001   42.2   9.0   64   43-106    99-171 (395)
 60 PF12894 Apc4_WD40:  Anaphase-p  84.5     1.9   4E-05   32.1   4.2   36   76-113    11-46  (47)
 61 KOG0296 Angio-associated migra  84.2      12 0.00026   39.2  11.3   98  303-403   102-225 (399)
 62 KOG0275 Conserved WD40 repeat-  84.1     1.2 2.5E-05   46.0   4.0   81  306-405   305-385 (508)
 63 KOG4640 Anaphase-promoting com  83.8     2.8 6.1E-05   46.3   6.9   84   34-126    25-112 (665)
 64 KOG0276 Vesicle coat complex C  83.8      16 0.00035   40.7  12.6  211   77-411    56-269 (794)
 65 KOG0306 WD40-repeat-containing  83.5     2.8 6.1E-05   47.2   6.8   89  299-406    99-187 (888)
 66 KOG0318 WD40 repeat stress pro  83.1       8 0.00017   42.2   9.8   88  302-405   185-272 (603)
 67 KOG1912 WD40 repeat protein [G  82.6     2.2 4.8E-05   48.3   5.7   66  320-403    80-148 (1062)
 68 KOG0307 Vesicle coat complex C  82.2     9.8 0.00021   44.5  10.7   76   57-134    47-131 (1049)
 69 KOG0316 Conserved WD40 repeat-  82.0     3.8 8.2E-05   40.9   6.4   77  310-403   186-262 (307)
 70 KOG0282 mRNA splicing factor [  81.8     1.2 2.5E-05   47.7   3.1   85  301-404   208-294 (503)
 71 COG2319 FOG: WD40 repeat [Gene  81.6      18  0.0004   33.2  10.6   86  302-405   150-236 (466)
 72 KOG2110 Uncharacterized conser  80.2      12 0.00026   39.2   9.6   82  308-413   130-214 (391)
 73 KOG0293 WD40 repeat-containing  80.2     6.5 0.00014   41.8   7.8  261   15-405   214-477 (519)
 74 KOG0646 WD40 repeat protein [G  80.1     3.4 7.4E-05   44.1   5.8   71  308-398    82-153 (476)
 75 KOG0286 G-protein beta subunit  79.7     9.2  0.0002   39.1   8.4   52  299-350   178-230 (343)
 76 KOG2321 WD40 repeat protein [G  79.5      12 0.00027   41.3   9.8  112   15-129   154-281 (703)
 77 KOG3914 WD repeat protein WDR4  79.3     5.9 0.00013   41.6   7.1   76  310-405   154-230 (390)
 78 KOG0286 G-protein beta subunit  79.1      13 0.00027   38.2   9.1   76  309-404    99-180 (343)
 79 KOG1408 WD40 repeat protein [F  79.0     2.7 5.8E-05   47.3   4.8   83  301-399   453-535 (1080)
 80 KOG0285 Pleiotropic regulator   78.8      11 0.00023   39.7   8.7   84  310-412   238-323 (460)
 81 KOG1538 Uncharacterized conser  78.7     5.9 0.00013   44.5   7.2   40  309-350    14-54  (1081)
 82 KOG0280 Uncharacterized conser  78.5      11 0.00024   38.5   8.6   77  309-403   123-202 (339)
 83 KOG1240 Protein kinase contain  78.3     8.6 0.00019   45.6   8.7   81  310-405  1198-1280(1431)
 84 KOG1188 WD40 repeat protein [G  78.1     3.7 7.9E-05   42.6   5.1   66  320-404    41-108 (376)
 85 KOG0289 mRNA splicing factor [  78.0      13 0.00029   39.7   9.3   79  308-404   304-383 (506)
 86 KOG1446 Histone H3 (Lys4) meth  78.0     4.2   9E-05   41.5   5.4   39  311-349   236-274 (311)
 87 KOG0264 Nucleosome remodeling   77.9      10 0.00022   40.3   8.4   46  306-351   271-318 (422)
 88 KOG0281 Beta-TrCP (transducin   77.7     1.9   4E-05   45.0   2.9   79  301-402   354-432 (499)
 89 KOG0306 WD40-repeat-containing  76.7      12 0.00026   42.5   8.9  115   33-156    24-147 (888)
 90 PF03178 CPSF_A:  CPSF A subuni  76.2      23  0.0005   35.0  10.2   52   75-127    22-96  (321)
 91 smart00320 WD40 WD40 repeats.   75.9     5.3 0.00011   23.7   3.7   33  303-335     8-40  (40)
 92 KOG1063 RNA polymerase II elon  75.2      19  0.0004   40.6   9.8   56   76-131   664-735 (764)
 93 PF14779 BBS1:  Ciliary BBSome   75.0     6.3 0.00014   39.3   5.7   53   77-129   177-234 (257)
 94 KOG0310 Conserved WD40 repeat-  74.4     6.3 0.00014   42.3   5.9   78  307-402    68-145 (487)
 95 PRK01742 tolB translocation pr  74.0      10 0.00022   39.4   7.3   44  311-355   336-379 (429)
 96 PF00780 CNH:  CNH domain;  Int  74.0      25 0.00054   33.5   9.5   68   38-111   102-169 (275)
 97 KOG0273 Beta-transducin family  74.0      17 0.00038   39.2   8.9   55   75-131   234-288 (524)
 98 KOG0265 U5 snRNP-specific prot  73.8      22 0.00049   36.5   9.3   86  299-403    39-125 (338)
 99 KOG1539 WD repeat protein [Gen  73.7      15 0.00033   42.0   8.8   58  303-362   572-629 (910)
100 KOG0275 Conserved WD40 repeat-  73.3     4.9 0.00011   41.6   4.6   87  306-403   212-298 (508)
101 KOG0296 Angio-associated migra  72.6     9.6 0.00021   39.9   6.5   53   77-129   284-337 (399)
102 KOG2048 WD40 repeat protein [G  72.6      23 0.00049   39.7   9.7   80  307-404   154-239 (691)
103 KOG0274 Cdc4 and related F-box  72.5      12 0.00027   40.8   7.7   78  306-405   330-408 (537)
104 KOG1963 WD40 repeat protein [G  71.4      12 0.00025   42.7   7.3   78  310-402   208-285 (792)
105 COG2319 FOG: WD40 repeat [Gene  70.7      36 0.00077   31.3   9.3   80  303-402   194-275 (466)
106 KOG4328 WD40 protein [Function  69.9 1.7E+02  0.0037   31.8  17.2   80   76-164   186-274 (498)
107 KOG0267 Microtubule severing p  69.7     6.1 0.00013   44.5   4.6   83  302-403   149-231 (825)
108 KOG0294 WD40 repeat-containing  69.4      34 0.00073   35.5   9.4   50   82-131   211-261 (362)
109 KOG1517 Guanine nucleotide bin  67.9      34 0.00073   40.6  10.0   57   75-132  1207-1269(1387)
110 KOG1274 WD40 repeat protein [G  67.8       6 0.00013   45.4   4.2   46  303-348   134-179 (933)
111 PF12894 Apc4_WD40:  Anaphase-p  66.2      15 0.00032   27.3   4.7   30  310-340    14-43  (47)
112 KOG2695 WD40 repeat protein [G  66.2      12 0.00026   39.2   5.6   74  315-405   307-383 (425)
113 TIGR02658 TTQ_MADH_Hv methylam  64.5      13 0.00028   38.6   5.6   39  307-345   298-338 (352)
114 KOG0264 Nucleosome remodeling   64.2      25 0.00054   37.4   7.6   82  299-398   308-404 (422)
115 KOG1034 Transcriptional repres  63.4      13 0.00028   38.7   5.2   69  319-405   105-173 (385)
116 KOG0771 Prolactin regulatory e  63.0      12 0.00026   39.4   5.1   49  308-359   282-330 (398)
117 PF10168 Nup88:  Nuclear pore c  62.9      67  0.0014   36.6  11.2   78   30-107    85-178 (717)
118 KOG0294 WD40 repeat-containing  62.8      37  0.0008   35.2   8.3   84  300-402    76-161 (362)
119 KOG0278 Serine/threonine kinas  62.8      64  0.0014   32.8   9.7   87   34-129   189-277 (334)
120 PRK03629 tolB translocation pr  61.8      32 0.00069   35.9   8.0   48  308-355   243-294 (429)
121 PRK04922 tolB translocation pr  61.2      37  0.0008   35.3   8.3   39  310-348   294-336 (433)
122 TIGR02276 beta_rpt_yvtn 40-res  61.1      22 0.00047   24.1   4.6   29  317-345     1-30  (42)
123 KOG0771 Prolactin regulatory e  60.8      65  0.0014   34.2   9.9  108   19-132   221-337 (398)
124 KOG0308 Conserved WD40 repeat-  60.4      21 0.00044   40.0   6.4   98  298-403   108-206 (735)
125 PRK01742 tolB translocation pr  59.7      41 0.00088   35.0   8.3   39  309-347   249-290 (429)
126 PF00400 WD40:  WD domain, G-be  58.9      18 0.00038   24.0   3.8   29   76-106    11-39  (39)
127 KOG0315 G-protein beta subunit  58.4      19 0.00042   36.3   5.3   54  303-356   254-307 (311)
128 KOG0313 Microtubule binding pr  57.9      26 0.00057   37.0   6.4   76  308-401   301-379 (423)
129 PF06977 SdiA-regulated:  SdiA-  57.7      82  0.0018   31.1   9.6   48  302-350    16-66  (248)
130 KOG0305 Anaphase promoting com  57.4      32  0.0007   37.3   7.2   86  303-405   297-383 (484)
131 KOG1408 WD40 repeat protein [F  57.3      49  0.0011   37.9   8.6   82  311-408   600-681 (1080)
132 PRK01029 tolB translocation pr  57.2      60  0.0013   34.1   9.1   46  303-348   322-370 (428)
133 KOG0295 WD40 repeat-containing  56.9      52  0.0011   34.7   8.3   69  318-405   303-371 (406)
134 KOG0274 Cdc4 and related F-box  56.6      84  0.0018   34.5  10.4   48  303-350   245-292 (537)
135 PF02239 Cytochrom_D1:  Cytochr  56.0      20 0.00044   37.0   5.3   39  308-346    78-117 (369)
136 PF02897 Peptidase_S9_N:  Proly  55.9      27  0.0006   35.5   6.3   49  308-356   124-178 (414)
137 PRK03629 tolB translocation pr  55.8      65  0.0014   33.6   9.1   47  309-355   332-381 (429)
138 KOG0269 WD40 repeat-containing  55.7      12 0.00026   42.4   3.8   67  319-404   100-170 (839)
139 COG4946 Uncharacterized protei  54.9      17 0.00037   39.5   4.6   38  309-346   403-440 (668)
140 KOG2048 WD40 repeat protein [G  53.7      16 0.00036   40.7   4.3   32  310-341   478-509 (691)
141 PRK04922 tolB translocation pr  53.4      69  0.0015   33.3   8.8   46  310-355   338-386 (433)
142 KOG0641 WD40 repeat protein [G  53.2      22 0.00048   35.5   4.7   47  302-348   223-272 (350)
143 KOG0292 Vesicle coat complex C  53.0      41  0.0009   39.2   7.4   86  301-405    45-130 (1202)
144 PRK11028 6-phosphogluconolacto  51.9      84  0.0018   30.8   8.7   30  310-339   128-158 (330)
145 KOG3881 Uncharacterized conser  51.8      29 0.00063   36.7   5.6   49   77-133   203-251 (412)
146 KOG0288 WD40 repeat protein Ti  51.6      15 0.00031   39.2   3.4   85  302-405   214-298 (459)
147 KOG0272 U4/U6 small nuclear ri  51.1      52  0.0011   35.2   7.3   84  303-404   341-424 (459)
148 PRK11028 6-phosphogluconolacto  49.6 1.1E+02  0.0024   30.0   9.1   28  311-338    83-111 (330)
149 KOG2096 WD40 repeat protein [G  49.3      29 0.00063   36.2   5.0   67   32-106   334-400 (420)
150 KOG2919 Guanine nucleotide-bin  49.2      49  0.0011   34.6   6.6   97  298-410   241-340 (406)
151 PF10313 DUF2415:  Uncharacteri  49.0      51  0.0011   24.3   5.0   29  311-339     4-35  (43)
152 PF14781 BBS2_N:  Ciliary BBSom  48.6      71  0.0015   29.1   6.9   73   76-159    47-124 (136)
153 KOG0640 mRNA cleavage stimulat  48.6      29 0.00062   36.1   4.8  103  299-406   163-299 (430)
154 PRK02889 tolB translocation pr  48.0      72  0.0016   33.2   7.9   40  305-344   193-235 (427)
155 PF01403 Sema:  Sema domain;  I  47.2      81  0.0018   32.9   8.2   52   78-129   365-431 (433)
156 PRK04792 tolB translocation pr  47.1 1.2E+02  0.0026   31.9   9.5   47  309-355   307-357 (448)
157 PF04841 Vps16_N:  Vps16, N-ter  47.0 2.3E+02   0.005   29.7  11.5   72   29-111    29-112 (410)
158 TIGR02800 propeller_TolB tol-p  46.6   1E+02  0.0022   31.1   8.6   43  302-344   184-229 (417)
159 KOG1407 WD40 repeat protein [F  46.4      39 0.00084   34.3   5.3   83  303-405    16-102 (313)
160 PF04841 Vps16_N:  Vps16, N-ter  46.3      65  0.0014   33.7   7.3   57   88-154    48-105 (410)
161 KOG1538 Uncharacterized conser  45.3 2.2E+02  0.0047   32.7  11.2   49  310-360   459-507 (1081)
162 KOG0301 Phospholipase A2-activ  43.8      84  0.0018   35.6   7.8   77  301-398   173-249 (745)
163 KOG0647 mRNA export protein (c  43.2 1.5E+02  0.0033   30.8   9.0   47  310-358    75-124 (347)
164 PF13464 DUF4115:  Domain of un  42.7      74  0.0016   25.2   5.6   55   96-160     5-60  (77)
165 KOG0284 Polyadenylation factor  41.7      37 0.00079   36.3   4.5   92  302-412   133-226 (464)
166 KOG0289 mRNA splicing factor [  41.7 1.1E+02  0.0024   33.1   8.0   78  309-405   349-426 (506)
167 PRK00178 tolB translocation pr  41.5 1.7E+02  0.0037   30.0   9.4   38  310-347   245-285 (430)
168 KOG0310 Conserved WD40 repeat-  41.4 1.6E+02  0.0035   32.0   9.3   86  300-403   103-190 (487)
169 PF04762 IKI3:  IKI3 family;  I  41.4 3.5E+02  0.0076   31.8  12.8  165    4-212   240-434 (928)
170 KOG4190 Uncharacterized conser  41.3      18 0.00039   40.0   2.3   51  309-362   878-928 (1034)
171 KOG1009 Chromatin assembly com  41.0   1E+02  0.0022   33.0   7.6   99  301-405    59-160 (434)
172 PRK02889 tolB translocation pr  40.6      54  0.0012   34.1   5.7   40  311-350   331-373 (427)
173 KOG2445 Nuclear pore complex c  40.0 1.2E+02  0.0026   31.5   7.8   34   76-110   223-258 (361)
174 KOG2394 WD40 protein DMR-N9 [G  39.9      74  0.0016   35.2   6.5   78  309-405   292-369 (636)
175 PF00780 CNH:  CNH domain;  Int  39.7      74  0.0016   30.3   6.1   61   91-161     8-68  (275)
176 KOG1645 RING-finger-containing  39.6      64  0.0014   34.5   5.9   80  302-401   188-269 (463)
177 PF11768 DUF3312:  Protein of u  39.3   1E+02  0.0022   34.1   7.5   73  308-401   260-332 (545)
178 KOG2110 Uncharacterized conser  39.2      66  0.0014   33.9   5.8   60  299-358   165-226 (391)
179 KOG0639 Transducin-like enhanc  38.9      32 0.00069   37.8   3.6   49   79-129   595-643 (705)
180 KOG2114 Vacuolar assembly/sort  38.5      69  0.0015   37.1   6.3   55   77-132   125-184 (933)
181 PF06200 tify:  tify domain;  I  38.2      57  0.0012   23.2   3.7   24  138-161     3-26  (36)
182 KOG0293 WD40 repeat-containing  37.8 1.2E+02  0.0026   32.8   7.5   85  302-405   219-306 (519)
183 PRK04792 tolB translocation pr  37.4 1.2E+02  0.0026   31.9   7.7   37  311-347   265-304 (448)
184 KOG0650 WD40 repeat nucleolar   37.2      61  0.0013   36.3   5.5   43  303-345   396-438 (733)
185 KOG1036 Mitotic spindle checkp  35.7 1.3E+02  0.0027   31.2   7.1   41  310-350   235-275 (323)
186 PRK05137 tolB translocation pr  35.5 1.9E+02   0.004   30.1   8.7   81  303-402   197-282 (435)
187 TIGR03300 assembly_YfgL outer   34.5 1.5E+02  0.0033   29.6   7.6   64  319-403   105-168 (377)
188 PF07433 DUF1513:  Protein of u  34.4      72  0.0016   32.7   5.2   40  306-345   215-255 (305)
189 KOG0321 WD40 repeat-containing  34.3      81  0.0018   35.5   5.9   85  314-412    58-148 (720)
190 KOG3881 Uncharacterized conser  34.2 1.2E+02  0.0027   32.1   7.0   76  307-401   247-323 (412)
191 KOG0283 WD40 repeat-containing  33.6 1.3E+02  0.0028   34.3   7.4   77  308-401   502-579 (712)
192 KOG4497 Uncharacterized conser  33.3      33 0.00071   36.0   2.6   32  313-345    14-45  (447)
193 COG3041 Uncharacterized protei  32.5      15 0.00032   31.3  -0.0   10  344-353    53-62  (91)
194 KOG4283 Transcription-coupled   32.0 1.6E+02  0.0034   30.7   7.1   64  319-400   158-221 (397)
195 KOG0300 WD40 repeat-containing  31.9 1.5E+02  0.0034   31.0   7.1   85  297-401   304-389 (481)
196 PLN02919 haloacid dehalogenase  31.9 2.1E+02  0.0046   34.0   9.2   82  311-402   807-892 (1057)
197 KOG0305 Anaphase promoting com  31.8 1.1E+02  0.0024   33.4   6.4   78  307-404   217-294 (484)
198 KOG0278 Serine/threonine kinas  31.6 2.1E+02  0.0045   29.3   7.7   80  304-402   221-301 (334)
199 KOG1273 WD40 repeat protein [G  31.4 4.4E+02  0.0096   27.8  10.2   41  303-343    61-101 (405)
200 KOG2111 Uncharacterized conser  30.5 3.7E+02   0.008   28.1   9.4   79  303-399   177-257 (346)
201 KOG1188 WD40 repeat protein [G  30.1      50  0.0011   34.5   3.3   39   91-129    41-80  (376)
202 KOG0267 Microtubule severing p  29.5      76  0.0016   36.2   4.7   58  302-359    65-124 (825)
203 PF06433 Me-amine-dh_H:  Methyl  29.2 2.9E+02  0.0062   28.9   8.6   57  330-404   270-326 (342)
204 KOG1334 WD40 repeat protein [G  28.8      47   0.001   36.2   2.9   69  316-403   403-471 (559)
205 KOG0277 Peroxisomal targeting   28.8 2.2E+02  0.0047   29.1   7.3   79  303-397     4-90  (311)
206 PF10282 Lactonase:  Lactonase,  28.7 2.7E+02  0.0059   28.0   8.3   80   33-115   248-332 (345)
207 TIGR02800 propeller_TolB tol-p  28.4 1.2E+02  0.0025   30.6   5.7   34  308-341   322-358 (417)
208 KOG0322 G-protein beta subunit  28.3      45 0.00098   33.9   2.6   40  297-336   283-322 (323)
209 PF13360 PQQ_2:  PQQ-like domai  27.9      81  0.0017   28.8   4.1   28  315-342   208-235 (238)
210 PF14783 BBS2_Mid:  Ciliary BBS  27.7   2E+02  0.0042   25.4   6.1   46   79-126     2-49  (111)
211 PRK00178 tolB translocation pr  27.3 3.1E+02  0.0066   28.1   8.6   41  311-351   334-377 (430)
212 KOG2096 WD40 repeat protein [G  26.8   1E+02  0.0023   32.3   4.9   42  297-338    74-117 (420)
213 PF12234 Rav1p_C:  RAVE protein  26.7 6.9E+02   0.015   28.3  11.5   93   34-132    34-128 (631)
214 PF01011 PQQ:  PQQ enzyme repea  26.6      94   0.002   21.3   3.3   18  325-342     6-23  (38)
215 PF13360 PQQ_2:  PQQ-like domai  26.4 3.5E+02  0.0075   24.6   8.0   85   55-155     6-92  (238)
216 KOG2114 Vacuolar assembly/sort  26.3 1.1E+02  0.0024   35.6   5.4   46  313-359    29-74  (933)
217 KOG0302 Ribosome Assembly prot  26.1 2.1E+02  0.0045   30.6   6.9   74  306-400   256-334 (440)
218 PF01436 NHL:  NHL repeat;  Int  26.1 1.2E+02  0.0025   19.7   3.4   20  323-342     7-26  (28)
219 COG3391 Uncharacterized conser  26.0 1.4E+02   0.003   30.8   5.7   45  311-355   163-214 (381)
220 PLN02919 haloacid dehalogenase  25.9 2.8E+02   0.006   33.0   8.8   36  310-345   742-778 (1057)
221 KOG1064 RAVE (regulator of V-A  25.8      61  0.0013   40.7   3.4   67  308-403  2337-2403(2439)
222 PF11396 DUF2874:  Protein of u  24.9 2.9E+02  0.0063   20.4   6.2   38   75-112    21-61  (61)
223 PF02393 US22:  US22 like;  Int  24.7 1.2E+02  0.0025   25.8   4.2   28  317-344    79-106 (125)
224 KOG2139 WD40 repeat protein [G  24.4 1.4E+02  0.0031   31.7   5.4   72   33-107   349-430 (445)
225 KOG4460 Nuclear pore complex,   24.4   3E+02  0.0065   30.9   7.9   40  314-354   470-509 (741)
226 TIGR03300 assembly_YfgL outer   23.8 2.7E+02  0.0059   27.8   7.3   49   63-118    85-134 (377)
227 PF10411 DsbC_N:  Disulfide bon  23.3 1.6E+02  0.0035   22.3   4.3   37   76-114    11-47  (57)
228 KOG0299 U3 snoRNP-associated p  23.1 3.1E+02  0.0067   29.9   7.6   88  301-405   136-239 (479)
229 PF14408 Actino_peptide:  Ribos  22.7      97  0.0021   24.4   2.9   22  312-333     5-26  (59)
230 KOG0973 Histone transcription   22.5 1.1E+02  0.0023   36.0   4.5   48  303-350   125-172 (942)
231 KOG0321 WD40 repeat-containing  22.3 2.3E+02  0.0049   32.1   6.6   98  303-403   140-253 (720)
232 smart00564 PQQ beta-propeller   21.9 1.5E+02  0.0033   18.9   3.4   21  322-342     9-29  (33)
233 KOG1523 Actin-related protein   21.9 2.9E+02  0.0063   29.0   6.9   84   24-117   204-289 (361)
234 KOG1912 WD40 repeat protein [G  21.7      71  0.0015   36.9   2.8   40   91-130   438-478 (1062)
235 KOG0270 WD40 repeat-containing  21.5 2.9E+02  0.0063   29.9   7.0   54   79-133   332-387 (463)
236 PRK04043 tolB translocation pr  21.4 1.6E+02  0.0035   30.9   5.3   43  313-355   324-375 (419)
237 KOG4378 Nuclear protein COP1 [  21.2 4.3E+02  0.0092   29.4   8.3   81  302-403   116-200 (673)
238 PF11768 DUF3312:  Protein of u  21.2   3E+02  0.0065   30.6   7.3   33   88-120   309-341 (545)
239 KOG1272 WD40-repeat-containing  20.9   1E+02  0.0022   33.6   3.6   36  303-338   289-324 (545)
240 PF07893 DUF1668:  Protein of u  20.9 1.7E+02  0.0037   29.9   5.2   38  305-344    64-101 (342)
241 KOG1517 Guanine nucleotide bin  20.8   1E+02  0.0022   36.9   3.8   69   34-107  1309-1380(1387)
242 KOG0647 mRNA export protein (c  20.5 1.4E+02  0.0031   30.9   4.4   52   74-126    25-79  (347)
243 KOG0772 Uncharacterized conser  20.3 1.2E+02  0.0025   33.6   3.9   77  310-402   320-398 (641)
244 KOG0322 G-protein beta subunit  20.3 1.8E+02  0.0038   29.9   4.9   69  310-397   254-322 (323)

No 1  
>PF14655 RAB3GAP2_N:  Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=100.00  E-value=2.1e-100  Score=782.62  Aligned_cols=365  Identities=31%  Similarity=0.452  Sum_probs=310.8

Q ss_pred             cccCCCeeeeccCcceeeeeecceEEE--EeecCCC------CCceeecccCCCCCCCeEEEEEeEEe---------CCc
Q 015021           28 WLVNDPNLLCALDMHTIALANRYQTVI--INWADPE------GLVAKIRPELSPIASEYITAIEWLVF---------EEM   90 (414)
Q Consensus        28 WL~~~~~~~~sp~~~~la~A~~~~~v~--~~w~~~~------~~~v~~~g~l~~~~~e~ITs~~~lpl---------~dw   90 (414)
                      ||+| |++++||+||+||+|+++|+||  .+|+.++      .+.+.|+|+|+.+++|+|||++||||         +||
T Consensus         1 WL~~-~~isls~~~d~laiA~~~r~vil~~~w~~~~~~~~~~~~~~~~~g~l~~~~~e~ITsi~clpl~s~~~s~~~~dw   79 (415)
T PF14655_consen    1 WLQD-CSISLSPDGDLLAIARGQRLVILTSKWDSSRKGENENTYSISWSGPLDDEPGECITSILCLPLSSQKRSTGGPDW   79 (415)
T ss_pred             Cccc-ceEEecCCCCEEEEEcCCEEEEEEeeccccccCCCCCeEEEEeeeeccCCCCCEEEEEEEEEeecccccCCCCCc
Confidence            9999 9999999999999999999999  5895533      24489999999977799999999999         899


Q ss_pred             EEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHHHHHHHHHHhc
Q 015021           91 RALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQKMLQRWFQDS  170 (414)
Q Consensus        91 ~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~~~Lr~c~~~~  170 (414)
                      +||||||++||||||||+|+|||+|+||++||++||||+++++..++...|||+|+||++||+|||++|+++|++|++|+
T Consensus        80 ~~I~VG~ssG~vrfyte~G~LL~~Q~~h~~pV~~ik~~~~~~~~~~~~~~eel~ily~~~v~~Idg~sL~~~L~~~~~~~  159 (415)
T PF14655_consen   80 TCIAVGTSSGYVRFYTENGVLLLSQLLHEEPVLKIKCRSTKIPRHPGDSSEELSILYPSAVVIIDGFSLFSVLRACRNQV  159 (415)
T ss_pred             EEEEEEecccEEEEEeccchHHHHHhcCccceEEEEecccCCCCCCcccccEEEEEECCEEEEEecHHHHHHHHHHHHhh
Confidence            99999999999999999999999999999999999999999866666666999999999999999999999999999999


Q ss_pred             cccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCCCCchhhh--------------ccccceeEEEeC
Q 015021          171 NSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMPPPLMEVQ--------------SSQRYFCAVTIG  236 (414)
Q Consensus       171 ~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~p~~~d~~--------------s~~~~~~~i~vG  236 (414)
                      +++.|+.+       + .+.+++|+||||+|++++.|+|++++|+++|++||++              +...++++|++|
T Consensus       160 ~~~~~~~~-------~-~~~~~~L~ykKw~l~~~~~i~D~~~~G~~~~~~fd~l~~aS~~gf~a~~~~s~~~~~~~i~~G  231 (415)
T PF14655_consen  160 ARGAASGS-------D-SPAPPPLSYKKWNLQSQDTINDAAICGPMPPSTFDHLVTASIGGFNAKYRSSPPRMSRYITVG  231 (415)
T ss_pred             hhhhhccc-------c-cCCCCccceeEecCCCCCcEeeEEEecCCCCcHHHHHHhhhcccccceeecCCcceEEEEEec
Confidence            99865321       1 3347889999999999999999999999999999984              335778999999


Q ss_pred             CCceeEEEEeccCCCcchhhhhhhhhhH-HHHHHhhhhhhhhhcC--CCCCC-CC-----CCCCcccccCCCCccccCCC
Q 015021          237 EDSVISAFRLSEDRSRSLVGAILSKVVP-ATFSTISSLSKMIWRS--EQSPK-KS-----EPKPQSFARASPLTCLKDHP  307 (414)
Q Consensus       237 ~~P~la~y~~~~~~~~s~~~al~S~va~-av~S~v~S~ak~~W~~--~~~~~-~~-----e~kp~~~~~a~pl~~l~D~~  307 (414)
                      ++||+|||+.+|+.+++    ++++|+. +|.|+++++++.+|++  .++++ +.     ++++++.++..+.++|+|.+
T Consensus       232 ~~P~v~f~~~~e~~s~~----~ls~va~~aVas~l~sav~~~lg~~~~~~~~~~~~~~~~~~~~~~~~~l~~r~~l~D~~  307 (415)
T PF14655_consen  232 SSPFVSFYYASEGSSQP----LLSDVASSAVASKLTSAVSGWLGWGSWRSEQQPQEKQPPEPKPEPAAPLPMRFGLPDSK  307 (415)
T ss_pred             CCceEEEEEccCCCCcc----cHHHHHHHHHHHHHHhhhHhhcccCCCCCccccccccccccCcCCCcccceEEeeccCC
Confidence            99999999888877765    5667777 8888888876634432  22221 11     22333333333336799999


Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      |++++|++||+|+|||+||++|||+|+|+++++++|||||||||||+|+++.+..+...+..+....+.+++||||||||
T Consensus       308 R~~~~i~~sP~~~laA~tDslGRV~LiD~~~~~vvrmWKGYRdAqc~wi~~~~~~~~~~~~~~~~~~~~~~~l~LvIyap  387 (415)
T PF14655_consen  308 REGESICLSPSGRLAAVTDSLGRVLLIDVARGIVVRMWKGYRDAQCGWIEVPEEGDRDRSNSNSPKSSSRFALFLVIYAP  387 (415)
T ss_pred             ceEEEEEECCCCCEEEEEcCCCcEEEEECCCChhhhhhccCccceEEEEEeecccccccccccccCCCCcceEEEEEEec
Confidence            99999999999999999999999999999999999999999999999999998876432333444445678999999999


Q ss_pred             CCCeEEEeeCCCCcceee
Q 015021          388 RKGIIEHLAGMADENWTT  405 (414)
Q Consensus       388 rRg~lEvW~~~~g~~v~~  405 (414)
                      |||+||||+||||+||++
T Consensus       388 rRg~lEvW~~~~g~Rv~a  405 (415)
T PF14655_consen  388 RRGILEVWSMRQGPRVAA  405 (415)
T ss_pred             cCCeEEEEecCCCCEEEE
Confidence            999999999999999954


No 2  
>KOG2727 consensus Rab3 GTPase-activating protein, non-catalytic subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=100.00  E-value=1.1e-90  Score=739.94  Aligned_cols=381  Identities=44%  Similarity=0.626  Sum_probs=331.8

Q ss_pred             CCCCcccceeeeeeecccccccCCCCCcccCCCeeeeccCcceeeeeecceEEE--EeecCCCCC---ceeecccCCCCC
Q 015021            1 MSKRTHTTEVGSIACTDLSDLGAGKEGWLVNDPNLLCALDMHTIALANRYQTVI--INWADPEGL---VAKIRPELSPIA   75 (414)
Q Consensus         1 ~~~~~~~~~~~~~~~~~~~~~g~~~~~WL~~~~~~~~sp~~~~la~A~~~~~v~--~~w~~~~~~---~v~~~g~l~~~~   75 (414)
                      |.+|.|.+|.||+.|+|+.+.|+++  ||++ |+++++|++|++++|+++||||  .+|.+.+++   .|.|+|+|++++
T Consensus         1 ~w~~w~~~e~g~~e~ee~t~lg~~~--WL~~-cnl~l~s~~d~~~~A~e~rfvfL~~~Wk~pd~p~~~~Vgw~g~l~dpe   77 (1244)
T KOG2727|consen    1 MWKRWHLTELGCIECEELTELGAGK--WLLV-CNLNLLSALDSHSLALENRFVFLIVNWKDPDAPVYKRVGWRGDLSDPE   77 (1244)
T ss_pred             CcccccccccCchhhhhhhcccccc--hHHh-cCcccCcchHHHHHHhhcceEEEEecCCCCCCCceEEEEeccccCCcc
Confidence            7789999999999999999999988  9999 9999999999999999999999  679999866   499999999988


Q ss_pred             CCeEEEEEeEEe--------CCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEe
Q 015021           76 SEYITAIEWLVF--------EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVM  147 (414)
Q Consensus        76 ~e~ITs~~~lpl--------~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily  147 (414)
                      +|+|||++|+||        +|||||||||++|||+||||+|.|||+|.+|++||++||||+++.       +++++|+|
T Consensus        78 ~e~ITa~~clpl~n~s~dgr~dwtcVavGt~sGyV~FYTe~Gvllf~Q~~~edPVl~lk~r~~k~-------d~~l~i~y  150 (1244)
T KOG2727|consen   78 AESITAIECLPLDNVSHDGRVDWTCVAVGTSSGYVLFYTETGVLLFKQIVHEDPVLKLKVRGTKK-------DLMLEISY  150 (1244)
T ss_pred             cceeeeeeeeeccccccccccceeEEEEecccceEEEEecccHHHHHHHhccCccceEEEEEeec-------CcEEEEee
Confidence            999999999999        799999999999999999999999999999999999999999976       89999999


Q ss_pred             CCeEEEEeChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceeccc-CC-CCCceeeEEeCCCCCCchhhh-
Q 015021          148 PGVLARFDGSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNV-SK-YGPCADAAITGLMPPPLMEVQ-  224 (414)
Q Consensus       148 ~~~iv~IdG~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l-~~-~~~i~Daa~~G~~~p~~~d~~-  224 (414)
                      |.++|+|+|++|++.|++|+++++++.      ..+  +.....++++||||.+ ++ ...|.|+++.+...||++|+. 
T Consensus       151 p~~~~~I~g~sl~~~L~ncq~~Vqkaa------~Ek--nsn~~~~~~~~qk~~l~qdi~~~I~hai~~~~~~ppt~Dq~v  222 (1244)
T KOG2727|consen  151 PEICIVIPGVSLRFDLSNCQSMVQKAA------QEK--NSNFWDQKNRKQKAELTQDIYQRIPHAIWNVNKNPPTVDQTV  222 (1244)
T ss_pred             cceEEEECCchhhhhHHHHHHHHHHHH------Hhc--cCCcCCccchhhhhhcccchhhccchheeecccCCccHHHhh
Confidence            999999999999999999999998873      111  1122457899999999 44 446777777666788989982 


Q ss_pred             --------------ccccceeEEEeCCCceeEEEEeccCCCcchhhhhhhhhhHHHHHHhhhhhhhhhcCCC--CCCCCC
Q 015021          225 --------------SSQRYFCAVTIGEDSVISAFRLSEDRSRSLVGAILSKVVPATFSTISSLSKMIWRSEQ--SPKKSE  288 (414)
Q Consensus       225 --------------s~~~~~~~i~vG~~P~la~y~~~~~~~~s~~~al~S~va~av~S~v~S~ak~~W~~~~--~~~~~e  288 (414)
                                    |-.+|++++++|++||++||+++++.+.+++++++.+|++++.|-+.|....+|+++.  +..+++
T Consensus       223 tas~~~gy~a~~k~SpPrySq~vt~ge~pf~gFf~a~eg~~~~llg~Vak~v~s~A~sn~asg~fgi~~ep~~sp~~kp~  302 (1244)
T KOG2727|consen  223 TASMPPGYLALQKPSPPRYSQAVTIGEDPFIGFFRASEGRGRSLLGAVAKKVVSAAASNIASGSFGIWREPDQSPKRKPE  302 (1244)
T ss_pred             hcccCchhhhhccCCCcceeeeEEecCCceeeeeeeccccccccHHHHHHHhhhhhhhhhhhheeeccCCCCcChhhcCC
Confidence                          2247889999999999999999999999877777777777665544443332444443  446677


Q ss_pred             CCCcccccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccc
Q 015021          289 PKPQSFARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSS  368 (414)
Q Consensus       289 ~kp~~~~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~  368 (414)
                      |||++.++++.+.+|+|.+|+|++|++||+|+|||+||++|||+|+|+++++|||||||||||||+|+++.+..++.   
T Consensus       303 pk~~saapa~~R~~i~D~~R~ge~lslSP~gtlAAVTD~lgRVlLlDta~~ivvr~wKGYRDAsc~fv~vkek~~~s---  379 (1244)
T KOG2727|consen  303 PKTQSAAPASSRTCIKDYPRKGEKLSLSPSGTLAAVTDSLGRVLLLDTAALIVVRLWKGYRDASCVFVEVKEKKGKS---  379 (1244)
T ss_pred             CCCCcccccceeeccccCccccceeeeCCCccEEEEecccCcEEEEehhhhhHHHHhcccccceeEEEEcccccCCC---
Confidence            78877665666678999999999999999999999999999999999999999999999999999999988765432   


Q ss_pred             cccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          369 AYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       369 ~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                         ++.+.++||||||||||||+||||.||+|++|.+
T Consensus       380 ---~~~~sRvAlFLvIyAPRrgiLEVW~~q~gpRV~A  413 (1244)
T KOG2727|consen  380 ---EPVKSRVALFLVIYAPRRGILEVWQMQTGPRVLA  413 (1244)
T ss_pred             ---ccCcCceeEEEEEecccccHHHHHHhccCCeEEE
Confidence               5677889999999999999999999999999976


No 3  
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.85  E-value=0.018  Score=51.12  Aligned_cols=81  Identities=17%  Similarity=0.138  Sum_probs=58.7

Q ss_pred             CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      ........+..+|+++.++++...|.|.++|+.++..++.++++.+    .+....-.+          .  .   .+++
T Consensus       175 ~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~----~i~~~~~~~----------~--~---~~~~  235 (289)
T cd00200         175 GHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHEN----GVNSVAFSP----------D--G---YLLA  235 (289)
T ss_pred             cCccccceEEECCCcCEEEEecCCCcEEEEECCCCceecchhhcCC----ceEEEEEcC----------C--C---cEEE
Confidence            3344677899999998777777799999999999999999987765    232221111          0  0   2555


Q ss_pred             EcCCCCeEEEeeCCCCccee
Q 015021          385 HAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v~  404 (414)
                      .+..+|.|.+|++++++.+.
T Consensus       236 ~~~~~~~i~i~~~~~~~~~~  255 (289)
T cd00200         236 SGSEDGTIRVWDLRTGECVQ  255 (289)
T ss_pred             EEcCCCcEEEEEcCCceeEE
Confidence            56679999999999877653


No 4  
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=97.76  E-value=0.00026  Score=74.25  Aligned_cols=53  Identities=21%  Similarity=0.318  Sum_probs=47.0

Q ss_pred             ccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecc
Q 015021          295 ARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKG  347 (414)
Q Consensus       295 ~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKG  347 (414)
                      +...|++.|.+++--.-++.-||+|+|.|..+..|+|.++++.++.+++-.+|
T Consensus       440 ~~gv~i~~f~kH~~pVysvafS~~g~ylAsGs~dg~V~iws~~~~~l~~s~~~  492 (524)
T KOG0273|consen  440 ESGVPIHTLMKHQEPVYSVAFSPNGRYLASGSLDGCVHIWSTKTGKLVKSYQG  492 (524)
T ss_pred             cCCceeEeeccCCCceEEEEecCCCcEEEecCCCCeeEeccccchheeEeecC
Confidence            33557778999999999999999999999999999999999999998876655


No 5  
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=97.42  E-value=0.0039  Score=64.56  Aligned_cols=79  Identities=15%  Similarity=0.192  Sum_probs=67.6

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      ..-+......+..|||++|+|.+.-+--|=|+|-.+|+-+-.++|+=+|  ||+|..     |            .    
T Consensus       363 mtgHq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~tGk~lasfRGHv~~VYqvawsa-----D------------s----  421 (480)
T KOG0271|consen  363 MTGHQALVNHVSFSPDGRYIASASFDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSA-----D------------S----  421 (480)
T ss_pred             hhchhhheeeEEECCCccEEEEeecccceeeeeCCCcchhhhhhhccceeEEEEecc-----C------------c----
Confidence            4445667889999999999999999999999999999999999999987  788863     1            1    


Q ss_pred             EEEEEcCCCCeEEEeeCCCCcc
Q 015021          381 CLAIHAPRKGIIEHLAGMADEN  402 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~  402 (414)
                      =|++...++..|+||++++-+.
T Consensus       422 RLlVS~SkDsTLKvw~V~tkKl  443 (480)
T KOG0271|consen  422 RLLVSGSKDSTLKVWDVRTKKL  443 (480)
T ss_pred             cEEEEcCCCceEEEEEeeeeee
Confidence            1888999999999999998654


No 6  
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=96.89  E-value=0.016  Score=64.01  Aligned_cols=83  Identities=17%  Similarity=0.146  Sum_probs=65.1

Q ss_pred             CCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          298 SPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       298 ~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      ...+.|.-+++.+++++.||+|++.|..|..|+|++||+.+|..+...||+++.- .=+...-  + +            
T Consensus       568 ~~VRiF~GH~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti-~SlsFS~--d-g------------  631 (707)
T KOG0263|consen  568 NSVRIFTGHKGPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGHTGTI-YSLSFSR--D-G------------  631 (707)
T ss_pred             cEEEEecCCCCceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcccCce-eEEEEec--C-C------------
Confidence            3345688899999999999999999999999999999999999999999996632 2222211  1 0            


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCC
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMA  399 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~  399 (414)
                         -..+-+.-+..|.+||+..
T Consensus       632 ---~vLasgg~DnsV~lWD~~~  650 (707)
T KOG0263|consen  632 ---NVLASGGADNSVRLWDLTK  650 (707)
T ss_pred             ---CEEEecCCCCeEEEEEchh
Confidence               1445677789999999865


No 7  
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.66  E-value=0.011  Score=62.19  Aligned_cols=88  Identities=16%  Similarity=0.156  Sum_probs=73.6

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEc-CcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDT-QALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~-~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      .+..+.++.+.+..++.+|++++++.+..++.|-++|+ ..+..+|..|||.+    ++....=.+++            
T Consensus       195 ~~~~l~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~----~v~~~~f~p~g------------  258 (456)
T KOG0266|consen  195 LLRELSGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHST----YVTSVAFSPDG------------  258 (456)
T ss_pred             hhccccccccceeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEecCCCC----ceEEEEecCCC------------
Confidence            34557889999999999999999999999999999999 77799999999999    66444322211            


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                         -|++.+-.++.+.||++++|+++-+
T Consensus       259 ---~~i~Sgs~D~tvriWd~~~~~~~~~  283 (456)
T KOG0266|consen  259 ---NLLVSGSDDGTVRIWDVRTGECVRK  283 (456)
T ss_pred             ---CEEEEecCCCcEEEEeccCCeEEEe
Confidence               2889999999999999999988743


No 8  
>PTZ00421 coronin; Provisional
Probab=96.62  E-value=0.014  Score=62.30  Aligned_cols=88  Identities=15%  Similarity=0.126  Sum_probs=67.9

Q ss_pred             CCccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          299 PLTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      |+..|..+.+.+..|..+|++ +++|++...|.|.|+|+.++..++.++|+.+    .|....-.+++            
T Consensus       117 ~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h~~----~V~sla~spdG------------  180 (493)
T PTZ00421        117 PIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCHSD----QITSLEWNLDG------------  180 (493)
T ss_pred             ceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEcCCCC----ceEEEEEECCC------------
Confidence            444577778889999999986 6888888899999999999999999999987    33322111111            


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                         -+++.+..+|.|.||++++|+.+.+
T Consensus       181 ---~lLatgs~Dg~IrIwD~rsg~~v~t  205 (493)
T PTZ00421        181 ---SLLCTTSKDKKLNIIDPRDGTIVSS  205 (493)
T ss_pred             ---CEEEEecCCCEEEEEECCCCcEEEE
Confidence               1567788999999999999987743


No 9  
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=96.61  E-value=0.066  Score=59.99  Aligned_cols=286  Identities=18%  Similarity=0.162  Sum_probs=142.2

Q ss_pred             CeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCc-cceeEEEEecccCCCCCCCCCCeEEEEeCCe-E--E
Q 015021           77 EYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHP-GRILKLRVRGSRRDLTQDTAEEEVCVVMPGV-L--A  152 (414)
Q Consensus        77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~-~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~-i--v  152 (414)
                      .+||+++=-|.-|  |||||+-+|-|.+|...-+.++.-.=|+ ++|.+|.+|+-..        .-+.+--+.+ +  -
T Consensus       203 s~IT~ieqsPaLD--VVaiG~~~G~ViifNlK~dkil~sFk~d~g~VtslSFrtDG~--------p~las~~~~G~m~~w  272 (910)
T KOG1539|consen  203 SRITAIEQSPALD--VVAIGLENGTVIIFNLKFDKILMSFKQDWGRVTSLSFRTDGN--------PLLASGRSNGDMAFW  272 (910)
T ss_pred             cceeEeccCCcce--EEEEeccCceEEEEEcccCcEEEEEEccccceeEEEeccCCC--------eeEEeccCCceEEEE
Confidence            5688887666544  8999999999999996544444445565 9999999998553        2244444422 2  3


Q ss_pred             EEeChhHHHHHHHHH-HhccccccCCCCccCCCcccccccCCccceecccCCCCCc--eeeEEeCCCCCCchhh-hcccc
Q 015021          153 RFDGSEIQKMLQRWF-QDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPC--ADAAITGLMPPPLMEV-QSSQR  228 (414)
Q Consensus       153 ~IdG~sL~~~Lr~c~-~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i--~Daa~~G~~~p~~~d~-~s~~~  228 (414)
                      -+|+-.|..++++.. ..+...     .-..+.+=+-...+.=+.+.|-++..+-.  -==.=-|...||.+=- +-.++
T Consensus       273 DLe~kkl~~v~~nah~~sv~~~-----~fl~~epVl~ta~~DnSlk~~vfD~~dg~pR~LR~R~GHs~Pp~~irfy~~~g  347 (910)
T KOG1539|consen  273 DLEKKKLINVTRNAHYGSVTGA-----TFLPGEPVLVTAGADNSLKVWVFDSGDGVPRLLRSRGGHSAPPSCIRFYGSQG  347 (910)
T ss_pred             EcCCCeeeeeeeccccCCcccc-----eecCCCceEeeccCCCceeEEEeeCCCCcchheeeccCCCCCchheeeeccCc
Confidence            466777777777532 000000     00000000000112235577777643321  0001123334444322 11233


Q ss_pred             ceeEEEeCCCceeEEEEeccCC-CcchhhhhhhhhhHHH--HHH-------hhhhh----h-hhhcCCCC---C------
Q 015021          229 YFCAVTIGEDSVISAFRLSEDR-SRSLVGAILSKVVPAT--FST-------ISSLS----K-MIWRSEQS---P------  284 (414)
Q Consensus       229 ~~~~i~vG~~P~la~y~~~~~~-~~s~~~al~S~va~av--~S~-------v~S~a----k-~~W~~~~~---~------  284 (414)
                      ++ ++.+|.++.+=.|-..-++ ++++..-...+-++.+  .+.       +..|+    + .-|..-..   .      
T Consensus       348 ~~-ilsa~~Drt~r~fs~~~e~~~~~l~~~~~~~~~kk~~~~~~~~~k~p~i~~fa~~~~RE~~W~Nv~~~h~~~~~~~t  426 (910)
T KOG1539|consen  348 HF-ILSAKQDRTLRSFSVISESQSQELGQLHNKKRAKKVNVFSTEKLKLPPIVEFAFENAREKEWDNVITAHKGKRSAYT  426 (910)
T ss_pred             EE-EEecccCcchhhhhhhHHHHhHhhcccccccccccccccchhhhcCCcceeeecccchhhhhcceeEEecCcceEEE
Confidence            33 6777777755444222111 1111110000111111  000       00010    0 01200000   0      


Q ss_pred             -----CCC-CC--CCcccccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEE
Q 015021          285 -----KKS-EP--KPQSFARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFM  356 (414)
Q Consensus       285 -----~~~-e~--kp~~~~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi  356 (414)
                           +.. +.  +|.+         +.-..-...++++|++|+++.+.-+-|-|-+++.+.|+...-+= -+-|.=  -
T Consensus       427 W~~~n~~~G~~~L~~~~---------~~~~~~~~~av~vs~CGNF~~IG~S~G~Id~fNmQSGi~r~sf~-~~~ah~--~  494 (910)
T KOG1539|consen  427 WNFRNKTSGRHVLDPKR---------FKKDDINATAVCVSFCGNFVFIGYSKGTIDRFNMQSGIHRKSFG-DSPAHK--G  494 (910)
T ss_pred             EeccCcccccEEecCcc---------ccccCcceEEEEEeccCceEEEeccCCeEEEEEcccCeeecccc-cCcccc--C
Confidence                 000 00  1111         22234678999999999999999999999999999998765441 000000  0


Q ss_pred             EEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021          357 EMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       357 ~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v  403 (414)
                      ++....           .  +..=-++|-+...|+|..|+...+..+
T Consensus       495 ~V~gla-----------~--D~~n~~~vsa~~~Gilkfw~f~~k~l~  528 (910)
T KOG1539|consen  495 EVTGLA-----------V--DGTNRLLVSAGADGILKFWDFKKKVLK  528 (910)
T ss_pred             ceeEEE-----------e--cCCCceEEEccCcceEEEEecCCccee
Confidence            000000           0  111127899999999999999887744


No 10 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=96.55  E-value=1.2  Score=49.90  Aligned_cols=93  Identities=18%  Similarity=0.191  Sum_probs=62.1

Q ss_pred             CeeeeccCcceeeeeecceEEEEeecCCCC--Cceeec-c--cCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-
Q 015021           33 PNLLCALDMHTIALANRYQTVIINWADPEG--LVAKIR-P--ELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-  106 (414)
Q Consensus        33 ~~~~~sp~~~~la~A~~~~~v~~~w~~~~~--~~v~~~-g--~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-  106 (414)
                      +.++.+|+|++||.|-...-|- -|+....  ...... +  .+.  ....|+++.|-|.. ...++.|-.+|.|++|+ 
T Consensus       487 ~~i~fs~dg~~latgg~D~~I~-iwd~~~~~~~~~~~~~~~~~~~--~~~~v~~l~~~~~~-~~~las~~~Dg~v~lWd~  562 (793)
T PLN00181        487 CAIGFDRDGEFFATAGVNKKIK-IFECESIIKDGRDIHYPVVELA--SRSKLSGICWNSYI-KSQVASSNFEGVVQVWDV  562 (793)
T ss_pred             EEEEECCCCCEEEEEeCCCEEE-EEECCcccccccccccceEEec--ccCceeeEEeccCC-CCEEEEEeCCCeEEEEEC
Confidence            4577889999988887765443 2433210  000000 0  111  13568888888763 45688888999999999 


Q ss_pred             cCCcEEeecccCccceeEEEEec
Q 015021          107 LKGDLVHRQLIHPGRILKLRVRG  129 (414)
Q Consensus       107 e~G~LL~sQ~lh~~pV~~ik~r~  129 (414)
                      ++|.++..+.-|..+|..+.+..
T Consensus       563 ~~~~~~~~~~~H~~~V~~l~~~p  585 (793)
T PLN00181        563 ARSQLVTEMKEHEKRVWSIDYSS  585 (793)
T ss_pred             CCCeEEEEecCCCCCEEEEEEcC
Confidence            57888888788999999999753


No 11 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.55  E-value=0.014  Score=61.30  Aligned_cols=85  Identities=24%  Similarity=0.217  Sum_probs=68.4

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      .+..|..+.--..+++.+|.|++.+..+.+|-|-++|+.++..++++||+.+    .|....-..+             .
T Consensus       238 ~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~----~is~~~f~~d-------------~  300 (456)
T KOG0266|consen  238 NLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSD----GISGLAFSPD-------------G  300 (456)
T ss_pred             EEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCC----ceEEEEECCC-------------C
Confidence            4456888888889999999999999999999999999999999999999999    5544322211             1


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCCcc
Q 015021          379 CLCLAIHAPRKGIIEHLAGMADEN  402 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g~~  402 (414)
                      . +| .-+-.+|.|.|||+.+|++
T Consensus       301 ~-~l-~s~s~d~~i~vwd~~~~~~  322 (456)
T KOG0266|consen  301 N-LL-VSASYDGTIRVWDLETGSK  322 (456)
T ss_pred             C-EE-EEcCCCccEEEEECCCCce
Confidence            1 23 3336699999999999994


No 12 
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=96.54  E-value=0.0039  Score=65.42  Aligned_cols=82  Identities=18%  Similarity=0.113  Sum_probs=64.0

Q ss_pred             CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      .+.++..++++||+|+|+|..+-.-.|.++|..++.-+..|||+|++=.+=.- .               +..-   =.+
T Consensus       200 ~h~keil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv~~~~ghr~~V~~L~f-r---------------~gt~---~ly  260 (479)
T KOG0299|consen  200 GHVKEILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHVKVFKGHRGAVSSLAF-R---------------KGTS---ELY  260 (479)
T ss_pred             cccceeEEEEEcCCCcEEEecCCCceEEEecCcccchhhcccccccceeeeee-e---------------cCcc---cee
Confidence            67789999999999999999999999999999999999999999995433221 1               1000   114


Q ss_pred             EcCCCCeEEEeeCCCCcceee
Q 015021          385 HAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v~~  405 (414)
                      -+..++.+.||++++-..|.|
T Consensus       261 s~s~Drsvkvw~~~~~s~vet  281 (479)
T KOG0299|consen  261 SASADRSVKVWSIDQLSYVET  281 (479)
T ss_pred             eeecCCceEEEehhHhHHHHH
Confidence            566788899999988776654


No 13 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.23  E-value=0.41  Score=53.27  Aligned_cols=86  Identities=20%  Similarity=0.189  Sum_probs=74.6

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccce--eeEEEEEecccccccccccCCCCC
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDAS--CVFMEMLVNKDAATSSAYYAPVKS  376 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq--~~wi~~~~~~~~~~~~~~~~~~k~  376 (414)
                      .+..|.-++|..-++..||.-++.|++..++.|=++.+.+...+.-+.|++.|=  +.|+.-               ++ 
T Consensus       497 l~~vLsGH~RGvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT~eGH~~aVlra~F~~~---------------~~-  560 (775)
T KOG0319|consen  497 LLGVLSGHTRGVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKTFEGHTSAVLRASFIRN---------------GK-  560 (775)
T ss_pred             EEEEeeCCccceEEEEeccccceeEeccCCceEEEEEeccceeeeeecCccceeEeeeeeeC---------------Cc-
Confidence            345799999999999999999999999999999999999999999999999973  445530               12 


Q ss_pred             cceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          377 DYCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       377 ~~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                           -.|.+-.+|+|++|++++++|+.|
T Consensus       561 -----qliS~~adGliKlWnikt~eC~~t  584 (775)
T KOG0319|consen  561 -----QLISAGADGLIKLWNIKTNECEMT  584 (775)
T ss_pred             -----EEEeccCCCcEEEEeccchhhhhh
Confidence                 458999999999999999999865


No 14 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.21  E-value=0.034  Score=49.36  Aligned_cols=82  Identities=17%  Similarity=0.137  Sum_probs=58.7

Q ss_pred             CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      ........+..+|+++++++++..|.|.++|+.++..++.++++.+    .+....-.++         +   .   .++
T Consensus       133 ~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~----~i~~~~~~~~---------~---~---~l~  193 (289)
T cd00200         133 GHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTG----EVNSVAFSPD---------G---E---KLL  193 (289)
T ss_pred             cCCCcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEecCcc----ccceEEECCC---------c---C---EEE
Confidence            3445678899999999999998899999999999999999987664    2221111110         0   0   233


Q ss_pred             EcCCCCeEEEeeCCCCcceee
Q 015021          385 HAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v~~  405 (414)
                      .+-.+|.|.+|+++.++.+.+
T Consensus       194 ~~~~~~~i~i~d~~~~~~~~~  214 (289)
T cd00200         194 SSSSDGTIKLWDLSTGKCLGT  214 (289)
T ss_pred             EecCCCcEEEEECCCCceecc
Confidence            444499999999998877643


No 15 
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=95.99  E-value=0.035  Score=55.03  Aligned_cols=82  Identities=16%  Similarity=0.237  Sum_probs=62.9

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR  388 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr  388 (414)
                      ++.++.++|..+-+..+-.+|-++-+|+.+|.+-|.+||+-|    .+......+..               --+....-
T Consensus       116 eINam~ldP~enSi~~AgGD~~~y~~dlE~G~i~r~~rGHtD----YvH~vv~R~~~---------------~qilsG~E  176 (325)
T KOG0649|consen  116 EINAMWLDPSENSILFAGGDGVIYQVDLEDGRIQREYRGHTD----YVHSVVGRNAN---------------GQILSGAE  176 (325)
T ss_pred             ccceeEeccCCCcEEEecCCeEEEEEEecCCEEEEEEcCCcc----eeeeeeecccC---------------cceeecCC
Confidence            567899999976444444889999999999999999999999    77665432211               12355677


Q ss_pred             CCeEEEeeCCCCcceeeccce
Q 015021          389 KGIIEHLAGMADENWTTSLNY  409 (414)
Q Consensus       389 Rg~lEvW~~~~g~~v~~~~~~  409 (414)
                      +|.+.||+++++++|-+---|
T Consensus       177 DGtvRvWd~kt~k~v~~ie~y  197 (325)
T KOG0649|consen  177 DGTVRVWDTKTQKHVSMIEPY  197 (325)
T ss_pred             CccEEEEeccccceeEEeccc
Confidence            999999999999999664433


No 16 
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=95.94  E-value=0.021  Score=63.08  Aligned_cols=88  Identities=18%  Similarity=0.212  Sum_probs=69.6

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      |++.|-.+--...++..-|+..|.|..-++-+|=+||+.+|..||++-||+.    =|.+..-.+.|           + 
T Consensus       527 PlRifaghlsDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G~~VRiF~GH~~----~V~al~~Sp~G-----------r-  590 (707)
T KOG0263|consen  527 PLRIFAGHLSDVDCVSFHPNSNYVATGSSDRTVRLWDVSTGNSVRIFTGHKG----PVTALAFSPCG-----------R-  590 (707)
T ss_pred             chhhhcccccccceEEECCcccccccCCCCceEEEEEcCCCcEEEEecCCCC----ceEEEEEcCCC-----------c-
Confidence            5666666667778899999999999999999999999999999999999888    34443322222           1 


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          379 CLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                        ||+ -+--.|+|.+||+.+|+.|.+
T Consensus       591 --~La-Sg~ed~~I~iWDl~~~~~v~~  614 (707)
T KOG0263|consen  591 --YLA-SGDEDGLIKIWDLANGSLVKQ  614 (707)
T ss_pred             --eEe-ecccCCcEEEEEcCCCcchhh
Confidence              444 566799999999999988753


No 17 
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=95.90  E-value=0.026  Score=58.60  Aligned_cols=96  Identities=19%  Similarity=0.165  Sum_probs=76.1

Q ss_pred             CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021          297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK  375 (414)
Q Consensus       297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k  375 (414)
                      .+|++...-++.-..+|+.||||+++|..--.|-|.|+|-.+| ++.|-.+|++-    ||....=.+-     |..+. 
T Consensus       147 eTp~~t~KgH~~WVlcvawsPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K----~It~Lawep~-----hl~p~-  216 (480)
T KOG0271|consen  147 ETPLFTCKGHKNWVLCVAWSPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKK----WITALAWEPL-----HLVPP-  216 (480)
T ss_pred             CCcceeecCCccEEEEEEECCCcchhhccccCCeEEEecCCCCCcccccccCccc----ceeEEeeccc-----ccCCC-
Confidence            3577778888889999999999999999999999999998776 57799999999    9987643221     11111 


Q ss_pred             CcceEEEEEEcCCCCeEEEeeCCCCcceeec
Q 015021          376 SDYCLCLAIHAPRKGIIEHLAGMADENWTTS  406 (414)
Q Consensus       376 ~~~~l~LvIyaprRg~lEvW~~~~g~~v~~~  406 (414)
                        ..  ++-.+.++|.+.||++..|.++-+.
T Consensus       217 --~r--~las~skDg~vrIWd~~~~~~~~~l  243 (480)
T KOG0271|consen  217 --CR--RLASSSKDGSVRIWDTKLGTCVRTL  243 (480)
T ss_pred             --cc--ceecccCCCCEEEEEccCceEEEEe
Confidence              11  5567889999999999999998553


No 18 
>PF12341 DUF3639:  Protein of unknown function (DUF3639) ;  InterPro: IPR022100  This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important. 
Probab=95.79  E-value=0.014  Score=38.86  Aligned_cols=27  Identities=37%  Similarity=0.455  Sum_probs=20.7

Q ss_pred             CCeEEEEEeEEeCCcEEEEEeccccEEEEEe
Q 015021           76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYD  106 (414)
Q Consensus        76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt  106 (414)
                      +|.|+++.   + ....|+|+||.+|||+||
T Consensus         1 gE~i~aia---~-g~~~vavaTS~~~lRifs   27 (27)
T PF12341_consen    1 GEEIEAIA---A-GDSWVAVATSAGYLRIFS   27 (27)
T ss_pred             CceEEEEE---c-cCCEEEEEeCCCeEEecC
Confidence            46677664   3 334799999999999997


No 19 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=95.78  E-value=1.7  Score=48.95  Aligned_cols=112  Identities=14%  Similarity=0.146  Sum_probs=72.4

Q ss_pred             eeeeccCcceeeeeecceEEEEeecCCC----CCc--eeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEec
Q 015021           34 NLLCALDMHTIALANRYQTVIINWADPE----GLV--AKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDL  107 (414)
Q Consensus        34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~----~~~--v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte  107 (414)
                      .|..||+|..+|++.+..+-|  |..++    ++|  +--+-.+..  -+.||++.|.  .|..++++|-.+=.+|+|..
T Consensus       101 ~i~fSPng~~fav~~gn~lqi--w~~P~~~~~~~~pFvl~r~~~g~--fddi~si~Ws--~DSr~l~~gsrD~s~rl~~v  174 (893)
T KOG0291|consen  101 AIKFSPNGKFFAVGCGNLLQI--WHAPGEIKNEFNPFVLHRTYLGH--FDDITSIDWS--DDSRLLVTGSRDLSARLFGV  174 (893)
T ss_pred             eEEECCCCcEEEEEecceeEE--EecCcchhcccCcceEeeeecCC--ccceeEEEec--cCCceEEeccccceEEEEEe
Confidence            467899999999999997776  22222    233  112223322  3568988886  68899999999999999996


Q ss_pred             CCcEE---eecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHH
Q 015021          108 KGDLV---HRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQ  160 (414)
Q Consensus       108 ~G~LL---~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~  160 (414)
                      ++..=   +--.=|.++|+.--+..         .+.+++.+..++.+..=..++.
T Consensus       175 ~~~k~~~~~~l~gHkd~VvacfF~~---------~~~~l~tvskdG~l~~W~~~~~  221 (893)
T KOG0291|consen  175 DGNKNLFTYALNGHKDYVVACFFGA---------NSLDLYTVSKDGALFVWTCDLR  221 (893)
T ss_pred             ccccccceEeccCCCcceEEEEecc---------CcceEEEEecCceEEEEEecCC
Confidence            54432   12234567777654322         1566888888875544444443


No 20 
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=94.99  E-value=0.027  Score=60.27  Aligned_cols=77  Identities=18%  Similarity=0.219  Sum_probs=57.8

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc-ceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD-ASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd-Aq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      --+..++++||-+++-.+=+.|-|.+||+.+-.+||-+|||-| |.|  |.+   .+++            +   -.--.
T Consensus       510 paCyALa~spDakvcFsccsdGnI~vwDLhnq~~VrqfqGhtDGasc--Idi---s~dG------------t---klWTG  569 (705)
T KOG0639|consen  510 PACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQGHTDGASC--IDI---SKDG------------T---KLWTG  569 (705)
T ss_pred             hhhhhhhcCCccceeeeeccCCcEEEEEcccceeeecccCCCCCcee--EEe---cCCC------------c---eeecC
Confidence            4466789999999888888899999999999999999999999 222  111   1111            0   11224


Q ss_pred             CCCCeEEEeeCCCCccee
Q 015021          387 PRKGIIEHLAGMADENWT  404 (414)
Q Consensus       387 prRg~lEvW~~~~g~~v~  404 (414)
                      ..+..|..||+|.|+++.
T Consensus       570 GlDntvRcWDlregrqlq  587 (705)
T KOG0639|consen  570 GLDNTVRCWDLREGRQLQ  587 (705)
T ss_pred             CCccceeehhhhhhhhhh
Confidence            558899999999999874


No 21 
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=94.81  E-value=0.097  Score=53.01  Aligned_cols=79  Identities=18%  Similarity=0.034  Sum_probs=65.5

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK  389 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR  389 (414)
                      +-.+.-.++++-+.++-.+-||..+|+++|..+|=.||+-+    ++....+              .++.+.||.-+.-+
T Consensus        93 VM~l~~~~d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~----~vNs~~p--------------~rrg~~lv~SgsdD  154 (338)
T KOG0265|consen   93 VMELHGMRDGSHILSCGTDKTVRGWDAETGKRIRKHKGHTS----FVNSLDP--------------SRRGPQLVCSGSDD  154 (338)
T ss_pred             eEeeeeccCCCEEEEecCCceEEEEecccceeeehhccccc----eeeecCc--------------cccCCeEEEecCCC
Confidence            33456678888888888889999999999999999999999    8877643              23566899999999


Q ss_pred             CeEEEeeCCCCcceeec
Q 015021          390 GIIEHLAGMADENWTTS  406 (414)
Q Consensus       390 g~lEvW~~~~g~~v~~~  406 (414)
                      |.+.+||||..+.+-|-
T Consensus       155 ~t~kl~D~R~k~~~~t~  171 (338)
T KOG0265|consen  155 GTLKLWDIRKKEAIKTF  171 (338)
T ss_pred             ceEEEEeecccchhhcc
Confidence            99999999977776553


No 22 
>KOG2727 consensus Rab3 GTPase-activating protein, non-catalytic subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.67  E-value=0.039  Score=62.46  Aligned_cols=84  Identities=45%  Similarity=0.833  Sum_probs=72.5

Q ss_pred             CCeEEEEeCCeEEEEeChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCC--
Q 015021          140 EEEVCVVMPGVLARFDGSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMP--  217 (414)
Q Consensus       140 ~eel~Ily~~~iv~IdG~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~--  217 (414)
                      -++.+++-|+...+.++...+.+.+.+..+.++..|.++-..++..........|++..|+.-+..++.|..+.+++.  
T Consensus       150 yp~~~~~I~g~sl~~~L~ncq~~Vqkaa~Eknsn~~~~~~~~qk~~l~qdi~~~I~hai~~~~~~ppt~Dq~vtas~~~g  229 (1244)
T KOG2727|consen  150 YPEICIVIPGVSLRFDLSNCQSMVQKAAQEKNSNFWDQKNRKQKAELTQDIYQRIPHAIWNVNKNPPTVDQTVTASMPPG  229 (1244)
T ss_pred             ecceEEEECCchhhhhHHHHHHHHHHHHHhccCCcCCccchhhhhhcccchhhccchheeecccCCccHHHhhhcccCch
Confidence            456899999999999999999999999999999999999988887665556788999999999888889988887664  


Q ss_pred             --------CCchhh
Q 015021          218 --------PPLMEV  223 (414)
Q Consensus       218 --------p~~~d~  223 (414)
                              ||.+++
T Consensus       230 y~a~~k~SpPrySq  243 (1244)
T KOG2727|consen  230 YLALQKPSPPRYSQ  243 (1244)
T ss_pred             hhhhccCCCcceee
Confidence                    677777


No 23 
>PTZ00421 coronin; Provisional
Probab=94.50  E-value=0.26  Score=52.86  Aligned_cols=81  Identities=16%  Similarity=0.182  Sum_probs=59.4

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc---eeeEEEEEecccccccccccCCCCCcce
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA---SCVFMEMLVNKDAATSSAYYAPVKSDYC  379 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA---q~~wi~~~~~~~~~~~~~~~~~~k~~~~  379 (414)
                      +..+...+.+|+.+|+|+++|++...|.|-++|+.++..++-++|+..+   .|.|..     +              ..
T Consensus       164 l~~h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~-----~--------------~~  224 (493)
T PTZ00421        164 IKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAK-----R--------------KD  224 (493)
T ss_pred             EcCCCCceEEEEEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcC-----C--------------CC
Confidence            4445566889999999999999999999999999999999999988764   355542     0              01


Q ss_pred             EEEEEE--cCCCCeEEEeeCCCCcc
Q 015021          380 LCLAIH--APRKGIIEHLAGMADEN  402 (414)
Q Consensus       380 l~LvIy--aprRg~lEvW~~~~g~~  402 (414)
                      +++..-  .-.+|.|.+|++++..+
T Consensus       225 ~ivt~G~s~s~Dr~VklWDlr~~~~  249 (493)
T PTZ00421        225 LIITLGCSKSQQRQIMLWDTRKMAS  249 (493)
T ss_pred             eEEEEecCCCCCCeEEEEeCCCCCC
Confidence            122221  13478899999987653


No 24 
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=94.47  E-value=0.28  Score=48.89  Aligned_cols=87  Identities=15%  Similarity=0.095  Sum_probs=64.4

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      |+.-|.-..|++.++-+||++++.|++.+.-.|.++.+.+. ..=+..+|+-.    |+.--.=..           ...
T Consensus       207 P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~~kle~~l~gh~r----WvWdc~FS~-----------dg~  271 (311)
T KOG0315|consen  207 PVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDFFKLELVLTGHQR----WVWDCAFSA-----------DGE  271 (311)
T ss_pred             EhhheecccceEEEEEECCCCcEEEeecCCceEEEEecCCceeeEEEeecCCc----eEEeeeecc-----------Ccc
Confidence            44447788899999999999999999999999999999987 33345555543    876332111           111


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCccee
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v~  404 (414)
                         |||- |..|+...+|++.+|+.|.
T Consensus       272 ---YlvT-assd~~~rlW~~~~~k~v~  294 (311)
T KOG0315|consen  272 ---YLVT-ASSDHTARLWDLSAGKEVR  294 (311)
T ss_pred             ---EEEe-cCCCCceeecccccCceee
Confidence               6655 5556999999999999874


No 25 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=94.23  E-value=0.26  Score=49.89  Aligned_cols=82  Identities=20%  Similarity=0.246  Sum_probs=63.2

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      -+-..|.-||+|++..++++.+-++|+|.-+|.++.=+-||+.++--=+++.=..|          ++      +|+-.-
T Consensus       188 ~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPd----------s~------Fvl~gs  251 (311)
T KOG1446|consen  188 AEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPD----------SK------FVLSGS  251 (311)
T ss_pred             cceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccCCCCcceeEEECCC----------Cc------EEEEec
Confidence            34577899999999999999999999999999999999999887611122211111          11      556666


Q ss_pred             CCCeEEEeeCCCCcceee
Q 015021          388 RKGIIEHLAGMADENWTT  405 (414)
Q Consensus       388 rRg~lEvW~~~~g~~v~~  405 (414)
                      -+|.|-||++.+|++|+.
T Consensus       252 ~dg~i~vw~~~tg~~v~~  269 (311)
T KOG1446|consen  252 DDGTIHVWNLETGKKVAV  269 (311)
T ss_pred             CCCcEEEEEcCCCcEeeE
Confidence            789999999999999964


No 26 
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=94.12  E-value=0.29  Score=49.26  Aligned_cols=88  Identities=11%  Similarity=0.056  Sum_probs=74.1

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccc-ccceeeEEEEEecccccccccccCCCCCcce
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGY-RDASCVFMEMLVNKDAATSSAYYAPVKSDYC  379 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGy-RdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~  379 (414)
                      .-|.-+.+...+++++|+++-++....+--|.|+|+-..+...|=.+. ||    |+....-.+          ++..  
T Consensus        99 ~~f~GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~g~ck~t~~~~~~~~----WVscvrfsP----------~~~~--  162 (315)
T KOG0279|consen   99 RRFVGHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTLGVCKYTIHEDSHRE----WVSCVRFSP----------NESN--  162 (315)
T ss_pred             EEEEecCCceEEEEecCCCceeecCCCcceeeeeeecccEEEEEecCCCcC----cEEEEEEcC----------CCCC--
Confidence            347788899999999999999999999999999999999999999998 99    998764322          2111  


Q ss_pred             EEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          380 LCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       380 l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                       ..++-+.-++.|+||++++.+.-++
T Consensus       163 -p~Ivs~s~DktvKvWnl~~~~l~~~  187 (315)
T KOG0279|consen  163 -PIIVSASWDKTVKVWNLRNCQLRTT  187 (315)
T ss_pred             -cEEEEccCCceEEEEccCCcchhhc
Confidence             3778999999999999999887654


No 27 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=94.10  E-value=0.18  Score=49.88  Aligned_cols=84  Identities=12%  Similarity=0.031  Sum_probs=64.8

Q ss_pred             CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCC
Q 015021          297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKS  376 (414)
Q Consensus       297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~  376 (414)
                      .+.+....-..++...+.++.|+.-+|..-..--|.+||+++|.++|=|.|+ +||+-=+...|+.              
T Consensus        49 g~liktYsghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~rgH-~aqVNtV~fNees--------------  113 (307)
T KOG0316|consen   49 GALIKTYSGHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGKVDRRFRGH-LAQVNTVRFNEES--------------  113 (307)
T ss_pred             cceeeeecCCCceeeeccccccccccccCCCCceEEEEEcccCeeeeecccc-cceeeEEEecCcc--------------
Confidence            3455668888999999999999977777777788999999999999999998 5677666654432              


Q ss_pred             cceEEEEEEcCCCCeEEEeeCCC
Q 015021          377 DYCLCLAIHAPRKGIIEHLAGMA  399 (414)
Q Consensus       377 ~~~l~LvIyaprRg~lEvW~~~~  399 (414)
                          -.|..+.-+..+.+|+.|.
T Consensus       114 ----SVv~SgsfD~s~r~wDCRS  132 (307)
T KOG0316|consen  114 ----SVVASGSFDSSVRLWDCRS  132 (307)
T ss_pred             ----eEEEeccccceeEEEEccc
Confidence                1455566677777777664


No 28 
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=93.94  E-value=0.18  Score=51.55  Aligned_cols=87  Identities=16%  Similarity=0.163  Sum_probs=66.5

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      ++-.|..- .+++..+|+|.+.|+.=..|||.++|..+-.+-||.-||-.+=+.=-.    ..++           +   
T Consensus        18 ~~tld~~~-a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T~~iar~lsaH~~pi~sl~W----S~dg-----------r---   78 (405)
T KOG1273|consen   18 THTLDNPL-AECCQFSRWGDYLAVGCANGRVVIYDFDTFRIARMLSAHVRPITSLCW----SRDG-----------R---   78 (405)
T ss_pred             ceeccCCc-cceEEeccCcceeeeeccCCcEEEEEccccchhhhhhccccceeEEEe----cCCC-----------C---
Confidence            34344444 899999999999999999999999999999999999887664332211    1111           1   


Q ss_pred             EEEEEcCCCCeEEEeeCCCCcceeeccceeee
Q 015021          381 CLAIHAPRKGIIEHLAGMADENWTTSLNYSVR  412 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~v~~~~~~~~~  412 (414)
                       +..-+.|+..+..|++..|-++     |.+|
T Consensus        79 -~LltsS~D~si~lwDl~~gs~l-----~rir  104 (405)
T KOG1273|consen   79 -KLLTSSRDWSIKLWDLLKGSPL-----KRIR  104 (405)
T ss_pred             -EeeeecCCceeEEEeccCCCce-----eEEE
Confidence             4567899999999999999877     6665


No 29 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=93.84  E-value=0.54  Score=43.74  Aligned_cols=76  Identities=16%  Similarity=0.138  Sum_probs=50.0

Q ss_pred             eeEEEECCCCCEEEEE-cCCCcEEEEEcCcceeEeeec-ccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          310 GERLTLSPSGSLAAIT-DSLGRILLLDTQALVVVRLWK-GYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       310 ~~~i~~sP~~~laa~t-D~lGRV~L~D~~~~~vvrmwK-GyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      ...|..+|+++++.++ ...++|.++|+.++.+++... |..--.+.|      .+.         ++     +|++-+.
T Consensus       209 ~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~~~~~~~~~~~------~~~---------g~-----~l~~~~~  268 (300)
T TIGR03866       209 PVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLLVGQRVWQLAF------TPD---------EK-----YLLTTNG  268 (300)
T ss_pred             ccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEEeCCCcceEEE------CCC---------CC-----EEEEEcC
Confidence            3468899999875543 456799999999999887643 321111111      111         11     3444456


Q ss_pred             CCCeEEEeeCCCCcceee
Q 015021          388 RKGIIEHLAGMADENWTT  405 (414)
Q Consensus       388 rRg~lEvW~~~~g~~v~~  405 (414)
                      ..|.|.||++++|+++.+
T Consensus       269 ~~~~i~v~d~~~~~~~~~  286 (300)
T TIGR03866       269 VSNDVSVIDVAALKVIKS  286 (300)
T ss_pred             CCCeEEEEECCCCcEEEE
Confidence            789999999999998754


No 30 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=93.82  E-value=0.49  Score=44.02  Aligned_cols=75  Identities=15%  Similarity=0.127  Sum_probs=51.0

Q ss_pred             eEEEECCCCCEE-EEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021          311 ERLTLSPSGSLA-AITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK  389 (414)
Q Consensus       311 ~~i~~sP~~~la-a~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR  389 (414)
                      ..+.++|+|+.+ ++....++|.++|+.++.+++.+....+.  ..+.+.   ++         +  +   +|++-....
T Consensus        34 ~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~~--~~~~~~---~~---------g--~---~l~~~~~~~   94 (300)
T TIGR03866        34 RGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPDP--ELFALH---PN---------G--K---ILYIANEDD   94 (300)
T ss_pred             CceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCCc--cEEEEC---CC---------C--C---EEEEEcCCC
Confidence            468999999865 55677899999999999998877653331  122111   10         1  1   355555677


Q ss_pred             CeEEEeeCCCCccee
Q 015021          390 GIIEHLAGMADENWT  404 (414)
Q Consensus       390 g~lEvW~~~~g~~v~  404 (414)
                      +.|.+|++++++.+.
T Consensus        95 ~~l~~~d~~~~~~~~  109 (300)
T TIGR03866        95 NLVTVIDIETRKVLA  109 (300)
T ss_pred             CeEEEEECCCCeEEe
Confidence            899999999877653


No 31 
>PTZ00420 coronin; Provisional
Probab=93.79  E-value=0.47  Score=51.96  Aligned_cols=87  Identities=10%  Similarity=-0.059  Sum_probs=58.6

Q ss_pred             CCccccCCCceeeEEEECCCCCE-EEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSL-AAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~l-aa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      |+..+..+.+.+..|+.+|++.. +|++...|.|.|+|+.++..+..++.. + .+.-+...   .++            
T Consensus       117 p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~~~-~-~V~Slsws---pdG------------  179 (568)
T PTZ00420        117 PQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINMP-K-KLSSLKWN---IKG------------  179 (568)
T ss_pred             ceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEecC-C-cEEEEEEC---CCC------------
Confidence            33346667788999999999964 566778899999999999877766532 1 22111111   111            


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                       .  +++-+-++|.|.||++++|+.+.+
T Consensus       180 -~--lLat~s~D~~IrIwD~Rsg~~i~t  204 (568)
T PTZ00420        180 -N--LLSGTCVGKHMHIIDPRKQEIASS  204 (568)
T ss_pred             -C--EEEEEecCCEEEEEECCCCcEEEE
Confidence             1  334456788999999999887743


No 32 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=93.72  E-value=5.8  Score=41.48  Aligned_cols=92  Identities=7%  Similarity=-0.102  Sum_probs=61.2

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      -...|.+.-||++.|..++.=+|-++=++++....+..++--.  -.+|..-.+..+    .....+.. ..+.+| .-+
T Consensus       237 wvr~v~v~~DGti~As~s~dqtl~vW~~~t~~~k~~lR~hEh~vEci~wap~~~~~~----i~~at~~~-~~~~~l-~s~  310 (406)
T KOG0295|consen  237 WVRMVRVNQDGTIIASCSNDQTLRVWVVATKQCKAELREHEHPVECIAWAPESSYPS----ISEATGST-NGGQVL-GSG  310 (406)
T ss_pred             hEEEEEecCCeeEEEecCCCceEEEEEeccchhhhhhhccccceEEEEecccccCcc----hhhccCCC-CCccEE-Eee
Confidence            5677889999999999999999999999999666665554331  123443222111    00011111 122244 568


Q ss_pred             CCCCeEEEeeCCCCcceeec
Q 015021          387 PRKGIIEHLAGMADENWTTS  406 (414)
Q Consensus       387 prRg~lEvW~~~~g~~v~~~  406 (414)
                      .|+++|++|+|++|.|+-|-
T Consensus       311 SrDktIk~wdv~tg~cL~tL  330 (406)
T KOG0295|consen  311 SRDKTIKIWDVSTGMCLFTL  330 (406)
T ss_pred             cccceEEEEeccCCeEEEEE
Confidence            89999999999999998663


No 33 
>PF00400 WD40:  WD domain, G-beta repeat;  InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=92.98  E-value=0.18  Score=33.93  Aligned_cols=36  Identities=25%  Similarity=0.389  Sum_probs=32.7

Q ss_pred             CccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEE
Q 015021          300 LTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLD  335 (414)
Q Consensus       300 l~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D  335 (414)
                      +..|......+.+|+.+|+++++|++...|.|.++|
T Consensus         4 ~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd   39 (39)
T PF00400_consen    4 VRTFRGHSSSINSIAWSPDGNFLASGSSDGTIRVWD   39 (39)
T ss_dssp             EEEEESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred             EEEEcCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence            456788889999999999999999999999999987


No 34 
>PTZ00420 coronin; Provisional
Probab=92.96  E-value=0.99  Score=49.47  Aligned_cols=50  Identities=12%  Similarity=0.143  Sum_probs=42.4

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc---eeeEEE
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA---SCVFME  357 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA---q~~wi~  357 (414)
                      ..+.+++.+|+|+++|++-..|.|.|+|+.++.+++.++||...   .+-|+.
T Consensus       168 ~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~  220 (568)
T PTZ00420        168 KKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWID  220 (568)
T ss_pred             CcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEEEEecccCCceeEEEEee
Confidence            35789999999999999888899999999999999999998763   444653


No 35 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=92.73  E-value=0.66  Score=51.92  Aligned_cols=79  Identities=14%  Similarity=0.024  Sum_probs=57.9

Q ss_pred             eeeEEEECCC-CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          309 KGERLTLSPS-GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       309 ~~~~i~~sP~-~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      ...+++.+|. ++++|+++..|.|.|||+.++..++.++|+.+.    |....-.+          .  +.  -+++-+.
T Consensus       534 ~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~~~----V~~l~~~p----------~--~~--~~L~Sgs  595 (793)
T PLN00181        534 KLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKR----VWSIDYSS----------A--DP--TLLASGS  595 (793)
T ss_pred             ceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeEEEEecCCCCC----EEEEEEcC----------C--CC--CEEEEEc
Confidence            4567777765 689999999999999999999999999999873    22211100          0  00  1456667


Q ss_pred             CCCeEEEeeCCCCcceee
Q 015021          388 RKGIIEHLAGMADENWTT  405 (414)
Q Consensus       388 rRg~lEvW~~~~g~~v~~  405 (414)
                      .+|.|.+|++++++++.+
T Consensus       596 ~Dg~v~iWd~~~~~~~~~  613 (793)
T PLN00181        596 DDGSVKLWSINQGVSIGT  613 (793)
T ss_pred             CCCEEEEEECCCCcEEEE
Confidence            799999999999887643


No 36 
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=91.63  E-value=1.3  Score=46.82  Aligned_cols=70  Identities=19%  Similarity=0.301  Sum_probs=50.1

Q ss_pred             EEEEeEEe----CCcEEEEEeccccEEEEEecCC------cEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCC
Q 015021           80 TAIEWLVF----EEMRALAVGTSRGYFLVYDLKG------DLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPG  149 (414)
Q Consensus        80 Ts~~~lpl----~dw~~I~VG~ssG~vrfyte~G------~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~  149 (414)
                      -++++-++    .+.-.|+||--+|++|+|..++      +||+...+ ..||++|.+..=..    ......|.||.|.
T Consensus        23 ~~l~v~~~~~~~~~~d~IivGS~~G~LrIy~P~~~~~~~~~lllE~~l-~~PILqv~~G~F~s----~~~~~~LaVLhP~   97 (418)
T PF14727_consen   23 GSLCVGNLDNSPSGSDKIIVGSYSGILRIYDPSGNEFQPEDLLLETQL-KDPILQVECGKFVS----GSEDLQLAVLHPR   97 (418)
T ss_pred             ceEEEEcccCCCCCccEEEEeccccEEEEEccCCCCCCCccEEEEEec-CCcEEEEEeccccC----CCCcceEEEecCC
Confidence            34545555    2446999999999999999533      47777666 58999999876321    1235689999999


Q ss_pred             eEEEE
Q 015021          150 VLARF  154 (414)
Q Consensus       150 ~iv~I  154 (414)
                      .++..
T Consensus        98 kl~vY  102 (418)
T PF14727_consen   98 KLSVY  102 (418)
T ss_pred             EEEEE
Confidence            86543


No 37 
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=91.16  E-value=0.73  Score=50.87  Aligned_cols=80  Identities=15%  Similarity=0.150  Sum_probs=66.7

Q ss_pred             CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      +-.+-+++.-|+|..++..+.-+-+.|+|-.++.=+-=.+||||    -+.+....+++           .    =++.+
T Consensus       171 k~siYSLA~N~t~t~ivsGgtek~lr~wDprt~~kimkLrGHTd----NVr~ll~~dDG-----------t----~~ls~  231 (735)
T KOG0308|consen  171 KDSIYSLAMNQTGTIIVSGGTEKDLRLWDPRTCKKIMKLRGHTD----NVRVLLVNDDG-----------T----RLLSA  231 (735)
T ss_pred             ccceeeeecCCcceEEEecCcccceEEeccccccceeeeecccc----ceEEEEEcCCC-----------C----eEeec
Confidence            34467889999999999999999999999998875555669999    88777655544           1    24789


Q ss_pred             CCCCeEEEeeCCCCcceee
Q 015021          387 PRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       387 prRg~lEvW~~~~g~~v~~  405 (414)
                      ..+|.|.+|++.+-+|++|
T Consensus       232 sSDgtIrlWdLgqQrCl~T  250 (735)
T KOG0308|consen  232 SSDGTIRLWDLGQQRCLAT  250 (735)
T ss_pred             CCCceEEeeeccccceeee
Confidence            9999999999999999998


No 38 
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=91.03  E-value=0.7  Score=52.53  Aligned_cols=108  Identities=19%  Similarity=0.131  Sum_probs=65.7

Q ss_pred             eeeeeeecccc-----------cccCCCCCcccCCCeee---eccCcceeee-eecceEEE------EeecCCCCCceee
Q 015021            9 EVGSIACTDLS-----------DLGAGKEGWLVNDPNLL---CALDMHTIAL-ANRYQTVI------INWADPEGLVAKI   67 (414)
Q Consensus         9 ~~~~~~~~~~~-----------~~g~~~~~WL~~~~~~~---~sp~~~~la~-A~~~~~v~------~~w~~~~~~~v~~   67 (414)
                      ++|||-|-+-+           +.|.-+..=+.|.-.+-   +|-.|-++|- +-++|...      ..|++.    -.|
T Consensus       442 ~vGiI~t~~~e~~~ssIdVeFHD~sihr~~H~~d~~~y~lA~ls~~g~llAsp~s~sk~~sil~~~h~~w~s~----seW  517 (933)
T KOG1274|consen  442 EVGIIRTVVNEANDSSIDVEFHDTSIHRAYHFSDLFGYELADLSEKGTLLASPESESKLGSILYRAHFSWDSH----SEW  517 (933)
T ss_pred             ccceEEEEeccCcCceEEEEEeccCccceeeeeccccceeeeccccceEEecccccCCcceEEEEcccCcccc----cce
Confidence            57888853322           23433555554422222   3334433332 33444444      236553    357


Q ss_pred             cccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEE
Q 015021           68 RPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRV  127 (414)
Q Consensus        68 ~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~  127 (414)
                      .=.|..  +|.|++|++-+.    .|+|+|+.||||+||..|.-..-- =|+.||+...+
T Consensus       518 tm~lP~--~E~~~~V~~t~~----~Vav~TS~~~lRvFt~gGvq~~I~-t~~gP~vtaag  570 (933)
T KOG1274|consen  518 TMILPL--QESIEAVAATSG----WVAVATSLGYLRVFTIGGVQREIF-TLPGPVVTAAG  570 (933)
T ss_pred             eeecCC--CCceeEEEccCc----EEEEEeccceEEEEEecceeeeEe-ecccceEEeec
Confidence            767744  488999977544    799999999999999999754332 45789998873


No 39 
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=90.63  E-value=0.93  Score=47.84  Aligned_cols=76  Identities=17%  Similarity=0.147  Sum_probs=58.9

Q ss_pred             CceeeEEEECCCCCEEEEE--cCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          307 PRKGERLTLSPSGSLAAIT--DSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       307 ~R~~~~i~~sP~~~laa~t--D~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      .+...+|+--|||.++++.  |++|||  +|+.+|.-|-...||-+.=++ +..   .+.               -|.+.
T Consensus       303 s~~v~~iaf~~DGSL~~tGGlD~~~Rv--WDlRtgr~im~L~gH~k~I~~-V~f---sPN---------------Gy~lA  361 (459)
T KOG0272|consen  303 SKGVFSIAFQPDGSLAATGGLDSLGRV--WDLRTGRCIMFLAGHIKEILS-VAF---SPN---------------GYHLA  361 (459)
T ss_pred             ccccceeEecCCCceeeccCccchhhe--eecccCcEEEEecccccceee-EeE---CCC---------------ceEEe
Confidence            4667789999999999994  999996  899999999999998772111 111   111               17888


Q ss_pred             EcCCCCeEEEeeCCCCcce
Q 015021          385 HAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v  403 (414)
                      -...+++..||+++--..+
T Consensus       362 Tgs~Dnt~kVWDLR~r~~l  380 (459)
T KOG0272|consen  362 TGSSDNTCKVWDLRMRSEL  380 (459)
T ss_pred             ecCCCCcEEEeeecccccc
Confidence            8899999999999876665


No 40 
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=90.55  E-value=0.44  Score=50.09  Aligned_cols=79  Identities=16%  Similarity=0.120  Sum_probs=62.4

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccc--cceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYR--DASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyR--dAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      .+..++.||+-..++.+..+|+|.++|.....-=|+..|+=  =-.|.|..                .|     -|+.-+
T Consensus       182 aIRdlafSpnDskF~t~SdDg~ikiWdf~~~kee~vL~GHgwdVksvdWHP----------------~k-----gLiasg  240 (464)
T KOG0284|consen  182 AIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKEERVLRGHGWDVKSVDWHP----------------TK-----GLIASG  240 (464)
T ss_pred             hhheeccCCCCceeEEecCCCeEEEEeccCCchhheeccCCCCcceeccCC----------------cc-----ceeEEc
Confidence            45678999988888888889999999999988888887762  11223332                12     488999


Q ss_pred             CCCCeEEEeeCCCCcceeeccc
Q 015021          387 PRKGIIEHLAGMADENWTTSLN  408 (414)
Q Consensus       387 prRg~lEvW~~~~g~~v~~~~~  408 (414)
                      .++..|++||.++|+|++|...
T Consensus       241 skDnlVKlWDprSg~cl~tlh~  262 (464)
T KOG0284|consen  241 SKDNLVKLWDPRSGSCLATLHG  262 (464)
T ss_pred             cCCceeEeecCCCcchhhhhhh
Confidence            9999999999999999998753


No 41 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=89.89  E-value=1.9  Score=44.56  Aligned_cols=50  Identities=18%  Similarity=0.180  Sum_probs=36.5

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccce
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDAS  352 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq  352 (414)
                      ++-...--..+..+|||+++-++...|.|.+||+.++.+++-.|--.++.
T Consensus        32 i~~~~~~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~   81 (369)
T PF02239_consen   32 IPTGGAPHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNPR   81 (369)
T ss_dssp             EE-STTEEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEE
T ss_pred             EcCCCCceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCcc
Confidence            33333333456789999999999889999999999999999888776653


No 42 
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=89.83  E-value=3.6  Score=43.97  Aligned_cols=203  Identities=18%  Similarity=0.187  Sum_probs=114.4

Q ss_pred             CCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCe-EEEEeChhHHHHHHH
Q 015021           88 EEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGV-LARFDGSEIQKMLQR  165 (414)
Q Consensus        88 ~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~-iv~IdG~sL~~~Lr~  165 (414)
                      |+-.+++.|+-+|-+.... -+|.||-.-+-|.-+|.+|++-.-..         -+.=--.++ |..-+-.+|.+.   
T Consensus        91 n~G~~l~ag~i~g~lYlWelssG~LL~v~~aHYQ~ITcL~fs~dgs---------~iiTgskDg~V~vW~l~~lv~a---  158 (476)
T KOG0646|consen   91 NLGYFLLAGTISGNLYLWELSSGILLNVLSAHYQSITCLKFSDDGS---------HIITGSKDGAVLVWLLTDLVSA---  158 (476)
T ss_pred             CCceEEEeecccCcEEEEEeccccHHHHHHhhccceeEEEEeCCCc---------EEEecCCCccEEEEEEEeeccc---
Confidence            6678999999999999999 59999988899999999999654322         111111111 211111222211   


Q ss_pred             HHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEE-eCCCCCCchhhhccccceeEEEeCCCceeEEE
Q 015021          166 WFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAI-TGLMPPPLMEVQSSQRYFCAVTIGEDSVISAF  244 (414)
Q Consensus       166 c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~-~G~~~p~~~d~~s~~~~~~~i~vG~~P~la~y  244 (414)
                                          +  +...+.|+.+|+= ..-+|.|.-| .|...            .++++++.+-.+-.|
T Consensus       159 --------------------~--~~~~~~p~~~f~~-HtlsITDl~ig~Gg~~------------~rl~TaS~D~t~k~w  203 (476)
T KOG0646|consen  159 --------------------D--NDHSVKPLHIFSD-HTLSITDLQIGSGGTN------------ARLYTASEDRTIKLW  203 (476)
T ss_pred             --------------------c--cCCCccceeeecc-CcceeEEEEecCCCcc------------ceEEEecCCceEEEE
Confidence                                1  1124466667752 3347888877 33211            125677777666666


Q ss_pred             EeccCCCcchhhhhhhhhhHHHHHHhhhhh--------------hhhhcCC--C-CCCCCCCCCccc-ccCCCCccccCC
Q 015021          245 RLSEDRSRSLVGAILSKVVPATFSTISSLS--------------KMIWRSE--Q-SPKKSEPKPQSF-ARASPLTCLKDH  306 (414)
Q Consensus       245 ~~~~~~~~s~~~al~S~va~av~S~v~S~a--------------k~~W~~~--~-~~~~~e~kp~~~-~~a~pl~~l~D~  306 (414)
                      -++.+.   +    +-.++.-+  .+++++              +..|-.+  + +.+..-.+.+.. +--+....|.-+
T Consensus       204 dlS~g~---L----Llti~fp~--si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh  274 (476)
T KOG0646|consen  204 DLSLGV---L----LLTITFPS--SIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGH  274 (476)
T ss_pred             Eeccce---e----eEEEecCC--cceeEEEcccccEEEecCCcceEEeeehhcCCcccccccccccccccceeeeeccc
Confidence            655442   1    11110000  011110              0011000  0 111110111110 112233446666


Q ss_pred             Cc--eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec
Q 015021          307 PR--KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK  346 (414)
Q Consensus       307 ~R--~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK  346 (414)
                      ..  .+.+++++-||++.+..|-.|.|.++|+...++||.--
T Consensus       275 ~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~Q~iRtl~  316 (476)
T KOG0646|consen  275 ENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSKQCIRTLQ  316 (476)
T ss_pred             cCCcceeEEEEecCccEEEeeCCCCCEEEEecchHHHHHHHh
Confidence            55  79999999999999999999999999999999999643


No 43 
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=88.95  E-value=2.2  Score=43.10  Aligned_cols=78  Identities=8%  Similarity=0.053  Sum_probs=66.6

Q ss_pred             CceeeEEEECCC--CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          307 PRKGERLTLSPS--GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       307 ~R~~~~i~~sP~--~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      +--..++..+|+  +-+++.+-.++-|=++|+.+-++..++=|+-.    -+...+..++|               .|.+
T Consensus       148 ~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~----~v~t~~vSpDG---------------slca  208 (315)
T KOG0279|consen  148 REWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLRTTFIGHSG----YVNTVTVSPDG---------------SLCA  208 (315)
T ss_pred             cCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchhhccccccc----cEEEEEECCCC---------------CEEe
Confidence            566899999999  57888899999999999999999999999766    66666555444               3789


Q ss_pred             EcCCCCeEEEeeCCCCcce
Q 015021          385 HAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v  403 (414)
                      |..++|.+-+|++..|+++
T Consensus       209 sGgkdg~~~LwdL~~~k~l  227 (315)
T KOG0279|consen  209 SGGKDGEAMLWDLNEGKNL  227 (315)
T ss_pred             cCCCCceEEEEEccCCcee
Confidence            9999999999999999998


No 44 
>PF15492 Nbas_N:  Neuroblastoma-amplified sequence, N terminal
Probab=88.75  E-value=0.63  Score=46.74  Aligned_cols=43  Identities=30%  Similarity=0.428  Sum_probs=39.5

Q ss_pred             cCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec
Q 015021          304 KDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK  346 (414)
Q Consensus       304 ~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK  346 (414)
                      .+.+-.+-++.+||+|++.|+...-|+|.|+++-+....|-||
T Consensus       226 ~~~~d~i~kmSlSPdg~~La~ih~sG~lsLW~iPsL~~~~~W~  268 (282)
T PF15492_consen  226 GQEQDGIFKMSLSPDGSLLACIHFSGSLSLWEIPSLRLQRSWK  268 (282)
T ss_pred             ccCCCceEEEEECCCCCEEEEEEcCCeEEEEecCcchhhcccc
Confidence            4556678899999999999999999999999999999999996


No 45 
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=88.54  E-value=1.5  Score=44.05  Aligned_cols=88  Identities=15%  Similarity=0.109  Sum_probs=67.3

Q ss_pred             CCCCccccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021          297 ASPLTCLKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK  375 (414)
Q Consensus       297 a~pl~~l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k  375 (414)
                      +.|+.++.+++|++.++--.+..+ -..++.=+|.|=|+|-.+..-|+-++|+++.=.+=.  ..+.             
T Consensus        94 s~Pi~~~kEH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~--~sp~-------------  158 (311)
T KOG0277|consen   94 SKPIHKFKEHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTFNGHNSCIYQAA--FSPH-------------  158 (311)
T ss_pred             CcchhHHHhhhhheEEeccccccceeEEeeccCCceEeecCCCCcceEeecCCccEEEEEe--cCCC-------------
Confidence            458899999999999999999875 445567789999999999999999999998422111  1000             


Q ss_pred             CcceEEEEEEcCCCCeEEEeeCCC-Ccc
Q 015021          376 SDYCLCLAIHAPRKGIIEHLAGMA-DEN  402 (414)
Q Consensus       376 ~~~~l~LvIyaprRg~lEvW~~~~-g~~  402 (414)
                       ..  =|+-++.-+|.+.+|+++. |+.
T Consensus       159 -~~--nlfas~Sgd~~l~lwdvr~~gk~  183 (311)
T KOG0277|consen  159 -IP--NLFASASGDGTLRLWDVRSPGKF  183 (311)
T ss_pred             -CC--CeEEEccCCceEEEEEecCCCce
Confidence             01  2778999999999999886 443


No 46 
>PF15492 Nbas_N:  Neuroblastoma-amplified sequence, N terminal
Probab=88.25  E-value=2.5  Score=42.57  Aligned_cols=76  Identities=18%  Similarity=0.152  Sum_probs=49.0

Q ss_pred             eeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEe
Q 015021           35 LLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVH  113 (414)
Q Consensus        35 ~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~  113 (414)
                      ++++.+|.+||+-+++.+-|-.=.++...-+.=|..-.+...| =.=++|  -+|.+.+|.-.++|+|+||+..|..||
T Consensus         3 ~~~~~~Gk~lAi~qd~~iEiRsa~Ddf~si~~kcqVpkD~~PQ-WRkl~W--SpD~tlLa~a~S~G~i~vfdl~g~~lf   78 (282)
T PF15492_consen    3 LALSSDGKLLAILQDQCIEIRSAKDDFSSIIGKCQVPKDPNPQ-WRKLAW--SPDCTLLAYAESTGTIRVFDLMGSELF   78 (282)
T ss_pred             eeecCCCcEEEEEeccEEEEEeccCCchheeEEEecCCCCCch-heEEEE--CCCCcEEEEEcCCCeEEEEecccceeE
Confidence            6789999999999999988832222211111112211111112 111222  189999999999999999999998888


No 47 
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=88.14  E-value=1.2  Score=47.67  Aligned_cols=67  Identities=18%  Similarity=0.220  Sum_probs=51.9

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      ...+..||||++++..|+.|+|..+|-.+-.++.-||+|-..  +..|+...             +.       -||-+.
T Consensus       435 s~~v~fSpDG~~l~SGdsdG~v~~wdwkt~kl~~~lkah~~~ci~v~wHP~e-------------~S-------kvat~~  494 (503)
T KOG0282|consen  435 SCQVDFSPDGRTLCSGDSDGKVNFWDWKTTKLVSKLKAHDQPCIGVDWHPVE-------------PS-------KVATCG  494 (503)
T ss_pred             eeeEEEcCCCCeEEeecCCccEEEeechhhhhhhccccCCcceEEEEecCCC-------------cc-------eeEecc
Confidence            345778999999999999999999999999999999999331  23454321             12       346667


Q ss_pred             CCCeEEEee
Q 015021          388 RKGIIEHLA  396 (414)
Q Consensus       388 rRg~lEvW~  396 (414)
                      =.|.|.||+
T Consensus       495 w~G~Ikiwd  503 (503)
T KOG0282|consen  495 WDGLIKIWD  503 (503)
T ss_pred             cCceeEecC
Confidence            789999995


No 48 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=88.07  E-value=2.2  Score=47.79  Aligned_cols=82  Identities=13%  Similarity=0.133  Sum_probs=64.3

Q ss_pred             CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      -+...+.+++++|+.+|+|+..-+--.=++|+.++.++-+.+|||.-  -|-.-+...              +   -++-
T Consensus       461 aHdKdIN~Vaia~ndkLiAT~SqDktaKiW~le~~~l~~vLsGH~RG--vw~V~Fs~~--------------d---q~la  521 (775)
T KOG0319|consen  461 AHDKDINCVAIAPNDKLIATGSQDKTAKIWDLEQLRLLGVLSGHTRG--VWCVSFSKN--------------D---QLLA  521 (775)
T ss_pred             hhcccccceEecCCCceEEecccccceeeecccCceEEEEeeCCccc--eEEEEeccc--------------c---ceeE
Confidence            34577899999999999999877777778888899999999999982  133222111              1   2666


Q ss_pred             EcCCCCeEEEeeCCCCcceee
Q 015021          385 HAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v~~  405 (414)
                      -+.-+..|+||++.++.|+-|
T Consensus       522 T~SgD~TvKIW~is~fSClkT  542 (775)
T KOG0319|consen  522 TCSGDKTVKIWSISTFSCLKT  542 (775)
T ss_pred             eccCCceEEEEEeccceeeee
Confidence            778899999999999999866


No 49 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=87.56  E-value=3.1  Score=46.71  Aligned_cols=69  Identities=23%  Similarity=0.370  Sum_probs=49.1

Q ss_pred             CCcEEEEEeccccEEEEEecCCcEEeecc-cCcc--------ceeEEEEecccCCCCCCCCCCeEEEEeCCe-EEEEeC-
Q 015021           88 EEMRALAVGTSRGYFLVYDLKGDLVHRQL-IHPG--------RILKLRVRGSRRDLTQDTAEEEVCVVMPGV-LARFDG-  156 (414)
Q Consensus        88 ~dw~~I~VG~ssG~vrfyte~G~LL~sQ~-lh~~--------pV~~ik~r~~~~~~~~~~~~eel~Ily~~~-iv~IdG-  156 (414)
                      ||-...+||+=.|+.|||+-.|..|..|. ++..        .|..|.+....        .++|.|--.+- |=++|| 
T Consensus       461 PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~--------~~~vLVTSnDSrIRI~d~~  532 (712)
T KOG0283|consen  461 PDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGD--------PDEVLVTSNDSRIRIYDGR  532 (712)
T ss_pred             cCCceEEEEEeccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCC--------CCeEEEecCCCceEEEecc
Confidence            66778999999999999999999888873 3222        45555544433        33788888886 778899 


Q ss_pred             -hhHHHHHH
Q 015021          157 -SEIQKMLQ  164 (414)
Q Consensus       157 -~sL~~~Lr  164 (414)
                       -+|-.-++
T Consensus       533 ~~~lv~KfK  541 (712)
T KOG0283|consen  533 DKDLVHKFK  541 (712)
T ss_pred             chhhhhhhc
Confidence             66654443


No 50 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=86.82  E-value=4.7  Score=42.21  Aligned_cols=50  Identities=12%  Similarity=0.062  Sum_probs=44.3

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd  350 (414)
                      +.+.-+-.-+.+|.++|.+.|+|....++-|=++|+++|+..--.-|+-.
T Consensus       145 rVi~gHlgWVr~vavdP~n~wf~tgs~DrtikIwDlatg~LkltltGhi~  194 (460)
T KOG0285|consen  145 RVISGHLGWVRSVAVDPGNEWFATGSADRTIKIWDLATGQLKLTLTGHIE  194 (460)
T ss_pred             hhhhhccceEEEEeeCCCceeEEecCCCceeEEEEcccCeEEEeecchhh
Confidence            45777788899999999999999999999999999999999887777643


No 51 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=86.76  E-value=3.3  Score=45.01  Aligned_cols=40  Identities=25%  Similarity=0.343  Sum_probs=37.3

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVV  342 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vv  342 (414)
                      +.+++-....|+.||++.++|++|..+.|+|+|+++..+-
T Consensus       483 ~~~h~a~iT~vaySpd~~yla~~Da~rkvv~yd~~s~~~~  522 (603)
T KOG0318|consen  483 LLEHRAAITDVAYSPDGAYLAAGDASRKVVLYDVASREVK  522 (603)
T ss_pred             eecccCCceEEEECCCCcEEEEeccCCcEEEEEcccCcee
Confidence            6777888999999999999999999999999999999983


No 52 
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=86.67  E-value=0.58  Score=46.93  Aligned_cols=47  Identities=23%  Similarity=0.377  Sum_probs=40.6

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW  345 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw  345 (414)
                      |+.+|.-++-.+..|..+|+|+++|+.-.+.-|-|+|+..++.+|+.
T Consensus       181 pv~si~AH~snCicI~f~p~GryfA~GsADAlvSLWD~~ELiC~R~i  227 (313)
T KOG1407|consen  181 PVQSIKAHPSNCICIEFDPDGRYFATGSADALVSLWDVDELICERCI  227 (313)
T ss_pred             cccccccCCcceEEEEECCCCceEeeccccceeeccChhHhhhheee
Confidence            44456666667889999999999999888889999999999999987


No 53 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=86.29  E-value=12  Score=42.52  Aligned_cols=88  Identities=16%  Similarity=0.215  Sum_probs=63.9

Q ss_pred             eeeccCcceeeeeecc--eEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEe-CCcEEEEEeccccEEEEEec-CCc
Q 015021           35 LLCALDMHTIALANRY--QTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVF-EEMRALAVGTSRGYFLVYDL-KGD  110 (414)
Q Consensus        35 ~~~sp~~~~la~A~~~--~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl-~dw~~I~VG~ssG~vrfyte-~G~  110 (414)
                      +++..+||.||++...  .++|=.|-+. .+-++=-|-.     .+||++   .. ||-..||-|-.+|-|++|.. +|-
T Consensus       313 ~~~N~tGDWiA~g~~klgQLlVweWqsE-sYVlKQQgH~-----~~i~~l---~YSpDgq~iaTG~eDgKVKvWn~~Sgf  383 (893)
T KOG0291|consen  313 VSFNSTGDWIAFGCSKLGQLLVWEWQSE-SYVLKQQGHS-----DRITSL---AYSPDGQLIATGAEDGKVKVWNTQSGF  383 (893)
T ss_pred             EEecccCCEEEEcCCccceEEEEEeecc-ceeeeccccc-----cceeeE---EECCCCcEEEeccCCCcEEEEeccCce
Confidence            4567889999999988  7777778554 2223333333     335544   44 78899999999999999995 777


Q ss_pred             EEeecccCccceeEEEEeccc
Q 015021          111 LVHRQLIHPGRILKLRVRGSR  131 (414)
Q Consensus       111 LL~sQ~lh~~pV~~ik~r~~~  131 (414)
                      ++..=-=|.+-|..+++--..
T Consensus       384 C~vTFteHts~Vt~v~f~~~g  404 (893)
T KOG0291|consen  384 CFVTFTEHTSGVTAVQFTARG  404 (893)
T ss_pred             EEEEeccCCCceEEEEEEecC
Confidence            777666678888888865544


No 54 
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=86.28  E-value=5.6  Score=37.12  Aligned_cols=75  Identities=15%  Similarity=0.244  Sum_probs=50.6

Q ss_pred             eeEEEECCCCCEEEEE--cCC-CcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          310 GERLTLSPSGSLAAIT--DSL-GRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       310 ~~~i~~sP~~~laa~t--D~l-GRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      ...|.-||+|+++|++  +++ |.|-++|+.+...+.-.+-..-..|.|-.      ++           ++ +.-..-+
T Consensus       103 ~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~~~~i~~~~~~~~t~~~WsP------dG-----------r~-~~ta~t~  164 (194)
T PF08662_consen  103 RNTISWSPDGRFLVLAGFGNLNGDLEFWDVRKKKKISTFEHSDATDVEWSP------DG-----------RY-LATATTS  164 (194)
T ss_pred             ceEEEECCCCCEEEEEEccCCCcEEEEEECCCCEEeeccccCcEEEEEEcC------CC-----------CE-EEEEEec
Confidence            3579999999998886  344 99999999999998877655433444431      11           11 1222334


Q ss_pred             CC---CCeEEEeeCCCCcce
Q 015021          387 PR---KGIIEHLAGMADENW  403 (414)
Q Consensus       387 pr---Rg~lEvW~~~~g~~v  403 (414)
                      ||   +.-+.||+. +|+++
T Consensus       165 ~r~~~dng~~Iw~~-~G~~l  183 (194)
T PF08662_consen  165 PRLRVDNGFKIWSF-QGRLL  183 (194)
T ss_pred             cceeccccEEEEEe-cCeEe
Confidence            54   366899998 58888


No 55 
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=86.21  E-value=5.3  Score=37.29  Aligned_cols=76  Identities=16%  Similarity=0.163  Sum_probs=48.6

Q ss_pred             eeeEEEECCCCCE-EEEE-cCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE--
Q 015021          309 KGERLTLSPSGSL-AAIT-DSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI--  384 (414)
Q Consensus       309 ~~~~i~~sP~~~l-aa~t-D~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI--  384 (414)
                      .+..++=+|+|+. |+++ +.-++|.|+|+..-.+...=++.++ .+.|=.      .         +   .  +|++  
T Consensus        61 ~I~~~~WsP~g~~favi~g~~~~~v~lyd~~~~~i~~~~~~~~n-~i~wsP------~---------G---~--~l~~~g  119 (194)
T PF08662_consen   61 PIHDVAWSPNGNEFAVIYGSMPAKVTLYDVKGKKIFSFGTQPRN-TISWSP------D---------G---R--FLVLAG  119 (194)
T ss_pred             ceEEEEECcCCCEEEEEEccCCcccEEEcCcccEeEeecCCCce-EEEECC------C---------C---C--EEEEEE
Confidence            4889999999964 4553 5567999999973333333344444 233321      1         1   1  3444  


Q ss_pred             EcCCCCeEEEeeCCCCcceee
Q 015021          385 HAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~~v~~  405 (414)
                      +.-..|.|++|+++..++|.+
T Consensus       120 ~~n~~G~l~~wd~~~~~~i~~  140 (194)
T PF08662_consen  120 FGNLNGDLEFWDVRKKKKIST  140 (194)
T ss_pred             ccCCCcEEEEEECCCCEEeec
Confidence            344568899999999998865


No 56 
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=85.97  E-value=1.5  Score=46.32  Aligned_cols=42  Identities=17%  Similarity=0.186  Sum_probs=39.8

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA  351 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA  351 (414)
                      -.+++.||++.|+|+.-..|+|.+|++.+|.+..+.|+-+..
T Consensus       390 wtrvvfSpd~~YvaAGS~dgsv~iW~v~tgKlE~~l~~s~s~  431 (459)
T KOG0288|consen  390 WTRVVFSPDGSYVAAGSADGSVYIWSVFTGKLEKVLSLSTSN  431 (459)
T ss_pred             cceeEECCCCceeeeccCCCcEEEEEccCceEEEEeccCCCC
Confidence            677899999999999999999999999999999999998886


No 57 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=85.78  E-value=4.5  Score=41.95  Aligned_cols=67  Identities=13%  Similarity=0.074  Sum_probs=51.0

Q ss_pred             CCCEEEEEcCC-----CcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC-----
Q 015021          318 SGSLAAITDSL-----GRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP-----  387 (414)
Q Consensus       318 ~~~laa~tD~l-----GRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap-----  387 (414)
                      +++++-++|.-     |||.+||..++.++.|+..-+..+.  + +.  .|             ..  +|.+=.+     
T Consensus        11 ~~~~v~V~d~~~~~~~~~v~ViD~~~~~v~g~i~~G~~P~~--~-~s--pD-------------g~--~lyva~~~~~R~   70 (352)
T TIGR02658        11 DARRVYVLDPGHFAATTQVYTIDGEAGRVLGMTDGGFLPNP--V-VA--SD-------------GS--FFAHASTVYSRI   70 (352)
T ss_pred             CCCEEEEECCcccccCceEEEEECCCCEEEEEEEccCCCce--e-EC--CC-------------CC--EEEEEecccccc
Confidence            56889999987     9999999999999999976665553  2 11  11             11  4556666     


Q ss_pred             ----CCCeEEEeeCCCCccee
Q 015021          388 ----RKGIIEHLAGMADENWT  404 (414)
Q Consensus       388 ----rRg~lEvW~~~~g~~v~  404 (414)
                          |.+.|+|||+++++.+.
T Consensus        71 ~~G~~~d~V~v~D~~t~~~~~   91 (352)
T TIGR02658        71 ARGKRTDYVEVIDPQTHLPIA   91 (352)
T ss_pred             ccCCCCCEEEEEECccCcEEe
Confidence                88999999999998874


No 58 
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=84.91  E-value=2.6  Score=44.00  Aligned_cols=98  Identities=14%  Similarity=0.012  Sum_probs=68.9

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEE-----EEEeccccc-------ccc-------
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFM-----EMLVNKDAA-------TSS-------  368 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi-----~~~~~~~~~-------~~~-------  368 (414)
                      -.|..+|+..+.++.+...++..|.++|+++|.++...=|+-||-.+-.     .+.+..|..       +.+       
T Consensus       236 HtGSVLCLqyd~rviisGSSDsTvrvWDv~tge~l~tlihHceaVLhlrf~ng~mvtcSkDrsiaVWdm~sps~it~rrV  315 (499)
T KOG0281|consen  236 HTGSVLCLQYDERVIVSGSSDSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSNGYMVTCSKDRSIAVWDMASPTDITLRRV  315 (499)
T ss_pred             CCCcEEeeeccceEEEecCCCceEEEEeccCCchhhHHhhhcceeEEEEEeCCEEEEecCCceeEEEeccCchHHHHHHH
Confidence            3477899999999999989999999999999999999999998866532     222222211       000       


Q ss_pred             ---cccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          369 ---AYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       369 ---~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                         ........++-==+++-|.-+..|+||++.+|+.|-|
T Consensus       316 LvGHrAaVNvVdfd~kyIVsASgDRTikvW~~st~efvRt  355 (499)
T KOG0281|consen  316 LVGHRAAVNVVDFDDKYIVSASGDRTIKVWSTSTCEFVRT  355 (499)
T ss_pred             HhhhhhheeeeccccceEEEecCCceEEEEeccceeeehh
Confidence               0111111111112778999999999999999999965


No 59 
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=84.83  E-value=4.8  Score=42.20  Aligned_cols=64  Identities=31%  Similarity=0.435  Sum_probs=37.5

Q ss_pred             eeeeeecc-eEEEEeecCCC-CCceeecc-cCCCCCCCeEEEEEeEEe---CCc---EEEEEeccccEEEEEe
Q 015021           43 TIALANRY-QTVIINWADPE-GLVAKIRP-ELSPIASEYITAIEWLVF---EEM---RALAVGTSRGYFLVYD  106 (414)
Q Consensus        43 ~la~A~~~-~~v~~~w~~~~-~~~v~~~g-~l~~~~~e~ITs~~~lpl---~dw---~~I~VG~ssG~vrfyt  106 (414)
                      .+||||++ .++|.--.... .++...+. -++...++.||+++|-.+   .|-   .|+.|||+.|.|..|.
T Consensus        99 Fvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn~G~v~~fk  171 (395)
T PF08596_consen   99 FVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLLVGTNSGNVLTFK  171 (395)
T ss_dssp             EEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEEEEETTSEEEEEE
T ss_pred             EEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEEEEeCCCCEEEEE
Confidence            45788865 45555442222 12222222 133345689999999877   333   8999999999999996


No 60 
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=84.47  E-value=1.9  Score=32.06  Aligned_cols=36  Identities=25%  Similarity=0.478  Sum_probs=30.2

Q ss_pred             CCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEe
Q 015021           76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVH  113 (414)
Q Consensus        76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~  113 (414)
                      .++|+++.|=|-.|  .||+|+.+|.|.+|-.+|..+.
T Consensus        11 ~~~v~~~~w~P~md--LiA~~t~~g~v~v~Rl~~qriw   46 (47)
T PF12894_consen   11 PSRVSCMSWCPTMD--LIALGTEDGEVLVYRLNWQRIW   46 (47)
T ss_pred             CCcEEEEEECCCCC--EEEEEECCCeEEEEECCCcCcc
Confidence            46688888877655  7999999999999999998765


No 61 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=84.16  E-value=12  Score=39.17  Aligned_cols=98  Identities=14%  Similarity=0.051  Sum_probs=66.5

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecc-------ccc-------cc-
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNK-------DAA-------TS-  367 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~-------~~~-------~~-  367 (414)
                      +..++-....+.-|-+|.|.|+.|-.|.|++++..++...+..- |--..+.|+.-....       +++       .+ 
T Consensus       102 ltgHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~~~~~~~-~e~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~  180 (399)
T KOG0296|consen  102 LTGHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGGEQWKLD-QEVEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQ  180 (399)
T ss_pred             ecCCCCceEEEEEccCceEEEecCCCccEEEEEcccCceEEEee-cccCceEEEEecccccEEEeecCCCcEEEEECCCc
Confidence            56677778999999999999999999999999999998877554 455566676554321       111       00 


Q ss_pred             ---------ccccCCCC--CcceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021          368 ---------SAYYAPVK--SDYCLCLAIHAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       368 ---------~~~~~~~k--~~~~l~LvIyaprRg~lEvW~~~~g~~v  403 (414)
                               +.+-..++  ++.-..+.-|-  +|+|.+|+.++|+.+
T Consensus       181 ~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~--dgti~~Wn~ktg~p~  225 (399)
T KOG0296|consen  181 ALCKVMSGHNSPCTCGEFIPDGKRILTGYD--DGTIIVWNPKTGQPL  225 (399)
T ss_pred             ceeeEecCCCCCcccccccCCCceEEEEec--CceEEEEecCCCcee
Confidence                     01101111  11112455665  999999999999865


No 62 
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=84.08  E-value=1.2  Score=46.04  Aligned_cols=81  Identities=15%  Similarity=0.160  Sum_probs=62.4

Q ss_pred             CCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021          306 HPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH  385 (414)
Q Consensus       306 ~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy  385 (414)
                      +.....++..|-|+.-+..+..+-.|-+--+..|..++-++|+-.    ++.-.--++++               .-+|.
T Consensus       305 HtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsS----yvn~a~ft~dG---------------~~iis  365 (508)
T KOG0275|consen  305 HTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSS----YVNEATFTDDG---------------HHIIS  365 (508)
T ss_pred             hccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccc----cccceEEcCCC---------------CeEEE
Confidence            345577889999997666666666677778899999999999988    66422222222               25799


Q ss_pred             cCCCCeEEEeeCCCCcceee
Q 015021          386 APRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       386 aprRg~lEvW~~~~g~~v~~  405 (414)
                      |..+|.++||++++++|+.|
T Consensus       366 aSsDgtvkvW~~KtteC~~T  385 (508)
T KOG0275|consen  366 ASSDGTVKVWHGKTTECLST  385 (508)
T ss_pred             ecCCccEEEecCcchhhhhh
Confidence            99999999999999999987


No 63 
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=83.83  E-value=2.8  Score=46.33  Aligned_cols=84  Identities=23%  Similarity=0.208  Sum_probs=57.5

Q ss_pred             eeeeccCcceeeeeecc-eEEEEeecCCCCCceeecccCCCCCCCeEE-EEEeEEeCCcEEEEEeccccEEEEEe-cCCc
Q 015021           34 NLLCALDMHTIALANRY-QTVIINWADPEGLVAKIRPELSPIASEYIT-AIEWLVFEEMRALAVGTSRGYFLVYD-LKGD  110 (414)
Q Consensus        34 ~~~~sp~~~~la~A~~~-~~v~~~w~~~~~~~v~~~g~l~~~~~e~IT-s~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~  110 (414)
                      .+-..|--|+||.+.+. .+++-+...+.    -|  +++ +++|.+| +++|=  +|-..+||||.+|-+++.+ |+|.
T Consensus        25 ~~ewnP~~dLiA~~t~~gelli~R~n~qR----lw--tip-~p~~~v~~sL~W~--~DGkllaVg~kdG~I~L~Dve~~~   95 (665)
T KOG4640|consen   25 RIEWNPKMDLIATRTEKGELLIHRLNWQR----LW--TIP-IPGENVTASLCWR--PDGKLLAVGFKDGTIRLHDVEKGG   95 (665)
T ss_pred             EEEEcCccchhheeccCCcEEEEEeccce----eE--ecc-CCCCccceeeeec--CCCCEEEEEecCCeEEEEEccCCC
Confidence            34567889999987776 46664322221    01  122 2477787 66665  4789999999999999999 8999


Q ss_pred             EEeecccC-ccceeEEE
Q 015021          111 LVHRQLIH-PGRILKLR  126 (414)
Q Consensus       111 LL~sQ~lh-~~pV~~ik  126 (414)
                      .|.+..+. +++|.++-
T Consensus        96 ~l~~~~~s~e~~is~~~  112 (665)
T KOG4640|consen   96 RLVSFLFSVETDISKGI  112 (665)
T ss_pred             ceeccccccccchheee
Confidence            99887665 34555544


No 64 
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=83.76  E-value=16  Score=40.74  Aligned_cols=211  Identities=17%  Similarity=0.237  Sum_probs=134.7

Q ss_pred             CeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEe
Q 015021           77 EYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFD  155 (414)
Q Consensus        77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~Id  155 (414)
                      .-|.+.-.|+=.+|  |++|-++++||+|. .+|+.+..=--|++-|.+|.+..+.+.           ||-.       
T Consensus        56 ~PvRa~kfiaRknW--iv~GsDD~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~P~-----------vLts-------  115 (794)
T KOG0276|consen   56 VPVRAAKFIARKNW--IVTGSDDMQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTLPY-----------VLTS-------  115 (794)
T ss_pred             cchhhheeeeccce--EEEecCCceEEEEecccceeeEEeeccccceeeeeecCCCCe-----------EEec-------
Confidence            45777777777888  78999999999999 799999888889999999998776651           1110       


Q ss_pred             ChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCCCCchhhhccccceeEEEe
Q 015021          156 GSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMPPPLMEVQSSQRYFCAVTI  235 (414)
Q Consensus       156 G~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~p~~~d~~s~~~~~~~i~v  235 (414)
                                                  +.       .+-.|.|+-+..=.|.- .|            .+|..|  |..
T Consensus       116 ----------------------------SD-------Dm~iKlW~we~~wa~~q-tf------------eGH~Hy--VMq  145 (794)
T KOG0276|consen  116 ----------------------------SD-------DMTIKLWDWENEWACEQ-TF------------EGHEHY--VMQ  145 (794)
T ss_pred             ----------------------------CC-------ccEEEEeeccCceeeee-EE------------cCcceE--EEE
Confidence                                        01       15567777765322211 11            123333  321


Q ss_pred             CCCceeEEEEeccCCCcchhhhhhhhhhHHHHHHhhhhhhhhhcCCCCCCCCCCCCcccccCCCCccccCCCceeeEEEE
Q 015021          236 GEDSVISAFRLSEDRSRSLVGAILSKVVPATFSTISSLSKMIWRSEQSPKKSEPKPQSFARASPLTCLKDHPRKGERLTL  315 (414)
Q Consensus       236 G~~P~la~y~~~~~~~~s~~~al~S~va~av~S~v~S~ak~~W~~~~~~~~~e~kp~~~~~a~pl~~l~D~~R~~~~i~~  315 (414)
                           + .|+.....+  |.++-+.           .-.| .|.=.        .      ..|-..|.-+.|...+|.-
T Consensus       146 -----v-~fnPkD~nt--FaS~sLD-----------rTVK-VWslg--------s------~~~nfTl~gHekGVN~Vdy  191 (794)
T KOG0276|consen  146 -----V-AFNPKDPNT--FASASLD-----------RTVK-VWSLG--------S------PHPNFTLEGHEKGVNCVDY  191 (794)
T ss_pred             -----E-EecCCCccc--eeeeecc-----------ccEE-EEEcC--------C------CCCceeeeccccCcceEEe
Confidence                 2 233332222  2221111           1123 55211        0      1233557777888888888


Q ss_pred             CCCC--CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEE
Q 015021          316 SPSG--SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIE  393 (414)
Q Consensus       316 sP~~--~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lE  393 (414)
                      =|.|  -+....-.+--|-++|-++...|+-.+||-.    -+....-               |.-|=++|-..-+|.+.
T Consensus       192 y~~gdkpylIsgaDD~tiKvWDyQtk~CV~TLeGHt~----Nvs~v~f---------------hp~lpiiisgsEDGTvr  252 (794)
T KOG0276|consen  192 YTGGDKPYLISGADDLTIKVWDYQTKSCVQTLEGHTN----NVSFVFF---------------HPELPIIISGSEDGTVR  252 (794)
T ss_pred             ccCCCcceEEecCCCceEEEeecchHHHHHHhhcccc----cceEEEe---------------cCCCcEEEEecCCccEE
Confidence            7777  3666654455788999999999999999977    3333211               22334899999999999


Q ss_pred             EeeCCCCcceeeccceee
Q 015021          394 HLAGMADENWTTSLNYSV  411 (414)
Q Consensus       394 vW~~~~g~~v~~~~~~~~  411 (414)
                      ||+..+-+.. ..|||..
T Consensus       253 iWhs~Ty~lE-~tLn~gl  269 (794)
T KOG0276|consen  253 IWNSKTYKLE-KTLNYGL  269 (794)
T ss_pred             EecCcceehh-hhhhcCC
Confidence            9999998766 6688864


No 65 
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=83.50  E-value=2.8  Score=47.22  Aligned_cols=89  Identities=12%  Similarity=0.104  Sum_probs=72.5

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      ++-.|+-+++.+..+..+-.|...|..--+|-|+|+|+..-..+-=.+||-|+=.+..-..  .            +   
T Consensus        99 ~~~tfngHK~AVt~l~fd~~G~rlaSGskDt~IIvwDlV~E~Gl~rL~GHkd~iT~~~F~~--~------------~---  161 (888)
T KOG0306|consen   99 ILITFNGHKAAVTTLKFDKIGTRLASGSKDTDIIVWDLVGEEGLFRLRGHKDSITQALFLN--G------------D---  161 (888)
T ss_pred             eeeeecccccceEEEEEcccCceEeecCCCccEEEEEeccceeeEEeecchHHHhHHhccC--C------------C---
Confidence            4445888999999999999999999999999999999999999999999999644333221  0            1   


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCCcceeec
Q 015021          379 CLCLAIHAPRKGIIEHLAGMADENWTTS  406 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g~~v~~~  406 (414)
                        -++|-..+++.|++|++.+-+|+-|-
T Consensus       162 --~~lvS~sKDs~iK~WdL~tqhCf~Th  187 (888)
T KOG0306|consen  162 --SFLVSVSKDSMIKFWDLETQHCFETH  187 (888)
T ss_pred             --eEEEEeccCceEEEEecccceeeeEE
Confidence              27788899999999999998888553


No 66 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=83.07  E-value=8  Score=42.16  Aligned_cols=88  Identities=20%  Similarity=0.139  Sum_probs=62.2

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC  381 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~  381 (414)
                      ++.++.|=..++--+|||.++|.+-++|.|+|+|=.+|..+-...+ -+|.=+=|-...=.+             +.-.|
T Consensus       185 s~r~HskFV~~VRysPDG~~Fat~gsDgki~iyDGktge~vg~l~~-~~aHkGsIfalsWsP-------------Ds~~~  250 (603)
T KOG0318|consen  185 SFREHSKFVNCVRYSPDGSRFATAGSDGKIYIYDGKTGEKVGELED-SDAHKGSIFALSWSP-------------DSTQF  250 (603)
T ss_pred             cccccccceeeEEECCCCCeEEEecCCccEEEEcCCCccEEEEecC-CCCccccEEEEEECC-------------CCceE
Confidence            4667778889999999999999999999999999999999988865 333332222221111             11114


Q ss_pred             EEEEcCCCCeEEEeeCCCCcceee
Q 015021          382 LAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       382 LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                      |-  +.-+-.++||++-+..++.|
T Consensus       251 ~T--~SaDkt~KIWdVs~~slv~t  272 (603)
T KOG0318|consen  251 LT--VSADKTIKIWDVSTNSLVST  272 (603)
T ss_pred             EE--ecCCceEEEEEeeccceEEE
Confidence            33  34467889999888877755


No 67 
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=82.61  E-value=2.2  Score=48.28  Aligned_cols=66  Identities=24%  Similarity=0.313  Sum_probs=46.3

Q ss_pred             CEEEEEcCCCcEEEEEcCcceeEeeecccc-c--ceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEee
Q 015021          320 SLAAITDSLGRILLLDTQALVVVRLWKGYR-D--ASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLA  396 (414)
Q Consensus       320 ~laa~tD~lGRV~L~D~~~~~vvrmwKGyR-d--Aq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~  396 (414)
                      -+.|++|-.|||+|+|...+.++- |=-.- |  -++.|++....               .+-+.|+||.|  ..|-.|+
T Consensus        80 lliAsaD~~GrIil~d~~~~s~~~-~l~~~~~~~qdl~W~~~rd~---------------Srd~LlaIh~s--s~lvLwn  141 (1062)
T KOG1912|consen   80 LLIASADISGRIILVDFVLASVIN-WLSHSNDSVQDLCWVPARDD---------------SRDVLLAIHGS--STLVLWN  141 (1062)
T ss_pred             eeEEeccccCcEEEEEehhhhhhh-hhcCCCcchhheeeeeccCc---------------chheeEEecCC--cEEEEEE
Confidence            478899999999999999876543 31111 1  23455554321               12457999998  6899999


Q ss_pred             CCCCcce
Q 015021          397 GMADENW  403 (414)
Q Consensus       397 ~~~g~~v  403 (414)
                      +.+|+++
T Consensus       142 tdtG~k~  148 (1062)
T KOG1912|consen  142 TDTGEKF  148 (1062)
T ss_pred             ccCCcee
Confidence            9999987


No 68 
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=82.15  E-value=9.8  Score=44.46  Aligned_cols=76  Identities=20%  Similarity=0.268  Sum_probs=58.1

Q ss_pred             ecCCCCCceeecccCCCCCCCeEEEEEeEEeCCc--EEEEEeccccEEEEEecCC-------cEEeecccCccceeEEEE
Q 015021           57 WADPEGLVAKIRPELSPIASEYITAIEWLVFEEM--RALAVGTSRGYFLVYDLKG-------DLVHRQLIHPGRILKLRV  127 (414)
Q Consensus        57 w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw--~~I~VG~ssG~vrfyte~G-------~LL~sQ~lh~~pV~~ik~  127 (414)
                      |..++.++++-+|.++.  .+|-.-+.|=++.+.  =+||=|+.+|.|-||+..-       .+|..+.-|.++|+.|.+
T Consensus        47 d~~~~~~dlk~~~s~~s--~~rF~kL~W~~~g~~~~GlIaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDf  124 (1049)
T KOG0307|consen   47 DFSDESSDLKPVGSLQS--SNRFNKLAWGSYGSHSHGLIAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDF  124 (1049)
T ss_pred             cccCccccccccccccc--cccceeeeecccCCCccceeeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeec
Confidence            44444556677787766  588888888888444  4899999999999999544       467777899999999998


Q ss_pred             ecccCCC
Q 015021          128 RGSRRDL  134 (414)
Q Consensus       128 r~~~~~~  134 (414)
                      ..-+.++
T Consensus       125 N~~q~nl  131 (1049)
T KOG0307|consen  125 NPFQGNL  131 (1049)
T ss_pred             cccCCce
Confidence            7766543


No 69 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=81.96  E-value=3.8  Score=40.90  Aligned_cols=77  Identities=14%  Similarity=0.172  Sum_probs=57.8

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK  389 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR  389 (414)
                      +.+++.||+++-..++-..+-+=|+|-.+|.++..+||+-+-+..==-...              +++   --|+-..-+
T Consensus       186 it~vs~s~d~nc~La~~l~stlrLlDk~tGklL~sYkGhkn~eykldc~l~--------------qsd---thV~sgSED  248 (307)
T KOG0316|consen  186 ITSVSFSKDGNCSLASSLDSTLRLLDKETGKLLKSYKGHKNMEYKLDCCLN--------------QSD---THVFSGSED  248 (307)
T ss_pred             ceeEEecCCCCEEEEeeccceeeecccchhHHHHHhcccccceeeeeeeec--------------ccc---eeEEeccCC
Confidence            567899999986666666678889999999999999999884432111110              111   267888999


Q ss_pred             CeEEEeeCCCCcce
Q 015021          390 GIIEHLAGMADENW  403 (414)
Q Consensus       390 g~lEvW~~~~g~~v  403 (414)
                      |.+-+|++-.++.+
T Consensus       249 G~Vy~wdLvd~~~~  262 (307)
T KOG0316|consen  249 GKVYFWDLVDETQI  262 (307)
T ss_pred             ceEEEEEeccceee
Confidence            99999999988876


No 70 
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=81.84  E-value=1.2  Score=47.75  Aligned_cols=85  Identities=14%  Similarity=0.077  Sum_probs=66.5

Q ss_pred             ccccCCCceeeEEEECC-CCCEEEEEcCCCcEEEEEcCc-ceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021          301 TCLKDHPRKGERLTLSP-SGSLAAITDSLGRILLLDTQA-LVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP-~~~laa~tD~lGRV~L~D~~~-~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      +.+.+++..+.+|...| .+.|+...-.+|.|.|+|+-. +..+|-+.||+.+=      .+..  .        .+ ..
T Consensus       208 ~~~~gH~kgvsai~~fp~~~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~V------rd~~--~--------s~-~g  270 (503)
T KOG0282|consen  208 HNLSGHTKGVSAIQWFPKKGHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPV------RDAS--F--------NN-CG  270 (503)
T ss_pred             eeccCCccccchhhhccceeeEEEecCCCceEEEEEEecCcceehhhhcchhhh------hhhh--c--------cc-cC
Confidence            45889988899999999 889999999999999999998 99999999998731      1100  0        00 00


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCCccee
Q 015021          379 CLCLAIHAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g~~v~  404 (414)
                        .-..-+.-++.|.+||+.+|+++.
T Consensus       271 --~~fLS~sfD~~lKlwDtETG~~~~  294 (503)
T KOG0282|consen  271 --TSFLSASFDRFLKLWDTETGQVLS  294 (503)
T ss_pred             --CeeeeeecceeeeeeccccceEEE
Confidence              122457779999999999999883


No 71 
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=81.61  E-value=18  Score=33.25  Aligned_cols=86  Identities=22%  Similarity=0.197  Sum_probs=61.5

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcC-CCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDS-LGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~-lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      .+....-.+..+..+|++++.+.... .|.+.++|+..+..++..+|+.+    .+....-.+.+           +   
T Consensus       150 ~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~v~~~~~~~~~-----------~---  211 (466)
T COG2319         150 TLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTD----PVSSLAFSPDG-----------G---  211 (466)
T ss_pred             EEecCcccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceEEeeccCCC----ceEEEEEcCCc-----------c---
Confidence            35555555669999999987777775 99999999999999999999555    33332211111           1   


Q ss_pred             EEEEEcCCCCeEEEeeCCCCcceee
Q 015021          381 CLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                      ++++-...+|.+.+|+.+.++.+.+
T Consensus       212 ~~~~~~~~d~~i~~wd~~~~~~~~~  236 (466)
T COG2319         212 LLIASGSSDGTIRLWDLSTGKLLRS  236 (466)
T ss_pred             eEEEEecCCCcEEEEECCCCcEEee
Confidence            3444468889999999888887754


No 72 
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=80.22  E-value=12  Score=39.16  Aligned_cols=82  Identities=20%  Similarity=0.220  Sum_probs=56.7

Q ss_pred             ceeeEEEECCCCCEEEE--EcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021          308 RKGERLTLSPSGSLAAI--TDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH  385 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~--tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy  385 (414)
                      +....+++++.+.|.|.  +-+.|.|+|+|+.+.+.+-+...|++    =+.+..-..+|               -|.--
T Consensus       130 ~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~----~lAalafs~~G---------------~llAT  190 (391)
T KOG2110|consen  130 KGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKG----PLAALAFSPDG---------------TLLAT  190 (391)
T ss_pred             cceEeeccCCCCceEEecCCCCCceEEEEEcccceeeeEEEecCC----ceeEEEECCCC---------------CEEEE
Confidence            34445555555678887  34589999999999999998877777    34443322222               14445


Q ss_pred             cCCCCe-EEEeeCCCCcceeeccceeeec
Q 015021          386 APRKGI-IEHLAGMADENWTTSLNYSVRQ  413 (414)
Q Consensus       386 aprRg~-lEvW~~~~g~~v~~~~~~~~~~  413 (414)
                      |.-+|. |.|+++.+|+++     |..|.
T Consensus       191 ASeKGTVIRVf~v~~G~kl-----~eFRR  214 (391)
T KOG2110|consen  191 ASEKGTVIRVFSVPEGQKL-----YEFRR  214 (391)
T ss_pred             eccCceEEEEEEcCCccEe-----eeeeC
Confidence            666664 579999999999     88874


No 73 
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=80.18  E-value=6.5  Score=41.84  Aligned_cols=261  Identities=19%  Similarity=0.215  Sum_probs=156.6

Q ss_pred             ecccccccCC-CCCcccCCCeeeeccCcceeeeeecceEEEE-eecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEE
Q 015021           15 CTDLSDLGAG-KEGWLVNDPNLLCALDMHTIALANRYQTVII-NWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRA   92 (414)
Q Consensus        15 ~~~~~~~g~~-~~~WL~~~~~~~~sp~~~~la~A~~~~~v~~-~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~   92 (414)
                      |.|-..+-.- +|.|+     +..|+.|.+||-|-...-++. .-..+.+  ++..-+|-- -.+-|..|.|=| +|-..
T Consensus       214 ~qt~qil~~htdEVWf-----l~FS~nGkyLAsaSkD~Taiiw~v~~d~~--~kl~~tlvg-h~~~V~yi~wSP-DdryL  284 (519)
T KOG0293|consen  214 SQTWQILQDHTDEVWF-----LQFSHNGKYLASASKDSTAIIWIVVYDVH--FKLKKTLVG-HSQPVSYIMWSP-DDRYL  284 (519)
T ss_pred             chhhhhHhhCCCcEEE-----EEEcCCCeeEeeccCCceEEEEEEecCcc--eeeeeeeec-ccCceEEEEECC-CCCeE
Confidence            4555555555 88888     347899999999887755552 2222222  444444411 135688888877 46789


Q ss_pred             EEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHHHHHHHHHHhcc
Q 015021           93 LAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQKMLQRWFQDSN  171 (414)
Q Consensus        93 I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~~~Lr~c~~~~~  171 (414)
                      +++|++.- ++..+ .+|++.+.=- | .-=.++.                =+.=||++.-+|-|..=    |.|     
T Consensus       285 laCg~~e~-~~lwDv~tgd~~~~y~-~-~~~~S~~----------------sc~W~pDg~~~V~Gs~d----r~i-----  336 (519)
T KOG0293|consen  285 LACGFDEV-LSLWDVDTGDLRHLYP-S-GLGFSVS----------------SCAWCPDGFRFVTGSPD----RTI-----  336 (519)
T ss_pred             EecCchHh-eeeccCCcchhhhhcc-c-CcCCCcc----------------eeEEccCCceeEecCCC----CcE-----
Confidence            99999887 77777 5777654210 0 0011122                23334444333333221    111     


Q ss_pred             ccccCCCCccCCCcccccccCCccceecccCCCCCceeeEEeCCCCCCchhhhccccceeEEEeCCCceeEEEEeccCCC
Q 015021          172 SNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAITGLMPPPLMEVQSSQRYFCAVTIGEDSVISAFRLSEDRS  251 (414)
Q Consensus       172 ~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~~G~~~p~~~d~~s~~~~~~~i~vG~~P~la~y~~~~~~~  251 (414)
                       ..|+       ..++      + ..+|.-.+.-.+.|.+++            .++.+ +++++.+|-+..|..+.-..
T Consensus       337 -~~wd-------lDgn------~-~~~W~gvr~~~v~dlait------------~Dgk~-vl~v~~d~~i~l~~~e~~~d  388 (519)
T KOG0293|consen  337 -IMWD-------LDGN------I-LGNWEGVRDPKVHDLAIT------------YDGKY-VLLVTVDKKIRLYNREARVD  388 (519)
T ss_pred             -EEec-------CCcc------h-hhcccccccceeEEEEEc------------CCCcE-EEEEecccceeeechhhhhh
Confidence             1121       1111      1 344655544456666664            23444 68888888887775432110


Q ss_pred             cchhhhhhhhhhHHHHHHhhhhhhhhhcCCCCCCCCCCCCcccccCCCCccccCCCceeeEEEECCCCCEEEEEcCCCcE
Q 015021          252 RSLVGAILSKVVPATFSTISSLSKMIWRSEQSPKKSEPKPQSFARASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRI  331 (414)
Q Consensus       252 ~s~~~al~S~va~av~S~v~S~ak~~W~~~~~~~~~e~kp~~~~~a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV  331 (414)
                      .                                                 .+....-.+.++++|-+|+||.+.=..--|
T Consensus       389 r-------------------------------------------------~lise~~~its~~iS~d~k~~LvnL~~qei  419 (519)
T KOG0293|consen  389 R-------------------------------------------------GLISEEQPITSFSISKDGKLALVNLQDQEI  419 (519)
T ss_pred             h-------------------------------------------------ccccccCceeEEEEcCCCcEEEEEcccCee
Confidence            0                                                 011122335788999999999998777789


Q ss_pred             EEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          332 LLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       332 ~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                      -|+|++...++|-++|++--.+-=-.++...+             +   -+|-...-++-|-||+-.+|+.+++
T Consensus       420 ~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg~~-------------~---~fiaSGSED~kvyIWhr~sgkll~~  477 (519)
T KOG0293|consen  420 HLWDLEENKLVRKYFGHKQGHFIIRSCFGGGN-------------D---KFIASGSEDSKVYIWHRISGKLLAV  477 (519)
T ss_pred             EEeecchhhHHHHhhcccccceEEEeccCCCC-------------c---ceEEecCCCceEEEEEccCCceeEe
Confidence            99999999999999999975443223331111             1   1777888899999999999999875


No 74 
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=80.12  E-value=3.4  Score=44.10  Aligned_cols=71  Identities=14%  Similarity=0.157  Sum_probs=54.3

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecc-cccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKG-YRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKG-yRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      -....|+.+|+|.|.+..---|.++||.+.+|.++.+|-+ |++     |....-.+++               -.+|-+
T Consensus        82 g~v~al~s~n~G~~l~ag~i~g~lYlWelssG~LL~v~~aHYQ~-----ITcL~fs~dg---------------s~iiTg  141 (476)
T KOG0646|consen   82 GPVHALASSNLGYFLLAGTISGNLYLWELSSGILLNVLSAHYQS-----ITCLKFSDDG---------------SHIITG  141 (476)
T ss_pred             cceeeeecCCCceEEEeecccCcEEEEEeccccHHHHHHhhccc-----eeEEEEeCCC---------------cEEEec
Confidence            4467788999999999988899999999999999999954 444     3333222222               156899


Q ss_pred             CCCCeEEEeeCC
Q 015021          387 PRKGIIEHLAGM  398 (414)
Q Consensus       387 prRg~lEvW~~~  398 (414)
                      .++|.|-||.+-
T Consensus       142 skDg~V~vW~l~  153 (476)
T KOG0646|consen  142 SKDGAVLVWLLT  153 (476)
T ss_pred             CCCccEEEEEEE
Confidence            999999999863


No 75 
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=79.65  E-value=9.2  Score=39.13  Aligned_cols=52  Identities=19%  Similarity=0.211  Sum_probs=46.2

Q ss_pred             CCccccCCCceeeEEEECC-CCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021          299 PLTCLKDHPRKGERLTLSP-SGSLAAITDSLGRILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP-~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd  350 (414)
                      ....|.++.-.+-+|.++| ++++++..--++--.|+|+..+..++++-|+-.
T Consensus       178 ~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c~qtF~ghes  230 (343)
T KOG0286|consen  178 QTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQCVQTFEGHES  230 (343)
T ss_pred             EEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccCcceeEeeccccc
Confidence            3356889999999999999 889999988888999999999999999999864


No 76 
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=79.48  E-value=12  Score=41.27  Aligned_cols=112  Identities=21%  Similarity=0.199  Sum_probs=64.6

Q ss_pred             ecccccccCCCCCcccC-----CCeeeeccCcceeeeeecc-eEEEEeecCCCCCceeecccCCC-------CCCCeEEE
Q 015021           15 CTDLSDLGAGKEGWLVN-----DPNLLCALDMHTIALANRY-QTVIINWADPEGLVAKIRPELSP-------IASEYITA   81 (414)
Q Consensus        15 ~~~~~~~g~~~~~WL~~-----~~~~~~sp~~~~la~A~~~-~~v~~~w~~~~~~~v~~~g~l~~-------~~~e~ITs   81 (414)
                      ..|+=.|-+-...||+-     .+.-+++....+=.||.|. -=+|--|++-....   +|.|+.       +.+..--+
T Consensus       154 g~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksr---v~~l~~~~~v~s~pg~~~~~s  230 (703)
T KOG2321|consen  154 GSEVYRLNLEQGRFLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSR---VGTLDAASSVNSHPGGDAAPS  230 (703)
T ss_pred             CcceEEEEccccccccccccccccceeeeecCccceEEecccCceEEEecchhhhh---heeeecccccCCCccccccCc
Confidence            34444444555667753     1333333333333344444 33445687765322   122211       01111222


Q ss_pred             EEeEEeCC-cEEEEEeccccEEEEEec--CCcEEeecccCccceeEEEEec
Q 015021           82 IEWLVFEE-MRALAVGTSRGYFLVYDL--KGDLVHRQLIHPGRILKLRVRG  129 (414)
Q Consensus        82 ~~~lpl~d-w~~I~VG~ssG~vrfyte--~G~LL~sQ~lh~~pV~~ik~r~  129 (414)
                      +..|.|.| -.=++|||++|+|.||+.  .--||...+.++.||.+|....
T Consensus       231 vTal~F~d~gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~  281 (703)
T KOG2321|consen  231 VTALKFRDDGLHVAVGTSTGSVLIYDLRASKPLLVKDHGYELPIKKLDWQD  281 (703)
T ss_pred             ceEEEecCCceeEEeeccCCcEEEEEcccCCceeecccCCccceeeecccc
Confidence            33346655 688999999999999995  6778888899999999999743


No 77 
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=79.26  E-value=5.9  Score=41.57  Aligned_cols=76  Identities=14%  Similarity=0.169  Sum_probs=62.3

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee-cccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW-KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR  388 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw-KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr  388 (414)
                      ...|++|||.+.+..+|.++.|=+....+--+|.=| =||++    |+...+..+.                ++.+.+.-
T Consensus       154 l~dVavS~D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~e----FVS~isl~~~----------------~~LlS~sG  213 (390)
T KOG3914|consen  154 LLDVAVSPDDQFIITADRDEKIRVSRYPATFVIESFCLGHKE----FVSTISLTDN----------------YLLLSGSG  213 (390)
T ss_pred             hheeeecCCCCEEEEecCCceEEEEecCcccchhhhccccHh----heeeeeeccC----------------ceeeecCC
Confidence            346899999999999999999999888888777754 57888    8776654331                46789999


Q ss_pred             CCeEEEeeCCCCcceee
Q 015021          389 KGIIEHLAGMADENWTT  405 (414)
Q Consensus       389 Rg~lEvW~~~~g~~v~~  405 (414)
                      ++.|.+|+..+|++..|
T Consensus       214 D~tlr~Wd~~sgk~L~t  230 (390)
T KOG3914|consen  214 DKTLRLWDITSGKLLDT  230 (390)
T ss_pred             CCcEEEEecccCCcccc
Confidence            99999999999999854


No 78 
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=79.07  E-value=13  Score=38.17  Aligned_cols=76  Identities=12%  Similarity=-0.004  Sum_probs=57.5

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcc------eeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQAL------VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL  382 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~------~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L  382 (414)
                      -+-..+.+|+|+++|+.-..--..++++.+.      .+.|+.+||+.    ++..-.-.+++                .
T Consensus        99 WVMtCA~sPSg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtg----ylScC~f~dD~----------------~  158 (343)
T KOG0286|consen   99 WVMTCAYSPSGNFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTG----YLSCCRFLDDN----------------H  158 (343)
T ss_pred             eEEEEEECCCCCeEEecCcCceeEEEecccccccccceeeeeecCccc----eeEEEEEcCCC----------------c
Confidence            4567899999999999755556777788766      78899999998    77665433322                3


Q ss_pred             EEEcCCCCeEEEeeCCCCccee
Q 015021          383 AIHAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       383 vIyaprRg~lEvW~~~~g~~v~  404 (414)
                      +|-+.-+...-+|++.+|+++.
T Consensus       159 ilT~SGD~TCalWDie~g~~~~  180 (343)
T KOG0286|consen  159 ILTGSGDMTCALWDIETGQQTQ  180 (343)
T ss_pred             eEecCCCceEEEEEcccceEEE
Confidence            4556667788899999999874


No 79 
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=79.02  E-value=2.7  Score=47.35  Aligned_cols=83  Identities=23%  Similarity=0.126  Sum_probs=58.5

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      .++.|++-.+.+|++||+|+..|..|..|-+=++|++.....-           ++++.|.+--   ...+  .+....+
T Consensus       453 ~s~~d~r~G~R~~~vSp~gqhLAsGDr~GnlrVy~Lq~l~~~~-----------~~eAHesEil---cLey--S~p~~~~  516 (1080)
T KOG1408|consen  453 VSTCDSRFGFRALAVSPDGQHLASGDRGGNLRVYDLQELEYTC-----------FMEAHESEIL---CLEY--SFPVLTN  516 (1080)
T ss_pred             hhhcCcccceEEEEECCCcceecccCccCceEEEEehhhhhhh-----------heecccceeE---EEee--cCchhhh
Confidence            4689999999999999999999999999999999999887544           4444432200   0000  0111112


Q ss_pred             EEEEEcCCCCeEEEeeCCC
Q 015021          381 CLAIHAPRKGIIEHLAGMA  399 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~  399 (414)
                      -|.-.|.|+.+|-|+|+..
T Consensus       517 kLLASasrdRlIHV~Dv~r  535 (1080)
T KOG1408|consen  517 KLLASASRDRLIHVYDVKR  535 (1080)
T ss_pred             HhhhhccCCceEEEEeccc
Confidence            3667888899999998764


No 80 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=78.78  E-value=11  Score=39.70  Aligned_cols=84  Identities=17%  Similarity=0.089  Sum_probs=63.6

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK  389 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR  389 (414)
                      ..++.+-|+-..++..-++--+=++|+.+..-|.+.+|+|..-+.= .+.+. +                 --||-...+
T Consensus       238 V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V~~V-~~~~~-d-----------------pqvit~S~D  298 (460)
T KOG0285|consen  238 VYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPVASV-MCQPT-D-----------------PQVITGSHD  298 (460)
T ss_pred             eEEEeccccceeEEecCCcceEEEeeecccceEEEecCCCCcceeE-EeecC-C-----------------CceEEecCC
Confidence            5668888999888887777777789999999999999999954332 22111 1                 046888999


Q ss_pred             CeEEEeeCCCCcceeeccc--eeee
Q 015021          390 GIIEHLAGMADENWTTSLN--YSVR  412 (414)
Q Consensus       390 g~lEvW~~~~g~~v~~~~~--~~~~  412 (414)
                      +.|.+||++.|+-..|-.|  -|||
T Consensus       299 ~tvrlWDl~agkt~~tlt~hkksvr  323 (460)
T KOG0285|consen  299 STVRLWDLRAGKTMITLTHHKKSVR  323 (460)
T ss_pred             ceEEEeeeccCceeEeeecccceee
Confidence            9999999999998776543  4555


No 81 
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=78.70  E-value=5.9  Score=44.47  Aligned_cols=40  Identities=28%  Similarity=0.425  Sum_probs=34.9

Q ss_pred             eeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021          309 KGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       309 ~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd  350 (414)
                      .+..|+-.||| +|+.++|+  |++++|+..|..+.-.||+.|
T Consensus        14 ci~d~afkPDGsqL~lAAg~--rlliyD~ndG~llqtLKgHKD   54 (1081)
T KOG1538|consen   14 CINDIAFKPDGTQLILAAGS--RLLVYDTSDGTLLQPLKGHKD   54 (1081)
T ss_pred             chheeEECCCCceEEEecCC--EEEEEeCCCcccccccccccc
Confidence            45568999999 56666665  999999999999999999999


No 82 
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=78.52  E-value=11  Score=38.53  Aligned_cols=77  Identities=18%  Similarity=0.215  Sum_probs=58.9

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEe--eecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVR--LWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvr--mwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      ...++-.+|.+.-++++|+-|++..++-..+.+-.  -||++- ++ +|+-.....+.                =||.+.
T Consensus       123 ~~lslD~~~~~~~i~vs~s~G~~~~v~~t~~~le~vq~wk~He-~E-~Wta~f~~~~p----------------nlvytG  184 (339)
T KOG0280|consen  123 EALSLDISTSGTKIFVSDSRGSISGVYETEMVLEKVQTWKVHE-FE-AWTAKFSDKEP----------------NLVYTG  184 (339)
T ss_pred             eeeEEEeeccCceEEEEcCCCcEEEEecceeeeeecccccccc-ee-eeeeecccCCC----------------ceEEec
Confidence            56778888999999999999999999988888877  899873 23 36644432211                288899


Q ss_pred             CCCCeEEEeeCC-CCcce
Q 015021          387 PRKGIIEHLAGM-ADENW  403 (414)
Q Consensus       387 prRg~lEvW~~~-~g~~v  403 (414)
                      .-+|.|-.|++| .++.|
T Consensus       185 gDD~~l~~~D~R~p~~~i  202 (339)
T KOG0280|consen  185 GDDGSLSCWDIRIPKTFI  202 (339)
T ss_pred             CCCceEEEEEecCCccee
Confidence            999999999999 45544


No 83 
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=78.35  E-value=8.6  Score=45.64  Aligned_cols=81  Identities=14%  Similarity=0.121  Sum_probs=56.8

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC--
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP--  387 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap--  387 (414)
                      +.+|++||+++|+++.-+-|-+.|||+.-+..++-|+===+|+.-=+.+..-..           +.    .-.|.+.  
T Consensus      1198 vTSi~idp~~~WlviGts~G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~~~~~~-----------~~----S~~vs~~~~ 1262 (1431)
T KOG1240|consen 1198 VTSIVIDPWCNWLVIGTSRGQLVLWDLRFRVPILSWEHPARAPIRHVWLCPTYP-----------QE----SVSVSAGSS 1262 (1431)
T ss_pred             eeEEEecCCceEEEEecCCceEEEEEeecCceeecccCcccCCcceEEeeccCC-----------CC----ceEEEeccc
Confidence            689999999999999999999999999999999999743333333222211100           00    1223333  


Q ss_pred             CCCeEEEeeCCCCcceee
Q 015021          388 RKGIIEHLAGMADENWTT  405 (414)
Q Consensus       388 rRg~lEvW~~~~g~~v~~  405 (414)
                      -.+-|++|+|-+|.+=++
T Consensus      1263 ~~nevs~wn~~~g~~~~v 1280 (1431)
T KOG1240|consen 1263 SNNEVSTWNMETGLRQTV 1280 (1431)
T ss_pred             CCCceeeeecccCcceEE
Confidence            368899999999987643


No 84 
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=78.13  E-value=3.7  Score=42.57  Aligned_cols=66  Identities=15%  Similarity=0.125  Sum_probs=53.7

Q ss_pred             CEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeC
Q 015021          320 SLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAG  397 (414)
Q Consensus       320 ~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~  397 (414)
                      +.+|+.=+-|-|=|+|..+|..++.+|||-++  +++|+...                   ....|+-+.-+|.|.+|++
T Consensus        41 ~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~d-------------------s~h~v~s~ssDG~Vr~wD~  101 (376)
T KOG1188|consen   41 TAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCD-------------------SPHGVISCSSDGTVRLWDI  101 (376)
T ss_pred             eeEEEEecCCeEEEEeccchhhhheecCCCCcccceEEecCC-------------------CCCeeEEeccCCeEEEEEe
Confidence            67788888999999999999999999999875  44444310                   1248899999999999999


Q ss_pred             CCCccee
Q 015021          398 MADENWT  404 (414)
Q Consensus       398 ~~g~~v~  404 (414)
                      |.-.+++
T Consensus       102 Rs~~e~a  108 (376)
T KOG1188|consen  102 RSQAESA  108 (376)
T ss_pred             ecchhhh
Confidence            9877764


No 85 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=78.03  E-value=13  Score=39.72  Aligned_cols=79  Identities=13%  Similarity=0.139  Sum_probs=56.2

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc-ceEEEEEEc
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD-YCLCLAIHA  386 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~-~~l~LvIya  386 (414)
                      -....+.+-|+|.|+..+++.|....-|..+|..+-+---- -.+|+--                ....| -.|.|.-|.
T Consensus       304 ~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~-~s~v~~t----------------s~~fHpDgLifgtgt  366 (506)
T KOG0289|consen  304 EPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDE-TSDVEYT----------------SAAFHPDGLIFGTGT  366 (506)
T ss_pred             ccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeec-cccceeE----------------EeeEcCCceEEeccC
Confidence            34567889999999999999999999999999776543110 0111111                11111 256788888


Q ss_pred             CCCCeEEEeeCCCCccee
Q 015021          387 PRKGIIEHLAGMADENWT  404 (414)
Q Consensus       387 prRg~lEvW~~~~g~~v~  404 (414)
                      | +|.|+||++..+..++
T Consensus       367 ~-d~~vkiwdlks~~~~a  383 (506)
T KOG0289|consen  367 P-DGVVKIWDLKSQTNVA  383 (506)
T ss_pred             C-CceEEEEEcCCccccc
Confidence            8 9999999999988664


No 86 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=77.96  E-value=4.2  Score=41.49  Aligned_cols=39  Identities=15%  Similarity=0.180  Sum_probs=36.6

Q ss_pred             eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccc
Q 015021          311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYR  349 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyR  349 (414)
                      ..-+-+|+++++.+++..|||.++++.+|..+..|+|-+
T Consensus       236 ~~a~ftPds~Fvl~gs~dg~i~vw~~~tg~~v~~~~~~~  274 (311)
T KOG1446|consen  236 LSATFTPDSKFVLSGSDDGTIHVWNLETGKKVAVLRGPN  274 (311)
T ss_pred             eeEEECCCCcEEEEecCCCcEEEEEcCCCcEeeEecCCC
Confidence            577889999999999999999999999999999999973


No 87 
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=77.86  E-value=10  Score=40.29  Aligned_cols=46  Identities=17%  Similarity=0.247  Sum_probs=38.7

Q ss_pred             CCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcce-eEeeecccccc
Q 015021          306 HPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALV-VVRLWKGYRDA  351 (414)
Q Consensus       306 ~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~-vvrmwKGyRdA  351 (414)
                      +.+...++...|.+ .++|+.++.|+|.|+|+.++. .+.-..|++|+
T Consensus       271 h~~~vn~~~fnp~~~~ilAT~S~D~tV~LwDlRnL~~~lh~~e~H~de  318 (422)
T KOG0264|consen  271 HSAEVNCVAFNPFNEFILATGSADKTVALWDLRNLNKPLHTFEGHEDE  318 (422)
T ss_pred             cCCceeEEEeCCCCCceEEeccCCCcEEEeechhcccCceeccCCCcc
Confidence            34677889999998 677778999999999999875 57788999996


No 88 
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=77.67  E-value=1.9  Score=45.01  Aligned_cols=79  Identities=19%  Similarity=0.170  Sum_probs=59.2

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      +.|.-++|.+-.  +--.|++++...++-.|=|+|+..|..+||..|+-|    -+.+.--.+           |     
T Consensus       354 Rtl~gHkRGIAC--lQYr~rlvVSGSSDntIRlwdi~~G~cLRvLeGHEe----LvRciRFd~-----------k-----  411 (499)
T KOG0281|consen  354 RTLNGHKRGIAC--LQYRDRLVVSGSSDNTIRLWDIECGACLRVLEGHEE----LVRCIRFDN-----------K-----  411 (499)
T ss_pred             hhhhccccccee--hhccCeEEEecCCCceEEEEeccccHHHHHHhchHH----hhhheeecC-----------c-----
Confidence            457777777654  446789999988889999999999999999999988    343321111           1     


Q ss_pred             EEEEEcCCCCeEEEeeCCCCcc
Q 015021          381 CLAIHAPRKGIIEHLAGMADEN  402 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~  402 (414)
                       -++....+|.|+||+++.+..
T Consensus       412 -rIVSGaYDGkikvWdl~aald  432 (499)
T KOG0281|consen  412 -RIVSGAYDGKIKVWDLQAALD  432 (499)
T ss_pred             -eeeeccccceEEEEecccccC
Confidence             235566799999999998753


No 89 
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=76.71  E-value=12  Score=42.49  Aligned_cols=115  Identities=16%  Similarity=0.188  Sum_probs=69.3

Q ss_pred             CeeeeccC--cceeee-eecceEEE-EeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-c
Q 015021           33 PNLLCALD--MHTIAL-ANRYQTVI-INWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-L  107 (414)
Q Consensus        33 ~~~~~sp~--~~~la~-A~~~~~v~-~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e  107 (414)
                      ++....-.  |.++|+ |.++.++- ++-..-+ ..+.-   +.  .-..||++.  +.+|---+||||.+|.|++|+ +
T Consensus        24 ~~~~~~~~~~Gr~va~~a~E~vn~WdlRtge~~-~~l~~---~~--~k~evt~l~--~~~d~l~lAVGYaDGsVqif~~~   95 (888)
T KOG0306|consen   24 INFVVKRSGKGRAVAVSALEQVNIWDLRTGEIE-KKLIL---LK--KKAEVTCLR--SSDDILLLAVGYADGSVQIFSLE   95 (888)
T ss_pred             eeEEEeecCCCcEEEEeccccEeEEeeecchhh-hhhhh---hc--ccceEEEee--ccCCcceEEEEecCceEEeeccC
Confidence            44444444  888888 77776654 3321000 00100   00  112355542  337778899999999999999 6


Q ss_pred             CCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeC----CeEEEEeC
Q 015021          108 KGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMP----GVLARFDG  156 (414)
Q Consensus       108 ~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~----~~iv~IdG  156 (414)
                      ++..++.=-.|...|.-|++-.-...+ -..+.+.-+|||.    .+..++.|
T Consensus        96 s~~~~~tfngHK~AVt~l~fd~~G~rl-aSGskDt~IIvwDlV~E~Gl~rL~G  147 (888)
T KOG0306|consen   96 SEEILITFNGHKAAVTTLKFDKIGTRL-ASGSKDTDIIVWDLVGEEGLFRLRG  147 (888)
T ss_pred             CCceeeeecccccceEEEEEcccCceE-eecCCCccEEEEEeccceeeEEeec
Confidence            888888777799999988876543211 1112344455554    45677777


No 90 
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=76.16  E-value=23  Score=35.04  Aligned_cols=52  Identities=21%  Similarity=0.381  Sum_probs=38.5

Q ss_pred             CCCeEEEEEeEEeCC-----cEEEEEecc---------c-cEEEEEecCC--------cEEeecccCccceeEEEE
Q 015021           75 ASEYITAIEWLVFEE-----MRALAVGTS---------R-GYFLVYDLKG--------DLVHRQLIHPGRILKLRV  127 (414)
Q Consensus        75 ~~e~ITs~~~lpl~d-----w~~I~VG~s---------s-G~vrfyte~G--------~LL~sQ~lh~~pV~~ik~  127 (414)
                      ++|.++|++-+.+.+     ..+|+|||.         + |.+.+|+-.-        .++++.- .+.||..|..
T Consensus        22 ~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i~~~~-~~g~V~ai~~   96 (321)
T PF03178_consen   22 PNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLIHSTE-VKGPVTAICS   96 (321)
T ss_dssp             TTEEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEEEEEE-ESS-EEEEEE
T ss_pred             CCceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEEEEEe-ecCcceEhhh
Confidence            479999999999964     799999997         3 9999999544        2333333 3678888873


No 91 
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=75.89  E-value=5.3  Score=23.72  Aligned_cols=33  Identities=33%  Similarity=0.434  Sum_probs=26.4

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLD  335 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D  335 (414)
                      +.........+...|.++++++....|.|.++|
T Consensus         8 ~~~~~~~i~~~~~~~~~~~~~~~~~d~~~~~~~   40 (40)
T smart00320        8 LKGHTGPVTSVAFSPDGKYLASASDDGTIKLWD   40 (40)
T ss_pred             EEecCCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence            445556788999999999999988889887765


No 92 
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=75.23  E-value=19  Score=40.56  Aligned_cols=56  Identities=21%  Similarity=0.230  Sum_probs=41.7

Q ss_pred             CCeEEEEEeEEe--CCc-EEEEEeccccEEEEEecC-----CcEE-------eec-ccCccceeEEEEeccc
Q 015021           76 SEYITAIEWLVF--EEM-RALAVGTSRGYFLVYDLK-----GDLV-------HRQ-LIHPGRILKLRVRGSR  131 (414)
Q Consensus        76 ~e~ITs~~~lpl--~dw-~~I~VG~ssG~vrfyte~-----G~LL-------~sQ-~lh~~pV~~ik~r~~~  131 (414)
                      ++.+|++.++|+  ++- -||+||+..|-+.+|+-.     ++..       +.+ +-|.+.|.+|+.|...
T Consensus       664 ~~aVTAv~~~~~~~~e~~~~vavGle~GeI~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~aV~rl~w~p~~  735 (764)
T KOG1063|consen  664 SLAVTAVAYLPVDHNEKGDVVAVGLEKGEIVLWRRKREHRQVTVGTFNLDTRLCATIGPDSAVNRLLWRPTC  735 (764)
T ss_pred             CCceeeEEeeccccccccceEEEEecccEEEEEecccccccccceeeeeccccccccChHHhhheeEecccc
Confidence            688999999999  333 499999999999999932     2221       222 4567899999988653


No 93 
>PF14779 BBS1:  Ciliary BBSome complex subunit 1
Probab=74.96  E-value=6.3  Score=39.34  Aligned_cols=53  Identities=21%  Similarity=0.310  Sum_probs=40.3

Q ss_pred             CeEEEEEeEEe-----CCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEec
Q 015021           77 EYITAIEWLVF-----EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRG  129 (414)
Q Consensus        77 e~ITs~~~lpl-----~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~  129 (414)
                      ..||++..|.-     ....|++|||-+|.|.+.+.++--++.|.-=+++-..|.+-.
T Consensus       177 t~ITcm~tikk~~~d~~a~scLViGTE~~~i~iLd~~af~il~~~~lpsvPv~i~~~G  234 (257)
T PF14779_consen  177 TVITCMATIKKSSADEDAVSCLVIGTESGEIYILDPQAFTILKQVQLPSVPVFISVSG  234 (257)
T ss_pred             ceeEEeeeecccccCCCCcceEEEEecCCeEEEECchhheeEEEEecCCCceEEEEEe
Confidence            46888888887     345999999999999999999999998864444333555443


No 94 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=74.44  E-value=6.3  Score=42.31  Aligned_cols=78  Identities=10%  Similarity=0.086  Sum_probs=59.8

Q ss_pred             CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      ++..-+++..-||+|+|+.|.-|-|=++|..+..++|..+++-. -...+....             .    ..-+...+
T Consensus        68 k~~v~s~~fR~DG~LlaaGD~sG~V~vfD~k~r~iLR~~~ah~a-pv~~~~f~~-------------~----d~t~l~s~  129 (487)
T KOG0310|consen   68 KDVVYSVDFRSDGRLLAAGDESGHVKVFDMKSRVILRQLYAHQA-PVHVTKFSP-------------Q----DNTMLVSG  129 (487)
T ss_pred             ccceeEEEeecCCeEEEccCCcCcEEEeccccHHHHHHHhhccC-ceeEEEecc-------------c----CCeEEEec
Confidence            46678889999999999999999999999999999999988753 222222211             1    11355778


Q ss_pred             CCCCeEEEeeCCCCcc
Q 015021          387 PRKGIIEHLAGMADEN  402 (414)
Q Consensus       387 prRg~lEvW~~~~g~~  402 (414)
                      .-++++.+|++-++.-
T Consensus       130 sDd~v~k~~d~s~a~v  145 (487)
T KOG0310|consen  130 SDDKVVKYWDLSTAYV  145 (487)
T ss_pred             CCCceEEEEEcCCcEE
Confidence            8889999999988763


No 95 
>PRK01742 tolB translocation protein TolB; Provisional
Probab=74.02  E-value=10  Score=39.39  Aligned_cols=44  Identities=14%  Similarity=0.106  Sum_probs=32.6

Q ss_pred             eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeE
Q 015021          311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVF  355 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~w  355 (414)
                      .....||+|++++.+.. .+++++|+.+|...++.+++.+-.+.|
T Consensus       336 ~~~~~SpDG~~ia~~~~-~~i~~~Dl~~g~~~~lt~~~~~~~~~~  379 (429)
T PRK01742        336 YSAQISADGKTLVMING-DNVVKQDLTSGSTEVLSSTFLDESPSI  379 (429)
T ss_pred             CCccCCCCCCEEEEEcC-CCEEEEECCCCCeEEecCCCCCCCceE
Confidence            35778999998877755 568889999998877777765533333


No 96 
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=73.96  E-value=25  Score=33.52  Aligned_cols=68  Identities=25%  Similarity=0.450  Sum_probs=46.9

Q ss_pred             ccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcE
Q 015021           38 ALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDL  111 (414)
Q Consensus        38 sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~L  111 (414)
                      ......|++|.+.++.+.+|......-.....++.-  .+.++++.|+    ...|+||+.++|..+=-++|..
T Consensus       102 ~~~~~~L~va~kk~i~i~~~~~~~~~f~~~~ke~~l--p~~~~~i~~~----~~~i~v~~~~~f~~idl~~~~~  169 (275)
T PF00780_consen  102 HEGSRRLCVAVKKKILIYEWNDPRNSFSKLLKEISL--PDPPSSIAFL----GNKICVGTSKGFYLIDLNTGSP  169 (275)
T ss_pred             cccceEEEEEECCEEEEEEEECCcccccceeEEEEc--CCCcEEEEEe----CCEEEEEeCCceEEEecCCCCc
Confidence            566778899999999999998753211022233433  2668888888    4588899999987765565654


No 97 
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=73.95  E-value=17  Score=39.16  Aligned_cols=55  Identities=22%  Similarity=0.433  Sum_probs=46.1

Q ss_pred             CCCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEeccc
Q 015021           75 ASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSR  131 (414)
Q Consensus        75 ~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~  131 (414)
                      .+.-||++.|=  .|-+.+|.|+-+|++|+|+.+|.|+.+--+|..||.+||-+...
T Consensus       234 ~nkdVT~L~Wn--~~G~~LatG~~~G~~riw~~~G~l~~tl~~HkgPI~slKWnk~G  288 (524)
T KOG0273|consen  234 SNKDVTSLDWN--NDGTLLATGSEDGEARIWNKDGNLISTLGQHKGPIFSLKWNKKG  288 (524)
T ss_pred             ccCCcceEEec--CCCCeEEEeecCcEEEEEecCchhhhhhhccCCceEEEEEcCCC
Confidence            34778888442  23479999999999999999999999999999999999966543


No 98 
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=73.78  E-value=22  Score=36.48  Aligned_cols=86  Identities=19%  Similarity=0.152  Sum_probs=60.7

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcce-eEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALV-VVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~-vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      |+--++-++-++..+..+|+|.++|..-.+--|+|+++-..+ =.-..|||-.|=+.=--+.   +           +. 
T Consensus        39 p~m~l~gh~geI~~~~F~P~gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgHsgAVM~l~~~~---d-----------~s-  103 (338)
T KOG0265|consen   39 PIMLLPGHKGEIYTIKFHPDGSCFASGGSDRAIVLWNVYGDCENFWVLKGHSGAVMELHGMR---D-----------GS-  103 (338)
T ss_pred             hhhhcCCCcceEEEEEECCCCCeEeecCCcceEEEEeccccccceeeeccccceeEeeeecc---C-----------CC-
Confidence            333477888999999999999999999888889998854332 2234567777644322211   1           11 


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v  403 (414)
                          -+.-+.-+-.|-+||+++|+++
T Consensus       104 ----~i~S~gtDk~v~~wD~~tG~~~  125 (338)
T KOG0265|consen  104 ----HILSCGTDKTVRGWDAETGKRI  125 (338)
T ss_pred             ----EEEEecCCceEEEEecccceee
Confidence                3456777889999999999987


No 99 
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=73.69  E-value=15  Score=41.96  Aligned_cols=58  Identities=17%  Similarity=0.183  Sum_probs=47.6

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNK  362 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~  362 (414)
                      |.-+..++...+.||+|+|.++++.++.|=++|+-+|..|-+.  .=|.-|-.+...++.
T Consensus       572 f~gh~nritd~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~--~vd~~~~sls~SPng  629 (910)
T KOG1539|consen  572 FWGHGNRITDMTFSPDGRWLISASMDSTIRTWDLPTGTLIDGL--LVDSPCTSLSFSPNG  629 (910)
T ss_pred             hhccccceeeeEeCCCCcEEEEeecCCcEEEEeccCcceeeeE--ecCCcceeeEECCCC
Confidence            4555677889999999999999999999999999999999887  346666666655443


No 100
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=73.30  E-value=4.9  Score=41.65  Aligned_cols=87  Identities=16%  Similarity=0.020  Sum_probs=58.9

Q ss_pred             CCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021          306 HPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH  385 (414)
Q Consensus       306 ~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy  385 (414)
                      -+-+.++-..||||+|.+...-+|-|=+||-.+|.+-+=.|=  .||=.|+.+.+....-.-     .+  +.  =.+--
T Consensus       212 ~KSh~EcA~FSPDgqyLvsgSvDGFiEVWny~~GKlrKDLkY--QAqd~fMMmd~aVlci~F-----SR--Ds--EMlAs  280 (508)
T KOG0275|consen  212 QKSHVECARFSPDGQYLVSGSVDGFIEVWNYTTGKLRKDLKY--QAQDNFMMMDDAVLCISF-----SR--DS--EMLAS  280 (508)
T ss_pred             cccchhheeeCCCCceEeeccccceeeeehhccchhhhhhhh--hhhcceeecccceEEEee-----cc--cH--HHhhc
Confidence            355788999999999999999999999999999987665542  355566655422100000     00  00  01123


Q ss_pred             cCCCCeEEEeeCCCCcce
Q 015021          386 APRKGIIEHLAGMADENW  403 (414)
Q Consensus       386 aprRg~lEvW~~~~g~~v  403 (414)
                      ...+|-|+||.+++|.|+
T Consensus       281 GsqDGkIKvWri~tG~Cl  298 (508)
T KOG0275|consen  281 GSQDGKIKVWRIETGQCL  298 (508)
T ss_pred             cCcCCcEEEEEEecchHH
Confidence            456899999999999987


No 101
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=72.65  E-value=9.6  Score=39.89  Aligned_cols=53  Identities=21%  Similarity=0.283  Sum_probs=45.0

Q ss_pred             CeEEEEEeEEe-CCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEec
Q 015021           77 EYITAIEWLVF-EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRG  129 (414)
Q Consensus        77 e~ITs~~~lpl-~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~  129 (414)
                      +.+-|++.+|+ .....-|+|.-+|-|.+|+-.-..|+.++-|+++|.+|+.-.
T Consensus       284 ~~~esve~~~~ss~lpL~A~G~vdG~i~iyD~a~~~~R~~c~he~~V~~l~w~~  337 (399)
T KOG0296|consen  284 ELDESVESIPSSSKLPLAACGSVDGTIAIYDLAASTLRHICEHEDGVTKLKWLN  337 (399)
T ss_pred             hhhhhhhhcccccccchhhcccccceEEEEecccchhheeccCCCceEEEEEcC
Confidence            55666666676 445678999999999999999999999999999999999655


No 102
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=72.59  E-value=23  Score=39.68  Aligned_cols=80  Identities=14%  Similarity=0.142  Sum_probs=63.4

Q ss_pred             CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEe-----eecc-cccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVR-----LWKG-YRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvr-----mwKG-yRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      +-+..+|.-.|++.-+|..-.+|+|.++|+..+.-+.     +-+- -|+.-|.|-......                  
T Consensus       154 ~sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd------------------  215 (691)
T KOG2048|consen  154 KSRVLSLSWNPTGTKIAGGSIDGVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFLRD------------------  215 (691)
T ss_pred             cceEEEEEecCCccEEEecccCceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEeec------------------
Confidence            4557788888999988888899999999999999887     4443 348889997665422                  


Q ss_pred             EEEEEcCCCCeEEEeeCCCCccee
Q 015021          381 CLAIHAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~v~  404 (414)
                      -.+|-.=.+|.|.+||-++|-.+.
T Consensus       216 ~tI~sgDS~G~V~FWd~~~gTLiq  239 (691)
T KOG2048|consen  216 STIASGDSAGTVTFWDSIFGTLIQ  239 (691)
T ss_pred             CcEEEecCCceEEEEcccCcchhh
Confidence            156788889999999999998763


No 103
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=72.47  E-value=12  Score=40.80  Aligned_cols=78  Identities=12%  Similarity=0.041  Sum_probs=57.2

Q ss_pred             CCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021          306 HPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH  385 (414)
Q Consensus       306 ~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy  385 (414)
                      +.-.+..+.++  +.+.++.-..|.|.++|+.++..++-.+|+-.    |+.......           .     -.++-
T Consensus       330 h~~~V~~v~~~--~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~----~V~sl~~~~-----------~-----~~~~S  387 (537)
T KOG0274|consen  330 HTGPVNCVQLD--EPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTG----RVYSLIVDS-----------E-----NRLLS  387 (537)
T ss_pred             ccccEEEEEec--CCEEEEEecCceEEEEEhhhceeeeeecCCcc----eEEEEEecC-----------c-----ceEEe
Confidence            55556677777  66666655566999999999999999999777    666542211           0     14466


Q ss_pred             cCCCCeEEEeeCCCC-cceee
Q 015021          386 APRKGIIEHLAGMAD-ENWTT  405 (414)
Q Consensus       386 aprRg~lEvW~~~~g-~~v~~  405 (414)
                      ..-++.|++||+.++ +|+.|
T Consensus       388 gs~D~~IkvWdl~~~~~c~~t  408 (537)
T KOG0274|consen  388 GSLDTTIKVWDLRTKRKCIHT  408 (537)
T ss_pred             eeeccceEeecCCchhhhhhh
Confidence            777899999999999 88744


No 104
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=71.38  E-value=12  Score=42.69  Aligned_cols=78  Identities=21%  Similarity=0.099  Sum_probs=45.0

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK  389 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR  389 (414)
                      .+...+||+++++|+.|+.|||+++---.-    -=+-+----+.|.-........         ..+.+  .+..+.+-
T Consensus       208 ~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~----~~~~~t~t~lHWH~~~V~~L~f---------S~~G~--~LlSGG~E  272 (792)
T KOG1963|consen  208 ITCVALSPNERYLAAGDSDGRILVWRDFGS----SDDSETCTLLHWHHDEVNSLSF---------SSDGA--YLLSGGRE  272 (792)
T ss_pred             ceeEEeccccceEEEeccCCcEEEEecccc----ccccccceEEEecccccceeEE---------ecCCc--eEeecccc
Confidence            678999999999999999999876532220    0000001112366211111100         00111  34567778


Q ss_pred             CeEEEeeCCCCcc
Q 015021          390 GIIEHLAGMADEN  402 (414)
Q Consensus       390 g~lEvW~~~~g~~  402 (414)
                      |.|-+|.+.++++
T Consensus       273 ~VLv~Wq~~T~~k  285 (792)
T KOG1963|consen  273 GVLVLWQLETGKK  285 (792)
T ss_pred             eEEEEEeecCCCc
Confidence            9999999988764


No 105
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=70.66  E-value=36  Score=31.34  Aligned_cols=80  Identities=24%  Similarity=0.255  Sum_probs=57.7

Q ss_pred             ccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEe-eecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          303 LKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVR-LWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvr-mwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      +.........+..+|+++ +++.....|.|.++|...+..++ .+.|+.+-.   +.....            .  .   
T Consensus       194 ~~~~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~~~~~~~~~~~~~~---~~~~~~------------~--~---  253 (466)
T COG2319         194 LAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRSTLSGHSDSV---VSSFSP------------D--G---  253 (466)
T ss_pred             eccCCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCcEEeeecCCCCcce---eEeECC------------C--C---
Confidence            444678888899999998 44445999999999999999999 788888842   111111            0  0   


Q ss_pred             EEEEEcCCCCeEEEeeCCCCcc
Q 015021          381 CLAIHAPRKGIIEHLAGMADEN  402 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~  402 (414)
                      -+++-....+.+.+|+++....
T Consensus       254 ~~~~~~~~d~~~~~~~~~~~~~  275 (466)
T COG2319         254 SLLASGSSDGTIRLWDLRSSSS  275 (466)
T ss_pred             CEEEEecCCCcEEEeeecCCCc
Confidence            1333778889999999987765


No 106
>KOG4328 consensus WD40 protein [Function unknown]
Probab=69.86  E-value=1.7e+02  Score=31.80  Aligned_cols=80  Identities=19%  Similarity=0.340  Sum_probs=57.2

Q ss_pred             CCeEEEEEeEEeCCcEEEEEeccccEEEEEec-------CCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeC
Q 015021           76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYDL-------KGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMP  148 (414)
Q Consensus        76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte-------~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~  148 (414)
                      .++||++.+.|-..-..|++|=..|.|-|..-       .|..+|  .-|-.||-.|+|.-.-.       +.-++--|.
T Consensus       186 ~~Rit~l~fHPt~~~~lva~GdK~G~VG~Wn~~~~~~d~d~v~~f--~~hs~~Vs~l~F~P~n~-------s~i~ssSyD  256 (498)
T KOG4328|consen  186 DRRITSLAFHPTENRKLVAVGDKGGQVGLWNFGTQEKDKDGVYLF--TPHSGPVSGLKFSPANT-------SQIYSSSYD  256 (498)
T ss_pred             ccceEEEEecccCcceEEEEccCCCcEEEEecCCCCCccCceEEe--ccCCccccceEecCCCh-------hheeeeccC
Confidence            58999999999999899999999999999986       344544  35668999999765433       444566666


Q ss_pred             CeE--EEEeChhHHHHHH
Q 015021          149 GVL--ARFDGSEIQKMLQ  164 (414)
Q Consensus       149 ~~i--v~IdG~sL~~~Lr  164 (414)
                      +.|  .-+.+.-+..+|+
T Consensus       257 GtiR~~D~~~~i~e~v~s  274 (498)
T KOG4328|consen  257 GTIRLQDFEGNISEEVLS  274 (498)
T ss_pred             ceeeeeeecchhhHHHhh
Confidence            665  3344444444433


No 107
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=69.71  E-value=6.1  Score=44.46  Aligned_cols=83  Identities=18%  Similarity=0.104  Sum_probs=59.3

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC  381 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~  381 (414)
                      .+.-+.|....+.++|+|+|.|..-..--|=++|+.+|.++.-+|++-. +..=++..                  ..=|
T Consensus       149 ~~~s~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk~~~ef~~~e~-~v~sle~h------------------p~e~  209 (825)
T KOG0267|consen  149 TYKSHTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGKLSKEFKSHEG-KVQSLEFH------------------PLEV  209 (825)
T ss_pred             eecCCcceeEEEeecCCCceeeccCCcceeeeecccccccccccccccc-cccccccC------------------chhh
Confidence            3455778899999999999999965577888899999999988887543 22222211                  1114


Q ss_pred             EEEEcCCCCeEEEeeCCCCcce
Q 015021          382 LAIHAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       382 LvIyaprRg~lEvW~~~~g~~v  403 (414)
                      |.=-...++++.+|++.+.+-|
T Consensus       210 Lla~Gs~d~tv~f~dletfe~I  231 (825)
T KOG0267|consen  210 LLAPGSSDRTVRFWDLETFEVI  231 (825)
T ss_pred             hhccCCCCceeeeeccceeEEe
Confidence            4455567889999999977766


No 108
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=69.35  E-value=34  Score=35.48  Aligned_cols=50  Identities=18%  Similarity=0.210  Sum_probs=42.6

Q ss_pred             EEeEEeCCcEEEEEeccccEEEEEecC-CcEEeecccCccceeEEEEeccc
Q 015021           82 IEWLVFEEMRALAVGTSRGYFLVYDLK-GDLVHRQLIHPGRILKLRVRGSR  131 (414)
Q Consensus        82 ~~~lpl~dw~~I~VG~ssG~vrfyte~-G~LL~sQ~lh~~pV~~ik~r~~~  131 (414)
                      +.|+++.+-..++||-+++.+++++.+ +.-++.-.-|+.+|..|..-+.+
T Consensus       211 ~l~~~~l~~~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~  261 (362)
T KOG0294|consen  211 ILCATFLDGSELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNP  261 (362)
T ss_pred             ceeeeecCCceEEEecCCceEEEeccCCCccceeeecchhheeeeEEEecC
Confidence            456777788899999999999999975 88888889999999999966654


No 109
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=67.94  E-value=34  Score=40.57  Aligned_cols=57  Identities=23%  Similarity=0.179  Sum_probs=40.4

Q ss_pred             CCCeEEEEEeEEeCCcEEEEEeccccEEEEEec------CCcEEeecccCccceeEEEEecccC
Q 015021           75 ASEYITAIEWLVFEEMRALAVGTSRGYFLVYDL------KGDLVHRQLIHPGRILKLRVRGSRR  132 (414)
Q Consensus        75 ~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte------~G~LL~sQ~lh~~pV~~ik~r~~~~  132 (414)
                      .+.+|||+.+=- ..-..||+||.+|.||+|+.      .+....+|.=..+||+.+++|....
T Consensus      1207 s~t~vTaLS~~~-~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~slq~~G~ 1269 (1387)
T KOG1517|consen 1207 SSTLVTALSADL-VHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLSLQRQGL 1269 (1387)
T ss_pred             CCccceeecccc-cCCceEEEeecCCceEEeecccCCccccceeecccCCcccceeEEeecCCC
Confidence            356777775522 23579999999999999994      3455555554445699999988664


No 110
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=67.85  E-value=6  Score=45.37  Aligned_cols=46  Identities=22%  Similarity=0.410  Sum_probs=40.4

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGY  348 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGy  348 (414)
                      |.-++-....|..+|.+.+.|+++-.|-|.+||++.+.+..+|+|-
T Consensus       134 lrgh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v  179 (933)
T KOG1274|consen  134 LRGHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGV  179 (933)
T ss_pred             ecccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccC
Confidence            3334455678999999999999999999999999999999999985


No 111
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=66.24  E-value=15  Score=27.27  Aligned_cols=30  Identities=17%  Similarity=0.222  Sum_probs=27.0

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcce
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALV  340 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~  340 (414)
                      +..++-||+..|.|++...|.|++..+ +++
T Consensus        14 v~~~~w~P~mdLiA~~t~~g~v~v~Rl-~~q   43 (47)
T PF12894_consen   14 VSCMSWCPTMDLIALGTEDGEVLVYRL-NWQ   43 (47)
T ss_pred             EEEEEECCCCCEEEEEECCCeEEEEEC-CCc
Confidence            678999999999999999999999998 444


No 112
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=66.16  E-value=12  Score=39.16  Aligned_cols=74  Identities=16%  Similarity=0.112  Sum_probs=52.8

Q ss_pred             ECCCCCEEEEEcCCCcEEEEEcCccee---EeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCe
Q 015021          315 LSPSGSLAAITDSLGRILLLDTQALVV---VRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGI  391 (414)
Q Consensus       315 ~sP~~~laa~tD~lGRV~L~D~~~~~v---vrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~  391 (414)
                      +..++++..++|-.|.|-|+|+..-.-   |+.+.||=+- -+-+++.-+.+.                -.|.-+..+=.
T Consensus       307 Lq~s~q~LmaS~M~gkikLyD~R~~K~~~~V~qYeGHvN~-~a~l~~~v~~ee----------------g~I~s~GdDcy  369 (425)
T KOG2695|consen  307 LQFSQQKLMASDMTGKIKLYDLRATKCKKSVMQYEGHVNL-SAYLPAHVKEEE----------------GSIFSVGDDCY  369 (425)
T ss_pred             hccccceEeeccCcCceeEeeehhhhcccceeeeeccccc-cccccccccccc----------------ceEEEccCeeE
Confidence            345788999999999999999987777   8888888651 112222111110                14455788888


Q ss_pred             EEEeeCCCCcceee
Q 015021          392 IEHLAGMADENWTT  405 (414)
Q Consensus       392 lEvW~~~~g~~v~~  405 (414)
                      ..||++++|+...|
T Consensus       370 tRiWsl~~ghLl~t  383 (425)
T KOG2695|consen  370 TRIWSLDSGHLLCT  383 (425)
T ss_pred             EEEEecccCceeec
Confidence            99999999999866


No 113
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=64.55  E-value=13  Score=38.57  Aligned_cols=39  Identities=8%  Similarity=0.042  Sum_probs=34.3

Q ss_pred             CceeeEEEECCCCC-EEEEEc-CCCcEEEEEcCcceeEeee
Q 015021          307 PRKGERLTLSPSGS-LAAITD-SLGRILLLDTQALVVVRLW  345 (414)
Q Consensus       307 ~R~~~~i~~sP~~~-laa~tD-~lGRV~L~D~~~~~vvrmw  345 (414)
                      .+....|+++|+|+ ++.+++ .-|.|.+||+.++.+++--
T Consensus       298 G~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~~i  338 (352)
T TIGR02658       298 GHEIDSINVSQDAKPLLYALSTGDKTLYIFDAETGKELSSV  338 (352)
T ss_pred             CCceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCeEEeee
Confidence            46788999999999 888887 6788999999999999866


No 114
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=64.15  E-value=25  Score=37.45  Aligned_cols=82  Identities=15%  Similarity=0.180  Sum_probs=57.9

Q ss_pred             CCccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCccee--------------EeeecccccceeeEEEEEeccc
Q 015021          299 PLTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALVV--------------VRLWKGYRDASCVFMEMLVNKD  363 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~v--------------vrmwKGyRdAq~~wi~~~~~~~  363 (414)
                      |++.+...+-+...+.-||.. +..|.+-..||++++|+.+-.-              +-|=-|+|.-=+-|-       
T Consensus       308 ~lh~~e~H~dev~~V~WSPh~etvLASSg~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~~kV~Dfs-------  380 (422)
T KOG0264|consen  308 PLHTFEGHEDEVFQVEWSPHNETVLASSGTDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHTAKVSDFS-------  380 (422)
T ss_pred             CceeccCCCcceEEEEeCCCCCceeEecccCCcEEEEeccccccccChhhhccCCcceeEEecCccccccccc-------
Confidence            667899999999999999999 7888889999999999976332              234444444210000       


Q ss_pred             ccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021          364 AATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM  398 (414)
Q Consensus       364 ~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~  398 (414)
                             -.|..  .  ..+-.+.-+++|.||.|-
T Consensus       381 -------Wnp~e--P--W~I~SvaeDN~LqIW~~s  404 (422)
T KOG0264|consen  381 -------WNPNE--P--WTIASVAEDNILQIWQMA  404 (422)
T ss_pred             -------CCCCC--C--eEEEEecCCceEEEeecc
Confidence                   00111  1  467788889999999986


No 115
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=63.41  E-value=13  Score=38.66  Aligned_cols=69  Identities=14%  Similarity=0.141  Sum_probs=51.6

Q ss_pred             CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021          319 GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM  398 (414)
Q Consensus       319 ~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~  398 (414)
                      +-+.|++-.+|-|-+||+.++....=..|+=+|    |.  |        .+.-|.    ..-|++-+..+..|.+|++|
T Consensus       105 ~p~la~~G~~GvIrVid~~~~~~~~~~~ghG~s----IN--e--------ik~~p~----~~qlvls~SkD~svRlwnI~  166 (385)
T KOG1034|consen  105 NPFLAAGGYLGVIRVIDVVSGQCSKNYRGHGGS----IN--E--------IKFHPD----RPQLVLSASKDHSVRLWNIQ  166 (385)
T ss_pred             CeeEEeecceeEEEEEecchhhhccceeccCcc----ch--h--------hhcCCC----CCcEEEEecCCceEEEEecc
Confidence            457788889999999999999998777666552    11  0        111112    22499999999999999999


Q ss_pred             CCcceee
Q 015021          399 ADENWTT  405 (414)
Q Consensus       399 ~g~~v~~  405 (414)
                      ++.||+-
T Consensus       167 ~~~Cv~V  173 (385)
T KOG1034|consen  167 TDVCVAV  173 (385)
T ss_pred             CCeEEEE
Confidence            9999964


No 116
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=62.98  E-value=12  Score=39.42  Aligned_cols=49  Identities=20%  Similarity=0.307  Sum_probs=42.2

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEE
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEML  359 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~  359 (414)
                      ..+.++.||++|+++|+.-..|-|.++|+.+++.+..-   +.|.-.+|.-.
T Consensus       282 ~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~v---k~aH~~~VT~l  330 (398)
T KOG0771|consen  282 KSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYV---KEAHLGFVTGL  330 (398)
T ss_pred             CcceeEEEcCCCcEEEEeccCCcEEEEEeceeeeeEee---hhhheeeeeeE
Confidence            35789999999999999999999999999999999887   57777777543


No 117
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=62.89  E-value=67  Score=36.56  Aligned_cols=78  Identities=19%  Similarity=0.242  Sum_probs=53.7

Q ss_pred             cCCCeeeeccCcceeeeeecceEEE--E--eecCCC-----CCceeecc------cCCCCCCCeEEEEEeEEe-CCcEEE
Q 015021           30 VNDPNLLCALDMHTIALANRYQTVI--I--NWADPE-----GLVAKIRP------ELSPIASEYITAIEWLVF-EEMRAL   93 (414)
Q Consensus        30 ~~~~~~~~sp~~~~la~A~~~~~v~--~--~w~~~~-----~~~v~~~g------~l~~~~~e~ITs~~~lpl-~dw~~I   93 (414)
                      ++-..|.++|+|++||++-..-++|  +  +|...+     ...+.++-      ......+-.|.-+.|=|. .+...+
T Consensus        85 f~v~~i~~n~~g~~lal~G~~~v~V~~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~~~~l  164 (717)
T PF10168_consen   85 FEVHQISLNPTGSLLALVGPRGVVVLELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSESDSHL  164 (717)
T ss_pred             eeEEEEEECCCCCEEEEEcCCcEEEEEeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCCCCeE
Confidence            3335788999999999876664444  3  486554     12355542      112212457889999999 356788


Q ss_pred             EEeccccEEEEEec
Q 015021           94 AVGTSRGYFLVYDL  107 (414)
Q Consensus        94 ~VG~ssG~vrfyte  107 (414)
                      +|=|+++.+|+|+.
T Consensus       165 ~vLtsdn~lR~y~~  178 (717)
T PF10168_consen  165 VVLTSDNTLRLYDI  178 (717)
T ss_pred             EEEecCCEEEEEec
Confidence            99999999999997


No 118
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=62.80  E-value=37  Score=35.20  Aligned_cols=84  Identities=14%  Similarity=0.118  Sum_probs=65.4

Q ss_pred             CccccCCCceeeEEEECCCCC--EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCc
Q 015021          300 LTCLKDHPRKGERLTLSPSGS--LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       300 l~~l~D~~R~~~~i~~sP~~~--laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      +..|.++...+..+..+|+-.  +...+-..|-|+++|+..=..+.-.|++-.+ +..|.+...            +|  
T Consensus        76 lg~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~-Vt~lsiHPS------------~K--  140 (362)
T KOG0294|consen   76 LGILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQ-VTDLSIHPS------------GK--  140 (362)
T ss_pred             hcceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeecccccc-cceeEecCC------------Cc--
Confidence            345667777888888888873  4444445699999999999999999999876 777776532            22  


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcc
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADEN  402 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~  402 (414)
                          |++-..+++.|..|++-+|+.
T Consensus       141 ----LALsVg~D~~lr~WNLV~Gr~  161 (362)
T KOG0294|consen  141 ----LALSVGGDQVLRTWNLVRGRV  161 (362)
T ss_pred             ----eEEEEcCCceeeeehhhcCcc
Confidence                778888999999999998875


No 119
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=62.80  E-value=64  Score=32.77  Aligned_cols=87  Identities=11%  Similarity=0.024  Sum_probs=57.5

Q ss_pred             eeeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEE
Q 015021           34 NLLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLV  112 (414)
Q Consensus        34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL  112 (414)
                      ++-+|++|++|.+|+++-+.|.  +...=+.++-. +    -...|+|..-=  |+--+.+.|=.++|++-|+ .+|+-+
T Consensus       189 SlEvs~dG~ilTia~gssV~Fw--daksf~~lKs~-k----~P~nV~SASL~--P~k~~fVaGged~~~~kfDy~TgeEi  259 (334)
T KOG0278|consen  189 SLEVSQDGRILTIAYGSSVKFW--DAKSFGLLKSY-K----MPCNVESASLH--PKKEFFVAGGEDFKVYKFDYNTGEEI  259 (334)
T ss_pred             ceeeccCCCEEEEecCceeEEe--ccccccceeec-c----Ccccccccccc--CCCceEEecCcceEEEEEeccCCcee
Confidence            4558999999999999988772  22110001100 0    01333333111  4557889999999999999 789888


Q ss_pred             eec-ccCccceeEEEEec
Q 015021          113 HRQ-LIHPGRILKLRVRG  129 (414)
Q Consensus       113 ~sQ-~lh~~pV~~ik~r~  129 (414)
                      =+- -=|+.||..+|+-.
T Consensus       260 ~~~nkgh~gpVhcVrFSP  277 (334)
T KOG0278|consen  260 GSYNKGHFGPVHCVRFSP  277 (334)
T ss_pred             eecccCCCCceEEEEECC
Confidence            762 45789999999754


No 120
>PRK03629 tolB translocation protein TolB; Provisional
Probab=61.82  E-value=32  Score=35.89  Aligned_cols=48  Identities=19%  Similarity=0.324  Sum_probs=34.3

Q ss_pred             ceeeEEEECCCCCEEEEE-cCCC--cEEEEEcCcceeEeeeccc-ccceeeE
Q 015021          308 RKGERLTLSPSGSLAAIT-DSLG--RILLLDTQALVVVRLWKGY-RDASCVF  355 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~t-D~lG--RV~L~D~~~~~vvrmwKGy-RdAq~~w  355 (414)
                      +.......||+|+.+|.+ +..|  +|.++|+.++.+.++..+. .+....|
T Consensus       243 ~~~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~~~~~~~~w  294 (429)
T PRK03629        243 RHNGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGRSNNTEPTW  294 (429)
T ss_pred             CCcCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCCCCCcCceEE
Confidence            334568899999866654 5555  6999999999998888763 3344444


No 121
>PRK04922 tolB translocation protein TolB; Provisional
Probab=61.21  E-value=37  Score=35.27  Aligned_cols=39  Identities=26%  Similarity=0.278  Sum_probs=28.8

Q ss_pred             eeEEEECCCCCEEE-EEcCCCc--EEEEEcCcceeEeee-ccc
Q 015021          310 GERLTLSPSGSLAA-ITDSLGR--ILLLDTQALVVVRLW-KGY  348 (414)
Q Consensus       310 ~~~i~~sP~~~laa-~tD~lGR--V~L~D~~~~~vvrmw-KGy  348 (414)
                      ....+.+|+|++++ ++|..|+  |+++|+.++..-|+- .|.
T Consensus       294 ~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~  336 (433)
T PRK04922        294 DTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGN  336 (433)
T ss_pred             ccceEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCCC
Confidence            34578999997555 5677775  999999998887763 443


No 122
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=61.14  E-value=22  Score=24.07  Aligned_cols=29  Identities=21%  Similarity=0.327  Sum_probs=22.0

Q ss_pred             CCCCEEEEEc-CCCcEEEEEcCcceeEeee
Q 015021          317 PSGSLAAITD-SLGRILLLDTQALVVVRLW  345 (414)
Q Consensus       317 P~~~laa~tD-~lGRV~L~D~~~~~vvrmw  345 (414)
                      |+|+.+-+++ ..+.|.++|+.++.+++-.
T Consensus         1 pd~~~lyv~~~~~~~v~~id~~~~~~~~~i   30 (42)
T TIGR02276         1 PDGTKLYVTNSGSNTVSVIDTATNKVIATI   30 (42)
T ss_pred             CCCCEEEEEeCCCCEEEEEECCCCeEEEEE
Confidence            6787555554 5889999999998877654


No 123
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=60.78  E-value=65  Score=34.18  Aligned_cols=108  Identities=18%  Similarity=0.209  Sum_probs=69.1

Q ss_pred             ccccCCCCCcccCCCeeeeccCcceeeeeecc---eEEE----EeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcE
Q 015021           19 SDLGAGKEGWLVNDPNLLCALDMHTIALANRY---QTVI----INWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMR   91 (414)
Q Consensus        19 ~~~g~~~~~WL~~~~~~~~sp~~~~la~A~~~---~~v~----~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~   91 (414)
                      ...++-+.+|+...|.+...++++.|-+|..+   +.|.    .-|..+  .-+.++..+..  .|-|||+. +. +|-.
T Consensus       221 a~~t~~~k~~~~~~cRF~~d~~~~~l~laa~~~~~~~v~~~~~~~w~~~--~~l~~~~~~~~--~~siSsl~-VS-~dGk  294 (398)
T KOG0771|consen  221 ARKTPFSKDEMFSSCRFSVDNAQETLRLAASQFPGGGVRLCDISLWSGS--NFLRLRKKIKR--FKSISSLA-VS-DDGK  294 (398)
T ss_pred             hhcCCcccchhhhhceecccCCCceEEEEEecCCCCceeEEEeeeeccc--cccchhhhhhc--cCcceeEE-Ec-CCCc
Confidence            34444566777777999998888777665544   3333    224332  11334444422  35666652 22 4678


Q ss_pred             EEEEeccccEEEEEec-C-CcEEeecccCccceeEEEEecccC
Q 015021           92 ALAVGTSRGYFLVYDL-K-GDLVHRQLIHPGRILKLRVRGSRR  132 (414)
Q Consensus        92 ~I~VG~ssG~vrfyte-~-G~LL~sQ~lh~~pV~~ik~r~~~~  132 (414)
                      ++++|+.+|-|.+|.. + -.+.+.-..|..-|.+|.+-.-.+
T Consensus       295 f~AlGT~dGsVai~~~~~lq~~~~vk~aH~~~VT~ltF~Pdsr  337 (398)
T KOG0771|consen  295 FLALGTMDGSVAIYDAKSLQRLQYVKEAHLGFVTGLTFSPDSR  337 (398)
T ss_pred             EEEEeccCCcEEEEEeceeeeeEeehhhheeeeeeEEEcCCcC
Confidence            9999999999999993 2 344455577888999999877553


No 124
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=60.42  E-value=21  Score=40.03  Aligned_cols=98  Identities=11%  Similarity=0.043  Sum_probs=63.9

Q ss_pred             CCCccccCCCceeeEEEE-CCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCC
Q 015021          298 SPLTCLKDHPRKGERLTL-SPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKS  376 (414)
Q Consensus       298 ~pl~~l~D~~R~~~~i~~-sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~  376 (414)
                      ..+.+|-+++-=+.+|+. .+++.++|..-..++|.|+|+.+|.. +.-     |.+-.+++..-. .+.-..-|.-.-.
T Consensus       108 ~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~-~l~-----~s~n~~t~~sl~-sG~k~siYSLA~N  180 (735)
T KOG0308|consen  108 FCMSTIRTHKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTA-TLV-----ASFNNVTVNSLG-SGPKDSIYSLAMN  180 (735)
T ss_pred             hhHhhhhcccchheeeeecccCceeEEecCCCccEEEEEccCcch-hhh-----hhccccccccCC-CCCccceeeeecC
Confidence            455678899888999999 89999999999999999999999965 000     111122211100 0100000000011


Q ss_pred             cceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021          377 DYCLCLAIHAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       377 ~~~l~LvIyaprRg~lEvW~~~~g~~v  403 (414)
                      + -.++++-..--+.|.+|+.++++++
T Consensus       181 ~-t~t~ivsGgtek~lr~wDprt~~ki  206 (735)
T KOG0308|consen  181 Q-TGTIIVSGGTEKDLRLWDPRTCKKI  206 (735)
T ss_pred             C-cceEEEecCcccceEEeccccccce
Confidence            1 2267888888999999999999998


No 125
>PRK01742 tolB translocation protein TolB; Provisional
Probab=59.67  E-value=41  Score=34.95  Aligned_cols=39  Identities=23%  Similarity=0.309  Sum_probs=28.7

Q ss_pred             eeeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEeeecc
Q 015021          309 KGERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRLWKG  347 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrmwKG  347 (414)
                      ....+..+|+|++++. .+..|  +|+++|+.++.+.++-.+
T Consensus       249 ~~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~  290 (429)
T PRK01742        249 HNGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSG  290 (429)
T ss_pred             ccCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccC
Confidence            3446899999976665 45556  678899999888777654


No 126
>PF00400 WD40:  WD domain, G-beta repeat;  InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=58.90  E-value=18  Score=23.96  Aligned_cols=29  Identities=38%  Similarity=0.478  Sum_probs=23.7

Q ss_pred             CCeEEEEEeEEeCCcEEEEEeccccEEEEEe
Q 015021           76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYD  106 (414)
Q Consensus        76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt  106 (414)
                      .+.|+++.|-|-  -..++.|-.+|.|++|+
T Consensus        11 ~~~i~~i~~~~~--~~~~~s~~~D~~i~vwd   39 (39)
T PF00400_consen   11 SSSINSIAWSPD--GNFLASGSSDGTIRVWD   39 (39)
T ss_dssp             SSSEEEEEEETT--SSEEEEEETTSEEEEEE
T ss_pred             CCcEEEEEEecc--cccceeeCCCCEEEEEC
Confidence            466888888765  56889999999999985


No 127
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=58.44  E-value=19  Score=36.27  Aligned_cols=54  Identities=24%  Similarity=0.181  Sum_probs=48.2

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFM  356 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi  356 (414)
                      |.-+.|-.=..+.|-+|+|++++.+.+-+=|+|+.++..+|.|-|++-|-+.|-
T Consensus       254 l~gh~rWvWdc~FS~dg~YlvTassd~~~rlW~~~~~k~v~qy~gh~K~~vc~~  307 (311)
T KOG0315|consen  254 LTGHQRWVWDCAFSADGEYLVTASSDHTARLWDLSAGKEVRQYQGHHKAAVCVA  307 (311)
T ss_pred             eecCCceEEeeeeccCccEEEecCCCCceeecccccCceeeecCCcccccEEEE
Confidence            555668888889999999999999999999999999999999999999877664


No 128
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=57.94  E-value=26  Score=36.98  Aligned_cols=76  Identities=14%  Similarity=0.100  Sum_probs=59.9

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc---eeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEE
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQAL---VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAI  384 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~---~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvI  384 (414)
                      ....+|..+|..+|+|+.-+.--+-|+|-.++   .+..-+-||-.    |++...-.+          .    --|+.+
T Consensus       301 ksl~~i~~~~~~~Ll~~gssdr~irl~DPR~~~gs~v~~s~~gH~n----wVssvkwsp----------~----~~~~~~  362 (423)
T KOG0313|consen  301 KSLNCISYSPLSKLLASGSSDRHIRLWDPRTGDGSVVSQSLIGHKN----WVSSVKWSP----------T----NEFQLV  362 (423)
T ss_pred             cceeEeecccccceeeecCCCCceeecCCCCCCCceeEEeeecchh----hhhheecCC----------C----CceEEE
Confidence            34567899999999999999999999998765   45568889988    998653211          1    117889


Q ss_pred             EcCCCCeEEEeeCCCCc
Q 015021          385 HAPRKGIIEHLAGMADE  401 (414)
Q Consensus       385 yaprRg~lEvW~~~~g~  401 (414)
                      .+..+|.+.+||+|.-.
T Consensus       363 S~S~D~t~klWDvRS~k  379 (423)
T KOG0313|consen  363 SGSYDNTVKLWDVRSTK  379 (423)
T ss_pred             EEecCCeEEEEEeccCC
Confidence            99999999999999755


No 129
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=57.74  E-value=82  Score=31.10  Aligned_cols=48  Identities=31%  Similarity=0.369  Sum_probs=34.0

Q ss_pred             cccCCCceeeEEEECCC-CCEEEEEcCCCcEEEEEcCcceeEeee--ccccc
Q 015021          302 CLKDHPRKGERLTLSPS-GSLAAITDSLGRILLLDTQALVVVRLW--KGYRD  350 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~-~~laa~tD~lGRV~L~D~~~~~vvrmw--KGyRd  350 (414)
                      .|++..++...|+.+|+ ++|+|++|.-+.|+-+|+. |.++|-+  .|..|
T Consensus        16 ~l~g~~~e~SGLTy~pd~~tLfaV~d~~~~i~els~~-G~vlr~i~l~g~~D   66 (248)
T PF06977_consen   16 PLPGILDELSGLTYNPDTGTLFAVQDEPGEIYELSLD-GKVLRRIPLDGFGD   66 (248)
T ss_dssp             E-TT--S-EEEEEEETTTTEEEEEETTTTEEEEEETT---EEEEEE-SS-SS
T ss_pred             ECCCccCCccccEEcCCCCeEEEEECCCCEEEEEcCC-CCEEEEEeCCCCCC
Confidence            36677778999999997 5899999999999999985 7788755  45444


No 130
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=57.40  E-value=32  Score=37.32  Aligned_cols=86  Identities=14%  Similarity=0.039  Sum_probs=59.2

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL  382 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L  382 (414)
                      +.-++.+...+..+|++++.|...++.+|.++|...-..+.-..+++.|==+....+=..                . .|
T Consensus       297 ~~~H~qeVCgLkws~d~~~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~----------------~-lL  359 (484)
T KOG0305|consen  297 LQGHRQEVCGLKWSPDGNQLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQS----------------G-LL  359 (484)
T ss_pred             hhcccceeeeeEECCCCCeeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCCcc----------------C-ce
Confidence            666778888999999999999999999999999966666665666776643332221000                0 12


Q ss_pred             EEEc-CCCCeEEEeeCCCCcceee
Q 015021          383 AIHA-PRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       383 vIya-prRg~lEvW~~~~g~~v~~  405 (414)
                      ++=. -.++.|.+|++.+|.++.+
T Consensus       360 AsGGGs~D~~i~fwn~~~g~~i~~  383 (484)
T KOG0305|consen  360 ATGGGSADRCIKFWNTNTGARIDS  383 (484)
T ss_pred             EEcCCCcccEEEEEEcCCCcEecc
Confidence            2211 2367888999988888743


No 131
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=57.34  E-value=49  Score=37.88  Aligned_cols=82  Identities=15%  Similarity=0.193  Sum_probs=60.9

Q ss_pred             eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCC
Q 015021          311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKG  390 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg  390 (414)
                      -.+.++|+-++.++.=-+-.|=++|+.+|.-++.+||-||-+=.-|-+..+               +-+.||+-.-. +.
T Consensus       600 YDm~Vdp~~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~lD---------------PSgiY~atScs-dk  663 (1080)
T KOG1408|consen  600 YDMAVDPTSKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVILD---------------PSGIYLATSCS-DK  663 (1080)
T ss_pred             EEeeeCCCcceEEEEecccceEEEeccccceeeeecccccCCCceEEEEEC---------------CCccEEEEeec-CC
Confidence            346899999999998777788899999999999999999954444444322               11335554444 56


Q ss_pred             eEEEeeCCCCcceeeccc
Q 015021          391 IIEHLAGMADENWTTSLN  408 (414)
Q Consensus       391 ~lEvW~~~~g~~v~~~~~  408 (414)
                      .|-++|.-.|+|||+-..
T Consensus       664 tl~~~Df~sgEcvA~m~G  681 (1080)
T KOG1408|consen  664 TLCFVDFVSGECVAQMTG  681 (1080)
T ss_pred             ceEEEEeccchhhhhhcC
Confidence            788899999999987543


No 132
>PRK01029 tolB translocation protein TolB; Provisional
Probab=57.16  E-value=60  Score=34.07  Aligned_cols=46  Identities=20%  Similarity=0.099  Sum_probs=34.0

Q ss_pred             ccCCCceeeEEEECCCCCEEEE-EcCC--CcEEEEEcCcceeEeeeccc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAI-TDSL--GRILLLDTQALVVVRLWKGY  348 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~-tD~l--GRV~L~D~~~~~vvrmwKGy  348 (414)
                      +.+..+.......||+|+++|. ++..  .+|.++|+.++..-++..+.
T Consensus       322 lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~Lt~~~  370 (428)
T PRK01029        322 LTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQLTTSP  370 (428)
T ss_pred             eccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEEccCCC
Confidence            3334445567889999986665 4443  47999999999998888774


No 133
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=56.91  E-value=52  Score=34.68  Aligned_cols=69  Identities=9%  Similarity=0.016  Sum_probs=54.3

Q ss_pred             CCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeC
Q 015021          318 SGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAG  397 (414)
Q Consensus       318 ~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~  397 (414)
                      .+++.+....++-|=.+|+.+|..+--.+|+=+    |+.-..=.+         .+|     || +..--++.|.||++
T Consensus       303 ~~~~l~s~SrDktIk~wdv~tg~cL~tL~ghdn----wVr~~af~p---------~Gk-----yi-~ScaDDktlrvwdl  363 (406)
T KOG0295|consen  303 GGQVLGSGSRDKTIKIWDVSTGMCLFTLVGHDN----WVRGVAFSP---------GGK-----YI-LSCADDKTLRVWDL  363 (406)
T ss_pred             CccEEEeecccceEEEEeccCCeEEEEEecccc----eeeeeEEcC---------CCe-----EE-EEEecCCcEEEEEe
Confidence            457888888999999999999999999999999    996443211         122     43 55667999999999


Q ss_pred             CCCcceee
Q 015021          398 MADENWTT  405 (414)
Q Consensus       398 ~~g~~v~~  405 (414)
                      ++++|.-|
T Consensus       364 ~~~~cmk~  371 (406)
T KOG0295|consen  364 KNLQCMKT  371 (406)
T ss_pred             ccceeeec
Confidence            99998744


No 134
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=56.58  E-value=84  Score=34.49  Aligned_cols=48  Identities=13%  Similarity=-0.009  Sum_probs=37.6

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd  350 (414)
                      |..+.-..-.+++.-.+.+....-.+..+-++|+.+|..++...|+-+
T Consensus       245 l~GH~g~V~~l~~~~~~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh~s  292 (537)
T KOG0274|consen  245 LVGHFGGVWGLAFPSGGDKLVSGSTDKTERVWDCSTGECTHSLQGHTS  292 (537)
T ss_pred             ccCCCCCceeEEEecCCCEEEEEecCCcEEeEecCCCcEEEEecCCCc
Confidence            555556666777766567777777788999999999999999998766


No 135
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=55.99  E-value=20  Score=36.99  Aligned_cols=39  Identities=18%  Similarity=0.376  Sum_probs=29.7

Q ss_pred             ceeeEEEECCCCCEEEEEc-CCCcEEEEEcCcceeEeeec
Q 015021          308 RKGERLTLSPSGSLAAITD-SLGRILLLDTQALVVVRLWK  346 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD-~lGRV~L~D~~~~~vvrmwK  346 (414)
                      .....|++||+|++++++. .-+.|.++|..++.+++.-.
T Consensus        78 ~~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~  117 (369)
T PF02239_consen   78 GNPRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIP  117 (369)
T ss_dssp             SEEEEEEE--TTTEEEEEEEETTEEEEEETTT--EEEEEE
T ss_pred             CCcceEEEcCCCCEEEEEecCCCceeEeccccccceeecc
Confidence            5678899999999998874 67999999999999998653


No 136
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=55.95  E-value=27  Score=35.54  Aligned_cols=49  Identities=14%  Similarity=0.118  Sum_probs=36.4

Q ss_pred             ceeeEEEECCCCCEEEEE-cCCCc----EEEEEcCcceeEe-eecccccceeeEE
Q 015021          308 RKGERLTLSPSGSLAAIT-DSLGR----ILLLDTQALVVVR-LWKGYRDASCVFM  356 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~t-D~lGR----V~L~D~~~~~vvr-mwKGyRdAq~~wi  356 (414)
                      .......+||+|+++|.+ |..|.    +.++|+.+|..+. ...+.+-..+.|.
T Consensus       124 ~~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~~~~~~~~~W~  178 (414)
T PF02897_consen  124 VSLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIENPKFSSVSWS  178 (414)
T ss_dssp             EEEEEEEETTTSSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEEEEESEEEEEC
T ss_pred             EEeeeeeECCCCCEEEEEecCCCCceEEEEEEECCCCcCcCCcccccccceEEEe
Confidence            455678999999988875 77776    9999999998876 4344444346665


No 137
>PRK03629 tolB translocation protein TolB; Provisional
Probab=55.79  E-value=65  Score=33.62  Aligned_cols=47  Identities=15%  Similarity=0.126  Sum_probs=33.5

Q ss_pred             eeeEEEECCCCCEEEEEc---CCCcEEEEEcCcceeEeeecccccceeeE
Q 015021          309 KGERLTLSPSGSLAAITD---SLGRILLLDTQALVVVRLWKGYRDASCVF  355 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD---~lGRV~L~D~~~~~vvrmwKGyRdAq~~w  355 (414)
                      .......||+|++++.+.   ....|.++|+.++.+.++-.++.+....|
T Consensus       332 ~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~~~~~~~p~~  381 (429)
T PRK03629        332 QNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTDTFLDETPSI  381 (429)
T ss_pred             CccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCCCCCCCCceE
Confidence            345688999998777642   33569999999998877776665544444


No 138
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=55.74  E-value=12  Score=42.39  Aligned_cols=67  Identities=13%  Similarity=0.127  Sum_probs=45.8

Q ss_pred             CCEEEEEcCCCcEEEEEcCc---ceeEeeeccc-ccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEE
Q 015021          319 GSLAAITDSLGRILLLDTQA---LVVVRLWKGY-RDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEH  394 (414)
Q Consensus       319 ~~laa~tD~lGRV~L~D~~~---~~vvrmwKGy-RdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEv  394 (414)
                      .+|+|++.+.|-|.++|+..   -..+.+++-| |.|.+-=+..+                   --.++|-+.++|.|++
T Consensus       100 ~NlIAT~s~nG~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~t-------------------ep~iliSGSQDg~vK~  160 (839)
T KOG0269|consen  100 SNLIATCSTNGVISVWDLNKSIRNKLLTVFNEHERSANKLDFHST-------------------EPNILISGSQDGTVKC  160 (839)
T ss_pred             hhhheeecCCCcEEEEecCccccchhhhHhhhhccceeeeeeccC-------------------CccEEEecCCCceEEE
Confidence            47899999999999999998   3334444444 22221111110                   1158999999999999


Q ss_pred             eeCCCCccee
Q 015021          395 LAGMADENWT  404 (414)
Q Consensus       395 W~~~~g~~v~  404 (414)
                      ||+|--.-+-
T Consensus       161 ~DlR~~~S~~  170 (839)
T KOG0269|consen  161 WDLRSKKSKS  170 (839)
T ss_pred             Eeeecccccc
Confidence            9999866553


No 139
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=54.94  E-value=17  Score=39.53  Aligned_cols=38  Identities=18%  Similarity=0.198  Sum_probs=32.9

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK  346 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK  346 (414)
                      .++++.+||+|++++++..-+-++++|+.+|.+--+=|
T Consensus       403 ~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idk  440 (668)
T COG4946         403 NIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDK  440 (668)
T ss_pred             ceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEecc
Confidence            46789999999999999999999999999998754433


No 140
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=53.67  E-value=16  Score=40.73  Aligned_cols=32  Identities=25%  Similarity=0.387  Sum_probs=29.5

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCccee
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVV  341 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~v  341 (414)
                      +.+|.+||+|+|+|+.+..|-|.++|+.+++-
T Consensus       478 I~~l~~SsdG~yiaa~~t~g~I~v~nl~~~~~  509 (691)
T KOG2048|consen  478 ISRLVVSSDGNYIAAISTRGQIFVYNLETLES  509 (691)
T ss_pred             ceeEEEcCCCCEEEEEeccceEEEEEccccee
Confidence            67899999999999999999999999998864


No 141
>PRK04922 tolB translocation protein TolB; Provisional
Probab=53.39  E-value=69  Score=33.28  Aligned_cols=46  Identities=33%  Similarity=0.406  Sum_probs=33.3

Q ss_pred             eeEEEECCCCCEEEEEcCC-C--cEEEEEcCcceeEeeecccccceeeE
Q 015021          310 GERLTLSPSGSLAAITDSL-G--RILLLDTQALVVVRLWKGYRDASCVF  355 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~l-G--RV~L~D~~~~~vvrmwKGyRdAq~~w  355 (414)
                      ...++.||+|+++|.+... |  +|.++|+.++...++-.|..+....|
T Consensus       338 ~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~~~~~p~~  386 (433)
T PRK04922        338 NARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGSLDESPSF  386 (433)
T ss_pred             ccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCCCCCCceE
Confidence            3468999999888775332 2  69999999998877766655544444


No 142
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=53.24  E-value=22  Score=35.48  Aligned_cols=47  Identities=17%  Similarity=0.195  Sum_probs=38.1

Q ss_pred             cccCCC---ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccc
Q 015021          302 CLKDHP---RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGY  348 (414)
Q Consensus       302 ~l~D~~---R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGy  348 (414)
                      .+.|..   --..+++++|+|+|.|..-..---+|+|+..+.+|.-+--+
T Consensus       223 ~~~~~glessavaav~vdpsgrll~sg~~dssc~lydirg~r~iq~f~ph  272 (350)
T KOG0641|consen  223 DFHDGGLESSAVAAVAVDPSGRLLASGHADSSCMLYDIRGGRMIQRFHPH  272 (350)
T ss_pred             cccCCCcccceeEEEEECCCcceeeeccCCCceEEEEeeCCceeeeeCCC
Confidence            355555   44678999999999999877788999999999998876544


No 143
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=53.04  E-value=41  Score=39.20  Aligned_cols=86  Identities=9%  Similarity=0.029  Sum_probs=71.6

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      +-|.++..-...|..-|++-+++..-.+=.|=+|+..+..++--.+||=|    +|.+..=               |.-.
T Consensus        45 ~rFdeHdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclftL~GHlD----YVRt~~F---------------Hhey  105 (1202)
T KOG0292|consen   45 DRFDEHDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGHLD----YVRTVFF---------------HHEY  105 (1202)
T ss_pred             hhhhccCCccceeeecCCCCeEEecCCccEEEEEecccceehhhhccccc----eeEEeec---------------cCCC
Confidence            34777888888999999999999987788899999999999999999999    8876521               1111


Q ss_pred             EEEEEcCCCCeEEEeeCCCCcceee
Q 015021          381 CLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                      =-+|.|.-+-+|.||++|++++|++
T Consensus       106 PWIlSASDDQTIrIWNwqsr~~iav  130 (1202)
T KOG0292|consen  106 PWILSASDDQTIRIWNWQSRKCIAV  130 (1202)
T ss_pred             ceEEEccCCCeEEEEeccCCceEEE
Confidence            2678999999999999999999975


No 144
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=51.85  E-value=84  Score=30.83  Aligned_cols=30  Identities=13%  Similarity=0.061  Sum_probs=23.9

Q ss_pred             eeEEEECCCCCEEEEEcC-CCcEEEEEcCcc
Q 015021          310 GERLTLSPSGSLAAITDS-LGRILLLDTQAL  339 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~-lGRV~L~D~~~~  339 (414)
                      ...++++|+|+++.++|. .++|.++|+.+.
T Consensus       128 ~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~  158 (330)
T PRK11028        128 CHSANIDPDNRTLWVPCLKEDRIRLFTLSDD  158 (330)
T ss_pred             ccEeEeCCCCCEEEEeeCCCCEEEEEEECCC
Confidence            456789999988877654 599999999873


No 145
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=51.79  E-value=29  Score=36.66  Aligned_cols=49  Identities=18%  Similarity=0.131  Sum_probs=38.2

Q ss_pred             CeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCC
Q 015021           77 EYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRD  133 (414)
Q Consensus        77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~  133 (414)
                      =+||++-+||=.+..-+|-+|-.|+||+|+.+     -|.   -||.++.++.+++.
T Consensus       203 vW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~-----~qR---RPV~~fd~~E~~is  251 (412)
T KOG3881|consen  203 VWITDIRFLEGSPNYKFATITRYHQVRLYDTR-----HQR---RPVAQFDFLENPIS  251 (412)
T ss_pred             eeeccceecCCCCCceEEEEecceeEEEecCc-----ccC---cceeEeccccCcce
Confidence            45666666655557899999999999999973     233   79999999998853


No 146
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=51.57  E-value=15  Score=39.17  Aligned_cols=85  Identities=11%  Similarity=0.017  Sum_probs=61.0

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC  381 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~  381 (414)
                      +|.-.--.+++|..+++|+....+-+.+-+-+||+..+....-..||-|      ++.+..-+.          .+.+  
T Consensus       214 tLaGs~g~it~~d~d~~~~~~iAas~d~~~r~Wnvd~~r~~~TLsGHtd------kVt~ak~~~----------~~~~--  275 (459)
T KOG0288|consen  214 TLAGSLGNITSIDFDSDNKHVIAASNDKNLRLWNVDSLRLRHTLSGHTD------KVTAAKFKL----------SHSR--  275 (459)
T ss_pred             hhhccCCCcceeeecCCCceEEeecCCCceeeeeccchhhhhhhccccc------ceeeehhhc----------cccc--
Confidence            4666667788999999999888888899999999999999999999998      333322110          1111  


Q ss_pred             EEEEcCCCCeEEEeeCCCCcceee
Q 015021          382 LAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       382 LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                       ||-+.++..|+.|+||.+.|.-|
T Consensus       276 -vVsgs~DRtiK~WDl~k~~C~kt  298 (459)
T KOG0288|consen  276 -VVSGSADRTIKLWDLQKAYCSKT  298 (459)
T ss_pred             -eeeccccchhhhhhhhhhheecc
Confidence             66666666666666666666544


No 147
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=51.09  E-value=52  Score=35.21  Aligned_cols=84  Identities=14%  Similarity=0.120  Sum_probs=63.8

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL  382 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L  382 (414)
                      |.-+-+++.++.-||+|...|+.-+++-+=++|+....-+..--+|++    =+          ++..++|..   . ++
T Consensus       341 L~gH~k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR~r~~ly~ipAH~n----lV----------S~Vk~~p~~---g-~f  402 (459)
T KOG0272|consen  341 LAGHIKEILSVAFSPNGYHLATGSSDNTCKVWDLRMRSELYTIPAHSN----LV----------SQVKYSPQE---G-YF  402 (459)
T ss_pred             ecccccceeeEeECCCceEEeecCCCCcEEEeeecccccceecccccc----hh----------hheEecccC---C-eE
Confidence            555678899999999999999988888888999988888777777776    11          122333311   1 45


Q ss_pred             EEEcCCCCeEEEeeCCCCccee
Q 015021          383 AIHAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       383 vIyaprRg~lEvW~~~~g~~v~  404 (414)
                      ..-+..++.+.||+.+++.++-
T Consensus       403 L~TasyD~t~kiWs~~~~~~~k  424 (459)
T KOG0272|consen  403 LVTASYDNTVKIWSTRTWSPLK  424 (459)
T ss_pred             EEEcccCcceeeecCCCcccch
Confidence            6788999999999999988763


No 148
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=49.61  E-value=1.1e+02  Score=30.02  Aligned_cols=28  Identities=4%  Similarity=0.016  Sum_probs=22.8

Q ss_pred             eEEEECCCCCEEEEE-cCCCcEEEEEcCc
Q 015021          311 ERLTLSPSGSLAAIT-DSLGRILLLDTQA  338 (414)
Q Consensus       311 ~~i~~sP~~~laa~t-D~lGRV~L~D~~~  338 (414)
                      ..|+++|+|+++.++ ...|+|.++|+.+
T Consensus        83 ~~i~~~~~g~~l~v~~~~~~~v~v~~~~~  111 (330)
T PRK11028         83 THISTDHQGRFLFSASYNANCVSVSPLDK  111 (330)
T ss_pred             eEEEECCCCCEEEEEEcCCCeEEEEEECC
Confidence            479999999866664 5689999999974


No 149
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=49.34  E-value=29  Score=36.17  Aligned_cols=67  Identities=19%  Similarity=0.143  Sum_probs=40.9

Q ss_pred             CCeeeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe
Q 015021           32 DPNLLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD  106 (414)
Q Consensus        32 ~~~~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt  106 (414)
                      +|-++|||.|+.||++.++-+=+.+   .+.++.  -+++..+-+++||+|.|-|- +-..+-+|  +-|||++.
T Consensus       334 p~RL~lsP~g~~lA~s~gs~l~~~~---se~g~~--~~~~e~~h~~~Is~is~~~~-g~~~atcG--dr~vrv~~  400 (420)
T KOG2096|consen  334 PVRLELSPSGDSLAVSFGSDLKVFA---SEDGKD--YPELEDIHSTTISSISYSSD-GKYIATCG--DRYVRVIR  400 (420)
T ss_pred             ceEEEeCCCCcEEEeecCCceEEEE---cccCcc--chhHHHhhcCceeeEEecCC-CcEEeeec--ceeeeeec
Confidence            4789999999999999999764432   221111  12333344678888877552 22344444  45666654


No 150
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=49.21  E-value=49  Score=34.58  Aligned_cols=97  Identities=18%  Similarity=0.115  Sum_probs=61.9

Q ss_pred             CCCccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021          298 SPLTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK  375 (414)
Q Consensus       298 ~pl~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k  375 (414)
                      .|+..|-..+..++.++-+|+| +|...+--.-.|+.+|+..- .+|--..++++-.=+=|...-+.            +
T Consensus       241 ~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~~~pv~~L~rhv~~TNQRI~FDld~------------~  308 (406)
T KOG2919|consen  241 RPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYSRDPVYALERHVGDTNQRILFDLDP------------K  308 (406)
T ss_pred             CceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhccchhhhhhhhccCccceEEEecCC------------C
Confidence            4565566777788999999999 67777888889999999753 33333333333212222221111            1


Q ss_pred             CcceEEEEEEcCCCCeEEEeeCCC-Ccceeecccee
Q 015021          376 SDYCLCLAIHAPRKGIIEHLAGMA-DENWTTSLNYS  410 (414)
Q Consensus       376 ~~~~l~LvIyaprRg~lEvW~~~~-g~~v~~~~~~~  410 (414)
                      ..   +| .-.-.||.|.||+++. |+-+-.-.|||
T Consensus       309 ~~---~L-asG~tdG~V~vwdlk~~gn~~sv~~~~s  340 (406)
T KOG2919|consen  309 GE---IL-ASGDTDGSVRVWDLKDLGNEVSVTGNYS  340 (406)
T ss_pred             Cc---ee-eccCCCccEEEEecCCCCCccccccccc
Confidence            11   22 3345799999999999 88776666665


No 151
>PF10313 DUF2415:  Uncharacterised protein domain (DUF2415);  InterPro: IPR019417  This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif. 
Probab=48.97  E-value=51  Score=24.35  Aligned_cols=29  Identities=28%  Similarity=0.421  Sum_probs=23.8

Q ss_pred             eEEEECCCC---CEEEEEcCCCcEEEEEcCcc
Q 015021          311 ERLTLSPSG---SLAAITDSLGRILLLDTQAL  339 (414)
Q Consensus       311 ~~i~~sP~~---~laa~tD~lGRV~L~D~~~~  339 (414)
                      .++.-||..   .|.|.+.--|||-++|+.++
T Consensus         4 R~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~~   35 (43)
T PF10313_consen    4 RCCKFSPEPGGNDLLAWAEHQGRVHIVDTRSN   35 (43)
T ss_pred             EEEEeCCCCCcccEEEEEccCCeEEEEEcccC
Confidence            456677754   59999999999999999864


No 152
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=48.59  E-value=71  Score=29.14  Aligned_cols=73  Identities=19%  Similarity=0.292  Sum_probs=50.0

Q ss_pred             CCeEEEEEeEEe---CCcEEEEEeccccEEEEEe--cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCe
Q 015021           76 SEYITAIEWLVF---EEMRALAVGTSRGYFLVYD--LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGV  150 (414)
Q Consensus        76 ~e~ITs~~~lpl---~dw~~I~VG~ssG~vrfyt--e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~  150 (414)
                      ++.|||+..=||   .+.-+++||+.+. |..|+  ++-++.++++  ++-|..|-+-+...      ..+.|.|  -+.
T Consensus        47 n~~italaaG~l~~~~~~D~LliGt~t~-llaYDV~~N~d~Fyke~--~DGvn~i~~g~~~~------~~~~l~i--vGG  115 (136)
T PF14781_consen   47 NQEITALAAGRLKPDDGRDCLLIGTQTS-LLAYDVENNSDLFYKEV--PDGVNAIVIGKLGD------IPSPLVI--VGG  115 (136)
T ss_pred             CCceEEEEEEecCCCCCcCEEEEeccce-EEEEEcccCchhhhhhC--ccceeEEEEEecCC------CCCcEEE--ECc
Confidence            688999999999   3557999999885 66787  4777777766  47777777664432      1333444  455


Q ss_pred             EEEEeChhH
Q 015021          151 LARFDGSEI  159 (414)
Q Consensus       151 iv~IdG~sL  159 (414)
                      =|.|-|++-
T Consensus       116 ncsi~Gfd~  124 (136)
T PF14781_consen  116 NCSIQGFDY  124 (136)
T ss_pred             eEEEEEeCC
Confidence            566666653


No 153
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=48.59  E-value=29  Score=36.11  Aligned_cols=103  Identities=19%  Similarity=0.270  Sum_probs=75.1

Q ss_pred             CC-ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccccee----eEEEEEe----ccccc----
Q 015021          299 PL-TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASC----VFMEMLV----NKDAA----  365 (414)
Q Consensus       299 pl-~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~----~wi~~~~----~~~~~----  365 (414)
                      |+ +.|+|+--+...+..-|..+..+...+++-|=|+|...-.+=|-+|=+-|++-    .|...-+    .++..    
T Consensus       163 PvIRTlYDH~devn~l~FHPre~ILiS~srD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHPsGefllvgTdHp~~rl  242 (430)
T KOG0640|consen  163 PVIRTLYDHVDEVNDLDFHPRETILISGSRDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHPSGEFLLVGTDHPTLRL  242 (430)
T ss_pred             ceEeehhhccCcccceeecchhheEEeccCCCeEEEEecccHHHHHHHHHhhccceeeeEeecCCCceEEEecCCCceeE
Confidence            44 67999999999999999999999999999999999999999999988888652    2221100    00100    


Q ss_pred             -------------ccc--------cccCCCCCcceEEEEEEcCCCCeEEEeeCCCCcceeec
Q 015021          366 -------------TSS--------AYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWTTS  406 (414)
Q Consensus       366 -------------~~~--------~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~~~  406 (414)
                                   .+.        .++.+.-     -|.|-+.++|.|++|+--.++||.|-
T Consensus       243 Ydv~T~QcfvsanPd~qht~ai~~V~Ys~t~-----~lYvTaSkDG~IklwDGVS~rCv~t~  299 (430)
T KOG0640|consen  243 YDVNTYQCFVSANPDDQHTGAITQVRYSSTG-----SLYVTASKDGAIKLWDGVSNRCVRTI  299 (430)
T ss_pred             EeccceeEeeecCcccccccceeEEEecCCc-----cEEEEeccCCcEEeeccccHHHHHHH
Confidence                         000        1122111     26688999999999999999999763


No 154
>PRK02889 tolB translocation protein TolB; Provisional
Probab=48.04  E-value=72  Score=33.15  Aligned_cols=40  Identities=15%  Similarity=0.027  Sum_probs=29.5

Q ss_pred             CCCceeeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEee
Q 015021          305 DHPRKGERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRL  344 (414)
Q Consensus       305 D~~R~~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrm  344 (414)
                      ..+....+...||+|+.+|. ++..|  .|.++|+.+|...++
T Consensus       193 ~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~~l  235 (427)
T PRK02889        193 SSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRRVV  235 (427)
T ss_pred             cCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEEEe
Confidence            34455668899999976665 44444  599999999987776


No 155
>PF01403 Sema:  Sema domain;  InterPro: IPR001627 The Sema domain occurs in semaphorins, which are a large family of secreted and transmembrane proteins, some of which function as repellent signals during axon guidance. Sema domains also occur in a hepatocyte growth factor receptor, in SEX protein [] and in viral proteins. CD100 (also called SEMA4D) is associated with PTPase and serine kinase activity. CD100 increases PMA, CD3 and CD2 induced T cell proliferation, increases CD45 induced T cell adhesion, induces B cell homotypic adhesion and down-regulates B cell expression of CD23.  The Sema domain is characterised by a conserved set of cysteine residues, which form four disulphide bonds to stabilise the structure. The Sema domain fold is a variation of the beta propeller topology, with seven blades radially arranged around a central axis. Each blade contains a four- stranded (strands A to D) antiparallel beta sheet. The inner strand of each blade (A) lines the channel at the centre of the propeller, with strands B and C of the same repeat radiating outward, and strand D of the next repeat forming the outer edge of the blade. The large size of the Sema domain is not due to a single inserted domain but results from the presence of additional secondary structure elements inserted in most of the blades. The Sema domain uses a 'loop and hook' system to close the circle between the first and the last blades. The blades are constructed sequentially with an N-terminal beta- strand closing the circle by providing the outermost strand (D) of the seventh (C-terminal) blade. The beta-propeller is further stabilised by an extension of the N terminus, providing an additional, fifth beta-strand on the outer edge of blade 6 [, , ]. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0005515 protein binding; PDB: 3NVX_A 3NVQ_A 3OL2_A 1OLZ_B 3OKT_A 3AL9_B 3OKY_A 3AL8_B 3NVN_A 3OKW_A ....
Probab=47.20  E-value=81  Score=32.95  Aligned_cols=52  Identities=15%  Similarity=0.101  Sum_probs=36.2

Q ss_pred             eEEEEEeE--EeCC--cEEEEEeccccEEE-EEe--cCC-------cEEeecccCcc-ceeEEEEec
Q 015021           78 YITAIEWL--VFEE--MRALAVGTSRGYFL-VYD--LKG-------DLVHRQLIHPG-RILKLRVRG  129 (414)
Q Consensus        78 ~ITs~~~l--pl~d--w~~I~VG~ssG~vr-fyt--e~G-------~LL~sQ~lh~~-pV~~ik~r~  129 (414)
                      +.|+|..-  ...|  .+++-+||++|.|. ..-  .++       .+.-.|.+++. ||+.+++..
T Consensus       365 ~~T~i~v~~v~~~~~~~tV~flGT~~G~l~K~v~~~~~~~~~~~~~~iee~~~~~~~~pI~~~~l~~  431 (433)
T PF01403_consen  365 RLTSIAVDRVQVENGSYTVAFLGTDDGRLHKKVVLSNSSSGHYESYIIEEIQVFPDSEPIQSMKLSP  431 (433)
T ss_dssp             -EEEEEEEEEEETTTCEEEEEEEETTSEEEEEEEESSSSTCT-EEEEEEEEE-STSC-EEEEEEEET
T ss_pred             eeeEEEEEEEecCCCcEEEEEEecCCceEEEEEEecCCCCcccccEEEEEEeecCCCCceEEEEecc
Confidence            78999876  6666  89999999999999 332  222       22334577764 999998754


No 156
>PRK04792 tolB translocation protein TolB; Provisional
Probab=47.07  E-value=1.2e+02  Score=31.94  Aligned_cols=47  Identities=17%  Similarity=0.257  Sum_probs=33.1

Q ss_pred             eeeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEee-ecccccceeeE
Q 015021          309 KGERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRL-WKGYRDASCVF  355 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrm-wKGyRdAq~~w  355 (414)
                      .....+.+|+|++++. +|..|  .|+++|+.++.+.|+ +.|..+....|
T Consensus       307 ~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~~~~~~~~  357 (448)
T PRK04792        307 IDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEGEQNLGGSI  357 (448)
T ss_pred             CccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCCCCCcCeeE
Confidence            3456788999976655 56555  588999999998887 56665543333


No 157
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=46.98  E-value=2.3e+02  Score=29.67  Aligned_cols=72  Identities=22%  Similarity=0.226  Sum_probs=45.0

Q ss_pred             ccCCCeeeeccCcceeeeeecc-eEEEE--------e-ecCCCC--CceeecccCCCCCCCeEEEEEeEEeCCcEEEEEe
Q 015021           29 LVNDPNLLCALDMHTIALANRY-QTVII--------N-WADPEG--LVAKIRPELSPIASEYITAIEWLVFEEMRALAVG   96 (414)
Q Consensus        29 L~~~~~~~~sp~~~~la~A~~~-~~v~~--------~-w~~~~~--~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG   96 (414)
                      |.+ ..++++|.|-.||+.+.. |++-.        . ...+++  ..+.|.+       ++|-.+.|..   ---++|=
T Consensus        29 l~~-~~va~a~~gGpIAi~~d~~k~~~~~~~~p~~I~iys~sG~ll~~i~w~~-------~~iv~~~wt~---~e~LvvV   97 (410)
T PF04841_consen   29 LSD-YIVAVAPYGGPIAIIRDESKLVPVGSAKPNSIQIYSSSGKLLSSIPWDS-------GRIVGMGWTD---DEELVVV   97 (410)
T ss_pred             ccc-eeEEEcCCCceEEEEecCcccccccCCCCcEEEEECCCCCEeEEEEECC-------CCEEEEEECC---CCeEEEE
Confidence            445 889999999999999888 44432        0 111110  1133322       3455555533   2234455


Q ss_pred             ccccEEEEEecCCcE
Q 015021           97 TSRGYFLVYDLKGDL  111 (414)
Q Consensus        97 ~ssG~vrfyte~G~L  111 (414)
                      +++|.+++|+..|+.
T Consensus        98 ~~dG~v~vy~~~G~~  112 (410)
T PF04841_consen   98 QSDGTVRVYDLFGEF  112 (410)
T ss_pred             EcCCEEEEEeCCCce
Confidence            799999999999998


No 158
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=46.65  E-value=1e+02  Score=31.07  Aligned_cols=43  Identities=14%  Similarity=0.030  Sum_probs=31.5

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcC---CCcEEEEEcCcceeEee
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDS---LGRILLLDTQALVVVRL  344 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~---lGRV~L~D~~~~~vvrm  344 (414)
                      .|.+..+...+...||+|+++|.+..   ..+|.++|+.++...++
T Consensus       184 ~l~~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~~~  229 (417)
T TIGR02800       184 TITRSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQREKV  229 (417)
T ss_pred             EeecCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEEEe
Confidence            35566666778889999987777533   24799999999866554


No 159
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=46.35  E-value=39  Score=34.33  Aligned_cols=83  Identities=10%  Similarity=-0.067  Sum_probs=57.4

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc----eeeEEEEEecccccccccccCCCCCcc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA----SCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA----q~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      +....-+..+|+-.-+|+..|.....+.|.++++..+..+.-|-+++.-    |..|-+.                  +.
T Consensus        16 ~~~~~~~v~Sv~wn~~g~~lasgs~dktv~v~n~e~~r~~~~~~~~gh~~svdql~w~~~------------------~~   77 (313)
T KOG1407|consen   16 LQGHVQKVHSVAWNCDGTKLASGSFDKTVSVWNLERDRFRKELVYRGHTDSVDQLCWDPK------------------HP   77 (313)
T ss_pred             hhhhhhcceEEEEcccCceeeecccCCceEEEEecchhhhhhhcccCCCcchhhheeCCC------------------CC
Confidence            4455667889999999999999888888888888888555555444432    4444431                  11


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          379 CLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                      -||  .-|.-.-.+.+|+.+.|++++.
T Consensus        78 d~~--atas~dk~ir~wd~r~~k~~~~  102 (313)
T KOG1407|consen   78 DLF--ATASGDKTIRIWDIRSGKCTAR  102 (313)
T ss_pred             cce--EEecCCceEEEEEeccCcEEEE
Confidence            223  3444567899999999999865


No 160
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=46.28  E-value=65  Score=33.74  Aligned_cols=57  Identities=25%  Similarity=0.403  Sum_probs=39.4

Q ss_pred             CCcEEEEEecccc-EEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEE
Q 015021           88 EEMRALAVGTSRG-YFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARF  154 (414)
Q Consensus        88 ~dw~~I~VG~ssG-~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~I  154 (414)
                      .+..++.+|...- +|++||-+|.+|-+-.+..++|+.+-   +-.       .|+|.||..++.+++
T Consensus        48 d~~k~~~~~~~~p~~I~iys~sG~ll~~i~w~~~~iv~~~---wt~-------~e~LvvV~~dG~v~v  105 (410)
T PF04841_consen   48 DESKLVPVGSAKPNSIQIYSSSGKLLSSIPWDSGRIVGMG---WTD-------DEELVVVQSDGTVRV  105 (410)
T ss_pred             cCcccccccCCCCcEEEEECCCCCEeEEEEECCCCEEEEE---ECC-------CCeEEEEEcCCEEEE
Confidence            4444555555544 69999999999877555555555554   322       788999999987655


No 161
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=45.28  E-value=2.2e+02  Score=32.74  Aligned_cols=49  Identities=14%  Similarity=0.077  Sum_probs=37.6

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEe
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLV  360 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~  360 (414)
                      +..|.++-.-+..|+.|.-||..+.|+.++.++-.=-|.-.  +.|=...|
T Consensus       459 vrClDINA~R~kLAvVDD~~~c~v~DI~t~elL~QEpg~nS--V~wNT~~E  507 (1081)
T KOG1538|consen  459 VRCLDINASRKKLAVVDDNDTCLVYDIDTKELLFQEPGANS--VAWNTQCE  507 (1081)
T ss_pred             eEEeeccCCcceEEEEccCCeEEEEEccCCceEeecCCCce--EEeecccc
Confidence            45667777778889999999999999999999987666544  55654443


No 162
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=43.83  E-value=84  Score=35.61  Aligned_cols=77  Identities=13%  Similarity=0.059  Sum_probs=54.0

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      ..|.-+.-....+++=|++.++ .++|+|-|.++|+ +|.+++-.-|++.    ++-......          ..     
T Consensus       173 ~tf~gHtD~VRgL~vl~~~~fl-ScsNDg~Ir~w~~-~ge~l~~~~ghtn----~vYsis~~~----------~~-----  231 (745)
T KOG0301|consen  173 KTFSGHTDCVRGLAVLDDSHFL-SCSNDGSIRLWDL-DGEVLLEMHGHTN----FVYSISMAL----------SD-----  231 (745)
T ss_pred             hhhccchhheeeeEEecCCCeE-eecCCceEEEEec-cCceeeeeeccce----EEEEEEecC----------CC-----
Confidence            4456666677788888887654 5689999999999 6666666679998    664433111          00     


Q ss_pred             EEEEEcCCCCeEEEeeCC
Q 015021          381 CLAIHAPRKGIIEHLAGM  398 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~  398 (414)
                      -+++-..-++.|+||...
T Consensus       232 ~~Ivs~gEDrtlriW~~~  249 (745)
T KOG0301|consen  232 GLIVSTGEDRTLRIWKKD  249 (745)
T ss_pred             CeEEEecCCceEEEeecC
Confidence            267778888888888765


No 163
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=43.23  E-value=1.5e+02  Score=30.76  Aligned_cols=47  Identities=19%  Similarity=0.094  Sum_probs=36.0

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc---eeeEEEE
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA---SCVFMEM  358 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA---q~~wi~~  358 (414)
                      ...++-+-+|+.+.+.+..|-+-|+|++++++..+=  --||   .|.||+.
T Consensus        75 vL~v~WsddgskVf~g~~Dk~~k~wDL~S~Q~~~v~--~Hd~pvkt~~wv~~  124 (347)
T KOG0647|consen   75 VLDVCWSDDGSKVFSGGCDKQAKLWDLASGQVSQVA--AHDAPVKTCHWVPG  124 (347)
T ss_pred             eEEEEEccCCceEEeeccCCceEEEEccCCCeeeee--ecccceeEEEEecC
Confidence            345777888999999999999999999999887652  2222   4677754


No 164
>PF13464 DUF4115:  Domain of unknown function (DUF4115)
Probab=42.73  E-value=74  Score=25.23  Aligned_cols=55  Identities=13%  Similarity=0.264  Sum_probs=40.2

Q ss_pred             eccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeE-EEEeChhHH
Q 015021           96 GTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVL-ARFDGSEIQ  160 (414)
Q Consensus        96 G~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~i-v~IdG~sL~  160 (414)
                      ....-+|++.+.+|..+++.++++.-...+....          .=+|.+=.++++ +.+||.++-
T Consensus         5 a~~~sWv~V~d~dG~~~~~~~l~~G~~~~~~~~~----------~~~i~iGna~~v~v~~nG~~~~   60 (77)
T PF13464_consen    5 ATGDSWVEVTDADGKVLFSGTLKAGETKTFEGKE----------PFRIRIGNAGAVEVTVNGKPVD   60 (77)
T ss_pred             EeCCeEEEEEeCCCcEeeeeeeCCCcEEEEeCCC----------CEEEEEeCCCcEEEEECCEECC
Confidence            3456789999999999999999988887774221          222555666665 888888773


No 165
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=41.71  E-value=37  Score=36.26  Aligned_cols=92  Identities=14%  Similarity=0.141  Sum_probs=63.5

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEE
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLC  381 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~  381 (414)
                      ++..++--+..+..|++|+|+...|..|-|=.++.+ ...|.|.+++-++++-=+-... .|          .|     |
T Consensus       133 ilQaHDs~Vr~m~ws~~g~wmiSgD~gG~iKyWqpn-mnnVk~~~ahh~eaIRdlafSp-nD----------sk-----F  195 (464)
T KOG0284|consen  133 ILQAHDSPVRTMKWSHNGTWMISGDKGGMIKYWQPN-MNNVKIIQAHHAEAIRDLAFSP-ND----------SK-----F  195 (464)
T ss_pred             HhhhhcccceeEEEccCCCEEEEcCCCceEEecccc-hhhhHHhhHhhhhhhheeccCC-CC----------ce-----e
Confidence            456666677889999999999999999999999975 5667788777765543332211 11          11     2


Q ss_pred             EEEEcCCCCeEEEeeCCCCcc--eeeccceeee
Q 015021          382 LAIHAPRKGIIEHLAGMADEN--WTTSLNYSVR  412 (414)
Q Consensus       382 LvIyaprRg~lEvW~~~~g~~--v~~~~~~~~~  412 (414)
                        .-+.-+|+|+||+....+.  +-+.-+|-||
T Consensus       196 --~t~SdDg~ikiWdf~~~kee~vL~GHgwdVk  226 (464)
T KOG0284|consen  196 --LTCSDDGTIKIWDFRMPKEERVLRGHGWDVK  226 (464)
T ss_pred             --EEecCCCeEEEEeccCCchhheeccCCCCcc
Confidence              4567799999999887543  4444445444


No 166
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=41.65  E-value=1.1e+02  Score=33.08  Aligned_cols=78  Identities=18%  Similarity=0.206  Sum_probs=57.9

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR  388 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr  388 (414)
                      ...++++-|||-+++..-..|-|=+||+..+..+-=+-|+- +.+.=|..-|+               .|  ||+. +--
T Consensus       349 ~~ts~~fHpDgLifgtgt~d~~vkiwdlks~~~~a~Fpght-~~vk~i~FsEN---------------GY--~Lat-~ad  409 (506)
T KOG0289|consen  349 EYTSAAFHPDGLIFGTGTPDGVVKIWDLKSQTNVAKFPGHT-GPVKAISFSEN---------------GY--WLAT-AAD  409 (506)
T ss_pred             eeEEeeEcCCceEEeccCCCceEEEEEcCCccccccCCCCC-CceeEEEeccC---------------ce--EEEE-Eec
Confidence            46778899999999999999999999999999888888843 34444443322               12  5555 445


Q ss_pred             CCeEEEeeCCCCcceee
Q 015021          389 KGIIEHLAGMADENWTT  405 (414)
Q Consensus       389 Rg~lEvW~~~~g~~v~~  405 (414)
                      +|.|.+||+|-.+..-|
T Consensus       410 d~~V~lwDLRKl~n~kt  426 (506)
T KOG0289|consen  410 DGSVKLWDLRKLKNFKT  426 (506)
T ss_pred             CCeEEEEEehhhcccce
Confidence            88899999998776544


No 167
>PRK00178 tolB translocation protein TolB; Provisional
Probab=41.48  E-value=1.7e+02  Score=29.99  Aligned_cols=38  Identities=24%  Similarity=0.269  Sum_probs=28.5

Q ss_pred             eeEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEeeecc
Q 015021          310 GERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRLWKG  347 (414)
Q Consensus       310 ~~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrmwKG  347 (414)
                      ......||+|+.++. .+..|  +|+++|+.++.+.++.++
T Consensus       245 ~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~  285 (430)
T PRK00178        245 NGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTNH  285 (430)
T ss_pred             cCCeEECCCCCEEEEEEccCCCceEEEEECCCCCeEEcccC
Confidence            345788999976664 55555  799999999998887653


No 168
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=41.44  E-value=1.6e+02  Score=32.00  Aligned_cols=86  Identities=12%  Similarity=0.033  Sum_probs=62.8

Q ss_pred             CccccCCCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcc
Q 015021          300 LTCLKDHPRKGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       300 l~~l~D~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      |+.++-+.-....+--+|++ +.++.+|..+-+-++|+.+..+.--.-|+-|    .+.+-.-.+          .    
T Consensus       103 LR~~~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~v~~~l~~htD----YVR~g~~~~----------~----  164 (487)
T KOG0310|consen  103 LRQLYAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLSTAYVQAELSGHTD----YVRCGDISP----------A----  164 (487)
T ss_pred             HHHHhhccCceeEEEecccCCeEEEecCCCceEEEEEcCCcEEEEEecCCcc----eeEeecccc----------C----
Confidence            34566677777888899988 5666678888888899999987656778888    555432211          1    


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCC-cce
Q 015021          379 CLCLAIHAPRKGIIEHLAGMAD-ENW  403 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g-~~v  403 (414)
                      +=.+|.-...+|.|.+|++|.- +.+
T Consensus       165 ~~hivvtGsYDg~vrl~DtR~~~~~v  190 (487)
T KOG0310|consen  165 NDHIVVTGSYDGKVRLWDTRSLTSRV  190 (487)
T ss_pred             CCeEEEecCCCceEEEEEeccCCcee
Confidence            1148899999999999999986 555


No 169
>PF04762 IKI3:  IKI3 family;  InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=41.35  E-value=3.5e+02  Score=31.83  Aligned_cols=165  Identities=16%  Similarity=0.148  Sum_probs=0.0

Q ss_pred             CcccceeeeeeecccccccCC-CCCcccCCCeeeeccCcceeeeeec----ceEEEEeecCCCCCceeecccCCCCCCCe
Q 015021            4 RTHTTEVGSIACTDLSDLGAG-KEGWLVNDPNLLCALDMHTIALANR----YQTVIINWADPEGLVAKIRPELSPIASEY   78 (414)
Q Consensus         4 ~~~~~~~~~~~~~~~~~~g~~-~~~WL~~~~~~~~sp~~~~la~A~~----~~~v~~~w~~~~~~~v~~~g~l~~~~~e~   78 (414)
                      |.+.+| |-+-+..+.--|+. .=.|=         |.|.+||-+++    ..+||-.-+.-.++..+.+-  ++ +.+.
T Consensus       240 RVy~Re-G~L~stSE~v~gLe~~l~Wr---------PsG~lIA~~q~~~~~~~VvFfErNGLrhgeF~l~~--~~-~~~~  306 (928)
T PF04762_consen  240 RVYSRE-GELQSTSEPVDGLEGALSWR---------PSGNLIASSQRLPDRHDVVFFERNGLRHGEFTLRF--DP-EEEK  306 (928)
T ss_pred             EEECCC-ceEEeccccCCCccCCccCC---------CCCCEEEEEEEcCCCcEEEEEecCCcEeeeEecCC--CC-CCce


Q ss_pred             EEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeC---------
Q 015021           79 ITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMP---------  148 (414)
Q Consensus        79 ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~---------  148 (414)
                      |..+.|=  .|.+++||-+.+. |++|| -|-.-=++|-++...-..+..=.+-+     +..-.|.|+..         
T Consensus       307 v~~l~Wn--~ds~iLAv~~~~~-vqLWt~~NYHWYLKqei~~~~~~~~~~~~Wdp-----e~p~~L~v~t~~g~~~~~~~  378 (928)
T PF04762_consen  307 VIELAWN--SDSEILAVWLEDR-VQLWTRSNYHWYLKQEIRFSSSESVNFVKWDP-----EKPLRLHVLTSNGQYEIYDF  378 (928)
T ss_pred             eeEEEEC--CCCCEEEEEecCC-ceEEEeeCCEEEEEEEEEccCCCCCCceEECC-----CCCCEEEEEecCCcEEEEEE


Q ss_pred             ---------------CeEEEEeChhHHHHHHHHHHhccccccCCCCccCCCcccccccCCccceecccCCCCCceeeEE
Q 015021          149 ---------------GVLARFDGSEIQKMLQRWFQDSNSNFWDQKPKQRDSEDLENSYERLPHQLWNVSKYGPCADAAI  212 (414)
Q Consensus       149 ---------------~~iv~IdG~sL~~~Lr~c~~~~~~~~w~~~~~~~~~~~~~~~~~~L~ykKW~l~~~~~i~Daa~  212 (414)
                                     +.++.|||..|                     .--.-.....|||+.+..-.++.  +|+|++|
T Consensus       379 ~~~v~~s~~~~~~D~g~vaVIDG~~l---------------------llTpf~~a~VPPPMs~~~l~~~~--~v~~vaf  434 (928)
T PF04762_consen  379 AWDVSRSPGSSPNDNGTVAVIDGNKL---------------------LLTPFRRAVVPPPMSSYELELPS--PVNDVAF  434 (928)
T ss_pred             EEEEEecCCCCccCceEEEEEeCCeE---------------------EEecccccCCCchHhceEEcCCC--CcEEEEE


No 170
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=41.31  E-value=18  Score=40.04  Aligned_cols=51  Identities=22%  Similarity=0.372  Sum_probs=43.5

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecc
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNK  362 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~  362 (414)
                      -...|++.|.|+|.|++=+-|-|..+|..+|.+|.-|   |--+|.-+|.-.+.
T Consensus       878 ~~R~iaVa~~GN~lAa~LSnGci~~LDaR~G~vINsw---rpmecdllqlaaps  928 (1034)
T KOG4190|consen  878 LTRAIAVADKGNKLAAALSNGCIAILDARNGKVINSW---RPMECDLLQLAAPS  928 (1034)
T ss_pred             heeEEEeccCcchhhHHhcCCcEEEEecCCCceeccC---CcccchhhhhcCch
Confidence            3467999999999999999999999999999999988   67778777765544


No 171
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=41.04  E-value=1e+02  Score=32.97  Aligned_cols=99  Identities=17%  Similarity=0.211  Sum_probs=58.1

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc-eee--EEEEEecccccccccccCCCCCc
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA-SCV--FMEMLVNKDAATSSAYYAPVKSD  377 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA-q~~--wi~~~~~~~~~~~~~~~~~~k~~  377 (414)
                      ..|.-+.|....|--+|+|.+.|..+..|-|+|+=...   +|..-.-+.| +..  |+......-.. ..+....=..+
T Consensus        59 s~Ls~H~~aVN~vRf~p~gelLASg~D~g~v~lWk~~~---~~~~~~d~e~~~~ke~w~v~k~lr~h~-~diydL~Ws~d  134 (434)
T KOG1009|consen   59 SSLSRHTRAVNVVRFSPDGELLASGGDGGEVFLWKQGD---VRIFDADTEADLNKEKWVVKKVLRGHR-DDIYDLAWSPD  134 (434)
T ss_pred             ecccCCcceeEEEEEcCCcCeeeecCCCceEEEEEecC---cCCccccchhhhCccceEEEEEecccc-cchhhhhccCC
Confidence            46888899999999999999999999999999987663   1211000000 001  22211100000 00000000011


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                       . -+++.+.-+..+-+|++..|+.+++
T Consensus       135 -~-~~l~s~s~dns~~l~Dv~~G~l~~~  160 (434)
T KOG1009|consen  135 -S-NFLVSGSVDNSVRLWDVHAGQLLAI  160 (434)
T ss_pred             -C-ceeeeeeccceEEEEEeccceeEee
Confidence             1 2556777799999999999998865


No 172
>PRK02889 tolB translocation protein TolB; Provisional
Probab=40.62  E-value=54  Score=34.06  Aligned_cols=40  Identities=28%  Similarity=0.363  Sum_probs=30.1

Q ss_pred             eEEEECCCCCEEEEEcCCC---cEEEEEcCcceeEeeeccccc
Q 015021          311 ERLTLSPSGSLAAITDSLG---RILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD~lG---RV~L~D~~~~~vvrmwKGyRd  350 (414)
                      .....||+|+++|.+...|   +|.++|+.++...++-.+..+
T Consensus       331 ~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt~~~~~  373 (427)
T PRK02889        331 TSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALTDTTRD  373 (427)
T ss_pred             CceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEccCCCCc
Confidence            4578999999888765444   699999999987776555433


No 173
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=40.03  E-value=1.2e+02  Score=31.53  Aligned_cols=34  Identities=26%  Similarity=0.432  Sum_probs=29.6

Q ss_pred             CCeEEEEEeEEe--CCcEEEEEeccccEEEEEecCCc
Q 015021           76 SEYITAIEWLVF--EEMRALAVGTSRGYFLVYDLKGD  110 (414)
Q Consensus        76 ~e~ITs~~~lpl--~dw~~I~VG~ssG~vrfyte~G~  110 (414)
                      .+-|++|.|-|-  .+...|||++-+| ||+|...+.
T Consensus       223 ~dpI~di~wAPn~Gr~y~~lAvA~kDg-v~I~~v~~~  258 (361)
T KOG2445|consen  223 TDPIRDISWAPNIGRSYHLLAVATKDG-VRIFKVKVA  258 (361)
T ss_pred             CCcceeeeeccccCCceeeEEEeecCc-EEEEEEeec
Confidence            577999999998  5668999999999 999997653


No 174
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=39.88  E-value=74  Score=35.20  Aligned_cols=78  Identities=13%  Similarity=0.156  Sum_probs=59.4

Q ss_pred             eeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCC
Q 015021          309 KGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPR  388 (414)
Q Consensus       309 ~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyapr  388 (414)
                      -+..++.||||++.|+..-+|-+=++|-.+..++-+.|.|=.+   .+-+.=..|          +|      ++.-...
T Consensus       292 ~in~f~FS~DG~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGG---LLCvcWSPD----------GK------yIvtGGE  352 (636)
T KOG2394|consen  292 SINEFAFSPDGKYLATVSQDGFLRIFDFDTQELLGVMKSYFGG---LLCVCWSPD----------GK------YIVTGGE  352 (636)
T ss_pred             cccceeEcCCCceEEEEecCceEEEeeccHHHHHHHHHhhccc---eEEEEEcCC----------cc------EEEecCC
Confidence            6778999999999999988898999999999999999999663   222221112          23      3444667


Q ss_pred             CCeEEEeeCCCCcceee
Q 015021          389 KGIIEHLAGMADENWTT  405 (414)
Q Consensus       389 Rg~lEvW~~~~g~~v~~  405 (414)
                      +-+|-||++--++-||-
T Consensus       353 DDLVtVwSf~erRVVAR  369 (636)
T KOG2394|consen  353 DDLVTVWSFEERRVVAR  369 (636)
T ss_pred             cceEEEEEeccceEEEe
Confidence            89999999998887763


No 175
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=39.73  E-value=74  Score=30.26  Aligned_cols=61  Identities=26%  Similarity=0.352  Sum_probs=43.5

Q ss_pred             EEEEEeccccEEEEEecCCcEEeecccCccceeEEEEecccCCCCCCCCCCeEEEEeCCeEEEEeChhHHH
Q 015021           91 RALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRGSRRDLTQDTAEEEVCVVMPGVLARFDGSEIQK  161 (414)
Q Consensus        91 ~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~~~~~~~~~~~~eel~Ily~~~iv~IdG~sL~~  161 (414)
                      .-++||+++| |.+|..+..--..+.++..+|.+|.+-..         -+-|.++..+.+...|-.+|..
T Consensus         8 ~~L~vGt~~G-l~~~~~~~~~~~~~i~~~~~I~ql~vl~~---------~~~llvLsd~~l~~~~L~~l~~   68 (275)
T PF00780_consen    8 DRLLVGTEDG-LYVYDLSDPSKPTRILKLSSITQLSVLPE---------LNLLLVLSDGQLYVYDLDSLEP   68 (275)
T ss_pred             CEEEEEECCC-EEEEEecCCccceeEeecceEEEEEEecc---------cCEEEEEcCCccEEEEchhhcc
Confidence            5789999999 88888755555555666667888885532         3347888887888788777753


No 176
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=39.64  E-value=64  Score=34.54  Aligned_cols=80  Identities=18%  Similarity=0.185  Sum_probs=60.9

Q ss_pred             cccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          302 CLKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      -++-..+-+..|++||.++ |+......-.|=++|+.+..++.++-.|+-   -|--..+..+            .    
T Consensus       188 ~lp~~g~~IrdlafSp~~~GLl~~asl~nkiki~dlet~~~vssy~a~~~---~wSC~wDlde------------~----  248 (463)
T KOG1645|consen  188 ILPGEGSFIRDLAFSPFNEGLLGLASLGNKIKIMDLETSCVVSSYIAYNQ---IWSCCWDLDE------------R----  248 (463)
T ss_pred             cccccchhhhhhccCccccceeeeeccCceEEEEecccceeeeheeccCC---ceeeeeccCC------------c----
Confidence            3677788899999999997 777777777888999999999999988843   3443333221            1    


Q ss_pred             EEEEEcC-CCCeEEEeeCCCCc
Q 015021          381 CLAIHAP-RKGIIEHLAGMADE  401 (414)
Q Consensus       381 ~LvIyap-rRg~lEvW~~~~g~  401 (414)
                       =+|||. .||.|=|+||++-+
T Consensus       249 -h~IYaGl~nG~VlvyD~R~~~  269 (463)
T KOG1645|consen  249 -HVIYAGLQNGMVLVYDMRQPE  269 (463)
T ss_pred             -ceeEEeccCceEEEEEccCCC
Confidence             248886 57999999999854


No 177
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=39.33  E-value=1e+02  Score=34.12  Aligned_cols=73  Identities=15%  Similarity=0.094  Sum_probs=55.1

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      -...+.+.+|+..++++.=.+|-|+|+|..++.....=-..+=..++|..                   ..+  +++-+.
T Consensus       260 s~v~~ca~sp~E~kLvlGC~DgSiiLyD~~~~~t~~~ka~~~P~~iaWHp-------------------~ga--i~~V~s  318 (545)
T PF11768_consen  260 SQVICCARSPSEDKLVLGCEDGSIILYDTTRGVTLLAKAEFIPTLIAWHP-------------------DGA--IFVVGS  318 (545)
T ss_pred             CcceEEecCcccceEEEEecCCeEEEEEcCCCeeeeeeecccceEEEEcC-------------------CCc--EEEEEc
Confidence            35667888999998888888999999999999887664445555555552                   112  567788


Q ss_pred             CCCeEEEeeCCCCc
Q 015021          388 RKGIIEHLAGMADE  401 (414)
Q Consensus       388 rRg~lEvW~~~~g~  401 (414)
                      .||.|-+||+-...
T Consensus       319 ~qGelQ~FD~ALsp  332 (545)
T PF11768_consen  319 EQGELQCFDMALSP  332 (545)
T ss_pred             CCceEEEEEeecCc
Confidence            89999999997654


No 178
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=39.17  E-value=66  Score=33.91  Aligned_cols=60  Identities=18%  Similarity=0.316  Sum_probs=47.3

Q ss_pred             CCccccCCCceeeEEEECCCCCEEEEEcCCCcEE-EEEcCcceeEeee-cccccceeeEEEE
Q 015021          299 PLTCLKDHPRKGERLTLSPSGSLAAITDSLGRIL-LLDTQALVVVRLW-KGYRDASCVFMEM  358 (414)
Q Consensus       299 pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~-L~D~~~~~vvrmw-KGyRdAq~~wi~~  358 (414)
                      |...+.-++-....|+.+|+|++.|++..-|+|| ++++..|+-+.-+ +|++=+++.=+-.
T Consensus       165 ~v~~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~F  226 (391)
T KOG2110|consen  165 PVNTINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSF  226 (391)
T ss_pred             eeeEEEecCCceeEEEECCCCCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEE
Confidence            3345677777889999999999999998889887 7899999988866 5888666554443


No 179
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=38.88  E-value=32  Score=37.76  Aligned_cols=49  Identities=18%  Similarity=0.172  Sum_probs=39.2

Q ss_pred             EEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEeecccCccceeEEEEec
Q 015021           79 ITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLRVRG  129 (414)
Q Consensus        79 ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik~r~  129 (414)
                      |-|+-+-|-.||  ++||+.+++|-+-+.+|--=+--.+|++=|++||+-.
T Consensus       595 IfSLg~cP~~dW--lavGMens~vevlh~skp~kyqlhlheScVLSlKFa~  643 (705)
T KOG0639|consen  595 IFSLGYCPTGDW--LAVGMENSNVEVLHTSKPEKYQLHLHESCVLSLKFAY  643 (705)
T ss_pred             heecccCCCccc--eeeecccCcEEEEecCCccceeecccccEEEEEEecc
Confidence            333334444778  8999999999999998888777789999999999654


No 180
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=38.50  E-value=69  Score=37.13  Aligned_cols=55  Identities=15%  Similarity=0.218  Sum_probs=37.1

Q ss_pred             CeEEEEEeEEeCCcEEEEEeccccEEEEEe-----cCCcEEeecccCccceeEEEEecccC
Q 015021           77 EYITAIEWLVFEEMRALAVGTSRGYFLVYD-----LKGDLVHRQLIHPGRILKLRVRGSRR  132 (414)
Q Consensus        77 e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-----e~G~LL~sQ~lh~~pV~~ik~r~~~~  132 (414)
                      +.++++..+. .|-.+|||||.+|.|.+|-     +.|...--+.--.+||..|-+|..+.
T Consensus       125 ~~p~s~l~Vs-~~l~~Iv~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pITgL~~~~d~~  184 (933)
T KOG2114|consen  125 PSPASSLAVS-EDLKTIVCGFTNGLVICYKGDILRDRGSRQDYSHRGKEPITGLALRSDGK  184 (933)
T ss_pred             CCcceEEEEE-ccccEEEEEecCcEEEEEcCcchhccccceeeeccCCCCceeeEEecCCc
Confidence            3334443333 3578999999999999995     45553333333357999999988764


No 181
>PF06200 tify:  tify domain;  InterPro: IPR010399 The tify domain is a 36-amino acid domain only found among Embryophyta (land plants). It has been named after the most conserved amino acid pattern (TIF[F/Y]XG) it contains, but was previously known as the Zim domain. As the use of uppercase characters (TIFY) might imply that the domain is fully conserved across proteins, a lowercase lettering has been chosen in an attempt to highlight the reality of its natural variability.  Based on the domain architecture, tify domain containing proteins can be classified into two groups. Group I is formed by proteins possessing a CCT (CONSTANS, CO-like, and TOC1) domain and a GATA-type zinc finger in addition to the tify domain. Group II contains proteins characterised by the tify domain but lacking a GATA-type zinc finger. Tify domain containing proteins might be involved in developmental processes and some of them have features that are characteristic for transcription factors: a nuclear localisation and the presence of a putative DNA-binding domain []. Some proteins known to contain a tify domain include:   Arabidopsis thaliana Zinc-finger protein expressed in Inflorescence Meristem (ZIM), a putative transcription factor involved in inflorescence and flower development [, ].  A. thaliana ZIM-like proteins (ZML) [].  A. thaliana PEAPOD1 and PEAPOD2 (PPD1 and PPD2) [].   
Probab=38.23  E-value=57  Score=23.20  Aligned_cols=24  Identities=17%  Similarity=0.302  Sum_probs=19.3

Q ss_pred             CCCCeEEEEeCCeEEEEeChhHHH
Q 015021          138 TAEEEVCVVMPGVLARFDGSEIQK  161 (414)
Q Consensus       138 ~~~eel~Ily~~~iv~IdG~sL~~  161 (414)
                      .+...|+|.|.+.++.+|+..-..
T Consensus         3 ~~~~qLTIfY~G~V~Vfd~v~~~K   26 (36)
T PF06200_consen    3 PETAQLTIFYGGQVCVFDDVPPDK   26 (36)
T ss_pred             CCCCcEEEEECCEEEEeCCCCHHH
Confidence            346679999999999999876543


No 182
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=37.77  E-value=1.2e+02  Score=32.76  Aligned_cols=85  Identities=16%  Similarity=0.085  Sum_probs=56.4

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeec---ccccceeeEEEEEecccccccccccCCCCCcc
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWK---GYRDASCVFMEMLVNKDAATSSAYYAPVKSDY  378 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwK---GyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~  378 (414)
                      .|.|...++=-++-||+|+|.|.+...-..+++++....=+.+.|   |+-+ -+.+|.-.             |.  ++
T Consensus       219 il~~htdEVWfl~FS~nGkyLAsaSkD~Taiiw~v~~d~~~kl~~tlvgh~~-~V~yi~wS-------------PD--dr  282 (519)
T KOG0293|consen  219 ILQDHTDEVWFLQFSHNGKYLASASKDSTAIIWIVVYDVHFKLKKTLVGHSQ-PVSYIMWS-------------PD--DR  282 (519)
T ss_pred             hHhhCCCcEEEEEEcCCCeeEeeccCCceEEEEEEecCcceeeeeeeecccC-ceEEEEEC-------------CC--CC
Confidence            589999999999999999999998888888888776655454443   3322 12222111             00  11


Q ss_pred             eEEEEEEcCCCCeEEEeeCCCCcceee
Q 015021          379 CLCLAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       379 ~l~LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                        +|+-- |-.-.+..|+.++|.+..+
T Consensus       283 --yLlaC-g~~e~~~lwDv~tgd~~~~  306 (519)
T KOG0293|consen  283 --YLLAC-GFDEVLSLWDVDTGDLRHL  306 (519)
T ss_pred             --eEEec-CchHheeeccCCcchhhhh
Confidence              34333 3355689999999998754


No 183
>PRK04792 tolB translocation protein TolB; Provisional
Probab=37.39  E-value=1.2e+02  Score=31.91  Aligned_cols=37  Identities=22%  Similarity=0.330  Sum_probs=28.6

Q ss_pred             eEEEECCCCCEEEE-EcCCC--cEEEEEcCcceeEeeecc
Q 015021          311 ERLTLSPSGSLAAI-TDSLG--RILLLDTQALVVVRLWKG  347 (414)
Q Consensus       311 ~~i~~sP~~~laa~-tD~lG--RV~L~D~~~~~vvrmwKG  347 (414)
                      .....||+|+.+|. .+..|  +|+++|+.++.+.++-.+
T Consensus       265 ~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~~lt~~  304 (448)
T PRK04792        265 GAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALTRITRH  304 (448)
T ss_pred             CCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeEECccC
Confidence            46789999975554 55556  499999999998887654


No 184
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=37.19  E-value=61  Score=36.26  Aligned_cols=43  Identities=19%  Similarity=0.236  Sum_probs=39.1

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW  345 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw  345 (414)
                      +.-+.-...+|+++|.|.|+|..-.+|-|=++.+++|..||-|
T Consensus       396 yrGHtg~Vr~iSvdp~G~wlasGsdDGtvriWEi~TgRcvr~~  438 (733)
T KOG0650|consen  396 YRGHTGLVRSISVDPSGEWLASGSDDGTVRIWEIATGRCVRTV  438 (733)
T ss_pred             EeccCCeEEEEEecCCcceeeecCCCCcEEEEEeecceEEEEE
Confidence            5666777889999999999999999999999999999999976


No 185
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=35.67  E-value=1.3e+02  Score=31.20  Aligned_cols=41  Identities=15%  Similarity=0.102  Sum_probs=37.5

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd  350 (414)
                      ..+|+.+|-.+.+|+.+++|-|..||+.+..=++.+-+|-+
T Consensus       235 VNai~Fhp~~~tfaTgGsDG~V~~Wd~~~rKrl~q~~~~~~  275 (323)
T KOG1036|consen  235 VNAIAFHPIHGTFATGGSDGIVNIWDLFNRKRLKQLAKYET  275 (323)
T ss_pred             eceeEeccccceEEecCCCceEEEccCcchhhhhhccCCCC
Confidence            46899999999999999999999999999999999999943


No 186
>PRK05137 tolB translocation protein TolB; Provisional
Probab=35.54  E-value=1.9e+02  Score=30.05  Aligned_cols=81  Identities=12%  Similarity=0.033  Sum_probs=0.0

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEc---CCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcce
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITD---SLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYC  379 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD---~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~  379 (414)
                      +..+.+.......||+|+.+|.+.   ..+.|+++|+.+|.. +....+...-...-..++..                 
T Consensus       197 lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~-~~l~~~~g~~~~~~~SPDG~-----------------  258 (435)
T PRK05137        197 LTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQR-ELVGNFPGMTFAPRFSPDGR-----------------  258 (435)
T ss_pred             EecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcE-EEeecCCCcccCcEECCCCC-----------------


Q ss_pred             EEEEEEcCCCCeEEEe--eCCCCcc
Q 015021          380 LCLAIHAPRKGIIEHL--AGMADEN  402 (414)
Q Consensus       380 l~LvIyaprRg~lEvW--~~~~g~~  402 (414)
                       .|++-..++|..+||  ++.+|+.
T Consensus       259 -~la~~~~~~g~~~Iy~~d~~~~~~  282 (435)
T PRK05137        259 -KVVMSLSQGGNTDIYTMDLRSGTT  282 (435)
T ss_pred             -EEEEEEecCCCceEEEEECCCCce


No 187
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=34.46  E-value=1.5e+02  Score=29.64  Aligned_cols=64  Identities=17%  Similarity=0.342  Sum_probs=37.2

Q ss_pred             CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021          319 GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM  398 (414)
Q Consensus       319 ~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~  398 (414)
                      +..+.+.+..|+|+.+|..+|.++  |+---+.++.-                .|.-...   .++.....|.|-.|+.+
T Consensus       105 ~~~v~v~~~~g~l~ald~~tG~~~--W~~~~~~~~~~----------------~p~v~~~---~v~v~~~~g~l~a~d~~  163 (377)
T TIGR03300       105 GGLVFVGTEKGEVIALDAEDGKEL--WRAKLSSEVLS----------------PPLVANG---LVVVRTNDGRLTALDAA  163 (377)
T ss_pred             CCEEEEEcCCCEEEEEECCCCcEe--eeeccCceeec----------------CCEEECC---EEEEECCCCeEEEEEcC
Confidence            455556667799999999999984  54222221110                0000011   23344567888888888


Q ss_pred             CCcce
Q 015021          399 ADENW  403 (414)
Q Consensus       399 ~g~~v  403 (414)
                      +|+.+
T Consensus       164 tG~~~  168 (377)
T TIGR03300       164 TGERL  168 (377)
T ss_pred             CCcee
Confidence            88764


No 188
>PF07433 DUF1513:  Protein of unknown function (DUF1513);  InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=34.44  E-value=72  Score=32.73  Aligned_cols=40  Identities=15%  Similarity=0.116  Sum_probs=33.3

Q ss_pred             CCceeeEEEECCCCCEEEE-EcCCCcEEEEEcCcceeEeee
Q 015021          306 HPRKGERLTLSPSGSLAAI-TDSLGRILLLDTQALVVVRLW  345 (414)
Q Consensus       306 ~~R~~~~i~~sP~~~laa~-tD~lGRV~L~D~~~~~vvrmw  345 (414)
                      .+.-+.+|+.+++|.++|+ +=.-|+|.++|..+|.++..-
T Consensus       215 l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~~d~~tg~~~~~~  255 (305)
T PF07433_consen  215 LNGYIGSIAADRDGRLIAVTSPRGGRVAVWDAATGRLLGSV  255 (305)
T ss_pred             hCCceEEEEEeCCCCEEEEECCCCCEEEEEECCCCCEeecc
Confidence            3466899999999986655 689999999999999987643


No 189
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=34.27  E-value=81  Score=35.47  Aligned_cols=85  Identities=24%  Similarity=0.242  Sum_probs=52.4

Q ss_pred             EECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecc-cccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCe
Q 015021          314 TLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKG-YRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGI  391 (414)
Q Consensus       314 ~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKG-yRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~  391 (414)
                      +-+|+. .++|++|..|-|.|+|+..-.-- .=|- .++-++.|=-+.+..  +      .+++     -+.|-|.-+..
T Consensus        58 s~~~n~eHiLavadE~G~i~l~dt~~~~fr-~ee~~lk~~~aH~nAifDl~--w------apge-----~~lVsasGDsT  123 (720)
T KOG0321|consen   58 SAAPNKEHILAVADEDGGIILFDTKSIVFR-LEERQLKKPLAHKNAIFDLK--W------APGE-----SLLVSASGDST  123 (720)
T ss_pred             cCCCCccceEEEecCCCceeeecchhhhcc-hhhhhhcccccccceeEeec--c------CCCc-----eeEEEccCCce
Confidence            344555 68899999999999999764432 1000 112122221111110  0      1122     46788888999


Q ss_pred             EEEeeCCCCcceeeccc----eeee
Q 015021          392 IEHLAGMADENWTTSLN----YSVR  412 (414)
Q Consensus       392 lEvW~~~~g~~v~~~~~----~~~~  412 (414)
                      +.+|++.+++++-+++|    |||+
T Consensus       124 ~r~Wdvk~s~l~G~~~~~GH~~Svk  148 (720)
T KOG0321|consen  124 IRPWDVKTSRLVGGRLNLGHTGSVK  148 (720)
T ss_pred             eeeeeeccceeecceeecccccccc
Confidence            99999999999988665    5554


No 190
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=34.24  E-value=1.2e+02  Score=32.14  Aligned_cols=76  Identities=12%  Similarity=0.097  Sum_probs=54.8

Q ss_pred             CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEee-ecccccceeeEEEEEecccccccccccCCCCCcceEEEEEE
Q 015021          307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRL-WKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIH  385 (414)
Q Consensus       307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrm-wKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIy  385 (414)
                      ---..++.+.|+|+++-++|..|-+-.+|+.++..+-. +||+-.+- .=|.+.             |.  +   =++-.
T Consensus       247 E~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsi-rsih~h-------------p~--~---~~las  307 (412)
T KOG3881|consen  247 ENPISSTGLTPSGNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSI-RSIHCH-------------PT--H---PVLAS  307 (412)
T ss_pred             cCcceeeeecCCCcEEEEecccchhheecccCceeeccccCCccCCc-ceEEEc-------------CC--C---ceEEe
Confidence            44567899999999999999999999999999998877 88765411 111111             11  1   14445


Q ss_pred             cCCCCeEEEeeCCCCc
Q 015021          386 APRKGIIEHLAGMADE  401 (414)
Q Consensus       386 aprRg~lEvW~~~~g~  401 (414)
                      ...+..|.|+++++-+
T Consensus       308 ~GLDRyvRIhD~ktrk  323 (412)
T KOG3881|consen  308 CGLDRYVRIHDIKTRK  323 (412)
T ss_pred             eccceeEEEeecccch
Confidence            6678889999998833


No 191
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=33.59  E-value=1.3e+02  Score=34.34  Aligned_cols=77  Identities=17%  Similarity=0.196  Sum_probs=55.6

Q ss_pred             ceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          308 RKGERLTLSPSG-SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       308 R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      |+++.++-.|.. .-+.||-++-||=++|+....++.-+||+|.++=   |..+.-        ..-+|      -+|.+
T Consensus       502 ~rITG~Q~~p~~~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n~~S---Q~~Asf--------s~Dgk------~IVs~  564 (712)
T KOG0283|consen  502 KRITGLQFFPGDPDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRNTSS---QISASF--------SSDGK------HIVSA  564 (712)
T ss_pred             ceeeeeEecCCCCCeEEEecCCCceEEEeccchhhhhhhcccccCCc---ceeeeE--------ccCCC------EEEEe
Confidence            467778888776 3577788888999999999999999999999543   111100        01122      35778


Q ss_pred             CCCCeEEEeeCCCCc
Q 015021          387 PRKGIIEHLAGMADE  401 (414)
Q Consensus       387 prRg~lEvW~~~~g~  401 (414)
                      ..+.-|-||+++.-+
T Consensus       565 seDs~VYiW~~~~~~  579 (712)
T KOG0283|consen  565 SEDSWVYIWKNDSFN  579 (712)
T ss_pred             ecCceEEEEeCCCCc
Confidence            899999999986544


No 192
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=33.30  E-value=33  Score=35.96  Aligned_cols=32  Identities=25%  Similarity=0.501  Sum_probs=27.2

Q ss_pred             EEECCCCCEEEEEcCCCcEEEEEcCcceeEeee
Q 015021          313 LTLSPSGSLAAITDSLGRILLLDTQALVVVRLW  345 (414)
Q Consensus       313 i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw  345 (414)
                      .+.||+|+|+|+...- |+++=|+.+.++++++
T Consensus        14 c~fSp~g~yiAs~~~y-rlviRd~~tlq~~qlf   45 (447)
T KOG4497|consen   14 CSFSPCGNYIASLSRY-RLVIRDSETLQLHQLF   45 (447)
T ss_pred             eeECCCCCeeeeeeee-EEEEeccchhhHHHHH
Confidence            4789999999986444 9999999999998865


No 193
>COG3041 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.49  E-value=15  Score=31.34  Aligned_cols=10  Identities=60%  Similarity=1.212  Sum_probs=8.3

Q ss_pred             eeccccccee
Q 015021          344 LWKGYRDASC  353 (414)
Q Consensus       344 mwKGyRdAq~  353 (414)
                      =|||||||.+
T Consensus        53 ~wkg~RdCHi   62 (91)
T COG3041          53 DWKGYRDCHI   62 (91)
T ss_pred             Cccchhhccc
Confidence            3999999765


No 194
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=31.97  E-value=1.6e+02  Score=30.71  Aligned_cols=64  Identities=16%  Similarity=0.053  Sum_probs=46.1

Q ss_pred             CCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCC
Q 015021          319 GSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGM  398 (414)
Q Consensus       319 ~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~  398 (414)
                      ..|+|+.-..-.|=|-|++.|..-+...|+||-    +-..+=.+           +..   |..--+.-+|.+..|++|
T Consensus       158 HcLiA~gtr~~~VrLCDi~SGs~sH~LsGHr~~----vlaV~Wsp-----------~~e---~vLatgsaDg~irlWDiR  219 (397)
T KOG4283|consen  158 HCLIAAGTRDVQVRLCDIASGSFSHTLSGHRDG----VLAVEWSP-----------SSE---WVLATGSADGAIRLWDIR  219 (397)
T ss_pred             ceEEEEecCCCcEEEEeccCCcceeeeccccCc----eEEEEecc-----------Cce---eEEEecCCCceEEEEEee
Confidence            457788766778999999999999999999993    22222111           111   444567789999999998


Q ss_pred             CC
Q 015021          399 AD  400 (414)
Q Consensus       399 ~g  400 (414)
                      .-
T Consensus       220 ra  221 (397)
T KOG4283|consen  220 RA  221 (397)
T ss_pred             cc
Confidence            63


No 195
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=31.89  E-value=1.5e+02  Score=31.02  Aligned_cols=85  Identities=13%  Similarity=0.112  Sum_probs=60.1

Q ss_pred             CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcc-eeEeeecccccceeeEEEEEecccccccccccCCCC
Q 015021          297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQAL-VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVK  375 (414)
Q Consensus       297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~-~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k  375 (414)
                      ..+++.|.-+.-+-...+--|+.+|++.+..+-..-|+|.... ..+-++.|+-|---.=+-   +.+            
T Consensus       304 ge~v~~LtGHd~ELtHcstHptQrLVvTsSrDtTFRLWDFReaI~sV~VFQGHtdtVTS~vF---~~d------------  368 (481)
T KOG0300|consen  304 GEVVNILTGHDSELTHCSTHPTQRLVVTSSRDTTFRLWDFREAIQSVAVFQGHTDTVTSVVF---NTD------------  368 (481)
T ss_pred             CceeccccCcchhccccccCCcceEEEEeccCceeEeccchhhcceeeeecccccceeEEEE---ecC------------
Confidence            3455566667777888899999999999988888888998643 356788998882211111   001            


Q ss_pred             CcceEEEEEEcCCCCeEEEeeCCCCc
Q 015021          376 SDYCLCLAIHAPRKGIIEHLAGMADE  401 (414)
Q Consensus       376 ~~~~l~LvIyaprRg~lEvW~~~~g~  401 (414)
                           .-|+...-+..|+||+++|-+
T Consensus       369 -----d~vVSgSDDrTvKvWdLrNMR  389 (481)
T KOG0300|consen  369 -----DRVVSGSDDRTVKVWDLRNMR  389 (481)
T ss_pred             -----CceeecCCCceEEEeeecccc
Confidence                 245777778999999999854


No 196
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=31.86  E-value=2.1e+02  Score=33.95  Aligned_cols=82  Identities=16%  Similarity=0.124  Sum_probs=47.8

Q ss_pred             eEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee----cccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          311 ERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW----KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw----KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      ..|+++|+|++.++--..+||-+||..++.+..+-    .||+|-.    .....-    +.+.......+..  |.|--
T Consensus       807 ~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG~----~~~a~l----~~P~GIavd~dG~--lyVaD  876 (1057)
T PLN02919        807 LGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDGK----ALKAQL----SEPAGLALGENGR--LFVAD  876 (1057)
T ss_pred             ceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCCc----cccccc----CCceEEEEeCCCC--EEEEE
Confidence            47899999987666556789999999998877654    2343310    000000    0000000000111  44556


Q ss_pred             CCCCeEEEeeCCCCcc
Q 015021          387 PRKGIIEHLAGMADEN  402 (414)
Q Consensus       387 prRg~lEvW~~~~g~~  402 (414)
                      -..+.|.+|++.+++.
T Consensus       877 t~Nn~Irvid~~~~~~  892 (1057)
T PLN02919        877 TNNSLIRYLDLNKGEA  892 (1057)
T ss_pred             CCCCEEEEEECCCCcc
Confidence            6778999999998864


No 197
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=31.83  E-value=1.1e+02  Score=33.35  Aligned_cols=78  Identities=18%  Similarity=0.161  Sum_probs=57.1

Q ss_pred             CceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEc
Q 015021          307 PRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHA  386 (414)
Q Consensus       307 ~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIya  386 (414)
                      .-...++.-+|+|+++|+.=..|.|.|+|..+..-+|-.+|.=.+.++=+.   =             +.    ..++-.
T Consensus       217 ~~~vtSv~ws~~G~~LavG~~~g~v~iwD~~~~k~~~~~~~~h~~rvg~la---W-------------~~----~~lssG  276 (484)
T KOG0305|consen  217 EELVTSVKWSPDGSHLAVGTSDGTVQIWDVKEQKKTRTLRGSHASRVGSLA---W-------------NS----SVLSSG  276 (484)
T ss_pred             CCceEEEEECCCCCEEEEeecCCeEEEEehhhccccccccCCcCceeEEEe---c-------------cC----ceEEEe
Confidence            667899999999999999999999999999999999999884222222111   0             00    245666


Q ss_pred             CCCCeEEEeeCCCCccee
Q 015021          387 PRKGIIEHLAGMADENWT  404 (414)
Q Consensus       387 prRg~lEvW~~~~g~~v~  404 (414)
                      -|.|.|-+++++..+.+.
T Consensus       277 sr~~~I~~~dvR~~~~~~  294 (484)
T KOG0305|consen  277 SRDGKILNHDVRISQHVV  294 (484)
T ss_pred             cCCCcEEEEEEecchhhh
Confidence            677777777777766553


No 198
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=31.61  E-value=2.1e+02  Score=29.26  Aligned_cols=80  Identities=13%  Similarity=0.081  Sum_probs=58.0

Q ss_pred             cCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeee-cccccceeeEEEEEecccccccccccCCCCCcceEEE
Q 015021          304 KDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLW-KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCL  382 (414)
Q Consensus       304 ~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmw-KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~L  382 (414)
                      .|.+-.+++-.|.|.....++.-.++.+.-+|-.+|.=+-.+ ||+-.+    +.+..-.+++               -+
T Consensus       221 ~k~P~nV~SASL~P~k~~fVaGged~~~~kfDy~TgeEi~~~nkgh~gp----VhcVrFSPdG---------------E~  281 (334)
T KOG0278|consen  221 YKMPCNVESASLHPKKEFFVAGGEDFKVYKFDYNTGEEIGSYNKGHFGP----VHCVRFSPDG---------------EL  281 (334)
T ss_pred             ccCccccccccccCCCceEEecCcceEEEEEeccCCceeeecccCCCCc----eEEEEECCCC---------------ce
Confidence            355677889999999999999999999999999999988886 877652    2111111111               12


Q ss_pred             EEEcCCCCeEEEeeCCCCcc
Q 015021          383 AIHAPRKGIIEHLAGMADEN  402 (414)
Q Consensus       383 vIyaprRg~lEvW~~~~g~~  402 (414)
                      .-....+|.|.+|.+-.|+-
T Consensus       282 yAsGSEDGTirlWQt~~~~~  301 (334)
T KOG0278|consen  282 YASGSEDGTIRLWQTTPGKT  301 (334)
T ss_pred             eeccCCCceEEEEEecCCCc
Confidence            23456789999999877653


No 199
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=31.39  E-value=4.4e+02  Score=27.75  Aligned_cols=41  Identities=22%  Similarity=0.089  Sum_probs=36.0

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEe
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVR  343 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvr  343 (414)
                      |.-+-|-+.+|+=||+|++..++..+--|.|+|+..|.+++
T Consensus        61 lsaH~~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs~l~  101 (405)
T KOG1273|consen   61 LSAHVRPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGSPLK  101 (405)
T ss_pred             hhccccceeEEEecCCCCEeeeecCCceeEEEeccCCCcee
Confidence            44556788999999999999999888999999999998876


No 200
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=30.53  E-value=3.7e+02  Score=28.13  Aligned_cols=79  Identities=19%  Similarity=0.299  Sum_probs=57.2

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEE-EEEcCcceeEeee-cccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRIL-LLDTQALVVVRLW-KGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~-L~D~~~~~vvrmw-KGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      ++-+.-.+..++|.=+|.++|++..-|.+| ++|+.+|..+.-. +|+-.|.+.=|....+                 ..
T Consensus       177 I~AH~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RRG~d~A~iy~iaFSp~-----------------~s  239 (346)
T KOG2111|consen  177 INAHDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRRGVDRADIYCIAFSPN-----------------SS  239 (346)
T ss_pred             EEcccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeecCCchheEEEEEeCCC-----------------cc
Confidence            666667788999999999999999999876 8899999988754 5777777666654211                 11


Q ss_pred             EEEEEcCCCCeEEEeeCCC
Q 015021          381 CLAIHAPRKGIIEHLAGMA  399 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~  399 (414)
                      +|+. +..+|.|-|+.++-
T Consensus       240 ~Lav-sSdKgTlHiF~l~~  257 (346)
T KOG2111|consen  240 WLAV-SSDKGTLHIFSLRD  257 (346)
T ss_pred             EEEE-EcCCCeEEEEEeec
Confidence            3443 44577777777664


No 201
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=30.10  E-value=50  Score=34.55  Aligned_cols=39  Identities=28%  Similarity=0.306  Sum_probs=31.4

Q ss_pred             EEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEec
Q 015021           91 RALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRG  129 (414)
Q Consensus        91 ~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~  129 (414)
                      +.||||+|+|.||+|+ ++|.+|-.=--|+.-+..+|+-+
T Consensus        41 ~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~   80 (376)
T KOG1188|consen   41 TAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFIS   80 (376)
T ss_pred             eeEEEEecCCeEEEEeccchhhhheecCCCCcccceEEec
Confidence            7899999999999999 57777765566777777777655


No 202
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=29.53  E-value=76  Score=36.22  Aligned_cols=58  Identities=28%  Similarity=0.230  Sum_probs=48.0

Q ss_pred             cccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccc--eeeEEEEE
Q 015021          302 CLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDA--SCVFMEML  359 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdA--q~~wi~~~  359 (414)
                      +|.-+.--+++|.+++...|+|..-..|-|=++|+..+.++|-.-|+|+-  .+.|....
T Consensus        65 S~~~hespIeSl~f~~~E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P~~  124 (825)
T KOG0267|consen   65 SLTGHESPIESLTFDTSERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHPYG  124 (825)
T ss_pred             eeeccCCcceeeecCcchhhhcccccCCceeeeehhhhhhhhhhhccccCcceeeeccce
Confidence            45666667899999999999999999999999999999999999999874  33444443


No 203
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=29.15  E-value=2.9e+02  Score=28.94  Aligned_cols=57  Identities=9%  Similarity=-0.023  Sum_probs=36.1

Q ss_pred             cEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEeeCCCCccee
Q 015021          330 RILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHLAGMADENWT  404 (414)
Q Consensus       330 RV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW~~~~g~~v~  404 (414)
                      -|-++|+.++..|+-|+.=..  .-=|++....            |  .  .|+.-..-.|.|.||+..+|+.+.
T Consensus       270 eVWv~D~~t~krv~Ri~l~~~--~~Si~Vsqd~------------~--P--~L~~~~~~~~~l~v~D~~tGk~~~  326 (342)
T PF06433_consen  270 EVWVYDLKTHKRVARIPLEHP--IDSIAVSQDD------------K--P--LLYALSAGDGTLDVYDAATGKLVR  326 (342)
T ss_dssp             EEEEEETTTTEEEEEEEEEEE--ESEEEEESSS------------S-----EEEEEETTTTEEEEEETTT--EEE
T ss_pred             EEEEEECCCCeEEEEEeCCCc--cceEEEccCC------------C--c--EEEEEcCCCCeEEEEeCcCCcEEe
Confidence            488999999999998864222  1124443211            1  1  344445568999999999999884


No 204
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=28.83  E-value=47  Score=36.22  Aligned_cols=69  Identities=14%  Similarity=0.189  Sum_probs=48.7

Q ss_pred             CCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCCCeEEEe
Q 015021          316 SPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRKGIIEHL  395 (414)
Q Consensus       316 sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprRg~lEvW  395 (414)
                      -|...|++.....|+|.+||=.++.+||..||=|.    =+.+.++               |..+=+.--..-..-|+||
T Consensus       403 GPrsEyVvSGSDCGhIFiW~K~t~eii~~MegDr~----VVNCLEp---------------HP~~PvLAsSGid~DVKIW  463 (559)
T KOG1334|consen  403 GPRSEYVVSGSDCGHIFIWDKKTGEIIRFMEGDRH----VVNCLEP---------------HPHLPVLASSGIDHDVKIW  463 (559)
T ss_pred             cCccceEEecCccceEEEEecchhHHHHHhhcccc----eEeccCC---------------CCCCchhhccCCccceeee
Confidence            46668999989999999999999999999999998    3444432               1111122233456778888


Q ss_pred             eCCCCcce
Q 015021          396 AGMADENW  403 (414)
Q Consensus       396 ~~~~g~~v  403 (414)
                      +-+.-++.
T Consensus       464 TP~~~er~  471 (559)
T KOG1334|consen  464 TPLTAERA  471 (559)
T ss_pred             cCCccccc
Confidence            87655544


No 205
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=28.82  E-value=2.2e+02  Score=29.12  Aligned_cols=79  Identities=18%  Similarity=0.189  Sum_probs=50.1

Q ss_pred             ccCCCceeeEEEECCC--CCEEEEEc------CCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCC
Q 015021          303 LKDHPRKGERLTLSPS--GSLAAITD------SLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPV  374 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~--~~laa~tD------~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~  374 (414)
                      +.-..|+|.++.-||=  ++||++|-      .-||+.++|+..+.=|+...-|-=+.+-|=..-.+.            
T Consensus         4 ~~tpgf~GysvqfSPf~~nrLavAt~q~yGl~G~G~L~ile~~~~~gi~e~~s~d~~D~LfdV~Wse~------------   71 (311)
T KOG0277|consen    4 HTTPGFHGYSVQFSPFVENRLAVATAQHYGLAGNGRLFILEVTDPKGIQECQSYDTEDGLFDVAWSEN------------   71 (311)
T ss_pred             eecCCcccceeEecccccchhheeehhhcccccCceEEEEecCCCCCeEEEEeeecccceeEeeecCC------------
Confidence            3345699999999996  57777753      468999999986665665544433322221111110            


Q ss_pred             CCcceEEEEEEcCCCCeEEEeeC
Q 015021          375 KSDYCLCLAIHAPRKGIIEHLAG  397 (414)
Q Consensus       375 k~~~~l~LvIyaprRg~lEvW~~  397 (414)
                          +=-.+|-|--+|.|.+|++
T Consensus        72 ----~e~~~~~a~GDGSLrl~d~   90 (311)
T KOG0277|consen   72 ----HENQVIAASGDGSLRLFDL   90 (311)
T ss_pred             ----CcceEEEEecCceEEEecc
Confidence                0025677888899999985


No 206
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=28.72  E-value=2.7e+02  Score=27.97  Aligned_cols=80  Identities=9%  Similarity=0.098  Sum_probs=48.2

Q ss_pred             CeeeeccCcceeeeeecc--eEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe---c
Q 015021           33 PNLLCALDMHTIALANRY--QTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD---L  107 (414)
Q Consensus        33 ~~~~~sp~~~~la~A~~~--~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt---e  107 (414)
                      ..|.+||+|+.|-++++.  .+.+.+.+.+ .+.++..+.+.. +++.-..+..=| .+-.+++.+-.++.|.+|.   +
T Consensus       248 ~~i~ispdg~~lyvsnr~~~sI~vf~~d~~-~g~l~~~~~~~~-~G~~Pr~~~~s~-~g~~l~Va~~~s~~v~vf~~d~~  324 (345)
T PF10282_consen  248 AEIAISPDGRFLYVSNRGSNSISVFDLDPA-TGTLTLVQTVPT-GGKFPRHFAFSP-DGRYLYVANQDSNTVSVFDIDPD  324 (345)
T ss_dssp             EEEEE-TTSSEEEEEECTTTEEEEEEECTT-TTTEEEEEEEEE-SSSSEEEEEE-T-TSSEEEEEETTTTEEEEEEEETT
T ss_pred             eeEEEecCCCEEEEEeccCCEEEEEEEecC-CCceEEEEEEeC-CCCCccEEEEeC-CCCEEEEEecCCCeEEEEEEeCC
Confidence            458899999999988876  5666666443 233444444433 233333333222 2345666778899999996   4


Q ss_pred             CCcEEeec
Q 015021          108 KGDLVHRQ  115 (414)
Q Consensus       108 ~G~LL~sQ  115 (414)
                      +|.|-...
T Consensus       325 tG~l~~~~  332 (345)
T PF10282_consen  325 TGKLTPVG  332 (345)
T ss_dssp             TTEEEEEE
T ss_pred             CCcEEEec
Confidence            77776655


No 207
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=28.43  E-value=1.2e+02  Score=30.62  Aligned_cols=34  Identities=24%  Similarity=0.262  Sum_probs=26.8

Q ss_pred             ceeeEEEECCCCCEEEEEcCCC---cEEEEEcCccee
Q 015021          308 RKGERLTLSPSGSLAAITDSLG---RILLLDTQALVV  341 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lG---RV~L~D~~~~~v  341 (414)
                      ........+|+|++++.++..+   +|.++|+.++..
T Consensus       322 ~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~  358 (417)
T TIGR02800       322 GYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGE  358 (417)
T ss_pred             CCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCe
Confidence            3445778999999888876665   999999988654


No 208
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=28.32  E-value=45  Score=33.94  Aligned_cols=40  Identities=30%  Similarity=0.356  Sum_probs=35.2

Q ss_pred             CCCCccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEc
Q 015021          297 ASPLTCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDT  336 (414)
Q Consensus       297 a~pl~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~  336 (414)
                      ..||..|.-++-.+.+++-+|+..+.|.+.-++||-|+++
T Consensus       283 l~pLAVLkyHsagvn~vAfspd~~lmAaaskD~rISLWkL  322 (323)
T KOG0322|consen  283 LNPLAVLKYHSAGVNAVAFSPDCELMAAASKDARISLWKL  322 (323)
T ss_pred             CCchhhhhhhhcceeEEEeCCCCchhhhccCCceEEeeec
Confidence            4567778888888999999999999999999999999986


No 209
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=27.91  E-value=81  Score=28.79  Aligned_cols=28  Identities=36%  Similarity=0.338  Sum_probs=22.0

Q ss_pred             ECCCCCEEEEEcCCCcEEEEEcCcceeE
Q 015021          315 LSPSGSLAAITDSLGRILLLDTQALVVV  342 (414)
Q Consensus       315 ~sP~~~laa~tD~lGRV~L~D~~~~~vv  342 (414)
                      +.+.+..+.+++..|+|+.+|+.+|.++
T Consensus       208 ~~~~~~~l~~~~~~~~l~~~d~~tG~~~  235 (238)
T PF13360_consen  208 PSVDGGTLYVTSSDGRLYALDLKTGKVV  235 (238)
T ss_dssp             EECCCTEEEEEETTTEEEEEETTTTEEE
T ss_pred             ceeeCCEEEEEeCCCEEEEEECCCCCEE
Confidence            3456666666668999999999999875


No 210
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=27.66  E-value=2e+02  Score=25.40  Aligned_cols=46  Identities=17%  Similarity=0.230  Sum_probs=34.3

Q ss_pred             EEEEEeEEeCCc--EEEEEeccccEEEEEecCCcEEeecccCccceeEEE
Q 015021           79 ITAIEWLVFEEM--RALAVGTSRGYFLVYDLKGDLVHRQLIHPGRILKLR  126 (414)
Q Consensus        79 ITs~~~lpl~dw--~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~pV~~ik  126 (414)
                      |||++...+.+.  .=++||..+..+|+|...  .++.++=-.+.|..|.
T Consensus         2 V~al~~~d~d~dg~~eLlvGs~D~~IRvf~~~--e~~~Ei~e~~~v~~L~   49 (111)
T PF14783_consen    2 VTALCLFDFDGDGENELLVGSDDFEIRVFKGD--EIVAEITETDKVTSLC   49 (111)
T ss_pred             eeEEEEEecCCCCcceEEEecCCcEEEEEeCC--cEEEEEecccceEEEE
Confidence            788888888443  889999999999999853  4455544456666665


No 211
>PRK00178 tolB translocation protein TolB; Provisional
Probab=27.27  E-value=3.1e+02  Score=28.15  Aligned_cols=41  Identities=24%  Similarity=0.149  Sum_probs=29.4

Q ss_pred             eEEEECCCCCEEEEEcC-CC--cEEEEEcCcceeEeeecccccc
Q 015021          311 ERLTLSPSGSLAAITDS-LG--RILLLDTQALVVVRLWKGYRDA  351 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD~-lG--RV~L~D~~~~~vvrmwKGyRdA  351 (414)
                      .....||+|++++.+.. .|  +|.++|+.++..-++-.+..|.
T Consensus       334 ~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~~~~~~  377 (430)
T PRK00178        334 ARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTDTSLDE  377 (430)
T ss_pred             cceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccCCCCCC
Confidence            45688999988887643 23  5899999998876665554443


No 212
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=26.82  E-value=1e+02  Score=32.25  Aligned_cols=42  Identities=26%  Similarity=0.356  Sum_probs=35.7

Q ss_pred             CCCC--ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCc
Q 015021          297 ASPL--TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQA  338 (414)
Q Consensus       297 a~pl--~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~  338 (414)
                      ..||  ..|.-++..+..++.|-+|++.|+.-.+|-|.|+|+..
T Consensus        74 ~Hpl~~~~LKgH~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~D  117 (420)
T KOG2096|consen   74 VHPLNVSVLKGHKKEVTDVAFSSDGKKLATISGDRSIRLWDVRD  117 (420)
T ss_pred             ccchhhhhhhccCCceeeeEEcCCCceeEEEeCCceEEEEecch
Confidence            3455  35889999999999999999999988899999998754


No 213
>PF12234 Rav1p_C:  RAVE protein 1 C terminal;  InterPro: IPR022033  This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits. 
Probab=26.71  E-value=6.9e+02  Score=28.33  Aligned_cols=93  Identities=16%  Similarity=0.209  Sum_probs=57.4

Q ss_pred             eeeeccCcceeeeeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCc-EEEEEeccccEEEEEec-CCcE
Q 015021           34 NLLCALDMHTIALANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEM-RALAVGTSRGYFLVYDL-KGDL  111 (414)
Q Consensus        34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw-~~I~VG~ssG~vrfyte-~G~L  111 (414)
                      .+..|..+.. |+-.+++--++=|+..+ +.++..-.++  +.+.|..+-|...+|- .+++|||. .+|.+|+. .-+.
T Consensus        34 li~gss~~k~-a~V~~~~~~LtIWD~~~-~~lE~~~~f~--~~~~I~dLDWtst~d~qsiLaVGf~-~~v~l~~Q~R~dy  108 (631)
T PF12234_consen   34 LISGSSIKKI-AVVDSSRSELTIWDTRS-GVLEYEESFS--EDDPIRDLDWTSTPDGQSILAVGFP-HHVLLYTQLRYDY  108 (631)
T ss_pred             eEeecccCcE-EEEECCCCEEEEEEcCC-cEEEEeeeec--CCCceeeceeeecCCCCEEEEEEcC-cEEEEEEccchhh
Confidence            3445554444 55565555555677664 2244444442  3688999999999775 88999996 47777775 2222


Q ss_pred             EeecccCccceeEEEEecccC
Q 015021          112 VHRQLIHPGRILKLRVRGSRR  132 (414)
Q Consensus       112 L~sQ~lh~~pV~~ik~r~~~~  132 (414)
                      +-... .-.||.+|.++...+
T Consensus       109 ~~~~p-~w~~i~~i~i~~~T~  128 (631)
T PF12234_consen  109 TNKGP-SWAPIRKIDISSHTP  128 (631)
T ss_pred             hcCCc-ccceeEEEEeecCCC
Confidence            21111 125899999987553


No 214
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=26.64  E-value=94  Score=21.30  Aligned_cols=18  Identities=22%  Similarity=0.228  Sum_probs=16.2

Q ss_pred             EcCCCcEEEEEcCcceeE
Q 015021          325 TDSLGRILLLDTQALVVV  342 (414)
Q Consensus       325 tD~lGRV~L~D~~~~~vv  342 (414)
                      ....|+|.-+|..+|.++
T Consensus         6 ~~~~g~l~AlD~~TG~~~   23 (38)
T PF01011_consen    6 GTPDGYLYALDAKTGKVL   23 (38)
T ss_dssp             ETTTSEEEEEETTTTSEE
T ss_pred             eCCCCEEEEEECCCCCEE
Confidence            388999999999999986


No 215
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=26.37  E-value=3.5e+02  Score=24.57  Aligned_cols=85  Identities=18%  Similarity=0.096  Sum_probs=51.0

Q ss_pred             EeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccCccceeEEEEecccCC
Q 015021           55 INWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIHPGRILKLRVRGSRRD  133 (414)
Q Consensus        55 ~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh~~pV~~ik~r~~~~~  133 (414)
                      ..|+.. .++..|+-.+.+    .++...+.++++.-.+.++..+|.|..|+ .+|.++.+.-+ ++++...     +  
T Consensus         6 ~~~d~~-tG~~~W~~~~~~----~~~~~~~~~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~-~~~~~~~-----~--   72 (238)
T PF13360_consen    6 SALDPR-TGKELWSYDLGP----GIGGPVATAVPDGGRVYVASGDGNLYALDAKTGKVLWRFDL-PGPISGA-----P--   72 (238)
T ss_dssp             EEEETT-TTEEEEEEECSS----SCSSEEETEEEETTEEEEEETTSEEEEEETTTSEEEEEEEC-SSCGGSG-----E--
T ss_pred             EEEECC-CCCEEEEEECCC----CCCCccceEEEeCCEEEEEcCCCEEEEEECCCCCEEEEeec-cccccce-----e--
Confidence            344443 466888887733    14444444554444566668999999999 59999988877 4443322     0  


Q ss_pred             CCCCCCCCeEEEEeCCe-EEEEe
Q 015021          134 LTQDTAEEEVCVVMPGV-LARFD  155 (414)
Q Consensus       134 ~~~~~~~eel~Ily~~~-iv~Id  155 (414)
                         ......++|...+. +..+|
T Consensus        73 ---~~~~~~v~v~~~~~~l~~~d   92 (238)
T PF13360_consen   73 ---VVDGGRVYVGTSDGSLYALD   92 (238)
T ss_dssp             ---EEETTEEEEEETTSEEEEEE
T ss_pred             ---eecccccccccceeeeEecc
Confidence               11145566666543 44444


No 216
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=26.27  E-value=1.1e+02  Score=35.57  Aligned_cols=46  Identities=22%  Similarity=0.419  Sum_probs=36.9

Q ss_pred             EEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEE
Q 015021          313 LTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEML  359 (414)
Q Consensus       313 i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~  359 (414)
                      =+.||++..+|+.++.|||..+|- +-..+|=||.|-.-++.++...
T Consensus        29 sc~~s~~~~vvigt~~G~V~~Ln~-s~~~~~~fqa~~~siv~~L~~~   74 (933)
T KOG2114|consen   29 SCCSSSTGSVVIGTADGRVVILNS-SFQLIRGFQAYEQSIVQFLYIL   74 (933)
T ss_pred             eEEcCCCceEEEeeccccEEEecc-cceeeehheecchhhhhHhhcc
Confidence            356888899999999999999874 4456699999988777776654


No 217
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=26.12  E-value=2.1e+02  Score=30.61  Aligned_cols=74  Identities=18%  Similarity=0.113  Sum_probs=52.8

Q ss_pred             CCceeeEEEECCCC-CEEEEEcCCCcEEEEEcCcc----eeEeeecccccceeeEEEEEecccccccccccCCCCCcceE
Q 015021          306 HPRKGERLTLSPSG-SLAAITDSLGRILLLDTQAL----VVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCL  380 (414)
Q Consensus       306 ~~R~~~~i~~sP~~-~laa~tD~lGRV~L~D~~~~----~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l  380 (414)
                      +.+..+.|+=||+. ..+|.+.-.|-|=++|+..+    ++.+  |.+-.    =+.|..    +         ..+.- 
T Consensus       256 H~~SVEDLqWSptE~~vfaScS~DgsIrIWDiRs~~~~~~~~~--kAh~s----DVNVIS----W---------nr~~~-  315 (440)
T KOG0302|consen  256 HTKSVEDLQWSPTEDGVFASCSCDGSIRIWDIRSGPKKAAVST--KAHNS----DVNVIS----W---------NRREP-  315 (440)
T ss_pred             cccchhhhccCCccCceEEeeecCceEEEEEecCCCccceeEe--eccCC----ceeeEE----c---------cCCcc-
Confidence            77889999999998 68888899999999999988    3333  44332    233321    1         01112 


Q ss_pred             EEEEEcCCCCeEEEeeCCCC
Q 015021          381 CLAIHAPRKGIIEHLAGMAD  400 (414)
Q Consensus       381 ~LvIyaprRg~lEvW~~~~g  400 (414)
                       |.-..--.|.+.||++|+.
T Consensus       316 -lLasG~DdGt~~iwDLR~~  334 (440)
T KOG0302|consen  316 -LLASGGDDGTLSIWDLRQF  334 (440)
T ss_pred             -eeeecCCCceEEEEEhhhc
Confidence             5567788999999999983


No 218
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=26.05  E-value=1.2e+02  Score=19.70  Aligned_cols=20  Identities=30%  Similarity=0.278  Sum_probs=14.2

Q ss_pred             EEEcCCCcEEEEEcCcceeE
Q 015021          323 AITDSLGRILLLDTQALVVV  342 (414)
Q Consensus       323 a~tD~lGRV~L~D~~~~~vv  342 (414)
                      .+.|+.|+|++-|..+..|.
T Consensus         7 vav~~~g~i~VaD~~n~rV~   26 (28)
T PF01436_consen    7 VAVDSDGNIYVADSGNHRVQ   26 (28)
T ss_dssp             EEEETTSEEEEEECCCTEEE
T ss_pred             EEEeCCCCEEEEECCCCEEE
Confidence            34568888888887776654


No 219
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=25.95  E-value=1.4e+02  Score=30.83  Aligned_cols=45  Identities=22%  Similarity=0.309  Sum_probs=38.3

Q ss_pred             eEEEECCCCCEEEEEc-CCCcEEEEEcCcceeEe------eecccccceeeE
Q 015021          311 ERLTLSPSGSLAAITD-SLGRILLLDTQALVVVR------LWKGYRDASCVF  355 (414)
Q Consensus       311 ~~i~~sP~~~laa~tD-~lGRV~L~D~~~~~vvr------mwKGyRdAq~~w  355 (414)
                      ..+.++|+|+.+.++| ..++|.++|..+..+.|      +=.|.+.+...+
T Consensus       163 ~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~~~~~~~~~~~~~~P~~i~v  214 (381)
T COG3391         163 TGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVRGSVGSLVGVGTGPAGIAV  214 (381)
T ss_pred             ceEEECCCCCeEEEEecCCCeEEEEeCCCcceeccccccccccCCCCceEEE
Confidence            7899999999777776 89999999999999997      666777766665


No 220
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=25.93  E-value=2.8e+02  Score=33.03  Aligned_cols=36  Identities=28%  Similarity=0.115  Sum_probs=27.5

Q ss_pred             eeEEEECCCCCEEEEEcCC-CcEEEEEcCcceeEeee
Q 015021          310 GERLTLSPSGSLAAITDSL-GRILLLDTQALVVVRLW  345 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~l-GRV~L~D~~~~~vvrmw  345 (414)
                      -..|+++|+|+.+.++|.. +||..+|+.++.+.-+.
T Consensus       742 P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~  778 (1057)
T PLN02919        742 PSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLA  778 (1057)
T ss_pred             ccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEE
Confidence            3459999999866777765 89999999987654333


No 221
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=25.78  E-value=61  Score=40.67  Aligned_cols=67  Identities=12%  Similarity=0.066  Sum_probs=54.6

Q ss_pred             ceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          308 RKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       308 R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      -..+.|+-.|..++....-..|-|.|+|+.+.+.++-|+.       |. ..                     -...-.+
T Consensus      2337 ~gaT~l~~~P~~qllisggr~G~v~l~D~rqrql~h~~~~-------~~-~~---------------------~~f~~~s 2387 (2439)
T KOG1064|consen 2337 GGATVLAYAPKHQLLISGGRKGEVCLFDIRQRQLRHTFQA-------LD-TR---------------------EYFVTGS 2387 (2439)
T ss_pred             CCceEEEEcCcceEEEecCCcCcEEEeehHHHHHHHHhhh-------hh-hh---------------------heeeccC
Confidence            3467889999999999999999999999999999999977       22 00                     1446678


Q ss_pred             CCCeEEEeeCCCCcce
Q 015021          388 RKGIIEHLAGMADENW  403 (414)
Q Consensus       388 rRg~lEvW~~~~g~~v  403 (414)
                      .||.+.||++-.-..+
T Consensus      2388 s~g~ikIw~~s~~~ll 2403 (2439)
T KOG1064|consen 2388 SEGNIKIWRLSEFGLL 2403 (2439)
T ss_pred             cccceEEEEccccchh
Confidence            8999999999776555


No 222
>PF11396 DUF2874:  Protein of unknown function (DUF2874);  InterPro: IPR021533  This bacterial family of proteins are probable periplasmic proteins with unknown function. There are between one and four copies of this domain per sequence. ; PDB: 3DUE_A 3U1W_B 3DB7_A 4DSD_A 3ELG_A.
Probab=24.87  E-value=2.9e+02  Score=20.40  Aligned_cols=38  Identities=16%  Similarity=0.170  Sum_probs=32.2

Q ss_pred             CCCeEEEEEeEEeCCcEEEEEecc---ccEEEEEecCCcEE
Q 015021           75 ASEYITAIEWLVFEEMRALAVGTS---RGYFLVYDLKGDLV  112 (414)
Q Consensus        75 ~~e~ITs~~~lpl~dw~~I~VG~s---sG~vrfyte~G~LL  112 (414)
                      ++..|+.+.-.-.++....-|-+.   .++-..|+.+|.+|
T Consensus        21 p~~~i~~v~~~~~~~~~~Y~v~l~~~~~~~~v~fd~~G~~l   61 (61)
T PF11396_consen   21 PGAKIKEVEKETDPGGKYYEVELKKGGNEYEVYFDANGNWL   61 (61)
T ss_dssp             TTSEEEEEEEEEETTEEEEEEEETETTTSEEEEEETTS-EE
T ss_pred             CCCeEEEEEEEEcCCCCEEEEEEEEeCCeEEEEEcCCCCCC
Confidence            467888888888888888889999   99999999999986


No 223
>PF02393 US22:  US22 like;  InterPro: IPR003360 Herpesviruses are large and complex DNA viruses, widely found in nature. Human cytomegalovirus (HCMV), an important human pathogen, defines the betaherpesvirus family. Mouse cytomegalovirus (MCMV) and rat cytomegalovirus serve as biological model systems for HCMV. HCMV, MCMV, and rat CMV display the largest genomes among the herpesviruses and are essentially co-linear over the central 180 kb of the 230-kb genomes. Betaherpesviruses, which include the CMVs as well as human herpesviruses 6 and 7, differ from alpha- and gammaherpesviruses by the presence of additional gene families such as the US22 gene family, which are mainly clustered at the ends of the genome. The US22 family was first described in HCMV. This gene family comprises 12 members in both HCMV and MCMV and 11 in rat CMV []. Members of the US22 gene family are characterised by stretches of hydrophobic and charged residues as well as up to four conserved sequence motifs which are specific for betaherpesviruses. Motif I differs between the HCMV US and UL family members []. Motifs I and II have consensus sequences, while motifs III and IV are less well defined but have stretches of non-polar residues [, ]. Members of this gene family are widely divergent in function and their involvement in viral replication []. This entry contains US22 family members from the Cytomegalovirus, Muromegalovirus and the Roseolovirus taxonomic groups.  The name sake of this family US22 is an early nuclear protein that is secreted from cells []. The US22 family may have a role in virus replication and pathogenesis [].
Probab=24.69  E-value=1.2e+02  Score=25.78  Aligned_cols=28  Identities=21%  Similarity=0.286  Sum_probs=23.9

Q ss_pred             CCCCEEEEEcCCCcEEEEEcCcceeEee
Q 015021          317 PSGSLAAITDSLGRILLLDTQALVVVRL  344 (414)
Q Consensus       317 P~~~laa~tD~lGRV~L~D~~~~~vvrm  344 (414)
                      +.-+..+..|..|||+.+|..+..+.|+
T Consensus        79 ~~~~~vvl~~~~G~Vy~yd~~~~~l~~l  106 (125)
T PF02393_consen   79 FRDRLVVLVGESGRVYAYDPEDDRLYRL  106 (125)
T ss_pred             ccceEEEEEeCCCeEEEEEcCCCEEEEE
Confidence            4557888899999999999999887765


No 224
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=24.38  E-value=1.4e+02  Score=31.65  Aligned_cols=72  Identities=18%  Similarity=0.117  Sum_probs=51.0

Q ss_pred             CeeeeccCcceeeeeecc-------eEEEEeecCCCCCc--eeecccCCCCCCCeEEEEEeEEe-CCcEEEEEeccccEE
Q 015021           33 PNLLCALDMHTIALANRY-------QTVIINWADPEGLV--AKIRPELSPIASEYITAIEWLVF-EEMRALAVGTSRGYF  102 (414)
Q Consensus        33 ~~~~~sp~~~~la~A~~~-------~~v~~~w~~~~~~~--v~~~g~l~~~~~e~ITs~~~lpl-~dw~~I~VG~ssG~v  102 (414)
                      -.++..|.|++||+-++.       |.+|++|+...-..  ++-||.+..   |.=..|..=|+ .+-.++-|+-++|++
T Consensus       349 q~lawDpsGeyLav~fKg~~~v~~~k~~i~~fdtr~sp~vels~cg~i~g---e~P~~IsF~pl~n~g~lLsiaWsTGri  425 (445)
T KOG2139|consen  349 QCLAWDPSGEYLAVIFKGQSFVLLCKLHISRFDTRKSPPVELSYCGMIGG---EYPAYISFGPLKNEGRLLSIAWSTGRI  425 (445)
T ss_pred             ceeeECCCCCEEEEEEcCCchhhhhhhhhhhhcccccCceEEEecccccC---CCCceEEeeecccCCcEEEEEeccCce
Confidence            357888999999987765       44557776664333  456788744   33444555555 677899999999999


Q ss_pred             EEEec
Q 015021          103 LVYDL  107 (414)
Q Consensus       103 rfyte  107 (414)
                      .-|..
T Consensus       426 q~ypl  430 (445)
T KOG2139|consen  426 QRYPL  430 (445)
T ss_pred             Eeeee
Confidence            99874


No 225
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.38  E-value=3e+02  Score=30.86  Aligned_cols=40  Identities=13%  Similarity=0.170  Sum_probs=30.6

Q ss_pred             EECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceee
Q 015021          314 TLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCV  354 (414)
Q Consensus       314 ~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~  354 (414)
                      +++|=-.++...|.+|.-|+.=+..|++|- ||=--++|..
T Consensus       470 ~V~pi~Gf~~L~d~~G~~IV~vLsSGecI~-w~Ll~~~h~~  509 (741)
T KOG4460|consen  470 QVAPIRGFWILPDILGPTIVCILSSGECII-WPLLSTVHPA  509 (741)
T ss_pred             ccccccceeeccccCCceEEEEecCCcEEE-Eeeecccccc
Confidence            788888888899999976666677887775 8776666543


No 226
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=23.81  E-value=2.7e+02  Score=27.84  Aligned_cols=49  Identities=20%  Similarity=0.188  Sum_probs=30.6

Q ss_pred             CceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEEEEEe-cCCcEEeecccC
Q 015021           63 LVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYFLVYD-LKGDLVHRQLIH  118 (414)
Q Consensus        63 ~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyt-e~G~LL~sQ~lh  118 (414)
                      +++.|+-.+..    .+.+   -|..+-.-+.||..+|.|..++ ++|+++.+..+.
T Consensus        85 G~~~W~~~~~~----~~~~---~p~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~  134 (377)
T TIGR03300        85 GKRLWRVDLDE----RLSG---GVGADGGLVFVGTEKGEVIALDAEDGKELWRAKLS  134 (377)
T ss_pred             CcEeeeecCCC----Cccc---ceEEcCCEEEEEcCCCEEEEEECCCCcEeeeeccC
Confidence            45777666533    1221   1333344566888899999888 489988776543


No 227
>PF10411 DsbC_N:  Disulfide bond isomerase protein N-terminus;  InterPro: IPR018950  This is the N-terminal domain of the disulphide bond isomerase DsbC. The whole molecule is V-shaped, where each arm is a DsbC monomer of two domains linked by a hinge; and the N-termini of each monomer join to form the dimer interface at the base of the V, so are vital for dimerisation []. DsbC is required for disulphide bond formation and functions as a disulphide bond isomerase during oxidative protein-folding in bacterial periplasm. It also has chaperone activity []. ; PDB: 1EEJ_B 2IYJ_A 1TJD_A 1JZD_B 1JZO_A 1G0T_B 1T3B_A.
Probab=23.34  E-value=1.6e+02  Score=22.33  Aligned_cols=37  Identities=11%  Similarity=0.117  Sum_probs=26.5

Q ss_pred             CCeEEEEEeEEeCCcEEEEEeccccEEEEEecCCcEEee
Q 015021           76 SEYITAIEWLVFEEMRALAVGTSRGYFLVYDLKGDLVHR  114 (414)
Q Consensus        76 ~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~G~LL~s  114 (414)
                      +=.|+++...|+++..=|.+  .+|-+.+.|++|.-|+.
T Consensus        11 ~~~v~~v~~spi~GlyeV~~--~~~~i~Y~~~dg~yli~   47 (57)
T PF10411_consen   11 GLKVESVSPSPIPGLYEVVL--KGGGILYVDEDGRYLIQ   47 (57)
T ss_dssp             T-TCEEEEE-SSTTEEEEEE---TTEEEEEETTSSEEEE
T ss_pred             CCceeEEEcCCCCCeEEEEE--CCCeEEEEcCCCCEEEE
Confidence            55799999999999755555  66778778899987764


No 228
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=23.09  E-value=3.1e+02  Score=29.86  Aligned_cols=88  Identities=15%  Similarity=0.011  Sum_probs=60.4

Q ss_pred             ccccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeE-eeecc-------cccceeeEEEEEecccccccccccC
Q 015021          301 TCLKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVV-RLWKG-------YRDASCVFMEMLVNKDAATSSAYYA  372 (414)
Q Consensus       301 ~~l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vv-rmwKG-------yRdAq~~wi~~~~~~~~~~~~~~~~  372 (414)
                      ..+--+++...+++++|++++.-.++..|-|+=+|+.+|.-. .+|+-       |+--+..| .              .
T Consensus       136 ~~~~~H~~s~~~vals~d~~~~fsask~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r-~--------------~  200 (479)
T KOG0299|consen  136 RVIGKHQLSVTSVALSPDDKRVFSASKDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESR-K--------------G  200 (479)
T ss_pred             eeeccccCcceEEEeeccccceeecCCCcceeeeehhcCcccccccccchhhhhccCCCCccc-c--------------c
Confidence            346677888999999999999999999999999999999988 46653       22212222 0              0


Q ss_pred             CCCCcceEE--------EEEEcCCCCeEEEeeCCCCcceee
Q 015021          373 PVKSDYCLC--------LAIHAPRKGIIEHLAGMADENWTT  405 (414)
Q Consensus       373 ~~k~~~~l~--------LvIyaprRg~lEvW~~~~g~~v~~  405 (414)
                      +.+  .-|.        .+....++..|-||+.++++-|-+
T Consensus       201 h~k--eil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv~~  239 (479)
T KOG0299|consen  201 HVK--EILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHVKV  239 (479)
T ss_pred             ccc--eeEEEEEcCCCcEEEecCCCceEEEecCcccchhhc
Confidence            000  0112        224567778888999999887743


No 229
>PF14408 Actino_peptide:  Ribosomally synthesized peptide in actinomycetes
Probab=22.65  E-value=97  Score=24.45  Aligned_cols=22  Identities=23%  Similarity=0.382  Sum_probs=19.1

Q ss_pred             EEEECCCCCEEEEEcCCCcEEE
Q 015021          312 RLTLSPSGSLAAITDSLGRILL  333 (414)
Q Consensus       312 ~i~~sP~~~laa~tD~lGRV~L  333 (414)
                      ++.++|.-++++..|+.|+++-
T Consensus         5 ~~~lDP~TQ~~v~~D~~G~~ve   26 (59)
T PF14408_consen    5 RVVLDPDTQTGVYVDRDGPVVE   26 (59)
T ss_pred             eeeECCCceeeEEEcCCCCcee
Confidence            5679999999999999998754


No 230
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=22.55  E-value=1.1e+02  Score=36.02  Aligned_cols=48  Identities=15%  Similarity=0.317  Sum_probs=43.9

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeeccccc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRD  350 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRd  350 (414)
                      |--++-.+..++=||++.|.|......+|+++|..+...+++.+||-.
T Consensus       125 l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~~~vl~~H~s  172 (942)
T KOG0973|consen  125 LRGHDSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFELLKVLRGHQS  172 (942)
T ss_pred             EecCCCccceeccCCCccEEEEecccceEEEEccccceeeeeeecccc
Confidence            555778889999999999999999999999999999999999999865


No 231
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=22.27  E-value=2.3e+02  Score=32.15  Aligned_cols=98  Identities=12%  Similarity=-0.022  Sum_probs=56.0

Q ss_pred             ccCCCceeeEEEECCCCC-EEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEec-cccc--ccccc--cCCCC-
Q 015021          303 LKDHPRKGERLTLSPSGS-LAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVN-KDAA--TSSAY--YAPVK-  375 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~-laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~-~~~~--~~~~~--~~~~k-  375 (414)
                      +..+.+..-++|..|++. .++..-.+|-|+|+|+..-.   |=+=+|+..|.-...... .+..  ....+  ..+.+ 
T Consensus       140 ~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~R~n~---~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~t  216 (720)
T KOG0321|consen  140 NLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDCRCNG---VDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNT  216 (720)
T ss_pred             ecccccccchhhhccCCCcceeeccCCCcEEEEEEeccc---hhhHHHHhhhhhccccCCCCCCchhhccccccccccCc
Confidence            667788889999999996 55556899999999987665   222344444433332221 0000  00000  00000 


Q ss_pred             --C--cceEE----EEEEcCC-CCeEEEeeCCCCcce
Q 015021          376 --S--DYCLC----LAIHAPR-KGIIEHLAGMADENW  403 (414)
Q Consensus       376 --~--~~~l~----LvIyapr-Rg~lEvW~~~~g~~v  403 (414)
                        +  ..-+|    +++-|.- +|.|+|||++.-.+.
T Consensus       217 i~ssvTvv~fkDe~tlaSaga~D~~iKVWDLRk~~~~  253 (720)
T KOG0321|consen  217 IFSSVTVVLFKDESTLASAGAADSTIKVWDLRKNYTA  253 (720)
T ss_pred             eeeeeEEEEEeccceeeeccCCCcceEEEeecccccc
Confidence              0  01111    3556665 999999999976554


No 232
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=21.95  E-value=1.5e+02  Score=18.91  Aligned_cols=21  Identities=14%  Similarity=0.254  Sum_probs=16.7

Q ss_pred             EEEEcCCCcEEEEEcCcceeE
Q 015021          322 AAITDSLGRILLLDTQALVVV  342 (414)
Q Consensus       322 aa~tD~lGRV~L~D~~~~~vv  342 (414)
                      +.+++..|++.-+|..+|.++
T Consensus         9 v~~~~~~g~l~a~d~~~G~~~   29 (33)
T smart00564        9 VYVGSTDGTLYALDAKTGEIL   29 (33)
T ss_pred             EEEEcCCCEEEEEEcccCcEE
Confidence            444566799999999999875


No 233
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=21.90  E-value=2.9e+02  Score=28.95  Aligned_cols=84  Identities=18%  Similarity=0.343  Sum_probs=46.2

Q ss_pred             CCCCcccCCCeeeeccCcceee-eeecceEEEEeecCCCCCceeecccCCCCCCCeEEEEEeEEeCCcEEEEEeccccEE
Q 015021           24 GKEGWLVNDPNLLCALDMHTIA-LANRYQTVIINWADPEGLVAKIRPELSPIASEYITAIEWLVFEEMRALAVGTSRGYF  102 (414)
Q Consensus        24 ~~~~WL~~~~~~~~sp~~~~la-~A~~~~~v~~~w~~~~~~~v~~~g~l~~~~~e~ITs~~~lpl~dw~~I~VG~ssG~v  102 (414)
                      ...+|..+   +..||+|..|| +++.+-.-+..=....+-..+....     ----.+++|+  .+-.+|+.|++.+=+
T Consensus       204 ~~ggwvh~---v~fs~sG~~lawv~Hds~v~~~da~~p~~~v~~~~~~-----~lP~ls~~~i--se~~vv~ag~~c~P~  273 (361)
T KOG1523|consen  204 SSGGWVHG---VLFSPSGNRLAWVGHDSTVSFVDAAGPSERVQSVATA-----QLPLLSVSWI--SENSVVAAGYDCGPV  273 (361)
T ss_pred             cCCCceee---eEeCCCCCEeeEecCCCceEEeecCCCchhccchhhc-----cCCceeeEee--cCCceeecCCCCCce
Confidence            45678877   77788888877 4555554443211110000111110     0112333333  577899999997776


Q ss_pred             EEEe-cCCcEEeeccc
Q 015021          103 LVYD-LKGDLVHRQLI  117 (414)
Q Consensus       103 rfyt-e~G~LL~sQ~l  117 (414)
                      .|-+ +.|-|.|.-.+
T Consensus       274 lf~~~~~~~l~~~~~l  289 (361)
T KOG1523|consen  274 LFVTDEEGGLSFARRL  289 (361)
T ss_pred             EEEeccccceeeehhc
Confidence            6654 77766665443


No 234
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=21.69  E-value=71  Score=36.91  Aligned_cols=40  Identities=28%  Similarity=0.330  Sum_probs=33.9

Q ss_pred             EEEEEeccccEEEEEecCCcEEeec-ccCccceeEEEEecc
Q 015021           91 RALAVGTSRGYFLVYDLKGDLVHRQ-LIHPGRILKLRVRGS  130 (414)
Q Consensus        91 ~~I~VG~ssG~vrfyte~G~LL~sQ-~lh~~pV~~ik~r~~  130 (414)
                      -.+||||++|.|-+|+..-.+++.. ..|.+-|..|+--+.
T Consensus       438 pLvAvGT~sGTV~vvdvst~~v~~~fsvht~~VkgleW~g~  478 (1062)
T KOG1912|consen  438 PLVAVGTNSGTVDVVDVSTNAVAASFSVHTSLVKGLEWLGN  478 (1062)
T ss_pred             eeEEeecCCceEEEEEecchhhhhhhcccccceeeeeeccc
Confidence            4689999999999999988888877 689999988875553


No 235
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=21.46  E-value=2.9e+02  Score=29.93  Aligned_cols=54  Identities=22%  Similarity=0.275  Sum_probs=42.1

Q ss_pred             EEEEEeEEeCCcEEEEEeccccEEEEEe--cCCcEEeecccCccceeEEEEecccCC
Q 015021           79 ITAIEWLVFEEMRALAVGTSRGYFLVYD--LKGDLVHRQLIHPGRILKLRVRGSRRD  133 (414)
Q Consensus        79 ITs~~~lpl~dw~~I~VG~ssG~vrfyt--e~G~LL~sQ~lh~~pV~~ik~r~~~~~  133 (414)
                      |--+.|=|.... .++||+.+|.|+-|+  ..|..++.-.-|+++|-.|-+-...++
T Consensus       332 VEkv~w~~~se~-~f~~~tddG~v~~~D~R~~~~~vwt~~AHd~~ISgl~~n~~~p~  387 (463)
T KOG0270|consen  332 VEKVAWDPHSEN-SFFVSTDDGTVYYFDIRNPGKPVWTLKAHDDEISGLSVNIQTPG  387 (463)
T ss_pred             eEEEEecCCCce-eEEEecCCceEEeeecCCCCCceeEEEeccCCcceEEecCCCCc
Confidence            444556665444 678999999999999  478899999999999999987665554


No 236
>PRK04043 tolB translocation protein TolB; Provisional
Probab=21.40  E-value=1.6e+02  Score=30.89  Aligned_cols=43  Identities=14%  Similarity=0.040  Sum_probs=30.9

Q ss_pred             EEECCCCCEEEEEcCC---------CcEEEEEcCcceeEeeecccccceeeE
Q 015021          313 LTLSPSGSLAAITDSL---------GRILLLDTQALVVVRLWKGYRDASCVF  355 (414)
Q Consensus       313 i~~sP~~~laa~tD~l---------GRV~L~D~~~~~vvrmwKGyRdAq~~w  355 (414)
                      ...||+|+++|.+-.-         .+|.++|+.++..-++-.+..+....|
T Consensus       324 ~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~LT~~~~~~~p~~  375 (419)
T PRK04043        324 SSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRLTANGVNQFPRF  375 (419)
T ss_pred             ceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEECCCCCCcCCeEE
Confidence            3789999866664221         489999999999888877755544333


No 237
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=21.22  E-value=4.3e+02  Score=29.44  Aligned_cols=81  Identities=21%  Similarity=0.179  Sum_probs=48.4

Q ss_pred             cccCCCceeeEEEECCCC--CEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccc--cccCCCCCc
Q 015021          302 CLKDHPRKGERLTLSPSG--SLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSS--AYYAPVKSD  377 (414)
Q Consensus       302 ~l~D~~R~~~~i~~sP~~--~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~--~~~~~~k~~  377 (414)
                      .|.|++-.+.  ++.-++  .|+|.....|.|+|..+.++.=---+|=               +.+.+.  ..+.+.|. 
T Consensus       116 ~lkdh~stvt--~v~YN~~DeyiAsvs~gGdiiih~~~t~~~tt~f~~---------------~sgqsvRll~ys~skr-  177 (673)
T KOG4378|consen  116 FLKDHQSTVT--YVDYNNTDEYIASVSDGGDIIIHGTKTKQKTTTFTI---------------DSGQSVRLLRYSPSKR-  177 (673)
T ss_pred             hccCCcceeE--EEEecCCcceeEEeccCCcEEEEecccCccccceec---------------CCCCeEEEeecccccc-
Confidence            4778774444  445444  5777777789999999887643222210               111000  12333342 


Q ss_pred             ceEEEEEEcCCCCeEEEeeCCCCcce
Q 015021          378 YCLCLAIHAPRKGIIEHLAGMADENW  403 (414)
Q Consensus       378 ~~l~LvIyaprRg~lEvW~~~~g~~v  403 (414)
                         ||..-|.-+|+|-+||++--..+
T Consensus       178 ---~lL~~asd~G~VtlwDv~g~sp~  200 (673)
T KOG4378|consen  178 ---FLLSIASDKGAVTLWDVQGMSPI  200 (673)
T ss_pred             ---eeeEeeccCCeEEEEeccCCCcc
Confidence               57777889999999999864433


No 238
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=21.22  E-value=3e+02  Score=30.58  Aligned_cols=33  Identities=18%  Similarity=0.343  Sum_probs=27.9

Q ss_pred             CCcEEEEEeccccEEEEEecCCcEEeecccCcc
Q 015021           88 EEMRALAVGTSRGYFLVYDLKGDLVHRQLIHPG  120 (414)
Q Consensus        88 ~dw~~I~VG~ssG~vrfyte~G~LL~sQ~lh~~  120 (414)
                      ||..+++||-..|-+..|+-.=.-+-.|++.++
T Consensus       309 p~gai~~V~s~qGelQ~FD~ALspi~~qLlsEd  341 (545)
T PF11768_consen  309 PDGAIFVVGSEQGELQCFDMALSPIKMQLLSED  341 (545)
T ss_pred             CCCcEEEEEcCCceEEEEEeecCccceeecccc
Confidence            788899999999999999987777777777654


No 239
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=20.91  E-value=1e+02  Score=33.61  Aligned_cols=36  Identities=14%  Similarity=0.153  Sum_probs=29.9

Q ss_pred             ccCCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCc
Q 015021          303 LKDHPRKGERLTLSPSGSLAAITDSLGRILLLDTQA  338 (414)
Q Consensus       303 l~D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~  338 (414)
                      +.-++--..+|+++|+|+|.|+|-.+-+|=+||+.+
T Consensus       289 iLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~  324 (545)
T KOG1272|consen  289 ILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLRN  324 (545)
T ss_pred             HHhcCCCcceEEECCCCcEEeecccccceeEeeecc
Confidence            444556678999999999999999999999999743


No 240
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=20.88  E-value=1.7e+02  Score=29.90  Aligned_cols=38  Identities=29%  Similarity=0.324  Sum_probs=30.4

Q ss_pred             CCCceeeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEee
Q 015021          305 DHPRKGERLTLSPSGSLAAITDSLGRILLLDTQALVVVRL  344 (414)
Q Consensus       305 D~~R~~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrm  344 (414)
                      ...+....+.+  .|+.+.++|..||.+|+|+.+..+..+
T Consensus        64 ~~~~~~~F~al--~gskIv~~d~~~~t~vyDt~t~av~~~  101 (342)
T PF07893_consen   64 RGPWSMDFFAL--HGSKIVAVDQSGRTLVYDTDTRAVATG  101 (342)
T ss_pred             CCCceeEEEEe--cCCeEEEEcCCCCeEEEECCCCeEecc
Confidence            44455566666  788899999999999999999988843


No 241
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=20.78  E-value=1e+02  Score=36.87  Aligned_cols=69  Identities=19%  Similarity=0.218  Sum_probs=47.0

Q ss_pred             eeeeccCcceeeeeecceEEEEeecCCCC--CceeecccCCCCCCCeEEEEEeEEe-CCcEEEEEeccccEEEEEec
Q 015021           34 NLLCALDMHTIALANRYQTVIINWADPEG--LVAKIRPELSPIASEYITAIEWLVF-EEMRALAVGTSRGYFLVYDL  107 (414)
Q Consensus        34 ~~~~sp~~~~la~A~~~~~v~~~w~~~~~--~~v~~~g~l~~~~~e~ITs~~~lpl-~dw~~I~VG~ssG~vrfyte  107 (414)
                      .+.+-+.+++||-+..+.+.|  |+.+++  ..+.-.+-.   -+++|.+..||-| |=+--+|+|+.+-+|.+|+.
T Consensus      1309 al~VH~hapiiAsGs~q~ikI--y~~~G~~l~~~k~n~~F---~~q~~gs~scL~FHP~~~llAaG~~Ds~V~iYs~ 1380 (1387)
T KOG1517|consen 1309 ALTVHEHAPIIASGSAQLIKI--YSLSGEQLNIIKYNPGF---MGQRIGSVSCLAFHPHRLLLAAGSADSTVSIYSC 1380 (1387)
T ss_pred             eeeeccCCCeeeecCcceEEE--EecChhhhcccccCccc---ccCcCCCcceeeecchhHhhhhccCCceEEEeec
Confidence            355667777777666666555  333331  112212211   2688999999999 77889999999999999994


No 242
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=20.52  E-value=1.4e+02  Score=30.89  Aligned_cols=52  Identities=25%  Similarity=0.414  Sum_probs=40.9

Q ss_pred             CCCCeEEEEEeEEeCCcEEEEEeccccEEEEEecC--CcEEe-ecccCccceeEEE
Q 015021           74 IASEYITAIEWLVFEEMRALAVGTSRGYFLVYDLK--GDLVH-RQLIHPGRILKLR  126 (414)
Q Consensus        74 ~~~e~ITs~~~lpl~dw~~I~VG~ssG~vrfyte~--G~LL~-sQ~lh~~pV~~ik  126 (414)
                      .+++.|++++.=| -.+..++.|-=+|.||.|..+  |.+.= .|.=|+.||+-+.
T Consensus        25 pP~DsIS~l~FSP-~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~   79 (347)
T KOG0647|consen   25 PPEDSISALAFSP-QADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVC   79 (347)
T ss_pred             CcccchheeEecc-ccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEE
Confidence            3578999999988 577799999999999999953  44442 4566788998876


No 243
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=20.33  E-value=1.2e+02  Score=33.64  Aligned_cols=77  Identities=14%  Similarity=-0.010  Sum_probs=46.4

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceee--EEEEEecccccccccccCCCCCcceEEEEEEcC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCV--FMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAP  387 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~--wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyap  387 (414)
                      .++.+-+|+|+|+|..=..|-|=.||. .+.-+|.===||||.-.  =|....=..+         ++     +| ..-.
T Consensus       320 ~tsC~~nrdg~~iAagc~DGSIQ~W~~-~~~~v~p~~~vk~AH~~g~~Itsi~FS~d---------g~-----~L-lSRg  383 (641)
T KOG0772|consen  320 VTSCAWNRDGKLIAAGCLDGSIQIWDK-GSRTVRPVMKVKDAHLPGQDITSISFSYD---------GN-----YL-LSRG  383 (641)
T ss_pred             ceeeecCCCcchhhhcccCCceeeeec-CCcccccceEeeeccCCCCceeEEEeccc---------cc-----hh-hhcc
Confidence            567788999999998888899999998 44444533335565433  1111100000         01     11 1223


Q ss_pred             CCCeEEEeeCCCCcc
Q 015021          388 RKGIIEHLAGMADEN  402 (414)
Q Consensus       388 rRg~lEvW~~~~g~~  402 (414)
                      -++.|+||++++.++
T Consensus       384 ~D~tLKvWDLrq~kk  398 (641)
T KOG0772|consen  384 FDDTLKVWDLRQFKK  398 (641)
T ss_pred             CCCceeeeecccccc
Confidence            368999999999765


No 244
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=20.33  E-value=1.8e+02  Score=29.87  Aligned_cols=69  Identities=16%  Similarity=0.082  Sum_probs=51.5

Q ss_pred             eeEEEECCCCCEEEEEcCCCcEEEEEcCcceeEeeecccccceeeEEEEEecccccccccccCCCCCcceEEEEEEcCCC
Q 015021          310 GERLTLSPSGSLAAITDSLGRILLLDTQALVVVRLWKGYRDASCVFMEMLVNKDAATSSAYYAPVKSDYCLCLAIHAPRK  389 (414)
Q Consensus       310 ~~~i~~sP~~~laa~tD~lGRV~L~D~~~~~vvrmwKGyRdAq~~wi~~~~~~~~~~~~~~~~~~k~~~~l~LvIyaprR  389 (414)
                      ...+.+=||++.+|++-=+|||=++.-.++..+-+.|=|++ -|.=+-...  +.                =|.--|..+
T Consensus       254 v~gvrIRpD~KIlATAGWD~RiRVyswrtl~pLAVLkyHsa-gvn~vAfsp--d~----------------~lmAaaskD  314 (323)
T KOG0322|consen  254 VSGVRIRPDGKILATAGWDHRIRVYSWRTLNPLAVLKYHSA-GVNAVAFSP--DC----------------ELMAAASKD  314 (323)
T ss_pred             ccceEEccCCcEEeecccCCcEEEEEeccCCchhhhhhhhc-ceeEEEeCC--CC----------------chhhhccCC
Confidence            45588899999999999999999999999999999999995 332222111  00                144456778


Q ss_pred             CeEEEeeC
Q 015021          390 GIIEHLAG  397 (414)
Q Consensus       390 g~lEvW~~  397 (414)
                      +.|-+|.+
T Consensus       315 ~rISLWkL  322 (323)
T KOG0322|consen  315 ARISLWKL  322 (323)
T ss_pred             ceEEeeec
Confidence            88888875


Done!