Query         022074
Match_columns 303
No_of_seqs    125 out of 1375
Neff          9.6 
Searched_HMMs 46136
Date          Fri Mar 29 07:50:09 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/022074.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/022074hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG0271 Notchless-like WD40 re 100.0 4.7E-36   1E-40  254.0  18.4  256   16-284   136-441 (480)
  2 KOG0272 U4/U6 small nuclear ri 100.0 1.5E-36 3.3E-41  259.6  15.1  229    4-280   224-458 (459)
  3 KOG0263 Transcription initiati 100.0 5.6E-36 1.2E-40  273.4  19.3  206   35-285   447-652 (707)
  4 KOG0315 G-protein beta subunit 100.0   2E-34 4.4E-39  231.8  22.7  264    3-284     6-290 (311)
  5 KOG0272 U4/U6 small nuclear ri 100.0 7.7E-35 1.7E-39  249.2  17.6  204   37-284   173-377 (459)
  6 KOG0271 Notchless-like WD40 re 100.0 5.5E-33 1.2E-37  235.4  21.2  228   10-280   172-479 (480)
  7 KOG0286 G-protein beta subunit 100.0 5.6E-32 1.2E-36  222.2  22.2  227   11-280   116-343 (343)
  8 KOG0279 G protein beta subunit 100.0 3.5E-31 7.6E-36  216.3  22.3  230    9-284    30-264 (315)
  9 KOG0282 mRNA splicing factor [ 100.0 1.2E-31 2.5E-36  233.2  17.7  257   11-280   231-503 (503)
 10 KOG0286 G-protein beta subunit 100.0 7.6E-30 1.7E-34  209.7  23.1  222   12-281    72-302 (343)
 11 KOG0279 G protein beta subunit 100.0 7.5E-30 1.6E-34  208.5  21.8  205   34-284    10-224 (315)
 12 KOG0281 Beta-TrCP (transducin  100.0 1.5E-31 3.1E-36  224.5  11.2  220   10-284   210-430 (499)
 13 KOG0263 Transcription initiati 100.0 2.7E-30 5.7E-35  236.3  19.9  200   40-284   379-609 (707)
 14 KOG0266 WD40 repeat-containing 100.0 1.6E-29 3.4E-34  233.5  24.0  261   11-282   175-454 (456)
 15 KOG0295 WD40 repeat-containing 100.0 3.6E-30 7.8E-35  217.1  17.2  249    4-284   115-366 (406)
 16 KOG0285 Pleiotropic regulator  100.0 9.9E-30 2.1E-34  214.1  19.1  203   36-284   148-350 (460)
 17 KOG0284 Polyadenylation factor 100.0 2.1E-30 4.6E-35  221.2  12.5  224    8-282   111-337 (464)
 18 KOG0273 Beta-transducin family 100.0 1.6E-28 3.5E-33  213.1  22.9  221   22-284   259-484 (524)
 19 KOG0265 U5 snRNP-specific prot 100.0 1.3E-28 2.8E-33  203.1  20.4  233   11-278    63-334 (338)
 20 KOG0284 Polyadenylation factor 100.0 1.1E-29 2.5E-34  216.8  13.1  201   37-283    94-295 (464)
 21 KOG0319 WD40-repeat-containing 100.0 2.1E-29 4.5E-34  228.8  15.2  230   12-287   382-624 (775)
 22 KOG0293 WD40 repeat-containing 100.0 2.1E-28 4.6E-33  209.4  19.4  264   11-283   240-514 (519)
 23 KOG0265 U5 snRNP-specific prot 100.0   7E-28 1.5E-32  198.8  21.7  237    5-287    13-251 (338)
 24 KOG0278 Serine/threonine kinas 100.0 3.3E-29 7.2E-34  202.0  10.4  237   36-282    56-297 (334)
 25 KOG0266 WD40 repeat-containing 100.0 3.7E-27   8E-32  217.7  24.6  203   38-284   158-366 (456)
 26 KOG0645 WD40 repeat protein [G 100.0 1.4E-26   3E-31  188.7  24.7  207   35-281    10-224 (312)
 27 PTZ00421 coronin; Provisional  100.0 1.7E-26 3.6E-31  213.5  27.8  207   35-284    71-292 (493)
 28 KOG0291 WD40-repeat-containing 100.0   4E-27 8.6E-32  214.4  23.0  208   35-283   346-613 (893)
 29 PTZ00420 coronin; Provisional  100.0 4.7E-26   1E-30  211.8  30.7  236    6-283    43-294 (568)
 30 KOG0319 WD40-repeat-containing 100.0 2.1E-27 4.6E-32  215.7  20.1  223   32-295   358-589 (775)
 31 PLN00181 protein SPA1-RELATED; 100.0 3.3E-26 7.1E-31  225.4  28.5  231   10-281   548-792 (793)
 32 KOG0294 WD40 repeat-containing 100.0 1.6E-26 3.5E-31  191.9  21.2  237   36-285    40-284 (362)
 33 KOG0315 G-protein beta subunit 100.0 2.2E-26 4.8E-31  185.6  20.9  224   52-284    11-247 (311)
 34 KOG0295 WD40 repeat-containing 100.0 2.1E-26 4.5E-31  194.4  21.4  200   36-280   190-404 (406)
 35 KOG0275 Conserved WD40 repeat- 100.0 1.4E-27 3.1E-32  199.1  13.7  262   12-280   230-507 (508)
 36 KOG0296 Angio-associated migra 100.0 6.3E-26 1.4E-30  191.6  23.5  259   12-282    81-398 (399)
 37 KOG0277 Peroxisomal targeting   99.9 8.4E-27 1.8E-31  188.5  16.7  202   40-284    61-267 (311)
 38 KOG0277 Peroxisomal targeting   99.9 7.6E-27 1.7E-31  188.7  15.1  204   36-281   101-308 (311)
 39 KOG0291 WD40-repeat-containing  99.9 6.6E-26 1.4E-30  206.6  22.1  201   40-284   308-510 (893)
 40 KOG0316 Conserved WD40 repeat-  99.9 2.8E-26 6.1E-31  183.7  17.3  238   36-282    14-257 (307)
 41 KOG0292 Vesicle coat complex C  99.9 2.6E-26 5.7E-31  212.2  19.6  229   37-282    49-280 (1202)
 42 KOG0318 WD40 repeat stress pro  99.9   5E-25 1.1E-29  193.8  26.2  250   36-286   187-521 (603)
 43 KOG0276 Vesicle coat complex C  99.9 2.2E-26 4.7E-31  205.9  18.0  201   40-282    56-257 (794)
 44 KOG0316 Conserved WD40 repeat-  99.9 9.6E-26 2.1E-30  180.6  19.4  250   19-280    41-297 (307)
 45 cd00200 WD40 WD40 domain, foun  99.9 1.3E-24 2.8E-29  186.0  27.7  256   14-280    28-289 (289)
 46 KOG0645 WD40 repeat protein [G  99.9 3.7E-25   8E-30  180.4  22.1  273    5-282    22-311 (312)
 47 KOG1407 WD40 repeat protein [F  99.9 9.7E-26 2.1E-30  183.0  18.3  234   33-277    14-256 (313)
 48 KOG0264 Nucleosome remodeling   99.9 1.1E-25 2.4E-30  194.8  19.5  236    7-282   137-404 (422)
 49 KOG0647 mRNA export protein (c  99.9 1.8E-25 3.9E-30  184.8  19.7  231   35-271    23-312 (347)
 50 KOG0313 Microtubule binding pr  99.9 1.5E-25 3.2E-30  190.2  19.5  248   34-284   100-378 (423)
 51 PLN00181 protein SPA1-RELATED;  99.9   1E-24 2.3E-29  214.8  28.6  226   12-282   500-738 (793)
 52 cd00200 WD40 WD40 domain, foun  99.9 2.4E-24 5.1E-29  184.4  27.3  239   36-283     6-250 (289)
 53 KOG0273 Beta-transducin family  99.9 1.9E-25 4.2E-30  194.1  19.6  236   36-284   175-442 (524)
 54 KOG1446 Histone H3 (Lys4) meth  99.9 3.2E-24 6.9E-29  178.5  25.2  236   38-284    13-264 (311)
 55 KOG0285 Pleiotropic regulator   99.9 5.4E-25 1.2E-29  185.6  20.8  255   11-284   167-441 (460)
 56 KOG0281 Beta-TrCP (transducin   99.9 2.7E-26 5.9E-31  192.9  12.6  193   40-284   198-390 (499)
 57 KOG0288 WD40 repeat protein Ti  99.9 2.8E-25 6.1E-30  190.2  15.9  263    5-280   178-459 (459)
 58 KOG0283 WD40 repeat-containing  99.9 1.1E-24 2.3E-29  201.1  19.9  237   37-285   265-579 (712)
 59 KOG0305 Anaphase promoting com  99.9 3.2E-24 6.9E-29  193.0  22.5  266    7-287   190-466 (484)
 60 KOG0299 U3 snoRNP-associated p  99.9 4.6E-24   1E-28  185.0  19.6  239   12-297   159-425 (479)
 61 KOG0318 WD40 repeat stress pro  99.9 3.8E-23 8.3E-28  182.0  25.1  231    7-284   118-352 (603)
 62 KOG0313 Microtubule binding pr  99.9 5.8E-24 1.3E-28  180.6  19.3  228   11-282   163-418 (423)
 63 KOG0282 mRNA splicing factor [  99.9   4E-25 8.7E-30  192.6  12.0  239   35-284   210-464 (503)
 64 KOG0310 Conserved WD40 repeat-  99.9 3.9E-24 8.5E-29  186.7  17.2  200   37-281    66-267 (487)
 65 KOG0296 Angio-associated migra  99.9 9.8E-23 2.1E-27  172.3  23.6  238   36-284    61-358 (399)
 66 KOG0640 mRNA cleavage stimulat  99.9   6E-24 1.3E-28  176.5  15.9  211   36-286   109-339 (430)
 67 PTZ00420 coronin; Provisional   99.9 1.2E-22 2.7E-27  189.1  26.6  186   57-283    50-249 (568)
 68 KOG1332 Vesicle coat complex C  99.9 5.9E-24 1.3E-28  171.2  15.3  224   34-282     6-241 (299)
 69 KOG0274 Cdc4 and related F-box  99.9   2E-23 4.4E-28  193.6  20.5  218   12-283   223-442 (537)
 70 KOG1446 Histone H3 (Lys4) meth  99.9 1.3E-22 2.8E-27  168.9  23.0  252   18-283    37-304 (311)
 71 PTZ00421 coronin; Provisional   99.9 2.1E-22 4.6E-27  186.3  26.9  204   41-284    22-247 (493)
 72 KOG0646 WD40 repeat protein [G  99.9 2.8E-23 6.1E-28  180.5  19.5  221   18-284    62-309 (476)
 73 KOG0643 Translation initiation  99.9 6.3E-23 1.4E-27  167.2  20.4  211   33-284     4-319 (327)
 74 KOG0300 WD40 repeat-containing  99.9   2E-23 4.4E-28  173.9  17.2  234    9-290   162-436 (481)
 75 KOG0292 Vesicle coat complex C  99.9 1.8E-23 3.9E-28  193.6  18.3  198   40-282    10-236 (1202)
 76 KOG0301 Phospholipase A2-activ  99.9 5.8E-23 1.2E-27  186.0  20.5  218   12-287    76-293 (745)
 77 KOG0276 Vesicle coat complex C  99.9 7.7E-23 1.7E-27  183.3  20.5  205   37-284    11-217 (794)
 78 KOG0973 Histone transcription   99.9   9E-23 1.9E-27  192.9  22.1  268   10-284    28-357 (942)
 79 KOG0641 WD40 repeat protein [G  99.9 4.1E-22 8.8E-27  159.1  22.2  211   37-285    87-306 (350)
 80 KOG0772 Uncharacterized conser  99.9 3.3E-23 7.2E-28  182.0  17.1  207   37-282   266-487 (641)
 81 KOG0289 mRNA splicing factor [  99.9 7.8E-23 1.7E-27  176.3  18.8  200   41-283   221-420 (506)
 82 KOG2445 Nuclear pore complex c  99.9 8.1E-22 1.8E-26  163.6  23.6  243   32-281     6-317 (361)
 83 KOG1407 WD40 repeat protein [F  99.9 3.1E-22 6.7E-27  162.7  20.1  221   10-273    35-293 (313)
 84 KOG0274 Cdc4 and related F-box  99.9 1.2E-22 2.7E-27  188.4  20.2  218   12-284   266-484 (537)
 85 KOG0268 Sof1-like rRNA process  99.9 2.8E-23   6E-28  175.6  13.8  233   36-282    63-345 (433)
 86 KOG1036 Mitotic spindle checkp  99.9   1E-21 2.3E-26  163.0  22.3  229   36-282    10-262 (323)
 87 KOG0310 Conserved WD40 repeat-  99.9 1.3E-21 2.8E-26  171.0  23.7  200   36-280   107-307 (487)
 88 KOG1036 Mitotic spindle checkp  99.9 1.3E-21 2.7E-26  162.5  21.6  226   35-272    50-294 (323)
 89 KOG1332 Vesicle coat complex C  99.9 2.4E-22 5.2E-27  162.0  16.7  236   12-282    28-286 (299)
 90 KOG0289 mRNA splicing factor [  99.9 1.7E-21 3.7E-26  168.1  22.6  226    8-281   276-505 (506)
 91 KOG0301 Phospholipase A2-activ  99.9 2.6E-22 5.6E-27  181.8  18.3  222    7-282    25-249 (745)
 92 KOG0308 Conserved WD40 repeat-  99.9 4.2E-23 9.2E-28  185.7  13.1  216   36-283    18-244 (735)
 93 KOG0264 Nucleosome remodeling   99.9 3.7E-22 8.1E-27  173.0  18.1  210   38-284   123-349 (422)
 94 KOG0300 WD40 repeat-containing  99.9   2E-22 4.4E-27  168.0  15.6  212   36-289   145-393 (481)
 95 KOG0267 Microtubule severing p  99.9 2.4E-23 5.2E-28  189.2   9.0  202   36-282    25-226 (825)
 96 KOG0275 Conserved WD40 repeat-  99.9 1.5E-22 3.2E-27  169.2  12.5  202   37-283   211-424 (508)
 97 KOG0306 WD40-repeat-containing  99.9 1.2E-21 2.7E-26  178.8  18.6  201   36-281   451-663 (888)
 98 KOG0772 Uncharacterized conser  99.9 1.2E-21 2.7E-26  172.2  17.5  219   34-287   162-399 (641)
 99 KOG0308 Conserved WD40 repeat-  99.9 1.2E-21 2.5E-26  176.5  17.0  232   13-284    43-287 (735)
100 KOG0639 Transducin-like enhanc  99.9 1.1E-21 2.5E-26  172.0  14.9  232   39-281   465-703 (705)
101 KOG0302 Ribosome Assembly prot  99.9 6.5E-21 1.4E-25  162.0  18.4  209   37-284   149-380 (440)
102 KOG0321 WD40 repeat-containing  99.9 5.3E-21 1.2E-25  171.9  18.5  246   34-283    95-392 (720)
103 KOG0267 Microtubule severing p  99.9 2.1E-22 4.5E-27  183.1   9.3  195   35-274    66-260 (825)
104 KOG0269 WD40 repeat-containing  99.9 1.7E-21 3.8E-26  178.4  15.2  222   17-279   110-337 (839)
105 KOG0293 WD40 repeat-containing  99.9 1.5E-20 3.3E-25  161.4  19.3  202   36-283   221-426 (519)
106 KOG2096 WD40 repeat protein [G  99.9 2.4E-20 5.3E-25  155.6  19.8  207   36-284    83-310 (420)
107 KOG0306 WD40-repeat-containing  99.9 4.7E-20   1E-24  168.6  22.0  214   33-282   367-580 (888)
108 KOG0269 WD40 repeat-containing  99.9 2.3E-21 5.1E-26  177.5  13.4  208   42-293    90-307 (839)
109 KOG0302 Ribosome Assembly prot  99.9 1.6E-20 3.5E-25  159.6  16.9  205   36-278   208-435 (440)
110 KOG0640 mRNA cleavage stimulat  99.9 1.8E-20 3.8E-25  155.9  16.7  207   36-282   169-426 (430)
111 KOG0288 WD40 repeat protein Ti  99.9 5.7E-21 1.2E-25  164.0  13.6  237   37-284   173-419 (459)
112 KOG1034 Transcriptional repres  99.9 6.9E-20 1.5E-24  153.4  18.7  242   37-282    87-383 (385)
113 KOG1408 WD40 repeat protein [F  99.9   2E-20 4.3E-25  170.2  16.4  240   37-283   457-714 (1080)
114 KOG0643 Translation initiation  99.8 1.1E-19 2.3E-24  148.5  17.6  198   76-283     5-221 (327)
115 KOG0270 WD40 repeat-containing  99.8 4.3E-20 9.4E-25  159.9  16.0  213   36-290   240-457 (463)
116 KOG0642 Cell-cycle nuclear pro  99.8 5.8E-20 1.3E-24  163.3  15.8  247   34-283   289-562 (577)
117 KOG0283 WD40 repeat-containing  99.8 6.2E-20 1.4E-24  169.8  16.7  199   36-241   366-577 (712)
118 KOG1274 WD40 repeat protein [G  99.8 6.2E-19 1.3E-23  164.4  22.7  211   36-283    10-263 (933)
119 KOG0973 Histone transcription   99.8 5.9E-19 1.3E-23  167.3  22.3  247   33-284     7-314 (942)
120 KOG0270 WD40 repeat-containing  99.8 2.3E-19   5E-24  155.5  17.4  200   43-285   177-407 (463)
121 KOG4283 Transcription-coupled   99.8 4.4E-19 9.5E-24  146.8  18.1  220   36-299    40-294 (397)
122 KOG4283 Transcription-coupled   99.8 6.5E-19 1.4E-23  145.7  18.6  240    4-281   108-364 (397)
123 KOG0278 Serine/threonine kinas  99.8 7.2E-20 1.6E-24  148.2  10.7  203   32-282     7-213 (334)
124 KOG2106 Uncharacterized conser  99.8 7.4E-18 1.6E-22  148.3  23.9  231   40-280   247-519 (626)
125 KOG0641 WD40 repeat protein [G  99.8 5.4E-18 1.2E-22  135.6  20.9  203   36-282   133-349 (350)
126 KOG2919 Guanine nucleotide-bin  99.8 7.3E-19 1.6E-23  147.1  14.8  217   16-273   132-361 (406)
127 KOG0299 U3 snoRNP-associated p  99.8 2.1E-18 4.5E-23  150.2  17.7  205   33-284   136-358 (479)
128 KOG0303 Actin-binding protein   99.8 1.9E-17   4E-22  141.9  18.6  204   36-284    78-296 (472)
129 KOG0305 Anaphase promoting com  99.8 1.7E-17 3.7E-22  149.7  19.0  195   42-284   180-378 (484)
130 KOG4328 WD40 protein [Function  99.8 1.3E-17 2.9E-22  145.1  16.3  204   37-282   184-399 (498)
131 TIGR03866 PQQ_ABC_repeats PQQ-  99.8 6.9E-16 1.5E-20  134.5  26.5  257   13-284     7-281 (300)
132 KOG2096 WD40 repeat protein [G  99.8 2.8E-17   6E-22  137.5  16.5  211   36-280   184-400 (420)
133 KOG0322 G-protein beta subunit  99.8   2E-17 4.2E-22  135.1  13.8  243   36-281    11-322 (323)
134 KOG0650 WD40 repeat nucleolar   99.8 7.7E-17 1.7E-21  144.5  18.6  241   35-280   396-678 (733)
135 KOG1273 WD40 repeat protein [G  99.8 1.5E-16 3.4E-21  132.9  19.1  199   42-282    26-226 (405)
136 KOG0646 WD40 repeat protein [G  99.8 3.7E-17   8E-22  142.6  15.9  220   17-265   103-332 (476)
137 KOG1063 RNA polymerase II elon  99.7   2E-16 4.4E-21  143.9  20.9  273    8-284   280-650 (764)
138 KOG4328 WD40 protein [Function  99.7 7.7E-17 1.7E-21  140.4  16.8  216   36-282   231-495 (498)
139 KOG0321 WD40 repeat-containing  99.7 8.7E-17 1.9E-21  145.1  17.6  229   43-284    53-303 (720)
140 KOG1063 RNA polymerase II elon  99.7 9.4E-17   2E-21  146.0  17.5  214   36-282   522-763 (764)
141 KOG0647 mRNA export protein (c  99.7 1.2E-16 2.5E-21  132.8  15.8   79   80-160    26-104 (347)
142 KOG1539 WD repeat protein [Gen  99.7 2.2E-16 4.9E-21  146.2  19.1  198   40-284   449-650 (910)
143 KOG0644 Uncharacterized conser  99.7 9.7E-18 2.1E-22  155.1  10.0  229   36-282   187-426 (1113)
144 KOG0290 Conserved WD40 repeat-  99.7 2.5E-16 5.5E-21  130.3  15.6  212   37-286    94-322 (364)
145 KOG1274 WD40 repeat protein [G  99.7 7.1E-16 1.5E-20  144.3  20.2  202   36-280    93-298 (933)
146 KOG1408 WD40 repeat protein [F  99.7 2.7E-15 5.9E-20  137.1  23.3  239   42-284   327-673 (1080)
147 KOG0268 Sof1-like rRNA process  99.7 3.1E-17 6.6E-22  139.2  10.0  163   41-247   189-352 (433)
148 KOG0771 Prolactin regulatory e  99.7 3.5E-16 7.6E-21  135.2  16.6  200   43-283   148-355 (398)
149 KOG0307 Vesicle coat complex C  99.7 6.9E-17 1.5E-21  154.2  13.2  235   40-284    65-329 (1049)
150 KOG0294 WD40 repeat-containing  99.7 3.4E-15 7.3E-20  124.9  21.4  236   10-260    56-305 (362)
151 KOG1009 Chromatin assembly com  99.7 1.5E-15 3.3E-20  130.6  19.0  265   12-282    31-372 (434)
152 KOG2048 WD40 repeat protein [G  99.7 2.8E-15 6.2E-20  136.2  21.1  230   18-284    48-277 (691)
153 KOG1273 WD40 repeat protein [G  99.7 3.4E-15 7.3E-20  124.9  18.8  241   32-282    58-322 (405)
154 KOG1539 WD repeat protein [Gen  99.7 1.2E-15 2.5E-20  141.5  16.5  198   41-284   397-608 (910)
155 KOG2106 Uncharacterized conser  99.7 4.3E-14 9.3E-19  124.9  25.3  258   16-284   179-479 (626)
156 KOG0639 Transducin-like enhanc  99.7 9.9E-16 2.1E-20  135.0  15.0  238   37-283   417-664 (705)
157 KOG1007 WD repeat protein TSSC  99.7 5.1E-15 1.1E-19  122.5  18.3  223   17-282    93-361 (370)
158 KOG2048 WD40 repeat protein [G  99.7 1.6E-14 3.4E-19  131.5  23.0  197   41-282    27-233 (691)
159 TIGR03866 PQQ_ABC_repeats PQQ-  99.7 2.3E-14   5E-19  124.9  23.6  186   51-284     1-189 (300)
160 COG2319 FOG: WD40 repeat [Gene  99.7   7E-14 1.5E-18  125.5  26.3  225   15-284    85-316 (466)
161 KOG2055 WD40 repeat protein [G  99.7 2.1E-14 4.7E-19  125.4  21.0  202   38-283   212-418 (514)
162 PRK01742 tolB translocation pr  99.6 5.5E-14 1.2E-18  129.5  22.5  222    7-281   174-400 (429)
163 KOG4378 Nuclear protein COP1 [  99.6 6.8E-15 1.5E-19  129.6  14.7  199   42-285    82-283 (673)
164 KOG1587 Cytoplasmic dynein int  99.6 4.2E-14 9.2E-19  131.1  20.7  247   37-284   178-474 (555)
165 KOG1538 Uncharacterized conser  99.6 7.1E-14 1.5E-18  127.2  20.5  239   41-283    14-294 (1081)
166 KOG1310 WD40 repeat protein [G  99.6 1.2E-14 2.6E-19  129.5  14.8  134   26-160    38-180 (758)
167 KOG1188 WD40 repeat protein [G  99.6 3.2E-14 6.9E-19  120.2  16.4  194   52-284    41-244 (376)
168 COG2319 FOG: WD40 repeat [Gene  99.6   1E-12 2.2E-17  117.9  27.5  230   10-285   127-362 (466)
169 KOG1445 Tumor-specific antigen  99.6 7.2E-15 1.6E-19  132.9  12.0  201   40-282   628-844 (1012)
170 KOG0303 Actin-binding protein   99.6 1.6E-14 3.4E-19  124.1  13.5  166   76-282    76-249 (472)
171 KOG1188 WD40 repeat protein [G  99.6 2.3E-14 5.1E-19  121.0  14.0  244   12-284    89-348 (376)
172 KOG4227 WD40 repeat protein [G  99.6 2.2E-13 4.7E-18  117.2  19.9  263   11-281    74-386 (609)
173 KOG4378 Nuclear protein COP1 [  99.6 2.1E-13 4.5E-18  120.4  19.0  185   36-264   118-305 (673)
174 KOG2445 Nuclear pore complex c  99.6   3E-13 6.5E-18  113.0  18.2  200   78-285    10-259 (361)
175 KOG1523 Actin-related protein   99.6 2.2E-13 4.8E-18  114.3  17.5  207   41-285    12-239 (361)
176 KOG1445 Tumor-specific antigen  99.6 1.6E-14 3.5E-19  130.6  10.9  156   47-283   587-751 (1012)
177 KOG2055 WD40 repeat protein [G  99.6 4.9E-13 1.1E-17  117.1  18.6  199   39-282   257-512 (514)
178 PRK11028 6-phosphogluconolacto  99.6 3.3E-12 7.1E-17  113.7  24.7  211   41-283    81-305 (330)
179 KOG2919 Guanine nucleotide-bin  99.6   6E-13 1.3E-17  111.9  18.3  217   42-295    52-294 (406)
180 KOG0650 WD40 repeat nucleolar   99.6 8.9E-14 1.9E-18  125.1  14.2  198   38-280   520-733 (733)
181 KOG1517 Guanine nucleotide bin  99.5 8.9E-13 1.9E-17  125.5  19.8  210   37-284  1062-1289(1387)
182 KOG1334 WD40 repeat protein [G  99.5 4.7E-14   1E-18  124.2  10.5  244   36-283   139-467 (559)
183 KOG0649 WD40 repeat protein [G  99.5 2.8E-12 6.1E-17  104.1  19.5  204   36-284    59-276 (325)
184 KOG0642 Cell-cycle nuclear pro  99.5   4E-13 8.7E-18  120.2  15.6  100   12-112   311-426 (577)
185 KOG1034 Transcriptional repres  99.5 1.4E-12   3E-17  109.9  17.4  161   37-283    36-212 (385)
186 KOG0649 WD40 repeat protein [G  99.5 1.8E-12 3.9E-17  105.3  17.2  196   42-284    13-237 (325)
187 PF08662 eIF2A:  Eukaryotic tra  99.5 4.7E-12   1E-16  103.9  20.0   67  219-287   109-184 (194)
188 PRK11028 6-phosphogluconolacto  99.5 1.6E-11 3.5E-16  109.3  24.3  233   16-284    11-260 (330)
189 KOG2394 WD40 protein DMR-N9 [G  99.5 8.4E-13 1.8E-17  117.6  15.3  173   49-261   183-383 (636)
190 KOG0644 Uncharacterized conser  99.5 1.2E-13 2.6E-18  128.5   9.6  259    6-282   203-468 (1113)
191 KOG0290 Conserved WD40 repeat-  99.5 5.2E-13 1.1E-17  110.8  12.0  125   36-160   147-320 (364)
192 KOG0307 Vesicle coat complex C  99.5 5.1E-13 1.1E-17  128.1  13.0  236   43-283    10-285 (1049)
193 KOG1517 Guanine nucleotide bin  99.5 1.9E-12   4E-17  123.4  16.3  203   43-283  1169-1382(1387)
194 KOG1007 WD repeat protein TSSC  99.4 4.3E-12 9.3E-17  105.4  14.1  188   36-239   167-360 (370)
195 KOG4227 WD40 repeat protein [G  99.4 1.3E-11 2.8E-16  106.4  17.5  237   34-285    51-325 (609)
196 KOG1009 Chromatin assembly com  99.4 2.3E-12 5.1E-17  111.2  12.4  177   82-282    14-195 (434)
197 PRK03629 tolB translocation pr  99.4 2.5E-10 5.3E-15  105.2  25.8  220   14-284   176-408 (429)
198 PRK05137 tolB translocation pr  99.4 1.1E-10 2.5E-15  107.8  23.4  216   16-283   181-413 (435)
199 PRK02889 tolB translocation pr  99.4   8E-11 1.7E-15  108.5  22.3  194   36-274   192-392 (427)
200 KOG1272 WD40-repeat-containing  99.4 8.8E-13 1.9E-17  115.6   8.7  197   39-284   129-325 (545)
201 KOG2394 WD40 protein DMR-N9 [G  99.4 2.1E-11 4.6E-16  108.8  16.4  219   42-283   126-363 (636)
202 PRK04922 tolB translocation pr  99.4 1.3E-10 2.8E-15  107.3  22.1  218   16-283   183-412 (433)
203 KOG3881 Uncharacterized conser  99.4 8.3E-11 1.8E-15  101.4  18.9  207   36-283   102-321 (412)
204 KOG1587 Cytoplasmic dynein int  99.4 6.6E-11 1.4E-15  110.1  19.5  239    7-284   255-518 (555)
205 KOG1524 WD40 repeat-containing  99.4   1E-11 2.3E-16  110.8  13.1  182   36-278   101-282 (737)
206 KOG2111 Uncharacterized conser  99.4 5.9E-10 1.3E-14   93.9  22.6  220   41-284     7-258 (346)
207 KOG1524 WD40 repeat-containing  99.3   2E-11 4.3E-16  109.0  13.8  219   36-277    11-250 (737)
208 KOG1963 WD40 repeat protein [G  99.3 8.6E-10 1.9E-14  103.7  24.7  144    7-159   166-323 (792)
209 KOG2110 Uncharacterized conser  99.3 1.6E-09 3.6E-14   92.9  23.5  195   41-283    48-249 (391)
210 KOG0974 WD-repeat protein WDR6  99.3 1.1E-10 2.4E-15  111.3  17.8  196   41-283    89-289 (967)
211 PRK01742 tolB translocation pr  99.3 8.5E-11 1.8E-15  108.4  16.5  192   18-263   229-426 (429)
212 KOG2139 WD40 repeat protein [G  99.3   3E-10 6.5E-15   97.2  17.5  192   42-277   101-306 (445)
213 KOG2110 Uncharacterized conser  99.3 3.6E-09 7.8E-14   90.8  23.2  202   37-284    85-332 (391)
214 KOG1963 WD40 repeat protein [G  99.3 1.4E-10   3E-15  109.0  15.5  246   36-292    13-292 (792)
215 KOG1538 Uncharacterized conser  99.3 1.8E-10 3.9E-15  105.4  15.7  189   83-285    14-214 (1081)
216 KOG1310 WD40 repeat protein [G  99.3 1.5E-10 3.3E-15  103.7  14.0  216   11-245    66-308 (758)
217 KOG1523 Actin-related protein   99.2 1.1E-09 2.3E-14   92.4  17.5  201   36-262    52-259 (361)
218 KOG2321 WD40 repeat protein [G  99.2 4.6E-10 9.9E-15  101.2  15.9  200   45-284   139-345 (703)
219 KOG1240 Protein kinase contain  99.2 1.9E-09 4.2E-14  104.6  20.9  209   37-284  1046-1275(1431)
220 TIGR02800 propeller_TolB tol-p  99.2 4.3E-09 9.3E-14   96.7  21.8  190   37-275   187-387 (417)
221 KOG0974 WD-repeat protein WDR6  99.2 1.2E-10 2.6E-15  111.1  11.5  131   19-159   157-289 (967)
222 KOG0280 Uncharacterized conser  99.2   1E-09 2.3E-14   91.5  15.5  117   42-160   124-243 (339)
223 KOG0322 G-protein beta subunit  99.2   7E-11 1.5E-15   97.1   7.8  117   36-157   202-322 (323)
224 KOG1354 Serine/threonine prote  99.2 8.1E-10 1.8E-14   94.0  14.2  208   42-283    28-302 (433)
225 KOG2321 WD40 repeat protein [G  99.2 1.7E-09 3.8E-14   97.5  17.0  210   36-283    48-303 (703)
226 KOG4547 WD40 repeat-containing  99.1 6.7E-09 1.5E-13   94.0  19.6  189   49-283     3-221 (541)
227 KOG1240 Protein kinase contain  99.1 1.7E-09 3.6E-14  105.1  16.3  183   70-282  1037-1225(1431)
228 PF08662 eIF2A:  Eukaryotic tra  99.1 2.1E-09 4.5E-14   88.2  14.5  112   37-158    57-179 (194)
229 PRK00178 tolB translocation pr  99.1 2.8E-08 6.1E-13   91.8  23.2  200   36-284   195-408 (430)
230 PRK05137 tolB translocation pr  99.1 3.9E-08 8.5E-13   90.9  23.5  174   61-283   182-367 (435)
231 PRK03629 tolB translocation pr  99.1   8E-09 1.7E-13   95.3  18.3  173   42-262   245-427 (429)
232 KOG0771 Prolactin regulatory e  99.1 1.6E-09 3.5E-14   94.3  12.7  154   85-283   148-312 (398)
233 KOG2111 Uncharacterized conser  99.1 1.2E-07 2.6E-12   80.2  23.1  230    9-282    63-322 (346)
234 KOG3881 Uncharacterized conser  99.1 7.6E-09 1.6E-13   89.5  15.1  182   40-264   149-343 (412)
235 KOG2695 WD40 repeat protein [G  99.1 9.9E-10 2.2E-14   93.6   9.4  178   45-262   217-402 (425)
236 KOG1272 WD40-repeat-containing  99.0 6.7E-10 1.5E-14   97.8   8.0  116   41-161   211-326 (545)
237 PRK04792 tolB translocation pr  99.0   1E-07 2.2E-12   88.3  22.4  196   40-284   218-427 (448)
238 KOG1409 Uncharacterized conser  99.0 2.6E-08 5.5E-13   85.0  15.5  248   36-291    21-279 (404)
239 PRK01029 tolB translocation pr  99.0 2.6E-07 5.6E-12   85.1  23.3  205   40-284   185-405 (428)
240 KOG1064 RAVE (regulator of V-A  99.0 1.1E-09 2.4E-14  109.7   7.9  186    6-249  2217-2407(2439)
241 PF00400 WD40:  WD domain, G-be  99.0 2.1E-09 4.6E-14   63.9   5.7   39  242-280     1-39  (39)
242 PRK02889 tolB translocation pr  98.9 1.5E-07 3.1E-12   86.9  20.3  177   61-284   176-362 (427)
243 KOG1354 Serine/threonine prote  98.9 5.1E-08 1.1E-12   83.3  14.1  215   36-279   161-431 (433)
244 PF02239 Cytochrom_D1:  Cytochr  98.9 9.8E-07 2.1E-11   79.5  23.4  256   16-284    57-349 (369)
245 KOG1064 RAVE (regulator of V-A  98.9 7.7E-09 1.7E-13  103.9  10.5  186   41-283  2210-2399(2439)
246 KOG4497 Uncharacterized conser  98.9 4.2E-07 9.1E-12   77.4  18.7  243   42-285    51-394 (447)
247 PRK04922 tolB translocation pr  98.9 1.9E-07 4.1E-12   86.3  18.4  172   41-262   249-432 (433)
248 KOG0309 Conserved WD40 repeat-  98.8 1.2E-08 2.6E-13   94.8   8.8  215   36-282   111-339 (1081)
249 KOG2139 WD40 repeat protein [G  98.8 1.5E-07 3.2E-12   81.0  14.6  166   83-281   100-267 (445)
250 KOG1409 Uncharacterized conser  98.8 4.4E-08 9.5E-13   83.7  11.2   82   74-159   190-271 (404)
251 COG5170 CDC55 Serine/threonine  98.8 4.6E-08 9.9E-13   82.6  11.0  210   40-283    27-310 (460)
252 KOG4714 Nucleoporin [Nuclear s  98.8 4.8E-08   1E-12   80.5  10.4   63  221-283   191-255 (319)
253 TIGR02800 propeller_TolB tol-p  98.8 1.1E-06 2.5E-11   80.6  20.8  173   62-283   171-355 (417)
254 KOG1275 PAB-dependent poly(A)   98.8 3.8E-07 8.1E-12   86.8  16.4  182   50-280   146-340 (1118)
255 KOG4497 Uncharacterized conser  98.8 1.4E-07   3E-12   80.2  12.2  207   44-278    13-236 (447)
256 KOG4532 WD40-like repeat conta  98.7 1.1E-06 2.4E-11   73.0  16.2  190   50-282    83-282 (344)
257 PLN02919 haloacid dehalogenase  98.7 5.6E-06 1.2E-10   84.1  24.9  222   42-284   626-890 (1057)
258 PF10282 Lactonase:  Lactonase,  98.7 3.3E-05 7.1E-10   69.3  26.6  240   43-283    40-323 (345)
259 PRK04043 tolB translocation pr  98.6 1.4E-05 3.1E-10   73.3  21.9  192   41-283   189-401 (419)
260 KOG0280 Uncharacterized conser  98.6 7.2E-06 1.6E-10   69.0  17.7  187   60-284    45-243 (339)
261 TIGR02658 TTQ_MADH_Hv methylam  98.6 0.00016 3.5E-09   64.4  27.2  256    5-284    15-332 (352)
262 KOG3914 WD repeat protein WDR4  98.6 1.4E-06   3E-11   76.0  13.7  165   39-248    62-231 (390)
263 PRK00178 tolB translocation pr  98.6 2.1E-05 4.6E-10   72.7  22.7  174   62-284   180-365 (430)
264 PF15492 Nbas_N:  Neuroblastoma  98.6 2.8E-05 6.1E-10   65.4  20.5  192   43-284    47-261 (282)
265 PRK04792 tolB translocation pr  98.6 5.7E-06 1.2E-10   76.8  18.2  172   42-262   264-446 (448)
266 KOG4190 Uncharacterized conser  98.6 1.4E-07 3.1E-12   85.2   7.1  203   35-281   731-947 (1034)
267 PF00400 WD40:  WD domain, G-be  98.6 1.7E-07 3.7E-12   55.5   5.2   32   36-67      8-39  (39)
268 PRK01029 tolB translocation pr  98.6 8.4E-06 1.8E-10   75.2  18.8  180   41-264   232-426 (428)
269 PF02239 Cytochrom_D1:  Cytochr  98.5 6.8E-05 1.5E-09   67.7  22.8  191   55-284    10-204 (369)
270 COG2706 3-carboxymuconate cycl  98.5 0.00063 1.4E-08   59.1  26.7  252   36-290    36-329 (346)
271 KOG2695 WD40 repeat protein [G  98.5 3.8E-07 8.3E-12   78.1   7.0  124   36-160   243-378 (425)
272 KOG0309 Conserved WD40 repeat-  98.5 8.2E-07 1.8E-11   82.9   9.6  203   40-284    25-234 (1081)
273 KOG1334 WD40 repeat protein [G  98.5 3.2E-06 6.9E-11   75.5  12.7  209   74-283   135-425 (559)
274 KOG2315 Predicted translation   98.4 9.6E-05 2.1E-09   67.3  20.0  196   38-285   164-393 (566)
275 KOG2041 WD40 repeat protein [G  98.4 1.5E-05 3.3E-10   74.4  15.2  223   40-273    15-279 (1189)
276 COG4946 Uncharacterized protei  98.4 0.00013 2.7E-09   65.5  19.8  187   48-283   275-478 (668)
277 KOG4714 Nucleoporin [Nuclear s  98.3 3.2E-06 6.9E-11   70.0   9.0   93   63-158   161-254 (319)
278 KOG4190 Uncharacterized conser  98.3 1.9E-06 4.2E-11   78.1   8.1  169   76-283   730-907 (1034)
279 KOG3914 WD repeat protein WDR4  98.3 1.7E-06 3.6E-11   75.5   7.2   93   16-113   131-224 (390)
280 PLN02919 haloacid dehalogenase  98.3 0.00028 6.1E-09   72.0  23.9  213   39-284   568-835 (1057)
281 COG4946 Uncharacterized protei  98.2 0.00043 9.2E-09   62.2  20.3  118   36-159   356-478 (668)
282 KOG4547 WD40 repeat-containing  98.2 2.8E-05 6.1E-10   71.0  11.8  117   36-159    99-221 (541)
283 KOG1645 RING-finger-containing  98.1 8.1E-05 1.8E-09   65.3  13.8   77   36-113   190-267 (463)
284 PF04762 IKI3:  IKI3 family;  I  98.1  0.0014 2.9E-08   66.0  24.0  198   38-281    74-332 (928)
285 PF10282 Lactonase:  Lactonase,  98.1   0.002 4.3E-08   57.8  22.7  184   40-260   144-345 (345)
286 COG5170 CDC55 Serine/threonine  98.1   4E-05 8.6E-10   65.2  10.2  124   36-161   169-312 (460)
287 TIGR02658 TTQ_MADH_Hv methylam  98.1  0.0039 8.4E-08   55.6  23.2   96   61-162    27-140 (352)
288 KOG1832 HIV-1 Vpr-binding prot  98.0 5.3E-06 1.1E-10   79.1   4.8  116   37-159  1099-1215(1516)
289 KOG1912 WD40 repeat protein [G  98.0 0.00045 9.9E-09   65.5  16.9  237   40-282    16-304 (1062)
290 KOG1008 Uncharacterized conser  98.0 5.4E-06 1.2E-10   76.5   3.6  203   40-284    57-277 (783)
291 smart00320 WD40 WD40 repeats.   98.0   3E-05 6.5E-10   44.1   5.7   39  242-280     2-40  (40)
292 KOG4532 WD40-like repeat conta  97.9  0.0022 4.8E-08   53.8  17.5  103   53-160   130-235 (344)
293 KOG2315 Predicted translation   97.9  0.0003 6.6E-09   64.2  13.3  111   39-159   270-391 (566)
294 KOG2314 Translation initiation  97.9  0.0011 2.3E-08   60.8  16.5   68  216-284   498-575 (698)
295 COG5354 Uncharacterized protei  97.9  0.0046 9.9E-08   56.2  20.2  197   40-287   174-400 (561)
296 PF08450 SGL:  SMP-30/Gluconola  97.9   0.012 2.6E-07   49.9  25.2  186   42-270    42-244 (246)
297 PF11768 DUF3312:  Protein of u  97.9 7.7E-05 1.7E-09   68.6   9.0   65  218-284   267-331 (545)
298 KOG1920 IkappaB kinase complex  97.9  0.0022 4.7E-08   63.8  19.2  199   40-282    69-322 (1265)
299 KOG1275 PAB-dependent poly(A)   97.8 0.00019 4.1E-09   69.1  11.5  147   49-239   185-341 (1118)
300 PF11768 DUF3312:  Protein of u  97.8 0.00074 1.6E-08   62.3  14.7   90   63-159   238-330 (545)
301 KOG3617 WD40 and TPR repeat-co  97.8 0.00049 1.1E-08   65.9  13.7  108   45-159    21-132 (1416)
302 KOG0882 Cyclophilin-related pe  97.8 7.6E-05 1.7E-09   66.4   7.0  199   79-284     7-233 (558)
303 PF04762 IKI3:  IKI3 family;  I  97.8   0.021 4.6E-07   57.6  25.1  113   38-158   208-333 (928)
304 TIGR03300 assembly_YfgL outer   97.8   0.011 2.4E-07   53.6  21.4  217   50-283    64-298 (377)
305 TIGR03300 assembly_YfgL outer   97.7  0.0053 1.1E-07   55.7  19.2  175   52-279   191-376 (377)
306 COG2706 3-carboxymuconate cycl  97.7  0.0072 1.6E-07   52.7  18.5  194   60-284    15-223 (346)
307 KOG2066 Vacuolar assembly/sort  97.7  0.0038 8.3E-08   59.6  17.7  116   36-159    55-188 (846)
308 PF13360 PQQ_2:  PQQ-like domai  97.7  0.0059 1.3E-07   51.3  17.5  196   49-284    34-232 (238)
309 KOG2041 WD40 repeat protein [G  97.7   0.001 2.2E-08   62.7  13.4  240   33-280    65-335 (1189)
310 KOG2114 Vacuolar assembly/sort  97.7  0.0088 1.9E-07   57.7  19.3  196   46-283    30-244 (933)
311 KOG1832 HIV-1 Vpr-binding prot  97.6 5.6E-05 1.2E-09   72.4   3.9  158   74-284  1094-1257(1516)
312 COG5354 Uncharacterized protei  97.6  0.0096 2.1E-07   54.2  17.3  209   41-284    73-308 (561)
313 PF08450 SGL:  SMP-30/Gluconola  97.5   0.041 8.9E-07   46.6  23.3  192   44-284     4-215 (246)
314 KOG2444 WD40 repeat protein [G  97.5 0.00033 7.1E-09   57.4   6.3   64  222-285   114-180 (238)
315 smart00320 WD40 WD40 repeats.   97.4 0.00028 6.1E-09   39.8   4.1   32   36-67      9-40  (40)
316 KOG2066 Vacuolar assembly/sort  97.4  0.0055 1.2E-07   58.6  14.4  182   41-282    41-233 (846)
317 PRK04043 tolB translocation pr  97.3    0.13 2.7E-06   47.5  23.2  172   62-284   170-359 (419)
318 KOG0882 Cyclophilin-related pe  97.2  0.0025 5.3E-08   57.1   8.9  216   36-284    50-307 (558)
319 PF13360 PQQ_2:  PQQ-like domai  97.1    0.07 1.5E-06   44.7  17.2  147   60-252     2-152 (238)
320 KOG2314 Translation initiation  97.0   0.073 1.6E-06   49.2  16.7  110   42-160   213-336 (698)
321 KOG2444 WD40 repeat protein [G  96.9  0.0029 6.4E-08   51.9   6.5  106   51-160    70-179 (238)
322 PF06977 SdiA-regulated:  SdiA-  96.9    0.24 5.1E-06   42.2  21.0  210   36-277    18-245 (248)
323 PF06433 Me-amine-dh_H:  Methyl  96.7     0.3 6.4E-06   43.2  17.5   51  233-284   270-322 (342)
324 PRK02888 nitrous-oxide reducta  96.7   0.084 1.8E-06   50.3  15.0  109   43-161   238-354 (635)
325 PF14783 BBS2_Mid:  Ciliary BBS  96.7    0.16 3.4E-06   37.2  14.2  101   42-152     2-108 (111)
326 KOG1008 Uncharacterized conser  96.5 0.00072 1.6E-08   62.9   0.4  143    7-159    72-227 (783)
327 KOG1920 IkappaB kinase complex  96.5    0.22 4.7E-06   50.3  16.7  113   43-158   199-322 (1265)
328 PF08553 VID27:  VID27 cytoplas  96.4   0.065 1.4E-06   52.7  12.9  130   58-194   501-638 (794)
329 PF08553 VID27:  VID27 cytoplas  96.4   0.069 1.5E-06   52.5  13.1   59  221-281   587-646 (794)
330 PF12894 Apc4_WD40:  Anaphase-p  96.4   0.019 4.1E-07   35.2   5.9   30   40-69     12-41  (47)
331 PRK11138 outer membrane biogen  96.3    0.83 1.8E-05   41.7  21.1   58  222-281   335-393 (394)
332 COG0823 TolB Periplasmic compo  96.3    0.14 3.1E-06   47.1  14.1  184   40-270   193-387 (425)
333 PF14783 BBS2_Mid:  Ciliary BBS  96.3    0.28   6E-06   36.0  12.4   52  223-277    54-109 (111)
334 KOG1912 WD40 repeat protein [G  96.2   0.066 1.4E-06   51.5  11.4  237   45-284   236-508 (1062)
335 KOG3621 WD40 repeat-containing  96.2   0.062 1.3E-06   51.0  10.9  117   41-159    35-155 (726)
336 KOG1645 RING-finger-containing  96.2   0.017 3.6E-07   51.2   6.7   93   63-160   175-268 (463)
337 KOG4640 Anaphase-promoting com  96.1   0.022 4.8E-07   53.3   7.5   66  218-284    28-94  (665)
338 PF08596 Lgl_C:  Lethal giant l  96.0    0.83 1.8E-05   41.7  17.4  227   40-284     2-292 (395)
339 KOG4649 PQQ (pyrrolo-quinoline  95.9    0.89 1.9E-05   38.5  16.3   62   50-112    62-123 (354)
340 PF12894 Apc4_WD40:  Anaphase-p  95.9   0.033 7.1E-07   34.1   5.3   31  252-282    11-41  (47)
341 PF07433 DUF1513:  Protein of u  95.9     1.1 2.3E-05   39.2  23.6  100   44-147     9-117 (305)
342 KOG2395 Protein involved in va  95.8    0.23 4.9E-06   46.0  12.6   61  221-283   440-501 (644)
343 KOG4640 Anaphase-promoting com  95.7   0.079 1.7E-06   49.7   9.3   71   40-112    21-92  (665)
344 KOG3621 WD40 repeat-containing  95.7   0.068 1.5E-06   50.7   9.0   66  218-283    84-155 (726)
345 PRK02888 nitrous-oxide reducta  95.7    0.96 2.1E-05   43.4  16.5   66  217-283   327-405 (635)
346 PF04053 Coatomer_WDAD:  Coatom  95.6     1.1 2.3E-05   41.7  16.6   56  222-281   117-172 (443)
347 KOG2114 Vacuolar assembly/sort  95.6     1.7 3.8E-05   42.6  17.8  122   36-158    61-201 (933)
348 KOG2395 Protein involved in va  95.1    0.31 6.7E-06   45.1  10.7  151   36-194   329-491 (644)
349 PF15492 Nbas_N:  Neuroblastoma  94.9    0.71 1.5E-05   39.4  11.8   35  127-161   228-262 (282)
350 PF04841 Vps16_N:  Vps16, N-ter  94.9     3.1 6.8E-05   38.3  24.7   49  217-265   223-272 (410)
351 PF03178 CPSF_A:  CPSF A subuni  94.8     2.7 5.8E-05   37.2  18.5  178   62-283     3-203 (321)
352 KOG3617 WD40 and TPR repeat-co  94.8   0.034 7.5E-07   53.9   4.1   65  218-282    67-131 (1416)
353 KOG2079 Vacuolar assembly/sort  94.4    0.17 3.7E-06   50.5   7.9   69   40-111   131-202 (1206)
354 PRK11138 outer membrane biogen  94.1     4.5 9.8E-05   36.9  21.1  103   50-162    68-182 (394)
355 KOG4499 Ca2+-binding protein R  93.9     3.4 7.4E-05   34.6  13.8   51  221-271   222-274 (310)
356 PF07433 DUF1513:  Protein of u  93.9     0.2 4.2E-06   43.7   6.5   55  217-271    57-117 (305)
357 PHA02713 hypothetical protein;  93.6     4.7  0.0001   38.8  16.0   60  221-282   463-533 (557)
358 PRK13616 lipoprotein LpqB; Pro  93.4     1.5 3.2E-05   42.4  12.2   60  215-278   401-472 (591)
359 PF12234 Rav1p_C:  RAVE protein  93.4     2.1 4.5E-05   41.4  12.9  114   42-157    32-155 (631)
360 PF00930 DPPIV_N:  Dipeptidyl p  93.1     6.3 0.00014   35.4  15.5  107   48-158     1-131 (353)
361 PF02897 Peptidase_S9_N:  Proly  93.1       3 6.6E-05   38.2  13.6  117   39-159   123-261 (414)
362 KOG2079 Vacuolar assembly/sort  92.8    0.54 1.2E-05   47.1   8.4  102   50-158    98-203 (1206)
363 PF14727 PHTB1_N:  PTHB1 N-term  92.6     8.2 0.00018   35.5  21.9   57  224-282   302-360 (418)
364 PF03178 CPSF_A:  CPSF A subuni  92.6     6.9 0.00015   34.5  21.3  193   42-279    29-262 (321)
365 PF08596 Lgl_C:  Lethal giant l  92.6     8.2 0.00018   35.3  18.0   72   39-111    86-172 (395)
366 PF10313 DUF2415:  Uncharacteri  92.1    0.58 1.3E-05   27.9   4.8   34  253-286     1-37  (43)
367 KOG4649 PQQ (pyrrolo-quinoline  91.8     7.4 0.00016   33.2  13.5   32  218-249   101-132 (354)
368 KOG4441 Proteins containing BT  91.8     9.1  0.0002   36.9  15.3   96   50-152   332-440 (571)
369 PF10313 DUF2415:  Uncharacteri  91.3    0.72 1.6E-05   27.5   4.6   31   82-112     1-33  (43)
370 COG0823 TolB Periplasmic compo  91.1     1.6 3.5E-05   40.3   9.1  103   42-149   240-346 (425)
371 PF00930 DPPIV_N:  Dipeptidyl p  90.5    0.77 1.7E-05   41.3   6.4   52  231-284    22-73  (353)
372 PF00780 CNH:  CNH domain;  Int  90.3     5.6 0.00012   34.1  11.5  107   49-159     5-123 (275)
373 PF14870 PSII_BNR:  Photosynthe  90.3      12 0.00026   32.9  13.5  127   20-149   125-253 (302)
374 PF07569 Hira:  TUP1-like enhan  90.3     2.1 4.6E-05   35.7   8.4   61  221-282    21-95  (219)
375 KOG4441 Proteins containing BT  90.2      18  0.0004   34.9  15.9   99   48-152   282-393 (571)
376 TIGR02276 beta_rpt_yvtn 40-res  89.9     2.5 5.4E-05   24.4   6.4   40  221-261     2-42  (42)
377 COG3391 Uncharacterized conser  89.7      16 0.00034   33.3  20.3  182   41-263   117-309 (381)
378 PF04841 Vps16_N:  Vps16, N-ter  88.8      19 0.00042   33.1  17.5   30  252-281   216-245 (410)
379 COG3391 Uncharacterized conser  88.7      18  0.0004   32.8  22.7  198   42-283    76-284 (381)
380 PF00780 CNH:  CNH domain;  Int  88.1      16 0.00034   31.2  18.1  115   40-159    36-166 (275)
381 COG3490 Uncharacterized protei  87.7     1.8 3.9E-05   37.3   6.1   61  217-279   120-186 (366)
382 PF12234 Rav1p_C:  RAVE protein  87.6      19 0.00041   35.0  13.6   61  219-281    83-155 (631)
383 PF06977 SdiA-regulated:  SdiA-  87.1      14 0.00031   31.4  11.4  105   37-146   115-239 (248)
384 KOG1897 Damage-specific DNA bi  86.7      39 0.00084   34.3  19.8  113   40-160   775-900 (1096)
385 PF14583 Pectate_lyase22:  Olig  85.8      27 0.00059   31.7  16.5   63  219-282   291-381 (386)
386 PF14727 PHTB1_N:  PTHB1 N-term  85.0      32 0.00069   31.8  17.9   61  222-283   145-205 (418)
387 PHA02713 hypothetical protein;  84.5      17 0.00038   34.9  11.9   23  221-243   512-536 (557)
388 PF14655 RAB3GAP2_N:  Rab3 GTPa  84.5      12 0.00026   34.5  10.2   39   41-79    309-347 (415)
389 PF02897 Peptidase_S9_N:  Proly  84.2     5.9 0.00013   36.3   8.4   67  218-286   131-214 (414)
390 TIGR02276 beta_rpt_yvtn 40-res  83.6     6.3 0.00014   22.6   6.3   30   49-78      1-31  (42)
391 PRK13616 lipoprotein LpqB; Pro  83.4      14 0.00031   35.7  10.8  110   42-157   399-524 (591)
392 KOG2280 Vacuolar assembly/sort  83.4      49  0.0011   32.6  15.0   32  218-249   224-255 (829)
393 KOG2377 Uncharacterized conser  83.2     8.7 0.00019   35.4   8.5  113   39-157    66-184 (657)
394 PF07569 Hira:  TUP1-like enhan  83.1      10 0.00022   31.6   8.6   65   45-111    16-94  (219)
395 PF08728 CRT10:  CRT10;  InterP  82.9      29 0.00062   34.3  12.5   68  214-281   167-245 (717)
396 KOG4499 Ca2+-binding protein R  82.6      10 0.00023   31.8   8.1   54   45-98    217-270 (310)
397 KOG1900 Nuclear pore complex,   82.5      31 0.00067   36.1  12.8   42  249-290   239-280 (1311)
398 KOG1916 Nuclear protein, conta  82.4    0.56 1.2E-05   46.2   0.9   63  220-283   193-266 (1283)
399 PF07676 PD40:  WD40-like Beta   81.4     7.7 0.00017   22.0   5.5   29  251-279     7-38  (39)
400 PF14761 HPS3_N:  Hermansky-Pud  81.1      30 0.00065   28.6  12.9   48   52-101    29-78  (215)
401 PF10168 Nup88:  Nuclear pore c  80.0      18  0.0004   35.9  10.3   75   37-112    82-179 (717)
402 PHA03098 kelch-like protein; P  79.6      58  0.0013   31.0  17.0   60   49-112   293-366 (534)
403 PF08728 CRT10:  CRT10;  InterP  79.3      49  0.0011   32.7  12.7  121   36-157    99-245 (717)
404 KOG4460 Nuclear pore complex,   79.0      33 0.00071   32.4  10.8   32   36-68    100-131 (741)
405 KOG1983 Tomosyn and related SN  78.1      69  0.0015   33.3  14.0   33   36-68     32-64  (993)
406 PF14583 Pectate_lyase22:  Olig  77.4      57  0.0012   29.7  13.2   92   67-161    16-113 (386)
407 PF12657 TFIIIC_delta:  Transcr  77.1      27 0.00059   27.8   9.0   29  130-158    87-121 (173)
408 KOG3630 Nuclear pore complex,   75.7     9.4  0.0002   39.1   6.9  115   40-158   101-228 (1405)
409 PF14655 RAB3GAP2_N:  Rab3 GTPa  75.5      52  0.0011   30.4  11.3   31  130-160   309-339 (415)
410 PF05694 SBP56:  56kDa selenium  71.6      41 0.00089   31.1   9.4  118   43-160   184-344 (461)
411 PF10647 Gmad1:  Lipoprotein Lp  70.7      66  0.0014   27.4  11.8  114   41-158    25-144 (253)
412 COG3386 Gluconolactonase [Carb  69.6      80  0.0017   27.9  18.4   50  221-271   223-275 (307)
413 COG5167 VID27 Protein involved  69.6      85  0.0018   29.8  11.0  118   34-157   461-590 (776)
414 PF11715 Nup160:  Nucleoporin N  69.1      16 0.00035   34.9   6.9   64  221-284   157-250 (547)
415 smart00564 PQQ beta-propeller   68.6      14 0.00031   19.8   4.0   23  224-246     8-30  (33)
416 smart00036 CNH Domain found in  68.5      82  0.0018   27.6  12.4  106   50-157    12-130 (302)
417 PF07995 GSDH:  Glucose / Sorbo  68.2      87  0.0019   27.8  10.9   57  221-277   270-330 (331)
418 PF11715 Nup160:  Nucleoporin N  67.6      15 0.00033   35.1   6.4   35   41-75    216-254 (547)
419 COG3386 Gluconolactonase [Carb  67.4      89  0.0019   27.6  11.6  118   41-158   112-243 (307)
420 cd00216 PQQ_DH Dehydrogenases   67.4 1.1E+02  0.0025   28.8  18.1   62   51-114    61-130 (488)
421 KOG2377 Uncharacterized conser  66.0 1.2E+02  0.0025   28.5  14.5  158   84-282    25-185 (657)
422 KOG3630 Nuclear pore complex,   65.0     5.8 0.00013   40.5   3.0   59  225-283   171-229 (1405)
423 KOG1916 Nuclear protein, conta  65.0       5 0.00011   40.0   2.5   70   41-111   185-264 (1283)
424 PF03088 Str_synth:  Strictosid  64.9      21 0.00044   25.1   5.0   42  228-270    33-74  (89)
425 PF07995 GSDH:  Glucose / Sorbo  64.6      37 0.00079   30.2   7.8   48   42-91      4-58  (331)
426 PF05096 Glu_cyclase_2:  Glutam  64.1      95  0.0021   26.7  12.7   58  222-282   100-157 (264)
427 PF06433 Me-amine-dh_H:  Methyl  63.3 1.1E+02  0.0024   27.3  18.1  137   20-160    67-215 (342)
428 PRK10115 protease 2; Provision  63.2 1.6E+02  0.0036   29.2  21.0  115   40-158   127-255 (686)
429 TIGR02608 delta_60_rpt delta-6  61.9      30 0.00066   21.8   4.9   18   42-59      3-20  (55)
430 TIGR02604 Piru_Ver_Nterm putat  61.5 1.2E+02  0.0027   27.2  16.7   41  232-273   164-204 (367)
431 PF14761 HPS3_N:  Hermansky-Pud  60.8      43 0.00093   27.8   6.8   51  223-274    29-81  (215)
432 KOG4659 Uncharacterized conser  60.8 2.4E+02  0.0052   30.3  13.7   23  127-149   660-682 (1899)
433 COG3204 Uncharacterized protei  60.4 1.2E+02  0.0026   26.6  19.2  122   36-158    82-210 (316)
434 PF10168 Nup88:  Nuclear pore c  60.4 1.5E+02  0.0032   29.7  11.7   33  209-242   149-181 (717)
435 COG5167 VID27 Protein involved  59.8      29 0.00064   32.6   6.2   63  220-284   571-634 (776)
436 PF13570 PQQ_3:  PQQ-like domai  59.6      18  0.0004   20.7   3.4   21   50-70     20-40  (40)
437 PHA02790 Kelch-like protein; P  59.6 1.6E+02  0.0034   27.8  16.2   58  221-283   407-471 (480)
438 PF01436 NHL:  NHL repeat;  Int  59.3      25 0.00054   18.5   3.8   24   43-66      5-28  (28)
439 COG3490 Uncharacterized protei  59.0 1.3E+02  0.0027   26.5  10.2   53   47-100   121-179 (366)
440 cd00216 PQQ_DH Dehydrogenases   57.9 1.7E+02  0.0037   27.6  22.9   30  221-250   405-434 (488)
441 KOG2247 WD40 repeat-containing  56.8     1.6 3.4E-05   40.4  -2.3  114   40-159    35-148 (615)
442 PF01011 PQQ:  PQQ enzyme repea  56.5      35 0.00076   19.3   4.6   25   53-77      2-26  (38)
443 PF15390 DUF4613:  Domain of un  55.7   2E+02  0.0044   27.8  14.6   66   40-112    20-90  (671)
444 PF12768 Rax2:  Cortical protei  55.7   1E+02  0.0022   26.8   8.7   53   59-112    14-72  (281)
445 PF14269 Arylsulfotran_2:  Aryl  54.6 1.3E+02  0.0027   26.5   9.3   70   41-111   145-219 (299)
446 PF04053 Coatomer_WDAD:  Coatom  53.7 1.9E+02  0.0042   27.0  16.5  109   37-158    30-173 (443)
447 PF05096 Glu_cyclase_2:  Glutam  51.4 1.6E+02  0.0035   25.4  22.7  180   44-270    49-249 (264)
448 TIGR03074 PQQ_membr_DH membran  50.7 2.8E+02  0.0061   28.0  22.1   62   51-112   194-278 (764)
449 TIGR03606 non_repeat_PQQ dehyd  49.8 2.3E+02  0.0049   26.7  14.6   56  222-278   369-431 (454)
450 TIGR03118 PEPCTERM_chp_1 conse  47.4 1.5E+02  0.0033   26.2   8.3   69  214-282    26-118 (336)
451 PF15390 DUF4613:  Domain of un  47.0 2.7E+02  0.0059   27.0  10.4   67   44-111   117-185 (671)
452 TIGR02604 Piru_Ver_Nterm putat  46.0 2.3E+02  0.0049   25.6  12.2  103   42-147    16-142 (367)
453 PF01731 Arylesterase:  Arylest  45.0      53  0.0012   22.9   4.4   28   43-70     57-85  (86)
454 PF10647 Gmad1:  Lipoprotein Lp  39.9 2.3E+02  0.0051   24.0  12.0  106   41-148    67-185 (253)
455 PF05694 SBP56:  56kDa selenium  39.6 3.2E+02   0.007   25.5  11.1   97   17-113   222-343 (461)
456 PHA03098 kelch-like protein; P  39.0 2.6E+02  0.0056   26.6   9.6   24   50-73    389-418 (534)
457 PF07250 Glyoxal_oxid_N:  Glyox  38.1 2.5E+02  0.0055   23.8  11.5   86   63-149    48-138 (243)
458 PRK13684 Ycf48-like protein; P  37.8 2.9E+02  0.0064   24.5  13.6  112   37-153   170-283 (334)
459 COG4590 ABC-type uncharacteriz  37.8 1.5E+02  0.0033   27.7   7.1  118   37-156   218-344 (733)
460 COG3823 Glutamine cyclotransfe  36.2 2.6E+02  0.0056   23.4  15.2   49  222-270   186-247 (262)
461 PF14779 BBS1:  Ciliary BBSome   36.0 2.2E+02  0.0048   24.4   7.5   55  223-278   196-254 (257)
462 PF14779 BBS1:  Ciliary BBSome   35.9 2.2E+02  0.0047   24.4   7.5   56   53-108   197-254 (257)
463 PRK13684 Ycf48-like protein; P  35.5 3.2E+02  0.0069   24.3  12.0  112   39-156   214-329 (334)
464 PF14269 Arylsulfotran_2:  Aryl  32.7 1.7E+02  0.0037   25.6   6.7   63  220-282   153-220 (299)
465 PF14870 PSII_BNR:  Photosynthe  32.3 3.5E+02  0.0076   23.8  19.2  105   44-156   108-213 (302)
466 PHA02790 Kelch-like protein; P  31.8 4.4E+02  0.0095   24.8   9.7   97   50-157   362-469 (480)
467 cd01268 Numb Numb Phosphotyros  30.9 2.5E+02  0.0053   21.6   6.9   53   52-107    51-103 (138)
468 KOG3616 Selective LIM binding   30.3 1.2E+02  0.0025   30.4   5.4   31   42-72     17-47  (1636)
469 PF01731 Arylesterase:  Arylest  30.2 1.9E+02  0.0041   20.1   6.7   48  232-282    36-84  (86)
470 PLN00033 photosystem II stabil  30.1 4.4E+02  0.0096   24.2  14.6  129   20-155   260-396 (398)
471 COG4590 ABC-type uncharacteriz  30.0 4.8E+02    0.01   24.6   9.7  105   49-160   278-388 (733)
472 PF12657 TFIIIC_delta:  Transcr  28.5 1.2E+02  0.0025   24.0   4.6   30  254-283    87-122 (173)
473 TIGR03118 PEPCTERM_chp_1 conse  28.5 4.2E+02  0.0092   23.5  22.3  219   42-284    25-281 (336)
474 PF12768 Rax2:  Cortical protei  28.1   4E+02  0.0087   23.2   9.6   75   36-111    33-122 (281)
475 TIGR03606 non_repeat_PQQ dehyd  26.6 3.1E+02  0.0068   25.7   7.5   52   42-93     32-90  (454)
476 PF10584 Proteasome_A_N:  Prote  26.4      18 0.00039   18.4  -0.3    8  259-266     7-14  (23)
477 PF14781 BBS2_N:  Ciliary BBSom  25.5 3.1E+02  0.0067   21.0  11.3  105   46-157     5-124 (136)
478 KOG3616 Selective LIM binding   25.3 1.3E+02  0.0027   30.2   4.7   59  219-282    23-83  (1636)
479 KOG1897 Damage-specific DNA bi  25.2   8E+02   0.017   25.6  21.0  110   36-155   445-563 (1096)
480 COG5308 NUP170 Nuclear pore co  24.9 2.5E+02  0.0055   28.7   6.7   28  129-158   182-209 (1263)
481 KOG1900 Nuclear pore complex,   24.9 1.8E+02  0.0039   30.8   6.0   70   40-113   179-273 (1311)
482 PF08801 Nucleoporin_N:  Nup133  24.4 5.5E+02   0.012   23.5  14.6   30  254-283   191-220 (422)
483 PF06739 SBBP:  Beta-propeller   24.2 1.5E+02  0.0032   16.9   3.3   22  253-274    13-34  (38)
484 KOG4460 Nuclear pore complex,   23.9 3.1E+02  0.0068   26.3   6.8   65  219-284   112-200 (741)
485 TIGR03075 PQQ_enz_alc_DH PQQ-d  23.1 2.7E+02  0.0058   26.7   6.6   54  221-274   471-526 (527)
486 KOG3356 Predicted membrane pro  22.8 1.1E+02  0.0024   22.4   3.0   32   31-62     60-91  (147)
487 COG3204 Uncharacterized protei  22.7 5.3E+02   0.012   22.7  14.8   74   77-159    81-159 (316)
488 PF12341 DUF3639:  Protein of u  22.3 1.4E+02   0.003   15.8   3.7   23   42-66      4-26  (27)
489 TIGR03054 photo_alph_chp1 puta  22.0 3.7E+02  0.0079   20.6   6.5   61  224-284    43-115 (135)
490 PF10214 Rrn6:  RNA polymerase   21.4 8.6E+02   0.019   24.6  15.9  122   37-160    77-234 (765)
491 TIGR02171 Fb_sc_TIGR02171 Fibr  21.3 3.8E+02  0.0081   27.6   7.2   54  230-284   327-387 (912)
492 PF13418 Kelch_4:  Galactose ox  21.3 1.3E+02  0.0029   17.6   2.9   23  221-243    12-40  (49)

No 1  
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=100.00  E-value=4.7e-36  Score=253.97  Aligned_cols=256  Identities=21%  Similarity=0.364  Sum_probs=186.4

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE-EEEecccCCeEEEEEccC--
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS-LRILAHTSDVNTVCFGDE--   92 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~-~~~~~h~~~v~~l~~~~~--   92 (303)
                      |....+|+.-+-...  -.--||...|.|++|+|||+.||+|+.||+|++||.++|+.. ..+.+|...|++++|.|-  
T Consensus       136 D~TvR~WD~~TeTp~--~t~KgH~~WVlcvawsPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K~It~Lawep~hl  213 (480)
T KOG0271|consen  136 DTTVRLWDLDTETPL--FTCKGHKNWVLCVAWSPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKKWITALAWEPLHL  213 (480)
T ss_pred             CceEEeeccCCCCcc--eeecCCccEEEEEEECCCcchhhccccCCeEEEecCCCCCcccccccCcccceeEEeeccccc
Confidence            445566665222211  133699999999999999999999999999999999998765 457899999999999642  


Q ss_pred             --CCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccc-cC
Q 022074           93 --SGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCN-LG  169 (303)
Q Consensus        93 --~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~-~~  169 (303)
                        ...+|+++++||+|+|||+.    .+.....+.||..+|+|+.+..+| +|++|+.|++|++|+........... +.
T Consensus       214 ~p~~r~las~skDg~vrIWd~~----~~~~~~~lsgHT~~VTCvrwGG~g-liySgS~DrtIkvw~a~dG~~~r~lkGHa  288 (480)
T KOG0271|consen  214 VPPCRRLASSSKDGSVRIWDTK----LGTCVRTLSGHTASVTCVRWGGEG-LIYSGSQDRTIKVWRALDGKLCRELKGHA  288 (480)
T ss_pred             CCCccceecccCCCCEEEEEcc----CceEEEEeccCccceEEEEEcCCc-eEEecCCCceEEEEEccchhHHHhhcccc
Confidence              24589999999999999975    334566788999999999998766 89999999999999865421110000 00


Q ss_pred             c------cceeeeceeeeCCCCCc-------------------------cccCCCC-------------CcceEEecccc
Q 022074          170 F------RSYEWDYRWMDYPPQAR-------------------------DLKHPCD-------------QSVATYKGHSV  205 (303)
Q Consensus       170 ~------~~~~~~~~~~~~~~~~~-------------------------~~~~~~~-------------~~~~~~~~~~~  205 (303)
                      .      .+.++..+.-.|.+...                         .+...++             +++..+.||+.
T Consensus       289 hwvN~lalsTdy~LRtgaf~~t~~~~~~~se~~~~Al~rY~~~~~~~~erlVSgsDd~tlflW~p~~~kkpi~rmtgHq~  368 (480)
T KOG0271|consen  289 HWVNHLALSTDYVLRTGAFDHTGRKPKSFSEEQKKALERYEAVLKDSGERLVSGSDDFTLFLWNPFKSKKPITRMTGHQA  368 (480)
T ss_pred             hheeeeeccchhhhhccccccccccCCChHHHHHHHHHHHHHhhccCcceeEEecCCceEEEecccccccchhhhhchhh
Confidence            0      00011111111111111                         1111111             11222333332


Q ss_pred             eeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          206 LRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       206 ~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.      ....||||++++|+|+-|+.|++||-++|+.+..|.+|-++|..++||.|.++|+|+|.|.+|++|++...
T Consensus       369 lV------n~V~fSPd~r~IASaSFDkSVkLW~g~tGk~lasfRGHv~~VYqvawsaDsRLlVS~SkDsTLKvw~V~tk  441 (480)
T KOG0271|consen  369 LV------NHVSFSPDGRYIASASFDKSVKLWDGRTGKFLASFRGHVAAVYQVAWSADSRLLVSGSKDSTLKVWDVRTK  441 (480)
T ss_pred             he------eeEEECCCccEEEEeecccceeeeeCCCcchhhhhhhccceeEEEEeccCccEEEEcCCCceEEEEEeeee
Confidence            21      23458999999999999999999999999999999999999999999999999999999999999998754


No 2  
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=100.00  E-value=1.5e-36  Score=259.64  Aligned_cols=229  Identities=25%  Similarity=0.382  Sum_probs=196.2

Q ss_pred             eEEEEE----ccCchhhccccccccccCcCcc-cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe
Q 022074            4 IVHIVD----VGSGTMESLANVTEIHDGLDFS-AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL   78 (303)
Q Consensus         4 ~~~~~~----~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~   78 (303)
                      ..||++    +-+++-|.++-+|..   .+.. -++..||...|..++|+|+|++|+++|.|.+-||||+.++.+.....
T Consensus       224 ~fhP~~~~~~lat~s~Dgtvklw~~---~~e~~l~~l~gH~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~ElL~QE  300 (459)
T KOG0272|consen  224 VFHPVDSDLNLATASADGTVKLWKL---SQETPLQDLEGHLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSELLLQE  300 (459)
T ss_pred             EEccCCCccceeeeccCCceeeecc---CCCcchhhhhcchhhheeeeecCCCceeeecccccchhhcccccchhhHhhc
Confidence            357774    667888888888876   3322 34558999999999999999999999999999999999999888888


Q ss_pred             cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074           79 AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus        79 ~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      +|..+|.+++|++ ++.+++||+.|..-|+||+|    +++.+-.+.||...|..++|+|+|-.++|||.|++++|||+|
T Consensus       301 GHs~~v~~iaf~~-DGSL~~tGGlD~~~RvWDlR----tgr~im~L~gH~k~I~~V~fsPNGy~lATgs~Dnt~kVWDLR  375 (459)
T KOG0272|consen  301 GHSKGVFSIAFQP-DGSLAATGGLDSLGRVWDLR----TGRCIMFLAGHIKEILSVAFSPNGYHLATGSSDNTCKVWDLR  375 (459)
T ss_pred             ccccccceeEecC-CCceeeccCccchhheeecc----cCcEEEEecccccceeeEeECCCceEEeecCCCCcEEEeeec
Confidence            9999999999965 68999999999999999998    455667789999999999999999999999999999999999


Q ss_pred             cccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeC-CCeEEEEEeCCCeEEEE
Q 022074          159 KMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYST-GQKYIYTGSHDSCVYVY  237 (303)
Q Consensus       159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~-~~~~latg~~dg~i~iw  237 (303)
                      ...+                                  +.++.+|..+..-++      |+| .|.+|+|++.|++++||
T Consensus       376 ~r~~----------------------------------ly~ipAH~nlVS~Vk------~~p~~g~fL~TasyD~t~kiW  415 (459)
T KOG0272|consen  376 MRSE----------------------------------LYTIPAHSNLVSQVK------YSPQEGYFLVTASYDNTVKIW  415 (459)
T ss_pred             cccc----------------------------------ceecccccchhhheE------ecccCCeEEEEcccCcceeee
Confidence            6432                                  223333433322222      444 68999999999999999


Q ss_pred             ECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          238 DLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       238 d~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      ..++.++++.+.+|++.|.+++.+||+.+++|++.|+++++|.
T Consensus       416 s~~~~~~~ksLaGHe~kV~s~Dis~d~~~i~t~s~DRT~KLW~  458 (459)
T KOG0272|consen  416 STRTWSPLKSLAGHEGKVISLDISPDSQAIATSSFDRTIKLWR  458 (459)
T ss_pred             cCCCcccchhhcCCccceEEEEeccCCceEEEeccCceeeecc
Confidence            9999999999999999999999999999999999999999996


No 3  
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=100.00  E-value=5.6e-36  Score=273.38  Aligned_cols=206  Identities=25%  Similarity=0.425  Sum_probs=185.4

Q ss_pred             CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           35 DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      +.||+.||+.++|+|+.++|+++|.|++||||.+.+........+|..+|..+.|+ +.+.+|+|||.|++.++|...  
T Consensus       447 L~GH~GPVyg~sFsPd~rfLlScSED~svRLWsl~t~s~~V~y~GH~~PVwdV~F~-P~GyYFatas~D~tArLWs~d--  523 (707)
T KOG0263|consen  447 LYGHSGPVYGCSFSPDRRFLLSCSEDSSVRLWSLDTWSCLVIYKGHLAPVWDVQFA-PRGYYFATASHDQTARLWSTD--  523 (707)
T ss_pred             eecCCCceeeeeecccccceeeccCCcceeeeecccceeEEEecCCCcceeeEEec-CCceEEEecCCCceeeeeecc--
Confidence            46999999999999999999999999999999999998887888999999999997 569999999999999999865  


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                        ...|.+.+.||...|.|+.|+|+.+|++|||.|++||+||....                                  
T Consensus       524 --~~~PlRifaghlsDV~cv~FHPNs~Y~aTGSsD~tVRlWDv~~G----------------------------------  567 (707)
T KOG0263|consen  524 --HNKPLRIFAGHLSDVDCVSFHPNSNYVATGSSDRTVRLWDVSTG----------------------------------  567 (707)
T ss_pred             --cCCchhhhcccccccceEEECCcccccccCCCCceEEEEEcCCC----------------------------------
Confidence              35678899999999999999999999999999999999998642                                  


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                      ..+..+.||......      ..|||+|++|++|++||.|.+||+.+++.+..+.+|++.|.++.||.+|..||++|.|+
T Consensus       568 ~~VRiF~GH~~~V~a------l~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~Ht~ti~SlsFS~dg~vLasgg~Dn  641 (707)
T KOG0263|consen  568 NSVRIFTGHKGPVTA------LAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGHTGTIYSLSFSRDGNVLASGGADN  641 (707)
T ss_pred             cEEEEecCCCCceEE------EEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcccCceeEEEEecCCCEEEecCCCC
Confidence            235566777644433      34788999999999999999999999999999999999999999999999999999999


Q ss_pred             CEEEeecCCCC
Q 022074          275 DVVRWEFPGNG  285 (303)
Q Consensus       275 ~i~~Wd~~~~~  285 (303)
                      ++++||+....
T Consensus       642 sV~lWD~~~~~  652 (707)
T KOG0263|consen  642 SVRLWDLTKVI  652 (707)
T ss_pred             eEEEEEchhhc
Confidence            99999986543


No 4  
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=100.00  E-value=2e-34  Score=231.83  Aligned_cols=264  Identities=22%  Similarity=0.347  Sum_probs=196.6

Q ss_pred             ceEEEEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--eEEEEecc
Q 022074            3 PIVHIVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--LSLRILAH   80 (303)
Q Consensus         3 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~~~~~~~h   80 (303)
                      |-.++|+.-+|+.|+.|..|+.++|.=.-++-  =-+.-|..+...|+++.|+++++-. |||||+.++.  ....+.+|
T Consensus         6 ~~d~~viLvsA~YDhTIRfWqa~tG~C~rTiq--h~dsqVNrLeiTpdk~~LAaa~~qh-vRlyD~~S~np~Pv~t~e~h   82 (311)
T KOG0315|consen    6 PTDDPVILVSAGYDHTIRFWQALTGICSRTIQ--HPDSQVNRLEITPDKKDLAAAGNQH-VRLYDLNSNNPNPVATFEGH   82 (311)
T ss_pred             CCCCceEEEeccCcceeeeeehhcCeEEEEEe--cCccceeeEEEcCCcchhhhccCCe-eEEEEccCCCCCceeEEecc
Confidence            55689999999999999999999998221111  1123499999999999999998775 9999998875  45678899


Q ss_pred             cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074           81 TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus        81 ~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      +..|..+.|. .+++.+.||++||++|+||+|...     ......|..+|+.+..+|+...|++|..+|.|++||++..
T Consensus        83 ~kNVtaVgF~-~dgrWMyTgseDgt~kIWdlR~~~-----~qR~~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~~  156 (311)
T KOG0315|consen   83 TKNVTAVGFQ-CDGRWMYTGSEDGTVKIWDLRSLS-----CQRNYQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGEN  156 (311)
T ss_pred             CCceEEEEEe-ecCeEEEecCCCceEEEEeccCcc-----cchhccCCCCcceEEecCCcceEEeecCCCcEEEEEccCC
Confidence            9999999995 568999999999999999998422     2234468899999999999999999999999999999853


Q ss_pred             cCCcccccCccceeeeceeeeCCCCCccccCCCC------------------CcceEEecccceeeeEEEeeeeeeeCCC
Q 022074          161 SSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD------------------QSVATYKGHSVLRTLIRCHFSPVYSTGQ  222 (303)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------------~~~~~~~~~~~~~~~~~~~~~~~~s~~~  222 (303)
                      ...  +.+ .+...-.++.+...+++..+.....                  .++..++.|.  ..+.+|.    +|||+
T Consensus       157 ~c~--~~l-iPe~~~~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~--~~il~C~----lSPd~  227 (311)
T KOG0315|consen  157 SCT--HEL-IPEDDTSIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAHN--GHILRCL----LSPDV  227 (311)
T ss_pred             ccc--ccc-CCCCCcceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheeccc--ceEEEEE----ECCCC
Confidence            211  111 0011111222223333332221111                  0111122222  1233443    67899


Q ss_pred             eEEEEEeCCCeEEEEECCCC-eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          223 KYIYTGSHDSCVYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       223 ~~latg~~dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++||++|.|.+++||+.++. +....+++|+.++++++||.|++||+||+.|+..++|+++..
T Consensus       228 k~lat~ssdktv~iwn~~~~~kle~~l~gh~rWvWdc~FS~dg~YlvTassd~~~rlW~~~~~  290 (311)
T KOG0315|consen  228 KYLATCSSDKTVKIWNTDDFFKLELVLTGHQRWVWDCAFSADGEYLVTASSDHTARLWDLSAG  290 (311)
T ss_pred             cEEEeecCCceEEEEecCCceeeEEEeecCCceEEeeeeccCccEEEecCCCCceeecccccC
Confidence            99999999999999999987 556678999999999999999999999999999999998754


No 5  
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=100.00  E-value=7.7e-35  Score=249.24  Aligned_cols=204  Identities=25%  Similarity=0.367  Sum_probs=179.2

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccC-CCcEEEEecCCCeEEEEcCcccc
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDE-SGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~-~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      |=+.||..+.|++|++.|+|||++|.++||+.++.....++.+|+..|.++.|+|. ++..++||+.||+|++|++.   
T Consensus       173 gd~rPis~~~fS~ds~~laT~swsG~~kvW~~~~~~~~~~l~gH~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~---  249 (459)
T KOG0272|consen  173 GDTRPISGCSFSRDSKHLATGSWSGLVKVWSVPQCNLLQTLRGHTSRVGAAVFHPVDSDLNLATASADGTVKLWKLS---  249 (459)
T ss_pred             cCCCcceeeEeecCCCeEEEeecCCceeEeecCCcceeEEEeccccceeeEEEccCCCccceeeeccCCceeeeccC---
Confidence            56678999999999999999999999999999999888999999999999999987 47789999999999999975   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                       ...++..+.||...|..++|+|+|++|+|++.|.+-|+||++.......                              
T Consensus       250 -~e~~l~~l~gH~~RVs~VafHPsG~~L~TasfD~tWRlWD~~tk~ElL~------------------------------  298 (459)
T KOG0272|consen  250 -QETPLQDLEGHLARVSRVAFHPSGKFLGTASFDSTWRLWDLETKSELLL------------------------------  298 (459)
T ss_pred             -CCcchhhhhcchhhheeeeecCCCceeeecccccchhhcccccchhhHh------------------------------
Confidence             3367888999999999999999999999999999999999975332111                              


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGD  275 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~  275 (303)
                          ..||.      +..++.+|.+||.+++|||.|..-+|||+++|.++..+.+|..+|.+|+|||+|-.|||||.|++
T Consensus       299 ----QEGHs------~~v~~iaf~~DGSL~~tGGlD~~~RvWDlRtgr~im~L~gH~k~I~~V~fsPNGy~lATgs~Dnt  368 (459)
T KOG0272|consen  299 ----QEGHS------KGVFSIAFQPDGSLAATGGLDSLGRVWDLRTGRCIMFLAGHIKEILSVAFSPNGYHLATGSSDNT  368 (459)
T ss_pred             ----hcccc------cccceeEecCCCceeeccCccchhheeecccCcEEEEecccccceeeEeECCCceEEeecCCCCc
Confidence                11222      11234457789999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEeecCCC
Q 022074          276 VVRWEFPGN  284 (303)
Q Consensus       276 i~~Wd~~~~  284 (303)
                      +++||+..-
T Consensus       369 ~kVWDLR~r  377 (459)
T KOG0272|consen  369 CKVWDLRMR  377 (459)
T ss_pred             EEEeeeccc
Confidence            999998743


No 6  
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=100.00  E-value=5.5e-33  Score=235.37  Aligned_cols=228  Identities=26%  Similarity=0.407  Sum_probs=190.4

Q ss_pred             ccCchhhccccccccccCcCcccccCCCcccceEEEEEcC-----CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCe
Q 022074           10 VGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFST-----DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDV   84 (303)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~-----~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v   84 (303)
                      |-||+|+..|.+|+--+|.+. .--..||+-.|.+++|.|     ..++++++|.||+|+|||+..++.+..+.+|+..|
T Consensus       172 iASG~~dg~I~lwdpktg~~~-g~~l~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd~~~~~~~~~lsgHT~~V  250 (480)
T KOG0271|consen  172 IASGSKDGSIRLWDPKTGQQI-GRALRGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWDTKLGTCVRTLSGHTASV  250 (480)
T ss_pred             hhccccCCeEEEecCCCCCcc-cccccCcccceeEEeecccccCCCccceecccCCCCEEEEEccCceEEEEeccCccce
Confidence            679999999999998777655 445589999999999986     57789999999999999999999998999999999


Q ss_pred             EEEEEccCCCcEEEEecCCCeEEEEcCccc-----------------------------cC-------------------
Q 022074           85 NTVCFGDESGHLIYSGSDDNLCKVWDRRCL-----------------------------NV-------------------  116 (303)
Q Consensus        85 ~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~-----------------------------~~-------------------  116 (303)
                      +|++|.-  ..++.|||.|++|++|+...+                             ..                   
T Consensus       251 TCvrwGG--~gliySgS~DrtIkvw~a~dG~~~r~lkGHahwvN~lalsTdy~LRtgaf~~t~~~~~~~se~~~~Al~rY  328 (480)
T KOG0271|consen  251 TCVRWGG--EGLIYSGSQDRTIKVWRALDGKLCRELKGHAHWVNHLALSTDYVLRTGAFDHTGRKPKSFSEEQKKALERY  328 (480)
T ss_pred             EEEEEcC--CceEEecCCCceEEEEEccchhHHHhhcccchheeeeeccchhhhhccccccccccCCChHHHHHHHHHHH
Confidence            9999953  358999999999999974210                             00                   


Q ss_pred             ---------------------------CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccC
Q 022074          117 ---------------------------KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLG  169 (303)
Q Consensus       117 ---------------------------~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~  169 (303)
                                                 ..+++....||..-|+.+.|+||+.++++++.|++||+||.+..+        
T Consensus       329 ~~~~~~~~erlVSgsDd~tlflW~p~~~kkpi~rmtgHq~lVn~V~fSPd~r~IASaSFDkSVkLW~g~tGk--------  400 (480)
T KOG0271|consen  329 EAVLKDSGERLVSGSDDFTLFLWNPFKSKKPITRMTGHQALVNHVSFSPDGRYIASASFDKSVKLWDGRTGK--------  400 (480)
T ss_pred             HHhhccCcceeEEecCCceEEEecccccccchhhhhchhhheeeEEECCCccEEEEeecccceeeeeCCCcc--------
Confidence                                       011333456899999999999999999999999999999987533        


Q ss_pred             ccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee
Q 022074          170 FRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK  249 (303)
Q Consensus       170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~  249 (303)
                                                .+.++.||-...+      ...++.|.++|++|+.|.++++||+++.++...+.
T Consensus       401 --------------------------~lasfRGHv~~VY------qvawsaDsRLlVS~SkDsTLKvw~V~tkKl~~DLp  448 (480)
T KOG0271|consen  401 --------------------------FLASFRGHVAAVY------QVAWSADSRLLVSGSKDSTLKVWDVRTKKLKQDLP  448 (480)
T ss_pred             --------------------------hhhhhhhccceeE------EEEeccCccEEEEcCCCceEEEEEeeeeeecccCC
Confidence                                      2344444432221      22367789999999999999999999999888999


Q ss_pred             cCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          250 YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       250 ~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      +|.+.|.+++|+|||..+++|+.|..+++|.
T Consensus       449 Gh~DEVf~vDwspDG~rV~sggkdkv~~lw~  479 (480)
T KOG0271|consen  449 GHADEVFAVDWSPDGQRVASGGKDKVLRLWR  479 (480)
T ss_pred             CCCceEEEEEecCCCceeecCCCceEEEeec
Confidence            9999999999999999999999999999995


No 7  
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=100.00  E-value=5.6e-32  Score=222.21  Aligned_cols=227  Identities=23%  Similarity=0.348  Sum_probs=188.3

Q ss_pred             cCchhhcccccc-ccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEE
Q 022074           11 GSGTMESLANVT-EIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCF   89 (303)
Q Consensus        11 ~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~   89 (303)
                      |+..+|+.-.+. +--+|...-.-...||+..+.|+.|.+| ..|+++|.|.+.-+||+++++....+.+|.+.|-++.+
T Consensus       116 GLdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD-~~ilT~SGD~TCalWDie~g~~~~~f~GH~gDV~slsl  194 (343)
T KOG0286|consen  116 GLDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDD-NHILTGSGDMTCALWDIETGQQTQVFHGHTGDVMSLSL  194 (343)
T ss_pred             CcCceeEEEecccccccccceeeeeecCccceeEEEEEcCC-CceEecCCCceEEEEEcccceEEEEecCCcccEEEEec
Confidence            666677665555 1112222223345899999999999985 56999999999999999999999999999999999999


Q ss_pred             ccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccC
Q 022074           90 GDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLG  169 (303)
Q Consensus        90 ~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~  169 (303)
                      .|.+++.|+||+-|++.+|||.|.    +...+.|.||...|+++.|.|+|.-|+||+.|+++|+||+|...+...++. 
T Consensus       195 ~p~~~ntFvSg~cD~~aklWD~R~----~~c~qtF~ghesDINsv~ffP~G~afatGSDD~tcRlyDlRaD~~~a~ys~-  269 (343)
T KOG0286|consen  195 SPSDGNTFVSGGCDKSAKLWDVRS----GQCVQTFEGHESDINSVRFFPSGDAFATGSDDATCRLYDLRADQELAVYSH-  269 (343)
T ss_pred             CCCCCCeEEecccccceeeeeccC----cceeEeecccccccceEEEccCCCeeeecCCCceeEEEeecCCcEEeeecc-
Confidence            776899999999999999999983    456778999999999999999999999999999999999997543322211 


Q ss_pred             ccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee
Q 022074          170 FRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK  249 (303)
Q Consensus       170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~  249 (303)
                                                       ..    ......+..||..|++|.+|..|.++.+||.-.++.+..+.
T Consensus       270 ---------------------------------~~----~~~gitSv~FS~SGRlLfagy~d~~c~vWDtlk~e~vg~L~  312 (343)
T KOG0286|consen  270 ---------------------------------DS----IICGITSVAFSKSGRLLFAGYDDFTCNVWDTLKGERVGVLA  312 (343)
T ss_pred             ---------------------------------Cc----ccCCceeEEEcccccEEEeeecCCceeEeeccccceEEEee
Confidence                                             00    00111234578889999999999999999999999999999


Q ss_pred             cCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          250 YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       250 ~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      +|+.+|.++..+|||..|+|||+|.++++|.
T Consensus       313 GHeNRvScl~~s~DG~av~TgSWDs~lriW~  343 (343)
T KOG0286|consen  313 GHENRVSCLGVSPDGMAVATGSWDSTLRIWA  343 (343)
T ss_pred             ccCCeeEEEEECCCCcEEEecchhHheeecC
Confidence            9999999999999999999999999999994


No 8  
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=100.00  E-value=3.5e-31  Score=216.27  Aligned_cols=230  Identities=21%  Similarity=0.293  Sum_probs=179.2

Q ss_pred             EccCchhhccccccccccCcCcccc---cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeE
Q 022074            9 DVGSGTMESLANVTEIHDGLDFSAA---DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVN   85 (303)
Q Consensus         9 ~~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~   85 (303)
                      ++=+++||-.+=+|....-+..-+-   -..||+.-|..+..++||++++++|+|+++|+||+.+++...++.+|...|.
T Consensus        30 ~l~sasrDk~ii~W~L~~dd~~~G~~~r~~~GHsH~v~dv~~s~dg~~alS~swD~~lrlWDl~~g~~t~~f~GH~~dVl  109 (315)
T KOG0279|consen   30 ILVSASRDKTIIVWKLTSDDIKYGVPVRRLTGHSHFVSDVVLSSDGNFALSASWDGTLRLWDLATGESTRRFVGHTKDVL  109 (315)
T ss_pred             eEEEcccceEEEEEEeccCccccCceeeeeeccceEecceEEccCCceEEeccccceEEEEEecCCcEEEEEEecCCceE
Confidence            4558899988877776443222111   1259999999999999999999999999999999999988889999999999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC--CCEEEEEeCCCcEEEEEcccccCC
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD--GRYLISNGKDQAIKLWDIRKMSSN  163 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~--~~~l~s~~~D~~v~lWdl~~~~~~  163 (303)
                      +++|+++ +.+++||+.|.++++|+....   ......-.+|.+-|.++.|+|+  ..+|+++|.|++||+||++..+..
T Consensus       110 sva~s~d-n~qivSGSrDkTiklwnt~g~---ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~  185 (315)
T KOG0279|consen  110 SVAFSTD-NRQIVSGSRDKTIKLWNTLGV---CKYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLR  185 (315)
T ss_pred             EEEecCC-CceeecCCCcceeeeeeeccc---EEEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchh
Confidence            9999764 678999999999999996421   1111111223667999999997  789999999999999999853321


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE  243 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~  243 (303)
                                                        ..+-||......      ..+||||.+.++|+.||.+.+||+..++
T Consensus       186 ----------------------------------~~~~gh~~~v~t------~~vSpDGslcasGgkdg~~~LwdL~~~k  225 (315)
T KOG0279|consen  186 ----------------------------------TTFIGHSGYVNT------VTVSPDGSLCASGGKDGEAMLWDLNEGK  225 (315)
T ss_pred             ----------------------------------hccccccccEEE------EEECCCCCEEecCCCCceEEEEEccCCc
Confidence                                              112223222211      2368999999999999999999999999


Q ss_pred             EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          244 QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       244 ~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .++.+ .|..+|.+++|+|+..+|+.+- +..|++||..+.
T Consensus       226 ~lysl-~a~~~v~sl~fspnrywL~~at-~~sIkIwdl~~~  264 (315)
T KOG0279|consen  226 NLYSL-EAFDIVNSLCFSPNRYWLCAAT-ATSIKIWDLESK  264 (315)
T ss_pred             eeEec-cCCCeEeeEEecCCceeEeecc-CCceEEEeccch
Confidence            88887 4888999999999988777665 556999998754


No 9  
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.98  E-value=1.2e-31  Score=233.20  Aligned_cols=257  Identities=23%  Similarity=0.377  Sum_probs=195.3

Q ss_pred             cCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEc
Q 022074           11 GSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFG   90 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~   90 (303)
                      =|++||.++-||+.++-..- -.+..||+.+|.+++|+.+|..++++|.|+.+++||+++|+....+. -...+.|+.|+
T Consensus       231 LS~gmD~~vklW~vy~~~~~-lrtf~gH~k~Vrd~~~s~~g~~fLS~sfD~~lKlwDtETG~~~~~f~-~~~~~~cvkf~  308 (503)
T KOG0282|consen  231 LSGGMDGLVKLWNVYDDRRC-LRTFKGHRKPVRDASFNNCGTSFLSASFDRFLKLWDTETGQVLSRFH-LDKVPTCVKFH  308 (503)
T ss_pred             EecCCCceEEEEEEecCcce-ehhhhcchhhhhhhhccccCCeeeeeecceeeeeeccccceEEEEEe-cCCCceeeecC
Confidence            37899999999999873333 34668999999999999999999999999999999999998765443 33467899998


Q ss_pred             cCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCc
Q 022074           91 DESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGF  170 (303)
Q Consensus        91 ~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~  170 (303)
                      |++.+.|++|+.|+.|+.||+|.    +..+..+..|..+|..+.|-+++.++++.+.|+++++|+.+........   .
T Consensus       309 pd~~n~fl~G~sd~ki~~wDiRs----~kvvqeYd~hLg~i~~i~F~~~g~rFissSDdks~riWe~~~~v~ik~i---~  381 (503)
T KOG0282|consen  309 PDNQNIFLVGGSDKKIRQWDIRS----GKVVQEYDRHLGAILDITFVDEGRRFISSSDDKSVRIWENRIPVPIKNI---A  381 (503)
T ss_pred             CCCCcEEEEecCCCcEEEEeccc----hHHHHHHHhhhhheeeeEEccCCceEeeeccCccEEEEEcCCCccchhh---c
Confidence            87778999999999999999984    3456667789999999999999999999999999999998753221100   0


Q ss_pred             cceeeeceeeeCCCCCccccCC-CCCcceE--------------EecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEE
Q 022074          171 RSYEWDYRWMDYPPQARDLKHP-CDQSVAT--------------YKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVY  235 (303)
Q Consensus       171 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~--------------~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~  235 (303)
                      ..-......+...|..+.+... .++.+..              +.||..    ..+.....|||||++|++|+.||.+.
T Consensus       382 ~~~~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~~~r~nkkK~feGh~v----aGys~~v~fSpDG~~l~SGdsdG~v~  457 (503)
T KOG0282|consen  382 DPEMHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVPPFRLNKKKRFEGHSV----AGYSCQVDFSPDGRTLCSGDSDGKVN  457 (503)
T ss_pred             chhhccCcceecCCCCCeehhhccCceEEEEecccccccCHhhhhcceec----cCceeeEEEcCCCCeEEeecCCccEE
Confidence            0000011112222222222111 1222222              233321    22233456889999999999999999


Q ss_pred             EEECCCCeEEEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEee
Q 022074          236 VYDLVSGEQVAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWE  280 (303)
Q Consensus       236 iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd  280 (303)
                      +||.++-+++..+++|..++..+.|+|..+ .+||++.||.|++|+
T Consensus       458 ~wdwkt~kl~~~lkah~~~ci~v~wHP~e~Skvat~~w~G~Ikiwd  503 (503)
T KOG0282|consen  458 FWDWKTTKLVSKLKAHDQPCIGVDWHPVEPSKVATCGWDGLIKIWD  503 (503)
T ss_pred             EeechhhhhhhccccCCcceEEEEecCCCcceeEecccCceeEecC
Confidence            999999999999999999999999999875 799999999999996


No 10 
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=99.97  E-value=7.6e-30  Score=209.70  Aligned_cols=222  Identities=23%  Similarity=0.364  Sum_probs=187.8

Q ss_pred             CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC------ceEEEEecccCCeE
Q 022074           12 SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN------KLSLRILAHTSDVN   85 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~------~~~~~~~~h~~~v~   85 (303)
                      |+|-|.-.=||+-++.++.-++.  -....|.+++|+|.|+.+|+|+-|+...||++.+.      +...++.+|++-+.
T Consensus        72 SaSqDGklIvWDs~TtnK~haip--l~s~WVMtCA~sPSg~~VAcGGLdN~Csiy~ls~~d~~g~~~v~r~l~gHtgylS  149 (343)
T KOG0286|consen   72 SASQDGKLIVWDSFTTNKVHAIP--LPSSWVMTCAYSPSGNFVACGGLDNKCSIYPLSTRDAEGNVRVSRELAGHTGYLS  149 (343)
T ss_pred             eeccCCeEEEEEcccccceeEEe--cCceeEEEEEECCCCCeEEecCcCceeEEEecccccccccceeeeeecCccceeE
Confidence            67777777899999999765433  33678999999999999999999999999999865      23456889999999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCc
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNA  164 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~  164 (303)
                      |+.|.+  +..++|+|.|.++.+||++    .+.....|.||.+.|.+++++| +++.+++|+.|+..+|||+|...   
T Consensus       150 cC~f~d--D~~ilT~SGD~TCalWDie----~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~---  220 (343)
T KOG0286|consen  150 CCRFLD--DNHILTGSGDMTCALWDIE----TGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQ---  220 (343)
T ss_pred             EEEEcC--CCceEecCCCceEEEEEcc----cceEEEEecCCcccEEEEecCCCCCCeEEecccccceeeeeccCcc---
Confidence            999954  4678899999999999986    5567788999999999999999 89999999999999999998532   


Q ss_pred             ccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE
Q 022074          165 SCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ  244 (303)
Q Consensus       165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~  244 (303)
                                                     .++++.||...+..+      .|.|+|.-+++|++|+++++||++..+.
T Consensus       221 -------------------------------c~qtF~ghesDINsv------~ffP~G~afatGSDD~tcRlyDlRaD~~  263 (343)
T KOG0286|consen  221 -------------------------------CVQTFEGHESDINSV------RFFPSGDAFATGSDDATCRLYDLRADQE  263 (343)
T ss_pred             -------------------------------eeEeecccccccceE------EEccCCCeeeecCCCceeEEEeecCCcE
Confidence                                           345666665544333      3667899999999999999999999888


Q ss_pred             EEEeecC--CCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          245 VAALKYH--TSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       245 ~~~~~~h--~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      +..++..  ..+|++++||..|++|.+|..|.++.+||.
T Consensus       264 ~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~~c~vWDt  302 (343)
T KOG0286|consen  264 LAVYSHDSIICGITSVAFSKSGRLLFAGYDDFTCNVWDT  302 (343)
T ss_pred             EeeeccCcccCCceeEEEcccccEEEeeecCCceeEeec
Confidence            8887632  368999999999999999999999999995


No 11 
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=99.97  E-value=7.5e-30  Score=208.51  Aligned_cols=205  Identities=22%  Similarity=0.368  Sum_probs=171.1

Q ss_pred             cCCCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCC-----CceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           34 DDGGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEA-----NKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~-----~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      ...||+..|.+++..+. -+.+++++.|.++.+|++..     |..+..+.+|...|+.+..++ ++++++|+++|+++|
T Consensus        10 tl~gh~d~Vt~la~~~~~~~~l~sasrDk~ii~W~L~~dd~~~G~~~r~~~GHsH~v~dv~~s~-dg~~alS~swD~~lr   88 (315)
T KOG0279|consen   10 TLEGHTDWVTALAIKIKNSDILVSASRDKTIIVWKLTSDDIKYGVPVRRLTGHSHFVSDVVLSS-DGNFALSASWDGTLR   88 (315)
T ss_pred             eecCCCceEEEEEeecCCCceEEEcccceEEEEEEeccCccccCceeeeeeccceEecceEEcc-CCceEEeccccceEE
Confidence            45799999999999986 55788899999999998865     445678899999999999864 588999999999999


Q ss_pred             EEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          108 VWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      +||+.    .+++.+.|.||...|.++++++|.++++||++|+++++|++-..     |                     
T Consensus        89 lWDl~----~g~~t~~f~GH~~dVlsva~s~dn~qivSGSrDkTiklwnt~g~-----c---------------------  138 (315)
T KOG0279|consen   89 LWDLA----TGESTRRFVGHTKDVLSVAFSTDNRQIVSGSRDKTIKLWNTLGV-----C---------------------  138 (315)
T ss_pred             EEEec----CCcEEEEEEecCCceEEEEecCCCceeecCCCcceeeeeeeccc-----E---------------------
Confidence            99985    34567789999999999999999999999999999999997421     1                     


Q ss_pred             cccCCCCCcceEEecc--cceeeeEEEeeeeeeeCC--CeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC
Q 022074          188 DLKHPCDQSVATYKGH--SVLRTLIRCHFSPVYSTG--QKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS  263 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~s~~--~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~  263 (303)
                               ..+...+  .....++  .    |+|+  ..+++++|.|+++++||+.+.+....+-+|+..++.|++|||
T Consensus       139 ---------k~t~~~~~~~~WVscv--r----fsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~~~gh~~~v~t~~vSpD  203 (315)
T KOG0279|consen  139 ---------KYTIHEDSHREWVSCV--R----FSPNESNPIIVSASWDKTVKVWNLRNCQLRTTFIGHSGYVNTVTVSPD  203 (315)
T ss_pred             ---------EEEEecCCCcCcEEEE--E----EcCCCCCcEEEEccCCceEEEEccCCcchhhccccccccEEEEEECCC
Confidence                     1111111  1111222  2    4444  789999999999999999999988899999999999999999


Q ss_pred             CCeEEEEeCCCCEEEeecCCC
Q 022074          264 QPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       264 ~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      |...++|+.||++.+||+...
T Consensus       204 GslcasGgkdg~~~LwdL~~~  224 (315)
T KOG0279|consen  204 GSLCASGGKDGEAMLWDLNEG  224 (315)
T ss_pred             CCEEecCCCCceEEEEEccCC
Confidence            999999999999999999755


No 12 
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.97  E-value=1.5e-31  Score=224.49  Aligned_cols=220  Identities=22%  Similarity=0.391  Sum_probs=180.3

Q ss_pred             ccCchhhccccccccccCcCcc-cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEE
Q 022074           10 VGSGTMESLANVTEIHDGLDFS-AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVC   88 (303)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~   88 (303)
                      |=||.-|..|.||+.   +..+ --..+||++.|.|+.|..  +.+++||.|.+|++||.+++.....+..|.+.|-.+.
T Consensus       210 iVSGlrDnTikiWD~---n~~~c~~~L~GHtGSVLCLqyd~--rviisGSSDsTvrvWDv~tge~l~tlihHceaVLhlr  284 (499)
T KOG0281|consen  210 IVSGLRDNTIKIWDK---NSLECLKILTGHTGSVLCLQYDE--RVIVSGSSDSTVRVWDVNTGEPLNTLIHHCEAVLHLR  284 (499)
T ss_pred             hhcccccCceEEecc---ccHHHHHhhhcCCCcEEeeeccc--eEEEecCCCceEEEEeccCCchhhHHhhhcceeEEEE
Confidence            447888888888888   5443 344589999999999974  5899999999999999999999999999999999999


Q ss_pred             EccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccccc
Q 022074           89 FGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNL  168 (303)
Q Consensus        89 ~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~  168 (303)
                      |+   ..+++|+|+|.++++||..... .......+.||..+|+.++|+  ++|+++++.|++|++|++....       
T Consensus       285 f~---ng~mvtcSkDrsiaVWdm~sps-~it~rrVLvGHrAaVNvVdfd--~kyIVsASgDRTikvW~~st~e-------  351 (499)
T KOG0281|consen  285 FS---NGYMVTCSKDRSIAVWDMASPT-DITLRRVLVGHRAAVNVVDFD--DKYIVSASGDRTIKVWSTSTCE-------  351 (499)
T ss_pred             Ee---CCEEEEecCCceeEEEeccCch-HHHHHHHHhhhhhheeeeccc--cceEEEecCCceEEEEecccee-------
Confidence            95   3489999999999999986432 223345678999999999984  5699999999999999986421       


Q ss_pred             CccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe
Q 022074          169 GFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL  248 (303)
Q Consensus       169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~  248 (303)
                                                 .+.++.||..-+.++      .  ..++++++|++|.+|++||+..|.++..+
T Consensus       352 ---------------------------fvRtl~gHkRGIACl------Q--Yr~rlvVSGSSDntIRlwdi~~G~cLRvL  396 (499)
T KOG0281|consen  352 ---------------------------FVRTLNGHKRGIACL------Q--YRDRLVVSGSSDNTIRLWDIECGACLRVL  396 (499)
T ss_pred             ---------------------------eehhhhcccccceeh------h--ccCeEEEecCCCceEEEEeccccHHHHHH
Confidence                                       133344443222221      1  35799999999999999999999999999


Q ss_pred             ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          249 KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       249 ~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++|+.-|.++.|  |.+.++||+.||+|++||+...
T Consensus       397 eGHEeLvRciRF--d~krIVSGaYDGkikvWdl~aa  430 (499)
T KOG0281|consen  397 EGHEELVRCIRF--DNKRIVSGAYDGKIKVWDLQAA  430 (499)
T ss_pred             hchHHhhhheee--cCceeeeccccceEEEEecccc
Confidence            999999999999  5678999999999999998754


No 13 
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=99.97  E-value=2.7e-30  Score=236.34  Aligned_cols=200  Identities=24%  Similarity=0.425  Sum_probs=169.0

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc-------------------------------eEEEEecccCCeEEEE
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK-------------------------------LSLRILAHTSDVNTVC   88 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~-------------------------------~~~~~~~h~~~v~~l~   88 (303)
                      .++.|..|++|++.+|.|-.|..|++|.+...+                               ...++.+|.++|..+.
T Consensus       379 ~~v~ca~fSddssmlA~Gf~dS~i~~~Sl~p~kl~~lk~~~~l~~~d~~sad~~~~~~D~~~~~~~~~L~GH~GPVyg~s  458 (707)
T KOG0263|consen  379 QGVTCAEFSDDSSMLACGFVDSSVRVWSLTPKKLKKLKDASDLSNIDTESADVDVDMLDDDSSGTSRTLYGHSGPVYGCS  458 (707)
T ss_pred             CcceeEeecCCcchhhccccccEEEEEecchhhhccccchhhhccccccccchhhhhccccCCceeEEeecCCCceeeee
Confidence            369999999999999999999999999987421                               2245789999999999


Q ss_pred             EccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccccc
Q 022074           89 FGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNL  168 (303)
Q Consensus        89 ~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~  168 (303)
                      |+|+ .++|+++|+|++||+|.++.    ......+.||..+|+.+.|+|.|-||||+|.|++.++|......       
T Consensus       459 FsPd-~rfLlScSED~svRLWsl~t----~s~~V~y~GH~~PVwdV~F~P~GyYFatas~D~tArLWs~d~~~-------  526 (707)
T KOG0263|consen  459 FSPD-RRFLLSCSEDSSVRLWSLDT----WSCLVIYKGHLAPVWDVQFAPRGYYFATASHDQTARLWSTDHNK-------  526 (707)
T ss_pred             eccc-ccceeeccCCcceeeeeccc----ceeEEEecCCCcceeeEEecCCceEEEecCCCceeeeeecccCC-------
Confidence            9864 78999999999999999863    34455688999999999999999999999999999999865311       


Q ss_pred             CccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe
Q 022074          169 GFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL  248 (303)
Q Consensus       169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~  248 (303)
                                                 +...+.||-....+.      .|+|+..|++|||.|.++|+||+.+|..+..|
T Consensus       527 ---------------------------PlRifaghlsDV~cv------~FHPNs~Y~aTGSsD~tVRlWDv~~G~~VRiF  573 (707)
T KOG0263|consen  527 ---------------------------PLRIFAGHLSDVDCV------SFHPNSNYVATGSSDRTVRLWDVSTGNSVRIF  573 (707)
T ss_pred             ---------------------------chhhhcccccccceE------EECCcccccccCCCCceEEEEEcCCCcEEEEe
Confidence                                       122333333222222      37788999999999999999999999999999


Q ss_pred             ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          249 KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       249 ~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .+|.++|++++|||+|++||||++||.|++||++..
T Consensus       574 ~GH~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~  609 (707)
T KOG0263|consen  574 TGHKGPVTALAFSPCGRYLASGDEDGLIKIWDLANG  609 (707)
T ss_pred             cCCCCceEEEEEcCCCceEeecccCCcEEEEEcCCC
Confidence            999999999999999999999999999999999864


No 14 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.97  E-value=1.6e-29  Score=233.47  Aligned_cols=261  Identities=33%  Similarity=0.548  Sum_probs=202.5

Q ss_pred             cCchhhccccccccccCcC-cccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEEC-CCCceEEEEecccCCeEEEE
Q 022074           11 GSGTMESLANVTEIHDGLD-FSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDL-EANKLSLRILAHTSDVNTVC   88 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~-~~~~~~~~~~~h~~~v~~l~   88 (303)
                      .+++.+-++.+|..-++.. . .....||...|.+++|+|+|+++++++.|++++|||+ ..+.....+.+|...|++++
T Consensus       175 ~~~~~~~~i~~~~~~~~~~~~-~~~l~~h~~~v~~~~fs~d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~  253 (456)
T KOG0266|consen  175 AAASSDGLIRIWKLEGIKSNL-LRELSGHTRGVSDVAFSPDGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVA  253 (456)
T ss_pred             EEccCCCcEEEeecccccchh-hccccccccceeeeEECCCCcEEEEecCCceEEEeeccCCCeEEEEecCCCCceEEEE
Confidence            3455666677777633331 2 2334899999999999999999999999999999999 44577788899999999999


Q ss_pred             EccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCC--ccc
Q 022074           89 FGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSN--ASC  166 (303)
Q Consensus        89 ~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~--~~~  166 (303)
                      |+++ ++++++|+.|++|++||++.    ++..+.+.+|.+.|++++|++++++|++++.|+.|++||+......  ...
T Consensus       254 f~p~-g~~i~Sgs~D~tvriWd~~~----~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~  328 (456)
T KOG0266|consen  254 FSPD-GNLLVSGSDDGTVRIWDVRT----GECVRKLKGHSDGISGLAFSPDGNLLVSASYDGTIRVWDLETGSKLCLKLL  328 (456)
T ss_pred             ecCC-CCEEEEecCCCcEEEEeccC----CeEEEeeeccCCceEEEEECCCCCEEEEcCCCccEEEEECCCCceeeeecc
Confidence            9876 59999999999999999873    5677889999999999999999999999999999999999875511  111


Q ss_pred             ccCccceeeeceeeeCCCCCccccCCCC------------CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeE
Q 022074          167 NLGFRSYEWDYRWMDYPPQARDLKHPCD------------QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCV  234 (303)
Q Consensus       167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~------------~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i  234 (303)
                      .... ... ......+.++...+.....            .....+.+|...   ..|.+.+.++++++++++|+.|+.|
T Consensus       329 ~~~~-~~~-~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~~~~~~~~~~~~~~---~~~~~~~~~~~~~~~i~sg~~d~~v  403 (456)
T KOG0266|consen  329 SGAE-NSA-PVTSVQFSPNGKYLLSASLDRTLKLWDLRSGKSVGTYTGHSNL---VRCIFSPTLSTGGKLIYSGSEDGSV  403 (456)
T ss_pred             cCCC-CCC-ceeEEEECCCCcEEEEecCCCeEEEEEccCCcceeeecccCCc---ceeEecccccCCCCeEEEEeCCceE
Confidence            0000 000 1233344444444433222            223334444432   2566777778899999999999999


Q ss_pred             EEEECCCCeEEEEeecC-CCCeEEEEECCCCCeEEEEe--CCCCEEEeecC
Q 022074          235 YVYDLVSGEQVAALKYH-TSPVRDCSWHPSQPMLVSSS--WDGDVVRWEFP  282 (303)
Q Consensus       235 ~iwd~~~~~~~~~~~~h-~~~I~~v~~sp~~~~las~s--~Dg~i~~Wd~~  282 (303)
                      ++||..++..+..+.+| ...+..++|+|..+++++++  .|+.+++|..+
T Consensus       404 ~~~~~~s~~~~~~l~~h~~~~~~~~~~~~~~~~~~s~s~~~d~~~~~w~~~  454 (456)
T KOG0266|consen  404 YVWDSSSGGILQRLEGHSKAAVSDLSSHPTENLIASSSFEGDGLIRLWKYD  454 (456)
T ss_pred             EEEeCCccchhhhhcCCCCCceeccccCCCcCeeeecCcCCCceEEEecCC
Confidence            99999999989999999 89999999999999999999  78999999854


No 15 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.97  E-value=3.6e-30  Score=217.06  Aligned_cols=249  Identities=23%  Similarity=0.375  Sum_probs=197.7

Q ss_pred             eEEEE--EccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC-ceEEEEecc
Q 022074            4 IVHIV--DVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN-KLSLRILAH   80 (303)
Q Consensus         4 ~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~-~~~~~~~~h   80 (303)
                      |.||.  ++-++|-|+.|-||+.-+|.=  .....||+.+|.+|+|+..|+++++++.|-.+++||..+. +....+.+|
T Consensus       115 ~~hp~~~~v~~as~d~tikv~D~~tg~~--e~~LrGHt~sv~di~~~a~Gk~l~tcSsDl~~~LWd~~~~~~c~ks~~gh  192 (406)
T KOG0295|consen  115 IFHPSEALVVSASEDATIKVFDTETGEL--ERSLRGHTDSVFDISFDASGKYLATCSSDLSAKLWDFDTFFRCIKSLIGH  192 (406)
T ss_pred             eeccCceEEEEecCCceEEEEEccchhh--hhhhhccccceeEEEEecCccEEEecCCccchhheeHHHHHHHHHHhcCc
Confidence            44554  456777799999999866643  5577999999999999999999999999999999999773 334557789


Q ss_pred             cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074           81 TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus        81 ~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      +..|.+++|.|. ++.++|++.|.+|+.|+..    ++..+..+.+|.+-|..+....||..+++++.|.++++|-+...
T Consensus       193 ~h~vS~V~f~P~-gd~ilS~srD~tik~We~~----tg~cv~t~~~h~ewvr~v~v~~DGti~As~s~dqtl~vW~~~t~  267 (406)
T KOG0295|consen  193 EHGVSSVFFLPL-GDHILSCSRDNTIKAWECD----TGYCVKTFPGHSEWVRMVRVNQDGTIIASCSNDQTLRVWVVATK  267 (406)
T ss_pred             ccceeeEEEEec-CCeeeecccccceeEEecc----cceeEEeccCchHhEEEEEecCCeeEEEecCCCceEEEEEeccc
Confidence            999999999765 7899999999999999975    45567789999999999999999999999999999999987543


Q ss_pred             cCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC
Q 022074          161 SSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV  240 (303)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~  240 (303)
                      .    |...++..+..++...+.|....-      .+....+.               ...++++.+++.|++|++||+.
T Consensus       268 ~----~k~~lR~hEh~vEci~wap~~~~~------~i~~at~~---------------~~~~~~l~s~SrDktIk~wdv~  322 (406)
T KOG0295|consen  268 Q----CKAELREHEHPVECIAWAPESSYP------SISEATGS---------------TNGGQVLGSGSRDKTIKIWDVS  322 (406)
T ss_pred             h----hhhhhhccccceEEEEecccccCc------chhhccCC---------------CCCccEEEeecccceEEEEecc
Confidence            2    223344444444433332221100      00000000               0136789999999999999999


Q ss_pred             CCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          241 SGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       241 ~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++.++.++.+|..+|.+++|||.|+||+|+.+|+++++||++..
T Consensus       323 tg~cL~tL~ghdnwVr~~af~p~Gkyi~ScaDDktlrvwdl~~~  366 (406)
T KOG0295|consen  323 TGMCLFTLVGHDNWVRGVAFSPGGKYILSCADDKTLRVWDLKNL  366 (406)
T ss_pred             CCeEEEEEecccceeeeeEEcCCCeEEEEEecCCcEEEEEeccc
Confidence            99999999999999999999999999999999999999998643


No 16 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.97  E-value=9.9e-30  Score=214.10  Aligned_cols=203  Identities=29%  Similarity=0.505  Sum_probs=176.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .||...|.|+++.|-.+++++|+.|+++.|||+.+|++...+.+|-.-|..+++++- -.++++++.|++|+-||+.   
T Consensus       148 ~gHlgWVr~vavdP~n~wf~tgs~DrtikIwDlatg~LkltltGhi~~vr~vavS~r-HpYlFs~gedk~VKCwDLe---  223 (460)
T KOG0285|consen  148 SGHLGWVRSVAVDPGNEWFATGSADRTIKIWDLATGQLKLTLTGHIETVRGVAVSKR-HPYLFSAGEDKQVKCWDLE---  223 (460)
T ss_pred             hhccceEEEEeeCCCceeEEecCCCceeEEEEcccCeEEEeecchhheeeeeeeccc-CceEEEecCCCeeEEEech---
Confidence            699999999999999999999999999999999999999999999999999999754 4588999999999999986   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                       ..+.++.+.||..+|.+++..|.-..|+|||+|.++|+||+|...                                  
T Consensus       224 -~nkvIR~YhGHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~----------------------------------  268 (460)
T KOG0285|consen  224 -YNKVIRHYHGHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRA----------------------------------  268 (460)
T ss_pred             -hhhhHHHhccccceeEEEeccccceeEEecCCcceEEEeeecccc----------------------------------
Confidence             346788899999999999999988899999999999999998522                                  


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGD  275 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~  275 (303)
                      .+..+.||......+.|  .    +-...+++|+.|++|++||+..|+.+.++..|...|.+++.+|....+||+|.| +
T Consensus       269 ~V~~l~GH~~~V~~V~~--~----~~dpqvit~S~D~tvrlWDl~agkt~~tlt~hkksvral~lhP~e~~fASas~d-n  341 (460)
T KOG0285|consen  269 SVHVLSGHTNPVASVMC--Q----PTDPQVITGSHDSTVRLWDLRAGKTMITLTHHKKSVRALCLHPKENLFASASPD-N  341 (460)
T ss_pred             eEEEecCCCCcceeEEe--e----cCCCceEEecCCceEEEeeeccCceeEeeecccceeeEEecCCchhhhhccCCc-c
Confidence            24456666654433322  1    223458999999999999999999999999999999999999999999999987 7


Q ss_pred             EEEeecCCC
Q 022074          276 VVRWEFPGN  284 (303)
Q Consensus       276 i~~Wd~~~~  284 (303)
                      |+-|+++..
T Consensus       342 ik~w~~p~g  350 (460)
T KOG0285|consen  342 IKQWKLPEG  350 (460)
T ss_pred             ceeccCCcc
Confidence            899998743


No 17 
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.97  E-value=2.1e-30  Score=221.21  Aligned_cols=224  Identities=26%  Similarity=0.425  Sum_probs=180.7

Q ss_pred             EEccCchhh-ccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-ccCCeE
Q 022074            8 VDVGSGTME-SLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-HTSDVN   85 (303)
Q Consensus         8 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~~v~   85 (303)
                      |.+|+.|=+ .||+-    +.-+|++ ..++|+.+|.++.|+++|.++++|+.+|.|+.|+....... .+.+ |...|.
T Consensus       111 Lltgs~SGEFtLWNg----~~fnFEt-ilQaHDs~Vr~m~ws~~g~wmiSgD~gG~iKyWqpnmnnVk-~~~ahh~eaIR  184 (464)
T KOG0284|consen  111 LLTGSQSGEFTLWNG----TSFNFET-ILQAHDSPVRTMKWSHNGTWMISGDKGGMIKYWQPNMNNVK-IIQAHHAEAIR  184 (464)
T ss_pred             eEeecccccEEEecC----ceeeHHH-HhhhhcccceeEEEccCCCEEEEcCCCceEEecccchhhhH-HhhHhhhhhhh
Confidence            456665555 34433    2224433 34899999999999999999999999999999988766543 3444 448999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcc
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNAS  165 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~  165 (303)
                      +++|+| ++..|+|+|.||+|+|||....    +....+.||.-.|.+++|+|.-.+++++|.|..|++||.|...    
T Consensus       185 dlafSp-nDskF~t~SdDg~ikiWdf~~~----kee~vL~GHgwdVksvdWHP~kgLiasgskDnlVKlWDprSg~----  255 (464)
T KOG0284|consen  185 DLAFSP-NDSKFLTCSDDGTIKIWDFRMP----KEERVLRGHGWDVKSVDWHPTKGLIASGSKDNLVKLWDPRSGS----  255 (464)
T ss_pred             eeccCC-CCceeEEecCCCeEEEEeccCC----chhheeccCCCCcceeccCCccceeEEccCCceeEeecCCCcc----
Confidence            999987 5778999999999999997632    3455678999999999999999999999999999999988543    


Q ss_pred             cccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEE
Q 022074          166 CNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQV  245 (303)
Q Consensus       166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~  245 (303)
                                                    +++++.+|....  +    ...|++++.+|+|+|.|..++++|+++.+.+
T Consensus       256 ------------------------------cl~tlh~HKntV--l----~~~f~~n~N~Llt~skD~~~kv~DiR~mkEl  299 (464)
T KOG0284|consen  256 ------------------------------CLATLHGHKNTV--L----AVKFNPNGNWLLTGSKDQSCKVFDIRTMKEL  299 (464)
T ss_pred             ------------------------------hhhhhhhccceE--E----EEEEcCCCCeeEEccCCceEEEEehhHhHHH
Confidence                                          334444444322  2    2336678899999999999999999998999


Q ss_pred             EEeecCCCCeEEEEECCCC-CeEEEEeCCCCEEEeecC
Q 022074          246 AALKYHTSPVRDCSWHPSQ-PMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       246 ~~~~~h~~~I~~v~~sp~~-~~las~s~Dg~i~~Wd~~  282 (303)
                      .++.+|+..+++++|+|-. .+|.+|+.||.+..|.+.
T Consensus       300 ~~~r~Hkkdv~~~~WhP~~~~lftsgg~Dgsvvh~~v~  337 (464)
T KOG0284|consen  300 FTYRGHKKDVTSLTWHPLNESLFTSGGSDGSVVHWVVG  337 (464)
T ss_pred             HHhhcchhhheeeccccccccceeeccCCCceEEEecc
Confidence            9999999999999999965 589999999999999987


No 18 
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.97  E-value=1.6e-28  Score=213.11  Aligned_cols=221  Identities=22%  Similarity=0.336  Sum_probs=176.8

Q ss_pred             cccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEec
Q 022074           22 TEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGS  101 (303)
Q Consensus        22 ~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s  101 (303)
                      .+||+=.-......+-|+.||+++.|+.+|.+|++++.|+++.|||..++.....+.-|..+--.+.|..  .+-|++++
T Consensus       259 ~riw~~~G~l~~tl~~HkgPI~slKWnk~G~yilS~~vD~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~--~~~F~ts~  336 (524)
T KOG0273|consen  259 ARIWNKDGNLISTLGQHKGPIFSLKWNKKGTYILSGGVDGTTILWDAHTGTVKQQFEFHSAPALDVDWQS--NDEFATSS  336 (524)
T ss_pred             EEEEecCchhhhhhhccCCceEEEEEcCCCCEEEeccCCccEEEEeccCceEEEeeeeccCCccceEEec--CceEeecC
Confidence            3444333333456789999999999999999999999999999999999987777777877767788953  45799999


Q ss_pred             CCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeee
Q 022074          102 DDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMD  181 (303)
Q Consensus       102 ~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~  181 (303)
                      .|+.++++.+.    ..+|...+.||...|.++.|.|.+.+|+|++.|++++||.+......                  
T Consensus       337 td~~i~V~kv~----~~~P~~t~~GH~g~V~alk~n~tg~LLaS~SdD~TlkiWs~~~~~~~------------------  394 (524)
T KOG0273|consen  337 TDGCIHVCKVG----EDRPVKTFIGHHGEVNALKWNPTGSLLASCSDDGTLKIWSMGQSNSV------------------  394 (524)
T ss_pred             CCceEEEEEec----CCCcceeeecccCceEEEEECCCCceEEEecCCCeeEeeecCCCcch------------------
Confidence            99999999875    45688899999999999999999999999999999999986532211                  


Q ss_pred             CCCCCccccCCCCCcceEEecccceeeeEEEeeeee-----eeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeE
Q 022074          182 YPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPV-----YSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVR  256 (303)
Q Consensus       182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~  256 (303)
                                      ..+.+|..  .+....++|.     .+..+..+++++.|+++++||+..+.++..|..|+.||.
T Consensus       395 ----------------~~l~~Hsk--ei~t~~wsp~g~v~~n~~~~~~l~sas~dstV~lwdv~~gv~i~~f~kH~~pVy  456 (524)
T KOG0273|consen  395 ----------------HDLQAHSK--EIYTIKWSPTGPVTSNPNMNLMLASASFDSTVKLWDVESGVPIHTLMKHQEPVY  456 (524)
T ss_pred             ----------------hhhhhhcc--ceeeEeecCCCCccCCCcCCceEEEeecCCeEEEEEccCCceeEeeccCCCceE
Confidence                            11111111  1111122221     123467799999999999999999999999999999999


Q ss_pred             EEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          257 DCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       257 ~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +++|||+++++|+|+.||.+.+|+.+..
T Consensus       457 svafS~~g~ylAsGs~dg~V~iws~~~~  484 (524)
T KOG0273|consen  457 SVAFSPNGRYLASGSLDGCVHIWSTKTG  484 (524)
T ss_pred             EEEecCCCcEEEecCCCCeeEeccccch
Confidence            9999999999999999999999997643


No 19 
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.97  E-value=1.3e-28  Score=203.11  Aligned_cols=233  Identities=28%  Similarity=0.414  Sum_probs=177.0

Q ss_pred             cCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEc
Q 022074           11 GSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFG   90 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~   90 (303)
                      -+|.||+.|-.|.. .|...-+-...||+.+|..+.|.+|++.++++|.|.+|+.||+++|+...++..|...|+.+...
T Consensus        63 aSgG~Dr~I~LWnv-~gdceN~~~lkgHsgAVM~l~~~~d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~~vNs~~p~  141 (338)
T KOG0265|consen   63 ASGGSDRAIVLWNV-YGDCENFWVLKGHSGAVMELHGMRDGSHILSCGTDKTVRGWDAETGKRIRKHKGHTSFVNSLDPS  141 (338)
T ss_pred             eecCCcceEEEEec-cccccceeeeccccceeEeeeeccCCCEEEEecCCceEEEEecccceeeehhccccceeeecCcc
Confidence            46788887777765 23322244557999999999999999999999999999999999999999999999999999865


Q ss_pred             cCCCcEEEEecCCCeEEEEcCccccCCC-------------------------------------ccceeecccccCeEE
Q 022074           91 DESGHLIYSGSDDNLCKVWDRRCLNVKG-------------------------------------KPAGVLMGHLEGITF  133 (303)
Q Consensus        91 ~~~~~~l~s~s~dg~v~lWd~~~~~~~~-------------------------------------~~~~~~~~h~~~v~~  133 (303)
                      .-...++.|++.|++++|||.|......                                     ...-.+.||.+.|+.
T Consensus       142 rrg~~lv~SgsdD~t~kl~D~R~k~~~~t~~~kyqltAv~f~d~s~qv~sggIdn~ikvWd~r~~d~~~~lsGh~DtIt~  221 (338)
T KOG0265|consen  142 RRGPQLVCSGSDDGTLKLWDIRKKEAIKTFENKYQLTAVGFKDTSDQVISGGIDNDIKVWDLRKNDGLYTLSGHADTITG  221 (338)
T ss_pred             ccCCeEEEecCCCceEEEEeecccchhhccccceeEEEEEecccccceeeccccCceeeeccccCcceEEeecccCceee
Confidence            4456678899999999999987211000                                     011123456666666


Q ss_pred             EEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecc--cceeeeEE
Q 022074          134 IDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGH--SVLRTLIR  211 (303)
Q Consensus       134 ~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~  211 (303)
                      ++.+++|.++++.+-|.++++||+|..                              ++..+++..+.|+  .+...+++
T Consensus       222 lsls~~gs~llsnsMd~tvrvwd~rp~------------------------------~p~~R~v~if~g~~hnfeknlL~  271 (338)
T KOG0265|consen  222 LSLSRYGSFLLSNSMDNTVRVWDVRPF------------------------------APSQRCVKIFQGHIHNFEKNLLK  271 (338)
T ss_pred             EEeccCCCccccccccceEEEEEeccc------------------------------CCCCceEEEeecchhhhhhhcce
Confidence            666666666666666666666665531                              1223345555554  33345566


Q ss_pred             EeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          212 CHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       212 ~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                      |.+    +|+++.+..|+.|+.+++||...+..++.+.+|.+.|++++|+|..++|.+++.|.+|.+
T Consensus       272 csw----sp~~~~i~ags~dr~vyvwd~~~r~~lyklpGh~gsvn~~~Fhp~e~iils~~sdk~i~l  334 (338)
T KOG0265|consen  272 CSW----SPNGTKITAGSADRFVYVWDTTSRRILYKLPGHYGSVNEVDFHPTEPIILSCSSDKTIYL  334 (338)
T ss_pred             eec----cCCCCccccccccceEEEeecccccEEEEcCCcceeEEEeeecCCCcEEEEeccCceeEe
Confidence            664    457888999999999999999999999999999999999999999999999999999986


No 20 
>KOG0284 consensus Polyadenylation factor I complex, subunit PFS2 [RNA processing and modification]
Probab=99.96  E-value=1.1e-29  Score=216.77  Aligned_cols=201  Identities=21%  Similarity=0.353  Sum_probs=167.4

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      --+.+|..+.|.|+|+.|++|+..|-+.||+..+-.....+..|+..|.++.|++ ++.+++||+.+|.|++|+..    
T Consensus        94 Kvkc~V~~v~WtPeGRRLltgs~SGEFtLWNg~~fnFEtilQaHDs~Vr~m~ws~-~g~wmiSgD~gG~iKyWqpn----  168 (464)
T KOG0284|consen   94 KVKCPVNVVRWTPEGRRLLTGSQSGEFTLWNGTSFNFETILQAHDSPVRTMKWSH-NGTWMISGDKGGMIKYWQPN----  168 (464)
T ss_pred             ccccceeeEEEcCCCceeEeecccccEEEecCceeeHHHHhhhhcccceeEEEcc-CCCEEEEcCCCceEEecccc----
Confidence            3457899999999999999999999999998855544445678999999999975 58899999999999999853    


Q ss_pred             CCccceeecccc-cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          117 KGKPAGVLMGHL-EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       117 ~~~~~~~~~~h~-~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                       ...+..++.|. ++|++++|+|.+..|+|++.|++|+|||.+..++..                               
T Consensus       169 -mnnVk~~~ahh~eaIRdlafSpnDskF~t~SdDg~ikiWdf~~~kee~-------------------------------  216 (464)
T KOG0284|consen  169 -MNNVKIIQAHHAEAIRDLAFSPNDSKFLTCSDDGTIKIWDFRMPKEER-------------------------------  216 (464)
T ss_pred             -hhhhHHhhHhhhhhhheeccCCCCceeEEecCCCeEEEEeccCCchhh-------------------------------
Confidence             12244455555 899999999999999999999999999987543211                               


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGD  275 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~  275 (303)
                         .+.||.-..+.+      +++|...++|+||.|..|++||.++++++.++-+|+..|..+.|+|++++|+|+|.|..
T Consensus       217 ---vL~GHgwdVksv------dWHP~kgLiasgskDnlVKlWDprSg~cl~tlh~HKntVl~~~f~~n~N~Llt~skD~~  287 (464)
T KOG0284|consen  217 ---VLRGHGWDVKSV------DWHPTKGLIASGSKDNLVKLWDPRSGSCLATLHGHKNTVLAVKFNPNGNWLLTGSKDQS  287 (464)
T ss_pred             ---eeccCCCCccee------ccCCccceeEEccCCceeEeecCCCcchhhhhhhccceEEEEEEcCCCCeeEEccCCce
Confidence               123333222222      35567789999999999999999999999999999999999999999999999999999


Q ss_pred             EEEeecCC
Q 022074          276 VVRWEFPG  283 (303)
Q Consensus       276 i~~Wd~~~  283 (303)
                      ++++|+..
T Consensus       288 ~kv~DiR~  295 (464)
T KOG0284|consen  288 CKVFDIRT  295 (464)
T ss_pred             EEEEehhH
Confidence            99999873


No 21 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.96  E-value=2.1e-29  Score=228.77  Aligned_cols=230  Identities=23%  Similarity=0.378  Sum_probs=187.6

Q ss_pred             CchhhccccccccccCcCc---ccccCCCcccceEEEEEcCCC-CEEEEeeCCCeEEEEECCCCce-----E----EEEe
Q 022074           12 SGTMESLANVTEIHDGLDF---SAADDGGYSFGIFSLKFSTDG-RELVAGSSDDCIYVYDLEANKL-----S----LRIL   78 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~v~~l~~s~~g-~~l~sgs~Dg~v~lwd~~~~~~-----~----~~~~   78 (303)
                      +++-|-.+-+|+. .++..   .-.-.+||+..|.+++++..+ .+++++|.|.++++|++...+.     .    ....
T Consensus       382 t~sKD~svilWr~-~~~~~~~~~~a~~~gH~~svgava~~~~~asffvsvS~D~tlK~W~l~~s~~~~~~~~~~~~~t~~  460 (775)
T KOG0319|consen  382 TGSKDKSVILWRL-NNNCSKSLCVAQANGHTNSVGAVAGSKLGASFFVSVSQDCTLKLWDLPKSKETAFPIVLTCRYTER  460 (775)
T ss_pred             EecCCceEEEEEe-cCCcchhhhhhhhcccccccceeeecccCccEEEEecCCceEEEecCCCcccccccceehhhHHHH
Confidence            5677778888888 22211   122238999999999998654 4899999999999999977321     1    1235


Q ss_pred             cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074           79 AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus        79 ~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      .|+..|++++++| ++++++|||.|+++++|++.    ......++.||..+|+++.|++.++.++|+|.|++|+||.+.
T Consensus       461 aHdKdIN~Vaia~-ndkLiAT~SqDktaKiW~le----~~~l~~vLsGH~RGvw~V~Fs~~dq~laT~SgD~TvKIW~is  535 (775)
T KOG0319|consen  461 AHDKDINCVAIAP-NDKLIATGSQDKTAKIWDLE----QLRLLGVLSGHTRGVWCVSFSKNDQLLATCSGDKTVKIWSIS  535 (775)
T ss_pred             hhcccccceEecC-CCceEEecccccceeeeccc----CceEEEEeeCCccceEEEEeccccceeEeccCCceEEEEEec
Confidence            7999999999975 68899999999999999985    456788999999999999999999999999999999999986


Q ss_pred             cccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEE
Q 022074          159 KMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYD  238 (303)
Q Consensus       159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd  238 (303)
                      ...                                  ++.++.||...  +.++.    |-.++++|++++.||.|++|+
T Consensus       536 ~fS----------------------------------ClkT~eGH~~a--Vlra~----F~~~~~qliS~~adGliKlWn  575 (775)
T KOG0319|consen  536 TFS----------------------------------CLKTFEGHTSA--VLRAS----FIRNGKQLISAGADGLIKLWN  575 (775)
T ss_pred             cce----------------------------------eeeeecCccce--eEeee----eeeCCcEEEeccCCCcEEEEe
Confidence            421                                  35566777532  33333    445789999999999999999


Q ss_pred             CCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCcc
Q 022074          239 LVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEA  287 (303)
Q Consensus       239 ~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~  287 (303)
                      +++.+++.++.+|++.||+++-+|...+++||+.||.|.+|.-.+..++
T Consensus       576 ikt~eC~~tlD~H~DrvWaL~~~~~~~~~~tgg~Dg~i~~wkD~Te~~~  624 (775)
T KOG0319|consen  576 IKTNECEMTLDAHNDRVWALSVSPLLDMFVTGGGDGRIIFWKDVTEEEQ  624 (775)
T ss_pred             ccchhhhhhhhhccceeEEEeecCccceeEecCCCeEEEEeecCcHHHH
Confidence            9999999999999999999999999999999999999999985544333


No 22 
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.96  E-value=2.1e-28  Score=209.36  Aligned_cols=264  Identities=23%  Similarity=0.324  Sum_probs=193.6

Q ss_pred             cCchhhccccccccccCcCcc-cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe-cccCCeEEEE
Q 022074           11 GSGTMESLANVTEIHDGLDFS-AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL-AHTSDVNTVC   88 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~-~h~~~v~~l~   88 (303)
                      .+||.|..+-+|++..-.++. --..-||..+|.-+.||||.+++++|+.|..+++||+.+|.....+. +|...+.+++
T Consensus       240 AsaSkD~Taiiw~v~~d~~~kl~~tlvgh~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~  319 (519)
T KOG0293|consen  240 ASASKDSTAIIWIVVYDVHFKLKKTLVGHSQPVSYIMWSPDDRYLLACGFDEVLSLWDVDTGDLRHLYPSGLGFSVSSCA  319 (519)
T ss_pred             eeccCCceEEEEEEecCcceeeeeeeecccCceEEEEECCCCCeEEecCchHheeeccCCcchhhhhcccCcCCCcceeE
Confidence            578889999999997777643 22335999999999999999999999999999999999998654433 2346788999


Q ss_pred             EccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccc-cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccc
Q 022074           89 FGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL-EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCN  167 (303)
Q Consensus        89 ~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~-~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~  167 (303)
                      |.| ++..+++|+.|+++..||+..     .......+-. -.|..++..+||+++++...|..+++++..........+
T Consensus       320 W~p-Dg~~~V~Gs~dr~i~~wdlDg-----n~~~~W~gvr~~~v~dlait~Dgk~vl~v~~d~~i~l~~~e~~~dr~lis  393 (519)
T KOG0293|consen  320 WCP-DGFRFVTGSPDRTIIMWDLDG-----NILGNWEGVRDPKVHDLAITYDGKYVLLVTVDKKIRLYNREARVDRGLIS  393 (519)
T ss_pred             Ecc-CCceeEecCCCCcEEEecCCc-----chhhcccccccceeEEEEEcCCCcEEEEEecccceeeechhhhhhhcccc
Confidence            976 588999999999999999752     2233333433 348899999999999999999999999875432211111


Q ss_pred             cCccceeeec------eeeeCCCCCccccCCCC-CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC
Q 022074          168 LGFRSYEWDY------RWMDYPPQARDLKHPCD-QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV  240 (303)
Q Consensus       168 ~~~~~~~~~~------~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~  240 (303)
                      .......+.+      ......++...+....+ ..+..+.||..-..+++.+|.-   .+.+++++|++|+.|+||+-.
T Consensus       394 e~~~its~~iS~d~k~~LvnL~~qei~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg---~~~~fiaSGSED~kvyIWhr~  470 (519)
T KOG0293|consen  394 EEQPITSFSISKDGKLALVNLQDQEIHLWDLEENKLVRKYFGHKQGHFIIRSCFGG---GNDKFIASGSEDSKVYIWHRI  470 (519)
T ss_pred             ccCceeEEEEcCCCcEEEEEcccCeeEEeecchhhHHHHhhcccccceEEEeccCC---CCcceEEecCCCceEEEEEcc
Confidence            1000001110      01111122222221112 2345567777666677766543   355899999999999999999


Q ss_pred             CCeEEEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEeecCC
Q 022074          241 SGEQVAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       241 ~~~~~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~  283 (303)
                      +|+++.++.+|...|++|+|+|..+ ++||||+||+|++|.+..
T Consensus       471 sgkll~~LsGHs~~vNcVswNP~~p~m~ASasDDgtIRIWg~~~  514 (519)
T KOG0293|consen  471 SGKLLAVLSGHSKTVNCVSWNPADPEMFASASDDGTIRIWGPSD  514 (519)
T ss_pred             CCceeEeecCCcceeeEEecCCCCHHHhhccCCCCeEEEecCCc
Confidence            9999999999999999999999876 899999999999998753


No 23 
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=99.96  E-value=7e-28  Score=198.79  Aligned_cols=237  Identities=23%  Similarity=0.313  Sum_probs=182.2

Q ss_pred             EEEEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce-EEEEecccCC
Q 022074            5 VHIVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL-SLRILAHTSD   83 (303)
Q Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~-~~~~~~h~~~   83 (303)
                      |++.+-|-.+.-.++....+...++.--....||+..|+.+.|+|+|..+++|+.|..|.||++..... ...+.+|.+.
T Consensus        13 v~~a~~~~~q~s~~~~~~~rts~l~ap~m~l~gh~geI~~~~F~P~gs~~aSgG~Dr~I~LWnv~gdceN~~~lkgHsgA   92 (338)
T KOG0265|consen   13 VYPAKRGRSQISALALGKQRTSSLQAPIMLLPGHKGEIYTIKFHPDGSCFASGGSDRAIVLWNVYGDCENFWVLKGHSGA   92 (338)
T ss_pred             eEecccccccchhhhhcccccccccchhhhcCCCcceEEEEEECCCCCeEeecCCcceEEEEeccccccceeeeccccce
Confidence            445555544444555444444444332333479999999999999999999999999999999765432 3457799999


Q ss_pred             eEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCE-EEEEeCCCcEEEEEcccccC
Q 022074           84 VNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRY-LISNGKDQAIKLWDIRKMSS  162 (303)
Q Consensus        84 v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~-l~s~~~D~~v~lWdl~~~~~  162 (303)
                      |..+.|+. +++.++|++.|.+|+.||.+    +++.+..+.+|..-|+++....-|.. +.|++.|+++|+||+|+...
T Consensus        93 VM~l~~~~-d~s~i~S~gtDk~v~~wD~~----tG~~~rk~k~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k~~  167 (338)
T KOG0265|consen   93 VMELHGMR-DGSHILSCGTDKTVRGWDAE----TGKRIRKHKGHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKKEA  167 (338)
T ss_pred             eEeeeecc-CCCEEEEecCCceEEEEecc----cceeeehhccccceeeecCccccCCeEEEecCCCceEEEEeecccch
Confidence            99999975 57899999999999999976    56677788899999999886655655 56788999999999996432


Q ss_pred             CcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC
Q 022074          163 NASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG  242 (303)
Q Consensus       163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~  242 (303)
                      ......                         .     +       .+.    ...|..++..+.+|+-|+.|++||++..
T Consensus       168 ~~t~~~-------------------------k-----y-------qlt----Av~f~d~s~qv~sggIdn~ikvWd~r~~  206 (338)
T KOG0265|consen  168 IKTFEN-------------------------K-----Y-------QLT----AVGFKDTSDQVISGGIDNDIKVWDLRKN  206 (338)
T ss_pred             hhcccc-------------------------c-----e-------eEE----EEEecccccceeeccccCceeeeccccC
Confidence            211100                         0     0       000    1113345677899999999999999999


Q ss_pred             eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCcc
Q 022074          243 EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEA  287 (303)
Q Consensus       243 ~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~  287 (303)
                      +.+..+++|.++|+.+..||+|.++.|-++|+++++||++.-+.+
T Consensus       207 d~~~~lsGh~DtIt~lsls~~gs~llsnsMd~tvrvwd~rp~~p~  251 (338)
T KOG0265|consen  207 DGLYTLSGHADTITGLSLSRYGSFLLSNSMDNTVRVWDVRPFAPS  251 (338)
T ss_pred             cceEEeecccCceeeEEeccCCCccccccccceEEEEEecccCCC
Confidence            999999999999999999999999999999999999998754433


No 24 
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.96  E-value=3.3e-29  Score=202.01  Aligned_cols=237  Identities=20%  Similarity=0.324  Sum_probs=181.1

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      -||..+|++..++.+....++++.|-+.+|||.-+|.... ...|+.-|.+++|+ .+.+.|++|+.+.-+|+||+.   
T Consensus        56 eghkgavw~~~l~~na~~aasaaadftakvw~a~tgdelh-sf~hkhivk~~af~-~ds~~lltgg~ekllrvfdln---  130 (334)
T KOG0278|consen   56 EGHKGAVWSATLNKNATRAASAAADFTAKVWDAVTGDELH-SFEHKHIVKAVAFS-QDSNYLLTGGQEKLLRVFDLN---  130 (334)
T ss_pred             eccCcceeeeecCchhhhhhhhcccchhhhhhhhhhhhhh-hhhhhheeeeEEec-ccchhhhccchHHHhhhhhcc---
Confidence            3999999999999999999999999999999999998765 34688899999996 567899999999999999975   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                      ....+...+.+|..+|..+-|...++.|++++.|++||+||.|......+..+...     +..+++.+++..+......
T Consensus       131 ~p~App~E~~ghtg~Ir~v~wc~eD~~iLSSadd~tVRLWD~rTgt~v~sL~~~s~-----VtSlEvs~dG~ilTia~gs  205 (334)
T KOG0278|consen  131 RPKAPPKEISGHTGGIRTVLWCHEDKCILSSADDKTVRLWDHRTGTEVQSLEFNSP-----VTSLEVSQDGRILTIAYGS  205 (334)
T ss_pred             CCCCCchhhcCCCCcceeEEEeccCceEEeeccCCceEEEEeccCcEEEEEecCCC-----CcceeeccCCCEEEEecCc
Confidence            23345567889999999999999999999999999999999998765544332211     1122333333333322222


Q ss_pred             cceEEecccce-ee---eEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe-ecCCCCeEEEEECCCCCeEEEE
Q 022074          196 SVATYKGHSVL-RT---LIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL-KYHTSPVRDCSWHPSQPMLVSS  270 (303)
Q Consensus       196 ~~~~~~~~~~~-~~---~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~-~~h~~~I~~v~~sp~~~~las~  270 (303)
                      .+.-++...+- .+   +..-..+...+|+...+++|++|..++.||..+++.+..+ ++|.+||.++.|||||...|+|
T Consensus       206 sV~Fwdaksf~~lKs~k~P~nV~SASL~P~k~~fVaGged~~~~kfDy~TgeEi~~~nkgh~gpVhcVrFSPdGE~yAsG  285 (334)
T KOG0278|consen  206 SVKFWDAKSFGLLKSYKMPCNVESASLHPKKEFFVAGGEDFKVYKFDYNTGEEIGSYNKGHFGPVHCVRFSPDGELYASG  285 (334)
T ss_pred             eeEEeccccccceeeccCccccccccccCCCceEEecCcceEEEEEeccCCceeeecccCCCCceEEEEECCCCceeecc
Confidence            22222222110 00   0001112345677889999999999999999999998886 8999999999999999999999


Q ss_pred             eCCCCEEEeecC
Q 022074          271 SWDGDVVRWEFP  282 (303)
Q Consensus       271 s~Dg~i~~Wd~~  282 (303)
                      |+||+|++|..-
T Consensus       286 SEDGTirlWQt~  297 (334)
T KOG0278|consen  286 SEDGTIRLWQTT  297 (334)
T ss_pred             CCCceEEEEEec
Confidence            999999999864


No 25 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.96  E-value=3.7e-27  Score=217.74  Aligned_cols=203  Identities=31%  Similarity=0.505  Sum_probs=170.8

Q ss_pred             cccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           38 YSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      |..+|.++.|+++|+.+++++.|+.+++|++.+..  +...+.+|...|+.++|++ +++++++++.|+++++||+.   
T Consensus       158 ~~~sv~~~~fs~~g~~l~~~~~~~~i~~~~~~~~~~~~~~~l~~h~~~v~~~~fs~-d~~~l~s~s~D~tiriwd~~---  233 (456)
T KOG0266|consen  158 ECPSVTCVDFSPDGRALAAASSDGLIRIWKLEGIKSNLLRELSGHTRGVSDVAFSP-DGSYLLSGSDDKTLRIWDLK---  233 (456)
T ss_pred             ccCceEEEEEcCCCCeEEEccCCCcEEEeecccccchhhccccccccceeeeEECC-CCcEEEEecCCceEEEeecc---
Confidence            37899999999999999999999999999997776  5556678999999999975 57899999999999999973   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                      .......++.+|...|++++|+++++++++|+.|++||+||++...                                  
T Consensus       234 ~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~----------------------------------  279 (456)
T KOG0266|consen  234 DDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGE----------------------------------  279 (456)
T ss_pred             CCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCe----------------------------------
Confidence            2345677889999999999999999999999999999999987522                                  


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe--EEEEeecCCC--CeEEEEECCCCCeEEEEe
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE--QVAALKYHTS--PVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~--~~~~~~~h~~--~I~~v~~sp~~~~las~s  271 (303)
                      .+..+.+|.....      ...|++++.+|++++.|+.|++||+.+++  .+..+..+..  +++.+.|+|++.+++++.
T Consensus       280 ~~~~l~~hs~~is------~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~  353 (456)
T KOG0266|consen  280 CVRKLKGHSDGIS------GLAFSPDGNLLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSAS  353 (456)
T ss_pred             EEEeeeccCCceE------EEEECCCCCEEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEec
Confidence            2233444442221      12477899999999999999999999998  4566766655  499999999999999999


Q ss_pred             CCCCEEEeecCCC
Q 022074          272 WDGDVVRWEFPGN  284 (303)
Q Consensus       272 ~Dg~i~~Wd~~~~  284 (303)
                      .|+.+++||+...
T Consensus       354 ~d~~~~~w~l~~~  366 (456)
T KOG0266|consen  354 LDRTLKLWDLRSG  366 (456)
T ss_pred             CCCeEEEEEccCC
Confidence            9999999998743


No 26 
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.96  E-value=1.4e-26  Score=188.73  Aligned_cols=207  Identities=20%  Similarity=0.343  Sum_probs=168.9

Q ss_pred             CCCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCc-eEEE--E-ecccCCeEEEEEccCCCcEEEEecCCCeEEEE
Q 022074           35 DGGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANK-LSLR--I-LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~-~~~~--~-~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      ..||..++..++|+|- |..|++||.|..||+|++..+. ...+  + .+|+..|..++|+| .+++|++||.|.++.||
T Consensus        10 ~~gh~~r~W~~awhp~~g~ilAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp-~g~~La~aSFD~t~~Iw   88 (312)
T KOG0645|consen   10 LSGHKDRVWSVAWHPGKGVILASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSP-HGRYLASASFDATVVIW   88 (312)
T ss_pred             ecCCCCcEEEEEeccCCceEEEeecCCceEEEEecCCCCcEEEEEeccccchheeeeeeecC-CCcEEEEeeccceEEEe
Confidence            3799999999999998 8899999999999999998432 2222  1 26899999999964 68999999999999999


Q ss_pred             cCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccc
Q 022074          110 DRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       110 d~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                      .....+  -+....+.||...|.+++|+++|++|||+++|++|=+|.......            +              
T Consensus        89 ~k~~~e--fecv~~lEGHEnEVK~Vaws~sG~~LATCSRDKSVWiWe~deddE------------f--------------  140 (312)
T KOG0645|consen   89 KKEDGE--FECVATLEGHENEVKCVAWSASGNYLATCSRDKSVWIWEIDEDDE------------F--------------  140 (312)
T ss_pred             ecCCCc--eeEEeeeeccccceeEEEEcCCCCEEEEeeCCCeEEEEEecCCCc------------E--------------
Confidence            754222  234667889999999999999999999999999999998763221            1              


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC---CeEEEEeecCCCCeEEEEECCCCCe
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS---GEQVAALKYHTSPVRDCSWHPSQPM  266 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~---~~~~~~~~~h~~~I~~v~~sp~~~~  266 (303)
                           .++..+++|....+...      ++|...+|++++.|.+|++|+-..   -+++.++.+|+..|++++|+|.|..
T Consensus       141 -----ec~aVL~~HtqDVK~V~------WHPt~dlL~S~SYDnTIk~~~~~~dddW~c~~tl~g~~~TVW~~~F~~~G~r  209 (312)
T KOG0645|consen  141 -----ECIAVLQEHTQDVKHVI------WHPTEDLLFSCSYDNTIKVYRDEDDDDWECVQTLDGHENTVWSLAFDNIGSR  209 (312)
T ss_pred             -----EEEeeeccccccccEEE------EcCCcceeEEeccCCeEEEEeecCCCCeeEEEEecCccceEEEEEecCCCce
Confidence                 13455666665444433      456678999999999999998762   2578899999999999999999999


Q ss_pred             EEEEeCCCCEEEeec
Q 022074          267 LVSSSWDGDVVRWEF  281 (303)
Q Consensus       267 las~s~Dg~i~~Wd~  281 (303)
                      |++++.|+++++|..
T Consensus       210 l~s~sdD~tv~Iw~~  224 (312)
T KOG0645|consen  210 LVSCSDDGTVSIWRL  224 (312)
T ss_pred             EEEecCCcceEeeee
Confidence            999999999999983


No 27 
>PTZ00421 coronin; Provisional
Probab=99.96  E-value=1.7e-26  Score=213.48  Aligned_cols=207  Identities=21%  Similarity=0.316  Sum_probs=157.6

Q ss_pred             CCCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCc-------eEEEEecccCCeEEEEEccCCCcEEEEecCCCeE
Q 022074           35 DGGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANK-------LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLC  106 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~-------~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v  106 (303)
                      ..||+.+|.+++|+| +++.|++|+.|++|+|||+.++.       ....+.+|...|.+++|+|..+++|++++.|++|
T Consensus        71 l~GH~~~V~~v~fsP~d~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtV  150 (493)
T PTZ00421         71 LLGQEGPIIDVAFNPFDPQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVV  150 (493)
T ss_pred             EeCCCCCEEEEEEcCCCCCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEE
Confidence            469999999999999 88999999999999999997653       3456788999999999987656799999999999


Q ss_pred             EEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          107 KVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       107 ~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      ++||++    .+.....+.+|.+.|.+++|++++.+|++++.|++|++||++....                        
T Consensus       151 rIWDl~----tg~~~~~l~~h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~------------------------  202 (493)
T PTZ00421        151 NVWDVE----RGKAVEVIKCHSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTI------------------------  202 (493)
T ss_pred             EEEECC----CCeEEEEEcCCCCceEEEEEECCCCEEEEecCCCEEEEEECCCCcE------------------------
Confidence            999986    3345566788999999999999999999999999999999985321                        


Q ss_pred             ccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe----CCCeEEEEECCCCeE-EEEeecC-CCCeEEEEE
Q 022074          187 RDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS----HDSCVYVYDLVSGEQ-VAALKYH-TSPVRDCSW  260 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~----~dg~i~iwd~~~~~~-~~~~~~h-~~~I~~v~~  260 (303)
                                +..+.+|.... ..++.    +.+++..+++++    .|+.|++||+++.+. +.....+ ...+....|
T Consensus       203 ----------v~tl~~H~~~~-~~~~~----w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d~~~~~~~~~~  267 (493)
T PTZ00421        203 ----------VSSVEAHASAK-SQRCL----WAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSALFIPFF  267 (493)
T ss_pred             ----------EEEEecCCCCc-ceEEE----EcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccCCCCceEEEEE
Confidence                      11122221100 01122    334444555543    589999999987653 4433333 345667789


Q ss_pred             CCCCCeEEEEe-CCCCEEEeecCCC
Q 022074          261 HPSQPMLVSSS-WDGDVVRWEFPGN  284 (303)
Q Consensus       261 sp~~~~las~s-~Dg~i~~Wd~~~~  284 (303)
                      ++++++|++++ .|++|++||+...
T Consensus       268 d~d~~~L~lggkgDg~Iriwdl~~~  292 (493)
T PTZ00421        268 DEDTNLLYIGSKGEGNIRCFELMNE  292 (493)
T ss_pred             cCCCCEEEEEEeCCCeEEEEEeeCC
Confidence            99999988887 5999999998743


No 28 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.96  E-value=4e-27  Score=214.44  Aligned_cols=208  Identities=21%  Similarity=0.336  Sum_probs=168.3

Q ss_pred             CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           35 DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      ++||-..+.+++++|||+.+++|+.||.|+|||...+-...++..|+..|+.+.|+ ..++.++|++.||+||.||+.-.
T Consensus       346 QQgH~~~i~~l~YSpDgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~Vt~v~f~-~~g~~llssSLDGtVRAwDlkRY  424 (893)
T KOG0291|consen  346 QQGHSDRITSLAYSPDGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGVTAVQFT-ARGNVLLSSSLDGTVRAWDLKRY  424 (893)
T ss_pred             ccccccceeeEEECCCCcEEEeccCCCcEEEEeccCceEEEEeccCCCceEEEEEE-ecCCEEEEeecCCeEEeeeeccc
Confidence            47999999999999999999999999999999999998888999999999999996 56889999999999999996310


Q ss_pred             ----------------------------------------cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEE
Q 022074          115 ----------------------------------------NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKL  154 (303)
Q Consensus       115 ----------------------------------------~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~l  154 (303)
                                                              -++++....+.||.++|.+++|++++..|++++.|++||+
T Consensus       425 rNfRTft~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDkTVRi  504 (893)
T KOG0291|consen  425 RNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDKTVRI  504 (893)
T ss_pred             ceeeeecCCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccceEEE
Confidence                                                    0123344567899999999999999999999999999999


Q ss_pred             EEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeE
Q 022074          155 WDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCV  234 (303)
Q Consensus       155 Wdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i  234 (303)
                      ||+-....                                 .+.++.-       ..-.....|+|+|+.+|++..||.|
T Consensus       505 W~if~s~~---------------------------------~vEtl~i-------~sdvl~vsfrPdG~elaVaTldgqI  544 (893)
T KOG0291|consen  505 WDIFSSSG---------------------------------TVETLEI-------RSDVLAVSFRPDGKELAVATLDGQI  544 (893)
T ss_pred             EEeeccCc---------------------------------eeeeEee-------ccceeEEEEcCCCCeEEEEEecceE
Confidence            99732110                                 0011100       0001123478899999999999999


Q ss_pred             EEEECCCCeEEEEeec--------------------CCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          235 YVYDLVSGEQVAALKY--------------------HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       235 ~iwd~~~~~~~~~~~~--------------------h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .+||.+.+..+..+++                    ...+.+.+++|+||..+++||+...|.+++++.
T Consensus       545 tf~d~~~~~q~~~IdgrkD~~~gR~~~D~~ta~~sa~~K~Ftti~ySaDG~~IlAgG~sn~iCiY~v~~  613 (893)
T KOG0291|consen  545 TFFDIKEAVQVGSIDGRKDLSGGRKETDRITAENSAKGKTFTTICYSADGKCILAGGESNSICIYDVPE  613 (893)
T ss_pred             EEEEhhhceeeccccchhhccccccccceeehhhcccCCceEEEEEcCCCCEEEecCCcccEEEEECch
Confidence            9999987765544432                    235799999999999999999999999999863


No 29 
>PTZ00420 coronin; Provisional
Probab=99.96  E-value=4.7e-26  Score=211.85  Aligned_cols=236  Identities=14%  Similarity=0.178  Sum_probs=167.6

Q ss_pred             EEEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCc--------eEEE
Q 022074            6 HIVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANK--------LSLR   76 (303)
Q Consensus         6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~--------~~~~   76 (303)
                      .+.+++.|.+...+.+|..-+.  .......||..+|.+++|+|+ ++.|++|+.|++|+|||+.++.        ....
T Consensus        43 ~~w~~~gGG~~gvI~L~~~~r~--~~v~~L~gH~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~  120 (568)
T PTZ00420         43 VPWEVEGGGLIGAIRLENQMRK--PPVIKLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCI  120 (568)
T ss_pred             EEEEcCCCCceeEEEeeecCCC--ceEEEEcCCCCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEE
Confidence            3555666666666666654322  212334799999999999996 7899999999999999998642        1234


Q ss_pred             EecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEE
Q 022074           77 ILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWD  156 (303)
Q Consensus        77 ~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWd  156 (303)
                      +.+|...|.+++|+|....+|++++.|++|++||++..    .....+ .|...|.+++|+++|.+|++++.|+.|++||
T Consensus       121 L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg----~~~~~i-~~~~~V~SlswspdG~lLat~s~D~~IrIwD  195 (568)
T PTZ00420        121 LKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENE----KRAFQI-NMPKKLSSLKWNIKGNLLSGTCVGKHMHIID  195 (568)
T ss_pred             eecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCC----cEEEEE-ecCCcEEEEEECCCCCEEEEEecCCEEEEEE
Confidence            67899999999998765556789999999999998632    222233 2567899999999999999999999999999


Q ss_pred             cccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCC----
Q 022074          157 IRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS----  232 (303)
Q Consensus       157 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg----  232 (303)
                      +|....                                  +..+.+|..... .++.+...|++++.+++++|.|+    
T Consensus       196 ~Rsg~~----------------------------------i~tl~gH~g~~~-s~~v~~~~fs~d~~~IlTtG~d~~~~R  240 (568)
T PTZ00420        196 PRKQEI----------------------------------ASSFHIHDGGKN-TKNIWIDGLGGDDNYILSTGFSKNNMR  240 (568)
T ss_pred             CCCCcE----------------------------------EEEEecccCCce-eEEEEeeeEcCCCCEEEEEEcCCCCcc
Confidence            985321                                  112223321111 11112223567788889887764    


Q ss_pred             eEEEEECCC-CeEEEEeec--CCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          233 CVYVYDLVS-GEQVAALKY--HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       233 ~i~iwd~~~-~~~~~~~~~--h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .|+|||+++ .+.+..+..  +.+.+.-...++++.++++|+.|++|++|++..
T Consensus       241 ~VkLWDlr~~~~pl~~~~ld~~~~~L~p~~D~~tg~l~lsGkGD~tIr~~e~~~  294 (568)
T PTZ00420        241 EMKLWDLKNTTSALVTMSIDNASAPLIPHYDESTGLIYLIGKGDGNCRYYQHSL  294 (568)
T ss_pred             EEEEEECCCCCCceEEEEecCCccceEEeeeCCCCCEEEEEECCCeEEEEEccC
Confidence            799999985 455555433  334444555566789999999999999999854


No 30 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.96  E-value=2.1e-27  Score=215.74  Aligned_cols=223  Identities=22%  Similarity=0.316  Sum_probs=182.9

Q ss_pred             cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce----EEEEecccCCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           32 AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL----SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        32 ~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~----~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      .+...||+-.|.+++-..+|..|++||.|.++++|.++.+..    .....+|+..|.+++.+......|+++|.|++++
T Consensus       358 c~ii~GH~e~vlSL~~~~~g~llat~sKD~svilWr~~~~~~~~~~~a~~~gH~~svgava~~~~~asffvsvS~D~tlK  437 (775)
T KOG0319|consen  358 CQIIPGHTEAVLSLDVWSSGDLLATGSKDKSVILWRLNNNCSKSLCVAQANGHTNSVGAVAGSKLGASFFVSVSQDCTLK  437 (775)
T ss_pred             eEEEeCchhheeeeeecccCcEEEEecCCceEEEEEecCCcchhhhhhhhcccccccceeeecccCccEEEEecCCceEE
Confidence            335689999999999667889999999999999999855542    2345689999999999776788999999999999


Q ss_pred             EEcCccccCCCccce-----eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          108 VWDRRCLNVKGKPAG-----VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       108 lWd~~~~~~~~~~~~-----~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      +|++..-..+..+..     ....|...|++++++|++.+++|||.|++.+||++...                      
T Consensus       438 ~W~l~~s~~~~~~~~~~~~~t~~aHdKdIN~Vaia~ndkLiAT~SqDktaKiW~le~~----------------------  495 (775)
T KOG0319|consen  438 LWDLPKSKETAFPIVLTCRYTERAHDKDINCVAIAPNDKLIATGSQDKTAKIWDLEQL----------------------  495 (775)
T ss_pred             EecCCCcccccccceehhhHHHHhhcccccceEecCCCceEEecccccceeeecccCc----------------------
Confidence            999863111111111     12358888999999999999999999999999998521                      


Q ss_pred             CCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECC
Q 022074          183 PPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp  262 (303)
                                  +...++.||.      +..|+..|++..+.++|+|.|++|+||.+.++.++++|++|+..|..+.|-.
T Consensus       496 ------------~l~~vLsGH~------RGvw~V~Fs~~dq~laT~SgD~TvKIW~is~fSClkT~eGH~~aVlra~F~~  557 (775)
T KOG0319|consen  496 ------------RLLGVLSGHT------RGVWCVSFSKNDQLLATCSGDKTVKIWSISTFSCLKTFEGHTSAVLRASFIR  557 (775)
T ss_pred             ------------eEEEEeeCCc------cceEEEEeccccceeEeccCCceEEEEEeccceeeeeecCccceeEeeeeee
Confidence                        1244666765      3445667888899999999999999999999999999999999999999999


Q ss_pred             CCCeEEEEeCCCCEEEeecCCCCccCCCCcccc
Q 022074          263 SQPMLVSSSWDGDVVRWEFPGNGEAAPPLNKKR  295 (303)
Q Consensus       263 ~~~~las~s~Dg~i~~Wd~~~~~~~~~~~~~~~  295 (303)
                      ++.+|+|++.||.+++|+++.+ ++...++.++
T Consensus       558 ~~~qliS~~adGliKlWnikt~-eC~~tlD~H~  589 (775)
T KOG0319|consen  558 NGKQLISAGADGLIKLWNIKTN-ECEMTLDAHN  589 (775)
T ss_pred             CCcEEEeccCCCcEEEEeccch-hhhhhhhhcc
Confidence            9999999999999999999765 6666666553


No 31 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.95  E-value=3.3e-26  Score=225.40  Aligned_cols=231  Identities=22%  Similarity=0.362  Sum_probs=178.0

Q ss_pred             ccCchhhccccccccccCcCcccccCCCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEE
Q 022074           10 VGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVC   88 (303)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~   88 (303)
                      +.++++|..+.||++-+|...  ....+|+.+|.+++|+| ++..|++|+.|++|++||+.++.....+..+ ..+.++.
T Consensus       548 las~~~Dg~v~lWd~~~~~~~--~~~~~H~~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~-~~v~~v~  624 (793)
T PLN00181        548 VASSNFEGVVQVWDVARSQLV--TEMKEHEKRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTK-ANICCVQ  624 (793)
T ss_pred             EEEEeCCCeEEEEECCCCeEE--EEecCCCCCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEEecC-CCeEEEE
Confidence            446778888999988655432  23478999999999996 7899999999999999999988776666544 6789999


Q ss_pred             EccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccccc
Q 022074           89 FGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNL  168 (303)
Q Consensus        89 ~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~  168 (303)
                      |.+++++.|++|+.|++|++||++..   ..+...+.+|...|.++.|. ++.+|++++.|++|++||++......    
T Consensus       625 ~~~~~g~~latgs~dg~I~iwD~~~~---~~~~~~~~~h~~~V~~v~f~-~~~~lvs~s~D~~ikiWd~~~~~~~~----  696 (793)
T PLN00181        625 FPSESGRSLAFGSADHKVYYYDLRNP---KLPLCTMIGHSKTVSYVRFV-DSSTLVSSSTDNTLKLWDLSMSISGI----  696 (793)
T ss_pred             EeCCCCCEEEEEeCCCeEEEEECCCC---CccceEecCCCCCEEEEEEe-CCCEEEEEECCCEEEEEeCCCCcccc----
Confidence            97777899999999999999998632   22455677899999999996 67899999999999999986421000    


Q ss_pred             CccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe
Q 022074          169 GFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL  248 (303)
Q Consensus       169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~  248 (303)
                                              ....+..+.+|......      ..+++++.+|++|+.|+.|++|+......+..+
T Consensus       697 ------------------------~~~~l~~~~gh~~~i~~------v~~s~~~~~lasgs~D~~v~iw~~~~~~~~~s~  746 (793)
T PLN00181        697 ------------------------NETPLHSFMGHTNVKNF------VGLSVSDGYIATGSETNEVFVYHKAFPMPVLSY  746 (793)
T ss_pred             ------------------------CCcceEEEcCCCCCeeE------EEEcCCCCEEEEEeCCCEEEEEECCCCCceEEE
Confidence                                    01123344555432221      236678899999999999999998765433221


Q ss_pred             -------------ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          249 -------------KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       249 -------------~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                                   ..|...|.+++|+|++++|++|+.||+|++|++
T Consensus       747 ~~~~~~~~~~~~~~~~~~~V~~v~ws~~~~~lva~~~dG~I~i~~~  792 (793)
T PLN00181        747 KFKTIDPVSGLEVDDASQFISSVCWRGQSSTLVAANSTGNIKILEM  792 (793)
T ss_pred             ecccCCcccccccCCCCcEEEEEEEcCCCCeEEEecCCCcEEEEec
Confidence                         234567999999999999999999999999985


No 32 
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.95  E-value=1.6e-26  Score=191.90  Aligned_cols=237  Identities=22%  Similarity=0.323  Sum_probs=178.1

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC-CcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES-GHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~-~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .+|..+|.+++.+  |.++++||.|.+|+|||++.......+..|.+.++++.|.++. .+.|++|+.||.|.+|+... 
T Consensus        40 ~aH~~sitavAVs--~~~~aSGssDetI~IYDm~k~~qlg~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~-  116 (362)
T KOG0294|consen   40 SAHAGSITALAVS--GPYVASGSSDETIHIYDMRKRKQLGILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGS-  116 (362)
T ss_pred             cccccceeEEEec--ceeEeccCCCCcEEEEeccchhhhcceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCC-
Confidence            6899999999987  7899999999999999999988888888999999999996653 34788999999999999763 


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                         ......+.+|...|+.++++|.+.+.++.|.|+.+|+|||-..+..+.+++.....     .+.+.+.+..+.....
T Consensus       117 ---W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D~~lr~WNLV~Gr~a~v~~L~~~at-----~v~w~~~Gd~F~v~~~  188 (362)
T KOG0294|consen  117 ---WELLKSLKAHKGQVTDLSIHPSGKLALSVGGDQVLRTWNLVRGRVAFVLNLKNKAT-----LVSWSPQGDHFVVSGR  188 (362)
T ss_pred             ---eEEeeeecccccccceeEecCCCceEEEEcCCceeeeehhhcCccceeeccCCcce-----eeEEcCCCCEEEEEec
Confidence               34567788999999999999999999999999999999998765544443321110     1223333332222222


Q ss_pred             CcceEEeccc--cee---eeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEE--CCCCCeE
Q 022074          195 QSVATYKGHS--VLR---TLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSW--HPSQPML  267 (303)
Q Consensus       195 ~~~~~~~~~~--~~~---~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~--sp~~~~l  267 (303)
                      +.+..++-..  ...   .-.+.+.. .| -++.+|++|+.|+.|++||......+..+.+|+.+|.++.+  .|.+.+|
T Consensus       189 ~~i~i~q~d~A~v~~~i~~~~r~l~~-~~-l~~~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~~~~~~~~l  266 (362)
T KOG0294|consen  189 NKIDIYQLDNASVFREIENPKRILCA-TF-LDGSELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASYTNPEHEYL  266 (362)
T ss_pred             cEEEEEecccHhHhhhhhccccceee-ee-cCCceEEEecCCceEEEeccCCCccceeeecchhheeeeEEEecCCceEE
Confidence            2222222111  000   00011111 11 25678999999999999999998899999999999999995  5678899


Q ss_pred             EEEeCCCCEEEeecCCCC
Q 022074          268 VSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       268 as~s~Dg~i~~Wd~~~~~  285 (303)
                      +|+|.||.|++||+....
T Consensus       267 vTaSSDG~I~vWd~~~~~  284 (362)
T KOG0294|consen  267 VTASSDGFIKVWDIDMET  284 (362)
T ss_pred             EEeccCceEEEEEccccc
Confidence            999999999999998663


No 33 
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=99.95  E-value=2.2e-26  Score=185.58  Aligned_cols=224  Identities=18%  Similarity=0.258  Sum_probs=162.3

Q ss_pred             CEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCe
Q 022074           52 RELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGI  131 (303)
Q Consensus        52 ~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v  131 (303)
                      -+|+++|.|-+||+|.+.+|....++...+..|+.+...|+ .+.|++++ ...||+||+++.++  .|+..+.+|...|
T Consensus        11 viLvsA~YDhTIRfWqa~tG~C~rTiqh~dsqVNrLeiTpd-k~~LAaa~-~qhvRlyD~~S~np--~Pv~t~e~h~kNV   86 (311)
T KOG0315|consen   11 VILVSAGYDHTIRFWQALTGICSRTIQHPDSQVNRLEITPD-KKDLAAAG-NQHVRLYDLNSNNP--NPVATFEGHTKNV   86 (311)
T ss_pred             eEEEeccCcceeeeeehhcCeEEEEEecCccceeeEEEcCC-cchhhhcc-CCeeEEEEccCCCC--CceeEEeccCCce
Confidence            47999999999999999999988777777789999999764 55666555 56899999986543  4788999999999


Q ss_pred             EEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceee-eCCCCCccccCCCCCcceEEecccc--eee
Q 022074          132 TFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWM-DYPPQARDLKHPCDQSVATYKGHSV--LRT  208 (303)
Q Consensus       132 ~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~--~~~  208 (303)
                      +++.|..+|+.++|||.||++||||+|.+...-.+...     ..+..+ ..|.++..+......-+..++-...  ...
T Consensus        87 taVgF~~dgrWMyTgseDgt~kIWdlR~~~~qR~~~~~-----spVn~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~  161 (311)
T KOG0315|consen   87 TAVGFQCDGRWMYTGSEDGTVKIWDLRSLSCQRNYQHN-----SPVNTVVLHPNQTELISGDQSGNIRVWDLGENSCTHE  161 (311)
T ss_pred             EEEEEeecCeEEEecCCCceEEEEeccCcccchhccCC-----CCcceEEecCCcceEEeecCCCcEEEEEccCCccccc
Confidence            99999999999999999999999999974332211111     011111 1122222222222222333321110  000


Q ss_pred             e----EEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe------EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          209 L----IRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE------QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       209 ~----~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~------~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                      .    .....+....+||++++.+...|.+++|++-+.+      ++..+++|++-|..+-+|||+++||++|.|.++++
T Consensus       162 liPe~~~~i~sl~v~~dgsml~a~nnkG~cyvW~l~~~~~~s~l~P~~k~~ah~~~il~C~lSPd~k~lat~ssdktv~i  241 (311)
T KOG0315|consen  162 LIPEDDTSIQSLTVMPDGSMLAAANNKGNCYVWRLLNHQTASELEPVHKFQAHNGHILRCLLSPDVKYLATCSSDKTVKI  241 (311)
T ss_pred             cCCCCCcceeeEEEcCCCcEEEEecCCccEEEEEccCCCccccceEhhheecccceEEEEEECCCCcEEEeecCCceEEE
Confidence            0    0111223356899999999999999999987653      45567899999999999999999999999999999


Q ss_pred             eecCCC
Q 022074          279 WEFPGN  284 (303)
Q Consensus       279 Wd~~~~  284 (303)
                      |+....
T Consensus       242 wn~~~~  247 (311)
T KOG0315|consen  242 WNTDDF  247 (311)
T ss_pred             EecCCc
Confidence            998765


No 34 
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.95  E-value=2.1e-26  Score=194.40  Aligned_cols=200  Identities=26%  Similarity=0.375  Sum_probs=173.2

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      -||+..|.++.|-|.|.++++++.|.+|+.||..++-...++.+|..-|..+..+ .++.++++++.|.+|++|-..   
T Consensus       190 ~gh~h~vS~V~f~P~gd~ilS~srD~tik~We~~tg~cv~t~~~h~ewvr~v~v~-~DGti~As~s~dqtl~vW~~~---  265 (406)
T KOG0295|consen  190 IGHEHGVSSVFFLPLGDHILSCSRDNTIKAWECDTGYCVKTFPGHSEWVRMVRVN-QDGTIIASCSNDQTLRVWVVA---  265 (406)
T ss_pred             cCcccceeeEEEEecCCeeeecccccceeEEecccceeEEeccCchHhEEEEEec-CCeeEEEecCCCceEEEEEec---
Confidence            5999999999999999999999999999999999999999999999999999986 568999999999999999864   


Q ss_pred             CCCccceeecccccCeEEEEeCCC---------------CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceee
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGD---------------GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWM  180 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~---------------~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~  180 (303)
                       +......+.+|...|.+++|.|.               +.++.+++.|++||+||+...                    
T Consensus       266 -t~~~k~~lR~hEh~vEci~wap~~~~~~i~~at~~~~~~~~l~s~SrDktIk~wdv~tg--------------------  324 (406)
T KOG0295|consen  266 -TKQCKAELREHEHPVECIAWAPESSYPSISEATGSTNGGQVLGSGSRDKTIKIWDVSTG--------------------  324 (406)
T ss_pred             -cchhhhhhhccccceEEEEecccccCcchhhccCCCCCccEEEeecccceEEEEeccCC--------------------
Confidence             22234456778888888877432               358999999999999998532                    


Q ss_pred             eCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEE
Q 022074          181 DYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSW  260 (303)
Q Consensus       181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~  260 (303)
                                    .++.++.||....      ....|+|.|+||+++.+|+++++||++++++++.++.|+.-+++++|
T Consensus       325 --------------~cL~tL~ghdnwV------r~~af~p~Gkyi~ScaDDktlrvwdl~~~~cmk~~~ah~hfvt~lDf  384 (406)
T KOG0295|consen  325 --------------MCLFTLVGHDNWV------RGVAFSPGGKYILSCADDKTLRVWDLKNLQCMKTLEAHEHFVTSLDF  384 (406)
T ss_pred             --------------eEEEEEeccccee------eeeEEcCCCeEEEEEecCCcEEEEEeccceeeeccCCCcceeEEEec
Confidence                          2455666665432      23458899999999999999999999999999999999999999999


Q ss_pred             CCCCCeEEEEeCCCCEEEee
Q 022074          261 HPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       261 sp~~~~las~s~Dg~i~~Wd  280 (303)
                      +.+.++++||+-|.++++|.
T Consensus       385 h~~~p~VvTGsVdqt~KvwE  404 (406)
T KOG0295|consen  385 HKTAPYVVTGSVDQTVKVWE  404 (406)
T ss_pred             CCCCceEEeccccceeeeee
Confidence            99999999999999999997


No 35 
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.95  E-value=1.4e-27  Score=199.06  Aligned_cols=262  Identities=23%  Similarity=0.328  Sum_probs=196.9

Q ss_pred             CchhhccccccccccCc-----CcccccC-CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe-cccCCe
Q 022074           12 SGTMESLANVTEIHDGL-----DFSAADD-GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL-AHTSDV   84 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~-----~~~~~~~-~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~-~h~~~v   84 (303)
                      +||-|..|.||.-.+|+     +.+++|. -=|+.+|.|++||.|.+.+++|+.||.|++|.+.+|+...++. +|+.+|
T Consensus       230 sgSvDGFiEVWny~~GKlrKDLkYQAqd~fMMmd~aVlci~FSRDsEMlAsGsqDGkIKvWri~tG~ClRrFdrAHtkGv  309 (508)
T KOG0275|consen  230 SGSVDGFIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVLCISFSRDSEMLASGSQDGKIKVWRIETGQCLRRFDRAHTKGV  309 (508)
T ss_pred             eccccceeeeehhccchhhhhhhhhhhcceeecccceEEEeecccHHHhhccCcCCcEEEEEEecchHHHHhhhhhccCe
Confidence            67889999999999997     4445453 3678899999999999999999999999999999998777766 799999


Q ss_pred             EEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCc
Q 022074           85 NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNA  164 (303)
Q Consensus        85 ~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~  164 (303)
                      +|+.|+. ++..+++++.|.++|+--++    .++....+.||+.-|+-..|.++|+.+++++.|++|++|+.....+..
T Consensus       310 t~l~FSr-D~SqiLS~sfD~tvRiHGlK----SGK~LKEfrGHsSyvn~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~  384 (508)
T KOG0275|consen  310 TCLSFSR-DNSQILSASFDQTVRIHGLK----SGKCLKEFRGHSSYVNEATFTDDGHHIISASSDGTVKVWHGKTTECLS  384 (508)
T ss_pred             eEEEEcc-CcchhhcccccceEEEeccc----cchhHHHhcCccccccceEEcCCCCeEEEecCCccEEEecCcchhhhh
Confidence            9999975 46688899999999998775    455677889999999999999999999999999999999987654433


Q ss_pred             ccccCccceeeeceeee-CCCCCccccCC-CCCcc--eEEecccceeee----EE-EeeeeeeeCCCeEEEEEeCCCeEE
Q 022074          165 SCNLGFRSYEWDYRWMD-YPPQARDLKHP-CDQSV--ATYKGHSVLRTL----IR-CHFSPVYSTGQKYIYTGSHDSCVY  235 (303)
Q Consensus       165 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~--~~~~~~~~~~~~----~~-~~~~~~~s~~~~~latg~~dg~i~  235 (303)
                      .+.-.  +-+..+.... +|-+...+..+ ..+.+  ..++|.......    .. ...+...||.|.++.+.++|+.++
T Consensus       385 Tfk~~--~~d~~vnsv~~~PKnpeh~iVCNrsntv~imn~qGQvVrsfsSGkREgGdFi~~~lSpkGewiYcigED~vlY  462 (508)
T KOG0275|consen  385 TFKPL--GTDYPVNSVILLPKNPEHFIVCNRSNTVYIMNMQGQVVRSFSSGKREGGDFINAILSPKGEWIYCIGEDGVLY  462 (508)
T ss_pred             hccCC--CCcccceeEEEcCCCCceEEEEcCCCeEEEEeccceEEeeeccCCccCCceEEEEecCCCcEEEEEccCcEEE
Confidence            32211  1111111111 12111111111 11111  122221110000    00 011233578899999999999999


Q ss_pred             EEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          236 VYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       236 iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      .+...+|++..++..|+..+-.++-+|..+.|||=++||.+++|.
T Consensus       463 CF~~~sG~LE~tl~VhEkdvIGl~HHPHqNllAsYsEDgllKLWk  507 (508)
T KOG0275|consen  463 CFSVLSGKLERTLPVHEKDVIGLTHHPHQNLLASYSEDGLLKLWK  507 (508)
T ss_pred             EEEeecCceeeeeecccccccccccCcccchhhhhcccchhhhcC
Confidence            999999999999999999999999999999999999999999996


No 36 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.95  E-value=6.3e-26  Score=191.56  Aligned_cols=259  Identities=19%  Similarity=0.267  Sum_probs=187.6

Q ss_pred             CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEcc
Q 022074           12 SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGD   91 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~   91 (303)
                      .|.=|-++.||++.+|..  +-...||+-.|.++.||.+|.+||+|+.+|.|+||+..++.....+...-..+.=+.|+|
T Consensus        81 TGGgDD~AflW~~~~ge~--~~eltgHKDSVt~~~FshdgtlLATGdmsG~v~v~~~stg~~~~~~~~e~~dieWl~WHp  158 (399)
T KOG0296|consen   81 TGGGDDLAFLWDISTGEF--AGELTGHKDSVTCCSFSHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQEVEDIEWLKWHP  158 (399)
T ss_pred             ecCCCceEEEEEccCCcc--eeEecCCCCceEEEEEccCceEEEecCCCccEEEEEcccCceEEEeecccCceEEEEecc
Confidence            344568999999999883  335599999999999999999999999999999999999988877765556677778876


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccccc---
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNL---  168 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~---  168 (303)
                       -+..|+.|+.||.+-+|.+.    +......+.||..++++=.|.|+|+.++++..|++|++||+....+....+.   
T Consensus       159 -~a~illAG~~DGsvWmw~ip----~~~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~dgti~~Wn~ktg~p~~~~~~~e~  233 (399)
T KOG0296|consen  159 -RAHILLAGSTDGSVWMWQIP----SQALCKVMSGHNSPCTCGEFIPDGKRILTGYDDGTIIVWNPKTGQPLHKITQAEG  233 (399)
T ss_pred             -cccEEEeecCCCcEEEEECC----CcceeeEecCCCCCcccccccCCCceEEEEecCceEEEEecCCCceeEEeccccc
Confidence             68899999999999999874    2245678999999999999999999999999999999999875322111110   


Q ss_pred             -Cccceeee------------------------ceeeeCC--CC--------------------CccccCC-CCCcceEE
Q 022074          169 -GFRSYEWD------------------------YRWMDYP--PQ--------------------ARDLKHP-CDQSVATY  200 (303)
Q Consensus       169 -~~~~~~~~------------------------~~~~~~~--~~--------------------~~~~~~~-~~~~~~~~  200 (303)
                       ........                        +.....+  |.                    .+..+.. .+..+..+
T Consensus       234 ~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~esve~~~~ss~lpL~A~G~vdG~i~iy  313 (399)
T KOG0296|consen  234 LELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDESVESIPSSSKLPLAACGSVDGTIAIY  313 (399)
T ss_pred             CcCCccccccccceeEeccCCccEEEEccccceEEEecCCCCccccccchhhhhhhhhcccccccchhhcccccceEEEE
Confidence             00000000                        0000000  00                    0000000 01111111


Q ss_pred             eccc-cee-------eeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeC
Q 022074          201 KGHS-VLR-------TLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSW  272 (303)
Q Consensus       201 ~~~~-~~~-------~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~  272 (303)
                      +-.. ..+       .+.++.    |-+ ..+|++++.+|.|+.||.++|+++.++.+|..+|.+++.+|++++++|+|.
T Consensus       314 D~a~~~~R~~c~he~~V~~l~----w~~-t~~l~t~c~~g~v~~wDaRtG~l~~~y~GH~~~Il~f~ls~~~~~vvT~s~  388 (399)
T KOG0296|consen  314 DLAASTLRHICEHEDGVTKLK----WLN-TDYLLTACANGKVRQWDARTGQLKFTYTGHQMGILDFALSPQKRLVVTVSD  388 (399)
T ss_pred             ecccchhheeccCCCceEEEE----EcC-cchheeeccCceEEeeeccccceEEEEecCchheeEEEEcCCCcEEEEecC
Confidence            1100 000       011111    223 468999999999999999999999999999999999999999999999999


Q ss_pred             CCCEEEeecC
Q 022074          273 DGDVVRWEFP  282 (303)
Q Consensus       273 Dg~i~~Wd~~  282 (303)
                      |++.++|+++
T Consensus       389 D~~a~VF~v~  398 (399)
T KOG0296|consen  389 DNTALVFEVP  398 (399)
T ss_pred             CCeEEEEecC
Confidence            9999999975


No 37 
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.95  E-value=8.4e-27  Score=188.51  Aligned_cols=202  Identities=22%  Similarity=0.381  Sum_probs=160.4

Q ss_pred             cceEEEEEcCC-CCEEEEeeCCCeEEEEECCCC-ceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC
Q 022074           40 FGIFSLKFSTD-GRELVAGSSDDCIYVYDLEAN-KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        40 ~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~-~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~  117 (303)
                      .+++.++|+++ .+.+++++.||+++|||+... ..+..++.|...|.++-|++.....++++|+|++|+|||..    .
T Consensus        61 D~LfdV~Wse~~e~~~~~a~GDGSLrl~d~~~~s~Pi~~~kEH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~----r  136 (311)
T KOG0277|consen   61 DGLFDVAWSENHENQVIAASGDGSLRLFDLTMPSKPIHKFKEHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPN----R  136 (311)
T ss_pred             cceeEeeecCCCcceEEEEecCceEEEeccCCCCcchhHHHhhhhheEEeccccccceeEEeeccCCceEeecCC----C
Confidence            46999999975 568999999999999997543 34456788999999999988878889999999999999953    3


Q ss_pred             CccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          118 GKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       118 ~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      ...+.++.||...|....|+| ..+++++++.|+++++||+|..-.   .                              
T Consensus       137 ~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~gk---~------------------------------  183 (311)
T KOG0277|consen  137 PNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSPGK---F------------------------------  183 (311)
T ss_pred             CcceEeecCCccEEEEEecCCCCCCeEEEccCCceEEEEEecCCCc---e------------------------------
Confidence            345678999999999999988 578999999999999999875311   0                              


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEECCCC-CeEEEEeCCC
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWHPSQ-PMLVSSSWDG  274 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~sp~~-~~las~s~Dg  274 (303)
                       ..+..|..  .++.|-++-   .+...|+||+.|+.|++||+++.+ ++.++.+|.-.|..++|||.. .+|||++.|.
T Consensus       184 -~~i~ah~~--Eil~cdw~k---y~~~vl~Tg~vd~~vr~wDir~~r~pl~eL~gh~~AVRkvk~Sph~~~lLaSasYDm  257 (311)
T KOG0277|consen  184 -MSIEAHNS--EILCCDWSK---YNHNVLATGGVDNLVRGWDIRNLRTPLFELNGHGLAVRKVKFSPHHASLLASASYDM  257 (311)
T ss_pred             -eEEEeccc--eeEeecccc---cCCcEEEecCCCceEEEEehhhccccceeecCCceEEEEEecCcchhhHhhhccccc
Confidence             00111211  122232221   245789999999999999998754 588899999999999999986 5899999999


Q ss_pred             CEEEeecCCC
Q 022074          275 DVVRWEFPGN  284 (303)
Q Consensus       275 ~i~~Wd~~~~  284 (303)
                      ++++||....
T Consensus       258 T~riw~~~~~  267 (311)
T KOG0277|consen  258 TVRIWDPERQ  267 (311)
T ss_pred             eEEecccccc
Confidence            9999998644


No 38 
>KOG0277 consensus Peroxisomal targeting signal type 2 receptor [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.95  E-value=7.6e-27  Score=188.74  Aligned_cols=204  Identities=23%  Similarity=0.361  Sum_probs=167.0

Q ss_pred             CCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .-|+..|+++.|++ +++.++++|+|++|+||+....+.+.++.+|...|....|+|..+++|+++|.|+++++||++..
T Consensus       101 kEH~~EV~Svdwn~~~r~~~ltsSWD~TiKLW~~~r~~Sv~Tf~gh~~~Iy~a~~sp~~~nlfas~Sgd~~l~lwdvr~~  180 (311)
T KOG0277|consen  101 KEHKREVYSVDWNTVRRRIFLTSSWDGTIKLWDPNRPNSVQTFNGHNSCIYQAAFSPHIPNLFASASGDGTLRLWDVRSP  180 (311)
T ss_pred             HhhhhheEEeccccccceeEEeeccCCceEeecCCCCcceEeecCCccEEEEEecCCCCCCeEEEccCCceEEEEEecCC
Confidence            47999999999997 56678899999999999999998888999999999999999988999999999999999998743


Q ss_pred             cCCCccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                         ++..- +..|...+.+++|+. +.+.++||+.|+.||.||+|..+.                               
T Consensus       181 ---gk~~~-i~ah~~Eil~cdw~ky~~~vl~Tg~vd~~vr~wDir~~r~-------------------------------  225 (311)
T KOG0277|consen  181 ---GKFMS-IEAHNSEILCCDWSKYNHNVLATGGVDNLVRGWDIRNLRT-------------------------------  225 (311)
T ss_pred             ---CceeE-EEeccceeEeecccccCCcEEEecCCCceEEEEehhhccc-------------------------------
Confidence               34343 567888899999886 556789999999999999997532                               


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEECCCC-CeEEEEe
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWHPSQ-PMLVSSS  271 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~sp~~-~~las~s  271 (303)
                        .+..+.||....-  +..++|-   ...+||+++.|-+++|||...++ .+.+.+.|+.-+..++||+.. .++|+.+
T Consensus       226 --pl~eL~gh~~AVR--kvk~Sph---~~~lLaSasYDmT~riw~~~~~ds~~e~~~~HtEFv~g~Dws~~~~~~vAs~g  298 (311)
T KOG0277|consen  226 --PLFELNGHGLAVR--KVKFSPH---HASLLASASYDMTVRIWDPERQDSAIETVDHHTEFVCGLDWSLFDPGQVASTG  298 (311)
T ss_pred             --cceeecCCceEEE--EEecCcc---hhhHhhhccccceEEecccccchhhhhhhhccceEEeccccccccCceeeecc
Confidence              2445556654322  2233331   24679999999999999998654 356678899999999999964 5899999


Q ss_pred             CCCCEEEeec
Q 022074          272 WDGDVVRWEF  281 (303)
Q Consensus       272 ~Dg~i~~Wd~  281 (303)
                      .|..+.+|+.
T Consensus       299 WDe~l~Vw~p  308 (311)
T KOG0277|consen  299 WDELLYVWNP  308 (311)
T ss_pred             cccceeeecc
Confidence            9999999995


No 39 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.95  E-value=6.6e-26  Score=206.56  Aligned_cols=201  Identities=23%  Similarity=0.519  Sum_probs=164.9

Q ss_pred             cceEEEEEcCCCCEEEEeeC-CCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCC
Q 022074           40 FGIFSLKFSTDGRELVAGSS-DDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG  118 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~-Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~  118 (303)
                      ++|.+++|+..|++|+.|+. -|.+-||+.....-+.+.++|...+++++++| +++++++|+.||.|++||..    .+
T Consensus       308 ~~I~t~~~N~tGDWiA~g~~klgQLlVweWqsEsYVlKQQgH~~~i~~l~YSp-Dgq~iaTG~eDgKVKvWn~~----Sg  382 (893)
T KOG0291|consen  308 QKILTVSFNSTGDWIAFGCSKLGQLLVWEWQSESYVLKQQGHSDRITSLAYSP-DGQLIATGAEDGKVKVWNTQ----SG  382 (893)
T ss_pred             ceeeEEEecccCCEEEEcCCccceEEEEEeeccceeeeccccccceeeEEECC-CCcEEEeccCCCcEEEEecc----Cc
Confidence            46999999999999999875 48999999999888888899999999999975 58999999999999999964    44


Q ss_pred             ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          119 KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       119 ~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      ....+|..|+.+|+.+.|...++.+++.+-||+||.||+.+-..       ++.+.       .| .          .+ 
T Consensus       383 fC~vTFteHts~Vt~v~f~~~g~~llssSLDGtVRAwDlkRYrN-------fRTft-------~P-~----------p~-  436 (893)
T KOG0291|consen  383 FCFVTFTEHTSGVTAVQFTARGNVLLSSSLDGTVRAWDLKRYRN-------FRTFT-------SP-E----------PI-  436 (893)
T ss_pred             eEEEEeccCCCceEEEEEEecCCEEEEeecCCeEEeeeecccce-------eeeec-------CC-C----------ce-
Confidence            55678999999999999999999999999999999999864221       11100       00 0          00 


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCC-eEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEE
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS-CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVV  277 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg-~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~  277 (303)
                                ..   .+....|.|.++..|+.|. .|++|+.++|+++-.+++|++||.+++|+|++..|||+|+|.+++
T Consensus       437 ----------Qf---scvavD~sGelV~AG~~d~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDkTVR  503 (893)
T KOG0291|consen  437 ----------QF---SCVAVDPSGELVCAGAQDSFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDKTVR  503 (893)
T ss_pred             ----------ee---eEEEEcCCCCEEEeeccceEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccceEE
Confidence                      00   0111235577777777665 599999999999999999999999999999999999999999999


Q ss_pred             EeecCCC
Q 022074          278 RWEFPGN  284 (303)
Q Consensus       278 ~Wd~~~~  284 (303)
                      +||+-..
T Consensus       504 iW~if~s  510 (893)
T KOG0291|consen  504 IWDIFSS  510 (893)
T ss_pred             EEEeecc
Confidence            9997544


No 40 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.95  E-value=2.8e-26  Score=183.66  Aligned_cols=238  Identities=20%  Similarity=0.242  Sum_probs=178.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .+|..+|.++.|+.||++.++++.|.+|+||+...+.++++..+|...|..++.+.+ +..|++|+.|..|.+||..   
T Consensus        14 ~~~qgaV~avryN~dGnY~ltcGsdrtvrLWNp~rg~liktYsghG~EVlD~~~s~D-nskf~s~GgDk~v~vwDV~---   89 (307)
T KOG0316|consen   14 DCAQGAVRAVRYNVDGNYCLTCGSDRTVRLWNPLRGALIKTYSGHGHEVLDAALSSD-NSKFASCGGDKAVQVWDVN---   89 (307)
T ss_pred             cccccceEEEEEccCCCEEEEcCCCceEEeecccccceeeeecCCCceeeecccccc-ccccccCCCCceEEEEEcc---
Confidence            477889999999999999999999999999999999999999999999999988654 5678899999999999985   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC-ccccCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA-RDLKHPCD  194 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~  194 (303)
                       +++..+.+.+|...|+.+.|+.+...+++|+.|.++|+||-|.........+.....  .+.  ...... ..+....+
T Consensus        90 -TGkv~Rr~rgH~aqVNtV~fNeesSVv~SgsfD~s~r~wDCRS~s~ePiQildea~D--~V~--Si~v~~heIvaGS~D  164 (307)
T KOG0316|consen   90 -TGKVDRRFRGHLAQVNTVRFNEESSVVASGSFDSSVRLWDCRSRSFEPIQILDEAKD--GVS--SIDVAEHEIVAGSVD  164 (307)
T ss_pred             -cCeeeeecccccceeeEEEecCcceEEEeccccceeEEEEcccCCCCccchhhhhcC--cee--EEEecccEEEeeccC
Confidence             667888999999999999999999999999999999999998654322221110000  000  000000 11111122


Q ss_pred             CcceEEecccc---eeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCC--CeEEEEECCCCCeEEE
Q 022074          195 QSVATYKGHSV---LRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTS--PVRDCSWHPSQPMLVS  269 (303)
Q Consensus       195 ~~~~~~~~~~~---~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~--~I~~v~~sp~~~~las  269 (303)
                      ..+.+++-...   ..+.-.-..+..|+++++.++.++.|+++++.|-++|++++.+++|..  .=.+++++.....+++
T Consensus       165 GtvRtydiR~G~l~sDy~g~pit~vs~s~d~nc~La~~l~stlrLlDk~tGklL~sYkGhkn~eykldc~l~qsdthV~s  244 (307)
T KOG0316|consen  165 GTVRTYDIRKGTLSSDYFGHPITSVSFSKDGNCSLASSLDSTLRLLDKETGKLLKSYKGHKNMEYKLDCCLNQSDTHVFS  244 (307)
T ss_pred             CcEEEEEeecceeehhhcCCcceeEEecCCCCEEEEeeccceeeecccchhHHHHHhcccccceeeeeeeecccceeEEe
Confidence            22222221100   000000012345889999999999999999999999999999999975  3457788888889999


Q ss_pred             EeCCCCEEEeecC
Q 022074          270 SSWDGDVVRWEFP  282 (303)
Q Consensus       270 ~s~Dg~i~~Wd~~  282 (303)
                      ||+||.+.+||+-
T Consensus       245 gSEDG~Vy~wdLv  257 (307)
T KOG0316|consen  245 GSEDGKVYFWDLV  257 (307)
T ss_pred             ccCCceEEEEEec
Confidence            9999999999975


No 41 
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.95  E-value=2.6e-26  Score=212.18  Aligned_cols=229  Identities=23%  Similarity=0.326  Sum_probs=178.0

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      .|++||..+.|+|++..+++|+.|-+|++|+.++.+...++.+|.+-|..+.|++. -.+++|+|.|.+||+|+..    
T Consensus        49 eHdGpVRgv~FH~~qplFVSGGDDykIkVWnYk~rrclftL~GHlDYVRt~~FHhe-yPWIlSASDDQTIrIWNwq----  123 (1202)
T KOG0292|consen   49 EHDGPVRGVDFHPTQPLFVSGGDDYKIKVWNYKTRRCLFTLLGHLDYVRTVFFHHE-YPWILSASDDQTIRIWNWQ----  123 (1202)
T ss_pred             ccCCccceeeecCCCCeEEecCCccEEEEEecccceehhhhccccceeEEeeccCC-CceEEEccCCCeEEEEecc----
Confidence            79999999999999999999999999999999999998999999999999999876 4589999999999999974    


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC-
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ-  195 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-  195 (303)
                      ....+.++.||..-|.|..|+|...+++|+|-|.+||+||+..++.......   +.+-..+.   .+....+....+. 
T Consensus       124 sr~~iavltGHnHYVMcAqFhptEDlIVSaSLDQTVRVWDisGLRkk~~~pg---~~e~~~~~---~~~~~dLfg~~DaV  197 (1202)
T KOG0292|consen  124 SRKCIAVLTGHNHYVMCAQFHPTEDLIVSASLDQTVRVWDISGLRKKNKAPG---SLEDQMRG---QQGNSDLFGQTDAV  197 (1202)
T ss_pred             CCceEEEEecCceEEEeeccCCccceEEEecccceEEEEeecchhccCCCCC---Cchhhhhc---cccchhhcCCcCee
Confidence            4566888999999999999999888999999999999999875433221111   11100000   0000111111111 


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe--EEEEeecCCCCeEEEEECCCCCeEEEEeCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE--QVAALKYHTSPVRDCSWHPSQPMLVSSSWD  273 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~--~~~~~~~h~~~I~~v~~sp~~~~las~s~D  273 (303)
                      ....+.||..      .....+|+|.-.++++|+.|..|++|....-+  ++-+..+|..+|.++-|+|....++|.|+|
T Consensus       198 VK~VLEGHDR------GVNwaAfhpTlpliVSG~DDRqVKlWrmnetKaWEvDtcrgH~nnVssvlfhp~q~lIlSnsED  271 (1202)
T KOG0292|consen  198 VKHVLEGHDR------GVNWAAFHPTLPLIVSGADDRQVKLWRMNETKAWEVDTCRGHYNNVSSVLFHPHQDLILSNSED  271 (1202)
T ss_pred             eeeeeccccc------ccceEEecCCcceEEecCCcceeeEEEeccccceeehhhhcccCCcceEEecCccceeEecCCC
Confidence            1233455542      22334577777899999999999999985433  355667999999999999999999999999


Q ss_pred             CCEEEeecC
Q 022074          274 GDVVRWEFP  282 (303)
Q Consensus       274 g~i~~Wd~~  282 (303)
                      ++|++||..
T Consensus       272 ksirVwDm~  280 (1202)
T KOG0292|consen  272 KSIRVWDMT  280 (1202)
T ss_pred             ccEEEEecc
Confidence            999999974


No 42 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.95  E-value=5e-25  Score=193.81  Aligned_cols=250  Identities=22%  Similarity=0.324  Sum_probs=176.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe---cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL---AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~---~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      .-|.-=|+|+.|+|||+++++.+.||++.+||-+++.....+.   +|++.|.++.|+|+ .+.|+|++.|.++++||..
T Consensus       187 r~HskFV~~VRysPDG~~Fat~gsDgki~iyDGktge~vg~l~~~~aHkGsIfalsWsPD-s~~~~T~SaDkt~KIWdVs  265 (603)
T KOG0318|consen  187 REHSKFVNCVRYSPDGSRFATAGSDGKIYIYDGKTGEKVGELEDSDAHKGSIFALSWSPD-STQFLTVSADKTIKIWDVS  265 (603)
T ss_pred             cccccceeeEEECCCCCeEEEecCCccEEEEcCCCccEEEEecCCCCccccEEEEEECCC-CceEEEecCCceEEEEEee
Confidence            4677789999999999999999999999999999999888877   89999999999764 7899999999999999964


Q ss_pred             cccC---------------------------------------CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEE
Q 022074          113 CLNV---------------------------------------KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIK  153 (303)
Q Consensus       113 ~~~~---------------------------------------~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~  153 (303)
                      ....                                       ...+...+.||..+|+++..++++.+|++|+.||.|.
T Consensus       266 ~~slv~t~~~~~~v~dqqvG~lWqkd~lItVSl~G~in~ln~~d~~~~~~i~GHnK~ITaLtv~~d~~~i~SgsyDG~I~  345 (603)
T KOG0318|consen  266 TNSLVSTWPMGSTVEDQQVGCLWQKDHLITVSLSGTINYLNPSDPSVLKVISGHNKSITALTVSPDGKTIYSGSYDGHIN  345 (603)
T ss_pred             ccceEEEeecCCchhceEEEEEEeCCeEEEEEcCcEEEEecccCCChhheecccccceeEEEEcCCCCEEEeeccCceEE
Confidence            2111                                       0123445679999999999999999999999999999


Q ss_pred             EEEcccccCCccc-----c----------cCccceeee--ceee-------------eCCCCCccccCCCC---------
Q 022074          154 LWDIRKMSSNASC-----N----------LGFRSYEWD--YRWM-------------DYPPQARDLKHPCD---------  194 (303)
Q Consensus       154 lWdl~~~~~~~~~-----~----------~~~~~~~~~--~~~~-------------~~~~~~~~~~~~~~---------  194 (303)
                      -||..........     +          .......|+  ++..             +++.+.+.+....+         
T Consensus       346 ~W~~~~g~~~~~~g~~h~nqI~~~~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg~QP~~lav~~d~~~avv~~~  425 (603)
T KOG0318|consen  346 SWDSGSGTSDRLAGKGHTNQIKGMAASESGELFTIGWDDTLRVISLKDNGYTKSEVVKLGSQPKGLAVLSDGGTAVVACI  425 (603)
T ss_pred             EEecCCccccccccccccceEEEEeecCCCcEEEEecCCeEEEEecccCcccccceeecCCCceeEEEcCCCCEEEEEec
Confidence            9998764321100     0          001111122  0111             11111111111111         


Q ss_pred             CcceEEecccceeee-EEEe-eeeeeeCCCeEEEEEeCCCeEEEEECCCCeE--EEEeecCCCCeEEEEECCCCCeEEEE
Q 022074          195 QSVATYKGHSVLRTL-IRCH-FSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ--VAALKYHTSPVRDCSWHPSQPMLVSS  270 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~-~~~~-~~~~~s~~~~~latg~~dg~i~iwd~~~~~~--~~~~~~h~~~I~~v~~sp~~~~las~  270 (303)
                      ..+..++........ +... ...+++|+++++|.|++|+.+++|.+..++.  ...+..|.++|++++||||+.+||++
T Consensus       426 ~~iv~l~~~~~~~~~~~~y~~s~vAv~~~~~~vaVGG~Dgkvhvysl~g~~l~ee~~~~~h~a~iT~vaySpd~~yla~~  505 (603)
T KOG0318|consen  426 SDIVLLQDQTKVSSIPIGYESSAVAVSPDGSEVAVGGQDGKVHVYSLSGDELKEEAKLLEHRAAITDVAYSPDGAYLAAG  505 (603)
T ss_pred             CcEEEEecCCcceeeccccccceEEEcCCCCEEEEecccceEEEEEecCCcccceeeeecccCCceEEEECCCCcEEEEe
Confidence            111111111111110 0111 1234789999999999999999999975442  33456799999999999999999999


Q ss_pred             eCCCCEEEeecCCCCc
Q 022074          271 SWDGDVVRWEFPGNGE  286 (303)
Q Consensus       271 s~Dg~i~~Wd~~~~~~  286 (303)
                      +..+.+.+||++...+
T Consensus       506 Da~rkvv~yd~~s~~~  521 (603)
T KOG0318|consen  506 DASRKVVLYDVASREV  521 (603)
T ss_pred             ccCCcEEEEEcccCce
Confidence            9999999999886544


No 43 
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.95  E-value=2.2e-26  Score=205.88  Aligned_cols=201  Identities=23%  Similarity=0.335  Sum_probs=169.0

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      .||.++.|-+..+++++|+.|..||||+..++..+..+.+|.+-+.+++.+|.. .+++|+|.|-+|++||..   ....
T Consensus        56 ~PvRa~kfiaRknWiv~GsDD~~IrVfnynt~ekV~~FeAH~DyIR~iavHPt~-P~vLtsSDDm~iKlW~we---~~wa  131 (794)
T KOG0276|consen   56 VPVRAAKFIARKNWIVTGSDDMQIRVFNYNTGEKVKTFEAHSDYIRSIAVHPTL-PYVLTSSDDMTIKLWDWE---NEWA  131 (794)
T ss_pred             cchhhheeeeccceEEEecCCceEEEEecccceeeEEeeccccceeeeeecCCC-CeEEecCCccEEEEeecc---Ccee
Confidence            479999999999999999999999999999999999999999999999998754 488899999999999974   3445


Q ss_pred             cceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          120 PAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      ....+.||..-|..++|+| |.+.+++++-|++|++|.+....++                                  .
T Consensus       132 ~~qtfeGH~HyVMqv~fnPkD~ntFaS~sLDrTVKVWslgs~~~n----------------------------------f  177 (794)
T KOG0276|consen  132 CEQTFEGHEHYVMQVAFNPKDPNTFASASLDRTVKVWSLGSPHPN----------------------------------F  177 (794)
T ss_pred             eeeEEcCcceEEEEEEecCCCccceeeeeccccEEEEEcCCCCCc----------------------------------e
Confidence            5678999999999999998 4578999999999999998653332                                  2


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                      +++||..-..++....    -.|..+|++|+.|.+|+|||..+..++.++++|...|..+.|+|.-++++|||+||++++
T Consensus       178 Tl~gHekGVN~Vdyy~----~gdkpylIsgaDD~tiKvWDyQtk~CV~TLeGHt~Nvs~v~fhp~lpiiisgsEDGTvri  253 (794)
T KOG0276|consen  178 TLEGHEKGVNCVDYYT----GGDKPYLISGADDLTIKVWDYQTKSCVQTLEGHTNNVSFVFFHPELPIIISGSEDGTVRI  253 (794)
T ss_pred             eeeccccCcceEEecc----CCCcceEEecCCCceEEEeecchHHHHHHhhcccccceEEEecCCCcEEEEecCCccEEE
Confidence            3344432222222111    134569999999999999999999999999999999999999999999999999999999


Q ss_pred             eecC
Q 022074          279 WEFP  282 (303)
Q Consensus       279 Wd~~  282 (303)
                      |.-.
T Consensus       254 Whs~  257 (794)
T KOG0276|consen  254 WNSK  257 (794)
T ss_pred             ecCc
Confidence            9843


No 44 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.94  E-value=9.6e-26  Score=180.63  Aligned_cols=250  Identities=23%  Similarity=0.379  Sum_probs=187.9

Q ss_pred             ccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEE
Q 022074           19 ANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIY   98 (303)
Q Consensus        19 ~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~   98 (303)
                      +..|.-++|-=.  -.+.||...|..++.+.|...+++|+.|..|.+||+.+|+...++.+|.+.|+.++|+ +....++
T Consensus        41 vrLWNp~rg~li--ktYsghG~EVlD~~~s~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~rgH~aqVNtV~fN-eesSVv~  117 (307)
T KOG0316|consen   41 VRLWNPLRGALI--KTYSGHGHEVLDAALSSDNSKFASCGGDKAVQVWDVNTGKVDRRFRGHLAQVNTVRFN-EESSVVA  117 (307)
T ss_pred             EEeeccccccee--eeecCCCceeeeccccccccccccCCCCceEEEEEcccCeeeeecccccceeeEEEec-CcceEEE
Confidence            344545444322  3568999999999999999999999999999999999999999999999999999996 5678999


Q ss_pred             EecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccc-cCccceeeec
Q 022074           99 SGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCN-LGFRSYEWDY  177 (303)
Q Consensus        99 s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~-~~~~~~~~~~  177 (303)
                      ||+.|.++++||-|+.  ..+|++.+....+.|.++.+.  ++.|++|+.||++|.||+|.......+. .+..+..+..
T Consensus       118 SgsfD~s~r~wDCRS~--s~ePiQildea~D~V~Si~v~--~heIvaGS~DGtvRtydiR~G~l~sDy~g~pit~vs~s~  193 (307)
T KOG0316|consen  118 SGSFDSSVRLWDCRSR--SFEPIQILDEAKDGVSSIDVA--EHEIVAGSVDGTVRTYDIRKGTLSSDYFGHPITSVSFSK  193 (307)
T ss_pred             eccccceeEEEEcccC--CCCccchhhhhcCceeEEEec--ccEEEeeccCCcEEEEEeecceeehhhcCCcceeEEecC
Confidence            9999999999998754  446788888888999999874  5689999999999999999764432211 0011111110


Q ss_pred             e-ee----eCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC
Q 022074          178 R-WM----DYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT  252 (303)
Q Consensus       178 ~-~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~  252 (303)
                      . .+    ......+.+--.....+..++||....+-..|.+..    ....+++|++||.+++||+.....+..+..|.
T Consensus       194 d~nc~La~~l~stlrLlDk~tGklL~sYkGhkn~eykldc~l~q----sdthV~sgSEDG~Vy~wdLvd~~~~sk~~~~~  269 (307)
T KOG0316|consen  194 DGNCSLASSLDSTLRLLDKETGKLLKSYKGHKNMEYKLDCCLNQ----SDTHVFSGSEDGKVYFWDLVDETQISKLSVVS  269 (307)
T ss_pred             CCCEEEEeeccceeeecccchhHHHHHhcccccceeeeeeeecc----cceeEEeccCCceEEEEEeccceeeeeeccCC
Confidence            0 00    000111111112233466788898888777777654    35679999999999999999999998898888


Q ss_pred             CC-eEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          253 SP-VRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       253 ~~-I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      .. |.+++++|.-.-|.++.. +....|.
T Consensus       270 ~v~v~dl~~hp~~~~f~~A~~-~~~~~~~  297 (307)
T KOG0316|consen  270 TVIVTDLSCHPTMDDFITATG-HGDLFWY  297 (307)
T ss_pred             ceeEEeeecccCccceeEecC-Cceecee
Confidence            87 999999999887777764 4555665


No 45 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.94  E-value=1.3e-24  Score=186.03  Aligned_cols=256  Identities=28%  Similarity=0.410  Sum_probs=187.0

Q ss_pred             hhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC
Q 022074           14 TMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES   93 (303)
Q Consensus        14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~   93 (303)
                      +.+..+.+|+.-++...  ....+|..++.++.|+++++.+++++.||.|++||+.+++....+..|...+.++.|.++ 
T Consensus        28 ~~~g~i~i~~~~~~~~~--~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~-  104 (289)
T cd00200          28 SGDGTIKVWDLETGELL--RTLKGHTGPVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYVSSVAFSPD-  104 (289)
T ss_pred             ecCcEEEEEEeeCCCcE--EEEecCCcceeEEEECCCCCEEEEEcCCCeEEEEEcCcccceEEEeccCCcEEEEEEcCC-
Confidence            34677888887555422  234688999999999999999999999999999999988777778889889999999764 


Q ss_pred             CcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccce
Q 022074           94 GHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSY  173 (303)
Q Consensus        94 ~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~  173 (303)
                      ++++++++.|+.+++||++    ..+....+..|...+.++.+++++.++++++.|+.+++||++..........    .
T Consensus       105 ~~~~~~~~~~~~i~~~~~~----~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~----~  176 (289)
T cd00200         105 GRILSSSSRDKTIKVWDVE----TGKCLTTLRGHTDWVNSVAFSPDGTFVASSSQDGTIKLWDLRTGKCVATLTG----H  176 (289)
T ss_pred             CCEEEEecCCCeEEEEECC----CcEEEEEeccCCCcEEEEEEcCcCCEEEEEcCCCcEEEEEccccccceeEec----C
Confidence            6788888889999999975    2344556667888999999999988888888899999999975433222111    1


Q ss_pred             eeeceeeeCCCCCccccC-CCCCcceEEecccc-eeeeE----EEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEE
Q 022074          174 EWDYRWMDYPPQARDLKH-PCDQSVATYKGHSV-LRTLI----RCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAA  247 (303)
Q Consensus       174 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~-~~~~~----~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~  247 (303)
                      ...+..+.+.+....+.. ..+..+..++-... .....    .......+++++.++++++.||.|++||..+++.+..
T Consensus       177 ~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~~~~~~~  256 (289)
T cd00200         177 TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLASGSEDGTIRVWDLRTGECVQT  256 (289)
T ss_pred             ccccceEEECCCcCEEEEecCCCcEEEEECCCCceecchhhcCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCCceeEEE
Confidence            111222333333322211 11223333322110 00000    0112344677888888888899999999999888888


Q ss_pred             eecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          248 LKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       248 ~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      +..|..+|.+++|+|++++|++++.|+.+++|+
T Consensus       257 ~~~~~~~i~~~~~~~~~~~l~~~~~d~~i~iw~  289 (289)
T cd00200         257 LSGHTNSVTSLAWSPDGKRLASGSADGTIRIWD  289 (289)
T ss_pred             ccccCCcEEEEEECCCCCEEEEecCCCeEEecC
Confidence            889999999999999999999999999999996


No 46 
>KOG0645 consensus WD40 repeat protein [General function prediction only]
Probab=99.94  E-value=3.7e-25  Score=180.44  Aligned_cols=273  Identities=18%  Similarity=0.237  Sum_probs=186.0

Q ss_pred             EEEE---EccCchhhccccccccccCcCcc--cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--eEEEE
Q 022074            5 VHIV---DVGSGTMESLANVTEIHDGLDFS--AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--LSLRI   77 (303)
Q Consensus         5 ~~~~---~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~~~~~   77 (303)
                      .||.   ++-+++-|-.+.+|+.--|....  ..-+.||+..|.+++|+|.|++|+++|.|.++.||.-..+.  ....+
T Consensus        22 whp~~g~ilAscg~Dk~vriw~~~~~~s~~ck~vld~~hkrsVRsvAwsp~g~~La~aSFD~t~~Iw~k~~~efecv~~l  101 (312)
T KOG0645|consen   22 WHPGKGVILASCGTDKAVRIWSTSSGDSWTCKTVLDDGHKRSVRSVAWSPHGRYLASASFDATVVIWKKEDGEFECVATL  101 (312)
T ss_pred             eccCCceEEEeecCCceEEEEecCCCCcEEEEEeccccchheeeeeeecCCCcEEEEeeccceEEEeecCCCceeEEeee
Confidence            4555   67778888888888875444443  23335999999999999999999999999999999776554  34568


Q ss_pred             ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEc
Q 022074           78 LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus        78 ~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl  157 (303)
                      .+|...|.+++|+ .++++|++++.|++|-+|..... .+-.....++.|.-.|..+.|+|...+|+++|.|.+|++|+-
T Consensus       102 EGHEnEVK~Vaws-~sG~~LATCSRDKSVWiWe~ded-dEfec~aVL~~HtqDVK~V~WHPt~dlL~S~SYDnTIk~~~~  179 (312)
T KOG0645|consen  102 EGHENEVKCVAWS-ASGNYLATCSRDKSVWIWEIDED-DEFECIAVLQEHTQDVKHVIWHPTEDLLFSCSYDNTIKVYRD  179 (312)
T ss_pred             eccccceeEEEEc-CCCCEEEEeeCCCeEEEEEecCC-CcEEEEeeeccccccccEEEEcCCcceeEEeccCCeEEEEee
Confidence            8999999999996 56999999999999999987521 223456678999999999999999899999999999999975


Q ss_pred             ccccCCcccccCccceeeeceeeeCCCCCccccCCC-CCcceEEecccceee-eEEEeeeeeeeCCCeEEEEEeCCCeEE
Q 022074          158 RKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC-DQSVATYKGHSVLRT-LIRCHFSPVYSTGQKYIYTGSHDSCVY  235 (303)
Q Consensus       158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~-~~~~~~~~~~s~~~~~latg~~dg~i~  235 (303)
                      ... ....+...+...+.++-...|.+.+..+.... +..+..+.....+.. ..+-.+...+.  ...+++++.|+.|+
T Consensus       180 ~~d-ddW~c~~tl~g~~~TVW~~~F~~~G~rl~s~sdD~tv~Iw~~~~~~~~~~sr~~Y~v~W~--~~~IaS~ggD~~i~  256 (312)
T KOG0645|consen  180 EDD-DDWECVQTLDGHENTVWSLAFDNIGSRLVSCSDDGTVSIWRLYTDLSGMHSRALYDVPWD--NGVIASGGGDDAIR  256 (312)
T ss_pred             cCC-CCeeEEEEecCccceEEEEEecCCCceEEEecCCcceEeeeeccCcchhcccceEeeeec--ccceEeccCCCEEE
Confidence            421 11111000111111222234444443333222 222222221110000 00111111122  34689999999999


Q ss_pred             EEECCCC------eEE-EEeecCCCCeEEEEECCC-CCeEEEEeCCCCEEEeecC
Q 022074          236 VYDLVSG------EQV-AALKYHTSPVRDCSWHPS-QPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       236 iwd~~~~------~~~-~~~~~h~~~I~~v~~sp~-~~~las~s~Dg~i~~Wd~~  282 (303)
                      ++.....      +.+ +.-..|...|++++|+|. .+.|++++.||.+++|.+.
T Consensus       257 lf~~s~~~d~p~~~l~~~~~~aHe~dVNsV~w~p~~~~~L~s~~DDG~v~~W~l~  311 (312)
T KOG0645|consen  257 LFKESDSPDEPSWNLLAKKEGAHEVDVNSVQWNPKVSNRLASGGDDGIVNFWELE  311 (312)
T ss_pred             EEEecCCCCCchHHHHHhhhcccccccceEEEcCCCCCceeecCCCceEEEEEec
Confidence            9976532      111 123478899999999996 6799999999999999874


No 47 
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.94  E-value=9.7e-26  Score=182.99  Aligned_cols=234  Identities=24%  Similarity=0.334  Sum_probs=173.6

Q ss_pred             ccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE--EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEc
Q 022074           33 ADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL--RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        33 ~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~--~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      -+..||.+.|.+++|+.+|..+++|+.|+++++|+++......  ...+|.+.|..++|.|+++++|++++.|.+|++||
T Consensus        14 r~~~~~~~~v~Sv~wn~~g~~lasgs~dktv~v~n~e~~r~~~~~~~~gh~~svdql~w~~~~~d~~atas~dk~ir~wd   93 (313)
T KOG1407|consen   14 RELQGHVQKVHSVAWNCDGTKLASGSFDKTVSVWNLERDRFRKELVYRGHTDSVDQLCWDPKHPDLFATASGDKTIRIWD   93 (313)
T ss_pred             HHhhhhhhcceEEEEcccCceeeecccCCceEEEEecchhhhhhhcccCCCcchhhheeCCCCCcceEEecCCceEEEEE
Confidence            3557999999999999999999999999999999998875443  34679999999999999999999999999999999


Q ss_pred             CccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccc--eeeece----eeeCC-
Q 022074          111 RRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRS--YEWDYR----WMDYP-  183 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~--~~~~~~----~~~~~-  183 (303)
                      .+..    ++........+.+ .+.++|+|++++.++.|..|.+.|.|..+........+..  ..|...    .+... 
T Consensus        94 ~r~~----k~~~~i~~~~eni-~i~wsp~g~~~~~~~kdD~it~id~r~~~~~~~~~~~~e~ne~~w~~~nd~Fflt~Gl  168 (313)
T KOG1407|consen   94 IRSG----KCTARIETKGENI-NITWSPDGEYIAVGNKDDRITFIDARTYKIVNEEQFKFEVNEISWNNSNDLFFLTNGL  168 (313)
T ss_pred             eccC----cEEEEeeccCcce-EEEEcCCCCEEEEecCcccEEEEEecccceeehhcccceeeeeeecCCCCEEEEecCC
Confidence            8733    3333333333444 4568999999999999999999999874432221111111  111100    00000 


Q ss_pred             CCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC
Q 022074          184 PQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS  263 (303)
Q Consensus       184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~  263 (303)
                      .....+..+....+.+++.|...      ++...|+|+|++||+|+.|..+.+||+...-++..+.-+.-||..+.||.|
T Consensus       169 G~v~ILsypsLkpv~si~AH~sn------CicI~f~p~GryfA~GsADAlvSLWD~~ELiC~R~isRldwpVRTlSFS~d  242 (313)
T KOG1407|consen  169 GCVEILSYPSLKPVQSIKAHPSN------CICIEFDPDGRYFATGSADALVSLWDVDELICERCISRLDWPVRTLSFSHD  242 (313)
T ss_pred             ceEEEEeccccccccccccCCcc------eEEEEECCCCceEeeccccceeeccChhHhhhheeeccccCceEEEEeccC
Confidence            01122333344455556655521      223458899999999999999999999887778888889999999999999


Q ss_pred             CCeEEEEeCCCCEE
Q 022074          264 QPMLVSSSWDGDVV  277 (303)
Q Consensus       264 ~~~las~s~Dg~i~  277 (303)
                      |++||+||+|.-|-
T Consensus       243 g~~lASaSEDh~ID  256 (313)
T KOG1407|consen  243 GRMLASASEDHFID  256 (313)
T ss_pred             cceeeccCccceEE
Confidence            99999999998774


No 48 
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.94  E-value=1.1e-25  Score=194.82  Aligned_cols=236  Identities=22%  Similarity=0.344  Sum_probs=180.2

Q ss_pred             EEEccCchhhccccccccccCcCcc-c-----c--cCCCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCce----
Q 022074            7 IVDVGSGTMESLANVTEIHDGLDFS-A-----A--DDGGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKL----   73 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~~~-~-----~--~~~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~----   73 (303)
                      |.+|..++...-+-|++...=..-. +     +  ...||+..=++++|++... .+++|+.|++|.+||+.....    
T Consensus       137 p~iVAt~t~~~dv~Vfd~tk~~s~~~~~~~~~Pdl~L~gH~~eg~glsWn~~~~g~Lls~~~d~~i~lwdi~~~~~~~~~  216 (422)
T KOG0264|consen  137 PNIVATKTSSGDVYVFDYTKHPSKPKASGECRPDLRLKGHEKEGYGLSWNRQQEGTLLSGSDDHTICLWDINAESKEDKV  216 (422)
T ss_pred             CcEEEecCCCCCEEEEEeccCCCcccccccCCCceEEEeecccccccccccccceeEeeccCCCcEEEEeccccccCCcc
Confidence            3456666666666666654322111 1     1  2258998788899998644 799999999999999976543    


Q ss_pred             ---EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC-CCEEEEEeCC
Q 022074           74 ---SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD-GRYLISNGKD  149 (303)
Q Consensus        74 ---~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D  149 (303)
                         ...+.+|...|+.++|++...++|++++.|+.+.|||+|..  +.++.....+|...|++++|+|- +..|||||.|
T Consensus       217 ~~p~~~~~~h~~~VeDV~~h~~h~~lF~sv~dd~~L~iwD~R~~--~~~~~~~~~ah~~~vn~~~fnp~~~~ilAT~S~D  294 (422)
T KOG0264|consen  217 VDPKTIFSGHEDVVEDVAWHPLHEDLFGSVGDDGKLMIWDTRSN--TSKPSHSVKAHSAEVNCVAFNPFNEFILATGSAD  294 (422)
T ss_pred             ccceEEeecCCcceehhhccccchhhheeecCCCeEEEEEcCCC--CCCCcccccccCCceeEEEeCCCCCceEEeccCC
Confidence               23467899999999999888889999999999999999962  44566677889999999999984 5668999999


Q ss_pred             CcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe
Q 022074          150 QAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS  229 (303)
Q Consensus       150 ~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~  229 (303)
                      ++|+|||+|.+..                                 ++.++.+|..  .+....|+|.   ....||+.+
T Consensus       295 ~tV~LwDlRnL~~---------------------------------~lh~~e~H~d--ev~~V~WSPh---~etvLASSg  336 (422)
T KOG0264|consen  295 KTVALWDLRNLNK---------------------------------PLHTFEGHED--EVFQVEWSPH---NETVLASSG  336 (422)
T ss_pred             CcEEEeechhccc---------------------------------CceeccCCCc--ceEEEEeCCC---CCceeEecc
Confidence            9999999997643                                 1223333332  2333344442   356899999


Q ss_pred             CCCeEEEEECCCC--------------eEEEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEeecC
Q 022074          230 HDSCVYVYDLVSG--------------EQVAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       230 ~dg~i~iwd~~~~--------------~~~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~  282 (303)
                      .|+.+.+||+..-              +++....+|+..|.+++|+|..+ .++|+++|+.+++|+..
T Consensus       337 ~D~rl~vWDls~ig~eq~~eda~dgppEllF~HgGH~~kV~DfsWnp~ePW~I~SvaeDN~LqIW~~s  404 (422)
T KOG0264|consen  337 TDRRLNVWDLSRIGEEQSPEDAEDGPPELLFIHGGHTAKVSDFSWNPNEPWTIASVAEDNILQIWQMA  404 (422)
T ss_pred             cCCcEEEEeccccccccChhhhccCCcceeEEecCcccccccccCCCCCCeEEEEecCCceEEEeecc
Confidence            9999999998531              34567789999999999999998 58999999999999976


No 49 
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.94  E-value=1.8e-25  Score=184.75  Aligned_cols=231  Identities=21%  Similarity=0.360  Sum_probs=163.4

Q ss_pred             CCCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCC-CceE-EEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           35 DGGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEA-NKLS-LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~-~~~~-~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      .+..+..|.+|+|||. ...+++||+||+||+|++.. |... +....|.++|.+++|+ ++++.+++|+.|+++++||+
T Consensus        23 ~~pP~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~~~~ka~~~~~~PvL~v~Ws-ddgskVf~g~~Dk~~k~wDL  101 (347)
T KOG0647|consen   23 PNPPEDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQLVPKAQQSHDGPVLDVCWS-DDGSKVFSGGCDKQAKLWDL  101 (347)
T ss_pred             CCCcccchheeEeccccCceEEecccCCceEEEEEecCCcccchhhhccCCCeEEEEEc-cCCceEEeeccCCceEEEEc
Confidence            3555666999999995 44566899999999999987 3433 3456799999999996 56788999999999999998


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCC--EEEEEeCCCcEEEEEcccccCCcccccCccceeeecee----------
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGR--YLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRW----------  179 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~--~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~----------  179 (303)
                      .+.     +...+..|.++|..+.|-+...  .|+|||.|++||.||+|...+.....+..+.+..+...          
T Consensus       102 ~S~-----Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~pv~t~~LPeRvYa~Dv~~pm~vVata~r  176 (347)
T KOG0647|consen  102 ASG-----QVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSNPVATLQLPERVYAADVLYPMAVVATAER  176 (347)
T ss_pred             cCC-----CeeeeeecccceeEEEEecCCCcceeEecccccceeecccCCCCeeeeeeccceeeehhccCceeEEEecCC
Confidence            632     3445667999999888866554  79999999999999999876666665555554433210          


Q ss_pred             ------eeCCCC-CccccCCC---CCcceE-----------Eecc----------cceeeeEEEeee-------------
Q 022074          180 ------MDYPPQ-ARDLKHPC---DQSVAT-----------YKGH----------SVLRTLIRCHFS-------------  215 (303)
Q Consensus       180 ------~~~~~~-~~~~~~~~---~~~~~~-----------~~~~----------~~~~~~~~~~~~-------------  215 (303)
                            +.-++. -+.+..+-   -++++.           ..|.          .......+||.+             
T Consensus       177 ~i~vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiEGrv~iq~id~~~~~~nFtFkCHR~~~~~~~~VYaVNs  256 (347)
T KOG0647|consen  177 HIAVYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIEGRVAIQYIDDPNPKDNFTFKCHRSTNSVNDDVYAVNS  256 (347)
T ss_pred             cEEEEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeecceEEEEecCCCCccCceeEEEeccCCCCCCceEEecc
Confidence                  001111 00000000   001111           1110          011123455552             


Q ss_pred             eeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEe
Q 022074          216 PVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       216 ~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s  271 (303)
                      ..|+|....|+|+|.||++.+||-..+.++++.+.|..||++++|+.+|.++|-+-
T Consensus       257 i~FhP~hgtlvTaGsDGtf~FWDkdar~kLk~s~~~~qpItcc~fn~~G~ifaYA~  312 (347)
T KOG0647|consen  257 IAFHPVHGTLVTAGSDGTFSFWDKDARTKLKTSETHPQPITCCSFNRNGSIFAYAL  312 (347)
T ss_pred             eEeecccceEEEecCCceEEEecchhhhhhhccCcCCCccceeEecCCCCEEEEEe
Confidence            34777778899999999999999999999999999999999999999999888664


No 50 
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.94  E-value=1.5e-25  Score=190.19  Aligned_cols=248  Identities=21%  Similarity=0.258  Sum_probs=168.9

Q ss_pred             cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC--CcEEEEecCCCeEEEEcC
Q 022074           34 DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES--GHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~--~~~l~s~s~dg~v~lWd~  111 (303)
                      ..--|...|.++...  +++|++|++||++|+||... +...++.+|.+.+..++|..++  ...|++++.|.++++|..
T Consensus       100 ~~~~hdDWVSsv~~~--~~~IltgsYDg~~riWd~~G-k~~~~~~Ght~~ik~v~~v~~n~~~~~fvsas~Dqtl~Lw~~  176 (423)
T KOG0313|consen  100 QCFLHDDWVSSVKGA--SKWILTGSYDGTSRIWDLKG-KSIKTIVGHTGPIKSVAWVIKNSSSCLFVSASMDQTLRLWKW  176 (423)
T ss_pred             ccccchhhhhhhccc--CceEEEeecCCeeEEEecCC-ceEEEEecCCcceeeeEEEecCCccceEEEecCCceEEEEEe
Confidence            334677788888777  78999999999999999854 4567889999999999884332  346999999999999987


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc--CCccccc------------Ccc------
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS--SNASCNL------------GFR------  171 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~--~~~~~~~------------~~~------  171 (303)
                      ............-.||..+|-+++..+++..+++|+.|..+++|+.....  .....+.            ..+      
T Consensus       177 ~~~~~~~~~~~~~~GHk~~V~sVsv~~sgtr~~SgS~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~~~~~~~~r~P~vtl  256 (423)
T KOG0313|consen  177 NVGENKVKALKVCRGHKRSVDSVSVDSSGTRFCSGSWDTMLKIWSVETDEEDELESSSNRRRKKQKREKEGGTRTPLVTL  256 (423)
T ss_pred             cCchhhhhHHhHhcccccceeEEEecCCCCeEEeecccceeeecccCCCccccccccchhhhhhhhhhhcccccCceEEe
Confidence            53322222223334999999999999999999999999999999932211  0000000            000      


Q ss_pred             -ceeeeceeeeCCCCCccccCCCCCcceEEecccc----eeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe---
Q 022074          172 -SYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSV----LRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE---  243 (303)
Q Consensus       172 -~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~---  243 (303)
                       .+...+....+++....+....+-.+..++-...    ..+.-+..++..+++...+|++|+.|..|++||.+++.   
T Consensus       257 ~GHt~~Vs~V~w~d~~v~yS~SwDHTIk~WDletg~~~~~~~~~ksl~~i~~~~~~~Ll~~gssdr~irl~DPR~~~gs~  336 (423)
T KOG0313|consen  257 EGHTEPVSSVVWSDATVIYSVSWDHTIKVWDLETGGLKSTLTTNKSLNCISYSPLSKLLASGSSDRHIRLWDPRTGDGSV  336 (423)
T ss_pred             cccccceeeEEEcCCCceEeecccceEEEEEeecccceeeeecCcceeEeecccccceeeecCCCCceeecCCCCCCCce
Confidence             0001111122222222222222222333321110    11111233445577888999999999999999998774   


Q ss_pred             EEEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEeecCCC
Q 022074          244 QVAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       244 ~~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~~  284 (303)
                      ....|.+|+..|.++.|||... +|+|++.|+++++||+...
T Consensus       337 v~~s~~gH~nwVssvkwsp~~~~~~~S~S~D~t~klWDvRS~  378 (423)
T KOG0313|consen  337 VSQSLIGHKNWVSSVKWSPTNEFQLVSGSYDNTVKLWDVRST  378 (423)
T ss_pred             eEEeeecchhhhhheecCCCCceEEEEEecCCeEEEEEeccC
Confidence            2457789999999999999875 6999999999999999864


No 51 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=99.94  E-value=1e-24  Score=214.77  Aligned_cols=226  Identities=17%  Similarity=0.261  Sum_probs=171.1

Q ss_pred             Cchhhcccccccccc----CcCcc-cccCCCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeE
Q 022074           12 SGTMESLANVTEIHD----GLDFS-AADDGGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVN   85 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~----~~~~~-~~~~~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~   85 (303)
                      +|+.|..+.||+.-.    +.... +.....+...|.+++|++ ++.+|++++.||+|+|||+.+++....+.+|.+.|.
T Consensus       500 tgg~D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~~~V~  579 (793)
T PLN00181        500 TAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKEHEKRVW  579 (793)
T ss_pred             EEeCCCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeEEEEecCCCCCEE
Confidence            456677888887522    11110 111123446799999997 478999999999999999999988888889999999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeC-CCCCEEEEEeCCCcEEEEEcccccCCc
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSR-GDGRYLISNGKDQAIKLWDIRKMSSNA  164 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~-~~~~~l~s~~~D~~v~lWdl~~~~~~~  164 (303)
                      +++|++.++++|+||+.|++|++||++.    ......+..+ ..+.++.+. +++.+|++|+.|+.|++||++....  
T Consensus       580 ~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~----~~~~~~~~~~-~~v~~v~~~~~~g~~latgs~dg~I~iwD~~~~~~--  652 (793)
T PLN00181        580 SIDYSSADPTLLASGSDDGSVKLWSINQ----GVSIGTIKTK-ANICCVQFPSESGRSLAFGSADHKVYYYDLRNPKL--  652 (793)
T ss_pred             EEEEcCCCCCEEEEEcCCCEEEEEECCC----CcEEEEEecC-CCeEEEEEeCCCCCEEEEEeCCCeEEEEECCCCCc--
Confidence            9999876788999999999999999863    2334444433 568888884 5789999999999999999874221  


Q ss_pred             ccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC--
Q 022074          165 SCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG--  242 (303)
Q Consensus       165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~--  242 (303)
                                                     .+..+.+|......+      .|. ++.++++++.|++|++||+..+  
T Consensus       653 -------------------------------~~~~~~~h~~~V~~v------~f~-~~~~lvs~s~D~~ikiWd~~~~~~  694 (793)
T PLN00181        653 -------------------------------PLCTMIGHSKTVSYV------RFV-DSSTLVSSSTDNTLKLWDLSMSIS  694 (793)
T ss_pred             -------------------------------cceEecCCCCCEEEE------EEe-CCCEEEEEECCCEEEEEeCCCCcc
Confidence                                           011222232211111      233 4678999999999999999743  


Q ss_pred             ----eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          243 ----EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       243 ----~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                          +.+..+.+|...++.++|+|++++|++|+.|+.+++|+..
T Consensus       695 ~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~D~~v~iw~~~  738 (793)
T PLN00181        695 GINETPLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKA  738 (793)
T ss_pred             ccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeCCCEEEEEECC
Confidence                5677889999999999999999999999999999999964


No 52 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=99.94  E-value=2.4e-24  Score=184.38  Aligned_cols=239  Identities=25%  Similarity=0.408  Sum_probs=178.0

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .+|+.+|.+++|+|+++.+++++.||.+++|++.++.....+..|...+..+.|.++ ++.+++++.|+.|++||...  
T Consensus         6 ~~h~~~i~~~~~~~~~~~l~~~~~~g~i~i~~~~~~~~~~~~~~~~~~i~~~~~~~~-~~~l~~~~~~~~i~i~~~~~--   82 (289)
T cd00200           6 KGHTGGVTCVAFSPDGKLLATGSGDGTIKVWDLETGELLRTLKGHTGPVRDVAASAD-GTYLASGSSDKTIRLWDLET--   82 (289)
T ss_pred             cccCCCEEEEEEcCCCCEEEEeecCcEEEEEEeeCCCcEEEEecCCcceeEEEECCC-CCEEEEEcCCCeEEEEEcCc--
Confidence            489999999999999999999999999999999988877778889889989999754 57899999999999999752  


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC-C
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC-D  194 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~  194 (303)
                        ......+..|...+.++.+.+++.++++++.|+.+++||++.........    .....+..+.+.+....+.... +
T Consensus        83 --~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~----~~~~~i~~~~~~~~~~~l~~~~~~  156 (289)
T cd00200          83 --GECVRTLTGHTSYVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLR----GHTDWVNSVAFSPDGTFVASSSQD  156 (289)
T ss_pred             --ccceEEEeccCCcEEEEEEcCCCCEEEEecCCCeEEEEECCCcEEEEEec----cCCCcEEEEEEcCcCCEEEEEcCC
Confidence              23455667888899999999998888888889999999997432221111    0111122233333333332222 3


Q ss_pred             CcceEEecccc-eeeeEE----EeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEE
Q 022074          195 QSVATYKGHSV-LRTLIR----CHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVS  269 (303)
Q Consensus       195 ~~~~~~~~~~~-~~~~~~----~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las  269 (303)
                      ..+..++.... ......    ......++++++.+++++.|+.|++||..+++.+..+..|..++.+++|+|++.++++
T Consensus       157 ~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~  236 (289)
T cd00200         157 GTIKLWDLRTGKCVATLTGHTGEVNSVAFSPDGEKLLSSSSDGTIKLWDLSTGKCLGTLRGHENGVNSVAFSPDGYLLAS  236 (289)
T ss_pred             CcEEEEEccccccceeEecCccccceEEECCCcCEEEEecCCCcEEEEECCCCceecchhhcCCceEEEEEcCCCcEEEE
Confidence            33333322110 000111    1123446788888999999999999999998888888889999999999999999999


Q ss_pred             EeCCCCEEEeecCC
Q 022074          270 SSWDGDVVRWEFPG  283 (303)
Q Consensus       270 ~s~Dg~i~~Wd~~~  283 (303)
                      ++.|+.+++|++..
T Consensus       237 ~~~~~~i~i~~~~~  250 (289)
T cd00200         237 GSEDGTIRVWDLRT  250 (289)
T ss_pred             EcCCCcEEEEEcCC
Confidence            98899999999864


No 53 
>KOG0273 consensus Beta-transducin family (WD-40 repeat) protein [Chromatin structure and dynamics]
Probab=99.94  E-value=1.9e-25  Score=194.05  Aligned_cols=236  Identities=22%  Similarity=0.390  Sum_probs=178.9

Q ss_pred             CCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCc--eEE------------EEecccCCeEEEEEccCCCcEEEEe
Q 022074           36 GGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANK--LSL------------RILAHTSDVNTVCFGDESGHLIYSG  100 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~--~~~------------~~~~h~~~v~~l~~~~~~~~~l~s~  100 (303)
                      -+|+.+|.+++|+|-.. .+++|+.|-+.|+|++....  ...            +-...+..|++++|+. +++.|++|
T Consensus       175 l~~~~~V~~~~WnP~~~~llasg~~~s~ari~~l~e~~~~~~~q~~lrh~~~~~~~s~~~nkdVT~L~Wn~-~G~~LatG  253 (524)
T KOG0273|consen  175 LRHESEVFICAWNPLRDGLLASGSGDSTARIWNLLENSNIGSTQLVLRHCIREGGKSVPSNKDVTSLDWNN-DGTLLATG  253 (524)
T ss_pred             ccCCCceEEEecCchhhhhhhccCCccceeeeeehhhccccchhhhhhhhhhhhcccCCccCCcceEEecC-CCCeEEEe
Confidence            35999999999999666 89999999999999997511  100            1112346899999975 58999999


Q ss_pred             cCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceee
Q 022074          101 SDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWM  180 (303)
Q Consensus       101 s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~  180 (303)
                      +.||.+|+|+.     .+..+..+..|.++|.++.++..|+||++++.|+++.+||...........+.... ..++.|+
T Consensus       254 ~~~G~~riw~~-----~G~l~~tl~~HkgPI~slKWnk~G~yilS~~vD~ttilwd~~~g~~~q~f~~~s~~-~lDVdW~  327 (524)
T KOG0273|consen  254 SEDGEARIWNK-----DGNLISTLGQHKGPIFSLKWNKKGTYILSGGVDGTTILWDAHTGTVKQQFEFHSAP-ALDVDWQ  327 (524)
T ss_pred             ecCcEEEEEec-----CchhhhhhhccCCceEEEEEcCCCCEEEeccCCccEEEEeccCceEEEeeeeccCC-ccceEEe
Confidence            99999999995     45567788889999999999999999999999999999998654322222211111 1222222


Q ss_pred             eC------CCC--CccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC
Q 022074          181 DY------PPQ--ARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT  252 (303)
Q Consensus       181 ~~------~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~  252 (303)
                      ..      .++  ..-.+...++++.++.||......++      |.|.+++|++++.|++++||..........+.+|+
T Consensus       328 ~~~~F~ts~td~~i~V~kv~~~~P~~t~~GH~g~V~alk------~n~tg~LLaS~SdD~TlkiWs~~~~~~~~~l~~Hs  401 (524)
T KOG0273|consen  328 SNDEFATSSTDGCIHVCKVGEDRPVKTFIGHHGEVNALK------WNPTGSLLASCSDDGTLKIWSMGQSNSVHDLQAHS  401 (524)
T ss_pred             cCceEeecCCCceEEEEEecCCCcceeeecccCceEEEE------ECCCCceEEEecCCCeeEeeecCCCcchhhhhhhc
Confidence            21      111  11123344667778888876554443      56779999999999999999998888888899999


Q ss_pred             CCeEEEEECCCCC---------eEEEEeCCCCEEEeecCCC
Q 022074          253 SPVRDCSWHPSQP---------MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       253 ~~I~~v~~sp~~~---------~las~s~Dg~i~~Wd~~~~  284 (303)
                      ..|..+.|||+++         .+++++.|+++++||+...
T Consensus       402 kei~t~~wsp~g~v~~n~~~~~~l~sas~dstV~lwdv~~g  442 (524)
T KOG0273|consen  402 KEIYTIKWSPTGPVTSNPNMNLMLASASFDSTVKLWDVESG  442 (524)
T ss_pred             cceeeEeecCCCCccCCCcCCceEEEeecCCeEEEEEccCC
Confidence            9999999999864         8999999999999998643


No 54 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.94  E-value=3.2e-24  Score=178.50  Aligned_cols=236  Identities=21%  Similarity=0.314  Sum_probs=176.4

Q ss_pred             cccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC--CCeEEEEcCcccc
Q 022074           38 YSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD--DNLCKVWDRRCLN  115 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~--dg~v~lWd~~~~~  115 (303)
                      ....|.++.|+++|..+++++.|.+++|||..+++....+..++.+|..++|.+... .++.++.  |.+||+-++.   
T Consensus        13 ~~~~i~sl~fs~~G~~litss~dDsl~LYd~~~g~~~~ti~skkyG~~~~~Fth~~~-~~i~sStk~d~tIryLsl~---   88 (311)
T KOG1446|consen   13 TNGKINSLDFSDDGLLLITSSEDDSLRLYDSLSGKQVKTINSKKYGVDLACFTHHSN-TVIHSSTKEDDTIRYLSLH---   88 (311)
T ss_pred             CCCceeEEEecCCCCEEEEecCCCeEEEEEcCCCceeeEeecccccccEEEEecCCc-eEEEccCCCCCceEEEEee---
Confidence            456799999999999999999999999999999999988888888999999976544 4444544  8899988864   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                       .++.++.|.||...|+.+..+|-+..+++++.|++||+||+|..++..-.....+.      ..++.|++-.++..+..
T Consensus        89 -dNkylRYF~GH~~~V~sL~~sP~~d~FlS~S~D~tvrLWDlR~~~cqg~l~~~~~p------i~AfDp~GLifA~~~~~  161 (311)
T KOG1446|consen   89 -DNKYLRYFPGHKKRVNSLSVSPKDDTFLSSSLDKTVRLWDLRVKKCQGLLNLSGRP------IAAFDPEGLIFALANGS  161 (311)
T ss_pred             -cCceEEEcCCCCceEEEEEecCCCCeEEecccCCeEEeeEecCCCCceEEecCCCc------ceeECCCCcEEEEecCC
Confidence             45678899999999999999999999999999999999999965443322221111      12233333333222222


Q ss_pred             -cceEE-----ecccceeeeE----EEee-eeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCe---EEEEEC
Q 022074          196 -SVATY-----KGHSVLRTLI----RCHF-SPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPV---RDCSWH  261 (303)
Q Consensus       196 -~~~~~-----~~~~~~~~~~----~~~~-~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I---~~v~~s  261 (303)
                       .+..+     +...+....+    .+.+ ...|||+|++++.....+.+++.|.-.|..+..+..+...-   .+..|+
T Consensus       162 ~~IkLyD~Rs~dkgPF~tf~i~~~~~~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ft  241 (311)
T KOG1446|consen  162 ELIKLYDLRSFDKGPFTTFSITDNDEAEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPNAGNLPLSATFT  241 (311)
T ss_pred             CeEEEEEecccCCCCceeEccCCCCccceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccCCCCcceeEEEC
Confidence             22211     1111111111    1111 23589999999999999999999999999988888775433   688999


Q ss_pred             CCCCeEEEEeCCCCEEEeecCCC
Q 022074          262 PSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       262 p~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ||++++.+++.||+|.+|++...
T Consensus       242 Pds~Fvl~gs~dg~i~vw~~~tg  264 (311)
T KOG1446|consen  242 PDSKFVLSGSDDGTIHVWNLETG  264 (311)
T ss_pred             CCCcEEEEecCCCcEEEEEcCCC
Confidence            99999999999999999998644


No 55 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=99.94  E-value=5.4e-25  Score=185.63  Aligned_cols=255  Identities=16%  Similarity=0.261  Sum_probs=192.0

Q ss_pred             cCchhhccccccccccCc-CcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEE
Q 022074           11 GSGTMESLANVTEIHDGL-DFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCF   89 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~   89 (303)
                      -+|+||-.|.||++-+|. +.   ..+||-.-|..+++|+--.++.+++.|+.|+-||+...+.+..+.+|-..|.|+..
T Consensus       167 ~tgs~DrtikIwDlatg~Lkl---tltGhi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS~V~~L~l  243 (460)
T KOG0285|consen  167 ATGSADRTIKIWDLATGQLKL---TLTGHIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLSGVYCLDL  243 (460)
T ss_pred             EecCCCceeEEEEcccCeEEE---eecchhheeeeeeecccCceEEEecCCCeeEEEechhhhhHHHhccccceeEEEec
Confidence            368899999999998887 44   56899999999999999999999999999999999999988888999999999999


Q ss_pred             ccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccC
Q 022074           90 GDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLG  169 (303)
Q Consensus        90 ~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~  169 (303)
                      +| .-++++||+.|.++|+||.|    +...+..+.||...|..+.+.+-+.+++||+-|++||+||++..+.-..... 
T Consensus       244 hP-Tldvl~t~grDst~RvWDiR----tr~~V~~l~GH~~~V~~V~~~~~dpqvit~S~D~tvrlWDl~agkt~~tlt~-  317 (460)
T KOG0285|consen  244 HP-TLDVLVTGGRDSTIRVWDIR----TRASVHVLSGHTNPVASVMCQPTDPQVITGSHDSTVRLWDLRAGKTMITLTH-  317 (460)
T ss_pred             cc-cceeEEecCCcceEEEeeec----ccceEEEecCCCCcceeEEeecCCCceEEecCCceEEEeeeccCceeEeeec-
Confidence            75 46799999999999999998    3445778999999999999988888999999999999999986543211111 


Q ss_pred             ccceeeeceeeeCCCCCcccc-----------CCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEE
Q 022074          170 FRSYEWDYRWMDYPPQARDLK-----------HPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYD  238 (303)
Q Consensus       170 ~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd  238 (303)
                         ..-.++.+...|....+.           .+...-+..+.+|..+...+.      .. +..++++|++.|.+.+||
T Consensus       318 ---hkksvral~lhP~e~~fASas~dnik~w~~p~g~f~~nlsgh~~iintl~------~n-sD~v~~~G~dng~~~fwd  387 (460)
T KOG0285|consen  318 ---HKKSVRALCLHPKENLFASASPDNIKQWKLPEGEFLQNLSGHNAIINTLS------VN-SDGVLVSGGDNGSIMFWD  387 (460)
T ss_pred             ---ccceeeEEecCCchhhhhccCCccceeccCCccchhhccccccceeeeee------ec-cCceEEEcCCceEEEEEe
Confidence               011112222222222121           122222333445543332221      12 235789999999999999


Q ss_pred             CCCCeEEEEe---e-----cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          239 LVSGEQVAAL---K-----YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       239 ~~~~~~~~~~---~-----~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .++|-....+   .     ..+..|.+.+|...+..|+||..|.+|++|.-...
T Consensus       388 wksg~nyQ~~~t~vqpGSl~sEagI~as~fDktg~rlit~eadKtIk~~keDe~  441 (460)
T KOG0285|consen  388 WKSGHNYQRGQTIVQPGSLESEAGIFASCFDKTGSRLITGEADKTIKMYKEDEH  441 (460)
T ss_pred             cCcCcccccccccccCCccccccceeEEeecccCceEEeccCCcceEEEecccc
Confidence            9988533222   1     12457999999999999999999999999986544


No 56 
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=99.94  E-value=2.7e-26  Score=192.87  Aligned_cols=193  Identities=24%  Similarity=0.440  Sum_probs=158.5

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      -+|+|+.+.  .+.+++|..|++|+|||.++-.....+.+|++.|-|+.|.   ..+++|||.|.+|++||..    +++
T Consensus       198 kgVYClQYD--D~kiVSGlrDnTikiWD~n~~~c~~~L~GHtGSVLCLqyd---~rviisGSSDsTvrvWDv~----tge  268 (499)
T KOG0281|consen  198 KGVYCLQYD--DEKIVSGLRDNTIKIWDKNSLECLKILTGHTGSVLCLQYD---ERVIVSGSSDSTVRVWDVN----TGE  268 (499)
T ss_pred             CceEEEEec--chhhhcccccCceEEeccccHHHHHhhhcCCCcEEeeecc---ceEEEecCCCceEEEEecc----CCc
Confidence            379999987  3469999999999999999888778899999999999993   4599999999999999975    667


Q ss_pred             cceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceE
Q 022074          120 PAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVAT  199 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  199 (303)
                      +..++.+|.++|..+.|+.  .+++|++.|+++.+||+...... .                              ....
T Consensus       269 ~l~tlihHceaVLhlrf~n--g~mvtcSkDrsiaVWdm~sps~i-t------------------------------~rrV  315 (499)
T KOG0281|consen  269 PLNTLIHHCEAVLHLRFSN--GYMVTCSKDRSIAVWDMASPTDI-T------------------------------LRRV  315 (499)
T ss_pred             hhhHHhhhcceeEEEEEeC--CEEEEecCCceeEEEeccCchHH-H------------------------------HHHH
Confidence            7888889999999998864  49999999999999998643210 0                              1112


Q ss_pred             EecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEe
Q 022074          200 YKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRW  279 (303)
Q Consensus       200 ~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~W  279 (303)
                      +.||......      .  ..+.+|+++++.|.+|++|++.+++.+.++.+|+..|.++.+  .+++++|||.|.+|++|
T Consensus       316 LvGHrAaVNv------V--dfd~kyIVsASgDRTikvW~~st~efvRtl~gHkRGIAClQY--r~rlvVSGSSDntIRlw  385 (499)
T KOG0281|consen  316 LVGHRAAVNV------V--DFDDKYIVSASGDRTIKVWSTSTCEFVRTLNGHKRGIACLQY--RDRLVVSGSSDNTIRLW  385 (499)
T ss_pred             Hhhhhhheee------e--ccccceEEEecCCceEEEEeccceeeehhhhcccccceehhc--cCeEEEecCCCceEEEE
Confidence            2333211111      1  125679999999999999999999999999999999998877  78999999999999999


Q ss_pred             ecCCC
Q 022074          280 EFPGN  284 (303)
Q Consensus       280 d~~~~  284 (303)
                      |+...
T Consensus       386 di~~G  390 (499)
T KOG0281|consen  386 DIECG  390 (499)
T ss_pred             ecccc
Confidence            98754


No 57 
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.93  E-value=2.8e-25  Score=190.25  Aligned_cols=263  Identities=24%  Similarity=0.341  Sum_probs=182.4

Q ss_pred             EEEEEccCc-------hhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE
Q 022074            5 VHIVDVGSG-------TMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI   77 (303)
Q Consensus         5 ~~~~~~~~~-------~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~   77 (303)
                      ||.+--+.+       -||..|.+|+...++-.--....|-.++|.++.|.++++.+++++.|+.+++|++...++..++
T Consensus       178 v~~v~~l~~sdtlatgg~Dr~Ik~W~v~~~k~~~~~tLaGs~g~it~~d~d~~~~~~iAas~d~~~r~Wnvd~~r~~~TL  257 (459)
T KOG0288|consen  178 VHDVEFLRNSDTLATGGSDRIIKLWNVLGEKSELISTLAGSLGNITSIDFDSDNKHVIAASNDKNLRLWNVDSLRLRHTL  257 (459)
T ss_pred             cceeEEccCcchhhhcchhhhhhhhhcccchhhhhhhhhccCCCcceeeecCCCceEEeecCCCceeeeeccchhhhhhh
Confidence            455555544       5889999999977772213455788889999999999999999999999999999999998999


Q ss_pred             ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEc
Q 022074           78 LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus        78 ~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl  157 (303)
                      .+|++.|+++.|.. ....+++|+.|.++++||+.......   ..+  ....+..+..+  ...+++|-.|++||+||.
T Consensus       258 sGHtdkVt~ak~~~-~~~~vVsgs~DRtiK~WDl~k~~C~k---t~l--~~S~cnDI~~~--~~~~~SgH~DkkvRfwD~  329 (459)
T KOG0288|consen  258 SGHTDKVTAAKFKL-SHSRVVSGSADRTIKLWDLQKAYCSK---TVL--PGSQCNDIVCS--ISDVISGHFDKKVRFWDI  329 (459)
T ss_pred             cccccceeeehhhc-cccceeeccccchhhhhhhhhhheec---ccc--ccccccceEec--ceeeeecccccceEEEec
Confidence            99999999999953 34458999999999999985211111   112  22333344333  446889999999999999


Q ss_pred             ccccCCcccccCccceeeeceeeeCCCCCccc-cCCCCCcceEEeccccee------eeEEEe--e-eeeeeCCCeEEEE
Q 022074          158 RKMSSNASCNLGFRSYEWDYRWMDYPPQARDL-KHPCDQSVATYKGHSVLR------TLIRCH--F-SPVYSTGQKYIYT  227 (303)
Q Consensus       158 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~------~~~~~~--~-~~~~s~~~~~lat  227 (303)
                      |..........+.+..     .+........+ ....+..+..++.....+      ...++.  + ..+|||++.|+|+
T Consensus       330 Rs~~~~~sv~~gg~vt-----Sl~ls~~g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfSpd~~YvaA  404 (459)
T KOG0288|consen  330 RSADKTRSVPLGGRVT-----SLDLSMDGLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFSPDGSYVAA  404 (459)
T ss_pred             cCCceeeEeecCccee-----eEeeccCCeEEeeecCCCceeeeecccccEEEEeeccccccccccceeEECCCCceeee
Confidence            8765443333221111     11111111111 111122222222211100      001111  1 2458999999999


Q ss_pred             EeCCCeEEEEECCCCeEEEEeecCCCC--eEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          228 GSHDSCVYVYDLVSGEQVAALKYHTSP--VRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       228 g~~dg~i~iwd~~~~~~~~~~~~h~~~--I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      ||.||.|+||++.++++.+.++....+  |++++|+|.|..|++++-++.+.+|.
T Consensus       405 GS~dgsv~iW~v~tgKlE~~l~~s~s~~aI~s~~W~~sG~~Llsadk~~~v~lW~  459 (459)
T KOG0288|consen  405 GSADGSVYIWSVFTGKLEKVLSLSTSNAAITSLSWNPSGSGLLSADKQKAVTLWT  459 (459)
T ss_pred             ccCCCcEEEEEccCceEEEEeccCCCCcceEEEEEcCCCchhhcccCCcceEecC
Confidence            999999999999999998888755544  99999999999999999999999994


No 58 
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.93  E-value=1.1e-24  Score=201.12  Aligned_cols=237  Identities=21%  Similarity=0.284  Sum_probs=160.3

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC--------------------------------Cc------------
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA--------------------------------NK------------   72 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~--------------------------------~~------------   72 (303)
                      +|+.+|.++.||+||++||+||.|+.|+||.+..                                ..            
T Consensus       265 ah~gaIw~mKFS~DGKyLAsaGeD~virVWkVie~e~~~~~~~~~~~~~~~~~~~s~~~p~~s~~~~~~~~~s~~~~~~~  344 (712)
T KOG0283|consen  265 AHKGAIWAMKFSHDGKYLASAGEDGVIRVWKVIESERMRVAEGDSSCMYFEYNANSQIEPSTSSEEKISSRTSSSRKGSQ  344 (712)
T ss_pred             ccCCcEEEEEeCCCCceeeecCCCceEEEEEEeccchhcccccccchhhhhhhhccccCccccccccccccccccccccC
Confidence            9999999999999999999999999999997755                                00            


Q ss_pred             ----------------eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEe
Q 022074           73 ----------------LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDS  136 (303)
Q Consensus        73 ----------------~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~  136 (303)
                                      ....+.+|.+.|-.+.|+ + .++|+|++.|.|||||.+..    ....++| .|.+-|+|++|
T Consensus       345 s~~~~~p~~~f~f~ekP~~ef~GHt~DILDlSWS-K-n~fLLSSSMDKTVRLWh~~~----~~CL~~F-~HndfVTcVaF  417 (712)
T KOG0283|consen  345 SPCVLLPLKAFVFSEKPFCEFKGHTADILDLSWS-K-NNFLLSSSMDKTVRLWHPGR----KECLKVF-SHNDFVTCVAF  417 (712)
T ss_pred             CccccCCCccccccccchhhhhccchhheecccc-c-CCeeEeccccccEEeecCCC----cceeeEE-ecCCeeEEEEe
Confidence                            112357899999999996 3 46899999999999999752    2345555 59999999999


Q ss_pred             CC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc-ceEEe--cccceee-eEE
Q 022074          137 RG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS-VATYK--GHSVLRT-LIR  211 (303)
Q Consensus       137 ~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~--~~~~~~~-~~~  211 (303)
                      +| |++||++|+-|+.||||++.......     ...+..-+..+.|.|+++.....+-.. +..+.  +..+... .+.
T Consensus       418 nPvDDryFiSGSLD~KvRiWsI~d~~Vv~-----W~Dl~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~  492 (712)
T KOG0283|consen  418 NPVDDRYFISGSLDGKVRLWSISDKKVVD-----WNDLRDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIR  492 (712)
T ss_pred             cccCCCcEeecccccceEEeecCcCeeEe-----ehhhhhhheeEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeEe
Confidence            98 78999999999999999875422100     000000112233444444332221111 00000  0000000 000


Q ss_pred             --------Ee--eeeeeeC-CCeEEEEEeCCCeEEEEECCCCeEEEEeecCCC--CeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          212 --------CH--FSPVYST-GQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTS--PVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       212 --------~~--~~~~~s~-~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~--~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                              +.  ....|.| +...++..+.|..|||+|.++.+++..|+++..  .=....|+.||++|+++++|..+++
T Consensus       493 ~~~~Kk~~~~rITG~Q~~p~~~~~vLVTSnDSrIRI~d~~~~~lv~KfKG~~n~~SQ~~Asfs~Dgk~IVs~seDs~VYi  572 (712)
T KOG0283|consen  493 LHNKKKKQGKRITGLQFFPGDPDEVLVTSNDSRIRIYDGRDKDLVHKFKGFRNTSSQISASFSSDGKHIVSASEDSWVYI  572 (712)
T ss_pred             eccCccccCceeeeeEecCCCCCeEEEecCCCceEEEeccchhhhhhhcccccCCcceeeeEccCCCEEEEeecCceEEE
Confidence                    00  0011222 223467778999999999988888888886542  3457789999999999999999999


Q ss_pred             eecCCCC
Q 022074          279 WEFPGNG  285 (303)
Q Consensus       279 Wd~~~~~  285 (303)
                      |+.+...
T Consensus       573 W~~~~~~  579 (712)
T KOG0283|consen  573 WKNDSFN  579 (712)
T ss_pred             EeCCCCc
Confidence            9986543


No 59 
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.93  E-value=3.2e-24  Score=192.99  Aligned_cols=266  Identities=21%  Similarity=0.280  Sum_probs=191.7

Q ss_pred             EEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-ccCCeE
Q 022074            7 IVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-HTSDVN   85 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~~v~   85 (303)
                      .|-||+|.-=|||+-   .+|. -....+.+ +..|.++.|+++|.+|++|..+|.|.|||..+.+....+.. |...|-
T Consensus       190 ~laValg~~vylW~~---~s~~-v~~l~~~~-~~~vtSv~ws~~G~~LavG~~~g~v~iwD~~~~k~~~~~~~~h~~rvg  264 (484)
T KOG0305|consen  190 VLAVALGQSVYLWSA---SSGS-VTELCSFG-EELVTSVKWSPDGSHLAVGTSDGTVQIWDVKEQKKTRTLRGSHASRVG  264 (484)
T ss_pred             eEEEEecceEEEEec---CCCc-eEEeEecC-CCceEEEEECCCCCEEEEeecCCeEEEEehhhccccccccCCcCceeE
Confidence            467788877777732   1222 00111112 56699999999999999999999999999998877777777 999999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcc
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNAS  165 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~  165 (303)
                      +++|+   ...+.+|+.|+.|..+|++....   ....+.+|...|..+.+++++.++++||.|+.+.|||.........
T Consensus       265 ~laW~---~~~lssGsr~~~I~~~dvR~~~~---~~~~~~~H~qeVCgLkws~d~~~lASGgnDN~~~Iwd~~~~~p~~~  338 (484)
T KOG0305|consen  265 SLAWN---SSVLSSGSRDGKILNHDVRISQH---VVSTLQGHRQEVCGLKWSPDGNQLASGGNDNVVFIWDGLSPEPKFT  338 (484)
T ss_pred             EEecc---CceEEEecCCCcEEEEEEecchh---hhhhhhcccceeeeeEECCCCCeeccCCCccceEeccCCCccccEE
Confidence            99996   56899999999999999985432   2224778999999999999999999999999999999854322221


Q ss_pred             cccCccceeeeceeeeCCCCCccccCC----CCCcceEEecccc--eeee--EEEeeeeeeeCCCeEEEE--EeCCCeEE
Q 022074          166 CNLGFRSYEWDYRWMDYPPQARDLKHP----CDQSVATYKGHSV--LRTL--IRCHFSPVYSTGQKYIYT--GSHDSCVY  235 (303)
Q Consensus       166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~--~~~~--~~~~~~~~~s~~~~~lat--g~~dg~i~  235 (303)
                          +..+..+++.+.++|....+...    .++.+.-++-...  +..+  -....+..+++..+.|++  |..+..|.
T Consensus       339 ----~~~H~aAVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~~g~~i~~vdtgsQVcsL~Wsk~~kEi~sthG~s~n~i~  414 (484)
T KOG0305|consen  339 ----FTEHTAAVKALAWCPWQSGLLATGGGSADRCIKFWNTNTGARIDSVDTGSQVCSLIWSKKYKELLSTHGYSENQIT  414 (484)
T ss_pred             ----EeccceeeeEeeeCCCccCceEEcCCCcccEEEEEEcCCCcEecccccCCceeeEEEcCCCCEEEEecCCCCCcEE
Confidence                22333445556665544333221    1233333221111  0000  001223456666655555  45788999


Q ss_pred             EEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCcc
Q 022074          236 VYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEA  287 (303)
Q Consensus       236 iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~  287 (303)
                      ||+..+.+++..+.+|..+|..+++||||..+++|+.|.++++|++-...+.
T Consensus       415 lw~~ps~~~~~~l~gH~~RVl~la~SPdg~~i~t~a~DETlrfw~~f~~~~~  466 (484)
T KOG0305|consen  415 LWKYPSMKLVAELLGHTSRVLYLALSPDGETIVTGAADETLRFWNLFDERPK  466 (484)
T ss_pred             EEeccccceeeeecCCcceeEEEEECCCCCEEEEecccCcEEeccccCCCCc
Confidence            9999999999999999999999999999999999999999999998765333


No 60 
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.93  E-value=4.6e-24  Score=185.03  Aligned_cols=239  Identities=21%  Similarity=0.354  Sum_probs=183.2

Q ss_pred             CchhhccccccccccCcCc---cc-----------ccCC--CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE
Q 022074           12 SGTMESLANVTEIHDGLDF---SA-----------ADDG--GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL   75 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~---~~-----------~~~~--~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~   75 (303)
                      +++-++.|.=|.+.+|++-   ++           .-+.  +|..-+.+++.|+||++|++|+.|..|.||+..+...+.
T Consensus       159 sask~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r~~h~keil~~avS~Dgkylatgg~d~~v~Iw~~~t~ehv~  238 (479)
T KOG0299|consen  159 SASKDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESRKGHVKEILTLAVSSDGKYLATGGRDRHVQIWDCDTLEHVK  238 (479)
T ss_pred             ecCCCcceeeeehhcCcccccccccchhhhhccCCCCcccccccceeEEEEEcCCCcEEEecCCCceEEEecCcccchhh
Confidence            5666777877888888732   11           1122  899999999999999999999999999999999999888


Q ss_pred             EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEE
Q 022074           76 RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLW  155 (303)
Q Consensus        76 ~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lW  155 (303)
                      .+.+|-+.|.+++| ....+.+++++.|++|++|++..    ...+.++.||.+.|..++...-++.+-.|+.|+++++|
T Consensus       239 ~~~ghr~~V~~L~f-r~gt~~lys~s~Drsvkvw~~~~----~s~vetlyGHqd~v~~IdaL~reR~vtVGgrDrT~rlw  313 (479)
T KOG0299|consen  239 VFKGHRGAVSSLAF-RKGTSELYSASADRSVKVWSIDQ----LSYVETLYGHQDGVLGIDALSRERCVTVGGRDRTVRLW  313 (479)
T ss_pred             cccccccceeeeee-ecCccceeeeecCCceEEEehhH----hHHHHHHhCCccceeeechhcccceEEeccccceeEEE
Confidence            88999999999999 45677899999999999999752    33466788999999999887777766677799999999


Q ss_pred             EcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEE
Q 022074          156 DIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVY  235 (303)
Q Consensus       156 dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~  235 (303)
                      ++....     .+                              .+.++.....++.      |- +...+++|+.+|.|.
T Consensus       314 Ki~ees-----ql------------------------------ifrg~~~sidcv~------~I-n~~HfvsGSdnG~Ia  351 (479)
T KOG0299|consen  314 KIPEES-----QL------------------------------IFRGGEGSIDCVA------FI-NDEHFVSGSDNGSIA  351 (479)
T ss_pred             eccccc-----ee------------------------------eeeCCCCCeeeEE------Ee-cccceeeccCCceEE
Confidence            983211     00                              1111110011111      11 345799999999999


Q ss_pred             EEECCCCeEEEEee-cC-----------CCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCccCCCCcccccc
Q 022074          236 VYDLVSGEQVAALK-YH-----------TSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEAAPPLNKKRIR  297 (303)
Q Consensus       236 iwd~~~~~~~~~~~-~h-----------~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~~~~~~~~~~~  297 (303)
                      +|++-+.+++.+.. .|           ..+|++++..|...++|||+.+|.+++|.+..+.-+..+++...++
T Consensus       352 LWs~~KKkplf~~~~AHgv~~~~~~~~~~~Witsla~i~~sdL~asGS~~G~vrLW~i~~g~r~i~~l~~ls~~  425 (479)
T KOG0299|consen  352 LWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITSLAVIPGSDLLASGSWSGCVRLWKIEDGLRAINLLYSLSLV  425 (479)
T ss_pred             EeeecccCceeEeeccccccCCccccccccceeeeEecccCceEEecCCCCceEEEEecCCccccceeeecccc
Confidence            99998888776653 23           1289999999999999999999999999998776666666554443


No 61 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=99.92  E-value=3.8e-23  Score=181.98  Aligned_cols=231  Identities=20%  Similarity=0.330  Sum_probs=180.3

Q ss_pred             EEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEEEEecccCCeE
Q 022074            7 IVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSLRILAHTSDVN   85 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~   85 (303)
                      |+-||-|--- +-+|.+-=+|..- + +-.||...|++++|-|.-. ++++||.|++|.+|+-+.-+....+..|...|+
T Consensus       118 I~avGEGrer-fg~~F~~DSG~Sv-G-ei~GhSr~ins~~~KpsRPfRi~T~sdDn~v~ffeGPPFKFk~s~r~HskFV~  194 (603)
T KOG0318|consen  118 IAAVGEGRER-FGHVFLWDSGNSV-G-EITGHSRRINSVDFKPSRPFRIATGSDDNTVAFFEGPPFKFKSSFREHSKFVN  194 (603)
T ss_pred             EEEEecCccc-eeEEEEecCCCcc-c-eeeccceeEeeeeccCCCceEEEeccCCCeEEEeeCCCeeeeeccccccccee
Confidence            3444444322 4455544444433 2 2279999999999998755 699999999999998888777777888999999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeec---ccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccC
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLM---GHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSS  162 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~---~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~  162 (303)
                      ++.|+|+ +++|+|++.||++.+||.+    ++...+.+.   +|.++|.+++|+||+..++|++.|+++||||+...+.
T Consensus       195 ~VRysPD-G~~Fat~gsDgki~iyDGk----tge~vg~l~~~~aHkGsIfalsWsPDs~~~~T~SaDkt~KIWdVs~~sl  269 (603)
T KOG0318|consen  195 CVRYSPD-GSRFATAGSDGKIYIYDGK----TGEKVGELEDSDAHKGSIFALSWSPDSTQFLTVSADKTIKIWDVSTNSL  269 (603)
T ss_pred             eEEECCC-CCeEEEecCCccEEEEcCC----CccEEEEecCCCCccccEEEEEECCCCceEEEecCCceEEEEEeeccce
Confidence            9999865 8999999999999999976    444455555   7999999999999999999999999999999875432


Q ss_pred             CcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC
Q 022074          163 NASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG  242 (303)
Q Consensus       163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~  242 (303)
                      ......+                                 .......+.|.+.      ...|++-+.+|+|.+++....
T Consensus       270 v~t~~~~---------------------------------~~v~dqqvG~lWq------kd~lItVSl~G~in~ln~~d~  310 (603)
T KOG0318|consen  270 VSTWPMG---------------------------------STVEDQQVGCLWQ------KDHLITVSLSGTINYLNPSDP  310 (603)
T ss_pred             EEEeecC---------------------------------CchhceEEEEEEe------CCeEEEEEcCcEEEEecccCC
Confidence            2111000                                 0011112333332      457999999999999999999


Q ss_pred             eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          243 EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       243 ~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.+..+.+|...|++++.+|++.+|.||+.||.|.-|+....
T Consensus       311 ~~~~~i~GHnK~ITaLtv~~d~~~i~SgsyDG~I~~W~~~~g  352 (603)
T KOG0318|consen  311 SVLKVISGHNKSITALTVSPDGKTIYSGSYDGHINSWDSGSG  352 (603)
T ss_pred             ChhheecccccceeEEEEcCCCCEEEeeccCceEEEEecCCc
Confidence            999999999999999999999999999999999999997543


No 62 
>KOG0313 consensus Microtubule binding protein YTM1 (contains WD40 repeats) [Cytoskeleton]
Probab=99.92  E-value=5.8e-24  Score=180.55  Aligned_cols=228  Identities=23%  Similarity=0.326  Sum_probs=173.4

Q ss_pred             cCchhhccccccccccCcCcccccC--CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC-----------------
Q 022074           11 GSGTMESLANVTEIHDGLDFSAADD--GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN-----------------   71 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~-----------------   71 (303)
                      =+|+||-++-.|.+=.|..--....  .||+.+|-+++-.++|..+++||+|.++.||+..+.                 
T Consensus       163 vsas~Dqtl~Lw~~~~~~~~~~~~~~~~GHk~~V~sVsv~~sgtr~~SgS~D~~lkiWs~~~~~~~~~E~~s~~rrk~~~  242 (423)
T KOG0313|consen  163 VSASMDQTLRLWKWNVGENKVKALKVCRGHKRSVDSVSVDSSGTRFCSGSWDTMLKIWSVETDEEDELESSSNRRRKKQK  242 (423)
T ss_pred             EEecCCceEEEEEecCchhhhhHHhHhcccccceeEEEecCCCCeEEeecccceeeecccCCCccccccccchhhhhhhh
Confidence            3678888888887744432211111  499999999999999999999999999999983221                 


Q ss_pred             --------ceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEE
Q 022074           72 --------KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYL  143 (303)
Q Consensus        72 --------~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l  143 (303)
                              .....+.+|.++|.++.|.+  ...+.|+++|.+|+.||+...    ..+..+. -..++.+++.++..++|
T Consensus       243 ~~~~~~~r~P~vtl~GHt~~Vs~V~w~d--~~v~yS~SwDHTIk~WDletg----~~~~~~~-~~ksl~~i~~~~~~~Ll  315 (423)
T KOG0313|consen  243 REKEGGTRTPLVTLEGHTEPVSSVVWSD--ATVIYSVSWDHTIKVWDLETG----GLKSTLT-TNKSLNCISYSPLSKLL  315 (423)
T ss_pred             hhhcccccCceEEecccccceeeEEEcC--CCceEeecccceEEEEEeecc----cceeeee-cCcceeEeeccccccee
Confidence                    02235679999999999954  678899999999999998632    2222222 23568899999999999


Q ss_pred             EEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCe
Q 022074          144 ISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQK  223 (303)
Q Consensus       144 ~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~  223 (303)
                      ++|+.|+.+|+||.|....                               .-....+.||......+.  ++|   .+..
T Consensus       316 ~~gssdr~irl~DPR~~~g-------------------------------s~v~~s~~gH~nwVssvk--wsp---~~~~  359 (423)
T KOG0313|consen  316 ASGSSDRHIRLWDPRTGDG-------------------------------SVVSQSLIGHKNWVSSVK--WSP---TNEF  359 (423)
T ss_pred             eecCCCCceeecCCCCCCC-------------------------------ceeEEeeecchhhhhhee--cCC---CCce
Confidence            9999999999999885321                               012345566665433222  232   2456


Q ss_pred             EEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          224 YIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      +|++|+.|+++++||+++-+ .++.+.+|.+.|.++.|+. +..++|||.|++|+++.-.
T Consensus       360 ~~~S~S~D~t~klWDvRS~k~plydI~~h~DKvl~vdW~~-~~~IvSGGaD~~l~i~~~~  418 (423)
T KOG0313|consen  360 QLVSGSYDNTVKLWDVRSTKAPLYDIAGHNDKVLSVDWNE-GGLIVSGGADNKLRIFKGS  418 (423)
T ss_pred             EEEEEecCCeEEEEEeccCCCcceeeccCCceEEEEeccC-CceEEeccCcceEEEeccc
Confidence            79999999999999999887 7999999999999999965 5689999999999998743


No 63 
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=99.92  E-value=4e-25  Score=192.63  Aligned_cols=239  Identities=19%  Similarity=0.351  Sum_probs=170.3

Q ss_pred             CCCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCC-CceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           35 DGGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEA-NKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~-~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      ..||+-+|.++.|.| .+..|++++.|+.|+||++-. +..+.++.+|..+|..++|+ +++..|+|++.|+++++||++
T Consensus       210 ~~gH~kgvsai~~fp~~~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~Vrd~~~s-~~g~~fLS~sfD~~lKlwDtE  288 (503)
T KOG0282|consen  210 LSGHTKGVSAIQWFPKKGHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPVRDASFN-NCGTSFLSASFDRFLKLWDTE  288 (503)
T ss_pred             ccCCccccchhhhccceeeEEEecCCCceEEEEEEecCcceehhhhcchhhhhhhhcc-ccCCeeeeeecceeeeeeccc
Confidence            369999999999999 899999999999999999977 66777899999999999996 568899999999999999975


Q ss_pred             cccCCCccceeecccccC-eEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcccc
Q 022074          113 CLNVKGKPAGVLMGHLEG-ITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLK  190 (303)
Q Consensus       113 ~~~~~~~~~~~~~~h~~~-v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  190 (303)
                          +++....+  |.+. +.++.+.|++ +.+++|+.|+.|+.||+|..+....+......    +....+-+......
T Consensus       289 ----TG~~~~~f--~~~~~~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs~kvvqeYd~hLg~----i~~i~F~~~g~rFi  358 (503)
T KOG0282|consen  289 ----TGQVLSRF--HLDKVPTCVKFHPDNQNIFLVGGSDKKIRQWDIRSGKVVQEYDRHLGA----ILDITFVDEGRRFI  358 (503)
T ss_pred             ----cceEEEEE--ecCCCceeeecCCCCCcEEEEecCCCcEEEEeccchHHHHHHHhhhhh----eeeeEEccCCceEe
Confidence                44444444  5554 6788999988 88999999999999999975432222111111    11112222222222


Q ss_pred             C-CCCCcceEEecccc--eeeeE--EEeeee--eeeCCCeEEEEEeCCCeEEEEECCCC---eEEEEeecCC--CCeEEE
Q 022074          191 H-PCDQSVATYKGHSV--LRTLI--RCHFSP--VYSTGQKYIYTGSHDSCVYVYDLVSG---EQVAALKYHT--SPVRDC  258 (303)
Q Consensus       191 ~-~~~~~~~~~~~~~~--~~~~~--~~~~~~--~~s~~~~~latg~~dg~i~iwd~~~~---~~~~~~~~h~--~~I~~v  258 (303)
                      . ..+..+..+.-...  +..+.  ..|..|  ..+|++.++++-+.|.+|.++.+...   .+.+.+++|.  +.-..|
T Consensus       359 ssSDdks~riWe~~~~v~ik~i~~~~~hsmP~~~~~P~~~~~~aQs~dN~i~ifs~~~~~r~nkkK~feGh~vaGys~~v  438 (503)
T KOG0282|consen  359 SSSDDKSVRIWENRIPVPIKNIADPEMHTMPCLTLHPNGKWFAAQSMDNYIAIFSTVPPFRLNKKKRFEGHSVAGYSCQV  438 (503)
T ss_pred             eeccCccEEEEEcCCCccchhhcchhhccCcceecCCCCCeehhhccCceEEEEecccccccCHhhhhcceeccCceeeE
Confidence            2 22223333322111  11110  112222  24688999999999999999987543   2345678886  456678


Q ss_pred             EECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          259 SWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       259 ~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .|||||++|++|+.||.+.+||..+.
T Consensus       439 ~fSpDG~~l~SGdsdG~v~~wdwkt~  464 (503)
T KOG0282|consen  439 DFSPDGRTLCSGDSDGKVNFWDWKTT  464 (503)
T ss_pred             EEcCCCCeEEeecCCccEEEeechhh
Confidence            99999999999999999999998754


No 64 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.92  E-value=3.9e-24  Score=186.68  Aligned_cols=200  Identities=23%  Similarity=0.397  Sum_probs=159.7

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      .|.-.|++++|..||+.+++|...|.|+|||.++......+.+|..+|..+.|++.+++.|++|+.|+.+++||+..   
T Consensus        66 rFk~~v~s~~fR~DG~LlaaGD~sG~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~---  142 (487)
T KOG0310|consen   66 RFKDVVYSVDFRSDGRLLAAGDESGHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLST---  142 (487)
T ss_pred             hhccceeEEEeecCCeEEEccCCcCcEEEeccccHHHHHHHhhccCceeEEEecccCCeEEEecCCCceEEEEEcCC---
Confidence            44556999999999999999999999999998776666668899999999999988889999999999999999862   


Q ss_pred             CCccceeecccccCeEEEEeCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                       ......+.+|++.|.+.+++|. ++.++|||.||.||+||+|....                                 
T Consensus       143 -a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~~---------------------------------  188 (487)
T KOG0310|consen  143 -AYVQAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLTS---------------------------------  188 (487)
T ss_pred             -cEEEEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCCc---------------------------------
Confidence             2335578899999999999885 56789999999999999986421                                 


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC-eEEEEeecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                      .+.+++....+..+       .+-|.|+.+|+++. ..+++||+.+| +++..+..|...|+|+.+..++..|+|||-|+
T Consensus       189 ~v~elnhg~pVe~v-------l~lpsgs~iasAgG-n~vkVWDl~~G~qll~~~~~H~KtVTcL~l~s~~~rLlS~sLD~  260 (487)
T KOG0310|consen  189 RVVELNHGCPVESV-------LALPSGSLIASAGG-NSVKVWDLTTGGQLLTSMFNHNKTVTCLRLASDSTRLLSGSLDR  260 (487)
T ss_pred             eeEEecCCCceeeE-------EEcCCCCEEEEcCC-CeEEEEEecCCceehhhhhcccceEEEEEeecCCceEeeccccc
Confidence            11111111111111       13356788888864 46999999865 55666666999999999999999999999999


Q ss_pred             CEEEeec
Q 022074          275 DVVRWEF  281 (303)
Q Consensus       275 ~i~~Wd~  281 (303)
                      .+++||.
T Consensus       261 ~VKVfd~  267 (487)
T KOG0310|consen  261 HVKVFDT  267 (487)
T ss_pred             ceEEEEc
Confidence            9999983


No 65 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=99.92  E-value=9.8e-23  Score=172.34  Aligned_cols=238  Identities=16%  Similarity=0.346  Sum_probs=170.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .+|+.+|++++.+|+.+.+++|+.|...+||+..++....++.+|++.|.++.|+. ++.+|+||+.+|.|++|+..   
T Consensus        61 ~~H~~svFavsl~P~~~l~aTGGgDD~AflW~~~~ge~~~eltgHKDSVt~~~Fsh-dgtlLATGdmsG~v~v~~~s---  136 (399)
T KOG0296|consen   61 DKHTDSVFAVSLHPNNNLVATGGGDDLAFLWDISTGEFAGELTGHKDSVTCCSFSH-DGTLLATGDMSGKVLVFKVS---  136 (399)
T ss_pred             hhcCCceEEEEeCCCCceEEecCCCceEEEEEccCCcceeEecCCCCceEEEEEcc-CceEEEecCCCccEEEEEcc---
Confidence            59999999999999999999999999999999999998889999999999999976 48899999999999999965   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC-C
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC-D  194 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~  194 (303)
                       ++.....+.+..+.+.++.|+|.++.|+.|+.||.+-+|.+.....   ++. +.........-.+.|.++.+.... +
T Consensus       137 -tg~~~~~~~~e~~dieWl~WHp~a~illAG~~DGsvWmw~ip~~~~---~kv-~~Gh~~~ct~G~f~pdGKr~~tgy~d  211 (399)
T KOG0296|consen  137 -TGGEQWKLDQEVEDIEWLKWHPRAHILLAGSTDGSVWMWQIPSQAL---CKV-MSGHNSPCTCGEFIPDGKRILTGYDD  211 (399)
T ss_pred             -cCceEEEeecccCceEEEEecccccEEEeecCCCcEEEEECCCcce---eeE-ecCCCCCcccccccCCCceEEEEecC
Confidence             3333444545667799999999999999999999999999865211   110 111111111122334444433222 2


Q ss_pred             CcceEEe---cccceeee------EEEe-------------------------------eee------------------
Q 022074          195 QSVATYK---GHSVLRTL------IRCH-------------------------------FSP------------------  216 (303)
Q Consensus       195 ~~~~~~~---~~~~~~~~------~~~~-------------------------------~~~------------------  216 (303)
                      ..+..++   ++...+..      ..|.                               +.+                  
T Consensus       212 gti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~~~~e~~~esve~  291 (399)
T KOG0296|consen  212 GTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELKPSQEELDESVES  291 (399)
T ss_pred             ceEEEEecCCCceeEEecccccCcCCccccccccceeEeccCCccEEEEccccceEEEecCCCCccccccchhhhhhhhh
Confidence            2333332   22222111      1100                               000                  


Q ss_pred             -eeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          217 -VYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       217 -~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                       .++..-.+.|+|+.||+|.|||+....+.. .-.|+.+|..+.|-+ ..+|++++.||+++.||....
T Consensus       292 ~~~ss~lpL~A~G~vdG~i~iyD~a~~~~R~-~c~he~~V~~l~w~~-t~~l~t~c~~g~v~~wDaRtG  358 (399)
T KOG0296|consen  292 IPSSSKLPLAACGSVDGTIAIYDLAASTLRH-ICEHEDGVTKLKWLN-TDYLLTACANGKVRQWDARTG  358 (399)
T ss_pred             cccccccchhhcccccceEEEEecccchhhe-eccCCCceEEEEEcC-cchheeeccCceEEeeecccc
Confidence             011222578899999999999997765443 446899999999998 889999999999999998754


No 66 
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.92  E-value=6e-24  Score=176.46  Aligned_cols=211  Identities=24%  Similarity=0.319  Sum_probs=163.1

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC------------C------ceEEEEecccCCeEEEEEccCCCcEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA------------N------KLSLRILAHTSDVNTVCFGDESGHLI   97 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~------------~------~~~~~~~~h~~~v~~l~~~~~~~~~l   97 (303)
                      +-|+.++.+.+|++||..+++||.|..|+|+|++.            +      ..+.++..|.+.|+++.|+| ..+.|
T Consensus       109 t~HK~~cR~aafs~DG~lvATGsaD~SIKildvermlaks~~~em~~~~~qa~hPvIRTlYDH~devn~l~FHP-re~IL  187 (430)
T KOG0640|consen  109 TSHKSPCRAAAFSPDGSLVATGSADASIKILDVERMLAKSKPKEMISGDTQARHPVIRTLYDHVDEVNDLDFHP-RETIL  187 (430)
T ss_pred             eecccceeeeeeCCCCcEEEccCCcceEEEeehhhhhhhcchhhhccCCcccCCceEeehhhccCcccceeecc-hhheE
Confidence            47889999999999999999999999999999872            1      12345667899999999976 47899


Q ss_pred             EEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeec
Q 022074           98 YSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDY  177 (303)
Q Consensus        98 ~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~  177 (303)
                      ++|+.|++|++||..-.. ..+..+.+ .....|.+++|+|.|.+++.|..--++|+||+..-.+..+++          
T Consensus       188 iS~srD~tvKlFDfsK~s-aKrA~K~~-qd~~~vrsiSfHPsGefllvgTdHp~~rlYdv~T~Qcfvsan----------  255 (430)
T KOG0640|consen  188 ISGSRDNTVKLFDFSKTS-AKRAFKVF-QDTEPVRSISFHPSGEFLLVGTDHPTLRLYDVNTYQCFVSAN----------  255 (430)
T ss_pred             EeccCCCeEEEEecccHH-HHHHHHHh-hccceeeeEeecCCCceEEEecCCCceeEEeccceeEeeecC----------
Confidence            999999999999974111 11222233 345689999999999999999999999999986422111100          


Q ss_pred             eeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee-cCC-CCe
Q 022074          178 RWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK-YHT-SPV  255 (303)
Q Consensus       178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~-~h~-~~I  255 (303)
                            |        .       ++|....      ....||+.+++-+||+.||.|++||-.++.++.++. .|. ..|
T Consensus       256 ------P--------d-------~qht~ai------~~V~Ys~t~~lYvTaSkDG~IklwDGVS~rCv~t~~~AH~gsev  308 (430)
T KOG0640|consen  256 ------P--------D-------DQHTGAI------TQVRYSSTGSLYVTASKDGAIKLWDGVSNRCVRTIGNAHGGSEV  308 (430)
T ss_pred             ------c--------c-------cccccce------eEEEecCCccEEEEeccCCcEEeeccccHHHHHHHHhhcCCcee
Confidence                  0        0       1111111      123367889999999999999999998888887774 564 589


Q ss_pred             EEEEECCCCCeEEEEeCCCCEEEeecCCCCc
Q 022074          256 RDCSWHPSQPMLVSSSWDGDVVRWEFPGNGE  286 (303)
Q Consensus       256 ~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~  286 (303)
                      .+..|+.++++++|.+.|..+++|++...++
T Consensus       309 cSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R~  339 (430)
T KOG0640|consen  309 CSAVFTKNGKYILSSGKDSTVKLWEISTGRM  339 (430)
T ss_pred             eeEEEccCCeEEeecCCcceeeeeeecCCce
Confidence            9999999999999999999999999987754


No 67 
>PTZ00420 coronin; Provisional
Probab=99.92  E-value=1.2e-22  Score=189.12  Aligned_cols=186  Identities=17%  Similarity=0.248  Sum_probs=145.1

Q ss_pred             eeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC----CccceeecccccCeE
Q 022074           57 GSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK----GKPAGVLMGHLEGIT  132 (303)
Q Consensus        57 gs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~----~~~~~~~~~h~~~v~  132 (303)
                      |+.++.|+||+.........+.+|.+.|.+++|+|..+++|+||+.|++|++||+......    ..+...+.+|...|.
T Consensus        50 GG~~gvI~L~~~~r~~~v~~L~gH~~~V~~lafsP~~~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~  129 (568)
T PTZ00420         50 GGLIGAIRLENQMRKPPVIKLKGHTSSILDLQFNPCFSEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKIS  129 (568)
T ss_pred             CCceeEEEeeecCCCceEEEEcCCCCCEEEEEEcCCCCCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEE
Confidence            6678899999987776677788999999999998766789999999999999998632110    123446788999999


Q ss_pred             EEEeCCCCCE-EEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEE
Q 022074          133 FIDSRGDGRY-LISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIR  211 (303)
Q Consensus       133 ~~~~~~~~~~-l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  211 (303)
                      ++.|+|++.. |++++.|++|++||++.....                                  ..+..+.       
T Consensus       130 sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~----------------------------------~~i~~~~-------  168 (568)
T PTZ00420        130 IIDWNPMNYYIMCSSGFDSFVNIWDIENEKRA----------------------------------FQINMPK-------  168 (568)
T ss_pred             EEEECCCCCeEEEEEeCCCeEEEEECCCCcEE----------------------------------EEEecCC-------
Confidence            9999998876 579999999999999753211                                  0011000       


Q ss_pred             EeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeE-----EEEECCCCCeEEEEeCCC----CEEEeecC
Q 022074          212 CHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVR-----DCSWHPSQPMLVSSSWDG----DVVRWEFP  282 (303)
Q Consensus       212 ~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~-----~v~~sp~~~~las~s~Dg----~i~~Wd~~  282 (303)
                      ...+..|+++|++|++++.|+.|+|||+++++.+..+.+|.+.+.     ...|++++.+|+|++.|+    ++++||+.
T Consensus       169 ~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr  248 (568)
T PTZ00420        169 KLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLK  248 (568)
T ss_pred             cEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEEEEecccCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECC
Confidence            012334778999999999999999999999999989999987643     345678999999988774    79999987


Q ss_pred             C
Q 022074          283 G  283 (303)
Q Consensus       283 ~  283 (303)
                      .
T Consensus       249 ~  249 (568)
T PTZ00420        249 N  249 (568)
T ss_pred             C
Confidence            4


No 68 
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.92  E-value=5.9e-24  Score=171.18  Aligned_cols=224  Identities=20%  Similarity=0.268  Sum_probs=161.1

Q ss_pred             cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc---eEEEEecccCCeEEEEE-ccCCCcEEEEecCCCeEEEE
Q 022074           34 DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK---LSLRILAHTSDVNTVCF-GDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~---~~~~~~~h~~~v~~l~~-~~~~~~~l~s~s~dg~v~lW  109 (303)
                      .+++|+.-|..+...--|++|++++.|++|+||+.....   +..++.+|.++|..++| +|+.++.|+|++.||.|.||
T Consensus         6 idt~H~D~IHda~lDyygkrlATcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgkVIiW   85 (299)
T KOG1332|consen    6 IDTQHEDMIHDAQLDYYGKRLATCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGKVIIW   85 (299)
T ss_pred             hhhhhhhhhhHhhhhhhcceeeeecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecccccCcEeeEeecCceEEEE
Confidence            348899999998888889999999999999999997653   56678999999999999 55679999999999999999


Q ss_pred             cCccccCCCccceeecccccCeEEEEeCCC--CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          110 DRRCLNVKGKPAGVLMGHLEGITFIDSRGD--GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       110 d~~~~~~~~~~~~~~~~h~~~v~~~~~~~~--~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      .-.  +........+..|..+|+++++.|.  |-.|++++.||.|.+.+.+..- ....+.........+....+.|...
T Consensus        86 ke~--~g~w~k~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~~~~~g-~w~t~ki~~aH~~GvnsVswapa~~  162 (299)
T KOG1332|consen   86 KEE--NGRWTKAYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLTYDSSG-GWTTSKIVFAHEIGVNSVSWAPASA  162 (299)
T ss_pred             ecC--CCchhhhhhhhhhcccceeecccccccceEEEEeeCCCcEEEEEEcCCC-CccchhhhhccccccceeeecCcCC
Confidence            843  1233344566789999999998875  5778999999999999877531 0000000011111111111111100


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe--EEEEeecCCCCeEEEEECCCC-
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE--QVAALKYHTSPVRDCSWHPSQ-  264 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~--~~~~~~~h~~~I~~v~~sp~~-  264 (303)
                      .         ..+-.+.           +  ...-+.|++||.|..|+||+..+++  +..+|++|.+.|.+++|.|.- 
T Consensus       163 ~---------g~~~~~~-----------~--~~~~krlvSgGcDn~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~g  220 (299)
T KOG1332|consen  163 P---------GSLVDQG-----------P--AAKVKRLVSGGCDNLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVG  220 (299)
T ss_pred             C---------ccccccC-----------c--ccccceeeccCCccceeeeecCCcchhhhhhhhhcchhhhhhhhccccC
Confidence            0         0000000           0  0012569999999999999998763  345689999999999999974 


Q ss_pred             ---CeEEEEeCCCCEEEeecC
Q 022074          265 ---PMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       265 ---~~las~s~Dg~i~~Wd~~  282 (303)
                         .+|||+|+||++.+|...
T Consensus       221 l~~s~iAS~SqDg~viIwt~~  241 (299)
T KOG1332|consen  221 LPKSTIASCSQDGTVIIWTKD  241 (299)
T ss_pred             CCceeeEEecCCCcEEEEEec
Confidence               389999999999999965


No 69 
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.92  E-value=2e-23  Score=193.58  Aligned_cols=218  Identities=28%  Similarity=0.430  Sum_probs=180.3

Q ss_pred             CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEcc
Q 022074           12 SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGD   91 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~   91 (303)
                      +|+-|.-+++|+.-+|+.. .....||..+|.++.+..-+..+++|+.|.++++||..+|+....+.+|..-|.++... 
T Consensus       223 ~~s~~~tl~~~~~~~~~~i-~~~l~GH~g~V~~l~~~~~~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh~stv~~~~~~-  300 (537)
T KOG0274|consen  223 SGSDDSTLHLWDLNNGYLI-LTRLVGHFGGVWGLAFPSGGDKLVSGSTDKTERVWDCSTGECTHSLQGHTSSVRCLTID-  300 (537)
T ss_pred             ecCCCceeEEeecccceEE-EeeccCCCCCceeEEEecCCCEEEEEecCCcEEeEecCCCcEEEEecCCCceEEEEEcc-
Confidence            5666777799999888755 33357999999999999878899999999999999999999999999999999998764 


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCcc
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFR  171 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~  171 (303)
                        +..+++|+.|.+|++|++.    ++.....+.+|.++|.++...  +.++++|+.|++|++||.+..           
T Consensus       301 --~~~~~sgs~D~tVkVW~v~----n~~~l~l~~~h~~~V~~v~~~--~~~lvsgs~d~~v~VW~~~~~-----------  361 (537)
T KOG0274|consen  301 --PFLLVSGSRDNTVKVWDVT----NGACLNLLRGHTGPVNCVQLD--EPLLVSGSYDGTVKVWDPRTG-----------  361 (537)
T ss_pred             --CceEeeccCCceEEEEecc----CcceEEEeccccccEEEEEec--CCEEEEEecCceEEEEEhhhc-----------
Confidence              4578889999999999975    445566677799999999886  779999999999999998632           


Q ss_pred             ceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCC-eEEEEEeCCCeEEEEECCCC-eEEEEee
Q 022074          172 SYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQ-KYIYTGSHDSCVYVYDLVSG-EQVAALK  249 (303)
Q Consensus       172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~~latg~~dg~i~iwd~~~~-~~~~~~~  249 (303)
                                             .++.++.||....+.+.        .++ ..+++|+.|++|++||+.+. +++.++.
T Consensus       362 -----------------------~cl~sl~gH~~~V~sl~--------~~~~~~~~Sgs~D~~IkvWdl~~~~~c~~tl~  410 (537)
T KOG0274|consen  362 -----------------------KCLKSLSGHTGRVYSLI--------VDSENRLLSGSLDTTIKVWDLRTKRKCIHTLQ  410 (537)
T ss_pred             -----------------------eeeeeecCCcceEEEEE--------ecCcceEEeeeeccceEeecCCchhhhhhhhc
Confidence                                   24556677765443321        234 78999999999999999999 8899999


Q ss_pred             cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          250 YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       250 ~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      +|..-+.++.+  .+++|++++.|++|++||...
T Consensus       411 ~h~~~v~~l~~--~~~~Lvs~~aD~~Ik~WD~~~  442 (537)
T KOG0274|consen  411 GHTSLVSSLLL--RDNFLVSSSADGTIKLWDAEE  442 (537)
T ss_pred             CCccccccccc--ccceeEeccccccEEEeeccc
Confidence            99988865554  678999999999999999753


No 70 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=99.92  E-value=1.3e-22  Score=168.93  Aligned_cols=252  Identities=21%  Similarity=0.372  Sum_probs=183.0

Q ss_pred             cccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeC--CCeEEEEECCCCceEEEEecccCCeEEEEEccCCCc
Q 022074           18 LANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSS--DDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGH   95 (303)
Q Consensus        18 ~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~--Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~   95 (303)
                      .+.+-+..+|++.-  ..+-++.++..+.|......++.++.  |.+||..++.+.+-+..+.+|...|+.++.+|. ++
T Consensus        37 sl~LYd~~~g~~~~--ti~skkyG~~~~~Fth~~~~~i~sStk~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~-~d  113 (311)
T KOG1446|consen   37 SLRLYDSLSGKQVK--TINSKKYGVDLACFTHHSNTVIHSSTKEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPK-DD  113 (311)
T ss_pred             eEEEEEcCCCceee--EeecccccccEEEEecCCceEEEccCCCCCceEEEEeecCceEEEcCCCCceEEEEEecCC-CC
Confidence            44566666666442  12567778999999988888888887  889999999999988889999999999999875 58


Q ss_pred             EEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCcc-cee
Q 022074           96 LIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFR-SYE  174 (303)
Q Consensus        96 ~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~-~~~  174 (303)
                      .|+|++.|++||+||+|..+    ..+.+  +...-..+++.|+|-++|.+.....|+|||+|............. ...
T Consensus       114 ~FlS~S~D~tvrLWDlR~~~----cqg~l--~~~~~pi~AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf~i~~~~~  187 (311)
T KOG1446|consen  114 TFLSSSLDKTVRLWDLRVKK----CQGLL--NLSGRPIAAFDPEGLIFALANGSELIKLYDLRSFDKGPFTTFSITDNDE  187 (311)
T ss_pred             eEEecccCCeEEeeEecCCC----CceEE--ecCCCcceeECCCCcEEEEecCCCeEEEEEecccCCCCceeEccCCCCc
Confidence            99999999999999998432    22233  333344567899999999888877999999998643222211111 112


Q ss_pred             eeceeeeCCCCCccccCCCCCc-c---eEEecc--------cceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC
Q 022074          175 WDYRWMDYPPQARDLKHPCDQS-V---ATYKGH--------SVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG  242 (303)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~--------~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~  242 (303)
                      -++..++++++++.+...+... +   ..++|.        ....   .......|+||++++++|+.||+|++|+++++
T Consensus       188 ~ew~~l~FS~dGK~iLlsT~~s~~~~lDAf~G~~~~tfs~~~~~~---~~~~~a~ftPds~Fvl~gs~dg~i~vw~~~tg  264 (311)
T KOG1446|consen  188 AEWTDLEFSPDGKSILLSTNASFIYLLDAFDGTVKSTFSGYPNAG---NLPLSATFTPDSKFVLSGSDDGTIHVWNLETG  264 (311)
T ss_pred             cceeeeEEcCCCCEEEEEeCCCcEEEEEccCCcEeeeEeeccCCC---CcceeEEECCCCcEEEEecCCCcEEEEEcCCC
Confidence            2234567788877665443221 1   222222        1110   01123458899999999999999999999999


Q ss_pred             eEEEEeec-CCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          243 EQVAALKY-HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       243 ~~~~~~~~-h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      +++..+.+ +..++.++.|+|.-.+++|++  .++.+|=...
T Consensus       265 ~~v~~~~~~~~~~~~~~~fnP~~~mf~sa~--s~l~fw~p~~  304 (311)
T KOG1446|consen  265 KKVAVLRGPNGGPVSCVRFNPRYAMFVSAS--SNLVFWLPDE  304 (311)
T ss_pred             cEeeEecCCCCCCccccccCCceeeeeecC--ceEEEEeccc
Confidence            99999887 789999999999999999996  5788887543


No 71 
>PTZ00421 coronin; Provisional
Probab=99.92  E-value=2.1e-22  Score=186.25  Aligned_cols=204  Identities=17%  Similarity=0.260  Sum_probs=154.1

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE-------------EEEecccCCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS-------------LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~-------------~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      .|.....++++..+++++.+..+..|+...+...             ..+.+|.+.|.+++|+|.++++|++|+.|++|+
T Consensus        22 ~i~~~~~~~d~~~~~~~n~~~~a~~w~~~gg~~v~~~~~~G~~~~~~~~l~GH~~~V~~v~fsP~d~~~LaSgS~DgtIk  101 (493)
T PTZ00421         22 NVTPSTALWDCSNTIACNDRFIAVPWQQLGSTAVLKHTDYGKLASNPPILLGQEGPIIDVAFNPFDPQKLFTASEDGTIM  101 (493)
T ss_pred             ccccccccCCCCCcEeECCceEEEEEecCCceEEeeccccccCCCCCceEeCCCCCEEEEEEcCCCCCEEEEEeCCCEEE
Confidence            4555666677777777777777777876554322             136689999999999875678999999999999


Q ss_pred             EEcCccccC---CCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCC
Q 022074          108 VWDRRCLNV---KGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYP  183 (303)
Q Consensus       108 lWd~~~~~~---~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~  183 (303)
                      +||+.....   ...+...+.+|...|.++.|+|++ ++|++++.|++|++||++....                     
T Consensus       102 IWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~---------------------  160 (493)
T PTZ00421        102 GWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKA---------------------  160 (493)
T ss_pred             EEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeE---------------------
Confidence            999853211   123456788999999999999975 6899999999999999864211                     


Q ss_pred             CCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCC-eEEEEECC
Q 022074          184 PQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSP-VRDCSWHP  262 (303)
Q Consensus       184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~-I~~v~~sp  262 (303)
                                   +..+.+|....      .+..|++++.+|++++.|+.|++||+++++.+..+.+|... +..+.|++
T Consensus       161 -------------~~~l~~h~~~V------~sla~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~~~w~~  221 (493)
T PTZ00421        161 -------------VEVIKCHSDQI------TSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQRCLWAK  221 (493)
T ss_pred             -------------EEEEcCCCCce------EEEEEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCcceEEEEcC
Confidence                         11222222111      12346788999999999999999999999998888889764 45788999


Q ss_pred             CCCeEEEEe----CCCCEEEeecCCC
Q 022074          263 SQPMLVSSS----WDGDVVRWEFPGN  284 (303)
Q Consensus       263 ~~~~las~s----~Dg~i~~Wd~~~~  284 (303)
                      ++..+++++    .|+++++||+...
T Consensus       222 ~~~~ivt~G~s~s~Dr~VklWDlr~~  247 (493)
T PTZ00421        222 RKDLIITLGCSKSQQRQIMLWDTRKM  247 (493)
T ss_pred             CCCeEEEEecCCCCCCeEEEEeCCCC
Confidence            988877765    4799999998643


No 72 
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.92  E-value=2.8e-23  Score=180.50  Aligned_cols=221  Identities=22%  Similarity=0.304  Sum_probs=170.9

Q ss_pred             cccccccccCcCcccccCCCcc--cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCc
Q 022074           18 LANVTEIHDGLDFSAADDGGYS--FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGH   95 (303)
Q Consensus        18 ~~~~~~~~~~~~~~~~~~~~~~--~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~   95 (303)
                      +++||+++.-.++.    .-..  .+|.++.-+|+|.+++.|+-.|.+++|.+.+|.+...+.+|-..|+|+.|+ +++.
T Consensus        62 ~l~vw~i~k~~~~~----q~~v~Pg~v~al~s~n~G~~l~ag~i~g~lYlWelssG~LL~v~~aHYQ~ITcL~fs-~dgs  136 (476)
T KOG0646|consen   62 LLHVWEILKKDQVV----QYIVLPGPVHALASSNLGYFLLAGTISGNLYLWELSSGILLNVLSAHYQSITCLKFS-DDGS  136 (476)
T ss_pred             cccccccCchhhhh----hhcccccceeeeecCCCceEEEeecccCcEEEEEeccccHHHHHHhhccceeEEEEe-CCCc
Confidence            67999996555442    1122  359999999999999999899999999999999988889999999999996 5689


Q ss_pred             EEEEecCCCeEEEEcCcc-----ccCCCccceeecccccCeEEEEeCC--CCCEEEEEeCCCcEEEEEcccccCCccccc
Q 022074           96 LIYSGSDDNLCKVWDRRC-----LNVKGKPAGVLMGHLEGITFIDSRG--DGRYLISNGKDQAIKLWDIRKMSSNASCNL  168 (303)
Q Consensus        96 ~l~s~s~dg~v~lWd~~~-----~~~~~~~~~~~~~h~~~v~~~~~~~--~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~  168 (303)
                      +|+||++||.|.+|.+..     ......+...+..|.-+|+.+.+..  ...+++|+|.|+++|+||+.......+..+
T Consensus       137 ~iiTgskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS~g~LLlti~f  216 (476)
T KOG0646|consen  137 HIITGSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWDLSLGVLLLTITF  216 (476)
T ss_pred             EEEecCCCccEEEEEEEeecccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEEeccceeeEEEec
Confidence            999999999999997631     1123457788999999999887654  346899999999999999976432211111


Q ss_pred             CccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC------
Q 022074          169 GFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG------  242 (303)
Q Consensus       169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~------  242 (303)
                                    |..                       +.    +....|..+.+..|+++|.|.+.++.+-      
T Consensus       217 --------------p~s-----------------------i~----av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~~~  255 (476)
T KOG0646|consen  217 --------------PSS-----------------------IK----AVALDPAERVVYIGTEEGKIFQNLLFKLSGQSAG  255 (476)
T ss_pred             --------------CCc-----------------------ce----eEEEcccccEEEecCCcceEEeeehhcCCccccc
Confidence                          000                       00    0113455778889999999999886432      


Q ss_pred             ----------eEEEEeecCCC--CeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          243 ----------EQVAALKYHTS--PVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       243 ----------~~~~~~~~h~~--~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                                .....+.+|++  +|++++.|-||.+|++|+.||++.+||+.+.
T Consensus       256 v~~k~~~~~~t~~~~~~Gh~~~~~ITcLais~DgtlLlSGd~dg~VcvWdi~S~  309 (476)
T KOG0646|consen  256 VNQKGRHEENTQINVLVGHENESAITCLAISTDGTLLLSGDEDGKVCVWDIYSK  309 (476)
T ss_pred             ccccccccccceeeeeccccCCcceeEEEEecCccEEEeeCCCCCEEEEecchH
Confidence                      23456678988  9999999999999999999999999998654


No 73 
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.92  E-value=6.3e-23  Score=167.19  Aligned_cols=211  Identities=21%  Similarity=0.202  Sum_probs=152.6

Q ss_pred             ccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           33 ADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        33 ~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      +-..||+.|+..|.|+.+|..|.+++.|.++.||-..+|...-++.+|.+.|.++... .+.+.++||+.|.+++|||..
T Consensus         4 i~l~GHERplTqiKyN~eGDLlFscaKD~~~~vw~s~nGerlGty~GHtGavW~~Did-~~s~~liTGSAD~t~kLWDv~   82 (327)
T KOG0643|consen    4 ILLQGHERPLTQIKYNREGDLLFSCAKDSTPTVWYSLNGERLGTYDGHTGAVWCCDID-WDSKHLITGSADQTAKLWDVE   82 (327)
T ss_pred             cccccCccccceEEecCCCcEEEEecCCCCceEEEecCCceeeeecCCCceEEEEEec-CCcceeeeccccceeEEEEcC
Confidence            4558999999999999999999999999999999888888888999999999999984 457789999999999999975


Q ss_pred             cccCC-------------------------------------------------CccceeecccccCeEEEEeCCCCCEE
Q 022074          113 CLNVK-------------------------------------------------GKPAGVLMGHLEGITFIDSRGDGRYL  143 (303)
Q Consensus       113 ~~~~~-------------------------------------------------~~~~~~~~~h~~~v~~~~~~~~~~~l  143 (303)
                      .+.+.                                                 ..|...+..+...++..-|.+.+++|
T Consensus        83 tGk~la~~k~~~~Vk~~~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~skit~a~Wg~l~~~i  162 (327)
T KOG0643|consen   83 TGKQLATWKTNSPVKRVDFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSKITSALWGPLGETI  162 (327)
T ss_pred             CCcEEEEeecCCeeEEEeeccCCcEEEEEehhhcCcceEEEEEEccCChhhhcccCceEEecCCccceeeeeecccCCEE
Confidence            32110                                                 01112223344566677788889999


Q ss_pred             EEEeCCCcEEEEEcccccCC-cccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCC
Q 022074          144 ISNGKDQAIKLWDIRKMSSN-ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQ  222 (303)
Q Consensus       144 ~s~~~D~~v~lWdl~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~  222 (303)
                      ++|..||.|..||.+..... .+...    +.-.+.                                    ...++++.
T Consensus       163 i~Ghe~G~is~~da~~g~~~v~s~~~----h~~~In------------------------------------d~q~s~d~  202 (327)
T KOG0643|consen  163 IAGHEDGSISIYDARTGKELVDSDEE----HSSKIN------------------------------------DLQFSRDR  202 (327)
T ss_pred             EEecCCCcEEEEEcccCceeeechhh----hccccc------------------------------------cccccCCc
Confidence            99999999999998863211 11000    000001                                    11233444


Q ss_pred             eEEEEEeCCCeEEEEECCCC-------------------------------------------------------eEEEE
Q 022074          223 KYIYTGSHDSCVYVYDLVSG-------------------------------------------------------EQVAA  247 (303)
Q Consensus       223 ~~latg~~dg~i~iwd~~~~-------------------------------------------------------~~~~~  247 (303)
                      .+++|++.|.+.++||..+.                                                       +++..
T Consensus       203 T~FiT~s~Dttakl~D~~tl~v~Kty~te~PvN~aaisP~~d~VilgGGqeA~dVTTT~~r~GKFEArFyh~i~eEEigr  282 (327)
T KOG0643|consen  203 TYFITGSKDTTAKLVDVRTLEVLKTYTTERPVNTAAISPLLDHVILGGGQEAMDVTTTSTRAGKFEARFYHLIFEEEIGR  282 (327)
T ss_pred             ceEEecccCccceeeeccceeeEEEeeecccccceecccccceEEecCCceeeeeeeecccccchhhhHHHHHHHHHhcc
Confidence            44444444444444443321                                                       24455


Q ss_pred             eecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          248 LKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       248 ~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +++|-+||++++|||+|+-.+||++||.+++--+..+
T Consensus       283 vkGHFGPINsvAfhPdGksYsSGGEDG~VR~h~Fd~~  319 (327)
T KOG0643|consen  283 VKGHFGPINSVAFHPDGKSYSSGGEDGYVRLHHFDSN  319 (327)
T ss_pred             ccccccCcceeEECCCCcccccCCCCceEEEEEeccc
Confidence            6779999999999999999999999999999887654


No 74 
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.91  E-value=2e-23  Score=173.87  Aligned_cols=234  Identities=24%  Similarity=0.413  Sum_probs=190.9

Q ss_pred             EccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEE------CCCCc----------
Q 022074            9 DVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYD------LEANK----------   72 (303)
Q Consensus         9 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd------~~~~~----------   72 (303)
                      ++|.||-|+.+-+|-+-.|.=.-  .+.||.+.|.+|.|++.+..++++|.|++..||.      ++...          
T Consensus       162 i~gtASADhTA~iWs~Esg~CL~--~Y~GH~GSVNsikfh~s~~L~lTaSGD~taHIW~~av~~~vP~~~a~~~hSsEeE  239 (481)
T KOG0300|consen  162 ICGTASADHTARIWSLESGACLA--TYTGHTGSVNSIKFHNSGLLLLTASGDETAHIWKAAVNWEVPSNNAPSDHSSEEE  239 (481)
T ss_pred             ceeecccccceeEEeecccccee--eecccccceeeEEeccccceEEEccCCcchHHHHHhhcCcCCCCCCCCCCCchhh
Confidence            68999999999999997777553  6799999999999999999999999999999996      22100          


Q ss_pred             ------------------------eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccc
Q 022074           73 ------------------------LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL  128 (303)
Q Consensus        73 ------------------------~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~  128 (303)
                                              ....+.+|...|.+..|. ..++.++++++|.+..+||.+    ++.+...+.||.
T Consensus       240 ~e~sDe~~~d~d~~~~sD~~tiRvPl~~ltgH~~vV~a~dWL-~gg~Q~vTaSWDRTAnlwDVE----tge~v~~LtGHd  314 (481)
T KOG0300|consen  240 EEHSDEHNRDTDSSEKSDGHTIRVPLMRLTGHRAVVSACDWL-AGGQQMVTASWDRTANLWDVE----TGEVVNILTGHD  314 (481)
T ss_pred             hhcccccccccccccccCCceeeeeeeeeeccccceEehhhh-cCcceeeeeeccccceeeeec----cCceeccccCcc
Confidence                                    124567788888888885 468899999999999999986    566778899999


Q ss_pred             cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceee
Q 022074          129 EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRT  208 (303)
Q Consensus       129 ~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  208 (303)
                      ...+.++.+|..++.+|++.|.+.|+||.|..                                 -..+..|+||....+
T Consensus       315 ~ELtHcstHptQrLVvTsSrDtTFRLWDFRea---------------------------------I~sV~VFQGHtdtVT  361 (481)
T KOG0300|consen  315 SELTHCSTHPTQRLVVTSSRDTTFRLWDFREA---------------------------------IQSVAVFQGHTDTVT  361 (481)
T ss_pred             hhccccccCCcceEEEEeccCceeEeccchhh---------------------------------cceeeeeccccccee
Confidence            99999989999999999999999999998731                                 113556777764433


Q ss_pred             eEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCcc
Q 022074          209 LIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEA  287 (303)
Q Consensus       209 ~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~  287 (303)
                      .      .+|..+. .+++|+.|.+|++||++++. .+.++. ...+++.++.+..++.+|---+++.+++||+.+.+-+
T Consensus       362 S------~vF~~dd-~vVSgSDDrTvKvWdLrNMRsplATIR-tdS~~NRvavs~g~~iIAiPhDNRqvRlfDlnG~Rla  433 (481)
T KOG0300|consen  362 S------VVFNTDD-RVVSGSDDRTVKVWDLRNMRSPLATIR-TDSPANRVAVSKGHPIIAIPHDNRQVRLFDLNGNRLA  433 (481)
T ss_pred             E------EEEecCC-ceeecCCCceEEEeeeccccCcceeee-cCCccceeEeecCCceEEeccCCceEEEEecCCCccc
Confidence            2      2355544 48999999999999998764 577775 4578999999999999999999999999999887654


Q ss_pred             CCC
Q 022074          288 APP  290 (303)
Q Consensus       288 ~~~  290 (303)
                      +-|
T Consensus       434 RlP  436 (481)
T KOG0300|consen  434 RLP  436 (481)
T ss_pred             cCC
Confidence            333


No 75 
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.91  E-value=1.8e-23  Score=193.59  Aligned_cols=198  Identities=24%  Similarity=0.393  Sum_probs=169.7

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      .+|..++|+|...+++++--.|.|++||-+.+.+..++..|+++|..++|+|. ..+|+||+.|-.|++|+.+    +.+
T Consensus        10 sRvKglsFHP~rPwILtslHsG~IQlWDYRM~tli~rFdeHdGpVRgv~FH~~-qplFVSGGDDykIkVWnYk----~rr   84 (1202)
T KOG0292|consen   10 SRVKGLSFHPKRPWILTSLHSGVIQLWDYRMGTLIDRFDEHDGPVRGVDFHPT-QPLFVSGGDDYKIKVWNYK----TRR   84 (1202)
T ss_pred             ccccceecCCCCCEEEEeecCceeeeehhhhhhHHhhhhccCCccceeeecCC-CCeEEecCCccEEEEEecc----cce
Confidence            46899999999999999999999999999999999999999999999999765 5699999999999999975    334


Q ss_pred             cceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceE
Q 022074          120 PAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVAT  199 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  199 (303)
                      ..-++.||.+-|..+.|++.-..|+|+|.|.+||||+...                                  ..+++.
T Consensus        85 clftL~GHlDYVRt~~FHheyPWIlSASDDQTIrIWNwqs----------------------------------r~~iav  130 (1202)
T KOG0292|consen   85 CLFTLLGHLDYVRTVFFHHEYPWILSASDDQTIRIWNWQS----------------------------------RKCIAV  130 (1202)
T ss_pred             ehhhhccccceeEEeeccCCCceEEEccCCCeEEEEeccC----------------------------------CceEEE
Confidence            4557889999999999999999999999999999999753                                  235778


Q ss_pred             EecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC--------C-------------------e--EEEEeec
Q 022074          200 YKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS--------G-------------------E--QVAALKY  250 (303)
Q Consensus       200 ~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~--------~-------------------~--~~~~~~~  250 (303)
                      ++||.....+      ..|+|...++++||-|.+||+||+.-        +                   .  .-..+++
T Consensus       131 ltGHnHYVMc------AqFhptEDlIVSaSLDQTVRVWDisGLRkk~~~pg~~e~~~~~~~~~~dLfg~~DaVVK~VLEG  204 (1202)
T KOG0292|consen  131 LTGHNHYVMC------AQFHPTEDLIVSASLDQTVRVWDISGLRKKNKAPGSLEDQMRGQQGNSDLFGQTDAVVKHVLEG  204 (1202)
T ss_pred             EecCceEEEe------eccCCccceEEEecccceEEEEeecchhccCCCCCCchhhhhccccchhhcCCcCeeeeeeecc
Confidence            8888754433      34667778999999999999999742        1                   0  1134679


Q ss_pred             CCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          251 HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       251 h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      |...|+-++|+|+-++++||++|+.+++|...
T Consensus       205 HDRGVNwaAfhpTlpliVSG~DDRqVKlWrmn  236 (1202)
T KOG0292|consen  205 HDRGVNWAAFHPTLPLIVSGADDRQVKLWRMN  236 (1202)
T ss_pred             cccccceEEecCCcceEEecCCcceeeEEEec
Confidence            99999999999999999999999999999864


No 76 
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.91  E-value=5.8e-23  Score=185.98  Aligned_cols=218  Identities=22%  Similarity=0.313  Sum_probs=171.9

Q ss_pred             CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEcc
Q 022074           12 SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGD   91 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~   91 (303)
                      .|.||.-|.|+..-. .++ .....||+..|.|++..-++. +++||+|.|+++|..  ++....+.+|...|.+++.-|
T Consensus        76 ~g~~D~~i~v~~~~~-~~P-~~~LkgH~snVC~ls~~~~~~-~iSgSWD~TakvW~~--~~l~~~l~gH~asVWAv~~l~  150 (745)
T KOG0301|consen   76 VGGMDTTIIVFKLSQ-AEP-LYTLKGHKSNVCSLSIGEDGT-LISGSWDSTAKVWRI--GELVYSLQGHTASVWAVASLP  150 (745)
T ss_pred             eecccceEEEEecCC-CCc-hhhhhccccceeeeecCCcCc-eEecccccceEEecc--hhhhcccCCcchheeeeeecC
Confidence            478898898887722 222 234479999999999887776 999999999999955  666667899999999998866


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCcc
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFR  171 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~  171 (303)
                      ++  .++||+.|.+|++|..      ++...+|.||.+.|..+++-+++ .|++++.||.||+|++..            
T Consensus       151 e~--~~vTgsaDKtIklWk~------~~~l~tf~gHtD~VRgL~vl~~~-~flScsNDg~Ir~w~~~g------------  209 (745)
T KOG0301|consen  151 EN--TYVTGSADKTIKLWKG------GTLLKTFSGHTDCVRGLAVLDDS-HFLSCSNDGSIRLWDLDG------------  209 (745)
T ss_pred             CC--cEEeccCcceeeeccC------CchhhhhccchhheeeeEEecCC-CeEeecCCceEEEEeccC------------
Confidence            53  7889999999999973      35677899999999999987765 599999999999999842            


Q ss_pred             ceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecC
Q 022074          172 SYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYH  251 (303)
Q Consensus       172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h  251 (303)
                                             ..+..+.||....+.+.      ...+++.++++|+|++++||+..  +++..+...
T Consensus       210 -----------------------e~l~~~~ghtn~vYsis------~~~~~~~Ivs~gEDrtlriW~~~--e~~q~I~lP  258 (745)
T KOG0301|consen  210 -----------------------EVLLEMHGHTNFVYSIS------MALSDGLIVSTGEDRTLRIWKKD--ECVQVITLP  258 (745)
T ss_pred             -----------------------ceeeeeeccceEEEEEE------ecCCCCeEEEecCCceEEEeecC--ceEEEEecC
Confidence                                   12445555654443332      23457889999999999999976  667777766


Q ss_pred             CCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCcc
Q 022074          252 TSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEA  287 (303)
Q Consensus       252 ~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~  287 (303)
                      ...||++.+-++|. +++|+.||.+++|.....|.+
T Consensus       259 ttsiWsa~~L~NgD-Ivvg~SDG~VrVfT~~k~R~A  293 (745)
T KOG0301|consen  259 TTSIWSAKVLLNGD-IVVGGSDGRVRVFTVDKDRKA  293 (745)
T ss_pred             ccceEEEEEeeCCC-EEEeccCceEEEEEecccccC
Confidence            67899999999888 566777999999998755444


No 77 
>KOG0276 consensus Vesicle coat complex COPI, beta' subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.91  E-value=7.7e-23  Score=183.31  Aligned_cols=205  Identities=21%  Similarity=0.332  Sum_probs=175.5

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      .|+-.|.++.|+|...+++++-.+|.|.||+-++...+..+...+-+|.+..|-. -.+++++|+.|..||+|+..    
T Consensus        11 ~rSdRVKsVd~HPtePw~la~LynG~V~IWnyetqtmVksfeV~~~PvRa~kfia-RknWiv~GsDD~~IrVfnyn----   85 (794)
T KOG0276|consen   11 SRSDRVKSVDFHPTEPWILAALYNGDVQIWNYETQTMVKSFEVSEVPVRAAKFIA-RKNWIVTGSDDMQIRVFNYN----   85 (794)
T ss_pred             ccCCceeeeecCCCCceEEEeeecCeeEEEecccceeeeeeeecccchhhheeee-ccceEEEecCCceEEEEecc----
Confidence            3677899999999999999999999999999999998888887788899888854 35799999999999999974    


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      +...+..|..|.+-+.+++.+|..++++|+|.|.+|++||-...              |                   .+
T Consensus        86 t~ekV~~FeAH~DyIR~iavHPt~P~vLtsSDDm~iKlW~we~~--------------w-------------------a~  132 (794)
T KOG0276|consen   86 TGEKVKTFEAHSDYIRSIAVHPTLPYVLTSSDDMTIKLWDWENE--------------W-------------------AC  132 (794)
T ss_pred             cceeeEEeeccccceeeeeecCCCCeEEecCCccEEEEeeccCc--------------e-------------------ee
Confidence            45567789999999999999999999999999999999997531              1                   13


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCC--CeEEEEeCCC
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQ--PMLVSSSWDG  274 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~--~~las~s~Dg  274 (303)
                      ..++.||..  .++...+.|   .|...+|+++-|++|++|.+.+..+..++++|+..|+++.|-+.|  ++|+||++|.
T Consensus       133 ~qtfeGH~H--yVMqv~fnP---kD~ntFaS~sLDrTVKVWslgs~~~nfTl~gHekGVN~Vdyy~~gdkpylIsgaDD~  207 (794)
T KOG0276|consen  133 EQTFEGHEH--YVMQVAFNP---KDPNTFASASLDRTVKVWSLGSPHPNFTLEGHEKGVNCVDYYTGGDKPYLISGADDL  207 (794)
T ss_pred             eeEEcCcce--EEEEEEecC---CCccceeeeeccccEEEEEcCCCCCceeeeccccCcceEEeccCCCcceEEecCCCc
Confidence            456777763  345555555   366789999999999999999988899999999999999998754  7999999999


Q ss_pred             CEEEeecCCC
Q 022074          275 DVVRWEFPGN  284 (303)
Q Consensus       275 ~i~~Wd~~~~  284 (303)
                      ++++||.+..
T Consensus       208 tiKvWDyQtk  217 (794)
T KOG0276|consen  208 TIKVWDYQTK  217 (794)
T ss_pred             eEEEeecchH
Confidence            9999998753


No 78 
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.91  E-value=9e-23  Score=192.93  Aligned_cols=268  Identities=22%  Similarity=0.347  Sum_probs=187.7

Q ss_pred             ccCch--hhccccccccccCcCc-ccccC---------CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC-------
Q 022074           10 VGSGT--MESLANVTEIHDGLDF-SAADD---------GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA-------   70 (303)
Q Consensus        10 ~~~~~--~~~~~~~~~~~~~~~~-~~~~~---------~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~-------   70 (303)
                      +++|.  +|.-+-||.+-.=++. ...++         --|.+.|.|+.|++||++||+||.|+.|.||+...       
T Consensus        28 ~aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~~h~~sv~CVR~S~dG~~lAsGSDD~~v~iW~~~~~~~~~~f  107 (942)
T KOG0973|consen   28 FATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMDDHDGSVNCVRFSPDGSYLASGSDDRLVMIWERAEIGSGTVF  107 (942)
T ss_pred             EecCCccccccceeeccccccchhhhhhcccchhheeeccccCceeEEEECCCCCeEeeccCcceEEEeeecccCCcccc
Confidence            34555  7766666655332211 12222         27889999999999999999999999999998873       


Q ss_pred             ---C--------ceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC
Q 022074           71 ---N--------KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD  139 (303)
Q Consensus        71 ---~--------~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~  139 (303)
                         |        +....+.+|+..|..++|+| ++.+|++++.|.+|.+|+.+.+    +...++.+|...|..+.|.|-
T Consensus       108 gs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp-~~~~lvS~s~DnsViiwn~~tF----~~~~vl~~H~s~VKGvs~DP~  182 (942)
T KOG0973|consen  108 GSTGGAKNVESWKVVSILRGHDSDVLDVNWSP-DDSLLVSVSLDNSVIIWNAKTF----ELLKVLRGHQSLVKGVSWDPI  182 (942)
T ss_pred             cccccccccceeeEEEEEecCCCccceeccCC-CccEEEEecccceEEEEccccc----eeeeeeecccccccceEECCc
Confidence               0        13356789999999999986 6889999999999999997633    557788999999999999999


Q ss_pred             CCEEEEEeCCCcEEEEEcccccCCcccccCcccee--eeceeeeCCCCCccccCCCC----------------CcceEEe
Q 022074          140 GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYE--WDYRWMDYPPQARDLKHPCD----------------QSVATYK  201 (303)
Q Consensus       140 ~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~----------------~~~~~~~  201 (303)
                      |+||+|-+.|++|++|++......-..+..|....  --...+.++|+++.+..+..                ..-..+-
T Consensus       183 Gky~ASqsdDrtikvwrt~dw~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~n~~~~~~~IieR~tWk~~~~Lv  262 (942)
T KOG0973|consen  183 GKYFASQSDDRTLKVWRTSDWGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAVNGGKSTIAIIERGTWKVDKDLV  262 (942)
T ss_pred             cCeeeeecCCceEEEEEcccceeeEeeccchhhCCCcceeeecccCCCcCeecchhhccCCcceeEEEecCCceeeeeee
Confidence            99999999999999999654211111111111000  01223556777776654321                0112344


Q ss_pred             cccceeeeEEEeeeee-ee--------CCC----eEEEEEeCCCeEEEEECCCCeEEEEe-ecCCCCeEEEEECCCCCeE
Q 022074          202 GHSVLRTLIRCHFSPV-YS--------TGQ----KYIYTGSHDSCVYVYDLVSGEQVAAL-KYHTSPVRDCSWHPSQPML  267 (303)
Q Consensus       202 ~~~~~~~~~~~~~~~~-~s--------~~~----~~latg~~dg~i~iwd~~~~~~~~~~-~~h~~~I~~v~~sp~~~~l  267 (303)
                      ||....++++  |+|. |.        ...    ..+|+|+.|++|.||.....+++... +--...|.+++|||||..|
T Consensus       263 GH~~p~evvr--FnP~lfe~~~~ng~~~~~~~~y~i~AvgSqDrSlSVW~T~~~RPl~vi~~lf~~SI~DmsWspdG~~L  340 (942)
T KOG0973|consen  263 GHSAPVEVVR--FNPKLFERNNKNGTSTQPNCYYCIAAVGSQDRSLSVWNTALPRPLFVIHNLFNKSIVDMSWSPDGFSL  340 (942)
T ss_pred             cCCCceEEEE--eChHHhccccccCCccCCCcceEEEEEecCCccEEEEecCCCCchhhhhhhhcCceeeeeEcCCCCeE
Confidence            5654444443  3332 11        111    26889999999999998776765432 2234689999999999999


Q ss_pred             EEEeCCCCEEEeecCCC
Q 022074          268 VSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       268 as~s~Dg~i~~Wd~~~~  284 (303)
                      ..+|.||++.+..+...
T Consensus       341 facS~DGtV~~i~Fee~  357 (942)
T KOG0973|consen  341 FACSLDGTVALIHFEEK  357 (942)
T ss_pred             EEEecCCeEEEEEcchH
Confidence            99999999999998643


No 79 
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.91  E-value=4.1e-22  Score=159.15  Aligned_cols=211  Identities=22%  Similarity=0.374  Sum_probs=158.9

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc-----eEEEEecccCCeEEEEEccC--C-CcEEEEec-CCCeEE
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK-----LSLRILAHTSDVNTVCFGDE--S-GHLIYSGS-DDNLCK  107 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~-----~~~~~~~h~~~v~~l~~~~~--~-~~~l~s~s-~dg~v~  107 (303)
                      -|+..|+|.+|+|+|+.+++||+|.+|++...+...     ...++.-|++-|..++|..+  . +..|++++ .|..|+
T Consensus        87 hhkgsiyc~~ws~~geliatgsndk~ik~l~fn~dt~~~~g~dle~nmhdgtirdl~fld~~~s~~~il~s~gagdc~iy  166 (350)
T KOG0641|consen   87 HHKGSIYCTAWSPCGELIATGSNDKTIKVLPFNADTCNATGHDLEFNMHDGTIRDLAFLDDPESGGAILASAGAGDCKIY  166 (350)
T ss_pred             ccCccEEEEEecCccCeEEecCCCceEEEEecccccccccCcceeeeecCCceeeeEEecCCCcCceEEEecCCCcceEE
Confidence            688999999999999999999999999998665432     12456678999999999532  2 44566654 344555


Q ss_pred             EEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          108 VWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      +-|  +  ..+.....+.||.+.+.++ ++.++-.+++|+.|++||+||+|...........+                 
T Consensus       167 ~td--c--~~g~~~~a~sghtghilal-yswn~~m~~sgsqdktirfwdlrv~~~v~~l~~~~-----------------  224 (350)
T KOG0641|consen  167 ITD--C--GRGQGFHALSGHTGHILAL-YSWNGAMFASGSQDKTIRFWDLRVNSCVNTLDNDF-----------------  224 (350)
T ss_pred             Eee--c--CCCCcceeecCCcccEEEE-EEecCcEEEccCCCceEEEEeeeccceeeeccCcc-----------------
Confidence            444  3  3566778899999999887 45567799999999999999998643221111000                 


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeE
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPML  267 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~l  267 (303)
                                   .+.......+   ......|.|++|++|-+|..+.+||++-+.++..+-.|...|.++.|||..-+|
T Consensus       225 -------------~~~glessav---aav~vdpsgrll~sg~~dssc~lydirg~r~iq~f~phsadir~vrfsp~a~yl  288 (350)
T KOG0641|consen  225 -------------HDGGLESSAV---AAVAVDPSGRLLASGHADSSCMLYDIRGGRMIQRFHPHSADIRCVRFSPGAHYL  288 (350)
T ss_pred             -------------cCCCccccee---EEEEECCCcceeeeccCCCceEEEEeeCCceeeeeCCCccceeEEEeCCCceEE
Confidence                         0000000000   112245789999999999999999999999999999999999999999999999


Q ss_pred             EEEeCCCCEEEeecCCCC
Q 022074          268 VSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       268 as~s~Dg~i~~Wd~~~~~  285 (303)
                      .|++.|..|++=|+++..
T Consensus       289 lt~syd~~ikltdlqgdl  306 (350)
T KOG0641|consen  289 LTCSYDMKIKLTDLQGDL  306 (350)
T ss_pred             EEecccceEEEeecccch
Confidence            999999999999998763


No 80 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.91  E-value=3.3e-23  Score=182.04  Aligned_cols=207  Identities=26%  Similarity=0.473  Sum_probs=159.4

Q ss_pred             CcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEEEEeccc------CCeEEEEEccCCCcEEEEecCCCeEEEE
Q 022074           37 GYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSLRILAHT------SDVNTVCFGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~------~~v~~l~~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      ||...+.|.+|+|+.+ .+++++.||++||||+...+...++..|.      -.+..++|++ +++++++|..||+|.+|
T Consensus       266 GHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q~qVik~k~~~g~Rv~~tsC~~nr-dg~~iAagc~DGSIQ~W  344 (641)
T KOG0772|consen  266 GHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQLQVIKTKPAGGKRVPVTSCAWNR-DGKLIAAGCLDGSIQIW  344 (641)
T ss_pred             CceeeeeccccccCcccceEEecCCCcEEEEecCCchhheeEEeeccCCCcccCceeeecCC-CcchhhhcccCCceeee
Confidence            9999999999999654 79999999999999998876554444432      2577889975 58899999999999999


Q ss_pred             cCccccCCCccceeeccccc--CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          110 DRRCLNVKGKPAGVLMGHLE--GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       110 d~~~~~~~~~~~~~~~~h~~--~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      +.+.... ......-..|..  .++++.|+++|++|++=|.|.++++||||..+.......++..               
T Consensus       345 ~~~~~~v-~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkpL~~~tgL~t---------------  408 (641)
T KOG0772|consen  345 DKGSRTV-RPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKPLNVRTGLPT---------------  408 (641)
T ss_pred             ecCCccc-ccceEeeeccCCCCceeEEEeccccchhhhccCCCceeeeeccccccchhhhcCCCc---------------
Confidence            9753321 112333456877  7999999999999999999999999999975432211111000               


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC------CCeEEEEECCCCeEEEEeecCCCCeEEEEEC
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH------DSCVYVYDLVSGEQVAALKYHTSPVRDCSWH  261 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~------dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~s  261 (303)
                                 .+.       .-.|    .|||+.++++||..      .|.+.+||..+.+.++.+......|-.+.||
T Consensus       409 -----------~~~-------~tdc----~FSPd~kli~TGtS~~~~~~~g~L~f~d~~t~d~v~ki~i~~aSvv~~~Wh  466 (641)
T KOG0772|consen  409 -----------PFP-------GTDC----CFSPDDKLILTGTSAPNGMTAGTLFFFDRMTLDTVYKIDISTASVVRCLWH  466 (641)
T ss_pred             -----------cCC-------CCcc----ccCCCceEEEecccccCCCCCceEEEEeccceeeEEEecCCCceEEEEeec
Confidence                       000       0012    27788999999763      6789999999999998888788899999999


Q ss_pred             CCCCeEEEEeCCCCEEEeecC
Q 022074          262 PSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       262 p~~~~las~s~Dg~i~~Wd~~  282 (303)
                      |.-++|..++.||+++++--+
T Consensus       467 pkLNQi~~gsgdG~~~vyYdp  487 (641)
T KOG0772|consen  467 PKLNQIFAGSGDGTAHVYYDP  487 (641)
T ss_pred             chhhheeeecCCCceEEEECc
Confidence            999999999999999987643


No 81 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.91  E-value=7.8e-23  Score=176.34  Aligned_cols=200  Identities=23%  Similarity=0.332  Sum_probs=166.7

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      ++.++...+....+++|+.|.++.++|-..++....+.+|...|+.+.+++ +...+++++.|-.+++|....    ...
T Consensus       221 gi~ald~~~s~~~ilTGG~d~~av~~d~~s~q~l~~~~Gh~kki~~v~~~~-~~~~v~~aSad~~i~vws~~~----~s~  295 (506)
T KOG0289|consen  221 GITALDIIPSSSKILTGGEDKTAVLFDKPSNQILATLKGHTKKITSVKFHK-DLDTVITASADEIIRVWSVPL----SSE  295 (506)
T ss_pred             CeeEEeecCCCCcceecCCCCceEEEecchhhhhhhccCcceEEEEEEecc-chhheeecCCcceEEeecccc----ccC
Confidence            389999998878999999999999999999999989999999999999975 456788999999999998631    122


Q ss_pred             ceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          121 AGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       121 ~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      ......|.++|+.+..++.|.||++++.|++.-+.|++.........                                -
T Consensus       296 ~~~~~~h~~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs--------------------------------~  343 (506)
T KOG0289|consen  296 PTSSRPHEEPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVS--------------------------------D  343 (506)
T ss_pred             ccccccccccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEe--------------------------------e
Confidence            33455699999999999999999999999999999987533211000                                0


Q ss_pred             ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          201 KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       201 ~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      .+..      ....+..|+|||-+|++|..|+.++|||++++..+..|.+|+++|..++|+.+|=+||++++|+.+++||
T Consensus       344 ~~s~------v~~ts~~fHpDgLifgtgt~d~~vkiwdlks~~~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwD  417 (506)
T KOG0289|consen  344 ETSD------VEYTSAAFHPDGLIFGTGTPDGVVKIWDLKSQTNVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWD  417 (506)
T ss_pred             cccc------ceeEEeeEcCCceEEeccCCCceEEEEEcCCccccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEE
Confidence            0000      0122456899999999999999999999999998999999999999999999999999999999999999


Q ss_pred             cCC
Q 022074          281 FPG  283 (303)
Q Consensus       281 ~~~  283 (303)
                      +..
T Consensus       418 LRK  420 (506)
T KOG0289|consen  418 LRK  420 (506)
T ss_pred             ehh
Confidence            864


No 82 
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.91  E-value=8.1e-22  Score=163.61  Aligned_cols=243  Identities=18%  Similarity=0.273  Sum_probs=163.8

Q ss_pred             cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc----eEEEEecccCCeEEEEEc-cCCCcEEEEecCCCeE
Q 022074           32 AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK----LSLRILAHTSDVNTVCFG-DESGHLIYSGSDDNLC  106 (303)
Q Consensus        32 ~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~----~~~~~~~h~~~v~~l~~~-~~~~~~l~s~s~dg~v  106 (303)
                      ++.++||..=|.+++|..-|+++++|+.|++|.|||.+.+.    ....+..|.+.|..+.|. |+.|+.+++++.|+++
T Consensus         6 ~pi~s~h~DlihdVs~D~~GRRmAtCSsDq~vkI~d~~~~s~~W~~Ts~Wrah~~Si~rV~WAhPEfGqvvA~cS~Drtv   85 (361)
T KOG2445|consen    6 APIDSGHKDLIHDVSFDFYGRRMATCSSDQTVKIWDSTSDSGTWSCTSSWRAHDGSIWRVVWAHPEFGQVVATCSYDRTV   85 (361)
T ss_pred             cccccCCcceeeeeeecccCceeeeccCCCcEEEEeccCCCCceEEeeeEEecCCcEEEEEecCccccceEEEEecCCce
Confidence            44568999999999999999999999999999999964432    234578899999999994 5569999999999999


Q ss_pred             EEEcCc--cccC---CCccceeecccccCeEEEEeCC--CCCEEEEEeCCCcEEEEEcccccC----------------C
Q 022074          107 KVWDRR--CLNV---KGKPAGVLMGHLEGITFIDSRG--DGRYLISNGKDQAIKLWDIRKMSS----------------N  163 (303)
Q Consensus       107 ~lWd~~--~~~~---~~~~~~~~~~h~~~v~~~~~~~--~~~~l~s~~~D~~v~lWdl~~~~~----------------~  163 (303)
                      ++|.=.  ..+.   .....+.+......|+.+.|.|  -|-.|++++.||.+|||+.-....                .
T Consensus        86 ~iWEE~~~~~~~~~~~Wv~~ttl~DsrssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~dp~nLs~W~Lq~Ei~~~~~pp  165 (361)
T KOG2445|consen   86 SIWEEQEKSEEAHGRRWVRRTTLVDSRSSVTDVKFAPKHLGLKLAAASADGILRIYEAPDPMNLSQWTLQHEIQNVIDPP  165 (361)
T ss_pred             eeeeecccccccccceeEEEEEeecCCcceeEEEecchhcceEEEEeccCcEEEEEecCCccccccchhhhhhhhccCCc
Confidence            999731  1111   1112334556677899998887  477899999999999997532110                0


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCC------c--ceEE-------------ecccceeeeEEEeeeeeeeCCC
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ------S--VATY-------------KGHSVLRTLIRCHFSPVYSTGQ  222 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~--~~~~-------------~~~~~~~~~~~~~~~~~~s~~~  222 (303)
                      ..+...-.++.|....    ...+.+...++.      .  +..+             .++...  +....|.|..-...
T Consensus       166 ~~~~~~~~CvsWn~sr----~~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~d~~dp--I~di~wAPn~Gr~y  239 (361)
T KOG2445|consen  166 GKNKQPCFCVSWNPSR----MHEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELPDHTDP--IRDISWAPNIGRSY  239 (361)
T ss_pred             ccccCcceEEeecccc----ccCceEEEEcccCCccccceEEEEecCCcceeeeehhcCCCCCc--ceeeeeccccCCce
Confidence            0011111122332110    111111111111      1  1111             112111  11233444433445


Q ss_pred             eEEEEEeCCCeEEEEECCCC--------------------eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          223 KYIYTGSHDSCVYVYDLVSG--------------------EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       223 ~~latg~~dg~i~iwd~~~~--------------------~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      .+||+|+.|| |+||.++..                    +++..+..|+++|+.+.|+-.|.+|+|.|.||.+++|..
T Consensus       240 ~~lAvA~kDg-v~I~~v~~~~s~i~~ee~~~~~~~~~l~v~~vs~~~~H~~~VWrv~wNmtGtiLsStGdDG~VRLWka  317 (361)
T KOG2445|consen  240 HLLAVATKDG-VRIFKVKVARSAIEEEEVLAPDLMTDLPVEKVSELDDHNGEVWRVRWNMTGTILSSTGDDGCVRLWKA  317 (361)
T ss_pred             eeEEEeecCc-EEEEEEeeccchhhhhcccCCCCccccceEEeeeccCCCCceEEEEEeeeeeEEeecCCCceeeehhh
Confidence            7899999999 999998731                    345667899999999999999999999999999999973


No 83 
>KOG1407 consensus WD40 repeat protein [Function unknown]
Probab=99.90  E-value=3.1e-22  Score=162.70  Aligned_cols=221  Identities=20%  Similarity=0.381  Sum_probs=169.6

Q ss_pred             ccCchhhccccccccccCcCcccccCCCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEE
Q 022074           10 VGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVC   88 (303)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~   88 (303)
                      .-+|++|....||-+-++.-.-.....||...|-.+.|+| +...+++++.|.+|++||...++...++....+. ..+.
T Consensus        35 lasgs~dktv~v~n~e~~r~~~~~~~~gh~~svdql~w~~~~~d~~atas~dk~ir~wd~r~~k~~~~i~~~~en-i~i~  113 (313)
T KOG1407|consen   35 LASGSFDKTVSVWNLERDRFRKELVYRGHTDSVDQLCWDPKHPDLFATASGDKTIRIWDIRSGKCTARIETKGEN-INIT  113 (313)
T ss_pred             eeecccCCceEEEEecchhhhhhhcccCCCcchhhheeCCCCCcceEEecCCceEEEEEeccCcEEEEeeccCcc-eEEE
Confidence            4578999999999886653222445589999999999997 5668999999999999999999877665544444 3455


Q ss_pred             EccCCCcEEEEecCCCeEEEEcCccccC-------------------------------------CCccceeecccccCe
Q 022074           89 FGDESGHLIYSGSDDNLCKVWDRRCLNV-------------------------------------KGKPAGVLMGHLEGI  131 (303)
Q Consensus        89 ~~~~~~~~l~s~s~dg~v~lWd~~~~~~-------------------------------------~~~~~~~~~~h~~~v  131 (303)
                      |+| ++++++.+++|..|...|.+....                                     ..+++..+..|....
T Consensus       114 wsp-~g~~~~~~~kdD~it~id~r~~~~~~~~~~~~e~ne~~w~~~nd~Fflt~GlG~v~ILsypsLkpv~si~AH~snC  192 (313)
T KOG1407|consen  114 WSP-DGEYIAVGNKDDRITFIDARTYKIVNEEQFKFEVNEISWNNSNDLFFLTNGLGCVEILSYPSLKPVQSIKAHPSNC  192 (313)
T ss_pred             EcC-CCCEEEEecCcccEEEEEecccceeehhcccceeeeeeecCCCCEEEEecCCceEEEEeccccccccccccCCcce
Confidence            755 477888888888888877642100                                     113445567799888


Q ss_pred             EEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEE
Q 022074          132 TFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIR  211 (303)
Q Consensus       132 ~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  211 (303)
                      .++.|+|+|++||+|+.|..+.|||+..+.+.-.    +..++|.++                                 
T Consensus       193 icI~f~p~GryfA~GsADAlvSLWD~~ELiC~R~----isRldwpVR---------------------------------  235 (313)
T KOG1407|consen  193 ICIEFDPDGRYFATGSADALVSLWDVDELICERC----ISRLDWPVR---------------------------------  235 (313)
T ss_pred             EEEEECCCCceEeeccccceeeccChhHhhhhee----eccccCceE---------------------------------
Confidence            9999999999999999999999999875432110    111122111                                 


Q ss_pred             EeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC
Q 022074          212 CHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD  273 (303)
Q Consensus       212 ~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D  273 (303)
                         ...||.+|++||+|++|..|-|=++++|+.+..+. +++|...|+|+|..++||-+++|
T Consensus       236 ---TlSFS~dg~~lASaSEDh~IDIA~vetGd~~~eI~-~~~~t~tVAWHPk~~LLAyA~dd  293 (313)
T KOG1407|consen  236 ---TLSFSHDGRMLASASEDHFIDIAEVETGDRVWEIP-CEGPTFTVAWHPKRPLLAYACDD  293 (313)
T ss_pred             ---EEEeccCcceeeccCccceEEeEecccCCeEEEee-ccCCceeEEecCCCceeeEEecC
Confidence               23488899999999999999999999999998874 88999999999999999988876


No 84 
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=99.90  E-value=1.2e-22  Score=188.37  Aligned_cols=218  Identities=27%  Similarity=0.396  Sum_probs=179.4

Q ss_pred             CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEcc
Q 022074           12 SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGD   91 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~   91 (303)
                      +||.|-.+.||+..+|.--.  ...||...|.++...+  ..+++||.|.+|++|+++++.....+.+|.+.|+++..+ 
T Consensus       266 sgS~D~t~rvWd~~sg~C~~--~l~gh~stv~~~~~~~--~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~~V~~v~~~-  340 (537)
T KOG0274|consen  266 SGSTDKTERVWDCSTGECTH--SLQGHTSSVRCLTIDP--FLLVSGSRDNTVKVWDVTNGACLNLLRGHTGPVNCVQLD-  340 (537)
T ss_pred             EEecCCcEEeEecCCCcEEE--EecCCCceEEEEEccC--ceEeeccCCceEEEEeccCcceEEEeccccccEEEEEec-
Confidence            57788889999987777332  4469999999998774  468889999999999999999888888899999999985 


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCcc
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFR  171 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~  171 (303)
                        ..++++|+.|++|++||..    ..+....+.||...|.++.+.+. ..+++|+.|++|++||++...          
T Consensus       341 --~~~lvsgs~d~~v~VW~~~----~~~cl~sl~gH~~~V~sl~~~~~-~~~~Sgs~D~~IkvWdl~~~~----------  403 (537)
T KOG0274|consen  341 --EPLLVSGSYDGTVKVWDPR----TGKCLKSLSGHTGRVYSLIVDSE-NRLLSGSLDTTIKVWDLRTKR----------  403 (537)
T ss_pred             --CCEEEEEecCceEEEEEhh----hceeeeeecCCcceEEEEEecCc-ceEEeeeeccceEeecCCchh----------
Confidence              5699999999999999986    45667889999999999977654 789999999999999997641          


Q ss_pred             ceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeec-
Q 022074          172 SYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKY-  250 (303)
Q Consensus       172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~-  250 (303)
                                             +++.++.+|......+        ...+++|++++.|++|++||..+++.+..+++ 
T Consensus       404 -----------------------~c~~tl~~h~~~v~~l--------~~~~~~Lvs~~aD~~Ik~WD~~~~~~~~~~~~~  452 (537)
T KOG0274|consen  404 -----------------------KCIHTLQGHTSLVSSL--------LLRDNFLVSSSADGTIKLWDAEEGECLRTLEGR  452 (537)
T ss_pred             -----------------------hhhhhhcCCccccccc--------ccccceeEeccccccEEEeecccCceeeeeccC
Confidence                                   1233444444332111        12467899999999999999999999999988 


Q ss_pred             CCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          251 HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       251 h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      |...|+.+++.  ...+++++.||++++||++..
T Consensus       453 ~~~~v~~l~~~--~~~il~s~~~~~~~l~dl~~~  484 (537)
T KOG0274|consen  453 HVGGVSALALG--KEEILCSSDDGSVKLWDLRSG  484 (537)
T ss_pred             CcccEEEeecC--cceEEEEecCCeeEEEecccC
Confidence            67899999987  678999999999999998765


No 85 
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.90  E-value=2.8e-23  Score=175.59  Aligned_cols=233  Identities=19%  Similarity=0.281  Sum_probs=163.1

Q ss_pred             CCcccceEEEEEcCCC-CEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDG-RELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g-~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .||.-+|.|++=+|.. ..+++|+.||.|+|||+........+..|.+.|..+++..   ..+++++.|.+|+.|.....
T Consensus        63 ~gHrdGV~~lakhp~~ls~~aSGs~DG~VkiWnlsqR~~~~~f~AH~G~V~Gi~v~~---~~~~tvgdDKtvK~wk~~~~  139 (433)
T KOG0268|consen   63 DGHRDGVSCLAKHPNKLSTVASGSCDGEVKIWNLSQRECIRTFKAHEGLVRGICVTQ---TSFFTVGDDKTVKQWKIDGP  139 (433)
T ss_pred             cccccccchhhcCcchhhhhhccccCceEEEEehhhhhhhheeecccCceeeEEecc---cceEEecCCcceeeeeccCC
Confidence            7999999999999987 7899999999999999999888888999999999999953   57889999999999973210


Q ss_pred             -----------------------cC-----------CCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEccc
Q 022074          115 -----------------------NV-----------KGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       115 -----------------------~~-----------~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~  159 (303)
                                             ..           ...|+..+.--.+.+.++.|+|-. ..|++++.|++|.|||+|.
T Consensus       140 p~~tilg~s~~~gIdh~~~~~~FaTcGe~i~IWD~~R~~Pv~smswG~Dti~svkfNpvETsILas~~sDrsIvLyD~R~  219 (433)
T KOG0268|consen  140 PLHTILGKSVYLGIDHHRKNSVFATCGEQIDIWDEQRDNPVSSMSWGADSISSVKFNPVETSILASCASDRSIVLYDLRQ  219 (433)
T ss_pred             cceeeeccccccccccccccccccccCceeeecccccCCccceeecCCCceeEEecCCCcchheeeeccCCceEEEeccc
Confidence                                   00           011233333334667888888854 4577888999999999997


Q ss_pred             ccCCcccccCccceeeeceeeeCCCCCccccCCC-CCcceE------------EecccceeeeEEEeeeeeeeCCCeEEE
Q 022074          160 MSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC-DQSVAT------------YKGHSVLRTLIRCHFSPVYSTGQKYIY  226 (303)
Q Consensus       160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~------------~~~~~~~~~~~~~~~~~~~s~~~~~la  226 (303)
                      ..+.-+..+.-+.     ..+.+.|..-.+.... +..+..            +.+|.      ......+|||.|+.++
T Consensus       220 ~~Pl~KVi~~mRT-----N~IswnPeafnF~~a~ED~nlY~~DmR~l~~p~~v~~dhv------sAV~dVdfsptG~Efv  288 (433)
T KOG0268|consen  220 ASPLKKVILTMRT-----NTICWNPEAFNFVAANEDHNLYTYDMRNLSRPLNVHKDHV------SAVMDVDFSPTGQEFV  288 (433)
T ss_pred             CCccceeeeeccc-----cceecCccccceeeccccccceehhhhhhcccchhhcccc------eeEEEeccCCCcchhc
Confidence            6543332221111     1122222222221111 112222            22222      1223456889999999


Q ss_pred             EEeCCCeEEEEECCCCeEEEEe-ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          227 TGSHDSCVYVYDLVSGEQVAAL-KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       227 tg~~dg~i~iwd~~~~~~~~~~-~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      +||.|++|+||....+...-.+ ..-...|.++.||.|.++++|||+|+++++|...
T Consensus       289 sgsyDksIRIf~~~~~~SRdiYhtkRMq~V~~Vk~S~Dskyi~SGSdd~nvRlWka~  345 (433)
T KOG0268|consen  289 SGSYDKSIRIFPVNHGHSRDIYHTKRMQHVFCVKYSMDSKYIISGSDDGNVRLWKAK  345 (433)
T ss_pred             cccccceEEEeecCCCcchhhhhHhhhheeeEEEEeccccEEEecCCCcceeeeecc
Confidence            9999999999999876532221 1122469999999999999999999999999964


No 86 
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.90  E-value=1e-21  Score=162.97  Aligned_cols=229  Identities=26%  Similarity=0.322  Sum_probs=160.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      +..+..|.++.|+|.++.|++++|||++++||.....+. ....|..++.+++|.+  ...+++|+.||.|+++|+... 
T Consensus        10 npP~d~IS~v~f~~~~~~LLvssWDgslrlYdv~~~~l~-~~~~~~~plL~c~F~d--~~~~~~G~~dg~vr~~Dln~~-   85 (323)
T KOG1036|consen   10 NPPEDGISSVKFSPSSSDLLVSSWDGSLRLYDVPANSLK-LKFKHGAPLLDCAFAD--ESTIVTGGLDGQVRRYDLNTG-   85 (323)
T ss_pred             CCChhceeeEEEcCcCCcEEEEeccCcEEEEeccchhhh-hheecCCceeeeeccC--CceEEEeccCceEEEEEecCC-
Confidence            444556999999999999999999999999999887543 3457889999999964  457889999999999997532 


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC-CCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH-PCD  194 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~  194 (303)
                          ....+..|.+++.++...+....+++||.|++|++||.|..........+...+..+       .....+.. ..+
T Consensus        86 ----~~~~igth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~~~~~~~~d~~kkVy~~~-------v~g~~LvVg~~~  154 (323)
T KOG1036|consen   86 ----NEDQIGTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRNKVVVGTFDQGKKVYCMD-------VSGNRLVVGTSD  154 (323)
T ss_pred             ----cceeeccCCCceEEEEeeccCCeEEEcccCccEEEEeccccccccccccCceEEEEe-------ccCCEEEEeecC
Confidence                233455799999999999888899999999999999999633333222211111111       11111111 112


Q ss_pred             CcceEEecc----------cceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC----eEEEEeecCC--------
Q 022074          195 QSVATYKGH----------SVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG----EQVAALKYHT--------  252 (303)
Q Consensus       195 ~~~~~~~~~----------~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~----~~~~~~~~h~--------  252 (303)
                      +.+..++-.          .......||...   -|++.=.+.++-||+|.+=..+..    ++-..|+.|.        
T Consensus       155 r~v~iyDLRn~~~~~q~reS~lkyqtR~v~~---~pn~eGy~~sSieGRVavE~~d~s~~~~skkyaFkCHr~~~~~~~~  231 (323)
T KOG1036|consen  155 RKVLIYDLRNLDEPFQRRESSLKYQTRCVAL---VPNGEGYVVSSIEGRVAVEYFDDSEEAQSKKYAFKCHRLSEKDTEI  231 (323)
T ss_pred             ceEEEEEcccccchhhhccccceeEEEEEEE---ecCCCceEEEeecceEEEEccCCchHHhhhceeEEeeecccCCceE
Confidence            222222111          111223333322   234444788999999999887765    3445677774        


Q ss_pred             -CCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          253 -SPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       253 -~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                       -||++++|||-...||||+.||-+.+||+.
T Consensus       232 ~yPVNai~Fhp~~~tfaTgGsDG~V~~Wd~~  262 (323)
T KOG1036|consen  232 IYPVNAIAFHPIHGTFATGGSDGIVNIWDLF  262 (323)
T ss_pred             EEEeceeEeccccceEEecCCCceEEEccCc
Confidence             389999999999999999999999999965


No 87 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.90  E-value=1.3e-21  Score=170.99  Aligned_cols=200  Identities=23%  Similarity=0.317  Sum_probs=159.9

Q ss_pred             CCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      ++|+.+|..+.|+|+++ .+++|+.|+.+++||+.++..+..+.+|++-|.|..++|.+++.++||+.||+||+||+|..
T Consensus       107 ~ah~apv~~~~f~~~d~t~l~s~sDd~v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~  186 (487)
T KOG0310|consen  107 YAHQAPVHVTKFSPQDNTMLVSGSDDKVVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSL  186 (487)
T ss_pred             hhccCceeEEEecccCCeEEEecCCCceEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccC
Confidence            68999999999999765 57778889999999999998777889999999999999888899999999999999999843


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                      .   ..+ ....|..+|..+-+-|.|.+++++|. ..||+||+.....                                
T Consensus       187 ~---~~v-~elnhg~pVe~vl~lpsgs~iasAgG-n~vkVWDl~~G~q--------------------------------  229 (487)
T KOG0310|consen  187 T---SRV-VELNHGCPVESVLALPSGSLIASAGG-NSVKVWDLTTGGQ--------------------------------  229 (487)
T ss_pred             C---cee-EEecCCCceeeEEEcCCCCEEEEcCC-CeEEEEEecCCce--------------------------------
Confidence            2   223 33468899999988999999999875 8999999863210                                


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                       .+.....|.-..+++      .+..+++.|++|+-|+.+++||+.+-+.+..++ -.+||.+++.||+++.++.|..||
T Consensus       230 -ll~~~~~H~KtVTcL------~l~s~~~rLlS~sLD~~VKVfd~t~~Kvv~s~~-~~~pvLsiavs~dd~t~viGmsnG  301 (487)
T KOG0310|consen  230 -LLTSMFNHNKTVTCL------RLASDSTRLLSGSLDRHVKVFDTTNYKVVHSWK-YPGPVLSIAVSPDDQTVVIGMSNG  301 (487)
T ss_pred             -ehhhhhcccceEEEE------EeecCCceEeecccccceEEEEccceEEEEeee-cccceeeEEecCCCceEEEecccc
Confidence             011111122122222      244567889999999999999998888888775 457999999999999999999999


Q ss_pred             CEEEee
Q 022074          275 DVVRWE  280 (303)
Q Consensus       275 ~i~~Wd  280 (303)
                      .+-.=+
T Consensus       302 lv~~rr  307 (487)
T KOG0310|consen  302 LVSIRR  307 (487)
T ss_pred             eeeeeh
Confidence            887654


No 88 
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=99.90  E-value=1.3e-21  Score=162.48  Aligned_cols=226  Identities=18%  Similarity=0.253  Sum_probs=160.9

Q ss_pred             CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           35 DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .--|+.|+.+++|.+ ...+++|+-||.|+++|+.++.. .++..|..++.|+.+... ...+++|++|++|++||.|. 
T Consensus        50 ~~~~~~plL~c~F~d-~~~~~~G~~dg~vr~~Dln~~~~-~~igth~~~i~ci~~~~~-~~~vIsgsWD~~ik~wD~R~-  125 (323)
T KOG1036|consen   50 KFKHGAPLLDCAFAD-ESTIVTGGLDGQVRRYDLNTGNE-DQIGTHDEGIRCIEYSYE-VGCVISGSWDKTIKFWDPRN-  125 (323)
T ss_pred             heecCCceeeeeccC-CceEEEeccCceEEEEEecCCcc-eeeccCCCceEEEEeecc-CCeEEEcccCccEEEEeccc-
Confidence            358999999999996 56799999999999999999875 457789999999999754 55788999999999999873 


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC-CC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH-PC  193 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~  193 (303)
                         ......+. ....|.+++.  .++.|+.|..|..+.+||+|.+...++.  ....+.+..+.+..-|....... ..
T Consensus       126 ---~~~~~~~d-~~kkVy~~~v--~g~~LvVg~~~r~v~iyDLRn~~~~~q~--reS~lkyqtR~v~~~pn~eGy~~sSi  197 (323)
T KOG1036|consen  126 ---KVVVGTFD-QGKKVYCMDV--SGNRLVVGTSDRKVLIYDLRNLDEPFQR--RESSLKYQTRCVALVPNGEGYVVSSI  197 (323)
T ss_pred             ---cccccccc-cCceEEEEec--cCCEEEEeecCceEEEEEcccccchhhh--ccccceeEEEEEEEecCCCceEEEee
Confidence               11122221 2346777765  4668999999999999999987654422  12233444444433332111111 00


Q ss_pred             ---------------CCcceEEeccccee---eeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCe
Q 022074          194 ---------------DQSVATYKGHSVLR---TLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPV  255 (303)
Q Consensus       194 ---------------~~~~~~~~~~~~~~---~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I  255 (303)
                                     ......++.|....   .++-......|+|-.+.|||||.||.|-+||+.+++.++.+......|
T Consensus       198 eGRVavE~~d~s~~~~skkyaFkCHr~~~~~~~~~yPVNai~Fhp~~~tfaTgGsDG~V~~Wd~~~rKrl~q~~~~~~SI  277 (323)
T KOG1036|consen  198 EGRVAVEYFDDSEEAQSKKYAFKCHRLSEKDTEIIYPVNAIAFHPIHGTFATGGSDGIVNIWDLFNRKRLKQLAKYETSI  277 (323)
T ss_pred             cceEEEEccCCchHHhhhceeEEeeecccCCceEEEEeceeEeccccceEEecCCCceEEEccCcchhhhhhccCCCCce
Confidence                           11223344443221   111122334577777889999999999999999999998887777789


Q ss_pred             EEEEECCCCCeEEEEeC
Q 022074          256 RDCSWHPSQPMLVSSSW  272 (303)
Q Consensus       256 ~~v~~sp~~~~las~s~  272 (303)
                      .+++|+.||..||.|+.
T Consensus       278 ~slsfs~dG~~LAia~s  294 (323)
T KOG1036|consen  278 SSLSFSMDGSLLAIASS  294 (323)
T ss_pred             EEEEeccCCCeEEEEec
Confidence            99999999999999986


No 89 
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.90  E-value=2.4e-22  Score=161.96  Aligned_cols=236  Identities=21%  Similarity=0.305  Sum_probs=171.5

Q ss_pred             CchhhccccccccccCcCcc-cccCCCcccceEEEEEcC--CCCEEEEeeCCCeEEEEECCCCceE--EEEecccCCeEE
Q 022074           12 SGTMESLANVTEIHDGLDFS-AADDGGYSFGIFSLKFST--DGRELVAGSSDDCIYVYDLEANKLS--LRILAHTSDVNT   86 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~l~~s~--~g~~l~sgs~Dg~v~lwd~~~~~~~--~~~~~h~~~v~~   86 (303)
                      ..++|.++.|.++=.+.+.. ..+..||++||..++|..  .|.+||+++.||.|.||.-.+++..  .....|...|++
T Consensus        28 TcsSD~tVkIf~v~~n~~s~ll~~L~Gh~GPVwqv~wahPk~G~iLAScsYDgkVIiWke~~g~w~k~~e~~~h~~SVNs  107 (299)
T KOG1332|consen   28 TCSSDGTVKIFEVRNNGQSKLLAELTGHSGPVWKVAWAHPKFGTILASCSYDGKVIIWKEENGRWTKAYEHAAHSASVNS  107 (299)
T ss_pred             eecCCccEEEEEEcCCCCceeeeEecCCCCCeeEEeecccccCcEeeEeecCceEEEEecCCCchhhhhhhhhhccccee
Confidence            45778888999986666533 445689999999999996  7999999999999999998888533  235578899999


Q ss_pred             EEEccC-CCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC---C-----------CEEEEEeCCCc
Q 022074           87 VCFGDE-SGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD---G-----------RYLISNGKDQA  151 (303)
Q Consensus        87 l~~~~~-~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~---~-----------~~l~s~~~D~~  151 (303)
                      ++|.|. .+-.|++++.||.|.+.+.+.... .........|.-+|+++++.|.   |           ..|++||.|..
T Consensus       108 V~wapheygl~LacasSDG~vsvl~~~~~g~-w~t~ki~~aH~~GvnsVswapa~~~g~~~~~~~~~~~krlvSgGcDn~  186 (299)
T KOG1332|consen  108 VAWAPHEYGLLLACASSDGKVSVLTYDSSGG-WTTSKIVFAHEIGVNSVSWAPASAPGSLVDQGPAAKVKRLVSGGCDNL  186 (299)
T ss_pred             ecccccccceEEEEeeCCCcEEEEEEcCCCC-ccchhhhhccccccceeeecCcCCCccccccCcccccceeeccCCccc
Confidence            999763 366899999999999988763311 1223456679999999988775   4           56999999999


Q ss_pred             EEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC
Q 022074          152 IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD  231 (303)
Q Consensus       152 v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d  231 (303)
                      |+||+.....             |..                   -..+.+|...  +-...+.|...--..++|++++|
T Consensus       187 VkiW~~~~~~-------------w~~-------------------e~~l~~H~dw--VRDVAwaP~~gl~~s~iAS~SqD  232 (299)
T KOG1332|consen  187 VKIWKFDSDS-------------WKL-------------------ERTLEGHKDW--VRDVAWAPSVGLPKSTIASCSQD  232 (299)
T ss_pred             eeeeecCCcc-------------hhh-------------------hhhhhhcchh--hhhhhhccccCCCceeeEEecCC
Confidence            9999875421             000                   0012222210  11122334333335689999999


Q ss_pred             CeEEEEECCCC-e--EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          232 SCVYVYDLVSG-E--QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       232 g~i~iwd~~~~-~--~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      |++-||-.... +  +...++.-..+++.+.||+.|++|+.++.|+.+.+|.-.
T Consensus       233 g~viIwt~~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~GdNkvtlwke~  286 (299)
T KOG1332|consen  233 GTVIIWTKDEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGGDNKVTLWKEN  286 (299)
T ss_pred             CcEEEEEecCccCcccccccccCCcceEEEEEeccccEEEEecCCcEEEEEEeC
Confidence            99999987522 1  122334455789999999999999999999999999854


No 90 
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=99.90  E-value=1.7e-21  Score=168.10  Aligned_cols=226  Identities=19%  Similarity=0.299  Sum_probs=174.0

Q ss_pred             EEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEeccc--CCeE
Q 022074            8 VDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHT--SDVN   85 (303)
Q Consensus         8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~--~~v~   85 (303)
                      ++.+|+  |.-|-||-.  +++........|+.+|..+..+|+|+|+++++.|++....|..++.........+  -.++
T Consensus       276 v~~aSa--d~~i~vws~--~~~s~~~~~~~h~~~V~~ls~h~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~t  351 (506)
T KOG0289|consen  276 VITASA--DEIIRVWSV--PLSSEPTSSRPHEEPVTGLSLHPTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYT  351 (506)
T ss_pred             eeecCC--cceEEeecc--ccccCccccccccccceeeeeccCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeE
Confidence            344444  445566655  2333344457999999999999999999999999999999999998665443322  2478


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcc
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNAS  165 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~  165 (303)
                      +.+|+| ++..|.+|..||.|++||++..+    ....|.+|.++|..++|+.+|-+|+++..|++|++||||+++...+
T Consensus       352 s~~fHp-DgLifgtgt~d~~vkiwdlks~~----~~a~Fpght~~vk~i~FsENGY~Lat~add~~V~lwDLRKl~n~kt  426 (506)
T KOG0289|consen  352 SAAFHP-DGLIFGTGTPDGVVKIWDLKSQT----NVAKFPGHTGPVKAISFSENGYWLATAADDGSVKLWDLRKLKNFKT  426 (506)
T ss_pred             EeeEcC-CceEEeccCCCceEEEEEcCCcc----ccccCCCCCCceeEEEeccCceEEEEEecCCeEEEEEehhhcccce
Confidence            889965 58899999999999999997432    4567889999999999999999999999999999999998763222


Q ss_pred             cccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC--CCe
Q 022074          166 CNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV--SGE  243 (303)
Q Consensus       166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~--~~~  243 (303)
                      ..+.                       ....+                .+..|...|++|+.+|+|=+|++++-.  +..
T Consensus       427 ~~l~-----------------------~~~~v----------------~s~~fD~SGt~L~~~g~~l~Vy~~~k~~k~W~  467 (506)
T KOG0289|consen  427 IQLD-----------------------EKKEV----------------NSLSFDQSGTYLGIAGSDLQVYICKKKTKSWT  467 (506)
T ss_pred             eecc-----------------------ccccc----------------eeEEEcCCCCeEEeecceeEEEEEecccccce
Confidence            1110                       00000                012355678999999999888888854  445


Q ss_pred             EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          244 QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       244 ~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      ++..+..|.+..+.+.|....++++++|.|..+++.-+
T Consensus       468 ~~~~~~~~sg~st~v~Fg~~aq~l~s~smd~~l~~~a~  505 (506)
T KOG0289|consen  468 EIKELADHSGLSTGVRFGEHAQYLASTSMDAILRLYAL  505 (506)
T ss_pred             eeehhhhcccccceeeecccceEEeeccchhheEEeec
Confidence            67788889999999999999999999999999888653


No 91 
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=99.90  E-value=2.6e-22  Score=181.78  Aligned_cols=222  Identities=25%  Similarity=0.324  Sum_probs=171.4

Q ss_pred             EEEccCchhhccccccccccCcCcc-cccCCCcccceEE-EEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCC
Q 022074            7 IVDVGSGTMESLANVTEIHDGLDFS-AADDGGYSFGIFS-LKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSD   83 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~v~~-l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~   83 (303)
                      .+-|++++-|....||+...+.-.. ..- .||..-|.. ++|-+ ++..+++|+.|.++.+|.+.+......+.+|...
T Consensus        25 ~~~i~s~sRd~t~~vw~~~~~~~l~~~~~-~~~~g~i~~~i~y~e~~~~~l~~g~~D~~i~v~~~~~~~P~~~LkgH~sn  103 (745)
T KOG0301|consen   25 GVCIISGSRDGTVKVWAKKGKQYLETHAF-EGPKGFIANSICYAESDKGRLVVGGMDTTIIVFKLSQAEPLYTLKGHKSN  103 (745)
T ss_pred             CeEEeecCCCCceeeeeccCcccccceec-ccCcceeeccceeccccCcceEeecccceEEEEecCCCCchhhhhccccc
Confidence            3457888889889999874433221 223 334433444 77775 5556999999999999999998888889999999


Q ss_pred             eEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCC
Q 022074           84 VNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSN  163 (303)
Q Consensus        84 v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~  163 (303)
                      |+++... .++. ++|||+|.++++|...      ...-.+.+|..+|+++.+-|++ .++||+.|++||+|.=.     
T Consensus       104 VC~ls~~-~~~~-~iSgSWD~TakvW~~~------~l~~~l~gH~asVWAv~~l~e~-~~vTgsaDKtIklWk~~-----  169 (745)
T KOG0301|consen  104 VCSLSIG-EDGT-LISGSWDSTAKVWRIG------ELVYSLQGHTASVWAVASLPEN-TYVTGSADKTIKLWKGG-----  169 (745)
T ss_pred             eeeeecC-CcCc-eEecccccceEEecch------hhhcccCCcchheeeeeecCCC-cEEeccCcceeeeccCC-----
Confidence            9999874 3344 8899999999999753      2334588999999999988887 79999999999999632     


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE  243 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~  243 (303)
                                                     ..+.++.||..   ++|..   .+-+ ...+++++.||.|+.|++ +|+
T Consensus       170 -------------------------------~~l~tf~gHtD---~VRgL---~vl~-~~~flScsNDg~Ir~w~~-~ge  210 (745)
T KOG0301|consen  170 -------------------------------TLLKTFSGHTD---CVRGL---AVLD-DSHFLSCSNDGSIRLWDL-DGE  210 (745)
T ss_pred             -------------------------------chhhhhccchh---heeee---EEec-CCCeEeecCCceEEEEec-cCc
Confidence                                           12344555532   22211   1112 345899999999999999 788


Q ss_pred             EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          244 QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       244 ~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      .+.++.+|+.-|.+++..+++..++|+++|+++++|+..
T Consensus       211 ~l~~~~ghtn~vYsis~~~~~~~Ivs~gEDrtlriW~~~  249 (745)
T KOG0301|consen  211 VLLEMHGHTNFVYSISMALSDGLIVSTGEDRTLRIWKKD  249 (745)
T ss_pred             eeeeeeccceEEEEEEecCCCCeEEEecCCceEEEeecC
Confidence            888999999999999988999999999999999999864


No 92 
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.90  E-value=4.2e-23  Score=185.67  Aligned_cols=216  Identities=21%  Similarity=0.308  Sum_probs=159.8

Q ss_pred             CCcccceE---EEEEc-CCCCEEEEeeCCCeEEEEECCCCce------EEEEecccCCeEEEEEccCCCcEEEEecCCCe
Q 022074           36 GGYSFGIF---SLKFS-TDGRELVAGSSDDCIYVYDLEANKL------SLRILAHTSDVNTVCFGDESGHLIYSGSDDNL  105 (303)
Q Consensus        36 ~~~~~~v~---~l~~s-~~g~~l~sgs~Dg~v~lwd~~~~~~------~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~  105 (303)
                      ..|..+|.   ++..+ |++++|++||.||.|++|+......      ...+..|.+-|+.++... +++.|+|+|.|-+
T Consensus        18 ~qn~~~v~~~~~Lq~da~~~ryLfTgGRDg~i~~W~~~~d~~~~s~~~~asme~HsDWVNDiiL~~-~~~tlIS~SsDtT   96 (735)
T KOG0308|consen   18 KQNRNGVNITKALQLDAPNGRYLFTGGRDGIIRLWSVTQDSNEPSTPYIASMEHHSDWVNDIILCG-NGKTLISASSDTT   96 (735)
T ss_pred             hhccccccchhhccccCCCCceEEecCCCceEEEeccccccCCcccchhhhhhhhHhHHhhHHhhc-CCCceEEecCCce
Confidence            35555555   56666 5677899999999999998866432      345678999999998854 4778999999999


Q ss_pred             EEEEcCccccCCCccceeecccccCeEEEEe-CCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCC
Q 022074          106 CKVWDRRCLNVKGKPAGVLMGHLEGITFIDS-RGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPP  184 (303)
Q Consensus       106 v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~-~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~  184 (303)
                      |++|+.....  .....++..|.+.|.+++. .++..++||||-|+.|.+||+.........+.+..             
T Consensus        97 VK~W~~~~~~--~~c~stir~H~DYVkcla~~ak~~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~-------------  161 (735)
T KOG0308|consen   97 VKVWNAHKDN--TFCMSTIRTHKDYVKCLAYIAKNNELVASGGLDRKIFLWDINTGTATLVASFNNV-------------  161 (735)
T ss_pred             EEEeecccCc--chhHhhhhcccchheeeeecccCceeEEecCCCccEEEEEccCcchhhhhhcccc-------------
Confidence            9999964221  1234456679999999998 77888999999999999999975422100000000             


Q ss_pred             CCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCC
Q 022074          185 QARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~  264 (303)
                      ....+..          |+..      ..++.+..+.+..+++||.++.+++||.++++++..+.+|+..|..+-.++||
T Consensus       162 t~~sl~s----------G~k~------siYSLA~N~t~t~ivsGgtek~lr~wDprt~~kimkLrGHTdNVr~ll~~dDG  225 (735)
T KOG0308|consen  162 TVNSLGS----------GPKD------SIYSLAMNQTGTIIVSGGTEKDLRLWDPRTCKKIMKLRGHTDNVRVLLVNDDG  225 (735)
T ss_pred             ccccCCC----------CCcc------ceeeeecCCcceEEEecCcccceEEeccccccceeeeeccccceEEEEEcCCC
Confidence            0000000          1110      11122334567889999999999999999999999999999999999999999


Q ss_pred             CeEEEEeCCCCEEEeecCC
Q 022074          265 PMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       265 ~~las~s~Dg~i~~Wd~~~  283 (303)
                      +.++|+|.||+|++||+..
T Consensus       226 t~~ls~sSDgtIrlWdLgq  244 (735)
T KOG0308|consen  226 TRLLSASSDGTIRLWDLGQ  244 (735)
T ss_pred             CeEeecCCCceEEeeeccc
Confidence            9999999999999999853


No 93 
>KOG0264 consensus Nucleosome remodeling factor, subunit CAF1/NURF55/MSI1 [Chromatin structure and dynamics]
Probab=99.89  E-value=3.7e-22  Score=173.02  Aligned_cols=210  Identities=21%  Similarity=0.342  Sum_probs=158.1

Q ss_pred             cccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCce----------EEEEecccCCeEEEEEccCCCcEEEEecCCCeE
Q 022074           38 YSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKL----------SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLC  106 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~----------~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v  106 (303)
                      |...|..+.+-|+.. .+++++..+.|.|||.....-          -.++.+|.+.-..++|++...-.|++++.|++|
T Consensus       123 h~gEVnRaRymPQnp~iVAt~t~~~dv~Vfd~tk~~s~~~~~~~~~Pdl~L~gH~~eg~glsWn~~~~g~Lls~~~d~~i  202 (422)
T KOG0264|consen  123 HDGEVNRARYMPQNPNIVATKTSSGDVYVFDYTKHPSKPKASGECRPDLRLKGHEKEGYGLSWNRQQEGTLLSGSDDHTI  202 (422)
T ss_pred             CCccchhhhhCCCCCcEEEecCCCCCEEEEEeccCCCcccccccCCCceEEEeecccccccccccccceeEeeccCCCcE
Confidence            556677777777544 677788899999999865321          136889988778899987666688899999999


Q ss_pred             EEEcCccccCC---CccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          107 KVWDRRCLNVK---GKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       107 ~lWd~~~~~~~---~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      ++||+......   ..+...+.+|.+.|..++|++ +..+|++++.|+.+.|||+|..  .....               
T Consensus       203 ~lwdi~~~~~~~~~~~p~~~~~~h~~~VeDV~~h~~h~~lF~sv~dd~~L~iwD~R~~--~~~~~---------------  265 (422)
T KOG0264|consen  203 CLWDINAESKEDKVVDPKTIFSGHEDVVEDVAWHPLHEDLFGSVGDDGKLMIWDTRSN--TSKPS---------------  265 (422)
T ss_pred             EEEeccccccCCccccceEEeecCCcceehhhccccchhhheeecCCCeEEEEEcCCC--CCCCc---------------
Confidence            99998754432   345667889999999999987 4567899999999999999952  11110               


Q ss_pred             CCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEEC
Q 022074          183 PPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWH  261 (303)
Q Consensus       183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~s  261 (303)
                                     ....+|.....+  +.|+|   .++..|||||.|++|++||+++.+ ++..+++|+..|..|.||
T Consensus       266 ---------------~~~~ah~~~vn~--~~fnp---~~~~ilAT~S~D~tV~LwDlRnL~~~lh~~e~H~dev~~V~WS  325 (422)
T KOG0264|consen  266 ---------------HSVKAHSAEVNC--VAFNP---FNEFILATGSADKTVALWDLRNLNKPLHTFEGHEDEVFQVEWS  325 (422)
T ss_pred             ---------------ccccccCCceeE--EEeCC---CCCceEEeccCCCcEEEeechhcccCceeccCCCcceEEEEeC
Confidence                           011112111111  12222   246789999999999999999875 488999999999999999


Q ss_pred             CCC-CeEEEEeCCCCEEEeecCCC
Q 022074          262 PSQ-PMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       262 p~~-~~las~s~Dg~i~~Wd~~~~  284 (303)
                      |.. ..|||++.|+.+.+||+...
T Consensus       326 Ph~etvLASSg~D~rl~vWDls~i  349 (422)
T KOG0264|consen  326 PHNETVLASSGTDRRLNVWDLSRI  349 (422)
T ss_pred             CCCCceeEecccCCcEEEEecccc
Confidence            985 58999999999999998654


No 94 
>KOG0300 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.89  E-value=2e-22  Score=167.97  Aligned_cols=212  Identities=23%  Similarity=0.354  Sum_probs=171.0

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc--
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC--  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~--  113 (303)
                      .||+.+|+.++-......+.++|.|.+.+||.++++....++.+|.+.|+++.|+ +.+.++++++.|++.++|....  
T Consensus       145 ~GHkDGiW~Vaa~~tqpi~gtASADhTA~iWs~Esg~CL~~Y~GH~GSVNsikfh-~s~~L~lTaSGD~taHIW~~av~~  223 (481)
T KOG0300|consen  145 EGHKDGIWHVAADSTQPICGTASADHTARIWSLESGACLATYTGHTGSVNSIKFH-NSGLLLLTASGDETAHIWKAAVNW  223 (481)
T ss_pred             cccccceeeehhhcCCcceeecccccceeEEeeccccceeeecccccceeeEEec-cccceEEEccCCcchHHHHHhhcC
Confidence            6999999999998877899999999999999999999999999999999999996 4688999999999999996210  


Q ss_pred             --cc----------------------------C----CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074          114 --LN----------------------------V----KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       114 --~~----------------------------~----~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                        ..                            .    ...|...+.||...|.+.+|-..|.+++|++.|++..+||+..
T Consensus       224 ~vP~~~a~~~hSsEeE~e~sDe~~~d~d~~~~sD~~tiRvPl~~ltgH~~vV~a~dWL~gg~Q~vTaSWDRTAnlwDVEt  303 (481)
T KOG0300|consen  224 EVPSNNAPSDHSSEEEEEHSDEHNRDTDSSEKSDGHTIRVPLMRLTGHRAVVSACDWLAGGQQMVTASWDRTANLWDVET  303 (481)
T ss_pred             cCCCCCCCCCCCchhhhhcccccccccccccccCCceeeeeeeeeeccccceEehhhhcCcceeeeeeccccceeeeecc
Confidence              00                            0    0124456789999999999988999999999999999999875


Q ss_pred             ccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEEC
Q 022074          160 MSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDL  239 (303)
Q Consensus       160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~  239 (303)
                      ...                                  +..+.||...-  ..|.    -+|.++++++.+-|.+.++||.
T Consensus       304 ge~----------------------------------v~~LtGHd~EL--tHcs----tHptQrLVvTsSrDtTFRLWDF  343 (481)
T KOG0300|consen  304 GEV----------------------------------VNILTGHDSEL--THCS----THPTQRLVVTSSRDTTFRLWDF  343 (481)
T ss_pred             Cce----------------------------------eccccCcchhc--cccc----cCCcceEEEEeccCceeEeccc
Confidence            332                                  22334443211  1111    2467899999999999999999


Q ss_pred             CCC-eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCccCC
Q 022074          240 VSG-EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEAAP  289 (303)
Q Consensus       240 ~~~-~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~~~  289 (303)
                      +.. ..+..|.+|++.|+++.|..+.+ +++|++|.++++||+..++....
T Consensus       344 ReaI~sV~VFQGHtdtVTS~vF~~dd~-vVSgSDDrTvKvWdLrNMRsplA  393 (481)
T KOG0300|consen  344 REAIQSVAVFQGHTDTVTSVVFNTDDR-VVSGSDDRTVKVWDLRNMRSPLA  393 (481)
T ss_pred             hhhcceeeeecccccceeEEEEecCCc-eeecCCCceEEEeeeccccCcce
Confidence            743 34788999999999999988765 78999999999999998866543


No 95 
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.89  E-value=2.4e-23  Score=189.22  Aligned_cols=202  Identities=28%  Similarity=0.480  Sum_probs=173.6

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      ..|...|.++..-..++.+++|++|..+.||....-.....+.+|..+|.++.|+.+ ..++++|+.+|+|++||+.   
T Consensus        25 ~~hsaav~~lk~~~s~r~~~~Gg~~~k~~L~~i~kp~~i~S~~~hespIeSl~f~~~-E~LlaagsasgtiK~wDle---  100 (825)
T KOG0267|consen   25 VAHSAAVGCLKIRKSSRSLVTGGEDEKVNLWAIGKPNAITSLTGHESPIESLTFDTS-ERLLAAGSASGTIKVWDLE---  100 (825)
T ss_pred             hhhhhhhceeeeeccceeeccCCCceeeccccccCCchhheeeccCCcceeeecCcc-hhhhcccccCCceeeeehh---
Confidence            688899999888778889999999999999988665555668899999999999654 5688899999999999986   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                       ..+..+.+.||...+..++|+|-+.++++|+.|.-+++||.|+.-                                  
T Consensus       101 -eAk~vrtLtgh~~~~~sv~f~P~~~~~a~gStdtd~~iwD~Rk~G----------------------------------  145 (825)
T KOG0267|consen  101 -EAKIVRTLTGHLLNITSVDFHPYGEFFASGSTDTDLKIWDIRKKG----------------------------------  145 (825)
T ss_pred             -hhhhhhhhhccccCcceeeeccceEEeccccccccceehhhhccC----------------------------------
Confidence             455677899999999999999999999999999999999998532                                  


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGD  275 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~  275 (303)
                      +...+++|.....+      ..|+|+|++++.|++|..++|||...|+.+.+|+.|++++.++.|+|..-++++||.|++
T Consensus       146 c~~~~~s~~~vv~~------l~lsP~Gr~v~~g~ed~tvki~d~~agk~~~ef~~~e~~v~sle~hp~e~Lla~Gs~d~t  219 (825)
T KOG0267|consen  146 CSHTYKSHTRVVDV------LRLSPDGRWVASGGEDNTVKIWDLTAGKLSKEFKSHEGKVQSLEFHPLEVLLAPGSSDRT  219 (825)
T ss_pred             ceeeecCCcceeEE------EeecCCCceeeccCCcceeeeecccccccccccccccccccccccCchhhhhccCCCCce
Confidence            23334443322222      237899999999999999999999999999999999999999999999999999999999


Q ss_pred             EEEeecC
Q 022074          276 VVRWEFP  282 (303)
Q Consensus       276 i~~Wd~~  282 (303)
                      +++||+.
T Consensus       220 v~f~dle  226 (825)
T KOG0267|consen  220 VRFWDLE  226 (825)
T ss_pred             eeeeccc
Confidence            9999986


No 96 
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=99.89  E-value=1.5e-22  Score=169.22  Aligned_cols=202  Identities=23%  Similarity=0.378  Sum_probs=162.4

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE--------EEEecccCCeEEEEEccCCCcEEEEecCCCeEEE
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS--------LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKV  108 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~--------~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~l  108 (303)
                      |-..-+-|..|||||+++++||.||.|.+|+..+|++.        ..+.-+++.|.|+.|+. +..++++|+.||.|++
T Consensus       211 g~KSh~EcA~FSPDgqyLvsgSvDGFiEVWny~~GKlrKDLkYQAqd~fMMmd~aVlci~FSR-DsEMlAsGsqDGkIKv  289 (508)
T KOG0275|consen  211 GQKSHVECARFSPDGQYLVSGSVDGFIEVWNYTTGKLRKDLKYQAQDNFMMMDDAVLCISFSR-DSEMLASGSQDGKIKV  289 (508)
T ss_pred             ccccchhheeeCCCCceEeeccccceeeeehhccchhhhhhhhhhhcceeecccceEEEeecc-cHHHhhccCcCCcEEE
Confidence            55567889999999999999999999999999998754        23556788999999975 5789999999999999


Q ss_pred             EcCccccCCCccceeec-ccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          109 WDRRCLNVKGKPAGVLM-GHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       109 Wd~~~~~~~~~~~~~~~-~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      |.++    ++...+.|. .|..+|+++.|+.|+..+++++.|.++|+--+...+                          
T Consensus       290 Wri~----tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK--------------------------  339 (508)
T KOG0275|consen  290 WRIE----TGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGK--------------------------  339 (508)
T ss_pred             EEEe----cchHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccccch--------------------------
Confidence            9976    344455565 699999999999999999999999999999876432                          


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee--cCCCCeEEEEECCCCC
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK--YHTSPVRDCSWHPSQP  265 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~--~h~~~I~~v~~sp~~~  265 (303)
                              ++..+.||.....      ...|+++|..+++++.||+|++|+.++.+++.+++  +...+|+++..-|..+
T Consensus       340 --------~LKEfrGHsSyvn------~a~ft~dG~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnp  405 (508)
T KOG0275|consen  340 --------CLKEFRGHSSYVN------EATFTDDGHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNP  405 (508)
T ss_pred             --------hHHHhcCcccccc------ceEEcCCCCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcCCCC
Confidence                    2334445542221      23477899999999999999999999999988886  3456899999888664


Q ss_pred             -eEEEEeCCCCEEEeecCC
Q 022074          266 -MLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       266 -~las~s~Dg~i~~Wd~~~  283 (303)
                       .++.+...+++.+-++++
T Consensus       406 eh~iVCNrsntv~imn~qG  424 (508)
T KOG0275|consen  406 EHFIVCNRSNTVYIMNMQG  424 (508)
T ss_pred             ceEEEEcCCCeEEEEeccc
Confidence             677777778888877653


No 97 
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.88  E-value=1.2e-21  Score=178.80  Aligned_cols=201  Identities=24%  Similarity=0.321  Sum_probs=160.3

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC-----CceEE-------EEecccCCeEEEEEccCCCcEEEEecCC
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA-----NKLSL-------RILAHTSDVNTVCFGDESGHLIYSGSDD  103 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~-----~~~~~-------~~~~h~~~v~~l~~~~~~~~~l~s~s~d  103 (303)
                      .+|+.+|.+++.+||++.+++||.|.+|++||..-     +....       +...-...|.|+.++| ++++|+.+-.|
T Consensus       451 ~AHdgaIWsi~~~pD~~g~vT~saDktVkfWdf~l~~~~~gt~~k~lsl~~~rtLel~ddvL~v~~Sp-dgk~LaVsLLd  529 (888)
T KOG0306|consen  451 RAHDGAIWSISLSPDNKGFVTGSADKTVKFWDFKLVVSVPGTQKKVLSLKHTRTLELEDDVLCVSVSP-DGKLLAVSLLD  529 (888)
T ss_pred             hccccceeeeeecCCCCceEEecCCcEEEEEeEEEEeccCcccceeeeeccceEEeccccEEEEEEcC-CCcEEEEEecc
Confidence            38999999999999999999999999999997642     21111       1222346799999975 58899999999


Q ss_pred             CeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCC
Q 022074          104 NLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYP  183 (303)
Q Consensus       104 g~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~  183 (303)
                      .+|++|=+...    +..-.+.||.=+|.++++++|+.+++|||.|+.|++|-+.-..+.    .               
T Consensus       530 nTVkVyflDtl----KFflsLYGHkLPV~smDIS~DSklivTgSADKnVKiWGLdFGDCH----K---------------  586 (888)
T KOG0306|consen  530 NTVKVYFLDTL----KFFLSLYGHKLPVLSMDISPDSKLIVTGSADKNVKIWGLDFGDCH----K---------------  586 (888)
T ss_pred             CeEEEEEecce----eeeeeecccccceeEEeccCCcCeEEeccCCCceEEeccccchhh----h---------------
Confidence            99999976532    334467799999999999999999999999999999977532211    1               


Q ss_pred             CCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC
Q 022074          184 PQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS  263 (303)
Q Consensus       184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~  263 (303)
                                     ++-+|+..  ++    +..|-|...++.++|.|+.|+-||-+..+++..+.+|...|++++.+|+
T Consensus       587 ---------------S~fAHdDS--vm----~V~F~P~~~~FFt~gKD~kvKqWDg~kFe~iq~L~~H~~ev~cLav~~~  645 (888)
T KOG0306|consen  587 ---------------SFFAHDDS--VM----SVQFLPKTHLFFTCGKDGKVKQWDGEKFEEIQKLDGHHSEVWCLAVSPN  645 (888)
T ss_pred             ---------------hhhcccCc--ee----EEEEcccceeEEEecCcceEEeechhhhhhheeeccchheeeeeEEcCC
Confidence                           11111111  11    1224566788999999999999999999999999999999999999999


Q ss_pred             CCeEEEEeCCCCEEEeec
Q 022074          264 QPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       264 ~~~las~s~Dg~i~~Wd~  281 (303)
                      |.+++|+|.|.+|++|.-
T Consensus       646 G~~vvs~shD~sIRlwE~  663 (888)
T KOG0306|consen  646 GSFVVSSSHDKSIRLWER  663 (888)
T ss_pred             CCeEEeccCCceeEeeec
Confidence            999999999999999984


No 98 
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=99.88  E-value=1.2e-21  Score=172.22  Aligned_cols=219  Identities=18%  Similarity=0.283  Sum_probs=157.1

Q ss_pred             cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE----EEec-ccCCeEEEEEccCCCcEEEEecCCCeEEE
Q 022074           34 DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL----RILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKV  108 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~----~~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~l  108 (303)
                      -..+|+..|.++++.|.|-++++||.|-+|++||...-....    ++.. ....|+.+.|++ .++.|++.+...+.+|
T Consensus       162 ~l~hgtk~Vsal~~Dp~GaR~~sGs~Dy~v~~wDf~gMdas~~~fr~l~P~E~h~i~sl~ys~-Tg~~iLvvsg~aqakl  240 (641)
T KOG0772|consen  162 QLKHGTKIVSALAVDPSGARFVSGSLDYTVKFWDFQGMDASMRSFRQLQPCETHQINSLQYSV-TGDQILVVSGSAQAKL  240 (641)
T ss_pred             eccCCceEEEEeeecCCCceeeeccccceEEEEecccccccchhhhccCcccccccceeeecC-CCCeEEEEecCcceeE
Confidence            336899999999999999999999999999999997543221    1222 234689999965 5778888888889999


Q ss_pred             EcCccccCCC-----cc---ceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeecee
Q 022074          109 WDRRCLNVKG-----KP---AGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRW  179 (303)
Q Consensus       109 Wd~~~~~~~~-----~~---~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~  179 (303)
                      +|........     +.   .....||...+++..|+|.. +.|+|++.|+++|+||+...+.....             
T Consensus       241 ~DRdG~~~~e~~KGDQYI~Dm~nTKGHia~lt~g~whP~~k~~FlT~s~DgtlRiWdv~~~k~q~qV-------------  307 (641)
T KOG0772|consen  241 LDRDGFEIVEFSKGDQYIRDMYNTKGHIAELTCGCWHPDNKEEFLTCSYDGTLRIWDVNNTKSQLQV-------------  307 (641)
T ss_pred             EccCCceeeeeeccchhhhhhhccCCceeeeeccccccCcccceEEecCCCcEEEEecCCchhheeE-------------
Confidence            9964321100     01   11235899999999999864 56999999999999998754321110             


Q ss_pred             eeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe---EEEEeecCCC--C
Q 022074          180 MDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE---QVAALKYHTS--P  254 (303)
Q Consensus       180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~---~~~~~~~h~~--~  254 (303)
                               +++...+      +..  .....|.    |+++++++|+|..||.|.+||..+..   ..+.-++|..  .
T Consensus       308 ---------ik~k~~~------g~R--v~~tsC~----~nrdg~~iAagc~DGSIQ~W~~~~~~v~p~~~vk~AH~~g~~  366 (641)
T KOG0772|consen  308 ---------IKTKPAG------GKR--VPVTSCA----WNRDGKLIAAGCLDGSIQIWDKGSRTVRPVMKVKDAHLPGQD  366 (641)
T ss_pred             ---------EeeccCC------Ccc--cCceeee----cCCCcchhhhcccCCceeeeecCCcccccceEeeeccCCCCc
Confidence                     0000000      000  0111232    66789999999999999999975442   1334467876  8


Q ss_pred             eEEEEECCCCCeEEEEeCCCCEEEeecCCCCcc
Q 022074          255 VRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEA  287 (303)
Q Consensus       255 I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~  287 (303)
                      |++++||+||++|+|-|.|+++++||+....+.
T Consensus       367 Itsi~FS~dg~~LlSRg~D~tLKvWDLrq~kkp  399 (641)
T KOG0772|consen  367 ITSISFSYDGNYLLSRGFDDTLKVWDLRQFKKP  399 (641)
T ss_pred             eeEEEeccccchhhhccCCCceeeeeccccccc
Confidence            999999999999999999999999999865433


No 99 
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=99.88  E-value=1.2e-21  Score=176.49  Aligned_cols=232  Identities=19%  Similarity=0.287  Sum_probs=181.5

Q ss_pred             chhhccccccccccCcCc-cccc---CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--eEEEEecccCCeEE
Q 022074           13 GTMESLANVTEIHDGLDF-SAAD---DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--LSLRILAHTSDVNT   86 (303)
Q Consensus        13 ~~~~~~~~~~~~~~~~~~-~~~~---~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~~~~~~~h~~~v~~   86 (303)
                      |.-|..|-+|.+-.-.++ +++.   -..|+..|..+....+|+.++++|.|-+|++|+...+.  ....+..|++-|.|
T Consensus        43 gGRDg~i~~W~~~~d~~~~s~~~~asme~HsDWVNDiiL~~~~~tlIS~SsDtTVK~W~~~~~~~~c~stir~H~DYVkc  122 (735)
T KOG0308|consen   43 GGRDGIIRLWSVTQDSNEPSTPYIASMEHHSDWVNDIILCGNGKTLISASSDTTVKVWNAHKDNTFCMSTIRTHKDYVKC  122 (735)
T ss_pred             cCCCceEEEeccccccCCcccchhhhhhhhHhHHhhHHhhcCCCceEEecCCceEEEeecccCcchhHhhhhcccchhee
Confidence            445666666666333322 2111   14799999999999999999999999999999998774  23456789999999


Q ss_pred             EEEccCCCcEEEEecCCCeEEEEcCcccc------CCCccceeec-ccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074           87 VCFGDESGHLIYSGSDDNLCKVWDRRCLN------VKGKPAGVLM-GHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus        87 l~~~~~~~~~l~s~s~dg~v~lWd~~~~~------~~~~~~~~~~-~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      +++--++..+++||+.|+.|.+||+....      .+..+...+. |+.++|.+++-++.|..|++||.++.+|+||.|.
T Consensus       123 la~~ak~~~lvaSgGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGgtek~lr~wDprt  202 (735)
T KOG0308|consen  123 LAYIAKNNELVASGGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGGTEKDLRLWDPRT  202 (735)
T ss_pred             eeecccCceeEEecCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecCcccceEEecccc
Confidence            99833567799999999999999986331      1112222333 8999999999999999999999999999999874


Q ss_pred             ccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEEC
Q 022074          160 MSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDL  239 (303)
Q Consensus       160 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~  239 (303)
                      .+                                  .+..+.||......+      ..+.||..+++|++||+|++||+
T Consensus       203 ~~----------------------------------kimkLrGHTdNVr~l------l~~dDGt~~ls~sSDgtIrlWdL  242 (735)
T KOG0308|consen  203 CK----------------------------------KIMKLRGHTDNVRVL------LVNDDGTRLLSASSDGTIRLWDL  242 (735)
T ss_pred             cc----------------------------------ceeeeeccccceEEE------EEcCCCCeEeecCCCceEEeeec
Confidence            22                                  233444554322222      24678999999999999999999


Q ss_pred             CCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          240 VSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       240 ~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ....++.++..|+..||++.-+|+-..+.+|+.||.|..=|+...
T Consensus       243 gqQrCl~T~~vH~e~VWaL~~~~sf~~vYsG~rd~~i~~Tdl~n~  287 (735)
T KOG0308|consen  243 GQQRCLATYIVHKEGVWALQSSPSFTHVYSGGRDGNIYRTDLRNP  287 (735)
T ss_pred             cccceeeeEEeccCceEEEeeCCCcceEEecCCCCcEEecccCCc
Confidence            999999999999999999999999999999999999999888754


No 100
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.88  E-value=1.1e-21  Score=172.00  Aligned_cols=232  Identities=21%  Similarity=0.324  Sum_probs=171.5

Q ss_pred             ccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE--EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           39 SFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL--RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~--~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      .--|.++.+.|||+.|++|+.-.++.|||+..-....  ++......+.+++.++ +.++++++..||.|++||+.    
T Consensus       465 dnyiRSckL~pdgrtLivGGeastlsiWDLAapTprikaeltssapaCyALa~sp-DakvcFsccsdGnI~vwDLh----  539 (705)
T KOG0639|consen  465 DNYIRSCKLLPDGRTLIVGGEASTLSIWDLAAPTPRIKAELTSSAPACYALAISP-DAKVCFSCCSDGNIAVWDLH----  539 (705)
T ss_pred             ccceeeeEecCCCceEEeccccceeeeeeccCCCcchhhhcCCcchhhhhhhcCC-ccceeeeeccCCcEEEEEcc----
Confidence            3458899999999999999999999999998765332  2222234566777765 57899999999999999986    


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      +...++.|+||.+++.|++++++|..|-|||-|.+||.||+|........  .|.+   ++..+.+.|+...+...+...
T Consensus       540 nq~~VrqfqGhtDGascIdis~dGtklWTGGlDntvRcWDlregrqlqqh--dF~S---QIfSLg~cP~~dWlavGMens  614 (705)
T KOG0639|consen  540 NQTLVRQFQGHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQLQQH--DFSS---QIFSLGYCPTGDWLAVGMENS  614 (705)
T ss_pred             cceeeecccCCCCCceeEEecCCCceeecCCCccceeehhhhhhhhhhhh--hhhh---hheecccCCCccceeeecccC
Confidence            34567889999999999999999999999999999999999975443222  1111   112233445555444443322


Q ss_pred             -ceEEecccceee----eEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEe
Q 022074          197 -VATYKGHSVLRT----LIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       197 -~~~~~~~~~~~~----~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s  271 (303)
                       +..+.-....++    ...|..+..|++-|+++++-|.|.-+-.|..--|..+...+ -..+|.+|+.|.|.++++|||
T Consensus       615 ~vevlh~skp~kyqlhlheScVLSlKFa~cGkwfvStGkDnlLnawrtPyGasiFqsk-E~SsVlsCDIS~ddkyIVTGS  693 (705)
T KOG0639|consen  615 NVEVLHTSKPEKYQLHLHESCVLSLKFAYCGKWFVSTGKDNLLNAWRTPYGASIFQSK-ESSSVLSCDISFDDKYIVTGS  693 (705)
T ss_pred             cEEEEecCCccceeecccccEEEEEEecccCceeeecCchhhhhhccCccccceeecc-ccCcceeeeeccCceEEEecC
Confidence             111110000011    12355677788899999999999999999988888777665 346899999999999999999


Q ss_pred             CCCCEEEeec
Q 022074          272 WDGDVVRWEF  281 (303)
Q Consensus       272 ~Dg~i~~Wd~  281 (303)
                      .|....++.+
T Consensus       694 GdkkATVYeV  703 (705)
T KOG0639|consen  694 GDKKATVYEV  703 (705)
T ss_pred             CCcceEEEEE
Confidence            9999988875


No 101
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.87  E-value=6.5e-21  Score=162.05  Aligned_cols=209  Identities=21%  Similarity=0.366  Sum_probs=153.7

Q ss_pred             CcccceEEEEEcCCCC--EEEEeeCCCeEEEEECCCC----------------ceEEEEecccCCeEEEEEccCCCcEEE
Q 022074           37 GYSFGIFSLKFSTDGR--ELVAGSSDDCIYVYDLEAN----------------KLSLRILAHTSDVNTVCFGDESGHLIY   98 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~--~l~sgs~Dg~v~lwd~~~~----------------~~~~~~~~h~~~v~~l~~~~~~~~~l~   98 (303)
                      +|...+.-+.-++-|+  ..++=+..|.|.||++...                +.+.++.+|.+.-..++|+|-..-.|+
T Consensus       149 ~h~g~~NRvr~~~~~~~~~~aswse~G~V~Vw~l~~~l~~l~~~~~~~~~s~~~Pl~t~~ghk~EGy~LdWSp~~~g~Ll  228 (440)
T KOG0302|consen  149 PHYGGINRVRVSRLGNEVLCASWSENGRVQVWDLAPHLNALSEPGLEVKDSEFRPLFTFNGHKGEGYGLDWSPIKTGRLL  228 (440)
T ss_pred             ccccccceeeecccCCcceeeeecccCcEEEEEchhhhhhhcCccccccccccCceEEecccCccceeeecccccccccc
Confidence            7777888877776554  4555567899999998642                123456788888899999875555788


Q ss_pred             EecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeec
Q 022074           99 SGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDY  177 (303)
Q Consensus        99 s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~  177 (303)
                      ||..-+.|++|......-. .-...+.+|..+|-.+.++| ....|+|||.|++|||||+|......             
T Consensus       229 sGDc~~~I~lw~~~~g~W~-vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~DgsIrIWDiRs~~~~~-------------  294 (440)
T KOG0302|consen  229 SGDCVKGIHLWEPSTGSWK-VDQRPFTGHTKSVEDLQWSPTEDGVFASCSCDGSIRIWDIRSGPKKA-------------  294 (440)
T ss_pred             cCccccceEeeeeccCcee-ecCccccccccchhhhccCCccCceEEeeecCceEEEEEecCCCccc-------------
Confidence            9999999999976432110 11235778999999999988 45789999999999999998642111             


Q ss_pred             eeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC---CeEEEEeecCCCC
Q 022074          178 RWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS---GEQVAALKYHTSP  254 (303)
Q Consensus       178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~---~~~~~~~~~h~~~  254 (303)
                                        ++. .+.|.....++      .++....+||+|+.||+++|||+++   ++.+..|+.|..|
T Consensus       295 ------------------~~~-~kAh~sDVNVI------SWnr~~~lLasG~DdGt~~iwDLR~~~~~~pVA~fk~Hk~p  349 (440)
T KOG0302|consen  295 ------------------AVS-TKAHNSDVNVI------SWNRREPLLASGGDDGTLSIWDLRQFKSGQPVATFKYHKAP  349 (440)
T ss_pred             ------------------eeE-eeccCCceeeE------EccCCcceeeecCCCceEEEEEhhhccCCCcceeEEeccCC
Confidence                              011 12222222222      1333445899999999999999975   4578899999999


Q ss_pred             eEEEEECCCC-CeEEEEeCCCCEEEeecCCC
Q 022074          255 VRDCSWHPSQ-PMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       255 I~~v~~sp~~-~~las~s~Dg~i~~Wd~~~~  284 (303)
                      |+++.|+|.. ..|+++|+|..|.+||+...
T Consensus       350 ItsieW~p~e~s~iaasg~D~QitiWDlsvE  380 (440)
T KOG0302|consen  350 ITSIEWHPHEDSVIAASGEDNQITIWDLSVE  380 (440)
T ss_pred             eeEEEeccccCceEEeccCCCcEEEEEeecc
Confidence            9999999975 57899999999999998654


No 102
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.87  E-value=5.3e-21  Score=171.93  Aligned_cols=246  Identities=25%  Similarity=0.321  Sum_probs=167.1

Q ss_pred             cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE--EecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           34 DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR--ILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~--~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      ....|.-+|+.+.|-|....|++++.|.++++||+++.++...  ..+|..-|..+||.+.+...|++|+.||.+.|||.
T Consensus        95 ~~~aH~nAifDl~wapge~~lVsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~  174 (720)
T KOG0321|consen   95 KPLAHKNAIFDLKWAPGESLLVSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDC  174 (720)
T ss_pred             ccccccceeEeeccCCCceeEEEccCCceeeeeeeccceeecceeecccccccchhhhccCCCcceeeccCCCcEEEEEE
Confidence            4468999999999999777899999999999999999887655  78999999999999888899999999999999998


Q ss_pred             ccccCC--------------C--ccc-------eeecccccCeEE---EEeCCCCCEEEEEeC-CCcEEEEEcccccCCc
Q 022074          112 RCLNVK--------------G--KPA-------GVLMGHLEGITF---IDSRGDGRYLISNGK-DQAIKLWDIRKMSSNA  164 (303)
Q Consensus       112 ~~~~~~--------------~--~~~-------~~~~~h~~~v~~---~~~~~~~~~l~s~~~-D~~v~lWdl~~~~~~~  164 (303)
                      ++....              .  .+.       .....|...+..   +.+..|...|+++|. |+.|++||+|+.....
T Consensus       175 R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~ti~ssvTvv~fkDe~tlaSaga~D~~iKVWDLRk~~~~~  254 (720)
T KOG0321|consen  175 RCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNTIFSSVTVVLFKDESTLASAGAADSTIKVWDLRKNYTAY  254 (720)
T ss_pred             eccchhhHHHHhhhhhccccCCCCCCchhhccccccccccCceeeeeEEEEEeccceeeeccCCCcceEEEeeccccccc
Confidence            753210              0  000       011123333333   335567788998887 9999999999753321


Q ss_pred             ccc----cCccce---eeeceeeeCCCCCccc-cCCCCCcceEEe-------------cccceeeeEEEeeeeeeeCCCe
Q 022074          165 SCN----LGFRSY---EWDYRWMDYPPQARDL-KHPCDQSVATYK-------------GHSVLRTLIRCHFSPVYSTGQK  223 (303)
Q Consensus       165 ~~~----~~~~~~---~~~~~~~~~~~~~~~~-~~~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~s~~~~  223 (303)
                      ...    ..+...   ...+..+.....+..+ +.+.+..|..++             |+.......+    -..++++.
T Consensus       255 r~ep~~~~~~~t~skrs~G~~nL~lDssGt~L~AsCtD~sIy~ynm~s~s~sP~~~~sg~~~~sf~vk----s~lSpd~~  330 (720)
T KOG0321|consen  255 RQEPRGSDKYPTHSKRSVGQVNLILDSSGTYLFASCTDNSIYFYNMRSLSISPVAEFSGKLNSSFYVK----SELSPDDC  330 (720)
T ss_pred             ccCCCcccCccCcccceeeeEEEEecCCCCeEEEEecCCcEEEEeccccCcCchhhccCcccceeeee----eecCCCCc
Confidence            110    000000   0011111111112222 222233343332             2221111111    12468999


Q ss_pred             EEEEEeCCCeEEEEECCCCeE-EEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEeecCC
Q 022074          224 YIYTGSHDSCVYVYDLVSGEQ-VAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~~~-~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~  283 (303)
                      ++++|+.|...++|.+.+-+. ...+.+|...|++++|.|..- -++|+++|..+++|++..
T Consensus       331 ~l~SgSsd~~ayiw~vs~~e~~~~~l~Ght~eVt~V~w~pS~~t~v~TcSdD~~~kiW~l~~  392 (720)
T KOG0321|consen  331 SLLSGSSDEQAYIWVVSSPEAPPALLLGHTREVTTVRWLPSATTPVATCSDDFRVKIWRLSN  392 (720)
T ss_pred             eEeccCCCcceeeeeecCccCChhhhhCcceEEEEEeeccccCCCceeeccCcceEEEeccC
Confidence            999999999999999988765 566789999999999998653 477779999999999843


No 103
>KOG0267 consensus Microtubule severing protein katanin p80 subunit B (contains WD40 repeats) [Cell cycle control, cell division, chromosome partitioning]
Probab=99.87  E-value=2.1e-22  Score=183.12  Aligned_cols=195  Identities=30%  Similarity=0.414  Sum_probs=161.9

Q ss_pred             CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           35 DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      ..||+.+|.++.|++....|++|+.+|+|++||+..++....+.+|...+..+.|+| .+.++++|+.|+.+++||.+  
T Consensus        66 ~~~hespIeSl~f~~~E~LlaagsasgtiK~wDleeAk~vrtLtgh~~~~~sv~f~P-~~~~~a~gStdtd~~iwD~R--  142 (825)
T KOG0267|consen   66 LTGHESPIESLTFDTSERLLAAGSASGTIKVWDLEEAKIVRTLTGHLLNITSVDFHP-YGEFFASGSTDTDLKIWDIR--  142 (825)
T ss_pred             eeccCCcceeeecCcchhhhcccccCCceeeeehhhhhhhhhhhccccCcceeeecc-ceEEeccccccccceehhhh--
Confidence            389999999999999999999999999999999999998889999999999999964 68899999999999999987  


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                        .......+.+|...|..+.++|+|.++++++.|.++++||++..+....    |...+                    
T Consensus       143 --k~Gc~~~~~s~~~vv~~l~lsP~Gr~v~~g~ed~tvki~d~~agk~~~e----f~~~e--------------------  196 (825)
T KOG0267|consen  143 --KKGCSHTYKSHTRVVDVLRLSPDGRWVASGGEDNTVKIWDLTAGKLSKE----FKSHE--------------------  196 (825)
T ss_pred             --ccCceeeecCCcceeEEEeecCCCceeeccCCcceeeeecccccccccc----ccccc--------------------
Confidence              2334667888999999999999999999999999999999975432111    10000                    


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                      ..+.                .+.|+|..-++++||.|+++++||+++.+.+...+.-...|.+.+|+|++..+++|..+.
T Consensus       197 ~~v~----------------sle~hp~e~Lla~Gs~d~tv~f~dletfe~I~s~~~~~~~v~~~~fn~~~~~~~~G~q~s  260 (825)
T KOG0267|consen  197 GKVQ----------------SLEFHPLEVLLAPGSSDRTVRFWDLETFEVISSGKPETDGVRSLAFNPDGKIVLSGEQIS  260 (825)
T ss_pred             cccc----------------ccccCchhhhhccCCCCceeeeeccceeEEeeccCCccCCceeeeecCCceeeecCchhh
Confidence            0111                122445567899999999999999999998888777778999999999999999887653


No 104
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.87  E-value=1.7e-21  Score=178.36  Aligned_cols=222  Identities=21%  Similarity=0.294  Sum_probs=166.9

Q ss_pred             cccccccccc-CcCcccccCCCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCC
Q 022074           17 SLANVTEIHD-GLDFSAADDGGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESG   94 (303)
Q Consensus        17 ~~~~~~~~~~-~~~~~~~~~~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~   94 (303)
                      ..|+||++=. +.+.+-.+.+-|+..+.+++|++. ..+|++||+||+|++||++..+-...+.+....|..|.|+|..+
T Consensus       110 G~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSGSQDg~vK~~DlR~~~S~~t~~~nSESiRDV~fsp~~~  189 (839)
T KOG0269|consen  110 GVISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISGSQDGTVKCWDLRSKKSKSTFRSNSESIRDVKFSPGYG  189 (839)
T ss_pred             CcEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEecCCCceEEEEeeecccccccccccchhhhceeeccCCC
Confidence            3457888722 112222355789999999999975 55899999999999999998876667777778899999998888


Q ss_pred             cEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCcccee
Q 022074           95 HLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYE  174 (303)
Q Consensus        95 ~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~  174 (303)
                      +.|+++..+|.+.+||+|   +..+....+..|.+.|.++.++|++.+|||||+|+.|+|||+.........        
T Consensus       190 ~~F~s~~dsG~lqlWDlR---qp~r~~~k~~AH~GpV~c~nwhPnr~~lATGGRDK~vkiWd~t~~~~~~~~--------  258 (839)
T KOG0269|consen  190 NKFASIHDSGYLQLWDLR---QPDRCEKKLTAHNGPVLCLNWHPNREWLATGGRDKMVKIWDMTDSRAKPKH--------  258 (839)
T ss_pred             ceEEEecCCceEEEeecc---CchhHHHHhhcccCceEEEeecCCCceeeecCCCccEEEEeccCCCcccee--------
Confidence            999999999999999998   344555567889999999999999999999999999999998642211100        


Q ss_pred             eeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe--CCCeEEEEECCCCe-EEEEeecC
Q 022074          175 WDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS--HDSCVYVYDLVSGE-QVAALKYH  251 (303)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~--~dg~i~iwd~~~~~-~~~~~~~h  251 (303)
                                           .+.  +.    ..+-+..|.|..+   .+||+++  .|..|+|||++-.- +..++..|
T Consensus       259 ---------------------tIn--Ti----apv~rVkWRP~~~---~hLAtcsmv~dtsV~VWDvrRPYIP~~t~~eH  308 (839)
T KOG0269|consen  259 ---------------------TIN--TI----APVGRVKWRPARS---YHLATCSMVVDTSVHVWDVRRPYIPYATFLEH  308 (839)
T ss_pred             ---------------------EEe--ec----ceeeeeeeccCcc---chhhhhhccccceEEEEeeccccccceeeecc
Confidence                                 000  00    1122334445432   4577775  58899999996432 36678899


Q ss_pred             CCCeEEEEECCCC-CeEEEEeCCCCEEEe
Q 022074          252 TSPVRDCSWHPSQ-PMLVSSSWDGDVVRW  279 (303)
Q Consensus       252 ~~~I~~v~~sp~~-~~las~s~Dg~i~~W  279 (303)
                      ..-++.++|-... -.|.+++-|+++..-
T Consensus       309 ~~~vt~i~W~~~d~~~l~s~sKD~tv~qh  337 (839)
T KOG0269|consen  309 TDSVTGIAWDSGDRINLWSCSKDGTVLQH  337 (839)
T ss_pred             CccccceeccCCCceeeEeecCccHHHHh
Confidence            9999999997643 478899999887644


No 105
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.87  E-value=1.5e-20  Score=161.41  Aligned_cols=202  Identities=14%  Similarity=0.247  Sum_probs=160.0

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      ..|+-.|.-+.||++|++||++|.|.+..+|++.....   ..++.+|..+|..+.|+|+ +++|++|+.|..+++||..
T Consensus       221 ~~htdEVWfl~FS~nGkyLAsaSkD~Taiiw~v~~d~~~kl~~tlvgh~~~V~yi~wSPD-dryLlaCg~~e~~~lwDv~  299 (519)
T KOG0293|consen  221 QDHTDEVWFLQFSHNGKYLASASKDSTAIIWIVVYDVHFKLKKTLVGHSQPVSYIMWSPD-DRYLLACGFDEVLSLWDVD  299 (519)
T ss_pred             hhCCCcEEEEEEcCCCeeEeeccCCceEEEEEEecCcceeeeeeeecccCceEEEEECCC-CCeEEecCchHheeeccCC
Confidence            58999999999999999999999999999998876554   5678899999999999875 6788889988899999986


Q ss_pred             cccCCCccceee-cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          113 CLNVKGKPAGVL-MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       113 ~~~~~~~~~~~~-~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      .+.    ....+ .+|..++.+++|.|||..+++|+.|+++..||+..-.                              
T Consensus       300 tgd----~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~dr~i~~wdlDgn~------------------------------  345 (519)
T KOG0293|consen  300 TGD----LRHLYPSGLGFSVSSCAWCPDGFRFVTGSPDRTIIMWDLDGNI------------------------------  345 (519)
T ss_pred             cch----hhhhcccCcCCCcceeEEccCCceeEecCCCCcEEEecCCcch------------------------------
Confidence            332    12122 2356789999999999999999999999999986311                              


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEe
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s  271 (303)
                           ...+.|...     .+....+.++||+++++.+.|..|++++.++....+... -+.+|++...|.|++++...=
T Consensus       346 -----~~~W~gvr~-----~~v~dlait~Dgk~vl~v~~d~~i~l~~~e~~~dr~lis-e~~~its~~iS~d~k~~LvnL  414 (519)
T KOG0293|consen  346 -----LGNWEGVRD-----PKVHDLAITYDGKYVLLVTVDKKIRLYNREARVDRGLIS-EEQPITSFSISKDGKLALVNL  414 (519)
T ss_pred             -----hhccccccc-----ceeEEEEEcCCCcEEEEEecccceeeechhhhhhhcccc-ccCceeEEEEcCCCcEEEEEc
Confidence                 111111110     111223456899999999999999999998876665444 345899999999999999999


Q ss_pred             CCCCEEEeecCC
Q 022074          272 WDGDVVRWEFPG  283 (303)
Q Consensus       272 ~Dg~i~~Wd~~~  283 (303)
                      .+.++++||++.
T Consensus       415 ~~qei~LWDl~e  426 (519)
T KOG0293|consen  415 QDQEIHLWDLEE  426 (519)
T ss_pred             ccCeeEEeecch
Confidence            999999999873


No 106
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.87  E-value=2.4e-20  Score=155.61  Aligned_cols=207  Identities=23%  Similarity=0.309  Sum_probs=149.3

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE----EEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS----LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~----~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      -||...|.+++|+.||++|++++.|++||||+++.-...    .+..-.-+.-+.+.|.|++...+++.....++++|..
T Consensus        83 KgH~~~vt~~~FsSdGK~lat~~~Dr~Ir~w~~~DF~~~eHr~~R~nve~dhpT~V~FapDc~s~vv~~~~g~~l~vyk~  162 (420)
T KOG2096|consen   83 KGHKKEVTDVAFSSDGKKLATISGDRSIRLWDVRDFENKEHRCIRQNVEYDHPTRVVFAPDCKSVVVSVKRGNKLCVYKL  162 (420)
T ss_pred             hccCCceeeeEEcCCCceeEEEeCCceEEEEecchhhhhhhhHhhccccCCCceEEEECCCcceEEEEEccCCEEEEEEe
Confidence            499999999999999999999999999999999774321    0111111346788999888888888888889999965


Q ss_pred             ccccCCCc------cce---eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          112 RCLNVKGK------PAG---VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       112 ~~~~~~~~------~~~---~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      .-.+ .+.      +..   ...-|.-.+-.+-+...+.+|++++.|..|.||+++... ..+.                
T Consensus       163 ~K~~-dG~~~~~~v~~D~~~f~~kh~v~~i~iGiA~~~k~imsas~dt~i~lw~lkGq~-L~~i----------------  224 (420)
T KOG2096|consen  163 VKKT-DGSGSHHFVHIDNLEFERKHQVDIINIGIAGNAKYIMSASLDTKICLWDLKGQL-LQSI----------------  224 (420)
T ss_pred             eecc-cCCCCcccccccccccchhcccceEEEeecCCceEEEEecCCCcEEEEecCCce-eeee----------------
Confidence            2110 110      111   111244445556667788999999999999999997321 1100                


Q ss_pred             CCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC---CC-----eEEEEeecCCCC
Q 022074          183 PPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV---SG-----EQVAALKYHTSP  254 (303)
Q Consensus       183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~---~~-----~~~~~~~~h~~~  254 (303)
                                     .+   .+.      .......||+|+++++++-.-.+++|.+-   .|     +.+..+++|+..
T Consensus       225 ---------------dt---nq~------~n~~aavSP~GRFia~~gFTpDVkVwE~~f~kdG~fqev~rvf~LkGH~sa  280 (420)
T KOG2096|consen  225 ---------------DT---NQS------SNYDAAVSPDGRFIAVSGFTPDVKVWEPIFTKDGTFQEVKRVFSLKGHQSA  280 (420)
T ss_pred             ---------------cc---ccc------cccceeeCCCCcEEEEecCCCCceEEEEEeccCcchhhhhhhheeccchhh
Confidence                           00   000      00112368999999999999999999863   33     245678999999


Q ss_pred             eEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          255 VRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       255 I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      |..++||++.+.++|+|-||+.++||..--
T Consensus       281 V~~~aFsn~S~r~vtvSkDG~wriwdtdVr  310 (420)
T KOG2096|consen  281 VLAAAFSNSSTRAVTVSKDGKWRIWDTDVR  310 (420)
T ss_pred             eeeeeeCCCcceeEEEecCCcEEEeeccce
Confidence            999999999999999999999999997644


No 107
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.86  E-value=4.7e-20  Score=168.58  Aligned_cols=214  Identities=21%  Similarity=0.249  Sum_probs=162.6

Q ss_pred             ccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           33 ADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        33 ~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      ++-+||+.-|.++++|.+...+++|+. +.+++|+.++.+...++..  +-+-+..|.| .++++++|..+|.+-+||+.
T Consensus       367 i~~~GHR~dVRsl~vS~d~~~~~Sga~-~SikiWn~~t~kciRTi~~--~y~l~~~Fvp-gd~~Iv~G~k~Gel~vfdla  442 (888)
T KOG0306|consen  367 IEIGGHRSDVRSLCVSSDSILLASGAG-ESIKIWNRDTLKCIRTITC--GYILASKFVP-GDRYIVLGTKNGELQVFDLA  442 (888)
T ss_pred             eeeccchhheeEEEeecCceeeeecCC-CcEEEEEccCcceeEEecc--ccEEEEEecC-CCceEEEeccCCceEEEEee
Confidence            455899999999999988877777654 4699999999887665543  2566777864 57899999999999999986


Q ss_pred             cccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCC
Q 022074          113 CLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHP  192 (303)
Q Consensus       113 ~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  192 (303)
                      +.    ........|.++++.++..||+..++|||.|++|++||.........                    +      
T Consensus       443 S~----~l~Eti~AHdgaIWsi~~~pD~~g~vT~saDktVkfWdf~l~~~~~g--------------------t------  492 (888)
T KOG0306|consen  443 SA----SLVETIRAHDGAIWSISLSPDNKGFVTGSADKTVKFWDFKLVVSVPG--------------------T------  492 (888)
T ss_pred             hh----hhhhhhhccccceeeeeecCCCCceEEecCCcEEEEEeEEEEeccCc--------------------c------
Confidence            32    22334557999999999999999999999999999999764321000                    0      


Q ss_pred             CCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeC
Q 022074          193 CDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSW  272 (303)
Q Consensus       193 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~  272 (303)
                       ..++..+..... ..+..-..+..+|||+++||.+--|.+++||-+++.+..-.+=+|.-||.++..|||.++++|||.
T Consensus       493 -~~k~lsl~~~rt-Lel~ddvL~v~~Spdgk~LaVsLLdnTVkVyflDtlKFflsLYGHkLPV~smDIS~DSklivTgSA  570 (888)
T KOG0306|consen  493 -QKKVLSLKHTRT-LELEDDVLCVSVSPDGKLLAVSLLDNTVKVYFLDTLKFFLSLYGHKLPVLSMDISPDSKLIVTGSA  570 (888)
T ss_pred             -cceeeeeccceE-EeccccEEEEEEcCCCcEEEEEeccCeEEEEEecceeeeeeecccccceeEEeccCCcCeEEeccC
Confidence             000001100000 001111123457899999999999999999999999887777799999999999999999999999


Q ss_pred             CCCEEEeecC
Q 022074          273 DGDVVRWEFP  282 (303)
Q Consensus       273 Dg~i~~Wd~~  282 (303)
                      |.++++|-++
T Consensus       571 DKnVKiWGLd  580 (888)
T KOG0306|consen  571 DKNVKIWGLD  580 (888)
T ss_pred             CCceEEeccc
Confidence            9999999764


No 108
>KOG0269 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.86  E-value=2.3e-21  Score=177.51  Aligned_cols=208  Identities=21%  Similarity=0.346  Sum_probs=155.5

Q ss_pred             eEEEEEc-CCCCEEEEeeCCCeEEEEECCC---CceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC
Q 022074           42 IFSLKFS-TDGRELVAGSSDDCIYVYDLEA---NKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        42 v~~l~~s-~~g~~l~sgs~Dg~v~lwd~~~---~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~  117 (303)
                      ...+.|+ -+.+.|++++..|.|.+||+..   .++...+..|...++++.|++-.+++|+|||.||+|++||+|..   
T Consensus        90 ~~DVkW~~~~~NlIAT~s~nG~i~vWdlnk~~rnk~l~~f~EH~Rs~~~ldfh~tep~iliSGSQDg~vK~~DlR~~---  166 (839)
T KOG0269|consen   90 AADVKWGQLYSNLIATCSTNGVISVWDLNKSIRNKLLTVFNEHERSANKLDFHSTEPNILISGSQDGTVKCWDLRSK---  166 (839)
T ss_pred             hhhcccccchhhhheeecCCCcEEEEecCccccchhhhHhhhhccceeeeeeccCCccEEEecCCCceEEEEeeecc---
Confidence            3446666 3577899999999999999977   34444567899999999998878899999999999999999832   


Q ss_pred             CccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          118 GKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       118 ~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                       ....++.+..++|..+.|+| .+.+|+++...|.+++||+|....                                 +
T Consensus       167 -~S~~t~~~nSESiRDV~fsp~~~~~F~s~~dsG~lqlWDlRqp~r---------------------------------~  212 (839)
T KOG0269|consen  167 -KSKSTFRSNSESIRDVKFSPGYGNKFASIHDSGYLQLWDLRQPDR---------------------------------C  212 (839)
T ss_pred             -cccccccccchhhhceeeccCCCceEEEecCCceEEEeeccCchh---------------------------------H
Confidence             33445667888999999987 477899999999999999996432                                 1


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE--EEEeecCCCCeEEEEECCCCC-eEEEEeC-
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ--VAALKYHTSPVRDCSWHPSQP-MLVSSSW-  272 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~--~~~~~~h~~~I~~v~~sp~~~-~las~s~-  272 (303)
                      ...+..|....      ++..++|++.+|||||-|+.|+|||..+.+.  +.++ ....||..|+|=|..+ .|||++. 
T Consensus       213 ~~k~~AH~GpV------~c~nwhPnr~~lATGGRDK~vkiWd~t~~~~~~~~tI-nTiapv~rVkWRP~~~~hLAtcsmv  285 (839)
T KOG0269|consen  213 EKKLTAHNGPV------LCLNWHPNREWLATGGRDKMVKIWDMTDSRAKPKHTI-NTIAPVGRVKWRPARSYHLATCSMV  285 (839)
T ss_pred             HHHhhcccCce------EEEeecCCCceeeecCCCccEEEEeccCCCccceeEE-eecceeeeeeeccCccchhhhhhcc
Confidence            11122222111      1223678899999999999999999986543  3333 2457999999999877 4777664 


Q ss_pred             -CCCEEEeecCCCCccCCCCcc
Q 022074          273 -DGDVVRWEFPGNGEAAPPLNK  293 (303)
Q Consensus       273 -Dg~i~~Wd~~~~~~~~~~~~~  293 (303)
                       |-.|++||+.-++-.-.-+.+
T Consensus       286 ~dtsV~VWDvrRPYIP~~t~~e  307 (839)
T KOG0269|consen  286 VDTSVHVWDVRRPYIPYATFLE  307 (839)
T ss_pred             ccceEEEEeeccccccceeeec
Confidence             889999998765444333333


No 109
>KOG0302 consensus Ribosome Assembly protein [General function prediction only]
Probab=99.86  E-value=1.6e-20  Score=159.63  Aligned_cols=205  Identities=25%  Similarity=0.356  Sum_probs=157.1

Q ss_pred             CCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCceE---EEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           36 GGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANKLS---LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~~~---~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      +||...=++|+|||- -..+++|..-+.|++|...++...   ..+.+|+..|..++|+|....+|+|||-||+|+|||+
T Consensus       208 ~ghk~EGy~LdWSp~~~g~LlsGDc~~~I~lw~~~~g~W~vd~~Pf~gH~~SVEDLqWSptE~~vfaScS~DgsIrIWDi  287 (440)
T KOG0302|consen  208 NGHKGEGYGLDWSPIKTGRLLSGDCVKGIHLWEPSTGSWKVDQRPFTGHTKSVEDLQWSPTEDGVFASCSCDGSIRIWDI  287 (440)
T ss_pred             cccCccceeeecccccccccccCccccceEeeeeccCceeecCccccccccchhhhccCCccCceEEeeecCceEEEEEe
Confidence            599999999999983 225888888889999999887643   3467899999999999888889999999999999999


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      |...  .++.-....|...|+.++|+..-.+|++|+.||+++|||||..+.                             
T Consensus       288 Rs~~--~~~~~~~kAh~sDVNVISWnr~~~lLasG~DdGt~~iwDLR~~~~-----------------------------  336 (440)
T KOG0302|consen  288 RSGP--KKAAVSTKAHNSDVNVISWNRREPLLASGGDDGTLSIWDLRQFKS-----------------------------  336 (440)
T ss_pred             cCCC--ccceeEeeccCCceeeEEccCCcceeeecCCCceEEEEEhhhccC-----------------------------
Confidence            8542  223334467989999999999888999999999999999996432                             


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-------E-EE--------EeecC--CC
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-------Q-VA--------ALKYH--TS  253 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-------~-~~--------~~~~h--~~  253 (303)
                        .+.++.++-|...++.+.++.     .+...|+++|+|..|.+||+....       . ..        -+-.|  +.
T Consensus       337 --~~pVA~fk~Hk~pItsieW~p-----~e~s~iaasg~D~QitiWDlsvE~D~ee~~~~a~~~L~dlPpQLLFVHqGQk  409 (440)
T KOG0302|consen  337 --GQPVATFKYHKAPITSIEWHP-----HEDSVIAASGEDNQITIWDLSVEADEEEIDQEAAEGLQDLPPQLLFVHQGQK  409 (440)
T ss_pred             --CCcceeEEeccCCeeEEEecc-----ccCceEEeccCCCcEEEEEeeccCChhhhccccccchhcCCceeEEEecchh
Confidence              235667777766666555432     245678999999999999985321       0 00        11234  35


Q ss_pred             CeEEEEECCCCC-eEEEEeCCCCEEE
Q 022074          254 PVRDCSWHPSQP-MLVSSSWDGDVVR  278 (303)
Q Consensus       254 ~I~~v~~sp~~~-~las~s~Dg~i~~  278 (303)
                      .+..+.|+++-+ +|+|.+.||--.+
T Consensus       410 e~KevhWH~QiPG~lvsTa~dGfnVf  435 (440)
T KOG0302|consen  410 EVKEVHWHRQIPGLLVSTAIDGFNVF  435 (440)
T ss_pred             HhhhheeccCCCCeEEEecccceeEE
Confidence            799999999976 8888888875443


No 110
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=99.86  E-value=1.8e-20  Score=155.94  Aligned_cols=207  Identities=22%  Similarity=0.321  Sum_probs=148.1

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce--EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL--SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~--~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      +-|.-.|.++.|+|...+|++|+.|++|++||......  ..+......+|.++.|+| .++.++.|..-.++++||+..
T Consensus       169 YDH~devn~l~FHPre~ILiS~srD~tvKlFDfsK~saKrA~K~~qd~~~vrsiSfHP-sGefllvgTdHp~~rlYdv~T  247 (430)
T KOG0640|consen  169 YDHVDEVNDLDFHPRETILISGSRDNTVKLFDFSKTSAKRAFKVFQDTEPVRSISFHP-SGEFLLVGTDHPTLRLYDVNT  247 (430)
T ss_pred             hhccCcccceeecchhheEEeccCCCeEEEEecccHHHHHHHHHhhccceeeeEeecC-CCceEEEecCCCceeEEeccc
Confidence            46788999999999999999999999999999865322  223344556899999964 689999999999999999753


Q ss_pred             ccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                      ...-. ..-.-.+|.++|+++..++.+++.+|++.||.|+|||=-.-++..+....                        
T Consensus       248 ~Qcfv-sanPd~qht~ai~~V~Ys~t~~lYvTaSkDG~IklwDGVS~rCv~t~~~A------------------------  302 (430)
T KOG0640|consen  248 YQCFV-SANPDDQHTGAITQVRYSSTGSLYVTASKDGAIKLWDGVSNRCVRTIGNA------------------------  302 (430)
T ss_pred             eeEee-ecCcccccccceeEEEecCCccEEEEeccCCcEEeeccccHHHHHHHHhh------------------------
Confidence            21100 01123479999999999999999999999999999994322111111000                        


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe-------------------------
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL-------------------------  248 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~-------------------------  248 (303)
                               |...     ...+..|+.+++|+++.|.|..+++|.+.++.++.++                         
T Consensus       303 ---------H~gs-----evcSa~Ftkn~kyiLsSG~DS~vkLWEi~t~R~l~~YtGAg~tgrq~~rtqAvFNhtEdyVl  368 (430)
T KOG0640|consen  303 ---------HGGS-----EVCSAVFTKNGKYILSSGKDSTVKLWEISTGRMLKEYTGAGTTGRQKHRTQAVFNHTEDYVL  368 (430)
T ss_pred             ---------cCCc-----eeeeEEEccCCeEEeecCCcceeeeeeecCCceEEEEecCCcccchhhhhhhhhcCccceEE
Confidence                     0000     0112345556666666666666666666555333222                         


Q ss_pred             ------------------------ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          249 ------------------------KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       249 ------------------------~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                                              -+|++++..+.-||.++-+.|+|.|..+++|--.
T Consensus       369 ~pDEas~slcsWdaRtadr~~l~slgHn~a~R~i~HSP~~p~FmTcsdD~raRFWyrr  426 (430)
T KOG0640|consen  369 FPDEASNSLCSWDARTADRVALLSLGHNGAVRWIVHSPVEPAFMTCSDDFRARFWYRR  426 (430)
T ss_pred             ccccccCceeeccccchhhhhhcccCCCCCceEEEeCCCCCceeeecccceeeeeeec
Confidence                                    1488999999999999999999999999999743


No 111
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=99.86  E-value=5.7e-21  Score=163.96  Aligned_cols=237  Identities=19%  Similarity=0.234  Sum_probs=167.6

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .|+..+..+.|-++...|++|+.|..|.+|+....+  ....+.+..+.++.+.|.+ .++.+++++.|+.+++|+..  
T Consensus       173 ~h~gev~~v~~l~~sdtlatgg~Dr~Ik~W~v~~~k~~~~~tLaGs~g~it~~d~d~-~~~~~iAas~d~~~r~Wnvd--  249 (459)
T KOG0288|consen  173 AHEGEVHDVEFLRNSDTLATGGSDRIIKLWNVLGEKSELISTLAGSLGNITSIDFDS-DNKHVIAASNDKNLRLWNVD--  249 (459)
T ss_pred             ccccccceeEEccCcchhhhcchhhhhhhhhcccchhhhhhhhhccCCCcceeeecC-CCceEEeecCCCceeeeecc--
Confidence            799999999999999999999999999999998776  3445667778899999965 47788899999999999975  


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                        ..+....+.||.+.|+++.+......+++|+.|+++++||+.+..+..+..  +.+...++..-    ....+..-.+
T Consensus       250 --~~r~~~TLsGHtdkVt~ak~~~~~~~vVsgs~DRtiK~WDl~k~~C~kt~l--~~S~cnDI~~~----~~~~~SgH~D  321 (459)
T KOG0288|consen  250 --SLRLRHTLSGHTDKVTAAKFKLSHSRVVSGSADRTIKLWDLQKAYCSKTVL--PGSQCNDIVCS----ISDVISGHFD  321 (459)
T ss_pred             --chhhhhhhcccccceeeehhhccccceeeccccchhhhhhhhhhheecccc--ccccccceEec----ceeeeecccc
Confidence              334566789999999999998776669999999999999998743221110  00000000000    0000000012


Q ss_pred             CcceEEeccccee--ee--EEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecC----CCCeEEEEECCCCCe
Q 022074          195 QSVATYKGHSVLR--TL--IRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYH----TSPVRDCSWHPSQPM  266 (303)
Q Consensus       195 ~~~~~~~~~~~~~--~~--~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h----~~~I~~v~~sp~~~~  266 (303)
                      .++..++......  .+  -.-..+...++++..+.+.+-|.++.+.|.++.+....+.+.    ....+.+.|||++.|
T Consensus       322 kkvRfwD~Rs~~~~~sv~~gg~vtSl~ls~~g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~asDwtrvvfSpd~~Y  401 (459)
T KOG0288|consen  322 KKVRFWDIRSADKTRSVPLGGRVTSLDLSMDGLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKCASDWTRVVFSPDGSY  401 (459)
T ss_pred             cceEEEeccCCceeeEeecCcceeeEeeccCCeEEeeecCCCceeeeecccccEEEEeeccccccccccceeEECCCCce
Confidence            2233332111100  00  001123446788889999999999999999988776666432    134899999999999


Q ss_pred             EEEEeCCCCEEEeecCCC
Q 022074          267 LVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       267 las~s~Dg~i~~Wd~~~~  284 (303)
                      +|+||.||.+++|++.+.
T Consensus       402 vaAGS~dgsv~iW~v~tg  419 (459)
T KOG0288|consen  402 VAAGSADGSVYIWSVFTG  419 (459)
T ss_pred             eeeccCCCcEEEEEccCc
Confidence            999999999999998754


No 112
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.85  E-value=6.9e-20  Score=153.41  Aligned_cols=242  Identities=23%  Similarity=0.378  Sum_probs=162.9

Q ss_pred             CcccceEEEEEcCC----CCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           37 GYSFGIFSLKFSTD----GRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        37 ~~~~~v~~l~~s~~----g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      -|+-..+.++|+-+    .-++|+|+.-|.|||.|+..++....+.+|...|+.+.+.|..+++++++|+|.+||+|+++
T Consensus        87 d~~Esfytcsw~yd~~~~~p~la~~G~~GvIrVid~~~~~~~~~~~ghG~sINeik~~p~~~qlvls~SkD~svRlwnI~  166 (385)
T KOG1034|consen   87 DHDESFYTCSWSYDSNTGNPFLAAGGYLGVIRVIDVVSGQCSKNYRGHGGSINEIKFHPDRPQLVLSASKDHSVRLWNIQ  166 (385)
T ss_pred             CCCcceEEEEEEecCCCCCeeEEeecceeEEEEEecchhhhccceeccCccchhhhcCCCCCcEEEEecCCceEEEEecc
Confidence            35556999999964    23788999999999999999999999999999999999999888999999999999999986


Q ss_pred             cccCCCcccee---ecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcc--cccCc----cceeeeceeeeCC
Q 022074          113 CLNVKGKPAGV---LMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNAS--CNLGF----RSYEWDYRWMDYP  183 (303)
Q Consensus       113 ~~~~~~~~~~~---~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~--~~~~~----~~~~~~~~~~~~~  183 (303)
                      .    ...+..   +.||.+.|.+++|+.+|.+|+++|.|.++++|++....-...  +...+    ....+......+|
T Consensus       167 ~----~~Cv~VfGG~egHrdeVLSvD~~~~gd~i~ScGmDhslk~W~l~~~~f~~~lE~s~~~~~~~t~~pfpt~~~~fp  242 (385)
T KOG1034|consen  167 T----DVCVAVFGGVEGHRDEVLSVDFSLDGDRIASCGMDHSLKLWRLNVKEFKNKLELSITYSPNKTTRPFPTPKTHFP  242 (385)
T ss_pred             C----CeEEEEecccccccCcEEEEEEcCCCCeeeccCCcceEEEEecChhHHhhhhhhhcccCCCCccCcCCccccccc
Confidence            2    222322   467999999999999999999999999999999873110000  00000    0000000000111


Q ss_pred             CC-Cc----------------cccCCCCCcceEEecccc------------eeeeEE------E-eeee--eeeCCCeEE
Q 022074          184 PQ-AR----------------DLKHPCDQSVATYKGHSV------------LRTLIR------C-HFSP--VYSTGQKYI  225 (303)
Q Consensus       184 ~~-~~----------------~~~~~~~~~~~~~~~~~~------------~~~~~~------~-~~~~--~~s~~~~~l  225 (303)
                      .- +.                .+.-.|++.+..+.....            ..+++.      | .|-.  .|.+-++.|
T Consensus       243 ~fst~diHrnyVDCvrw~gd~ilSkscenaI~~w~pgkl~e~~~~vkp~es~~Ti~~~~~~~~c~iWfirf~~d~~~~~l  322 (385)
T KOG1034|consen  243 DFSTTDIHRNYVDCVRWFGDFILSKSCENAIVCWKPGKLEESIHNVKPPESATTILGEFDYPMCDIWFIRFAFDPWQKML  322 (385)
T ss_pred             cccccccccchHHHHHHHhhheeecccCceEEEEecchhhhhhhccCCCccceeeeeEeccCccceEEEEEeecHHHHHH
Confidence            00 00                001112222222221000            000000      0 0111  234557889


Q ss_pred             EEEeCCCeEEEEECCCCeEE--EEee--cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          226 YTGSHDSCVYVYDLVSGEQV--AALK--YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       226 atg~~dg~i~iwd~~~~~~~--~~~~--~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      |.|.+.|.+++||+++.++.  .++.  .-...|...+||.|+.+|+..++|+++..||.-
T Consensus       323 a~gnq~g~v~vwdL~~~ep~~~ttl~~s~~~~tVRQ~sfS~dgs~lv~vcdd~~Vwrwdrv  383 (385)
T KOG1034|consen  323 ALGNQSGKVYVWDLDNNEPPKCTTLTHSKSGSTVRQTSFSRDGSILVLVCDDGTVWRWDRV  383 (385)
T ss_pred             hhccCCCcEEEEECCCCCCccCceEEeccccceeeeeeecccCcEEEEEeCCCcEEEEEee
Confidence            99999999999999877652  2222  123579999999999999999999999999953


No 113
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.85  E-value=2e-20  Score=170.25  Aligned_cols=240  Identities=27%  Similarity=0.359  Sum_probs=174.1

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC--CcEEEEecCCCeEEEEcCccc
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES--GHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~--~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      +-+.+|.+++.+|+|++|++|..-|+++|||+..-.....+..|...|-|+.|+.+.  .++|++++.|..|+++|..  
T Consensus       457 d~r~G~R~~~vSp~gqhLAsGDr~GnlrVy~Lq~l~~~~~~eAHesEilcLeyS~p~~~~kLLASasrdRlIHV~Dv~--  534 (1080)
T KOG1408|consen  457 DSRFGFRALAVSPDGQHLASGDRGGNLRVYDLQELEYTCFMEAHESEILCLEYSFPVLTNKLLASASRDRLIHVYDVK--  534 (1080)
T ss_pred             CcccceEEEEECCCcceecccCccCceEEEEehhhhhhhheecccceeEEEeecCchhhhHhhhhccCCceEEEEecc--
Confidence            556799999999999999999999999999998887777888999999999996543  5789999999999999974  


Q ss_pred             cCCCccceeecccccCeEEEEeCCCC--CEEEEEeCCCcEEEEEcccccCCcccccCcccee-eeceeeeCCCCCccccC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDG--RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYE-WDYRWMDYPPQARDLKH  191 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~--~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~  191 (303)
                       .+--+...+.+|..+|+++.|.-.|  ..++++|.|+.+.+=--++......+........ -.+--++..|..+.+..
T Consensus       535 -rny~l~qtld~HSssITsvKFa~~gln~~MiscGADksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~~k~v~t  613 (1080)
T KOG1408|consen  535 -RNYDLVQTLDGHSSSITSVKFACNGLNRKMISCGADKSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPTSKLVVT  613 (1080)
T ss_pred             -cccchhhhhcccccceeEEEEeecCCceEEEeccCchhhheehhccccCceeccccccccccceEEEeeeCCCcceEEE
Confidence             2234567788999999999987766  6789999998875432221110000000000000 00001233333333322


Q ss_pred             CC-CCcc-----------eEEecccce-eeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEE
Q 022074          192 PC-DQSV-----------ATYKGHSVL-RTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDC  258 (303)
Q Consensus       192 ~~-~~~~-----------~~~~~~~~~-~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v  258 (303)
                      .| ++.+           ..++|.... -..++..    ..|.|-|+||...|+++.++|..+|+++..+-+|...|+.+
T Consensus       614 ~cQDrnirif~i~sgKq~k~FKgs~~~eG~lIKv~----lDPSgiY~atScsdktl~~~Df~sgEcvA~m~GHsE~VTG~  689 (1080)
T KOG1408|consen  614 VCQDRNIRIFDIESGKQVKSFKGSRDHEGDLIKVI----LDPSGIYLATSCSDKTLCFVDFVSGECVAQMTGHSEAVTGV  689 (1080)
T ss_pred             EecccceEEEeccccceeeeecccccCCCceEEEE----ECCCccEEEEeecCCceEEEEeccchhhhhhcCcchheeee
Confidence            22 2222           233333221 1222222    34678999999999999999999999999999999999999


Q ss_pred             EECCCCCeEEEEeCCCCEEEeecCC
Q 022074          259 SWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       259 ~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .|++|.+.|++++.||-|-+|.++.
T Consensus       690 kF~nDCkHlISvsgDgCIFvW~lp~  714 (1080)
T KOG1408|consen  690 KFLNDCKHLISVSGDGCIFVWKLPL  714 (1080)
T ss_pred             eecccchhheeecCCceEEEEECch
Confidence            9999999999999999999999875


No 114
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=99.85  E-value=1.1e-19  Score=148.48  Aligned_cols=198  Identities=19%  Similarity=0.317  Sum_probs=142.0

Q ss_pred             EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEE
Q 022074           76 RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLW  155 (303)
Q Consensus        76 ~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lW  155 (303)
                      .+++|..+++.+.|+. ++++|+|+++|.+..+|=    ..+++.++.+.||.++|+++++..+..+++||+.|.+++||
T Consensus         5 ~l~GHERplTqiKyN~-eGDLlFscaKD~~~~vw~----s~nGerlGty~GHtGavW~~Did~~s~~liTGSAD~t~kLW   79 (327)
T KOG0643|consen    5 LLQGHERPLTQIKYNR-EGDLLFSCAKDSTPTVWY----SLNGERLGTYDGHTGAVWCCDIDWDSKHLITGSADQTAKLW   79 (327)
T ss_pred             ccccCccccceEEecC-CCcEEEEecCCCCceEEE----ecCCceeeeecCCCceEEEEEecCCcceeeeccccceeEEE
Confidence            4678999999999975 589999999999999995    24677899999999999999999999999999999999999


Q ss_pred             EcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC------cceEEe---------cccceeeeE---EEeeeee
Q 022074          156 DIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ------SVATYK---------GHSVLRTLI---RCHFSPV  217 (303)
Q Consensus       156 dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~---------~~~~~~~~~---~~~~~~~  217 (303)
                      |+...+..+....+..     ++.+.+...........+.      .+..++         .......+.   .-.....
T Consensus        80 Dv~tGk~la~~k~~~~-----Vk~~~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~t~~skit~a~  154 (327)
T KOG0643|consen   80 DVETGKQLATWKTNSP-----VKRVDFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIPTPDSKITSAL  154 (327)
T ss_pred             EcCCCcEEEEeecCCe-----eEEEeeccCCcEEEEEehhhcCcceEEEEEEccCChhhhcccCceEEecCCccceeeee
Confidence            9987665444322211     1112222222211111100      000000         000000000   0011233


Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeE-EEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQ-VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~-~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ++|-++++++|.+||.|.+||.++|++ +..-+.|...|+++.||||..+++|+|.|.+.++||+..
T Consensus       155 Wg~l~~~ii~Ghe~G~is~~da~~g~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~Dttakl~D~~t  221 (327)
T KOG0643|consen  155 WGPLGETIIAGHEDGSISIYDARTGKELVDSDEEHSSKINDLQFSRDRTYFITGSKDTTAKLVDVRT  221 (327)
T ss_pred             ecccCCEEEEecCCCcEEEEEcccCceeeechhhhccccccccccCCcceEEecccCccceeeeccc
Confidence            567789999999999999999999866 445578999999999999999999999999999999753


No 115
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.85  E-value=4.3e-20  Score=159.89  Aligned_cols=213  Identities=17%  Similarity=0.212  Sum_probs=152.6

Q ss_pred             CCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .||+.+|.+++|+.+ .+.|++||.|.+|.+||+.+++....+..|.+.|.++.|++..+..|++|+.|++|++.|.|..
T Consensus       240 ~gHTdavl~Ls~n~~~~nVLaSgsaD~TV~lWD~~~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~  319 (463)
T KOG0270|consen  240 SGHTDAVLALSWNRNFRNVLASGSADKTVKLWDVDTGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDP  319 (463)
T ss_pred             ccchHHHHHHHhccccceeEEecCCCceEEEEEcCCCCcceehhhcCCceeEEEecCCCceEEEeccccceEEeeeccCc
Confidence            489999999999976 4478999999999999999999887777899999999999888999999999999999998843


Q ss_pred             cCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                      ...+...    ...+.|-.+.|.+.. ..++++..||+|+-+|+|...                                
T Consensus       320 ~~s~~~w----k~~g~VEkv~w~~~se~~f~~~tddG~v~~~D~R~~~--------------------------------  363 (463)
T KOG0270|consen  320 SNSGKEW----KFDGEVEKVAWDPHSENSFFVSTDDGTVYYFDIRNPG--------------------------------  363 (463)
T ss_pred             cccCceE----EeccceEEEEecCCCceeEEEecCCceEEeeecCCCC--------------------------------
Confidence            2222111    123445566666543 468888899999999998642                                


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE--EEEeecCCCCeEEEEECCCCC-eEEEE
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ--VAALKYHTSPVRDCSWHPSQP-MLVSS  270 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~--~~~~~~h~~~I~~v~~sp~~~-~las~  270 (303)
                       .++.+.+.|...+..+..  ++   ..-.+++|++.|+.+++|++..-+.  ++....-.+...|.++.|+-. +||.|
T Consensus       364 -~~vwt~~AHd~~ISgl~~--n~---~~p~~l~t~s~d~~Vklw~~~~~~~~~v~~~~~~~~rl~c~~~~~~~a~~la~G  437 (463)
T KOG0270|consen  364 -KPVWTLKAHDDEISGLSV--NI---QTPGLLSTASTDKVVKLWKFDVDSPKSVKEHSFKLGRLHCFALDPDVAFTLAFG  437 (463)
T ss_pred             -CceeEEEeccCCcceEEe--cC---CCCcceeeccccceEEEEeecCCCCcccccccccccceeecccCCCcceEEEec
Confidence             123333334322211111  10   1124799999999999999864332  222222224577888888876 68999


Q ss_pred             eCCCCEEEeecCCCCccCCC
Q 022074          271 SWDGDVVRWEFPGNGEAAPP  290 (303)
Q Consensus       271 s~Dg~i~~Wd~~~~~~~~~~  290 (303)
                      +..+.+++||.....+-.+.
T Consensus       438 G~k~~~~vwd~~~~~~V~ka  457 (463)
T KOG0270|consen  438 GEKAVLRVWDIFTNSPVRKA  457 (463)
T ss_pred             CccceEEEeecccChhHHHh
Confidence            99999999998765444333


No 116
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=99.84  E-value=5.8e-20  Score=163.34  Aligned_cols=247  Identities=19%  Similarity=0.268  Sum_probs=168.7

Q ss_pred             cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC--------ceEEEEecccCCeEEEEEccCCCcEEEEecCCCe
Q 022074           34 DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN--------KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNL  105 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~--------~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~  105 (303)
                      +..-|...|..+.|.+....+++++.||++.+|.+...        ....++.+|.++|-|++. +.+++.+++|+-||+
T Consensus       289 tl~s~~d~ir~l~~~~sep~lit~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl~v~v-~~n~~~~ysgg~Dg~  367 (577)
T KOG0642|consen  289 TLRSHDDCIRALAFHPSEPVLITASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVLCVVV-PSNGEHCYSGGIDGT  367 (577)
T ss_pred             eeecchhhhhhhhcCCCCCeEEEeccccchhhhhhcccCCccccceeeeEEEecccCceEEEEe-cCCceEEEeeccCce
Confidence            33567788999999999999999999999999999321        134678899999999999 567899999999999


Q ss_pred             EEEEcCcc------ccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccC--C-cccccCccceeee
Q 022074          106 CKVWDRRC------LNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSS--N-ASCNLGFRSYEWD  176 (303)
Q Consensus       106 v~lWd~~~------~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~--~-~~~~~~~~~~~~~  176 (303)
                      |+.|++..      ..........+.||.++|+.+++++....|++++.||++|+|+......  . ..+..++ ...++
T Consensus       368 I~~w~~p~n~dp~ds~dp~vl~~~l~Ghtdavw~l~~s~~~~~Llscs~DgTvr~w~~~~~~~~~f~~~~e~g~-Plsvd  446 (577)
T KOG0642|consen  368 IRCWNLPPNQDPDDSYDPSVLSGTLLGHTDAVWLLALSSTKDRLLSCSSDGTVRLWEPTEESPCTFGEPKEHGY-PLSVD  446 (577)
T ss_pred             eeeeccCCCCCcccccCcchhccceeccccceeeeeecccccceeeecCCceEEeeccCCcCccccCCccccCC-cceEe
Confidence            99996531      1111123457889999999999998888899999999999998765433  0 0011111 11111


Q ss_pred             ceeee--CCCCCccccC--C----CCCcceEEeccc--ceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEE
Q 022074          177 YRWMD--YPPQARDLKH--P----CDQSVATYKGHS--VLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVA  246 (303)
Q Consensus       177 ~~~~~--~~~~~~~~~~--~----~~~~~~~~~~~~--~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~  246 (303)
                      .....  ..........  .    ....+..+....  ........ ...+-+|.+.+.+++.+|+.|+++|..+++.+.
T Consensus       447 ~~ss~~a~~~~s~~~~~~~~~~~ev~s~~~~~~s~~~~~~~~~~~i-n~vVs~~~~~~~~~~hed~~Ir~~dn~~~~~l~  525 (577)
T KOG0642|consen  447 RTSSRPAHSLASFRFGYTSIDDMEVVSDLLIFESSASPGPRRYPQI-NKVVSHPTADITFTAHEDRSIRFFDNKTGKILH  525 (577)
T ss_pred             eccchhHhhhhhcccccccchhhhhhhheeeccccCCCcccccCcc-ceEEecCCCCeeEecccCCceecccccccccch
Confidence            00000  0000000000  0    000001110000  00000000 001124567789999999999999999999999


Q ss_pred             EeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          247 ALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       247 ~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ....|...++++++-|+|.+|.+++.||.+++|....
T Consensus       526 s~~a~~~svtslai~~ng~~l~s~s~d~sv~l~kld~  562 (577)
T KOG0642|consen  526 SMVAHKDSVTSLAIDPNGPYLMSGSHDGSVRLWKLDV  562 (577)
T ss_pred             heeeccceecceeecCCCceEEeecCCceeehhhccc
Confidence            9999999999999999999999999999999999753


No 117
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.84  E-value=6.2e-20  Score=169.83  Aligned_cols=199  Identities=19%  Similarity=0.290  Sum_probs=139.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .||..-|+.|+||.+ ++|+++|.|.|||||.+.... ....+.|.+.|+|++|+|.++++|+||+.||.||||++.   
T Consensus       366 ~GHt~DILDlSWSKn-~fLLSSSMDKTVRLWh~~~~~-CL~~F~HndfVTcVaFnPvDDryFiSGSLD~KvRiWsI~---  440 (712)
T KOG0283|consen  366 KGHTADILDLSWSKN-NFLLSSSMDKTVRLWHPGRKE-CLKVFSHNDFVTCVAFNPVDDRYFISGSLDGKVRLWSIS---  440 (712)
T ss_pred             hccchhheecccccC-CeeEeccccccEEeecCCCcc-eeeEEecCCeeEEEEecccCCCcEeecccccceEEeecC---
Confidence            399999999999964 689999999999999998665 446789999999999999999999999999999999974   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccc---e-eeeceeeeC-CCCC-ccc
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRS---Y-EWDYRWMDY-PPQA-RDL  189 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~---~-~~~~~~~~~-~~~~-~~~  189 (303)
                       ..+ +.....-.+-|+++.+.|+|++.+.|+.+|.+++|+++..+...+.......   . .-.+..+.+ +... +.+
T Consensus       441 -d~~-Vv~W~Dl~~lITAvcy~PdGk~avIGt~~G~C~fY~t~~lk~~~~~~I~~~~~Kk~~~~rITG~Q~~p~~~~~vL  518 (712)
T KOG0283|consen  441 -DKK-VVDWNDLRDLITAVCYSPDGKGAVIGTFNGYCRFYDTEGLKLVSDFHIRLHNKKKKQGKRITGLQFFPGDPDEVL  518 (712)
T ss_pred             -cCe-eEeehhhhhhheeEEeccCCceEEEEEeccEEEEEEccCCeEEEeeeEeeccCccccCceeeeeEecCCCCCeEE
Confidence             222 2222223367999999999999999999999999998765432221110000   0 000111122 2222 234


Q ss_pred             cCCCCCcceEEec--ccceeeeEE-----EeeeeeeeCCCeEEEEEeCCCeEEEEECCC
Q 022074          190 KHPCDQSVATYKG--HSVLRTLIR-----CHFSPVYSTGQKYIYTGSHDSCVYVYDLVS  241 (303)
Q Consensus       190 ~~~~~~~~~~~~~--~~~~~~~~~-----~~~~~~~s~~~~~latg~~dg~i~iwd~~~  241 (303)
                      ....+..+..+++  ...+.+...     ......|+.||+++++|++|..|++|+...
T Consensus       519 VTSnDSrIRI~d~~~~~lv~KfKG~~n~~SQ~~Asfs~Dgk~IVs~seDs~VYiW~~~~  577 (712)
T KOG0283|consen  519 VTSNDSRIRIYDGRDKDLVHKFKGFRNTSSQISASFSSDGKHIVSASEDSWVYIWKNDS  577 (712)
T ss_pred             EecCCCceEEEeccchhhhhhhcccccCCcceeeeEccCCCEEEEeecCceEEEEeCCC
Confidence            4445556666666  332222111     112345788999999999999999999743


No 118
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.84  E-value=6.2e-19  Score=164.45  Aligned_cols=211  Identities=19%  Similarity=0.276  Sum_probs=149.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC-ceEEEEecccCC-------------------------------
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN-KLSLRILAHTSD-------------------------------   83 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~-~~~~~~~~h~~~-------------------------------   83 (303)
                      ++|+.+...|+|.|+|++|++++.||.|++|+.... .....+..+...                               
T Consensus        10 yaht~G~t~i~~d~~gefi~tcgsdg~ir~~~~~sd~e~P~ti~~~g~~v~~ia~~s~~f~~~s~~~tv~~y~fps~~~~   89 (933)
T KOG1274|consen   10 YAHTGGLTLICYDPDGEFICTCGSDGDIRKWKTNSDEEEPETIDISGELVSSIACYSNHFLTGSEQNTVLRYKFPSGEED   89 (933)
T ss_pred             hhccCceEEEEEcCCCCEEEEecCCCceEEeecCCcccCCchhhccCceeEEEeecccceEEeeccceEEEeeCCCCCcc
Confidence            589999999999999999999999999999977554 222112113333                               


Q ss_pred             ---------eEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEE
Q 022074           84 ---------VNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKL  154 (303)
Q Consensus        84 ---------v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~l  154 (303)
                               +++++++ .++++++.||.|=.|++-+..    .......+.+|.+.|.+++++|.+++||+.+.||.|++
T Consensus        90 ~iL~Rftlp~r~~~v~-g~g~~iaagsdD~~vK~~~~~----D~s~~~~lrgh~apVl~l~~~p~~~fLAvss~dG~v~i  164 (933)
T KOG1274|consen   90 TILARFTLPIRDLAVS-GSGKMIAAGSDDTAVKLLNLD----DSSQEKVLRGHDAPVLQLSYDPKGNFLAVSSCDGKVQI  164 (933)
T ss_pred             ceeeeeeccceEEEEe-cCCcEEEeecCceeEEEEecc----ccchheeecccCCceeeeeEcCCCCEEEEEecCceEEE
Confidence                     3444442 234455555555555554432    22345567789999999999999999999999999999


Q ss_pred             EEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeE
Q 022074          155 WDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCV  234 (303)
Q Consensus       155 Wdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i  234 (303)
                      ||+................+                              .  ...+.+..+.|+|++..|+..+.|+.|
T Consensus       165 w~~~~~~~~~tl~~v~k~n~------------------------------~--~~s~i~~~~aW~Pk~g~la~~~~d~~V  212 (933)
T KOG1274|consen  165 WDLQDGILSKTLTGVDKDNE------------------------------F--ILSRICTRLAWHPKGGTLAVPPVDNTV  212 (933)
T ss_pred             EEcccchhhhhcccCCcccc------------------------------c--cccceeeeeeecCCCCeEEeeccCCeE
Confidence            99975322111100000000                              0  002233346688888888999999999


Q ss_pred             EEEECCCCeEEEEee--cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          235 YVYDLVSGEQVAALK--YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       235 ~iwd~~~~~~~~~~~--~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .+|+....+..+.+.  .+..-+.++.|||.|.|||+++.||.|.+||++.
T Consensus       213 kvy~r~~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~g~I~vWnv~t  263 (933)
T KOG1274|consen  213 KVYSRKGWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLDGQILVWNVDT  263 (933)
T ss_pred             EEEccCCceeheeecccccccceEEEEEcCCCcEEeeeccCCcEEEEeccc
Confidence            999999988877664  3445599999999999999999999999999986


No 119
>KOG0973 consensus Histone transcription regulator HIRA, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.84  E-value=5.9e-19  Score=167.32  Aligned_cols=247  Identities=19%  Similarity=0.288  Sum_probs=176.5

Q ss_pred             ccCCCcccceEEEEEcCCCCEEEEee--CCCeEEEEECCCC------------ceEEEEecccCCeEEEEEccCCCcEEE
Q 022074           33 ADDGGYSFGIFSLKFSTDGRELVAGS--SDDCIYVYDLEAN------------KLSLRILAHTSDVNTVCFGDESGHLIY   98 (303)
Q Consensus        33 ~~~~~~~~~v~~l~~s~~g~~l~sgs--~Dg~v~lwd~~~~------------~~~~~~~~h~~~v~~l~~~~~~~~~l~   98 (303)
                      .+..=++..|++|+.+|||..+++|+  .|+.++||+.+.=            +...++..|.+.|+|+.|++ ++++|+
T Consensus         7 ~wv~H~~~~IfSIdv~pdg~~~aTgGq~~d~~~~iW~~~~vl~~~~~~~~~l~k~l~~m~~h~~sv~CVR~S~-dG~~lA   85 (942)
T KOG0973|consen    7 TWVNHNEKSIFSIDVHPDGVKFATGGQVLDGGIVIWSQDPVLDEKEEKNENLPKHLCTMDDHDGSVNCVRFSP-DGSYLA   85 (942)
T ss_pred             cccccCCeeEEEEEecCCceeEecCCccccccceeeccccccchhhhhhcccchhheeeccccCceeEEEECC-CCCeEe
Confidence            44444556799999999999999999  8999999976431            12345678999999999974 699999


Q ss_pred             EecCCCeEEEEcCcc------cc--------CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCc
Q 022074           99 SGSDDNLCKVWDRRC------LN--------VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNA  164 (303)
Q Consensus        99 s~s~dg~v~lWd~~~------~~--------~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~  164 (303)
                      +|+.|+.|-+|+...      ..        ...+....+.+|...|..+.|+|++.+|+++|.|++|.+||.+......
T Consensus        86 sGSDD~~v~iW~~~~~~~~~~fgs~g~~~~vE~wk~~~~l~~H~~DV~Dv~Wsp~~~~lvS~s~DnsViiwn~~tF~~~~  165 (942)
T KOG0973|consen   86 SGSDDRLVMIWERAEIGSGTVFGSTGGAKNVESWKVVSILRGHDSDVLDVNWSPDDSLLVSVSLDNSVIIWNAKTFELLK  165 (942)
T ss_pred             eccCcceEEEeeecccCCcccccccccccccceeeEEEEEecCCCccceeccCCCccEEEEecccceEEEEccccceeee
Confidence            999999999998651      00        0112455778999999999999999999999999999999988753221


Q ss_pred             ccccCccceeeeceeeeCCCCCccccCCC-CCcceEEeccc-ceeeeEE----------EeeeeeeeCCCeEEEEEe---
Q 022074          165 SCNLGFRSYEWDYRWMDYPPQARDLKHPC-DQSVATYKGHS-VLRTLIR----------CHFSPVYSTGQKYIYTGS---  229 (303)
Q Consensus       165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~-~~~~~~~----------~~~~~~~s~~~~~latg~---  229 (303)
                      .    +..+..-+....+.|.++.++... ++.+..+.-.. ...+.+.          .+..+.+||||++|++..   
T Consensus       166 v----l~~H~s~VKGvs~DP~Gky~ASqsdDrtikvwrt~dw~i~k~It~pf~~~~~~T~f~RlSWSPDG~~las~nA~n  241 (942)
T KOG0973|consen  166 V----LRGHQSLVKGVSWDPIGKYFASQSDDRTLKVWRTSDWGIEKSITKPFEESPLTTFFLRLSWSPDGHHLASPNAVN  241 (942)
T ss_pred             e----eecccccccceEECCccCeeeeecCCceEEEEEcccceeeEeeccchhhCCCcceeeecccCCCcCeecchhhcc
Confidence            1    122222233455566666665543 34444444111 1122222          122355789999998863   


Q ss_pred             -CCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC------C-------C----eEEEEeCCCCEEEeecCCC
Q 022074          230 -HDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS------Q-------P----MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       230 -~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~------~-------~----~las~s~Dg~i~~Wd~~~~  284 (303)
                       .-.++.|.+-.+.+.-..|-+|.+|+++++|+|.      .       .    .+|+||.|++|.+|....+
T Consensus       242 ~~~~~~~IieR~tWk~~~~LvGH~~p~evvrFnP~lfe~~~~ng~~~~~~~~y~i~AvgSqDrSlSVW~T~~~  314 (942)
T KOG0973|consen  242 GGKSTIAIIERGTWKVDKDLVGHSAPVEVVRFNPKLFERNNKNGTSTQPNCYYCIAAVGSQDRSLSVWNTALP  314 (942)
T ss_pred             CCcceeEEEecCCceeeeeeecCCCceEEEEeChHHhccccccCCccCCCcceEEEEEecCCccEEEEecCCC
Confidence             2445777776666667778899999999999982      1       1    6899999999999997543


No 120
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.83  E-value=2.3e-19  Score=155.45  Aligned_cols=200  Identities=24%  Similarity=0.418  Sum_probs=151.6

Q ss_pred             EEEEEcC-------CCCEEEEeeCCCeEEEEECCCCceE---E------------------EEecccCCeEEEEEccCCC
Q 022074           43 FSLKFST-------DGRELVAGSSDDCIYVYDLEANKLS---L------------------RILAHTSDVNTVCFGDESG   94 (303)
Q Consensus        43 ~~l~~s~-------~g~~l~sgs~Dg~v~lwd~~~~~~~---~------------------~~~~h~~~v~~l~~~~~~~   94 (303)
                      .|+.|.-       .|+++|.|+.|..|-|||+.--...   .                  ...+|++.|..+.|+....
T Consensus       177 LC~ewld~~~~~~~~gNyvAiGtmdp~IeIWDLDI~d~v~P~~~LGs~~sk~~~k~~k~~~~~~gHTdavl~Ls~n~~~~  256 (463)
T KOG0270|consen  177 LCIEWLDHGSKSGGAGNYVAIGTMDPEIEIWDLDIVDAVLPCVTLGSKASKKKKKKGKRSNSASGHTDAVLALSWNRNFR  256 (463)
T ss_pred             hhhhhhhcCCCCCCCcceEEEeccCceeEEeccccccccccceeechhhhhhhhhhcccccccccchHHHHHHHhccccc
Confidence            4666652       3789999999999999998632211   0                  0235888899999987778


Q ss_pred             cEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccce
Q 022074           95 HLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSY  173 (303)
Q Consensus        95 ~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~  173 (303)
                      +.|+|||.|.+|++||+.    ++++...+.-|...|.++.|++. ..+|++|+.|++|++.|.|.....        +.
T Consensus       257 nVLaSgsaD~TV~lWD~~----~g~p~~s~~~~~k~Vq~l~wh~~~p~~LLsGs~D~~V~l~D~R~~~~s--------~~  324 (463)
T KOG0270|consen  257 NVLASGSADKTVKLWDVD----TGKPKSSITHHGKKVQTLEWHPYEPSVLLSGSYDGTVALKDCRDPSNS--------GK  324 (463)
T ss_pred             eeEEecCCCceEEEEEcC----CCCcceehhhcCCceeEEEecCCCceEEEeccccceEEeeeccCcccc--------Cc
Confidence            899999999999999986    55667777778999999999874 577999999999999999852110        11


Q ss_pred             eeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC-eEEEEeecCC
Q 022074          174 EWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG-EQVAALKYHT  252 (303)
Q Consensus       174 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~-~~~~~~~~h~  252 (303)
                      .|.                       ++|.     +.+..+.+ +  ....++++..||+++-+|+++. +++.+++.|.
T Consensus       325 ~wk-----------------------~~g~-----VEkv~w~~-~--se~~f~~~tddG~v~~~D~R~~~~~vwt~~AHd  373 (463)
T KOG0270|consen  325 EWK-----------------------FDGE-----VEKVAWDP-H--SENSFFVSTDDGTVYYFDIRNPGKPVWTLKAHD  373 (463)
T ss_pred             eEE-----------------------eccc-----eEEEEecC-C--CceeEEEecCCceEEeeecCCCCCceeEEEecc
Confidence            111                       1111     11111221 1  1245778899999999999875 7799999999


Q ss_pred             CCeEEEEECCCCC-eEEEEeCCCCEEEeecCCCC
Q 022074          253 SPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       253 ~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~~~  285 (303)
                      ++|.++++++.-+ +|+|++.|+++++|++....
T Consensus       374 ~~ISgl~~n~~~p~~l~t~s~d~~Vklw~~~~~~  407 (463)
T KOG0270|consen  374 DEISGLSVNIQTPGLLSTASTDKVVKLWKFDVDS  407 (463)
T ss_pred             CCcceEEecCCCCcceeeccccceEEEEeecCCC
Confidence            9999999999865 79999999999999997653


No 121
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.83  E-value=4.4e-19  Score=146.77  Aligned_cols=220  Identities=20%  Similarity=0.257  Sum_probs=153.0

Q ss_pred             CCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCce------E----EE-----EecccCCeEEEEEccCCCcEEEE
Q 022074           36 GGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKL------S----LR-----ILAHTSDVNTVCFGDESGHLIYS   99 (303)
Q Consensus        36 ~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~------~----~~-----~~~h~~~v~~l~~~~~~~~~l~s   99 (303)
                      ..|.++|.++...+ .|+++++|+.||.|.+||++....      .    ..     -.+|...|..+.|-|-+.-+|.+
T Consensus        40 r~HgGsvNsL~id~tegrymlSGgadgsi~v~Dl~n~t~~e~s~li~k~~c~v~~~h~~~Hky~iss~~WyP~DtGmFts  119 (397)
T KOG4283|consen   40 RPHGGSVNSLQIDLTEGRYMLSGGADGSIAVFDLQNATDYEASGLIAKHKCIVAKQHENGHKYAISSAIWYPIDTGMFTS  119 (397)
T ss_pred             ccCCCccceeeeccccceEEeecCCCccEEEEEeccccchhhccceeheeeeccccCCccceeeeeeeEEeeecCceeec
Confidence            78999999999997 588999999999999999987541      1    10     12467789999998766668889


Q ss_pred             ecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCC---CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeee
Q 022074          100 GSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG---DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWD  176 (303)
Q Consensus       100 ~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~---~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~  176 (303)
                      ++.|.++++||....    +..-.| .-.+.|..-+.+|   .-.++++|.+|-+||+.|+....    +          
T Consensus       120 sSFDhtlKVWDtnTl----Q~a~~F-~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs----~----------  180 (397)
T KOG4283|consen  120 SSFDHTLKVWDTNTL----QEAVDF-KMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGS----F----------  180 (397)
T ss_pred             ccccceEEEeecccc----eeeEEe-ecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCc----c----------
Confidence            999999999996421    111122 1223333333332   23478889999999999986422    1          


Q ss_pred             ceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC-eEEEEe-------
Q 022074          177 YRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG-EQVAAL-------  248 (303)
Q Consensus       177 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~-~~~~~~-------  248 (303)
                                          --++.||..  .++...|+|.   ..-.|++|+.||.|++||++.- -+...+       
T Consensus       181 --------------------sH~LsGHr~--~vlaV~Wsp~---~e~vLatgsaDg~irlWDiRrasgcf~~lD~hn~k~  235 (397)
T KOG4283|consen  181 --------------------SHTLSGHRD--GVLAVEWSPS---SEWVLATGSADGAIRLWDIRRASGCFRVLDQHNTKR  235 (397)
T ss_pred             --------------------eeeeccccC--ceEEEEeccC---ceeEEEecCCCceEEEEEeecccceeEEeecccCcc
Confidence                                123444542  2333344442   2456899999999999998643 112222       


Q ss_pred             -------ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCccCCCCccc-ccccc
Q 022074          249 -------KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEAAPPLNKK-RIRRR  299 (303)
Q Consensus       249 -------~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~~~~~~~~-~~~~~  299 (303)
                             ..|.+.++.++|+.++.++++.+.|..+++|+....+....++.+. +.++.
T Consensus       236 ~p~~~~n~ah~gkvngla~tSd~~~l~~~gtd~r~r~wn~~~G~ntl~~~g~~~~n~~~  294 (397)
T KOG4283|consen  236 PPILKTNTAHYGKVNGLAWTSDARYLASCGTDDRIRVWNMESGRNTLREFGPIIHNQTT  294 (397)
T ss_pred             CccccccccccceeeeeeecccchhhhhccCccceEEeecccCcccccccccccccccc
Confidence                   2567889999999999999999999999999987776666665443 33333


No 122
>KOG4283 consensus Transcription-coupled repair protein CSA, contains WD40 domain [Transcription; Replication, recombination and repair]
Probab=99.83  E-value=6.5e-19  Score=145.75  Aligned_cols=240  Identities=20%  Similarity=0.323  Sum_probs=181.1

Q ss_pred             eEEEEEcc---CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCC---EEEEeeCCCeEEEEECCCCceEEEE
Q 022074            4 IVHIVDVG---SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGR---ELVAGSSDDCIYVYDLEANKLSLRI   77 (303)
Q Consensus         4 ~~~~~~~~---~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~---~l~sgs~Dg~v~lwd~~~~~~~~~~   77 (303)
                      |-.|.|-|   ++|||+...||+.   +-.++.++--.+.-|++-++||-..   .+|+|..|-.|+|=|+..|.....+
T Consensus       108 ~WyP~DtGmFtssSFDhtlKVWDt---nTlQ~a~~F~me~~VYshamSp~a~sHcLiA~gtr~~~VrLCDi~SGs~sH~L  184 (397)
T KOG4283|consen  108 IWYPIDTGMFTSSSFDHTLKVWDT---NTLQEAVDFKMEGKVYSHAMSPMAMSHCLIAAGTRDVQVRLCDIASGSFSHTL  184 (397)
T ss_pred             EEeeecCceeecccccceEEEeec---ccceeeEEeecCceeehhhcChhhhcceEEEEecCCCcEEEEeccCCcceeee
Confidence            45677777   7999999999998   7677677777778899999998543   5777888889999999999999999


Q ss_pred             ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc-------cC-C-Ccc--ceeecccccCeEEEEeCCCCCEEEEE
Q 022074           78 LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL-------NV-K-GKP--AGVLMGHLEGITFIDSRGDGRYLISN  146 (303)
Q Consensus        78 ~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~-------~~-~-~~~--~~~~~~h~~~v~~~~~~~~~~~l~s~  146 (303)
                      .+|.++|-++.|+|...-.|++|+.||.||+||+|-.       .+ + .++  ...-..|.+.|..+++..++.+++++
T Consensus       185 sGHr~~vlaV~Wsp~~e~vLatgsaDg~irlWDiRrasgcf~~lD~hn~k~~p~~~~n~ah~gkvngla~tSd~~~l~~~  264 (397)
T KOG4283|consen  185 SGHRDGVLAVEWSPSSEWVLATGSADGAIRLWDIRRASGCFRVLDQHNTKRPPILKTNTAHYGKVNGLAWTSDARYLASC  264 (397)
T ss_pred             ccccCceEEEEeccCceeEEEecCCCceEEEEEeecccceeEEeecccCccCccccccccccceeeeeeecccchhhhhc
Confidence            9999999999999887888999999999999998621       00 1 111  11234588899999999999999999


Q ss_pred             eCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEE
Q 022074          147 GKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIY  226 (303)
Q Consensus       147 ~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~la  226 (303)
                      +.|..+++|++.........   +            .+.       +.+....+.      ..++       +.+...++
T Consensus       265 gtd~r~r~wn~~~G~ntl~~---~------------g~~-------~~n~~~~~~------~~~~-------~~~s~vfv  309 (397)
T KOG4283|consen  265 GTDDRIRVWNMESGRNTLRE---F------------GPI-------IHNQTTSFA------VHIQ-------SMDSDVFV  309 (397)
T ss_pred             cCccceEEeecccCcccccc---c------------ccc-------cccccccce------EEEe-------ecccceEE
Confidence            99999999998764322110   0            000       000000000      0000       11222333


Q ss_pred             EEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          227 TGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       227 tg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      --=.++.+.++++-.++.+..++.|-..|.+.++-|+-+...+++.|+++..|-+
T Consensus       310 ~~p~~~~lall~~~sgs~ir~l~~h~k~i~c~~~~~~fq~~~tg~~d~ni~~w~p  364 (397)
T KOG4283|consen  310 LFPNDGSLALLNLLEGSFVRRLSTHLKRINCAAYRPDFEQCFTGDMNGNIYMWSP  364 (397)
T ss_pred             EEecCCeEEEEEccCceEEEeeecccceeeEEeecCchhhhhccccCCccccccc
Confidence            3334588999999999999999999999999999999999999999999999987


No 123
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=99.82  E-value=7.2e-20  Score=148.21  Aligned_cols=203  Identities=21%  Similarity=0.342  Sum_probs=160.4

Q ss_pred             cccCCCcccceEEEEEcC---CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEE
Q 022074           32 AADDGGYSFGIFSLKFST---DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKV  108 (303)
Q Consensus        32 ~~~~~~~~~~v~~l~~s~---~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~l  108 (303)
                      +..-.||+.||..++|||   +|-+|++++.||.--|=.-++|..+-++.+|++.|...+.+ .+..+.++++.|=+.++
T Consensus         7 pl~c~ghtrpvvdl~~s~itp~g~flisa~kd~~pmlr~g~tgdwigtfeghkgavw~~~l~-~na~~aasaaadftakv   85 (334)
T KOG0278|consen    7 PLTCHGHTRPVVDLAFSPITPDGYFLISASKDGKPMLRNGDTGDWIGTFEGHKGAVWSATLN-KNATRAASAAADFTAKV   85 (334)
T ss_pred             ceEEcCCCcceeEEeccCCCCCceEEEEeccCCCchhccCCCCCcEEeeeccCcceeeeecC-chhhhhhhhcccchhhh
Confidence            334479999999999994   89999999999998887788899999999999999999885 56778889999999999


Q ss_pred             EcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcc
Q 022074          109 WDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARD  188 (303)
Q Consensus       109 Wd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  188 (303)
                      ||.-    ++-.+..| .|..-|..++|+.|.++|+|||.++.+|+||+.+.+...                        
T Consensus        86 w~a~----tgdelhsf-~hkhivk~~af~~ds~~lltgg~ekllrvfdln~p~App------------------------  136 (334)
T KOG0278|consen   86 WDAV----TGDELHSF-EHKHIVKAVAFSQDSNYLLTGGQEKLLRVFDLNRPKAPP------------------------  136 (334)
T ss_pred             hhhh----hhhhhhhh-hhhheeeeEEecccchhhhccchHHHhhhhhccCCCCCc------------------------
Confidence            9953    33333344 477889999999999999999999999999997643211                        


Q ss_pred             ccCCCCCcceEEecccc-eeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeE
Q 022074          189 LKHPCDQSVATYKGHSV-LRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPML  267 (303)
Q Consensus       189 ~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~l  267 (303)
                               ..+.||.. +++++       |....+.+++...|++||+||.+++..+..+. ...+|+++..|++|++|
T Consensus       137 ---------~E~~ghtg~Ir~v~-------wc~eD~~iLSSadd~tVRLWD~rTgt~v~sL~-~~s~VtSlEvs~dG~il  199 (334)
T KOG0278|consen  137 ---------KEISGHTGGIRTVL-------WCHEDKCILSSADDKTVRLWDHRTGTEVQSLE-FNSPVTSLEVSQDGRIL  199 (334)
T ss_pred             ---------hhhcCCCCcceeEE-------EeccCceEEeeccCCceEEEEeccCcEEEEEe-cCCCCcceeeccCCCEE
Confidence                     11122221 12222       22234667777999999999999999998886 45689999999999988


Q ss_pred             EEEeCCCCEEEeecC
Q 022074          268 VSSSWDGDVVRWEFP  282 (303)
Q Consensus       268 as~s~Dg~i~~Wd~~  282 (303)
                      .++. .+.+.+||..
T Consensus       200 Tia~-gssV~Fwdak  213 (334)
T KOG0278|consen  200 TIAY-GSSVKFWDAK  213 (334)
T ss_pred             EEec-CceeEEeccc
Confidence            7774 6899999965


No 124
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.82  E-value=7.4e-18  Score=148.29  Aligned_cols=231  Identities=21%  Similarity=0.366  Sum_probs=156.3

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc----
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN----  115 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~----  115 (303)
                      --|.|++|.++|+ +++|..+|+|.||+..+.+...+...|+++|.+++.. .++. |+||++|..|.+||-....    
T Consensus       247 k~Vl~v~F~engd-viTgDS~G~i~Iw~~~~~~~~k~~~aH~ggv~~L~~l-r~Gt-llSGgKDRki~~Wd~~y~k~r~~  323 (626)
T KOG2106|consen  247 KFVLCVTFLENGD-VITGDSGGNILIWSKGTNRISKQVHAHDGGVFSLCML-RDGT-LLSGGKDRKIILWDDNYRKLRET  323 (626)
T ss_pred             eEEEEEEEcCCCC-EEeecCCceEEEEeCCCceEEeEeeecCCceEEEEEe-cCcc-EeecCccceEEeccccccccccc
Confidence            4699999999886 8899999999999998888888888999999999985 4464 5579999999999832100    


Q ss_pred             ----CC----------------------------CccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCC
Q 022074          116 ----VK----------------------------GKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSN  163 (303)
Q Consensus       116 ----~~----------------------------~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~  163 (303)
                          +.                            .......++|.+..+.++..|+.++++|++.|+.+++|+  ..+..
T Consensus       324 elPe~~G~iRtv~e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hps~~q~~T~gqdk~v~lW~--~~k~~  401 (626)
T KOG2106|consen  324 ELPEQFGPIRTVAEGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHPSKNQLLTCGQDKHVRLWN--DHKLE  401 (626)
T ss_pred             cCchhcCCeeEEecCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcCCChhheeeccCcceEEEcc--CCcee
Confidence                00                            001112357888999999999999999999999999998  22222


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCCcceEEeccc-ceeeeEEE---eeeeeeeCCCeEEEEEeCCCeEEEEEC
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHS-VLRTLIRC---HFSPVYSTGQKYIYTGSHDSCVYVYDL  239 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~---~~~~~~s~~~~~latg~~dg~i~iwd~  239 (303)
                      .+...     +.+.....+.|...............++... ....+-.+   .....|+|+|.+||.|+.|+.|++|-+
T Consensus       402 wt~~~-----~d~~~~~~fhpsg~va~Gt~~G~w~V~d~e~~~lv~~~~d~~~ls~v~ysp~G~~lAvgs~d~~iyiy~V  476 (626)
T KOG2106|consen  402 WTKII-----EDPAECADFHPSGVVAVGTATGRWFVLDTETQDLVTIHTDNEQLSVVRYSPDGAFLAVGSHDNHIYIYRV  476 (626)
T ss_pred             EEEEe-----cCceeEeeccCcceEEEeeccceEEEEecccceeEEEEecCCceEEEEEcCCCCEEEEecCCCeEEEEEE
Confidence            22211     1112223333333111111111111111111 00000001   112348899999999999999999998


Q ss_pred             CC-CeEEEEe-ecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          240 VS-GEQVAAL-KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       240 ~~-~~~~~~~-~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      .. +.++... +.|..+|+.++||+|+++|.+-+.|-.|-.|.
T Consensus       477 s~~g~~y~r~~k~~gs~ithLDwS~Ds~~~~~~S~d~eiLyW~  519 (626)
T KOG2106|consen  477 SANGRKYSRVGKCSGSPITHLDWSSDSQFLVSNSGDYEILYWK  519 (626)
T ss_pred             CCCCcEEEEeeeecCceeEEeeecCCCceEEeccCceEEEEEc
Confidence            64 4444443 33448999999999999999999999999994


No 125
>KOG0641 consensus WD40 repeat protein [General function prediction only]
Probab=99.82  E-value=5.4e-18  Score=135.63  Aligned_cols=203  Identities=25%  Similarity=0.373  Sum_probs=152.9

Q ss_pred             CCcccceEEEEEcCC----CCEEEEee-CCCeEEEEECCCCceEEEEecccCCeEEEE-EccCCCcEEEEecCCCeEEEE
Q 022074           36 GGYSFGIFSLKFSTD----GRELVAGS-SDDCIYVYDLEANKLSLRILAHTSDVNTVC-FGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~----g~~l~sgs-~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~-~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      +=|...|..++|-.+    |..|++++ .|-.|++-|-.+|+-...+.+|++.+.++. |+   +-+|++|+.|.+||.|
T Consensus       133 nmhdgtirdl~fld~~~s~~~il~s~gagdc~iy~tdc~~g~~~~a~sghtghilalyswn---~~m~~sgsqdktirfw  209 (350)
T KOG0641|consen  133 NMHDGTIRDLAFLDDPESGGAILASAGAGDCKIYITDCGRGQGFHALSGHTGHILALYSWN---GAMFASGSQDKTIRFW  209 (350)
T ss_pred             eecCCceeeeEEecCCCcCceEEEecCCCcceEEEeecCCCCcceeecCCcccEEEEEEec---CcEEEccCCCceEEEE
Confidence            567788999999853    45666655 466788889889988888999999998884 53   5799999999999999


Q ss_pred             cCccccCCCccceeecc---cccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          110 DRRCLNVKGKPAGVLMG---HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       110 d~~~~~~~~~~~~~~~~---h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      |+|........-..+.+   ...+|.+++..|.|++|++|-.|.++-+||+|....                        
T Consensus       210 dlrv~~~v~~l~~~~~~~glessavaav~vdpsgrll~sg~~dssc~lydirg~r~------------------------  265 (350)
T KOG0641|consen  210 DLRVNSCVNTLDNDFHDGGLESSAVAAVAVDPSGRLLASGHADSSCMLYDIRGGRM------------------------  265 (350)
T ss_pred             eeeccceeeeccCcccCCCcccceeEEEEECCCcceeeeccCCCceEEEEeeCCce------------------------
Confidence            99843221111111211   235689999999999999999999999999985332                        


Q ss_pred             ccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-----EEEeecCCCCeEEEEEC
Q 022074          187 RDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-----VAALKYHTSPVRDCSWH  261 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-----~~~~~~h~~~I~~v~~s  261 (303)
                                +..+..|....   +   +..|||.-.|+++++.|..|++=|+. |.+     ......|++.+-.+.|+
T Consensus       266 ----------iq~f~phsadi---r---~vrfsp~a~yllt~syd~~ikltdlq-gdla~el~~~vv~ehkdk~i~~rwh  328 (350)
T KOG0641|consen  266 ----------IQRFHPHSADI---R---CVRFSPGAHYLLTCSYDMKIKLTDLQ-GDLAHELPIMVVAEHKDKAIQCRWH  328 (350)
T ss_pred             ----------eeeeCCCccce---e---EEEeCCCceEEEEecccceEEEeecc-cchhhcCceEEEEeccCceEEEEec
Confidence                      22222232211   1   22378888999999999999999985 332     33446799999999999


Q ss_pred             CCCCeEEEEeCCCCEEEeecC
Q 022074          262 PSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       262 p~~~~las~s~Dg~i~~Wd~~  282 (303)
                      |+.--+++.+.|.++.+|-.+
T Consensus       329 ~~d~sfisssadkt~tlwa~~  349 (350)
T KOG0641|consen  329 PQDFSFISSSADKTATLWALN  349 (350)
T ss_pred             CccceeeeccCcceEEEeccC
Confidence            999999999999999999864


No 126
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.81  E-value=7.3e-19  Score=147.09  Aligned_cols=217  Identities=26%  Similarity=0.376  Sum_probs=158.9

Q ss_pred             hccccccccccCc-Ccc--cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEEC-CCCceE--EEEe-----cccCCe
Q 022074           16 ESLANVTEIHDGL-DFS--AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDL-EANKLS--LRIL-----AHTSDV   84 (303)
Q Consensus        16 ~~~~~~~~~~~~~-~~~--~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~-~~~~~~--~~~~-----~h~~~v   84 (303)
                      +--||+|+..||. ..|  +.|-.---.+-.++.|+|||++|++| ..++|++||+ +.|+..  ....     +..+.+
T Consensus       132 ~~PIh~wdaftG~lraSy~~ydh~de~taAhsL~Fs~DGeqlfaG-ykrcirvFdt~RpGr~c~vy~t~~~~k~gq~gii  210 (406)
T KOG2919|consen  132 DQPIHLWDAFTGKLRASYRAYDHQDEYTAAHSLQFSPDGEQLFAG-YKRCIRVFDTSRPGRDCPVYTTVTKGKFGQKGII  210 (406)
T ss_pred             cCceeeeeccccccccchhhhhhHHhhhhheeEEecCCCCeEeec-ccceEEEeeccCCCCCCcchhhhhccccccccee
Confidence            3457899999998 222  21111011467899999999999976 6778999999 555432  1122     335678


Q ss_pred             EEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeC-CCcEEEEEcccccCC
Q 022074           85 NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK-DQAIKLWDIRKMSSN  163 (303)
Q Consensus        85 ~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~-D~~v~lWdl~~~~~~  163 (303)
                      .+++|+|.+.+.++.++....+-|+.-.    ..++...+-||.++|+.+.+.++|+.|.+|++ |-.|..||+|.... 
T Consensus       211 sc~a~sP~~~~~~a~gsY~q~~giy~~~----~~~pl~llggh~gGvThL~~~edGn~lfsGaRk~dkIl~WDiR~~~~-  285 (406)
T KOG2919|consen  211 SCFAFSPMDSKTLAVGSYGQRVGIYNDD----GRRPLQLLGGHGGGVTHLQWCEDGNKLFSGARKDDKILCWDIRYSRD-  285 (406)
T ss_pred             eeeeccCCCCcceeeecccceeeeEecC----CCCceeeecccCCCeeeEEeccCcCeecccccCCCeEEEEeehhccc-
Confidence            9999998888899999988887776522    34667778899999999999999999999985 89999999985321 


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC-C
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS-G  242 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~-~  242 (303)
                                                      ++..+.+|.. .+--|+.|.  ..|.+++|++|+.||.|++||++. +
T Consensus       286 --------------------------------pv~~L~rhv~-~TNQRI~FD--ld~~~~~LasG~tdG~V~vwdlk~~g  330 (406)
T KOG2919|consen  286 --------------------------------PVYALERHVG-DTNQRILFD--LDPKGEILASGDTDGSVRVWDLKDLG  330 (406)
T ss_pred             --------------------------------hhhhhhhhcc-CccceEEEe--cCCCCceeeccCCCccEEEEecCCCC
Confidence                                            1111111110 011122232  236789999999999999999987 7


Q ss_pred             eEEEEeecCCCCeEEEEECCCCCeEEEEeCC
Q 022074          243 EQVAALKYHTSPVRDCSWHPSQPMLVSSSWD  273 (303)
Q Consensus       243 ~~~~~~~~h~~~I~~v~~sp~~~~las~s~D  273 (303)
                      +.+..+..|..-++.++++|--+++||++..
T Consensus       331 n~~sv~~~~sd~vNgvslnP~mpilatssGq  361 (406)
T KOG2919|consen  331 NEVSVTGNYSDTVNGVSLNPIMPILATSSGQ  361 (406)
T ss_pred             CcccccccccccccceecCcccceeeeccCc
Confidence            7777788899999999999999999999865


No 127
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.81  E-value=2.1e-18  Score=150.16  Aligned_cols=205  Identities=20%  Similarity=0.327  Sum_probs=157.6

Q ss_pred             ccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE----EE------------E--ecccCCeEEEEEccCCC
Q 022074           33 ADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS----LR------------I--LAHTSDVNTVCFGDESG   94 (303)
Q Consensus        33 ~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~----~~------------~--~~h~~~v~~l~~~~~~~   94 (303)
                      .....|..++.++.++|++++.++++.|++|.=|++.+++..    .+            .  ..|...+.+++.++ ++
T Consensus       136 ~~~~~H~~s~~~vals~d~~~~fsask~g~i~kw~v~tgk~~~~i~~~~ev~k~~~~~~k~~r~~h~keil~~avS~-Dg  214 (479)
T KOG0299|consen  136 RVIGKHQLSVTSVALSPDDKRVFSASKDGTILKWDVLTGKKDRYIIERDEVLKSHGNPLKESRKGHVKEILTLAVSS-DG  214 (479)
T ss_pred             eeeccccCcceEEEeeccccceeecCCCcceeeeehhcCcccccccccchhhhhccCCCCcccccccceeEEEEEcC-CC
Confidence            445799999999999999999999999999999999887622    00            1  26778899999975 58


Q ss_pred             cEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCcccee
Q 022074           95 HLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYE  174 (303)
Q Consensus        95 ~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~  174 (303)
                      ++|++|+.|..|.|||.+    +..++..+.+|.+.|.+++|....+.+++++.|++|++|++..+..            
T Consensus       215 kylatgg~d~~v~Iw~~~----t~ehv~~~~ghr~~V~~L~fr~gt~~lys~s~Drsvkvw~~~~~s~------------  278 (479)
T KOG0299|consen  215 KYLATGGRDRHVQIWDCD----TLEHVKVFKGHRGAVSSLAFRKGTSELYSASADRSVKVWSIDQLSY------------  278 (479)
T ss_pred             cEEEecCCCceEEEecCc----ccchhhcccccccceeeeeeecCccceeeeecCCceEEEehhHhHH------------
Confidence            999999999999999976    4456677899999999999988888899999999999999864321            


Q ss_pred             eeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCC
Q 022074          175 WDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSP  254 (303)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~  254 (303)
                                            +.++-||+.....+..      ..-++.+-.|+-|+++++|++.... ...+.+|.+.
T Consensus       279 ----------------------vetlyGHqd~v~~Ida------L~reR~vtVGgrDrT~rlwKi~ees-qlifrg~~~s  329 (479)
T KOG0299|consen  279 ----------------------VETLYGHQDGVLGIDA------LSRERCVTVGGRDRTVRLWKIPEES-QLIFRGGEGS  329 (479)
T ss_pred             ----------------------HHHHhCCccceeeech------hcccceEEeccccceeEEEeccccc-eeeeeCCCCC
Confidence                                  2223333322111110      1124555556799999999995443 3456789999


Q ss_pred             eEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          255 VRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       255 I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.+++|-. ...++|||+||+|.+|.+..+
T Consensus       330 idcv~~In-~~HfvsGSdnG~IaLWs~~KK  358 (479)
T KOG0299|consen  330 IDCVAFIN-DEHFVSGSDNGSIALWSLLKK  358 (479)
T ss_pred             eeeEEEec-ccceeeccCCceEEEeeeccc
Confidence            99999954 456899999999999998643


No 128
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=99.78  E-value=1.9e-17  Score=141.92  Aligned_cols=204  Identities=18%  Similarity=0.311  Sum_probs=149.2

Q ss_pred             CCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCce-------EEEEecccCCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           36 GGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKL-------SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        36 ~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~-------~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      .||+.+|..++|+| +...||+||.|-+|.||.+..+.+       ...+.+|...|..++|+|.-.+.|+|++.|.+|.
T Consensus        78 ~GHt~~vLDi~w~PfnD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~v~  157 (472)
T KOG0303|consen   78 CGHTAPVLDIDWCPFNDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNTVS  157 (472)
T ss_pred             cCccccccccccCccCCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCceEE
Confidence            59999999999998 556799999999999999987643       3567899999999999887788999999999999


Q ss_pred             EEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          108 VWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      +|++.    ++...-.+ .|.+-|.+++|+.+|.+|+|...|+.||+||.|........                     
T Consensus       158 iWnv~----tgeali~l-~hpd~i~S~sfn~dGs~l~TtckDKkvRv~dpr~~~~v~e~---------------------  211 (472)
T KOG0303|consen  158 IWNVG----TGEALITL-DHPDMVYSMSFNRDGSLLCTTCKDKKVRVIDPRRGTVVSEG---------------------  211 (472)
T ss_pred             EEecc----CCceeeec-CCCCeEEEEEeccCCceeeeecccceeEEEcCCCCcEeeec---------------------
Confidence            99975    33333334 39999999999999999999999999999999865422110                     


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe---CCCeEEEEECCCCeE---EEEeecCCCCeEEEEEC
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS---HDSCVYVYDLVSGEQ---VAALKYHTSPVRDCSWH  261 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~---~dg~i~iwd~~~~~~---~~~~~~h~~~I~~v~~s  261 (303)
                       .         ...|....+    .    .|-.++.++-||-   ++..+.+||..+.+.   +.++.. ...|.==-|.
T Consensus       212 -~---------~heG~k~~R----a----ifl~~g~i~tTGfsr~seRq~aLwdp~nl~eP~~~~elDt-SnGvl~PFyD  272 (472)
T KOG0303|consen  212 -V---------AHEGAKPAR----A----IFLASGKIFTTGFSRMSERQIALWDPNNLEEPIALQELDT-SNGVLLPFYD  272 (472)
T ss_pred             -c---------cccCCCcce----e----EEeccCceeeeccccccccceeccCcccccCcceeEEecc-CCceEEeeec
Confidence             0         001111111    1    1223455444442   688999999987653   233332 2334444567


Q ss_pred             CCCC-eEEEEeCCCCEEEeecCCC
Q 022074          262 PSQP-MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       262 p~~~-~las~s~Dg~i~~Wd~~~~  284 (303)
                      ||.. +.+.|-.|++|+.+++...
T Consensus       273 ~dt~ivYl~GKGD~~IRYyEit~d  296 (472)
T KOG0303|consen  273 PDTSIVYLCGKGDSSIRYFEITNE  296 (472)
T ss_pred             CCCCEEEEEecCCcceEEEEecCC
Confidence            7776 4678889999999998643


No 129
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=99.78  E-value=1.7e-17  Score=149.73  Aligned_cols=195  Identities=21%  Similarity=0.347  Sum_probs=149.7

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPA  121 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~  121 (303)
                      ..=++|+. .+.+++|... .|++|+..++.......-+...|+++.|++ .+..|+.|..+|+|.|||..    +.+..
T Consensus       180 ~nlldWss-~n~laValg~-~vylW~~~s~~v~~l~~~~~~~vtSv~ws~-~G~~LavG~~~g~v~iwD~~----~~k~~  252 (484)
T KOG0305|consen  180 LNLLDWSS-ANVLAVALGQ-SVYLWSASSGSVTELCSFGEELVTSVKWSP-DGSHLAVGTSDGTVQIWDVK----EQKKT  252 (484)
T ss_pred             hhHhhccc-CCeEEEEecc-eEEEEecCCCceEEeEecCCCceEEEEECC-CCCEEEEeecCCeEEEEehh----hcccc
Confidence            45578884 4567776544 599999999985543333478999999964 68999999999999999975    23345


Q ss_pred             eeecc-cccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          122 GVLMG-HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       122 ~~~~~-h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      +.+.+ |...|.++++.  +..+.+|+.|+.|..+|+|.......                                 .+
T Consensus       253 ~~~~~~h~~rvg~laW~--~~~lssGsr~~~I~~~dvR~~~~~~~---------------------------------~~  297 (484)
T KOG0305|consen  253 RTLRGSHASRVGSLAWN--SSVLSSGSRDGKILNHDVRISQHVVS---------------------------------TL  297 (484)
T ss_pred             ccccCCcCceeEEEecc--CceEEEecCCCcEEEEEEecchhhhh---------------------------------hh
Confidence            55666 88899999987  66899999999999999986432111                                 12


Q ss_pred             ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC-CCeEEEEeC--CCCEE
Q 022074          201 KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS-QPMLVSSSW--DGDVV  277 (303)
Q Consensus       201 ~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~-~~~las~s~--Dg~i~  277 (303)
                      .+|....    |  ...+++++.++|+|+.|+.+.|||....+.+..+..|...|..++|+|- ..+||+|+.  |+.|+
T Consensus       298 ~~H~qeV----C--gLkws~d~~~lASGgnDN~~~Iwd~~~~~p~~~~~~H~aAVKA~awcP~q~~lLAsGGGs~D~~i~  371 (484)
T KOG0305|consen  298 QGHRQEV----C--GLKWSPDGNQLASGGNDNVVFIWDGLSPEPKFTFTEHTAAVKALAWCPWQSGLLATGGGSADRCIK  371 (484)
T ss_pred             hccccee----e--eeEECCCCCeeccCCCccceEeccCCCccccEEEeccceeeeEeeeCCCccCceEEcCCCcccEEE
Confidence            2232111    1  1236789999999999999999999777888888999999999999995 568888764  99999


Q ss_pred             EeecCCC
Q 022074          278 RWEFPGN  284 (303)
Q Consensus       278 ~Wd~~~~  284 (303)
                      +||....
T Consensus       372 fwn~~~g  378 (484)
T KOG0305|consen  372 FWNTNTG  378 (484)
T ss_pred             EEEcCCC
Confidence            9998644


No 130
>KOG4328 consensus WD40 protein [Function unknown]
Probab=99.77  E-value=1.3e-17  Score=145.06  Aligned_cols=204  Identities=17%  Similarity=0.208  Sum_probs=147.1

Q ss_pred             CcccceEEEEEcCCCC--EEEEeeCCCeEEEEECCCCc----eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEc
Q 022074           37 GYSFGIFSLKFSTDGR--ELVAGSSDDCIYVYDLEANK----LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~--~l~sgs~Dg~v~lwd~~~~~----~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      -|..+|.+++|+|..+  .+++|..-|+|-+||+.+..    -...+..|.+.|+++.|+|.+...+++.|.||++|+-|
T Consensus       184 v~~~Rit~l~fHPt~~~~lva~GdK~G~VG~Wn~~~~~~d~d~v~~f~~hs~~Vs~l~F~P~n~s~i~ssSyDGtiR~~D  263 (498)
T KOG4328|consen  184 VTDRRITSLAFHPTENRKLVAVGDKGGQVGLWNFGTQEKDKDGVYLFTPHSGPVSGLKFSPANTSQIYSSSYDGTIRLQD  263 (498)
T ss_pred             ecccceEEEEecccCcceEEEEccCCCcEEEEecCCCCCccCceEEeccCCccccceEecCCChhheeeeccCceeeeee
Confidence            4567999999999655  78889999999999995322    23456789999999999998889999999999999999


Q ss_pred             CccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCC-cccccCccceeeeceeeeCCCCCccc
Q 022074          111 RRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSN-ASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                      ++..  ....+-.+..-...+..++++.+...++.+..=|...+||+|..... ....+                     
T Consensus       264 ~~~~--i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~f~~iD~R~~~s~~~~~~l---------------------  320 (498)
T KOG4328|consen  264 FEGN--ISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGNFNVIDLRTDGSEYENLRL---------------------  320 (498)
T ss_pred             ecch--hhHHHhhcCccceeeeeccccCCCccEEEeecccceEEEEeecCCccchhhhh---------------------
Confidence            7521  11111111112234567778888778888887789999999864321 00000                     


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-----EEEeecCCCCeEEEEECCCC
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-----VAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-----~~~~~~h~~~I~~v~~sp~~  264 (303)
                         .++++.            ..+++|.   ...+|||+|.|++++|||+++...     +..+ .|..+|.++.|||++
T Consensus       321 ---h~kKI~------------sv~~NP~---~p~~laT~s~D~T~kIWD~R~l~~K~sp~lst~-~HrrsV~sAyFSPs~  381 (498)
T KOG4328|consen  321 ---HKKKIT------------SVALNPV---CPWFLATASLDQTAKIWDLRQLRGKASPFLSTL-PHRRSVNSAYFSPSG  381 (498)
T ss_pred             ---hhcccc------------eeecCCC---CchheeecccCcceeeeehhhhcCCCCcceecc-cccceeeeeEEcCCC
Confidence               011111            1122232   346899999999999999975432     3333 699999999999998


Q ss_pred             CeEEEEeCCCCEEEeecC
Q 022074          265 PMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       265 ~~las~s~Dg~i~~Wd~~  282 (303)
                      -.|+|.+.|+.|++||..
T Consensus       382 gtl~TT~~D~~IRv~dss  399 (498)
T KOG4328|consen  382 GTLLTTCQDNEIRVFDSS  399 (498)
T ss_pred             CceEeeccCCceEEeecc
Confidence            889999999999999974


No 131
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.77  E-value=6.9e-16  Score=134.49  Aligned_cols=257  Identities=15%  Similarity=0.111  Sum_probs=155.1

Q ss_pred             chhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEE-EEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEcc
Q 022074           13 GTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGREL-VAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGD   91 (303)
Q Consensus        13 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l-~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~   91 (303)
                      +..|..+.+|+.-+|..... . .+|. .+.++.|+|+|+.+ ++++.|+.|++||..+++....+..+. .+..+++++
T Consensus         7 ~~~d~~v~~~d~~t~~~~~~-~-~~~~-~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~-~~~~~~~~~   82 (300)
T TIGR03866         7 NEKDNTISVIDTATLEVTRT-F-PVGQ-RPRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGP-DPELFALHP   82 (300)
T ss_pred             ecCCCEEEEEECCCCceEEE-E-ECCC-CCCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCC-CccEEEECC
Confidence            44566778887766553221 1 2232 35789999999976 567789999999999887665554443 356778876


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCC-cEEEEEcccccCCcccccCc
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQ-AIKLWDIRKMSSNASCNLGF  170 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~-~v~lWdl~~~~~~~~~~~~~  170 (303)
                      +...++++++.++.+++||++.    ......+. +...+..+.+++++.++++++.++ .+.+||.+............
T Consensus        83 ~g~~l~~~~~~~~~l~~~d~~~----~~~~~~~~-~~~~~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~~~~~~~~~~~  157 (300)
T TIGR03866        83 NGKILYIANEDDNLVTVIDIET----RKVLAEIP-VGVEPEGMAVSPDGKIVVNTSETTNMAHFIDTKTYEIVDNVLVDQ  157 (300)
T ss_pred             CCCEEEEEcCCCCeEEEEECCC----CeEEeEee-CCCCcceEEECCCCCEEEEEecCCCeEEEEeCCCCeEEEEEEcCC
Confidence            5333445566789999999863    22233332 223356788999999999888775 46677876432211110000


Q ss_pred             cceeeeceeeeCCCCCccccCC--CCCcceEEecccce-eeeEEE-----------eeeeeeeCCCeEEEE-EeCCCeEE
Q 022074          171 RSYEWDYRWMDYPPQARDLKHP--CDQSVATYKGHSVL-RTLIRC-----------HFSPVYSTGQKYIYT-GSHDSCVY  235 (303)
Q Consensus       171 ~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~-~~~~~~-----------~~~~~~s~~~~~lat-g~~dg~i~  235 (303)
                           ....+.+.++...+...  .+..+..++-.... ...+..           .....+++++++++. .+.++.+.
T Consensus       158 -----~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~  232 (300)
T TIGR03866       158 -----RPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVA  232 (300)
T ss_pred             -----CccEEEECCCCCEEEEEcCCCCEEEEEEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEE
Confidence                 01112233333322111  12222222211100 000000           012346788887544 45677899


Q ss_pred             EEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEE-eCCCCEEEeecCCC
Q 022074          236 VYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSS-SWDGDVVRWEFPGN  284 (303)
Q Consensus       236 iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~-s~Dg~i~~Wd~~~~  284 (303)
                      +||..+++.+..+. +...+.+++|+|++++|+++ +.++.|++||+...
T Consensus       233 v~d~~~~~~~~~~~-~~~~~~~~~~~~~g~~l~~~~~~~~~i~v~d~~~~  281 (300)
T TIGR03866       233 VVDAKTYEVLDYLL-VGQRVWQLAFTPDEKYLLTTNGVSNDVSVIDVAAL  281 (300)
T ss_pred             EEECCCCcEEEEEE-eCCCcceEEECCCCCEEEEEcCCCCeEEEEECCCC
Confidence            99999888766553 44579999999999998876 56899999998753


No 132
>KOG2096 consensus WD40 repeat protein [General function prediction only]
Probab=99.77  E-value=2.8e-17  Score=137.48  Aligned_cols=211  Identities=16%  Similarity=0.220  Sum_probs=145.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .-|...|..+-...++++|++++.|.+|.||+++ |+....+..-...-...+.+ +++..+++++...-|++|..- +.
T Consensus       184 ~kh~v~~i~iGiA~~~k~imsas~dt~i~lw~lk-Gq~L~~idtnq~~n~~aavS-P~GRFia~~gFTpDVkVwE~~-f~  260 (420)
T KOG2096|consen  184 RKHQVDIINIGIAGNAKYIMSASLDTKICLWDLK-GQLLQSIDTNQSSNYDAAVS-PDGRFIAVSGFTPDVKVWEPI-FT  260 (420)
T ss_pred             hhcccceEEEeecCCceEEEEecCCCcEEEEecC-CceeeeeccccccccceeeC-CCCcEEEEecCCCCceEEEEE-ec
Confidence            4677889999999999999999999999999998 55544333222222344554 579999999999999999852 11


Q ss_pred             CCC-----ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcccc
Q 022074          116 VKG-----KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLK  190 (303)
Q Consensus       116 ~~~-----~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  190 (303)
                      ..+     ...-.+.||..+|.+++|+++.+.++|.+.||++|+||+...-..                   ..+.+.++
T Consensus       261 kdG~fqev~rvf~LkGH~saV~~~aFsn~S~r~vtvSkDG~wriwdtdVrY~~-------------------~qDpk~Lk  321 (420)
T KOG2096|consen  261 KDGTFQEVKRVFSLKGHQSAVLAAAFSNSSTRAVTVSKDGKWRIWDTDVRYEA-------------------GQDPKILK  321 (420)
T ss_pred             cCcchhhhhhhheeccchhheeeeeeCCCcceeEEEecCCcEEEeeccceEec-------------------CCCchHhh
Confidence            111     122346799999999999999999999999999999997531100                   00111111


Q ss_pred             CCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee-cCCCCeEEEEECCCCCeEEE
Q 022074          191 HPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK-YHTSPVRDCSWHPSQPMLVS  269 (303)
Q Consensus       191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~-~h~~~I~~v~~sp~~~~las  269 (303)
                      ... .......+..     .    ....+|+++.||.. ....++++..++|+.+-+++ .|...|.+++|+++|++++|
T Consensus       322 ~g~-~pl~aag~~p-----~----RL~lsP~g~~lA~s-~gs~l~~~~se~g~~~~~~e~~h~~~Is~is~~~~g~~~at  390 (420)
T KOG2096|consen  322 EGS-APLHAAGSEP-----V----RLELSPSGDSLAVS-FGSDLKVFASEDGKDYPELEDIHSTTISSISYSSDGKYIAT  390 (420)
T ss_pred             cCC-cchhhcCCCc-----e----EEEeCCCCcEEEee-cCCceEEEEcccCccchhHHHhhcCceeeEEecCCCcEEee
Confidence            000 0000000000     0    12356788876654 45679999999998776664 79999999999999999999


Q ss_pred             EeCCCCEEEee
Q 022074          270 SSWDGDVVRWE  280 (303)
Q Consensus       270 ~s~Dg~i~~Wd  280 (303)
                      ++ |+-+++..
T Consensus       391 cG-dr~vrv~~  400 (420)
T KOG2096|consen  391 CG-DRYVRVIR  400 (420)
T ss_pred             ec-ceeeeeec
Confidence            98 56666665


No 133
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=99.76  E-value=2e-17  Score=135.10  Aligned_cols=243  Identities=17%  Similarity=0.208  Sum_probs=152.3

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe-cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL-AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~-~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .+|...|+++.|..+++ |++|..-|.|.+|++.+......+. .|...|+.+.-.| + ..+.+-+.|+.+.+|++...
T Consensus        11 Rp~~~~v~s~~fqa~~r-L~sg~~~G~V~~w~lqt~r~~~~~r~~g~~~it~lq~~p-~-d~l~tqgRd~~L~lw~ia~s   87 (323)
T KOG0322|consen   11 RPHSSSVTSVLFQANER-LMSGLSVGIVKMWVLQTERDLPLIRLFGRLFITNLQSIP-N-DSLDTQGRDPLLILWTIAYS   87 (323)
T ss_pred             ccccchheehhhccchh-hhcccccceEEEEEeecCccchhhhhhccceeeceeecC-C-cchhhcCCCceEEEEEccCc
Confidence            58999999999998775 9999999999999999987766666 4556777776644 2 57789999999999986420


Q ss_pred             cC--------------------CCccc-------------------------e----eecccccCeEEEEeCC-CCC--E
Q 022074          115 NV--------------------KGKPA-------------------------G----VLMGHLEGITFIDSRG-DGR--Y  142 (303)
Q Consensus       115 ~~--------------------~~~~~-------------------------~----~~~~h~~~v~~~~~~~-~~~--~  142 (303)
                      ..                    ..++.                         +    ...+..+.+.+.++.. .+.  +
T Consensus        88 ~~i~i~Si~~nslgFCrfSl~~~~k~~eqll~yp~rgsde~h~~D~g~~tqv~i~dd~~~~Klgsvmc~~~~~~c~s~~l  167 (323)
T KOG0322|consen   88 AFISIHSIVVNSLGFCRFSLVKKPKNSEQLLEYPSRGSDETHKQDGGDTTQVQIADDSERSKLGSVMCQDKDHACGSTFL  167 (323)
T ss_pred             ceEEEeeeeccccccccceeccCCCcchhheecCCcccchhhhhccCccceeEccCchhccccCceeeeeccccccceEE
Confidence            00                    00000                         0    0011234455555322 232  3


Q ss_pred             EEEEeCCCcEEEEEcccccCCcc------cccCccceeeeceeeeCCCCC-ccccCCCCCcce--EEecc---cceeeeE
Q 022074          143 LISNGKDQAIKLWDIRKMSSNAS------CNLGFRSYEWDYRWMDYPPQA-RDLKHPCDQSVA--TYKGH---SVLRTLI  210 (303)
Q Consensus       143 l~s~~~D~~v~lWdl~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~--~~~~~---~~~~~~~  210 (303)
                      ++.|..+|.+.+||+........      ......+..-+...+++.+.. ..+.......+.  .++..   -.++...
T Consensus       168 llaGyEsghvv~wd~S~~~~~~~~~~~~kv~~~~ash~qpvlsldyas~~~rGisgga~dkl~~~Sl~~s~gslq~~~e~  247 (323)
T KOG0322|consen  168 LLAGYESGHVVIWDLSTGDKIIQLPQSSKVESPNASHKQPVLSLDYASSCDRGISGGADDKLVMYSLNHSTGSLQIRKEI  247 (323)
T ss_pred             EEEeccCCeEEEEEccCCceeeccccccccccchhhccCcceeeeechhhcCCcCCCccccceeeeeccccCcccccceE
Confidence            56788899999999976421111      101111111111112221110 001000011111  11111   0000000


Q ss_pred             E----EeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          211 R----CHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       211 ~----~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      .    .......-||++.+||+|-|++||||..+++..+..++.|.+.|++++|||+.+++|+||.|+.|.+|++
T Consensus       248 ~lknpGv~gvrIRpD~KIlATAGWD~RiRVyswrtl~pLAVLkyHsagvn~vAfspd~~lmAaaskD~rISLWkL  322 (323)
T KOG0322|consen  248 TLKNPGVSGVRIRPDGKILATAGWDHRIRVYSWRTLNPLAVLKYHSAGVNAVAFSPDCELMAAASKDARISLWKL  322 (323)
T ss_pred             EecCCCccceEEccCCcEEeecccCCcEEEEEeccCCchhhhhhhhcceeEEEeCCCCchhhhccCCceEEeeec
Confidence            0    0011124589999999999999999999999999999999999999999999999999999999999985


No 134
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=99.75  E-value=7.7e-17  Score=144.52  Aligned_cols=241  Identities=17%  Similarity=0.255  Sum_probs=164.8

Q ss_pred             CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           35 DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      +.||+.-|.+|+..|.|.+|++|+.||+||||++.+|..+.++ ...+.|.+++|+|....-++.++....+.+-+...+
T Consensus       396 yrGHtg~Vr~iSvdp~G~wlasGsdDGtvriWEi~TgRcvr~~-~~d~~I~~vaw~P~~~~~vLAvA~~~~~~ivnp~~G  474 (733)
T KOG0650|consen  396 YRGHTGLVRSISVDPSGEWLASGSDDGTVRIWEIATGRCVRTV-QFDSEIRSVAWNPLSDLCVLAVAVGECVLIVNPIFG  474 (733)
T ss_pred             EeccCCeEEEEEecCCcceeeecCCCCcEEEEEeecceEEEEE-eecceeEEEEecCCCCceeEEEEecCceEEeCcccc
Confidence            3599999999999999999999999999999999999876543 455689999998754444444444444554432111


Q ss_pred             c---------------CCCc------------------cceeecccccCeEEEEeCCCCCEEEEEeC---CCcEEEEEcc
Q 022074          115 N---------------VKGK------------------PAGVLMGHLEGITFIDSRGDGRYLISNGK---DQAIKLWDIR  158 (303)
Q Consensus       115 ~---------------~~~~------------------~~~~~~~h~~~v~~~~~~~~~~~l~s~~~---D~~v~lWdl~  158 (303)
                      .               ....                  -++....|...|..+.|+..|.||++...   .+.|.|.+|.
T Consensus       475 ~~~e~~~t~ell~~~~~~~~p~~~~~~W~~~~~~e~~~~v~~~I~~~k~i~~vtWHrkGDYlatV~~~~~~~~VliHQLS  554 (733)
T KOG0650|consen  475 DRLEVGPTKELLASAPNESEPDAAVVTWSRASLDELEKGVCIVIKHPKSIRQVTWHRKGDYLATVMPDSGNKSVLIHQLS  554 (733)
T ss_pred             chhhhcchhhhhhcCCCccCCcccceeechhhhhhhccceEEEEecCCccceeeeecCCceEEEeccCCCcceEEEEecc
Confidence            0               0000                  01233346778889999999999998654   4788999998


Q ss_pred             cccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecc--cceeee---EEEeeeeeeeCCCeEEEEEeCCCe
Q 022074          159 KMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGH--SVLRTL---IRCHFSPVYSTGQKYIYTGSHDSC  233 (303)
Q Consensus       159 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~---~~~~~~~~~s~~~~~latg~~dg~  233 (303)
                      +......    |+.....+....+.|....+...+...+..++-.  ....++   .++..+...++.|..|+.|+.|+.
T Consensus       555 K~~sQ~P----F~kskG~vq~v~FHPs~p~lfVaTq~~vRiYdL~kqelvKkL~tg~kwiS~msihp~GDnli~gs~d~k  630 (733)
T KOG0650|consen  555 KRKSQSP----FRKSKGLVQRVKFHPSKPYLFVATQRSVRIYDLSKQELVKKLLTGSKWISSMSIHPNGDNLILGSYDKK  630 (733)
T ss_pred             cccccCc----hhhcCCceeEEEecCCCceEEEEeccceEEEehhHHHHHHHHhcCCeeeeeeeecCCCCeEEEecCCCe
Confidence            7543322    2111112233444444444444444443333211  111111   223334456788899999999999


Q ss_pred             EEEEECCCC-eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          234 VYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       234 i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      +..+|+.-. +..+++..|...+++|+||+.-++++||+.||++.++-
T Consensus       631 ~~WfDldlsskPyk~lr~H~~avr~Va~H~ryPLfas~sdDgtv~Vfh  678 (733)
T KOG0650|consen  631 MCWFDLDLSSKPYKTLRLHEKAVRSVAFHKRYPLFASGSDDGTVIVFH  678 (733)
T ss_pred             eEEEEcccCcchhHHhhhhhhhhhhhhhccccceeeeecCCCcEEEEe
Confidence            999999865 45778899999999999999999999999999999985


No 135
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.75  E-value=1.5e-16  Score=132.85  Aligned_cols=199  Identities=18%  Similarity=0.251  Sum_probs=145.7

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPA  121 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~  121 (303)
                      -.|+.|++.|.+||+|+.||.|.|||+.|......+.+|..+|++++|++ +++.|+|+|.|..|++||+..+.    +.
T Consensus        26 a~~~~Fs~~G~~lAvGc~nG~vvI~D~~T~~iar~lsaH~~pi~sl~WS~-dgr~LltsS~D~si~lwDl~~gs----~l  100 (405)
T KOG1273|consen   26 AECCQFSRWGDYLAVGCANGRVVIYDFDTFRIARMLSAHVRPITSLCWSR-DGRKLLTSSRDWSIKLWDLLKGS----PL  100 (405)
T ss_pred             cceEEeccCcceeeeeccCCcEEEEEccccchhhhhhccccceeEEEecC-CCCEeeeecCCceeEEEeccCCC----ce
Confidence            67999999999999999999999999999887777889999999999975 58899999999999999986432    23


Q ss_pred             eeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          122 GVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       122 ~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      ..+ ....+|+.+.++|.. +.++..-.+..-.+-++.....                        ..+....+..... 
T Consensus       101 ~ri-rf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~s~~~h------------------------~~Lp~d~d~dln~-  154 (405)
T KOG1273|consen  101 KRI-RFDSPVWGAQWHPRKRNKCVATIMEESPVVIDFSDPKH------------------------SVLPKDDDGDLNS-  154 (405)
T ss_pred             eEE-EccCccceeeeccccCCeEEEEEecCCcEEEEecCCce------------------------eeccCCCcccccc-
Confidence            222 245678888887743 3333333333333333321000                        0000000000000 


Q ss_pred             ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC-CCeEEEEECCCCCeEEEEeCCCCEEEe
Q 022074          201 KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT-SPVRDCSWHPSQPMLVSSSWDGDVVRW  279 (303)
Q Consensus       201 ~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~-~~I~~v~~sp~~~~las~s~Dg~i~~W  279 (303)
                              ...   ...|.+.|+++++|...|.+.++|..+.+++..++... ..|..+.|+..|++|+.-+.|++|+.+
T Consensus       155 --------sas---~~~fdr~g~yIitGtsKGkllv~~a~t~e~vas~rits~~~IK~I~~s~~g~~liiNtsDRvIR~y  223 (405)
T KOG1273|consen  155 --------SAS---HGVFDRRGKYIITGTSKGKLLVYDAETLECVASFRITSVQAIKQIIVSRKGRFLIINTSDRVIRTY  223 (405)
T ss_pred             --------ccc---cccccCCCCEEEEecCcceEEEEecchheeeeeeeechheeeeEEEEeccCcEEEEecCCceEEEE
Confidence                    000   01356779999999999999999999999998887666 789999999999999999999999999


Q ss_pred             ecC
Q 022074          280 EFP  282 (303)
Q Consensus       280 d~~  282 (303)
                      +..
T Consensus       224 e~~  226 (405)
T KOG1273|consen  224 EIS  226 (405)
T ss_pred             ehh
Confidence            976


No 136
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=99.75  E-value=3.7e-17  Score=142.64  Aligned_cols=220  Identities=20%  Similarity=0.261  Sum_probs=158.5

Q ss_pred             ccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC---------CceEEEEecccCCeEEE
Q 022074           17 SLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA---------NKLSLRILAHTSDVNTV   87 (303)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~---------~~~~~~~~~h~~~v~~l   87 (303)
                      .-+.+||.=+|.=..  .-.+|=++|.|+.|+.||+.+++||.||.|.+|++.+         -+....+..|+-.|+.+
T Consensus       103 g~lYlWelssG~LL~--v~~aHYQ~ITcL~fs~dgs~iiTgskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~HtlsITDl  180 (476)
T KOG0646|consen  103 GNLYLWELSSGILLN--VLSAHYQSITCLKFSDDGSHIITGSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDHTLSITDL  180 (476)
T ss_pred             CcEEEEEeccccHHH--HHHhhccceeEEEEeCCCcEEEecCCCccEEEEEEEeecccccCCCccceeeeccCcceeEEE
Confidence            345778887776432  1178999999999999999999999999999997642         12345677899999999


Q ss_pred             EEccC-CCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccc
Q 022074           88 CFGDE-SGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASC  166 (303)
Q Consensus        88 ~~~~~-~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~  166 (303)
                      ...+. ...+++|+|.|.++++||+...    .....+ ....++.++...|.++.+..|+.+|.|.+.++-...... .
T Consensus       181 ~ig~Gg~~~rl~TaS~D~t~k~wdlS~g----~LLlti-~fp~si~av~lDpae~~~yiGt~~G~I~~~~~~~~~~~~-~  254 (476)
T KOG0646|consen  181 QIGSGGTNARLYTASEDRTIKLWDLSLG----VLLLTI-TFPSSIKAVALDPAERVVYIGTEEGKIFQNLLFKLSGQS-A  254 (476)
T ss_pred             EecCCCccceEEEecCCceEEEEEeccc----eeeEEE-ecCCcceeEEEcccccEEEecCCcceEEeeehhcCCccc-c
Confidence            87543 3568999999999999998632    222222 234678899999999999999999999999876432100 0


Q ss_pred             ccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEE
Q 022074          167 NLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVA  246 (303)
Q Consensus       167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~  246 (303)
                      ...                 +...+.....+..+.||..... +   .+..++.||.+|++|++||.++|||+.+.+.++
T Consensus       255 ~v~-----------------~k~~~~~~t~~~~~~Gh~~~~~-I---TcLais~DgtlLlSGd~dg~VcvWdi~S~Q~iR  313 (476)
T KOG0646|consen  255 GVN-----------------QKGRHEENTQINVLVGHENESA-I---TCLAISTDGTLLLSGDEDGKVCVWDIYSKQCIR  313 (476)
T ss_pred             ccc-----------------ccccccccceeeeeccccCCcc-e---eEEEEecCccEEEeeCCCCCEEEEecchHHHHH
Confidence            000                 0111122234556666655211 1   123467899999999999999999999999888


Q ss_pred             EeecCCCCeEEEEECCCCC
Q 022074          247 ALKYHTSPVRDCSWHPSQP  265 (303)
Q Consensus       247 ~~~~h~~~I~~v~~sp~~~  265 (303)
                      ++....++|+-+.+.|=.+
T Consensus       314 tl~~~kgpVtnL~i~~~~~  332 (476)
T KOG0646|consen  314 TLQTSKGPVTNLQINPLER  332 (476)
T ss_pred             HHhhhccccceeEeecccc
Confidence            8866778999999976544


No 137
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.75  E-value=2e-16  Score=143.86  Aligned_cols=273  Identities=20%  Similarity=0.277  Sum_probs=177.5

Q ss_pred             EEccCchhhccccccccccCcCcccccC--------CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEE
Q 022074            8 VDVGSGTMESLANVTEIHDGLDFSAADD--------GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLR   76 (303)
Q Consensus         8 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~--------~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~   76 (303)
                      +++=|+|||..+=||+-   .+-++++.        +|-.++++++-|+|+++.+++-+.-|..++|..+....   ...
T Consensus       280 ~~LLSASaDksmiiW~p---d~~tGiWv~~vRlGe~gg~a~GF~g~lw~~n~~~ii~~g~~Gg~hlWkt~d~~~w~~~~~  356 (764)
T KOG1063|consen  280 LDLLSASADKSMIIWKP---DENTGIWVDVVRLGEVGGSAGGFWGGLWSPNSNVIIAHGRTGGFHLWKTKDKTFWTQEPV  356 (764)
T ss_pred             hhheecccCcceEEEec---CCccceEEEEEEeecccccccceeeEEEcCCCCEEEEecccCcEEEEeccCccceeeccc
Confidence            45668999988878775   33323332        46667899999999999999999999999998433322   223


Q ss_pred             EecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEE
Q 022074           77 ILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWD  156 (303)
Q Consensus        77 ~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWd  156 (303)
                      +.+|.++|..+.|.| .+++|+|++.|.+-|++-.-..+.+...+.+.+-|.-...++++-+....|++|+.++.+|+|+
T Consensus       357 iSGH~~~V~dv~W~p-sGeflLsvs~DQTTRlFa~wg~q~~wHEiaRPQiHGyDl~c~~~vn~~~~FVSgAdEKVlRvF~  435 (764)
T KOG1063|consen  357 ISGHVDGVKDVDWDP-SGEFLLSVSLDQTTRLFARWGRQQEWHEIARPQIHGYDLTCLSFVNEDLQFVSGADEKVLRVFE  435 (764)
T ss_pred             cccccccceeeeecC-CCCEEEEeccccceeeecccccccceeeecccccccccceeeehccCCceeeecccceeeeeec
Confidence            568999999999964 6889999999999999864322223344555667888889999888778899999999999998


Q ss_pred             cccc-----cC-Ccccc--------------cC------ccc----eeeeceeee-----------CCCCCccc-cCCCC
Q 022074          157 IRKM-----SS-NASCN--------------LG------FRS----YEWDYRWMD-----------YPPQARDL-KHPCD  194 (303)
Q Consensus       157 l~~~-----~~-~~~~~--------------~~------~~~----~~~~~~~~~-----------~~~~~~~~-~~~~~  194 (303)
                      ..+.     .. ...+.              ++      +..    -.....+..           -||....+ .+..-
T Consensus       436 aPk~fv~~l~~i~g~~~~~~~~~p~gA~VpaLGLSnKa~~~~e~~~G~~~~~~~et~~~~~p~~L~ePP~EdqLq~~tLw  515 (764)
T KOG1063|consen  436 APKSFVKSLMAICGKCFKGSDELPDGANVPALGLSNKAFFPGETNTGGEAAVCAETPLAAAPCELTEPPTEDQLQQNTLW  515 (764)
T ss_pred             CcHHHHHHHHHHhCccccCchhcccccccccccccCCCCcccccccccccceeeecccccCchhccCCChHHHHHHhccc
Confidence            5420     00 00000              00      000    000000000           01110000 00000


Q ss_pred             CcceEEecccceeee-------------------------------------EEEe----eeeeeeCCCeEEEEEeCCCe
Q 022074          195 QSVATYKGHSVLRTL-------------------------------------IRCH----FSPVYSTGQKYIYTGSHDSC  233 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~-------------------------------------~~~~----~~~~~s~~~~~latg~~dg~  233 (303)
                      ..+..+.||.+..+.                                     +..|    ....||||+++|++.+-|++
T Consensus       516 PEv~KLYGHGyEv~~l~~s~~gnliASaCKS~~~ehAvI~lw~t~~W~~~~~L~~HsLTVT~l~FSpdg~~LLsvsRDRt  595 (764)
T KOG1063|consen  516 PEVHKLYGHGYEVYALAISPTGNLIASACKSSLKEHAVIRLWNTANWLQVQELEGHSLTVTRLAFSPDGRYLLSVSRDRT  595 (764)
T ss_pred             hhhHHhccCceeEEEEEecCCCCEEeehhhhCCccceEEEEEeccchhhhheecccceEEEEEEECCCCcEEEEeecCce
Confidence            001111122111100                                     0000    12458999999999999999


Q ss_pred             EEEEECCCCeE----EEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          234 VYVYDLVSGEQ----VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       234 i~iwd~~~~~~----~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.+|.......    ....+.|+.-|++++|+|+..++||+|.|.++++|..+..
T Consensus       596 ~sl~~~~~~~~~e~~fa~~k~HtRIIWdcsW~pde~~FaTaSRDK~VkVW~~~~~  650 (764)
T KOG1063|consen  596 VSLYEVQEDIKDEFRFACLKAHTRIIWDCSWSPDEKYFATASRDKKVKVWEEPDL  650 (764)
T ss_pred             EEeeeeecccchhhhhccccccceEEEEcccCcccceeEEecCCceEEEEeccCc
Confidence            99998754322    2236799999999999999999999999999999997654


No 138
>KOG4328 consensus WD40 protein [Function unknown]
Probab=99.75  E-value=7.7e-17  Score=140.39  Aligned_cols=216  Identities=21%  Similarity=0.271  Sum_probs=152.6

Q ss_pred             CCcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCce-----------------------------------------
Q 022074           36 GGYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANKL-----------------------------------------   73 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~~-----------------------------------------   73 (303)
                      .+|.++|.+|.|+|. -..+++.|.||+||+-|+++...                                         
T Consensus       231 ~~hs~~Vs~l~F~P~n~s~i~ssSyDGtiR~~D~~~~i~e~v~s~~~d~~~fs~~d~~~e~~~vl~~~~~G~f~~iD~R~  310 (498)
T KOG4328|consen  231 TPHSGPVSGLKFSPANTSQIYSSSYDGTIRLQDFEGNISEEVLSLDTDNIWFSSLDFSAESRSVLFGDNVGNFNVIDLRT  310 (498)
T ss_pred             ccCCccccceEecCCChhheeeeccCceeeeeeecchhhHHHhhcCccceeeeeccccCCCccEEEeecccceEEEEeec
Confidence            699999999999985 44799999999999999876531                                         


Q ss_pred             ----EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCC
Q 022074           74 ----SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKD  149 (303)
Q Consensus        74 ----~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D  149 (303)
                          ...+.-|+..|+.++++|.++.+|+|++.|++++|||+|.......|.-....|...|.+..|+|.+..|+|.+.|
T Consensus       311 ~~s~~~~~~lh~kKI~sv~~NP~~p~~laT~s~D~T~kIWD~R~l~~K~sp~lst~~HrrsV~sAyFSPs~gtl~TT~~D  390 (498)
T KOG4328|consen  311 DGSEYENLRLHKKKITSVALNPVCPWFLATASLDQTAKIWDLRQLRGKASPFLSTLPHRRSVNSAYFSPSGGTLLTTCQD  390 (498)
T ss_pred             CCccchhhhhhhcccceeecCCCCchheeecccCcceeeeehhhhcCCCCcceecccccceeeeeEEcCCCCceEeeccC
Confidence                0012346678999999998999999999999999999985432222333445799999999999998889999999


Q ss_pred             CcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe
Q 022074          150 QAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS  229 (303)
Q Consensus       150 ~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~  229 (303)
                      ..|||||.......                  .+|. ..+.|.+..          -+.+  -.|.+.|.|+..++++|-
T Consensus       391 ~~IRv~dss~~sa~------------------~~p~-~~I~Hn~~t----------~Rwl--T~fKA~W~P~~~li~vg~  439 (498)
T KOG4328|consen  391 NEIRVFDSSCISAK------------------DEPL-GTIPHNNRT----------GRWL--TPFKAAWDPDYNLIVVGR  439 (498)
T ss_pred             CceEEeeccccccc------------------CCcc-ceeeccCcc----------cccc--cchhheeCCCccEEEEec
Confidence            99999997421100                  0000 001111110          0000  012234567888999999


Q ss_pred             CCCeEEEEECCCCeEEEEeecCCC-CeEE-EEECCCCC-eEEEEeCCCCEEEeecC
Q 022074          230 HDSCVYVYDLVSGEQVAALKYHTS-PVRD-CSWHPSQP-MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       230 ~dg~i~iwd~~~~~~~~~~~~h~~-~I~~-v~~sp~~~-~las~s~Dg~i~~Wd~~  282 (303)
                      .-..|-|+|...++.+..+-.... .|.+ .+|+|.+. ++|.++.-|.|.+|.-+
T Consensus       440 ~~r~IDv~~~~~~q~v~el~~P~~~tI~~vn~~HP~~~~~~aG~~s~Gki~vft~k  495 (498)
T KOG4328|consen  440 YPRPIDVFDGNGGQMVCELHDPESSTIPSVNEFHPMRDTLAAGGNSSGKIYVFTNK  495 (498)
T ss_pred             cCcceeEEcCCCCEEeeeccCccccccccceeecccccceeccCCccceEEEEecC
Confidence            999999999988887766533222 3443 46999988 66666677889988744


No 139
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=99.74  E-value=8.7e-17  Score=145.07  Aligned_cols=229  Identities=20%  Similarity=0.245  Sum_probs=146.5

Q ss_pred             EEEEEcC---CCCEEEEeeCCCeEEEEECCCCceE------EEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           43 FSLKFST---DGRELVAGSSDDCIYVYDLEANKLS------LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        43 ~~l~~s~---~g~~l~sgs~Dg~v~lwd~~~~~~~------~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      ++..|++   ....|+.+..||.|.++|.......      .....|...|..+.|.| ...+|++++.|.++++||++.
T Consensus        53 f~~sFs~~~n~eHiLavadE~G~i~l~dt~~~~fr~ee~~lk~~~aH~nAifDl~wap-ge~~lVsasGDsT~r~Wdvk~  131 (720)
T KOG0321|consen   53 FADSFSAAPNKEHILAVADEDGGIILFDTKSIVFRLEERQLKKPLAHKNAIFDLKWAP-GESLLVSASGDSTIRPWDVKT  131 (720)
T ss_pred             ccccccCCCCccceEEEecCCCceeeecchhhhcchhhhhhcccccccceeEeeccCC-CceeEEEccCCceeeeeeecc
Confidence            5577875   3457888999999999998765433      34668999999999976 677899999999999999864


Q ss_pred             ccCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCC
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHP  192 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  192 (303)
                      ....+.  ..+.||...|.+++|.+.+ ..|++|++|+.|.|||+|.-.......  + ......+....+...+.+   
T Consensus       132 s~l~G~--~~~~GH~~SvkS~cf~~~n~~vF~tGgRDg~illWD~R~n~~d~~e~--~-~~~~~~~~n~~ptpskp~---  203 (720)
T KOG0321|consen  132 SRLVGG--RLNLGHTGSVKSECFMPTNPAVFCTGGRDGEILLWDCRCNGVDALEE--F-DNRIYGRHNTAPTPSKPL---  203 (720)
T ss_pred             ceeecc--eeecccccccchhhhccCCCcceeeccCCCcEEEEEEeccchhhHHH--H-hhhhhccccCCCCCCchh---
Confidence            333222  2478999999999998854 568999999999999998532100000  0 000000000000000000   


Q ss_pred             CCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CCeEEEEECCCCeEEEE------ee--cC---CCCeEEEEE
Q 022074          193 CDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DSCVYVYDLVSGEQVAA------LK--YH---TSPVRDCSW  260 (303)
Q Consensus       193 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg~i~iwd~~~~~~~~~------~~--~h---~~~I~~v~~  260 (303)
                       .+.+.....+....   .......+.-|...||++|. |+.|+|||++.....+.      .+  .|   .-.+.++..
T Consensus       204 -~kr~~k~kA~s~ti---~ssvTvv~fkDe~tlaSaga~D~~iKVWDLRk~~~~~r~ep~~~~~~~t~skrs~G~~nL~l  279 (720)
T KOG0321|consen  204 -KKRIRKWKAASNTI---FSSVTVVLFKDESTLASAGAADSTIKVWDLRKNYTAYRQEPRGSDKYPTHSKRSVGQVNLIL  279 (720)
T ss_pred             -hccccccccccCce---eeeeEEEEEeccceeeeccCCCcceEEEeecccccccccCCCcccCccCcccceeeeEEEEe
Confidence             00011111111100   00111234457788999887 99999999986543222      11  23   234677777


Q ss_pred             CCCCCeEEEEeCCCCEEEeecCCC
Q 022074          261 HPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       261 sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ...|.+|...+.|++|.+|++...
T Consensus       280 DssGt~L~AsCtD~sIy~ynm~s~  303 (720)
T KOG0321|consen  280 DSSGTYLFASCTDNSIYFYNMRSL  303 (720)
T ss_pred             cCCCCeEEEEecCCcEEEEecccc
Confidence            778899888888999999998764


No 140
>KOG1063 consensus RNA polymerase II elongator complex, subunit ELP2, WD repeat superfamily [Chromatin structure and dynamics; Transcription]
Probab=99.74  E-value=9.4e-17  Score=146.04  Aligned_cols=214  Identities=15%  Similarity=0.247  Sum_probs=150.3

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCC-----eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDD-----CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg-----~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      +||.+.|++++.+|+|+.+|+++...     .|+||+..+-.....+..|.-.|+.++|+| ++++|++++.|.++.+|.
T Consensus       522 YGHGyEv~~l~~s~~gnliASaCKS~~~ehAvI~lw~t~~W~~~~~L~~HsLTVT~l~FSp-dg~~LLsvsRDRt~sl~~  600 (764)
T KOG1063|consen  522 YGHGYEVYALAISPTGNLIASACKSSLKEHAVIRLWNTANWLQVQELEGHSLTVTRLAFSP-DGRYLLSVSRDRTVSLYE  600 (764)
T ss_pred             ccCceeEEEEEecCCCCEEeehhhhCCccceEEEEEeccchhhhheecccceEEEEEEECC-CCcEEEEeecCceEEeee
Confidence            69999999999999999999987544     489999988766667899999999999975 589999999999999998


Q ss_pred             CccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcccc
Q 022074          111 RRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLK  190 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  190 (303)
                      ................|..-|+.++|+|++.+|+|+|+|++|++|........  +...+..       ..+        
T Consensus       601 ~~~~~~~e~~fa~~k~HtRIIWdcsW~pde~~FaTaSRDK~VkVW~~~~~~d~--~i~~~a~-------~~~--------  663 (764)
T KOG1063|consen  601 VQEDIKDEFRFACLKAHTRIIWDCSWSPDEKYFATASRDKKVKVWEEPDLRDK--YISRFAC-------LKF--------  663 (764)
T ss_pred             eecccchhhhhccccccceEEEEcccCcccceeEEecCCceEEEEeccCchhh--hhhhhch-------hcc--------
Confidence            53111111112235678888999999999999999999999999976543100  0000000       000        


Q ss_pred             CCCCCcceEEecccceeeeEEEeeeeeeeCCC-eEEEEEeCCCeEEEEECC-------CCeE-----EEEeecCCCCeEE
Q 022074          191 HPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQ-KYIYTGSHDSCVYVYDLV-------SGEQ-----VAALKYHTSPVRD  257 (303)
Q Consensus       191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-~~latg~~dg~i~iwd~~-------~~~~-----~~~~~~h~~~I~~  257 (303)
                         ...            .....+.+.+.++. ..++.|-+.|.|.+|...       .+..     +.....|...|+.
T Consensus       664 ---~~a------------VTAv~~~~~~~~e~~~~vavGle~GeI~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~aV~r  728 (764)
T KOG1063|consen  664 ---SLA------------VTAVAYLPVDHNEKGDVVAVGLEKGEIVLWRRKREHRQVTVGTFNLDTRLCATIGPDSAVNR  728 (764)
T ss_pred             ---CCc------------eeeEEeeccccccccceEEEEecccEEEEEecccccccccceeeeeccccccccChHHhhhe
Confidence               000            00111223333333 367788899999999954       1111     1122356778999


Q ss_pred             EEECCC--------CC--eEEEEeCCCCEEEeecC
Q 022074          258 CSWHPS--------QP--MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       258 v~~sp~--------~~--~las~s~Dg~i~~Wd~~  282 (303)
                      +.|+|.        .+  .|++|++|..++++++.
T Consensus       729 l~w~p~~~~~~~~~~~~l~la~~g~D~~vri~nv~  763 (764)
T KOG1063|consen  729 LLWRPTCSDDWVEDKEWLNLAVGGDDESVRIFNVD  763 (764)
T ss_pred             eEeccccccccccccceeEEeeecccceeEEeecc
Confidence            999986        22  57999999999998864


No 141
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.74  E-value=1.2e-16  Score=132.81  Aligned_cols=79  Identities=20%  Similarity=0.350  Sum_probs=64.5

Q ss_pred             ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074           80 HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus        80 h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      .++-|.+++|+|....+++.+|+|++||+|++.....  ...+....|..+|.++.|+.+|..+++|+.|+++++|||..
T Consensus        26 P~DsIS~l~FSP~~~~~~~A~SWD~tVR~wevq~~g~--~~~ka~~~~~~PvL~v~WsddgskVf~g~~Dk~~k~wDL~S  103 (347)
T KOG0647|consen   26 PEDSISALAFSPQADNLLAAGSWDGTVRIWEVQNSGQ--LVPKAQQSHDGPVLDVCWSDDGSKVFSGGCDKQAKLWDLAS  103 (347)
T ss_pred             cccchheeEeccccCceEEecccCCceEEEEEecCCc--ccchhhhccCCCeEEEEEccCCceEEeeccCCceEEEEccC
Confidence            4567999999986677788999999999999752111  11244567889999999999999999999999999999976


Q ss_pred             c
Q 022074          160 M  160 (303)
Q Consensus       160 ~  160 (303)
                      .
T Consensus       104 ~  104 (347)
T KOG0647|consen  104 G  104 (347)
T ss_pred             C
Confidence            4


No 142
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.73  E-value=2.2e-16  Score=146.21  Aligned_cols=198  Identities=21%  Similarity=0.262  Sum_probs=156.2

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE---ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI---LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~---~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      ..+.+++.++.|++.+.|...|+|-+|++..|-....+   ..|++.|..++. ..-++.++|++.+|-++.||...   
T Consensus       449 ~~~~av~vs~CGNF~~IG~S~G~Id~fNmQSGi~r~sf~~~~ah~~~V~gla~-D~~n~~~vsa~~~Gilkfw~f~~---  524 (910)
T KOG1539|consen  449 INATAVCVSFCGNFVFIGYSKGTIDRFNMQSGIHRKSFGDSPAHKGEVTGLAV-DGTNRLLVSAGADGILKFWDFKK---  524 (910)
T ss_pred             cceEEEEEeccCceEEEeccCCeEEEEEcccCeeecccccCccccCceeEEEe-cCCCceEEEccCcceEEEEecCC---
Confidence            46889999999999999999999999999999777666   479999999998 34467899999999999999752   


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                       ......+. -..++.++..+.....++.+..|-.|+++|....+                                  .
T Consensus       525 -k~l~~~l~-l~~~~~~iv~hr~s~l~a~~~ddf~I~vvD~~t~k----------------------------------v  568 (910)
T KOG1539|consen  525 -KVLKKSLR-LGSSITGIVYHRVSDLLAIALDDFSIRVVDVVTRK----------------------------------V  568 (910)
T ss_pred             -cceeeeec-cCCCcceeeeeehhhhhhhhcCceeEEEEEchhhh----------------------------------h
Confidence             11222221 23456666667777789999999999999975321                                  2


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC-CC
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD-GD  275 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D-g~  275 (303)
                      +..+.||....+-      ..|||||++|++++.|++|++||+.++.++-.+. -..+++++.|||+|.+|||+..| .-
T Consensus       569 vR~f~gh~nritd------~~FS~DgrWlisasmD~tIr~wDlpt~~lID~~~-vd~~~~sls~SPngD~LAT~Hvd~~g  641 (910)
T KOG1539|consen  569 VREFWGHGNRITD------MTFSPDGRWLISASMDSTIRTWDLPTGTLIDGLL-VDSPCTSLSFSPNGDFLATVHVDQNG  641 (910)
T ss_pred             hHHhhccccceee------eEeCCCCcEEEEeecCCcEEEEeccCcceeeeEe-cCCcceeeEECCCCCEEEEEEecCce
Confidence            3334444433322      2488999999999999999999999999876663 45799999999999999999999 88


Q ss_pred             EEEeecCCC
Q 022074          276 VVRWEFPGN  284 (303)
Q Consensus       276 i~~Wd~~~~  284 (303)
                      |.+|-....
T Consensus       642 IylWsNksl  650 (910)
T KOG1539|consen  642 IYLWSNKSL  650 (910)
T ss_pred             EEEEEchhH
Confidence            999976544


No 143
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=99.73  E-value=9.7e-18  Score=155.14  Aligned_cols=229  Identities=21%  Similarity=0.361  Sum_probs=161.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      -||..+|+|+.|...|.++++|+.|..|+||.++++.......+|.+.++.++.+. +..+++++|.|..|++|.+.   
T Consensus       187 lgH~naVyca~fDrtg~~Iitgsdd~lvKiwS~et~~~lAs~rGhs~ditdlavs~-~n~~iaaaS~D~vIrvWrl~---  262 (1113)
T KOG0644|consen  187 LGHRNAVYCAIFDRTGRYIITGSDDRLVKIWSMETARCLASCRGHSGDITDLAVSS-NNTMIAAASNDKVIRVWRLP---  262 (1113)
T ss_pred             HhhhhheeeeeeccccceEeecCccceeeeeeccchhhhccCCCCccccchhccch-hhhhhhhcccCceEEEEecC---
Confidence            39999999999999999999999999999999999988888899999999999854 46688899999999999975   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccc---ccC----ccceeeeceeeeCCCCCcc
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASC---NLG----FRSYEWDYRWMDYPPQARD  188 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~---~~~----~~~~~~~~~~~~~~~~~~~  188 (303)
                       .+.++.++.||+++|++++|+|-    .+.+.||++++||-|........   ...    +.++.+.-.+..+-.... 
T Consensus       263 -~~~pvsvLrghtgavtaiafsP~----~sss~dgt~~~wd~r~~~~~y~prp~~~~~~~~~~s~~~~~~~~~f~Tgs~-  336 (1113)
T KOG0644|consen  263 -DGAPVSVLRGHTGAVTAIAFSPR----ASSSDDGTCRIWDARLEPRIYVPRPLKFTEKDLVDSILFENNGDRFLTGSR-  336 (1113)
T ss_pred             -CCchHHHHhccccceeeeccCcc----ccCCCCCceEeccccccccccCCCCCCcccccceeeeeccccccccccccC-
Confidence             56778889999999999999984    48889999999998832211110   000    000000000000000000 


Q ss_pred             ccCCCCCcceEEeccccee---eeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCC
Q 022074          189 LKHPCDQSVATYKGHSVLR---TLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQP  265 (303)
Q Consensus       189 ~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~  265 (303)
                           +....   .|....   ......|...-+.-..+.+++-.+-.+.+|++-+|.++..+.+|..++..+.++|-.+
T Consensus       337 -----d~ea~---n~e~~~l~~~~~~lif~t~ssd~~~~~~~ar~~~~~~vwnl~~g~l~H~l~ghsd~~yvLd~Hpfn~  408 (1113)
T KOG0644|consen  337 -----DGEAR---NHEFEQLAWRSNLLIFVTRSSDLSSIVVTARNDHRLCVWNLYTGQLLHNLMGHSDEVYVLDVHPFNP  408 (1113)
T ss_pred             -----Ccccc---cchhhHhhhhccceEEEeccccccccceeeeeeeEeeeeecccchhhhhhcccccceeeeeecCCCc
Confidence                 00000   000000   0000000000011125677788888999999999999999999999999999999665


Q ss_pred             -eEEEEeCCCCEEEeecC
Q 022074          266 -MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       266 -~las~s~Dg~i~~Wd~~  282 (303)
                       ...+++.||...+||+.
T Consensus       409 ri~msag~dgst~iwdi~  426 (1113)
T KOG0644|consen  409 RIAMSAGYDGSTIIWDIW  426 (1113)
T ss_pred             HhhhhccCCCceEeeecc
Confidence             67799999999999975


No 144
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=99.72  E-value=2.5e-16  Score=130.27  Aligned_cols=212  Identities=19%  Similarity=0.369  Sum_probs=149.9

Q ss_pred             CcccceEEEEEcCCCC----EEEEeeCCCeEEEEECCC--CceEE-------EEecccCCeEEEEEccCCCcEEEEecCC
Q 022074           37 GYSFGIFSLKFSTDGR----ELVAGSSDDCIYVYDLEA--NKLSL-------RILAHTSDVNTVCFGDESGHLIYSGSDD  103 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~----~l~sgs~Dg~v~lwd~~~--~~~~~-------~~~~h~~~v~~l~~~~~~~~~l~s~s~d  103 (303)
                      -|..|+..+.|.|+.+    .+++.+.| .+|||.+..  .+...       +-..+..+++..-|+.-+.+++.++|-|
T Consensus        94 d~~YP~tK~~wiPd~~g~~pdlLATs~D-~LRlWri~~ee~~~~~~~~L~~~kns~~~aPlTSFDWne~dp~~igtSSiD  172 (364)
T KOG0290|consen   94 DHPYPVTKLMWIPDSKGVYPDLLATSSD-FLRLWRIGDEESRVELQSVLNNNKNSEFCAPLTSFDWNEVDPNLIGTSSID  172 (364)
T ss_pred             CCCCCccceEecCCccccCcchhhcccC-eEEEEeccCcCCceehhhhhccCcccccCCcccccccccCCcceeEeeccc
Confidence            6889999999999863    24443444 599998874  22211       1123446888889987778999999999


Q ss_pred             CeEEEEcCccccCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          104 NLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       104 g~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      -++.+||+... ..+.....+-.|...|..++|...+ ..|++.|.||+||+||||.+...        .+.++      
T Consensus       173 TTCTiWdie~~-~~~~vkTQLIAHDKEV~DIaf~~~s~~~FASvgaDGSvRmFDLR~leHS--------TIIYE------  237 (364)
T KOG0290|consen  173 TTCTIWDIETG-VSGTVKTQLIAHDKEVYDIAFLKGSRDVFASVGADGSVRMFDLRSLEHS--------TIIYE------  237 (364)
T ss_pred             CeEEEEEEeec-cccceeeEEEecCcceeEEEeccCccceEEEecCCCcEEEEEecccccc--------eEEec------
Confidence            99999998632 2334455677899999999998754 56899999999999999975421        11110      


Q ss_pred             CCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CCeEEEEECCCC-eEEEEeecCCCCeEEEEE
Q 022074          183 PPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DSCVYVYDLVSG-EQVAALKYHTSPVRDCSW  260 (303)
Q Consensus       183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~  260 (303)
                      +|+.                   ...++|-.++.   .|-.++||-.+ ...|-|-|++.- ..+.++..|++.|+.++|
T Consensus       238 ~p~~-------------------~~pLlRLswnk---qDpnymATf~~dS~~V~iLDiR~P~tpva~L~~H~a~VNgIaW  295 (364)
T KOG0290|consen  238 DPSP-------------------STPLLRLSWNK---QDPNYMATFAMDSNKVVILDIRVPCTPVARLRNHQASVNGIAW  295 (364)
T ss_pred             CCCC-------------------CCcceeeccCc---CCchHHhhhhcCCceEEEEEecCCCcceehhhcCcccccceEe
Confidence            0110                   00112222221   13456666443 346889999865 458899999999999999


Q ss_pred             CCCCC-eEEEEeCCCCEEEeecCCCCc
Q 022074          261 HPSQP-MLVSSSWDGDVVRWEFPGNGE  286 (303)
Q Consensus       261 sp~~~-~las~s~Dg~i~~Wd~~~~~~  286 (303)
                      .|... .|.|+++|+.+-+||++.+..
T Consensus       296 aPhS~~hictaGDD~qaliWDl~q~~~  322 (364)
T KOG0290|consen  296 APHSSSHICTAGDDCQALIWDLQQMPR  322 (364)
T ss_pred             cCCCCceeeecCCcceEEEEecccccc
Confidence            99864 899999999999999987644


No 145
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=99.72  E-value=7.1e-16  Score=144.30  Aligned_cols=202  Identities=17%  Similarity=0.198  Sum_probs=149.8

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      ..++.++.+++|+.+|+.++.||.|-.|++.++.+......+.+|+++|.++.|.| +++.|++.+-||.|++||+....
T Consensus        93 ~Rftlp~r~~~v~g~g~~iaagsdD~~vK~~~~~D~s~~~~lrgh~apVl~l~~~p-~~~fLAvss~dG~v~iw~~~~~~  171 (933)
T KOG1274|consen   93 ARFTLPIRDLAVSGSGKMIAAGSDDTAVKLLNLDDSSQEKVLRGHDAPVLQLSYDP-KGNFLAVSSCDGKVQIWDLQDGI  171 (933)
T ss_pred             eeeeccceEEEEecCCcEEEeecCceeEEEEeccccchheeecccCCceeeeeEcC-CCCEEEEEecCceEEEEEcccch
Confidence            68999999999999999999999999999999999988888999999999999965 58899999999999999986432


Q ss_pred             CCCcccee---eccc-ccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          116 VKGKPAGV---LMGH-LEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       116 ~~~~~~~~---~~~h-~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      ........   ...- ...+.-++|+|++..|+..+.|+.|++|+...-...+....                       
T Consensus       172 ~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~-----------------------  228 (933)
T KOG1274|consen  172 LSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRD-----------------------  228 (933)
T ss_pred             hhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEccCCceeheeecc-----------------------
Confidence            21111111   1111 33456789999988999999999999998643221111100                       


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEe
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s  271 (303)
                                 ......    .....|+|.|+|||+++.||.|.|||.++-+.    ..-...|.+++|.|+.+-+---.
T Consensus       229 -----------~~~ss~----~~~~~wsPnG~YiAAs~~~g~I~vWnv~t~~~----~~~~~~Vc~~aw~p~~n~it~~~  289 (933)
T KOG1274|consen  229 -----------KLSSSK----FSDLQWSPNGKYIAASTLDGQILVWNVDTHER----HEFKRAVCCEAWKPNANAITLIT  289 (933)
T ss_pred             -----------cccccc----eEEEEEcCCCcEEeeeccCCcEEEEecccchh----ccccceeEEEecCCCCCeeEEEe
Confidence                       000000    11234788999999999999999999988222    11235799999999998666555


Q ss_pred             CCCCEEEee
Q 022074          272 WDGDVVRWE  280 (303)
Q Consensus       272 ~Dg~i~~Wd  280 (303)
                      ..|..-+|.
T Consensus       290 ~~g~~~~~~  298 (933)
T KOG1274|consen  290 ALGTLGVSP  298 (933)
T ss_pred             eccccccCh
Confidence            566766665


No 146
>KOG1408 consensus WD40 repeat protein [Function unknown]
Probab=99.72  E-value=2.7e-15  Score=137.14  Aligned_cols=239  Identities=20%  Similarity=0.279  Sum_probs=153.8

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEEecccCCeEEEEEcc-----C-----CCcEEEEecCCCeEEE
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRILAHTSDVNTVCFGD-----E-----SGHLIYSGSDDNLCKV  108 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~~~h~~~v~~l~~~~-----~-----~~~~l~s~s~dg~v~l  108 (303)
                      -.++.|++...++.+.-.|..++|||++.-..   ...+..|...|..+.--|     +     -...|.|++.|++||+
T Consensus       327 ~IA~~Fdet~~klscVYndhSlYvWDvrD~~kvgk~~s~lyHS~ciW~Ve~~p~nv~~~~~aclp~~cF~TCSsD~TIRl  406 (1080)
T KOG1408|consen  327 AIACQFDETTDKLSCVYNDHSLYVWDVRDVNKVGKCSSMLYHSACIWDVENLPCNVHSPTAACLPRGCFTTCSSDGTIRL  406 (1080)
T ss_pred             eeEEEecCCCceEEEEEcCceEEEEeccccccccceeeeeeccceeeeeccccccccCcccccCCccceeEecCCCcEEE
Confidence            67899999999999999999999999976432   234567887777664322     0     1236889999999999


Q ss_pred             EcCccccCCCc---------------------------------cceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEE
Q 022074          109 WDRRCLNVKGK---------------------------------PAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLW  155 (303)
Q Consensus       109 Wd~~~~~~~~~---------------------------------~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lW  155 (303)
                      ||+........                                 ......+...++.+++++|+|.+|++|...|.+|+|
T Consensus       407 W~l~~ctnn~vyrRNils~~l~ki~y~d~~~q~~~d~~~~~fdka~~s~~d~r~G~R~~~vSp~gqhLAsGDr~GnlrVy  486 (1080)
T KOG1408|consen  407 WDLAFCTNNQVYRRNILSANLSKIPYEDSTQQIMHDASAGIFDKALVSTCDSRFGFRALAVSPDGQHLASGDRGGNLRVY  486 (1080)
T ss_pred             eecccccccceeecccchhhhhcCccccCchhhhhhccCCcccccchhhcCcccceEEEEECCCcceecccCccCceEEE
Confidence            99853110000                                 000112234578899999999999999999999999


Q ss_pred             EcccccCCcccccCccceeeeceeeeCC-CC--CccccCCC-C------------CcceEEecccceeeeEE--------
Q 022074          156 DIRKMSSNASCNLGFRSYEWDYRWMDYP-PQ--ARDLKHPC-D------------QSVATYKGHSVLRTLIR--------  211 (303)
Q Consensus       156 dl~~~~~~~~~~~~~~~~~~~~~~~~~~-~~--~~~~~~~~-~------------~~~~~~~~~~~~~~~~~--------  211 (303)
                      ||..+.......    ..+.++..++|+ |.  .+.+.... +            ..+.++++|...++.++        
T Consensus       487 ~Lq~l~~~~~~e----AHesEilcLeyS~p~~~~kLLASasrdRlIHV~Dv~rny~l~qtld~HSssITsvKFa~~gln~  562 (1080)
T KOG1408|consen  487 DLQELEYTCFME----AHESEILCLEYSFPVLTNKLLASASRDRLIHVYDVKRNYDLVQTLDGHSSSITSVKFACNGLNR  562 (1080)
T ss_pred             Eehhhhhhhhee----cccceeEEEeecCchhhhHhhhhccCCceEEEEecccccchhhhhcccccceeEEEEeecCCce
Confidence            987643221111    111111111111 00  00000000 0            00111222211111100        


Q ss_pred             ----E-------------------------------eeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeec---CCC
Q 022074          212 ----C-------------------------------HFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKY---HTS  253 (303)
Q Consensus       212 ----~-------------------------------~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~---h~~  253 (303)
                          |                               .+.....|.-++++++++|+.|+|||+.+|+..+.|++   |++
T Consensus       563 ~MiscGADksimFr~~qk~~~g~~f~r~t~t~~ktTlYDm~Vdp~~k~v~t~cQDrnirif~i~sgKq~k~FKgs~~~eG  642 (1080)
T KOG1408|consen  563 KMISCGADKSIMFRVNQKASSGRLFPRHTQTLSKTTLYDMAVDPTSKLVVTVCQDRNIRIFDIESGKQVKSFKGSRDHEG  642 (1080)
T ss_pred             EEEeccCchhhheehhccccCceeccccccccccceEEEeeeCCCcceEEEEecccceEEEeccccceeeeecccccCCC
Confidence                0                               00111245678999999999999999999999999974   667


Q ss_pred             CeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          254 PVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       254 ~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ..-.+...|.|.||||.+.|.+|.++|+-++
T Consensus       643 ~lIKv~lDPSgiY~atScsdktl~~~Df~sg  673 (1080)
T KOG1408|consen  643 DLIKVILDPSGIYLATSCSDKTLCFVDFVSG  673 (1080)
T ss_pred             ceEEEEECCCccEEEEeecCCceEEEEeccc
Confidence            8889999999999999999999999998654


No 147
>KOG0268 consensus Sof1-like rRNA processing protein (contains WD40 repeats) [RNA processing and modification]
Probab=99.72  E-value=3.1e-17  Score=139.22  Aligned_cols=163  Identities=24%  Similarity=0.383  Sum_probs=125.7

Q ss_pred             ceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           41 GIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        41 ~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      .|.++.|+|... .|++|..|+.|.|||+.++....++. -+...+.++|+| .+-.|++|++|..++.+|.+..   .+
T Consensus       189 ti~svkfNpvETsILas~~sDrsIvLyD~R~~~Pl~KVi-~~mRTN~IswnP-eafnF~~a~ED~nlY~~DmR~l---~~  263 (433)
T KOG0268|consen  189 SISSVKFNPVETSILASCASDRSIVLYDLRQASPLKKVI-LTMRTNTICWNP-EAFNFVAANEDHNLYTYDMRNL---SR  263 (433)
T ss_pred             ceeEEecCCCcchheeeeccCCceEEEecccCCccceee-eeccccceecCc-cccceeeccccccceehhhhhh---cc
Confidence            378888888766 56667799999999999998765443 234678999988 6888999999999999998743   45


Q ss_pred             cceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceE
Q 022074          120 PAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVAT  199 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  199 (303)
                      +...+.+|..+|..++|+|.|..|++||.|++||||..+.......+                                 
T Consensus       264 p~~v~~dhvsAV~dVdfsptG~EfvsgsyDksIRIf~~~~~~SRdiY---------------------------------  310 (433)
T KOG0268|consen  264 PLNVHKDHVSAVMDVDFSPTGQEFVSGSYDKSIRIFPVNHGHSRDIY---------------------------------  310 (433)
T ss_pred             cchhhcccceeEEEeccCCCcchhccccccceEEEeecCCCcchhhh---------------------------------
Confidence            77888999999999999999999999999999999987653221100                                 


Q ss_pred             EecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEE
Q 022074          200 YKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAA  247 (303)
Q Consensus       200 ~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~  247 (303)
                            ..+.+.-.++..||.|.+|+++|+.|+.|++|.....+++..
T Consensus       311 ------htkRMq~V~~Vk~S~Dskyi~SGSdd~nvRlWka~Aseklgv  352 (433)
T KOG0268|consen  311 ------HTKRMQHVFCVKYSMDSKYIISGSDDGNVRLWKAKASEKLGV  352 (433)
T ss_pred             ------hHhhhheeeEEEEeccccEEEecCCCcceeeeecchhhhcCC
Confidence                  000111223445788999999999999999999765554443


No 148
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.72  E-value=3.5e-16  Score=135.16  Aligned_cols=200  Identities=21%  Similarity=0.256  Sum_probs=146.1

Q ss_pred             EEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccce
Q 022074           43 FSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAG  122 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~  122 (303)
                      .+++|+.+|..+++|+.||++|+|+.+..........|.+.|.++.|++ +++.|++.+.| ..++|+....    ....
T Consensus       148 k~vaf~~~gs~latgg~dg~lRv~~~Ps~~t~l~e~~~~~eV~DL~FS~-dgk~lasig~d-~~~VW~~~~g----~~~a  221 (398)
T KOG0771|consen  148 KVVAFNGDGSKLATGGTDGTLRVWEWPSMLTILEEIAHHAEVKDLDFSP-DGKFLASIGAD-SARVWSVNTG----AALA  221 (398)
T ss_pred             eEEEEcCCCCEeeeccccceEEEEecCcchhhhhhHhhcCccccceeCC-CCcEEEEecCC-ceEEEEeccC----chhh
Confidence            7899999999999999999999999888777777888999999999975 68899999999 9999997633    1221


Q ss_pred             eec--ccccCeEEEEeCCCC-----CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          123 VLM--GHLEGITFIDSRGDG-----RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       123 ~~~--~h~~~v~~~~~~~~~-----~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                      ...  +.......+.|+.++     ..++....-+.|+.||+...+......                         ..+
T Consensus       222 ~~t~~~k~~~~~~cRF~~d~~~~~l~laa~~~~~~~v~~~~~~~w~~~~~l~-------------------------~~~  276 (398)
T KOG0771|consen  222 RKTPFSKDEMFSSCRFSVDNAQETLRLAASQFPGGGVRLCDISLWSGSNFLR-------------------------LRK  276 (398)
T ss_pred             hcCCcccchhhhhceecccCCCceEEEEEecCCCCceeEEEeeeeccccccc-------------------------hhh
Confidence            111  122234455566555     233445566778888765321110000                         000


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe-ecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL-KYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~-~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                      .+..+          ....+.+.|.+|+++|.|+.||.|.|++..+.+.+..+ +.|..-|+++.|+||.+++++.+.|.
T Consensus       277 ~~~~~----------~siSsl~VS~dGkf~AlGT~dGsVai~~~~~lq~~~~vk~aH~~~VT~ltF~Pdsr~~~svSs~~  346 (398)
T KOG0771|consen  277 KIKRF----------KSISSLAVSDDGKFLALGTMDGSVAIYDAKSLQRLQYVKEAHLGFVTGLTFSPDSRYLASVSSDN  346 (398)
T ss_pred             hhhcc----------CcceeEEEcCCCcEEEEeccCCcEEEEEeceeeeeEeehhhheeeeeeEEEcCCcCcccccccCC
Confidence            01100          01123346789999999999999999999998876655 58999999999999999999999999


Q ss_pred             CEEEeecCC
Q 022074          275 DVVRWEFPG  283 (303)
Q Consensus       275 ~i~~Wd~~~  283 (303)
                      ++.+-.++.
T Consensus       347 ~~~v~~l~v  355 (398)
T KOG0771|consen  347 EAAVTKLAV  355 (398)
T ss_pred             ceeEEEEee
Confidence            999999875


No 149
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.72  E-value=6.9e-17  Score=154.15  Aligned_cols=235  Identities=20%  Similarity=0.336  Sum_probs=164.9

Q ss_pred             cceEEEEEcCCCCE----EEEeeCCCeEEEEECCC---Cc---eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEE
Q 022074           40 FGIFSLKFSTDGRE----LVAGSSDDCIYVYDLEA---NK---LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        40 ~~v~~l~~s~~g~~----l~sgs~Dg~v~lwd~~~---~~---~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      .+++.++|.+.|..    |+.|..||.|.+||...   +.   .+.+...|++.|..+.|++..+++|++|+.||.|.+|
T Consensus        65 ~rF~kL~W~~~g~~~~GlIaGG~edG~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~~q~nlLASGa~~geI~iW  144 (1049)
T KOG0307|consen   65 NRFNKLAWGSYGSHSHGLIAGGLEDGNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNPFQGNLLASGADDGEILIW  144 (1049)
T ss_pred             ccceeeeecccCCCccceeeccccCCceEEecchhhccCcchHHHhhhcccCCceeeeeccccCCceeeccCCCCcEEEe
Confidence            47899999998887    88888999999999865   22   2234567999999999999889999999999999999


Q ss_pred             cCccccCCCcccee-ecccccCeEEEEeCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCC-C
Q 022074          110 DRRCLNVKGKPAGV-LMGHLEGITFIDSRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQ-A  186 (303)
Q Consensus       110 d~~~~~~~~~~~~~-~~~h~~~v~~~~~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~  186 (303)
                      |+...+   .+... -....+.|.+++|+.. .+.|++++.++.+-|||+|..+.+-........  ..+..+.+.|+ .
T Consensus       145 Dlnn~~---tP~~~~~~~~~~eI~~lsWNrkvqhILAS~s~sg~~~iWDlr~~~pii~ls~~~~~--~~~S~l~WhP~~a  219 (1049)
T KOG0307|consen  145 DLNKPE---TPFTPGSQAPPSEIKCLSWNRKVSHILASGSPSGRAVIWDLRKKKPIIKLSDTPGR--MHCSVLAWHPDHA  219 (1049)
T ss_pred             ccCCcC---CCCCCCCCCCcccceEeccchhhhHHhhccCCCCCceeccccCCCcccccccCCCc--cceeeeeeCCCCc
Confidence            986322   22211 1224567999999875 456789999999999999986544333221110  11111222222 1


Q ss_pred             ccc-cCCCC---------------CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeec
Q 022074          187 RDL-KHPCD---------------QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKY  250 (303)
Q Consensus       187 ~~~-~~~~~---------------~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~  250 (303)
                      ..+ ....+               ..+..+.+|..-  ++...|++   .|..+|++++.|+.|.+|+..+++.+..+..
T Consensus       220 Tql~~As~dd~~PviqlWDlR~assP~k~~~~H~~G--ilslsWc~---~D~~lllSsgkD~~ii~wN~~tgEvl~~~p~  294 (1049)
T KOG0307|consen  220 TQLLVASGDDSAPVIQLWDLRFASSPLKILEGHQRG--ILSLSWCP---QDPRLLLSSGKDNRIICWNPNTGEVLGELPA  294 (1049)
T ss_pred             eeeeeecCCCCCceeEeecccccCCchhhhcccccc--eeeeccCC---CCchhhhcccCCCCeeEecCCCceEeeecCC
Confidence            111 11111               112222344311  12222222   2558999999999999999999999999988


Q ss_pred             CCCCeEEEEECCCCC-eEEEEeCCCCEEEeecCCC
Q 022074          251 HTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       251 h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~~  284 (303)
                      ...++.++.|.|..+ +++.++-||.|.++.+.+.
T Consensus       295 ~~nW~fdv~w~pr~P~~~A~asfdgkI~I~sl~~~  329 (1049)
T KOG0307|consen  295 QGNWCFDVQWCPRNPSVMAAASFDGKISIYSLQGT  329 (1049)
T ss_pred             CCcceeeeeecCCCcchhhhheeccceeeeeeecC
Confidence            889999999999987 8999999999999998654


No 150
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=99.71  E-value=3.4e-15  Score=124.86  Aligned_cols=236  Identities=19%  Similarity=0.225  Sum_probs=151.0

Q ss_pred             ccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCC--EEEEeeCCCeEEEEECCCCceEEEEecccCCeEEE
Q 022074           10 VGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGR--ELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTV   87 (303)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~--~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l   87 (303)
                      +-||+.|--|+|.+.-...+..  +.--|...|.++.|.++-.  .|++|+.||.|.+|+...-.+...+.+|.+.|+.+
T Consensus        56 ~aSGssDetI~IYDm~k~~qlg--~ll~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~~~~W~~~~slK~H~~~Vt~l  133 (362)
T KOG0294|consen   56 VASGSSDETIHIYDMRKRKQLG--ILLSHAGSITALKFYPPLSKSHLLSGSDDGHIIIWRVGSWELLKSLKAHKGQVTDL  133 (362)
T ss_pred             EeccCCCCcEEEEeccchhhhc--ceeccccceEEEEecCCcchhheeeecCCCcEEEEEcCCeEEeeeeccccccccee
Confidence            4688999999999886666552  3467899999999998765  89999999999999998888888899999999999


Q ss_pred             EEccCCCcEEEEecCCCeEEEEcCccccCCCccceee-cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccc
Q 022074           88 CFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVL-MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASC  166 (303)
Q Consensus        88 ~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~-~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~  166 (303)
                      +.+| .+++.++.+.|+.+++|++-    +++....+ ..+....  +.|++.|.+|+.++. ..|-+|.+..-......
T Consensus       134 siHP-S~KLALsVg~D~~lr~WNLV----~Gr~a~v~~L~~~at~--v~w~~~Gd~F~v~~~-~~i~i~q~d~A~v~~~i  205 (362)
T KOG0294|consen  134 SIHP-SGKLALSVGGDQVLRTWNLV----RGRVAFVLNLKNKATL--VSWSPQGDHFVVSGR-NKIDIYQLDNASVFREI  205 (362)
T ss_pred             EecC-CCceEEEEcCCceeeeehhh----cCccceeeccCCccee--eEEcCCCCEEEEEec-cEEEEEecccHhHhhhh
Confidence            9964 68899999999999999973    22211111 1232222  667888988888877 46788876542211000


Q ss_pred             ccCccceeeece---eeeCCCC--CccccCCC-CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC
Q 022074          167 NLGFRSYEWDYR---WMDYPPQ--ARDLKHPC-DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV  240 (303)
Q Consensus       167 ~~~~~~~~~~~~---~~~~~~~--~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~  240 (303)
                      ....+.+.....   .+....+  ...+.... ..+...+.+|....+-+...    -.+++.+|+|+|+||.|++||++
T Consensus       206 ~~~~r~l~~~~l~~~~L~vG~d~~~i~~~D~ds~~~~~~~~AH~~RVK~i~~~----~~~~~~~lvTaSSDG~I~vWd~~  281 (362)
T KOG0294|consen  206 ENPKRILCATFLDGSELLVGGDNEWISLKDTDSDTPLTEFLAHENRVKDIASY----TNPEHEYLVTASSDGFIKVWDID  281 (362)
T ss_pred             hccccceeeeecCCceEEEecCCceEEEeccCCCccceeeecchhheeeeEEE----ecCCceEEEEeccCceEEEEEcc
Confidence            000000000000   0000000  00001111 22344556665544433222    23567899999999999999998


Q ss_pred             CC-----eEEEEeecCCCCeEEEEE
Q 022074          241 SG-----EQVAALKYHTSPVRDCSW  260 (303)
Q Consensus       241 ~~-----~~~~~~~~h~~~I~~v~~  260 (303)
                      ..     +.+..+.. ..+++|+..
T Consensus       282 ~~~k~~~~~l~e~n~-~~RltCl~~  305 (362)
T KOG0294|consen  282 METKKRPTLLAELNT-NVRLTCLRV  305 (362)
T ss_pred             ccccCCcceeEEeec-CCccceeee
Confidence            65     34555543 445666554


No 151
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=99.71  E-value=1.5e-15  Score=130.64  Aligned_cols=265  Identities=18%  Similarity=0.299  Sum_probs=176.9

Q ss_pred             CchhhccccccccccCcCcc-------cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECC--------C-----C
Q 022074           12 SGTMESLANVTEIHDGLDFS-------AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLE--------A-----N   71 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~-------~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~--------~-----~   71 (303)
                      +|.-|+-|-+|.+.++....       -...++|..+|+++.|+|+|+.+++|+.+|.|.+|-..        +     .
T Consensus        31 T~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D~g~v~lWk~~~~~~~~~d~e~~~~k  110 (434)
T KOG1009|consen   31 TAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGDGGEVFLWKQGDVRIFDADTEADLNK  110 (434)
T ss_pred             cccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCCCceEEEEEecCcCCccccchhhhCc
Confidence            45557777888887766442       23457999999999999999999999999999999665        2     1


Q ss_pred             ---ceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeC
Q 022074           72 ---KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK  148 (303)
Q Consensus        72 ---~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~  148 (303)
                         .....+.+|...+..++|++ ++..+++++.|.++++||+.    .+.....+.+|..-|..+++.|.+.++++-+.
T Consensus       111 e~w~v~k~lr~h~~diydL~Ws~-d~~~l~s~s~dns~~l~Dv~----~G~l~~~~~dh~~yvqgvawDpl~qyv~s~s~  185 (434)
T KOG1009|consen  111 EKWVVKKVLRGHRDDIYDLAWSP-DSNFLVSGSVDNSVRLWDVH----AGQLLAILDDHEHYVQGVAWDPLNQYVASKSS  185 (434)
T ss_pred             cceEEEEEecccccchhhhhccC-CCceeeeeeccceEEEEEec----cceeEeeccccccccceeecchhhhhhhhhcc
Confidence               12244667999999999975 57899999999999999986    45566677889999999999999999999999


Q ss_pred             CCcEEEEEcccccCCcccc-----------cCcccee-ee-------ceeeeCCCCCccccCCCC----------CcceE
Q 022074          149 DQAIKLWDIRKMSSNASCN-----------LGFRSYE-WD-------YRWMDYPPQARDLKHPCD----------QSVAT  199 (303)
Q Consensus       149 D~~v~lWdl~~~~~~~~~~-----------~~~~~~~-~~-------~~~~~~~~~~~~~~~~~~----------~~~~~  199 (303)
                      |+..+.+.+.......-+.           ...+... +.       .+...+.|++..+..+..          +....
T Consensus       186 dr~~~~~~~~~~~~~~~~~~~~m~~~~~~~~e~~s~rLfhDeTlksFFrRlsfTPdG~llvtPag~~~~g~~~~~n~tYv  265 (434)
T KOG1009|consen  186 DRHPEGFSAKLKQVIKRHGLDIMPAKAFNEREGKSTRLFHDETLKSFFRRLSFTPDGSLLVTPAGLFKVGGGVFRNTSYV  265 (434)
T ss_pred             CcccceeeeeeeeeeeeeeeeEeeecccCCCCcceeeeeecCchhhhhhhcccCCCCcEEEcccceeeeCCceeeceeEe
Confidence            9987877654321111000           0000000 00       112233344433332221          11122


Q ss_pred             Eeccccee----------eeEEEeeeeee-------------e-CCCeEEEEEeCCCeEEEEECCCCeEEEEe-ecCCCC
Q 022074          200 YKGHSVLR----------TLIRCHFSPVY-------------S-TGQKYIYTGSHDSCVYVYDLVSGEQVAAL-KYHTSP  254 (303)
Q Consensus       200 ~~~~~~~~----------~~~~~~~~~~~-------------s-~~~~~latg~~dg~i~iwd~~~~~~~~~~-~~h~~~  254 (303)
                      ++++..-+          ..+...++|++             + |.+-.+|.+. ...+++||.++-+++... ..|=.+
T Consensus       266 fsrk~l~rP~~~lp~~~k~~lavr~~pVy~elrp~~~~~~~~~lpyrlvfaiAt-~~svyvydtq~~~P~~~v~nihy~~  344 (434)
T KOG1009|consen  266 FSRKDLKRPAARLPSPKKPALAVRFSPVYYELRPLSSEKFLFVLPYRLVFAIAT-KNSVYVYDTQTLEPLAVVDNIHYSA  344 (434)
T ss_pred             eccccccCceeecCCCCcceEEEEeeeeEEEeccccccccccccccceEEEEee-cceEEEeccccccceEEEeeeeeee
Confidence            22221111          11112223321             1 2334456665 557999999988876655 467789


Q ss_pred             eEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          255 VRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       255 I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      |++++||+||.+|+..|.||-+.+=.+.
T Consensus       345 iTDiaws~dg~~l~vSS~DGyCS~vtfe  372 (434)
T KOG1009|consen  345 ITDIAWSDDGSVLLVSSTDGFCSLVTFE  372 (434)
T ss_pred             ecceeecCCCcEEEEeccCCceEEEEEc
Confidence            9999999999999999999988876654


No 152
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.70  E-value=2.8e-15  Score=136.20  Aligned_cols=230  Identities=15%  Similarity=0.201  Sum_probs=166.5

Q ss_pred             cccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEE
Q 022074           18 LANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLI   97 (303)
Q Consensus        18 ~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l   97 (303)
                      .|.+|-...+.-.+....++-...|-+++|+ +|.+|.+.+-+|.|.=||+.+.+....+..-.+.+..++.+|. .+.+
T Consensus        48 ~IEiwN~~~~w~~~~vi~g~~drsIE~L~W~-e~~RLFS~g~sg~i~EwDl~~lk~~~~~d~~gg~IWsiai~p~-~~~l  125 (691)
T KOG2048|consen   48 NIEIWNLSNNWFLEPVIHGPEDRSIESLAWA-EGGRLFSSGLSGSITEWDLHTLKQKYNIDSNGGAIWSIAINPE-NTIL  125 (691)
T ss_pred             cEEEEccCCCceeeEEEecCCCCceeeEEEc-cCCeEEeecCCceEEEEecccCceeEEecCCCcceeEEEeCCc-cceE
Confidence            3444444444433333335566789999999 5667999999999999999999887777777788999999765 5788


Q ss_pred             EEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeec
Q 022074           98 YSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDY  177 (303)
Q Consensus        98 ~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~  177 (303)
                      +.|+.||.+..++...  ..-.....+....+.+.+++|++++..+++|+.||.||+||.......-       ..... 
T Consensus       126 ~IgcddGvl~~~s~~p--~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~-------~~~~~-  195 (691)
T KOG2048|consen  126 AIGCDDGVLYDFSIGP--DKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDGVIRIWDVKSGQTLH-------IITMQ-  195 (691)
T ss_pred             EeecCCceEEEEecCC--ceEEEEeecccccceEEEEEecCCccEEEecccCceEEEEEcCCCceEE-------Eeeec-
Confidence            8899999666666431  1112233455556789999999999999999999999999986532110       00000 


Q ss_pred             eeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEE
Q 022074          178 RWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRD  257 (303)
Q Consensus       178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~  257 (303)
                                         +..+.     +....+.|+..|=.+ ..+++|.+-|+|.+||...+.++..++.|...|.+
T Consensus       196 -------------------~d~l~-----k~~~~iVWSv~~Lrd-~tI~sgDS~G~V~FWd~~~gTLiqS~~~h~adVl~  250 (691)
T KOG2048|consen  196 -------------------LDRLS-----KREPTIVWSVLFLRD-STIASGDSAGTVTFWDSIFGTLIQSHSCHDADVLA  250 (691)
T ss_pred             -------------------ccccc-----cCCceEEEEEEEeec-CcEEEecCCceEEEEcccCcchhhhhhhhhcceeE
Confidence                               00000     000112233333333 46899999999999999999999999999999999


Q ss_pred             EEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          258 CSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       258 v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++-++++..+++++.|+.+.-+...++
T Consensus       251 Lav~~~~d~vfsaGvd~~ii~~~~~~~  277 (691)
T KOG2048|consen  251 LAVADNEDRVFSAGVDPKIIQYSLTTN  277 (691)
T ss_pred             EEEcCCCCeEEEccCCCceEEEEecCC
Confidence            999999999999999999998876554


No 153
>KOG1273 consensus WD40 repeat protein [General function prediction only]
Probab=99.69  E-value=3.4e-15  Score=124.92  Aligned_cols=241  Identities=21%  Similarity=0.249  Sum_probs=155.6

Q ss_pred             cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           32 AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        32 ~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      +-..+||..||.+++||+||+.|+++|.|..|.+||+..|....++ ..+.+|..+.|+|.+.+.++..-.+..-.+-+.
T Consensus        58 ar~lsaH~~pi~sl~WS~dgr~LltsS~D~si~lwDl~~gs~l~ri-rf~spv~~~q~hp~k~n~~va~~~~~sp~vi~~  136 (405)
T KOG1273|consen   58 ARMLSAHVRPITSLCWSRDGRKLLTSSRDWSIKLWDLLKGSPLKRI-RFDSPVWGAQWHPRKRNKCVATIMEESPVVIDF  136 (405)
T ss_pred             hhhhhccccceeEEEecCCCCEeeeecCCceeEEEeccCCCceeEE-EccCccceeeeccccCCeEEEEEecCCcEEEEe
Confidence            4455899999999999999999999999999999999999865544 466789999998766555544433332333332


Q ss_pred             ccccCCCccceeeccc----cc-CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          112 RCLNVKGKPAGVLMGH----LE-GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h----~~-~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      ...     ....+...    .+ .-.+..|.+.|+++++|..-|.+.++|....+...+++...   .-.++.+.++..+
T Consensus       137 s~~-----~h~~Lp~d~d~dln~sas~~~fdr~g~yIitGtsKGkllv~~a~t~e~vas~rits---~~~IK~I~~s~~g  208 (405)
T KOG1273|consen  137 SDP-----KHSVLPKDDDGDLNSSASHGVFDRRGKYIITGTSKGKLLVYDAETLECVASFRITS---VQAIKQIIVSRKG  208 (405)
T ss_pred             cCC-----ceeeccCCCccccccccccccccCCCCEEEEecCcceEEEEecchheeeeeeeech---heeeeEEEEeccC
Confidence            210     01111111    01 11122478899999999999999999988776665554321   0111122222222


Q ss_pred             ccccC-CCCCcceEEecccceee---------------eEEEee-eeeeeCCCeEEEEEe-CCCeEEEEECCCCeEEEEe
Q 022074          187 RDLKH-PCDQSVATYKGHSVLRT---------------LIRCHF-SPVYSTGQKYIYTGS-HDSCVYVYDLVSGEQVAAL  248 (303)
Q Consensus       187 ~~~~~-~~~~~~~~~~~~~~~~~---------------~~~~~~-~~~~s~~~~~latg~-~dg~i~iwd~~~~~~~~~~  248 (303)
                      ..+.. ..++.+.++........               +-+-.| +-.||.+|.|++.|+ ....++||.-..|.+++.+
T Consensus       209 ~~liiNtsDRvIR~ye~~di~~~~r~~e~e~~~K~qDvVNk~~Wk~ccfs~dgeYv~a~s~~aHaLYIWE~~~GsLVKIL  288 (405)
T KOG1273|consen  209 RFLIINTSDRVIRTYEISDIDDEGRDGEVEPEHKLQDVVNKLQWKKCCFSGDGEYVCAGSARAHALYIWEKSIGSLVKIL  288 (405)
T ss_pred             cEEEEecCCceEEEEehhhhcccCccCCcChhHHHHHHHhhhhhhheeecCCccEEEeccccceeEEEEecCCcceeeee
Confidence            22211 22334444332211100               000001 112677899888776 4667999999999999999


Q ss_pred             ecCC-CCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          249 KYHT-SPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       249 ~~h~-~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      .+.+ ....++.|+|-.+.+++- ..|++++|...
T Consensus       289 hG~kgE~l~DV~whp~rp~i~si-~sg~v~iw~~~  322 (405)
T KOG1273|consen  289 HGTKGEELLDVNWHPVRPIIASI-ASGVVYIWAVV  322 (405)
T ss_pred             cCCchhheeecccccceeeeeec-cCCceEEEEee
Confidence            8887 578899999999999999 67999999854


No 154
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=99.68  E-value=1.2e-15  Score=141.53  Aligned_cols=198  Identities=22%  Similarity=0.384  Sum_probs=146.7

Q ss_pred             ceEEEEEcCC-----CCEEEEeeCCCeEEEEECCCCceEEEEeccc------CCeEEEEEccCCCcEEEEecCCCeEEEE
Q 022074           41 GIFSLKFSTD-----GRELVAGSSDDCIYVYDLEANKLSLRILAHT------SDVNTVCFGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        41 ~v~~l~~s~~-----g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~------~~v~~l~~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      +|..+++.-.     .+.+++.-.+..++.|+......-.-...++      ..+.+++.+ .+|+..+.|...|+|-+|
T Consensus       397 ~i~~fa~~~~RE~~W~Nv~~~h~~~~~~~tW~~~n~~~G~~~L~~~~~~~~~~~~~av~vs-~CGNF~~IG~S~G~Id~f  475 (910)
T KOG1539|consen  397 PIVEFAFENAREKEWDNVITAHKGKRSAYTWNFRNKTSGRHVLDPKRFKKDDINATAVCVS-FCGNFVFIGYSKGTIDRF  475 (910)
T ss_pred             cceeeecccchhhhhcceeEEecCcceEEEEeccCcccccEEecCccccccCcceEEEEEe-ccCceEEEeccCCeEEEE
Confidence            4555555532     2344445566779999997765422222333      577888885 589999999999999999


Q ss_pred             cCccccCCCccceee---cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          110 DRRCLNVKGKPAGVL---MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       110 d~~~~~~~~~~~~~~---~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      +..+    +.....+   ..|.++|+.++...-++.+++++.||.+++||........+..++                 
T Consensus       476 NmQS----Gi~r~sf~~~~ah~~~V~gla~D~~n~~~vsa~~~Gilkfw~f~~k~l~~~l~l~-----------------  534 (910)
T KOG1539|consen  476 NMQS----GIHRKSFGDSPAHKGEVTGLAVDGTNRLLVSAGADGILKFWDFKKKVLKKSLRLG-----------------  534 (910)
T ss_pred             Eccc----CeeecccccCccccCceeEEEecCCCceEEEccCcceEEEEecCCcceeeeeccC-----------------
Confidence            9753    3233344   468999999999888899999999999999998653221111100                 


Q ss_pred             ccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCe
Q 022074          187 RDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPM  266 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~  266 (303)
                            +  .          ...      +.++.....+|.+..|-.|+++|..+.+.+..|.+|.+.|++++|||||++
T Consensus       535 ------~--~----------~~~------iv~hr~s~l~a~~~ddf~I~vvD~~t~kvvR~f~gh~nritd~~FS~DgrW  590 (910)
T KOG1539|consen  535 ------S--S----------ITG------IVYHRVSDLLAIALDDFSIRVVDVVTRKVVREFWGHGNRITDMTFSPDGRW  590 (910)
T ss_pred             ------C--C----------cce------eeeeehhhhhhhhcCceeEEEEEchhhhhhHHhhccccceeeeEeCCCCcE
Confidence                  0  0          001      122333567889999999999999999999999999999999999999999


Q ss_pred             EEEEeCCCCEEEeecCCC
Q 022074          267 LVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       267 las~s~Dg~i~~Wd~~~~  284 (303)
                      |++++.|++|++||++..
T Consensus       591 lisasmD~tIr~wDlpt~  608 (910)
T KOG1539|consen  591 LISASMDSTIRTWDLPTG  608 (910)
T ss_pred             EEEeecCCcEEEEeccCc
Confidence            999999999999999875


No 155
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=99.68  E-value=4.3e-14  Score=124.85  Aligned_cols=258  Identities=18%  Similarity=0.251  Sum_probs=161.8

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE---Eecc-cCCeEEEEEcc
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR---ILAH-TSDVNTVCFGD   91 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~---~~~h-~~~v~~l~~~~   91 (303)
                      +|+.+||+--.|....  +...-+-.|..+.|+|.+..++.-...|.+..|++.++.+.++   +..+ ...|.|++|. 
T Consensus       179 ~h~lSVWdWqk~~~~~--~vk~sne~v~~a~FHPtd~nliit~Gk~H~~Fw~~~~~~l~k~~~~fek~ekk~Vl~v~F~-  255 (626)
T KOG2106|consen  179 PHMLSVWDWQKKAKLG--PVKTSNEVVFLATFHPTDPNLIITCGKGHLYFWTLRGGSLVKRQGIFEKREKKFVLCVTFL-  255 (626)
T ss_pred             ccccchhhchhhhccC--cceeccceEEEEEeccCCCcEEEEeCCceEEEEEccCCceEEEeeccccccceEEEEEEEc-
Confidence            4666788754444332  2223334589999999777666656677899999999876544   2232 2579999996 


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc--ccCCccc-cc
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK--MSSNASC-NL  168 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~--~~~~~~~-~~  168 (303)
                      ++++ ++||..+|.+.+|+.+.    .+.......|..+|.++....+|. |++|+.|+.|..||-..  .....-. ..
T Consensus       256 engd-viTgDS~G~i~Iw~~~~----~~~~k~~~aH~ggv~~L~~lr~Gt-llSGgKDRki~~Wd~~y~k~r~~elPe~~  329 (626)
T KOG2106|consen  256 ENGD-VITGDSGGNILIWSKGT----NRISKQVHAHDGGVFSLCMLRDGT-LLSGGKDRKIILWDDNYRKLRETELPEQF  329 (626)
T ss_pred             CCCC-EEeecCCceEEEEeCCC----ceEEeEeeecCCceEEEEEecCcc-EeecCccceEEeccccccccccccCchhc
Confidence            4454 56999999999999752    222222337999999998888885 66699999999998321  1110000 00


Q ss_pred             C-ccc--------eeeecee------------------------eeC-CCCCccccCCCCCcceEEecccceeee--EEE
Q 022074          169 G-FRS--------YEWDYRW------------------------MDY-PPQARDLKHPCDQSVATYKGHSVLRTL--IRC  212 (303)
Q Consensus       169 ~-~~~--------~~~~~~~------------------------~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~  212 (303)
                      + .+.        +....++                        +.. |.....+....+..+..++.|...=+.  ..-
T Consensus       330 G~iRtv~e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hps~~q~~T~gqdk~v~lW~~~k~~wt~~~~d~  409 (626)
T KOG2106|consen  330 GPIRTVAEGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHPSKNQLLTCGQDKHVRLWNDHKLEWTKIIEDP  409 (626)
T ss_pred             CCeeEEecCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcCCChhheeeccCcceEEEccCCceeEEEEecCc
Confidence            0 000        0000000                        000 111111111112223334433321111  011


Q ss_pred             eeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          213 HFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       213 ~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .-+..|+|.| .+|.|...|.-.+.|.++.+.+..... ..+++.++|||+|.+||.|+.|+.|.++.+..+
T Consensus       410 ~~~~~fhpsg-~va~Gt~~G~w~V~d~e~~~lv~~~~d-~~~ls~v~ysp~G~~lAvgs~d~~iyiy~Vs~~  479 (626)
T KOG2106|consen  410 AECADFHPSG-VVAVGTATGRWFVLDTETQDLVTIHTD-NEQLSVVRYSPDGAFLAVGSHDNHIYIYRVSAN  479 (626)
T ss_pred             eeEeeccCcc-eEEEeeccceEEEEecccceeEEEEec-CCceEEEEEcCCCCEEEEecCCCeEEEEEECCC
Confidence            1234577878 889999999999999988776655444 789999999999999999999999999998755


No 156
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=99.68  E-value=9.9e-16  Score=135.02  Aligned_cols=238  Identities=16%  Similarity=0.202  Sum_probs=147.7

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEE--ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRI--LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~--~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      .|.--|.++.+|...+++++|+. |.|+|||+.....   +..+  ..-+.-+..+...+ +++.|++|++-.++.|||+
T Consensus       417 ~HGEvVcAvtIS~~trhVyTgGk-gcVKVWdis~pg~k~PvsqLdcl~rdnyiRSckL~p-dgrtLivGGeastlsiWDL  494 (705)
T KOG0639|consen  417 AHGEVVCAVTISNPTRHVYTGGK-GCVKVWDISQPGNKSPVSQLDCLNRDNYIRSCKLLP-DGRTLIVGGEASTLSIWDL  494 (705)
T ss_pred             ccCcEEEEEEecCCcceeEecCC-CeEEEeeccCCCCCCccccccccCcccceeeeEecC-CCceEEeccccceeeeeec
Confidence            56677889999998999999875 5699999965421   1111  12234566666654 5788889999999999998


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      ....  .+....+....-+..+++.++|.+..+++-.||.|+|||++.......+.    .+.-....+....++..+..
T Consensus       495 AapT--prikaeltssapaCyALa~spDakvcFsccsdGnI~vwDLhnq~~Vrqfq----GhtDGascIdis~dGtklWT  568 (705)
T KOG0639|consen  495 AAPT--PRIKAELTSSAPACYALAISPDAKVCFSCCSDGNIAVWDLHNQTLVRQFQ----GHTDGASCIDISKDGTKLWT  568 (705)
T ss_pred             cCCC--cchhhhcCCcchhhhhhhcCCccceeeeeccCCcEEEEEcccceeeeccc----CCCCCceeEEecCCCceeec
Confidence            6432  22222333333456678899999999999999999999998643322211    11000111122222222211


Q ss_pred             C-CCCcceEEecccceee----eEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCe
Q 022074          192 P-CDQSVATYKGHSVLRT----LIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPM  266 (303)
Q Consensus       192 ~-~~~~~~~~~~~~~~~~----~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~  266 (303)
                      . -+..+..|+-......    -..-.|+.-++|++.+|+.|=+.+.+.+-.....+ .+.+..|+.-|.++.|++.|++
T Consensus       569 GGlDntvRcWDlregrqlqqhdF~SQIfSLg~cP~~dWlavGMens~vevlh~skp~-kyqlhlheScVLSlKFa~cGkw  647 (705)
T KOG0639|consen  569 GGLDNTVRCWDLREGRQLQQHDFSSQIFSLGYCPTGDWLAVGMENSNVEVLHTSKPE-KYQLHLHESCVLSLKFAYCGKW  647 (705)
T ss_pred             CCCccceeehhhhhhhhhhhhhhhhhheecccCCCccceeeecccCcEEEEecCCcc-ceeecccccEEEEEEecccCce
Confidence            1 1222222221110000    00112334455667777777777777666654333 3455678899999999999999


Q ss_pred             EEEEeCCCCEEEeecCC
Q 022074          267 LVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       267 las~s~Dg~i~~Wd~~~  283 (303)
                      ++|.+.|..+..|..+-
T Consensus       648 fvStGkDnlLnawrtPy  664 (705)
T KOG0639|consen  648 FVSTGKDNLLNAWRTPY  664 (705)
T ss_pred             eeecCchhhhhhccCcc
Confidence            99999999999998653


No 157
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=99.68  E-value=5.1e-15  Score=122.53  Aligned_cols=223  Identities=23%  Similarity=0.349  Sum_probs=155.3

Q ss_pred             ccccccccccCcCcc--------cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce-EEEEe-----cccC
Q 022074           17 SLANVTEIHDGLDFS--------AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL-SLRIL-----AHTS   82 (303)
Q Consensus        17 ~~~~~~~~~~~~~~~--------~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~-~~~~~-----~h~~   82 (303)
                      +-++||.+=-.+..+        +.-++.+-+.|.|+.|.|+++.+++-. |..|.+|++..+.. ...+.     .|..
T Consensus        93 ~~aaiw~ipe~~~~S~~~tlE~v~~Ldteavg~i~cvew~Pns~klasm~-dn~i~l~~l~ess~~vaev~ss~s~e~~~  171 (370)
T KOG1007|consen   93 TGAAIWQIPEPLGQSNSSTLECVASLDTEAVGKINCVEWEPNSDKLASMD-DNNIVLWSLDESSKIVAEVLSSESAEMRH  171 (370)
T ss_pred             eeEEEEecccccCccccchhhHhhcCCHHHhCceeeEEEcCCCCeeEEec-cCceEEEEcccCcchheeecccccccccc
Confidence            445777774444332        111235556999999999999999875 77899999988764 22221     2334


Q ss_pred             CeEEEEEcc-CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccc
Q 022074           83 DVNTVCFGD-ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus        83 ~v~~l~~~~-~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ..+.-+|+| -+++.+++.+ |+++..||+|...   +.-.....|.-.|..++|+|+- .+|+|+|.|+.||+||+|+.
T Consensus       172 ~ftsg~WspHHdgnqv~tt~-d~tl~~~D~RT~~---~~~sI~dAHgq~vrdlDfNpnkq~~lvt~gDdgyvriWD~R~t  247 (370)
T KOG1007|consen  172 SFTSGAWSPHHDGNQVATTS-DSTLQFWDLRTMK---KNNSIEDAHGQRVRDLDFNPNKQHILVTCGDDGYVRIWDTRKT  247 (370)
T ss_pred             eecccccCCCCccceEEEeC-CCcEEEEEccchh---hhcchhhhhcceeeeccCCCCceEEEEEcCCCccEEEEeccCC
Confidence            556678876 4577776655 7899999998432   2233445687889999999874 55899999999999999864


Q ss_pred             cCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC
Q 022074          161 SSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV  240 (303)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~  240 (303)
                      +.                                 ++..+.+|......+|++  |.   ..+++++||.|..+-+|...
T Consensus       248 k~---------------------------------pv~el~~HsHWvW~VRfn--~~---hdqLiLs~~SDs~V~Lsca~  289 (370)
T KOG1007|consen  248 KF---------------------------------PVQELPGHSHWVWAVRFN--PE---HDQLILSGGSDSAVNLSCAS  289 (370)
T ss_pred             Cc---------------------------------cccccCCCceEEEEEEec--Cc---cceEEEecCCCceeEEEecc
Confidence            32                                 233444555444444432  21   24789999999999999753


Q ss_pred             CC-----------------------------eEEEEeecCCCCeEEEEECCCCCe-EEEEeCCCCEEEeecC
Q 022074          241 SG-----------------------------EQVAALKYHTSPVRDCSWHPSQPM-LVSSSWDGDVVRWEFP  282 (303)
Q Consensus       241 ~~-----------------------------~~~~~~~~h~~~I~~v~~sp~~~~-las~s~Dg~i~~Wd~~  282 (303)
                      .-                             ..+.++..|++.|.+++||.-.++ +||-|.||.+.+=.++
T Consensus       290 svSSE~qi~~~~dese~e~~dseer~kpL~dg~l~tydehEDSVY~~aWSsadPWiFASLSYDGRviIs~V~  361 (370)
T KOG1007|consen  290 SVSSEQQIEFEDDESESEDEDSEERVKPLQDGQLETYDEHEDSVYALAWSSADPWIFASLSYDGRVIISSVP  361 (370)
T ss_pred             ccccccccccccccccCcchhhHHhcccccccccccccccccceEEEeeccCCCeeEEEeccCceEEeecCC
Confidence            20                             123466789999999999998885 8899999999886554


No 158
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=99.68  E-value=1.6e-14  Score=131.45  Aligned_cols=197  Identities=17%  Similarity=0.214  Sum_probs=147.0

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE-EEecc-cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCC
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL-RILAH-TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG  118 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~-~~~~h-~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~  118 (303)
                      +|.+++|+.+.+.||++-.||.|-||++..+-... .+.++ +..|..++|. + +..|+|.+.+|.|.-||+-    +.
T Consensus        27 ~I~slA~s~kS~~lAvsRt~g~IEiwN~~~~w~~~~vi~g~~drsIE~L~W~-e-~~RLFS~g~sg~i~EwDl~----~l  100 (691)
T KOG2048|consen   27 EIVSLAYSHKSNQLAVSRTDGNIEIWNLSNNWFLEPVIHGPEDRSIESLAWA-E-GGRLFSSGLSGSITEWDLH----TL  100 (691)
T ss_pred             ceEEEEEeccCCceeeeccCCcEEEEccCCCceeeEEEecCCCCceeeEEEc-c-CCeEEeecCCceEEEEecc----cC
Confidence            69999999999999999999999999998875443 34444 4689999996 3 5578899999999999974    44


Q ss_pred             ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          119 KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       119 ~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      ++.........++++++.+|.+..++.|..||.+..++...........                               
T Consensus       101 k~~~~~d~~gg~IWsiai~p~~~~l~IgcddGvl~~~s~~p~~I~~~r~-------------------------------  149 (691)
T KOG2048|consen  101 KQKYNIDSNGGAIWSIAINPENTILAIGCDDGVLYDFSIGPDKITYKRS-------------------------------  149 (691)
T ss_pred             ceeEEecCCCcceeEEEeCCccceEEeecCCceEEEEecCCceEEEEee-------------------------------
Confidence            5555666667889999999999999999999977777654321110000                               


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeec--------CCCCeEEEEECCCCCeEEEE
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKY--------HTSPVRDCSWHPSQPMLVSS  270 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~--------h~~~I~~v~~sp~~~~las~  270 (303)
                       +....      .-.++..|++++..+++|+.||.|++||.++++.+.....        -..-||++.|- ....||+|
T Consensus       150 -l~rq~------sRvLslsw~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~L-rd~tI~sg  221 (691)
T KOG2048|consen  150 -LMRQK------SRVLSLSWNPTGTKIAGGSIDGVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFL-RDSTIASG  221 (691)
T ss_pred             -ccccc------ceEEEEEecCCccEEEecccCceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEe-ecCcEEEe
Confidence             00000      0012345778888899999999999999999987663221        12347888887 45579999


Q ss_pred             eCCCCEEEeecC
Q 022074          271 SWDGDVVRWEFP  282 (303)
Q Consensus       271 s~Dg~i~~Wd~~  282 (303)
                      +.-|++++||..
T Consensus       222 DS~G~V~FWd~~  233 (691)
T KOG2048|consen  222 DSAGTVTFWDSI  233 (691)
T ss_pred             cCCceEEEEccc
Confidence            999999999964


No 159
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=99.68  E-value=2.3e-14  Score=124.86  Aligned_cols=186  Identities=17%  Similarity=0.121  Sum_probs=126.7

Q ss_pred             CCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccC
Q 022074           51 GRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEG  130 (303)
Q Consensus        51 g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~  130 (303)
                      ++.+++++.|+.+++||+.+++....+..+. .+..++|+++...++++++.++.|++||.+..    +....+..+.. 
T Consensus         1 ~~~~~s~~~d~~v~~~d~~t~~~~~~~~~~~-~~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~----~~~~~~~~~~~-   74 (300)
T TIGR03866         1 EKAYVSNEKDNTISVIDTATLEVTRTFPVGQ-RPRGITLSKDGKLLYVCASDSDTIQVIDLATG----EVIGTLPSGPD-   74 (300)
T ss_pred             CcEEEEecCCCEEEEEECCCCceEEEEECCC-CCCceEECCCCCEEEEEECCCCeEEEEECCCC----cEEEeccCCCC-
Confidence            3568889999999999999988766665553 46778997653334467788999999997532    22333333333 


Q ss_pred             eEEEEeCCCCCEEEE-EeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeee
Q 022074          131 ITFIDSRGDGRYLIS-NGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTL  209 (303)
Q Consensus       131 v~~~~~~~~~~~l~s-~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  209 (303)
                      +..+.++++++.+++ ++.|+.+++||++......                                  .+.....    
T Consensus        75 ~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~----------------------------------~~~~~~~----  116 (300)
T TIGR03866        75 PELFALHPNGKILYIANEDDNLVTVIDIETRKVLA----------------------------------EIPVGVE----  116 (300)
T ss_pred             ccEEEECCCCCEEEEEcCCCCeEEEEECCCCeEEe----------------------------------EeeCCCC----
Confidence            456788999987754 5568999999986421100                                  0000000    


Q ss_pred             EEEeeeeeeeCCCeEEEEEeCCC-eEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEE-EEeCCCCEEEeecCCC
Q 022074          210 IRCHFSPVYSTGQKYIYTGSHDS-CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLV-SSSWDGDVVRWEFPGN  284 (303)
Q Consensus       210 ~~~~~~~~~s~~~~~latg~~dg-~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~la-s~s~Dg~i~~Wd~~~~  284 (303)
                         .....++|+++++++++.++ .+++||..+++.+..+... ..+..++|+|++++|+ ++..++.+++||....
T Consensus       117 ---~~~~~~~~dg~~l~~~~~~~~~~~~~d~~~~~~~~~~~~~-~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~  189 (300)
T TIGR03866       117 ---PEGMAVSPDGKIVVNTSETTNMAHFIDTKTYEIVDNVLVD-QRPRFAEFTADGKELWVSSEIGGTVSVIDVATR  189 (300)
T ss_pred             ---cceEEECCCCCEEEEEecCCCeEEEEeCCCCeEEEEEEcC-CCccEEEECCCCCEEEEEcCCCCEEEEEEcCcc
Confidence               01124678899999888765 5778899888776554433 3467899999999775 5556999999998753


No 160
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=99.67  E-value=7e-14  Score=125.46  Aligned_cols=225  Identities=29%  Similarity=0.486  Sum_probs=162.7

Q ss_pred             hhccccccccccCcCcccccCCCcccceEEEEE-cCCCC-EEEEeeC-CCeEEEEECCC-CceEEEEecccCCeEEEEEc
Q 022074           15 MESLANVTEIHDGLDFSAADDGGYSFGIFSLKF-STDGR-ELVAGSS-DDCIYVYDLEA-NKLSLRILAHTSDVNTVCFG   90 (303)
Q Consensus        15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~-s~~g~-~l~sgs~-Dg~v~lwd~~~-~~~~~~~~~h~~~v~~l~~~   90 (303)
                      .+..+.+|+...+..........+...+..+.+ ++++. .++..+. |+.+.+|+... ......+..|...|..+.|+
T Consensus        85 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~  164 (466)
T COG2319          85 SDGTIKLWDLDNGEKLIKSLEGLHDSSVSKLALSSPDGNSILLASSSLDGTVKLWDLSTPGKLIRTLEGHSESVTSLAFS  164 (466)
T ss_pred             CCCcEEEEEcCCCceeEEEEeccCCCceeeEEEECCCcceEEeccCCCCccEEEEEecCCCeEEEEEecCcccEEEEEEC
Confidence            566777777765541211121223246777777 88887 5555455 99999999988 66777788999999999997


Q ss_pred             cCCCcEEEEecC-CCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCC-EEEEEeCCCcEEEEEcccccCCccccc
Q 022074           91 DESGHLIYSGSD-DNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGR-YLISNGKDQAIKLWDIRKMSSNASCNL  168 (303)
Q Consensus        91 ~~~~~~l~s~s~-dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~-~l~s~~~D~~v~lWdl~~~~~~~~~~~  168 (303)
                      +. +..+++++. |+.+++|+...    ......+.+|...|..+++.+++. .+++++.|+.+++||.+.......   
T Consensus       165 ~~-~~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~d~~i~~wd~~~~~~~~~---  236 (466)
T COG2319         165 PD-GKLLASGSSLDGTIKLWDLRT----GKPLSTLAGHTDPVSSLAFSPDGGLLIASGSSDGTIRLWDLSTGKLLRS---  236 (466)
T ss_pred             CC-CCEEEecCCCCCceEEEEcCC----CceEEeeccCCCceEEEEEcCCcceEEEEecCCCcEEEEECCCCcEEee---
Confidence            64 557778875 99999999752    345666777999999999999887 566669999999997652110000   


Q ss_pred             CccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-EEE
Q 022074          169 GFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-VAA  247 (303)
Q Consensus       169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-~~~  247 (303)
                                                    .+.+|.... ..      .|++++.++++++.|+.+++||...... +..
T Consensus       237 ------------------------------~~~~~~~~~-~~------~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~  279 (466)
T COG2319         237 ------------------------------TLSGHSDSV-VS------SFSPDGSLLASGSSDGTIRLWDLRSSSSLLRT  279 (466)
T ss_pred             ------------------------------ecCCCCcce-eE------eECCCCCEEEEecCCCcEEEeeecCCCcEEEE
Confidence                                          111111110 00      3667778888999999999999986654 444


Q ss_pred             eecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          248 LKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       248 ~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +..|..++.++.|+|++..+++++.|+.+.+|+....
T Consensus       280 ~~~~~~~v~~~~~~~~~~~~~~~~~d~~~~~~~~~~~  316 (466)
T COG2319         280 LSGHSSSVLSVAFSPDGKLLASGSSDGTVRLWDLETG  316 (466)
T ss_pred             EecCCccEEEEEECCCCCEEEEeeCCCcEEEEEcCCC
Confidence            4678899999999999998888999999999987644


No 161
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.66  E-value=2.1e-14  Score=125.43  Aligned_cols=202  Identities=16%  Similarity=0.189  Sum_probs=149.0

Q ss_pred             cccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--eEEEEecccCCeEEEEEccCCCc-EEEEecCCCeEEEEcCccc
Q 022074           38 YSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--LSLRILAHTSDVNTVCFGDESGH-LIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~~~~~~~h~~~v~~l~~~~~~~~-~l~s~s~dg~v~lWd~~~~  114 (303)
                      -..+|.++.|+|....+++++.||+++||-++...  .+..+.--.-++.+.+|.|+ ++ .+++++.....+.||+...
T Consensus       212 s~~~I~sv~FHp~~plllvaG~d~~lrifqvDGk~N~~lqS~~l~~fPi~~a~f~p~-G~~~i~~s~rrky~ysyDle~a  290 (514)
T KOG2055|consen  212 SHGGITSVQFHPTAPLLLVAGLDGTLRIFQVDGKVNPKLQSIHLEKFPIQKAEFAPN-GHSVIFTSGRRKYLYSYDLETA  290 (514)
T ss_pred             CcCCceEEEecCCCceEEEecCCCcEEEEEecCccChhheeeeeccCccceeeecCC-CceEEEecccceEEEEeecccc
Confidence            35689999999999999999999999999886543  22333334468889999764 55 8999999999999998633


Q ss_pred             cCCCccceeeccccc-CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          115 NVKGKPAGVLMGHLE-GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~-~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                      ..  .++..+.|+.. .+....+++++++|+..|..|.|.+--........++                           
T Consensus       291 k~--~k~~~~~g~e~~~~e~FeVShd~~fia~~G~~G~I~lLhakT~eli~s~---------------------------  341 (514)
T KOG2055|consen  291 KV--TKLKPPYGVEEKSMERFEVSHDSNFIAIAGNNGHIHLLHAKTKELITSF---------------------------  341 (514)
T ss_pred             cc--ccccCCCCcccchhheeEecCCCCeEEEcccCceEEeehhhhhhhhhee---------------------------
Confidence            22  23344455553 4667788999999999999999999865432221111                           


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC-CCeEEEEECCCCCeEEEEeC
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT-SPVRDCSWHPSQPMLVSSSW  272 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~-~~I~~v~~sp~~~~las~s~  272 (303)
                             +-...+.       ...|+.+++.|+..+.+|.|++||+.+...+..+.... -.-++++.|+++.+||+||.
T Consensus       342 -------KieG~v~-------~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~v~gts~~~S~ng~ylA~GS~  407 (514)
T KOG2055|consen  342 -------KIEGVVS-------DFTFSSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGSVHGTSLCISLNGSYLATGSD  407 (514)
T ss_pred             -------eeccEEe-------eEEEecCCcEEEEEcCCceEEEEecCCcceEEEEeecCccceeeeeecCCCceEEeccC
Confidence                   1111101       11256788899999999999999999998888775221 23478888999999999999


Q ss_pred             CCCEEEeecCC
Q 022074          273 DGDVVRWEFPG  283 (303)
Q Consensus       273 Dg~i~~Wd~~~  283 (303)
                      .|.+.++|...
T Consensus       408 ~GiVNIYd~~s  418 (514)
T KOG2055|consen  408 SGIVNIYDGNS  418 (514)
T ss_pred             cceEEEeccch
Confidence            99999999553


No 162
>PRK01742 tolB translocation protein TolB; Provisional
Probab=99.64  E-value=5.5e-14  Score=129.47  Aligned_cols=222  Identities=18%  Similarity=0.168  Sum_probs=144.7

Q ss_pred             EEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCC---CeEEEEECCCCceEEEEecccCC
Q 022074            7 IVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSD---DCIYVYDLEANKLSLRILAHTSD   83 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~D---g~v~lwd~~~~~~~~~~~~h~~~   83 (303)
                      ...-+.++-++-+.||+. +|.+..  ...+|+..+.+..|||||++|+..+.+   ..|++||+.++.... +....+.
T Consensus       174 v~~~~~~~~~~~i~i~d~-dg~~~~--~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~-l~~~~g~  249 (429)
T PRK01742        174 VVQKNGGSQPYEVRVADY-DGFNQF--IVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKV-VASFRGH  249 (429)
T ss_pred             EEEEcCCCceEEEEEECC-CCCCce--EeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEE-EecCCCc
Confidence            333333344567777775 565542  235778889999999999999987754   369999998875432 2222233


Q ss_pred             eEEEEEccCCCcEEEEe-cCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEccccc
Q 022074           84 VNTVCFGDESGHLIYSG-SDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMS  161 (303)
Q Consensus        84 v~~l~~~~~~~~~l~s~-s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~  161 (303)
                      ...++|+|+ ++.|+.+ +.+|.+.||....   .......+..+...+....|+|+|+.|+..+ .++..+||++....
T Consensus       250 ~~~~~wSPD-G~~La~~~~~~g~~~Iy~~d~---~~~~~~~lt~~~~~~~~~~wSpDG~~i~f~s~~~g~~~I~~~~~~~  325 (429)
T PRK01742        250 NGAPAFSPD-GSRLAFASSKDGVLNIYVMGA---NGGTPSQLTSGAGNNTEPSWSPDGQSILFTSDRSGSPQVYRMSASG  325 (429)
T ss_pred             cCceeECCC-CCEEEEEEecCCcEEEEEEEC---CCCCeEeeccCCCCcCCEEECCCCCEEEEEECCCCCceEEEEECCC
Confidence            456889765 6666554 5788777764321   1122334555666677889999999876554 67899999875311


Q ss_pred             CCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC
Q 022074          162 SNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS  241 (303)
Q Consensus       162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~  241 (303)
                      ..                                 ..... +.       . ..+.++|++++++..+.++ +.+||+.+
T Consensus       326 ~~---------------------------------~~~l~-~~-------~-~~~~~SpDG~~ia~~~~~~-i~~~Dl~~  362 (429)
T PRK01742        326 GG---------------------------------ASLVG-GR-------G-YSAQISADGKTLVMINGDN-VVKQDLTS  362 (429)
T ss_pred             CC---------------------------------eEEec-CC-------C-CCccCCCCCCEEEEEcCCC-EEEEECCC
Confidence            00                                 00000 00       0 1245788999998887765 55699988


Q ss_pred             CeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          242 GEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       242 ~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      ++.......+  ...++.|+||+++|+.++.++...+|++
T Consensus       363 g~~~~lt~~~--~~~~~~~sPdG~~i~~~s~~g~~~~l~~  400 (429)
T PRK01742        363 GSTEVLSSTF--LDESPSISPNGIMIIYSSTQGLGKVLQL  400 (429)
T ss_pred             CCeEEecCCC--CCCCceECCCCCEEEEEEcCCCceEEEE
Confidence            8754322222  3467889999999999999999998875


No 163
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=99.63  E-value=6.8e-15  Score=129.64  Aligned_cols=199  Identities=18%  Similarity=0.310  Sum_probs=144.5

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPA  121 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~  121 (303)
                      -.|++......++++|+..++|+|||++.......+..|..-|+++.++. .+.++++++..|.|.+-.+..    +...
T Consensus        82 ~~Cv~~~s~S~y~~sgG~~~~Vkiwdl~~kl~hr~lkdh~stvt~v~YN~-~DeyiAsvs~gGdiiih~~~t----~~~t  156 (673)
T KOG4378|consen   82 AFCVACASQSLYEISGGQSGCVKIWDLRAKLIHRFLKDHQSTVTYVDYNN-TDEYIASVSDGGDIIIHGTKT----KQKT  156 (673)
T ss_pred             HHHHhhhhcceeeeccCcCceeeehhhHHHHHhhhccCCcceeEEEEecC-CcceeEEeccCCcEEEEeccc----Cccc
Confidence            34555556668999999999999999996555556778999999999964 478999999999999987642    2222


Q ss_pred             eeecccc--cCeEEEEeCCCCC-EEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          122 GVLMGHL--EGITFIDSRGDGR-YLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       122 ~~~~~h~--~~v~~~~~~~~~~-~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      ..| +|.  ..|.-+.+++..+ +|.+++.+|.|.+||+..+.+.......                             
T Consensus       157 t~f-~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~sp~~~~~~~-----------------------------  206 (673)
T KOG4378|consen  157 TTF-TIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGMSPIFHASEA-----------------------------  206 (673)
T ss_pred             cce-ecCCCCeEEEeecccccceeeEeeccCCeEEEEeccCCCcccchhhh-----------------------------
Confidence            233 343  3466788888654 4678999999999998754432211000                             


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                          |.  ..+..++|+|   .+..+|++.|.|..|.+||.........+. ...|...++|+++|.+|+.|...|.|..
T Consensus       207 ----Hs--AP~~gicfsp---sne~l~vsVG~Dkki~~yD~~s~~s~~~l~-y~~Plstvaf~~~G~~L~aG~s~G~~i~  276 (673)
T KOG4378|consen  207 ----HS--APCRGICFSP---SNEALLVSVGYDKKINIYDIRSQASTDRLT-YSHPLSTVAFSECGTYLCAGNSKGELIA  276 (673)
T ss_pred             ----cc--CCcCcceecC---CccceEEEecccceEEEeecccccccceee-ecCCcceeeecCCceEEEeecCCceEEE
Confidence                00  0011122333   246789999999999999998766655554 4458999999999999999999999999


Q ss_pred             eecCCCC
Q 022074          279 WEFPGNG  285 (303)
Q Consensus       279 Wd~~~~~  285 (303)
                      +|+.+..
T Consensus       277 YD~R~~k  283 (673)
T KOG4378|consen  277 YDMRSTK  283 (673)
T ss_pred             EecccCC
Confidence            9987653


No 164
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=99.63  E-value=4.2e-14  Score=131.09  Aligned_cols=247  Identities=19%  Similarity=0.246  Sum_probs=149.5

Q ss_pred             CcccceEEEEEcCCCC--EEEEe------------------eCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcE
Q 022074           37 GYSFGIFSLKFSTDGR--ELVAG------------------SSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHL   96 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~--~l~sg------------------s~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~   96 (303)
                      -+...+.++.|.+++.  ...+.                  ..++.+.||+++..........-...|.+++|+|.++++
T Consensus       178 ~~~~~~~~~~w~~~~~~~~~~s~~~~~~~~~~~~~~~~~~~~~~~~~~vW~~~~p~~Pe~~~~~~s~v~~~~f~p~~p~l  257 (555)
T KOG1587|consen  178 SPKRQVTDESWHPTGSVLIAVSVAYSELDFDRYAFNKPLLSEPDGVLLVWSLKNPNTPELVLESPSEVTCLKFCPFDPNL  257 (555)
T ss_pred             chhcceeeeeeccCCCcceEEEEeecccccccccccccccccCCceEEEEecCCCCCceEEEecCCceeEEEeccCCcce
Confidence            4556677777777665  11111                  023468999998875444455566789999999989999


Q ss_pred             EEEecCCCeEEEEcCccccCC--CccceeecccccCeEEEEeCCC--CCEEEEEeCCCcEEEEEcccccCCcccc-cCcc
Q 022074           97 IYSGSDDNLCKVWDRRCLNVK--GKPAGVLMGHLEGITFIDSRGD--GRYLISNGKDQAIKLWDIRKMSSNASCN-LGFR  171 (303)
Q Consensus        97 l~s~s~dg~v~lWd~~~~~~~--~~~~~~~~~h~~~v~~~~~~~~--~~~l~s~~~D~~v~lWdl~~~~~~~~~~-~~~~  171 (303)
                      ++.|..+|+|.+||++.....  .........|.++++.+.+-.+  +.-|++++.||+|..|+++......... ....
T Consensus       258 l~gG~y~GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~~~~~~f~s~ssDG~i~~W~~~~l~~P~e~~~~~~~  337 (555)
T KOG1587|consen  258 LAGGCYNGQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQNEHNTEFFSLSSDGSICSWDTDMLSLPVEGLLLESK  337 (555)
T ss_pred             EEeeccCceEEEEEccCCCCCCCcccccccccCCcCeEEEEEeccCCCCceEEEecCCcEeeeeccccccchhhcccccc
Confidence            999999999999999744321  1112233468888888776543  3459999999999999988754311100 0000


Q ss_pred             -------ceeeeceeeeCCCCCc-cc-cCCCCCcceE-------------EecccceeeeEEEeeeeeeeC-CCeEEEEE
Q 022074          172 -------SYEWDYRWMDYPPQAR-DL-KHPCDQSVAT-------------YKGHSVLRTLIRCHFSPVYST-GQKYIYTG  228 (303)
Q Consensus       172 -------~~~~~~~~~~~~~~~~-~~-~~~~~~~~~~-------------~~~~~~~~~~~~~~~~~~~s~-~~~~latg  228 (303)
                             ........+++++... .+ .....+.+..             ++++.........+....++| ..+.+.++
T Consensus       338 ~~~~~~~~~~~~~t~~~F~~~~p~~FiVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~v~~nPF~~k~fls~  417 (555)
T KOG1587|consen  338 KHKGQQSSKAVGATSLKFEPTDPNHFIVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVYAVSRNPFYPKNFLSV  417 (555)
T ss_pred             cccccccccccceeeEeeccCCCceEEEEcCCcEEEEEeccCCcccccccccccccccccCcceEeeecCCCccceeeee
Confidence                   0000111222222111 11 1111111111             111111110001111111122 12456666


Q ss_pred             eCCCeEEEEECC-CCeEEEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEeecCCC
Q 022074          229 SHDSCVYVYDLV-SGEQVAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       229 ~~dg~i~iwd~~-~~~~~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~~  284 (303)
                      + |.+++||... .-.++..++.+.+.|++++|||..+ +++++..||.|.+||+...
T Consensus       418 g-DW~vriWs~~~~~~Pl~~~~~~~~~v~~vaWSptrpavF~~~d~~G~l~iWDLl~~  474 (555)
T KOG1587|consen  418 G-DWTVRIWSEDVIASPLLSLDSSPDYVTDVAWSPTRPAVFATVDGDGNLDIWDLLQD  474 (555)
T ss_pred             c-cceeEeccccCCCCcchhhhhccceeeeeEEcCcCceEEEEEcCCCceehhhhhcc
Confidence            6 9999999988 5567888888888999999999987 7899999999999998643


No 165
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=99.62  E-value=7.1e-14  Score=127.21  Aligned_cols=239  Identities=19%  Similarity=0.268  Sum_probs=154.5

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC----
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV----  116 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~----  116 (303)
                      .|+.++|-|||..++.+..| .+.|||...|.+..++.+|++-|.|++|+ .++++|+||+.|+.|.+|.-+....    
T Consensus        14 ci~d~afkPDGsqL~lAAg~-rlliyD~ndG~llqtLKgHKDtVycVAys-~dGkrFASG~aDK~VI~W~~klEG~LkYS   91 (1081)
T KOG1538|consen   14 CINDIAFKPDGTQLILAAGS-RLLVYDTSDGTLLQPLKGHKDTVYCVAYA-KDGKRFASGSADKSVIIWTSKLEGILKYS   91 (1081)
T ss_pred             chheeEECCCCceEEEecCC-EEEEEeCCCcccccccccccceEEEEEEc-cCCceeccCCCceeEEEecccccceeeec
Confidence            69999999999998887655 59999999999999999999999999996 4699999999999999997431100    


Q ss_pred             -CC-------ccc------e-------------eecccc--cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccc
Q 022074          117 -KG-------KPA------G-------------VLMGHL--EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCN  167 (303)
Q Consensus       117 -~~-------~~~------~-------------~~~~h~--~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~  167 (303)
                       ..       .|.      +             ....|.  ..+.+++|..||.+|+-|-.||+|.+=+-......-..+
T Consensus        92 H~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~kss~R~~~CsWtnDGqylalG~~nGTIsiRNk~gEek~~I~R  171 (1081)
T KOG1538|consen   92 HNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKSSSRIICCSWTNDGQYLALGMFNGTISIRNKNGEEKVKIER  171 (1081)
T ss_pred             cCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhhheeEEEeeecCCCcEEEEeccCceEEeecCCCCcceEEeC
Confidence             00       000      0             011122  346677888999999999999998886532211100000


Q ss_pred             -cCccceeeeceeeeCCCC--CccccCCCC-Cc--ceEEecccceeeeEEEeee---eeeeCCCeEEEEEeCCCeEEEEE
Q 022074          168 -LGFRSYEWDYRWMDYPPQ--ARDLKHPCD-QS--VATYKGHSVLRTLIRCHFS---PVYSTGQKYIYTGSHDSCVYVYD  238 (303)
Q Consensus       168 -~~~~~~~~~~~~~~~~~~--~~~~~~~~~-~~--~~~~~~~~~~~~~~~~~~~---~~~s~~~~~latg~~dg~i~iwd  238 (303)
                       .+.-+..|.+.+......  ...+..... +.  -..++|...- +.-.-.|.   ..|-++|.+++.||.|+.+++|-
T Consensus       172 pgg~Nspiwsi~~~p~sg~G~~di~aV~DW~qTLSFy~LsG~~Ig-k~r~L~FdP~CisYf~NGEy~LiGGsdk~L~~fT  250 (1081)
T KOG1538|consen  172 PGGSNSPIWSICWNPSSGEGRNDILAVADWGQTLSFYQLSGKQIG-KDRALNFDPCCISYFTNGEYILLGGSDKQLSLFT  250 (1081)
T ss_pred             CCCCCCCceEEEecCCCCCCccceEEEEeccceeEEEEecceeec-ccccCCCCchhheeccCCcEEEEccCCCceEEEe
Confidence             000111222221111100  000110000 00  0111221110 00001111   23567899999999999999996


Q ss_pred             CCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          239 LVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       239 ~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                       +.|-.+.++.....+||.++..|+++.++.|+.||+|-.+++..
T Consensus       251 -R~GvrLGTvg~~D~WIWtV~~~PNsQ~v~~GCqDGTiACyNl~f  294 (1081)
T KOG1538|consen  251 -RDGVRLGTVGEQDSWIWTVQAKPNSQYVVVGCQDGTIACYNLIF  294 (1081)
T ss_pred             -ecCeEEeeccccceeEEEEEEccCCceEEEEEccCeeehhhhHH
Confidence             56788888877778999999999999999999999999998653


No 166
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=99.62  E-value=1.2e-14  Score=129.53  Aligned_cols=134  Identities=26%  Similarity=0.367  Sum_probs=110.4

Q ss_pred             cCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE-ecccCCeEEEEEccCC-CcEEEEecCC
Q 022074           26 DGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI-LAHTSDVNTVCFGDES-GHLIYSGSDD  103 (303)
Q Consensus        26 ~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~-~~h~~~v~~l~~~~~~-~~~l~s~s~d  103 (303)
                      ++++. ++...||++=|.|+.|+.+|..|++||.|-.+.|||.-..++...+ .+|...|.++.|.|.. +.+++||+.|
T Consensus        38 rrL~l-E~eL~GH~GCVN~LeWn~dG~lL~SGSDD~r~ivWd~~~~KllhsI~TgHtaNIFsvKFvP~tnnriv~sgAgD  116 (758)
T KOG1310|consen   38 RRLDL-EAELTGHTGCVNCLEWNADGELLASGSDDTRLIVWDPFEYKLLHSISTGHTANIFSVKFVPYTNNRIVLSGAGD  116 (758)
T ss_pred             hhcch-hhhhccccceecceeecCCCCEEeecCCcceEEeecchhcceeeeeecccccceeEEeeeccCCCeEEEeccCc
Confidence            44444 5567999999999999999999999999999999999877766554 4799999999997753 5678899999


Q ss_pred             CeEEEEcCccccC------CCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccc
Q 022074          104 NLCKVWDRRCLNV------KGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       104 g~v~lWd~~~~~~------~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ..|+++|+.....      ...+...+..|.+.|.-++..|++ +.+.+++.||++|=+|+|..
T Consensus       117 k~i~lfdl~~~~~~~~d~~~~~~~~~~~cht~rVKria~~p~~PhtfwsasEDGtirQyDiREp  180 (758)
T KOG1310|consen  117 KLIKLFDLDSSKEGGMDHGMEETTRCWSCHTDRVKRIATAPNGPHTFWSASEDGTIRQYDIREP  180 (758)
T ss_pred             ceEEEEecccccccccccCccchhhhhhhhhhhhhheecCCCCCceEEEecCCcceeeecccCC
Confidence            9999999863111      123445667799999999988888 78999999999999999863


No 167
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=99.62  E-value=3.2e-14  Score=120.21  Aligned_cols=194  Identities=19%  Similarity=0.272  Sum_probs=140.4

Q ss_pred             CEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccC-CCcEEEEecCCCeEEEEcCccccCCCccceeecccc-c
Q 022074           52 RELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDE-SGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL-E  129 (303)
Q Consensus        52 ~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~-~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~-~  129 (303)
                      ..+|++-..|+|++||..+++....+.++...++.++|... .++.+.+|+.||+||+||+|....  .+...+.++. .
T Consensus        41 ~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~wD~Rs~~e--~a~~~~~~~~~~  118 (376)
T KOG1188|consen   41 TAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCSSDGTVRLWDIRSQAE--SARISWTQQSGT  118 (376)
T ss_pred             eeEEEEecCCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEeccCCeEEEEEeecchh--hhheeccCCCCC
Confidence            56888889999999999999988889999999999999543 678999999999999999985432  2333445555 4


Q ss_pred             CeEEEEeCCCCCEEEEEe----CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccc
Q 022074          130 GITFIDSRGDGRYLISNG----KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSV  205 (303)
Q Consensus       130 ~v~~~~~~~~~~~l~s~~----~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  205 (303)
                      +..+++..-.++.+++|.    .|-.|.+||.|......      +                          .-...|.-
T Consensus       119 ~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq~l------~--------------------------~~~eSH~D  166 (376)
T KOG1188|consen  119 PFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQLL------R--------------------------QLNESHND  166 (376)
T ss_pred             cceEeeccCcCCeEEeccccccCceEEEEEEeccccchh------h--------------------------hhhhhccC
Confidence            567777765667777765    37899999998643210      0                          00111222


Q ss_pred             eeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE---EEEeecCCCCeEEEEECCCC-CeEEEEeCCCCEEEeec
Q 022074          206 LRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ---VAALKYHTSPVRDCSWHPSQ-PMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       206 ~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~---~~~~~~h~~~I~~v~~sp~~-~~las~s~Dg~i~~Wd~  281 (303)
                      ..+.+++|  |   .+..+|++|+.||.+.++|++....   +...--|...|..+.|+.++ +.+.+-+...+..+|++
T Consensus       167 DVT~lrFH--P---~~pnlLlSGSvDGLvnlfD~~~d~EeDaL~~viN~~sSI~~igw~~~~ykrI~clTH~Etf~~~el  241 (376)
T KOG1188|consen  167 DVTQLRFH--P---SDPNLLLSGSVDGLVNLFDTKKDNEEDALLHVINHGSSIHLIGWLSKKYKRIMCLTHMETFAIYEL  241 (376)
T ss_pred             cceeEEec--C---CCCCeEEeecccceEEeeecCCCcchhhHHHhhcccceeeeeeeecCCcceEEEEEccCceeEEEc
Confidence            23444433  2   2457899999999999999975532   22223467789999999988 35888888999999998


Q ss_pred             CCC
Q 022074          282 PGN  284 (303)
Q Consensus       282 ~~~  284 (303)
                      .-.
T Consensus       242 e~~  244 (376)
T KOG1188|consen  242 EDG  244 (376)
T ss_pred             cCC
Confidence            643


No 168
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=99.61  E-value=1e-12  Score=117.90  Aligned_cols=230  Identities=32%  Similarity=0.522  Sum_probs=163.6

Q ss_pred             ccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeC-CCeEEEEECCCCceEEEEecccCCeEEEE
Q 022074           10 VGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSS-DDCIYVYDLEANKLSLRILAHTSDVNTVC   88 (303)
Q Consensus        10 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~-Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~   88 (303)
                      ....+.+..+.+|+.-..... -....+|...|.+++|+|+++.+++++. |+.+++|++..+.....+..|...|.++.
T Consensus       127 ~~~~~~d~~~~~~~~~~~~~~-~~~~~~~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~  205 (466)
T COG2319         127 LASSSLDGTVKLWDLSTPGKL-IRTLEGHSESVTSLAFSPDGKLLASGSSLDGTIKLWDLRTGKPLSTLAGHTDPVSSLA  205 (466)
T ss_pred             eccCCCCccEEEEEecCCCeE-EEEEecCcccEEEEEECCCCCEEEecCCCCCceEEEEcCCCceEEeeccCCCceEEEE
Confidence            344455556677766331111 2233789999999999999998888885 99999999998777777888999999999


Q ss_pred             EccCCCc-EEEEecCCCeEEEEcCccccCCCccce-eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccc
Q 022074           89 FGDESGH-LIYSGSDDNLCKVWDRRCLNVKGKPAG-VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASC  166 (303)
Q Consensus        89 ~~~~~~~-~l~s~s~dg~v~lWd~~~~~~~~~~~~-~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~  166 (303)
                      |++ .+. .+++++.|+++++||..    ...... .+.+|...+ ...+++++.++++++.|+.+++||++....    
T Consensus       206 ~~~-~~~~~~~~~~~d~~i~~wd~~----~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~----  275 (466)
T COG2319         206 FSP-DGGLLIASGSSDGTIRLWDLS----TGKLLRSTLSGHSDSV-VSSFSPDGSLLASGSSDGTIRLWDLRSSSS----  275 (466)
T ss_pred             EcC-CcceEEEEecCCCcEEEEECC----CCcEEeeecCCCCcce-eEeECCCCCEEEEecCCCcEEEeeecCCCc----
Confidence            984 454 66666999999999864    233343 577787775 447888888899999999999999875321    


Q ss_pred             ccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEE
Q 022074          167 NLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVA  246 (303)
Q Consensus       167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~  246 (303)
                                                   ......+|..  .+..    ..++|++..+++++.|+.+.+||..+.....
T Consensus       276 -----------------------------~~~~~~~~~~--~v~~----~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~  320 (466)
T COG2319         276 -----------------------------LLRTLSGHSS--SVLS----VAFSPDGKLLASGSSDGTVRLWDLETGKLLS  320 (466)
T ss_pred             -----------------------------EEEEEecCCc--cEEE----EEECCCCCEEEEeeCCCcEEEEEcCCCceEE
Confidence                                         0000011110  0111    1345566777779889999999998887666


Q ss_pred             Eee--cCCCCeEEEEECCCCCeEEEE-eCCCCEEEeecCCCC
Q 022074          247 ALK--YHTSPVRDCSWHPSQPMLVSS-SWDGDVVRWEFPGNG  285 (303)
Q Consensus       247 ~~~--~h~~~I~~v~~sp~~~~las~-s~Dg~i~~Wd~~~~~  285 (303)
                      ...  .|...+..+.|++++..++.+ ..|+.+.+|+.....
T Consensus       321 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~  362 (466)
T COG2319         321 SLTLKGHEGPVSSLSFSPDGSLLVSGGSDDGTIRLWDLRTGK  362 (466)
T ss_pred             EeeecccCCceEEEEECCCCCEEEEeecCCCcEEeeecCCCc
Confidence            655  788889999994342455555 688999999987553


No 169
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=99.60  E-value=7.2e-15  Score=132.88  Aligned_cols=201  Identities=18%  Similarity=0.227  Sum_probs=140.3

Q ss_pred             cceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCce-------EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           40 FGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKL-------SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        40 ~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~-------~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      ..|..+.|.| |.++|++++.||.|+||.+..+.+       ...+..|...|+.+.|+|--.++|++++.|-+|++||+
T Consensus       628 t~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd~Ti~lWDl  707 (1012)
T KOG1445|consen  628 TLVTDLHWDPFDDERLAVATDDGQINLWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYDSTIELWDL  707 (1012)
T ss_pred             ceeeecccCCCChHHeeecccCceEEEEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhccceeeeeeh
Confidence            4688899998 777899999999999999987643       34577899999999998877889999999999999998


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      +    +......+.||.+.|..++|+++|++++|.+.|+++|+|..|+......        +     -.          
T Consensus       708 ~----~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~~rVy~Prs~e~pv~--------E-----g~----------  760 (1012)
T KOG1445|consen  708 A----NAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGTLRVYEPRSREQPVY--------E-----GK----------  760 (1012)
T ss_pred             h----hhhhhheeccCcCceeEEEECCCCcceeeeecCceEEEeCCCCCCCccc--------c-----CC----------
Confidence            6    3344567899999999999999999999999999999999876432100        0     00          


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC----CCeEEEEECCCCe--EEEEeecCCC-CeEEEEECCCC
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH----DSCVYVYDLVSGE--QVAALKYHTS-PVRDCSWHPSQ  264 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~----dg~i~iwd~~~~~--~~~~~~~h~~-~I~~v~~sp~~  264 (303)
                         ..+.    ..    ..++.    |.-+|+++++.|.    +..|.+||..+-.  .+++..-... .+.-=.+.+|.
T Consensus       761 ---gpvg----tR----gARi~----wacdgr~viv~Gfdk~SeRQv~~Y~Aq~l~~~pl~t~~lDvaps~LvP~YD~Ds  825 (1012)
T KOG1445|consen  761 ---GPVG----TR----GARIL----WACDGRIVIVVGFDKSSERQVQMYDAQTLDLRPLYTQVLDVAPSPLVPHYDYDS  825 (1012)
T ss_pred             ---CCcc----Cc----ceeEE----EEecCcEEEEecccccchhhhhhhhhhhccCCcceeeeecccCccccccccCCC
Confidence               0000    00    01111    2235677666654    4558888876533  2322211111 11111234454


Q ss_pred             C-eEEEEeCCCCEEEeecC
Q 022074          265 P-MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       265 ~-~las~s~Dg~i~~Wd~~  282 (303)
                      + +++||-.|..+.++++-
T Consensus       826 ~~lfltGKGD~~v~~yEv~  844 (1012)
T KOG1445|consen  826 NVLFLTGKGDRFVNMYEVI  844 (1012)
T ss_pred             ceEEEecCCCceEEEEEec
Confidence            4 68899999999999863


No 170
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=99.60  E-value=1.6e-14  Score=124.13  Aligned_cols=166  Identities=19%  Similarity=0.297  Sum_probs=126.8

Q ss_pred             EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC---CCccceeecccccCeEEEEeCCC-CCEEEEEeCCCc
Q 022074           76 RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV---KGKPAGVLMGHLEGITFIDSRGD-GRYLISNGKDQA  151 (303)
Q Consensus        76 ~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~---~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D~~  151 (303)
                      .+.+|++.|..+.|+|-+++.++|||+|.+|.+|.+-....   ...+...+.||...|--+.++|. .+.|+|+|.|.+
T Consensus        76 ~v~GHt~~vLDi~w~PfnD~vIASgSeD~~v~vW~IPe~~l~~~ltepvv~L~gH~rrVg~V~wHPtA~NVLlsag~Dn~  155 (472)
T KOG0303|consen   76 LVCGHTAPVLDIDWCPFNDCVIASGSEDTKVMVWQIPENGLTRDLTEPVVELYGHQRRVGLVQWHPTAPNVLLSAGSDNT  155 (472)
T ss_pred             CccCccccccccccCccCCceeecCCCCceEEEEECCCcccccCcccceEEEeecceeEEEEeecccchhhHhhccCCce
Confidence            46789999999999998899999999999999998742211   12456778899999999999884 577999999999


Q ss_pred             EEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC
Q 022074          152 IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD  231 (303)
Q Consensus       152 v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d  231 (303)
                      |.+||+.....                                  +.+++ |....      .+..|+.+|.+|+|.+.|
T Consensus       156 v~iWnv~tgea----------------------------------li~l~-hpd~i------~S~sfn~dGs~l~TtckD  194 (472)
T KOG0303|consen  156 VSIWNVGTGEA----------------------------------LITLD-HPDMV------YSMSFNRDGSLLCTTCKD  194 (472)
T ss_pred             EEEEeccCCce----------------------------------eeecC-CCCeE------EEEEeccCCceeeeeccc
Confidence            99999864321                                  11111 22111      234577899999999999


Q ss_pred             CeEEEEECCCCeEEEEeecCCC-CeEEEEECCCCCeEEEE---eCCCCEEEeecC
Q 022074          232 SCVYVYDLVSGEQVAALKYHTS-PVRDCSWHPSQPMLVSS---SWDGDVVRWEFP  282 (303)
Q Consensus       232 g~i~iwd~~~~~~~~~~~~h~~-~I~~v~~sp~~~~las~---s~Dg~i~~Wd~~  282 (303)
                      +.|||||.++++.+.+-.+|++ .-..+-|-.++.++-||   ..++.+-+||..
T Consensus       195 KkvRv~dpr~~~~v~e~~~heG~k~~Raifl~~g~i~tTGfsr~seRq~aLwdp~  249 (472)
T KOG0303|consen  195 KKVRVIDPRRGTVVSEGVAHEGAKPARAIFLASGKIFTTGFSRMSERQIALWDPN  249 (472)
T ss_pred             ceeEEEcCCCCcEeeecccccCCCcceeEEeccCceeeeccccccccceeccCcc
Confidence            9999999999999888777874 44566777888844333   347899999964


No 171
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=99.60  E-value=2.3e-14  Score=121.01  Aligned_cols=244  Identities=16%  Similarity=0.193  Sum_probs=158.6

Q ss_pred             CchhhccccccccccCcCcccccCCCcc-cceEEEEEcCCCCEEEEee----CCCeEEEEECCCCce-EEE-EecccCCe
Q 022074           12 SGTMESLANVTEIHDGLDFSAADDGGYS-FGIFSLKFSTDGRELVAGS----SDDCIYVYDLEANKL-SLR-ILAHTSDV   84 (303)
Q Consensus        12 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~v~~l~~s~~g~~l~sgs----~Dg~v~lwd~~~~~~-~~~-~~~h~~~v   84 (303)
                      +++.|.-+.+|++-...+-.-+...+|. -+..|++.+.+++.+++|+    .|..|.+||.+..+. ... ...|.+.|
T Consensus        89 s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~~~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~eSH~DDV  168 (376)
T KOG1188|consen   89 SCSSDGTVRLWDIRSQAESARISWTQQSGTPFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQLLRQLNESHNDDV  168 (376)
T ss_pred             EeccCCeEEEEEeecchhhhheeccCCCCCcceEeeccCcCCeEEeccccccCceEEEEEEeccccchhhhhhhhccCcc
Confidence            5778888899988554444333445565 5788888888888888886    477899999988664 222 34699999


Q ss_pred             EEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCC
Q 022074           85 NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSN  163 (303)
Q Consensus        85 ~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~  163 (303)
                      ++++|+|.++++|+|||-||.|.++|++..+... +......|..+|..+.|..++ ..|.+-+-+.+..+|++......
T Consensus       169 T~lrFHP~~pnlLlSGSvDGLvnlfD~~~d~EeD-aL~~viN~~sSI~~igw~~~~ykrI~clTH~Etf~~~ele~~~~~  247 (376)
T KOG1188|consen  169 TQLRFHPSDPNLLLSGSVDGLVNLFDTKKDNEED-ALLHVINHGSSIHLIGWLSKKYKRIMCLTHMETFAIYELEDGSEE  247 (376)
T ss_pred             eeEEecCCCCCeEEeecccceEEeeecCCCcchh-hHHHhhcccceeeeeeeecCCcceEEEEEccCceeEEEccCCChh
Confidence            9999999999999999999999999986332222 222233577789999998776 45888899999999998765432


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CCeEEEEEC---
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DSCVYVYDL---  239 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg~i~iwd~---  239 (303)
                      ...+..  .....                ..+...      ...+++.++.    ..+...++.++. -+...++-.   
T Consensus       248 ~~~~~~--~~~~~----------------d~r~~~------~~dY~I~~~~----~~~~~~~~l~g~~~n~~~~~~~~~~  299 (376)
T KOG1188|consen  248 TWLENP--DVSAD----------------DLRKED------NCDYVINEHS----PGDKDTCALAGTDSNKGTIFPLVDT  299 (376)
T ss_pred             hcccCc--cchhh----------------hHHhhh------hhhheeeccc----CCCcceEEEeccccCceeEEEeeec
Confidence            211110  00000                000000      0001111111    113334444443 444444432   


Q ss_pred             CCCe---EEEEeec-CCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          240 VSGE---QVAALKY-HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       240 ~~~~---~~~~~~~-h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .++.   .+..+.+ |..-|.++.|.-.+.++.|||+||.+.+|..+..
T Consensus       300 ~s~~~~~~~a~l~g~~~eiVR~i~~~~~~~~l~TGGEDG~l~~Wk~~da  348 (376)
T KOG1188|consen  300 SSGSLLTEPAILQGGHEEIVRDILFDVKNDVLYTGGEDGLLQAWKVEDA  348 (376)
T ss_pred             ccccccCccccccCCcHHHHHHHhhhcccceeeccCCCceEEEEecCCc
Confidence            3333   3445554 6677899999988999999999999999996544


No 172
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=99.60  E-value=2.2e-13  Score=117.18  Aligned_cols=263  Identities=16%  Similarity=0.175  Sum_probs=169.5

Q ss_pred             cCchhh-ccccccccccCc--CcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccC---Ce
Q 022074           11 GSGTME-SLANVTEIHDGL--DFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTS---DV   84 (303)
Q Consensus        11 ~~~~~~-~~~~~~~~~~~~--~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~---~v   84 (303)
                      |--+|- -+|||-+..--.  +++.--.-.|+..|+|++|.....++++|+.+++|.+-|+.+.+.+. +..|+.   .|
T Consensus        74 GGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~N~~~~SG~~~~~VI~HDiEt~qsi~-V~~~~~~~~~V  152 (609)
T KOG4227|consen   74 GGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLENRFLYSGERWGTVIKHDIETKQSIY-VANENNNRGDV  152 (609)
T ss_pred             cCCcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccCCeeEecCCCcceeEeeecccceeee-eecccCcccce
Confidence            444444 466776654433  34433335788899999999999999999999999999999987664 445654   78


Q ss_pred             EEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCC
Q 022074           85 NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSN  163 (303)
Q Consensus        85 ~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~  163 (303)
                      ..+..+|. ++.|++.+.++.|.+||.+.......+. ++.....+...+-|+|.. .+|++++.-+.+.+||.|+....
T Consensus       153 Y~m~~~P~-DN~~~~~t~~~~V~~~D~Rd~~~~~~~~-~~AN~~~~F~t~~F~P~~P~Li~~~~~~~G~~~~D~R~~~~~  230 (609)
T KOG4227|consen  153 YHMDQHPT-DNTLIVVTRAKLVSFIDNRDRQNPISLV-LPANSGKNFYTAEFHPETPALILVNSETGGPNVFDRRMQARP  230 (609)
T ss_pred             eecccCCC-CceEEEEecCceEEEEeccCCCCCCcee-eecCCCccceeeeecCCCceeEEeccccCCCCceeeccccch
Confidence            88888665 7899999999999999987433222222 223344567777788754 56789999999999999874321


Q ss_pred             cccccCccceee-ec--eeeeCCCCCccccCC----C-------CC--cceEEe----cccceeeeEEEeeeeeeeCCCe
Q 022074          164 ASCNLGFRSYEW-DY--RWMDYPPQARDLKHP----C-------DQ--SVATYK----GHSVLRTLIRCHFSPVYSTGQK  223 (303)
Q Consensus       164 ~~~~~~~~~~~~-~~--~~~~~~~~~~~~~~~----~-------~~--~~~~~~----~~~~~~~~~~~~~~~~~s~~~~  223 (303)
                      .-....+.++.- ..  ....+.+.+..+...    |       .+  .+..++    |.....++..|.|.     +..
T Consensus       231 ~~~~~~~~~L~~~~~~~M~~~~~~~G~Q~msiRR~~~P~~~D~~S~R~~V~k~D~N~~GY~N~~T~KS~~F~-----~D~  305 (609)
T KOG4227|consen  231 VYQRSMFKGLPQENTEWMGSLWSPSGNQFMSIRRGKCPLYFDFISQRCFVLKSDHNPNGYCNIKTIKSMTFI-----DDY  305 (609)
T ss_pred             HHhhhccccCcccchhhhheeeCCCCCeehhhhccCCCEEeeeecccceeEeccCCCCcceeeeeeeeeeee-----cce
Confidence            111111111111 11  111222322211110    0       00  111111    22223333333332     234


Q ss_pred             EEEEEeCCCeEEEEECCCC-----------------------eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          224 YIYTGSHDSCVYVYDLVSG-----------------------EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~-----------------------~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      .+++|+.+-.|++|.+...                       +.+..+++|..-++.|.|+|...+|++.+-...+++|.
T Consensus       306 ~v~tGSD~~~i~~WklP~~~ds~G~~~IG~~~~~~~~~~~i~~~~~VLrGHRSv~NQVRF~~H~~~l~SSGVE~~~KlWS  385 (609)
T KOG4227|consen  306 TVATGSDHWGIHIWKLPRANDSYGFTQIGHDEEEMPSEIFIEKELTVLRGHRSVPNQVRFSQHNNLLVSSGVENSFKLWS  385 (609)
T ss_pred             eeeccCcccceEEEecCCCccccCccccCcchhhCchhheecceeEEEecccccccceeecCCcceEeccchhhheeccc
Confidence            5999999999999986321                       23456789999999999999999999999999999996


Q ss_pred             c
Q 022074          281 F  281 (303)
Q Consensus       281 ~  281 (303)
                      .
T Consensus       386 ~  386 (609)
T KOG4227|consen  386 D  386 (609)
T ss_pred             c
Confidence            3


No 173
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=99.58  E-value=2.1e-13  Score=120.42  Aligned_cols=185  Identities=18%  Similarity=0.253  Sum_probs=142.0

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecc-cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAH-TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h-~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      -+|..-|.++.++-...+||+++..|.|.|-.+.++.....+... .+.|.-+.|++....+|.+++.+|.|.+||....
T Consensus       118 kdh~stvt~v~YN~~DeyiAsvs~gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~  197 (673)
T KOG4378|consen  118 KDHQSTVTYVDYNNTDEYIASVSDGGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGM  197 (673)
T ss_pred             cCCcceeEEEEecCCcceeEEeccCCcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCCeEEEEeccCC
Confidence            588899999999999999999999999999999998765555433 3456688998877778889999999999997532


Q ss_pred             cCCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                      .   ........|..+...+.|+|.. .+|++.|.|+.|.+||.+........              .+           
T Consensus       198 s---p~~~~~~~HsAP~~gicfspsne~l~vsVG~Dkki~~yD~~s~~s~~~l--------------~y-----------  249 (673)
T KOG4378|consen  198 S---PIFHASEAHSAPCRGICFSPSNEALLVSVGYDKKINIYDIRSQASTDRL--------------TY-----------  249 (673)
T ss_pred             C---cccchhhhccCCcCcceecCCccceEEEecccceEEEeeccccccccee--------------ee-----------
Confidence            1   1122345688888888898865 45789999999999998743211000              00           


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC-eEEEEeecCCCCeEEEEECCCC
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~  264 (303)
                              .+.        ....+|+++|.+|+.|...|.|..||++.- .++..+..|...|++++|-|..
T Consensus       250 --------~~P--------lstvaf~~~G~~L~aG~s~G~~i~YD~R~~k~Pv~v~sah~~sVt~vafq~s~  305 (673)
T KOG4378|consen  250 --------SHP--------LSTVAFSECGTYLCAGNSKGELIAYDMRSTKAPVAVRSAHDASVTRVAFQPSP  305 (673)
T ss_pred             --------cCC--------cceeeecCCceEEEeecCCceEEEEecccCCCCceEeeecccceeEEEeeecc
Confidence                    000        012347889999999999999999999854 4588899999999999998764


No 174
>KOG2445 consensus Nuclear pore complex component (sc Seh1) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=99.57  E-value=3e-13  Score=112.99  Aligned_cols=200  Identities=23%  Similarity=0.348  Sum_probs=122.3

Q ss_pred             ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC--CCEEEEEeCCCcEEEE
Q 022074           78 LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD--GRYLISNGKDQAIKLW  155 (303)
Q Consensus        78 ~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~--~~~l~s~~~D~~v~lW  155 (303)
                      .+|.+-|.++.| ...|+++++|+.|++|++||.+...-+.........|.++|..+.+.+.  |+.+++++.|++++||
T Consensus        10 s~h~DlihdVs~-D~~GRRmAtCSsDq~vkI~d~~~~s~~W~~Ts~Wrah~~Si~rV~WAhPEfGqvvA~cS~Drtv~iW   88 (361)
T KOG2445|consen   10 SGHKDLIHDVSF-DFYGRRMATCSSDQTVKIWDSTSDSGTWSCTSSWRAHDGSIWRVVWAHPEFGQVVATCSYDRTVSIW   88 (361)
T ss_pred             cCCcceeeeeee-cccCceeeeccCCCcEEEEeccCCCCceEEeeeEEecCCcEEEEEecCccccceEEEEecCCceeee
Confidence            468888999999 4679999999999999999965333333444456679999999888653  7889999999999999


Q ss_pred             EcccccCCcccccCccceeeece-----------eeeCCCCCccc---cCCCCCcceEEeccccee---ee----EE---
Q 022074          156 DIRKMSSNASCNLGFRSYEWDYR-----------WMDYPPQARDL---KHPCDQSVATYKGHSVLR---TL----IR---  211 (303)
Q Consensus       156 dl~~~~~~~~~~~~~~~~~~~~~-----------~~~~~~~~~~~---~~~~~~~~~~~~~~~~~~---~~----~~---  211 (303)
                      .=.......      ....|..+           -..|.|.-..+   ....+..+..+..-....   ..    +.   
T Consensus        89 EE~~~~~~~------~~~~Wv~~ttl~DsrssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~dp~nLs~W~Lq~Ei~~~~  162 (361)
T KOG2445|consen   89 EEQEKSEEA------HGRRWVRRTTLVDSRSSVTDVKFAPKHLGLKLAAASADGILRIYEAPDPMNLSQWTLQHEIQNVI  162 (361)
T ss_pred             eeccccccc------ccceeEEEEEeecCCcceeEEEecchhcceEEEEeccCcEEEEEecCCccccccchhhhhhhhcc
Confidence            642211111      11112111           11222221111   111222333332111000   00    00   


Q ss_pred             --------EeeeeeeeC---CCeEEEEEeCC-----CeEEEEECCCC----eEEEEeecCCCCeEEEEECCCC-C---eE
Q 022074          212 --------CHFSPVYST---GQKYIYTGSHD-----SCVYVYDLVSG----EQVAALKYHTSPVRDCSWHPSQ-P---ML  267 (303)
Q Consensus       212 --------~~~~~~~s~---~~~~latg~~d-----g~i~iwd~~~~----~~~~~~~~h~~~I~~v~~sp~~-~---~l  267 (303)
                              -.++..+++   ...+||.|+.+     +.+.||.....    ..+.++..|.+||++++|.|.- +   +|
T Consensus       163 ~pp~~~~~~~~CvsWn~sr~~~p~iAvgs~e~a~~~~~~~Iye~~e~~rKw~kva~L~d~~dpI~di~wAPn~Gr~y~~l  242 (361)
T KOG2445|consen  163 DPPGKNKQPCFCVSWNPSRMHEPLIAVGSDEDAPHLNKVKIYEYNENGRKWLKVAELPDHTDPIRDISWAPNIGRSYHLL  242 (361)
T ss_pred             CCcccccCcceEEeeccccccCceEEEEcccCCccccceEEEEecCCcceeeeehhcCCCCCcceeeeeccccCCceeeE
Confidence                    001111121   23567777766     47888876433    2456778999999999999973 3   89


Q ss_pred             EEEeCCCCEEEeecCCCC
Q 022074          268 VSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       268 as~s~Dg~i~~Wd~~~~~  285 (303)
                      |+|+.|| +++|++...+
T Consensus       243 AvA~kDg-v~I~~v~~~~  259 (361)
T KOG2445|consen  243 AVATKDG-VRIFKVKVAR  259 (361)
T ss_pred             EEeecCc-EEEEEEeecc
Confidence            9999999 9999998544


No 175
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=99.57  E-value=2.2e-13  Score=114.33  Aligned_cols=207  Identities=17%  Similarity=0.249  Sum_probs=156.0

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~  117 (303)
                      +|+|.+|++|+..+|++-+...|.||.....++   ..++..|+..|+.+.|++. .+.+++++.|...++|..... -+
T Consensus        12 pitchAwn~drt~iAv~~~~~evhiy~~~~~~~w~~~htls~Hd~~vtgvdWap~-snrIvtcs~drnayVw~~~~~-~~   89 (361)
T KOG1523|consen   12 PITCHAWNSDRTQIAVSPNNHEVHIYSMLGADLWEPAHTLSEHDKIVTGVDWAPK-SNRIVTCSHDRNAYVWTQPSG-GT   89 (361)
T ss_pred             ceeeeeecCCCceEEeccCCceEEEEEecCCCCceeceehhhhCcceeEEeecCC-CCceeEccCCCCccccccCCC-Ce
Confidence            599999999999999999999999999987763   3567789999999999764 678999999999999986322 23


Q ss_pred             CccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcc
Q 022074          118 GKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSV  197 (303)
Q Consensus       118 ~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  197 (303)
                      .++.-.+..+..+.+++.++|..+.|++||.-+.|.+|=......            |   |.     .+.++-+     
T Consensus        90 WkptlvLlRiNrAAt~V~WsP~enkFAVgSgar~isVcy~E~ENd------------W---WV-----sKhikkP-----  144 (361)
T KOG1523|consen   90 WKPTLVLLRINRAATCVKWSPKENKFAVGSGARLISVCYYEQEND------------W---WV-----SKHIKKP-----  144 (361)
T ss_pred             eccceeEEEeccceeeEeecCcCceEEeccCccEEEEEEEecccc------------e---eh-----hhhhCCc-----
Confidence            456667778999999999999999999999999999996543210            0   00     0000000     


Q ss_pred             eEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC-----C-------------CeEEEEeecCCCCeEEEE
Q 022074          198 ATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV-----S-------------GEQVAALKYHTSPVRDCS  259 (303)
Q Consensus       198 ~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~-----~-------------~~~~~~~~~h~~~I~~v~  259 (303)
                             ...++.    +..++|++-+|++|+.|+.+|++..-     .             |+++.++....+.|..+.
T Consensus       145 -------irStv~----sldWhpnnVLlaaGs~D~k~rVfSayIK~Vdekpap~pWgsk~PFG~lm~E~~~~ggwvh~v~  213 (361)
T KOG1523|consen  145 -------IRSTVT----SLDWHPNNVLLAAGSTDGKCRVFSAYIKGVDEKPAPTPWGSKMPFGQLMSEASSSGGWVHGVL  213 (361)
T ss_pred             -------ccccee----eeeccCCcceecccccCcceeEEEEeeeccccCCCCCCCccCCcHHHHHHhhccCCCceeeeE
Confidence                   001111    23356778899999999999999741     1             123344444567899999


Q ss_pred             ECCCCCeEEEEeCCCCEEEeecCCCC
Q 022074          260 WHPSQPMLVSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       260 ~sp~~~~las~s~Dg~i~~Wd~~~~~  285 (303)
                      |+|+|+.|+-.+.|..+.+=|..++.
T Consensus       214 fs~sG~~lawv~Hds~v~~~da~~p~  239 (361)
T KOG1523|consen  214 FSPSGNRLAWVGHDSTVSFVDAAGPS  239 (361)
T ss_pred             eCCCCCEeeEecCCCceEEeecCCCc
Confidence            99999999999999999998876654


No 176
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=99.57  E-value=1.6e-14  Score=130.61  Aligned_cols=156  Identities=19%  Similarity=0.289  Sum_probs=117.9

Q ss_pred             EcCCCCEEEE--eeCCCeEEEEECCC-CceEEEE---ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc---CC
Q 022074           47 FSTDGRELVA--GSSDDCIYVYDLEA-NKLSLRI---LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN---VK  117 (303)
Q Consensus        47 ~s~~g~~l~s--gs~Dg~v~lwd~~~-~~~~~~~---~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~---~~  117 (303)
                      |+.+.+++++  .+.-|.|-||++.. |++..-.   ......|..+.|.|-+.++|+.++.||.|++|.+....   ..
T Consensus       587 fcan~~rvAVPL~g~gG~iai~el~~PGrLPDgv~p~l~Ngt~vtDl~WdPFD~~rLAVa~ddg~i~lWr~~a~gl~e~~  666 (1012)
T KOG1445|consen  587 FCANNKRVAVPLAGSGGVIAIYELNEPGRLPDGVMPGLFNGTLVTDLHWDPFDDERLAVATDDGQINLWRLTANGLPENE  666 (1012)
T ss_pred             eeeccceEEEEecCCCceEEEEEcCCCCCCCcccccccccCceeeecccCCCChHHeeecccCceEEEEEeccCCCCccc
Confidence            4445666665  45678999999965 3332211   11235789999988888999999999999999875322   22


Q ss_pred             CccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcc
Q 022074          118 GKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSV  197 (303)
Q Consensus       118 ~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  197 (303)
                      ..+.+.+..|.+.|+++.|+|-.                                                         
T Consensus       667 ~tPe~~lt~h~eKI~slRfHPLA---------------------------------------------------------  689 (1012)
T KOG1445|consen  667 MTPEKILTIHGEKITSLRFHPLA---------------------------------------------------------  689 (1012)
T ss_pred             CCcceeeecccceEEEEEecchh---------------------------------------------------------
Confidence            34556666777777776665410                                                         


Q ss_pred             eEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEE
Q 022074          198 ATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVV  277 (303)
Q Consensus       198 ~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~  277 (303)
                                              ...|++++.|-+|++||+.+++....+.+|++.|.+++|||||+.+||.+-||+++
T Consensus       690 ------------------------advLa~asyd~Ti~lWDl~~~~~~~~l~gHtdqIf~~AWSpdGr~~AtVcKDg~~r  745 (1012)
T KOG1445|consen  690 ------------------------ADVLAVASYDSTIELWDLANAKLYSRLVGHTDQIFGIAWSPDGRRIATVCKDGTLR  745 (1012)
T ss_pred             ------------------------hhHhhhhhccceeeeeehhhhhhhheeccCcCceeEEEECCCCcceeeeecCceEE
Confidence                                    13477888888899999988888888999999999999999999999999999999


Q ss_pred             EeecCC
Q 022074          278 RWEFPG  283 (303)
Q Consensus       278 ~Wd~~~  283 (303)
                      +++...
T Consensus       746 Vy~Prs  751 (1012)
T KOG1445|consen  746 VYEPRS  751 (1012)
T ss_pred             EeCCCC
Confidence            999764


No 177
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=99.56  E-value=4.9e-13  Score=117.05  Aligned_cols=199  Identities=18%  Similarity=0.270  Sum_probs=135.0

Q ss_pred             ccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEE--EEeccc-CCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           39 SFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSL--RILAHT-SDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        39 ~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~--~~~~h~-~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      ++||.++.|.|+|. .+++++...-.+.||+.+++...  ...++. ..+.....++ +++.++..+..|.|.|--..  
T Consensus       257 ~fPi~~a~f~p~G~~~i~~s~rrky~ysyDle~ak~~k~~~~~g~e~~~~e~FeVSh-d~~fia~~G~~G~I~lLhak--  333 (514)
T KOG2055|consen  257 KFPIQKAEFAPNGHSVIFTSGRRKYLYSYDLETAKVTKLKPPYGVEEKSMERFEVSH-DSNFIAIAGNNGHIHLLHAK--  333 (514)
T ss_pred             cCccceeeecCCCceEEEecccceEEEEeeccccccccccCCCCcccchhheeEecC-CCCeEEEcccCceEEeehhh--
Confidence            47899999999999 89999999999999999987543  222333 3455666665 46799999999999986543  


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                        ++..+..+. -.+.|..+.|+.+++.|+.++.+|.|.+||++.....    .     .|.    +            +
T Consensus       334 --T~eli~s~K-ieG~v~~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~----~-----rf~----D------------~  385 (514)
T KOG2055|consen  334 --TKELITSFK-IEGVVSDFTFSSDSKELLASGGTGEVYVWNLRQNSCL----H-----RFV----D------------D  385 (514)
T ss_pred             --hhhhhheee-eccEEeeEEEecCCcEEEEEcCCceEEEEecCCcceE----E-----EEe----e------------c
Confidence              333333332 2356788889999999999999999999999864211    0     000    0            0


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC------eEEE----------------------
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG------EQVA----------------------  246 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~------~~~~----------------------  246 (303)
                      ..+              +-.+...|.+++|||+|+..|.|-|||..+-      ++++                      
T Consensus       386 G~v--------------~gts~~~S~ng~ylA~GS~~GiVNIYd~~s~~~s~~PkPik~~dNLtt~Itsl~Fn~d~qiLA  451 (514)
T KOG2055|consen  386 GSV--------------HGTSLCISLNGSYLATGSDSGIVNIYDGNSCFASTNPKPIKTVDNLTTAITSLQFNHDAQILA  451 (514)
T ss_pred             Ccc--------------ceeeeeecCCCceEEeccCcceEEEeccchhhccCCCCchhhhhhhheeeeeeeeCcchhhhh
Confidence            000              0001123456777777777777777774321      0000                      


Q ss_pred             -------------------Ee---e---cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          247 -------------------AL---K---YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       247 -------------------~~---~---~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                                         .|   .   ..-+.|+|++|||.+.+||.|.++|.+.+|.+.
T Consensus       452 iaS~~~knalrLVHvPS~TVFsNfP~~n~~vg~vtc~aFSP~sG~lAvGNe~grv~l~kL~  512 (514)
T KOG2055|consen  452 IASRVKKNALRLVHVPSCTVFSNFPTSNTKVGHVTCMAFSPNSGYLAVGNEAGRVHLFKLH  512 (514)
T ss_pred             hhhhccccceEEEeccceeeeccCCCCCCcccceEEEEecCCCceEEeecCCCceeeEeec
Confidence                               00   0   112468999999999999999999999999863


No 178
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.56  E-value=3.3e-12  Score=113.74  Aligned_cols=211  Identities=10%  Similarity=0.165  Sum_probs=126.5

Q ss_pred             ceEEEEEcCCCCEEEEeeC-CCeEEEEECCCCc-eE--EEEecccCCeEEEEEccCCCcEE-EEecCCCeEEEEcCcccc
Q 022074           41 GIFSLKFSTDGRELVAGSS-DDCIYVYDLEANK-LS--LRILAHTSDVNTVCFGDESGHLI-YSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~-Dg~v~lwd~~~~~-~~--~~~~~h~~~v~~l~~~~~~~~~l-~s~s~dg~v~lWd~~~~~  115 (303)
                      ....+.|+|+|+++++++. ++.|.+|++++.. ..  .....+......++++|+ ++.+ ++...++.|.+||+....
T Consensus        81 ~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~-g~~l~v~~~~~~~v~v~d~~~~g  159 (330)
T PRK11028         81 SPTHISTDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPD-NRTLWVPCLKEDRIRLFTLSDDG  159 (330)
T ss_pred             CceEEEECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCC-CCEEEEeeCCCCEEEEEEECCCC
Confidence            4567999999999988774 8889999997432 11  111223345667788764 5555 555677999999985311


Q ss_pred             CCC-ccceeec-ccccCeEEEEeCCCCCEEEEEeC-CCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCC
Q 022074          116 VKG-KPAGVLM-GHLEGITFIDSRGDGRYLISNGK-DQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHP  192 (303)
Q Consensus       116 ~~~-~~~~~~~-~h~~~v~~~~~~~~~~~l~s~~~-D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  192 (303)
                      ... ....... ........+.+++++++++++.. +++|.+||+.........       ...+.  ..+...      
T Consensus       160 ~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~-------~~~~~--~~p~~~------  224 (330)
T PRK11028        160 HLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIEC-------VQTLD--MMPADF------  224 (330)
T ss_pred             cccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEE-------EEEEe--cCCCcC------
Confidence            000 0000000 11234567889999999877765 999999998632110000       00000  000000      


Q ss_pred             CCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe-CCCeEEEEECCCCe-E---EEEeecCCCCeEEEEECCCCCeE
Q 022074          193 CDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS-HDSCVYVYDLVSGE-Q---VAALKYHTSPVRDCSWHPSQPML  267 (303)
Q Consensus       193 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~-~dg~i~iwd~~~~~-~---~~~~~~h~~~I~~v~~sp~~~~l  267 (303)
                              .+.   ...    ....++|++++++++. .++.|.+|++.+.. .   +..... ......++|+|+|++|
T Consensus       225 --------~~~---~~~----~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~~-~~~p~~~~~~~dg~~l  288 (330)
T PRK11028        225 --------SDT---RWA----ADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQPT-ETQPRGFNIDHSGKYL  288 (330)
T ss_pred             --------CCC---ccc----eeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEEec-cccCCceEECCCCCEE
Confidence                    000   000    0123678899888885 47899999986432 1   222221 1245689999999988


Q ss_pred             EEEeC-CCCEEEeecCC
Q 022074          268 VSSSW-DGDVVRWEFPG  283 (303)
Q Consensus       268 as~s~-Dg~i~~Wd~~~  283 (303)
                      +++.. ++++.+|++..
T Consensus       289 ~va~~~~~~v~v~~~~~  305 (330)
T PRK11028        289 IAAGQKSHHISVYEIDG  305 (330)
T ss_pred             EEEEccCCcEEEEEEcC
Confidence            87775 89999999864


No 179
>KOG2919 consensus Guanine nucleotide-binding protein [General function prediction only]
Probab=99.55  E-value=6e-13  Score=111.94  Aligned_cols=217  Identities=21%  Similarity=0.321  Sum_probs=146.4

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE------------EEEecc-cCCeEEEEEc------cCCCcEEEEecC
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS------------LRILAH-TSDVNTVCFG------DESGHLIYSGSD  102 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~------------~~~~~h-~~~v~~l~~~------~~~~~~l~s~s~  102 (303)
                      ...+.|+|||..|++-+.|..+++|+++.....            ..+.-. ..-|...+|-      .++.+++++.+.
T Consensus        52 ~kgckWSPDGSciL~~sedn~l~~~nlP~dlys~~~~~~~~~~~~~~~r~~eg~tvydy~wYs~M~s~qP~t~l~a~ssr  131 (406)
T KOG2919|consen   52 LKGCKWSPDGSCILSLSEDNCLNCWNLPFDLYSKKADGPLNFSKHLSYRYQEGETVYDYCWYSRMKSDQPSTNLFAVSSR  131 (406)
T ss_pred             hccceeCCCCceEEeecccCeeeEEecChhhcccCCCCccccccceeEEeccCCEEEEEEeeeccccCCCccceeeeccc
Confidence            567899999999999999999999988643210            111111 2345555662      245678999999


Q ss_pred             CCeEEEEcCccccCCCccceeec--cccc---CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeec
Q 022074          103 DNLCKVWDRRCLNVKGKPAGVLM--GHLE---GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDY  177 (303)
Q Consensus       103 dg~v~lWd~~~~~~~~~~~~~~~--~h~~---~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~  177 (303)
                      +.-|++||..    +++....+.  .|.+   +-.++.|+|||++|.+| ..++||+||+.+.-..  |.          
T Consensus       132 ~~PIh~wdaf----tG~lraSy~~ydh~de~taAhsL~Fs~DGeqlfaG-ykrcirvFdt~RpGr~--c~----------  194 (406)
T KOG2919|consen  132 DQPIHLWDAF----TGKLRASYRAYDHQDEYTAAHSLQFSPDGEQLFAG-YKRCIRVFDTSRPGRD--CP----------  194 (406)
T ss_pred             cCceeeeecc----ccccccchhhhhhHHhhhhheeEEecCCCCeEeec-ccceEEEeeccCCCCC--Cc----------
Confidence            9999999975    333333332  2444   34578999999998877 5699999998432110  00          


Q ss_pred             eeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeC-CCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeE
Q 022074          178 RWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYST-GQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVR  256 (303)
Q Consensus       178 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~-~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~  256 (303)
                                       ......++.....-++   ....|+| +.+.++.|+.-.++-|+.-..+.++..+.+|.+-|+
T Consensus       195 -----------------vy~t~~~~k~gq~gii---sc~a~sP~~~~~~a~gsY~q~~giy~~~~~~pl~llggh~gGvT  254 (406)
T KOG2919|consen  195 -----------------VYTTVTKGKFGQKGII---SCFAFSPMDSKTLAVGSYGQRVGIYNDDGRRPLQLLGGHGGGVT  254 (406)
T ss_pred             -----------------chhhhhccccccccee---eeeeccCCCCcceeeecccceeeeEecCCCCceeeecccCCCee
Confidence                             0000000000001111   1223444 456899999999999999989999999999999999


Q ss_pred             EEEECCCCCeEEEEeC-CCCEEEeecCCCCccCCCCcccc
Q 022074          257 DCSWHPSQPMLVSSSW-DGDVVRWEFPGNGEAAPPLNKKR  295 (303)
Q Consensus       257 ~v~~sp~~~~las~s~-Dg~i~~Wd~~~~~~~~~~~~~~~  295 (303)
                      .+.|.++|+.|.+|+. |-.|..||+...+...=.+.+++
T Consensus       255 hL~~~edGn~lfsGaRk~dkIl~WDiR~~~~pv~~L~rhv  294 (406)
T KOG2919|consen  255 HLQWCEDGNKLFSGARKDDKILCWDIRYSRDPVYALERHV  294 (406)
T ss_pred             eEEeccCcCeecccccCCCeEEEEeehhccchhhhhhhhc
Confidence            9999999998888876 67899999876554444444443


No 180
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=99.55  E-value=8.9e-14  Score=125.09  Aligned_cols=198  Identities=20%  Similarity=0.253  Sum_probs=142.4

Q ss_pred             cccceEEEEEcCCCCEEEEeeCCC---eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           38 YSFGIFSLKFSTDGRELVAGSSDD---CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~~l~sgs~Dg---~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      |--+|..+.|+..|.+|++...++   .|.|..+..+..+..+..-.+-|-++.|+|.. .+|+.++. ..|++||+.. 
T Consensus       520 ~~k~i~~vtWHrkGDYlatV~~~~~~~~VliHQLSK~~sQ~PF~kskG~vq~v~FHPs~-p~lfVaTq-~~vRiYdL~k-  596 (733)
T KOG0650|consen  520 HPKSIRQVTWHRKGDYLATVMPDSGNKSVLIHQLSKRKSQSPFRKSKGLVQRVKFHPSK-PYLFVATQ-RSVRIYDLSK-  596 (733)
T ss_pred             cCCccceeeeecCCceEEEeccCCCcceEEEEecccccccCchhhcCCceeEEEecCCC-ceEEEEec-cceEEEehhH-
Confidence            556899999999999999976543   58899998877666666666788999997654 45555554 5899999742 


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                         ......+.....-|..+++++.|.-|+.++.|+.+..+|+......                               
T Consensus       597 ---qelvKkL~tg~kwiS~msihp~GDnli~gs~d~k~~WfDldlsskP-------------------------------  642 (733)
T KOG0650|consen  597 ---QELVKKLLTGSKWISSMSIHPNGDNLILGSYDKKMCWFDLDLSSKP-------------------------------  642 (733)
T ss_pred             ---HHHHHHHhcCCeeeeeeeecCCCCeEEEecCCCeeEEEEcccCcch-------------------------------
Confidence               2223333333455788899999999999999999999998642110                               


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC-------C--eEEEEeecCCCC----eEEEEEC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS-------G--EQVAALKYHTSP----VRDCSWH  261 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~-------~--~~~~~~~~h~~~----I~~v~~s  261 (303)
                        ..++.-|..      ...+.+|++.-.++++|+.||++.|+--.-       -  ..++.+.+|...    |.++.||
T Consensus       643 --yk~lr~H~~------avr~Va~H~ryPLfas~sdDgtv~Vfhg~VY~Dl~qnpliVPlK~L~gH~~~~~~gVLd~~wH  714 (733)
T KOG0650|consen  643 --YKTLRLHEK------AVRSVAFHKRYPLFASGSDDGTVIVFHGMVYNDLLQNPLIVPLKRLRGHEKTNDLGVLDTIWH  714 (733)
T ss_pred             --hHHhhhhhh------hhhhhhhccccceeeeecCCCcEEEEeeeeehhhhcCCceEeeeeccCceeecccceEeeccc
Confidence              001111110      011223555567899999999999995321       1  346778888765    9999999


Q ss_pred             CCCCeEEEEeCCCCEEEee
Q 022074          262 PSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       262 p~~~~las~s~Dg~i~~Wd  280 (303)
                      |..++|+|++.||+|++|.
T Consensus       715 P~qpWLfsAGAd~tirlfT  733 (733)
T KOG0650|consen  715 PRQPWLFSAGADGTIRLFT  733 (733)
T ss_pred             CCCceEEecCCCceEEeeC
Confidence            9999999999999999994


No 181
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=99.53  E-value=8.9e-13  Score=125.48  Aligned_cols=210  Identities=20%  Similarity=0.252  Sum_probs=144.4

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEeccc---CCeEEEEE-ccCCCcEEEEecCCCeEEEEcCc
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHT---SDVNTVCF-GDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~---~~v~~l~~-~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      |-..+-..+.|+|=...++++.....|++||-+.++....+..+.   ..|+.+++ +..+..++++|+.||.||+|+-.
T Consensus      1062 ~n~~~pk~~~~hpf~p~i~~ad~r~~i~vwd~e~~~~l~~F~n~~~~~t~Vs~l~liNe~D~aLlLtas~dGvIRIwk~y 1141 (1387)
T KOG1517|consen 1062 GNNQPPKTLKFHPFEPQIAAADDRERIRVWDWEKGRLLNGFDNGAFPDTRVSDLELINEQDDALLLTASSDGVIRIWKDY 1141 (1387)
T ss_pred             cCCCCCceeeecCCCceeEEcCCcceEEEEecccCceeccccCCCCCCCccceeeeecccchhheeeeccCceEEEeccc
Confidence            333457788999988899998877789999999988766555443   47888887 33456789999999999999743


Q ss_pred             ccc-CCCcccee---eccc----ccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCC
Q 022074          113 CLN-VKGKPAGV---LMGH----LEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPP  184 (303)
Q Consensus       113 ~~~-~~~~~~~~---~~~h----~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~  184 (303)
                      ... ...+.+..   +.++    .+.-.-++|.+...+|+++|.-+.|||||..+.........                
T Consensus      1142 ~~~~~~~eLVTaw~~Ls~~~~~~r~~~~v~dWqQ~~G~Ll~tGd~r~IRIWDa~~E~~~~diP~---------------- 1205 (1387)
T KOG1517|consen 1142 ADKWKKPELVTAWSSLSDQLPGARGTGLVVDWQQQSGHLLVTGDVRSIRIWDAHKEQVVADIPY---------------- 1205 (1387)
T ss_pred             ccccCCceeEEeeccccccCccCCCCCeeeehhhhCCeEEecCCeeEEEEEecccceeEeeccc----------------
Confidence            111 11111111   1121    12223456777667788888889999999876432211100                


Q ss_pred             CCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe---EEEEeecCCCC--eEEEE
Q 022074          185 QARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE---QVAALKYHTSP--VRDCS  259 (303)
Q Consensus       185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~---~~~~~~~h~~~--I~~v~  259 (303)
                             .....+..+.+.               ...|..++.|..||.+++||.+...   .+...+.|+++  |..+.
T Consensus      1206 -------~s~t~vTaLS~~---------------~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R~h~~~~~Iv~~s 1263 (1387)
T KOG1517|consen 1206 -------GSSTLVTALSAD---------------LVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYREHNDVEPIVHLS 1263 (1387)
T ss_pred             -------CCCccceeeccc---------------ccCCceEEEeecCCceEEeecccCCccccceeecccCCcccceeEE
Confidence                   011122222211               1246789999999999999987543   46677889887  99999


Q ss_pred             ECCCCC-eEEEEeCCCCEEEeecCCC
Q 022074          260 WHPSQP-MLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       260 ~sp~~~-~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.++|- -|++|+.||.|++||+..+
T Consensus      1264 lq~~G~~elvSgs~~G~I~~~DlR~~ 1289 (1387)
T KOG1517|consen 1264 LQRQGLGELVSGSQDGDIQLLDLRMS 1289 (1387)
T ss_pred             eecCCCcceeeeccCCeEEEEecccC
Confidence            999876 4999999999999999874


No 182
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=99.53  E-value=4.7e-14  Score=124.16  Aligned_cols=244  Identities=22%  Similarity=0.282  Sum_probs=164.2

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE----------------------------------------
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL----------------------------------------   75 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~----------------------------------------   75 (303)
                      ++|.+-|..+.|+..|..+++||.|..|.+||-..+....                                        
T Consensus       139 ~~H~GcVntV~FN~~Gd~l~SgSDD~~vv~WdW~~~~~~l~f~SGH~~NvfQaKFiP~s~d~ti~~~s~dgqvr~s~i~~  218 (559)
T KOG1334|consen  139 NKHKGCVNTVHFNQRGDVLASGSDDLQVVVWDWVSGSPKLSFESGHCNNVFQAKFIPFSGDRTIVTSSRDGQVRVSEILE  218 (559)
T ss_pred             cCCCCccceeeecccCceeeccCccceEEeehhhccCcccccccccccchhhhhccCCCCCcCceeccccCceeeeeecc
Confidence            6999999999999999999999999999999987654211                                        


Q ss_pred             --------EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeeccccc---CeEEEEeCCCCC-EE
Q 022074           76 --------RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLE---GITFIDSRGDGR-YL  143 (303)
Q Consensus        76 --------~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~---~v~~~~~~~~~~-~l  143 (303)
                              .+..|.+.|..++.-|..++-|.|++.|+.|.-.|+++........+.. .+..   ....++..|... ++
T Consensus       219 t~~~e~t~rl~~h~g~vhklav~p~sp~~f~S~geD~~v~~~Dlr~~~pa~~~~cr~-~~~~~~v~L~~Ia~~P~nt~~f  297 (559)
T KOG1334|consen  219 TGYVENTKRLAPHEGPVHKLAVEPDSPKPFLSCGEDAVVFHIDLRQDVPAEKFVCRE-ADEKERVGLYTIAVDPRNTNEF  297 (559)
T ss_pred             ccceecceecccccCccceeeecCCCCCcccccccccceeeeeeccCCccceeeeec-cCCccceeeeeEecCCCCcccc
Confidence                    1234566677777766667788888888888888887543332222221 2222   345677777554 79


Q ss_pred             EEEeCCCcEEEEEcccccCCcccc-------cCccc-eeeeceeeeCCCCCcccc------------------------C
Q 022074          144 ISNGKDQAIKLWDIRKMSSNASCN-------LGFRS-YEWDYRWMDYPPQARDLK------------------------H  191 (303)
Q Consensus       144 ~s~~~D~~v~lWdl~~~~~~~~~~-------~~~~~-~~~~~~~~~~~~~~~~~~------------------------~  191 (303)
                      ++++.|.-+|+||.|+......+.       ..... ..-.+..+.|......+.                        .
T Consensus       298 aVgG~dqf~RvYD~R~~~~e~~n~~~~~f~p~hl~~d~~v~ITgl~Ysh~~sElLaSYnDe~IYLF~~~~~~G~~p~~~s  377 (559)
T KOG1334|consen  298 AVGGSDQFARVYDQRRIDKEENNGVLDKFCPHHLVEDDPVNITGLVYSHDGSELLASYNDEDIYLFNKSMGDGSEPDPSS  377 (559)
T ss_pred             ccCChhhhhhhhcccchhhccccchhhhcCCccccccCcccceeEEecCCccceeeeecccceEEeccccccCCCCCCCc
Confidence            999999999999998743221111       00000 000000111111100000                        0


Q ss_pred             CCCCc-ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEE
Q 022074          192 PCDQS-VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSS  270 (303)
Q Consensus       192 ~~~~~-~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~  270 (303)
                      +..+. ...++||...+++....|   |-|...|+++|+.=|.|.||+-.+++.+..+++...=|+|+.=+|--++|||+
T Consensus       378 ~~~~~~k~vYKGHrN~~TVKgVNF---fGPrsEyVvSGSDCGhIFiW~K~t~eii~~MegDr~VVNCLEpHP~~PvLAsS  454 (559)
T KOG1334|consen  378 PREQYVKRVYKGHRNSRTVKGVNF---FGPRSEYVVSGSDCGHIFIWDKKTGEIIRFMEGDRHVVNCLEPHPHLPVLASS  454 (559)
T ss_pred             chhhccchhhcccccccccceeee---ccCccceEEecCccceEEEEecchhHHHHHhhcccceEeccCCCCCCchhhcc
Confidence            00111 223778877666433332   44667899999999999999999999887787766689999999999999999


Q ss_pred             eCCCCEEEeecCC
Q 022074          271 SWDGDVVRWEFPG  283 (303)
Q Consensus       271 s~Dg~i~~Wd~~~  283 (303)
                      |-|.-|++|...+
T Consensus       455 Gid~DVKIWTP~~  467 (559)
T KOG1334|consen  455 GIDHDVKIWTPLT  467 (559)
T ss_pred             CCccceeeecCCc
Confidence            9999999999743


No 183
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=99.53  E-value=2.8e-12  Score=104.11  Aligned_cols=204  Identities=14%  Similarity=0.258  Sum_probs=135.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce-------EE-EEeccc-----CCeEEEEEccCCCcEEEEecC
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL-------SL-RILAHT-----SDVNTVCFGDESGHLIYSGSD  102 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~-------~~-~~~~h~-----~~v~~l~~~~~~~~~l~s~s~  102 (303)
                      .+|+.+|+.+.|..  ..|++|+ ||.|+=|.-+.-..       .. +...|.     ..|+++-..|..+..| .++.
T Consensus        59 qahdgpiy~~~f~d--~~Lls~g-dG~V~gw~W~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~-~AgG  134 (325)
T KOG0649|consen   59 QAHDGPIYYLAFHD--DFLLSGG-DGLVYGWEWNEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSIL-FAGG  134 (325)
T ss_pred             cccCCCeeeeeeeh--hheeecc-CceEEEeeehhhhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEE-EecC
Confidence            79999999999993  4666665 59999886543221       11 111222     3577777655545455 5558


Q ss_pred             CCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          103 DNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       103 dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      |+.++-||++    .++....+.||++.+.++.......+++||+.||++|+||++..+.......              
T Consensus       135 D~~~y~~dlE----~G~i~r~~rGHtDYvH~vv~R~~~~qilsG~EDGtvRvWd~kt~k~v~~ie~--------------  196 (325)
T KOG0649|consen  135 DGVIYQVDLE----DGRIQREYRGHTDYVHSVVGRNANGQILSGAEDGTVRVWDTKTQKHVSMIEP--------------  196 (325)
T ss_pred             CeEEEEEEec----CCEEEEEEcCCcceeeeeeecccCcceeecCCCccEEEEeccccceeEEecc--------------
Confidence            9999999986    5566778999999999998866677899999999999999987544322100              


Q ss_pred             CCCCccccCCC-CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEEC
Q 022074          183 PPQARDLKHPC-DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWH  261 (303)
Q Consensus       183 ~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~s  261 (303)
                       ...+.+..+. ..-+.                  ....+..+|++|+ ...+.+|.++..+....+.- ..++..+.| 
T Consensus       197 -yk~~~~lRp~~g~wig------------------ala~~edWlvCGg-Gp~lslwhLrsse~t~vfpi-pa~v~~v~F-  254 (325)
T KOG0649|consen  197 -YKNPNLLRPDWGKWIG------------------ALAVNEDWLVCGG-GPKLSLWHLRSSESTCVFPI-PARVHLVDF-  254 (325)
T ss_pred             -ccChhhcCcccCceeE------------------EEeccCceEEecC-CCceeEEeccCCCceEEEec-ccceeEeee-
Confidence             0000000000 00000                  1123456777765 45699999999888777753 357888998 


Q ss_pred             CCCCeEEEEeCCCCEEEeecCCC
Q 022074          262 PSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       262 p~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                       ....++++++.+-+.-|.+.+.
T Consensus       255 -~~d~vl~~G~g~~v~~~~l~Gv  276 (325)
T KOG0649|consen  255 -VDDCVLIGGEGNHVQSYTLNGV  276 (325)
T ss_pred             -ecceEEEeccccceeeeeeccE
Confidence             4456788887778888876543


No 184
>KOG0642 consensus Cell-cycle nuclear protein, contains WD-40 repeats [Cell cycle control, cell division, chromosome partitioning]
Probab=99.52  E-value=4e-13  Score=120.19  Aligned_cols=100  Identities=17%  Similarity=0.287  Sum_probs=79.5

Q ss_pred             Cchhhcccccccc---c--cCcCcccccC-CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc----------eEE
Q 022074           12 SGTMESLANVTEI---H--DGLDFSAADD-GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK----------LSL   75 (303)
Q Consensus        12 ~~~~~~~~~~~~~---~--~~~~~~~~~~-~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~----------~~~   75 (303)
                      +++.++.+.+|..   .  .+++.+++.. .||++||.|+.+.++++.+++|+-||+|+.|++....          +..
T Consensus       311 t~sed~~lk~WnLqk~~~s~~~~~epi~tfraH~gPVl~v~v~~n~~~~ysgg~Dg~I~~w~~p~n~dp~ds~dp~vl~~  390 (577)
T KOG0642|consen  311 TASEDGTLKLWNLQKAKKSAEKDVEPILTFRAHEGPVLCVVVPSNGEHCYSGGIDGTIRCWNLPPNQDPDDSYDPSVLSG  390 (577)
T ss_pred             EeccccchhhhhhcccCCccccceeeeEEEecccCceEEEEecCCceEEEeeccCceeeeeccCCCCCcccccCcchhcc
Confidence            6888999888876   2  2223322222 6999999999999999999999999999999665322          234


Q ss_pred             EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           76 RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        76 ~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      .+.+|++.|..++++. ..+.|++++.||+||+|+..
T Consensus       391 ~l~Ghtdavw~l~~s~-~~~~Llscs~DgTvr~w~~~  426 (577)
T KOG0642|consen  391 TLLGHTDAVWLLALSS-TKDRLLSCSSDGTVRLWEPT  426 (577)
T ss_pred             ceeccccceeeeeecc-cccceeeecCCceEEeeccC
Confidence            5789999999999964 46789999999999999854


No 185
>KOG1034 consensus Transcriptional repressor EED/ESC/FIE, required for transcriptional silencing, WD repeat superfamily [Transcription]
Probab=99.51  E-value=1.4e-12  Score=109.92  Aligned_cols=161  Identities=16%  Similarity=0.228  Sum_probs=103.7

Q ss_pred             CcccceEEEEEcC-----CCCEEEEeeCCCeEEEEECCCCceEEEEe-----cccCCeEEEEEccC---CCcEEEEecCC
Q 022074           37 GYSFGIFSLKFST-----DGRELVAGSSDDCIYVYDLEANKLSLRIL-----AHTSDVNTVCFGDE---SGHLIYSGSDD  103 (303)
Q Consensus        37 ~~~~~v~~l~~s~-----~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~-----~h~~~v~~l~~~~~---~~~~l~s~s~d  103 (303)
                      .|+.+|+.++|++     +-..+++.+.+ .+.||+.........++     .|...-..++|.-+   ...+++.|+.-
T Consensus        36 d~~~~I~gv~fN~~~~~~e~~vfatvG~~-rvtiy~c~~d~~ir~lq~y~D~d~~Esfytcsw~yd~~~~~p~la~~G~~  114 (385)
T KOG1034|consen   36 DHNKPIFGVAFNSFLGCDEPQVFATVGGN-RVTIYECPGDGGIRLLQSYADEDHDESFYTCSWSYDSNTGNPFLAAGGYL  114 (385)
T ss_pred             cCCCccceeeeehhcCCCCCceEEEeCCc-EEEEEEECCccceeeeeeccCCCCCcceEEEEEEecCCCCCeeEEeecce
Confidence            7888999999994     23345555544 58899887654222221     13344445555221   12355555666


Q ss_pred             CeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCC
Q 022074          104 NLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYP  183 (303)
Q Consensus       104 g~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~  183 (303)
                      |-||+.|..    ..+....+.+|..+|+.+.+.|+.                                           
T Consensus       115 GvIrVid~~----~~~~~~~~~ghG~sINeik~~p~~-------------------------------------------  147 (385)
T KOG1034|consen  115 GVIRVIDVV----SGQCSKNYRGHGGSINEIKFHPDR-------------------------------------------  147 (385)
T ss_pred             eEEEEEecc----hhhhccceeccCccchhhhcCCCC-------------------------------------------
Confidence            666666643    222334455555555555444422                                           


Q ss_pred             CCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe---ecCCCCeEEEEE
Q 022074          184 PQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL---KYHTSPVRDCSW  260 (303)
Q Consensus       184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~---~~h~~~I~~v~~  260 (303)
                                                            .+++++|+.|..||+|+++++.++..+   ++|.+.|.+++|
T Consensus       148 --------------------------------------~qlvls~SkD~svRlwnI~~~~Cv~VfGG~egHrdeVLSvD~  189 (385)
T KOG1034|consen  148 --------------------------------------PQLVLSASKDHSVRLWNIQTDVCVAVFGGVEGHRDEVLSVDF  189 (385)
T ss_pred             --------------------------------------CcEEEEecCCceEEEEeccCCeEEEEecccccccCcEEEEEE
Confidence                                                  245666666677777777666666554   589999999999


Q ss_pred             CCCCCeEEEEeCCCCEEEeecCC
Q 022074          261 HPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       261 sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      |++|.+++|+|.|.+|++|+++.
T Consensus       190 ~~~gd~i~ScGmDhslk~W~l~~  212 (385)
T KOG1034|consen  190 SLDGDRIASCGMDHSLKLWRLNV  212 (385)
T ss_pred             cCCCCeeeccCCcceEEEEecCh
Confidence            99999999999999999999873


No 186
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=99.51  E-value=1.8e-12  Score=105.26  Aligned_cols=196  Identities=18%  Similarity=0.220  Sum_probs=133.3

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCC---------c-eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEAN---------K-LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~---------~-~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      |++-+++|.++++++|+.+|+|.++.++.-         + ......+|++++..++|.   +..|++|+ ||.|+-|..
T Consensus        13 vf~qa~sp~~~~l~agn~~G~iav~sl~sl~s~sa~~~gk~~iv~eqahdgpiy~~~f~---d~~Lls~g-dG~V~gw~W   88 (325)
T KOG0649|consen   13 VFAQAISPSKQYLFAGNLFGDIAVLSLKSLDSGSAEPPGKLKIVPEQAHDGPIYYLAFH---DDFLLSGG-DGLVYGWEW   88 (325)
T ss_pred             HHHHhhCCcceEEEEecCCCeEEEEEehhhhccccCCCCCcceeeccccCCCeeeeeee---hhheeecc-CceEEEeee
Confidence            666789999999999999999999988642         1 223457899999999996   34666776 599998876


Q ss_pred             ccccCCCcccee----eccc-----ccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          112 RCLNVKGKPAGV----LMGH-----LEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       112 ~~~~~~~~~~~~----~~~h-----~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      +...........    ...|     ...|+++...|..+.++.++.|+.+.-||+...+                     
T Consensus        89 ~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgGD~~~y~~dlE~G~---------------------  147 (325)
T KOG0649|consen   89 NEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGGDGVIYQVDLEDGR---------------------  147 (325)
T ss_pred             hhhhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEEEecCCeEEEEEEecCCE---------------------
Confidence            432211110000    0112     2347788888887888888899999999986422                     


Q ss_pred             CCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecC----------C
Q 022074          183 PPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYH----------T  252 (303)
Q Consensus       183 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h----------~  252 (303)
                                   ...++.||......+...     + ....+++|++||++|+||.++++.+..++..          .
T Consensus       148 -------------i~r~~rGHtDYvH~vv~R-----~-~~~qilsG~EDGtvRvWd~kt~k~v~~ie~yk~~~~lRp~~g  208 (325)
T KOG0649|consen  148 -------------IQREYRGHTDYVHSVVGR-----N-ANGQILSGAEDGTVRVWDTKTQKHVSMIEPYKNPNLLRPDWG  208 (325)
T ss_pred             -------------EEEEEcCCcceeeeeeec-----c-cCcceeecCCCccEEEEeccccceeEEeccccChhhcCcccC
Confidence                         233455554322211110     1 1345899999999999999999988776532          1


Q ss_pred             CCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          253 SPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       253 ~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .+|.+++-  +..+|++|+ .-.+.+|.++.+
T Consensus       209 ~wigala~--~edWlvCGg-Gp~lslwhLrss  237 (325)
T KOG0649|consen  209 KWIGALAV--NEDWLVCGG-GPKLSLWHLRSS  237 (325)
T ss_pred             ceeEEEec--cCceEEecC-CCceeEEeccCC
Confidence            34555554  556888886 468999999865


No 187
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=99.51  E-value=4.7e-12  Score=103.86  Aligned_cols=67  Identities=16%  Similarity=0.408  Sum_probs=48.5

Q ss_pred             eCCCeEEEEEe---CCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeC------CCCEEEeecCCCCcc
Q 022074          219 STGQKYIYTGS---HDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSW------DGDVVRWEFPGNGEA  287 (303)
Q Consensus       219 s~~~~~latg~---~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~------Dg~i~~Wd~~~~~~~  287 (303)
                      ||+|++|++|+   ..|.|.+||..+.+.+...+ |. .++.++|||||++|+++..      |+.+++|++.+....
T Consensus       109 sP~G~~l~~~g~~n~~G~l~~wd~~~~~~i~~~~-~~-~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~G~~l~  184 (194)
T PF08662_consen  109 SPDGRFLVLAGFGNLNGDLEFWDVRKKKKISTFE-HS-DATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQGRLLY  184 (194)
T ss_pred             CCCCCEEEEEEccCCCcEEEEEECCCCEEeeccc-cC-cEEEEEEcCCCCEEEEEEeccceeccccEEEEEecCeEeE
Confidence            33444444443   23668889988888776664 33 4789999999999998875      799999999876433


No 188
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=99.50  E-value=1.6e-11  Score=109.26  Aligned_cols=233  Identities=18%  Similarity=0.201  Sum_probs=137.1

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEee-CCCeEEEEECCC-CceEE-EEecccCCeEEEEEccC
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGS-SDDCIYVYDLEA-NKLSL-RILAHTSDVNTVCFGDE   92 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~-~~~~~-~~~~h~~~v~~l~~~~~   92 (303)
                      +..|.+|++.++.+......-.+......++++|++++|++++ .++.|.+|+++. +.+.. ......+....+++++ 
T Consensus        11 ~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~-   89 (330)
T PRK11028         11 SQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPTHISTDH-   89 (330)
T ss_pred             CCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCceEEEECC-
Confidence            4567777775433321111111223466789999999987765 478899999973 43321 1112334567888975 


Q ss_pred             CCcEEEEec-CCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEE-EEeCCCcEEEEEcccccCCcccccCc
Q 022074           93 SGHLIYSGS-DDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLI-SNGKDQAIKLWDIRKMSSNASCNLGF  170 (303)
Q Consensus        93 ~~~~l~s~s-~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~-s~~~D~~v~lWdl~~~~~~~~~~~~~  170 (303)
                      +++.+++++ .++.|.+||+............+. +......+.++|++++++ +...++.|.+||+....... .....
T Consensus        90 ~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~-~~~~~~~~~~~p~g~~l~v~~~~~~~v~v~d~~~~g~l~-~~~~~  167 (330)
T PRK11028         90 QGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIE-GLEGCHSANIDPDNRTLWVPCLKEDRIRLFTLSDDGHLV-AQEPA  167 (330)
T ss_pred             CCCEEEEEEcCCCeEEEEEECCCCCCCCceeecc-CCCcccEeEeCCCCCEEEEeeCCCCEEEEEEECCCCccc-ccCCC
Confidence            466666655 488999999752111111222222 223456677899998885 45567999999986421100 00000


Q ss_pred             cceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CCeEEEEECCC--Ce--EE
Q 022074          171 RSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DSCVYVYDLVS--GE--QV  245 (303)
Q Consensus       171 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg~i~iwd~~~--~~--~~  245 (303)
                                               .+....+..        .....|+|++++++++.+ ++.|.+||+..  ++  .+
T Consensus       168 -------------------------~~~~~~g~~--------p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~~~~~~~~~  214 (330)
T PRK11028        168 -------------------------EVTTVEGAG--------PRHMVFHPNQQYAYCVNELNSSVDVWQLKDPHGEIECV  214 (330)
T ss_pred             -------------------------ceecCCCCC--------CceEEECCCCCEEEEEecCCCEEEEEEEeCCCCCEEEE
Confidence                                     000000000        012347789999988876 99999999973  32  23


Q ss_pred             EEeecC------CCCeEEEEECCCCCeEEEEeC-CCCEEEeecCCC
Q 022074          246 AALKYH------TSPVRDCSWHPSQPMLVSSSW-DGDVVRWEFPGN  284 (303)
Q Consensus       246 ~~~~~h------~~~I~~v~~sp~~~~las~s~-Dg~i~~Wd~~~~  284 (303)
                      ..+..+      ......+.|+|++++|+++.. ++.|.+|++...
T Consensus       215 ~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~  260 (330)
T PRK11028        215 QTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSED  260 (330)
T ss_pred             EEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCC
Confidence            333221      122346899999998888754 789999998543


No 189
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=99.49  E-value=8.4e-13  Score=117.63  Aligned_cols=173  Identities=20%  Similarity=0.229  Sum_probs=121.8

Q ss_pred             CCCCEEEEeeCCCeEEEEECCCCceEEEE----ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC-------
Q 022074           49 TDGRELVAGSSDDCIYVYDLEANKLSLRI----LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK-------  117 (303)
Q Consensus        49 ~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~----~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~-------  117 (303)
                      +.+-.++.|=.-|.|.+.|.........+    .--+..|+|+.|-+.+...|+.+-.+|.+.++|.......       
T Consensus       183 ~~g~dllIGf~tGqvq~idp~~~~~sklfne~r~i~ktsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~~~~~~t~p~~~~  262 (636)
T KOG2394|consen  183 PKGLDLLIGFTTGQVQLIDPINFEVSKLFNEERLINKSSVTCIKWVPGSDSLFLVAHASGNLYLYDKEIVCGATAPSYQA  262 (636)
T ss_pred             CCCcceEEeeccCceEEecchhhHHHHhhhhcccccccceEEEEEEeCCCceEEEEEecCceEEeeccccccCCCCcccc
Confidence            45667888888888888877653221111    0122579999998877888999999999999975311000       


Q ss_pred             -----------------CccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceee
Q 022074          118 -----------------GKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWM  180 (303)
Q Consensus       118 -----------------~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~  180 (303)
                                       ..|...+.--..+++.++|++||.+||+.+.||.+||||...+......              
T Consensus       263 ~k~~~~f~i~t~ksk~~rNPv~~w~~~~g~in~f~FS~DG~~LA~VSqDGfLRvF~fdt~eLlg~m--------------  328 (636)
T KOG2394|consen  263 LKDGDQFAILTSKSKKTRNPVARWHIGEGSINEFAFSPDGKYLATVSQDGFLRIFDFDTQELLGVM--------------  328 (636)
T ss_pred             cCCCCeeEEeeeeccccCCccceeEeccccccceeEcCCCceEEEEecCceEEEeeccHHHHHHHH--------------
Confidence                             0111112112346778889999999999999999999997654321111              


Q ss_pred             eCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEE
Q 022074          181 DYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSW  260 (303)
Q Consensus       181 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~  260 (303)
                                       ..+         .....+..+||||+|+++||+|.-|.||....++.+..=.+|+.+|..|+|
T Consensus       329 -----------------kSY---------FGGLLCvcWSPDGKyIvtGGEDDLVtVwSf~erRVVARGqGHkSWVs~VaF  382 (636)
T KOG2394|consen  329 -----------------KSY---------FGGLLCVCWSPDGKYIVTGGEDDLVTVWSFEERRVVARGQGHKSWVSVVAF  382 (636)
T ss_pred             -----------------Hhh---------ccceEEEEEcCCccEEEecCCcceEEEEEeccceEEEeccccccceeeEee
Confidence                             000         001112336789999999999999999999999999998999999999999


Q ss_pred             C
Q 022074          261 H  261 (303)
Q Consensus       261 s  261 (303)
                      .
T Consensus       383 D  383 (636)
T KOG2394|consen  383 D  383 (636)
T ss_pred             c
Confidence            8


No 190
>KOG0644 consensus Uncharacterized conserved protein, contains WD40 repeat and BROMO domains [General function prediction only]
Probab=99.48  E-value=1.2e-13  Score=128.48  Aligned_cols=259  Identities=18%  Similarity=0.207  Sum_probs=170.8

Q ss_pred             EEEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeE
Q 022074            6 HIVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVN   85 (303)
Q Consensus         6 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~   85 (303)
                      |-++-||.+|  |..||--.++....  +-.||+..|..++.+.+.-.++++|.|..|++|.+.++..+..+.+|++.|+
T Consensus       203 ~~Iitgsdd~--lvKiwS~et~~~lA--s~rGhs~ditdlavs~~n~~iaaaS~D~vIrvWrl~~~~pvsvLrghtgavt  278 (1113)
T KOG0644|consen  203 RYIITGSDDR--LVKIWSMETARCLA--SCRGHSGDITDLAVSSNNTMIAAASNDKVIRVWRLPDGAPVSVLRGHTGAVT  278 (1113)
T ss_pred             ceEeecCccc--eeeeeeccchhhhc--cCCCCccccchhccchhhhhhhhcccCceEEEEecCCCchHHHHhcccccee
Confidence            4466677665  77888877777663  6699999999999999999999999999999999999998888999999999


Q ss_pred             EEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecc-cccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCc
Q 022074           86 TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMG-HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNA  164 (303)
Q Consensus        86 ~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~-h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~  164 (303)
                      +++|+|-     .+.+.||++++||.+- .....+...+.- -.+-+..+-+-..+..++|++.|+.-+.|.+.......
T Consensus       279 aiafsP~-----~sss~dgt~~~wd~r~-~~~~y~prp~~~~~~~~~~s~~~~~~~~~f~Tgs~d~ea~n~e~~~l~~~~  352 (1113)
T KOG0644|consen  279 AIAFSPR-----ASSSDDGTCRIWDARL-EPRIYVPRPLKFTEKDLVDSILFENNGDRFLTGSRDGEARNHEFEQLAWRS  352 (1113)
T ss_pred             eeccCcc-----ccCCCCCceEeccccc-cccccCCCCCCcccccceeeeeccccccccccccCCcccccchhhHhhhhc
Confidence            9999763     2778899999999871 111111111111 12345566677778889999999999999765421110


Q ss_pred             ccccCccceeeeceeeeCCCCCccccCC------CCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEE
Q 022074          165 SCNLGFRSYEWDYRWMDYPPQARDLKHP------CDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYD  238 (303)
Q Consensus       165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd  238 (303)
                      .. +.+.....+..  .+-+....-...      .....-...+|.....+++.|.     -+.+...+++.||...|||
T Consensus       353 ~~-lif~t~ssd~~--~~~~~ar~~~~~~vwnl~~g~l~H~l~ghsd~~yvLd~Hp-----fn~ri~msag~dgst~iwd  424 (1113)
T KOG0644|consen  353 NL-LIFVTRSSDLS--SIVVTARNDHRLCVWNLYTGQLLHNLMGHSDEVYVLDVHP-----FNPRIAMSAGYDGSTIIWD  424 (1113)
T ss_pred             cc-eEEEecccccc--ccceeeeeeeEeeeeecccchhhhhhcccccceeeeeecC-----CCcHhhhhccCCCceEeee
Confidence            00 00000000000  000000000000      0011112233333333333331     1345667899999999999


Q ss_pred             CCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          239 LVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       239 ~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      +-.|-+++.+......+-+.+||+||..++....-|.+.+...-
T Consensus       425 i~eg~pik~y~~gh~kl~d~kFSqdgts~~lsd~hgql~i~g~g  468 (1113)
T KOG0644|consen  425 IWEGIPIKHYFIGHGKLVDGKFSQDGTSIALSDDHGQLYILGTG  468 (1113)
T ss_pred             cccCCcceeeecccceeeccccCCCCceEecCCCCCceEEeccC
Confidence            99887766554335678899999999999999999999988754


No 191
>KOG0290 consensus Conserved WD40 repeat-containing protein AN11 [Function unknown]
Probab=99.48  E-value=5.2e-13  Score=110.81  Aligned_cols=125  Identities=20%  Similarity=0.341  Sum_probs=104.5

Q ss_pred             CCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCc---eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           36 GGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANK---LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        36 ~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~---~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      .-|..++.+..|+. +-+++.++|-|-|..|||++++.   ...++.+|+..|..++|.....++|+|.+.||+||+||+
T Consensus       147 s~~~aPlTSFDWne~dp~~igtSSiDTTCTiWdie~~~~~~vkTQLIAHDKEV~DIaf~~~s~~~FASvgaDGSvRmFDL  226 (364)
T KOG0290|consen  147 SEFCAPLTSFDWNEVDPNLIGTSSIDTTCTIWDIETGVSGTVKTQLIAHDKEVYDIAFLKGSRDVFASVGADGSVRMFDL  226 (364)
T ss_pred             cccCCcccccccccCCcceeEeecccCeEEEEEEeeccccceeeEEEecCcceeEEEeccCccceEEEecCCCcEEEEEe
Confidence            36668999999994 67789999999999999999863   356788999999999998766789999999999999998


Q ss_pred             ccccC--------------------------------------------CCccceeecccccCeEEEEeCCC-CCEEEEE
Q 022074          112 RCLNV--------------------------------------------KGKPAGVLMGHLEGITFIDSRGD-GRYLISN  146 (303)
Q Consensus       112 ~~~~~--------------------------------------------~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~  146 (303)
                      |..+.                                            ...+...+.+|.+.|+.++|.|. ...|+|+
T Consensus       227 R~leHSTIIYE~p~~~~pLlRLswnkqDpnymATf~~dS~~V~iLDiR~P~tpva~L~~H~a~VNgIaWaPhS~~hicta  306 (364)
T KOG0290|consen  227 RSLEHSTIIYEDPSPSTPLLRLSWNKQDPNYMATFAMDSNKVVILDIRVPCTPVARLRNHQASVNGIAWAPHSSSHICTA  306 (364)
T ss_pred             cccccceEEecCCCCCCcceeeccCcCCchHHhhhhcCCceEEEEEecCCCcceehhhcCcccccceEecCCCCceeeec
Confidence            74211                                            12244567789999999999885 5679999


Q ss_pred             eCCCcEEEEEcccc
Q 022074          147 GKDQAIKLWDIRKM  160 (303)
Q Consensus       147 ~~D~~v~lWdl~~~  160 (303)
                      |.|.++.+||+..+
T Consensus       307 GDD~qaliWDl~q~  320 (364)
T KOG0290|consen  307 GDDCQALIWDLQQM  320 (364)
T ss_pred             CCcceEEEEecccc
Confidence            99999999999754


No 192
>KOG0307 consensus Vesicle coat complex COPII, subunit SEC31 [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.47  E-value=5.1e-13  Score=128.10  Aligned_cols=236  Identities=19%  Similarity=0.305  Sum_probs=142.5

Q ss_pred             EEEEEcCCCC-EEEEee----------CCCeEEEEECCCCceE---EE--EecccCCeEEEEEccCCCc---EEEEecCC
Q 022074           43 FSLKFSTDGR-ELVAGS----------SDDCIYVYDLEANKLS---LR--ILAHTSDVNTVCFGDESGH---LIYSGSDD  103 (303)
Q Consensus        43 ~~l~~s~~g~-~l~sgs----------~Dg~v~lwd~~~~~~~---~~--~~~h~~~v~~l~~~~~~~~---~l~s~s~d  103 (303)
                      -.++|+|.+. ++++|.          .+.++-||.+......   ..  ....+..-+.++|.+....   +++.|.+|
T Consensus        10 a~~awSp~~~~~laagt~aq~~D~sfst~~slEifeld~~~~~~dlk~~~s~~s~~rF~kL~W~~~g~~~~GlIaGG~ed   89 (1049)
T KOG0307|consen   10 ATFAWSPASPPLLAAGTAAQQFDASFSTSASLEIFELDFSDESSDLKPVGSLQSSNRFNKLAWGSYGSHSHGLIAGGLED   89 (1049)
T ss_pred             ceEEecCCCchhhHHHhhhhccccccccccccceeeecccCccccccccccccccccceeeeecccCCCccceeeccccC
Confidence            4578898886 455443          3455667765433211   11  1122346688999654333   58888999


Q ss_pred             CeEEEEcCccc--cCCCccceeecccccCeEEEEeCCCCC-EEEEEeCCCcEEEEEcccccCCcccc-----cCccceee
Q 022074          104 NLCKVWDRRCL--NVKGKPAGVLMGHLEGITFIDSRGDGR-YLISNGKDQAIKLWDIRKMSSNASCN-----LGFRSYEW  175 (303)
Q Consensus       104 g~v~lWd~~~~--~~~~~~~~~~~~h~~~v~~~~~~~~~~-~l~s~~~D~~v~lWdl~~~~~~~~~~-----~~~~~~~~  175 (303)
                      |.|.+||....  +.....+.....|.+.|..++|++.+. +|++|+.||.|.|||+.+.....+..     .....+.|
T Consensus        90 G~I~ly~p~~~~~~~~~~~la~~~~h~G~V~gLDfN~~q~nlLASGa~~geI~iWDlnn~~tP~~~~~~~~~~eI~~lsW  169 (1049)
T KOG0307|consen   90 GNIVLYDPASIIANASEEVLATKSKHTGPVLGLDFNPFQGNLLASGADDGEILIWDLNKPETPFTPGSQAPPSEIKCLSW  169 (1049)
T ss_pred             CceEEecchhhccCcchHHHhhhcccCCceeeeeccccCCceeeccCCCCcEEEeccCCcCCCCCCCCCCCcccceEecc
Confidence            99999997532  222234556677999999999999755 99999999999999998754332221     11122233


Q ss_pred             eceeee----CCCCCccc-c-CCCCCcceEEecccceeeeEEEee-eeeeeCC-CeEEEEEeCCC---eEEEEECCCC-e
Q 022074          176 DYRWMD----YPPQARDL-K-HPCDQSVATYKGHSVLRTLIRCHF-SPVYSTG-QKYIYTGSHDS---CVYVYDLVSG-E  243 (303)
Q Consensus       176 ~~~~~~----~~~~~~~~-~-~~~~~~~~~~~~~~~~~~~~~~~~-~~~~s~~-~~~latg~~dg---~i~iwd~~~~-~  243 (303)
                      ......    ..+.++.. . ......+..+..+..     ++++ ...++|+ -..++++++|.   .|.+||++.- .
T Consensus       170 NrkvqhILAS~s~sg~~~iWDlr~~~pii~ls~~~~-----~~~~S~l~WhP~~aTql~~As~dd~~PviqlWDlR~ass  244 (1049)
T KOG0307|consen  170 NRKVSHILASGSPSGRAVIWDLRKKKPIIKLSDTPG-----RMHCSVLAWHPDHATQLLVASGDDSAPVIQLWDLRFASS  244 (1049)
T ss_pred             chhhhHHhhccCCCCCceeccccCCCcccccccCCC-----ccceeeeeeCCCCceeeeeecCCCCCceeEeecccccCC
Confidence            311100    00010000 0 000011111111111     1111 1224443 34566666544   5999998754 4


Q ss_pred             EEEEeecCCCCeEEEEECCCC-CeEEEEeCCCCEEEeecCC
Q 022074          244 QVAALKYHTSPVRDCSWHPSQ-PMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       244 ~~~~~~~h~~~I~~v~~sp~~-~~las~s~Dg~i~~Wd~~~  283 (303)
                      .++++.+|+..|.++.|.+.+ .+|+|++.|+.+..|+...
T Consensus       245 P~k~~~~H~~GilslsWc~~D~~lllSsgkD~~ii~wN~~t  285 (1049)
T KOG0307|consen  245 PLKILEGHQRGILSLSWCPQDPRLLLSSGKDNRIICWNPNT  285 (1049)
T ss_pred             chhhhcccccceeeeccCCCCchhhhcccCCCCeeEecCCC
Confidence            577889999999999999987 7999999999999999765


No 193
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=99.47  E-value=1.9e-12  Score=123.36  Aligned_cols=203  Identities=23%  Similarity=0.355  Sum_probs=142.0

Q ss_pred             EEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccc
Q 022074           43 FSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPA  121 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~  121 (303)
                      .-++|.-+..+|+++|.-..|||||++......-+.. .+..|+++.-....+++++.|-.||.||+||.|.... ...+
T Consensus      1169 ~v~dWqQ~~G~Ll~tGd~r~IRIWDa~~E~~~~diP~~s~t~vTaLS~~~~~gn~i~AGfaDGsvRvyD~R~a~~-ds~v 1247 (1387)
T KOG1517|consen 1169 LVVDWQQQSGHLLVTGDVRSIRIWDAHKEQVVADIPYGSSTLVTALSADLVHGNIIAAGFADGSVRVYDRRMAPP-DSLV 1247 (1387)
T ss_pred             eeeehhhhCCeEEecCCeeEEEEEecccceeEeecccCCCccceeecccccCCceEEEeecCCceEEeecccCCc-cccc
Confidence            5577886666777777788999999988776554432 3446677765445578999999999999999885443 2456


Q ss_pred             eeecccccC--eEEEEeCCCCC-EEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          122 GVLMGHLEG--ITFIDSRGDGR-YLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       122 ~~~~~h~~~--v~~~~~~~~~~-~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      .....|.+.  |..+.+.+.|- .|++|+.||.|++||+|..........   ...|++                     
T Consensus      1248 ~~~R~h~~~~~Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~~~e~~~~i---v~~~~y--------------------- 1303 (1387)
T KOG1517|consen 1248 CVYREHNDVEPIVHLSLQRQGLGELVSGSQDGDIQLLDLRMSSKETFLTI---VAHWEY--------------------- 1303 (1387)
T ss_pred             eeecccCCcccceeEEeecCCCcceeeeccCCeEEEEecccCccccccee---eecccc---------------------
Confidence            667778876  88888887653 499999999999999997311110000   000000                     


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecC-------CCCeEEEEECCCCCeEEEEe
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYH-------TSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h-------~~~I~~v~~sp~~~~las~s  271 (303)
                         |..  -+.+.      .+.....+|+|+. +.|+||++. |+.+..++.+       .+.+.+++|||-..+||.|+
T Consensus      1304 ---Gs~--lTal~------VH~hapiiAsGs~-q~ikIy~~~-G~~l~~~k~n~~F~~q~~gs~scL~FHP~~~llAaG~ 1370 (1387)
T KOG1517|consen 1304 ---GSA--LTALT------VHEHAPIIASGSA-QLIKIYSLS-GEQLNIIKYNPGFMGQRIGSVSCLAFHPHRLLLAAGS 1370 (1387)
T ss_pred             ---Ccc--ceeee------eccCCCeeeecCc-ceEEEEecC-hhhhcccccCcccccCcCCCcceeeecchhHhhhhcc
Confidence               000  01121      2234567899988 999999985 5655555433       35789999999999999999


Q ss_pred             CCCCEEEeecCC
Q 022074          272 WDGDVVRWEFPG  283 (303)
Q Consensus       272 ~Dg~i~~Wd~~~  283 (303)
                      .|..+.++....
T Consensus      1371 ~Ds~V~iYs~~k 1382 (1387)
T KOG1517|consen 1371 ADSTVSIYSCEK 1382 (1387)
T ss_pred             CCceEEEeecCC
Confidence            999999998643


No 194
>KOG1007 consensus WD repeat protein TSSC1, WD repeat superfamily [Function unknown]
Probab=99.43  E-value=4.3e-12  Score=105.37  Aligned_cols=188  Identities=22%  Similarity=0.261  Sum_probs=128.7

Q ss_pred             CCcccceEEEEEcC--CCCEEEEeeCCCeEEEEECCCCceEEEE-ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           36 GGYSFGIFSLKFST--DGRELVAGSSDDCIYVYDLEANKLSLRI-LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        36 ~~~~~~v~~l~~s~--~g~~l~sgs~Dg~v~lwd~~~~~~~~~~-~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      .+|+....+-.|+|  ||+.+++. .|++++-||+++......+ .+|...|..+-|+|+.-.+|+||+.||.||+||.|
T Consensus       167 ~e~~~~ftsg~WspHHdgnqv~tt-~d~tl~~~D~RT~~~~~sI~dAHgq~vrdlDfNpnkq~~lvt~gDdgyvriWD~R  245 (370)
T KOG1007|consen  167 AEMRHSFTSGAWSPHHDGNQVATT-SDSTLQFWDLRTMKKNNSIEDAHGQRVRDLDFNPNKQHILVTCGDDGYVRIWDTR  245 (370)
T ss_pred             ccccceecccccCCCCccceEEEe-CCCcEEEEEccchhhhcchhhhhcceeeeccCCCCceEEEEEcCCCccEEEEecc
Confidence            56888999999998  78877775 6889999999987655444 36888999999998777889999999999999987


Q ss_pred             cccCCCccceeecccccCeEEEEeCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCC--Cccc
Q 022074          113 CLNVKGKPAGVLMGHLEGITFIDSRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQ--ARDL  189 (303)
Q Consensus       113 ~~~~~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~  189 (303)
                      .   +..++..+.+|..-|+++.|+|. +.+++|+|.|..|.+|.............+.       ..-..+..  ....
T Consensus       246 ~---tk~pv~el~~HsHWvW~VRfn~~hdqLiLs~~SDs~V~Lsca~svSSE~qi~~~~-------dese~e~~dseer~  315 (370)
T KOG1007|consen  246 K---TKFPVQELPGHSHWVWAVRFNPEHDQLILSGGSDSAVNLSCASSVSSEQQIEFED-------DESESEDEDSEERV  315 (370)
T ss_pred             C---CCccccccCCCceEEEEEEecCccceEEEecCCCceeEEEecccccccccccccc-------ccccCcchhhHHhc
Confidence            3   45677888899999999999884 5678999999999999865432221111110       00000000  0111


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEEC
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDL  239 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~  239 (303)
                      +...++.+.+++.|....+.  +.++   +.+.-.+|+-+.||++-|=.+
T Consensus       316 kpL~dg~l~tydehEDSVY~--~aWS---sadPWiFASLSYDGRviIs~V  360 (370)
T KOG1007|consen  316 KPLQDGQLETYDEHEDSVYA--LAWS---SADPWIFASLSYDGRVIISSV  360 (370)
T ss_pred             ccccccccccccccccceEE--Eeec---cCCCeeEEEeccCceEEeecC
Confidence            11223345555555432222  2222   235567888899999877544


No 195
>KOG4227 consensus WD40 repeat protein [General function prediction only]
Probab=99.43  E-value=1.3e-11  Score=106.39  Aligned_cols=237  Identities=17%  Similarity=0.199  Sum_probs=146.9

Q ss_pred             cCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC------ceEE-EEecccCCeEEEEEccCCCcEEEEecCCCeE
Q 022074           34 DDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN------KLSL-RILAHTSDVNTVCFGDESGHLIYSGSDDNLC  106 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~------~~~~-~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v  106 (303)
                      |..||.+-|.+|.|+.++++|++|+.|..++||.++..      +.+. .-..|...|.|++|... ...+++|..+++|
T Consensus        51 D~~~H~GCiNAlqFS~N~~~L~SGGDD~~~~~W~~de~~~~k~~KPI~~~~~~H~SNIF~L~F~~~-N~~~~SG~~~~~V  129 (609)
T KOG4227|consen   51 DVREHTGCINALQFSHNDRFLASGGDDMHGRVWNVDELMVRKTPKPIGVMEHPHRSNIFSLEFDLE-NRFLYSGERWGTV  129 (609)
T ss_pred             hhhhhccccceeeeccCCeEEeecCCcceeeeechHHHHhhcCCCCceeccCccccceEEEEEccC-CeeEecCCCccee
Confidence            55699999999999999999999999999999988542      2221 12235579999999654 5678899999999


Q ss_pred             EEEcCccccCCCccceeeccccc---CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcc------cccCccceeeec
Q 022074          107 KVWDRRCLNVKGKPAGVLMGHLE---GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNAS------CNLGFRSYEWDY  177 (303)
Q Consensus       107 ~lWd~~~~~~~~~~~~~~~~h~~---~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~------~~~~~~~~~~~~  177 (303)
                      .+-|+..    .+.+-++ .|.+   .|..++.+|-++.|++.+.++.|.+||.|.......      ....|.+..   
T Consensus       130 I~HDiEt----~qsi~V~-~~~~~~~~VY~m~~~P~DN~~~~~t~~~~V~~~D~Rd~~~~~~~~~~AN~~~~F~t~~---  201 (609)
T KOG4227|consen  130 IKHDIET----KQSIYVA-NENNNRGDVYHMDQHPTDNTLIVVTRAKLVSFIDNRDRQNPISLVLPANSGKNFYTAE---  201 (609)
T ss_pred             Eeeeccc----ceeeeee-cccCcccceeecccCCCCceEEEEecCceEEEEeccCCCCCCceeeecCCCccceeee---
Confidence            9999752    2223222 3444   799999999999999999999999999986442111      111222222   


Q ss_pred             eeeeCCCCCcccc-CCC-CCcceEE------------ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe
Q 022074          178 RWMDYPPQARDLK-HPC-DQSVATY------------KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE  243 (303)
Q Consensus       178 ~~~~~~~~~~~~~-~~~-~~~~~~~------------~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~  243 (303)
                          +.|....+. ... ......+            .+...+.....--....|++.|..|.+----..-.+||+.+..
T Consensus       202 ----F~P~~P~Li~~~~~~~G~~~~D~R~~~~~~~~~~~~~~L~~~~~~~M~~~~~~~G~Q~msiRR~~~P~~~D~~S~R  277 (609)
T KOG4227|consen  202 ----FHPETPALILVNSETGGPNVFDRRMQARPVYQRSMFKGLPQENTEWMGSLWSPSGNQFMSIRRGKCPLYFDFISQR  277 (609)
T ss_pred             ----ecCCCceeEEeccccCCCCceeeccccchHHhhhccccCcccchhhhheeeCCCCCeehhhhccCCCEEeeeeccc
Confidence                222222111 100 0001111            1110000000000112356667666665445556677876632


Q ss_pred             -EEEEeecCC-------CCeEEEEECCCCCeEEEEeCCCCEEEeecCCCC
Q 022074          244 -QVAALKYHT-------SPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       244 -~~~~~~~h~-------~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~  285 (303)
                       .+..+. |.       ..|.+|+|--|- .++||+.+=.|++|.++...
T Consensus       278 ~~V~k~D-~N~~GY~N~~T~KS~~F~~D~-~v~tGSD~~~i~~WklP~~~  325 (609)
T KOG4227|consen  278 CFVLKSD-HNPNGYCNIKTIKSMTFIDDY-TVATGSDHWGIHIWKLPRAN  325 (609)
T ss_pred             ceeEecc-CCCCcceeeeeeeeeeeecce-eeeccCcccceEEEecCCCc
Confidence             343333 22       357788886554 49999999999999987543


No 196
>KOG1009 consensus Chromatin assembly complex 1 subunit B/CAC2 (contains WD40 repeats) [Chromatin structure and dynamics; Replication, recombination and repair]
Probab=99.42  E-value=2.3e-12  Score=111.22  Aligned_cols=177  Identities=21%  Similarity=0.313  Sum_probs=125.2

Q ss_pred             CCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCC-----ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEE
Q 022074           82 SDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG-----KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWD  156 (303)
Q Consensus        82 ~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~-----~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWd  156 (303)
                      .+|..+.|.....+.++||+.|..|++|-+......+     .....+..|..+|+.+.|+++|++|+||+.++.+.+|.
T Consensus        14 ~pv~s~dfq~n~~~~laT~G~D~~iriW~v~r~~~~~~~~~V~y~s~Ls~H~~aVN~vRf~p~gelLASg~D~g~v~lWk   93 (434)
T KOG1009|consen   14 EPVYSVDFQKNSLNKLATAGGDKDIRIWKVNRSEPGGGDMKVEYLSSLSRHTRAVNVVRFSPDGELLASGGDGGEVFLWK   93 (434)
T ss_pred             CceEEEEeccCcccceecccCccceeeeeeeecCCCCCceeEEEeecccCCcceeEEEEEcCCcCeeeecCCCceEEEEE
Confidence            4788888865555599999999999999764222111     22345778999999999999999999999999999997


Q ss_pred             cccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEE
Q 022074          157 IRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYV  236 (303)
Q Consensus       157 l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~i  236 (303)
                      ...-. ...+..     +.+            +..........+.+|...      .....+++++.++++|+.|..+++
T Consensus        94 ~~~~~-~~~~d~-----e~~------------~~ke~w~v~k~lr~h~~d------iydL~Ws~d~~~l~s~s~dns~~l  149 (434)
T KOG1009|consen   94 QGDVR-IFDADT-----EAD------------LNKEKWVVKKVLRGHRDD------IYDLAWSPDSNFLVSGSVDNSVRL  149 (434)
T ss_pred             ecCcC-Cccccc-----hhh------------hCccceEEEEEecccccc------hhhhhccCCCceeeeeeccceEEE
Confidence            54200 000000     000            000000001111122111      112346789999999999999999


Q ss_pred             EECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          237 YDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       237 wd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      ||+..|+.+..+..|...+..++|.|-.+++++-+.|...+...+.
T Consensus       150 ~Dv~~G~l~~~~~dh~~yvqgvawDpl~qyv~s~s~dr~~~~~~~~  195 (434)
T KOG1009|consen  150 WDVHAGQLLAILDDHEHYVQGVAWDPLNQYVASKSSDRHPEGFSAK  195 (434)
T ss_pred             EEeccceeEeeccccccccceeecchhhhhhhhhccCcccceeeee
Confidence            9999999999999999999999999999999999999877766643


No 197
>PRK03629 tolB translocation protein TolB; Provisional
Probab=99.41  E-value=2.5e-10  Score=105.24  Aligned_cols=220  Identities=15%  Similarity=0.113  Sum_probs=133.0

Q ss_pred             hhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeC---CCeEEEEECCCCceEEEEecccCCeEEEEEc
Q 022074           14 TMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSS---DDCIYVYDLEANKLSLRILAHTSDVNTVCFG   90 (303)
Q Consensus        14 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~---Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~   90 (303)
                      ...+-+.|++. +|.+....  ..+...+.+..|||||+.|+..+.   +..+++|++.+++.. .+....+.+....|+
T Consensus       176 ~~~~~l~~~d~-dg~~~~~l--t~~~~~~~~p~wSPDG~~la~~s~~~g~~~i~i~dl~~G~~~-~l~~~~~~~~~~~~S  251 (429)
T PRK03629        176 QFPYELRVSDY-DGYNQFVV--HRSPQPLMSPAWSPDGSKLAYVTFESGRSALVIQTLANGAVR-QVASFPRHNGAPAFS  251 (429)
T ss_pred             CcceeEEEEcC-CCCCCEEe--ecCCCceeeeEEcCCCCEEEEEEecCCCcEEEEEECCCCCeE-EccCCCCCcCCeEEC
Confidence            33445555554 34433222  234567899999999999886542   457999999887643 333344455678998


Q ss_pred             cCCCcEEE-EecCCC--eEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeC-CCcEEEEEcccccCCccc
Q 022074           91 DESGHLIY-SGSDDN--LCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK-DQAIKLWDIRKMSSNASC  166 (303)
Q Consensus        91 ~~~~~~l~-s~s~dg--~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~-D~~v~lWdl~~~~~~~~~  166 (303)
                      |+ ++.|+ +.+.+|  .|++||+...    . ...+..+...+....|+|+|+.|+..+. ++...+|.+......   
T Consensus       252 PD-G~~La~~~~~~g~~~I~~~d~~tg----~-~~~lt~~~~~~~~~~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~---  322 (429)
T PRK03629        252 PD-GSKLAFALSKTGSLNLYVMDLASG----Q-IRQVTDGRSNNTEPTWFPDSQNLAYTSDQAGRPQVYKVNINGGA---  322 (429)
T ss_pred             CC-CCEEEEEEcCCCCcEEEEEECCCC----C-EEEccCCCCCcCceEECCCCCEEEEEeCCCCCceEEEEECCCCC---
Confidence            75 55544 545555  5888887522    2 2233334445677889999998876664 456667654321000   


Q ss_pred             ccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC---CeEEEEECCCCe
Q 022074          167 NLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD---SCVYVYDLVSGE  243 (303)
Q Consensus       167 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d---g~i~iwd~~~~~  243 (303)
                         ..                        .+ +..+.        ....+.++|+|++++..+.+   ..|++||+.+++
T Consensus       323 ---~~------------------------~l-t~~~~--------~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~  366 (429)
T PRK03629        323 ---PQ------------------------RI-TWEGS--------QNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGG  366 (429)
T ss_pred             ---eE------------------------Ee-ecCCC--------CccCEEECCCCCEEEEEEccCCCceEEEEECCCCC
Confidence               00                        00 00000        01135578899988876543   468999998876


Q ss_pred             EEEEeecCCCCeEEEEECCCCCeEEEEeCCCC---EEEeecCCC
Q 022074          244 QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGD---VVRWEFPGN  284 (303)
Q Consensus       244 ~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~---i~~Wd~~~~  284 (303)
                      .. .+... .......|||||++|+.++.++.   +.++++.+.
T Consensus       367 ~~-~Lt~~-~~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~~G~  408 (429)
T PRK03629        367 VQ-VLTDT-FLDETPSIAPNGTMVIYSSSQGMGSVLNLVSTDGR  408 (429)
T ss_pred             eE-EeCCC-CCCCCceECCCCCEEEEEEcCCCceEEEEEECCCC
Confidence            43 33221 23456789999999999988875   667776543


No 198
>PRK05137 tolB translocation protein TolB; Provisional
Probab=99.40  E-value=1.1e-10  Score=107.77  Aligned_cols=216  Identities=14%  Similarity=0.090  Sum_probs=132.9

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeC---CCeEEEEECCCCceEEEEecccCCeEEEEEccC
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSS---DDCIYVYDLEANKLSLRILAHTSDVNTVCFGDE   92 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~---Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~   92 (303)
                      .+-+.|++. +|.+..  -...|...+.+.+|+|||+.|+..+.   +..|++||+.++... .+..+.+.+....|+|+
T Consensus       181 ~~~l~~~d~-dg~~~~--~lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~-~l~~~~g~~~~~~~SPD  256 (435)
T PRK05137        181 IKRLAIMDQ-DGANVR--YLTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRE-LVGNFPGMTFAPRFSPD  256 (435)
T ss_pred             ceEEEEECC-CCCCcE--EEecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEE-EeecCCCcccCcEECCC
Confidence            344555554 455442  12467778999999999999888764   468999999888643 45556667778899875


Q ss_pred             CCcEEEEecCCCe--EEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCC--cEEEEEcccccCCcccc
Q 022074           93 SGHLIYSGSDDNL--CKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQ--AIKLWDIRKMSSNASCN  167 (303)
Q Consensus        93 ~~~~l~s~s~dg~--v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~--~v~lWdl~~~~~~~~~~  167 (303)
                      ...++++.+.++.  |.+||+...     ....+..+........|+|+|+.|+..+ .++  .|.++|+.....     
T Consensus       257 G~~la~~~~~~g~~~Iy~~d~~~~-----~~~~Lt~~~~~~~~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~-----  326 (435)
T PRK05137        257 GRKVVMSLSQGGNTDIYTMDLRSG-----TTTRLTDSPAIDTSPSYSPDGSQIVFESDRSGSPQLYVMNADGSNP-----  326 (435)
T ss_pred             CCEEEEEEecCCCceEEEEECCCC-----ceEEccCCCCccCceeEcCCCCEEEEEECCCCCCeEEEEECCCCCe-----
Confidence            4444557676665  666676421     2333444554556678999999887666 344  455555432100     


Q ss_pred             cCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC---CeEEEEECCCCeE
Q 022074          168 LGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD---SCVYVYDLVSGEQ  244 (303)
Q Consensus       168 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d---g~i~iwd~~~~~~  244 (303)
                                                    ..+....      .....+.++|++++|+....+   ..|.+||...+..
T Consensus       327 ------------------------------~~lt~~~------~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~  370 (435)
T PRK05137        327 ------------------------------RRISFGG------GRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGE  370 (435)
T ss_pred             ------------------------------EEeecCC------CcccCeEECCCCCEEEEEEcCCCceEEEEEECCCCce
Confidence                                          0000000      001235578899988876543   4688999865543


Q ss_pred             EEEeecCCCCeEEEEECCCCCeEEEEeCC------CCEEEeecCC
Q 022074          245 VAALKYHTSPVRDCSWHPSQPMLVSSSWD------GDVVRWEFPG  283 (303)
Q Consensus       245 ~~~~~~h~~~I~~v~~sp~~~~las~s~D------g~i~~Wd~~~  283 (303)
                       ..+. ....+.+..|+|||++|+..+.+      ..|.+.++.+
T Consensus       371 -~~lt-~~~~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g  413 (435)
T PRK05137        371 -RILT-SGFLVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTG  413 (435)
T ss_pred             -Eecc-CCCCCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCC
Confidence             2332 22357788999999987765543      2466666654


No 199
>PRK02889 tolB translocation protein TolB; Provisional
Probab=99.40  E-value=8e-11  Score=108.46  Aligned_cols=194  Identities=20%  Similarity=0.199  Sum_probs=124.2

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCC---CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSD---DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~D---g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      ..+...+.+.+|+|||+.|+..+.+   ..|++||+.+++.. .+....+......|+|+...++++.+.++...+|...
T Consensus       192 ~~~~~~v~~p~wSPDG~~la~~s~~~~~~~I~~~dl~~g~~~-~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d  270 (427)
T PRK02889        192 LSSPEPIISPAWSPDGTKLAYVSFESKKPVVYVHDLATGRRR-VVANFKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVN  270 (427)
T ss_pred             ccCCCCcccceEcCCCCEEEEEEccCCCcEEEEEECCCCCEE-EeecCCCCccceEECCCCCEEEEEEccCCCceEEEEE
Confidence            3566789999999999998887643   35999999988653 3444445567889987544445577888887887653


Q ss_pred             cccCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          113 CLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       113 ~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      .   .......+..+........|+|||+.|+..+ .++...+|.+......  .                         
T Consensus       271 ~---~~~~~~~lt~~~~~~~~~~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~--~-------------------------  320 (427)
T PRK02889        271 A---DGSGLRRLTQSSGIDTEPFFSPDGRSIYFTSDRGGAPQIYRMPASGGA--A-------------------------  320 (427)
T ss_pred             C---CCCCcEECCCCCCCCcCeEEcCCCCEEEEEecCCCCcEEEEEECCCCc--e-------------------------
Confidence            2   1122334444444455677999999887554 4577788876421100  0                         


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCC---eEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEE
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS---CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLV  268 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg---~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~la  268 (303)
                          ...++.+.        ....+.+||+|++++..+.++   .|++||+.+++.. .+... .......|+||+++|+
T Consensus       321 ----~~lt~~g~--------~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~-~lt~~-~~~~~p~~spdg~~l~  386 (427)
T PRK02889        321 ----QRVTFTGS--------YNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVT-ALTDT-TRDESPSFAPNGRYIL  386 (427)
T ss_pred             ----EEEecCCC--------CcCceEECCCCCEEEEEEccCCcEEEEEEECCCCCeE-EccCC-CCccCceECCCCCEEE
Confidence                00000110        012356889999988776554   6999999887643 33222 2346789999999877


Q ss_pred             EEeCCC
Q 022074          269 SSSWDG  274 (303)
Q Consensus       269 s~s~Dg  274 (303)
                      .++.++
T Consensus       387 ~~~~~~  392 (427)
T PRK02889        387 YATQQG  392 (427)
T ss_pred             EEEecC
Confidence            766544


No 200
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.40  E-value=8.8e-13  Score=115.63  Aligned_cols=197  Identities=18%  Similarity=0.282  Sum_probs=145.6

Q ss_pred             ccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCC
Q 022074           39 SFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG  118 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~  118 (303)
                      +++-+.+.++.+|+.++.|+..|.+-.+|..++++...+. -...|..+.|.+ +.++| .......+++||-.     +
T Consensus       129 eFGPY~~~ytrnGrhlllgGrKGHlAa~Dw~t~~L~~Ei~-v~Etv~Dv~~LH-neq~~-AVAQK~y~yvYD~~-----G  200 (545)
T KOG1272|consen  129 EFGPYHLDYTRNGRHLLLGGRKGHLAAFDWVTKKLHFEIN-VMETVRDVTFLH-NEQFF-AVAQKKYVYVYDNN-----G  200 (545)
T ss_pred             ccCCeeeeecCCccEEEecCCccceeeeecccceeeeeee-hhhhhhhhhhhc-chHHH-HhhhhceEEEecCC-----C
Confidence            4567899999999999999999999999999998776553 335688888864 34444 55566799999943     3


Q ss_pred             ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          119 KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       119 ~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      ....++..| ..|..+.|-|--=+|++++.-|-++--|+......++...+...                        + 
T Consensus       201 tElHClk~~-~~v~rLeFLPyHfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~------------------------~-  254 (545)
T KOG1272|consen  201 TELHCLKRH-IRVARLEFLPYHFLLVAASEAGFLKYQDVSTGKLVASIRTGAGR------------------------T-  254 (545)
T ss_pred             cEEeehhhc-CchhhhcccchhheeeecccCCceEEEeechhhhhHHHHccCCc------------------------c-
Confidence            334444433 34666777775556778888888888888765444332221100                        0 


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                               .++.      -+|-+-.+-+|...|+|.+|.....+.+..+-.|.++|.++++.++|+++||++.|..+++
T Consensus       255 ---------~vm~------qNP~NaVih~GhsnGtVSlWSP~skePLvKiLcH~g~V~siAv~~~G~YMaTtG~Dr~~kI  319 (545)
T KOG1272|consen  255 ---------DVMK------QNPYNAVIHLGHSNGTVSLWSPNSKEPLVKILCHRGPVSSIAVDRGGRYMATTGLDRKVKI  319 (545)
T ss_pred             ---------chhh------cCCccceEEEcCCCceEEecCCCCcchHHHHHhcCCCcceEEECCCCcEEeecccccceeE
Confidence                     0000      1234566788999999999999998887777789999999999999999999999999999


Q ss_pred             eecCCC
Q 022074          279 WEFPGN  284 (303)
Q Consensus       279 Wd~~~~  284 (303)
                      ||+..-
T Consensus       320 WDlR~~  325 (545)
T KOG1272|consen  320 WDLRNF  325 (545)
T ss_pred             eeeccc
Confidence            998754


No 201
>KOG2394 consensus WD40 protein DMR-N9 [General function prediction only]
Probab=99.38  E-value=2.1e-11  Score=108.80  Aligned_cols=219  Identities=16%  Similarity=0.194  Sum_probs=129.3

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE----EEecccCCeEEEEEcc----CCCcEEEEecCCCeEEEEcCcc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL----RILAHTSDVNTVCFGD----ESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~----~~~~h~~~v~~l~~~~----~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      +....++..|++|+--- ...+++|+.+.+....    +..-.....+|-.|+.    +.+--++.|-.-|.|.+.|...
T Consensus       126 ~~~~~~~~~gd~lcFnv-g~~lyv~~~~g~~~~~~pi~k~~y~gt~P~cHdfn~~~a~~~g~dllIGf~tGqvq~idp~~  204 (636)
T KOG2394|consen  126 VTNTNQSGKGDRLCFNV-GRELYVYSYRGAADLSKPIDKREYKGTSPTCHDFNSFTATPKGLDLLIGFTTGQVQLIDPIN  204 (636)
T ss_pred             eeeccccCCCCEEEEec-CCeEEEEEccCcchhccchhhhcccCCCCceecccccccCCCCcceEEeeccCceEEecchh
Confidence            44445555677665433 3358889887543221    1111111223333421    2344566788888998887531


Q ss_pred             ccCCCccceeec--c--cccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcc
Q 022074          114 LNVKGKPAGVLM--G--HLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARD  188 (303)
Q Consensus       114 ~~~~~~~~~~~~--~--h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  188 (303)
                          ....+.+.  .  ....|+++.|-+ +..+++.+-.+|.+.++|...... .+      ...+           ..
T Consensus       205 ----~~~sklfne~r~i~ktsvT~ikWvpg~~~~Fl~a~~sGnlyly~~~~~~~-~t------~p~~-----------~~  262 (636)
T KOG2394|consen  205 ----FEVSKLFNEERLINKSSVTCIKWVPGSDSLFLVAHASGNLYLYDKEIVCG-AT------APSY-----------QA  262 (636)
T ss_pred             ----hHHHHhhhhcccccccceEEEEEEeCCCceEEEEEecCceEEeecccccc-CC------CCcc-----------cc
Confidence                11111111  1  235688888876 456788888999999999732110 00      0000           00


Q ss_pred             ccCCCCCcceEEecccceeeeEEE------eeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECC
Q 022074          189 LKHPCDQSVATYKGHSVLRTLIRC------HFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       189 ~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp  262 (303)
                      +.....-.+.+.+.+.....+.++      ...-.|++||++||+.++||.+||+|..+.+++..++..-+...||+|||
T Consensus       263 ~k~~~~f~i~t~ksk~~rNPv~~w~~~~g~in~f~FS~DG~~LA~VSqDGfLRvF~fdt~eLlg~mkSYFGGLLCvcWSP  342 (636)
T KOG2394|consen  263 LKDGDQFAILTSKSKKTRNPVARWHIGEGSINEFAFSPDGKYLATVSQDGFLRIFDFDTQELLGVMKSYFGGLLCVCWSP  342 (636)
T ss_pred             cCCCCeeEEeeeeccccCCccceeEeccccccceeEcCCCceEEEEecCceEEEeeccHHHHHHHHHhhccceEEEEEcC
Confidence            000000001111111100111111      12234889999999999999999999999988888877778899999999


Q ss_pred             CCCeEEEEeCCCCEEEeecCC
Q 022074          263 SQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       263 ~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ||+++++|++|-.+.+|.+..
T Consensus       343 DGKyIvtGGEDDLVtVwSf~e  363 (636)
T KOG2394|consen  343 DGKYIVTGGEDDLVTVWSFEE  363 (636)
T ss_pred             CccEEEecCCcceEEEEEecc
Confidence            999999999999999999753


No 202
>PRK04922 tolB translocation protein TolB; Provisional
Probab=99.38  E-value=1.3e-10  Score=107.35  Aligned_cols=218  Identities=19%  Similarity=0.179  Sum_probs=133.0

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCC---CeEEEEECCCCceEEEEecccCCeEEEEEccC
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSD---DCIYVYDLEANKLSLRILAHTSDVNTVCFGDE   92 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~D---g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~   92 (303)
                      .+-+.|++. +|.+...  ...|..++.+..|+|||+.|+..+.+   ..|++||+.++... .+....+......|+|+
T Consensus       183 ~~~l~i~D~-~g~~~~~--lt~~~~~v~~p~wSpDg~~la~~s~~~~~~~l~~~dl~~g~~~-~l~~~~g~~~~~~~SpD  258 (433)
T PRK04922        183 RYALQVADS-DGYNPQT--ILRSAEPILSPAWSPDGKKLAYVSFERGRSAIYVQDLATGQRE-LVASFRGINGAPSFSPD  258 (433)
T ss_pred             eEEEEEECC-CCCCceE--eecCCCccccccCCCCCCEEEEEecCCCCcEEEEEECCCCCEE-EeccCCCCccCceECCC
Confidence            344556654 4544322  23456679999999999999887743   46999999887643 34444445567889875


Q ss_pred             CCcEEEEecCCC--eEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccC
Q 022074           93 SGHLIYSGSDDN--LCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLG  169 (303)
Q Consensus        93 ~~~~l~s~s~dg--~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~  169 (303)
                      ...++++.+.+|  .|++||+...     ....+..+.......+|+++|++|+..+ .++...+|.+......  .   
T Consensus       259 G~~l~~~~s~~g~~~Iy~~d~~~g-----~~~~lt~~~~~~~~~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~--~---  328 (433)
T PRK04922        259 GRRLALTLSRDGNPEIYVMDLGSR-----QLTRLTNHFGIDTEPTWAPDGKSIYFTSDRGGRPQIYRVAASGGS--A---  328 (433)
T ss_pred             CCEEEEEEeCCCCceEEEEECCCC-----CeEECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCC--e---
Confidence            444555666655  5888887522     1233444444455678999999887665 4555455543211000  0   


Q ss_pred             ccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCC---eEEEEECCCCeEEE
Q 022074          170 FRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS---CVYVYDLVSGEQVA  246 (303)
Q Consensus       170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg---~i~iwd~~~~~~~~  246 (303)
                                                ...+..+.        ....+.+||+|++++..+.++   .|++||+.+++.. 
T Consensus       329 --------------------------~~lt~~g~--------~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~-  373 (433)
T PRK04922        329 --------------------------ERLTFQGN--------YNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVR-  373 (433)
T ss_pred             --------------------------EEeecCCC--------CccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeE-
Confidence                                      00000010        012356889999988765433   6999999887654 


Q ss_pred             EeecCCCCeEEEEECCCCCeEEEEeCC---CCEEEeecCC
Q 022074          247 ALKYHTSPVRDCSWHPSQPMLVSSSWD---GDVVRWEFPG  283 (303)
Q Consensus       247 ~~~~h~~~I~~v~~sp~~~~las~s~D---g~i~~Wd~~~  283 (303)
                      .+. +........|+|||++|+..+.+   ..|.+++..+
T Consensus       374 ~Lt-~~~~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g  412 (433)
T PRK04922        374 TLT-PGSLDESPSFAPNGSMVLYATREGGRGVLAAVSTDG  412 (433)
T ss_pred             ECC-CCCCCCCceECCCCCEEEEEEecCCceEEEEEECCC
Confidence            333 32345677999999987766653   3577777654


No 203
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=99.37  E-value=8.3e-11  Score=101.38  Aligned_cols=207  Identities=22%  Similarity=0.289  Sum_probs=139.6

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEEecccCCeEEEEEccCCCcEEEEecCC--CeEEEEc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRILAHTSDVNTVCFGDESGHLIYSGSDD--NLCKVWD  110 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~~~h~~~v~~l~~~~~~~~~l~s~s~d--g~v~lWd  110 (303)
                      .+.+.+|..+... | .+|++|-.+|.+++|..+.+..   ......-..++..+.-.+.....+++|++.  ..+++||
T Consensus       102 ~l~~~~I~gl~~~-d-g~Litc~~sG~l~~~~~k~~d~hss~l~~la~g~g~~~~r~~~~~p~Iva~GGke~~n~lkiwd  179 (412)
T KOG3881|consen  102 SLGTKSIKGLKLA-D-GTLITCVSSGNLQVRHDKSGDLHSSKLIKLATGPGLYDVRQTDTDPYIVATGGKENINELKIWD  179 (412)
T ss_pred             ccccccccchhhc-C-CEEEEEecCCcEEEEeccCCccccccceeeecCCceeeeccCCCCCceEecCchhcccceeeee
Confidence            3444556555554 3 3688888999999998884431   111223336788888777777888899998  8999999


Q ss_pred             CccccCCCccceee---cccccCe--EEEEeCCC--CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCC
Q 022074          111 RRCLNVKGKPAGVL---MGHLEGI--TFIDSRGD--GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYP  183 (303)
Q Consensus       111 ~~~~~~~~~~~~~~---~~h~~~v--~~~~~~~~--~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~  183 (303)
                      +....+........   .+-.-+|  +.+.|-+.  ...|+++..-+.+|+||.+.......        .+++.     
T Consensus       180 le~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~hqvR~YDt~~qRRPV~--------~fd~~-----  246 (412)
T KOG3881|consen  180 LEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRYHQVRLYDTRHQRRPVA--------QFDFL-----  246 (412)
T ss_pred             cccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecceeEEEecCcccCccee--------Eeccc-----
Confidence            85221111101000   0111122  23445554  56799999999999999985321110        00000     


Q ss_pred             CCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEE-eecCCCCeEEEEECC
Q 022074          184 PQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAA-LKYHTSPVRDCSWHP  262 (303)
Q Consensus       184 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~-~~~h~~~I~~v~~sp  262 (303)
                                ...+.                +...-|++.++++|..-|.+..+|.+.++.+.. +++-.+.|.++.-+|
T Consensus       247 ----------E~~is----------------~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp  300 (412)
T KOG3881|consen  247 ----------ENPIS----------------STGLTPSGNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHP  300 (412)
T ss_pred             ----------cCcce----------------eeeecCCCcEEEEecccchhheecccCceeeccccCCccCCcceEEEcC
Confidence                      00000                001235788999999999999999999998776 888899999999999


Q ss_pred             CCCeEEEEeCCCCEEEeecCC
Q 022074          263 SQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       263 ~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ++++||++|-|+.++++|.+.
T Consensus       301 ~~~~las~GLDRyvRIhD~kt  321 (412)
T KOG3881|consen  301 THPVLASCGLDRYVRIHDIKT  321 (412)
T ss_pred             CCceEEeeccceeEEEeeccc
Confidence            999999999999999999876


No 204
>KOG1587 consensus Cytoplasmic dynein intermediate chain [Cytoskeleton]
Probab=99.37  E-value=6.6e-11  Score=110.05  Aligned_cols=239  Identities=15%  Similarity=0.182  Sum_probs=158.8

Q ss_pred             EEEccCchhhccccccccccCcC--cc--cccCCCcccceEEEEEcCCCCE--EEEeeCCCeEEEEECCCCceE-----E
Q 022074            7 IVDVGSGTMESLANVTEIHDGLD--FS--AADDGGYSFGIFSLKFSTDGRE--LVAGSSDDCIYVYDLEANKLS-----L   75 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~--~~--~~~~~~~~~~v~~l~~s~~g~~--l~sgs~Dg~v~lwd~~~~~~~-----~   75 (303)
                      |-.+-.|.-..-+-+|+.--|..  .+  -.-...|..++..+.|..+..-  ++++|.||.|..|+++.-...     .
T Consensus       255 p~ll~gG~y~GqV~lWD~~~~~~~~~s~ls~~~~sh~~~v~~vvW~~~~~~~~f~s~ssDG~i~~W~~~~l~~P~e~~~~  334 (555)
T KOG1587|consen  255 PNLLAGGCYNGQVVLWDLRKGSDTPPSGLSALEVSHSEPVTAVVWLQNEHNTEFFSLSSDGSICSWDTDMLSLPVEGLLL  334 (555)
T ss_pred             cceEEeeccCceEEEEEccCCCCCCCcccccccccCCcCeEEEEEeccCCCCceEEEecCCcEeeeeccccccchhhccc
Confidence            33333455566778888765554  22  2233789999999999975444  999999999999987654321     1


Q ss_pred             EEecc-------cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCC----ccceeecccccCeEEEEeCCCCCEEE
Q 022074           76 RILAH-------TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG----KPAGVLMGHLEGITFIDSRGDGRYLI  144 (303)
Q Consensus        76 ~~~~h-------~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~----~~~~~~~~h~~~v~~~~~~~~~~~l~  144 (303)
                      ....|       ..+++++.|.+.+.+.|+.|+.+|.|.-=+........    +....+..|.+.|.++.++|=...++
T Consensus       335 ~~~~~~~~~~~~~~~~t~~~F~~~~p~~FiVGTe~G~v~~~~r~g~~~~~~~~~~~~~~~~~h~g~v~~v~~nPF~~k~f  414 (555)
T KOG1587|consen  335 ESKKHKGQQSSKAVGATSLKFEPTDPNHFIVGTEEGKVYKGCRKGYTPAPEVSYKGHSTFITHIGPVYAVSRNPFYPKNF  414 (555)
T ss_pred             ccccccccccccccceeeEeeccCCCceEEEEcCCcEEEEEeccCCcccccccccccccccccCcceEeeecCCCcccee
Confidence            11111       13678899977778899999999999763221111111    22334566888999999988766655


Q ss_pred             EEeCCCcEEEEEcccc-cCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCe
Q 022074          145 SNGKDQAIKLWDIRKM-SSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQK  223 (303)
Q Consensus       145 s~~~D~~v~lWdl~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~  223 (303)
                      ..+.|-+|+||..... .+..                                  .+..+.  ..+....|+|.   ...
T Consensus       415 ls~gDW~vriWs~~~~~~Pl~----------------------------------~~~~~~--~~v~~vaWSpt---rpa  455 (555)
T KOG1587|consen  415 LSVGDWTVRIWSEDVIASPLL----------------------------------SLDSSP--DYVTDVAWSPT---RPA  455 (555)
T ss_pred             eeeccceeEeccccCCCCcch----------------------------------hhhhcc--ceeeeeEEcCc---Cce
Confidence            4444999999976521 1100                                  000000  00122334432   235


Q ss_pred             EEEEEeCCCeEEEEECCCC--eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          224 YIYTGSHDSCVYVYDLVSG--EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~--~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .|+++..||.|.+||+...  +++...+.+....+.+.|++.|++|+.|+..|+++++++..+
T Consensus       456 vF~~~d~~G~l~iWDLl~~~~~Pv~s~~~~~~~l~~~~~s~~g~~lavGd~~G~~~~~~l~~~  518 (555)
T KOG1587|consen  456 VFATVDGDGNLDIWDLLQDDEEPVLSQKVCSPALTRVRWSPNGKLLAVGDANGTTHILKLSES  518 (555)
T ss_pred             EEEEEcCCCceehhhhhccccCCcccccccccccceeecCCCCcEEEEecCCCcEEEEEcCch
Confidence            7899999999999999654  345555566677889999999999999999999999999643


No 205
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=99.36  E-value=1e-11  Score=110.76  Aligned_cols=182  Identities=15%  Similarity=0.180  Sum_probs=125.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .+|..++.|-.|+|||.-|++++.||.|++|.- +|-+...+......|.|++|.|++.+.+++.+..-.|+-  +.   
T Consensus       101 ~AH~~A~~~gRW~~dGtgLlt~GEDG~iKiWSr-sGMLRStl~Q~~~~v~c~~W~p~S~~vl~c~g~h~~IKp--L~---  174 (737)
T KOG1524|consen  101 SAHAAAISSGRWSPDGAGLLTAGEDGVIKIWSR-SGMLRSTVVQNEESIRCARWAPNSNSIVFCQGGHISIKP--LA---  174 (737)
T ss_pred             hhhhhhhhhcccCCCCceeeeecCCceEEEEec-cchHHHHHhhcCceeEEEEECCCCCceEEecCCeEEEee--cc---
Confidence            589999999999999999999999999999965 333333344556789999998887888887776444432  21   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                      .+.+ +-....|.+-|.+++|++..+++++||.|-..++||--......+                              
T Consensus       175 ~n~k-~i~WkAHDGiiL~~~W~~~s~lI~sgGED~kfKvWD~~G~~Lf~S------------------------------  223 (737)
T KOG1524|consen  175 ANSK-IIRWRAHDGLVLSLSWSTQSNIIASGGEDFRFKIWDAQGANLFTS------------------------------  223 (737)
T ss_pred             cccc-eeEEeccCcEEEEeecCccccceeecCCceeEEeecccCcccccC------------------------------
Confidence            2222 334567888899999999999999999999999999532111100                              


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCC
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGD  275 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~  275 (303)
                           ..|...++      +..|.|+ +.++.++. .++|           --+...+.|..++||+||.+++.|...|.
T Consensus       224 -----~~~ey~IT------Sva~npd-~~~~v~S~-nt~R-----------~~~p~~GSifnlsWS~DGTQ~a~gt~~G~  279 (737)
T KOG1524|consen  224 -----AAEEYAIT------SVAFNPE-KDYLLWSY-NTAR-----------FSSPRVGSIFNLSWSADGTQATCGTSTGQ  279 (737)
T ss_pred             -----Chhcccee------eeeeccc-cceeeeee-eeee-----------ecCCCccceEEEEEcCCCceeeccccCce
Confidence                 00111111      2235566 44444432 2233           11344578999999999999999998887


Q ss_pred             EEE
Q 022074          276 VVR  278 (303)
Q Consensus       276 i~~  278 (303)
                      +.+
T Consensus       280 v~~  282 (737)
T KOG1524|consen  280 LIV  282 (737)
T ss_pred             EEE
Confidence            764


No 206
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=99.36  E-value=5.9e-10  Score=93.92  Aligned_cols=220  Identities=15%  Similarity=0.190  Sum_probs=128.7

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCe--EEEEEccCCCcEEEEecCC------CeEEEEcCc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDV--NTVCFGDESGHLIYSGSDD------NLCKVWDRR  112 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v--~~l~~~~~~~~~l~s~s~d------g~v~lWd~~  112 (303)
                      ...+++|+-|..-+++|..+| .+||+.+.-++......+.++.  ..+-|-  ..-+.+.|+.+      ..|.+||- 
T Consensus         7 ~~lsvs~NQD~ScFava~~~G-friyn~~P~ke~~~r~~~~~G~~~veMLfR--~N~laLVGGg~~pky~pNkviIWDD-   82 (346)
T KOG2111|consen    7 KTLSVSFNQDHSCFAVATDTG-FRIYNCDPFKESASRQFIDGGFKIVEMLFR--SNYLALVGGGSRPKYPPNKVIIWDD-   82 (346)
T ss_pred             ceeEEEEccCCceEEEEecCc-eEEEecCchhhhhhhccccCchhhhhHhhh--hceEEEecCCCCCCCCCceEEEEec-
Confidence            366799999999999998888 9999887644322222222221  111221  12233344433      36889992 


Q ss_pred             cccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc-cc---CCcccccCccceeeeceeeeCCCC--C
Q 022074          113 CLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK-MS---SNASCNLGFRSYEWDYRWMDYPPQ--A  186 (303)
Q Consensus       113 ~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~-~~---~~~~~~~~~~~~~~~~~~~~~~~~--~  186 (303)
                         ...+++..+ .....|.++.+.++  .|+.. .++.|.+|.... ++   .......+ ..      .+.+.+.  .
T Consensus        83 ---~k~~~i~el-~f~~~I~~V~l~r~--riVvv-l~~~I~VytF~~n~k~l~~~et~~NP-kG------lC~~~~~~~k  148 (346)
T KOG2111|consen   83 ---LKERCIIEL-SFNSEIKAVKLRRD--RIVVV-LENKIYVYTFPDNPKLLHVIETRSNP-KG------LCSLCPTSNK  148 (346)
T ss_pred             ---ccCcEEEEE-EeccceeeEEEcCC--eEEEE-ecCeEEEEEcCCChhheeeeecccCC-Cc------eEeecCCCCc
Confidence               223333333 35667888887665  45544 358899997542 11   10000000 00      0111110  0


Q ss_pred             ccccCCCCC--cc-------------eEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCe-EEEEECCCCeEEEEee-
Q 022074          187 RDLKHPCDQ--SV-------------ATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSC-VYVYDLVSGEQVAALK-  249 (303)
Q Consensus       187 ~~~~~~~~~--~~-------------~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~-i~iwd~~~~~~~~~~~-  249 (303)
                      ..+..|...  .+             .....|...+.+      ...+-+|..+||+|+.|+ |||||..+|+++.++. 
T Consensus       149 ~~LafPg~k~GqvQi~dL~~~~~~~p~~I~AH~s~Iac------v~Ln~~Gt~vATaStkGTLIRIFdt~~g~~l~E~RR  222 (346)
T KOG2111|consen  149 SLLAFPGFKTGQVQIVDLASTKPNAPSIINAHDSDIAC------VALNLQGTLVATASTKGTLIRIFDTEDGTLLQELRR  222 (346)
T ss_pred             eEEEcCCCccceEEEEEhhhcCcCCceEEEcccCceeE------EEEcCCccEEEEeccCcEEEEEEEcCCCcEeeeeec
Confidence            111111100  00             112222222211      124567999999999998 8999999999998885 


Q ss_pred             -cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          250 -YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       250 -~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                       .....|.+++|||+..+||.+|+.|+++++.+...
T Consensus       223 G~d~A~iy~iaFSp~~s~LavsSdKgTlHiF~l~~~  258 (346)
T KOG2111|consen  223 GVDRADIYCIAFSPNSSWLAVSSDKGTLHIFSLRDT  258 (346)
T ss_pred             CCchheEEEEEeCCCccEEEEEcCCCeEEEEEeecC
Confidence             33468999999999999999999999999998754


No 207
>KOG1524 consensus WD40 repeat-containing protein CHE-2 [General function prediction only]
Probab=99.35  E-value=2e-11  Score=108.99  Aligned_cols=219  Identities=16%  Similarity=0.244  Sum_probs=136.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEE-ECCCCceEEEEecccCCeEEEEE----c---cCCCcEEEEecCCCeEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVY-DLEANKLSLRILAHTSDVNTVCF----G---DESGHLIYSGSDDNLCK  107 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lw-d~~~~~~~~~~~~h~~~v~~l~~----~---~~~~~~l~s~s~dg~v~  107 (303)
                      +.|.--|.|+.|+.+...+.+++ |..+.+| |+.+.. .....-....+....+    +   ....+.|+.++.||.+.
T Consensus        11 ~r~~e~vc~v~w~~~eei~~~~d-Dh~~~~~~~~~~~s-~~~~~~p~df~pt~~h~~~rs~~~g~~~d~~~i~s~DGkf~   88 (737)
T KOG1524|consen   11 NRNSEKVCCVDWSSNEEIYFVSD-DHQIFKWSDVSRDS-VEVAKLPDDFVPTDMHLGGRSSGGGKGSDTLLICSNDGRFV   88 (737)
T ss_pred             cccceeEEeecccccceEEEecc-CceEEEeecccchh-hhhhhCCcccCCccccccccccCCCCCcceEEEEcCCceEE
Confidence            46666788999998887666655 5555555 444332 2111111122221111    0   11345788899999998


Q ss_pred             EEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccC--CcccccCccceeeeceeeeCCCC
Q 022074          108 VWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSS--NASCNLGFRSYEWDYRWMDYPPQ  185 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~--~~~~~~~~~~~~~~~~~~~~~~~  185 (303)
                      +-+.     .++.......|.+++.+-.|+++|.-|+|+|.||.|++|.-..+..  ..+.....++..|       .|+
T Consensus        89 il~k-----~~rVE~sv~AH~~A~~~gRW~~dGtgLlt~GEDG~iKiWSrsGMLRStl~Q~~~~v~c~~W-------~p~  156 (737)
T KOG1524|consen   89 ILNK-----SARVERSISAHAAAISSGRWSPDGAGLLTAGEDGVIKIWSRSGMLRSTVVQNEESIRCARW-------APN  156 (737)
T ss_pred             Eecc-----cchhhhhhhhhhhhhhhcccCCCCceeeeecCCceEEEEeccchHHHHHhhcCceeEEEEE-------CCC
Confidence            8763     3455566778999999999999999999999999999997443211  1111122233333       333


Q ss_pred             CccccCCC-----------CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCC
Q 022074          186 ARDLKHPC-----------DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSP  254 (303)
Q Consensus       186 ~~~~~~~~-----------~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~  254 (303)
                      ...+..+.           ..++..+..|..+.  +    +..+++...++++||+|=..++||. .|..+.....|+.|
T Consensus       157 S~~vl~c~g~h~~IKpL~~n~k~i~WkAHDGii--L----~~~W~~~s~lI~sgGED~kfKvWD~-~G~~Lf~S~~~ey~  229 (737)
T KOG1524|consen  157 SNSIVFCQGGHISIKPLAANSKIIRWRAHDGLV--L----SLSWSTQSNIIASGGEDFRFKIWDA-QGANLFTSAAEEYA  229 (737)
T ss_pred             CCceEEecCCeEEEeecccccceeEEeccCcEE--E----EeecCccccceeecCCceeEEeecc-cCcccccCChhccc
Confidence            22221111           11223344444322  2    2335667889999999999999997 57778888899999


Q ss_pred             eEEEEECCCCCeEEEEeCCCCEE
Q 022074          255 VRDCSWHPSQPMLVSSSWDGDVV  277 (303)
Q Consensus       255 I~~v~~sp~~~~las~s~Dg~i~  277 (303)
                      |++++|+|+ ..++-+|. .+++
T Consensus       230 ITSva~npd-~~~~v~S~-nt~R  250 (737)
T KOG1524|consen  230 ITSVAFNPE-KDYLLWSY-NTAR  250 (737)
T ss_pred             eeeeeeccc-cceeeeee-eeee
Confidence            999999999 55665654 3555


No 208
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=99.34  E-value=8.6e-10  Score=103.71  Aligned_cols=144  Identities=24%  Similarity=0.250  Sum_probs=104.4

Q ss_pred             EEEccCchhhc--------cccccc--cccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC---Cce
Q 022074            7 IVDVGSGTMES--------LANVTE--IHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA---NKL   73 (303)
Q Consensus         7 ~~~~~~~~~~~--------~~~~~~--~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~---~~~   73 (303)
                      +++.++|.+-+        .+.|..  .+++..-   ..--|...+.+.++||+++++++|..||.|.+|.--.   ...
T Consensus       166 I~~~~~ge~~~i~~~~~~~~~~v~~~~~~~~~~~---~~~~Htf~~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~~~~~~  242 (792)
T KOG1963|consen  166 IVDNNSGEFKGIVHMCKIHIYFVPKHTKHTSSRD---ITVHHTFNITCVALSPNERYLAAGDSDGRILVWRDFGSSDDSE  242 (792)
T ss_pred             EEEcCCceEEEEEEeeeEEEEEecccceeeccch---hhhhhcccceeEEeccccceEEEeccCCcEEEEeccccccccc
Confidence            56667766653        334444  2222222   2246888899999999999999999999999995533   122


Q ss_pred             -EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcE
Q 022074           74 -SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAI  152 (303)
Q Consensus        74 -~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v  152 (303)
                       ...+.=|...|+++.|++ ++.+|+||+..|.+.+|-+...    + ..-+..-...|..+.++||+.+.+..-.|.+|
T Consensus       243 t~t~lHWH~~~V~~L~fS~-~G~~LlSGG~E~VLv~Wq~~T~----~-kqfLPRLgs~I~~i~vS~ds~~~sl~~~DNqI  316 (792)
T KOG1963|consen  243 TCTLLHWHHDEVNSLSFSS-DGAYLLSGGREGVLVLWQLETG----K-KQFLPRLGSPILHIVVSPDSDLYSLVLEDNQI  316 (792)
T ss_pred             cceEEEecccccceeEEec-CCceEeecccceEEEEEeecCC----C-cccccccCCeeEEEEEcCCCCeEEEEecCceE
Confidence             233445778999999975 5789999999999999987522    2 22233335678999999999999888899999


Q ss_pred             EEEEccc
Q 022074          153 KLWDIRK  159 (303)
Q Consensus       153 ~lWdl~~  159 (303)
                      .+-....
T Consensus       317 ~li~~~d  323 (792)
T KOG1963|consen  317 HLIKASD  323 (792)
T ss_pred             EEEeccc
Confidence            9987643


No 209
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=99.32  E-value=1.6e-09  Score=92.89  Aligned_cols=195  Identities=14%  Similarity=0.213  Sum_probs=131.8

Q ss_pred             ceEEEEEcCCCCEEEEeeCC--CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCC
Q 022074           41 GIFSLKFSTDGRELVAGSSD--DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG  118 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~D--g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~  118 (303)
                      .|.-+-|+..  .+|..+.+  +.+++++.+.+..++.+.- ...|-++..+   .++|+.+=++ .|++||++.-..- 
T Consensus        48 ~IvEmLFSSS--LvaiV~~~qpr~Lkv~~~Kk~~~ICe~~f-pt~IL~VrmN---r~RLvV~Lee-~IyIydI~~MklL-  119 (391)
T KOG2110|consen   48 SIVEMLFSSS--LVAIVSIKQPRKLKVVHFKKKTTICEIFF-PTSILAVRMN---RKRLVVCLEE-SIYIYDIKDMKLL-  119 (391)
T ss_pred             EEEEeecccc--eeEEEecCCCceEEEEEcccCceEEEEec-CCceEEEEEc---cceEEEEEcc-cEEEEecccceee-
Confidence            4555666643  34443333  3588888888877665432 2357777775   3566666555 4999998743211 


Q ss_pred             ccceeecccccCeEEEEeCCCCCEEEEEe--CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          119 KPAGVLMGHLEGITFIDSRGDGRYLISNG--KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       119 ~~~~~~~~h~~~v~~~~~~~~~~~l~s~~--~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      ..+.....+..++.++++++++.|++--+  .-|.|.+||+-....                                  
T Consensus       120 hTI~t~~~n~~gl~AlS~n~~n~ylAyp~s~t~GdV~l~d~~nl~~----------------------------------  165 (391)
T KOG2110|consen  120 HTIETTPPNPKGLCALSPNNANCYLAYPGSTTSGDVVLFDTINLQP----------------------------------  165 (391)
T ss_pred             hhhhccCCCccceEeeccCCCCceEEecCCCCCceEEEEEccccee----------------------------------
Confidence            11212223556688888888888887432  368999999754322                                  


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCe-EEEEECCCCeEEEEeecC--CCCeEEEEECCCCCeEEEEeCC
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSC-VYVYDLVSGEQVAALKYH--TSPVRDCSWHPSQPMLVSSSWD  273 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~-i~iwd~~~~~~~~~~~~h--~~~I~~v~~sp~~~~las~s~D  273 (303)
                      +..+..|.....      ..+|+++|.+||||++.|+ |||+.+.+|+++++|.--  ...|.+++|||+.++|++.|..
T Consensus       166 v~~I~aH~~~lA------alafs~~G~llATASeKGTVIRVf~v~~G~kl~eFRRG~~~~~IySL~Fs~ds~~L~~sS~T  239 (391)
T KOG2110|consen  166 VNTINAHKGPLA------ALAFSPDGTLLATASEKGTVIRVFSVPEGQKLYEFRRGTYPVSIYSLSFSPDSQFLAASSNT  239 (391)
T ss_pred             eeEEEecCCcee------EEEECCCCCEEEEeccCceEEEEEEcCCccEeeeeeCCceeeEEEEEEECCCCCeEEEecCC
Confidence            112222221111      2358899999999999997 899999999999988522  3468899999999999999999


Q ss_pred             CCEEEeecCC
Q 022074          274 GDVVRWEFPG  283 (303)
Q Consensus       274 g~i~~Wd~~~  283 (303)
                      +++++|.+..
T Consensus       240 eTVHiFKL~~  249 (391)
T KOG2110|consen  240 ETVHIFKLEK  249 (391)
T ss_pred             CeEEEEEecc
Confidence            9999999864


No 210
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=99.32  E-value=1.1e-10  Score=111.33  Aligned_cols=196  Identities=23%  Similarity=0.328  Sum_probs=141.6

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeE---EEEE-ccCCCcEEEEecCCCeEEEEcCccccC
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVN---TVCF-GDESGHLIYSGSDDNLCKVWDRRCLNV  116 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~---~l~~-~~~~~~~l~s~s~dg~v~lWd~~~~~~  116 (303)
                      .|....+.-+.+.++..+.++.+.+||...+....++. +...+.   ++-+ ...+.-++++|+--+.+.+|+..   .
T Consensus        89 wi~g~~l~~e~k~i~l~~~~ns~~i~d~~~~~~~~~i~-~~er~~l~~~~~~g~s~~~~~i~~gsv~~~iivW~~~---~  164 (967)
T KOG0974|consen   89 WIFGAKLFEENKKIALVTSRNSLLIRDSKNSSVLSKIQ-SDERCTLYSSLIIGDSAEELYIASGSVFGEIIVWKPH---E  164 (967)
T ss_pred             cccccchhhhcceEEEEEcCceEEEEecccCceehhcC-CCceEEEEeEEEEeccCcEEEEEeccccccEEEEecc---c
Confidence            34445556667788889999999999999887654433 333222   1112 12334578899999999999975   2


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      ...+. .+.||.+.+..+.++.+|+++++.|.|+++|+|++...+...                                
T Consensus       165 dn~p~-~l~GHeG~iF~i~~s~dg~~i~s~SdDRsiRlW~i~s~~~~~--------------------------------  211 (967)
T KOG0974|consen  165 DNKPI-RLKGHEGSIFSIVTSLDGRYIASVSDDRSIRLWPIDSREVLG--------------------------------  211 (967)
T ss_pred             cCCcc-eecccCCceEEEEEccCCcEEEEEecCcceeeeecccccccC--------------------------------
Confidence            22333 678999999999999999999999999999999987543211                                


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC-CCeEEEEECCCCCeEEEEeCCCC
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT-SPVRDCSWHPSQPMLVSSSWDGD  275 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~-~~I~~v~~sp~~~~las~s~Dg~  275 (303)
                       ...-||....  ..+.    |.++  .++|+++|=++++|+. +++.+..+++|. .-|+.++.+++...++|++.|+.
T Consensus       212 -~~~fgHsaRv--w~~~----~~~n--~i~t~gedctcrvW~~-~~~~l~~y~~h~g~~iw~~~~~~~~~~~vT~g~Ds~  281 (967)
T KOG0974|consen  212 -CTGFGHSARV--WACC----FLPN--RIITVGEDCTCRVWGV-NGTQLEVYDEHSGKGIWKIAVPIGVIIKVTGGNDST  281 (967)
T ss_pred             -ccccccccee--EEEE----eccc--eeEEeccceEEEEEec-ccceehhhhhhhhcceeEEEEcCCceEEEeeccCcc
Confidence             0111122111  1222    3344  7999999999999976 456666888886 57999999999999999999999


Q ss_pred             EEEeecCC
Q 022074          276 VVRWEFPG  283 (303)
Q Consensus       276 i~~Wd~~~  283 (303)
                      +++|+..+
T Consensus       282 lk~~~l~~  289 (967)
T KOG0974|consen  282 LKLWDLNG  289 (967)
T ss_pred             hhhhhhhc
Confidence            99999653


No 211
>PRK01742 tolB translocation protein TolB; Provisional
Probab=99.31  E-value=8.5e-11  Score=108.37  Aligned_cols=192  Identities=19%  Similarity=0.222  Sum_probs=122.4

Q ss_pred             cccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEee-CCCeEEEE--ECCCCceEEEEecccCCeEEEEEccCCC
Q 022074           18 LANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGS-SDDCIYVY--DLEANKLSLRILAHTSDVNTVCFGDESG   94 (303)
Q Consensus        18 ~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs-~Dg~v~lw--d~~~~~~~~~~~~h~~~v~~l~~~~~~~   94 (303)
                      -+.+|++-+|....-....||.   .+++|+|||+.|+.++ .+|.+.||  |+.++.. .++..+...+....|+|+ +
T Consensus       229 ~i~i~dl~tg~~~~l~~~~g~~---~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~-~~lt~~~~~~~~~~wSpD-G  303 (429)
T PRK01742        229 QLVVHDLRSGARKVVASFRGHN---GAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTP-SQLTSGAGNNTEPSWSPD-G  303 (429)
T ss_pred             EEEEEeCCCCceEEEecCCCcc---CceeECCCCCEEEEEEecCCcEEEEEEECCCCCe-EeeccCCCCcCCEEECCC-C
Confidence            3455665444321111123443   3689999999888765 67765555  6666653 456666667788999865 5


Q ss_pred             cE-EEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccce
Q 022074           95 HL-IYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSY  173 (303)
Q Consensus        95 ~~-l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~  173 (303)
                      +. ++++..++...+|+....   ......+ ++..  ....++|+|++|+..+.++ +.+||+......          
T Consensus       304 ~~i~f~s~~~g~~~I~~~~~~---~~~~~~l-~~~~--~~~~~SpDG~~ia~~~~~~-i~~~Dl~~g~~~----------  366 (429)
T PRK01742        304 QSILFTSDRSGSPQVYRMSAS---GGGASLV-GGRG--YSAQISADGKTLVMINGDN-VVKQDLTSGSTE----------  366 (429)
T ss_pred             CEEEEEECCCCCceEEEEECC---CCCeEEe-cCCC--CCccCCCCCCEEEEEcCCC-EEEEECCCCCeE----------
Confidence            54 555567888899976421   1112222 3333  3467899999998887765 455887532110          


Q ss_pred             eeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEEC--CCCeEEEEeecC
Q 022074          174 EWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDL--VSGEQVAALKYH  251 (303)
Q Consensus       174 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~--~~~~~~~~~~~h  251 (303)
                                               .+....       ....+.|+|++++|+.++.++.+.+|++  .+|+....+..|
T Consensus       367 -------------------------~lt~~~-------~~~~~~~sPdG~~i~~~s~~g~~~~l~~~~~~G~~~~~l~~~  414 (429)
T PRK01742        367 -------------------------VLSSTF-------LDESPSISPNGIMIIYSSTQGLGKVLQLVSADGRFKARLPGS  414 (429)
T ss_pred             -------------------------EecCCC-------CCCCceECCCCCEEEEEEcCCCceEEEEEECCCCceEEccCC
Confidence                                     000000       0023568899999999999998888875  357777888888


Q ss_pred             CCCeEEEEECCC
Q 022074          252 TSPVRDCSWHPS  263 (303)
Q Consensus       252 ~~~I~~v~~sp~  263 (303)
                      .+.+.+.+|||-
T Consensus       415 ~g~~~~p~wsp~  426 (429)
T PRK01742        415 DGQVKFPAWSPY  426 (429)
T ss_pred             CCCCCCcccCCC
Confidence            889999999984


No 212
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=99.30  E-value=3e-10  Score=97.23  Aligned_cols=192  Identities=14%  Similarity=0.239  Sum_probs=127.3

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      +..++|++-=..++++..|.+||+||-... ....+.. -...|.+++|-|-..+.|+.|.. +-|.+|.........++
T Consensus       101 lr~~aWhqH~~~fava~nddvVriy~ksst-~pt~Lks~sQrnvtclawRPlsaselavgCr-~gIciW~~s~tln~~r~  178 (445)
T KOG2139|consen  101 LRGVAWHQHIIAFAVATNDDVVRIYDKSST-CPTKLKSVSQRNVTCLAWRPLSASELAVGCR-AGICIWSDSRTLNANRN  178 (445)
T ss_pred             eeeEeechhhhhhhhhccCcEEEEeccCCC-CCceecchhhcceeEEEeccCCcceeeeeec-ceeEEEEcCcccccccc
Confidence            778899986666888999999999988773 3333332 23589999998777777777775 46889976422111121


Q ss_pred             --------ceee--cccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccc
Q 022074          121 --------AGVL--MGHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       121 --------~~~~--~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                              ...+  .|| ..|+++.+.+||..+++++ .|..|.|||........-...                     
T Consensus       179 ~~~~s~~~~qvl~~pgh-~pVtsmqwn~dgt~l~tAS~gsssi~iWdpdtg~~~pL~~~---------------------  236 (445)
T KOG2139|consen  179 IRMMSTHHLQVLQDPGH-NPVTSMQWNEDGTILVTASFGSSSIMIWDPDTGQKIPLIPK---------------------  236 (445)
T ss_pred             cccccccchhheeCCCC-ceeeEEEEcCCCCEEeecccCcceEEEEcCCCCCccccccc---------------------
Confidence                    1112  233 5699999999999999988 478999999864321100000                     


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC-eEEEEeecCCCCeEEEEECCCCCe-E
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQPM-L  267 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~~~-l  267 (303)
                               ...|          .....+|||+.+|.++.-|+..++|+..+. ....-. .-.+.|....|+|+|++ |
T Consensus       237 ---------glgg----------~slLkwSPdgd~lfaAt~davfrlw~e~q~wt~erw~-lgsgrvqtacWspcGsfLL  296 (445)
T KOG2139|consen  237 ---------GLGG----------FSLLKWSPDGDVLFAATCDAVFRLWQENQSWTKERWI-LGSGRVQTACWSPCGSFLL  296 (445)
T ss_pred             ---------CCCc----------eeeEEEcCCCCEEEEecccceeeeehhcccceeccee-ccCCceeeeeecCCCCEEE
Confidence                     0000          012347899999999999999999965433 233322 23358999999999986 4


Q ss_pred             EEEeCCCCEE
Q 022074          268 VSSSWDGDVV  277 (303)
Q Consensus       268 as~s~Dg~i~  277 (303)
                      .+.+..-.+.
T Consensus       297 f~~sgsp~ly  306 (445)
T KOG2139|consen  297 FACSGSPRLY  306 (445)
T ss_pred             EEEcCCceEE
Confidence            4554444443


No 213
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=99.28  E-value=3.6e-09  Score=90.83  Aligned_cols=202  Identities=17%  Similarity=0.315  Sum_probs=135.7

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe---cccCCeEEEEEccCCCcEEE--EecCCCeEEEEcC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL---AHTSDVNTVCFGDESGHLIY--SGSDDNLCKVWDR  111 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~---~h~~~v~~l~~~~~~~~~l~--s~s~dg~v~lWd~  111 (303)
                      -+--+|.++.++.  ++|+++-.+. |+|||+++-++..++.   .+..++.++.+++. +.+++  .....|.|.+||+
T Consensus        85 ~fpt~IL~VrmNr--~RLvV~Lee~-IyIydI~~MklLhTI~t~~~n~~gl~AlS~n~~-n~ylAyp~s~t~GdV~l~d~  160 (391)
T KOG2110|consen   85 FFPTSILAVRMNR--KRLVVCLEES-IYIYDIKDMKLLHTIETTPPNPKGLCALSPNNA-NCYLAYPGSTTSGDVVLFDT  160 (391)
T ss_pred             ecCCceEEEEEcc--ceEEEEEccc-EEEEecccceeehhhhccCCCccceEeeccCCC-CceEEecCCCCCceEEEEEc
Confidence            3445788888875  4677776665 9999999988776554   34456777777543 33544  2334789999997


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCc-EEEEEcccccCCcccccCccceeeeceeeeCCCCCcccc
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQA-IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLK  190 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~-v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  190 (303)
                      .    +.++...+..|.+++-+++|+++|.+|||++.-|+ ||++.+.....                            
T Consensus       161 ~----nl~~v~~I~aH~~~lAalafs~~G~llATASeKGTVIRVf~v~~G~k----------------------------  208 (391)
T KOG2110|consen  161 I----NLQPVNTINAHKGPLAALAFSPDGTLLATASEKGTVIRVFSVPEGQK----------------------------  208 (391)
T ss_pred             c----cceeeeEEEecCCceeEEEECCCCCEEEEeccCceEEEEEEcCCccE----------------------------
Confidence            4    44566777889999999999999999999997665 57887643211                            


Q ss_pred             CCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe---------------------------
Q 022074          191 HPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE---------------------------  243 (303)
Q Consensus       191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~---------------------------  243 (303)
                            +..+....    ......+..|++++++|++.|..++|+++.++...                           
T Consensus       209 ------l~eFRRG~----~~~~IySL~Fs~ds~~L~~sS~TeTVHiFKL~~~~~~~~~~p~~~~~~~~~~sk~~~sylps  278 (391)
T KOG2110|consen  209 ------LYEFRRGT----YPVSIYSLSFSPDSQFLAASSNTETVHIFKLEKVSNNPPESPTAGTSWFGKVSKAATSYLPS  278 (391)
T ss_pred             ------eeeeeCCc----eeeEEEEEEECCCCCeEEEecCCCeEEEEEecccccCCCCCCCCCCcccchhhhhhhhhcch
Confidence                  11111111    01122456789999999999999999999875421                           


Q ss_pred             EE----------EEeecCCCCe-EEEEECC--CCCeEEEEeCCCCEEEeecCCC
Q 022074          244 QV----------AALKYHTSPV-RDCSWHP--SQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       244 ~~----------~~~~~h~~~I-~~v~~sp--~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .+          ...+...... ..+.+.+  ..+.+..++.||.+..+.++..
T Consensus       279 ~V~~~~~~~R~FAt~~l~~s~~~~~~~l~~~~~~~~v~vas~dG~~y~y~l~~~  332 (391)
T KOG2110|consen  279 QVSSVLDQSRKFATAKLPESGRKNICSLSSIQKIPRVLVASYDGHLYSYRLPPK  332 (391)
T ss_pred             hhhhhhhhccceeEEEccCCCccceEEeeccCCCCEEEEEEcCCeEEEEEcCCC
Confidence            00          0001111111 3444553  5678889999999999998764


No 214
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=99.27  E-value=1.4e-10  Score=108.95  Aligned_cols=246  Identities=17%  Similarity=0.222  Sum_probs=135.5

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCC--cEEEEecCCCeEEEEcCcc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESG--HLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~--~~l~s~s~dg~v~lWd~~~  113 (303)
                      +|.+.....-.|++|+++++... +.+|.||...++.....+..|...+..+.+.+...  .++.+++.||+|++||.+.
T Consensus        13 gg~n~~~~~avfSnD~k~l~~~~-~~~V~VyS~~Tg~~i~~l~~~~a~l~s~~~~~~~~~~~~~~~~sl~G~I~vwd~~~   91 (792)
T KOG1963|consen   13 GGRNGNKSPAVFSNDAKFLFLCT-GNFVKVYSTATGECITSLEDHTAPLTSVIVLPSSENANYLIVCSLDGTIRVWDWSD   91 (792)
T ss_pred             ccccceecccccccCCcEEEEee-CCEEEEEecchHhhhhhcccccCccceeeecCCCccceEEEEEecCccEEEecCCC
Confidence            45555566677999999888765 45799999999988888889999999998866554  5677999999999999753


Q ss_pred             ccCCCccceeecccccCeEEEEeCC---CCCEEEEEeC-C------------CcEEEEEcccccCCcccccCccceeeec
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRG---DGRYLISNGK-D------------QAIKLWDIRKMSSNASCNLGFRSYEWDY  177 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~---~~~~l~s~~~-D------------~~v~lWdl~~~~~~~~~~~~~~~~~~~~  177 (303)
                      .    ...+++..+ ..+..+.+.+   +-...+..+. |            ++++-+.+.+....   ...+..-.-..
T Consensus        92 ~----~Llkt~~~~-~~v~~~~~~~~~a~~s~~~~~s~~~~~~~~~~s~~~~~q~~~~~~~t~~~~---~~d~~~~~~~~  163 (792)
T KOG1963|consen   92 G----ELLKTFDNN-LPVHALVYKPAQADISANVYVSVEDYSILTTFSKKLSKQSSRFVLATFDSA---KGDFLKEHQEP  163 (792)
T ss_pred             c----EEEEEEecC-CceeEEEechhHhCccceeEeecccceeeeecccccccceeeeEeeecccc---chhhhhhhcCC
Confidence            2    223322211 1111111100   0001111111 1            11111111110000   00000000000


Q ss_pred             eeeeCCCCC--ccccCCCCCcceEEeccc-----ceeeeEEEe----eeeeeeCCCeEEEEEeCCCeEEEEECCC--Ce-
Q 022074          178 RWMDYPPQA--RDLKHPCDQSVATYKGHS-----VLRTLIRCH----FSPVYSTGQKYIYTGSHDSCVYVYDLVS--GE-  243 (303)
Q Consensus       178 ~~~~~~~~~--~~~~~~~~~~~~~~~~~~-----~~~~~~~~~----~~~~~s~~~~~latg~~dg~i~iwd~~~--~~-  243 (303)
                      +.+.+.+.+  ..+.+.|  .+..+.-+.     .......-|    ....+||.++++|+|..||.|.+|.--.  .+ 
T Consensus       164 ~~I~~~~~ge~~~i~~~~--~~~~~~v~~~~~~~~~~~~~~~Htf~~t~~~~spn~~~~Aa~d~dGrI~vw~d~~~~~~~  241 (792)
T KOG1963|consen  164 KSIVDNNSGEFKGIVHMC--KIHIYFVPKHTKHTSSRDITVHHTFNITCVALSPNERYLAAGDSDGRILVWRDFGSSDDS  241 (792)
T ss_pred             ccEEEcCCceEEEEEEee--eEEEEEecccceeeccchhhhhhcccceeEEeccccceEEEeccCCcEEEEecccccccc
Confidence            000000000  0001111  011110000     000000001    1234789999999999999999996432  22 


Q ss_pred             -EEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC-CccCCCCc
Q 022074          244 -QVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN-GEAAPPLN  292 (303)
Q Consensus       244 -~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~-~~~~~~~~  292 (303)
                       ....+.-|..+|++++||+||.+|.|||.+|.+.+|....+ +.-.|++.
T Consensus       242 ~t~t~lHWH~~~V~~L~fS~~G~~LlSGG~E~VLv~Wq~~T~~kqfLPRLg  292 (792)
T KOG1963|consen  242 ETCTLLHWHHDEVNSLSFSSDGAYLLSGGREGVLVLWQLETGKKQFLPRLG  292 (792)
T ss_pred             ccceEEEecccccceeEEecCCceEeecccceEEEEEeecCCCcccccccC
Confidence             23456789999999999999999999999999999998765 33355554


No 215
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=99.27  E-value=1.8e-10  Score=105.39  Aligned_cols=189  Identities=16%  Similarity=0.331  Sum_probs=125.8

Q ss_pred             CeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccC
Q 022074           83 DVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSS  162 (303)
Q Consensus        83 ~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~  162 (303)
                      .|..++|.|+ +..|+.+.. ..+.+||..    .+.....+.+|.+.|.+++++.+|+.+++|+.|+.|.+|.-... -
T Consensus        14 ci~d~afkPD-GsqL~lAAg-~rlliyD~n----dG~llqtLKgHKDtVycVAys~dGkrFASG~aDK~VI~W~~klE-G   86 (1081)
T KOG1538|consen   14 CINDIAFKPD-GTQLILAAG-SRLLVYDTS----DGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKSVIIWTSKLE-G   86 (1081)
T ss_pred             chheeEECCC-CceEEEecC-CEEEEEeCC----CcccccccccccceEEEEEEccCCceeccCCCceeEEEeccccc-c
Confidence            7889999765 555544443 378899975    45667788999999999999999999999999999999975421 1


Q ss_pred             CcccccCccceeeeceeeeCCCCCccccCCCCCcceEEeccc-c---eeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEE
Q 022074          163 NASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHS-V---LRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYD  238 (303)
Q Consensus       163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~---~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd  238 (303)
                      ...+     +..-.+..+.+.|..+.+..+....-.-+...+ .   .+...++..+ .+..||++++.|-.+|+|.+-+
T Consensus        87 ~LkY-----SH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~kss~R~~~C-sWtnDGqylalG~~nGTIsiRN  160 (1081)
T KOG1538|consen   87 ILKY-----SHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKSSSRIICC-SWTNDGQYLALGMFNGTISIRN  160 (1081)
T ss_pred             eeee-----ccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhhheeEEEe-eecCCCcEEEEeccCceEEeec
Confidence            1111     112223445565655555443322111111100 0   0111222211 2557899999999999999986


Q ss_pred             CCCCeEEEEe---ecCCCCeEEEEECCCC-----CeEEEEeCCCCEEEeecCCCC
Q 022074          239 LVSGEQVAAL---KYHTSPVRDCSWHPSQ-----PMLVSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       239 ~~~~~~~~~~---~~h~~~I~~v~~sp~~-----~~las~s~Dg~i~~Wd~~~~~  285 (303)
                      . ++++-..+   .+...||++++|+|..     ..++..++..++.++.+.+..
T Consensus       161 k-~gEek~~I~Rpgg~Nspiwsi~~~p~sg~G~~di~aV~DW~qTLSFy~LsG~~  214 (1081)
T KOG1538|consen  161 K-NGEEKVKIERPGGSNSPIWSICWNPSSGEGRNDILAVADWGQTLSFYQLSGKQ  214 (1081)
T ss_pred             C-CCCcceEEeCCCCCCCCceEEEecCCCCCCccceEEEEeccceeEEEEeccee
Confidence            4 45543333   3577899999999963     389999999999999987653


No 216
>KOG1310 consensus WD40 repeat protein [General function prediction only]
Probab=99.25  E-value=1.5e-10  Score=103.74  Aligned_cols=216  Identities=21%  Similarity=0.303  Sum_probs=138.1

Q ss_pred             cCchhhccccccccccCcCcccccCCCcccceEEEEEcC--CCCEEEEeeCCCeEEEEECCCCc----------eEEEEe
Q 022074           11 GSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFST--DGRELVAGSSDDCIYVYDLEANK----------LSLRIL   78 (303)
Q Consensus        11 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~--~g~~l~sgs~Dg~v~lwd~~~~~----------~~~~~~   78 (303)
                      -|||=|.-+.||+-..-+ .--...+||...|+++.|-|  +.+.+++|..|..|+|||+...+          ...-+.
T Consensus        66 ~SGSDD~r~ivWd~~~~K-llhsI~TgHtaNIFsvKFvP~tnnriv~sgAgDk~i~lfdl~~~~~~~~d~~~~~~~~~~~  144 (758)
T KOG1310|consen   66 ASGSDDTRLIVWDPFEYK-LLHSISTGHTANIFSVKFVPYTNNRIVLSGAGDKLIKLFDLDSSKEGGMDHGMEETTRCWS  144 (758)
T ss_pred             eecCCcceEEeecchhcc-eeeeeecccccceeEEeeeccCCCeEEEeccCcceEEEEecccccccccccCccchhhhhh
Confidence            367778888999987433 33456699999999999998  46689999999999999998522          112244


Q ss_pred             cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccce-------eecccccCeEEEEeCCCC-CEEEEEeCCC
Q 022074           79 AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAG-------VLMGHLEGITFIDSRGDG-RYLISNGKDQ  150 (303)
Q Consensus        79 ~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~-------~~~~h~~~v~~~~~~~~~-~~l~s~~~D~  150 (303)
                      -|...|..++..|..++.|.++++||+++-+|+|... ...+..       .+....-...++.++|.. .+|+.|+.|-
T Consensus       145 cht~rVKria~~p~~PhtfwsasEDGtirQyDiREph-~c~p~~~~~~~l~ny~~~lielk~ltisp~rp~~laVGgsdp  223 (758)
T KOG1310|consen  145 CHTDRVKRIATAPNGPHTFWSASEDGTIRQYDIREPH-VCNPDEDCPSILVNYNPQLIELKCLTISPSRPYYLAVGGSDP  223 (758)
T ss_pred             hhhhhhhheecCCCCCceEEEecCCcceeeecccCCc-cCCccccccHHHHHhchhhheeeeeeecCCCCceEEecCCCc
Confidence            5888888888877777999999999999999998421 111111       111112235577888754 6789999999


Q ss_pred             cEEEEEcccccCCc-ccccCccceeeeceeeeCCCCCccccCCCCCcceEEe-cc-----cceeeeEEEeeeeeeeCCCe
Q 022074          151 AIKLWDIRKMSSNA-SCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYK-GH-----SVLRTLIRCHFSPVYSTGQK  223 (303)
Q Consensus       151 ~v~lWdl~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~-----~~~~~~~~~~~~~~~s~~~~  223 (303)
                      -.|+||.|+..... +...           +..+|..      ..+++.-+. +|     ........|..-..|+|+|.
T Consensus       224 farLYD~Rr~lks~~s~~~-----------~~~~pp~------~~~cv~yf~p~hlkn~~gn~~~~~~~~t~vtfnpNGt  286 (758)
T KOG1310|consen  224 FARLYDRRRVLKSFRSDGT-----------MNTCPPK------DCRCVRYFSPGHLKNSQGNLDRYITCCTYVTFNPNGT  286 (758)
T ss_pred             hhhhhhhhhhccCCCCCcc-----------ccCCCCc------ccchhheecCccccCcccccccceeeeEEEEECCCCc
Confidence            99999976533211 1000           0011100      000111110 11     01112233333445889988


Q ss_pred             EEEEEeCCCeEEEEECCCCeEE
Q 022074          224 YIYTGSHDSCVYVYDLVSGEQV  245 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~~~~  245 (303)
                      .|+..-....|+++|+..++..
T Consensus       287 ElLvs~~gEhVYlfdvn~~~~~  308 (758)
T KOG1310|consen  287 ELLVSWGGEHVYLFDVNEDKSP  308 (758)
T ss_pred             EEEEeeCCeEEEEEeecCCCCc
Confidence            7777666678999999877654


No 217
>KOG1523 consensus Actin-related protein Arp2/3 complex, subunit ARPC1/p41-ARC [Cytoskeleton]
Probab=99.24  E-value=1.1e-09  Score=92.43  Aligned_cols=201  Identities=15%  Similarity=0.136  Sum_probs=132.4

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      .-|...|.+|+|+|..+.|++|+.|...+||....+..   ...+..++...+++.|+| +.+.|++||.-..|.+|=.+
T Consensus        52 s~Hd~~vtgvdWap~snrIvtcs~drnayVw~~~~~~~WkptlvLlRiNrAAt~V~WsP-~enkFAVgSgar~isVcy~E  130 (361)
T KOG1523|consen   52 SEHDKIVTGVDWAPKSNRIVTCSHDRNAYVWTQPSGGTWKPTLVLLRINRAATCVKWSP-KENKFAVGSGARLISVCYYE  130 (361)
T ss_pred             hhhCcceeEEeecCCCCceeEccCCCCccccccCCCCeeccceeEEEeccceeeEeecC-cCceEEeccCccEEEEEEEe
Confidence            45667999999999999999999999999999954432   244678999999999976 57899999999999888543


Q ss_pred             cccCCC---ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccc
Q 022074          113 CLNVKG---KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       113 ~~~~~~---~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                      ..+ ..   +.++  .-+...|+++++++++-+|+.|+.|+.+|++..--.           .++.  +. .-+|-...+
T Consensus       131 ~EN-dWWVsKhik--kPirStv~sldWhpnnVLlaaGs~D~k~rVfSayIK-----------~Vde--kp-ap~pWgsk~  193 (361)
T KOG1523|consen  131 QEN-DWWVSKHIK--KPIRSTVTSLDWHPNNVLLAAGSTDGKCRVFSAYIK-----------GVDE--KP-APTPWGSKM  193 (361)
T ss_pred             ccc-ceehhhhhC--CccccceeeeeccCCcceecccccCcceeEEEEeee-----------cccc--CC-CCCCCccCC
Confidence            211 11   1111  125678999999999999999999999999963110           0000  00 000001111


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEECC
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~sp  262 (303)
                      ..  .+.+..+....      .......|+++|..|+=.+.|..+.+=|..... .+..+..-.-|..++.|-.
T Consensus       194 PF--G~lm~E~~~~g------gwvh~v~fs~sG~~lawv~Hds~v~~~da~~p~~~v~~~~~~~lP~ls~~~is  259 (361)
T KOG1523|consen  194 PF--GQLMSEASSSG------GWVHGVLFSPSGNRLAWVGHDSTVSFVDAAGPSERVQSVATAQLPLLSVSWIS  259 (361)
T ss_pred             cH--HHHHHhhccCC------CceeeeEeCCCCCEeeEecCCCceEEeecCCCchhccchhhccCCceeeEeec
Confidence            00  11111111000      111223478889999999999999999987654 3444444447888888844


No 218
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=99.23  E-value=4.6e-10  Score=101.18  Aligned_cols=200  Identities=18%  Similarity=0.317  Sum_probs=137.7

Q ss_pred             EEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc--ce
Q 022074           45 LKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP--AG  122 (303)
Q Consensus        45 l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~--~~  122 (303)
                      +.++.-.-.|++++....|+-++++.|+....+....++++++..++. ..+|++|+.+|.|-.||.|+..-.+..  ..
T Consensus       139 m~y~~~scDly~~gsg~evYRlNLEqGrfL~P~~~~~~~lN~v~in~~-hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~  217 (703)
T KOG2321|consen  139 MKYHKPSCDLYLVGSGSEVYRLNLEQGRFLNPFETDSGELNVVSINEE-HGLLACGTEDGVVEFWDPRDKSRVGTLDAAS  217 (703)
T ss_pred             ccccCCCccEEEeecCcceEEEEccccccccccccccccceeeeecCc-cceEEecccCceEEEecchhhhhheeeeccc
Confidence            455532333555555556999999999988777777789999999754 558889999999999998743211110  00


Q ss_pred             eecccc-----cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcc
Q 022074          123 VLMGHL-----EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSV  197 (303)
Q Consensus       123 ~~~~h~-----~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  197 (303)
                      .+..|.     ..|+++.|+.+|-.++.|..+|.+.|||||...+.....++.                           
T Consensus       218 ~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~---------------------------  270 (703)
T KOG2321|consen  218 SVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRASKPLLVKDHGY---------------------------  270 (703)
T ss_pred             ccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcccCCceeecccCC---------------------------
Confidence            111232     349999999999999999999999999999755432222111                           


Q ss_pred             eEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEE
Q 022074          198 ATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVV  277 (303)
Q Consensus       198 ~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~  277 (303)
                              ...+....|.+.  .++..+++. ....++|||-.+|+....++ .+..++++++-|++.++++|-++..+.
T Consensus       271 --------e~pi~~l~~~~~--~~q~~v~S~-Dk~~~kiWd~~~Gk~~asiE-pt~~lND~C~~p~sGm~f~Ane~~~m~  338 (703)
T KOG2321|consen  271 --------ELPIKKLDWQDT--DQQNKVVSM-DKRILKIWDECTGKPMASIE-PTSDLNDFCFVPGSGMFFTANESSKMH  338 (703)
T ss_pred             --------ccceeeeccccc--CCCceEEec-chHHhhhcccccCCceeecc-ccCCcCceeeecCCceEEEecCCCcce
Confidence                    001111122211  223344443 45679999999999887776 445699999999999999999999998


Q ss_pred             EeecCCC
Q 022074          278 RWEFPGN  284 (303)
Q Consensus       278 ~Wd~~~~  284 (303)
                      -+-++..
T Consensus       339 ~yyiP~L  345 (703)
T KOG2321|consen  339 TYYIPSL  345 (703)
T ss_pred             eEEcccc
Confidence            8877654


No 219
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=99.22  E-value=1.9e-09  Score=104.63  Aligned_cols=209  Identities=18%  Similarity=0.223  Sum_probs=134.9

Q ss_pred             CcccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCC--c-----eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEE
Q 022074           37 GYSFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEAN--K-----LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKV  108 (303)
Q Consensus        37 ~~~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~--~-----~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~l  108 (303)
                      =|+..|..++.++. +.++++||.||+|++|++..-  .     ...++......+.++... .+++.++.++.||.|++
T Consensus      1046 Ehs~~v~k~a~s~~~~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~~vt~~-~~~~~~Av~t~DG~v~~ 1124 (1431)
T KOG1240|consen 1046 EHSSAVIKLAVSSEHTSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVEKVTMC-GNGDQFAVSTKDGSVRV 1124 (1431)
T ss_pred             hccccccceeecCCCCceEEEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceEEEEec-cCCCeEEEEcCCCeEEE
Confidence            47778888888865 489999999999999988541  1     112333345677777774 46789999999999999


Q ss_pred             EcCccccCCC---ccceeeccccc-CeEEE-EeCC-CC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeee
Q 022074          109 WDRRCLNVKG---KPAGVLMGHLE-GITFI-DSRG-DG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMD  181 (303)
Q Consensus       109 Wd~~~~~~~~---~~~~~~~~h~~-~v~~~-~~~~-~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~  181 (303)
                      .++...+...   ...+....+.+ .+..+ ++.. .+ ..++-+..-+.+-.||+|........+...           
T Consensus      1125 ~~id~~~~~~~~~~~~ri~n~~~~g~vv~m~a~~~~~~S~~lvy~T~~~~iv~~D~r~~~~~w~lk~~~----------- 1193 (1431)
T KOG1240|consen 1125 LRIDHYNVSKRVATQVRIPNLKKDGVVVSMHAFTAIVQSHVLVYATDLSRIVSWDTRMRHDAWRLKNQL----------- 1193 (1431)
T ss_pred             EEccccccccceeeeeecccccCCCceEEeecccccccceeEEEEEeccceEEecchhhhhHHhhhcCc-----------
Confidence            9876432211   11112222332 23332 2322 22 367788888999999998753221111100           


Q ss_pred             CCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee-cCCCCeEEEEE
Q 022074          182 YPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK-YHTSPVRDCSW  260 (303)
Q Consensus       182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~-~h~~~I~~v~~  260 (303)
                                          .|...       .+.+.++.+.++++|...|.+.+||++=+.++...+ .+..+|+.+..
T Consensus      1194 --------------------~hG~v-------TSi~idp~~~WlviGts~G~l~lWDLRF~~~i~sw~~P~~~~i~~v~~ 1246 (1431)
T KOG1240|consen 1194 --------------------RHGLV-------TSIVIDPWCNWLVIGTSRGQLVLWDLRFRVPILSWEHPARAPIRHVWL 1246 (1431)
T ss_pred             --------------------cccce-------eEEEecCCceEEEEecCCceEEEEEeecCceeecccCcccCCcceEEe
Confidence                                00100       122345678899999999999999999887776654 34578888887


Q ss_pred             CCCCC---eEE-EEe-CCCCEEEeecCCC
Q 022074          261 HPSQP---MLV-SSS-WDGDVVRWEFPGN  284 (303)
Q Consensus       261 sp~~~---~la-s~s-~Dg~i~~Wd~~~~  284 (303)
                      +|.-+   ..+ +++ ..+.+.+|++...
T Consensus      1247 ~~~~~~~S~~vs~~~~~~nevs~wn~~~g 1275 (1431)
T KOG1240|consen 1247 CPTYPQESVSVSAGSSSNNEVSTWNMETG 1275 (1431)
T ss_pred             eccCCCCceEEEecccCCCceeeeecccC
Confidence            77644   444 444 5889999997543


No 220
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=99.20  E-value=4.3e-09  Score=96.69  Aligned_cols=190  Identities=20%  Similarity=0.146  Sum_probs=117.4

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCC---CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCC--eEEEEcC
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSD---DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDN--LCKVWDR  111 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~D---g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg--~v~lWd~  111 (303)
                      .+...+.+..|+|||++|+..+.+   ..|++||+.+++.. .+..+.+.+...+|+|+...++++.+.++  .|++||+
T Consensus       187 ~~~~~~~~p~~Spdg~~la~~~~~~~~~~i~v~d~~~g~~~-~~~~~~~~~~~~~~spDg~~l~~~~~~~~~~~i~~~d~  265 (417)
T TIGR02800       187 RSREPILSPAWSPDGQKLAYVSFESGKPEIYVQDLATGQRE-KVASFPGMNGAPAFSPDGSKLAVSLSKDGNPDIYVMDL  265 (417)
T ss_pred             cCCCceecccCCCCCCEEEEEEcCCCCcEEEEEECCCCCEE-EeecCCCCccceEECCCCCEEEEEECCCCCccEEEEEC
Confidence            455578899999999999887654   47999999988643 34445566677889765333445655555  5778886


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeC-CCc--EEEEEcccccCCcccccCccceeeeceeeeCCCCCcc
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK-DQA--IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARD  188 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~-D~~--v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  188 (303)
                      ...     ....+..+........|+++++.|+..+. ++.  |.++|+.....                          
T Consensus       266 ~~~-----~~~~l~~~~~~~~~~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~--------------------------  314 (417)
T TIGR02800       266 DGK-----QLTRLTNGPGIDTEPSWSPDGKSIAFTSDRGGSPQIYMMDADGGEV--------------------------  314 (417)
T ss_pred             CCC-----CEEECCCCCCCCCCEEECCCCCEEEEEECCCCCceEEEEECCCCCE--------------------------
Confidence            421     12233334333445578899988876553 443  44455432100                          


Q ss_pred             ccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCC---eEEEEECCCCeEEEEeecCCCCeEEEEECCCCC
Q 022074          189 LKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS---CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQP  265 (303)
Q Consensus       189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg---~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~  265 (303)
                               ..+..+.      .....+.++|++++++..+.++   .|.+||+.++.. ..+... .......|+||++
T Consensus       315 ---------~~l~~~~------~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~-~~l~~~-~~~~~p~~spdg~  377 (417)
T TIGR02800       315 ---------RRLTFRG------GYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGE-RVLTDT-GLDESPSFAPNGR  377 (417)
T ss_pred             ---------EEeecCC------CCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCe-EEccCC-CCCCCceECCCCC
Confidence                     0000000      0112345788999988888776   899999987654 233222 2345668999999


Q ss_pred             eEEEEeCCCC
Q 022074          266 MLVSSSWDGD  275 (303)
Q Consensus       266 ~las~s~Dg~  275 (303)
                      +|+.++.++.
T Consensus       378 ~l~~~~~~~~  387 (417)
T TIGR02800       378 MILYATTRGG  387 (417)
T ss_pred             EEEEEEeCCC
Confidence            8877777653


No 221
>KOG0974 consensus WD-repeat protein WDR6, WD repeat superfamily [General function prediction only]
Probab=99.20  E-value=1.2e-10  Score=111.08  Aligned_cols=131  Identities=26%  Similarity=0.353  Sum_probs=107.4

Q ss_pred             ccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE-EEecccCCeEEEEEccCCCcEE
Q 022074           19 ANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL-RILAHTSDVNTVCFGDESGHLI   97 (303)
Q Consensus        19 ~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~-~~~~h~~~v~~l~~~~~~~~~l   97 (303)
                      +=||.-. +++. +.-..||+..|+++.|+.||.++++.|.|.++|+|+++++.... ...+|...|..+++.+ +  .+
T Consensus       157 iivW~~~-~dn~-p~~l~GHeG~iF~i~~s~dg~~i~s~SdDRsiRlW~i~s~~~~~~~~fgHsaRvw~~~~~~-n--~i  231 (967)
T KOG0974|consen  157 IIVWKPH-EDNK-PIRLKGHEGSIFSIVTSLDGRYIASVSDDRSIRLWPIDSREVLGCTGFGHSARVWACCFLP-N--RI  231 (967)
T ss_pred             EEEEecc-ccCC-cceecccCCceEEEEEccCCcEEEEEecCcceeeeecccccccCcccccccceeEEEEecc-c--ee
Confidence            3466665 2222 23347999999999999999999999999999999999987554 6789999999999964 3  89


Q ss_pred             EEecCCCeEEEEcCccccCCCccceeeccccc-CeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074           98 YSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLE-GITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus        98 ~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~-~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      +|++.|.+.++|+.     ++.....+.+|.. .+..++...+...++|++.|+.+++||+..
T Consensus       232 ~t~gedctcrvW~~-----~~~~l~~y~~h~g~~iw~~~~~~~~~~~vT~g~Ds~lk~~~l~~  289 (967)
T KOG0974|consen  232 ITVGEDCTCRVWGV-----NGTQLEVYDEHSGKGIWKIAVPIGVIIKVTGGNDSTLKLWDLNG  289 (967)
T ss_pred             EEeccceEEEEEec-----ccceehhhhhhhhcceeEEEEcCCceEEEeeccCcchhhhhhhc
Confidence            99999999999964     3344447778875 589999998888999999999999999753


No 222
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=99.20  E-value=1e-09  Score=91.54  Aligned_cols=117  Identities=26%  Similarity=0.307  Sum_probs=93.0

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE--EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL--RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~--~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      -.++.|++.|..++++-.+|.+.+-+.....+..  .+..|.-......|+..+++++.+|+.|+.+.-||+|..+  ..
T Consensus       124 ~lslD~~~~~~~i~vs~s~G~~~~v~~t~~~le~vq~wk~He~E~Wta~f~~~~pnlvytGgDD~~l~~~D~R~p~--~~  201 (339)
T KOG0280|consen  124 ALSLDISTSGTKIFVSDSRGSISGVYETEMVLEKVQTWKVHEFEAWTAKFSDKEPNLVYTGGDDGSLSCWDIRIPK--TF  201 (339)
T ss_pred             eeEEEeeccCceEEEEcCCCcEEEEecceeeeeecccccccceeeeeeecccCCCceEEecCCCceEEEEEecCCc--ce
Confidence            4578888889999999999999866665555544  6788999999999987788999999999999999998321  11


Q ss_pred             cceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccc
Q 022074          120 PAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ....-.-|..+|.++..+| .+.+++||+.|-.|++||+|.+
T Consensus       202 i~~n~kvH~~GV~SI~ss~~~~~~I~TGsYDe~i~~~DtRnm  243 (339)
T KOG0280|consen  202 IWHNSKVHTSGVVSIYSSPPKPTYIATGSYDECIRVLDTRNM  243 (339)
T ss_pred             eeecceeeecceEEEecCCCCCceEEEeccccceeeeehhcc
Confidence            1122234888999987665 5789999999999999999964


No 223
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=99.18  E-value=7e-11  Score=97.09  Aligned_cols=117  Identities=22%  Similarity=0.379  Sum_probs=96.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC--ceE--EEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN--KLS--LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~--~~~--~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      .-|+++|.++.|.+.-..=++|+.+..+..|+++..  .+.  ....-.+.+|..+.+- ++++.++|+++|+.+|+|..
T Consensus       202 ash~qpvlsldyas~~~rGisgga~dkl~~~Sl~~s~gslq~~~e~~lknpGv~gvrIR-pD~KIlATAGWD~RiRVysw  280 (323)
T KOG0322|consen  202 ASHKQPVLSLDYASSCDRGISGGADDKLVMYSLNHSTGSLQIRKEITLKNPGVSGVRIR-PDGKILATAGWDHRIRVYSW  280 (323)
T ss_pred             hhccCcceeeeechhhcCCcCCCccccceeeeeccccCcccccceEEecCCCccceEEc-cCCcEEeecccCCcEEEEEe
Confidence            579999999999987667788888888999988654  221  1222344678888884 56899999999999999987


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEc
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl  157 (303)
                      +    +..+...+.-|.++|++++|+|+.++++.+|.|+.|.+|++
T Consensus       281 r----tl~pLAVLkyHsagvn~vAfspd~~lmAaaskD~rISLWkL  322 (323)
T KOG0322|consen  281 R----TLNPLAVLKYHSAGVNAVAFSPDCELMAAASKDARISLWKL  322 (323)
T ss_pred             c----cCCchhhhhhhhcceeEEEeCCCCchhhhccCCceEEeeec
Confidence            6    45677788889999999999999999999999999999986


No 224
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=99.18  E-value=8.1e-10  Score=94.05  Aligned_cols=208  Identities=18%  Similarity=0.294  Sum_probs=130.2

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCc-----eEEEEeccc------------CCeEEEEEccC-CCcEEEEecCC
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANK-----LSLRILAHT------------SDVNTVCFGDE-SGHLIYSGSDD  103 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~-----~~~~~~~h~------------~~v~~l~~~~~-~~~~l~s~s~d  103 (303)
                      |.++.|...|++|++|..+|.|-+|.-....     ....++.|.            ..|+.+.|.++ +...|+....|
T Consensus        28 is~vef~~~Ge~LatGdkgGRVv~f~r~~~~~~ey~~~t~fqshepEFDYLkSleieEKinkIrw~~~~n~a~FLlstNd  107 (433)
T KOG1354|consen   28 ISAVEFDHYGERLATGDKGGRVVLFEREKLYKGEYNFQTEFQSHEPEFDYLKSLEIEEKINKIRWLDDGNLAEFLLSTND  107 (433)
T ss_pred             eeeEEeecccceEeecCCCCeEEEeecccccccceeeeeeeeccCcccchhhhhhhhhhhhhceecCCCCccEEEEecCC
Confidence            7889999999999999999999999654322     222344443            36788888654 35577788889


Q ss_pred             CeEEEEcCccccCCC-------------------------------ccceee-cccccCeEEEEeCCCCCEEEEEeCCCc
Q 022074          104 NLCKVWDRRCLNVKG-------------------------------KPAGVL-MGHLEGITFIDSRGDGRYLISNGKDQA  151 (303)
Q Consensus       104 g~v~lWd~~~~~~~~-------------------------------~~~~~~-~~h~~~v~~~~~~~~~~~l~s~~~D~~  151 (303)
                      .++++|.++......                               .+.+.+ ..|.--+++++++.|++.++++ .|=.
T Consensus       108 ktiKlWKi~er~~k~~~~~~~~~~~~~~~~~lr~p~~~~~~~~vea~prRv~aNaHtyhiNSIS~NsD~Et~lSA-DdLR  186 (433)
T KOG1354|consen  108 KTIKLWKIRERGSKKEGYNLPEEGPPGTITSLRLPVEGRHDLEVEASPRRVYANAHTYHINSISVNSDKETFLSA-DDLR  186 (433)
T ss_pred             cceeeeeeeccccccccccccccCCCCccceeeceeeccccceeeeeeeeeccccceeEeeeeeecCccceEeec-ccee
Confidence            999999864211110                               000111 2356668889999998878776 5788


Q ss_pred             EEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeC-CCeEEEEEeC
Q 022074          152 IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYST-GQKYIYTGSH  230 (303)
Q Consensus       152 v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~-~~~~latg~~  230 (303)
                      |.+|++......+..             .+.-|.       .   +..+   ..+++      +..|+| ....++-.++
T Consensus       187 INLWnlei~d~sFnI-------------VDIKP~-------n---mEeL---teVIT------saEFhp~~cn~f~YSSS  234 (433)
T KOG1354|consen  187 INLWNLEIIDQSFNI-------------VDIKPA-------N---MEEL---TEVIT------SAEFHPHHCNVFVYSSS  234 (433)
T ss_pred             eeeccccccCCceeE-------------EEcccc-------C---HHHH---HHHHh------hhccCHhHccEEEEecC
Confidence            999998643221110             000000       0   0000   00011      112333 2456777788


Q ss_pred             CCeEEEEECCCCeEE----EEeecC------------CCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          231 DSCVYVYDLVSGEQV----AALKYH------------TSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       231 dg~i~iwd~~~~~~~----~~~~~h------------~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .|+|+++|++..-+.    +.++..            -..|.++.||+.|++++|-+. -++++||+..
T Consensus       235 KGtIrLcDmR~~aLCd~hsKlfEepedp~~rsffseiIsSISDvKFs~sGryilsRDy-ltvk~wD~nm  302 (433)
T KOG1354|consen  235 KGTIRLCDMRQSALCDAHSKLFEEPEDPSSRSFFSEIISSISDVKFSHSGRYILSRDY-LTVKLWDLNM  302 (433)
T ss_pred             CCcEEEeechhhhhhcchhhhhccccCCcchhhHHHHhhhhhceEEccCCcEEEEecc-ceeEEEeccc
Confidence            999999999853211    111111            146899999999999999864 7999999863


No 225
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=99.18  E-value=1.7e-09  Score=97.51  Aligned_cols=210  Identities=19%  Similarity=0.248  Sum_probs=131.7

Q ss_pred             CCcccceEEEEEcCCCCEE-EEeeCCCeEEEEECCCCceEEEEecccC--------------------------------
Q 022074           36 GGYSFGIFSLKFSTDGREL-VAGSSDDCIYVYDLEANKLSLRILAHTS--------------------------------   82 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l-~sgs~Dg~v~lwd~~~~~~~~~~~~h~~--------------------------------   82 (303)
                      -+|...-..|..+|||+++ ++|...-.|++||+..-.  .++..|.+                                
T Consensus        48 fe~p~ast~ik~s~DGqY~lAtG~YKP~ikvydlanLS--LKFERhlDae~V~feiLsDD~SK~v~L~~DR~IefHak~G  125 (703)
T KOG2321|consen   48 FEMPTASTRIKVSPDGQYLLATGTYKPQIKVYDLANLS--LKFERHLDAEVVDFEILSDDYSKSVFLQNDRTIEFHAKYG  125 (703)
T ss_pred             cCCccccceeEecCCCcEEEEecccCCceEEEEcccce--eeeeecccccceeEEEeccchhhheEeecCceeeehhhcC
Confidence            4566677889999999985 556678899999996532  22222211                                


Q ss_pred             ---------CeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEE
Q 022074           83 ---------DVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIK  153 (303)
Q Consensus        83 ---------~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~  153 (303)
                               ....++++.++-++++.|+ ...|+-.++.    .++-...+..-.+.++++.+++...+|++|+.||.|-
T Consensus       126 ~hy~~RIP~~GRDm~y~~~scDly~~gs-g~evYRlNLE----qGrfL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VE  200 (703)
T KOG2321|consen  126 RHYRTRIPKFGRDMKYHKPSCDLYLVGS-GSEVYRLNLE----QGRFLNPFETDSGELNVVSINEEHGLLACGTEDGVVE  200 (703)
T ss_pred             eeeeeecCcCCccccccCCCccEEEeec-CcceEEEEcc----ccccccccccccccceeeeecCccceEEecccCceEE
Confidence                     1123333333333443333 3345444543    2333444444557899999999888999999999999


Q ss_pred             EEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCe
Q 022074          154 LWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSC  233 (303)
Q Consensus       154 lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~  233 (303)
                      .||.|..........+.              .           +....+..    ......+..|+.+|-.+++|..+|.
T Consensus       201 fwDpR~ksrv~~l~~~~--------------~-----------v~s~pg~~----~~~svTal~F~d~gL~~aVGts~G~  251 (703)
T KOG2321|consen  201 FWDPRDKSRVGTLDAAS--------------S-----------VNSHPGGD----AAPSVTALKFRDDGLHVAVGTSTGS  251 (703)
T ss_pred             Eecchhhhhheeeeccc--------------c-----------cCCCcccc----ccCcceEEEecCCceeEEeeccCCc
Confidence            99998643322211100              0           00000000    0001112347777889999999999


Q ss_pred             EEEEECCCCeEEEEeecC--CCCeEEEEECCC--CCeEEEEeCCCCEEEeecCC
Q 022074          234 VYVYDLVSGEQVAALKYH--TSPVRDCSWHPS--QPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       234 i~iwd~~~~~~~~~~~~h--~~~I~~v~~sp~--~~~las~s~Dg~i~~Wd~~~  283 (303)
                      +.|||+++.+++.. +-|  ..||..++|.+.  ++.++|.. ...+++||-..
T Consensus       252 v~iyDLRa~~pl~~-kdh~~e~pi~~l~~~~~~~q~~v~S~D-k~~~kiWd~~~  303 (703)
T KOG2321|consen  252 VLIYDLRASKPLLV-KDHGYELPIKKLDWQDTDQQNKVVSMD-KRILKIWDECT  303 (703)
T ss_pred             EEEEEcccCCceee-cccCCccceeeecccccCCCceEEecc-hHHhhhccccc
Confidence            99999999887644 344  469999999887  45677764 57899999653


No 226
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=99.15  E-value=6.7e-09  Score=94.02  Aligned_cols=189  Identities=17%  Similarity=0.168  Sum_probs=136.7

Q ss_pred             CCCCEEEEeeCCCeEEEEECCCCceEEEEec--c-cCCeEEEEEc------c-------------CCCcEEEEecCCCeE
Q 022074           49 TDGRELVAGSSDDCIYVYDLEANKLSLRILA--H-TSDVNTVCFG------D-------------ESGHLIYSGSDDNLC  106 (303)
Q Consensus        49 ~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~--h-~~~v~~l~~~------~-------------~~~~~l~s~s~dg~v  106 (303)
                      |-..++|....||.+|+||...+++...+..  | .+-+.+..|.      |             .+...++-|...|.|
T Consensus         3 ~~~~~~A~~~~~g~l~iw~t~~~~~~~e~~p~~~~s~t~~~~~w~L~~~~s~~k~~~~~~~~~~s~~t~~lvlgt~~g~v   82 (541)
T KOG4547|consen    3 PALDYFALSTGDGRLRIWDTAKNQLQQEFAPIASLSGTCTYTKWGLSADYSPMKWLSLEKAKKASLDTSMLVLGTPQGSV   82 (541)
T ss_pred             chhheEeecCCCCeEEEEEccCceeeeeeccchhccCcceeEEEEEEeccchHHHHhHHHHhhccCCceEEEeecCCccE
Confidence            4567899999999999999999987655543  2 2334444552      1             022367778888999


Q ss_pred             EEEcCccccCCCccceee--cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCC
Q 022074          107 KVWDRRCLNVKGKPAGVL--MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPP  184 (303)
Q Consensus       107 ~lWd~~~~~~~~~~~~~~--~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~  184 (303)
                      -+++.-.+    +....+  .+|.+.|+++..+.+-..|.|++.|..+-.|+......                      
T Consensus        83 ~~ys~~~g----~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~----------------------  136 (541)
T KOG4547|consen   83 LLYSVAGG----EITAKLSTDKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVI----------------------  136 (541)
T ss_pred             EEEEecCC----eEEEEEecCCCCCcceeeecccccCceEecCCceeEEEEeccccee----------------------
Confidence            99986422    222222  35888999998888878899999999999999754211                      


Q ss_pred             CCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC-
Q 022074          185 QARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS-  263 (303)
Q Consensus       185 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~-  263 (303)
                                  ++.+++....      ..+..++||++.+++|+  +.|++||+++++.+..|.+|.++|.+++|--+ 
T Consensus       137 ------------~~~~~~~~~~------~~sl~is~D~~~l~~as--~~ik~~~~~~kevv~~ftgh~s~v~t~~f~~~~  196 (541)
T KOG4547|consen  137 ------------IRIWKEQKPL------VSSLCISPDGKILLTAS--RQIKVLDIETKEVVITFTGHGSPVRTLSFTTLI  196 (541)
T ss_pred             ------------eeeeccCCCc------cceEEEcCCCCEEEecc--ceEEEEEccCceEEEEecCCCcceEEEEEEEec
Confidence                        1112211111      01234678999999887  77999999999999999999999999999877 


Q ss_pred             ----C-CeEEEEeCCCCEEEeecCC
Q 022074          264 ----Q-PMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       264 ----~-~~las~s~Dg~i~~Wd~~~  283 (303)
                          | .+|.++..+.-+.+|-+..
T Consensus       197 ~g~~G~~vLssa~~~r~i~~w~v~~  221 (541)
T KOG4547|consen  197 DGIIGKYVLSSAAAERGITVWVVEK  221 (541)
T ss_pred             cccccceeeeccccccceeEEEEEc
Confidence                3 4788888899999998754


No 227
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=99.14  E-value=1.7e-09  Score=105.07  Aligned_cols=183  Identities=21%  Similarity=0.243  Sum_probs=120.9

Q ss_pred             CCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc---cceeecccccCeEEEEeCCCCCEEEEE
Q 022074           70 ANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK---PAGVLMGHLEGITFIDSRGDGRYLISN  146 (303)
Q Consensus        70 ~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~---~~~~~~~h~~~v~~~~~~~~~~~l~s~  146 (303)
                      .|.++..+..|...|..++.+++.+.+|+|||.||+||+|+.+.......   ...++.--...+..+...+.++++|.+
T Consensus      1037 ~G~lVAhL~Ehs~~v~k~a~s~~~~s~FvsgS~DGtVKvW~~~k~~~~~~s~rS~ltys~~~sr~~~vt~~~~~~~~Av~ 1116 (1431)
T KOG1240|consen 1037 RGILVAHLHEHSSAVIKLAVSSEHTSLFVSGSDDGTVKVWNLRKLEGEGGSARSELTYSPEGSRVEKVTMCGNGDQFAVS 1116 (1431)
T ss_pred             cceEeehhhhccccccceeecCCCCceEEEecCCceEEEeeehhhhcCcceeeeeEEEeccCCceEEEEeccCCCeEEEE
Confidence            45567778889999999999888778999999999999999863222211   112222234567788888889999999


Q ss_pred             eCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCe-EE
Q 022074          147 GKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQK-YI  225 (303)
Q Consensus       147 ~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~l  225 (303)
                      +.||.|++.++..-.  .......              +........+..            +..++...  ...+. .+
T Consensus      1117 t~DG~v~~~~id~~~--~~~~~~~--------------~~ri~n~~~~g~------------vv~m~a~~--~~~~S~~l 1166 (1431)
T KOG1240|consen 1117 TKDGSVRVLRIDHYN--VSKRVAT--------------QVRIPNLKKDGV------------VVSMHAFT--AIVQSHVL 1166 (1431)
T ss_pred             cCCCeEEEEEccccc--cccceee--------------eeecccccCCCc------------eEEeeccc--ccccceeE
Confidence            999999999876420  0000000              000000000001            11111000  01233 67


Q ss_pred             EEEeCCCeEEEEECCCCeEEEEee--cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          226 YTGSHDSCVYVYDLVSGEQVAALK--YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       226 atg~~dg~i~iwd~~~~~~~~~~~--~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      +.+..-+.|..||+.+..-+-+++  ...+-|++++.+|.+++++.|..-|.+.+||+.
T Consensus      1167 vy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~idp~~~WlviGts~G~l~lWDLR 1225 (1431)
T KOG1240|consen 1167 VYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIVIDPWCNWLVIGTSRGQLVLWDLR 1225 (1431)
T ss_pred             EEEEeccceEEecchhhhhHHhhhcCccccceeEEEecCCceEEEEecCCceEEEEEee
Confidence            778888999999998775443332  334789999999999999999999999999974


No 228
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=99.13  E-value=2.1e-09  Score=88.20  Aligned_cols=112  Identities=25%  Similarity=0.434  Sum_probs=81.8

Q ss_pred             CcccceEEEEEcCCCCEEEE--eeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC---CCeEEEEcC
Q 022074           37 GYSFGIFSLKFSTDGRELVA--GSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD---DNLCKVWDR  111 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~s--gs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~---dg~v~lWd~  111 (303)
                      -.+.+|.+++|+|+|+.+++  |..++.|.|||++ ++....+  +...++.+.|+| ++++++.++.   .|.+.+||.
T Consensus        57 ~~~~~I~~~~WsP~g~~favi~g~~~~~v~lyd~~-~~~i~~~--~~~~~n~i~wsP-~G~~l~~~g~~n~~G~l~~wd~  132 (194)
T PF08662_consen   57 KKEGPIHDVAWSPNGNEFAVIYGSMPAKVTLYDVK-GKKIFSF--GTQPRNTISWSP-DGRFLVLAGFGNLNGDLEFWDV  132 (194)
T ss_pred             cCCCceEEEEECcCCCEEEEEEccCCcccEEEcCc-ccEeEee--cCCCceEEEECC-CCCEEEEEEccCCCcEEEEEEC
Confidence            34456999999999998655  4467799999997 3333333  345788999976 5888888763   577999997


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeC------CCcEEEEEcc
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK------DQAIKLWDIR  158 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~------D~~v~lWdl~  158 (303)
                      +    +...+... .| ..++.+.|+|+|++|+++..      |..++||+..
T Consensus       133 ~----~~~~i~~~-~~-~~~t~~~WsPdGr~~~ta~t~~r~~~dng~~Iw~~~  179 (194)
T PF08662_consen  133 R----KKKKISTF-EH-SDATDVEWSPDGRYLATATTSPRLRVDNGFKIWSFQ  179 (194)
T ss_pred             C----CCEEeecc-cc-CcEEEEEEcCCCCEEEEEEeccceeccccEEEEEec
Confidence            5    22223222 23 34778999999999998874      8899999874


No 229
>PRK00178 tolB translocation protein TolB; Provisional
Probab=99.12  E-value=2.8e-08  Score=91.78  Aligned_cols=200  Identities=20%  Similarity=0.176  Sum_probs=117.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCC---CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCC--eEEEEc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSD---DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDN--LCKVWD  110 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~D---g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg--~v~lWd  110 (303)
                      ..++..+....|+|||+.|+..+.+   ..|.+||+.++... .+....+.+....|+|+...++++.+.++  .|++||
T Consensus       195 ~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~-~l~~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d  273 (430)
T PRK00178        195 LQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRRE-QITNFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMD  273 (430)
T ss_pred             ecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEE-EccCCCCCcCCeEECCCCCEEEEEEccCCCceEEEEE
Confidence            3456678999999999998876644   36899999888643 33333444557889765333444666665  577778


Q ss_pred             CccccCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEE--EEcccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          111 RRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKL--WDIRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~l--Wdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      +...     ....+..+........|+++|+.++..+ .++...+  +|+.....               .         
T Consensus       274 ~~~~-----~~~~lt~~~~~~~~~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~---------------~---------  324 (430)
T PRK00178        274 LASR-----QLSRVTNHPAIDTEPFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRA---------------E---------  324 (430)
T ss_pred             CCCC-----CeEEcccCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECCCCCE---------------E---------
Confidence            6522     1223444444455667899998876554 3444444  44321100               0         


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC-C--eEEEEECCCCeEEEEeecCCCCeEEEEECCCC
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD-S--CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d-g--~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~  264 (303)
                      .+         ...+.        ....+.++|++++++..+.+ +  .|++||+.+++. ..+. +........|||||
T Consensus       325 ~l---------t~~~~--------~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~-~~lt-~~~~~~~p~~spdg  385 (430)
T PRK00178        325 RV---------TFVGN--------YNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSV-RILT-DTSLDESPSVAPNG  385 (430)
T ss_pred             Ee---------ecCCC--------CccceEECCCCCEEEEEEccCCceEEEEEECCCCCE-EEcc-CCCCCCCceECCCC
Confidence            00         00000        01134578889888776643 3  588999988764 2232 12223356899999


Q ss_pred             CeEEEEeCC-C--CEEEeecCCC
Q 022074          265 PMLVSSSWD-G--DVVRWEFPGN  284 (303)
Q Consensus       265 ~~las~s~D-g--~i~~Wd~~~~  284 (303)
                      ++++-++.+ +  .|.+.+..+.
T Consensus       386 ~~i~~~~~~~g~~~l~~~~~~g~  408 (430)
T PRK00178        386 TMLIYATRQQGRGVLMLVSINGR  408 (430)
T ss_pred             CEEEEEEecCCceEEEEEECCCC
Confidence            987766543 3  3556666543


No 230
>PRK05137 tolB translocation protein TolB; Provisional
Probab=99.11  E-value=3.9e-08  Score=90.95  Aligned_cols=174  Identities=15%  Similarity=0.106  Sum_probs=110.3

Q ss_pred             CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEec---CCCeEEEEcCccccCCCccceeecccccCeEEEEeC
Q 022074           61 DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGS---DDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSR  137 (303)
Q Consensus        61 g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s---~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~  137 (303)
                      ..|.++|.++.. ...+..|...+....|+|+ ++.|+..+   .+..|.+||+...    . ...+..+...+....|+
T Consensus       182 ~~l~~~d~dg~~-~~~lt~~~~~v~~p~wSpD-G~~lay~s~~~g~~~i~~~dl~~g----~-~~~l~~~~g~~~~~~~S  254 (435)
T PRK05137        182 KRLAIMDQDGAN-VRYLTDGSSLVLTPRFSPN-RQEITYMSYANGRPRVYLLDLETG----Q-RELVGNFPGMTFAPRFS  254 (435)
T ss_pred             eEEEEECCCCCC-cEEEecCCCCeEeeEECCC-CCEEEEEEecCCCCEEEEEECCCC----c-EEEeecCCCcccCcEEC
Confidence            368888886554 3456678888999999865 66666554   3568999997532    1 22344455566677899


Q ss_pred             CCCCEEE-EEeCCCc--EEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEee
Q 022074          138 GDGRYLI-SNGKDQA--IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHF  214 (303)
Q Consensus       138 ~~~~~l~-s~~~D~~--v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  214 (303)
                      |+|+.|+ +.+.++.  |.+||+.....                                   ..+..+..      ...
T Consensus       255 PDG~~la~~~~~~g~~~Iy~~d~~~~~~-----------------------------------~~Lt~~~~------~~~  293 (435)
T PRK05137        255 PDGRKVVMSLSQGGNTDIYTMDLRSGTT-----------------------------------TRLTDSPA------IDT  293 (435)
T ss_pred             CCCCEEEEEEecCCCceEEEEECCCCce-----------------------------------EEccCCCC------ccC
Confidence            9998765 6666666  44456542110                                   00000000      012


Q ss_pred             eeeeeCCCeEEEEEeC-C--CeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC---CCEEEeecCC
Q 022074          215 SPVYSTGQKYIYTGSH-D--SCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD---GDVVRWEFPG  283 (303)
Q Consensus       215 ~~~~s~~~~~latg~~-d--g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D---g~i~~Wd~~~  283 (303)
                      .+.|+||++.++..+. +  ..|+++|...++. ..+..+...+....|||||+.|+..+.+   ..|.+||..+
T Consensus       294 ~~~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~-~~lt~~~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~  367 (435)
T PRK05137        294 SPSYSPDGSQIVFESDRSGSPQLYVMNADGSNP-RRISFGGGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDG  367 (435)
T ss_pred             ceeEcCCCCEEEEEECCCCCCeEEEEECCCCCe-EEeecCCCcccCeEECCCCCEEEEEEcCCCceEEEEEECCC
Confidence            3568899998887663 2  3689999876654 3343344567778999999998876654   3577788644


No 231
>PRK03629 tolB translocation protein TolB; Provisional
Probab=99.10  E-value=8e-09  Score=95.25  Aligned_cols=173  Identities=16%  Similarity=0.189  Sum_probs=108.0

Q ss_pred             eEEEEEcCCCCEEEEe-eCCC--eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEE-EecCCCeEEEEcCccccCC
Q 022074           42 IFSLKFSTDGRELVAG-SSDD--CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIY-SGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        42 v~~l~~s~~g~~l~sg-s~Dg--~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~-s~s~dg~v~lWd~~~~~~~  117 (303)
                      +.+..|+|||+.|+.. +.+|  .|++||++++... ++..+...+....|+|+ ++.++ +...++...+|....   .
T Consensus       245 ~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~-~lt~~~~~~~~~~wSPD-G~~I~f~s~~~g~~~Iy~~d~---~  319 (429)
T PRK03629        245 NGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIR-QVTDGRSNNTEPTWFPD-SQNLAYTSDQAGRPQVYKVNI---N  319 (429)
T ss_pred             cCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEE-EccCCCCCcCceEECCC-CCEEEEEeCCCCCceEEEEEC---C
Confidence            4467999999988764 4454  5889999888654 44445556778899765 56554 444455556664321   1


Q ss_pred             CccceeecccccCeEEEEeCCCCCEEEEEeCC---CcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          118 GKPAGVLMGHLEGITFIDSRGDGRYLISNGKD---QAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       118 ~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D---~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                      ......+..+........++|+|++|+..+.+   ..+.+||+.....                                
T Consensus       320 ~g~~~~lt~~~~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~--------------------------------  367 (429)
T PRK03629        320 GGAPQRITWEGSQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGV--------------------------------  367 (429)
T ss_pred             CCCeEEeecCCCCccCEEECCCCCEEEEEEccCCCceEEEEECCCCCe--------------------------------
Confidence            11122233333344567789999998776543   3466677643110                                


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCe---EEEEECCCCeEEEEeecCCCCeEEEEECC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSC---VYVYDLVSGEQVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~---i~iwd~~~~~~~~~~~~h~~~I~~v~~sp  262 (303)
                         ..+....       ....|.|+|||++|+.++.++.   ++++++. |.....+..|.+.+...+|||
T Consensus       368 ---~~Lt~~~-------~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~~-G~~~~~l~~~~~~~~~p~Wsp  427 (429)
T PRK03629        368 ---QVLTDTF-------LDETPSIAPNGTMVIYSSSQGMGSVLNLVSTD-GRFKARLPATDGQVKFPAWSP  427 (429)
T ss_pred             ---EEeCCCC-------CCCCceECCCCCEEEEEEcCCCceEEEEEECC-CCCeEECccCCCCcCCcccCC
Confidence               0000000       0124668899999999887765   7777874 555566778888999999998


No 232
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=99.09  E-value=1.6e-09  Score=94.28  Aligned_cols=154  Identities=18%  Similarity=0.275  Sum_probs=99.4

Q ss_pred             EEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCc
Q 022074           85 NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNA  164 (303)
Q Consensus        85 ~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~  164 (303)
                      .+++|+. ++..+++++.||++|+|+..    ....+..+..|...|..++|++||..|++-+.| ..++|+++.....+
T Consensus       148 k~vaf~~-~gs~latgg~dg~lRv~~~P----s~~t~l~e~~~~~eV~DL~FS~dgk~lasig~d-~~~VW~~~~g~~~a  221 (398)
T KOG0771|consen  148 KVVAFNG-DGSKLATGGTDGTLRVWEWP----SMLTILEEIAHHAEVKDLDFSPDGKFLASIGAD-SARVWSVNTGAALA  221 (398)
T ss_pred             eEEEEcC-CCCEeeeccccceEEEEecC----cchhhhhhHhhcCccccceeCCCCcEEEEecCC-ceEEEEeccCchhh
Confidence            5778864 57899999999999999953    223344566788999999999999999999999 99999987642211


Q ss_pred             ccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCC-----eEEEEEeCCCeEEEEEC
Q 022074          165 SCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQ-----KYIYTGSHDSCVYVYDL  239 (303)
Q Consensus       165 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~-----~~latg~~dg~i~iwd~  239 (303)
                      ...  .....+.                                ...|.|    +.++     .+++....-+.|+.||+
T Consensus       222 ~~t--~~~k~~~--------------------------------~~~cRF----~~d~~~~~l~laa~~~~~~~v~~~~~  263 (398)
T KOG0771|consen  222 RKT--PFSKDEM--------------------------------FSSCRF----SVDNAQETLRLAASQFPGGGVRLCDI  263 (398)
T ss_pred             hcC--Ccccchh--------------------------------hhhcee----cccCCCceEEEEEecCCCCceeEEEe
Confidence            110  0000000                                011111    1111     22222334445555554


Q ss_pred             CCCeE---E--E-EeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          240 VSGEQ---V--A-ALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       240 ~~~~~---~--~-~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ...+.   +  . .+..+ ..|.+++.|++|+++|-|+.||.+-+.+...
T Consensus       264 ~~w~~~~~l~~~~~~~~~-~siSsl~VS~dGkf~AlGT~dGsVai~~~~~  312 (398)
T KOG0771|consen  264 SLWSGSNFLRLRKKIKRF-KSISSLAVSDDGKFLALGTMDGSVAIYDAKS  312 (398)
T ss_pred             eeeccccccchhhhhhcc-CcceeEEEcCCCcEEEEeccCCcEEEEEece
Confidence            32211   1  1 11222 3699999999999999999999999998654


No 233
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=99.09  E-value=1.2e-07  Score=80.24  Aligned_cols=230  Identities=16%  Similarity=0.229  Sum_probs=147.9

Q ss_pred             EccCch-hhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC-ceEEEEec--ccCCe
Q 022074            9 DVGSGT-MESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN-KLSLRILA--HTSDV   84 (303)
Q Consensus         9 ~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~-~~~~~~~~--h~~~v   84 (303)
                      .||-|+ -.|--|=-+|||..+.-...+--+..+|.++.++++  .|++. -.+.|+||...+. +....+..  --.++
T Consensus        63 LVGGg~~pky~pNkviIWDD~k~~~i~el~f~~~I~~V~l~r~--riVvv-l~~~I~VytF~~n~k~l~~~et~~NPkGl  139 (346)
T KOG2111|consen   63 LVGGGSRPKYPPNKVIIWDDLKERCIIELSFNSEIKAVKLRRD--RIVVV-LENKIYVYTFPDNPKLLHVIETRSNPKGL  139 (346)
T ss_pred             EecCCCCCCCCCceEEEEecccCcEEEEEEeccceeeEEEcCC--eEEEE-ecCeEEEEEcCCChhheeeeecccCCCce
Confidence            455555 455556667888776667777788899999999975  45554 4567999988643 22222221  12346


Q ss_pred             EEEEEccCCCcEEE-EecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCc-EEEEEcccccC
Q 022074           85 NTVCFGDESGHLIY-SGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQA-IKLWDIRKMSS  162 (303)
Q Consensus        85 ~~l~~~~~~~~~l~-s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~-v~lWdl~~~~~  162 (303)
                      ++++-+. +..+|+ =|-.-|+|.+-|+.....  .+......|...|.+++.+.+|.++||+|..|+ |||||.+....
T Consensus       140 C~~~~~~-~k~~LafPg~k~GqvQi~dL~~~~~--~~p~~I~AH~s~Iacv~Ln~~Gt~vATaStkGTLIRIFdt~~g~~  216 (346)
T KOG2111|consen  140 CSLCPTS-NKSLLAFPGFKTGQVQIVDLASTKP--NAPSIINAHDSDIACVALNLQGTLVATASTKGTLIRIFDTEDGTL  216 (346)
T ss_pred             EeecCCC-CceEEEcCCCccceEEEEEhhhcCc--CCceEEEcccCceeEEEEcCCccEEEEeccCcEEEEEEEcCCCcE
Confidence            6665432 233443 344678999999863221  134466789999999999999999999997665 79999865322


Q ss_pred             CcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC
Q 022074          163 NASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG  242 (303)
Q Consensus       163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~  242 (303)
                      .                                  ..+.......    -.++.+|||+.++||++|..|+++|+.++..
T Consensus       217 l----------------------------------~E~RRG~d~A----~iy~iaFSp~~s~LavsSdKgTlHiF~l~~~  258 (346)
T KOG2111|consen  217 L----------------------------------QELRRGVDRA----DIYCIAFSPNSSWLAVSSDKGTLHIFSLRDT  258 (346)
T ss_pred             e----------------------------------eeeecCCchh----eEEEEEeCCCccEEEEEcCCCeEEEEEeecC
Confidence            1                                  1111000000    1123468999999999999999999987532


Q ss_pred             e---E---------------------EEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          243 E---Q---------------------VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       243 ~---~---------------------~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      .   .                     ...+.-.+++..-++|-.+.+.++..+.||+-+-+.+.
T Consensus       259 ~~~~~~~SSl~~~~~~lpky~~S~wS~~~f~l~~~~~~~~~fg~~~nsvi~i~~Dgsy~k~~f~  322 (346)
T KOG2111|consen  259 ENTEDESSSLSFKRLVLPKYFSSEWSFAKFQLPQGTQCIIAFGSETNTVIAICADGSYYKFKFD  322 (346)
T ss_pred             CCCccccccccccccccchhcccceeEEEEEccCCCcEEEEecCCCCeEEEEEeCCcEEEEEec
Confidence            1   1                     01111224556667777776777777788888777654


No 234
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=99.06  E-value=7.6e-09  Score=89.47  Aligned_cols=182  Identities=13%  Similarity=0.150  Sum_probs=123.4

Q ss_pred             cceEEEEEcCCCC-EEEEeeCC--CeEEEEECCCCceEEEEecc-cC--------CeEEEEEccCC-CcEEEEecCCCeE
Q 022074           40 FGIFSLKFSTDGR-ELVAGSSD--DCIYVYDLEANKLSLRILAH-TS--------DVNTVCFGDES-GHLIYSGSDDNLC  106 (303)
Q Consensus        40 ~~v~~l~~s~~g~-~l~sgs~D--g~v~lwd~~~~~~~~~~~~h-~~--------~v~~l~~~~~~-~~~l~s~s~dg~v  106 (303)
                      .++..+.-++.-. ++++|+..  ..+.|||+.+.+.+.+-..- ++        -++.+.|.++. ...|++++.-++|
T Consensus       149 ~g~~~~r~~~~~p~Iva~GGke~~n~lkiwdle~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~hqv  228 (412)
T KOG3881|consen  149 PGLYDVRQTDTDPYIVATGGKENINELKIWDLEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRYHQV  228 (412)
T ss_pred             CceeeeccCCCCCceEecCchhcccceeeeecccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecceeE
Confidence            4577777776544 55668888  77999999888433221111 11        23566775432 5689999999999


Q ss_pred             EEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          107 KVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       107 ~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      |+||.+.   ..+|+..+.--..+++++...|+++++++|..-+.+..||+|.......                     
T Consensus       229 R~YDt~~---qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~g~l~~FD~r~~kl~g~---------------------  284 (412)
T KOG3881|consen  229 RLYDTRH---QRRPVAQFDFLENPISSTGLTPSGNFIYTGNTKGQLAKFDLRGGKLLGC---------------------  284 (412)
T ss_pred             EEecCcc---cCcceeEeccccCcceeeeecCCCcEEEEecccchhheecccCceeecc---------------------
Confidence            9999862   3466766666677899999999999999999999999999987543211                     


Q ss_pred             ccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCC
Q 022074          187 RDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~  264 (303)
                                  .++|.......+      ..+|..+++|++|-|.++||+|+++++++... .-...++.+-+.++-
T Consensus       285 ------------~~kg~tGsirsi------h~hp~~~~las~GLDRyvRIhD~ktrkll~kv-YvKs~lt~il~~~~~  343 (412)
T KOG3881|consen  285 ------------GLKGITGSIRSI------HCHPTHPVLASCGLDRYVRIHDIKTRKLLHKV-YVKSRLTFILLRDDV  343 (412)
T ss_pred             ------------ccCCccCCcceE------EEcCCCceEEeeccceeEEEeecccchhhhhh-hhhccccEEEecCCc
Confidence                        011111101111      13466789999999999999999997765432 223456777776543


No 235
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=99.05  E-value=9.9e-10  Score=93.56  Aligned_cols=178  Identities=21%  Similarity=0.340  Sum_probs=125.2

Q ss_pred             EEEcCC--CCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC-Cccc
Q 022074           45 LKFSTD--GRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK-GKPA  121 (303)
Q Consensus        45 l~~s~~--g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~-~~~~  121 (303)
                      ++|+-+  |-. ++.+.+-.|-|-++.+|...  ....++.|.++.|. ..+++++.|..+|.|...|+|+.++- +.+.
T Consensus       217 CawSlni~gyh-fs~G~sqqv~L~nvetg~~q--sf~sksDVfAlQf~-~s~nLv~~GcRngeI~~iDLR~rnqG~~~~a  292 (425)
T KOG2695|consen  217 CAWSLNIMGYH-FSVGLSQQVLLTNVETGHQQ--SFQSKSDVFALQFA-GSDNLVFNGCRNGEIFVIDLRCRNQGNGWCA  292 (425)
T ss_pred             hhhhhccceee-ecccccceeEEEEeeccccc--ccccchhHHHHHhc-ccCCeeEecccCCcEEEEEeeecccCCCcce
Confidence            466643  434 44455666889999998643  33466789999995 45789999999999999999976432 1222


Q ss_pred             eeecccccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          122 GVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       122 ~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      ..+ -|..+|+++..-. ++.+|++.+-+|+|++||+|..+.                               ...+..+
T Consensus       293 ~rl-yh~Ssvtslq~Lq~s~q~LmaS~M~gkikLyD~R~~K~-------------------------------~~~V~qY  340 (425)
T KOG2695|consen  293 QRL-YHDSSVTSLQILQFSQQKLMASDMTGKIKLYDLRATKC-------------------------------KKSVMQY  340 (425)
T ss_pred             EEE-EcCcchhhhhhhccccceEeeccCcCceeEeeehhhhc-------------------------------ccceeee
Confidence            222 3888999987766 788999999999999999985432                               2235556


Q ss_pred             ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecC----CCCeEEEEECC
Q 022074          201 KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYH----TSPVRDCSWHP  262 (303)
Q Consensus       201 ~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h----~~~I~~v~~sp  262 (303)
                      .||......+..+.    .+....++++|+|=+.|||.++.|.++.+++-.    +..+++++|..
T Consensus       341 eGHvN~~a~l~~~v----~~eeg~I~s~GdDcytRiWsl~~ghLl~tipf~~s~~e~d~~sv~~~s  402 (425)
T KOG2695|consen  341 EGHVNLSAYLPAHV----KEEEGSIFSVGDDCYTRIWSLDSGHLLCTIPFPYSASEVDIPSVAFDS  402 (425)
T ss_pred             eccccccccccccc----ccccceEEEccCeeEEEEEecccCceeeccCCCCccccccccceehhc
Confidence            66655443333332    234567888999999999999999998887532    33567777754


No 236
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=99.04  E-value=6.7e-10  Score=97.81  Aligned_cols=116  Identities=22%  Similarity=0.356  Sum_probs=97.7

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      +|..+.|.|.-=.|++++..|.++--|+.+|+++..+..-.+.+..++.+|- +-.+-.|..+|+|.+|...    ...+
T Consensus       211 ~v~rLeFLPyHfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~~vm~qNP~-NaVih~GhsnGtVSlWSP~----skeP  285 (545)
T KOG1272|consen  211 RVARLEFLPYHFLLVAASEAGFLKYQDVSTGKLVASIRTGAGRTDVMKQNPY-NAVIHLGHSNGTVSLWSPN----SKEP  285 (545)
T ss_pred             chhhhcccchhheeeecccCCceEEEeechhhhhHHHHccCCccchhhcCCc-cceEEEcCCCceEEecCCC----Ccch
Confidence            6888899998777888899999999999999988777666677888888764 4578899999999999864    2344


Q ss_pred             ceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc
Q 022074          121 AGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS  161 (303)
Q Consensus       121 ~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~  161 (303)
                      ...+..|.++|.++++.++|+|++|.|.|+.++|||+|...
T Consensus       286 LvKiLcH~g~V~siAv~~~G~YMaTtG~Dr~~kIWDlR~~~  326 (545)
T KOG1272|consen  286 LVKILCHRGPVSSIAVDRGGRYMATTGLDRKVKIWDLRNFY  326 (545)
T ss_pred             HHHHHhcCCCcceEEECCCCcEEeecccccceeEeeecccc
Confidence            55566799999999999999999999999999999999744


No 237
>PRK04792 tolB translocation protein TolB; Provisional
Probab=99.02  E-value=1e-07  Score=88.35  Aligned_cols=196  Identities=17%  Similarity=0.165  Sum_probs=112.5

Q ss_pred             cceEEEEEcCCCCEEEEeeCC-C--eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCe--EEEEcCccc
Q 022074           40 FGIFSLKFSTDGRELVAGSSD-D--CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNL--CKVWDRRCL  114 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~D-g--~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~--v~lWd~~~~  114 (303)
                      ..+.+..|+|||+.|+..+.+ +  .|++||+.+++.. .+....+......|+|+...++++.+.++.  |.++|+...
T Consensus       218 ~~~~~p~wSPDG~~La~~s~~~g~~~L~~~dl~tg~~~-~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg  296 (448)
T PRK04792        218 EPLMSPAWSPDGRKLAYVSFENRKAEIFVQDIYTQVRE-KVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATK  296 (448)
T ss_pred             CcccCceECCCCCEEEEEEecCCCcEEEEEECCCCCeE-EecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCC
Confidence            456789999999988876543 2  5888899887643 233223344577897654434456667775  666675421


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEE--EcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLW--DIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lW--dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                           ....+..+........|+++|+.|+..+ .++...+|  |+.....        .                    
T Consensus       297 -----~~~~lt~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~--------~--------------------  343 (448)
T PRK04792        297 -----ALTRITRHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLASGKV--------S--------------------  343 (448)
T ss_pred             -----CeEECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCCCCE--------E--------------------
Confidence                 2233334444456678999998876544 44555555  3321100        0                    


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CC--eEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEE
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DS--CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLV  268 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg--~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~la  268 (303)
                           ..++.+..        ...+.++|++++++..+. ++  .|.++|+.+++.. .+... .......|+|||++|+
T Consensus       344 -----~Lt~~g~~--------~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~-~lt~~-~~d~~ps~spdG~~I~  408 (448)
T PRK04792        344 -----RLTFEGEQ--------NLGGSITPDGRSMIMVNRTNGKFNIARQDLETGAMQ-VLTST-RLDESPSVAPNGTMVI  408 (448)
T ss_pred             -----EEecCCCC--------CcCeeECCCCCEEEEEEecCCceEEEEEECCCCCeE-EccCC-CCCCCceECCCCCEEE
Confidence                 00001100        113457889988877654 33  5677888877642 23222 1223458999999766


Q ss_pred             EEeC-CCC--EEEeecCCC
Q 022074          269 SSSW-DGD--VVRWEFPGN  284 (303)
Q Consensus       269 s~s~-Dg~--i~~Wd~~~~  284 (303)
                      -++. ++.  +.+++..+.
T Consensus       409 ~~~~~~g~~~l~~~~~~G~  427 (448)
T PRK04792        409 YSTTYQGKQVLAAVSIDGR  427 (448)
T ss_pred             EEEecCCceEEEEEECCCC
Confidence            5554 443  566666543


No 238
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=98.99  E-value=2.6e-08  Score=85.05  Aligned_cols=248  Identities=17%  Similarity=0.226  Sum_probs=159.3

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEEC-CCCceEEEEec-ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDL-EANKLSLRILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~-~~~~~~~~~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      -||-..|++...-|..+-+.+.+.|.++|||-- +.++-...+.. -..+++++.+.++ ...|+.|-.+|++.-+.+..
T Consensus        21 eG~~d~vn~~~l~~~e~gv~~~s~drtvrv~lkrds~q~wpsI~~~mP~~~~~~~y~~e-~~~L~vg~~ngtvtefs~se   99 (404)
T KOG1409|consen   21 EGSQDDVNAAILIPKEEGVISVSEDRTVRVWLKRDSGQYWPSIYHYMPSPCSAMEYVSE-SRRLYVGQDNGTVTEFALSE   99 (404)
T ss_pred             cCchhhhhhheeccCCCCeEEccccceeeeEEeccccccCchhhhhCCCCceEeeeecc-ceEEEEEEecceEEEEEhhh
Confidence            477777888888888888999999999999933 33443222221 1257888888654 56788899999999886532


Q ss_pred             ccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCc--cceeeeceeeeC----CCCCc
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGF--RSYEWDYRWMDY----PPQAR  187 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~--~~~~~~~~~~~~----~~~~~  187 (303)
                      .-......+....|...+..+-|+..-+++++.+.|+.+.---.+.......+.+.-  ....++.. ..+    .....
T Consensus       100 dfnkm~~~r~~~~h~~~v~~~if~~~~e~V~s~~~dk~~~~hc~e~~~~lg~Y~~~~~~t~~~~d~~-~~fvGd~~gqvt  178 (404)
T KOG1409|consen  100 DFNKMTFLKDYLAHQARVSAIVFSLTHEWVLSTGKDKQFAWHCTESGNRLGGYNFETPASALQFDAL-YAFVGDHSGQIT  178 (404)
T ss_pred             hhhhcchhhhhhhhhcceeeEEecCCceeEEEeccccceEEEeeccCCcccceEeeccCCCCceeeE-EEEecccccceE
Confidence            111223455677899999999888888899999999886544333222111111000  00001100 000    00000


Q ss_pred             c--ccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-EEEeecCCCCeEEEEECCCC
Q 022074          188 D--LKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-VAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       188 ~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-~~~~~~h~~~I~~v~~sp~~  264 (303)
                      .  +....-+.+..+.+|..-..      ...+.+..+.|.+|..|..+-+||+.-+.. ..++.+|...|..+..-+..
T Consensus       179 ~lr~~~~~~~~i~~~~~h~~~~~------~l~Wd~~~~~LfSg~~d~~vi~wdigg~~g~~~el~gh~~kV~~l~~~~~t  252 (404)
T KOG1409|consen  179 MLKLEQNGCQLITTFNGHTGEVT------CLKWDPGQRLLFSGASDHSVIMWDIGGRKGTAYELQGHNDKVQALSYAQHT  252 (404)
T ss_pred             EEEEeecCCceEEEEcCcccceE------EEEEcCCCcEEEeccccCceEEEeccCCcceeeeeccchhhhhhhhhhhhh
Confidence            0  00001122333444432222      223456678899999999999999975543 45677899999999999999


Q ss_pred             CeEEEEeCCCCEEEeecCCCCccCCCC
Q 022074          265 PMLVSSSWDGDVVRWEFPGNGEAAPPL  291 (303)
Q Consensus       265 ~~las~s~Dg~i~~Wd~~~~~~~~~~~  291 (303)
                      +.|.|+++||.|.+|+....+.+.+..
T Consensus       253 ~~l~S~~edg~i~~w~mn~~r~etpew  279 (404)
T KOG1409|consen  253 RQLISCGEDGGIVVWNMNVKRVETPEW  279 (404)
T ss_pred             eeeeeccCCCeEEEEeccceeecCccc
Confidence            999999999999999998887776654


No 239
>PRK01029 tolB translocation protein TolB; Provisional
Probab=98.98  E-value=2.6e-07  Score=85.13  Aligned_cols=205  Identities=17%  Similarity=0.133  Sum_probs=115.5

Q ss_pred             cceEEEEEcCCCCEE---EEeeCCC--eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCC----eEEEEc
Q 022074           40 FGIFSLKFSTDGREL---VAGSSDD--CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDN----LCKVWD  110 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l---~sgs~Dg--~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg----~v~lWd  110 (303)
                      ....+=.|||||+.+   ++...+|  .|++.++.++... ++....+......|+|+...++++.+.+|    .+.+|+
T Consensus       185 ~~~~sP~wSPDG~~~~~~y~S~~~g~~~I~~~~l~~g~~~-~lt~~~g~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~  263 (428)
T PRK01029        185 SLSITPTWMHIGSGFPYLYVSYKLGVPKIFLGSLENPAGK-KILALQGNQLMPTFSPRKKLLAFISDRYGNPDLFIQSFS  263 (428)
T ss_pred             CCcccceEccCCCceEEEEEEccCCCceEEEEECCCCCce-EeecCCCCccceEECCCCCEEEEEECCCCCcceeEEEee
Confidence            345667899999852   2443343  5788899877643 34444455567789865334444443333    344466


Q ss_pred             CccccCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccc
Q 022074          111 RRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                      +.... .+.+.....++........|+|||+.|+..+ .++...+|.+.......                         
T Consensus       264 ~~~g~-~g~~~~lt~~~~~~~~~p~wSPDG~~Laf~s~~~g~~~ly~~~~~~~g~-------------------------  317 (428)
T PRK01029        264 LETGA-IGKPRRLLNEAFGTQGNPSFSPDGTRLVFVSNKDGRPRIYIMQIDPEGQ-------------------------  317 (428)
T ss_pred             cccCC-CCcceEeecCCCCCcCCeEECCCCCEEEEEECCCCCceEEEEECccccc-------------------------
Confidence            54210 1122222222223345568999999877655 56777777543210000                         


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC---CeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCe
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD---SCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPM  266 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d---g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~  266 (303)
                            ....+....      .....|.+||||+.|+..+.+   ..|++||+.+++.. .+......+....|+|||++
T Consensus       318 ------~~~~lt~~~------~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~-~Lt~~~~~~~~p~wSpDG~~  384 (428)
T PRK01029        318 ------SPRLLTKKY------RNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDY-QLTTSPENKESPSWAIDSLH  384 (428)
T ss_pred             ------ceEEeccCC------CCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeE-EccCCCCCccceEECCCCCE
Confidence                  000000000      011245688999988876543   47999999888753 33323345778999999997


Q ss_pred             EEE-EeC--CCCEEEeecCCC
Q 022074          267 LVS-SSW--DGDVVRWEFPGN  284 (303)
Q Consensus       267 las-~s~--Dg~i~~Wd~~~~  284 (303)
                      |+- +..  ...|.+|++.+.
T Consensus       385 L~f~~~~~g~~~L~~vdl~~g  405 (428)
T PRK01029        385 LVYSAGNSNESELYLISLITK  405 (428)
T ss_pred             EEEEECCCCCceEEEEECCCC
Confidence            764 332  356777887653


No 240
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=98.98  E-value=1.1e-09  Score=109.70  Aligned_cols=186  Identities=18%  Similarity=0.257  Sum_probs=135.5

Q ss_pred             EEEEcc--CchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCC
Q 022074            6 HIVDVG--SGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSD   83 (303)
Q Consensus         6 ~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~   83 (303)
                      ||.+.+  +|+-|.++.+||--.|.+---....|- ..|..+.|+.+|+....+..||.+.+|... .+.....+.|+..
T Consensus      2217 Hp~~~~Yltgs~dgsv~~~~w~~~~~v~~~rt~g~-s~vtr~~f~~qGnk~~i~d~dg~l~l~q~~-pk~~~s~qchnk~ 2294 (2439)
T KOG1064|consen 2217 HPSDPYYLTGSQDGSVRMFEWGHGQQVVCFRTAGN-SRVTRSRFNHQGNKFGIVDGDGDLSLWQAS-PKPYTSWQCHNKA 2294 (2439)
T ss_pred             CCCCceEEecCCCceEEEEeccCCCeEEEeeccCc-chhhhhhhcccCCceeeeccCCceeecccC-CcceeccccCCcc
Confidence            555554  566677888888744444433333455 788999999999999999999999999876 3344556779988


Q ss_pred             eEEEEEccCCCcEEEEec---CCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074           84 VNTVCFGDESGHLIYSGS---DDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus        84 v~~l~~~~~~~~~l~s~s---~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ...+.|-.   ..+++++   .++.+.+||.-   ...........|..+++++++-|...+|++||++|.|++||+|..
T Consensus      2295 ~~Df~Fi~---s~~~tag~s~d~~n~~lwDtl---~~~~~s~v~~~H~~gaT~l~~~P~~qllisggr~G~v~l~D~rqr 2368 (2439)
T KOG1064|consen 2295 LSDFRFIG---SLLATAGRSSDNRNVCLWDTL---LPPMNSLVHTCHDGGATVLAYAPKHQLLISGGRKGEVCLFDIRQR 2368 (2439)
T ss_pred             ccceeeee---hhhhccccCCCCCcccchhcc---cCcccceeeeecCCCceEEEEcCcceEEEecCCcCcEEEeehHHH
Confidence            89998842   5677664   47899999953   122222334679999999999999999999999999999999853


Q ss_pred             cCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECC
Q 022074          161 SSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLV  240 (303)
Q Consensus       161 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~  240 (303)
                      +..    +.+                              +               ... ...++++|+..|+|+||++.
T Consensus      2369 ql~----h~~------------------------------~---------------~~~-~~~~f~~~ss~g~ikIw~~s 2398 (2439)
T KOG1064|consen 2369 QLR----HTF------------------------------Q---------------ALD-TREYFVTGSSEGNIKIWRLS 2398 (2439)
T ss_pred             HHH----HHh------------------------------h---------------hhh-hhheeeccCcccceEEEEcc
Confidence            211    000                              0               001 24679999999999999998


Q ss_pred             CCeEEEEee
Q 022074          241 SGEQVAALK  249 (303)
Q Consensus       241 ~~~~~~~~~  249 (303)
                      .-..++++.
T Consensus      2399 ~~~ll~~~p 2407 (2439)
T KOG1064|consen 2399 EFGLLHTFP 2407 (2439)
T ss_pred             ccchhhcCc
Confidence            877676654


No 241
>PF00400 WD40:  WD domain, G-beta repeat;  InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=98.96  E-value=2.1e-09  Score=63.88  Aligned_cols=39  Identities=31%  Similarity=0.669  Sum_probs=36.9

Q ss_pred             CeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          242 GEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       242 ~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      ++++.++.+|..+|++++|+|++++|+|++.|+.|++||
T Consensus         1 g~~~~~~~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd   39 (39)
T PF00400_consen    1 GKCVRTFRGHSSSINSIAWSPDGNFLASGSSDGTIRVWD   39 (39)
T ss_dssp             EEEEEEEESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred             CeEEEEEcCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence            467889999999999999999999999999999999997


No 242
>PRK02889 tolB translocation protein TolB; Provisional
Probab=98.95  E-value=1.5e-07  Score=86.92  Aligned_cols=177  Identities=15%  Similarity=0.146  Sum_probs=105.3

Q ss_pred             CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC---CCeEEEEcCccccCCCccceeecccccCeEEEEeC
Q 022074           61 DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD---DNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSR  137 (303)
Q Consensus        61 g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~---dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~  137 (303)
                      ..|.++|.+.. ....+..+...+...+|+|+ ++.++..+.   ...|.+||+...    . ...+......+....|+
T Consensus       176 ~~L~~~D~dG~-~~~~l~~~~~~v~~p~wSPD-G~~la~~s~~~~~~~I~~~dl~~g----~-~~~l~~~~g~~~~~~~S  248 (427)
T PRK02889        176 YQLQISDADGQ-NAQSALSSPEPIISPAWSPD-GTKLAYVSFESKKPVVYVHDLATG----R-RRVVANFKGSNSAPAWS  248 (427)
T ss_pred             cEEEEECCCCC-CceEeccCCCCcccceEcCC-CCEEEEEEccCCCcEEEEEECCCC----C-EEEeecCCCCccceEEC
Confidence            35777776443 33445667778889999865 666665543   346999997532    1 12232233445577899


Q ss_pred             CCCCEEE-EEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeee
Q 022074          138 GDGRYLI-SNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSP  216 (303)
Q Consensus       138 ~~~~~l~-s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  216 (303)
                      |||+.|+ +.+.++...||.+......                                 ...+..+..      ....+
T Consensus       249 PDG~~la~~~~~~g~~~Iy~~d~~~~~---------------------------------~~~lt~~~~------~~~~~  289 (427)
T PRK02889        249 PDGRTLAVALSRDGNSQIYTVNADGSG---------------------------------LRRLTQSSG------IDTEP  289 (427)
T ss_pred             CCCCEEEEEEccCCCceEEEEECCCCC---------------------------------cEECCCCCC------CCcCe
Confidence            9998876 6778888888875421000                                 000000000      01134


Q ss_pred             eeeCCCeEEEEEeC-CCeEEEE--ECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC---CEEEeecCCC
Q 022074          217 VYSTGQKYIYTGSH-DSCVYVY--DLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG---DVVRWEFPGN  284 (303)
Q Consensus       217 ~~s~~~~~latg~~-dg~i~iw--d~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg---~i~~Wd~~~~  284 (303)
                      .|+|||+.|+..+. +|...+|  +..+++. ..+..+.......+|||||++|+..+.++   .|.+||+...
T Consensus       290 ~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~-~~lt~~g~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g  362 (427)
T PRK02889        290 FFSPDGRSIYFTSDRGGAPQIYRMPASGGAA-QRVTFTGSYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATG  362 (427)
T ss_pred             EEcCCCCEEEEEecCCCCcEEEEEECCCCce-EEEecCCCCcCceEECCCCCEEEEEEccCCcEEEEEEECCCC
Confidence            58899998876654 4555555  5445443 22222223345679999999998777654   5999998654


No 243
>KOG1354 consensus Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=98.90  E-value=5.1e-08  Score=83.26  Aligned_cols=215  Identities=17%  Similarity=0.304  Sum_probs=130.7

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE---Eeccc-----CCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR---ILAHT-----SDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~---~~~h~-----~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      ++|..-|.+|+++.|++.++++ .|=.|.||.+.--.....   +.+++     ..|++..|+|..-++|+-.+..|+||
T Consensus       161 NaHtyhiNSIS~NsD~Et~lSA-DdLRINLWnlei~d~sFnIVDIKP~nmEeLteVITsaEFhp~~cn~f~YSSSKGtIr  239 (433)
T KOG1354|consen  161 NAHTYHINSISVNSDKETFLSA-DDLRINLWNLEIIDQSFNIVDIKPANMEELTEVITSAEFHPHHCNVFVYSSSKGTIR  239 (433)
T ss_pred             ccceeEeeeeeecCccceEeec-cceeeeeccccccCCceeEEEccccCHHHHHHHHhhhccCHhHccEEEEecCCCcEE
Confidence            5999999999999999999987 455699998864332222   23333     35778889887677888888899999


Q ss_pred             EEcCccccCCCcccee------------ecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc-CCcccccCcccee
Q 022074          108 VWDRRCLNVKGKPAGV------------LMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS-SNASCNLGFRSYE  174 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~------------~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~-~~~~~~~~~~~~~  174 (303)
                      |-|.|....-......            +.+-..+|..+.|+++|+|+++=.. -+|++||+.... +......      
T Consensus       240 LcDmR~~aLCd~hsKlfEepedp~~rsffseiIsSISDvKFs~sGryilsRDy-ltvk~wD~nme~~pv~t~~v------  312 (433)
T KOG1354|consen  240 LCDMRQSALCDAHSKLFEEPEDPSSRSFFSEIISSISDVKFSHSGRYILSRDY-LTVKLWDLNMEAKPVETYPV------  312 (433)
T ss_pred             EeechhhhhhcchhhhhccccCCcchhhHHHHhhhhhceEEccCCcEEEEecc-ceeEEEeccccCCcceEEee------
Confidence            9998732111111111            1222346778899999999998754 799999985421 1111000      


Q ss_pred             eeceeeeCCCCCccccCCCC-CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-EEEee---
Q 022074          175 WDYRWMDYPPQARDLKHPCD-QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-VAALK---  249 (303)
Q Consensus       175 ~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-~~~~~---  249 (303)
                                      +... ..++.+-...    ++-..|.-.++.+++++.||+.....++++...|.. ..+++   
T Consensus       313 ----------------h~~lr~kLc~lYEnD----~IfdKFec~~sg~~~~v~TGsy~n~frvf~~~~gsk~d~tl~asr  372 (433)
T KOG1354|consen  313 ----------------HEYLRSKLCSLYEND----AIFDKFECSWSGNDSYVMTGSYNNVFRVFNLARGSKEDFTLEASR  372 (433)
T ss_pred             ----------------hHhHHHHHHHHhhcc----chhheeEEEEcCCcceEecccccceEEEecCCCCcceeecccccc
Confidence                            0000 0011100000    111112223566788999999999999999654421 11110   


Q ss_pred             -----------------cC-------------CCCeEEEEECCCCCeEEEEeCCCCEEEe
Q 022074          250 -----------------YH-------------TSPVRDCSWHPSQPMLVSSSWDGDVVRW  279 (303)
Q Consensus       250 -----------------~h-------------~~~I~~v~~sp~~~~las~s~Dg~i~~W  279 (303)
                                       +-             ...|...+|+|..+.+|.|..+ .+.++
T Consensus       373 ~~~~~~~~~k~~~V~~~g~r~~~~~~vd~ldf~kkilh~aWhp~en~ia~aatn-nlyif  431 (433)
T KOG1354|consen  373 KNMKPRKVLKLRLVSSSGKRKRDEISVDALDFRKKILHTAWHPKENSIAVAATN-NLYIF  431 (433)
T ss_pred             cCCcccccccceeeecCCCccccccccchhhhhhHHHhhccCCccceeeeeecC-ceEEe
Confidence                             00             1235567799999988888764 44443


No 244
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.90  E-value=9.8e-07  Score=79.55  Aligned_cols=256  Identities=16%  Similarity=0.124  Sum_probs=130.7

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEee-CCCeEEEEECCCCceEEEEecc-------cCCeEEE
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGS-SDDCIYVYDLEANKLSLRILAH-------TSDVNTV   87 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~~~~~~~~~~h-------~~~v~~l   87 (303)
                      |..++|++..++.-- .....|-  .-.++++|+||++++++. .++++.++|.++.+....+...       ...+..+
T Consensus        57 dg~vsviD~~~~~~v-~~i~~G~--~~~~i~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aI  133 (369)
T PF02239_consen   57 DGTVSVIDLATGKVV-ATIKVGG--NPRGIAVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAI  133 (369)
T ss_dssp             TSEEEEEETTSSSEE-EEEE-SS--EEEEEEE--TTTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEE
T ss_pred             CCeEEEEECCcccEE-EEEecCC--CcceEEEcCCCCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeE
Confidence            455677777666522 2222322  357899999999998876 5889999999998877665432       2356777


Q ss_pred             EEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEE-EeCCCcEEEEEcccccCCccc
Q 022074           88 CFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLIS-NGKDQAIKLWDIRKMSSNASC  166 (303)
Q Consensus        88 ~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s-~~~D~~v~lWdl~~~~~~~~~  166 (303)
                      ..++....++++--..+.|-+-|....   ..................+.++++|++. ......+-++|+...+.....
T Consensus       134 v~s~~~~~fVv~lkd~~~I~vVdy~d~---~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~sn~i~viD~~~~k~v~~i  210 (369)
T PF02239_consen  134 VASPGRPEFVVNLKDTGEIWVVDYSDP---KNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGSNKIAVIDTKTGKLVALI  210 (369)
T ss_dssp             EE-SSSSEEEEEETTTTEEEEEETTTS---SCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTTTEEEEEE
T ss_pred             EecCCCCEEEEEEccCCeEEEEEeccc---cccceeeecccccccccccCcccceeeecccccceeEEEeeccceEEEEe
Confidence            666654445555555577776675421   1111112222234556778999998765 456778889998764332221


Q ss_pred             ccCccce--------------eeeceeeeCC---CCCc-ccc-CC--CCCcceEEecccceeeeEEEeeeeeeeCCCeEE
Q 022074          167 NLGFRSY--------------EWDYRWMDYP---PQAR-DLK-HP--CDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYI  225 (303)
Q Consensus       167 ~~~~~~~--------------~~~~~~~~~~---~~~~-~~~-~~--~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~l  225 (303)
                      ..+....              .|........   .-.. ... +.  ....+..+.....       -....-+|+++++
T Consensus       211 ~~g~~p~~~~~~~~php~~g~vw~~~~~~~~~~~~ig~~~v~v~d~~~wkvv~~I~~~G~-------glFi~thP~s~~v  283 (369)
T PF02239_consen  211 DTGKKPHPGPGANFPHPGFGPVWATSGLGYFAIPLIGTDPVSVHDDYAWKVVKTIPTQGG-------GLFIKTHPDSRYV  283 (369)
T ss_dssp             E-SSSBEETTEEEEEETTTEEEEEEEBSSSSEEEEEE--TTT-STTTBTSEEEEEE-SSS-------S--EE--TT-SEE
T ss_pred             eccccccccccccccCCCcceEEeeccccceecccccCCccccchhhcCeEEEEEECCCC-------cceeecCCCCccE
Confidence            1111000              0111100000   0000 000 00  0001111111000       0011237889998


Q ss_pred             EEE----eCCCeEEEEECCCCeEEEEeecC-CCCeEEEEECCCCCeEEEEeCC--CCEEEeecCCC
Q 022074          226 YTG----SHDSCVYVYDLVSGEQVAALKYH-TSPVRDCSWHPSQPMLVSSSWD--GDVVRWEFPGN  284 (303)
Q Consensus       226 atg----~~dg~i~iwd~~~~~~~~~~~~h-~~~I~~v~~sp~~~~las~s~D--g~i~~Wd~~~~  284 (303)
                      ...    ..+++|.++|.++.+.+..+... ..++..+.|++||+++-.+..+  +.|.++|.+.-
T Consensus       284 wvd~~~~~~~~~v~viD~~tl~~~~~i~~~~~~~~~h~ef~~dG~~v~vS~~~~~~~i~v~D~~Tl  349 (369)
T PF02239_consen  284 WVDTFLNPDADTVQVIDKKTLKVVKTITPGPGKRVVHMEFNPDGKEVWVSVWDGNGAIVVYDAKTL  349 (369)
T ss_dssp             EEE-TT-SSHT-EEEEECCGTEEEE-HHHHHT--EEEEEE-TTSSEEEEEEE--TTEEEEEETTTT
T ss_pred             EeeccCCCCCceEEEEECcCcceeEEEeccCCCcEeccEECCCCCEEEEEEecCCCEEEEEECCCc
Confidence            888    45689999999999887777532 2369999999999954444333  36999997643


No 245
>KOG1064 consensus RAVE (regulator of V-ATPase assembly) complex subunit RAV1/DMX protein, WD repeat superfamily [General function prediction only]
Probab=98.90  E-value=7.7e-09  Score=103.87  Aligned_cols=186  Identities=16%  Similarity=0.270  Sum_probs=133.4

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      .|.++.=+|.-.+.++|+.||.|++|.-..++.+..+.. -...|+.+.|+. +|+.+..+..||.+.+|-..     .+
T Consensus      2210 ~v~r~~sHp~~~~Yltgs~dgsv~~~~w~~~~~v~~~rt~g~s~vtr~~f~~-qGnk~~i~d~dg~l~l~q~~-----pk 2283 (2439)
T KOG1064|consen 2210 NVRRMTSHPSDPYYLTGSQDGSVRMFEWGHGQQVVCFRTAGNSRVTRSRFNH-QGNKFGIVDGDGDLSLWQAS-----PK 2283 (2439)
T ss_pred             ceeeecCCCCCceEEecCCCceEEEEeccCCCeEEEeeccCcchhhhhhhcc-cCCceeeeccCCceeecccC-----Cc
Confidence            678888888888999999999999999877766544332 237888889975 47788889999999999753     34


Q ss_pred             cceeecccccCeEEEEeCCCCCEEEEEe---CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          120 PAGVLMGHLEGITFIDSRGDGRYLISNG---KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~~~~~l~s~~---~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      +....+.|..+.+.+.|-.  ..+++++   .++.+.+||.-.....                        .       .
T Consensus      2284 ~~~s~qchnk~~~Df~Fi~--s~~~tag~s~d~~n~~lwDtl~~~~~------------------------s-------~ 2330 (2439)
T KOG1064|consen 2284 PYTSWQCHNKALSDFRFIG--SLLATAGRSSDNRNVCLWDTLLPPMN------------------------S-------L 2330 (2439)
T ss_pred             ceeccccCCccccceeeee--hhhhccccCCCCCcccchhcccCccc------------------------c-------e
Confidence            4445566777666655543  4677765   4789999996421100                        0       0


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCE
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDV  276 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i  276 (303)
                      +-  +.|..-.+      ...|-|..++|++||-+|.|++||++.+++..++..         +. ...++++++..|.+
T Consensus      2331 v~--~~H~~gaT------~l~~~P~~qllisggr~G~v~l~D~rqrql~h~~~~---------~~-~~~~f~~~ss~g~i 2392 (2439)
T KOG1064|consen 2331 VH--TCHDGGAT------VLAYAPKHQLLISGGRKGEVCLFDIRQRQLRHTFQA---------LD-TREYFVTGSSEGNI 2392 (2439)
T ss_pred             ee--eecCCCce------EEEEcCcceEEEecCCcCcEEEeehHHHHHHHHhhh---------hh-hhheeeccCcccce
Confidence            10  11111111      123567889999999999999999999887766543         44 56789999999999


Q ss_pred             EEeecCC
Q 022074          277 VRWEFPG  283 (303)
Q Consensus       277 ~~Wd~~~  283 (303)
                      ++|++..
T Consensus      2393 kIw~~s~ 2399 (2439)
T KOG1064|consen 2393 KIWRLSE 2399 (2439)
T ss_pred             EEEEccc
Confidence            9999764


No 246
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=98.88  E-value=4.2e-07  Score=77.40  Aligned_cols=243  Identities=15%  Similarity=0.234  Sum_probs=141.1

Q ss_pred             eEEEEEcCCCCEEEEe-eCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc-----
Q 022074           42 IFSLKFSTDGRELVAG-SSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN-----  115 (303)
Q Consensus        42 v~~l~~s~~g~~l~sg-s~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~-----  115 (303)
                      |.-|.|..|..+++++ ..|+.|.+|++.......++..-..++...+|+|+..+.|.+...+-.+.+|.+....     
T Consensus        51 i~yieW~ads~~ilC~~yk~~~vqvwsl~Qpew~ckIdeg~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~~~~~~~  130 (447)
T KOG4497|consen   51 IVYIEWKADSCHILCVAYKDPKVQVWSLVQPEWYCKIDEGQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQKGYLLP  130 (447)
T ss_pred             hhheeeeccceeeeeeeeccceEEEEEeecceeEEEeccCCCcceeeeECCCcceEeeeecceeEEEEEEeccceeEEec
Confidence            5567888888776664 6788999999988877777877778999999988766788888889999999753100     


Q ss_pred             ------------CCCc---------------------------------------------cceee---------cccc-
Q 022074          116 ------------VKGK---------------------------------------------PAGVL---------MGHL-  128 (303)
Q Consensus       116 ------------~~~~---------------------------------------------~~~~~---------~~h~-  128 (303)
                                  ..++                                             .....         .=|. 
T Consensus       131 ~pK~~~kg~~f~~dg~f~ai~sRrDCkdyv~i~~c~~W~ll~~f~~dT~DltgieWsPdg~~laVwd~~Leykv~aYe~~  210 (447)
T KOG4497|consen  131 HPKTNVKGYAFHPDGQFCAILSRRDCKDYVQISSCKAWILLKEFKLDTIDLTGIEWSPDGNWLAVWDNVLEYKVYAYERG  210 (447)
T ss_pred             ccccCceeEEECCCCceeeeeecccHHHHHHHHhhHHHHHHHhcCCCcccccCceECCCCcEEEEecchhhheeeeeeec
Confidence                        0000                                             00000         0011 


Q ss_pred             cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCccc----c-----------cCccceeeeceeeeCCCCCccccC-C
Q 022074          129 EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASC----N-----------LGFRSYEWDYRWMDYPPQARDLKH-P  192 (303)
Q Consensus       129 ~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~----~-----------~~~~~~~~~~~~~~~~~~~~~~~~-~  192 (303)
                      -++..+.++|.+++|+.|+.|+.+|+-+.-.-+....+    .           ..+.........+.++|..-.... .
T Consensus       211 lG~k~v~wsP~~qflavGsyD~~lrvlnh~tWk~f~eflhl~s~~dp~~~~~~ke~~~~~ql~~~cLsf~p~~~~a~~~~  290 (447)
T KOG4497|consen  211 LGLKFVEWSPCNQFLAVGSYDQMLRVLNHFTWKPFGEFLHLCSYHDPTLHLLEKETFSIVQLLHHCLSFTPTDLEAHIWE  290 (447)
T ss_pred             cceeEEEeccccceEEeeccchhhhhhceeeeeehhhhccchhccCchhhhhhhhhcchhhhcccccccCCCccccCccc
Confidence            24566778888889999999999988653221111000    0           000000000001111111000000 0


Q ss_pred             CC----------CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe--CCCeEEEEECCCCeEEEEeecCCCCeEEEEE
Q 022074          193 CD----------QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS--HDSCVYVYDLVSGEQVAALKYHTSPVRDCSW  260 (303)
Q Consensus       193 ~~----------~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~--~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~  260 (303)
                      ..          ..+..++.-.....-....-...||+|..+++|-.  .-+.+.+||+.+.+.-..+ ....||....|
T Consensus       291 ~se~~YE~~~~pv~~~~lkp~tD~pnPk~g~g~lafs~Ds~y~aTrnd~~PnalW~Wdlq~l~l~avL-iQk~piraf~W  369 (447)
T KOG4497|consen  291 ESETIYEQQMTPVKVHKLKPPTDFPNPKCGAGKLAFSCDSTYAATRNDKYPNALWLWDLQNLKLHAVL-IQKHPIRAFEW  369 (447)
T ss_pred             cchhhhhhhhcceeeecccCCCCCCCcccccceeeecCCceEEeeecCCCCceEEEEechhhhhhhhh-hhccceeEEEe
Confidence            00          00000000000000000111235889999999864  4578999999886654444 35569999999


Q ss_pred             CCCCCeEEEEeCCCCEEEeecCCCC
Q 022074          261 HPSQPMLVSSSWDGDVVRWEFPGNG  285 (303)
Q Consensus       261 sp~~~~las~s~Dg~i~~Wd~~~~~  285 (303)
                      +|..+.|+.....-.+.+|-+.+.+
T Consensus       370 dP~~prL~vctg~srLY~W~psg~~  394 (447)
T KOG4497|consen  370 DPGRPRLVVCTGKSRLYFWAPSGPR  394 (447)
T ss_pred             CCCCceEEEEcCCceEEEEcCCCce
Confidence            9999877777777789999987653


No 247
>PRK04922 tolB translocation protein TolB; Provisional
Probab=98.88  E-value=1.9e-07  Score=86.35  Aligned_cols=172  Identities=21%  Similarity=0.198  Sum_probs=104.0

Q ss_pred             ceEEEEEcCCCCEEE-EeeCCC--eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEE-EecCCCe--EEEEcCccc
Q 022074           41 GIFSLKFSTDGRELV-AGSSDD--CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIY-SGSDDNL--CKVWDRRCL  114 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~-sgs~Dg--~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~-s~s~dg~--v~lWd~~~~  114 (303)
                      ...+..|+|||+.++ +.+.+|  .|++||+.++.. .++..+........|+++ ++.++ +...+|.  +.++|+.. 
T Consensus       249 ~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~-~~lt~~~~~~~~~~~spD-G~~l~f~sd~~g~~~iy~~dl~~-  325 (433)
T PRK04922        249 INGAPSFSPDGRRLALTLSRDGNPEIYVMDLGSRQL-TRLTNHFGIDTEPTWAPD-GKSIYFTSDRGGRPQIYRVAASG-  325 (433)
T ss_pred             CccCceECCCCCEEEEEEeCCCCceEEEEECCCCCe-EECccCCCCccceEECCC-CCEEEEEECCCCCceEEEEECCC-
Confidence            345789999999775 445555  599999988864 345555555567889765 55555 4445555  55555432 


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCC---cEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQ---AIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~---~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                         +. ...+..+.......+++|+|++++..+.++   .|.+||+......                            
T Consensus       326 ---g~-~~~lt~~g~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~----------------------------  373 (433)
T PRK04922        326 ---GS-AERLTFQGNYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVR----------------------------  373 (433)
T ss_pred             ---CC-eEEeecCCCCccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeE----------------------------
Confidence               11 222222223344578999999987665433   5888887532110                            


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC---CCeEEEEECCCCeEEEEeecCCCCeEEEEECC
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH---DSCVYVYDLVSGEQVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~---dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp  262 (303)
                             .+....       ....|.|+|++++++..+.   .+.|++++... .....+..+.+.+...+|||
T Consensus       374 -------~Lt~~~-------~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g-~~~~~l~~~~g~~~~p~wsp  432 (433)
T PRK04922        374 -------TLTPGS-------LDESPSFAPNGSMVLYATREGGRGVLAAVSTDG-RVRQRLVSADGEVREPAWSP  432 (433)
T ss_pred             -------ECCCCC-------CCCCceECCCCCEEEEEEecCCceEEEEEECCC-CceEEcccCCCCCCCCccCC
Confidence                   000000       0013457889988777665   34688888854 44555666667788889987


No 248
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=98.84  E-value=1.2e-08  Score=94.76  Aligned_cols=215  Identities=20%  Similarity=0.274  Sum_probs=134.5

Q ss_pred             CCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEE-EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           36 GGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSL-RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~-~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      -||+.+|..+.|+|+.. .+++++.|-.+..||+..-.... .+..-..+-..|+|+-.+++.+++ +....|++||.+.
T Consensus       111 hghsraitd~n~~~q~pdVlatcsvdt~vh~wd~rSp~~p~ys~~~w~s~asqVkwnyk~p~vlas-shg~~i~vwd~r~  189 (1081)
T KOG0309|consen  111 HGHSRAITDINFNPQHPDVLATCSVDTYVHAWDMRSPHRPFYSTSSWRSAASQVKWNYKDPNVLAS-SHGNDIFVWDLRK  189 (1081)
T ss_pred             ecCccceeccccCCCCCcceeeccccccceeeeccCCCcceeeeecccccCceeeecccCcchhhh-ccCCceEEEeccC
Confidence            49999999999998654 78899999999999998765432 222222455788897677777654 4456899999874


Q ss_pred             ccCCCccceeecccccCeEEEEeCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCC
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHP  192 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  192 (303)
                      .   ..+...+.+|...|..++|..- -..+.+.+.|++|+.||-.+..........-....|--+.+   |..      
T Consensus       190 g---s~pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~d~tvkfw~y~kSt~e~~~~vtt~~piw~~r~~---Pfg------  257 (1081)
T KOG0309|consen  190 G---STPLCSLKGHVSSVNSIDFNRFKYSEIMSSSNDGTVKFWDYSKSTTESKRTVTTNFPIWRGRYL---PFG------  257 (1081)
T ss_pred             C---CcceEEecccceeeehHHHhhhhhhhhcccCCCCceeeecccccccccceeccccCcceecccc---ccC------
Confidence            3   3567788889999998887652 34689999999999999765432211110000000000000   000      


Q ss_pred             CCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC-eEEEEeecCCCCeEEEEECCCCC------
Q 022074          193 CDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQP------  265 (303)
Q Consensus       193 ~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~~------  265 (303)
                      ...++..                   ..++..+..---+..-..|+..++ ..+.+|.+|.+.|.+.-|-..+.      
T Consensus       258 ~g~~~mp-------------------~~G~n~v~~~~c~n~d~e~n~~~~~~pVh~F~GH~D~V~eFlWR~r~e~~~d~d  318 (1081)
T KOG0309|consen  258 EGYCIMP-------------------MVGGNMVPQLRCENSDLEWNVFDLNTPVHTFVGHDDVVLEFLWRKRKECDGDYD  318 (1081)
T ss_pred             ceeEecc-------------------ccCCeeeeeccccchhhhhccccCCcceeeecCcchHHHHHhhhhcccccCCCC
Confidence            0000000                   001111111111222345555544 46889999999888777754322      


Q ss_pred             ----eEEEEeCCCCEEEeecC
Q 022074          266 ----MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       266 ----~las~s~Dg~i~~Wd~~  282 (303)
                          +|+|-|-|..+++|.+.
T Consensus       319 ~rdfQLVTWSkD~~lrlWpI~  339 (1081)
T KOG0309|consen  319 SRDFQLVTWSKDQTLRLWPID  339 (1081)
T ss_pred             ccceeEEEeecCCceEeeecc
Confidence                89999999999999875


No 249
>KOG2139 consensus WD40 repeat protein [General function prediction only]
Probab=98.84  E-value=1.5e-07  Score=81.03  Aligned_cols=166  Identities=16%  Similarity=0.264  Sum_probs=105.1

Q ss_pred             CeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecc-cccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc
Q 022074           83 DVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMG-HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS  161 (303)
Q Consensus        83 ~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~-h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~  161 (303)
                      .+..++|++ .-..|+++..|..|++||...     .....+.. -...|++++|.|.+..-++.+--+.|.+|......
T Consensus       100 dlr~~aWhq-H~~~fava~nddvVriy~kss-----t~pt~Lks~sQrnvtclawRPlsaselavgCr~gIciW~~s~tl  173 (445)
T KOG2139|consen  100 DLRGVAWHQ-HIIAFAVATNDDVVRIYDKSS-----TCPTKLKSVSQRNVTCLAWRPLSASELAVGCRAGICIWSDSRTL  173 (445)
T ss_pred             ceeeEeech-hhhhhhhhccCcEEEEeccCC-----CCCceecchhhcceeEEEeccCCcceeeeeecceeEEEEcCccc
Confidence            567888964 455688999999999999653     11222221 23579999999865444444445789999765321


Q ss_pred             CCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe-CCCeEEEEECC
Q 022074          162 SNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS-HDSCVYVYDLV  240 (303)
Q Consensus       162 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~-~dg~i~iwd~~  240 (303)
                      ...      +.+    +          +....-..+....||.. .+.+      .+.+||..+++++ .|..|.|||..
T Consensus       174 n~~------r~~----~----------~~s~~~~qvl~~pgh~p-Vtsm------qwn~dgt~l~tAS~gsssi~iWdpd  226 (445)
T KOG2139|consen  174 NAN------RNI----R----------MMSTHHLQVLQDPGHNP-VTSM------QWNEDGTILVTASFGSSSIMIWDPD  226 (445)
T ss_pred             ccc------ccc----c----------cccccchhheeCCCCce-eeEE------EEcCCCCEEeecccCcceEEEEcCC
Confidence            100      000    0          00000001222334421 2222      2456788899987 67889999999


Q ss_pred             CCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          241 SGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       241 ~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      ++.++--..---+.+.-+.||||+.+|+++.-|++.++|+.
T Consensus       227 tg~~~pL~~~glgg~slLkwSPdgd~lfaAt~davfrlw~e  267 (445)
T KOG2139|consen  227 TGQKIPLIPKGLGGFSLLKWSPDGDVLFAATCDAVFRLWQE  267 (445)
T ss_pred             CCCcccccccCCCceeeEEEcCCCCEEEEecccceeeeehh
Confidence            98764333223356889999999999999999999999954


No 250
>KOG1409 consensus Uncharacterized conserved protein, contains WD40 repeats and FYVE domains [Function unknown]
Probab=98.83  E-value=4.4e-08  Score=83.65  Aligned_cols=82  Identities=26%  Similarity=0.419  Sum_probs=66.4

Q ss_pred             EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEE
Q 022074           74 SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIK  153 (303)
Q Consensus        74 ~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~  153 (303)
                      ..++.+|.+.+.+++|. +...+|++|..|..|.+||+...   ......+.+|.+.|..+..-+--..+.+++.|+.|-
T Consensus       190 i~~~~~h~~~~~~l~Wd-~~~~~LfSg~~d~~vi~wdigg~---~g~~~el~gh~~kV~~l~~~~~t~~l~S~~edg~i~  265 (404)
T KOG1409|consen  190 ITTFNGHTGEVTCLKWD-PGQRLLFSGASDHSVIMWDIGGR---KGTAYELQGHNDKVQALSYAQHTRQLISCGEDGGIV  265 (404)
T ss_pred             EEEEcCcccceEEEEEc-CCCcEEEeccccCceEEEeccCC---cceeeeeccchhhhhhhhhhhhheeeeeccCCCeEE
Confidence            34567899999999995 45678999999999999997522   223456789999998877666677899999999999


Q ss_pred             EEEccc
Q 022074          154 LWDIRK  159 (303)
Q Consensus       154 lWdl~~  159 (303)
                      +||+..
T Consensus       266 ~w~mn~  271 (404)
T KOG1409|consen  266 VWNMNV  271 (404)
T ss_pred             EEeccc
Confidence            999864


No 251
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=98.82  E-value=4.6e-08  Score=82.61  Aligned_cols=210  Identities=19%  Similarity=0.288  Sum_probs=127.3

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc--e---EEEEeccc------------CCeEEEEEccC-CCcEEEEec
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK--L---SLRILAHT------------SDVNTVCFGDE-SGHLIYSGS  101 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~--~---~~~~~~h~------------~~v~~l~~~~~-~~~~l~s~s  101 (303)
                      --|.++.|...|.++++|...|.|.+|+-....  .   ...++.|.            ..|+.+.|..+ ....|+-.+
T Consensus        27 d~ItaVefd~tg~YlatGDkgGRVvlfer~~s~~ceykf~teFQshe~EFDYLkSleieEKin~I~w~~~t~r~hFLlst  106 (460)
T COG5170          27 DKITAVEFDETGLYLATGDKGGRVVLFEREKSYGCEYKFFTEFQSHELEFDYLKSLEIEEKINAIEWFDDTGRNHFLLST  106 (460)
T ss_pred             ceeeEEEeccccceEeecCCCceEEEeecccccccchhhhhhhcccccchhhhhhccHHHHhhheeeecCCCcceEEEec
Confidence            348899999999999999999999999765432  1   12244553            35788888543 345677788


Q ss_pred             CCCeEEEEcCccccC---------------CCc-----------------------cceee-cccccCeEEEEeCCCCCE
Q 022074          102 DDNLCKVWDRRCLNV---------------KGK-----------------------PAGVL-MGHLEGITFIDSRGDGRY  142 (303)
Q Consensus       102 ~dg~v~lWd~~~~~~---------------~~~-----------------------~~~~~-~~h~~~v~~~~~~~~~~~  142 (303)
                      .|.++++|.++..+.               .+.                       +.+.. ..|.--+.++++..|.+.
T Consensus       107 NdktiKlWKiyeknlk~va~nnls~~~~~~~~g~~~s~~~l~lprls~hd~iiaa~p~rvyaNaH~yhiNSiS~NsD~et  186 (460)
T COG5170         107 NDKTIKLWKIYEKNLKVVAENNLSDSFHSPMGGPLTSTKELLLPRLSEHDEIIAAKPCRVYANAHPYHINSISFNSDKET  186 (460)
T ss_pred             CCceeeeeeeecccchhhhccccccccccccCCCcCCHHHhhcccccccceEEEeccceeccccceeEeeeeeecCchhe
Confidence            899999998642100               000                       11111 235555778888887776


Q ss_pred             EEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCC-
Q 022074          143 LISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTG-  221 (303)
Q Consensus       143 l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~-  221 (303)
                      ++++ .|=.|.+|++......+..             .+.                  +.+.. ..+.....+..|+|. 
T Consensus       187 ~lSa-DdLrINLWnl~i~D~sFnI-------------VDi------------------KP~nm-eeLteVItSaeFhp~~  233 (460)
T COG5170         187 LLSA-DDLRINLWNLEIIDGSFNI-------------VDI------------------KPHNM-EELTEVITSAEFHPEM  233 (460)
T ss_pred             eeec-cceeeeeccccccCCceEE-------------Eec------------------cCccH-HHHHHHHhhcccCHhH
Confidence            6665 6788999998653321110             000                  00000 000001112223332 


Q ss_pred             CeEEEEEeCCCeEEEEECCCCe------EEEEe----------ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGE------QVAAL----------KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~------~~~~~----------~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ...+.-.++.|.|++.|+++..      ++...          ++-...|.++.|+++|+++++-+. -++++||+..
T Consensus       234 cn~fmYSsSkG~Ikl~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~ngryIlsRdy-ltvkiwDvnm  310 (460)
T COG5170         234 CNVFMYSSSKGEIKLNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDNGRYILSRDY-LTVKIWDVNM  310 (460)
T ss_pred             cceEEEecCCCcEEehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCCCcEEEEecc-ceEEEEeccc
Confidence            2344456678999999987432      11111          112357899999999999999875 6999999863


No 252
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=98.81  E-value=4.8e-08  Score=80.51  Aligned_cols=63  Identities=27%  Similarity=0.491  Sum_probs=54.5

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeE-EEEeecCCCCeEEEEECCCCC-eEEEEeCCCCEEEeecCC
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQ-VAALKYHTSPVRDCSWHPSQP-MLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~-~~~~~~h~~~I~~v~~sp~~~-~las~s~Dg~i~~Wd~~~  283 (303)
                      ++.++++|++||.+.+||.++... ...++.|+.+|+.+.|+|..+ .|+++++||.+..||..+
T Consensus       191 qq~~v~cgt~dg~~~l~d~rn~~~p~S~l~ahk~~i~eV~FHpk~p~~Lft~sedGslw~wdas~  255 (319)
T KOG4714|consen  191 QQHLVCCGTDDGIVGLWDARNVAMPVSLLKAHKAEIWEVHFHPKNPEHLFTCSEDGSLWHWDAST  255 (319)
T ss_pred             cccEEEEecCCCeEEEEEcccccchHHHHHHhhhhhhheeccCCCchheeEecCCCcEEEEcCCC
Confidence            456788999999999999998754 445689999999999999764 799999999999999764


No 253
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=98.80  E-value=1.1e-06  Score=80.61  Aligned_cols=173  Identities=17%  Similarity=0.164  Sum_probs=107.8

Q ss_pred             eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCC---CeEEEEcCccccCCCccceeecccccCeEEEEeCC
Q 022074           62 CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDD---NLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG  138 (303)
Q Consensus        62 ~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~d---g~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~  138 (303)
                      .|.++|...+. ...+..+...+....|+|+ ++.++.++.+   ..|++||+...    . ...+..+...+....|+|
T Consensus       171 ~l~~~d~~g~~-~~~l~~~~~~~~~p~~Spd-g~~la~~~~~~~~~~i~v~d~~~g----~-~~~~~~~~~~~~~~~~sp  243 (417)
T TIGR02800       171 ELQVADYDGAN-PQTITRSREPILSPAWSPD-GQKLAYVSFESGKPEIYVQDLATG----Q-REKVASFPGMNGAPAFSP  243 (417)
T ss_pred             eEEEEcCCCCC-CEEeecCCCceecccCCCC-CCEEEEEEcCCCCcEEEEEECCCC----C-EEEeecCCCCccceEECC
Confidence            57788876544 3446666667888899764 6666655432   47999997522    1 122333444556678999


Q ss_pred             CCCEEE-EEeCCC--cEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeee
Q 022074          139 DGRYLI-SNGKDQ--AIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFS  215 (303)
Q Consensus       139 ~~~~l~-s~~~D~--~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  215 (303)
                      +++.|+ +.+.++  .|.+||+......                                   .+..+.      .....
T Consensus       244 Dg~~l~~~~~~~~~~~i~~~d~~~~~~~-----------------------------------~l~~~~------~~~~~  282 (417)
T TIGR02800       244 DGSKLAVSLSKDGNPDIYVMDLDGKQLT-----------------------------------RLTNGP------GIDTE  282 (417)
T ss_pred             CCCEEEEEECCCCCccEEEEECCCCCEE-----------------------------------ECCCCC------CCCCC
Confidence            998775 444444  4777776432100                                   000000      00113


Q ss_pred             eeeeCCCeEEEEEeCC---CeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC---CEEEeecCC
Q 022074          216 PVYSTGQKYIYTGSHD---SCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG---DVVRWEFPG  283 (303)
Q Consensus       216 ~~~s~~~~~latg~~d---g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg---~i~~Wd~~~  283 (303)
                      +.|+++++.|+..+..   ..|+++|..+++. ..+..+...+....|+|++++|+.++.++   .|.+||+..
T Consensus       283 ~~~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~-~~l~~~~~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~  355 (417)
T TIGR02800       283 PSWSPDGKSIAFTSDRGGSPQIYMMDADGGEV-RRLTFRGGYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDG  355 (417)
T ss_pred             EEECCCCCEEEEEECCCCCceEEEEECCCCCE-EEeecCCCCccCeEECCCCCEEEEEEccCCceEEEEEeCCC
Confidence            4567888887766542   2688889887664 34444556778899999999988888776   788888764


No 254
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=98.77  E-value=3.8e-07  Score=86.83  Aligned_cols=182  Identities=18%  Similarity=0.272  Sum_probs=126.5

Q ss_pred             CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeeccccc
Q 022074           50 DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLE  129 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~  129 (303)
                      ++..++.|+--..+..+|+.+.++........++|.-++.+   ++.+++|...|+|.+=|.+    ..+++..+..|.+
T Consensus       146 ~~~~~i~Gg~Q~~li~~Dl~~~~e~r~~~v~a~~v~imR~N---nr~lf~G~t~G~V~LrD~~----s~~~iht~~aHs~  218 (1118)
T KOG1275|consen  146 GPSTLIMGGLQEKLIHIDLNTEKETRTTNVSASGVTIMRYN---NRNLFCGDTRGTVFLRDPN----SFETIHTFDAHSG  218 (1118)
T ss_pred             CCcceeecchhhheeeeecccceeeeeeeccCCceEEEEec---CcEEEeecccceEEeecCC----cCceeeeeecccc
Confidence            35567777766678888998887654444344467777763   5789999999999999876    4456778899999


Q ss_pred             CeEEEEeCCCCCEEEEEeC---------CCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          130 GITFIDSRGDGRYLISNGK---------DQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       130 ~v~~~~~~~~~~~l~s~~~---------D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      .+..++.  .|++|+|+|.         |.=|++||||.++.........            .|                
T Consensus       219 siSDfDv--~GNlLitCG~S~R~~~l~~D~FvkVYDLRmmral~PI~~~~------------~P----------------  268 (1118)
T KOG1275|consen  219 SISDFDV--QGNLLITCGYSMRRYNLAMDPFVKVYDLRMMRALSPIQFPY------------GP----------------  268 (1118)
T ss_pred             ceeeeec--cCCeEEEeecccccccccccchhhhhhhhhhhccCCccccc------------Cc----------------
Confidence            9988765  6889999886         5667899999765432211100            00                


Q ss_pred             ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC-CeE---EEEeecCCCCeEEEEECCCCCeEEEEeCCCCE
Q 022074          201 KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS-GEQ---VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDV  276 (303)
Q Consensus       201 ~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~-~~~---~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i  276 (303)
                         +    .++.  .|.+   ...++..+..|...+.|..+ .+.   +..+......+..+++|++++.||-+..+|.+
T Consensus       269 ---~----flrf--~Psl---~t~~~V~S~sGq~q~vd~~~lsNP~~~~~~v~p~~s~i~~fDiSsn~~alafgd~~g~v  336 (1118)
T KOG1275|consen  269 ---Q----FLRF--HPSL---TTRLAVTSQSGQFQFVDTATLSNPPAGVKMVNPNGSGISAFDISSNGDALAFGDHEGHV  336 (1118)
T ss_pred             ---h----hhhh--cccc---cceEEEEecccceeeccccccCCCccceeEEccCCCcceeEEecCCCceEEEecccCcE
Confidence               0    1111  1211   24578889999999999432 222   22223333459999999999999999999999


Q ss_pred             EEee
Q 022074          277 VRWE  280 (303)
Q Consensus       277 ~~Wd  280 (303)
                      .+|-
T Consensus       337 ~~wa  340 (1118)
T KOG1275|consen  337 NLWA  340 (1118)
T ss_pred             eeec
Confidence            9997


No 255
>KOG4497 consensus Uncharacterized conserved protein WDR8, contains WD repeats [General function prediction only]
Probab=98.77  E-value=1.4e-07  Score=80.24  Aligned_cols=207  Identities=18%  Similarity=0.232  Sum_probs=118.0

Q ss_pred             EEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcccee
Q 022074           44 SLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGV  123 (303)
Q Consensus        44 ~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~  123 (303)
                      -++|||+|+++|+.+.- .+.|=|..+-+.. ++..--+.|.-+.|..++-..+-....++.|.+|++...    .--+.
T Consensus        13 ~c~fSp~g~yiAs~~~y-rlviRd~~tlq~~-qlf~cldki~yieW~ads~~ilC~~yk~~~vqvwsl~Qp----ew~ck   86 (447)
T KOG4497|consen   13 FCSFSPCGNYIASLSRY-RLVIRDSETLQLH-QLFLCLDKIVYIEWKADSCHILCVAYKDPKVQVWSLVQP----EWYCK   86 (447)
T ss_pred             ceeECCCCCeeeeeeee-EEEEeccchhhHH-HHHHHHHHhhheeeeccceeeeeeeeccceEEEEEeecc----eeEEE
Confidence            46899999999999877 4666666554421 112223567778886554445555678999999997521    12233


Q ss_pred             ecccccCeEEEEeCCCCC-EEEEEeCCCcEEEEEcccccCCc--ccccCccceeeeceeeeCCCCCccccC----CCCCc
Q 022074          124 LMGHLEGITFIDSRGDGR-YLISNGKDQAIKLWDIRKMSSNA--SCNLGFRSYEWDYRWMDYPPQARDLKH----PCDQS  196 (303)
Q Consensus       124 ~~~h~~~v~~~~~~~~~~-~l~s~~~D~~v~lWdl~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~  196 (303)
                      ...-.+++.++.++|+|+ .|.+...|-.|.+|.+...+..-  -.+.+.       +...+.++++....    .|.+.
T Consensus        87 Ideg~agls~~~WSPdgrhiL~tseF~lriTVWSL~t~~~~~~~~pK~~~-------kg~~f~~dg~f~ai~sRrDCkdy  159 (447)
T KOG4497|consen   87 IDEGQAGLSSISWSPDGRHILLTSEFDLRITVWSLNTQKGYLLPHPKTNV-------KGYAFHPDGQFCAILSRRDCKDY  159 (447)
T ss_pred             eccCCCcceeeeECCCcceEeeeecceeEEEEEEeccceeEEecccccCc-------eeEEECCCCceeeeeecccHHHH
Confidence            445567889999999994 45677789999999875422100  000011       11122222221111    11111


Q ss_pred             ceEE-------ecccce--eeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC-CCeEEEEECCCCCe
Q 022074          197 VATY-------KGHSVL--RTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT-SPVRDCSWHPSQPM  266 (303)
Q Consensus       197 ~~~~-------~~~~~~--~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~-~~I~~v~~sp~~~~  266 (303)
                      +..+       -++-..  ......    .++|||..         +.+||.--.-++.  ..|. -.+..++|||.+++
T Consensus       160 v~i~~c~~W~ll~~f~~dT~Dltgi----eWsPdg~~---------laVwd~~Leykv~--aYe~~lG~k~v~wsP~~qf  224 (447)
T KOG4497|consen  160 VQISSCKAWILLKEFKLDTIDLTGI----EWSPDGNW---------LAVWDNVLEYKVY--AYERGLGLKFVEWSPCNQF  224 (447)
T ss_pred             HHHHhhHHHHHHHhcCCCcccccCc----eECCCCcE---------EEEecchhhheee--eeeeccceeEEEeccccce
Confidence            0000       000000  011112    24566655         5678754332332  2232 46889999999999


Q ss_pred             EEEEeCCCCEEE
Q 022074          267 LVSSSWDGDVVR  278 (303)
Q Consensus       267 las~s~Dg~i~~  278 (303)
                      |+.|+.|+.+++
T Consensus       225 lavGsyD~~lrv  236 (447)
T KOG4497|consen  225 LAVGSYDQMLRV  236 (447)
T ss_pred             EEeeccchhhhh
Confidence            999999998875


No 256
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=98.74  E-value=1.1e-06  Score=72.99  Aligned_cols=190  Identities=14%  Similarity=0.055  Sum_probs=121.4

Q ss_pred             CCCEEEEeeCCCeEEEEECCCCce-EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccc
Q 022074           50 DGRELVAGSSDDCIYVYDLEANKL-SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL  128 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~~~-~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~  128 (303)
                      .-.+|+.|+.-|...+|...+.+. ...-..|..+|+-+.-..+..-.+..++.|.++++.++.... .+     ...|.
T Consensus        83 kc~~la~gG~~g~fd~~~~~tn~~h~~~cd~snn~v~~~~r~cd~~~~~~i~sndht~k~~~~~~~s-~~-----~~~h~  156 (344)
T KOG4532|consen   83 KCVTLADGGASGQFDLFACNTNDGHLYQCDVSNNDVTLVKRYCDLKFPLNIASNDHTGKTMVVSGDS-NK-----FAVHN  156 (344)
T ss_pred             cccEEEeccccceeeeecccCcccceeeecccccchhhhhhhcccccceeeccCCcceeEEEEecCc-cc-----ceeec
Confidence            345799999999999999986543 333344555555442212233356678999999998865221 11     12244


Q ss_pred             cC--eEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccce
Q 022074          129 EG--ITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVL  206 (303)
Q Consensus       129 ~~--v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  206 (303)
                      ..  +..+++++++.++++.|.-..|.+|.+.....--     ...               .++...             
T Consensus       157 ~~~~~ns~~~snd~~~~~~Vgds~~Vf~y~id~~sey~-----~~~---------------~~a~t~-------------  203 (344)
T KOG4532|consen  157 QNLTQNSLHYSNDPSWGSSVGDSRRVFRYAIDDESEYI-----ENI---------------YEAPTS-------------  203 (344)
T ss_pred             cccceeeeEEcCCCceEEEecCCCcceEEEeCCcccee-----eee---------------EecccC-------------
Confidence            43  7788999999999999999999999875421100     000               000000             


Q ss_pred             eeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-EE----EeecCCCCeEEEEECCCCC--eEEEEeCCCCEEEe
Q 022074          207 RTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-VA----ALKYHTSPVRDCSWHPSQP--MLVSSSWDGDVVRW  279 (303)
Q Consensus       207 ~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-~~----~~~~h~~~I~~v~~sp~~~--~las~s~Dg~i~~W  279 (303)
                          ...|...|+.....+|++.+||++.|||++.... +.    +-..|.+.+..|.|+|-|.  +|+-.-.-+.+.+-
T Consensus       204 ----D~gF~~S~s~~~~~FAv~~Qdg~~~I~DVR~~~tpm~~~sstrp~hnGa~R~c~Fsl~g~lDLLf~sEhfs~~hv~  279 (344)
T KOG4532|consen  204 ----DHGFYNSFSENDLQFAVVFQDGTCAIYDVRNMATPMAEISSTRPHHNGAFRVCRFSLYGLLDLLFISEHFSRVHVV  279 (344)
T ss_pred             ----CCceeeeeccCcceEEEEecCCcEEEEEecccccchhhhcccCCCCCCceEEEEecCCCcceEEEEecCcceEEEE
Confidence                0112334566677899999999999999986532 21    2236889999999999765  44444445666777


Q ss_pred             ecC
Q 022074          280 EFP  282 (303)
Q Consensus       280 d~~  282 (303)
                      |..
T Consensus       280 D~R  282 (344)
T KOG4532|consen  280 DTR  282 (344)
T ss_pred             Ecc
Confidence            654


No 257
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.73  E-value=5.6e-06  Score=84.12  Aligned_cols=222  Identities=10%  Similarity=0.092  Sum_probs=129.8

Q ss_pred             eEEEEEcCCCCEEEEee-CCCeEEEEECCCCceEEEEecc-----------------cCCeEEEEEccCCCcEEEEecCC
Q 022074           42 IFSLKFSTDGRELVAGS-SDDCIYVYDLEANKLSLRILAH-----------------TSDVNTVCFGDESGHLIYSGSDD  103 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~~~~~~~~~~h-----------------~~~v~~l~~~~~~~~~l~s~s~d  103 (303)
                      -..|+++++++.|+++. ..+.|+++|..++... ++.+-                 -..-..+++.+.++.++++.+.+
T Consensus       626 P~GIavd~~gn~LYVaDt~n~~Ir~id~~~~~V~-tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~  704 (1057)
T PLN02919        626 PQGLAYNAKKNLLYVADTENHALREIDFVNETVR-TLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQ  704 (1057)
T ss_pred             CcEEEEeCCCCEEEEEeCCCceEEEEecCCCEEE-EEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCC
Confidence            57889999888766654 4567999998876532 22110                 01234678876567788888889


Q ss_pred             CeEEEEcCccccCC-----Cccceeeccc------ccCeEEEEeCCCCC-EEEEEeCCCcEEEEEcccccCCcccccCcc
Q 022074          104 NLCKVWDRRCLNVK-----GKPAGVLMGH------LEGITFIDSRGDGR-YLISNGKDQAIKLWDIRKMSSNASCNLGFR  171 (303)
Q Consensus       104 g~v~lWd~~~~~~~-----~~~~~~~~~h------~~~v~~~~~~~~~~-~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~  171 (303)
                      +.|++||.......     +. .....++      -.....+++++++. ++++.+.++.|++||+......... .+..
T Consensus       705 ~~I~v~d~~~g~v~~~~G~G~-~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~-gg~~  782 (1057)
T PLN02919        705 HQIWEYNISDGVTRVFSGDGY-ERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLA-GGDP  782 (1057)
T ss_pred             CeEEEEECCCCeEEEEecCCc-cccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEE-eccc
Confidence            99999996421100     00 0000111      12345688899987 5567777899999998642110000 0000


Q ss_pred             ceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee--
Q 022074          172 SYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK--  249 (303)
Q Consensus       172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~--  249 (303)
                               ..+....  .....      ++... ...........++++|.++++...+++|++||..++.......  
T Consensus       783 ---------~~~~~l~--~fG~~------dG~g~-~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G  844 (1057)
T PLN02919        783 ---------TFSDNLF--KFGDH------DGVGS-EVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTG  844 (1057)
T ss_pred             ---------ccCcccc--cccCC------CCchh-hhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccC
Confidence                     0000000  00000      00000 0000001122356788888888999999999998876542221  


Q ss_pred             -----------cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          250 -----------YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       250 -----------~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                                 +.-.....++++++|+++++-+.++.|++||+...
T Consensus       845 ~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn~Irvid~~~~  890 (1057)
T PLN02919        845 KAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNSLIRYLDLNKG  890 (1057)
T ss_pred             CcCCCCCcccccccCCceEEEEeCCCCEEEEECCCCEEEEEECCCC
Confidence                       11235789999999999999999999999998654


No 258
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=98.71  E-value=3.3e-05  Score=69.29  Aligned_cols=240  Identities=14%  Similarity=0.191  Sum_probs=129.5

Q ss_pred             EEEEEcCCCCEEEEeeC----CCeEEEEECCCC--ceEE--EEecccCCeEEEEEccCCCcEEEEec-CCCeEEEEcCcc
Q 022074           43 FSLKFSTDGRELVAGSS----DDCIYVYDLEAN--KLSL--RILAHTSDVNTVCFGDESGHLIYSGS-DDNLCKVWDRRC  113 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~----Dg~v~lwd~~~~--~~~~--~~~~h~~~v~~l~~~~~~~~~l~s~s-~dg~v~lWd~~~  113 (303)
                      .-++++|++++|++...    ++.|..|++...  ++..  +........+.+++.+ ++++|+++. .+|+|.++++..
T Consensus        40 s~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~-~g~~l~vany~~g~v~v~~l~~  118 (345)
T PF10282_consen   40 SWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDP-DGRFLYVANYGGGSVSVFPLDD  118 (345)
T ss_dssp             CCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECT-TSSEEEEEETTTTEEEEEEECT
T ss_pred             ceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEEEEec-CCCEEEEEEccCCeEEEEEccC
Confidence            44778999999999876    568989988764  3322  2222334556778865 466766665 589999998753


Q ss_pred             ccCCCccceee--c--------ccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCc-cceeeeceeee
Q 022074          114 LNVKGKPAGVL--M--------GHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGF-RSYEWDYRWMD  181 (303)
Q Consensus       114 ~~~~~~~~~~~--~--------~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~-~~~~~~~~~~~  181 (303)
                      ..........+  .        .-....+++.++|++++++... ....|.+|++............. .......+.+.
T Consensus       119 ~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~  198 (345)
T PF10282_consen  119 DGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLA  198 (345)
T ss_dssp             TSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEE
T ss_pred             CcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEE
Confidence            11111111111  0        1123467889999999876654 45688889886533111000000 00001123344


Q ss_pred             CCCCCccccCCC--CCcceEEecc--c-ceeeeEE------------EeeeeeeeCCCeEEEEEe-CCCeEEEEECC--C
Q 022074          182 YPPQARDLKHPC--DQSVATYKGH--S-VLRTLIR------------CHFSPVYSTGQKYIYTGS-HDSCVYVYDLV--S  241 (303)
Q Consensus       182 ~~~~~~~~~~~~--~~~~~~~~~~--~-~~~~~~~------------~~~~~~~s~~~~~latg~-~dg~i~iwd~~--~  241 (303)
                      +.++.+.+-..+  ...+..+.-.  . .......            ......++||+++|.+.. .+..|.+|++.  +
T Consensus       199 f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~  278 (345)
T PF10282_consen  199 FSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPAT  278 (345)
T ss_dssp             E-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTT
T ss_pred             EcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCC
Confidence            444433321111  1122222111  0 0000000            011234789999887765 67889999983  3


Q ss_pred             Ce--EEEEeecCCCCeEEEEECCCCCeEEEEe-CCCCEEEeecCC
Q 022074          242 GE--QVAALKYHTSPVRDCSWHPSQPMLVSSS-WDGDVVRWEFPG  283 (303)
Q Consensus       242 ~~--~~~~~~~h~~~I~~v~~sp~~~~las~s-~Dg~i~~Wd~~~  283 (303)
                      ++  .+..+.........++++|+|++|+.+. .++.|.+|++..
T Consensus       279 g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~  323 (345)
T PF10282_consen  279 GTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDIDP  323 (345)
T ss_dssp             TTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEET
T ss_pred             CceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEeC
Confidence            43  3444444445589999999999888776 567899998753


No 259
>PRK04043 tolB translocation protein TolB; Provisional
Probab=98.62  E-value=1.4e-05  Score=73.31  Aligned_cols=192  Identities=10%  Similarity=0.055  Sum_probs=107.6

Q ss_pred             ceEEEEEcCCCCE-EEEeeCC---CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCC--CeEEEEcCccc
Q 022074           41 GIFSLKFSTDGRE-LVAGSSD---DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDD--NLCKVWDRRCL  114 (303)
Q Consensus        41 ~v~~l~~s~~g~~-l~sgs~D---g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~d--g~v~lWd~~~~  114 (303)
                      ....-.|+|||+. ++..+.+   ..|+++|+.+++.. .+....+......|+|+...++++.+.+  ..|.++|+...
T Consensus       189 ~~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~-~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g  267 (419)
T PRK04043        189 LNIFPKWANKEQTAFYYTSYGERKPTLYKYNLYTGKKE-KIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTK  267 (419)
T ss_pred             CeEeEEECCCCCcEEEEEEccCCCCEEEEEECCCCcEE-EEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCC
Confidence            5678999999985 5544443   46899999888654 3434445566678987655566666555  45666675421


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEE--EcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLW--DIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lW--dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                           ....+..+........|+|||+.|+-.+ ..+.-.||  |+.....                        +.+  
T Consensus       268 -----~~~~LT~~~~~d~~p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~------------------------~rl--  316 (419)
T PRK04043        268 -----TLTQITNYPGIDVNGNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSV------------------------EQV--  316 (419)
T ss_pred             -----cEEEcccCCCccCccEECCCCCEEEEEECCCCCceEEEEECCCCCe------------------------EeC--
Confidence                 1222333332223446899998765444 44443444  3321100                        000  


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC---------CeEEEEECCCCeEEEEeecCCCCeEEEEECC
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD---------SCVYVYDLVSGEQVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d---------g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp  262 (303)
                             +..+.          ..+.+||+|++++.....         ..|++.|+.+++. ..+... .......|||
T Consensus       317 -------t~~g~----------~~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~-~~LT~~-~~~~~p~~SP  377 (419)
T PRK04043        317 -------VFHGK----------NNSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYI-RRLTAN-GVNQFPRFSS  377 (419)
T ss_pred             -------ccCCC----------cCceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCe-EECCCC-CCcCCeEECC
Confidence                   00010          012467888877766543         3788989888764 333322 2334688999


Q ss_pred             CCCeEEEEeC-CCC--EEEeecCC
Q 022074          263 SQPMLVSSSW-DGD--VVRWEFPG  283 (303)
Q Consensus       263 ~~~~las~s~-Dg~--i~~Wd~~~  283 (303)
                      ||++|+-.+. .+.  |.+.++.+
T Consensus       378 DG~~I~f~~~~~~~~~L~~~~l~g  401 (419)
T PRK04043        378 DGGSIMFIKYLGNQSALGIIRLNY  401 (419)
T ss_pred             CCCEEEEEEccCCcEEEEEEecCC
Confidence            9996554443 344  34445444


No 260
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=98.61  E-value=7.2e-06  Score=69.01  Aligned_cols=187  Identities=18%  Similarity=0.146  Sum_probs=113.2

Q ss_pred             CCeEEEEECCCCceEE---EEecccCCeEEEEEcc--CCCc-EEEEecCCCeEEEEcCccccCCCccceeecccccC---
Q 022074           60 DDCIYVYDLEANKLSL---RILAHTSDVNTVCFGD--ESGH-LIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEG---  130 (303)
Q Consensus        60 Dg~v~lwd~~~~~~~~---~~~~h~~~v~~l~~~~--~~~~-~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~---  130 (303)
                      .|.+.+|++...+...   .....+..++.+.|..  .+++ .++-+..+|.|.++...... .......+.+..-.   
T Consensus        45 ~Gkl~Lys~~d~~~~~l~~~q~~dts~~~dm~w~~~~~~g~~~l~~a~a~G~i~~~r~~~~~-ss~~L~~ls~~ki~~~~  123 (339)
T KOG0280|consen   45 SGKLHLYSLEDMKLSPLDTLQCTDTSTEFDMLWRIRETDGDFNLLDAHARGQIQLYRNDEDE-SSVHLRGLSSKKISVVE  123 (339)
T ss_pred             ccceEEEeecccccCccceeeeecccccceeeeeeccCCccceeeeccccceEEEEeeccce-eeeeecccchhhhhhee
Confidence            4678888887655332   1222335567777732  2344 56677888999998643110 00001111111111   


Q ss_pred             eEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeE
Q 022074          131 ITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLI  210 (303)
Q Consensus       131 v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  210 (303)
                      -.++++++.+..++++-.+|.+.+-+-....                                ...++.+++|.+.....
T Consensus       124 ~lslD~~~~~~~i~vs~s~G~~~~v~~t~~~--------------------------------le~vq~wk~He~E~Wta  171 (339)
T KOG0280|consen  124 ALSLDISTSGTKIFVSDSRGSISGVYETEMV--------------------------------LEKVQTWKVHEFEAWTA  171 (339)
T ss_pred             eeEEEeeccCceEEEEcCCCcEEEEecceee--------------------------------eeecccccccceeeeee
Confidence            2356777778788888777777743322110                                11234555555433222


Q ss_pred             EEeeeeeeeCCCeEEEEEeCCCeEEEEECC-CCeEEEE-eecCCCCeEEEEECCC-CCeEEEEeCCCCEEEeecCCC
Q 022074          211 RCHFSPVYSTGQKYIYTGSHDSCVYVYDLV-SGEQVAA-LKYHTSPVRDCSWHPS-QPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       211 ~~~~~~~~s~~~~~latg~~dg~i~iwd~~-~~~~~~~-~~~h~~~I~~v~~sp~-~~~las~s~Dg~i~~Wd~~~~  284 (303)
                        +|+-   .+..++.+||.|+.+..||++ .++.+.. .+.|...|.++.=||- ..+++||+.|-.|++||...+
T Consensus       172 --~f~~---~~pnlvytGgDD~~l~~~D~R~p~~~i~~n~kvH~~GV~SI~ss~~~~~~I~TGsYDe~i~~~DtRnm  243 (339)
T KOG0280|consen  172 --KFSD---KEPNLVYTGGDDGSLSCWDIRIPKTFIWHNSKVHTSGVVSIYSSPPKPTYIATGSYDECIRVLDTRNM  243 (339)
T ss_pred             --eccc---CCCceEEecCCCceEEEEEecCCcceeeecceeeecceEEEecCCCCCceEEEeccccceeeeehhcc
Confidence              2221   133689999999999999998 3444443 4678889999998875 558999999999999998754


No 261
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.60  E-value=0.00016  Score=64.36  Aligned_cols=256  Identities=12%  Similarity=0.085  Sum_probs=135.7

Q ss_pred             EEEEEccCchhhccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeC----------CCeEEEEECCCCceE
Q 022074            5 VHIVDVGSGTMESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSS----------DDCIYVYDLEANKLS   74 (303)
Q Consensus         5 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~----------Dg~v~lwd~~~~~~~   74 (303)
                      +-+.|.+...|..-+.|.+.=++. ..+..+.|..-..  + +||||+.++++..          +..|.+||+.+.+..
T Consensus        15 v~V~d~~~~~~~~~v~ViD~~~~~-v~g~i~~G~~P~~--~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~   90 (352)
T TIGR02658        15 VYVLDPGHFAATTQVYTIDGEAGR-VLGMTDGGFLPNP--V-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPI   90 (352)
T ss_pred             EEEECCcccccCceEEEEECCCCE-EEEEEEccCCCce--e-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEE
Confidence            556666766666777777764433 2233445544443  4 9999999888766          778999999999887


Q ss_pred             EEEeccc-------CCeEEEEEccCCCcEEEEec-C-CCeEEEEcCccccCCCccceeecccccCeEE--------EEeC
Q 022074           75 LRILAHT-------SDVNTVCFGDESGHLIYSGS-D-DNLCKVWDRRCLNVKGKPAGVLMGHLEGITF--------IDSR  137 (303)
Q Consensus        75 ~~~~~h~-------~~v~~l~~~~~~~~~l~s~s-~-dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~--------~~~~  137 (303)
                      .++.--.       .....+++++ ++++++... . +..|.+.|+....    ......- .++...        ...+
T Consensus        91 ~~i~~p~~p~~~~~~~~~~~~ls~-dgk~l~V~n~~p~~~V~VvD~~~~k----vv~ei~v-p~~~~vy~t~e~~~~~~~  164 (352)
T TIGR02658        91 ADIELPEGPRFLVGTYPWMTSLTP-DNKTLLFYQFSPSPAVGVVDLEGKA----FVRMMDV-PDCYHIFPTANDTFFMHC  164 (352)
T ss_pred             eEEccCCCchhhccCccceEEECC-CCCEEEEecCCCCCEEEEEECCCCc----EEEEEeC-CCCcEEEEecCCccEEEe
Confidence            6655311       1223566765 577777665 3 6899999976322    2221111 111111        1122


Q ss_pred             CCCCEE-EEEeCCCcEE-----EEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEec--ccc--ee
Q 022074          138 GDGRYL-ISNGKDQAIK-----LWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKG--HSV--LR  207 (303)
Q Consensus       138 ~~~~~l-~s~~~D~~v~-----lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~--~~  207 (303)
                      .||.++ ++...+|...     +++-... ..+     .+. .      -.+.+.+.+.......+..++-  ...  ..
T Consensus       165 ~Dg~~~~v~~d~~g~~~~~~~~vf~~~~~-~v~-----~rP-~------~~~~dg~~~~vs~eG~V~~id~~~~~~~~~~  231 (352)
T TIGR02658       165 RDGSLAKVGYGTKGNPKIKPTEVFHPEDE-YLI-----NHP-A------YSNKSGRLVWPTYTGKIFQIDLSSGDAKFLP  231 (352)
T ss_pred             ecCceEEEEecCCCceEEeeeeeecCCcc-ccc-----cCC-c------eEcCCCcEEEEecCCeEEEEecCCCcceecc
Confidence            333332 2233333322     1111000 000     000 0      0011222222222222222220  000  00


Q ss_pred             --eeEEE-----ee-----e-eeeeCCCeEEEEEe----------CCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCC
Q 022074          208 --TLIRC-----HF-----S-PVYSTGQKYIYTGS----------HDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       208 --~~~~~-----~~-----~-~~~s~~~~~latg~----------~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~  264 (303)
                        .....     .+     . ..++++++.+....          ..+.|.++|..+++.+..+.. ..+++.+++|||+
T Consensus       232 ~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~~t~kvi~~i~v-G~~~~~iavS~Dg  310 (352)
T TIGR02658       232 AIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLADQRAKWTHKTASRFLFVVDAKTGKRLRKIEL-GHEIDSINVSQDA  310 (352)
T ss_pred             eeeeccccccccccCCCcceeEEEcCCCCEEEEEecCCccccccCCCCEEEEEECCCCeEEEEEeC-CCceeeEEECCCC
Confidence              00000     00     0 23667777766632          235899999999999888763 3589999999999


Q ss_pred             C-eEEEEe-CCCCEEEeecCCC
Q 022074          265 P-MLVSSS-WDGDVVRWEFPGN  284 (303)
Q Consensus       265 ~-~las~s-~Dg~i~~Wd~~~~  284 (303)
                      + .|.+.. .++.+.+.|.+..
T Consensus       311 kp~lyvtn~~s~~VsViD~~t~  332 (352)
T TIGR02658       311 KPLLYALSTGDKTLYIFDAETG  332 (352)
T ss_pred             CeEEEEeCCCCCcEEEEECcCC
Confidence            9 777666 5788999997643


No 262
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=98.60  E-value=1.4e-06  Score=75.98  Aligned_cols=165  Identities=17%  Similarity=0.244  Sum_probs=96.2

Q ss_pred             ccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE---ecccCCeEEEEEccCCCcEE--EEecCCCeEEEEcCcc
Q 022074           39 SFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI---LAHTSDVNTVCFGDESGHLI--YSGSDDNLCKVWDRRC  113 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~---~~h~~~v~~l~~~~~~~~~l--~s~s~dg~v~lWd~~~  113 (303)
                      ..+...+..++.++.+|++..+....+++........++   ..-...-+++.+...+...+  ..++....+.+|... 
T Consensus        62 ~~a~~~~~~s~~~~llAv~~~~K~~~~f~~~~~~~~~kl~~~~~v~~~~~ai~~~~~~~sv~v~dkagD~~~~di~s~~-  140 (390)
T KOG3914|consen   62 SLAPALVLTSDSGRLVAVATSSKQRAVFDYRENPKGAKLLDVSCVPKRPTAISFIREDTSVLVADKAGDVYSFDILSAD-  140 (390)
T ss_pred             hccccccccCCCceEEEEEeCCCceEEEEEecCCCcceeeeEeecccCcceeeeeeccceEEEEeecCCceeeeeeccc-
Confidence            345666778888999998887777667766544321111   11112223444422222222  122333344444422 


Q ss_pred             ccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                          ........||..-+..+++++|++.++|+.+|..||+-.......+.+.                           
T Consensus       141 ----~~~~~~~lGhvSml~dVavS~D~~~IitaDRDEkIRvs~ypa~f~Iesf---------------------------  189 (390)
T KOG3914|consen  141 ----SGRCEPILGHVSMLLDVAVSPDDQFIITADRDEKIRVSRYPATFVIESF---------------------------  189 (390)
T ss_pred             ----ccCcchhhhhhhhhheeeecCCCCEEEEecCCceEEEEecCcccchhhh---------------------------
Confidence                1234456799999999999999999999999999999754321100000                           


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEe
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAAL  248 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~  248 (303)
                            +-||......+.      . .+++.|++||.|+++++||+++|+.+.++
T Consensus       190 ------clGH~eFVS~is------l-~~~~~LlS~sGD~tlr~Wd~~sgk~L~t~  231 (390)
T KOG3914|consen  190 ------CLGHKEFVSTIS------L-TDNYLLLSGSGDKTLRLWDITSGKLLDTC  231 (390)
T ss_pred             ------ccccHhheeeee------e-ccCceeeecCCCCcEEEEecccCCccccc
Confidence                  112221111111      1 23566899999999999999999876554


No 263
>PRK00178 tolB translocation protein TolB; Provisional
Probab=98.60  E-value=2.1e-05  Score=72.68  Aligned_cols=174  Identities=13%  Similarity=0.105  Sum_probs=100.8

Q ss_pred             eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC-C--CeEEEEcCccccCCCccceeecccccCeEEEEeCC
Q 022074           62 CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD-D--NLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG  138 (303)
Q Consensus        62 ~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~-d--g~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~  138 (303)
                      .|.++|.+.+.. ..+..+...+....|+|+ ++.|+..+. +  ..|.+||+...    . ...+......+....|+|
T Consensus       180 ~l~~~d~~g~~~-~~l~~~~~~~~~p~wSpD-G~~la~~s~~~~~~~l~~~~l~~g----~-~~~l~~~~g~~~~~~~Sp  252 (430)
T PRK00178        180 TLQRSDYDGARA-VTLLQSREPILSPRWSPD-GKRIAYVSFEQKRPRIFVQNLDTG----R-REQITNFEGLNGAPAWSP  252 (430)
T ss_pred             EEEEECCCCCCc-eEEecCCCceeeeeECCC-CCEEEEEEcCCCCCEEEEEECCCC----C-EEEccCCCCCcCCeEECC
Confidence            477778876543 445567778889999865 666655443 2  46888887532    1 112222223344577999


Q ss_pred             CCCEEE-EEeCCC--cEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeee
Q 022074          139 DGRYLI-SNGKDQ--AIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFS  215 (303)
Q Consensus       139 ~~~~l~-s~~~D~--~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  215 (303)
                      +|+.|+ +.+.++  .|.+||+......                                   .+..+..      ....
T Consensus       253 DG~~la~~~~~~g~~~Iy~~d~~~~~~~-----------------------------------~lt~~~~------~~~~  291 (430)
T PRK00178        253 DGSKLAFVLSKDGNPEIYVMDLASRQLS-----------------------------------RVTNHPA------IDTE  291 (430)
T ss_pred             CCCEEEEEEccCCCceEEEEECCCCCeE-----------------------------------EcccCCC------CcCC
Confidence            998876 555555  4666676431100                                   0000000      0123


Q ss_pred             eeeeCCCeEEEEEeC-C--CeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC-C--CEEEeecCCC
Q 022074          216 PVYSTGQKYIYTGSH-D--SCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD-G--DVVRWEFPGN  284 (303)
Q Consensus       216 ~~~s~~~~~latg~~-d--g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D-g--~i~~Wd~~~~  284 (303)
                      +.|+||++.++..+. +  ..|+++|+.+++.. .+..........+||||++.|+..+.+ +  .|.+||+.+.
T Consensus       292 ~~~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~-~lt~~~~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg  365 (430)
T PRK00178        292 PFWGKDGRTLYFTSDRGGKPQIYKVNVNGGRAE-RVTFVGNYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRG  365 (430)
T ss_pred             eEECCCCCEEEEEECCCCCceEEEEECCCCCEE-EeecCCCCccceEECCCCCEEEEEEccCCceEEEEEECCCC
Confidence            557888887766553 2  36888888777642 222122234567899999988776643 3  4777887653


No 264
>PF15492 Nbas_N:  Neuroblastoma-amplified sequence, N terminal
Probab=98.58  E-value=2.8e-05  Score=65.39  Aligned_cols=192  Identities=17%  Similarity=0.137  Sum_probs=106.9

Q ss_pred             EEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-c------cCCeEEEEEccCC-----CcEEEEecCCCeEEEEc
Q 022074           43 FSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-H------TSDVNTVCFGDES-----GHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h------~~~v~~l~~~~~~-----~~~l~s~s~dg~v~lWd  110 (303)
                      ..++||||+..||.+...|+|++||+....+. .+.. +      ...|..+.|....     ...|+.-..+|.++-|=
T Consensus        47 Rkl~WSpD~tlLa~a~S~G~i~vfdl~g~~lf-~I~p~~~~~~d~~~Aiagl~Fl~~~~s~~ws~ELlvi~Y~G~L~Sy~  125 (282)
T PF15492_consen   47 RKLAWSPDCTLLAYAESTGTIRVFDLMGSELF-VIPPAMSFPGDLSDAIAGLIFLEYKKSAQWSYELLVINYRGQLRSYL  125 (282)
T ss_pred             eEEEECCCCcEEEEEcCCCeEEEEecccceeE-EcCcccccCCccccceeeeEeeccccccccceeEEEEeccceeeeEE
Confidence            56899999999999999999999999765432 2221 1      2345556663221     22456667777777554


Q ss_pred             Ccccc-CCCcccee--ecc-cccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          111 RRCLN-VKGKPAGV--LMG-HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       111 ~~~~~-~~~~~~~~--~~~-h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      +.... +.-+....  +.. +..+|.++.+++.-++|+.||....      ......+                      
T Consensus       126 vs~gt~q~y~e~hsfsf~~~yp~Gi~~~vy~p~h~LLlVgG~~~~------~~~~s~a----------------------  177 (282)
T PF15492_consen  126 VSVGTNQGYQENHSFSFSSHYPHGINSAVYHPKHRLLLVGGCEQN------QDGMSKA----------------------  177 (282)
T ss_pred             EEcccCCcceeeEEEEecccCCCceeEEEEcCCCCEEEEeccCCC------CCccccc----------------------
Confidence            32111 11111112  222 3568999999998888888775432      0000000                      


Q ss_pred             ccccCCCCC-cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCC------eEEEEECCCCeEEEEeecCCCCeEEEE
Q 022074          187 RDLKHPCDQ-SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS------CVYVYDLVSGEQVAALKYHTSPVRDCS  259 (303)
Q Consensus       187 ~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg------~i~iwd~~~~~~~~~~~~h~~~I~~v~  259 (303)
                          ..+.- ....+++.                |..+. ++..+|+      +-.+|.+.+.+.-.....-.+.|..|.
T Consensus       178 ----~~~GLtaWRiL~~~----------------Pyyk~-v~~~~~~~~~~~~~~~~~~~~~~~~fs~~~~~~d~i~kmS  236 (282)
T PF15492_consen  178 ----SSCGLTAWRILSDS----------------PYYKQ-VTSSEDDITASSKRRGLLRIPSFKFFSRQGQEQDGIFKMS  236 (282)
T ss_pred             ----cccCceEEEEcCCC----------------CcEEE-ccccCccccccccccceeeccceeeeeccccCCCceEEEE
Confidence                00000 00011111                11111 1122221      123444333332222223457899999


Q ss_pred             ECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          260 WHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       260 ~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .||||..||+...+|.|.+|+++.-
T Consensus       237 lSPdg~~La~ih~sG~lsLW~iPsL  261 (282)
T PF15492_consen  237 LSPDGSLLACIHFSGSLSLWEIPSL  261 (282)
T ss_pred             ECCCCCEEEEEEcCCeEEEEecCcc
Confidence            9999999999999999999999864


No 265
>PRK04792 tolB translocation protein TolB; Provisional
Probab=98.57  E-value=5.7e-06  Score=76.81  Aligned_cols=172  Identities=20%  Similarity=0.229  Sum_probs=98.5

Q ss_pred             eEEEEEcCCCCEEEE-eeCCCe--EEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCe--EEEEcCccccC
Q 022074           42 IFSLKFSTDGRELVA-GSSDDC--IYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNL--CKVWDRRCLNV  116 (303)
Q Consensus        42 v~~l~~s~~g~~l~s-gs~Dg~--v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~--v~lWd~~~~~~  116 (303)
                      .....|+|||+.|+. .+.+|.  |+++|+.+++.. ++..+........|+|+...++++...++.  |.++|+..   
T Consensus       264 ~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~-~lt~~~~~~~~p~wSpDG~~I~f~s~~~g~~~Iy~~dl~~---  339 (448)
T PRK04792        264 NGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALT-RITRHRAIDTEPSWHPDGKSLIFTSERGGKPQIYRVNLAS---  339 (448)
T ss_pred             cCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeE-ECccCCCCccceEECCCCCEEEEEECCCCCceEEEEECCC---
Confidence            346799999998765 456664  778898887643 455555556678897654444455555554  55555532   


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeC-CCcEEEE--EcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGK-DQAIKLW--DIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~-D~~v~lW--dl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                       +. ...+..........+++|+|++|+..+. ++...||  |+.....                               
T Consensus       340 -g~-~~~Lt~~g~~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~-------------------------------  386 (448)
T PRK04792        340 -GK-VSRLTFEGEQNLGGSITPDGRSMIMVNRTNGKFNIARQDLETGAM-------------------------------  386 (448)
T ss_pred             -CC-EEEEecCCCCCcCeeECCCCCEEEEEEecCCceEEEEEECCCCCe-------------------------------
Confidence             11 1222111122234578999998876554 4545555  3321100                               


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CC--eEEEEECCCCeEEEEeecCCCCeEEEEECC
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DS--CVYVYDLVSGEQVAALKYHTSPVRDCSWHP  262 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg--~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp  262 (303)
                          ..+....       ....|.++|+++.++.... ++  .+++++. +|.....+..+.+.+...+|||
T Consensus       387 ----~~lt~~~-------~d~~ps~spdG~~I~~~~~~~g~~~l~~~~~-~G~~~~~l~~~~g~~~~p~Wsp  446 (448)
T PRK04792        387 ----QVLTSTR-------LDESPSVAPNGTMVIYSTTYQGKQVLAAVSI-DGRFKARLPAGQGEVKSPAWSP  446 (448)
T ss_pred             ----EEccCCC-------CCCCceECCCCCEEEEEEecCCceEEEEEEC-CCCceEECcCCCCCcCCCccCC
Confidence                0000000       0013457888888776553 33  3778887 4555666666667788889987


No 266
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.57  E-value=1.4e-07  Score=85.21  Aligned_cols=203  Identities=18%  Similarity=0.289  Sum_probs=126.5

Q ss_pred             CCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc-------eEEEEecccCCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           35 DGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK-------LSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        35 ~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~-------~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      ..||.-.|.++.--.+.+-+++++.|.+|++|.++...       .+.++..|+..|..+.|.. +...+  ++.||-++
T Consensus       731 f~GH~~~iRai~AidNENSFiSASkDKTVKLWSik~EgD~~~tsaCQfTY~aHkk~i~~igfL~-~lr~i--~ScD~giH  807 (1034)
T KOG4190|consen  731 FTGHQEKIRAIAAIDNENSFISASKDKTVKLWSIKPEGDEIGTSACQFTYQAHKKPIHDIGFLA-DLRSI--ASCDGGIH  807 (1034)
T ss_pred             ccCcHHHhHHHHhcccccceeeccCCceEEEEEeccccCccccceeeeEhhhccCcccceeeee-cccee--eeccCcce
Confidence            36999888888777788889999999999999886531       3345678999999999953 33444  45688999


Q ss_pred             EEcCccccCCCcccee-ec----ccccCeEEEEeCCCCCEEE-EEeCCCcEEEEEcccccCCcccccCccceeeeceeee
Q 022074          108 VWDRRCLNVKGKPAGV-LM----GHLEGITFIDSRGDGRYLI-SNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMD  181 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~-~~----~h~~~v~~~~~~~~~~~l~-s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~  181 (303)
                      +||..    .+++... +.    +....|.++. +-+...+. -++...+|+++|-|..             +|...+. 
T Consensus       808 lWDPF----igr~Laq~~dapk~~a~~~ikcl~-nv~~~iliAgcsaeSTVKl~DaRsc-------------e~~~E~k-  868 (1034)
T KOG4190|consen  808 LWDPF----IGRLLAQMEDAPKEGAGGNIKCLE-NVDRHILIAGCSAESTVKLFDARSC-------------EWTCELK-  868 (1034)
T ss_pred             eeccc----ccchhHhhhcCcccCCCceeEecc-cCcchheeeeccchhhheeeecccc-------------cceeeEE-
Confidence            99953    2222211 11    1122344442 22344444 3478999999998742             2221110 


Q ss_pred             CCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEEC
Q 022074          182 YPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWH  261 (303)
Q Consensus       182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~s  261 (303)
                                     +....+-   ....++.   ...+.|.+++.|-..|+|.+.|.++|+.+.....-+.....++ .
T Consensus       869 ---------------Vcna~~P---na~~R~i---aVa~~GN~lAa~LSnGci~~LDaR~G~vINswrpmecdllqla-a  926 (1034)
T KOG4190|consen  869 ---------------VCNAPGP---NALTRAI---AVADKGNKLAAALSNGCIAILDARNGKVINSWRPMECDLLQLA-A  926 (1034)
T ss_pred             ---------------eccCCCC---chheeEE---EeccCcchhhHHhcCCcEEEEecCCCceeccCCcccchhhhhc-C
Confidence                           0000000   0011111   1235678899999999999999999998776654333333333 2


Q ss_pred             CCCCeEEEEeCCCCEEE-eec
Q 022074          262 PSQPMLVSSSWDGDVVR-WEF  281 (303)
Q Consensus       262 p~~~~las~s~Dg~i~~-Wd~  281 (303)
                      |..+.|+....|.++.+ |-.
T Consensus       927 psdq~L~~saldHslaVnWha  947 (1034)
T KOG4190|consen  927 PSDQALAQSALDHSLAVNWHA  947 (1034)
T ss_pred             chhHHHHhhcccceeEeeehh
Confidence            55667777777888877 753


No 267
>PF00400 WD40:  WD domain, G-beta repeat;  InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=98.57  E-value=1.7e-07  Score=55.47  Aligned_cols=32  Identities=47%  Similarity=0.596  Sum_probs=31.0

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYD   67 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd   67 (303)
                      .||+.+|.+|+|+|+++.+++|+.|++|++||
T Consensus         8 ~~h~~~i~~i~~~~~~~~~~s~~~D~~i~vwd   39 (39)
T PF00400_consen    8 RGHSSSINSIAWSPDGNFLASGSSDGTIRVWD   39 (39)
T ss_dssp             ESSSSSEEEEEEETTSSEEEEEETTSEEEEEE
T ss_pred             cCCCCcEEEEEEecccccceeeCCCCEEEEEC
Confidence            68999999999999999999999999999997


No 268
>PRK01029 tolB translocation protein TolB; Provisional
Probab=98.56  E-value=8.4e-06  Score=75.18  Aligned_cols=180  Identities=14%  Similarity=0.177  Sum_probs=99.7

Q ss_pred             ceEEEEEcCCCCEEEEeeC-CC----eEEEEECCCC--ceEEEEecc-cCCeEEEEEccCCCcE-EEEecCCCeEEEEcC
Q 022074           41 GIFSLKFSTDGRELVAGSS-DD----CIYVYDLEAN--KLSLRILAH-TSDVNTVCFGDESGHL-IYSGSDDNLCKVWDR  111 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~-Dg----~v~lwd~~~~--~~~~~~~~h-~~~v~~l~~~~~~~~~-l~s~s~dg~v~lWd~  111 (303)
                      .....+|||||+.|+..+. +|    .+.+|++..+  ....++... .......+|+|+ ++. +++...+|...+|..
T Consensus       232 ~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wSPD-G~~Laf~s~~~g~~~ly~~  310 (428)
T PRK01029        232 NQLMPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFSPD-GTRLVFVSNKDGRPRIYIM  310 (428)
T ss_pred             CccceEECCCCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEECCC-CCEEEEEECCCCCceEEEE
Confidence            3456789999998886553 23    2344677653  122233332 233456789765 554 445556776666643


Q ss_pred             ccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCC---CcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcc
Q 022074          112 RCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKD---QAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARD  188 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D---~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  188 (303)
                      ... ..+.....+..+...+....++|+|+.|+..+.+   ..|.+||+......                         
T Consensus       311 ~~~-~~g~~~~~lt~~~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~-------------------------  364 (428)
T PRK01029        311 QID-PEGQSPRLLTKKYRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDY-------------------------  364 (428)
T ss_pred             ECc-ccccceEEeccCCCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeE-------------------------
Confidence            211 0111223343444456677899999988766543   35777776432110                         


Q ss_pred             ccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe---CCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCC
Q 022074          189 LKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS---HDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQ  264 (303)
Q Consensus       189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~---~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~  264 (303)
                                .+....      .....+.++||++.|+...   .+..|+++|+..++..... ...+.+...+|||-.
T Consensus       365 ----------~Lt~~~------~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~~Lt-~~~g~~~~p~Ws~~~  426 (428)
T PRK01029        365 ----------QLTTSP------ENKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTRKIV-IGSGEKRFPSWGAFP  426 (428)
T ss_pred             ----------EccCCC------CCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEee-cCCCcccCceecCCC
Confidence                      000000      0012355788888776533   2467999999877653333 344567788888753


No 269
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=98.51  E-value=6.8e-05  Score=67.71  Aligned_cols=191  Identities=17%  Similarity=0.212  Sum_probs=111.1

Q ss_pred             EEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEE
Q 022074           55 VAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFI  134 (303)
Q Consensus        55 ~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~  134 (303)
                      ++-..+|.|.+.|..+.+...++......-..+.+++ ++++++.++.||.|.++|+..    .+.+..+.. ...-..+
T Consensus        10 V~~~~~~~v~viD~~t~~~~~~i~~~~~~h~~~~~s~-Dgr~~yv~~rdg~vsviD~~~----~~~v~~i~~-G~~~~~i   83 (369)
T PF02239_consen   10 VVERGSGSVAVIDGATNKVVARIPTGGAPHAGLKFSP-DGRYLYVANRDGTVSVIDLAT----GKVVATIKV-GGNPRGI   83 (369)
T ss_dssp             EEEGGGTEEEEEETTT-SEEEEEE-STTEEEEEE-TT--SSEEEEEETTSEEEEEETTS----SSEEEEEE--SSEEEEE
T ss_pred             EEecCCCEEEEEECCCCeEEEEEcCCCCceeEEEecC-CCCEEEEEcCCCeEEEEECCc----ccEEEEEec-CCCcceE
Confidence            4556789999999999988877765433323456754 577888888999999999863    333444322 2335678


Q ss_pred             EeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEe
Q 022074          135 DSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCH  213 (303)
Q Consensus       135 ~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  213 (303)
                      +++++|+++++++ ..+.+.++|.+.++.........           .+...     ...            + ..   
T Consensus        84 ~~s~DG~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~-----------~~~~~-----~~~------------R-v~---  131 (369)
T PF02239_consen   84 AVSPDGKYVYVANYEPGTVSVIDAETLEPVKTIPTGG-----------MPVDG-----PES------------R-VA---  131 (369)
T ss_dssp             EE--TTTEEEEEEEETTEEEEEETTT--EEEEEE--E-----------E-TTT-----S----------------EE---
T ss_pred             EEcCCCCEEEEEecCCCceeEeccccccceeeccccc-----------ccccc-----cCC------------C-ce---
Confidence            8999999987665 68999999987644322111000           00000     000            0 00   


Q ss_pred             eeeeeeCCCe-EEEEEeCCCeEEEEECCCCeEEE-EeecCCCCeEEEEECCCCCeE-EEEeCCCCEEEeecCCC
Q 022074          214 FSPVYSTGQK-YIYTGSHDSCVYVYDLVSGEQVA-ALKYHTSPVRDCSWHPSQPML-VSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       214 ~~~~~s~~~~-~latg~~dg~i~iwd~~~~~~~~-~~~~h~~~I~~v~~sp~~~~l-as~s~Dg~i~~Wd~~~~  284 (303)
                       ....++... ++++--+.+.|.+-|....+.+. ....-.....+..|+|+++++ +++-....+-++|.+..
T Consensus       132 -aIv~s~~~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i~~g~~~~D~~~dpdgry~~va~~~sn~i~viD~~~~  204 (369)
T PF02239_consen  132 -AIVASPGRPEFVVNLKDTGEIWVVDYSDPKNLKVTTIKVGRFPHDGGFDPDGRYFLVAANGSNKIAVIDTKTG  204 (369)
T ss_dssp             -EEEE-SSSSEEEEEETTTTEEEEEETTTSSCEEEEEEE--TTEEEEEE-TTSSEEEEEEGGGTEEEEEETTTT
T ss_pred             -eEEecCCCCEEEEEEccCCeEEEEEeccccccceeeecccccccccccCcccceeeecccccceeEEEeeccc
Confidence             011234445 44555556899999987654332 222344578899999999975 44566778888886643


No 270
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=98.48  E-value=0.00063  Score=59.10  Aligned_cols=252  Identities=14%  Similarity=0.153  Sum_probs=138.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCC---CeEEEEECCC--CceEEE--EecccCCeEEEEEccCCCcEEEEecC-CCeEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSD---DCIYVYDLEA--NKLSLR--ILAHTSDVNTVCFGDESGHLIYSGSD-DNLCK  107 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~D---g~v~lwd~~~--~~~~~~--~~~h~~~v~~l~~~~~~~~~l~s~s~-dg~v~  107 (303)
                      -.+.....=|+|++++++|.++-.+   |.|--|..+.  |.+...  ......+-+.++.. ++++.++++.. .|.|.
T Consensus        36 v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd-~~g~~vf~AnY~~g~v~  114 (346)
T COG2706          36 VAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVD-EDGRFVFVANYHSGSVS  114 (346)
T ss_pred             ccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEEC-CCCCEEEEEEccCceEE
Confidence            3455567789999999999998654   6677776654  554321  11122334777884 56778888864 67999


Q ss_pred             EEcCccccCCCccceeecccccC----------eEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCccceeee
Q 022074          108 VWDRRCLNVKGKPAGVLMGHLEG----------ITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFRSYEWD  176 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~~~----------v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~~~~~~  176 (303)
                      ++.++.......++. +..|.+.          +.+..+.|+++++++.. .--.|.+|++............. .-..-
T Consensus       115 v~p~~~dG~l~~~v~-~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~Dri~~y~~~dg~L~~~~~~~v-~~G~G  192 (346)
T COG2706         115 VYPLQADGSLQPVVQ-VVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGTDRIFLYDLDDGKLTPADPAEV-KPGAG  192 (346)
T ss_pred             EEEcccCCcccccee-eeecCCCCCCccccCCccceeeeCCCCCEEEEeecCCceEEEEEcccCcccccccccc-CCCCC
Confidence            998753211111211 1224333          78888999999988766 34678899987433222111111 11111


Q ss_pred             ceeeeCCCCCccccCCC--CCcceEE--ecc-cceeeeEE------------EeeeeeeeCCCeEEEEEe-CCCeEEEEE
Q 022074          177 YRWMDYPPQARDLKHPC--DQSVATY--KGH-SVLRTLIR------------CHFSPVYSTGQKYIYTGS-HDSCVYVYD  238 (303)
Q Consensus       177 ~~~~~~~~~~~~~~~~~--~~~~~~~--~~~-~~~~~~~~------------~~~~~~~s~~~~~latg~-~dg~i~iwd  238 (303)
                      .+.+.+.|+.+..-..+  ...+..+  +.. .....+-.            .......+++|++|.+.. ....|.++.
T Consensus       193 PRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~  272 (346)
T COG2706         193 PRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFS  272 (346)
T ss_pred             cceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEE
Confidence            23334444333211100  1111111  110 00000000            001122578999998874 334788887


Q ss_pred             CCC--CeE--EEEeecCCCCeEEEEECCCCCeEEEEeCC-CCEEEeecCCCCccCCC
Q 022074          239 LVS--GEQ--VAALKYHTSPVRDCSWHPSQPMLVSSSWD-GDVVRWEFPGNGEAAPP  290 (303)
Q Consensus       239 ~~~--~~~--~~~~~~h~~~I~~v~~sp~~~~las~s~D-g~i~~Wd~~~~~~~~~~  290 (303)
                      +..  +++  +.....+.....+..|+|++++|+.+.+| .++.++.....-++...
T Consensus       273 V~~~~g~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~~TG~L~~  329 (346)
T COG2706         273 VDPDGGKLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFERDKETGRLTL  329 (346)
T ss_pred             EcCCCCEEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEEEcCCCceEEe
Confidence            653  332  22223344457899999999998888875 57889987655444433


No 271
>KOG2695 consensus WD40 repeat protein [General function prediction only]
Probab=98.48  E-value=3.8e-07  Score=78.09  Aligned_cols=124  Identities=19%  Similarity=0.271  Sum_probs=95.5

Q ss_pred             CCccc------ceEEEEEcCCCCEEEEeeCCCeEEEEECCCC----ceEEEEecccCCeEEEEEccCCCcEEEEecCCCe
Q 022074           36 GGYSF------GIFSLKFSTDGRELVAGSSDDCIYVYDLEAN----KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNL  105 (303)
Q Consensus        36 ~~~~~------~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~----~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~  105 (303)
                      +||.+      -|+++.|...++.+..|...|.|...|++.+    .......-|...|+++....-+...|.+.+.+|+
T Consensus       243 tg~~qsf~sksDVfAlQf~~s~nLv~~GcRngeI~~iDLR~rnqG~~~~a~rlyh~Ssvtslq~Lq~s~q~LmaS~M~gk  322 (425)
T KOG2695|consen  243 TGHQQSFQSKSDVFALQFAGSDNLVFNGCRNGEIFVIDLRCRNQGNGWCAQRLYHDSSVTSLQILQFSQQKLMASDMTGK  322 (425)
T ss_pred             cccccccccchhHHHHHhcccCCeeEecccCCcEEEEEeeecccCCCcceEEEEcCcchhhhhhhccccceEeeccCcCc
Confidence            56654      4777888888999999999999999999876    3344556788999999775435678888999999


Q ss_pred             EEEEcCccccCCCccceeecccccCeE--EEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074          106 CKVWDRRCLNVKGKPAGVLMGHLEGIT--FIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       106 v~lWd~~~~~~~~~~~~~~~~h~~~v~--~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      |++||.|... .+.-+..+.||...-.  -+.+.++...++++|.|-..|||.++..
T Consensus       323 ikLyD~R~~K-~~~~V~qYeGHvN~~a~l~~~v~~eeg~I~s~GdDcytRiWsl~~g  378 (425)
T KOG2695|consen  323 IKLYDLRATK-CKKSVMQYEGHVNLSAYLPAHVKEEEGSIFSVGDDCYTRIWSLDSG  378 (425)
T ss_pred             eeEeeehhhh-cccceeeeecccccccccccccccccceEEEccCeeEEEEEecccC
Confidence            9999998432 2234567888876433  3345677778999999999999998853


No 272
>KOG0309 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=98.47  E-value=8.2e-07  Score=82.87  Aligned_cols=203  Identities=24%  Similarity=0.373  Sum_probs=133.8

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccC--CeEEEEEccC--CCcEEEEecCCCeEEEEcCcccc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTS--DVNTVCFGDE--SGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~--~v~~l~~~~~--~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      +++.+++.+|.|+-++.++.-| +.+.|+...-...++..|..  .|-.+.|++.  .+..+++-+.. .-.+|.+....
T Consensus        25 ~~~~a~si~p~grdi~lAsr~g-l~i~dld~p~~ppr~l~h~tpw~vad~qws~h~a~~~wiVsts~q-kaiiwnlA~ss  102 (1081)
T KOG0309|consen   25 GGFNAVSINPSGRDIVLASRQG-LYIIDLDDPFTPPRWLHHITPWQVADVQWSPHPAKPYWIVSTSNQ-KAIIWNLAKSS  102 (1081)
T ss_pred             CcccceeeccccchhhhhhhcC-eEEEeccCCCCCceeeeccCcchhcceecccCCCCceeEEecCcc-hhhhhhhhcCC
Confidence            4578899999999999999888 67788876544444555543  5677788642  34456555544 44578864211


Q ss_pred             CCCccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                       .....-.+.||..+++.+.|.+.. ..+++++.|..+-.||+|.....        .+.+...                
T Consensus       103 -~~aIef~lhghsraitd~n~~~q~pdVlatcsvdt~vh~wd~rSp~~p--------~ys~~~w----------------  157 (1081)
T KOG0309|consen  103 -SNAIEFVLHGHSRAITDINFNPQHPDVLATCSVDTYVHAWDMRSPHRP--------FYSTSSW----------------  157 (1081)
T ss_pred             -ccceEEEEecCccceeccccCCCCCcceeeccccccceeeeccCCCcc--------eeeeecc----------------
Confidence             112223456889999999998754 46899999999999999864321        1111100                


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe-EEEEeecCCCCeEEEEECCCC-CeEEEEeC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE-QVAALKYHTSPVRDCSWHPSQ-PMLVSSSW  272 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~-~~~~~~~h~~~I~~v~~sp~~-~~las~s~  272 (303)
                            ...   ...++      ++.....+.+.+..+.|++||.+.|. .+..+++|...|+.++|+.-. ..+.+.+.
T Consensus       158 ------~s~---asqVk------wnyk~p~vlasshg~~i~vwd~r~gs~pl~s~K~~vs~vn~~~fnr~~~s~~~s~~~  222 (1081)
T KOG0309|consen  158 ------RSA---ASQVK------WNYKDPNVLASSHGNDIFVWDLRKGSTPLCSLKGHVSSVNSIDFNRFKYSEIMSSSN  222 (1081)
T ss_pred             ------ccc---Cceee------ecccCcchhhhccCCceEEEeccCCCcceEEecccceeeehHHHhhhhhhhhcccCC
Confidence                  000   00011      11111224445567789999998774 588899999999999997643 46888999


Q ss_pred             CCCEEEeecCCC
Q 022074          273 DGDVVRWEFPGN  284 (303)
Q Consensus       273 Dg~i~~Wd~~~~  284 (303)
                      |++++.|+-...
T Consensus       223 d~tvkfw~y~kS  234 (1081)
T KOG0309|consen  223 DGTVKFWDYSKS  234 (1081)
T ss_pred             CCceeeeccccc
Confidence            999999997543


No 273
>KOG1334 consensus WD40 repeat protein [General function prediction only]
Probab=98.47  E-value=3.2e-06  Score=75.50  Aligned_cols=209  Identities=22%  Similarity=0.295  Sum_probs=120.8

Q ss_pred             EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC-------------------------------C----
Q 022074           74 SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK-------------------------------G----  118 (303)
Q Consensus        74 ~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~-------------------------------~----  118 (303)
                      ..++..|.+.|+.|.|+. .++.++|++.|..|.+||.-.....                               +    
T Consensus       135 ~~kL~~H~GcVntV~FN~-~Gd~l~SgSDD~~vv~WdW~~~~~~l~f~SGH~~NvfQaKFiP~s~d~ti~~~s~dgqvr~  213 (559)
T KOG1334|consen  135 QKKLNKHKGCVNTVHFNQ-RGDVLASGSDDLQVVVWDWVSGSPKLSFESGHCNNVFQAKFIPFSGDRTIVTSSRDGQVRV  213 (559)
T ss_pred             hhcccCCCCccceeeecc-cCceeeccCccceEEeehhhccCcccccccccccchhhhhccCCCCCcCceeccccCceee
Confidence            346778999999999974 5889999999999999985211000                               0    


Q ss_pred             ---------ccceeecccccCeEEEEeCCCC-CEEEEEeCCCcEEEEEcccccCCcc--cccCccceeeeceeeeCCCCC
Q 022074          119 ---------KPAGVLMGHLEGITFIDSRGDG-RYLISNGKDQAIKLWDIRKMSSNAS--CNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       119 ---------~~~~~~~~h~~~v~~~~~~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~  186 (303)
                               .....+..|.+.|..++.-|+. ..|+|+|.|+.++-+|+|...+...  |...........-.+...|..
T Consensus       214 s~i~~t~~~e~t~rl~~h~g~vhklav~p~sp~~f~S~geD~~v~~~Dlr~~~pa~~~~cr~~~~~~~v~L~~Ia~~P~n  293 (559)
T KOG1334|consen  214 SEILETGYVENTKRLAPHEGPVHKLAVEPDSPKPFLSCGEDAVVFHIDLRQDVPAEKFVCREADEKERVGLYTIAVDPRN  293 (559)
T ss_pred             eeeccccceecceecccccCccceeeecCCCCCcccccccccceeeeeeccCCccceeeeeccCCccceeeeeEecCCCC
Confidence                     0012234477778777777755 4488999999999999886533222  211111100000011111111


Q ss_pred             c-cccCC-CCCcceE-----------------EecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC--C---
Q 022074          187 R-DLKHP-CDQSVAT-----------------YKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS--G---  242 (303)
Q Consensus       187 ~-~~~~~-~~~~~~~-----------------~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~--~---  242 (303)
                      . .+... .++....                 +-.+.....-.......+|+.++..|.+...|-.|+++...-  |   
T Consensus       294 t~~faVgG~dqf~RvYD~R~~~~e~~n~~~~~f~p~hl~~d~~v~ITgl~Ysh~~sElLaSYnDe~IYLF~~~~~~G~~p  373 (559)
T KOG1334|consen  294 TNEFAVGGSDQFARVYDQRRIDKEENNGVLDKFCPHHLVEDDPVNITGLVYSHDGSELLASYNDEDIYLFNKSMGDGSEP  373 (559)
T ss_pred             ccccccCChhhhhhhhcccchhhccccchhhhcCCccccccCcccceeEEecCCccceeeeecccceEEeccccccCCCC
Confidence            1 11111 1111111                 111111111111122346887777788888888999995432  2   


Q ss_pred             -------eEEEE-eecCCC--CeEEEE-ECCCCCeEEEEeCCCCEEEeecCC
Q 022074          243 -------EQVAA-LKYHTS--PVRDCS-WHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       243 -------~~~~~-~~~h~~--~I~~v~-~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                             ..+.. +++|..  .|..+- |-|...+++|||+-|.|-+|+-.+
T Consensus       374 ~~~s~~~~~~k~vYKGHrN~~TVKgVNFfGPrsEyVvSGSDCGhIFiW~K~t  425 (559)
T KOG1334|consen  374 DPSSPREQYVKRVYKGHRNSRTVKGVNFFGPRSEYVVSGSDCGHIFIWDKKT  425 (559)
T ss_pred             CCCcchhhccchhhcccccccccceeeeccCccceEEecCccceEEEEecch
Confidence                   22333 788864  466665 568889999999999999999654


No 274
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=98.38  E-value=9.6e-05  Score=67.30  Aligned_cols=196  Identities=13%  Similarity=0.214  Sum_probs=117.5

Q ss_pred             cccceEEEEEcCCCC--EEEEe-----eCCCeEEEEECCCCceE-----EEEecccCCeEEEEEccCCCcEEEEecC---
Q 022074           38 YSFGIFSLKFSTDGR--ELVAG-----SSDDCIYVYDLEANKLS-----LRILAHTSDVNTVCFGDESGHLIYSGSD---  102 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~--~l~sg-----s~Dg~v~lwd~~~~~~~-----~~~~~h~~~v~~l~~~~~~~~~l~s~s~---  102 (303)
                      |..+|....+||.+.  .+++-     |.=+.||||.......-     ..+...  .=..+.|++...-+|+-++.   
T Consensus       164 ~~~~i~~f~lSpgp~~~~vAvyvPe~kGaPa~vri~~~~~~~~~~~~a~ksFFka--dkvqm~WN~~gt~LLvLastdVD  241 (566)
T KOG2315|consen  164 SVSGITMLSLSPGPEPPFVAVYVPEKKGAPASVRIYKYPEEGQHQPVANKSFFKA--DKVQMKWNKLGTALLVLASTDVD  241 (566)
T ss_pred             eccceeeEEecCCCCCceEEEEccCCCCCCcEEEEeccccccccchhhhcccccc--ceeEEEeccCCceEEEEEEEeec
Confidence            456788888887533  44442     34447999977632211     111111  12244565433223332221   


Q ss_pred             --------CCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEe--CCCcEEEEEcccccCCcccccCccc
Q 022074          103 --------DNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG--KDQAIKLWDIRKMSSNASCNLGFRS  172 (303)
Q Consensus       103 --------dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~--~D~~v~lWdl~~~~~~~~~~~~~~~  172 (303)
                              +.++++.++.     +.....-....++|.++.|+++++.|+.+.  .=.++-|+|++....          
T Consensus       242 ktn~SYYGEq~Lyll~t~-----g~s~~V~L~k~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~~~v----------  306 (566)
T KOG2315|consen  242 KTNASYYGEQTLYLLATQ-----GESVSVPLLKEGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLRGKPV----------  306 (566)
T ss_pred             CCCccccccceEEEEEec-----CceEEEecCCCCCceEEEECCCCCEEEEEEecccceEEEEcCCCCEe----------
Confidence                    2355665542     111112222467899999999998886544  346778888863210          


Q ss_pred             eeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe---CCCeEEEEECCCCeEEEEee
Q 022074          173 YEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS---HDSCVYVYDLVSGEQVAALK  249 (303)
Q Consensus       173 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~---~dg~i~iwd~~~~~~~~~~~  249 (303)
                             .+++...+                          ...-|+|.|.+++.+|   -.|.+-|||..+.+++..++
T Consensus       307 -------~df~egpR--------------------------N~~~fnp~g~ii~lAGFGNL~G~mEvwDv~n~K~i~~~~  353 (566)
T KOG2315|consen  307 -------FDFPEGPR--------------------------NTAFFNPHGNIILLAGFGNLPGDMEVWDVPNRKLIAKFK  353 (566)
T ss_pred             -------EeCCCCCc--------------------------cceEECCCCCEEEEeecCCCCCceEEEeccchhhccccc
Confidence                   01111100                          0122677788888766   57889999999998888887


Q ss_pred             cCCCCeEEEEECCCCCeEEEEeC------CCCEEEeecCCCC
Q 022074          250 YHTSPVRDCSWHPSQPMLVSSSW------DGDVVRWEFPGNG  285 (303)
Q Consensus       250 ~h~~~I~~v~~sp~~~~las~s~------Dg~i~~Wd~~~~~  285 (303)
                      .-  .-+-++|+|||++++|+..      |+.+++|+..+..
T Consensus       354 a~--~tt~~eW~PdGe~flTATTaPRlrvdNg~KiwhytG~~  393 (566)
T KOG2315|consen  354 AA--NTTVFEWSPDGEYFLTATTAPRLRVDNGIKIWHYTGSL  393 (566)
T ss_pred             cC--CceEEEEcCCCcEEEEEeccccEEecCCeEEEEecCce
Confidence            54  4577899999999988876      6889999987653


No 275
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=98.38  E-value=1.5e-05  Score=74.44  Aligned_cols=223  Identities=10%  Similarity=0.121  Sum_probs=129.1

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---------------EEEEecccCCeEEEEEccCCCcEEEEecCCC
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---------------SLRILAHTSDVNTVCFGDESGHLIYSGSDDN  104 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---------------~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg  104 (303)
                      ....|++|+.+..++++|+.||.+++..+.+...               -.++.+|++.|..+.|+ ++.+.|-|...+|
T Consensus        15 vkL~c~~WNke~gyIAcgG~dGlLKVlKl~t~t~d~~~~glaa~snLsmNQtLeGH~~sV~vvTWN-e~~QKLTtSDt~G   93 (1189)
T KOG2041|consen   15 VKLHCAEWNKESGYIACGGADGLLKVLKLGTDTTDLNKSGLAAASNLSMNQTLEGHNASVMVVTWN-ENNQKLTTSDTSG   93 (1189)
T ss_pred             ceEEEEEEcccCCeEEeccccceeEEEEccccCCcccccccccccccchhhhhccCcceEEEEEec-cccccccccCCCc
Confidence            4588999999999999999999999998765431               12467899999999996 5567788889999


Q ss_pred             eEEEEcCccccCCCccceee--cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeC
Q 022074          105 LCKVWDRRCLNVKGKPAGVL--MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDY  182 (303)
Q Consensus       105 ~v~lWd~~~~~~~~~~~~~~--~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~  182 (303)
                      .|.+|=+.-    +.-....  .....-|.+++|..+|..+.....||.|.+=.+..-. +....+  ....  ...+.+
T Consensus        94 lIiVWmlyk----gsW~EEMiNnRnKSvV~SmsWn~dG~kIcIvYeDGavIVGsvdGNR-IwgKeL--kg~~--l~hv~w  164 (1189)
T KOG2041|consen   94 LIIVWMLYK----GSWCEEMINNRNKSVVVSMSWNLDGTKICIVYEDGAVIVGSVDGNR-IWGKEL--KGQL--LAHVLW  164 (1189)
T ss_pred             eEEEEeeec----ccHHHHHhhCcCccEEEEEEEcCCCcEEEEEEccCCEEEEeeccce-ecchhc--chhe--ccceee
Confidence            999997631    1111111  1123447788898899888877777776553322100 000000  0000  000111


Q ss_pred             CCCCccccCCC-------------------CCcceEEecc-c-ceeeeEEEeee--e--eeeCCCeEEEEEeCCCeEEEE
Q 022074          183 PPQARDLKHPC-------------------DQSVATYKGH-S-VLRTLIRCHFS--P--VYSTGQKYIYTGSHDSCVYVY  237 (303)
Q Consensus       183 ~~~~~~~~~~~-------------------~~~~~~~~~~-~-~~~~~~~~~~~--~--~~s~~~~~latg~~dg~i~iw  237 (303)
                      +++.+.+....                   ..+....+|. . ....+...++.  +  ...|+...||.+-..|.+.|.
T Consensus       165 s~D~~~~Lf~~ange~hlydnqgnF~~Kl~~~c~Vn~tg~~s~~~~kia~i~w~~g~~~~v~pdrP~lavcy~nGr~QiM  244 (1189)
T KOG2041|consen  165 SEDLEQALFKKANGETHLYDNQGNFERKLEKDCEVNGTGIFSNFPTKIAEIEWNTGPYQPVPPDRPRLAVCYANGRMQIM  244 (1189)
T ss_pred             cccHHHHHhhhcCCcEEEecccccHHHhhhhceEEeeeeeecCCCccccceeeccCccccCCCCCCEEEEEEcCceehhh
Confidence            11111100000                   0000000000 0 00011111111  1  124688899999999999988


Q ss_pred             ECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC
Q 022074          238 DLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD  273 (303)
Q Consensus       238 d~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D  273 (303)
                      .-.+...-..+. ....|....|+|+|.+||.++.|
T Consensus       245 R~eND~~Pvv~d-tgm~~vgakWnh~G~vLAvcG~~  279 (1189)
T KOG2041|consen  245 RSENDPEPVVVD-TGMKIVGAKWNHNGAVLAVCGND  279 (1189)
T ss_pred             hhcCCCCCeEEe-cccEeecceecCCCcEEEEccCc
Confidence            765544322333 22678999999999999998865


No 276
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=98.36  E-value=0.00013  Score=65.50  Aligned_cols=187  Identities=17%  Similarity=0.231  Sum_probs=121.5

Q ss_pred             cCCCCEEEEeeCCCeEEEEECCCCceEEEEec------cc--CCe---EEEE-EccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           48 STDGRELVAGSSDDCIYVYDLEANKLSLRILA------HT--SDV---NTVC-FGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        48 s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~------h~--~~v---~~l~-~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      +.||++++- +.-|.|.+||..+..+...-.+      .+  ..+   .-+. |+..++++++..|. |.+.+.+.-   
T Consensus       275 nsDGkrIvF-q~~GdIylydP~td~lekldI~lpl~rk~k~~k~~~pskyledfa~~~Gd~ia~VSR-GkaFi~~~~---  349 (668)
T COG4946         275 NSDGKRIVF-QNAGDIYLYDPETDSLEKLDIGLPLDRKKKQPKFVNPSKYLEDFAVVNGDYIALVSR-GKAFIMRPW---  349 (668)
T ss_pred             CCCCcEEEE-ecCCcEEEeCCCcCcceeeecCCccccccccccccCHHHhhhhhccCCCcEEEEEec-CcEEEECCC---
Confidence            346877664 4667899999988765432111      01  111   1111 44456788887774 566665421   


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCC-cEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQ-AIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~-~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                       .+  -..-.+|...|....+..+++-++.|..|+ .+-++|.+.....                               
T Consensus       350 -~~--~~iqv~~~~~VrY~r~~~~~e~~vigt~dgD~l~iyd~~~~e~k-------------------------------  395 (668)
T COG4946         350 -DG--YSIQVGKKGGVRYRRIQVDPEGDVIGTNDGDKLGIYDKDGGEVK-------------------------------  395 (668)
T ss_pred             -CC--eeEEcCCCCceEEEEEccCCcceEEeccCCceEEEEecCCceEE-------------------------------
Confidence             11  112346777898888888888899999999 9999997642210                               


Q ss_pred             CcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          195 QSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       195 ~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                          .+.+.      +.-.+....+++|++++.+.....|.+.|+++|+....=+.-.+-|++.+|||+++++|-+=-+|
T Consensus       396 ----r~e~~------lg~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idkS~~~lItdf~~~~nsr~iAYafP~g  465 (668)
T COG4946         396 ----RIEKD------LGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDKSEYGLITDFDWHPNSRWIAYAFPEG  465 (668)
T ss_pred             ----EeeCC------ccceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEecccccceeEEEEEcCCceeEEEecCcc
Confidence                00000      00011223467899999999999999999999985322234457899999999999999776654


Q ss_pred             ----CEEEeecCC
Q 022074          275 ----DVVRWEFPG  283 (303)
Q Consensus       275 ----~i~~Wd~~~  283 (303)
                          .|+++|..+
T Consensus       466 y~tq~Iklydm~~  478 (668)
T COG4946         466 YYTQSIKLYDMDG  478 (668)
T ss_pred             eeeeeEEEEecCC
Confidence                678888765


No 277
>KOG4714 consensus Nucleoporin [Nuclear structure]
Probab=98.34  E-value=3.2e-06  Score=69.99  Aligned_cols=93  Identities=23%  Similarity=0.401  Sum_probs=70.9

Q ss_pred             EEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC-CC
Q 022074           63 IYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD-GR  141 (303)
Q Consensus        63 v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~-~~  141 (303)
                      .+.|+++..+...........|.+++-+|...+++++|+.||.+-+||.|..   ..+...+..|...++-+.|+|. +.
T Consensus       161 ~~a~~~~p~~t~~~~~~~~~~v~~l~~hp~qq~~v~cgt~dg~~~l~d~rn~---~~p~S~l~ahk~~i~eV~FHpk~p~  237 (319)
T KOG4714|consen  161 FYANTLDPIKTLIPSKKALDAVTALCSHPAQQHLVCCGTDDGIVGLWDARNV---AMPVSLLKAHKAEIWEVHFHPKNPE  237 (319)
T ss_pred             eeeecccccccccccccccccchhhhCCcccccEEEEecCCCeEEEEEcccc---cchHHHHHHhhhhhhheeccCCCch
Confidence            4556555443322222233458999988877889999999999999998843   4566778889999999999884 56


Q ss_pred             EEEEEeCCCcEEEEEcc
Q 022074          142 YLISNGKDQAIKLWDIR  158 (303)
Q Consensus       142 ~l~s~~~D~~v~lWdl~  158 (303)
                      .|++++.||.+--||-.
T Consensus       238 ~Lft~sedGslw~wdas  254 (319)
T KOG4714|consen  238 HLFTCSEDGSLWHWDAS  254 (319)
T ss_pred             heeEecCCCcEEEEcCC
Confidence            79999999999999965


No 278
>KOG4190 consensus Uncharacterized conserved protein [Function unknown]
Probab=98.33  E-value=1.9e-06  Score=78.06  Aligned_cols=169  Identities=20%  Similarity=0.189  Sum_probs=110.9

Q ss_pred             EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC-Cc--cceeecccccCeEEEEeCCCCCEEEEEeCCCcE
Q 022074           76 RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK-GK--PAGVLMGHLEGITFIDSRGDGRYLISNGKDQAI  152 (303)
Q Consensus        76 ~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~-~~--~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v  152 (303)
                      .+.+|...|..++.- ++.+-|+++++|++|++|.++..... +.  ..-+++.|..+|..+.|-.+-++++++  |+.+
T Consensus       730 nf~GH~~~iRai~Ai-dNENSFiSASkDKTVKLWSik~EgD~~~tsaCQfTY~aHkk~i~~igfL~~lr~i~Sc--D~gi  806 (1034)
T KOG4190|consen  730 NFTGHQEKIRAIAAI-DNENSFISASKDKTVKLWSIKPEGDEIGTSACQFTYQAHKKPIHDIGFLADLRSIASC--DGGI  806 (1034)
T ss_pred             cccCcHHHhHHHHhc-ccccceeeccCCceEEEEEeccccCccccceeeeEhhhccCcccceeeeeccceeeec--cCcc
Confidence            456788888888663 45678999999999999998742211 11  222467899999999887776677655  8999


Q ss_pred             EEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEE-EeCC
Q 022074          153 KLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYT-GSHD  231 (303)
Q Consensus       153 ~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~lat-g~~d  231 (303)
                      .+||.=......+.       +++.+               .       +..   ..++|.-    +.+...+.. ++.+
T Consensus       807 HlWDPFigr~Laq~-------~dapk---------------~-------~a~---~~ikcl~----nv~~~iliAgcsae  850 (1034)
T KOG4190|consen  807 HLWDPFIGRLLAQM-------EDAPK---------------E-------GAG---GNIKCLE----NVDRHILIAGCSAE  850 (1034)
T ss_pred             eeecccccchhHhh-------hcCcc---------------c-------CCC---ceeEecc----cCcchheeeeccch
Confidence            99995221111000       00000               0       000   0111111    112334444 4789


Q ss_pred             CeEEEEECCCCeEEEEee-----cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          232 SCVYVYDLVSGEQVAALK-----YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       232 g~i~iwd~~~~~~~~~~~-----~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .+++++|.+..+-..+++     +...-+.+++.-+.|+++|.+=.+|.|..-|...
T Consensus       851 STVKl~DaRsce~~~E~kVcna~~Pna~~R~iaVa~~GN~lAa~LSnGci~~LDaR~  907 (1034)
T KOG4190|consen  851 STVKLFDARSCEWTCELKVCNAPGPNALTRAIAVADKGNKLAAALSNGCIAILDARN  907 (1034)
T ss_pred             hhheeeecccccceeeEEeccCCCCchheeEEEeccCcchhhHHhcCCcEEEEecCC
Confidence            999999999887665554     3445688999999999999999999999888653


No 279
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=98.32  E-value=1.7e-06  Score=75.49  Aligned_cols=93  Identities=20%  Similarity=0.234  Sum_probs=73.9

Q ss_pred             hccccccccccCcCcccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE-ecccCCeEEEEEccCCC
Q 022074           16 ESLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI-LAHTSDVNTVCFGDESG   94 (303)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~-~~h~~~v~~l~~~~~~~   94 (303)
                      +|+..|+++.-   +...-.-||-.-++.++++||+++|+++..|+.||+-..+.-.....+ .+|+..|..++..+  +
T Consensus       131 ~~~~di~s~~~---~~~~~~lGhvSml~dVavS~D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS~isl~~--~  205 (390)
T KOG3914|consen  131 VYSFDILSADS---GRCEPILGHVSMLLDVAVSPDDQFIITADRDEKIRVSRYPATFVIESFCLGHKEFVSTISLTD--N  205 (390)
T ss_pred             ceeeeeecccc---cCcchhhhhhhhhheeeecCCCCEEEEecCCceEEEEecCcccchhhhccccHhheeeeeecc--C
Confidence            46667777633   222233799999999999999999999999999999877665444443 47999999999854  4


Q ss_pred             cEEEEecCCCeEEEEcCcc
Q 022074           95 HLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        95 ~~l~s~s~dg~v~lWd~~~  113 (303)
                      ..|+|+|.|++|++||++.
T Consensus       206 ~~LlS~sGD~tlr~Wd~~s  224 (390)
T KOG3914|consen  206 YLLLSGSGDKTLRLWDITS  224 (390)
T ss_pred             ceeeecCCCCcEEEEeccc
Confidence            5689999999999999863


No 280
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.30  E-value=0.00028  Score=71.99  Aligned_cols=213  Identities=14%  Similarity=0.113  Sum_probs=118.6

Q ss_pred             ccceEEEEEcCC-CCEEEEeeCCCeEEEEECCCCceEEEEec-cc-------------CCeEEEEEccCCCcEEEEecCC
Q 022074           39 SFGIFSLKFSTD-GRELVAGSSDDCIYVYDLEANKLSLRILA-HT-------------SDVNTVCFGDESGHLIYSGSDD  103 (303)
Q Consensus        39 ~~~v~~l~~s~~-g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~-------------~~v~~l~~~~~~~~~l~s~s~d  103 (303)
                      .++ ..++++++ |+.+++-+..+.|+++|.... ....+.. ..             ..-..+++.++.+.++++-..+
T Consensus       568 ~~P-~gvavd~~~g~lyVaDs~n~rI~v~d~~G~-~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~n  645 (1057)
T PLN02919        568 KFP-GKLAIDLLNNRLFISDSNHNRIVVTDLDGN-FIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTEN  645 (1057)
T ss_pred             CCC-ceEEEECCCCeEEEEECCCCeEEEEeCCCC-EEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCCC
Confidence            444 46888874 566777777888999998654 3322222 10             1235677754433344444556


Q ss_pred             CeEEEEcCccccCCCccceeec----------cc-------ccCeEEEEeCC-CCCEEEEEeCCCcEEEEEcccccCCcc
Q 022074          104 NLCKVWDRRCLNVKGKPAGVLM----------GH-------LEGITFIDSRG-DGRYLISNGKDQAIKLWDIRKMSSNAS  165 (303)
Q Consensus       104 g~v~lWd~~~~~~~~~~~~~~~----------~h-------~~~v~~~~~~~-~~~~l~s~~~D~~v~lWdl~~~~~~~~  165 (303)
                      +.|+.+|....     .+..+.          +.       -..-..+++++ ++.++++.+.++.|++||.......  
T Consensus       646 ~~Ir~id~~~~-----~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~~I~v~d~~~g~v~--  718 (1057)
T PLN02919        646 HALREIDFVNE-----TVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQHQIWEYNISDGVTR--  718 (1057)
T ss_pred             ceEEEEecCCC-----EEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCCeEEEEECCCCeEE--
Confidence            78998886421     111111          00       01224677888 5667788888999999997532100  


Q ss_pred             cccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCe-EEEEEeCCCeEEEEECCCCeE
Q 022074          166 CNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQK-YIYTGSHDSCVYVYDLVSGEQ  244 (303)
Q Consensus       166 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~-~latg~~dg~i~iwd~~~~~~  244 (303)
                              .+.       ......         ...+...............++++++ ++++.+.++.|++||+.++..
T Consensus       719 --------~~~-------G~G~~~---------~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~  774 (1057)
T PLN02919        719 --------VFS-------GDGYER---------NLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGS  774 (1057)
T ss_pred             --------EEe-------cCCccc---------cCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcE
Confidence                    000       000000         0000000000000111234667776 555667789999999987653


Q ss_pred             EEEee-------------c--------CCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          245 VAALK-------------Y--------HTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       245 ~~~~~-------------~--------h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .....             .        .-.....++++++|+++++-..++.|++||..+.
T Consensus       775 ~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg  835 (1057)
T PLN02919        775 RLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATK  835 (1057)
T ss_pred             EEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCC
Confidence            21110             0        0112468999999999999999999999998643


No 281
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=98.23  E-value=0.00043  Score=62.22  Aligned_cols=118  Identities=21%  Similarity=0.206  Sum_probs=91.1

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCC-eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDD-CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg-~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      -||.++|.=..+..+++-++.|..|| .+-|+|..++. .+++...-+.|.++..+ ++++.++.+.....+-+.|+.. 
T Consensus       356 v~~~~~VrY~r~~~~~e~~vigt~dgD~l~iyd~~~~e-~kr~e~~lg~I~av~vs-~dGK~~vvaNdr~el~vididn-  432 (668)
T COG4946         356 VGKKGGVRYRRIQVDPEGDVIGTNDGDKLGIYDKDGGE-VKRIEKDLGNIEAVKVS-PDGKKVVVANDRFELWVIDIDN-  432 (668)
T ss_pred             cCCCCceEEEEEccCCcceEEeccCCceEEEEecCCce-EEEeeCCccceEEEEEc-CCCcEEEEEcCceEEEEEEecC-
Confidence            58999999999999988999999999 89999998886 34566667889999986 4688888888777887778753 


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEE----EeCCCcEEEEEccc
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLIS----NGKDQAIKLWDIRK  159 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s----~~~D~~v~lWdl~~  159 (303)
                         +.+.-.-....+-++.++++++++.+|-    |-....|+++|+..
T Consensus       433 ---gnv~~idkS~~~lItdf~~~~nsr~iAYafP~gy~tq~Iklydm~~  478 (668)
T COG4946         433 ---GNVRLIDKSEYGLITDFDWHPNSRWIAYAFPEGYYTQSIKLYDMDG  478 (668)
T ss_pred             ---CCeeEecccccceeEEEEEcCCceeEEEecCcceeeeeEEEEecCC
Confidence               2222222334456889999999998875    44567899999865


No 282
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=98.16  E-value=2.8e-05  Score=70.99  Aligned_cols=117  Identities=14%  Similarity=0.165  Sum_probs=99.7

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .+|-.+|.++.++-+-..|.+++.|..+-.|+.+..+....+......+..++++++ +..+++|+.  +|++||++   
T Consensus        99 ~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~~~~~~sl~is~D-~~~l~~as~--~ik~~~~~---  172 (541)
T KOG4547|consen   99 DKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVIIRIWKEQKPLVSSLCISPD-GKILLTASR--QIKVLDIE---  172 (541)
T ss_pred             CCCCCcceeeecccccCceEecCCceeEEEEecccceeeeeeccCCCccceEEEcCC-CCEEEeccc--eEEEEEcc---
Confidence            578899999999999999999999999999999999998888888889999999765 788888885  89999986   


Q ss_pred             CCCccceeecccccCeEEEEeCCC-----CCEEEE-EeCCCcEEEEEccc
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGD-----GRYLIS-NGKDQAIKLWDIRK  159 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~-----~~~l~s-~~~D~~v~lWdl~~  159 (303)
                       +.+....|.||...|.++.|-.+     |.++++ ...+.-+.+|-++.
T Consensus       173 -~kevv~~ftgh~s~v~t~~f~~~~~g~~G~~vLssa~~~r~i~~w~v~~  221 (541)
T KOG4547|consen  173 -TKEVVITFTGHGSPVRTLSFTTLIDGIIGKYVLSSAAAERGITVWVVEK  221 (541)
T ss_pred             -CceEEEEecCCCcceEEEEEEEeccccccceeeeccccccceeEEEEEc
Confidence             55677889999999999888655     666655 44678888887664


No 283
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=98.15  E-value=8.1e-05  Score=65.32  Aligned_cols=77  Identities=26%  Similarity=0.333  Sum_probs=67.8

Q ss_pred             CCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           36 GGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      .+|..-|..++|||..+ .+..++-+.+|.|.|+++.........+ ..+.+++|.-++.+.+..|..+|.|.+||.|.
T Consensus       190 p~~g~~IrdlafSp~~~GLl~~asl~nkiki~dlet~~~vssy~a~-~~~wSC~wDlde~h~IYaGl~nG~VlvyD~R~  267 (463)
T KOG1645|consen  190 PGEGSFIRDLAFSPFNEGLLGLASLGNKIKIMDLETSCVVSSYIAY-NQIWSCCWDLDERHVIYAGLQNGMVLVYDMRQ  267 (463)
T ss_pred             cccchhhhhhccCccccceeeeeccCceEEEEecccceeeeheecc-CCceeeeeccCCcceeEEeccCceEEEEEccC
Confidence            57888999999999776 7888999999999999999877777777 67899999777788999999999999999873


No 284
>PF04762 IKI3:  IKI3 family;  InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=98.13  E-value=0.0014  Score=66.04  Aligned_cols=198  Identities=16%  Similarity=0.236  Sum_probs=123.3

Q ss_pred             cccceEEEEEcCCCCEEEEeeCCCeEEEE----ECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc-
Q 022074           38 YSFGIFSLKFSTDGRELVAGSSDDCIYVY----DLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR-  112 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~~l~sgs~Dg~v~lw----d~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~-  112 (303)
                      -...|.++.+-++...++.+..+|.|.+.    +..+.... ..-.-+.+|.+.+|+|+ .++|+.++.++++.+-... 
T Consensus        74 ~~~~ivs~~yl~d~~~l~~~~~~Gdi~~~~~~~~~~~~~~E-~VG~vd~GI~a~~WSPD-~Ella~vT~~~~l~~mt~~f  151 (928)
T PF04762_consen   74 PNDKIVSFQYLADSESLCIALASGDIILVREDPDPDEDEIE-IVGSVDSGILAASWSPD-EELLALVTGEGNLLLMTRDF  151 (928)
T ss_pred             CCCcEEEEEeccCCCcEEEEECCceEEEEEccCCCCCceeE-EEEEEcCcEEEEEECCC-cCEEEEEeCCCEEEEEeccc
Confidence            34679999999999999999999999999    55444322 23345679999999865 6788888888888764311 


Q ss_pred             -----------ccc---------------CC---Ccc--------------ceeecccccCeEEEEeCCCCCEEEEEeC-
Q 022074          113 -----------CLN---------------VK---GKP--------------AGVLMGHLEGITFIDSRGDGRYLISNGK-  148 (303)
Q Consensus       113 -----------~~~---------------~~---~~~--------------~~~~~~h~~~v~~~~~~~~~~~l~s~~~-  148 (303)
                                 ...               ..   ++.              ...+. +.+.-..++|..||.++|+.+. 
T Consensus       152 d~i~E~~l~~~~~~~~~~VsVGWGkKeTQF~Gs~gK~aa~~~~~p~~~~~d~~~~s-~dd~~~~ISWRGDG~yFAVss~~  230 (928)
T PF04762_consen  152 DPISEVPLDSDDFGESKHVSVGWGKKETQFHGSAGKAAARQLRDPTVPKVDEGKLS-WDDGRVRISWRGDGEYFAVSSVE  230 (928)
T ss_pred             eEEEEeecCccccCCCceeeeccCcccCccCcchhhhhhhhccCCCCCccccCccc-cCCCceEEEECCCCcEEEEEEEE
Confidence                       000               00   000              00111 2334557889999999988775 


Q ss_pred             ---C--CcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCe
Q 022074          149 ---D--QAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQK  223 (303)
Q Consensus       149 ---D--~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~  223 (303)
                         +  +.+|+|+-.. ......                           + .+..+            .-...+.|.|.
T Consensus       231 ~~~~~~R~iRVy~ReG-~L~stS---------------------------E-~v~gL------------e~~l~WrPsG~  269 (928)
T PF04762_consen  231 PETGSRRVIRVYSREG-ELQSTS---------------------------E-PVDGL------------EGALSWRPSGN  269 (928)
T ss_pred             cCCCceeEEEEECCCc-eEEecc---------------------------c-cCCCc------------cCCccCCCCCC
Confidence               2  5788887421 100000                           0 00000            00122456777


Q ss_pred             EEEEEeC---CCeEEEEECCCCeEEEEee----cCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          224 YIYTGSH---DSCVYVYDLVSGEQVAALK----YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       224 ~latg~~---dg~i~iwd~~~~~~~~~~~----~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      ++|+.-.   ...|.+|. ++|-.-..|.    .....|..++||+|+..||..-.|. +++|..
T Consensus       270 lIA~~q~~~~~~~VvFfE-rNGLrhgeF~l~~~~~~~~v~~l~Wn~ds~iLAv~~~~~-vqLWt~  332 (928)
T PF04762_consen  270 LIASSQRLPDRHDVVFFE-RNGLRHGEFTLRFDPEEEKVIELAWNSDSEILAVWLEDR-VQLWTR  332 (928)
T ss_pred             EEEEEEEcCCCcEEEEEe-cCCcEeeeEecCCCCCCceeeEEEECCCCCEEEEEecCC-ceEEEe
Confidence            7777542   34455554 5665544443    3356899999999999999987665 999974


No 285
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=98.10  E-value=0.002  Score=57.78  Aligned_cols=184  Identities=19%  Similarity=0.265  Sum_probs=103.1

Q ss_pred             cceEEEEEcCCCCEEEEee-CCCeEEEEECCCCc--eEE--EE-ecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           40 FGIFSLKFSTDGRELVAGS-SDDCIYVYDLEANK--LSL--RI-LAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~~~--~~~--~~-~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      ....++.|+|+|+++++.. ....|++|+++...  +..  .+ .....+-..+.|+++.....+....+++|.++++..
T Consensus       144 ~h~H~v~~~pdg~~v~v~dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~  223 (345)
T PF10282_consen  144 PHPHQVVFSPDGRFVYVPDLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDP  223 (345)
T ss_dssp             TCEEEEEE-TTSSEEEEEETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEET
T ss_pred             ccceeEEECCCCCEEEEEecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecc
Confidence            3468899999999888864 23469999997765  322  12 234457789999765444455667788999998651


Q ss_pred             ccCCCccceeec----ccc--cCeEEEEeCCCCCEEEEEe-CCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          114 LNVKGKPAGVLM----GHL--EGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       114 ~~~~~~~~~~~~----~h~--~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      ............    +..  .....+.++|+|++|..+. ...+|-+|++........                     
T Consensus       224 ~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~---------------------  282 (345)
T PF10282_consen  224 SDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLT---------------------  282 (345)
T ss_dssp             TTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEE---------------------
T ss_pred             cCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceE---------------------
Confidence            111111111111    111  2466788999999876655 577899998843110000                     


Q ss_pred             ccccCCCCCcceEEe-cccceeeeEEEeeeeeeeCCCeEEEEEe-CCCeEEEEEC--CCCeEEEEee-cCCCCeEEEEE
Q 022074          187 RDLKHPCDQSVATYK-GHSVLRTLIRCHFSPVYSTGQKYIYTGS-HDSCVYVYDL--VSGEQVAALK-YHTSPVRDCSW  260 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~s~~~~~latg~-~dg~i~iwd~--~~~~~~~~~~-~h~~~I~~v~~  260 (303)
                               .+.... +...       .....+++++++|++++ .++.|.+|++  ++|.+...-. ..-....||.|
T Consensus       283 ---------~~~~~~~~G~~-------Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~~tG~l~~~~~~~~~~~p~ci~f  345 (345)
T PF10282_consen  283 ---------LVQTVPTGGKF-------PRHFAFSPDGRYLYVANQDSNTVSVFDIDPDTGKLTPVGSSVPIPSPVCIVF  345 (345)
T ss_dssp             ---------EEEEEEESSSS-------EEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEEESSSEEEEEE
T ss_pred             ---------EEEEEeCCCCC-------ccEEEEeCCCCEEEEEecCCCeEEEEEEeCCCCcEEEecccccCCCCEEEeC
Confidence                     000000 0000       11223678999988876 6778999976  5776533321 22345666665


No 286
>COG5170 CDC55 Serine/threonine protein phosphatase 2A, regulatory subunit [Signal transduction mechanisms]
Probab=98.07  E-value=4e-05  Score=65.20  Aligned_cols=124  Identities=21%  Similarity=0.317  Sum_probs=87.0

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE---EEEeccc-----CCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS---LRILAHT-----SDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~---~~~~~h~-----~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      ++|..-|.+++|+.|.+.++++ .|=.|.+|.+.--...   .-+.+|+     ..++...|+|..-++|.-.+..|+|+
T Consensus       169 NaH~yhiNSiS~NsD~et~lSa-DdLrINLWnl~i~D~sFnIVDiKP~nmeeLteVItSaeFhp~~cn~fmYSsSkG~Ik  247 (460)
T COG5170         169 NAHPYHINSISFNSDKETLLSA-DDLRINLWNLEIIDGSFNIVDIKPHNMEELTEVITSAEFHPEMCNVFMYSSSKGEIK  247 (460)
T ss_pred             ccceeEeeeeeecCchheeeec-cceeeeeccccccCCceEEEeccCccHHHHHHHHhhcccCHhHcceEEEecCCCcEE
Confidence            6899999999999999988876 4667999987643222   2234454     35677789887677787888899999


Q ss_pred             EEcCccccCCCcccee------------ecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc
Q 022074          108 VWDRRCLNVKGKPAGV------------LMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS  161 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~------------~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~  161 (303)
                      +-|+|....-..+...            +.+-..++..+.|+++|+|+++-.. -+|++||++..+
T Consensus       248 l~DlRq~alcdn~~klfe~~~D~v~~~ff~eivsSISD~kFs~ngryIlsRdy-ltvkiwDvnm~k  312 (460)
T COG5170         248 LNDLRQSALCDNSKKLFELTIDGVDVDFFEEIVSSISDFKFSDNGRYILSRDY-LTVKIWDVNMAK  312 (460)
T ss_pred             ehhhhhhhhccCchhhhhhccCcccchhHHHHhhhhcceEEcCCCcEEEEecc-ceEEEEeccccc
Confidence            9998732111111111            1223456777889999999988765 789999998643


No 287
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=98.06  E-value=0.0039  Score=55.63  Aligned_cols=96  Identities=13%  Similarity=0.036  Sum_probs=61.7

Q ss_pred             CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEec---------CCCeEEEEcCccccCCCccceeecc-----
Q 022074           61 DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGS---------DDNLCKVWDRRCLNVKGKPAGVLMG-----  126 (303)
Q Consensus        61 g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s---------~dg~v~lWd~~~~~~~~~~~~~~~~-----  126 (303)
                      ++|.+.|..+++....+..-..+- .+ ++++...+.++.+         .+..|.+||.....    ....+.-     
T Consensus        27 ~~v~ViD~~~~~v~g~i~~G~~P~-~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~----~~~~i~~p~~p~  100 (352)
T TIGR02658        27 TQVYTIDGEAGRVLGMTDGGFLPN-PV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHL----PIADIELPEGPR  100 (352)
T ss_pred             ceEEEEECCCCEEEEEEEccCCCc-ee-ECCCCCEEEEEeccccccccCCCCCEEEEEECccCc----EEeEEccCCCch
Confidence            789999999988766555322222 23 6665444444555         58899999986432    2222211     


Q ss_pred             --cccCeEEEEeCCCCCEEEEEe-C-CCcEEEEEcccccC
Q 022074          127 --HLEGITFIDSRGDGRYLISNG-K-DQAIKLWDIRKMSS  162 (303)
Q Consensus       127 --h~~~v~~~~~~~~~~~l~s~~-~-D~~v~lWdl~~~~~  162 (303)
                        ....-..++++++|++|+... . +..|-+.|+...+.
T Consensus       101 ~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~VvD~~~~kv  140 (352)
T TIGR02658       101 FLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGVVDLEGKAF  140 (352)
T ss_pred             hhccCccceEEECCCCCEEEEecCCCCCEEEEEECCCCcE
Confidence              112233567899999988766 4 79999999986543


No 288
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=98.04  E-value=5.3e-06  Score=79.15  Aligned_cols=116  Identities=16%  Similarity=0.252  Sum_probs=80.6

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCC-eEEEEcCcccc
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDN-LCKVWDRRCLN  115 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg-~v~lWd~~~~~  115 (303)
                      -|+....|++|+.+.++|++|+..|.|++|++.+|........|...|+.+..+.+...+|.++++.. -..+|+...  
T Consensus      1099 d~~~~fTc~afs~~~~hL~vG~~~Geik~~nv~sG~~e~s~ncH~SavT~vePs~dgs~~Ltsss~S~PlsaLW~~~s-- 1176 (1516)
T KOG1832|consen 1099 DETALFTCIAFSGGTNHLAVGSHAGEIKIFNVSSGSMEESVNCHQSAVTLVEPSVDGSTQLTSSSSSSPLSALWDASS-- 1176 (1516)
T ss_pred             ccccceeeEEeecCCceEEeeeccceEEEEEccCccccccccccccccccccccCCcceeeeeccccCchHHHhcccc--
Confidence            45578999999999999999999999999999999988888899999999987544333444444444 567999753  


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                       ...+...|    ..-.++.|+..-..-+.|..-....+||+..
T Consensus      1177 -~~~~~Hsf----~ed~~vkFsn~~q~r~~gt~~d~a~~YDvqT 1215 (1516)
T KOG1832|consen 1177 -TGGPRHSF----DEDKAVKFSNSLQFRALGTEADDALLYDVQT 1215 (1516)
T ss_pred             -ccCccccc----cccceeehhhhHHHHHhcccccceEEEeccc
Confidence             22233233    2234555655432233344446788999865


No 289
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=98.02  E-value=0.00045  Score=65.48  Aligned_cols=237  Identities=17%  Similarity=0.140  Sum_probs=136.2

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC-----------CcEEEEecCCCeEEE
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES-----------GHLIYSGSDDNLCKV  108 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~-----------~~~l~s~s~dg~v~l  108 (303)
                      ....+++|+|.|- ++-| .-..|.+.|..+-+.+..+..|...|+.+.|.|..           --+++++.-.|.|.+
T Consensus        16 sN~~A~Dw~~~GL-iAyg-shslV~VVDs~s~q~iqsie~h~s~V~~VrWap~~~p~~llS~~~~~lliAsaD~~GrIil   93 (1062)
T KOG1912|consen   16 SNRNAADWSPSGL-IAYG-SHSLVSVVDSRSLQLIQSIELHQSAVTSVRWAPAPSPRDLLSPSSSQLLIASADISGRIIL   93 (1062)
T ss_pred             ccccccccCccce-EEEe-cCceEEEEehhhhhhhhccccCccceeEEEeccCCCchhccCccccceeEEeccccCcEEE
Confidence            3467889999873 3334 44569999999988888888899999999996531           125778888999999


Q ss_pred             EcCccccCCCccceeecccccCeEEEEe---CCCC-CEEEEEeCCCcEEEEEcccccCCcccccC------ccceeeece
Q 022074          109 WDRRCLNVKGKPAGVLMGHLEGITFIDS---RGDG-RYLISNGKDQAIKLWDIRKMSSNASCNLG------FRSYEWDYR  178 (303)
Q Consensus       109 Wd~~~~~~~~~~~~~~~~h~~~v~~~~~---~~~~-~~l~s~~~D~~v~lWdl~~~~~~~~~~~~------~~~~~~~~~  178 (303)
                      ||..    ....+..+..|.+++-.+.|   .++. ..|+.-..-..+-+|+........+....      ++...|+.+
T Consensus        94 ~d~~----~~s~~~~l~~~~~~~qdl~W~~~rd~Srd~LlaIh~ss~lvLwntdtG~k~Wk~~ys~~iLs~f~~DPfd~r  169 (1062)
T KOG1912|consen   94 VDFV----LASVINWLSHSNDSVQDLCWVPARDDSRDVLLAIHGSSTLVLWNTDTGEKFWKYDYSHEILSCFRVDPFDSR  169 (1062)
T ss_pred             EEeh----hhhhhhhhcCCCcchhheeeeeccCcchheeEEecCCcEEEEEEccCCceeeccccCCcceeeeeeCCCCcc
Confidence            9975    22334445556666544433   2333 45667677788899977654433332211      111111111


Q ss_pred             eeeC----------------CCCC--ccc--cCCCCC----cceEEecccceee-----eEEEeeeeeeeCCCeEEEEEe
Q 022074          179 WMDY----------------PPQA--RDL--KHPCDQ----SVATYKGHSVLRT-----LIRCHFSPVYSTGQKYIYTGS  229 (303)
Q Consensus       179 ~~~~----------------~~~~--~~~--~~~~~~----~~~~~~~~~~~~~-----~~~~~~~~~~s~~~~~latg~  229 (303)
                      .+.+                +|..  +.+  ...+..    ...+..|......     .+.....-.|+|.-+.++-..
T Consensus       170 h~~~l~s~g~vl~~~~l~~sep~~pgk~~qI~sd~Sdl~~lere~at~ns~ts~~~sa~fity~a~faf~p~~rn~lfi~  249 (1062)
T KOG1912|consen  170 HFCVLGSKGFVLSCKDLGLSEPDVPGKEFQITSDHSDLAHLERETATGNSTTSTPASAYFITYCAQFAFSPHWRNILFIT  249 (1062)
T ss_pred             eEEEEccCceEEEEeccCCCCCCCCceeEEEecCccchhhhhhhhhccccccCCCcchhHHHHHHhhhcChhhhceEEEE
Confidence            1100                0100  000  000000    0000001000000     000000112444444444445


Q ss_pred             CCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCC--eEEEEeCCCCEEEeecC
Q 022074          230 HDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQP--MLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       230 ~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~--~las~s~Dg~i~~Wd~~  282 (303)
                      --..+.++|++-...+....-..+.+.=+.|-|+++  .|.+.=.||.+.+|--+
T Consensus       250 ~prellv~dle~~~~l~vvpier~~akfv~vlP~~~rd~LfclH~nG~ltirvrk  304 (1062)
T KOG1912|consen  250 FPRELLVFDLEYECCLAVVPIERGGAKFVDVLPDPRRDALFCLHSNGRLTIRVRK  304 (1062)
T ss_pred             eccceEEEcchhhceeEEEEeccCCcceeEeccCCCcceEEEEecCCeEEEEEee
Confidence            567799999988888888776666667778888875  79999999999999754


No 290
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=97.97  E-value=5.4e-06  Score=76.49  Aligned_cols=203  Identities=13%  Similarity=0.221  Sum_probs=122.4

Q ss_pred             cceEEEEEcCCCC--EEEEeeCCCeEEEEECCCCceE--EEEecccCCeEEEEEccCCCcEEEEec----CCCeEEEEcC
Q 022074           40 FGIFSLKFSTDGR--ELVAGSSDDCIYVYDLEANKLS--LRILAHTSDVNTVCFGDESGHLIYSGS----DDNLCKVWDR  111 (303)
Q Consensus        40 ~~v~~l~~s~~g~--~l~sgs~Dg~v~lwd~~~~~~~--~~~~~h~~~v~~l~~~~~~~~~l~s~s----~dg~v~lWd~  111 (303)
                      +.+.|+++.-+.+  .+++|..+|.|-+-........  ....+|...+++++|++-+.+.|+.|-    .|..+.+||+
T Consensus        57 qy~kcva~~y~~d~cIlavG~atG~I~l~s~r~~hdSs~E~tp~~ar~Ct~lAwneLDtn~LAagldkhrnds~~~Iwdi  136 (783)
T KOG1008|consen   57 QYVKCVASFYGNDRCILAVGSATGNISLLSVRHPHDSSAEVTPGYARPCTSLAWNELDTNHLAAGLDKHRNDSSLKIWDI  136 (783)
T ss_pred             CCceeehhhcCCchhhhhhccccCceEEeecCCcccccceecccccccccccccccccHHHHHhhhhhhcccCCccceec
Confidence            4578888775443  7889999999999877554322  234567788999999876667777663    3678899998


Q ss_pred             ccccCCCccceeecc-cccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCcccc
Q 022074          112 RCLNVKGKPAGVLMG-HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLK  190 (303)
Q Consensus       112 ~~~~~~~~~~~~~~~-h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  190 (303)
                      ...-...+....+.+ -.++..++.+..+.+++++|..-+.+.++|+|.....  ++. +              +++   
T Consensus       137 ~s~ltvPke~~~fs~~~l~gqns~cwlrd~klvlaGm~sr~~~ifdlRqs~~~--~~s-v--------------nTk---  196 (783)
T KOG1008|consen  137 NSLLTVPKESPLFSSSTLDGQNSVCWLRDTKLVLAGMTSRSVHIFDLRQSLDS--VSS-V--------------NTK---  196 (783)
T ss_pred             ccccCCCccccccccccccCccccccccCcchhhcccccchhhhhhhhhhhhh--hhh-h--------------hhh---
Confidence            532101011112222 3345556667778888999999999999999842110  000 0              000   


Q ss_pred             CCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEE-CCCCeE-EEEeecCC-----CCeEEEEECCC
Q 022074          191 HPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYD-LVSGEQ-VAALKYHT-----SPVRDCSWHPS  263 (303)
Q Consensus       191 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd-~~~~~~-~~~~~~h~-----~~I~~v~~sp~  263 (303)
                              ...|..    +     .| |+  ..|+++ ..||.|.+|| ..+-+. +..+ .|.     ..+..++|.|.
T Consensus       197 --------~vqG~t----V-----dp-~~--~nY~cs-~~dg~iAiwD~~rnienpl~~i-~~~~N~~~~~l~~~aycPt  254 (783)
T KOG1008|consen  197 --------YVQGIT----V-----DP-FS--PNYFCS-NSDGDIAIWDTYRNIENPLQII-LRNENKKPKQLFALAYCPT  254 (783)
T ss_pred             --------hcccce----e-----cC-CC--CCceec-cccCceeeccchhhhccHHHHH-hhCCCCcccceeeEEeccC
Confidence                    000000    0     01 22  335554 4599999999 333322 2111 222     24899999998


Q ss_pred             CC-eEEEEeC-CCCEEEeecCCC
Q 022074          264 QP-MLVSSSW-DGDVVRWEFPGN  284 (303)
Q Consensus       264 ~~-~las~s~-Dg~i~~Wd~~~~  284 (303)
                      .+ ++++... .++|++.|+...
T Consensus       255 rtglla~l~RdS~tIrlydi~~v  277 (783)
T KOG1008|consen  255 RTGLLAVLSRDSITIRLYDICVV  277 (783)
T ss_pred             CcchhhhhccCcceEEEeccccc
Confidence            65 4555444 478999997643


No 291
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=97.97  E-value=3e-05  Score=44.11  Aligned_cols=39  Identities=38%  Similarity=0.655  Sum_probs=34.6

Q ss_pred             CeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          242 GEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       242 ~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      ++.+..+..|...|+++.|+++++++++++.|+.+++|+
T Consensus         2 ~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~d~~~~~~~   40 (40)
T smart00320        2 GELLKTLKGHTGPVTSVAFSPDGKYLASASDDGTIKLWD   40 (40)
T ss_pred             cEEEEEEEecCCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence            345667778999999999999999999999999999996


No 292
>KOG4532 consensus WD40-like repeat containing protein [General function prediction only]
Probab=97.92  E-value=0.0022  Score=53.78  Aligned_cols=103  Identities=11%  Similarity=0.020  Sum_probs=71.9

Q ss_pred             EEEEeeCCCeEEEEECCCCceEEEEecccCC--eEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccce-eeccccc
Q 022074           53 ELVAGSSDDCIYVYDLEANKLSLRILAHTSD--VNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAG-VLMGHLE  129 (303)
Q Consensus        53 ~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~--v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~-~~~~h~~  129 (303)
                      .+.-++.|.+++++++..+...  ...|...  ++.+.+++ +++++++.+....|..|.+.....  .... .+..-++
T Consensus       130 ~~~i~sndht~k~~~~~~~s~~--~~~h~~~~~~ns~~~sn-d~~~~~~Vgds~~Vf~y~id~~se--y~~~~~~a~t~D  204 (344)
T KOG4532|consen  130 PLNIASNDHTGKTMVVSGDSNK--FAVHNQNLTQNSLHYSN-DPSWGSSVGDSRRVFRYAIDDESE--YIENIYEAPTSD  204 (344)
T ss_pred             ceeeccCCcceeEEEEecCccc--ceeeccccceeeeEEcC-CCceEEEecCCCcceEEEeCCccc--eeeeeEecccCC
Confidence            3666788888988888766443  3334443  77888864 588998999888999997752111  1111 2222334


Q ss_pred             CeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074          130 GITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       130 ~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      .=-+.+|+....++|++..||++.|||+|.+
T Consensus       205 ~gF~~S~s~~~~~FAv~~Qdg~~~I~DVR~~  235 (344)
T KOG4532|consen  205 HGFYNSFSENDLQFAVVFQDGTCAIYDVRNM  235 (344)
T ss_pred             CceeeeeccCcceEEEEecCCcEEEEEeccc
Confidence            4456778888999999999999999999864


No 293
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=97.90  E-value=0.0003  Score=64.15  Aligned_cols=111  Identities=20%  Similarity=0.322  Sum_probs=78.1

Q ss_pred             ccceEEEEEcCCCCEEEEe--eCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC---CCeEEEEcCcc
Q 022074           39 SFGIFSLKFSTDGRELVAG--SSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD---DNLCKVWDRRC  113 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sg--s~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~---dg~v~lWd~~~  113 (303)
                      ++||+++.|+++|+.++++  =.=.++.|||++..- +.  .--.+.=+++.|+| .+++++-++.   .|.+-+||...
T Consensus       270 ~GPVhdv~W~~s~~EF~VvyGfMPAkvtifnlr~~~-v~--df~egpRN~~~fnp-~g~ii~lAGFGNL~G~mEvwDv~n  345 (566)
T KOG2315|consen  270 EGPVHDVTWSPSGREFAVVYGFMPAKVTIFNLRGKP-VF--DFPEGPRNTAFFNP-HGNIILLAGFGNLPGDMEVWDVPN  345 (566)
T ss_pred             CCCceEEEECCCCCEEEEEEecccceEEEEcCCCCE-eE--eCCCCCccceEECC-CCCEEEEeecCCCCCceEEEeccc
Confidence            5799999999999865553  345689999996653 22  22335567888975 6788776654   68999999752


Q ss_pred             ccCCCccceeecccccCeEEEEeCCCCCEEEEEeC------CCcEEEEEccc
Q 022074          114 LNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK------DQAIKLWDIRK  159 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~------D~~v~lWdl~~  159 (303)
                          .+.+..+  ....-+-+.|+|||++|+|+..      |..++||+...
T Consensus       346 ----~K~i~~~--~a~~tt~~eW~PdGe~flTATTaPRlrvdNg~KiwhytG  391 (566)
T KOG2315|consen  346 ----RKLIAKF--KAANTTVFEWSPDGEYFLTATTAPRLRVDNGIKIWHYTG  391 (566)
T ss_pred             ----hhhcccc--ccCCceEEEEcCCCcEEEEEeccccEEecCCeEEEEecC
Confidence                2223333  1233456689999999998775      89999998753


No 294
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=97.89  E-value=0.0011  Score=60.81  Aligned_cols=68  Identities=21%  Similarity=0.330  Sum_probs=50.1

Q ss_pred             eeeeCCCeEEEEE---eCCCeEEEEECCCCeE-EEEeecCCCCeEEEEECCCCCeEEEEeC------CCCEEEeecCCC
Q 022074          216 PVYSTGQKYIYTG---SHDSCVYVYDLVSGEQ-VAALKYHTSPVRDCSWHPSQPMLVSSSW------DGDVVRWEFPGN  284 (303)
Q Consensus       216 ~~~s~~~~~latg---~~dg~i~iwd~~~~~~-~~~~~~h~~~I~~v~~sp~~~~las~s~------Dg~i~~Wd~~~~  284 (303)
                      .-++|.|++++.+   |..|.+.++|....+. ......| ...+.+.|.|+|++++|++.      |.--++|++++.
T Consensus       498 vfwsPkG~fvvva~l~s~~g~l~F~D~~~a~~k~~~~~eh-~~at~veWDPtGRYvvT~ss~wrhk~d~GYri~tfqGr  575 (698)
T KOG2314|consen  498 VFWSPKGRFVVVAALVSRRGDLEFYDTDYADLKDTASPEH-FAATEVEWDPTGRYVVTSSSSWRHKVDNGYRIFTFQGR  575 (698)
T ss_pred             EEEcCCCcEEEEEEecccccceEEEecchhhhhhccCccc-cccccceECCCCCEEEEeeehhhhccccceEEEEeecH
Confidence            4478899998876   4678999999874332 1112234 35689999999999999886      566778998876


No 295
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=97.89  E-value=0.0046  Score=56.16  Aligned_cols=197  Identities=14%  Similarity=0.202  Sum_probs=118.7

Q ss_pred             cceEEEEEcCCCC--EEEE-----eeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEe------c-----
Q 022074           40 FGIFSLKFSTDGR--ELVA-----GSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSG------S-----  101 (303)
Q Consensus        40 ~~v~~l~~s~~g~--~l~s-----gs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~------s-----  101 (303)
                      .+|..-+|+|.|+  .|+.     .+.++.++||.+..+....+-.-..-.-..+.|++ .++.++.=      +     
T Consensus       174 ~gi~dFsisP~~n~~~la~~tPEk~~kpa~~~i~sIp~~s~l~tk~lfk~~~~qLkW~~-~g~~ll~l~~t~~ksnKsyf  252 (561)
T COG5354         174 VGILDFSISPEGNHDELAYWTPEKLNKPAMVRILSIPKNSVLVTKNLFKVSGVQLKWQV-LGKYLLVLVMTHTKSNKSYF  252 (561)
T ss_pred             cceeeEEecCCCCCceEEEEccccCCCCcEEEEEEccCCCeeeeeeeEeecccEEEEec-CCceEEEEEEEeeeccccee
Confidence            4577788888643  3333     35688899999986654322111111224556654 34433211      1     


Q ss_pred             CCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEe--CCCcEEEEEcccccCCcccccCccceeeecee
Q 022074          102 DDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG--KDQAIKLWDIRKMSSNASCNLGFRSYEWDYRW  179 (303)
Q Consensus       102 ~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~--~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~  179 (303)
                      ....+.|++++     .+-+....+-.+.|..+.|.|.++.+++.+  .+.++-++|++.-     .       .+    
T Consensus       253 gesnLyl~~~~-----e~~i~V~~~~~~pVhdf~W~p~S~~F~vi~g~~pa~~s~~~lr~N-----l-------~~----  311 (561)
T COG5354         253 GESNLYLLRIT-----ERSIPVEKDLKDPVHDFTWEPLSSRFAVISGYMPASVSVFDLRGN-----L-------RF----  311 (561)
T ss_pred             ccceEEEEeec-----ccccceeccccccceeeeecccCCceeEEecccccceeecccccc-----e-------EE----
Confidence            12456677654     222333334567899999999888886655  6778888887631     0       00    


Q ss_pred             eeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe---CCCeEEEEECCCCeEE-EEeecCCCCe
Q 022074          180 MDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS---HDSCVYVYDLVSGEQV-AALKYHTSPV  255 (303)
Q Consensus       180 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~---~dg~i~iwd~~~~~~~-~~~~~h~~~I  255 (303)
                       .+|++.+                          ..+.|||.+++++.++   ..|.|-+||...+-++ ..+.+.  ..
T Consensus       312 -~~Pe~~r--------------------------NT~~fsp~~r~il~agF~nl~gni~i~~~~~rf~~~~~~~~~--n~  362 (561)
T COG5354         312 -YFPEQKR--------------------------NTIFFSPHERYILFAGFDNLQGNIEIFDPAGRFKVAGAFNGL--NT  362 (561)
T ss_pred             -ecCCccc--------------------------ccccccCcccEEEEecCCccccceEEeccCCceEEEEEeecC--Cc
Confidence             0111110                          1234777888888866   4678999998766543 366554  35


Q ss_pred             EEEEECCCCCeEEEEeC------CCCEEEeecCCCCcc
Q 022074          256 RDCSWHPSQPMLVSSSW------DGDVVRWEFPGNGEA  287 (303)
Q Consensus       256 ~~v~~sp~~~~las~s~------Dg~i~~Wd~~~~~~~  287 (303)
                      .-+.||||++++-++-.      |..+++||+.+....
T Consensus       363 s~~~wspd~qF~~~~~ts~k~~~Dn~i~l~~v~g~~~f  400 (561)
T COG5354         363 SYCDWSPDGQFYDTDTTSEKLRVDNSIKLWDVYGAKVF  400 (561)
T ss_pred             eEeeccCCceEEEecCCCcccccCcceEEEEecCchhh
Confidence            67889999997666533      788999998776433


No 296
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.88  E-value=0.012  Score=49.92  Aligned_cols=186  Identities=18%  Similarity=0.222  Sum_probs=107.7

Q ss_pred             eEEEEEc-CCCCEEEEeeCCCeEEEEECCCCceEEEEec-----ccCCeEEEEEccCCCcEEEEecCC--------CeEE
Q 022074           42 IFSLKFS-TDGRELVAGSSDDCIYVYDLEANKLSLRILA-----HTSDVNTVCFGDESGHLIYSGSDD--------NLCK  107 (303)
Q Consensus        42 v~~l~~s-~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-----h~~~v~~l~~~~~~~~~l~s~s~d--------g~v~  107 (303)
                      ...+.+. ++ ..++.+..++ +.++|..+++.......     .....+.+++.+ ++++.++....        |.|.
T Consensus        42 ~~G~~~~~~~-g~l~v~~~~~-~~~~d~~~g~~~~~~~~~~~~~~~~~~ND~~vd~-~G~ly~t~~~~~~~~~~~~g~v~  118 (246)
T PF08450_consen   42 PNGMAFDRPD-GRLYVADSGG-IAVVDPDTGKVTVLADLPDGGVPFNRPNDVAVDP-DGNLYVTDSGGGGASGIDPGSVY  118 (246)
T ss_dssp             EEEEEEECTT-SEEEEEETTC-EEEEETTTTEEEEEEEEETTCSCTEEEEEEEE-T-TS-EEEEEECCBCTTCGGSEEEE
T ss_pred             CceEEEEccC-CEEEEEEcCc-eEEEecCCCcEEEEeeccCCCcccCCCceEEEcC-CCCEEEEecCCCccccccccceE
Confidence            6677777 55 5556666666 56669988865433222     234678888864 57777766543        4566


Q ss_pred             EEcCccccCCCccceeecccccCeEEEEeCCCCCEE-EEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          108 VWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYL-ISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l-~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      .++..     ++ ...........+.++++++++.| ++-+..+.|..+++...........         .....+.  
T Consensus       119 ~~~~~-----~~-~~~~~~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~---------~~~~~~~--  181 (246)
T PF08450_consen  119 RIDPD-----GK-VTVVADGLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRR---------VFIDFPG--  181 (246)
T ss_dssp             EEETT-----SE-EEEEEEEESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEE---------EEEE-SS--
T ss_pred             EECCC-----Ce-EEEEecCcccccceEECCcchheeecccccceeEEEeccccccceeeee---------eEEEcCC--
Confidence            66632     11 22222234456788999999866 5677788888888753221000000         0000000  


Q ss_pred             ccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEEC-CCCC
Q 022074          187 RDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWH-PSQP  265 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~s-p~~~  265 (303)
                             . . ...+             ...+..+|.+.++....+.|.++|.+ |+.+..+......+++++|. |+.+
T Consensus       182 -------~-~-g~pD-------------G~~vD~~G~l~va~~~~~~I~~~~p~-G~~~~~i~~p~~~~t~~~fgg~~~~  238 (246)
T PF08450_consen  182 -------G-P-GYPD-------------GLAVDSDGNLWVADWGGGRIVVFDPD-GKLLREIELPVPRPTNCAFGGPDGK  238 (246)
T ss_dssp             -------S-S-CEEE-------------EEEEBTTS-EEEEEETTTEEEEEETT-SCEEEEEE-SSSSEEEEEEESTTSS
T ss_pred             -------C-C-cCCC-------------cceEcCCCCEEEEEcCCCEEEEECCC-ccEEEEEcCCCCCEEEEEEECCCCC
Confidence                   0 0 0000             01234578888887889999999987 88888887665689999994 5655


Q ss_pred             -eEEEE
Q 022074          266 -MLVSS  270 (303)
Q Consensus       266 -~las~  270 (303)
                       +++|.
T Consensus       239 ~L~vTt  244 (246)
T PF08450_consen  239 TLYVTT  244 (246)
T ss_dssp             EEEEEE
T ss_pred             EEEEEe
Confidence             44443


No 297
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=97.86  E-value=7.7e-05  Score=68.61  Aligned_cols=65  Identities=20%  Similarity=0.362  Sum_probs=55.0

Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++|+...|+.|++||.|.+||...+...  +....-.++.++|||+|.+++.|++.|.+.+||.+-+
T Consensus       267 ~sp~E~kLvlGC~DgSiiLyD~~~~~t~--~~ka~~~P~~iaWHp~gai~~V~s~qGelQ~FD~ALs  331 (545)
T PF11768_consen  267 RSPSEDKLVLGCEDGSIILYDTTRGVTL--LAKAEFIPTLIAWHPDGAIFVVGSEQGELQCFDMALS  331 (545)
T ss_pred             cCcccceEEEEecCCeEEEEEcCCCeee--eeeecccceEEEEcCCCcEEEEEcCCceEEEEEeecC
Confidence            5678899999999999999998766432  3345567899999999999999999999999997654


No 298
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=97.86  E-value=0.0022  Score=63.82  Aligned_cols=199  Identities=17%  Similarity=0.213  Sum_probs=120.5

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc----ccc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR----CLN  115 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~----~~~  115 (303)
                      ..|.++.|..+.+.++.+..+|.|.+-|..+..... .-.-..+|.+.+|+++ .+.++-.+.++++.+-...    ...
T Consensus        69 ~~i~s~~fl~d~~~i~v~~~~G~iilvd~et~~~ei-vg~vd~GI~aaswS~D-ee~l~liT~~~tll~mT~~f~~i~E~  146 (1265)
T KOG1920|consen   69 DEIVSVQFLADTNSICVITALGDIILVDPETLELEI-VGNVDNGISAASWSPD-EELLALITGRQTLLFMTKDFEPIAEK  146 (1265)
T ss_pred             cceEEEEEecccceEEEEecCCcEEEEcccccceee-eeeccCceEEEeecCC-CcEEEEEeCCcEEEEEeccccchhcc
Confidence            579999999999999999999999999888776432 3345678999999764 6788888877888653210    000


Q ss_pred             C-------CC--------ccceeecc------------c---------ccCeEEEEeCCCCCEEEEEe----CC-CcEEE
Q 022074          116 V-------KG--------KPAGVLMG------------H---------LEGITFIDSRGDGRYLISNG----KD-QAIKL  154 (303)
Q Consensus       116 ~-------~~--------~~~~~~~~------------h---------~~~v~~~~~~~~~~~l~s~~----~D-~~v~l  154 (303)
                      .       ..        +....|.|            +         .+.-+.+.|..||.++++..    .+ +.+++
T Consensus       147 ~L~~d~~~~sk~v~VGwGrkeTqfrgs~gr~~~~~~~~~ek~~~~~~~~~~~~~IsWRgDg~~fAVs~~~~~~~~RkirV  226 (1265)
T KOG1920|consen  147 PLDADDERKSKFVNVGWGRKETQFRGSEGRQAARQKIEKEKALEQIEQDDHKTSISWRGDGEYFAVSFVESETGTRKIRV  226 (1265)
T ss_pred             ccccccccccccceecccccceeeecchhhhcccccccccccccchhhccCCceEEEccCCcEEEEEEEeccCCceeEEE
Confidence            0       00        00001111            0         11123477889999998732    24 89999


Q ss_pred             EEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEE---eCC
Q 022074          155 WDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTG---SHD  231 (303)
Q Consensus       155 Wdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg---~~d  231 (303)
                      ||-. ...+....    ..+.                  .+.                  +..+-|.|.++++-   .+|
T Consensus       227 ~drE-g~Lns~se----~~~~------------------l~~------------------~LsWkPsgs~iA~iq~~~sd  265 (1265)
T KOG1920|consen  227 YDRE-GALNSTSE----PVEG------------------LQH------------------SLSWKPSGSLIAAIQCKTSD  265 (1265)
T ss_pred             eccc-chhhcccC----cccc------------------ccc------------------ceeecCCCCeEeeeeecCCC
Confidence            9865 11110000    0000                  000                  01122445555552   345


Q ss_pred             CeEEEEECCCCeEEE----EeecCCCCeEEEEECCCCCeEEE---EeCCCCEEEeecC
Q 022074          232 SCVYVYDLVSGEQVA----ALKYHTSPVRDCSWHPSQPMLVS---SSWDGDVVRWEFP  282 (303)
Q Consensus       232 g~i~iwd~~~~~~~~----~~~~h~~~I~~v~~sp~~~~las---~s~Dg~i~~Wd~~  282 (303)
                      +.|.++. ++|-.-.    .+.....+|..++|+.++..||.   ......+++|...
T Consensus       266 ~~IvffE-rNGL~hg~f~l~~p~de~~ve~L~Wns~sdiLAv~~~~~e~~~v~lwt~~  322 (1265)
T KOG1920|consen  266 SDIVFFE-RNGLRHGEFVLPFPLDEKEVEELAWNSNSDILAVVTSNLENSLVQLWTTG  322 (1265)
T ss_pred             CcEEEEe-cCCccccccccCCcccccchheeeecCCCCceeeeecccccceEEEEEec
Confidence            6788886 3453322    22344556999999999999888   6666669999754


No 299
>KOG1275 consensus PAB-dependent poly(A) ribonuclease, subunit PAN2 [Replication, recombination and repair]
Probab=97.85  E-value=0.00019  Score=69.06  Aligned_cols=147  Identities=16%  Similarity=0.276  Sum_probs=96.0

Q ss_pred             CCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC---------CCeEEEEcCccccCCCc
Q 022074           49 TDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD---------DNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        49 ~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~---------dg~v~lWd~~~~~~~~~  119 (303)
                      .+++.+.+|...|+|.|-|.++-+.++++..|.+.+.++..   .++.|++++.         |..|++||+|-.    +
T Consensus       185 ~Nnr~lf~G~t~G~V~LrD~~s~~~iht~~aHs~siSDfDv---~GNlLitCG~S~R~~~l~~D~FvkVYDLRmm----r  257 (1118)
T KOG1275|consen  185 YNNRNLFCGDTRGTVFLRDPNSFETIHTFDAHSGSISDFDV---QGNLLITCGYSMRRYNLAMDPFVKVYDLRMM----R  257 (1118)
T ss_pred             ecCcEEEeecccceEEeecCCcCceeeeeeccccceeeeec---cCCeEEEeecccccccccccchhhhhhhhhh----h
Confidence            46789999999999999999999999999999998888655   3678888874         667889998732    1


Q ss_pred             cceeecccccCeEEEEeCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcce
Q 022074          120 PAGVLMGHLEGITFIDSRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVA  198 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  198 (303)
                      .+..+.-+.++ ..+.|.|. ...++.++.-|.+.+-|......                    |+....+..++...+.
T Consensus       258 al~PI~~~~~P-~flrf~Psl~t~~~V~S~sGq~q~vd~~~lsN--------------------P~~~~~~v~p~~s~i~  316 (1118)
T KOG1275|consen  258 ALSPIQFPYGP-QFLRFHPSLTTRLAVTSQSGQFQFVDTATLSN--------------------PPAGVKMVNPNGSGIS  316 (1118)
T ss_pred             ccCCcccccCc-hhhhhcccccceEEEEecccceeeccccccCC--------------------CccceeEEccCCCcce
Confidence            22222223232 23334442 34567777778888887432111                    1111111112211111


Q ss_pred             EEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEEC
Q 022074          199 TYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDL  239 (303)
Q Consensus       199 ~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~  239 (303)
                                      .-.+|+++..+|.|..+|.|.+|--
T Consensus       317 ----------------~fDiSsn~~alafgd~~g~v~~wa~  341 (1118)
T KOG1275|consen  317 ----------------AFDISSNGDALAFGDHEGHVNLWAD  341 (1118)
T ss_pred             ----------------eEEecCCCceEEEecccCcEeeecC
Confidence                            1235778999999999999999973


No 300
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=97.83  E-value=0.00074  Score=62.32  Aligned_cols=90  Identities=18%  Similarity=0.194  Sum_probs=67.0

Q ss_pred             EEEEECCCCceEE---EEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC
Q 022074           63 IYVYDLEANKLSL---RILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD  139 (303)
Q Consensus        63 v~lwd~~~~~~~~---~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~  139 (303)
                      -.+|+...++...   +.......|.+.+++| +.+.|+.|..||+|.+||...      ....+....-..+.++|+|+
T Consensus       238 ~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp-~E~kLvlGC~DgSiiLyD~~~------~~t~~~ka~~~P~~iaWHp~  310 (545)
T PF11768_consen  238 SCIYECSRNKLQRVSVTSIPLPSQVICCARSP-SEDKLVLGCEDGSIILYDTTR------GVTLLAKAEFIPTLIAWHPD  310 (545)
T ss_pred             EEEEEeecCceeEEEEEEEecCCcceEEecCc-ccceEEEEecCCeEEEEEcCC------CeeeeeeecccceEEEEcCC
Confidence            3677777665432   2335667899999976 467888999999999999642      12223334445678899999


Q ss_pred             CCEEEEEeCCCcEEEEEccc
Q 022074          140 GRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       140 ~~~l~s~~~D~~v~lWdl~~  159 (303)
                      |..++.|+.-|.+.+||+..
T Consensus       311 gai~~V~s~qGelQ~FD~AL  330 (545)
T PF11768_consen  311 GAIFVVGSEQGELQCFDMAL  330 (545)
T ss_pred             CcEEEEEcCCceEEEEEeec
Confidence            99999999999999999863


No 301
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=97.82  E-value=0.00049  Score=65.89  Aligned_cols=108  Identities=13%  Similarity=0.247  Sum_probs=77.4

Q ss_pred             EEEcCCCCEEEEee----CCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           45 LKFSTDGRELVAGS----SDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        45 l~~s~~g~~l~sgs----~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      .+|+|....+++++    ..|.|.|| +++|....-+ ...-.+.++||+|. .-+|+.|=.-|.+.+|...    ..+.
T Consensus        21 ~SWHPsePlfAVA~fS~er~GSVtIf-adtGEPqr~V-t~P~hatSLCWHpe-~~vLa~gwe~g~~~v~~~~----~~e~   93 (1416)
T KOG3617|consen   21 SSWHPSEPLFAVASFSPERGGSVTIF-ADTGEPQRDV-TYPVHATSLCWHPE-EFVLAQGWEMGVSDVQKTN----TTET   93 (1416)
T ss_pred             cccCCCCceeEEEEecCCCCceEEEE-ecCCCCCccc-ccceehhhhccChH-HHHHhhccccceeEEEecC----Ccee
Confidence            57888888888876    46789888 3445422111 11113567999764 4467777778899999853    2233


Q ss_pred             ceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074          121 AGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       121 ~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      ......|..++..+.|+++|+.++|+..-|.|.+|....
T Consensus        94 htv~~th~a~i~~l~wS~~G~~l~t~d~~g~v~lwr~d~  132 (1416)
T KOG3617|consen   94 HTVVETHPAPIQGLDWSHDGTVLMTLDNPGSVHLWRYDV  132 (1416)
T ss_pred             eeeccCCCCCceeEEecCCCCeEEEcCCCceeEEEEeee
Confidence            344556999999999999999999999999999997653


No 302
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=97.77  E-value=7.6e-05  Score=66.38  Aligned_cols=199  Identities=16%  Similarity=0.141  Sum_probs=116.5

Q ss_pred             cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeC-CCcEEEEEc
Q 022074           79 AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGK-DQAIKLWDI  157 (303)
Q Consensus        79 ~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~-D~~v~lWdl  157 (303)
                      -|.+.|+.+..  ...+.+.+++.||.++.|....... ...+..+..|...+.++..+-++.++.|.+. |+.+|++|+
T Consensus         7 mhrd~i~hv~~--tka~fiiqASlDGh~KFWkKs~isG-vEfVKhFraHL~~I~sl~~S~dg~L~~Sv~d~Dhs~KvfDv   83 (558)
T KOG0882|consen    7 MHRDVITHVFP--TKAKFIIQASLDGHKKFWKKSRISG-VEFVKHFRAHLGVILSLAVSYDGWLFRSVEDPDHSVKVFDV   83 (558)
T ss_pred             cccceeeeEee--ehhheEEeeecchhhhhcCCCCccc-eeehhhhHHHHHHHHhhhccccceeEeeccCcccceeEEEe
Confidence            36566666653  3467999999999999997431110 1123345567777888888889999999777 999999998


Q ss_pred             ccccCCcccccCccc--eeeeceeeeCCCCCc-ccc--CCCCCcceEEecccc---eeeeEEEeeeee----eeCCCeEE
Q 022074          158 RKMSSNASCNLGFRS--YEWDYRWMDYPPQAR-DLK--HPCDQSVATYKGHSV---LRTLIRCHFSPV----YSTGQKYI  225 (303)
Q Consensus       158 ~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~-~~~--~~~~~~~~~~~~~~~---~~~~~~~~~~~~----~s~~~~~l  225 (303)
                      ..........+.+.+  .+|..    .+.+.. .+.  ......+...++...   ...-...|++|+    +.+-+..+
T Consensus        84 En~DminmiKL~~lPg~a~wv~----skGd~~s~IAVs~~~sg~i~VvD~~~d~~q~~~fkklH~sPV~~i~y~qa~Ds~  159 (558)
T KOG0882|consen   84 ENFDMINMIKLVDLPGFAEWVT----SKGDKISLIAVSLFKSGKIFVVDGFGDFCQDGYFKKLHFSPVKKIRYNQAGDSA  159 (558)
T ss_pred             eccchhhhcccccCCCceEEec----CCCCeeeeEEeecccCCCcEEECCcCCcCccceecccccCceEEEEeeccccce
Confidence            764433222222111  11111    000000 000  001111222211110   001112233332    34556677


Q ss_pred             EEEeCCCeEEEEECCC-Ce-----E---------EEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          226 YTGSHDSCVYVYDLVS-GE-----Q---------VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       226 atg~~dg~i~iwd~~~-~~-----~---------~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++....|-|.-|..+- .+     .         +..+........++.|+|++..+++-+.|+.++++.+.+.
T Consensus       160 vSiD~~gmVEyWs~e~~~qfPr~~l~~~~K~eTdLy~f~K~Kt~pts~Efsp~g~qistl~~DrkVR~F~~KtG  233 (558)
T KOG0882|consen  160 VSIDISGMVEYWSAEGPFQFPRTNLNFELKHETDLYGFPKAKTEPTSFEFSPDGAQISTLNPDRKVRGFVFKTG  233 (558)
T ss_pred             eeccccceeEeecCCCcccCccccccccccccchhhcccccccCccceEEccccCcccccCcccEEEEEEeccc
Confidence            8888889999998762 11     1         1122233457899999999999999999999999998765


No 303
>PF04762 IKI3:  IKI3 family;  InterPro: IPR006849 Members of this family are components of the elongator multi-subunit component of a novel RNA polymerase II holoenzyme for transcriptional elongation [].
Probab=97.76  E-value=0.021  Score=57.65  Aligned_cols=113  Identities=20%  Similarity=0.271  Sum_probs=76.1

Q ss_pred             cccceEEEEEcCCCCEEEEeeC---C---CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC---CCeEEE
Q 022074           38 YSFGIFSLKFSTDGRELVAGSS---D---DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD---DNLCKV  108 (303)
Q Consensus        38 ~~~~v~~l~~s~~g~~l~sgs~---D---g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~---dg~v~l  108 (303)
                      +...-..|+|..||+++|+.+.   +   ..+|||+-+ |.+......-.+--.+++|. |+++++++...   ...|..
T Consensus       208 ~dd~~~~ISWRGDG~yFAVss~~~~~~~~R~iRVy~Re-G~L~stSE~v~gLe~~l~Wr-PsG~lIA~~q~~~~~~~VvF  285 (928)
T PF04762_consen  208 WDDGRVRISWRGDGEYFAVSSVEPETGSRRVIRVYSRE-GELQSTSEPVDGLEGALSWR-PSGNLIASSQRLPDRHDVVF  285 (928)
T ss_pred             cCCCceEEEECCCCcEEEEEEEEcCCCceeEEEEECCC-ceEEeccccCCCccCCccCC-CCCCEEEEEEEcCCCcEEEE
Confidence            4456788999999999999874   3   478999876 54443333333334577895 56889988764   456677


Q ss_pred             EcCccccCCCccceeec----ccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074          109 WDRRCLNVKGKPAGVLM----GHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus       109 Wd~~~~~~~~~~~~~~~----~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      |...     +-..+.|.    .....|..+.|+.++..|+.--.|. |.+|-..
T Consensus       286 fErN-----GLrhgeF~l~~~~~~~~v~~l~Wn~ds~iLAv~~~~~-vqLWt~~  333 (928)
T PF04762_consen  286 FERN-----GLRHGEFTLRFDPEEEKVIELAWNSDSEILAVWLEDR-VQLWTRS  333 (928)
T ss_pred             EecC-----CcEeeeEecCCCCCCceeeEEEECCCCCEEEEEecCC-ceEEEee
Confidence            7632     22222221    2345688999999999888866555 9999764


No 304
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.76  E-value=0.011  Score=53.59  Aligned_cols=217  Identities=16%  Similarity=0.094  Sum_probs=107.5

Q ss_pred             CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeeccccc
Q 022074           50 DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLE  129 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~  129 (303)
                      .+..+++++.+|.+.-+|..+|+..-+..-..........   .+..++.++.++.+..+|....+...+.  .+   .+
T Consensus        64 ~~~~v~v~~~~g~v~a~d~~tG~~~W~~~~~~~~~~~p~v---~~~~v~v~~~~g~l~ald~~tG~~~W~~--~~---~~  135 (377)
T TIGR03300        64 AGGKVYAADADGTVVALDAETGKRLWRVDLDERLSGGVGA---DGGLVFVGTEKGEVIALDAEDGKELWRA--KL---SS  135 (377)
T ss_pred             ECCEEEEECCCCeEEEEEccCCcEeeeecCCCCcccceEE---cCCEEEEEcCCCEEEEEECCCCcEeeee--cc---Cc
Confidence            3678999999999999999999865443322221222222   2456777888999999997533221110  01   01


Q ss_pred             CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCC--CCccccCCCCCcceEEe---ccc
Q 022074          130 GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPP--QARDLKHPCDQSVATYK---GHS  204 (303)
Q Consensus       130 ~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~---~~~  204 (303)
                      .+.+.... .+..++.+..++.+..||.+..+..............  .....|.  ....+.......+..++   |..
T Consensus       136 ~~~~~p~v-~~~~v~v~~~~g~l~a~d~~tG~~~W~~~~~~~~~~~--~~~~sp~~~~~~v~~~~~~g~v~ald~~tG~~  212 (377)
T TIGR03300       136 EVLSPPLV-ANGLVVVRTNDGRLTALDAATGERLWTYSRVTPALTL--RGSASPVIADGGVLVGFAGGKLVALDLQTGQP  212 (377)
T ss_pred             eeecCCEE-ECCEEEEECCCCeEEEEEcCCCceeeEEccCCCceee--cCCCCCEEECCEEEEECCCCEEEEEEccCCCE
Confidence            11110001 2346667778899999998754433222111000000  0000000  00000000111111111   110


Q ss_pred             ceeee-------------EEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEe
Q 022074          205 VLRTL-------------IRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       205 ~~~~~-------------~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s  271 (303)
                      .....             .....+|..  .+..+..++.+|.++.+|.++|+.+-..+. .. ...  ....+..+..++
T Consensus       213 ~W~~~~~~~~g~~~~~~~~~~~~~p~~--~~~~vy~~~~~g~l~a~d~~tG~~~W~~~~-~~-~~~--p~~~~~~vyv~~  286 (377)
T TIGR03300       213 LWEQRVALPKGRTELERLVDVDGDPVV--DGGQVYAVSYQGRVAALDLRSGRVLWKRDA-SS-YQG--PAVDDNRLYVTD  286 (377)
T ss_pred             eeeeccccCCCCCchhhhhccCCccEE--ECCEEEEEEcCCEEEEEECCCCcEEEeecc-CC-ccC--ceEeCCEEEEEC
Confidence            00000             000112222  234677788899999999999987655431 11 111  122456777777


Q ss_pred             CCCCEEEeecCC
Q 022074          272 WDGDVVRWEFPG  283 (303)
Q Consensus       272 ~Dg~i~~Wd~~~  283 (303)
                      .||.+..+|...
T Consensus       287 ~~G~l~~~d~~t  298 (377)
T TIGR03300       287 ADGVVVALDRRS  298 (377)
T ss_pred             CCCeEEEEECCC
Confidence            899999988754


No 305
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.75  E-value=0.0053  Score=55.67  Aligned_cols=175  Identities=18%  Similarity=0.135  Sum_probs=100.7

Q ss_pred             CEEEEeeCCCeEEEEECCCCceEEEEec-ccC---C------e-EEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           52 RELVAGSSDDCIYVYDLEANKLSLRILA-HTS---D------V-NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        52 ~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~---~------v-~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      ..++.+..+|.+.-+|+.+|+..-+... ...   .      + ....+   .+..++.++.+|.++.+|.+..+.    
T Consensus       191 ~~v~~~~~~g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~---~~~~vy~~~~~g~l~a~d~~tG~~----  263 (377)
T TIGR03300       191 GGVLVGFAGGKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVV---DGGQVYAVSYQGRVAALDLRSGRV----  263 (377)
T ss_pred             CEEEEECCCCEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEE---ECCEEEEEEcCCEEEEEECCCCcE----
Confidence            4678888899999999999875432211 000   0      0 01111   134677788899999999753221    


Q ss_pred             ceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          121 AGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       121 ~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      .-... . ......  ...+..++.++.|+.+..+|....+..           |....                    +
T Consensus       264 ~W~~~-~-~~~~~p--~~~~~~vyv~~~~G~l~~~d~~tG~~~-----------W~~~~--------------------~  308 (377)
T TIGR03300       264 LWKRD-A-SSYQGP--AVDDNRLYVTDADGVVVALDRRSGSEL-----------WKNDE--------------------L  308 (377)
T ss_pred             EEeec-c-CCccCc--eEeCCEEEEECCCCeEEEEECCCCcEE-----------Ecccc--------------------c
Confidence            11110 0 111111  124567888888999999997543211           11000                    0


Q ss_pred             ecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEe
Q 022074          201 KGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRW  279 (303)
Q Consensus       201 ~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~W  279 (303)
                      .+        ....+|..  .+..+++++.+|.++++|..+|+.+..++.+..++..--.-.++ .|..++.||.|..|
T Consensus       309 ~~--------~~~ssp~i--~g~~l~~~~~~G~l~~~d~~tG~~~~~~~~~~~~~~~sp~~~~~-~l~v~~~dG~l~~~  376 (377)
T TIGR03300       309 KY--------RQLTAPAV--VGGYLVVGDFEGYLHWLSREDGSFVARLKTDGSGIASPPVVVGD-GLLVQTRDGDLYAF  376 (377)
T ss_pred             cC--------CccccCEE--ECCEEEEEeCCCEEEEEECCCCCEEEEEEcCCCccccCCEEECC-EEEEEeCCceEEEe
Confidence            00        00011221  24578889999999999999999988887665443322222233 47788889998865


No 306
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=97.74  E-value=0.0072  Score=52.66  Aligned_cols=194  Identities=14%  Similarity=0.247  Sum_probs=108.9

Q ss_pred             CCeEEEEECCCC--ceE-EEEecccCCeEEEEEccCCCcEEEEecC---CCeEEEEcCccccCCCcc--ceeecccccCe
Q 022074           60 DDCIYVYDLEAN--KLS-LRILAHTSDVNTVCFGDESGHLIYSGSD---DNLCKVWDRRCLNVKGKP--AGVLMGHLEGI  131 (303)
Q Consensus        60 Dg~v~lwd~~~~--~~~-~~~~~h~~~v~~l~~~~~~~~~l~s~s~---dg~v~lWd~~~~~~~~~~--~~~~~~h~~~v  131 (303)
                      +.-|++|++.+.  ++. .+...+.+..+-++|+++ .+.|.++.+   +|.|.-|.+...  .++.  +.....-..+-
T Consensus        15 s~gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~-~~~LY~v~~~~~~ggvaay~iD~~--~G~Lt~ln~~~~~g~~p   91 (346)
T COG2706          15 SQGIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPD-QRHLYVVNEPGEEGGVAAYRIDPD--DGRLTFLNRQTLPGSPP   91 (346)
T ss_pred             CCceEEEEEeCcccccchhhhccccCCCceEEECCC-CCEEEEEEecCCcCcEEEEEEcCC--CCeEEEeeccccCCCCC
Confidence            345999988743  221 234456678889999754 556666654   466666654311  1111  11111112233


Q ss_pred             EEEEeCCCCCEEEEEeC-CCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeE
Q 022074          132 TFIDSRGDGRYLISNGK-DQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLI  210 (303)
Q Consensus       132 ~~~~~~~~~~~l~s~~~-D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  210 (303)
                      +.++++++++++++++. -+.|.++-++..-.....       .            ..+.+....      .| ......
T Consensus        92 ~yvsvd~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~-------v------------~~~~h~g~~------p~-~rQ~~~  145 (346)
T COG2706          92 CYVSVDEDGRFVFVANYHSGSVSVYPLQADGSLQPV-------V------------QVVKHTGSG------PH-ERQESP  145 (346)
T ss_pred             eEEEECCCCCEEEEEEccCceEEEEEcccCCccccc-------e------------eeeecCCCC------CC-ccccCC
Confidence            77889999999888775 578888876542110000       0            000000000      00 000111


Q ss_pred             EEeeeeeeeCCCeEEEEEe-CCCeEEEEECCCCeEEE----EeecCCCCeEEEEECCCCCeEEEEe-CCCCEEEeecCCC
Q 022074          211 RCHFSPVYSTGQKYIYTGS-HDSCVYVYDLVSGEQVA----ALKYHTSPVRDCSWHPSQPMLVSSS-WDGDVVRWEFPGN  284 (303)
Q Consensus       211 ~~~~~~~~s~~~~~latg~-~dg~i~iwd~~~~~~~~----~~~~h~~~I~~v~~sp~~~~las~s-~Dg~i~~Wd~~~~  284 (303)
                      .+|+ ..+.|++++|++.. ..-+|.+|++..|++..    .+ ....-...+.|+|++++.-... -++++.+|+....
T Consensus       146 h~H~-a~~tP~~~~l~v~DLG~Dri~~y~~~dg~L~~~~~~~v-~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~~  223 (346)
T COG2706         146 HVHS-ANFTPDGRYLVVPDLGTDRIFLYDLDDGKLTPADPAEV-KPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNPA  223 (346)
T ss_pred             ccce-eeeCCCCCEEEEeecCCceEEEEEcccCcccccccccc-CCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcCC
Confidence            1222 24678998888864 34469999999776422    22 2234568999999999755544 4899999987654


No 307
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.71  E-value=0.0038  Score=59.61  Aligned_cols=116  Identities=16%  Similarity=0.219  Sum_probs=83.9

Q ss_pred             CCcccceEEEEEcCC-------------CCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccC----CCcEEE
Q 022074           36 GGYSFGIFSLKFSTD-------------GRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDE----SGHLIY   98 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~-------------g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~----~~~~l~   98 (303)
                      +.|++.|+-..+.-+             |+++++||.||+|.|-.+.+++...++ ....++..++++|+    ..++++
T Consensus        55 GtH~g~v~~~~~~~~~~~~~~~s~~~~~Gey~asCS~DGkv~I~sl~~~~~~~~~-df~rpiksial~Pd~~~~~sk~fv  133 (846)
T KOG2066|consen   55 GTHRGAVYLTTCQGNPKTNFDHSSSILEGEYVASCSDDGKVVIGSLFTDDEITQY-DFKRPIKSIALHPDFSRQQSKQFV  133 (846)
T ss_pred             ccccceEEEEecCCcccccccccccccCCceEEEecCCCcEEEeeccCCccceeE-ecCCcceeEEeccchhhhhhhhee
Confidence            678888888777766             999999999999999999888765433 44568899999765    356899


Q ss_pred             EecCCCeEEEEcCccccCCCccce-eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074           99 SGSDDNLCKVWDRRCLNVKGKPAG-VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus        99 s~s~dg~v~lWd~~~~~~~~~~~~-~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      +|+.-| +.++..+-.   +.... .+..-.++|.++.|.  |+++|=++ |-.|++||+..
T Consensus       134 ~GG~ag-lvL~er~wl---gnk~~v~l~~~eG~I~~i~W~--g~lIAWan-d~Gv~vyd~~~  188 (846)
T KOG2066|consen  134 SGGMAG-LVLSERNWL---GNKDSVVLSEGEGPIHSIKWR--GNLIAWAN-DDGVKVYDTPT  188 (846)
T ss_pred             ecCcce-EEEehhhhh---cCccceeeecCccceEEEEec--CcEEEEec-CCCcEEEeccc
Confidence            999988 888764311   11111 233344678888874  66777665 45579999864


No 308
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.69  E-value=0.0059  Score=51.26  Aligned_cols=196  Identities=18%  Similarity=0.180  Sum_probs=103.6

Q ss_pred             CCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc-cceeeccc
Q 022074           49 TDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK-PAGVLMGH  127 (303)
Q Consensus        49 ~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~-~~~~~~~h  127 (303)
                      ++++.+++++.++.++.||..+|+..-+.... +.+..... .. +..++.++.++.++.+|.+..+...+ ........
T Consensus        34 ~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~-~~~~~~~~-~~-~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~  110 (238)
T PF13360_consen   34 PDGGRVYVASGDGNLYALDAKTGKVLWRFDLP-GPISGAPV-VD-GGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPA  110 (238)
T ss_dssp             EETTEEEEEETTSEEEEEETTTSEEEEEEECS-SCGGSGEE-EE-TTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTC
T ss_pred             EeCCEEEEEcCCCEEEEEECCCCCEEEEeecc-ccccceee-ec-ccccccccceeeeEecccCCcceeeeecccccccc
Confidence            35778888899999999999999876554432 22111112 12 34556666778999998653322111 00000000


Q ss_pred             -ccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccce
Q 022074          128 -LEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVL  206 (303)
Q Consensus       128 -~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  206 (303)
                       ........+  .+..++.+..++.+..+|++..+.......            ..+....        .+..+      
T Consensus       111 ~~~~~~~~~~--~~~~~~~~~~~g~l~~~d~~tG~~~w~~~~------------~~~~~~~--------~~~~~------  162 (238)
T PF13360_consen  111 GVRSSSSPAV--DGDRLYVGTSSGKLVALDPKTGKLLWKYPV------------GEPRGSS--------PISSF------  162 (238)
T ss_dssp             STB--SEEEE--ETTEEEEEETCSEEEEEETTTTEEEEEEES------------STT-SS----------EEEE------
T ss_pred             ccccccCceE--ecCEEEEEeccCcEEEEecCCCcEEEEeec------------CCCCCCc--------ceeee------
Confidence             011111222  266788888899999999875433221111            0000000        00000      


Q ss_pred             eeeEEEeeeeeeeCCCeEEEEEeCCCe-EEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          207 RTLIRCHFSPVYSTGQKYIYTGSHDSC-VYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       207 ~~~~~~~~~~~~s~~~~~latg~~dg~-i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                         ......+.+ .++ .+..++.++. +.+ |..+++.+-...  ...+..+ ..+++..|+.++.++.+..||+...
T Consensus       163 ---~~~~~~~~~-~~~-~v~~~~~~g~~~~~-d~~tg~~~w~~~--~~~~~~~-~~~~~~~l~~~~~~~~l~~~d~~tG  232 (238)
T PF13360_consen  163 ---SDINGSPVI-SDG-RVYVSSGDGRVVAV-DLATGEKLWSKP--ISGIYSL-PSVDGGTLYVTSSDGRLYALDLKTG  232 (238)
T ss_dssp             ---TTEEEEEEC-CTT-EEEEECCTSSEEEE-ETTTTEEEEEEC--SS-ECEC-EECCCTEEEEEETTTEEEEEETTTT
T ss_pred             ---cccccceEE-ECC-EEEEEcCCCeEEEE-ECCCCCEEEEec--CCCccCC-ceeeCCEEEEEeCCCEEEEEECCCC
Confidence               000011222 234 5566666664 566 999998663222  2222221 4567777777779999999998754


No 309
>KOG2041 consensus WD40 repeat protein [General function prediction only]
Probab=97.69  E-value=0.001  Score=62.69  Aligned_cols=240  Identities=15%  Similarity=0.157  Sum_probs=135.6

Q ss_pred             ccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec--ccCCeEEEEEccCCCcEEEEecCCCeEEEEc
Q 022074           33 ADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA--HTSDVNTVCFGDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        33 ~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~--h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      +..-||+..|.-+.|+.+.+.|-++..+|.|.+|=+-.|.....+..  .++-|.+++|+. +++.+...-.||.|.+=.
T Consensus        65 QtLeGH~~sV~vvTWNe~~QKLTtSDt~GlIiVWmlykgsW~EEMiNnRnKSvV~SmsWn~-dG~kIcIvYeDGavIVGs  143 (1189)
T KOG2041|consen   65 QTLEGHNASVMVVTWNENNQKLTTSDTSGLIIVWMLYKGSWCEEMINNRNKSVVVSMSWNL-DGTKICIVYEDGAVIVGS  143 (1189)
T ss_pred             hhhccCcceEEEEEeccccccccccCCCceEEEEeeecccHHHHHhhCcCccEEEEEEEcC-CCcEEEEEEccCCEEEEe
Confidence            34459999999999999999999999999999998887765433322  345678889965 477777777788776532


Q ss_pred             CccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc---cCCccccc---------C--ccceeee
Q 022074          111 RRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM---SSNASCNL---------G--FRSYEWD  176 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~---~~~~~~~~---------~--~~~~~~~  176 (303)
                      ++....-+   ..+.|.  -...+.+++|.++++-+-..|.+.++|....   +...+|..         +  ...+.|.
T Consensus       144 vdGNRIwg---KeLkg~--~l~hv~ws~D~~~~Lf~~ange~hlydnqgnF~~Kl~~~c~Vn~tg~~s~~~~kia~i~w~  218 (1189)
T KOG2041|consen  144 VDGNRIWG---KELKGQ--LLAHVLWSEDLEQALFKKANGETHLYDNQGNFERKLEKDCEVNGTGIFSNFPTKIAEIEWN  218 (1189)
T ss_pred             eccceecc---hhcchh--eccceeecccHHHHHhhhcCCcEEEecccccHHHhhhhceEEeeeeeecCCCccccceeec
Confidence            21100000   011110  0124557888888888888899999986531   11111100         0  1111121


Q ss_pred             ce-eeeCCCCCccccCCCCCcceEEe-----cccceeeeEEEeeeeeeeCCCeEEEEEeCCC---------eEEEEECCC
Q 022074          177 YR-WMDYPPQARDLKHPCDQSVATYK-----GHSVLRTLIRCHFSPVYSTGQKYIYTGSHDS---------CVYVYDLVS  241 (303)
Q Consensus       177 ~~-~~~~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~s~~~~~latg~~dg---------~i~iwd~~~  241 (303)
                      .. ....+|+.+.+...-.+...-+.     ....+...---.....++++|..||.+|.|.         .|.++.. -
T Consensus       219 ~g~~~~v~pdrP~lavcy~nGr~QiMR~eND~~Pvv~dtgm~~vgakWnh~G~vLAvcG~~~da~~~~d~n~v~Fysp-~  297 (1189)
T KOG2041|consen  219 TGPYQPVPPDRPRLAVCYANGRMQIMRSENDPEPVVVDTGMKIVGAKWNHNGAVLAVCGNDSDADEPTDSNKVHFYSP-Y  297 (1189)
T ss_pred             cCccccCCCCCCEEEEEEcCceehhhhhcCCCCCeEEecccEeecceecCCCcEEEEccCcccccCccccceEEEecc-c
Confidence            11 01112222222111111100000     0000000000011234678899999988643         4666654 4


Q ss_pred             CeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEee
Q 022074          242 GEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWE  280 (303)
Q Consensus       242 ~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd  280 (303)
                      |+.+.+++.....|++++|-..|-.+|-+- |+.|.+=+
T Consensus       298 G~i~gtlkvpg~~It~lsWEg~gLriA~Av-dsfiyfan  335 (1189)
T KOG2041|consen  298 GHIVGTLKVPGSCITGLSWEGTGLRIAIAV-DSFIYFAN  335 (1189)
T ss_pred             hhheEEEecCCceeeeeEEcCCceEEEEEe-cceEEEEe
Confidence            677888888888999999988887666664 55555433


No 310
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.66  E-value=0.0088  Score=57.67  Aligned_cols=196  Identities=13%  Similarity=0.178  Sum_probs=115.2

Q ss_pred             EEcCCCCEEEEeeCCCeEEEEECCCCceE-EEEecccCC-eEEEEEccCCCcEEEEecCCC-----eEEEEcCccccCCC
Q 022074           46 KFSTDGRELVAGSSDDCIYVYDLEANKLS-LRILAHTSD-VNTVCFGDESGHLIYSGSDDN-----LCKVWDRRCLNVKG  118 (303)
Q Consensus        46 ~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~-~~~~~h~~~-v~~l~~~~~~~~~l~s~s~dg-----~v~lWd~~~~~~~~  118 (303)
                      +|++++..++.|+.||.|.+++-  +... .-+..+... |..+.. .+..+.|++.++|+     .+++||+.-...+.
T Consensus        30 c~~s~~~~vvigt~~G~V~~Ln~--s~~~~~~fqa~~~siv~~L~~-~~~~~~L~sv~Ed~~~np~llkiw~lek~~~n~  106 (933)
T KOG2114|consen   30 CCSSSTGSVVIGTADGRVVILNS--SFQLIRGFQAYEQSIVQFLYI-LNKQNFLFSVGEDEQGNPVLLKIWDLEKVDKNN  106 (933)
T ss_pred             EEcCCCceEEEeeccccEEEecc--cceeeehheecchhhhhHhhc-ccCceEEEEEeecCCCCceEEEEecccccCCCC
Confidence            35678889999999999877743  3322 345556555 444433 34446788777665     48999986332222


Q ss_pred             cccee----eccc-----ccCeEEEEeCCCCCEEEEEeCCCcEEEEE--cccccCCcccccCccceeeeceeeeCCCCCc
Q 022074          119 KPAGV----LMGH-----LEGITFIDSRGDGRYLISNGKDQAIKLWD--IRKMSSNASCNLGFRSYEWDYRWMDYPPQAR  187 (303)
Q Consensus       119 ~~~~~----~~~h-----~~~v~~~~~~~~~~~l~s~~~D~~v~lWd--l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  187 (303)
                      .|...    ...|     ..++.+++++.+-..+|.|=.||.|.++.  +.+.... ..              .+     
T Consensus       107 sP~c~~~~ri~~~~np~~~~p~s~l~Vs~~l~~Iv~Gf~nG~V~~~~GDi~RDrgs-r~--------------~~-----  166 (933)
T KOG2114|consen  107 SPQCLYEHRIFTIKNPTNPSPASSLAVSEDLKTIVCGFTNGLVICYKGDILRDRGS-RQ--------------DY-----  166 (933)
T ss_pred             CcceeeeeeeeccCCCCCCCcceEEEEEccccEEEEEecCcEEEEEcCcchhcccc-ce--------------ee-----
Confidence            13222    1222     33577888888888889999999999983  2111100 00              00     


Q ss_pred             cccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeE-EEEeecCCCCeEEEEECCCCCe
Q 022074          188 DLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQ-VAALKYHTSPVRDCSWHPSQPM  266 (303)
Q Consensus       188 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~-~~~~~~h~~~I~~v~~sp~~~~  266 (303)
                        .+....++.          -      ..+..+++.++-+.....|.+|.+....+ +..+..|..+++|.+|++..+.
T Consensus       167 --~~~~~~pIT----------g------L~~~~d~~s~lFv~Tt~~V~~y~l~gr~p~~~~ld~~G~~lnCss~~~~t~q  228 (933)
T KOG2114|consen  167 --SHRGKEPIT----------G------LALRSDGKSVLFVATTEQVMLYSLSGRTPSLKVLDNNGISLNCSSFSDGTYQ  228 (933)
T ss_pred             --eccCCCCce----------e------eEEecCCceeEEEEecceeEEEEecCCCcceeeeccCCccceeeecCCCCcc
Confidence              000000110          0      11222344433344456789999875442 5557788899999999998875


Q ss_pred             EEEEeCCCCEEEeecCC
Q 022074          267 LVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       267 las~s~Dg~i~~Wd~~~  283 (303)
                      ++.|+. .-+.+++.++
T Consensus       229 fIca~~-e~l~fY~sd~  244 (933)
T KOG2114|consen  229 FICAGS-EFLYFYDSDG  244 (933)
T ss_pred             EEEecC-ceEEEEcCCC
Confidence            666553 4677887653


No 311
>KOG1832 consensus HIV-1 Vpr-binding protein [Cell cycle control, cell division, chromosome partitioning]
Probab=97.59  E-value=5.6e-05  Score=72.44  Aligned_cols=158  Identities=12%  Similarity=0.245  Sum_probs=101.4

Q ss_pred             EEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCC--c
Q 022074           74 SLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQ--A  151 (303)
Q Consensus        74 ~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~--~  151 (303)
                      +..+..|+..-+|++|+- ..+.|+.|+..|.|++++..++    .-.....+|..+|+-+..+.+|..+++.+.-.  -
T Consensus      1094 w~~frd~~~~fTc~afs~-~~~hL~vG~~~Geik~~nv~sG----~~e~s~ncH~SavT~vePs~dgs~~Ltsss~S~Pl 1168 (1516)
T KOG1832|consen 1094 WRSFRDETALFTCIAFSG-GTNHLAVGSHAGEIKIFNVSSG----SMEESVNCHQSAVTLVEPSVDGSTQLTSSSSSSPL 1168 (1516)
T ss_pred             chhhhccccceeeEEeec-CCceEEeeeccceEEEEEccCc----cccccccccccccccccccCCcceeeeeccccCch
Confidence            345667888889999964 5678899999999999997643    33456778999999998888998877765432  4


Q ss_pred             EEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCC
Q 022074          152 IKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHD  231 (303)
Q Consensus       152 v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~d  231 (303)
                      .-+|++..   ...                              ...++++..    .+      .|+..-++-+.|..-
T Consensus      1169 saLW~~~s---~~~------------------------------~~Hsf~ed~----~v------kFsn~~q~r~~gt~~ 1205 (1516)
T KOG1832|consen 1169 SALWDASS---TGG------------------------------PRHSFDEDK----AV------KFSNSLQFRALGTEA 1205 (1516)
T ss_pred             HHHhcccc---ccC------------------------------ccccccccc----ee------ehhhhHHHHHhcccc
Confidence            66787653   111                              111111111    01      122222222334444


Q ss_pred             CeEEEEECCCCeEEEEe-e---cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          232 SCVYVYDLVSGEQVAAL-K---YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       232 g~i~iwd~~~~~~~~~~-~---~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ....+||+.++.++.++ .   +.+..=+.+.|||+..+++--|     .+||+..+
T Consensus      1206 d~a~~YDvqT~~~l~tylt~~~~~~y~~n~a~FsP~D~LIlndG-----vLWDvR~~ 1257 (1516)
T KOG1832|consen 1206 DDALLYDVQTCSPLQTYLTDTVTSSYSNNLAHFSPCDTLILNDG-----VLWDVRIP 1257 (1516)
T ss_pred             cceEEEecccCcHHHHhcCcchhhhhhccccccCCCcceEeeCc-----eeeeeccH
Confidence            56899999998765542 2   2233447888999998877544     57998754


No 312
>COG5354 Uncharacterized protein, contains Trp-Asp (WD) repeat [General function prediction only]
Probab=97.57  E-value=0.0096  Score=54.17  Aligned_cols=209  Identities=17%  Similarity=0.238  Sum_probs=112.2

Q ss_pred             ceEEEEEcCCCCEEEEeeCCC---------------eEEEEECCCCceEEEEecccCC--eE-EEEEccCCCcEEEEecC
Q 022074           41 GIFSLKFSTDGRELVAGSSDD---------------CIYVYDLEANKLSLRILAHTSD--VN-TVCFGDESGHLIYSGSD  102 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg---------------~v~lwd~~~~~~~~~~~~h~~~--v~-~l~~~~~~~~~l~s~s~  102 (303)
                      -|+-+.|+|++++|.+=+.-.               .+.+||..++.++..+......  .. -+.|+.+ +++++=- -
T Consensus        73 ~V~~~~fSP~~kYL~tw~~~pi~~pe~e~sp~~~~n~~~vwd~~sg~iv~sf~~~~q~~~~Wp~~k~s~~-D~y~ARv-v  150 (561)
T COG5354          73 DVKYLDFSPNEKYLVTWSREPIIEPEIEISPFTSKNNVFVWDIASGMIVFSFNGISQPYLGWPVLKFSID-DKYVARV-V  150 (561)
T ss_pred             CceecccCcccceeeeeccCCccChhhccCCccccCceeEEeccCceeEeeccccCCcccccceeeeeec-chhhhhh-c
Confidence            388899999999999876433               4899999999887665544433  33 5566543 3444322 2


Q ss_pred             CCeEEEEcCccccCCCccceeecccccCeEEEEeCCCC--CEEE-----EEeCCCcEEEEEcccccCCcccccCccceee
Q 022074          103 DNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDG--RYLI-----SNGKDQAIKLWDIRKMSSNASCNLGFRSYEW  175 (303)
Q Consensus       103 dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~--~~l~-----s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~  175 (303)
                      ...++++++. .+....+...+  ...++....++|.+  ..|+     ..+.+..+++|.+..-....+.++ +     
T Consensus       151 ~~sl~i~e~t-~n~~~~p~~~l--r~~gi~dFsisP~~n~~~la~~tPEk~~kpa~~~i~sIp~~s~l~tk~l-f-----  221 (561)
T COG5354         151 GSSLYIHEIT-DNIEEHPFKNL--RPVGILDFSISPEGNHDELAYWTPEKLNKPAMVRILSIPKNSVLVTKNL-F-----  221 (561)
T ss_pred             cCeEEEEecC-CccccCchhhc--cccceeeEEecCCCCCceEEEEccccCCCCcEEEEEEccCCCeeeeeee-E-----
Confidence            3468888852 22222233222  13556666777753  2233     255677777776642111111100 0     


Q ss_pred             eceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCe
Q 022074          176 DYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPV  255 (303)
Q Consensus       176 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I  255 (303)
                      ..                +.+...++...  . .+-+.....+..+..++    .+..+++++++....--. ....++|
T Consensus       222 k~----------------~~~qLkW~~~g--~-~ll~l~~t~~ksnKsyf----gesnLyl~~~~e~~i~V~-~~~~~pV  277 (561)
T COG5354         222 KV----------------SGVQLKWQVLG--K-YLLVLVMTHTKSNKSYF----GESNLYLLRITERSIPVE-KDLKDPV  277 (561)
T ss_pred             ee----------------cccEEEEecCC--c-eEEEEEEEeeeccccee----ccceEEEEeeccccccee-ccccccc
Confidence            00                00000000000  0 00000000111122222    256789999875553222 2567899


Q ss_pred             EEEEECCCCCeEE--EEeCCCCEEEeecCCC
Q 022074          256 RDCSWHPSQPMLV--SSSWDGDVVRWEFPGN  284 (303)
Q Consensus       256 ~~v~~sp~~~~la--s~s~Dg~i~~Wd~~~~  284 (303)
                      .+.+|+|+++.++  +|-.+-++.++|++++
T Consensus       278 hdf~W~p~S~~F~vi~g~~pa~~s~~~lr~N  308 (561)
T COG5354         278 HDFTWEPLSSRFAVISGYMPASVSVFDLRGN  308 (561)
T ss_pred             eeeeecccCCceeEEecccccceeecccccc
Confidence            9999999987554  4457888888888766


No 313
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.55  E-value=0.041  Score=46.65  Aligned_cols=192  Identities=15%  Similarity=0.187  Sum_probs=107.2

Q ss_pred             EEEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccce
Q 022074           44 SLKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAG  122 (303)
Q Consensus        44 ~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~  122 (303)
                      +..|.+ +|..+.+--..+.|..|+..++....  ..... ...+++..+++ .++.+..++. .++|......  ....
T Consensus         4 gp~~d~~~g~l~~~D~~~~~i~~~~~~~~~~~~--~~~~~-~~G~~~~~~~g-~l~v~~~~~~-~~~d~~~g~~--~~~~   76 (246)
T PF08450_consen    4 GPVWDPRDGRLYWVDIPGGRIYRVDPDTGEVEV--IDLPG-PNGMAFDRPDG-RLYVADSGGI-AVVDPDTGKV--TVLA   76 (246)
T ss_dssp             EEEEETTTTEEEEEETTTTEEEEEETTTTEEEE--EESSS-EEEEEEECTTS-EEEEEETTCE-EEEETTTTEE--EEEE
T ss_pred             ceEEECCCCEEEEEEcCCCEEEEEECCCCeEEE--EecCC-CceEEEEccCC-EEEEEEcCce-EEEecCCCcE--EEEe
Confidence            467887 66666666678899999998886532  22222 56666653445 4445555444 4457642211  1111


Q ss_pred             eec--c-cccCeEEEEeCCCCCEEEEEeCCC--------cEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          123 VLM--G-HLEGITFIDSRGDGRYLISNGKDQ--------AIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       123 ~~~--~-h~~~v~~~~~~~~~~~l~s~~~D~--------~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      ...  . .....+.+.+.++|++.++.....        .|..++.. .                               
T Consensus        77 ~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~-------------------------------  124 (246)
T PF08450_consen   77 DLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-G-------------------------------  124 (246)
T ss_dssp             EEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-S-------------------------------
T ss_pred             eccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-C-------------------------------
Confidence            111  1 223467788889998777765432        22222221 0                               


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEE-EEeCCCeEEEEECCCC-e-E-----EEEeecCCCCeEEEEECCC
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIY-TGSHDSCVYVYDLVSG-E-Q-----VAALKYHTSPVRDCSWHPS  263 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~la-tg~~dg~i~iwd~~~~-~-~-----~~~~~~h~~~I~~v~~sp~  263 (303)
                          .+...      ..-+.......++++++.|+ +-+..+.|+.+++... . .     ...+..-.+..-.+++..+
T Consensus       125 ----~~~~~------~~~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~  194 (246)
T PF08450_consen  125 ----KVTVV------ADGLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSD  194 (246)
T ss_dssp             ----EEEEE------EEEESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTT
T ss_pred             ----eEEEE------ecCcccccceEECCcchheeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCC
Confidence                00000      00011112345788887664 6678899999998532 2 1     1122222234788999999


Q ss_pred             CCeEEEEeCCCCEEEeecCCC
Q 022074          264 QPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       264 ~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      |++.++.-..+.|.+++..+.
T Consensus       195 G~l~va~~~~~~I~~~~p~G~  215 (246)
T PF08450_consen  195 GNLWVADWGGGRIVVFDPDGK  215 (246)
T ss_dssp             S-EEEEEETTTEEEEEETTSC
T ss_pred             CCEEEEEcCCCEEEEECCCcc
Confidence            999888888899999997643


No 314
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=97.47  E-value=0.00033  Score=57.36  Aligned_cols=64  Identities=13%  Similarity=0.152  Sum_probs=57.2

Q ss_pred             CeEEEEEeCCCeEEEEECCCCeEEEEeecCC-CCeEEEEECCCCCeEEEE--eCCCCEEEeecCCCC
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGEQVAALKYHT-SPVRDCSWHPSQPMLVSS--SWDGDVVRWEFPGNG  285 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~~~~~~~~h~-~~I~~v~~sp~~~~las~--s~Dg~i~~Wd~~~~~  285 (303)
                      +.+..++++||.||-|+++-.+.+....+|+ .++.....+..+++++.+  |.|..++.|++....
T Consensus       114 ~~~~c~~~~dg~ir~~n~~p~k~~g~~g~h~~~~~e~~ivv~sd~~i~~a~~S~d~~~k~W~ve~~~  180 (238)
T KOG2444|consen  114 SSLGCVGAQDGRIRACNIKPNKVLGYVGQHNFESGEELIVVGSDEFLKIADTSHDRVLKKWNVEKIK  180 (238)
T ss_pred             cceeEEeccCCceeeeccccCceeeeeccccCCCcceeEEecCCceEEeeccccchhhhhcchhhhh
Confidence            4578889999999999999888888888888 899999999999999999  999999999987654


No 315
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=97.42  E-value=0.00028  Score=39.84  Aligned_cols=32  Identities=34%  Similarity=0.504  Sum_probs=29.3

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEE
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYD   67 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd   67 (303)
                      .+|...|.++.|+++++.+++++.|+.+++|+
T Consensus         9 ~~~~~~i~~~~~~~~~~~~~~~~~d~~~~~~~   40 (40)
T smart00320        9 KGHTGPVTSVAFSPDGKYLASASDDGTIKLWD   40 (40)
T ss_pred             EecCCceeEEEECCCCCEEEEecCCCeEEEcC
Confidence            46788899999999999999999999999995


No 316
>KOG2066 consensus Vacuolar assembly/sorting protein VPS41 [Intracellular trafficking, secretion, and vesicular transport]
Probab=97.41  E-value=0.0055  Score=58.58  Aligned_cols=182  Identities=14%  Similarity=0.166  Sum_probs=112.3

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      .+.|++++  ++.++-|+-+|.|++++....-  .+...|...       ...+.+++|||.||+|.+-.+..    ...
T Consensus        41 ~is~~av~--~~~~~~GtH~g~v~~~~~~~~~--~~~~~~s~~-------~~~Gey~asCS~DGkv~I~sl~~----~~~  105 (846)
T KOG2066|consen   41 AISCCAVH--DKFFALGTHRGAVYLTTCQGNP--KTNFDHSSS-------ILEGEYVASCSDDGKVVIGSLFT----DDE  105 (846)
T ss_pred             HHHHHHhh--cceeeeccccceEEEEecCCcc--ccccccccc-------ccCCceEEEecCCCcEEEeeccC----Ccc
Confidence            46666666  6789999999999999875542  334444332       34589999999999999876532    111


Q ss_pred             ceeecccccCeEEEEeCCC-----CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCC
Q 022074          121 AGVLMGHLEGITFIDSRGD-----GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQ  195 (303)
Q Consensus       121 ~~~~~~h~~~v~~~~~~~~-----~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  195 (303)
                      ...+ .....+.+++++|+     .+.+++||.-| +.++.-+.+....+.                      ..+...+
T Consensus       106 ~~~~-df~rpiksial~Pd~~~~~sk~fv~GG~ag-lvL~er~wlgnk~~v----------------------~l~~~eG  161 (846)
T KOG2066|consen  106 ITQY-DFKRPIKSIALHPDFSRQQSKQFVSGGMAG-LVLSERNWLGNKDSV----------------------VLSEGEG  161 (846)
T ss_pred             ceeE-ecCCcceeEEeccchhhhhhhheeecCcce-EEEehhhhhcCccce----------------------eeecCcc
Confidence            2222 23356777788776     56789998877 777754322110000                      0000000


Q ss_pred             cceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCC------CeEEEEECCCCCeEEE
Q 022074          196 SVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTS------PVRDCSWHPSQPMLVS  269 (303)
Q Consensus       196 ~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~------~I~~v~~sp~~~~las  269 (303)
                      .            +..+.|      .|.++|=++.+| |++||..+++.+..++-...      .-..+.|.+..++++.
T Consensus       162 ~------------I~~i~W------~g~lIAWand~G-v~vyd~~~~~~l~~i~~p~~~~R~e~fpphl~W~~~~~LVIG  222 (846)
T KOG2066|consen  162 P------------IHSIKW------RGNLIAWANDDG-VKVYDTPTRQRLTNIPPPSQSVRPELFPPHLHWQDEDRLVIG  222 (846)
T ss_pred             c------------eEEEEe------cCcEEEEecCCC-cEEEeccccceeeccCCCCCCCCcccCCCceEecCCCeEEEe
Confidence            1            111222      367888888777 89999999988877653322      2346778777765554


Q ss_pred             EeCCCCEEEeecC
Q 022074          270 SSWDGDVVRWEFP  282 (303)
Q Consensus       270 ~s~Dg~i~~Wd~~  282 (303)
                      -+  -+|++..++
T Consensus       223 W~--d~v~i~~I~  233 (846)
T KOG2066|consen  223 WG--DSVKICSIK  233 (846)
T ss_pred             cC--CeEEEEEEe
Confidence            33  478887776


No 317
>PRK04043 tolB translocation protein TolB; Provisional
Probab=97.34  E-value=0.13  Score=47.54  Aligned_cols=172  Identities=15%  Similarity=0.094  Sum_probs=88.5

Q ss_pred             eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC---CCeEEEEcCccccCCCccceeecccccCeEEEEeCC
Q 022074           62 CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD---DNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG  138 (303)
Q Consensus        62 ~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~---dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~  138 (303)
                      .|.+-|.+... ...+... +......|+|+..+.++..+.   +..|.++|+..    ++ ...+....+......|+|
T Consensus       170 ~l~~~d~dg~~-~~~~~~~-~~~~~p~wSpDG~~~i~y~s~~~~~~~Iyv~dl~t----g~-~~~lt~~~g~~~~~~~SP  242 (419)
T PRK04043        170 NIVLADYTLTY-QKVIVKG-GLNIFPKWANKEQTAFYYTSYGERKPTLYKYNLYT----GK-KEKIASSQGMLVVSDVSK  242 (419)
T ss_pred             eEEEECCCCCc-eeEEccC-CCeEeEEECCCCCcEEEEEEccCCCCEEEEEECCC----Cc-EEEEecCCCcEEeeEECC
Confidence            34444444333 2223333 355677897653333443332   45788888642    22 222323344455667999


Q ss_pred             CCCEEE-EEeCCC--cEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeee
Q 022074          139 DGRYLI-SNGKDQ--AIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFS  215 (303)
Q Consensus       139 ~~~~l~-s~~~D~--~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  215 (303)
                      ||+.++ +.+.++  .|.++|+.....                                   ..++....      ....
T Consensus       243 DG~~la~~~~~~g~~~Iy~~dl~~g~~-----------------------------------~~LT~~~~------~d~~  281 (419)
T PRK04043        243 DGSKLLLTMAPKGQPDIYLYDTNTKTL-----------------------------------TQITNYPG------IDVN  281 (419)
T ss_pred             CCCEEEEEEccCCCcEEEEEECCCCcE-----------------------------------EEcccCCC------ccCc
Confidence            997764 444444  444445432110                                   00000000      0123


Q ss_pred             eeeeCCCeEEEEEeC-CC--eEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC-------C--CEEEeecCC
Q 022074          216 PVYSTGQKYIYTGSH-DS--CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD-------G--DVVRWEFPG  283 (303)
Q Consensus       216 ~~~s~~~~~latg~~-dg--~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D-------g--~i~~Wd~~~  283 (303)
                      |.|+|||+.|+-.+. .+  .|++.|+.+++...... ...  ....|||||+.|+-.+..       +  .|.+-|+.+
T Consensus       282 p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~-~g~--~~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~  358 (419)
T PRK04043        282 GNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVF-HGK--NNSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNS  358 (419)
T ss_pred             cEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCcc-CCC--cCceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCC
Confidence            558888877666553 22  68888998776533221 111  124899999987766554       2  456666654


Q ss_pred             C
Q 022074          284 N  284 (303)
Q Consensus       284 ~  284 (303)
                      .
T Consensus       359 g  359 (419)
T PRK04043        359 D  359 (419)
T ss_pred             C
Confidence            3


No 318
>KOG0882 consensus Cyclophilin-related peptidyl-prolyl cis-trans isomerase [Posttranslational modification, protein turnover, chaperones]
Probab=97.18  E-value=0.0025  Score=57.08  Aligned_cols=216  Identities=16%  Similarity=0.099  Sum_probs=130.6

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeC-CCeEEEEECCCCceEEEEecccCCeEEEEEccCCC----cEEEEecCCCeEEEEc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSS-DDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESG----HLIYSGSDDNLCKVWD  110 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~-Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~----~~l~s~s~dg~v~lWd  110 (303)
                      ..|-..|.+++.+-+|-.+.+.+. |..++++|+.+-....-+.-. ..-..+.|....+    .+.++.-.++.+.++|
T Consensus        50 raHL~~I~sl~~S~dg~L~~Sv~d~Dhs~KvfDvEn~DminmiKL~-~lPg~a~wv~skGd~~s~IAVs~~~sg~i~VvD  128 (558)
T KOG0882|consen   50 RAHLGVILSLAVSYDGWLFRSVEDPDHSVKVFDVENFDMINMIKLV-DLPGFAEWVTSKGDKISLIAVSLFKSGKIFVVD  128 (558)
T ss_pred             HHHHHHHHhhhccccceeEeeccCcccceeEEEeeccchhhhcccc-cCCCceEEecCCCCeeeeEEeecccCCCcEEEC
Confidence            477788999999999999999777 999999999876544212111 1111222311122    3344455789999999


Q ss_pred             CccccCCCccceeec-ccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccc
Q 022074          111 RRCLNVKGKPAGVLM-GHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~-~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                      .+...   .+...+. -|..+|..+...+.+..+++....|-|.-|.....     .......+.|.+.           
T Consensus       129 ~~~d~---~q~~~fkklH~sPV~~i~y~qa~Ds~vSiD~~gmVEyWs~e~~-----~qfPr~~l~~~~K-----------  189 (558)
T KOG0882|consen  129 GFGDF---CQDGYFKKLHFSPVKKIRYNQAGDSAVSIDISGMVEYWSAEGP-----FQFPRTNLNFELK-----------  189 (558)
T ss_pred             CcCCc---CccceecccccCceEEEEeeccccceeeccccceeEeecCCCc-----ccCcccccccccc-----------
Confidence            76322   1222222 38889999999999999999999999999986531     0000001111111           


Q ss_pred             cCCCCCcceEEecccceeeeEEEe---eeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee-----------------
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCH---FSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK-----------------  249 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~---~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~-----------------  249 (303)
                       +.           ..+....++.   .+..|+|++..+++-+.|..|++++.++|+.+..+.                 
T Consensus       190 -~e-----------TdLy~f~K~Kt~pts~Efsp~g~qistl~~DrkVR~F~~KtGklvqeiDE~~t~~~~q~ks~y~l~  257 (558)
T KOG0882|consen  190 -HE-----------TDLYGFPKAKTEPTSFEFSPDGAQISTLNPDRKVRGFVFKTGKLVQEIDEVLTDAQYQPKSPYGLM  257 (558)
T ss_pred             -cc-----------chhhcccccccCccceEEccccCcccccCcccEEEEEEeccchhhhhhhccchhhhhccccccccc
Confidence             00           0000001111   122367889999999999999999999986443332                 


Q ss_pred             ---------------cCC-CCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          250 ---------------YHT-SPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       250 ---------------~h~-~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                                     .|. .+-+-+.|...+++|+-++.= -|++.++..+
T Consensus       258 ~VelgRRmaverelek~~~~~~~~~~fdes~~flly~t~~-gikvin~~tn  307 (558)
T KOG0882|consen  258 HVELGRRMAVERELEKHGSTVGTNAVFDESGNFLLYGTIL-GIKVINLDTN  307 (558)
T ss_pred             eeehhhhhhHHhhHhhhcCcccceeEEcCCCCEEEeecce-eEEEEEeecC
Confidence                           111 234456677788888877653 3556665543


No 319
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.11  E-value=0.07  Score=44.66  Aligned_cols=147  Identities=15%  Similarity=0.154  Sum_probs=79.3

Q ss_pred             CCeEEEEECCCCceEEEEecc--cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeec--ccccCeEEEE
Q 022074           60 DDCIYVYDLEANKLSLRILAH--TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLM--GHLEGITFID  135 (303)
Q Consensus        60 Dg~v~lwd~~~~~~~~~~~~h--~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~--~h~~~v~~~~  135 (303)
                      +|+|..||+.+|+..-+..--  .....+... .. +..+++++.++.+..||....+    ..-...  +......   
T Consensus         2 ~g~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~-~~-~~~v~~~~~~~~l~~~d~~tG~----~~W~~~~~~~~~~~~---   72 (238)
T PF13360_consen    2 DGTLSALDPRTGKELWSYDLGPGIGGPVATAV-PD-GGRVYVASGDGNLYALDAKTGK----VLWRFDLPGPISGAP---   72 (238)
T ss_dssp             TSEEEEEETTTTEEEEEEECSSSCSSEEETEE-EE-TTEEEEEETTSEEEEEETTTSE----EEEEEECSSCGGSGE---
T ss_pred             CCEEEEEECCCCCEEEEEECCCCCCCccceEE-Ee-CCEEEEEcCCCEEEEEECCCCC----EEEEeecccccccee---
Confidence            689999999999876554321  111121122 22 4466677889999999975332    221111  1111111   


Q ss_pred             eCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeee
Q 022074          136 SRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFS  215 (303)
Q Consensus       136 ~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  215 (303)
                       ...+..++.+..|+.++.+|.+..+.           .|.......++..  .                       ...
T Consensus        73 -~~~~~~v~v~~~~~~l~~~d~~tG~~-----------~W~~~~~~~~~~~--~-----------------------~~~  115 (238)
T PF13360_consen   73 -VVDGGRVYVGTSDGSLYALDAKTGKV-----------LWSIYLTSSPPAG--V-----------------------RSS  115 (238)
T ss_dssp             -EEETTEEEEEETTSEEEEEETTTSCE-----------EEEEEE-SSCTCS--T-----------------------B--
T ss_pred             -eecccccccccceeeeEecccCCcce-----------eeeeccccccccc--c-----------------------ccc
Confidence             11344666666888999999765332           1210000000000  0                       000


Q ss_pred             eeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC
Q 022074          216 PVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT  252 (303)
Q Consensus       216 ~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~  252 (303)
                      .....++..++.+..++.|..+|.++|+.+-......
T Consensus       116 ~~~~~~~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~~  152 (238)
T PF13360_consen  116 SSPAVDGDRLYVGTSSGKLVALDPKTGKLLWKYPVGE  152 (238)
T ss_dssp             SEEEEETTEEEEEETCSEEEEEETTTTEEEEEEESST
T ss_pred             cCceEecCEEEEEeccCcEEEEecCCCcEEEEeecCC
Confidence            0011125567888889999999999999877766543


No 320
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=97.01  E-value=0.073  Score=49.24  Aligned_cols=110  Identities=15%  Similarity=0.141  Sum_probs=73.3

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC-----------CCeEEEEc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD-----------DNLCKVWD  110 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~-----------dg~v~lWd  110 (303)
                      -.=+.|||-|.+|++-=.-| |.+|--..-....+ ..| ..|..+.|+| +.++|+|=+.           ...+++||
T Consensus       213 etyv~wSP~GTYL~t~Hk~G-I~lWGG~~f~r~~R-F~H-p~Vq~idfSP-~EkYLVT~s~~p~~~~~~d~e~~~l~IWD  288 (698)
T KOG2314|consen  213 ETYVRWSPKGTYLVTFHKQG-IALWGGESFDRIQR-FYH-PGVQFIDFSP-NEKYLVTYSPEPIIVEEDDNEGQQLIIWD  288 (698)
T ss_pred             eeeEEecCCceEEEEEeccc-eeeecCccHHHHHh-ccC-CCceeeecCC-ccceEEEecCCccccCcccCCCceEEEEE
Confidence            44589999999999998888 88994433322222 234 4688888964 6778887553           25889999


Q ss_pred             CccccCCCccceeeccccc--C-eEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074          111 RRCLNVKGKPAGVLMGHLE--G-ITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       111 ~~~~~~~~~~~~~~~~h~~--~-v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ++.+    ...+.|....+  . -..+.|+.|+.|+|.-.. .+|.|++..+.
T Consensus       289 I~tG----~lkrsF~~~~~~~~~WP~frWS~DdKy~Arm~~-~sisIyEtpsf  336 (698)
T KOG2314|consen  289 IATG----LLKRSFPVIKSPYLKWPIFRWSHDDKYFARMTG-NSISIYETPSF  336 (698)
T ss_pred             cccc----chhcceeccCCCccccceEEeccCCceeEEecc-ceEEEEecCce
Confidence            8743    33333322112  2 234578999999987665 67888886653


No 321
>KOG2444 consensus WD40 repeat protein [General function prediction only]
Probab=96.92  E-value=0.0029  Score=51.91  Aligned_cols=106  Identities=20%  Similarity=0.325  Sum_probs=63.7

Q ss_pred             CCEEEEeeCCCeEEEEECCCC-ceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccc-
Q 022074           51 GRELVAGSSDDCIYVYDLEAN-KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL-  128 (303)
Q Consensus        51 g~~l~sgs~Dg~v~lwd~~~~-~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~-  128 (303)
                      +..+++|+.||.|.+|....- ........-...+.+....-..+.+..++++||.+|.|...    ..+..+...+|. 
T Consensus        70 ~~~~~vG~~dg~v~~~n~n~~g~~~d~~~s~~e~i~~~Ip~~~~~~~~c~~~~dg~ir~~n~~----p~k~~g~~g~h~~  145 (238)
T KOG2444|consen   70 SAKLMVGTSDGAVYVFNWNLEGAHSDRVCSGEESIDLGIPNGRDSSLGCVGAQDGRIRACNIK----PNKVLGYVGQHNF  145 (238)
T ss_pred             CceEEeecccceEEEecCCccchHHHhhhcccccceeccccccccceeEEeccCCceeeeccc----cCceeeeeccccC
Confidence            557999999999999977621 11111111112222322223345577789999999999864    334455555566 


Q ss_pred             cCeEEEEeCCCCCEEEEE--eCCCcEEEEEcccc
Q 022074          129 EGITFIDSRGDGRYLISN--GKDQAIKLWDIRKM  160 (303)
Q Consensus       129 ~~v~~~~~~~~~~~l~s~--~~D~~v~lWdl~~~  160 (303)
                      .++........++.++..  |.|..++.|++...
T Consensus       146 ~~~e~~ivv~sd~~i~~a~~S~d~~~k~W~ve~~  179 (238)
T KOG2444|consen  146 ESGEELIVVGSDEFLKIADTSHDRVLKKWNVEKI  179 (238)
T ss_pred             CCcceeEEecCCceEEeeccccchhhhhcchhhh
Confidence            445444445555666666  67777777776643


No 322
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=96.87  E-value=0.24  Score=42.16  Aligned_cols=210  Identities=12%  Similarity=0.138  Sum_probs=105.4

Q ss_pred             CCcccceEEEEEcCCCC-EEEEeeCCCeEEEEECCCCceEEEEecc-cCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           36 GGYSFGIFSLKFSTDGR-ELVAGSSDDCIYVYDLEANKLSLRILAH-TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~-~l~sgs~Dg~v~lwd~~~~~~~~~~~~h-~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                      .|-...+..|+|+|+.+ .+++....+.|.-++++ |+...++.-. .+....+++. .++.++++.-.++.+.++++..
T Consensus        18 ~g~~~e~SGLTy~pd~~tLfaV~d~~~~i~els~~-G~vlr~i~l~g~~D~EgI~y~-g~~~~vl~~Er~~~L~~~~~~~   95 (248)
T PF06977_consen   18 PGILDELSGLTYNPDTGTLFAVQDEPGEIYELSLD-GKVLRRIPLDGFGDYEGITYL-GNGRYVLSEERDQRLYIFTIDD   95 (248)
T ss_dssp             TT--S-EEEEEEETTTTEEEEEETTTTEEEEEETT---EEEEEE-SS-SSEEEEEE--STTEEEEEETTTTEEEEEEE--
T ss_pred             CCccCCccccEEcCCCCeEEEEECCCCEEEEEcCC-CCEEEEEeCCCCCCceeEEEE-CCCEEEEEEcCCCcEEEEEEec
Confidence            34444599999999866 45556667778777764 5555544322 3567888885 4466666666689998887632


Q ss_pred             ccCCCcc--ceee-----cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCC
Q 022074          114 LNVKGKP--AGVL-----MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQA  186 (303)
Q Consensus       114 ~~~~~~~--~~~~-----~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  186 (303)
                      .......  ...+     ..+..++-.+++.+.++.|+.+-.....+++.++.........      ....  .      
T Consensus        96 ~~~~~~~~~~~~~~l~~~~~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~------~~~~--~------  161 (248)
T PF06977_consen   96 DTTSLDRADVQKISLGFPNKGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLF------VSDD--Q------  161 (248)
T ss_dssp             --TT--EEEEEEEE---S---SS--EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--E------EEE---H------
T ss_pred             cccccchhhceEEecccccCCCcceEEEEEcCCCCEEEEEeCCCChhhEEEccccCcccee------eccc--c------
Confidence            2111100  0111     1234458889999887777777777777787765421000000      0000  0      


Q ss_pred             ccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCC---------CCeEE
Q 022074          187 RDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHT---------SPVRD  257 (303)
Q Consensus       187 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~---------~~I~~  257 (303)
                               ....  ...........++.|   ..+.+++-..++..|-..| .+|+.+..+.-..         .....
T Consensus       162 ---------~~~~--~~~~~~d~S~l~~~p---~t~~lliLS~es~~l~~~d-~~G~~~~~~~L~~g~~gl~~~~~QpEG  226 (248)
T PF06977_consen  162 ---------DLDD--DKLFVRDLSGLSYDP---RTGHLLILSDESRLLLELD-RQGRVVSSLSLDRGFHGLSKDIPQPEG  226 (248)
T ss_dssp             ---------HHH---HT--SS---EEEEET---TTTEEEEEETTTTEEEEE--TT--EEEEEE-STTGGG-SS---SEEE
T ss_pred             ---------cccc--ccceeccccceEEcC---CCCeEEEEECCCCeEEEEC-CCCCEEEEEEeCCcccCcccccCCccE
Confidence                     0000  000000111111111   2467788888999999999 5677666554221         35789


Q ss_pred             EEECCCCCeEEEEeCCCCEE
Q 022074          258 CSWHPSQPMLVSSSWDGDVV  277 (303)
Q Consensus       258 v~~sp~~~~las~s~Dg~i~  277 (303)
                      |+|.++|++.+++ +-+...
T Consensus       227 Ia~d~~G~LYIvs-EpNlfy  245 (248)
T PF06977_consen  227 IAFDPDGNLYIVS-EPNLFY  245 (248)
T ss_dssp             EEE-TT--EEEEE-TTTEEE
T ss_pred             EEECCCCCEEEEc-CCceEE
Confidence            9999999876665 433333


No 323
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=96.69  E-value=0.3  Score=43.17  Aligned_cols=51  Identities=22%  Similarity=0.363  Sum_probs=39.8

Q ss_pred             eEEEEECCCCeEEEEeecCCCCeEEEEECCCCC-eEEEE-eCCCCEEEeecCCC
Q 022074          233 CVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQP-MLVSS-SWDGDVVRWEFPGN  284 (303)
Q Consensus       233 ~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~-~las~-s~Dg~i~~Wd~~~~  284 (303)
                      .|+++|+++++.+.++.. ..++.+++.+.+.+ +|.+. ..++.|.+||....
T Consensus       270 eVWv~D~~t~krv~Ri~l-~~~~~Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tG  322 (342)
T PF06433_consen  270 EVWVYDLKTHKRVARIPL-EHPIDSIAVSQDDKPLLYALSAGDGTLDVYDAATG  322 (342)
T ss_dssp             EEEEEETTTTEEEEEEEE-EEEESEEEEESSSS-EEEEEETTTTEEEEEETTT-
T ss_pred             EEEEEECCCCeEEEEEeC-CCccceEEEccCCCcEEEEEcCCCCeEEEEeCcCC
Confidence            388999999999988863 24688999999865 66554 45799999998754


No 324
>PRK02888 nitrous-oxide reductase; Validated
Probab=96.67  E-value=0.084  Score=50.26  Aligned_cols=109  Identities=19%  Similarity=0.254  Sum_probs=58.7

Q ss_pred             EEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEcc-------CCCcEEEEecCCCeEEEEcCcccc
Q 022074           43 FSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGD-------ESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~-------~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      .-+.++++|+++.+.+.+.       +.+.....+..-+.. ..+.|+.       ++++....  .++.|.+.|.+...
T Consensus       238 d~v~~spdGk~afvTsyNs-------E~G~tl~em~a~e~d-~~vvfni~~iea~vkdGK~~~V--~gn~V~VID~~t~~  307 (635)
T PRK02888        238 DNVDTDYDGKYAFSTCYNS-------EEGVTLAEMMAAERD-WVVVFNIARIEEAVKAGKFKTI--GGSKVPVVDGRKAA  307 (635)
T ss_pred             ccceECCCCCEEEEeccCc-------ccCcceeeeccccCc-eEEEEchHHHHHhhhCCCEEEE--CCCEEEEEECCccc
Confidence            4567888888888776322       112211111111111 2222221       23544443  25689999975310


Q ss_pred             CCCccceeecccccCeEEEEeCCCCCEEEEEe-CCCcEEEEEccccc
Q 022074          116 VKGKPAGVLMGHLEGITFIDSRGDGRYLISNG-KDQAIKLWDIRKMS  161 (303)
Q Consensus       116 ~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~~~  161 (303)
                      ..+.....+..-....+.++++|||++++.++ .+.+|-+.|+.+.+
T Consensus       308 ~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k  354 (635)
T PRK02888        308 NAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLD  354 (635)
T ss_pred             cCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhh
Confidence            01111222222334567888999999986555 69999999998754


No 325
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=96.65  E-value=0.16  Score=37.21  Aligned_cols=101  Identities=24%  Similarity=0.323  Sum_probs=60.1

Q ss_pred             eEEEEEc---CCC-CEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC
Q 022074           42 IFSLKFS---TDG-RELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        42 v~~l~~s---~~g-~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~  117 (303)
                      |.++++.   .|| +.|++||.|..||+|+-  .....++..+ +.|..++...  ...|+.+-.+|+|-+|+..     
T Consensus         2 V~al~~~d~d~dg~~eLlvGs~D~~IRvf~~--~e~~~Ei~e~-~~v~~L~~~~--~~~F~Y~l~NGTVGvY~~~-----   71 (111)
T PF14783_consen    2 VTALCLFDFDGDGENELLVGSDDFEIRVFKG--DEIVAEITET-DKVTSLCSLG--GGRFAYALANGTVGVYDRS-----   71 (111)
T ss_pred             eeEEEEEecCCCCcceEEEecCCcEEEEEeC--CcEEEEEecc-cceEEEEEcC--CCEEEEEecCCEEEEEeCc-----
Confidence            4555555   343 47999999999999955  3444444433 5677777643  3689999999999998742     


Q ss_pred             Cccceeecccc-cCeEEEEeCCCC-CEEEEEeCCCcE
Q 022074          118 GKPAGVLMGHL-EGITFIDSRGDG-RYLISNGKDQAI  152 (303)
Q Consensus       118 ~~~~~~~~~h~-~~v~~~~~~~~~-~~l~s~~~D~~v  152 (303)
                      .+.-+.=..|. -++...++..+| ..|++|=.+|.|
T Consensus        72 ~RlWRiKSK~~~~~~~~~D~~gdG~~eLI~GwsnGkv  108 (111)
T PF14783_consen   72 QRLWRIKSKNQVTSMAFYDINGDGVPELIVGWSNGKV  108 (111)
T ss_pred             ceeeeeccCCCeEEEEEEcCCCCCceEEEEEecCCeE
Confidence            11111111111 122233344444 347777777765


No 326
>KOG1008 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=96.52  E-value=0.00072  Score=62.92  Aligned_cols=143  Identities=24%  Similarity=0.321  Sum_probs=90.8

Q ss_pred             EEEccCchhhccccccccccCcCc-ccccCCCcccceEEEEEcC-CCCEEEEee----CCCeEEEEECCCCce----EEE
Q 022074            7 IVDVGSGTMESLANVTEIHDGLDF-SAADDGGYSFGIFSLKFST-DGRELVAGS----SDDCIYVYDLEANKL----SLR   76 (303)
Q Consensus         7 ~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~v~~l~~s~-~g~~l~sgs----~Dg~v~lwd~~~~~~----~~~   76 (303)
                      |+-||+++  .+|.+--. +..+. +.+.-+||-.+..+++|++ |.+.||+|=    .|..+.|||+.++-.    ...
T Consensus        72 IlavG~at--G~I~l~s~-r~~hdSs~E~tp~~ar~Ct~lAwneLDtn~LAagldkhrnds~~~Iwdi~s~ltvPke~~~  148 (783)
T KOG1008|consen   72 ILAVGSAT--GNISLLSV-RHPHDSSAEVTPGYARPCTSLAWNELDTNHLAAGLDKHRNDSSLKIWDINSLLTVPKESPL  148 (783)
T ss_pred             hhhhcccc--CceEEeec-CCcccccceecccccccccccccccccHHHHHhhhhhhcccCCccceecccccCCCccccc
Confidence            45567666  55543322 11223 3555688999999999997 556666663    466799999988721    112


Q ss_pred             Eec-ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCC-CCCEEEEEeCCCcEEE
Q 022074           77 ILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRG-DGRYLISNGKDQAIKL  154 (303)
Q Consensus        77 ~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~-~~~~l~s~~~D~~v~l  154 (303)
                      +.+ ...+.+.+||. .+.+++++|.....++++|+|.......   .+  .+..+..+.+.| ..+|+++.. |+.|-+
T Consensus       149 fs~~~l~gqns~cwl-rd~klvlaGm~sr~~~ifdlRqs~~~~~---sv--nTk~vqG~tVdp~~~nY~cs~~-dg~iAi  221 (783)
T KOG1008|consen  149 FSSSTLDGQNSVCWL-RDTKLVLAGMTSRSVHIFDLRQSLDSVS---SV--NTKYVQGITVDPFSPNYFCSNS-DGDIAI  221 (783)
T ss_pred             cccccccCccccccc-cCcchhhcccccchhhhhhhhhhhhhhh---hh--hhhhcccceecCCCCCceeccc-cCceee
Confidence            222 34567788996 5567888999999999999872111111   11  122344455566 566777766 999999


Q ss_pred             EE-ccc
Q 022074          155 WD-IRK  159 (303)
Q Consensus       155 Wd-l~~  159 (303)
                      || .+.
T Consensus       222 wD~~rn  227 (783)
T KOG1008|consen  222 WDTYRN  227 (783)
T ss_pred             ccchhh
Confidence            99 443


No 327
>KOG1920 consensus IkappaB kinase complex, IKAP component [Transcription]
Probab=96.46  E-value=0.22  Score=50.30  Aligned_cols=113  Identities=19%  Similarity=0.212  Sum_probs=70.6

Q ss_pred             EEEEEcCCCCEEEEee----CC-CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEec---CCCeEEEEcCccc
Q 022074           43 FSLKFSTDGRELVAGS----SD-DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGS---DDNLCKVWDRRCL  114 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs----~D-g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s---~dg~v~lWd~~~~  114 (303)
                      .+|+|..||+++++..    .+ .+|++||-+ |.+...-....+--.+++|-| .+..+++..   .|+.|.++.....
T Consensus       199 ~~IsWRgDg~~fAVs~~~~~~~~RkirV~drE-g~Lns~se~~~~l~~~LsWkP-sgs~iA~iq~~~sd~~IvffErNGL  276 (1265)
T KOG1920|consen  199 TSISWRGDGEYFAVSFVESETGTRKIRVYDRE-GALNSTSEPVEGLQHSLSWKP-SGSLIAAIQCKTSDSDIVFFERNGL  276 (1265)
T ss_pred             ceEEEccCCcEEEEEEEeccCCceeEEEeccc-chhhcccCcccccccceeecC-CCCeEeeeeecCCCCcEEEEecCCc
Confidence            4689999999999843    23 789999987 544322222233345889965 677877663   4667888874311


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEE---EeCCCcEEEEEcc
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLIS---NGKDQAIKLWDIR  158 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s---~~~D~~v~lWdl~  158 (303)
                      . .+.....+......+..+.|+.++..|+.   ......|++|-+.
T Consensus       277 ~-hg~f~l~~p~de~~ve~L~Wns~sdiLAv~~~~~e~~~v~lwt~~  322 (1265)
T KOG1920|consen  277 R-HGEFVLPFPLDEKEVEELAWNSNSDILAVVTSNLENSLVQLWTTG  322 (1265)
T ss_pred             c-ccccccCCcccccchheeeecCCCCceeeeecccccceEEEEEec
Confidence            1 01001011111223788899999888876   5555669999764


No 328
>PF08553 VID27:  VID27 cytoplasmic protein;  InterPro: IPR013863  This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=96.42  E-value=0.065  Score=52.69  Aligned_cols=130  Identities=16%  Similarity=0.189  Sum_probs=77.8

Q ss_pred             eCCCeEEEEECCCCceEEEEecccCC-eEEEEEccC----CCcEEEEecCCCeEEEEcCccccCCCccce-eec--cccc
Q 022074           58 SSDDCIYVYDLEANKLSLRILAHTSD-VNTVCFGDE----SGHLIYSGSDDNLCKVWDRRCLNVKGKPAG-VLM--GHLE  129 (303)
Q Consensus        58 s~Dg~v~lwd~~~~~~~~~~~~h~~~-v~~l~~~~~----~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~-~~~--~h~~  129 (303)
                      .....++-.|+.+|+.+..+..|... |..++-..+    .+...+.|-.+..+..||.|...  .+.+. ...  ....
T Consensus       501 ~~~~~ly~mDLe~GKVV~eW~~~~~~~v~~~~p~~K~aqlt~e~tflGls~n~lfriDpR~~~--~k~v~~~~k~Y~~~~  578 (794)
T PF08553_consen  501 NNPNKLYKMDLERGKVVEEWKVHDDIPVVDIAPDSKFAQLTNEQTFLGLSDNSLFRIDPRLSG--NKLVDSQSKQYSSKN  578 (794)
T ss_pred             CCCCceEEEecCCCcEEEEeecCCCcceeEecccccccccCCCceEEEECCCceEEeccCCCC--CceeeccccccccCC
Confidence            34577999999999999988887754 666653211    12344567667788899988422  11111 010  1223


Q ss_pred             CeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCC
Q 022074          130 GITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCD  194 (303)
Q Consensus       130 ~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  194 (303)
                      ...|++...+| +||.|+.+|.||+||- ... .  ....++++.-++..++...+++.+...|.
T Consensus       579 ~Fs~~aTt~~G-~iavgs~~G~IRLyd~-~g~-~--AKT~lp~lG~pI~~iDvt~DGkwilaTc~  638 (794)
T PF08553_consen  579 NFSCFATTEDG-YIAVGSNKGDIRLYDR-LGK-R--AKTALPGLGDPIIGIDVTADGKWILATCK  638 (794)
T ss_pred             CceEEEecCCc-eEEEEeCCCcEEeecc-cch-h--hhhcCCCCCCCeeEEEecCCCcEEEEeec
Confidence            57788777777 7999999999999983 211 1  11223333334444444455555544444


No 329
>PF08553 VID27:  VID27 cytoplasmic protein;  InterPro: IPR013863  This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=96.42  E-value=0.069  Score=52.51  Aligned_cols=59  Identities=17%  Similarity=0.245  Sum_probs=45.5

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEE-EeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVA-ALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~-~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      ...++|.|+.+|.||+||- .|...+ .+.+-..||..|+.+.||++|++.+ +.-|.+.+.
T Consensus       587 ~~G~iavgs~~G~IRLyd~-~g~~AKT~lp~lG~pI~~iDvt~DGkwilaTc-~tyLlLi~t  646 (794)
T PF08553_consen  587 EDGYIAVGSNKGDIRLYDR-LGKRAKTALPGLGDPIIGIDVTADGKWILATC-KTYLLLIDT  646 (794)
T ss_pred             CCceEEEEeCCCcEEeecc-cchhhhhcCCCCCCCeeEEEecCCCcEEEEee-cceEEEEEE
Confidence            3457999999999999994 444333 3556678999999999999987776 456666764


No 330
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=96.35  E-value=0.019  Score=35.16  Aligned_cols=30  Identities=17%  Similarity=0.306  Sum_probs=27.7

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECC
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLE   69 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~   69 (303)
                      ..|.+++|+|..+.+|.|+.||.|.|+.++
T Consensus        12 ~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~   41 (47)
T PF12894_consen   12 SRVSCMSWCPTMDLIALGTEDGEVLVYRLN   41 (47)
T ss_pred             CcEEEEEECCCCCEEEEEECCCeEEEEECC
Confidence            459999999999999999999999999983


No 331
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=96.32  E-value=0.83  Score=41.70  Aligned_cols=58  Identities=19%  Similarity=0.173  Sum_probs=41.2

Q ss_pred             CeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEE-EEECCCCCeEEEEeCCCCEEEeec
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRD-CSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~-v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      +..++.++.||.+++.|..+|+.+...+.....+.+ -.+  .+..|..++.||.+..++.
T Consensus       335 ~g~l~v~~~~G~l~~ld~~tG~~~~~~~~~~~~~~s~P~~--~~~~l~v~t~~G~l~~~~~  393 (394)
T PRK11138        335 NGYLVVGDSEGYLHWINREDGRFVAQQKVDSSGFLSEPVV--ADDKLLIQARDGTVYAITR  393 (394)
T ss_pred             CCEEEEEeCCCEEEEEECCCCCEEEEEEcCCCcceeCCEE--ECCEEEEEeCCceEEEEeC
Confidence            345778899999999999999988776644333322 111  2446777788999998775


No 332
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=96.32  E-value=0.14  Score=47.13  Aligned_cols=184  Identities=23%  Similarity=0.279  Sum_probs=101.9

Q ss_pred             cceEEEEEcCCCCEEEEee---CC-CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEE--cCcc
Q 022074           40 FGIFSLKFSTDGRELVAGS---SD-DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVW--DRRC  113 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs---~D-g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lW--d~~~  113 (303)
                      ..+..=.|+++++.++-.+   .. ..++++|+++++....+ ...+.-..-.|+|+..+++++...||...+|  |+..
T Consensus       193 ~~~~~p~ws~~~~~~~y~~f~~~~~~~i~~~~l~~g~~~~i~-~~~g~~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~  271 (425)
T COG0823         193 SLILTPAWSPDGKKLAYVSFELGGCPRIYYLDLNTGKRPVIL-NFNGNNGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDG  271 (425)
T ss_pred             cceeccccCcCCCceEEEEEecCCCceEEEEeccCCccceee-ccCCccCCccCCCCCCEEEEEECCCCCccEEEEcCCC
Confidence            3566678999988755543   22 35899999988754332 2333344557887766777888888877666  5432


Q ss_pred             ccCCCccceeecccccCe-EEEEeCCCCCEEE-EEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccC
Q 022074          114 LNVKGKPAGVLMGHLEGI-TFIDSRGDGRYLI-SNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKH  191 (303)
Q Consensus       114 ~~~~~~~~~~~~~h~~~v-~~~~~~~~~~~l~-s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  191 (303)
                      .     ....+. +..++ ..=+++|+|++++ +.++.|.-.||-+.......                           
T Consensus       272 ~-----~~~~Lt-~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~---------------------------  318 (425)
T COG0823         272 K-----NLPRLT-NGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQV---------------------------  318 (425)
T ss_pred             C-----cceecc-cCCccccCccCCCCCCEEEEEeCCCCCcceEEECCCCCce---------------------------
Confidence            1     122222 22222 2445889999876 44456666666443211000                           


Q ss_pred             CCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC-CCe--EEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEE
Q 022074          192 PCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH-DSC--VYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLV  268 (303)
Q Consensus       192 ~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~-dg~--i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~la  268 (303)
                          ...+..+        .....|..||||++++..+. +|.  |.+.|+.++..+..+. +..-...-.|.|+++.+.
T Consensus       319 ----~riT~~~--------~~~~~p~~SpdG~~i~~~~~~~g~~~i~~~~~~~~~~~~~lt-~~~~~e~ps~~~ng~~i~  385 (425)
T COG0823         319 ----TRLTFSG--------GGNSNPVWSPDGDKIVFESSSGGQWDIDKNDLASGGKIRILT-STYLNESPSWAPNGRMIM  385 (425)
T ss_pred             ----eEeeccC--------CCCcCccCCCCCCEEEEEeccCCceeeEEeccCCCCcEEEcc-ccccCCCCCcCCCCceEE
Confidence                0001111        11124667899999888774 344  6666666655433332 222334456777776544


Q ss_pred             EE
Q 022074          269 SS  270 (303)
Q Consensus       269 s~  270 (303)
                      ..
T Consensus       386 ~~  387 (425)
T COG0823         386 FS  387 (425)
T ss_pred             Ee
Confidence            33


No 333
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=96.26  E-value=0.28  Score=35.95  Aligned_cols=52  Identities=19%  Similarity=0.178  Sum_probs=32.5

Q ss_pred             eEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEEC---CCC-CeEEEEeCCCCEE
Q 022074          223 KYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWH---PSQ-PMLVSSSWDGDVV  277 (303)
Q Consensus       223 ~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~s---p~~-~~las~s~Dg~i~  277 (303)
                      ..++.|-++|+|-+|+-..+  +=..+.- ..+.++++.   .|| +-|++|-.+|.+-
T Consensus        54 ~~F~Y~l~NGTVGvY~~~~R--lWRiKSK-~~~~~~~~~D~~gdG~~eLI~GwsnGkve  109 (111)
T PF14783_consen   54 GRFAYALANGTVGVYDRSQR--LWRIKSK-NQVTSMAFYDINGDGVPELIVGWSNGKVE  109 (111)
T ss_pred             CEEEEEecCCEEEEEeCcce--eeeeccC-CCeEEEEEEcCCCCCceEEEEEecCCeEE
Confidence            55888999999999986432  2223322 224554433   333 3699998888775


No 334
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=96.24  E-value=0.066  Score=51.47  Aligned_cols=237  Identities=15%  Similarity=0.140  Sum_probs=117.8

Q ss_pred             EEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccC-CCcEEEEecCCCeEEEEcCccccCC-Cccce
Q 022074           45 LKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDE-SGHLIYSGSDDNLCKVWDRRCLNVK-GKPAG  122 (303)
Q Consensus        45 l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~-~~~~l~s~s~dg~v~lWd~~~~~~~-~~~~~  122 (303)
                      .+|+|.-+.++-...-.-+.|+|++-.+....+.-..+++.-+.+.|+ ..+.|++.-.||.+.+|-.+..... ..+..
T Consensus       236 faf~p~~rn~lfi~~prellv~dle~~~~l~vvpier~~akfv~vlP~~~rd~LfclH~nG~ltirvrk~~~~~f~~~~~  315 (1062)
T KOG1912|consen  236 FAFSPHWRNILFITFPRELLVFDLEYECCLAVVPIERGGAKFVDVLPDPRRDALFCLHSNGRLTIRVRKEEPTEFKKPNA  315 (1062)
T ss_pred             hhcChhhhceEEEEeccceEEEcchhhceeEEEEeccCCcceeEeccCCCcceEEEEecCCeEEEEEeeccCccccccch
Confidence            566776665555556667999999887766554444455666666553 3567889999999999976532111 11111


Q ss_pred             eecc-cccCeEEEE-----------eCCC-CCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccc
Q 022074          123 VLMG-HLEGITFID-----------SRGD-GRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDL  189 (303)
Q Consensus       123 ~~~~-h~~~v~~~~-----------~~~~-~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  189 (303)
                      .+.- -.+.+.++.           ..|. ...++.--.++.+-+|.++..+.......+........   .+......+
T Consensus       316 ~l~~dl~~Q~~~vr~m~~~rp~~~~~cPs~~sa~avl~s~g~~~~w~l~~~ri~~~~~s~~iel~~pf---~f~~~~~~v  392 (1062)
T KOG1912|consen  316 SLSMDLGEQVHVVRPMEEFRPVIGASCPSTPSALAVLYSSGDSTFWQLSNGRIHLDYRSSSIELVLPF---DFNLSTKLV  392 (1062)
T ss_pred             hhccccccceEEEeechhcccceeecCCCChhhhhhhhhcchhHHHhhhcCCcCcccccccccccccc---cccCceeeh
Confidence            1110 011111111           1222 23344444577888999874322111111110000000   000000000


Q ss_pred             cCCCCCcceEEecccceeeeEEEeeeeee-----eC-------CCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEE
Q 022074          190 KHPCDQSVATYKGHSVLRTLIRCHFSPVY-----ST-------GQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRD  257 (303)
Q Consensus       190 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-----s~-------~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~  257 (303)
                      .-...-.+....+|..-..+.+.+..|.+     .|       ...++|.|.+.|+|.++|+.++...+.+..|...|.+
T Consensus       393 ~k~~l~~LS~dg~h~sGs~~~~~~p~p~~t~~~~~p~~n~~~~~~pLvAvGT~sGTV~vvdvst~~v~~~fsvht~~Vkg  472 (1062)
T KOG1912|consen  393 GKTSLISLSDDGSHSSGSTCVRMRPMPELTKVENDPGGNTPAGTVPLVAVGTNSGTVDVVDVSTNAVAASFSVHTSLVKG  472 (1062)
T ss_pred             hhccccchhhcCCCCCCceeeecccCcccceeecCCCCCccceeeeeEEeecCCceEEEEEecchhhhhhhcccccceee
Confidence            00000000000011111111111111111     01       1347888999999999999999988899999999999


Q ss_pred             EEECCCCCeEE---------EEeCCCCEEEeecCCC
Q 022074          258 CSWHPSQPMLV---------SSSWDGDVVRWEFPGN  284 (303)
Q Consensus       258 v~~sp~~~~la---------s~s~Dg~i~~Wd~~~~  284 (303)
                      +.|-...+++-         +++--+.+.+=|+++.
T Consensus       473 leW~g~sslvSfsys~~n~~sg~vrN~l~vtdLrtG  508 (1062)
T KOG1912|consen  473 LEWLGNSSLVSFSYSHVNSASGGVRNDLVVTDLRTG  508 (1062)
T ss_pred             eeeccceeEEEeeeccccccccceeeeEEEEEcccc
Confidence            99965444322         2222345556666544


No 335
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=96.19  E-value=0.062  Score=50.99  Aligned_cols=117  Identities=14%  Similarity=0.128  Sum_probs=81.0

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe-cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL-AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~-~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      .|.=-+++..+++++.|+.-|.+++|+-.++.....-. +..+.+.....+ ++..+++.|+..|.|.++-+.. .+...
T Consensus        35 ~v~lTc~dst~~~l~~GsS~G~lyl~~R~~~~~~~~~~~~~~~~~~~~~vs-~~e~lvAagt~~g~V~v~ql~~-~~p~~  112 (726)
T KOG3621|consen   35 RVKLTCVDATEEYLAMGSSAGSVYLYNRHTGEMRKLKNEGATGITCVRSVS-SVEYLVAAGTASGRVSVFQLNK-ELPRD  112 (726)
T ss_pred             eEEEEEeecCCceEEEecccceEEEEecCchhhhcccccCccceEEEEEec-chhHhhhhhcCCceEEeehhhc-cCCCc
Confidence            34445566789999999999999999998886543222 122333444554 4577888888899998886432 11111


Q ss_pred             c--ce-eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074          120 P--AG-VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       120 ~--~~-~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      .  .. .-..|...|+++.|++++..+.+|..-|+|-+-.|..
T Consensus       113 ~~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s  155 (726)
T KOG3621|consen  113 LDYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDS  155 (726)
T ss_pred             ceeeccccccCCceEEEEEecccccEEeecCCCceEEEEEech
Confidence            1  11 1123778899999999999999999999999876654


No 336
>KOG1645 consensus RING-finger-containing E3 ubiquitin ligase [Posttranslational modification, protein turnover, chaperones]
Probab=96.15  E-value=0.017  Score=51.22  Aligned_cols=93  Identities=14%  Similarity=0.045  Sum_probs=65.8

Q ss_pred             EEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC-CC
Q 022074           63 IYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD-GR  141 (303)
Q Consensus        63 v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~-~~  141 (303)
                      ++.++..+-+....+..|...|..++|+|.+.-++..++.+..+++.|++...    ....+..| ..+++++|..+ .+
T Consensus       175 v~~l~~~~fkssq~lp~~g~~IrdlafSp~~~GLl~~asl~nkiki~dlet~~----~vssy~a~-~~~wSC~wDlde~h  249 (463)
T KOG1645|consen  175 VQKLESHDFKSSQILPGEGSFIRDLAFSPFNEGLLGLASLGNKIKIMDLETSC----VVSSYIAY-NQIWSCCWDLDERH  249 (463)
T ss_pred             eEEeccCCcchhhcccccchhhhhhccCccccceeeeeccCceEEEEecccce----eeeheecc-CCceeeeeccCCcc
Confidence            44444444333334556777899999987554478899999999999987332    23334445 67888888765 46


Q ss_pred             EEEEEeCCCcEEEEEcccc
Q 022074          142 YLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       142 ~l~s~~~D~~v~lWdl~~~  160 (303)
                      +|..|-..|.|.+||+|..
T Consensus       250 ~IYaGl~nG~VlvyD~R~~  268 (463)
T KOG1645|consen  250 VIYAGLQNGMVLVYDMRQP  268 (463)
T ss_pred             eeEEeccCceEEEEEccCC
Confidence            7888889999999999963


No 337
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=96.09  E-value=0.022  Score=53.25  Aligned_cols=66  Identities=14%  Similarity=0.214  Sum_probs=57.3

Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeE-EEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVR-DCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~-~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ++|.-.++|++.++|.+-++... .+.+-++..|..+++ +++|.|||+.||.|=.||+|++-|+...
T Consensus        28 wnP~~dLiA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~DGkllaVg~kdG~I~L~Dve~~   94 (665)
T KOG4640|consen   28 WNPKMDLIATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRPDGKLLAVGFKDGTIRLHDVEKG   94 (665)
T ss_pred             EcCccchhheeccCCcEEEEEec-cceeEeccCCCCccceeeeecCCCCEEEEEecCCeEEEEEccCC
Confidence            44556789999999999999886 677888887888888 9999999999999999999999998643


No 338
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=96.04  E-value=0.83  Score=41.75  Aligned_cols=227  Identities=15%  Similarity=0.113  Sum_probs=108.1

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceE-------------------------------------------EE
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLS-------------------------------------------LR   76 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~-------------------------------------------~~   76 (303)
                      ..|..++|+++...+++|...|.|-||.....+..                                           ..
T Consensus         2 ~~v~~vs~a~~t~Elav~~~~GeVv~~k~~~n~~~~~~~~~~~~~~~~~~~~~~~~~~~l~di~~r~~~~~~~gf~P~~l   81 (395)
T PF08596_consen    2 VSVTHVSFAPETLELAVGLESGEVVLFKFGKNQNYGNREQPPDLDYNFRRFSLNNSPGKLTDISDRAPPSLKEGFLPLTL   81 (395)
T ss_dssp             --EEEEEEETTTTEEEEEETTS-EEEEEEEE------------------S--GGGSS-SEEE-GGG--TT-SEEEEEEEE
T ss_pred             ceEEEEEecCCCceEEEEccCCcEEEEEcccCCCCCccCCCcccCcccccccccCCCcceEEehhhCCcccccccCchhh
Confidence            46899999999889999999999988844221100                                           01


Q ss_pred             EecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc-ccee--ec-ccccCeEEEEeC-----CCC---CEEE
Q 022074           77 ILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK-PAGV--LM-GHLEGITFIDSR-----GDG---RYLI  144 (303)
Q Consensus        77 ~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~-~~~~--~~-~h~~~v~~~~~~-----~~~---~~l~  144 (303)
                      +....+.|++++-+ +-+ .++.|.++|.+.+.|+|.....-. .+..  .. ...+.++++.|.     .|+   -.++
T Consensus        82 ~~~~~g~vtal~~S-~iG-Fvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~  159 (395)
T PF08596_consen   82 LDAKQGPVTALKNS-DIG-FVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLL  159 (395)
T ss_dssp             E---S-SEEEEEE--BTS-EEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEE
T ss_pred             eeccCCcEeEEecC-CCc-EEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEE
Confidence            12224678888874 334 788899999999999974321100 0000  00 122345555543     333   5678


Q ss_pred             EEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEE----eeeeeeeC
Q 022074          145 SNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRC----HFSPVYST  220 (303)
Q Consensus       145 s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~----~~~~~~s~  220 (303)
                      .|...|.+.+|.+-... .....     ..+.  ........+      -..+..++...........    ....-.. 
T Consensus       160 vGTn~G~v~~fkIlp~~-~g~f~-----v~~~--~~~~~~~~~------i~~I~~i~~~~G~~a~At~~~~~~l~~g~~-  224 (395)
T PF08596_consen  160 VGTNSGNVLTFKILPSS-NGRFS-----VQFA--GATTNHDSP------ILSIIPINADTGESALATISAMQGLSKGIS-  224 (395)
T ss_dssp             EEETTSEEEEEEEEE-G-GG-EE-----EEEE--EEE--SS----------EEEEEETTT--B-B-BHHHHHGGGGT---
T ss_pred             EEeCCCCEEEEEEecCC-CCceE-----EEEe--eccccCCCc------eEEEEEEECCCCCcccCchhHhhccccCCC-
Confidence            89999999999874211 01000     0000  000000000      0011111111000000000    0000000 


Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEE-----CCCCCeEEEEeCCCCEEEeecCCC
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSW-----HPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~-----sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ...+++ ...+..|+++...+.+..............+.+     ...+..|++-..+|.++++-++.-
T Consensus       225 i~g~vV-vvSe~~irv~~~~~~k~~~K~~~~~~~~~~~~vv~~~~~~~~~~Lv~l~~~G~i~i~SLP~L  292 (395)
T PF08596_consen  225 IPGYVV-VVSESDIRVFKPPKSKGAHKSFDDPFLCSSASVVPTISRNGGYCLVCLFNNGSIRIYSLPSL  292 (395)
T ss_dssp             --EEEE-EE-SSEEEEE-TT---EEEEE-SS-EEEEEEEEEEEE-EEEEEEEEEEETTSEEEEEETTT-
T ss_pred             cCcEEE-EEcccceEEEeCCCCcccceeeccccccceEEEEeecccCCceEEEEEECCCcEEEEECCCc
Confidence            112344 344778999999887765443322223334555     235668999999999999999854


No 339
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=95.93  E-value=0.89  Score=38.54  Aligned_cols=62  Identities=23%  Similarity=0.269  Sum_probs=44.6

Q ss_pred             CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           50 DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      -|++++.|...|.+++.+.++|.....+..- +.|.+-+....++.++..++.|++.+.-|.+
T Consensus        62 vgdfVV~GCy~g~lYfl~~~tGs~~w~f~~~-~~vk~~a~~d~~~glIycgshd~~~yalD~~  123 (354)
T KOG4649|consen   62 VGDFVVLGCYSGGLYFLCVKTGSQIWNFVIL-ETVKVRAQCDFDGGLIYCGSHDGNFYALDPK  123 (354)
T ss_pred             ECCEEEEEEccCcEEEEEecchhheeeeeeh-hhhccceEEcCCCceEEEecCCCcEEEeccc
Confidence            4778999999999999999999654333321 2333333334457788999999999887765


No 340
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=95.91  E-value=0.033  Score=34.12  Aligned_cols=31  Identities=26%  Similarity=0.506  Sum_probs=28.4

Q ss_pred             CCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          252 TSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       252 ~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      ..+|..++|+|+..+||.+..||.+.++.+.
T Consensus        11 ~~~v~~~~w~P~mdLiA~~t~~g~v~v~Rl~   41 (47)
T PF12894_consen   11 PSRVSCMSWCPTMDLIALGTEDGEVLVYRLN   41 (47)
T ss_pred             CCcEEEEEECCCCCEEEEEECCCeEEEEECC
Confidence            3579999999999999999999999999974


No 341
>PF07433 DUF1513:  Protein of unknown function (DUF1513);  InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=95.89  E-value=1.1  Score=39.19  Aligned_cols=100  Identities=17%  Similarity=0.192  Sum_probs=62.0

Q ss_pred             EEEEcC-CCCEEEEeeCCCe-EEEEECCCCceEEEEecccCCeE--EEEEccCCCcEEEEec-----CCCeEEEEcCccc
Q 022074           44 SLKFST-DGRELVAGSSDDC-IYVYDLEANKLSLRILAHTSDVN--TVCFGDESGHLIYSGS-----DDNLCKVWDRRCL  114 (303)
Q Consensus        44 ~l~~s~-~g~~l~sgs~Dg~-v~lwd~~~~~~~~~~~~h~~~v~--~l~~~~~~~~~l~s~s-----~dg~v~lWd~~~~  114 (303)
                      .++.+| .+..++.+-.-|+ ..+||..+++....+....+.-.  --+|++ ++++|++.-     ..|.|-+||... 
T Consensus         9 ~~a~~p~~~~avafaRRPG~~~~v~D~~~g~~~~~~~a~~gRHFyGHg~fs~-dG~~LytTEnd~~~g~G~IgVyd~~~-   86 (305)
T PF07433_consen    9 GVAAHPTRPEAVAFARRPGTFALVFDCRTGQLLQRLWAPPGRHFYGHGVFSP-DGRLLYTTENDYETGRGVIGVYDAAR-   86 (305)
T ss_pred             ceeeCCCCCeEEEEEeCCCcEEEEEEcCCCceeeEEcCCCCCEEecCEEEcC-CCCEEEEeccccCCCcEEEEEEECcC-
Confidence            456677 4555666666664 57889999987665544332211  235754 577777653     367899999751 


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEe
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNG  147 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~  147 (303)
                        .-..+..+..|.-.-.-+.+.+||+.|+.+.
T Consensus        87 --~~~ri~E~~s~GIGPHel~l~pDG~tLvVAN  117 (305)
T PF07433_consen   87 --GYRRIGEFPSHGIGPHELLLMPDGETLVVAN  117 (305)
T ss_pred             --CcEEEeEecCCCcChhhEEEcCCCCEEEEEc
Confidence              2234556666655555666788887776654


No 342
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.85  E-value=0.23  Score=45.98  Aligned_cols=61  Identities=15%  Similarity=0.183  Sum_probs=47.1

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEE-EEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQV-AALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~-~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ...++|.||.+|.||+||- .+... ..+.+-..+|..+..+.+|.+++..+ +..|.+-+...
T Consensus       440 ~sG~IvvgS~~GdIRLYdr-i~~~AKTAlPgLG~~I~hVdvtadGKwil~Tc-~tyLlLi~t~~  501 (644)
T KOG2395|consen  440 ESGYIVVGSLKGDIRLYDR-IGRRAKTALPGLGDAIKHVDVTADGKWILATC-KTYLLLIDTLI  501 (644)
T ss_pred             CCceEEEeecCCcEEeehh-hhhhhhhcccccCCceeeEEeeccCcEEEEec-ccEEEEEEEec
Confidence            3457999999999999997 45443 34677888999999999999887776 45666666543


No 343
>KOG4640 consensus Anaphase-promoting complex (APC), subunit 4 [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=95.71  E-value=0.079  Score=49.73  Aligned_cols=71  Identities=14%  Similarity=0.255  Sum_probs=58.8

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeE-EEEEccCCCcEEEEecCCCeEEEEcCc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVN-TVCFGDESGHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~-~l~~~~~~~~~l~s~s~dg~v~lWd~~  112 (303)
                      ..+.-+.|+|.-..+|.+..+|.|.+..+. .+..-.+.-|+..++ +++|.| +++.++.|-+||+|++-|..
T Consensus        21 ~~i~~~ewnP~~dLiA~~t~~gelli~R~n-~qRlwtip~p~~~v~~sL~W~~-DGkllaVg~kdG~I~L~Dve   92 (665)
T KOG4640|consen   21 INIKRIEWNPKMDLIATRTEKGELLIHRLN-WQRLWTIPIPGENVTASLCWRP-DGKLLAVGFKDGTIRLHDVE   92 (665)
T ss_pred             cceEEEEEcCccchhheeccCCcEEEEEec-cceeEeccCCCCccceeeeecC-CCCEEEEEecCCeEEEEEcc
Confidence            358889999999999999999999998887 333444554777777 999975 49999999999999999975


No 344
>KOG3621 consensus WD40 repeat-containing protein [General function prediction only]
Probab=95.71  E-value=0.068  Score=50.72  Aligned_cols=66  Identities=23%  Similarity=0.238  Sum_probs=52.5

Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeE---E--EEe-ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQ---V--AAL-KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~---~--~~~-~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .|++..++|.|++.|.|.++-+..+.+   +  ... +.|...|++++|++++..|.+|..-|++.+-.+..
T Consensus        84 vs~~e~lvAagt~~g~V~v~ql~~~~p~~~~~~t~~d~~~~~rVTal~Ws~~~~k~ysGD~~Gkv~~~~L~s  155 (726)
T KOG3621|consen   84 VSSVEYLVAAGTASGRVSVFQLNKELPRDLDYVTPCDKSHKCRVTALEWSKNGMKLYSGDSQGKVVLTELDS  155 (726)
T ss_pred             ecchhHhhhhhcCCceEEeehhhccCCCcceeeccccccCCceEEEEEecccccEEeecCCCceEEEEEech
Confidence            355677889999999999998866432   1  111 24778999999999999999999999999887764


No 345
>PRK02888 nitrous-oxide reductase; Validated
Probab=95.68  E-value=0.96  Score=43.36  Aligned_cols=66  Identities=17%  Similarity=0.218  Sum_probs=51.3

Q ss_pred             eeeCCCeEEEEEe-CCCeEEEEECCCCeE------------EEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          217 VYSTGQKYIYTGS-HDSCVYVYDLVSGEQ------------VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       217 ~~s~~~~~latg~-~dg~i~iwd~~~~~~------------~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .+||||+++++++ .+.++.|.|+.+.+.            +.+.+.-.+ ....+|.++|+...|--.|..+..|++..
T Consensus       327 ~vSPDGkylyVanklS~tVSVIDv~k~k~~~~~~~~~~~~vvaevevGlG-PLHTaFDg~G~aytslf~dsqv~kwn~~~  405 (635)
T PRK02888        327 NTSPDGKYFIANGKLSPTVTVIDVRKLDDLFDGKIKPRDAVVAEPELGLG-PLHTAFDGRGNAYTTLFLDSQIVKWNIEA  405 (635)
T ss_pred             EECCCCCEEEEeCCCCCcEEEEEChhhhhhhhccCCccceEEEeeccCCC-cceEEECCCCCEEEeEeecceeEEEehHH
Confidence            4688999877765 699999999987653            344443233 45789999999999999999999999864


No 346
>PF04053 Coatomer_WDAD:  Coatomer WD associated region ;  InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=95.64  E-value=1.1  Score=41.68  Aligned_cols=56  Identities=14%  Similarity=0.309  Sum_probs=32.4

Q ss_pred             CeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      |.+|...+.+ .|.+||+.+++.+..+...  +|..+.||+++.++|-.+.+ ++.+.+.
T Consensus       117 G~LL~~~~~~-~i~~yDw~~~~~i~~i~v~--~vk~V~Ws~~g~~val~t~~-~i~il~~  172 (443)
T PF04053_consen  117 GNLLGVKSSD-FICFYDWETGKLIRRIDVS--AVKYVIWSDDGELVALVTKD-SIYILKY  172 (443)
T ss_dssp             SSSEEEEETT-EEEEE-TTT--EEEEESS---E-EEEEE-TTSSEEEEE-S--SEEEEEE
T ss_pred             CcEEEEECCC-CEEEEEhhHcceeeEEecC--CCcEEEEECCCCEEEEEeCC-eEEEEEe
Confidence            4444444433 6888888888877777532  47888888888887777644 6666654


No 347
>KOG2114 consensus Vacuolar assembly/sorting protein PEP5/VPS11 [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.56  E-value=1.7  Score=42.62  Aligned_cols=122  Identities=19%  Similarity=0.266  Sum_probs=74.6

Q ss_pred             CCcccceEEEEEcCCC-CEEEEeeCCC-----eEEEEECCCCc-----eE---EEEecc-----cCCeEEEEEccCCCcE
Q 022074           36 GGYSFGIFSLKFSTDG-RELVAGSSDD-----CIYVYDLEANK-----LS---LRILAH-----TSDVNTVCFGDESGHL   96 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g-~~l~sgs~Dg-----~v~lwd~~~~~-----~~---~~~~~h-----~~~v~~l~~~~~~~~~   96 (303)
                      .+|..++...-+..++ ++|++-+.|+     .++||+++.-+     ..   .++..|     ..+++.++.+ .+-+.
T Consensus        61 qa~~~siv~~L~~~~~~~~L~sv~Ed~~~np~llkiw~lek~~~n~sP~c~~~~ri~~~~np~~~~p~s~l~Vs-~~l~~  139 (933)
T KOG2114|consen   61 QAYEQSIVQFLYILNKQNFLFSVGEDEQGNPVLLKIWDLEKVDKNNSPQCLYEHRIFTIKNPTNPSPASSLAVS-EDLKT  139 (933)
T ss_pred             eecchhhhhHhhcccCceEEEEEeecCCCCceEEEEecccccCCCCCcceeeeeeeeccCCCCCCCcceEEEEE-ccccE
Confidence            4566664444455555 5777766655     48999986531     11   133332     2467778885 44778


Q ss_pred             EEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074           97 IYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus        97 l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      +++|-.||.|..+..+.....+........-.++|+.+.+..++...+-...-..|.+|.+.
T Consensus       140 Iv~Gf~nG~V~~~~GDi~RDrgsr~~~~~~~~~pITgL~~~~d~~s~lFv~Tt~~V~~y~l~  201 (933)
T KOG2114|consen  140 IVCGFTNGLVICYKGDILRDRGSRQDYSHRGKEPITGLALRSDGKSVLFVATTEQVMLYSLS  201 (933)
T ss_pred             EEEEecCcEEEEEcCcchhccccceeeeccCCCCceeeEEecCCceeEEEEecceeEEEEec
Confidence            89999999999986432111122122222234689999998888774444445778888775


No 348
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=95.07  E-value=0.31  Score=45.15  Aligned_cols=151  Identities=17%  Similarity=0.232  Sum_probs=85.0

Q ss_pred             CCcccceEE-EEEcCCCCE-EEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC------CcEEEEecCCCeEE
Q 022074           36 GGYSFGIFS-LKFSTDGRE-LVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES------GHLIYSGSDDNLCK  107 (303)
Q Consensus        36 ~~~~~~v~~-l~~s~~g~~-l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~------~~~l~s~s~dg~v~  107 (303)
                      .||+..-.. +....+.+. +.++..-..++=.|+++|+.+..+.-+.. |+-+.+.|+.      +..-+.|-.|..|.
T Consensus       329 ~g~S~~P~K~mL~~~dsnlil~~~~~~~~l~klDIE~GKIVeEWk~~~d-i~mv~~t~d~K~~Ql~~e~TlvGLs~n~vf  407 (644)
T KOG2395|consen  329 DGKSIDPHKAMLHRADSNLILMDGGEQDKLYKLDIERGKIVEEWKFEDD-INMVDITPDFKFAQLTSEQTLVGLSDNSVF  407 (644)
T ss_pred             CccccCcchhhhhccccceEeeCCCCcCcceeeecccceeeeEeeccCC-cceeeccCCcchhcccccccEEeecCCceE
Confidence            455533332 333334443 44455555688889999999888877766 6666665431      11223455577888


Q ss_pred             EEcCccccCCCccceeecccc----cCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCC
Q 022074          108 VWDRRCLNVKGKPAGVLMGHL----EGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYP  183 (303)
Q Consensus       108 lWd~~~~~~~~~~~~~~~~h~----~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~  183 (303)
                      -||.|.....  .+...++|.    ....|.+...+| +++.||.+|.|||||-- +. .  ....++.+.-++..+...
T Consensus       408 riDpRv~~~~--kl~~~q~kqy~~k~nFsc~aTT~sG-~IvvgS~~GdIRLYdri-~~-~--AKTAlPgLG~~I~hVdvt  480 (644)
T KOG2395|consen  408 RIDPRVQGKN--KLAVVQSKQYSTKNNFSCFATTESG-YIVVGSLKGDIRLYDRI-GR-R--AKTALPGLGDAIKHVDVT  480 (644)
T ss_pred             EecccccCcc--eeeeeeccccccccccceeeecCCc-eEEEeecCCcEEeehhh-hh-h--hhhcccccCCceeeEEee
Confidence            8998843321  233333442    235566655555 89999999999999852 11 1  112233333344444444


Q ss_pred             CCCccccCCCC
Q 022074          184 PQARDLKHPCD  194 (303)
Q Consensus       184 ~~~~~~~~~~~  194 (303)
                      .+++.+...|+
T Consensus       481 adGKwil~Tc~  491 (644)
T KOG2395|consen  481 ADGKWILATCK  491 (644)
T ss_pred             ccCcEEEEecc
Confidence            45555544444


No 349
>PF15492 Nbas_N:  Neuroblastoma-amplified sequence, N terminal
Probab=94.90  E-value=0.71  Score=39.40  Aligned_cols=35  Identities=23%  Similarity=0.409  Sum_probs=30.7

Q ss_pred             cccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc
Q 022074          127 HLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS  161 (303)
Q Consensus       127 h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~  161 (303)
                      ..+.|..+..+|||+.|++...+|.|.+|++-.++
T Consensus       228 ~~d~i~kmSlSPdg~~La~ih~sG~lsLW~iPsL~  262 (282)
T PF15492_consen  228 EQDGIFKMSLSPDGSLLACIHFSGSLSLWEIPSLR  262 (282)
T ss_pred             CCCceEEEEECCCCCEEEEEEcCCeEEEEecCcch
Confidence            35678899999999999999999999999986544


No 350
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=94.87  E-value=3.1  Score=38.27  Aligned_cols=49  Identities=16%  Similarity=0.276  Sum_probs=36.0

Q ss_pred             eeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecC-CCCeEEEEECCCCC
Q 022074          217 VYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYH-TSPVRDCSWHPSQP  265 (303)
Q Consensus       217 ~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h-~~~I~~v~~sp~~~  265 (303)
                      ..||+++++|.-..+|.+.+.+.+-.+.+.++... ..+...+.|--+..
T Consensus       223 avSpng~~iAl~t~~g~l~v~ssDf~~~~~e~~~~~~~~p~~~~WCG~da  272 (410)
T PF04841_consen  223 AVSPNGKFIALFTDSGNLWVVSSDFSEKLCEFDTDSKSPPKQMAWCGNDA  272 (410)
T ss_pred             EECCCCCEEEEEECCCCEEEEECcccceeEEeecCcCCCCcEEEEECCCc
Confidence            46889999999999999999987766666666544 34556777755543


No 351
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=94.82  E-value=2.7  Score=37.18  Aligned_cols=178  Identities=13%  Similarity=0.169  Sum_probs=95.0

Q ss_pred             eEEEEECCCCceEEEEe-cccCCeEEEE---EccC---CCcEEEEecC----------CCeEEEEcCccccCCCccceee
Q 022074           62 CIYVYDLEANKLSLRIL-AHTSDVNTVC---FGDE---SGHLIYSGSD----------DNLCKVWDRRCLNVKGKPAGVL  124 (303)
Q Consensus        62 ~v~lwd~~~~~~~~~~~-~h~~~v~~l~---~~~~---~~~~l~s~s~----------dg~v~lWd~~~~~~~~~~~~~~  124 (303)
                      .|+|.|..+......+. ..+..+.+++   +..+   ..++++.|..          .|.+.++++.............
T Consensus         3 ~i~l~d~~~~~~~~~~~l~~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i   82 (321)
T PF03178_consen    3 SIRLVDPTTFEVLDSFELEPNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLI   82 (321)
T ss_dssp             EEEEEETTTSSEEEEEEEETTEEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEE
T ss_pred             EEEEEeCCCCeEEEEEECCCCceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEE
Confidence            57888887776554322 2233445443   3211   1456665542          2889999875320001112211


Q ss_pred             --cccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccccc-CCcccccCccceeeeceeeeCCCCCccccCCCCCcceEEe
Q 022074          125 --MGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMS-SNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYK  201 (303)
Q Consensus       125 --~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~-~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  201 (303)
                        ....++|.++... .+ +|+.+ .++.|.+|++.... ..                                ..+.+.
T Consensus        83 ~~~~~~g~V~ai~~~-~~-~lv~~-~g~~l~v~~l~~~~~l~--------------------------------~~~~~~  127 (321)
T PF03178_consen   83 HSTEVKGPVTAICSF-NG-RLVVA-VGNKLYVYDLDNSKTLL--------------------------------KKAFYD  127 (321)
T ss_dssp             EEEEESS-EEEEEEE-TT-EEEEE-ETTEEEEEEEETTSSEE--------------------------------EEEEE-
T ss_pred             EEEeecCcceEhhhh-CC-EEEEe-ecCEEEEEEccCcccch--------------------------------hhheec
Confidence              1235678887655 34 34333 34899999987533 00                                011111


Q ss_pred             cccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC-CeEEEEee--cCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          202 GHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS-GEQVAALK--YHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       202 ~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~-~~~~~~~~--~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                      ...... .+        ...+.+++.|..-+.+.++..+. .+.+..+.  ....+++++.|-++++.++.++.+|++.+
T Consensus       128 ~~~~i~-sl--------~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~gnl~~  198 (321)
T PF03178_consen  128 SPFYIT-SL--------SVFKNYILVGDAMKSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKDGNLFV  198 (321)
T ss_dssp             BSSSEE-EE--------EEETTEEEEEESSSSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETTSEEEE
T ss_pred             ceEEEE-EE--------eccccEEEEEEcccCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCCCeEEE
Confidence            111000 01        11145788898888888875443 33333332  23456899999877789999999999999


Q ss_pred             eecCC
Q 022074          279 WEFPG  283 (303)
Q Consensus       279 Wd~~~  283 (303)
                      +..+.
T Consensus       199 l~~~~  203 (321)
T PF03178_consen  199 LRYNP  203 (321)
T ss_dssp             EEE-S
T ss_pred             EEECC
Confidence            99864


No 352
>KOG3617 consensus WD40 and TPR repeat-containing protein [General function prediction only]
Probab=94.79  E-value=0.034  Score=53.86  Aligned_cols=65  Identities=15%  Similarity=0.309  Sum_probs=57.2

Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      ++|..-.|+.|-+-|.+.+|...+.+.-.....|..+|..+.|||+|..|.|+..=|.+.+|...
T Consensus        67 WHpe~~vLa~gwe~g~~~v~~~~~~e~htv~~th~a~i~~l~wS~~G~~l~t~d~~g~v~lwr~d  131 (1416)
T KOG3617|consen   67 WHPEEFVLAQGWEMGVSDVQKTNTTETHTVVETHPAPIQGLDWSHDGTVLMTLDNPGSVHLWRYD  131 (1416)
T ss_pred             cChHHHHHhhccccceeEEEecCCceeeeeccCCCCCceeEEecCCCCeEEEcCCCceeEEEEee
Confidence            45666678889999999999988877766667899999999999999999999999999999865


No 353
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=94.40  E-value=0.17  Score=50.45  Aligned_cols=69  Identities=23%  Similarity=0.334  Sum_probs=53.9

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEE---EEccCCCcEEEEecCCCeEEEEcC
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTV---CFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l---~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      +||.+++|+.+|+.++.|=.+|.|.+||+.+++....+..|..++..+   .+.. .+..++++...|.  +|.+
T Consensus       131 ~~Vtsvafn~dg~~l~~G~~~G~V~v~D~~~~k~l~~i~e~~ap~t~vi~v~~t~-~nS~llt~D~~Gs--f~~l  202 (1206)
T KOG2079|consen  131 GPVTSVAFNQDGSLLLAGLGDGHVTVWDMHRAKILKVITEHGAPVTGVIFVGRTS-QNSKLLTSDTGGS--FWKL  202 (1206)
T ss_pred             CcceeeEecCCCceeccccCCCcEEEEEccCCcceeeeeecCCccceEEEEEEeC-CCcEEEEccCCCc--eEEE
Confidence            589999999999999999999999999999998877766666554444   3433 3457888888886  4653


No 354
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=94.10  E-value=4.5  Score=36.86  Aligned_cols=103  Identities=10%  Similarity=0.063  Sum_probs=56.7

Q ss_pred             CCCEEEEeeCCCeEEEEECCCCceEEEEeccc-C---------Ce-EEEEEccCCCcEEEEecCCCeEEEEcCccccCCC
Q 022074           50 DGRELVAGSSDDCIYVYDLEANKLSLRILAHT-S---------DV-NTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKG  118 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~-~---------~v-~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~  118 (303)
                      .+..+++++.+|.+.-+|.++|+..-+..-.. .         .+ .....   .+..++.++.++.+.-+|.+..+...
T Consensus        68 ~~~~vy~~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v---~~~~v~v~~~~g~l~ald~~tG~~~W  144 (394)
T PRK11138         68 AYNKVYAADRAGLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTV---AGGKVYIGSEKGQVYALNAEDGEVAW  144 (394)
T ss_pred             ECCEEEEECCCCeEEEEECCCCcEeeEEcCCCcccccccccccccccccEE---ECCEEEEEcCCCEEEEEECCCCCCcc
Confidence            36678888889999999999987653322111 0         00 01111   13456677788999989876443322


Q ss_pred             ccceeecccccCeEE-EEeCCCCCEEEEEeCCCcEEEEEcccccC
Q 022074          119 KPAGVLMGHLEGITF-IDSRGDGRYLISNGKDQAIKLWDIRKMSS  162 (303)
Q Consensus       119 ~~~~~~~~h~~~v~~-~~~~~~~~~l~s~~~D~~v~lWdl~~~~~  162 (303)
                      +..  +.+   .+.+ ..+  .+..++.+..++.+..+|....+.
T Consensus       145 ~~~--~~~---~~~ssP~v--~~~~v~v~~~~g~l~ald~~tG~~  182 (394)
T PRK11138        145 QTK--VAG---EALSRPVV--SDGLVLVHTSNGMLQALNESDGAV  182 (394)
T ss_pred             ccc--CCC---ceecCCEE--ECCEEEEECCCCEEEEEEccCCCE
Confidence            111  111   1110 001  133566667778888888765443


No 355
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=93.87  E-value=3.4  Score=34.63  Aligned_cols=51  Identities=14%  Similarity=0.239  Sum_probs=40.8

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCC--CCeEEEEe
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPS--QPMLVSSS  271 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~--~~~las~s  271 (303)
                      +|.+.++.-..++|...|..+|+.+.+++-....|++++|--.  .-+.+|+.
T Consensus       222 eG~L~Va~~ng~~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~yvT~a  274 (310)
T KOG4499|consen  222 EGNLYVATFNGGTVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILYVTTA  274 (310)
T ss_pred             CCcEEEEEecCcEEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEEEEeh
Confidence            4677777888899999999999999999989999999999543  22444444


No 356
>PF07433 DUF1513:  Protein of unknown function (DUF1513);  InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=93.87  E-value=0.2  Score=43.68  Aligned_cols=55  Identities=20%  Similarity=0.316  Sum_probs=46.1

Q ss_pred             eeeCCCeEEEEE-----eCCCeEEEEECC-CCeEEEEeecCCCCeEEEEECCCCCeEEEEe
Q 022074          217 VYSTGQKYIYTG-----SHDSCVYVYDLV-SGEQVAALKYHTSPVRDCSWHPSQPMLVSSS  271 (303)
Q Consensus       217 ~~s~~~~~latg-----~~dg~i~iwd~~-~~~~~~~~~~h~~~I~~v~~sp~~~~las~s  271 (303)
                      +||+||++|++-     ...|.|-|||.. +.+.+.++..|..-..++.+.||++.|+.+-
T Consensus        57 ~fs~dG~~LytTEnd~~~g~G~IgVyd~~~~~~ri~E~~s~GIGPHel~l~pDG~tLvVAN  117 (305)
T PF07433_consen   57 VFSPDGRLLYTTENDYETGRGVIGVYDAARGYRRIGEFPSHGIGPHELLLMPDGETLVVAN  117 (305)
T ss_pred             EEcCCCCEEEEeccccCCCcEEEEEEECcCCcEEEeEecCCCcChhhEEEcCCCCEEEEEc
Confidence            488999988884     457899999998 6677889998888889999999998777763


No 357
>PHA02713 hypothetical protein; Provisional
Probab=93.59  E-value=4.7  Score=38.76  Aligned_cols=60  Identities=7%  Similarity=0.184  Sum_probs=36.8

Q ss_pred             CCeEEEEEeCC------CeEEEEECCC-C--eEEEEeecCCCCeEEEEECCCCCeEEEEeCCC--CEEEeecC
Q 022074          221 GQKYIYTGSHD------SCVYVYDLVS-G--EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG--DVVRWEFP  282 (303)
Q Consensus       221 ~~~~latg~~d------g~i~iwd~~~-~--~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg--~i~~Wd~~  282 (303)
                      +++..+.||.+      ..+..||..+ .  +.+..+.........+.+  ++.+.+.||.|+  .+..+|+.
T Consensus       463 ~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~~--~~~iyv~Gg~~~~~~~e~yd~~  533 (557)
T PHA02713        463 KDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTILH--DNTIMMLHCYESYMLQDTFNVY  533 (557)
T ss_pred             CCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEEE--CCEEEEEeeecceeehhhcCcc
Confidence            45666667643      2477899886 3  344444433333333334  788899999888  56666654


No 358
>PRK13616 lipoprotein LpqB; Provisional
Probab=93.37  E-value=1.5  Score=42.40  Aligned_cols=60  Identities=8%  Similarity=0.058  Sum_probs=36.7

Q ss_pred             eeeeeCCCeEEEEEeC------------CCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEE
Q 022074          215 SPVYSTGQKYIYTGSH------------DSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       215 ~~~~s~~~~~latg~~------------dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~  278 (303)
                      .|.|+|+|+.+++...            .+.+++.++..++...   .....|.++.|||||..+|-.. ++.+.+
T Consensus       401 ~PsWspDG~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~---~~~g~Issl~wSpDG~RiA~i~-~g~v~V  472 (591)
T PRK13616        401 RPSWSLDADAVWVVVDGNTVVRVIRDPATGQLARTPVDASAVAS---RVPGPISELQLSRDGVRAAMII-GGKVYL  472 (591)
T ss_pred             CceECCCCCceEEEecCcceEEEeccCCCceEEEEeccCchhhh---ccCCCcCeEEECCCCCEEEEEE-CCEEEE
Confidence            4567777666655532            2344444554444322   2345799999999999777655 466655


No 359
>PF12234 Rav1p_C:  RAVE protein 1 C terminal;  InterPro: IPR022033  This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits. 
Probab=93.36  E-value=2.1  Score=41.39  Aligned_cols=114  Identities=13%  Similarity=0.112  Sum_probs=73.3

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE-ecccCCeEEEEEc-cCCCcEEEEecCCCeEEEEcCc-----cc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI-LAHTSDVNTVCFG-DESGHLIYSGSDDNLCKVWDRR-----CL  114 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~-~~h~~~v~~l~~~-~~~~~~l~s~s~dg~v~lWd~~-----~~  114 (303)
                      ..-+.-|.-++..++-+...++.|||.+.+.+..+. ....+.|.++.|. .++++.+++.+....|.++.-.     ..
T Consensus        32 ~~li~gss~~k~a~V~~~~~~LtIWD~~~~~lE~~~~f~~~~~I~dLDWtst~d~qsiLaVGf~~~v~l~~Q~R~dy~~~  111 (631)
T PF12234_consen   32 PSLISGSSIKKIAVVDSSRSELTIWDTRSGVLEYEESFSEDDPIRDLDWTSTPDGQSILAVGFPHHVLLYTQLRYDYTNK  111 (631)
T ss_pred             cceEeecccCcEEEEECCCCEEEEEEcCCcEEEEeeeecCCCceeeceeeecCCCCEEEEEEcCcEEEEEEccchhhhcC
Confidence            444555666776666666667999999998765432 2456789999994 3568888899999999998531     01


Q ss_pred             cCCCcccee--ecccc-cCeEEEEeCCCCCEEEEEeCCCcEEEEEc
Q 022074          115 NVKGKPAGV--LMGHL-EGITFIDSRGDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus       115 ~~~~~~~~~--~~~h~-~~v~~~~~~~~~~~l~s~~~D~~v~lWdl  157 (303)
                      .+...++..  +..|+ .+|....|.++|.+++.+|  ..+.++|-
T Consensus       112 ~p~w~~i~~i~i~~~T~h~Igds~Wl~~G~LvV~sG--Nqlfv~dk  155 (631)
T PF12234_consen  112 GPSWAPIRKIDISSHTPHPIGDSIWLKDGTLVVGSG--NQLFVFDK  155 (631)
T ss_pred             CcccceeEEEEeecCCCCCccceeEecCCeEEEEeC--CEEEEECC
Confidence            111112221  23344 4577777888886555443  67888864


No 360
>PF00930 DPPIV_N:  Dipeptidyl peptidase IV (DPP IV) N-terminal region;  InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis.  Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide  It is a type II membrane protein that forms a homodimer.  CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=93.12  E-value=6.3  Score=35.39  Aligned_cols=107  Identities=17%  Similarity=0.255  Sum_probs=60.2

Q ss_pred             cCCCCEEEEee---------CCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccC--
Q 022074           48 STDGRELVAGS---------SDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNV--  116 (303)
Q Consensus        48 s~~g~~l~sgs---------~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~--  116 (303)
                      |||+++++...         ..+.+.|+|+.+++... +......+....|+|+ ++.++-.. ++.|.+++......  
T Consensus         1 S~d~~~~l~~~~~~~~~r~s~~~~y~i~d~~~~~~~~-l~~~~~~~~~~~~sP~-g~~~~~v~-~~nly~~~~~~~~~~~   77 (353)
T PF00930_consen    1 SPDGKFVLFATNYTKQWRHSFKGDYYIYDIETGEITP-LTPPPPKLQDAKWSPD-GKYIAFVR-DNNLYLRDLATGQETQ   77 (353)
T ss_dssp             -TTSSEEEEEEEEEEESSSEEEEEEEEEETTTTEEEE-SS-EETTBSEEEE-SS-STEEEEEE-TTEEEEESSTTSEEEE
T ss_pred             CCCCCeEEEEECcEEeeeeccceeEEEEecCCCceEE-CcCCccccccceeecC-CCeeEEEe-cCceEEEECCCCCeEE
Confidence            57888777742         34578999999986543 3333567888899865 66766554 56888887532100  


Q ss_pred             ---CCccceeecccc---------cCeEEEEeCCCCCEEEEEe-CCCcEEEEEcc
Q 022074          117 ---KGKPAGVLMGHL---------EGITFIDSRGDGRYLISNG-KDQAIKLWDIR  158 (303)
Q Consensus       117 ---~~~~~~~~~~h~---------~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~  158 (303)
                         .+ ....+.|-.         +.-..+-|+||+++|+... .+..|+.+.+-
T Consensus        78 lT~dg-~~~i~nG~~dwvyeEEv~~~~~~~~WSpd~~~la~~~~d~~~v~~~~~~  131 (353)
T PF00930_consen   78 LTTDG-EPGIYNGVPDWVYEEEVFDRRSAVWWSPDSKYLAFLRFDEREVPEYPLP  131 (353)
T ss_dssp             SES---TTTEEESB--HHHHHHTSSSSBSEEE-TTSSEEEEEEEE-TTS-EEEEE
T ss_pred             ecccc-ceeEEcCccceeccccccccccceEECCCCCEEEEEEECCcCCceEEee
Confidence               01 001111111         1223566899999988765 45667776553


No 361
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=93.11  E-value=3  Score=38.20  Aligned_cols=117  Identities=20%  Similarity=0.254  Sum_probs=64.5

Q ss_pred             ccceEEEEEcCCCCEEEEe-eCCC----eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCC----------
Q 022074           39 SFGIFSLKFSTDGRELVAG-SSDD----CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDD----------  103 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sg-s~Dg----~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~d----------  103 (303)
                      ...+....+||||+++|-+ +..|    +++++|+++++........... ..+.|.++ ++.|+-...+          
T Consensus       123 ~~~~~~~~~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~~~~~-~~~~W~~d-~~~~~y~~~~~~~~~~~~~~  200 (414)
T PF02897_consen  123 YVSLGGFSVSPDGKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIENPKF-SSVSWSDD-GKGFFYTRFDEDQRTSDSGY  200 (414)
T ss_dssp             -EEEEEEEETTTSSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEEEEES-EEEEECTT-SSEEEEEECSTTTSS-CCGC
T ss_pred             eEEeeeeeECCCCCEEEEEecCCCCceEEEEEEECCCCcCcCCccccccc-ceEEEeCC-CCEEEEEEeCcccccccCCC
Confidence            4445578999999988765 4444    4999999999765432222112 23899754 4554444322          


Q ss_pred             -CeEEEEcCccccCCCccceeecccccC--eEEEEeCCCCCEEEE-EeCCC---cEEEEEccc
Q 022074          104 -NLCKVWDRRCLNVKGKPAGVLMGHLEG--ITFIDSRGDGRYLIS-NGKDQ---AIKLWDIRK  159 (303)
Q Consensus       104 -g~v~lWd~~~~~~~~~~~~~~~~h~~~--v~~~~~~~~~~~l~s-~~~D~---~v~lWdl~~  159 (303)
                       ..|+.|++...  ...-...+.+....  ...+..+.++++++. .+...   .+.+-|+..
T Consensus       201 ~~~v~~~~~gt~--~~~d~lvfe~~~~~~~~~~~~~s~d~~~l~i~~~~~~~~s~v~~~d~~~  261 (414)
T PF02897_consen  201 PRQVYRHKLGTP--QSEDELVFEEPDEPFWFVSVSRSKDGRYLFISSSSGTSESEVYLLDLDD  261 (414)
T ss_dssp             CEEEEEEETTS---GGG-EEEEC-TTCTTSEEEEEE-TTSSEEEEEEESSSSEEEEEEEECCC
T ss_pred             CcEEEEEECCCC--hHhCeeEEeecCCCcEEEEEEecCcccEEEEEEEccccCCeEEEEeccc
Confidence             23677776421  11112334443333  456778899998764 33333   355556654


No 362
>KOG2079 consensus Vacuolar assembly/sorting protein VPS8 [Intracellular trafficking, secretion, and vesicular transport]
Probab=92.85  E-value=0.54  Score=47.13  Aligned_cols=102  Identities=21%  Similarity=0.372  Sum_probs=67.5

Q ss_pred             CCCEEEEeeCCCeEEEEECCCCceE-EEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccc
Q 022074           50 DGRELVAGSSDDCIYVYDLEANKLS-LRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL  128 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~~~~-~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~  128 (303)
                      .+..++.|+.-|.+-..|+.+.--. ..-..-.++|++++|+ .++..++.|-.+|-|.+||..    .+.+.+.+..|.
T Consensus        98 ~~~~ivi~Ts~ghvl~~d~~~nL~~~~~ne~v~~~Vtsvafn-~dg~~l~~G~~~G~V~v~D~~----~~k~l~~i~e~~  172 (1206)
T KOG2079|consen   98 VVVPIVIGTSHGHVLLSDMTGNLGPLHQNERVQGPVTSVAFN-QDGSLLLAGLGDGHVTVWDMH----RAKILKVITEHG  172 (1206)
T ss_pred             eeeeEEEEcCchhhhhhhhhcccchhhcCCccCCcceeeEec-CCCceeccccCCCcEEEEEcc----CCcceeeeeecC
Confidence            4567888888888988888654111 1111234689999995 568888889899999999964    334444444444


Q ss_pred             cC---eEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074          129 EG---ITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus       129 ~~---v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      .+   |..+....++..++++..-|+  +|.+.
T Consensus       173 ap~t~vi~v~~t~~nS~llt~D~~Gs--f~~lv  203 (1206)
T KOG2079|consen  173 APVTGVIFVGRTSQNSKLLTSDTGGS--FWKLV  203 (1206)
T ss_pred             CccceEEEEEEeCCCcEEEEccCCCc--eEEEE
Confidence            33   444444555667888777676  77654


No 363
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=92.65  E-value=8.2  Score=35.54  Aligned_cols=57  Identities=19%  Similarity=0.268  Sum_probs=36.3

Q ss_pred             EEEEEeCCCeEEEEECCCCeEEEEeecCCCCe--EEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          224 YIYTGSHDSCVYVYDLVSGEQVAALKYHTSPV--RDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I--~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      .++.++.++.+.||.-.+  ++=.-+....||  .-..|.....+|++-+.+|.+.+--+-
T Consensus       302 ~llV~t~t~~LlVy~d~~--L~WsA~l~~~PVal~v~~~~~~~G~IV~Ls~~G~L~v~YLG  360 (418)
T PF14727_consen  302 NLLVGTHTGTLLVYEDTT--LVWSAQLPHVPVALSVANFNGLKGLIVSLSDEGQLSVSYLG  360 (418)
T ss_pred             EEEEEecCCeEEEEeCCe--EEEecCCCCCCEEEEecccCCCCceEEEEcCCCcEEEEEeC
Confidence            488899999999996432  211111122333  233344456689999999999998763


No 364
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=92.60  E-value=6.9  Score=34.53  Aligned_cols=193  Identities=17%  Similarity=0.241  Sum_probs=99.6

Q ss_pred             eEEEEEcCC----CCEEEEeeC----------CCeEEEEECCCC-----ceEE-EEecccCCeEEEEEccCCCcEEEEec
Q 022074           42 IFSLKFSTD----GRELVAGSS----------DDCIYVYDLEAN-----KLSL-RILAHTSDVNTVCFGDESGHLIYSGS  101 (303)
Q Consensus        42 v~~l~~s~~----g~~l~sgs~----------Dg~v~lwd~~~~-----~~~~-~~~~h~~~v~~l~~~~~~~~~l~s~s  101 (303)
                      +..+.+..+    ..++++|+.          .|.|.+|++...     ++.. ......++|.+++-.  ++. ++.+.
T Consensus        29 ~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~--~~~-lv~~~  105 (321)
T PF03178_consen   29 LCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLIHSTEVKGPVTAICSF--NGR-LVVAV  105 (321)
T ss_dssp             EEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEE--TTE-EEEEE
T ss_pred             EEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEEEEEeecCcceEhhhh--CCE-EEEee
Confidence            444555543    467887763          288999999884     2221 123456789998764  344 43333


Q ss_pred             CCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeee
Q 022074          102 DDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMD  181 (303)
Q Consensus       102 ~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~  181 (303)
                       ++.|.+|++.... .-.+. .+......+.++...  +++++.|...+.+.++..+....    .+             
T Consensus       106 -g~~l~v~~l~~~~-~l~~~-~~~~~~~~i~sl~~~--~~~I~vgD~~~sv~~~~~~~~~~----~l-------------  163 (321)
T PF03178_consen  106 -GNKLYVYDLDNSK-TLLKK-AFYDSPFYITSLSVF--KNYILVGDAMKSVSLLRYDEENN----KL-------------  163 (321)
T ss_dssp             -TTEEEEEEEETTS-SEEEE-EEE-BSSSEEEEEEE--TTEEEEEESSSSEEEEEEETTTE-----E-------------
T ss_pred             -cCEEEEEEccCcc-cchhh-heecceEEEEEEecc--ccEEEEEEcccCEEEEEEEccCC----EE-------------
Confidence             4699999875322 01111 121122245555443  55999999999999885543100    00             


Q ss_pred             CCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCC-------Ce-EEE-Eeec-C
Q 022074          182 YPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVS-------GE-QVA-ALKY-H  251 (303)
Q Consensus       182 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~-------~~-~~~-~~~~-h  251 (303)
                                      ...........+..+    .+-.++..++.+..+|.+.++....       ++ .+. ...- .
T Consensus       164 ----------------~~va~d~~~~~v~~~----~~l~d~~~~i~~D~~gnl~~l~~~~~~~~~~~~~~~L~~~~~f~l  223 (321)
T PF03178_consen  164 ----------------ILVARDYQPRWVTAA----EFLVDEDTIIVGDKDGNLFVLRYNPEIPNSRDGDPKLERISSFHL  223 (321)
T ss_dssp             ----------------EEEEEESS-BEEEEE----EEE-SSSEEEEEETTSEEEEEEE-SS-SSTTTTTTBEEEEEEEE-
T ss_pred             ----------------EEEEecCCCccEEEE----EEecCCcEEEEEcCCCeEEEEEECCCCcccccccccceeEEEEEC
Confidence                            000000000011111    1222335789999999999998752       22 222 2222 2


Q ss_pred             CCCeEEE---EECCC--C------CeEEEEeCCCCEEEe
Q 022074          252 TSPVRDC---SWHPS--Q------PMLVSSSWDGDVVRW  279 (303)
Q Consensus       252 ~~~I~~v---~~sp~--~------~~las~s~Dg~i~~W  279 (303)
                      ...|+++   .+.|.  +      +.++-++.+|.|-.-
T Consensus       224 g~~v~~~~~~~l~~~~~~~~~~~~~~i~~~T~~G~Ig~l  262 (321)
T PF03178_consen  224 GDIVNSFRRGSLIPRSGSSESPNRPQILYGTVDGSIGVL  262 (321)
T ss_dssp             SS-EEEEEE--SS--SSSS-TTEEEEEEEEETTS-EEEE
T ss_pred             CCccceEEEEEeeecCCCCcccccceEEEEecCCEEEEE
Confidence            4578887   55562  2      248888889998843


No 365
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=92.57  E-value=8.2  Score=35.34  Aligned_cols=72  Identities=18%  Similarity=0.163  Sum_probs=50.1

Q ss_pred             ccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE--ec------ccCCeEEEEEcc----CCC---cEEEEecCC
Q 022074           39 SFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI--LA------HTSDVNTVCFGD----ESG---HLIYSGSDD  103 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~--~~------h~~~v~~l~~~~----~~~---~~l~s~s~d  103 (303)
                      ..+|.+++.| |=.+++.|..+|++.|.|++....+...  ..      ....++++.|..    +++   -.+++|...
T Consensus        86 ~g~vtal~~S-~iGFvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn~  164 (395)
T PF08596_consen   86 QGPVTALKNS-DIGFVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLLVGTNS  164 (395)
T ss_dssp             S-SEEEEEE--BTSEEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEEEEETT
T ss_pred             CCcEeEEecC-CCcEEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEEEEeCC
Confidence            6789999998 5669999999999999999777655431  12      123567777741    222   367889999


Q ss_pred             CeEEEEcC
Q 022074          104 NLCKVWDR  111 (303)
Q Consensus       104 g~v~lWd~  111 (303)
                      |.+.+|.+
T Consensus       165 G~v~~fkI  172 (395)
T PF08596_consen  165 GNVLTFKI  172 (395)
T ss_dssp             SEEEEEEE
T ss_pred             CCEEEEEE
Confidence            99999976


No 366
>PF10313 DUF2415:  Uncharacterised protein domain (DUF2415);  InterPro: IPR019417  This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif. 
Probab=92.14  E-value=0.58  Score=27.90  Aligned_cols=34  Identities=24%  Similarity=0.313  Sum_probs=27.5

Q ss_pred             CCeEEEEECCCC---CeEEEEeCCCCEEEeecCCCCc
Q 022074          253 SPVRDCSWHPSQ---PMLVSSSWDGDVVRWEFPGNGE  286 (303)
Q Consensus       253 ~~I~~v~~sp~~---~~las~s~Dg~i~~Wd~~~~~~  286 (303)
                      +.|.+|+|||..   .+|+-+-.-+.+.++|...+.+
T Consensus         1 GAvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R~~f~   37 (43)
T PF10313_consen    1 GAVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTRSNFM   37 (43)
T ss_pred             CCeEEEEeCCCCCcccEEEEEccCCeEEEEEcccCcc
Confidence            368899999854   4899888889999999886443


No 367
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=91.83  E-value=7.4  Score=33.19  Aligned_cols=32  Identities=31%  Similarity=0.315  Sum_probs=26.7

Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeEEEEee
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK  249 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~  249 (303)
                      ..+++.++.+|+.|+..+..|.++...+...+
T Consensus       101 ~d~~~glIycgshd~~~yalD~~~~~cVyksk  132 (354)
T KOG4649|consen  101 CDFDGGLIYCGSHDGNFYALDPKTYGCVYKSK  132 (354)
T ss_pred             EcCCCceEEEecCCCcEEEecccccceEEecc
Confidence            44678899999999999999999877776654


No 368
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=91.81  E-value=9.1  Score=36.93  Aligned_cols=96  Identities=21%  Similarity=0.276  Sum_probs=51.2

Q ss_pred             CCCEEEEeeCC------CeEEEEECCCCceEEEEecccCCe--EEEEEccCCCcEEEEecCCCe-----EEEEcCccccC
Q 022074           50 DGRELVAGSSD------DCIYVYDLEANKLSLRILAHTSDV--NTVCFGDESGHLIYSGSDDNL-----CKVWDRRCLNV  116 (303)
Q Consensus        50 ~g~~l~sgs~D------g~v~lwd~~~~~~~~~~~~h~~~v--~~l~~~~~~~~~l~s~s~dg~-----v~lWd~~~~~~  116 (303)
                      ++...++||.|      .++..||..+++.. .+..-...-  ..++.  -++.+.+.|+.||.     |-.||.+..  
T Consensus       332 ~~~lYv~GG~~~~~~~l~~ve~YD~~~~~W~-~~a~M~~~R~~~~v~~--l~g~iYavGG~dg~~~l~svE~YDp~~~--  406 (571)
T KOG4441|consen  332 NGKLYVVGGYDSGSDRLSSVERYDPRTNQWT-PVAPMNTKRSDFGVAV--LDGKLYAVGGFDGEKSLNSVECYDPVTN--  406 (571)
T ss_pred             CCEEEEEccccCCCcccceEEEecCCCCcee-ccCCccCccccceeEE--ECCEEEEEeccccccccccEEEecCCCC--
Confidence            56788889988      36788999888743 222211111  12222  23678888998875     445665422  


Q ss_pred             CCccceeecccccCeEEEEeCCCCCEEEEEeCCCcE
Q 022074          117 KGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAI  152 (303)
Q Consensus       117 ~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v  152 (303)
                      .................+.+  +|...+.||.|+.-
T Consensus       407 ~W~~va~m~~~r~~~gv~~~--~g~iYi~GG~~~~~  440 (571)
T KOG4441|consen  407 KWTPVAPMLTRRSGHGVAVL--GGKLYIIGGGDGSS  440 (571)
T ss_pred             cccccCCCCcceeeeEEEEE--CCEEEEEcCcCCCc
Confidence            22222222211222222222  56777888876654


No 369
>PF10313 DUF2415:  Uncharacterised protein domain (DUF2415);  InterPro: IPR019417  This entry represents a short (30 residues) domain of unknown function found in a family of fungal proteins. It contains a characteristic DLL sequence motif. 
Probab=91.33  E-value=0.72  Score=27.49  Aligned_cols=31  Identities=23%  Similarity=0.324  Sum_probs=26.0

Q ss_pred             CCeEEEEEccCCC--cEEEEecCCCeEEEEcCc
Q 022074           82 SDVNTVCFGDESG--HLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        82 ~~v~~l~~~~~~~--~~l~s~s~dg~v~lWd~~  112 (303)
                      +.+.++.|+|...  ++|+-+-..|.|.++|+|
T Consensus         1 GAvR~~kFsP~~~~~DLL~~~E~~g~vhi~D~R   33 (43)
T PF10313_consen    1 GAVRCCKFSPEPGGNDLLAWAEHQGRVHIVDTR   33 (43)
T ss_pred             CCeEEEEeCCCCCcccEEEEEccCCeEEEEEcc
Confidence            3578999987655  788888889999999987


No 370
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=91.07  E-value=1.6  Score=40.27  Aligned_cols=103  Identities=17%  Similarity=0.252  Sum_probs=58.0

Q ss_pred             eEEEEEcCCCCEEEEe-eCCCe--EEEEECCCCceEEEEecccCCeE-EEEEccCCCcEEEEecCCCeEEEEcCccccCC
Q 022074           42 IFSLKFSTDGRELVAG-SSDDC--IYVYDLEANKLSLRILAHTSDVN-TVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        42 v~~l~~s~~g~~l~sg-s~Dg~--v~lwd~~~~~~~~~~~~h~~~v~-~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~  117 (303)
                      -..-+|+|||++|+-. ..||.  |++.|+.++.... + .+..++. .=.|+|+....+++.+..|.=.+|-...   .
T Consensus       240 ~~~P~fspDG~~l~f~~~rdg~~~iy~~dl~~~~~~~-L-t~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~---~  314 (425)
T COG0823         240 NGAPAFSPDGSKLAFSSSRDGSPDIYLMDLDGKNLPR-L-TNGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDL---E  314 (425)
T ss_pred             cCCccCCCCCCEEEEEECCCCCccEEEEcCCCCccee-c-ccCCccccCccCCCCCCEEEEEeCCCCCcceEEECC---C
Confidence            3445799999976665 45665  5666887776432 2 3333333 3356666555666777777555553221   1


Q ss_pred             CccceeecccccCeEEEEeCCCCCEEEEEeCC
Q 022074          118 GKPAGVLMGHLEGITFIDSRGDGRYLISNGKD  149 (303)
Q Consensus       118 ~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D  149 (303)
                      +.....+.-....-..-.++|+|.+|+-.+..
T Consensus       315 g~~~~riT~~~~~~~~p~~SpdG~~i~~~~~~  346 (425)
T COG0823         315 GSQVTRLTFSGGGNSNPVWSPDGDKIVFESSS  346 (425)
T ss_pred             CCceeEeeccCCCCcCccCCCCCCEEEEEecc
Confidence            22222332222222245689999999877643


No 371
>PF00930 DPPIV_N:  Dipeptidyl peptidase IV (DPP IV) N-terminal region;  InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis.  Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide  It is a type II membrane protein that forms a homodimer.  CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=90.52  E-value=0.77  Score=41.28  Aligned_cols=52  Identities=19%  Similarity=0.431  Sum_probs=39.3

Q ss_pred             CCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          231 DSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       231 dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      .+.+.++|+.+++.. .+......+....|||+|+.+|-.. ++.|.+++....
T Consensus        22 ~~~y~i~d~~~~~~~-~l~~~~~~~~~~~~sP~g~~~~~v~-~~nly~~~~~~~   73 (353)
T PF00930_consen   22 KGDYYIYDIETGEIT-PLTPPPPKLQDAKWSPDGKYIAFVR-DNNLYLRDLATG   73 (353)
T ss_dssp             EEEEEEEETTTTEEE-ESS-EETTBSEEEE-SSSTEEEEEE-TTEEEEESSTTS
T ss_pred             ceeEEEEecCCCceE-ECcCCccccccceeecCCCeeEEEe-cCceEEEECCCC
Confidence            457999999997643 3333367899999999999988886 689999987654


No 372
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=90.30  E-value=5.6  Score=34.06  Aligned_cols=107  Identities=23%  Similarity=0.366  Sum_probs=61.6

Q ss_pred             CCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccc--e----
Q 022074           49 TDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPA--G----  122 (303)
Q Consensus        49 ~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~--~----  122 (303)
                      ..++.|+.|+.+| +++++........++. +...|..+...++-+ .|+.-+ |++++++++..........  .    
T Consensus         5 ~~~~~L~vGt~~G-l~~~~~~~~~~~~~i~-~~~~I~ql~vl~~~~-~llvLs-d~~l~~~~L~~l~~~~~~~~~~~~~~   80 (275)
T PF00780_consen    5 SWGDRLLVGTEDG-LYVYDLSDPSKPTRIL-KLSSITQLSVLPELN-LLLVLS-DGQLYVYDLDSLEPVSTSAPLAFPKS   80 (275)
T ss_pred             cCCCEEEEEECCC-EEEEEecCCccceeEe-ecceEEEEEEecccC-EEEEEc-CCccEEEEchhhcccccccccccccc
Confidence            3578999999999 9999993333222222 223388888765544 444444 4999999986432221100  0    


Q ss_pred             ----eecccccCeEEEE--eCCCCCEEEEEeCCCcEEEEEccc
Q 022074          123 ----VLMGHLEGITFID--SRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       123 ----~~~~h~~~v~~~~--~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                          .-......+..++  -...+...+.....++|.+|....
T Consensus        81 ~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk~i~i~~~~~  123 (275)
T PF00780_consen   81 RSLPTKLPETKGVSFFAVNGGHEGSRRLCVAVKKKILIYEWND  123 (275)
T ss_pred             ccccccccccCCeeEEeeccccccceEEEEEECCEEEEEEEEC
Confidence                0111223454444  233445556666667999998765


No 373
>PF14870 PSII_BNR:  Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=90.26  E-value=12  Score=32.86  Aligned_cols=127  Identities=16%  Similarity=0.098  Sum_probs=60.7

Q ss_pred             cccccccCcCc-ccccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEE
Q 022074           20 NVTEIHDGLDF-SAADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIY   98 (303)
Q Consensus        20 ~~~~~~~~~~~-~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~   98 (303)
                      +|.+=.+|.+- .++. .+-...+..+..++||+++++++.-....-||.-.......-..-...|..+.|.++ +.+.+
T Consensus       125 ~iy~T~DgG~tW~~~~-~~~~gs~~~~~r~~dG~~vavs~~G~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~-~~lw~  202 (302)
T PF14870_consen  125 AIYRTTDGGKTWQAVV-SETSGSINDITRSSDGRYVAVSSRGNFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPD-GNLWM  202 (302)
T ss_dssp             -EEEESSTTSSEEEEE--S----EEEEEE-TTS-EEEEETTSSEEEEE-TT-SS-EEEE--SSS-EEEEEE-TT-S-EEE
T ss_pred             cEEEeCCCCCCeeEcc-cCCcceeEeEEECCCCcEEEEECcccEEEEecCCCccceEEccCccceehhceecCC-CCEEE
Confidence            45555555522 1222 344467899999999999988866665667765433222222334578999999754 55544


Q ss_pred             EecCCCeEEEEcCccccCCC-ccceeecccccCeEEEEeCCCCCEEEEEeCC
Q 022074           99 SGSDDNLCKVWDRRCLNVKG-KPAGVLMGHLEGITFIDSRGDGRYLISNGKD  149 (303)
Q Consensus        99 s~s~dg~v~lWd~~~~~~~~-~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D  149 (303)
                       ....|.++.=+......+. ++......-.-.+..+++.+++...++|+..
T Consensus       203 -~~~Gg~~~~s~~~~~~~~w~~~~~~~~~~~~~~ld~a~~~~~~~wa~gg~G  253 (302)
T PF14870_consen  203 -LARGGQIQFSDDPDDGETWSEPIIPIKTNGYGILDLAYRPPNEIWAVGGSG  253 (302)
T ss_dssp             -EETTTEEEEEE-TTEEEEE---B-TTSS--S-EEEEEESSSS-EEEEESTT
T ss_pred             -EeCCcEEEEccCCCCccccccccCCcccCceeeEEEEecCCCCEEEEeCCc
Confidence             4478888876511000000 0111111112346788898888777776654


No 374
>PF07569 Hira:  TUP1-like enhancer of split;  InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=90.26  E-value=2.1  Score=35.67  Aligned_cols=61  Identities=21%  Similarity=0.402  Sum_probs=46.4

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEEE-------ee-------cCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVAA-------LK-------YHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~~-------~~-------~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      ++.+|++-..+|.+++||+.+++.+..       +.       .....|..+..+.+|.-+++-+ +|....|+..
T Consensus        21 ~~~~Ll~iT~~G~l~vWnl~~~k~~~~~~Si~pll~~~~~~~~~~~~~i~~~~lt~~G~PiV~ls-ng~~y~y~~~   95 (219)
T PF07569_consen   21 NGSYLLAITSSGLLYVWNLKKGKAVLPPVSIAPLLNSSPVSDKSSSPNITSCSLTSNGVPIVTLS-NGDSYSYSPD   95 (219)
T ss_pred             CCCEEEEEeCCCeEEEEECCCCeeccCCccHHHHhcccccccCCCCCcEEEEEEcCCCCEEEEEe-CCCEEEeccc
Confidence            577899999999999999998875321       11       2446799999999998777665 5778888753


No 375
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=90.19  E-value=18  Score=34.86  Aligned_cols=99  Identities=19%  Similarity=0.221  Sum_probs=54.4

Q ss_pred             cCCCCEEEEeeCCC------eEEEEECCCCceEEEE-ecccCCeEEEEEccCCCcEEEEecCC------CeEEEEcCccc
Q 022074           48 STDGRELVAGSSDD------CIYVYDLEANKLSLRI-LAHTSDVNTVCFGDESGHLIYSGSDD------NLCKVWDRRCL  114 (303)
Q Consensus        48 s~~g~~l~sgs~Dg------~v~lwd~~~~~~~~~~-~~h~~~v~~l~~~~~~~~~l~s~s~d------g~v~lWd~~~~  114 (303)
                      +..+..++.||.++      .+..||..++...... ..+...-.+++..  ++.+.++|+.|      .++..||.+..
T Consensus       282 ~~~~~l~~vGG~~~~~~~~~~ve~yd~~~~~w~~~a~m~~~r~~~~~~~~--~~~lYv~GG~~~~~~~l~~ve~YD~~~~  359 (571)
T KOG4441|consen  282 SVSGKLVAVGGYNRQGQSLRSVECYDPKTNEWSSLAPMPSPRCRVGVAVL--NGKLYVVGGYDSGSDRLSSVERYDPRTN  359 (571)
T ss_pred             CCCCeEEEECCCCCCCcccceeEEecCCcCcEeecCCCCcccccccEEEE--CCEEEEEccccCCCcccceEEEecCCCC
Confidence            34466778888774      6888999888533211 1222233444443  35788899988      35556776532


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcE
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAI  152 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v  152 (303)
                        .......+...........+  +|...+.||.|+.-
T Consensus       360 --~W~~~a~M~~~R~~~~v~~l--~g~iYavGG~dg~~  393 (571)
T KOG4441|consen  360 --QWTPVAPMNTKRSDFGVAVL--DGKLYAVGGFDGEK  393 (571)
T ss_pred             --ceeccCCccCccccceeEEE--CCEEEEEecccccc
Confidence              22223223222222222222  57778899998553


No 376
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=89.93  E-value=2.5  Score=24.45  Aligned_cols=40  Identities=15%  Similarity=0.132  Sum_probs=26.7

Q ss_pred             CCeEEEE-EeCCCeEEEEECCCCeEEEEeecCCCCeEEEEEC
Q 022074          221 GQKYIYT-GSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWH  261 (303)
Q Consensus       221 ~~~~lat-g~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~s  261 (303)
                      ++++|.+ .-.+++|.++|..+++.+..+..- .....+.|+
T Consensus         2 d~~~lyv~~~~~~~v~~id~~~~~~~~~i~vg-~~P~~i~~~   42 (42)
T TIGR02276         2 DGTKLYVTNSGSNTVSVIDTATNKVIATIPVG-GYPFGVAVS   42 (42)
T ss_pred             CCCEEEEEeCCCCEEEEEECCCCeEEEEEECC-CCCceEEeC
Confidence            4454444 456899999999999888877653 333455553


No 377
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=89.73  E-value=16  Score=33.32  Aligned_cols=182  Identities=19%  Similarity=0.198  Sum_probs=102.5

Q ss_pred             ceEEEEEcCCCCEEEEeeC---CCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC
Q 022074           41 GIFSLKFSTDGRELVAGSS---DDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK  117 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~---Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~  117 (303)
                      .-..++++++++.+.++..   ++++.+.|..+.+.......-..+ ..+++.+......++-..++.|.+.|.......
T Consensus       117 ~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~~P-~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~  195 (381)
T COG3391         117 GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNTP-TGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVV  195 (381)
T ss_pred             CCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEEEecCCCc-ceEEECCCCCeEEEEecCCCeEEEEeCCCccee
Confidence            4567889999988888765   688999999988877664433334 778886553335555567889999986422211


Q ss_pred             -CccceeecccccCeEEEEeCCCCCEEEEEeC-C--CcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCC
Q 022074          118 -GKPAGVLMGHLEGITFIDSRGDGRYLISNGK-D--QAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPC  193 (303)
Q Consensus       118 -~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~-D--~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  193 (303)
                       ....... .-...-..+.+.+++.++..... +  +.+...|.........               ..+..        
T Consensus       196 ~~~~~~~~-~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~~v~~~---------------~~~~~--------  251 (381)
T COG3391         196 RGSVGSLV-GVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATGNVTAT---------------DLPVG--------  251 (381)
T ss_pred             cccccccc-ccCCCCceEEECCCCCEEEEEeccCCCceEEEEeCCCceEEEe---------------ccccc--------
Confidence             1100011 11122235567888886544333 2  4777777653211000               00000        


Q ss_pred             CCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEe-CCCeEEEEECCCCeEEEEeecCC---CCeEEEEECCC
Q 022074          194 DQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGS-HDSCVYVYDLVSGEQVAALKYHT---SPVRDCSWHPS  263 (303)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~-~dg~i~iwd~~~~~~~~~~~~h~---~~I~~v~~sp~  263 (303)
                           .          . .......+|+++++.... ..+.+.+.|..+...........   ..+..+++.+.
T Consensus       252 -----~----------~-~~~~v~~~p~g~~~yv~~~~~~~V~vid~~~~~v~~~~~~~~~~~~~~~~~~~~~~  309 (381)
T COG3391         252 -----S----------G-APRGVAVDPAGKAAYVANSQGGTVSVIDGATDRVVKTGPTGNEALGEPVSIAISPL  309 (381)
T ss_pred             -----c----------C-CCCceeECCCCCEEEEEecCCCeEEEEeCCCCceeeeecccccccccceeccceee
Confidence                 0          0 000112456777777763 45889999988877665544332   24566666554


No 378
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=88.79  E-value=19  Score=33.13  Aligned_cols=30  Identities=10%  Similarity=0.119  Sum_probs=26.4

Q ss_pred             CCCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          252 TSPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       252 ~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      .+++..++.||+++++|--..+|.+.+..-
T Consensus       216 ~~~i~~iavSpng~~iAl~t~~g~l~v~ss  245 (410)
T PF04841_consen  216 DGPIIKIAVSPNGKFIALFTDSGNLWVVSS  245 (410)
T ss_pred             CCCeEEEEECCCCCEEEEEECCCCEEEEEC
Confidence            368999999999999999999999988763


No 379
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=88.72  E-value=18  Score=32.83  Aligned_cols=198  Identities=15%  Similarity=0.127  Sum_probs=111.7

Q ss_pred             eEEEEEcCCCCEEEEee-CCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEec--CCCeEEEEcCccccCCC
Q 022074           42 IFSLKFSTDGRELVAGS-SDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGS--DDNLCKVWDRRCLNVKG  118 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s--~dg~v~lWd~~~~~~~~  118 (303)
                      -..++.++.++.+++.. .++.|.+.|..+.+.......- .....+++.++.....++-.  .++++.+.|..    ..
T Consensus        76 p~~i~v~~~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG-~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~----t~  150 (381)
T COG3391          76 PAGVAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVG-LGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAA----TN  150 (381)
T ss_pred             ccceeeCCCCCeEEEecCCCCeEEEEcCcccceeeEeeec-cCCceEEECCCCCEEEEEecccCCceEEEEeCC----CC
Confidence            34577888888655544 4578999997777655433221 24567778665444544444  36788777743    22


Q ss_pred             cccee-ecccccCeEEEEeCCCCCEEE-EEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCc
Q 022074          119 KPAGV-LMGHLEGITFIDSRGDGRYLI-SNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQS  196 (303)
Q Consensus       119 ~~~~~-~~~h~~~v~~~~~~~~~~~l~-s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  196 (303)
                      ..... ..|- .+ ..+++.++|+.++ +...++.+.+.|........ ..               +          ...
T Consensus       151 ~~~~~~~vG~-~P-~~~a~~p~g~~vyv~~~~~~~v~vi~~~~~~v~~-~~---------------~----------~~~  202 (381)
T COG3391         151 KVTATIPVGN-TP-TGVAVDPDGNKVYVTNSDDNTVSVIDTSGNSVVR-GS---------------V----------GSL  202 (381)
T ss_pred             eEEEEEecCC-Cc-ceEEECCCCCeEEEEecCCCeEEEEeCCCcceec-cc---------------c----------ccc
Confidence            22222 2222 22 6778899998654 45578999999854321110 00               0          000


Q ss_pred             ceEEecccceeeeEEEeeeeeeeCCCeEEEEEeC---CCeEEEEECCCCeEEEE-ee-cCCCCeEEEEECCCCCeEEEE-
Q 022074          197 VATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSH---DSCVYVYDLVSGEQVAA-LK-YHTSPVRDCSWHPSQPMLVSS-  270 (303)
Q Consensus       197 ~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~---dg~i~iwd~~~~~~~~~-~~-~h~~~I~~v~~sp~~~~las~-  270 (303)
                      +.          .....+...+++++.++.....   ++.+...|..++..... .. +-. ....+..+|++.++-.. 
T Consensus       203 ~~----------~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~~v~~~~~~~~~~-~~~~v~~~p~g~~~yv~~  271 (381)
T COG3391         203 VG----------VGTGPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATGNVTATDLPVGSG-APRGVAVDPAGKAAYVAN  271 (381)
T ss_pred             cc----------cCCCCceEEECCCCCEEEEEeccCCCceEEEEeCCCceEEEeccccccC-CCCceeECCCCCEEEEEe
Confidence            00          0000111224566665444433   36899999988776544 21 222 45778999999976666 


Q ss_pred             eCCCCEEEeecCC
Q 022074          271 SWDGDVVRWEFPG  283 (303)
Q Consensus       271 s~Dg~i~~Wd~~~  283 (303)
                      +..+.+.+-|...
T Consensus       272 ~~~~~V~vid~~~  284 (381)
T COG3391         272 SQGGTVSVIDGAT  284 (381)
T ss_pred             cCCCeEEEEeCCC
Confidence            3346777766543


No 380
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=88.06  E-value=16  Score=31.23  Aligned_cols=115  Identities=17%  Similarity=0.111  Sum_probs=64.6

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE--------------EecccCCeEEEE-EccCCCcEEEEecCCC
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR--------------ILAHTSDVNTVC-FGDESGHLIYSGSDDN  104 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~--------------~~~h~~~v~~l~-~~~~~~~~l~s~s~dg  104 (303)
                      .+|..|...++-+.+++= .|+.++++++..-.....              ......++...+ -....+...+......
T Consensus        36 ~~I~ql~vl~~~~~llvL-sd~~l~~~~L~~l~~~~~~~~~~~~~~~~~~~~~~~~~~v~~f~~~~~~~~~~~L~va~kk  114 (275)
T PF00780_consen   36 SSITQLSVLPELNLLLVL-SDGQLYVYDLDSLEPVSTSAPLAFPKSRSLPTKLPETKGVSFFAVNGGHEGSRRLCVAVKK  114 (275)
T ss_pred             ceEEEEEEecccCEEEEE-cCCccEEEEchhhccccccccccccccccccccccccCCeeEEeeccccccceEEEEEECC
Confidence            349999999887766654 459999999876432221              122334566655 1122333444444455


Q ss_pred             eEEEEcCccccCCC-ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074          105 LCKVWDRRCLNVKG-KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       105 ~v~lWd~~~~~~~~-~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      .+.+|......... ...+.+. -.+.+..+.+.  ++.++.|.. +...+.|+..
T Consensus       115 ~i~i~~~~~~~~~f~~~~ke~~-lp~~~~~i~~~--~~~i~v~~~-~~f~~idl~~  166 (275)
T PF00780_consen  115 KILIYEWNDPRNSFSKLLKEIS-LPDPPSSIAFL--GNKICVGTS-KGFYLIDLNT  166 (275)
T ss_pred             EEEEEEEECCcccccceeEEEE-cCCCcEEEEEe--CCEEEEEeC-CceEEEecCC
Confidence            88888764321111 2222332 23566677776  445666654 5577888774


No 381
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=87.71  E-value=1.8  Score=37.29  Aligned_cols=61  Identities=20%  Similarity=0.363  Sum_probs=48.4

Q ss_pred             eeeCCCeEEEEEeC-----CCeEEEEECCCC-eEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEe
Q 022074          217 VYSTGQKYIYTGSH-----DSCVYVYDLVSG-EQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRW  279 (303)
Q Consensus       217 ~~s~~~~~latg~~-----dg~i~iwd~~~~-~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~W  279 (303)
                      +||+||.+|..--.     -|.|-|||.+.+ +.+.++..|..-..++.|.+||+.|+.+.  |-|..-
T Consensus       120 vfs~dG~~LYATEndfd~~rGViGvYd~r~~fqrvgE~~t~GiGpHev~lm~DGrtlvvan--GGIeth  186 (366)
T COG3490         120 VFSPDGRLLYATENDFDPNRGVIGVYDAREGFQRVGEFSTHGIGPHEVTLMADGRTLVVAN--GGIETH  186 (366)
T ss_pred             ccCCCCcEEEeecCCCCCCCceEEEEecccccceecccccCCcCcceeEEecCCcEEEEeC--Cceecc
Confidence            58899998876432     467999999855 45788889988889999999999998884  666655


No 382
>PF12234 Rav1p_C:  RAVE protein 1 C terminal;  InterPro: IPR022033  This domain family is found in eukaryotes, and is typically between 621 and 644 amino acids in length. This family is the C-terminal region of the protein RAVE (regulator of the ATPase of vacuolar and endosomal membranes). Rav1p is involved in regulating the glucose dependent assembly and disassembly of vacuolar ATPase V1 and V0 subunits. 
Probab=87.63  E-value=19  Score=35.04  Aligned_cols=61  Identities=16%  Similarity=0.346  Sum_probs=42.1

Q ss_pred             eCCCeEEEEEeCCCeEEEEECC-----CC----eEEEEe--ecCC-CCeEEEEECCCCCeEEEEeCCCCEEEeec
Q 022074          219 STGQKYIYTGSHDSCVYVYDLV-----SG----EQVAAL--KYHT-SPVRDCSWHPSQPMLVSSSWDGDVVRWEF  281 (303)
Q Consensus       219 s~~~~~latg~~dg~i~iwd~~-----~~----~~~~~~--~~h~-~~I~~v~~sp~~~~las~s~Dg~i~~Wd~  281 (303)
                      .|+++.+++-|-...|.++-..     +.    ..+..+  ..|+ .+|.+..|-++|.+++.+|  +.+.+++-
T Consensus        83 t~d~qsiLaVGf~~~v~l~~Q~R~dy~~~~p~w~~i~~i~i~~~T~h~Igds~Wl~~G~LvV~sG--Nqlfv~dk  155 (631)
T PF12234_consen   83 TPDGQSILAVGFPHHVLLYTQLRYDYTNKGPSWAPIRKIDISSHTPHPIGDSIWLKDGTLVVGSG--NQLFVFDK  155 (631)
T ss_pred             cCCCCEEEEEEcCcEEEEEEccchhhhcCCcccceeEEEEeecCCCCCccceeEecCCeEEEEeC--CEEEEECC
Confidence            4677888888888888888542     11    122222  3344 5899999999999877775  67888874


No 383
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=87.06  E-value=14  Score=31.38  Aligned_cols=105  Identities=18%  Similarity=0.242  Sum_probs=59.0

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC--C-ceE--EEEe------cccCCeEEEEEccCCCcEEEEecCCCe
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA--N-KLS--LRIL------AHTSDVNTVCFGDESGHLIYSGSDDNL  105 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~--~-~~~--~~~~------~h~~~v~~l~~~~~~~~~l~s~s~dg~  105 (303)
                      .++.++-.++|++.++.++++-...-..||.++.  + ...  ....      .....+..++++|..+++++-++.+..
T Consensus       115 ~~N~G~EGla~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~lliLS~es~~  194 (248)
T PF06977_consen  115 KGNKGFEGLAYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHLLILSDESRL  194 (248)
T ss_dssp             --SS--EEEEEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEEEEEETTTTE
T ss_pred             CCCcceEEEEEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeEEEEECCCCe
Confidence            4667899999999988888887777777887754  2 111  1111      123467899998888899999999999


Q ss_pred             EEEEcCccccCCCccceeec---c-c-----ccCeEEEEeCCCCCEEEEE
Q 022074          106 CKVWDRRCLNVKGKPAGVLM---G-H-----LEGITFIDSRGDGRYLISN  146 (303)
Q Consensus       106 v~lWd~~~~~~~~~~~~~~~---~-h-----~~~v~~~~~~~~~~~l~s~  146 (303)
                      +...|..     +++...+.   + |     -..--.+++.++|++.+++
T Consensus       195 l~~~d~~-----G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvs  239 (248)
T PF06977_consen  195 LLELDRQ-----GRVVSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVS  239 (248)
T ss_dssp             EEEE-TT-------EEEEEE-STTGGG-SS---SEEEEEE-TT--EEEEE
T ss_pred             EEEECCC-----CCEEEEEEeCCcccCcccccCCccEEEECCCCCEEEEc
Confidence            9998853     23222221   1 1     0134567788888655554


No 384
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=86.73  E-value=39  Score=34.35  Aligned_cols=113  Identities=16%  Similarity=0.209  Sum_probs=73.6

Q ss_pred             cceEEEEEcCC-CCEEEEee----------CCCeEEEEECCCCceEEEEecc--cCCeEEEEEccCCCcEEEEecCCCeE
Q 022074           40 FGIFSLKFSTD-GRELVAGS----------SDDCIYVYDLEANKLSLRILAH--TSDVNTVCFGDESGHLIYSGSDDNLC  106 (303)
Q Consensus        40 ~~v~~l~~s~~-g~~l~sgs----------~Dg~v~lwd~~~~~~~~~~~~h--~~~v~~l~~~~~~~~~l~s~s~dg~v  106 (303)
                      ..+.++.|..| +.++++|.          ..|.|.||....++....+..+  .+.|.++..  -+++++  ++-+.+|
T Consensus       775 ~Si~s~~~~~d~~t~~vVGT~~v~Pde~ep~~GRIivfe~~e~~~L~~v~e~~v~Gav~aL~~--fngkll--A~In~~v  850 (1096)
T KOG1897|consen  775 LSIISCKFTDDPNTYYVVGTGLVYPDENEPVNGRIIVFEFEELNSLELVAETVVKGAVYALVE--FNGKLL--AGINQSV  850 (1096)
T ss_pred             eeeeeeeecCCCceEEEEEEEeeccCCCCcccceEEEEEEecCCceeeeeeeeeccceeehhh--hCCeEE--EecCcEE
Confidence            36777778877 66788875          2456777776663222222222  244555543  235565  4445699


Q ss_pred             EEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074          107 KVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       107 ~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ++|+..    ..+..+....|...+..+...-.|..++.|...+++.+-..+.+
T Consensus       851 rLye~t----~~~eLr~e~~~~~~~~aL~l~v~gdeI~VgDlm~Sitll~y~~~  900 (1096)
T KOG1897|consen  851 RLYEWT----TERELRIECNISNPIIALDLQVKGDEIAVGDLMRSITLLQYKGD  900 (1096)
T ss_pred             EEEEcc----ccceehhhhcccCCeEEEEEEecCcEEEEeeccceEEEEEEecc
Confidence            999975    22344455667788888888888889999999999888766543


No 385
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=85.81  E-value=27  Score=31.70  Aligned_cols=63  Identities=21%  Similarity=0.300  Sum_probs=33.1

Q ss_pred             eCCCeEEEEEeCC----------------CeEEEEECCCCeEEEEeecCC---------CC--eEEEEECCCCCe-EEEE
Q 022074          219 STGQKYIYTGSHD----------------SCVYVYDLVSGEQVAALKYHT---------SP--VRDCSWHPSQPM-LVSS  270 (303)
Q Consensus       219 s~~~~~latg~~d----------------g~i~iwd~~~~~~~~~~~~h~---------~~--I~~v~~sp~~~~-las~  270 (303)
                      ++|+++++.-|.|                -.|++++.+.++.. .+..|.         ..  =--..||||+++ |.++
T Consensus       291 s~Dg~L~vGDG~d~p~~v~~~~~~~~~~~p~i~~~~~~~~~~~-~l~~h~~sw~v~~~~~q~~hPhp~FSPDgk~VlF~S  369 (386)
T PF14583_consen  291 SPDGKLFVGDGGDAPVDVADAGGYKIENDPWIYLFDVEAGRFR-KLARHDTSWKVLDGDRQVTHPHPSFSPDGKWVLFRS  369 (386)
T ss_dssp             -TTSSEEEEEE-------------------EEEEEETTTTEEE-EEEE-------BTTBSSTT----EE-TTSSEEEEEE
T ss_pred             cCCCCEEEecCCCCCccccccccceecCCcEEEEeccccCcee-eeeeccCcceeecCCCccCCCCCccCCCCCEEEEEC
Confidence            4677766654443                26778888877642 122221         11  135689999985 7788


Q ss_pred             eCCCCEEEeecC
Q 022074          271 SWDGDVVRWEFP  282 (303)
Q Consensus       271 s~Dg~i~~Wd~~  282 (303)
                      ...|...++=++
T Consensus       370 d~~G~~~vY~v~  381 (386)
T PF14583_consen  370 DMEGPPAVYLVE  381 (386)
T ss_dssp             -TTSS-EEEEEE
T ss_pred             CCCCCccEEEEe
Confidence            888888777543


No 386
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=84.98  E-value=32  Score=31.77  Aligned_cols=61  Identities=13%  Similarity=0.106  Sum_probs=44.7

Q ss_pred             CeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ..++..-+.||.+.+++-+.--....+.. ---..-+.|.+....+++++.+..+.-+..+.
T Consensus       145 ~~~IcVQS~DG~L~~feqe~~~f~~~lp~-~llPgPl~Y~~~tDsfvt~sss~~l~~Yky~~  205 (418)
T PF14727_consen  145 RDFICVQSMDGSLSFFEQESFAFSRFLPD-FLLPGPLCYCPRTDSFVTASSSWTLECYKYQD  205 (418)
T ss_pred             ceEEEEEecCceEEEEeCCcEEEEEEcCC-CCCCcCeEEeecCCEEEEecCceeEEEecHHH
Confidence            45778889999999999776443334433 22234577888889999999999999888643


No 387
>PHA02713 hypothetical protein; Provisional
Probab=84.55  E-value=17  Score=34.93  Aligned_cols=23  Identities=9%  Similarity=0.088  Sum_probs=18.1

Q ss_pred             CCeEEEEEeCCC--eEEEEECCCCe
Q 022074          221 GQKYIYTGSHDS--CVYVYDLVSGE  243 (303)
Q Consensus       221 ~~~~latg~~dg--~i~iwd~~~~~  243 (303)
                      ++++.++||.|+  .+..||+.+.+
T Consensus       512 ~~~iyv~Gg~~~~~~~e~yd~~~~~  536 (557)
T PHA02713        512 DNTIMMLHCYESYMLQDTFNVYTYE  536 (557)
T ss_pred             CCEEEEEeeecceeehhhcCccccc
Confidence            578888999888  77888887654


No 388
>PF14655 RAB3GAP2_N:  Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=84.48  E-value=12  Score=34.49  Aligned_cols=39  Identities=15%  Similarity=0.026  Sum_probs=32.3

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA   79 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~   79 (303)
                      .+.+|..+|+++++++...=|.|.|+|+.++.....+++
T Consensus       309 ~~~~i~~sP~~~laA~tDslGRV~LiD~~~~~vvrmWKG  347 (415)
T PF14655_consen  309 EGESICLSPSGRLAAVTDSLGRVLLIDVARGIVVRMWKG  347 (415)
T ss_pred             eEEEEEECCCCCEEEEEcCCCcEEEEECCCChhhhhhcc
Confidence            588899999999999888888999999999876544444


No 389
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=84.23  E-value=5.9  Score=36.30  Aligned_cols=67  Identities=18%  Similarity=0.260  Sum_probs=43.4

Q ss_pred             eeCCCeEEEEE-eCC----CeEEEEECCCCeEEEE-eecCCCCeEEEEECCCCCeEEEEeCCC-----------CEEEee
Q 022074          218 YSTGQKYIYTG-SHD----SCVYVYDLVSGEQVAA-LKYHTSPVRDCSWHPSQPMLVSSSWDG-----------DVVRWE  280 (303)
Q Consensus       218 ~s~~~~~latg-~~d----g~i~iwd~~~~~~~~~-~~~h~~~I~~v~~sp~~~~las~s~Dg-----------~i~~Wd  280 (303)
                      +||+++++|-+ +..    -.+++.|+.+|+.+.. +...  ....+.|.+|+..|+-...|.           .+++|+
T Consensus       131 ~Spdg~~la~~~s~~G~e~~~l~v~Dl~tg~~l~d~i~~~--~~~~~~W~~d~~~~~y~~~~~~~~~~~~~~~~~v~~~~  208 (414)
T PF02897_consen  131 VSPDGKRLAYSLSDGGSEWYTLRVFDLETGKFLPDGIENP--KFSSVSWSDDGKGFFYTRFDEDQRTSDSGYPRQVYRHK  208 (414)
T ss_dssp             ETTTSSEEEEEEEETTSSEEEEEEEETTTTEEEEEEEEEE--ESEEEEECTTSSEEEEEECSTTTSS-CCGCCEEEEEEE
T ss_pred             ECCCCCEEEEEecCCCCceEEEEEEECCCCcCcCCccccc--ccceEEEeCCCCEEEEEEeCcccccccCCCCcEEEEEE
Confidence            67888887754 333    4499999999987643 2221  123499999988766555443           267777


Q ss_pred             cCCCCc
Q 022074          281 FPGNGE  286 (303)
Q Consensus       281 ~~~~~~  286 (303)
                      +....+
T Consensus       209 ~gt~~~  214 (414)
T PF02897_consen  209 LGTPQS  214 (414)
T ss_dssp             TTS-GG
T ss_pred             CCCChH
Confidence            765533


No 390
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=83.64  E-value=6.3  Score=22.61  Aligned_cols=30  Identities=27%  Similarity=0.368  Sum_probs=22.2

Q ss_pred             CCCCEEEEee-CCCeEEEEECCCCceEEEEe
Q 022074           49 TDGRELVAGS-SDDCIYVYDLEANKLSLRIL   78 (303)
Q Consensus        49 ~~g~~l~sgs-~Dg~v~lwd~~~~~~~~~~~   78 (303)
                      |++++|+++. .+++|.++|..+++....+.
T Consensus         1 pd~~~lyv~~~~~~~v~~id~~~~~~~~~i~   31 (42)
T TIGR02276         1 PDGTKLYVTNSGSNTVSVIDTATNKVIATIP   31 (42)
T ss_pred             CCCCEEEEEeCCCCEEEEEECCCCeEEEEEE
Confidence            5778777765 47889999998887665544


No 391
>PRK13616 lipoprotein LpqB; Provisional
Probab=83.44  E-value=14  Score=35.74  Aligned_cols=110  Identities=10%  Similarity=0.045  Sum_probs=56.8

Q ss_pred             eEEEEEcCCCCEEEEeeCC------------CeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEE
Q 022074           42 IFSLKFSTDGRELVAGSSD------------DCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~D------------g~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      ...=+|+|||+.+.+.+..            +.+.+.+++.+....   ...+.|..+.|+++ +..++-.. ++.|.+=
T Consensus       399 ~t~PsWspDG~~lw~v~dg~~~~~v~~~~~~gql~~~~vd~ge~~~---~~~g~Issl~wSpD-G~RiA~i~-~g~v~Va  473 (591)
T PRK13616        399 LTRPSWSLDADAVWVVVDGNTVVRVIRDPATGQLARTPVDASAVAS---RVPGPISELQLSRD-GVRAAMII-GGKVYLA  473 (591)
T ss_pred             CCCceECCCCCceEEEecCcceEEEeccCCCceEEEEeccCchhhh---ccCCCcCeEEECCC-CCEEEEEE-CCEEEEE
Confidence            6677899998877776532            223333443332211   33467999999865 66555443 4666552


Q ss_pred             ---cCccccCCC-ccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEc
Q 022074          110 ---DRRCLNVKG-KPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus       110 ---d~~~~~~~~-~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl  157 (303)
                         ......... .+.....+-.+.+..++|..++.+ +.+..+....+|.+
T Consensus       474 ~Vvr~~~G~~~l~~~~~l~~~l~~~~~~l~W~~~~~L-~V~~~~~~~~v~~v  524 (591)
T PRK13616        474 VVEQTEDGQYALTNPREVGPGLGDTAVSLDWRTGDSL-VVGRSDPEHPVWYV  524 (591)
T ss_pred             EEEeCCCCceeecccEEeecccCCccccceEecCCEE-EEEecCCCCceEEE
Confidence               111110000 011111122233567888888874 45555555556654


No 392
>KOG2280 consensus Vacuolar assembly/sorting protein VPS16 [Intracellular trafficking, secretion, and vesicular transport]
Probab=83.37  E-value=49  Score=32.64  Aligned_cols=32  Identities=6%  Similarity=0.198  Sum_probs=27.4

Q ss_pred             eeCCCeEEEEEeCCCeEEEEECCCCeEEEEee
Q 022074          218 YSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK  249 (303)
Q Consensus       218 ~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~  249 (303)
                      .||+.++|+--.++|.|.+-+.+..+++.+++
T Consensus       224 VS~n~~~laLyt~~G~i~~vs~D~~~~lce~~  255 (829)
T KOG2280|consen  224 VSPNRRFLALYTETGKIWVVSIDLSQILCEFN  255 (829)
T ss_pred             EcCCcceEEEEecCCcEEEEecchhhhhhccC
Confidence            57788999999999999999988877777765


No 393
>KOG2377 consensus Uncharacterized conserved protein [Function unknown]
Probab=83.20  E-value=8.7  Score=35.45  Aligned_cols=113  Identities=11%  Similarity=0.151  Sum_probs=70.2

Q ss_pred             ccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE-Eec---ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           39 SFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR-ILA---HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~-~~~---h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      +++|.+|.||+|.+.+|+--.|.+|.+++...++.... ...   .+..+-..+|.+.  .-++-.+.. -+-+|-..  
T Consensus        66 ~G~I~SIkFSlDnkilAVQR~~~~v~f~nf~~d~~~l~~~~~ck~k~~~IlGF~W~~s--~e~A~i~~~-G~e~y~v~--  140 (657)
T KOG2377|consen   66 KGEIKSIKFSLDNKILAVQRTSKTVDFCNFIPDNSQLEYTQECKTKNANILGFCWTSS--TEIAFITDQ-GIEFYQVL--  140 (657)
T ss_pred             CCceeEEEeccCcceEEEEecCceEEEEecCCCchhhHHHHHhccCcceeEEEEEecC--eeEEEEecC-CeEEEEEc--
Confidence            34899999999999999999999999998744432211 111   2234777788643  334434433 34555422  


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEE-e-CCCcEEEEEc
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISN-G-KDQAIKLWDI  157 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~-~-~D~~v~lWdl  157 (303)
                       ...+..+....|.-+|.+..+.++.+.++-+ + ..+++.=+.+
T Consensus       141 -pekrslRlVks~~~nvnWy~yc~et~v~LL~t~~~~n~lnpf~~  184 (657)
T KOG2377|consen  141 -PEKRSLRLVKSHNLNVNWYMYCPETAVILLSTTVLENVLNPFHF  184 (657)
T ss_pred             -hhhhhhhhhhhcccCccEEEEccccceEeeeccccccccccEEE
Confidence             2334455666788899998888887765433 3 3444443443


No 394
>PF07569 Hira:  TUP1-like enhancer of split;  InterPro: IPR011494 The Hira proteins are found in a range of eukaryotes and are implicated in the assembly of repressive chromatin. These proteins also contain IPR001680 from INTERPRO.; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=83.12  E-value=10  Score=31.61  Aligned_cols=65  Identities=20%  Similarity=0.287  Sum_probs=42.3

Q ss_pred             EEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE------e--------cccCCeEEEEEccCCCcEEEEecCCCeEEEEc
Q 022074           45 LKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI------L--------AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        45 l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~------~--------~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      +.+..+++++++-+.+|.+++||+.+++....-      .        .....|..+..+ ++|.-+++-+ +|..+.|+
T Consensus        16 ~~l~~~~~~Ll~iT~~G~l~vWnl~~~k~~~~~~Si~pll~~~~~~~~~~~~~i~~~~lt-~~G~PiV~ls-ng~~y~y~   93 (219)
T PF07569_consen   16 SFLECNGSYLLAITSSGLLYVWNLKKGKAVLPPVSIAPLLNSSPVSDKSSSPNITSCSLT-SNGVPIVTLS-NGDSYSYS   93 (219)
T ss_pred             EEEEeCCCEEEEEeCCCeEEEEECCCCeeccCCccHHHHhcccccccCCCCCcEEEEEEc-CCCCEEEEEe-CCCEEEec
Confidence            335567899999999999999999998754211      1        233456666664 4455454443 46667776


Q ss_pred             C
Q 022074          111 R  111 (303)
Q Consensus       111 ~  111 (303)
                      .
T Consensus        94 ~   94 (219)
T PF07569_consen   94 P   94 (219)
T ss_pred             c
Confidence            4


No 395
>PF08728 CRT10:  CRT10;  InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance []. 
Probab=82.86  E-value=29  Score=34.26  Aligned_cols=68  Identities=13%  Similarity=0.234  Sum_probs=46.7

Q ss_pred             eeeeee--CCCeEEEEEeCCCeEEEEECCCCe-EEEEe--ecCCCCeEEEEECCCC---C---eEEEEeCCCCEEEeec
Q 022074          214 FSPVYS--TGQKYIYTGSHDSCVYVYDLVSGE-QVAAL--KYHTSPVRDCSWHPSQ---P---MLVSSSWDGDVVRWEF  281 (303)
Q Consensus       214 ~~~~~s--~~~~~latg~~dg~i~iwd~~~~~-~~~~~--~~h~~~I~~v~~sp~~---~---~las~s~Dg~i~~Wd~  281 (303)
                      |..+++  ...+++|.++....|.||-....+ .....  ..|...|-+|+|-++.   .   .|++++-.|++.+|++
T Consensus       167 WGLdIh~~~~~rlIAVSsNs~~VTVFaf~l~~~r~~~~~s~~~~hNIP~VSFl~~~~d~~G~v~v~a~dI~G~v~~~~I  245 (717)
T PF08728_consen  167 WGLDIHDYKKSRLIAVSSNSQEVTVFAFALVDERFYHVPSHQHSHNIPNVSFLDDDLDPNGHVKVVATDISGEVWTFKI  245 (717)
T ss_pred             eEEEEEecCcceEEEEecCCceEEEEEEeccccccccccccccccCCCeeEeecCCCCCccceEEEEEeccCcEEEEEE
Confidence            444444  556788888888889988654321 11111  1244568899997754   2   7999999999999998


No 396
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=82.63  E-value=10  Score=31.83  Aligned_cols=54  Identities=24%  Similarity=0.443  Sum_probs=42.0

Q ss_pred             EEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEE
Q 022074           45 LKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIY   98 (303)
Q Consensus        45 l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~   98 (303)
                      ++...+|+..++.-+.++|..+|..+|+...++.-.+..++++||.-++-+.|.
T Consensus       217 m~ID~eG~L~Va~~ng~~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~y  270 (310)
T KOG4499|consen  217 MTIDTEGNLYVATFNGGTVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILY  270 (310)
T ss_pred             ceEccCCcEEEEEecCcEEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEE
Confidence            334557877777778889999999999998888877889999999655444444


No 397
>KOG1900 consensus Nuclear pore complex, Nup155 component (D Nup154, sc Nup157/Nup170) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=82.49  E-value=31  Score=36.09  Aligned_cols=42  Identities=26%  Similarity=0.412  Sum_probs=34.3

Q ss_pred             ecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCCCccCCC
Q 022074          249 KYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGNGEAAPP  290 (303)
Q Consensus       249 ~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~~~~~~~  290 (303)
                      ..+.+||..++.....+.|-+-++.|++..|++.++.+...+
T Consensus       239 ~~~~dpI~qi~ID~SR~IlY~lsek~~v~~Y~i~~~G~~~~r  280 (1311)
T KOG1900|consen  239 GSSKDPIRQITIDNSRNILYVLSEKGTVSAYDIGGNGLGGPR  280 (1311)
T ss_pred             CCCCCcceeeEeccccceeeeeccCceEEEEEccCCCcccee
Confidence            356789999999888889999999999999999776444443


No 398
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=82.41  E-value=0.56  Score=46.24  Aligned_cols=63  Identities=16%  Similarity=0.241  Sum_probs=46.6

Q ss_pred             CCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEE-----------ECCCCCeEEEEeCCCCEEEeecCC
Q 022074          220 TGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCS-----------WHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       220 ~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~-----------~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      ++..++..+-.++.|++..+..... ..+..|..++.+++           .||||..+|+++.||.++.|.+..
T Consensus       193 ~~~~~ic~~~~~~~i~lL~~~ra~~-~l~rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~v~f~Qiyi  266 (1283)
T KOG1916|consen  193 VNKVYICYGLKGGEIRLLNINRALR-SLFRSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISDGSVGFYQIYI  266 (1283)
T ss_pred             cccceeeeccCCCceeEeeechHHH-HHHHhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecCCccceeeeee
Confidence            3446677777888888887754332 33455877666665           599999999999999999998653


No 399
>PF07676 PD40:  WD40-like Beta Propeller Repeat;  InterPro: IPR011659 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events. This region appears to be related to the IPR001680 from INTERPRO repeat. This model is likely to miss copies within a sequence.; PDB: 2HQS_D 1C5K_A 2IVZ_A 2W8B_D 3IAX_A 1CRZ_A 1N6F_D 1N6D_C 1N6E_C 1K32_A ....
Probab=81.42  E-value=7.7  Score=22.04  Aligned_cols=29  Identities=14%  Similarity=0.316  Sum_probs=18.7

Q ss_pred             CCCCeEEEEECCCCCeEE-EEeCC--CCEEEe
Q 022074          251 HTSPVRDCSWHPSQPMLV-SSSWD--GDVVRW  279 (303)
Q Consensus       251 h~~~I~~v~~sp~~~~la-s~s~D--g~i~~W  279 (303)
                      ....-....|||||+.|+ ++..+  |.-.+|
T Consensus         7 ~~~~~~~p~~SpDGk~i~f~s~~~~~g~~diy   38 (39)
T PF07676_consen    7 SPGDDGSPAWSPDGKYIYFTSNRNDRGSFDIY   38 (39)
T ss_dssp             SSSSEEEEEE-TTSSEEEEEEECT--SSEEEE
T ss_pred             CCccccCEEEecCCCEEEEEecCCCCCCcCEE
Confidence            445677889999998655 44455  565555


No 400
>PF14761 HPS3_N:  Hermansky-Pudlak syndrome 3
Probab=81.09  E-value=30  Score=28.65  Aligned_cols=48  Identities=23%  Similarity=0.323  Sum_probs=31.4

Q ss_pred             CEEEEeeCCCeEEEEECCCC--ceEEEEecccCCeEEEEEccCCCcEEEEec
Q 022074           52 RELVAGSSDDCIYVYDLEAN--KLSLRILAHTSDVNTVCFGDESGHLIYSGS  101 (303)
Q Consensus        52 ~~l~sgs~Dg~v~lwd~~~~--~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s  101 (303)
                      +.|..+.....|.+|++.+.  +...++ ..-+.|..+.++ ..|++++|--
T Consensus        29 d~Lfva~~g~~Vev~~l~~~~~~~~~~F-~Tv~~V~~l~y~-~~GDYlvTlE   78 (215)
T PF14761_consen   29 DALFVAASGCKVEVYDLEQEECPLLCTF-STVGRVLQLVYS-EAGDYLVTLE   78 (215)
T ss_pred             ceEEEEcCCCEEEEEEcccCCCceeEEE-cchhheeEEEec-cccceEEEEE
Confidence            44544455667999999832  233333 334788899996 4588988874


No 401
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=80.03  E-value=18  Score=35.87  Aligned_cols=75  Identities=16%  Similarity=0.239  Sum_probs=53.5

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCC----------c--eEEE-E--------ecccCCeEEEEEccC--C
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEAN----------K--LSLR-I--------LAHTSDVNTVCFGDE--S   93 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~----------~--~~~~-~--------~~h~~~v~~l~~~~~--~   93 (303)
                      .-.+.|..|..|++|+.++..|..| |.|..+...          +  ..++ +        ..+...|..+.|+|.  +
T Consensus        82 ~~~f~v~~i~~n~~g~~lal~G~~~-v~V~~LP~r~g~~~~~~~g~~~i~Crt~~v~~~~~~~~~~~~i~qv~WhP~s~~  160 (717)
T PF10168_consen   82 PPLFEVHQISLNPTGSLLALVGPRG-VVVLELPRRWGKNGEFEDGKKEINCRTVPVDERFFTSNSSLEIKQVRWHPWSES  160 (717)
T ss_pred             CCceeEEEEEECCCCCEEEEEcCCc-EEEEEeccccCccccccCCCcceeEEEEEechhhccCCCCceEEEEEEcCCCCC
Confidence            4557899999999999999998887 666666431          1  1111 1        123346888999875  3


Q ss_pred             CcEEEEecCCCeEEEEcCc
Q 022074           94 GHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        94 ~~~l~s~s~dg~v~lWd~~  112 (303)
                      +..|+.=..|+++|+||+.
T Consensus       161 ~~~l~vLtsdn~lR~y~~~  179 (717)
T PF10168_consen  161 DSHLVVLTSDNTLRLYDIS  179 (717)
T ss_pred             CCeEEEEecCCEEEEEecC
Confidence            5677777889999999985


No 402
>PHA03098 kelch-like protein; Provisional
Probab=79.58  E-value=58  Score=30.98  Aligned_cols=60  Identities=12%  Similarity=0.256  Sum_probs=31.8

Q ss_pred             CCCCEEEEeeCCC------eEEEEECCCCceEEEEecc---cCCeEEEEEccCCCcEEEEecCC-----CeEEEEcCc
Q 022074           49 TDGRELVAGSSDD------CIYVYDLEANKLSLRILAH---TSDVNTVCFGDESGHLIYSGSDD-----NLCKVWDRR  112 (303)
Q Consensus        49 ~~g~~l~sgs~Dg------~v~lwd~~~~~~~~~~~~h---~~~v~~l~~~~~~~~~l~s~s~d-----g~v~lWd~~  112 (303)
                      .++..++.||.++      .+..||..+.+.. .+..-   -.....+..   ++++++.|+.+     ..+..||..
T Consensus       293 ~~~~lyv~GG~~~~~~~~~~v~~yd~~~~~W~-~~~~~~~~R~~~~~~~~---~~~lyv~GG~~~~~~~~~v~~yd~~  366 (534)
T PHA03098        293 LNNVIYFIGGMNKNNLSVNSVVSYDTKTKSWN-KVPELIYPRKNPGVTVF---NNRIYVIGGIYNSISLNTVESWKPG  366 (534)
T ss_pred             ECCEEEEECCCcCCCCeeccEEEEeCCCCeee-ECCCCCcccccceEEEE---CCEEEEEeCCCCCEecceEEEEcCC
Confidence            3456677776543      4778888877643 22211   111122222   35677777765     245567754


No 403
>PF08728 CRT10:  CRT10;  InterPro: IPR014839 CRT10 is a transcriptional regulator of ribonucleotide reductase (RNR) genes []. RNR catalyses the rate limiting step in dNTP synthesis. Mutations in CRT10 have been shown to enhance hydroxyurea resistance []. 
Probab=79.28  E-value=49  Score=32.73  Aligned_cols=121  Identities=12%  Similarity=0.029  Sum_probs=71.4

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCC-------C----ce-E-------EEEecccCCeEEEEEc-cCCCc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEA-------N----KL-S-------LRILAHTSDVNTVCFG-DESGH   95 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~-------~----~~-~-------~~~~~h~~~v~~l~~~-~~~~~   95 (303)
                      ..|.-....+..-.+.+.|+++..||.|.+|.+++       .    .. .       .-......-+..++++ .+..+
T Consensus        99 ~PHtIN~i~v~~lg~~EVLl~c~DdG~V~~Yyt~~I~~~i~~~~~~~~~~~~r~~i~P~f~~~v~~SaWGLdIh~~~~~r  178 (717)
T PF08728_consen   99 FPHTINFIKVGDLGGEEVLLLCTDDGDVLAYYTETIIEAIERFSEDNDSGFSRLKIKPFFHLRVGASAWGLDIHDYKKSR  178 (717)
T ss_pred             CCceeeEEEecccCCeeEEEEEecCCeEEEEEHHHHHHHHHhhccccccccccccCCCCeEeecCCceeEEEEEecCcce
Confidence            45665555555556778899999999999996521       0    00 0       0011122345666764 13467


Q ss_pred             EEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCC-----CC-EEEEEeCCCcEEEEEc
Q 022074           96 LIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGD-----GR-YLISNGKDQAIKLWDI  157 (303)
Q Consensus        96 ~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~-----~~-~l~s~~~D~~v~lWdl  157 (303)
                      ++|.++....|.||-..... .......-..|...|-+++|-++     |. .+++++=.|.+-+|++
T Consensus       179 lIAVSsNs~~VTVFaf~l~~-~r~~~~~s~~~~hNIP~VSFl~~~~d~~G~v~v~a~dI~G~v~~~~I  245 (717)
T PF08728_consen  179 LIAVSSNSQEVTVFAFALVD-ERFYHVPSHQHSHNIPNVSFLDDDLDPNGHVKVVATDISGEVWTFKI  245 (717)
T ss_pred             EEEEecCCceEEEEEEeccc-cccccccccccccCCCeeEeecCCCCCccceEEEEEeccCcEEEEEE
Confidence            88888888889888643211 11111111124555666665432     32 6778888999999887


No 404
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=78.96  E-value=33  Score=32.43  Aligned_cols=32  Identities=16%  Similarity=0.286  Sum_probs=26.2

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEEC
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDL   68 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~   68 (303)
                      ..-.+.|..+..++.|..++-.|.+|.+ |..+
T Consensus       100 ~~V~feV~~vl~s~~GS~VaL~G~~Gi~-vMeL  131 (741)
T KOG4460|consen  100 NPVLFEVYQVLLSPTGSHVALIGIKGLM-VMEL  131 (741)
T ss_pred             CcceEEEEEEEecCCCceEEEecCCeeE-EEEc
Confidence            3566889999999999999999999954 4445


No 405
>KOG1983 consensus Tomosyn and related SNARE-interacting proteins [Intracellular trafficking, secretion, and vesicular transport]
Probab=78.08  E-value=69  Score=33.27  Aligned_cols=33  Identities=21%  Similarity=0.325  Sum_probs=27.9

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEEC
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDL   68 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~   68 (303)
                      .|+...-..++|+|....++.|+.+|.|.++-.
T Consensus        32 ~G~~~~~~~~afD~~q~llai~t~tg~i~~yg~   64 (993)
T KOG1983|consen   32 HGFPSTPSALAFDPTQGLLAIGTRTGAIKIYGQ   64 (993)
T ss_pred             cCCCCCCcceeeccccceEEEEEecccEEEecc
Confidence            355557788999999999999999999999944


No 406
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=77.40  E-value=57  Score=29.70  Aligned_cols=92  Identities=13%  Similarity=0.127  Sum_probs=39.7

Q ss_pred             ECCCCceEEEEecccCCeEEEEE-----ccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccc-cCeEEEEeCCCC
Q 022074           67 DLEANKLSLRILAHTSDVNTVCF-----GDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHL-EGITFIDSRGDG  140 (303)
Q Consensus        67 d~~~~~~~~~~~~h~~~v~~l~~-----~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~-~~v~~~~~~~~~  140 (303)
                      |..||..+.++.........+.|     ..+..++|+++..||.-.+|-+...  ++ .+..+.... +.......++++
T Consensus        16 D~~TG~~VtrLT~~~~~~h~~YF~~~~ft~dG~kllF~s~~dg~~nly~lDL~--t~-~i~QLTdg~g~~~~g~~~s~~~   92 (386)
T PF14583_consen   16 DPDTGHRVTRLTPPDGHSHRLYFYQNCFTDDGRKLLFASDFDGNRNLYLLDLA--TG-EITQLTDGPGDNTFGGFLSPDD   92 (386)
T ss_dssp             -TTT--EEEE-S-TTS-EE---TTS--B-TTS-EEEEEE-TTSS-EEEEEETT--T--EEEE---SS-B-TTT-EE-TTS
T ss_pred             CCCCCceEEEecCCCCcccceeecCCCcCCCCCEEEEEeccCCCcceEEEEcc--cC-EEEECccCCCCCccceEEecCC
Confidence            77788777777666555555544     3344467777777776666643211  11 222232221 222123345667


Q ss_pred             CEEEEEeCCCcEEEEEccccc
Q 022074          141 RYLISNGKDQAIKLWDIRKMS  161 (303)
Q Consensus       141 ~~l~s~~~D~~v~lWdl~~~~  161 (303)
                      +.++-.-.++.|+--|++.++
T Consensus        93 ~~~~Yv~~~~~l~~vdL~T~e  113 (386)
T PF14583_consen   93 RALYYVKNGRSLRRVDLDTLE  113 (386)
T ss_dssp             SEEEEEETTTEEEEEETTT--
T ss_pred             CeEEEEECCCeEEEEECCcCc
Confidence            776555555778888887654


No 407
>PF12657 TFIIIC_delta:  Transcription factor IIIC subunit delta N-term;  InterPro: IPR024761  This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 [].
Probab=77.14  E-value=27  Score=27.76  Aligned_cols=29  Identities=10%  Similarity=0.053  Sum_probs=23.6

Q ss_pred             CeEEEEeCCCC------CEEEEEeCCCcEEEEEcc
Q 022074          130 GITFIDSRGDG------RYLISNGKDQAIKLWDIR  158 (303)
Q Consensus       130 ~v~~~~~~~~~------~~l~s~~~D~~v~lWdl~  158 (303)
                      .+..++|+|.|      .+|+....++.|.||.-.
T Consensus        87 ~vv~~aWSP~Gl~~~~rClLavLTs~~~l~l~~~~  121 (173)
T PF12657_consen   87 QVVSAAWSPSGLGPNGRCLLAVLTSNGRLSLYGPP  121 (173)
T ss_pred             cEEEEEECCCCCCCCCceEEEEEcCCCeEEEEecC
Confidence            68888898854      678899999999999743


No 408
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=75.69  E-value=9.4  Score=39.12  Aligned_cols=115  Identities=13%  Similarity=0.033  Sum_probs=74.2

Q ss_pred             cceEEEEEcCCCCEEEEe--eCCCeEEEEECCCCceEE-----EEecc------cCCeEEEEEccCCCcEEEEecCCCeE
Q 022074           40 FGIFSLKFSTDGRELVAG--SSDDCIYVYDLEANKLSL-----RILAH------TSDVNTVCFGDESGHLIYSGSDDNLC  106 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sg--s~Dg~v~lwd~~~~~~~~-----~~~~h------~~~v~~l~~~~~~~~~l~s~s~dg~v  106 (303)
                      .++.-+..++|+...++.  +.+..|..||+++-....     .+..|      -..+.++.|+|......+.+..|+.|
T Consensus       101 ~pi~~~v~~~D~t~s~v~~tsng~~v~~fD~~~fs~s~~~~~~pl~~s~ts~ek~vf~~~~~wnP~vp~n~av~l~dlsl  180 (1405)
T KOG3630|consen  101 IPIVIFVCFHDATDSVVVSTSNGEAVYSFDLEEFSESRYETTVPLKNSATSFEKPVFQLKNVWNPLVPLNSAVDLSDLSL  180 (1405)
T ss_pred             ccceEEEeccCCceEEEEEecCCceEEEEehHhhhhhhhhhccccccccchhccccccccccccCCccchhhhhccccch
Confidence            456677777787765543  344478999997643211     11112      23456778887666667778889999


Q ss_pred             EEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074          107 KVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus       107 ~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      ++.-+....   .....+ --...++++.|++.|.+++.|-..|++.=|...
T Consensus       181 ~V~~~~~~~---~~v~s~-p~t~~~Tav~WSprGKQl~iG~nnGt~vQy~P~  228 (1405)
T KOG3630|consen  181 RVKSTKQLA---QNVTSF-PVTNSQTAVLWSPRGKQLFIGRNNGTEVQYEPS  228 (1405)
T ss_pred             hhhhhhhhh---hhhccc-CcccceeeEEeccccceeeEecCCCeEEEeecc
Confidence            887654211   111111 123568899999999999999999999888643


No 409
>PF14655 RAB3GAP2_N:  Rab3 GTPase-activating protein regulatory subunit N-terminus
Probab=75.54  E-value=52  Score=30.37  Aligned_cols=31  Identities=16%  Similarity=0.026  Sum_probs=26.1

Q ss_pred             CeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074          130 GITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       130 ~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      .+..+..+|.+++.++...-|.|.|+|+...
T Consensus       309 ~~~~i~~sP~~~laA~tDslGRV~LiD~~~~  339 (415)
T PF14655_consen  309 EGESICLSPSGRLAAVTDSLGRVLLIDVARG  339 (415)
T ss_pred             eEEEEEECCCCCEEEEEcCCCcEEEEECCCC
Confidence            4667888999999888888899999998754


No 410
>PF05694 SBP56:  56kDa selenium binding protein (SBP56);  InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=71.60  E-value=41  Score=31.12  Aligned_cols=118  Identities=14%  Similarity=0.069  Sum_probs=59.2

Q ss_pred             EEEEEcCCCCEEEEeeC--------------------CCeEEEEECCCCceEEEEeccc--CCeEEEEEcc--CCCcEEE
Q 022074           43 FSLKFSTDGRELVAGSS--------------------DDCIYVYDLEANKLSLRILAHT--SDVNTVCFGD--ESGHLIY   98 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~--------------------Dg~v~lwd~~~~~~~~~~~~h~--~~v~~l~~~~--~~~~~l~   98 (303)
                      +..-|.|.-+.++|..+                    -.++.+||+.+.+....+.--.  ...-.+.|.+  ....-|+
T Consensus       184 YDfw~qpr~nvMiSSeWg~P~~~~~Gf~~~d~~~~~yG~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFv  263 (461)
T PF05694_consen  184 YDFWYQPRHNVMISSEWGAPSMFEKGFNPEDLEAGKYGHSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFV  263 (461)
T ss_dssp             --EEEETTTTEEEE-B---HHHHTT---TTTHHHH-S--EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEE
T ss_pred             CCeEEcCCCCEEEEeccCChhhcccCCChhHhhcccccCeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEE
Confidence            44556676667776653                    3479999999998876655332  2345667733  3445566


Q ss_pred             EecCCCeEEEEcC-ccccCC-Cccceee----ccc------------ccCeEEEEeCCCCCEEEEEe-CCCcEEEEEccc
Q 022074           99 SGSDDNLCKVWDR-RCLNVK-GKPAGVL----MGH------------LEGITFIDSRGDGRYLISNG-KDQAIKLWDIRK  159 (303)
Q Consensus        99 s~s~dg~v~lWd~-~~~~~~-~~~~~~~----~~h------------~~~v~~~~~~~~~~~l~s~~-~D~~v~lWdl~~  159 (303)
                      .+...++|.+|=. +...-. .+.+...    .+.            ..-++.+.++.|+++|..+. .+|.+|-||+..
T Consensus       264 g~aLss~i~~~~k~~~g~W~a~kVi~ip~~~v~~~~lp~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISD  343 (461)
T PF05694_consen  264 GCALSSSIWRFYKDDDGEWAAEKVIDIPAKKVEGWILPEMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYDISD  343 (461)
T ss_dssp             EEE--EEEEEEEE-ETTEEEEEEEEEE--EE--SS---GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEEEE-SS
T ss_pred             EEeccceEEEEEEcCCCCeeeeEEEECCCcccCcccccccccccccCCCceEeEEEccCCCEEEEEcccCCcEEEEecCC
Confidence            6677777776632 100000 0011110    000            13367888999999987666 589999999875


Q ss_pred             c
Q 022074          160 M  160 (303)
Q Consensus       160 ~  160 (303)
                      .
T Consensus       344 P  344 (461)
T PF05694_consen  344 P  344 (461)
T ss_dssp             T
T ss_pred             C
Confidence            3


No 411
>PF10647 Gmad1:  Lipoprotein LpqB beta-propeller domain;  InterPro: IPR018910  The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues. 
Probab=70.68  E-value=66  Score=27.36  Aligned_cols=114  Identities=18%  Similarity=0.130  Sum_probs=62.3

Q ss_pred             ceEEEEEcCCCCEEEEee-CCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEc-CccccCCC
Q 022074           41 GIFSLKFSTDGRELVAGS-SDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWD-RRCLNVKG  118 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd-~~~~~~~~  118 (303)
                      .+.+.+++++|+.++.-. .++.-.||-...+....... ....+..-.|.+. +.+.+....+...+++. ....  ..
T Consensus        25 ~~~s~AvS~dg~~~A~v~~~~~~~~L~~~~~~~~~~~~~-~g~~l~~PS~d~~-g~~W~v~~~~~~~~~~~~~~~g--~~  100 (253)
T PF10647_consen   25 DVTSPAVSPDGSRVAAVSEGDGGRSLYVGPAGGPVRPVL-TGGSLTRPSWDPD-GWVWTVDDGSGGVRVVRDSASG--TG  100 (253)
T ss_pred             cccceEECCCCCeEEEEEEcCCCCEEEEEcCCCcceeec-cCCccccccccCC-CCEEEEEcCCCceEEEEecCCC--cc
Confidence            688999999999776655 22223344333333332222 2235666678654 66666666666666663 1111  11


Q ss_pred             ccceeeccccc-CeEEEEeCCCCCEEEEEe---CCCcEEEEEcc
Q 022074          119 KPAGVLMGHLE-GITFIDSRGDGRYLISNG---KDQAIKLWDIR  158 (303)
Q Consensus       119 ~~~~~~~~h~~-~v~~~~~~~~~~~l~s~~---~D~~v~lWdl~  158 (303)
                      .+...-..... .|..+.+++||..++-..   .++.|.+=-+.
T Consensus       101 ~~~~v~~~~~~~~I~~l~vSpDG~RvA~v~~~~~~~~v~va~V~  144 (253)
T PF10647_consen  101 EPVEVDWPGLRGRITALRVSPDGTRVAVVVEDGGGGRVYVAGVV  144 (253)
T ss_pred             eeEEecccccCCceEEEEECCCCcEEEEEEecCCCCeEEEEEEE
Confidence            11111111112 799999999998876544   34555555443


No 412
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=69.58  E-value=80  Score=27.86  Aligned_cols=50  Identities=8%  Similarity=0.116  Sum_probs=34.9

Q ss_pred             CCeEEEEEeCCC-eEEEEECCCCeEEEEeecCCCCeEEEEECC-CCC-eEEEEe
Q 022074          221 GQKYIYTGSHDS-CVYVYDLVSGEQVAALKYHTSPVRDCSWHP-SQP-MLVSSS  271 (303)
Q Consensus       221 ~~~~latg~~dg-~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp-~~~-~las~s  271 (303)
                      +|.+.+++..+| .|.+|+.+ |+++..++.+...+++++|=- +.+ +++|+.
T Consensus       223 dG~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP~~~~t~~~FgG~~~~~L~iTs~  275 (307)
T COG3386         223 DGNLWVAAVWGGGRVVRFNPD-GKLLGEIKLPVKRPTNPAFGGPDLNTLYITSA  275 (307)
T ss_pred             CCCEEEecccCCceEEEECCC-CcEEEEEECCCCCCccceEeCCCcCEEEEEec
Confidence            456554544444 89999998 999988888877888999854 444 344443


No 413
>COG5167 VID27 Protein involved in vacuole import and degradation [Intracellular trafficking and secretion]
Probab=69.57  E-value=85  Score=29.77  Aligned_cols=118  Identities=19%  Similarity=0.211  Sum_probs=65.8

Q ss_pred             cCCCcccceEEEEEcC-CCCEE-EEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCC------CcEEEEecCCCe
Q 022074           34 DDGGYSFGIFSLKFST-DGREL-VAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDES------GHLIYSGSDDNL  105 (303)
Q Consensus        34 ~~~~~~~~v~~l~~s~-~g~~l-~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~------~~~l~s~s~dg~  105 (303)
                      +++|-+.....+-.+. +.+.| ..|+.-..++=.|+..|+.+..+..|...  -+.|+|..      +..-+.|-.+..
T Consensus       461 ~~~GKSidp~K~mlh~~dssli~~dg~~~~kLykmDIErGkvveeW~~~ddv--vVqy~p~~kf~qmt~eqtlvGlS~~s  538 (776)
T COG5167         461 DDGGKSIDPEKIMLHDNDSSLIYLDGGERDKLYKMDIERGKVVEEWDLKDDV--VVQYNPYFKFQQMTDEQTLVGLSDYS  538 (776)
T ss_pred             CCCCCcCChhhceeecCCcceEEecCCCcccceeeecccceeeeEeecCCcc--eeecCCchhHHhcCccceEEeecccc
Confidence            3455555555555554 34443 34555666777799999988888877665  45555421      223334544555


Q ss_pred             EEEEcCccccCCCccceeec--cc--ccCeEEEEeCCCCCEEEEEeCCCcEEEEEc
Q 022074          106 CKVWDRRCLNVKGKPAGVLM--GH--LEGITFIDSRGDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus       106 v~lWd~~~~~~~~~~~~~~~--~h--~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl  157 (303)
                      |.--|.|...   ..+....  ..  .....+... ....+++.+|.-|-||+||-
T Consensus       539 vFrIDPR~~g---NKi~v~esKdY~tKn~Fss~~t-TesGyIa~as~kGDirLyDR  590 (776)
T COG5167         539 VFRIDPRARG---NKIKVVESKDYKTKNKFSSGMT-TESGYIAAASRKGDIRLYDR  590 (776)
T ss_pred             eEEecccccC---Cceeeeeehhcccccccccccc-ccCceEEEecCCCceeeehh
Confidence            5545765322   1121111  11  112223222 23459999999999999983


No 414
>PF11715 Nup160:  Nucleoporin Nup120/160;  InterPro: IPR021717  Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=69.07  E-value=16  Score=34.91  Aligned_cols=64  Identities=13%  Similarity=0.157  Sum_probs=39.0

Q ss_pred             CCeEEEEEeCCCeEEEEECCC----CeEEEE--eecC--------------------CCCeEEEEECC----CCCeEEEE
Q 022074          221 GQKYIYTGSHDSCVYVYDLVS----GEQVAA--LKYH--------------------TSPVRDCSWHP----SQPMLVSS  270 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~----~~~~~~--~~~h--------------------~~~I~~v~~sp----~~~~las~  270 (303)
                      +...++.+..||.+...+...    +.....  +..+                    .....+++.++    +..+|++.
T Consensus       157 ~~~~l~v~~~dG~ll~l~~~~~~~~~~~~~~~~~~~~~~~~~l~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~tl  236 (547)
T PF11715_consen  157 SEANLVVSLQDGGLLRLKRSSGDSDGSVWSEELFNDSSWLRSLSGLFPWSYRGDNSSSSVAASLAVSSSEINDDTFLFTL  236 (547)
T ss_dssp             SSSBEEEEESSS-EEEEEES----SSS-EE----STHHHHHCCTTTS-TT---SSSS---EEEEEE-----ETTTEEEEE
T ss_pred             CCCEEEEEECCCCeEEEECCcccCCCCeeEEEEeCCCchhhhhhCcCCcccccCCCCCCccceEEEecceeCCCCEEEEE
Confidence            345677778888888877654    221111  1111                    23466777777    77899999


Q ss_pred             eCCCCEEEeecCCC
Q 022074          271 SWDGDVVRWEFPGN  284 (303)
Q Consensus       271 s~Dg~i~~Wd~~~~  284 (303)
                      +.|++||+||+...
T Consensus       237 ~~D~~LRiW~l~t~  250 (547)
T PF11715_consen  237 SRDHTLRIWSLETG  250 (547)
T ss_dssp             ETTSEEEEEETTTT
T ss_pred             eCCCeEEEEECCCC
Confidence            99999999998755


No 415
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=68.59  E-value=14  Score=19.79  Aligned_cols=23  Identities=35%  Similarity=0.603  Sum_probs=19.3

Q ss_pred             EEEEEeCCCeEEEEECCCCeEEE
Q 022074          224 YIYTGSHDSCVYVYDLVSGEQVA  246 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~~~~~  246 (303)
                      .++.++.+|.++.+|.++|+.+-
T Consensus         8 ~v~~~~~~g~l~a~d~~~G~~~W   30 (33)
T smart00564        8 TVYVGSTDGTLYALDAKTGEILW   30 (33)
T ss_pred             EEEEEcCCCEEEEEEcccCcEEE
Confidence            47778899999999999988753


No 416
>smart00036 CNH Domain found in NIK1-like kinases, mouse citron and yeast ROM1, ROM2. Unpublished observations.
Probab=68.50  E-value=82  Score=27.60  Aligned_cols=106  Identities=14%  Similarity=0.198  Sum_probs=57.7

Q ss_pred             CCCEEEEeeCCCeEEEEECCCC-ceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCC------C----
Q 022074           50 DGRELVAGSSDDCIYVYDLEAN-KLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVK------G----  118 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~~-~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~------~----  118 (303)
                      ++++++.|+.+| +.+.++... ....++ .+...|.++......+-+++-+++...++++++......      .    
T Consensus        12 ~~~~lL~GTe~G-ly~~~~~~~~~~~~kl-~~~~~v~q~~v~~~~~lLi~Lsgk~~~L~~~~L~~L~~~~~~~~~~~~~~   89 (302)
T smart00036       12 DGKWLLVGTEEG-LYVLNISDQPGTLEKL-IGRRSVTQIWVLEENNVLLMISGKKPQLYSHPLSALVEKKEALGSARLVI   89 (302)
T ss_pred             CCcEEEEEeCCc-eEEEEcccCCCCeEEe-cCcCceEEEEEEhhhCEEEEEeCCcceEEEEEHHHhhhhhhccCCccccc
Confidence            346899999999 777776542 222223 344578888876554444444555566999987533210      0    


Q ss_pred             -ccceeecccccCeEEEEeC-CCCCEEEEEeCCCcEEEEEc
Q 022074          119 -KPAGVLMGHLEGITFIDSR-GDGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus       119 -~~~~~~~~h~~~v~~~~~~-~~~~~l~s~~~D~~v~lWdl  157 (303)
                       +....-.+|..+....... .....+++++.-.+|.++..
T Consensus        90 ~~~~~~~~~~tkGc~~~~v~~~~~~~~l~~A~~~~i~l~~~  130 (302)
T smart00036       90 RKNVLTKIPDTKGCHLCAVVNGKRSLFLCVALQSSVVLLQW  130 (302)
T ss_pred             cccceEeCCcCCceEEEEEEcCCCcEEEEEEcCCeEEEEEc
Confidence             0011122344433322222 22334566666777777643


No 417
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=68.15  E-value=87  Score=27.84  Aligned_cols=57  Identities=19%  Similarity=0.223  Sum_probs=41.1

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEE---E-eecCCCCeEEEEECCCCCeEEEEeCCCCEE
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVA---A-LKYHTSPVRDCSWHPSQPMLVSSSWDGDVV  277 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~---~-~~~h~~~I~~v~~sp~~~~las~s~Dg~i~  277 (303)
                      .++++++.-..+.|....++.+..+.   . +.....++.++++.|||.++++.+.+|+|.
T Consensus       270 ~g~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~~~~r~~~v~~~pDG~Lyv~~d~~G~iy  330 (331)
T PF07995_consen  270 RGDLFVADYGGGRIWRLDLDEDGSVTEEEEFLGGFGGRPRDVAQGPDGALYVSDDSDGKIY  330 (331)
T ss_dssp             TTEEEEEETTTTEEEEEEEETTEEEEEEEEECTTSSS-EEEEEEETTSEEEEEE-TTTTEE
T ss_pred             cCcEEEecCCCCEEEEEeeecCCCccceEEccccCCCCceEEEEcCCCeEEEEECCCCeEe
Confidence            56777777777888888887553322   2 223445899999999999999998999885


No 418
>PF11715 Nup160:  Nucleoporin Nup120/160;  InterPro: IPR021717  Nup120 is conserved from fungi to plants to humans, and is homologous with the Nup160 of vertebrates. The nuclear core complex, or NPC, mediates macromolecular transport across the nuclear envelope. Deletion of the NUP120 gene causes clustering of NPCs at one side of the nuclear envelope, moderate nucleolar fragmentation and slower cell growth []. The vertebrate NPC is estimated to contain between 30 and 60 different proteins. most of which are not known. Two important ones in creating the nucleoporin basket are Nup98 and Nup153, and Nup120, in conjunction with Nup 133, interacts with these two and itself plays a role in mRNA export []. Nup160, Nup133, Nup96, and Nup107 are all targets of phosphorylation. The phosphorylation sites are clustered mainly at the N-terminal regions of these proteins, which are predicted to be natively disordered. The entire Nup107-160 subcomplex is stable throughout the cell cycle, thus it seems unlikely that phosphorylation affects interactions within the Nup107-160 subcomplex, but rather that it regulates the association of the subcomplex with the NPC and other proteins []. ; PDB: 3F7F_D 3H7N_D 3HXR_A.
Probab=67.62  E-value=15  Score=35.11  Aligned_cols=35  Identities=26%  Similarity=0.238  Sum_probs=26.4

Q ss_pred             ceEEEEEcC----CCCEEEEeeCCCeEEEEECCCCceEE
Q 022074           41 GIFSLKFST----DGRELVAGSSDDCIYVYDLEANKLSL   75 (303)
Q Consensus        41 ~v~~l~~s~----~g~~l~sgs~Dg~v~lwd~~~~~~~~   75 (303)
                      ...+++.+.    +..++++-+.|+++|+||+.+++...
T Consensus       216 ~~~~~~~~~~~~~~~~~l~tl~~D~~LRiW~l~t~~~~~  254 (547)
T PF11715_consen  216 VAASLAVSSSEINDDTFLFTLSRDHTLRIWSLETGQCLA  254 (547)
T ss_dssp             -EEEEEE-----ETTTEEEEEETTSEEEEEETTTTCEEE
T ss_pred             ccceEEEecceeCCCCEEEEEeCCCeEEEEECCCCeEEE
Confidence            345555555    67789999999999999999998743


No 419
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=67.42  E-value=89  Score=27.58  Aligned_cols=118  Identities=11%  Similarity=0.035  Sum_probs=62.5

Q ss_pred             ceEEEEEcCCCCEEEEeeC------C---CeEEEEECC-CCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEc
Q 022074           41 GIFSLKFSTDGRELVAGSS------D---DCIYVYDLE-ANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~------D---g~v~lwd~~-~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      ..+.+...|+|.+-++--.      +   ..-+||.+. .+.....+..+-..-+.++|+|+...+.++=+..+.+.-|+
T Consensus       112 r~ND~~v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~  191 (307)
T COG3386         112 RPNDGVVDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPDGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYD  191 (307)
T ss_pred             CCCceeEcCCCCEEEeCCCccccCccccCCcceEEEEcCCCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEe
Confidence            4556777888875554322      0   011345444 45555555555556689999876444444445567787776


Q ss_pred             Ccc--ccCCCccceeec-ccccCeEEEEeCCCCCEEEEEeCCC-cEEEEEcc
Q 022074          111 RRC--LNVKGKPAGVLM-GHLEGITFIDSRGDGRYLISNGKDQ-AIKLWDIR  158 (303)
Q Consensus       111 ~~~--~~~~~~~~~~~~-~h~~~v~~~~~~~~~~~l~s~~~D~-~v~lWdl~  158 (303)
                      +..  .....+...... .....--.++...+|++.+++..++ .|..|+..
T Consensus       192 ~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd  243 (307)
T COG3386         192 LDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPD  243 (307)
T ss_pred             cCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEEecccCCceEEEECCC
Confidence            542  111111111111 1122333455667787775444443 78888765


No 420
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=67.35  E-value=1.1e+02  Score=28.78  Aligned_cols=62  Identities=18%  Similarity=0.220  Sum_probs=38.8

Q ss_pred             CCEEEEeeCCCeEEEEECCCCceEEEEecccC------Ce-E-EEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           51 GRELVAGSSDDCIYVYDLEANKLSLRILAHTS------DV-N-TVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        51 g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~------~v-~-~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      +..++.++.++.++-+|.++|+..-+......      .+ . .+..  ..+..++.++.++.|+-+|.+..
T Consensus        61 ~g~vy~~~~~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~--~~~~~V~v~~~~g~v~AlD~~TG  130 (488)
T cd00216          61 DGDMYFTTSHSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAY--WDPRKVFFGTFDGRLVALDAETG  130 (488)
T ss_pred             CCEEEEeCCCCcEEEEECCCChhhceeCCCCCccccccccccCCcEE--ccCCeEEEecCCCeEEEEECCCC
Confidence            55677888899999999999976544322211      00 0 0111  11246677888999998887543


No 421
>KOG2377 consensus Uncharacterized conserved protein [Function unknown]
Probab=66.02  E-value=1.2e+02  Score=28.45  Aligned_cols=158  Identities=13%  Similarity=0.174  Sum_probs=84.0

Q ss_pred             eEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCC
Q 022074           84 VNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSN  163 (303)
Q Consensus        84 v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~  163 (303)
                      .+.+-| .+.++++ -+-..|.+.=|..+..+ ...++..-..-.+++.++.|++|...++.--.|.+|.+++.......
T Consensus        25 sngvFf-DDaNkql-favrSggatgvvvkgpn-dDVpiSfdm~d~G~I~SIkFSlDnkilAVQR~~~~v~f~nf~~d~~~  101 (657)
T KOG2377|consen   25 SNGVFF-DDANKQL-FAVRSGGATGVVVKGPN-DDVPISFDMDDKGEIKSIKFSLDNKILAVQRTSKTVDFCNFIPDNSQ  101 (657)
T ss_pred             ccceee-ccCcceE-EEEecCCeeEEEEeCCC-CCCCceeeecCCCceeEEEeccCcceEEEEecCceEEEEecCCCchh
Confidence            345555 3333343 34445567777765433 22333333334568999999999999999999999999976322111


Q ss_pred             cccccCccceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCe
Q 022074          164 ASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGE  243 (303)
Q Consensus       164 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~  243 (303)
                      ...                       .+.|..+-         ..+..+.|.     +...+|.-...| +.+|-....+
T Consensus       102 l~~-----------------------~~~ck~k~---------~~IlGF~W~-----~s~e~A~i~~~G-~e~y~v~pek  143 (657)
T KOG2377|consen  102 LEY-----------------------TQECKTKN---------ANILGFCWT-----SSTEIAFITDQG-IEFYQVLPEK  143 (657)
T ss_pred             hHH-----------------------HHHhccCc---------ceeEEEEEe-----cCeeEEEEecCC-eEEEEEchhh
Confidence            000                       00011000         011122221     123344444333 4455443322


Q ss_pred             -EEEEeecCCCCeEEEEECCCCCe--EEEEeCCCCEEEeecC
Q 022074          244 -QVAALKYHTSPVRDCSWHPSQPM--LVSSSWDGDVVRWEFP  282 (303)
Q Consensus       244 -~~~~~~~h~~~I~~v~~sp~~~~--las~s~Dg~i~~Wd~~  282 (303)
                       .+...+.|+..|+-..|.|+.+.  |+|+-..+++.-+.+.
T Consensus       144 rslRlVks~~~nvnWy~yc~et~v~LL~t~~~~n~lnpf~~~  185 (657)
T KOG2377|consen  144 RSLRLVKSHNLNVNWYMYCPETAVILLSTTVLENVLNPFHFR  185 (657)
T ss_pred             hhhhhhhhcccCccEEEEccccceEeeeccccccccccEEEe
Confidence             24455778888999999999884  3444355555555443


No 422
>KOG3630 consensus Nuclear pore complex, Nup214/CAN component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=65.04  E-value=5.8  Score=40.53  Aligned_cols=59  Identities=17%  Similarity=0.064  Sum_probs=43.6

Q ss_pred             EEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          225 IYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       225 latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .+....|+.|++..+........--.-....++++|||.|.+++.|-.+|++.=+.+..
T Consensus       171 ~av~l~dlsl~V~~~~~~~~~v~s~p~t~~~Tav~WSprGKQl~iG~nnGt~vQy~P~l  229 (1405)
T KOG3630|consen  171 SAVDLSDLSLRVKSTKQLAQNVTSFPVTNSQTAVLWSPRGKQLFIGRNNGTEVQYEPSL  229 (1405)
T ss_pred             hhhhccccchhhhhhhhhhhhhcccCcccceeeEEeccccceeeEecCCCeEEEeeccc
Confidence            56677888899887754433222112345789999999999999999999999887653


No 423
>KOG1916 consensus Nuclear protein, contains WD40 repeats [General function prediction only]
Probab=64.99  E-value=5  Score=40.01  Aligned_cols=70  Identities=10%  Similarity=0.076  Sum_probs=47.0

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEE----------ccCCCcEEEEecCCCeEEEEc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCF----------GDESGHLIYSGSDDNLCKVWD  110 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~----------~~~~~~~l~s~s~dg~v~lWd  110 (303)
                      -|.-+-|-++..++..+-.+++++|....+... ..+.+|...+..++|          ..++|+.|+.+..||.|+.|-
T Consensus       185 ~V~wcp~~~~~~~ic~~~~~~~i~lL~~~ra~~-~l~rsHs~~~~d~a~~~~g~~~l~~lSpDGtv~a~a~~dG~v~f~Q  263 (1283)
T KOG1916|consen  185 LVSWCPIAVNKVYICYGLKGGEIRLLNINRALR-SLFRSHSQRVTDMAFFAEGVLKLASLSPDGTVFAWAISDGSVGFYQ  263 (1283)
T ss_pred             eeeecccccccceeeeccCCCceeEeeechHHH-HHHHhcCCCcccHHHHhhchhhheeeCCCCcEEEEeecCCccceee
Confidence            344444445777788888899999987766532 234557665555433          134688999999999998885


Q ss_pred             C
Q 022074          111 R  111 (303)
Q Consensus       111 ~  111 (303)
                      +
T Consensus       264 i  264 (1283)
T KOG1916|consen  264 I  264 (1283)
T ss_pred             e
Confidence            4


No 424
>PF03088 Str_synth:  Strictosidine synthase;  InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=64.85  E-value=21  Score=25.13  Aligned_cols=42  Identities=14%  Similarity=0.186  Sum_probs=26.9

Q ss_pred             EeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEE
Q 022074          228 GSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSS  270 (303)
Q Consensus       228 g~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~  270 (303)
                      +..+|.+.-||+.+++..-.+.+ -.-.+.|+.|||+.+|+.+
T Consensus        33 ~~~~GRll~ydp~t~~~~vl~~~-L~fpNGVals~d~~~vlv~   74 (89)
T PF03088_consen   33 GRPTGRLLRYDPSTKETTVLLDG-LYFPNGVALSPDESFVLVA   74 (89)
T ss_dssp             T---EEEEEEETTTTEEEEEEEE-ESSEEEEEE-TTSSEEEEE
T ss_pred             CCCCcCEEEEECCCCeEEEehhC-CCccCeEEEcCCCCEEEEE
Confidence            44577899999999874323333 2367999999999965544


No 425
>PF07995 GSDH:  Glucose / Sorbosone dehydrogenase;  InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=64.56  E-value=37  Score=30.24  Aligned_cols=48  Identities=25%  Similarity=0.329  Sum_probs=28.8

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCce---EEEE----ecccCCeEEEEEcc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKL---SLRI----LAHTSDVNTVCFGD   91 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~---~~~~----~~h~~~v~~l~~~~   91 (303)
                      -.+|+|.|||+.++ +...|.|++++ ..+..   ...+    .........++++|
T Consensus         4 P~~~a~~pdG~l~v-~e~~G~i~~~~-~~g~~~~~v~~~~~v~~~~~~gllgia~~p   58 (331)
T PF07995_consen    4 PRSMAFLPDGRLLV-AERSGRIWVVD-KDGSLKTPVADLPEVFADGERGLLGIAFHP   58 (331)
T ss_dssp             EEEEEEETTSCEEE-EETTTEEEEEE-TTTEECEEEEE-TTTBTSTTBSEEEEEE-T
T ss_pred             ceEEEEeCCCcEEE-EeCCceEEEEe-CCCcCcceecccccccccccCCcccceecc
Confidence            36788999986555 45688898888 34433   1111    11234667888866


No 426
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=64.05  E-value=95  Score=26.72  Aligned_cols=58  Identities=12%  Similarity=0.109  Sum_probs=40.8

Q ss_pred             CeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      .++..---.++...+||..+.+++.++.. ...-|.++  .|+..|+.++....|+.+|++
T Consensus       100 d~l~qLTWk~~~~f~yd~~tl~~~~~~~y-~~EGWGLt--~dg~~Li~SDGS~~L~~~dP~  157 (264)
T PF05096_consen  100 DKLYQLTWKEGTGFVYDPNTLKKIGTFPY-PGEGWGLT--SDGKRLIMSDGSSRLYFLDPE  157 (264)
T ss_dssp             TEEEEEESSSSEEEEEETTTTEEEEEEE--SSS--EEE--ECSSCEEEE-SSSEEEEE-TT
T ss_pred             CEEEEEEecCCeEEEEccccceEEEEEec-CCcceEEE--cCCCEEEEECCccceEEECCc
Confidence            34444456788999999999999988864 35678888  577778888777888888865


No 427
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=63.33  E-value=1.1e+02  Score=27.34  Aligned_cols=137  Identities=15%  Similarity=0.031  Sum_probs=73.0

Q ss_pred             cccccccCcCccc----ccCCCcccc----eEEEEEcCCCCEEEEee--CCCeEEEEECCCCceEEEEecccCCeEEEEE
Q 022074           20 NVTEIHDGLDFSA----ADDGGYSFG----IFSLKFSTDGRELVAGS--SDDCIYVYDLEANKLSLRILAHTSDVNTVCF   89 (303)
Q Consensus        20 ~~~~~~~~~~~~~----~~~~~~~~~----v~~l~~s~~g~~l~sgs--~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~   89 (303)
                      .|.++|+...+++    +.+.+|++-    ....+++.||+++++..  ---+|.|.|+..++...++.-  .++..+.-
T Consensus        67 Dvv~~~D~~TL~~~~EI~iP~k~R~~~~~~~~~~~ls~dgk~~~V~N~TPa~SVtVVDl~~~kvv~ei~~--PGC~~iyP  144 (342)
T PF06433_consen   67 DVVEIWDTQTLSPTGEIEIPPKPRAQVVPYKNMFALSADGKFLYVQNFTPATSVTVVDLAAKKVVGEIDT--PGCWLIYP  144 (342)
T ss_dssp             EEEEEEETTTTEEEEEEEETTS-B--BS--GGGEEE-TTSSEEEEEEESSSEEEEEEETTTTEEEEEEEG--TSEEEEEE
T ss_pred             eEEEEEecCcCcccceEecCCcchheecccccceEEccCCcEEEEEccCCCCeEEEEECCCCceeeeecC--CCEEEEEe
Confidence            5666776665541    122444542    23368889999988864  345799999999987655432  34444433


Q ss_pred             ccCCCcEEEEecCCCeEEEEcCccccCCCc-cceeecccccCe-EEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074           90 GDESGHLIYSGSDDNLCKVWDRRCLNVKGK-PAGVLMGHLEGI-TFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus        90 ~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~-~~~~~~~h~~~v-~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      ..  +..|.+-+.||++--..+........ ....|..-.+.+ ..-.+...+..++=-+.+|.|+--|+...
T Consensus       145 ~~--~~~F~~lC~DGsl~~v~Ld~~Gk~~~~~t~~F~~~~dp~f~~~~~~~~~~~~~F~Sy~G~v~~~dlsg~  215 (342)
T PF06433_consen  145 SG--NRGFSMLCGDGSLLTVTLDADGKEAQKSTKVFDPDDDPLFEHPAYSRDGGRLYFVSYEGNVYSADLSGD  215 (342)
T ss_dssp             EE--TTEEEEEETTSCEEEEEETSTSSEEEEEEEESSTTTS-B-S--EEETTTTEEEEEBTTSEEEEEEETTS
T ss_pred             cC--CCceEEEecCCceEEEEECCCCCEeEeeccccCCCCcccccccceECCCCeEEEEecCCEEEEEeccCC
Confidence            32  35688888899888776642211111 111121111221 11112233444554677788888877653


No 428
>PRK10115 protease 2; Provisional
Probab=63.17  E-value=1.6e+02  Score=29.18  Aligned_cols=115  Identities=10%  Similarity=0.093  Sum_probs=60.4

Q ss_pred             cceEEEEEcCCCCEEEEeeC-CC----eEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCC-----CeEEEE
Q 022074           40 FGIFSLKFSTDGRELVAGSS-DD----CIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDD-----NLCKVW  109 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~-Dg----~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~d-----g~v~lW  109 (303)
                      ..+..+.++|||++|+-+.. +|    ++++.|+.++...........  ..++|.++...++.+...+     ..|+++
T Consensus       127 ~~l~~~~~Spdg~~la~~~d~~G~E~~~l~v~d~~tg~~l~~~i~~~~--~~~~w~~D~~~~~y~~~~~~~~~~~~v~~h  204 (686)
T PRK10115        127 YTLGGMAITPDNTIMALAEDFLSRRQYGIRFRNLETGNWYPELLDNVE--PSFVWANDSWTFYYVRKHPVTLLPYQVWRH  204 (686)
T ss_pred             EEEeEEEECCCCCEEEEEecCCCcEEEEEEEEECCCCCCCCccccCcc--eEEEEeeCCCEEEEEEecCCCCCCCEEEEE
Confidence            66778999999998777543 23    588899988863322222111  4588975544444444322     355666


Q ss_pred             cCccccCCCccceeecccccCeE-EEEeCCCCCEEEEEe---CCCcEEEEEcc
Q 022074          110 DRRCLNVKGKPAGVLMGHLEGIT-FIDSRGDGRYLISNG---KDQAIKLWDIR  158 (303)
Q Consensus       110 d~~~~~~~~~~~~~~~~h~~~v~-~~~~~~~~~~l~s~~---~D~~v~lWdl~  158 (303)
                      ++...  ...-...+.+...... ....+.++.+++..+   .++.+.+++..
T Consensus       205 ~lgt~--~~~d~lv~~e~~~~~~~~~~~s~d~~~l~i~~~~~~~~~~~l~~~~  255 (686)
T PRK10115        205 TIGTP--ASQDELVYEEKDDTFYVSLHKTTSKHYVVIHLASATTSEVLLLDAE  255 (686)
T ss_pred             ECCCC--hhHCeEEEeeCCCCEEEEEEEcCCCCEEEEEEECCccccEEEEECc
Confidence            65421  1111223332222222 223344666654333   34567777743


No 429
>TIGR02608 delta_60_rpt delta-60 repeat domain. This domain occurs in tandem repeats, as many as 13, in proteins from Bdellovibrio bacteriovorus, Azotobacter vinelandii, Geobacter sulfurreducens, Pirellula sp. 1, Myxococcus xanthus, and others, many of which are Deltaproteobacteria. The periodicity of the repeat ranges from about 57 to 61 amino acids, and a core region of about 54 is represented by this model and seed alignment.
Probab=61.94  E-value=30  Score=21.82  Aligned_cols=18  Identities=33%  Similarity=0.619  Sum_probs=15.5

Q ss_pred             eEEEEEcCCCCEEEEeeC
Q 022074           42 IFSLKFSTDGRELVAGSS   59 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~   59 (303)
                      ++++...|||+++++|..
T Consensus         3 ~~~~~~q~DGkIlv~G~~   20 (55)
T TIGR02608         3 AYAVAVQSDGKILVAGYV   20 (55)
T ss_pred             eEEEEECCCCcEEEEEEe
Confidence            678899999999999864


No 430
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=61.50  E-value=1.2e+02  Score=27.23  Aligned_cols=41  Identities=10%  Similarity=0.038  Sum_probs=26.5

Q ss_pred             CeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCeEEEEeCC
Q 022074          232 SCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPMLVSSSWD  273 (303)
Q Consensus       232 g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~las~s~D  273 (303)
                      |.|.-+|...++. ..+.......+.++|+|+|+++++-..+
T Consensus       164 g~i~r~~pdg~~~-e~~a~G~rnp~Gl~~d~~G~l~~tdn~~  204 (367)
T TIGR02604       164 GGLFRYNPDGGKL-RVVAHGFQNPYGHSVDSWGDVFFCDNDD  204 (367)
T ss_pred             ceEEEEecCCCeE-EEEecCcCCCccceECCCCCEEEEccCC
Confidence            5677777766553 2332222346899999999988775543


No 431
>PF14761 HPS3_N:  Hermansky-Pudlak syndrome 3
Probab=60.84  E-value=43  Score=27.77  Aligned_cols=51  Identities=16%  Similarity=0.197  Sum_probs=37.3

Q ss_pred             eEEEEEeCCCeEEEEECCC--CeEEEEeecCCCCeEEEEECCCCCeEEEEeCCC
Q 022074          223 KYIYTGSHDSCVYVYDLVS--GEQVAALKYHTSPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       223 ~~latg~~dg~i~iwd~~~--~~~~~~~~~h~~~I~~v~~sp~~~~las~s~Dg  274 (303)
                      ..|..+.....|.+|++.+  .+.+.+|. .-++|..+.++.-|.+|+|-=++.
T Consensus        29 d~Lfva~~g~~Vev~~l~~~~~~~~~~F~-Tv~~V~~l~y~~~GDYlvTlE~k~   81 (215)
T PF14761_consen   29 DALFVAASGCKVEVYDLEQEECPLLCTFS-TVGRVLQLVYSEAGDYLVTLEEKN   81 (215)
T ss_pred             ceEEEEcCCCEEEEEEcccCCCceeEEEc-chhheeEEEeccccceEEEEEeec
Confidence            3444445667899999873  34566775 348999999999999999875543


No 432
>KOG4659 consensus Uncharacterized conserved protein (Rhs family) [Function unknown]
Probab=60.80  E-value=2.4e+02  Score=30.27  Aligned_cols=23  Identities=17%  Similarity=0.296  Sum_probs=14.7

Q ss_pred             cccCeEEEEeCCCCCEEEEEeCC
Q 022074          127 HLEGITFIDSRGDGRYLISNGKD  149 (303)
Q Consensus       127 h~~~v~~~~~~~~~~~l~s~~~D  149 (303)
                      |-.+..+++.+|+|..++.-..+
T Consensus       660 ~lnsp~alaVsPdg~v~IAD~gN  682 (1899)
T KOG4659|consen  660 KLNSPYALAVSPDGDVIIADSGN  682 (1899)
T ss_pred             ccCCcceEEECCCCcEEEecCCc
Confidence            44566677778887766554433


No 433
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.45  E-value=1.2e+02  Score=26.62  Aligned_cols=122  Identities=16%  Similarity=0.139  Sum_probs=77.1

Q ss_pred             CCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEec-ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILA-HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~-h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .|-+..+.++.|+|+.+.|.+-.+...-.|+=...|++..++.- --..-..+.|. .++.+.++--.+.++.++-+...
T Consensus        82 ~g~~~nvS~LTynp~~rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Ieyi-g~n~fvi~dER~~~l~~~~vd~~  160 (316)
T COG3204          82 LGETANVSSLTYNPDTRTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEYI-GGNQFVIVDERDRALYLFTVDAD  160 (316)
T ss_pred             ccccccccceeeCCCcceEEEecCCCceEEEEecCCceEEEecccccCChhHeEEe-cCCEEEEEehhcceEEEEEEcCC
Confidence            56667799999999999999888877666665556766544321 11233566663 33555555566777777654311


Q ss_pred             cCC------CccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074          115 NVK------GKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus       115 ~~~------~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      ...      .-+.........+.-.+++.+.++.|..+-.-+.++||...
T Consensus       161 t~~~~~~~~~i~L~~~~k~N~GfEGlA~d~~~~~l~~aKEr~P~~I~~~~  210 (316)
T COG3204         161 TTVISAKVQKIPLGTTNKKNKGFEGLAWDPVDHRLFVAKERNPIGIFEVT  210 (316)
T ss_pred             ccEEeccceEEeccccCCCCcCceeeecCCCCceEEEEEccCCcEEEEEe
Confidence            000      00111111225568889999998888888887888888765


No 434
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=60.44  E-value=1.5e+02  Score=29.67  Aligned_cols=33  Identities=15%  Similarity=0.384  Sum_probs=24.6

Q ss_pred             eEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCC
Q 022074          209 LIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSG  242 (303)
Q Consensus       209 ~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~  242 (303)
                      +....|.| .+.++..|+.=.+|+++|+||+...
T Consensus       149 i~qv~WhP-~s~~~~~l~vLtsdn~lR~y~~~~~  181 (717)
T PF10168_consen  149 IKQVRWHP-WSESDSHLVVLTSDNTLRLYDISDP  181 (717)
T ss_pred             EEEEEEcC-CCCCCCeEEEEecCCEEEEEecCCC
Confidence            34455555 3556788999999999999999653


No 435
>COG5167 VID27 Protein involved in vacuole import and degradation [Intracellular trafficking and secretion]
Probab=59.78  E-value=29  Score=32.64  Aligned_cols=63  Identities=16%  Similarity=0.240  Sum_probs=47.4

Q ss_pred             CCCeEEEEEeCCCeEEEEECCCCeEE-EEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          220 TGQKYIYTGSHDSCVYVYDLVSGEQV-AALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       220 ~~~~~latg~~dg~i~iwd~~~~~~~-~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      ....|+|.||..|.|++||-. |... ..+.+-...|..+..+.+|.++++.+. ..|.+-|++..
T Consensus       571 TesGyIa~as~kGDirLyDRi-g~rAKtalP~lG~aIk~idvta~Gk~ilaTCk-~yllL~d~~ik  634 (776)
T COG5167         571 TESGYIAAASRKGDIRLYDRI-GKRAKTALPGLGDAIKHIDVTANGKHILATCK-NYLLLTDVPIK  634 (776)
T ss_pred             ccCceEEEecCCCceeeehhh-cchhhhcCcccccceeeeEeecCCcEEEEeec-ceEEEEecccc
Confidence            345689999999999999953 3332 335666778999999999998776664 57777887654


No 436
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=59.62  E-value=18  Score=20.67  Aligned_cols=21  Identities=24%  Similarity=0.521  Sum_probs=16.2

Q ss_pred             CCCEEEEeeCCCeEEEEECCC
Q 022074           50 DGRELVAGSSDDCIYVYDLEA   70 (303)
Q Consensus        50 ~g~~l~sgs~Dg~v~lwd~~~   70 (303)
                      .+..++.++.||.++-+|.++
T Consensus        20 ~~g~vyv~~~dg~l~ald~~t   40 (40)
T PF13570_consen   20 AGGRVYVGTGDGNLYALDAAT   40 (40)
T ss_dssp             CTSEEEEE-TTSEEEEEETT-
T ss_pred             ECCEEEEEcCCCEEEEEeCCC
Confidence            466899999999999998764


No 437
>PHA02790 Kelch-like protein; Provisional
Probab=59.61  E-value=1.6e+02  Score=27.78  Aligned_cols=58  Identities=10%  Similarity=0.105  Sum_probs=31.2

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCe--EEEEeecCCCCeEEEEECCCCCeEEEEeCC-----CCEEEeecCC
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGE--QVAALKYHTSPVRDCSWHPSQPMLVSSSWD-----GDVVRWEFPG  283 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~--~~~~~~~h~~~I~~v~~sp~~~~las~s~D-----g~i~~Wd~~~  283 (303)
                      +++..+.|+   .+.+||.++.+  .+..+.........+.  -++++.+.||.+     .++..+|+..
T Consensus       407 ~~~IYv~GG---~~e~ydp~~~~W~~~~~m~~~r~~~~~~v--~~~~IYviGG~~~~~~~~~ve~Yd~~~  471 (480)
T PHA02790        407 GRRLFLVGR---NAEFYCESSNTWTLIDDPIYPRDNPELII--VDNKLLLIGGFYRGSYIDTIEVYNNRT  471 (480)
T ss_pred             CCEEEEECC---ceEEecCCCCcEeEcCCCCCCccccEEEE--ECCEEEEECCcCCCcccceEEEEECCC
Confidence            466666664   46788887653  2332322222222222  367778887754     3466666543


No 438
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=59.26  E-value=25  Score=18.47  Aligned_cols=24  Identities=25%  Similarity=0.439  Sum_probs=14.5

Q ss_pred             EEEEEcCCCCEEEEeeCCCeEEEE
Q 022074           43 FSLKFSTDGRELVAGSSDDCIYVY   66 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs~Dg~v~lw   66 (303)
                      ..++.+++|+.+++-+....|++|
T Consensus         5 ~gvav~~~g~i~VaD~~n~rV~vf   28 (28)
T PF01436_consen    5 HGVAVDSDGNIYVADSGNHRVQVF   28 (28)
T ss_dssp             EEEEEETTSEEEEEECCCTEEEEE
T ss_pred             cEEEEeCCCCEEEEECCCCEEEEC
Confidence            456666677666666555555543


No 439
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=59.03  E-value=1.3e+02  Score=26.45  Aligned_cols=53  Identities=28%  Similarity=0.349  Sum_probs=35.8

Q ss_pred             EcCCCCEEEEeeCC-----CeEEEEECCCC-ceEEEEecccCCeEEEEEccCCCcEEEEe
Q 022074           47 FSTDGRELVAGSSD-----DCIYVYDLEAN-KLSLRILAHTSDVNTVCFGDESGHLIYSG  100 (303)
Q Consensus        47 ~s~~g~~l~sgs~D-----g~v~lwd~~~~-~~~~~~~~h~~~v~~l~~~~~~~~~l~s~  100 (303)
                      ||+||.+|++.-+|     |.|-|||...+ +.+.++..|.-+-..+.+.++ +..++.+
T Consensus       121 fs~dG~~LYATEndfd~~rGViGvYd~r~~fqrvgE~~t~GiGpHev~lm~D-Grtlvva  179 (366)
T COG3490         121 FSPDGRLLYATENDFDPNRGVIGVYDAREGFQRVGEFSTHGIGPHEVTLMAD-GRTLVVA  179 (366)
T ss_pred             cCCCCcEEEeecCCCCCCCceEEEEecccccceecccccCCcCcceeEEecC-CcEEEEe
Confidence            89999998876433     56899998754 233456666666677888654 6665544


No 440
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=57.91  E-value=1.7e+02  Score=27.62  Aligned_cols=30  Identities=13%  Similarity=0.437  Sum_probs=25.2

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEEEeec
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVAALKY  250 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~~~~~  250 (303)
                      .+..++.++.||.++.+|.++|+.+-.++.
T Consensus       405 ~g~~v~~g~~dG~l~ald~~tG~~lW~~~~  434 (488)
T cd00216         405 AGNLVFAGAADGYFRAFDATTGKELWKFRT  434 (488)
T ss_pred             cCCeEEEECCCCeEEEEECCCCceeeEEEC
Confidence            456788889999999999999998877653


No 441
>KOG2247 consensus WD40 repeat-containing protein [General function prediction only]
Probab=56.77  E-value=1.6  Score=40.40  Aligned_cols=114  Identities=12%  Similarity=0.217  Sum_probs=72.5

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGK  119 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~  119 (303)
                      .+-....|.+.+..++.++.+..+..||-..... .+. ...+..-.++|..+.+..++-+.+.+.+.+||+...    .
T Consensus        35 v~pi~~~w~~e~~nlavaca~tiv~~YD~agq~~-le~-n~tg~aldm~wDkegdvlavlAek~~piylwd~n~e----y  108 (615)
T KOG2247|consen   35 VGPIIHRWRPEGHNLAVACANTIVIYYDKAGQVI-LEL-NPTGKALDMAWDKEGDVLAVLAEKTGPIYLWDVNSE----Y  108 (615)
T ss_pred             cccceeeEecCCCceehhhhhhHHHhhhhhccee-ccc-CCchhHhhhhhccccchhhhhhhcCCCeeechhhhh----h
Confidence            3445567888888899999999999998755432 222 233445566775444455667788999999997421    1


Q ss_pred             cceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEccc
Q 022074          120 PAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRK  159 (303)
Q Consensus       120 ~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~  159 (303)
                      ......+-...-.-+.+++....++.+...+.+++++.+.
T Consensus       109 tqqLE~gg~~s~sll~wsKg~~el~ig~~~gn~viynhgt  148 (615)
T KOG2247|consen  109 TQQLESGGTSSKSLLAWSKGTPELVIGNNAGNIVIYNHGT  148 (615)
T ss_pred             HHHHhccCcchHHHHhhccCCccccccccccceEEEeccc
Confidence            1111122122222245667677778888889999998764


No 442
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=56.51  E-value=35  Score=19.30  Aligned_cols=25  Identities=20%  Similarity=0.485  Sum_probs=20.0

Q ss_pred             EEEEeeCCCeEEEEECCCCceEEEE
Q 022074           53 ELVAGSSDDCIYVYDLEANKLSLRI   77 (303)
Q Consensus        53 ~l~sgs~Dg~v~lwd~~~~~~~~~~   77 (303)
                      .+.+++.||.++-+|.++|+..-+.
T Consensus         2 ~v~~~~~~g~l~AlD~~TG~~~W~~   26 (38)
T PF01011_consen    2 RVYVGTPDGYLYALDAKTGKVLWKF   26 (38)
T ss_dssp             EEEEETTTSEEEEEETTTTSEEEEE
T ss_pred             EEEEeCCCCEEEEEECCCCCEEEee
Confidence            4666799999999999999876444


No 443
>PF15390 DUF4613:  Domain of unknown function (DUF4613)
Probab=55.75  E-value=2e+02  Score=27.84  Aligned_cols=66  Identities=11%  Similarity=0.257  Sum_probs=38.0

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEEC--CCCceEEEEecccCCeEEEEEccCC---CcEEEEecCCCeEEEEcCc
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDL--EANKLSLRILAHTSDVNTVCFGDES---GHLIYSGSDDNLCKVWDRR  112 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~--~~~~~~~~~~~h~~~v~~l~~~~~~---~~~l~s~s~dg~v~lWd~~  112 (303)
                      .|+..++|. ||+.++--    .+++-+-  +-|.  ....++-..|..++|.|..   ...++......-|.+|-+.
T Consensus        20 HPvhGlaWT-DGkqVvLT----~L~l~~gE~kfGd--s~viGqFEhV~GlsW~P~~~~~~paLLAVQHkkhVtVWqL~   90 (671)
T PF15390_consen   20 HPVHGLAWT-DGKQVVLT----DLQLHNGEPKFGD--SKVIGQFEHVHGLSWAPPCTADTPALLAVQHKKHVTVWQLC   90 (671)
T ss_pred             ccccceEec-CCCEEEEE----eeeeeCCccccCC--ccEeeccceeeeeeecCcccCCCCceEEEeccceEEEEEec
Confidence            478999998 66654432    1222211  1111  1234555679999997752   2244556666789999763


No 444
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=55.68  E-value=1e+02  Score=26.80  Aligned_cols=53  Identities=13%  Similarity=0.231  Sum_probs=36.4

Q ss_pred             CCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEec------CCCeEEEEcCc
Q 022074           59 SDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGS------DDNLCKVWDRR  112 (303)
Q Consensus        59 ~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s------~dg~v~lWd~~  112 (303)
                      ....|.+||..+.+....-.+-.+.|..+.|. ++.++++.|.      ....+..||..
T Consensus        14 ~C~~lC~yd~~~~qW~~~g~~i~G~V~~l~~~-~~~~Llv~G~ft~~~~~~~~la~yd~~   72 (281)
T PF12768_consen   14 PCPGLCLYDTDNSQWSSPGNGISGTVTDLQWA-SNNQLLVGGNFTLNGTNSSNLATYDFK   72 (281)
T ss_pred             CCCEEEEEECCCCEeecCCCCceEEEEEEEEe-cCCEEEEEEeeEECCCCceeEEEEecC
Confidence            35569999998887554333445789999996 3466777664      34567788865


No 445
>PF14269 Arylsulfotran_2:  Arylsulfotransferase (ASST)
Probab=54.65  E-value=1.3e+02  Score=26.49  Aligned_cols=70  Identities=20%  Similarity=0.352  Sum_probs=49.1

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecccC-C----eEEEEEccCCCcEEEEecCCCeEEEEcC
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAHTS-D----VNTVCFGDESGHLIYSGSDDNLCKVWDR  111 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~-~----v~~l~~~~~~~~~l~s~s~dg~v~lWd~  111 (303)
                      =+.+|....+|.+|++.-.-.+|.+.|..+|+..-++.+... .    -...++.+ +.+.+-.+..+++|.++|-
T Consensus       145 HiNsV~~~~~G~yLiS~R~~~~i~~I~~~tG~I~W~lgG~~~~df~~~~~~f~~QH-dar~~~~~~~~~~IslFDN  219 (299)
T PF14269_consen  145 HINSVDKDDDGDYLISSRNTSTIYKIDPSTGKIIWRLGGKRNSDFTLPATNFSWQH-DARFLNESNDDGTISLFDN  219 (299)
T ss_pred             EeeeeeecCCccEEEEecccCEEEEEECCCCcEEEEeCCCCCCcccccCCcEeecc-CCEEeccCCCCCEEEEEcC
Confidence            478889999999999998888999999999987766654411 1    11234433 2444445567889999983


No 446
>PF04053 Coatomer_WDAD:  Coatomer WD associated region ;  InterPro: IPR006692 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the WD-associated region found in coatomer subunits alpha, beta and beta' subunits. The alpha-subunit (RET1P) of the coatomer complex in Saccharomyces cerevisiae (Baker's yeast), participates in membrane transport between the endoplasmic reticulum and Golgi apparatus. The protein contains six WD-40 repeat motifs in its N-terminal region []. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030117 membrane coat; PDB: 3MKQ_B.
Probab=53.73  E-value=1.9e+02  Score=27.00  Aligned_cols=109  Identities=18%  Similarity=0.244  Sum_probs=54.7

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE------------------------EE-eccc-C--------
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL------------------------RI-LAHT-S--------   82 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~------------------------~~-~~h~-~--------   82 (303)
                      ........++++|+|+.++++ .||.-.|+.....+...                        .+ ..-+ .        
T Consensus        30 ~~~~~p~~ls~npngr~v~V~-g~geY~iyt~~~~r~k~~G~g~~~vw~~~n~yAv~~~~~~I~I~kn~~~~~~k~i~~~  108 (443)
T PF04053_consen   30 SCEIYPQSLSHNPNGRFVLVC-GDGEYEIYTALAWRNKAFGSGLSFVWSSRNRYAVLESSSTIKIYKNFKNEVVKSIKLP  108 (443)
T ss_dssp             E-SS--SEEEE-TTSSEEEEE-ETTEEEEEETTTTEEEEEEE-SEEEE-TSSEEEEE-TTS-EEEEETTEE-TT-----S
T ss_pred             CCCcCCeeEEECCCCCEEEEE-cCCEEEEEEccCCcccccCceeEEEEecCccEEEEECCCeEEEEEcCccccceEEcCC
Confidence            344457889999999988884 56667777643222110                        00 0000 0        


Q ss_pred             -CeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074           83 -DVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus        83 -~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                       .+..+ | .  |.+|+..+ ++.|.+||..    +++.++.+.  ..+|..+.|++++++++-.+. ..+.+++..
T Consensus       109 ~~~~~I-f-~--G~LL~~~~-~~~i~~yDw~----~~~~i~~i~--v~~vk~V~Ws~~g~~val~t~-~~i~il~~~  173 (443)
T PF04053_consen  109 FSVEKI-F-G--GNLLGVKS-SDFICFYDWE----TGKLIRRID--VSAVKYVIWSDDGELVALVTK-DSIYILKYN  173 (443)
T ss_dssp             S-EEEE-E----SSSEEEEE-TTEEEEE-TT----T--EEEEES--S-E-EEEEE-TTSSEEEEE-S--SEEEEEE-
T ss_pred             cccceE-E-c--CcEEEEEC-CCCEEEEEhh----HcceeeEEe--cCCCcEEEEECCCCEEEEEeC-CeEEEEEec
Confidence             01111 1 1  45555554 4489999975    334444442  234788889999998887765 577777643


No 447
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=51.38  E-value=1.6e+02  Score=25.37  Aligned_cols=180  Identities=14%  Similarity=0.108  Sum_probs=93.9

Q ss_pred             EEEEcCCCCEEEEeeCCC--eEEEEECCCCceEEEEe-cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcc
Q 022074           44 SLKFSTDGRELVAGSSDD--CIYVYDLEANKLSLRIL-AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKP  120 (303)
Q Consensus        44 ~l~~s~~g~~l~sgs~Dg--~v~lwd~~~~~~~~~~~-~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~  120 (303)
                      .+.|..+|..+-+.+.-|  .|+.+|+.+++...+.. ...-.-..++...  +++..-.=.++...+||....    +.
T Consensus        49 GL~~~~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~--d~l~qLTWk~~~~f~yd~~tl----~~  122 (264)
T PF05096_consen   49 GLEFLDDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITILG--DKLYQLTWKEGTGFVYDPNTL----KK  122 (264)
T ss_dssp             EEEEEETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEEEEET--TEEEEEESSSSEEEEEETTTT----EE
T ss_pred             cEEecCCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeEEEEC--CEEEEEEecCCeEEEEccccc----eE
Confidence            356766787777777766  68999999998654332 1222223344322  233333445778889997532    23


Q ss_pred             ceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccccCCcccccCccceeeeceeeeCCCCCccccCCCCCcceEE
Q 022074          121 AGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKMSSNASCNLGFRSYEWDYRWMDYPPQARDLKHPCDQSVATY  200 (303)
Q Consensus       121 ~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  200 (303)
                      ...+.- ...=+.++  .++..|+.......+.++|..........                               ...
T Consensus       123 ~~~~~y-~~EGWGLt--~dg~~Li~SDGS~~L~~~dP~~f~~~~~i-------------------------------~V~  168 (264)
T PF05096_consen  123 IGTFPY-PGEGWGLT--SDGKRLIMSDGSSRLYFLDPETFKEVRTI-------------------------------QVT  168 (264)
T ss_dssp             EEEEE--SSS--EEE--ECSSCEEEE-SSSEEEEE-TTT-SEEEEE-------------------------------E-E
T ss_pred             EEEEec-CCcceEEE--cCCCEEEEECCccceEEECCcccceEEEE-------------------------------EEE
Confidence            333321 22233443  34555655554577777776532211100                               000


Q ss_pred             ecccceee--eEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEeec------------C---CCCeEEEEECCC
Q 022074          201 KGHSVLRT--LIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALKY------------H---TSPVRDCSWHPS  263 (303)
Q Consensus       201 ~~~~~~~~--~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~~------------h---~~~I~~v~~sp~  263 (303)
                      .+......  .+.+.       +|...|=--....|...|..+|+.+..+..            +   .+=.+.+||.|.
T Consensus       169 ~~g~pv~~LNELE~i-------~G~IyANVW~td~I~~Idp~tG~V~~~iDls~L~~~~~~~~~~~~~~dVLNGIAyd~~  241 (264)
T PF05096_consen  169 DNGRPVSNLNELEYI-------NGKIYANVWQTDRIVRIDPETGKVVGWIDLSGLRPEVGRDKSRQPDDDVLNGIAYDPE  241 (264)
T ss_dssp             ETTEE---EEEEEEE-------TTEEEEEETTSSEEEEEETTT-BEEEEEE-HHHHHHHTSTTST--TTS-EEEEEEETT
T ss_pred             ECCEECCCcEeEEEE-------cCEEEEEeCCCCeEEEEeCCCCeEEEEEEhhHhhhcccccccccccCCeeEeEeEeCC
Confidence            00000000  11111       466666666778899999999988776631            0   234889999887


Q ss_pred             CC-eEEEE
Q 022074          264 QP-MLVSS  270 (303)
Q Consensus       264 ~~-~las~  270 (303)
                      .+ +++||
T Consensus       242 ~~~l~vTG  249 (264)
T PF05096_consen  242 TDRLFVTG  249 (264)
T ss_dssp             TTEEEEEE
T ss_pred             CCEEEEEe
Confidence            65 67777


No 448
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=50.65  E-value=2.8e+02  Score=28.01  Aligned_cols=62  Identities=15%  Similarity=0.149  Sum_probs=39.4

Q ss_pred             CCEEEEeeCCCeEEEEECCCCceEEEEecccC--------CeEEEEEcc---------------CCCcEEEEecCCCeEE
Q 022074           51 GRELVAGSSDDCIYVYDLEANKLSLRILAHTS--------DVNTVCFGD---------------ESGHLIYSGSDDNLCK  107 (303)
Q Consensus        51 g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~--------~v~~l~~~~---------------~~~~~l~s~s~dg~v~  107 (303)
                      +..++.++.++.|.=+|.++|++.-++.....        .+..+.+..               .++..++.++.|+.+.
T Consensus       194 gg~lYv~t~~~~V~ALDa~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~Dg~Li  273 (764)
T TIGR03074       194 GDTLYLCTPHNKVIALDAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSDARLI  273 (764)
T ss_pred             CCEEEEECCCCeEEEEECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCCCeEE
Confidence            66788888899999999999987644432211        112233311               1234677788888887


Q ss_pred             EEcCc
Q 022074          108 VWDRR  112 (303)
Q Consensus       108 lWd~~  112 (303)
                      -.|.+
T Consensus       274 ALDA~  278 (764)
T TIGR03074       274 ALDAD  278 (764)
T ss_pred             EEECC
Confidence            77754


No 449
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=49.82  E-value=2.3e+02  Score=26.65  Aligned_cols=56  Identities=18%  Similarity=0.233  Sum_probs=37.1

Q ss_pred             CeEEEEEeCCCeEEEEECCCCe------EEEEeecCCCCeEEEEECCCC-CeEEEEeCCCCEEE
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGE------QVAALKYHTSPVRDCSWHPSQ-PMLVSSSWDGDVVR  278 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~------~~~~~~~h~~~I~~v~~sp~~-~~las~s~Dg~i~~  278 (303)
                      +.+|+++-..+.|+...+....      ....+.. ..+|.+|+-+||| .+.+..+.+|.+.-
T Consensus       369 g~llv~~L~~~~l~r~~l~~~~~~v~~~~~~~~~~-~~RiRdv~~~pDg~~iy~~td~~g~~~~  431 (454)
T TIGR03606       369 NSLLIPSLKRGVIYRIKLDPDYSTVYGDAVPMFKT-NNRYRDVIASPDGNVLYVATDNFGNVQK  431 (454)
T ss_pred             CCEEEEEcCCCeEEEEEecCCcceecceeEEeecC-CCeeEEEEECCCCCEEEEEEcCCCcccc
Confidence            5667777677778777775331      1222333 5799999999997 66666667777653


No 450
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=47.39  E-value=1.5e+02  Score=26.17  Aligned_cols=69  Identities=12%  Similarity=0.197  Sum_probs=48.2

Q ss_pred             eeeeeeCCCeEEEEEeCCCeEEEEECC------CCeE-EEEeec-----CCCCeEEEEECCCCCe------------EEE
Q 022074          214 FSPVYSTGQKYIYTGSHDSCVYVYDLV------SGEQ-VAALKY-----HTSPVRDCSWHPSQPM------------LVS  269 (303)
Q Consensus       214 ~~~~~s~~~~~latg~~dg~i~iwd~~------~~~~-~~~~~~-----h~~~I~~v~~sp~~~~------------las  269 (303)
                      |-..++|.+.+.++....+...+||..      ..+. +-.+..     -....+.+.|+....+            ++.
T Consensus        26 WGia~~p~~~~WVadngT~~~TlYdg~~~~~~g~~~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~vt~~g~~~~a~Fif  105 (336)
T TIGR03118        26 WGLSYRPGGPFWVANTGTGTATLYVGNPDTQPLVQDPLVVVIPAPPPLAAEGTPTGQVFNGSDTFVVSGEGITGPSRFLF  105 (336)
T ss_pred             ceeEecCCCCEEEecCCcceEEeecCCcccccCCccceEEEecCCCCCCCCCCccEEEEeCCCceEEcCCCcccceeEEE
Confidence            445577888888988899999999985      2222 223321     2346788888864333            677


Q ss_pred             EeCCCCEEEeecC
Q 022074          270 SSWDGDVVRWEFP  282 (303)
Q Consensus       270 ~s~Dg~i~~Wd~~  282 (303)
                      +++||+|.-|...
T Consensus       106 ~tEdGTisaW~p~  118 (336)
T TIGR03118       106 VTEDGTLSGWAPA  118 (336)
T ss_pred             EeCCceEEeecCc
Confidence            8899999999954


No 451
>PF15390 DUF4613:  Domain of unknown function (DUF4613)
Probab=47.01  E-value=2.7e+02  Score=27.04  Aligned_cols=67  Identities=9%  Similarity=0.242  Sum_probs=41.3

Q ss_pred             EEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEE-ecccCCeEEEEEccCCCcEEEEe-cCCCeEEEEcC
Q 022074           44 SLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRI-LAHTSDVNTVCFGDESGHLIYSG-SDDNLCKVWDR  111 (303)
Q Consensus        44 ~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~-~~h~~~v~~l~~~~~~~~~l~s~-s~dg~v~lWd~  111 (303)
                      .+-|+|....|++=.....-.++++..+....+. ....+.|.|.||.. ++++|+.+ +..=--++||-
T Consensus       117 GCVWHPk~~iL~VLT~~dvSV~~sV~~d~srVkaDi~~~G~IhCACWT~-DG~RLVVAvGSsLHSyiWd~  185 (671)
T PF15390_consen  117 GCVWHPKKAILTVLTARDVSVLPSVHCDSSRVKADIKTSGLIHCACWTK-DGQRLVVAVGSSLHSYIWDS  185 (671)
T ss_pred             cccccCCCceEEEEecCceeEeeeeeeCCceEEEeccCCceEEEEEecC-cCCEEEEEeCCeEEEEEecC
Confidence            4789998887777665554456666554322221 23457799999975 56666555 33223468984


No 452
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=45.98  E-value=2.3e+02  Score=25.55  Aligned_cols=103  Identities=17%  Similarity=0.193  Sum_probs=52.4

Q ss_pred             eEEEEEcCCCCEEEEee-----------CCC-eEEEEECCC--Cce--EEEEecccCCeEEEEEccCCCcEEEEecCCCe
Q 022074           42 IFSLKFSTDGRELVAGS-----------SDD-CIYVYDLEA--NKL--SLRILAHTSDVNTVCFGDESGHLIYSGSDDNL  105 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs-----------~Dg-~v~lwd~~~--~~~--~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~  105 (303)
                      ..+|+|.++|+..++-.           ..+ .|.+++..+  |+.  ...+.........+++.+ ++ +++ ++...-
T Consensus        16 P~~ia~d~~G~l~V~e~~~y~~~~~~~~~~~~rI~~l~d~dgdG~~d~~~vfa~~l~~p~Gi~~~~-~G-lyV-~~~~~i   92 (367)
T TIGR02604        16 PIAVCFDERGRLWVAEGITYSRPAGRQGPLGDRILILEDADGDGKYDKSNVFAEELSMVTGLAVAV-GG-VYV-ATPPDI   92 (367)
T ss_pred             CceeeECCCCCEEEEeCCcCCCCCCCCCCCCCEEEEEEcCCCCCCcceeEEeecCCCCccceeEec-CC-EEE-eCCCeE
Confidence            56789999999766642           223 677776543  221  122332334457777754 45 554 444433


Q ss_pred             EEEEcCccccCCC-ccceeecc-------cccCeEEEEeCCCCCEEEEEe
Q 022074          106 CKVWDRRCLNVKG-KPAGVLMG-------HLEGITFIDSRGDGRYLISNG  147 (303)
Q Consensus       106 v~lWd~~~~~~~~-~~~~~~~~-------h~~~v~~~~~~~~~~~l~s~~  147 (303)
                      .++.|........ +....+.+       +......+.+.++|.+.++.+
T Consensus        93 ~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpDG~LYv~~G  142 (367)
T TIGR02604        93 LFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPDGWLYFNHG  142 (367)
T ss_pred             EEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCCCCEEEecc
Confidence            3343542111111 11111111       123356778889988777655


No 453
>PF01731 Arylesterase:  Arylesterase;  InterPro: IPR002640  The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity [].   Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity.   Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL.   Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo [].  This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=45.01  E-value=53  Score=22.86  Aligned_cols=28  Identities=25%  Similarity=0.260  Sum_probs=21.8

Q ss_pred             EEEEEcCCCCEEEEee-CCCeEEEEECCC
Q 022074           43 FSLKFSTDGRELVAGS-SDDCIYVYDLEA   70 (303)
Q Consensus        43 ~~l~~s~~g~~l~sgs-~Dg~v~lwd~~~   70 (303)
                      ..|.++|++++|.+++ ..+.|++|+.+.
T Consensus        57 NGI~~s~~~k~lyVa~~~~~~I~vy~~~~   85 (86)
T PF01731_consen   57 NGIAISPDKKYLYVASSLAHSIHVYKRHK   85 (86)
T ss_pred             ceEEEcCCCCEEEEEeccCCeEEEEEecC
Confidence            5788999999887765 567899987653


No 454
>PF10647 Gmad1:  Lipoprotein LpqB beta-propeller domain;  InterPro: IPR018910  The Gmad1 domain is found associated with IPR019606 from INTERPRO, in bacterial spore formation. It is predicted to have a beta-propeller fold and to have a passive binding role rather than a catalytic function owing to the low number of conserved hydrophilic residues. 
Probab=39.92  E-value=2.3e+02  Score=23.98  Aligned_cols=106  Identities=10%  Similarity=0.049  Sum_probs=58.9

Q ss_pred             ceEEEEEcCCCCEEEEeeCCCeEEEEE-CCCCceEE-EEecc-c-CCeEEEEEccCCCcEEEEec---CCCeEEEEcCcc
Q 022074           41 GIFSLKFSTDGRELVAGSSDDCIYVYD-LEANKLSL-RILAH-T-SDVNTVCFGDESGHLIYSGS---DDNLCKVWDRRC  113 (303)
Q Consensus        41 ~v~~l~~s~~g~~l~sgs~Dg~v~lwd-~~~~~~~~-~~~~h-~-~~v~~l~~~~~~~~~l~s~s---~dg~v~lWd~~~  113 (303)
                      .+..-+|+++|...++...+...+++. ..++.... .+... . ..|..+.++++ +.+++-..   .++.|.+=-+. 
T Consensus        67 ~l~~PS~d~~g~~W~v~~~~~~~~~~~~~~~g~~~~~~v~~~~~~~~I~~l~vSpD-G~RvA~v~~~~~~~~v~va~V~-  144 (253)
T PF10647_consen   67 SLTRPSWDPDGWVWTVDDGSGGVRVVRDSASGTGEPVEVDWPGLRGRITALRVSPD-GTRVAVVVEDGGGGRVYVAGVV-  144 (253)
T ss_pred             ccccccccCCCCEEEEEcCCCceEEEEecCCCcceeEEecccccCCceEEEEECCC-CcEEEEEEecCCCCeEEEEEEE-
Confidence            577778999988777766666677773 33333221 12111 1 27999999875 55554333   34566553221 


Q ss_pred             ccCCC------ccceeecccccCeEEEEeCCCCCEEEEEeC
Q 022074          114 LNVKG------KPAGVLMGHLEGITFIDSRGDGRYLISNGK  148 (303)
Q Consensus       114 ~~~~~------~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~  148 (303)
                      ....+      .+..........+..++|.+++.+++.+..
T Consensus       145 r~~~g~~~~l~~~~~~~~~~~~~v~~v~W~~~~~L~V~~~~  185 (253)
T PF10647_consen  145 RDGDGVPRRLTGPRRVAPPLLSDVTDVAWSDDSTLVVLGRS  185 (253)
T ss_pred             eCCCCCcceeccceEecccccCcceeeeecCCCEEEEEeCC
Confidence            00111      111222223457889999998877665544


No 455
>PF05694 SBP56:  56kDa selenium binding protein (SBP56);  InterPro: IPR008826 This family consists of several eukaryotic selenium binding proteins as well as three sequences from archaea. The exact function of this protein is unknown although it is thought that SBP56 participates in late stages of intra-Golgi protein transport []. The Lotus japonicus homologue of SBP56, LjSBP is thought to have more than one physiological role and can be implicated in controlling the oxidation/reduction status of target proteins in vesicular Golgi transport [].; GO: 0008430 selenium binding; PDB: 2ECE_A.
Probab=39.58  E-value=3.2e+02  Score=25.52  Aligned_cols=97  Identities=12%  Similarity=0.080  Sum_probs=50.1

Q ss_pred             ccccccccccCcCcccccCCCcccceEEEEEcCC--CCEEEEeeC-CCeEEEE-ECCCCceEE----EEecc--------
Q 022074           17 SLANVTEIHDGLDFSAADDGGYSFGIFSLKFSTD--GRELVAGSS-DDCIYVY-DLEANKLSL----RILAH--------   80 (303)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~~v~~l~~s~~--g~~l~sgs~-Dg~v~lw-d~~~~~~~~----~~~~h--------   80 (303)
                      +.++||+..+....+.+|.+--.+...-|.|..+  ..+-.+|+. .++|.+| ..+.+....    .+..-        
T Consensus       222 ~~l~vWD~~~r~~~Q~idLg~~g~~pLEvRflH~P~~~~gFvg~aLss~i~~~~k~~~g~W~a~kVi~ip~~~v~~~~lp  301 (461)
T PF05694_consen  222 HSLHVWDWSTRKLLQTIDLGEEGQMPLEVRFLHDPDANYGFVGCALSSSIWRFYKDDDGEWAAEKVIDIPAKKVEGWILP  301 (461)
T ss_dssp             -EEEEEETTTTEEEEEEES-TTEEEEEEEEE-SSTT--EEEEEEE--EEEEEEEE-ETTEEEEEEEEEE--EE--SS---
T ss_pred             CeEEEEECCCCcEeeEEecCCCCCceEEEEecCCCCccceEEEEeccceEEEEEEcCCCCeeeeEEEECCCcccCccccc
Confidence            6679999988888878887654556778888754  555444443 3345444 434443221    11110        


Q ss_pred             ---------cCCeEEEEEccCCCcEEEEecCCCeEEEEcCcc
Q 022074           81 ---------TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        81 ---------~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~  113 (303)
                               ..-++.+..+.++.-+.+++=..|.||.||+..
T Consensus       302 ~ml~~~~~~P~LitDI~iSlDDrfLYvs~W~~GdvrqYDISD  343 (461)
T PF05694_consen  302 EMLKPFGAVPPLITDILISLDDRFLYVSNWLHGDVRQYDISD  343 (461)
T ss_dssp             GGGGGG-EE------EEE-TTS-EEEEEETTTTEEEEEE-SS
T ss_pred             ccccccccCCCceEeEEEccCCCEEEEEcccCCcEEEEecCC
Confidence                     123577777665444455666699999999864


No 456
>PHA03098 kelch-like protein; Provisional
Probab=39.00  E-value=2.6e+02  Score=26.57  Aligned_cols=24  Identities=21%  Similarity=0.332  Sum_probs=16.2

Q ss_pred             CCCEEEEeeCC------CeEEEEECCCCce
Q 022074           50 DGRELVAGSSD------DCIYVYDLEANKL   73 (303)
Q Consensus        50 ~g~~l~sgs~D------g~v~lwd~~~~~~   73 (303)
                      +++..+.||.+      ..+..||..+++.
T Consensus       389 ~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W  418 (534)
T PHA03098        389 NNLIYVIGGISKNDELLKTVECFSLNTNKW  418 (534)
T ss_pred             CCEEEEECCcCCCCcccceEEEEeCCCCee
Confidence            56666777632      3578899887754


No 457
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=38.08  E-value=2.5e+02  Score=23.83  Aligned_cols=86  Identities=19%  Similarity=0.178  Sum_probs=43.6

Q ss_pred             EEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecC-C--CeEEEEcCccccCCCcccee--ecccccCeEEEEeC
Q 022074           63 IYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSD-D--NLCKVWDRRCLNVKGKPAGV--LMGHLEGITFIDSR  137 (303)
Q Consensus        63 v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~-d--g~v~lWd~~~~~~~~~~~~~--~~~h~~~v~~~~~~  137 (303)
                      -.+||+.+++....-...+-.+..=.+. ++++++.+|+. +  ..+|+++.............  ......---....-
T Consensus        48 s~~yD~~tn~~rpl~v~td~FCSgg~~L-~dG~ll~tGG~~~G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L  126 (243)
T PF07250_consen   48 SVEYDPNTNTFRPLTVQTDTFCSGGAFL-PDGRLLQTGGDNDGNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTL  126 (243)
T ss_pred             EEEEecCCCcEEeccCCCCCcccCcCCC-CCCCEEEeCCCCccccceEEEecCCCCCCCCceECcccccCCCccccceEC
Confidence            4689999887543222233333333454 46889988875 3  35777774210000000000  00111111122345


Q ss_pred             CCCCEEEEEeCC
Q 022074          138 GDGRYLISNGKD  149 (303)
Q Consensus       138 ~~~~~l~s~~~D  149 (303)
                      +||+.|+.||.+
T Consensus       127 ~DG~vlIvGG~~  138 (243)
T PF07250_consen  127 PDGRVLIVGGSN  138 (243)
T ss_pred             CCCCEEEEeCcC
Confidence            789999998876


No 458
>PRK13684 Ycf48-like protein; Provisional
Probab=37.77  E-value=2.9e+02  Score=24.51  Aligned_cols=112  Identities=13%  Similarity=0.020  Sum_probs=55.6

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEe-cccCCeEEEEEccCCCcEEEEecCCCeEEEEcCcccc
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRIL-AHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLN  115 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~-~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~  115 (303)
                      +-...++++.+.+++.+++++ ..|.+..-....++....+. .-...+..+.+.+ ++..++. +..|.+++=... ..
T Consensus       170 ~~~g~~~~i~~~~~g~~v~~g-~~G~i~~s~~~gg~tW~~~~~~~~~~l~~i~~~~-~g~~~~v-g~~G~~~~~s~d-~G  245 (334)
T PRK13684        170 DAAGVVRNLRRSPDGKYVAVS-SRGNFYSTWEPGQTAWTPHQRNSSRRLQSMGFQP-DGNLWML-ARGGQIRFNDPD-DL  245 (334)
T ss_pred             CCcceEEEEEECCCCeEEEEe-CCceEEEEcCCCCCeEEEeeCCCcccceeeeEcC-CCCEEEE-ecCCEEEEccCC-CC
Confidence            334578999999988766554 55655332112233232222 2335677888865 3555554 456776542111 11


Q ss_pred             CCCccceeecc-cccCeEEEEeCCCCCEEEEEeCCCcEE
Q 022074          116 VKGKPAGVLMG-HLEGITFIDSRGDGRYLISNGKDQAIK  153 (303)
Q Consensus       116 ~~~~~~~~~~~-h~~~v~~~~~~~~~~~l~s~~~D~~v~  153 (303)
                      .+.+....... -...+..+.+.++++.++ ++.++.+.
T Consensus       246 ~sW~~~~~~~~~~~~~l~~v~~~~~~~~~~-~G~~G~v~  283 (334)
T PRK13684        246 ESWSKPIIPEITNGYGYLDLAYRTPGEIWA-GGGNGTLL  283 (334)
T ss_pred             CccccccCCccccccceeeEEEcCCCCEEE-EcCCCeEE
Confidence            11111111100 112466677777776554 44556544


No 459
>COG4590 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=37.76  E-value=1.5e+02  Score=27.69  Aligned_cols=118  Identities=14%  Similarity=0.170  Sum_probs=61.2

Q ss_pred             CcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE-Eecc-cC----CeE-EEEEccCCCcEEEEecCCCeEEEE
Q 022074           37 GYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR-ILAH-TS----DVN-TVCFGDESGHLIYSGSDDNLCKVW  109 (303)
Q Consensus        37 ~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~-~~~h-~~----~v~-~l~~~~~~~~~l~s~s~dg~v~lW  109 (303)
                      ..-..|..+-..|||+.+++-+. .++.++++.......+ +... .+    .|+ .+.. -..+.-++.++.||-|.-|
T Consensus       218 ~~~~~v~qllL~Pdg~~LYv~~g-~~~~v~~L~~r~l~~rkl~~dspg~~~~~Vte~l~l-L~Gg~SLLv~~~dG~vsQW  295 (733)
T COG4590         218 VPFSDVSQLLLTPDGKTLYVRTG-SELVVALLDKRSLQIRKLVDDSPGDSRHQVTEQLYL-LSGGFSLLVVHEDGLVSQW  295 (733)
T ss_pred             CCccchHhhEECCCCCEEEEecC-CeEEEEeecccccchhhhhhcCCCchHHHHHHHHHH-HhCceeEEEEcCCCceeee
Confidence            33446788889999998887655 5788998877654322 1111 11    122 1111 1235567788999999887


Q ss_pred             -cCccccCC-CccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEE
Q 022074          110 -DRRCLNVK-GKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWD  156 (303)
Q Consensus       110 -d~~~~~~~-~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWd  156 (303)
                       |.+..... -..++.+.-....+..+.-..+.+-+++-+..|++.++-
T Consensus       296 Fdvr~~~~p~l~h~R~f~l~pa~~~~l~pe~~rkgF~~l~~~G~L~~f~  344 (733)
T COG4590         296 FDVRRDGQPHLNHIRNFKLAPAEVQFLLPETNRKGFYSLYRNGTLQSFY  344 (733)
T ss_pred             eeeecCCCCcceeeeccccCcccceeeccccccceEEEEcCCCceeeee
Confidence             54311110 011111211122333333233334456666666666554


No 460
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=36.19  E-value=2.6e+02  Score=23.40  Aligned_cols=49  Identities=16%  Similarity=0.158  Sum_probs=30.7

Q ss_pred             CeEEEEEeCCCeEEEEECCCCeEEEEee------------cCCCCeEEEEECCCC-CeEEEE
Q 022074          222 QKYIYTGSHDSCVYVYDLVSGEQVAALK------------YHTSPVRDCSWHPSQ-PMLVSS  270 (303)
Q Consensus       222 ~~~latg~~dg~i~iwd~~~~~~~~~~~------------~h~~~I~~v~~sp~~-~~las~  270 (303)
                      |...|---.+..|-..|..+|+.+..++            .|..-.+.+++-|++ ++++||
T Consensus       186 G~lyANVw~t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTG  247 (262)
T COG3823         186 GELYANVWQTTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITG  247 (262)
T ss_pred             cEEEEeeeeecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEec
Confidence            4444444455556666666666544432            344567899999987 678887


No 461
>PF14779 BBS1:  Ciliary BBSome complex subunit 1
Probab=36.00  E-value=2.2e+02  Score=24.41  Aligned_cols=55  Identities=15%  Similarity=0.251  Sum_probs=38.1

Q ss_pred             eEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEE---EC-CCCCeEEEEeCCCCEEE
Q 022074          223 KYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCS---WH-PSQPMLVSSSWDGDVVR  278 (303)
Q Consensus       223 ~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~---~s-p~~~~las~s~Dg~i~~  278 (303)
                      ..|+.|.++|.|+|.|......+..++-..-|+.=.+   |. -|. .++.++.||.|.+
T Consensus       196 scLViGTE~~~i~iLd~~af~il~~~~lpsvPv~i~~~G~~devdy-RI~Va~Rdg~iy~  254 (257)
T PF14779_consen  196 SCLVIGTESGEIYILDPQAFTILKQVQLPSVPVFISVSGQYDEVDY-RIVVACRDGKIYT  254 (257)
T ss_pred             ceEEEEecCCeEEEECchhheeEEEEecCCCceEEEEEeeeeccce-EEEEEeCCCEEEE
Confidence            5799999999999999988887777765555553222   22 222 3666667888765


No 462
>PF14779 BBS1:  Ciliary BBSome complex subunit 1
Probab=35.87  E-value=2.2e+02  Score=24.44  Aligned_cols=56  Identities=23%  Similarity=0.207  Sum_probs=37.8

Q ss_pred             EEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEc--cCCCcEEEEecCCCeEEE
Q 022074           53 ELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFG--DESGHLIYSGSDDNLCKV  108 (303)
Q Consensus        53 ~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~--~~~~~~l~s~s~dg~v~l  108 (303)
                      .++.|..+|.|.|.|...-....++.-..-++.-.+..  .+.+.+++.++.||.|++
T Consensus       197 cLViGTE~~~i~iLd~~af~il~~~~lpsvPv~i~~~G~~devdyRI~Va~Rdg~iy~  254 (257)
T PF14779_consen  197 CLVIGTESGEIYILDPQAFTILKQVQLPSVPVFISVSGQYDEVDYRIVVACRDGKIYT  254 (257)
T ss_pred             eEEEEecCCeEEEECchhheeEEEEecCCCceEEEEEeeeeccceEEEEEeCCCEEEE
Confidence            68999999999999998877665554443344322221  113457788888988875


No 463
>PRK13684 Ycf48-like protein; Provisional
Probab=35.47  E-value=3.2e+02  Score=24.27  Aligned_cols=112  Identities=12%  Similarity=0.041  Sum_probs=58.0

Q ss_pred             ccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEEEEecc----cCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           39 SFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSLRILAH----TSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        39 ~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h----~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      ...++++.+.++++.+++| ..|.+++=..+.+........+    ...+..+.+.+ .++.+ .++.+|.+.. ... .
T Consensus       214 ~~~l~~i~~~~~g~~~~vg-~~G~~~~~s~d~G~sW~~~~~~~~~~~~~l~~v~~~~-~~~~~-~~G~~G~v~~-S~d-~  288 (334)
T PRK13684        214 SRRLQSMGFQPDGNLWMLA-RGGQIRFNDPDDLESWSKPIIPEITNGYGYLDLAYRT-PGEIW-AGGGNGTLLV-SKD-G  288 (334)
T ss_pred             cccceeeeEcCCCCEEEEe-cCCEEEEccCCCCCccccccCCccccccceeeEEEcC-CCCEE-EEcCCCeEEE-eCC-C
Confidence            4568889999988876665 5676543234555433322211    13467777865 34454 4556776654 211 1


Q ss_pred             cCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEEE
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWD  156 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWd  156 (303)
                      ..+........+-......+.+..+++.++ .|..|.|.-|+
T Consensus       289 G~tW~~~~~~~~~~~~~~~~~~~~~~~~~~-~G~~G~il~~~  329 (334)
T PRK13684        289 GKTWEKDPVGEEVPSNFYKIVFLDPEKGFV-LGQRGVLLRYV  329 (334)
T ss_pred             CCCCeECCcCCCCCcceEEEEEeCCCceEE-ECCCceEEEec
Confidence            112221111011123455555555555544 55668877775


No 464
>PF14269 Arylsulfotran_2:  Arylsulfotransferase (ASST)
Probab=32.71  E-value=1.7e+02  Score=25.64  Aligned_cols=63  Identities=21%  Similarity=0.380  Sum_probs=48.4

Q ss_pred             CCCeEEEEEeCCCeEEEEECCCCeEEEEeecCCC-----CeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          220 TGQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTS-----PVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       220 ~~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~-----~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      .+|.+|++.-.-..|.+.|.++|+.+=.+.+...     +-...+|-.|.+++-.+..+++|.++|-.
T Consensus       153 ~~G~yLiS~R~~~~i~~I~~~tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~~~~~~~~IslFDN~  220 (299)
T PF14269_consen  153 DDGDYLISSRNTSTIYKIDPSTGKIIWRLGGKRNSDFTLPATNFSWQHDARFLNESNDDGTISLFDNA  220 (299)
T ss_pred             CCccEEEEecccCEEEEEECCCCcEEEEeCCCCCCcccccCCcEeeccCCEEeccCCCCCEEEEEcCC
Confidence            5678999999999999999999988767654411     12235666777788788899999999974


No 465
>PF14870 PSII_BNR:  Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=32.31  E-value=3.5e+02  Score=23.80  Aligned_cols=105  Identities=16%  Similarity=0.239  Sum_probs=48.8

Q ss_pred             EEEEcCCCCEEEEeeCCCeEEEEECCCCceEEE-EecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccce
Q 022074           44 SLKFSTDGRELVAGSSDDCIYVYDLEANKLSLR-ILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAG  122 (303)
Q Consensus        44 ~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~~-~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~  122 (303)
                      .+....++. +...+..|.|+. ..+.|+.... .....+.+..+.-. +++++++.++.-....-||...  ....+..
T Consensus       108 ~i~~l~~~~-~~l~~~~G~iy~-T~DgG~tW~~~~~~~~gs~~~~~r~-~dG~~vavs~~G~~~~s~~~G~--~~w~~~~  182 (302)
T PF14870_consen  108 GITALGDGS-AELAGDRGAIYR-TTDGGKTWQAVVSETSGSINDITRS-SDGRYVAVSSRGNFYSSWDPGQ--TTWQPHN  182 (302)
T ss_dssp             EEEEEETTE-EEEEETT--EEE-ESSTTSSEEEEE-S----EEEEEE--TTS-EEEEETTSSEEEEE-TT---SS-EEEE
T ss_pred             EEEEcCCCc-EEEEcCCCcEEE-eCCCCCCeeEcccCCcceeEeEEEC-CCCcEEEEECcccEEEEecCCC--ccceEEc
Confidence            333333443 333445565422 2334444333 33344567777664 5677887776655566787320  0111111


Q ss_pred             eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEE
Q 022074          123 VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWD  156 (303)
Q Consensus       123 ~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWd  156 (303)
                      +  .-...+..+.|.+++.+.+.+ +.+.+++=+
T Consensus       183 r--~~~~riq~~gf~~~~~lw~~~-~Gg~~~~s~  213 (302)
T PF14870_consen  183 R--NSSRRIQSMGFSPDGNLWMLA-RGGQIQFSD  213 (302)
T ss_dssp             ----SSS-EEEEEE-TTS-EEEEE-TTTEEEEEE
T ss_pred             c--CccceehhceecCCCCEEEEe-CCcEEEEcc
Confidence            1  123568899999998776644 888888776


No 466
>PHA02790 Kelch-like protein; Provisional
Probab=31.80  E-value=4.4e+02  Score=24.78  Aligned_cols=97  Identities=10%  Similarity=0.071  Sum_probs=45.5

Q ss_pred             CCCEEEEeeCCC---eEEEEECCCCceEEEEec--cc-CCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCcccee
Q 022074           50 DGRELVAGSSDD---CIYVYDLEANKLSLRILA--HT-SDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGV  123 (303)
Q Consensus        50 ~g~~l~sgs~Dg---~v~lwd~~~~~~~~~~~~--h~-~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~  123 (303)
                      +|+..+.||.++   ++..||..+++... ...  .. .....+..   ++++.+.|+   .+..||.+..  .......
T Consensus       362 ~g~IYviGG~~~~~~~ve~ydp~~~~W~~-~~~m~~~r~~~~~~~~---~~~IYv~GG---~~e~ydp~~~--~W~~~~~  432 (480)
T PHA02790        362 NNVIYVIGGHSETDTTTEYLLPNHDQWQF-GPSTYYPHYKSCALVF---GRRLFLVGR---NAEFYCESSN--TWTLIDD  432 (480)
T ss_pred             CCEEEEecCcCCCCccEEEEeCCCCEEEe-CCCCCCccccceEEEE---CCEEEEECC---ceEEecCCCC--cEeEcCC
Confidence            577777777654   46788887765432 111  11 11122222   255666664   4667776422  2222221


Q ss_pred             ecccccCeEEEEeCCCCCEEEEEeCC-----CcEEEEEc
Q 022074          124 LMGHLEGITFIDSRGDGRYLISNGKD-----QAIKLWDI  157 (303)
Q Consensus       124 ~~~h~~~v~~~~~~~~~~~l~s~~~D-----~~v~lWdl  157 (303)
                      +.........+..  ++...+.||.+     .++..||.
T Consensus       433 m~~~r~~~~~~v~--~~~IYviGG~~~~~~~~~ve~Yd~  469 (480)
T PHA02790        433 PIYPRDNPELIIV--DNKLLLIGGFYRGSYIDTIEVYNN  469 (480)
T ss_pred             CCCCccccEEEEE--CCEEEEECCcCCCcccceEEEEEC
Confidence            2111112222222  56677888765     23445554


No 467
>cd01268 Numb Numb Phosphotyrosine-binding (PTB) domain. Numb Phosphotyrosine-binding (PTB) domain. Numb is a membrane associated adaptor protein, which is a determinant of asymmetric cell division.  Numb has an N-terminal PTB domain.  PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
Probab=30.89  E-value=2.5e+02  Score=21.57  Aligned_cols=53  Identities=13%  Similarity=0.038  Sum_probs=32.8

Q ss_pred             CEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEE
Q 022074           52 RELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCK  107 (303)
Q Consensus        52 ~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~  107 (303)
                      +.++.-|.|| |+|.|..++.+.....  -..|+.++..+.+.+.|+-...|+.-.
T Consensus        51 kv~L~VS~~G-i~vvd~~Tk~~i~~~~--i~~ISfca~D~~d~r~FayIakd~~~~  103 (138)
T cd01268          51 KAVLWVSGDG-LRVVDEKTKGLIVDQT--IEKVSFCAPDRNFDRGFSYICRDGTTR  103 (138)
T ss_pred             EEEEEEecCc-EEEEecCCCcEEEEEe--EEEEEEEecCCCCCcEEEEEecCCCcc
Confidence            3567778888 9999998887654321  123444444445566777666666543


No 468
>KOG3616 consensus Selective LIM binding factor [Transcription]
Probab=30.29  E-value=1.2e+02  Score=30.38  Aligned_cols=31  Identities=13%  Similarity=0.353  Sum_probs=26.8

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCc
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANK   72 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~   72 (303)
                      +.++.-+|.|+-++.+..||+|++|+.-..+
T Consensus        17 ~~aiqshp~~~s~v~~~~d~si~lfn~~~r~   47 (1636)
T KOG3616|consen   17 TTAIQSHPGGQSFVLAHQDGSIILFNFIPRR   47 (1636)
T ss_pred             eeeeeecCCCceEEEEecCCcEEEEeecccc
Confidence            6778888999999999999999999876554


No 469
>PF01731 Arylesterase:  Arylesterase;  InterPro: IPR002640  The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity [].   Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity.   Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL.   Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo [].  This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=30.24  E-value=1.9e+02  Score=20.09  Aligned_cols=48  Identities=19%  Similarity=0.128  Sum_probs=30.5

Q ss_pred             CeEEEEECCCCeEEEEeecCCCCeEEEEECCCCCe-EEEEeCCCCEEEeecC
Q 022074          232 SCVYVYDLVSGEQVAALKYHTSPVRDCSWHPSQPM-LVSSSWDGDVVRWEFP  282 (303)
Q Consensus       232 g~i~iwd~~~~~~~~~~~~h~~~I~~v~~sp~~~~-las~s~Dg~i~~Wd~~  282 (303)
                      +.|..||.++   ......--...+.+..+|++++ .++....++|++++..
T Consensus        36 ~~Vvyyd~~~---~~~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I~vy~~~   84 (86)
T PF01731_consen   36 GNVVYYDGKE---VKVVASGFSFANGIAISPDKKYLYVASSLAHSIHVYKRH   84 (86)
T ss_pred             ceEEEEeCCE---eEEeeccCCCCceEEEcCCCCEEEEEeccCCeEEEEEec
Confidence            4466676543   2222222245689999999885 4555567899998864


No 470
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=30.06  E-value=4.4e+02  Score=24.23  Aligned_cols=129  Identities=6%  Similarity=-0.066  Sum_probs=0.0

Q ss_pred             cccccccCcCcc-cccCCCcccceEEEEEcCCCCEEEEeeCCCeEEEEECCCCc-----eEEEEecccCC--eEEEEEcc
Q 022074           20 NVTEIHDGLDFS-AADDGGYSFGIFSLKFSTDGRELVAGSSDDCIYVYDLEANK-----LSLRILAHTSD--VNTVCFGD   91 (303)
Q Consensus        20 ~~~~~~~~~~~~-~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~-----~~~~~~~h~~~--v~~l~~~~   91 (303)
                      ++..-++..+.. .+...+-...+.++.|.++|..++++ .+|.+ ++....+.     ...........  +..+.+.+
T Consensus       260 ~~~~s~d~G~~~W~~~~~~~~~~l~~v~~~~dg~l~l~g-~~G~l-~~S~d~G~~~~~~~f~~~~~~~~~~~l~~v~~~~  337 (398)
T PLN00033        260 NFYLTWEPGQPYWQPHNRASARRIQNMGWRADGGLWLLT-RGGGL-YVSKGTGLTEEDFDFEEADIKSRGFGILDVGYRS  337 (398)
T ss_pred             cEEEecCCCCcceEEecCCCccceeeeeEcCCCCEEEEe-CCceE-EEecCCCCcccccceeecccCCCCcceEEEEEcC


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCCccceeecccccCeEEEEeCCCCCEEEEEeCCCcEEEE
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKGKPAGVLMGHLEGITFIDSRGDGRYLISNGKDQAIKLW  155 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lW  155 (303)
                        +..++.++.+|.+....  ....+......-..-......+.|.++++-+++| .+|.|.-|
T Consensus       338 --d~~~~a~G~~G~v~~s~--D~G~tW~~~~~~~~~~~~ly~v~f~~~~~g~~~G-~~G~il~~  396 (398)
T PLN00033        338 --KKEAWAAGGSGILLRST--DGGKSWKRDKGADNIAANLYSVKFFDDKKGFVLG-NDGVLLRY  396 (398)
T ss_pred             --CCcEEEEECCCcEEEeC--CCCcceeEccccCCCCcceeEEEEcCCCceEEEe-CCcEEEEe


No 471
>COG4590 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=30.02  E-value=4.8e+02  Score=24.63  Aligned_cols=105  Identities=12%  Similarity=0.162  Sum_probs=61.3

Q ss_pred             CCCCEEEEeeCCCeEEEE-ECCCCceE--EEEec---ccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccccCCCccce
Q 022074           49 TDGRELVAGSSDDCIYVY-DLEANKLS--LRILA---HTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCLNVKGKPAG  122 (303)
Q Consensus        49 ~~g~~l~sgs~Dg~v~lw-d~~~~~~~--~~~~~---h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~~~~~~~~~  122 (303)
                      ..|.-+++++.||-|.-| |+..+...  ..+..   ....+..+.- ..+.+.|++-+.+|++.++-..     .++.-
T Consensus       278 ~Gg~SLLv~~~dG~vsQWFdvr~~~~p~l~h~R~f~l~pa~~~~l~p-e~~rkgF~~l~~~G~L~~f~st-----~~~~l  351 (733)
T COG4590         278 SGGFSLLVVHEDGLVSQWFDVRRDGQPHLNHIRNFKLAPAEVQFLLP-ETNRKGFYSLYRNGTLQSFYST-----SEKLL  351 (733)
T ss_pred             hCceeEEEEcCCCceeeeeeeecCCCCcceeeeccccCcccceeecc-ccccceEEEEcCCCceeeeecc-----cCcce
Confidence            456678889999998777 55443211  11111   1123333332 1235567777888888876521     11122


Q ss_pred             eecccccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074          123 VLMGHLEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus       123 ~~~~h~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      .+..-.++..-++++|.+.++++-. .+.++++.+...
T Consensus       352 L~~~~~~~~~~~~~Sp~~~~Ll~e~-~gki~~~~l~Nr  388 (733)
T COG4590         352 LFERAYQAPQLVAMSPNQAYLLSED-QGKIRLAQLENR  388 (733)
T ss_pred             ehhhhhcCcceeeeCcccchheeec-CCceEEEEecCC
Confidence            2222334556678899998888875 488999988754


No 472
>PF12657 TFIIIC_delta:  Transcription factor IIIC subunit delta N-term;  InterPro: IPR024761  This entry represents a domain found towards the N terminus of the 90 kDa subunit of transcription factor IIIC (also known as subunit 9 in yeast []). The whole subunit is involved in RNA polymerase III-mediated transcription. It is possible that this N-terminal domain interacts with TFIIIC subunit 8 [].
Probab=28.51  E-value=1.2e+02  Score=24.05  Aligned_cols=30  Identities=27%  Similarity=0.519  Sum_probs=25.9

Q ss_pred             CeEEEEECCCCC------eEEEEeCCCCEEEeecCC
Q 022074          254 PVRDCSWHPSQP------MLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       254 ~I~~v~~sp~~~------~las~s~Dg~i~~Wd~~~  283 (303)
                      .+.+++|||.|-      +||.-..++.+.+|....
T Consensus        87 ~vv~~aWSP~Gl~~~~rClLavLTs~~~l~l~~~~~  122 (173)
T PF12657_consen   87 QVVSAAWSPSGLGPNGRCLLAVLTSNGRLSLYGPPG  122 (173)
T ss_pred             cEEEEEECCCCCCCCCceEEEEEcCCCeEEEEecCC
Confidence            789999999652      899999999999999764


No 473
>TIGR03118 PEPCTERM_chp_1 conserved hypothetical protein TIGR03118. This model describes and uncharacterized conserved hypothetical protein. Members are found with the C-terminal putative exosortase interaction domain, PEP-CTERM, in Nitrosospira multiformis, Rhodoferax ferrireducens, Solibacter usitatus Ellin6076, and Acidobacteria bacterium Ellin345. It is found without the PEP-CTERM domain in several other species, including Burkholderia ambifaria, Gloeobacter violaceus PCC 7421, and three copies in the Acanthamoeba polyphaga mimivirus.
Probab=28.51  E-value=4.2e+02  Score=23.52  Aligned_cols=219  Identities=15%  Similarity=0.168  Sum_probs=112.7

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCC-------ceEEEEec-----ccCCeEEEEEccC-----------CCcEEE
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEAN-------KLSLRILA-----HTSDVNTVCFGDE-----------SGHLIY   98 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~-------~~~~~~~~-----h~~~v~~l~~~~~-----------~~~~l~   98 (303)
                      -+.|+++|.+..-++....++..+||....       .+...+..     .....+-+.|+..           ....|+
T Consensus        25 ~WGia~~p~~~~WVadngT~~~TlYdg~~~~~~g~~~~L~vtiP~~~~~~~~~~PTGiVfN~~~~F~vt~~g~~~~a~Fi  104 (336)
T TIGR03118        25 AWGLSYRPGGPFWVANTGTGTATLYVGNPDTQPLVQDPLVVVIPAPPPLAAEGTPTGQVFNGSDTFVVSGEGITGPSRFL  104 (336)
T ss_pred             cceeEecCCCCEEEecCCcceEEeecCCcccccCCccceEEEecCCCCCCCCCCccEEEEeCCCceEEcCCCcccceeEE
Confidence            467999999988888778889999999721       12222221     1123445555321           123577


Q ss_pred             EecCCCeEEEEcCccccCCC--ccceeec-ccccCe-EEEEeC--CCCCEEEEEe-CCCcEEEEEcccccCCcccccCcc
Q 022074           99 SGSDDNLCKVWDRRCLNVKG--KPAGVLM-GHLEGI-TFIDSR--GDGRYLISNG-KDQAIKLWDIRKMSSNASCNLGFR  171 (303)
Q Consensus        99 s~s~dg~v~lWd~~~~~~~~--~~~~~~~-~h~~~v-~~~~~~--~~~~~l~s~~-~D~~v~lWdl~~~~~~~~~~~~~~  171 (303)
                      .+++||+|.-|..... .+.  .....+. +...+| ..+++.  ..+.+|..+. ..++|.++|-.-.+...  ...+.
T Consensus       105 f~tEdGTisaW~p~v~-~t~~~~~~~~~d~s~~gavYkGLAi~~~~~~~~LYaadF~~g~IDVFd~~f~~~~~--~g~F~  181 (336)
T TIGR03118       105 FVTEDGTLSGWAPALG-TTRMTRAEIVVDASQQGNVYKGLAVGPTGGGDYLYAANFRQGRIDVFKGSFRPPPL--PGSFI  181 (336)
T ss_pred             EEeCCceEEeecCcCC-cccccccEEEEccCCCcceeeeeEEeecCCCceEEEeccCCCceEEecCccccccC--CCCcc
Confidence            8999999999984311 110  0111121 111233 223332  2344554433 57888888754221100  00000


Q ss_pred             ceeeeceeeeCCCCCccccCCCCCcceEEecccceeeeEEEeeeeeeeCCCeEEEEEeCCCeEEEEECCCCeEEEEee--
Q 022074          172 SYEWDYRWMDYPPQARDLKHPCDQSVATYKGHSVLRTLIRCHFSPVYSTGQKYIYTGSHDSCVYVYDLVSGEQVAALK--  249 (303)
Q Consensus       172 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~s~~~~~latg~~dg~i~iwd~~~~~~~~~~~--  249 (303)
                                 .|....--.+  --+..+.+.-.+.+..+       .++++.=+.|-.-|.|-++|. .|+++.++.  
T Consensus       182 -----------DP~iPagyAP--FnIqnig~~lyVtYA~q-------d~~~~d~v~G~G~G~VdvFd~-~G~l~~r~as~  240 (336)
T TIGR03118       182 -----------DPALPAGYAP--FNVQNLGGTLYVTYAQQ-------DADRNDEVAGAGLGYVNVFTL-NGQLLRRVASS  240 (336)
T ss_pred             -----------CCCCCCCCCC--cceEEECCeEEEEEEec-------CCcccccccCCCcceEEEEcC-CCcEEEEeccC
Confidence                       0000000000  01122222111111110       112222344556789999997 578887773  


Q ss_pred             cCCCCeEEEEECC------CCCeEEEEeCCCCEEEeecCCC
Q 022074          250 YHTSPVRDCSWHP------SQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       250 ~h~~~I~~v~~sp------~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.-...|.|+..|      .+.+|+.-=.||+|..+|....
T Consensus       241 g~LNaPWG~a~APa~FG~~sg~lLVGNFGDG~InaFD~~sG  281 (336)
T TIGR03118       241 GRLNAPWGLAIAPESFGSLSGALLVGNFGDGTINAYDPQSG  281 (336)
T ss_pred             CcccCCceeeeChhhhCCCCCCeEEeecCCceeEEecCCCC
Confidence            3345668888866      4678888888999999997543


No 474
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=28.14  E-value=4e+02  Score=23.16  Aligned_cols=75  Identities=19%  Similarity=0.348  Sum_probs=45.4

Q ss_pred             CCcccceEEEEEcCCCCEEEEee------CCCeEEEEECCCCceEEEEecc-----cCCeEEEEEccCCC-cEEEEec-C
Q 022074           36 GGYSFGIFSLKFSTDGRELVAGS------SDDCIYVYDLEANKLSLRILAH-----TSDVNTVCFGDESG-HLIYSGS-D  102 (303)
Q Consensus        36 ~~~~~~v~~l~~s~~g~~l~sgs------~Dg~v~lwd~~~~~~~~~~~~h-----~~~v~~l~~~~~~~-~~l~s~s-~  102 (303)
                      ++-+..|.++.|..+.+.+++|.      ....+..||.++.... .+..-     .++|..+.+...+. +..+.|. .
T Consensus        33 ~~i~G~V~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~-~~~~~~s~~ipgpv~a~~~~~~d~~~~~~aG~~~  111 (281)
T PF12768_consen   33 NGISGTVTDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWS-SLGGGSSNSIPGPVTALTFISNDGSNFWVAGRSA  111 (281)
T ss_pred             CCceEEEEEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeee-ecCCcccccCCCcEEEEEeeccCCceEEEeceec
Confidence            45677899999996555555554      3456888999887542 23331     26788887744333 3444443 2


Q ss_pred             --CCeEEEEcC
Q 022074          103 --DNLCKVWDR  111 (303)
Q Consensus       103 --dg~v~lWd~  111 (303)
                        +..+.-||-
T Consensus       112 ~g~~~l~~~dG  122 (281)
T PF12768_consen  112 NGSTFLMKYDG  122 (281)
T ss_pred             CCCceEEEEcC
Confidence              335666774


No 475
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=26.59  E-value=3.1e+02  Score=25.74  Aligned_cols=52  Identities=12%  Similarity=0.163  Sum_probs=33.8

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEEECCCCceEE--E---E-ec-ccCCeEEEEEccCC
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVYDLEANKLSL--R---I-LA-HTSDVNTVCFGDES   93 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lwd~~~~~~~~--~---~-~~-h~~~v~~l~~~~~~   93 (303)
                      -..|+|.|||+.+++--..|.|++++..++....  .   + .. -.++...++++|+.
T Consensus        32 Pw~maflPDG~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF   90 (454)
T TIGR03606        32 PWALLWGPDNQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDF   90 (454)
T ss_pred             ceEEEEcCCCeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCc
Confidence            5688999999776665446899999765543211  1   1 11 24677888987653


No 476
>PF10584 Proteasome_A_N:  Proteasome subunit A N-terminal signature;  InterPro: IPR000426 The proteasome (or macropain) (3.4.25.1 from EC) [, , , , ] is a eukaryotic and archaeal multicatalytic proteinase complex that seems to be involved in an ATP/ubiquitin-dependent nonlysosomal proteolytic pathway. In eukaryotes the proteasome is composed of about 28 distinct subunits which form a highly ordered ring-shaped structure (20S ring) of about 700 kDa. Most proteasome subunits can be classified, on the basis on sequence similarities into two groups, alpha (A) and beta (B). This family contains the alpha subunit sequences which range from 210 to 290 amino acids. These sequences are classified as non-peptidase homologues in MEROPS peptidase family T1 (clan PB(T)). ; GO: 0004175 endopeptidase activity, 0006511 ubiquitin-dependent protein catabolic process, 0019773 proteasome core complex, alpha-subunit complex; PDB: 3H4P_M 1IRU_O 3UN4_U 1FNT_A 3OEV_G 3OEU_U 3SDK_U 3DY3_G 3MG7_G 3L5Q_C ....
Probab=26.42  E-value=18  Score=18.37  Aligned_cols=8  Identities=13%  Similarity=0.580  Sum_probs=5.4

Q ss_pred             EECCCCCe
Q 022074          259 SWHPSQPM  266 (303)
Q Consensus       259 ~~sp~~~~  266 (303)
                      .|||+|++
T Consensus         7 ~FSp~Grl   14 (23)
T PF10584_consen    7 TFSPDGRL   14 (23)
T ss_dssp             SBBTTSSB
T ss_pred             eECCCCeE
Confidence            47787764


No 477
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=25.55  E-value=3.1e+02  Score=20.98  Aligned_cols=105  Identities=12%  Similarity=0.193  Sum_probs=58.6

Q ss_pred             EEcCCCCEEEEeeCCCeEEEEECCCCce-------EEEEecccCCeEEEEEcc----CCCcEEEEecCCCeEEEEcCccc
Q 022074           46 KFSTDGRELVAGSSDDCIYVYDLEANKL-------SLRILAHTSDVNTVCFGD----ESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        46 ~~s~~g~~l~sgs~Dg~v~lwd~~~~~~-------~~~~~~h~~~v~~l~~~~----~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .|......|++++.-|+|.|.+......       ..++..-+..|++++-.+    +..+.|+.|+.. .+-.||....
T Consensus         5 kfDG~~pcL~~aT~~gKV~IH~ph~~~~~~~~~~~~i~~LNin~~italaaG~l~~~~~~D~LliGt~t-~llaYDV~~N   83 (136)
T PF14781_consen    5 KFDGVHPCLACATTGGKVFIHNPHERGQRTGRQDSDISFLNINQEITALAAGRLKPDDGRDCLLIGTQT-SLLAYDVENN   83 (136)
T ss_pred             EeCCCceeEEEEecCCEEEEECCCccccccccccCceeEEECCCceEEEEEEecCCCCCcCEEEEeccc-eEEEEEcccC
Confidence            3455555788888999999998764421       123455667788886532    235567777654 7778997421


Q ss_pred             cCCCccceeecccccCeEEEEeC---C-CCCEEEEEeCCCcEEEEEc
Q 022074          115 NVKGKPAGVLMGHLEGITFIDSR---G-DGRYLISNGKDQAIKLWDI  157 (303)
Q Consensus       115 ~~~~~~~~~~~~h~~~v~~~~~~---~-~~~~l~s~~~D~~v~lWdl  157 (303)
                         ....  +..-.++|.++.+.   . +.++++.|| +-.|.=||.
T Consensus        84 ---~d~F--yke~~DGvn~i~~g~~~~~~~~l~ivGG-ncsi~Gfd~  124 (136)
T PF14781_consen   84 ---SDLF--YKEVPDGVNAIVIGKLGDIPSPLVIVGG-NCSIQGFDY  124 (136)
T ss_pred             ---chhh--hhhCccceeEEEEEecCCCCCcEEEECc-eEEEEEeCC
Confidence               1111  11233566666542   2 334444443 344444443


No 478
>KOG3616 consensus Selective LIM binding factor [Transcription]
Probab=25.27  E-value=1.3e+02  Score=30.16  Aligned_cols=59  Identities=14%  Similarity=0.351  Sum_probs=38.7

Q ss_pred             eCCCeEEEEEeCCCeEEEEECCCCeE--EEEeecCCCCeEEEEECCCCCeEEEEeCCCCEEEeecC
Q 022074          219 STGQKYIYTGSHDSCVYVYDLVSGEQ--VAALKYHTSPVRDCSWHPSQPMLVSSSWDGDVVRWEFP  282 (303)
Q Consensus       219 s~~~~~latg~~dg~i~iwd~~~~~~--~~~~~~h~~~I~~v~~sp~~~~las~s~Dg~i~~Wd~~  282 (303)
                      +|.++-++.+..||.|.+|+...+..  +.+.   ..|-..+.|...|  |+++..|+...-|.-.
T Consensus        23 hp~~~s~v~~~~d~si~lfn~~~r~qski~~~---~~p~~nlv~tnhg--l~~~tsdrr~la~~~d   83 (1636)
T KOG3616|consen   23 HPGGQSFVLAHQDGSIILFNFIPRRQSKICEE---AKPKENLVFTNHG--LVTATSDRRALAWKED   83 (1636)
T ss_pred             cCCCceEEEEecCCcEEEEeecccchhhhhhh---cCCccceeeeccc--eEEEeccchhheeecc
Confidence            46788899999999999999876543  3222   2244445554444  5555567777777643


No 479
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=25.15  E-value=8e+02  Score=25.61  Aligned_cols=110  Identities=14%  Similarity=0.009  Sum_probs=55.2

Q ss_pred             CCcccceEEEEEcC-CCCEEEEeeCCCeEEEEECCCCceEEEEecccCCeEEEEEccCCCcEEEEecCCCeEEEEcCccc
Q 022074           36 GGYSFGIFSLKFST-DGRELVAGSSDDCIYVYDLEANKLSLRILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVWDRRCL  114 (303)
Q Consensus        36 ~~~~~~v~~l~~s~-~g~~l~sgs~Dg~v~lwd~~~~~~~~~~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lWd~~~~  114 (303)
                      .|+..-..++..|. .|+.|+=...+ .||+++-.  .....+....+.....++.  +...++.++.++.+...++...
T Consensus       445 ~gf~~~~~Tif~S~i~g~~lvQvTs~-~iRl~ss~--~~~~~W~~p~~~ti~~~~~--n~sqVvvA~~~~~l~y~~i~~~  519 (1096)
T KOG1897|consen  445 PGFSTDEQTIFCSTINGNQLVQVTSN-SIRLVSSA--GLRSEWRPPGKITIGVVSA--NASQVVVAGGGLALFYLEIEDG  519 (1096)
T ss_pred             ccccccCceEEEEccCCceEEEEecc-cEEEEcch--hhhhcccCCCceEEEEEee--cceEEEEecCccEEEEEEeecc
Confidence            45554445554442 34443333333 48888765  2233444444444444442  2446666776666666665321


Q ss_pred             cCCCccceeec--ccccCeEEEEeCCCC------CEEEEEeCCCcEEEE
Q 022074          115 NVKGKPAGVLM--GHLEGITFIDSRGDG------RYLISNGKDQAIKLW  155 (303)
Q Consensus       115 ~~~~~~~~~~~--~h~~~v~~~~~~~~~------~~l~s~~~D~~v~lW  155 (303)
                      .     .....  .-...|.|++++|-|      ++++.|-.+..+.+-
T Consensus       520 ~-----l~e~~~~~~e~evaCLDisp~~d~~~~s~~~aVG~Ws~~~~~l  563 (1096)
T KOG1897|consen  520 G-----LREVSHKEFEYEVACLDISPLGDAPNKSRLLAVGLWSDISMIL  563 (1096)
T ss_pred             c-----eeeeeeheecceeEEEecccCCCCCCcceEEEEEeecceEEEE
Confidence            1     11111  123468899888642      256666655554444


No 480
>COG5308 NUP170 Nuclear pore complex subunit [Intracellular trafficking and secretion]
Probab=24.93  E-value=2.5e+02  Score=28.75  Aligned_cols=28  Identities=21%  Similarity=0.407  Sum_probs=22.5

Q ss_pred             cCeEEEEeCCCCCEEEEEeCCCcEEEEEcc
Q 022074          129 EGITFIDSRGDGRYLISNGKDQAIKLWDIR  158 (303)
Q Consensus       129 ~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~  158 (303)
                      -.|.++....+|+.+.+|..|  +.+|.+.
T Consensus       182 inV~civs~e~GrIFf~g~~d--~nvyEl~  209 (1263)
T COG5308         182 INVRCIVSEEDGRIFFGGEND--PNVYELV  209 (1263)
T ss_pred             ceeEEEEeccCCcEEEecCCC--CCeEEEE
Confidence            457788777789988888887  8899865


No 481
>KOG1900 consensus Nuclear pore complex, Nup155 component (D Nup154, sc Nup157/Nup170) [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.93  E-value=1.8e+02  Score=30.82  Aligned_cols=70  Identities=19%  Similarity=0.394  Sum_probs=48.1

Q ss_pred             cceEEEEEcCCCCEEEEeeCCCeEEEEECC----CCce--------------------EEEEe-cccCCeEEEEEccCCC
Q 022074           40 FGIFSLKFSTDGRELVAGSSDDCIYVYDLE----ANKL--------------------SLRIL-AHTSDVNTVCFGDESG   94 (303)
Q Consensus        40 ~~v~~l~~s~~g~~l~sgs~Dg~v~lwd~~----~~~~--------------------~~~~~-~h~~~v~~l~~~~~~~   94 (303)
                      ..|.|+....+|+.+++| .||  .||++.    .+-.                    ..++. .+.++|..+.. .++.
T Consensus       179 ~~V~~I~~t~nGRIF~~G-~dg--~lyEl~Yq~~~gWf~~rc~Kiclt~s~ls~lvPs~~~~~~~~~dpI~qi~I-D~SR  254 (1311)
T KOG1900|consen  179 VSVNCITYTENGRIFFAG-RDG--NLYELVYQAEDGWFGSRCRKICLTKSVLSSLVPSLLSVPGSSKDPIRQITI-DNSR  254 (1311)
T ss_pred             ceEEEEEeccCCcEEEee-cCC--CEEEEEEeccCchhhcccccccCchhHHHHhhhhhhcCCCCCCCcceeeEe-cccc
Confidence            458899988899887766 555  345542    2210                    01223 45678888888 4567


Q ss_pred             cEEEEecCCCeEEEEcCcc
Q 022074           95 HLIYSGSDDNLCKVWDRRC  113 (303)
Q Consensus        95 ~~l~s~s~dg~v~lWd~~~  113 (303)
                      ..+.+=++.|+|..||+..
T Consensus       255 ~IlY~lsek~~v~~Y~i~~  273 (1311)
T KOG1900|consen  255 NILYVLSEKGTVSAYDIGG  273 (1311)
T ss_pred             ceeeeeccCceEEEEEccC
Confidence            7888999999999999853


No 482
>PF08801 Nucleoporin_N:  Nup133 N terminal like;  InterPro: IPR014908 Nucleoporins are the main components of the nuclear pore complex (NPC) in eukaryotic cells, and mediate bidirectional nucleocytoplasmic transport, especially of mRNA and proteins. RNA undergoing nuclear export first encounters the basket of the nuclear pore and many nucleoporins are accessible on the basket side of the pore [, ].  This entry represents the N-terminal of Nucleoprotein which forms a seven-bladed beta propeller structure []. ; PDB: 1XKS_A.
Probab=24.39  E-value=5.5e+02  Score=23.48  Aligned_cols=30  Identities=20%  Similarity=0.458  Sum_probs=24.9

Q ss_pred             CeEEEEECCCCCeEEEEeCCCCEEEeecCC
Q 022074          254 PVRDCSWHPSQPMLVSSSWDGDVVRWEFPG  283 (303)
Q Consensus       254 ~I~~v~~sp~~~~las~s~Dg~i~~Wd~~~  283 (303)
                      .|.+++..+..+.|.+...+|.|++|++..
T Consensus       191 ~I~~v~~d~~r~~ly~l~~~~~Iq~w~l~~  220 (422)
T PF08801_consen  191 KIVQVAVDPSRRLLYTLTSDGSIQVWDLGP  220 (422)
T ss_dssp             -EEEEEEETTTTEEEEEESSE-EEEEEE-S
T ss_pred             ceeeEEecCCcCEEEEEeCCCcEEEEEEeC
Confidence            488888988889999999999999999964


No 483
>PF06739 SBBP:  Beta-propeller repeat;  InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=24.16  E-value=1.5e+02  Score=16.85  Aligned_cols=22  Identities=9%  Similarity=0.046  Sum_probs=18.9

Q ss_pred             CCeEEEEECCCCCeEEEEeCCC
Q 022074          253 SPVRDCSWHPSQPMLVSSSWDG  274 (303)
Q Consensus       253 ~~I~~v~~sp~~~~las~s~Dg  274 (303)
                      ....++++.++|+..++|..++
T Consensus        13 ~~~~~IavD~~GNiYv~G~T~~   34 (38)
T PF06739_consen   13 DYGNGIAVDSNGNIYVTGYTNG   34 (38)
T ss_pred             eeEEEEEECCCCCEEEEEeecC
Confidence            4578999999999999998776


No 484
>KOG4460 consensus Nuclear pore complex, Nup88/rNup84 component [Nuclear structure; Intracellular trafficking, secretion, and vesicular transport]
Probab=23.91  E-value=3.1e+02  Score=26.28  Aligned_cols=65  Identities=15%  Similarity=0.198  Sum_probs=40.7

Q ss_pred             eCCCeEEEEEeCCCeEEEEEC---------CCCe------------EEEEeecCCCCeEEEEECCCC---CeEEEEeCCC
Q 022074          219 STGQKYIYTGSHDSCVYVYDL---------VSGE------------QVAALKYHTSPVRDCSWHPSQ---PMLVSSSWDG  274 (303)
Q Consensus       219 s~~~~~latg~~dg~i~iwd~---------~~~~------------~~~~~~~h~~~I~~v~~sp~~---~~las~s~Dg  274 (303)
                      ++.|+.++-.|.+|.+-++=.         +.|+            .+.+-. ..-.+..++|+|+.   ..|.--+.|.
T Consensus       112 s~~GS~VaL~G~~Gi~vMeLp~rwG~~s~~eDgk~~v~CRt~~i~~~~ftss-~~ltl~Qa~WHP~S~~D~hL~iL~sdn  190 (741)
T KOG4460|consen  112 SPTGSHVALIGIKGLMVMELPKRWGKNSEFEDGKSTVNCRTTPVAERFFTSS-TSLTLKQAAWHPSSILDPHLVLLTSDN  190 (741)
T ss_pred             cCCCceEEEecCCeeEEEEchhhcCccceecCCCceEEEEeecccceeeccC-CceeeeeccccCCccCCceEEEEecCc
Confidence            556777777777776544421         1221            111111 23467789999985   5677777799


Q ss_pred             CEEEeecCCC
Q 022074          275 DVVRWEFPGN  284 (303)
Q Consensus       275 ~i~~Wd~~~~  284 (303)
                      ++++++....
T Consensus       191 viRiy~lS~~  200 (741)
T KOG4460|consen  191 VIRIYSLSEP  200 (741)
T ss_pred             EEEEEecCCc
Confidence            9999997644


No 485
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=23.15  E-value=2.7e+02  Score=26.71  Aligned_cols=54  Identities=15%  Similarity=0.169  Sum_probs=0.0

Q ss_pred             CCeEEEEEeCCCeEEEEECCCCeEEEEeecCCCCeEEEEE--CCCCCeEEEEeCCC
Q 022074          221 GQKYIYTGSHDSCVYVYDLVSGEQVAALKYHTSPVRDCSW--HPSQPMLVSSSWDG  274 (303)
Q Consensus       221 ~~~~latg~~dg~i~iwd~~~~~~~~~~~~h~~~I~~v~~--sp~~~~las~s~Dg  274 (303)
                      .+.+++.+..+|.++.+|.++|+.+-.++....-.-+-.-  ....+|++.++.-|
T Consensus       471 ~g~lvf~g~~~G~l~a~D~~TGe~lw~~~~g~~~~a~P~ty~~~G~qYv~~~~G~g  526 (527)
T TIGR03075       471 AGDLVFYGTLEGYFKAFDAKTGEELWKFKTGSGIVGPPVTYEQDGKQYVAVLSGWG  526 (527)
T ss_pred             CCcEEEEECCCCeEEEEECCCCCEeEEEeCCCCceecCEEEEeCCEEEEEEEeccC


No 486
>KOG3356 consensus Predicted membrane protein [Function unknown]
Probab=22.76  E-value=1.1e+02  Score=22.37  Aligned_cols=32  Identities=19%  Similarity=0.230  Sum_probs=26.3

Q ss_pred             ccccCCCcccceEEEEEcCCCCEEEEeeCCCe
Q 022074           31 SAADDGGYSFGIFSLKFSTDGRELVAGSSDDC   62 (303)
Q Consensus        31 ~~~~~~~~~~~v~~l~~s~~g~~l~sgs~Dg~   62 (303)
                      |..|+.||-.+|.-++..-+|+++.-|-..+.
T Consensus        60 s~~d~~g~~rpv~fla~rvngqyimeglas~f   91 (147)
T KOG3356|consen   60 SMTDEHGHQRPVAFLAGRVNGQYIMEGLASSF   91 (147)
T ss_pred             cccccCCcCcceEEEeccccceeeehhhcccc
Confidence            36678999999999999999999887765553


No 487
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=22.73  E-value=5.3e+02  Score=22.72  Aligned_cols=74  Identities=22%  Similarity=0.418  Sum_probs=42.6

Q ss_pred             EecccCCeEEEEEccCCCcEEEEecCCCeEEEE-cCccccCCCcccee--eccc--ccCeEEEEeCCCCCEEEEEeCCCc
Q 022074           77 ILAHTSDVNTVCFGDESGHLIYSGSDDNLCKVW-DRRCLNVKGKPAGV--LMGH--LEGITFIDSRGDGRYLISNGKDQA  151 (303)
Q Consensus        77 ~~~h~~~v~~l~~~~~~~~~l~s~s~dg~v~lW-d~~~~~~~~~~~~~--~~~h--~~~v~~~~~~~~~~~l~s~~~D~~  151 (303)
                      +.+-+..++++.|+|+...+|++..+ +.-.+| +.     ++..+++  +.+-  .++|..   -.++.+.++--+++.
T Consensus        81 i~g~~~nvS~LTynp~~rtLFav~n~-p~~iVElt~-----~GdlirtiPL~g~~DpE~Iey---ig~n~fvi~dER~~~  151 (316)
T COG3204          81 ILGETANVSSLTYNPDTRTLFAVTNK-PAAIVELTK-----EGDLIRTIPLTGFSDPETIEY---IGGNQFVIVDERDRA  151 (316)
T ss_pred             cccccccccceeeCCCcceEEEecCC-CceEEEEec-----CCceEEEecccccCChhHeEE---ecCCEEEEEehhcce
Confidence            45555679999998775555555554 444444 32     2333322  2222  233444   345667777778888


Q ss_pred             EEEEEccc
Q 022074          152 IKLWDIRK  159 (303)
Q Consensus       152 v~lWdl~~  159 (303)
                      +.++.+..
T Consensus       152 l~~~~vd~  159 (316)
T COG3204         152 LYLFTVDA  159 (316)
T ss_pred             EEEEEEcC
Confidence            88887654


No 488
>PF12341 DUF3639:  Protein of unknown function (DUF3639) ;  InterPro: IPR022100  This domain family is found in eukaryotes, and is approximately 30 amino acids in length. The family is found in association with PF00400 from PFAM. There are two completely conserved residues (E and R) that may be functionally important. 
Probab=22.30  E-value=1.4e+02  Score=15.81  Aligned_cols=23  Identities=9%  Similarity=0.174  Sum_probs=17.9

Q ss_pred             eEEEEEcCCCCEEEEeeCCCeEEEE
Q 022074           42 IFSLKFSTDGRELVAGSSDDCIYVY   66 (303)
Q Consensus        42 v~~l~~s~~g~~l~sgs~Dg~v~lw   66 (303)
                      |.+++.++  ++++++..-+-+|+|
T Consensus         4 i~aia~g~--~~vavaTS~~~lRif   26 (27)
T PF12341_consen    4 IEAIAAGD--SWVAVATSAGYLRIF   26 (27)
T ss_pred             EEEEEccC--CEEEEEeCCCeEEec
Confidence            66777664  588888888889987


No 489
>TIGR03054 photo_alph_chp1 putative photosynthetic complex assembly protein. In twenty or so anoxygenic photosynthetic alpha-Proteobacteria known so far, a gene for a member of this protein family is present and is found in the vicinity of puhA, which encodes a component of the photosynthetic reaction center, and other genes associated with photosynthesis. This protein family is suggested, consequently, as a probable assembly factor for the photosynthetic reaction center, but its seems its actual function has not yet been demonstrated.
Probab=22.04  E-value=3.7e+02  Score=20.57  Aligned_cols=61  Identities=18%  Similarity=0.174  Sum_probs=45.2

Q ss_pred             EEEEEeCCCeEEEEECCCCeEEEEeecCC-CCeEEE-----------EECCCCCeEEEEeCCCCEEEeecCCC
Q 022074          224 YIYTGSHDSCVYVYDLVSGEQVAALKYHT-SPVRDC-----------SWHPSQPMLVSSSWDGDVVRWEFPGN  284 (303)
Q Consensus       224 ~latg~~dg~i~iwd~~~~~~~~~~~~h~-~~I~~v-----------~~sp~~~~las~s~Dg~i~~Wd~~~~  284 (303)
                      +.+.+..||.+.+++..+|+.+..+...+ +-|..+           ....+.++-++--+||.+.+-|..+.
T Consensus        43 l~f~d~~~G~v~V~~~~~G~~va~~~~g~~GFvrgvlR~l~R~R~~~gv~~~~Pf~L~r~~dGrltL~Dp~Tg  115 (135)
T TIGR03054        43 LVFEDRPDGAVAVVETPDGRLVAILEPGQNGFVRVMLRGLARARARAGVAAEPPFRLTRYDNGRLTLTDPATG  115 (135)
T ss_pred             EEEecCCCCeEEEEECCCCCEEEEecCCCCchhhHhHHHHHHHHHHcCCCCCCCEEEEEEeCCcEEEEcCCCC
Confidence            45667789999999999999998885332 222211           14567789999999999999996654


No 490
>PF10214 Rrn6:  RNA polymerase I-specific transcription-initiation factor;  InterPro: IPR019350  RNA polymerase I-specific transcription-initiation factor Rrn6 and Rrn7 represent components of a multisubunit transcription factor essential for the initiation of rDNA transcription by Pol I []. These proteins are found in fungi. 
Probab=21.35  E-value=8.6e+02  Score=24.59  Aligned_cols=122  Identities=16%  Similarity=0.142  Sum_probs=69.7

Q ss_pred             CcccceEEEEEc---C----CCCEEEEeeCCCeEEEEECCCCc---------------eEEEEecc---cCCeEEEEEcc
Q 022074           37 GYSFGIFSLKFS---T----DGRELVAGSSDDCIYVYDLEANK---------------LSLRILAH---TSDVNTVCFGD   91 (303)
Q Consensus        37 ~~~~~v~~l~~s---~----~g~~l~sgs~Dg~v~lwd~~~~~---------------~~~~~~~h---~~~v~~l~~~~   91 (303)
                      ....||..|.|.   .    ..++|++= ....+.|+...-.+               ....+..+   ......++|+|
T Consensus        77 ~~~~PI~qI~fa~~~~~~~~~~~~l~Vr-t~~st~I~~p~~~~~~~~~~~~~s~i~~~~l~~i~~~~tgg~~~aDv~FnP  155 (765)
T PF10214_consen   77 DDGSPIKQIKFATLSESFDEKSRWLAVR-TETSTTILRPEYHRVISSIRSRPSRIDPNPLLTISSSDTGGFPHADVAFNP  155 (765)
T ss_pred             CCCCCeeEEEecccccccCCcCcEEEEE-cCCEEEEEEcccccccccccCCccccccceeEEechhhcCCCccceEEecc
Confidence            566899999999   2    12355554 44567788722111               11222211   12456889998


Q ss_pred             CCCcEEEEecCCCeEEEEcCccccCCC-cccee---eccc-------ccCeEEEEeCCCCCEEEEEeCCCcEEEEEcccc
Q 022074           92 ESGHLIYSGSDDNLCKVWDRRCLNVKG-KPAGV---LMGH-------LEGITFIDSRGDGRYLISNGKDQAIKLWDIRKM  160 (303)
Q Consensus        92 ~~~~~l~s~s~dg~v~lWd~~~~~~~~-~~~~~---~~~h-------~~~v~~~~~~~~~~~l~s~~~D~~v~lWdl~~~  160 (303)
                      .+...||.....|...+||+....... .....   ..|+       .+....+.|..+-+.|+.+++ ..+.++|++..
T Consensus       156 ~~~~q~AiVD~~G~Wsvw~i~~~~~~~~~~~~~~~~~~gsi~~d~~e~s~w~rI~W~~~~~~lLv~~r-~~l~~~d~~~~  234 (765)
T PF10214_consen  156 WDQRQFAIVDEKGNWSVWDIKGRPKRKSSNLRLSRNISGSIIFDPEELSNWKRILWVSDSNRLLVCNR-SKLMLIDFESN  234 (765)
T ss_pred             CccceEEEEeccCcEEEEEeccccccCCcceeeccCCCccccCCCcccCcceeeEecCCCCEEEEEcC-CceEEEECCCC
Confidence            777899999999999999982111111 01111   1111       122334556666666776665 66777777653


No 491
>TIGR02171 Fb_sc_TIGR02171 Fibrobacter succinogenes paralogous family TIGR02171. This model describes a paralogous family of the rumen bacterium Fibrobacter succinogenes. Eleven members are found in Fibrobacter succinogenes S85, averaging over 900 amino acids in length. More than half are predicted lipoproteins. The function is unknown.
Probab=21.34  E-value=3.8e+02  Score=27.58  Aligned_cols=54  Identities=15%  Similarity=0.136  Sum_probs=38.3

Q ss_pred             CCCeEEEEECCCCeEEEE-eecCCCCeEEEEECCCCCeEEE-EeCCC-----CEEEeecCCC
Q 022074          230 HDSCVYVYDLVSGEQVAA-LKYHTSPVRDCSWHPSQPMLVS-SSWDG-----DVVRWEFPGN  284 (303)
Q Consensus       230 ~dg~i~iwd~~~~~~~~~-~~~h~~~I~~v~~sp~~~~las-~s~Dg-----~i~~Wd~~~~  284 (303)
                      ..+.|.+-|......... + .+..+|.+=+|||||+.||= .+.++     .|.+=++...
T Consensus       327 ~~~~L~~~D~dG~n~~~ve~-~~~~~i~sP~~SPDG~~vAY~ts~e~~~g~s~vYv~~L~t~  387 (912)
T TIGR02171       327 VTGNLAYIDYTKGASRAVEI-EDTISVYHPDISPDGKKVAFCTGIEGLPGKSSVYVRNLNAS  387 (912)
T ss_pred             CCCeEEEEecCCCCceEEEe-cCCCceecCcCCCCCCEEEEEEeecCCCCCceEEEEehhcc
Confidence            345888889876554322 3 46789999999999999887 56555     4666676544


No 492
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=21.31  E-value=1.3e+02  Score=17.63  Aligned_cols=23  Identities=17%  Similarity=0.421  Sum_probs=12.8

Q ss_pred             CCeEEEEEeCCC------eEEEEECCCCe
Q 022074          221 GQKYIYTGSHDS------CVYVYDLVSGE  243 (303)
Q Consensus       221 ~~~~latg~~dg------~i~iwd~~~~~  243 (303)
                      ++++++.||.+.      .+.+||+.+++
T Consensus        12 ~~~i~v~GG~~~~~~~~~d~~~~d~~~~~   40 (49)
T PF13418_consen   12 DNSIYVFGGRDSSGSPLNDLWIFDIETNT   40 (49)
T ss_dssp             TTEEEEE--EEE-TEE---EEEEETTTTE
T ss_pred             CCeEEEECCCCCCCcccCCEEEEECCCCE
Confidence            456666666433      47788887764


Done!