Query         045821
Match_columns 352
No_of_seqs    190 out of 2431
Neff          10.2
Searched_HMMs 46136
Date          Fri Mar 29 05:50:20 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/045821.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/045821hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4441 Proteins containing BT 100.0 1.4E-42   3E-47  327.5  31.0  258   66-330   282-555 (571)
  2 PHA02713 hypothetical protein; 100.0 1.9E-41 4.1E-46  321.2  29.8  255   69-330   258-542 (557)
  3 KOG4441 Proteins containing BT 100.0 5.5E-39 1.2E-43  303.1  25.8  254   36-300   301-566 (571)
  4 PLN02153 epithiospecifier prot 100.0 6.5E-37 1.4E-41  275.9  28.6  264   47-313     8-324 (341)
  5 PHA02713 hypothetical protein; 100.0 1.1E-37 2.5E-42  295.4  22.3  236   44-288   280-543 (557)
  6 PLN02193 nitrile-specifier pro 100.0 2.6E-35 5.7E-40  274.9  35.6  261   68-330   120-419 (470)
  7 PLN02193 nitrile-specifier pro 100.0 6.2E-36 1.3E-40  279.1  30.3  264   46-313   151-453 (470)
  8 TIGR03548 mutarot_permut cycli 100.0 1.2E-35 2.7E-40  265.7  29.2  240   67-314    12-314 (323)
  9 PHA03098 kelch-like protein; P 100.0 6.6E-36 1.4E-40  285.4  29.1  253   69-330   251-520 (534)
 10 TIGR03547 muta_rot_YjhT mutatr 100.0   2E-35 4.4E-40  267.1  27.5  240   67-313    16-331 (346)
 11 PLN02153 epithiospecifier prot 100.0 6.9E-35 1.5E-39  262.8  30.4  235   93-327     5-290 (341)
 12 PRK14131 N-acetylneuraminic ac 100.0 1.2E-34 2.6E-39  263.9  28.1  239   68-313    38-353 (376)
 13 PHA02790 Kelch-like protein; P 100.0 2.1E-34 4.6E-39  269.5  27.6  203  115-326   267-475 (480)
 14 PHA03098 kelch-like protein; P 100.0 4.8E-34   1E-38  272.6  26.2  217   68-290   294-523 (534)
 15 PHA02790 Kelch-like protein; P 100.0   1E-33 2.2E-38  264.9  26.5  203   67-286   270-478 (480)
 16 KOG4693 Uncharacterized conser 100.0 1.4E-33   3E-38  228.9  21.7  251   61-313    14-312 (392)
 17 TIGR03548 mutarot_permut cycli 100.0 6.1E-32 1.3E-36  241.9  27.4  222  109-330     3-288 (323)
 18 TIGR03547 muta_rot_YjhT mutatr 100.0 8.8E-32 1.9E-36  243.4  26.8  223  101-331     2-308 (346)
 19 PRK14131 N-acetylneuraminic ac 100.0 2.2E-30 4.8E-35  236.0  26.3  230   94-331    16-330 (376)
 20 KOG4693 Uncharacterized conser 100.0 3.7E-28 8.1E-33  197.4  21.8  229   96-327     3-282 (392)
 21 KOG0379 Kelch repeat-containin  99.9 4.7E-25   1E-29  205.5  23.3  234   54-289    54-312 (482)
 22 KOG0379 Kelch repeat-containin  99.9 7.4E-24 1.6E-28  197.5  25.1  225  106-332    57-312 (482)
 23 KOG1230 Protein containing rep  99.9 7.7E-23 1.7E-27  175.9  18.8  223  108-330    65-349 (521)
 24 KOG4152 Host cell transcriptio  99.9 6.6E-23 1.4E-27  180.6  16.4  264   47-313    18-342 (830)
 25 KOG1230 Protein containing rep  99.9 1.9E-22 4.2E-27  173.4  17.6  257   18-286    33-348 (521)
 26 KOG4152 Host cell transcriptio  99.8 3.1E-18 6.7E-23  151.4  18.5  218   93-312    15-273 (830)
 27 COG3055 Uncharacterized protei  99.8 1.2E-17 2.7E-22  142.1  20.6  237   68-313    46-359 (381)
 28 COG3055 Uncharacterized protei  99.7 3.6E-15 7.8E-20  127.2  18.6  207   99-313    29-295 (381)
 29 KOG2437 Muskelin [Signal trans  99.5 1.3E-14 2.8E-19  128.4   7.4  234   93-327   237-540 (723)
 30 PF13964 Kelch_6:  Kelch motif   99.3 1.4E-11 3.1E-16   77.7   6.6   50  109-158     1-50  (50)
 31 KOG2437 Muskelin [Signal trans  99.2 5.9E-12 1.3E-16  111.8   4.9  239   46-286   239-542 (723)
 32 PF13964 Kelch_6:  Kelch motif   99.1 1.1E-10 2.5E-15   73.5   5.9   46  157-202     1-50  (50)
 33 TIGR01640 F_box_assoc_1 F-box   99.1 8.4E-08 1.8E-12   81.7  22.7  188  136-323    14-230 (230)
 34 PF13415 Kelch_3:  Galactose ox  99.0 8.4E-10 1.8E-14   69.1   5.8   48  119-166     1-49  (49)
 35 PF01344 Kelch_1:  Kelch motif;  99.0 4.8E-10   1E-14   69.7   4.0   47  109-155     1-47  (47)
 36 PF01344 Kelch_1:  Kelch motif;  99.0 7.4E-10 1.6E-14   68.8   4.0   43  157-199     1-47  (47)
 37 PF07646 Kelch_2:  Kelch motif;  98.9 1.3E-08 2.7E-13   63.7   6.7   47  109-155     1-49  (49)
 38 PF13418 Kelch_4:  Galactose ox  98.8 3.6E-09 7.8E-14   66.3   3.8   47  109-155     1-48  (49)
 39 PF07646 Kelch_2:  Kelch motif;  98.8 1.2E-08 2.7E-13   63.8   5.7   43  157-199     1-49  (49)
 40 TIGR01640 F_box_assoc_1 F-box   98.8 5.7E-06 1.2E-10   70.4  23.8  194   83-280    14-230 (230)
 41 smart00612 Kelch Kelch domain.  98.7 1.9E-08 4.2E-13   62.3   4.9   47  121-168     1-47  (47)
 42 PF13418 Kelch_4:  Galactose ox  98.7 1.3E-08 2.8E-13   63.8   3.9   43  157-199     1-48  (49)
 43 PF07250 Glyoxal_oxid_N:  Glyox  98.7 3.5E-06 7.7E-11   71.0  17.7  149  136-291    46-211 (243)
 44 PF13415 Kelch_3:  Galactose ox  98.6 1.4E-07   3E-12   58.9   5.7   43  167-209     1-48  (49)
 45 smart00612 Kelch Kelch domain.  98.5 1.5E-07 3.3E-12   58.2   4.8   44  169-212     1-47  (47)
 46 PLN02772 guanylate kinase       98.5 1.6E-06 3.5E-11   77.6  10.7   83  108-190    23-110 (398)
 47 PF07250 Glyoxal_oxid_N:  Glyox  98.4 3.9E-05 8.5E-10   64.7  16.1  141  183-327    49-203 (243)
 48 PLN03215 ascorbic acid mannose  98.3  0.0016 3.4E-08   58.5  25.1   38   16-53      2-40  (373)
 49 PF13854 Kelch_5:  Kelch motif   98.2 3.1E-06 6.8E-11   50.8   5.3   38  108-145     3-41  (42)
 50 PLN02772 guanylate kinase       98.2 9.8E-06 2.1E-10   72.7  10.1   80  156-237    23-110 (398)
 51 PRK11138 outer membrane biogen  98.1  0.0069 1.5E-07   56.0  28.2  228   67-327    68-320 (394)
 52 PF12937 F-box-like:  F-box-lik  98.1 9.2E-07   2E-11   54.7   1.3   40   18-57      1-40  (47)
 53 PF13854 Kelch_5:  Kelch motif   98.1 8.1E-06 1.8E-10   48.9   5.2   36  154-189     1-41  (42)
 54 PF07893 DUF1668:  Protein of u  98.0  0.0066 1.4E-07   54.7  24.7  119  114-244    70-216 (342)
 55 PRK11138 outer membrane biogen  98.0   0.012 2.6E-07   54.4  27.8  206   84-319   171-392 (394)
 56 PF00646 F-box:  F-box domain;   98.0 1.1E-06 2.4E-11   54.6   0.0   43   17-59      2-44  (48)
 57 smart00256 FBOX A Receptor for  97.9 6.2E-06 1.4E-10   49.2   1.5   37   21-57      1-37  (41)
 58 PF13360 PQQ_2:  PQQ-like domai  97.9   0.011 2.4E-07   50.3  21.9  173  136-324     3-194 (238)
 59 KOG0281 Beta-TrCP (transducin   97.8 0.00087 1.9E-08   58.0  14.1  279   18-322    75-389 (499)
 60 TIGR03300 assembly_YfgL outer   97.8    0.03 6.6E-07   51.3  27.7  173  116-317   187-375 (377)
 61 PF13360 PQQ_2:  PQQ-like domai  97.5   0.041   9E-07   46.7  27.0  182  116-324    33-233 (238)
 62 TIGR03300 assembly_YfgL outer   97.5   0.069 1.5E-06   49.0  28.8  221   68-327    65-305 (377)
 63 PF03089 RAG2:  Recombination a  97.5   0.004 8.7E-08   52.7  13.1  101  121-221    39-174 (337)
 64 PF08450 SGL:  SMP-30/Gluconola  97.4    0.02 4.4E-07   49.1  17.7  186  119-322    11-214 (246)
 65 PF14870 PSII_BNR:  Photosynthe  97.4   0.066 1.4E-06   47.1  20.7  236   68-328    27-268 (302)
 66 PF12768 Rax2:  Cortical protei  97.3   0.018 3.8E-07   50.1  15.5  105  180-287    16-130 (281)
 67 PF07893 DUF1668:  Protein of u  97.2   0.018   4E-07   51.9  15.2  119  166-288    75-217 (342)
 68 PRK13684 Ycf48-like protein; P  97.1    0.21 4.5E-06   45.0  20.8  206   93-321   117-331 (334)
 69 PF08450 SGL:  SMP-30/Gluconola  96.9     0.1 2.2E-06   44.7  16.6  190   68-286    11-221 (246)
 70 TIGR03866 PQQ_ABC_repeats PQQ-  96.9    0.27 5.8E-06   42.9  19.9  182  121-324     2-190 (300)
 71 PF05096 Glu_cyclase_2:  Glutam  96.8    0.14   3E-06   43.8  16.1  156  160-327    47-209 (264)
 72 PRK04792 tolB translocation pr  96.8    0.45 9.7E-06   44.8  26.3  148  135-287   241-391 (448)
 73 TIGR03866 PQQ_ABC_repeats PQQ-  96.7    0.35 7.6E-06   42.2  24.6  224   70-322     2-238 (300)
 74 KOG2055 WD40 repeat protein [G  96.6   0.058 1.3E-06   48.8  12.7  148  167-324   224-377 (514)
 75 PF12768 Rax2:  Cortical protei  96.6    0.19 4.1E-06   43.8  15.6  106  134-244    14-130 (281)
 76 KOG2055 WD40 repeat protein [G  96.6    0.12 2.6E-06   46.9  14.4  188   68-279   224-418 (514)
 77 KOG0274 Cdc4 and related F-box  96.6    0.37 8.1E-06   46.2  18.8  285   14-323   104-402 (537)
 78 cd00200 WD40 WD40 domain, foun  96.5    0.42 9.1E-06   40.6  18.2  144  167-324    62-210 (289)
 79 PRK13684 Ycf48-like protein; P  96.5    0.57 1.2E-05   42.2  21.7  212   93-327    74-294 (334)
 80 cd00200 WD40 WD40 domain, foun  96.5    0.43 9.3E-06   40.6  22.6  183  119-322    62-250 (289)
 81 PF09910 DUF2139:  Uncharacteri  96.5    0.48   1E-05   40.9  21.6  169  100-279    28-231 (339)
 82 KOG0291 WD40-repeat-containing  96.5    0.95 2.1E-05   43.9  21.0  154  117-288   359-517 (893)
 83 PF03089 RAG2:  Recombination a  96.3    0.59 1.3E-05   40.1  16.3   70  108-177    86-174 (337)
 84 cd00094 HX Hemopexin-like repe  96.3     0.5 1.1E-05   38.9  17.6  145  115-281    12-178 (194)
 85 PLN02919 haloacid dehalogenase  96.3     1.9 4.1E-05   45.3  27.9  192  119-324   635-891 (1057)
 86 PRK00178 tolB translocation pr  96.2     1.1 2.3E-05   42.0  22.6  182   82-280   222-408 (430)
 87 PRK04792 tolB translocation pr  96.2     1.1 2.4E-05   42.2  23.5  183   81-280   240-427 (448)
 88 PRK11028 6-phosphogluconolacto  96.2    0.88 1.9E-05   40.8  19.4  189  121-322     3-206 (330)
 89 PF05096 Glu_cyclase_2:  Glutam  96.1    0.31 6.7E-06   41.7  13.9  115  210-332    54-168 (264)
 90 PRK11028 6-phosphogluconolacto  96.1       1 2.2E-05   40.4  22.7  191  119-323    46-260 (330)
 91 PRK04922 tolB translocation pr  96.0     1.4   3E-05   41.4  24.9  180  135-322   227-412 (433)
 92 KOG0310 Conserved WD40 repeat-  96.0    0.44 9.4E-06   43.6  14.8  177  135-323    47-227 (487)
 93 PF10282 Lactonase:  Lactonase,  96.0     1.2 2.5E-05   40.4  22.1  238   68-322    48-323 (345)
 94 PRK00178 tolB translocation pr  95.9     1.5 3.2E-05   41.0  26.7  146  135-287   222-372 (430)
 95 PRK05137 tolB translocation pr  95.9     1.6 3.4E-05   41.0  23.4  188   82-286   225-420 (435)
 96 smart00284 OLF Olfactomedin-li  95.9       1 2.2E-05   38.6  20.5  193  119-321    34-252 (255)
 97 PF10282 Lactonase:  Lactonase,  95.8     1.4   3E-05   40.0  22.0  234   74-321     4-275 (345)
 98 TIGR02800 propeller_TolB tol-p  95.8     1.6 3.6E-05   40.4  23.5  182   82-279   213-398 (417)
 99 COG4257 Vgb Streptogramin lyas  95.7     1.1 2.5E-05   38.3  18.3  218   86-325    86-309 (353)
100 KOG0310 Conserved WD40 repeat-  95.7     1.3 2.8E-05   40.7  16.6  223   68-319    79-306 (487)
101 TIGR02800 propeller_TolB tol-p  95.7     1.8 3.8E-05   40.2  26.3  146  136-287   214-363 (417)
102 PF03178 CPSF_A:  CPSF A subuni  95.6    0.83 1.8E-05   40.9  15.6  133  136-277    62-203 (321)
103 TIGR03075 PQQ_enz_alc_DH PQQ-d  95.6     1.4 3.1E-05   42.3  18.0  115  161-285    63-198 (527)
104 cd00216 PQQ_DH Dehydrogenases   95.5     2.4 5.2E-05   40.5  24.4  116  115-241    57-191 (488)
105 PRK04922 tolB translocation pr  95.5     2.2 4.8E-05   40.0  23.7  187   82-285   227-418 (433)
106 PF02191 OLF:  Olfactomedin-lik  95.5     1.5 3.1E-05   37.7  19.4  191  119-321    30-247 (250)
107 COG4257 Vgb Streptogramin lyas  95.4     1.5 3.3E-05   37.5  17.0  184   84-287   125-314 (353)
108 cd00094 HX Hemopexin-like repe  95.3     1.3 2.9E-05   36.3  17.9  145  162-324    11-178 (194)
109 PF08268 FBA_3:  F-box associat  95.2    0.69 1.5E-05   35.2  11.9   79  164-242     2-87  (129)
110 PF08268 FBA_3:  F-box associat  95.2    0.42 9.2E-06   36.4  10.6   79  208-287     2-89  (129)
111 PRK04043 tolB translocation pr  95.1     2.9 6.3E-05   39.0  20.4  177  136-323   213-402 (419)
112 KOG2120 SCF ubiquitin ligase,   95.0   0.016 3.6E-07   49.8   2.5   38   18-55     98-135 (419)
113 PLN00181 protein SPA1-RELATED;  95.0     4.8  0.0001   41.0  23.6  172  136-322   555-739 (793)
114 TIGR03075 PQQ_enz_alc_DH PQQ-d  94.7     2.5 5.5E-05   40.7  16.7  117  115-241    65-197 (527)
115 PRK05137 tolB translocation pr  94.7     3.9 8.4E-05   38.4  27.6  181  135-323   225-414 (435)
116 PF12217 End_beta_propel:  Cata  94.6     2.5 5.4E-05   35.9  14.0  175   93-267    53-258 (367)
117 PF02191 OLF:  Olfactomedin-lik  94.5     2.7 5.9E-05   36.0  18.6  192   68-278    30-247 (250)
118 PRK03629 tolB translocation pr  94.3     4.7  0.0001   37.7  26.9  148  135-287   222-372 (429)
119 PRK03629 tolB translocation pr  94.2     4.8 0.00011   37.7  23.6  188   83-286   223-414 (429)
120 PRK04043 tolB translocation pr  94.1     5.1 0.00011   37.4  23.0  190   82-286   212-408 (419)
121 KOG0316 Conserved WD40 repeat-  94.1     3.1 6.7E-05   34.9  17.3  176  135-325    80-261 (307)
122 KOG0286 G-protein beta subunit  94.0     3.7   8E-05   35.5  19.7  204  119-348   108-321 (343)
123 PRK02889 tolB translocation pr  93.8     5.8 0.00013   37.1  24.8  146  135-286   219-368 (427)
124 KOG0266 WD40 repeat-containing  93.6     6.8 0.00015   37.0  17.5  178  135-322   224-410 (456)
125 PF03178 CPSF_A:  CPSF A subuni  93.5       3 6.5E-05   37.3  13.9  122  180-310    62-190 (321)
126 PLN02919 haloacid dehalogenase  93.2      13 0.00028   39.2  22.9  150  119-282   694-892 (1057)
127 PLN00033 photosystem II stabil  93.2     7.1 0.00015   36.1  22.8  216   93-328   118-363 (398)
128 KOG0646 WD40 repeat protein [G  93.1     3.8 8.1E-05   37.6  13.2  199  113-330    85-316 (476)
129 smart00284 OLF Olfactomedin-li  93.0     5.4 0.00012   34.2  20.2  192   68-278    34-252 (255)
130 PTZ00421 coronin; Provisional   92.9       9  0.0002   36.6  20.3  147  168-326   138-295 (493)
131 KOG0649 WD40 repeat protein [G  92.9     5.2 0.00011   33.7  16.0  185  147-352   100-303 (325)
132 COG2706 3-carboxymuconate cycl  92.6       7 0.00015   34.7  25.4  245   68-324    51-324 (346)
133 KOG0316 Conserved WD40 repeat-  92.6     5.6 0.00012   33.4  17.2  189   68-282    70-261 (307)
134 PLN00033 photosystem II stabil  92.5     8.8 0.00019   35.5  21.9  199   93-312   164-390 (398)
135 KOG2997 F-box protein FBX9 [Ge  92.3   0.058 1.3E-06   46.7   1.0   43   18-60    107-154 (366)
136 PF07734 FBA_1:  F-box associat  92.1     5.1 0.00011   31.8  13.3   80  208-287     2-92  (164)
137 PLN03215 ascorbic acid mannose  92.0     8.3 0.00018   35.1  14.2  132  145-285   189-353 (373)
138 PF14870 PSII_BNR:  Photosynthe  91.7       9  0.0002   33.9  24.9  217   68-312    71-295 (302)
139 PTZ00421 coronin; Provisional   91.7      13 0.00028   35.6  22.1  152  120-285   138-297 (493)
140 KOG0296 Angio-associated migra  91.4      10 0.00022   33.9  14.9  141  167-324    75-223 (399)
141 COG4946 Uncharacterized protei  91.3      12 0.00027   34.7  20.4  164   55-243   264-439 (668)
142 KOG0315 G-protein beta subunit  91.2     8.4 0.00018   32.6  16.6  165  167-347    51-221 (311)
143 PF07734 FBA_1:  F-box associat  91.2     6.5 0.00014   31.2  11.9   78  164-242     2-90  (164)
144 PRK02889 tolB translocation pr  91.1      13 0.00029   34.7  23.4  183   81-279   218-404 (427)
145 PTZ00420 coronin; Provisional   90.9      16 0.00036   35.4  19.8  141  169-323   139-295 (568)
146 PF02897 Peptidase_S9_N:  Proly  90.8      14 0.00029   34.4  17.3  161   68-242   238-411 (414)
147 COG1520 FOG: WD40-like repeat   90.8      13 0.00028   34.0  20.2  156  163-331    64-229 (370)
148 KOG0296 Angio-associated migra  90.8      12 0.00025   33.5  21.5  147  119-281    75-223 (399)
149 KOG1036 Mitotic spindle checkp  90.6     7.6 0.00017   33.8  11.8  132  180-324    35-166 (323)
150 COG4447 Uncharacterized protei  90.5     6.9 0.00015   33.8  11.4  244   68-339    54-307 (339)
151 TIGR03074 PQQ_membr_DH membran  90.0      24 0.00051   35.8  19.3   35  160-197   187-223 (764)
152 COG1520 FOG: WD40-like repeat   89.8      16 0.00034   33.4  25.2  153  116-285    65-226 (370)
153 KOG0315 G-protein beta subunit  89.3      12 0.00027   31.7  16.4  177  136-323    61-247 (311)
154 KOG1332 Vesicle coat complex C  89.2      13 0.00027   31.6  14.1  103  169-288   176-297 (299)
155 TIGR03074 PQQ_membr_DH membran  89.0      28  0.0006   35.3  18.5   35  113-154   188-224 (764)
156 KOG0266 WD40 repeat-containing  88.8      22 0.00047   33.7  18.5  146  119-279   257-410 (456)
157 PF02897 Peptidase_S9_N:  Proly  88.4      21 0.00046   33.1  19.9  147  135-286   251-412 (414)
158 KOG1446 Histone H3 (Lys4) meth  88.1      17 0.00037   31.7  22.5  101   84-195    37-137 (311)
159 cd00216 PQQ_DH Dehydrogenases   88.0      25 0.00055   33.5  25.9  162  135-309   255-454 (488)
160 PRK01742 tolB translocation pr  87.2      26 0.00056   32.8  22.1  180   82-285   227-411 (429)
161 KOG2321 WD40 repeat protein [G  86.7      17 0.00038   34.6  12.2   62  212-281   146-208 (703)
162 COG4946 Uncharacterized protei  86.7      27 0.00059   32.5  16.9  184  134-327   105-301 (668)
163 KOG4649 PQQ (pyrrolo-quinoline  85.1      24 0.00051   30.3  12.7   90  180-281    33-126 (354)
164 KOG2106 Uncharacterized conser  84.8      35 0.00076   32.1  18.4  185   84-289   223-409 (626)
165 KOG0289 mRNA splicing factor [  84.4      34 0.00073   31.5  17.8  119  160-290   350-474 (506)
166 KOG0278 Serine/threonine kinas  84.2      25 0.00054   29.9  11.9  125  134-275   163-294 (334)
167 KOG1332 Vesicle coat complex C  84.2      25 0.00054   29.9  14.1   74  156-244   221-296 (299)
168 PF07433 DUF1513:  Protein of u  83.4      31 0.00068   30.5  21.6  237   68-321    16-285 (305)
169 KOG0274 Cdc4 and related F-box  83.2      46   0.001   32.2  20.0  177  134-327   309-488 (537)
170 PRK01742 tolB translocation pr  82.7      42 0.00091   31.4  25.4  139  135-286   227-369 (429)
171 PF06433 Me-amine-dh_H:  Methyl  82.7      36 0.00078   30.6  12.2   69  211-282   249-324 (342)
172 KOG0291 WD40-repeat-containing  82.4      55  0.0012   32.5  23.1  157  115-285   312-473 (893)
173 PF15525 DUF4652:  Domain of un  82.2      25 0.00055   28.5  11.2   86  127-212    79-172 (200)
174 PRK10115 protease 2; Provision  82.2      59  0.0013   32.7  25.4  149  135-286   198-354 (686)
175 PLN00181 protein SPA1-RELATED;  82.0      64  0.0014   33.0  22.2  143  119-280   587-740 (793)
176 KOG3881 Uncharacterized conser  81.5      38 0.00082   30.7  11.6  151  168-326   161-325 (412)
177 COG2706 3-carboxymuconate cycl  81.3      40 0.00086   30.2  24.8  210  120-336    52-289 (346)
178 TIGR02658 TTQ_MADH_Hv methylam  80.3      46   0.001   30.2  23.5  120   68-194    12-142 (352)
179 COG3386 Gluconolactonase [Carb  77.6      51  0.0011   29.3  18.2  183  136-332    47-253 (307)
180 KOG0282 mRNA splicing factor [  77.6      29 0.00062   32.3   9.9   62  167-238   269-333 (503)
181 KOG2111 Uncharacterized conser  75.5      58  0.0013   28.8  17.6  145  119-282    58-216 (346)
182 TIGR02658 TTQ_MADH_Hv methylam  74.9      67  0.0014   29.2  22.4   68  258-327   259-336 (352)
183 PF14583 Pectate_lyase22:  Olig  73.8      41 0.00089   30.8  10.0  223   68-309    46-299 (386)
184 PRK10115 protease 2; Provision  73.8   1E+02  0.0022   30.9  21.8  150  135-288   246-404 (686)
185 KOG0263 Transcription initiati  71.7      43 0.00093   33.0  10.0   63  210-279   587-650 (707)
186 KOG0643 Translation initiation  71.6      67  0.0014   27.8  20.2  171  133-313    71-253 (327)
187 COG0823 TolB Periplasmic compo  70.4      95  0.0021   29.1  16.9  108  134-244   260-368 (425)
188 KOG0647 mRNA export protein (c  69.9      78  0.0017   27.9  16.2  129  135-275    93-225 (347)
189 KOG1036 Mitotic spindle checkp  69.9      77  0.0017   27.8  15.6  132  136-281    35-166 (323)
190 KOG0263 Transcription initiati  69.4      60  0.0013   32.1  10.4   59  167-234   588-648 (707)
191 KOG0305 Anaphase promoting com  69.2 1.1E+02  0.0023   29.2  12.4  173  137-323   198-378 (484)
192 KOG2321 WD40 repeat protein [G  69.0      65  0.0014   31.0  10.2  105  167-281   145-261 (703)
193 PF12217 End_beta_propel:  Cata  68.6      77  0.0017   27.3  13.1  194  116-310    22-256 (367)
194 COG4447 Uncharacterized protei  68.0      53  0.0011   28.6   8.7  113  163-286    50-164 (339)
195 PF14583 Pectate_lyase22:  Olig  68.0   1E+02  0.0022   28.4  11.7  129   85-221   170-303 (386)
196 KOG0286 G-protein beta subunit  67.5      86  0.0019   27.5  19.7  134  135-277   165-302 (343)
197 KOG0281 Beta-TrCP (transducin   67.1      30 0.00065   30.9   7.3  102  166-279   328-429 (499)
198 KOG1539 WD repeat protein [Gen  66.7 1.5E+02  0.0033   30.0  18.6   71  113-194    39-109 (910)
199 KOG0285 Pleiotropic regulator   66.5   1E+02  0.0022   27.9  15.7  204  119-349   203-416 (460)
200 COG0823 TolB Periplasmic compo  65.7 1.2E+02  0.0026   28.5  14.2  103  180-286   262-367 (425)
201 KOG0318 WD40 repeat stress pro  65.2 1.3E+02  0.0028   28.7  20.6   86  256-344   329-416 (603)
202 PF02239 Cytochrom_D1:  Cytochr  65.0 1.1E+02  0.0024   28.0  11.2  149  119-283    48-207 (369)
203 KOG0649 WD40 repeat protein [G  63.4      95  0.0021   26.5  16.5  129   68-218   126-263 (325)
204 COG4880 Secreted protein conta  62.6 1.3E+02  0.0029   27.9  13.8  184   83-288   406-600 (603)
205 KOG0647 mRNA export protein (c  62.0 1.1E+02  0.0025   26.9  10.4   63  211-279    83-146 (347)
206 PTZ00420 coronin; Provisional   61.2 1.7E+02  0.0037   28.7  18.0  148  121-286   139-301 (568)
207 KOG3545 Olfactomedin and relat  60.4 1.1E+02  0.0023   26.2  16.9  190   68-278    30-246 (249)
208 KOG0295 WD40 repeat-containing  59.8 1.4E+02  0.0029   27.1  15.6   53  257-313   344-396 (406)
209 PF13088 BNR_2:  BNR repeat-lik  59.7 1.1E+02  0.0025   26.2  16.2  214   93-308    28-275 (275)
210 KOG0640 mRNA cleavage stimulat  59.6      79  0.0017   27.9   8.4  104  168-282   184-295 (430)
211 PF02239 Cytochrom_D1:  Cytochr  59.4 1.4E+02  0.0031   27.3  13.0  134  135-280    15-160 (369)
212 PF06433 Me-amine-dh_H:  Methyl  58.5      42  0.0009   30.2   6.8   70  119-192   249-324 (342)
213 COG3391 Uncharacterized conser  58.2 1.5E+02  0.0033   27.2  18.8  150  167-324    85-242 (381)
214 KOG4378 Nuclear protein COP1 [  57.4 1.8E+02  0.0038   27.7  12.4   90  180-279   187-281 (673)
215 KOG0640 mRNA cleavage stimulat  55.5 1.2E+02  0.0026   26.9   8.7  108  167-282   227-339 (430)
216 KOG1520 Predicted alkaloid syn  55.4 1.7E+02  0.0036   26.8  12.7   69  211-285   126-214 (376)
217 KOG0289 mRNA splicing factor [  54.7 1.8E+02   0.004   27.0  17.7  119  112-246   350-473 (506)
218 PF14298 DUF4374:  Domain of un  54.4 1.9E+02  0.0041   27.1  11.9   59  180-239   367-428 (435)
219 KOG1274 WD40 repeat protein [G  54.0 2.6E+02  0.0057   28.6  17.6  149  170-330    68-225 (933)
220 KOG4378 Nuclear protein COP1 [  53.1 2.1E+02  0.0045   27.2  14.9   61  257-322   219-281 (673)
221 PF14781 BBS2_N:  Ciliary BBSom  52.5   1E+02  0.0023   23.6   7.5   63  168-242    64-134 (136)
222 KOG3881 Uncharacterized conser  51.8 1.9E+02  0.0042   26.4  12.2  148  120-280   161-322 (412)
223 KOG0282 mRNA splicing factor [  51.3      70  0.0015   29.9   7.1   63  257-322   268-331 (503)
224 PF07433 DUF1513:  Protein of u  51.2 1.8E+02  0.0039   25.9  17.7  214  113-334     8-260 (305)
225 PF13859 BNR_3:  BNR repeat-lik  51.0 1.8E+02  0.0039   25.9  12.7  166  161-327     2-211 (310)
226 KOG0772 Uncharacterized conser  50.7 2.3E+02   0.005   27.1  15.8  154  156-324   315-491 (641)
227 KOG1517 Guanine nucleotide bin  50.4 3.3E+02  0.0072   28.8  13.1  163  148-324  1154-1336(1387)
228 PF11768 DUF3312:  Protein of u  50.1 1.8E+02  0.0038   28.1   9.8  107  211-324   218-332 (545)
229 KOG0321 WD40 repeat-containing  49.1      90  0.0019   30.4   7.7   97  228-324    75-178 (720)
230 PF08662 eIF2A:  Eukaryotic tra  48.8 1.5E+02  0.0032   24.2  10.2   64  167-236    71-134 (194)
231 PF13570 PQQ_3:  PQQ-like domai  48.5      42 0.00091   19.1   3.7   25  298-322    16-40  (40)
232 KOG2048 WD40 repeat protein [G  47.6 2.9E+02  0.0063   27.3  21.9  121  195-324   421-551 (691)
233 PRK01029 tolB translocation pr  47.2 2.5E+02  0.0054   26.3  21.8   59  227-286   351-411 (428)
234 PF11134 Phage_stabilise:  Phag  46.8 2.5E+02  0.0054   26.3  12.0  112  160-279   234-364 (469)
235 PF14781 BBS2_N:  Ciliary BBSom  44.1      46 0.00099   25.5   4.1   64  258-323    63-127 (136)
236 KOG1897 Damage-specific DNA bi  43.7   4E+02  0.0088   27.8  19.4  148  120-278   787-944 (1096)
237 KOG1428 Inhibitor of type V ad  43.3 3.8E+02  0.0081   29.8  11.3   49  210-267   496-544 (3738)
238 KOG0294 WD40 repeat-containing  43.2 2.4E+02  0.0053   25.1  14.1   44  146-191    29-74  (362)
239 KOG2048 WD40 repeat protein [G  42.5 3.5E+02  0.0076   26.7  21.4  147  163-318   159-315 (691)
240 KOG0272 U4/U6 small nuclear ri  42.3 2.8E+02  0.0062   25.7  11.1  111  166-289   313-427 (459)
241 KOG0650 WD40 repeat nucleolar   40.7   2E+02  0.0043   28.1   8.4  135  168-312   534-680 (733)
242 KOG1538 Uncharacterized conser  40.4 3.9E+02  0.0084   26.7  12.5  170  147-323     2-253 (1081)
243 COG4101 Predicted mannose-6-ph  40.3      98  0.0021   23.0   5.1   48  302-350    67-119 (142)
244 COG3391 Uncharacterized conser  39.8   3E+02  0.0066   25.3  19.9  153  118-285    84-246 (381)
245 PRK01029 tolB translocation pr  39.7 3.2E+02   0.007   25.6  25.8  185  135-325   210-407 (428)
246 KOG1446 Histone H3 (Lys4) meth  39.6 2.7E+02  0.0059   24.6  15.9  147  180-336    36-183 (311)
247 PF13088 BNR_2:  BNR repeat-lik  39.1 2.5E+02  0.0054   24.0  12.6  143   68-216   118-275 (275)
248 KOG1963 WD40 repeat protein [G  38.6 4.4E+02  0.0095   26.8  16.5   95  228-322   433-539 (792)
249 PF13013 F-box-like_2:  F-box-l  37.8      32 0.00069   25.3   2.4   29   18-46     22-50  (109)
250 KOG0639 Transducin-like enhanc  37.3 3.4E+02  0.0073   26.0   9.2   34  256-290   474-507 (705)
251 COG3823 Glutamine cyclotransfe  36.9 2.5E+02  0.0055   23.5   9.7  116  207-331    51-168 (262)
252 KOG0322 G-protein beta subunit  36.8 2.9E+02  0.0062   24.1   9.2  113  167-289   163-287 (323)
253 KOG0639 Transducin-like enhanc  36.6 1.1E+02  0.0024   28.9   6.1  101  167-278   476-581 (705)
254 KOG4649 PQQ (pyrrolo-quinoline  35.5   3E+02  0.0065   24.0  19.5   71  115-196    16-91  (354)
255 COG3386 Gluconolactonase [Carb  35.2 3.3E+02  0.0071   24.3  22.8  205   82-312    46-276 (307)
256 PF08662 eIF2A:  Eukaryotic tra  34.7 2.5E+02  0.0054   22.8  11.7   68  119-196    71-139 (194)
257 PF09910 DUF2139:  Uncharacteri  34.5 3.3E+02  0.0072   24.2  16.0  112   68-195   117-238 (339)
258 KOG2106 Uncharacterized conser  34.1 4.3E+02  0.0093   25.3  13.1   98  211-319   212-314 (626)
259 TIGR03032 conserved hypothetic  33.8 3.5E+02  0.0076   24.3  22.1   33  257-290   211-243 (335)
260 KOG0299 U3 snoRNP-associated p  32.6 4.3E+02  0.0092   24.9  14.9  141  116-281   210-359 (479)
261 KOG0308 Conserved WD40 repeat-  32.0 3.2E+02   0.007   26.9   8.5   95  180-281    95-204 (735)
262 PF06977 SdiA-regulated:  SdiA-  31.8 2.1E+02  0.0046   24.5   6.9   59  258-318   182-247 (248)
263 KOG0265 U5 snRNP-specific prot  31.5 3.7E+02   0.008   23.8  10.3   96  167-280    58-165 (338)
264 PF00780 CNH:  CNH domain;  Int  30.9 3.4E+02  0.0074   23.2  18.7   83  256-341   146-236 (275)
265 KOG3545 Olfactomedin and relat  30.4 3.5E+02  0.0076   23.2  19.6  190  119-320    30-245 (249)
266 KOG0306 WD40-repeat-containing  30.4 5.9E+02   0.013   25.8  15.4  169  135-320    42-218 (888)
267 KOG2315 Predicted translation   29.8 5.2E+02   0.011   25.0  13.3   64  119-190   282-346 (566)
268 KOG1188 WD40 repeat protein [G  29.7 4.2E+02  0.0092   23.9  12.4  176  135-324    49-245 (376)
269 KOG1274 WD40 repeat protein [G  29.7 6.5E+02   0.014   26.0  21.0   60  258-321   200-262 (933)
270 PF11768 DUF3312:  Protein of u  28.3 5.6E+02   0.012   24.9  10.6   96   82-191   235-332 (545)
271 KOG4341 F-box protein containi  28.1      32  0.0007   31.7   1.4   38   16-53     70-107 (483)
272 smart00564 PQQ beta-propeller   27.9   1E+02  0.0022   16.2   3.4   24  256-281     4-27  (33)
273 KOG0293 WD40 repeat-containing  27.5 5.1E+02   0.011   24.1  16.5  130  137-279   292-426 (519)
274 PRK02888 nitrous-oxide reducta  27.4 6.3E+02   0.014   25.1  12.7   54  271-324   297-354 (635)
275 KOG0308 Conserved WD40 repeat-  27.1 6.3E+02   0.014   25.1  12.9   66  118-191   128-204 (735)
276 KOG0306 WD40-repeat-containing  27.1 4.9E+02   0.011   26.3   8.9   99  166-276   382-482 (888)
277 KOG2314 Translation initiation  26.5 4.8E+02    0.01   25.4   8.5   70   68-145   457-526 (698)
278 KOG0301 Phospholipase A2-activ  26.2 6.7E+02   0.014   25.1  13.6   66  256-326   228-293 (745)
279 COG3490 Uncharacterized protei  26.1 4.6E+02    0.01   23.2  15.9   56  258-318   290-345 (366)
280 KOG0646 WD40 repeat protein [G  25.4 5.8E+02   0.012   24.1  11.0   59  160-222    84-145 (476)
281 KOG1898 Splicing factor 3b, su  25.1 8.5E+02   0.018   25.9  13.3  160  134-302   851-1028(1205)
282 KOG0270 WD40 repeat-containing  24.7 5.8E+02   0.013   23.9  14.5   28  297-324   424-452 (463)
283 PF14298 DUF4374:  Domain of un  24.2 6.1E+02   0.013   23.9  12.8   61  270-330   367-433 (435)
284 PF15525 DUF4652:  Domain of un  23.8 4.1E+02  0.0089   21.8  10.5   69   84-156    89-160 (200)
285 KOG3669 Uncharacterized conser  23.7   7E+02   0.015   24.4  12.2   54  141-200   213-273 (705)
286 KOG0279 G protein beta subunit  23.4 5.1E+02   0.011   22.7  18.4  186  119-327    74-268 (315)
287 KOG0319 WD40-repeat-containing  22.9 7.9E+02   0.017   24.8  16.1  169  138-324    86-271 (775)
288 smart00120 HX Hemopexin-like r  22.6 1.7E+02  0.0036   16.8   4.7   19  303-322     9-27  (45)
289 COG3823 Glutamine cyclotransfe  22.5 4.7E+02    0.01   22.0  16.2  186  113-313    49-250 (262)
290 KOG1963 WD40 repeat protein [G  22.4 8.5E+02   0.018   24.9  10.4   59  254-313   258-323 (792)
291 KOG0294 WD40 repeat-containing  21.9 5.8E+02   0.013   22.8  19.1   27  115-147    48-74  (362)
292 KOG0303 Actin-binding protein   21.4 6.6E+02   0.014   23.3  11.6  135  180-324   154-297 (472)
293 KOG0265 U5 snRNP-specific prot  21.1 5.9E+02   0.013   22.6  14.8  148   68-239    58-208 (338)

No 1  
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=100.00  E-value=1.4e-42  Score=327.48  Aligned_cols=258  Identities=24%  Similarity=0.386  Sum_probs=230.8

Q ss_pred             CCCCEEEEEEecCC--CCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821           66 LDETWIYAFCRDNK--LERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA  143 (352)
Q Consensus        66 ~~~~~l~~~gg~~~--~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~  143 (352)
                      ...+.|+++||...  .....+..|||  .++.|..+++|+   .+|..+++++++|.||++||.+.....++++++||+
T Consensus       282 ~~~~~l~~vGG~~~~~~~~~~ve~yd~--~~~~w~~~a~m~---~~r~~~~~~~~~~~lYv~GG~~~~~~~l~~ve~YD~  356 (571)
T KOG4441|consen  282 SVSGKLVAVGGYNRQGQSLRSVECYDP--KTNEWSSLAPMP---SPRCRVGVAVLNGKLYVVGGYDSGSDRLSSVERYDP  356 (571)
T ss_pred             CCCCeEEEECCCCCCCcccceeEEecC--CcCcEeecCCCC---cccccccEEEECCEEEEEccccCCCcccceEEEecC
Confidence            34688999999864  33456889999  999999999999   778899999999999999999843377899999999


Q ss_pred             CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccC
Q 045821          144 SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSA  220 (352)
Q Consensus       144 ~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~  220 (352)
                      .+++|..+++|+.+|..+++++++|.||++||.+..   +++|+|||.+++|..+++|+..+..+++++++|+||++||.
T Consensus       357 ~~~~W~~~a~M~~~R~~~~v~~l~g~iYavGG~dg~~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~  436 (571)
T KOG4441|consen  357 RTNQWTPVAPMNTKRSDFGVAVLDGKLYAVGGFDGEKSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGG  436 (571)
T ss_pred             CCCceeccCCccCccccceeEEECCEEEEEeccccccccccEEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCc
Confidence            999999999999999999999999999999999865   99999999999999999999988899999999999999998


Q ss_pred             CCCC-CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC----CeEEEEeCCCCceEEccCCCCCcCCC
Q 045821          221 SAAT-SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG----TKLMMWQKESREWSPVGRLSTLLTRP  295 (352)
Q Consensus       221 ~~~~-~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W~~~~~~~~~~~~~  295 (352)
                      .... ....+++|||.+++|+.++++..++....+++.+++||++||+++    ..+.+|||.+++|+.++.++.  .+.
T Consensus       437 ~~~~~~l~sve~YDP~t~~W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~~--~rs  514 (571)
T KOG4441|consen  437 DGSSNCLNSVECYDPETNTWTLIAPMNTRRSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMTS--PRS  514 (571)
T ss_pred             CCCccccceEEEEcCCCCceeecCCcccccccceEEEECCEEEEECCccCCCccceEEEEcCCCCceeEcccCcc--ccc
Confidence            8766 667999999999999999987777777778899999999999876    468999999999999988876  588


Q ss_pred             CceEEEeCCEEEEEcCCc------eEEEEEcCCcccccceE
Q 045821          296 PCKLVAIGKTIFVIGKGC------SAVVIDVGNIGNIGGIM  330 (352)
Q Consensus       296 ~~~~~~~~~~i~v~gG~~------~~~~~d~~~~~w~~~~~  330 (352)
                      ..+++..+++||++||.+      .+..||+++++|.....
T Consensus       515 ~~g~~~~~~~ly~vGG~~~~~~l~~ve~ydp~~d~W~~~~~  555 (571)
T KOG4441|consen  515 AVGVVVLGGKLYAVGGFDGNNNLNTVECYDPETDTWTEVTE  555 (571)
T ss_pred             cccEEEECCEEEEEecccCccccceeEEcCCCCCceeeCCC
Confidence            889999999999999966      35579999999987544


No 2  
>PHA02713 hypothetical protein; Provisional
Probab=100.00  E-value=1.9e-41  Score=321.20  Aligned_cols=255  Identities=17%  Similarity=0.265  Sum_probs=215.0

Q ss_pred             CEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCee
Q 045821           69 TWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTW  148 (352)
Q Consensus        69 ~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W  148 (352)
                      ..+++.||.........+.||+  .+++|..++++|   .++.+++++++++.||++||.+......+++++||+.+++|
T Consensus       258 ~~l~~~~g~~~~~~~~v~~yd~--~~~~W~~l~~mp---~~r~~~~~a~l~~~IYviGG~~~~~~~~~~v~~Yd~~~n~W  332 (557)
T PHA02713        258 LCLVCHDTKYNVCNPCILVYNI--NTMEYSVISTIP---NHIINYASAIVDNEIIIAGGYNFNNPSLNKVYKINIENKIH  332 (557)
T ss_pred             eEEEEecCccccCCCCEEEEeC--CCCeEEECCCCC---ccccceEEEEECCEEEEEcCCCCCCCccceEEEEECCCCeE
Confidence            3455544421111235678999  999999999998   56788899999999999999754334578999999999999


Q ss_pred             EecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC-
Q 045821          149 TDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT-  224 (352)
Q Consensus       149 ~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~-  224 (352)
                      ..+++|+.+|..++++.++++||++||.+..   +++++||+.+++|+.+++|+.++..+++++++|+||++||..... 
T Consensus       333 ~~~~~m~~~R~~~~~~~~~g~IYviGG~~~~~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g~IYviGG~~~~~~  412 (557)
T PHA02713        333 VELPPMIKNRCRFSLAVIDDTIYAIGGQNGTNVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHID  412 (557)
T ss_pred             eeCCCCcchhhceeEEEECCEEEEECCcCCCCCCceEEEEECCCCeEEECCCCCcccccccEEEECCEEEEEeCCCcccc
Confidence            9999999999999999999999999998643   789999999999999999999999999999999999999976421 


Q ss_pred             -----------------CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-----CeEEEEeCCC-Cc
Q 045821          225 -----------------SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-----TKLMMWQKES-RE  281 (352)
Q Consensus       225 -----------------~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-----~~v~~yd~~~-~~  281 (352)
                                       ....+++|||.+++|+.+++.+..+....+++++|+||++||..+     +.+.+|||++ ++
T Consensus       413 ~~~~~~~~~~~~~~~~~~~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~  492 (557)
T PHA02713        413 YTSVHHMNSIDMEEDTHSSNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNG  492 (557)
T ss_pred             cccccccccccccccccccceEEEECCCCCeEeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCC
Confidence                             135799999999999999876666656667889999999999753     3578999999 89


Q ss_pred             eEEccCCCCCcCCCCceEEEeCCEEEEEcCCce---EEEEEcCCcccccceE
Q 045821          282 WSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS---AVVIDVGNIGNIGGIM  330 (352)
Q Consensus       282 W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~---~~~~d~~~~~w~~~~~  330 (352)
                      |+.++.+|.  .|..+++++++++||++||.+.   +..||+++++|...++
T Consensus       493 W~~~~~m~~--~r~~~~~~~~~~~iyv~Gg~~~~~~~e~yd~~~~~W~~~~~  542 (557)
T PHA02713        493 WELITTTES--RLSALHTILHDNTIMMLHCYESYMLQDTFNVYTYEWNHICH  542 (557)
T ss_pred             eeEccccCc--ccccceeEEECCEEEEEeeecceeehhhcCcccccccchhh
Confidence            999999987  5888999999999999999775   5579999999986444


No 3  
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=100.00  E-value=5.5e-39  Score=303.11  Aligned_cols=254  Identities=24%  Similarity=0.400  Sum_probs=216.2

Q ss_pred             ccchhhhhhhhhhcccccChhhHHHHhhCCCC--CCEEEEEEecCC-CC-ceeEEEecCCCCCCCeeEeccCCCcccCCc
Q 045821           36 RYHAVLKCVSSRWRGLLHSEEWCAYRRKHNLD--ETWIYAFCRDNK-LE-RVCCYVLDPNSTRRSWKMIHELPARTLRRK  111 (352)
Q Consensus        36 ~~~~~l~~v~k~W~~l~~s~~~~~~~~~~~~~--~~~l~~~gg~~~-~~-~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~  111 (352)
                      .....+....+.|..+...|   ..|.+++..  ++.||++||.+. .. ....+.|||  .+++|..+++|.   .+|.
T Consensus       301 ~~ve~yd~~~~~w~~~a~m~---~~r~~~~~~~~~~~lYv~GG~~~~~~~l~~ve~YD~--~~~~W~~~a~M~---~~R~  372 (571)
T KOG4441|consen  301 RSVECYDPKTNEWSSLAPMP---SPRCRVGVAVLNGKLYVVGGYDSGSDRLSSVERYDP--RTNQWTPVAPMN---TKRS  372 (571)
T ss_pred             ceeEEecCCcCcEeecCCCC---cccccccEEEECCEEEEEccccCCCcccceEEEecC--CCCceeccCCcc---Cccc
Confidence            33444455556798886555   445555544  999999999873 33 456899999  999999999999   7899


Q ss_pred             ceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC----CceEEEeC
Q 045821          112 GMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDP  187 (352)
Q Consensus       112 ~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~  187 (352)
                      .+++++++|.||++||.++. ...+++++|||.+++|..+++|+.+|+.|++++++++||++||.+..    +++++|||
T Consensus       373 ~~~v~~l~g~iYavGG~dg~-~~l~svE~YDp~~~~W~~va~m~~~r~~~gv~~~~g~iYi~GG~~~~~~~l~sve~YDP  451 (571)
T KOG4441|consen  373 DFGVAVLDGKLYAVGGFDGE-KSLNSVECYDPVTNKWTPVAPMLTRRSGHGVAVLGGKLYIIGGGDGSSNCLNSVECYDP  451 (571)
T ss_pred             cceeEEECCEEEEEeccccc-cccccEEEecCCCCcccccCCCCcceeeeEEEEECCEEEEEcCcCCCccccceEEEEcC
Confidence            99999999999999999975 77899999999999999999999999999999999999999997654    89999999


Q ss_pred             CCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecC
Q 045821          188 RTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQS  267 (352)
Q Consensus       188 ~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~  267 (352)
                      .|++|+.+++|+.++..+++++++++||++||.++......+++||+.+++|+.+.++...+.....++.+++||++||+
T Consensus       452 ~t~~W~~~~~M~~~R~~~g~a~~~~~iYvvGG~~~~~~~~~VE~ydp~~~~W~~v~~m~~~rs~~g~~~~~~~ly~vGG~  531 (571)
T KOG4441|consen  452 ETNTWTLIAPMNTRRSGFGVAVLNGKIYVVGGFDGTSALSSVERYDPETNQWTMVAPMTSPRSAVGVVVLGGKLYAVGGF  531 (571)
T ss_pred             CCCceeecCCcccccccceEEEECCEEEEECCccCCCccceEEEEcCCCCceeEcccCccccccccEEEECCEEEEEecc
Confidence            99999999999999999999999999999999998555568999999999999997666666666678899999999998


Q ss_pred             CC----CeEEEEeCCCCceEEccCCCCCcCCCCceEE
Q 045821          268 SG----TKLMMWQKESREWSPVGRLSTLLTRPPCKLV  300 (352)
Q Consensus       268 ~~----~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~  300 (352)
                      .+    +.+..|||++++|+.... +.. .+...+++
T Consensus       532 ~~~~~l~~ve~ydp~~d~W~~~~~-~~~-~~~~~~~~  566 (571)
T KOG4441|consen  532 DGNNNLNTVECYDPETDTWTEVTE-PES-GRGGAGVA  566 (571)
T ss_pred             cCccccceeEEcCCCCCceeeCCC-ccc-cccCcceE
Confidence            77    689999999999999998 442 34444443


No 4  
>PLN02153 epithiospecifier protein
Probab=100.00  E-value=6.5e-37  Score=275.92  Aligned_cols=264  Identities=17%  Similarity=0.304  Sum_probs=198.4

Q ss_pred             hhcccccCh-hhHHHHhhCCCC--CCEEEEEEecCCC-C--ceeEEEecCCCCCCCeeEeccCCCccc-CCcceEEEEEC
Q 045821           47 RWRGLLHSE-EWCAYRRKHNLD--ETWIYAFCRDNKL-E--RVCCYVLDPNSTRRSWKMIHELPARTL-RRKGMGFEVLG  119 (352)
Q Consensus        47 ~W~~l~~s~-~~~~~~~~~~~~--~~~l~~~gg~~~~-~--~~~~~~~d~~~~~~~w~~~~~~~~~~~-~~~~~~~~~~~  119 (352)
                      .|..+.... ..+.+|..|++.  ++.||++||.... .  ...++.||+  .+++|..++++...+. .+.++++++++
T Consensus         8 ~W~~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~~~~yd~--~~~~W~~~~~~~~~p~~~~~~~~~~~~~   85 (341)
T PLN02153          8 GWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDF--NTHTWSIAPANGDVPRISCLGVRMVAVG   85 (341)
T ss_pred             eEEEecCCCCCCCCCCCcceEEEECCEEEEECCccCCCCceeCcEEEEEC--CCCEEEEcCccCCCCCCccCceEEEEEC
Confidence            488776532 234456666544  8999999997432 2  236899999  9999999987653212 24478889999


Q ss_pred             CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC-----CCcceeeeeEEECCEEEEEeCCCCC---------CceEEE
Q 045821          120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM-----STARCYFPCGVLNQKIYCIGGLGDT---------HSWDVY  185 (352)
Q Consensus       120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~-----~~~r~~~~~~~~~~~iyv~gG~~~~---------~~v~~y  185 (352)
                      ++||++||.... ...+++++||+.+++|+.++++     |.+|..|+++.++++|||+||....         +++++|
T Consensus        86 ~~iyv~GG~~~~-~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~y  164 (341)
T PLN02153         86 TKLYIFGGRDEK-REFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAY  164 (341)
T ss_pred             CEEEEECCCCCC-CccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEEEE
Confidence            999999998654 4578999999999999999877     7889999999999999999997531         578999


Q ss_pred             eCCCCcEEEecCCC---CcccceeEEEECCEEEEEccCCCC--------CCceEEEEEeCCCCceeecCC---CCCCCCC
Q 045821          186 DPRTNNWKLHTEPN---IFTEIEDSFVMDGKIYIRCSASAA--------TSHVCALVYEPSTDSWLHADA---NMASGWR  251 (352)
Q Consensus       186 d~~t~~W~~~~~~~---~~~~~~~~~~~~g~iyv~GG~~~~--------~~~~~i~~yd~~~~~W~~~~~---~~~~~~~  251 (352)
                      |+.+++|+.++++.   .++..+.+++++++||++||....        ...+.+++||+.+++|++++.   .+..+..
T Consensus       165 d~~~~~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g~~P~~r~~  244 (341)
T PLN02153        165 NIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSV  244 (341)
T ss_pred             ECCCCeEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCCcEEeccccCCCCCCcce
Confidence            99999999998653   566667788899999999886421        113589999999999999864   2333334


Q ss_pred             CcEEEECCEEEEEecCC----------C---CeEEEEeCCCCceEEccCC---CCCcCCCC-ceEEEe-CCEEEEEcCCc
Q 045821          252 GPAVVVDDALYVLDQSS----------G---TKLMMWQKESREWSPVGRL---STLLTRPP-CKLVAI-GKTIFVIGKGC  313 (352)
Q Consensus       252 ~~~~~~~~~l~~~gg~~----------~---~~v~~yd~~~~~W~~~~~~---~~~~~~~~-~~~~~~-~~~i~v~gG~~  313 (352)
                      +.+++++++||++||..          +   +++++||+++++|+.+...   +.+..+.. +++++. +++||++||..
T Consensus       245 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~~W~~~~~~~~~~~pr~~~~~~~~~v~~~~~~~~~gG~~  324 (341)
T PLN02153        245 FAHAVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETLVWEKLGECGEPAMPRGWTAYTTATVYGKNGLLMHGGKL  324 (341)
T ss_pred             eeeEEECCEEEEECcccCCccccccccccccccEEEEEcCccEEEeccCCCCCCCCCccccccccccCCcceEEEEcCcC
Confidence            45678899999999962          1   4899999999999998632   22212322 233333 45899999974


No 5  
>PHA02713 hypothetical protein; Provisional
Probab=100.00  E-value=1.1e-37  Score=295.41  Aligned_cols=236  Identities=16%  Similarity=0.212  Sum_probs=198.1

Q ss_pred             hhhhhcccccChhhHHHHhhCC--CCCCEEEEEEecCC--CCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC
Q 045821           44 VSSRWRGLLHSEEWCAYRRKHN--LDETWIYAFCRDNK--LERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG  119 (352)
Q Consensus        44 v~k~W~~l~~s~~~~~~~~~~~--~~~~~l~~~gg~~~--~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~  119 (352)
                      ..+.|..+.+.|.   .+..++  ..++.||++||.+.  ......+.|||  .++.|..+++|+   .+|..+++++++
T Consensus       280 ~~~~W~~l~~mp~---~r~~~~~a~l~~~IYviGG~~~~~~~~~~v~~Yd~--~~n~W~~~~~m~---~~R~~~~~~~~~  351 (557)
T PHA02713        280 NTMEYSVISTIPN---HIINYASAIVDNEIIIAGGYNFNNPSLNKVYKINI--ENKIHVELPPMI---KNRCRFSLAVID  351 (557)
T ss_pred             CCCeEEECCCCCc---cccceEEEEECCEEEEEcCCCCCCCccceEEEEEC--CCCeEeeCCCCc---chhhceeEEEEC
Confidence            3457888755443   333333  33999999999642  22346889999  999999999998   689999999999


Q ss_pred             CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCC---------------------
Q 045821          120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGD---------------------  178 (352)
Q Consensus       120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~---------------------  178 (352)
                      ++||++||.++. ...+++++|||.+++|..+++||.+|..+++++++++||++||...                     
T Consensus       352 g~IYviGG~~~~-~~~~sve~Ydp~~~~W~~~~~mp~~r~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~  430 (557)
T PHA02713        352 DTIYAIGGQNGT-NVERTIECYTMGDDKWKMLPDMPIALSSYGMCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHS  430 (557)
T ss_pred             CEEEEECCcCCC-CCCceEEEEECCCCeEEECCCCCcccccccEEEECCEEEEEeCCCcccccccccccccccccccccc
Confidence            999999998654 4578899999999999999999999999999999999999999752                     


Q ss_pred             CCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCc-eEEEEEeCCC-CceeecCCCCCCCCCCcEEE
Q 045821          179 THSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSH-VCALVYEPST-DSWLHADANMASGWRGPAVV  256 (352)
Q Consensus       179 ~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~-~~i~~yd~~~-~~W~~~~~~~~~~~~~~~~~  256 (352)
                      .+++++|||.+++|+.+++|+.++..+++++++|+||++||..+.... ..+++||+.+ ++|+.+++.+.++....+++
T Consensus       431 ~~~ve~YDP~td~W~~v~~m~~~r~~~~~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~~W~~~~~m~~~r~~~~~~~  510 (557)
T PHA02713        431 SNKVIRYDTVNNIWETLPNFWTGTIRPGVVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYNGWELITTTESRLSALHTIL  510 (557)
T ss_pred             cceEEEECCCCCeEeecCCCCcccccCcEEEECCEEEEEeCCCCCCccceeEEEecCCCCCCeeEccccCcccccceeEE
Confidence            157999999999999999999999989999999999999987643333 3689999999 89999988666666667788


Q ss_pred             ECCEEEEEecCCC-CeEEEEeCCCCceEEccCC
Q 045821          257 VDDALYVLDQSSG-TKLMMWQKESREWSPVGRL  288 (352)
Q Consensus       257 ~~~~l~~~gg~~~-~~v~~yd~~~~~W~~~~~~  288 (352)
                      ++|+||++||+.+ ..+.+||+.+++|+.++..
T Consensus       511 ~~~~iyv~Gg~~~~~~~e~yd~~~~~W~~~~~~  543 (557)
T PHA02713        511 HDNTIMMLHCYESYMLQDTFNVYTYEWNHICHQ  543 (557)
T ss_pred             ECCEEEEEeeecceeehhhcCcccccccchhhh
Confidence            9999999999876 5689999999999988744


No 6  
>PLN02193 nitrile-specifier protein
Probab=100.00  E-value=2.6e-35  Score=274.88  Aligned_cols=261  Identities=17%  Similarity=0.264  Sum_probs=207.1

Q ss_pred             CCEEEEEEecCCCC--ceeEEEecCCCCC----CCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCC-CCCeEEE
Q 045821           68 ETWIYAFCRDNKLE--RVCCYVLDPNSTR----RSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSED-ATSEVYC  140 (352)
Q Consensus        68 ~~~l~~~gg~~~~~--~~~~~~~d~~~~~----~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~-~~~~~~~  140 (352)
                      ++.|+.++|.....  ..-.+.+++  .+    ++|.++.++...|.+|..|+++.+++.||++||...... ..+++++
T Consensus       120 ~~~ivgf~G~~~~~~~~ig~y~~~~--~~~~~~~~W~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~v~~  197 (470)
T PLN02193        120 GGKIVGFHGRSTDVLHSLGAYISLP--STPKLLGKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYV  197 (470)
T ss_pred             CCeEEEEeccCCCcEEeeEEEEecC--CChhhhceEEEcccCCCCCCCccccEEEEECCEEEEECCcCCCCCCeeCcEEE
Confidence            77888888864333  233566676  44    899999875433478999999999999999999754322 3468999


Q ss_pred             EECCCCeeEecCCC---CC-cceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCC---CCcccceeEEEE
Q 045821          141 YDASMNTWTDAAPM---ST-ARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEP---NIFTEIEDSFVM  210 (352)
Q Consensus       141 ~d~~t~~W~~~~~~---~~-~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~---~~~~~~~~~~~~  210 (352)
                      ||+.+++|..++.+   |. +|..++++.++++||++||....   +++++||+.+++|++++++   |.++..++++.+
T Consensus       198 yD~~~~~W~~~~~~g~~P~~~~~~~~~v~~~~~lYvfGG~~~~~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~  277 (470)
T PLN02193        198 FDLETRTWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAAD  277 (470)
T ss_pred             EECCCCEEEeCCCCCCCCCCcccceEEEEECCEEEEECCCCCCCCCccEEEEECCCCEEEEcCcCCCCCCCccceEEEEE
Confidence            99999999988653   33 25678888999999999998654   8999999999999999876   677888888889


Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC---CCCCCCCcEEEECCEEEEEecCCC---CeEEEEeCCCCceEE
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN---MASGWRGPAVVVDDALYVLDQSSG---TKLMMWQKESREWSP  284 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~l~~~gg~~~---~~v~~yd~~~~~W~~  284 (352)
                      +++||++||.........+++||+.+++|+.++.+   +..+..+.+++++++||++||..+   +++++||+++++|++
T Consensus       278 ~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~~~~~~~~~~~~R~~~~~~~~~gkiyviGG~~g~~~~dv~~yD~~t~~W~~  357 (470)
T PLN02193        278 EENVYVFGGVSATARLKTLDSYNIVDKKWFHCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNGCEVDDVHYYDPVQDKWTQ  357 (470)
T ss_pred             CCEEEEECCCCCCCCcceEEEEECCCCEEEeCCCCCCCCCCCCCcEEEEECCcEEEEECCCCCccCceEEEECCCCEEEE
Confidence            99999999987766667899999999999998652   223333445678999999999764   689999999999999


Q ss_pred             ccCCC-CCcCCCCceEEEeCCEEEEEcCCc---------------eEEEEEcCCcccccceE
Q 045821          285 VGRLS-TLLTRPPCKLVAIGKTIFVIGKGC---------------SAVVIDVGNIGNIGGIM  330 (352)
Q Consensus       285 ~~~~~-~~~~~~~~~~~~~~~~i~v~gG~~---------------~~~~~d~~~~~w~~~~~  330 (352)
                      +..+. .+..|..++++.++++|||+||..               ++++||+++++|.+...
T Consensus       358 ~~~~g~~P~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~~W~~~~~  419 (470)
T PLN02193        358 VETFGVRPSERSVFASAAVGKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETLQWERLDK  419 (470)
T ss_pred             eccCCCCCCCcceeEEEEECCEEEEECCccCCccccccCccceeccEEEEEcCcCEEEEccc
Confidence            97652 223577888889999999999953               36789999999986443


No 7  
>PLN02193 nitrile-specifier protein
Probab=100.00  E-value=6.2e-36  Score=279.11  Aligned_cols=264  Identities=17%  Similarity=0.312  Sum_probs=204.1

Q ss_pred             hhhcccccChhhHHHHhhCCCC--CCEEEEEEecCCC-C-c-eeEEEecCCCCCCCeeEeccCCCccc-CCcceEEEEEC
Q 045821           46 SRWRGLLHSEEWCAYRRKHNLD--ETWIYAFCRDNKL-E-R-VCCYVLDPNSTRRSWKMIHELPARTL-RRKGMGFEVLG  119 (352)
Q Consensus        46 k~W~~l~~s~~~~~~~~~~~~~--~~~l~~~gg~~~~-~-~-~~~~~~d~~~~~~~w~~~~~~~~~~~-~~~~~~~~~~~  119 (352)
                      ..|..+....+.+.+|..|+..  ++.||++||.... . . ..++.||+  .+++|..++.+...|. .+..+++++++
T Consensus       151 ~~W~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~v~~yD~--~~~~W~~~~~~g~~P~~~~~~~~~v~~~  228 (470)
T PLN02193        151 GKWIKVEQKGEGPGLRCSHGIAQVGNKIYSFGGEFTPNQPIDKHLYVFDL--ETRTWSISPATGDVPHLSCLGVRMVSIG  228 (470)
T ss_pred             ceEEEcccCCCCCCCccccEEEEECCEEEEECCcCCCCCCeeCcEEEEEC--CCCEEEeCCCCCCCCCCcccceEEEEEC
Confidence            5798876554555567767654  8999999996422 2 2 35899999  9999998876532222 35678899999


Q ss_pred             CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC---CCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEE
Q 045821          120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM---STARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWK  193 (352)
Q Consensus       120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~---~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~  193 (352)
                      ++||++||.+.. ...+++++||+.+++|+.++++   |.||..|+++.++++|||+||....   +++++||+.+++|+
T Consensus       229 ~~lYvfGG~~~~-~~~ndv~~yD~~t~~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~~~~~~~~~~yd~~t~~W~  307 (470)
T PLN02193        229 STLYVFGGRDAS-RQYNGFYSFDTTTNEWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDKKWF  307 (470)
T ss_pred             CEEEEECCCCCC-CCCccEEEEECCCCEEEEcCcCCCCCCCccceEEEEECCEEEEECCCCCCCCcceEEEEECCCCEEE
Confidence            999999998754 5678999999999999999887   7899999999999999999998654   78999999999999


Q ss_pred             EecC---CCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC---CCCCCCCcEEEECCEEEEEecC
Q 045821          194 LHTE---PNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN---MASGWRGPAVVVDDALYVLDQS  267 (352)
Q Consensus       194 ~~~~---~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~---~~~~~~~~~~~~~~~l~~~gg~  267 (352)
                      .++.   ++.++..+.+++++++||++||..+ ...+++++||+.+++|++++..   +..+..+.+++++++||++||.
T Consensus       308 ~~~~~~~~~~~R~~~~~~~~~gkiyviGG~~g-~~~~dv~~yD~~t~~W~~~~~~g~~P~~R~~~~~~~~~~~iyv~GG~  386 (470)
T PLN02193        308 HCSTPGDSFSIRGGAGLEVVQGKVWVVYGFNG-CEVDDVHYYDPVQDKWTQVETFGVRPSERSVFASAAVGKHIVIFGGE  386 (470)
T ss_pred             eCCCCCCCCCCCCCcEEEEECCcEEEEECCCC-CccCceEEEECCCCEEEEeccCCCCCCCcceeEEEEECCEEEEECCc
Confidence            9874   4456667778889999999998764 2356899999999999998652   3344445567889999999996


Q ss_pred             CC-------------CeEEEEeCCCCceEEccCCCC----CcCCCCce--EEEe--CCEEEEEcCCc
Q 045821          268 SG-------------TKLMMWQKESREWSPVGRLST----LLTRPPCK--LVAI--GKTIFVIGKGC  313 (352)
Q Consensus       268 ~~-------------~~v~~yd~~~~~W~~~~~~~~----~~~~~~~~--~~~~--~~~i~v~gG~~  313 (352)
                      ..             +++++||+.+++|+.+..++.    +..|..++  ....  ++.|+++||..
T Consensus       387 ~~~~~~~~~~~~~~~ndv~~~D~~t~~W~~~~~~~~~~~~P~~R~~~~~~~~~~~~~~~~~~fGG~~  453 (470)
T PLN02193        387 IAMDPLAHVGPGQLTDGTFALDTETLQWERLDKFGEEEETPSSRGWTASTTGTIDGKKGLVMHGGKA  453 (470)
T ss_pred             cCCccccccCccceeccEEEEEcCcCEEEEcccCCCCCCCCCCCccccceeeEEcCCceEEEEcCCC
Confidence            31             479999999999999975431    22444442  2222  34599999974


No 8  
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=100.00  E-value=1.2e-35  Score=265.72  Aligned_cols=240  Identities=14%  Similarity=0.188  Sum_probs=192.3

Q ss_pred             CCCEEEEEEecCCCCc-----------eeEEEec-CCCC-CCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCC
Q 045821           67 DETWIYAFCRDNKLER-----------VCCYVLD-PNST-RRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSED  133 (352)
Q Consensus        67 ~~~~l~~~gg~~~~~~-----------~~~~~~d-~~~~-~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~  133 (352)
                      .++.||++||.+..+.           ..++.|+ +  . +.+|..+++||   .+|..++++++++.||++||.+.. .
T Consensus        12 ~~~~l~v~GG~~~~~~~~~~~g~~~~~~~v~~~~~~--~~~~~W~~~~~lp---~~r~~~~~~~~~~~lyviGG~~~~-~   85 (323)
T TIGR03548        12 IGDYILVAGGCNFPEDPLAEGGKKKNYKGIYIAKDE--NSNLKWVKDGQLP---YEAAYGASVSVENGIYYIGGSNSS-E   85 (323)
T ss_pred             ECCEEEEeeccCCCCCchhhCCcEEeeeeeEEEecC--CCceeEEEcccCC---ccccceEEEEECCEEEEEcCCCCC-C
Confidence            4899999999864321           2466664 3  3 34799999998   678778888999999999998654 4


Q ss_pred             CCCeEEEEECCCCee----EecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCC-cccce
Q 045821          134 ATSEVYCYDASMNTW----TDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNI-FTEIE  205 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W----~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~-~~~~~  205 (352)
                      ..+++++||+.+++|    ..++++|.+|..|++++++++||++||....   +++++||+.+++|+.+++++. ++..+
T Consensus        86 ~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~~~r~~~  165 (323)
T TIGR03548        86 RFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQEWFELPDFPGEPRVQP  165 (323)
T ss_pred             CceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCCeeECCCCCCCCCCcc
Confidence            578999999999987    7889999999999999999999999997433   889999999999999987764 56666


Q ss_pred             eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC----CCCCC-CcE-EEECCEEEEEecCCC----------
Q 045821          206 DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANM----ASGWR-GPA-VVVDDALYVLDQSSG----------  269 (352)
Q Consensus       206 ~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~----~~~~~-~~~-~~~~~~l~~~gg~~~----------  269 (352)
                      .+++++++||++||.... ...++++||+.+++|+.++++.    +.... ..+ +..+++||++||.++          
T Consensus       166 ~~~~~~~~iYv~GG~~~~-~~~~~~~yd~~~~~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~  244 (323)
T TIGR03548       166 VCVKLQNELYVFGGGSNI-AYTDGYKYSPKKNQWQKVADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVIDL  244 (323)
T ss_pred             eEEEECCEEEEEcCCCCc-cccceEEEecCCCeeEECCCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHhhh
Confidence            677899999999987642 2346899999999999997632    22222 223 345799999999752          


Q ss_pred             --------------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCce
Q 045821          270 --------------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS  314 (352)
Q Consensus       270 --------------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~  314 (352)
                                                +.+++||+.+++|+.++.+|. ..|..++++.++++||++||..+
T Consensus       245 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~~W~~~~~~p~-~~r~~~~~~~~~~~iyv~GG~~~  314 (323)
T TIGR03548       245 ATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTGKWKSIGNSPF-FARCGAALLLTGNNIFSINGELK  314 (323)
T ss_pred             hhccchhhhhhHHHHhCCCccccCcCceEEEEECCCCeeeEcccccc-cccCchheEEECCEEEEEecccc
Confidence                                      369999999999999997764 25778889999999999999643


No 9  
>PHA03098 kelch-like protein; Provisional
Probab=100.00  E-value=6.6e-36  Score=285.42  Aligned_cols=253  Identities=17%  Similarity=0.235  Sum_probs=207.2

Q ss_pred             CEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCee
Q 045821           69 TWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTW  148 (352)
Q Consensus        69 ~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W  148 (352)
                      ..+++.||.. ........|++  .++.|..++..+    .+..++++++++.||++||........+++++||+.+++|
T Consensus       251 ~~~~~~~g~~-~~~~~~~~~~~--~~~~~~~~~~~~----~~~~~~~~~~~~~lyv~GG~~~~~~~~~~v~~yd~~~~~W  323 (534)
T PHA03098        251 SIIYIHITMS-IFTYNYITNYS--PLSEINTIIDIH----YVYCFGSVVLNNVIYFIGGMNKNNLSVNSVVSYDTKTKSW  323 (534)
T ss_pred             cceEeecccc-hhhceeeecch--hhhhcccccCcc----ccccceEEEECCEEEEECCCcCCCCeeccEEEEeCCCCee
Confidence            4455545543 22334556788  888898887654    2344688999999999999876555667999999999999


Q ss_pred             EecCCCCCcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC-
Q 045821          149 TDAAPMSTARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT-  224 (352)
Q Consensus       149 ~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~-  224 (352)
                      ..+++|+.+|..++++.++++||++||....   +++++||+.+++|+.++++|.++..+++++++|+||++||..... 
T Consensus       324 ~~~~~~~~~R~~~~~~~~~~~lyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~  403 (534)
T PHA03098        324 NKVPELIYPRKNPGVTVFNNRIYVIGGIYNSISLNTVESWKPGESKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDE  403 (534)
T ss_pred             eECCCCCcccccceEEEECCEEEEEeCCCCCEecceEEEEcCCCCceeeCCCcCcCCccceEEEECCEEEEECCcCCCCc
Confidence            9999999999999999999999999998744   889999999999999999999888888899999999999965432 


Q ss_pred             CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-------CeEEEEeCCCCceEEccCCCCCcCCCCc
Q 045821          225 SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-------TKLMMWQKESREWSPVGRLSTLLTRPPC  297 (352)
Q Consensus       225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-------~~v~~yd~~~~~W~~~~~~~~~~~~~~~  297 (352)
                      ....+++||+.+++|+.+++.+..+..+.++..+++||++||...       +.+++||+++++|+.++.++.  .+..+
T Consensus       404 ~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~v~~yd~~~~~W~~~~~~~~--~r~~~  481 (534)
T PHA03098        404 LLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGISYIDNIKVYNIVESYNPVTNKWTELSSLNF--PRINA  481 (534)
T ss_pred             ccceEEEEeCCCCeeeecCCCCccccCceEEEECCEEEEECCccCCCCCcccceEEEecCCCCceeeCCCCCc--ccccc
Confidence            346899999999999999875555555566788999999999653       359999999999999998775  46777


Q ss_pred             eEEEeCCEEEEEcCCc------eEEEEEcCCcccccceE
Q 045821          298 KLVAIGKTIFVIGKGC------SAVVIDVGNIGNIGGIM  330 (352)
Q Consensus       298 ~~~~~~~~i~v~gG~~------~~~~~d~~~~~w~~~~~  330 (352)
                      +++.++++||++||..      .+++||+++++|.....
T Consensus       482 ~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~  520 (534)
T PHA03098        482 SLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWTLFCK  520 (534)
T ss_pred             eEEEECCEEEEEcCCcCCcccceeEEEeCCCCEEEecCC
Confidence            7888899999999954      56789999999965433


No 10 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=100.00  E-value=2e-35  Score=267.11  Aligned_cols=240  Identities=19%  Similarity=0.248  Sum_probs=186.3

Q ss_pred             CCCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCC-----CCCCeEEEE
Q 045821           67 DETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSE-----DATSEVYCY  141 (352)
Q Consensus        67 ~~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~-----~~~~~~~~~  141 (352)
                      .++.||++||..   ...++.||++..+++|.++++||.  .+|..+++++++++||++||.....     ...+++++|
T Consensus        16 ~~~~vyv~GG~~---~~~~~~~d~~~~~~~W~~l~~~p~--~~R~~~~~~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Y   90 (346)
T TIGR03547        16 IGDKVYVGLGSA---GTSWYKLDLKKPSKGWQKIADFPG--GPRNQAVAAAIDGKLYVFGGIGKANSEGSPQVFDDVYRY   90 (346)
T ss_pred             ECCEEEEEcccc---CCeeEEEECCCCCCCceECCCCCC--CCcccceEEEECCEEEEEeCCCCCCCCCcceecccEEEE
Confidence            389999999973   245778885336789999999873  3688899999999999999975421     246789999


Q ss_pred             ECCCCeeEecC-CCCCcceeeeeE-EECCEEEEEeCCCCC-------------------------------------Cce
Q 045821          142 DASMNTWTDAA-PMSTARCYFPCG-VLNQKIYCIGGLGDT-------------------------------------HSW  182 (352)
Q Consensus       142 d~~t~~W~~~~-~~~~~r~~~~~~-~~~~~iyv~gG~~~~-------------------------------------~~v  182 (352)
                      |+.+++|+.++ .+|.+|..++++ .++++||++||....                                     +++
T Consensus        91 d~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v  170 (346)
T TIGR03547        91 DPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYFWNKNV  170 (346)
T ss_pred             ECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcCccceE
Confidence            99999999997 455666666666 689999999997521                                     579


Q ss_pred             EEEeCCCCcEEEecCCCC-cccceeEEEECCEEEEEccCCCCCC-ceEEEEE--eCCCCceeecCCCCCCCC------CC
Q 045821          183 DVYDPRTNNWKLHTEPNI-FTEIEDSFVMDGKIYIRCSASAATS-HVCALVY--EPSTDSWLHADANMASGW------RG  252 (352)
Q Consensus       183 ~~yd~~t~~W~~~~~~~~-~~~~~~~~~~~g~iyv~GG~~~~~~-~~~i~~y--d~~~~~W~~~~~~~~~~~------~~  252 (352)
                      ++||+.+++|+.+++||. ++..+++++++++||++||...... ...++.|  |+.+++|+.+++.+.++.      .+
T Consensus       171 ~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~~~~m~~~r~~~~~~~~~  250 (346)
T TIGR03547       171 LSYDPSTNQWRNLGENPFLGTAGSAIVHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNKLPPLPPPKSSSQEGLAG  250 (346)
T ss_pred             EEEECCCCceeECccCCCCcCCCceEEEECCEEEEEeeeeCCCccchheEEEEecCCCceeeecCCCCCCCCCccccccE
Confidence            999999999999998876 4566677889999999999754332 2245555  457789999977543321      11


Q ss_pred             -cEEEECCEEEEEecCCC---------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821          253 -PAVVVDDALYVLDQSSG---------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG  310 (352)
Q Consensus       253 -~~~~~~~~l~~~gg~~~---------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g  310 (352)
                       .+++++++||++||...                     ..+.+||+++++|+.++.+|.  .+..+++++++++|||+|
T Consensus       251 ~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~lp~--~~~~~~~~~~~~~iyv~G  328 (346)
T TIGR03547       251 AFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNGKWSKVGKLPQ--GLAYGVSVSWNNGVLLIG  328 (346)
T ss_pred             EeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCCcccccCCCCC--CceeeEEEEcCCEEEEEe
Confidence             24678999999999641                     257899999999999999987  466677788999999999


Q ss_pred             CCc
Q 045821          311 KGC  313 (352)
Q Consensus       311 G~~  313 (352)
                      |..
T Consensus       329 G~~  331 (346)
T TIGR03547       329 GEN  331 (346)
T ss_pred             ccC
Confidence            975


No 11 
>PLN02153 epithiospecifier protein
Probab=100.00  E-value=6.9e-35  Score=262.75  Aligned_cols=235  Identities=17%  Similarity=0.282  Sum_probs=187.3

Q ss_pred             CCCCeeEeccCCC-cccCCcceEEEEECCEEEEEeCCCCC-CCCCCeEEEEECCCCeeEecCCCC-Ccc---eeeeeEEE
Q 045821           93 TRRSWKMIHELPA-RTLRRKGMGFEVLGKNAYLLGGCGWS-EDATSEVYCYDASMNTWTDAAPMS-TAR---CYFPCGVL  166 (352)
Q Consensus        93 ~~~~w~~~~~~~~-~~~~~~~~~~~~~~~~iyv~GG~~~~-~~~~~~~~~~d~~t~~W~~~~~~~-~~r---~~~~~~~~  166 (352)
                      ....|.++..... .|.+|..|++++++++||++||.... ....+++++||+.+++|..+++++ .||   ..|+++.+
T Consensus         5 ~~~~W~~~~~~~~~~P~pR~~h~~~~~~~~iyv~GG~~~~~~~~~~~~~~yd~~~~~W~~~~~~~~~p~~~~~~~~~~~~   84 (341)
T PLN02153          5 LQGGWIKVEQKGGKGPGPRCSHGIAVVGDKLYSFGGELKPNEHIDKDLYVFDFNTHTWSIAPANGDVPRISCLGVRMVAV   84 (341)
T ss_pred             cCCeEEEecCCCCCCCCCCCcceEEEECCEEEEECCccCCCCceeCcEEEEECCCCEEEEcCccCCCCCCccCceEEEEE
Confidence            6788999987321 23689999999999999999998543 234579999999999999988764 344   36788899


Q ss_pred             CCEEEEEeCCCCC---CceEEEeCCCCcEEEecCC-----CCcccceeEEEECCEEEEEccCCCCC------CceEEEEE
Q 045821          167 NQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEP-----NIFTEIEDSFVMDGKIYIRCSASAAT------SHVCALVY  232 (352)
Q Consensus       167 ~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~-----~~~~~~~~~~~~~g~iyv~GG~~~~~------~~~~i~~y  232 (352)
                      +++||++||....   +++++||+.+++|+.++++     |.++..+++++++++|||+||.....      ...++++|
T Consensus        85 ~~~iyv~GG~~~~~~~~~v~~yd~~t~~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~y  164 (341)
T PLN02153         85 GTKLYIFGGRDEKREFSDFYSYDTVKNEWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIEAY  164 (341)
T ss_pred             CCEEEEECCCCCCCccCcEEEEECCCCEEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEEEE
Confidence            9999999997654   7899999999999998866     66777888889999999999986432      23479999


Q ss_pred             eCCCCceeecCCCC---CCCCCCcEEEECCEEEEEecCC------------CCeEEEEeCCCCceEEccCCC-CCcCCCC
Q 045821          233 EPSTDSWLHADANM---ASGWRGPAVVVDDALYVLDQSS------------GTKLMMWQKESREWSPVGRLS-TLLTRPP  296 (352)
Q Consensus       233 d~~~~~W~~~~~~~---~~~~~~~~~~~~~~l~~~gg~~------------~~~v~~yd~~~~~W~~~~~~~-~~~~~~~  296 (352)
                      |+.+++|+.++...   ..+..+.+++++++||++||..            .+++++||+++++|+++.... .+..|..
T Consensus       165 d~~~~~W~~l~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~~W~~~~~~g~~P~~r~~  244 (341)
T PLN02153        165 NIADGKWVQLPDPGENFEKRGGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASGKWTEVETTGAKPSARSV  244 (341)
T ss_pred             ECCCCeEeeCCCCCCCCCCCCcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCCcEEeccccCCCCCCcce
Confidence            99999999987632   3333444577899999998853            147999999999999997532 1225777


Q ss_pred             ceEEEeCCEEEEEcCCc---------------eEEEEEcCCccccc
Q 045821          297 CKLVAIGKTIFVIGKGC---------------SAVVIDVGNIGNIG  327 (352)
Q Consensus       297 ~~~~~~~~~i~v~gG~~---------------~~~~~d~~~~~w~~  327 (352)
                      ++++.++++|||+||..               +++.||+++++|.+
T Consensus       245 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~~W~~  290 (341)
T PLN02153        245 FAHAVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETLVWEK  290 (341)
T ss_pred             eeeEEECCEEEEECcccCCccccccccccccccEEEEEcCccEEEe
Confidence            88889999999999952               56789999999975


No 12 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=100.00  E-value=1.2e-34  Score=263.95  Aligned_cols=239  Identities=19%  Similarity=0.262  Sum_probs=184.6

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCC-C----CCCCCeEEEEE
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGW-S----EDATSEVYCYD  142 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~-~----~~~~~~~~~~d  142 (352)
                      ++.||++||...   ..++.||.+..++.|.+++++|.  .+|..++++.++++||++||... .    ....+++++||
T Consensus        38 ~~~iyv~gG~~~---~~~~~~d~~~~~~~W~~l~~~p~--~~r~~~~~v~~~~~IYV~GG~~~~~~~~~~~~~~~v~~YD  112 (376)
T PRK14131         38 NNTVYVGLGSAG---TSWYKLDLNAPSKGWTKIAAFPG--GPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYD  112 (376)
T ss_pred             CCEEEEEeCCCC---CeEEEEECCCCCCCeEECCcCCC--CCcccceEEEECCEEEEEcCCCCCCCCCceeEcccEEEEe
Confidence            899999998632   34778887224589999998873  36888899999999999999764 1    12367899999


Q ss_pred             CCCCeeEecCC-CCCcceeeeeEE-ECCEEEEEeCCCC-------------------------------------CCceE
Q 045821          143 ASMNTWTDAAP-MSTARCYFPCGV-LNQKIYCIGGLGD-------------------------------------THSWD  183 (352)
Q Consensus       143 ~~t~~W~~~~~-~~~~r~~~~~~~-~~~~iyv~gG~~~-------------------------------------~~~v~  183 (352)
                      +.+++|+.+++ +|.++..|+++. .+++||++||...                                     .++++
T Consensus       113 ~~~n~W~~~~~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~v~  192 (376)
T PRK14131        113 PKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDKTPKDKINDAYFDKKPEDYFFNKEVL  192 (376)
T ss_pred             CCCCEEEeCCCCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccchhhhhhhHHHHhcCChhhcCcCceEE
Confidence            99999999985 456667777766 7999999999752                                     15799


Q ss_pred             EEeCCCCcEEEecCCCC-cccceeEEEECCEEEEEccCCCCC-Cce--EEEEEeCCCCceeecCCCCCCCC-------CC
Q 045821          184 VYDPRTNNWKLHTEPNI-FTEIEDSFVMDGKIYIRCSASAAT-SHV--CALVYEPSTDSWLHADANMASGW-------RG  252 (352)
Q Consensus       184 ~yd~~t~~W~~~~~~~~-~~~~~~~~~~~g~iyv~GG~~~~~-~~~--~i~~yd~~~~~W~~~~~~~~~~~-------~~  252 (352)
                      +||+.+++|+.++++|. ++..++++.++++||++||....+ ...  ..+.||+.+++|+.+++.+.++.       .+
T Consensus       193 ~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~~iYv~GG~~~~~~~~~~~~~~~~~~~~~~W~~~~~~p~~~~~~~~~~~~~  272 (376)
T PRK14131        193 SYDPSTNQWKNAGESPFLGTAGSAVVIKGNKLWLINGEIKPGLRTDAVKQGKFTGNNLKWQKLPDLPPAPGGSSQEGVAG  272 (376)
T ss_pred             EEECCCCeeeECCcCCCCCCCcceEEEECCEEEEEeeeECCCcCChhheEEEecCCCcceeecCCCCCCCcCCcCCccce
Confidence            99999999999988876 455666778899999999865332 222  34456788999999976543321       11


Q ss_pred             c-EEEECCEEEEEecCCC---------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821          253 P-AVVVDDALYVLDQSSG---------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG  310 (352)
Q Consensus       253 ~-~~~~~~~l~~~gg~~~---------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g  310 (352)
                      . +++.+++||++||...                     ..+.+||+++++|+.++.+|.  .+..++++.++++|||+|
T Consensus       273 ~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~lp~--~r~~~~av~~~~~iyv~G  350 (376)
T PRK14131        273 AFAGYSNGVLLVAGGANFPGARENYQNGKLYAHEGLKKSWSDEIYALVNGKWQKVGELPQ--GLAYGVSVSWNNGVLLIG  350 (376)
T ss_pred             EeceeECCEEEEeeccCCCCChhhhhcCCcccccCCcceeehheEEecCCcccccCcCCC--CccceEEEEeCCEEEEEc
Confidence            1 4568999999999641                     135689999999999998887  466677888999999999


Q ss_pred             CCc
Q 045821          311 KGC  313 (352)
Q Consensus       311 G~~  313 (352)
                      |..
T Consensus       351 G~~  353 (376)
T PRK14131        351 GET  353 (376)
T ss_pred             CCC
Confidence            964


No 13 
>PHA02790 Kelch-like protein; Provisional
Probab=100.00  E-value=2.1e-34  Score=269.51  Aligned_cols=203  Identities=17%  Similarity=0.276  Sum_probs=178.1

Q ss_pred             EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821          115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKL  194 (352)
Q Consensus       115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~  194 (352)
                      .+..++.||++||.+.. ...+++++|||.+++|..+++|+.+|..++++.++++||++||.+...++++||+.+++|+.
T Consensus       267 ~~~~~~~lyviGG~~~~-~~~~~v~~Ydp~~~~W~~~~~m~~~r~~~~~v~~~~~iYviGG~~~~~sve~ydp~~n~W~~  345 (480)
T PHA02790        267 STHVGEVVYLIGGWMNN-EIHNNAIAVNYISNNWIPIPPMNSPRLYASGVPANNKLYVVGGLPNPTSVERWFHGDAAWVN  345 (480)
T ss_pred             eEEECCEEEEEcCCCCC-CcCCeEEEEECCCCEEEECCCCCchhhcceEEEECCEEEEECCcCCCCceEEEECCCCeEEE
Confidence            35589999999998653 45788999999999999999999999999999999999999997654789999999999999


Q ss_pred             ecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEE
Q 045821          195 HTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMM  274 (352)
Q Consensus       195 ~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~  274 (352)
                      +++|+.++..+++++++|+||++||....  ...+++||+.+++|+.+++++.++..+.+++++|+||++||    .+.+
T Consensus       346 ~~~l~~~r~~~~~~~~~g~IYviGG~~~~--~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~~IYv~GG----~~e~  419 (480)
T PHA02790        346 MPSLLKPRCNPAVASINNVIYVIGGHSET--DTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVGR----NAEF  419 (480)
T ss_pred             CCCCCCCCcccEEEEECCEEEEecCcCCC--CccEEEEeCCCCEEEeCCCCCCccccceEEEECCEEEEECC----ceEE
Confidence            99999999888899999999999997532  35789999999999999876655555566789999999998    5789


Q ss_pred             EeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc------eEEEEEcCCcccc
Q 045821          275 WQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC------SAVVIDVGNIGNI  326 (352)
Q Consensus       275 yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~------~~~~~d~~~~~w~  326 (352)
                      ||+++++|+.++.++.  .|..+++++++++||++||.+      .+..||+++++|.
T Consensus       420 ydp~~~~W~~~~~m~~--~r~~~~~~v~~~~IYviGG~~~~~~~~~ve~Yd~~~~~W~  475 (480)
T PHA02790        420 YCESSNTWTLIDDPIY--PRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSWN  475 (480)
T ss_pred             ecCCCCcEeEcCCCCC--CccccEEEEECCEEEEECCcCCCcccceEEEEECCCCeEE
Confidence            9999999999998876  588889999999999999953      4668999999995


No 14 
>PHA03098 kelch-like protein; Provisional
Probab=100.00  E-value=4.8e-34  Score=272.60  Aligned_cols=217  Identities=20%  Similarity=0.340  Sum_probs=188.2

Q ss_pred             CCEEEEEEecCCCCc--eeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821           68 ETWIYAFCRDNKLER--VCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM  145 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~--~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t  145 (352)
                      ++.||++||.+....  ...+.||+  .+++|..+++++   .+|.++++++++++||++||.+.. ...+++++||+.+
T Consensus       294 ~~~lyv~GG~~~~~~~~~~v~~yd~--~~~~W~~~~~~~---~~R~~~~~~~~~~~lyv~GG~~~~-~~~~~v~~yd~~~  367 (534)
T PHA03098        294 NNVIYFIGGMNKNNLSVNSVVSYDT--KTKSWNKVPELI---YPRKNPGVTVFNNRIYVIGGIYNS-ISLNTVESWKPGE  367 (534)
T ss_pred             CCEEEEECCCcCCCCeeccEEEEeC--CCCeeeECCCCC---cccccceEEEECCEEEEEeCCCCC-EecceEEEEcCCC
Confidence            899999999754332  36889999  999999999988   678999999999999999998743 5578899999999


Q ss_pred             CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCC
Q 045821          146 NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSAS  221 (352)
Q Consensus       146 ~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~  221 (352)
                      ++|+.++++|.||..++++.++++||++||....    +++++||+.+++|+.++++|.++..+++++++++||++||..
T Consensus       368 ~~W~~~~~lp~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~~p~~r~~~~~~~~~~~iyv~GG~~  447 (534)
T PHA03098        368 SKWREEPPLIFPRYNPCVVNVNNLIYVIGGISKNDELLKTVECFSLNTNKWSKGSPLPISHYGGCAIYHDGKIYVIGGIS  447 (534)
T ss_pred             CceeeCCCcCcCCccceEEEECCEEEEECCcCCCCcccceEEEEeCCCCeeeecCCCCccccCceEEEECCEEEEECCcc
Confidence            9999999999999999999999999999996432    789999999999999999988888888899999999999876


Q ss_pred             CCCC---ceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC----CeEEEEeCCCCceEEccCCCC
Q 045821          222 AATS---HVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG----TKLMMWQKESREWSPVGRLST  290 (352)
Q Consensus       222 ~~~~---~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W~~~~~~~~  290 (352)
                      ....   ...+++||+.+++|+.++..+.++....++..+++||++||..+    +.+++||+++++|+.++.+|+
T Consensus       448 ~~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~  523 (534)
T PHA03098        448 YIDNIKVYNIVESYNPVTNKWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNTWTLFCKFPK  523 (534)
T ss_pred             CCCCCcccceEEEecCCCCceeeCCCCCcccccceEEEECCEEEEEcCCcCCcccceeEEEeCCCCEEEecCCCcc
Confidence            4332   34799999999999999875555555556778999999999763    589999999999999998776


No 15 
>PHA02790 Kelch-like protein; Provisional
Probab=100.00  E-value=1e-33  Score=264.92  Aligned_cols=203  Identities=15%  Similarity=0.251  Sum_probs=177.3

Q ss_pred             CCCEEEEEEecCCCC-ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821           67 DETWIYAFCRDNKLE-RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM  145 (352)
Q Consensus        67 ~~~~l~~~gg~~~~~-~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t  145 (352)
                      .++.||++||.+... ....+.|||  .+++|..+++|+   .+|..++++.++++||++||.+.    .+++++||+.+
T Consensus       270 ~~~~lyviGG~~~~~~~~~v~~Ydp--~~~~W~~~~~m~---~~r~~~~~v~~~~~iYviGG~~~----~~sve~ydp~~  340 (480)
T PHA02790        270 VGEVVYLIGGWMNNEIHNNAIAVNY--ISNNWIPIPPMN---SPRLYASGVPANNKLYVVGGLPN----PTSVERWFHGD  340 (480)
T ss_pred             ECCEEEEEcCCCCCCcCCeEEEEEC--CCCEEEECCCCC---chhhcceEEEECCEEEEECCcCC----CCceEEEECCC
Confidence            488999999975433 345788999  999999999998   67888999999999999999753    25689999999


Q ss_pred             CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC
Q 045821          146 NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT  224 (352)
Q Consensus       146 ~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~  224 (352)
                      ++|..+++||.+|..+++++++++||++||.... +.+++|||.+++|+.+++|+.++..+++++++|+||++||.    
T Consensus       341 n~W~~~~~l~~~r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~~W~~~~~m~~~r~~~~~~~~~~~IYv~GG~----  416 (480)
T PHA02790        341 AAWVNMPSLLKPRCNPAVASINNVIYVIGGHSETDTTTEYLLPNHDQWQFGPSTYYPHYKSCALVFGRRLFLVGRN----  416 (480)
T ss_pred             CeEEECCCCCCCCcccEEEEECCEEEEecCcCCCCccEEEEeCCCCEEEeCCCCCCccccceEEEECCEEEEECCc----
Confidence            9999999999999999999999999999998654 78999999999999999999988888889999999999863    


Q ss_pred             CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC----CeEEEEeCCCCceEEcc
Q 045821          225 SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG----TKLMMWQKESREWSPVG  286 (352)
Q Consensus       225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W~~~~  286 (352)
                          +++||+.+++|+.+++.+.++....+++++|+||++||..+    +.+.+||+++++|+...
T Consensus       417 ----~e~ydp~~~~W~~~~~m~~~r~~~~~~v~~~~IYviGG~~~~~~~~~ve~Yd~~~~~W~~~~  478 (480)
T PHA02790        417 ----AEFYCESSNTWTLIDDPIYPRDNPELIIVDNKLLLIGGFYRGSYIDTIEVYNNRTYSWNIWD  478 (480)
T ss_pred             ----eEEecCCCCcEeEcCCCCCCccccEEEEECCEEEEECCcCCCcccceEEEEECCCCeEEecC
Confidence                57899999999999876555656667889999999999753    47999999999998654


No 16 
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=100.00  E-value=1.4e-33  Score=228.91  Aligned_cols=251  Identities=20%  Similarity=0.328  Sum_probs=202.7

Q ss_pred             HhhCCCC--CCEEEEEEec------CCCCceeEEEecCCCCCCCeeEeccCC----------CcccCCcceEEEEECCEE
Q 045821           61 RRKHNLD--ETWIYAFCRD------NKLERVCCYVLDPNSTRRSWKMIHELP----------ARTLRRKGMGFEVLGKNA  122 (352)
Q Consensus        61 ~~~~~~~--~~~l~~~gg~------~~~~~~~~~~~d~~~~~~~w~~~~~~~----------~~~~~~~~~~~~~~~~~i  122 (352)
                      |..|+..  +++||-+||.      .........++|.  .+-+|.++++--          .-|-.|++|+++...+++
T Consensus        14 RVNHAavaVG~riYSFGGYCsGedy~~~~piDVH~lNa--~~~RWtk~pp~~~ka~i~~~yp~VPyqRYGHtvV~y~d~~   91 (392)
T KOG4693|consen   14 RVNHAAVAVGSRIYSFGGYCSGEDYDAKDPIDVHVLNA--ENYRWTKMPPGITKATIESPYPAVPYQRYGHTVVEYQDKA   91 (392)
T ss_pred             cccceeeeecceEEecCCcccccccccCCcceeEEeec--cceeEEecCcccccccccCCCCccchhhcCceEEEEcceE
Confidence            4444433  8999999985      1233567889999  999999998711          112458999999999999


Q ss_pred             EEEeCCCCCCCCCCeEEEEECCCCeeEecC---CCCCcceeeeeEEECCEEEEEeCCCCC-----CceEEEeCCCCcEEE
Q 045821          123 YLLGGCGWSEDATSEVYCYDASMNTWTDAA---PMSTARCYFPCGVLNQKIYCIGGLGDT-----HSWDVYDPRTNNWKL  194 (352)
Q Consensus       123 yv~GG~~~~~~~~~~~~~~d~~t~~W~~~~---~~~~~r~~~~~~~~~~~iyv~gG~~~~-----~~v~~yd~~t~~W~~  194 (352)
                      |+.||.+.....-+.++.|||.+++|....   .+|.+|-+|+++++++.+|++||+..+     +++..+|..|.+|+.
T Consensus        92 yvWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~gn~MyiFGGye~~a~~FS~d~h~ld~~TmtWr~  171 (392)
T KOG4693|consen   92 YVWGGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVWGNQMYIFGGYEEDAQRFSQDTHVLDFATMTWRE  171 (392)
T ss_pred             EEEcCccCcccccceeeeeccccccccccceeeecCCccCCceeeEECcEEEEecChHHHHHhhhccceeEeccceeeee
Confidence            999999887677788999999999998653   468899999999999999999998765     899999999999999


Q ss_pred             ecC---CCCcccceeEEEECCEEEEEccCCCCCCc---------eEEEEEeCCCCceeecCC---CCCCCCCCcEEEECC
Q 045821          195 HTE---PNIFTEIEDSFVMDGKIYIRCSASAATSH---------VCALVYEPSTDSWLHADA---NMASGWRGPAVVVDD  259 (352)
Q Consensus       195 ~~~---~~~~~~~~~~~~~~g~iyv~GG~~~~~~~---------~~i~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~  259 (352)
                      +..   +|.=+..++++++++.+|++||.......         ..|..+|..++.|...+.   .+..++++++.+.++
T Consensus       172 ~~Tkg~PprwRDFH~a~~~~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~GRRSHS~fvYng  251 (392)
T KOG4693|consen  172 MHTKGDPPRWRDFHTASVIDGMMYIFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPGGRRSHSTFVYNG  251 (392)
T ss_pred             hhccCCCchhhhhhhhhhccceEEEeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCCcccccceEEEcc
Confidence            863   55667778889999999999987643221         278899999999998754   344566777789999


Q ss_pred             EEEEEecCCC------CeEEEEeCCCCceEEccCCC-CCcCCCCceEEEeCCEEEEEcCCc
Q 045821          260 ALYVLDQSSG------TKLMMWQKESREWSPVGRLS-TLLTRPPCKLVAIGKTIFVIGKGC  313 (352)
Q Consensus       260 ~l~~~gg~~~------~~v~~yd~~~~~W~~~~~~~-~~~~~~~~~~~~~~~~i~v~gG~~  313 (352)
                      ++|++||+++      ++++.|||.+..|+.+..-. .+..|+..+.++.++|+|+|||..
T Consensus       252 ~~Y~FGGYng~ln~HfndLy~FdP~t~~W~~I~~~Gk~P~aRRRqC~~v~g~kv~LFGGTs  312 (392)
T KOG4693|consen  252 KMYMFGGYNGTLNVHFNDLYCFDPKTSMWSVISVRGKYPSARRRQCSVVSGGKVYLFGGTS  312 (392)
T ss_pred             eEEEecccchhhhhhhcceeecccccchheeeeccCCCCCcccceeEEEECCEEEEecCCC
Confidence            9999999987      79999999999999886321 112466667777899999999975


No 17 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=100.00  E-value=6.1e-32  Score=241.90  Aligned_cols=222  Identities=15%  Similarity=0.205  Sum_probs=176.5

Q ss_pred             CCcceEEEEECCEEEEEeCCCCCC---------CCCCeEEEEE-CCC-CeeEecCCCCCcceeeeeEEECCEEEEEeCCC
Q 045821          109 RRKGMGFEVLGKNAYLLGGCGWSE---------DATSEVYCYD-ASM-NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLG  177 (352)
Q Consensus       109 ~~~~~~~~~~~~~iyv~GG~~~~~---------~~~~~~~~~d-~~t-~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~  177 (352)
                      .++++.++++++.||++||.+...         ...+++++|+ +.. .+|..+++||.+|..++++.++++||++||..
T Consensus         3 ~~~g~~~~~~~~~l~v~GG~~~~~~~~~~~g~~~~~~~v~~~~~~~~~~~W~~~~~lp~~r~~~~~~~~~~~lyviGG~~   82 (323)
T TIGR03548         3 GVAGCYAGIIGDYILVAGGCNFPEDPLAEGGKKKNYKGIYIAKDENSNLKWVKDGQLPYEAAYGASVSVENGIYYIGGSN   82 (323)
T ss_pred             ceeeEeeeEECCEEEEeeccCCCCCchhhCCcEEeeeeeEEEecCCCceeEEEcccCCccccceEEEEECCEEEEEcCCC
Confidence            466788899999999999986542         1345788885 332 37999999999999888899999999999976


Q ss_pred             CC---CceEEEeCCCCcE----EEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCC-CC
Q 045821          178 DT---HSWDVYDPRTNNW----KLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMA-SG  249 (352)
Q Consensus       178 ~~---~~v~~yd~~t~~W----~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~-~~  249 (352)
                      ..   +++++||+.+++|    +.++++|.++..+++++++++||++||.......+++++||+.+++|+++++.+. .+
T Consensus        83 ~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~p~~~r  162 (323)
T TIGR03548        83 SSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKDGTLYVGGGNRNGKPSNKSYLFNLETQEWFELPDFPGEPR  162 (323)
T ss_pred             CCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEECCEEEEEeCcCCCccCceEEEEcCCCCCeeECCCCCCCCC
Confidence            54   8899999999988    7788888888888889999999999997655556799999999999999976443 33


Q ss_pred             CCCcEEEECCEEEEEecCCC---CeEEEEeCCCCceEEccCCCC---CcCC-CCceEEEeCCEEEEEcCCc---------
Q 045821          250 WRGPAVVVDDALYVLDQSSG---TKLMMWQKESREWSPVGRLST---LLTR-PPCKLVAIGKTIFVIGKGC---------  313 (352)
Q Consensus       250 ~~~~~~~~~~~l~~~gg~~~---~~v~~yd~~~~~W~~~~~~~~---~~~~-~~~~~~~~~~~i~v~gG~~---------  313 (352)
                      ..+.+++.+++||++||..+   .++++||+++++|+.++.++.   +..+ ..++++..+++||++||.+         
T Consensus       163 ~~~~~~~~~~~iYv~GG~~~~~~~~~~~yd~~~~~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~  242 (323)
T TIGR03548       163 VQPVCVKLQNELYVFGGGSNIAYTDGYKYSPKKNQWQKVADPTTDSEPISLLGAASIKINESLLLCIGGFNKDVYNDAVI  242 (323)
T ss_pred             CcceEEEECCEEEEEcCCCCccccceEEEecCCCeeEECCCCCCCCCceeccceeEEEECCCEEEEECCcCHHHHHHHHh
Confidence            34445678999999999764   468999999999999987642   1122 2334455689999999964         


Q ss_pred             -----------------------------eEEEEEcCCcccccceE
Q 045821          314 -----------------------------SAVVIDVGNIGNIGGIM  330 (352)
Q Consensus       314 -----------------------------~~~~~d~~~~~w~~~~~  330 (352)
                                                   ++++||+++++|.....
T Consensus       243 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~yd~~~~~W~~~~~  288 (323)
T TIGR03548       243 DLATMKDESLKGYKKEYFLKPPEWYNWNRKILIYNVRTGKWKSIGN  288 (323)
T ss_pred             hhhhccchhhhhhHHHHhCCCccccCcCceEEEEECCCCeeeEccc
Confidence                                         46789999999977543


No 18 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=100.00  E-value=8.8e-32  Score=243.37  Aligned_cols=223  Identities=18%  Similarity=0.264  Sum_probs=171.0

Q ss_pred             ccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC--CCCeeEecCCCC-CcceeeeeEEECCEEEEEeCCC
Q 045821          101 HELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA--SMNTWTDAAPMS-TARCYFPCGVLNQKIYCIGGLG  177 (352)
Q Consensus       101 ~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~--~t~~W~~~~~~~-~~r~~~~~~~~~~~iyv~gG~~  177 (352)
                      ++||   .++...++++++++|||+||..     .+++++||+  .+++|..+++|| .+|..++++.++++|||+||..
T Consensus         2 ~~lp---~~~~~~~~~~~~~~vyv~GG~~-----~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~~~~~~~iYv~GG~~   73 (346)
T TIGR03547         2 PDLP---VGFKNGTGAIIGDKVYVGLGSA-----GTSWYKLDLKKPSKGWQKIADFPGGPRNQAVAAAIDGKLYVFGGIG   73 (346)
T ss_pred             CCCC---ccccCceEEEECCEEEEEcccc-----CCeeEEEECCCCCCCceECCCCCCCCcccceEEEECCEEEEEeCCC
Confidence            5566   5677777889999999999963     267899996  578899999998 5899999999999999999975


Q ss_pred             C--------C-CceEEEeCCCCcEEEecC-CCCcccceeEE-EECCEEEEEccCCCCC----------------------
Q 045821          178 D--------T-HSWDVYDPRTNNWKLHTE-PNIFTEIEDSF-VMDGKIYIRCSASAAT----------------------  224 (352)
Q Consensus       178 ~--------~-~~v~~yd~~t~~W~~~~~-~~~~~~~~~~~-~~~g~iyv~GG~~~~~----------------------  224 (352)
                      .        . +++++||+.+++|++++. ++..+..+.++ +++++||++||.....                      
T Consensus        74 ~~~~~~~~~~~~~v~~Yd~~~~~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~  153 (346)
T TIGR03547        74 KANSEGSPQVFDDVYRYDPKKNSWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLI  153 (346)
T ss_pred             CCCCCCcceecccEEEEECCCCEEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhH
Confidence            2        1 689999999999999974 34444445455 6899999999976321                      


Q ss_pred             ------------CceEEEEEeCCCCceeecCCCCCC-CCCCcEEEECCEEEEEecCCC-----CeEEEEe--CCCCceEE
Q 045821          225 ------------SHVCALVYEPSTDSWLHADANMAS-GWRGPAVVVDDALYVLDQSSG-----TKLMMWQ--KESREWSP  284 (352)
Q Consensus       225 ------------~~~~i~~yd~~~~~W~~~~~~~~~-~~~~~~~~~~~~l~~~gg~~~-----~~v~~yd--~~~~~W~~  284 (352)
                                  ....+++||+.+++|+.+++.+.. +....++..+++||++||...     ..++.||  +++++|++
T Consensus       154 ~~~~~~~~~~~~~~~~v~~YDp~t~~W~~~~~~p~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~  233 (346)
T TIGR03547       154 AAYFSQPPEDYFWNKNVLSYDPSTNQWRNLGENPFLGTAGSAIVHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNK  233 (346)
T ss_pred             HHHhCCChhHcCccceEEEEECCCCceeECccCCCCcCCCceEEEECCEEEEEeeeeCCCccchheEEEEecCCCceeee
Confidence                        125799999999999999875543 334445678999999999643     2355554  57789999


Q ss_pred             ccCCCCCcC-----CCCceEEEeCCEEEEEcCCc-----------------------eEEEEEcCCcccccceEE
Q 045821          285 VGRLSTLLT-----RPPCKLVAIGKTIFVIGKGC-----------------------SAVVIDVGNIGNIGGIMV  331 (352)
Q Consensus       285 ~~~~~~~~~-----~~~~~~~~~~~~i~v~gG~~-----------------------~~~~~d~~~~~w~~~~~~  331 (352)
                      ++.+|....     +..+.++.++++||++||..                       .+.+||+++++|.....|
T Consensus       234 ~~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~~~l  308 (346)
T TIGR03547       234 LPPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSEVYALDNGKWSKVGKL  308 (346)
T ss_pred             cCCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEeeEEEecCCcccccCCC
Confidence            998876421     12334677899999999963                       356899999999876544


No 19 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=99.98  E-value=2.2e-30  Score=236.00  Aligned_cols=230  Identities=18%  Similarity=0.243  Sum_probs=173.5

Q ss_pred             CCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECC--CCeeEecCCCC-CcceeeeeEEECCEE
Q 045821           94 RRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDAS--MNTWTDAAPMS-TARCYFPCGVLNQKI  170 (352)
Q Consensus        94 ~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~--t~~W~~~~~~~-~~r~~~~~~~~~~~i  170 (352)
                      .-.++.+++||   .++..++++.++++||++||...     +.+++||+.  +++|..++++| .+|..++++.++++|
T Consensus        16 ~~~~~~l~~lP---~~~~~~~~~~~~~~iyv~gG~~~-----~~~~~~d~~~~~~~W~~l~~~p~~~r~~~~~v~~~~~I   87 (376)
T PRK14131         16 AANAEQLPDLP---VPFKNGTGAIDNNTVYVGLGSAG-----TSWYKLDLNAPSKGWTKIAAFPGGPREQAVAAFIDGKL   87 (376)
T ss_pred             ceecccCCCCC---cCccCCeEEEECCEEEEEeCCCC-----CeEEEEECCCCCCCeEECCcCCCCCcccceEEEECCEE
Confidence            34566778888   56666678889999999999633     458999986  47899999987 589999999999999


Q ss_pred             EEEeCCCC-------C--CceEEEeCCCCcEEEecCC-CCcccceeEEE-ECCEEEEEccCCCCC---------------
Q 045821          171 YCIGGLGD-------T--HSWDVYDPRTNNWKLHTEP-NIFTEIEDSFV-MDGKIYIRCSASAAT---------------  224 (352)
Q Consensus       171 yv~gG~~~-------~--~~v~~yd~~t~~W~~~~~~-~~~~~~~~~~~-~~g~iyv~GG~~~~~---------------  224 (352)
                      ||+||...       .  +++++||+.+++|+.++++ +..+..+++++ .+++||++||.....               
T Consensus        88 YV~GG~~~~~~~~~~~~~~~v~~YD~~~n~W~~~~~~~p~~~~~~~~~~~~~~~IYv~GG~~~~~~~~~~~d~~~~~~~~  167 (376)
T PRK14131         88 YVFGGIGKTNSEGSPQVFDDVYKYDPKTNSWQKLDTRSPVGLAGHVAVSLHNGKAYITGGVNKNIFDGYFEDLAAAGKDK  167 (376)
T ss_pred             EEEcCCCCCCCCCceeEcccEEEEeCCCCEEEeCCCCCCCcccceEEEEeeCCEEEEECCCCHHHHHHHHhhhhhcccch
Confidence            99999754       1  6899999999999999853 44444555555 799999999975310               


Q ss_pred             -------------------CceEEEEEeCCCCceeecCCCCCCCC-CCcEEEECCEEEEEecCCC-----CeE--EEEeC
Q 045821          225 -------------------SHVCALVYEPSTDSWLHADANMASGW-RGPAVVVDDALYVLDQSSG-----TKL--MMWQK  277 (352)
Q Consensus       225 -------------------~~~~i~~yd~~~~~W~~~~~~~~~~~-~~~~~~~~~~l~~~gg~~~-----~~v--~~yd~  277 (352)
                                         ....+++||+.+++|+.+++.+..++ ..+++..+++||++||...     ..+  +.||+
T Consensus       168 ~~~~~i~~~~~~~~~~~~~~~~~v~~YD~~t~~W~~~~~~p~~~~~~~a~v~~~~~iYv~GG~~~~~~~~~~~~~~~~~~  247 (376)
T PRK14131        168 TPKDKINDAYFDKKPEDYFFNKEVLSYDPSTNQWKNAGESPFLGTAGSAVVIKGNKLWLINGEIKPGLRTDAVKQGKFTG  247 (376)
T ss_pred             hhhhhhHHHHhcCChhhcCcCceEEEEECCCCeeeECCcCCCCCCCcceEEEECCEEEEEeeeECCCcCChhheEEEecC
Confidence                               13579999999999999987554333 4445678999999999532     233  45688


Q ss_pred             CCCceEEccCCCCCcC----C--CCceEEEeCCEEEEEcCCc-----------------------eEEEEEcCCcccccc
Q 045821          278 ESREWSPVGRLSTLLT----R--PPCKLVAIGKTIFVIGKGC-----------------------SAVVIDVGNIGNIGG  328 (352)
Q Consensus       278 ~~~~W~~~~~~~~~~~----~--~~~~~~~~~~~i~v~gG~~-----------------------~~~~~d~~~~~w~~~  328 (352)
                      ++++|+.++.+|....    +  ..+..+..+++||++||..                       .+.+||+++++|...
T Consensus       248 ~~~~W~~~~~~p~~~~~~~~~~~~~~~a~~~~~~iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e~yd~~~~~W~~~  327 (376)
T PRK14131        248 NNLKWQKLPDLPPAPGGSSQEGVAGAFAGYSNGVLLVAGGANFPGARENYQNGKLYAHEGLKKSWSDEIYALVNGKWQKV  327 (376)
T ss_pred             CCcceeecCCCCCCCcCCcCCccceEeceeECCEEEEeeccCCCCChhhhhcCCcccccCCcceeehheEEecCCccccc
Confidence            9999999998876321    1  1122467899999999953                       234799999999875


Q ss_pred             eEE
Q 045821          329 IMV  331 (352)
Q Consensus       329 ~~~  331 (352)
                      ..|
T Consensus       328 ~~l  330 (376)
T PRK14131        328 GEL  330 (376)
T ss_pred             CcC
Confidence            444


No 20 
>KOG4693 consensus Uncharacterized conserved protein, contains kelch repeat [General function prediction only]
Probab=99.96  E-value=3.7e-28  Score=197.42  Aligned_cols=229  Identities=18%  Similarity=0.202  Sum_probs=183.2

Q ss_pred             CeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCC----CCCCCeEEEEECCCCeeEecCC-------------CCCcc
Q 045821           96 SWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWS----EDATSEVYCYDASMNTWTDAAP-------------MSTAR  158 (352)
Q Consensus        96 ~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~----~~~~~~~~~~d~~t~~W~~~~~-------------~~~~r  158 (352)
                      .|+..-.-.   +.|-+|+++.++..||-|||+-..    ....-++.++|..+-+|..+++             .|..|
T Consensus         3 ~WTVHLeGG---PrRVNHAavaVG~riYSFGGYCsGedy~~~~piDVH~lNa~~~RWtk~pp~~~ka~i~~~yp~VPyqR   79 (392)
T KOG4693|consen    3 TWTVHLEGG---PRRVNHAAVAVGSRIYSFGGYCSGEDYDAKDPIDVHVLNAENYRWTKMPPGITKATIESPYPAVPYQR   79 (392)
T ss_pred             eEEEEecCC---cccccceeeeecceEEecCCcccccccccCCcceeEEeeccceeEEecCcccccccccCCCCccchhh
Confidence            355443333   468899999999999999996432    1335589999999999999886             23459


Q ss_pred             eeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEec---CCCCcccceeEEEECCEEEEEccCCCCC--CceEE
Q 045821          159 CYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHT---EPNIFTEIEDSFVMDGKIYIRCSASAAT--SHVCA  229 (352)
Q Consensus       159 ~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~---~~~~~~~~~~~~~~~g~iyv~GG~~~~~--~~~~i  229 (352)
                      ++|+.+.+++++|+.||.++.    +.++.||++|+.|.+..   -.|.++.++++|+.++.+|++||.....  ..+++
T Consensus        80 YGHtvV~y~d~~yvWGGRND~egaCN~Ly~fDp~t~~W~~p~v~G~vPgaRDGHsAcV~gn~MyiFGGye~~a~~FS~d~  159 (392)
T KOG4693|consen   80 YGHTVVEYQDKAYVWGGRNDDEGACNLLYEFDPETNVWKKPEVEGFVPGARDGHSACVWGNQMYIFGGYEEDAQRFSQDT  159 (392)
T ss_pred             cCceEEEEcceEEEEcCccCcccccceeeeeccccccccccceeeecCCccCCceeeEECcEEEEecChHHHHHhhhccc
Confidence            999999999999999998875    88899999999998764   3688899999999999999999987543  34589


Q ss_pred             EEEeCCCCceeecCC---CCCCCCCCcEEEECCEEEEEecCCC-------------CeEEEEeCCCCceEEccCCCC-Cc
Q 045821          230 LVYEPSTDSWLHADA---NMASGWRGPAVVVDDALYVLDQSSG-------------TKLMMWQKESREWSPVGRLST-LL  292 (352)
Q Consensus       230 ~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~l~~~gg~~~-------------~~v~~yd~~~~~W~~~~~~~~-~~  292 (352)
                      .++|..+.+|+++..   ++.-+..+.+.++++.+|++||...             +.+..+|..++.|..-+..+. +.
T Consensus       160 h~ld~~TmtWr~~~Tkg~PprwRDFH~a~~~~~~MYiFGGR~D~~gpfHs~~e~Yc~~i~~ld~~T~aW~r~p~~~~~P~  239 (392)
T KOG4693|consen  160 HVLDFATMTWREMHTKGDPPRWRDFHTASVIDGMMYIFGGRSDESGPFHSIHEQYCDTIMALDLATGAWTRTPENTMKPG  239 (392)
T ss_pred             eeEeccceeeeehhccCCCchhhhhhhhhhccceEEEeccccccCCCccchhhhhcceeEEEeccccccccCCCCCcCCC
Confidence            999999999999854   2222334566788999999999543             578899999999998764332 22


Q ss_pred             CCCCceEEEeCCEEEEEcCCc--------eEEEEEcCCccccc
Q 045821          293 TRPPCKLVAIGKTIFVIGKGC--------SAVVIDVGNIGNIG  327 (352)
Q Consensus       293 ~~~~~~~~~~~~~i~v~gG~~--------~~~~~d~~~~~w~~  327 (352)
                      .|+.++..++++++|+|||++        +.+.||+.+..|..
T Consensus       240 GRRSHS~fvYng~~Y~FGGYng~ln~HfndLy~FdP~t~~W~~  282 (392)
T KOG4693|consen  240 GRRSHSTFVYNGKMYMFGGYNGTLNVHFNDLYCFDPKTSMWSV  282 (392)
T ss_pred             cccccceEEEcceEEEecccchhhhhhhcceeecccccchhee
Confidence            688889999999999999976        46679999999954


No 21 
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=99.94  E-value=4.7e-25  Score=205.53  Aligned_cols=234  Identities=20%  Similarity=0.246  Sum_probs=191.6

Q ss_pred             ChhhHHHHhhCCCC--CCEEEEEEecCCCC---ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCC
Q 045821           54 SEEWCAYRRKHNLD--ETWIYAFCRDNKLE---RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGC  128 (352)
Q Consensus        54 s~~~~~~~~~~~~~--~~~l~~~gg~~~~~---~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~  128 (352)
                      ....+..|..|.+.  ++.+|++||.....   ...+++||.  .+..|.........+.++.+++++.++++||+|||.
T Consensus        54 ~~~~p~~R~~hs~~~~~~~~~vfGG~~~~~~~~~~dl~~~d~--~~~~w~~~~~~g~~p~~r~g~~~~~~~~~l~lfGG~  131 (482)
T KOG0379|consen   54 LGVGPIPRAGHSAVLIGNKLYVFGGYGSGDRLTDLDLYVLDL--ESQLWTKPAATGDEPSPRYGHSLSAVGDKLYLFGGT  131 (482)
T ss_pred             CCCCcchhhccceeEECCEEEEECCCCCCCccccceeEEeec--CCcccccccccCCCCCcccceeEEEECCeEEEEccc
Confidence            34556677777655  89999999964222   225999999  999998887766555889999999999999999998


Q ss_pred             CCCCCCCCeEEEEECCCCeeEecCC---CCCcceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEec---CC
Q 045821          129 GWSEDATSEVYCYDASMNTWTDAAP---MSTARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHT---EP  198 (352)
Q Consensus       129 ~~~~~~~~~~~~~d~~t~~W~~~~~---~~~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~---~~  198 (352)
                      +......++++.||+.|++|+.+..   +|.+|.+|+++.++.++||+||....    +++++||+.+.+|.++.   ..
T Consensus       132 ~~~~~~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~  211 (482)
T KOG0379|consen  132 DKKYRNLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEA  211 (482)
T ss_pred             cCCCCChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccceecccCCCC
Confidence            8644668899999999999998754   57889999999999999999998754    99999999999999986   35


Q ss_pred             CCcccceeEEEECCEEEEEccCC-CCCCceEEEEEeCCCCceeecCC---CCCCCCCCcEEEECCEEEEEecCCC-----
Q 045821          199 NIFTEIEDSFVMDGKIYIRCSAS-AATSHVCALVYEPSTDSWLHADA---NMASGWRGPAVVVDDALYVLDQSSG-----  269 (352)
Q Consensus       199 ~~~~~~~~~~~~~g~iyv~GG~~-~~~~~~~i~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~l~~~gg~~~-----  269 (352)
                      |.++..|++++++++++++||.. .....++++.+|+.+.+|.++..   .+.++..+..+..+..++++||...     
T Consensus       212 P~pR~gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~~~g~~p~~R~~h~~~~~~~~~~l~gG~~~~~~~~  291 (482)
T KOG0379|consen  212 PSPRYGHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLPTGGDLPSPRSGHSLTVSGDHLLLFGGGTDPKQEP  291 (482)
T ss_pred             CCCCCCceEEEECCeEEEEeccccCCceecceEeeecccceeeeccccCCCCCCcceeeeEEECCEEEEEcCCccccccc
Confidence            66788889999999999999988 44556699999999999997653   2333444445688999999998544     


Q ss_pred             -CeEEEEeCCCCceEEccCCC
Q 045821          270 -TKLMMWQKESREWSPVGRLS  289 (352)
Q Consensus       270 -~~v~~yd~~~~~W~~~~~~~  289 (352)
                       .+++.||.+++.|+++....
T Consensus       292 l~~~~~l~~~~~~w~~~~~~~  312 (482)
T KOG0379|consen  292 LGDLYGLDLETLVWSKVESVG  312 (482)
T ss_pred             ccccccccccccceeeeeccc
Confidence             46889999999999997554


No 22 
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=99.93  E-value=7.4e-24  Score=197.52  Aligned_cols=225  Identities=20%  Similarity=0.357  Sum_probs=184.4

Q ss_pred             cccCCcceEEEEECCEEEEEeCCCCCCCCCC-eEEEEECCCCeeEecC---CCCCcceeeeeEEECCEEEEEeCCCCC--
Q 045821          106 RTLRRKGMGFEVLGKNAYLLGGCGWSEDATS-EVYCYDASMNTWTDAA---PMSTARCYFPCGVLNQKIYCIGGLGDT--  179 (352)
Q Consensus       106 ~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~-~~~~~d~~t~~W~~~~---~~~~~r~~~~~~~~~~~iyv~gG~~~~--  179 (352)
                      .+.+|..|+++.+++++|+|||......... +++++|..+..|....   ..|.+|++|.++.++++||++||....  
T Consensus        57 ~p~~R~~hs~~~~~~~~~vfGG~~~~~~~~~~dl~~~d~~~~~w~~~~~~g~~p~~r~g~~~~~~~~~l~lfGG~~~~~~  136 (482)
T KOG0379|consen   57 GPIPRAGHSAVLIGNKLYVFGGYGSGDRLTDLDLYVLDLESQLWTKPAATGDEPSPRYGHSLSAVGDKLYLFGGTDKKYR  136 (482)
T ss_pred             CcchhhccceeEECCEEEEECCCCCCCccccceeEEeecCCcccccccccCCCCCcccceeEEEECCeEEEEccccCCCC
Confidence            3378999999999999999999765522222 6999999999997654   456889999999999999999998843  


Q ss_pred             --CceEEEeCCCCcEEEec---CCCCcccceeEEEECCEEEEEccCCCCC-CceEEEEEeCCCCceeecCC---CCCCCC
Q 045821          180 --HSWDVYDPRTNNWKLHT---EPNIFTEIEDSFVMDGKIYIRCSASAAT-SHVCALVYEPSTDSWLHADA---NMASGW  250 (352)
Q Consensus       180 --~~v~~yd~~t~~W~~~~---~~~~~~~~~~~~~~~g~iyv~GG~~~~~-~~~~i~~yd~~~~~W~~~~~---~~~~~~  250 (352)
                        ++++.||+.|++|+.+.   .+|+++..|+++++++++||+||..... ..+++++||+.+.+|.++..   .+.++.
T Consensus       137 ~~~~l~~~d~~t~~W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~W~~~~~~g~~P~pR~  216 (482)
T KOG0379|consen  137 NLNELHSLDLSTRTWSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETSTWSELDTQGEAPSPRY  216 (482)
T ss_pred             ChhheEeccCCCCcEEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeeccccccceecccCCCCCCCCC
Confidence              79999999999999886   3688889999999999999999988766 67799999999999998854   233344


Q ss_pred             CCcEEEECCEEEEEecCC-C----CeEEEEeCCCCceEEcc---CCCCCcCCCCceEEEeCCEEEEEcCCce--------
Q 045821          251 RGPAVVVDDALYVLDQSS-G----TKLMMWQKESREWSPVG---RLSTLLTRPPCKLVAIGKTIFVIGKGCS--------  314 (352)
Q Consensus       251 ~~~~~~~~~~l~~~gg~~-~----~~v~~yd~~~~~W~~~~---~~~~~~~~~~~~~~~~~~~i~v~gG~~~--------  314 (352)
                      .+..++.+++++++||.. +    ++++.+|+.+.+|..+.   ..|.  +|..+..+..+++++++||...        
T Consensus       217 gH~~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~W~~~~~~g~~p~--~R~~h~~~~~~~~~~l~gG~~~~~~~~l~~  294 (482)
T KOG0379|consen  217 GHAMVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWEWKLLPTGGDLPS--PRSGHSLTVSGDHLLLFGGGTDPKQEPLGD  294 (482)
T ss_pred             CceEEEECCeEEEEeccccCCceecceEeeecccceeeeccccCCCCC--CcceeeeEEECCEEEEEcCCcccccccccc
Confidence            455578899999998876 3    79999999999999665   3343  6888888899999999998653        


Q ss_pred             EEEEEcCCcccccceEEe
Q 045821          315 AVVIDVGNIGNIGGIMVS  332 (352)
Q Consensus       315 ~~~~d~~~~~w~~~~~~~  332 (352)
                      .+.+|.+...|.+.....
T Consensus       295 ~~~l~~~~~~w~~~~~~~  312 (482)
T KOG0379|consen  295 LYGLDLETLVWSKVESVG  312 (482)
T ss_pred             cccccccccceeeeeccc
Confidence            345888888898755554


No 23 
>KOG1230 consensus Protein containing repeated kelch motifs [General function prediction only]
Probab=99.91  E-value=7.7e-23  Score=175.86  Aligned_cols=223  Identities=13%  Similarity=0.191  Sum_probs=174.2

Q ss_pred             cCCcceEEEEE--CCEEEEEeCC--CCC-CCCCCeEEEEECCCCeeEec--CCCCCcceeeeeEEE-CCEEEEEeCCCCC
Q 045821          108 LRRKGMGFEVL--GKNAYLLGGC--GWS-EDATSEVYCYDASMNTWTDA--APMSTARCYFPCGVL-NQKIYCIGGLGDT  179 (352)
Q Consensus       108 ~~~~~~~~~~~--~~~iyv~GG~--~~~-~~~~~~~~~~d~~t~~W~~~--~~~~~~r~~~~~~~~-~~~iyv~gG~~~~  179 (352)
                      .+|.++++.+.  .+.+++|||.  ++. ....++++.||..+++|+.+  |+.|.||+.|.++++ .+.+|++||.-..
T Consensus        65 spRsn~sl~~nPekeELilfGGEf~ngqkT~vYndLy~Yn~k~~eWkk~~spn~P~pRsshq~va~~s~~l~~fGGEfaS  144 (521)
T KOG1230|consen   65 SPRSNPSLFANPEKEELILFGGEFYNGQKTHVYNDLYSYNTKKNEWKKVVSPNAPPPRSSHQAVAVPSNILWLFGGEFAS  144 (521)
T ss_pred             CCCCCcceeeccCcceeEEecceeecceeEEEeeeeeEEeccccceeEeccCCCcCCCccceeEEeccCeEEEeccccCC
Confidence            67888777666  5689999994  322 24578999999999999976  456789999988888 4899999995432


Q ss_pred             ---------CceEEEeCCCCcEEEecC--CCCcccceeEEEECCEEEEEccCCCCC----CceEEEEEeCCCCceeecCC
Q 045821          180 ---------HSWDVYDPRTNNWKLHTE--PNIFTEIEDSFVMDGKIYIRCSASAAT----SHVCALVYEPSTDSWLHADA  244 (352)
Q Consensus       180 ---------~~v~~yd~~t~~W~~~~~--~~~~~~~~~~~~~~g~iyv~GG~~~~~----~~~~i~~yd~~~~~W~~~~~  244 (352)
                               .++|.||..|++|+++..  -|.+|.+|-+++...+|++|||.....    +.+++++||+.+-+|+++.+
T Consensus       145 Pnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RSGHRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klep  224 (521)
T KOG1230|consen  145 PNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRSGHRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEP  224 (521)
T ss_pred             cchhhhhhhhheeeeeeccchheeeccCCCCCCCccceeEEeeeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccC
Confidence                     999999999999999964  466788888999999999999988643    34599999999999999966


Q ss_pred             CC--CCCCCCc--EEEECCEEEEEecCCC-------------CeEEEEeCCC-----CceEEccCCC-CCcCCCCceEEE
Q 045821          245 NM--ASGWRGP--AVVVDDALYVLDQSSG-------------TKLMMWQKES-----REWSPVGRLS-TLLTRPPCKLVA  301 (352)
Q Consensus       245 ~~--~~~~~~~--~~~~~~~l~~~gg~~~-------------~~v~~yd~~~-----~~W~~~~~~~-~~~~~~~~~~~~  301 (352)
                      .-  +.++++.  ++.-+|.||+.||+..             ++++.++++.     =+|+.+.... .+.+|.++++++
T Consensus       225 sga~PtpRSGcq~~vtpqg~i~vyGGYsK~~~kK~~dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~kPspRsgfsv~v  304 (521)
T KOG1230|consen  225 SGAGPTPRSGCQFSVTPQGGIVVYGGYSKQRVKKDVDKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVKPSPRSGFSVAV  304 (521)
T ss_pred             CCCCCCCCCcceEEecCCCcEEEEcchhHhhhhhhhhcCceeeeeeeecCCcCCCcceeEeeccCCCCCCCCCCceeEEE
Confidence            32  3444444  3455999999999764             6899999988     4588886432 223688888777


Q ss_pred             e-CCEEEEEcCCc---------------eEEEEEcCCcccccceE
Q 045821          302 I-GKTIFVIGKGC---------------SAVVIDVGNIGNIGGIM  330 (352)
Q Consensus       302 ~-~~~i~v~gG~~---------------~~~~~d~~~~~w~~~~~  330 (352)
                      . +++-|.|||..               +.+.||...++|.++..
T Consensus       305 a~n~kal~FGGV~D~eeeeEsl~g~F~NDLy~fdlt~nrW~~~ql  349 (521)
T KOG1230|consen  305 AKNHKALFFGGVCDLEEEEESLSGEFFNDLYFFDLTRNRWSEGQL  349 (521)
T ss_pred             ecCCceEEecceecccccchhhhhhhhhhhhheecccchhhHhhh
Confidence            6 55899999955               36679999999987633


No 24 
>KOG4152 consensus Host cell transcription factor HCFC1 [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.90  E-value=6.6e-23  Score=180.60  Aligned_cols=264  Identities=17%  Similarity=0.199  Sum_probs=196.9

Q ss_pred             hhcccccC-hhhHHHHhhCCCC--CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEE
Q 045821           47 RWRGLLHS-EEWCAYRRKHNLD--ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAY  123 (352)
Q Consensus        47 ~W~~l~~s-~~~~~~~~~~~~~--~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iy  123 (352)
                      +|+.+..+ ...+++|+.|...  ...|.+|||.|..-..++++||.  .++.|..-......+.+.+.|..+..+.+||
T Consensus        18 rWrrV~~~tGPvPrpRHGHRAVaikELiviFGGGNEGiiDELHvYNT--atnqWf~PavrGDiPpgcAA~GfvcdGtril   95 (830)
T KOG4152|consen   18 RWRRVQQSTGPVPRPRHGHRAVAIKELIVIFGGGNEGIIDELHVYNT--ATNQWFAPAVRGDIPPGCAAFGFVCDGTRIL   95 (830)
T ss_pred             ceEEEecccCCCCCccccchheeeeeeEEEecCCcccchhhhhhhcc--ccceeecchhcCCCCCchhhcceEecCceEE
Confidence            57766543 2345556555433  88899999986655677899999  9999976544333336677888888999999


Q ss_pred             EEeCCCCCCCCCCeEEEEECCCCeeEecCC-------CCCcceeeeeEEECCEEEEEeCCCCC------------CceEE
Q 045821          124 LLGGCGWSEDATSEVYCYDASMNTWTDAAP-------MSTARCYFPCGVLNQKIYCIGGLGDT------------HSWDV  184 (352)
Q Consensus       124 v~GG~~~~~~~~~~~~~~d~~t~~W~~~~~-------~~~~r~~~~~~~~~~~iyv~gG~~~~------------~~v~~  184 (352)
                      +|||..+....+++++-.....-.|+++.+       +|-||.+|+...++++-|+|||...+            +++++
T Consensus        96 vFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl~gnKcYlFGGLaNdseDpknNvPrYLnDlY~  175 (830)
T KOG4152|consen   96 VFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNVPRYLNDLYI  175 (830)
T ss_pred             EEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEEeccEeEEeccccccccCcccccchhhcceEE
Confidence            999988777888888766666666777753       35689999999999999999996432            77777


Q ss_pred             EeCCCC----cEEEec---CCCCcccceeEEEE------CCEEEEEccCCCCCCceEEEEEeCCCCceeecCC---CCCC
Q 045821          185 YDPRTN----NWKLHT---EPNIFTEIEDSFVM------DGKIYIRCSASAATSHVCALVYEPSTDSWLHADA---NMAS  248 (352)
Q Consensus       185 yd~~t~----~W~~~~---~~~~~~~~~~~~~~------~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~---~~~~  248 (352)
                      .+..-+    .|...-   ..|.++..|.++++      ..++||+||+.+ -...++|..|+.+-+|++..-   .+-+
T Consensus       176 leL~~Gsgvv~W~ip~t~Gv~P~pRESHTAViY~eKDs~~skmvvyGGM~G-~RLgDLW~Ldl~Tl~W~kp~~~G~~PlP  254 (830)
T KOG4152|consen  176 LELRPGSGVVAWDIPITYGVLPPPRESHTAVIYTEKDSKKSKMVVYGGMSG-CRLGDLWTLDLDTLTWNKPSLSGVAPLP  254 (830)
T ss_pred             EEeccCCceEEEecccccCCCCCCcccceeEEEEeccCCcceEEEEccccc-ccccceeEEecceeecccccccCCCCCC
Confidence            777644    486543   36777888888777      348999998874 344589999999999998743   2333


Q ss_pred             CCCCcEEEECCEEEEEecCCC------------------CeEEEEeCCCCceEEcc-----CCCCCcCCCCceEEEeCCE
Q 045821          249 GWRGPAVVVDDALYVLDQSSG------------------TKLMMWQKESREWSPVG-----RLSTLLTRPPCKLVAIGKT  305 (352)
Q Consensus       249 ~~~~~~~~~~~~l~~~gg~~~------------------~~v~~yd~~~~~W~~~~-----~~~~~~~~~~~~~~~~~~~  305 (352)
                      +.-+.+..+++|+|++||.--                  +.+-++|++++.|..+-     +-..+..|.+++.+.++.+
T Consensus       255 RSLHsa~~IGnKMyvfGGWVPl~~~~~~~~~hekEWkCTssl~clNldt~~W~tl~~d~~ed~tiPR~RAGHCAvAigtR  334 (830)
T KOG4152|consen  255 RSLHSATTIGNKMYVFGGWVPLVMDDVKVATHEKEWKCTSSLACLNLDTMAWETLLMDTLEDNTIPRARAGHCAVAIGTR  334 (830)
T ss_pred             cccccceeecceeEEecceeeeeccccccccccceeeeccceeeeeecchheeeeeeccccccccccccccceeEEeccE
Confidence            334556788999999999311                  57889999999999874     1223346888999999999


Q ss_pred             EEEEcCCc
Q 045821          306 IFVIGKGC  313 (352)
Q Consensus       306 i~v~gG~~  313 (352)
                      +|+-.|++
T Consensus       335 lYiWSGRD  342 (830)
T KOG4152|consen  335 LYIWSGRD  342 (830)
T ss_pred             EEEEeccc
Confidence            99999876


No 25 
>KOG1230 consensus Protein containing repeated kelch motifs [General function prediction only]
Probab=99.90  E-value=1.9e-22  Score=173.43  Aligned_cols=257  Identities=14%  Similarity=0.147  Sum_probs=183.7

Q ss_pred             cCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhhHHHHhhCC----CCCCEEEEEEecCCCCc-----eeEEEe
Q 045821           18 ISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEWCAYRRKHN----LDETWIYAFCRDNKLER-----VCCYVL   88 (352)
Q Consensus        18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~~~~~~~~~----~~~~~l~~~gg~~~~~~-----~~~~~~   88 (352)
                      ...|-++.+.+|+..+.++......-.+    .-.++     +|....    ...+.|+++||.-..+.     ..+|.|
T Consensus        33 ~~e~de~~i~~~iq~~eaK~~e~~~e~~----~~~Ps-----pRsn~sl~~nPekeELilfGGEf~ngqkT~vYndLy~Y  103 (521)
T KOG1230|consen   33 NEELDEADIAEIIQSLEAKQIEHVVETS----VPPPS-----PRSNPSLFANPEKEELILFGGEFYNGQKTHVYNDLYSY  103 (521)
T ss_pred             CcccchHHHHHHHHhhhhhccceeeecc----CCCCC-----CCCCcceeeccCcceeEEecceeecceeEEEeeeeeEE
Confidence            3445577888888888776542211000    00111     122111    11557999998522221     247899


Q ss_pred             cCCCCCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCC-----CCCCeEEEEECCCCeeEecCC--CCCccee
Q 045821           89 DPNSTRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSE-----DATSEVYCYDASMNTWTDAAP--MSTARCY  160 (352)
Q Consensus        89 d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~-----~~~~~~~~~d~~t~~W~~~~~--~~~~r~~  160 (352)
                      |.  .+++|+++..... |.+|..|.++++- |.+|+|||.-...     ...+++|.||..+++|..+..  -|.||++
T Consensus       104 n~--k~~eWkk~~spn~-P~pRsshq~va~~s~~l~~fGGEfaSPnq~qF~HYkD~W~fd~~trkweql~~~g~PS~RSG  180 (521)
T KOG1230|consen  104 NT--KKNEWKKVVSPNA-PPPRSSHQAVAVPSNILWLFGGEFASPNQEQFHHYKDLWLFDLKTRKWEQLEFGGGPSPRSG  180 (521)
T ss_pred             ec--cccceeEeccCCC-cCCCccceeEEeccCeEEEeccccCCcchhhhhhhhheeeeeeccchheeeccCCCCCCCcc
Confidence            99  9999999875433 3678888887774 8999999964321     246799999999999998854  5789999


Q ss_pred             eeeEEECCEEEEEeCCCCC-------CceEEEeCCCCcEEEecCC---CCcccceeEEEE-CCEEEEEccCCC-------
Q 045821          161 FPCGVLNQKIYCIGGLGDT-------HSWDVYDPRTNNWKLHTEP---NIFTEIEDSFVM-DGKIYIRCSASA-------  222 (352)
Q Consensus       161 ~~~~~~~~~iyv~gG~~~~-------~~v~~yd~~t~~W~~~~~~---~~~~~~~~~~~~-~g~iyv~GG~~~-------  222 (352)
                      |.++....++++|||..+.       +++++||+.|-+|.++.+.   |.+|.+++..+. +|.|||.||+..       
T Consensus       181 HRMvawK~~lilFGGFhd~nr~y~YyNDvy~FdLdtykW~Klepsga~PtpRSGcq~~vtpqg~i~vyGGYsK~~~kK~~  260 (521)
T KOG1230|consen  181 HRMVAWKRQLILFGGFHDSNRDYIYYNDVYAFDLDTYKWSKLEPSGAGPTPRSGCQFSVTPQGGIVVYGGYSKQRVKKDV  260 (521)
T ss_pred             ceeEEeeeeEEEEcceecCCCceEEeeeeEEEeccceeeeeccCCCCCCCCCCcceEEecCCCcEEEEcchhHhhhhhhh
Confidence            9999999999999997654       9999999999999999753   456667766666 999999999763       


Q ss_pred             --CCCceEEEEEeCCC-----CceeecCC--CCCCCCCCcE--EEECCEEEEEecCCC-------------CeEEEEeCC
Q 045821          223 --ATSHVCALVYEPST-----DSWLHADA--NMASGWRGPA--VVVDDALYVLDQSSG-------------TKLMMWQKE  278 (352)
Q Consensus       223 --~~~~~~i~~yd~~~-----~~W~~~~~--~~~~~~~~~~--~~~~~~l~~~gg~~~-------------~~v~~yd~~  278 (352)
                        +...++++..++.+     -.|+.+.+  ..+.++++.+  +.-+++-+.+||.-.             ++++.||..
T Consensus       261 dKG~~hsDmf~L~p~~~~~dKw~W~kvkp~g~kPspRsgfsv~va~n~kal~FGGV~D~eeeeEsl~g~F~NDLy~fdlt  340 (521)
T KOG1230|consen  261 DKGTRHSDMFLLKPEDGREDKWVWTKVKPSGVKPSPRSGFSVAVAKNHKALFFGGVCDLEEEEESLSGEFFNDLYFFDLT  340 (521)
T ss_pred             hcCceeeeeeeecCCcCCCcceeEeeccCCCCCCCCCCceeEEEecCCceEEecceecccccchhhhhhhhhhhhheecc
Confidence              22344899999877     56888865  2334444444  345779999999432             789999999


Q ss_pred             CCceEEcc
Q 045821          279 SREWSPVG  286 (352)
Q Consensus       279 ~~~W~~~~  286 (352)
                      .++|+...
T Consensus       341 ~nrW~~~q  348 (521)
T KOG1230|consen  341 RNRWSEGQ  348 (521)
T ss_pred             cchhhHhh
Confidence            99998663


No 26 
>KOG4152 consensus Host cell transcription factor HCFC1 [Cell cycle control, cell division, chromosome partitioning; Transcription]
Probab=99.80  E-value=3.1e-18  Score=151.37  Aligned_cols=218  Identities=15%  Similarity=0.201  Sum_probs=157.9

Q ss_pred             CCCCeeEeccCCC-cccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEec---CCCCCcceeeeeEEECC
Q 045821           93 TRRSWKMIHELPA-RTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDA---APMSTARCYFPCGVLNQ  168 (352)
Q Consensus        93 ~~~~w~~~~~~~~-~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~---~~~~~~r~~~~~~~~~~  168 (352)
                      ....|+.+..... -+.+|++|.++++...|.+|||-++  ...+++++||..+++|..-   .+.|.+.+.|..+..+.
T Consensus        15 ~~~rWrrV~~~tGPvPrpRHGHRAVaikELiviFGGGNE--GiiDELHvYNTatnqWf~PavrGDiPpgcAA~GfvcdGt   92 (830)
T KOG4152|consen   15 NVVRWRRVQQSTGPVPRPRHGHRAVAIKELIVIFGGGNE--GIIDELHVYNTATNQWFAPAVRGDIPPGCAAFGFVCDGT   92 (830)
T ss_pred             cccceEEEecccCCCCCccccchheeeeeeEEEecCCcc--cchhhhhhhccccceeecchhcCCCCCchhhcceEecCc
Confidence            5567887765432 2368999999999999999999765  4578899999999999753   46788888888888899


Q ss_pred             EEEEEeCCCCC----CceEEEeCCCCcEEEecC-------CCCcccceeEEEECCEEEEEccCCCCC---------CceE
Q 045821          169 KIYCIGGLGDT----HSWDVYDPRTNNWKLHTE-------PNIFTEIEDSFVMDGKIYIRCSASAAT---------SHVC  228 (352)
Q Consensus       169 ~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~-------~~~~~~~~~~~~~~g~iyv~GG~~~~~---------~~~~  228 (352)
                      +||+|||.-..    ++++......-.|+++.+       +|-++-+|+...++++-|+|||.....         +.++
T Consensus        93 rilvFGGMvEYGkYsNdLYELQasRWeWkrlkp~~p~nG~pPCPRlGHSFsl~gnKcYlFGGLaNdseDpknNvPrYLnD  172 (830)
T KOG4152|consen   93 RILVFGGMVEYGKYSNDLYELQASRWEWKRLKPKTPKNGPPPCPRLGHSFSLVGNKCYLFGGLANDSEDPKNNVPRYLND  172 (830)
T ss_pred             eEEEEccEeeeccccchHHHhhhhhhhHhhcCCCCCCCCCCCCCccCceeEEeccEeEEeccccccccCcccccchhhcc
Confidence            99999995432    555444444456777753       233455666677799999999975422         2347


Q ss_pred             EEEEeCCCC----ceeecC---CCCCCCCCCcEEEE------CCEEEEEecCCC---CeEEEEeCCCCceEEccCC-CCC
Q 045821          229 ALVYEPSTD----SWLHAD---ANMASGWRGPAVVV------DDALYVLDQSSG---TKLMMWQKESREWSPVGRL-STL  291 (352)
Q Consensus       229 i~~yd~~~~----~W~~~~---~~~~~~~~~~~~~~------~~~l~~~gg~~~---~~v~~yd~~~~~W~~~~~~-~~~  291 (352)
                      ++..++..+    .|...-   ..++.+.++.++..      ..++|++||..+   .++|.+|+++.+|++...- -.+
T Consensus       173 lY~leL~~Gsgvv~W~ip~t~Gv~P~pRESHTAViY~eKDs~~skmvvyGGM~G~RLgDLW~Ldl~Tl~W~kp~~~G~~P  252 (830)
T KOG4152|consen  173 LYILELRPGSGVVAWDIPITYGVLPPPRESHTAVIYTEKDSKKSKMVVYGGMSGCRLGDLWTLDLDTLTWNKPSLSGVAP  252 (830)
T ss_pred             eEEEEeccCCceEEEecccccCCCCCCcccceeEEEEeccCCcceEEEEcccccccccceeEEecceeecccccccCCCC
Confidence            777776543    386542   23344444455443      347999999877   7999999999999977521 122


Q ss_pred             cCCCCceEEEeCCEEEEEcCC
Q 045821          292 LTRPPCKLVAIGKTIFVIGKG  312 (352)
Q Consensus       292 ~~~~~~~~~~~~~~i~v~gG~  312 (352)
                      ++|..+....+++|+|||||-
T Consensus       253 lPRSLHsa~~IGnKMyvfGGW  273 (830)
T KOG4152|consen  253 LPRSLHSATTIGNKMYVFGGW  273 (830)
T ss_pred             CCcccccceeecceeEEecce
Confidence            468888999999999999983


No 27 
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.80  E-value=1.2e-17  Score=142.05  Aligned_cols=237  Identities=21%  Similarity=0.323  Sum_probs=172.9

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCC----CCCCCeEEEEEC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWS----EDATSEVYCYDA  143 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~----~~~~~~~~~~d~  143 (352)
                      ++.+|+--|.   ....+|.+|.++....|+++...|.  .+|.+...++++++||+|||....    ....+++++|||
T Consensus        46 g~~~YVGLGs---~G~afy~ldL~~~~k~W~~~a~FpG--~~rnqa~~a~~~~kLyvFgG~Gk~~~~~~~~~nd~Y~y~p  120 (381)
T COG3055          46 GDTVYVGLGS---AGTAFYVLDLKKPGKGWTKIADFPG--GARNQAVAAVIGGKLYVFGGYGKSVSSSPQVFNDAYRYDP  120 (381)
T ss_pred             cceEEEEecc---CCccceehhhhcCCCCceEcccCCC--cccccchheeeCCeEEEeeccccCCCCCceEeeeeEEecC
Confidence            6788986553   3457888888667789999999986  578899999999999999996432    345789999999


Q ss_pred             CCCeeEecCC-CCCcceeeeeEEECC-EEEEEeCCCCC-------------------------------------CceEE
Q 045821          144 SMNTWTDAAP-MSTARCYFPCGVLNQ-KIYCIGGLGDT-------------------------------------HSWDV  184 (352)
Q Consensus       144 ~t~~W~~~~~-~~~~r~~~~~~~~~~-~iyv~gG~~~~-------------------------------------~~v~~  184 (352)
                      .+++|..+.. .|+....+..+.+++ +||++||+...                                     ..+..
T Consensus       121 ~~nsW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i~~~yf~~~~~dy~~n~ev~s  200 (381)
T COG3055         121 STNSWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKIIAHYFDKKAEDYFFNKEVLS  200 (381)
T ss_pred             CCChhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHHHHHHhCCCHHHhcccccccc
Confidence            9999998865 355566777888876 99999997421                                     78899


Q ss_pred             EeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCC-CceEEEEEeC--CCCceeecCCCCCCCCCC---cE---
Q 045821          185 YDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAAT-SHVCALVYEP--STDSWLHADANMASGWRG---PA---  254 (352)
Q Consensus       185 yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~-~~~~i~~yd~--~~~~W~~~~~~~~~~~~~---~~---  254 (352)
                      |||++++|+.+...|......++++. ++++.++-|....+ ....+..++.  ...+|..+++.+......   .+   
T Consensus       201 y~p~~n~W~~~G~~pf~~~aGsa~~~~~n~~~lInGEiKpGLRt~~~k~~~~~~~~~~w~~l~~lp~~~~~~~eGvAGaf  280 (381)
T COG3055         201 YDPSTNQWRNLGENPFYGNAGSAVVIKGNKLTLINGEIKPGLRTAEVKQADFGGDNLKWLKLSDLPAPIGSNKEGVAGAF  280 (381)
T ss_pred             cccccchhhhcCcCcccCccCcceeecCCeEEEEcceecCCccccceeEEEeccCceeeeeccCCCCCCCCCccccceec
Confidence            99999999998866655544455554 56577775654333 2335556655  456899987643332221   11   


Q ss_pred             -EEECCEEEEEecCCC-----------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821          255 -VVVDDALYVLDQSSG-----------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG  310 (352)
Q Consensus       255 -~~~~~~l~~~gg~~~-----------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g  310 (352)
                       -..++.+.+.||..-                       ++|+.+|  +++|+.++.+|.  .......+..+++||+||
T Consensus       281 ~G~s~~~~lv~GGAnF~Ga~~~y~~Gk~~AH~Gl~K~w~~~Vy~~d--~g~Wk~~GeLp~--~l~YG~s~~~nn~vl~IG  356 (381)
T COG3055         281 SGKSNGEVLVAGGANFPGALKAYKNGKFYAHEGLSKSWNSEVYIFD--NGSWKIVGELPQ--GLAYGVSLSYNNKVLLIG  356 (381)
T ss_pred             cceeCCeEEEecCCCChhHHHHHHhcccccccchhhhhhceEEEEc--CCceeeecccCC--CccceEEEecCCcEEEEc
Confidence             134677777776321                       4788888  899999999998  455566677899999999


Q ss_pred             CCc
Q 045821          311 KGC  313 (352)
Q Consensus       311 G~~  313 (352)
                      |..
T Consensus       357 GE~  359 (381)
T COG3055         357 GET  359 (381)
T ss_pred             ccc
Confidence            965


No 28 
>COG3055 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=99.69  E-value=3.6e-15  Score=127.21  Aligned_cols=207  Identities=23%  Similarity=0.357  Sum_probs=148.3

Q ss_pred             EeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC--CeeEecCCCC-CcceeeeeEEECCEEEEEeC
Q 045821           99 MIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM--NTWTDAAPMS-TARCYFPCGVLNQKIYCIGG  175 (352)
Q Consensus        99 ~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t--~~W~~~~~~~-~~r~~~~~~~~~~~iyv~gG  175 (352)
                      .+|.+|   .+..+-+.+.+++.+||-=|..+     .+.+..|...  ..|+.++..| .+|.....++++++||++||
T Consensus        29 ~lPdlP---vg~KnG~Ga~ig~~~YVGLGs~G-----~afy~ldL~~~~k~W~~~a~FpG~~rnqa~~a~~~~kLyvFgG  100 (381)
T COG3055          29 QLPDLP---VGFKNGAGALIGDTVYVGLGSAG-----TAFYVLDLKKPGKGWTKIADFPGGARNQAVAAVIGGKLYVFGG  100 (381)
T ss_pred             cCCCCC---ccccccccceecceEEEEeccCC-----ccceehhhhcCCCCceEcccCCCcccccchheeeCCeEEEeec
Confidence            345555   34444467788889999866333     4567777763  5699999988 57888899999999999999


Q ss_pred             CCCC--------CceEEEeCCCCcEEEecCC-CCcccceeEEEECC-EEEEEccCCCCC---------------------
Q 045821          176 LGDT--------HSWDVYDPRTNNWKLHTEP-NIFTEIEDSFVMDG-KIYIRCSASAAT---------------------  224 (352)
Q Consensus       176 ~~~~--------~~v~~yd~~t~~W~~~~~~-~~~~~~~~~~~~~g-~iyv~GG~~~~~---------------------  224 (352)
                      ....        +++++|||.+++|.++... |..-..+.++.+++ +||++||.+..-                     
T Consensus       101 ~Gk~~~~~~~~~nd~Y~y~p~~nsW~kl~t~sP~gl~G~~~~~~~~~~i~f~GGvn~~if~~yf~dv~~a~~d~~~~~~i  180 (381)
T COG3055         101 YGKSVSSSPQVFNDAYRYDPSTNSWHKLDTRSPTGLVGASTFSLNGTKIYFFGGVNQNIFNGYFEDVGAAGKDKEAVDKI  180 (381)
T ss_pred             cccCCCCCceEeeeeEEecCCCChhheeccccccccccceeEecCCceEEEEccccHHhhhhhHHhhhhhcccHHHHHHH
Confidence            7643        8999999999999998753 33344456677777 999999977311                     


Q ss_pred             -------------CceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCC-----CeEEEEeCC--CCceE
Q 045821          225 -------------SHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSG-----TKLMMWQKE--SREWS  283 (352)
Q Consensus       225 -------------~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~-----~~v~~yd~~--~~~W~  283 (352)
                                   ....++.||+++++|+.+...+-...++++++. ++++.++.|+-.     ..+.+++..  ..+|.
T Consensus       181 ~~~yf~~~~~dy~~n~ev~sy~p~~n~W~~~G~~pf~~~aGsa~~~~~n~~~lInGEiKpGLRt~~~k~~~~~~~~~~w~  260 (381)
T COG3055         181 IAHYFDKKAEDYFFNKEVLSYDPSTNQWRNLGENPFYGNAGSAVVIKGNKLTLINGEIKPGLRTAEVKQADFGGDNLKWL  260 (381)
T ss_pred             HHHHhCCCHHHhcccccccccccccchhhhcCcCcccCccCcceeecCCeEEEEcceecCCccccceeEEEeccCceeee
Confidence                         112789999999999999876767777777654 666888887543     345666555  56799


Q ss_pred             EccCCCCCcCCCCceEE-----EeCCEEEEEcCCc
Q 045821          284 PVGRLSTLLTRPPCKLV-----AIGKTIFVIGKGC  313 (352)
Q Consensus       284 ~~~~~~~~~~~~~~~~~-----~~~~~i~v~gG~~  313 (352)
                      .++++|.+.....-.++     ..++++.+.||.+
T Consensus       261 ~l~~lp~~~~~~~eGvAGaf~G~s~~~~lv~GGAn  295 (381)
T COG3055         261 KLSDLPAPIGSNKEGVAGAFSGKSNGEVLVAGGAN  295 (381)
T ss_pred             eccCCCCCCCCCccccceeccceeCCeEEEecCCC
Confidence            99988865433322222     2467788888755


No 29 
>KOG2437 consensus Muskelin [Signal transduction mechanisms]
Probab=99.54  E-value=1.3e-14  Score=128.44  Aligned_cols=234  Identities=16%  Similarity=0.172  Sum_probs=155.0

Q ss_pred             CCCCeeEeccCC-------CcccCCcceEEEEECC--EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCC---CCCccee
Q 045821           93 TRRSWKMIHELP-------ARTLRRKGMGFEVLGK--NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAP---MSTARCY  160 (352)
Q Consensus        93 ~~~~w~~~~~~~-------~~~~~~~~~~~~~~~~--~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~---~~~~r~~  160 (352)
                      -+..|.++++..       ..++.|.+|..+...+  .||++||+++. +.+.++|+|+...+.|..+..   .|..|..
T Consensus       237 y~~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGGWdG~-~~l~DFW~Y~v~e~~W~~iN~~t~~PG~RsC  315 (723)
T KOG2437|consen  237 YKPRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGGWDGT-QDLADFWAYSVKENQWTCINRDTEGPGARSC  315 (723)
T ss_pred             ccccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecCcccc-hhHHHHHhhcCCcceeEEeecCCCCCcchhh
Confidence            567788777654       3457788998888865  89999999987 778999999999999998753   5778999


Q ss_pred             eeeEEECC--EEEEEeCCCCC---------CceEEEeCCCCcEEEecC------CCCcccceeEEEECCE--EEEEccCC
Q 045821          161 FPCGVLNQ--KIYCIGGLGDT---------HSWDVYDPRTNNWKLHTE------PNIFTEIEDSFVMDGK--IYIRCSAS  221 (352)
Q Consensus       161 ~~~~~~~~--~iyv~gG~~~~---------~~v~~yd~~t~~W~~~~~------~~~~~~~~~~~~~~g~--iyv~GG~~  221 (352)
                      |.++..-.  ++|++|-+-..         +++|+||..++.|.-++.      -|.....|..++.+.+  |||+||..
T Consensus       316 HRMVid~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~  395 (723)
T KOG2437|consen  316 HRMVIDISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSEKHMIYVFGGRI  395 (723)
T ss_pred             hhhhhhhhHhHHhhhhhccccccccccccccceEEEecCCceeEEecccccccCCcceeecceeeEecCcceEEEecCee
Confidence            98887644  89999986432         899999999999998862      1223344566666655  99999976


Q ss_pred             CCCC---ceEEEEEeCCCCceeecCCCCCC----------CCCC-c-EEEECCEEEEEecCCC----CeEEEEeCCCCce
Q 045821          222 AATS---HVCALVYEPSTDSWLHADANMAS----------GWRG-P-AVVVDDALYVLDQSSG----TKLMMWQKESREW  282 (352)
Q Consensus       222 ~~~~---~~~i~~yd~~~~~W~~~~~~~~~----------~~~~-~-~~~~~~~l~~~gg~~~----~~v~~yd~~~~~W  282 (352)
                      -...   ...++.||.....|..+......          +..+ + .+..++.+|++||...    .-.+.||+..+.=
T Consensus       396 ~~~~e~~f~GLYaf~~~~~~w~~l~e~~~~~~~vvE~~~sR~ghcmE~~~~n~~ly~fggq~s~~El~L~f~y~I~~E~~  475 (723)
T KOG2437|consen  396 LTCNEPQFSGLYAFNCQCQTWKLLREDSCNAGPVVEDIQSRIGHCMEFHSKNRCLYVFGGQRSKTELNLFFSYDIDSEHV  475 (723)
T ss_pred             ccCCCccccceEEEecCCccHHHHHHHHhhcCcchhHHHHHHHHHHHhcCCCCeEEeccCcccceEEeehhcceeccccc
Confidence            4333   34899999999999877431111          0000 1 2345778999998665    2235565554443


Q ss_pred             EEccCC--------CCCcCCCCceEEEeCCEEEEEcC------------CceEEEEEcCCccccc
Q 045821          283 SPVGRL--------STLLTRPPCKLVAIGKTIFVIGK------------GCSAVVIDVGNIGNIG  327 (352)
Q Consensus       283 ~~~~~~--------~~~~~~~~~~~~~~~~~i~v~gG------------~~~~~~~d~~~~~w~~  327 (352)
                      ..+...        |.........+-.-...|.+.-|            +++++||++.++.|..
T Consensus       476 ~~~s~~~k~dsS~~pS~~f~qRs~~dp~~~~i~~~~G~~~~~~~~e~~~rns~wi~~i~~~~w~c  540 (723)
T KOG2437|consen  476 DIISDGTKKDSSMVPSTGFTQRATIDPELNEIHVLSGLSKDKEKREENVRNSFWIYDIVRNSWSC  540 (723)
T ss_pred             hhhhccCcCccccCCCcchhhhcccCCCCcchhhhcccchhccCccccccCcEEEEEecccchhh
Confidence            333211        11001111111122344554433            3368889999998863


No 30 
>PF13964 Kelch_6:  Kelch motif
Probab=99.28  E-value=1.4e-11  Score=77.67  Aligned_cols=50  Identities=36%  Similarity=0.605  Sum_probs=45.6

Q ss_pred             CCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcc
Q 045821          109 RRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTAR  158 (352)
Q Consensus       109 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r  158 (352)
                      +|.++++++++++||++||........+++++||+.+++|+.+++||.||
T Consensus         1 pR~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR   50 (50)
T PF13964_consen    1 PRYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR   50 (50)
T ss_pred             CCccCEEEEECCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence            47889999999999999998775577899999999999999999999887


No 31 
>KOG2437 consensus Muskelin [Signal transduction mechanisms]
Probab=99.25  E-value=5.9e-12  Score=111.83  Aligned_cols=239  Identities=16%  Similarity=0.174  Sum_probs=156.9

Q ss_pred             hhhcccccCh-------hhHHHHhhCCCC----CCEEEEEEecCCCCce-eEEEecCCCCCCCeeEeccCCCcccCCcce
Q 045821           46 SRWRGLLHSE-------EWCAYRRKHNLD----ETWIYAFCRDNKLERV-CCYVLDPNSTRRSWKMIHELPARTLRRKGM  113 (352)
Q Consensus        46 k~W~~l~~s~-------~~~~~~~~~~~~----~~~l~~~gg~~~~~~~-~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~  113 (352)
                      -+|..++...       ..+..|..|.+.    ++-||+.||.+..... .+|.|+-  ..+.|..+..-...|..|.+|
T Consensus       239 ~~W~~i~~~~~~~~~~~~~p~~RgGHQMV~~~~~~CiYLYGGWdG~~~l~DFW~Y~v--~e~~W~~iN~~t~~PG~RsCH  316 (723)
T KOG2437|consen  239 PRWSQIIPKSTKGDGEDNRPGMRGGHQMVIDVQTECVYLYGGWDGTQDLADFWAYSV--KENQWTCINRDTEGPGARSCH  316 (723)
T ss_pred             ccccccCchhhcccccccCccccCcceEEEeCCCcEEEEecCcccchhHHHHHhhcC--CcceeEEeecCCCCCcchhhh
Confidence            3577665433       234456666655    5689999999766654 5889988  889999886544334778888


Q ss_pred             EEEEECC--EEEEEeCCCCC-----CCCCCeEEEEECCCCeeEecCC------CCCcceeeeeEEECCE--EEEEeCCCC
Q 045821          114 GFEVLGK--NAYLLGGCGWS-----EDATSEVYCYDASMNTWTDAAP------MSTARCYFPCGVLNQK--IYCIGGLGD  178 (352)
Q Consensus       114 ~~~~~~~--~iyv~GG~~~~-----~~~~~~~~~~d~~t~~W~~~~~------~~~~r~~~~~~~~~~~--iyv~gG~~~  178 (352)
                      ..+....  ++|+.|-+-+.     ....+++|.||..++.|..+.-      -|...+.|.+++.+.+  |||+||+.-
T Consensus       317 RMVid~S~~KLYLlG~Y~~sS~r~~~s~RsDfW~FDi~~~~W~~ls~dt~~dGGP~~vfDHqM~Vd~~k~~iyVfGGr~~  396 (723)
T KOG2437|consen  317 RMVIDISRRKLYLLGRYLDSSVRNSKSLRSDFWRFDIDTNTWMLLSEDTAADGGPKLVFDHQMCVDSEKHMIYVFGGRIL  396 (723)
T ss_pred             hhhhhhhHhHHhhhhhccccccccccccccceEEEecCCceeEEecccccccCCcceeecceeeEecCcceEEEecCeec
Confidence            8887765  89999976432     2456799999999999998852      2344667888888766  999999753


Q ss_pred             C------CceEEEeCCCCcEEEecCC----------CCcccce--eEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee
Q 045821          179 T------HSWDVYDPRTNNWKLHTEP----------NIFTEIE--DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL  240 (352)
Q Consensus       179 ~------~~v~~yd~~t~~W~~~~~~----------~~~~~~~--~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~  240 (352)
                      .      ...+.||.....|..+..-          ...+-++  -++.-+..+|++||.......+-...||+....=.
T Consensus       397 ~~~e~~f~GLYaf~~~~~~w~~l~e~~~~~~~vvE~~~sR~ghcmE~~~~n~~ly~fggq~s~~El~L~f~y~I~~E~~~  476 (723)
T KOG2437|consen  397 TCNEPQFSGLYAFNCQCQTWKLLREDSCNAGPVVEDIQSRIGHCMEFHSKNRCLYVFGGQRSKTELNLFFSYDIDSEHVD  476 (723)
T ss_pred             cCCCccccceEEEecCCccHHHHHHHHhhcCcchhHHHHHHHHHHHhcCCCCeEEeccCcccceEEeehhcceeccccch
Confidence            3      7889999999999876521          0111111  13445778999988776666666677766444332


Q ss_pred             ecCC-------CCCCCCCC-cEE--EECCEEEEEecCCC----------CeEEEEeCCCCceEEcc
Q 045821          241 HADA-------NMASGWRG-PAV--VVDDALYVLDQSSG----------TKLMMWQKESREWSPVG  286 (352)
Q Consensus       241 ~~~~-------~~~~~~~~-~~~--~~~~~l~~~gg~~~----------~~v~~yd~~~~~W~~~~  286 (352)
                      .+..       ..+.++.. -++  .-...|...-|...          +..|+|+..++.|..+.
T Consensus       477 ~~s~~~k~dsS~~pS~~f~qRs~~dp~~~~i~~~~G~~~~~~~~e~~~rns~wi~~i~~~~w~cI~  542 (723)
T KOG2437|consen  477 IISDGTKKDSSMVPSTGFTQRATIDPELNEIHVLSGLSKDKEKREENVRNSFWIYDIVRNSWSCIY  542 (723)
T ss_pred             hhhccCcCccccCCCcchhhhcccCCCCcchhhhcccchhccCccccccCcEEEEEecccchhhHh
Confidence            2211       01111110 011  12345555555322          57899999999998774


No 32 
>PF13964 Kelch_6:  Kelch motif
Probab=99.15  E-value=1.1e-10  Score=73.51  Aligned_cols=46  Identities=28%  Similarity=0.463  Sum_probs=41.7

Q ss_pred             cceeeeeEEECCEEEEEeCCCC-C---CceEEEeCCCCcEEEecCCCCcc
Q 045821          157 ARCYFPCGVLNQKIYCIGGLGD-T---HSWDVYDPRTNNWKLHTEPNIFT  202 (352)
Q Consensus       157 ~r~~~~~~~~~~~iyv~gG~~~-~---~~v~~yd~~t~~W~~~~~~~~~~  202 (352)
                      ||..|+++.++++|||+||... .   +++++||++|++|+++++||.+|
T Consensus         1 pR~~~s~v~~~~~iyv~GG~~~~~~~~~~v~~yd~~t~~W~~~~~mp~pR   50 (50)
T PF13964_consen    1 PRYGHSAVVVGGKIYVFGGYDNSGKYSNDVERYDPETNTWEQLPPMPTPR   50 (50)
T ss_pred             CCccCEEEEECCEEEEECCCCCCCCccccEEEEcCCCCcEEECCCCCCCC
Confidence            6889999999999999999877 2   99999999999999999888654


No 33 
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=99.08  E-value=8.4e-08  Score=81.68  Aligned_cols=188  Identities=12%  Similarity=0.109  Sum_probs=113.5

Q ss_pred             CeEEEEECCCCeeEecCCCCCcce---e-eeeEEEC-----CEEEEEeCCC---CCCceEEEeCCCCcEEEecCCCCcc-
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARC---Y-FPCGVLN-----QKIYCIGGLG---DTHSWDVYDPRTNNWKLHTEPNIFT-  202 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~---~-~~~~~~~-----~~iyv~gG~~---~~~~v~~yd~~t~~W~~~~~~~~~~-  202 (352)
                      ..+.++||.|++|..+|+.+.++.   . .....++     -+|..+....   ....+++|+..+++|+.+...+... 
T Consensus        14 ~~~~V~NP~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~~~~~~~~~Vys~~~~~Wr~~~~~~~~~~   93 (230)
T TIGR01640        14 KRLVVWNPSTGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSGNRNQSEHQVYTLGSNSWRTIECSPPHHP   93 (230)
T ss_pred             CcEEEECCCCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecCCCCCccEEEEEeCCCCccccccCCCCcc
Confidence            568999999999999986543211   1 0111122     2555554321   1168899999999999987433221 


Q ss_pred             cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee-cCCCCCCCC---CCcEEEECCEEEEEecCCC---CeEEEE
Q 045821          203 EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH-ADANMASGW---RGPAVVVDDALYVLDQSSG---TKLMMW  275 (352)
Q Consensus       203 ~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~~~~~~~~---~~~~~~~~~~l~~~gg~~~---~~v~~y  275 (352)
                      .....+.++|.||-+...........|.+||..+++|.+ ++.+.....   ....+..+|+|.++.....   -+||+.
T Consensus        94 ~~~~~v~~~G~lyw~~~~~~~~~~~~IvsFDl~~E~f~~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~IWvl  173 (230)
T TIGR01640        94 LKSRGVCINGVLYYLAYTLKTNPDYFIVSFDVSSERFKEFIPLPCGNSDSVDYLSLINYKGKLAVLKQKKDTNNFDLWVL  173 (230)
T ss_pred             ccCCeEEECCEEEEEEEECCCCCcEEEEEEEcccceEeeeeecCccccccccceEEEEECCEEEEEEecCCCCcEEEEEE
Confidence            122367889999998643321222379999999999995 654332211   1223567899998865432   256666


Q ss_pred             e-CCCCceEEccCCCCC-c--CCC--CceEEEeCCEEEEEcCC--ce-EEEEEcCCc
Q 045821          276 Q-KESREWSPVGRLSTL-L--TRP--PCKLVAIGKTIFVIGKG--CS-AVVIDVGNI  323 (352)
Q Consensus       276 d-~~~~~W~~~~~~~~~-~--~~~--~~~~~~~~~~i~v~gG~--~~-~~~~d~~~~  323 (352)
                      + .+.+.|+++-.++.. .  ...  ....+..+++|++....  .. ..+||++++
T Consensus       174 ~d~~~~~W~k~~~i~~~~~~~~~~~~~~~~~~~~g~I~~~~~~~~~~~~~~y~~~~~  230 (230)
T TIGR01640       174 NDAGKQEWSKLFTVPIPPLPDLVDDNFLSGFTDKGEIVLCCEDENPFYIFYYNVGEN  230 (230)
T ss_pred             CCCCCCceeEEEEEcCcchhhhhhheeEeEEeeCCEEEEEeCCCCceEEEEEeccCC
Confidence            4 446679976544421 1  111  12345567888887653  23 667999874


No 34 
>PF13415 Kelch_3:  Galactose oxidase, central domain
Probab=99.02  E-value=8.4e-10  Score=69.12  Aligned_cols=48  Identities=25%  Similarity=0.523  Sum_probs=42.9

Q ss_pred             CCEEEEEeCCC-CCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE
Q 045821          119 GKNAYLLGGCG-WSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL  166 (352)
Q Consensus       119 ~~~iyv~GG~~-~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~  166 (352)
                      ++++||+||.+ ......+++|+||+.+++|+.++++|.||..|+++++
T Consensus         1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~i   49 (49)
T PF13415_consen    1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATVI   49 (49)
T ss_pred             CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEEC
Confidence            57899999988 4457789999999999999999999999999998864


No 35 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=99.00  E-value=4.8e-10  Score=69.71  Aligned_cols=47  Identities=34%  Similarity=0.663  Sum_probs=42.6

Q ss_pred             CCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC
Q 045821          109 RRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS  155 (352)
Q Consensus       109 ~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~  155 (352)
                      +|.++++++++++||++||.+......+++++||+.+++|+.+++||
T Consensus         1 pR~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp   47 (47)
T PF01344_consen    1 PRSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP   47 (47)
T ss_dssp             -BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred             CCccCEEEEECCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence            47889999999999999999886688999999999999999999876


No 36 
>PF01344 Kelch_1:  Kelch motif;  InterPro: IPR006652 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding; PDB: 2XN4_A 2WOZ_A 3II7_A 4ASC_A 1U6D_X 1ZGK_A 2FLU_X 2VPJ_A 2DYH_A 1X2R_A ....
Probab=98.96  E-value=7.4e-10  Score=68.84  Aligned_cols=43  Identities=37%  Similarity=0.612  Sum_probs=38.8

Q ss_pred             cceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEecCCC
Q 045821          157 ARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHTEPN  199 (352)
Q Consensus       157 ~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~~~  199 (352)
                      ||..|++++++++||++||....    +++++||+.+++|+.+++||
T Consensus         1 pR~~~~~~~~~~~iyv~GG~~~~~~~~~~v~~yd~~~~~W~~~~~mp   47 (47)
T PF01344_consen    1 PRSGHAAVVVGNKIYVIGGYDGNNQPTNSVEVYDPETNTWEELPPMP   47 (47)
T ss_dssp             -BBSEEEEEETTEEEEEEEBESTSSBEEEEEEEETTTTEEEEEEEES
T ss_pred             CCccCEEEEECCEEEEEeeecccCceeeeEEEEeCCCCEEEEcCCCC
Confidence            68899999999999999998873    99999999999999998765


No 37 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.85  E-value=1.3e-08  Score=63.73  Aligned_cols=47  Identities=30%  Similarity=0.636  Sum_probs=41.1

Q ss_pred             CCcceEEEEECCEEEEEeCC--CCCCCCCCeEEEEECCCCeeEecCCCC
Q 045821          109 RRKGMGFEVLGKNAYLLGGC--GWSEDATSEVYCYDASMNTWTDAAPMS  155 (352)
Q Consensus       109 ~~~~~~~~~~~~~iyv~GG~--~~~~~~~~~~~~~d~~t~~W~~~~~~~  155 (352)
                      +|.+|++++++++||++||.  .......+++++||+.+++|+.+++++
T Consensus         1 ~r~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g   49 (49)
T PF07646_consen    1 PRYGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG   49 (49)
T ss_pred             CccceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence            47889999999999999999  444467899999999999999998875


No 38 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.83  E-value=3.6e-09  Score=66.32  Aligned_cols=47  Identities=26%  Similarity=0.499  Sum_probs=31.5

Q ss_pred             CCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC
Q 045821          109 RRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS  155 (352)
Q Consensus       109 ~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~  155 (352)
                      +|.+|+++.+ ++.|||+||.+......+++|+||+.+++|+.++++|
T Consensus         1 pR~~h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P   48 (49)
T PF13418_consen    1 PRYGHSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP   48 (49)
T ss_dssp             --BS-EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred             CcceEEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence            4788999988 5899999998876568899999999999999998876


No 39 
>PF07646 Kelch_2:  Kelch motif;  InterPro: IPR011498 Kelch is a 50-residue motif, named after the Drosophila mutant in which it was first identified []. This sequence motif represents one beta-sheet blade, and several of these repeats can associate to form a beta-propeller. For instance, the motif appears 6 times in Drosophila egg-chamber regulatory protein, creating a 6-bladed beta-propeller. The motif is also found in mouse protein MIPP [] and in a number of poxviruses. In addition, kelch repeats have been recognised in alpha- and beta-scruin [, ], and in galactose oxidase from the fungus Dactylium dendroides [, ]. The structure of galactose oxidase reveals that the repeated sequence corresponds to a 4-stranded anti-parallel beta-sheet motif that forms the repeat unit in a super-barrel structural fold []. The known functions of kelch-containing proteins are diverse: scruin is an actin cross-linking protein; galactose oxidase catalyses the oxidation of the hydroxyl group at the C6 position in D-galactose; neuraminidase hydrolyses sialic acid residues from glycoproteins; and kelch may have a cytoskeletal function, as it is localised to the actin-rich ring canals that connect the 15 nurse cells to the developing oocyte in Drosophila []. Nevertheless, based on the location of the kelch pattern in the catalytic unit in galactose oxidase, functionally important residues have been predicted in glyoxal oxidase []. This entry represents a type of kelch sequence motif that comprises one beta-sheet blade.; GO: 0005515 protein binding
Probab=98.81  E-value=1.2e-08  Score=63.76  Aligned_cols=43  Identities=30%  Similarity=0.468  Sum_probs=37.7

Q ss_pred             cceeeeeEEECCEEEEEeCC--CCC----CceEEEeCCCCcEEEecCCC
Q 045821          157 ARCYFPCGVLNQKIYCIGGL--GDT----HSWDVYDPRTNNWKLHTEPN  199 (352)
Q Consensus       157 ~r~~~~~~~~~~~iyv~gG~--~~~----~~v~~yd~~t~~W~~~~~~~  199 (352)
                      ||..|++++++++|||+||+  ...    +++++||+++++|+.+++++
T Consensus         1 ~r~~hs~~~~~~kiyv~GG~~~~~~~~~~~~v~~~d~~t~~W~~~~~~g   49 (49)
T PF07646_consen    1 PRYGHSAVVLDGKIYVFGGYGTDNGGSSSNDVWVFDTETNQWTELSPMG   49 (49)
T ss_pred             CccceEEEEECCEEEEECCcccCCCCcccceeEEEECCCCEEeecCCCC
Confidence            68899999999999999999  222    99999999999999998654


No 40 
>TIGR01640 F_box_assoc_1 F-box protein interaction domain. This model describes a large family of plant domains, with several hundred members in Arabidopsis thaliana. Most examples are found C-terminal to an F-box (pfam00646), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes. Some members have two copies of this domain.
Probab=98.80  E-value=5.7e-06  Score=70.39  Aligned_cols=194  Identities=9%  Similarity=0.057  Sum_probs=114.4

Q ss_pred             eeEEEecCCCCCCCeeEeccCCCcccCCcc--eEEEE--E-CC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCC
Q 045821           83 VCCYVLDPNSTRRSWKMIHELPARTLRRKG--MGFEV--L-GK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMST  156 (352)
Q Consensus        83 ~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~--~~~~~--~-~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~  156 (352)
                      ...+++||  .|+.|..+|+.+........  .+...  . +. +|..+...... .....+++|+..+++|+.+...+.
T Consensus        14 ~~~~V~NP--~T~~~~~LP~~~~~~~~~~~~~~~~G~d~~~~~YKVv~~~~~~~~-~~~~~~~Vys~~~~~Wr~~~~~~~   90 (230)
T TIGR01640        14 KRLVVWNP--STGQSRWLPTPKSRRSNKESDTYFLGYDPIEKQYKVLCFSDRSGN-RNQSEHQVYTLGSNSWRTIECSPP   90 (230)
T ss_pred             CcEEEECC--CCCCEEecCCCCCcccccccceEEEeecccCCcEEEEEEEeecCC-CCCccEEEEEeCCCCccccccCCC
Confidence            35778899  99999999865421010111  11111  1 11 44444332111 234578999999999999874332


Q ss_pred             c-ceeeeeEEECCEEEEEeCCCC-C--CceEEEeCCCCcEEE-ecCCCCcc---cceeEEEECCEEEEEccCCCCCCceE
Q 045821          157 A-RCYFPCGVLNQKIYCIGGLGD-T--HSWDVYDPRTNNWKL-HTEPNIFT---EIEDSFVMDGKIYIRCSASAATSHVC  228 (352)
Q Consensus       157 ~-r~~~~~~~~~~~iyv~gG~~~-~--~~v~~yd~~t~~W~~-~~~~~~~~---~~~~~~~~~g~iyv~GG~~~~~~~~~  228 (352)
                      . ......+.++|.+|.+..... .  ..+..||..+++|+. ++.|....   .....+.++|+|.++.... ......
T Consensus        91 ~~~~~~~~v~~~G~lyw~~~~~~~~~~~~IvsFDl~~E~f~~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~-~~~~~~  169 (230)
T TIGR01640        91 HHPLKSRGVCINGVLYYLAYTLKTNPDYFIVSFDVSSERFKEFIPLPCGNSDSVDYLSLINYKGKLAVLKQKK-DTNNFD  169 (230)
T ss_pred             CccccCCeEEECCEEEEEEEECCCCCcEEEEEEEcccceEeeeeecCccccccccceEEEEECCEEEEEEecC-CCCcEE
Confidence            2 112236778999999875432 1  268999999999995 66433221   1234677799999885432 223357


Q ss_pred             EEEEe-CCCCceeec---CCCCCCCCC----CcEEEECCEEEEEecCC-CCeEEEEeCCCC
Q 045821          229 ALVYE-PSTDSWLHA---DANMASGWR----GPAVVVDDALYVLDQSS-GTKLMMWQKESR  280 (352)
Q Consensus       229 i~~yd-~~~~~W~~~---~~~~~~~~~----~~~~~~~~~l~~~gg~~-~~~v~~yd~~~~  280 (352)
                      ||+.+ .....|+++   +.+......    ...+..+++|++..... +..+..||++++
T Consensus       170 IWvl~d~~~~~W~k~~~i~~~~~~~~~~~~~~~~~~~~g~I~~~~~~~~~~~~~~y~~~~~  230 (230)
T TIGR01640       170 LWVLNDAGKQEWSKLFTVPIPPLPDLVDDNFLSGFTDKGEIVLCCEDENPFYIFYYNVGEN  230 (230)
T ss_pred             EEEECCCCCCceeEEEEEcCcchhhhhhheeEeEEeeCCEEEEEeCCCCceEEEEEeccCC
Confidence            88875 345679864   211111111    12356688888876532 224999999875


No 41 
>smart00612 Kelch Kelch domain.
Probab=98.74  E-value=1.9e-08  Score=62.32  Aligned_cols=47  Identities=30%  Similarity=0.485  Sum_probs=41.1

Q ss_pred             EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECC
Q 045821          121 NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ  168 (352)
Q Consensus       121 ~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~  168 (352)
                      +||++||.... ...+++++||+.+++|+.+++|+.+|..++++.+++
T Consensus         1 ~iyv~GG~~~~-~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g   47 (47)
T smart00612        1 KIYVVGGFDGG-QRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING   47 (47)
T ss_pred             CEEEEeCCCCC-ceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence            48999998653 567899999999999999999999999999888764


No 42 
>PF13418 Kelch_4:  Galactose oxidase, central domain; PDB: 2UVK_B.
Probab=98.74  E-value=1.3e-08  Score=63.80  Aligned_cols=43  Identities=21%  Similarity=0.383  Sum_probs=29.3

Q ss_pred             cceeeeeEEE-CCEEEEEeCCCCC----CceEEEeCCCCcEEEecCCC
Q 045821          157 ARCYFPCGVL-NQKIYCIGGLGDT----HSWDVYDPRTNNWKLHTEPN  199 (352)
Q Consensus       157 ~r~~~~~~~~-~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~~~~  199 (352)
                      ||..|+++.+ +++||++||....    +++++||+++++|++++++|
T Consensus         1 pR~~h~~~~~~~~~i~v~GG~~~~~~~~~d~~~~d~~~~~W~~~~~~P   48 (49)
T PF13418_consen    1 PRYGHSAVSIGDNSIYVFGGRDSSGSPLNDLWIFDIETNTWTRLPSMP   48 (49)
T ss_dssp             --BS-EEEEE-TTEEEEE--EEE-TEE---EEEEETTTTEEEE--SS-
T ss_pred             CcceEEEEEEeCCeEEEECCCCCCCcccCCEEEEECCCCEEEECCCCC
Confidence            6899999998 5899999998764    89999999999999997665


No 43 
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.66  E-value=3.5e-06  Score=70.98  Aligned_cols=149  Identities=12%  Similarity=0.195  Sum_probs=97.2

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCC----CcEEEec-CCCCcccceeEEE
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRT----NNWKLHT-EPNIFTEIEDSFV  209 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t----~~W~~~~-~~~~~~~~~~~~~  209 (352)
                      .....||+.+++++.+.-....-+...+..-||++.+.||.... ..+-.|++.+    ..|.+.+ .|...|.+..+..
T Consensus        46 a~s~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~  125 (243)
T PF07250_consen   46 AHSVEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDNDGNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATT  125 (243)
T ss_pred             EEEEEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCccccceEEEecCCCCCCCCceECcccccCCCccccceE
Confidence            34567999999999886432222222222238999999998665 7788898876    5798776 4777888887776


Q ss_pred             E-CCEEEEEccCCCCCCceEEEEEeCCC-----CceeecCC---CCCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCC
Q 045821          210 M-DGKIYIRCSASAATSHVCALVYEPST-----DSWLHADA---NMASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       210 ~-~g~iyv~GG~~~~~~~~~i~~yd~~~-----~~W~~~~~---~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      + ||+++|+||....    ..+.+....     ..|..+..   ..+...+... +.-+|+||+++..   .-.+||+.+
T Consensus       126 L~DG~vlIvGG~~~~----t~E~~P~~~~~~~~~~~~~l~~~~~~~~~nlYP~~~llPdG~lFi~an~---~s~i~d~~~  198 (243)
T PF07250_consen  126 LPDGRVLIVGGSNNP----TYEFWPPKGPGPGPVTLPFLSQTSDTLPNNLYPFVHLLPDGNLFIFANR---GSIIYDYKT  198 (243)
T ss_pred             CCCCCEEEEeCcCCC----cccccCCccCCCCceeeecchhhhccCccccCceEEEcCCCCEEEEEcC---CcEEEeCCC
Confidence            6 8999999987721    223333211     11222211   1112222222 4569999999986   678889999


Q ss_pred             Cce-EEccCCCCC
Q 045821          280 REW-SPVGRLSTL  291 (352)
Q Consensus       280 ~~W-~~~~~~~~~  291 (352)
                      +++ .+++.+|..
T Consensus       199 n~v~~~lP~lPg~  211 (243)
T PF07250_consen  199 NTVVRTLPDLPGG  211 (243)
T ss_pred             CeEEeeCCCCCCC
Confidence            987 788888864


No 44 
>PF13415 Kelch_3:  Galactose oxidase, central domain
Probab=98.60  E-value=1.4e-07  Score=58.94  Aligned_cols=43  Identities=23%  Similarity=0.309  Sum_probs=37.8

Q ss_pred             CCEEEEEeCCC--CC---CceEEEeCCCCcEEEecCCCCcccceeEEE
Q 045821          167 NQKIYCIGGLG--DT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFV  209 (352)
Q Consensus       167 ~~~iyv~gG~~--~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~  209 (352)
                      +++|||+||..  ..   +++++||+.+++|++++++|.+|..|++++
T Consensus         1 g~~~~vfGG~~~~~~~~~nd~~~~~~~~~~W~~~~~~P~~R~~h~~~~   48 (49)
T PF13415_consen    1 GNKLYVFGGYDDDGGTRLNDVWVFDLDTNTWTRIGDLPPPRSGHTATV   48 (49)
T ss_pred             CCEEEEECCcCCCCCCEecCEEEEECCCCEEEECCCCCCCccceEEEE
Confidence            57899999988  22   999999999999999998999888887765


No 45 
>smart00612 Kelch Kelch domain.
Probab=98.54  E-value=1.5e-07  Score=58.19  Aligned_cols=44  Identities=34%  Similarity=0.579  Sum_probs=38.0

Q ss_pred             EEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821          169 KIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       169 ~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      +||++||....   +++++||+.+++|+.+++|+.++..+++++++|
T Consensus         1 ~iyv~GG~~~~~~~~~v~~yd~~~~~W~~~~~~~~~r~~~~~~~~~g   47 (47)
T smart00612        1 KIYVVGGFDGGQRLKSVEVYDPETNKWTPLPSMPTPRSGHGVAVING   47 (47)
T ss_pred             CEEEEeCCCCCceeeeEEEECCCCCeEccCCCCCCccccceEEEeCC
Confidence            48999998653   889999999999999999999888888777764


No 46 
>PLN02772 guanylate kinase
Probab=98.46  E-value=1.6e-06  Score=77.63  Aligned_cols=83  Identities=17%  Similarity=0.177  Sum_probs=68.8

Q ss_pred             cCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecC---CCCCcceeeeeEEE-CCEEEEEeCCCCC-Cce
Q 045821          108 LRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAA---PMSTARCYFPCGVL-NQKIYCIGGLGDT-HSW  182 (352)
Q Consensus       108 ~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~---~~~~~r~~~~~~~~-~~~iyv~gG~~~~-~~v  182 (352)
                      .++..++++.+++++||+||.+......+.+++||..|++|..-.   ..|.||.+|+++++ +++|+|+++.... .++
T Consensus        23 ~~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~~~~~~~  102 (398)
T PLN02772         23 KPKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGSAPDDSI  102 (398)
T ss_pred             CCCCcceeEEECCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCCCCccce
Confidence            367889999999999999998775446789999999999998654   56789999999999 5799999976654 777


Q ss_pred             EEEeCCCC
Q 045821          183 DVYDPRTN  190 (352)
Q Consensus       183 ~~yd~~t~  190 (352)
                      |.....|.
T Consensus       103 w~l~~~t~  110 (398)
T PLN02772        103 WFLEVDTP  110 (398)
T ss_pred             EEEEcCCH
Confidence            77776654


No 47 
>PF07250 Glyoxal_oxid_N:  Glyoxal oxidase N-terminus;  InterPro: IPR009880 This entry represents the N terminus (approximately 300 residues) of a number of plant and fungal glyoxal oxidase enzymes. Glyoxal oxidase catalyses the oxidation of aldehydes to carboxylic acids, coupled with reduction of dioxygen to hydrogen peroxide. It is an essential component of the extracellular lignin degradation pathways of the wood-rot fungus Phanerochaete chrysosporium [].
Probab=98.35  E-value=3.9e-05  Score=64.72  Aligned_cols=141  Identities=14%  Similarity=0.213  Sum_probs=91.1

Q ss_pred             EEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC----CceeecCC-CCCCCCCCcEE-E
Q 045821          183 DVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST----DSWLHADA-NMASGWRGPAV-V  256 (352)
Q Consensus       183 ~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~----~~W~~~~~-~~~~~~~~~~~-~  256 (352)
                      ..||+.|++++.+......-+..++..-||++.+.||...  ....+-.|++.+    ..|.+... ....+|+..+. .
T Consensus        49 ~~yD~~tn~~rpl~v~td~FCSgg~~L~dG~ll~tGG~~~--G~~~ir~~~p~~~~~~~~w~e~~~~m~~~RWYpT~~~L  126 (243)
T PF07250_consen   49 VEYDPNTNTFRPLTVQTDTFCSGGAFLPDGRLLQTGGDND--GNKAIRIFTPCTSDGTCDWTESPNDMQSGRWYPTATTL  126 (243)
T ss_pred             EEEecCCCcEEeccCCCCCcccCcCCCCCCCEEEeCCCCc--cccceEEEecCCCCCCCCceECcccccCCCccccceEC
Confidence            5799999999988643333333344555899999988764  334566787765    67988865 44556666654 4


Q ss_pred             ECCEEEEEecCCCCeEEEEeCC-C--C---ceEEccCCCC--CcCCCCceEEEeCCEEEEEcCCceEEEEEcCCccccc
Q 045821          257 VDDALYVLDQSSGTKLMMWQKE-S--R---EWSPVGRLST--LLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIG  327 (352)
Q Consensus       257 ~~~~l~~~gg~~~~~v~~yd~~-~--~---~W~~~~~~~~--~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~  327 (352)
                      -+|+++++||..+ ..+.|-|. .  .   .|..+.....  ....+-+..+.-+++|++++.. +..+||...+.+.+
T Consensus       127 ~DG~vlIvGG~~~-~t~E~~P~~~~~~~~~~~~~l~~~~~~~~~nlYP~~~llPdG~lFi~an~-~s~i~d~~~n~v~~  203 (243)
T PF07250_consen  127 PDGRVLIVGGSNN-PTYEFWPPKGPGPGPVTLPFLSQTSDTLPNNLYPFVHLLPDGNLFIFANR-GSIIYDYKTNTVVR  203 (243)
T ss_pred             CCCCEEEEeCcCC-CcccccCCccCCCCceeeecchhhhccCccccCceEEEcCCCCEEEEEcC-CcEEEeCCCCeEEe
Confidence            5899999999876 33444443 2  1   1222222111  1123334555679999999976 57789999986533


No 48 
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=98.27  E-value=0.0016  Score=58.49  Aligned_cols=38  Identities=16%  Similarity=0.287  Sum_probs=32.9

Q ss_pred             CccCCCcHHHHHHhhccCc-cccchhhhhhhhhhccccc
Q 045821           16 PLISGLPDDIALFCLARVP-RRYHAVLKCVSSRWRGLLH   53 (352)
Q Consensus        16 ~~~~~lp~dl~~~il~r~p-~~~~~~l~~v~k~W~~l~~   53 (352)
                      ..|++||+||+..|..|+| ..++.++++||+.|++-..
T Consensus         2 ~~Ws~Lp~dll~~i~~~l~~~~d~~~~~~vC~sWr~a~~   40 (373)
T PLN03215          2 ADWSTLPEELLHMIAGRLFSNVELKRFRSICRSWRSSVS   40 (373)
T ss_pred             CChhhCCHHHHHHHHhhCCcHHHHHHHHhhhhhHHHhcc
Confidence            3689999999999999995 5679999999999996544


No 49 
>PF13854 Kelch_5:  Kelch motif
Probab=98.25  E-value=3.1e-06  Score=50.76  Aligned_cols=38  Identities=26%  Similarity=0.489  Sum_probs=33.9

Q ss_pred             cCCcceEEEEECCEEEEEeCCCC-CCCCCCeEEEEECCC
Q 045821          108 LRRKGMGFEVLGKNAYLLGGCGW-SEDATSEVYCYDASM  145 (352)
Q Consensus       108 ~~~~~~~~~~~~~~iyv~GG~~~-~~~~~~~~~~~d~~t  145 (352)
                      .+|..|+++.++++||++||... .....+++|++|..+
T Consensus         3 ~~R~~hs~~~~~~~iyi~GG~~~~~~~~~~d~~~l~l~s   41 (42)
T PF13854_consen    3 SPRYGHSAVVVGNNIYIFGGYSGNNNSYSNDLYVLDLPS   41 (42)
T ss_pred             CCccceEEEEECCEEEEEcCccCCCCCEECcEEEEECCC
Confidence            68999999999999999999984 557789999999876


No 50 
>PLN02772 guanylate kinase
Probab=98.22  E-value=9.8e-06  Score=72.68  Aligned_cols=80  Identities=20%  Similarity=0.191  Sum_probs=62.7

Q ss_pred             CcceeeeeEEECCEEEEEeCCCCC----CceEEEeCCCCcEEEec---CCCCcccceeEEEE-CCEEEEEccCCCCCCce
Q 045821          156 TARCYFPCGVLNQKIYCIGGLGDT----HSWDVYDPRTNNWKLHT---EPNIFTEIEDSFVM-DGKIYIRCSASAATSHV  227 (352)
Q Consensus       156 ~~r~~~~~~~~~~~iyv~gG~~~~----~~v~~yd~~t~~W~~~~---~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~  227 (352)
                      .|+..++++.+++++||+||..+.    +.+++||..|++|....   ..|.++.+|+++++ +++|+|++++...  ..
T Consensus        23 ~~~~~~tav~igdk~yv~GG~~d~~~~~~~v~i~D~~t~~W~~P~V~G~~P~~r~GhSa~v~~~~rilv~~~~~~~--~~  100 (398)
T PLN02772         23 KPKNRETSVTIGDKTYVIGGNHEGNTLSIGVQILDKITNNWVSPIVLGTGPKPCKGYSAVVLNKDRILVIKKGSAP--DD  100 (398)
T ss_pred             CCCCcceeEEECCEEEEEcccCCCccccceEEEEECCCCcEecccccCCCCCCCCcceEEEECCceEEEEeCCCCC--cc
Confidence            478889999999999999997654    78999999999998764   46777888888877 6899999755432  24


Q ss_pred             EEEEEeCCCC
Q 045821          228 CALVYEPSTD  237 (352)
Q Consensus       228 ~i~~yd~~~~  237 (352)
                      ++|.....|.
T Consensus       101 ~~w~l~~~t~  110 (398)
T PLN02772        101 SIWFLEVDTP  110 (398)
T ss_pred             ceEEEEcCCH
Confidence            6776665443


No 51 
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=98.14  E-value=0.0069  Score=55.97  Aligned_cols=228  Identities=13%  Similarity=0.150  Sum_probs=130.7

Q ss_pred             CCCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcc-----cCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEE
Q 045821           67 DETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPART-----LRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCY  141 (352)
Q Consensus        67 ~~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~-----~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~  141 (352)
                      .++.+|+....     -.++.||..+-...|+.-..-....     ......+.+..++.||+.+.       ...++.+
T Consensus        68 ~~~~vy~~~~~-----g~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~-------~g~l~al  135 (394)
T PRK11138         68 AYNKVYAADRA-----GLVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSE-------KGQVYAL  135 (394)
T ss_pred             ECCEEEEECCC-----CeEEEEECCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcC-------CCEEEEE
Confidence            38888887432     2467778733345576432211000     01122334566888887532       3568999


Q ss_pred             ECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCc--ccceeEEEECCEEE
Q 045821          142 DASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIF--TEIEDSFVMDGKIY  215 (352)
Q Consensus       142 d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~--~~~~~~~~~~g~iy  215 (352)
                      |..+++  |+.-..  .+ ...+.++.++.+|+..+.   ..++.+|+.+++  |+.-...+..  +....-++.++.+|
T Consensus       136 d~~tG~~~W~~~~~--~~-~~ssP~v~~~~v~v~~~~---g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v~~~~v~  209 (394)
T PRK11138        136 NAEDGEVAWQTKVA--GE-ALSRPVVSDGLVLVHTSN---GMLQALNESDGAVKWTVNLDVPSLTLRGESAPATAFGGAI  209 (394)
T ss_pred             ECCCCCCcccccCC--Cc-eecCCEEECCEEEEECCC---CEEEEEEccCCCEeeeecCCCCcccccCCCCCEEECCEEE
Confidence            998876  765322  11 223445568888886542   568999998875  8764332211  11123345577777


Q ss_pred             EEccCCCCCCceEEEEEeCCCC--ceeec-CCCCCC-------CCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eE
Q 045821          216 IRCSASAATSHVCALVYEPSTD--SWLHA-DANMAS-------GWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WS  283 (352)
Q Consensus       216 v~GG~~~~~~~~~i~~yd~~~~--~W~~~-~~~~~~-------~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~  283 (352)
                      +..+ .     ..+..+|..++  .|+.- ..+...       ......++.++.+|+.+. .+ .++.+|+.+++  |+
T Consensus       210 ~~~~-~-----g~v~a~d~~~G~~~W~~~~~~~~~~~~~~~~~~~~~sP~v~~~~vy~~~~-~g-~l~ald~~tG~~~W~  281 (394)
T PRK11138        210 VGGD-N-----GRVSAVLMEQGQLIWQQRISQPTGATEIDRLVDVDTTPVVVGGVVYALAY-NG-NLVALDLRSGQIVWK  281 (394)
T ss_pred             EEcC-C-----CEEEEEEccCChhhheeccccCCCccchhcccccCCCcEEECCEEEEEEc-CC-eEEEEECCCCCEEEe
Confidence            7532 2     24777887765  48642 111100       011223566888887663 33 89999998765  86


Q ss_pred             EccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCc--cccc
Q 045821          284 PVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNI--GNIG  327 (352)
Q Consensus       284 ~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~--~w~~  327 (352)
                      .-  ...     ...++..++.||+.......+.+|+++.  .|..
T Consensus       282 ~~--~~~-----~~~~~~~~~~vy~~~~~g~l~ald~~tG~~~W~~  320 (394)
T PRK11138        282 RE--YGS-----VNDFAVDGGRIYLVDQNDRVYALDTRGGVELWSQ  320 (394)
T ss_pred             ec--CCC-----ccCcEEECCEEEEEcCCCeEEEEECCCCcEEEcc
Confidence            52  111     1134567899999887767778888643  3643


No 52 
>PF12937 F-box-like:  F-box-like; PDB: 1P22_A 2OVP_B 2OVR_B 2OVQ_B 1FS1_A 1FS2_C 1FQV_I 1LDK_E 2AST_B 2ASS_B.
Probab=98.13  E-value=9.2e-07  Score=54.65  Aligned_cols=40  Identities=33%  Similarity=0.483  Sum_probs=34.4

Q ss_pred             cCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhh
Q 045821           18 ISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEW   57 (352)
Q Consensus        18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~   57 (352)
                      |..||+|++.+|++.++..++.++..|||+|+.++.++.+
T Consensus         1 i~~LP~Eil~~If~~L~~~dl~~~~~vcr~w~~~~~~~~l   40 (47)
T PF12937_consen    1 ISSLPDEILLEIFSYLDPRDLLRLSLVCRRWRRIANDNSL   40 (47)
T ss_dssp             CCCS-HHHHHHHHTTS-HHHHHHHTTSSHHHHHHHTCCCH
T ss_pred             ChHhHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHCChhh
Confidence            5689999999999999999999999999999999876644


No 53 
>PF13854 Kelch_5:  Kelch motif
Probab=98.12  E-value=8.1e-06  Score=48.93  Aligned_cols=36  Identities=22%  Similarity=0.372  Sum_probs=31.9

Q ss_pred             CCCcceeeeeEEECCEEEEEeCCCC--C---CceEEEeCCC
Q 045821          154 MSTARCYFPCGVLNQKIYCIGGLGD--T---HSWDVYDPRT  189 (352)
Q Consensus       154 ~~~~r~~~~~~~~~~~iyv~gG~~~--~---~~v~~yd~~t  189 (352)
                      +|.+|..|+++.++++||++||...  .   +++++||+.+
T Consensus         1 ~P~~R~~hs~~~~~~~iyi~GG~~~~~~~~~~d~~~l~l~s   41 (42)
T PF13854_consen    1 IPSPRYGHSAVVVGNNIYIFGGYSGNNNSYSNDLYVLDLPS   41 (42)
T ss_pred             CCCCccceEEEEECCEEEEEcCccCCCCCEECcEEEEECCC
Confidence            4789999999999999999999883  2   9999999876


No 54 
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=98.04  E-value=0.0066  Score=54.75  Aligned_cols=119  Identities=11%  Similarity=0.253  Sum_probs=82.9

Q ss_pred             EEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC--C------ceEE
Q 045821          114 GFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT--H------SWDV  184 (352)
Q Consensus       114 ~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~--~------~v~~  184 (352)
                      ....+ +++|+.++..       ....+||..+..-...|.+..+.....++.++++||++......  .      .+++
T Consensus        70 ~F~al~gskIv~~d~~-------~~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~  142 (342)
T PF07893_consen   70 DFFALHGSKIVAVDQS-------GRTLVYDTDTRAVATGPRLHSPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEA  142 (342)
T ss_pred             EEEEecCCeEEEEcCC-------CCeEEEECCCCeEeccCCCCCCCcceEEEEeCCeEEEeeccCccccccCccceeEEE
Confidence            34444 8889888653       44789999999988888888887777788889999999886443  1      5555


Q ss_pred             E--eC--------CCCcEEEecCCCCcccc-------eeEEEE-CCEEEEE-ccCCCCCCceEEEEEeCCCCceeecCC
Q 045821          185 Y--DP--------RTNNWKLHTEPNIFTEI-------EDSFVM-DGKIYIR-CSASAATSHVCALVYEPSTDSWLHADA  244 (352)
Q Consensus       185 y--d~--------~t~~W~~~~~~~~~~~~-------~~~~~~-~g~iyv~-GG~~~~~~~~~i~~yd~~~~~W~~~~~  244 (352)
                      +  ++        ..-.|+.++++|.....       .+-+++ +..|+|. -+..     ...++||+.+.+|+...+
T Consensus       143 l~~~~~~~~~~~~~~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~-----~GTysfDt~~~~W~~~Gd  216 (342)
T PF07893_consen  143 LVYRPPPDDPSPEESWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR-----WGTYSFDTESHEWRKHGD  216 (342)
T ss_pred             eccccccccccCCCcceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCCc-----eEEEEEEcCCcceeeccc
Confidence            5  42        23368888876543332       233455 5688884 2211     248999999999999976


No 55 
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=98.02  E-value=0.012  Score=54.41  Aligned_cols=206  Identities=15%  Similarity=0.178  Sum_probs=112.9

Q ss_pred             eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCc----
Q 045821           84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTA----  157 (352)
Q Consensus        84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~----  157 (352)
                      .++.||+.+-...|+.-...+.. ..+...+-+..++.+|+..+       ...++.+|+.+++  |+.-...+..    
T Consensus       171 ~l~ald~~tG~~~W~~~~~~~~~-~~~~~~sP~v~~~~v~~~~~-------~g~v~a~d~~~G~~~W~~~~~~~~~~~~~  242 (394)
T PRK11138        171 MLQALNESDGAVKWTVNLDVPSL-TLRGESAPATAFGGAIVGGD-------NGRVSAVLMEQGQLIWQQRISQPTGATEI  242 (394)
T ss_pred             EEEEEEccCCCEeeeecCCCCcc-cccCCCCCEEECCEEEEEcC-------CCEEEEEEccCChhhheeccccCCCccch
Confidence            35677772223346654332210 01111223344666666432       2457888888775  7642111110    


Q ss_pred             ----ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEE
Q 045821          158 ----RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALV  231 (352)
Q Consensus       158 ----r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~  231 (352)
                          ....+.++.++.+|+.+..   ..+.++|+.+++  |+.--  ..   ....+..+++||+....      ..+++
T Consensus       243 ~~~~~~~~sP~v~~~~vy~~~~~---g~l~ald~~tG~~~W~~~~--~~---~~~~~~~~~~vy~~~~~------g~l~a  308 (394)
T PRK11138        243 DRLVDVDTTPVVVGGVVYALAYN---GNLVALDLRSGQIVWKREY--GS---VNDFAVDGGRIYLVDQN------DRVYA  308 (394)
T ss_pred             hcccccCCCcEEECCEEEEEEcC---CeEEEEECCCCCEEEeecC--CC---ccCcEEECCEEEEEcCC------CeEEE
Confidence                1123445668999987642   568999998874  87521  11   11245678999997422      25888


Q ss_pred             EeCCCC--ceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEE
Q 045821          232 YEPSTD--SWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIF  307 (352)
Q Consensus       232 yd~~~~--~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~  307 (352)
                      +|..++  .|+.-..  ........+..++.||+... ++ .++.+|..+++  |+.  .++.  .......+..+++||
T Consensus       309 ld~~tG~~~W~~~~~--~~~~~~sp~v~~g~l~v~~~-~G-~l~~ld~~tG~~~~~~--~~~~--~~~~s~P~~~~~~l~  380 (394)
T PRK11138        309 LDTRGGVELWSQSDL--LHRLLTAPVLYNGYLVVGDS-EG-YLHWINREDGRFVAQQ--KVDS--SGFLSEPVVADDKLL  380 (394)
T ss_pred             EECCCCcEEEccccc--CCCcccCCEEECCEEEEEeC-CC-EEEEEECCCCCEEEEE--EcCC--CcceeCCEEECCEEE
Confidence            998765  4864321  11122233557888886543 34 79999998876  443  1211  112234455788888


Q ss_pred             EEcCCceEEEEE
Q 045821          308 VIGKGCSAVVID  319 (352)
Q Consensus       308 v~gG~~~~~~~d  319 (352)
                      +-...-..+.++
T Consensus       381 v~t~~G~l~~~~  392 (394)
T PRK11138        381 IQARDGTVYAIT  392 (394)
T ss_pred             EEeCCceEEEEe
Confidence            876544454444


No 56 
>PF00646 F-box:  F-box domain;  InterPro: IPR001810 The F-box domain was first described as a sequence motif found in cyclin-F that interacts with the protein SKP1 [, ]. This relatively conserved structural motif is present in numerous proteins and serves as a link between a target protein and a ubiquitin-conjugating enzyme. The SCF complex (e.g., Skp1-Cullin-F-box) plays a similar role as an E3 ligase in the ubiquitin protein degradation pathway [, ]. Different F-box proteins as a part of SCF complex recruit particular substrates for ubiquitination through specific protein-protein interaction domains.  Many mammalian F-box domains contain leucine-rich or WD-40 repeats (IPR001680 from INTERPRO). However, several F-box proteins either have other previously described domains such as Sec7 domain found in FBS protein or do not contain defined protein-protein interaction domains or motifs.; GO: 0005515 protein binding; PDB: 2E32_A 2E31_A 3V7D_B 1NEX_B 3MKS_D 3L2O_B.
Probab=98.02  E-value=1.1e-06  Score=54.56  Aligned_cols=43  Identities=33%  Similarity=0.426  Sum_probs=35.8

Q ss_pred             ccCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhhHH
Q 045821           17 LISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEWCA   59 (352)
Q Consensus        17 ~~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~~~   59 (352)
                      .+.+||+|++.+|+.+++..++..++.|||+|+.++.++.+..
T Consensus         2 ~~~~LP~~il~~Il~~l~~~~~~~l~~vsk~~~~~~~~~~~~~   44 (48)
T PF00646_consen    2 PLSDLPDEILQEILSYLDPKDLLRLSLVSKRWRSLVDSPRLWK   44 (48)
T ss_dssp             HHHHS-HHHHHHHHHTS-HHHHHHHCTT-HHHHHHHTTHHHHH
T ss_pred             CHHHCCHHHHHHHHHHCcHHHHHHHHHHhhHHHHHHcCCCccH
Confidence            4568999999999999999999999999999999998876643


No 57 
>smart00256 FBOX A Receptor for Ubiquitination Targets.
Probab=97.87  E-value=6.2e-06  Score=49.18  Aligned_cols=37  Identities=38%  Similarity=0.671  Sum_probs=34.3

Q ss_pred             CcHHHHHHhhccCccccchhhhhhhhhhcccccChhh
Q 045821           21 LPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEW   57 (352)
Q Consensus        21 lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~   57 (352)
                      ||+|++.+|+++++..++.+++.|||+|+.++..+.+
T Consensus         1 lP~~ll~~I~~~l~~~d~~~~~~vc~~~~~~~~~~~~   37 (41)
T smart00256        1 LPDEILEEILSKLPPKDLLRLRKVSRRWRSLIDSHDF   37 (41)
T ss_pred             CCHHHHHHHHHcCCHHHHHHHHHHHHHHHHHhcChhh
Confidence            7999999999999999999999999999999877654


No 58 
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.85  E-value=0.011  Score=50.26  Aligned_cols=173  Identities=16%  Similarity=0.175  Sum_probs=100.7

Q ss_pred             CeEEEEECCCCe--eEecCCCCCcceeee--eEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCcccceeEEE
Q 045821          136 SEVYCYDASMNT--WTDAAPMSTARCYFP--CGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIFTEIEDSFV  209 (352)
Q Consensus       136 ~~~~~~d~~t~~--W~~~~~~~~~r~~~~--~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~~~~~~~~~  209 (352)
                      ..+..+|+.+++  |+.-.  ..+.....  .+..++.+|+..+.   ..+.++|..+++  |+.-.  +..... ....
T Consensus         3 g~l~~~d~~tG~~~W~~~~--~~~~~~~~~~~~~~~~~v~~~~~~---~~l~~~d~~tG~~~W~~~~--~~~~~~-~~~~   74 (238)
T PF13360_consen    3 GTLSALDPRTGKELWSYDL--GPGIGGPVATAVPDGGRVYVASGD---GNLYALDAKTGKVLWRFDL--PGPISG-APVV   74 (238)
T ss_dssp             SEEEEEETTTTEEEEEEEC--SSSCSSEEETEEEETTEEEEEETT---SEEEEEETTTSEEEEEEEC--SSCGGS-GEEE
T ss_pred             CEEEEEECCCCCEEEEEEC--CCCCCCccceEEEeCCEEEEEcCC---CEEEEEECCCCCEEEEeec--cccccc-eeee
Confidence            457788888776  76622  11122222  34468899998432   779999998774  77643  222222 2477


Q ss_pred             ECCEEEEEccCCCCCCceEEEEEeCCCC--cee-ecCCCCC-CCCCCcEE-EECCEEEEEecCCCCeEEEEeCCCCc--e
Q 045821          210 MDGKIYIRCSASAATSHVCALVYEPSTD--SWL-HADANMA-SGWRGPAV-VVDDALYVLDQSSGTKLMMWQKESRE--W  282 (352)
Q Consensus       210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~--~W~-~~~~~~~-~~~~~~~~-~~~~~l~~~gg~~~~~v~~yd~~~~~--W  282 (352)
                      .++.+|+....      ..+.++|..++  .|+ .....+. ........ ..++.+|+... .+ .++.+|+++++  |
T Consensus        75 ~~~~v~v~~~~------~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~g-~l~~~d~~tG~~~w  146 (238)
T PF13360_consen   75 DGGRVYVGTSD------GSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTS-SG-KLVALDPKTGKLLW  146 (238)
T ss_dssp             ETTEEEEEETT------SEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEET-CS-EEEEEETTTTEEEE
T ss_pred             cccccccccce------eeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEec-cC-cEEEEecCCCcEEE
Confidence            89999988421      16899997665  598 4433222 22222333 34666665543 33 89999999765  6


Q ss_pred             EEccCCCCCcC------CCCceEEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821          283 SPVGRLSTLLT------RPPCKLVAIGKTIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       283 ~~~~~~~~~~~------~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~  324 (352)
                      +.-...+....      ......+..++.||+..+......+|.++..
T Consensus       147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~~~~~g~~~~~d~~tg~  194 (238)
T PF13360_consen  147 KYPVGEPRGSSPISSFSDINGSPVISDGRVYVSSGDGRVVAVDLATGE  194 (238)
T ss_dssp             EEESSTT-SS--EEEETTEEEEEECCTTEEEEECCTSSEEEEETTTTE
T ss_pred             EeecCCCCCCcceeeecccccceEEECCEEEEEcCCCeEEEEECCCCC
Confidence            65433332110      1123444456799998877655545888776


No 59 
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=97.83  E-value=0.00087  Score=58.02  Aligned_cols=279  Identities=15%  Similarity=0.163  Sum_probs=133.2

Q ss_pred             cCCCc----HHHHHHhhccCccccchhhhhhhhhhcccccChhhH-HH-----------HhhCCCCCCEEEEEEecCCCC
Q 045821           18 ISGLP----DDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEWC-AY-----------RRKHNLDETWIYAFCRDNKLE   81 (352)
Q Consensus        18 ~~~lp----~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~~-~~-----------~~~~~~~~~~l~~~gg~~~~~   81 (352)
                      +..||    +++++.||+.|...+++....|||+|..++..+..- +.           |.-+...+-..|++--.....
T Consensus        75 i~~lP~~gl~hi~e~ilsyld~~sLc~celv~k~W~r~l~dg~~WKkLie~~vr~dslWrgl~e~rqw~~~lf~~r~~~~  154 (499)
T KOG0281|consen   75 ITALPEQGLDHIAENILSYLDALSLCACELVCKEWKRVLSDGMLWKKLIERMVRTDSLWRGLSERRQWDQYLFKNRPNDG  154 (499)
T ss_pred             HHhcccccHHHHHHHHHHhcchhhhhHHHHHHHHHHHHhccchHHHHHHHHhcchHHHHhhhhhccCcchhhccCCCCcC
Confidence            45788    999999999999999999999999999988777431 11           111111122333331110000


Q ss_pred             -----ceeEEEe-----cCCCCCCCeeEeccCCCcc---c-CCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           82 -----RVCCYVL-----DPNSTRRSWKMIHELPART---L-RRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        82 -----~~~~~~~-----d~~~~~~~w~~~~~~~~~~---~-~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                           .+....|     |.....+.|..-..+-...   + ...+.-+...++...|-|-      ..+++-++|..+..
T Consensus       155 ~~~~n~f~~~l~pki~~di~~idsNWr~Gr~~~~rinc~Se~skgVYClQYDD~kiVSGl------rDnTikiWD~n~~~  228 (499)
T KOG0281|consen  155 GFPPNSFYRLLYPKIIQDIETIESNWRCGRHLLQRINCRSENSKGVYCLQYDDEKIVSGL------RDNTIKIWDKNSLE  228 (499)
T ss_pred             CcCCCcchhhhhHHHHHHHhhhhcchhccceeeeeecCCcccCCceEEEEecchhhhccc------ccCceEEeccccHH
Confidence                 0000001     1111334453222111000   0 0111112233444444332      23456666765543


Q ss_pred             eEecCCCCCcceeeee-EEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eEEEECCEEEEEccCCCCCC
Q 045821          148 WTDAAPMSTARCYFPC-GVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DSFVMDGKIYIRCSASAATS  225 (352)
Q Consensus       148 W~~~~~~~~~r~~~~~-~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~~~~~g~iyv~GG~~~~~~  225 (352)
                      -..+   -..+.+... -.+++++.+.|..+  +.+-++|.+|++--+.   ...+... ....+++.+.|....+   .
T Consensus       229 c~~~---L~GHtGSVLCLqyd~rviisGSSD--sTvrvWDv~tge~l~t---lihHceaVLhlrf~ng~mvtcSkD---r  297 (499)
T KOG0281|consen  229 CLKI---LTGHTGSVLCLQYDERVIVSGSSD--STVRVWDVNTGEPLNT---LIHHCEAVLHLRFSNGYMVTCSKD---R  297 (499)
T ss_pred             HHHh---hhcCCCcEEeeeccceEEEecCCC--ceEEEEeccCCchhhH---HhhhcceeEEEEEeCCEEEEecCC---c
Confidence            2211   112222222 22477877777655  7888999988753211   1112211 1122333333432222   1


Q ss_pred             ceEEEEEeCCCCceeecCCCCCCCCCCc---EEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe
Q 045821          226 HVCALVYEPSTDSWLHADANMASGWRGP---AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI  302 (352)
Q Consensus       226 ~~~i~~yd~~~~~W~~~~~~~~~~~~~~---~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~  302 (352)
                        .+-++|...-+  .+.-..--..+.+   ++-++++ |++.......+-+|++.+..+...  +..  -+.+-++.-+
T Consensus       298 --siaVWdm~sps--~it~rrVLvGHrAaVNvVdfd~k-yIVsASgDRTikvW~~st~efvRt--l~g--HkRGIAClQY  368 (499)
T KOG0281|consen  298 --SIAVWDMASPT--DITLRRVLVGHRAAVNVVDFDDK-YIVSASGDRTIKVWSTSTCEFVRT--LNG--HKRGIACLQY  368 (499)
T ss_pred             --eeEEEeccCch--HHHHHHHHhhhhhheeeeccccc-eEEEecCCceEEEEeccceeeehh--hhc--ccccceehhc
Confidence              23333332211  1100000011222   2446777 666655555899999887666433  322  2455667778


Q ss_pred             CCEEEEEcCCceEE-EEEcCC
Q 045821          303 GKTIFVIGKGCSAV-VIDVGN  322 (352)
Q Consensus       303 ~~~i~v~gG~~~~~-~~d~~~  322 (352)
                      .+++.|-|..+... ++|++.
T Consensus       369 r~rlvVSGSSDntIRlwdi~~  389 (499)
T KOG0281|consen  369 RDRLVVSGSSDNTIRLWDIEC  389 (499)
T ss_pred             cCeEEEecCCCceEEEEeccc
Confidence            88888888665322 455543


No 60 
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.77  E-value=0.03  Score=51.34  Aligned_cols=173  Identities=14%  Similarity=0.187  Sum_probs=96.0

Q ss_pred             EEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCc--------ceeeeeEEECCEEEEEeCCCCCCceEEE
Q 045821          116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTA--------RCYFPCGVLNQKIYCIGGLGDTHSWDVY  185 (352)
Q Consensus       116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~--------r~~~~~~~~~~~iyv~gG~~~~~~v~~y  185 (352)
                      +..++.+|+ |..      ...+..+|+.+++  |+.-...+..        ......+..++.+|+....   ..+.+|
T Consensus       187 ~~~~~~v~~-~~~------~g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~~~~~---g~l~a~  256 (377)
T TIGR03300       187 VIADGGVLV-GFA------GGKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVVDGGQVYAVSYQ---GRVAAL  256 (377)
T ss_pred             EEECCEEEE-ECC------CCEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEEECCEEEEEEcC---CEEEEE
Confidence            445665554 321      2468889988775  7542211111        1122344557888886542   568999


Q ss_pred             eCCCCc--EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCC--ceeecCCCCCCCCCCcEEEECCEE
Q 045821          186 DPRTNN--WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTD--SWLHADANMASGWRGPAVVVDDAL  261 (352)
Q Consensus       186 d~~t~~--W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~--~W~~~~~~~~~~~~~~~~~~~~~l  261 (352)
                      |+.+++  |+.-.     ......+..+++||+....      ..+.++|..++  .|+.-..  ........+..++.|
T Consensus       257 d~~tG~~~W~~~~-----~~~~~p~~~~~~vyv~~~~------G~l~~~d~~tG~~~W~~~~~--~~~~~ssp~i~g~~l  323 (377)
T TIGR03300       257 DLRSGRVLWKRDA-----SSYQGPAVDDNRLYVTDAD------GVVVALDRRSGSELWKNDEL--KYRQLTAPAVVGGYL  323 (377)
T ss_pred             ECCCCcEEEeecc-----CCccCceEeCCEEEEECCC------CeEEEEECCCCcEEEccccc--cCCccccCEEECCEE
Confidence            998764  76531     1122334568899987321      25888888765  4865321  111122224467777


Q ss_pred             EEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEE
Q 045821          262 YVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVV  317 (352)
Q Consensus       262 ~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~  317 (352)
                      |+.+ .++ .++.+|..+++  |+ .. ++.  .......+..+++||+.+..-.++.
T Consensus       324 ~~~~-~~G-~l~~~d~~tG~~~~~-~~-~~~--~~~~~sp~~~~~~l~v~~~dG~l~~  375 (377)
T TIGR03300       324 VVGD-FEG-YLHWLSREDGSFVAR-LK-TDG--SGIASPPVVVGDGLLVQTRDGDLYA  375 (377)
T ss_pred             EEEe-CCC-EEEEEECCCCCEEEE-EE-cCC--CccccCCEEECCEEEEEeCCceEEE
Confidence            7654 344 89999987665  42 21 111  1123445667888887765434443


No 61 
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=97.54  E-value=0.041  Score=46.65  Aligned_cols=182  Identities=18%  Similarity=0.228  Sum_probs=108.4

Q ss_pred             EEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--c
Q 045821          116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--N  191 (352)
Q Consensus       116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~  191 (352)
                      +..++.+|+..+       ...++++|..+++  |+.-.  +.+ .....+..++.+|+..+.   ..+..+|..+.  .
T Consensus        33 ~~~~~~v~~~~~-------~~~l~~~d~~tG~~~W~~~~--~~~-~~~~~~~~~~~v~v~~~~---~~l~~~d~~tG~~~   99 (238)
T PF13360_consen   33 VPDGGRVYVASG-------DGNLYALDAKTGKVLWRFDL--PGP-ISGAPVVDGGRVYVGTSD---GSLYALDAKTGKVL   99 (238)
T ss_dssp             EEETTEEEEEET-------TSEEEEEETTTSEEEEEEEC--SSC-GGSGEEEETTEEEEEETT---SEEEEEETTTSCEE
T ss_pred             EEeCCEEEEEcC-------CCEEEEEECCCCCEEEEeec--ccc-ccceeeecccccccccce---eeeEecccCCccee
Confidence            447888998843       4788999998887  65443  222 112246778999988742   47899998877  4


Q ss_pred             EE-EecCCCCcc--cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCC--------CCcEEEEC
Q 045821          192 WK-LHTEPNIFT--EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGW--------RGPAVVVD  258 (352)
Q Consensus       192 W~-~~~~~~~~~--~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~--------~~~~~~~~  258 (352)
                      |+ .....+...  ......+.++.+|+...      ...+.++|+.+++  |+.-...+....        ....+..+
T Consensus       100 W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~------~g~l~~~d~~tG~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~  173 (238)
T PF13360_consen  100 WSIYLTSSPPAGVRSSSSPAVDGDRLYVGTS------SGKLVALDPKTGKLLWKYPVGEPRGSSPISSFSDINGSPVISD  173 (238)
T ss_dssp             EEEEE-SSCTCSTB--SEEEEETTEEEEEET------CSEEEEEETTTTEEEEEEESSTT-SS--EEEETTEEEEEECCT
T ss_pred             eeeccccccccccccccCceEecCEEEEEec------cCcEEEEecCCCcEEEEeecCCCCCCcceeeecccccceEEEC
Confidence            98 443322222  22234445777777642      1268999988764  776432211111        11123346


Q ss_pred             CEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821          259 DALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       259 ~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~  324 (352)
                      +.+|+..+..  .+..+|..++.  |+..  ...    ........++.||+......++.+|+++..
T Consensus       174 ~~v~~~~~~g--~~~~~d~~tg~~~w~~~--~~~----~~~~~~~~~~~l~~~~~~~~l~~~d~~tG~  233 (238)
T PF13360_consen  174 GRVYVSSGDG--RVVAVDLATGEKLWSKP--ISG----IYSLPSVDGGTLYVTSSDGRLYALDLKTGK  233 (238)
T ss_dssp             TEEEEECCTS--SEEEEETTTTEEEEEEC--SS-----ECECEECCCTEEEEEETTTEEEEEETTTTE
T ss_pred             CEEEEEcCCC--eEEEEECCCCCEEEEec--CCC----ccCCceeeCCEEEEEeCCCEEEEEECCCCC
Confidence            7788776553  36666999987  8333  211    112245668888888855567789988654


No 62 
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=97.53  E-value=0.069  Score=48.97  Aligned_cols=221  Identities=16%  Similarity=0.175  Sum_probs=122.4

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ++.+|+.+..     -.++.||+.+....|+.-..-.      ...+.+..++.+|+.+.       ...++.+|..+++
T Consensus        65 ~~~v~v~~~~-----g~v~a~d~~tG~~~W~~~~~~~------~~~~p~v~~~~v~v~~~-------~g~l~ald~~tG~  126 (377)
T TIGR03300        65 GGKVYAADAD-----GTVVALDAETGKRLWRVDLDER------LSGGVGADGGLVFVGTE-------KGEVIALDAEDGK  126 (377)
T ss_pred             CCEEEEECCC-----CeEEEEEccCCcEeeeecCCCC------cccceEEcCCEEEEEcC-------CCEEEEEECCCCc
Confidence            7777765322     2477778722334575433221      11123444677776432       3578999998775


Q ss_pred             --eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecCCCCc--ccceeEEEECCEEEEEccCC
Q 045821          148 --WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTEPNIF--TEIEDSFVMDGKIYIRCSAS  221 (352)
Q Consensus       148 --W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~~~~~--~~~~~~~~~~g~iyv~GG~~  221 (352)
                        |+.-..  .+ .....+..++.+|+..+.   ..+..+|+.++  .|+.-...+..  ......+..++.+|+. ...
T Consensus       127 ~~W~~~~~--~~-~~~~p~v~~~~v~v~~~~---g~l~a~d~~tG~~~W~~~~~~~~~~~~~~~sp~~~~~~v~~~-~~~  199 (377)
T TIGR03300       127 ELWRAKLS--SE-VLSPPLVANGLVVVRTND---GRLTALDAATGERLWTYSRVTPALTLRGSASPVIADGGVLVG-FAG  199 (377)
T ss_pred             EeeeeccC--ce-eecCCEEECCEEEEECCC---CeEEEEEcCCCceeeEEccCCCceeecCCCCCEEECCEEEEE-CCC
Confidence              764322  11 223344567888886442   56899999876  48754322211  1122334567766653 222


Q ss_pred             CCCCceEEEEEeCCCC--ceeec-CCCCCC-------CCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eEEccCCC
Q 045821          222 AATSHVCALVYEPSTD--SWLHA-DANMAS-------GWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WSPVGRLS  289 (352)
Q Consensus       222 ~~~~~~~i~~yd~~~~--~W~~~-~~~~~~-------~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~  289 (352)
                           ..+..+|+.++  .|+.- ..+...       ......+..++.+|+.+. .+ .++.||+++++  |+.-  .+
T Consensus       200 -----g~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~~~~~vy~~~~-~g-~l~a~d~~tG~~~W~~~--~~  270 (377)
T TIGR03300       200 -----GKLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVVDGGQVYAVSY-QG-RVAALDLRSGRVLWKRD--AS  270 (377)
T ss_pred             -----CEEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEEECCEEEEEEc-CC-EEEEEECCCCcEEEeec--cC
Confidence                 25888888665  48642 111100       011223456788887653 33 79999998754  7543  11


Q ss_pred             CCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCc--cccc
Q 045821          290 TLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNI--GNIG  327 (352)
Q Consensus       290 ~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~--~w~~  327 (352)
                           .....+..+++||+......++.+|....  .|+.
T Consensus       271 -----~~~~p~~~~~~vyv~~~~G~l~~~d~~tG~~~W~~  305 (377)
T TIGR03300       271 -----SYQGPAVDDNRLYVTDADGVVVALDRRSGSELWKN  305 (377)
T ss_pred             -----CccCceEeCCEEEEECCCCeEEEEECCCCcEEEcc
Confidence                 12234567899999876656777887543  3654


No 63 
>PF03089 RAG2:  Recombination activating protein 2;  InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end.  The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events.  The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=97.49  E-value=0.004  Score=52.68  Aligned_cols=101  Identities=17%  Similarity=0.126  Sum_probs=66.4

Q ss_pred             EEEEE-eCCCCCCCCCCeEEEEECCCCe--------e---EecCCCCCcceeeeeEEE--CC--EEEEEeCCCCC-----
Q 045821          121 NAYLL-GGCGWSEDATSEVYCYDASMNT--------W---TDAAPMSTARCYFPCGVL--NQ--KIYCIGGLGDT-----  179 (352)
Q Consensus       121 ~iyv~-GG~~~~~~~~~~~~~~d~~t~~--------W---~~~~~~~~~r~~~~~~~~--~~--~iyv~gG~~~~-----  179 (352)
                      ..|++ ||.+.++.-+..+++....++.        .   ..+.+.|.+|++|++.++  .|  ...+|||...-     
T Consensus        39 ~~YlIHGGrTPNNElS~~LY~ls~~s~~cNkK~tl~C~EKeLvGdvP~aRYGHt~~vV~SrGKta~VlFGGRSY~P~~qR  118 (337)
T PF03089_consen   39 EQYLIHGGRTPNNELSSSLYILSVDSRGCNKKVTLCCQEKELVGDVPEARYGHTINVVHSRGKTACVLFGGRSYMPPGQR  118 (337)
T ss_pred             eeEEecCCcCCCcccccceEEEEeecCCCCceeEEEEecceecCCCCcccccceEEEEEECCcEEEEEECCcccCCcccc
Confidence            45665 6777766677788887665432        1   124578999999987776  23  46777885421     


Q ss_pred             ------------CceEEEeCCCCcEEE--ecCCCCcccceeEEEECCEEEEEccCC
Q 045821          180 ------------HSWDVYDPRTNNWKL--HTEPNIFTEIEDSFVMDGKIYIRCSAS  221 (352)
Q Consensus       180 ------------~~v~~yd~~t~~W~~--~~~~~~~~~~~~~~~~~g~iyv~GG~~  221 (352)
                                  ..|+..|++-+-.+.  ++.+.....+|.+..-++.+|++||..
T Consensus       119 TTenWNsVvDC~P~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D~VYilGGHs  174 (337)
T PF03089_consen  119 TTENWNSVVDCPPQVFLIDLEFGCCTAHTLPELQDGQSFHVSLARNDCVYILGGHS  174 (337)
T ss_pred             chhhcceeccCCCeEEEEeccccccccccchhhcCCeEEEEEEecCceEEEEccEE
Confidence                        556666766655442  333444556667777899999999875


No 64 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.45  E-value=0.02  Score=49.08  Aligned_cols=186  Identities=16%  Similarity=0.152  Sum_probs=104.2

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEec
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHT  196 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~  196 (352)
                      ++.+|...      .....++++|+.+++-..+....    ..+++..  ++.+|+...    ....++|+.+++++.+.
T Consensus        11 ~g~l~~~D------~~~~~i~~~~~~~~~~~~~~~~~----~~G~~~~~~~g~l~v~~~----~~~~~~d~~~g~~~~~~   76 (246)
T PF08450_consen   11 DGRLYWVD------IPGGRIYRVDPDTGEVEVIDLPG----PNGMAFDRPDGRLYVADS----GGIAVVDPDTGKVTVLA   76 (246)
T ss_dssp             TTEEEEEE------TTTTEEEEEETTTTEEEEEESSS----EEEEEEECTTSEEEEEET----TCEEEEETTTTEEEEEE
T ss_pred             CCEEEEEE------cCCCEEEEEECCCCeEEEEecCC----CceEEEEccCCEEEEEEc----CceEEEecCCCcEEEEe
Confidence            57788774      23468999999998766543222    2233333  689999887    55678899999998886


Q ss_pred             CCC-----CcccceeEEEECCEEEEEccCCCC-CCc--eEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecC
Q 045821          197 EPN-----IFTEIEDSFVMDGKIYIRCSASAA-TSH--VCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQS  267 (352)
Q Consensus       197 ~~~-----~~~~~~~~~~~~g~iyv~GG~~~~-~~~--~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~  267 (352)
                      ..+     ..+....++.-+|+||+-.-.... ...  ..++++++. .+.+.+...... -.+.++..++ .||+....
T Consensus        77 ~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~-pNGi~~s~dg~~lyv~ds~  154 (246)
T PF08450_consen   77 DLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGF-PNGIAFSPDGKTLYVADSF  154 (246)
T ss_dssp             EEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESS-EEEEEEETTSSEEEEEETT
T ss_pred             eccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCccc-ccceEECCcchheeecccc
Confidence            542     111222334447899986322211 111  479999998 666555332111 1122334455 46665544


Q ss_pred             CCCeEEEEeCCCCc--eEEc---cCCCCCcCCCCceEEE-eCCEEEEEc-CCceEEEEEcCC
Q 045821          268 SGTKLMMWQKESRE--WSPV---GRLSTLLTRPPCKLVA-IGKTIFVIG-KGCSAVVIDVGN  322 (352)
Q Consensus       268 ~~~~v~~yd~~~~~--W~~~---~~~~~~~~~~~~~~~~-~~~~i~v~g-G~~~~~~~d~~~  322 (352)
                      .+ .|++||+....  +...   ..++... ...-++++ -++.||+.. +...+.+||++.
T Consensus       155 ~~-~i~~~~~~~~~~~~~~~~~~~~~~~~~-g~pDG~~vD~~G~l~va~~~~~~I~~~~p~G  214 (246)
T PF08450_consen  155 NG-RIWRFDLDADGGELSNRRVFIDFPGGP-GYPDGLAVDSDGNLWVADWGGGRIVVFDPDG  214 (246)
T ss_dssp             TT-EEEEEEEETTTCCEEEEEEEEE-SSSS-CEEEEEEEBTTS-EEEEEETTTEEEEEETTS
T ss_pred             cc-eeEEEeccccccceeeeeeEEEcCCCC-cCCCcceEcCCCCEEEEEcCCCEEEEECCCc
Confidence            44 89999886433  4322   2233211 01123333 367888863 455788999884


No 65 
>PF14870 PSII_BNR:  Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=97.44  E-value=0.066  Score=47.12  Aligned_cols=236  Identities=11%  Similarity=0.113  Sum_probs=106.4

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ++.-|++|-.    . .++.=+-  ...+|+....-...+......++...++..|++|-.       .-++.-.=.-.+
T Consensus        27 ~~~G~~VG~~----g-~il~T~D--GG~tW~~~~~~~~~~~~~~l~~I~f~~~~g~ivG~~-------g~ll~T~DgG~t   92 (302)
T PF14870_consen   27 PNHGWAVGAY----G-TILKTTD--GGKTWQPVSLDLDNPFDYHLNSISFDGNEGWIVGEP-------GLLLHTTDGGKT   92 (302)
T ss_dssp             SS-EEEEETT----T-EEEEESS--TTSS-EE-----S-----EEEEEEEETTEEEEEEET-------TEEEEESSTTSS
T ss_pred             CCEEEEEecC----C-EEEEECC--CCccccccccCCCccceeeEEEEEecCCceEEEcCC-------ceEEEecCCCCC
Confidence            5667777542    1 1222222  578999886432211122233445557889988731       122222223457


Q ss_pred             eEecCC-CCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC
Q 045821          148 WTDAAP-MSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS  225 (352)
Q Consensus       148 W~~~~~-~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~  225 (352)
                      |+.++. .+.|...+....+ ++.+.+++..   ..+++=.-.-.+|+.+..............-+|++++++ ..+   
T Consensus        93 W~~v~l~~~lpgs~~~i~~l~~~~~~l~~~~---G~iy~T~DgG~tW~~~~~~~~gs~~~~~r~~dG~~vavs-~~G---  165 (302)
T PF14870_consen   93 WERVPLSSKLPGSPFGITALGDGSAELAGDR---GAIYRTTDGGKTWQAVVSETSGSINDITRSSDGRYVAVS-SRG---  165 (302)
T ss_dssp             -EE----TT-SS-EEEEEEEETTEEEEEETT-----EEEESSTTSSEEEEE-S----EEEEEE-TTS-EEEEE-TTS---
T ss_pred             cEEeecCCCCCCCeeEEEEcCCCcEEEEcCC---CcEEEeCCCCCCeeEcccCCcceeEeEEECCCCcEEEEE-Ccc---
Confidence            999862 2344444444444 5677776653   334444444568998764333222222233467766664 221   


Q ss_pred             ceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEe--CCCCceEEccCCCCCcCCCC-ceEEE-
Q 045821          226 HVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQ--KESREWSPVGRLSTLLTRPP-CKLVA-  301 (352)
Q Consensus       226 ~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd--~~~~~W~~~~~~~~~~~~~~-~~~~~-  301 (352)
                       .-....|+....|+.......++...+...-++.|++++  .+..+..=|  -..++|.+-. .|....... ..++. 
T Consensus       166 -~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~~~lw~~~--~Gg~~~~s~~~~~~~~w~~~~-~~~~~~~~~~ld~a~~  241 (302)
T PF14870_consen  166 -NFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPDGNLWMLA--RGGQIQFSDDPDDGETWSEPI-IPIKTNGYGILDLAYR  241 (302)
T ss_dssp             -SEEEEE-TT-SS-EEEE--SSS-EEEEEE-TTS-EEEEE--TTTEEEEEE-TTEEEEE---B--TTSS--S-EEEEEES
T ss_pred             -cEEEEecCCCccceEEccCccceehhceecCCCCEEEEe--CCcEEEEccCCCCcccccccc-CCcccCceeeEEEEec
Confidence             234567888889998876555555555566678888876  222666666  4457787732 222111111 22222 


Q ss_pred             eCCEEEEEcCCceEEEEEcCCcccccc
Q 045821          302 IGKTIFVIGKGCSAVVIDVGNIGNIGG  328 (352)
Q Consensus       302 ~~~~i~v~gG~~~~~~~d~~~~~w~~~  328 (352)
                      .++.+++.||.-..+.-.-.-.+|++.
T Consensus       242 ~~~~~wa~gg~G~l~~S~DgGktW~~~  268 (302)
T PF14870_consen  242 PPNEIWAVGGSGTLLVSTDGGKTWQKD  268 (302)
T ss_dssp             SSS-EEEEESTT-EEEESSTTSS-EE-
T ss_pred             CCCCEEEEeCCccEEEeCCCCccceEC
Confidence            368899999987666656667789874


No 66 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=97.32  E-value=0.018  Score=50.14  Aligned_cols=105  Identities=14%  Similarity=0.278  Sum_probs=65.3

Q ss_pred             CceEEEeCCCCcEEEecCCCCcccceeEEE-ECCEEEEEccCCCCC-CceEEEEEeCCCCceeecCCCC----CCCCCCc
Q 045821          180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFV-MDGKIYIRCSASAAT-SHVCALVYEPSTDSWLHADANM----ASGWRGP  253 (352)
Q Consensus       180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~-~~g~iyv~GG~~~~~-~~~~i~~yd~~~~~W~~~~~~~----~~~~~~~  253 (352)
                      ..+-.||..+.+|..+..- ....-..... -++++|+.|-..-.+ ....+..||..+.+|+.++...    +.+....
T Consensus        16 ~~lC~yd~~~~qW~~~g~~-i~G~V~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~~~s~~ipgpv~a~   94 (281)
T PF12768_consen   16 PGLCLYDTDNSQWSSPGNG-ISGTVTDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGGGSSNSIPGPVTAL   94 (281)
T ss_pred             CEEEEEECCCCEeecCCCC-ceEEEEEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCCcccccCCCcEEEE
Confidence            7788999999999987643 2222222333 377888886544434 3458899999999999886622    2221111


Q ss_pred             EEE--ECCEEEEEecCCC--CeEEEEeCCCCceEEccC
Q 045821          254 AVV--VDDALYVLDQSSG--TKLMMWQKESREWSPVGR  287 (352)
Q Consensus       254 ~~~--~~~~l~~~gg~~~--~~v~~yd~~~~~W~~~~~  287 (352)
                      .+.  ...++++.|....  ..+..||  ..+|..+..
T Consensus        95 ~~~~~d~~~~~~aG~~~~g~~~l~~~d--Gs~W~~i~~  130 (281)
T PF12768_consen   95 TFISNDGSNFWVAGRSANGSTFLMKYD--GSSWSSIGS  130 (281)
T ss_pred             EeeccCCceEEEeceecCCCceEEEEc--CCceEeccc
Confidence            111  2345666665322  4577775  778999975


No 67 
>PF07893 DUF1668:  Protein of unknown function (DUF1668);  InterPro: IPR012871 The hypothetical proteins found in this family are expressed by Oryza sativa (Rice) and are of unknown function. 
Probab=97.22  E-value=0.018  Score=51.89  Aligned_cols=119  Identities=21%  Similarity=0.199  Sum_probs=77.6

Q ss_pred             ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC-------ceEEEEEeC----
Q 045821          166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS-------HVCALVYEP----  234 (352)
Q Consensus       166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~-------~~~i~~yd~----  234 (352)
                      .+.+|+.++..   ....+||..|..-...|.+..+.....++.++++||++........       ...+..|+.    
T Consensus        75 ~gskIv~~d~~---~~t~vyDt~t~av~~~P~l~~pk~~pisv~VG~~LY~m~~~~~~~~~~~~~~~~FE~l~~~~~~~~  151 (342)
T PF07893_consen   75 HGSKIVAVDQS---GRTLVYDTDTRAVATGPRLHSPKRCPISVSVGDKLYAMDRSPFPEPAGRPDFPCFEALVYRPPPDD  151 (342)
T ss_pred             cCCeEEEEcCC---CCeEEEECCCCeEeccCCCCCCCcceEEEEeCCeEEEeeccCccccccCccceeEEEecccccccc
Confidence            47888888654   4478999999988777766666666677788999999966543322       123444442    


Q ss_pred             ----CCCceeecCCCCCCCCCC-----c--EEEE-CCEEEE-EecCCCCeEEEEeCCCCceEEccCC
Q 045821          235 ----STDSWLHADANMASGWRG-----P--AVVV-DDALYV-LDQSSGTKLMMWQKESREWSPVGRL  288 (352)
Q Consensus       235 ----~~~~W~~~~~~~~~~~~~-----~--~~~~-~~~l~~-~gg~~~~~v~~yd~~~~~W~~~~~~  288 (352)
                          ..-.|+.+++++-.....     .  -+++ +..|+| +.+.. ...+.||+++.+|+.+++-
T Consensus       152 ~~~~~~w~W~~LP~PPf~~~~~~~~~~i~sYavv~g~~I~vS~~~~~-~GTysfDt~~~~W~~~GdW  217 (342)
T PF07893_consen  152 PSPEESWSWRSLPPPPFVRDRRYSDYRITSYAVVDGRTIFVSVNGRR-WGTYSFDTESHEWRKHGDW  217 (342)
T ss_pred             ccCCCcceEEcCCCCCccccCCcccceEEEEEEecCCeEEEEecCCc-eEEEEEEcCCcceeeccce
Confidence                233688887644322221     1  1344 567777 43321 1489999999999999853


No 68 
>PRK13684 Ycf48-like protein; Provisional
Probab=97.08  E-value=0.21  Score=45.03  Aligned_cols=206  Identities=13%  Similarity=0.190  Sum_probs=102.0

Q ss_pred             CCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEE
Q 045821           93 TRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIY  171 (352)
Q Consensus        93 ~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iy  171 (352)
                      ...+|+++.....  .+...+.+..++ +.+++.|.       ...+++=+-.-.+|+.+.... .-..+.+....+..+
T Consensus       117 gG~tW~~~~~~~~--~~~~~~~i~~~~~~~~~~~g~-------~G~i~~S~DgG~tW~~~~~~~-~g~~~~i~~~~~g~~  186 (334)
T PRK13684        117 GGKNWTRIPLSEK--LPGSPYLITALGPGTAEMATN-------VGAIYRTTDGGKNWEALVEDA-AGVVRNLRRSPDGKY  186 (334)
T ss_pred             CCCCCeEccCCcC--CCCCceEEEEECCCcceeeec-------cceEEEECCCCCCceeCcCCC-cceEEEEEECCCCeE
Confidence            4678988863211  111222333343 34555543       123444333456799886533 223344444444344


Q ss_pred             EEeCCCCCCceEE-EeCCCCcEEEecCCCCccccee-EEEECCEEEEEccCCCCCCceEEEEE--eCCCCceeecCCCCC
Q 045821          172 CIGGLGDTHSWDV-YDPRTNNWKLHTEPNIFTEIED-SFVMDGKIYIRCSASAATSHVCALVY--EPSTDSWLHADANMA  247 (352)
Q Consensus       172 v~gG~~~~~~v~~-yd~~t~~W~~~~~~~~~~~~~~-~~~~~g~iyv~GG~~~~~~~~~i~~y--d~~~~~W~~~~~~~~  247 (352)
                      ++.|..  ..++. .|....+|+.+... ....... +..-++.++++|...       ...+  +-...+|+.+..+..
T Consensus       187 v~~g~~--G~i~~s~~~gg~tW~~~~~~-~~~~l~~i~~~~~g~~~~vg~~G-------~~~~~s~d~G~sW~~~~~~~~  256 (334)
T PRK13684        187 VAVSSR--GNFYSTWEPGQTAWTPHQRN-SSRRLQSMGFQPDGNLWMLARGG-------QIRFNDPDDLESWSKPIIPEI  256 (334)
T ss_pred             EEEeCC--ceEEEEcCCCCCeEEEeeCC-CcccceeeeEcCCCCEEEEecCC-------EEEEccCCCCCccccccCCcc
Confidence            443322  12222 24445679988643 2233233 333478898885332       2334  233458998654322


Q ss_pred             -CCCCCcE--EEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEE-eCCEEEEEcCCceEEEEEcC
Q 045821          248 -SGWRGPA--VVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVA-IGKTIFVIGKGCSAVVIDVG  321 (352)
Q Consensus       248 -~~~~~~~--~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~~~i~v~gG~~~~~~~d~~  321 (352)
                       ......+  +..++.++++|.. + .++.-.....+|+.+...+. .......++. .++++|+.|..-.+..++..
T Consensus       257 ~~~~~l~~v~~~~~~~~~~~G~~-G-~v~~S~d~G~tW~~~~~~~~-~~~~~~~~~~~~~~~~~~~G~~G~il~~~~~  331 (334)
T PRK13684        257 TNGYGYLDLAYRTPGEIWAGGGN-G-TLLVSKDGGKTWEKDPVGEE-VPSNFYKIVFLDPEKGFVLGQRGVLLRYVGS  331 (334)
T ss_pred             ccccceeeEEEcCCCCEEEEcCC-C-eEEEeCCCCCCCeECCcCCC-CCcceEEEEEeCCCceEEECCCceEEEecCC
Confidence             1122222  2336678877754 3 45555556789999863222 1223344443 46778888765444445543


No 69 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=96.91  E-value=0.1  Score=44.69  Aligned_cols=190  Identities=16%  Similarity=0.155  Sum_probs=101.5

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE--CCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL--GKNAYLLGGCGWSEDATSEVYCYDASM  145 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~d~~t  145 (352)
                      ++.||.+.    ...-.++.+|+  .++.-..+. .+   .   ..+++..  ++.+|+...        ....++|+.+
T Consensus        11 ~g~l~~~D----~~~~~i~~~~~--~~~~~~~~~-~~---~---~~G~~~~~~~g~l~v~~~--------~~~~~~d~~~   69 (246)
T PF08450_consen   11 DGRLYWVD----IPGGRIYRVDP--DTGEVEVID-LP---G---PNGMAFDRPDGRLYVADS--------GGIAVVDPDT   69 (246)
T ss_dssp             TTEEEEEE----TTTTEEEEEET--TTTEEEEEE-SS---S---EEEEEEECTTSEEEEEET--------TCEEEEETTT
T ss_pred             CCEEEEEE----cCCCEEEEEEC--CCCeEEEEe-cC---C---CceEEEEccCCEEEEEEc--------CceEEEecCC
Confidence            56788772    23346788898  776554433 22   1   1333443  678888764        3346669999


Q ss_pred             CeeEecCCCC--C-cceee--eeEEECCEEEEEeCCCCC------CceEEEeCCCCcEEEecCCCCcccceeEEEECC-E
Q 045821          146 NTWTDAAPMS--T-ARCYF--PCGVLNQKIYCIGGLGDT------HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-K  213 (352)
Q Consensus       146 ~~W~~~~~~~--~-~r~~~--~~~~~~~~iyv~gG~~~~------~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~  213 (352)
                      ++++.+...+  . +....  .++--+|.+|+..-....      ..++++++. .+.+.+... ......-+..-++ .
T Consensus        70 g~~~~~~~~~~~~~~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~-~~~pNGi~~s~dg~~  147 (246)
T PF08450_consen   70 GKVTVLADLPDGGVPFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADG-LGFPNGIAFSPDGKT  147 (246)
T ss_dssp             TEEEEEEEEETTCSCTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEE-ESSEEEEEEETTSSE
T ss_pred             CcEEEEeeccCCCcccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecC-cccccceEECCcchh
Confidence            9998876653  1 22333  333336888886432221      358899998 665554321 1111222233355 5


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCc--eeecC---CCCCCCCC--CcEEEECCEEEEEecCCCCeEEEEeCCCCceEEcc
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDS--WLHAD---ANMASGWR--GPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVG  286 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~---~~~~~~~~--~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~  286 (352)
                      ||+.-.     ....|++|+.....  +....   ........  +.++-.+|.||+..- .+..|++||++...-..+.
T Consensus       148 lyv~ds-----~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~-~~~~I~~~~p~G~~~~~i~  221 (246)
T PF08450_consen  148 LYVADS-----FNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADW-GGGRIVVFDPDGKLLREIE  221 (246)
T ss_dssp             EEEEET-----TTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEE-TTTEEEEEETTSCEEEEEE
T ss_pred             eeeccc-----ccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEc-CCCEEEEECCCccEEEEEc
Confidence            777632     22368888875433  33221   11111111  122334788888643 2339999999966666564


No 70 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.87  E-value=0.27  Score=42.92  Aligned_cols=182  Identities=15%  Similarity=0.079  Sum_probs=87.3

Q ss_pred             EEEEEeCCCCCCCCCCeEEEEECCCCeeEe-cCCCCCcceeeeeEEE-CC-EEEEEeCCCCCCceEEEeCCCCcEEEecC
Q 045821          121 NAYLLGGCGWSEDATSEVYCYDASMNTWTD-AAPMSTARCYFPCGVL-NQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTE  197 (352)
Q Consensus       121 ~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~-~~~~~~~r~~~~~~~~-~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~  197 (352)
                      .+|+.++.      ...+.+||+.+++-.. +.....++   .++.. ++ .+|+.++.+  ..+.+||..+.+....-.
T Consensus         2 ~~~~s~~~------d~~v~~~d~~t~~~~~~~~~~~~~~---~l~~~~dg~~l~~~~~~~--~~v~~~d~~~~~~~~~~~   70 (300)
T TIGR03866         2 KAYVSNEK------DNTISVIDTATLEVTRTFPVGQRPR---GITLSKDGKLLYVCASDS--DTIQVIDLATGEVIGTLP   70 (300)
T ss_pred             cEEEEecC------CCEEEEEECCCCceEEEEECCCCCC---ceEECCCCCEEEEEECCC--CeEEEEECCCCcEEEecc
Confidence            45555543      3468888888775322 22111122   12222 33 577776543  678899998876543211


Q ss_pred             CCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCce-eecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEE
Q 045821          198 PNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDSW-LHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMW  275 (352)
Q Consensus       198 ~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~y  275 (352)
                      ..... ...++.-++ .+|+.++..     ..+..||+.+.+- ..++..  .......+..++++++++...+..+..|
T Consensus        71 ~~~~~-~~~~~~~~g~~l~~~~~~~-----~~l~~~d~~~~~~~~~~~~~--~~~~~~~~~~dg~~l~~~~~~~~~~~~~  142 (300)
T TIGR03866        71 SGPDP-ELFALHPNGKILYIANEDD-----NLVTVIDIETRKVLAEIPVG--VEPEGMAVSPDGKIVVNTSETTNMAHFI  142 (300)
T ss_pred             CCCCc-cEEEECCCCCEEEEEcCCC-----CeEEEEECCCCeEEeEeeCC--CCcceEEECCCCCEEEEEecCCCeEEEE
Confidence            11111 111222244 566664322     2588888876542 222211  1112223344677777766554467778


Q ss_pred             eCCCCceEEccCCCCCcCCCCceEEEeCCE-EEEEcC-CceEEEEEcCCcc
Q 045821          276 QKESREWSPVGRLSTLLTRPPCKLVAIGKT-IFVIGK-GCSAVVIDVGNIG  324 (352)
Q Consensus       276 d~~~~~W~~~~~~~~~~~~~~~~~~~~~~~-i~v~gG-~~~~~~~d~~~~~  324 (352)
                      |..+..-...-....   +..+.....+++ +++-+. ...+.+||.++..
T Consensus       143 d~~~~~~~~~~~~~~---~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~  190 (300)
T TIGR03866       143 DTKTYEIVDNVLVDQ---RPRFAEFTADGKELWVSSEIGGTVSVIDVATRK  190 (300)
T ss_pred             eCCCCeEEEEEEcCC---CccEEEECCCCCEEEEEcCCCCEEEEEEcCcce
Confidence            887654322111111   111122223444 444432 2346678887654


No 71 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=96.83  E-value=0.14  Score=43.79  Aligned_cols=156  Identities=18%  Similarity=0.081  Sum_probs=93.1

Q ss_pred             eeeeEE-ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          160 YFPCGV-LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       160 ~~~~~~-~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      ..+... .++.+|.-.|....+.+..||+.|++-....+++......+++.++++||.+.     -.....+.||..+  
T Consensus        47 TQGL~~~~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLT-----Wk~~~~f~yd~~t--  119 (264)
T PF05096_consen   47 TQGLEFLDDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLT-----WKEGTGFVYDPNT--  119 (264)
T ss_dssp             EEEEEEEETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEE-----SSSSEEEEEETTT--
T ss_pred             CccEEecCCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEE-----ecCCeEEEEcccc--
Confidence            344555 57899999887776899999999998766666777777778899999999993     1123688999975  


Q ss_pred             eeecCCCCCCCCCCcEE-EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc----CCCCceEEEeCCEEEE-EcCC
Q 045821          239 WLHADANMASGWRGPAV-VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL----TRPPCKLVAIGKTIFV-IGKG  312 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~-~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~----~~~~~~~~~~~~~i~v-~gG~  312 (352)
                      .+.+... .....+-.. ..+..|++-.|.+  .++.+||++-+  .+..+....    ....--+-..+++||. +--.
T Consensus       120 l~~~~~~-~y~~EGWGLt~dg~~Li~SDGS~--~L~~~dP~~f~--~~~~i~V~~~g~pv~~LNELE~i~G~IyANVW~t  194 (264)
T PF05096_consen  120 LKKIGTF-PYPGEGWGLTSDGKRLIMSDGSS--RLYFLDPETFK--EVRTIQVTDNGRPVSNLNELEYINGKIYANVWQT  194 (264)
T ss_dssp             TEEEEEE-E-SSS--EEEECSSCEEEE-SSS--EEEEE-TTT-S--EEEEEE-EETTEE---EEEEEEETTEEEEEETTS
T ss_pred             ceEEEEE-ecCCcceEEEcCCCEEEEECCcc--ceEEECCcccc--eEEEEEEEECCEECCCcEeEEEEcCEEEEEeCCC
Confidence            3333321 111233344 4455677777754  89999998633  222221100    0111123345788885 5566


Q ss_pred             ceEEEEEcCCccccc
Q 045821          313 CSAVVIDVGNIGNIG  327 (352)
Q Consensus       313 ~~~~~~d~~~~~w~~  327 (352)
                      +.+..+|+++..-..
T Consensus       195 d~I~~Idp~tG~V~~  209 (264)
T PF05096_consen  195 DRIVRIDPETGKVVG  209 (264)
T ss_dssp             SEEEEEETTT-BEEE
T ss_pred             CeEEEEeCCCCeEEE
Confidence            677789999876433


No 72 
>PRK04792 tolB translocation protein TolB; Provisional
Probab=96.83  E-value=0.45  Score=44.82  Aligned_cols=148  Identities=11%  Similarity=0.065  Sum_probs=82.3

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE-
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK-  213 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~-  213 (352)
                      ...++++|..+++-+.+...+..........-+..|++....+....++.+|..+++.+++..... ........-+|+ 
T Consensus       241 ~~~L~~~dl~tg~~~~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~~lt~~~~-~~~~p~wSpDG~~  319 (448)
T PRK04792        241 KAEIFVQDIYTQVREKVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALTRITRHRA-IDTEPSWHPDGKS  319 (448)
T ss_pred             CcEEEEEECCCCCeEEecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeEECccCCC-CccceEECCCCCE
Confidence            367999999988776665544221111111113356555433333679999999998887754211 111122333554 


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEccC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVGR  287 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~~  287 (352)
                      |++. ...  .....++.+|..+++++.+..... ........-++ .|++.+...+ ..++++|+.++....+..
T Consensus       320 I~f~-s~~--~g~~~Iy~~dl~~g~~~~Lt~~g~-~~~~~~~SpDG~~l~~~~~~~g~~~I~~~dl~~g~~~~lt~  391 (448)
T PRK04792        320 LIFT-SER--GGKPQIYRVNLASGKVSRLTFEGE-QNLGGSITPDGRSMIMVNRTNGKFNIARQDLETGAMQVLTS  391 (448)
T ss_pred             EEEE-ECC--CCCceEEEEECCCCCEEEEecCCC-CCcCeeECCCCCEEEEEEecCCceEEEEEECCCCCeEEccC
Confidence            4443 221  122478999999888887742111 11122333344 5555543322 478999999988877653


No 73 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=96.72  E-value=0.35  Score=42.16  Aligned_cols=224  Identities=15%  Similarity=0.076  Sum_probs=102.9

Q ss_pred             EEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCee
Q 045821           70 WIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTW  148 (352)
Q Consensus        70 ~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W  148 (352)
                      .+|+.++.+    -.+.++|+  .++.-...-....  ..+  ......++ .+|+.++.      ...+.+||..+++.
T Consensus         2 ~~~~s~~~d----~~v~~~d~--~t~~~~~~~~~~~--~~~--~l~~~~dg~~l~~~~~~------~~~v~~~d~~~~~~   65 (300)
T TIGR03866         2 KAYVSNEKD----NTISVIDT--ATLEVTRTFPVGQ--RPR--GITLSKDGKLLYVCASD------SDTIQVIDLATGEV   65 (300)
T ss_pred             cEEEEecCC----CEEEEEEC--CCCceEEEEECCC--CCC--ceEECCCCCEEEEEECC------CCeEEEEECCCCcE
Confidence            466665531    25556687  6655332222211  111  11122233 56776642      35688999988765


Q ss_pred             Ee-cCCCCCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCccc-ceeEEEECCEEEEEccCCCCC
Q 045821          149 TD-AAPMSTARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTE-IEDSFVMDGKIYIRCSASAAT  224 (352)
Q Consensus       149 ~~-~~~~~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~-~~~~~~~~g~iyv~GG~~~~~  224 (352)
                      .. ++....+.   ..+.. + +.+|+.++.+  ..+.+||..+.+-  +...+.... ...+..-+|++++++....  
T Consensus        66 ~~~~~~~~~~~---~~~~~~~g~~l~~~~~~~--~~l~~~d~~~~~~--~~~~~~~~~~~~~~~~~dg~~l~~~~~~~--  136 (300)
T TIGR03866        66 IGTLPSGPDPE---LFALHPNGKILYIANEDD--NLVTVIDIETRKV--LAEIPVGVEPEGMAVSPDGKIVVNTSETT--  136 (300)
T ss_pred             EEeccCCCCcc---EEEECCCCCEEEEEcCCC--CeEEEEECCCCeE--EeEeeCCCCcceEEECCCCCEEEEEecCC--
Confidence            43 32211221   12222 3 4566665433  5789999987642  211111111 1122333677766643321  


Q ss_pred             CceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE-Ecc----CCCCCcCCCCceE
Q 045821          225 SHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-PVG----RLSTLLTRPPCKL  299 (352)
Q Consensus       225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-~~~----~~~~~~~~~~~~~  299 (352)
                        ..+..||..+.+-..... .........+..+++.+++++..+..+.+||.++.+.. .+.    ..+.. ......+
T Consensus       137 --~~~~~~d~~~~~~~~~~~-~~~~~~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~~~~~~~~~~~~~~~~-~~~~~~i  212 (300)
T TIGR03866       137 --NMAHFIDTKTYEIVDNVL-VDQRPRFAEFTADGKELWVSSEIGGTVSVIDVATRKVIKKITFEIPGVHPE-AVQPVGI  212 (300)
T ss_pred             --CeEEEEeCCCCeEEEEEE-cCCCccEEEECCCCCEEEEEcCCCCEEEEEEcCcceeeeeeeecccccccc-cCCccce
Confidence              235556776543322100 11111112233456555555443348999999876532 221    11110 0111122


Q ss_pred             --EEeCCEEEEE-cCCceEEEEEcCC
Q 045821          300 --VAIGKTIFVI-GKGCSAVVIDVGN  322 (352)
Q Consensus       300 --~~~~~~i~v~-gG~~~~~~~d~~~  322 (352)
                        ..-+..+|+. ++.+...++|.++
T Consensus       213 ~~s~dg~~~~~~~~~~~~i~v~d~~~  238 (300)
T TIGR03866       213 KLTKDGKTAFVALGPANRVAVVDAKT  238 (300)
T ss_pred             EECCCCCEEEEEcCCCCeEEEEECCC
Confidence              2224455664 4455677898864


No 74 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=96.61  E-value=0.058  Score=48.78  Aligned_cols=148  Identities=11%  Similarity=0.144  Sum_probs=91.6

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC--cccceeEEEECCE-EEEEccCCCCCCceEEEEEeCCCCceeecC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI--FTEIEDSFVMDGK-IYIRCSASAATSHVCALVYEPSTDSWLHAD  243 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~--~~~~~~~~~~~g~-iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~  243 (352)
                      ...+.+++|.+..-.++.-|-+++.  .+..+..  .+-..+...-+|. ..+.+|..     ..++.||..+.+-+++.
T Consensus       224 ~~plllvaG~d~~lrifqvDGk~N~--~lqS~~l~~fPi~~a~f~p~G~~~i~~s~rr-----ky~ysyDle~ak~~k~~  296 (514)
T KOG2055|consen  224 TAPLLLVAGLDGTLRIFQVDGKVNP--KLQSIHLEKFPIQKAEFAPNGHSVIFTSGRR-----KYLYSYDLETAKVTKLK  296 (514)
T ss_pred             CCceEEEecCCCcEEEEEecCccCh--hheeeeeccCccceeeecCCCceEEEecccc-----eEEEEeecccccccccc
Confidence            3568888888754555555666665  3333222  1222222223555 44454444     36899999999998886


Q ss_pred             CCCCC---CCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEc
Q 045821          244 ANMAS---GWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDV  320 (352)
Q Consensus       244 ~~~~~---~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~  320 (352)
                      ++...   ......+..++...++.|..+ .|+..-..+++|..--.++.  ....+++...+.+|++.||.-.++++|+
T Consensus       297 ~~~g~e~~~~e~FeVShd~~fia~~G~~G-~I~lLhakT~eli~s~KieG--~v~~~~fsSdsk~l~~~~~~GeV~v~nl  373 (514)
T KOG2055|consen  297 PPYGVEEKSMERFEVSHDSNFIAIAGNNG-HIHLLHAKTKELITSFKIEG--VVSDFTFSSDSKELLASGGTGEVYVWNL  373 (514)
T ss_pred             CCCCcccchhheeEecCCCCeEEEcccCc-eEEeehhhhhhhhheeeecc--EEeeEEEecCCcEEEEEcCCceEEEEec
Confidence            54222   222334556777777777777 88888888888853322332  1233444455678999999889999999


Q ss_pred             CCcc
Q 045821          321 GNIG  324 (352)
Q Consensus       321 ~~~~  324 (352)
                      .++.
T Consensus       374 ~~~~  377 (514)
T KOG2055|consen  374 RQNS  377 (514)
T ss_pred             CCcc
Confidence            9875


No 75 
>PF12768 Rax2:  Cortical protein marker for cell polarity
Probab=96.58  E-value=0.19  Score=43.79  Aligned_cols=106  Identities=11%  Similarity=0.234  Sum_probs=65.1

Q ss_pred             CCCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CCEEEEEeCCCC----CCceEEEeCCCCcEEEecCCC--Cccccee
Q 045821          134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGD----THSWDVYDPRTNNWKLHTEPN--IFTEIED  206 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~----~~~v~~yd~~t~~W~~~~~~~--~~~~~~~  206 (352)
                      .-..+..||..+.+|..+..--.... .++... ++++|+.|-...    ...+-.||..+.+|..++.-.  .-.....
T Consensus        14 ~C~~lC~yd~~~~qW~~~g~~i~G~V-~~l~~~~~~~Llv~G~ft~~~~~~~~la~yd~~~~~w~~~~~~~s~~ipgpv~   92 (281)
T PF12768_consen   14 PCPGLCLYDTDNSQWSSPGNGISGTV-TDLQWASNNQLLVGGNFTLNGTNSSNLATYDFKNQTWSSLGGGSSNSIPGPVT   92 (281)
T ss_pred             CCCEEEEEECCCCEeecCCCCceEEE-EEEEEecCCEEEEEEeeEECCCCceeEEEEecCCCeeeecCCcccccCCCcEE
Confidence            35678899999999999875422221 233333 677888775322    167889999999998887521  1111112


Q ss_pred             EE-E---ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821          207 SF-V---MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA  244 (352)
Q Consensus       207 ~~-~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~  244 (352)
                      +. +   -.+++++.|..  ......+..||  ..+|+.+..
T Consensus        93 a~~~~~~d~~~~~~aG~~--~~g~~~l~~~d--Gs~W~~i~~  130 (281)
T PF12768_consen   93 ALTFISNDGSNFWVAGRS--ANGSTFLMKYD--GSSWSSIGS  130 (281)
T ss_pred             EEEeeccCCceEEEecee--cCCCceEEEEc--CCceEeccc
Confidence            22 2   13468887654  23344677774  458988865


No 76 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=96.57  E-value=0.12  Score=46.86  Aligned_cols=188  Identities=12%  Similarity=0.141  Sum_probs=100.4

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ..-+++++|  ..+...+|..|-  .++.  .+..+.-.-.|......+..+....+++|.      ..-++.||..+.+
T Consensus       224 ~~plllvaG--~d~~lrifqvDG--k~N~--~lqS~~l~~fPi~~a~f~p~G~~~i~~s~r------rky~ysyDle~ak  291 (514)
T KOG2055|consen  224 TAPLLLVAG--LDGTLRIFQVDG--KVNP--KLQSIHLEKFPIQKAEFAPNGHSVIFTSGR------RKYLYSYDLETAK  291 (514)
T ss_pred             CCceEEEec--CCCcEEEEEecC--ccCh--hheeeeeccCccceeeecCCCceEEEeccc------ceEEEEeeccccc
Confidence            345666666  345677888887  6665  333332110222333333334436666654      3457899999998


Q ss_pred             eEecCCCCC-c---ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCC
Q 045821          148 WTDAAPMST-A---RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASA  222 (352)
Q Consensus       148 W~~~~~~~~-~---r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~  222 (352)
                      -..+.++.. +   ...|.+. .++.+.++-|..  .-+......|+.|-.--.++..... .....++ .|++.||.. 
T Consensus       292 ~~k~~~~~g~e~~~~e~FeVS-hd~~fia~~G~~--G~I~lLhakT~eli~s~KieG~v~~-~~fsSdsk~l~~~~~~G-  366 (514)
T KOG2055|consen  292 VTKLKPPYGVEEKSMERFEVS-HDSNFIAIAGNN--GHIHLLHAKTKELITSFKIEGVVSD-FTFSSDSKELLASGGTG-  366 (514)
T ss_pred             cccccCCCCcccchhheeEec-CCCCeEEEcccC--ceEEeehhhhhhhhheeeeccEEee-EEEecCCcEEEEEcCCc-
Confidence            888876531 1   1223333 344556666644  3455666677777432222222111 2223344 567775544 


Q ss_pred             CCCceEEEEEeCCCCceeecCCCCCCCCCCcEE--EECCEEEEEecCCCCeEEEEeCCC
Q 045821          223 ATSHVCALVYEPSTDSWLHADANMASGWRGPAV--VVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       223 ~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~--~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                           .||++|..++.-...-. -....++...  ..++.++.+|...| -|-+||.++
T Consensus       367 -----eV~v~nl~~~~~~~rf~-D~G~v~gts~~~S~ng~ylA~GS~~G-iVNIYd~~s  418 (514)
T KOG2055|consen  367 -----EVYVWNLRQNSCLHRFV-DDGSVHGTSLCISLNGSYLATGSDSG-IVNIYDGNS  418 (514)
T ss_pred             -----eEEEEecCCcceEEEEe-ecCccceeeeeecCCCceEEeccCcc-eEEEeccch
Confidence                 59999998875332211 1112233332  36778777776665 677887554


No 77 
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=96.57  E-value=0.37  Score=46.19  Aligned_cols=285  Identities=14%  Similarity=0.101  Sum_probs=137.3

Q ss_pred             CCCccCCCcHHHHHHhhccCccccchhhhhhhhhhcccccChhh-HHH-HhhCCCC----CCEEEEEEecCCCCceeEEE
Q 045821           14 VVPLISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSEEW-CAY-RRKHNLD----ETWIYAFCRDNKLERVCCYV   87 (352)
Q Consensus        14 ~~~~~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~~~-~~~-~~~~~~~----~~~l~~~gg~~~~~~~~~~~   87 (352)
                      ....+..||.|+...||..++.+.+..++.+|+.|+.+...... .+. +......    +..+-...+.........+.
T Consensus       104 ~~dfi~~lp~el~~~il~~Ld~~~l~~~~~v~~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ks~~~  183 (537)
T KOG0274|consen  104 QRDFLSLLPSELSLHILSFLDGRDLLAVRQVCRNWNKLLDDDKVWWRMCRELIGRLPPKCEKGLPLKSGFKGRPWKSFYR  183 (537)
T ss_pred             ccchhhcccchhcccccccCCHHHhhhhhhhcchhhhhhhccchhhhhhhhhcccCCcccCcccccccccccchhhhhhh
Confidence            35567889999999999999999999999999999988754432 211 1111111    11111111100000000000


Q ss_pred             -ecCCCCCCCeeEeccCCCc-ccC--CcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeee
Q 045821           88 -LDPNSTRRSWKMIHELPAR-TLR--RKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPC  163 (352)
Q Consensus        88 -~d~~~~~~~w~~~~~~~~~-~~~--~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~  163 (352)
                       ..-  ....|......... ..+  ........+.+.. +.-|     .....+.+||..+..-...+.....-.-.+.
T Consensus       184 ~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~q~~~~~-~~~~-----s~~~tl~~~~~~~~~~i~~~l~GH~g~V~~l  255 (537)
T KOG0274|consen  184 RRFR--LSKNWRKLFRRGYKVLLGTDDHVVLCLQLHDGF-FKSG-----SDDSTLHLWDLNNGYLILTRLVGHFGGVWGL  255 (537)
T ss_pred             hhhh--ccccccccccccceeecccCcchhhhheeecCe-EEec-----CCCceeEEeecccceEEEeeccCCCCCceeE
Confidence             011  23344443322100 000  0111111111111 1122     2334556777777654433211111111222


Q ss_pred             EEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceee
Q 045821          164 GVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLH  241 (352)
Q Consensus       164 ~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~  241 (352)
                      ... ++.+++.|..+  ..+-+.|..++....+-  . .+.....+.. .+...+.|+.+     +.+.++|..+.+-..
T Consensus       256 ~~~~~~~~lvsgS~D--~t~rvWd~~sg~C~~~l--~-gh~stv~~~~~~~~~~~sgs~D-----~tVkVW~v~n~~~l~  325 (537)
T KOG0274|consen  256 AFPSGGDKLVSGSTD--KTERVWDCSTGECTHSL--Q-GHTSSVRCLTIDPFLLVSGSRD-----NTVKVWDVTNGACLN  325 (537)
T ss_pred             EEecCCCEEEEEecC--CcEEeEecCCCcEEEEe--c-CCCceEEEEEccCceEeeccCC-----ceEEEEeccCcceEE
Confidence            222 35666666655  67778888888776542  2 2222222333 33333333333     367788887666554


Q ss_pred             cCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC-EEEEEcCCce-EEEEE
Q 045821          242 ADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK-TIFVIGKGCS-AVVID  319 (352)
Q Consensus       242 ~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~-~i~v~gG~~~-~~~~d  319 (352)
                      +-..  +...--.+..++.+.+.|.+++ .|-+||+.+.+.  +..+.. ....-.++ .+++ ..++=|+.++ +.+.|
T Consensus       326 l~~~--h~~~V~~v~~~~~~lvsgs~d~-~v~VW~~~~~~c--l~sl~g-H~~~V~sl-~~~~~~~~~Sgs~D~~IkvWd  398 (537)
T KOG0274|consen  326 LLRG--HTGPVNCVQLDEPLLVSGSYDG-TVKVWDPRTGKC--LKSLSG-HTGRVYSL-IVDSENRLLSGSLDTTIKVWD  398 (537)
T ss_pred             Eecc--ccccEEEEEecCCEEEEEecCc-eEEEEEhhhcee--eeeecC-CcceEEEE-EecCcceEEeeeeccceEeec
Confidence            4221  1111112456788888888887 899999985444  333332 12222233 3444 4444444443 44678


Q ss_pred             cCCc
Q 045821          320 VGNI  323 (352)
Q Consensus       320 ~~~~  323 (352)
                      +++.
T Consensus       399 l~~~  402 (537)
T KOG0274|consen  399 LRTK  402 (537)
T ss_pred             CCch
Confidence            7776


No 78 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.53  E-value=0.42  Score=40.65  Aligned_cols=144  Identities=11%  Similarity=0.107  Sum_probs=64.5

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWL-HADA  244 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~  244 (352)
                      ++..+++++.+  ..+.+||..++.-...-..... ........ ++++++.++..     ..+..||..+.+-. .+..
T Consensus        62 ~~~~l~~~~~~--~~i~i~~~~~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~-----~~i~~~~~~~~~~~~~~~~  133 (289)
T cd00200          62 DGTYLASGSSD--KTIRLWDLETGECVRTLTGHTS-YVSSVAFSPDGRILSSSSRD-----KTIKVWDVETGKCLTTLRG  133 (289)
T ss_pred             CCCEEEEEcCC--CeEEEEEcCcccceEEEeccCC-cEEEEEEcCCCCEEEEecCC-----CeEEEEECCCcEEEEEecc
Confidence            34455555544  6788999887532211111111 11112222 34666665423     25778888744332 2221


Q ss_pred             CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE-EccCCCCCcCCCCceEEE-eCC-EEEEEcCCceEEEEEcC
Q 045821          245 NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-PVGRLSTLLTRPPCKLVA-IGK-TIFVIGKGCSAVVIDVG  321 (352)
Q Consensus       245 ~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~-~~~-~i~v~gG~~~~~~~d~~  321 (352)
                       ............++.+++.+...+ .+.+||..+.+-. .+.. .   .....++.. .++ .+++.+....+.+||..
T Consensus       134 -~~~~i~~~~~~~~~~~l~~~~~~~-~i~i~d~~~~~~~~~~~~-~---~~~i~~~~~~~~~~~l~~~~~~~~i~i~d~~  207 (289)
T cd00200         134 -HTDWVNSVAFSPDGTFVASSSQDG-TIKLWDLRTGKCVATLTG-H---TGEVNSVAFSPDGEKLLSSSSDGTIKLWDLS  207 (289)
T ss_pred             -CCCcEEEEEEcCcCCEEEEEcCCC-cEEEEEccccccceeEec-C---ccccceEEECCCcCEEEEecCCCcEEEEECC
Confidence             111111111222345555554344 8999998754321 2211 1   111122222 233 45554444456688887


Q ss_pred             Ccc
Q 045821          322 NIG  324 (352)
Q Consensus       322 ~~~  324 (352)
                      ...
T Consensus       208 ~~~  210 (289)
T cd00200         208 TGK  210 (289)
T ss_pred             CCc
Confidence            533


No 79 
>PRK13684 Ycf48-like protein; Provisional
Probab=96.53  E-value=0.57  Score=42.20  Aligned_cols=212  Identities=15%  Similarity=0.162  Sum_probs=104.1

Q ss_pred             CCCCeeEecc-CCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC-CCcceeeeeEEE-CCE
Q 045821           93 TRRSWKMIHE-LPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM-STARCYFPCGVL-NQK  169 (352)
Q Consensus        93 ~~~~w~~~~~-~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~-~~~r~~~~~~~~-~~~  169 (352)
                      ...+|++... ++.  ....-..+...++..|+.|..       ..++.=+=.-.+|+.+... ..+........+ ++.
T Consensus        74 gG~tW~~~~~~~~~--~~~~l~~v~~~~~~~~~~G~~-------g~i~~S~DgG~tW~~~~~~~~~~~~~~~i~~~~~~~  144 (334)
T PRK13684         74 GGETWEERSLDLPE--ENFRLISISFKGDEGWIVGQP-------SLLLHTTDGGKNWTRIPLSEKLPGSPYLITALGPGT  144 (334)
T ss_pred             CCCCceECccCCcc--cccceeeeEEcCCcEEEeCCC-------ceEEEECCCCCCCeEccCCcCCCCCceEEEEECCCc
Confidence            4678998753 221  111222333345556665421       2233311123479888532 122222333333 345


Q ss_pred             EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEE-EeCCCCceeecCCCCC
Q 045821          170 IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALV-YEPSTDSWLHADANMA  247 (352)
Q Consensus       170 iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~-yd~~~~~W~~~~~~~~  247 (352)
                      +|+.|..   ..+++=+-.-.+|+.+...... ....+... ++.++++ |..+     .++. .|....+|+.+.....
T Consensus       145 ~~~~g~~---G~i~~S~DgG~tW~~~~~~~~g-~~~~i~~~~~g~~v~~-g~~G-----~i~~s~~~gg~tW~~~~~~~~  214 (334)
T PRK13684        145 AEMATNV---GAIYRTTDGGKNWEALVEDAAG-VVRNLRRSPDGKYVAV-SSRG-----NFYSTWEPGQTAWTPHQRNSS  214 (334)
T ss_pred             ceeeecc---ceEEEECCCCCCceeCcCCCcc-eEEEEEECCCCeEEEE-eCCc-----eEEEEcCCCCCeEEEeeCCCc
Confidence            6666643   2344444446789988653322 22233333 4444444 3332     2333 2445568998865433


Q ss_pred             CCCCCcEEEECCEEEEEecCCCCeEEEE--eCCCCceEEccCCCCC-cCCCCceEEEe-CCEEEEEcCCceEEEEEcCCc
Q 045821          248 SGWRGPAVVVDDALYVLDQSSGTKLMMW--QKESREWSPVGRLSTL-LTRPPCKLVAI-GKTIFVIGKGCSAVVIDVGNI  323 (352)
Q Consensus       248 ~~~~~~~~~~~~~l~~~gg~~~~~v~~y--d~~~~~W~~~~~~~~~-~~~~~~~~~~~-~~~i~v~gG~~~~~~~d~~~~  323 (352)
                      ......+...++.++++|.. + .+ ++  +-...+|+.+.. |.. .....+++... ++.+++.|..-.++.-...-.
T Consensus       215 ~~l~~i~~~~~g~~~~vg~~-G-~~-~~~s~d~G~sW~~~~~-~~~~~~~~l~~v~~~~~~~~~~~G~~G~v~~S~d~G~  290 (334)
T PRK13684        215 RRLQSMGFQPDGNLWMLARG-G-QI-RFNDPDDLESWSKPII-PEITNGYGYLDLAYRTPGEIWAGGGNGTLLVSKDGGK  290 (334)
T ss_pred             ccceeeeEcCCCCEEEEecC-C-EE-EEccCCCCCccccccC-CccccccceeeEEEcCCCCEEEEcCCCeEEEeCCCCC
Confidence            33344444557888888754 3 33 34  344568997642 211 11222333333 667888876544444344455


Q ss_pred             cccc
Q 045821          324 GNIG  327 (352)
Q Consensus       324 ~w~~  327 (352)
                      +|..
T Consensus       291 tW~~  294 (334)
T PRK13684        291 TWEK  294 (334)
T ss_pred             CCeE
Confidence            7865


No 80 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=96.51  E-value=0.43  Score=40.57  Aligned_cols=183  Identities=10%  Similarity=0.083  Sum_probs=82.7

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeE-ecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEec
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWT-DAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHT  196 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~-~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~  196 (352)
                      ++..+++++.      ...+.+||..+++.. .+.....  .-..+... ++.+++.++.+  ..+.+||..+..-...-
T Consensus        62 ~~~~l~~~~~------~~~i~i~~~~~~~~~~~~~~~~~--~i~~~~~~~~~~~~~~~~~~--~~i~~~~~~~~~~~~~~  131 (289)
T cd00200          62 DGTYLASGSS------DKTIRLWDLETGECVRTLTGHTS--YVSSVAFSPDGRILSSSSRD--KTIKVWDVETGKCLTTL  131 (289)
T ss_pred             CCCEEEEEcC------CCeEEEEEcCcccceEEEeccCC--cEEEEEEcCCCCEEEEecCC--CeEEEEECCCcEEEEEe
Confidence            3345555553      356788888775321 1211111  11112222 34566666533  67889999865432221


Q ss_pred             CCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCce-eecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEE
Q 045821          197 EPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSW-LHADANMASGWRGPAVVVDDALYVLDQSSGTKLMM  274 (352)
Q Consensus       197 ~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~  274 (352)
                      .. ........... ++.+++.|+..     ..+..||..+.+- ..+.. ...........-+++.+++++..+ .+.+
T Consensus       132 ~~-~~~~i~~~~~~~~~~~l~~~~~~-----~~i~i~d~~~~~~~~~~~~-~~~~i~~~~~~~~~~~l~~~~~~~-~i~i  203 (289)
T cd00200         132 RG-HTDWVNSVAFSPDGTFVASSSQD-----GTIKLWDLRTGKCVATLTG-HTGEVNSVAFSPDGEKLLSSSSDG-TIKL  203 (289)
T ss_pred             cc-CCCcEEEEEEcCcCCEEEEEcCC-----CcEEEEEccccccceeEec-CccccceEEECCCcCEEEEecCCC-cEEE
Confidence            11 11111122223 34555553312     2477788764332 22211 111111112223444555665544 8999


Q ss_pred             EeCCCCceEEccCCCCCcCCCCceEEEe-CCEEEEEcC-CceEEEEEcCC
Q 045821          275 WQKESREWSPVGRLSTLLTRPPCKLVAI-GKTIFVIGK-GCSAVVIDVGN  322 (352)
Q Consensus       275 yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~~~i~v~gG-~~~~~~~d~~~  322 (352)
                      ||..+.+....  +.. ......++... ++.+++.++ ...+.+||..+
T Consensus       204 ~d~~~~~~~~~--~~~-~~~~i~~~~~~~~~~~~~~~~~~~~i~i~~~~~  250 (289)
T cd00200         204 WDLSTGKCLGT--LRG-HENGVNSVAFSPDGYLLASGSEDGTIRVWDLRT  250 (289)
T ss_pred             EECCCCceecc--hhh-cCCceEEEEEcCCCcEEEEEcCCCcEEEEEcCC
Confidence            99976443222  211 01112223332 355666665 44567888875


No 81 
>PF09910 DUF2139:  Uncharacterized protein conserved in archaea (DUF2139);  InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=96.49  E-value=0.48  Score=40.89  Aligned_cols=169  Identities=12%  Similarity=0.119  Sum_probs=95.9

Q ss_pred             eccCCCcccCCcceEEEEECCEEEEEeCCCCC----------------CCCCCeEEEEECCCCe----eEecCCCCCcce
Q 045821          100 IHELPARTLRRKGMGFEVLGKNAYLLGGCGWS----------------EDATSEVYCYDASMNT----WTDAAPMSTARC  159 (352)
Q Consensus       100 ~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~----------------~~~~~~~~~~d~~t~~----W~~~~~~~~~r~  159 (352)
                      +.+.|.. ....+-++..+++.|| |||+--.                ....+.+..||..+++    |++--.-+...+
T Consensus        28 vG~~P~S-GGDTYNAV~~vDd~Iy-FGGWVHAPa~y~gk~~g~~~IdF~NKYSHVH~yd~e~~~VrLLWkesih~~~~Wa  105 (339)
T PF09910_consen   28 VGPPPTS-GGDTYNAVEWVDDFIY-FGGWVHAPAVYEGKGDGRATIDFRNKYSHVHEYDTENDSVRLLWKESIHDKTKWA  105 (339)
T ss_pred             ccCCCCC-CCccceeeeeecceEE-EeeeecCCceeeeccCCceEEEEeeccceEEEEEcCCCeEEEEEecccCCccccc
Confidence            4444432 3455566777777777 5775211                1235678999988887    654433332222


Q ss_pred             eeeeEE----ECCEEEEEeCCCCC-CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821          160 YFPCGV----LNQKIYCIGGLGDT-HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEP  234 (352)
Q Consensus       160 ~~~~~~----~~~~iyv~gG~~~~-~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~  234 (352)
                      +-.+-.    +++++++.=+-... -.++..|..++.=+++..-|..   -.+.++|...|-+  .+.......+.+||+
T Consensus       106 GEVSdIlYdP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~~ps~---KG~~~~D~a~F~i--~~~~~g~~~i~~~Dl  180 (339)
T PF09910_consen  106 GEVSDILYDPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSSNPSL---KGTLVHDYACFGI--NNFHKGVSGIHCLDL  180 (339)
T ss_pred             cchhheeeCCCcCEEEEEecCCcceeeeEEEcccCCceeeccCCCCc---CceEeeeeEEEec--cccccCCceEEEEEc
Confidence            211111    25677776443322 6778888888887777643332   2233344444433  333344558999999


Q ss_pred             CCCce--eecCCC-------CCCCCCCcEEEECCEEEEE-ecCCCCeEEEEeCCC
Q 045821          235 STDSW--LHADAN-------MASGWRGPAVVVDDALYVL-DQSSGTKLMMWQKES  279 (352)
Q Consensus       235 ~~~~W--~~~~~~-------~~~~~~~~~~~~~~~l~~~-gg~~~~~v~~yd~~~  279 (352)
                      .+++|  ...+..       ...+..+.++...+++|.+ +|    -+.+.||..
T Consensus       181 i~~~~~~e~f~~~~s~Dg~~~~~~~~G~~~s~ynR~faF~rG----Gi~vgnP~~  231 (339)
T PF09910_consen  181 ISGKWVIESFDVSLSVDGGPVIRPELGAMASAYNRLFAFVRG----GIFVGNPYN  231 (339)
T ss_pred             cCCeEEEEecccccCCCCCceEeeccccEEEEeeeEEEEEec----cEEEeCCCC
Confidence            99999  444321       1123334456677777755 44    588888883


No 82 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=96.46  E-value=0.95  Score=43.92  Aligned_cols=154  Identities=12%  Similarity=0.112  Sum_probs=86.9

Q ss_pred             EECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCc-EE
Q 045821          117 VLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN-WK  193 (352)
Q Consensus       117 ~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~-W~  193 (352)
                      ..++.+.+.|+.++      .+-+||..++-....  +...-..++++.+  .++..+....+  ..|-.+|...-. ++
T Consensus       359 SpDgq~iaTG~eDg------KVKvWn~~SgfC~vT--FteHts~Vt~v~f~~~g~~llssSLD--GtVRAwDlkRYrNfR  428 (893)
T KOG0291|consen  359 SPDGQLIATGAEDG------KVKVWNTQSGFCFVT--FTEHTSGVTAVQFTARGNVLLSSSLD--GTVRAWDLKRYRNFR  428 (893)
T ss_pred             CCCCcEEEeccCCC------cEEEEeccCceEEEE--eccCCCceEEEEEEecCCEEEEeecC--CeEEeeeecccceee
Confidence            33667777776544      477788876642221  1122223333332  55666655554  567777776543 55


Q ss_pred             EecCCCCcccceeEEEEC--CEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCe
Q 045821          194 LHTEPNIFTEIEDSFVMD--GKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTK  271 (352)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~--g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~  271 (352)
                      ....|.+...  .++.+|  |.|.+.|+.+    ...|++++.++++--.+=.....+-++.++...+.+++-|..+. .
T Consensus       429 Tft~P~p~Qf--scvavD~sGelV~AG~~d----~F~IfvWS~qTGqllDiLsGHEgPVs~l~f~~~~~~LaS~SWDk-T  501 (893)
T KOG0291|consen  429 TFTSPEPIQF--SCVAVDPSGELVCAGAQD----SFEIFVWSVQTGQLLDILSGHEGPVSGLSFSPDGSLLASGSWDK-T  501 (893)
T ss_pred             eecCCCceee--eEEEEcCCCCEEEeeccc----eEEEEEEEeecCeeeehhcCCCCcceeeEEccccCeEEeccccc-e
Confidence            5543333222  334444  8999987554    24688888888877666443444444444555666777776665 7


Q ss_pred             EEEEeCCCCceEEccCC
Q 045821          272 LMMWQKESREWSPVGRL  288 (352)
Q Consensus       272 v~~yd~~~~~W~~~~~~  288 (352)
                      |-+||.-. +|.++..+
T Consensus       502 VRiW~if~-s~~~vEtl  517 (893)
T KOG0291|consen  502 VRIWDIFS-SSGTVETL  517 (893)
T ss_pred             EEEEEeec-cCceeeeE
Confidence            88888764 34444433


No 83 
>PF03089 RAG2:  Recombination activating protein 2;  InterPro: IPR004321 The variable portion of the genes encoding immunoglobulins and T cell receptors are assembled from component V, D, and J DNA segments by a site-specific recombination reaction termed V(D)J recombination. V(D)J recombination is targeted to specific sites on the chromosome by recombination signal sequences (RSSs) that flank antigen receptor gene segments. The RSS consists of a conserved heptamer (consensus, 5'-CACAGTG-3') and nonamer (consensus, 5'-ACAAAAACC-3') separated by a spacer of either 12 or 23 bp. Efficient recombination occurs between a 12-RSS and a 23-RSS, a restriction known as the 12/23 rule. V(D)J recombination can be divided into two phases, DNA cleavage and DNA joining. DNA cleavage requires two lymphocyte-specific factors, the products of the recombination activating genes, RAG1 and RAG2, which together recognise the RSSs and create double strand breaks at the RSS-coding segment junctions []. RAG-mediated DNA cleavage occurs in a synaptic complex termed the paired complex, which is constituted from two distinct RSS-RAG complexes, a 12-SC and a 23-SC (where SC stands for signal complex). The DNA cleavage reaction involves two distinct enzymatic steps, initial nicking that creates a 3'-OH between a coding segment and its RSS, followed by hairpin formation in which the newly created 3'-OH attacks a phosphodiester bond on the opposite DNA strand. This generates a blunt, 5' phosphorylated signal end containing all of the RSS elements, and a covalently sealed hairpin coding end.  The second phase of V(D)J recombination, in which broken DNA fragments are processed and joined, is less well characterised. Signal ends are typically joined precisely to form a signal joint, whereas joining of the coding ends requires the hairpin structure to be opened and typically involves nucleotide addition and deletion before formation of the coding joint. The factors involved in these processes include ubiquitously expressed proteins involved in the repair of DNA double strand breaks by nonhomologous end joining, terminal deoxynucleotidyl transferase, and Artemis protein. In addition to their critical roles in RSS recognition and DNA cleavage, the RAG proteins may perform two distinct types of functions in the postcleavage phase of V(D)J. A structural function has been inferred from the finding that, after DNA cleavage in vitro, the DNA ends remain associated with the RAG proteins in a "four end" complex known as the cleaved signal complex. After release of the coding ends in vitro, and after coding joint formation in vivo, the RAG proteins remain in a stable signal end complex (SEC) containing the two signal ends. These postcleavage complexes may serve as essential scaffolds for the second phase of the reaction, with the RAG proteins acting to organise the DNA processing and joining events.  The second type of RAG protein-mediated postcleavage activity is the catalysis of phosphodiester bond hydrolysis and strand transfer reactions. The RAG proteins are capable of opening hairpin coding ends in vitro. The RAG proteins also show 3' flap endonuclease activity that may contribute to coding end processing/joining and can utilise the 3' OH group on the signal ends to attack hairpin coding ends (forming hybrid or open/shut joints) or virtually any DNA duplex (forming a transposition product).; GO: 0003677 DNA binding, 0006310 DNA recombination, 0005634 nucleus
Probab=96.32  E-value=0.59  Score=40.05  Aligned_cols=70  Identities=20%  Similarity=0.346  Sum_probs=51.3

Q ss_pred             cCCcceEEEEEC--C--EEEEEeCCCCC-------------CCCCCeEEEEECCCCeeE--ecCCCCCcceeeeeEEECC
Q 045821          108 LRRKGMGFEVLG--K--NAYLLGGCGWS-------------EDATSEVYCYDASMNTWT--DAAPMSTARCYFPCGVLNQ  168 (352)
Q Consensus       108 ~~~~~~~~~~~~--~--~iyv~GG~~~~-------------~~~~~~~~~~d~~t~~W~--~~~~~~~~r~~~~~~~~~~  168 (352)
                      .+|++|++.++.  |  .+.+|||+.-.             ......++.+|+.-+..+  .++.+....++|.+..-++
T Consensus        86 ~aRYGHt~~vV~SrGKta~VlFGGRSY~P~~qRTTenWNsVvDC~P~VfLiDleFGC~tah~lpEl~dG~SFHvslar~D  165 (337)
T PF03089_consen   86 EARYGHTINVVHSRGKTACVLFGGRSYMPPGQRTTENWNSVVDCPPQVFLIDLEFGCCTAHTLPELQDGQSFHVSLARND  165 (337)
T ss_pred             cccccceEEEEEECCcEEEEEECCcccCCccccchhhcceeccCCCeEEEEeccccccccccchhhcCCeEEEEEEecCc
Confidence            789999988872  2  47888995321             124457788888877544  4566677788888888899


Q ss_pred             EEEEEeCCC
Q 045821          169 KIYCIGGLG  177 (352)
Q Consensus       169 ~iyv~gG~~  177 (352)
                      .+|++||..
T Consensus       166 ~VYilGGHs  174 (337)
T PF03089_consen  166 CVYILGGHS  174 (337)
T ss_pred             eEEEEccEE
Confidence            999999954


No 84 
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=96.29  E-value=0.5  Score=38.89  Aligned_cols=145  Identities=13%  Similarity=0.112  Sum_probs=77.9

Q ss_pred             EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCC-CC-CcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCC
Q 045821          115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAP-MS-TARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPR  188 (352)
Q Consensus       115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~-~~-~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~  188 (352)
                      +....+.+|+|-|        ..+|+++.....  -+.+.. .+ .|..--++...  ++++|++-|    +.+++|+..
T Consensus        12 ~~~~~g~~y~FkG--------~~~w~~~~~~~~~~p~~I~~~w~~~p~~IDAa~~~~~~~~~yfFkg----~~yw~~~~~   79 (194)
T cd00094          12 VTTLRGELYFFKG--------RYFWRLSPGKPPGSPFLISSFWPSLPSPVDAAFERPDTGKIYFFKG----DKYWVYTGK   79 (194)
T ss_pred             EEEeCCEEEEEeC--------CEEEEEeCCCCCCCCeEhhhhCCCCCCCccEEEEECCCCEEEEECC----CEEEEEcCc
Confidence            3444689999976        466777764211  111211 11 11111122222  389999988    778999877


Q ss_pred             CCcEE---EecCCCCc---ccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCCCCceeec-CCCCCCCC------CCc
Q 045821          189 TNNWK---LHTEPNIF---TEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPSTDSWLHA-DANMASGW------RGP  253 (352)
Q Consensus       189 t~~W~---~~~~~~~~---~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~-~~~~~~~~------~~~  253 (352)
                      +....   .+......   ....++...  ++++|++-|..       .++||...++...- +......+      .-+
T Consensus        80 ~~~~~~Pk~i~~~~~~~~~~~iDAA~~~~~~~~~yfFkg~~-------y~ry~~~~~~v~~~yP~~i~~~w~g~p~~ida  152 (194)
T cd00094          80 NLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFFKGDK-------YWRYDEKTQKMDPGYPKLIETDFPGVPDKVDA  152 (194)
T ss_pred             ccccCCCcchhhcCCCCCCCCccEEEEEcCCCEEEEEeCCE-------EEEEeCCCccccCCCCcchhhcCCCcCCCcce
Confidence            52221   12111111   111233333  68999997654       78888765543211 11111011      122


Q ss_pred             EEEEC-CEEEEEecCCCCeEEEEeCCCCc
Q 045821          254 AVVVD-DALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       254 ~~~~~-~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      +.... +++|++.|.   ..++||..+++
T Consensus       153 a~~~~~~~~yfF~g~---~y~~~d~~~~~  178 (194)
T cd00094         153 AFRWLDGYYYFFKGD---QYWRFDPRSKE  178 (194)
T ss_pred             eEEeCCCcEEEEECC---EEEEEeCccce
Confidence            33344 899999886   89999998766


No 85 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=96.26  E-value=1.9  Score=45.26  Aligned_cols=192  Identities=14%  Similarity=0.065  Sum_probs=103.8

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC-------C------c--ceeeeeEEE--CCEEEEEeCCCCCCc
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS-------T------A--RCYFPCGVL--NQKIYCIGGLGDTHS  181 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~-------~------~--r~~~~~~~~--~~~iyv~gG~~~~~~  181 (352)
                      ++.||+....      ...+.++|+.++.-+.+..-.       .      .  ...+.+++-  ++.+||.....  ..
T Consensus       635 gn~LYVaDt~------n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~--~~  706 (1057)
T PLN02919        635 KNLLYVADTE------NHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQ--HQ  706 (1057)
T ss_pred             CCEEEEEeCC------CceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCC--Ce
Confidence            4678987542      245778888877655442210       0      0  011223333  57899876543  67


Q ss_pred             eEEEeCCCCcEEEecCCC----------C--cccce-eEEEE-C-CEEEEEccCCCCCCceEEEEEeCCCCceeecC--C
Q 045821          182 WDVYDPRTNNWKLHTEPN----------I--FTEIE-DSFVM-D-GKIYIRCSASAATSHVCALVYEPSTDSWLHAD--A  244 (352)
Q Consensus       182 v~~yd~~t~~W~~~~~~~----------~--~~~~~-~~~~~-~-g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~--~  244 (352)
                      +++||+.++....+..-.          .  ..... +.++. + +.||+....+     ..|.+||+.++....+.  .
T Consensus       707 I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n-----~~Irv~D~~tg~~~~~~gg~  781 (1057)
T PLN02919        707 IWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSES-----SSIRALDLKTGGSRLLAGGD  781 (1057)
T ss_pred             EEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCC-----CeEEEEECCCCcEEEEEecc
Confidence            899998877654432100          0  00011 12222 3 4599985332     36889998766533211  0


Q ss_pred             C-----C-----------CCCC---CCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCC----------CcCCC
Q 045821          245 N-----M-----------ASGW---RGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLST----------LLTRP  295 (352)
Q Consensus       245 ~-----~-----------~~~~---~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~----------~~~~~  295 (352)
                      +     .           ....   .+.++.-+|.||+....++ .|.+||++++....+.....          .....
T Consensus       782 ~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~-rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~  860 (1057)
T PLN02919        782 PTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNH-KIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSE  860 (1057)
T ss_pred             cccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCC-EEEEEECCCCeEEEEeccCCcCCCCCcccccccCC
Confidence            0     0           0000   1122344678888876554 99999999988876653211          00111


Q ss_pred             CceEE-EeCCEEEEEcCCc-eEEEEEcCCcc
Q 045821          296 PCKLV-AIGKTIFVIGKGC-SAVVIDVGNIG  324 (352)
Q Consensus       296 ~~~~~-~~~~~i~v~gG~~-~~~~~d~~~~~  324 (352)
                      -.+++ ..+++|||....+ .+.++|+++..
T Consensus       861 P~GIavd~dG~lyVaDt~Nn~Irvid~~~~~  891 (1057)
T PLN02919        861 PAGLALGENGRLFVADTNNSLIRYLDLNKGE  891 (1057)
T ss_pred             ceEEEEeCCCCEEEEECCCCEEEEEECCCCc
Confidence            22333 3467899886544 56689988754


No 86 
>PRK00178 tolB translocation protein TolB; Provisional
Probab=96.22  E-value=1.1  Score=42.02  Aligned_cols=182  Identities=10%  Similarity=0.040  Sum_probs=95.3

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY  160 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~  160 (352)
                      ...++++|.  .++.-+.+...+.    .........++ +|++....+    ...+++++|..+++.+.+......-..
T Consensus       222 ~~~l~~~~l--~~g~~~~l~~~~g----~~~~~~~SpDG~~la~~~~~~----g~~~Iy~~d~~~~~~~~lt~~~~~~~~  291 (430)
T PRK00178        222 RPRIFVQNL--DTGRREQITNFEG----LNGAPAWSPDGSKLAFVLSKD----GNPEIYVMDLASRQLSRVTNHPAIDTE  291 (430)
T ss_pred             CCEEEEEEC--CCCCEEEccCCCC----CcCCeEECCCCCEEEEEEccC----CCceEEEEECCCCCeEEcccCCCCcCC
Confidence            456788888  7777666654431    11111222233 454433221    136899999999988777643221111


Q ss_pred             eeeEEEC-CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          161 FPCGVLN-QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       161 ~~~~~~~-~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      +. ..-+ ..|++.........++.+|..++.++++.... .........-+| .|+.. ....  ....++.+|+.++.
T Consensus       292 ~~-~spDg~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~-~~~~--~~~~l~~~dl~tg~  366 (430)
T PRK00178        292 PF-WGKDGRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMV-HRQD--GNFHVAAQDLQRGS  366 (430)
T ss_pred             eE-ECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEE-EccC--CceEEEEEECCCCC
Confidence            11 1113 35655543322367899999988888775221 111111222344 45444 3221  23468999999988


Q ss_pred             eeecCCCCCCCCCCcEEEECCEEEEEecCC-C-CeEEEEeCCCC
Q 045821          239 WLHADANMASGWRGPAVVVDDALYVLDQSS-G-TKLMMWQKESR  280 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~-~-~~v~~yd~~~~  280 (352)
                      .+.+....  ........-+|+.+++.... + ..++..+...+
T Consensus       367 ~~~lt~~~--~~~~p~~spdg~~i~~~~~~~g~~~l~~~~~~g~  408 (430)
T PRK00178        367 VRILTDTS--LDESPSVAPNGTMLIYATRQQGRGVLMLVSINGR  408 (430)
T ss_pred             EEEccCCC--CCCCceECCCCCEEEEEEecCCceEEEEEECCCC
Confidence            88775421  11223444466555554322 2 35777777544


No 87 
>PRK04792 tolB translocation protein TolB; Provisional
Probab=96.22  E-value=1.1  Score=42.20  Aligned_cols=183  Identities=11%  Similarity=0.036  Sum_probs=95.8

Q ss_pred             CceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce
Q 045821           81 ERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC  159 (352)
Q Consensus        81 ~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~  159 (352)
                      +...++.+|.  .++.-..+...+.   ... ......+ ..|++....+    ...+++.+|..+++.+.+........
T Consensus       240 g~~~L~~~dl--~tg~~~~lt~~~g---~~~-~~~wSPDG~~La~~~~~~----g~~~Iy~~dl~tg~~~~lt~~~~~~~  309 (448)
T PRK04792        240 RKAEIFVQDI--YTQVREKVTSFPG---ING-APRFSPDGKKLALVLSKD----GQPEIYVVDIATKALTRITRHRAIDT  309 (448)
T ss_pred             CCcEEEEEEC--CCCCeEEecCCCC---CcC-CeeECCCCCEEEEEEeCC----CCeEEEEEECCCCCeEECccCCCCcc
Confidence            3456888888  7777666655441   111 1122223 4465544322    23679999999998877764321111


Q ss_pred             eeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCC
Q 045821          160 YFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTD  237 (352)
Q Consensus       160 ~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~  237 (352)
                      ... ..-++ .|++.........++.+|..+++++++.... .........-+| .|+.. +..  .....++.+|+.++
T Consensus       310 ~p~-wSpDG~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g-~~~~~~~~SpDG~~l~~~-~~~--~g~~~I~~~dl~~g  384 (448)
T PRK04792        310 EPS-WHPDGKSLIFTSERGGKPQIYRVNLASGKVSRLTFEG-EQNLGGSITPDGRSMIMV-NRT--NGKFNIARQDLETG  384 (448)
T ss_pred             ceE-ECCCCCEEEEEECCCCCceEEEEECCCCCEEEEecCC-CCCcCeeECCCCCEEEEE-Eec--CCceEEEEEECCCC
Confidence            111 11244 4555443332378899999999988875211 111112333355 44444 332  22347889999988


Q ss_pred             ceeecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCCC
Q 045821          238 SWLHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKESR  280 (352)
Q Consensus       238 ~W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~~  280 (352)
                      ..+.+.... . -......-+++ |++.....+ ..++.+|.+..
T Consensus       385 ~~~~lt~~~-~-d~~ps~spdG~~I~~~~~~~g~~~l~~~~~~G~  427 (448)
T PRK04792        385 AMQVLTSTR-L-DESPSVAPNGTMVIYSTTYQGKQVLAAVSIDGR  427 (448)
T ss_pred             CeEEccCCC-C-CCCceECCCCCEEEEEEecCCceEEEEEECCCC
Confidence            887765421 1 11223444454 444333333 35777787433


No 88 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.19  E-value=0.88  Score=40.79  Aligned_cols=189  Identities=7%  Similarity=0.017  Sum_probs=88.1

Q ss_pred             EEEEEeCCCCCCCCCCeEEEEECCC-CeeEecCCCCCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCC-CCcEEEec
Q 045821          121 NAYLLGGCGWSEDATSEVYCYDASM-NTWTDAAPMSTARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPR-TNNWKLHT  196 (352)
Q Consensus       121 ~iyv~GG~~~~~~~~~~~~~~d~~t-~~W~~~~~~~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~-t~~W~~~~  196 (352)
                      .+|+..+.      .+.+.+||..+ ++++.+...+.......++.- + ..+|+.+...  ..+..|+.. ++.++.+.
T Consensus         3 ~~y~~~~~------~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~--~~i~~~~~~~~g~l~~~~   74 (330)
T PRK11028          3 IVYIASPE------SQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPE--FRVLSYRIADDGALTFAA   74 (330)
T ss_pred             EEEEEcCC------CCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCC--CcEEEEEECCCCceEEee
Confidence            46666432      35677788754 466665544432222222222 3 4577755422  667777775 45666554


Q ss_pred             CCCCcccceeEEEE-CC-EEEEEccCCCCCCceEEEEEeCCCC-ce-eecCCCCC-CCCCCcEEEECCEEEEEecCCCCe
Q 045821          197 EPNIFTEIEDSFVM-DG-KIYIRCSASAATSHVCALVYEPSTD-SW-LHADANMA-SGWRGPAVVVDDALYVLDQSSGTK  271 (352)
Q Consensus       197 ~~~~~~~~~~~~~~-~g-~iyv~GG~~~~~~~~~i~~yd~~~~-~W-~~~~~~~~-~~~~~~~~~~~~~l~~~gg~~~~~  271 (352)
                      ..+.....++.+.. ++ .+|+.. ..    ...+.+||..++ .- ..+..... ..-+..++.-+++.+++.....+.
T Consensus        75 ~~~~~~~p~~i~~~~~g~~l~v~~-~~----~~~v~v~~~~~~g~~~~~~~~~~~~~~~~~~~~~p~g~~l~v~~~~~~~  149 (330)
T PRK11028         75 ESPLPGSPTHISTDHQGRFLFSAS-YN----ANCVSVSPLDKDGIPVAPIQIIEGLEGCHSANIDPDNRTLWVPCLKEDR  149 (330)
T ss_pred             eecCCCCceEEEECCCCCEEEEEE-cC----CCeEEEEEECCCCCCCCceeeccCCCcccEeEeCCCCCEEEEeeCCCCE
Confidence            33222222233333 44 566653 21    135666776532 11 11211111 111222233355444444444458


Q ss_pred             EEEEeCCCC-ceEEcc----CCCCCcCCCCceEEEeCCEEEEEcC-CceEEEEEcCC
Q 045821          272 LMMWQKESR-EWSPVG----RLSTLLTRPPCKLVAIGKTIFVIGK-GCSAVVIDVGN  322 (352)
Q Consensus       272 v~~yd~~~~-~W~~~~----~~~~~~~~~~~~~~~~~~~i~v~gG-~~~~~~~d~~~  322 (352)
                      |.+||..++ ......    ..+....-+...+..-+..+|+... ...+.+||+..
T Consensus       150 v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~  206 (330)
T PRK11028        150 IRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKD  206 (330)
T ss_pred             EEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeC
Confidence            999998763 232110    1111111111222233456888764 55677888863


No 89 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=96.13  E-value=0.31  Score=41.74  Aligned_cols=115  Identities=15%  Similarity=0.193  Sum_probs=77.8

Q ss_pred             ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821          210 MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS  289 (352)
Q Consensus       210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~  289 (352)
                      .+|.+|...|..+   .+.+..||+.+++-......+...+.-..+..+++||.+-=.++ ..++||+++  .+.+...+
T Consensus        54 ~~g~LyESTG~yG---~S~l~~~d~~tg~~~~~~~l~~~~FgEGit~~~d~l~qLTWk~~-~~f~yd~~t--l~~~~~~~  127 (264)
T PF05096_consen   54 DDGTLYESTGLYG---QSSLRKVDLETGKVLQSVPLPPRYFGEGITILGDKLYQLTWKEG-TGFVYDPNT--LKKIGTFP  127 (264)
T ss_dssp             ETTEEEEEECSTT---EEEEEEEETTTSSEEEEEE-TTT--EEEEEEETTEEEEEESSSS-EEEEEETTT--TEEEEEEE
T ss_pred             CCCEEEEeCCCCC---cEEEEEEECCCCcEEEEEECCccccceeEEEECCEEEEEEecCC-eEEEEcccc--ceEEEEEe
Confidence            5789999877654   35788999999886654443444444445678999999975444 899999975  66666555


Q ss_pred             CCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcccccceEEe
Q 045821          290 TLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIGGIMVS  332 (352)
Q Consensus       290 ~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~~~~~~  332 (352)
                      .  ..-+-+++..++.+++--|+...+..|++...-.+...++
T Consensus       128 y--~~EGWGLt~dg~~Li~SDGS~~L~~~dP~~f~~~~~i~V~  168 (264)
T PF05096_consen  128 Y--PGEGWGLTSDGKRLIMSDGSSRLYFLDPETFKEVRTIQVT  168 (264)
T ss_dssp             ---SSS--EEEECSSCEEEE-SSSEEEEE-TTT-SEEEEEE-E
T ss_pred             c--CCcceEEEcCCCEEEEECCccceEEECCcccceEEEEEEE
Confidence            4  2456678877888888889999999999987755544443


No 90 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=96.06  E-value=1  Score=40.36  Aligned_cols=191  Identities=12%  Similarity=0.098  Sum_probs=85.6

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECC-CCeeEecCCCCCcceeeeeEEE-CC-EEEEEeCCCCCCceEEEeCCCCc--EE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDAS-MNTWTDAAPMSTARCYFPCGVL-NQ-KIYCIGGLGDTHSWDVYDPRTNN--WK  193 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~-t~~W~~~~~~~~~r~~~~~~~~-~~-~iyv~gG~~~~~~v~~yd~~t~~--W~  193 (352)
                      +..+|+.+.      ....+..|+.. ++++..+...+.+.....++.. ++ .+|+..-.+  ..+.+||..++.  .+
T Consensus        46 ~~~lyv~~~------~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~--~~v~v~~~~~~g~~~~  117 (330)
T PRK11028         46 KRHLYVGVR------PEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASYNA--NCVSVSPLDKDGIPVA  117 (330)
T ss_pred             CCEEEEEEC------CCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEcCC--CeEEEEEECCCCCCCC
Confidence            345676543      12556667765 4556554433322111222222 34 566654322  677788876431  11


Q ss_pred             EecCCCCcccceeEEEE-CC-EEEEEccCCCCCCceEEEEEeCCCC-ceeec-----CCCCCCCCCCcEEEECC-EEEEE
Q 045821          194 LHTEPNIFTEIEDSFVM-DG-KIYIRCSASAATSHVCALVYEPSTD-SWLHA-----DANMASGWRGPAVVVDD-ALYVL  264 (352)
Q Consensus       194 ~~~~~~~~~~~~~~~~~-~g-~iyv~GG~~~~~~~~~i~~yd~~~~-~W~~~-----~~~~~~~~~~~~~~~~~-~l~~~  264 (352)
                      .+...+.....+.++.. ++ .+|+..-     ....+..||..++ .....     ..+....-++.++.-++ .+|+.
T Consensus       118 ~~~~~~~~~~~~~~~~~p~g~~l~v~~~-----~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~  192 (330)
T PRK11028        118 PIQIIEGLEGCHSANIDPDNRTLWVPCL-----KEDRIRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCV  192 (330)
T ss_pred             ceeeccCCCcccEeEeCCCCCEEEEeeC-----CCCEEEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEE
Confidence            22211111111222222 34 6777632     2236888887653 22211     00111111112233344 56666


Q ss_pred             ecCCCCeEEEEeCC--CCceEEc---cCCCCCc--CCCCceEE--EeCCEEEEEcC-CceEEEEEcCCc
Q 045821          265 DQSSGTKLMMWQKE--SREWSPV---GRLSTLL--TRPPCKLV--AIGKTIFVIGK-GCSAVVIDVGNI  323 (352)
Q Consensus       265 gg~~~~~v~~yd~~--~~~W~~~---~~~~~~~--~~~~~~~~--~~~~~i~v~gG-~~~~~~~d~~~~  323 (352)
                      .... +.+.+||.+  +++.+.+   ..+|...  .+....+.  .-+..+|+... .....+|+++.+
T Consensus       193 ~~~~-~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~  260 (330)
T PRK11028        193 NELN-SSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSED  260 (330)
T ss_pred             ecCC-CEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeCC
Confidence            5433 388888776  4454433   3333321  12222222  33446888653 445667887654


No 91 
>PRK04922 tolB translocation protein TolB; Provisional
Probab=96.01  E-value=1.4  Score=41.37  Aligned_cols=180  Identities=12%  Similarity=0.007  Sum_probs=93.4

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK  213 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~  213 (352)
                      ...++++|..+++-..+...+...... ...-++ +|++....+....++++|..++.-+++..-.. ........-+|+
T Consensus       227 ~~~l~~~dl~~g~~~~l~~~~g~~~~~-~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~~~lt~~~~-~~~~~~~spDG~  304 (433)
T PRK04922        227 RSAIYVQDLATGQRELVASFRGINGAP-SFSPDGRRLALTLSRDGNPEIYVMDLGSRQLTRLTNHFG-IDTEPTWAPDGK  304 (433)
T ss_pred             CcEEEEEECCCCCEEEeccCCCCccCc-eECCCCCEEEEEEeCCCCceEEEEECCCCCeEECccCCC-CccceEECCCCC
Confidence            467899999988877776544211111 111134 56554333323689999999888766643211 111122333555


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEEC-CEEEEEecCCC-CeEEEEeCCCCceEEccCCCCC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVD-DALYVLDQSSG-TKLMMWQKESREWSPVGRLSTL  291 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~l~~~gg~~~-~~v~~yd~~~~~W~~~~~~~~~  291 (352)
                      -.++....  .....++.+|..+++.+.+.... .........-+ ..|++..+..+ ..++++|+.++....+..-+. 
T Consensus       305 ~l~f~sd~--~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~~~~~~~I~v~d~~~g~~~~Lt~~~~-  380 (433)
T PRK04922        305 SIYFTSDR--GGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHGSGGQYRIAVMDLSTGSVRTLTPGSL-  380 (433)
T ss_pred             EEEEEECC--CCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEECCCCceeEEEEECCCCCeEECCCCCC-
Confidence            33332222  11236888898888887764211 11112233334 45665544332 479999999888877653221 


Q ss_pred             cCCCCceEEEeCCEEEEE---cCCceEEEEEcCC
Q 045821          292 LTRPPCKLVAIGKTIFVI---GKGCSAVVIDVGN  322 (352)
Q Consensus       292 ~~~~~~~~~~~~~~i~v~---gG~~~~~~~d~~~  322 (352)
                        .........+..|+..   +|....+++|...
T Consensus       381 --~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g  412 (433)
T PRK04922        381 --DESPSFAPNGSMVLYATREGGRGVLAAVSTDG  412 (433)
T ss_pred             --CCCceECCCCCEEEEEEecCCceEEEEEECCC
Confidence              1122333333344433   2344566677753


No 92 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.99  E-value=0.44  Score=43.59  Aligned_cols=177  Identities=15%  Similarity=0.119  Sum_probs=91.5

Q ss_pred             CCeEEEEECCCCeeEe-cCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc-EEEecCCCCcccceeEEEECC
Q 045821          135 TSEVYCYDASMNTWTD-AAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN-WKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~-~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      +..+..|+..+..-.. +..+.. -.+......||++..+|+..  .-|.+||..+.. -+.+..-..+.........++
T Consensus        47 S~rvqly~~~~~~~~k~~srFk~-~v~s~~fR~DG~LlaaGD~s--G~V~vfD~k~r~iLR~~~ah~apv~~~~f~~~d~  123 (487)
T KOG0310|consen   47 SVRVQLYSSVTRSVRKTFSRFKD-VVYSVDFRSDGRLLAAGDES--GHVKVFDMKSRVILRQLYAHQAPVHVTKFSPQDN  123 (487)
T ss_pred             ccEEEEEecchhhhhhhHHhhcc-ceeEEEeecCCeEEEccCCc--CcEEEeccccHHHHHHHhhccCceeEEEecccCC
Confidence            3567778776653221 221111 11222223379999999876  578889955521 111111111111112334688


Q ss_pred             EEEEEccCCCCCCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCC
Q 045821          213 KIYIRCSASAATSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTL  291 (352)
Q Consensus       213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~  291 (352)
                      .+++.|+-+.     .+..+|.++..-+ ++......-+++.....++.|++.||+++ .|-.||..+.+ ..+-.+...
T Consensus       124 t~l~s~sDd~-----v~k~~d~s~a~v~~~l~~htDYVR~g~~~~~~~hivvtGsYDg-~vrl~DtR~~~-~~v~elnhg  196 (487)
T KOG0310|consen  124 TMLVSGSDDK-----VVKYWDLSTAYVQAELSGHTDYVRCGDISPANDHIVVTGSYDG-KVRLWDTRSLT-SRVVELNHG  196 (487)
T ss_pred             eEEEecCCCc-----eEEEEEcCCcEEEEEecCCcceeEeeccccCCCeEEEecCCCc-eEEEEEeccCC-ceeEEecCC
Confidence            9999975442     3444555555422 22222222233344456888999999998 99999999874 333333221


Q ss_pred             cCCCCceEEE-eCCEEEEEcCCceEEEEEcCCc
Q 045821          292 LTRPPCKLVA-IGKTIFVIGKGCSAVVIDVGNI  323 (352)
Q Consensus       292 ~~~~~~~~~~-~~~~i~v~gG~~~~~~~d~~~~  323 (352)
                        ..--.++. .++.+.+..|.+.+-++|.-+.
T Consensus       197 --~pVe~vl~lpsgs~iasAgGn~vkVWDl~~G  227 (487)
T KOG0310|consen  197 --CPVESVLALPSGSLIASAGGNSVKVWDLTTG  227 (487)
T ss_pred             --CceeeEEEcCCCCEEEEcCCCeEEEEEecCC
Confidence              11112233 3445555444446667776643


No 93 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.99  E-value=1.2  Score=40.40  Aligned_cols=238  Identities=14%  Similarity=0.202  Sum_probs=117.9

Q ss_pred             CCEEEEEEecC-CCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE---CCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821           68 ETWIYAFCRDN-KLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL---GKNAYLLGGCGWSEDATSEVYCYDA  143 (352)
Q Consensus        68 ~~~l~~~gg~~-~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~---~~~iyv~GG~~~~~~~~~~~~~~d~  143 (352)
                      ++.||+..... .......|..++  .+++.+.+...+.   .-..-+...+   +..+|+.--      ....+.+|+.
T Consensus        48 ~~~LY~~~e~~~~~g~v~~~~i~~--~~g~L~~~~~~~~---~g~~p~~i~~~~~g~~l~vany------~~g~v~v~~l  116 (345)
T PF10282_consen   48 GRRLYVVNEGSGDSGGVSSYRIDP--DTGTLTLLNSVPS---GGSSPCHIAVDPDGRFLYVANY------GGGSVSVFPL  116 (345)
T ss_dssp             SSEEEEEETTSSTTTEEEEEEEET--TTTEEEEEEEEEE---SSSCEEEEEECTTSSEEEEEET------TTTEEEEEEE
T ss_pred             CCEEEEEEccccCCCCEEEEEECC--CcceeEEeeeecc---CCCCcEEEEEecCCCEEEEEEc------cCCeEEEEEc
Confidence            78899886543 445667777777  7678877766541   1112222333   445666531      1356777777


Q ss_pred             CCC-eeEecC---------CC---CCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCCCCc--EEEecC--CCCcccc
Q 045821          144 SMN-TWTDAA---------PM---STARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPRTNN--WKLHTE--PNIFTEI  204 (352)
Q Consensus       144 ~t~-~W~~~~---------~~---~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~--~~~~~~~  204 (352)
                      ..+ +-....         +-   ...-..|.+... + ..+|+..= .. ..+.+|+...+.  ......  .+....-
T Consensus       117 ~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg~~v~v~dl-G~-D~v~~~~~~~~~~~l~~~~~~~~~~G~GP  194 (345)
T PF10282_consen  117 DDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDGRFVYVPDL-GA-DRVYVYDIDDDTGKLTPVDSIKVPPGSGP  194 (345)
T ss_dssp             CTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTSSEEEEEET-TT-TEEEEEEE-TTS-TEEEEEEEECSTTSSE
T ss_pred             cCCcccceeeeecccCCCCCcccccccccceeEEECCCCCEEEEEec-CC-CEEEEEEEeCCCceEEEeeccccccCCCC
Confidence            653 222211         00   111223444444 3 46777531 11 678888887665  544221  2222222


Q ss_pred             eeEEEE-C-CEEEEEccCCCCCCceEEEEEeCCCCceeecCC--CCCCCC----CCcEEE--ECC-EEEEEecCCCCeEE
Q 045821          205 EDSFVM-D-GKIYIRCSASAATSHVCALVYEPSTDSWLHADA--NMASGW----RGPAVV--VDD-ALYVLDQSSGTKLM  273 (352)
Q Consensus       205 ~~~~~~-~-g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~--~~~~~~----~~~~~~--~~~-~l~~~gg~~~~~v~  273 (352)
                      .++++. + ..+|++...   .....++.++..+..++.+..  ..+...    ..+.+.  -++ .||+.... .+.|.
T Consensus       195 Rh~~f~pdg~~~Yv~~e~---s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~-~~sI~  270 (345)
T PF10282_consen  195 RHLAFSPDGKYAYVVNEL---SNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRG-SNSIS  270 (345)
T ss_dssp             EEEEE-TTSSEEEEEETT---TTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECT-TTEEE
T ss_pred             cEEEEcCCcCEEEEecCC---CCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEecc-CCEEE
Confidence            244444 3 478998532   222345556655777765532  111111    122232  245 45655433 34888


Q ss_pred             EEeC--CCCceEEccCCCCC-cCCCCceEEEeCCEEEEEc-CCceEEEEEcCC
Q 045821          274 MWQK--ESREWSPVGRLSTL-LTRPPCKLVAIGKTIFVIG-KGCSAVVIDVGN  322 (352)
Q Consensus       274 ~yd~--~~~~W~~~~~~~~~-~~~~~~~~~~~~~~i~v~g-G~~~~~~~d~~~  322 (352)
                      +|+.  .+++.+.+...+.. ..-+.+.+..-++.||+.+ +.+.+.+|++..
T Consensus       271 vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~vf~~d~  323 (345)
T PF10282_consen  271 VFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVSVFDIDP  323 (345)
T ss_dssp             EEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEEEEEEET
T ss_pred             EEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEEEEEEeC
Confidence            8876  45677766644431 1112233333566777776 344566787653


No 94 
>PRK00178 tolB translocation protein TolB; Provisional
Probab=95.91  E-value=1.5  Score=41.01  Aligned_cols=146  Identities=8%  Similarity=0.016  Sum_probs=81.4

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      ...++++|..+++-+.+...+..-.  ..... ++ +|++....+....++++|..++..+++..... ........-+|
T Consensus       222 ~~~l~~~~l~~g~~~~l~~~~g~~~--~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~-~~~~~~~spDg  298 (430)
T PRK00178        222 RPRIFVQNLDTGRREQITNFEGLNG--APAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSRVTNHPA-IDTEPFWGKDG  298 (430)
T ss_pred             CCEEEEEECCCCCEEEccCCCCCcC--CeEECCCCCEEEEEEccCCCceEEEEECCCCCeEEcccCCC-CcCCeEECCCC
Confidence            3579999999988777765432111  11111 33 45544332222689999999998887754221 11111223354


Q ss_pred             -EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCC-CeEEEEeCCCCceEEccC
Q 045821          213 -KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSG-TKLMMWQKESREWSPVGR  287 (352)
Q Consensus       213 -~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~-~~v~~yd~~~~~W~~~~~  287 (352)
                       +|+.. ...  .....++.+|..+++++.+.... .........- ++.|++.....+ ..++++|..++..+.+..
T Consensus       299 ~~i~f~-s~~--~g~~~iy~~d~~~g~~~~lt~~~-~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~~~~lt~  372 (430)
T PRK00178        299 RTLYFT-SDR--GGKPQIYKVNVNGGRAERVTFVG-NYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGSVRILTD  372 (430)
T ss_pred             CEEEEE-ECC--CCCceEEEEECCCCCEEEeecCC-CCccceEECCCCCEEEEEEccCCceEEEEEECCCCCEEEccC
Confidence             45544 221  12236888999888887764311 1111122233 445555543322 468999999988887764


No 95 
>PRK05137 tolB translocation protein TolB; Provisional
Probab=95.88  E-value=1.6  Score=40.99  Aligned_cols=188  Identities=13%  Similarity=0.065  Sum_probs=97.5

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY  160 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~  160 (352)
                      ...++.+|.  .++....+...+.   ... ......++ .|++....+    ...+++.+|..+++.+.+...+.....
T Consensus       225 ~~~i~~~dl--~~g~~~~l~~~~g---~~~-~~~~SPDG~~la~~~~~~----g~~~Iy~~d~~~~~~~~Lt~~~~~~~~  294 (435)
T PRK05137        225 RPRVYLLDL--ETGQRELVGNFPG---MTF-APRFSPDGRKVVMSLSQG----GNTDIYTMDLRSGTTTRLTDSPAIDTS  294 (435)
T ss_pred             CCEEEEEEC--CCCcEEEeecCCC---ccc-CcEECCCCCEEEEEEecC----CCceEEEEECCCCceEEccCCCCccCc
Confidence            356788888  8877777665442   111 11222344 454443322    246799999998887777543321111


Q ss_pred             eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      . ...-++ .|+..........++++|..+...+++.... .........-+| .|++. ...  .....++.+|+.+..
T Consensus       295 ~-~~spDG~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~-~~~~~~~~SpdG~~ia~~-~~~--~~~~~i~~~d~~~~~  369 (435)
T PRK05137        295 P-SYSPDGSQIVFESDRSGSPQLYVMNADGSNPRRISFGG-GRYSTPVWSPRGDLIAFT-KQG--GGQFSIGVMKPDGSG  369 (435)
T ss_pred             e-eEcCCCCEEEEEECCCCCCeEEEEECCCCCeEEeecCC-CcccCeEECCCCCEEEEE-EcC--CCceEEEEEECCCCc
Confidence            1 111244 4444332222267899998888777765321 111112233355 44443 222  123468888987766


Q ss_pred             eeecCCCCCCCCCCcEEEECCE-EEEEecCCC----CeEEEEeCCCCceEEcc
Q 045821          239 WLHADANMASGWRGPAVVVDDA-LYVLDQSSG----TKLMMWQKESREWSPVG  286 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~----~~v~~yd~~~~~W~~~~  286 (352)
                      ...+....  ........-+|+ |++.....+    ..++.+|..+..-..+.
T Consensus       370 ~~~lt~~~--~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~~~~l~  420 (435)
T PRK05137        370 ERILTSGF--LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRNEREVP  420 (435)
T ss_pred             eEeccCCC--CCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCceEEcc
Confidence            66554321  222334444554 444433222    37899998876655554


No 96 
>smart00284 OLF Olfactomedin-like domains.
Probab=95.86  E-value=1  Score=38.56  Aligned_cols=193  Identities=13%  Similarity=0.131  Sum_probs=105.7

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEEC----CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDA----SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKL  194 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~----~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~  194 (352)
                      ++++|++.+...   ....+..|.-    ..++....-.+|.+-.+-+.++++|.+|.--...  ..+..||..+++-..
T Consensus        34 ~~~~wv~~~~~~---~~~~v~ey~~~~~f~~~~~~~~~~Lp~~~~GtG~VVYngslYY~~~~s--~~iiKydL~t~~v~~  108 (255)
T smart00284       34 KSLYWYMPLNTR---VLRSVREYSSMSDFQMGKNPTDHPLPHAGQGTGVVVYNGSLYFNKFNS--HDICRFDLTTETYQK  108 (255)
T ss_pred             CceEEEEccccC---CCcEEEEecCHHHHhccCCceEEECCCccccccEEEECceEEEEecCC--ccEEEEECCCCcEEE
Confidence            467787765421   1234445532    2333322223566666777888899999865433  789999999997643


Q ss_pred             ecCCCCcccc------------eeEEEECCEEEEEccCCCCCCceEEEEEeCCC----CceeecCCCCCCCCCCcEEEEC
Q 045821          195 HTEPNIFTEI------------EDSFVMDGKIYIRCSASAATSHVCALVYEPST----DSWLHADANMASGWRGPAVVVD  258 (352)
Q Consensus       195 ~~~~~~~~~~------------~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~----~~W~~~~~~~~~~~~~~~~~~~  258 (352)
                      ...+|.+...            --.++-++-|+++=..........+-..|+.+    .+|..   ..+.+..+.++++-
T Consensus       109 ~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSkLnp~tL~ve~tW~T---~~~k~sa~naFmvC  185 (255)
T smart00284      109 EPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNPATLTIENTWIT---TYNKRSASNAFMIC  185 (255)
T ss_pred             EEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEEEeeCcccceEEEEEEc---CCCcccccccEEEe
Confidence            3333332111            11233445566653333323334566777765    34654   23444455567777


Q ss_pred             CEEEEEecCC--CCe-EEEEeCCCCceEEccCCCCCcCCCCceEEE---eCCEEEEEcCCceEEEEEcC
Q 045821          259 DALYVLDQSS--GTK-LMMWQKESREWSPVGRLSTLLTRPPCKLVA---IGKTIFVIGKGCSAVVIDVG  321 (352)
Q Consensus       259 ~~l~~~gg~~--~~~-v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~---~~~~i~v~gG~~~~~~~d~~  321 (352)
                      |.||++....  ... .+.||+.+++=.. ..+|-.......+++.   -+.+||+.--. ....||+.
T Consensus       186 GvLY~~~s~~~~~~~I~yayDt~t~~~~~-~~i~f~n~y~~~s~l~YNP~d~~LY~wdng-~~l~Y~v~  252 (255)
T smart00284      186 GILYVTRSLGSKGEKVFYAYDTNTGKEGH-LDIPFENMYEYISMLDYNPNDRKLYAWNNG-HLVHYDIA  252 (255)
T ss_pred             eEEEEEccCCCCCcEEEEEEECCCCccce-eeeeeccccccceeceeCCCCCeEEEEeCC-eEEEEEEE
Confidence            9999996422  234 4899999876332 2233221222333333   36789988643 35556653


No 97 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.84  E-value=1.4  Score=39.97  Aligned_cols=234  Identities=15%  Similarity=0.194  Sum_probs=116.0

Q ss_pred             EEecC--CCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEE--ECCCCeeE
Q 045821           74 FCRDN--KLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCY--DASMNTWT  149 (352)
Q Consensus        74 ~gg~~--~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~--d~~t~~W~  149 (352)
                      +|+..  ..+....+.||.  .++++..+...... ......+...-++.||+.....   .....+..|  +..+++.+
T Consensus         4 vgsy~~~~~~gI~~~~~d~--~~g~l~~~~~~~~~-~~Ps~l~~~~~~~~LY~~~e~~---~~~g~v~~~~i~~~~g~L~   77 (345)
T PF10282_consen    4 VGSYTNGKGGGIYVFRFDE--ETGTLTLVQTVAEG-ENPSWLAVSPDGRRLYVVNEGS---GDSGGVSSYRIDPDTGTLT   77 (345)
T ss_dssp             EEECCSSSSTEEEEEEEET--TTTEEEEEEEEEES-SSECCEEE-TTSSEEEEEETTS---STTTEEEEEEEETTTTEEE
T ss_pred             EEcCCCCCCCcEEEEEEcC--CCCCceEeeeecCC-CCCceEEEEeCCCEEEEEEccc---cCCCCEEEEEECCCcceeE
Confidence            45443  234567788888  99999877753311 1112222222356788886432   123445444  45556777


Q ss_pred             ecCCCCC-cceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCC-cEEEec----------CC-CCcccceeEEEE--C-
Q 045821          150 DAAPMST-ARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTN-NWKLHT----------EP-NIFTEIEDSFVM--D-  211 (352)
Q Consensus       150 ~~~~~~~-~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~-~W~~~~----------~~-~~~~~~~~~~~~--~-  211 (352)
                      .+...+. +...+.++.-  +..+|+.- +.. ..+.+|+...+ .=....          .+ .......+.+.+  + 
T Consensus        78 ~~~~~~~~g~~p~~i~~~~~g~~l~van-y~~-g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg  155 (345)
T PF10282_consen   78 LLNSVPSGGSSPCHIAVDPDGRFLYVAN-YGG-GSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDG  155 (345)
T ss_dssp             EEEEEEESSSCEEEEEECTTSSEEEEEE-TTT-TEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTS
T ss_pred             EeeeeccCCCCcEEEEEecCCCEEEEEE-ccC-CeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCC
Confidence            7766553 2222222222  34566653 221 67888888764 211111          00 111222233333  3 


Q ss_pred             CEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCC---cEEEE-CCEEEEEecCCCCeEEEEeCC--CCceE
Q 045821          212 GKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRG---PAVVV-DDALYVLDQSSGTKLMMWQKE--SREWS  283 (352)
Q Consensus       212 g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~---~~~~~-~~~l~~~gg~~~~~v~~yd~~--~~~W~  283 (352)
                      ..+|+..     .....|+.|+...+.  .............+   .++.- +..+|++....+ .|.+|+..  +..++
T Consensus       156 ~~v~v~d-----lG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~-~v~v~~~~~~~g~~~  229 (345)
T PF10282_consen  156 RFVYVPD-----LGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSN-TVSVFDYDPSDGSLT  229 (345)
T ss_dssp             SEEEEEE-----TTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTT-EEEEEEEETTTTEEE
T ss_pred             CEEEEEe-----cCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCC-cEEEEeecccCCcee
Confidence            3577763     122367777776554  54422111111111   12222 346888877655 77666555  66776


Q ss_pred             Ecc---CCCCCcCC--CCceEEEe--CCEEEEEc-CCceEEEEEcC
Q 045821          284 PVG---RLSTLLTR--PPCKLVAI--GKTIFVIG-KGCSAVVIDVG  321 (352)
Q Consensus       284 ~~~---~~~~~~~~--~~~~~~~~--~~~i~v~g-G~~~~~~~d~~  321 (352)
                      .+.   .+|.....  ....+..-  +..||+-. |.+.+.+|++.
T Consensus       230 ~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI~vf~~d  275 (345)
T PF10282_consen  230 EIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSISVFDLD  275 (345)
T ss_dssp             EEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEEEEEEEC
T ss_pred             EEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEEEEEEEe
Confidence            654   44432211  22333333  55688865 56667789984


No 98 
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=95.78  E-value=1.6  Score=40.43  Aligned_cols=182  Identities=10%  Similarity=-0.007  Sum_probs=93.1

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY  160 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~  160 (352)
                      ...++++|.  .++....+.....   .... .....++ .+++.....    ...+++.+|..++..+.+.........
T Consensus       213 ~~~i~v~d~--~~g~~~~~~~~~~---~~~~-~~~spDg~~l~~~~~~~----~~~~i~~~d~~~~~~~~l~~~~~~~~~  282 (417)
T TIGR02800       213 KPEIYVQDL--ATGQREKVASFPG---MNGA-PAFSPDGSKLAVSLSKD----GNPDIYVMDLDGKQLTRLTNGPGIDTE  282 (417)
T ss_pred             CcEEEEEEC--CCCCEEEeecCCC---Cccc-eEECCCCCEEEEEECCC----CCccEEEEECCCCCEEECCCCCCCCCC
Confidence            356788888  7776665554431   1111 1222244 465543322    236789999998877776543321111


Q ss_pred             eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCce
Q 045821          161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSW  239 (352)
Q Consensus       161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W  239 (352)
                      .. ..-++ +|++.........++.+|..+..++++.... .........-+|+.+++.....  ....++.+|+.+..+
T Consensus       283 ~~-~s~dg~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~~~-~~~~~~~~spdg~~i~~~~~~~--~~~~i~~~d~~~~~~  358 (417)
T TIGR02800       283 PS-WSPDGKSIAFTSDRGGSPQIYMMDADGGEVRRLTFRG-GYNASPSWSPDGDLIAFVHREG--GGFNIAVMDLDGGGE  358 (417)
T ss_pred             EE-ECCCCCEEEEEECCCCCceEEEEECCCCCEEEeecCC-CCccCeEECCCCCEEEEEEccC--CceEEEEEeCCCCCe
Confidence            11 11144 4554443322267899999988887765221 1111122333566555544332  234789999988777


Q ss_pred             eecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCC
Q 045821          240 LHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKES  279 (352)
Q Consensus       240 ~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~  279 (352)
                      +.+....  .........+++ |++.....+ ..+++++.+.
T Consensus       359 ~~l~~~~--~~~~p~~spdg~~l~~~~~~~~~~~l~~~~~~g  398 (417)
T TIGR02800       359 RVLTDTG--LDESPSFAPNGRMILYATTRGGRGVLGLVSTDG  398 (417)
T ss_pred             EEccCCC--CCCCceECCCCCEEEEEEeCCCcEEEEEEECCC
Confidence            7665321  112233444444 444433322 3456666544


No 99 
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=95.73  E-value=1.1  Score=38.30  Aligned_cols=218  Identities=12%  Similarity=0.046  Sum_probs=120.5

Q ss_pred             EEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCC-CCCcceeeeeE
Q 045821           86 YVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAP-MSTARCYFPCG  164 (352)
Q Consensus        86 ~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~-~~~~r~~~~~~  164 (352)
                      =.+||  .+++-...+--.   ..+.+..+.--++..++..+       ...+-++|+.+..-++.+- ...+-.....+
T Consensus        86 GhLdP--~tGev~~ypLg~---Ga~Phgiv~gpdg~~Witd~-------~~aI~R~dpkt~evt~f~lp~~~a~~nlet~  153 (353)
T COG4257          86 GHLDP--ATGEVETYPLGS---GASPHGIVVGPDGSAWITDT-------GLAIGRLDPKTLEVTRFPLPLEHADANLETA  153 (353)
T ss_pred             eecCC--CCCceEEEecCC---CCCCceEEECCCCCeeEecC-------cceeEEecCcccceEEeecccccCCCcccce
Confidence            35688  888887776433   12222222222455665532       2367889998887766642 12233445556


Q ss_pred             EEC--CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeec
Q 045821          165 VLN--QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHA  242 (352)
Q Consensus       165 ~~~--~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~  242 (352)
                      ++|  |.++..|-...   --+.||.++.-+..+.+....-+..++.-+|.+|+.- .    .-+.|-..|+.+..=+.+
T Consensus       154 vfD~~G~lWFt~q~G~---yGrLdPa~~~i~vfpaPqG~gpyGi~atpdGsvwyas-l----agnaiaridp~~~~aev~  225 (353)
T COG4257         154 VFDPWGNLWFTGQIGA---YGRLDPARNVISVFPAPQGGGPYGICATPDGSVWYAS-L----AGNAIARIDPFAGHAEVV  225 (353)
T ss_pred             eeCCCccEEEeecccc---ceecCcccCceeeeccCCCCCCcceEECCCCcEEEEe-c----cccceEEcccccCCccee
Confidence            664  56666654221   1267888776665554433333444566688888872 1    123577788887765555


Q ss_pred             CCCCC--CCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEE-cCCceEEEEE
Q 045821          243 DANMA--SGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVI-GKGCSAVVID  319 (352)
Q Consensus       243 ~~~~~--~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~-gG~~~~~~~d  319 (352)
                      +.+-.  .........-.+.+.+..-.. ..+++|||.+.+|.+-+ ||....+...-.+--.+.+++- -+.....-||
T Consensus       226 p~P~~~~~gsRriwsdpig~~wittwg~-g~l~rfdPs~~sW~eyp-LPgs~arpys~rVD~~grVW~sea~agai~rfd  303 (353)
T COG4257         226 PQPNALKAGSRRIWSDPIGRAWITTWGT-GSLHRFDPSVTSWIEYP-LPGSKARPYSMRVDRHGRVWLSEADAGAIGRFD  303 (353)
T ss_pred             cCCCcccccccccccCccCcEEEeccCC-ceeeEeCcccccceeee-CCCCCCCcceeeeccCCcEEeeccccCceeecC
Confidence            44222  111122233456777663222 38999999999999876 4443334444344445666652 2232333477


Q ss_pred             cCCccc
Q 045821          320 VGNIGN  325 (352)
Q Consensus       320 ~~~~~w  325 (352)
                      +++.+.
T Consensus       304 peta~f  309 (353)
T COG4257         304 PETARF  309 (353)
T ss_pred             cccceE
Confidence            776554


No 100
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.73  E-value=1.3  Score=40.70  Aligned_cols=223  Identities=13%  Similarity=0.090  Sum_probs=107.6

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCC-eeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRS-WKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMN  146 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~-w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~  146 (352)
                      ++.|++.|+.  .+.  ..+||.  .+.. ...+...+   .+-+.--....++.+++.|+-+      ..+..+|..+.
T Consensus        79 DG~LlaaGD~--sG~--V~vfD~--k~r~iLR~~~ah~---apv~~~~f~~~d~t~l~s~sDd------~v~k~~d~s~a  143 (487)
T KOG0310|consen   79 DGRLLAAGDE--SGH--VKVFDM--KSRVILRQLYAHQ---APVHVTKFSPQDNTMLVSGSDD------KVVKYWDLSTA  143 (487)
T ss_pred             CCeEEEccCC--cCc--EEEecc--ccHHHHHHHhhcc---CceeEEEecccCCeEEEecCCC------ceEEEEEcCCc
Confidence            7888888753  333  344565  4421 11121111   1222222344578888887622      23344555555


Q ss_pred             eeE-ecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCC
Q 045821          147 TWT-DAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAAT  224 (352)
Q Consensus       147 ~W~-~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~  224 (352)
                      .-. .+..-...-...+....++.|.+.|||+  ..+-.||..+.+ ..+-.+.....-..++.+ .|.+.+..|++   
T Consensus       144 ~v~~~l~~htDYVR~g~~~~~~~hivvtGsYD--g~vrl~DtR~~~-~~v~elnhg~pVe~vl~lpsgs~iasAgGn---  217 (487)
T KOG0310|consen  144 YVQAELSGHTDYVRCGDISPANDHIVVTGSYD--GKVRLWDTRSLT-SRVVELNHGCPVESVLALPSGSLIASAGGN---  217 (487)
T ss_pred             EEEEEecCCcceeEeeccccCCCeEEEecCCC--ceEEEEEeccCC-ceeEEecCCCceeeEEEcCCCCEEEEcCCC---
Confidence            421 1111111111223333478899999998  678889988774 223223322222233444 33555554444   


Q ss_pred             CceEEEEEeCCCCceeecCCCCCCCCCCcE--EEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe
Q 045821          225 SHVCALVYEPSTDSWLHADANMASGWRGPA--VVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI  302 (352)
Q Consensus       225 ~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~--~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~  302 (352)
                         .+-++|..++.=. +.....+...-..  ..-++.=++-||.++ .+-+||  +..|..+-.+..+-+-- ...+..
T Consensus       218 ---~vkVWDl~~G~ql-l~~~~~H~KtVTcL~l~s~~~rLlS~sLD~-~VKVfd--~t~~Kvv~s~~~~~pvL-siavs~  289 (487)
T KOG0310|consen  218 ---SVKVWDLTTGGQL-LTSMFNHNKTVTCLRLASDSTRLLSGSLDR-HVKVFD--TTNYKVVHSWKYPGPVL-SIAVSP  289 (487)
T ss_pred             ---eEEEEEecCCcee-hhhhhcccceEEEEEeecCCceEeeccccc-ceEEEE--ccceEEEEeeeccccee-eEEecC
Confidence               3667776533211 1111111100001  112445566677766 899999  44566665432211111 122234


Q ss_pred             CCEEEEEcCCceEEEEE
Q 045821          303 GKTIFVIGKGCSAVVID  319 (352)
Q Consensus       303 ~~~i~v~gG~~~~~~~d  319 (352)
                      ++.-.++|..+....+.
T Consensus       290 dd~t~viGmsnGlv~~r  306 (487)
T KOG0310|consen  290 DDQTVVIGMSNGLVSIR  306 (487)
T ss_pred             CCceEEEecccceeeee
Confidence            77888888877666554


No 101
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=95.70  E-value=1.8  Score=40.23  Aligned_cols=146  Identities=13%  Similarity=0.029  Sum_probs=79.5

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE-
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK-  213 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~-  213 (352)
                      ..++++|..+++-..+........... ..-++ .+++.........++.+|..++..+.+....... ......-+|+ 
T Consensus       214 ~~i~v~d~~~g~~~~~~~~~~~~~~~~-~spDg~~l~~~~~~~~~~~i~~~d~~~~~~~~l~~~~~~~-~~~~~s~dg~~  291 (417)
T TIGR02800       214 PEIYVQDLATGQREKVASFPGMNGAPA-FSPDGSKLAVSLSKDGNPDIYVMDLDGKQLTRLTNGPGID-TEPSWSPDGKS  291 (417)
T ss_pred             cEEEEEECCCCCEEEeecCCCCccceE-ECCCCCEEEEEECCCCCccEEEEECCCCCEEECCCCCCCC-CCEEECCCCCE
Confidence            678999999887666554332222111 11244 5665544333367999999988877765322111 1112223554 


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC--CeEEEEeCCCCceEEccC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG--TKLMMWQKESREWSPVGR  287 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~--~~v~~yd~~~~~W~~~~~  287 (352)
                      |+.. ...  .....++.+|..+..+..+.... .........-+++.+++....+  ..++.+|+.++.+..+..
T Consensus       292 l~~~-s~~--~g~~~iy~~d~~~~~~~~l~~~~-~~~~~~~~spdg~~i~~~~~~~~~~~i~~~d~~~~~~~~l~~  363 (417)
T TIGR02800       292 IAFT-SDR--GGSPQIYMMDADGGEVRRLTFRG-GYNASPSWSPDGDLIAFVHREGGGFNIAVMDLDGGGERVLTD  363 (417)
T ss_pred             EEEE-ECC--CCCceEEEEECCCCCEEEeecCC-CCccCeEECCCCCEEEEEEccCCceEEEEEeCCCCCeEEccC
Confidence            4443 222  11236888999888887664321 1111223334555444443322  479999999877776653


No 102
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=95.62  E-value=0.83  Score=40.88  Aligned_cols=133  Identities=13%  Similarity=0.169  Sum_probs=81.6

Q ss_pred             CeEEEEECCCC-----eeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc-EEEecCCCCcccceeEEE
Q 045821          136 SEVYCYDASMN-----TWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN-WKLHTEPNIFTEIEDSFV  209 (352)
Q Consensus       136 ~~~~~~d~~t~-----~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~~~~~~~~~~~~~~  209 (352)
                      ..+.+|+....     +.+.+.....+-.-.+.+.+++++.+..|    +.+.+|+...+. +.+.+............+
T Consensus        62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g----~~l~v~~l~~~~~l~~~~~~~~~~~i~sl~~  137 (321)
T PF03178_consen   62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVG----NKLYVYDLDNSKTLLKKAFYDSPFYITSLSV  137 (321)
T ss_dssp             EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEET----TEEEEEEEETTSSEEEEEEE-BSSSEEEEEE
T ss_pred             cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeec----CEEEEEEccCcccchhhheecceEEEEEEec
Confidence            67889998884     56666544444445677778999888777    789999999888 887775444444445667


Q ss_pred             ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCCCeE--EEEeC
Q 045821          210 MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSGTKL--MMWQK  277 (352)
Q Consensus       210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~~~v--~~yd~  277 (352)
                      .++.|++..-.    ....+..|+....+-..+.......+..++..+ ++..++.+...+ .+  ..|++
T Consensus       138 ~~~~I~vgD~~----~sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~~g-nl~~l~~~~  203 (321)
T PF03178_consen  138 FKNYILVGDAM----KSVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDKDG-NLFVLRYNP  203 (321)
T ss_dssp             ETTEEEEEESS----SSEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEETTS-EEEEEEE-S
T ss_pred             cccEEEEEEcc----cCEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcCCC-eEEEEEECC
Confidence            78877764222    223566788867777777654444443344444 555444444444 54  45554


No 103
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=95.62  E-value=1.4  Score=42.32  Aligned_cols=115  Identities=12%  Similarity=0.193  Sum_probs=68.4

Q ss_pred             eeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecCCCCcc--------cceeEEEECCEEEEEccCCCCCCceEEE
Q 045821          161 FPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTEPNIFT--------EIEDSFVMDGKIYIRCSASAATSHVCAL  230 (352)
Q Consensus       161 ~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~~~~~~--------~~~~~~~~~g~iyv~GG~~~~~~~~~i~  230 (352)
                      ...++.++.||+....   ..++.+|..|.  .|+.-...+...        .....++.+++||+.. ..     ..+.
T Consensus        63 stPvv~~g~vyv~s~~---g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t-~d-----g~l~  133 (527)
T TIGR03075        63 SQPLVVDGVMYVTTSY---SRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGT-LD-----ARLV  133 (527)
T ss_pred             cCCEEECCEEEEECCC---CcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEc-CC-----CEEE
Confidence            3456679999996653   46899999876  487643222111        1123456788888753 22     2589


Q ss_pred             EEeCCCCc--eeecCCCCCC--CCCCcEEEECCEEEEEec-----CCCCeEEEEeCCCCc--eEEc
Q 045821          231 VYEPSTDS--WLHADANMAS--GWRGPAVVVDDALYVLDQ-----SSGTKLMMWQKESRE--WSPV  285 (352)
Q Consensus       231 ~yd~~~~~--W~~~~~~~~~--~~~~~~~~~~~~l~~~gg-----~~~~~v~~yd~~~~~--W~~~  285 (352)
                      ++|..+++  |+.-......  ....+.++.++.+|+-..     .. ..|..||.++++  |+.-
T Consensus       134 ALDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~~~~-G~v~AlD~~TG~~lW~~~  198 (527)
T TIGR03075       134 ALDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEFGVR-GYVTAYDAKTGKLVWRRY  198 (527)
T ss_pred             EEECCCCCEEeecccccccccccccCCcEEECCEEEEeecccccCCC-cEEEEEECCCCceeEecc
Confidence            99987764  8754221211  122233567888776532     12 379999998865  7533


No 104
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=95.51  E-value=2.4  Score=40.46  Aligned_cols=116  Identities=15%  Similarity=0.203  Sum_probs=65.7

Q ss_pred             EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCC----Ccc-eeeeeEEEC-CEEEEEeCCCCCCceEEEe
Q 045821          115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMS----TAR-CYFPCGVLN-QKIYCIGGLGDTHSWDVYD  186 (352)
Q Consensus       115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~----~~r-~~~~~~~~~-~~iyv~gG~~~~~~v~~yd  186 (352)
                      -++.++.+|+...       ...++.+|..+++  |+.-...+    .+. .....+..+ ++||+....   ..+..+|
T Consensus        57 Pvv~~g~vy~~~~-------~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~v~~~~---g~v~AlD  126 (488)
T cd00216          57 PLVVDGDMYFTTS-------HSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVFFGTFD---GRLVALD  126 (488)
T ss_pred             CEEECCEEEEeCC-------CCcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEEEecCC---CeEEEEE
Confidence            3566899998643       2568899998775  87543222    111 111234446 888875432   5789999


Q ss_pred             CCCC--cEEEecCCCC-cc--cceeEEEECCEEEEEccCCCC----CCceEEEEEeCCCC--ceee
Q 045821          187 PRTN--NWKLHTEPNI-FT--EIEDSFVMDGKIYIRCSASAA----TSHVCALVYEPSTD--SWLH  241 (352)
Q Consensus       187 ~~t~--~W~~~~~~~~-~~--~~~~~~~~~g~iyv~GG~~~~----~~~~~i~~yd~~~~--~W~~  241 (352)
                      ..|.  .|+.-..... ..  .....++.++.+|+- .....    .....++++|..++  .|+.
T Consensus       127 ~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~vg-~~~~~~~~~~~~g~v~alD~~TG~~~W~~  191 (488)
T cd00216         127 AETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVIIG-SSGAEFFACGVRGALRAYDVETGKLLWRF  191 (488)
T ss_pred             CCCCCEeeeecCCCCcCcceEecCCCEEECCEEEEe-ccccccccCCCCcEEEEEECCCCceeeEe
Confidence            9876  4876432221 01  122335566776653 22111    11236889998765  4875


No 105
>PRK04922 tolB translocation protein TolB; Provisional
Probab=95.49  E-value=2.2  Score=39.97  Aligned_cols=187  Identities=15%  Similarity=0.059  Sum_probs=96.1

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY  160 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~  160 (352)
                      ...++.+|.  .++.-..+...+.    .........++ +|++....+    ...+++++|..+++.+.+.........
T Consensus       227 ~~~l~~~dl--~~g~~~~l~~~~g----~~~~~~~SpDG~~l~~~~s~~----g~~~Iy~~d~~~g~~~~lt~~~~~~~~  296 (433)
T PRK04922        227 RSAIYVQDL--ATGQRELVASFRG----INGAPSFSPDGRRLALTLSRD----GNPEIYVMDLGSRQLTRLTNHFGIDTE  296 (433)
T ss_pred             CcEEEEEEC--CCCCEEEeccCCC----CccCceECCCCCEEEEEEeCC----CCceEEEEECCCCCeEECccCCCCccc
Confidence            456778888  7777666655442    11111222234 455443222    236799999998887666543211111


Q ss_pred             eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      .+ ..-++ .|+..........++.+|..++..+++.... .........-+| .|++..+ .  .....++.+|+.++.
T Consensus       297 ~~-~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~-~--~~~~~I~v~d~~~g~  371 (433)
T PRK04922        297 PT-WAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHG-S--GGQYRIAVMDLSTGS  371 (433)
T ss_pred             eE-ECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEC-C--CCceeEEEEECCCCC
Confidence            11 11144 4544433222257888999888887775211 111112233355 4544422 2  222378899998888


Q ss_pred             eeecCCCCCCCCCCcEEEECCEEEEEec-CCC-CeEEEEeCCCCceEEc
Q 045821          239 WLHADANMASGWRGPAVVVDDALYVLDQ-SSG-TKLMMWQKESREWSPV  285 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg-~~~-~~v~~yd~~~~~W~~~  285 (352)
                      .+.+....  .........+++.+++.. ..+ ..++.+|.+...=..+
T Consensus       372 ~~~Lt~~~--~~~~p~~spdG~~i~~~s~~~g~~~L~~~~~~g~~~~~l  418 (433)
T PRK04922        372 VRTLTPGS--LDESPSFAPNGSMVLYATREGGRGVLAAVSTDGRVRQRL  418 (433)
T ss_pred             eEECCCCC--CCCCceECCCCCEEEEEEecCCceEEEEEECCCCceEEc
Confidence            88775421  112234444555444433 222 4688888865443334


No 106
>PF02191 OLF:  Olfactomedin-like domain;  InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=95.47  E-value=1.5  Score=37.70  Aligned_cols=191  Identities=16%  Similarity=0.149  Sum_probs=106.7

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECC-----CCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDAS-----MNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWK  193 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~-----t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~  193 (352)
                      .+++|++.+..+.     .+..|.-.     .++....-.+|.+-.+-+-++++|.+|.--...  ..+..||+.++.-.
T Consensus        30 ~~~iy~~~~~~~~-----~v~ey~~~~~f~~~~~~~~~~~Lp~~~~GtG~vVYngslYY~~~~s--~~IvkydL~t~~v~  102 (250)
T PF02191_consen   30 SEKIYVTSGFSGN-----TVYEYRNYEDFLRNGRSSRTYKLPYPWQGTGHVVYNGSLYYNKYNS--RNIVKYDLTTRSVV  102 (250)
T ss_pred             CCCEEEECccCCC-----EEEEEcCHhHHhhcCCCceEEEEeceeccCCeEEECCcEEEEecCC--ceEEEEECcCCcEE
Confidence            4567888774332     44444322     222222223455555666777888887765422  78999999998644


Q ss_pred             EecCCCCcccc------------eeEEEECCEEEEEccCCCCCCceEEEEEeCCC----CceeecCCCCCCCCCCcEEEE
Q 045821          194 LHTEPNIFTEI------------EDSFVMDGKIYIRCSASAATSHVCALVYEPST----DSWLHADANMASGWRGPAVVV  257 (352)
Q Consensus       194 ~~~~~~~~~~~------------~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~----~~W~~~~~~~~~~~~~~~~~~  257 (352)
                      .-..+|.+...            --.++-++-|+|+-..........+-..|+.+    .+|..   ....+..+.++++
T Consensus       103 ~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~tL~v~~tw~T---~~~k~~~~naFmv  179 (250)
T PF02191_consen  103 ARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPETLSVEQTWNT---SYPKRSAGNAFMV  179 (250)
T ss_pred             EEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCcccCceEEEEEe---ccCchhhcceeeE
Confidence            11112222211            11234455677764444333334666777754    45653   2344555566777


Q ss_pred             CCEEEEEecCCC--CeE-EEEeCCCCceEEccCCCCCcCCCCceEEE---eCCEEEEEcCCceEEEEEcC
Q 045821          258 DDALYVLDQSSG--TKL-MMWQKESREWSPVGRLSTLLTRPPCKLVA---IGKTIFVIGKGCSAVVIDVG  321 (352)
Q Consensus       258 ~~~l~~~gg~~~--~~v-~~yd~~~~~W~~~~~~~~~~~~~~~~~~~---~~~~i~v~gG~~~~~~~d~~  321 (352)
                      -|.||++.....  ..| +.||..+++=..+. ++-........++.   .+.+||+.-.. ..+.|++.
T Consensus       180 CGvLY~~~s~~~~~~~I~yafDt~t~~~~~~~-i~f~~~~~~~~~l~YNP~dk~LY~wd~G-~~v~Y~v~  247 (250)
T PF02191_consen  180 CGVLYATDSYDTRDTEIFYAFDTYTGKEEDVS-IPFPNPYGNISMLSYNPRDKKLYAWDNG-YQVTYDVR  247 (250)
T ss_pred             eeEEEEEEECCCCCcEEEEEEECCCCceecee-eeeccccCceEeeeECCCCCeEEEEECC-eEEEEEEE
Confidence            899999987653  234 89999987765443 32211222333443   36789998644 35556653


No 107
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=95.40  E-value=1.5  Score=37.55  Aligned_cols=184  Identities=13%  Similarity=0.112  Sum_probs=99.5

Q ss_pred             eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC--CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce-e
Q 045821           84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG--KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC-Y  160 (352)
Q Consensus        84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~--~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~-~  160 (352)
                      .+-.+|+  ++...++.+-...  ....+.-.++++  +.++..|-....       =..||.++.-+..+.. .... +
T Consensus       125 aI~R~dp--kt~evt~f~lp~~--~a~~nlet~vfD~~G~lWFt~q~G~y-------GrLdPa~~~i~vfpaP-qG~gpy  192 (353)
T COG4257         125 AIGRLDP--KTLEVTRFPLPLE--HADANLETAVFDPWGNLWFTGQIGAY-------GRLDPARNVISVFPAP-QGGGPY  192 (353)
T ss_pred             eeEEecC--cccceEEeecccc--cCCCcccceeeCCCccEEEeeccccc-------eecCcccCceeeeccC-CCCCCc
Confidence            3445666  6666655542211  233344445553  456666532111       1567777765554432 2222 2


Q ss_pred             eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce--eEEEECCEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          161 FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE--DSFVMDGKIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       161 ~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~--~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      -.++.-+|.+|+..=..  +-+-..|+.+..=+.++.+.......  ..+--.|++++..     ...-.+.+||+++.+
T Consensus       193 Gi~atpdGsvwyaslag--naiaridp~~~~aev~p~P~~~~~gsRriwsdpig~~witt-----wg~g~l~rfdPs~~s  265 (353)
T COG4257         193 GICATPDGSVWYASLAG--NAIARIDPFAGHAEVVPQPNALKAGSRRIWSDPIGRAWITT-----WGTGSLHRFDPSVTS  265 (353)
T ss_pred             ceEECCCCcEEEEeccc--cceEEcccccCCcceecCCCcccccccccccCccCcEEEec-----cCCceeeEeCccccc
Confidence            22333478888762111  66778888887655565433312111  1233357777762     112268999999999


Q ss_pred             eeecCCCCCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821          239 WLHADANMASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKESREWSPVGR  287 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~  287 (352)
                      |.+-+-+-...+-... |-..+++++-. ...+.+.+||+++.+++.++.
T Consensus       266 W~eypLPgs~arpys~rVD~~grVW~se-a~agai~rfdpeta~ftv~p~  314 (353)
T COG4257         266 WIEYPLPGSKARPYSMRVDRHGRVWLSE-ADAGAIGRFDPETARFTVLPI  314 (353)
T ss_pred             ceeeeCCCCCCCcceeeeccCCcEEeec-cccCceeecCcccceEEEecC
Confidence            9987543222222222 33345555533 222389999999999998863


No 108
>cd00094 HX Hemopexin-like repeats.; Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix metalloproteinases family (matrixins). The HX repeats of some matrixins bind tissue inhibitor of metalloproteinases (TIMPs). This CD contains 4 instances of the repeat.
Probab=95.31  E-value=1.3  Score=36.34  Aligned_cols=145  Identities=12%  Similarity=0.116  Sum_probs=78.3

Q ss_pred             eeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecC-CCC-cccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCC
Q 045821          162 PCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTE-PNI-FTEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPS  235 (352)
Q Consensus       162 ~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~-~~~-~~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~  235 (352)
                      +++..++++|++-|    ..+++++....  .-+.|.. .+. +..-.++...  +|++|++-|..       .|+|+..
T Consensus        11 A~~~~~g~~y~FkG----~~~w~~~~~~~~~~p~~I~~~w~~~p~~IDAa~~~~~~~~~yfFkg~~-------yw~~~~~   79 (194)
T cd00094          11 AVTTLRGELYFFKG----RYFWRLSPGKPPGSPFLISSFWPSLPSPVDAAFERPDTGKIYFFKGDK-------YWVYTGK   79 (194)
T ss_pred             eEEEeCCEEEEEeC----CEEEEEeCCCCCCCCeEhhhhCCCCCCCccEEEEECCCCEEEEECCCE-------EEEEcCc
Confidence            34445689999988    67888876522  1222222 111 1122233333  38999995543       7888866


Q ss_pred             CCcee---ecCC-CCCC--CCCCcEEEE--CCEEEEEecCCCCeEEEEeCCCCceEE-----c----cCCCCCcCCCCce
Q 045821          236 TDSWL---HADA-NMAS--GWRGPAVVV--DDALYVLDQSSGTKLMMWQKESREWSP-----V----GRLSTLLTRPPCK  298 (352)
Q Consensus       236 ~~~W~---~~~~-~~~~--~~~~~~~~~--~~~l~~~gg~~~~~v~~yd~~~~~W~~-----~----~~~~~~~~~~~~~  298 (352)
                      +..+.   .+.. ..+.  ..--++...  ++++|++.|.   ..++||...++...     +    ..+|.   .-.++
T Consensus        80 ~~~~~~Pk~i~~~~~~~~~~~iDAA~~~~~~~~~yfFkg~---~y~ry~~~~~~v~~~yP~~i~~~w~g~p~---~idaa  153 (194)
T cd00094          80 NLEPGYPKPISDLGFPPTVKQIDAALRWPDNGKTYFFKGD---KYWRYDEKTQKMDPGYPKLIETDFPGVPD---KVDAA  153 (194)
T ss_pred             ccccCCCcchhhcCCCCCCCCccEEEEEcCCCEEEEEeCC---EEEEEeCCCccccCCCCcchhhcCCCcCC---Cccee
Confidence            42221   1111 0010  111123333  6899999886   89999987655421     1    11221   12233


Q ss_pred             EEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821          299 LVAIGKTIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       299 ~~~~~~~i~v~gG~~~~~~~d~~~~~  324 (352)
                      +...++++|+|-|. ..+.||..+.+
T Consensus       154 ~~~~~~~~yfF~g~-~y~~~d~~~~~  178 (194)
T cd00094         154 FRWLDGYYYFFKGD-QYWRFDPRSKE  178 (194)
T ss_pred             EEeCCCcEEEEECC-EEEEEeCccce
Confidence            33344899999876 56678887654


No 109
>PF08268 FBA_3:  F-box associated domain;  InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=95.22  E-value=0.69  Score=35.18  Aligned_cols=79  Identities=13%  Similarity=0.187  Sum_probs=54.3

Q ss_pred             EEECCEEEEEeCCCC--CCceEEEeCCCCcEEEecCC---CCcccceeEEEECCEEEEEccCCCCC-CceEEEEE-eCCC
Q 045821          164 GVLNQKIYCIGGLGD--THSWDVYDPRTNNWKLHTEP---NIFTEIEDSFVMDGKIYIRCSASAAT-SHVCALVY-EPST  236 (352)
Q Consensus       164 ~~~~~~iyv~gG~~~--~~~v~~yd~~t~~W~~~~~~---~~~~~~~~~~~~~g~iyv~GG~~~~~-~~~~i~~y-d~~~  236 (352)
                      +.+||-+|.......  ...+..||..+++|+.+..|   .........+.++|+|-++.-..... ....+|+. |..+
T Consensus         2 icinGvly~~a~~~~~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~~~iWvLeD~~k   81 (129)
T PF08268_consen    2 ICINGVLYWLAWSEDSDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGEPDSIDIWVLEDYEK   81 (129)
T ss_pred             EEECcEEEeEEEECCCCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCCcceEEEEEeecccc
Confidence            457888888876522  28889999999999998754   12233345678899998884333222 23578877 4567


Q ss_pred             Cceeec
Q 045821          237 DSWLHA  242 (352)
Q Consensus       237 ~~W~~~  242 (352)
                      .+|.+.
T Consensus        82 ~~Wsk~   87 (129)
T PF08268_consen   82 QEWSKK   87 (129)
T ss_pred             ceEEEE
Confidence            789866


No 110
>PF08268 FBA_3:  F-box associated domain;  InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=95.20  E-value=0.42  Score=36.36  Aligned_cols=79  Identities=10%  Similarity=0.181  Sum_probs=53.6

Q ss_pred             EEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC--CCCCC-CCcEEEECCEEEEEecCCC-----CeEEEE-eCC
Q 045821          208 FVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN--MASGW-RGPAVVVDDALYVLDQSSG-----TKLMMW-QKE  278 (352)
Q Consensus       208 ~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~--~~~~~-~~~~~~~~~~l~~~gg~~~-----~~v~~y-d~~  278 (352)
                      ..+||.+|.+.-. .......|.+||..+++|+.+..|  ..... ....+..+|+|.++.-...     -++|+. |.+
T Consensus         2 icinGvly~~a~~-~~~~~~~IvsFDv~~E~f~~i~~P~~~~~~~~~~~L~~~~G~L~~v~~~~~~~~~~~~iWvLeD~~   80 (129)
T PF08268_consen    2 ICINGVLYWLAWS-EDSDNNVIVSFDVRSEKFRFIKLPEDPYSSDCSSTLIEYKGKLALVSYNDQGEPDSIDIWVLEDYE   80 (129)
T ss_pred             EEECcEEEeEEEE-CCCCCcEEEEEEcCCceEEEEEeeeeeccccCccEEEEeCCeEEEEEecCCCCcceEEEEEeeccc
Confidence            3578999888544 334455899999999999998764  22222 2234678999998865432     256666 666


Q ss_pred             CCceEEccC
Q 045821          279 SREWSPVGR  287 (352)
Q Consensus       279 ~~~W~~~~~  287 (352)
                      .++|++...
T Consensus        81 k~~Wsk~~~   89 (129)
T PF08268_consen   81 KQEWSKKHI   89 (129)
T ss_pred             cceEEEEEE
Confidence            789997753


No 111
>PRK04043 tolB translocation protein TolB; Provisional
Probab=95.09  E-value=2.9  Score=38.99  Aligned_cols=177  Identities=11%  Similarity=0.036  Sum_probs=96.8

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC-E
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-K  213 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~  213 (352)
                      .++|++|..+++=+.+...+....... ..-|+ ++.+........+++.+|..++.++++...+. ........-+| +
T Consensus       213 ~~Iyv~dl~tg~~~~lt~~~g~~~~~~-~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~~~LT~~~~-~d~~p~~SPDG~~  290 (419)
T PRK04043        213 PTLYKYNLYTGKKEKIASSQGMLVVSD-VSKDGSKLLLTMAPKGQPDIYLYDTNTKTLTQITNYPG-IDVNGNFVEDDKR  290 (419)
T ss_pred             CEEEEEECCCCcEEEEecCCCcEEeeE-ECCCCCEEEEEEccCCCcEEEEEECCCCcEEEcccCCC-ccCccEECCCCCE
Confidence            478888888887666654322111111 12244 55554433333789999999999998864432 11112233345 5


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCC------C-CeEEEEeCCCCceEEc
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSS------G-TKLMMWQKESREWSPV  285 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~------~-~~v~~yd~~~~~W~~~  285 (352)
                      ||..-..   .....++.+|..+++.+++.....  . .....-+| .|.+.....      + .+++++|.+++.++.+
T Consensus       291 I~F~Sdr---~g~~~Iy~~dl~~g~~~rlt~~g~--~-~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~g~~~~L  364 (419)
T PRK04043        291 IVFVSDR---LGYPNIFMKKLNSGSVEQVVFHGK--N-NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNSDYIRRL  364 (419)
T ss_pred             EEEEECC---CCCceEEEEECCCCCeEeCccCCC--c-CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCCCCeEEC
Confidence            6655322   223479999999888877643211  1 22333344 454444322      2 4799999999999888


Q ss_pred             cCCCCCcCCCCceEEEeCCEEEEEc---CCceEEEEEcCCc
Q 045821          286 GRLSTLLTRPPCKLVAIGKTIFVIG---KGCSAVVIDVGNI  323 (352)
Q Consensus       286 ~~~~~~~~~~~~~~~~~~~~i~v~g---G~~~~~~~d~~~~  323 (352)
                      .....   .........+..|+...   |.....++++..+
T Consensus       365 T~~~~---~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~g~  402 (419)
T PRK04043        365 TANGV---NQFPRFSSDGGSIMFIKYLGNQSALGIIRLNYN  402 (419)
T ss_pred             CCCCC---cCCeEECCCCCEEEEEEccCCcEEEEEEecCCC
Confidence            65321   11233333344455443   3334556666553


No 112
>KOG2120 consensus SCF ubiquitin ligase, Skp2 component [Posttranslational modification, protein turnover, chaperones]
Probab=95.04  E-value=0.016  Score=49.83  Aligned_cols=38  Identities=24%  Similarity=0.281  Sum_probs=34.8

Q ss_pred             cCCCcHHHHHHhhccCccccchhhhhhhhhhcccccCh
Q 045821           18 ISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLHSE   55 (352)
Q Consensus        18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~s~   55 (352)
                      |.+||||+++.|++-++.+.+.+...|||+|..+....
T Consensus        98 ~~slpDEill~IFs~L~kk~LL~~~~VC~Rfyr~~~de  135 (419)
T KOG2120|consen   98 WDSLPDEILLGIFSCLCKKELLKVSGVCKRFYRLASDE  135 (419)
T ss_pred             cccCCHHHHHHHHHhccHHHHHHHHHHHHHHhhccccc
Confidence            67899999999999999999999999999999886543


No 113
>PLN00181 protein SPA1-RELATED; Provisional
Probab=95.00  E-value=4.8  Score=41.04  Aligned_cols=172  Identities=10%  Similarity=0.047  Sum_probs=81.9

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCcEE-EecCCCCcccceeEEE--E
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNNWK-LHTEPNIFTEIEDSFV--M  210 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~W~-~~~~~~~~~~~~~~~~--~  210 (352)
                      ..+.+||..+++-...-..... .-.+++.  .++.+++.|+.+  ..+.+||..+..-. .+.   .......+..  -
T Consensus       555 g~v~lWd~~~~~~~~~~~~H~~-~V~~l~~~p~~~~~L~Sgs~D--g~v~iWd~~~~~~~~~~~---~~~~v~~v~~~~~  628 (793)
T PLN00181        555 GVVQVWDVARSQLVTEMKEHEK-RVWSIDYSSADPTLLASGSDD--GSVKLWSINQGVSIGTIK---TKANICCVQFPSE  628 (793)
T ss_pred             CeEEEEECCCCeEEEEecCCCC-CEEEEEEcCCCCCEEEEEcCC--CEEEEEECCCCcEEEEEe---cCCCeEEEEEeCC
Confidence            4678889877653221111111 1122222  246677777765  67889998765422 221   1111111211  2


Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEE-EECCEEEEEecCCCCeEEEEeCCCC----ceE
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAV-VVDDALYVLDQSSGTKLMMWQKESR----EWS  283 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~-~~~~~l~~~gg~~~~~v~~yd~~~~----~W~  283 (352)
                      ++.+++.|+.++     .+..||..+..  ...+..   +...-..+ ..++..++.++.++ .+.+||....    .|.
T Consensus       629 ~g~~latgs~dg-----~I~iwD~~~~~~~~~~~~~---h~~~V~~v~f~~~~~lvs~s~D~-~ikiWd~~~~~~~~~~~  699 (793)
T PLN00181        629 SGRSLAFGSADH-----KVYYYDLRNPKLPLCTMIG---HSKTVSYVRFVDSSTLVSSSTDN-TLKLWDLSMSISGINET  699 (793)
T ss_pred             CCCEEEEEeCCC-----eEEEEECCCCCccceEecC---CCCCEEEEEEeCCCEEEEEECCC-EEEEEeCCCCccccCCc
Confidence            567777765542     68888886542  112111   11011112 23566666776665 7999998643    233


Q ss_pred             EccCCCCCcCCCCceEEEeCCEEEEEcCCce-EEEEEcCC
Q 045821          284 PVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS-AVVIDVGN  322 (352)
Q Consensus       284 ~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~-~~~~d~~~  322 (352)
                      .+..+........+.....++.+++.|+.++ +.+|+...
T Consensus       700 ~l~~~~gh~~~i~~v~~s~~~~~lasgs~D~~v~iw~~~~  739 (793)
T PLN00181        700 PLHSFMGHTNVKNFVGLSVSDGYIATGSETNEVFVYHKAF  739 (793)
T ss_pred             ceEEEcCCCCCeeEEEEcCCCCEEEEEeCCCEEEEEECCC
Confidence            3322211001111122223455666665544 55677653


No 114
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=94.72  E-value=2.5  Score=40.70  Aligned_cols=117  Identities=17%  Similarity=0.239  Sum_probs=68.2

Q ss_pred             EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCc--------ceeeeeEEECCEEEEEeCCCCCCceEE
Q 045821          115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTA--------RCYFPCGVLNQKIYCIGGLGDTHSWDV  184 (352)
Q Consensus       115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~--------r~~~~~~~~~~~iyv~gG~~~~~~v~~  184 (352)
                      -++.++.||+...       ...++.+|..|++  |+.-...+..        ......++.+++||+... +  ..+..
T Consensus        65 Pvv~~g~vyv~s~-------~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~-d--g~l~A  134 (527)
T TIGR03075        65 PLVVDGVMYVTTS-------YSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTL-D--ARLVA  134 (527)
T ss_pred             CEEECCEEEEECC-------CCcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcC-C--CEEEE
Confidence            3566899998643       2468899998875  8754432211        111234566888887543 2  56899


Q ss_pred             EeCCCCc--EEEec-CCCCc-ccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc--eee
Q 045821          185 YDPRTNN--WKLHT-EPNIF-TEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS--WLH  241 (352)
Q Consensus       185 yd~~t~~--W~~~~-~~~~~-~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~  241 (352)
                      +|..|.+  |+.-. .+... .....-++.+++||+-....+......+..||..+++  |+.
T Consensus       135 LDa~TGk~~W~~~~~~~~~~~~~tssP~v~~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~  197 (527)
T TIGR03075       135 LDAKTGKVVWSKKNGDYKAGYTITAAPLVVKGKVITGISGGEFGVRGYVTAYDAKTGKLVWRR  197 (527)
T ss_pred             EECCCCCEEeecccccccccccccCCcEEECCEEEEeecccccCCCcEEEEEECCCCceeEec
Confidence            9998875  86532 11111 1122345678888875322211222368888887764  763


No 115
>PRK05137 tolB translocation protein TolB; Provisional
Probab=94.66  E-value=3.9  Score=38.37  Aligned_cols=181  Identities=10%  Similarity=0.028  Sum_probs=93.0

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK  213 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~  213 (352)
                      ...++++|+.+++.+.+...+......... -++ +|++....+....++.+|..+..-+++...+. ........-+|+
T Consensus       225 ~~~i~~~dl~~g~~~~l~~~~g~~~~~~~S-PDG~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~~~-~~~~~~~spDG~  302 (435)
T PRK05137        225 RPRVYLLDLETGQRELVGNFPGMTFAPRFS-PDGRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDSPA-IDTSPSYSPDGS  302 (435)
T ss_pred             CCEEEEEECCCCcEEEeecCCCcccCcEEC-CCCCEEEEEEecCCCceEEEEECCCCceEEccCCCC-ccCceeEcCCCC
Confidence            368999999998887776544322211111 144 55444433333778999999888777653221 111122333554


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEccCCCCC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVGRLSTL  291 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~~~~~~  291 (352)
                      -.++....  .....++.+|..+...+.+... ..........-+| .|++.....+ ..++.+|+.++....+....  
T Consensus       303 ~i~f~s~~--~g~~~Iy~~d~~g~~~~~lt~~-~~~~~~~~~SpdG~~ia~~~~~~~~~~i~~~d~~~~~~~~lt~~~--  377 (435)
T PRK05137        303 QIVFESDR--SGSPQLYVMNADGSNPRRISFG-GGRYSTPVWSPRGDLIAFTKQGGGQFSIGVMKPDGSGERILTSGF--  377 (435)
T ss_pred             EEEEEECC--CCCCeEEEEECCCCCeEEeecC-CCcccCeEECCCCCEEEEEEcCCCceEEEEEECCCCceEeccCCC--
Confidence            33332221  1123688899887777766432 1111222233344 5555443222 47899998877665554211  


Q ss_pred             cCCCCceEEEeCCEEEEEc---CC---ceEEEEEcCCc
Q 045821          292 LTRPPCKLVAIGKTIFVIG---KG---CSAVVIDVGNI  323 (352)
Q Consensus       292 ~~~~~~~~~~~~~~i~v~g---G~---~~~~~~d~~~~  323 (352)
                       ..........+..|+...   |.   ...+++|....
T Consensus       378 -~~~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~  414 (435)
T PRK05137        378 -LVEGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGR  414 (435)
T ss_pred             -CCCCCeECCCCCEEEEEEccCCCCCcceEEEEECCCC
Confidence             111223333344455443   22   24566777543


No 116
>PF12217 End_beta_propel:  Catalytic beta propeller domain of bacteriophage endosialidase;  InterPro: IPR024428 This entry represents the beta propeller domain of endosialidases, which consists of catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain. This domain is typically between 443 and 460 amino acids in length [].; PDB: 1V0E_B 1V0F_E 3JU4_A 3GVL_A 3GVK_B 3GVJ_A.
Probab=94.61  E-value=2.5  Score=35.89  Aligned_cols=175  Identities=10%  Similarity=0.160  Sum_probs=80.8

Q ss_pred             CCCCeeEec---cCCC--cccCCcceEEEEECCEEEEEeC-CCCCCCCCCeEEEEE---CCCCeeEe--cCCCCC-----
Q 045821           93 TRRSWKMIH---ELPA--RTLRRKGMGFEVLGKNAYLLGG-CGWSEDATSEVYCYD---ASMNTWTD--AAPMST-----  156 (352)
Q Consensus        93 ~~~~w~~~~---~~~~--~~~~~~~~~~~~~~~~iyv~GG-~~~~~~~~~~~~~~d---~~t~~W~~--~~~~~~-----  156 (352)
                      ..++|+.-.   ++..  +....++++..++++++|.+=- ..-........+.||   ...+.|+.  ++..+.     
T Consensus        53 dG~TWttPEwLtd~H~~yptvnyHCmSMGv~~NRLfa~iEtR~~a~~km~~~~Lw~RpMF~~spW~~teL~~~~~~~~a~  132 (367)
T PF12217_consen   53 DGQTWTTPEWLTDLHPDYPTVNYHCMSMGVVGNRLFAVIETRTVASNKMVRAELWSRPMFHDSPWRITELGTIASFTSAG  132 (367)
T ss_dssp             TTSS----EESS---TTTTTEEEE-B-EEEETTEEEEEEEEEETTT--EEEEEEEEEE-STTS--EEEEEES-TT-----
T ss_pred             CCCcccCchhhhhcCCCCCccceeeeeeeeecceeeEEEeehhhhhhhhhhhhhhcccccccCCceeeeccccccccccc
Confidence            578886432   2211  1134567788899999986632 222223344455555   33566764  443333     


Q ss_pred             --cceeeeeEEECCEEEEEeCCCCC---Cc--eEEEeCC----CCcE-EEecC-CCCcccceeEEEECCEEEEEccCCCC
Q 045821          157 --ARCYFPCGVLNQKIYCIGGLGDT---HS--WDVYDPR----TNNW-KLHTE-PNIFTEIEDSFVMDGKIYIRCSASAA  223 (352)
Q Consensus       157 --~r~~~~~~~~~~~iyv~gG~~~~---~~--v~~yd~~----t~~W-~~~~~-~~~~~~~~~~~~~~g~iyv~GG~~~~  223 (352)
                        .-.-|+.+.+++.-|.+|=...+   ..  +..|...    ...- +.++. ........+.-.++|.||+..-....
T Consensus       133 ~~vTe~HSFa~i~~~~fA~GyHnGD~sPRe~G~~yfs~~~~sp~~~vrr~i~sey~~~AsEPCvkyY~g~LyLtTRgt~~  212 (367)
T PF12217_consen  133 VAVTELHSFATIDDNQFAVGYHNGDVSPRELGFLYFSDAFASPGVFVRRIIPSEYERNASEPCVKYYDGVLYLTTRGTLP  212 (367)
T ss_dssp             ---SEEEEEEE-SSS-EEEEEEE-SSSS-EEEEEEETTTTT-TT--EEEE--GGG-TTEEEEEEEEETTEEEEEEEES-T
T ss_pred             ceeeeeeeeeEecCCceeEEeccCCCCcceeeEEEecccccCCcceeeeechhhhccccccchhhhhCCEEEEEEcCcCC
Confidence              23467888888877777743332   22  2223221    1111 22221 11222334556789999998533322


Q ss_pred             -CCceEEEEEeCCCCceeecCCCCCCCCCC-cEEEECCEEEEEecC
Q 045821          224 -TSHVCALVYEPSTDSWLHADANMASGWRG-PAVVVDDALYVLDQS  267 (352)
Q Consensus       224 -~~~~~i~~yd~~~~~W~~~~~~~~~~~~~-~~~~~~~~l~~~gg~  267 (352)
                       .....+.+-+.....|+.+.-+....... +....++.||++|..
T Consensus       213 ~~~GS~L~rs~d~G~~w~slrfp~nvHhtnlPFakvgD~l~mFgsE  258 (367)
T PF12217_consen  213 TNPGSSLHRSDDNGQNWSSLRFPNNVHHTNLPFAKVGDVLYMFGSE  258 (367)
T ss_dssp             TS---EEEEESSTTSS-EEEE-TT---SS---EEEETTEEEEEEE-
T ss_pred             CCCcceeeeecccCCchhhccccccccccCCCceeeCCEEEEEecc
Confidence             33346778888889999886542222221 235679999999863


No 117
>PF02191 OLF:  Olfactomedin-like domain;  InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=94.54  E-value=2.7  Score=36.05  Aligned_cols=192  Identities=14%  Similarity=0.160  Sum_probs=105.6

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCC---CCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNS---TRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDAS  144 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~---~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~  144 (352)
                      .+.+|++.+....   .++.|.-..   ..+.....-.+|   .+-.+.+.++.+|.+|.--      .....+.+||..
T Consensus        30 ~~~iy~~~~~~~~---~v~ey~~~~~f~~~~~~~~~~~Lp---~~~~GtG~vVYngslYY~~------~~s~~IvkydL~   97 (250)
T PF02191_consen   30 SEKIYVTSGFSGN---TVYEYRNYEDFLRNGRSSRTYKLP---YPWQGTGHVVYNGSLYYNK------YNSRNIVKYDLT   97 (250)
T ss_pred             CCCEEEECccCCC---EEEEEcCHhHHhhcCCCceEEEEe---ceeccCCeEEECCcEEEEe------cCCceEEEEECc
Confidence            6778888665222   444442200   112222222333   3455566677788877653      346889999999


Q ss_pred             CCeeE---ecCCCC---------CcceeeeeEEECCEEEEEeCCCCC---CceEEEeCCC----CcEEEecCCCCcccce
Q 045821          145 MNTWT---DAAPMS---------TARCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRT----NNWKLHTEPNIFTEIE  205 (352)
Q Consensus       145 t~~W~---~~~~~~---------~~r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t----~~W~~~~~~~~~~~~~  205 (352)
                      +++-.   .++...         .+......++-++-++|+-.....   -.+-..|+.+    .+|..-  . ..+...
T Consensus        98 t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~tL~v~~tw~T~--~-~k~~~~  174 (250)
T PF02191_consen   98 TRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPETLSVEQTWNTS--Y-PKRSAG  174 (250)
T ss_pred             CCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCcccCceEEEEEec--c-Cchhhc
Confidence            98744   444321         112223444445556666443322   3334556654    357642  2 223333


Q ss_pred             eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE-E---EECCEEEEEecCCCCeEEEEeCC
Q 045821          206 DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA-V---VVDDALYVLDQSSGTKLMMWQKE  278 (352)
Q Consensus       206 ~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~-~---~~~~~l~~~gg~~~~~v~~yd~~  278 (352)
                      .++.+=|.||++....... ..-.++||+.+++=..+..+...+....+ +   ..+.+||+...-   .+..|+..
T Consensus       175 naFmvCGvLY~~~s~~~~~-~~I~yafDt~t~~~~~~~i~f~~~~~~~~~l~YNP~dk~LY~wd~G---~~v~Y~v~  247 (250)
T PF02191_consen  175 NAFMVCGVLYATDSYDTRD-TEIFYAFDTYTGKEEDVSIPFPNPYGNISMLSYNPRDKKLYAWDNG---YQVTYDVR  247 (250)
T ss_pred             ceeeEeeEEEEEEECCCCC-cEEEEEEECCCCceeceeeeeccccCceEeeeECCCCCeEEEEECC---eEEEEEEE
Confidence            4677789999996554433 44567899998876655444444433333 2   247889988643   67777653


No 118
>PRK03629 tolB translocation protein TolB; Provisional
Probab=94.28  E-value=4.7  Score=37.75  Aligned_cols=148  Identities=9%  Similarity=-0.030  Sum_probs=80.0

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK  213 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~  213 (352)
                      ...++++|..+++-+.+...+..-.... ..-++ +|++.........++.+|..++..+++..... ........-+|+
T Consensus       222 ~~~i~i~dl~~G~~~~l~~~~~~~~~~~-~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~~~~-~~~~~~wSPDG~  299 (429)
T PRK03629        222 RSALVIQTLANGAVRQVASFPRHNGAPA-FSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTDGRS-NNTEPTWFPDSQ  299 (429)
T ss_pred             CcEEEEEECCCCCeEEccCCCCCcCCeE-ECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccCCCC-CcCceEECCCCC
Confidence            3578999998887766665433211111 11144 56554333322578999999988877653321 111122233555


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEccC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVGR  287 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~~  287 (352)
                      ..++....  .....++.+|..+..-+.+.... .........-+| .|++.+...+ ..++++|++++.+..+..
T Consensus       300 ~I~f~s~~--~g~~~Iy~~d~~~g~~~~lt~~~-~~~~~~~~SpDG~~Ia~~~~~~g~~~I~~~dl~~g~~~~Lt~  372 (429)
T PRK03629        300 NLAYTSDQ--AGRPQVYKVNINGGAPQRITWEG-SQNQDADVSSDGKFMVMVSSNGGQQHIAKQDLATGGVQVLTD  372 (429)
T ss_pred             EEEEEeCC--CCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEccCCCceEEEEECCCCCeEEeCC
Confidence            33332222  11236888888877666653211 111122233344 4545444332 478999999999887763


No 119
>PRK03629 tolB translocation protein TolB; Provisional
Probab=94.23  E-value=4.8  Score=37.67  Aligned_cols=188  Identities=8%  Similarity=0.008  Sum_probs=94.9

Q ss_pred             eeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceee
Q 045821           83 VCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYF  161 (352)
Q Consensus        83 ~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~  161 (352)
                      ..++.+|.  .++.-..+...+.    .........++ .|++.....    ...+++.+|..+++.+.+..........
T Consensus       223 ~~i~i~dl--~~G~~~~l~~~~~----~~~~~~~SPDG~~La~~~~~~----g~~~I~~~d~~tg~~~~lt~~~~~~~~~  292 (429)
T PRK03629        223 SALVIQTL--ANGAVRQVASFPR----HNGAPAFSPDGSKLAFALSKT----GSLNLYVMDLASGQIRQVTDGRSNNTEP  292 (429)
T ss_pred             cEEEEEEC--CCCCeEEccCCCC----CcCCeEECCCCCEEEEEEcCC----CCcEEEEEECCCCCEEEccCCCCCcCce
Confidence            45666677  6666555554431    11111222244 455543321    2346999999988877765433211111


Q ss_pred             eeEEECCE-EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee
Q 045821          162 PCGVLNQK-IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL  240 (352)
Q Consensus       162 ~~~~~~~~-iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~  240 (352)
                       ...-+++ |+..........++.+|..+..-+++.... .........-+|+..++.+...  ....++.+|+.++.++
T Consensus       293 -~wSPDG~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~-~~~~~~~~SpDG~~Ia~~~~~~--g~~~I~~~dl~~g~~~  368 (429)
T PRK03629        293 -TWFPDSQNLAYTSDQAGRPQVYKVNINGGAPQRITWEG-SQNQDADVSSDGKFMVMVSSNG--GQQHIAKQDLATGGVQ  368 (429)
T ss_pred             -EECCCCCEEEEEeCCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEccC--CCceEEEEECCCCCeE
Confidence             1111444 443332222257888898887766664221 1111222333554444433322  1236888999999888


Q ss_pred             ecCCCCCCCCCCcEEEECCEEEEEecCCC--CeEEEEeCCCCceEEcc
Q 045821          241 HADANMASGWRGPAVVVDDALYVLDQSSG--TKLMMWQKESREWSPVG  286 (352)
Q Consensus       241 ~~~~~~~~~~~~~~~~~~~~l~~~gg~~~--~~v~~yd~~~~~W~~~~  286 (352)
                      .+....  ........-+|+.+++.+..+  ..++..+.+...=..+.
T Consensus       369 ~Lt~~~--~~~~p~~SpDG~~i~~~s~~~~~~~l~~~~~~G~~~~~l~  414 (429)
T PRK03629        369 VLTDTF--LDETPSIAPNGTMVIYSSSQGMGSVLNLVSTDGRFKARLP  414 (429)
T ss_pred             EeCCCC--CCCCceECCCCCEEEEEEcCCCceEEEEEECCCCCeEECc
Confidence            876421  112334555666555544333  34667777655444443


No 120
>PRK04043 tolB translocation protein TolB; Provisional
Probab=94.09  E-value=5.1  Score=37.40  Aligned_cols=190  Identities=9%  Similarity=-0.037  Sum_probs=106.8

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY  160 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~  160 (352)
                      ...+|.+|.  .++.=+.+...+.    .........++ +|.+.....    ...++|.+|..+++++.+...+..-..
T Consensus       212 ~~~Iyv~dl--~tg~~~~lt~~~g----~~~~~~~SPDG~~la~~~~~~----g~~~Iy~~dl~~g~~~~LT~~~~~d~~  281 (419)
T PRK04043        212 KPTLYKYNL--YTGKKEKIASSQG----MLVVSDVSKDGSKLLLTMAPK----GQPDIYLYDTNTKTLTQITNYPGIDVN  281 (419)
T ss_pred             CCEEEEEEC--CCCcEEEEecCCC----cEEeeEECCCCCEEEEEEccC----CCcEEEEEECCCCcEEEcccCCCccCc
Confidence            458899999  8887777765431    11111223344 555544322    247899999999999888754431111


Q ss_pred             eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCC----CceEEEEEeCCC
Q 045821          161 FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAAT----SHVCALVYEPST  236 (352)
Q Consensus       161 ~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~----~~~~i~~yd~~~  236 (352)
                      .....-+.+|++.........++++|..+++.+++..-..  . .....-+|+..++-......    ....++.+|+.+
T Consensus       282 p~~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~--~-~~~~SPDG~~Ia~~~~~~~~~~~~~~~~I~v~d~~~  358 (419)
T PRK04043        282 GNFVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGK--N-NSSVSTYKNYIVYSSRETNNEFGKNTFNLYLISTNS  358 (419)
T ss_pred             cEECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCC--c-CceECCCCCEEEEEEcCCCcccCCCCcEEEEEECCC
Confidence            1111113467776654433789999999998877653211  1 12333355433332322211    224789999999


Q ss_pred             CceeecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821          237 DSWLHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKESREWSPVG  286 (352)
Q Consensus       237 ~~W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~~~W~~~~  286 (352)
                      +.++.+....  ........-+|+ |++.....+ ..++.++.+.+.=..++
T Consensus       359 g~~~~LT~~~--~~~~p~~SPDG~~I~f~~~~~~~~~L~~~~l~g~~~~~l~  408 (419)
T PRK04043        359 DYIRRLTANG--VNQFPRFSSDGGSIMFIKYLGNQSALGIIRLNYNKSFLFP  408 (419)
T ss_pred             CCeEECCCCC--CcCCeEECCCCCEEEEEEccCCcEEEEEEecCCCeeEEee
Confidence            9998886531  222334444555 544443322 35788888765544444


No 121
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=94.09  E-value=3.1  Score=34.87  Aligned_cols=176  Identities=10%  Similarity=0.037  Sum_probs=101.2

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEEC--CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLN--QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~--~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      .+.+.+||..|++-.+--.  .--..-..+.+|  ..+.+.|+.+  .++.++|..+++.+++..+..+.....++.+.+
T Consensus        80 Dk~v~vwDV~TGkv~Rr~r--gH~aqVNtV~fNeesSVv~SgsfD--~s~r~wDCRS~s~ePiQildea~D~V~Si~v~~  155 (307)
T KOG0316|consen   80 DKAVQVWDVNTGKVDRRFR--GHLAQVNTVRFNEESSVVASGSFD--SSVRLWDCRSRSFEPIQILDEAKDGVSSIDVAE  155 (307)
T ss_pred             CceEEEEEcccCeeeeecc--cccceeeEEEecCcceEEEecccc--ceeEEEEcccCCCCccchhhhhcCceeEEEecc
Confidence            4678999999986322110  000111223343  3566666666  789999999999988887777777777888888


Q ss_pred             EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEc--cCCCC
Q 045821          213 KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPV--GRLST  290 (352)
Q Consensus       213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~--~~~~~  290 (352)
                      ...+.|..++     .+-.||...++-..  +-+..+.....+..++...++|..++ .+...|-+++.--+.  +....
T Consensus       156 heIvaGS~DG-----tvRtydiR~G~l~s--Dy~g~pit~vs~s~d~nc~La~~l~s-tlrLlDk~tGklL~sYkGhkn~  227 (307)
T KOG0316|consen  156 HEIVAGSVDG-----TVRTYDIRKGTLSS--DYFGHPITSVSFSKDGNCSLASSLDS-TLRLLDKETGKLLKSYKGHKNM  227 (307)
T ss_pred             cEEEeeccCC-----cEEEEEeecceeeh--hhcCCcceeEEecCCCCEEEEeeccc-eeeecccchhHHHHHhcccccc
Confidence            8888865553     46678876554321  11111222222344666666665544 788888877653211  11111


Q ss_pred             CcCCCCceEEEeCCEEEEEcCCc--eEEEEEcCCccc
Q 045821          291 LLTRPPCKLVAIGKTIFVIGKGC--SAVVIDVGNIGN  325 (352)
Q Consensus       291 ~~~~~~~~~~~~~~~i~v~gG~~--~~~~~d~~~~~w  325 (352)
                      . .+  ..+...+..-.|++|..  .++++|.....-
T Consensus       228 e-yk--ldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~  261 (307)
T KOG0316|consen  228 E-YK--LDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ  261 (307)
T ss_pred             e-ee--eeeeecccceeEEeccCCceEEEEEecccee
Confidence            0 11  12223344456677765  366788776553


No 122
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=94.00  E-value=3.7  Score=35.48  Aligned_cols=204  Identities=12%  Similarity=0.141  Sum_probs=100.4

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCC----CCCcceeeeeEEE-C-CEEEEEeCCCCCCceEEEeCCCCcE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAP----MSTARCYFPCGVL-N-QKIYCIGGLGDTHSWDVYDPRTNNW  192 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~----~~~~r~~~~~~~~-~-~~iyv~gG~~~~~~v~~yd~~t~~W  192 (352)
                      .+.....||.+      +.+-+|+..++.=+....    ++.-..+.+++.+ + ++|.--.| +  ..+-.+|+++.+=
T Consensus       108 Sg~~VAcGGLd------N~Csiy~ls~~d~~g~~~v~r~l~gHtgylScC~f~dD~~ilT~SG-D--~TCalWDie~g~~  178 (343)
T KOG0286|consen  108 SGNFVACGGLD------NKCSIYPLSTRDAEGNVRVSRELAGHTGYLSCCRFLDDNHILTGSG-D--MTCALWDIETGQQ  178 (343)
T ss_pred             CCCeEEecCcC------ceeEEEecccccccccceeeeeecCccceeEEEEEcCCCceEecCC-C--ceEEEEEcccceE
Confidence            56666778764      346788887653332222    2222334444444 3 34433333 2  6778899998753


Q ss_pred             EEecCCCCcccceeEEE---ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC
Q 045821          193 KLHTEPNIFTEIEDSFV---MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG  269 (352)
Q Consensus       193 ~~~~~~~~~~~~~~~~~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~  269 (352)
                      ..  ..........+..   .+++.||.||.+.     ....+|.....-.+.=.-...........-+|.-+..|..++
T Consensus       179 ~~--~f~GH~gDV~slsl~p~~~ntFvSg~cD~-----~aklWD~R~~~c~qtF~ghesDINsv~ffP~G~afatGSDD~  251 (343)
T KOG0286|consen  179 TQ--VFHGHTGDVMSLSLSPSDGNTFVSGGCDK-----SAKLWDVRSGQCVQTFEGHESDINSVRFFPSGDAFATGSDDA  251 (343)
T ss_pred             EE--EecCCcccEEEEecCCCCCCeEEeccccc-----ceeeeeccCcceeEeecccccccceEEEccCCCeeeecCCCc
Confidence            22  1211111111111   2789999988774     234445544432221000000001111233566666665554


Q ss_pred             CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCcccccceEEeccCCCCCCCCcceeeE
Q 045821          270 TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIGNIGGIMVSSSIPKLNDNDDIISCK  348 (352)
Q Consensus       270 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~w~~~~~~~~~~~~~~~~~~~~~~~  348 (352)
                       ..-.||+..++=-.+=.-+..........+...|+++..|..+ ++.+.|.-+         -.++-.|..|+|-++|-
T Consensus       252 -tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~~c~vWDtlk---------~e~vg~L~GHeNRvScl  321 (343)
T KOG0286|consen  252 -TCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFAGYDDFTCNVWDTLK---------GERVGVLAGHENRVSCL  321 (343)
T ss_pred             -eeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEeeecCCceeEeeccc---------cceEEEeeccCCeeEEE
Confidence             7788888876533222222111122223344578888877543 344455443         33455667888888763


No 123
>PRK02889 tolB translocation protein TolB; Provisional
Probab=93.82  E-value=5.8  Score=37.12  Aligned_cols=146  Identities=12%  Similarity=0.027  Sum_probs=77.0

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      ...++++|..+++-..+...+...  .+.+. -++ +|++....+....++.+|..+...+++.... .........-+|
T Consensus       219 ~~~I~~~dl~~g~~~~l~~~~g~~--~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~-~~~~~~~wSpDG  295 (427)
T PRK02889        219 KPVVYVHDLATGRRRVVANFKGSN--SAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLRRLTQSS-GIDTEPFFSPDG  295 (427)
T ss_pred             CcEEEEEECCCCCEEEeecCCCCc--cceEECCCCCEEEEEEccCCCceEEEEECCCCCcEECCCCC-CCCcCeEEcCCC
Confidence            357999999988766655433211  11111 144 5554444333367888898877766664321 111122233356


Q ss_pred             EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821          213 KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSPVG  286 (352)
Q Consensus       213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~~~  286 (352)
                      +..++....  .....++.++..+...+.+.... .........-+| .|++.....+ ..++++|..++....+.
T Consensus       296 ~~l~f~s~~--~g~~~Iy~~~~~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~s~~~g~~~I~v~d~~~g~~~~lt  368 (427)
T PRK02889        296 RSIYFTSDR--GGAPQIYRMPASGGAAQRVTFTG-SYNTSPRISPDGKLLAYISRVGGAFKLYVQDLATGQVTALT  368 (427)
T ss_pred             CEEEEEecC--CCCcEEEEEECCCCceEEEecCC-CCcCceEECCCCCEEEEEEccCCcEEEEEEECCCCCeEEcc
Confidence            533332222  12246888888777776664211 111122333344 4554443322 36899999988877764


No 124
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=93.55  E-value=6.8  Score=37.04  Aligned_cols=178  Identities=17%  Similarity=0.119  Sum_probs=89.4

Q ss_pred             CCeEEEEECCCC-e-eEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEEC
Q 045821          135 TSEVYCYDASMN-T-WTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMD  211 (352)
Q Consensus       135 ~~~~~~~d~~t~-~-W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~  211 (352)
                      ...+.+||...+ + -+.+.......  ++++.. .+++++.|+.+  ..|.++|..+.+-.+.-......-...++.-+
T Consensus       224 D~tiriwd~~~~~~~~~~l~gH~~~v--~~~~f~p~g~~i~Sgs~D--~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d  299 (456)
T KOG0266|consen  224 DKTLRIWDLKDDGRNLKTLKGHSTYV--TSVAFSPDGNLLVSGSDD--GTVRIWDVRTGECVRKLKGHSDGISGLAFSPD  299 (456)
T ss_pred             CceEEEeeccCCCeEEEEecCCCCce--EEEEecCCCCEEEEecCC--CcEEEEeccCCeEEEeeeccCCceEEEEECCC
Confidence            356788888433 2 34444434333  232222 45888888877  78999999986544332111111112233347


Q ss_pred             CEEEEEccCCCCCCceEEEEEeCCCCcee---ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE-EccC
Q 045821          212 GKIYIRCSASAATSHVCALVYEPSTDSWL---HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS-PVGR  287 (352)
Q Consensus       212 g~iyv~GG~~~~~~~~~i~~yd~~~~~W~---~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~-~~~~  287 (352)
                      +.+++.+..+     ..+..||..+..-.   .+................+..|++.+..+..+-.||.....-- +...
T Consensus       300 ~~~l~s~s~d-----~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~~~~~w~l~~~~~~~~~~~  374 (456)
T KOG0266|consen  300 GNLLVSASYD-----GTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDRTLKLWDLRSGKSVGTYTG  374 (456)
T ss_pred             CCEEEEcCCC-----ccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCCeEEEEEccCCcceeeecc
Confidence            7888876443     25778888887743   2222111111111222244455555555557888888754322 1111


Q ss_pred             CCCCcCCCCceEE-EeCCEEEEEcCCc-eEEEEEcCC
Q 045821          288 LSTLLTRPPCKLV-AIGKTIFVIGKGC-SAVVIDVGN  322 (352)
Q Consensus       288 ~~~~~~~~~~~~~-~~~~~i~v~gG~~-~~~~~d~~~  322 (352)
                      .... .+..+..+ ..+++..+.|+.+ .+.++|+..
T Consensus       375 ~~~~-~~~~~~~~~~~~~~~i~sg~~d~~v~~~~~~s  410 (456)
T KOG0266|consen  375 HSNL-VRCIFSPTLSTGGKLIYSGSEDGSVYVWDSSS  410 (456)
T ss_pred             cCCc-ceeEecccccCCCCeEEEEeCCceEEEEeCCc
Confidence            1111 11222222 2355555555444 355788775


No 125
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=93.53  E-value=3  Score=37.27  Aligned_cols=122  Identities=13%  Similarity=0.158  Sum_probs=69.0

Q ss_pred             CceEEEeCCCC-----cEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc-eeecCCCCCCCCCCc
Q 045821          180 HSWDVYDPRTN-----NWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS-WLHADANMASGWRGP  253 (352)
Q Consensus       180 ~~v~~yd~~t~-----~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~-W~~~~~~~~~~~~~~  253 (352)
                      ..+.+|+....     +.+.+........-.+.+.++|+|.+..|.       .+..|+...++ +..............
T Consensus        62 Gri~v~~i~~~~~~~~~l~~i~~~~~~g~V~ai~~~~~~lv~~~g~-------~l~v~~l~~~~~l~~~~~~~~~~~i~s  134 (321)
T PF03178_consen   62 GRILVFEISESPENNFKLKLIHSTEVKGPVTAICSFNGRLVVAVGN-------KLYVYDLDNSKTLLKKAFYDSPFYITS  134 (321)
T ss_dssp             EEEEEEEECSS-----EEEEEEEEEESS-EEEEEEETTEEEEEETT-------EEEEEEEETTSSEEEEEEE-BSSSEEE
T ss_pred             cEEEEEEEEcccccceEEEEEEEEeecCcceEhhhhCCEEEEeecC-------EEEEEEccCcccchhhheecceEEEEE
Confidence            45788888874     555554332223334566779996665442       47788877777 766643222222222


Q ss_pred             EEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe-CCEEEEEc
Q 045821          254 AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI-GKTIFVIG  310 (352)
Q Consensus       254 ~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~~~i~v~g  310 (352)
                      ..+.++.|++-.-..+-.++.|+.+..+...++.-..  .+...++..+ ++..++.+
T Consensus       135 l~~~~~~I~vgD~~~sv~~~~~~~~~~~l~~va~d~~--~~~v~~~~~l~d~~~~i~~  190 (321)
T PF03178_consen  135 LSVFKNYILVGDAMKSVSLLRYDEENNKLILVARDYQ--PRWVTAAEFLVDEDTIIVG  190 (321)
T ss_dssp             EEEETTEEEEEESSSSEEEEEEETTTE-EEEEEEESS---BEEEEEEEE-SSSEEEEE
T ss_pred             EeccccEEEEEEcccCEEEEEEEccCCEEEEEEecCC--CccEEEEEEecCCcEEEEE
Confidence            3456887776655555567788987777888875433  2334455555 54444444


No 126
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=93.20  E-value=13  Score=39.24  Aligned_cols=150  Identities=15%  Similarity=0.149  Sum_probs=83.4

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCC----------CC-c--ceeeeeEEE-C-CEEEEEeCCCCCCceE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPM----------ST-A--RCYFPCGVL-N-QKIYCIGGLGDTHSWD  183 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~----------~~-~--r~~~~~~~~-~-~~iyv~gG~~~~~~v~  183 (352)
                      ++.+|+...      ....+++||+.++....+..-          .. .  .....++.. + +.|||....+  +.+.
T Consensus       694 ~g~LyVad~------~~~~I~v~d~~~g~v~~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n--~~Ir  765 (1057)
T PLN02919        694 NEKVYIAMA------GQHQIWEYNISDGVTRVFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSES--SSIR  765 (1057)
T ss_pred             CCeEEEEEC------CCCeEEEEECCCCeEEEEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCC--CeEE
Confidence            578888753      245688888887765433210          00 0  111123332 3 4699987654  7899


Q ss_pred             EEeCCCCcEEEec--CC--CC--------------cc-cc--eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeec
Q 045821          184 VYDPRTNNWKLHT--EP--NI--------------FT-EI--EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHA  242 (352)
Q Consensus       184 ~yd~~t~~W~~~~--~~--~~--------------~~-~~--~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~  242 (352)
                      +||+.++.-+.+.  ..  +.              .. ..  ..++.-+|.|||....+     ..|.+||+.++....+
T Consensus       766 v~D~~tg~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N-----~rIrviD~~tg~v~ti  840 (1057)
T PLN02919        766 ALDLKTGGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYN-----HKIKKLDPATKRVTTL  840 (1057)
T ss_pred             EEECCCCcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCC-----CEEEEEECCCCeEEEE
Confidence            9999876543221  00  00              00 01  11233467899985332     3689999988877655


Q ss_pred             CCCC----------CCCCCCc---EEEECCEEEEEecCCCCeEEEEeCCCCce
Q 045821          243 DANM----------ASGWRGP---AVVVDDALYVLDQSSGTKLMMWQKESREW  282 (352)
Q Consensus       243 ~~~~----------~~~~~~~---~~~~~~~l~~~gg~~~~~v~~yd~~~~~W  282 (352)
                      ....          ......+   ++.-+|++|+....++ .|.++|..+++-
T Consensus       841 aG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVaDt~Nn-~Irvid~~~~~~  892 (1057)
T PLN02919        841 AGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVADTNNS-LIRYLDLNKGEA  892 (1057)
T ss_pred             eccCCcCCCCCcccccccCCceEEEEeCCCCEEEEECCCC-EEEEEECCCCcc
Confidence            3211          0111111   2334688988876554 899999988753


No 127
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=93.15  E-value=7.1  Score=36.09  Aligned_cols=216  Identities=12%  Similarity=0.074  Sum_probs=106.4

Q ss_pred             CCCCeeEecc-CCCcc-cCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC-CcceeeeeEEE-CC
Q 045821           93 TRRSWKMIHE-LPART-LRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS-TARCYFPCGVL-NQ  168 (352)
Q Consensus        93 ~~~~w~~~~~-~~~~~-~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~-~~r~~~~~~~~-~~  168 (352)
                      ...+|.+... ..... ...+...+...++..|++|-.       ..++.=.-.-.+|+.++..+ .+......... ++
T Consensus       118 GG~tW~~~~~~~~~~~~~~~~l~~v~f~~~~g~~vG~~-------G~il~T~DgG~tW~~~~~~~~~p~~~~~i~~~~~~  190 (398)
T PLN00033        118 GGKTWVPRSIPSAEDEDFNYRFNSISFKGKEGWIIGKP-------AILLHTSDGGETWERIPLSPKLPGEPVLIKATGPK  190 (398)
T ss_pred             CCCCceECccCcccccccccceeeeEEECCEEEEEcCc-------eEEEEEcCCCCCceECccccCCCCCceEEEEECCC
Confidence            5788988642 11100 112234445557778887531       12222122235799876432 12112223334 45


Q ss_pred             EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCc------cc--------c----ee-EEEECCEEEEEccCCCCCCceEE
Q 045821          169 KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIF------TE--------I----ED-SFVMDGKIYIRCSASAATSHVCA  229 (352)
Q Consensus       169 ~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~------~~--------~----~~-~~~~~g~iyv~GG~~~~~~~~~i  229 (352)
                      .++++|..   ..+++=+-.-.+|+.+...+..      ..        +    .. ...-++.++++|-..      .+
T Consensus       191 ~~~ivg~~---G~v~~S~D~G~tW~~~~~~t~~~~l~~~~~s~~~g~~~y~Gsf~~v~~~~dG~~~~vg~~G------~~  261 (398)
T PLN00033        191 SAEMVTDE---GAIYVTSNAGRNWKAAVEETVSATLNRTVSSGISGASYYTGTFSTVNRSPDGDYVAVSSRG------NF  261 (398)
T ss_pred             ceEEEecc---ceEEEECCCCCCceEcccccccccccccccccccccceeccceeeEEEcCCCCEEEEECCc------cE
Confidence            67777743   3344444445689987322110      00        0    00 112245566663211      23


Q ss_pred             E-EEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCce-----EEccCCCCCcCCCCceEEE-e
Q 045821          230 L-VYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREW-----SPVGRLSTLLTRPPCKLVA-I  302 (352)
Q Consensus       230 ~-~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W-----~~~~~~~~~~~~~~~~~~~-~  302 (352)
                      + ..|.....|+.+..+............++.+++++...  .+..-+.....|     .++.....  .....++.. -
T Consensus       262 ~~s~d~G~~~W~~~~~~~~~~l~~v~~~~dg~l~l~g~~G--~l~~S~d~G~~~~~~~f~~~~~~~~--~~~l~~v~~~~  337 (398)
T PLN00033        262 YLTWEPGQPYWQPHNRASARRIQNMGWRADGGLWLLTRGG--GLYVSKGTGLTEEDFDFEEADIKSR--GFGILDVGYRS  337 (398)
T ss_pred             EEecCCCCcceEEecCCCccceeeeeEcCCCCEEEEeCCc--eEEEecCCCCcccccceeecccCCC--CcceEEEEEcC
Confidence            3 34444445999877655555555556688888887543  455555555545     44432211  111223332 3


Q ss_pred             CCEEEEEcCCceEEEEEcCCcccccc
Q 045821          303 GKTIFVIGKGCSAVVIDVGNIGNIGG  328 (352)
Q Consensus       303 ~~~i~v~gG~~~~~~~d~~~~~w~~~  328 (352)
                      ++.+++.|..--.+.-...-.+|.+.
T Consensus       338 d~~~~a~G~~G~v~~s~D~G~tW~~~  363 (398)
T PLN00033        338 KKEAWAAGGSGILLRSTDGGKSWKRD  363 (398)
T ss_pred             CCcEEEEECCCcEEEeCCCCcceeEc
Confidence            66788888654444445556678774


No 128
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=93.05  E-value=3.8  Score=37.58  Aligned_cols=199  Identities=12%  Similarity=0.087  Sum_probs=92.6

Q ss_pred             eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE----ECCEEEEEeCCCCCCceEEEeCC
Q 045821          113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV----LNQKIYCIGGLGDTHSWDVYDPR  188 (352)
Q Consensus       113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~----~~~~iyv~gG~~~~~~v~~yd~~  188 (352)
                      ++++..+...|++||.     ...+++++...++.--.+-   .+++ ...+.    -|+..++.||.+  ..|.+|+..
T Consensus        85 ~al~s~n~G~~l~ag~-----i~g~lYlWelssG~LL~v~---~aHY-Q~ITcL~fs~dgs~iiTgskD--g~V~vW~l~  153 (476)
T KOG0646|consen   85 HALASSNLGYFLLAGT-----ISGNLYLWELSSGILLNVL---SAHY-QSITCLKFSDDGSHIITGSKD--GAVLVWLLT  153 (476)
T ss_pred             eeeecCCCceEEEeec-----ccCcEEEEEeccccHHHHH---Hhhc-cceeEEEEeCCCcEEEecCCC--ccEEEEEEE
Confidence            5566677778888872     3456888888887522211   2222 22222    267777777766  445554432


Q ss_pred             CC--cEEEecCCCCcccceeEEE----------ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEE
Q 045821          189 TN--NWKLHTEPNIFTEIEDSFV----------MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVV  256 (352)
Q Consensus       189 t~--~W~~~~~~~~~~~~~~~~~----------~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~  256 (352)
                      +-  .-..-.+.|......+...          .+.++|-.+ .+     ..+-+||.+.+.--.-- ..+......++-
T Consensus       154 ~lv~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS-~D-----~t~k~wdlS~g~LLlti-~fp~si~av~lD  226 (476)
T KOG0646|consen  154 DLVSADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTAS-ED-----RTIKLWDLSLGVLLLTI-TFPSSIKAVALD  226 (476)
T ss_pred             eecccccCCCccceeeeccCcceeEEEEecCCCccceEEEec-CC-----ceEEEEEeccceeeEEE-ecCCcceeEEEc
Confidence            10  0000011122222222211          133455542 11     25667787776432110 112222222222


Q ss_pred             ECCEEEEEecCCCCeEEEEeCCCCceEE--------------ccCCCCCcC-CCCceE-EEeCCEEEEEcCCceE-EEEE
Q 045821          257 VDDALYVLDQSSGTKLMMWQKESREWSP--------------VGRLSTLLT-RPPCKL-VAIGKTIFVIGKGCSA-VVID  319 (352)
Q Consensus       257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~--------------~~~~~~~~~-~~~~~~-~~~~~~i~v~gG~~~~-~~~d  319 (352)
                      ..+..+++|+..| .++..+..+-.=..              +..+-.... ..-.++ +..++.+++.|+.+.. .+.|
T Consensus       227 pae~~~yiGt~~G-~I~~~~~~~~~~~~~~v~~k~~~~~~t~~~~~~Gh~~~~~ITcLais~DgtlLlSGd~dg~VcvWd  305 (476)
T KOG0646|consen  227 PAERVVYIGTEEG-KIFQNLLFKLSGQSAGVNQKGRHEENTQINVLVGHENESAITCLAISTDGTLLLSGDEDGKVCVWD  305 (476)
T ss_pred             ccccEEEecCCcc-eEEeeehhcCCcccccccccccccccceeeeeccccCCcceeEEEEecCccEEEeeCCCCCEEEEe
Confidence            3445555666555 55555443211000              111111001 111222 3458999999988764 4799


Q ss_pred             cCCcccccceE
Q 045821          320 VGNIGNIGGIM  330 (352)
Q Consensus       320 ~~~~~w~~~~~  330 (352)
                      +...+-.+...
T Consensus       306 i~S~Q~iRtl~  316 (476)
T KOG0646|consen  306 IYSKQCIRTLQ  316 (476)
T ss_pred             cchHHHHHHHh
Confidence            98887666433


No 129
>smart00284 OLF Olfactomedin-like domains.
Probab=92.95  E-value=5.4  Score=34.19  Aligned_cols=192  Identities=16%  Similarity=0.200  Sum_probs=103.7

Q ss_pred             CCEEEEEEecCCCCceeEEEe----cCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821           68 ETWIYAFCRDNKLERVCCYVL----DPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA  143 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~----d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~  143 (352)
                      ++.+|++.+.. .....++.|    |.  ......+.-.+|   .+..+.+.++.+|.+|.--      .....+.+||.
T Consensus        34 ~~~~wv~~~~~-~~~~~v~ey~~~~~f--~~~~~~~~~~Lp---~~~~GtG~VVYngslYY~~------~~s~~iiKydL  101 (255)
T smart00284       34 KSLYWYMPLNT-RVLRSVREYSSMSDF--QMGKNPTDHPLP---HAGQGTGVVVYNGSLYFNK------FNSHDICRFDL  101 (255)
T ss_pred             CceEEEEcccc-CCCcEEEEecCHHHH--hccCCceEEECC---CccccccEEEECceEEEEe------cCCccEEEEEC
Confidence            46788875532 122334444    22  333443333455   4566777888899998853      23577999999


Q ss_pred             CCCeeEecCCCCCc------------ceeeeeEEECCEEEEEeCCCCC---CceEEEeCCC----CcEEEecCCCCcccc
Q 045821          144 SMNTWTDAAPMSTA------------RCYFPCGVLNQKIYCIGGLGDT---HSWDVYDPRT----NNWKLHTEPNIFTEI  204 (352)
Q Consensus       144 ~t~~W~~~~~~~~~------------r~~~~~~~~~~~iyv~gG~~~~---~~v~~yd~~t----~~W~~~~~~~~~~~~  204 (352)
                      .+++-.....+|.+            ......++-++-++|+=.....   -.+-..|+.|    .+|..-  .+. +..
T Consensus       102 ~t~~v~~~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSkLnp~tL~ve~tW~T~--~~k-~sa  178 (255)
T smart00284      102 TTETYQKEPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNPATLTIENTWITT--YNK-RSA  178 (255)
T ss_pred             CCCcEEEEEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEEEeeCcccceEEEEEEcC--CCc-ccc
Confidence            99986433333322            1223344444445555332211   2234566654    357652  222 222


Q ss_pred             eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE-E---EECCEEEEEecCCCCeEEEEeCC
Q 045821          205 EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA-V---VVDDALYVLDQSSGTKLMMWQKE  278 (352)
Q Consensus       205 ~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~-~---~~~~~l~~~gg~~~~~v~~yd~~  278 (352)
                      ..++.+=|.||++-... .....-.+.||+.+++=..+..+.+.+....+ +   ..+.+||+....   .+..||..
T Consensus       179 ~naFmvCGvLY~~~s~~-~~~~~I~yayDt~t~~~~~~~i~f~n~y~~~s~l~YNP~d~~LY~wdng---~~l~Y~v~  252 (255)
T smart00284      179 SNAFMICGILYVTRSLG-SKGEKVFYAYDTNTGKEGHLDIPFENMYEYISMLDYNPNDRKLYAWNNG---HLVHYDIA  252 (255)
T ss_pred             cccEEEeeEEEEEccCC-CCCcEEEEEEECCCCccceeeeeeccccccceeceeCCCCCeEEEEeCC---eEEEEEEE
Confidence            24566678999995322 12223567899988764333333333333322 2   247889988543   67777753


No 130
>PTZ00421 coronin; Provisional
Probab=92.89  E-value=9  Score=36.59  Aligned_cols=147  Identities=14%  Similarity=0.154  Sum_probs=68.8

Q ss_pred             CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc--cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCC
Q 045821          168 QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT--EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL-HADA  244 (352)
Q Consensus       168 ~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~--~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~  244 (352)
                      +.+++.||.+  ..+.+||..+.+-...  +....  .......-++.+.+.|+.++     .+..||+.+++-. .+..
T Consensus       138 ~~iLaSgs~D--gtVrIWDl~tg~~~~~--l~~h~~~V~sla~spdG~lLatgs~Dg-----~IrIwD~rsg~~v~tl~~  208 (493)
T PTZ00421        138 MNVLASAGAD--MVVNVWDVERGKAVEV--IKCHSDQITSLEWNLDGSLLCTTSKDK-----KLNIIDPRDGTIVSSVEA  208 (493)
T ss_pred             CCEEEEEeCC--CEEEEEECCCCeEEEE--EcCCCCceEEEEEECCCCEEEEecCCC-----EEEEEECCCCcEEEEEec
Confidence            3567777765  6788999987753221  11111  11122233677777766553     5788998766422 2211


Q ss_pred             CCCCCCCCcEE-EECCEEEEEecCC---CCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcC--CceEE
Q 045821          245 NMASGWRGPAV-VVDDALYVLDQSS---GTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGK--GCSAV  316 (352)
Q Consensus       245 ~~~~~~~~~~~-~~~~~l~~~gg~~---~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG--~~~~~  316 (352)
                       .........+ ..++..++..|..   ...+.+||..+..  .... ..... ..........++.++++||  ...+.
T Consensus       209 -H~~~~~~~~~w~~~~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~-~~d~~-~~~~~~~~d~d~~~L~lggkgDg~Ir  285 (493)
T PTZ00421        209 -HASAKSQRCLWAKRKDLIITLGCSKSQQRQIMLWDTRKMASPYSTV-DLDQS-SALFIPFFDEDTNLLYIGSKGEGNIR  285 (493)
T ss_pred             -CCCCcceEEEEcCCCCeEEEEecCCCCCCeEEEEeCCCCCCceeEe-ccCCC-CceEEEEEcCCCCEEEEEEeCCCeEE
Confidence             1111111111 2233344444432   2479999987533  1111 11110 0000111122445555544  33566


Q ss_pred             EEEcCCcccc
Q 045821          317 VIDVGNIGNI  326 (352)
Q Consensus       317 ~~d~~~~~w~  326 (352)
                      +||+......
T Consensus       286 iwdl~~~~~~  295 (493)
T PTZ00421        286 CFELMNERLT  295 (493)
T ss_pred             EEEeeCCceE
Confidence            8998876543


No 131
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=92.87  E-value=5.2  Score=33.75  Aligned_cols=185  Identities=17%  Similarity=0.204  Sum_probs=95.1

Q ss_pred             eeEecCCCCC-----ccee-eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE---CCEEEEE
Q 045821          147 TWTDAAPMST-----ARCY-FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM---DGKIYIR  217 (352)
Q Consensus       147 ~W~~~~~~~~-----~r~~-~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~---~g~iyv~  217 (352)
                      .|+..+++..     |-.. +....-.+.|+..||.   ..++..|.++++.+..-  .....+-++++.   +++|+- 
T Consensus       100 lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgGD---~~~y~~dlE~G~i~r~~--rGHtDYvH~vv~R~~~~qils-  173 (325)
T KOG0649|consen  100 LWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGGD---GVIYQVDLEDGRIQREY--RGHTDYVHSVVGRNANGQILS-  173 (325)
T ss_pred             hhhhcCccccCcccCCccceeEeccCCCcEEEecCC---eEEEEEEecCCEEEEEE--cCCcceeeeeeecccCcceee-
Confidence            4777766543     3222 2233345788888874   56788999999876542  222333344433   345443 


Q ss_pred             ccCCCCCCceEEEEEeCCCCceeec-CC----CCCCC-CCC--cEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821          218 CSASAATSHVCALVYEPSTDSWLHA-DA----NMASG-WRG--PAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS  289 (352)
Q Consensus       218 GG~~~~~~~~~i~~yd~~~~~W~~~-~~----~~~~~-~~~--~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~  289 (352)
                      |+.++     .+-.+|..+.+-.++ .+    ...++ +..  .+...+...++.||-.  .+..|.....+-+.+-..|
T Consensus       174 G~EDG-----tvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgGp--~lslwhLrsse~t~vfpip  246 (325)
T KOG0649|consen  174 GAEDG-----TVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGGP--KLSLWHLRSSESTCVFPIP  246 (325)
T ss_pred             cCCCc-----cEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEecCCC--ceeEEeccCCCceEEEecc
Confidence            33332     466788888776554 22    11111 222  2345566777777654  5777777776666665444


Q ss_pred             CCcCCCCceEEEeCCEEEEEcC-CceEEEEEcCCcccccceEEeccCCCCCCCC-cceeeEEeeC
Q 045821          290 TLLTRPPCKLVAIGKTIFVIGK-GCSAVVIDVGNIGNIGGIMVSSSIPKLNDND-DIISCKCLAI  352 (352)
Q Consensus       290 ~~~~~~~~~~~~~~~~i~v~gG-~~~~~~~d~~~~~w~~~~~~~~~~~~~~~~~-~~~~~~~~~~  352 (352)
                      ..     ...+.+.++..+++| .+.+.-|.....   -.+-++.+-|...... +.-+||+|.+
T Consensus       247 a~-----v~~v~F~~d~vl~~G~g~~v~~~~l~Gv---l~a~ip~~s~~c~s~s~~~~p~k~~s~  303 (325)
T KOG0649|consen  247 AR-----VHLVDFVDDCVLIGGEGNHVQSYTLNGV---LQANIPVESTACYSASWQTSPIKFISI  303 (325)
T ss_pred             cc-----eeEeeeecceEEEeccccceeeeeeccE---EEEeccCCccceeeecccCCceEEEEe
Confidence            31     233444455555554 333322322210   1222333334444443 5556787764


No 132
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=92.64  E-value=7  Score=34.72  Aligned_cols=245  Identities=11%  Similarity=0.097  Sum_probs=119.3

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC-C
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM-N  146 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t-~  146 (352)
                      +..||++......+....|..|+  ..++.+.+...... ..-..+..+.-++...+..-+.     ...+-+|-..+ +
T Consensus        51 ~~~LY~v~~~~~~ggvaay~iD~--~~G~Lt~ln~~~~~-g~~p~yvsvd~~g~~vf~AnY~-----~g~v~v~p~~~dG  122 (346)
T COG2706          51 QRHLYVVNEPGEEGGVAAYRIDP--DDGRLTFLNRQTLP-GSPPCYVSVDEDGRFVFVANYH-----SGSVSVYPLQADG  122 (346)
T ss_pred             CCEEEEEEecCCcCcEEEEEEcC--CCCeEEEeeccccC-CCCCeEEEECCCCCEEEEEEcc-----CceEEEEEcccCC
Confidence            55799887665566788888999  88888877654421 1111332333333322222221     23444444433 1


Q ss_pred             e-eEec------CCCCCcc-----eeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecC--CCCcccceeEEEE-C
Q 045821          147 T-WTDA------APMSTAR-----CYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE--PNIFTEIEDSFVM-D  211 (352)
Q Consensus       147 ~-W~~~------~~~~~~r-----~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~--~~~~~~~~~~~~~-~  211 (352)
                      . |..+      ..-|.+|     ..++-..-++++.+.--... ..+..|+...+.-....+  .++...-.+.+++ +
T Consensus       123 ~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~DLG~-Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn  201 (346)
T COG2706         123 SLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPDLGT-DRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPN  201 (346)
T ss_pred             ccccceeeeecCCCCCCccccCCccceeeeCCCCCEEEEeecCC-ceEEEEEcccCccccccccccCCCCCcceEEEcCC
Confidence            1 2211      1112233     21222222443333221111 678888888665443322  2222222245555 4


Q ss_pred             C-EEEEEccCCCCCCceEEEEEeCCCCceeecCCC--CCCCCC----CcE--EEECCEEEEEecCCCCeEEEE--eCCCC
Q 045821          212 G-KIYIRCSASAATSHVCALVYEPSTDSWLHADAN--MASGWR----GPA--VVVDDALYVLDQSSGTKLMMW--QKESR  280 (352)
Q Consensus       212 g-~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~--~~~~~~----~~~--~~~~~~l~~~gg~~~~~v~~y--d~~~~  280 (352)
                      + ..|++..   -.....++.||....+.++++..  ++..+.    .++  +..+|+.+.......+.|.+|  |+.++
T Consensus       202 ~k~aY~v~E---L~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g  278 (346)
T COG2706         202 GKYAYLVNE---LNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGG  278 (346)
T ss_pred             CcEEEEEec---cCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCC
Confidence            4 5688842   23344788888888888877542  222222    222  233555443433222366555  77777


Q ss_pred             ceEEccCCCC-CcCCCCceEEEeCCEEEEEcCC-ceEEEEEcCCcc
Q 045821          281 EWSPVGRLST-LLTRPPCKLVAIGKTIFVIGKG-CSAVVIDVGNIG  324 (352)
Q Consensus       281 ~W~~~~~~~~-~~~~~~~~~~~~~~~i~v~gG~-~~~~~~d~~~~~  324 (352)
                      .-..+..-+. ...-+.|.+..-++.|++.+-. +.+.+|..++.+
T Consensus       279 ~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~q~sd~i~vf~~d~~T  324 (346)
T COG2706         279 KLELVGITPTEGQFPRDFNINPSGRFLIAANQKSDNITVFERDKET  324 (346)
T ss_pred             EEEEEEEeccCCcCCccceeCCCCCEEEEEccCCCcEEEEEEcCCC
Confidence            6555543221 1122345555556666666643 345567766655


No 133
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=92.56  E-value=5.6  Score=33.38  Aligned_cols=189  Identities=13%  Similarity=0.093  Sum_probs=103.5

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC--EEEEEeCCCCCCCCCCeEEEEECCC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK--NAYLLGGCGWSEDATSEVYCYDASM  145 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~--~iyv~GG~~~~~~~~~~~~~~d~~t  145 (352)
                      ++.-++.||.    ...++++|-  ++++-.+.  +.   ........+..++  .+.+-|+.      ..++..||..+
T Consensus        70 Dnskf~s~Gg----Dk~v~vwDV--~TGkv~Rr--~r---gH~aqVNtV~fNeesSVv~Sgsf------D~s~r~wDCRS  132 (307)
T KOG0316|consen   70 DNSKFASCGG----DKAVQVWDV--NTGKVDRR--FR---GHLAQVNTVRFNEESSVVASGSF------DSSVRLWDCRS  132 (307)
T ss_pred             cccccccCCC----CceEEEEEc--ccCeeeee--cc---cccceeeEEEecCcceEEEeccc------cceeEEEEccc
Confidence            4444555543    234666688  77653222  11   1122333444543  35544543      45688899999


Q ss_pred             CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC
Q 045821          146 NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS  225 (352)
Q Consensus       146 ~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~  225 (352)
                      ++.+.++.+..++-.-.++.+.+..++.|..+  ..+-.||+..++-  ..+-...+-.+....-+++-.++|..++   
T Consensus       133 ~s~ePiQildea~D~V~Si~v~~heIvaGS~D--GtvRtydiR~G~l--~sDy~g~pit~vs~s~d~nc~La~~l~s---  205 (307)
T KOG0316|consen  133 RSFEPIQILDEAKDGVSSIDVAEHEIVAGSVD--GTVRTYDIRKGTL--SSDYFGHPITSVSFSKDGNCSLASSLDS---  205 (307)
T ss_pred             CCCCccchhhhhcCceeEEEecccEEEeeccC--CcEEEEEeeccee--ehhhcCCcceeEEecCCCCEEEEeeccc---
Confidence            99888888888888777888888888877766  6788899987642  2221112222223334555555543332   


Q ss_pred             ceEEEEEeCCCCceeecCCCCCCCCCC-cEEEECCEEEEEecCCCCeEEEEeCCCCce
Q 045821          226 HVCALVYEPSTDSWLHADANMASGWRG-PAVVVDDALYVLDQSSGTKLMMWQKESREW  282 (352)
Q Consensus       226 ~~~i~~yd~~~~~W~~~~~~~~~~~~~-~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W  282 (352)
                        .+...|..+++--..-.......+. -.+..+..-.+++|.....++.||....+-
T Consensus       206 --tlrLlDk~tGklL~sYkGhkn~eykldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~  261 (307)
T KOG0316|consen  206 --TLRLLDKETGKLLKSYKGHKNMEYKLDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ  261 (307)
T ss_pred             --eeeecccchhHHHHHhcccccceeeeeeeecccceeEEeccCCceEEEEEecccee
Confidence              3444455554432211101111111 123344455667776555899999987654


No 134
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=92.48  E-value=8.8  Score=35.47  Aligned_cols=199  Identities=13%  Similarity=0.188  Sum_probs=98.5

Q ss_pred             CCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCC----Cc----------
Q 045821           93 TRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMS----TA----------  157 (352)
Q Consensus        93 ~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~----~~----------  157 (352)
                      ...+|+.++..+..  +.....+..++ +.+++.|..       ..+++-+-.-.+|+.+...+    ..          
T Consensus       164 gG~tW~~~~~~~~~--p~~~~~i~~~~~~~~~ivg~~-------G~v~~S~D~G~tW~~~~~~t~~~~l~~~~~s~~~g~  234 (398)
T PLN00033        164 GGETWERIPLSPKL--PGEPVLIKATGPKSAEMVTDE-------GAIYVTSNAGRNWKAAVEETVSATLNRTVSSGISGA  234 (398)
T ss_pred             CCCCceECccccCC--CCCceEEEEECCCceEEEecc-------ceEEEECCCCCCceEccccccccccccccccccccc
Confidence            57899888653211  11223334443 457777631       23444443445798762111    00          


Q ss_pred             ----ceeeeeEEE-CCEEEEEeCCCCCCceEE-EeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEE
Q 045821          158 ----RCYFPCGVL-NQKIYCIGGLGDTHSWDV-YDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALV  231 (352)
Q Consensus       158 ----r~~~~~~~~-~~~iyv~gG~~~~~~v~~-yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~  231 (352)
                          -..+.+... ++.++++|-..   .+++ .|.....|+.+...............++.++++|...      .+..
T Consensus       235 ~~y~Gsf~~v~~~~dG~~~~vg~~G---~~~~s~d~G~~~W~~~~~~~~~~l~~v~~~~dg~l~l~g~~G------~l~~  305 (398)
T PLN00033        235 SYYTGTFSTVNRSPDGDYVAVSSRG---NFYLTWEPGQPYWQPHNRASARRIQNMGWRADGGLWLLTRGG------GLYV  305 (398)
T ss_pred             ceeccceeeEEEcCCCCEEEEECCc---cEEEecCCCCcceEEecCCCccceeeeeEcCCCCEEEEeCCc------eEEE
Confidence                011122222 45566665432   2222 3333345998875444443333445688899885332      2333


Q ss_pred             EeCCCC-----ceeecCCCCCCC-CCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEE-eCC
Q 045821          232 YEPSTD-----SWLHADANMASG-WRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVA-IGK  304 (352)
Q Consensus       232 yd~~~~-----~W~~~~~~~~~~-~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~-~~~  304 (352)
                      -+....     +|.+++...... ........++.+++.|...  .+.+-....++|+.....+. .....+.+.. .++
T Consensus       306 S~d~G~~~~~~~f~~~~~~~~~~~l~~v~~~~d~~~~a~G~~G--~v~~s~D~G~tW~~~~~~~~-~~~~ly~v~f~~~~  382 (398)
T PLN00033        306 SKGTGLTEEDFDFEEADIKSRGFGILDVGYRSKKEAWAAGGSG--ILLRSTDGGKSWKRDKGADN-IAANLYSVKFFDDK  382 (398)
T ss_pred             ecCCCCcccccceeecccCCCCcceEEEEEcCCCcEEEEECCC--cEEEeCCCCcceeEccccCC-CCcceeEEEEcCCC
Confidence            333334     455543321111 1111223466788777643  56777778899999763221 1223345553 457


Q ss_pred             EEEEEcCC
Q 045821          305 TIFVIGKG  312 (352)
Q Consensus       305 ~i~v~gG~  312 (352)
                      +.|+.|-.
T Consensus       383 ~g~~~G~~  390 (398)
T PLN00033        383 KGFVLGND  390 (398)
T ss_pred             ceEEEeCC
Confidence            89998844


No 135
>KOG2997 consensus F-box protein FBX9 [General function prediction only]
Probab=92.33  E-value=0.058  Score=46.68  Aligned_cols=43  Identities=23%  Similarity=0.216  Sum_probs=37.6

Q ss_pred             cCCCcHHHHHHhhccCcc-----ccchhhhhhhhhhcccccChhhHHH
Q 045821           18 ISGLPDDIALFCLARVPR-----RYHAVLKCVSSRWRGLLHSEEWCAY   60 (352)
Q Consensus        18 ~~~lp~dl~~~il~r~p~-----~~~~~l~~v~k~W~~l~~s~~~~~~   60 (352)
                      +..||+|++.+|+.++=.     +++.++.+||+.|.-....|++.+.
T Consensus       107 ~~~LPdEvLm~I~~~vv~~~~d~rsL~~~s~vCr~F~~~~R~~~lwR~  154 (366)
T KOG2997|consen  107 ISVLPDEVLMRIFRWVVSSLLDLRSLEQLSLVCRGFYKCARDPELWRL  154 (366)
T ss_pred             hhhCCHHHHHHHHHHHHhhhcchhhHHHhHhhHHHHHHHHcChHHHHH
Confidence            468999999999998775     7899999999999999888887654


No 136
>PF07734 FBA_1:  F-box associated;  InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=92.06  E-value=5.1  Score=31.82  Aligned_cols=80  Identities=15%  Similarity=0.196  Sum_probs=49.3

Q ss_pred             EEECCEEEEEccCCCCCCceEEEEEeCCCCce-eecCCCCCCCCCCcE----EEECCEEEEEecCC-C--CeEEEEe---
Q 045821          208 FVMDGKIYIRCSASAATSHVCALVYEPSTDSW-LHADANMASGWRGPA----VVVDDALYVLDQSS-G--TKLMMWQ---  276 (352)
Q Consensus       208 ~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~~~~~~~~~~----~~~~~~l~~~gg~~-~--~~v~~yd---  276 (352)
                      +.++|.+|-++..........|.+||..+.+. ..++.|.........    ++.++.|.++-... .  -+||+-+   
T Consensus         2 V~vnG~~hW~~~~~~~~~~~~IlsFDl~~E~F~~~~~lP~~~~~~~~~~~L~~v~~~~L~~~~~~~~~~~~~IWvm~~~~   81 (164)
T PF07734_consen    2 VFVNGALHWLAYDENNDEKDFILSFDLSTEKFGRSLPLPFCNDDDDDSVSLSVVRGDCLCVLYQCDETSKIEIWVMKKYG   81 (164)
T ss_pred             EEECCEEEeeEEecCCCCceEEEEEeccccccCCEECCCCccCccCCEEEEEEecCCEEEEEEeccCCccEEEEEEeeec
Confidence            57899999986554433333799999999999 666554333311121    23477888774322 1  3566554   


Q ss_pred             CCCCceEEccC
Q 045821          277 KESREWSPVGR  287 (352)
Q Consensus       277 ~~~~~W~~~~~  287 (352)
                      ....+|+++-.
T Consensus        82 ~~~~SWtK~~~   92 (164)
T PF07734_consen   82 YGKESWTKLFT   92 (164)
T ss_pred             cCcceEEEEEE
Confidence            23678998853


No 137
>PLN03215 ascorbic acid mannose pathway regulator 1; Provisional
Probab=91.97  E-value=8.3  Score=35.11  Aligned_cols=132  Identities=8%  Similarity=0.080  Sum_probs=73.9

Q ss_pred             CCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecC-----CCCc--ccceeEEEECCEEEEE
Q 045821          145 MNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE-----PNIF--TEIEDSFVMDGKIYIR  217 (352)
Q Consensus       145 t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~-----~~~~--~~~~~~~~~~g~iyv~  217 (352)
                      -++|+.+..  .....-.++.++|++|++.-.   ..++.++.+-. =.++.+     +...  ......+...|.|+++
T Consensus       189 ~~~Wt~l~~--~~~~~~DIi~~kGkfYAvD~~---G~l~~i~~~l~-i~~v~~~i~~~~~~g~~~~~~yLVEs~GdLLmV  262 (373)
T PLN03215        189 GNVLKALKQ--MGYHFSDIIVHKGQTYALDSI---GIVYWINSDLE-FSRFGTSLDENITDGCWTGDRRFVECCGELYIV  262 (373)
T ss_pred             CCeeeEccC--CCceeeEEEEECCEEEEEcCC---CeEEEEecCCc-eeeecceecccccCCcccCceeEEEECCEEEEE
Confidence            389999974  223355678889999998432   34666664321 112221     1001  1223456677889988


Q ss_pred             ccCCCCC--------------CceEEEEEeCCCCceeecCCCCCCCC-----CCcEE-------EECCEEEEEecCCCCe
Q 045821          218 CSASAAT--------------SHVCALVYEPSTDSWLHADANMASGW-----RGPAV-------VVDDALYVLDQSSGTK  271 (352)
Q Consensus       218 GG~~~~~--------------~~~~i~~yd~~~~~W~~~~~~~~~~~-----~~~~~-------~~~~~l~~~gg~~~~~  271 (352)
                      .......              ....++..|....+|.++........     ....+       ..+|.||++...   .
T Consensus       263 ~R~~~~~~~~~~~~~~~~~~t~~f~VfklD~~~~~WveV~sLgd~aLFlG~~~s~sv~a~e~pG~k~NcIYFtdd~---~  339 (373)
T PLN03215        263 ERLPKESTWKRKADGFEYSRTVGFKVYKFDDELAKWMEVKTLGDNAFVMATDTCFSVLAHEFYGCLPNSIYFTEDT---M  339 (373)
T ss_pred             EEEccCcccccccccccccceeEEEEEEEcCCCCcEEEecccCCeEEEEECCccEEEecCCCCCccCCEEEEECCC---c
Confidence            6532110              11266677888999999876322110     00001       125677877554   5


Q ss_pred             EEEEeCCCCceEEc
Q 045821          272 LMMWQKESREWSPV  285 (352)
Q Consensus       272 v~~yd~~~~~W~~~  285 (352)
                      ..+||++.++-+.+
T Consensus       340 ~~v~~~~dg~~~~~  353 (373)
T PLN03215        340 PKVFKLDNGNGSSI  353 (373)
T ss_pred             ceEEECCCCCccce
Confidence            67888887764444


No 138
>PF14870 PSII_BNR:  Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=91.70  E-value=9  Score=33.90  Aligned_cols=217  Identities=12%  Similarity=0.221  Sum_probs=97.3

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMN  146 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~  146 (352)
                      ++..+++|..    .. ++.-.-  ...+|++++....  .+-..+.+..+ ++.+.+++..       ..+++=.=.-.
T Consensus        71 ~~~g~ivG~~----g~-ll~T~D--gG~tW~~v~l~~~--lpgs~~~i~~l~~~~~~l~~~~-------G~iy~T~DgG~  134 (302)
T PF14870_consen   71 GNEGWIVGEP----GL-LLHTTD--GGKTWERVPLSSK--LPGSPFGITALGDGSAELAGDR-------GAIYRTTDGGK  134 (302)
T ss_dssp             TTEEEEEEET----TE-EEEESS--TTSS-EE----TT---SS-EEEEEEEETTEEEEEETT---------EEEESSTTS
T ss_pred             CCceEEEcCC----ce-EEEecC--CCCCcEEeecCCC--CCCCeeEEEEcCCCcEEEEcCC-------CcEEEeCCCCC
Confidence            5567776532    12 222222  6889999863211  12223344444 5667777642       33444333345


Q ss_pred             eeEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCC
Q 045821          147 TWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATS  225 (352)
Q Consensus       147 ~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~  225 (352)
                      +|+.+..-... .-..+... ++++.+++...  +-....|+....|+........+-......-++.|+++. ..+   
T Consensus       135 tW~~~~~~~~g-s~~~~~r~~dG~~vavs~~G--~~~~s~~~G~~~w~~~~r~~~~riq~~gf~~~~~lw~~~-~Gg---  207 (302)
T PF14870_consen  135 TWQAVVSETSG-SINDITRSSDGRYVAVSSRG--NFYSSWDPGQTTWQPHNRNSSRRIQSMGFSPDGNLWMLA-RGG---  207 (302)
T ss_dssp             SEEEEE-S-----EEEEEE-TTS-EEEEETTS--SEEEEE-TT-SS-EEEE--SSS-EEEEEE-TTS-EEEEE-TTT---
T ss_pred             CeeEcccCCcc-eeEeEEECCCCcEEEEECcc--cEEEEecCCCccceEEccCccceehhceecCCCCEEEEe-CCc---
Confidence            79887543222 22222333 56766666443  334466888888998875434333334444578888874 221   


Q ss_pred             ceEEEEEe--CCCCceeecCCCCCCCCCC-cEE--EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE
Q 045821          226 HVCALVYE--PSTDSWLHADANMASGWRG-PAV--VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV  300 (352)
Q Consensus       226 ~~~i~~yd--~~~~~W~~~~~~~~~~~~~-~~~--~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~  300 (352)
                        .+..-+  ....+|++-..+......+ ..+  .-++.+++.||..  .+++=.-..++|++....... +...+.++
T Consensus       208 --~~~~s~~~~~~~~w~~~~~~~~~~~~~~ld~a~~~~~~~wa~gg~G--~l~~S~DgGktW~~~~~~~~~-~~n~~~i~  282 (302)
T PF14870_consen  208 --QIQFSDDPDDGETWSEPIIPIKTNGYGILDLAYRPPNEIWAVGGSG--TLLVSTDGGKTWQKDRVGENV-PSNLYRIV  282 (302)
T ss_dssp             --EEEEEE-TTEEEEE---B-TTSS--S-EEEEEESSSS-EEEEESTT---EEEESSTTSS-EE-GGGTTS-SS---EEE
T ss_pred             --EEEEccCCCCccccccccCCcccCceeeEEEEecCCCCEEEEeCCc--cEEEeCCCCccceECccccCC-CCceEEEE
Confidence              233333  3456788733323222222 222  2368899988864  577777778999998754321 23334444


Q ss_pred             E-eCCEEEEEcCC
Q 045821          301 A-IGKTIFVIGKG  312 (352)
Q Consensus       301 ~-~~~~i~v~gG~  312 (352)
                      . ..++-+++|..
T Consensus       283 f~~~~~gf~lG~~  295 (302)
T PF14870_consen  283 FVNPDKGFVLGQD  295 (302)
T ss_dssp             EEETTEEEEE-ST
T ss_pred             EcCCCceEEECCC
Confidence            4 45799999854


No 139
>PTZ00421 coronin; Provisional
Probab=91.69  E-value=13  Score=35.59  Aligned_cols=152  Identities=10%  Similarity=0.036  Sum_probs=71.8

Q ss_pred             CEEEEEeCCCCCCCCCCeEEEEECCCCeeE-ecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEE-EecC
Q 045821          120 KNAYLLGGCGWSEDATSEVYCYDASMNTWT-DAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWK-LHTE  197 (352)
Q Consensus       120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~-~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~-~~~~  197 (352)
                      +.+++.||.+      ..+.+||..+++-. .+.....+ .......-++.+++.|+.+  ..+.+||+.+..-. .+..
T Consensus       138 ~~iLaSgs~D------gtVrIWDl~tg~~~~~l~~h~~~-V~sla~spdG~lLatgs~D--g~IrIwD~rsg~~v~tl~~  208 (493)
T PTZ00421        138 MNVLASAGAD------MVVNVWDVERGKAVEVIKCHSDQ-ITSLEWNLDGSLLCTTSKD--KKLNIIDPRDGTIVSSVEA  208 (493)
T ss_pred             CCEEEEEeCC------CEEEEEECCCCeEEEEEcCCCCc-eEEEEEECCCCEEEEecCC--CEEEEEECCCCcEEEEEec
Confidence            3466666543      45788898876532 22211111 1111112267777777765  67889999876521 1111


Q ss_pred             CCCcccceeEEEEC-CEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcE-E--EECCEEEEEecCCCCe
Q 045821          198 PNIFTEIEDSFVMD-GKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPA-V--VVDDALYVLDQSSGTK  271 (352)
Q Consensus       198 ~~~~~~~~~~~~~~-g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~-~--~~~~~l~~~gg~~~~~  271 (352)
                      .............+ +.|... |... .....+..||..+..  ......  . ...... .  ..++.++++||.....
T Consensus       209 H~~~~~~~~~w~~~~~~ivt~-G~s~-s~Dr~VklWDlr~~~~p~~~~~~--d-~~~~~~~~~~d~d~~~L~lggkgDg~  283 (493)
T PTZ00421        209 HASAKSQRCLWAKRKDLIITL-GCSK-SQQRQIMLWDTRKMASPYSTVDL--D-QSSALFIPFFDEDTNLLYIGSKGEGN  283 (493)
T ss_pred             CCCCcceEEEEcCCCCeEEEE-ecCC-CCCCeEEEEeCCCCCCceeEecc--C-CCCceEEEEEcCCCCEEEEEEeCCCe
Confidence            11111111111223 344444 4321 112257777875432  111110  0 001111 1  2256666666643348


Q ss_pred             EEEEeCCCCceEEc
Q 045821          272 LMMWQKESREWSPV  285 (352)
Q Consensus       272 v~~yd~~~~~W~~~  285 (352)
                      |..||..++.....
T Consensus       284 Iriwdl~~~~~~~~  297 (493)
T PTZ00421        284 IRCFELMNERLTFC  297 (493)
T ss_pred             EEEEEeeCCceEEE
Confidence            99999988776544


No 140
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=91.40  E-value=10  Score=33.88  Aligned_cols=141  Identities=17%  Similarity=0.191  Sum_probs=78.4

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccc--eeEEEECCEEEEEccCCCCCCceEEEEEeC--CCCceeec
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEI--EDSFVMDGKIYIRCSASAATSHVCALVYEP--STDSWLHA  242 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~--~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~--~~~~W~~~  242 (352)
                      +..+.+.||.+  ...+.++..+..|---  ++.....  +..+.++|.+..-|+..+     .+.++..  ...+|...
T Consensus        75 ~~~l~aTGGgD--D~AflW~~~~ge~~~e--ltgHKDSVt~~~FshdgtlLATGdmsG-----~v~v~~~stg~~~~~~~  145 (399)
T KOG0296|consen   75 NNNLVATGGGD--DLAFLWDISTGEFAGE--LTGHKDSVTCCSFSHDGTLLATGDMSG-----KVLVFKVSTGGEQWKLD  145 (399)
T ss_pred             CCceEEecCCC--ceEEEEEccCCcceeE--ecCCCCceEEEEEccCceEEEecCCCc-----cEEEEEcccCceEEEee
Confidence            56777778866  6778899999886432  2222222  244667899888887664     3555544  44567654


Q ss_pred             CCCC---CCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCce-EEEE
Q 045821          243 DANM---ASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCS-AVVI  318 (352)
Q Consensus       243 ~~~~---~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~-~~~~  318 (352)
                      ...-   --.||.     ...+++.|..++ .+|.|...++.-.++-  +.......++-+..+||-.+.|-.+. +.++
T Consensus       146 ~e~~dieWl~WHp-----~a~illAG~~DG-svWmw~ip~~~~~kv~--~Gh~~~ct~G~f~pdGKr~~tgy~dgti~~W  217 (399)
T KOG0296|consen  146 QEVEDIEWLKWHP-----RAHILLAGSTDG-SVWMWQIPSQALCKVM--SGHNSPCTCGEFIPDGKRILTGYDDGTIIVW  217 (399)
T ss_pred             cccCceEEEEecc-----cccEEEeecCCC-cEEEEECCCcceeeEe--cCCCCCcccccccCCCceEEEEecCceEEEE
Confidence            2110   011111     345677777666 8999988875433332  11112333344444566555554443 3346


Q ss_pred             EcCCcc
Q 045821          319 DVGNIG  324 (352)
Q Consensus       319 d~~~~~  324 (352)
                      ++++-+
T Consensus       218 n~ktg~  223 (399)
T KOG0296|consen  218 NPKTGQ  223 (399)
T ss_pred             ecCCCc
Confidence            666543


No 141
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=91.29  E-value=12  Score=34.69  Aligned_cols=164  Identities=13%  Similarity=0.098  Sum_probs=85.6

Q ss_pred             hhhHHHHhhCCCC-CCEEEEEEecCCCCceeEEEecCCCCCCCeeEecc-CCCc-ccCCcceE--------EEEECC-EE
Q 045821           55 EEWCAYRRKHNLD-ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHE-LPAR-TLRRKGMG--------FEVLGK-NA  122 (352)
Q Consensus        55 ~~~~~~~~~~~~~-~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~-~~~~-~~~~~~~~--------~~~~~~-~i  122 (352)
                      ..|...-.++... +.+|...-|.      .+|.|||  .+.+.+++.- +|.. ......+.        .+..+| .|
T Consensus       264 TnFtdYY~R~~nsDGkrIvFq~~G------dIylydP--~td~lekldI~lpl~rk~k~~k~~~pskyledfa~~~Gd~i  335 (668)
T COG4946         264 TNFTDYYPRNANSDGKRIVFQNAG------DIYLYDP--ETDSLEKLDIGLPLDRKKKQPKFVNPSKYLEDFAVVNGDYI  335 (668)
T ss_pred             CCchhccccccCCCCcEEEEecCC------cEEEeCC--CcCcceeeecCCccccccccccccCHHHhhhhhccCCCcEE
Confidence            3444444455544 4444443222      6889999  8888877752 2311 00000000        122333 33


Q ss_pred             EEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc
Q 045821          123 YLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT  202 (352)
Q Consensus       123 yv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~  202 (352)
                      -++.        ...++++++..+--..++.-..-|+.+  ...++.-.++|-.+. ..+.+||..+..-+++.. +...
T Consensus       336 a~VS--------RGkaFi~~~~~~~~iqv~~~~~VrY~r--~~~~~e~~vigt~dg-D~l~iyd~~~~e~kr~e~-~lg~  403 (668)
T COG4946         336 ALVS--------RGKAFIMRPWDGYSIQVGKKGGVRYRR--IQVDPEGDVIGTNDG-DKLGIYDKDGGEVKRIEK-DLGN  403 (668)
T ss_pred             EEEe--------cCcEEEECCCCCeeEEcCCCCceEEEE--EccCCcceEEeccCC-ceEEEEecCCceEEEeeC-Cccc
Confidence            3321        234566666554333343322223333  223445667776554 688999999998777652 3333


Q ss_pred             cceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecC
Q 045821          203 EIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHAD  243 (352)
Q Consensus       203 ~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~  243 (352)
                      -....+.-+|+..+++     +....+|.+|..++.=+.+.
T Consensus       404 I~av~vs~dGK~~vva-----Ndr~el~vididngnv~~id  439 (668)
T COG4946         404 IEAVKVSPDGKKVVVA-----NDRFELWVIDIDNGNVRLID  439 (668)
T ss_pred             eEEEEEcCCCcEEEEE-----cCceEEEEEEecCCCeeEec
Confidence            3333344466755553     23346888898888777664


No 142
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=91.21  E-value=8.4  Score=32.63  Aligned_cols=165  Identities=12%  Similarity=0.107  Sum_probs=78.0

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeE--EEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDS--FVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA  244 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~--~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~  244 (352)
                      +++..+.+|.   ..|-.||..++.=..+...-.......+  +-.+|+-...||.++     .+-++|...-.-++.  
T Consensus        51 dk~~LAaa~~---qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseDg-----t~kIWdlR~~~~qR~--  120 (311)
T KOG0315|consen   51 DKKDLAAAGN---QHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSEDG-----TVKIWDLRSLSCQRN--  120 (311)
T ss_pred             CcchhhhccC---CeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCCc-----eEEEEeccCcccchh--
Confidence            4444555554   5688999998742111111111111222  234788777765543     344555544222111  


Q ss_pred             CCCCCC-CCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE-EeCCEEEEE-cCCceEEEEEcC
Q 045821          245 NMASGW-RGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV-AIGKTIFVI-GKGCSAVVIDVG  321 (352)
Q Consensus       245 ~~~~~~-~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~~~i~v~-gG~~~~~~~d~~  321 (352)
                       ..... ....+..-++--++.|..+..|+++|..++..+...-....  ....++. ..+++.++. ...-++++.+.-
T Consensus       121 -~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~~~~c~~~liPe~~--~~i~sl~v~~dgsml~a~nnkG~cyvW~l~  197 (311)
T KOG0315|consen  121 -YQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLGENSCTHELIPEDD--TSIQSLTVMPDGSMLAAANNKGNCYVWRLL  197 (311)
T ss_pred             -ccCCCCcceEEecCCcceEEeecCCCcEEEEEccCCccccccCCCCC--cceeeEEEcCCCcEEEEecCCccEEEEEcc
Confidence             11110 01112222333333333333899999999988765422221  2222233 334544433 323357777776


Q ss_pred             CcccccceEEeccCCCCCCCC-cceee
Q 045821          322 NIGNIGGIMVSSSIPKLNDND-DIISC  347 (352)
Q Consensus       322 ~~~w~~~~~~~~~~~~~~~~~-~~~~~  347 (352)
                      .   ....+...-+.++.++. +|-.|
T Consensus       198 ~---~~~~s~l~P~~k~~ah~~~il~C  221 (311)
T KOG0315|consen  198 N---HQTASELEPVHKFQAHNGHILRC  221 (311)
T ss_pred             C---CCccccceEhhheecccceEEEE
Confidence            6   23344455566677666 44443


No 143
>PF07734 FBA_1:  F-box associated;  InterPro: IPR006527 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=91.17  E-value=6.5  Score=31.24  Aligned_cols=78  Identities=13%  Similarity=0.108  Sum_probs=48.8

Q ss_pred             EEECCEEEEEeCCCCC-C--ceEEEeCCCCcE-EEecCCCCcccce---eE-EEECCEEEEEccCCCCCCceEEEEEe--
Q 045821          164 GVLNQKIYCIGGLGDT-H--SWDVYDPRTNNW-KLHTEPNIFTEIE---DS-FVMDGKIYIRCSASAATSHVCALVYE--  233 (352)
Q Consensus       164 ~~~~~~iyv~gG~~~~-~--~v~~yd~~t~~W-~~~~~~~~~~~~~---~~-~~~~g~iyv~GG~~~~~~~~~i~~yd--  233 (352)
                      +.++|.+|.++..... .  .+..||..+++. +.++.|.......   .. ++.++.|-++-- ........||+-+  
T Consensus         2 V~vnG~~hW~~~~~~~~~~~~IlsFDl~~E~F~~~~~lP~~~~~~~~~~~L~~v~~~~L~~~~~-~~~~~~~~IWvm~~~   80 (164)
T PF07734_consen    2 VFVNGALHWLAYDENNDEKDFILSFDLSTEKFGRSLPLPFCNDDDDDSVSLSVVRGDCLCVLYQ-CDETSKIEIWVMKKY   80 (164)
T ss_pred             EEECCEEEeeEEecCCCCceEEEEEeccccccCCEECCCCccCccCCEEEEEEecCCEEEEEEe-ccCCccEEEEEEeee
Confidence            5679999998876554 2  689999999999 6665433333111   22 333778888732 2222234677655  


Q ss_pred             -CCCCceeec
Q 045821          234 -PSTDSWLHA  242 (352)
Q Consensus       234 -~~~~~W~~~  242 (352)
                       ....+|+++
T Consensus        81 ~~~~~SWtK~   90 (164)
T PF07734_consen   81 GYGKESWTKL   90 (164)
T ss_pred             ccCcceEEEE
Confidence             247899876


No 144
>PRK02889 tolB translocation protein TolB; Provisional
Probab=91.14  E-value=13  Score=34.73  Aligned_cols=183  Identities=11%  Similarity=0.019  Sum_probs=90.5

Q ss_pred             CceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECC-EEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce
Q 045821           81 ERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGK-NAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC  159 (352)
Q Consensus        81 ~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~  159 (352)
                      +...++.+|.  .++.-..+...+.    .........++ .|++....+    ...++|.+|..++..+.+..-.....
T Consensus       218 ~~~~I~~~dl--~~g~~~~l~~~~g----~~~~~~~SPDG~~la~~~~~~----g~~~Iy~~d~~~~~~~~lt~~~~~~~  287 (427)
T PRK02889        218 KKPVVYVHDL--ATGRRRVVANFKG----SNSAPAWSPDGRTLAVALSRD----GNSQIYTVNADGSGLRRLTQSSGIDT  287 (427)
T ss_pred             CCcEEEEEEC--CCCCEEEeecCCC----CccceEECCCCCEEEEEEccC----CCceEEEEECCCCCcEECCCCCCCCc
Confidence            3456888888  7776555554431    11112222244 455443322    24688999988777666643221111


Q ss_pred             eeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          160 YFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       160 ~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      .. ...-++ .|+..+.......++.+|..++..+.+... ..........-+|+..++.....  ....++.+|..+++
T Consensus       288 ~~-~wSpDG~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~-g~~~~~~~~SpDG~~Ia~~s~~~--g~~~I~v~d~~~g~  363 (427)
T PRK02889        288 EP-FFSPDGRSIYFTSDRGGAPQIYRMPASGGAAQRVTFT-GSYNTSPRISPDGKLLAYISRVG--GAFKLYVQDLATGQ  363 (427)
T ss_pred             Ce-EEcCCCCEEEEEecCCCCcEEEEEECCCCceEEEecC-CCCcCceEECCCCCEEEEEEccC--CcEEEEEEECCCCC
Confidence            11 111244 455443322226788888887777776521 11111122333554333323221  22368889998888


Q ss_pred             eeecCCCCCCCCCCcEEEECCEEEEEec-CCC-CeEEEEeCCC
Q 045821          239 WLHADANMASGWRGPAVVVDDALYVLDQ-SSG-TKLMMWQKES  279 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg-~~~-~~v~~yd~~~  279 (352)
                      .+.+....  ........-+++.+++.. ..+ ..++.++...
T Consensus       364 ~~~lt~~~--~~~~p~~spdg~~l~~~~~~~g~~~l~~~~~~g  404 (427)
T PRK02889        364 VTALTDTT--RDESPSFAPNGRYILYATQQGGRSVLAAVSSDG  404 (427)
T ss_pred             eEEccCCC--CccCceECCCCCEEEEEEecCCCEEEEEEECCC
Confidence            77764321  112233444555444433 222 3567777754


No 145
>PTZ00420 coronin; Provisional
Probab=90.90  E-value=16  Score=35.45  Aligned_cols=141  Identities=12%  Similarity=0.144  Sum_probs=68.0

Q ss_pred             EEEEEeCCCCCCceEEEeCCCCcEE-EecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCCCC
Q 045821          169 KIYCIGGLGDTHSWDVYDPRTNNWK-LHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWL-HADANM  246 (352)
Q Consensus       169 ~iyv~gG~~~~~~v~~yd~~t~~W~-~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~  246 (352)
                      .+++.+|.+  ..+.+||..+.+=. .+. .+ ......+...+|.+++.++.++     .+..||+.+.+=. .+.. .
T Consensus       139 ~iLaSgS~D--gtIrIWDl~tg~~~~~i~-~~-~~V~SlswspdG~lLat~s~D~-----~IrIwD~Rsg~~i~tl~g-H  208 (568)
T PTZ00420        139 YIMCSSGFD--SFVNIWDIENEKRAFQIN-MP-KKLSSLKWNIKGNLLSGTCVGK-----HMHIIDPRKQEIASSFHI-H  208 (568)
T ss_pred             eEEEEEeCC--CeEEEEECCCCcEEEEEe-cC-CcEEEEEECCCCCEEEEEecCC-----EEEEEECCCCcEEEEEec-c
Confidence            455566655  67889999876511 111 11 1111122234778777755432     5888898765422 1211 1


Q ss_pred             CCCCCCcEEE-----ECCEEEEEecCCC---CeEEEEeCCC-CceEEccCCCCCcCCCCceEE----EeCCEEEEEcCCc
Q 045821          247 ASGWRGPAVV-----VDDALYVLDQSSG---TKLMMWQKES-REWSPVGRLSTLLTRPPCKLV----AIGKTIFVIGKGC  313 (352)
Q Consensus       247 ~~~~~~~~~~-----~~~~l~~~gg~~~---~~v~~yd~~~-~~W~~~~~~~~~~~~~~~~~~----~~~~~i~v~gG~~  313 (352)
                      ........+.     .++..++.+|.++   ..+.+||..+ ..-...-.+..    ....+.    ...+.+|+.|..+
T Consensus       209 ~g~~~s~~v~~~~fs~d~~~IlTtG~d~~~~R~VkLWDlr~~~~pl~~~~ld~----~~~~L~p~~D~~tg~l~lsGkGD  284 (568)
T PTZ00420        209 DGGKNTKNIWIDGLGGDDNYILSTGFSKNNMREMKLWDLKNTTSALVTMSIDN----ASAPLIPHYDESTGLIYLIGKGD  284 (568)
T ss_pred             cCCceeEEEEeeeEcCCCCEEEEEEcCCCCccEEEEEECCCCCCceEEEEecC----CccceEEeeeCCCCCEEEEEECC
Confidence            1100011111     2445555556543   3699999874 22111111111    111111    2247788888444


Q ss_pred             -eEEEEEcCCc
Q 045821          314 -SAVVIDVGNI  323 (352)
Q Consensus       314 -~~~~~d~~~~  323 (352)
                       .+.+|++...
T Consensus       285 ~tIr~~e~~~~  295 (568)
T PTZ00420        285 GNCRYYQHSLG  295 (568)
T ss_pred             CeEEEEEccCC
Confidence             4668887654


No 146
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=90.84  E-value=14  Score=34.36  Aligned_cols=161  Identities=11%  Similarity=0.082  Sum_probs=86.2

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCC-----CCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEE
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTR-----RSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYD  142 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~-----~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d  142 (352)
                      +.+|++.-..... ...++.+|.  ..     ..|..+.+-.    .-..+.+...++.+|+....+   .....+..++
T Consensus       238 ~~~l~i~~~~~~~-~s~v~~~d~--~~~~~~~~~~~~l~~~~----~~~~~~v~~~~~~~yi~Tn~~---a~~~~l~~~~  307 (414)
T PF02897_consen  238 GRYLFISSSSGTS-ESEVYLLDL--DDGGSPDAKPKLLSPRE----DGVEYYVDHHGDRLYILTNDD---APNGRLVAVD  307 (414)
T ss_dssp             SSEEEEEEESSSS-EEEEEEEEC--CCTTTSS-SEEEEEESS----SS-EEEEEEETTEEEEEE-TT----TT-EEEEEE
T ss_pred             ccEEEEEEEcccc-CCeEEEEec--cccCCCcCCcEEEeCCC----CceEEEEEccCCEEEEeeCCC---CCCcEEEEec
Confidence            4555554333222 466777887  54     3777776532    122334455588999887633   3356788888


Q ss_pred             CCCCe---eE-ecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCC-CCcEEEecCCCCcccceeEEE--E-CCEE
Q 045821          143 ASMNT---WT-DAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPR-TNNWKLHTEPNIFTEIEDSFV--M-DGKI  214 (352)
Q Consensus       143 ~~t~~---W~-~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~-t~~W~~~~~~~~~~~~~~~~~--~-~g~i  214 (352)
                      .....   |. .+.+-.....--.+...++.+++..-.+....+.+||.. +..-..++ ++.... .....  . .+.+
T Consensus       308 l~~~~~~~~~~~l~~~~~~~~l~~~~~~~~~Lvl~~~~~~~~~l~v~~~~~~~~~~~~~-~p~~g~-v~~~~~~~~~~~~  385 (414)
T PF02897_consen  308 LADPSPAEWWTVLIPEDEDVSLEDVSLFKDYLVLSYRENGSSRLRVYDLDDGKESREIP-LPEAGS-VSGVSGDFDSDEL  385 (414)
T ss_dssp             TTSTSGGGEEEEEE--SSSEEEEEEEEETTEEEEEEEETTEEEEEEEETT-TEEEEEEE-SSSSSE-EEEEES-TT-SEE
T ss_pred             ccccccccceeEEcCCCCceeEEEEEEECCEEEEEEEECCccEEEEEECCCCcEEeeec-CCcceE-EeccCCCCCCCEE
Confidence            87665   65 444322223444555568887776544333788999998 44434444 222221 11111  1 3344


Q ss_pred             EEEccCCCCCCceEEEEEeCCCCceeec
Q 045821          215 YIRCSASAATSHVCALVYEPSTDSWLHA  242 (352)
Q Consensus       215 yv~GG~~~~~~~~~i~~yd~~~~~W~~~  242 (352)
                      ++.-.  +-.....++.||+.+++.+.+
T Consensus       386 ~~~~s--s~~~P~~~y~~d~~t~~~~~~  411 (414)
T PF02897_consen  386 RFSYS--SFTTPPTVYRYDLATGELTLL  411 (414)
T ss_dssp             EEEEE--ETTEEEEEEEEETTTTCEEEE
T ss_pred             EEEEe--CCCCCCEEEEEECCCCCEEEE
Confidence            44311  112233899999999987654


No 147
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=90.78  E-value=13  Score=33.99  Aligned_cols=156  Identities=16%  Similarity=0.134  Sum_probs=87.4

Q ss_pred             eEEECCEEEEEeCCCCCCceEEEeCCCCc--EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCC--CCc
Q 045821          163 CGVLNQKIYCIGGLGDTHSWDVYDPRTNN--WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPS--TDS  238 (352)
Q Consensus       163 ~~~~~~~iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~--~~~  238 (352)
                      .+..++++|+....   ..+..+|+.+..  |+.................+|+||+- ...+     .+++||..  +..
T Consensus        64 ~~~~dg~v~~~~~~---G~i~A~d~~~g~~~W~~~~~~~~~~~~~~~~~~~G~i~~g-~~~g-----~~y~ld~~~G~~~  134 (370)
T COG1520          64 PADGDGTVYVGTRD---GNIFALNPDTGLVKWSYPLLGAVAQLSGPILGSDGKIYVG-SWDG-----KLYALDASTGTLV  134 (370)
T ss_pred             cEeeCCeEEEecCC---CcEEEEeCCCCcEEecccCcCcceeccCceEEeCCeEEEe-cccc-----eEEEEECCCCcEE
Confidence            36678999997221   468999999886  97654220112222334448998885 3322     68899984  456


Q ss_pred             eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEEcC--Cce
Q 045821          239 WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGK--GCS  314 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG--~~~  314 (352)
                      |+.-.... ..+....+..++.+|+.. ..+ .++.+|.++.+  |+.-...+ ...+.......-++.+|+-..  ...
T Consensus       135 W~~~~~~~-~~~~~~~v~~~~~v~~~s-~~g-~~~al~~~tG~~~W~~~~~~~-~~~~~~~~~~~~~~~vy~~~~~~~~~  210 (370)
T COG1520         135 WSRNVGGS-PYYASPPVVGDGTVYVGT-DDG-HLYALNADTGTLKWTYETPAP-LSLSIYGSPAIASGTVYVGSDGYDGI  210 (370)
T ss_pred             EEEecCCC-eEEecCcEEcCcEEEEec-CCC-eEEEEEccCCcEEEEEecCCc-cccccccCceeecceEEEecCCCcce
Confidence            88654331 233344455666766654 222 78888887544  76332221 111222223355677777644  234


Q ss_pred             EEEEEcCCcc--cccceEE
Q 045821          315 AVVIDVGNIG--NIGGIMV  331 (352)
Q Consensus       315 ~~~~d~~~~~--w~~~~~~  331 (352)
                      .+.+|++..+  |+.....
T Consensus       211 ~~a~~~~~G~~~w~~~~~~  229 (370)
T COG1520         211 LYALNAEDGTLKWSQKVSQ  229 (370)
T ss_pred             EEEEEccCCcEeeeeeeec
Confidence            5668876544  6643333


No 148
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=90.77  E-value=12  Score=33.50  Aligned_cols=147  Identities=12%  Similarity=0.170  Sum_probs=75.8

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEec
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHT  196 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~  196 (352)
                      ++.+.+.||.+.      ..+.++..++.|-..-.-...-.......+++.+.+.|+.+  ..+.++...++  +|....
T Consensus        75 ~~~l~aTGGgDD------~AflW~~~~ge~~~eltgHKDSVt~~~FshdgtlLATGdms--G~v~v~~~stg~~~~~~~~  146 (399)
T KOG0296|consen   75 NNNLVATGGGDD------LAFLWDISTGEFAGELTGHKDSVTCCSFSHDGTLLATGDMS--GKVLVFKVSTGGEQWKLDQ  146 (399)
T ss_pred             CCceEEecCCCc------eEEEEEccCCcceeEecCCCCceEEEEEccCceEEEecCCC--ccEEEEEcccCceEEEeec
Confidence            566777787543      46788888877432211111112223334577777777766  56666666655  465431


Q ss_pred             CCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEe
Q 045821          197 EPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQ  276 (352)
Q Consensus       197 ~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd  276 (352)
                      ....-..  ..-.-.+.|+++|..++     .+|.|...+..=.++-.-...+-...-+.-+||..+.|-.++ .+.+||
T Consensus       147 e~~dieW--l~WHp~a~illAG~~DG-----svWmw~ip~~~~~kv~~Gh~~~ct~G~f~pdGKr~~tgy~dg-ti~~Wn  218 (399)
T KOG0296|consen  147 EVEDIEW--LKWHPRAHILLAGSTDG-----SVWMWQIPSQALCKVMSGHNSPCTCGEFIPDGKRILTGYDDG-TIIVWN  218 (399)
T ss_pred             ccCceEE--EEecccccEEEeecCCC-----cEEEEECCCcceeeEecCCCCCcccccccCCCceEEEEecCc-eEEEEe
Confidence            1111000  00001345677754332     578887766433332111111111111344677777765555 899999


Q ss_pred             CCCCc
Q 045821          277 KESRE  281 (352)
Q Consensus       277 ~~~~~  281 (352)
                      +++.+
T Consensus       219 ~ktg~  223 (399)
T KOG0296|consen  219 PKTGQ  223 (399)
T ss_pred             cCCCc
Confidence            99875


No 149
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=90.61  E-value=7.6  Score=33.78  Aligned_cols=132  Identities=11%  Similarity=0.048  Sum_probs=69.1

Q ss_pred             CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC
Q 045821          180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD  259 (352)
Q Consensus       180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~  259 (352)
                      .++-.||..++.=+..  ......--.+++.+..-.+.||.++     .+-.||..+..=..+...-..-+. ..-....
T Consensus        35 gslrlYdv~~~~l~~~--~~~~~plL~c~F~d~~~~~~G~~dg-----~vr~~Dln~~~~~~igth~~~i~c-i~~~~~~  106 (323)
T KOG1036|consen   35 GSLRLYDVPANSLKLK--FKHGAPLLDCAFADESTIVTGGLDG-----QVRRYDLNTGNEDQIGTHDEGIRC-IEYSYEV  106 (323)
T ss_pred             CcEEEEeccchhhhhh--eecCCceeeeeccCCceEEEeccCc-----eEEEEEecCCcceeeccCCCceEE-EEeeccC
Confidence            5677888887732111  1111111123344444344455553     588999988877666542111100 0011234


Q ss_pred             EEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcc
Q 045821          260 ALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       260 ~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~  324 (352)
                      ..++.||.++ .+-.+|+...+    ..........-+++.+.++.|+|-+-..++.+||..+..
T Consensus       107 ~~vIsgsWD~-~ik~wD~R~~~----~~~~~d~~kkVy~~~v~g~~LvVg~~~r~v~iyDLRn~~  166 (323)
T KOG1036|consen  107 GCVISGSWDK-TIKFWDPRNKV----VVGTFDQGKKVYCMDVSGNRLVVGTSDRKVLIYDLRNLD  166 (323)
T ss_pred             CeEEEcccCc-cEEEEeccccc----cccccccCceEEEEeccCCEEEEeecCceEEEEEccccc
Confidence            4566777776 89999998622    111111133456666666666654444467789976543


No 150
>COG4447 Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
Probab=90.54  E-value=6.9  Score=33.76  Aligned_cols=244  Identities=14%  Similarity=0.146  Sum_probs=123.9

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      +.+.+++|+.   .  .+..-|-  ...+|.+.....   .....++....+.+-++.|       ..+.++.=+-.-.+
T Consensus        54 g~~gwlVg~r---g--tiletdd--~g~tw~qal~~~---gr~~f~sv~f~~~egw~vG-------e~sqll~T~DgGqs  116 (339)
T COG4447          54 GSHGWLVGGR---G--TILETDD--GGITWAQALDFL---GRHAFHSVSFLGMEGWIVG-------EPSQLLHTTDGGQS  116 (339)
T ss_pred             CcceEEEcCc---c--eEEEecC--Ccccchhhhchh---hhhheeeeeeecccccccC-------CcceEEEecCCCcc
Confidence            6778888764   1  2232344  678898775443   1222333333343333333       12344444444557


Q ss_pred             eEecCCC-CCcceeeeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCc--ccceeEEEECCEEEEEccCCCC
Q 045821          148 WTDAAPM-STARCYFPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIF--TEIEDSFVMDGKIYIRCSASAA  223 (352)
Q Consensus       148 W~~~~~~-~~~r~~~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~--~~~~~~~~~~g~iyv~GG~~~~  223 (352)
                      |.+++.- ..+-...+...++. +-|++|-+.   .++.=+-.-+.|+.+.+-...  .-...+.+.+|...++|-..  
T Consensus       117 WARi~~~e~~eg~~~sI~f~d~q~g~m~gd~G---ail~T~DgGk~Wk~l~e~~v~~~~~n~ia~s~dng~vaVg~rG--  191 (339)
T COG4447         117 WARIPLSEKLEGFPDSITFLDDQRGEMLGDQG---AILKTTDGGKNWKALVEKAVGLAVPNEIARSADNGYVAVGARG--  191 (339)
T ss_pred             hhhchhhcCCCCCcceeEEecchhhhhhcccc---eEEEecCCcccHhHhcccccchhhhhhhhhhccCCeEEEecCc--
Confidence            9888743 23333344555543 445555432   233333334579887642222  22333455667766664222  


Q ss_pred             CCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC--EEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE-
Q 045821          224 TSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD--ALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV-  300 (352)
Q Consensus       224 ~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~--~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-  300 (352)
                         +..-..+.....|..-.....++...+....++  -++..|+..  ..+..+...+.|+.++.... ..+...++. 
T Consensus       192 ---s~f~T~~aGqt~~~~~g~~s~~~letmg~adag~~g~la~g~qg--~~f~~~~~gD~wsd~~~~~~-~g~~~~Gl~d  265 (339)
T COG4447         192 ---SFFSTWGAGQTVWLPHGRNSSRRLETMGLADAGSKGLLARGGQG--DQFSWVCGGDEWSDQGEPVN-LGRRSWGLLD  265 (339)
T ss_pred             ---ceEecCCCCccEEeccCCCccchhcccccccCCccceEEEcccc--ceeecCCCcccccccccchh-cccCCCcccc
Confidence               111123333444444333222333333344444  355556543  57778888999998863322 223333222 


Q ss_pred             ---EeCCEEEEEcCCceEEEEEcCCcccccceEEeccCCCCC
Q 045821          301 ---AIGKTIFVIGKGCSAVVIDVGNIGNIGGIMVSSSIPKLN  339 (352)
Q Consensus       301 ---~~~~~i~v~gG~~~~~~~d~~~~~w~~~~~~~~~~~~~~  339 (352)
                         ...+.+||.|..-+++.---.-.+|.++.-+..++-.++
T Consensus       266 ~a~~a~~~v~v~G~gGnvl~StdgG~t~skd~g~~er~s~l~  307 (339)
T COG4447         266 FAPRAPPEVWVSGIGGNVLASTDGGTTWSKDGGVEERVSNLY  307 (339)
T ss_pred             ccccCCCCeEEeccCccEEEecCCCeeEeccCChhhhhhhhh
Confidence               236789998876667765566667887665554444333


No 151
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=89.97  E-value=24  Score=35.77  Aligned_cols=35  Identities=14%  Similarity=0.171  Sum_probs=25.3

Q ss_pred             eeeeEEECCEEEEEeCCCCCCceEEEeCCCC--cEEEecC
Q 045821          160 YFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--NWKLHTE  197 (352)
Q Consensus       160 ~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~  197 (352)
                      ....+++++.+|+....   ..++.+|..|+  .|+.-+.
T Consensus       187 e~TPlvvgg~lYv~t~~---~~V~ALDa~TGk~lW~~d~~  223 (764)
T TIGR03074       187 QATPLKVGDTLYLCTPH---NKVIALDAATGKEKWKFDPK  223 (764)
T ss_pred             ccCCEEECCEEEEECCC---CeEEEEECCCCcEEEEEcCC
Confidence            35567789999998654   56888888876  4876543


No 152
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=89.76  E-value=16  Score=33.41  Aligned_cols=153  Identities=15%  Similarity=0.170  Sum_probs=85.3

Q ss_pred             EEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC--c
Q 045821          116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN--N  191 (352)
Q Consensus       116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~--~  191 (352)
                      +..++.+|+...       ...++.+|+.+.+  |+................-+|+||+-...   ..+++||..+.  .
T Consensus        65 ~~~dg~v~~~~~-------~G~i~A~d~~~g~~~W~~~~~~~~~~~~~~~~~~~G~i~~g~~~---g~~y~ld~~~G~~~  134 (370)
T COG1520          65 ADGDGTVYVGTR-------DGNIFALNPDTGLVKWSYPLLGAVAQLSGPILGSDGKIYVGSWD---GKLYALDASTGTLV  134 (370)
T ss_pred             EeeCCeEEEecC-------CCcEEEEeCCCCcEEecccCcCcceeccCceEEeCCeEEEeccc---ceEEEEECCCCcEE
Confidence            666888998611       2278999999887  86543320111111112227887775542   25889999644  5


Q ss_pred             EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC--CceeecCCC-CCCCCCCcEEEECCEEEEEecCC
Q 045821          192 WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST--DSWLHADAN-MASGWRGPAVVVDDALYVLDQSS  268 (352)
Q Consensus       192 W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~--~~W~~~~~~-~~~~~~~~~~~~~~~l~~~gg~~  268 (352)
                      |+.-..- ........++.++.+|+..      ....++++|..+  ..|+.-... ......+.....++.+|+.....
T Consensus       135 W~~~~~~-~~~~~~~~v~~~~~v~~~s------~~g~~~al~~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~vy~~~~~~  207 (370)
T COG1520         135 WSRNVGG-SPYYASPPVVGDGTVYVGT------DDGHLYALNADTGTLKWTYETPAPLSLSIYGSPAIASGTVYVGSDGY  207 (370)
T ss_pred             EEEecCC-CeEEecCcEEcCcEEEEec------CCCeEEEEEccCCcEEEEEecCCccccccccCceeecceEEEecCCC
Confidence            8765433 1222234556677888762      112577777764  458744322 22233333335566666653310


Q ss_pred             CCeEEEEeCCCCc--eEEc
Q 045821          269 GTKLMMWQKESRE--WSPV  285 (352)
Q Consensus       269 ~~~v~~yd~~~~~--W~~~  285 (352)
                      ...++.+|+++++  |++.
T Consensus       208 ~~~~~a~~~~~G~~~w~~~  226 (370)
T COG1520         208 DGILYALNAEDGTLKWSQK  226 (370)
T ss_pred             cceEEEEEccCCcEeeeee
Confidence            2269999997654  7753


No 153
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=89.33  E-value=12  Score=31.66  Aligned_cols=177  Identities=10%  Similarity=0.109  Sum_probs=89.7

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE--C
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM--D  211 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~--~  211 (352)
                      ..+-.||..++.=..+...-..+..-.++.  .+|+....||.+  ..+-++|...-.-.+.-..+. . -..++..  +
T Consensus        61 qhvRlyD~~S~np~Pv~t~e~h~kNVtaVgF~~dgrWMyTgseD--gt~kIWdlR~~~~qR~~~~~s-p-Vn~vvlhpnQ  136 (311)
T KOG0315|consen   61 QHVRLYDLNSNNPNPVATFEGHTKNVTAVGFQCDGRWMYTGSED--GTVKIWDLRSLSCQRNYQHNS-P-VNTVVLHPNQ  136 (311)
T ss_pred             CeeEEEEccCCCCCceeEEeccCCceEEEEEeecCeEEEecCCC--ceEEEEeccCcccchhccCCC-C-cceEEecCCc
Confidence            567889998874222222222223333333  378888888866  566777777633332211111 1 1123333  3


Q ss_pred             CEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC-CCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCce-EEccCCC
Q 045821          212 GKIYIRCSASAATSHVCALVYEPSTDSWLHADANM-ASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREW-SPVGRLS  289 (352)
Q Consensus       212 g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~-~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W-~~~~~~~  289 (352)
                      +.|++  |.+    ...|+++|+.++.......+- .......++.-+|+.++.+...| ..++++.-+.+- +++..+.
T Consensus       137 teLis--~dq----sg~irvWDl~~~~c~~~liPe~~~~i~sl~v~~dgsml~a~nnkG-~cyvW~l~~~~~~s~l~P~~  209 (311)
T KOG0315|consen  137 TELIS--GDQ----SGNIRVWDLGENSCTHELIPEDDTSIQSLTVMPDGSMLAAANNKG-NCYVWRLLNHQTASELEPVH  209 (311)
T ss_pred             ceEEe--ecC----CCcEEEEEccCCccccccCCCCCcceeeEEEcCCCcEEEEecCCc-cEEEEEccCCCccccceEhh
Confidence            34443  222    235999999999876553322 22223334555777555544444 778887765432 2222222


Q ss_pred             CCcCCCCc---eEEEeCCEEEEEcCCce-EEEEEcCCc
Q 045821          290 TLLTRPPC---KLVAIGKTIFVIGKGCS-AVVIDVGNI  323 (352)
Q Consensus       290 ~~~~~~~~---~~~~~~~~i~v~gG~~~-~~~~d~~~~  323 (352)
                      ....+..+   +....++|.++..+.++ +.|+..+.+
T Consensus       210 k~~ah~~~il~C~lSPd~k~lat~ssdktv~iwn~~~~  247 (311)
T KOG0315|consen  210 KFQAHNGHILRCLLSPDVKYLATCSSDKTVKIWNTDDF  247 (311)
T ss_pred             heecccceEEEEEECCCCcEEEeecCCceEEEEecCCc
Confidence            21122232   23344677666666554 446666655


No 154
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=89.22  E-value=13  Score=31.55  Aligned_cols=103  Identities=19%  Similarity=0.402  Sum_probs=52.5

Q ss_pred             EEEEEeCCCCCCceEEEeCCCCcEEEecC----------------CCCcccceeEEEECCEEEEEccCCCCCCceEEEEE
Q 045821          169 KIYCIGGLGDTHSWDVYDPRTNNWKLHTE----------------PNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVY  232 (352)
Q Consensus       169 ~iyv~gG~~~~~~v~~yd~~t~~W~~~~~----------------~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~y  232 (352)
                      +-++.||++  +.|-+++...++|..-..                ...++...+++.-++++++.             .-
T Consensus       176 krlvSgGcD--n~VkiW~~~~~~w~~e~~l~~H~dwVRDVAwaP~~gl~~s~iAS~SqDg~viIw-------------t~  240 (299)
T KOG1332|consen  176 KRLVSGGCD--NLVKIWKFDSDSWKLERTLEGHKDWVRDVAWAPSVGLPKSTIASCSQDGTVIIW-------------TK  240 (299)
T ss_pred             ceeeccCCc--cceeeeecCCcchhhhhhhhhcchhhhhhhhccccCCCceeeEEecCCCcEEEE-------------Ee
Confidence            457888888  455555555557753221                11122222344445555444             44


Q ss_pred             eCCCCceeec--CCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCC-CCceEEccCC
Q 045821          233 EPSTDSWLHA--DANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKE-SREWSPVGRL  288 (352)
Q Consensus       233 d~~~~~W~~~--~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~-~~~W~~~~~~  288 (352)
                      +...+.|+..  ...+...++..--..++.|-+-||.+  .+.+|-.. .++|.+++..
T Consensus       241 ~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~GdN--kvtlwke~~~Gkw~~v~~~  297 (299)
T KOG1332|consen  241 DEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGGDN--KVTLWKENVDGKWEEVGEV  297 (299)
T ss_pred             cCccCcccccccccCCcceEEEEEeccccEEEEecCCc--EEEEEEeCCCCcEEEcccc
Confidence            4556778643  33233333322223344555555544  66666555 4599998754


No 155
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=89.04  E-value=28  Score=35.28  Aligned_cols=35  Identities=20%  Similarity=0.295  Sum_probs=24.4

Q ss_pred             eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCC
Q 045821          113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPM  154 (352)
Q Consensus       113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~  154 (352)
                      .+-.++++.+|+...       .+.++.+|..|++  |+.-+..
T Consensus       188 ~TPlvvgg~lYv~t~-------~~~V~ALDa~TGk~lW~~d~~~  224 (764)
T TIGR03074       188 ATPLKVGDTLYLCTP-------HNKVIALDAATGKEKWKFDPKL  224 (764)
T ss_pred             cCCEEECCEEEEECC-------CCeEEEEECCCCcEEEEEcCCC
Confidence            445667999999843       3568888888775  7765543


No 156
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=88.78  E-value=22  Score=33.68  Aligned_cols=146  Identities=11%  Similarity=0.100  Sum_probs=72.1

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEE---Ee
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWK---LH  195 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~---~~  195 (352)
                      .+.+++.|+.+      ..+.++|..+.+-...-.............-++..++.+.++  ..+.+||..+..-+   .+
T Consensus       257 ~g~~i~Sgs~D------~tvriWd~~~~~~~~~l~~hs~~is~~~f~~d~~~l~s~s~d--~~i~vwd~~~~~~~~~~~~  328 (456)
T KOG0266|consen  257 DGNLLVSGSDD------GTVRIWDVRTGECVRKLKGHSDGISGLAFSPDGNLLVSASYD--GTIRVWDLETGSKLCLKLL  328 (456)
T ss_pred             CCCEEEEecCC------CcEEEEeccCCeEEEeeeccCCceEEEEECCCCCEEEEcCCC--ccEEEEECCCCceeeeecc
Confidence            44677777653      458888998855333222222222222333367777777664  78899999988743   22


Q ss_pred             cCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCc----eeecCCCCCCCCCCcEEEECCEEEEEecCCCC
Q 045821          196 TEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDS----WLHADANMASGWRGPAVVVDDALYVLDQSSGT  270 (352)
Q Consensus       196 ~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~----W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~  270 (352)
                      ...........+... +++..+.+..+     ..+-.+|.....    |...... ...........++++.+.|...+ 
T Consensus       329 ~~~~~~~~~~~~~fsp~~~~ll~~~~d-----~~~~~w~l~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~i~sg~~d~-  401 (456)
T KOG0266|consen  329 SGAENSAPVTSVQFSPNGKYLLSASLD-----RTLKLWDLRSGKSVGTYTGHSNL-VRCIFSPTLSTGGKLIYSGSEDG-  401 (456)
T ss_pred             cCCCCCCceeEEEECCCCcEEEEecCC-----CeEEEEEccCCcceeeecccCCc-ceeEecccccCCCCeEEEEeCCc-
Confidence            211111111222332 44444443222     124444444332    2222111 00111111234666666666555 


Q ss_pred             eEEEEeCCC
Q 045821          271 KLMMWQKES  279 (352)
Q Consensus       271 ~v~~yd~~~  279 (352)
                      .|..||+.+
T Consensus       402 ~v~~~~~~s  410 (456)
T KOG0266|consen  402 SVYVWDSSS  410 (456)
T ss_pred             eEEEEeCCc
Confidence            899999987


No 157
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=88.38  E-value=21  Score=33.08  Aligned_cols=147  Identities=12%  Similarity=0.104  Sum_probs=83.0

Q ss_pred             CCeEEEEECCCC-----eeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCCc---EE-EecCCCCcccc
Q 045821          135 TSEVYCYDASMN-----TWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTNN---WK-LHTEPNIFTEI  204 (352)
Q Consensus       135 ~~~~~~~d~~t~-----~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~~---W~-~~~~~~~~~~~  204 (352)
                      .++++..|....     .|..+..-. .-....+...++.+|+....... ..+..++..+..   |. .+.+.......
T Consensus       251 ~s~v~~~d~~~~~~~~~~~~~l~~~~-~~~~~~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~~~~l  329 (414)
T PF02897_consen  251 ESEVYLLDLDDGGSPDAKPKLLSPRE-DGVEYYVDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDEDVSL  329 (414)
T ss_dssp             EEEEEEEECCCTTTSS-SEEEEEESS-SS-EEEEEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SSSEEE
T ss_pred             CCeEEEEeccccCCCcCCcEEEeCCC-CceEEEEEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCCceeE
Confidence            378999999875     677775422 22333344458899998875544 788888888665   66 44433333344


Q ss_pred             eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEE---ECCEEE-EEecCCC-CeEEEEeCCC
Q 045821          205 EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVV---VDDALY-VLDQSSG-TKLMMWQKES  279 (352)
Q Consensus       205 ~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~---~~~~l~-~~gg~~~-~~v~~yd~~~  279 (352)
                      ..+...++.|++.. ..  +....+.++|+. ..|.....+.+.........   .++.++ .+.+... ..++.||+.+
T Consensus       330 ~~~~~~~~~Lvl~~-~~--~~~~~l~v~~~~-~~~~~~~~~~p~~g~v~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~t  405 (414)
T PF02897_consen  330 EDVSLFKDYLVLSY-RE--NGSSRLRVYDLD-DGKESREIPLPEAGSVSGVSGDFDSDELRFSYSSFTTPPTVYRYDLAT  405 (414)
T ss_dssp             EEEEEETTEEEEEE-EE--TTEEEEEEEETT--TEEEEEEESSSSSEEEEEES-TT-SEEEEEEEETTEEEEEEEEETTT
T ss_pred             EEEEEECCEEEEEE-EE--CCccEEEEEECC-CCcEEeeecCCcceEEeccCCCCCCCEEEEEEeCCCCCCEEEEEECCC
Confidence            55666788888873 22  334478889988 33433322222211111111   133443 4455443 5899999999


Q ss_pred             CceEEcc
Q 045821          280 REWSPVG  286 (352)
Q Consensus       280 ~~W~~~~  286 (352)
                      ++.+.+.
T Consensus       406 ~~~~~~k  412 (414)
T PF02897_consen  406 GELTLLK  412 (414)
T ss_dssp             TCEEEEE
T ss_pred             CCEEEEE
Confidence            9877653


No 158
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=88.08  E-value=17  Score=31.72  Aligned_cols=101  Identities=8%  Similarity=-0.012  Sum_probs=53.5

Q ss_pred             eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeee
Q 045821           84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPC  163 (352)
Q Consensus        84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~  163 (352)
                      .+..||.  ..++-.+.-...   ..-.....-.......+.+-.    ....++...+..+|+.-+.-+-...+...-.
T Consensus        37 sl~LYd~--~~g~~~~ti~sk---kyG~~~~~Fth~~~~~i~sSt----k~d~tIryLsl~dNkylRYF~GH~~~V~sL~  107 (311)
T KOG1446|consen   37 SLRLYDS--LSGKQVKTINSK---KYGVDLACFTHHSNTVIHSST----KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLS  107 (311)
T ss_pred             eEEEEEc--CCCceeeEeecc---cccccEEEEecCCceEEEccC----CCCCceEEEEeecCceEEEcCCCCceEEEEE
Confidence            4566788  776655444333   111112222223333333322    2346778888888887554332333332222


Q ss_pred             EEECCEEEEEeCCCCCCceEEEeCCCCcEEEe
Q 045821          164 GVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLH  195 (352)
Q Consensus       164 ~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~  195 (352)
                      +.=.+..|+-++.+  ..+..+|..+...+-+
T Consensus       108 ~sP~~d~FlS~S~D--~tvrLWDlR~~~cqg~  137 (311)
T KOG1446|consen  108 VSPKDDTFLSSSLD--KTVRLWDLRVKKCQGL  137 (311)
T ss_pred             ecCCCCeEEecccC--CeEEeeEecCCCCceE
Confidence            22234778888876  6788888887665543


No 159
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=87.98  E-value=25  Score=33.54  Aligned_cols=162  Identities=12%  Similarity=0.152  Sum_probs=79.7

Q ss_pred             CCeEEEEECCCCe--eEecCCCCC---cceee--eeE---EECCE---EEEEeCCCCCCceEEEeCCCCc--EEEecCCC
Q 045821          135 TSEVYCYDASMNT--WTDAAPMST---ARCYF--PCG---VLNQK---IYCIGGLGDTHSWDVYDPRTNN--WKLHTEPN  199 (352)
Q Consensus       135 ~~~~~~~d~~t~~--W~~~~~~~~---~r~~~--~~~---~~~~~---iyv~gG~~~~~~v~~yd~~t~~--W~~~~~~~  199 (352)
                      .+.++.+|..+++  |+.-...+.   -+...  ...   .+++.   +.++|..+  ..++.+|..+.+  |+.-... 
T Consensus       255 ~~~l~Ald~~tG~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~~~--G~l~ald~~tG~~~W~~~~~~-  331 (488)
T cd00216         255 TDSIVALDADTGKVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAPKN--GFFYVLDRTTGKLISARPEVE-  331 (488)
T ss_pred             eeeEEEEcCCCCCEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEECCC--ceEEEEECCCCcEeeEeEeec-
Confidence            4579999999886  874321110   01111  111   12332   33334333  568999999875  8753210 


Q ss_pred             CcccceeEEEECCEEEEEccCCC------------CCCceEEEEEeCCCC--ceeecCCCC-------CCCCCCcEEEEC
Q 045821          200 IFTEIEDSFVMDGKIYIRCSASA------------ATSHVCALVYEPSTD--SWLHADANM-------ASGWRGPAVVVD  258 (352)
Q Consensus       200 ~~~~~~~~~~~~g~iyv~GG~~~------------~~~~~~i~~yd~~~~--~W~~~~~~~-------~~~~~~~~~~~~  258 (352)
                           .....-.+.+|+-.....            ......+.++|..+.  .|+.-....       ........+..+
T Consensus       332 -----~~~~~~~~~vyv~~~~~~~~~~~~~~~~~~~~~~G~l~AlD~~tG~~~W~~~~~~~~~~~~~g~~~~~~~~~~~g  406 (488)
T cd00216         332 -----QPMAYDPGLVYLGAFHIPLGLPPQKKKRCKKPGKGGLAALDPKTGKVVWEKREGTIRDSWNIGFPHWGGSLATAG  406 (488)
T ss_pred             -----cccccCCceEEEccccccccCcccccCCCCCCCceEEEEEeCCCCcEeeEeeCCccccccccCCcccCcceEecC
Confidence                 001111266666421100            011236888888765  487642200       011122233445


Q ss_pred             CEEEEEecCCCCeEEEEeCCCCc--eEEccCCCCCcCCCCceEEEeCCEEEEE
Q 045821          259 DALYVLDQSSGTKLMMWQKESRE--WSPVGRLSTLLTRPPCKLVAIGKTIFVI  309 (352)
Q Consensus       259 ~~l~~~gg~~~~~v~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~  309 (352)
                      +.|| +|..++ .++.+|.++++  |+.  .++.. ..........++++||.
T Consensus       407 ~~v~-~g~~dG-~l~ald~~tG~~lW~~--~~~~~-~~a~P~~~~~~g~~yv~  454 (488)
T cd00216         407 NLVF-AGAADG-YFRAFDATTGKELWKF--RTPSG-IQATPMTYEVNGKQYVG  454 (488)
T ss_pred             CeEE-EECCCC-eEEEEECCCCceeeEE--ECCCC-ceEcCEEEEeCCEEEEE
Confidence            5544 454455 89999998765  763  23321 12222333558999986


No 160
>PRK01742 tolB translocation protein TolB; Provisional
Probab=87.23  E-value=26  Score=32.81  Aligned_cols=180  Identities=9%  Similarity=0.003  Sum_probs=86.1

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCE-EEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKN-AYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY  160 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~-iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~  160 (352)
                      ...++.+|.  .++.-..+...+.    .........+++ |++....++    ..++|.+|..+++.+.+..-...-..
T Consensus       227 ~~~i~i~dl--~tg~~~~l~~~~g----~~~~~~wSPDG~~La~~~~~~g----~~~Iy~~d~~~~~~~~lt~~~~~~~~  296 (429)
T PRK01742        227 KSQLVVHDL--RSGARKVVASFRG----HNGAPAFSPDGSRLAFASSKDG----VLNIYVMGANGGTPSQLTSGAGNNTE  296 (429)
T ss_pred             CcEEEEEeC--CCCceEEEecCCC----ccCceeECCCCCEEEEEEecCC----cEEEEEEECCCCCeEeeccCCCCcCC
Confidence            456777888  7766555554431    111122223444 444332222    24688899988876666542211111


Q ss_pred             eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE-EEEEccCCCCCCceEEEEEeCCCCc
Q 045821          161 FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK-IYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       161 ~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~-iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      . ...-++ .|+..........++.+|..+..-+.+. .  .. ......-+|+ |++.++       ..+..+|..+++
T Consensus       297 ~-~wSpDG~~i~f~s~~~g~~~I~~~~~~~~~~~~l~-~--~~-~~~~~SpDG~~ia~~~~-------~~i~~~Dl~~g~  364 (429)
T PRK01742        297 P-SWSPDGQSILFTSDRSGSPQVYRMSASGGGASLVG-G--RG-YSAQISADGKTLVMING-------DNVVKQDLTSGS  364 (429)
T ss_pred             E-EECCCCCEEEEEECCCCCceEEEEECCCCCeEEec-C--CC-CCccCCCCCCEEEEEcC-------CCEEEEECCCCC
Confidence            1 111244 4554443332256777777655433332 1  11 1222223554 444432       236778999888


Q ss_pred             eeecCCCCCCCCCCcEEEECCEEEEEecCCC-C-eEEEEeCCCCceEEc
Q 045821          239 WLHADANMASGWRGPAVVVDDALYVLDQSSG-T-KLMMWQKESREWSPV  285 (352)
Q Consensus       239 W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-~-~v~~yd~~~~~W~~~  285 (352)
                      +..+....  ........-+++.++++...+ . .++..+.....=.++
T Consensus       365 ~~~lt~~~--~~~~~~~sPdG~~i~~~s~~g~~~~l~~~~~~G~~~~~l  411 (429)
T PRK01742        365 TEVLSSTF--LDESPSISPNGIMIIYSSTQGLGKVLQLVSADGRFKARL  411 (429)
T ss_pred             eEEecCCC--CCCCceECCCCCEEEEEEcCCCceEEEEEECCCCceEEc
Confidence            87764321  112334455676666655433 2 233445444433334


No 161
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=86.72  E-value=17  Score=34.58  Aligned_cols=62  Identities=11%  Similarity=0.247  Sum_probs=38.2

Q ss_pred             CEEEEEccCCCCCCceEEEEEeCCCCceeec-CCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc
Q 045821          212 GKIYIRCSASAATSHVCALVYEPSTDSWLHA-DANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       212 g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~-~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      --||+.|...      .++++|+..+.|-.- .... .......+.--..|+.+|+..+ .|..||+.+.+
T Consensus       146 cDly~~gsg~------evYRlNLEqGrfL~P~~~~~-~~lN~v~in~~hgLla~Gt~~g-~VEfwDpR~ks  208 (703)
T KOG2321|consen  146 CDLYLVGSGS------EVYRLNLEQGRFLNPFETDS-GELNVVSINEEHGLLACGTEDG-VVEFWDPRDKS  208 (703)
T ss_pred             ccEEEeecCc------ceEEEEcccccccccccccc-ccceeeeecCccceEEecccCc-eEEEecchhhh
Confidence            3577775333      699999999988542 1111 1111111223456788888777 89999998765


No 162
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=86.69  E-value=27  Score=32.55  Aligned_cols=184  Identities=10%  Similarity=0.072  Sum_probs=99.1

Q ss_pred             CCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC--CceEEEeCCCCcEEEecCCCCcccceeEEEEC
Q 045821          134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT--HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMD  211 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~--~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~  211 (352)
                      ...++++++...++-+++.-+...-..-+.-.-++.|.|..-...+  +-.+.|-...+-=.. .+++.... .+.+.-+
T Consensus       105 ~taDly~v~~e~Ge~kRiTyfGr~fT~VaG~~~dg~iiV~TD~~tPF~q~~~lYkv~~dg~~~-e~LnlGpa-thiv~~d  182 (668)
T COG4946         105 QTADLYVVPSEDGEAKRITYFGRRFTRVAGWIPDGEIIVSTDFHTPFSQWTELYKVNVDGIKT-EPLNLGPA-THIVIKD  182 (668)
T ss_pred             ccccEEEEeCCCCcEEEEEEeccccceeeccCCCCCEEEEeccCCCcccceeeeEEccCCcee-eeccCCce-eeEEEeC
Confidence            3567888888888877776553221222222236777777654433  233444444332111 11222222 2345557


Q ss_pred             CEEEEEccCCCC----------CCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-CeEEEEeCCCC
Q 045821          212 GKIYIRCSASAA----------TSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-TKLMMWQKESR  280 (352)
Q Consensus       212 g~iyv~GG~~~~----------~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-~~v~~yd~~~~  280 (352)
                      |.+++  |.+..          +..-.+|.-.....++.++-+.. ... ..-+.+++++|.+....+ .+++.-|.+.+
T Consensus       183 g~ivi--gRntydLP~WK~YkGGtrGklWis~d~g~tFeK~vdl~-~~v-S~PmIV~~RvYFlsD~eG~GnlYSvdldGk  258 (668)
T COG4946         183 GIIVI--GRNTYDLPHWKGYKGGTRGKLWISSDGGKTFEKFVDLD-GNV-SSPMIVGERVYFLSDHEGVGNLYSVDLDGK  258 (668)
T ss_pred             CEEEE--ccCcccCcccccccCCccceEEEEecCCcceeeeeecC-CCc-CCceEEcceEEEEecccCccceEEeccCCc
Confidence            75554  33311          11125665544444555553321 111 223567999999987766 57888888887


Q ss_pred             ceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCccccc
Q 045821          281 EWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIG  327 (352)
Q Consensus       281 ~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~  327 (352)
                      --.+..+.....+|+    +.-+++-.||.-.-+.++|||+++.-++
T Consensus       259 DlrrHTnFtdYY~R~----~nsDGkrIvFq~~GdIylydP~td~lek  301 (668)
T COG4946         259 DLRRHTNFTDYYPRN----ANSDGKRIVFQNAGDIYLYDPETDSLEK  301 (668)
T ss_pred             hhhhcCCchhccccc----cCCCCcEEEEecCCcEEEeCCCcCccee
Confidence            766665554322222    2235665555444479999999988665


No 163
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=85.06  E-value=24  Score=30.33  Aligned_cols=90  Identities=11%  Similarity=0.027  Sum_probs=55.5

Q ss_pred             CceEEEeCCCC--cEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC--CceeecCCCCCCCCCCcEE
Q 045821          180 HSWDVYDPRTN--NWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST--DSWLHADANMASGWRGPAV  255 (352)
Q Consensus       180 ~~v~~yd~~t~--~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~--~~W~~~~~~~~~~~~~~~~  255 (352)
                      ..+..-|+.++  .|+.+-   ..|..+.+.++++. .|+|.+.+     .++..+.++  .-|..+..  ..-...+.+
T Consensus        33 ~~~~avd~~sG~~~We~il---g~RiE~sa~vvgdf-VV~GCy~g-----~lYfl~~~tGs~~w~f~~~--~~vk~~a~~  101 (354)
T KOG4649|consen   33 GIVIAVDPQSGNLIWEAIL---GVRIECSAIVVGDF-VVLGCYSG-----GLYFLCVKTGSQIWNFVIL--ETVKVRAQC  101 (354)
T ss_pred             ceEEEecCCCCcEEeehhh---CceeeeeeEEECCE-EEEEEccC-----cEEEEEecchhheeeeeeh--hhhccceEE
Confidence            56778888877  498864   45556667777777 56655543     244445444  46776532  112233345


Q ss_pred             EECCEEEEEecCCCCeEEEEeCCCCc
Q 045821          256 VVDDALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       256 ~~~~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      ..++.+...|..++ .++..|+.+..
T Consensus       102 d~~~glIycgshd~-~~yalD~~~~~  126 (354)
T KOG4649|consen  102 DFDGGLIYCGSHDG-NFYALDPKTYG  126 (354)
T ss_pred             cCCCceEEEecCCC-cEEEecccccc
Confidence            55677666776666 88888888754


No 164
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=84.81  E-value=35  Score=32.09  Aligned_cols=185  Identities=10%  Similarity=0.090  Sum_probs=91.3

Q ss_pred             eEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeee
Q 045821           84 CCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPC  163 (352)
Q Consensus        84 ~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~  163 (352)
                      .++.+++  .++...+...+-+......-.+++..+|.-.+.|-      ....+.+|++.+++...-.. -.+..-++.
T Consensus       223 H~~Fw~~--~~~~l~k~~~~fek~ekk~Vl~v~F~engdviTgD------S~G~i~Iw~~~~~~~~k~~~-aH~ggv~~L  293 (626)
T KOG2106|consen  223 HLYFWTL--RGGSLVKRQGIFEKREKKFVLCVTFLENGDVITGD------SGGNILIWSKGTNRISKQVH-AHDGGVFSL  293 (626)
T ss_pred             eEEEEEc--cCCceEEEeeccccccceEEEEEEEcCCCCEEeec------CCceEEEEeCCCceEEeEee-ecCCceEEE
Confidence            4666699  99888777665433233222333333444444443      33468899998887655433 233344666


Q ss_pred             EEEC-CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccc-eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee
Q 045821          164 GVLN-QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEI-EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH  241 (352)
Q Consensus       164 ~~~~-~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~-~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~  241 (352)
                      +.+. |.+.- ||.+  ..+..+|-.-++=+++. +|..... ..++.-.+.||| | ..    .+.+..=..++. .+.
T Consensus       294 ~~lr~GtllS-GgKD--Rki~~Wd~~y~k~r~~e-lPe~~G~iRtv~e~~~di~v-G-Tt----rN~iL~Gt~~~~-f~~  362 (626)
T KOG2106|consen  294 CMLRDGTLLS-GGKD--RKIILWDDNYRKLRETE-LPEQFGPIRTVAEGKGDILV-G-TT----RNFILQGTLENG-FTL  362 (626)
T ss_pred             EEecCccEee-cCcc--ceEEecccccccccccc-CchhcCCeeEEecCCCcEEE-e-ec----cceEEEeeecCC-ceE
Confidence            6664 55555 8876  56677773322222222 2222111 122222334555 2 21    122333222222 221


Q ss_pred             cCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821          242 ADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS  289 (352)
Q Consensus       242 ~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~  289 (352)
                      +........-+.+......+|+..|..+ .+..|+-..-.|+++-.-|
T Consensus       363 ~v~gh~delwgla~hps~~q~~T~gqdk-~v~lW~~~k~~wt~~~~d~  409 (626)
T KOG2106|consen  363 TVQGHGDELWGLATHPSKNQLLTCGQDK-HVRLWNDHKLEWTKIIEDP  409 (626)
T ss_pred             EEEecccceeeEEcCCChhheeeccCcc-eEEEccCCceeEEEEecCc
Confidence            1110111111222334566777777766 7888886677899886444


No 165
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=84.40  E-value=34  Score=31.53  Aligned_cols=119  Identities=9%  Similarity=0.116  Sum_probs=65.5

Q ss_pred             eeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eEEEE-CCEEEEEccCCCCCCceEEEEEeCCC
Q 045821          160 YFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DSFVM-DGKIYIRCSASAATSHVCALVYEPST  236 (352)
Q Consensus       160 ~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~  236 (352)
                      ..+++.. ||.||..|-.+  ..+.+||..+..  .+...|....-- ...+. ||+..+. +.+.    ..+..+|+..
T Consensus       350 ~ts~~fHpDgLifgtgt~d--~~vkiwdlks~~--~~a~Fpght~~vk~i~FsENGY~Lat-~add----~~V~lwDLRK  420 (506)
T KOG0289|consen  350 YTSAAFHPDGLIFGTGTPD--GVVKIWDLKSQT--NVAKFPGHTGPVKAISFSENGYWLAT-AADD----GSVKLWDLRK  420 (506)
T ss_pred             eEEeeEcCCceEEeccCCC--ceEEEEEcCCcc--ccccCCCCCCceeEEEeccCceEEEE-EecC----CeEEEEEehh
Confidence            4444444 78888877655  678889998776  444444422211 22333 4554444 3332    1377788755


Q ss_pred             Cc-eeecCCCCCCCCCCcEEEE--CCEEEEEecCCCCeEEEEeCCCCceEEccCCCC
Q 045821          237 DS-WLHADANMASGWRGPAVVV--DDALYVLDQSSGTKLMMWQKESREWSPVGRLST  290 (352)
Q Consensus       237 ~~-W~~~~~~~~~~~~~~~~~~--~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~  290 (352)
                      .+ ...+.  .+......++.+  .|..+.++|..= .|+.|+-.+.+|+++..++.
T Consensus       421 l~n~kt~~--l~~~~~v~s~~fD~SGt~L~~~g~~l-~Vy~~~k~~k~W~~~~~~~~  474 (506)
T KOG0289|consen  421 LKNFKTIQ--LDEKKEVNSLSFDQSGTYLGIAGSDL-QVYICKKKTKSWTEIKELAD  474 (506)
T ss_pred             hcccceee--ccccccceeEEEcCCCCeEEeeccee-EEEEEecccccceeeehhhh
Confidence            43 22221  111111122333  456666665432 56777888999999987765


No 166
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=84.24  E-value=25  Score=29.94  Aligned_cols=125  Identities=15%  Similarity=0.241  Sum_probs=64.7

Q ss_pred             CCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEE-EeCCCCCCceEEEeCCCCcEEEec--CCCCcccceeEEEE
Q 045821          134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYC-IGGLGDTHSWDVYDPRTNNWKLHT--EPNIFTEIEDSFVM  210 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv-~gG~~~~~~v~~yd~~t~~W~~~~--~~~~~~~~~~~~~~  210 (352)
                      ...++-.+|..|++-..--.++.+-...-.. .+|+|.. .-|    ++|-.+|+.+-.  .+.  .||-... .++..-
T Consensus       163 dd~tVRLWD~rTgt~v~sL~~~s~VtSlEvs-~dG~ilTia~g----ssV~Fwdaksf~--~lKs~k~P~nV~-SASL~P  234 (334)
T KOG0278|consen  163 DDKTVRLWDHRTGTEVQSLEFNSPVTSLEVS-QDGRILTIAYG----SSVKFWDAKSFG--LLKSYKMPCNVE-SASLHP  234 (334)
T ss_pred             cCCceEEEEeccCcEEEEEecCCCCcceeec-cCCCEEEEecC----ceeEEecccccc--ceeeccCccccc-cccccC
Confidence            3456777888877643222233332222222 2444443 334    567777776532  332  2222111 112223


Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE----EEECCEEEEEecCCCCeEEEE
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA----VVVDDALYVLDQSSGTKLMMW  275 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~----~~~~~~l~~~gg~~~~~v~~y  275 (352)
                      +..+||.||..     ..++.||..++.=...   .....+++.    ..-+|.+|..|..++ .+..|
T Consensus       235 ~k~~fVaGged-----~~~~kfDy~TgeEi~~---~nkgh~gpVhcVrFSPdGE~yAsGSEDG-TirlW  294 (334)
T KOG0278|consen  235 KKEFFVAGGED-----FKVYKFDYNTGEEIGS---YNKGHFGPVHCVRFSPDGELYASGSEDG-TIRLW  294 (334)
T ss_pred             CCceEEecCcc-----eEEEEEeccCCceeee---cccCCCCceEEEEECCCCceeeccCCCc-eEEEE
Confidence            56899998665     3678888877642221   112222222    345999999998877 55554


No 167
>KOG1332 consensus Vesicle coat complex COPII, subunit SEC13 [Intracellular trafficking, secretion, and vesicular transport]
Probab=84.23  E-value=25  Score=29.87  Aligned_cols=74  Identities=9%  Similarity=0.109  Sum_probs=43.5

Q ss_pred             CcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecC-CCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821          156 TARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE-PNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEP  234 (352)
Q Consensus       156 ~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~-~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~  234 (352)
                      .++...+++..++++++.          .-+.+.++|+...- .-+....+.+-.+-|.+..++|.+     +.+..+-.
T Consensus       221 l~~s~iAS~SqDg~viIw----------t~~~e~e~wk~tll~~f~~~~w~vSWS~sGn~LaVs~Gd-----Nkvtlwke  285 (299)
T KOG1332|consen  221 LPKSTIASCSQDGTVIIW----------TKDEEYEPWKKTLLEEFPDVVWRVSWSLSGNILAVSGGD-----NKVTLWKE  285 (299)
T ss_pred             CCceeeEEecCCCcEEEE----------EecCccCcccccccccCCcceEEEEEeccccEEEEecCC-----cEEEEEEe
Confidence            466777777777776663          34456678875431 112233445556667766665544     24556655


Q ss_pred             C-CCceeecCC
Q 045821          235 S-TDSWLHADA  244 (352)
Q Consensus       235 ~-~~~W~~~~~  244 (352)
                      + .++|.+++.
T Consensus       286 ~~~Gkw~~v~~  296 (299)
T KOG1332|consen  286 NVDGKWEEVGE  296 (299)
T ss_pred             CCCCcEEEccc
Confidence            4 559998854


No 168
>PF07433 DUF1513:  Protein of unknown function (DUF1513);  InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=83.43  E-value=31  Score=30.45  Aligned_cols=237  Identities=12%  Similarity=0.060  Sum_probs=120.3

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcc--eEEEEECC-EEEEEeCCCCCCCCCCeEEEEECC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKG--MGFEVLGK-NAYLLGGCGWSEDATSEVYCYDAS  144 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~--~~~~~~~~-~iyv~GG~~~~~~~~~~~~~~d~~  144 (352)
                      ...+.+|+..   ......+||+  .+..-...-..+   ..|+.  |++..-++ .+|..-+..  .....-+-+||..
T Consensus        16 ~~~avafaRR---PG~~~~v~D~--~~g~~~~~~~a~---~gRHFyGHg~fs~dG~~LytTEnd~--~~g~G~IgVyd~~   85 (305)
T PF07433_consen   16 RPEAVAFARR---PGTFALVFDC--RTGQLLQRLWAP---PGRHFYGHGVFSPDGRLLYTTENDY--ETGRGVIGVYDAA   85 (305)
T ss_pred             CCeEEEEEeC---CCcEEEEEEc--CCCceeeEEcCC---CCCEEecCEEEcCCCCEEEEecccc--CCCcEEEEEEECc
Confidence            4566667554   3345667798  777654333333   34544  44444444 566654422  2345667889998


Q ss_pred             CCeeEecCCCCCc-ceeeeeEEE-CC-EEEEE-eCCCCC--------------CceEEEeCCCCcEEEecCCCCc----c
Q 045821          145 MNTWTDAAPMSTA-RCYFPCGVL-NQ-KIYCI-GGLGDT--------------HSWDVYDPRTNNWKLHTEPNIF----T  202 (352)
Q Consensus       145 t~~W~~~~~~~~~-r~~~~~~~~-~~-~iyv~-gG~~~~--------------~~v~~yd~~t~~W~~~~~~~~~----~  202 (352)
                       +...++...+.. .-.|-+... ++ .+.|. ||...+              .++-..|..+.+=..--.+++.    .
T Consensus        86 -~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~~lS  164 (305)
T PF07433_consen   86 -RGYRRIGEFPSHGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLHQLS  164 (305)
T ss_pred             -CCcEEEeEecCCCcChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCccccccc
Confidence             667777766643 334555554 44 34443 553211              5666677887753222122221    1


Q ss_pred             cceeEEEECCEEEEEccCCCCCCce--EEEEEeCCCCceeecCCCC--CCC--CCCcEEEE--CCEEEEEecCCCCeEEE
Q 045821          203 EIEDSFVMDGKIYIRCSASAATSHV--CALVYEPSTDSWLHADANM--ASG--WRGPAVVV--DDALYVLDQSSGTKLMM  274 (352)
Q Consensus       203 ~~~~~~~~~g~iyv~GG~~~~~~~~--~i~~yd~~~~~W~~~~~~~--~~~--~~~~~~~~--~~~l~~~gg~~~~~v~~  274 (352)
                      -.|.++.-+|.+.+-.-..+.....  -+..++.... -+.+..+.  ...  -+..++.+  ++.++.+..=.++.+.+
T Consensus       165 iRHLa~~~~G~V~~a~Q~qg~~~~~~PLva~~~~g~~-~~~~~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsPrGg~~~~  243 (305)
T PF07433_consen  165 IRHLAVDGDGTVAFAMQYQGDPGDAPPLVALHRRGGA-LRLLPAPEEQWRRLNGYIGSIAADRDGRLIAVTSPRGGRVAV  243 (305)
T ss_pred             eeeEEecCCCcEEEEEecCCCCCccCCeEEEEcCCCc-ceeccCChHHHHhhCCceEEEEEeCCCCEEEEECCCCCEEEE
Confidence            2233333455555432222221111  3333333221 22222111  111  12223333  44455555444558999


Q ss_pred             EeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcC
Q 045821          275 WQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVG  321 (352)
Q Consensus       275 yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~  321 (352)
                      ||..++.|.....++.     .|+++..++...+-.|.-.+...+..
T Consensus       244 ~d~~tg~~~~~~~l~D-----~cGva~~~~~f~~ssG~G~~~~~~~~  285 (305)
T PF07433_consen  244 WDAATGRLLGSVPLPD-----ACGVAPTDDGFLVSSGQGQLIRLSPD  285 (305)
T ss_pred             EECCCCCEeeccccCc-----eeeeeecCCceEEeCCCccEEEccCc
Confidence            9999999988877765     36777777775555565444444433


No 169
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=83.23  E-value=46  Score=32.22  Aligned_cols=177  Identities=16%  Similarity=0.159  Sum_probs=94.2

Q ss_pred             CCCeEEEEECCCCeeEecCCCCCccee-eeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821          134 ATSEVYCYDASMNTWTDAAPMSTARCY-FPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~-~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      ..+.+.++|..+++--.+-.   +... -..+.+++.+.|.|.++  ..|-+||+.+.+.-..  +.......-+..+++
T Consensus       309 ~D~tVkVW~v~n~~~l~l~~---~h~~~V~~v~~~~~~lvsgs~d--~~v~VW~~~~~~cl~s--l~gH~~~V~sl~~~~  381 (537)
T KOG0274|consen  309 RDNTVKVWDVTNGACLNLLR---GHTGPVNCVQLDEPLLVSGSYD--GTVKVWDPRTGKCLKS--LSGHTGRVYSLIVDS  381 (537)
T ss_pred             CCceEEEEeccCcceEEEec---cccccEEEEEecCCEEEEEecC--ceEEEEEhhhceeeee--ecCCcceEEEEEecC
Confidence            34678888888665333221   1111 12333466777777766  5788999997665332  222111112345666


Q ss_pred             -EEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCC
Q 045821          213 -KIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTL  291 (352)
Q Consensus       213 -~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~  291 (352)
                       ..++-|+.+     ..|.++|+.+.. ..+.....+...-....+.+++++-+..++ .+..||.+++.-.++-.-+. 
T Consensus       382 ~~~~~Sgs~D-----~~IkvWdl~~~~-~c~~tl~~h~~~v~~l~~~~~~Lvs~~aD~-~Ik~WD~~~~~~~~~~~~~~-  453 (537)
T KOG0274|consen  382 ENRLLSGSLD-----TTIKVWDLRTKR-KCIHTLQGHTSLVSSLLLRDNFLVSSSADG-TIKLWDAEEGECLRTLEGRH-  453 (537)
T ss_pred             cceEEeeeec-----cceEeecCCchh-hhhhhhcCCcccccccccccceeEeccccc-cEEEeecccCceeeeeccCC-
Confidence             666666555     246777776664 211111111111122345666676666665 89999998887765543321 


Q ss_pred             cCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCccccc
Q 045821          292 LTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIGNIG  327 (352)
Q Consensus       292 ~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~w~~  327 (352)
                        ...-.++..+...++..+.+ ...++|.....-.+
T Consensus       454 --~~~v~~l~~~~~~il~s~~~~~~~l~dl~~~~~~~  488 (537)
T KOG0274|consen  454 --VGGVSALALGKEEILCSSDDGSVKLWDLRSGTLIR  488 (537)
T ss_pred             --cccEEEeecCcceEEEEecCCeeEEEecccCchhh
Confidence              12222333333455555544 45578888776554


No 170
>PRK01742 tolB translocation protein TolB; Provisional
Probab=82.72  E-value=42  Score=31.39  Aligned_cols=139  Identities=12%  Similarity=0.029  Sum_probs=71.3

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECCE-EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQK-IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~~-iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      ...++++|..+++-+.+...+.. . ...+. -+++ |++....+...+++.+|..++..+++..... ........-+|
T Consensus       227 ~~~i~i~dl~tg~~~~l~~~~g~-~-~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~-~~~~~~wSpDG  303 (429)
T PRK01742        227 KSQLVVHDLRSGARKVVASFRGH-N-GAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPSQLTSGAG-NNTEPSWSPDG  303 (429)
T ss_pred             CcEEEEEeCCCCceEEEecCCCc-c-CceeECCCCCEEEEEEecCCcEEEEEEECCCCCeEeeccCCC-CcCCEEECCCC
Confidence            35789999988766555544321 1 11121 2454 4443333322567888988887776653211 11122333456


Q ss_pred             E-EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC-EEEEEecCCCCeEEEEeCCCCceEEcc
Q 045821          213 K-IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD-ALYVLDQSSGTKLMMWQKESREWSPVG  286 (352)
Q Consensus       213 ~-iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~~~v~~yd~~~~~W~~~~  286 (352)
                      + |+.. ...  .....++.++..+..-+.+..  . . ......-++ .|++.++   ..++.+|..++.+..+.
T Consensus       304 ~~i~f~-s~~--~g~~~I~~~~~~~~~~~~l~~--~-~-~~~~~SpDG~~ia~~~~---~~i~~~Dl~~g~~~~lt  369 (429)
T PRK01742        304 QSILFT-SDR--SGSPQVYRMSASGGGASLVGG--R-G-YSAQISADGKTLVMING---DNVVKQDLTSGSTEVLS  369 (429)
T ss_pred             CEEEEE-ECC--CCCceEEEEECCCCCeEEecC--C-C-CCccCCCCCCEEEEEcC---CCEEEEECCCCCeEEec
Confidence            5 4443 221  112367777776554333321  1 1 122233344 4555544   26888999998887664


No 171
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=82.68  E-value=36  Score=30.58  Aligned_cols=69  Identities=20%  Similarity=0.327  Sum_probs=37.0

Q ss_pred             CCEEEEEcc---CCCC-CCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCE--EEEEecCCCCeEEEEeCCCCce
Q 045821          211 DGKIYIRCS---ASAA-TSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDA--LYVLDQSSGTKLMMWQKESREW  282 (352)
Q Consensus       211 ~g~iyv~GG---~~~~-~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~--l~~~gg~~~~~v~~yd~~~~~W  282 (352)
                      .++|||+-=   ...+ .....||+||+.+.+=. +++-  ....-+..+..+.+  ||.+.+..+ .+.+||..+++-
T Consensus       249 ~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l--~~~~~Si~Vsqd~~P~L~~~~~~~~-~l~v~D~~tGk~  324 (342)
T PF06433_consen  249 SGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPL--EHPIDSIAVSQDDKPLLYALSAGDG-TLDVYDAATGKL  324 (342)
T ss_dssp             TTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEE--EEEESEEEEESSSS-EEEEEETTTT-EEEEEETTT--E
T ss_pred             cCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeC--CCccceEEEccCCCcEEEEEcCCCC-eEEEEeCcCCcE
Confidence            678998741   1111 12238999999988643 2221  11111233444443  555554333 899999998754


No 172
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=82.37  E-value=55  Score=32.50  Aligned_cols=157  Identities=8%  Similarity=0.083  Sum_probs=87.1

Q ss_pred             EEEEC--CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcE
Q 045821          115 FEVLG--KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNW  192 (352)
Q Consensus       115 ~~~~~--~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W  192 (352)
                      .+.++  |.-..+|+.     .+..+.+|+-.++++..-.+-..++.....-.-||.+.+.|+.+  ..|-+||..+.-.
T Consensus       312 t~~~N~tGDWiA~g~~-----klgQLlVweWqsEsYVlKQQgH~~~i~~l~YSpDgq~iaTG~eD--gKVKvWn~~SgfC  384 (893)
T KOG0291|consen  312 TVSFNSTGDWIAFGCS-----KLGQLLVWEWQSESYVLKQQGHSDRITSLAYSPDGQLIATGAED--GKVKVWNTQSGFC  384 (893)
T ss_pred             EEEecccCCEEEEcCC-----ccceEEEEEeeccceeeeccccccceeeEEECCCCcEEEeccCC--CcEEEEeccCceE
Confidence            34444  555556652     24577888777776654444333333222222378888888866  7888999888754


Q ss_pred             EEecCCCCcccceeE--EEECCEEEEEccCCCCCCceEEEEEeCCCC-ceeecCCCCCCCCCCcEEEECCEEEEEecCCC
Q 045821          193 KLHTEPNIFTEIEDS--FVMDGKIYIRCSASAATSHVCALVYEPSTD-SWLHADANMASGWRGPAVVVDDALYVLDQSSG  269 (352)
Q Consensus       193 ~~~~~~~~~~~~~~~--~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~-~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~  269 (352)
                      ...  .........+  +...|+..+.-..++     .+-.+|...- ..+....|-+..+...++-..|.|.+.|+.+.
T Consensus       385 ~vT--FteHts~Vt~v~f~~~g~~llssSLDG-----tVRAwDlkRYrNfRTft~P~p~QfscvavD~sGelV~AG~~d~  457 (893)
T KOG0291|consen  385 FVT--FTEHTSGVTAVQFTARGNVLLSSSLDG-----TVRAWDLKRYRNFRTFTSPEPIQFSCVAVDPSGELVCAGAQDS  457 (893)
T ss_pred             EEE--eccCCCceEEEEEEecCCEEEEeecCC-----eEEeeeecccceeeeecCCCceeeeEEEEcCCCCEEEeeccce
Confidence            322  1222222222  223455444423332     3555665433 24444433333333223333499999999887


Q ss_pred             CeEEEEeCCCCceEEc
Q 045821          270 TKLMMWQKESREWSPV  285 (352)
Q Consensus       270 ~~v~~yd~~~~~W~~~  285 (352)
                      -.|++|+.++++--.+
T Consensus       458 F~IfvWS~qTGqllDi  473 (893)
T KOG0291|consen  458 FEIFVWSVQTGQLLDI  473 (893)
T ss_pred             EEEEEEEeecCeeeeh
Confidence            7899999998886544


No 173
>PF15525 DUF4652:  Domain of unknown function (DUF4652)
Probab=82.18  E-value=25  Score=28.46  Aligned_cols=86  Identities=15%  Similarity=0.078  Sum_probs=48.5

Q ss_pred             CCCCCCCCCCeEEEEECCCCeeEecCCCCCc-ceeeeeEE-E-CCEEEEEeCCCCC-----CceEEEeCCCCcEEEecCC
Q 045821          127 GCGWSEDATSEVYCYDASMNTWTDAAPMSTA-RCYFPCGV-L-NQKIYCIGGLGDT-----HSWDVYDPRTNNWKLHTEP  198 (352)
Q Consensus       127 G~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~-r~~~~~~~-~-~~~iyv~gG~~~~-----~~v~~yd~~t~~W~~~~~~  198 (352)
                      |.++.......+|++|..++.|..+..-+.. .+.+--+. + +..|.|+-|....     ..+++|++.|+.=..+-+.
T Consensus        79 g~~a~eEgiGkIYIkn~~~~~~~~L~i~~~~~k~sPK~i~WiDD~~L~vIIG~a~GTvS~GGnLy~~nl~tg~~~~ly~~  158 (200)
T PF15525_consen   79 GPEAEEEGIGKIYIKNLNNNNWWSLQIDQNEEKYSPKYIEWIDDNNLAVIIGYAHGTVSKGGNLYKYNLNTGNLTELYEW  158 (200)
T ss_pred             CCccccccceeEEEEecCCCceEEEEecCcccccCCceeEEecCCcEEEEEccccceEccCCeEEEEEccCCceeEeeec
Confidence            4444456688999999999987655322211 22222222 2 3444444443221     8899999999987777654


Q ss_pred             CCcccceeEEEECC
Q 045821          199 NIFTEIEDSFVMDG  212 (352)
Q Consensus       199 ~~~~~~~~~~~~~g  212 (352)
                      ..-......+...+
T Consensus       159 ~dkkqQVis~e~~g  172 (200)
T PF15525_consen  159 KDKKQQVISAEKNG  172 (200)
T ss_pred             cccceeEEEEEEeC
Confidence            33333333333333


No 174
>PRK10115 protease 2; Provisional
Probab=82.17  E-value=59  Score=32.67  Aligned_cols=149  Identities=11%  Similarity=0.012  Sum_probs=77.4

Q ss_pred             CCeEEEEECCCCe--eEecCCCCCcceeeeeEEE-CCEEEEEeCCCCC-CceEEEeC--CCCcEEEecCCCCcccceeEE
Q 045821          135 TSEVYCYDASMNT--WTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDT-HSWDVYDP--RTNNWKLHTEPNIFTEIEDSF  208 (352)
Q Consensus       135 ~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~-~~v~~yd~--~t~~W~~~~~~~~~~~~~~~~  208 (352)
                      ..++++++..|..  =..+-.-+........... +++..++...... +.++.|+.  .+..|..+.+.+..... ...
T Consensus       198 ~~~v~~h~lgt~~~~d~lv~~e~~~~~~~~~~~s~d~~~l~i~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~-~~~  276 (686)
T PRK10115        198 PYQVWRHTIGTPASQDELVYEEKDDTFYVSLHKTTSKHYVVIHLASATTSEVLLLDAELADAEPFVFLPRRKDHEY-SLD  276 (686)
T ss_pred             CCEEEEEECCCChhHCeEEEeeCCCCEEEEEEEcCCCCEEEEEEECCccccEEEEECcCCCCCceEEEECCCCCEE-EEE
Confidence            4789999999883  2223221222222222223 4443334443333 77888883  23444433322222222 233


Q ss_pred             EECCEEEEEccCCCCCCceEEEEEeCC-CCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821          209 VMDGKIYIRCSASAATSHVCALVYEPS-TDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-TKLMMWQKESREWSPVG  286 (352)
Q Consensus       209 ~~~g~iyv~GG~~~~~~~~~i~~yd~~-~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-~~v~~yd~~~~~W~~~~  286 (352)
                      ..++.+|+.--..  .....+...+.. ..+|+.+-+......-......++.|++..-..+ ..++++|..++....+.
T Consensus       277 ~~~~~ly~~tn~~--~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~~~~~l~~~~~~~g~~~l~~~~~~~~~~~~l~  354 (686)
T PRK10115        277 HYQHRFYLRSNRH--GKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTLFTDWLVVEERQRGLTSLRQINRKTREVIGIA  354 (686)
T ss_pred             eCCCEEEEEEcCC--CCCceEEEecCCCcccCeEEECCCCCCEEEEEEEECCEEEEEEEeCCEEEEEEEcCCCCceEEec
Confidence            4567888874222  223356777776 5789887543221111222345777777765444 46888887766655554


No 175
>PLN00181 protein SPA1-RELATED; Provisional
Probab=81.95  E-value=64  Score=32.99  Aligned_cols=143  Identities=10%  Similarity=0.051  Sum_probs=69.0

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCee-EecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCc--EE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTW-TDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNN--WK  193 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W-~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~--W~  193 (352)
                      ++.+++.|+.+      ..+.+||..+.+- ..+.. ...  -..+..  -++..++.|+.+  ..+.+||..+..  ..
T Consensus       587 ~~~~L~Sgs~D------g~v~iWd~~~~~~~~~~~~-~~~--v~~v~~~~~~g~~latgs~d--g~I~iwD~~~~~~~~~  655 (793)
T PLN00181        587 DPTLLASGSDD------GSVKLWSINQGVSIGTIKT-KAN--ICCVQFPSESGRSLAFGSAD--HKVYYYDLRNPKLPLC  655 (793)
T ss_pred             CCCEEEEEcCC------CEEEEEECCCCcEEEEEec-CCC--eEEEEEeCCCCCEEEEEeCC--CeEEEEECCCCCccce
Confidence            45666667643      4578888876532 12211 111  111111  146777777755  688999987643  11


Q ss_pred             EecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCC----ceeecCCCCCCCC--CCcEEEECCEEEEEecC
Q 045821          194 LHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTD----SWLHADANMASGW--RGPAVVVDDALYVLDQS  267 (352)
Q Consensus       194 ~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~----~W~~~~~~~~~~~--~~~~~~~~~~l~~~gg~  267 (352)
                      .+..  ....-..+...++..++.|+.++     .+..||....    .|..+.....+..  ....+..++.+++.|+.
T Consensus       656 ~~~~--h~~~V~~v~f~~~~~lvs~s~D~-----~ikiWd~~~~~~~~~~~~l~~~~gh~~~i~~v~~s~~~~~lasgs~  728 (793)
T PLN00181        656 TMIG--HSKTVSYVRFVDSSTLVSSSTDN-----TLKLWDLSMSISGINETPLHSFMGHTNVKNFVGLSVSDGYIATGSE  728 (793)
T ss_pred             EecC--CCCCEEEEEEeCCCEEEEEECCC-----EEEEEeCCCCccccCCcceEEEcCCCCCeeEEEEcCCCCEEEEEeC
Confidence            2110  11111123334666666655442     4666776432    2322211111110  11122345666777776


Q ss_pred             CCCeEEEEeCCCC
Q 045821          268 SGTKLMMWQKESR  280 (352)
Q Consensus       268 ~~~~v~~yd~~~~  280 (352)
                      ++ .+.+|+....
T Consensus       729 D~-~v~iw~~~~~  740 (793)
T PLN00181        729 TN-EVFVYHKAFP  740 (793)
T ss_pred             CC-EEEEEECCCC
Confidence            65 8888887643


No 176
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=81.53  E-value=38  Score=30.71  Aligned_cols=151  Identities=13%  Similarity=0.132  Sum_probs=79.8

Q ss_pred             CEEEEEeCCCCCCceEEEeCCCC--cEEEecCCCC------cccceeEEEECC---EEEEEccCCCCCCceEEEEEeCCC
Q 045821          168 QKIYCIGGLGDTHSWDVYDPRTN--NWKLHTEPNI------FTEIEDSFVMDG---KIYIRCSASAATSHVCALVYEPST  236 (352)
Q Consensus       168 ~~iyv~gG~~~~~~v~~yd~~t~--~W~~~~~~~~------~~~~~~~~~~~g---~iyv~GG~~~~~~~~~i~~yd~~~  236 (352)
                      -.|+.+||....+.+.+||.++.  .|+.-..++.      +.....+.++.+   +-++-+-.     .-.+-.||+..
T Consensus       161 p~Iva~GGke~~n~lkiwdle~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~-----~hqvR~YDt~~  235 (412)
T KOG3881|consen  161 PYIVATGGKENINELKIWDLEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITR-----YHQVRLYDTRH  235 (412)
T ss_pred             CceEecCchhcccceeeeecccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEec-----ceeEEEecCcc
Confidence            36777788654466778887754  5776543222      222334445544   33333211     12577899875


Q ss_pred             CceeecCC-CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC-EEEEEcCCce
Q 045821          237 DSWLHADA-NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK-TIFVIGKGCS  314 (352)
Q Consensus       237 ~~W~~~~~-~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~-~i~v~gG~~~  314 (352)
                      +.=-...- .......+.+...++...++|...+ .+..||..++.---.. + .........+..+.+ .+...+|-+.
T Consensus       236 qRRPV~~fd~~E~~is~~~l~p~gn~Iy~gn~~g-~l~~FD~r~~kl~g~~-~-kg~tGsirsih~hp~~~~las~GLDR  312 (412)
T KOG3881|consen  236 QRRPVAQFDFLENPISSTGLTPSGNFIYTGNTKG-QLAKFDLRGGKLLGCG-L-KGITGSIRSIHCHPTHPVLASCGLDR  312 (412)
T ss_pred             cCcceeEeccccCcceeeeecCCCcEEEEecccc-hhheecccCceeeccc-c-CCccCCcceEEEcCCCceEEeeccce
Confidence            43211111 1122222333344566666666655 8999998876542221 0 111122234555555 5667777776


Q ss_pred             EE-EEEcCCcccc
Q 045821          315 AV-VIDVGNIGNI  326 (352)
Q Consensus       315 ~~-~~d~~~~~w~  326 (352)
                      .+ |||++++.-.
T Consensus       313 yvRIhD~ktrkll  325 (412)
T KOG3881|consen  313 YVRIHDIKTRKLL  325 (412)
T ss_pred             eEEEeecccchhh
Confidence            55 8999986544


No 177
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=81.30  E-value=40  Score=30.16  Aligned_cols=210  Identities=10%  Similarity=0.062  Sum_probs=105.1

Q ss_pred             CEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCC--cEEEe
Q 045821          120 KNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTN--NWKLH  195 (352)
Q Consensus       120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~--~W~~~  195 (352)
                      ..+|++..-+.  ..--..+.+|+..++-+.+...+.+-..++-+.+  +++..+..-+.. ..+-+|-..++  -|..+
T Consensus        52 ~~LY~v~~~~~--~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~-g~v~v~p~~~dG~l~~~v  128 (346)
T COG2706          52 RHLYVVNEPGE--EGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHS-GSVSVYPLQADGSLQPVV  128 (346)
T ss_pred             CEEEEEEecCC--cCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccC-ceEEEEEcccCCccccce
Confidence            35887754322  2234556778877887777654433333232333  343333333332 66677766554  13222


Q ss_pred             c------CCCCcc---cceeEEE--ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCC---CcEEEECCE-
Q 045821          196 T------EPNIFT---EIEDSFV--MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWR---GPAVVVDDA-  260 (352)
Q Consensus       196 ~------~~~~~~---~~~~~~~--~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~---~~~~~~~~~-  260 (352)
                      .      .-|..+   ...+.+.  -++++.+...    -....+..|+...+.-+......-....   +.++--+++ 
T Consensus       129 ~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~~l~v~D----LG~Dri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~  204 (346)
T COG2706         129 QVVKHTGSGPHERQESPHVHSANFTPDGRYLVVPD----LGTDRIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKY  204 (346)
T ss_pred             eeeecCCCCCCccccCCccceeeeCCCCCEEEEee----cCCceEEEEEcccCccccccccccCCCCCcceEEEcCCCcE
Confidence            1      111111   1122333  3554444321    2234688888887766554332111112   222333444 


Q ss_pred             EEEEecCCC-CeEEEEeCCCCceEEcc---CCCCCcCCCCc--eEEE--eCCEEEEEc-CCceEEEEEcCCcccccceEE
Q 045821          261 LYVLDQSSG-TKLMMWQKESREWSPVG---RLSTLLTRPPC--KLVA--IGKTIFVIG-KGCSAVVIDVGNIGNIGGIMV  331 (352)
Q Consensus       261 l~~~gg~~~-~~v~~yd~~~~~W~~~~---~~~~~~~~~~~--~~~~--~~~~i~v~g-G~~~~~~~d~~~~~w~~~~~~  331 (352)
                      .|++...++ -.++.||....+.+++.   .+|..+.....  ++..  .+.-+|+-. |.+...+|.+..++=.-....
T Consensus       205 aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~~f~V~~~~g~L~~~~  284 (346)
T COG2706         205 AYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIAVFSVDPDGGKLELVG  284 (346)
T ss_pred             EEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEEEEEEcCCCCEEEEEE
Confidence            688887766 34567777777777775   56665533322  2222  244577776 455677788887764333444


Q ss_pred             eccCC
Q 045821          332 SSSIP  336 (352)
Q Consensus       332 ~~~~~  336 (352)
                      +....
T Consensus       285 ~~~te  289 (346)
T COG2706         285 ITPTE  289 (346)
T ss_pred             EeccC
Confidence            44433


No 178
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=80.28  E-value=46  Score=30.24  Aligned_cols=120  Identities=9%  Similarity=-0.081  Sum_probs=65.4

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCC---CCCCCCCeEEEEECC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCG---WSEDATSEVYCYDAS  144 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~---~~~~~~~~~~~~d~~  144 (352)
                      ..++|+.-.......-..+++|.  .+.+-  +...+.  ..+....+..-+..+|+...+.   ........+.+||+.
T Consensus        12 ~~~v~V~d~~~~~~~~~v~ViD~--~~~~v--~g~i~~--G~~P~~~~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~   85 (352)
T TIGR02658        12 ARRVYVLDPGHFAATTQVYTIDG--EAGRV--LGMTDG--GFLPNPVVASDGSFFAHASTVYSRIARGKRTDYVEVIDPQ   85 (352)
T ss_pred             CCEEEEECCcccccCceEEEEEC--CCCEE--EEEEEc--cCCCceeECCCCCEEEEEeccccccccCCCCCEEEEEECc
Confidence            56688874331111245677788  66443  333332  1222232333355799887632   222346789999999


Q ss_pred             CCeeE-ecCCCCCccee------eeeEEECC-EEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821          145 MNTWT-DAAPMSTARCY------FPCGVLNQ-KIYCIGGLGDTHSWDVYDPRTNNWKL  194 (352)
Q Consensus       145 t~~W~-~~~~~~~~r~~------~~~~~~~~-~iyv~gG~~~~~~v~~yd~~t~~W~~  194 (352)
                      |.+-. +++..+.||..      .....-+| .+||.--. ..+.|-+.|..+++--.
T Consensus        86 t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~-p~~~V~VvD~~~~kvv~  142 (352)
T TIGR02658        86 THLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFS-PSPAVGVVDLEGKAFVR  142 (352)
T ss_pred             cCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCC-CCCEEEEEECCCCcEEE
Confidence            98743 44433444421      11222244 57776432 12889999999987543


No 179
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=77.64  E-value=51  Score=29.31  Aligned_cols=183  Identities=12%  Similarity=0.138  Sum_probs=90.2

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCc-EEEecCC----CCcccceeEE
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN-WKLHTEP----NIFTEIEDSF  208 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~~~----~~~~~~~~~~  208 (352)
                      ..++.||+.+++-+.... |...  .+...+  ++.|.++.     ..+..++++++. |+.+.+.    +..+..-..+
T Consensus        47 ~~i~r~~~~~g~~~~~~~-p~~~--~~~~~~d~~g~Lv~~~-----~g~~~~~~~~~~~~t~~~~~~~~~~~~r~ND~~v  118 (307)
T COG3386          47 GRIHRLDPETGKKRVFPS-PGGF--SSGALIDAGGRLIACE-----HGVRLLDPDTGGKITLLAEPEDGLPLNRPNDGVV  118 (307)
T ss_pred             CeEEEecCCcCceEEEEC-CCCc--ccceeecCCCeEEEEc-----cccEEEeccCCceeEEeccccCCCCcCCCCceeE
Confidence            567888887665443322 2111  222333  34555543     446777776443 3555532    2223333445


Q ss_pred             EECCEEEEEccCC-----CC--CCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC--
Q 045821          209 VMDGKIYIRCSAS-----AA--TSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES--  279 (352)
Q Consensus       209 ~~~g~iyv~GG~~-----~~--~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~--  279 (352)
                      .-+|.+|+- ...     ..  ...-.++++|+ .....++-........+.+..-+++.+++.....+.+++|+...  
T Consensus       119 ~pdG~~wfg-t~~~~~~~~~~~~~~G~lyr~~p-~g~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~~~  196 (307)
T COG3386         119 DPDGRIWFG-DMGYFDLGKSEERPTGSLYRVDP-DGGVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPAT  196 (307)
T ss_pred             cCCCCEEEe-CCCccccCccccCCcceEEEEcC-CCCEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEEEEecCccc
Confidence            557888774 322     11  11127999998 45565553321222233344556654444444444899997763  


Q ss_pred             ----Cc--eEEccCCCCCcCCCCceEEEeCCEEEEEcC--CceEEEEEcCCcccccceEEe
Q 045821          280 ----RE--WSPVGRLSTLLTRPPCKLVAIGKTIFVIGK--GCSAVVIDVGNIGNIGGIMVS  332 (352)
Q Consensus       280 ----~~--W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG--~~~~~~~d~~~~~w~~~~~~~  332 (352)
                          +.  +..... ..  ...-...+--++.||+...  ...+..|+++ ..-.+...++
T Consensus       197 g~~~~~~~~~~~~~-~~--G~PDG~~vDadG~lw~~a~~~g~~v~~~~pd-G~l~~~i~lP  253 (307)
T COG3386         197 GPIGGRRGFVDFDE-EP--GLPDGMAVDADGNLWVAAVWGGGRVVRFNPD-GKLLGEIKLP  253 (307)
T ss_pred             CccCCcceEEEccC-CC--CCCCceEEeCCCCEEEecccCCceEEEECCC-CcEEEEEECC
Confidence                11  111111 00  1111223334677885432  2367789998 4444555555


No 180
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=77.56  E-value=29  Score=32.31  Aligned_cols=62  Identities=6%  Similarity=0.134  Sum_probs=36.2

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE--C-CEEEEEccCCCCCCceEEEEEeCCCCc
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM--D-GKIYIRCSASAATSHVCALVYEPSTDS  238 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~--~-g~iyv~GG~~~~~~~~~i~~yd~~~~~  238 (352)
                      ++.-|+.+|++  ..+-.+|.+|++-..--  .. ..-..++.+  + .+++++||.++     .|..+|..+++
T Consensus       269 ~g~~fLS~sfD--~~lKlwDtETG~~~~~f--~~-~~~~~cvkf~pd~~n~fl~G~sd~-----ki~~wDiRs~k  333 (503)
T KOG0282|consen  269 CGTSFLSASFD--RFLKLWDTETGQVLSRF--HL-DKVPTCVKFHPDNQNIFLVGGSDK-----KIRQWDIRSGK  333 (503)
T ss_pred             cCCeeeeeecc--eeeeeeccccceEEEEE--ec-CCCceeeecCCCCCcEEEEecCCC-----cEEEEeccchH
Confidence            34556666665  56677899998754321  11 111112222  3 48999987764     57778877665


No 181
>KOG2111 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=75.46  E-value=58  Score=28.79  Aligned_cols=145  Identities=12%  Similarity=0.094  Sum_probs=72.6

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCee-EecCCCCCc------ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTW-TDAAPMSTA------RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN  191 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W-~~~~~~~~~------r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~  191 (352)
                      .+.+-++||.+.+..+.+.+.+||-...+- .++. +..+      |..+-+++..++|||            |...++-
T Consensus        58 ~N~laLVGGg~~pky~pNkviIWDD~k~~~i~el~-f~~~I~~V~l~r~riVvvl~~~I~V------------ytF~~n~  124 (346)
T KOG2111|consen   58 SNYLALVGGGSRPKYPPNKVIIWDDLKERCIIELS-FNSEIKAVKLRRDRIVVVLENKIYV------------YTFPDNP  124 (346)
T ss_pred             hceEEEecCCCCCCCCCceEEEEecccCcEEEEEE-eccceeeEEEcCCeEEEEecCeEEE------------EEcCCCh
Confidence            466677787765557789999999444332 2221 1111      233445555666655            4333321


Q ss_pred             --EEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceeecCC-CCCCCCCCc---EEEECCEEEEE
Q 045821          192 --WKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLHADA-NMASGWRGP---AVVVDDALYVL  264 (352)
Q Consensus       192 --W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~-~~~~~~~~~---~~~~~~~l~~~  264 (352)
                        =..+....-+..-+..+.. +..+.++-|...+    .+..-|+...+=.  ++ .........   +...+|.+...
T Consensus       125 k~l~~~et~~NPkGlC~~~~~~~k~~LafPg~k~G----qvQi~dL~~~~~~--~p~~I~AH~s~Iacv~Ln~~Gt~vAT  198 (346)
T KOG2111|consen  125 KLLHVIETRSNPKGLCSLCPTSNKSLLAFPGFKTG----QVQIVDLASTKPN--APSIINAHDSDIACVALNLQGTLVAT  198 (346)
T ss_pred             hheeeeecccCCCceEeecCCCCceEEEcCCCccc----eEEEEEhhhcCcC--CceEEEcccCceeEEEEcCCccEEEE
Confidence              1111111111112222222 4456666555431    3555555433321  11 111111222   23468888888


Q ss_pred             ecCCCCeEEEEeCCCCce
Q 045821          265 DQSSGTKLMMWQKESREW  282 (352)
Q Consensus       265 gg~~~~~v~~yd~~~~~W  282 (352)
                      +...|.-|-+||..+++=
T Consensus       199 aStkGTLIRIFdt~~g~~  216 (346)
T KOG2111|consen  199 ASTKGTLIRIFDTEDGTL  216 (346)
T ss_pred             eccCcEEEEEEEcCCCcE
Confidence            888887788999998763


No 182
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=74.87  E-value=67  Score=29.23  Aligned_cols=68  Identities=9%  Similarity=0.166  Sum_probs=38.4

Q ss_pred             CCEEEEEe--cC------CCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC-EEEEEcC-CceEEEEEcCCccccc
Q 045821          258 DDALYVLD--QS------SGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK-TIFVIGK-GCSAVVIDVGNIGNIG  327 (352)
Q Consensus       258 ~~~l~~~g--g~------~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~-~i~v~gG-~~~~~~~d~~~~~w~~  327 (352)
                      ++++|+..  +-      .++.++++|..+  ++.+..++....-...++..-+. .+|+..+ ...+.++|+++..-.+
T Consensus       259 g~~lyV~~~~~~~~thk~~~~~V~ViD~~t--~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~~VsViD~~t~k~i~  336 (352)
T TIGR02658       259 RDRIYLLADQRAKWTHKTASRFLFVVDAKT--GKRLRKIELGHEIDSINVSQDAKPLLYALSTGDKTLYIFDAETGKELS  336 (352)
T ss_pred             CCEEEEEecCCccccccCCCCEEEEEECCC--CeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCCcEEEEECcCCeEEe
Confidence            57788742  11      115899999765  44454444321122223333345 6787775 4457799988765433


No 183
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=73.82  E-value=41  Score=30.79  Aligned_cols=223  Identities=11%  Similarity=0.005  Sum_probs=92.5

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ++.-+++.+. ..+....|.+|.  .++.-.+|...+..  ...+..++.-+..+|.+-.       ...++..|..|.+
T Consensus        46 dG~kllF~s~-~dg~~nly~lDL--~t~~i~QLTdg~g~--~~~g~~~s~~~~~~~Yv~~-------~~~l~~vdL~T~e  113 (386)
T PF14583_consen   46 DGRKLLFASD-FDGNRNLYLLDL--ATGEITQLTDGPGD--NTFGGFLSPDDRALYYVKN-------GRSLRRVDLDTLE  113 (386)
T ss_dssp             TS-EEEEEE--TTSS-EEEEEET--TT-EEEE---SS-B---TTT-EE-TTSSEEEEEET-------TTEEEEEETTT--
T ss_pred             CCCEEEEEec-cCCCcceEEEEc--ccCEEEECccCCCC--CccceEEecCCCeEEEEEC-------CCeEEEEECCcCc
Confidence            4545556554 566778899999  99999999886631  1223333333556654421       2578889999887


Q ss_pred             eEecCCCCCcceeeeeEEE--CCEEEEEeCC---C----CC-------------CceEEEeCCCCcEEEecCCCCcccce
Q 045821          148 WTDAAPMSTARCYFPCGVL--NQKIYCIGGL---G----DT-------------HSWDVYDPRTNNWKLHTEPNIFTEIE  205 (352)
Q Consensus       148 W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~---~----~~-------------~~v~~yd~~t~~W~~~~~~~~~~~~~  205 (352)
                      =+.+-..|..-......+.  ++..++.--.   +    ..             ..+..-|..|++.+.+-.  ......
T Consensus       114 ~~~vy~~p~~~~g~gt~v~n~d~t~~~g~e~~~~d~~~l~~~~~f~e~~~a~p~~~i~~idl~tG~~~~v~~--~~~wlg  191 (386)
T PF14583_consen  114 ERVVYEVPDDWKGYGTWVANSDCTKLVGIEISREDWKPLTKWKGFREFYEARPHCRIFTIDLKTGERKVVFE--DTDWLG  191 (386)
T ss_dssp             EEEEEE--TTEEEEEEEEE-TTSSEEEEEEEEGGG-----SHHHHHHHHHC---EEEEEEETTT--EEEEEE--ESS-EE
T ss_pred             EEEEEECCcccccccceeeCCCccEEEEEEEeehhccCccccHHHHHHHhhCCCceEEEEECCCCceeEEEe--cCcccc
Confidence            6666666654443333322  2322221100   0    00             556667777777665532  111111


Q ss_pred             eE--EEECCEEEEEccCCCC-CC-ceEEEEEeCCCCceeecCCCCCCCCCCc-EEEEC-CEEEEEecCCC---CeEEEEe
Q 045821          206 DS--FVMDGKIYIRCSASAA-TS-HVCALVYEPSTDSWLHADANMASGWRGP-AVVVD-DALYVLDQSSG---TKLMMWQ  276 (352)
Q Consensus       206 ~~--~~~~g~iyv~GG~~~~-~~-~~~i~~yd~~~~~W~~~~~~~~~~~~~~-~~~~~-~~l~~~gg~~~---~~v~~yd  276 (352)
                      +.  .-.+..+.+++ ..+. .. ...||..|........+....+.-..+. --..+ ..|+..+...+   ..+..||
T Consensus       192 H~~fsP~dp~li~fC-HEGpw~~Vd~RiW~i~~dg~~~~~v~~~~~~e~~gHEfw~~DG~~i~y~~~~~~~~~~~i~~~d  270 (386)
T PF14583_consen  192 HVQFSPTDPTLIMFC-HEGPWDLVDQRIWTINTDGSNVKKVHRRMEGESVGHEFWVPDGSTIWYDSYTPGGQDFWIAGYD  270 (386)
T ss_dssp             EEEEETTEEEEEEEE-E-S-TTTSS-SEEEEETTS---EESS---TTEEEEEEEE-TTSS-EEEEEEETTT--EEEEEE-
T ss_pred             CcccCCCCCCEEEEe-ccCCcceeceEEEEEEcCCCcceeeecCCCCcccccccccCCCCEEEEEeecCCCCceEEEeeC
Confidence            11  11244454542 1111 11 1267888876554444433221111111 11223 34544443232   3578889


Q ss_pred             CCCCceEEccCCCCCcCCCCceEEEeCCEEEEE
Q 045821          277 KESREWSPVGRLSTLLTRPPCKLVAIGKTIFVI  309 (352)
Q Consensus       277 ~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~  309 (352)
                      +++..=+.+..+|.    ..+-....++++++=
T Consensus       271 ~~t~~~~~~~~~p~----~~H~~ss~Dg~L~vG  299 (386)
T PF14583_consen  271 PDTGERRRLMEMPW----CSHFMSSPDGKLFVG  299 (386)
T ss_dssp             TTT--EEEEEEE-S----EEEEEE-TTSSEEEE
T ss_pred             CCCCCceEEEeCCc----eeeeEEcCCCCEEEe
Confidence            99865444444432    122333445666554


No 184
>PRK10115 protease 2; Provisional
Probab=73.78  E-value=1e+02  Score=30.95  Aligned_cols=150  Identities=11%  Similarity=0.017  Sum_probs=79.5

Q ss_pred             CCeEEEEEC--CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCC-CCcEEEecCCCCcccceeEEEE
Q 045821          135 TSEVYCYDA--SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPR-TNNWKLHTEPNIFTEIEDSFVM  210 (352)
Q Consensus       135 ~~~~~~~d~--~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~-t~~W~~~~~~~~~~~~~~~~~~  210 (352)
                      .+.++.++.  .+..|..+...+.. ........++.+|+..-.+.. ..+...+.. ++.|+.+-+......-......
T Consensus       246 ~~~~~l~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~ly~~tn~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~i~~~~~~  324 (686)
T PRK10115        246 TSEVLLLDAELADAEPFVFLPRRKD-HEYSLDHYQHRFYLRSNRHGKNFGLYRTRVRDEQQWEELIPPRENIMLEGFTLF  324 (686)
T ss_pred             cccEEEEECcCCCCCceEEEECCCC-CEEEEEeCCCEEEEEEcCCCCCceEEEecCCCcccCeEEECCCCCCEEEEEEEE
Confidence            456777773  23444333222222 222333456788888755433 556666766 5789888644232333344555


Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEE--EC-CEEEE-EecCCC-CeEEEEeCCCCceEEc
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVV--VD-DALYV-LDQSSG-TKLMMWQKESREWSPV  285 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~--~~-~~l~~-~gg~~~-~~v~~yd~~~~~W~~~  285 (352)
                      ++.|++.. ..  .....++.+|..+.....+.-..+.........  .+ +.+++ +.+... ..++.||+.+++|+.+
T Consensus       325 ~~~l~~~~-~~--~g~~~l~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~ss~~~P~~~y~~d~~~~~~~~l  401 (686)
T PRK10115        325 TDWLVVEE-RQ--RGLTSLRQINRKTREVIGIAFDDPAYVTWIAYNPEPETSRLRYGYSSMTTPDTLFELDMDTGERRVL  401 (686)
T ss_pred             CCEEEEEE-Ee--CCEEEEEEEcCCCCceEEecCCCCceEeeecccCCCCCceEEEEEecCCCCCEEEEEECCCCcEEEE
Confidence            78887773 22  233357788876655555531111110000011  12 33433 333332 6899999999998877


Q ss_pred             cCC
Q 045821          286 GRL  288 (352)
Q Consensus       286 ~~~  288 (352)
                      ...
T Consensus       402 ~~~  404 (686)
T PRK10115        402 KQT  404 (686)
T ss_pred             Eec
Confidence            643


No 185
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=71.66  E-value=43  Score=33.04  Aligned_cols=63  Identities=6%  Similarity=0.139  Sum_probs=36.9

Q ss_pred             ECCEEEEEccCCCCCCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821          210 MDGKIYIRCSASAATSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      -.|+..+.|+..+     .|..||..+.+=. ++... ........+..+|.+++.||-++ +|-.||...
T Consensus       587 p~Gr~LaSg~ed~-----~I~iWDl~~~~~v~~l~~H-t~ti~SlsFS~dg~vLasgg~Dn-sV~lWD~~~  650 (707)
T KOG0263|consen  587 PCGRYLASGDEDG-----LIKIWDLANGSLVKQLKGH-TGTIYSLSFSRDGNVLASGGADN-SVRLWDLTK  650 (707)
T ss_pred             CCCceEeecccCC-----cEEEEEcCCCcchhhhhcc-cCceeEEEEecCCCEEEecCCCC-eEEEEEchh
Confidence            3677666654442     5777888765322 22111 11111223567999999998766 888888764


No 186
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=71.60  E-value=67  Score=27.78  Aligned_cols=171  Identities=9%  Similarity=0.005  Sum_probs=80.2

Q ss_pred             CCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCC-CCC--CceEEEeCCCC-------c-EEEecCCCCc
Q 045821          133 DATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGL-GDT--HSWDVYDPRTN-------N-WKLHTEPNIF  201 (352)
Q Consensus       133 ~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~-~~~--~~v~~yd~~t~-------~-W~~~~~~~~~  201 (352)
                      ....++..+|..+++-...-..+.+-... -...++.+.++.-. ...  ..+.+||....       + ..+++ .+..
T Consensus        71 SAD~t~kLWDv~tGk~la~~k~~~~Vk~~-~F~~~gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~kI~-t~~s  148 (327)
T KOG0643|consen   71 SADQTAKLWDVETGKQLATWKTNSPVKRV-DFSFGGNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYLKIP-TPDS  148 (327)
T ss_pred             cccceeEEEEcCCCcEEEEeecCCeeEEE-eeccCCcEEEEEehhhcCcceEEEEEEccCChhhhcccCceEEec-CCcc
Confidence            34566788899998743333333332111 11234444444321 111  67788888733       2 44444 2221


Q ss_pred             ccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCc-eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCC
Q 045821          202 TEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDS-WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESR  280 (352)
Q Consensus       202 ~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~-W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~  280 (352)
                      ....+.-.--++..+.|...+     .|-.||..++. -.+-...........-..-+.-.|+.|..+. .--.+|..+-
T Consensus       149 kit~a~Wg~l~~~ii~Ghe~G-----~is~~da~~g~~~v~s~~~h~~~Ind~q~s~d~T~FiT~s~Dt-takl~D~~tl  222 (327)
T KOG0643|consen  149 KITSALWGPLGETIIAGHEDG-----SISIYDARTGKELVDSDEEHSSKINDLQFSRDRTYFITGSKDT-TAKLVDVRTL  222 (327)
T ss_pred             ceeeeeecccCCEEEEecCCC-----cEEEEEcccCceeeechhhhccccccccccCCcceEEecccCc-cceeeeccce
Confidence            111111112345555543332     58889988753 2221111111101111223445555555443 5566666553


Q ss_pred             ceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc
Q 045821          281 EWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC  313 (352)
Q Consensus       281 ~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~  313 (352)
                      +-.+-  .....+....++..+.+++.+-||.+
T Consensus       223 ~v~Kt--y~te~PvN~aaisP~~d~VilgGGqe  253 (327)
T KOG0643|consen  223 EVLKT--YTTERPVNTAAISPLLDHVILGGGQE  253 (327)
T ss_pred             eeEEE--eeecccccceecccccceEEecCCce
Confidence            33221  11112344567777888888888865


No 187
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=70.44  E-value=95  Score=29.10  Aligned_cols=108  Identities=11%  Similarity=-0.032  Sum_probs=60.6

Q ss_pred             CCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCE
Q 045821          134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGK  213 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~  213 (352)
                      ...+++.+|..+++-..+.....--...+...-+.+|+..........+++||++..+=+++..-..... .....-+|+
T Consensus       260 g~~~iy~~dl~~~~~~~Lt~~~gi~~~Ps~spdG~~ivf~Sdr~G~p~I~~~~~~g~~~~riT~~~~~~~-~p~~SpdG~  338 (425)
T COG0823         260 GSPDIYLMDLDGKNLPRLTNGFGINTSPSWSPDGSKIVFTSDRGGRPQIYLYDLEGSQVTRLTFSGGGNS-NPVWSPDGD  338 (425)
T ss_pred             CCccEEEEcCCCCcceecccCCccccCccCCCCCCEEEEEeCCCCCcceEEECCCCCceeEeeccCCCCc-CccCCCCCC
Confidence            3578999999988744443332211222222223445544443333899999999887666653222221 223334565


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCc-eeecCC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDS-WLHADA  244 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~-W~~~~~  244 (352)
                      .+++-+... +. ..+..+|+.++. |+.+..
T Consensus       339 ~i~~~~~~~-g~-~~i~~~~~~~~~~~~~lt~  368 (425)
T COG0823         339 KIVFESSSG-GQ-WDIDKNDLASGGKIRILTS  368 (425)
T ss_pred             EEEEEeccC-Cc-eeeEEeccCCCCcEEEccc
Confidence            555544321 11 578889987776 887754


No 188
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=69.90  E-value=78  Score=27.86  Aligned_cols=129  Identities=12%  Similarity=0.099  Sum_probs=67.4

Q ss_pred             CCeEEEEECCCCeeEecCCCCCc-ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCC-CCcccceeEEEECC
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTA-RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEP-NIFTEIEDSFVMDG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~-r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~-~~~~~~~~~~~~~g  212 (352)
                      .+.+-.||+.+++-..++.--.| |..|-.-..+..+.+.|..+  ..+-.+|+....  .+..+ .+.+.++ +-+...
T Consensus        93 Dk~~k~wDL~S~Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWD--KTlKfWD~R~~~--pv~t~~LPeRvYa-~Dv~~p  167 (347)
T KOG0647|consen   93 DKQAKLWDLASGQVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWD--KTLKFWDTRSSN--PVATLQLPERVYA-ADVLYP  167 (347)
T ss_pred             CCceEEEEccCCCeeeeeecccceeEEEEecCCCcceeEecccc--cceeecccCCCC--eeeeeeccceeee-hhccCc
Confidence            45678899999987777643332 33232222233455555554  556666666332  12111 1222222 222233


Q ss_pred             EEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEE
Q 045821          213 KIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMW  275 (352)
Q Consensus       213 ~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~y  275 (352)
                      .+.|..+.      ..|.+|++.+..  ...+..+........++..+++.|.+|+.+| .+.+.
T Consensus       168 m~vVata~------r~i~vynL~n~~te~k~~~SpLk~Q~R~va~f~d~~~~alGsiEG-rv~iq  225 (347)
T KOG0647|consen  168 MAVVATAE------RHIAVYNLENPPTEFKRIESPLKWQTRCVACFQDKDGFALGSIEG-RVAIQ  225 (347)
T ss_pred             eeEEEecC------CcEEEEEcCCCcchhhhhcCcccceeeEEEEEecCCceEeeeecc-eEEEE
Confidence            33333221      248888886553  3444444444444456777888999999988 44443


No 189
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=69.90  E-value=77  Score=27.85  Aligned_cols=132  Identities=14%  Similarity=0.119  Sum_probs=63.4

Q ss_pred             CeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEE
Q 045821          136 SEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIY  215 (352)
Q Consensus       136 ~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iy  215 (352)
                      .++-.||..++.-+..-....|  --.++..+..=.+.||.+  ..+-+||..+..=.++..-..+-..-.-. ......
T Consensus        35 gslrlYdv~~~~l~~~~~~~~p--lL~c~F~d~~~~~~G~~d--g~vr~~Dln~~~~~~igth~~~i~ci~~~-~~~~~v  109 (323)
T KOG1036|consen   35 GSLRLYDVPANSLKLKFKHGAP--LLDCAFADESTIVTGGLD--GQVRRYDLNTGNEDQIGTHDEGIRCIEYS-YEVGCV  109 (323)
T ss_pred             CcEEEEeccchhhhhheecCCc--eeeeeccCCceEEEeccC--ceEEEEEecCCcceeeccCCCceEEEEee-ccCCeE
Confidence            3466777776632211111111  123333444444556666  67889999988755554222211111111 223334


Q ss_pred             EEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc
Q 045821          216 IRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       216 v~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      |.||-+     ..|..+|+.+..=..  ...... .-.+..+.+..+++|+... .+..||..+..
T Consensus       110 IsgsWD-----~~ik~wD~R~~~~~~--~~d~~k-kVy~~~v~g~~LvVg~~~r-~v~iyDLRn~~  166 (323)
T KOG1036|consen  110 ISGSWD-----KTIKFWDPRNKVVVG--TFDQGK-KVYCMDVSGNRLVVGTSDR-KVLIYDLRNLD  166 (323)
T ss_pred             EEcccC-----ccEEEEecccccccc--ccccCc-eEEEEeccCCEEEEeecCc-eEEEEEccccc
Confidence            555444     356777775511110  101111 1112345566666766444 89999987643


No 190
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=69.42  E-value=60  Score=32.06  Aligned_cols=59  Identities=15%  Similarity=0.154  Sum_probs=34.1

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc--cceeEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT--EIEDSFVMDGKIYIRCSASAATSHVCALVYEP  234 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~--~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~  234 (352)
                      +|+-.+.|+.+  ..+..||..+.+  .+..+....  -....+..+|.+.+.||.+.     .+..+|.
T Consensus       588 ~Gr~LaSg~ed--~~I~iWDl~~~~--~v~~l~~Ht~ti~SlsFS~dg~vLasgg~Dn-----sV~lWD~  648 (707)
T KOG0263|consen  588 CGRYLASGDED--GLIKIWDLANGS--LVKQLKGHTGTIYSLSFSRDGNVLASGGADN-----SVRLWDL  648 (707)
T ss_pred             CCceEeecccC--CcEEEEEcCCCc--chhhhhcccCceeEEEEecCCCEEEecCCCC-----eEEEEEc
Confidence            45555555444  678889988764  222222221  11234566999999987763     4555555


No 191
>KOG0305 consensus Anaphase promoting complex, Cdc20, Cdh1, and Ama1 subunits [Cell cycle control, cell division, chromosome partitioning; Posttranslational modification, protein turnover, chaperones]
Probab=69.18  E-value=1.1e+02  Score=29.19  Aligned_cols=173  Identities=10%  Similarity=0.058  Sum_probs=81.4

Q ss_pred             eEEEEECCCCeeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc-cceeEEEECCE
Q 045821          137 EVYCYDASMNTWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT-EIEDSFVMDGK  213 (352)
Q Consensus       137 ~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~-~~~~~~~~~g~  213 (352)
                      .++.++-.+..-..+.++.  +...+++..  +|...++|-.+  ..|++||..+..  .+..+...+ ...++..-++.
T Consensus       198 ~vylW~~~s~~v~~l~~~~--~~~vtSv~ws~~G~~LavG~~~--g~v~iwD~~~~k--~~~~~~~~h~~rvg~laW~~~  271 (484)
T KOG0305|consen  198 SVYLWSASSGSVTELCSFG--EELVTSVKWSPDGSHLAVGTSD--GTVQIWDVKEQK--KTRTLRGSHASRVGSLAWNSS  271 (484)
T ss_pred             eEEEEecCCCceEEeEecC--CCceEEEEECCCCCEEEEeecC--CeEEEEehhhcc--ccccccCCcCceeEEEeccCc
Confidence            4455555555544444443  233333333  46666666544  678899988663  444444422 22334455777


Q ss_pred             EEEEccCCCCCCceEEEEEeCCCCcee-ecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEE-ccCCCCC
Q 045821          214 IYIRCSASAATSHVCALVYEPSTDSWL-HADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSP-VGRLSTL  291 (352)
Q Consensus       214 iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~-~~~~~~~  291 (352)
                      +...|....     .|..+|....+=. ..-.......++.....++..+.-||.++ .+.+||.....+.. +..... 
T Consensus       272 ~lssGsr~~-----~I~~~dvR~~~~~~~~~~~H~qeVCgLkws~d~~~lASGgnDN-~~~Iwd~~~~~p~~~~~~H~a-  344 (484)
T KOG0305|consen  272 VLSSGSRDG-----KILNHDVRISQHVVSTLQGHRQEVCGLKWSPDGNQLASGGNDN-VVFIWDGLSPEPKFTFTEHTA-  344 (484)
T ss_pred             eEEEecCCC-----cEEEEEEecchhhhhhhhcccceeeeeEECCCCCeeccCCCcc-ceEeccCCCccccEEEeccce-
Confidence            788765553     4556655322110 00000000011111233555555566555 88999883322211 100000 


Q ss_pred             cCCCCceEEEeCCEEEEEcCCce---EEEEEcCCc
Q 045821          292 LTRPPCKLVAIGKTIFVIGKGCS---AVVIDVGNI  323 (352)
Q Consensus       292 ~~~~~~~~~~~~~~i~v~gG~~~---~~~~d~~~~  323 (352)
                       .....+.+.....|+..||...   .-++|+.+.
T Consensus       345 -AVKA~awcP~q~~lLAsGGGs~D~~i~fwn~~~g  378 (484)
T KOG0305|consen  345 -AVKALAWCPWQSGLLATGGGSADRCIKFWNTNTG  378 (484)
T ss_pred             -eeeEeeeCCCccCceEEcCCCcccEEEEEEcCCC
Confidence             1112233445667777776542   224555543


No 192
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=68.98  E-value=65  Score=30.98  Aligned_cols=105  Identities=11%  Similarity=0.091  Sum_probs=56.0

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCCCCce-eecC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPSTDSW-LHAD  243 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~  243 (352)
                      +-.+|+.|..   .+|+++|.+.+.|-.  +.......--++.+  -+.|..+|+..+     .++.+|+.+.+- ..+.
T Consensus       145 scDly~~gsg---~evYRlNLEqGrfL~--P~~~~~~~lN~v~in~~hgLla~Gt~~g-----~VEfwDpR~ksrv~~l~  214 (703)
T KOG2321|consen  145 SCDLYLVGSG---SEVYRLNLEQGRFLN--PFETDSGELNVVSINEEHGLLACGTEDG-----VVEFWDPRDKSRVGTLD  214 (703)
T ss_pred             CccEEEeecC---cceEEEEcccccccc--ccccccccceeeeecCccceEEecccCc-----eEEEecchhhhhheeee
Confidence            3457776643   789999999999843  22222111122333  346777765453     688888866542 2221


Q ss_pred             C------CCCCC--CCCcEEEE-CCEEEEEecCCCCeEEEEeCCCCc
Q 045821          244 A------NMASG--WRGPAVVV-DDALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       244 ~------~~~~~--~~~~~~~~-~~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      .      .+...  ..-.++.+ ++.|-+--|.....+++||+.+.+
T Consensus       215 ~~~~v~s~pg~~~~~svTal~F~d~gL~~aVGts~G~v~iyDLRa~~  261 (703)
T KOG2321|consen  215 AASSVNSHPGGDAAPSVTALKFRDDGLHVAVGTSTGSVLIYDLRASK  261 (703)
T ss_pred             cccccCCCccccccCcceEEEecCCceeEEeeccCCcEEEEEcccCC
Confidence            1      11111  11123334 435655545444489999987654


No 193
>PF12217 End_beta_propel:  Catalytic beta propeller domain of bacteriophage endosialidase;  InterPro: IPR024428 This entry represents the beta propeller domain of endosialidases, which consists of catalytically active part of the enzymes. This core domain forms stable SDS-resistant trimers. There is a nested beta barrel domain in this domain. This domain is typically between 443 and 460 amino acids in length [].; PDB: 1V0E_B 1V0F_E 3JU4_A 3GVL_A 3GVK_B 3GVJ_A.
Probab=68.55  E-value=77  Score=27.29  Aligned_cols=194  Identities=13%  Similarity=0.257  Sum_probs=86.6

Q ss_pred             EEECCEEEE--EeCCCCCCCCCCeEEEEECCCC-eeEe---cCCC----CC-cceeeeeEEECCEEEEEeCCCCC-----
Q 045821          116 EVLGKNAYL--LGGCGWSEDATSEVYCYDASMN-TWTD---AAPM----ST-ARCYFPCGVLNQKIYCIGGLGDT-----  179 (352)
Q Consensus       116 ~~~~~~iyv--~GG~~~~~~~~~~~~~~d~~t~-~W~~---~~~~----~~-~r~~~~~~~~~~~iyv~gG~~~~-----  179 (352)
                      .+.++-||.  ++|....-..+.-.|+=....+ +|..   +.++    |. .-..+++.++++++|.+=....-     
T Consensus        22 FVy~~VIYAPfM~~~RHGv~~LhvaWVkSgDdG~TWttPEwLtd~H~~yptvnyHCmSMGv~~NRLfa~iEtR~~a~~km  101 (367)
T PF12217_consen   22 FVYDNVIYAPFMAGDRHGVDNLHVAWVKSGDDGQTWTTPEWLTDLHPDYPTVNYHCMSMGVVGNRLFAVIETRTVASNKM  101 (367)
T ss_dssp             EEETTEEEEEEEEESSSSSTT-EEEEEEESSTTSS----EESS---TTTTTEEEE-B-EEEETTEEEEEEEEEETTT--E
T ss_pred             eeecCeeecccccccccCccceEEEEEEecCCCCcccCchhhhhcCCCCCccceeeeeeeeecceeeEEEeehhhhhhhh
Confidence            445666652  3333222233445566555543 4743   2222    22 23356777889999887553211     


Q ss_pred             CceEEEe---CCCCcEEEec--CCCC-------cccceeEEEECCEEEEEccCCCCCC--ceEEEEEeCCC---Cce--e
Q 045821          180 HSWDVYD---PRTNNWKLHT--EPNI-------FTEIEDSFVMDGKIYIRCSASAATS--HVCALVYEPST---DSW--L  240 (352)
Q Consensus       180 ~~v~~yd---~~t~~W~~~~--~~~~-------~~~~~~~~~~~g~iyv~GG~~~~~~--~~~i~~yd~~~---~~W--~  240 (352)
                      ...+.||   .....|+...  ..+.       ....|+.+.+++.-|.+|=.++.-.  ...+..|...-   ...  +
T Consensus       102 ~~~~Lw~RpMF~~spW~~teL~~~~~~~~a~~~vTe~HSFa~i~~~~fA~GyHnGD~sPRe~G~~yfs~~~~sp~~~vrr  181 (367)
T PF12217_consen  102 VRAELWSRPMFHDSPWRITELGTIASFTSAGVAVTELHSFATIDDNQFAVGYHNGDVSPRELGFLYFSDAFASPGVFVRR  181 (367)
T ss_dssp             EEEEEEEEE-STTS--EEEEEES-TT--------SEEEEEEE-SSS-EEEEEEE-SSSS-EEEEEEETTTTT-TT--EEE
T ss_pred             hhhhhhcccccccCCceeeecccccccccccceeeeeeeeeEecCCceeEEeccCCCCcceeeEEEecccccCCcceeee
Confidence            3334444   4577887643  2222       1223455777887788854333222  22444442211   111  1


Q ss_pred             ecCCCCCCCCCCcE-EEECCEEEEEe-cC----CCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc
Q 045821          241 HADANMASGWRGPA-VVVDDALYVLD-QS----SGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG  310 (352)
Q Consensus       241 ~~~~~~~~~~~~~~-~~~~~~l~~~g-g~----~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g  310 (352)
                      .++.......+.+. -..+|+||++- |.    -++.+.+-+.....|+.+.- |...-....-++..++.||+||
T Consensus       182 ~i~sey~~~AsEPCvkyY~g~LyLtTRgt~~~~~GS~L~rs~d~G~~w~slrf-p~nvHhtnlPFakvgD~l~mFg  256 (367)
T PF12217_consen  182 IIPSEYERNASEPCVKYYDGVLYLTTRGTLPTNPGSSLHRSDDNGQNWSSLRF-PNNVHHTNLPFAKVGDVLYMFG  256 (367)
T ss_dssp             E--GGG-TTEEEEEEEEETTEEEEEEEES-TTS---EEEEESSTTSS-EEEE--TT---SS---EEEETTEEEEEE
T ss_pred             echhhhccccccchhhhhCCEEEEEEcCcCCCCCcceeeeecccCCchhhccc-cccccccCCCceeeCCEEEEEe
Confidence            22211111111111 24699999884 22    23678888888999998863 3333333445667899999998


No 194
>COG4447 Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
Probab=67.98  E-value=53  Score=28.59  Aligned_cols=113  Identities=11%  Similarity=0.056  Sum_probs=66.9

Q ss_pred             eEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeec
Q 045821          163 CGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHA  242 (352)
Q Consensus       163 ~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~  242 (352)
                      .+.-+.+.+++|+..   .+..=|-.-++|.+...+...+..+...+.+.+=++.| ..     ..++.-+-...+|.++
T Consensus        50 ~~~~g~~gwlVg~rg---tiletdd~g~tw~qal~~~gr~~f~sv~f~~~egw~vG-e~-----sqll~T~DgGqsWARi  120 (339)
T COG4447          50 FTESGSHGWLVGGRG---TILETDDGGITWAQALDFLGRHAFHSVSFLGMEGWIVG-EP-----SQLLHTTDGGQSWARI  120 (339)
T ss_pred             EeecCcceEEEcCcc---eEEEecCCcccchhhhchhhhhheeeeeeecccccccC-Cc-----ceEEEecCCCcchhhc
Confidence            333467899999863   34455666778998765544444554444454455552 22     3466666778899998


Q ss_pred             CCCCCCCCCCc--EEEECCEEEEEecCCCCeEEEEeCCCCceEEcc
Q 045821          243 DANMASGWRGP--AVVVDDALYVLDQSSGTKLMMWQKESREWSPVG  286 (352)
Q Consensus       243 ~~~~~~~~~~~--~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~  286 (352)
                      +..........  .+..+..-+++|.+.  .|+.-+-..+.|+.+.
T Consensus       121 ~~~e~~eg~~~sI~f~d~q~g~m~gd~G--ail~T~DgGk~Wk~l~  164 (339)
T COG4447         121 PLSEKLEGFPDSITFLDDQRGEMLGDQG--AILKTTDGGKNWKALV  164 (339)
T ss_pred             hhhcCCCCCcceeEEecchhhhhhcccc--eEEEecCCcccHhHhc
Confidence            65332222222  233344456666643  5666666778898775


No 195
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=67.95  E-value=1e+02  Score=28.39  Aligned_cols=129  Identities=12%  Similarity=0.007  Sum_probs=57.7

Q ss_pred             EEEecCCCCCCCeeEeccCCCcccCCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCe-eEecCCCCCcceeee
Q 045821           85 CYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNT-WTDAAPMSTARCYFP  162 (352)
Q Consensus        85 ~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~-W~~~~~~~~~r~~~~  162 (352)
                      ++..|.  .++..+.+-.-.    ...+|..... +..+.+|--...-......+|..|..... |.....++.....|-
T Consensus       170 i~~idl--~tG~~~~v~~~~----~wlgH~~fsP~dp~li~fCHEGpw~~Vd~RiW~i~~dg~~~~~v~~~~~~e~~gHE  243 (386)
T PF14583_consen  170 IFTIDL--KTGERKVVFEDT----DWLGHVQFSPTDPTLIMFCHEGPWDLVDQRIWTINTDGSNVKKVHRRMEGESVGHE  243 (386)
T ss_dssp             EEEEET--TT--EEEEEEES----S-EEEEEEETTEEEEEEEEE-S-TTTSS-SEEEEETTS---EESS---TTEEEEEE
T ss_pred             EEEEEC--CCCceeEEEecC----ccccCcccCCCCCCEEEEeccCCcceeceEEEEEEcCCCcceeeecCCCCcccccc
Confidence            334455  666655554322    1223333333 33445553222222234589999987555 444444444555665


Q ss_pred             eEEECCE-EEEEeCCCCC--CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCC
Q 045821          163 CGVLNQK-IYCIGGLGDT--HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSAS  221 (352)
Q Consensus       163 ~~~~~~~-iyv~gG~~~~--~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~  221 (352)
                      --..+|. |+..+.....  .-+..||+.|..=+.+..+|  ...|.....+|++++--|.+
T Consensus       244 fw~~DG~~i~y~~~~~~~~~~~i~~~d~~t~~~~~~~~~p--~~~H~~ss~Dg~L~vGDG~d  303 (386)
T PF14583_consen  244 FWVPDGSTIWYDSYTPGGQDFWIAGYDPDTGERRRLMEMP--WCSHFMSSPDGKLFVGDGGD  303 (386)
T ss_dssp             EE-TTSS-EEEEEEETTT--EEEEEE-TTT--EEEEEEE---SEEEEEE-TTSSEEEEEE--
T ss_pred             cccCCCCEEEEEeecCCCCceEEEeeCCCCCCceEEEeCC--ceeeeEEcCCCCEEEecCCC
Confidence            5555553 3332322222  55677999987544443343  35566667788988866654


No 196
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=67.45  E-value=86  Score=27.47  Aligned_cols=134  Identities=14%  Similarity=0.152  Sum_probs=69.0

Q ss_pred             CCeEEEEECCCCeeEecCCCCCc-ceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CC
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTA-RCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DG  212 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~-r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g  212 (352)
                      ..++-.+|..+++-...-.-... ....+...-+.+.||.||++  .....+|.....-.+.-.-. .....++.++ +|
T Consensus       165 D~TCalWDie~g~~~~~f~GH~gDV~slsl~p~~~ntFvSg~cD--~~aklWD~R~~~c~qtF~gh-esDINsv~ffP~G  241 (343)
T KOG0286|consen  165 DMTCALWDIETGQQTQVFHGHTGDVMSLSLSPSDGNTFVSGGCD--KSAKLWDVRSGQCVQTFEGH-ESDINSVRFFPSG  241 (343)
T ss_pred             CceEEEEEcccceEEEEecCCcccEEEEecCCCCCCeEEecccc--cceeeeeccCcceeEeeccc-ccccceEEEccCC
Confidence            45677788888764332110111 11111111267899999988  56677888877554432111 1112233333 67


Q ss_pred             EEEEEccCCCCCCceEEEEEeCCCCceeecC--CCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeC
Q 045821          213 KIYIRCSASAATSHVCALVYEPSTDSWLHAD--ANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQK  277 (352)
Q Consensus       213 ~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~--~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~  277 (352)
                      .-++.|.-+     ...-.||+..++=-.+-  .....+..+.++...|+|++.|..+. ...+||.
T Consensus       242 ~afatGSDD-----~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfagy~d~-~c~vWDt  302 (343)
T KOG0286|consen  242 DAFATGSDD-----ATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFAGYDDF-TCNVWDT  302 (343)
T ss_pred             CeeeecCCC-----ceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEeeecCC-ceeEeec
Confidence            777775333     24667887766433331  11222223334556888887775433 5566553


No 197
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=67.09  E-value=30  Score=30.88  Aligned_cols=102  Identities=14%  Similarity=0.214  Sum_probs=64.3

Q ss_pred             ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC
Q 045821          166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN  245 (352)
Q Consensus       166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~  245 (352)
                      ++++ |++....+ ..+-+++..|..+-+  .+....++-++.-++|++.|.|..+     +.|-.+|...+..-.+-..
T Consensus       328 fd~k-yIVsASgD-RTikvW~~st~efvR--tl~gHkRGIAClQYr~rlvVSGSSD-----ntIRlwdi~~G~cLRvLeG  398 (499)
T KOG0281|consen  328 FDDK-YIVSASGD-RTIKVWSTSTCEFVR--TLNGHKRGIACLQYRDRLVVSGSSD-----NTIRLWDIECGACLRVLEG  398 (499)
T ss_pred             cccc-eEEEecCC-ceEEEEeccceeeeh--hhhcccccceehhccCeEEEecCCC-----ceEEEEeccccHHHHHHhc
Confidence            4666 44333222 677888888876543  3455556666777899999997443     4677888877665443110


Q ss_pred             CCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821          246 MASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       246 ~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      ...  ---.+-++++=.+-|+++| .+-+||..+
T Consensus       399 HEe--LvRciRFd~krIVSGaYDG-kikvWdl~a  429 (499)
T KOG0281|consen  399 HEE--LVRCIRFDNKRIVSGAYDG-KIKVWDLQA  429 (499)
T ss_pred             hHH--hhhheeecCceeeeccccc-eEEEEeccc
Confidence            000  0112567888888999988 777777654


No 198
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=66.75  E-value=1.5e+02  Score=29.98  Aligned_cols=71  Identities=11%  Similarity=0.080  Sum_probs=40.5

Q ss_pred             eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcE
Q 045821          113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNW  192 (352)
Q Consensus       113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W  192 (352)
                      +.....+...||.-.      .-+.+.+||..+-.--.+. -+.|..-.+++.....+|+..|    +.+.+|--....|
T Consensus        39 ~~~~~~~~~~~vtt~------vgksfqvYd~~kl~ll~vs-~~lp~~I~alas~~~~vy~A~g----~~i~~~~rgk~i~  107 (910)
T KOG1539|consen   39 FRVVALGSTFYVTTC------VGKSFQVYDVNKLNLLFVS-KPLPDKITALASDKDYVYVASG----NKIYAYARGKHIR  107 (910)
T ss_pred             eeeeecCceEEEEEe------cCceEEEEeccceEEEEec-CCCCCceEEEEecCceEEEecC----cEEEEEEccceEE
Confidence            344555666666632      2356788888765544444 2333344555666667777777    5666665554444


Q ss_pred             EE
Q 045821          193 KL  194 (352)
Q Consensus       193 ~~  194 (352)
                      -.
T Consensus       108 ~~  109 (910)
T KOG1539|consen  108 HT  109 (910)
T ss_pred             EE
Confidence            33


No 199
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=66.46  E-value=1e+02  Score=27.88  Aligned_cols=204  Identities=15%  Similarity=0.137  Sum_probs=101.1

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE------CCEEEEEeCCCCCCceEEEeCCCCcE
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL------NQKIYCIGGLGDTHSWDVYDPRTNNW  192 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~------~~~iyv~gG~~~~~~v~~yd~~t~~W  192 (352)
                      ...=|+|...     ..+.+-+||...|+..+      ..++|-++++      --.+.+.||.+  ..+-++|+.|.. 
T Consensus       203 ~rHpYlFs~g-----edk~VKCwDLe~nkvIR------~YhGHlS~V~~L~lhPTldvl~t~grD--st~RvWDiRtr~-  268 (460)
T KOG0285|consen  203 KRHPYLFSAG-----EDKQVKCWDLEYNKVIR------HYHGHLSGVYCLDLHPTLDVLVTGGRD--STIRVWDIRTRA-  268 (460)
T ss_pred             ccCceEEEec-----CCCeeEEEechhhhhHH------HhccccceeEEEeccccceeEEecCCc--ceEEEeeecccc-
Confidence            4445666542     23568889998775322      1122322222      13456667766  667788888764 


Q ss_pred             EEecCCCCcccceeEEE---ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCC
Q 045821          193 KLHTEPNIFTEIEDSFV---MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSG  269 (352)
Q Consensus       193 ~~~~~~~~~~~~~~~~~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~  269 (352)
                       .+..+.....-...+.   .+.+||-- ..++     .+-.+|...++-...-  ..+.....+.+..-+.+++.....
T Consensus       269 -~V~~l~GH~~~V~~V~~~~~dpqvit~-S~D~-----tvrlWDl~agkt~~tl--t~hkksvral~lhP~e~~fASas~  339 (460)
T KOG0285|consen  269 -SVHVLSGHTNPVASVMCQPTDPQVITG-SHDS-----TVRLWDLRAGKTMITL--THHKKSVRALCLHPKENLFASASP  339 (460)
T ss_pred             -eEEEecCCCCcceeEEeecCCCceEEe-cCCc-----eEEEeeeccCceeEee--ecccceeeEEecCCchhhhhccCC
Confidence             2221222111112222   25666552 3331     3445555544322211  111222233455666666654333


Q ss_pred             CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEE-EEcCCcccccceEEeccCCCCCCCCcceeeE
Q 045821          270 TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVV-IDVGNIGNIGGIMVSSSIPKLNDNDDIISCK  348 (352)
Q Consensus       270 ~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~-~d~~~~~w~~~~~~~~~~~~~~~~~~~~~~~  348 (352)
                      +.+-.++.-.+....  ++.. .....-++....+-+|+.||.+..++ +|-+..- .=....+.-.|++...+..+.|.
T Consensus       340 dnik~w~~p~g~f~~--nlsg-h~~iintl~~nsD~v~~~G~dng~~~fwdwksg~-nyQ~~~t~vqpGSl~sEagI~as  415 (460)
T KOG0285|consen  340 DNIKQWKLPEGEFLQ--NLSG-HNAIINTLSVNSDGVLVSGGDNGSIMFWDWKSGH-NYQRGQTIVQPGSLESEAGIFAS  415 (460)
T ss_pred             ccceeccCCccchhh--cccc-ccceeeeeeeccCceEEEcCCceEEEEEecCcCc-ccccccccccCCccccccceeEE
Confidence            356666655544322  1221 11222356667788999998776554 6655422 22233344558888888666666


Q ss_pred             E
Q 045821          349 C  349 (352)
Q Consensus       349 ~  349 (352)
                      |
T Consensus       416 ~  416 (460)
T KOG0285|consen  416 C  416 (460)
T ss_pred             e
Confidence            5


No 200
>COG0823 TolB Periplasmic component of the Tol biopolymer transport system [Intracellular trafficking and secretion]
Probab=65.71  E-value=1.2e+02  Score=28.46  Aligned_cols=103  Identities=15%  Similarity=0.023  Sum_probs=56.7

Q ss_pred             CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC
Q 045821          180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD  259 (352)
Q Consensus       180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~  259 (352)
                      .+++.+|..+.+=.++..... ...+-+..-+|+-+++....  .....|+.+|....+=+++........ .....-+|
T Consensus       262 ~~iy~~dl~~~~~~~Lt~~~g-i~~~Ps~spdG~~ivf~Sdr--~G~p~I~~~~~~g~~~~riT~~~~~~~-~p~~SpdG  337 (425)
T COG0823         262 PDIYLMDLDGKNLPRLTNGFG-INTSPSWSPDGSKIVFTSDR--GGRPQIYLYDLEGSQVTRLTFSGGGNS-NPVWSPDG  337 (425)
T ss_pred             ccEEEEcCCCCcceecccCCc-cccCccCCCCCCEEEEEeCC--CCCcceEEECCCCCceeEeeccCCCCc-CccCCCCC
Confidence            889999999876333332211 11122333455544442211  222378999998877676654332222 33344455


Q ss_pred             EEEEEecCC-C-CeEEEEeCCCCc-eEEcc
Q 045821          260 ALYVLDQSS-G-TKLMMWQKESRE-WSPVG  286 (352)
Q Consensus       260 ~l~~~gg~~-~-~~v~~yd~~~~~-W~~~~  286 (352)
                      +.+++-+.. + ..+..+|+.++. |+.+.
T Consensus       338 ~~i~~~~~~~g~~~i~~~~~~~~~~~~~lt  367 (425)
T COG0823         338 DKIVFESSSGGQWDIDKNDLASGGKIRILT  367 (425)
T ss_pred             CEEEEEeccCCceeeEEeccCCCCcEEEcc
Confidence            544443322 2 368899998777 87775


No 201
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=65.24  E-value=1.3e+02  Score=28.68  Aligned_cols=86  Identities=17%  Similarity=0.095  Sum_probs=52.4

Q ss_pred             EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeC-CEEEEEcCCceEEEEEcCCcccccce-EEec
Q 045821          256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIG-KTIFVIGKGCSAVVIDVGNIGNIGGI-MVSS  333 (352)
Q Consensus       256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~-~~i~v~gG~~~~~~~d~~~~~w~~~~-~~~~  333 (352)
                      .-+++-++-|.++| .|..||..++.=.++..-+.  ...-.++...+ +.++.+|=-+...+++.+.+...+.. .-..
T Consensus       329 ~~d~~~i~SgsyDG-~I~~W~~~~g~~~~~~g~~h--~nqI~~~~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg  405 (603)
T KOG0318|consen  329 SPDGKTIYSGSYDG-HINSWDSGSGTSDRLAGKGH--TNQIKGMAASESGELFTIGWDDTLRVISLKDNGYTKSEVVKLG  405 (603)
T ss_pred             cCCCCEEEeeccCc-eEEEEecCCccccccccccc--cceEEEEeecCCCcEEEEecCCeEEEEecccCcccccceeecC
Confidence            34565555555666 88888888877665542221  22233455555 77888876666667888777766543 3455


Q ss_pred             cCCCCCCCCcc
Q 045821          334 SIPKLNDNDDI  344 (352)
Q Consensus       334 ~~~~~~~~~~~  344 (352)
                      +.|.-.++.+.
T Consensus       406 ~QP~~lav~~d  416 (603)
T KOG0318|consen  406 SQPKGLAVLSD  416 (603)
T ss_pred             CCceeEEEcCC
Confidence            66766666543


No 202
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=65.00  E-value=1.1e+02  Score=27.96  Aligned_cols=149  Identities=10%  Similarity=0.018  Sum_probs=70.2

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECCEEEEEeCCCCCCceEEEeCCCCcE-EEec
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQKIYCIGGLGDTHSWDVYDPRTNNW-KLHT  196 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~~iyv~gG~~~~~~v~~yd~~t~~W-~~~~  196 (352)
                      +..+|+.+.       ...+.++|+.+++  .+...+........+. -+|+..+++.+.. ..+.++|.+|.+= +.++
T Consensus        48 gr~~yv~~r-------dg~vsviD~~~~~--~v~~i~~G~~~~~i~~s~DG~~~~v~n~~~-~~v~v~D~~tle~v~~I~  117 (369)
T PF02239_consen   48 GRYLYVANR-------DGTVSVIDLATGK--VVATIKVGGNPRGIAVSPDGKYVYVANYEP-GTVSVIDAETLEPVKTIP  117 (369)
T ss_dssp             SSEEEEEET-------TSEEEEEETTSSS--EEEEEE-SSEEEEEEE--TTTEEEEEEEET-TEEEEEETTT--EEEEEE
T ss_pred             CCEEEEEcC-------CCeEEEEECCccc--EEEEEecCCCcceEEEcCCCCEEEEEecCC-CceeEeccccccceeecc
Confidence            457898853       2468899999887  3333333333333333 3554333333322 7888899887642 2333


Q ss_pred             C--CCC---cccceeE-EEECCEEEEEccCCCCCCceEEEEEeCCCCc--e-eecCCCCCCCCCCcEEEECCEEEEEecC
Q 045821          197 E--PNI---FTEIEDS-FVMDGKIYIRCSASAATSHVCALVYEPSTDS--W-LHADANMASGWRGPAVVVDDALYVLDQS  267 (352)
Q Consensus       197 ~--~~~---~~~~~~~-~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W-~~~~~~~~~~~~~~~~~~~~~l~~~gg~  267 (352)
                      .  ++.   ..+..+. ....+..|++.-.+    ...++..|....+  . +.+..  ....+.....-+++-|+++..
T Consensus       118 ~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd----~~~I~vVdy~d~~~~~~~~i~~--g~~~~D~~~dpdgry~~va~~  191 (369)
T PF02239_consen  118 TGGMPVDGPESRVAAIVASPGRPEFVVNLKD----TGEIWVVDYSDPKNLKVTTIKV--GRFPHDGGFDPDGRYFLVAAN  191 (369)
T ss_dssp             --EE-TTTS---EEEEEE-SSSSEEEEEETT----TTEEEEEETTTSSCEEEEEEE----TTEEEEEE-TTSSEEEEEEG
T ss_pred             cccccccccCCCceeEEecCCCCEEEEEEcc----CCeEEEEEeccccccceeeecc--cccccccccCcccceeeeccc
Confidence            1  111   1111222 22344545552222    2257777765432  1 11111  111122233345565666555


Q ss_pred             CCCeEEEEeCCCCceE
Q 045821          268 SGTKLMMWQKESREWS  283 (352)
Q Consensus       268 ~~~~v~~yd~~~~~W~  283 (352)
                      ..+.+.+.|.+++.-.
T Consensus       192 ~sn~i~viD~~~~k~v  207 (369)
T PF02239_consen  192 GSNKIAVIDTKTGKLV  207 (369)
T ss_dssp             GGTEEEEEETTTTEEE
T ss_pred             ccceeEEEeeccceEE
Confidence            5568888888876544


No 203
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=63.36  E-value=95  Score=26.50  Aligned_cols=129  Identities=13%  Similarity=0.041  Sum_probs=69.5

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLG-KNAYLLGGCGWSEDATSEVYCYDASMN  146 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~  146 (352)
                      ++.|+..||.     -..|..|.  ++++.+..-.-.    ..+-|+++.-+ +.-.+.|+.+      ..+-++|..|.
T Consensus       126 enSi~~AgGD-----~~~y~~dl--E~G~i~r~~rGH----tDYvH~vv~R~~~~qilsG~ED------GtvRvWd~kt~  188 (325)
T KOG0649|consen  126 ENSILFAGGD-----GVIYQVDL--EDGRIQREYRGH----TDYVHSVVGRNANGQILSGAED------GTVRVWDTKTQ  188 (325)
T ss_pred             CCcEEEecCC-----eEEEEEEe--cCCEEEEEEcCC----cceeeeeeecccCcceeecCCC------ccEEEEecccc
Confidence            6777777764     34666788  888776554332    12334444422 2223345443      34677888887


Q ss_pred             eeEec-CC-----CCCccee-e-eeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEc
Q 045821          147 TWTDA-AP-----MSTARCY-F-PCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRC  218 (352)
Q Consensus       147 ~W~~~-~~-----~~~~r~~-~-~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~G  218 (352)
                      +-..+ .+     ...|..+ + .+...+....|+||-   ...-.++..+.+-..+  .|.+...+.+.++++.+.+.|
T Consensus       189 k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvCGgG---p~lslwhLrsse~t~v--fpipa~v~~v~F~~d~vl~~G  263 (325)
T KOG0649|consen  189 KHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVCGGG---PKLSLWHLRSSESTCV--FPIPARVHLVDFVDDCVLIGG  263 (325)
T ss_pred             ceeEEeccccChhhcCcccCceeEEEeccCceEEecCC---CceeEEeccCCCceEE--EecccceeEeeeecceEEEec
Confidence            64332 22     1222222 2 233335566677664   4455666666554444  344455555666777777665


No 204
>COG4880 Secreted protein containing C-terminal beta-propeller domain distantly related to WD-40 repeats [General function prediction only]
Probab=62.58  E-value=1.3e+02  Score=27.93  Aligned_cols=184  Identities=13%  Similarity=0.105  Sum_probs=88.5

Q ss_pred             eeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCee-EecCCCCCcceee
Q 045821           83 VCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTW-TDAAPMSTARCYF  161 (352)
Q Consensus        83 ~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W-~~~~~~~~~r~~~  161 (352)
                      ...+++|.  .-+.--++..+..  ..| -+++-..++..|++-=     ...+-+++.|...-+= +.+..+.-|-+.-
T Consensus       406 N~vYilDe--~lnvvGkltGl~~--gER-IYAvRf~gdv~yiVTf-----rqtDPlfviDlsNPenPkvlGeLKIPGfS~  475 (603)
T COG4880         406 NAVYILDE--NLNVVGKLTGLAP--GER-IYAVRFVGDVLYIVTF-----RQTDPLFVIDLSNPENPKVLGELKIPGFSE  475 (603)
T ss_pred             ceeEEEcC--CCcEEEEEeccCC--Cce-EEEEEEeCceEEEEEE-----eccCceEEEEcCCCCCCceeEEEecCCchh
Confidence            45788887  7777777776651  123 2445556777777642     2234466666653221 1122222222221


Q ss_pred             eeEEEC-CEEEEEeCCCCCCceEEEeCCCCc-EEEec------CCCCcccceeEEEECC--EEEEEccCCCCCCceEEEE
Q 045821          162 PCGVLN-QKIYCIGGLGDTHSWDVYDPRTNN-WKLHT------EPNIFTEIEDSFVMDG--KIYIRCSASAATSHVCALV  231 (352)
Q Consensus       162 ~~~~~~-~~iyv~gG~~~~~~v~~yd~~t~~-W~~~~------~~~~~~~~~~~~~~~g--~iyv~GG~~~~~~~~~i~~  231 (352)
                      -...++ +++.=+|.+++.-.+-.||.+.-. =+.+.      -..+.-+.|+++..|.  .|+.+-.+.+    ..++.
T Consensus       476 YLHpigen~~lGvG~~~g~vKiSLFdiSdl~~PkEv~~y~l~~~wspvf~dhHAFl~d~~~~ifFlPay~~----gyif~  551 (603)
T COG4880         476 YLHPIGENRLLGVGAYQGGVKISLFDISDLAAPKEVSNYTLSNAWSPVFYDHHAFLYDPEAEIFFLPAYLG----GYIFF  551 (603)
T ss_pred             hccccCCCcEEEeecccCCceEEEEeccCCCCchhhhheehhhhcchhhhccceeecCCcccEEEecccCc----cEEEE
Confidence            222233 455555555544555666664311 00010      0112234456666653  4554422111    02333


Q ss_pred             EeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCC
Q 045821          232 YEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRL  288 (352)
Q Consensus       232 yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~  288 (352)
                      +..  + -..+...-...-.-.+...++-+|++||.   .+|+||  .|+|+.++..
T Consensus       552 ied--g-~kl~k~~e~k~na~RA~fi~dylY~vg~~---ev~~ld--enswe~Vge~  600 (603)
T COG4880         552 IED--G-SKLRKRAERKLNADRAFFIKDYLYLVGGN---EVWKLD--ENSWEVVGEA  600 (603)
T ss_pred             Eec--C-ceeeehhhhcccceeeEEecceEEEeccc---eeEEec--cchHhhhhhe
Confidence            322  2 11111100000112246789999999997   899988  5779887643


No 205
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=61.95  E-value=1.1e+02  Score=26.90  Aligned_cols=63  Identities=16%  Similarity=0.096  Sum_probs=34.2

Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEecCCCCeEEEEeCCC
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      +|.....||.+.     .+-.+|+.+++=.++...-.+-+...-+.. +-.+++.|..+. .+-.||+..
T Consensus        83 dgskVf~g~~Dk-----~~k~wDL~S~Q~~~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDK-TlKfWD~R~  146 (347)
T KOG0647|consen   83 DGSKVFSGGCDK-----QAKLWDLASGQVSQVAAHDAPVKTCHWVPGMNYQCLVTGSWDK-TLKFWDTRS  146 (347)
T ss_pred             CCceEEeeccCC-----ceEEEEccCCCeeeeeecccceeEEEEecCCCcceeEeccccc-ceeecccCC
Confidence            454444445553     477899999988887652111111111111 123556666655 777778774


No 206
>PTZ00420 coronin; Provisional
Probab=61.18  E-value=1.7e+02  Score=28.67  Aligned_cols=148  Identities=12%  Similarity=0.128  Sum_probs=70.0

Q ss_pred             EEEEEeCCCCCCCCCCeEEEEECCCCeeE-ecCCCCCcceeeeeE-EECCEEEEEeCCCCCCceEEEeCCCCcEEEecCC
Q 045821          121 NAYLLGGCGWSEDATSEVYCYDASMNTWT-DAAPMSTARCYFPCG-VLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEP  198 (352)
Q Consensus       121 ~iyv~GG~~~~~~~~~~~~~~d~~t~~W~-~~~~~~~~r~~~~~~-~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~  198 (352)
                      .+++.||.+      ..+.+||..+.+-. .+. .+.  .-.++. ..++.+++.++.+  ..+.+||+.+.+=  +...
T Consensus       139 ~iLaSgS~D------gtIrIWDl~tg~~~~~i~-~~~--~V~SlswspdG~lLat~s~D--~~IrIwD~Rsg~~--i~tl  205 (568)
T PTZ00420        139 YIMCSSGFD------SFVNIWDIENEKRAFQIN-MPK--KLSSLKWNIKGNLLSGTCVG--KHMHIIDPRKQEI--ASSF  205 (568)
T ss_pred             eEEEEEeCC------CeEEEEECCCCcEEEEEe-cCC--cEEEEEECCCCCEEEEEecC--CEEEEEECCCCcE--EEEE
Confidence            445555543      45778898876521 121 111  111121 1267777776644  6789999987642  2111


Q ss_pred             CCcccc---eeEEE-----ECCEEEEEccCCCCCCceEEEEEeCCC-CceeecCCCCCCCCCCcEE-E---ECCEEEEEe
Q 045821          199 NIFTEI---EDSFV-----MDGKIYIRCSASAATSHVCALVYEPST-DSWLHADANMASGWRGPAV-V---VDDALYVLD  265 (352)
Q Consensus       199 ~~~~~~---~~~~~-----~~g~iyv~GG~~~~~~~~~i~~yd~~~-~~W~~~~~~~~~~~~~~~~-~---~~~~l~~~g  265 (352)
                      . .+..   ..++.     -++...+.+|.... ....+..||..+ ..-..... .. ...+... .   ..+.+|+.|
T Consensus       206 ~-gH~g~~~s~~v~~~~fs~d~~~IlTtG~d~~-~~R~VkLWDlr~~~~pl~~~~-ld-~~~~~L~p~~D~~tg~l~lsG  281 (568)
T PTZ00420        206 H-IHDGGKNTKNIWIDGLGGDDNYILSTGFSKN-NMREMKLWDLKNTTSALVTMS-ID-NASAPLIPHYDESTGLIYLIG  281 (568)
T ss_pred             e-cccCCceeEEEEeeeEcCCCCEEEEEEcCCC-CccEEEEEECCCCCCceEEEE-ec-CCccceEEeeeCCCCCEEEEE
Confidence            1 1111   11111     24455555455432 222467777753 22111100 00 0111111 1   146677776


Q ss_pred             cCCCCeEEEEeCCCCceEEcc
Q 045821          266 QSSGTKLMMWQKESREWSPVG  286 (352)
Q Consensus       266 g~~~~~v~~yd~~~~~W~~~~  286 (352)
                      ..++ .+..|+...+.-..+.
T Consensus       282 kGD~-tIr~~e~~~~~~~~l~  301 (568)
T PTZ00420        282 KGDG-NCRYYQHSLGSIRKVN  301 (568)
T ss_pred             ECCC-eEEEEEccCCcEEeec
Confidence            4444 8999998777655554


No 207
>KOG3545 consensus Olfactomedin and related extracellular matrix glycoproteins [Extracellular structures]
Probab=60.42  E-value=1.1e+02  Score=26.17  Aligned_cols=190  Identities=16%  Similarity=0.221  Sum_probs=106.7

Q ss_pred             CCEEEEEEecCCCCceeEEEe----cCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEEC
Q 045821           68 ETWIYAFCRDNKLERVCCYVL----DPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDA  143 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~----d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~  143 (352)
                      ++++|++.   .......+.|    |.  ....+...-.+|   .+..+-..++.+|.+|.-.      .....+.+||.
T Consensus        30 ~~r~~~~~---~~~~~~l~E~~~~~~~--~~~~~~~~~~lp---~~~~gTg~VVynGs~yynk------~~t~~ivky~l   95 (249)
T KOG3545|consen   30 DDRIYVMN---YFDGLMLTEYTNLEDF--KRGRKAEKYRLP---YSWDGTGHVVYNGSLYYNK------AGTRNIIKYDL   95 (249)
T ss_pred             cCceEEec---cccCceEEEeccHHHh--hccCcceEEeCC---CCccccceEEEcceEEeec------cCCcceEEEEe
Confidence            56788883   2233344434    23  445555555566   4455566678888888764      33567889999


Q ss_pred             CCCe---eEecCCCCC---------cceeeeeEEECCEEEEEeCCCCC--Cce-EEEeCCC----CcEEEecCCCCcccc
Q 045821          144 SMNT---WTDAAPMST---------ARCYFPCGVLNQKIYCIGGLGDT--HSW-DVYDPRT----NNWKLHTEPNIFTEI  204 (352)
Q Consensus       144 ~t~~---W~~~~~~~~---------~r~~~~~~~~~~~iyv~gG~~~~--~~v-~~yd~~t----~~W~~~~~~~~~~~~  204 (352)
                      .++.   +..++.+..         +......++...-++++--....  ..| -..|+.|    .+|..--   ..+..
T Consensus        96 ~~~~~~~~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~~g~iv~skLdp~tl~~e~tW~T~~---~k~~~  172 (249)
T KOG3545|consen   96 ETRTVAGSAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPENAGTIVLSKLDPETLEVERTWNTTL---PKRSA  172 (249)
T ss_pred             ecceeeeeeeccccccCCCcccccCCCccccceecccceeEEecccccCCcEEeeccCHHHhheeeeecccc---CCCCc
Confidence            8854   555554321         12233455554555555443322  333 5667743    3575422   22333


Q ss_pred             eeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEE----EECCEEEEEecCCCCeEEEEeCC
Q 045821          205 EDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAV----VVDDALYVLDQSSGTKLMMWQKE  278 (352)
Q Consensus       205 ~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~----~~~~~l~~~gg~~~~~v~~yd~~  278 (352)
                      ..++.+=|.||++-....... .--++||..+++=..+.-+.+......+.    ..+.+||+....   .+..|++.
T Consensus       173 ~~aF~iCGvLY~v~S~~~~~~-~i~yaydt~~~~~~~~~ipf~N~y~~~~~idYNP~D~~LY~wdng---~~l~y~l~  246 (249)
T KOG3545|consen  173 GNAFMICGVLYVVHSYNCTHT-QISYAYDTTTGTQERIDLPFPNPYSYATMIDYNPRDRRLYAWDNG---HQLTYNLT  246 (249)
T ss_pred             CceEEEeeeeEEEeccccCCc-eEEEEEEcCCCceecccccccchhhhhhccCCCcccceeeEecCC---cEEEEEeE
Confidence            345666788899854443221 12378999988776665555554444432    347889988543   56777653


No 208
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=59.84  E-value=1.4e+02  Score=27.11  Aligned_cols=53  Identities=11%  Similarity=0.134  Sum_probs=34.2

Q ss_pred             ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc
Q 045821          257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC  313 (352)
Q Consensus       257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~  313 (352)
                      -+|+ |++..-++..+.+||..+.+....-..+.   ..-.++....++.||+.|+-
T Consensus       344 p~Gk-yi~ScaDDktlrvwdl~~~~cmk~~~ah~---hfvt~lDfh~~~p~VvTGsV  396 (406)
T KOG0295|consen  344 PGGK-YILSCADDKTLRVWDLKNLQCMKTLEAHE---HFVTSLDFHKTAPYVVTGSV  396 (406)
T ss_pred             CCCe-EEEEEecCCcEEEEEeccceeeeccCCCc---ceeEEEecCCCCceEEeccc
Confidence            3455 55554444589999999988766544332   33334555677889988764


No 209
>PF13088 BNR_2:  BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=59.67  E-value=1.1e+02  Score=26.17  Aligned_cols=214  Identities=12%  Similarity=0.158  Sum_probs=98.2

Q ss_pred             CCCCeeEeccCCCcc--cCCcceEEEE--ECCEEEEEe--CCCCC-CCCCCeEEEEECC-CCeeEecCCCCCc------c
Q 045821           93 TRRSWKMIHELPART--LRRKGMGFEV--LGKNAYLLG--GCGWS-EDATSEVYCYDAS-MNTWTDAAPMSTA------R  158 (352)
Q Consensus        93 ~~~~w~~~~~~~~~~--~~~~~~~~~~--~~~~iyv~G--G~~~~-~~~~~~~~~~d~~-t~~W~~~~~~~~~------r  158 (352)
                      ...+|.....+....  .......+..  .++.|+++-  +.... .....-.+..... -.+|.....++..      .
T Consensus        28 ~G~tWs~~~~v~~~~~~~~~~~~p~~~~~~~g~l~l~~~~~~~~~~~~~~~~~~~~S~D~G~TWs~~~~l~~~~~~~~~~  107 (275)
T PF13088_consen   28 GGKTWSEPRIVADGPKPGRRYGNPSLVVDPDGRLWLFYSAGSSGGGWSGSRIYYSRSTDGGKTWSEPTDLPPGWFGNFSG  107 (275)
T ss_dssp             CTTEEEEEEEEETSTBTTCEEEEEEEEEETTSEEEEEEEEEETTESCCTCEEEEEEESSTTSS-EEEEEEHHHCCCSCEE
T ss_pred             CCCeeCCCEEEeeccccCCcccCcEEEEeCCCCEEEEEEEccCCCCCCceeEEEEEECCCCCCCCCccccccccccceec
Confidence            567898765433221  1122222222  278888885  22111 1111112355555 3468876543311      1


Q ss_pred             --eeeeeEEECCEEEEEeCCC-C-C-CceEEEeCC-CCcEEEecCCCCc-ccce-eEEE-ECCEEEEEccCCCCCCceEE
Q 045821          159 --CYFPCGVLNQKIYCIGGLG-D-T-HSWDVYDPR-TNNWKLHTEPNIF-TEIE-DSFV-MDGKIYIRCSASAATSHVCA  229 (352)
Q Consensus       159 --~~~~~~~~~~~iyv~gG~~-~-~-~~v~~yd~~-t~~W~~~~~~~~~-~~~~-~~~~-~~g~iyv~GG~~~~~~~~~i  229 (352)
                        .......-++++++..-.. . . ..+..|... -.+|+....+... .... ..+. -+|+|+++.-.. ... ...
T Consensus       108 ~~~~~~i~~~~G~l~~~~~~~~~~~~~~~~~~S~D~G~tW~~~~~~~~~~~~~e~~~~~~~dG~l~~~~R~~-~~~-~~~  185 (275)
T PF13088_consen  108 PGRGPPIQLPDGRLIAPYYHESGGSFSAFVYYSDDGGKTWSSGSPIPDGQGECEPSIVELPDGRLLAVFRTE-GND-DIY  185 (275)
T ss_dssp             CSEEEEEEECTTEEEEEEEEESSCEEEEEEEEESSTTSSEEEEEECECSEEEEEEEEEEETTSEEEEEEEEC-SST-EEE
T ss_pred             cceeeeeEecCCCEEEEEeeccccCcceEEEEeCCCCceeeccccccccCCcceeEEEECCCCcEEEEEEcc-CCC-cEE
Confidence              1111233378887763111 1 1 334444444 4579887654332 2222 2222 377898884332 111 333


Q ss_pred             EEEeC-CCCceeecCC-CCCCCCCCcE-EE-ECCEEEEEecCC--CCeE--EEEeCCCCceEEccCCCCCc--CCCCceE
Q 045821          230 LVYEP-STDSWLHADA-NMASGWRGPA-VV-VDDALYVLDQSS--GTKL--MMWQKESREWSPVGRLSTLL--TRPPCKL  299 (352)
Q Consensus       230 ~~yd~-~~~~W~~~~~-~~~~~~~~~~-~~-~~~~l~~~gg~~--~~~v--~~yd~~~~~W~~~~~~~~~~--~~~~~~~  299 (352)
                      ..+.. ...+|+.... ..+....... +. -+++++++....  ...+  ..-..+..+|.....+....  .....++
T Consensus       186 ~~~S~D~G~TWs~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~~l~l~~S~D~g~tW~~~~~i~~~~~~~~~Y~~~  265 (275)
T PF13088_consen  186 ISRSTDGGRTWSPPQPTNLPNPNSSISLVRLSDGRLLLVYNNPDGRSNLSLYVSEDGGKTWSRPKTIDDGPNGDSGYPSL  265 (275)
T ss_dssp             EEEESSTTSS-EEEEEEECSSCCEEEEEEECTTSEEEEEEECSSTSEEEEEEEECTTCEEEEEEEEEEEEE-CCEEEEEE
T ss_pred             EEEECCCCCcCCCceecccCcccCCceEEEcCCCCEEEEEECCCCCCceEEEEEeCCCCcCCccEEEeCCCCCcEECCee
Confidence            33333 3568987532 1222222222 22 367888777631  1233  33344478999776554422  1223344


Q ss_pred             EEe-CCEEEE
Q 045821          300 VAI-GKTIFV  308 (352)
Q Consensus       300 ~~~-~~~i~v  308 (352)
                      +.. +++|||
T Consensus       266 ~~~~dg~l~i  275 (275)
T PF13088_consen  266 TQLPDGKLYI  275 (275)
T ss_dssp             EEEETTEEEE
T ss_pred             EEeCCCcCCC
Confidence            444 568886


No 210
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=59.60  E-value=79  Score=27.94  Aligned_cols=104  Identities=16%  Similarity=0.251  Sum_probs=55.6

Q ss_pred             CEEEEEeCCCCCCceEEEeCCCCcE----EEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecC
Q 045821          168 QKIYCIGGLGDTHSWDVYDPRTNNW----KLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHAD  243 (352)
Q Consensus       168 ~~iyv~gG~~~~~~v~~yd~~t~~W----~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~  243 (352)
                      ..|.+.|..+  +.+-.||...-+=    +.+.+--+  ..+-++.-.|...++|-     ....+..||..+-+.-.-.
T Consensus       184 e~ILiS~srD--~tvKlFDfsK~saKrA~K~~qd~~~--vrsiSfHPsGefllvgT-----dHp~~rlYdv~T~Qcfvsa  254 (430)
T KOG0640|consen  184 ETILISGSRD--NTVKLFDFSKTSAKRAFKVFQDTEP--VRSISFHPSGEFLLVGT-----DHPTLRLYDVNTYQCFVSA  254 (430)
T ss_pred             hheEEeccCC--CeEEEEecccHHHHHHHHHhhccce--eeeEeecCCCceEEEec-----CCCceeEEeccceeEeeec
Confidence            3455555444  7788888864321    11111000  01112222566666652     2345778888765543222


Q ss_pred             CCCCCCCCCcEE----EECCEEEEEecCCCCeEEEEeCCCCce
Q 045821          244 ANMASGWRGPAV----VVDDALYVLDQSSGTKLMMWQKESREW  282 (352)
Q Consensus       244 ~~~~~~~~~~~~----~~~~~l~~~gg~~~~~v~~yd~~~~~W  282 (352)
                      . |......+..    .-.++||+.|..+| .|-.||-.++..
T Consensus       255 n-Pd~qht~ai~~V~Ys~t~~lYvTaSkDG-~IklwDGVS~rC  295 (430)
T KOG0640|consen  255 N-PDDQHTGAITQVRYSSTGSLYVTASKDG-AIKLWDGVSNRC  295 (430)
T ss_pred             C-cccccccceeEEEecCCccEEEEeccCC-cEEeeccccHHH
Confidence            2 2222223322    23789999999888 888988877665


No 211
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=59.37  E-value=1.4e+02  Score=27.27  Aligned_cols=134  Identities=10%  Similarity=-0.000  Sum_probs=67.4

Q ss_pred             CCeEEEEECCCCe-eEecCCCCCcceeeeeEEE---CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eEEE
Q 045821          135 TSEVYCYDASMNT-WTDAAPMSTARCYFPCGVL---NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DSFV  209 (352)
Q Consensus       135 ~~~~~~~d~~t~~-W~~~~~~~~~r~~~~~~~~---~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~~~  209 (352)
                      .+++.+.|..+++ -..++....   .|....+   +..+|+.+. +  ..+-++|+.+.+  .+...+...... .+..
T Consensus        15 ~~~v~viD~~t~~~~~~i~~~~~---~h~~~~~s~Dgr~~yv~~r-d--g~vsviD~~~~~--~v~~i~~G~~~~~i~~s   86 (369)
T PF02239_consen   15 SGSVAVIDGATNKVVARIPTGGA---PHAGLKFSPDGRYLYVANR-D--GTVSVIDLATGK--VVATIKVGGNPRGIAVS   86 (369)
T ss_dssp             GTEEEEEETTT-SEEEEEE-STT---EEEEEE-TT-SSEEEEEET-T--SEEEEEETTSSS--EEEEEE-SSEEEEEEE-
T ss_pred             CCEEEEEECCCCeEEEEEcCCCC---ceeEEEecCCCCEEEEEcC-C--CeEEEEECCccc--EEEEEecCCCcceEEEc
Confidence            5678888988875 344443222   2444433   347899863 3  678999999887  232222222222 2344


Q ss_pred             ECCEEEEEccCCCCCCceEEEEEeCCCCce-eecCCC-----CCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCCC
Q 045821          210 MDGKIYIRCSASAATSHVCALVYEPSTDSW-LHADAN-----MASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKESR  280 (352)
Q Consensus       210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W-~~~~~~-----~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~~  280 (352)
                      -+|+..+++...    ...+..+|..+.+= ..++..     ....+..+. ....+..|++.-.+...++..|....
T Consensus        87 ~DG~~~~v~n~~----~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~s~~~~~fVv~lkd~~~I~vVdy~d~  160 (369)
T PF02239_consen   87 PDGKYVYVANYE----PGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVASPGRPEFVVNLKDTGEIWVVDYSDP  160 (369)
T ss_dssp             -TTTEEEEEEEE----TTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-SSSSEEEEEETTTTEEEEEETTTS
T ss_pred             CCCCEEEEEecC----CCceeEeccccccceeecccccccccccCCCceeEEecCCCCEEEEEEccCCeEEEEEeccc
Confidence            567654443322    12678889877643 233221     111111111 22355667776554458999887654


No 212
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=58.48  E-value=42  Score=30.19  Aligned_cols=70  Identities=23%  Similarity=0.356  Sum_probs=39.0

Q ss_pred             CCEEEEEeC---CCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CC--EEEEEeCCCCCCceEEEeCCCCcE
Q 045821          119 GKNAYLLGG---CGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQ--KIYCIGGLGDTHSWDVYDPRTNNW  192 (352)
Q Consensus       119 ~~~iyv~GG---~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~--~iyv~gG~~~~~~v~~yd~~t~~W  192 (352)
                      .+.+|+..-   ......+..++|.||+.+++--.--++..+  -.+..+- +.  .+|.+.+.+  ..+.+||..|++-
T Consensus       249 ~~rlyvLMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~--~~Si~Vsqd~~P~L~~~~~~~--~~l~v~D~~tGk~  324 (342)
T PF06433_consen  249 SGRLYVLMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHP--IDSIAVSQDDKPLLYALSAGD--GTLDVYDAATGKL  324 (342)
T ss_dssp             TTEEEEEEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEE--ESEEEEESSSS-EEEEEETTT--TEEEEEETTT--E
T ss_pred             cCeEEEEecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCc--cceEEEccCCCcEEEEEcCCC--CeEEEEeCcCCcE
Confidence            678998742   222235678999999999873222122222  2123333 33  566654422  5789999999853


No 213
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=58.17  E-value=1.5e+02  Score=27.19  Aligned_cols=150  Identities=14%  Similarity=0.110  Sum_probs=82.1

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANM  246 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~  246 (352)
                      +.++|+..+..  +.+.++|..+++-............-.....++.+|+.....   ....+...|..+++=.....--
T Consensus        85 ~~~vyv~~~~~--~~v~vid~~~~~~~~~~~vG~~P~~~~~~~~~~~vYV~n~~~---~~~~vsvid~~t~~~~~~~~vG  159 (381)
T COG3391          85 GNKVYVTTGDS--NTVSVIDTATNTVLGSIPVGLGPVGLAVDPDGKYVYVANAGN---GNNTVSVIDAATNKVTATIPVG  159 (381)
T ss_pred             CCeEEEecCCC--CeEEEEcCcccceeeEeeeccCCceEEECCCCCEEEEEeccc---CCceEEEEeCCCCeEEEEEecC
Confidence            56899998754  788899977765433222222222212223356999984321   2347888999888755441111


Q ss_pred             CCCCCCcEEE-ECCEEEEEecCCCCeEEEEeCCCCceEEccC----CCCCcCCCCceEEEeCCEEEEEcCCc---eEEEE
Q 045821          247 ASGWRGPAVV-VDDALYVLDQSSGTKLMMWQKESREWSPVGR----LSTLLTRPPCKLVAIGKTIFVIGKGC---SAVVI  318 (352)
Q Consensus       247 ~~~~~~~~~~-~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~----~~~~~~~~~~~~~~~~~~i~v~gG~~---~~~~~  318 (352)
                      ..+ .+.++. .++++|+..... +.+..+|+.++.-.+ +.    .+.......-.+..-+.++|+.-...   ....+
T Consensus       160 ~~P-~~~a~~p~g~~vyv~~~~~-~~v~vi~~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~i  236 (381)
T COG3391         160 NTP-TGVAVDPDGNKVYVTNSDD-NTVSVIDTSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKI  236 (381)
T ss_pred             CCc-ceEEECCCCCeEEEEecCC-CeEEEEeCCCcceec-cccccccccCCCCceEEECCCCCEEEEEeccCCCceEEEE
Confidence            111 333443 356688887333 489999988776554 32    22211111122223466788876544   45566


Q ss_pred             EcCCcc
Q 045821          319 DVGNIG  324 (352)
Q Consensus       319 d~~~~~  324 (352)
                      |...+.
T Consensus       237 d~~~~~  242 (381)
T COG3391         237 DTATGN  242 (381)
T ss_pred             eCCCce
Confidence            666543


No 214
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=57.44  E-value=1.8e+02  Score=27.68  Aligned_cols=90  Identities=8%  Similarity=-0.015  Sum_probs=46.7

Q ss_pred             CceEEEeCCCCc----EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee-cCCCCCCCCCCcE
Q 045821          180 HSWDVYDPRTNN----WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH-ADANMASGWRGPA  254 (352)
Q Consensus       180 ~~v~~yd~~t~~----W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~~~~~~~~~~~~  254 (352)
                      ..|..||.....    |.+...  .+.++-+.+..+..|++.-|.+.     .|..||.....-+. +.  ...+....+
T Consensus       187 G~VtlwDv~g~sp~~~~~~~Hs--AP~~gicfspsne~l~vsVG~Dk-----ki~~yD~~s~~s~~~l~--y~~Plstva  257 (673)
T KOG4378|consen  187 GAVTLWDVQGMSPIFHASEAHS--APCRGICFSPSNEALLVSVGYDK-----KINIYDIRSQASTDRLT--YSHPLSTVA  257 (673)
T ss_pred             CeEEEEeccCCCcccchhhhcc--CCcCcceecCCccceEEEecccc-----eEEEeecccccccceee--ecCCcceee
Confidence            456667776542    433321  11122223345677777666663     68889987544322 11  122222223


Q ss_pred             EEECCEEEEEecCCCCeEEEEeCCC
Q 045821          255 VVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       255 ~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      +.-+|.+++.|...| .++.||+..
T Consensus       258 f~~~G~~L~aG~s~G-~~i~YD~R~  281 (673)
T KOG4378|consen  258 FSECGTYLCAGNSKG-ELIAYDMRS  281 (673)
T ss_pred             ecCCceEEEeecCCc-eEEEEeccc
Confidence            333555555555555 899999874


No 215
>KOG0640 consensus mRNA cleavage stimulating factor complex; subunit 1 [RNA processing and modification]
Probab=55.47  E-value=1.2e+02  Score=26.92  Aligned_cols=108  Identities=9%  Similarity=0.159  Sum_probs=55.6

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCccccee-EEEE--CCEEEEEccCCCCCCceEEEEEeCCCCcee-ec
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIED-SFVM--DGKIYIRCSASAATSHVCALVYEPSTDSWL-HA  242 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~-~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~-~~  242 (352)
                      .|...++|...  ..+..||..|-+.-.-..|...+.... .+-+  .++||+-|..++     .|-.+|--.++.. .+
T Consensus       227 sGefllvgTdH--p~~rlYdv~T~QcfvsanPd~qht~ai~~V~Ys~t~~lYvTaSkDG-----~IklwDGVS~rCv~t~  299 (430)
T KOG0640|consen  227 SGEFLLVGTDH--PTLRLYDVNTYQCFVSANPDDQHTGAITQVRYSSTGSLYVTASKDG-----AIKLWDGVSNRCVRTI  299 (430)
T ss_pred             CCceEEEecCC--CceeEEeccceeEeeecCcccccccceeEEEecCCccEEEEeccCC-----cEEeeccccHHHHHHH
Confidence            34444544322  778899999887654333222222221 1222  689999976654     4666666555432 23


Q ss_pred             CCCCCCCCCCcE-EEECCEEEEEecCCCCeEEEEeCCCCce
Q 045821          243 DANMASGWRGPA-VVVDDALYVLDQSSGTKLMMWQKESREW  282 (352)
Q Consensus       243 ~~~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~yd~~~~~W  282 (352)
                      ...........+ +.-+++-.+-.|.+. .+..|.+.++.-
T Consensus       300 ~~AH~gsevcSa~Ftkn~kyiLsSG~DS-~vkLWEi~t~R~  339 (430)
T KOG0640|consen  300 GNAHGGSEVCSAVFTKNGKYILSSGKDS-TVKLWEISTGRM  339 (430)
T ss_pred             HhhcCCceeeeEEEccCCeEEeecCCcc-eeeeeeecCCce
Confidence            221211111122 234666555556544 566666766654


No 216
>KOG1520 consensus Predicted alkaloid synthase/Surface mucin Hemomucin [General function prediction only]
Probab=55.45  E-value=1.7e+02  Score=26.80  Aligned_cols=69  Identities=13%  Similarity=0.161  Sum_probs=39.6

Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCC---CCCCc-EEEECCEEEEEecCC----------------CC
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMAS---GWRGP-AVVVDDALYVLDQSS----------------GT  270 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~---~~~~~-~~~~~~~l~~~gg~~----------------~~  270 (352)
                      +|.+||.-++-      .++.-++..+.=+.+.+....   .+... .+.-+|.+|......                ..
T Consensus       126 ggdL~VaDAYl------GL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~~g~vyFTDSSsk~~~rd~~~a~l~g~~~G  199 (376)
T KOG1520|consen  126 GGDLYVADAYL------GLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDPEGVVYFTDSSSKYDRRDFVFAALEGDPTG  199 (376)
T ss_pred             CCeEEEEecce------eeEEECCCCCcceeccccccCeeeeecCceeEcCCCeEEEeccccccchhheEEeeecCCCcc
Confidence            55899984433      578888887775554432211   11111 122355666554322                13


Q ss_pred             eEEEEeCCCCceEEc
Q 045821          271 KLMMWQKESREWSPV  285 (352)
Q Consensus       271 ~v~~yd~~~~~W~~~  285 (352)
                      .+.+||+.++.=+.+
T Consensus       200 Rl~~YD~~tK~~~VL  214 (376)
T KOG1520|consen  200 RLFRYDPSTKVTKVL  214 (376)
T ss_pred             ceEEecCcccchhhh
Confidence            799999999876555


No 217
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=54.70  E-value=1.8e+02  Score=27.02  Aligned_cols=119  Identities=13%  Similarity=0.099  Sum_probs=63.2

Q ss_pred             ceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCC
Q 045821          112 GMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRT  189 (352)
Q Consensus       112 ~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t  189 (352)
                      ..+++.+ +|.||..|      .....+-+||..+..  .++.+|.--..-....+ ++--|++-+.++ .+|..+|...
T Consensus       350 ~ts~~fHpDgLifgtg------t~d~~vkiwdlks~~--~~a~Fpght~~vk~i~FsENGY~Lat~add-~~V~lwDLRK  420 (506)
T KOG0289|consen  350 YTSAAFHPDGLIFGTG------TPDGVVKIWDLKSQT--NVAKFPGHTGPVKAISFSENGYWLATAADD-GSVKLWDLRK  420 (506)
T ss_pred             eEEeeEcCCceEEecc------CCCceEEEEEcCCcc--ccccCCCCCCceeEEEeccCceEEEEEecC-CeEEEEEehh
Confidence            4444444 44555544      334567788988775  55555541111222333 233333344343 4589999986


Q ss_pred             Cc-EEEecCCCCcccceeEEEE--CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCC
Q 045821          190 NN-WKLHTEPNIFTEIEDSFVM--DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANM  246 (352)
Q Consensus       190 ~~-W~~~~~~~~~~~~~~~~~~--~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~  246 (352)
                      .. ...+. ++. .....+..+  .|+..+++|..     ..++.|+-.+..|+++....
T Consensus       421 l~n~kt~~-l~~-~~~v~s~~fD~SGt~L~~~g~~-----l~Vy~~~k~~k~W~~~~~~~  473 (506)
T KOG0289|consen  421 LKNFKTIQ-LDE-KKEVNSLSFDQSGTYLGIAGSD-----LQVYICKKKTKSWTEIKELA  473 (506)
T ss_pred             hcccceee-ccc-cccceeEEEcCCCCeEEeecce-----eEEEEEecccccceeeehhh
Confidence            54 33332 111 112233334  46666665332     46788888999999986543


No 218
>PF14298 DUF4374:  Domain of unknown function (DUF4374)
Probab=54.40  E-value=1.9e+02  Score=27.11  Aligned_cols=59  Identities=17%  Similarity=0.210  Sum_probs=38.3

Q ss_pred             CceEEEeCCCCcEEEecCCCCc---ccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCce
Q 045821          180 HSWDVYDPRTNNWKLHTEPNIF---TEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSW  239 (352)
Q Consensus       180 ~~v~~yd~~t~~W~~~~~~~~~---~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W  239 (352)
                      ..+-+||..+.+...+..+|..   .......+-+|++|+-=. ...+....|+.+|+.+.+=
T Consensus       367 ~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vt-t~~g~~~~IY~iDp~TatA  428 (435)
T PF14298_consen  367 KKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVT-TEDGSDPYIYKIDPATATA  428 (435)
T ss_pred             ceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEe-ecCCCceeEEEEcCccccc
Confidence            7778899999887777766654   111234556888888422 1222245799999988753


No 219
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=53.97  E-value=2.6e+02  Score=28.64  Aligned_cols=149  Identities=11%  Similarity=0.037  Sum_probs=64.6

Q ss_pred             EEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeC--CCCceeecCCCCC
Q 045821          170 IYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEP--STDSWLHADANMA  247 (352)
Q Consensus       170 iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~--~~~~W~~~~~~~~  247 (352)
                      -++.|..+  +.|.+|...+.+-..+-..-..+..+.++..+|+..++||.+.     .|-+.+.  .+..-...+.  .
T Consensus        68 ~f~~~s~~--~tv~~y~fps~~~~~iL~Rftlp~r~~~v~g~g~~iaagsdD~-----~vK~~~~~D~s~~~~lrgh--~  138 (933)
T KOG1274|consen   68 HFLTGSEQ--NTVLRYKFPSGEEDTILARFTLPIRDLAVSGSGKMIAAGSDDT-----AVKLLNLDDSSQEKVLRGH--D  138 (933)
T ss_pred             ceEEeecc--ceEEEeeCCCCCccceeeeeeccceEEEEecCCcEEEeecCce-----eEEEEeccccchheeeccc--C
Confidence            44444333  7888887776654433211112223345556777777765441     2222222  2222222111  1


Q ss_pred             CCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCc----eEEccCCCCCc-CC--CCceEEEeCCEEEEEcCCceEEEEEc
Q 045821          248 SGWRGPAVVVDDALYVLDQSSGTKLMMWQKESRE----WSPVGRLSTLL-TR--PPCKLVAIGKTIFVIGKGCSAVVIDV  320 (352)
Q Consensus       248 ~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~----W~~~~~~~~~~-~~--~~~~~~~~~~~i~v~gG~~~~~~~d~  320 (352)
                      ..-.+....-++.++.+...+| .|.+||+.+..    |..+..-.... .+  ...+...-++++.+.+-.+.+.+|+.
T Consensus       139 apVl~l~~~p~~~fLAvss~dG-~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~~Vkvy~r  217 (933)
T KOG1274|consen  139 APVLQLSYDPKGNFLAVSSCDG-KVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDNTVKVYSR  217 (933)
T ss_pred             CceeeeeEcCCCCEEEEEecCc-eEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCCeEEEEcc
Confidence            1111111223555555544444 78888877654    33332111100 01  11111223466666665555556654


Q ss_pred             CCcccccceE
Q 045821          321 GNIGNIGGIM  330 (352)
Q Consensus       321 ~~~~w~~~~~  330 (352)
                      +  .|...+.
T Consensus       218 ~--~we~~f~  225 (933)
T KOG1274|consen  218 K--GWELQFK  225 (933)
T ss_pred             C--Cceehee
Confidence            4  3554433


No 220
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=53.09  E-value=2.1e+02  Score=27.22  Aligned_cols=61  Identities=18%  Similarity=0.248  Sum_probs=34.1

Q ss_pred             ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEE-EeCCEEEEEcCCce-EEEEEcCC
Q 045821          257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLV-AIGKTIFVIGKGCS-AVVIDVGN  322 (352)
Q Consensus       257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~~~i~v~gG~~~-~~~~d~~~  322 (352)
                      .+..|++-=|++. .+..||.....-..  .+..  .....+++ ..+|.++++|-... ++.||+..
T Consensus       219 sne~l~vsVG~Dk-ki~~yD~~s~~s~~--~l~y--~~Plstvaf~~~G~~L~aG~s~G~~i~YD~R~  281 (673)
T KOG4378|consen  219 SNEALLVSVGYDK-KINIYDIRSQASTD--RLTY--SHPLSTVAFSECGTYLCAGNSKGELIAYDMRS  281 (673)
T ss_pred             CccceEEEecccc-eEEEeecccccccc--eeee--cCCcceeeecCCceEEEeecCCceEEEEeccc
Confidence            4677777777776 89999998543321  1111  11112232 23566777775554 44588653


No 221
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=52.48  E-value=1e+02  Score=23.57  Aligned_cols=63  Identities=14%  Similarity=0.071  Sum_probs=35.4

Q ss_pred             CEEEEEeCCCCCCceEEEeCCCCc---EEEecCCCCcccceeEEE---ECCEEEEEccCCCCCCceEEEEEeCCCC--ce
Q 045821          168 QKIYCIGGLGDTHSWDVYDPRTNN---WKLHTEPNIFTEIEDSFV---MDGKIYIRCSASAATSHVCALVYEPSTD--SW  239 (352)
Q Consensus       168 ~~iyv~gG~~~~~~v~~yd~~t~~---W~~~~~~~~~~~~~~~~~---~~g~iyv~GG~~~~~~~~~i~~yd~~~~--~W  239 (352)
                      ..+.++|..   +.+.+||...|.   ++.++   ..........   ....+.++||..      .+.-||-..+  -|
T Consensus        64 ~D~LliGt~---t~llaYDV~~N~d~Fyke~~---DGvn~i~~g~~~~~~~~l~ivGGnc------si~Gfd~~G~e~fW  131 (136)
T PF14781_consen   64 RDCLLIGTQ---TSLLAYDVENNSDLFYKEVP---DGVNAIVIGKLGDIPSPLVIVGGNC------SIQGFDYEGNEIFW  131 (136)
T ss_pred             cCEEEEecc---ceEEEEEcccCchhhhhhCc---cceeEEEEEecCCCCCcEEEECceE------EEEEeCCCCcEEEE
Confidence            345666654   679999999885   44443   2221111112   245677777654      5777776543  36


Q ss_pred             eec
Q 045821          240 LHA  242 (352)
Q Consensus       240 ~~~  242 (352)
                      +..
T Consensus       132 tVt  134 (136)
T PF14781_consen  132 TVT  134 (136)
T ss_pred             Eec
Confidence            643


No 222
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=51.81  E-value=1.9e+02  Score=26.42  Aligned_cols=148  Identities=9%  Similarity=0.071  Sum_probs=72.0

Q ss_pred             CEEEEEeCCCCCCCCCCeEEEEECCCC--eeEecCC------CCCcceeeeeEEECC---EEEEEeCCCCCCceEEEeCC
Q 045821          120 KNAYLLGGCGWSEDATSEVYCYDASMN--TWTDAAP------MSTARCYFPCGVLNQ---KIYCIGGLGDTHSWDVYDPR  188 (352)
Q Consensus       120 ~~iyv~GG~~~~~~~~~~~~~~d~~t~--~W~~~~~------~~~~r~~~~~~~~~~---~iyv~gG~~~~~~v~~yd~~  188 (352)
                      ..|+.+||...    .+.+.+||....  .|+.-..      |..|-....+..+.+   .-|+.+...  ..+-.||+.
T Consensus       161 p~Iva~GGke~----~n~lkiwdle~~~qiw~aKNvpnD~L~LrVPvW~tdi~Fl~g~~~~~fat~T~~--hqvR~YDt~  234 (412)
T KOG3881|consen  161 PYIVATGGKEN----INELKIWDLEQSKQIWSAKNVPNDRLGLRVPVWITDIRFLEGSPNYKFATITRY--HQVRLYDTR  234 (412)
T ss_pred             CceEecCchhc----ccceeeeecccceeeeeccCCCCccccceeeeeeccceecCCCCCceEEEEecc--eeEEEecCc
Confidence            45777788532    466677777654  3764321      223433344444433   233322211  678899998


Q ss_pred             CCcEEEecCCCCccc--ceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-CCEEEEEe
Q 045821          189 TNNWKLHTEPNIFTE--IEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-DDALYVLD  265 (352)
Q Consensus       189 t~~W~~~~~~~~~~~--~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~~~l~~~g  265 (352)
                      .++ +.+........  .+.+-.-+++..++|     .....+..||..+.+---..-.-.........+. +..+...+
T Consensus       235 ~qR-RPV~~fd~~E~~is~~~l~p~gn~Iy~g-----n~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~  308 (412)
T KOG3881|consen  235 HQR-RPVAQFDFLENPISSTGLTPSGNFIYTG-----NTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASC  308 (412)
T ss_pred             ccC-cceeEeccccCcceeeeecCCCcEEEEe-----cccchhheecccCceeeccccCCccCCcceEEEcCCCceEEee
Confidence            654 34432222211  112222344444443     2223577888877653322110111111111222 34677777


Q ss_pred             cCCCCeEEEEeCCCC
Q 045821          266 QSSGTKLMMWQKESR  280 (352)
Q Consensus       266 g~~~~~v~~yd~~~~  280 (352)
                      |.+- .+-+||.++.
T Consensus       309 GLDR-yvRIhD~ktr  322 (412)
T KOG3881|consen  309 GLDR-YVRIHDIKTR  322 (412)
T ss_pred             ccce-eEEEeecccc
Confidence            7654 7889998873


No 223
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=51.32  E-value=70  Score=29.91  Aligned_cols=63  Identities=14%  Similarity=0.199  Sum_probs=36.7

Q ss_pred             ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEE-EEEcCC
Q 045821          257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAV-VIDVGN  322 (352)
Q Consensus       257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~-~~d~~~  322 (352)
                      .++.=++.++++. .+-.||.++++-..--.+...  ......-.-+..+++.||.++-. .+|+..
T Consensus       268 ~~g~~fLS~sfD~-~lKlwDtETG~~~~~f~~~~~--~~cvkf~pd~~n~fl~G~sd~ki~~wDiRs  331 (503)
T KOG0282|consen  268 NCGTSFLSASFDR-FLKLWDTETGQVLSRFHLDKV--PTCVKFHPDNQNIFLVGGSDKKIRQWDIRS  331 (503)
T ss_pred             ccCCeeeeeecce-eeeeeccccceEEEEEecCCC--ceeeecCCCCCcEEEEecCCCcEEEEeccc
Confidence            3555667777776 788999999987655332210  00011112245899999887533 355543


No 224
>PF07433 DUF1513:  Protein of unknown function (DUF1513);  InterPro: IPR008311 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=51.21  E-value=1.8e+02  Score=25.87  Aligned_cols=214  Identities=12%  Similarity=0.043  Sum_probs=103.2

Q ss_pred             eEEEEE--CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcce--eeeeEEECCEEEEEeCCCCC---CceEEE
Q 045821          113 MGFEVL--GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARC--YFPCGVLNQKIYCIGGLGDT---HSWDVY  185 (352)
Q Consensus       113 ~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~--~~~~~~~~~~iyv~gG~~~~---~~v~~y  185 (352)
                      |.++..  ...+.+|+-     .+-.-+.+||+.+++-...-..+..|.  +|++..-+|+.......+..   ..+-+|
T Consensus         8 H~~a~~p~~~~avafaR-----RPG~~~~v~D~~~g~~~~~~~a~~gRHFyGHg~fs~dG~~LytTEnd~~~g~G~IgVy   82 (305)
T PF07433_consen    8 HGVAAHPTRPEAVAFAR-----RPGTFALVFDCRTGQLLQRLWAPPGRHFYGHGVFSPDGRLLYTTENDYETGRGVIGVY   82 (305)
T ss_pred             cceeeCCCCCeEEEEEe-----CCCcEEEEEEcCCCceeeEEcCCCCCEEecCEEEcCCCCEEEEeccccCCCcEEEEEE
Confidence            444444  456677764     334567899999987543333355554  46666666654333332222   677899


Q ss_pred             eCCCCcEEEecCCCCccccee-EEEE-CC-EEEEE-ccCCCCCCc-----e------EEEEEeCCCCceeec---C-CCC
Q 045821          186 DPRTNNWKLHTEPNIFTEIED-SFVM-DG-KIYIR-CSASAATSH-----V------CALVYEPSTDSWLHA---D-ANM  246 (352)
Q Consensus       186 d~~t~~W~~~~~~~~~~~~~~-~~~~-~g-~iyv~-GG~~~~~~~-----~------~i~~yd~~~~~W~~~---~-~~~  246 (352)
                      |.. +..+++...+...-..+ ...+ +| .|.|. ||...+...     +      ++...|..+++-.+.   + ...
T Consensus        83 d~~-~~~~ri~E~~s~GIGPHel~l~pDG~tLvVANGGI~Thpd~GR~kLNl~tM~psL~~ld~~sG~ll~q~~Lp~~~~  161 (305)
T PF07433_consen   83 DAA-RGYRRIGEFPSHGIGPHELLLMPDGETLVVANGGIETHPDSGRAKLNLDTMQPSLVYLDARSGALLEQVELPPDLH  161 (305)
T ss_pred             ECc-CCcEEEeEecCCCcChhhEEEcCCCCEEEEEcCCCccCcccCceecChhhcCCceEEEecCCCceeeeeecCcccc
Confidence            998 67888776554433332 3333 55 45554 444433111     1      555666776654322   1 111


Q ss_pred             CCCCCCcEEEECCEEEEEecCCC------CeEEEEeCCCCceEEccCCCC----CcCCCCceEEEeC-C-EEEEEcCCc-
Q 045821          247 ASGWRGPAVVVDDALYVLDQSSG------TKLMMWQKESREWSPVGRLST----LLTRPPCKLVAIG-K-TIFVIGKGC-  313 (352)
Q Consensus       247 ~~~~~~~~~~~~~~l~~~gg~~~------~~v~~yd~~~~~W~~~~~~~~----~~~~~~~~~~~~~-~-~i~v~gG~~-  313 (352)
                      .....+.++.-+|.+++-.-+.+      --+..++.... -..+ .+|.    .+..+..+++... + .|.+-.-.- 
T Consensus       162 ~lSiRHLa~~~~G~V~~a~Q~qg~~~~~~PLva~~~~g~~-~~~~-~~p~~~~~~l~~Y~gSIa~~~~g~~ia~tsPrGg  239 (305)
T PF07433_consen  162 QLSIRHLAVDGDGTVAFAMQYQGDPGDAPPLVALHRRGGA-LRLL-PAPEEQWRRLNGYIGSIAADRDGRLIAVTSPRGG  239 (305)
T ss_pred             ccceeeEEecCCCcEEEEEecCCCCCccCCeEEEEcCCCc-ceec-cCChHHHHhhCCceEEEEEeCCCCEEEEECCCCC
Confidence            11222233333444433322322      12334443331 2111 1211    1112223344432 2 344544332 


Q ss_pred             eEEEEEcCCcccccceEEecc
Q 045821          314 SAVVIDVGNIGNIGGIMVSSS  334 (352)
Q Consensus       314 ~~~~~d~~~~~w~~~~~~~~~  334 (352)
                      .+.+||..+..|.....++..
T Consensus       240 ~~~~~d~~tg~~~~~~~l~D~  260 (305)
T PF07433_consen  240 RVAVWDAATGRLLGSVPLPDA  260 (305)
T ss_pred             EEEEEECCCCCEeeccccCce
Confidence            455789999888776655443


No 225
>PF13859 BNR_3:  BNR repeat-like domain; PDB: 3B69_A.
Probab=51.04  E-value=1.8e+02  Score=25.92  Aligned_cols=166  Identities=10%  Similarity=0.115  Sum_probs=74.8

Q ss_pred             eeeEEECCEEEEEeCCCCC--------CceE-EEeCCCCcEEEecCCC---C--c---ccceeEEEECCEEEEEccCCCC
Q 045821          161 FPCGVLNQKIYCIGGLGDT--------HSWD-VYDPRTNNWKLHTEPN---I--F---TEIEDSFVMDGKIYIRCSASAA  223 (352)
Q Consensus       161 ~~~~~~~~~iyv~gG~~~~--------~~v~-~yd~~t~~W~~~~~~~---~--~---~~~~~~~~~~g~iyv~GG~~~~  223 (352)
                      ++++.+++.|+++......        ..+. .......+|+......   .  .   ......++-++.||++-|....
T Consensus         2 PSLV~vgGvv~AvAEa~~~~~~~~~~~~ias~~~~~~g~tw~~~~~~~~~~~~~~~v~v~rPTtvvkgn~IymLvG~y~~   81 (310)
T PF13859_consen    2 PSLVEVGGVVFAVAEAQCKKSNDSGFTDIASEYSTDNGETWKAEVAVLNDDGSKKRVDVSRPTTVVKGNKIYMLVGSYSR   81 (310)
T ss_dssp             EEEEEETTEEEEEEEEESS-S-SSS-EEEEEEEESSSSSS-EEEEEE----SS-TT-EEEEEEEEEETTEEEEEEEEESS
T ss_pred             CCEEEECCEEEEEEEEEEccCCCCCceeEEEeEeeccccccccceeeecccccccccccceeeeeecceeEEEEEEEEec
Confidence            4567778998888764311        1122 2334455797654211   1  1   1123456679999998766542


Q ss_pred             C---CceEEEEE--eCCCCceeecCCCCCCCCC---------CcEEEE-CCEEEEE--ecCCC----CeEEEEeCC-CCc
Q 045821          224 T---SHVCALVY--EPSTDSWLHADANMASGWR---------GPAVVV-DDALYVL--DQSSG----TKLMMWQKE-SRE  281 (352)
Q Consensus       224 ~---~~~~i~~y--d~~~~~W~~~~~~~~~~~~---------~~~~~~-~~~l~~~--gg~~~----~~v~~yd~~-~~~  281 (352)
                      .   ....+..+  +....+|......+.....         +..+.. +|-|.+-  +-..+    ..+.+|..+ ..+
T Consensus        82 ~~~~~~~~llLvks~~~g~~W~~~~~l~~~~~~~~~~figgGGSGV~m~dGTLVFPv~a~~~~~~~~~SlIiYS~d~g~~  161 (310)
T PF13859_consen   82 SAGADDWGLLLVKSTDGGIKWGDTKSLPSTSFQSWKQFIGGGGSGVVMEDGTLVFPVQATKKNGDGTVSLIIYSTDDGKT  161 (310)
T ss_dssp             --SSTTEEEEEEEEESSSSEE---EE-GGGS-EEEEEEEE-SEE-EE-TTS-EEEEEEEEETT---EEEEEEEESSTTSS
T ss_pred             cccccccceeeeeccCCcceeeecccCCchhccccceeecCCCCceEEcCCCEEEEEeeeccCccceEEEEEEECCCccc
Confidence            1   11233333  3344469776432221111         111332 3333211  21111    256788777 788


Q ss_pred             eEEccCCCCCcCCCCceEEEe-CCEEEEEc----CCceEEEEEcCCccccc
Q 045821          282 WSPVGRLSTLLTRPPCKLVAI-GKTIFVIG----KGCSAVVIDVGNIGNIG  327 (352)
Q Consensus       282 W~~~~~~~~~~~~~~~~~~~~-~~~i~v~g----G~~~~~~~d~~~~~w~~  327 (352)
                      |.--..+++. ......++.. +++|+++.    |+.+++.-.-.-.+|.+
T Consensus       162 W~lskg~s~~-gC~~psv~EWe~gkLlM~~~c~~g~rrVYeS~DmG~tWte  211 (310)
T PF13859_consen  162 WKLSKGMSPA-GCSDPSVVEWEDGKLLMMTACDDGRRRVYESGDMGTTWTE  211 (310)
T ss_dssp             -EE-S----T-T-EEEEEEEE-TTEEEEEEE-TTS---EEEESSTTSS-EE
T ss_pred             eEeccccCCC-CcceEEEEeccCCeeEEEEecccceEEEEEEcccceehhh
Confidence            9876666542 3344567777 78999885    44445544344557865


No 226
>KOG0772 consensus Uncharacterized conserved protein, contains WD40 repeat [Function unknown]
Probab=50.70  E-value=2.3e+02  Score=27.06  Aligned_cols=154  Identities=14%  Similarity=0.108  Sum_probs=76.3

Q ss_pred             CcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC--cc-----cceeEEEECCEEEEEccCCCCCCc
Q 045821          156 TARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI--FT-----EIEDSFVMDGKIYIRCSASAATSH  226 (352)
Q Consensus       156 ~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~--~~-----~~~~~~~~~g~iyv~GG~~~~~~~  226 (352)
                      ..|.....+.+  ++++ +.+|+.+ .++..++.  ..|..-+.+..  ++     -.+.++..+|++.+.-|.+.    
T Consensus       315 g~Rv~~tsC~~nrdg~~-iAagc~D-GSIQ~W~~--~~~~v~p~~~vk~AH~~g~~Itsi~FS~dg~~LlSRg~D~----  386 (641)
T KOG0772|consen  315 GKRVPVTSCAWNRDGKL-IAAGCLD-GSIQIWDK--GSRTVRPVMKVKDAHLPGQDITSISFSYDGNYLLSRGFDD----  386 (641)
T ss_pred             CcccCceeeecCCCcch-hhhcccC-Cceeeeec--CCcccccceEeeeccCCCCceeEEEeccccchhhhccCCC----
Confidence            34666666666  4566 5556554 56666665  33433332211  11     11234556888887766553    


Q ss_pred             eEEEEEeCC-----CCceeecCCCCCCCCCCcEEEECCEEEEEecCCC-----CeEEEEeCCCCceEEccCCCCCcCCCC
Q 045821          227 VCALVYEPS-----TDSWLHADANMASGWRGPAVVVDDALYVLDQSSG-----TKLMMWQKESREWSPVGRLSTLLTRPP  296 (352)
Q Consensus       227 ~~i~~yd~~-----~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~-----~~v~~yd~~~~~W~~~~~~~~~~~~~~  296 (352)
                       .+-.+|+.     -+.|+-++...+..  -..+.-+.+|.+.|-..-     ..++.||..  +...+..++..  ...
T Consensus       387 -tLKvWDLrq~kkpL~~~tgL~t~~~~t--dc~FSPd~kli~TGtS~~~~~~~g~L~f~d~~--t~d~v~ki~i~--~aS  459 (641)
T KOG0772|consen  387 -TLKVWDLRQFKKPLNVRTGLPTPFPGT--DCCFSPDDKLILTGTSAPNGMTAGTLFFFDRM--TLDTVYKIDIS--TAS  459 (641)
T ss_pred             -ceeeeeccccccchhhhcCCCccCCCC--ccccCCCceEEEecccccCCCCCceEEEEecc--ceeeEEEecCC--Cce
Confidence             23333432     23455554422211  112344778887775221     368888864  45555444331  111


Q ss_pred             ceEEEe---CCEEEEEcCCceEEE-EEcCCcc
Q 045821          297 CKLVAI---GKTIFVIGKGCSAVV-IDVGNIG  324 (352)
Q Consensus       297 ~~~~~~---~~~i~v~gG~~~~~~-~d~~~~~  324 (352)
                      ..-+..   -++|++-.|.-...+ ||+.+..
T Consensus       460 vv~~~WhpkLNQi~~gsgdG~~~vyYdp~~S~  491 (641)
T KOG0772|consen  460 VVRCLWHPKLNQIFAGSGDGTAHVYYDPNESI  491 (641)
T ss_pred             EEEEeecchhhheeeecCCCceEEEECccccc
Confidence            111122   356776555545554 7776654


No 227
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=50.39  E-value=3.3e+02  Score=28.78  Aligned_cols=163  Identities=10%  Similarity=0.062  Sum_probs=82.1

Q ss_pred             eEecCCC-CCcceeeeeEEE---CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccce-eE---EEECCEEEEEcc
Q 045821          148 WTDAAPM-STARCYFPCGVL---NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIE-DS---FVMDGKIYIRCS  219 (352)
Q Consensus       148 W~~~~~~-~~~r~~~~~~~~---~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~-~~---~~~~g~iyv~GG  219 (352)
                      |..+.++ +.+|..-.++-.   .|++++.|+.   ..+-++|..+++  .+.++|...... .+   --.+|.+.+.|-
T Consensus      1154 w~~Ls~~~~~~r~~~~v~dWqQ~~G~Ll~tGd~---r~IRIWDa~~E~--~~~diP~~s~t~vTaLS~~~~~gn~i~AGf 1228 (1387)
T KOG1517|consen 1154 WSSLSDQLPGARGTGLVVDWQQQSGHLLVTGDV---RSIRIWDAHKEQ--VVADIPYGSSTLVTALSADLVHGNIIAAGF 1228 (1387)
T ss_pred             eccccccCccCCCCCeeeehhhhCCeEEecCCe---eEEEEEecccce--eEeecccCCCccceeecccccCCceEEEee
Confidence            6666554 455554433333   5889998874   567889988775  344454442222 11   123578888854


Q ss_pred             CCCCCCceEEEEEeCCC-------CceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc
Q 045821          220 ASAATSHVCALVYEPST-------DSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL  292 (352)
Q Consensus       220 ~~~~~~~~~i~~yd~~~-------~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~  292 (352)
                      .++     .+-.||...       +.|++-....+  ..+..+.-+|.--++.|..+.+|..+|+...  ....-+....
T Consensus      1229 aDG-----svRvyD~R~a~~ds~v~~~R~h~~~~~--Iv~~slq~~G~~elvSgs~~G~I~~~DlR~~--~~e~~~~iv~ 1299 (1387)
T KOG1517|consen 1229 ADG-----SVRVYDRRMAPPDSLVCVYREHNDVEP--IVHLSLQRQGLGELVSGSQDGDIQLLDLRMS--SKETFLTIVA 1299 (1387)
T ss_pred             cCC-----ceEEeecccCCccccceeecccCCccc--ceeEEeecCCCcceeeeccCCeEEEEecccC--cccccceeee
Confidence            443     466777522       23443322111  1111122223223444433338999998875  2222111111


Q ss_pred             CC----CCceEEEeCC-EEEEEcCCceEEEEEcCCcc
Q 045821          293 TR----PPCKLVAIGK-TIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       293 ~~----~~~~~~~~~~-~i~v~gG~~~~~~~d~~~~~  324 (352)
                      .+    ...++.++.. -|+.-|+.....||+...+.
T Consensus      1300 ~~~yGs~lTal~VH~hapiiAsGs~q~ikIy~~~G~~ 1336 (1387)
T KOG1517|consen 1300 HWEYGSALTALTVHEHAPIIASGSAQLIKIYSLSGEQ 1336 (1387)
T ss_pred             ccccCccceeeeeccCCCeeeecCcceEEEEecChhh
Confidence            11    1234555543 46666666556678876554


No 228
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=50.14  E-value=1.8e+02  Score=28.09  Aligned_cols=107  Identities=14%  Similarity=0.076  Sum_probs=56.7

Q ss_pred             CCEEEEEccCCCC--CCceEEEEEeCCCCceeecCC---CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceE--
Q 045821          211 DGKIYIRCSASAA--TSHVCALVYEPSTDSWLHADA---NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWS--  283 (352)
Q Consensus       211 ~g~iyv~GG~~~~--~~~~~i~~yd~~~~~W~~~~~---~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~--  283 (352)
                      +.+|+.+.-.-+.  ....+.-.|+...++-+.+.-   +.+......+...+.+.+++|..++ .|..||...++=.  
T Consensus       218 ~~qi~tVE~s~s~~g~~~~d~ciYE~~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~Dg-SiiLyD~~~~~t~~~  296 (545)
T PF11768_consen  218 PYQIHTVEQSISVKGEPSADSCIYECSRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCEDG-SIILYDTTRGVTLLA  296 (545)
T ss_pred             CcEEEEEEEecCCCCCceeEEEEEEeecCceeEEEEEEEecCCcceEEecCcccceEEEEecCC-eEEEEEcCCCeeeee
Confidence            4456666322211  122255567777666554422   1221111122234667788888877 9999998876433  


Q ss_pred             EccCCCCCcCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCcc
Q 045821          284 PVGRLSTLLTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIG  324 (352)
Q Consensus       284 ~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~  324 (352)
                      +.+-.|.      +..-..++.|+++|+.. ...+||..-+.
T Consensus       297 ka~~~P~------~iaWHp~gai~~V~s~qGelQ~FD~ALsp  332 (545)
T PF11768_consen  297 KAEFIPT------LIAWHPDGAIFVVGSEQGELQCFDMALSP  332 (545)
T ss_pred             eecccce------EEEEcCCCcEEEEEcCCceEEEEEeecCc
Confidence            2222222      11122377788777654 45578876544


No 229
>KOG0321 consensus WD40 repeat-containing protein L2DTL [Function unknown]
Probab=49.12  E-value=90  Score=30.42  Aligned_cols=97  Identities=8%  Similarity=0.085  Sum_probs=52.2

Q ss_pred             EEEEEeCCCCceeecCC-CCCCCCCCcEEE----ECCEEEEEecCCCCeEEEEeCCCCceEEccC-CCCCcCCCCceEEE
Q 045821          228 CALVYEPSTDSWLHADA-NMASGWRGPAVV----VDDALYVLDQSSGTKLMMWQKESREWSPVGR-LSTLLTRPPCKLVA  301 (352)
Q Consensus       228 ~i~~yd~~~~~W~~~~~-~~~~~~~~~~~~----~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~-~~~~~~~~~~~~~~  301 (352)
                      .+-.||......+.... ......|..++.    +.++..++....+..+..||.++..-.-... +...-.....++..
T Consensus        75 ~i~l~dt~~~~fr~ee~~lk~~~aH~nAifDl~wapge~~lVsasGDsT~r~Wdvk~s~l~G~~~~~GH~~SvkS~cf~~  154 (720)
T KOG0321|consen   75 GIILFDTKSIVFRLEERQLKKPLAHKNAIFDLKWAPGESLLVSASGDSTIRPWDVKTSRLVGGRLNLGHTGSVKSECFMP  154 (720)
T ss_pred             ceeeecchhhhcchhhhhhcccccccceeEeeccCCCceeEEEccCCceeeeeeeccceeecceeecccccccchhhhcc
Confidence            46677776666651111 111111222221    2355555543333489999999877655431 11111233445666


Q ss_pred             eCCEEEEEcCCce-EEEEEcCCcc
Q 045821          302 IGKTIFVIGKGCS-AVVIDVGNIG  324 (352)
Q Consensus       302 ~~~~i~v~gG~~~-~~~~d~~~~~  324 (352)
                      .+..+++.||++. +.+.|...+.
T Consensus       155 ~n~~vF~tGgRDg~illWD~R~n~  178 (720)
T KOG0321|consen  155 TNPAVFCTGGRDGEILLWDCRCNG  178 (720)
T ss_pred             CCCcceeeccCCCcEEEEEEeccc
Confidence            7888999999886 4467766554


No 230
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=48.82  E-value=1.5e+02  Score=24.23  Aligned_cols=64  Identities=11%  Similarity=0.031  Sum_probs=36.2

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPST  236 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~  236 (352)
                      +.++.++-|... ..+..||.+..   .+...+..........-+|++.++||....  ...+..||..+
T Consensus        71 g~~favi~g~~~-~~v~lyd~~~~---~i~~~~~~~~n~i~wsP~G~~l~~~g~~n~--~G~l~~wd~~~  134 (194)
T PF08662_consen   71 GNEFAVIYGSMP-AKVTLYDVKGK---KIFSFGTQPRNTISWSPDGRFLVLAGFGNL--NGDLEFWDVRK  134 (194)
T ss_pred             CCEEEEEEccCC-cccEEEcCccc---EeEeecCCCceEEEECCCCCEEEEEEccCC--CcEEEEEECCC
Confidence            456666656333 68899999633   333333323332233347888888776422  23577888873


No 231
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=48.50  E-value=42  Score=19.09  Aligned_cols=25  Identities=12%  Similarity=0.173  Sum_probs=17.1

Q ss_pred             eEEEeCCEEEEEcCCceEEEEEcCC
Q 045821          298 KLVAIGKTIFVIGKGCSAVVIDVGN  322 (352)
Q Consensus       298 ~~~~~~~~i~v~gG~~~~~~~d~~~  322 (352)
                      +.+..++.+|+.+....++.+|+++
T Consensus        16 ~~~v~~g~vyv~~~dg~l~ald~~t   40 (40)
T PF13570_consen   16 SPAVAGGRVYVGTGDGNLYALDAAT   40 (40)
T ss_dssp             --EECTSEEEEE-TTSEEEEEETT-
T ss_pred             CCEEECCEEEEEcCCCEEEEEeCCC
Confidence            4466789999988877788888763


No 232
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=47.55  E-value=2.9e+02  Score=27.26  Aligned_cols=121  Identities=11%  Similarity=0.129  Sum_probs=68.2

Q ss_pred             ecCCCCccccee--EEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCC----CCCCcEE-EECCEEEEEec
Q 045821          195 HTEPNIFTEIED--SFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMAS----GWRGPAV-VVDDALYVLDQ  266 (352)
Q Consensus       195 ~~~~~~~~~~~~--~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~----~~~~~~~-~~~~~l~~~gg  266 (352)
                      +.+++.......  +... ++++++.-     .....++.++..+.+..++....+.    ..+..++ ..++-|.++++
T Consensus       421 v~~~~~~~~~a~~i~ftid~~k~~~~s-----~~~~~le~~el~~ps~kel~~~~~~~~~~~I~~l~~SsdG~yiaa~~t  495 (691)
T KOG2048|consen  421 VDDVPLALLDASAISFTIDKNKLFLVS-----KNIFSLEEFELETPSFKELKSIQSQAKCPSISRLVVSSDGNYIAAIST  495 (691)
T ss_pred             eccchhhhccceeeEEEecCceEEEEe-----cccceeEEEEecCcchhhhhccccccCCCcceeEEEcCCCCEEEEEec
Confidence            344555443332  2334 45555552     1223678888888887776442222    1222223 24566666664


Q ss_pred             CCCCeEEEEeCCCCceEEcc-CCCCCcCCCCceEE-EeCCEEEEEcCCceEEEEEcCCcc
Q 045821          267 SSGTKLMMWQKESREWSPVG-RLSTLLTRPPCKLV-AIGKTIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       267 ~~~~~v~~yd~~~~~W~~~~-~~~~~~~~~~~~~~-~~~~~i~v~gG~~~~~~~d~~~~~  324 (352)
                      ..  .|++||.++.+-..+. .++.  .....+.. ...+.|.+.-..+.++.||++.+.
T Consensus       496 ~g--~I~v~nl~~~~~~~l~~rln~--~vTa~~~~~~~~~~lvvats~nQv~efdi~~~~  551 (691)
T KOG2048|consen  496 RG--QIFVYNLETLESHLLKVRLNI--DVTAAAFSPFVRNRLVVATSNNQVFEFDIEARN  551 (691)
T ss_pred             cc--eEEEEEcccceeecchhccCc--ceeeeeccccccCcEEEEecCCeEEEEecchhh
Confidence            43  8999999999887776 3432  12222222 235677777767678889996544


No 233
>PRK01029 tolB translocation protein TolB; Provisional
Probab=47.15  E-value=2.5e+02  Score=26.34  Aligned_cols=59  Identities=14%  Similarity=0.111  Sum_probs=37.2

Q ss_pred             eEEEEEeCCCCceeecCCCCCCCCCCcEEEECCE-EEEEecCCC-CeEEEEeCCCCceEEcc
Q 045821          227 VCALVYEPSTDSWLHADANMASGWRGPAVVVDDA-LYVLDQSSG-TKLMMWQKESREWSPVG  286 (352)
Q Consensus       227 ~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~-l~~~gg~~~-~~v~~yd~~~~~W~~~~  286 (352)
                      ..++.||+.+++.+.+.... .........-+++ |++.....+ ..++.+|..++....+.
T Consensus       351 ~~I~v~dl~~g~~~~Lt~~~-~~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~~~Lt  411 (428)
T PRK01029        351 RQICVYDLATGRDYQLTTSP-ENKESPSWAIDSLHLVYSAGNSNESELYLISLITKKTRKIV  411 (428)
T ss_pred             cEEEEEECCCCCeEEccCCC-CCccceEECCCCCEEEEEECCCCCceEEEEECCCCCEEEee
Confidence            47899999999888875431 1112233333554 554443322 57999999988887775


No 234
>PF11134 Phage_stabilise:  Phage stabilisation protein;  InterPro: IPR021098 This entry represents the Bacteriophage P22, Gp10, DNA-stabilising protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members of this family are phage proteins involved with stabilising the head assembly unit and condensed DNA within the capsid [].
Probab=46.75  E-value=2.5e+02  Score=26.29  Aligned_cols=112  Identities=9%  Similarity=0.051  Sum_probs=57.3

Q ss_pred             eeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcc---cc----eeE-----EEE-CCEEEEEccCCCCCCc
Q 045821          160 YFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFT---EI----EDS-----FVM-DGKIYIRCSASAATSH  226 (352)
Q Consensus       160 ~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~---~~----~~~-----~~~-~g~iyv~GG~~~~~~~  226 (352)
                      .++.+.+++.++.+|..... .-.+|-....+=.+|+..+...   .+    -+.     ... +..+|++      .-.
T Consensus       234 ~~s~~~~~~t~~wlg~~~~G-~~sVy~~~gyq~~RIST~~IE~~l~~ya~~ela~af~et~~f~~h~~l~i------hlp  306 (469)
T PF11134_consen  234 KHSKTKFGNTVAWLGHDATG-APSVYRINGYQASRISTHAIEKALRSYAHDELAIAFMETYQFDGHEFLLI------HLP  306 (469)
T ss_pred             cceeeecCCEEEEeccCCCC-CceEEEecCCceeeeccHHHHHHHHhhccHHHHHHHHHHhhcCceEEEEE------EcC
Confidence            35566678888888874432 2234444444444555322211   11    011     122 3467777      222


Q ss_pred             eEEEEEeCCCCce----eecCCCCCC--CCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821          227 VCALVYEPSTDSW----LHADANMAS--GWRGPAVVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       227 ~~i~~yd~~~~~W----~~~~~~~~~--~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      ...++||..+++|    ..+......  .+....+..++++. +|...+..+..+++..
T Consensus       307 ~~tlcyD~at~~~~~qw~~l~tg~~~~~~R~~~~~f~~~q~~-VGD~~~g~lg~L~~n~  364 (469)
T PF11134_consen  307 RKTLCYDAATSQWGEQWFILKTGFYDEPYRAIDFMFFDNQIT-VGDKQNGLLGALDFNA  364 (469)
T ss_pred             CceEEEEcccCCcccceEEEeccccCCcceeeeeEEeCCeeE-ecccccceeEEecccc
Confidence            4789999998855    444443322  22233455566654 4554444666666654


No 235
>PF14781 BBS2_N:  Ciliary BBSome complex subunit 2, N-terminal
Probab=44.15  E-value=46  Score=25.46  Aligned_cols=64  Identities=14%  Similarity=0.162  Sum_probs=37.8

Q ss_pred             CCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCc-eEEEeCCEEEEEcCCceEEEEEcCCc
Q 045821          258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPC-KLVAIGKTIFVIGKGCSAVVIDVGNI  323 (352)
Q Consensus       258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~-~~~~~~~~i~v~gG~~~~~~~d~~~~  323 (352)
                      +..++++|-..  .+..||.+.|+=..-.+++.....-.. .+....+.+.++||.....=||-+-+
T Consensus        63 ~~D~LliGt~t--~llaYDV~~N~d~Fyke~~DGvn~i~~g~~~~~~~~l~ivGGncsi~Gfd~~G~  127 (136)
T PF14781_consen   63 GRDCLLIGTQT--SLLAYDVENNSDLFYKEVPDGVNAIVIGKLGDIPSPLVIVGGNCSIQGFDYEGN  127 (136)
T ss_pred             CcCEEEEeccc--eEEEEEcccCchhhhhhCccceeEEEEEecCCCCCcEEEECceEEEEEeCCCCc
Confidence            45677887655  799999998873222333331100000 11112467999999877777887644


No 236
>KOG1897 consensus Damage-specific DNA binding complex, subunit DDB1 [Replication, recombination and repair]
Probab=43.72  E-value=4e+02  Score=27.83  Aligned_cols=148  Identities=14%  Similarity=0.170  Sum_probs=91.9

Q ss_pred             CEEEEEeC-CCCC---CCCCCeEEEEECCC-CeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEE
Q 045821          120 KNAYLLGG-CGWS---EDATSEVYCYDASM-NTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKL  194 (352)
Q Consensus       120 ~~iyv~GG-~~~~---~~~~~~~~~~d~~t-~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~  194 (352)
                      +..|++|- ...+   ......+.+|.... ++-+.+..+...-+-.+.+.++|++.+.=|    ..+-.|+-.+++  .
T Consensus       787 ~t~~vVGT~~v~Pde~ep~~GRIivfe~~e~~~L~~v~e~~v~Gav~aL~~fngkllA~In----~~vrLye~t~~~--e  860 (1096)
T KOG1897|consen  787 NTYYVVGTGLVYPDENEPVNGRIIVFEFEELNSLELVAETVVKGAVYALVEFNGKLLAGIN----QSVRLYEWTTER--E  860 (1096)
T ss_pred             ceEEEEEEEeeccCCCCcccceEEEEEEecCCceeeeeeeeeccceeehhhhCCeEEEecC----cEEEEEEccccc--e
Confidence            45777764 2111   11234455555554 667777777666666778888998877656    778889777662  2


Q ss_pred             ecCCCCcccce----eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCC
Q 045821          195 HTEPNIFTEIE----DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGT  270 (352)
Q Consensus       195 ~~~~~~~~~~~----~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~  270 (352)
                      +. +--....+    ..-+.++.|+|.--    .....+..|+...+...+++....+.|-.++-.+++..|+.+...++
T Consensus       861 Lr-~e~~~~~~~~aL~l~v~gdeI~VgDl----m~Sitll~y~~~eg~f~evArD~~p~Wmtaveil~~d~ylgae~~gN  935 (1096)
T KOG1897|consen  861 LR-IECNISNPIIALDLQVKGDEIAVGDL----MRSITLLQYKGDEGNFEEVARDYNPNWMTAVEILDDDTYLGAENSGN  935 (1096)
T ss_pred             eh-hhhcccCCeEEEEEEecCcEEEEeec----cceEEEEEEeccCCceEEeehhhCccceeeEEEecCceEEeeccccc
Confidence            11 00111112    12345778887521    22347888999999999998888888877777788888877765552


Q ss_pred             -eEEEEeCC
Q 045821          271 -KLMMWQKE  278 (352)
Q Consensus       271 -~v~~yd~~  278 (352)
                       .+.++|-.
T Consensus       936 lf~v~~d~~  944 (1096)
T KOG1897|consen  936 LFTVRKDSD  944 (1096)
T ss_pred             EEEEEecCC
Confidence             23455543


No 237
>KOG1428 consensus Inhibitor of type V adenylyl cyclases/Neuronal presynaptic protein Highwire/PAM/RPM-1 [Signal transduction mechanisms]
Probab=43.31  E-value=3.8e+02  Score=29.80  Aligned_cols=49  Identities=14%  Similarity=0.264  Sum_probs=27.0

Q ss_pred             ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecC
Q 045821          210 MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQS  267 (352)
Q Consensus       210 ~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~  267 (352)
                      -+|++|..|-+.      .+=.|. ..+.|.++.-+  .+....++-.++..+..|..
T Consensus       496 ~sGKvYYaGn~t------~~Gl~e-~G~nWmEL~l~--~~IVq~SVG~D~~~~~~~A~  544 (3738)
T KOG1428|consen  496 RSGKVYYAGNGT------RFGLFE-TGNNWMELCLP--EPIVQISVGIDTIMFRSGAG  544 (3738)
T ss_pred             cCccEEEecCcc------EEeEEc-cCCceEEecCC--CceEEEEeccchhheeeccC
Confidence            478999996433      122332 35889998653  22233334445555555544


No 238
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=43.16  E-value=2.4e+02  Score=25.11  Aligned_cols=44  Identities=14%  Similarity=0.041  Sum_probs=24.5

Q ss_pred             CeeEecCCCCC-cceee-eeEEECCEEEEEeCCCCCCceEEEeCCCCc
Q 045821          146 NTWTDAAPMST-ARCYF-PCGVLNQKIYCIGGLGDTHSWDVYDPRTNN  191 (352)
Q Consensus       146 ~~W~~~~~~~~-~r~~~-~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~  191 (352)
                      .+|...|.+.. ++... .++.+++...+.||.+  ..+.+||..++.
T Consensus        29 ~~~~l~~lF~~~aH~~sitavAVs~~~~aSGssD--etI~IYDm~k~~   74 (362)
T KOG0294|consen   29 VKPTLKPLFAFSAHAGSITALAVSGPYVASGSSD--ETIHIYDMRKRK   74 (362)
T ss_pred             cceeeeccccccccccceeEEEecceeEeccCCC--CcEEEEeccchh
Confidence            34655554432 33322 3444455544445544  788999998764


No 239
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=42.49  E-value=3.5e+02  Score=26.74  Aligned_cols=147  Identities=10%  Similarity=0.061  Sum_probs=66.5

Q ss_pred             eEEECCEEE-EEeCCCCCCceEEEeCCCCcEEEecCCCCc---ccce----eEEEECCEEEEEccCCCCCCceEEEEEeC
Q 045821          163 CGVLNQKIY-CIGGLGDTHSWDVYDPRTNNWKLHTEPNIF---TEIE----DSFVMDGKIYIRCSASAATSHVCALVYEP  234 (352)
Q Consensus       163 ~~~~~~~iy-v~gG~~~~~~v~~yd~~t~~W~~~~~~~~~---~~~~----~~~~~~g~iyv~GG~~~~~~~~~i~~yd~  234 (352)
                      ....+.... ++||..+ ..+-++|..+++=-.+-.|...   .+..    ++.++.+...+.|-     ..-.+..+|.
T Consensus       159 slsw~~~~~~i~~Gs~D-g~Iriwd~~~~~t~~~~~~~~d~l~k~~~~iVWSv~~Lrd~tI~sgD-----S~G~V~FWd~  232 (691)
T KOG2048|consen  159 SLSWNPTGTKIAGGSID-GVIRIWDVKSGQTLHIITMQLDRLSKREPTIVWSVLFLRDSTIASGD-----SAGTVTFWDS  232 (691)
T ss_pred             EEEecCCccEEEecccC-ceEEEEEcCCCceEEEeeecccccccCCceEEEEEEEeecCcEEEec-----CCceEEEEcc
Confidence            334444444 6666654 3456677776643222222221   1111    23344444445542     2225666776


Q ss_pred             CCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCC--CCceEEccCCCCCcCCCCceEEEeCCEEEEEcCC
Q 045821          235 STDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKE--SREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKG  312 (352)
Q Consensus       235 ~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~--~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~  312 (352)
                      ...+=.+--..........++..++.-++.+|.+. .+..|-..  .++|.....-.. ..+..-+++++++ +.+-||.
T Consensus       233 ~~gTLiqS~~~h~adVl~Lav~~~~d~vfsaGvd~-~ii~~~~~~~~~~wv~~~~r~~-h~hdvrs~av~~~-~l~sgG~  309 (691)
T KOG2048|consen  233 IFGTLIQSHSCHDADVLALAVADNEDRVFSAGVDP-KIIQYSLTTNKSEWVINSRRDL-HAHDVRSMAVIEN-ALISGGR  309 (691)
T ss_pred             cCcchhhhhhhhhcceeEEEEcCCCCeEEEccCCC-ceEEEEecCCccceeeeccccC-Ccccceeeeeecc-eEEecce
Confidence            55543221001111111122333434555566665 67666444  446876653211 1233346677777 5555666


Q ss_pred             ceEEEE
Q 045821          313 CSAVVI  318 (352)
Q Consensus       313 ~~~~~~  318 (352)
                      +....+
T Consensus       310 d~~l~i  315 (691)
T KOG2048|consen  310 DFTLAI  315 (691)
T ss_pred             eeEEEE
Confidence            544433


No 240
>KOG0272 consensus U4/U6 small nuclear ribonucleoprotein Prp4 (contains WD40 repeats) [RNA processing and modification]
Probab=42.33  E-value=2.8e+02  Score=25.68  Aligned_cols=111  Identities=19%  Similarity=0.251  Sum_probs=55.1

Q ss_pred             ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCC
Q 045821          166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADAN  245 (352)
Q Consensus       166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~  245 (352)
                      .+|.+...||.+.  -.-++|..|..-.-+-.-....-......-+|+...-||.+     +.+-++|+...+=  +-. 
T Consensus       313 ~DGSL~~tGGlD~--~~RvWDlRtgr~im~L~gH~k~I~~V~fsPNGy~lATgs~D-----nt~kVWDLR~r~~--ly~-  382 (459)
T KOG0272|consen  313 PDGSLAATGGLDS--LGRVWDLRTGRCIMFLAGHIKEILSVAFSPNGYHLATGSSD-----NTCKVWDLRMRSE--LYT-  382 (459)
T ss_pred             CCCceeeccCccc--hhheeecccCcEEEEecccccceeeEeECCCceEEeecCCC-----CcEEEeeeccccc--cee-
Confidence            4899999999763  33457777776433211111111111223377777776554     2344444432221  111 


Q ss_pred             CCCCCCCcE-EE---ECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821          246 MASGWRGPA-VV---VDDALYVLDQSSGTKLMMWQKESREWSPVGRLS  289 (352)
Q Consensus       246 ~~~~~~~~~-~~---~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~  289 (352)
                      ++....-.+ |.   ..|..++.+++++ .+-+|.  +..|+.+..|.
T Consensus       383 ipAH~nlVS~Vk~~p~~g~fL~TasyD~-t~kiWs--~~~~~~~ksLa  427 (459)
T KOG0272|consen  383 IPAHSNLVSQVKYSPQEGYFLVTASYDN-TVKIWS--TRTWSPLKSLA  427 (459)
T ss_pred             cccccchhhheEecccCCeEEEEcccCc-ceeeec--CCCcccchhhc
Confidence            111111111 22   2577777788766 555554  45577776553


No 241
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=40.66  E-value=2e+02  Score=28.06  Aligned_cols=135  Identities=6%  Similarity=-0.023  Sum_probs=68.4

Q ss_pred             CEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEEC---CEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821          168 QKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMD---GKIYIRCSASAATSHVCALVYEPSTDSWLHADA  244 (352)
Q Consensus       168 ~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~---g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~  244 (352)
                      +.+-++.+.+....|.+++.....=+  .+..........+.+.   -.++|..-       ..+-.||+....-.+---
T Consensus       534 DYlatV~~~~~~~~VliHQLSK~~sQ--~PF~kskG~vq~v~FHPs~p~lfVaTq-------~~vRiYdL~kqelvKkL~  604 (733)
T KOG0650|consen  534 DYLATVMPDSGNKSVLIHQLSKRKSQ--SPFRKSKGLVQRVKFHPSKPYLFVATQ-------RSVRIYDLSKQELVKKLL  604 (733)
T ss_pred             ceEEEeccCCCcceEEEEeccccccc--CchhhcCCceeEEEecCCCceEEEEec-------cceEEEehhHHHHHHHHh
Confidence            34444444444488889988765421  2222222233334443   24555421       258889987643322111


Q ss_pred             CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCC--ceEEccCCC-------CCcCCCCceEEEeCCEEEEEcCC
Q 045821          245 NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESR--EWSPVGRLS-------TLLTRPPCKLVAIGKTIFVIGKG  312 (352)
Q Consensus       245 ~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~--~W~~~~~~~-------~~~~~~~~~~~~~~~~i~v~gG~  312 (352)
                      ....-.+..++.-+|.=+++|++.+ .+..||.+-+  -|..+.--+       .-.....++.+.-++.++||-|.
T Consensus       605 tg~kwiS~msihp~GDnli~gs~d~-k~~WfDldlsskPyk~lr~H~~avr~Va~H~ryPLfas~sdDgtv~Vfhg~  680 (733)
T KOG0650|consen  605 TGSKWISSMSIHPNGDNLILGSYDK-KMCWFDLDLSSKPYKTLRLHEKAVRSVAFHKRYPLFASGSDDGTVIVFHGM  680 (733)
T ss_pred             cCCeeeeeeeecCCCCeEEEecCCC-eeEEEEcccCcchhHHhhhhhhhhhhhhhccccceeeeecCCCcEEEEeee
Confidence            1111112333444555566777776 8888877644  576654211       11122335555667888888763


No 242
>KOG1538 consensus Uncharacterized conserved protein WDR10, contains WD40 repeats [General function prediction only]
Probab=40.43  E-value=3.9e+02  Score=26.66  Aligned_cols=170  Identities=14%  Similarity=0.163  Sum_probs=0.0

Q ss_pred             eeEecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCC---
Q 045821          147 TWTDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSAS---  221 (352)
Q Consensus       147 ~W~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~---  221 (352)
                      +|...-..+..++-+..+.-  +..+.+..|    +.+++||+..++--+.-.-.....++.+-..+|+.|..||.+   
T Consensus         2 ~~~~~~r~~~~hci~d~afkPDGsqL~lAAg----~rlliyD~ndG~llqtLKgHKDtVycVAys~dGkrFASG~aDK~V   77 (1081)
T KOG1538|consen    2 RAVLTWRDKAEHCINDIAFKPDGTQLILAAG----SRLLVYDTSDGTLLQPLKGHKDTVYCVAYAKDGKRFASGSADKSV   77 (1081)
T ss_pred             chhhhhhcccccchheeEECCCCceEEEecC----CEEEEEeCCCcccccccccccceEEEEEEccCCceeccCCCceeE


Q ss_pred             -------------CCCCceEEEEEeCCC-----------CceeecCCCCCCCCCCcEEEE-----CCEEEEEecCCC---
Q 045821          222 -------------AATSHVCALVYEPST-----------DSWLHADANMASGWRGPAVVV-----DDALYVLDQSSG---  269 (352)
Q Consensus       222 -------------~~~~~~~i~~yd~~~-----------~~W~~~~~~~~~~~~~~~~~~-----~~~l~~~gg~~~---  269 (352)
                                   +++.......|+|-+           .-|+.-...........-+..     +|.++++|-.+|   
T Consensus        78 I~W~~klEG~LkYSH~D~IQCMsFNP~~h~LasCsLsdFglWS~~qK~V~K~kss~R~~~CsWtnDGqylalG~~nGTIs  157 (1081)
T KOG1538|consen   78 IIWTSKLEGILKYSHNDAIQCMSFNPITHQLASCSLSDFGLWSPEQKSVSKHKSSSRIICCSWTNDGQYLALGMFNGTIS  157 (1081)
T ss_pred             EEecccccceeeeccCCeeeEeecCchHHHhhhcchhhccccChhhhhHHhhhhheeEEEeeecCCCcEEEEeccCceEE


Q ss_pred             ---------------------------------------------CeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCC
Q 045821          270 ---------------------------------------------TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGK  304 (352)
Q Consensus       270 ---------------------------------------------~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~  304 (352)
                                                                   ..+..|..+...-.+-..+.   ....+---..+|
T Consensus       158 iRNk~gEek~~I~Rpgg~Nspiwsi~~~p~sg~G~~di~aV~DW~qTLSFy~LsG~~Igk~r~L~---FdP~CisYf~NG  234 (1081)
T KOG1538|consen  158 IRNKNGEEKVKIERPGGSNSPIWSICWNPSSGEGRNDILAVADWGQTLSFYQLSGKQIGKDRALN---FDPCCISYFTNG  234 (1081)
T ss_pred             eecCCCCcceEEeCCCCCCCCceEEEecCCCCCCccceEEEEeccceeEEEEecceeecccccCC---CCchhheeccCC


Q ss_pred             EEEEEcCCceEEEEEcCCc
Q 045821          305 TIFVIGKGCSAVVIDVGNI  323 (352)
Q Consensus       305 ~i~v~gG~~~~~~~d~~~~  323 (352)
                      +..++||.++..-+-+.+.
T Consensus       235 Ey~LiGGsdk~L~~fTR~G  253 (1081)
T KOG1538|consen  235 EYILLGGSDKQLSLFTRDG  253 (1081)
T ss_pred             cEEEEccCCCceEEEeecC


No 243
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=40.25  E-value=98  Score=23.01  Aligned_cols=48  Identities=10%  Similarity=0.082  Sum_probs=32.0

Q ss_pred             eCCEEEEEcCCceEEEEE-----cCCcccccceEEeccCCCCCCCCcceeeEEe
Q 045821          302 IGKTIFVIGKGCSAVVID-----VGNIGNIGGIMVSSSIPKLNDNDDIISCKCL  350 (352)
Q Consensus       302 ~~~~i~v~gG~~~~~~~d-----~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~~  350 (352)
                      +...||++.|....++=+     .+. .=.+-+-++..+|+++.|.+-.+|.|+
T Consensus        67 hEtaIYvlsG~ah~w~G~rLE~ha~~-~pGDf~YiPpgVPHqp~N~S~ep~s~v  119 (142)
T COG4101          67 HETAIYVLSGEAHTWYGNRLEEHAEV-GPGDFFYIPPGVPHQPANLSTEPLSAV  119 (142)
T ss_pred             ccEEEEEEeceeeeeeccceeeeEEe-cCCCeEEcCCCCCCcccccCCCCeEEE
Confidence            355699999977665522     221 123345568999999999977777765


No 244
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=39.81  E-value=3e+02  Score=25.25  Aligned_cols=153  Identities=16%  Similarity=0.175  Sum_probs=83.7

Q ss_pred             ECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE--ECCEEEEEeCCCCCCceEEEeCCCCcEEEe
Q 045821          118 LGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV--LNQKIYCIGGLGDTHSWDVYDPRTNNWKLH  195 (352)
Q Consensus       118 ~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~  195 (352)
                      .++++|+..+.      .+.+.++|..+++-...  .+..+..+..+.  .++.+||.......+.+.+.|..+++=...
T Consensus        84 ~~~~vyv~~~~------~~~v~vid~~~~~~~~~--~~vG~~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~  155 (381)
T COG3391          84 AGNKVYVTTGD------SNTVSVIDTATNTVLGS--IPVGLGPVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTAT  155 (381)
T ss_pred             CCCeEEEecCC------CCeEEEEcCcccceeeE--eeeccCCceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEE
Confidence            46779998653      56788888666653322  222223233332  356899987643238889999998864433


Q ss_pred             cCCCCcccceeEEE-ECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC---CCCCCCCCc--EEE-ECCEEEEEecCC
Q 045821          196 TEPNIFTEIEDSFV-MDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA---NMASGWRGP--AVV-VDDALYVLDQSS  268 (352)
Q Consensus       196 ~~~~~~~~~~~~~~-~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~---~~~~~~~~~--~~~-~~~~l~~~gg~~  268 (352)
                      .+-.... ...++. .++++|+.-     ...+.+..+|.+++.=.. +.   .........  .+. .+.++|+....+
T Consensus       156 ~~vG~~P-~~~a~~p~g~~vyv~~-----~~~~~v~vi~~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~~  228 (381)
T COG3391         156 IPVGNTP-TGVAVDPDGNKVYVTN-----SDDNTVSVIDTSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDGS  228 (381)
T ss_pred             EecCCCc-ceEEECCCCCeEEEEe-----cCCCeEEEEeCCCcceec-cccccccccCCCCceEEECCCCCEEEEEeccC
Confidence            1111111 222222 345689884     223467888887664332 21   111111111  232 355688886655


Q ss_pred             C-CeEEEEeCCCCceEEc
Q 045821          269 G-TKLMMWQKESREWSPV  285 (352)
Q Consensus       269 ~-~~v~~yd~~~~~W~~~  285 (352)
                      . ..+.++|..++.-...
T Consensus       229 ~~~~v~~id~~~~~v~~~  246 (381)
T COG3391         229 GSNNVLKIDTATGNVTAT  246 (381)
T ss_pred             CCceEEEEeCCCceEEEe
Confidence            2 4788888888665444


No 245
>PRK01029 tolB translocation protein TolB; Provisional
Probab=39.71  E-value=3.2e+02  Score=25.56  Aligned_cols=185  Identities=8%  Similarity=-0.015  Sum_probs=82.5

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEE--EeCCCC---cEEEecCCCCcccceeEEE
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDV--YDPRTN---NWKLHTEPNIFTEIEDSFV  209 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~--yd~~t~---~W~~~~~~~~~~~~~~~~~  209 (352)
                      ...+|++|..+++-+.+...+..........-+.+|..........+++.  ||..++   .=+++..............
T Consensus       210 ~~~I~~~~l~~g~~~~lt~~~g~~~~p~wSPDG~~Laf~s~~~g~~di~~~~~~~~~g~~g~~~~lt~~~~~~~~~p~wS  289 (428)
T PRK01029        210 VPKIFLGSLENPAGKKILALQGNQLMPTFSPRKKLLAFISDRYGNPDLFIQSFSLETGAIGKPRRLLNEAFGTQGNPSFS  289 (428)
T ss_pred             CceEEEEECCCCCceEeecCCCCccceEECCCCCEEEEEECCCCCcceeEEEeecccCCCCcceEeecCCCCCcCCeEEC
Confidence            36899999998887777665443222211111335544432211133333  555442   1122221111111112333


Q ss_pred             ECCE-EEEEccCCCCCCceEEEEEeCCC--CceeecCCCCCCCCCCcEEEECC-EEEEEecCCC-CeEEEEeCCCCceEE
Q 045821          210 MDGK-IYIRCSASAATSHVCALVYEPST--DSWLHADANMASGWRGPAVVVDD-ALYVLDQSSG-TKLMMWQKESREWSP  284 (352)
Q Consensus       210 ~~g~-iyv~GG~~~~~~~~~i~~yd~~~--~~W~~~~~~~~~~~~~~~~~~~~-~l~~~gg~~~-~~v~~yd~~~~~W~~  284 (352)
                      -||+ |+.. ....  ....++.++...  ..=+.+... ..........-+| .|++.....+ ..+++||+++++.+.
T Consensus       290 PDG~~Laf~-s~~~--g~~~ly~~~~~~~g~~~~~lt~~-~~~~~~p~wSPDG~~Laf~~~~~g~~~I~v~dl~~g~~~~  365 (428)
T PRK01029        290 PDGTRLVFV-SNKD--GRPRIYIMQIDPEGQSPRLLTKK-YRNSSCPAWSPDGKKIAFCSVIKGVRQICVYDLATGRDYQ  365 (428)
T ss_pred             CCCCEEEEE-ECCC--CCceEEEEECcccccceEEeccC-CCCccceeECCCCCEEEEEEcCCCCcEEEEEECCCCCeEE
Confidence            4665 4433 2211  123566666532  211222211 1111122233344 4555543332 489999999999887


Q ss_pred             ccCCCCCcCCCCceEEEeCCEEEEEc---CCceEEEEEcCCccc
Q 045821          285 VGRLSTLLTRPPCKLVAIGKTIFVIG---KGCSAVVIDVGNIGN  325 (352)
Q Consensus       285 ~~~~~~~~~~~~~~~~~~~~~i~v~g---G~~~~~~~d~~~~~w  325 (352)
                      +...+.  ..........+..|+...   |....+++|+.....
T Consensus       366 Lt~~~~--~~~~p~wSpDG~~L~f~~~~~g~~~L~~vdl~~g~~  407 (428)
T PRK01029        366 LTTSPE--NKESPSWAIDSLHLVYSAGNSNESELYLISLITKKT  407 (428)
T ss_pred             ccCCCC--CccceEECCCCCEEEEEECCCCCceEEEEECCCCCE
Confidence            764321  111222222334455443   334566678766543


No 246
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=39.60  E-value=2.7e+02  Score=24.63  Aligned_cols=147  Identities=14%  Similarity=0.146  Sum_probs=75.7

Q ss_pred             CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECC
Q 045821          180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDD  259 (352)
Q Consensus       180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~  259 (352)
                      ..+..||..+..-.+.-....-..............+.+. .  ..-..|...+..+|+--+.=......-....+.-.+
T Consensus        36 Dsl~LYd~~~g~~~~ti~skkyG~~~~~Fth~~~~~i~sS-t--k~d~tIryLsl~dNkylRYF~GH~~~V~sL~~sP~~  112 (311)
T KOG1446|consen   36 DSLRLYDSLSGKQVKTINSKKYGVDLACFTHHSNTVIHSS-T--KEDDTIRYLSLHDNKYLRYFPGHKKRVNSLSVSPKD  112 (311)
T ss_pred             CeEEEEEcCCCceeeEeecccccccEEEEecCCceEEEcc-C--CCCCceEEEEeecCceEEEcCCCCceEEEEEecCCC
Confidence            6789999998875544323322222233444444444422 2  111345555666666544311111111111222345


Q ss_pred             EEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEc-CCceEEEEEcCCcccccceEEeccCC
Q 045821          260 ALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIG-KGCSAVVIDVGNIGNIGGIMVSSSIP  336 (352)
Q Consensus       260 ~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~g-G~~~~~~~d~~~~~w~~~~~~~~~~~  336 (352)
                      ..++-++.+. .+..||+....-.-+-.+..   + ..+..-..|-|+.++ |.+.+-+||.....  ++.+.+..++
T Consensus       113 d~FlS~S~D~-tvrLWDlR~~~cqg~l~~~~---~-pi~AfDp~GLifA~~~~~~~IkLyD~Rs~d--kgPF~tf~i~  183 (311)
T KOG1446|consen  113 DTFLSSSLDK-TVRLWDLRVKKCQGLLNLSG---R-PIAAFDPEGLIFALANGSELIKLYDLRSFD--KGPFTTFSIT  183 (311)
T ss_pred             CeEEecccCC-eEEeeEecCCCCceEEecCC---C-cceeECCCCcEEEEecCCCeEEEEEecccC--CCCceeEccC
Confidence            7777777665 88999988665544432221   1 122222345566555 34356678887654  5667766666


No 247
>PF13088 BNR_2:  BNR repeat-like domain; PDB: 2F11_A 2F0Z_A 1VCU_B 2F25_B 1SO7_A 2F29_A 1SNT_A 2F13_A 2F28_A 2F27_A ....
Probab=39.06  E-value=2.5e+02  Score=24.02  Aligned_cols=143  Identities=13%  Similarity=0.088  Sum_probs=70.2

Q ss_pred             CCEEEEEEecCCCC-ceeEEEe-cCCCCCCCeeEeccCCCcccCCcceEEEEE-CCEEEEEeCCCCCCCCCCeEEEEECC
Q 045821           68 ETWIYAFCRDNKLE-RVCCYVL-DPNSTRRSWKMIHELPARTLRRKGMGFEVL-GKNAYLLGGCGWSEDATSEVYCYDAS  144 (352)
Q Consensus        68 ~~~l~~~gg~~~~~-~~~~~~~-d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~-~~~iyv~GG~~~~~~~~~~~~~~d~~  144 (352)
                      ++.+++..-..... ......+ +-  ...+|+.....+.. ......+++.. ++.|+++--.... .  ...+.+...
T Consensus       118 ~G~l~~~~~~~~~~~~~~~~~~S~D--~G~tW~~~~~~~~~-~~~~e~~~~~~~dG~l~~~~R~~~~-~--~~~~~~S~D  191 (275)
T PF13088_consen  118 DGRLIAPYYHESGGSFSAFVYYSDD--GGKTWSSGSPIPDG-QGECEPSIVELPDGRLLAVFRTEGN-D--DIYISRSTD  191 (275)
T ss_dssp             TTEEEEEEEEESSCEEEEEEEEESS--TTSSEEEEEECECS-EEEEEEEEEEETTSEEEEEEEECSS-T--EEEEEEESS
T ss_pred             CCCEEEEEeeccccCcceEEEEeCC--CCceeecccccccc-CCcceeEEEECCCCcEEEEEEccCC-C--cEEEEEECC
Confidence            56666653222112 2223333 33  56789888766310 11222333333 6678877543211 1  333444555


Q ss_pred             C-CeeEecC--CCCCcceeeeeEEE-CCEEEEEeCCCCC---CceEEEeCCCCcEEEecCCCCcc----cceeEEEE-CC
Q 045821          145 M-NTWTDAA--PMSTARCYFPCGVL-NQKIYCIGGLGDT---HSWDVYDPRTNNWKLHTEPNIFT----EIEDSFVM-DG  212 (352)
Q Consensus       145 t-~~W~~~~--~~~~~r~~~~~~~~-~~~iyv~gG~~~~---~~v~~yd~~t~~W~~~~~~~~~~----~~~~~~~~-~g  212 (352)
                      . .+|....  .+|.+........+ +++++++......   ..+..-.....+|.....+....    .+..++.. +|
T Consensus       192 ~G~TWs~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~r~~l~l~~S~D~g~tW~~~~~i~~~~~~~~~Y~~~~~~~dg  271 (275)
T PF13088_consen  192 GGRTWSPPQPTNLPNPNSSISLVRLSDGRLLLVYNNPDGRSNLSLYVSEDGGKTWSRPKTIDDGPNGDSGYPSLTQLPDG  271 (275)
T ss_dssp             TTSS-EEEEEEECSSCCEEEEEEECTTSEEEEEEECSSTSEEEEEEEECTTCEEEEEEEEEEEEE-CCEEEEEEEEEETT
T ss_pred             CCCcCCCceecccCcccCCceEEEcCCCCEEEEEECCCCCCceEEEEEeCCCCcCCccEEEeCCCCCcEECCeeEEeCCC
Confidence            3 3598754  34555555555554 5688877773222   22222233366898765433322    44455555 67


Q ss_pred             EEEE
Q 045821          213 KIYI  216 (352)
Q Consensus       213 ~iyv  216 (352)
                      +|||
T Consensus       272 ~l~i  275 (275)
T PF13088_consen  272 KLYI  275 (275)
T ss_dssp             EEEE
T ss_pred             cCCC
Confidence            9886


No 248
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=38.64  E-value=4.4e+02  Score=26.80  Aligned_cols=95  Identities=15%  Similarity=0.163  Sum_probs=51.4

Q ss_pred             EEEEEeCCCCceeecCC-CCCCCCCCcE-EEE---CCEEEEEecCCC-CeEEEE------eCCCCceEEccCCCCCcCCC
Q 045821          228 CALVYEPSTDSWLHADA-NMASGWRGPA-VVV---DDALYVLDQSSG-TKLMMW------QKESREWSPVGRLSTLLTRP  295 (352)
Q Consensus       228 ~i~~yd~~~~~W~~~~~-~~~~~~~~~~-~~~---~~~l~~~gg~~~-~~v~~y------d~~~~~W~~~~~~~~~~~~~  295 (352)
                      ..|.|++...+|..... ..|+.-...+ +..   ....++..|.++ -.+|++      ++....|+..+--.-.....
T Consensus       433 KFW~~n~~~kt~~L~T~I~~PH~~~~vat~~~~~~rs~~~vta~~dg~~KiW~~~~~~n~~k~~s~W~c~~i~sy~k~~i  512 (792)
T KOG1963|consen  433 KFWQYNPNSKTFILNTKINNPHGNAFVATIFLNPTRSVRCVTASVDGDFKIWVFTDDSNIYKKSSNWTCKAIGSYHKTPI  512 (792)
T ss_pred             EEEEEcCCcceeEEEEEEecCCCceeEEEEEecCcccceeEEeccCCeEEEEEEecccccCcCccceEEeeeeccccCcc
Confidence            78999999999976533 1222222222 111   111444444443 367887      66677898776322111111


Q ss_pred             CceEEEeCCEEEEEcCCceEEEEEcCC
Q 045821          296 PCKLVAIGKTIFVIGKGCSAVVIDVGN  322 (352)
Q Consensus       296 ~~~~~~~~~~i~v~gG~~~~~~~d~~~  322 (352)
                      ...+..-+|+++..|-.+...++|..+
T Consensus       513 ~a~~fs~dGslla~s~~~~Itiwd~~~  539 (792)
T KOG1963|consen  513 TALCFSQDGSLLAVSFDDTITIWDYDT  539 (792)
T ss_pred             cchhhcCCCcEEEEecCCEEEEecCCC
Confidence            223334466777777666666777766


No 249
>PF13013 F-box-like_2:  F-box-like domain
Probab=37.83  E-value=32  Score=25.27  Aligned_cols=29  Identities=10%  Similarity=0.079  Sum_probs=23.8

Q ss_pred             cCCCcHHHHHHhhccCccccchhhhhhhh
Q 045821           18 ISGLPDDIALFCLARVPRRYHAVLKCVSS   46 (352)
Q Consensus        18 ~~~lp~dl~~~il~r~p~~~~~~l~~v~k   46 (352)
                      +.+||+||...|+..=....+..+...++
T Consensus        22 l~DLP~ELl~~I~~~C~~~~l~~l~~~~~   50 (109)
T PF13013_consen   22 LLDLPWELLQLIFDYCNDPILLALSRTCR   50 (109)
T ss_pred             hhhChHHHHHHHHhhcCcHHHHHHHHHHH
Confidence            77899999999999888887766666655


No 250
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=37.27  E-value=3.4e+02  Score=25.95  Aligned_cols=34  Identities=12%  Similarity=0.250  Sum_probs=26.3

Q ss_pred             EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCC
Q 045821          256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLST  290 (352)
Q Consensus       256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~  290 (352)
                      .-+++-+++||+.. .+.+||...-+=.....++.
T Consensus       474 ~pdgrtLivGGeas-tlsiWDLAapTprikaelts  507 (705)
T KOG0639|consen  474 LPDGRTLIVGGEAS-TLSIWDLAAPTPRIKAELTS  507 (705)
T ss_pred             cCCCceEEeccccc-eeeeeeccCCCcchhhhcCC
Confidence            34888899999854 89999999887766666654


No 251
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=36.93  E-value=2.5e+02  Score=23.53  Aligned_cols=116  Identities=15%  Similarity=0.146  Sum_probs=69.7

Q ss_pred             EEEECCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEE
Q 045821          207 SFVMDGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSP  284 (352)
Q Consensus       207 ~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~  284 (352)
                      ....+|+||...|..+   ...+.++|..+.+  |++--+ ++..+.-.....++.+|.+-=..+ .-+.||+.  +.++
T Consensus        51 L~~~~g~i~esTG~yg---~S~ir~~~L~~gq~~~s~~l~-~~~~FgEGit~~gd~~y~LTw~eg-vaf~~d~~--t~~~  123 (262)
T COG3823          51 LEYLDGHILESTGLYG---FSKIRVSDLTTGQEIFSEKLA-PDTVFGEGITKLGDYFYQLTWKEG-VAFKYDAD--TLEE  123 (262)
T ss_pred             eeeeCCEEEEeccccc---cceeEEEeccCceEEEEeecC-CccccccceeeccceEEEEEeccc-eeEEEChH--Hhhh
Confidence            3455788888766543   3468888887554  554322 233344445678899998865443 56778764  4566


Q ss_pred             ccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcccccceEE
Q 045821          285 VGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIGGIMV  331 (352)
Q Consensus       285 ~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~~~~~  331 (352)
                      ++..+-  ...+-+++.-+..|.+-.|.....+-||+...=.+..++
T Consensus       124 lg~~~y--~GeGWgLt~d~~~LimsdGsatL~frdP~tfa~~~~v~V  168 (262)
T COG3823         124 LGRFSY--EGEGWGLTSDDKNLIMSDGSATLQFRDPKTFAELDTVQV  168 (262)
T ss_pred             hccccc--CCcceeeecCCcceEeeCCceEEEecCHHHhhhcceEEE
Confidence            665543  244456666666666666766555667766654443333


No 252
>KOG0322 consensus G-protein beta subunit-like protein GNB1L, contains WD repeats [General function prediction only]
Probab=36.78  E-value=2.9e+02  Score=24.08  Aligned_cols=113  Identities=13%  Similarity=0.138  Sum_probs=57.0

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCC--------cEEEecCCCCcccce-eEEEECCEEEEEccCCCC-CCceEEEEEeCCC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTN--------NWKLHTEPNIFTEIE-DSFVMDGKIYIRCSASAA-TSHVCALVYEPST  236 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~--------~W~~~~~~~~~~~~~-~~~~~~g~iyv~GG~~~~-~~~~~i~~yd~~~  236 (352)
                      +..+++..|+.. ..|..||..++        .|..+.+ ...+... ..+.++ ...- +|..+. .....++.|+-++
T Consensus       163 ~s~~lllaGyEs-ghvv~wd~S~~~~~~~~~~~~kv~~~-~ash~qpvlsldya-s~~~-rGisgga~dkl~~~Sl~~s~  238 (323)
T KOG0322|consen  163 GSTFLLLAGYES-GHVVIWDLSTGDKIIQLPQSSKVESP-NASHKQPVLSLDYA-SSCD-RGISGGADDKLVMYSLNHST  238 (323)
T ss_pred             cceEEEEEeccC-CeEEEEEccCCceeeccccccccccc-hhhccCcceeeeec-hhhc-CCcCCCccccceeeeecccc
Confidence            457888888877 78899999886        2322221 1111111 112222 1111 122211 1112455555444


Q ss_pred             CceeecCC--CCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCC
Q 045821          237 DSWLHADA--NMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLS  289 (352)
Q Consensus       237 ~~W~~~~~--~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~  289 (352)
                      ++-+.-..  ...+.-.+..+-.++||+...|.++ .+-+|+     |+.+..+.
T Consensus       239 gslq~~~e~~lknpGv~gvrIRpD~KIlATAGWD~-RiRVys-----wrtl~pLA  287 (323)
T KOG0322|consen  239 GSLQIRKEITLKNPGVSGVRIRPDGKILATAGWDH-RIRVYS-----WRTLNPLA  287 (323)
T ss_pred             CcccccceEEecCCCccceEEccCCcEEeecccCC-cEEEEE-----eccCCchh
Confidence            43321111  1112223334667999999999988 788876     55665544


No 253
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=36.63  E-value=1.1e+02  Score=28.94  Aligned_cols=101  Identities=11%  Similarity=0.216  Sum_probs=50.9

Q ss_pred             CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC--cccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCC
Q 045821          167 NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI--FTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADA  244 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~--~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~  244 (352)
                      |++-.++||+.  ..+-++|..+-+=+.-..++.  ..++..++.-|.++......++     .|.+||+.+.+-..-  
T Consensus       476 dgrtLivGGea--stlsiWDLAapTprikaeltssapaCyALa~spDakvcFsccsdG-----nI~vwDLhnq~~Vrq--  546 (705)
T KOG0639|consen  476 DGRTLIVGGEA--STLSIWDLAAPTPRIKAELTSSAPACYALAISPDAKVCFSCCSDG-----NIAVWDLHNQTLVRQ--  546 (705)
T ss_pred             CCceEEecccc--ceeeeeeccCCCcchhhhcCCcchhhhhhhcCCccceeeeeccCC-----cEEEEEcccceeeec--
Confidence            78888889975  667777777655332222333  2233334445666655533222     366777766543221  


Q ss_pred             CCCCCCCCcE---EEECCEEEEEecCCCCeEEEEeCC
Q 045821          245 NMASGWRGPA---VVVDDALYVLDQSSGTKLMMWQKE  278 (352)
Q Consensus       245 ~~~~~~~~~~---~~~~~~l~~~gg~~~~~v~~yd~~  278 (352)
                       .+...-++.   +..+|.=+..||.++ .+-+||..
T Consensus       547 -fqGhtDGascIdis~dGtklWTGGlDn-tvRcWDlr  581 (705)
T KOG0639|consen  547 -FQGHTDGASCIDISKDGTKLWTGGLDN-TVRCWDLR  581 (705)
T ss_pred             -ccCCCCCceeEEecCCCceeecCCCcc-ceeehhhh
Confidence             111111222   222454455677665 44444443


No 254
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=35.52  E-value=3e+02  Score=23.95  Aligned_cols=71  Identities=14%  Similarity=0.191  Sum_probs=42.9

Q ss_pred             EEEEC-CEEEEEeCCCCCCCCCCeEEEEECCCCe--eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC-
Q 045821          115 FEVLG-KNAYLLGGCGWSEDATSEVYCYDASMNT--WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN-  190 (352)
Q Consensus       115 ~~~~~-~~iyv~GG~~~~~~~~~~~~~~d~~t~~--W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~-  190 (352)
                      +++++ .+-||+=|     .....+...|+.++.  |+.+-   ..|...++.++++. .|+|-+.  ..++..+.+|+ 
T Consensus        16 LVV~~dskT~v~ig-----SHs~~~~avd~~sG~~~We~il---g~RiE~sa~vvgdf-VV~GCy~--g~lYfl~~~tGs   84 (354)
T KOG4649|consen   16 LVVCNDSKTLVVIG-----SHSGIVIAVDPQSGNLIWEAIL---GVRIECSAIVVGDF-VVLGCYS--GGLYFLCVKTGS   84 (354)
T ss_pred             EEEecCCceEEEEe-----cCCceEEEecCCCCcEEeehhh---CceeeeeeEEECCE-EEEEEcc--CcEEEEEecchh
Confidence            34443 34444444     334567788999886  77654   24666666777776 4556554  45666777766 


Q ss_pred             -cEEEec
Q 045821          191 -NWKLHT  196 (352)
Q Consensus       191 -~W~~~~  196 (352)
                       -|....
T Consensus        85 ~~w~f~~   91 (354)
T KOG4649|consen   85 QIWNFVI   91 (354)
T ss_pred             heeeeee
Confidence             476543


No 255
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=35.20  E-value=3.3e+02  Score=24.28  Aligned_cols=205  Identities=15%  Similarity=0.072  Sum_probs=96.4

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEE--CCEEEEEeCCCCCCCCCCeEEEEECCCCe-eEecCCC----
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVL--GKNAYLLGGCGWSEDATSEVYCYDASMNT-WTDAAPM----  154 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~--~~~iyv~GG~~~~~~~~~~~~~~d~~t~~-W~~~~~~----  154 (352)
                      ...++.||+  .++.-+......    ..  .+...+  ++.|....         ..+..+++.+.. ++.+...    
T Consensus        46 ~~~i~r~~~--~~g~~~~~~~p~----~~--~~~~~~d~~g~Lv~~~---------~g~~~~~~~~~~~~t~~~~~~~~~  108 (307)
T COG3386          46 GGRIHRLDP--ETGKKRVFPSPG----GF--SSGALIDAGGRLIACE---------HGVRLLDPDTGGKITLLAEPEDGL  108 (307)
T ss_pred             CCeEEEecC--CcCceEEEECCC----Cc--ccceeecCCCeEEEEc---------cccEEEeccCCceeEEeccccCCC
Confidence            346777788  656544444322    11  223333  33444332         234556665443 3444432    


Q ss_pred             CCcceeeeeEEECCEEEEEeCC----CCC-----CceEEEeCCCCcEEEecCCCCcccceeEEEECC-EEEEEccCCCCC
Q 045821          155 STARCYFPCGVLNQKIYCIGGL----GDT-----HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG-KIYIRCSASAAT  224 (352)
Q Consensus       155 ~~~r~~~~~~~~~~~iyv~gG~----~~~-----~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g-~iyv~GG~~~~~  224 (352)
                      +.-|..-..+--+|.+|+-.-.    ...     ..+++||+. ....++-.-........+..-++ .+|+.-     .
T Consensus       109 ~~~r~ND~~v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~-g~~~~l~~~~~~~~NGla~SpDg~tly~aD-----T  182 (307)
T COG3386         109 PLNRPNDGVVDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPD-GGVVRLLDDDLTIPNGLAFSPDGKTLYVAD-----T  182 (307)
T ss_pred             CcCCCCceeEcCCCCEEEeCCCccccCccccCCcceEEEEcCC-CCEEEeecCcEEecCceEECCCCCEEEEEe-----C
Confidence            2223333444445666664332    011     678999984 55555432222333333444466 677762     2


Q ss_pred             CceEEEEEeCCC------Cc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCC
Q 045821          225 SHVCALVYEPST------DS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPP  296 (352)
Q Consensus       225 ~~~~i~~yd~~~------~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~  296 (352)
                      ....+++|+...      ++  +..... .+..-=+..+-.+|.||+.....+..|.+|+++...-.++ .+|.. .-..
T Consensus       183 ~~~~i~r~~~d~~~g~~~~~~~~~~~~~-~~G~PDG~~vDadG~lw~~a~~~g~~v~~~~pdG~l~~~i-~lP~~-~~t~  259 (307)
T COG3386         183 PANRIHRYDLDPATGPIGGRRGFVDFDE-EPGLPDGMAVDADGNLWVAAVWGGGRVVRFNPDGKLLGEI-KLPVK-RPTN  259 (307)
T ss_pred             CCCeEEEEecCcccCccCCcceEEEccC-CCCCCCceEEeCCCCEEEecccCCceEEEECCCCcEEEEE-ECCCC-CCcc
Confidence            224677776542      11  111111 0011112234457888875554445899999994444433 45531 1111


Q ss_pred             ceEEE-eCCEEEEEcCC
Q 045821          297 CKLVA-IGKTIFVIGKG  312 (352)
Q Consensus       297 ~~~~~-~~~~i~v~gG~  312 (352)
                      +++.. -.+.|||..-.
T Consensus       260 ~~FgG~~~~~L~iTs~~  276 (307)
T COG3386         260 PAFGGPDLNTLYITSAR  276 (307)
T ss_pred             ceEeCCCcCEEEEEecC
Confidence            11111 13678887643


No 256
>PF08662 eIF2A:  Eukaryotic translation initiation factor eIF2A;  InterPro: IPR013979  This entry contains beta propellor domains found in eukaryotic translation initiation factors and TolB domain-containing proteins. 
Probab=34.75  E-value=2.5e+02  Score=22.84  Aligned_cols=68  Identities=16%  Similarity=0.186  Sum_probs=37.8

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCCcEEEec
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTNNWKLHT  196 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~~W~~~~  196 (352)
                      ++++.++-|.     ....+..||.....-..+   +........-.-+|+..+++|.... ..+..||..+  .+.+.
T Consensus        71 g~~favi~g~-----~~~~v~lyd~~~~~i~~~---~~~~~n~i~wsP~G~~l~~~g~~n~~G~l~~wd~~~--~~~i~  139 (194)
T PF08662_consen   71 GNEFAVIYGS-----MPAKVTLYDVKGKKIFSF---GTQPRNTISWSPDGRFLVLAGFGNLNGDLEFWDVRK--KKKIS  139 (194)
T ss_pred             CCEEEEEEcc-----CCcccEEEcCcccEeEee---cCCCceEEEECCCCCEEEEEEccCCCcEEEEEECCC--CEEee
Confidence            5566666552     123788899864333333   2211111111126788888886543 6789999984  44454


No 257
>PF09910 DUF2139:  Uncharacterized protein conserved in archaea (DUF2139);  InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=34.48  E-value=3.3e+02  Score=24.16  Aligned_cols=112  Identities=13%  Similarity=0.141  Sum_probs=66.2

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ++.|++.-+ +.....-+|..|.  .++.-+.+..-|.   ..   .....+..+|  += +.-......+.++|..+++
T Consensus       117 ~D~LLlAR~-DGh~nLGvy~ldr--~~g~~~~L~~~ps---~K---G~~~~D~a~F--~i-~~~~~g~~~i~~~Dli~~~  184 (339)
T PF09910_consen  117 EDRLLLARA-DGHANLGVYSLDR--RTGKAEKLSSNPS---LK---GTLVHDYACF--GI-NNFHKGVSGIHCLDLISGK  184 (339)
T ss_pred             cCEEEEEec-CCcceeeeEEEcc--cCCceeeccCCCC---cC---ceEeeeeEEE--ec-cccccCCceEEEEEccCCe
Confidence            677776533 3445666888898  8998888877662   11   1222232333  22 3333567889999999999


Q ss_pred             e--EecCCCCC-------cceeeeeEEECCEEEEEeCCCCCCceEEEeCCC-CcEEEe
Q 045821          148 W--TDAAPMST-------ARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRT-NNWKLH  195 (352)
Q Consensus       148 W--~~~~~~~~-------~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t-~~W~~~  195 (352)
                      |  +..+.-..       -+..-.++...+++|++-+    ..+.+.||.. +..+..
T Consensus       185 ~~~e~f~~~~s~Dg~~~~~~~~G~~~s~ynR~faF~r----GGi~vgnP~~~e~~~f~  238 (339)
T PF09910_consen  185 WVIESFDVSLSVDGGPVIRPELGAMASAYNRLFAFVR----GGIFVGNPYNGEEFRFY  238 (339)
T ss_pred             EEEEecccccCCCCCceEeeccccEEEEeeeEEEEEe----ccEEEeCCCCCCceeEE
Confidence            9  44432111       1223345556677777654    3478899984 344333


No 258
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=34.09  E-value=4.3e+02  Score=25.32  Aligned_cols=98  Identities=13%  Similarity=0.229  Sum_probs=55.2

Q ss_pred             CCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCC-CCc--EEEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821          211 DGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGW-RGP--AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGR  287 (352)
Q Consensus       211 ~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~-~~~--~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~  287 (352)
                      +..+.+..|..      .++.+++..+.-.+-......+. ...  .+..+|.=.+.|...+ .|.+|++.++..++...
T Consensus       212 d~nliit~Gk~------H~~Fw~~~~~~l~k~~~~fek~ekk~Vl~v~F~engdviTgDS~G-~i~Iw~~~~~~~~k~~~  284 (626)
T KOG2106|consen  212 DPNLIITCGKG------HLYFWTLRGGSLVKRQGIFEKREKKFVLCVTFLENGDVITGDSGG-NILIWSKGTNRISKQVH  284 (626)
T ss_pred             CCcEEEEeCCc------eEEEEEccCCceEEEeeccccccceEEEEEEEcCCCCEEeecCCc-eEEEEeCCCceEEeEee
Confidence            44555554543      37777888887765543222221 111  1233444445554444 99999999888776543


Q ss_pred             CCCCcCCCCceEEEe-CCEEEEEcCCceEEE-EE
Q 045821          288 LSTLLTRPPCKLVAI-GKTIFVIGKGCSAVV-ID  319 (352)
Q Consensus       288 ~~~~~~~~~~~~~~~-~~~i~v~gG~~~~~~-~d  319 (352)
                       .  ....-++++.+ +++|+- ||.+...+ .|
T Consensus       285 -a--H~ggv~~L~~lr~GtllS-GgKDRki~~Wd  314 (626)
T KOG2106|consen  285 -A--HDGGVFSLCMLRDGTLLS-GGKDRKIILWD  314 (626)
T ss_pred             -e--cCCceEEEEEecCccEee-cCccceEEecc
Confidence             1  23445666666 455666 88775443 44


No 259
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=33.79  E-value=3.5e+02  Score=24.25  Aligned_cols=33  Identities=21%  Similarity=0.407  Sum_probs=28.9

Q ss_pred             ECCEEEEEecCCCCeEEEEeCCCCceEEccCCCC
Q 045821          257 VDDALYVLDQSSGTKLMMWQKESREWSPVGRLST  290 (352)
Q Consensus       257 ~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~  290 (352)
                      .+|+||+..-..+ .+..+|+++++.+.+..+|.
T Consensus       211 hdgrLwvldsgtG-ev~~vD~~~G~~e~Va~vpG  243 (335)
T TIGR03032       211 YQGKLWLLNSGRG-ELGYVDPQAGKFQPVAFLPG  243 (335)
T ss_pred             eCCeEEEEECCCC-EEEEEcCCCCcEEEEEECCC
Confidence            4899999987666 99999999999999988876


No 260
>KOG0299 consensus U3 snoRNP-associated protein (contains WD40 repeats) [RNA processing and modification]
Probab=32.62  E-value=4.3e+02  Score=24.85  Aligned_cols=141  Identities=9%  Similarity=0.080  Sum_probs=69.5

Q ss_pred             EEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCccee-eeeEEEC--CEEEEEeCCCCCCceEEEeCCCCcE
Q 045821          116 EVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCY-FPCGVLN--QKIYCIGGLGDTHSWDVYDPRTNNW  192 (352)
Q Consensus       116 ~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~-~~~~~~~--~~iyv~gG~~~~~~v~~yd~~t~~W  192 (352)
                      +..+++.+++||.+      ..+.++|+.|.+=  +..++.-|-. .+.+...  +.+|..+- +  .++-+|+......
T Consensus       210 vS~Dgkylatgg~d------~~v~Iw~~~t~eh--v~~~~ghr~~V~~L~fr~gt~~lys~s~-D--rsvkvw~~~~~s~  278 (479)
T KOG0299|consen  210 VSSDGKYLATGGRD------RHVQIWDCDTLEH--VKVFKGHRGAVSSLAFRKGTSELYSASA-D--RSVKVWSIDQLSY  278 (479)
T ss_pred             EcCCCcEEEecCCC------ceEEEecCcccch--hhcccccccceeeeeeecCccceeeeec-C--CceEEEehhHhHH
Confidence            33477888888853      4467888887642  2222222221 2222222  35666442 1  4455555544332


Q ss_pred             EEecCCCCcccceeE------EEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEec
Q 045821          193 KLHTEPNIFTEIEDS------FVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQ  266 (352)
Q Consensus       193 ~~~~~~~~~~~~~~~------~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg  266 (352)
                        +.    .-++|+.      +.-.++...+||.+.     .+..|+... .-+.+-.. .....-.+++++..-|+.|+
T Consensus       279 --ve----tlyGHqd~v~~IdaL~reR~vtVGgrDr-----T~rlwKi~e-esqlifrg-~~~sidcv~~In~~HfvsGS  345 (479)
T KOG0299|consen  279 --VE----TLYGHQDGVLGIDALSRERCVTVGGRDR-----TVRLWKIPE-ESQLIFRG-GEGSIDCVAFINDEHFVSGS  345 (479)
T ss_pred             --HH----HHhCCccceeeechhcccceEEeccccc-----eeEEEeccc-cceeeeeC-CCCCeeeEEEecccceeecc
Confidence              21    1222221      122467777877664     344444411 11111110 01111224567888899998


Q ss_pred             CCCCeEEEEeCCCCc
Q 045821          267 SSGTKLMMWQKESRE  281 (352)
Q Consensus       267 ~~~~~v~~yd~~~~~  281 (352)
                      .+| .|..++.-.+.
T Consensus       346 dnG-~IaLWs~~KKk  359 (479)
T KOG0299|consen  346 DNG-SIALWSLLKKK  359 (479)
T ss_pred             CCc-eEEEeeecccC
Confidence            877 88888776654


No 261
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=32.02  E-value=3.2e+02  Score=26.94  Aligned_cols=95  Identities=7%  Similarity=0.125  Sum_probs=48.6

Q ss_pred             CceEEEeCCCCc-EEEecCCCCcccceeEEEE---CCEEEEEccCCCCCCceEEEEEeCCCCce------eecCC--CC-
Q 045821          180 HSWDVYDPRTNN-WKLHTEPNIFTEIEDSFVM---DGKIYIRCSASAATSHVCALVYEPSTDSW------LHADA--NM-  246 (352)
Q Consensus       180 ~~v~~yd~~t~~-W~~~~~~~~~~~~~~~~~~---~g~iyv~GG~~~~~~~~~i~~yd~~~~~W------~~~~~--~~-  246 (352)
                      ..|-++++..+- |- +..+.....+..+.++   ++.+++.||.+.     .|..||.++..=      ..++.  .. 
T Consensus        95 tTVK~W~~~~~~~~c-~stir~H~DYVkcla~~ak~~~lvaSgGLD~-----~IflWDin~~~~~l~~s~n~~t~~sl~s  168 (735)
T KOG0308|consen   95 TTVKVWNAHKDNTFC-MSTIRTHKDYVKCLAYIAKNNELVASGGLDR-----KIFLWDINTGTATLVASFNNVTVNSLGS  168 (735)
T ss_pred             ceEEEeecccCcchh-HhhhhcccchheeeeecccCceeEEecCCCc-----cEEEEEccCcchhhhhhccccccccCCC
Confidence            677777777653 31 1112333333333333   677888887764     466666664422      22211  11 


Q ss_pred             CCC--CCCcEEEECCEEEEEecCCCCeEEEEeCCCCc
Q 045821          247 ASG--WRGPAVVVDDALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       247 ~~~--~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      .+.  .+..+....+.+++-||..+ .+..||+.+..
T Consensus       169 G~k~siYSLA~N~t~t~ivsGgtek-~lr~wDprt~~  204 (735)
T KOG0308|consen  169 GPKDSIYSLAMNQTGTIIVSGGTEK-DLRLWDPRTCK  204 (735)
T ss_pred             CCccceeeeecCCcceEEEecCccc-ceEEecccccc
Confidence            111  11112223344666677665 88999998743


No 262
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=31.79  E-value=2.1e+02  Score=24.53  Aligned_cols=59  Identities=12%  Similarity=0.119  Sum_probs=33.0

Q ss_pred             CCEEEEEecCCCCeEEEEeCCCCceEEccCCC------CCcCCCCceEEE-eCCEEEEEcCCceEEEE
Q 045821          258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLS------TLLTRPPCKLVA-IGKTIFVIGKGCSAVVI  318 (352)
Q Consensus       258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~------~~~~~~~~~~~~-~~~~i~v~gG~~~~~~~  318 (352)
                      .+.||++...+. .+.++|.+.+.-..+.-..      .. ..+.-+++. -+|.|||.+.-+-++.|
T Consensus       182 t~~lliLS~es~-~l~~~d~~G~~~~~~~L~~g~~gl~~~-~~QpEGIa~d~~G~LYIvsEpNlfy~f  247 (248)
T PF06977_consen  182 TGHLLILSDESR-LLLELDRQGRVVSSLSLDRGFHGLSKD-IPQPEGIAFDPDGNLYIVSEPNLFYRF  247 (248)
T ss_dssp             TTEEEEEETTTT-EEEEE-TT--EEEEEE-STTGGG-SS----SEEEEEE-TT--EEEEETTTEEEEE
T ss_pred             CCeEEEEECCCC-eEEEECCCCCEEEEEEeCCcccCcccc-cCCccEEEECCCCCEEEEcCCceEEEe
Confidence            678999987665 8999997766554443211      11 222334443 47899999987666665


No 263
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=31.50  E-value=3.7e+02  Score=23.82  Aligned_cols=96  Identities=16%  Similarity=0.205  Sum_probs=46.5

Q ss_pred             CCEEEEEeCCCCCCceEE---EeCCCCcEEEecCCCCcccceeEEEE------C-CEEEEEccCCCCCCceEEEEEeCCC
Q 045821          167 NQKIYCIGGLGDTHSWDV---YDPRTNNWKLHTEPNIFTEIEDSFVM------D-GKIYIRCSASAATSHVCALVYEPST  236 (352)
Q Consensus       167 ~~~iyv~gG~~~~~~v~~---yd~~t~~W~~~~~~~~~~~~~~~~~~------~-g~iyv~GG~~~~~~~~~i~~yd~~~  236 (352)
                      +|..++.||++.  .+..   |.-..|-|..-        .|..+++      + +.|+-. |.+.     .+..+|..+
T Consensus        58 ~gs~~aSgG~Dr--~I~LWnv~gdceN~~~lk--------gHsgAVM~l~~~~d~s~i~S~-gtDk-----~v~~wD~~t  121 (338)
T KOG0265|consen   58 DGSCFASGGSDR--AIVLWNVYGDCENFWVLK--------GHSGAVMELHGMRDGSHILSC-GTDK-----TVRGWDAET  121 (338)
T ss_pred             CCCeEeecCCcc--eEEEEeccccccceeeec--------cccceeEeeeeccCCCEEEEe-cCCc-----eEEEEeccc
Confidence            566777777763  3333   55556667532        2222222      3 345555 4442     578888887


Q ss_pred             CceeecCCCCCCCCCCcE--EEECCEEEEEecCCCCeEEEEeCCCC
Q 045821          237 DSWLHADANMASGWRGPA--VVVDDALYVLDQSSGTKLMMWQKESR  280 (352)
Q Consensus       237 ~~W~~~~~~~~~~~~~~~--~~~~~~l~~~gg~~~~~v~~yd~~~~  280 (352)
                      ++-..--.  .....-.+  ..--|..++..|.....+-+||..++
T Consensus       122 G~~~rk~k--~h~~~vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~k  165 (338)
T KOG0265|consen  122 GKRIRKHK--GHTSFVNSLDPSRRGPQLVCSGSDDGTLKLWDIRKK  165 (338)
T ss_pred             ceeeehhc--cccceeeecCccccCCeEEEecCCCceEEEEeeccc
Confidence            76443211  00000001  11123333444444448888988754


No 264
>PF00780 CNH:  CNH domain;  InterPro: IPR001180 Based on sequence similarities a domain of homology has been identified in the following proteins []:  Citron and Citron kinase. These two proteins interact with the GTP-bound forms of the small GTPases Rho and Rac but not with Cdc42. Myotonic dystrophy kinase-related Cdc42-binding kinase (MRCKalpha). This serine/threonine kinase interacts with the GTP-bound form of the small GTPase Cdc42 and to a lesser extent with that of Rac. NCK Interacting Kinase (NIK), a serine/threonine protein kinase. ROM-1 and ROM-2, from yeast. These proteins are GDP/GTP exchange proteins (GEPs) for the small GTP binding protein Rho1.  This domain, called the citron homology domain, is often found after cysteine rich and pleckstrin homology (PH) domains at the C-terminal end of the proteins []. It acts as a regulatory domain and could be involved in macromolecular interactions [, ].; GO: 0005083 small GTPase regulator activity
Probab=30.88  E-value=3.4e+02  Score=23.17  Aligned_cols=83  Identities=11%  Similarity=0.069  Sum_probs=42.0

Q ss_pred             EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc--------CCCCceEEEeCCEEEEEcCCceEEEEEcCCccccc
Q 045821          256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL--------TRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNIG  327 (352)
Q Consensus       256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~--------~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~~  327 (352)
                      ..++.|++....   ....+|+.+.+=+.+..+....        ......+..+++.-+++...+-....|.......+
T Consensus       146 ~~~~~i~v~~~~---~f~~idl~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~e~Ll~~~~~g~fv~~~G~~~r~  222 (275)
T PF00780_consen  146 FLGNKICVGTSK---GFYLIDLNTGSPSELLDPSDSSSSFKSRNSSSKPLGIFQLSDNEFLLCYDNIGVFVNKNGEPSRK  222 (275)
T ss_pred             EeCCEEEEEeCC---ceEEEecCCCCceEEeCccCCcchhhhcccCCCceEEEEeCCceEEEEecceEEEEcCCCCcCcc
Confidence            345555544322   6788888877666665433211        11123444555334444333333345555544445


Q ss_pred             ceEEeccCCCCCCC
Q 045821          328 GIMVSSSIPKLNDN  341 (352)
Q Consensus       328 ~~~~~~~~~~~~~~  341 (352)
                      ....|.+.|.....
T Consensus       223 ~~i~W~~~p~~~~~  236 (275)
T PF00780_consen  223 STIQWSSAPQSVAY  236 (275)
T ss_pred             cEEEcCCchhEEEE
Confidence            66667776654443


No 265
>KOG3545 consensus Olfactomedin and related extracellular matrix glycoproteins [Extracellular structures]
Probab=30.44  E-value=3.5e+02  Score=23.20  Aligned_cols=190  Identities=11%  Similarity=0.122  Sum_probs=103.6

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEEC----CCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCC---c
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDA----SMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTN---N  191 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~----~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~---~  191 (352)
                      ++++|++.+...     ..+.-|..    ..+.+...=.+|.+-.+-.-++++|.+|.-.+..  ..+..||..+.   .
T Consensus        30 ~~r~~~~~~~~~-----~~l~E~~~~~~~~~~~~~~~~~lp~~~~gTg~VVynGs~yynk~~t--~~ivky~l~~~~~~~  102 (249)
T KOG3545|consen   30 DDRIYVMNYFDG-----LMLTEYTNLEDFKRGRKAEKYRLPYSWDGTGHVVYNGSLYYNKAGT--RNIIKYDLETRTVAG  102 (249)
T ss_pred             cCceEEeccccC-----ceEEEeccHHHhhccCcceEEeCCCCccccceEEEcceEEeeccCC--cceEEEEeecceeee
Confidence            667788744322     23333433    2334444445666666677788888888765433  67889999884   3


Q ss_pred             EEEecCCCCccc---------ceeEEEECCEEEEEccCCCCCCceEEEEEeCC----CCceeecCCCCCCCCCCcEEEEC
Q 045821          192 WKLHTEPNIFTE---------IEDSFVMDGKIYIRCSASAATSHVCALVYEPS----TDSWLHADANMASGWRGPAVVVD  258 (352)
Q Consensus       192 W~~~~~~~~~~~---------~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~----~~~W~~~~~~~~~~~~~~~~~~~  258 (352)
                      |+.++.+.....         ..-.++-.+-|+++=-..+......+...|+.    ..+|..-   .+++..+.++++-
T Consensus       103 ~~~lp~a~y~~~~~y~~~g~sdiD~avDE~GLWviYat~~~~g~iv~skLdp~tl~~e~tW~T~---~~k~~~~~aF~iC  179 (249)
T KOG3545|consen  103 SAALPYAGYHNPSPYYWGGHSDIDLAVDENGLWVIYATPENAGTIVLSKLDPETLEVERTWNTT---LPKRSAGNAFMIC  179 (249)
T ss_pred             eeeccccccCCCcccccCCCccccceecccceeEEecccccCCcEEeeccCHHHhheeeeeccc---cCCCCcCceEEEe
Confidence            555553221111         11234445556666333333333355677773    3456432   3344455566677


Q ss_pred             CEEEEEecCCC--CeE-EEEeCCCCceEEccCCCCCcCCCCceEE---EeCCEEEEEcCCceEEEEEc
Q 045821          259 DALYVLDQSSG--TKL-MMWQKESREWSPVGRLSTLLTRPPCKLV---AIGKTIFVIGKGCSAVVIDV  320 (352)
Q Consensus       259 ~~l~~~gg~~~--~~v-~~yd~~~~~W~~~~~~~~~~~~~~~~~~---~~~~~i~v~gG~~~~~~~d~  320 (352)
                      |-||++-....  ..| +.||..+++=+.+. +|-........+.   ..+.++|+..-.. ...|++
T Consensus       180 GvLY~v~S~~~~~~~i~yaydt~~~~~~~~~-ipf~N~y~~~~~idYNP~D~~LY~wdng~-~l~y~l  245 (249)
T KOG3545|consen  180 GVLYVVHSYNCTHTQISYAYDTTTGTQERID-LPFPNPYSYATMIDYNPRDRRLYAWDNGH-QLTYNL  245 (249)
T ss_pred             eeeEEEeccccCCceEEEEEEcCCCceeccc-ccccchhhhhhccCCCcccceeeEecCCc-EEEEEe
Confidence            88999976554  334 78999988775443 3321122222333   3467899886432 344554


No 266
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=30.41  E-value=5.9e+02  Score=25.80  Aligned_cols=169  Identities=10%  Similarity=0.045  Sum_probs=81.4

Q ss_pred             CCeEEEEECCCCeeEecCCCCCcc-eeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEE--EE-
Q 045821          135 TSEVYCYDASMNTWTDAAPMSTAR-CYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSF--VM-  210 (352)
Q Consensus       135 ~~~~~~~d~~t~~W~~~~~~~~~r-~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~--~~-  210 (352)
                      ...+..||..++.-...-.....+ --+.....++.+.+.=||.+ ..+..|+..++.   +.-.+..+.. ++.  -+ 
T Consensus        42 ~E~vn~WdlRtge~~~~l~~~~~k~evt~l~~~~d~l~lAVGYaD-GsVqif~~~s~~---~~~tfngHK~-AVt~l~fd  116 (888)
T KOG0306|consen   42 LEQVNIWDLRTGEIEKKLILLKKKAEVTCLRSSDDILLLAVGYAD-GSVQIFSLESEE---ILITFNGHKA-AVTTLKFD  116 (888)
T ss_pred             cccEeEEeeecchhhhhhhhhcccceEEEeeccCCcceEEEEecC-ceEEeeccCCCc---eeeeeccccc-ceEEEEEc
Confidence            456778888877432221122223 22333334555555555554 678888888772   2111111111 111  12 


Q ss_pred             -CCEEEEEccCCCCCCceEEEEEeCCCCc--eeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821          211 -DGKIYIRCSASAATSHVCALVYEPSTDS--WLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGR  287 (352)
Q Consensus       211 -~g~iyv~GG~~~~~~~~~i~~yd~~~~~--W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~  287 (352)
                       .|.-++.||.++     .|..||.-...  .+.-+.  ...-...-+..+..+.+--+.++ .+-.||.++....+.- 
T Consensus       117 ~~G~rlaSGskDt-----~IIvwDlV~E~Gl~rL~GH--kd~iT~~~F~~~~~~lvS~sKDs-~iK~WdL~tqhCf~Th-  187 (888)
T KOG0306|consen  117 KIGTRLASGSKDT-----DIIVWDLVGEEGLFRLRGH--KDSITQALFLNGDSFLVSVSKDS-MIKFWDLETQHCFETH-  187 (888)
T ss_pred             ccCceEeecCCCc-----cEEEEEeccceeeEEeecc--hHHHhHHhccCCCeEEEEeccCc-eEEEEecccceeeeEE-
Confidence             466777777664     57777764332  111110  00000001222456666555554 7888999988765442 


Q ss_pred             CCCCcCCCCceEEEeCCEEEEEcCCce-EEEEEc
Q 045821          288 LSTLLTRPPCKLVAIGKTIFVIGKGCS-AVVIDV  320 (352)
Q Consensus       288 ~~~~~~~~~~~~~~~~~~i~v~gG~~~-~~~~d~  320 (352)
                      +..  ..-..+++..+ ++.|.+|.+. .-+|++
T Consensus       188 vd~--r~Eiw~l~~~~-~~lvt~~~dse~~v~~L  218 (888)
T KOG0306|consen  188 VDH--RGEIWALVLDE-KLLVTAGTDSELKVWEL  218 (888)
T ss_pred             ecc--cceEEEEEEec-ceEEEEecCCceEEEEe
Confidence            111  01123455545 6666655543 445665


No 267
>KOG2315 consensus Predicted translation initiation factor related to eIF-3a [Translation, ribosomal structure and biogenesis]
Probab=29.80  E-value=5.2e+02  Score=24.98  Aligned_cols=64  Identities=16%  Similarity=0.270  Sum_probs=37.5

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEEECCEEEEEeCCCCC-CceEEEeCCCC
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGVLNQKIYCIGGLGDT-HSWDVYDPRTN  190 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~-~~v~~yd~~t~  190 (352)
                      +...-|+-|+     .-..+-+||+.-+--..++.-|.-+.++   .-.|+|.++.|.+.- .+|+++|..+.
T Consensus       282 ~~EF~VvyGf-----MPAkvtifnlr~~~v~df~egpRN~~~f---np~g~ii~lAGFGNL~G~mEvwDv~n~  346 (566)
T KOG2315|consen  282 GREFAVVYGF-----MPAKVTIFNLRGKPVFDFPEGPRNTAFF---NPHGNIILLAGFGNLPGDMEVWDVPNR  346 (566)
T ss_pred             CCEEEEEEec-----ccceEEEEcCCCCEeEeCCCCCccceEE---CCCCCEEEEeecCCCCCceEEEeccch
Confidence            3345555553     2355677888765444444333222222   124678888887655 89999999873


No 268
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=29.68  E-value=4.2e+02  Score=23.91  Aligned_cols=176  Identities=9%  Similarity=0.072  Sum_probs=83.8

Q ss_pred             CCeEEEEECCCCee-EecCCCCCcceeeeeEEE--CCEEEEEeCCCCCCceEEEeCCCCc------EEEecCCCCcccce
Q 045821          135 TSEVYCYDASMNTW-TDAAPMSTARCYFPCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN------WKLHTEPNIFTEIE  205 (352)
Q Consensus       135 ~~~~~~~d~~t~~W-~~~~~~~~~r~~~~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~------W~~~~~~~~~~~~~  205 (352)
                      ..++.+||+.|.+- +.+.-.|..-........  ...++..+ .+  ..|-.||..+..      |+.-+   .....+
T Consensus        49 ngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~s-sD--G~Vr~wD~Rs~~e~a~~~~~~~~---~~~f~~  122 (376)
T KOG1188|consen   49 NGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCS-SD--GTVRLWDIRSQAESARISWTQQS---GTPFIC  122 (376)
T ss_pred             CCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEec-cC--CeEEEEEeecchhhhheeccCCC---CCcceE
Confidence            35677888888653 233222221111111111  12333332 22  468889988763      32211   112222


Q ss_pred             eEEEECCEEEEEccCCCCCCceEEEEEeCCCCceee-cCCCCCCCCCC---cE--EEECCEEEEEecCCCCeEEEEeCCC
Q 045821          206 DSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLH-ADANMASGWRG---PA--VVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       206 ~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~~~~~~~~~~---~~--~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      ...-.+++|...| .........+..||...  |++ +.. ......-   ..  ..-+..+++.|..+| -+-+||.+.
T Consensus       123 ld~nck~~ii~~G-tE~~~s~A~v~lwDvR~--~qq~l~~-~~eSH~DDVT~lrFHP~~pnlLlSGSvDG-LvnlfD~~~  197 (376)
T KOG1188|consen  123 LDLNCKKNIIACG-TELTRSDASVVLWDVRS--EQQLLRQ-LNESHNDDVTQLRFHPSDPNLLLSGSVDG-LVNLFDTKK  197 (376)
T ss_pred             eeccCcCCeEEec-cccccCceEEEEEEecc--ccchhhh-hhhhccCcceeEEecCCCCCeEEeecccc-eEEeeecCC
Confidence            2222367777774 33223334677888643  222 211 0000000   01  123667888887777 788999887


Q ss_pred             CceEEccCCCCCcCCCCceEEE---e-CC--EEEEEcCCceEEEEEcCCcc
Q 045821          280 REWSPVGRLSTLLTRPPCKLVA---I-GK--TIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       280 ~~W~~~~~~~~~~~~~~~~~~~---~-~~--~i~v~gG~~~~~~~d~~~~~  324 (352)
                      +  .+=..|-. ......++..   + ++  +|+++.-.....+|+.+...
T Consensus       198 d--~EeDaL~~-viN~~sSI~~igw~~~~ykrI~clTH~Etf~~~ele~~~  245 (376)
T KOG1188|consen  198 D--NEEDALLH-VINHGSSIHLIGWLSKKYKRIMCLTHMETFAIYELEDGS  245 (376)
T ss_pred             C--cchhhHHH-hhcccceeeeeeeecCCcceEEEEEccCceeEEEccCCC
Confidence            6  22222211 1122222221   2 23  79999877778888877655


No 269
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=29.67  E-value=6.5e+02  Score=26.04  Aligned_cols=60  Identities=12%  Similarity=0.130  Sum_probs=31.9

Q ss_pred             CCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEe--CCEEEEEcCCce-EEEEEcC
Q 045821          258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI--GKTIFVIGKGCS-AVVIDVG  321 (352)
Q Consensus       258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~--~~~i~v~gG~~~-~~~~d~~  321 (352)
                      +|.+.+.+-.+  .|.+|+.+  .|+....+........+..+..  +|+-...++.++ +.++|++
T Consensus       200 ~g~la~~~~d~--~Vkvy~r~--~we~~f~Lr~~~~ss~~~~~~wsPnG~YiAAs~~~g~I~vWnv~  262 (933)
T KOG1274|consen  200 GGTLAVPPVDN--TVKVYSRK--GWELQFKLRDKLSSSKFSDLQWSPNGKYIAASTLDGQILVWNVD  262 (933)
T ss_pred             CCeEEeeccCC--eEEEEccC--CceeheeecccccccceEEEEEcCCCcEEeeeccCCcEEEEecc
Confidence            46666665433  78888865  4887776654333332333322  344333444332 4467776


No 270
>PF11768 DUF3312:  Protein of unknown function (DUF3312);  InterPro: IPR024511 This is a eukaryotic family of uncharacterised proteins that contain WD40 repeats.
Probab=28.27  E-value=5.6e+02  Score=24.87  Aligned_cols=96  Identities=10%  Similarity=0.062  Sum_probs=50.5

Q ss_pred             ceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceee
Q 045821           82 RVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYF  161 (352)
Q Consensus        82 ~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~  161 (352)
                      ...+.+|+.  ..+.-+.+.....+.....-++.......-+++|..+      .++..||..++.-....    ...-.
T Consensus       235 ~~d~ciYE~--~r~klqrvsvtsipL~s~v~~ca~sp~E~kLvlGC~D------gSiiLyD~~~~~t~~~k----a~~~P  302 (545)
T PF11768_consen  235 SADSCIYEC--SRNKLQRVSVTSIPLPSQVICCARSPSEDKLVLGCED------GSIILYDTTRGVTLLAK----AEFIP  302 (545)
T ss_pred             eeEEEEEEe--ecCceeEEEEEEEecCCcceEEecCcccceEEEEecC------CeEEEEEcCCCeeeeee----ecccc
Confidence            345666766  5555544432211101122222233345677777654      45888998776422221    11111


Q ss_pred             eeEEE--CCEEEEEeCCCCCCceEEEeCCCCc
Q 045821          162 PCGVL--NQKIYCIGGLGDTHSWDVYDPRTNN  191 (352)
Q Consensus       162 ~~~~~--~~~iyv~gG~~~~~~v~~yd~~t~~  191 (352)
                      ..+..  +|.++++|+..  ..+.+||..-+.
T Consensus       303 ~~iaWHp~gai~~V~s~q--GelQ~FD~ALsp  332 (545)
T PF11768_consen  303 TLIAWHPDGAIFVVGSEQ--GELQCFDMALSP  332 (545)
T ss_pred             eEEEEcCCCcEEEEEcCC--ceEEEEEeecCc
Confidence            22222  77888888865  678899987664


No 271
>KOG4341 consensus F-box protein containing LRR [General function prediction only]
Probab=28.12  E-value=32  Score=31.73  Aligned_cols=38  Identities=13%  Similarity=0.242  Sum_probs=33.1

Q ss_pred             CccCCCcHHHHHHhhccCccccchhhhhhhhhhccccc
Q 045821           16 PLISGLPDDIALFCLARVPRRYHAVLKCVSSRWRGLLH   53 (352)
Q Consensus        16 ~~~~~lp~dl~~~il~r~p~~~~~~l~~v~k~W~~l~~   53 (352)
                      ...-.||+|++..+.+.+.-.++.+.+.+|+.|+.+..
T Consensus        70 ~~~~~LPpEl~lkvFS~LDtksl~r~a~~c~~~n~~Al  107 (483)
T KOG4341|consen   70 SISRSLPPELLLKVFSMLDTKSLCRAAQCCTMWNKLAL  107 (483)
T ss_pred             cccccCCHHHHHHHHHHHhHHHHHHHHHHHHHhhhhhh
Confidence            33447999999999999999999999999999997643


No 272
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=27.91  E-value=1e+02  Score=16.21  Aligned_cols=24  Identities=25%  Similarity=0.444  Sum_probs=14.5

Q ss_pred             EECCEEEEEecCCCCeEEEEeCCCCc
Q 045821          256 VVDDALYVLDQSSGTKLMMWQKESRE  281 (352)
Q Consensus       256 ~~~~~l~~~gg~~~~~v~~yd~~~~~  281 (352)
                      ..++.+|+. ..++ .++.+|.++++
T Consensus         4 ~~~~~v~~~-~~~g-~l~a~d~~~G~   27 (33)
T smart00564        4 LSDGTVYVG-STDG-TLYALDAKTGE   27 (33)
T ss_pred             EECCEEEEE-cCCC-EEEEEEcccCc
Confidence            345555544 3333 88899887654


No 273
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=27.50  E-value=5.1e+02  Score=24.14  Aligned_cols=130  Identities=12%  Similarity=0.168  Sum_probs=69.6

Q ss_pred             eEEEEECCCCeeEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCC---cEEEecCCCCcccceeEEEECC
Q 045821          137 EVYCYDASMNTWTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTN---NWKLHTEPNIFTEIEDSFVMDG  212 (352)
Q Consensus       137 ~~~~~d~~t~~W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~---~W~~~~~~~~~~~~~~~~~~~g  212 (352)
                      .++.+|..|+.-...-+-..+-+..+++-. ++.-+|.|+.+  ..+..+|...+   .|+.+..   +.-...+...||
T Consensus       292 ~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~Gs~d--r~i~~wdlDgn~~~~W~gvr~---~~v~dlait~Dg  366 (519)
T KOG0293|consen  292 VLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGFRFVTGSPD--RTIIMWDLDGNILGNWEGVRD---PKVHDLAITYDG  366 (519)
T ss_pred             heeeccCCcchhhhhcccCcCCCcceeEEccCCceeEecCCC--CcEEEecCCcchhhccccccc---ceeEEEEEcCCC
Confidence            367778877753332211111122222222 67778888876  56677777644   6887764   233344556666


Q ss_pred             E-EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEECCEEEEEecCCCCeEEEEeCCC
Q 045821          213 K-IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQKES  279 (352)
Q Consensus       213 ~-iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd~~~  279 (352)
                      + ++.+ +.+.     .+..|+..+..=..+-. ...+.....+.-++++.++.=... .++.||.+.
T Consensus       367 k~vl~v-~~d~-----~i~l~~~e~~~dr~lis-e~~~its~~iS~d~k~~LvnL~~q-ei~LWDl~e  426 (519)
T KOG0293|consen  367 KYVLLV-TVDK-----KIRLYNREARVDRGLIS-EEQPITSFSISKDGKLALVNLQDQ-EIHLWDLEE  426 (519)
T ss_pred             cEEEEE-eccc-----ceeeechhhhhhhcccc-ccCceeEEEEcCCCcEEEEEcccC-eeEEeecch
Confidence            5 5555 3332     45666665443331111 111222233455788777765443 899999983


No 274
>PRK02888 nitrous-oxide reductase; Validated
Probab=27.35  E-value=6.3e+02  Score=25.15  Aligned_cols=54  Identities=17%  Similarity=0.097  Sum_probs=34.7

Q ss_pred             eEEEEeCCC---CceEEccCCCCCcCCCCceEEEeCCEEEEEcCCc-eEEEEEcCCcc
Q 045821          271 KLMMWQKES---REWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGC-SAVVIDVGNIG  324 (352)
Q Consensus       271 ~v~~yd~~~---~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~-~~~~~d~~~~~  324 (352)
                      .|.+.|..+   ..+..+..+|....-++..+..-+..+|+-|+.. .+.++|+++..
T Consensus       297 ~V~VID~~t~~~~~~~v~~yIPVGKsPHGV~vSPDGkylyVanklS~tVSVIDv~k~k  354 (635)
T PRK02888        297 KVPVVDGRKAANAGSALTRYVPVPKNPHGVNTSPDGKYFIANGKLSPTVTVIDVRKLD  354 (635)
T ss_pred             EEEEEECCccccCCcceEEEEECCCCccceEECCCCCEEEEeCCCCCcEEEEEChhhh
Confidence            788999876   1244455455543444555555667788888754 46689998744


No 275
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=27.10  E-value=6.3e+02  Score=25.07  Aligned_cols=66  Identities=18%  Similarity=0.305  Sum_probs=39.8

Q ss_pred             ECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEe------c--CCCC-Cccee-eeeEE-ECCEEEEEeCCCCCCceEEEe
Q 045821          118 LGKNAYLLGGCGWSEDATSEVYCYDASMNTWTD------A--APMS-TARCY-FPCGV-LNQKIYCIGGLGDTHSWDVYD  186 (352)
Q Consensus       118 ~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~------~--~~~~-~~r~~-~~~~~-~~~~iyv~gG~~~~~~v~~yd  186 (352)
                      -++.+++-||.+.      .++.||..+..-+.      +  .+++ .++.. ++.+. -.+.++|.||..  ..+..||
T Consensus       128 k~~~lvaSgGLD~------~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA~N~t~t~ivsGgte--k~lr~wD  199 (735)
T KOG0308|consen  128 KNNELVASGGLDR------KIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLAMNQTGTIIVSGGTE--KDLRLWD  199 (735)
T ss_pred             cCceeEEecCCCc------cEEEEEccCcchhhhhhccccccccCCCCCccceeeeecCCcceEEEecCcc--cceEEec
Confidence            3677888888644      46777777553211      1  2233 23322 22222 235688888877  7888999


Q ss_pred             CCCCc
Q 045821          187 PRTNN  191 (352)
Q Consensus       187 ~~t~~  191 (352)
                      +.|..
T Consensus       200 prt~~  204 (735)
T KOG0308|consen  200 PRTCK  204 (735)
T ss_pred             ccccc
Confidence            99874


No 276
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=27.10  E-value=4.9e+02  Score=26.34  Aligned_cols=99  Identities=13%  Similarity=0.160  Sum_probs=48.1

Q ss_pred             ECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEE-CCEEEEEccCCCCCCceEEEEEeCCCCceee-cC
Q 045821          166 LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVM-DGKIYIRCSASAATSHVCALVYEPSTDSWLH-AD  243 (352)
Q Consensus       166 ~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~-~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~-~~  243 (352)
                      .++.+.++.|..  .++-+++..|.+..+.  ++- .+..++.++ +++..++|+..+     .+..||.....--+ ++
T Consensus       382 S~d~~~~~Sga~--~SikiWn~~t~kciRT--i~~-~y~l~~~Fvpgd~~Iv~G~k~G-----el~vfdlaS~~l~Eti~  451 (888)
T KOG0306|consen  382 SSDSILLASGAG--ESIKIWNRDTLKCIRT--ITC-GYILASKFVPGDRYIVLGTKNG-----ELQVFDLASASLVETIR  451 (888)
T ss_pred             ecCceeeeecCC--CcEEEEEccCcceeEE--ecc-ccEEEEEecCCCceEEEeccCC-----ceEEEEeehhhhhhhhh
Confidence            345555555544  5678888887665332  111 233344445 455555654443     47777775433222 21


Q ss_pred             CCCCCCCCCcEEEECCEEEEEecCCCCeEEEEe
Q 045821          244 ANMASGWRGPAVVVDDALYVLDQSSGTKLMMWQ  276 (352)
Q Consensus       244 ~~~~~~~~~~~~~~~~~l~~~gg~~~~~v~~yd  276 (352)
                      . .....-..+...+++-++.|+-+. .+-.||
T Consensus       452 A-HdgaIWsi~~~pD~~g~vT~saDk-tVkfWd  482 (888)
T KOG0306|consen  452 A-HDGAIWSISLSPDNKGFVTGSADK-TVKFWD  482 (888)
T ss_pred             c-cccceeeeeecCCCCceEEecCCc-EEEEEe
Confidence            1 111111112344667677776554 444443


No 277
>KOG2314 consensus Translation initiation factor 3, subunit b (eIF-3b) [Translation, ribosomal structure and biogenesis]
Probab=26.50  E-value=4.8e+02  Score=25.36  Aligned_cols=70  Identities=10%  Similarity=0.083  Sum_probs=44.0

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCC
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASM  145 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t  145 (352)
                      +++..++-|......+++|....  ...+|..+..++   ....+...-+..|...++++..+   ...+++.||...
T Consensus       457 gdkF~vi~g~~~k~tvsfY~~e~--~~~~~~lVk~~d---k~~~N~vfwsPkG~fvvva~l~s---~~g~l~F~D~~~  526 (698)
T KOG2314|consen  457 GDKFAVISGNTVKNTVSFYAVET--NIKKPSLVKELD---KKFANTVFWSPKGRFVVVAALVS---RRGDLEFYDTDY  526 (698)
T ss_pred             CCeEEEEEccccccceeEEEeec--CCCchhhhhhhc---ccccceEEEcCCCcEEEEEEecc---cccceEEEecch
Confidence            56666666665666677776665  667777777666   33344444444667666666543   346788888764


No 278
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=26.22  E-value=6.7e+02  Score=25.07  Aligned_cols=66  Identities=26%  Similarity=0.257  Sum_probs=35.6

Q ss_pred             EECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEEEcCCcccc
Q 045821          256 VVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVIDVGNIGNI  326 (352)
Q Consensus       256 ~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~d~~~~~w~  326 (352)
                      ..++.+.+..|.++ .+.+|+.+  ...+.=.+|.  .-...+-+..++.|++-|.-..+.+|...+++|-
T Consensus       228 ~~~~~~Ivs~gEDr-tlriW~~~--e~~q~I~lPt--tsiWsa~~L~NgDIvvg~SDG~VrVfT~~k~R~A  293 (745)
T KOG0301|consen  228 ALSDGLIVSTGEDR-TLRIWKKD--ECVQVITLPT--TSIWSAKVLLNGDIVVGGSDGRVRVFTVDKDRKA  293 (745)
T ss_pred             cCCCCeEEEecCCc-eEEEeecC--ceEEEEecCc--cceEEEEEeeCCCEEEeccCceEEEEEecccccC
Confidence            45677777777665 67777655  2222223443  1122333334666665443334558888877764


No 279
>COG3490 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.15  E-value=4.6e+02  Score=23.21  Aligned_cols=56  Identities=14%  Similarity=0.224  Sum_probs=34.6

Q ss_pred             CCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCcCCCCceEEEeCCEEEEEcCCceEEEE
Q 045821          258 DDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLLTRPPCKLVAIGKTIFVIGKGCSAVVI  318 (352)
Q Consensus       258 ~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~~~~i~v~gG~~~~~~~  318 (352)
                      ...+..+..-.++...++|.+++.-.....++.     .++++.-.+.+.+-.|.-.+..|
T Consensus       290 ~~glV~lTSP~GN~~vi~da~tG~vv~~a~l~d-----aaGva~~~~gf~vssg~G~~~~~  345 (366)
T COG3490         290 RDGLVALTSPRGNRAVIWDAATGAVVSEAALPD-----AAGVAAAKGGFAVSSGQGRIIFY  345 (366)
T ss_pred             cCCeEEEecCCCCeEEEEEcCCCcEEecccccc-----cccceeccCceEEecCCceEEec
Confidence            345555544455689999999998877776665     23444445555555555445444


No 280
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=25.44  E-value=5.8e+02  Score=24.06  Aligned_cols=59  Identities=12%  Similarity=-0.007  Sum_probs=32.5

Q ss_pred             eeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCccccee---EEEECCEEEEEccCCC
Q 045821          160 YFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIED---SFVMDGKIYIRCSASA  222 (352)
Q Consensus       160 ~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~---~~~~~g~iyv~GG~~~  222 (352)
                      -++.+..+.--|+++|... .+++.+...|+.--.+  . .+++..-   .+..||..++.||.++
T Consensus        84 v~al~s~n~G~~l~ag~i~-g~lYlWelssG~LL~v--~-~aHYQ~ITcL~fs~dgs~iiTgskDg  145 (476)
T KOG0646|consen   84 VHALASSNLGYFLLAGTIS-GNLYLWELSSGILLNV--L-SAHYQSITCLKFSDDGSHIITGSKDG  145 (476)
T ss_pred             eeeeecCCCceEEEeeccc-CcEEEEEeccccHHHH--H-HhhccceeEEEEeCCCcEEEecCCCc
Confidence            3455555555666666322 5678877777752111  1 2333332   2334778888877664


No 281
>KOG1898 consensus Splicing factor 3b, subunit 3 [RNA processing and modification]
Probab=25.09  E-value=8.5e+02  Score=25.91  Aligned_cols=160  Identities=11%  Similarity=0.080  Sum_probs=79.8

Q ss_pred             CCCeEEEEECCCCeeEecCCCCCcceeeeeEE--E---CCE-EEEEeCCCCC---------CceEEEeCCCC--cEEEec
Q 045821          134 ATSEVYCYDASMNTWTDAAPMSTARCYFPCGV--L---NQK-IYCIGGLGDT---------HSWDVYDPRTN--NWKLHT  196 (352)
Q Consensus       134 ~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~--~---~~~-iyv~gG~~~~---------~~v~~yd~~t~--~W~~~~  196 (352)
                      ....+..+|+.+++-...-.++..-..++++.  +   +.. +..+|+....         ..++.|+...+  +-+.+.
T Consensus       851 w~s~I~~~d~~s~~~~~~~~l~~ne~a~~v~~~~fs~~~~~~~~~v~~~~~~~l~~~~~~~g~~ytyk~~~~g~~lellh  930 (1205)
T KOG1898|consen  851 WVSSIRVFDPKSGKIICLVELGQNEAAFSVCAVDFSSSEYQPFVAVGVATTEQLDSKSISSGFVYTYKFVRNGDKLELLH  930 (1205)
T ss_pred             ccceEEEEcCCCCceEEEEeecCCcchhheeeeeeccCCCceEEEEEeeccccccccccCCCceEEEEEEecCceeeeee
Confidence            44567788888776544433333222222222  2   222 4444443221         44677766533  333333


Q ss_pred             CCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcE-EEECCEEEEEecCCCCeEEEE
Q 045821          197 EPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPA-VVVDDALYVLDQSSGTKLMMW  275 (352)
Q Consensus       197 ~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~-~~~~~~l~~~gg~~~~~v~~y  275 (352)
                      .-..+..-++.+.+.|+++.--|.       .+..||...++--........+..-.. .....+|++-.-..+-.+++|
T Consensus       931 ~T~~~~~v~Ai~~f~~~~LagvG~-------~l~~YdlG~K~lLRk~e~k~~p~~Is~iqt~~~RI~VgD~qeSV~~~~y 1003 (1205)
T KOG1898|consen  931 KTEIPGPVGAICPFQGRVLAGVGR-------FLRLYDLGKKKLLRKCELKFIPNRISSIQTYGARIVVGDIQESVHFVRY 1003 (1205)
T ss_pred             ccCCCccceEEeccCCEEEEeccc-------EEEEeeCChHHHHhhhhhccCceEEEEEeecceEEEEeeccceEEEEEE
Confidence            323333334567778866553332       488899876654332111111111112 234455555444444456888


Q ss_pred             eCCCCceEEccCCCCCcCCCCceEEEe
Q 045821          276 QKESREWSPVGRLSTLLTRPPCKLVAI  302 (352)
Q Consensus       276 d~~~~~W~~~~~~~~~~~~~~~~~~~~  302 (352)
                      +++.|+......-|.  .|+-.++..+
T Consensus      1004 ~~~~n~l~~fadD~~--pR~Vt~~~~l 1028 (1205)
T KOG1898|consen 1004 RREDNQLIVFADDPV--PRHVTALELL 1028 (1205)
T ss_pred             ecCCCeEEEEeCCCc--cceeeEEEEe
Confidence            999998888876554  3544444443


No 282
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=24.75  E-value=5.8e+02  Score=23.89  Aligned_cols=28  Identities=14%  Similarity=-0.056  Sum_probs=19.3

Q ss_pred             ceEEEeCCEEEEEcCCce-EEEEEcCCcc
Q 045821          297 CKLVAIGKTIFVIGKGCS-AVVIDVGNIG  324 (352)
Q Consensus       297 ~~~~~~~~~i~v~gG~~~-~~~~d~~~~~  324 (352)
                      ++.......+|++||... ..++|+.++.
T Consensus       424 ~~~~~~~a~~la~GG~k~~~~vwd~~~~~  452 (463)
T KOG0270|consen  424 FALDPDVAFTLAFGGEKAVLRVWDIFTNS  452 (463)
T ss_pred             cccCCCcceEEEecCccceEEEeecccCh
Confidence            344445567899999887 4478877654


No 283
>PF14298 DUF4374:  Domain of unknown function (DUF4374)
Probab=24.19  E-value=6.1e+02  Score=23.90  Aligned_cols=61  Identities=13%  Similarity=0.081  Sum_probs=40.5

Q ss_pred             CeEEEEeCCCCceEEccCCCCCc-CCCCceEEEeCCEEEEE-c---C-CceEEEEEcCCcccccceE
Q 045821          270 TKLMMWQKESREWSPVGRLSTLL-TRPPCKLVAIGKTIFVI-G---K-GCSAVVIDVGNIGNIGGIM  330 (352)
Q Consensus       270 ~~v~~yd~~~~~W~~~~~~~~~~-~~~~~~~~~~~~~i~v~-g---G-~~~~~~~d~~~~~w~~~~~  330 (352)
                      ..+.++|..+++.+.+..+|... ....-...+.++++|+- .   | ..-++.+|+.+.+=.++..
T Consensus       367 ~~laI~d~~~kt~t~V~glP~~~is~~~~~~~ve~G~aYi~Vtt~~g~~~~IY~iDp~TatAtKGl~  433 (435)
T PF14298_consen  367 KKLAIFDVSNKTFTWVTGLPADLISGFGNAPYVENGKAYIPVTTEDGSDPYIYKIDPATATATKGLK  433 (435)
T ss_pred             ceEEEEEccCceeEEeccCChhhccccccceEeeCCEEEEEEeecCCCceeEEEEcCcccccccceE
Confidence            57889999999999998888651 22222344568888863 2   2 1235568888877666554


No 284
>PF15525 DUF4652:  Domain of unknown function (DUF4652)
Probab=23.80  E-value=4.1e+02  Score=21.78  Aligned_cols=69  Identities=12%  Similarity=0.126  Sum_probs=41.8

Q ss_pred             eEEEecCCCCCCCeeEeccCC--CcccCCcceEEEEECCE-EEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCC
Q 045821           84 CCYVLDPNSTRRSWKMIHELP--ARTLRRKGMGFEVLGKN-AYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMST  156 (352)
Q Consensus        84 ~~~~~d~~~~~~~w~~~~~~~--~~~~~~~~~~~~~~~~~-iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~  156 (352)
                      .+|++|.  .++.|..+.--+  ....|.  ...-.-+.. ++++|...+....-..+++|++.+++-..+-+...
T Consensus        89 kIYIkn~--~~~~~~~L~i~~~~~k~sPK--~i~WiDD~~L~vIIG~a~GTvS~GGnLy~~nl~tg~~~~ly~~~d  160 (200)
T PF15525_consen   89 KIYIKNL--NNNNWWSLQIDQNEEKYSPK--YIEWIDDNNLAVIIGYAHGTVSKGGNLYKYNLNTGNLTELYEWKD  160 (200)
T ss_pred             eEEEEec--CCCceEEEEecCcccccCCc--eeEEecCCcEEEEEccccceEccCCeEEEEEccCCceeEeeeccc
Confidence            5788888  888887664322  122233  222223444 55556443433445789999999999888876644


No 285
>KOG3669 consensus Uncharacterized conserved protein, contains dysferlin, TECPR and PH domains [General function prediction only]
Probab=23.68  E-value=7e+02  Score=24.43  Aligned_cols=54  Identities=15%  Similarity=0.241  Sum_probs=33.8

Q ss_pred             EECCCCeeEecCCCC-Cc----c-eeee-eEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC
Q 045821          141 YDASMNTWTDAAPMS-TA----R-CYFP-CGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI  200 (352)
Q Consensus       141 ~d~~t~~W~~~~~~~-~~----r-~~~~-~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~  200 (352)
                      -.|.-+.|+.+++.+ ..    + ..-. ++.-++++|.=-|      |-+.++.-++|+.+..+..
T Consensus       213 ~~P~GraW~~i~~~t~L~qISagPtg~VwAvt~nG~vf~R~G------VsRqNp~GdsWkdI~tP~~  273 (705)
T KOG3669|consen  213 DRPCGRAWKVICPYTDLSQISAGPTGVVWAVTENGAVFYREG------VSRQNPEGDSWKDIVTPRQ  273 (705)
T ss_pred             CCCCCceeeecCCCCccceEeecCcceEEEEeeCCcEEEEec------ccccCCCCchhhhccCccc
Confidence            357777888887644 11    1 1111 2223777777666      5679999999998875443


No 286
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=23.43  E-value=5.1e+02  Score=22.75  Aligned_cols=186  Identities=10%  Similarity=0.050  Sum_probs=86.7

Q ss_pred             CCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcceeeeeEE-ECCEEEEEeCCCCCCceEEEeCCCCcEEEecC
Q 045821          119 GKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTARCYFPCGV-LNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTE  197 (352)
Q Consensus       119 ~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r~~~~~~~-~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~  197 (352)
                      ++...+-++++      ..+..+|..+++=+..-- ...+--.++++ -+++-.|.|..+  +.+..||...+.=-++..
T Consensus        74 dg~~alS~swD------~~lrlWDl~~g~~t~~f~-GH~~dVlsva~s~dn~qivSGSrD--kTiklwnt~g~ck~t~~~  144 (315)
T KOG0279|consen   74 DGNFALSASWD------GTLRLWDLATGESTRRFV-GHTKDVLSVAFSTDNRQIVSGSRD--KTIKLWNTLGVCKYTIHE  144 (315)
T ss_pred             CCceEEecccc------ceEEEEEecCCcEEEEEE-ecCCceEEEEecCCCceeecCCCc--ceeeeeeecccEEEEEec
Confidence            55555555543      345667777653221110 01111112222 255555656555  667777776654323322


Q ss_pred             CCCcccc-eeEEEE-C--CEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCC--cEEEECCEEEEEecCCCCe
Q 045821          198 PNIFTEI-EDSFVM-D--GKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRG--PAVVVDDALYVLDQSSGTK  271 (352)
Q Consensus       198 ~~~~~~~-~~~~~~-~--g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~--~~~~~~~~l~~~gg~~~~~  271 (352)
                      -.. +.- .+..+. +  +-+.+.+|.+.     .+-++|..+.+-...  .+.+...-  .++..+|.++.-||.++ .
T Consensus       145 ~~~-~~WVscvrfsP~~~~p~Ivs~s~Dk-----tvKvWnl~~~~l~~~--~~gh~~~v~t~~vSpDGslcasGgkdg-~  215 (315)
T KOG0279|consen  145 DSH-REWVSCVRFSPNESNPIIVSASWDK-----TVKVWNLRNCQLRTT--FIGHSGYVNTVTVSPDGSLCASGGKDG-E  215 (315)
T ss_pred             CCC-cCcEEEEEEcCCCCCcEEEEccCCc-----eEEEEccCCcchhhc--cccccccEEEEEECCCCCEEecCCCCc-e
Confidence            111 111 122222 2  33444443332     355566655433221  11111111  23456999999999888 8


Q ss_pred             EEEEeCCCCce-EEccCCCCCcCCCCceEEEeCCEEEEEcC-CceEEEEEcCCccccc
Q 045821          272 LMMWQKESREW-SPVGRLSTLLTRPPCKLVAIGKTIFVIGK-GCSAVVIDVGNIGNIG  327 (352)
Q Consensus       272 v~~yd~~~~~W-~~~~~~~~~~~~~~~~~~~~~~~i~v~gG-~~~~~~~d~~~~~w~~  327 (352)
                      +..+|....+= ..++..     ..-.+++..-++.++.-+ ..++-|+|.+...-..
T Consensus       216 ~~LwdL~~~k~lysl~a~-----~~v~sl~fspnrywL~~at~~sIkIwdl~~~~~v~  268 (315)
T KOG0279|consen  216 AMLWDLNEGKNLYSLEAF-----DIVNSLCFSPNRYWLCAATATSIKIWDLESKAVVE  268 (315)
T ss_pred             EEEEEccCCceeEeccCC-----CeEeeEEecCCceeEeeccCCceEEEeccchhhhh
Confidence            88888876543 222221     112344444555544433 2245567776655433


No 287
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=22.93  E-value=7.9e+02  Score=24.77  Aligned_cols=169  Identities=9%  Similarity=0.062  Sum_probs=77.4

Q ss_pred             EEEEECCCCe----eEecCCCCCcceeeeeEEECCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCC-cccceeEEEECC
Q 045821          138 VYCYDASMNT----WTDAAPMSTARCYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNI-FTEIEDSFVMDG  212 (352)
Q Consensus       138 ~~~~d~~t~~----W~~~~~~~~~r~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~-~~~~~~~~~~~g  212 (352)
                      +.+|+..+++    |..+-.-|.-+..+.  . .+.+...||.+  ..+-++|...+...--  +.. +..-+++.+..+
T Consensus        86 lrv~~L~tgk~irswKa~He~Pvi~ma~~--~-~g~LlAtggaD--~~v~VWdi~~~~~th~--fkG~gGvVssl~F~~~  158 (775)
T KOG0319|consen   86 LRVWSLPTGKLIRSWKAIHEAPVITMAFD--P-TGTLLATGGAD--GRVKVWDIKNGYCTHS--FKGHGGVVSSLLFHPH  158 (775)
T ss_pred             EEEEEcccchHhHhHhhccCCCeEEEEEc--C-CCceEEecccc--ceEEEEEeeCCEEEEE--ecCCCceEEEEEeCCc
Confidence            4556666653    544333333222111  1 23677778877  5677777766543321  111 111122333333


Q ss_pred             E---EEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCC--CCcEEEECCEEEEEecCCCCeEEEEeCCCCceEEccC
Q 045821          213 K---IYIRCSASAATSHVCALVYEPSTDSWLHADANMASGW--RGPAVVVDDALYVLDQSSGTKLMMWQKESREWSPVGR  287 (352)
Q Consensus       213 ~---iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~--~~~~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~  287 (352)
                      .   +...|+.+     ..+..||..+..= .+.....+..  .+.+...++.-.+.+|.+. -+.+||.  .+.+.+..
T Consensus       159 ~~~~lL~sg~~D-----~~v~vwnl~~~~t-cl~~~~~H~S~vtsL~~~~d~~~~ls~~RDk-vi~vwd~--~~~~~l~~  229 (775)
T KOG0319|consen  159 WNRWLLASGATD-----GTVRVWNLNDKRT-CLHTMILHKSAVTSLAFSEDSLELLSVGRDK-VIIVWDL--VQYKKLKT  229 (775)
T ss_pred             cchhheeecCCC-----ceEEEEEcccCch-HHHHHHhhhhheeeeeeccCCceEEEeccCc-EEEEeeh--hhhhhhhe
Confidence            3   33554333     2577777764322 1111111111  1112222333333344332 7889998  44445555


Q ss_pred             CCCCcCCCCceEEEeC-------CEEEEEcCCceEEEEEcCCcc
Q 045821          288 LSTLLTRPPCKLVAIG-------KTIFVIGKGCSAVVIDVGNIG  324 (352)
Q Consensus       288 ~~~~~~~~~~~~~~~~-------~~i~v~gG~~~~~~~d~~~~~  324 (352)
                      +|..  ...-+++...       +-++..||.....++|.+.-.
T Consensus       230 lp~y--e~~E~vv~l~~~~~~~~~~~~TaG~~g~~~~~d~es~~  271 (775)
T KOG0319|consen  230 LPLY--ESLESVVRLREELGGKGEYIITAGGSGVVQYWDSESGK  271 (775)
T ss_pred             echh--hheeeEEEechhcCCcceEEEEecCCceEEEEecccch
Confidence            5532  1112233222       246667877777788887644


No 288
>smart00120 HX Hemopexin-like repeats. Hemopexin is a heme-binding protein that transports heme to the liver. Hemopexin-like repeats occur in vitronectin and some matrix  metalloproteinases family (matrixins). The HX repeats of some matrixins  bind tissue inhibitor of metalloproteinases (TIMPs).
Probab=22.61  E-value=1.7e+02  Score=16.80  Aligned_cols=19  Identities=5%  Similarity=-0.010  Sum_probs=14.3

Q ss_pred             CCEEEEEcCCceEEEEEcCC
Q 045821          303 GKTIFVIGKGCSAVVIDVGN  322 (352)
Q Consensus       303 ~~~i~v~gG~~~~~~~d~~~  322 (352)
                      ++++|+|-|. ..+.||..+
T Consensus         9 ~~~~yfFkg~-~yw~~~~~~   27 (45)
T smart00120        9 NGKTYFFKGD-KYWRFDPKR   27 (45)
T ss_pred             CCeEEEEeCC-EEEEEcCCc
Confidence            5689999876 566788654


No 289
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=22.50  E-value=4.7e+02  Score=22.01  Aligned_cols=186  Identities=14%  Similarity=0.007  Sum_probs=95.1

Q ss_pred             eEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCeeEecCCCCCcc-eeeeeEEECCEEEEEeCCCCCCceEEEeCCCCc
Q 045821          113 MGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNTWTDAAPMSTAR-CYFPCGVLNQKIYCIGGLGDTHSWDVYDPRTNN  191 (352)
Q Consensus       113 ~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~W~~~~~~~~~r-~~~~~~~~~~~iyv~gG~~~~~~v~~yd~~t~~  191 (352)
                      +.+...+|++|.--|..+    .+.+.++|..+++-..-..++.++ ++-..+..++.+|...=.+  ...+.||+.|- 
T Consensus        49 QGL~~~~g~i~esTG~yg----~S~ir~~~L~~gq~~~s~~l~~~~~FgEGit~~gd~~y~LTw~e--gvaf~~d~~t~-  121 (262)
T COG3823          49 QGLEYLDGHILESTGLYG----FSKIRVSDLTTGQEIFSEKLAPDTVFGEGITKLGDYFYQLTWKE--GVAFKYDADTL-  121 (262)
T ss_pred             cceeeeCCEEEEeccccc----cceeEEEeccCceEEEEeecCCccccccceeeccceEEEEEecc--ceeEEEChHHh-
Confidence            456777888888877644    377889999977633222233233 3456777788999886543  44567776643 


Q ss_pred             EEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCce---eecCCC-CCCCCCCcEEEECCEEEEEecC
Q 045821          192 WKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSW---LHADAN-MASGWRGPAVVVDDALYVLDQS  267 (352)
Q Consensus       192 W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W---~~~~~~-~~~~~~~~~~~~~~~l~~~gg~  267 (352)
                       +.+...+....+-+.+.-+.+|.+.-|.      ..+..-||.+-.=   .++... .+...-.-.-.++|.+|.== .
T Consensus       122 -~~lg~~~y~GeGWgLt~d~~~LimsdGs------atL~frdP~tfa~~~~v~VT~~g~pv~~LNELE~VdG~lyANV-w  193 (262)
T COG3823         122 -EELGRFSYEGEGWGLTSDDKNLIMSDGS------ATLQFRDPKTFAELDTVQVTDDGVPVSKLNELEWVDGELYANV-W  193 (262)
T ss_pred             -hhhcccccCCcceeeecCCcceEeeCCc------eEEEecCHHHhhhcceEEEEECCeecccccceeeeccEEEEee-e
Confidence             3333333333333444444455444332      2344445543211   111110 00000000112344444210 1


Q ss_pred             CCCeEEEEeCCCCc---eEEccCCCCCcCCC--------CceEEEeCCEEEEEcCCc
Q 045821          268 SGTKLMMWQKESRE---WSPVGRLSTLLTRP--------PCKLVAIGKTIFVIGKGC  313 (352)
Q Consensus       268 ~~~~v~~yd~~~~~---W~~~~~~~~~~~~~--------~~~~~~~~~~i~v~gG~~  313 (352)
                      ..+.+.+.||+++.   |..+..++......        +-+...-++++|+-|-..
T Consensus       194 ~t~~I~rI~p~sGrV~~widlS~L~~~~~~~~~~~nvlNGIA~~~~~~r~~iTGK~w  250 (262)
T COG3823         194 QTTRIARIDPDSGRVVAWIDLSGLLKELNLDKSNDNVLNGIAHDPQQDRFLITGKLW  250 (262)
T ss_pred             eecceEEEcCCCCcEEEEEEccCCchhcCccccccccccceeecCcCCeEEEecCcC
Confidence            11478899999875   88888765422111        112222356888888544


No 290
>KOG1963 consensus WD40 repeat protein [General function prediction only]
Probab=22.36  E-value=8.5e+02  Score=24.91  Aligned_cols=59  Identities=22%  Similarity=0.278  Sum_probs=38.1

Q ss_pred             EEEECCEEEEEecCCCCeEEEEeCCCCceEEccCCCCCc-------CCCCceEEEeCCEEEEEcCCc
Q 045821          254 AVVVDDALYVLDQSSGTKLMMWQKESREWSPVGRLSTLL-------TRPPCKLVAIGKTIFVIGKGC  313 (352)
Q Consensus       254 ~~~~~~~l~~~gg~~~~~v~~yd~~~~~W~~~~~~~~~~-------~~~~~~~~~~~~~i~v~gG~~  313 (352)
                      +...+|.-++-||..+ .+.+|-.+++.=+-++.+....       .-..++++.-+++|.++-+.+
T Consensus       258 ~fS~~G~~LlSGG~E~-VLv~Wq~~T~~kqfLPRLgs~I~~i~vS~ds~~~sl~~~DNqI~li~~~d  323 (792)
T KOG1963|consen  258 SFSSDGAYLLSGGREG-VLVLWQLETGKKQFLPRLGSPILHIVVSPDSDLYSLVLEDNQIHLIKASD  323 (792)
T ss_pred             EEecCCceEeecccce-EEEEEeecCCCcccccccCCeeEEEEEcCCCCeEEEEecCceEEEEeccc
Confidence            3455777667788776 7888888887744455544321       233456666788999887643


No 291
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=21.86  E-value=5.8e+02  Score=22.84  Aligned_cols=27  Identities=15%  Similarity=0.292  Sum_probs=15.9

Q ss_pred             EEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821          115 FEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus       115 ~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ++++++ .|+..|.     ....+.+||..++.
T Consensus        48 avAVs~-~~~aSGs-----sDetI~IYDm~k~~   74 (362)
T KOG0294|consen   48 ALAVSG-PYVASGS-----SDETIHIYDMRKRK   74 (362)
T ss_pred             EEEecc-eeEeccC-----CCCcEEEEeccchh
Confidence            344444 4555442     23568999998764


No 292
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=21.39  E-value=6.6e+02  Score=23.29  Aligned_cols=135  Identities=14%  Similarity=0.192  Sum_probs=66.4

Q ss_pred             CceEEEeCCCCcEEEecCCCCcccceeEEEECCEEEEEccCCCCCCceEEEEEeCCCCceeecCCCCCCCCCCcEEEE-C
Q 045821          180 HSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDGKIYIRCSASAATSHVCALVYEPSTDSWLHADANMASGWRGPAVVV-D  258 (352)
Q Consensus       180 ~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g~iyv~GG~~~~~~~~~i~~yd~~~~~W~~~~~~~~~~~~~~~~~~-~  258 (352)
                      +.|..+|..|+.=-.--..| ..........+|.+++-...+.     .+-++|+.+++-.............-++.+ +
T Consensus       154 n~v~iWnv~tgeali~l~hp-d~i~S~sfn~dGs~l~TtckDK-----kvRv~dpr~~~~v~e~~~heG~k~~Raifl~~  227 (472)
T KOG0303|consen  154 NTVSIWNVGTGEALITLDHP-DMVYSMSFNRDGSLLCTTCKDK-----KVRVIDPRRGTVVSEGVAHEGAKPARAIFLAS  227 (472)
T ss_pred             ceEEEEeccCCceeeecCCC-CeEEEEEeccCCceeeeecccc-----eeEEEcCCCCcEeeecccccCCCcceeEEecc
Confidence            77888888887521111111 1122234445777777644432     688899988876555432222222223444 4


Q ss_pred             CEEEEEecCCC---CeEEEEeCCCCceEEccCCCCCcCCCCceEEEe----CCEEEEEcCCce-EEEEEcCCcc
Q 045821          259 DALYVLDQSSG---TKLMMWQKESREWSPVGRLSTLLTRPPCKLVAI----GKTIFVIGKGCS-AVVIDVGNIG  324 (352)
Q Consensus       259 ~~l~~~gg~~~---~~v~~yd~~~~~W~~~~~~~~~~~~~~~~~~~~----~~~i~v~gG~~~-~~~~d~~~~~  324 (352)
                      ++ ++..|++.   ..+..+|+++-+= .+.  -..+......+..+    .+-||+.|-.+. +-.|.+..+.
T Consensus       228 g~-i~tTGfsr~seRq~aLwdp~nl~e-P~~--~~elDtSnGvl~PFyD~dt~ivYl~GKGD~~IRYyEit~d~  297 (472)
T KOG0303|consen  228 GK-IFTTGFSRMSERQIALWDPNNLEE-PIA--LQELDTSNGVLLPFYDPDTSIVYLCGKGDSSIRYFEITNEP  297 (472)
T ss_pred             Cc-eeeeccccccccceeccCcccccC-cce--eEEeccCCceEEeeecCCCCEEEEEecCCcceEEEEecCCC
Confidence            55 55555543   5677777765221 010  00011222222222    245899996554 3355555443


No 293
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=21.12  E-value=5.9e+02  Score=22.62  Aligned_cols=148  Identities=11%  Similarity=-0.028  Sum_probs=68.5

Q ss_pred             CCEEEEEEecCCCCceeEEEecCCCCCCCeeEeccCCCcccCCcceEEEEECCEEEEEeCCCCCCCCCCeEEEEECCCCe
Q 045821           68 ETWIYAFCRDNKLERVCCYVLDPNSTRRSWKMIHELPARTLRRKGMGFEVLGKNAYLLGGCGWSEDATSEVYCYDASMNT  147 (352)
Q Consensus        68 ~~~l~~~gg~~~~~~~~~~~~d~~~~~~~w~~~~~~~~~~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~d~~t~~  147 (352)
                      ++..++.||. ...-+.|-+|+-  ..|.|.....-.    +........-++.|+-.|       ..+.+..||..+++
T Consensus        58 ~gs~~aSgG~-Dr~I~LWnv~gd--ceN~~~lkgHsg----AVM~l~~~~d~s~i~S~g-------tDk~v~~wD~~tG~  123 (338)
T KOG0265|consen   58 DGSCFASGGS-DRAIVLWNVYGD--CENFWVLKGHSG----AVMELHGMRDGSHILSCG-------TDKTVRGWDAETGK  123 (338)
T ss_pred             CCCeEeecCC-cceEEEEecccc--ccceeeeccccc----eeEeeeeccCCCEEEEec-------CCceEEEEecccce
Confidence            5667777775 333344555766  677776542111    000011111134455543       35778899999886


Q ss_pred             eEecCCCCCcceeeeeEEE-CCEEEEEeCCCCCCceEEEeCCCCcEEEecCCCCcccceeEEEECC--EEEEEccCCCCC
Q 045821          148 WTDAAPMSTARCYFPCGVL-NQKIYCIGGLGDTHSWDVYDPRTNNWKLHTEPNIFTEIEDSFVMDG--KIYIRCSASAAT  224 (352)
Q Consensus       148 W~~~~~~~~~r~~~~~~~~-~~~iyv~gG~~~~~~v~~yd~~t~~W~~~~~~~~~~~~~~~~~~~g--~iyv~GG~~~~~  224 (352)
                      -..--..-. +.--++... -|...|..|.++ ..+-+||..+..--+   ..+..+.-.++.+++  .=.+.||.+   
T Consensus       124 ~~rk~k~h~-~~vNs~~p~rrg~~lv~SgsdD-~t~kl~D~R~k~~~~---t~~~kyqltAv~f~d~s~qv~sggId---  195 (338)
T KOG0265|consen  124 RIRKHKGHT-SFVNSLDPSRRGPQLVCSGSDD-GTLKLWDIRKKEAIK---TFENKYQLTAVGFKDTSDQVISGGID---  195 (338)
T ss_pred             eeehhcccc-ceeeecCccccCCeEEEecCCC-ceEEEEeecccchhh---ccccceeEEEEEecccccceeecccc---
Confidence            432111000 000000011 133444455444 778889998664222   223344444554432  223344554   


Q ss_pred             CceEEEEEeCCCCce
Q 045821          225 SHVCALVYEPSTDSW  239 (352)
Q Consensus       225 ~~~~i~~yd~~~~~W  239 (352)
                        +.+.++|+..+.-
T Consensus       196 --n~ikvWd~r~~d~  208 (338)
T KOG0265|consen  196 --NDIKVWDLRKNDG  208 (338)
T ss_pred             --CceeeeccccCcc
Confidence              2466666654443


Done!