Query         003800
Match_columns 794
No_of_seqs    292 out of 1142
Neff          7.3 
Searched_HMMs 46136
Date          Thu Mar 28 12:16:11 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/003800.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/003800hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2103 Uncharacterized conser 100.0  1E-113  3E-118  960.3  56.7  702   12-792     7-730 (910)
  2 PRK11138 outer membrane biogen  99.9 2.1E-20 4.5E-25  210.6  34.8  241    1-258     1-278 (394)
  3 TIGR03300 assembly_YfgL outer   99.8 4.5E-18 9.8E-23  190.3  34.9  216   24-258    35-263 (377)
  4 PRK11138 outer membrane biogen  99.8 1.6E-16 3.4E-21  179.1  29.3  213   26-258    86-316 (394)
  5 PF13360 PQQ_2:  PQQ-like domai  99.7 9.3E-16   2E-20  159.6  27.7  216   24-258     8-234 (238)
  6 TIGR03300 assembly_YfgL outer   99.7 6.8E-15 1.5E-19  164.7  30.6  209   26-258    82-301 (377)
  7 cd00216 PQQ_DH Dehydrogenases   99.7 1.1E-14 2.4E-19  168.4  24.2  220   27-257    37-322 (488)
  8 cd00216 PQQ_DH Dehydrogenases   99.6 1.9E-14   4E-19  166.6  25.1  230   25-265   126-434 (488)
  9 PF13360 PQQ_2:  PQQ-like domai  99.6 5.6E-14 1.2E-18  146.2  22.1  181   61-257     1-194 (238)
 10 TIGR03075 PQQ_enz_alc_DH PQQ-d  99.5   1E-12 2.3E-17  152.8  23.1  219   29-257    47-336 (527)
 11 COG1520 FOG: WD40-like repeat   99.5 1.4E-12   3E-17  145.9  23.0  216   26-257    40-271 (370)
 12 TIGR03074 PQQ_membr_DH membran  99.5 1.7E-12 3.6E-17  155.5  24.6  202   49-258   190-481 (764)
 13 COG1520 FOG: WD40-like repeat   99.3 2.4E-10 5.2E-15  127.9  23.3  186   24-221    83-279 (370)
 14 TIGR03074 PQQ_membr_DH membran  99.3 9.8E-11 2.1E-15  140.4  21.3  191   25-220   210-487 (764)
 15 TIGR03075 PQQ_enz_alc_DH PQQ-d  99.3 1.8E-10 3.9E-15  134.2  19.9  187   25-218    85-341 (527)
 16 KOG4649 PQQ (pyrrolo-quinoline  99.0   2E-08 4.4E-13  102.4  19.0  183   53-257    23-209 (354)
 17 KOG4649 PQQ (pyrrolo-quinoline  98.9 4.8E-07   1E-11   92.5  23.7  178   25-223    39-219 (354)
 18 COG4993 Gcd Glucose dehydrogen  98.7 5.1E-07 1.1E-11  101.8  17.3  205   52-257   213-491 (773)
 19 TIGR03866 PQQ_ABC_repeats PQQ-  97.8   0.071 1.5E-06   56.4  34.6  183   56-257     4-190 (300)
 20 PF02239 Cytochrom_D1:  Cytochr  97.7   0.067 1.5E-06   60.1  31.6  187   21-220    18-212 (369)
 21 COG4993 Gcd Glucose dehydrogen  97.7 0.00079 1.7E-08   76.7  15.1  165   52-218   271-496 (773)
 22 PF01011 PQQ:  PQQ enzyme repea  97.6  0.0001 2.3E-09   54.3   4.6   31   54-84      1-31  (38)
 23 TIGR03866 PQQ_ABC_repeats PQQ-  97.6    0.16 3.4E-06   53.7  31.8  189   52-257    41-240 (300)
 24 cd00200 WD40 WD40 domain, foun  97.4    0.11 2.4E-06   53.1  26.7  186   53-257    21-210 (289)
 25 cd00200 WD40 WD40 domain, foun  97.4   0.069 1.5E-06   54.6  25.1  187   53-257    63-252 (289)
 26 PF05096 Glu_cyclase_2:  Glutam  97.4   0.024 5.2E-07   59.9  21.0  155   52-221    54-214 (264)
 27 TIGR02658 TTQ_MADH_Hv methylam  97.4    0.35 7.7E-06   53.7  33.9  191   51-257    10-226 (352)
 28 PF02239 Cytochrom_D1:  Cytochr  97.3    0.23   5E-06   55.8  29.5  190   54-257     6-205 (369)
 29 PF10282 Lactonase:  Lactonase,  96.9       1 2.2E-05   50.1  32.7  222   25-253    21-274 (345)
 30 PF13570 PQQ_3:  PQQ-like domai  96.9  0.0019 4.1E-08   48.1   4.6   40   73-115     1-40  (40)
 31 PTZ00421 coronin; Provisional   96.8    0.23   5E-06   57.9  23.9  195   53-257    88-293 (493)
 32 smart00564 PQQ beta-propeller   96.8  0.0015 3.3E-08   46.1   3.8   27   53-79      6-32  (33)
 33 PF13570 PQQ_3:  PQQ-like domai  96.8  0.0025 5.5E-08   47.4   5.0   40   29-72      1-40  (40)
 34 PF10282 Lactonase:  Lactonase,  96.8     1.3 2.7E-05   49.3  37.9  195   56-257     2-227 (345)
 35 TIGR02658 TTQ_MADH_Hv methylam  96.8     0.8 1.7E-05   51.0  26.4   79   49-127    53-149 (352)
 36 PF01011 PQQ:  PQQ enzyme repea  96.7  0.0039 8.6E-08   45.9   5.1   31   98-128     2-32  (38)
 37 KOG0296 Angio-associated migra  96.4    0.17 3.7E-06   54.9  17.5  156   51-219   200-365 (399)
 38 KOG2103 Uncharacterized conser  96.4   0.089 1.9E-06   62.3  16.4  192   26-246    64-267 (910)
 39 KOG2048 WD40 repeat protein [G  96.3     1.1 2.4E-05   52.3  24.0  188   53-257    37-236 (691)
 40 KOG1539 WD repeat protein [Gen  96.3     1.7 3.8E-05   51.9  25.7  186   53-251   124-315 (910)
 41 PF05935 Arylsulfotrans:  Aryls  96.1    0.24 5.3E-06   57.6  18.2  151   53-219   113-310 (477)
 42 PTZ00420 coronin; Provisional   96.0     2.4 5.2E-05   50.3  26.1  191   53-257    87-296 (568)
 43 smart00564 PQQ beta-propeller   95.9   0.014   3E-07   41.1   4.2   29   94-122     4-32  (33)
 44 KOG2055 WD40 repeat protein [G  95.9     1.3 2.8E-05   49.6  21.1  199   39-255   214-418 (514)
 45 KOG0318 WD40 repeat stress pro  95.2     7.6 0.00016   44.5  33.2  151   51-214   200-354 (603)
 46 KOG0316 Conserved WD40 repeat-  95.2    0.78 1.7E-05   47.4  15.3  190   56-264    74-266 (307)
 47 KOG0296 Angio-associated migra  95.1     6.6 0.00014   43.1  28.9  141   52-219    75-229 (399)
 48 KOG0316 Conserved WD40 repeat-  95.1     5.1 0.00011   41.6  22.6  146   94-257    28-176 (307)
 49 PHA02790 Kelch-like protein; P  94.9     2.6 5.7E-05   49.1  21.1  167   53-241   271-453 (480)
 50 PHA02713 hypothetical protein;  94.8     1.9   4E-05   51.3  19.8  173   53-241   303-519 (557)
 51 PTZ00421 coronin; Provisional   94.7     3.2   7E-05   48.5  21.3  154   53-218   138-298 (493)
 52 KOG0278 Serine/threonine kinas  94.7       2 4.2E-05   44.8  16.6  106   53-170   155-262 (334)
 53 PLN02919 haloacid dehalogenase  94.4     3.9 8.4E-05   52.4  22.6  200   53-257   635-891 (1057)
 54 PLN00181 protein SPA1-RELATED;  94.2      20 0.00043   44.6  28.1  190   52-256   494-692 (793)
 55 KOG0291 WD40-repeat-containing  94.2      17 0.00036   43.6  34.4   66  186-257   402-469 (893)
 56 KOG0319 WD40-repeat-containing  94.0      17 0.00038   43.3  25.2  193   52-257    73-314 (775)
 57 PRK11028 6-phosphogluconolacto  94.0      11 0.00025   41.1  28.3  197   51-255    44-259 (330)
 58 PRK11028 6-phosphogluconolacto  93.9      12 0.00025   40.9  29.8  188   55-255     3-206 (330)
 59 KOG2048 WD40 repeat protein [G  93.7     5.2 0.00011   47.0  19.5  153   52-218   121-283 (691)
 60 PHA03098 kelch-like protein; P  93.7     3.8 8.1E-05   48.3  19.5  189   53-257   294-514 (534)
 61 PF05935 Arylsulfotrans:  Aryls  93.6     3.4 7.3E-05   48.2  18.3  115   94-220   112-241 (477)
 62 PF05096 Glu_cyclase_2:  Glutam  93.5     3.8 8.1E-05   43.6  16.8  154   95-268    54-216 (264)
 63 TIGR03548 mutarot_permut cycli  93.5      13 0.00028   40.6  22.0  162   25-196    45-231 (323)
 64 KOG0291 WD40-repeat-containing  93.4      13 0.00029   44.3  22.2  110   95-218   361-474 (893)
 65 KOG1446 Histone H3 (Lys4) meth  93.4      13 0.00029   40.0  23.2  217   20-257    37-265 (311)
 66 KOG0315 G-protein beta subunit  93.4      12 0.00025   39.4  21.6  143   98-257    11-157 (311)
 67 COG3823 Glutamine cyclotransfe  93.2     2.8 6.1E-05   42.8  14.3  147   52-218    54-212 (262)
 68 KOG0310 Conserved WD40 repeat-  93.0     4.1 8.8E-05   46.1  16.7  151   52-218   121-276 (487)
 69 TIGR03548 mutarot_permut cycli  92.8      17 0.00037   39.7  21.8  145   64-218    40-200 (323)
 70 COG4257 Vgb Streptogramin lyas  92.8     6.9 0.00015   41.6  16.9  193   53-266    72-272 (353)
 71 PRK14131 N-acetylneuraminic ac  92.6      19 0.00042   40.4  22.1   70   53-124    38-122 (376)
 72 KOG0266 WD40 repeat-containing  92.5      15 0.00032   42.5  21.5  193   52-257   214-412 (456)
 73 KOG0278 Serine/threonine kinas  92.3     3.4 7.3E-05   43.2  13.7  108   95-218   154-262 (334)
 74 PHA02713 hypothetical protein;  92.1     3.5 7.5E-05   49.1  16.0  148   53-218   351-539 (557)
 75 PF14269 Arylsulfotran_2:  Aryl  92.0     5.1 0.00011   43.7  15.9  147   62-219    95-297 (299)
 76 PRK05137 tolB translocation pr  91.7      29 0.00063   39.7  24.1  188   51-257   211-415 (435)
 77 KOG3881 Uncharacterized conser  91.6     7.5 0.00016   43.0  16.2  188   51-257   113-323 (412)
 78 PTZ00420 coronin; Provisional   91.2      21 0.00045   42.6  20.9   69   56-126   141-209 (568)
 79 PLN00181 protein SPA1-RELATED;  90.7      52  0.0011   40.9  29.7  106   53-166   545-652 (793)
 80 TIGR03547 muta_rot_YjhT mutatr  90.0      29 0.00064   38.2  20.0  160   53-223    17-238 (346)
 81 KOG1445 Tumor-specific antigen  89.9     2.4 5.1E-05   49.3  10.9  150   51-218   638-808 (1012)
 82 KOG1539 WD repeat protein [Gen  89.9      24 0.00053   42.7  19.4  155   49-215   168-325 (910)
 83 PLN02919 haloacid dehalogenase  89.9      19  0.0004   46.3  20.3  157   53-215   694-893 (1057)
 84 PF06433 Me-amine-dh_H:  Methyl  89.9      36 0.00078   37.7  27.9  195   51-257   104-323 (342)
 85 KOG0274 Cdc4 and related F-box  89.8      30 0.00064   41.0  20.5  180   53-256   218-402 (537)
 86 PF08450 SGL:  SMP-30/Gluconola  89.7      28 0.00061   36.2  26.9  142   53-210    11-164 (246)
 87 KOG0285 Pleiotropic regulator   89.6      28 0.00061   38.3  18.0  232   55-314   207-441 (460)
 88 PF14269 Arylsulfotran_2:  Aryl  89.6       6 0.00013   43.2  13.7  112   52-171   153-297 (299)
 89 PRK04922 tolB translocation pr  88.0      55  0.0012   37.5  22.9  150   52-215   214-373 (433)
 90 PRK04792 tolB translocation pr  88.0      57  0.0012   37.6  23.7  148   51-214   227-386 (448)
 91 COG4257 Vgb Streptogramin lyas  87.3      27 0.00059   37.3  15.8  194   53-266   114-315 (353)
 92 PRK03629 tolB translocation pr  87.1      62  0.0014   37.0  23.5  151   51-215   208-368 (429)
 93 KOG0275 Conserved WD40 repeat-  86.6     5.9 0.00013   42.6  10.7  184   62-257   274-470 (508)
 94 KOG4441 Proteins containing BT  86.6      14 0.00031   44.1  15.3  173   53-241   284-482 (571)
 95 PRK00178 tolB translocation pr  86.4      65  0.0014   36.6  23.5  148   51-214   208-367 (430)
 96 KOG0310 Conserved WD40 repeat-  86.0      14  0.0003   42.0  13.6  113   55-177   168-283 (487)
 97 KOG4547 WD40 repeat-containing  85.9      20 0.00043   41.7  15.1  106  145-258    69-176 (541)
 98 KOG0649 WD40 repeat protein [G  84.7      56  0.0012   34.3  17.0  105   58-170    76-194 (325)
 99 KOG0286 G-protein beta subunit  84.6      40 0.00087   36.3  15.5  152   52-216   155-309 (343)
100 KOG0270 WD40 repeat-containing  84.4      29 0.00064   39.1  15.1  119   97-231   257-381 (463)
101 KOG0266 WD40 repeat-containing  84.2      44 0.00095   38.7  17.7  158   53-218   258-417 (456)
102 KOG0315 G-protein beta subunit  83.8      63  0.0014   34.2  18.3   60   53-114    95-154 (311)
103 TIGR03547 muta_rot_YjhT mutatr  83.1      79  0.0017   34.8  19.8  178   53-241    63-328 (346)
104 KOG1274 WD40 repeat protein [G  83.0      63  0.0014   39.7  18.1  186   52-254    65-262 (933)
105 KOG0303 Actin-binding protein   83.0      21 0.00047   39.7  13.2   71   53-126   144-215 (472)
106 PHA03098 kelch-like protein; P  82.1      47   0.001   39.1  17.2  135   93-240   292-443 (534)
107 PHA02790 Kelch-like protein; P  82.1      57  0.0012   38.0  17.6  147   94-257   270-426 (480)
108 KOG2321 WD40 repeat protein [G  81.3      36 0.00077   39.7  14.6  176   56-241   148-331 (703)
109 KOG0293 WD40 repeat-containing  81.2      36 0.00079   38.2  14.1  212   31-257   259-473 (519)
110 PRK00178 tolB translocation pr  80.1 1.1E+02  0.0025   34.7  23.2  187   54-257   164-366 (430)
111 COG3391 Uncharacterized conser  79.5 1.1E+02  0.0025   34.3  22.7  191   52-257    84-286 (381)
112 KOG0279 G protein beta subunit  79.4      78  0.0017   34.0  15.3   70   52-121   116-187 (315)
113 KOG4441 Proteins containing BT  79.1      68  0.0015   38.4  17.0  172   53-241   332-529 (571)
114 PRK04043 tolB translocation pr  78.9 1.3E+02  0.0028   34.5  23.5  148   51-214   197-361 (419)
115 COG4946 Uncharacterized protei  78.5 1.3E+02  0.0029   34.5  20.4  190   49-257   231-434 (668)
116 PF14727 PHTB1_N:  PTHB1 N-term  78.3      41 0.00088   38.5  14.2   93   31-128   231-330 (418)
117 KOG4499 Ca2+-binding protein R  77.6      43 0.00093   35.1  12.5  115  105-236   138-265 (310)
118 PLN02193 nitrile-specifier pro  77.6 1.5E+02  0.0032   34.5  23.0  198   53-263   175-417 (470)
119 KOG0270 WD40 repeat-containing  77.6      54  0.0012   37.1  14.2   94   30-129   268-376 (463)
120 PLN02153 epithiospecifier prot  77.6 1.2E+02  0.0026   33.4  23.0  196   53-257    32-287 (341)
121 KOG1036 Mitotic spindle checkp  77.0      79  0.0017   34.3  14.7  109   86-211    15-125 (323)
122 COG3391 Uncharacterized conser  76.9 1.3E+02  0.0029   33.8  19.7  157   51-218   125-291 (381)
123 PRK04792 tolB translocation pr  76.9 1.5E+02  0.0032   34.2  22.8  150   94-257   228-385 (448)
124 PRK14131 N-acetylneuraminic ac  76.4 1.4E+02   0.003   33.5  20.2   36  204-241   314-350 (376)
125 TIGR02800 propeller_TolB tol-p  75.7 1.4E+02  0.0031   33.4  23.6  149   51-214   199-358 (417)
126 PLN02193 nitrile-specifier pro  75.6 1.3E+02  0.0029   34.9  17.9  152   53-218   228-416 (470)
127 KOG0282 mRNA splicing factor [  74.4      32  0.0007   39.2  11.6   73   52-125   269-341 (503)
128 KOG1274 WD40 repeat protein [G  74.0      74  0.0016   39.1  15.1  119   52-174   107-230 (933)
129 KOG0275 Conserved WD40 repeat-  73.9      89  0.0019   34.0  14.1  181   26-220   282-477 (508)
130 TIGR02800 propeller_TolB tol-p  73.5 1.6E+02  0.0035   33.0  22.9  149   94-257   200-357 (417)
131 COG2706 3-carboxymuconate cycl  72.8 1.6E+02  0.0034   32.6  24.0   69  187-257    50-123 (346)
132 KOG0643 Translation initiation  72.8 1.4E+02   0.003   32.0  19.4  103  102-218    70-185 (327)
133 KOG2106 Uncharacterized conser  71.2 1.7E+02  0.0037   33.9  16.2  147   53-218   339-486 (626)
134 PRK05137 tolB translocation pr  69.1 2.1E+02  0.0046   32.6  23.3  137   64-214   183-326 (435)
135 PF14870 PSII_BNR:  Photosynthe  68.9 1.6E+02  0.0034   32.3  15.3  170   19-198   122-296 (302)
136 KOG4547 WD40 repeat-containing  68.8 2.4E+02  0.0052   33.1  17.7  114   53-174    70-184 (541)
137 PRK02888 nitrous-oxide reducta  68.7 1.3E+02  0.0028   36.1  15.5  150   46-215   196-356 (635)
138 KOG0646 WD40 repeat protein [G  68.5 2.2E+02  0.0048   32.6  18.9   60  188-255   188-248 (476)
139 KOG0282 mRNA splicing factor [  67.3      45 0.00097   38.1  10.7  141   97-255   227-373 (503)
140 KOG0295 WD40 repeat-containing  67.2 2.1E+02  0.0046   31.9  17.6   52  203-257   315-367 (406)
141 PLN00033 photosystem II stabil  67.0 2.3E+02   0.005   32.3  20.4  129   29-171    73-214 (398)
142 PRK13684 Ycf48-like protein; P  67.0 2.1E+02  0.0045   31.7  18.8  179   21-218   109-294 (334)
143 KOG0318 WD40 repeat stress pro  66.4 2.6E+02  0.0057   32.6  28.1  182   53-257   290-476 (603)
144 COG3823 Glutamine cyclotransfe  66.2      69  0.0015   33.1  10.8  110   53-170   100-213 (262)
145 KOG1036 Mitotic spindle checkp  65.9 1.5E+02  0.0032   32.2  13.8   61   53-115    65-125 (323)
146 KOG0649 WD40 repeat protein [G  65.8 1.9E+02   0.004   30.7  24.3  159   96-265    72-245 (325)
147 PLN02153 epithiospecifier prot  65.5 2.2E+02  0.0047   31.3  18.2  152   53-218    85-290 (341)
148 KOG0647 mRNA export protein (c  65.2 2.1E+02  0.0046   31.1  16.6  154   52-222    83-240 (347)
149 cd00028 B_lectin Bulb-type man  64.5      48  0.0011   30.4   9.1   71   73-170    41-112 (116)
150 KOG1027 Serine/threonine prote  63.4      30 0.00065   42.3   9.0  109   52-177   106-216 (903)
151 PF08450 SGL:  SMP-30/Gluconola  63.3 1.9E+02  0.0041   29.9  27.5  145   95-255    11-165 (246)
152 KOG0306 WD40-repeat-containing  63.3 3.5E+02  0.0076   33.0  26.2  101   56-167   339-447 (888)
153 PRK03629 tolB translocation pr  63.0 2.8E+02   0.006   31.7  23.6  149   94-257   209-366 (429)
154 KOG0303 Actin-binding protein   62.9   1E+02  0.0022   34.6  12.1   92   96-198   144-237 (472)
155 smart00108 B_lectin Bulb-type   62.7      63  0.0014   29.5   9.4   81   63-170    30-111 (114)
156 PF14583 Pectate_lyase22:  Olig  62.3 2.3E+02   0.005   32.1  15.2  102   67-176    14-124 (386)
157 KOG0288 WD40 repeat protein Ti  61.8      73  0.0016   35.8  10.9  108  100-219   315-426 (459)
158 KOG1446 Histone H3 (Lys4) meth  61.1 2.5E+02  0.0054   30.6  26.4  202   37-265    13-227 (311)
159 KOG0271 Notchless-like WD40 re  61.1      40 0.00087   37.5   8.7  109   50-166   166-280 (480)
160 KOG2106 Uncharacterized conser  60.4 3.3E+02  0.0071   31.7  23.8  220   53-312   257-489 (626)
161 KOG2055 WD40 repeat protein [G  60.4   1E+02  0.0022   35.3  11.8   77   50-128   312-388 (514)
162 KOG0639 Transducin-like enhanc  59.5 1.3E+02  0.0028   34.7  12.5  112   51-172   519-631 (705)
163 PF05262 Borrelia_P83:  Borreli  59.1      59  0.0013   37.9  10.2   98  153-256   374-472 (489)
164 PF14870 PSII_BNR:  Photosynthe  58.9 2.8E+02   0.006   30.4  19.7  180   23-219    83-268 (302)
165 KOG0285 Pleiotropic regulator   58.7   3E+02  0.0065   30.7  21.8  146   56-217   166-314 (460)
166 KOG0646 WD40 repeat protein [G  58.4 3.4E+02  0.0073   31.2  17.0  132   56-198    96-239 (476)
167 KOG1272 WD40-repeat-containing  58.2      28  0.0006   39.6   7.0  179   53-255   141-324 (545)
168 KOG4378 Nuclear protein COP1 [  58.1 2.6E+02  0.0056   32.4  14.5  139   56-210   136-280 (673)
169 COG3386 Gluconolactonase [Carb  57.3 2.7E+02  0.0058   30.6  14.6  105   98-215    39-155 (307)
170 COG2706 3-carboxymuconate cycl  57.0 3.1E+02  0.0068   30.4  30.8  192   55-255     4-222 (346)
171 KOG0265 U5 snRNP-specific prot  55.2 3.1E+02  0.0068   29.9  14.3   33   95-127   101-133 (338)
172 KOG0295 WD40 repeat-containing  54.9 1.6E+02  0.0034   32.8  11.8   65   97-169   305-371 (406)
173 KOG0288 WD40 repeat protein Ti  54.9 3.3E+02  0.0072   30.9  14.5  185   53-258   231-421 (459)
174 PF04841 Vps16_N:  Vps16, N-ter  54.6 3.7E+02  0.0081   30.6  16.5   98   65-170    63-163 (410)
175 KOG0274 Cdc4 and related F-box  53.8 4.5E+02  0.0097   31.3  22.7  180   53-257   261-444 (537)
176 KOG3881 Uncharacterized conser  53.1      24 0.00052   39.2   5.5   73   52-124   258-330 (412)
177 KOG0280 Uncharacterized conser  53.0      28 0.00061   37.4   5.8   73   53-127   178-255 (339)
178 KOG0294 WD40 repeat-containing  52.7 3.5E+02  0.0076   29.7  16.8  186   45-256    90-283 (362)
179 PF06977 SdiA-regulated:  SdiA-  50.4 3.4E+02  0.0073   28.8  22.2  187   52-252    32-239 (248)
180 COG3419 PilY1 Tfp pilus assemb  49.9 2.4E+02  0.0053   35.7  13.7   27   97-123   583-609 (1036)
181 PF09910 DUF2139:  Uncharacteri  48.8 1.2E+02  0.0027   32.9   9.7   98  153-257    77-184 (339)
182 PF01453 B_lectin:  D-mannose b  48.7 1.2E+02  0.0027   27.8   8.9   60   95-170    19-78  (114)
183 KOG0263 Transcription initiati  48.6 1.4E+02  0.0031   36.1  11.2   63  102-172   553-617 (707)
184 PRK01742 tolB translocation pr  48.4 4.6E+02    0.01   29.8  20.7  144   51-215   213-366 (429)
185 KOG4499 Ca2+-binding protein R  48.1      92   0.002   32.8   8.4   83   95-177   169-256 (310)
186 smart00108 B_lectin Bulb-type   46.8 1.9E+02  0.0042   26.3   9.9   52  107-173    31-82  (114)
187 KOG1188 WD40 repeat protein [G  46.8 2.2E+02  0.0047   31.5  11.3   64  148-215    43-107 (376)
188 PF14727 PHTB1_N:  PTHB1 N-term  46.5 5.1E+02   0.011   29.8  20.9  187   53-257   145-363 (418)
189 PRK13684 Ycf48-like protein; P  43.8 4.8E+02    0.01   28.7  22.6  168   29-219    33-209 (334)
190 PF08553 VID27:  VID27 cytoplas  43.5 3.9E+02  0.0085   33.3  14.2  110   96-213   493-608 (794)
191 TIGR02276 beta_rpt_yvtn 40-res  43.2      68  0.0015   23.1   5.1   31   52-82      2-33  (42)
192 cd00028 B_lectin Bulb-type man  42.6   2E+02  0.0043   26.3   9.3   22  151-173    62-83  (116)
193 KOG1517 Guanine nucleotide bin  42.3 4.2E+02  0.0092   33.8  13.9  147   96-254  1177-1333(1387)
194 PF06433 Me-amine-dh_H:  Methyl  42.1 5.3E+02   0.012   28.7  30.5  189   54-258     3-217 (342)
195 KOG0308 Conserved WD40 repeat-  42.1 2.2E+02  0.0047   34.1  11.1  101   55-163   184-286 (735)
196 PRK04922 tolB translocation pr  42.0 5.7E+02   0.012   29.1  22.7  149   94-257   214-371 (433)
197 PF15525 DUF4652:  Domain of un  40.8 3.8E+02  0.0083   27.3  11.2   65  497-573    88-153 (200)
198 PF03178 CPSF_A:  CPSF A subuni  40.4 5.1E+02   0.011   28.0  22.0  175   64-254     3-202 (321)
199 KOG0265 U5 snRNP-specific prot  38.6      82  0.0018   34.1   6.6   63   52-115   101-164 (338)
200 PF14339 DUF4394:  Domain of un  38.5   5E+02   0.011   27.4  14.6  165   50-221    35-224 (236)
201 PF09910 DUF2139:  Uncharacteri  38.2 5.7E+02   0.012   28.0  18.9  157   26-216    16-187 (339)
202 PRK02889 tolB translocation pr  37.8 6.6E+02   0.014   28.6  23.6  149   51-215   205-365 (427)
203 COG3045 CreA Uncharacterized p  37.5      95  0.0021   30.2   6.2   58    1-61      3-62  (165)
204 KOG0283 WD40 repeat-containing  36.7 4.9E+02   0.011   31.8  13.3  110   96-218   421-540 (712)
205 KOG0379 Kelch repeat-containin  36.7 6.2E+02   0.013   29.6  14.2  155   94-257    69-252 (482)
206 KOG1188 WD40 repeat protein [G  36.4 4.8E+02    0.01   29.0  11.9  172   69-255    17-197 (376)
207 KOG2321 WD40 repeat protein [G  36.3 7.5E+02   0.016   29.5  14.0  157   94-257    62-261 (703)
208 KOG3914 WD repeat protein WDR4  35.6      83  0.0018   35.2   6.3   72   52-125   162-234 (390)
209 KOG0292 Vesicle coat complex C  35.3   1E+03   0.022   30.0  21.2  107   75-211   239-349 (1202)
210 PF02897 Peptidase_S9_N:  Proly  34.5   7E+02   0.015   27.9  19.1  146   63-215   252-409 (414)
211 PF05567 Neisseria_PilC:  Neiss  34.4 6.6E+02   0.014   27.8  13.4   55  201-257   180-242 (335)
212 KOG0281 Beta-TrCP (transducin   33.8 1.3E+02  0.0028   33.3   7.2   98   96-212   331-430 (499)
213 PF14783 BBS2_Mid:  Ciliary BBS  33.6 3.9E+02  0.0086   24.8  11.8   68   53-127    15-82  (111)
214 KOG0771 Prolactin regulatory e  32.3 5.6E+02   0.012   29.0  12.0   19  237-255   294-312 (398)
215 KOG2395 Protein involved in va  31.6 7.1E+02   0.015   29.4  12.8  116   95-217   344-465 (644)
216 KOG0639 Transducin-like enhanc  30.9 2.2E+02  0.0048   33.0   8.7   75   50-127   560-634 (705)
217 TIGR02276 beta_rpt_yvtn 40-res  30.8 1.1E+02  0.0025   21.8   4.6   30  188-220     3-32  (42)
218 PF08596 Lgl_C:  Lethal giant l  30.1 8.6E+02   0.019   27.6  13.6  182   53-265    97-300 (395)
219 KOG0281 Beta-TrCP (transducin   30.1 1.1E+02  0.0025   33.6   6.1   72   52-126   329-400 (499)
220 KOG1912 WD40 repeat protein [G  29.9 5.4E+02   0.012   31.8  11.8   76   98-177    81-158 (1062)
221 KOG0263 Transcription initiati  29.6 2.1E+02  0.0045   34.7   8.6   69   56-125   550-618 (707)
222 KOG0271 Notchless-like WD40 re  29.6 8.7E+02   0.019   27.5  18.9   64   95-167   127-192 (480)
223 KOG0289 mRNA splicing factor [  29.5 9.2E+02    0.02   27.7  19.7   75   51-126   313-389 (506)
224 PRK01742 tolB translocation pr  28.9   9E+02    0.02   27.4  22.2  183   53-257   168-364 (429)
225 COG3386 Gluconolactonase [Carb  28.2 8.1E+02   0.018   26.8  12.6   75   51-127   172-255 (307)
226 PF11589 DUF3244:  Domain of un  27.0 1.2E+02  0.0026   27.4   5.0   24  731-755    48-71  (106)
227 PF08553 VID27:  VID27 cytoplas  27.0 2.6E+02  0.0056   34.8   9.1   63   53-117   542-608 (794)
228 PRK02889 tolB translocation pr  26.7 9.8E+02   0.021   27.2  27.0  149   94-257   206-363 (427)
229 PF01456 Mucin:  Mucin-like gly  26.1      49  0.0011   31.7   2.4   27    1-27      1-27  (143)
230 COG4946 Uncharacterized protei  25.8 1.1E+03   0.024   27.5  24.2  195   92-312   232-441 (668)
231 PF02897 Peptidase_S9_N:  Proly  25.3 9.8E+02   0.021   26.7  27.5   65  154-220   252-320 (414)
232 KOG0650 WD40 repeat nucleolar   25.3   2E+02  0.0044   34.0   7.3   31   98-128   414-444 (733)
233 COG4447 Uncharacterized protei  24.5 6.6E+02   0.014   27.3  10.3  174   54-242   139-322 (339)
234 COG3292 Predicted periplasmic   24.3 3.3E+02  0.0071   32.4   8.7   70   53-129   175-244 (671)
235 KOG0279 G protein beta subunit  24.3 9.4E+02    0.02   26.1  22.1   57   69-127    49-106 (315)
236 TIGR00548 lolB outer membrane   22.4 1.4E+02  0.0031   30.4   5.1   58   53-118    51-108 (202)
237 KOG1240 Protein kinase contain  21.4 4.8E+02    0.01   33.8   9.8   70   55-124  1165-1235(1431)
238 KOG0319 WD40-repeat-containing  21.2 1.6E+03   0.034   27.6  22.0   72   53-124    30-102 (775)
239 PRK13861 type IV secretion sys  20.9 3.6E+02  0.0077   29.4   8.0   33    3-35      2-34  (292)
240 PF14583 Pectate_lyase22:  Olig  20.7 3.3E+02  0.0072   30.9   7.8   68   53-123    47-119 (386)
241 KOG0301 Phospholipase A2-activ  20.5 8.2E+02   0.018   29.7  11.0   94   56-161   193-287 (745)
242 KOG1445 Tumor-specific antigen  20.4 1.1E+03   0.024   28.5  11.8   60   94-162   139-200 (1012)
243 PF01453 B_lectin:  D-mannose b  20.4 6.6E+02   0.014   22.9   8.7   60   53-122    19-78  (114)
244 KOG2110 Uncharacterized conser  20.2 1.3E+03   0.027   26.1  19.9  176   63-255    68-249 (391)
245 PF08894 DUF1838:  Protein of u  20.2      74  0.0016   33.3   2.4   67  700-769    24-90  (238)

No 1  
>KOG2103 consensus Uncharacterized conserved protein [Function unknown]
Probab=100.00  E-value=1.4e-113  Score=960.33  Aligned_cols=702  Identities=34%  Similarity=0.487  Sum_probs=551.9

Q ss_pred             HHHhccccccceeecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeee
Q 003800           12 FLSSCTIPSLSLYEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGID   91 (794)
Q Consensus        12 ~l~~~~~~~~Al~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~   91 (794)
                      +|+++..+|+|+||||+||+|||++++| ++...|+.-.+..+++||+|++|+||+||.+||+++|||.++.+....+..
T Consensus         7 ~~~~~~~~~aav~edq~gkfdwr~~~vG-~~k~~~~~~~t~~~rlivsT~~~vlAsL~~~tGei~WRqvl~~~~~~~~~~   85 (910)
T KOG2103|consen    7 ALALLLYRAAAVYEDQAGKFDWRQQLVG-VKKVNFLVYDTKSKRLIVSTEKGVLASLNLRTGEIIWRQVLEPKTSGLGVP   85 (910)
T ss_pred             HHHHHHHHHHHHHHHHhhhcchhhhccc-ceeEEEEeecCCCceEEEEeccchhheecccCCcEEEEEeccCCCcccCcc
Confidence            3333445667999999999999999999 555556666667899999999999999999999999999998874432331


Q ss_pred             eeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEe
Q 003800           92 IALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRD  171 (794)
Q Consensus        92 ~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~  171 (794)
                      .     .-++|.+|..+|+||.++|.+.|+..+..+ . ....+..       ...+.|+.+     .....|+..|...
T Consensus        86 ~-----~~~iS~dg~~lr~wn~~~g~l~~~i~l~~g-~-~~~~~~v-------~~~i~v~~g-----~~~~~g~l~w~~~  146 (910)
T KOG2103|consen   86 L-----TNTISVDGRYLRSWNTNNGILDWEIELADG-F-KGLLLEV-------NKGIAVLNG-----HTRKFGELKWVES  146 (910)
T ss_pred             e-----eEEEccCCcEEEeecCCCceeeeecccccc-c-ceeEEEE-------ccceEEEcc-----eeccccceeehhh
Confidence            1     115788889999999999999999999876 3 1111111       222333333     5667899999998


Q ss_pred             ccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCcee-eeeeeecccCccCceEEEcCcEEEEEECCCCeEEE
Q 003800          172 FAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELL-NHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVT  250 (794)
Q Consensus       172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~-w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v  250 (794)
                      .+.......|.+.+...+.+|++++--.++..|.+++..+|+.. |+.++..|+.-...|.-+.+.+++|.+   |.+..
T Consensus       147 ~~~~~~~~~q~~~~~~t~vvy~~~~l~~s~~~V~~~~~~~g~v~~~~~~v~~pw~~~~~c~~~k~~vl~~s~---g~l~s  223 (910)
T KOG2103|consen  147 FSISIEEDLQDAKIYGTDVVYVLGLLKRSGSCVQQVFSDDGEVTGPQSTVLGPWFKVLSCSTDKEVVLVCSN---GTLIS  223 (910)
T ss_pred             ccccchhHHHHhhhccCcEEEEEEEEecCCceEEEEEccCCcEecceeeeecCcccccccccccceEEEcCC---CCeEE
Confidence            87654434454334578889999887667779999999999999 888888886555566555666788885   57888


Q ss_pred             EEeecceeeeEEEeecccCCCCCCceEEeecCCcc-eeEEEecCcEEEEEEecCCcEEEEEeecCcceeeeeeeecCCce
Q 003800          251 VSFKNRKIAFQETHLSNLGEDSSGMVEILPSSLTG-MFTVKINNYKLFIRLTSEDKLEVVHKVDHETVVSDALVFSEGKE  329 (794)
Q Consensus       251 ~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~~~~-~~~~~~~~~~~l~~~~~~~~~~v~~~~~~~~~~s~~~~~~~~~~  329 (794)
                      .|+..++....+...           +++-. +.| ...+..++|..++.+++.|...++......-..+.+++..++..
T Consensus       224 ~di~~~~~~~~q~~~-----------e~l~~-l~g~~i~~~g~~~~~~V~V~s~~~~~v~~~~~~e~~lsdsl~~~~d~e  291 (910)
T KOG2103|consen  224 LDISSQKVQISQLLA-----------EILLP-LTGDLILLDGNKHTAMVSVNSSSNHWVYLFCRSEVDLSDSLEAGGDTE  291 (910)
T ss_pred             EEEEeeccchhhhhh-----------hhhhc-cCCceEEecCCCceeEEEEecCCCeEEEeecccceeeccccccccccc
Confidence            888776521111111           11110 111 44455556778899987776666544332223344455556666


Q ss_pred             EEEEEEEcCce----EEEEEeeeeeeecCccceeeeeccCCceeEEEEEEEEEecCCcceEEEEEEEcCCcEEEEECCeE
Q 003800          330 AFAVVEHGGSK----VDITVKPGQDWNNNLVQESIEMDHQRGLVHKVFINNYLRTDRSHGFRALIVMEDHSLLLVQQGKI  405 (794)
Q Consensus       330 ~~~~~~~~~~~----v~~~~~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~~~~~~~~~~~r~l~~t~d~~~~l~~~g~~  405 (794)
                      ++.++.+..+.    |+.......+..    +....++...+.|+.+..  +..++++.+||++++++|+.+.+.|||.+
T Consensus       292 ~~~si~~~ss~~~~~V~~vn~l~~~~~----~~~~~~~~~l~~p~~F~~--~~~~~~e~~~~al~~~~d~~~~~~qng~i  365 (910)
T KOG2103|consen  292 ASKSIHPESSYLFDQVFIVNNLYLVLD----AQSILLEQKLSRPEVFGT--FEYFDREIGALALVVNDDHSLLFLQNGLI  365 (910)
T ss_pred             cceeeecccchhhheeeehhhhhhcch----hhhhhhhcccCcchhcce--eEEeccccceEEEEEecCceEEEEeCcce
Confidence            66665555433    222222222222    223344444555644322  34455566999999999999999999887


Q ss_pred             E-EEeccccccceeEEEEeCCCCcccchhhhhhhhhc----hhHHHHHH-hhhcccccCChhhHHHHhh-------cc-c
Q 003800          406 V-WNREDALASIIDVTTSELPVEKEGVSVAKVEHSLF----EWLKGHML-KLKGTLMLASPEDVAAIQA-------IR-L  471 (794)
Q Consensus       406 ~-W~ReEsLa~i~~~~~vdlp~~~~~~~~~~le~e~~----~~~~~~~~-Rl~~~~~~~~~~~~~~l~~-------~~-~  471 (794)
                      . |+|||+||++++++|+|||++++   ++.+|.||.    +++++||+ |+.+        |+.+|++       .+ .
T Consensus       366 ~~WsREEsLa~vvd~~~vdlpLs~~---~~~~e~e~~~~~~~~l~~afl~R~~t--------q~~ql~~~~~h~~~~~~~  434 (910)
T KOG2103|consen  366 LVWSREESLANVVDVEMVDLPLSRD---QGLLEDEFEDKESNSLWGAFLKRLTT--------QFNQLINLLKHNQGLPTP  434 (910)
T ss_pred             EEeehhhhhhhhccceeeccccccc---hhhHHHHhhccccchHHHHHHHHHHH--------HHHHHHHHHHhhhccCCC
Confidence            7 99999999999999999999998   667777763    36999999 9999        8888766       22 4


Q ss_pred             cccCccc-ccccCCCceEEEEEEecCceEEEEECCCCcEEEEEecccCCCCCCCceee-EEeeecCcccCCCCCCeEEEE
Q 003800          472 KSSEKSK-MTRDHNGFRKLLIVLTKARKIFALHSGDGRVVWSLLLHKSEACDSPTELN-LYQWQTPHHHAMDENPSVLVV  549 (794)
Q Consensus       472 ~~~~~~~-~~rD~FGf~Klivv~T~~Gkl~alds~~G~i~W~~~l~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~vv  549 (794)
                      +++.+++ +.||.||||||||++|++|||||||+.+|+++|++.+++...  +++.++ ++|+..+|||   +++.|.|+
T Consensus       435 ~s~~~n~~l~rD~Fgl~K~iIvlT~tGkiFglds~~G~i~Wkl~L~~~~~--~~e~v~l~vqr~~~H~~---~d~~~svl  509 (910)
T KOG2103|consen  435 LSALKNKDLSRDKFGLRKMIIVLTSTGKIFGLDSVDGQIHWKLWLPNVQQ--NPEGVKLFVQRTTAHFP---LDEDPSVL  509 (910)
T ss_pred             cccccccceeecccCceeEEEEEecCceEEEEEcCCCeEEEEEecCcccC--CcccceEEEEeccccCC---CCCCCeEE
Confidence            4555666 999999999999999999999999999999999999997432  356899 7888888998   78888888


Q ss_pred             EEecCCCCCCcEEEEEEccCCceecccccccceeEEEeecccCCccceEEEEEcCCCceEEccCChhhhhhhhhcccceE
Q 003800          550 GRCGVSSKAPAILSFVDTYTGKELNSFDLVHSAVQVMPLPFTDSTEQRLHLLVDDDRRIHLYPKTSEAISIFQQEFSNIY  629 (794)
Q Consensus       550 ~~~~~~~~~~~~~~~~d~~tG~~~~~~~l~~~~~~~~~lp~~~~~~~~~~~~~d~~~~v~~~P~~~~~~~~~~~~~~~~~  629 (794)
                      ++++  .+++++++.|||++|++.++.+++++++|.++||.++.++++.++++|+.+.+++||.+.+.+..++++++++|
T Consensus       510 f~~k--~s~~gvly~fn~~~Gkv~s~~~l~~~v~q~sllp~~~~d~~~~illidd~~~v~l~P~~~~~l~~~~~~a~s~y  587 (910)
T KOG2103|consen  510 FVHK--GSGNGVLYEFNPITGKVISRSPLDYRVKQLSLLPVTEHDHQYLILLIDDHLKVKLYPGTSTDLEIVANEASSIY  587 (910)
T ss_pred             EEec--cCCCeEEEEEecCcceeeecCccCCceeeEEeccccccccceeEEEecccceEEecCCCcccchhhhhccCccE
Confidence            8876  57899999999999999998889999999999999999999999999999999999999999999999999999


Q ss_pred             EEEEEccCCeEEEEEEeecCCCcccccccceeeEeEEEEcCCCCceEEEEeeccCCcccccceeeecCCeeEeeccCCce
Q 003800          630 WYSVEADNGIIKGHAVKSKCAGEVLDDFCFETRVLWSIIFPMESEKIIAAVSRKQNEVVHTQAKVTSEQDVMYKYISKNL  709 (794)
Q Consensus       630 ~~~~d~~~~~l~G~~~~~~~~~~~~~~~~~~~~~~W~~~~~~~~e~Iv~~~~r~~~e~v~S~g~VLgDRsVLYKYLNPNl  709 (794)
                      +|++|.++|.|+||.++.+          ++..++|+.++|++.|+||++..|+++|+|||+|||||||+||||||||||
T Consensus       588 ~Yt~e~~~~~i~Gy~i~~~----------lT~~~~W~~~l~~e~e~IIav~~r~p~e~VhSqGrVlgdrsVlYKYlnPNL  657 (910)
T KOG2103|consen  588 LYTVEADTGGIYGYIIKAD----------LTTTQTWKKNLPSEKEKIIAVKGRNPNEHVHSQGRVLGDRSVLYKYLNPNL  657 (910)
T ss_pred             EEEEEcccCcEEEEEEecc----------cceeeeeeeccCchhheeeEeccCCcchheeecceecccceeeeeccCcch
Confidence            9999999999999999844          578899999999777999999999999999999999999999999999999


Q ss_pred             EEEEEEcCCCCCCcCCCCCCCcEEEEEEEEceeeeEEEEEEecCCCCCceEEEEecEEEEEEEeCCcceEEEEEEEEecC
Q 003800          710 LFVATVAPKASGHIGSADPDEAWLVVYLIDTITGRILHRMTHHGAQGPVHAVLSENWVVYHYFNLRAHRYEMSVTEIYDQ  789 (794)
Q Consensus       710 ~~v~t~~~~~~~~~~~~~~~~~~l~v~liD~VTG~il~s~~h~~~~~pi~~v~~ENWvvYsy~~~~~~~~~i~vvELyE~  789 (794)
                      +||+|.++++       ++   ..++||||+|||+|+|+++|+++++|||+||||||+||||||++.+|+||+|+|||||
T Consensus       658 ~A~~t~~~~~-------~~---~~~~~LiD~VTG~Ivht~~h~k~~~PvhiVfSENWvvYsYfs~k~~rteltvvELYEg  727 (910)
T KOG2103|consen  658 AAVATANPDD-------HH---ETFLYLIDTVTGSIVHTQSHQKARGPVHIVFSENWVVYSYFSDKARRTELTVVELYEG  727 (910)
T ss_pred             hheeecCcCC-------ce---eEEEEEEeeeeeEEEEeeehhhhcCceEEEEecceEEEEEeccccccceEEEEEEecC
Confidence            9999999983       21   1256999999999999999999999999999999999999999999999999999999


Q ss_pred             Ccc
Q 003800          790 SRA  792 (794)
Q Consensus       790 ~~~  792 (794)
                      ++.
T Consensus       728 s~~  730 (910)
T KOG2103|consen  728 SEQ  730 (910)
T ss_pred             Ccc
Confidence            864


No 2  
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=99.89  E-value=2.1e-20  Score=210.56  Aligned_cols=241  Identities=20%  Similarity=0.304  Sum_probs=168.0

Q ss_pred             ChHHHHHHHHHHHHhccccccceeec---------------ccccEeeEEeccCceeeeee--eeeccCCCEEEEEeCCC
Q 003800            1 MAIRFIILTLLFLSSCTIPSLSLYED---------------QVGLMDWHQQYIGKVKHAVF--HTQKTGRKRVVVSTEEN   63 (794)
Q Consensus         1 ~~~~~~l~~l~~l~~~~~~~~Al~ed---------------qvG~~dW~~~~vG~~~~~~f--~~~~~~~~~Vyv~t~~g   63 (794)
                      |-+|.+++..|++++|++.|+.++..               ..++..|+.++ |......+  ..|...+++||+++.+|
T Consensus         1 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~W~~~~-g~g~~~~~~~~sPvv~~~~vy~~~~~g   79 (394)
T PRK11138          1 MQLRKTLLPGLLSVTLLSGCSSFNSEEDVVKMSPLPQVENQFTPTTVWSTSV-GDGVGDYYSRLHPAVAYNKVYAADRAG   79 (394)
T ss_pred             CcHHHHHHHHHHHHHHhhhcCCCCCCccccCCCCcccccccCCcceeeEEEc-CCCCccceeeeccEEECCEEEEECCCC
Confidence            56788777777777777777765421               25678999986 43321111  13555689999999999


Q ss_pred             EEEEEECcCCccceEEEcCcccc---------eeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800           64 VIASLDLRHGEIFWRHVLGINDV---------VDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL  134 (794)
Q Consensus        64 ~l~ALn~~tG~ivWR~~l~~~~~---------i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~  134 (794)
                      .|+|||++||+++|++.+.....         +.+. +...++.|++++.++.++|+|++||+++|+.++.++..  +.+
T Consensus        80 ~l~ald~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~-~~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~--ssP  156 (394)
T PRK11138         80 LVKALDADTGKEIWSVDLSEKDGWFSKNKSALLSGG-VTVAGGKVYIGSEKGQVYALNAEDGEVAWQTKVAGEAL--SRP  156 (394)
T ss_pred             eEEEEECCCCcEeeEEcCCCcccccccccccccccc-cEEECCEEEEEcCCCEEEEEECCCCCCcccccCCCcee--cCC
Confidence            99999999999999999876211         1111 24456677777766799999999999999999876543  223


Q ss_pred             ccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeee-eEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800          135 LVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQ-QVIQLDESDQIYVVGYAGSSQFHAYQINAMNG  212 (794)
Q Consensus       135 ~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~-~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG  212 (794)
                      ++.       ++.+++. .+|.|+++|.+||+++|+++...+..... ...+...++.+|+.+..|    .++++|+++|
T Consensus       157 ~v~-------~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v~~~~v~~~~~~g----~v~a~d~~~G  225 (394)
T PRK11138        157 VVS-------DGLVLVHTSNGMLQALNESDGAVKWTVNLDVPSLTLRGESAPATAFGGAIVGGDNG----RVSAVLMEQG  225 (394)
T ss_pred             EEE-------CCEEEEECCCCEEEEEEccCCCEeeeecCCCCcccccCCCCCEEECCEEEEEcCCC----EEEEEEccCC
Confidence            332       5677776 48999999999999999998754322100 011123567888766555    8999999999


Q ss_pred             ceeeeeeeecccC---------ccCceEEEcCcEEEEEECCCCeEEEEEeeccee
Q 003800          213 ELLNHETAAFSGG---------FVGDVALVSSDTLVTLDTTRSILVTVSFKNRKI  258 (794)
Q Consensus       213 ~~~w~~~v~~~~~---------~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~  258 (794)
                      +.+|+.++..+.+         +..++++.++.++ +.+ ..|.++++|+.+|++
T Consensus       226 ~~~W~~~~~~~~~~~~~~~~~~~~~sP~v~~~~vy-~~~-~~g~l~ald~~tG~~  278 (394)
T PRK11138        226 QLIWQQRISQPTGATEIDRLVDVDTTPVVVGGVVY-ALA-YNGNLVALDLRSGQI  278 (394)
T ss_pred             hhhheeccccCCCccchhcccccCCCcEEECCEEE-EEE-cCCeEEEEECCCCCE
Confidence            9999987655422         2234555454444 444 358999999999984


No 3  
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=99.84  E-value=4.5e-18  Score=190.28  Aligned_cols=216  Identities=16%  Similarity=0.275  Sum_probs=151.6

Q ss_pred             eecccccEeeEEeccCceee-e-eeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE
Q 003800           24 YEDQVGLMDWHQQYIGKVKH-A-VFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL  101 (794)
Q Consensus        24 ~edqvG~~dW~~~~vG~~~~-~-~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V  101 (794)
                      ..++.|++.|+.++ |.... . .-..|...+++||+++.+|.|+|+|++||+++|++.+...  +.+. +..+++.+++
T Consensus        35 ~~~~~~~~~W~~~~-~~~~~~~~~~~~p~v~~~~v~v~~~~g~v~a~d~~tG~~~W~~~~~~~--~~~~-p~v~~~~v~v  110 (377)
T TIGR03300        35 QPTVKVDQVWSASV-GDGVGHYYLRLQPAVAGGKVYAADADGTVVALDAETGKRLWRVDLDER--LSGG-VGADGGLVFV  110 (377)
T ss_pred             cccCcceeeeEEEc-CCCcCccccccceEEECCEEEEECCCCeEEEEEccCCcEeeeecCCCC--cccc-eEEcCCEEEE
Confidence            45678999999987 44321 1 1123555689999999999999999999999999999765  3322 3456677878


Q ss_pred             EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeee
Q 003800          102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQ  180 (794)
Q Consensus       102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~  180 (794)
                      ++.++.+++||+.+|+++|+..+.++..  ..+++      . ++.+++. .+|.|+++|.++|+++|+++...+.....
T Consensus       111 ~~~~g~l~ald~~tG~~~W~~~~~~~~~--~~p~v------~-~~~v~v~~~~g~l~a~d~~tG~~~W~~~~~~~~~~~~  181 (377)
T TIGR03300       111 GTEKGEVIALDAEDGKELWRAKLSSEVL--SPPLV------A-NGLVVVRTNDGRLTALDAATGERLWTYSRVTPALTLR  181 (377)
T ss_pred             EcCCCEEEEEECCCCcEeeeeccCceee--cCCEE------E-CCEEEEECCCCeEEEEEcCCCceeeEEccCCCceeec
Confidence            7766799999999999999998876543  22222      2 5667776 58999999999999999998765432110


Q ss_pred             e-EEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccC---------ccCceEEEcCcEEEEEECCCCeEEE
Q 003800          181 Q-VIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGG---------FVGDVALVSSDTLVTLDTTRSILVT  250 (794)
Q Consensus       181 ~-~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~---------~s~~~~~vg~~~lv~~d~~~g~L~v  250 (794)
                      . ..+...++.+|+....|    +++++|+++|+.+|+..+..+.+         ....+.+ .++.+++.+ ..|.+++
T Consensus       182 ~~~sp~~~~~~v~~~~~~g----~v~ald~~tG~~~W~~~~~~~~g~~~~~~~~~~~~~p~~-~~~~vy~~~-~~g~l~a  255 (377)
T TIGR03300       182 GSASPVIADGGVLVGFAGG----KLVALDLQTGQPLWEQRVALPKGRTELERLVDVDGDPVV-DGGQVYAVS-YQGRVAA  255 (377)
T ss_pred             CCCCCEEECCEEEEECCCC----EEEEEEccCCCEeeeeccccCCCCCchhhhhccCCccEE-ECCEEEEEE-cCCEEEE
Confidence            0 00113456777544334    89999999999999987654422         1223443 334444444 3588999


Q ss_pred             EEeeccee
Q 003800          251 VSFKNRKI  258 (794)
Q Consensus       251 ~~l~sg~~  258 (794)
                      +|+++|++
T Consensus       256 ~d~~tG~~  263 (377)
T TIGR03300       256 LDLRSGRV  263 (377)
T ss_pred             EECCCCcE
Confidence            99999874


No 4  
>PRK11138 outer membrane biogenesis protein BamB; Provisional
Probab=99.77  E-value=1.6e-16  Score=179.15  Aligned_cols=213  Identities=15%  Similarity=0.244  Sum_probs=146.7

Q ss_pred             cccccEeeEEeccCceee------ee-eeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEE
Q 003800           26 DQVGLMDWHQQYIGKVKH------AV-FHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYV   98 (794)
Q Consensus        26 dqvG~~dW~~~~vG~~~~------~~-f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~   98 (794)
                      .+.|+..|++++-+....      .. ...|...+++||+++.+|.|+|||++||+++|++.+...  +... +...++.
T Consensus        86 ~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~--~~ss-P~v~~~~  162 (394)
T PRK11138         86 ADTGKEIWSVDLSEKDGWFSKNKSALLSGGVTVAGGKVYIGSEKGQVYALNAEDGEVAWQTKVAGE--ALSR-PVVSDGL  162 (394)
T ss_pred             CCCCcEeeEEcCCCcccccccccccccccccEEECCEEEEEcCCCEEEEEECCCCCCcccccCCCc--eecC-CEEECCE
Confidence            458999999987542110      01 112445688999999999999999999999999988654  3332 2344566


Q ss_pred             EEEEccCCeEEEEeCCCCcEeEEEeccCcccc---CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccC
Q 003800           99 ITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS---KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAA  174 (794)
Q Consensus        99 V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s---~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~  174 (794)
                      ++++..++.++|+|++||+++|+.....+...   ...|.+      . ++.+++. .+|.++++|..+|+++|+.+...
T Consensus       163 v~v~~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~~sP~v------~-~~~v~~~~~~g~v~a~d~~~G~~~W~~~~~~  235 (394)
T PRK11138        163 VLVHTSNGMLQALNESDGAVKWTVNLDVPSLTLRGESAPAT------A-FGGAIVGGDNGRVSAVLMEQGQLIWQQRISQ  235 (394)
T ss_pred             EEEECCCCEEEEEEccCCCEeeeecCCCCcccccCCCCCEE------E-CCEEEEEcCCCEEEEEEccCChhhheecccc
Confidence            77766567999999999999999987643220   111222      2 4566666 58999999999999999987543


Q ss_pred             cce--ee-----eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCe
Q 003800          175 ESV--EV-----QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSI  247 (794)
Q Consensus       175 ~~~--~~-----~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~  247 (794)
                      +..  ..     ....+...++.+|+.+..|    .++|+|++||+.+|+.....+.    .+.+.++.+++ .+ .+|.
T Consensus       236 ~~~~~~~~~~~~~~~sP~v~~~~vy~~~~~g----~l~ald~~tG~~~W~~~~~~~~----~~~~~~~~vy~-~~-~~g~  305 (394)
T PRK11138        236 PTGATEIDRLVDVDTTPVVVGGVVYALAYNG----NLVALDLRSGQIVWKREYGSVN----DFAVDGGRIYL-VD-QNDR  305 (394)
T ss_pred             CCCccchhcccccCCCcEEECCEEEEEEcCC----eEEEEECCCCCEEEeecCCCcc----CcEEECCEEEE-Ec-CCCe
Confidence            310  00     0011124688999887666    8999999999999998654321    23333444444 43 3689


Q ss_pred             EEEEEeeccee
Q 003800          248 LVTVSFKNRKI  258 (794)
Q Consensus       248 L~v~~l~sg~~  258 (794)
                      ++++|..+|++
T Consensus       306 l~ald~~tG~~  316 (394)
T PRK11138        306 VYALDTRGGVE  316 (394)
T ss_pred             EEEEECCCCcE
Confidence            99999999873


No 5  
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=99.74  E-value=9.3e-16  Score=159.57  Aligned_cols=216  Identities=19%  Similarity=0.301  Sum_probs=145.0

Q ss_pred             eecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc
Q 003800           24 YEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS  103 (794)
Q Consensus        24 ~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~  103 (794)
                      +..+.|+..|+.++ +.........+...++++|+++.++.|+|+|++||+++|++.++..  +...+...++.+++.+.
T Consensus         8 ~d~~tG~~~W~~~~-~~~~~~~~~~~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~--~~~~~~~~~~~v~v~~~   84 (238)
T PF13360_consen    8 LDPRTGKELWSYDL-GPGIGGPVATAVPDGGRVYVASGDGNLYALDAKTGKVLWRFDLPGP--ISGAPVVDGGRVYVGTS   84 (238)
T ss_dssp             EETTTTEEEEEEEC-SSSCSSEEETEEEETTEEEEEETTSEEEEEETTTSEEEEEEECSSC--GGSGEEEETTEEEEEET
T ss_pred             EECCCCCEEEEEEC-CCCCCCccceEEEeCCEEEEEcCCCEEEEEECCCCCEEEEeecccc--ccceeeecccccccccc
Confidence            45569999999987 4322111211223488999999999999999999999999999655  22222345555555454


Q ss_pred             cCCeEEEEeCCCCcEeEEE-eccCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcce-e--
Q 003800          104 DGSTLRAWNLPDGQMVWES-FLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESV-E--  178 (794)
Q Consensus       104 ~g~~v~A~d~~tG~llWe~-~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~-~--  178 (794)
                       ++.++++|+.||+++|+. ....+..  .. ......... ++.+++.. ++.|+++|++||+++|+++...+.. .  
T Consensus        85 -~~~l~~~d~~tG~~~W~~~~~~~~~~--~~-~~~~~~~~~-~~~~~~~~~~g~l~~~d~~tG~~~w~~~~~~~~~~~~~  159 (238)
T PF13360_consen   85 -DGSLYALDAKTGKVLWSIYLTSSPPA--GV-RSSSSPAVD-GDRLYVGTSSGKLVALDPKTGKLLWKYPVGEPRGSSPI  159 (238)
T ss_dssp             -TSEEEEEETTTSCEEEEEEE-SSCTC--ST-B--SEEEEE-TTEEEEEETCSEEEEEETTTTEEEEEEESSTT-SS--E
T ss_pred             -eeeeEecccCCcceeeeecccccccc--cc-ccccCceEe-cCEEEEEeccCcEEEEecCCCcEEEEeecCCCCCCcce
Confidence             459999999999999995 4332221  10 000000222 56677765 9999999999999999998865331 1  


Q ss_pred             ------eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEE
Q 003800          179 ------VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVS  252 (794)
Q Consensus       179 ------~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~  252 (794)
                            ..++  ...++.+|+.+..|    .+.++|..+|+.+|+.....   .. ..+...++.+++.+ ..+.++++|
T Consensus       160 ~~~~~~~~~~--~~~~~~v~~~~~~g----~~~~~d~~tg~~~w~~~~~~---~~-~~~~~~~~~l~~~~-~~~~l~~~d  228 (238)
T PF13360_consen  160 SSFSDINGSP--VISDGRVYVSSGDG----RVVAVDLATGEKLWSKPISG---IY-SLPSVDGGTLYVTS-SDGRLYALD  228 (238)
T ss_dssp             EEETTEEEEE--ECCTTEEEEECCTS----SEEEEETTTTEEEEEECSS----EC-ECEECCCTEEEEEE-TTTEEEEEE
T ss_pred             eeecccccce--EEECCEEEEEcCCC----eEEEEECCCCCEEEEecCCC---cc-CCceeeCCEEEEEe-CCCEEEEEE
Confidence                  1122  24567899876666    48888999999999664222   11 22334556777777 579999999


Q ss_pred             eeccee
Q 003800          253 FKNRKI  258 (794)
Q Consensus       253 l~sg~~  258 (794)
                      +.+|++
T Consensus       229 ~~tG~~  234 (238)
T PF13360_consen  229 LKTGKV  234 (238)
T ss_dssp             TTTTEE
T ss_pred             CCCCCE
Confidence            999984


No 6  
>TIGR03300 assembly_YfgL outer membrane assembly lipoprotein YfgL. Members of this protein family are YfgL, a lipoprotein component of a complex that acts protein insertion into the bacterial outer membrane. Other members of this complex are NlpB, YfiO, and YaeT. This protein contains multiple copies of a repeat that, in other contexts, are associated with binding of the coenzyme PQQ.
Probab=99.71  E-value=6.8e-15  Score=164.66  Aligned_cols=209  Identities=19%  Similarity=0.283  Sum_probs=144.4

Q ss_pred             cccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccC
Q 003800           26 DQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDG  105 (794)
Q Consensus        26 dqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g  105 (794)
                      .+.|+..|++++-+...    ..|..+++++|+++.+|.|+|||++||+++|+..+...  +... +...++.+++...+
T Consensus        82 ~~tG~~~W~~~~~~~~~----~~p~v~~~~v~v~~~~g~l~ald~~tG~~~W~~~~~~~--~~~~-p~v~~~~v~v~~~~  154 (377)
T TIGR03300        82 AETGKRLWRVDLDERLS----GGVGADGGLVFVGTEKGEVIALDAEDGKELWRAKLSSE--VLSP-PLVANGLVVVRTND  154 (377)
T ss_pred             ccCCcEeeeecCCCCcc----cceEEcCCEEEEEcCCCEEEEEECCCCcEeeeeccCce--eecC-CEEECCEEEEECCC
Confidence            46899999998755432    23445688999999999999999999999999988654  3322 23445567766656


Q ss_pred             CeEEEEeCCCCcEeEEEeccCcccc---CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcce--ee
Q 003800          106 STLRAWNLPDGQMVWESFLRGSKHS---KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESV--EV  179 (794)
Q Consensus       106 ~~v~A~d~~tG~llWe~~l~~~~~s---~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~--~~  179 (794)
                      +.+++||+++|+++|+.....+...   ...+.      .. ++.+++. .+|.++++|..+|+.+|+.+...+..  ..
T Consensus       155 g~l~a~d~~tG~~~W~~~~~~~~~~~~~~~sp~------~~-~~~v~~~~~~g~v~ald~~tG~~~W~~~~~~~~g~~~~  227 (377)
T TIGR03300       155 GRLTALDAATGERLWTYSRVTPALTLRGSASPV------IA-DGGVLVGFAGGKLVALDLQTGQPLWEQRVALPKGRTEL  227 (377)
T ss_pred             CeEEEEEcCCCceeeEEccCCCceeecCCCCCE------EE-CCEEEEECCCCEEEEEEccCCCEeeeeccccCCCCCch
Confidence            7999999999999999987654320   01111      12 4556555 47999999999999999986543210  00


Q ss_pred             -----eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEee
Q 003800          180 -----QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFK  254 (794)
Q Consensus       180 -----~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~  254 (794)
                           ....+...++.+|+.+..|    .++|+|++||+.+|+......    ..+.+.++.+++ .+ .+|.++++|..
T Consensus       228 ~~~~~~~~~p~~~~~~vy~~~~~g----~l~a~d~~tG~~~W~~~~~~~----~~p~~~~~~vyv-~~-~~G~l~~~d~~  297 (377)
T TIGR03300       228 ERLVDVDGDPVVDGGQVYAVSYQG----RVAALDLRSGRVLWKRDASSY----QGPAVDDNRLYV-TD-ADGVVVALDRR  297 (377)
T ss_pred             hhhhccCCccEEECCEEEEEEcCC----EEEEEECCCCcEEEeeccCCc----cCceEeCCEEEE-EC-CCCeEEEEECC
Confidence                 0001123578999877666    799999999999999863221    123333434444 43 46899999998


Q ss_pred             ccee
Q 003800          255 NRKI  258 (794)
Q Consensus       255 sg~~  258 (794)
                      +|++
T Consensus       298 tG~~  301 (377)
T TIGR03300       298 SGSE  301 (377)
T ss_pred             CCcE
Confidence            8873


No 7  
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=99.65  E-value=1.1e-14  Score=168.43  Aligned_cols=220  Identities=15%  Similarity=0.158  Sum_probs=141.6

Q ss_pred             ccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc-----ceeeeeeeeCC-EEEE
Q 003800           27 QVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-----VVDGIDIALGK-YVIT  100 (794)
Q Consensus        27 qvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-----~i~~l~~~~g~-~~V~  100 (794)
                      +.+++.|+.+. |.. ......|...+++||+++.++.|+|||++||+++|++.+....     .+..-.+...+ +.|+
T Consensus        37 ~~~~~~W~~~~-~~~-~~~~~sPvv~~g~vy~~~~~g~l~AlD~~tG~~~W~~~~~~~~~~~~~~~~~~g~~~~~~~~V~  114 (488)
T cd00216          37 KKLKVAWTFST-GDE-RGQEGTPLVVDGDMYFTTSHSALFALDAATGKVLWRYDPKLPADRGCCDVVNRGVAYWDPRKVF  114 (488)
T ss_pred             hcceeeEEEEC-CCC-CCcccCCEEECCEEEEeCCCCcEEEEECCCChhhceeCCCCCccccccccccCCcEEccCCeEE
Confidence            45779999987 310 0112234445899999999999999999999999999886541     00000112334 7788


Q ss_pred             EEccCCeEEEEeCCCCcEeEEEeccCcc-----ccCCccccccccccccCCeEEEEE----------CCEEEEEECCCCc
Q 003800          101 LSSDGSTLRAWNLPDGQMVWESFLRGSK-----HSKPLLLVPTNLKVDKDSLILVSS----------KGCLHAVSSIDGE  165 (794)
Q Consensus       101 Vs~~g~~v~A~d~~tG~llWe~~l~~~~-----~s~~~~~~~~~~~~~~~~~V~V~~----------~g~l~ald~~tG~  165 (794)
                      ++..++.|+|+|++||+++|+.......     . ...+.+      . ++.+++.+          +|.|+|||+.||+
T Consensus       115 v~~~~g~v~AlD~~TG~~~W~~~~~~~~~~~~~i-~ssP~v------~-~~~v~vg~~~~~~~~~~~~g~v~alD~~TG~  186 (488)
T cd00216         115 FGTFDGRLVALDAETGKQVWKFGNNDQVPPGYTM-TGAPTI------V-KKLVIIGSSGAEFFACGVRGALRAYDVETGK  186 (488)
T ss_pred             EecCCCeEEEEECCCCCEeeeecCCCCcCcceEe-cCCCEE------E-CCEEEEeccccccccCCCCcEEEEEECCCCc
Confidence            8776789999999999999999987642     1 112222      2 46666643          4789999999999


Q ss_pred             EEEEEeccCcc-eee------------------eeEEEEecCCEEEEEEecCC--------------ceeEEEEEEcCCC
Q 003800          166 ILWTRDFAAES-VEV------------------QQVIQLDESDQIYVVGYAGS--------------SQFHAYQINAMNG  212 (794)
Q Consensus       166 ~~W~~~~~~~~-~~~------------------~~~v~s~~~~~vyv~~~~g~--------------~~~~v~ald~~tG  212 (794)
                      ++|+++...+. ...                  .........+.||+.+..+.              ..-.++|||++||
T Consensus       187 ~~W~~~~~~~~~~~~~~~~~~~~~~~~~g~~vw~~pa~d~~~g~V~vg~~~g~~~~~~~~~~~~~~~~~~~l~Ald~~tG  266 (488)
T cd00216         187 LLWRFYTTEPDPNAFPTWGPDRQMWGPGGGTSWASPTYDPKTNLVYVGTGNGSPWNWGGRRTPGDNLYTDSIVALDADTG  266 (488)
T ss_pred             eeeEeeccCCCcCCCCCCCCCcceecCCCCCccCCeeEeCCCCEEEEECCCCCCCccCCccCCCCCCceeeEEEEcCCCC
Confidence            99999774221 000                  01111124678887643320              1237999999999


Q ss_pred             ceeeeeeeeccc----CccCceEEE-----cCc---EEEEEECCCCeEEEEEeecce
Q 003800          213 ELLNHETAAFSG----GFVGDVALV-----SSD---TLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       213 ~~~w~~~v~~~~----~~s~~~~~v-----g~~---~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +++|+.+...+.    .....+.+.     .+.   ++++.. .+|.++++|..+|+
T Consensus       267 ~~~W~~~~~~~~~~~~~~~s~p~~~~~~~~~g~~~~~V~~g~-~~G~l~ald~~tG~  322 (488)
T cd00216         267 KVKWFYQTTPHDLWDYDGPNQPSLADIKPKDGKPVPAIVHAP-KNGFFYVLDRTTGK  322 (488)
T ss_pred             CEEEEeeCCCCCCcccccCCCCeEEeccccCCCeeEEEEEEC-CCceEEEEECCCCc
Confidence            999999764331    111122222     111   334443 56889999999998


No 8  
>cd00216 PQQ_DH Dehydrogenases with pyrrolo-quinoline quinone (PQQ) as cofactor, like ethanol, methanol, and membrane bound glucose dehydrogenases. The alignment model contains an 8-bladed beta-propeller.
Probab=99.65  E-value=1.9e-14  Score=166.58  Aligned_cols=230  Identities=10%  Similarity=0.158  Sum_probs=143.9

Q ss_pred             ecccccEeeEEeccCcee--eeeeeeeccCCCEEEEEeC---------CCEEEEEECcCCccceEEEcCcccc-------
Q 003800           25 EDQVGLMDWHQQYIGKVK--HAVFHTQKTGRKRVVVSTE---------ENVIASLDLRHGEIFWRHVLGINDV-------   86 (794)
Q Consensus        25 edqvG~~dW~~~~vG~~~--~~~f~~~~~~~~~Vyv~t~---------~g~l~ALn~~tG~ivWR~~l~~~~~-------   86 (794)
                      ..+.|+..|++++-+...  ...-..|...++.+|+++.         .|.|+|||++||+++|++.+.....       
T Consensus       126 D~~TG~~~W~~~~~~~~~~~~~i~ssP~v~~~~v~vg~~~~~~~~~~~~g~v~alD~~TG~~~W~~~~~~~~~~~~~~~~  205 (488)
T cd00216         126 DAETGKQVWKFGNNDQVPPGYTMTGAPTIVKKLVIIGSSGAEFFACGVRGALRAYDVETGKLLWRFYTTEPDPNAFPTWG  205 (488)
T ss_pred             ECCCCCEeeeecCCCCcCcceEecCCCEEECCEEEEeccccccccCCCCcEEEEEECCCCceeeEeeccCCCcCCCCCCC
Confidence            456899999998755421  1111224444788998874         5789999999999999998853210       


Q ss_pred             ------------eeeeeeee--CCEEEEEEccC------------------CeEEEEeCCCCcEeEEEeccCccc----c
Q 003800           87 ------------VDGIDIAL--GKYVITLSSDG------------------STLRAWNLPDGQMVWESFLRGSKH----S  130 (794)
Q Consensus        87 ------------i~~l~~~~--g~~~V~Vs~~g------------------~~v~A~d~~tG~llWe~~l~~~~~----s  130 (794)
                                  +-.. ++.  .+++|+++..+                  +.|+|+|++||+++|+.+......    .
T Consensus       206 ~~~~~~~~~g~~vw~~-pa~d~~~g~V~vg~~~g~~~~~~~~~~~~~~~~~~~l~Ald~~tG~~~W~~~~~~~~~~~~~~  284 (488)
T cd00216         206 PDRQMWGPGGGTSWAS-PTYDPKTNLVYVGTGNGSPWNWGGRRTPGDNLYTDSIVALDADTGKVKWFYQTTPHDLWDYDG  284 (488)
T ss_pred             CCcceecCCCCCccCC-eeEeCCCCEEEEECCCCCCCccCCccCCCCCCceeeEEEEcCCCCCEEEEeeCCCCCCccccc
Confidence                        0011 122  45778886533                  279999999999999998653211    0


Q ss_pred             CCccccccccc-cccC--CeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEec---------
Q 003800          131 KPLLLVPTNLK-VDKD--SLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYA---------  197 (794)
Q Consensus       131 ~~~~~~~~~~~-~~~~--~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~---------  197 (794)
                      ...+.+. ... .++.  ..|++. .+|.|+|||++||+++|+.+......       +..++.||+.+..         
T Consensus       285 ~s~p~~~-~~~~~~g~~~~~V~~g~~~G~l~ald~~tG~~~W~~~~~~~~~-------~~~~~~vyv~~~~~~~~~~~~~  356 (488)
T cd00216         285 PNQPSLA-DIKPKDGKPVPAIVHAPKNGFFYVLDRTTGKLISARPEVEQPM-------AYDPGLVYLGAFHIPLGLPPQK  356 (488)
T ss_pred             CCCCeEE-eccccCCCeeEEEEEECCCceEEEEECCCCcEeeEeEeecccc-------ccCCceEEEccccccccCcccc
Confidence            1111111 000 1111  124444 48999999999999999987642111       2345778874321         


Q ss_pred             -----CCceeEEEEEEcCCCceeeeeeeecc-------cCccCceEEEcCcEEEEEECCCCeEEEEEeecceeeeEEEee
Q 003800          198 -----GSSQFHAYQINAMNGELLNHETAAFS-------GGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKIAFQETHL  265 (794)
Q Consensus       198 -----g~~~~~v~ald~~tG~~~w~~~v~~~-------~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l  265 (794)
                           ......++|||+.||+.+|+......       .......+.+.++.+++.+ .+|.|+++|..+|++ +-+..+
T Consensus       357 ~~~~~~~~~G~l~AlD~~tG~~~W~~~~~~~~~~~~~g~~~~~~~~~~~g~~v~~g~-~dG~l~ald~~tG~~-lW~~~~  434 (488)
T cd00216         357 KKRCKKPGKGGLAALDPKTGKVVWEKREGTIRDSWNIGFPHWGGSLATAGNLVFAGA-ADGYFRAFDATTGKE-LWKFRT  434 (488)
T ss_pred             cCCCCCCCceEEEEEeCCCCcEeeEeeCCccccccccCCcccCcceEecCCeEEEEC-CCCeEEEEECCCCce-eeEEEC
Confidence                 01234899999999999999976511       1111223334556665665 468999999999984 333444


No 9  
>PF13360 PQQ_2:  PQQ-like domain; PDB: 3HXJ_B 1YIQ_A 1KV9_A 3Q54_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A ....
Probab=99.61  E-value=5.6e-14  Score=146.20  Aligned_cols=181  Identities=19%  Similarity=0.285  Sum_probs=119.4

Q ss_pred             CCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccc
Q 003800           61 EENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTN  139 (794)
Q Consensus        61 ~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~  139 (794)
                      ++|.|.|+|++||+++|+..++... ..... +...++.++++..++.|++||+.||+++|+..+..+.. . .+..   
T Consensus         1 ~~g~l~~~d~~tG~~~W~~~~~~~~~~~~~~-~~~~~~~v~~~~~~~~l~~~d~~tG~~~W~~~~~~~~~-~-~~~~---   74 (238)
T PF13360_consen    1 DDGTLSALDPRTGKELWSYDLGPGIGGPVAT-AVPDGGRVYVASGDGNLYALDAKTGKVLWRFDLPGPIS-G-APVV---   74 (238)
T ss_dssp             -TSEEEEEETTTTEEEEEEECSSSCSSEEET-EEEETTEEEEEETTSEEEEEETTTSEEEEEEECSSCGG-S-GEEE---
T ss_pred             CCCEEEEEECCCCCEEEEEECCCCCCCccce-EEEeCCEEEEEcCCCEEEEEECCCCCEEEEeecccccc-c-eeee---
Confidence            4789999999999999999995431 11111 23355566666556799999999999999999965433 1 1222   


Q ss_pred             cccccCCeEEEEE-CCEEEEEECCCCcEEEEE-eccCccee-eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceee
Q 003800          140 LKVDKDSLILVSS-KGCLHAVSSIDGEILWTR-DFAAESVE-VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLN  216 (794)
Q Consensus       140 ~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~-~~~~~~~~-~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w  216 (794)
                         . ++.+++.. ++.|+++|..||+++|+. ....+... .........++.+|+....|    .++++|++||+++|
T Consensus        75 ---~-~~~v~v~~~~~~l~~~d~~tG~~~W~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~g----~l~~~d~~tG~~~w  146 (238)
T PF13360_consen   75 ---D-GGRVYVGTSDGSLYALDAKTGKVLWSIYLTSSPPAGVRSSSSPAVDGDRLYVGTSSG----KLVALDPKTGKLLW  146 (238)
T ss_dssp             ---E-TTEEEEEETTSEEEEEETTTSCEEEEEEE-SSCTCSTB--SEEEEETTEEEEEETCS----EEEEEETTTTEEEE
T ss_pred             ---c-ccccccccceeeeEecccCCcceeeeeccccccccccccccCceEecCEEEEEeccC----cEEEEecCCCcEEE
Confidence               2 67788875 789999999999999994 54422211 11111123577787655455    89999999999999


Q ss_pred             eeeeecccCcc---------CceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          217 HETAAFSGGFV---------GDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       217 ~~~v~~~~~~s---------~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +..+..+....         +.+++.++ .++..+ ..+.+..+|+.+|+
T Consensus       147 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~v~~~~-~~g~~~~~d~~tg~  194 (238)
T PF13360_consen  147 KYPVGEPRGSSPISSFSDINGSPVISDG-RVYVSS-GDGRVVAVDLATGE  194 (238)
T ss_dssp             EEESSTT-SS--EEEETTEEEEEECCTT-EEEEEC-CTSSEEEEETTTTE
T ss_pred             EeecCCCCCCcceeeecccccceEEECC-EEEEEc-CCCeEEEEECCCCC
Confidence            99885543221         23333333 333333 34555666999887


No 10 
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=99.51  E-value=1e-12  Score=152.81  Aligned_cols=219  Identities=17%  Similarity=0.194  Sum_probs=139.4

Q ss_pred             ccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceee---e-----eeeeCCEEEE
Q 003800           29 GLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDG---I-----DIALGKYVIT  100 (794)
Q Consensus        29 G~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~---l-----~~~~g~~~V~  100 (794)
                      .++.|+.++ |... ....+|...+++||+++..|.|+|||++||+++|++.......+..   .     .++..++.|+
T Consensus        47 L~~~W~~~~-g~~~-g~~stPvv~~g~vyv~s~~g~v~AlDa~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~  124 (527)
T TIGR03075        47 LQPAWTFSL-GKLR-GQESQPLVVDGVMYVTTSYSRVYALDAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVF  124 (527)
T ss_pred             ceEEEEEEC-CCCC-CcccCCEEECCEEEEECCCCcEEEEECCCCceeeEecCCCCcccccccccccccccceEECCEEE
Confidence            347799887 4221 1122344558999999999999999999999999998754321110   0     0234456777


Q ss_pred             EEccCCeEEEEeCCCCcEeEEEeccCccc---cCCccccccccccccCCeEEEEE-------CCEEEEEECCCCcEEEEE
Q 003800          101 LSSDGSTLRAWNLPDGQMVWESFLRGSKH---SKPLLLVPTNLKVDKDSLILVSS-------KGCLHAVSSIDGEILWTR  170 (794)
Q Consensus       101 Vs~~g~~v~A~d~~tG~llWe~~l~~~~~---s~~~~~~~~~~~~~~~~~V~V~~-------~g~l~ald~~tG~~~W~~  170 (794)
                      +++.++.|+|+|+.||+++|+........   ..+.+++      . ++.|++..       +|.|+|+|++||+++|++
T Consensus       125 v~t~dg~l~ALDa~TGk~~W~~~~~~~~~~~~~tssP~v------~-~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~  197 (527)
T TIGR03075       125 FGTLDARLVALDAKTGKVVWSKKNGDYKAGYTITAAPLV------V-KGKVITGISGGEFGVRGYVTAYDAKTGKLVWRR  197 (527)
T ss_pred             EEcCCCEEEEEECCCCCEEeecccccccccccccCCcEE------E-CCEEEEeecccccCCCcEEEEEECCCCceeEec
Confidence            77666799999999999999998743211   0112222      2 56777753       589999999999999998


Q ss_pred             eccCcce------------ee------------------eeEEEEecCCEEEEEEec-----CC-------ceeEEEEEE
Q 003800          171 DFAAESV------------EV------------------QQVIQLDESDQIYVVGYA-----GS-------SQFHAYQIN  208 (794)
Q Consensus       171 ~~~~~~~------------~~------------------~~~v~s~~~~~vyv~~~~-----g~-------~~~~v~ald  208 (794)
                      ....+.-            .+                  ..+..-...+.||+....     +.       +.-.++|||
T Consensus       198 ~~~p~~~~~~~~~~~~~~~~~~~~tw~~~~~~~gg~~~W~~~s~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld  277 (527)
T TIGR03075       198 YTVPGDMGYLDKADKPVGGEPGAKTWPGDAWKTGGGATWGTGSYDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARD  277 (527)
T ss_pred             cCcCCCcccccccccccccccccCCCCCCccccCCCCccCceeEcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEc
Confidence            6632110            00                  001100124577765422     11       123799999


Q ss_pred             cCCCceeeeeeeecc--cCc--cCceEEE----cCc---EEEEEECCCCeEEEEEeecce
Q 003800          209 AMNGELLNHETAAFS--GGF--VGDVALV----SSD---TLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       209 ~~tG~~~w~~~v~~~--~~~--s~~~~~v----g~~---~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      ++||+.+|.++..-.  ++.  ...++++    ++.   .++..+ .+|.++++|-.+|+
T Consensus       278 ~~TG~~~W~~Q~~~~D~wD~d~~~~p~l~d~~~~G~~~~~v~~~~-K~G~~~vlDr~tG~  336 (527)
T TIGR03075       278 PDTGKIKWHYQTTPHDEWDYDGVNEMILFDLKKDGKPRKLLAHAD-RNGFFYVLDRTNGK  336 (527)
T ss_pred             cccCCEEEeeeCCCCCCccccCCCCcEEEEeccCCcEEEEEEEeC-CCceEEEEECCCCc
Confidence            999999999985221  222  2233433    222   444554 57999999999987


No 11 
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=99.51  E-value=1.4e-12  Score=145.89  Aligned_cols=216  Identities=21%  Similarity=0.249  Sum_probs=148.3

Q ss_pred             cccccEeeEEeccCceeeeeeeee--ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCc-ccceeeeeeeeCCEEEEEE
Q 003800           26 DQVGLMDWHQQYIGKVKHAVFHTQ--KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGI-NDVVDGIDIALGKYVITLS  102 (794)
Q Consensus        26 dqvG~~dW~~~~vG~~~~~~f~~~--~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~-~~~i~~l~~~~g~~~V~Vs  102 (794)
                      ...|...|.... +......+..|  ...+++||+.+.+|.|.|+|+.+|+++|+..+.. ...+.+. +...++.++++
T Consensus        40 ~~~g~~~W~~~~-~~~~~~~~~~~~~~~~dg~v~~~~~~G~i~A~d~~~g~~~W~~~~~~~~~~~~~~-~~~~~G~i~~g  117 (370)
T COG1520          40 NTSGTLLWSVSL-GSGGGGIYAGPAPADGDGTVYVGTRDGNIFALNPDTGLVKWSYPLLGAVAQLSGP-ILGSDGKIYVG  117 (370)
T ss_pred             ccCcceeeeeec-ccCccceEeccccEeeCCeEEEecCCCcEEEEeCCCCcEEecccCcCcceeccCc-eEEeCCeEEEe
Confidence            445888897653 22222233334  5669999999999999999999999999998875 2112222 23446678888


Q ss_pred             ccCCeEEEEeCCCCcEeEEEeccC-ccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCc-cee-
Q 003800          103 SDGSTLRAWNLPDGQMVWESFLRG-SKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAE-SVE-  178 (794)
Q Consensus       103 ~~g~~v~A~d~~tG~llWe~~l~~-~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~-~~~-  178 (794)
                      ...+.++++|+.||+++|+..... ... ...+++       .++.|++. .+|.++++|+.||.++|+++.+.+ ... 
T Consensus       118 ~~~g~~y~ld~~~G~~~W~~~~~~~~~~-~~~~v~-------~~~~v~~~s~~g~~~al~~~tG~~~W~~~~~~~~~~~~  189 (370)
T COG1520         118 SWDGKLYALDASTGTLVWSRNVGGSPYY-ASPPVV-------GDGTVYVGTDDGHLYALNADTGTLKWTYETPAPLSLSI  189 (370)
T ss_pred             cccceEEEEECCCCcEEEEEecCCCeEE-ecCcEE-------cCcEEEEecCCCeEEEEEccCCcEEEEEecCCcccccc
Confidence            776799999999999999999987 222 122222       26777777 489999999999999999988763 111 


Q ss_pred             eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccC---------ccCceEEEcCcEEEEEECCCCeEE
Q 003800          179 VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGG---------FVGDVALVSSDTLVTLDTTRSILV  249 (794)
Q Consensus       179 ~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~---------~s~~~~~vg~~~lv~~d~~~g~L~  249 (794)
                      ....  ...++.+|+.... . ...++++|+.+|+..|+.+...+.+         +....+++++++  |.-..++.+.
T Consensus       190 ~~~~--~~~~~~vy~~~~~-~-~~~~~a~~~~~G~~~w~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~--~~~~~~g~~~  263 (370)
T COG1520         190 YGSP--AIASGTVYVGSDG-Y-DGILYALNAEDGTLKWSQKVSQTIGRTAISTTPAVDGGPVYVDGGV--YAGSYGGKLL  263 (370)
T ss_pred             ccCc--eeecceEEEecCC-C-cceEEEEEccCCcEeeeeeeecccCcccccccccccCceEEECCcE--EEEecCCeEE
Confidence            1111  2468888875542 1 2289999999999999975544322         222344455554  2333456788


Q ss_pred             EEEeecce
Q 003800          250 TVSFKNRK  257 (794)
Q Consensus       250 v~~l~sg~  257 (794)
                      .++..+|+
T Consensus       264 ~l~~~~G~  271 (370)
T COG1520         264 CLDADTGE  271 (370)
T ss_pred             EEEcCCCc
Confidence            88888887


No 12 
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=99.51  E-value=1.7e-12  Score=155.46  Aligned_cols=202  Identities=14%  Similarity=0.160  Sum_probs=130.1

Q ss_pred             eccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccce-------eeee----------------eeeCCEEEEEEccC
Q 003800           49 QKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVV-------DGID----------------IALGKYVITLSSDG  105 (794)
Q Consensus        49 ~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i-------~~l~----------------~~~g~~~V~Vs~~g  105 (794)
                      |...+++||+.|..|.|+|||++||+++||+........       .++.                +...++.|++++.+
T Consensus       190 Plvvgg~lYv~t~~~~V~ALDa~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~D  269 (764)
T TIGR03074       190 PLKVGDTLYLCTPHNKVIALDAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSD  269 (764)
T ss_pred             CEEECCEEEEECCCCeEEEEECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCC
Confidence            445589999999999999999999999999988654210       0110                11234577777767


Q ss_pred             CeEEEEeCCCCcEeEEEeccCccc--------------cCCccccccccccccCCeEEEEE-----------CCEEEEEE
Q 003800          106 STLRAWNLPDGQMVWESFLRGSKH--------------SKPLLLVPTNLKVDKDSLILVSS-----------KGCLHAVS  160 (794)
Q Consensus       106 ~~v~A~d~~tG~llWe~~l~~~~~--------------s~~~~~~~~~~~~~~~~~V~V~~-----------~g~l~ald  160 (794)
                      ++|+|+|+.||+++|++...+...              ..+.+++      . ++.|++..           +|.|+|+|
T Consensus       270 g~LiALDA~TGk~~W~fg~~G~vdl~~~~g~~~~g~~~~ts~P~V------~-~g~VIvG~~v~d~~~~~~~~G~I~A~D  342 (764)
T TIGR03074       270 ARLIALDADTGKLCEDFGNNGTVDLTAGMGTTPPGYYYPTSPPLV------A-GTTVVIGGRVADNYSTDEPSGVIRAFD  342 (764)
T ss_pred             CeEEEEECCCCCEEEEecCCCceeeecccCcCCCcccccccCCEE------E-CCEEEEEecccccccccCCCcEEEEEE
Confidence            899999999999999876543210              0111222      2 56777752           58899999


Q ss_pred             CCCCcEEEEEeccCccee--------e--------eeEEEEecCCEEEEEEec------C--------CceeEEEEEEcC
Q 003800          161 SIDGEILWTRDFAAESVE--------V--------QQVIQLDESDQIYVVGYA------G--------SSQFHAYQINAM  210 (794)
Q Consensus       161 ~~tG~~~W~~~~~~~~~~--------~--------~~~v~s~~~~~vyv~~~~------g--------~~~~~v~ald~~  210 (794)
                      +.||+++|++....+...        .        .....-...+.+|+-...      |        .+.-.++|||++
T Consensus       343 a~TGkl~W~~~~g~p~~~~~~~~g~~~~~gg~n~W~~~s~D~~~glvy~ptGn~~pd~~g~~r~~~~n~y~~slvALD~~  422 (764)
T TIGR03074       343 VNTGALVWAWDPGNPDPTAPPAPGETYTRNTPNSWSVASYDEKLGLVYLPMGNQTPDQWGGDRTPADEKYSSSLVALDAT  422 (764)
T ss_pred             CCCCcEeeEEecCCCCcccCCCCCCEeccCCCCccCceEEcCCCCeEEEeCCCccccccCCccccCcccccceEEEEeCC
Confidence            999999999986422110        0        001101223556652210      1        123579999999


Q ss_pred             CCceeeeeeeecc----cCccCceEEEc----Cc----EEEEEECCCCeEEEEEeeccee
Q 003800          211 NGELLNHETAAFS----GGFVGDVALVS----SD----TLVTLDTTRSILVTVSFKNRKI  258 (794)
Q Consensus       211 tG~~~w~~~v~~~----~~~s~~~~~vg----~~----~lv~~d~~~g~L~v~~l~sg~~  258 (794)
                      ||+.+|+++..-.    .++...++++.    ++    .++..+ .+|.++++|-++|+.
T Consensus       423 TGk~~W~~Q~~~hD~WD~D~~~~p~L~d~~~~~G~~~~~v~~~~-K~G~~~vlDr~tG~~  481 (764)
T TIGR03074       423 TGKERWVFQTVHHDLWDMDVPAQPSLVDLPDADGTTVPALVAPT-KQGQIYVLDRRTGEP  481 (764)
T ss_pred             CCceEEEecccCCccccccccCCceEEeeecCCCcEeeEEEEEC-CCCEEEEEECCCCCE
Confidence            9999999975221    12222344331    22    555555 579999999999883


No 13 
>COG1520 FOG: WD40-like repeat [Function unknown]
Probab=99.30  E-value=2.4e-10  Score=127.86  Aligned_cols=186  Identities=19%  Similarity=0.334  Sum_probs=124.1

Q ss_pred             eecccccEeeEEeccCceeeeeeeee-ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE
Q 003800           24 YEDQVGLMDWHQQYIGKVKHAVFHTQ-KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS  102 (794)
Q Consensus        24 ~edqvG~~dW~~~~vG~~~~~~f~~~-~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs  102 (794)
                      +..+.|+..|+..+.+..  ..+..| ...+++||+++.+|.++|||++||+++|++..+....... ++..+++.|++.
T Consensus        83 ~d~~~g~~~W~~~~~~~~--~~~~~~~~~~~G~i~~g~~~g~~y~ld~~~G~~~W~~~~~~~~~~~~-~~v~~~~~v~~~  159 (370)
T COG1520          83 LNPDTGLVKWSYPLLGAV--AQLSGPILGSDGKIYVGSWDGKLYALDASTGTLVWSRNVGGSPYYAS-PPVVGDGTVYVG  159 (370)
T ss_pred             EeCCCCcEEecccCcCcc--eeccCceEEeCCeEEEecccceEEEEECCCCcEEEEEecCCCeEEec-CcEEcCcEEEEe
Confidence            446678888999987611  112222 1237889999999999999999999999999987100112 235677888877


Q ss_pred             ccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE---CCEEEEEECCCCcEEEEEeccCccee-
Q 003800          103 SDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS---KGCLHAVSSIDGEILWTRDFAAESVE-  178 (794)
Q Consensus       103 ~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~---~g~l~ald~~tG~~~W~~~~~~~~~~-  178 (794)
                      +..++++++|+.||+++|+.....+ ...  .....  ....++.+++..   ++.++|+|+.+|..+|+.+...+... 
T Consensus       160 s~~g~~~al~~~tG~~~W~~~~~~~-~~~--~~~~~--~~~~~~~vy~~~~~~~~~~~a~~~~~G~~~w~~~~~~~~~~~  234 (370)
T COG1520         160 TDDGHLYALNADTGTLKWTYETPAP-LSL--SIYGS--PAIASGTVYVGSDGYDGILYALNAEDGTLKWSQKVSQTIGRT  234 (370)
T ss_pred             cCCCeEEEEEccCCcEEEEEecCCc-ccc--ccccC--ceeecceEEEecCCCcceEEEEEccCCcEeeeeeeecccCcc
Confidence            5557999999999999999888653 201  11110  112256667763   45899999999999999643221110 


Q ss_pred             -e--eeEE---EEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeee
Q 003800          179 -V--QQVI---QLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAA  221 (794)
Q Consensus       179 -~--~~~v---~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~  221 (794)
                       .  .+.+   ....++.+|..+..|    +++|+|+.+|+.+|+....
T Consensus       235 ~~~~~~~~~~~~v~v~~~~~~~~~~g----~~~~l~~~~G~~~W~~~~~  279 (370)
T COG1520         235 AISTTPAVDGGPVYVDGGVYAGSYGG----KLLCLDADTGELIWSFPAG  279 (370)
T ss_pred             cccccccccCceEEECCcEEEEecCC----eEEEEEcCCCceEEEEecc
Confidence             0  0110   012344555544444    7999999999999999754


No 14 
>TIGR03074 PQQ_membr_DH membrane-bound PQQ-dependent dehydrogenase, glucose/quinate/shikimate family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Members of this family have several predicted transmembrane helices in the N-terminal region, and include the quinoprotein glucose dehydrogenase (EC 1.1.5.2) of Escherichia coli and the quinate/shikimate dehydrogenase of Acinetobacter sp. ADP1 (EC 1.1.99.25). Sequences closely related except for the absense of the N-terminal hydrophobic region, scoring in the gray zone between the trusted and noise cutoffs, include PQQ-dependent glycerol (EC 1.1.99.22) and and other polyol (sugar alcohol) dehydrogenases.
Probab=99.30  E-value=9.8e-11  Score=140.38  Aligned_cols=191  Identities=14%  Similarity=0.213  Sum_probs=117.1

Q ss_pred             ecccccEeeEEeccCceee----------eeee------------eeccCCCEEEEEeCCCEEEEEECcCCccceEEEcC
Q 003800           25 EDQVGLMDWHQQYIGKVKH----------AVFH------------TQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLG   82 (794)
Q Consensus        25 edqvG~~dW~~~~vG~~~~----------~~f~------------~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~   82 (794)
                      ..+.|+..|++..-.....          +.+.            .|...+++||+.|.++.|+|||++||+++|++..+
T Consensus       210 Da~TGk~lW~~d~~~~~~~~~~~~~cRGvay~~~p~~~~~~~~~~~p~~~~~rV~~~T~Dg~LiALDA~TGk~~W~fg~~  289 (764)
T TIGR03074       210 DAATGKEKWKFDPKLKTEAGRQHQTCRGVSYYDAPAAAAGPAAPAAPADCARRIILPTSDARLIALDADTGKLCEDFGNN  289 (764)
T ss_pred             ECCCCcEEEEEcCCCCcccccccccccceEEecCCcccccccccccccccCCEEEEecCCCeEEEEECCCCCEEEEecCC
Confidence            3568999999986332211          0111            12345779999999999999999999999987543


Q ss_pred             ccc--------------ceeeeeeeeCCEEEEEEcc----------CCeEEEEeCCCCcEeEEEeccCccccC-----Cc
Q 003800           83 IND--------------VVDGIDIALGKYVITLSSD----------GSTLRAWNLPDGQMVWESFLRGSKHSK-----PL  133 (794)
Q Consensus        83 ~~~--------------~i~~l~~~~g~~~V~Vs~~----------g~~v~A~d~~tG~llWe~~l~~~~~s~-----~~  133 (794)
                      ...              .+.+. +.+.+++|++++.          .+.|+|+|++||+++|++....+....     ..
T Consensus       290 G~vdl~~~~g~~~~g~~~~ts~-P~V~~g~VIvG~~v~d~~~~~~~~G~I~A~Da~TGkl~W~~~~g~p~~~~~~~~g~~  368 (764)
T TIGR03074       290 GTVDLTAGMGTTPPGYYYPTSP-PLVAGTTVVIGGRVADNYSTDEPSGVIRAFDVNTGALVWAWDPGNPDPTAPPAPGET  368 (764)
T ss_pred             CceeeecccCcCCCcccccccC-CEEECCEEEEEecccccccccCCCcEEEEEECCCCcEeeEEecCCCCcccCCCCCCE
Confidence            210              01122 3455667777632          468999999999999999864322100     00


Q ss_pred             ccccc-----cccccc-CCeEEE-------------------EECCEEEEEECCCCcEEEEEeccCcce----eeee--E
Q 003800          134 LLVPT-----NLKVDK-DSLILV-------------------SSKGCLHAVSSIDGEILWTRDFAAESV----EVQQ--V  182 (794)
Q Consensus       134 ~~~~~-----~~~~~~-~~~V~V-------------------~~~g~l~ald~~tG~~~W~~~~~~~~~----~~~~--~  182 (794)
                      ...+.     ..+.+. .+.+|+                   ...+.|.|||++||+++|.++.....+    .+.+  +
T Consensus       369 ~~~gg~n~W~~~s~D~~~glvy~ptGn~~pd~~g~~r~~~~n~y~~slvALD~~TGk~~W~~Q~~~hD~WD~D~~~~p~L  448 (764)
T TIGR03074       369 YTRNTPNSWSVASYDEKLGLVYLPMGNQTPDQWGGDRTPADEKYSSSLVALDATTGKERWVFQTVHHDLWDMDVPAQPSL  448 (764)
T ss_pred             eccCCCCccCceEEcCCCCeEEEeCCCccccccCCccccCcccccceEEEEeCCCCceEEEecccCCccccccccCCceE
Confidence            00000     001111 133433                   125789999999999999997732211    1112  2


Q ss_pred             EEEec-CC----EEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800          183 IQLDE-SD----QIYVVGYAGSSQFHAYQINAMNGELLNHETA  220 (794)
Q Consensus       183 v~s~~-~~----~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v  220 (794)
                      ++... ++    .||..+-+|    .+++||.+||+++|..+.
T Consensus       449 ~d~~~~~G~~~~~v~~~~K~G----~~~vlDr~tG~~l~~~~e  487 (764)
T TIGR03074       449 VDLPDADGTTVPALVAPTKQG----QIYVLDRRTGEPIVPVEE  487 (764)
T ss_pred             EeeecCCCcEeeEEEEECCCC----EEEEEECCCCCEEeecee
Confidence            22112 44    456555455    899999999999998753


No 15 
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=99.26  E-value=1.8e-10  Score=134.23  Aligned_cols=187  Identities=16%  Similarity=0.229  Sum_probs=115.9

Q ss_pred             ecccccEeeEEeccCcee-ee-----ee-eeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc---ceeeeeeee
Q 003800           25 EDQVGLMDWHQQYIGKVK-HA-----VF-HTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND---VVDGIDIAL   94 (794)
Q Consensus        25 edqvG~~dW~~~~vG~~~-~~-----~f-~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~---~i~~l~~~~   94 (794)
                      ..+.|+..|++..-.... ..     .. ..++..+++||+++.++.|+|||++||+++|++.+....   .+.+. +..
T Consensus        85 Da~TGk~lW~~~~~~~~~~~~~~~~~~~~rg~av~~~~v~v~t~dg~l~ALDa~TGk~~W~~~~~~~~~~~~~tss-P~v  163 (527)
T TIGR03075        85 DAKTGKELWKYDPKLPDDVIPVMCCDVVNRGVALYDGKVFFGTLDARLVALDAKTGKVVWSKKNGDYKAGYTITAA-PLV  163 (527)
T ss_pred             ECCCCceeeEecCCCCcccccccccccccccceEECCEEEEEcCCCEEEEEECCCCCEEeecccccccccccccCC-cEE
Confidence            457899999998622111 01     00 113445789999999999999999999999999875321   12223 234


Q ss_pred             CCEEEEEEcc------CCeEEEEeCCCCcEeEEEeccCcccc---------------------------CCccccccccc
Q 003800           95 GKYVITLSSD------GSTLRAWNLPDGQMVWESFLRGSKHS---------------------------KPLLLVPTNLK  141 (794)
Q Consensus        95 g~~~V~Vs~~------g~~v~A~d~~tG~llWe~~l~~~~~s---------------------------~~~~~~~~~~~  141 (794)
                      .++.|+++..      .+.|+|+|++||+++|++....+...                           ...+...   .
T Consensus       164 ~~g~Vivg~~~~~~~~~G~v~AlD~~TG~~lW~~~~~p~~~~~~~~~~~~~~~~~~~~tw~~~~~~~gg~~~W~~~---s  240 (527)
T TIGR03075       164 VKGKVITGISGGEFGVRGYVTAYDAKTGKLVWRRYTVPGDMGYLDKADKPVGGEPGAKTWPGDAWKTGGGATWGTG---S  240 (527)
T ss_pred             ECCEEEEeecccccCCCcEEEEEECCCCceeEeccCcCCCcccccccccccccccccCCCCCCccccCCCCccCce---e
Confidence            4556666532      36899999999999999887533200                           0011111   2


Q ss_pred             ccc-CCeEEEEE------CC-----------EEEEEECCCCcEEEEEeccCcce------eeeeEEEEecCCE---EEEE
Q 003800          142 VDK-DSLILVSS------KG-----------CLHAVSSIDGEILWTRDFAAESV------EVQQVIQLDESDQ---IYVV  194 (794)
Q Consensus       142 ~~~-~~~V~V~~------~g-----------~l~ald~~tG~~~W~~~~~~~~~------~~~~~v~s~~~~~---vyv~  194 (794)
                      .|. .+.||+..      ++           .|.|||++||+.+|.++......      ....+++...+++   +++.
T Consensus       241 ~D~~~~lvy~~tGnp~p~~~~~r~gdnl~~~s~vAld~~TG~~~W~~Q~~~~D~wD~d~~~~p~l~d~~~~G~~~~~v~~  320 (527)
T TIGR03075       241 YDPETNLIYFGTGNPSPWNSHLRPGDNLYTSSIVARDPDTGKIKWHYQTTPHDEWDYDGVNEMILFDLKKDGKPRKLLAH  320 (527)
T ss_pred             EcCCCCeEEEeCCCCCCCCCCCCCCCCccceeEEEEccccCCEEEeeeCCCCCCccccCCCCcEEEEeccCCcEEEEEEE
Confidence            222 34566643      12           79999999999999998743321      1112232212333   4432


Q ss_pred             EecCCceeEEEEEEcCCCceeeee
Q 003800          195 GYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       195 ~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      +.   .+..+++||..||+++|..
T Consensus       321 ~~---K~G~~~vlDr~tG~~i~~~  341 (527)
T TIGR03075       321 AD---RNGFFYVLDRTNGKLLSAE  341 (527)
T ss_pred             eC---CCceEEEEECCCCceeccc
Confidence            22   2238999999999998754


No 16 
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=99.02  E-value=2e-08  Score=102.37  Aligned_cols=183  Identities=17%  Similarity=0.316  Sum_probs=131.3

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      .-.||.++..+.+.|+|+.+|+..|++.++..  +.+.+...|+. |+++-..+.++-++-+||.+.|....-+..-.+ 
T Consensus        23 kT~v~igSHs~~~~avd~~sG~~~We~ilg~R--iE~sa~vvgdf-VV~GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~-   98 (354)
T KOG4649|consen   23 KTLVVIGSHSGIVIAVDPQSGNLIWEAILGVR--IECSAIVVGDF-VVLGCYSGGLYFLCVKTGSQIWNFVILETVKVR-   98 (354)
T ss_pred             ceEEEEecCCceEEEecCCCCcEEeehhhCce--eeeeeEEECCE-EEEEEccCcEEEEEecchhheeeeeehhhhccc-
Confidence            44699999999999999999999999999876  55544455654 666776778999999999999999887654311 


Q ss_pred             ccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800          133 LLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMN  211 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~t  211 (794)
                       +.+    +.+ .+.++..+ |+++||||..+-.-+|+.+-+.... ...++ ...++.+|+....|    .|.+.+.++
T Consensus        99 -a~~----d~~-~glIycgshd~~~yalD~~~~~cVykskcgG~~f-~sP~i-~~g~~sly~a~t~G----~vlavt~~~  166 (354)
T KOG4649|consen   99 -AQC----DFD-GGLIYCGSHDGNFYALDPKTYGCVYKSKCGGGTF-VSPVI-APGDGSLYAAITAG----AVLAVTKNP  166 (354)
T ss_pred             -eEE----cCC-CceEEEecCCCcEEEecccccceEEecccCCcee-cccee-cCCCceEEEEeccc----eEEEEccCC
Confidence             122    112 45667764 9999999999999999987776543 22232 34578899887777    899999999


Q ss_pred             C--ceeeeeeeecccCccCceEEEcCc-EEEEEECCCCeEEEEEeecce
Q 003800          212 G--ELLNHETAAFSGGFVGDVALVSSD-TLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       212 G--~~~w~~~v~~~~~~s~~~~~vg~~-~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +  ..+|......|  +-+++..++.. ++-|+|   |.|...+ ++|+
T Consensus       167 ~~~~~~w~~~~~~P--iF~splcv~~sv~i~~Vd---G~l~~f~-~sG~  209 (354)
T KOG4649|consen  167 YSSTEFWAATRFGP--IFASPLCVGSSVIITTVD---GVLTSFD-ESGR  209 (354)
T ss_pred             CCcceehhhhcCCc--cccCceeccceEEEEEec---cEEEEEc-CCCc
Confidence            9  88898865554  22333444433 233443   5666666 6665


No 17 
>KOG4649 consensus PQQ (pyrrolo-quinoline quinone) repeat protein [Secondary metabolites biosynthesis, transport and catabolism]
Probab=98.90  E-value=4.8e-07  Score=92.50  Aligned_cols=178  Identities=13%  Similarity=0.137  Sum_probs=131.7

Q ss_pred             ecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc
Q 003800           25 EDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD  104 (794)
Q Consensus        25 edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~  104 (794)
                      ..|.|+..|++-+-++....+..    -++-|+++-.+|.|+-|+-+||+..|..+..+....... ..-..++++.++.
T Consensus        39 d~~sG~~~We~ilg~RiE~sa~v----vgdfVV~GCy~g~lYfl~~~tGs~~w~f~~~~~vk~~a~-~d~~~glIycgsh  113 (354)
T KOG4649|consen   39 DPQSGNLIWEAILGVRIECSAIV----VGDFVVLGCYSGGLYFLCVKTGSQIWNFVILETVKVRAQ-CDFDGGLIYCGSH  113 (354)
T ss_pred             cCCCCcEEeehhhCceeeeeeEE----ECCEEEEEEccCcEEEEEecchhheeeeeehhhhccceE-EcCCCceEEEecC
Confidence            47899999999886666533222    267799999999999999999999999988665211111 2346678998988


Q ss_pred             CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCC--cEEEEEeccCcceeeee
Q 003800          105 GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDG--EILWTRDFAAESVEVQQ  181 (794)
Q Consensus       105 g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG--~~~W~~~~~~~~~~~~~  181 (794)
                      +++.+|+|..+=.-+|+.+-.+...+ + |.+.     .+++.+|+. ..|.|.|++.+++  ...|.+....|-..-.+
T Consensus       114 d~~~yalD~~~~~cVykskcgG~~f~-s-P~i~-----~g~~sly~a~t~G~vlavt~~~~~~~~~w~~~~~~PiF~spl  186 (354)
T KOG4649|consen  114 DGNFYALDPKTYGCVYKSKCGGGTFV-S-PVIA-----PGDGSLYAAITAGAVLAVTKNPYSSTEFWAATRFGPIFASPL  186 (354)
T ss_pred             CCcEEEecccccceEEecccCCceec-c-ceec-----CCCceEEEEeccceEEEEccCCCCcceehhhhcCCccccCce
Confidence            88999999999999999888776552 2 2221     125678887 5999999999999  89999988777542223


Q ss_pred             EEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecc
Q 003800          182 VIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS  223 (794)
Q Consensus       182 ~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~  223 (794)
                      ++    ...+.....+|    .+.++| .+|+.+|+.+...|
T Consensus       187 cv----~~sv~i~~VdG----~l~~f~-~sG~qvwr~~t~Gp  219 (354)
T KOG4649|consen  187 CV----GSSVIITTVDG----VLTSFD-ESGRQVWRPATKGP  219 (354)
T ss_pred             ec----cceEEEEEecc----EEEEEc-CCCcEEEeecCCCc
Confidence            32    23344445566    899999 79999998865443


No 18 
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=98.69  E-value=5.1e-07  Score=101.80  Aligned_cols=205  Identities=18%  Similarity=0.214  Sum_probs=124.7

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCccccee-------eee--eeeCC------EEEEEEccCCeEEEEeCCCC
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVD-------GID--IALGK------YVITLSSDGSTLRAWNLPDG  116 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~-------~l~--~~~g~------~~V~Vs~~g~~v~A~d~~tG  116 (794)
                      .++.+|+.|.-|.+.|||++||++.||.+-..+.++.       ++.  .....      ..|++...+.+|.|+|++||
T Consensus       213 vgdtlYvcTphn~v~ALDa~TGkekWkydp~~~~nv~~~~~tCrgVsy~~a~a~~k~pc~~rIflpt~DarlIALdA~tG  292 (773)
T COG4993         213 VGDTLYVCTPHNRVFALDAATGKEKWKYDPNLKSNVDPQHQTCRGVSYGAAKADAKSPCPRRIFLPTADARLIALDADTG  292 (773)
T ss_pred             ECCEEEEecCcceeEEeeccCCceeeecCCCCCCCcccccccccceecccccccccCCCceeEEeecCCceEEEEeCCCC
Confidence            3778999999999999999999999999876553222       110  01112      34777766789999999999


Q ss_pred             cEeEEEeccCccc-------cCCccccccccccccCCeEEEE-E----------CCEEEEEECCCCcEEEEEeccCccee
Q 003800          117 QMVWESFLRGSKH-------SKPLLLVPTNLKVDKDSLILVS-S----------KGCLHAVSSIDGEILWTRDFAAESVE  178 (794)
Q Consensus       117 ~llWe~~l~~~~~-------s~~~~~~~~~~~~~~~~~V~V~-~----------~g~l~ald~~tG~~~W~~~~~~~~~~  178 (794)
                      +..|.+.-.+...       ..+-...+.+...-....+++. +          .|.+.++|..+|+..|.++...+...
T Consensus       293 kvc~~Fa~~Ga~~l~tgm~~~k~g~y~~tS~p~~~~~~~v~~g~v~Dn~st~e~sgVir~fdv~tG~l~w~~D~gnpD~t  372 (773)
T COG4993         293 KVCWSFANKGALNLETGMKDTKDGLYYGTSPPEFGVKGIVIAGSVADNESTWEPSGVIRGFDVLTGKLTWAGDPGNPDPT  372 (773)
T ss_pred             cEeheeccCceeeeeccCCCCCCCeEeecCCCcccceeEEEeeccCCCceeeccCccccccccccCceEEccCCCCCCCC
Confidence            9999976443210       0111111111011112333332 1          57888999999999999987655421


Q ss_pred             ----eeeE----------EEE--ecCCEEEEEEec------C--------CceeEEEEEEcCCCceeeeeeeecc--cCc
Q 003800          179 ----VQQV----------IQL--DESDQIYVVGYA------G--------SSQFHAYQINAMNGELLNHETAAFS--GGF  226 (794)
Q Consensus       179 ----~~~~----------v~s--~~~~~vyv~~~~------g--------~~~~~v~ald~~tG~~~w~~~v~~~--~~~  226 (794)
                          +.+-          ..+  ..-+.||+-.-.      |        .++-.++|+|+.||+..|-++..-.  ++.
T Consensus       373 ~p~~~g~tyt~nspn~W~~~SyD~~lnlVy~p~Gn~~pd~wg~trtp~dekysssivAlD~~TG~~kW~yQtvhhDlWDm  452 (773)
T COG4993         373 APTAPGQTYTRNSPNSWASASYDAKLNLVYVPMGNQTPDTWGGTRTPGDEKYSSSIVALDATTGKLKWVYQTVHHDLWDM  452 (773)
T ss_pred             CCCCCCceeecCCCCcccccccCCCCCeEEEeCCCCChhhccCCCCcccccccceeEEecCCCcceeeeeeccCcchhcc
Confidence                1010          001  234567763221      1        1245789999999999998864221  222


Q ss_pred             cC--ceEEE----cC---cEEEEEECCCCeEEEEEeecce
Q 003800          227 VG--DVALV----SS---DTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       227 s~--~~~~v----g~---~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +.  .+.+.    .+   ..++..+ .+|.++++|-.+|+
T Consensus       453 Dvp~qp~L~D~~~DG~~vpalv~pt-k~G~~YVlDRrtGe  491 (773)
T COG4993         453 DVPAQPTLLDITKDGKVVPALVHPT-KNGFIYVLDRRTGE  491 (773)
T ss_pred             cCCCCceEEEeecCCcEeeeeeccc-ccCcEEEEEcCCCc
Confidence            22  22222    11   1455555 46899999999988


No 19 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.80  E-value=0.071  Score=56.41  Aligned_cols=183  Identities=13%  Similarity=0.148  Sum_probs=102.7

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL  134 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~  134 (794)
                      ++..+.+|.|..+|.++|+.+.+......  ..++....++..+++ ++.++.++.||..+|+.+.+........  ...
T Consensus         4 ~~s~~~d~~v~~~d~~t~~~~~~~~~~~~--~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~~~--~~~   79 (300)
T TIGR03866         4 YVSNEKDNTISVIDTATLEVTRTFPVGQR--PRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPDPE--LFA   79 (300)
T ss_pred             EEEecCCCEEEEEECCCCceEEEEECCCC--CCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccCCCCcc--EEE
Confidence            44556789999999999998777654332  223322333444544 4456799999999999876654332211  111


Q ss_pred             ccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800          135 LVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNG  212 (794)
Q Consensus       135 ~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG  212 (794)
                      +     ..+ ++.+++..  ++.+..+|..+++.+...+....   +..+.. ..++..++++..++  ..+..+|..+|
T Consensus        80 ~-----~~~-g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~~~---~~~~~~-~~dg~~l~~~~~~~--~~~~~~d~~~~  147 (300)
T TIGR03866        80 L-----HPN-GKILYIANEDDNLVTVIDIETRKVLAEIPVGVE---PEGMAV-SPDGKIVVNTSETT--NMAHFIDTKTY  147 (300)
T ss_pred             E-----CCC-CCEEEEEcCCCCeEEEEECCCCeEEeEeeCCCC---cceEEE-CCCCCEEEEEecCC--CeEEEEeCCCC
Confidence            1     112 34465552  78999999999888777654321   122221 23444444443322  13555788888


Q ss_pred             ceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          213 ELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       213 ~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +........  ... ..+.+- .+..++......+.+.+.|+++++
T Consensus       148 ~~~~~~~~~--~~~-~~~~~s~dg~~l~~~~~~~~~v~i~d~~~~~  190 (300)
T TIGR03866       148 EIVDNVLVD--QRP-RFAEFTADGKELWVSSEIGGTVSVIDVATRK  190 (300)
T ss_pred             eEEEEEEcC--CCc-cEEEECCCCCEEEEEcCCCCEEEEEEcCcce
Confidence            776543211  111 112222 223333333345789999999876


No 20 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.71  E-value=0.067  Score=60.06  Aligned_cols=187  Identities=15%  Similarity=0.148  Sum_probs=101.8

Q ss_pred             cceeecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEE
Q 003800           21 LSLYEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVIT  100 (794)
Q Consensus        21 ~Al~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~  100 (794)
                      .++.+.+..++.-+.+..|.+ +... ..+.+++.+|+++.+|.|.-+|+.+++++-+......  ..++....++..++
T Consensus        18 v~viD~~t~~~~~~i~~~~~~-h~~~-~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~--~~~i~~s~DG~~~~   93 (369)
T PF02239_consen   18 VAVIDGATNKVVARIPTGGAP-HAGL-KFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGN--PRGIAVSPDGKYVY   93 (369)
T ss_dssp             EEEEETTT-SEEEEEE-STTE-EEEE-E-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSE--EEEEEE--TTTEEE
T ss_pred             EEEEECCCCeEEEEEcCCCCc-eeEE-EecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCC--cceEEEcCCCCEEE
Confidence            466777777777777765544 2111 1123456799999999999999999999999887654  33443344555666


Q ss_pred             EEc-cCCeEEEEeCCCCcEeEEEeccCccc----cCCccccccccccccCCeEEEE--E-CCEEEEEECCCCcEEEEEec
Q 003800          101 LSS-DGSTLRAWNLPDGQMVWESFLRGSKH----SKPLLLVPTNLKVDKDSLILVS--S-KGCLHAVSSIDGEILWTRDF  172 (794)
Q Consensus       101 Vs~-~g~~v~A~d~~tG~llWe~~l~~~~~----s~~~~~~~~~~~~~~~~~V~V~--~-~g~l~ald~~tG~~~W~~~~  172 (794)
                      ++. ..+.+..+|++|.+++=+....+...    +....++.    .. .+.-++.  . .+++.-+|-.+.+.+.....
T Consensus        94 v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv~----s~-~~~~fVv~lkd~~~I~vVdy~d~~~~~~~~i  168 (369)
T PF02239_consen   94 VANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIVA----SP-GRPEFVVNLKDTGEIWVVDYSDPKNLKVTTI  168 (369)
T ss_dssp             EEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEEE-----S-SSSEEEEEETTTTEEEEEETTTSSCEEEEEE
T ss_pred             EEecCCCceeEeccccccceeecccccccccccCCCceeEEe----cC-CCCEEEEEEccCCeEEEEEeccccccceeee
Confidence            665 46799999999999998877653221    00001111    01 2222222  2 46666666655554443322


Q ss_pred             cCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800          173 AAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETA  220 (794)
Q Consensus       173 ~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v  220 (794)
                      ..... +.-.. ...+++.|+++..++.  ++..+|.++++.++....
T Consensus       169 ~~g~~-~~D~~-~dpdgry~~va~~~sn--~i~viD~~~~k~v~~i~~  212 (369)
T PF02239_consen  169 KVGRF-PHDGG-FDPDGRYFLVAANGSN--KIAVIDTKTGKLVALIDT  212 (369)
T ss_dssp             E--TT-EEEEE-E-TTSSEEEEEEGGGT--EEEEEETTTTEEEEEEE-
T ss_pred             ccccc-ccccc-cCcccceeeecccccc--eeEEEeeccceEEEEeec
Confidence            22111 11111 1234555555544322  888999999999876543


No 21 
>COG4993 Gcd Glucose dehydrogenase [Carbohydrate transport and metabolism]
Probab=97.67  E-value=0.00079  Score=76.75  Aligned_cols=165  Identities=15%  Similarity=0.282  Sum_probs=95.5

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCccc--------c----eeee-eeeeCCEEEEEEc----------cCCeE
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND--------V----VDGI-DIALGKYVITLSS----------DGSTL  108 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~--------~----i~~l-~~~~g~~~V~Vs~----------~g~~v  108 (794)
                      ...|||.-|.+..|.|||++||++.|.+.-....        .    ..+. ++..+...+++++          ..+.+
T Consensus       271 c~~rIflpt~DarlIALdA~tGkvc~~Fa~~Ga~~l~tgm~~~k~g~y~~tS~p~~~~~~~v~~g~v~Dn~st~e~sgVi  350 (773)
T COG4993         271 CPRRIFLPTADARLIALDADTGKVCWSFANKGALNLETGMKDTKDGLYYGTSPPEFGVKGIVIAGSVADNESTWEPSGVI  350 (773)
T ss_pred             CceeEEeecCCceEEEEeCCCCcEeheeccCceeeeeccCCCCCCCeEeecCCCcccceeEEEeeccCCCceeeccCccc
Confidence            3567999999999999999999999995432210        0    0000 0112222222222          13578


Q ss_pred             EEEeCCCCcEeEEEeccCcccc------------CCcccccccccccc-CCeEEEE-E------------------CCEE
Q 003800          109 RAWNLPDGQMVWESFLRGSKHS------------KPLLLVPTNLKVDK-DSLILVS-S------------------KGCL  156 (794)
Q Consensus       109 ~A~d~~tG~llWe~~l~~~~~s------------~~~~~~~~~~~~~~-~~~V~V~-~------------------~g~l  156 (794)
                      |++|..+|+++|...-..+..-            .+.....+  ..|. -+.||+- .                  ...+
T Consensus       351 r~fdv~tG~l~w~~D~gnpD~t~p~~~g~tyt~nspn~W~~~--SyD~~lnlVy~p~Gn~~pd~wg~trtp~dekysssi  428 (773)
T COG4993         351 RGFDVLTGKLTWAGDPGNPDPTAPTAPGQTYTRNSPNSWASA--SYDAKLNLVYVPMGNQTPDTWGGTRTPGDEKYSSSI  428 (773)
T ss_pred             cccccccCceEEccCCCCCCCCCCCCCCceeecCCCCccccc--ccCCCCCeEEEeCCCCChhhccCCCCccccccccee
Confidence            9999999999999876543210            00000000  1111 2456652 1                  3479


Q ss_pred             EEEECCCCcEEEEEeccCcce----eeee--EEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          157 HAVSSIDGEILWTRDFAAESV----EVQQ--VIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       157 ~ald~~tG~~~W~~~~~~~~~----~~~~--~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      .|+|+.||+.+|.++..-..+    .+.|  +.+...++++.=+-.....+..++.+|..||+++-..
T Consensus       429 vAlD~~TG~~kW~yQtvhhDlWDmDvp~qp~L~D~~~DG~~vpalv~ptk~G~~YVlDRrtGe~lv~~  496 (773)
T COG4993         429 VALDATTGKLKWVYQTVHHDLWDMDVPAQPTLLDITKDGKVVPALVHPTKNGFIYVLDRRTGELLVPI  496 (773)
T ss_pred             EEecCCCcceeeeeeccCcchhcccCCCCceEEEeecCCcEeeeeecccccCcEEEEEcCCCcccccc
Confidence            999999999999987754322    1233  2223345544322222222347999999999987544


No 22 
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=97.59  E-value=0.0001  Score=54.31  Aligned_cols=31  Identities=29%  Similarity=0.595  Sum_probs=28.7

Q ss_pred             CEEEEEeCCCEEEEEECcCCccceEEEcCcc
Q 003800           54 KRVVVSTEENVIASLDLRHGEIFWRHVLGIN   84 (794)
Q Consensus        54 ~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~   84 (794)
                      ++||+++.+|.|+|||++||+++|+++.+..
T Consensus         1 ~~v~~~~~~g~l~AlD~~TG~~~W~~~~~~~   31 (38)
T PF01011_consen    1 GRVYVGTPDGYLYALDAKTGKVLWKFQTGPP   31 (38)
T ss_dssp             TEEEEETTTSEEEEEETTTTSEEEEEESSSG
T ss_pred             CEEEEeCCCCEEEEEECCCCCEEEeeeCCCC
Confidence            5799999999999999999999999998765


No 23 
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=97.56  E-value=0.16  Score=53.71  Aligned_cols=189  Identities=14%  Similarity=0.156  Sum_probs=102.8

Q ss_pred             CCCEEEEE-eCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800           52 GRKRVVVS-TEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRGSKH  129 (794)
Q Consensus        52 ~~~~Vyv~-t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~~~~  129 (794)
                      +++.+|++ +.++.|..+|.++|+...+......  ...+....++..++++ ..++.++.||..+++.+.+........
T Consensus        41 dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~~~~--~~~~~~~~~g~~l~~~~~~~~~l~~~d~~~~~~~~~~~~~~~~~  118 (300)
T TIGR03866        41 DGKLLYVCASDSDTIQVIDLATGEVIGTLPSGPD--PELFALHPNGKILYIANEDDNLVTVIDIETRKVLAEIPVGVEPE  118 (300)
T ss_pred             CCCEEEEEECCCCeEEEEECCCCcEEEeccCCCC--ccEEEECCCCCEEEEEcCCCCeEEEEECCCCeEEeEeeCCCCcc
Confidence            34557654 5678999999999987654433222  2222122334455555 345799999999998887766432211


Q ss_pred             cCCccccccccccccCCeEEE-EE-C-CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800          130 SKPLLLVPTNLKVDKDSLILV-SS-K-GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ  206 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V-~~-~-g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a  206 (794)
                        ...+       ..++..++ .. + ..++.+|..+|+..........   +..+..+..+..+++.+..++   .+..
T Consensus       119 --~~~~-------~~dg~~l~~~~~~~~~~~~~d~~~~~~~~~~~~~~~---~~~~~~s~dg~~l~~~~~~~~---~v~i  183 (300)
T TIGR03866       119 --GMAV-------SPDGKIVVNTSETTNMAHFIDTKTYEIVDNVLVDQR---PRFAEFTADGKELWVSSEIGG---TVSV  183 (300)
T ss_pred             --eEEE-------CCCCCEEEEEecCCCeEEEEeCCCCeEEEEEEcCCC---ccEEEECCCCCEEEEEcCCCC---EEEE
Confidence              1111       11344444 33 2 3567778888877655433221   222222234445655443233   7888


Q ss_pred             EEcCCCceeeeeeeeccc----CccC-ceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          207 INAMNGELLNHETAAFSG----GFVG-DVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       207 ld~~tG~~~w~~~v~~~~----~~s~-~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +|..+|+.+.+.....+.    .... .+.+- .+..+++.....+.+++.|+++++
T Consensus       184 ~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~  240 (300)
T TIGR03866       184 IDVATRKVIKKITFEIPGVHPEAVQPVGIKLTKDGKTAFVALGPANRVAVVDAKTYE  240 (300)
T ss_pred             EEcCcceeeeeeeecccccccccCCccceEECCCCCEEEEEcCCCCeEEEEECCCCc
Confidence            999999876554322211    1111 12221 233433433345678888988776


No 24 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.44  E-value=0.11  Score=53.13  Aligned_cols=186  Identities=18%  Similarity=0.182  Sum_probs=110.5

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      ++.+++++.+|.+...|..+++...+...... .+..+.....+..+++++.++.++.||..+++...+........ ..
T Consensus        21 ~~~l~~~~~~g~i~i~~~~~~~~~~~~~~~~~-~i~~~~~~~~~~~l~~~~~~~~i~i~~~~~~~~~~~~~~~~~~i-~~   98 (289)
T cd00200          21 GKLLATGSGDGTIKVWDLETGELLRTLKGHTG-PVRDVAASADGTYLASGSSDKTIRLWDLETGECVRTLTGHTSYV-SS   98 (289)
T ss_pred             CCEEEEeecCcEEEEEEeeCCCcEEEEecCCc-ceeEEEECCCCCEEEEEcCCCeEEEEEcCcccceEEEeccCCcE-EE
Confidence            56788888899999999999987777654332 23222222233355556656799999999998888877544222 11


Q ss_pred             ccccccccccccCCeEEE-EE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEe-cCCceeEEEEEEc
Q 003800          133 LLLVPTNLKVDKDSLILV-SS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGY-AGSSQFHAYQINA  209 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V-~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~-~g~~~~~v~ald~  209 (794)
                      ...       ..++.+++ .. +|.+..+|..+++...........  ...+.. ...+.+++.+. +|    .+..+|.
T Consensus        99 ~~~-------~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~--i~~~~~-~~~~~~l~~~~~~~----~i~i~d~  164 (289)
T cd00200          99 VAF-------SPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDW--VNSVAF-SPDGTFVASSSQDG----TIKLWDL  164 (289)
T ss_pred             EEE-------cCCCCEEEEecCCCeEEEEECCCcEEEEEeccCCCc--EEEEEE-cCcCCEEEEEcCCC----cEEEEEc
Confidence            111       11334444 44 899999999999888877633222  122221 12244444444 44    6888899


Q ss_pred             CCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          210 MNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       210 ~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .+++.+...... ...+.. +.+. .++.+++... .+.+.+.++.+++
T Consensus       165 ~~~~~~~~~~~~-~~~i~~-~~~~~~~~~l~~~~~-~~~i~i~d~~~~~  210 (289)
T cd00200         165 RTGKCVATLTGH-TGEVNS-VAFSPDGEKLLSSSS-DGTIKLWDLSTGK  210 (289)
T ss_pred             cccccceeEecC-ccccce-EEECCCcCEEEEecC-CCcEEEEECCCCc
Confidence            888887665411 111211 2222 2224444432 6888888888765


No 25 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=97.43  E-value=0.069  Score=54.64  Aligned_cols=187  Identities=14%  Similarity=0.156  Sum_probs=111.7

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      ++.+++++.+|.+...|..+++...+...... .+..+.....+.+++.++.++.++.||..+++............ ..
T Consensus        63 ~~~l~~~~~~~~i~i~~~~~~~~~~~~~~~~~-~i~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~~~i-~~  140 (289)
T cd00200          63 GTYLASGSSDKTIRLWDLETGECVRTLTGHTS-YVSSVAFSPDGRILSSSSRDKTIKVWDVETGKCLTTLRGHTDWV-NS  140 (289)
T ss_pred             CCEEEEEcCCCeEEEEEcCcccceEEEeccCC-cEEEEEEcCCCCEEEEecCCCeEEEEECCCcEEEEEeccCCCcE-EE
Confidence            45789999999999999999988777654332 23333222233455555546799999999999988877433222 11


Q ss_pred             ccccccccccccCCeEEE-EE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800          133 LLLVPTNLKVDKDSLILV-SS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM  210 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V-~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~  210 (794)
                      +.       ...++.+++ .. ++.+..+|..+++....+.......  ..+.....+..+++.+.+|    .+..+|..
T Consensus       141 ~~-------~~~~~~~l~~~~~~~~i~i~d~~~~~~~~~~~~~~~~i--~~~~~~~~~~~l~~~~~~~----~i~i~d~~  207 (289)
T cd00200         141 VA-------FSPDGTFVASSSQDGTIKLWDLRTGKCVATLTGHTGEV--NSVAFSPDGEKLLSSSSDG----TIKLWDLS  207 (289)
T ss_pred             EE-------EcCcCCEEEEEcCCCcEEEEEccccccceeEecCcccc--ceEEECCCcCEEEEecCCC----cEEEEECC
Confidence            11       121233444 45 8999999999998877776433221  2222112233566554444    68888998


Q ss_pred             CCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          211 NGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       211 tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +|+.+.+... .+..+.. +.+- .+.++++.+ .++.+++.++.+++
T Consensus       208 ~~~~~~~~~~-~~~~i~~-~~~~~~~~~~~~~~-~~~~i~i~~~~~~~  252 (289)
T cd00200         208 TGKCLGTLRG-HENGVNS-VAFSPDGYLLASGS-EDGTIRVWDLRTGE  252 (289)
T ss_pred             CCceecchhh-cCCceEE-EEEcCCCcEEEEEc-CCCcEEEEEcCCce
Confidence            8887765521 1111211 1111 223444443 46889888888765


No 26 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=97.41  E-value=0.024  Score=59.93  Aligned_cols=155  Identities=15%  Similarity=0.102  Sum_probs=99.0

Q ss_pred             CCCEEEEEeCC---CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           52 GRKRVVVSTEE---NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        52 ~~~~Vyv~t~~---g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      .++.+|-+|..   ..|..+|++||++..++.++...-..|+ ...++.+.-++=..+....||+++-+++=+.+..++.
T Consensus        54 ~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGi-t~~~d~l~qLTWk~~~~f~yd~~tl~~~~~~~y~~EG  132 (264)
T PF05096_consen   54 DDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGI-TILGDKLYQLTWKEGTGFVYDPNTLKKIGTFPYPGEG  132 (264)
T ss_dssp             ETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEE-EEETTEEEEEESSSSEEEEEETTTTEEEEEEE-SSS-
T ss_pred             CCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeE-EEECCEEEEEEecCCeEEEEccccceEEEEEecCCcc
Confidence            37899999973   4899999999999999999876322244 2356667776765679999999999999888877665


Q ss_pred             ccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcceeeeeEE-EEecCCEEEEEEecCCceeEEE
Q 003800          129 HSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAESVEVQQVI-QLDESDQIYVVGYAGSSQFHAY  205 (794)
Q Consensus       129 ~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v-~s~~~~~vyv~~~~g~~~~~v~  205 (794)
                      .  +  +     .-  ++.-++.+  ..+|+-+|+++-+..=+.+.........++- ....+|.+|+=-....   .++
T Consensus       133 W--G--L-----t~--dg~~Li~SDGS~~L~~~dP~~f~~~~~i~V~~~g~pv~~LNELE~i~G~IyANVW~td---~I~  198 (264)
T PF05096_consen  133 W--G--L-----TS--DGKRLIMSDGSSRLYFLDPETFKEVRTIQVTDNGRPVSNLNELEYINGKIYANVWQTD---RIV  198 (264)
T ss_dssp             ---E--E-----EE--CSSCEEEE-SSSEEEEE-TTT-SEEEEEE-EETTEE---EEEEEEETTEEEEEETTSS---EEE
T ss_pred             e--E--E-----Ec--CCCEEEEECCccceEEECCcccceEEEEEEEECCEECCCcEeEEEEcCEEEEEeCCCC---eEE
Confidence            4  1  1     11  34444444  5689999999887665554432221111110 0124889997444433   789


Q ss_pred             EEEcCCCceeeeeeee
Q 003800          206 QINAMNGELLNHETAA  221 (794)
Q Consensus       206 ald~~tG~~~w~~~v~  221 (794)
                      .+|++||++.-...++
T Consensus       199 ~Idp~tG~V~~~iDls  214 (264)
T PF05096_consen  199 RIDPETGKVVGWIDLS  214 (264)
T ss_dssp             EEETTT-BEEEEEE-H
T ss_pred             EEeCCCCeEEEEEEhh
Confidence            9999999999777653


No 27 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=97.39  E-value=0.35  Score=53.73  Aligned_cols=191  Identities=12%  Similarity=0.072  Sum_probs=115.9

Q ss_pred             cCCCEEEEEeCC-----CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-Ec---------cCCeEEEEeCCC
Q 003800           51 TGRKRVVVSTEE-----NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SS---------DGSTLRAWNLPD  115 (794)
Q Consensus        51 ~~~~~Vyv~t~~-----g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~---------~g~~v~A~d~~t  115 (794)
                      .+..++||....     |.|..+|.++++++=.........  +. ++.++..+|+ .+         ....|..||++|
T Consensus        10 ~~~~~v~V~d~~~~~~~~~v~ViD~~~~~v~g~i~~G~~P~--~~-~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t   86 (352)
T TIGR02658        10 SDARRVYVLDPGHFAATTQVYTIDGEAGRVLGMTDGGFLPN--PV-VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQT   86 (352)
T ss_pred             CCCCEEEEECCcccccCceEEEEECCCCEEEEEEEccCCCc--ee-ECCCCCEEEEEeccccccccCCCCCEEEEEECcc
Confidence            346789999886     899999999998875555543321  22 3445555655 44         457999999999


Q ss_pred             CcEeEEEeccCc-cc--cCCccccccccccccCCeEEEE--E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCC
Q 003800          116 GQMVWESFLRGS-KH--SKPLLLVPTNLKVDKDSLILVS--S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESD  189 (794)
Q Consensus       116 G~llWe~~l~~~-~~--s~~~~~~~~~~~~~~~~~V~V~--~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~  189 (794)
                      ++++.+..+... ..  ........  ...+ ++.+||.  + +..|..+|..+++++=+.+.+....    +. ...++
T Consensus        87 ~~~~~~i~~p~~p~~~~~~~~~~~~--ls~d-gk~l~V~n~~p~~~V~VvD~~~~kvv~ei~vp~~~~----vy-~t~e~  158 (352)
T TIGR02658        87 HLPIADIELPEGPRFLVGTYPWMTS--LTPD-NKTLLFYQFSPSPAVGVVDLEGKAFVRMMDVPDCYH----IF-PTAND  158 (352)
T ss_pred             CcEEeEEccCCCchhhccCccceEE--ECCC-CCEEEEecCCCCCEEEEEECCCCcEEEEEeCCCCcE----EE-EecCC
Confidence            999999998643 10  00011111  0222 4567775  3 7899999999999999988876433    22 24455


Q ss_pred             EEEEEEecCCceeEEEEEEcCCCceeeeeeeec--c--cCc-cCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          190 QIYVVGYAGSSQFHAYQINAMNGELLNHETAAF--S--GGF-VGDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       190 ~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~--~--~~~-s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .-++.+.+|.  ...+.+| .+|+.. ..+...  +  -.+ ..+.+...++..+|.+.. |.++++|+....
T Consensus       159 ~~~~~~~Dg~--~~~v~~d-~~g~~~-~~~~~vf~~~~~~v~~rP~~~~~dg~~~~vs~e-G~V~~id~~~~~  226 (352)
T TIGR02658       159 TFFMHCRDGS--LAKVGYG-TKGNPK-IKPTEVFHPEDEYLINHPAYSNKSGRLVWPTYT-GKIFQIDLSSGD  226 (352)
T ss_pred             ccEEEeecCc--eEEEEec-CCCceE-EeeeeeecCCccccccCCceEcCCCcEEEEecC-CeEEEEecCCCc
Confidence            5556677764  2334455 356633 222111  1  011 112122224556667654 999999986644


No 28 
>PF02239 Cytochrom_D1:  Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=97.34  E-value=0.23  Score=55.79  Aligned_cols=190  Identities=12%  Similarity=0.107  Sum_probs=105.8

Q ss_pred             CEEEEEe-CCCEEEEEECcCCccceEEEcCcccceee-eeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           54 KRVVVST-EENVIASLDLRHGEIFWRHVLGINDVVDG-IDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        54 ~~Vyv~t-~~g~l~ALn~~tG~ivWR~~l~~~~~i~~-l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      +..||.. +.|.|+.+|.+|.+++-+......  ..+ +....++..+++++.++.|.-+|+.+++++-+.+......  
T Consensus         6 ~l~~V~~~~~~~v~viD~~t~~~~~~i~~~~~--~h~~~~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G~~~~--   81 (369)
T PF02239_consen    6 NLFYVVERGSGSVAVIDGATNKVVARIPTGGA--PHAGLKFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVGGNPR--   81 (369)
T ss_dssp             GEEEEEEGGGTEEEEEETTT-SEEEEEE-STT--EEEEEE-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-SSEEE--
T ss_pred             cEEEEEecCCCEEEEEECCCCeEEEEEcCCCC--ceeEEEecCCCCEEEEEcCCCeEEEEECCcccEEEEEecCCCcc--
Confidence            4455544 579999999999999999877543  221 1112334456666556799999999999999998875432  


Q ss_pred             CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCc-----ceeeeeEEEEecCCEEEEEEecCCceeEE
Q 003800          132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAE-----SVEVQQVIQLDESDQIYVVGYAGSSQFHA  204 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~-----~~~~~~~v~s~~~~~vyv~~~~g~~~~~v  204 (794)
                      ...+     ..+ ++.+++.  ..+.+..+|.+|.+++=+.+....     ......++ .......|+++....  .++
T Consensus        82 ~i~~-----s~D-G~~~~v~n~~~~~v~v~D~~tle~v~~I~~~~~~~~~~~~Rv~aIv-~s~~~~~fVv~lkd~--~~I  152 (369)
T PF02239_consen   82 GIAV-----SPD-GKYVYVANYEPGTVSVIDAETLEPVKTIPTGGMPVDGPESRVAAIV-ASPGRPEFVVNLKDT--GEI  152 (369)
T ss_dssp             EEEE-------T-TTEEEEEEEETTEEEEEETTT--EEEEEE--EE-TTTS---EEEEE-E-SSSSEEEEEETTT--TEE
T ss_pred             eEEE-----cCC-CCEEEEEecCCCceeEeccccccceeecccccccccccCCCceeEE-ecCCCCEEEEEEccC--CeE
Confidence            1111     122 4567775  389999999999998877654321     11122233 234555576666531  178


Q ss_pred             EEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          205 YQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       205 ~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      ..+|..+.+.+....+.....+.+ ..+- .+.+++......+.+.++|.++++
T Consensus       153 ~vVdy~d~~~~~~~~i~~g~~~~D-~~~dpdgry~~va~~~sn~i~viD~~~~k  205 (369)
T PF02239_consen  153 WVVDYSDPKNLKVTTIKVGRFPHD-GGFDPDGRYFLVAANGSNKIAVIDTKTGK  205 (369)
T ss_dssp             EEEETTTSSCEEEEEEE--TTEEE-EEE-TTSSEEEEEEGGGTEEEEEETTTTE
T ss_pred             EEEEeccccccceeeecccccccc-cccCcccceeeecccccceeEEEeeccce
Confidence            888988877766555544332222 1111 122332222233456666666654


No 29 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.91  E-value=1  Score=50.08  Aligned_cols=222  Identities=14%  Similarity=0.207  Sum_probs=115.7

Q ss_pred             ecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCC----CEEEEE--ECcCCccceEEEcCccccee-eeeeeeCCE
Q 003800           25 EDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEE----NVIASL--DLRHGEIFWRHVLGINDVVD-GIDIALGKY   97 (794)
Q Consensus        25 edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~----g~l~AL--n~~tG~ivWR~~l~~~~~i~-~l~~~~g~~   97 (794)
                      .++.|++...+.. .....+.+.....+++.+|++++.    |.|.++  +.++|+..-.......+... .+.+...+.
T Consensus        21 d~~~g~l~~~~~~-~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~~~~~g~~p~~i~~~~~g~   99 (345)
T PF10282_consen   21 DEETGTLTLVQTV-AEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNSVPSGGSSPCHIAVDPDGR   99 (345)
T ss_dssp             ETTTTEEEEEEEE-EESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEEEEESSSCEEEEEECTTSS
T ss_pred             cCCCCCceEeeee-cCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeeeeccCCCCcEEEEEecCCC
Confidence            3455665544432 111123333334568889999984    566555  55557776665554322221 221223566


Q ss_pred             EEEEEc-cCCeEEEEeCCC-CcEeEEEecc-----Cccc-----cCCccccccccccccCCeEEEEE--CCEEEEEECCC
Q 003800           98 VITLSS-DGSTLRAWNLPD-GQMVWESFLR-----GSKH-----SKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSID  163 (794)
Q Consensus        98 ~V~Vs~-~g~~v~A~d~~t-G~llWe~~l~-----~~~~-----s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~t  163 (794)
                      .++++. .++.+..++..+ |++.-.....     ++..     +.+-.+..   ..+ ++.++|..  ..+|+.++...
T Consensus       100 ~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~---~pd-g~~v~v~dlG~D~v~~~~~~~  175 (345)
T PF10282_consen  100 FLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVF---SPD-GRFVYVPDLGADRVYVYDIDD  175 (345)
T ss_dssp             EEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE----TT-SSEEEEEETTTTEEEEEEE-T
T ss_pred             EEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEE---CCC-CCEEEEEecCCCEEEEEEEeC
Confidence            777665 367898888875 8777664321     1110     00001110   112 34566643  55666666554


Q ss_pred             Cc--EEEEE--eccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec-ccCccC-----ceEEE
Q 003800          164 GE--ILWTR--DFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF-SGGFVG-----DVALV  233 (794)
Q Consensus       164 G~--~~W~~--~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~-~~~~s~-----~~~~v  233 (794)
                      +.  ..-..  ..+.+. .|+.+....++..+|++.-. +..+.++.++..+|+......+.. |.+..+     .+.+-
T Consensus       176 ~~~~l~~~~~~~~~~G~-GPRh~~f~pdg~~~Yv~~e~-s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~is  253 (345)
T PF10282_consen  176 DTGKLTPVDSIKVPPGS-GPRHLAFSPDGKYAYVVNEL-SNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAIS  253 (345)
T ss_dssp             TS-TEEEEEEEECSTTS-SEEEEEE-TTSSEEEEEETT-TTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-
T ss_pred             CCceEEEeeccccccCC-CCcEEEEcCCcCEEEEecCC-CCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEe
Confidence            43  33211  222222 36677655566678887644 445666777766897766555543 333322     22222


Q ss_pred             -cCcEEEEEECCCCeEEEEEe
Q 003800          234 -SSDTLVTLDTTRSILVTVSF  253 (794)
Q Consensus       234 -g~~~lv~~d~~~g~L~v~~l  253 (794)
                       .++.+++.+...+++.+.++
T Consensus       254 pdg~~lyvsnr~~~sI~vf~~  274 (345)
T PF10282_consen  254 PDGRFLYVSNRGSNSISVFDL  274 (345)
T ss_dssp             TTSSEEEEEECTTTEEEEEEE
T ss_pred             cCCCEEEEEeccCCEEEEEEE
Confidence             35577778877788888888


No 30 
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=96.86  E-value=0.0019  Score=48.07  Aligned_cols=40  Identities=15%  Similarity=0.205  Sum_probs=26.1

Q ss_pred             CccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCC
Q 003800           73 GEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPD  115 (794)
Q Consensus        73 G~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~t  115 (794)
                      |+++|++.++..  +.+. ++..++.||+++.+++++|+|++|
T Consensus         1 G~~~W~~~~~~~--~~~~-~~v~~g~vyv~~~dg~l~ald~~t   40 (40)
T PF13570_consen    1 GKVLWSYDTGGP--IWSS-PAVAGGRVYVGTGDGNLYALDAAT   40 (40)
T ss_dssp             S-EEEEEE-SS-----S---EECTSEEEEE-TTSEEEEEETT-
T ss_pred             CceeEEEECCCC--cCcC-CEEECCEEEEEcCCCEEEEEeCCC
Confidence            899999999764  3333 356677888888778999999975


No 31 
>PTZ00421 coronin; Provisional
Probab=96.83  E-value=0.23  Score=57.94  Aligned_cols=195  Identities=13%  Similarity=0.111  Sum_probs=105.6

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceE-----EEcCc-ccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEecc
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWR-----HVLGI-NDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLR  125 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR-----~~l~~-~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~  125 (794)
                      ++.+++++++|.|...|..++.....     ..+.. ...+..+... .++.+++.++.++.|+.||..+|+.+=.....
T Consensus        88 ~~~LaSgS~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~P~~~~iLaSgs~DgtVrIWDl~tg~~~~~l~~h  167 (493)
T PTZ00421         88 PQKLFTASEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFHPSAMNVLASAGADMVVNVWDVERGKAVEVIKCH  167 (493)
T ss_pred             CCEEEEEeCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeCcCCCCEEEEEeCCCEEEEEECCCCeEEEEEcCC
Confidence            55788999999999999877643211     11211 1123333222 23345555566789999999999876555433


Q ss_pred             CccccCCccccccccccccCCeEEE-E-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeE
Q 003800          126 GSKHSKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFH  203 (794)
Q Consensus       126 ~~~~s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~  203 (794)
                      ...+ ..+       ....++.+++ . .|+.+...|..+|+...+........ ...+......+.++.+++.++....
T Consensus       168 ~~~V-~sl-------a~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~-~~~~~w~~~~~~ivt~G~s~s~Dr~  238 (493)
T PTZ00421        168 SDQI-TSL-------EWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAK-SQRCLWAKRKDLIITLGCSKSQQRQ  238 (493)
T ss_pred             CCce-EEE-------EEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCc-ceEEEEcCCCCeEEEEecCCCCCCe
Confidence            3222 111       1122344444 3 38999999999999887765433221 1122222344566655654333346


Q ss_pred             EEEEEcCCCceeee-eeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          204 AYQINAMNGELLNH-ETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       204 v~ald~~tG~~~w~-~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +..+|..+.....+ ..+...... ..+.+- ..++++......+.+++.++.+++
T Consensus       239 VklWDlr~~~~p~~~~~~d~~~~~-~~~~~d~d~~~L~lggkgDg~Iriwdl~~~~  293 (493)
T PTZ00421        239 IMLWDTRKMASPYSTVDLDQSSAL-FIPFFDEDTNLLYIGSKGEGNIRCFELMNER  293 (493)
T ss_pred             EEEEeCCCCCCceeEeccCCCCce-EEEEEcCCCCEEEEEEeCCCeEEEEEeeCCc
Confidence            77778776543221 111111111 011121 334554444346789999998877


No 32 
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=96.83  E-value=0.0015  Score=46.13  Aligned_cols=27  Identities=30%  Similarity=0.601  Sum_probs=25.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEE
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRH   79 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~   79 (794)
                      ++.+|+++.+|.|.|+|++||+++|++
T Consensus         6 ~~~v~~~~~~g~l~a~d~~~G~~~W~~   32 (33)
T smart00564        6 DGTVYVGSTDGTLYALDAKTGEILWTY   32 (33)
T ss_pred             CCEEEEEcCCCEEEEEEcccCcEEEEc
Confidence            668999999999999999999999985


No 33 
>PF13570 PQQ_3:  PQQ-like domain; PDB: 3HXJ_B 3Q54_A.
Probab=96.80  E-value=0.0025  Score=47.35  Aligned_cols=40  Identities=23%  Similarity=0.355  Sum_probs=27.2

Q ss_pred             ccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcC
Q 003800           29 GLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRH   72 (794)
Q Consensus        29 G~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~t   72 (794)
                      |+..|++++-|..    ...|...+++||+++.+|.|+|||++|
T Consensus         1 G~~~W~~~~~~~~----~~~~~v~~g~vyv~~~dg~l~ald~~t   40 (40)
T PF13570_consen    1 GKVLWSYDTGGPI----WSSPAVAGGRVYVGTGDGNLYALDAAT   40 (40)
T ss_dssp             S-EEEEEE-SS-------S--EECTSEEEEE-TTSEEEEEETT-
T ss_pred             CceeEEEECCCCc----CcCCEEECCEEEEEcCCCEEEEEeCCC
Confidence            7889999885522    344556699999999999999999986


No 34 
>PF10282 Lactonase:  Lactonase, 7-bladed beta-propeller;  InterPro: IPR019405  6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types.  This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=96.78  E-value=1.3  Score=49.27  Aligned_cols=195  Identities=10%  Similarity=0.150  Sum_probs=107.9

Q ss_pred             EEEEeCC----CE--EEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc----CCeEEEEeCCC--CcEeEEEe
Q 003800           56 VVVSTEE----NV--IASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD----GSTLRAWNLPD--GQMVWESF  123 (794)
Q Consensus        56 Vyv~t~~----g~--l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~----g~~v~A~d~~t--G~llWe~~  123 (794)
                      +|+++..    +-  ++.+|.++|++--.+..........+.....+..+|+...    .+.|.+|+..+  |++.--..
T Consensus         2 ~~vgsy~~~~~~gI~~~~~d~~~g~l~~~~~~~~~~~Ps~l~~~~~~~~LY~~~e~~~~~g~v~~~~i~~~~g~L~~~~~   81 (345)
T PF10282_consen    2 LYVGSYTNGKGGGIYVFRFDEETGTLTLVQTVAEGENPSWLAVSPDGRRLYVVNEGSGDSGGVSSYRIDPDTGTLTLLNS   81 (345)
T ss_dssp             EEEEECCSSSSTEEEEEEEETTTTEEEEEEEEEESSSECCEEE-TTSSEEEEEETTSSTTTEEEEEEEETTTTEEEEEEE
T ss_pred             EEEEcCCCCCCCcEEEEEEcCCCCCceEeeeecCCCCCceEEEEeCCCEEEEEEccccCCCCEEEEEECCCcceeEEeee
Confidence            5677765    33  5566779998877766543332333322335566666432    45777766553  88877666


Q ss_pred             ccCccccCCcccccccccccc-CCeEEEEE--CCEEEEEECCC-CcEEEE-----Eec--cCcc----eeeeeEEEEecC
Q 003800          124 LRGSKHSKPLLLVPTNLKVDK-DSLILVSS--KGCLHAVSSID-GEILWT-----RDF--AAES----VEVQQVIQLDES  188 (794)
Q Consensus       124 l~~~~~s~~~~~~~~~~~~~~-~~~V~V~~--~g~l~ald~~t-G~~~W~-----~~~--~~~~----~~~~~~v~s~~~  188 (794)
                      .....  ..+..+    ..+. ++.+++..  +|.+..++..+ |++.-.     ...  +.+.    .-+-++....++
T Consensus        82 ~~~~g--~~p~~i----~~~~~g~~l~vany~~g~v~v~~l~~~g~l~~~~~~~~~~g~g~~~~rq~~~h~H~v~~~pdg  155 (345)
T PF10282_consen   82 VPSGG--SSPCHI----AVDPDGRFLYVANYGGGSVSVFPLDDDGSLGEVVQTVRHEGSGPNPDRQEGPHPHQVVFSPDG  155 (345)
T ss_dssp             EEESS--SCEEEE----EECTTSSEEEEEETTTTEEEEEEECTTSEEEEEEEEEESEEEESSTTTTSSTCEEEEEE-TTS
T ss_pred             eccCC--CCcEEE----EEecCCCEEEEEEccCCeEEEEEccCCcccceeeeecccCCCCCcccccccccceeEEECCCC
Confidence            54221  122222    2222 45677763  88888888764 765433     211  1110    112334333345


Q ss_pred             CEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE--cCcEEEEEECCCCeEEEEEee--cce
Q 003800          189 DQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV--SSDTLVTLDTTRSILVTVSFK--NRK  257 (794)
Q Consensus       189 ~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v--g~~~lv~~d~~~g~L~v~~l~--sg~  257 (794)
                      ..+|+. .-|...+.++.+|..+|+......+..+.+-....+..  .++++++++...+.+.++++.  +|+
T Consensus       156 ~~v~v~-dlG~D~v~~~~~~~~~~~l~~~~~~~~~~G~GPRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~  227 (345)
T PF10282_consen  156 RFVYVP-DLGADRVYVYDIDDDTGKLTPVDSIKVPPGSGPRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGS  227 (345)
T ss_dssp             SEEEEE-ETTTTEEEEEEE-TTS-TEEEEEEEECSTTSSEEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTE
T ss_pred             CEEEEE-ecCCCEEEEEEEeCCCceEEEeeccccccCCCCcEEEEcCCcCEEEEecCCCCcEEEEeecccCCc
Confidence            567764 45667788888888888766545454443322222322  445777888788899999998  565


No 35 
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=96.77  E-value=0.8  Score=50.96  Aligned_cols=79  Identities=5%  Similarity=-0.052  Sum_probs=59.9

Q ss_pred             eccCCCEEEEEeC----------CCEEEEEECcCCccceEEEcCcccc-----ee-eeeeeeCCEEEEEEc-c-CCeEEE
Q 003800           49 QKTGRKRVVVSTE----------ENVIASLDLRHGEIFWRHVLGINDV-----VD-GIDIALGKYVITLSS-D-GSTLRA  110 (794)
Q Consensus        49 ~~~~~~~Vyv~t~----------~g~l~ALn~~tG~ivWR~~l~~~~~-----i~-~l~~~~g~~~V~Vs~-~-g~~v~A  110 (794)
                      .+.+++.+|+++.          .+.|..+|++|++++.+..++....     .. .+.+..++..++|+. . .+.|..
T Consensus        53 ~spDg~~lyva~~~~~R~~~G~~~d~V~v~D~~t~~~~~~i~~p~~p~~~~~~~~~~~~ls~dgk~l~V~n~~p~~~V~V  132 (352)
T TIGR02658        53 VASDGSFFAHASTVYSRIARGKRTDYVEVIDPQTHLPIADIELPEGPRFLVGTYPWMTSLTPDNKTLLFYQFSPSPAVGV  132 (352)
T ss_pred             ECCCCCEEEEEeccccccccCCCCCEEEEEECccCcEEeEEccCCCchhhccCccceEEECCCCCEEEEecCCCCCEEEE
Confidence            4556778998776          7899999999999999999865511     01 222345666777765 3 578999


Q ss_pred             EeCCCCcEeEEEeccCc
Q 003800          111 WNLPDGQMVWESFLRGS  127 (794)
Q Consensus       111 ~d~~tG~llWe~~l~~~  127 (794)
                      +|.++|+.+=+....+.
T Consensus       133 vD~~~~kvv~ei~vp~~  149 (352)
T TIGR02658       133 VDLEGKAFVRMMDVPDC  149 (352)
T ss_pred             EECCCCcEEEEEeCCCC
Confidence            99999999999998653


No 36 
>PF01011 PQQ:  PQQ enzyme repeat family.;  InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=96.66  E-value=0.0039  Score=45.87  Aligned_cols=31  Identities=13%  Similarity=0.232  Sum_probs=26.6

Q ss_pred             EEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      .|+++..++.++|+|+.||+++|+++.....
T Consensus         2 ~v~~~~~~g~l~AlD~~TG~~~W~~~~~~~~   32 (38)
T PF01011_consen    2 RVYVGTPDGYLYALDAKTGKVLWKFQTGPPV   32 (38)
T ss_dssp             EEEEETTTSEEEEEETTTTSEEEEEESSSGG
T ss_pred             EEEEeCCCCEEEEEECCCCCEEEeeeCCCCC
Confidence            5677777789999999999999999987654


No 37 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=96.43  E-value=0.17  Score=54.89  Aligned_cols=156  Identities=13%  Similarity=0.125  Sum_probs=97.9

Q ss_pred             cCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc-
Q 003800           51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH-  129 (794)
Q Consensus        51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~-  129 (794)
                      ++++|++++.++|.|.+.|++||+++-+..-.+.............-+++-+..++.+...+..+|+++--..-..+.+ 
T Consensus       200 pdGKr~~tgy~dgti~~Wn~ktg~p~~~~~~~e~~~~~~~~~~~~~~~~~~g~~e~~~~~~~~~sgKVv~~~n~~~~~l~  279 (399)
T KOG0296|consen  200 PDGKRILTGYDDGTIIVWNPKTGQPLHKITQAEGLELPCISLNLAGSTLTKGNSEGVACGVNNGSGKVVNCNNGTVPELK  279 (399)
T ss_pred             CCCceEEEEecCceEEEEecCCCceeEEecccccCcCCccccccccceeEeccCCccEEEEccccceEEEecCCCCcccc
Confidence            3588899999999999999999999988764443222222222333344434456788888888999887766322211 


Q ss_pred             ------cCCccccccccccccCCe-EEE-EE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800          130 ------SKPLLLVPTNLKVDKDSL-ILV-SS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS  200 (794)
Q Consensus       130 ------s~~~~~~~~~~~~~~~~~-V~V-~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~  200 (794)
                            .......+.     .... +.. .+ +|++.-+|.+.-+++-..+.+.+.   .++.. .....+|..+.+|  
T Consensus       280 ~~~e~~~esve~~~~-----ss~lpL~A~G~vdG~i~iyD~a~~~~R~~c~he~~V---~~l~w-~~t~~l~t~c~~g--  348 (399)
T KOG0296|consen  280 PSQEELDESVESIPS-----SSKLPLAACGSVDGTIAIYDLAASTLRHICEHEDGV---TKLKW-LNTDYLLTACANG--  348 (399)
T ss_pred             ccchhhhhhhhhccc-----ccccchhhcccccceEEEEecccchhheeccCCCce---EEEEE-cCcchheeeccCc--
Confidence                  011111110     0111 111 22 888888888766666555555542   23332 1256778777777  


Q ss_pred             eeEEEEEEcCCCceeeeee
Q 003800          201 QFHAYQINAMNGELLNHET  219 (794)
Q Consensus       201 ~~~v~ald~~tG~~~w~~~  219 (794)
                        +|..+|+.||+.+..++
T Consensus       349 --~v~~wDaRtG~l~~~y~  365 (399)
T KOG0296|consen  349 --KVRQWDARTGQLKFTYT  365 (399)
T ss_pred             --eEEeeeccccceEEEEe
Confidence              89999999999998885


No 38 
>KOG2103 consensus Uncharacterized conserved protein [Function unknown]
Probab=96.41  E-value=0.089  Score=62.25  Aligned_cols=192  Identities=16%  Similarity=0.185  Sum_probs=111.7

Q ss_pred             cccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccC
Q 003800           26 DQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDG  105 (794)
Q Consensus        26 dqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g  105 (794)
                      -..|.+-|||.+-+++...  +-+-    .-++..+.-.+.+-|.++|...|...+..+.....+  ....++.++++  
T Consensus        64 ~~tGei~WRqvl~~~~~~~--~~~~----~~~iS~dg~~lr~wn~~~g~l~~~i~l~~g~~~~~~--~v~~~i~v~~g--  133 (910)
T KOG2103|consen   64 LRTGEIIWRQVLEPKTSGL--GVPL----TNTISVDGRYLRSWNTNNGILDWEIELADGFKGLLL--EVNKGIAVLNG--  133 (910)
T ss_pred             ccCCcEEEEEeccCCCccc--Ccce----eEEEccCCcEEEeecCCCceeeeecccccccceeEE--EEccceEEEcc--
Confidence            4478999999774443322  1111    125555566799999999999999988766222233  34444445444  


Q ss_pred             CeEEEEeCCCCcEeEEEeccCccc--cCCccccccccccccCCeEEEE-----ECCEEEEEECCCCcEE-EEEeccCcce
Q 003800          106 STLRAWNLPDGQMVWESFLRGSKH--SKPLLLVPTNLKVDKDSLILVS-----SKGCLHAVSSIDGEIL-WTRDFAAESV  177 (794)
Q Consensus       106 ~~v~A~d~~tG~llWe~~l~~~~~--s~~~~~~~~~~~~~~~~~V~V~-----~~g~l~ald~~tG~~~-W~~~~~~~~~  177 (794)
                           |....|.+.|+..+.....  .+++.+.+       .+.++++     ++..+++++..+|++. |+...-.|+.
T Consensus       134 -----~~~~~g~l~w~~~~~~~~~~~~q~~~~~~-------t~vvy~~~~l~~s~~~V~~~~~~~g~v~~~~~~v~~pw~  201 (910)
T KOG2103|consen  134 -----HTRKFGELKWVESFSISIEEDLQDAKIYG-------TDVVYVLGLLKRSGSCVQQVFSDDGEVTGPQSTVLGPWF  201 (910)
T ss_pred             -----eeccccceeehhhccccchhHHHHhhhcc-------CcEEEEEEEEecCCceEEEEEccCCcEecceeeeecCcc
Confidence                 7899999999998875432  01122221       3444443     2668999999999988 9888777776


Q ss_pred             eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee-eeecccCccCceEEE-cC--cEEEEEECCCC
Q 003800          178 EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE-TAAFSGGFVGDVALV-SS--DTLVTLDTTRS  246 (794)
Q Consensus       178 ~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~-~v~~~~~~s~~~~~v-g~--~~lv~~d~~~g  246 (794)
                      .+..|  + .+..++.++..|    .+..+|...++.--.+ ....-..+.+..+++ |+  ++++|+++.++
T Consensus       202 ~~~~c--~-~~k~~vl~~s~g----~l~s~di~~~~~~~~q~~~e~l~~l~g~~i~~~g~~~~~~V~V~s~~~  267 (910)
T KOG2103|consen  202 KVLSC--S-TDKEVVLVCSNG----TLISLDISSQKVQISQLLAEILLPLTGDLILLDGNKHTAMVSVNSSSN  267 (910)
T ss_pred             ccccc--c-cccceEEEcCCC----CeEEEEEEeeccchhhhhhhhhhccCCceEEecCCCceeEEEEecCCC
Confidence            55444  2 233444456666    3555555433322111 111112334444444 32  37888886433


No 39 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=96.29  E-value=1.1  Score=52.28  Aligned_cols=188  Identities=13%  Similarity=0.163  Sum_probs=118.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCc----ccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGI----NDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~----~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      ++.+-++-.+|.|---|++.+   |-+...-    ...+.++.-+.++.+..++. .+.+.-||+.+|+.+-+....+..
T Consensus        37 S~~lAvsRt~g~IEiwN~~~~---w~~~~vi~g~~drsIE~L~W~e~~RLFS~g~-sg~i~EwDl~~lk~~~~~d~~gg~  112 (691)
T KOG2048|consen   37 SNQLAVSRTDGNIEIWNLSNN---WFLEPVIHGPEDRSIESLAWAEGGRLFSSGL-SGSITEWDLHTLKQKYNIDSNGGA  112 (691)
T ss_pred             CCceeeeccCCcEEEEccCCC---ceeeEEEecCCCCceeeEEEccCCeEEeecC-CceEEEEecccCceeEEecCCCcc
Confidence            555666666788888888874   8776532    22455552223444444344 569999999999999988876653


Q ss_pred             ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800          129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI  207 (794)
Q Consensus       129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al  207 (794)
                      .    +.+.   .-.....+.|. .+|.++-++...|+...+..++........+.-..++-+++..+.+|    .+.+.
T Consensus       113 I----Wsia---i~p~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLslsw~~~~~~i~~Gs~Dg----~Iriw  181 (691)
T KOG2048|consen  113 I----WSIA---INPENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLSWNPTGTKIAGGSIDG----VIRIW  181 (691)
T ss_pred             e----eEEE---eCCccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEEecCCccEEEecccCc----eEEEE
Confidence            3    3332   11113455566 48899999999999888877765432122222112233355444444    89999


Q ss_pred             EcCCCceeeeeeeecccCccC-------ceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          208 NAMNGELLNHETAAFSGGFVG-------DVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       208 d~~tG~~~w~~~v~~~~~~s~-------~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      |+++|+.+.-.+.... .+..       ++.++..+.++|.|+ +|.+..=|-..|+
T Consensus       182 d~~~~~t~~~~~~~~d-~l~k~~~~iVWSv~~Lrd~tI~sgDS-~G~V~FWd~~~gT  236 (691)
T KOG2048|consen  182 DVKSGQTLHIITMQLD-RLSKREPTIVWSVLFLRDSTIASGDS-AGTVTFWDSIFGT  236 (691)
T ss_pred             EcCCCceEEEeeeccc-ccccCCceEEEEEEEeecCcEEEecC-CceEEEEcccCcc
Confidence            9999998872222111 1111       344567889999995 6988888888887


No 40 
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=96.25  E-value=1.7  Score=51.89  Aligned_cols=186  Identities=14%  Similarity=0.129  Sum_probs=105.8

Q ss_pred             CCEEEEEeCCCEEEEEECcCC-ccceEE--EcCccc-cee-eeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           53 RKRVVVSTEENVIASLDLRHG-EIFWRH--VLGIND-VVD-GIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG-~ivWR~--~l~~~~-~i~-~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      +..++..+.+|.+.-.+..++ +++--+  .++..+ .|. .+++..-=+-++++..+|.+.-||..+|+++.+++....
T Consensus       124 Ge~lia~d~~~~l~vw~~s~~~~e~~l~~~~~~~~~~~Ital~HP~TYLNKIvvGs~~G~lql~Nvrt~K~v~~f~~~~s  203 (910)
T KOG1539|consen  124 GEHLIAVDISNILFVWKTSSIQEELYLQSTFLKVEGDFITALLHPSTYLNKIVVGSSQGRLQLWNVRTGKVVYTFQEFFS  203 (910)
T ss_pred             cceEEEEEccCcEEEEEeccccccccccceeeeccCCceeeEecchhheeeEEEeecCCcEEEEEeccCcEEEEeccccc
Confidence            345666666666666665554 221111  000011 122 122222223345555567999999999999999987654


Q ss_pred             cccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800          128 KHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ  206 (794)
Q Consensus       128 ~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a  206 (794)
                      .. ..  +.+   +++ -+.|.+. .+|++.-+|.+.|+.+-+++.+.+..  ..+.-..++..+-+.+-.   .+.+.-
T Consensus       204 ~I-T~--ieq---sPa-LDVVaiG~~~G~ViifNlK~dkil~sFk~d~g~V--tslSFrtDG~p~las~~~---~G~m~~  271 (910)
T KOG1539|consen  204 RI-TA--IEQ---SPA-LDVVAIGLENGTVIIFNLKFDKILMSFKQDWGRV--TSLSFRTDGNPLLASGRS---NGDMAF  271 (910)
T ss_pred             ce-eE--ecc---CCc-ceEEEEeccCceEEEEEcccCcEEEEEEccccce--eEEEeccCCCeeEEeccC---CceEEE
Confidence            33 11  111   111 1233343 49999999999999999998874332  122212345555554433   237888


Q ss_pred             EEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEE
Q 003800          207 INAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTV  251 (794)
Q Consensus       207 ld~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~  251 (794)
                      .|+..-+.+|+.+-+...++.+...+.|..+++...++ .+|++-
T Consensus       272 wDLe~kkl~~v~~nah~~sv~~~~fl~~epVl~ta~~D-nSlk~~  315 (910)
T KOG1539|consen  272 WDLEKKKLINVTRNAHYGSVTGATFLPGEPVLVTAGAD-NSLKVW  315 (910)
T ss_pred             EEcCCCeeeeeeeccccCCcccceecCCCceEeeccCC-CceeEE
Confidence            99988888888874444555555555566666655443 444433


No 41 
>PF05935 Arylsulfotrans:  Arylsulfotransferase (ASST);  InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=96.09  E-value=0.24  Score=57.59  Aligned_cols=151  Identities=18%  Similarity=0.236  Sum_probs=80.4

Q ss_pred             CCEEEEEeC-----CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           53 RKRVVVSTE-----ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        53 ~~~Vyv~t~-----~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      .+.+|+.+.     ....+++| .+|.++|...+....... +....++.+++.++  ..++.+|. .|+++|++.+...
T Consensus       113 ~~gl~~~~~~~~~~~~~~~~iD-~~G~Vrw~~~~~~~~~~~-~~~l~nG~ll~~~~--~~~~e~D~-~G~v~~~~~l~~~  187 (477)
T PF05935_consen  113 EDGLYFVNGNDWDSSSYTYLID-NNGDVRWYLPLDSGSDNS-FKQLPNGNLLIGSG--NRLYEIDL-LGKVIWEYDLPGG  187 (477)
T ss_dssp             TT-EEEEEETT--BEEEEEEEE-TTS-EEEEE-GGGT--SS-EEE-TTS-EEEEEB--TEEEEE-T-T--EEEEEE--TT
T ss_pred             CCcEEEEeCCCCCCCceEEEEC-CCccEEEEEccCccccce-eeEcCCCCEEEecC--CceEEEcC-CCCEEEeeecCCc
Confidence            555777666     67899999 589999999887653211 21223344444333  68999998 6999999999874


Q ss_pred             c--ccCCccccccccccccCCeEEEE-E--------------CCEEEEEECCCCcEEEEEeccCcc---ee---------
Q 003800          128 K--HSKPLLLVPTNLKVDKDSLILVS-S--------------KGCLHAVSSIDGEILWTRDFAAES---VE---------  178 (794)
Q Consensus       128 ~--~s~~~~~~~~~~~~~~~~~V~V~-~--------------~g~l~ald~~tG~~~W~~~~~~~~---~~---------  178 (794)
                      .  ..-+....+       ++.++++ .              ...+.-+| .+|+++|+|+....-   ..         
T Consensus       188 ~~~~HHD~~~l~-------nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd-~tG~vv~~wd~~d~ld~~~~~~~~~~~~~  259 (477)
T PF05935_consen  188 YYDFHHDIDELP-------NGNLLILASETKYVDEDKDVDTVEDVIVEVD-PTGEVVWEWDFFDHLDPYRDTVLKPYPYG  259 (477)
T ss_dssp             EE-B-S-EEE-T-------TS-EEEEEEETTEE-TS-EE---S-EEEEE--TTS-EEEEEEGGGTS-TT--TTGGT--SS
T ss_pred             ccccccccEECC-------CCCEEEEEeecccccCCCCccEecCEEEEEC-CCCCEEEEEehHHhCCccccccccccccc
Confidence            3  111222222       3444442 3              45799999 999999999774311   00         


Q ss_pred             ----------ee---eEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800          179 ----------VQ---QVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET  219 (794)
Q Consensus       179 ----------~~---~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~  219 (794)
                                ..   .+.+...++.+.+.+-.-+   .|..+|..||++.|-.-
T Consensus       260 ~~~~~~~~~DW~H~Nsi~yd~~dd~iivSsR~~s---~V~~Id~~t~~i~Wilg  310 (477)
T PF05935_consen  260 DISGSGGGRDWLHINSIDYDPSDDSIIVSSRHQS---AVIKIDYRTGKIKWILG  310 (477)
T ss_dssp             SSS-SSTTSBS--EEEEEEETTTTEEEEEETTT----EEEEEE-TTS-EEEEES
T ss_pred             ccccCCCCCCccccCccEEeCCCCeEEEEcCcce---EEEEEECCCCcEEEEeC
Confidence                      00   0111123566665443222   68999999999999873


No 42 
>PTZ00420 coronin; Provisional
Probab=96.03  E-value=2.4  Score=50.28  Aligned_cols=191  Identities=12%  Similarity=0.086  Sum_probs=103.3

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccce------EEEcCc-ccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFW------RHVLGI-NDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivW------R~~l~~-~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      ++.++.++.+|.|.-.|..++...-      ...+.. ...+..+... .+..+++.++.++.++.||..+|+.+++...
T Consensus        87 ~~lLASgS~DgtIrIWDi~t~~~~~~~i~~p~~~L~gH~~~V~sVaf~P~g~~iLaSgS~DgtIrIWDl~tg~~~~~i~~  166 (568)
T PTZ00420         87 SEILASGSEDLTIRVWEIPHNDESVKEIKDPQCILKGHKKKISIIDWNPMNYYIMCSSGFDSFVNIWDIENEKRAFQINM  166 (568)
T ss_pred             CCEEEEEeCCCeEEEEECCCCCccccccccceEEeecCCCcEEEEEECCCCCeEEEEEeCCCeEEEEECCCCcEEEEEec
Confidence            4578889999999999988764311      111211 1223332212 2444444455568999999999999888764


Q ss_pred             cCccccCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEE----EecCCEEEEEEecC
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ----LDESDQIYVVGYAG  198 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~----s~~~~~vyv~~~~g  198 (794)
                      .....  .+       ....++.+++. + ++.+...|..+|+.+-++....... ....+.    +..++.+...++++
T Consensus       167 ~~~V~--Sl-------swspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~g~~-~s~~v~~~~fs~d~~~IlTtG~d~  236 (568)
T PTZ00420        167 PKKLS--SL-------KWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHDGGK-NTKNIWIDGLGGDDNYILSTGFSK  236 (568)
T ss_pred             CCcEE--EE-------EECCCCCEEEEEecCCEEEEEECCCCcEEEEEecccCCc-eeEEEEeeeEcCCCCEEEEEEcCC
Confidence            33221  11       12224555554 3 8899999999999876655433221 111111    12334555555554


Q ss_pred             CceeEEEEEEcCC-CceeeeeeeecccCccCce-EEE--c-CcEEEEEECCCCeEEEEEeecce
Q 003800          199 SSQFHAYQINAMN-GELLNHETAAFSGGFVGDV-ALV--S-SDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       199 ~~~~~v~ald~~t-G~~~w~~~v~~~~~~s~~~-~~v--g-~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .....+.-.|+.+ ++++-...+....   +.+ .+.  . +.++++.. ..+.+++.++..+.
T Consensus       237 ~~~R~VkLWDlr~~~~pl~~~~ld~~~---~~L~p~~D~~tg~l~lsGk-GD~tIr~~e~~~~~  296 (568)
T PTZ00420        237 NNMREMKLWDLKNTTSALVTMSIDNAS---APLIPHYDESTGLIYLIGK-GDGNCRYYQHSLGS  296 (568)
T ss_pred             CCccEEEEEECCCCCCceEEEEecCCc---cceEEeeeCCCCCEEEEEE-CCCeEEEEEccCCc
Confidence            3323566677774 5555443221111   111 111  1 23455553 45778888877665


No 43 
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=95.87  E-value=0.014  Score=41.12  Aligned_cols=29  Identities=24%  Similarity=0.445  Sum_probs=24.0

Q ss_pred             eCCEEEEEEccCCeEEEEeCCCCcEeEEE
Q 003800           94 LGKYVITLSSDGSTLRAWNLPDGQMVWES  122 (794)
Q Consensus        94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~  122 (794)
                      ..++.+++++.++.++|+|+++|+++|+.
T Consensus         4 ~~~~~v~~~~~~g~l~a~d~~~G~~~W~~   32 (33)
T smart00564        4 LSDGTVYVGSTDGTLYALDAKTGEILWTY   32 (33)
T ss_pred             EECCEEEEEcCCCEEEEEEcccCcEEEEc
Confidence            34556777776789999999999999986


No 44 
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=95.85  E-value=1.3  Score=49.65  Aligned_cols=199  Identities=16%  Similarity=0.165  Sum_probs=112.5

Q ss_pred             CceeeeeeeeeccCCCEEEEEeCCC--EEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccCCeEEEEeCCC
Q 003800           39 GKVKHAVFHTQKTGRKRVVVSTEEN--VIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDGSTLRAWNLPD  115 (794)
Q Consensus        39 G~~~~~~f~~~~~~~~~Vyv~t~~g--~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g~~v~A~d~~t  115 (794)
                      |......||.   ....+.|+.-+|  .|+.+|-++-..+=...|+.- +|..... ..|...++.++....++.||..+
T Consensus       214 ~~I~sv~FHp---~~plllvaG~d~~lrifqvDGk~N~~lqS~~l~~f-Pi~~a~f~p~G~~~i~~s~rrky~ysyDle~  289 (514)
T KOG2055|consen  214 GGITSVQFHP---TAPLLLVAGLDGTLRIFQVDGKVNPKLQSIHLEKF-PIQKAEFAPNGHSVIFTSGRRKYLYSYDLET  289 (514)
T ss_pred             CCceEEEecC---CCceEEEecCCCcEEEEEecCccChhheeeeeccC-ccceeeecCCCceEEEecccceEEEEeeccc
Confidence            3334456773   244678887776  477888777665544444332 2332222 34555777787777899999999


Q ss_pred             CcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEE
Q 003800          116 GQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVV  194 (794)
Q Consensus       116 G~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~  194 (794)
                      +++.=-.+..+-. .+.......  ..+ +..+++. ..|.++.|.++||+..=+.+.++...   .+..+..+..+++.
T Consensus       290 ak~~k~~~~~g~e-~~~~e~FeV--Shd-~~fia~~G~~G~I~lLhakT~eli~s~KieG~v~---~~~fsSdsk~l~~~  362 (514)
T KOG2055|consen  290 AKVTKLKPPYGVE-EKSMERFEV--SHD-SNFIAIAGNNGHIHLLHAKTKELITSFKIEGVVS---DFTFSSDSKELLAS  362 (514)
T ss_pred             cccccccCCCCcc-cchhheeEe--cCC-CCeEEEcccCceEEeehhhhhhhhheeeeccEEe---eEEEecCCcEEEEE
Confidence            8875322222211 112222210  111 2333333 48999999999998877777654321   12223455667776


Q ss_pred             EecCCceeEEEEEEcCCCceeeeeeeecccCccCc--eEEEcCcEEEEEECCCCeEEEEEeec
Q 003800          195 GYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGD--VALVSSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       195 ~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~--~~~vg~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      +..|    .|+.+|+..-..+...  .-...+.+.  |.-..+.+++|.. +.|.+-+.|.++
T Consensus       363 ~~~G----eV~v~nl~~~~~~~rf--~D~G~v~gts~~~S~ng~ylA~GS-~~GiVNIYd~~s  418 (514)
T KOG2055|consen  363 GGTG----EVYVWNLRQNSCLHRF--VDDGSVHGTSLCISLNGSYLATGS-DSGIVNIYDGNS  418 (514)
T ss_pred             cCCc----eEEEEecCCcceEEEE--eecCccceeeeeecCCCceEEecc-CcceEEEeccch
Confidence            6666    7888888766444322  223344442  3223445666664 567777777665


No 45 
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=95.22  E-value=7.6  Score=44.50  Aligned_cols=151  Identities=15%  Similarity=0.183  Sum_probs=97.5

Q ss_pred             cCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc--cceeeee-eeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN--DVVDGID-IALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~--~~i~~l~-~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      +++.+...++.+|.++..|-+||+.+=...-+..  +.|-++. -..+..++++|++ ..++-||.+++++.=++..+..
T Consensus       200 PDG~~Fat~gsDgki~iyDGktge~vg~l~~~~aHkGsIfalsWsPDs~~~~T~SaD-kt~KIWdVs~~slv~t~~~~~~  278 (603)
T KOG0318|consen  200 PDGSRFATAGSDGKIYIYDGKTGEKVGELEDSDAHKGSIFALSWSPDSTQFLTVSAD-KTIKIWDVSTNSLVSTWPMGST  278 (603)
T ss_pred             CCCCeEEEecCCccEEEEcCCCccEEEEecCCCCccccEEEEEECCCCceEEEecCC-ceEEEEEeeccceEEEeecCCc
Confidence            4566777788899999999999999876543221  2333332 1256778888875 6899999999999988887654


Q ss_pred             cccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800          128 KHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ  206 (794)
Q Consensus       128 ~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a  206 (794)
                      ...+   .++.  ..- .+.++.. -+|.+.-|++.++.+.=...--...+....+  +.++..+|-.+.+|    .+..
T Consensus       279 v~dq---qvG~--lWq-kd~lItVSl~G~in~ln~~d~~~~~~i~GHnK~ITaLtv--~~d~~~i~SgsyDG----~I~~  346 (603)
T KOG0318|consen  279 VEDQ---QVGC--LWQ-KDHLITVSLSGTINYLNPSDPSVLKVISGHNKSITALTV--SPDGKTIYSGSYDG----HINS  346 (603)
T ss_pred             hhce---EEEE--EEe-CCeEEEEEcCcEEEEecccCCChhheecccccceeEEEE--cCCCCEEEeeccCc----eEEE
Confidence            2211   1210  112 3344444 4999999999999866555444333322222  23455566655565    7888


Q ss_pred             EEcCCCce
Q 003800          207 INAMNGEL  214 (794)
Q Consensus       207 ld~~tG~~  214 (794)
                      .|..+|.-
T Consensus       347 W~~~~g~~  354 (603)
T KOG0318|consen  347 WDSGSGTS  354 (603)
T ss_pred             EecCCccc
Confidence            88777753


No 46 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.19  E-value=0.78  Score=47.35  Aligned_cols=190  Identities=13%  Similarity=0.113  Sum_probs=100.8

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL  134 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~  134 (794)
                      +-....+-.+...|..||++.=|..--. ..+...+. ..+-.|++| +.+..+|+||..+-...=-.-+. +.. .+..
T Consensus        74 f~s~GgDk~v~vwDV~TGkv~Rr~rgH~-aqVNtV~f-NeesSVv~SgsfD~s~r~wDCRS~s~ePiQild-ea~-D~V~  149 (307)
T KOG0316|consen   74 FASCGGDKAVQVWDVNTGKVDRRFRGHL-AQVNTVRF-NEESSVVASGSFDSSVRLWDCRSRSFEPIQILD-EAK-DGVS  149 (307)
T ss_pred             cccCCCCceEEEEEcccCeeeeeccccc-ceeeEEEe-cCcceEEEeccccceeEEEEcccCCCCccchhh-hhc-Ccee
Confidence            3344457789999999999875543221 22333322 233344445 45889999998654322111111 100 0000


Q ss_pred             ccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEE-EecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800          135 LVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ-LDESDQIYVVGYAGSSQFHAYQINAMNG  212 (794)
Q Consensus       135 ~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~-s~~~~~vyv~~~~g~~~~~v~ald~~tG  212 (794)
                      .+    .+. +..++..+ ||+++.+|...|..---+- ..    |..++- +.+++-+.+.++++    .+.-||-.||
T Consensus       150 Si----~v~-~heIvaGS~DGtvRtydiR~G~l~sDy~-g~----pit~vs~s~d~nc~La~~l~s----tlrLlDk~tG  215 (307)
T KOG0316|consen  150 SI----DVA-EHEIVAGSVDGTVRTYDIRKGTLSSDYF-GH----PITSVSFSKDGNCSLASSLDS----TLRLLDKETG  215 (307)
T ss_pred             EE----Eec-ccEEEeeccCCcEEEEEeecceeehhhc-CC----cceeEEecCCCCEEEEeeccc----eeeecccchh
Confidence            01    111 34444444 9999999999886543331 11    112221 22344445545554    7888999999


Q ss_pred             ceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecceeeeEEEe
Q 003800          213 ELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKIAFQETH  264 (794)
Q Consensus       213 ~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~  264 (794)
                      +++..+.-......--.|-+-.....|..-+..|.++.-||..+.+ +..++
T Consensus       216 klL~sYkGhkn~eykldc~l~qsdthV~sgSEDG~Vy~wdLvd~~~-~sk~~  266 (307)
T KOG0316|consen  216 KLLKSYKGHKNMEYKLDCCLNQSDTHVFSGSEDGKVYFWDLVDETQ-ISKLS  266 (307)
T ss_pred             HHHHHhcccccceeeeeeeecccceeEEeccCCceEEEEEecccee-eeeec
Confidence            9998775322211111333323333334444678899999988773 33333


No 47 
>KOG0296 consensus Angio-associated migratory cell protein (contains WD40 repeats) [Function unknown]
Probab=95.09  E-value=6.6  Score=43.08  Aligned_cols=141  Identities=15%  Similarity=0.231  Sum_probs=80.4

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEc-cCCeEEEEeCCCCcEeEEEeccCccc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSS-DGSTLRAWNLPDGQMVWESFLRGSKH  129 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~~~~  129 (794)
                      ..+.+.++.++..-+-.+..||+  |-..+... +++..... ..++.+.++| -.+.|+.|...+|...|...-..   
T Consensus        75 ~~~l~aTGGgDD~AflW~~~~ge--~~~eltgHKDSVt~~~F-shdgtlLATGdmsG~v~v~~~stg~~~~~~~~e~---  148 (399)
T KOG0296|consen   75 NNNLVATGGGDDLAFLWDISTGE--FAGELTGHKDSVTCCSF-SHDGTLLATGDMSGKVLVFKVSTGGEQWKLDQEV---  148 (399)
T ss_pred             CCceEEecCCCceEEEEEccCCc--ceeEecCCCCceEEEEE-ccCceEEEecCCCccEEEEEcccCceEEEeeccc---
Confidence            34556677778877778888888  66666543 45544422 3334444455 47899999999999999986222   


Q ss_pred             cCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEE----------EecCCEEEEEEec
Q 003800          130 SKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ----------LDESDQIYVVGYA  197 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~----------s~~~~~vyv~~~~  197 (794)
                       .++..+-    .+....++.. + +|.           +|-|+.++...  .++..          -..+|+-.+.++.
T Consensus       149 -~dieWl~----WHp~a~illAG~~DGs-----------vWmw~ip~~~~--~kv~~Gh~~~ct~G~f~pdGKr~~tgy~  210 (399)
T KOG0296|consen  149 -EDIEWLK----WHPRAHILLAGSTDGS-----------VWMWQIPSQAL--CKVMSGHNSPCTCGEFIPDGKRILTGYD  210 (399)
T ss_pred             -CceEEEE----ecccccEEEeecCCCc-----------EEEEECCCcce--eeEecCCCCCcccccccCCCceEEEEec
Confidence             1222331    2222333332 2 454           45555554211  11110          0123443334444


Q ss_pred             CCceeEEEEEEcCCCceeeeee
Q 003800          198 GSSQFHAYQINAMNGELLNHET  219 (794)
Q Consensus       198 g~~~~~v~ald~~tG~~~w~~~  219 (794)
                      .+   .+...|++||+++-...
T Consensus       211 dg---ti~~Wn~ktg~p~~~~~  229 (399)
T KOG0296|consen  211 DG---TIIVWNPKTGQPLHKIT  229 (399)
T ss_pred             Cc---eEEEEecCCCceeEEec
Confidence            33   79999999999997664


No 48 
>KOG0316 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=95.06  E-value=5.1  Score=41.62  Aligned_cols=146  Identities=14%  Similarity=0.121  Sum_probs=78.9

Q ss_pred             eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEe
Q 003800           94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRD  171 (794)
Q Consensus        94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~  171 (794)
                      .|+..++ .|.+.+|+.||+..|.++-++...+... .++...       .++.=+..  .|..++..|..||++.-++.
T Consensus        28 dGnY~lt-cGsdrtvrLWNp~rg~liktYsghG~EV-lD~~~s-------~Dnskf~s~GgDk~v~vwDV~TGkv~Rr~r   98 (307)
T KOG0316|consen   28 DGNYCLT-CGSDRTVRLWNPLRGALIKTYSGHGHEV-LDAALS-------SDNSKFASCGGDKAVQVWDVNTGKVDRRFR   98 (307)
T ss_pred             CCCEEEE-cCCCceEEeecccccceeeeecCCCcee-eecccc-------ccccccccCCCCceEEEEEcccCeeeeecc
Confidence            3555555 4456899999999999999998876433 122111       13333333  36678899999999876665


Q ss_pred             ccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEE
Q 003800          172 FAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVT  250 (794)
Q Consensus       172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v  250 (794)
                      --......  +........++-.+++.    .+.++|-.+-..---+.+... -++  ..+-+.+..++.... .|.+..
T Consensus        99 gH~aqVNt--V~fNeesSVv~SgsfD~----s~r~wDCRS~s~ePiQildea~D~V--~Si~v~~heIvaGS~-DGtvRt  169 (307)
T KOG0316|consen   99 GHLAQVNT--VRFNEESSVVASGSFDS----SVRLWDCRSRSFEPIQILDEAKDGV--SSIDVAEHEIVAGSV-DGTVRT  169 (307)
T ss_pred             cccceeeE--EEecCcceEEEeccccc----eeEEEEcccCCCCccchhhhhcCce--eEEEecccEEEeecc-CCcEEE
Confidence            44332211  11111122223223332    566666554322111111100 011  112234556666653 699999


Q ss_pred             EEeecce
Q 003800          251 VSFKNRK  257 (794)
Q Consensus       251 ~~l~sg~  257 (794)
                      .|+..|+
T Consensus       170 ydiR~G~  176 (307)
T KOG0316|consen  170 YDIRKGT  176 (307)
T ss_pred             EEeecce
Confidence            9999988


No 49 
>PHA02790 Kelch-like protein; Provisional
Probab=94.86  E-value=2.6  Score=49.08  Aligned_cols=167  Identities=10%  Similarity=0.066  Sum_probs=91.8

Q ss_pred             CCEEEEEeCC------CEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEcc--CCeEEEEeCCCCcEeEEE
Q 003800           53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSD--GSTLRAWNLPDGQMVWES  122 (794)
Q Consensus        53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~  122 (794)
                      ++.||+..+.      ..+...|+++++  |+..-+-+.  .-.+. +..++.+.++||.  ...+..||+.++  .|+.
T Consensus       271 ~~~lyviGG~~~~~~~~~v~~Ydp~~~~--W~~~~~m~~~r~~~~~-v~~~~~iYviGG~~~~~sve~ydp~~n--~W~~  345 (480)
T PHA02790        271 GEVVYLIGGWMNNEIHNNAIAVNYISNN--WIPIPPMNSPRLYASG-VPANNKLYVVGGLPNPTSVERWFHGDA--AWVN  345 (480)
T ss_pred             CCEEEEEcCCCCCCcCCeEEEEECCCCE--EEECCCCCchhhcceE-EEECCEEEEECCcCCCCceEEEECCCC--eEEE
Confidence            6678887653      357788999875  987543321  11122 3467767777764  246888987655  5875


Q ss_pred             eccCccccCCccccccccccccCCeEEEEEC-----CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEec
Q 003800          123 FLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-----GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYA  197 (794)
Q Consensus       123 ~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-----g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~  197 (794)
                      -..-+.. .  .-..   ...-++.+||.++     ..+..+|+.++  .|+...+.+.-...... ..-++.+|++|  
T Consensus       346 ~~~l~~~-r--~~~~---~~~~~g~IYviGG~~~~~~~ve~ydp~~~--~W~~~~~m~~~r~~~~~-~~~~~~IYv~G--  414 (480)
T PHA02790        346 MPSLLKP-R--CNPA---VASINNVIYVIGGHSETDTTTEYLLPNHD--QWQFGPSTYYPHYKSCA-LVFGRRLFLVG--  414 (480)
T ss_pred             CCCCCCC-C--cccE---EEEECCEEEEecCcCCCCccEEEEeCCCC--EEEeCCCCCCccccceE-EEECCEEEEEC--
Confidence            3221111 1  0000   1222677888632     34667887754  79875543321111111 24688999976  


Q ss_pred             CCceeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800          198 GSSQFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL  241 (794)
Q Consensus       198 g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~  241 (794)
                      |    .+.++|+.++  .|+.--..+....+ .+.++++.++++.
T Consensus       415 G----~~e~ydp~~~--~W~~~~~m~~~r~~~~~~v~~~~IYviG  453 (480)
T PHA02790        415 R----NAEFYCESSN--TWTLIDDPIYPRDNPELIIVDNKLLLIG  453 (480)
T ss_pred             C----ceEEecCCCC--cEeEcCCCCCCccccEEEEECCEEEEEC
Confidence            3    4677888765  68764333332322 2333465566554


No 50 
>PHA02713 hypothetical protein; Provisional
Probab=94.77  E-value=1.9  Score=51.33  Aligned_cols=173  Identities=9%  Similarity=0.101  Sum_probs=96.4

Q ss_pred             CCEEEEEeCC-------CEEEEEECcCCccceEEEcCcccce--eeeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE
Q 003800           53 RKRVVVSTEE-------NVIASLDLRHGEIFWRHVLGINDVV--DGIDIALGKYVITLSSDG-----STLRAWNLPDGQM  118 (794)
Q Consensus        53 ~~~Vyv~t~~-------g~l~ALn~~tG~ivWR~~l~~~~~i--~~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l  118 (794)
                      ++.||+..+.       +.+..+|+++..  |+..-+-+..-  .+. +..++.+.++||.+     ..+..||+.+.  
T Consensus       303 ~~~IYviGG~~~~~~~~~~v~~Yd~~~n~--W~~~~~m~~~R~~~~~-~~~~g~IYviGG~~~~~~~~sve~Ydp~~~--  377 (557)
T PHA02713        303 DNEIIIAGGYNFNNPSLNKVYKINIENKI--HVELPPMIKNRCRFSL-AVIDDTIYAIGGQNGTNVERTIECYTMGDD--  377 (557)
T ss_pred             CCEEEEEcCCCCCCCccceEEEEECCCCe--EeeCCCCcchhhceeE-EEECCEEEEECCcCCCCCCceEEEEECCCC--
Confidence            7789988763       358889999874  97644322111  122 34577777777742     34889999876  


Q ss_pred             eEEEeccCccccCCccccccccccccCCeEEEEEC-------------------------CEEEEEECCCCcEEEEEecc
Q 003800          119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------------------------GCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus       119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------------------------g~l~ald~~tG~~~W~~~~~  173 (794)
                      .|+.-..-+..   ....+   ...-++.+||.++                         ..+.++|+.+.  .|+.-.+
T Consensus       378 ~W~~~~~mp~~---r~~~~---~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~ve~YDP~td--~W~~v~~  449 (557)
T PHA02713        378 KWKMLPDMPIA---LSSYG---MCVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEEDTHSSNKVIRYDTVNN--IWETLPN  449 (557)
T ss_pred             eEEECCCCCcc---ccccc---EEEECCEEEEEeCCCcccccccccccccccccccccccceEEEECCCCC--eEeecCC
Confidence            58864321111   00011   1122577777642                         24778888775  5886554


Q ss_pred             Ccce-eeeeEEEEecCCEEEEEEecCC-c--eeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800          174 AESV-EVQQVIQLDESDQIYVVGYAGS-S--QFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL  241 (794)
Q Consensus       174 ~~~~-~~~~~v~s~~~~~vyv~~~~g~-~--~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~  241 (794)
                      .+.. ....+  ..-++.+|++|...+ .  .-.+.++|+.+ .-.|+.--..|..... .+..+++.+++..
T Consensus       450 m~~~r~~~~~--~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~-~~~W~~~~~m~~~r~~~~~~~~~~~iyv~G  519 (557)
T PHA02713        450 FWTGTIRPGV--VSHKDDIYVVCDIKDEKNVKTCIFRYNTNT-YNGWELITTTESRLSALHTILHDNTIMMLH  519 (557)
T ss_pred             CCcccccCcE--EEECCEEEEEeCCCCCCccceeEEEecCCC-CCCeeEccccCcccccceeEEECCEEEEEe
Confidence            3221 11112  246889999874321 1  12467899987 1248875455544443 3333465565544


No 51 
>PTZ00421 coronin; Provisional
Probab=94.74  E-value=3.2  Score=48.52  Aligned_cols=154  Identities=15%  Similarity=0.135  Sum_probs=82.7

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      .+.++.++.++.|.-.|.++|+.+=.... ....+..+.....+..++.++.++.++.||+.+|+.+.+...........
T Consensus       138 ~~iLaSgs~DgtVrIWDl~tg~~~~~l~~-h~~~V~sla~spdG~lLatgs~Dg~IrIwD~rsg~~v~tl~~H~~~~~~~  216 (493)
T PTZ00421        138 MNVLASAGADMVVNVWDVERGKAVEVIKC-HSDQITSLEWNLDGSLLCTTSKDKKLNIIDPRDGTIVSSVEAHASAKSQR  216 (493)
T ss_pred             CCEEEEEeCCCEEEEEECCCCeEEEEEcC-CCCceEEEEEECCCCEEEEecCCCEEEEEECCCCcEEEEEecCCCCcceE
Confidence            35677888999999999999976432211 12234444222334455546667899999999999988766543221000


Q ss_pred             ccccccccccccCCeEEEEE-----CCEEEEEECCCCcE-EEEEeccCcceeeeeEEEEecCCEEEEEEe-cCCceeEEE
Q 003800          133 LLLVPTNLKVDKDSLILVSS-----KGCLHAVSSIDGEI-LWTRDFAAESVEVQQVIQLDESDQIYVVGY-AGSSQFHAY  205 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~~-----~g~l~ald~~tG~~-~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~-~g~~~~~v~  205 (794)
                      ....+     . .+.++...     ++.+...|..+... .-......... ...+....+++.+|+.+. +|    .+.
T Consensus       217 ~~w~~-----~-~~~ivt~G~s~s~Dr~VklWDlr~~~~p~~~~~~d~~~~-~~~~~~d~d~~~L~lggkgDg----~Ir  285 (493)
T PTZ00421        217 CLWAK-----R-KDLIITLGCSKSQQRQIMLWDTRKMASPYSTVDLDQSSA-LFIPFFDEDTNLLYIGSKGEG----NIR  285 (493)
T ss_pred             EEEcC-----C-CCeEEEEecCCCCCCeEEEEeCCCCCCceeEeccCCCCc-eEEEEEcCCCCEEEEEEeCCC----eEE
Confidence            11111     1 23333321     46777777765442 22222211111 111111234455665543 33    677


Q ss_pred             EEEcCCCceeeee
Q 003800          206 QINAMNGELLNHE  218 (794)
Q Consensus       206 ald~~tG~~~w~~  218 (794)
                      .+|..+|++....
T Consensus       286 iwdl~~~~~~~~~  298 (493)
T PTZ00421        286 CFELMNERLTFCS  298 (493)
T ss_pred             EEEeeCCceEEEe
Confidence            8888888876554


No 52 
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=94.66  E-value=2  Score=44.85  Aligned_cols=106  Identities=14%  Similarity=0.187  Sum_probs=77.5

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      ++.++-+++++.|---|-+||.++=+..++.+  +..+.+...+++++++ +|+.|.-||+.+=.++=++.+.-.+.  +
T Consensus       155 D~~iLSSadd~tVRLWD~rTgt~v~sL~~~s~--VtSlEvs~dG~ilTia-~gssV~Fwdaksf~~lKs~k~P~nV~--S  229 (334)
T KOG0278|consen  155 DKCILSSADDKTVRLWDHRTGTEVQSLEFNSP--VTSLEVSQDGRILTIA-YGSSVKFWDAKSFGLLKSYKMPCNVE--S  229 (334)
T ss_pred             CceEEeeccCCceEEEEeccCcEEEEEecCCC--CcceeeccCCCEEEEe-cCceeEEeccccccceeeccCccccc--c
Confidence            44566667888888899999999988777665  4455445566677644 56789999999999998888765443  2


Q ss_pred             ccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEE
Q 003800          133 LLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTR  170 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~  170 (794)
                      +.+-|       .+.+||.  .+..++.+|-.||+.+=.+
T Consensus       230 ASL~P-------~k~~fVaGged~~~~kfDy~TgeEi~~~  262 (334)
T KOG0278|consen  230 ASLHP-------KKEFFVAGGEDFKVYKFDYNTGEEIGSY  262 (334)
T ss_pred             ccccC-------CCceEEecCcceEEEEEeccCCceeeec
Confidence            22222       4567775  3889999999999877665


No 53 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=94.43  E-value=3.9  Score=52.36  Aligned_cols=200  Identities=11%  Similarity=0.112  Sum_probs=102.4

Q ss_pred             CCEEEEEeCC-CEEEEEECcCCccceEEE-------cCcc--c------ceeeeeeeeCCEEEEEEc-cCCeEEEEeCCC
Q 003800           53 RKRVVVSTEE-NVIASLDLRHGEIFWRHV-------LGIN--D------VVDGIDIALGKYVITLSS-DGSTLRAWNLPD  115 (794)
Q Consensus        53 ~~~Vyv~t~~-g~l~ALn~~tG~ivWR~~-------l~~~--~------~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~t  115 (794)
                      ++.|||+... +.|.-+|..+|.+.=-..       ....  .      ...++.....++.++|+. .+++|+-||..+
T Consensus       635 gn~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~~~~~I~v~d~~~  714 (1057)
T PLN02919        635 KNLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMAGQHQIWEYNISD  714 (1057)
T ss_pred             CCEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEECCCCeEEEEECCC
Confidence            4568887764 568888887775310000       0000  0      001221222245566653 457899999999


Q ss_pred             CcEeEEEeccCcc------ccC-CccccccccccccC-CeEEEEE--CCEEEEEECCCCcEEEEEecc------------
Q 003800          116 GQMVWESFLRGSK------HSK-PLLLVPTNLKVDKD-SLILVSS--KGCLHAVSSIDGEILWTRDFA------------  173 (794)
Q Consensus       116 G~llWe~~l~~~~------~s~-~~~~~~~~~~~~~~-~~V~V~~--~g~l~ald~~tG~~~W~~~~~------------  173 (794)
                      |...- ....+..      ... .....|..+..+.+ +.+||..  +++|+.+|..+|...|.....            
T Consensus       715 g~v~~-~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~tg~~~~~~gg~~~~~~~l~~fG~  793 (1057)
T PLN02919        715 GVTRV-FSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKTGGSRLLAGGDPTFSDNLFKFGD  793 (1057)
T ss_pred             CeEEE-EecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCCCcEEEEEecccccCcccccccC
Confidence            87541 1111000      000 00001111122323 3477763  789999999988866543100            


Q ss_pred             --Cc----ce-eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec---------ccCccC--ceEEEcC
Q 003800          174 --AE----SV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF---------SGGFVG--DVALVSS  235 (794)
Q Consensus       174 --~~----~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~---------~~~~s~--~~~~vg~  235 (794)
                        ..    .+ .|..+. ...++.+|+....++   ++..+|+.+|....-...+.         ...+..  .+.+..+
T Consensus       794 ~dG~g~~~~l~~P~Gva-vd~dG~LYVADs~N~---rIrviD~~tg~v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~d  869 (1057)
T PLN02919        794 HDGVGSEVLLQHPLGVL-CAKDGQIYVADSYNH---KIKKLDPATKRVTTLAGTGKAGFKDGKALKAQLSEPAGLALGEN  869 (1057)
T ss_pred             CCCchhhhhccCCceee-EeCCCcEEEEECCCC---EEEEEECCCCeEEEEeccCCcCCCCCcccccccCCceEEEEeCC
Confidence              00    00 133332 234567888665444   78889998887764332111         111111  1222233


Q ss_pred             cEEEEEECCCCeEEEEEeecce
Q 003800          236 DTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       236 ~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +.++.+|..++.++++|+.+++
T Consensus       870 G~lyVaDt~Nn~Irvid~~~~~  891 (1057)
T PLN02919        870 GRLFVADTNNSLIRYLDLNKGE  891 (1057)
T ss_pred             CCEEEEECCCCEEEEEECCCCc
Confidence            4456778888889999998876


No 54 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=94.21  E-value=20  Score=44.61  Aligned_cols=190  Identities=18%  Similarity=0.108  Sum_probs=96.6

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEE------EcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRH------VLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~------~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      +++.+.+++.++.|.-.|..+...-++.      .+.....+..+... ..+..++.++.++.|+.||..+|+.+.+...
T Consensus       494 dg~~latgg~D~~I~iwd~~~~~~~~~~~~~~~~~~~~~~~v~~l~~~~~~~~~las~~~Dg~v~lWd~~~~~~~~~~~~  573 (793)
T PLN00181        494 DGEFFATAGVNKKIKIFECESIIKDGRDIHYPVVELASRSKLSGICWNSYIKSQVASSNFEGVVQVWDVARSQLVTEMKE  573 (793)
T ss_pred             CCCEEEEEeCCCEEEEEECCcccccccccccceEEecccCceeeEEeccCCCCEEEEEeCCCeEEEEECCCCeEEEEecC
Confidence            3556778888898888886542111110      01111112222111 1234455466678999999999999988765


Q ss_pred             cCccccCCccccccccccccCCeEEE-E-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCcee
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQF  202 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~  202 (794)
                      ....+ ..+.+.+     . ++.+++ . .+|.+...|..+|...-+......   ...+.....++..++.+...+   
T Consensus       574 H~~~V-~~l~~~p-----~-~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~~~---v~~v~~~~~~g~~latgs~dg---  640 (793)
T PLN00181        574 HEKRV-WSIDYSS-----A-DPTLLASGSDDGSVKLWSINQGVSIGTIKTKAN---ICCVQFPSESGRSLAFGSADH---  640 (793)
T ss_pred             CCCCE-EEEEEcC-----C-CCCEEEEEcCCCEEEEEECCCCcEEEEEecCCC---eEEEEEeCCCCCEEEEEeCCC---
Confidence            54322 1111111     1 334444 3 389999999998876655443221   111111123344444443322   


Q ss_pred             EEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecc
Q 003800          203 HAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNR  256 (794)
Q Consensus       203 ~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg  256 (794)
                      .+..+|..+++.......+-...+. .+.+..+..+++.. ..+.+.+-|+..+
T Consensus       641 ~I~iwD~~~~~~~~~~~~~h~~~V~-~v~f~~~~~lvs~s-~D~~ikiWd~~~~  692 (793)
T PLN00181        641 KVYYYDLRNPKLPLCTMIGHSKTVS-YVRFVDSSTLVSSS-TDNTLKLWDLSMS  692 (793)
T ss_pred             eEEEEECCCCCccceEecCCCCCEE-EEEEeCCCEEEEEE-CCCEEEEEeCCCC
Confidence            7888898877532211111111121 12223344555554 3577888777643


No 55 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=94.16  E-value=17  Score=43.59  Aligned_cols=66  Identities=11%  Similarity=-0.003  Sum_probs=45.7

Q ss_pred             ecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEc-CcEEEEEE-CCCCeEEEEEeecce
Q 003800          186 DESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVS-SDTLVTLD-TTRSILVTVSFKNRK  257 (794)
Q Consensus       186 ~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg-~~~lv~~d-~~~g~L~v~~l~sg~  257 (794)
                      ..+..++-.+++|    .|.|.|.+.++--.+.+  +|..++-+|+-+. .+.+||+- .+.=.+++-++++|+
T Consensus       402 ~~g~~llssSLDG----tVRAwDlkRYrNfRTft--~P~p~QfscvavD~sGelV~AG~~d~F~IfvWS~qTGq  469 (893)
T KOG0291|consen  402 ARGNVLLSSSLDG----TVRAWDLKRYRNFRTFT--SPEPIQFSCVAVDPSGELVCAGAQDSFEIFVWSVQTGQ  469 (893)
T ss_pred             ecCCEEEEeecCC----eEEeeeecccceeeeec--CCCceeeeEEEEcCCCCEEEeeccceEEEEEEEeecCe
Confidence            4566777778888    89999999988776664  4555555777763 34555653 222257888888888


No 56 
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=94.01  E-value=17  Score=43.26  Aligned_cols=193  Identities=14%  Similarity=0.234  Sum_probs=107.9

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccc--eEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIF--WRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH  129 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~iv--WR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~  129 (794)
                      ++..+|++.....+.-.+..+|+.+  |+..-+.+  +..+.....+.++..+|.++.++.||...|...=..+..++..
T Consensus        73 d~~~L~~a~rs~llrv~~L~tgk~irswKa~He~P--vi~ma~~~~g~LlAtggaD~~v~VWdi~~~~~th~fkG~gGvV  150 (775)
T KOG0319|consen   73 DEEVLVTASRSQLLRVWSLPTGKLIRSWKAIHEAP--VITMAFDPTGTLLATGGADGRVKVWDIKNGYCTHSFKGHGGVV  150 (775)
T ss_pred             CccEEEEeeccceEEEEEcccchHhHhHhhccCCC--eEEEEEcCCCceEEeccccceEEEEEeeCCEEEEEecCCCceE
Confidence            4677899999998888888999654  55433333  3233223344566656667899999999998888877765554


Q ss_pred             cCCccccccccccccCCeEEE-E-ECCEEEEEECCCCcE----------------------------------EEEEecc
Q 003800          130 SKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEI----------------------------------LWTRDFA  173 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~----------------------------------~W~~~~~  173 (794)
                       ..+.+-+     +....+++ . .|+.+++.|..++..                                  +|.+..-
T Consensus       151 -ssl~F~~-----~~~~~lL~sg~~D~~v~vwnl~~~~tcl~~~~~H~S~vtsL~~~~d~~~~ls~~RDkvi~vwd~~~~  224 (775)
T KOG0319|consen  151 -SSLLFHP-----HWNRWLLASGATDGTVRVWNLNDKRTCLHTMILHKSAVTSLAFSEDSLELLSVGRDKVIIVWDLVQY  224 (775)
T ss_pred             -EEEEeCC-----ccchhheeecCCCceEEEEEcccCchHHHHHHhhhhheeeeeeccCCceEEEeccCcEEEEeehhhh
Confidence             2222222     10111222 2 266666666654433                                  3444211


Q ss_pred             Ccc--e----eeeeEEEEec-----CCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEE
Q 003800          174 AES--V----EVQQVIQLDE-----SDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLD  242 (794)
Q Consensus       174 ~~~--~----~~~~~v~s~~-----~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d  242 (794)
                      ...  +    ..+.++....     +..++.+|..|    .+--+|+++|+.+...+.+....+..-..+.+.+.+.++.
T Consensus       225 ~~l~~lp~ye~~E~vv~l~~~~~~~~~~~~TaG~~g----~~~~~d~es~~~~~~~~~~~~~e~~~~~~~~~~~~~l~vt  300 (775)
T KOG0319|consen  225 KKLKTLPLYESLESVVRLREELGGKGEYIITAGGSG----VVQYWDSESGKCVYKQRQSDSEEIDHLLAIESMSQLLLVT  300 (775)
T ss_pred             hhhheechhhheeeEEEechhcCCcceEEEEecCCc----eEEEEecccchhhhhhccCCchhhhcceeccccCceEEEE
Confidence            100  0    0111221111     22444444444    7888899999988776544222254444445556666665


Q ss_pred             CCCCeEEEEEeecce
Q 003800          243 TTRSILVTVSFKNRK  257 (794)
Q Consensus       243 ~~~g~L~v~~l~sg~  257 (794)
                      ++ -+|..+|..+.+
T Consensus       301 ae-Qnl~l~d~~~l~  314 (775)
T KOG0319|consen  301 AE-QNLFLYDEDELT  314 (775)
T ss_pred             cc-ceEEEEEccccE
Confidence            43 467777877766


No 57 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=94.00  E-value=11  Score=41.06  Aligned_cols=197  Identities=15%  Similarity=0.169  Sum_probs=92.6

Q ss_pred             cCCCEEEEEeC-CCEEEEEECc-CCccceEEEcCcccceeeeeeeeCCEEEEEEc-cCCeEEEEeCC-CCcEe-EEEecc
Q 003800           51 TGRKRVVVSTE-ENVIASLDLR-HGEIFWRHVLGINDVVDGIDIALGKYVITLSS-DGSTLRAWNLP-DGQMV-WESFLR  125 (794)
Q Consensus        51 ~~~~~Vyv~t~-~g~l~ALn~~-tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~-tG~ll-We~~l~  125 (794)
                      ++++.+|+++. ++.|..++.. +|++.=.......+....+.....+..++++. .++.+..||.+ +|.+. -.....
T Consensus        44 pd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~~  123 (330)
T PRK11028         44 PDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQIIE  123 (330)
T ss_pred             CCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeecc
Confidence            34667898875 5667666554 56531111111111122232223455666654 35789999886 45321 111111


Q ss_pred             CccccCCccccccccccccCCeEEEEE--CCEEEEEECCC-CcEE----EEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          126 GSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSID-GEIL----WTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       126 ~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~t-G~~~----W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      ...  .+..+.-   ..+ ++.++|..  ++.|..+|..+ |...    .....+.+. .|..+....++..+|++.. +
T Consensus       124 ~~~--~~~~~~~---~p~-g~~l~v~~~~~~~v~v~d~~~~g~l~~~~~~~~~~~~g~-~p~~~~~~pdg~~lyv~~~-~  195 (330)
T PRK11028        124 GLE--GCHSANI---DPD-NRTLWVPCLKEDRIRLFTLSDDGHLVAQEPAEVTTVEGA-GPRHMVFHPNQQYAYCVNE-L  195 (330)
T ss_pred             CCC--cccEeEe---CCC-CCEEEEeeCCCCEEEEEEECCCCcccccCCCceecCCCC-CCceEEECCCCCEEEEEec-C
Confidence            110  0011100   112 35666653  68888888765 4321    222222111 2444543345557777553 3


Q ss_pred             CceeEEEEEEcCCCceeeeeeee-cccCccC-----ceEE-EcCcEEEEEECCCCeEEEEEeec
Q 003800          199 SSQFHAYQINAMNGELLNHETAA-FSGGFVG-----DVAL-VSSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       199 ~~~~~v~ald~~tG~~~w~~~v~-~~~~~s~-----~~~~-vg~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      +..+.++.++..+|+......+. .|....+     .+.+ ..+..+++.+...+.+.+.++.+
T Consensus       196 ~~~v~v~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~i~~~pdg~~lyv~~~~~~~I~v~~i~~  259 (330)
T PRK11028        196 NSSVDVWQLKDPHGEIECVQTLDMMPADFSDTRWAADIHITPDGRHLYACDRTASLISVFSVSE  259 (330)
T ss_pred             CCEEEEEEEeCCCCCEEEEEEEecCCCcCCCCccceeEEECCCCCEEEEecCCCCeEEEEEEeC
Confidence            33455555655567654433332 2322211     1222 13445556666567788888754


No 58 
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=93.91  E-value=12  Score=40.94  Aligned_cols=188  Identities=12%  Similarity=0.147  Sum_probs=90.7

Q ss_pred             EEEEEeC-CCEEEEEECcC-CccceEEEcCcccceeeeeeeeCCEEEEEEc-cCCeEEEEeCC-CCcEeEEEeccCcccc
Q 003800           55 RVVVSTE-ENVIASLDLRH-GEIFWRHVLGINDVVDGIDIALGKYVITLSS-DGSTLRAWNLP-DGQMVWESFLRGSKHS  130 (794)
Q Consensus        55 ~Vyv~t~-~g~l~ALn~~t-G~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~-tG~llWe~~l~~~~~s  130 (794)
                      ++|+++. ++.|..+|..+ |++.=.+.++..+....+....++..+++++ ..+.+..|+.. +|++.=........  
T Consensus         3 ~~y~~~~~~~~I~~~~~~~~g~l~~~~~~~~~~~~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~--   80 (330)
T PRK11028          3 IVYIASPESQQIHVWNLNHEGALTLLQVVDVPGQVQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPG--   80 (330)
T ss_pred             EEEEEcCCCCCEEEEEECCCCceeeeeEEecCCCCccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCC--
Confidence            4788854 67788888864 5433223343322222332233455666654 35678888875 56542111111110  


Q ss_pred             CCcccccccccccc-CCeEEEEE--CCEEEEEECC-CCcE---EEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeE
Q 003800          131 KPLLLVPTNLKVDK-DSLILVSS--KGCLHAVSSI-DGEI---LWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFH  203 (794)
Q Consensus       131 ~~~~~~~~~~~~~~-~~~V~V~~--~g~l~ald~~-tG~~---~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~  203 (794)
                       .+..+    ..+. ++.+++..  ++.+..++.. +|..   .-..  +... .+..+....++..+|+.+...+   .
T Consensus        81 -~p~~i----~~~~~g~~l~v~~~~~~~v~v~~~~~~g~~~~~~~~~--~~~~-~~~~~~~~p~g~~l~v~~~~~~---~  149 (330)
T PRK11028         81 -SPTHI----STDHQGRFLFSASYNANCVSVSPLDKDGIPVAPIQII--EGLE-GCHSANIDPDNRTLWVPCLKED---R  149 (330)
T ss_pred             -CceEE----EECCCCCEEEEEEcCCCeEEEEEECCCCCCCCceeec--cCCC-cccEeEeCCCCCEEEEeeCCCC---E
Confidence             11111    1122 34566653  7888888775 4422   2211  1110 1223322334557777665433   5


Q ss_pred             EEEEEcCC-Cceeee--eeeecccCcc-CceEEE-cCcEEEEEECCCCeEEEEEeec
Q 003800          204 AYQINAMN-GELLNH--ETAAFSGGFV-GDVALV-SSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       204 v~ald~~t-G~~~w~--~~v~~~~~~s-~~~~~v-g~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      +..+|..+ |...-.  ..+..+.+-. ..+.+- ++..+++.+...+.+.+.++..
T Consensus       150 v~v~d~~~~g~l~~~~~~~~~~~~g~~p~~~~~~pdg~~lyv~~~~~~~v~v~~~~~  206 (330)
T PRK11028        150 IRLFTLSDDGHLVAQEPAEVTTVEGAGPRHMVFHPNQQYAYCVNELNSSVDVWQLKD  206 (330)
T ss_pred             EEEEEECCCCcccccCCCceecCCCCCCceEEECCCCCEEEEEecCCCEEEEEEEeC
Confidence            66666655 543211  1111221111 122222 4456667776678999999873


No 59 
>KOG2048 consensus WD40 repeat protein [General function prediction only]
Probab=93.75  E-value=5.2  Score=46.96  Aligned_cols=153  Identities=14%  Similarity=0.172  Sum_probs=92.6

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEE-EEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYV-ITLSSDGSTLRAWNLPDGQMVWESFLRGSKH  129 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~-V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~  129 (794)
                      ..+.+-++.++|++.-++-..|.+.-...|... ..+-.+. -...+. ++.|..++.+|+||+..|..+-.....-..+
T Consensus       121 ~~~~l~IgcddGvl~~~s~~p~~I~~~r~l~rq~sRvLsls-w~~~~~~i~~Gs~Dg~Iriwd~~~~~t~~~~~~~~d~l  199 (691)
T KOG2048|consen  121 ENTILAIGCDDGVLYDFSIGPDKITYKRSLMRQKSRVLSLS-WNPTGTKIAGGSIDGVIRIWDVKSGQTLHIITMQLDRL  199 (691)
T ss_pred             ccceEEeecCCceEEEEecCCceEEEEeecccccceEEEEE-ecCCccEEEecccCceEEEEEcCCCceEEEeeeccccc
Confidence            356788899999999999999999999999766 2333331 233344 4545567889999999999887433322222


Q ss_pred             cCCccccccccccccCCeEEEEECCEEEEEECCCCcE-EEEEeccCcce-------eeeeEEEEecCCEEEEEEecCCce
Q 003800          130 SKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEI-LWTRDFAAESV-------EVQQVIQLDESDQIYVVGYAGSSQ  201 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~-~W~~~~~~~~~-------~~~~~v~s~~~~~vyv~~~~g~~~  201 (794)
                      +..-+.+.        =.|.++.++.+.+-| .+|.+ -|-.....-.-       ....+.-+...+.++..|.++   
T Consensus       200 ~k~~~~iV--------WSv~~Lrd~tI~sgD-S~G~V~FWd~~~gTLiqS~~~h~adVl~Lav~~~~d~vfsaGvd~---  267 (691)
T KOG2048|consen  200 SKREPTIV--------WSVLFLRDSTIASGD-SAGTVTFWDSIFGTLIQSHSCHDADVLALAVADNEDRVFSAGVDP---  267 (691)
T ss_pred             ccCCceEE--------EEEEEeecCcEEEec-CCceEEEEcccCcchhhhhhhhhcceeEEEEcCCCCeEEEccCCC---
Confidence            11011110        113334577777777 45654 36544332100       011121123457888888887   


Q ss_pred             eEEEEEEcCCCceeeee
Q 003800          202 FHAYQINAMNGELLNHE  218 (794)
Q Consensus       202 ~~v~ald~~tG~~~w~~  218 (794)
                       ++.-+...++..-|..
T Consensus       268 -~ii~~~~~~~~~~wv~  283 (691)
T KOG2048|consen  268 -KIIQYSLTTNKSEWVI  283 (691)
T ss_pred             -ceEEEEecCCccceee
Confidence             7888888877666766


No 60 
>PHA03098 kelch-like protein; Provisional
Probab=93.74  E-value=3.8  Score=48.30  Aligned_cols=189  Identities=12%  Similarity=0.122  Sum_probs=97.8

Q ss_pred             CCEEEEEeCC-------CEEEEEECcCCccceEEEcCcccc--eeeeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE
Q 003800           53 RKRVVVSTEE-------NVIASLDLRHGEIFWRHVLGINDV--VDGIDIALGKYVITLSSDG-----STLRAWNLPDGQM  118 (794)
Q Consensus        53 ~~~Vyv~t~~-------g~l~ALn~~tG~ivWR~~l~~~~~--i~~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l  118 (794)
                      ++.||+..+.       +.+..+|+.+++  |+..-+-+..  -.+. +..++.++++||.+     ..+..||+.++  
T Consensus       294 ~~~lyv~GG~~~~~~~~~~v~~yd~~~~~--W~~~~~~~~~R~~~~~-~~~~~~lyv~GG~~~~~~~~~v~~yd~~~~--  368 (534)
T PHA03098        294 NNVIYFIGGMNKNNLSVNSVVSYDTKTKS--WNKVPELIYPRKNPGV-TVFNNRIYVIGGIYNSISLNTVESWKPGES--  368 (534)
T ss_pred             CCEEEEECCCcCCCCeeccEEEEeCCCCe--eeECCCCCcccccceE-EEECCEEEEEeCCCCCEecceEEEEcCCCC--
Confidence            6678876642       358899998874  8654322211  1122 34566677777743     35788998766  


Q ss_pred             eEEEeccCccccCCccccccccccccCCeEEEEEC--------CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCE
Q 003800          119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK--------GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQ  190 (794)
Q Consensus       119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~--------g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~  190 (794)
                      .|+....-+..     ..... ....++.+++.++        ..+..+|..++  .|+.-.+.+.-...... ...++.
T Consensus       369 ~W~~~~~lp~~-----r~~~~-~~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~--~W~~~~~~p~~r~~~~~-~~~~~~  439 (534)
T PHA03098        369 KWREEPPLIFP-----RYNPC-VVNVNNLIYVIGGISKNDELLKTVECFSLNTN--KWSKGSPLPISHYGGCA-IYHDGK  439 (534)
T ss_pred             ceeeCCCcCcC-----Cccce-EEEECCEEEEECCcCCCCcccceEEEEeCCCC--eeeecCCCCccccCceE-EEECCE
Confidence            48754322111     11100 1112567777632        45788888775  58765443321001111 245789


Q ss_pred             EEEEEecCCc-----eeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEEECC----CCeEEEEEeecce
Q 003800          191 IYVVGYAGSS-----QFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTLDTT----RSILVTVSFKNRK  257 (794)
Q Consensus       191 vyv~~~~g~~-----~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~d~~----~g~L~v~~l~sg~  257 (794)
                      +|++|.....     --.+.++|+.++  .|+..-..+....+ .....++.++++.-..    .+.+.+.|..+++
T Consensus       440 iyv~GG~~~~~~~~~~~~v~~yd~~~~--~W~~~~~~~~~r~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~~~~  514 (534)
T PHA03098        440 IYVIGGISYIDNIKVYNIVESYNPVTN--KWTELSSLNFPRINASLCIFNNKIYVVGGDKYEYYINEIEVYDDKTNT  514 (534)
T ss_pred             EEEECCccCCCCCcccceEEEecCCCC--ceeeCCCCCcccccceEEEECCEEEEEcCCcCCcccceeEEEeCCCCE
Confidence            9988743211     124888998876  46653222222222 2222354555443211    2356667766665


No 61 
>PF05935 Arylsulfotrans:  Arylsulfotransferase (ASST);  InterPro: IPR010262 This family consists of several bacterial arylsulphotransferase proteins. Arylsulphotransferase (ASST) transfers a sulphate group from phenolic sulphate esters to a phenolic acceptor substrate [].; PDB: 3ETT_B 3ELQ_A 3ETS_A.
Probab=93.56  E-value=3.4  Score=48.15  Aligned_cols=115  Identities=10%  Similarity=0.181  Sum_probs=62.1

Q ss_pred             eCCEEEEEEc----cCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEE
Q 003800           94 LGKYVITLSS----DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWT  169 (794)
Q Consensus        94 ~g~~~V~Vs~----~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~  169 (794)
                      ..+++.++..    .....+++|. +|.++|......... ......       .++.+++.....+..+|. .|++.|+
T Consensus       112 ~~~gl~~~~~~~~~~~~~~~~iD~-~G~Vrw~~~~~~~~~-~~~~~l-------~nG~ll~~~~~~~~e~D~-~G~v~~~  181 (477)
T PF05935_consen  112 MEDGLYFVNGNDWDSSSYTYLIDN-NGDVRWYLPLDSGSD-NSFKQL-------PNGNLLIGSGNRLYEIDL-LGKVIWE  181 (477)
T ss_dssp             -TT-EEEEEETT--BEEEEEEEET-TS-EEEEE-GGGT---SSEEE--------TTS-EEEEEBTEEEEE-T-T--EEEE
T ss_pred             cCCcEEEEeCCCCCCCceEEEECC-CccEEEEEccCcccc-ceeeEc-------CCCCEEEecCCceEEEcC-CCCEEEe
Confidence            3566777766    4568999996 799999999876532 111222       267777777899999996 5999999


Q ss_pred             EeccCccee-eeeEEEEecCCEEEEEEec-------CC---ceeEEEEEEcCCCceeeeeee
Q 003800          170 RDFAAESVE-VQQVIQLDESDQIYVVGYA-------GS---SQFHAYQINAMNGELLNHETA  220 (794)
Q Consensus       170 ~~~~~~~~~-~~~~v~s~~~~~vyv~~~~-------g~---~~~~v~ald~~tG~~~w~~~v  220 (794)
                      ++.+..... .=.+.. ..+|.+.+++..       .+   ..=.+..+| .+|+++|+...
T Consensus       182 ~~l~~~~~~~HHD~~~-l~nGn~L~l~~~~~~~~~~~~~~~~~D~Ivevd-~tG~vv~~wd~  241 (477)
T PF05935_consen  182 YDLPGGYYDFHHDIDE-LPNGNLLILASETKYVDEDKDVDTVEDVIVEVD-PTGEVVWEWDF  241 (477)
T ss_dssp             EE--TTEE-B-S-EEE--TTS-EEEEEEETTEE-TS-EE---S-EEEEE--TTS-EEEEEEG
T ss_pred             eecCCcccccccccEE-CCCCCEEEEEeecccccCCCCccEecCEEEEEC-CCCCEEEEEeh
Confidence            999874310 000111 233344433331       10   011588899 99999999875


No 62 
>PF05096 Glu_cyclase_2:  Glutamine cyclotransferase;  InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=93.54  E-value=3.8  Score=43.64  Aligned_cols=154  Identities=14%  Similarity=0.142  Sum_probs=86.4

Q ss_pred             CCEEEEEEc--cC-CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEE
Q 003800           95 GKYVITLSS--DG-STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWT  169 (794)
Q Consensus        95 g~~~V~Vs~--~g-~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~  169 (794)
                      +++.++.|+  .| +.||-+|..+|+.+.+..+.......++       +.. ++.++.++  .+..+.+|+.+-+++=+
T Consensus        54 ~~g~LyESTG~yG~S~l~~~d~~tg~~~~~~~l~~~~FgEGi-------t~~-~d~l~qLTWk~~~~f~yd~~tl~~~~~  125 (264)
T PF05096_consen   54 DDGTLYESTGLYGQSSLRKVDLETGKVLQSVPLPPRYFGEGI-------TIL-GDKLYQLTWKEGTGFVYDPNTLKKIGT  125 (264)
T ss_dssp             ETTEEEEEECSTTEEEEEEEETTTSSEEEEEE-TTT--EEEE-------EEE-TTEEEEEESSSSEEEEEETTTTEEEEE
T ss_pred             CCCEEEEeCCCCCcEEEEEEECCCCcEEEEEECCccccceeE-------EEE-CCEEEEEEecCCeEEEEccccceEEEE
Confidence            445566643  23 6899999999999999999875442222       222 56788874  89999999999988877


Q ss_pred             EeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeeccc----CccCceEEEcCcEEEEEECCC
Q 003800          170 RDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSG----GFVGDVALVSSDTLVTLDTTR  245 (794)
Q Consensus       170 ~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~----~~s~~~~~vg~~~lv~~d~~~  245 (794)
                      ++.+...+   .+.  .++..+++ + +|++  +++-+|+++-+...+.++....    .+.+ .-++++.+++=+- .+
T Consensus       126 ~~y~~EGW---GLt--~dg~~Li~-S-DGS~--~L~~~dP~~f~~~~~i~V~~~g~pv~~LNE-LE~i~G~IyANVW-~t  194 (264)
T PF05096_consen  126 FPYPGEGW---GLT--SDGKRLIM-S-DGSS--RLYFLDPETFKEVRTIQVTDNGRPVSNLNE-LEYINGKIYANVW-QT  194 (264)
T ss_dssp             EE-SSS-----EEE--ECSSCEEE-E--SSS--EEEEE-TTT-SEEEEEE-EETTEE---EEE-EEEETTEEEEEET-TS
T ss_pred             EecCCcce---EEE--cCCCEEEE-E-CCcc--ceEEECCcccceEEEEEEEECCEECCCcEe-EEEEcCEEEEEeC-CC
Confidence            77665433   221  34555554 2 4433  7888999998888777654321    1111 1223333333222 23


Q ss_pred             CeEEEEEeecceeeeEEEeeccc
Q 003800          246 SILVTVSFKNRKIAFQETHLSNL  268 (794)
Q Consensus       246 g~L~v~~l~sg~~~~~~~~l~~l  268 (794)
                      ..+.++|.++|.+ ...+.++.|
T Consensus       195 d~I~~Idp~tG~V-~~~iDls~L  216 (264)
T PF05096_consen  195 DRIVRIDPETGKV-VGWIDLSGL  216 (264)
T ss_dssp             SEEEEEETTT-BE-EEEEE-HHH
T ss_pred             CeEEEEeCCCCeE-EEEEEhhHh
Confidence            4566777777763 344444443


No 63 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=93.45  E-value=13  Score=40.63  Aligned_cols=162  Identities=9%  Similarity=0.040  Sum_probs=78.2

Q ss_pred             ecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCC------CEEEEEECcCCcc--ceEEEcCccccee-eeeeeeC
Q 003800           25 EDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEE------NVIASLDLRHGEI--FWRHVLGINDVVD-GIDIALG   95 (794)
Q Consensus        25 edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~------g~l~ALn~~tG~i--vWR~~l~~~~~i~-~l~~~~g   95 (794)
                      .++..+..|+..- ..|.....+....-++.||+....      +.+..+|+.+.+-  .|+..-+-+.... ......+
T Consensus        45 ~~~~~~~~W~~~~-~lp~~r~~~~~~~~~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~~~~~~~~~~~~  123 (323)
T TIGR03548        45 KDENSNLKWVKDG-QLPYEAAYGASVSVENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPFTFENGSACYKD  123 (323)
T ss_pred             ecCCCceeEEEcc-cCCccccceEEEEECCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCcCccCceEEEEC
Confidence            3445556787621 222211111122226778887653      4688889888752  4554322221111 1112345


Q ss_pred             CEEEEEEcc-----CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC------CEEEEEECCCC
Q 003800           96 KYVITLSSD-----GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK------GCLHAVSSIDG  164 (794)
Q Consensus        96 ~~~V~Vs~~-----g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~------g~l~ald~~tG  164 (794)
                      +.+++++|.     -..+..||+.+.  .|+..-.-+........     ....++.++|..+      ..+.++|..+.
T Consensus       124 ~~iYv~GG~~~~~~~~~v~~yd~~~~--~W~~~~~~p~~~r~~~~-----~~~~~~~iYv~GG~~~~~~~~~~~yd~~~~  196 (323)
T TIGR03548       124 GTLYVGGGNRNGKPSNKSYLFNLETQ--EWFELPDFPGEPRVQPV-----CVKLQNELYVFGGGSNIAYTDGYKYSPKKN  196 (323)
T ss_pred             CEEEEEeCcCCCccCceEEEEcCCCC--CeeECCCCCCCCCCcce-----EEEECCEEEEEcCCCCccccceEEEecCCC
Confidence            555555653     146899998765  48864321110011001     1112567777642      23568888765


Q ss_pred             cEEEEEeccCcce-eeee----EEEEecCCEEEEEEe
Q 003800          165 EILWTRDFAAESV-EVQQ----VIQLDESDQIYVVGY  196 (794)
Q Consensus       165 ~~~W~~~~~~~~~-~~~~----~v~s~~~~~vyv~~~  196 (794)
                        .|+.-.+.+.. .|..    ......++.+|++|-
T Consensus       197 --~W~~~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG  231 (323)
T TIGR03548       197 --QWQKVADPTTDSEPISLLGAASIKINESLLLCIGG  231 (323)
T ss_pred             --eeEECCCCCCCCCceeccceeEEEECCCEEEEECC
Confidence              58764432110 1111    111234788998764


No 64 
>KOG0291 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=93.45  E-value=13  Score=44.32  Aligned_cols=110  Identities=14%  Similarity=0.143  Sum_probs=71.5

Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEec
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDF  172 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~  172 (794)
                      ++..++-|+++++|..||...|-=.=.+.-+.... ....+       ...+.+++- + ||+|.|.|...++--=++..
T Consensus       361 Dgq~iaTG~eDgKVKvWn~~SgfC~vTFteHts~V-t~v~f-------~~~g~~llssSLDGtVRAwDlkRYrNfRTft~  432 (893)
T KOG0291|consen  361 DGQLIATGAEDGKVKVWNTQSGFCFVTFTEHTSGV-TAVQF-------TARGNVLLSSSLDGTVRAWDLKRYRNFRTFTS  432 (893)
T ss_pred             CCcEEEeccCCCcEEEEeccCceEEEEeccCCCce-EEEEE-------EecCCEEEEeecCCeEEeeeecccceeeeecC
Confidence            33344446678899999999998777766554332 11111       224555554 4 99999999999988778877


Q ss_pred             cCcceeeeeEEEEec--CCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          173 AAESVEVQQVIQLDE--SDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       173 ~~~~~~~~~~v~s~~--~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      |.+..  ..++ +.+  +..|++.+.   .++.++..+.+||+.+.-.
T Consensus       433 P~p~Q--fscv-avD~sGelV~AG~~---d~F~IfvWS~qTGqllDiL  474 (893)
T KOG0291|consen  433 PEPIQ--FSCV-AVDPSGELVCAGAQ---DSFEIFVWSVQTGQLLDIL  474 (893)
T ss_pred             CCcee--eeEE-EEcCCCCEEEeecc---ceEEEEEEEeecCeeeehh
Confidence            76542  2333 222  445554332   2468888999999988544


No 65 
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=93.44  E-value=13  Score=40.02  Aligned_cols=217  Identities=17%  Similarity=0.199  Sum_probs=118.7

Q ss_pred             ccceeecccccE---eeEEeccCceeeeeeeeeccCCCEEEEEeC--CCEEEEEECcCCccceEEEcCcccceeeeeeee
Q 003800           20 SLSLYEDQVGLM---DWHQQYIGKVKHAVFHTQKTGRKRVVVSTE--ENVIASLDLRHGEIFWRHVLGINDVVDGIDIAL   94 (794)
Q Consensus        20 ~~Al~edqvG~~---dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~--~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~   94 (794)
                      +.-||....|+.   .-+++| |. ....|..   .+..++-+|.  +..|--|+..|-+-+ |+=-+....+..+.+.-
T Consensus        37 sl~LYd~~~g~~~~ti~skky-G~-~~~~Fth---~~~~~i~sStk~d~tIryLsl~dNkyl-RYF~GH~~~V~sL~~sP  110 (311)
T KOG1446|consen   37 SLRLYDSLSGKQVKTINSKKY-GV-DLACFTH---HSNTVIHSSTKEDDTIRYLSLHDNKYL-RYFPGHKKRVNSLSVSP  110 (311)
T ss_pred             eEEEEEcCCCceeeEeecccc-cc-cEEEEec---CCceEEEccCCCCCceEEEEeecCceE-EEcCCCCceEEEEEecC
Confidence            456787777773   222233 22 2233442   2556666665  678999998886543 22222223344443333


Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE-CC-EEEEEECC--CCcEEEEE
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KG-CLHAVSSI--DGEILWTR  170 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g-~l~ald~~--tG~~~W~~  170 (794)
                      .++.+.-++.+.+||.||...=+-.=-..+.+      .++.    +-+..+.+++.. ++ .+.-+|..  ++.+-=++
T Consensus       111 ~~d~FlS~S~D~tvrLWDlR~~~cqg~l~~~~------~pi~----AfDp~GLifA~~~~~~~IkLyD~Rs~dkgPF~tf  180 (311)
T KOG1446|consen  111 KDDTFLSSSLDKTVRLWDLRVKKCQGLLNLSG------RPIA----AFDPEGLIFALANGSELIKLYDLRSFDKGPFTTF  180 (311)
T ss_pred             CCCeEEecccCCeEEeeEecCCCCceEEecCC------Ccce----eECCCCcEEEEecCCCeEEEEEecccCCCCceeE
Confidence            44555535567899999986332221112221      1222    234457777764 33 56666665  34444444


Q ss_pred             eccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeeccc-CccCceEE-EcCcEEEEEECCCCe
Q 003800          171 DFAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSG-GFVGDVAL-VSSDTLVTLDTTRSI  247 (794)
Q Consensus       171 ~~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~-~~s~~~~~-vg~~~lv~~d~~~g~  247 (794)
                      ..+.+. .+...+-. ..+|+..+++-.++   .++.+|+-+|.++......... .+..++.+ ..++.+.+.+ +.|.
T Consensus       181 ~i~~~~~~ew~~l~F-S~dGK~iLlsT~~s---~~~~lDAf~G~~~~tfs~~~~~~~~~~~a~ftPds~Fvl~gs-~dg~  255 (311)
T KOG1446|consen  181 SITDNDEAEWTDLEF-SPDGKSILLSTNAS---FIYLLDAFDGTVKSTFSGYPNAGNLPLSATFTPDSKFVLSGS-DDGT  255 (311)
T ss_pred             ccCCCCccceeeeEE-cCCCCEEEEEeCCC---cEEEEEccCCcEeeeEeeccCCCCcceeEEECCCCcEEEEec-CCCc
Confidence            444221 11222322 35666666666654   7889999999988776543322 23334444 3445555554 5799


Q ss_pred             EEEEEeecce
Q 003800          248 LVTVSFKNRK  257 (794)
Q Consensus       248 L~v~~l~sg~  257 (794)
                      +++-++++|.
T Consensus       256 i~vw~~~tg~  265 (311)
T KOG1446|consen  256 IHVWNLETGK  265 (311)
T ss_pred             EEEEEcCCCc
Confidence            9999999987


No 66 
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=93.42  E-value=12  Score=39.36  Aligned_cols=143  Identities=13%  Similarity=0.096  Sum_probs=82.9

Q ss_pred             EEEE-EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCc--EEEEEeccC
Q 003800           98 VITL-SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGE--ILWTRDFAA  174 (794)
Q Consensus        98 ~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~--~~W~~~~~~  174 (794)
                      ++.+ .+.+-++|-|.+.+|.=.-..+...... ..+++.+     + .+++.+.....++.+|..+++  ++=+++...
T Consensus        11 viLvsA~YDhTIRfWqa~tG~C~rTiqh~dsqV-NrLeiTp-----d-k~~LAaa~~qhvRlyD~~S~np~Pv~t~e~h~   83 (311)
T KOG0315|consen   11 VILVSAGYDHTIRFWQALTGICSRTIQHPDSQV-NRLEITP-----D-KKDLAAAGNQHVRLYDLNSNNPNPVATFEGHT   83 (311)
T ss_pred             eEEEeccCcceeeeeehhcCeEEEEEecCccce-eeEEEcC-----C-cchhhhccCCeeEEEEccCCCCCceeEEeccC
Confidence            4444 4568899999999999998888776554 4445544     2 345555577888888888876  466666554


Q ss_pred             cceeeeeEEEEecCCE-EEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEe
Q 003800          175 ESVEVQQVIQLDESDQ-IYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSF  253 (794)
Q Consensus       175 ~~~~~~~~v~s~~~~~-vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l  253 (794)
                      ....   .+.-..+++ .|-.+-+|    .+-..|+.+  +.-|+....++.+..-++-..+..++..| .+|.+++=||
T Consensus        84 kNVt---aVgF~~dgrWMyTgseDg----t~kIWdlR~--~~~qR~~~~~spVn~vvlhpnQteLis~d-qsg~irvWDl  153 (311)
T KOG0315|consen   84 KNVT---AVGFQCDGRWMYTGSEDG----TVKIWDLRS--LSCQRNYQHNSPVNTVVLHPNQTELISGD-QSGNIRVWDL  153 (311)
T ss_pred             CceE---EEEEeecCeEEEecCCCc----eEEEEeccC--cccchhccCCCCcceEEecCCcceEEeec-CCCcEEEEEc
Confidence            4331   111112222 33322222    455555554  22233333334443323333555666666 3689999999


Q ss_pred             ecce
Q 003800          254 KNRK  257 (794)
Q Consensus       254 ~sg~  257 (794)
                      +...
T Consensus       154 ~~~~  157 (311)
T KOG0315|consen  154 GENS  157 (311)
T ss_pred             cCCc
Confidence            8764


No 67 
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=93.20  E-value=2.8  Score=42.77  Aligned_cols=147  Identities=17%  Similarity=0.148  Sum_probs=84.4

Q ss_pred             CCCEEEEEeC---CCEEEEEECcCCccceEEEcCccccee--eeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800           52 GRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVD--GIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG  126 (794)
Q Consensus        52 ~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~--~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~  126 (794)
                      .++.+|.+|.   ...|...|..+|++.|.+.+..+ .+-  |+ ...++.+..++=..+..+-+|+.|=+.+=+++..+
T Consensus        54 ~~g~i~esTG~yg~S~ir~~~L~~gq~~~s~~l~~~-~~FgEGi-t~~gd~~y~LTw~egvaf~~d~~t~~~lg~~~y~G  131 (262)
T COG3823          54 LDGHILESTGLYGFSKIRVSDLTTGQEIFSEKLAPD-TVFGEGI-TKLGDYFYQLTWKEGVAFKYDADTLEELGRFSYEG  131 (262)
T ss_pred             eCCEEEEeccccccceeEEEeccCceEEEEeecCCc-cccccce-eeccceEEEEEeccceeEEEChHHhhhhcccccCC
Confidence            3668888886   46899999999999999999842 221  33 12344444445445678888988887777777666


Q ss_pred             ccccCCccccccccccccCCeEEEEEC--CEEEEEECCCCcEEEEEeccCcc-----eeeeeEEEEecCCEEEEEEecCC
Q 003800          127 SKHSKPLLLVPTNLKVDKDSLILVSSK--GCLHAVSSIDGEILWTRDFAAES-----VEVQQVIQLDESDQIYVVGYAGS  199 (794)
Q Consensus       127 ~~~s~~~~~~~~~~~~~~~~~V~V~~~--g~l~ald~~tG~~~W~~~~~~~~-----~~~~~~v~s~~~~~vyv~~~~g~  199 (794)
                      +..  +  +.     -  ++.-++.++  ..|+-.|++|=+..=+.......     +.-..+    -+|.+|+--....
T Consensus       132 eGW--g--Lt-----~--d~~~LimsdGsatL~frdP~tfa~~~~v~VT~~g~pv~~LNELE~----VdG~lyANVw~t~  196 (262)
T COG3823         132 EGW--G--LT-----S--DDKNLIMSDGSATLQFRDPKTFAELDTVQVTDDGVPVSKLNELEW----VDGELYANVWQTT  196 (262)
T ss_pred             cce--e--ee-----c--CCcceEeeCCceEEEecCHHHhhhcceEEEEECCeecccccceee----eccEEEEeeeeec
Confidence            554  1  11     1  222234443  35666666543322222211111     101112    3667776444432


Q ss_pred             ceeEEEEEEcCCCceeeee
Q 003800          200 SQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       200 ~~~~v~ald~~tG~~~w~~  218 (794)
                         ++.-+|+++|+++.-.
T Consensus       197 ---~I~rI~p~sGrV~~wi  212 (262)
T COG3823         197 ---RIARIDPDSGRVVAWI  212 (262)
T ss_pred             ---ceEEEcCCCCcEEEEE
Confidence               6778888888877444


No 68 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=93.04  E-value=4.1  Score=46.10  Aligned_cols=151  Identities=17%  Similarity=0.191  Sum_probs=83.8

Q ss_pred             CCCEEEE-EeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           52 GRKRVVV-STEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        52 ~~~~Vyv-~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      .++.+++ ++++.++--.|+.++.+  ...+... +-+.+.....+.+.+++ |+.++.||.||+.+-. -|...+.-+.
T Consensus       121 ~d~t~l~s~sDd~v~k~~d~s~a~v--~~~l~~htDYVR~g~~~~~~~hivvtGsYDg~vrl~DtR~~~-~~v~elnhg~  197 (487)
T KOG0310|consen  121 QDNTMLVSGSDDKVVKYWDLSTAYV--QAELSGHTDYVRCGDISPANDHIVVTGSYDGKVRLWDTRSLT-SRVVELNHGC  197 (487)
T ss_pred             cCCeEEEecCCCceEEEEEcCCcEE--EEEecCCcceeEeeccccCCCeEEEecCCCceEEEEEeccCC-ceeEEecCCC
Confidence            3555554 56677777778777764  4445433 22333222334444444 5678999999998765 6666665331


Q ss_pred             ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEec-cCcceeeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800          129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDF-AAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQ  206 (794)
Q Consensus       129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~-~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~a  206 (794)
                      --.....+|       .+..++. ++..+...|..+|.++=.... -...  ...+..+..+..++-++++|    +|-.
T Consensus       198 pVe~vl~lp-------sgs~iasAgGn~vkVWDl~~G~qll~~~~~H~Kt--VTcL~l~s~~~rLlS~sLD~----~VKV  264 (487)
T KOG0310|consen  198 PVESVLALP-------SGSLIASAGGNSVKVWDLTTGGQLLTSMFNHNKT--VTCLRLASDSTRLLSGSLDR----HVKV  264 (487)
T ss_pred             ceeeEEEcC-------CCCEEEEcCCCeEEEEEecCCceehhhhhcccce--EEEEEeecCCceEeeccccc----ceEE
Confidence            101222222       3445554 456677777776655432221 1111  22233334567788888887    7888


Q ss_pred             EEcCCCceeeee
Q 003800          207 INAMNGELLNHE  218 (794)
Q Consensus       207 ld~~tG~~~w~~  218 (794)
                      +|..+=+.+...
T Consensus       265 fd~t~~Kvv~s~  276 (487)
T KOG0310|consen  265 FDTTNYKVVHSW  276 (487)
T ss_pred             EEccceEEEEee
Confidence            886666666444


No 69 
>TIGR03548 mutarot_permut cyclically-permuted mutatrotase family protein. Members of this protein family show essentially full-length homology, cyclically permuted, to YjhT from Escherichia coli. YjhT was shown to act as a mutarotase for sialic acid, and by this ability to be able to act as a virulence factor. Members of the YjhT family (TIGR03547) and this cyclically-permuted family have multiple repeats of the beta-propeller-forming Kelch repeat.
Probab=92.84  E-value=17  Score=39.70  Aligned_cols=145  Identities=10%  Similarity=0.066  Sum_probs=75.2

Q ss_pred             EEEEEECcCCccceEEEcCccccee-eeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE--eEEEeccCccccCCccc
Q 003800           64 VIASLDLRHGEIFWRHVLGINDVVD-GIDIALGKYVITLSSDG-----STLRAWNLPDGQM--VWESFLRGSKHSKPLLL  135 (794)
Q Consensus        64 ~l~ALn~~tG~ivWR~~l~~~~~i~-~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l--lWe~~l~~~~~s~~~~~  135 (794)
                      .++.|+..+.+..|+..-+-+..-. +..+..++.++++||..     ..+..+|..+.+-  .|+..-.-+.     +.
T Consensus        40 ~v~~~~~~~~~~~W~~~~~lp~~r~~~~~~~~~~~lyviGG~~~~~~~~~v~~~d~~~~~w~~~~~~~~~lp~-----~~  114 (323)
T TIGR03548        40 GIYIAKDENSNLKWVKDGQLPYEAAYGASVSVENGIYYIGGSNSSERFSSVYRITLDESKEELICETIGNLPF-----TF  114 (323)
T ss_pred             eeEEEecCCCceeEEEcccCCccccceEEEEECCEEEEEcCCCCCCCceeEEEEEEcCCceeeeeeEcCCCCc-----Cc
Confidence            4666653344567987543332111 11134577777777642     3688888877653  4443211111     01


Q ss_pred             cccccccccCCeEEEEEC-------CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC-CceeEEEEE
Q 003800          136 VPTNLKVDKDSLILVSSK-------GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG-SSQFHAYQI  207 (794)
Q Consensus       136 ~~~~~~~~~~~~V~V~~~-------g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g-~~~~~v~al  207 (794)
                      .... ....++.+||..+       ..++++|..+.  .|+.-.+.+............++.+|+++... .....+.++
T Consensus       115 ~~~~-~~~~~~~iYv~GG~~~~~~~~~v~~yd~~~~--~W~~~~~~p~~~r~~~~~~~~~~~iYv~GG~~~~~~~~~~~y  191 (323)
T TIGR03548       115 ENGS-ACYKDGTLYVGGGNRNGKPSNKSYLFNLETQ--EWFELPDFPGEPRVQPVCVKLQNELYVFGGGSNIAYTDGYKY  191 (323)
T ss_pred             cCce-EEEECCEEEEEeCcCCCccCceEEEEcCCCC--CeeECCCCCCCCCCcceEEEECCEEEEEcCCCCccccceEEE
Confidence            1000 1122567777642       36888998765  48864432211001111124678999987432 112346789


Q ss_pred             EcCCCceeeee
Q 003800          208 NAMNGELLNHE  218 (794)
Q Consensus       208 d~~tG~~~w~~  218 (794)
                      |+.+.  .|+.
T Consensus       192 d~~~~--~W~~  200 (323)
T TIGR03548       192 SPKKN--QWQK  200 (323)
T ss_pred             ecCCC--eeEE
Confidence            99875  4765


No 70 
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=92.79  E-value=6.9  Score=41.64  Aligned_cols=193  Identities=13%  Similarity=0.092  Sum_probs=103.0

Q ss_pred             CCE-EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccCCeEEEEeCCCCc-EeEEEeccCccc
Q 003800           53 RKR-VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDGSTLRAWNLPDGQ-MVWESFLRGSKH  129 (794)
Q Consensus        53 ~~~-Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g~~v~A~d~~tG~-llWe~~l~~~~~  129 (794)
                      ++. =|.+...+.+.-||++||+.. ++.|.+...-.++.+ ..+.-.++  ..+.-+.-+|.+++. ..|...+.-...
T Consensus        72 dG~VWft~qg~gaiGhLdP~tGev~-~ypLg~Ga~Phgiv~gpdg~~Wit--d~~~aI~R~dpkt~evt~f~lp~~~a~~  148 (353)
T COG4257          72 DGAVWFTAQGTGAIGHLDPATGEVE-TYPLGSGASPHGIVVGPDGSAWIT--DTGLAIGRLDPKTLEVTRFPLPLEHADA  148 (353)
T ss_pred             CCceEEecCccccceecCCCCCceE-EEecCCCCCCceEEECCCCCeeEe--cCcceeEEecCcccceEEeecccccCCC
Confidence            554 455667899999999999864 556655422222211 12333444  323368888887764 455555332111


Q ss_pred             cCCccccccccccccCCeEEEE-ECCEEEEEECCCCcE-EEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800          130 SKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEI-LWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI  207 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~-~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al  207 (794)
                        -+...    ..+..+.+.+. ..|.-=+||+.++.+ +|.....  .- ++.+. ...++.||+.++.|+   .+.-+
T Consensus       149 --nlet~----vfD~~G~lWFt~q~G~yGrLdPa~~~i~vfpaPqG--~g-pyGi~-atpdGsvwyaslagn---aiari  215 (353)
T COG4257         149 --NLETA----VFDPWGNLWFTGQIGAYGRLDPARNVISVFPAPQG--GG-PYGIC-ATPDGSVWYASLAGN---AIARI  215 (353)
T ss_pred             --cccce----eeCCCccEEEeeccccceecCcccCceeeeccCCC--CC-CcceE-ECCCCcEEEEecccc---ceEEc
Confidence              11111    22334555443 455555889888764 3554432  21 33443 467899999999987   68889


Q ss_pred             EcCCCceeeeeeeecccCccC-c-eEEE-cCcEEEEEECCCCeEEEEEeecceeeeEEEeec
Q 003800          208 NAMNGELLNHETAAFSGGFVG-D-VALV-SSDTLVTLDTTRSILVTVSFKNRKIAFQETHLS  266 (794)
Q Consensus       208 d~~tG~~~w~~~v~~~~~~s~-~-~~~v-g~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l~  266 (794)
                      |+.+|..   ..+..|..+.. + -+-+ ..+-+-..+-.+++++..|-.+.+  ..+-+|-
T Consensus       216 dp~~~~a---ev~p~P~~~~~gsRriwsdpig~~wittwg~g~l~rfdPs~~s--W~eypLP  272 (353)
T COG4257         216 DPFAGHA---EVVPQPNALKAGSRRIWSDPIGRAWITTWGTGSLHRFDPSVTS--WIEYPLP  272 (353)
T ss_pred             ccccCCc---ceecCCCcccccccccccCccCcEEEeccCCceeeEeCccccc--ceeeeCC
Confidence            9999932   22344443222 1 1100 111222233445666666655544  5555653


No 71 
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=92.56  E-value=19  Score=40.39  Aligned_cols=70  Identities=11%  Similarity=0.197  Sum_probs=42.6

Q ss_pred             CCEEEEEeC--CCEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccC-----------CeEEEEeCCCCc
Q 003800           53 RKRVVVSTE--ENVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDG-----------STLRAWNLPDGQ  117 (794)
Q Consensus        53 ~~~Vyv~t~--~g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g-----------~~v~A~d~~tG~  117 (794)
                      ++.||+...  .+.+..+|.++-+-.|+..-+-+.  ......+..++.++++||..           ..+..||+.+. 
T Consensus        38 ~~~iyv~gG~~~~~~~~~d~~~~~~~W~~l~~~p~~~r~~~~~v~~~~~IYV~GG~~~~~~~~~~~~~~~v~~YD~~~n-  116 (376)
T PRK14131         38 NNTVYVGLGSAGTSWYKLDLNAPSKGWTKIAAFPGGPREQAVAAFIDGKLYVFGGIGKTNSEGSPQVFDDVYKYDPKTN-  116 (376)
T ss_pred             CCEEEEEeCCCCCeEEEEECCCCCCCeEECCcCCCCCcccceEEEECCEEEEEcCCCCCCCCCceeEcccEEEEeCCCC-
Confidence            678998654  367889998766667986443221  11111134566666667642           24788888764 


Q ss_pred             EeEEEec
Q 003800          118 MVWESFL  124 (794)
Q Consensus       118 llWe~~l  124 (794)
                       .|+.-.
T Consensus       117 -~W~~~~  122 (376)
T PRK14131        117 -SWQKLD  122 (376)
T ss_pred             -EEEeCC
Confidence             588753


No 72 
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=92.48  E-value=15  Score=42.51  Aligned_cols=193  Identities=12%  Similarity=0.106  Sum_probs=98.3

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ++..+..++.+..|...|.+.+...=|...+....+..+.....+..++-++.++.+|.||..+|+.+=......... .
T Consensus       214 d~~~l~s~s~D~tiriwd~~~~~~~~~~l~gH~~~v~~~~f~p~g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~~i-s  292 (456)
T KOG0266|consen  214 DGSYLLSGSDDKTLRIWDLKDDGRNLKTLKGHSTYVTSVAFSPDGNLLVSGSDDGTVRIWDVRTGECVRKLKGHSDGI-S  292 (456)
T ss_pred             CCcEEEEecCCceEEEeeccCCCeEEEEecCCCCceEEEEecCCCCEEEEecCCCcEEEEeccCCeEEEeeeccCCce-E
Confidence            344677788899999888844422223222222223222122222444445567899999999999998877766543 1


Q ss_pred             CccccccccccccCCeEEE-E-ECCEEEEEECCCCcEE--EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800          132 PLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGEIL--WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI  207 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~~~--W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al  207 (794)
                      ...+       ..++..++ . .++.+...|..+|..+  =+............+..+..+..++....++    .+.-.
T Consensus       293 ~~~f-------~~d~~~l~s~s~d~~i~vwd~~~~~~~~~~~~~~~~~~~~~~~~~fsp~~~~ll~~~~d~----~~~~w  361 (456)
T KOG0266|consen  293 GLAF-------SPDGNLLVSASYDGTIRVWDLETGSKLCLKLLSGAENSAPVTSVQFSPNGKYLLSASLDR----TLKLW  361 (456)
T ss_pred             EEEE-------CCCCCEEEEcCCCccEEEEECCCCceeeeecccCCCCCCceeEEEECCCCcEEEEecCCC----eEEEE
Confidence            1111       11344444 4 3999999999999843  1111111110012222122233333332232    56667


Q ss_pred             EcCCCceeeeeeeeccc--CccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          208 NAMNGELLNHETAAFSG--GFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       208 d~~tG~~~w~~~v~~~~--~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      |..+|...-++......  .+...+...++..++... ..+.++.-++.++.
T Consensus       362 ~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i~sg~-~d~~v~~~~~~s~~  412 (456)
T KOG0266|consen  362 DLRSGKSVGTYTGHSNLVRCIFSPTLSTGGKLIYSGS-EDGSVYVWDSSSGG  412 (456)
T ss_pred             EccCCcceeeecccCCcceeEecccccCCCCeEEEEe-CCceEEEEeCCccc
Confidence            88888877666422211  011111111333333332 34566677766654


No 73 
>KOG0278 consensus Serine/threonine kinase receptor-associated protein [Lipid transport and metabolism]
Probab=92.27  E-value=3.4  Score=43.17  Aligned_cols=108  Identities=18%  Similarity=0.240  Sum_probs=74.7

Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEecc
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~  173 (794)
                      .+..+.-+++++.||.||..+|...=+..+..++-  ++++..       ++.++.. .++.+.-.|+++=.++=+++.|
T Consensus       154 eD~~iLSSadd~tVRLWD~rTgt~v~sL~~~s~Vt--SlEvs~-------dG~ilTia~gssV~Fwdaksf~~lKs~k~P  224 (334)
T KOG0278|consen  154 EDKCILSSADDKTVRLWDHRTGTEVQSLEFNSPVT--SLEVSQ-------DGRILTIAYGSSVKFWDAKSFGLLKSYKMP  224 (334)
T ss_pred             cCceEEeeccCCceEEEEeccCcEEEEEecCCCCc--ceeecc-------CCCEEEEecCceeEEeccccccceeeccCc
Confidence            33444435677899999999999998888877653  444443       5666665 5788889999988888888877


Q ss_pred             CcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          174 AESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       174 ~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      -... ...+   .-...+||.   |+..++++-+|-.||+.+-.+
T Consensus       225 ~nV~-SASL---~P~k~~fVa---Gged~~~~kfDy~TgeEi~~~  262 (334)
T KOG0278|consen  225 CNVE-SASL---HPKKEFFVA---GGEDFKVYKFDYNTGEEIGSY  262 (334)
T ss_pred             cccc-cccc---cCCCceEEe---cCcceEEEEEeccCCceeeec
Confidence            4321 1112   122356654   455579999999999988664


No 74 
>PHA02713 hypothetical protein; Provisional
Probab=92.14  E-value=3.5  Score=49.07  Aligned_cols=148  Identities=12%  Similarity=0.195  Sum_probs=83.5

Q ss_pred             CCEEEEEeCC------CEEEEEECcCCccceEEEcCccccee--eeeeeeCCEEEEEEccC-------------------
Q 003800           53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGINDVVD--GIDIALGKYVITLSSDG-------------------  105 (794)
Q Consensus        53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~~~~i~--~l~~~~g~~~V~Vs~~g-------------------  105 (794)
                      +++||+..+.      +.+..+|+++.  .|+..-+-+....  +. ++.++.+.++||..                   
T Consensus       351 ~g~IYviGG~~~~~~~~sve~Ydp~~~--~W~~~~~mp~~r~~~~~-~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~  427 (557)
T PHA02713        351 DDTIYAIGGQNGTNVERTIECYTMGDD--KWKMLPDMPIALSSYGM-CVLDQYIYIIGGRTEHIDYTSVHHMNSIDMEED  427 (557)
T ss_pred             CCEEEEECCcCCCCCCceEEEEECCCC--eEEECCCCCcccccccE-EEECCEEEEEeCCCccccccccccccccccccc
Confidence            7789987763      34888999987  5997443221111  22 24566666667642                   


Q ss_pred             ----CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC--------CEEEEEECCC-CcEEEEEec
Q 003800          106 ----STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK--------GCLHAVSSID-GEILWTRDF  172 (794)
Q Consensus       106 ----~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~--------g~l~ald~~t-G~~~W~~~~  172 (794)
                          ..+..||+.+.  .|+.-..-.......   +   .+.-++.+||.++        ..+.++|+.+ .  .|+.-.
T Consensus       428 ~~~~~~ve~YDP~td--~W~~v~~m~~~r~~~---~---~~~~~~~IYv~GG~~~~~~~~~~ve~Ydp~~~~--~W~~~~  497 (557)
T PHA02713        428 THSSNKVIRYDTVNN--IWETLPNFWTGTIRP---G---VVSHKDDIYVVCDIKDEKNVKTCIFRYNTNTYN--GWELIT  497 (557)
T ss_pred             ccccceEEEECCCCC--eEeecCCCCcccccC---c---EEEECCEEEEEeCCCCCCccceeEEEecCCCCC--CeeEcc
Confidence                35888999876  587543221110111   1   1222677888642        2467899887 3  498654


Q ss_pred             cCcce-eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          173 AAESV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       173 ~~~~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      +.+.- ....+  +.-++.+|++|...+. ..+-++|+.|++  |+.
T Consensus       498 ~m~~~r~~~~~--~~~~~~iyv~Gg~~~~-~~~e~yd~~~~~--W~~  539 (557)
T PHA02713        498 TTESRLSALHT--ILHDNTIMMLHCYESY-MLQDTFNVYTYE--WNH  539 (557)
T ss_pred             ccCccccccee--EEECCEEEEEeeecce-eehhhcCccccc--ccc
Confidence            43321 01112  3468999998754322 246778877653  554


No 75 
>PF14269 Arylsulfotran_2:  Arylsulfotransferase (ASST)
Probab=91.97  E-value=5.1  Score=43.68  Aligned_cols=147  Identities=13%  Similarity=0.199  Sum_probs=83.0

Q ss_pred             CCEEEEEECcCCccceEEEcCccc-----c---------------------eeeeeeeeCCEEEEEEc-cCCeEEEEeCC
Q 003800           62 ENVIASLDLRHGEIFWRHVLGIND-----V---------------------VDGIDIALGKYVITLSS-DGSTLRAWNLP  114 (794)
Q Consensus        62 ~g~l~ALn~~tG~ivWR~~l~~~~-----~---------------------i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~  114 (794)
                      ++.+.-+|++||+++|+-....-.     .                     +..+. ...++-+.||. .-..|+.+|..
T Consensus        95 d~~~~EiDi~TgevlfeW~a~DH~~~~~~~~~~~~~~~~g~~~~~~~D~~HiNsV~-~~~~G~yLiS~R~~~~i~~I~~~  173 (299)
T PF14269_consen   95 DDVFQEIDIETGEVLFEWSASDHVDPNDSYDSQDPLPGSGGSSSFPWDYFHINSVD-KDDDGDYLISSRNTSTIYKIDPS  173 (299)
T ss_pred             cceeEEeccCCCCEEEEEEhhheecccccccccccccCCCcCCCCCCCccEeeeee-ecCCccEEEEecccCEEEEEECC
Confidence            567889999999999998753210     0                     00111 12233345565 45789999999


Q ss_pred             CCcEeEEEecc-Ccc-------c--cCCccccccccccccCCeEEEEE------------CCEEEEEECCCCcEEEEEec
Q 003800          115 DGQMVWESFLR-GSK-------H--SKPLLLVPTNLKVDKDSLILVSS------------KGCLHAVSSIDGEILWTRDF  172 (794)
Q Consensus       115 tG~llWe~~l~-~~~-------~--s~~~~~~~~~~~~~~~~~V~V~~------------~g~l~ald~~tG~~~W~~~~  172 (794)
                      ||+++|+.... ...       .  +-++.+.+   .-..++.+.++.            .+.+..||..+..+.|..+.
T Consensus       174 tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~---~~~~~~~IslFDN~~~~~~~~~~s~~~v~~ld~~~~~~~~~~~~  250 (299)
T PF14269_consen  174 TGKIIWRLGGKRNSDFTLPATNFSWQHDARFLN---ESNDDGTISLFDNANSDFNGTEPSRGLVLELDPETMTVTLVREY  250 (299)
T ss_pred             CCcEEEEeCCCCCCcccccCCcEeeccCCEEec---cCCCCCEEEEEcCCCCCCCCCcCCCceEEEEECCCCEEEEEEEe
Confidence            99999998654 111       1  11222221   001133444432            46899999997776665543


Q ss_pred             c---Ccceee----eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800          173 A---AESVEV----QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET  219 (794)
Q Consensus       173 ~---~~~~~~----~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~  219 (794)
                      .   .+-.++    .|.   ..++.+++.=..   ..++.-+++ +|+++|+..
T Consensus       251 ~~~~~~~~s~~~G~~Q~---L~nGn~li~~g~---~g~~~E~~~-~G~vv~~~~  297 (299)
T PF14269_consen  251 SDHPDGFYSPSQGSAQR---LPNGNVLIGWGN---NGRISEFTP-DGEVVWEAQ  297 (299)
T ss_pred             ecCCCcccccCCCcceE---CCCCCEEEecCC---CceEEEECC-CCCEEEEEE
Confidence            3   211111    122   234555542111   227888885 799999985


No 76 
>PRK05137 tolB translocation protein TolB; Provisional
Probab=91.67  E-value=29  Score=39.71  Aligned_cols=188  Identities=15%  Similarity=0.075  Sum_probs=90.5

Q ss_pred             cCCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEec
Q 003800           51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSD--GSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l  124 (794)
                      +++++|+..+.   ...|+.+|.++|+.  ++.....+.+...... .|+.+++....  ...++.||..+|.+.   ++
T Consensus       211 pDG~~lay~s~~~g~~~i~~~dl~~g~~--~~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~---~L  285 (435)
T PRK05137        211 PNRQEITYMSYANGRPRVYLLDLETGQR--ELVGNFPGMTFAPRFSPDGRKVVMSLSQGGNTDIYTMDLRSGTTT---RL  285 (435)
T ss_pred             CCCCEEEEEEecCCCCEEEEEECCCCcE--EEeecCCCcccCcEECCCCCEEEEEEecCCCceEEEEECCCCceE---Ec
Confidence            34556655543   46899999999864  3322222222222122 45555554332  246999999988753   23


Q ss_pred             cCcc-ccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC
Q 003800          125 RGSK-HSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS  199 (794)
Q Consensus       125 ~~~~-~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~  199 (794)
                      .... ....+..     ..+ ++.+++.+    ...++.+|..+|++.--..... ..  ..+..+.++..+++....++
T Consensus       286 t~~~~~~~~~~~-----spD-G~~i~f~s~~~g~~~Iy~~d~~g~~~~~lt~~~~-~~--~~~~~SpdG~~ia~~~~~~~  356 (435)
T PRK05137        286 TDSPAIDTSPSY-----SPD-GSQIVFESDRSGSPQLYVMNADGSNPRRISFGGG-RY--STPVWSPRGDLIAFTKQGGG  356 (435)
T ss_pred             cCCCCccCceeE-----cCC-CCEEEEEECCCCCCeEEEEECCCCCeEEeecCCC-cc--cCeEECCCCCEEEEEEcCCC
Confidence            2211 1011111     123 23344433    2379999988776543221111 11  11212345566665554332


Q ss_pred             ceeEEEEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCC-----CeEEEEEeecce
Q 003800          200 SQFHAYQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTR-----SILVTVSFKNRK  257 (794)
Q Consensus       200 ~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~-----g~L~v~~l~sg~  257 (794)
                       ...+..+|+.+|...   .+...... +.+.+- .+..+++.....     ..|+.+++..+.
T Consensus       357 -~~~i~~~d~~~~~~~---~lt~~~~~-~~p~~spDG~~i~~~~~~~~~~~~~~L~~~dl~g~~  415 (435)
T PRK05137        357 -QFSIGVMKPDGSGER---ILTSGFLV-EGPTWAPNGRVIMFFRQTPGSGGAPKLYTVDLTGRN  415 (435)
T ss_pred             -ceEEEEEECCCCceE---eccCCCCC-CCCeECCCCCEEEEEEccCCCCCcceEEEEECCCCc
Confidence             346778887666532   11112122 222222 334444443222     368888887665


No 77 
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=91.65  E-value=7.5  Score=43.01  Aligned_cols=188  Identities=11%  Similarity=0.175  Sum_probs=107.7

Q ss_pred             cCCCEEEEEeCCCEEEEEECcCCc-----cceEEEcCcccceeeeeee-eCCEEEEEEccC--CeEEEEeCCCCcEeEEE
Q 003800           51 TGRKRVVVSTEENVIASLDLRHGE-----IFWRHVLGINDVVDGIDIA-LGKYVITLSSDG--STLRAWNLPDGQMVWES  122 (794)
Q Consensus        51 ~~~~~Vyv~t~~g~l~ALn~~tG~-----ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g--~~v~A~d~~tG~llWe~  122 (794)
                      ..++.|++...+|.+...+.+.|.     .+|-+..+.-   ..++-. ....++..||.-  ..+--||.+.++.+|+.
T Consensus       113 ~~dg~Litc~~sG~l~~~~~k~~d~hss~l~~la~g~g~---~~~r~~~~~p~Iva~GGke~~n~lkiwdle~~~qiw~a  189 (412)
T KOG3881|consen  113 LADGTLITCVSSGNLQVRHDKSGDLHSSKLIKLATGPGL---YDVRQTDTDPYIVATGGKENINELKIWDLEQSKQIWSA  189 (412)
T ss_pred             hcCCEEEEEecCCcEEEEeccCCccccccceeeecCCce---eeeccCCCCCceEecCchhcccceeeeecccceeeeec
Confidence            347789999999998888888554     5555444221   112111 233455546644  57999999999999997


Q ss_pred             eccC-ccc-------cCCccccccccccccCCeEEEE--ECCEEEEEECCCCc-EEEEEeccCcceeeeeEEEEecCCEE
Q 003800          123 FLRG-SKH-------SKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGE-ILWTRDFAAESVEVQQVIQLDESDQI  191 (794)
Q Consensus       123 ~l~~-~~~-------s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~-~~W~~~~~~~~~~~~~~v~s~~~~~v  191 (794)
                      .=-. ..+       -.++.+++     ......|+.  .-+.|+-+|...|+ ++=+++.....++...+.  ..++.+
T Consensus       190 KNvpnD~L~LrVPvW~tdi~Fl~-----g~~~~~fat~T~~hqvR~YDt~~qRRPV~~fd~~E~~is~~~l~--p~gn~I  262 (412)
T KOG3881|consen  190 KNVPNDRLGLRVPVWITDIRFLE-----GSPNYKFATITRYHQVRLYDTRHQRRPVAQFDFLENPISSTGLT--PSGNFI  262 (412)
T ss_pred             cCCCCccccceeeeeeccceecC-----CCCCceEEEEecceeEEEecCcccCcceeEeccccCcceeeeec--CCCcEE
Confidence            6321 111       11222222     112455554  37899999999885 666666654433222222  356778


Q ss_pred             EEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCce--EEE--cCcEEEEEECCCCeEEEEEeecce
Q 003800          192 YVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDV--ALV--SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       192 yv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~--~~v--g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      |+....|    .+..||..+|+..-...    .+++|++  +.+  +..+++..- -...+.+.|+++.+
T Consensus       263 y~gn~~g----~l~~FD~r~~kl~g~~~----kg~tGsirsih~hp~~~~las~G-LDRyvRIhD~ktrk  323 (412)
T KOG3881|consen  263 YTGNTKG----QLAKFDLRGGKLLGCGL----KGITGSIRSIHCHPTHPVLASCG-LDRYVRIHDIKTRK  323 (412)
T ss_pred             EEecccc----hhheecccCceeecccc----CCccCCcceEEEcCCCceEEeec-cceeEEEeecccch
Confidence            8755555    79999999998875531    1233311  111  223333221 12457888887743


No 78 
>PTZ00420 coronin; Provisional
Probab=91.23  E-value=21  Score=42.57  Aligned_cols=69  Identities=4%  Similarity=0.091  Sum_probs=48.2

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG  126 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~  126 (794)
                      +..++.+|.|.-.|.++|+.+++....  ..+..+.....+.+++.++.++.++.||+.+|+.+-+...+.
T Consensus       141 LaSgS~DgtIrIWDl~tg~~~~~i~~~--~~V~SlswspdG~lLat~s~D~~IrIwD~Rsg~~i~tl~gH~  209 (568)
T PTZ00420        141 MCSSGFDSFVNIWDIENEKRAFQINMP--KKLSSLKWNIKGNLLSGTCVGKHMHIIDPRKQEIASSFHIHD  209 (568)
T ss_pred             EEEEeCCCeEEEEECCCCcEEEEEecC--CcEEEEEECCCCCEEEEEecCCEEEEEECCCCcEEEEEeccc
Confidence            346778999999999999988776543  234433223344455546556799999999999986665543


No 79 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=90.74  E-value=52  Score=40.91  Aligned_cols=106  Identities=11%  Similarity=0.125  Sum_probs=65.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ...+++++.+|.|...|..+|+.+....-.. ..+..+... .++..++.++.++.++.||..+|..+-.........  
T Consensus       545 ~~~las~~~Dg~v~lWd~~~~~~~~~~~~H~-~~V~~l~~~p~~~~~L~Sgs~Dg~v~iWd~~~~~~~~~~~~~~~v~--  621 (793)
T PLN00181        545 KSQVASSNFEGVVQVWDVARSQLVTEMKEHE-KRVWSIDYSSADPTLLASGSDDGSVKLWSINQGVSIGTIKTKANIC--  621 (793)
T ss_pred             CCEEEEEeCCCeEEEEECCCCeEEEEecCCC-CCEEEEEEcCCCCCEEEEEcCCCEEEEEECCCCcEEEEEecCCCeE--
Confidence            4568888889999999999998887764322 234444222 233455546667899999999998765554332211  


Q ss_pred             CccccccccccccCCeEEEE-ECCEEEEEECCCCcE
Q 003800          132 PLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEI  166 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~  166 (794)
                      ...+.     ...+..+++. .+|.++..|..+++.
T Consensus       622 ~v~~~-----~~~g~~latgs~dg~I~iwD~~~~~~  652 (793)
T PLN00181        622 CVQFP-----SESGRSLAFGSADHKVYYYDLRNPKL  652 (793)
T ss_pred             EEEEe-----CCCCCEEEEEeCCCeEEEEECCCCCc
Confidence            01110     1112333444 489999999887753


No 80 
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=89.96  E-value=29  Score=38.22  Aligned_cols=160  Identities=9%  Similarity=0.096  Sum_probs=86.6

Q ss_pred             CCEEEEEeCC--CEEEEEECcCCccceEEEcCccc--cee-eeeeeeCCEEEEEEccC-----------CeEEEEeCCCC
Q 003800           53 RKRVVVSTEE--NVIASLDLRHGEIFWRHVLGIND--VVD-GIDIALGKYVITLSSDG-----------STLRAWNLPDG  116 (794)
Q Consensus        53 ~~~Vyv~t~~--g~l~ALn~~tG~ivWR~~l~~~~--~i~-~l~~~~g~~~V~Vs~~g-----------~~v~A~d~~tG  116 (794)
                      ++.||+....  +.+..+|+++.+-.|+...+-+.  ... ++ +..++.+.++||..           ..+..||+.+.
T Consensus        17 ~~~vyv~GG~~~~~~~~~d~~~~~~~W~~l~~~p~~~R~~~~~-~~~~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~   95 (346)
T TIGR03547        17 GDKVYVGLGSAGTSWYKLDLKKPSKGWQKIADFPGGPRNQAVA-AAIDGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKN   95 (346)
T ss_pred             CCEEEEEccccCCeeEEEECCCCCCCceECCCCCCCCcccceE-EEECCEEEEEeCCCCCCCCCcceecccEEEEECCCC
Confidence            6788886653  57888998766778997554321  111 22 34577777777742           24777888654


Q ss_pred             cEeEEEeccCccccCCccccccccccccCCeEEEEE--C---------------------------------------CE
Q 003800          117 QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--K---------------------------------------GC  155 (794)
Q Consensus       117 ~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~---------------------------------------g~  155 (794)
                        .|+.-......  .  ..+.......++.|++..  +                                       ..
T Consensus        96 --~W~~~~~~~p~--~--~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  169 (346)
T TIGR03547        96 --SWQKLDTRSPV--G--LLGASGFSLHNGQAYFTGGVNKNIFDGYFADLSAADKDSEPKDKLIAAYFSQPPEDYFWNKN  169 (346)
T ss_pred             --EEecCCCCCCC--c--ccceeEEEEeCCEEEEEcCcChHHHHHHHhhHhhcCccchhhhhhHHHHhCCChhHcCccce
Confidence              48764321110  0  111000101256777763  1                                       35


Q ss_pred             EEEEECCCCcEEEEEeccCcc--eeeeeEEEEecCCEEEEEEecCCc---eeEEEEEEcCCCceeeeeeeecc
Q 003800          156 LHAVSSIDGEILWTRDFAAES--VEVQQVIQLDESDQIYVVGYAGSS---QFHAYQINAMNGELLNHETAAFS  223 (794)
Q Consensus       156 l~ald~~tG~~~W~~~~~~~~--~~~~~~v~s~~~~~vyv~~~~g~~---~~~v~ald~~tG~~~w~~~v~~~  223 (794)
                      +..+|+.+.  .|+.-.+.+.  ..-..+  ..-++++|+++.....   ...+..+|.......|+..-.++
T Consensus       170 v~~YDp~t~--~W~~~~~~p~~~r~~~~~--~~~~~~iyv~GG~~~~~~~~~~~~~y~~~~~~~~W~~~~~m~  238 (346)
T TIGR03547       170 VLSYDPSTN--QWRNLGENPFLGTAGSAI--VHKGNKLLLINGEIKPGLRTAEVKQYLFTGGKLEWNKLPPLP  238 (346)
T ss_pred             EEEEECCCC--ceeECccCCCCcCCCceE--EEECCEEEEEeeeeCCCccchheEEEEecCCCceeeecCCCC
Confidence            777787664  5876544332  111112  2457899998753211   12345566666677798754443


No 81 
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=89.90  E-value=2.4  Score=49.29  Aligned_cols=150  Identities=16%  Similarity=0.217  Sum_probs=90.9

Q ss_pred             cCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc---------------ceeeeee-eeCCEEEEEEccCCeEEEEeCC
Q 003800           51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND---------------VVDGIDI-ALGKYVITLSSDGSTLRAWNLP  114 (794)
Q Consensus        51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~---------------~i~~l~~-~~g~~~V~Vs~~g~~v~A~d~~  114 (794)
                      .++.++-|++++|.|-         +||...+.-.               -|..++. ....+++.++..+.+++.||..
T Consensus       638 FD~~rLAVa~ddg~i~---------lWr~~a~gl~e~~~tPe~~lt~h~eKI~slRfHPLAadvLa~asyd~Ti~lWDl~  708 (1012)
T KOG1445|consen  638 FDDERLAVATDDGQIN---------LWRLTANGLPENEMTPEKILTIHGEKITSLRFHPLAADVLAVASYDSTIELWDLA  708 (1012)
T ss_pred             CChHHeeecccCceEE---------EEEeccCCCCcccCCcceeeecccceEEEEEecchhhhHhhhhhccceeeeeehh
Confidence            4477899999999863         5776543220               1222221 1344566666777899999999


Q ss_pred             CCcEeEEEeccCccccCCccccccccccccCCeEEE-E-ECCEEEEEECCCCc-EEEEEeccCcceeeeeEEEEecCCEE
Q 003800          115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-S-SKGCLHAVSSIDGE-ILWTRDFAAESVEVQQVIQLDESDQI  191 (794)
Q Consensus       115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~-~~g~l~ald~~tG~-~~W~~~~~~~~~~~~~~v~s~~~~~v  191 (794)
                      ++++.=+........ .+.       ++..++..+. . .||+|+.+++.+++ ++.+-+-+.+.. -.+++.+.++..+
T Consensus       709 ~~~~~~~l~gHtdqI-f~~-------AWSpdGr~~AtVcKDg~~rVy~Prs~e~pv~Eg~gpvgtR-gARi~wacdgr~v  779 (1012)
T KOG1445|consen  709 NAKLYSRLVGHTDQI-FGI-------AWSPDGRRIATVCKDGTLRVYEPRSREQPVYEGKGPVGTR-GARILWACDGRIV  779 (1012)
T ss_pred             hhhhhheeccCcCce-eEE-------EECCCCcceeeeecCceEEEeCCCCCCCccccCCCCccCc-ceeEEEEecCcEE
Confidence            999987776654432 111       2222343333 3 59999999999886 445544444332 3445445567777


Q ss_pred             EEEEecCCceeEEEEEEcC--CCceeeee
Q 003800          192 YVVGYAGSSQFHAYQINAM--NGELLNHE  218 (794)
Q Consensus       192 yv~~~~g~~~~~v~ald~~--tG~~~w~~  218 (794)
                      .++|++..+...+..+|++  .|.++...
T Consensus       780 iv~Gfdk~SeRQv~~Y~Aq~l~~~pl~t~  808 (1012)
T KOG1445|consen  780 IVVGFDKSSERQVQMYDAQTLDLRPLYTQ  808 (1012)
T ss_pred             EEecccccchhhhhhhhhhhccCCcceee
Confidence            7888876555556556655  34455444


No 82 
>KOG1539 consensus WD repeat protein [General function prediction only]
Probab=89.89  E-value=24  Score=42.68  Aligned_cols=155  Identities=14%  Similarity=0.167  Sum_probs=101.7

Q ss_pred             eccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           49 QKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        49 ~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      |++.=++|.+++.+|.+.-+|-+||+++...+--.. .|..+..+-.=++|.+|-.+|+|.-+|...|+.+-+++..-+.
T Consensus       168 P~TYLNKIvvGs~~G~lql~Nvrt~K~v~~f~~~~s-~IT~ieqsPaLDVVaiG~~~G~ViifNlK~dkil~sFk~d~g~  246 (910)
T KOG1539|consen  168 PSTYLNKIVVGSSQGRLQLWNVRTGKVVYTFQEFFS-RITAIEQSPALDVVAIGLENGTVIIFNLKFDKILMSFKQDWGR  246 (910)
T ss_pred             chhheeeEEEeecCCcEEEEEeccCcEEEEeccccc-ceeEeccCCcceEEEEeccCceEEEEEcccCcEEEEEEccccc
Confidence            555578899999999999999999999988754332 2333322223468888887789999999999999999986222


Q ss_pred             ccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccC-cceeeeeEEEEecCCEEEEEEecCCceeEEE
Q 003800          129 HSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAA-ESVEVQQVIQLDESDQIYVVGYAGSSQFHAY  205 (794)
Q Consensus       129 ~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~-~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~  205 (794)
                      . ..+.+.     .| +..+++.  ..|.+.-.|.+.-+..|...... +...-..+.   .+..|.+ +..+...+++.
T Consensus       247 V-tslSFr-----tD-G~p~las~~~~G~m~~wDLe~kkl~~v~~nah~~sv~~~~fl---~~epVl~-ta~~DnSlk~~  315 (910)
T KOG1539|consen  247 V-TSLSFR-----TD-GNPLLASGRSNGDMAFWDLEKKKLINVTRNAHYGSVTGATFL---PGEPVLV-TAGADNSLKVW  315 (910)
T ss_pred             e-eEEEec-----cC-CCeeEEeccCCceEEEEEcCCCeeeeeeeccccCCcccceec---CCCceEe-eccCCCceeEE
Confidence            2 122222     22 3344444  36889899988888888876443 221111221   2333443 22233568999


Q ss_pred             EEEcCCCcee
Q 003800          206 QINAMNGELL  215 (794)
Q Consensus       206 ald~~tG~~~  215 (794)
                      .+|..+|.++
T Consensus       316 vfD~~dg~pR  325 (910)
T KOG1539|consen  316 VFDSGDGVPR  325 (910)
T ss_pred             EeeCCCCcch
Confidence            9998888644


No 83 
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=89.86  E-value=19  Score=46.33  Aligned_cols=157  Identities=17%  Similarity=0.181  Sum_probs=86.4

Q ss_pred             CCEEEEEeC-CCEEEEEECcCCccceEEEcCcc------c---------ceeeeeeeeCCEEEEEE-ccCCeEEEEeCCC
Q 003800           53 RKRVVVSTE-ENVIASLDLRHGEIFWRHVLGIN------D---------VVDGIDIALGKYVITLS-SDGSTLRAWNLPD  115 (794)
Q Consensus        53 ~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~------~---------~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~t  115 (794)
                      ++.+|++.. .+.|.-+|+.+|.+.  ......      +         ...++.....++.++|+ ..+++|+.||..+
T Consensus       694 ~g~LyVad~~~~~I~v~d~~~g~v~--~~~G~G~~~~~~g~~~~~~~~~~P~GIavspdG~~LYVADs~n~~Irv~D~~t  771 (1057)
T PLN02919        694 NEKVYIAMAGQHQIWEYNISDGVTR--VFSGDGYERNLNGSSGTSTSFAQPSGISLSPDLKELYIADSESSSIRALDLKT  771 (1057)
T ss_pred             CCeEEEEECCCCeEEEEECCCCeEE--EEecCCccccCCCCccccccccCccEEEEeCCCCEEEEEECCCCeEEEEECCC
Confidence            567888764 678999999888542  110000      0         01123222334445554 3457999999999


Q ss_pred             CcEeEEEeccCc------------cccC-CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCc-----
Q 003800          116 GQMVWESFLRGS------------KHSK-PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAE-----  175 (794)
Q Consensus       116 G~llWe~~l~~~------------~~s~-~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~-----  175 (794)
                      |...|-......            .... .....|.....+.++.+||.  .++++..+|..+|.+.........     
T Consensus       772 g~~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVADs~N~rIrviD~~tg~v~tiaG~G~~G~~dG  851 (1057)
T PLN02919        772 GGSRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVADSYNHKIKKLDPATKRVTTLAGTGKAGFKDG  851 (1057)
T ss_pred             CcEEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEEECCCCEEEEEECCCCeEEEEeccCCcCCCCC
Confidence            887654321100            0000 00001111133445678886  388999999999987754432210     


Q ss_pred             -----ce-eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCcee
Q 003800          176 -----SV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELL  215 (794)
Q Consensus       176 -----~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~  215 (794)
                           .+ .|..+. ...++.+|+.....+   .+..+|+.+|+..
T Consensus       852 ~~~~a~l~~P~GIa-vd~dG~lyVaDt~Nn---~Irvid~~~~~~~  893 (1057)
T PLN02919        852 KALKAQLSEPAGLA-LGENGRLFVADTNNS---LIRYLDLNKGEAA  893 (1057)
T ss_pred             cccccccCCceEEE-EeCCCCEEEEECCCC---EEEEEECCCCccc
Confidence                 01 233343 234677888654433   7888899998764


No 84 
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=89.86  E-value=36  Score=37.68  Aligned_cols=195  Identities=14%  Similarity=0.118  Sum_probs=111.3

Q ss_pred             cCCCEEEE--EeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCC-CCcEeEEEeccCc
Q 003800           51 TGRKRVVV--STEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLP-DGQMVWESFLRGS  127 (794)
Q Consensus        51 ~~~~~Vyv--~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~-tG~llWe~~l~~~  127 (794)
                      .+++.+||  .|...-|..+|...++.+=  .++.++....+ +....+...++++| .+...... +|++. +....  
T Consensus       104 ~dgk~~~V~N~TPa~SVtVVDl~~~kvv~--ei~~PGC~~iy-P~~~~~F~~lC~DG-sl~~v~Ld~~Gk~~-~~~t~--  176 (342)
T PF06433_consen  104 ADGKFLYVQNFTPATSVTVVDLAAKKVVG--EIDTPGCWLIY-PSGNRGFSMLCGDG-SLLTVTLDADGKEA-QKSTK--  176 (342)
T ss_dssp             TTSSEEEEEEESSSEEEEEEETTTTEEEE--EEEGTSEEEEE-EEETTEEEEEETTS-CEEEEEETSTSSEE-EEEEE--
T ss_pred             cCCcEEEEEccCCCCeEEEEECCCCceee--eecCCCEEEEE-ecCCCceEEEecCC-ceEEEEECCCCCEe-Eeecc--
Confidence            34555666  4567789999999998863  34444433333 23445666667875 45544444 89997 43321  


Q ss_pred             cc--cCCcccccccccc-ccCC-eEEEEECCEEEEEECCCCcEEEEEeccC-------cceee--eeEEE-EecCCEEEE
Q 003800          128 KH--SKPLLLVPTNLKV-DKDS-LILVSSKGCLHAVSSIDGEILWTRDFAA-------ESVEV--QQVIQ-LDESDQIYV  193 (794)
Q Consensus       128 ~~--s~~~~~~~~~~~~-~~~~-~V~V~~~g~l~ald~~tG~~~W~~~~~~-------~~~~~--~~~v~-s~~~~~vyv  193 (794)
                      ..  ..++.+.. + .. ..++ .+|+...|.|+.+|.....+.|......       ..+.|  .|++. ....+.+|+
T Consensus       177 ~F~~~~dp~f~~-~-~~~~~~~~~~F~Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyv  254 (342)
T PF06433_consen  177 VFDPDDDPLFEH-P-AYSRDGGRLYFVSYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYV  254 (342)
T ss_dssp             ESSTTTS-B-S----EEETTTTEEEEEBTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEE
T ss_pred             ccCCCCcccccc-c-ceECCCCeEEEEecCCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEE
Confidence            11  12222211 0 11 1123 3444469999999988777665433211       12222  22221 235789998


Q ss_pred             EEecCC------ceeEEEEEEcCCCceeeeeeeecccCccCceEEE--cCcEEEEEECCCCeEEEEEeecce
Q 003800          194 VGYAGS------SQFHAYQINAMNGELLNHETAAFSGGFVGDVALV--SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       194 ~~~~g~------~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v--g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +-..|.      ..-.|..+|++|++++-...+..+..   ++-+.  ....+++++..++.|.+.|..+|+
T Consensus       255 LMh~g~~gsHKdpgteVWv~D~~t~krv~Ri~l~~~~~---Si~Vsqd~~P~L~~~~~~~~~l~v~D~~tGk  323 (342)
T PF06433_consen  255 LMHQGGEGSHKDPGTEVWVYDLKTHKRVARIPLEHPID---SIAVSQDDKPLLYALSAGDGTLDVYDAATGK  323 (342)
T ss_dssp             EEEE--TT-TTS-EEEEEEEETTTTEEEEEEEEEEEES---EEEEESSSS-EEEEEETTTTEEEEEETTT--
T ss_pred             EecCCCCCCccCCceEEEEEECCCCeEEEEEeCCCccc---eEEEccCCCcEEEEEcCCCCeEEEEeCcCCc
Confidence            765552      24789999999999998776544421   12221  223777888777899999999998


No 85 
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=89.78  E-value=30  Score=41.02  Aligned_cols=180  Identities=15%  Similarity=0.179  Sum_probs=114.3

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      ++.++.++.++.|-..|..+|..+=+.-.+..+.+.++....+++.++-++.+.++|-||..+|.-.=.........   
T Consensus       218 ~~~~~~~s~~~tl~~~~~~~~~~i~~~l~GH~g~V~~l~~~~~~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh~stv---  294 (537)
T KOG0274|consen  218 DGFFKSGSDDSTLHLWDLNNGYLILTRLVGHFGGVWGLAFPSGGDKLVSGSTDKTERVWDCSTGECTHSLQGHTSSV---  294 (537)
T ss_pred             cCeEEecCCCceeEEeecccceEEEeeccCCCCCceeEEEecCCCEEEEEecCCcEEeEecCCCcEEEEecCCCceE---
Confidence            77788999999999999999987766555544555555444456666655557899999999998877766554432   


Q ss_pred             ccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800          133 LLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM  210 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~  210 (794)
                       ..+      ...+.+.+.  .|.+|.+-+..+|+.+=.......   +-..+ ....+.++..+.+|    .+-..|+.
T Consensus       295 -~~~------~~~~~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~---~V~~v-~~~~~~lvsgs~d~----~v~VW~~~  359 (537)
T KOG0274|consen  295 -RCL------TIDPFLLVSGSRDNTVKVWDVTNGACLNLLRGHTG---PVNCV-QLDEPLLVSGSYDG----TVKVWDPR  359 (537)
T ss_pred             -EEE------EccCceEeeccCCceEEEEeccCcceEEEeccccc---cEEEE-EecCCEEEEEecCc----eEEEEEhh
Confidence             111      113344444  377777777777766655542111   11222 23567777777666    68888999


Q ss_pred             CCceeeeeeeecccCccC--ceEEEcC-cEEEEEECCCCeEEEEEeecc
Q 003800          211 NGELLNHETAAFSGGFVG--DVALVSS-DTLVTLDTTRSILVTVSFKNR  256 (794)
Q Consensus       211 tG~~~w~~~v~~~~~~s~--~~~~vg~-~~lv~~d~~~g~L~v~~l~sg  256 (794)
                      +|+.+...+-     -++  .++++++ +.++-... ++.+.+=|+.++
T Consensus       360 ~~~cl~sl~g-----H~~~V~sl~~~~~~~~~Sgs~-D~~IkvWdl~~~  402 (537)
T KOG0274|consen  360 TGKCLKSLSG-----HTGRVYSLIVDSENRLLSGSL-DTTIKVWDLRTK  402 (537)
T ss_pred             hceeeeeecC-----CcceEEEEEecCcceEEeeee-ccceEeecCCch
Confidence            9988876642     112  2344565 55554443 366788888776


No 86 
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=89.70  E-value=28  Score=36.22  Aligned_cols=142  Identities=14%  Similarity=0.156  Sum_probs=76.4

Q ss_pred             CCEEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ++.+|+.+- .+.|..+|+++|+.. ...++.   ..++.....++.++++..+ .++.+|..+|+..--........  
T Consensus        11 ~g~l~~~D~~~~~i~~~~~~~~~~~-~~~~~~---~~G~~~~~~~g~l~v~~~~-~~~~~d~~~g~~~~~~~~~~~~~--   83 (246)
T PF08450_consen   11 DGRLYWVDIPGGRIYRVDPDTGEVE-VIDLPG---PNGMAFDRPDGRLYVADSG-GIAVVDPDTGKVTVLADLPDGGV--   83 (246)
T ss_dssp             TTEEEEEETTTTEEEEEETTTTEEE-EEESSS---EEEEEEECTTSEEEEEETT-CEEEEETTTTEEEEEEEEETTCS--
T ss_pred             CCEEEEEEcCCCEEEEEECCCCeEE-EEecCC---CceEEEEccCCEEEEEEcC-ceEEEecCCCcEEEEeeccCCCc--
Confidence            677888874 789999999888652 222332   2233223234666666654 45666999996554444321110  


Q ss_pred             CccccccccccccCCeEEEEE--C--------CEEEEEECCCCcEEEEEe-ccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800          132 PLLLVPTNLKVDKDSLILVSS--K--------GCLHAVSSIDGEILWTRD-FAAESVEVQQVIQLDESDQIYVVGYAGSS  200 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~~--~--------g~l~ald~~tG~~~W~~~-~~~~~~~~~~~v~s~~~~~vyv~~~~g~~  200 (794)
                       ....+-+...+.++.+++..  .        |.|++++.. |++..... ...    +-.+..+.++..+|+.....+ 
T Consensus        84 -~~~~~ND~~vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~----pNGi~~s~dg~~lyv~ds~~~-  156 (246)
T PF08450_consen   84 -PFNRPNDVAVDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGF----PNGIAFSPDGKTLYVADSFNG-  156 (246)
T ss_dssp             -CTEEEEEEEE-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESS----EEEEEEETTSSEEEEEETTTT-
T ss_pred             -ccCCCceEEEcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCccc----ccceEECCcchheeecccccc-
Confidence             01111122344467788853  1        789999988 76443332 222    223332345567887554433 


Q ss_pred             eeEEEEEEcC
Q 003800          201 QFHAYQINAM  210 (794)
Q Consensus       201 ~~~v~ald~~  210 (794)
                        ++..++..
T Consensus       157 --~i~~~~~~  164 (246)
T PF08450_consen  157 --RIWRFDLD  164 (246)
T ss_dssp             --EEEEEEEE
T ss_pred             --eeEEEecc
Confidence              56666664


No 87 
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=89.58  E-value=28  Score=38.34  Aligned_cols=232  Identities=15%  Similarity=0.195  Sum_probs=112.9

Q ss_pred             EEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800           55 RVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL  134 (794)
Q Consensus        55 ~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~  134 (794)
                      -+|-+.+++.|-|-|.+.-+++=.+- +.-..+.++...-..++++-++.+..+|.||..+-..+-......... ....
T Consensus       207 YlFs~gedk~VKCwDLe~nkvIR~Yh-GHlS~V~~L~lhPTldvl~t~grDst~RvWDiRtr~~V~~l~GH~~~V-~~V~  284 (460)
T KOG0285|consen  207 YLFSAGEDKQVKCWDLEYNKVIRHYH-GHLSGVYCLDLHPTLDVLVTGGRDSTIRVWDIRTRASVHVLSGHTNPV-ASVM  284 (460)
T ss_pred             eEEEecCCCeeEEEechhhhhHHHhc-cccceeEEEeccccceeEEecCCcceEEEeeecccceEEEecCCCCcc-eeEE
Confidence            37788888899999888765432210 000112233222234566656667899999998877776666443322 1111


Q ss_pred             ccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCc
Q 003800          135 LVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGE  213 (794)
Q Consensus       135 ~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~  213 (794)
                      .-      ..+..|+-.+ |+++.--|...|+..=+........  .-+. ..-....|+.+...    .+-+.+.-.|+
T Consensus       285 ~~------~~dpqvit~S~D~tvrlWDl~agkt~~tlt~hkksv--ral~-lhP~e~~fASas~d----nik~w~~p~g~  351 (460)
T KOG0285|consen  285 CQ------PTDPQVITGSHDSTVRLWDLRAGKTMITLTHHKKSV--RALC-LHPKENLFASASPD----NIKQWKLPEGE  351 (460)
T ss_pred             ee------cCCCceEEecCCceEEEeeeccCceeEeeeccccee--eEEe-cCCchhhhhccCCc----cceeccCCccc
Confidence            11      1145555554 7787777777776654433322111  1111 01111233222111    45566666666


Q ss_pred             eeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecceeeeEEEeecccCCCCCCceEEeecCCcceeEEEec
Q 003800          214 LLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRKIAFQETHLSNLGEDSSGMVEILPSSLTGMFTVKIN  292 (794)
Q Consensus       214 ~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~  292 (794)
                      .+-..  +....+-. ++-+ .+++++.. .++|.+..-|-++|. .+|...  ...++++.  +-    -.|.|.--.+
T Consensus       352 f~~nl--sgh~~iin-tl~~nsD~v~~~G-~dng~~~fwdwksg~-nyQ~~~--t~vqpGSl--~s----EagI~as~fD  418 (460)
T KOG0285|consen  352 FLQNL--SGHNAIIN-TLSVNSDGVLVSG-GDNGSIMFWDWKSGH-NYQRGQ--TIVQPGSL--ES----EAGIFASCFD  418 (460)
T ss_pred             hhhcc--ccccceee-eeeeccCceEEEc-CCceEEEEEecCcCc-cccccc--ccccCCcc--cc----ccceeEEeec
Confidence            55441  22211111 2222 33444443 357889888888887 444331  11111110  00    0122333222


Q ss_pred             C-cEEEEEEecCCcEEEEEeecC
Q 003800          293 N-YKLFIRLTSEDKLEVVHKVDH  314 (794)
Q Consensus       293 ~-~~~l~~~~~~~~~~v~~~~~~  314 (794)
                      . +.-|+.-+.+..+++++.++.
T Consensus       419 ktg~rlit~eadKtIk~~keDe~  441 (460)
T KOG0285|consen  419 KTGSRLITGEADKTIKMYKEDEH  441 (460)
T ss_pred             ccCceEEeccCCcceEEEecccc
Confidence            2 334555554556777776653


No 88 
>PF14269 Arylsulfotran_2:  Arylsulfotransferase (ASST)
Probab=89.56  E-value=6  Score=43.16  Aligned_cols=112  Identities=15%  Similarity=0.240  Sum_probs=66.4

Q ss_pred             CCCEEEEEeC-CCEEEEEECcCCccceEEEcCccc-------cee---eeeee---eCCEEEEE-Ec----------cCC
Q 003800           52 GRKRVVVSTE-ENVIASLDLRHGEIFWRHVLGIND-------VVD---GIDIA---LGKYVITL-SS----------DGS  106 (794)
Q Consensus        52 ~~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~-------~i~---~l~~~---~g~~~V~V-s~----------~g~  106 (794)
                      .++.+++.++ ...|+.+|++||+++||..=+...       ...   -.+..   .+++.+.+ =.          ..+
T Consensus       153 ~~G~yLiS~R~~~~i~~I~~~tG~I~W~lgG~~~~df~~~~~~f~~QHdar~~~~~~~~~~IslFDN~~~~~~~~~~s~~  232 (299)
T PF14269_consen  153 DDGDYLISSRNTSTIYKIDPSTGKIIWRLGGKRNSDFTLPATNFSWQHDARFLNESNDDGTISLFDNANSDFNGTEPSRG  232 (299)
T ss_pred             CCccEEEEecccCEEEEEECCCCcEEEEeCCCCCCcccccCCcEeeccCCEEeccCCCCCEEEEEcCCCCCCCCCcCCCc
Confidence            3556777776 589999999999999997433110       010   00111   12333332 11          236


Q ss_pred             eEEEEeCCCCcEeEEEecc-Cc-cc----cCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEe
Q 003800          107 TLRAWNLPDGQMVWESFLR-GS-KH----SKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRD  171 (794)
Q Consensus       107 ~v~A~d~~tG~llWe~~l~-~~-~~----s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~  171 (794)
                      .+..+|..+.+..|..... .+ ..    +.....++       .+.++|.  ..+++.-++ .+|+++|++.
T Consensus       233 ~v~~ld~~~~~~~~~~~~~~~~~~~~s~~~G~~Q~L~-------nGn~li~~g~~g~~~E~~-~~G~vv~~~~  297 (299)
T PF14269_consen  233 LVLELDPETMTVTLVREYSDHPDGFYSPSQGSAQRLP-------NGNVLIGWGNNGRISEFT-PDGEVVWEAQ  297 (299)
T ss_pred             eEEEEECCCCEEEEEEEeecCCCcccccCCCcceECC-------CCCEEEecCCCceEEEEC-CCCCEEEEEE
Confidence            8999999988776666554 11 11    11122222       4667775  378888887 6899999975


No 89 
>PRK04922 tolB translocation protein TolB; Provisional
Probab=88.03  E-value=55  Score=37.46  Aligned_cols=150  Identities=14%  Similarity=0.091  Sum_probs=74.3

Q ss_pred             CCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEc-cC-CeEEEEeCCCCcEeEEEecc
Q 003800           52 GRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSS-DG-STLRAWNLPDGQMVWESFLR  125 (794)
Q Consensus        52 ~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~-~g-~~v~A~d~~tG~llWe~~l~  125 (794)
                      +++.|++.+.   ...|+.+|.++|+..--..+  ++........ .|+.+++... +| ..++.||..+|+.. +....
T Consensus       214 Dg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~--~g~~~~~~~SpDG~~l~~~~s~~g~~~Iy~~d~~~g~~~-~lt~~  290 (433)
T PRK04922        214 DGKKLAYVSFERGRSAIYVQDLATGQRELVASF--RGINGAPSFSPDGRRLALTLSRDGNPEIYVMDLGSRQLT-RLTNH  290 (433)
T ss_pred             CCCEEEEEecCCCCcEEEEEECCCCCEEEeccC--CCCccCceECCCCCEEEEEEeCCCCceEEEEECCCCCeE-ECccC
Confidence            4556666553   34799999999875322112  2111112112 3555555432 22 47999999998753 21111


Q ss_pred             CccccCCccccccccccccCCeEEEEE--C--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCce
Q 003800          126 GSKHSKPLLLVPTNLKVDKDSLILVSS--K--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQ  201 (794)
Q Consensus       126 ~~~~s~~~~~~~~~~~~~~~~~V~V~~--~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~  201 (794)
                      .... ..+.+.     .+ ++.+++.+  +  ..++.+|..+|+..--.......   ..+..+.++..+++.+..+ ..
T Consensus       291 ~~~~-~~~~~s-----pD-G~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g~~~---~~~~~SpDG~~Ia~~~~~~-~~  359 (433)
T PRK04922        291 FGID-TEPTWA-----PD-GKSIYFTSDRGGRPQIYRVAASGGSAERLTFQGNYN---ARASVSPDGKKIAMVHGSG-GQ  359 (433)
T ss_pred             CCCc-cceEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCeEEeecCCCCc---cCEEECCCCCEEEEEECCC-Cc
Confidence            1111 111121     22 33444443  2  35889998888654221111111   1122234556666554432 23


Q ss_pred             eEEEEEEcCCCcee
Q 003800          202 FHAYQINAMNGELL  215 (794)
Q Consensus       202 ~~v~ald~~tG~~~  215 (794)
                      ..+..+|+.+|+..
T Consensus       360 ~~I~v~d~~~g~~~  373 (433)
T PRK04922        360 YRIAVMDLSTGSVR  373 (433)
T ss_pred             eeEEEEECCCCCeE
Confidence            46888899888765


No 90 
>PRK04792 tolB translocation protein TolB; Provisional
Probab=88.01  E-value=57  Score=37.63  Aligned_cols=148  Identities=11%  Similarity=0.099  Sum_probs=74.0

Q ss_pred             cCCCEEEEEe-CC--CEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccC--CeEEEEeCCCCcEeEEEec
Q 003800           51 TGRKRVVVST-EE--NVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDG--STLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        51 ~~~~~Vyv~t-~~--g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g--~~v~A~d~~tG~llWe~~l  124 (794)
                      +++++|+..+ ++  ..|+.+|..+|+..  +....++....... ..|+.+++.+..+  ..++.+|..+|++.   .+
T Consensus       227 PDG~~La~~s~~~g~~~L~~~dl~tg~~~--~lt~~~g~~~~~~wSPDG~~La~~~~~~g~~~Iy~~dl~tg~~~---~l  301 (448)
T PRK04792        227 PDGRKLAYVSFENRKAEIFVQDIYTQVRE--KVTSFPGINGAPRFSPDGKKLALVLSKDGQPEIYVVDIATKALT---RI  301 (448)
T ss_pred             CCCCEEEEEEecCCCcEEEEEECCCCCeE--EecCCCCCcCCeeECCCCCEEEEEEeCCCCeEEEEEECCCCCeE---EC
Confidence            3455555544 32  47999999998752  22111111111111 2455566654332  36999999888742   22


Q ss_pred             cCcc-ccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEEE-EEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          125 RGSK-HSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEILW-TRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       125 ~~~~-~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~W-~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      .... ....+.+.     .+ ++.+++.+    ...++.+|..+|+..- ++.... ..   ....+.++..+++.+..+
T Consensus       302 t~~~~~~~~p~wS-----pD-G~~I~f~s~~~g~~~Iy~~dl~~g~~~~Lt~~g~~-~~---~~~~SpDG~~l~~~~~~~  371 (448)
T PRK04792        302 TRHRAIDTEPSWH-----PD-GKSLIFTSERGGKPQIYRVNLASGKVSRLTFEGEQ-NL---GGSITPDGRSMIMVNRTN  371 (448)
T ss_pred             ccCCCCccceEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCEEEEecCCCC-Cc---CeeECCCCCEEEEEEecC
Confidence            2111 10111111     22 23444433    3479999998887532 211111 11   111134566676655443


Q ss_pred             CceeEEEEEEcCCCce
Q 003800          199 SSQFHAYQINAMNGEL  214 (794)
Q Consensus       199 ~~~~~v~ald~~tG~~  214 (794)
                       ....++.+|+.+|+.
T Consensus       372 -g~~~I~~~dl~~g~~  386 (448)
T PRK04792        372 -GKFNIARQDLETGAM  386 (448)
T ss_pred             -CceEEEEEECCCCCe
Confidence             235788899999875


No 91 
>COG4257 Vgb Streptogramin lyase [Defense mechanisms]
Probab=87.35  E-value=27  Score=37.34  Aligned_cols=194  Identities=9%  Similarity=0.056  Sum_probs=108.9

Q ss_pred             CCEEEEEeCCCEEEEEECcCCcc-ceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcE-eEEEeccCccc
Q 003800           53 RKRVVVSTEENVIASLDLRHGEI-FWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQM-VWESFLRGSKH  129 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~i-vWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~l-lWe~~l~~~~~  129 (794)
                      ++...+....+.|..||++|++. .|...++... +.... +-...+.+-.++..+.-=-+|+.++.+ +|.........
T Consensus       114 dg~~Witd~~~aI~R~dpkt~evt~f~lp~~~a~~nlet~-vfD~~G~lWFt~q~G~yGrLdPa~~~i~vfpaPqG~gpy  192 (353)
T COG4257         114 DGSAWITDTGLAIGRLDPKTLEVTRFPLPLEHADANLETA-VFDPWGNLWFTGQIGAYGRLDPARNVISVFPAPQGGGPY  192 (353)
T ss_pred             CCCeeEecCcceeEEecCcccceEEeecccccCCCcccce-eeCCCccEEEeeccccceecCcccCceeeeccCCCCCCc
Confidence            44455555556899999999864 3433333221 22222 234555554444322333578887754 56666444332


Q ss_pred             cCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcce-eeeeEEEEecCCEEEEEEecCCceeEEEE
Q 003800          130 SKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESV-EVQQVIQLDESDQIYVVGYAGSSQFHAYQ  206 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~-~~~~~v~s~~~~~vyv~~~~g~~~~~v~a  206 (794)
                        .++..+       ++.|++.  .+..+.++|..+|..- ....|.+.- ..+++ .+...+.++.....++   .++.
T Consensus       193 --Gi~atp-------dGsvwyaslagnaiaridp~~~~ae-v~p~P~~~~~gsRri-wsdpig~~wittwg~g---~l~r  258 (353)
T COG4257         193 --GICATP-------DGSVWYASLAGNAIARIDPFAGHAE-VVPQPNALKAGSRRI-WSDPIGRAWITTWGTG---SLHR  258 (353)
T ss_pred             --ceEECC-------CCcEEEEeccccceEEcccccCCcc-eecCCCccccccccc-ccCccCcEEEeccCCc---eeeE
Confidence              333333       6778876  4889999999999421 112222100 01112 1234567776433333   7899


Q ss_pred             EEcCCCceeeeeeeeccc-CccCceEEEcCcEEEEE-ECCCCeEEEEEeecceeeeEEEeec
Q 003800          207 INAMNGELLNHETAAFSG-GFVGDVALVSSDTLVTL-DTTRSILVTVSFKNRKIAFQETHLS  266 (794)
Q Consensus       207 ld~~tG~~~w~~~v~~~~-~~s~~~~~vg~~~lv~~-d~~~g~L~v~~l~sg~~~~~~~~l~  266 (794)
                      +|+.+-.  |+.= ..|. ....-.+.|.+.-.||+ |.+.|.|+..|-++.+  +..+|+.
T Consensus       259 fdPs~~s--W~ey-pLPgs~arpys~rVD~~grVW~sea~agai~rfdpeta~--ftv~p~p  315 (353)
T COG4257         259 FDPSVTS--WIEY-PLPGSKARPYSMRVDRHGRVWLSEADAGAIGRFDPETAR--FTVLPIP  315 (353)
T ss_pred             eCccccc--ceee-eCCCCCCCcceeeeccCCcEEeeccccCceeecCcccce--EEEecCC
Confidence            9998765  7541 2232 22223455666556777 7778889999888876  7777764


No 92 
>PRK03629 tolB translocation protein TolB; Provisional
Probab=87.05  E-value=62  Score=37.04  Aligned_cols=151  Identities=14%  Similarity=0.056  Sum_probs=73.0

Q ss_pred             cCCCEEEEEe---CCCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEc-cC-CeEEEEeCCCCcEeEEEec
Q 003800           51 TGRKRVVVST---EENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSS-DG-STLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        51 ~~~~~Vyv~t---~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~-~g-~~v~A~d~~tG~llWe~~l  124 (794)
                      ++++++.+.+   ....|+.+|.++|+..--..++..  ...... ..|+.+++++. .+ ..++.||..+|++.=-.. 
T Consensus       208 PDG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~~~--~~~~~~SPDG~~La~~~~~~g~~~I~~~d~~tg~~~~lt~-  284 (429)
T PRK03629        208 PDGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRH--NGAPAFSPDGSKLAFALSKTGSLNLYVMDLASGQIRQVTD-  284 (429)
T ss_pred             CCCCEEEEEEecCCCcEEEEEECCCCCeEEccCCCCC--cCCeEECCCCCEEEEEEcCCCCcEEEEEECCCCCEEEccC-
Confidence            3455555443   245788899988874322122211  111111 24555666533 22 369999999887642111 


Q ss_pred             cCccccCCccccccccccccCCeEEEEE--C--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILVSS--K--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS  200 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~  200 (794)
                      ..... ..+...     .+ ++.+++.+  +  -.++.+|..+|+..--.... ...  .....+.++..+++.+..++ 
T Consensus       285 ~~~~~-~~~~wS-----PD-G~~I~f~s~~~g~~~Iy~~d~~~g~~~~lt~~~-~~~--~~~~~SpDG~~Ia~~~~~~g-  353 (429)
T PRK03629        285 GRSNN-TEPTWF-----PD-SQNLAYTSDQAGRPQVYKVNINGGAPQRITWEG-SQN--QDADVSSDGKFMVMVSSNGG-  353 (429)
T ss_pred             CCCCc-CceEEC-----CC-CCEEEEEeCCCCCceEEEEECCCCCeEEeecCC-CCc--cCEEECCCCCEEEEEEccCC-
Confidence            11111 111122     22 23344433  2  27888998888654221111 111  11111334555555444432 


Q ss_pred             eeEEEEEEcCCCcee
Q 003800          201 QFHAYQINAMNGELL  215 (794)
Q Consensus       201 ~~~v~ald~~tG~~~  215 (794)
                      ...++.+|+.+|+..
T Consensus       354 ~~~I~~~dl~~g~~~  368 (429)
T PRK03629        354 QQHIAKQDLATGGVQ  368 (429)
T ss_pred             CceEEEEECCCCCeE
Confidence            246788899998743


No 93 
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=86.61  E-value=5.9  Score=42.60  Aligned_cols=184  Identities=18%  Similarity=0.221  Sum_probs=97.5

Q ss_pred             CCEEEEEECcCCcc-ceEEEcCcc---------cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           62 ENVIASLDLRHGEI-FWRHVLGIN---------DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        62 ~g~l~ALn~~tG~i-vWR~~l~~~---------~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      +....|--+.||++ +||...+.-         ..+.+++...++..+.-++.+..+|.--..+|+.+=|++..+.-. .
T Consensus       274 DsEMlAsGsqDGkIKvWri~tG~ClRrFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsSyv-n  352 (508)
T KOG0275|consen  274 DSEMLASGSQDGKIKVWRIETGQCLRRFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSSYV-N  352 (508)
T ss_pred             cHHHhhccCcCCcEEEEEEecchHHHHhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccccc-c
Confidence            34444444555654 577654321         123344333333334334557789999999999999999876543 2


Q ss_pred             CccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800          132 PLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM  210 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~  210 (794)
                      .+.+.+     + ++.++-. +||++..-+.+|++-+=+++..........++....+-.-++++...+   .++..+. 
T Consensus       353 ~a~ft~-----d-G~~iisaSsDgtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~PKnpeh~iVCNrsn---tv~imn~-  422 (508)
T KOG0275|consen  353 EATFTD-----D-GHHIISASSDGTVKVWHGKTTECLSTFKPLGTDYPVNSVILLPKNPEHFIVCNRSN---TVYIMNM-  422 (508)
T ss_pred             ceEEcC-----C-CCeEEEecCCccEEEecCcchhhhhhccCCCCcccceeEEEcCCCCceEEEEcCCC---eEEEEec-
Confidence            322222     2 3444444 599999999999888777766554432222222112222233333222   3444443 


Q ss_pred             CCceeeeeeeecc--cCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          211 NGELLNHETAAFS--GGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       211 tG~~~w~~~v~~~--~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .|+.+....-+-.  .++-..++-.-+..++|+- ..+.|+.....+|+
T Consensus       423 qGQvVrsfsSGkREgGdFi~~~lSpkGewiYcig-ED~vlYCF~~~sG~  470 (508)
T KOG0275|consen  423 QGQVVRSFSSGKREGGDFINAILSPKGEWIYCIG-EDGVLYCFSVLSGK  470 (508)
T ss_pred             cceEEeeeccCCccCCceEEEEecCCCcEEEEEc-cCcEEEEEEeecCc
Confidence            4555544421111  1111122222344666775 45788888888887


No 94 
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=86.55  E-value=14  Score=44.07  Aligned_cols=173  Identities=12%  Similarity=0.124  Sum_probs=94.6

Q ss_pred             CCEEEEEeCC-------CEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccC------CeEEEEeCCCCc
Q 003800           53 RKRVVVSTEE-------NVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDG------STLRAWNLPDGQ  117 (794)
Q Consensus        53 ~~~Vyv~t~~-------g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g------~~v~A~d~~tG~  117 (794)
                      .+.+|+....       ..+-++|++++  .|+...+-+.  .-.+. +..++.+.++||.+      ..+.-+|+.+++
T Consensus       284 ~~~l~~vGG~~~~~~~~~~ve~yd~~~~--~w~~~a~m~~~r~~~~~-~~~~~~lYv~GG~~~~~~~l~~ve~YD~~~~~  360 (571)
T KOG4441|consen  284 SGKLVAVGGYNRQGQSLRSVECYDPKTN--EWSSLAPMPSPRCRVGV-AVLNGKLYVVGGYDSGSDRLSSVERYDPRTNQ  360 (571)
T ss_pred             CCeEEEECCCCCCCcccceeEEecCCcC--cEeecCCCCcccccccE-EEECCEEEEEccccCCCcccceEEEecCCCCc
Confidence            4556665542       46889999999  6877654442  11122 34566666667755      568899999888


Q ss_pred             EeEEEeccCccccCCccccccccccccCCeEEEEE--CC-----EEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCE
Q 003800          118 MVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KG-----CLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQ  190 (794)
Q Consensus       118 llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g-----~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~  190 (794)
                        |..-..-.....   ..+   ...-++.+|+.+  +|     .+-++|+.+  -.|+...+.... ....-...-++.
T Consensus       361 --W~~~a~M~~~R~---~~~---v~~l~g~iYavGG~dg~~~l~svE~YDp~~--~~W~~va~m~~~-r~~~gv~~~~g~  429 (571)
T KOG4441|consen  361 --WTPVAPMNTKRS---DFG---VAVLDGKLYAVGGFDGEKSLNSVECYDPVT--NKWTPVAPMLTR-RSGHGVAVLGGK  429 (571)
T ss_pred             --eeccCCccCccc---cce---eEEECCEEEEEeccccccccccEEEecCCC--CcccccCCCCcc-eeeeEEEEECCE
Confidence              886322111101   111   111145666643  22     355555543  468887765432 112111356899


Q ss_pred             EEEEEecCCce---eEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800          191 IYVVGYAGSSQ---FHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL  241 (794)
Q Consensus       191 vyv~~~~g~~~---~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~  241 (794)
                      +|++|...+..   -.+-++|+.|++  |...-.++....+ .+...++.+|++.
T Consensus       430 iYi~GG~~~~~~~l~sve~YDP~t~~--W~~~~~M~~~R~~~g~a~~~~~iYvvG  482 (571)
T KOG4441|consen  430 LYIIGGGDGSSNCLNSVECYDPETNT--WTLIAPMNTRRSGFGVAVLNGKIYVVG  482 (571)
T ss_pred             EEEEcCcCCCccccceEEEEcCCCCc--eeecCCcccccccceEEEECCEEEEEC
Confidence            99987643222   568899998874  6654333322222 2333355555554


No 95 
>PRK00178 tolB translocation protein TolB; Provisional
Probab=86.39  E-value=65  Score=36.61  Aligned_cols=148  Identities=14%  Similarity=0.093  Sum_probs=74.2

Q ss_pred             cCCCEEEEEeCC---CEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEc-c-CCeEEEEeCCCCcEeEEEec
Q 003800           51 TGRKRVVVSTEE---NVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSS-D-GSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        51 ~~~~~Vyv~t~~---g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~-~-g~~v~A~d~~tG~llWe~~l  124 (794)
                      +++++|++.+.+   ..|+.+|.++|+..  +.....+........ .|+.+++... . ...++.+|..+|+..-   +
T Consensus       208 pDG~~la~~s~~~~~~~l~~~~l~~g~~~--~l~~~~g~~~~~~~SpDG~~la~~~~~~g~~~Iy~~d~~~~~~~~---l  282 (430)
T PRK00178        208 PDGKRIAYVSFEQKRPRIFVQNLDTGRRE--QITNFEGLNGAPAWSPDGSKLAFVLSKDGNPEIYVMDLASRQLSR---V  282 (430)
T ss_pred             CCCCEEEEEEcCCCCCEEEEEECCCCCEE--EccCCCCCcCCeEECCCCCEEEEEEccCCCceEEEEECCCCCeEE---c
Confidence            345566554432   47899999988752  222112111111112 4555555433 2 2479999999887531   2


Q ss_pred             cCcc-ccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEE-EEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          125 RGSK-HSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEIL-WTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       125 ~~~~-~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~-W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      .... ....+.+.     .+ ++.+++.+    ...++.+|..+|+.. .+...  ...  .....+.++..+++....+
T Consensus       283 t~~~~~~~~~~~s-----pD-g~~i~f~s~~~g~~~iy~~d~~~g~~~~lt~~~--~~~--~~~~~Spdg~~i~~~~~~~  352 (430)
T PRK00178        283 TNHPAIDTEPFWG-----KD-GRTLYFTSDRGGKPQIYKVNVNGGRAERVTFVG--NYN--ARPRLSADGKTLVMVHRQD  352 (430)
T ss_pred             ccCCCCcCCeEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCEEEeecCC--CCc--cceEECCCCCEEEEEEccC
Confidence            2111 10111111     22 34444443    347999999888753 22211  111  1111134566666655433


Q ss_pred             CceeEEEEEEcCCCce
Q 003800          199 SSQFHAYQINAMNGEL  214 (794)
Q Consensus       199 ~~~~~v~ald~~tG~~  214 (794)
                      + ...++.+|+.+|+.
T Consensus       353 ~-~~~l~~~dl~tg~~  367 (430)
T PRK00178        353 G-NFHVAAQDLQRGSV  367 (430)
T ss_pred             C-ceEEEEEECCCCCE
Confidence            2 34688899999875


No 96 
>KOG0310 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=86.03  E-value=14  Score=42.04  Aligned_cols=113  Identities=17%  Similarity=0.252  Sum_probs=71.8

Q ss_pred             EEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec--cCccccCC
Q 003800           55 RVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL--RGSKHSKP  132 (794)
Q Consensus        55 ~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l--~~~~~s~~  132 (794)
                      .++.++.+|.|---|.++=. -|-..++.+.++... .....+..+++..|+.|+.||..+|..+=-...  ...+.  .
T Consensus       168 ivvtGsYDg~vrl~DtR~~~-~~v~elnhg~pVe~v-l~lpsgs~iasAgGn~vkVWDl~~G~qll~~~~~H~KtVT--c  243 (487)
T KOG0310|consen  168 IVVTGSYDGKVRLWDTRSLT-SRVVELNHGCPVESV-LALPSGSLIASAGGNSVKVWDLTTGGQLLTSMFNHNKTVT--C  243 (487)
T ss_pred             EEEecCCCceEEEEEeccCC-ceeEEecCCCceeeE-EEcCCCCEEEEcCCCeEEEEEecCCceehhhhhcccceEE--E
Confidence            47788889999999988865 788888776555533 234444444465578999999997755433322  12111  1


Q ss_pred             ccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcce
Q 003800          133 LLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESV  177 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~  177 (794)
                      +.+..     + +..++-.+ |+.|-.+|..+=+++-.+..++|-+
T Consensus       244 L~l~s-----~-~~rLlS~sLD~~VKVfd~t~~Kvv~s~~~~~pvL  283 (487)
T KOG0310|consen  244 LRLAS-----D-STRLLSGSLDRHVKVFDTTNYKVVHSWKYPGPVL  283 (487)
T ss_pred             EEeec-----C-CceEeecccccceEEEEccceEEEEeeeccccee
Confidence            11111     1 23333344 9999999988888887777777654


No 97 
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=85.87  E-value=20  Score=41.66  Aligned_cols=106  Identities=15%  Similarity=0.144  Sum_probs=64.2

Q ss_pred             CCeEEEEE--CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec
Q 003800          145 DSLILVSS--KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF  222 (794)
Q Consensus       145 ~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~  222 (794)
                      +-.++|++  .|.+..++...|++.|+...+...-..-.+..+...+-+|-++.+    .++.-++.++++.+-.....-
T Consensus        69 ~t~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad----~~v~~~~~~~~~~~~~~~~~~  144 (541)
T KOG4547|consen   69 DTSMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCIYSVGAD----LKVVYILEKEKVIIRIWKEQK  144 (541)
T ss_pred             CceEEEeecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccccCceEecCCc----eeEEEEecccceeeeeeccCC
Confidence            34456663  899999999999999999854432101111112234455644333    488889999998874443222


Q ss_pred             ccCccCceEEEcCcEEEEEECCCCeEEEEEeeccee
Q 003800          223 SGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKI  258 (794)
Q Consensus       223 ~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~  258 (794)
                      + ..+.-|+...+.+++.+   .+++.++|++++++
T Consensus       145 ~-~~~sl~is~D~~~l~~a---s~~ik~~~~~~kev  176 (541)
T KOG4547|consen  145 P-LVSSLCISPDGKILLTA---SRQIKVLDIETKEV  176 (541)
T ss_pred             C-ccceEEEcCCCCEEEec---cceEEEEEccCceE
Confidence            2 22233443333455544   46899999999884


No 98 
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=84.67  E-value=56  Score=34.35  Aligned_cols=105  Identities=10%  Similarity=0.125  Sum_probs=62.8

Q ss_pred             EEeCCCEEEEEEC------cCCccceEEEcCccc------ceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           58 VSTEENVIASLDL------RHGEIFWRHVLGIND------VVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        58 v~t~~g~l~ALn~------~tG~ivWR~~l~~~~------~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      ....+|.|++.-=      .-=+.+|+...+...      .|..+.+. ..+-+++.+| ++.++.||.+||+..-+++.
T Consensus        76 ls~gdG~V~gw~W~E~~es~~~K~lwe~~~P~~~~~~evPeINam~ldP~enSi~~AgG-D~~~y~~dlE~G~i~r~~rG  154 (325)
T KOG0649|consen   76 LSGGDGLVYGWEWNEEEESLATKRLWEVKIPMQVDAVEVPEINAMWLDPSENSILFAGG-DGVIYQVDLEDGRIQREYRG  154 (325)
T ss_pred             eeccCceEEEeeehhhhhhccchhhhhhcCccccCcccCCccceeEeccCCCcEEEecC-CeEEEEEEecCCEEEEEEcC
Confidence            3334588888731      233567877665441      12222122 2333555455 57999999999999999998


Q ss_pred             cCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEE
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTR  170 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~  170 (794)
                      ....+ -  .+++    -...+.|+-. .||+++.-|.+|++-+=..
T Consensus       155 HtDYv-H--~vv~----R~~~~qilsG~EDGtvRvWd~kt~k~v~~i  194 (325)
T KOG0649|consen  155 HTDYV-H--SVVG----RNANGQILSGAEDGTVRVWDTKTQKHVSMI  194 (325)
T ss_pred             Cccee-e--eeee----cccCcceeecCCCccEEEEeccccceeEEe
Confidence            76543 1  1111    1113455555 3899999999988755443


No 99 
>KOG0286 consensus G-protein beta subunit [General function prediction only]
Probab=84.59  E-value=40  Score=36.27  Aligned_cols=152  Identities=11%  Similarity=0.125  Sum_probs=94.7

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEeccCcccc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLRGSKHS  130 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~~~~~s  130 (794)
                      +++.|++++.+..-+--|.++|+..=-+.--. +.+-++.+.-.+.-.|| ++.+...+.||...|.-+=.+....... 
T Consensus       155 dD~~ilT~SGD~TCalWDie~g~~~~~f~GH~-gDV~slsl~p~~~ntFvSg~cD~~aklWD~R~~~c~qtF~ghesDI-  232 (343)
T KOG0286|consen  155 DDNHILTGSGDMTCALWDIETGQQTQVFHGHT-GDVMSLSLSPSDGNTFVSGGCDKSAKLWDVRSGQCVQTFEGHESDI-  232 (343)
T ss_pred             CCCceEecCCCceEEEEEcccceEEEEecCCc-ccEEEEecCCCCCCeEEecccccceeeeeccCcceeEeeccccccc-
Confidence            37779999999999999999997654333211 22323322221333344 4557899999999998877777665444 


Q ss_pred             CCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEE
Q 003800          131 KPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQIN  208 (794)
Q Consensus       131 ~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald  208 (794)
                      .+..+.|       ++.-|+.  .|+.-..+|....+.+=.|+.+....-...+-.+.++..+|+ ++..   ......|
T Consensus       233 Nsv~ffP-------~G~afatGSDD~tcRlyDlRaD~~~a~ys~~~~~~gitSv~FS~SGRlLfa-gy~d---~~c~vWD  301 (343)
T KOG0286|consen  233 NSVRFFP-------SGDAFATGSDDATCRLYDLRADQELAVYSHDSIICGITSVAFSKSGRLLFA-GYDD---FTCNVWD  301 (343)
T ss_pred             ceEEEcc-------CCCeeeecCCCceeEEEeecCCcEEeeeccCcccCCceeEEEcccccEEEe-eecC---CceeEee
Confidence            3444554       5666665  388889999998888877775543321222322334444553 4433   2677778


Q ss_pred             cCCCceee
Q 003800          209 AMNGELLN  216 (794)
Q Consensus       209 ~~tG~~~w  216 (794)
                      ...|++.-
T Consensus       302 tlk~e~vg  309 (343)
T KOG0286|consen  302 TLKGERVG  309 (343)
T ss_pred             ccccceEE
Confidence            77776653


No 100
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=84.41  E-value=29  Score=39.10  Aligned_cols=119  Identities=19%  Similarity=0.218  Sum_probs=70.6

Q ss_pred             EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CCEEEEEECC---CCcEEEEEe
Q 003800           97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSI---DGEILWTRD  171 (794)
Q Consensus        97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~---tG~~~W~~~  171 (794)
                      .+++-++.+.+|..||.++|+..=.....+... +.+..-+     . ...+++. + +++|...|..   .-...|++.
T Consensus       257 nVLaSgsaD~TV~lWD~~~g~p~~s~~~~~k~V-q~l~wh~-----~-~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~  329 (463)
T KOG0270|consen  257 NVLASGSADKTVKLWDVDTGKPKSSITHHGKKV-QTLEWHP-----Y-EPSVLLSGSYDGTVALKDCRDPSNSGKEWKFD  329 (463)
T ss_pred             eeEEecCCCceEEEEEcCCCCcceehhhcCCce-eEEEecC-----C-CceEEEeccccceEEeeeccCccccCceEEec
Confidence            344434457899999999999998877555444 2333332     1 2334443 2 8888888877   344678886


Q ss_pred             ccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC-CCceeeeeeeecccCccCceE
Q 003800          172 FAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM-NGELLNHETAAFSGGFVGDVA  231 (794)
Q Consensus       172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~-tG~~~w~~~v~~~~~~s~~~~  231 (794)
                      ..-...     ..-......|+++.+.|   .|+-+|+. .|+++|+....- .++++-++
T Consensus       330 g~VEkv-----~w~~~se~~f~~~tddG---~v~~~D~R~~~~~vwt~~AHd-~~ISgl~~  381 (463)
T KOG0270|consen  330 GEVEKV-----AWDPHSENSFFVSTDDG---TVYYFDIRNPGKPVWTLKAHD-DEISGLSV  381 (463)
T ss_pred             cceEEE-----EecCCCceeEEEecCCc---eEEeeecCCCCCceeEEEecc-CCcceEEe
Confidence            543322     11122334454554433   78888876 579999986432 35555333


No 101
>KOG0266 consensus WD40 repeat-containing protein [General function prediction only]
Probab=84.23  E-value=44  Score=38.68  Aligned_cols=158  Identities=14%  Similarity=0.176  Sum_probs=84.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      +..|+-++.++.+..-|.++|+.+=....... .+.++.....+..++.++.++.++.||..+|..+ -........ ..
T Consensus       258 g~~i~Sgs~D~tvriWd~~~~~~~~~l~~hs~-~is~~~f~~d~~~l~s~s~d~~i~vwd~~~~~~~-~~~~~~~~~-~~  334 (456)
T KOG0266|consen  258 GNLLVSGSDDGTVRIWDVRTGECVRKLKGHSD-GISGLAFSPDGNLLVSASYDGTIRVWDLETGSKL-CLKLLSGAE-NS  334 (456)
T ss_pred             CCEEEEecCCCcEEEEeccCCeEEEeeeccCC-ceEEEEECCCCCEEEEcCCCccEEEEECCCCcee-eeecccCCC-CC
Confidence            56788899999999999999876544333332 3444422233344444555789999999999954 111111000 01


Q ss_pred             ccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEEEcC
Q 003800          133 LLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAM  210 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~  210 (794)
                      .++.-.....+ ...+++.. ++.+.-.|..+|...=++...... ........ ..++...+.+...   ..+..+|..
T Consensus       335 ~~~~~~~fsp~-~~~ll~~~~d~~~~~w~l~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~i~sg~~d---~~v~~~~~~  409 (456)
T KOG0266|consen  335 APVTSVQFSPN-GKYLLSASLDRTLKLWDLRSGKSVGTYTGHSNLVRCIFSPTL-STGGKLIYSGSED---GSVYVWDSS  409 (456)
T ss_pred             CceeEEEECCC-CcEEEEecCCCeEEEEEccCCcceeeecccCCcceeEecccc-cCCCCeEEEEeCC---ceEEEEeCC
Confidence            01110000112 33444444 667777777777655444332221 11222322 2334433333332   278999999


Q ss_pred             CCceeeee
Q 003800          211 NGELLNHE  218 (794)
Q Consensus       211 tG~~~w~~  218 (794)
                      +|..+-..
T Consensus       410 s~~~~~~l  417 (456)
T KOG0266|consen  410 SGGILQRL  417 (456)
T ss_pred             ccchhhhh
Confidence            88777554


No 102
>KOG0315 consensus G-protein beta subunit-like protein (contains WD40 repeats) [General function prediction only]
Probab=83.76  E-value=63  Score=34.15  Aligned_cols=60  Identities=15%  Similarity=0.296  Sum_probs=38.9

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLP  114 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~  114 (794)
                      ++-.|.++++|.+---|.+.  +.=.+.+....++..+-+.-.+.-++++...+.||.||..
T Consensus        95 grWMyTgseDgt~kIWdlR~--~~~qR~~~~~spVn~vvlhpnQteLis~dqsg~irvWDl~  154 (311)
T KOG0315|consen   95 GRWMYTGSEDGTVKIWDLRS--LSCQRNYQHNSPVNTVVLHPNQTELISGDQSGNIRVWDLG  154 (311)
T ss_pred             CeEEEecCCCceEEEEeccC--cccchhccCCCCcceEEecCCcceEEeecCCCcEEEEEcc
Confidence            44499999999999888887  2222233322223333233456667767777899999984


No 103
>TIGR03547 muta_rot_YjhT mutatrotase, YjhT family. Members of this protein family contain multiple copies of the beta-propeller-forming Kelch repeat. All are full-length homologs to YjhT of Escherichia coli, which has been identified as a mutarotase for sialic acid. This protein improves bacterial ability to obtain host sialic acid, and thus serves as a virulence factor. Some bacteria carry what appears to be a cyclically permuted homolog of this protein.
Probab=83.06  E-value=79  Score=34.79  Aligned_cols=178  Identities=10%  Similarity=0.072  Sum_probs=88.7

Q ss_pred             CCEEEEEeCC------------CEEEEEECcCCccceEEEcC-cccceeeee-e-eeCCEEEEEEccC------------
Q 003800           53 RKRVVVSTEE------------NVIASLDLRHGEIFWRHVLG-INDVVDGID-I-ALGKYVITLSSDG------------  105 (794)
Q Consensus        53 ~~~Vyv~t~~------------g~l~ALn~~tG~ivWR~~l~-~~~~i~~l~-~-~~g~~~V~Vs~~g------------  105 (794)
                      ++.||+....            +.+..+|+.+.  .|+.... .+....+.. . ..++.+.+++|.+            
T Consensus        63 ~~~iYv~GG~~~~~~~~~~~~~~~v~~Yd~~~~--~W~~~~~~~p~~~~~~~~~~~~~g~IYviGG~~~~~~~~~~~~~~  140 (346)
T TIGR03547        63 DGKLYVFGGIGKANSEGSPQVFDDVYRYDPKKN--SWQKLDTRSPVGLLGASGFSLHNGQAYFTGGVNKNIFDGYFADLS  140 (346)
T ss_pred             CCEEEEEeCCCCCCCCCcceecccEEEEECCCC--EEecCCCCCCCcccceeEEEEeCCEEEEEcCcChHHHHHHHhhHh
Confidence            7789987753            24778888876  4987642 111111111 1 2455566666642            


Q ss_pred             ---------------------------CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-----
Q 003800          106 ---------------------------STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-----  153 (794)
Q Consensus       106 ---------------------------~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-----  153 (794)
                                                 +.+..||+.+.  .|+..-.-+..    +..... ....++.++|..+     
T Consensus       141 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~v~~YDp~t~--~W~~~~~~p~~----~r~~~~-~~~~~~~iyv~GG~~~~~  213 (346)
T TIGR03547       141 AADKDSEPKDKLIAAYFSQPPEDYFWNKNVLSYDPSTN--QWRNLGENPFL----GTAGSA-IVHKGNKLLLINGEIKPG  213 (346)
T ss_pred             hcCccchhhhhhHHHHhCCChhHcCccceEEEEECCCC--ceeECccCCCC----cCCCce-EEEECCEEEEEeeeeCCC
Confidence                                       46888888765  58764322110    001100 1122567777531     


Q ss_pred             ---CEEEEEECCCCcEEEEEeccCcce--e-e---eeEEEEecCCEEEEEEecCC-------------------ceeEEE
Q 003800          154 ---GCLHAVSSIDGEILWTRDFAAESV--E-V---QQVIQLDESDQIYVVGYAGS-------------------SQFHAY  205 (794)
Q Consensus       154 ---g~l~ald~~tG~~~W~~~~~~~~~--~-~---~~~v~s~~~~~vyv~~~~g~-------------------~~~~v~  205 (794)
                         ..++.++.....-.|+.-.+.+.-  . +   .......-++.+|+++....                   ..-.+.
T Consensus       214 ~~~~~~~~y~~~~~~~~W~~~~~m~~~r~~~~~~~~~~~a~~~~~~Iyv~GG~~~~~~~~~~~~~~~~~~~~~~~~~~~e  293 (346)
T TIGR03547       214 LRTAEVKQYLFTGGKLEWNKLPPLPPPKSSSQEGLAGAFAGISNGVLLVAGGANFPGAQENYKNGKLYAHEGLIKAWSSE  293 (346)
T ss_pred             ccchheEEEEecCCCceeeecCCCCCCCCCccccccEEeeeEECCEEEEeecCCCCCchhhhhcCCccccCCCCceeEee
Confidence               124556655566679865443220  0 0   01101245889999875310                   001356


Q ss_pred             EEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEE
Q 003800          206 QINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTL  241 (794)
Q Consensus       206 ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~  241 (794)
                      ++|+.+.  .|+..-..|........+ +++.++++.
T Consensus       294 ~yd~~~~--~W~~~~~lp~~~~~~~~~~~~~~iyv~G  328 (346)
T TIGR03547       294 VYALDNG--KWSKVGKLPQGLAYGVSVSWNNGVLLIG  328 (346)
T ss_pred             EEEecCC--cccccCCCCCCceeeEEEEcCCEEEEEe
Confidence            6777765  487754555444332222 244444433


No 104
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=82.98  E-value=63  Score=39.71  Aligned_cols=186  Identities=15%  Similarity=0.110  Sum_probs=107.7

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCc---cceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGE---IFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~---ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      ..+.+.++|+++.|.+..--.|+   ++=|+.++-.    .+.+..++..+..++++-.|..+|..|+...-..+...+.
T Consensus        65 ~s~~f~~~s~~~tv~~y~fps~~~~~iL~Rftlp~r----~~~v~g~g~~iaagsdD~~vK~~~~~D~s~~~~lrgh~ap  140 (933)
T KOG1274|consen   65 YSNHFLTGSEQNTVLRYKFPSGEEDTILARFTLPIR----DLAVSGSGKMIAAGSDDTAVKLLNLDDSSQEKVLRGHDAP  140 (933)
T ss_pred             cccceEEeeccceEEEeeCCCCCccceeeeeeccce----EEEEecCCcEEEeecCceeEEEEeccccchheeecccCCc
Confidence            45678889999988888666554   4455555432    3322334446665777788999999999888777665443


Q ss_pred             ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcce--e----eeeEEEEecCCEEEEEEecCCce
Q 003800          129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESV--E----VQQVIQLDESDQIYVVGYAGSSQ  201 (794)
Q Consensus       129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~--~----~~~~v~s~~~~~vyv~~~~g~~~  201 (794)
                      . ..+.+-|     . ++.+.+. .+|.|+..|..+|...=++..-.+..  .    ..++.....++.+-+.+.++   
T Consensus       141 V-l~l~~~p-----~-~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~~~s~i~~~~aW~Pk~g~la~~~~d~---  210 (933)
T KOG1274|consen  141 V-LQLSYDP-----K-GNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEFILSRICTRLAWHPKGGTLAVPPVDN---  210 (933)
T ss_pred             e-eeeeEcC-----C-CCEEEEEecCceEEEEEcccchhhhhcccCCccccccccceeeeeeecCCCCeEEeeccCC---
Confidence            2 1111111     1 3344444 49999999999998764443322211  1    11222234567777777676   


Q ss_pred             eEEEEEEcCCCceeeeeeeeccc-CccCceEE-EcCcEEEEEECCCCeEEEEEee
Q 003800          202 FHAYQINAMNGELLNHETAAFSG-GFVGDVAL-VSSDTLVTLDTTRSILVTVSFK  254 (794)
Q Consensus       202 ~~v~ald~~tG~~~w~~~v~~~~-~~s~~~~~-vg~~~lv~~d~~~g~L~v~~l~  254 (794)
                       .|..++..+++.....+....+ .++- +-+ ..+.++++.+. +|.+.+-|.+
T Consensus       211 -~Vkvy~r~~we~~f~Lr~~~~ss~~~~-~~wsPnG~YiAAs~~-~g~I~vWnv~  262 (933)
T KOG1274|consen  211 -TVKVYSRKGWELQFKLRDKLSSSKFSD-LQWSPNGKYIAASTL-DGQILVWNVD  262 (933)
T ss_pred             -eEEEEccCCceeheeecccccccceEE-EEEcCCCcEEeeecc-CCcEEEEecc
Confidence             6888898888877666432211 1111 111 13345555553 4666666655


No 105
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=82.98  E-value=21  Score=39.67  Aligned_cols=71  Identities=15%  Similarity=0.222  Sum_probs=49.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG  126 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~  126 (794)
                      .+.+..++.+|+|.--|..||+.+=+..  .++-+...... .|..+++ +..+.+||.||+.+|+++|+.....
T Consensus       144 ~NVLlsag~Dn~v~iWnv~tgeali~l~--hpd~i~S~sfn~dGs~l~T-tckDKkvRv~dpr~~~~v~e~~~he  215 (472)
T KOG0303|consen  144 PNVLLSAGSDNTVSIWNVGTGEALITLD--HPDMVYSMSFNRDGSLLCT-TCKDKKVRVIDPRRGTVVSEGVAHE  215 (472)
T ss_pred             hhhHhhccCCceEEEEeccCCceeeecC--CCCeEEEEEeccCCceeee-ecccceeEEEcCCCCcEeeeccccc
Confidence            5557778889999999999999877743  44434433222 2333334 3346799999999999999985443


No 106
>PHA03098 kelch-like protein; Provisional
Probab=82.12  E-value=47  Score=39.06  Aligned_cols=135  Identities=9%  Similarity=0.080  Sum_probs=69.4

Q ss_pred             eeCCEEEEEEccC------CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-------CEEEEE
Q 003800           93 ALGKYVITLSSDG------STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------GCLHAV  159 (794)
Q Consensus        93 ~~g~~~V~Vs~~g------~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------g~l~al  159 (794)
                      ..++.++++||.+      ..++.||+.+++  |+.- .....  +.....   ....++.+++.++       ..+..+
T Consensus       292 ~~~~~lyv~GG~~~~~~~~~~v~~yd~~~~~--W~~~-~~~~~--~R~~~~---~~~~~~~lyv~GG~~~~~~~~~v~~y  363 (534)
T PHA03098        292 VLNNVIYFIGGMNKNNLSVNSVVSYDTKTKS--WNKV-PELIY--PRKNPG---VTVFNNRIYVIGGIYNSISLNTVESW  363 (534)
T ss_pred             EECCEEEEECCCcCCCCeeccEEEEeCCCCe--eeEC-CCCCc--ccccce---EEEECCEEEEEeCCCCCEecceEEEE
Confidence            4567777777642      258889988764  7532 21110  000010   1122566777542       346777


Q ss_pred             ECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC---ceeEEEEEEcCCCceeeeeeeecccCccCce-EEEcC
Q 003800          160 SSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS---SQFHAYQINAMNGELLNHETAAFSGGFVGDV-ALVSS  235 (794)
Q Consensus       160 d~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~---~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~-~~vg~  235 (794)
                      |..++  .|+.-.+.+.-...... ...++.+|++|....   ..-.+..+|+.++  .|+..-..|....+.+ ...++
T Consensus       364 d~~~~--~W~~~~~lp~~r~~~~~-~~~~~~iYv~GG~~~~~~~~~~v~~yd~~t~--~W~~~~~~p~~r~~~~~~~~~~  438 (534)
T PHA03098        364 KPGES--KWREEPPLIFPRYNPCV-VNVNNLIYVIGGISKNDELLKTVECFSLNTN--KWSKGSPLPISHYGGCAIYHDG  438 (534)
T ss_pred             cCCCC--ceeeCCCcCcCCccceE-EEECCEEEEECCcCCCCcccceEEEEeCCCC--eeeecCCCCccccCceEEEECC
Confidence            87765  58764433221000111 245889999875311   1135788998875  4877544454444433 33354


Q ss_pred             cEEEE
Q 003800          236 DTLVT  240 (794)
Q Consensus       236 ~~lv~  240 (794)
                      .++++
T Consensus       439 ~iyv~  443 (534)
T PHA03098        439 KIYVI  443 (534)
T ss_pred             EEEEE
Confidence            44444


No 107
>PHA02790 Kelch-like protein; Provisional
Probab=82.11  E-value=57  Score=38.01  Aligned_cols=147  Identities=10%  Similarity=0.004  Sum_probs=75.6

Q ss_pred             eCCEEEEEEccC-----CeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC----CEEEEEECCCC
Q 003800           94 LGKYVITLSSDG-----STLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK----GCLHAVSSIDG  164 (794)
Q Consensus        94 ~g~~~V~Vs~~g-----~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~----g~l~ald~~tG  164 (794)
                      .++.++++||.+     ..+..+|+.+++  |..-..-+.. .  ....   ...-++.+|+.++    ..+.++|+.++
T Consensus       270 ~~~~lyviGG~~~~~~~~~v~~Ydp~~~~--W~~~~~m~~~-r--~~~~---~v~~~~~iYviGG~~~~~sve~ydp~~n  341 (480)
T PHA02790        270 VGEVVYLIGGWMNNEIHNNAIAVNYISNN--WIPIPPMNSP-R--LYAS---GVPANNKLYVVGGLPNPTSVERWFHGDA  341 (480)
T ss_pred             ECCEEEEEcCCCCCCcCCeEEEEECCCCE--EEECCCCCch-h--hcce---EEEECCEEEEECCcCCCCceEEEECCCC
Confidence            566666667632     357889998765  7654322111 0  0011   1122567777632    34677776544


Q ss_pred             cEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceE-EEcCcEEEEEEC
Q 003800          165 EILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVA-LVSSDTLVTLDT  243 (794)
Q Consensus       165 ~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~-~vg~~~lv~~d~  243 (794)
                        .|+.-.+.+.- ........-++.+|++|...+..-.+.++|+.+.  .|+..-..+......+. .+++.++++.  
T Consensus       342 --~W~~~~~l~~~-r~~~~~~~~~g~IYviGG~~~~~~~ve~ydp~~~--~W~~~~~m~~~r~~~~~~~~~~~IYv~G--  414 (480)
T PHA02790        342 --AWVNMPSLLKP-RCNPAVASINNVIYVIGGHSETDTTTEYLLPNHD--QWQFGPSTYYPHYKSCALVFGRRLFLVG--  414 (480)
T ss_pred             --eEEECCCCCCC-CcccEEEEECCEEEEecCcCCCCccEEEEeCCCC--EEEeCCCCCCccccceEEEECCEEEEEC--
Confidence              58764443321 1111113568999998754322235678898765  68874333333333233 2354555443  


Q ss_pred             CCCeEEEEEeecce
Q 003800          244 TRSILVTVSFKNRK  257 (794)
Q Consensus       244 ~~g~L~v~~l~sg~  257 (794)
                        |...+.|.++++
T Consensus       415 --G~~e~ydp~~~~  426 (480)
T PHA02790        415 --RNAEFYCESSNT  426 (480)
T ss_pred             --CceEEecCCCCc
Confidence              334455655554


No 108
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=81.32  E-value=36  Score=39.74  Aligned_cols=176  Identities=13%  Similarity=0.131  Sum_probs=92.5

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc----c
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH----S  130 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~----s  130 (794)
                      +|++.....|+.||++-|+  |=..++.. +.+....+..-.+++.+|+..+.|-+||+.+-...=.......+.    .
T Consensus       148 ly~~gsg~evYRlNLEqGr--fL~P~~~~~~~lN~v~in~~hgLla~Gt~~g~VEfwDpR~ksrv~~l~~~~~v~s~pg~  225 (703)
T KOG2321|consen  148 LYLVGSGSEVYRLNLEQGR--FLNPFETDSGELNVVSINEEHGLLACGTEDGVVEFWDPRDKSRVGTLDAASSVNSHPGG  225 (703)
T ss_pred             EEEeecCcceEEEEccccc--cccccccccccceeeeecCccceEEecccCceEEEecchhhhhheeeecccccCCCccc
Confidence            8888888889999999995  44444433 122223223345677778877899999998876655554433211    0


Q ss_pred             CCccccccccccccCC-eEEEE-ECCEEEEEECCCCcEEEEEeccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800          131 KPLLLVPTNLKVDKDS-LILVS-SKGCLHAVSSIDGEILWTRDFAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQI  207 (794)
Q Consensus       131 ~~~~~~~~~~~~~~~~-~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al  207 (794)
                      ...+.+. ++.-..++ .+-|. +.|.++-+|..+-+++-.-+....- +......+....++|+  +.+..   .+-..
T Consensus       226 ~~~~svT-al~F~d~gL~~aVGts~G~v~iyDLRa~~pl~~kdh~~e~pi~~l~~~~~~~q~~v~--S~Dk~---~~kiW  299 (703)
T KOG2321|consen  226 DAAPSVT-ALKFRDDGLHVAVGTSTGSVLIYDLRASKPLLVKDHGYELPIKKLDWQDTDQQNKVV--SMDKR---ILKIW  299 (703)
T ss_pred             cccCcce-EEEecCCceeEEeeccCCcEEEEEcccCCceeecccCCccceeeecccccCCCceEE--ecchH---Hhhhc
Confidence            0111111 00111112 23344 4888888888877776654433211 0000011111112222  33321   34445


Q ss_pred             EcCCCceeeeeeeecccCccCceEEEcCcEEEEE
Q 003800          208 NAMNGELLNHETAAFSGGFVGDVALVSSDTLVTL  241 (794)
Q Consensus       208 d~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~  241 (794)
                      |..||++.-..  ....++..-|.+.+.+++..+
T Consensus       300 d~~~Gk~~asi--Ept~~lND~C~~p~sGm~f~A  331 (703)
T KOG2321|consen  300 DECTGKPMASI--EPTSDLNDFCFVPGSGMFFTA  331 (703)
T ss_pred             ccccCCceeec--cccCCcCceeeecCCceEEEe
Confidence            67777766443  223456667888777765444


No 109
>KOG0293 consensus WD40 repeat-containing protein [Function unknown]
Probab=81.16  E-value=36  Score=38.16  Aligned_cols=212  Identities=10%  Similarity=0.053  Sum_probs=97.1

Q ss_pred             EeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceee-eeeeeCCEEEEEEccCCeEE
Q 003800           31 MDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDG-IDIALGKYVITLSSDGSTLR  109 (794)
Q Consensus        31 ~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~-l~~~~g~~~V~Vs~~g~~v~  109 (794)
                      +.=.+.++|..+...|..=++++..+.+-..+-.+.--|+.||+.+=...-+.+.+... .-...|..+|+ |+.++.+.
T Consensus       259 ~kl~~tlvgh~~~V~yi~wSPDdryLlaCg~~e~~~lwDv~tgd~~~~y~~~~~~S~~sc~W~pDg~~~V~-Gs~dr~i~  337 (519)
T KOG0293|consen  259 FKLKKTLVGHSQPVSYIMWSPDDRYLLACGFDEVLSLWDVDTGDLRHLYPSGLGFSVSSCAWCPDGFRFVT-GSPDRTII  337 (519)
T ss_pred             eeeeeeeecccCceEEEEECCCCCeEEecCchHheeeccCCcchhhhhcccCcCCCcceeEEccCCceeEe-cCCCCcEE
Confidence            33344455554444444434555556665556677788999998754443331112221 11234555444 66678999


Q ss_pred             EEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecC
Q 003800          110 AWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDES  188 (794)
Q Consensus       110 A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~  188 (794)
                      +||. ||.++=.++.-..     +.+..++.+.+ ++.++.. .+.++..++..+-.-+=......+-   .+.. ...+
T Consensus       338 ~wdl-Dgn~~~~W~gvr~-----~~v~dlait~D-gk~vl~v~~d~~i~l~~~e~~~dr~lise~~~i---ts~~-iS~d  406 (519)
T KOG0293|consen  338 MWDL-DGNILGNWEGVRD-----PKVHDLAITYD-GKYVLLVTVDKKIRLYNREARVDRGLISEEQPI---TSFS-ISKD  406 (519)
T ss_pred             EecC-Ccchhhccccccc-----ceeEEEEEcCC-CcEEEEEecccceeeechhhhhhhccccccCce---eEEE-EcCC
Confidence            9997 7887643332211     11111111222 3444444 4777777765432111000001110   0111 1345


Q ss_pred             CEEEEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          189 DQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       189 ~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +++..+.+...   .+.-.|.+.-..+-++.-... .-+-++|+-.++..++..-+..+++++=+..+|+
T Consensus       407 ~k~~LvnL~~q---ei~LWDl~e~~lv~kY~Ghkq~~fiIrSCFgg~~~~fiaSGSED~kvyIWhr~sgk  473 (519)
T KOG0293|consen  407 GKLALVNLQDQ---EIHLWDLEENKLVRKYFGHKQGHFIIRSCFGGGNDKFIASGSEDSKVYIWHRISGK  473 (519)
T ss_pred             CcEEEEEcccC---eeEEeecchhhHHHHhhcccccceEEEeccCCCCcceEEecCCCceEEEEEccCCc
Confidence            55555555432   334444443222222210000 0111244322232455555566788888877777


No 110
>PRK00178 tolB translocation protein TolB; Provisional
Probab=80.10  E-value=1.1e+02  Score=34.65  Aligned_cols=187  Identities=14%  Similarity=0.148  Sum_probs=88.6

Q ss_pred             CEEEEEeCCC------EEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEcc--CCeEEEEeCCCCcEeEEEec
Q 003800           54 KRVVVSTEEN------VIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSD--GSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        54 ~~Vyv~t~~g------~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l  124 (794)
                      ..+|+.+...      .|...|...+. . ++.+.....+..... ..|+.+++++..  ...|+.||..+|+..--...
T Consensus       164 ~ia~v~~~~~~~~~~~~l~~~d~~g~~-~-~~l~~~~~~~~~p~wSpDG~~la~~s~~~~~~~l~~~~l~~g~~~~l~~~  241 (430)
T PRK00178        164 RILYVTAERFSVNTRYTLQRSDYDGAR-A-VTLLQSREPILSPRWSPDGKRIAYVSFEQKRPRIFVQNLDTGRREQITNF  241 (430)
T ss_pred             eEEEEEeeCCCCCcceEEEEECCCCCC-c-eEEecCCCceeeeeECCCCCEEEEEEcCCCCCEEEEEECCCCCEEEccCC
Confidence            3466654322      47777876443 3 222222222222111 246667776643  35799999999976432222


Q ss_pred             cCccccCCccccccccccccCCeEEE-EE-C--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILV-SS-K--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS  200 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~  200 (794)
                      .+..  ..+.+.     .+ ++.+++ .. +  ..++.+|..+|+..--........   ....+.++..+++.+..++ 
T Consensus       242 ~g~~--~~~~~S-----pD-G~~la~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~~~---~~~~spDg~~i~f~s~~~g-  309 (430)
T PRK00178        242 EGLN--GAPAWS-----PD-GSKLAFVLSKDGNPEIYVMDLASRQLSRVTNHPAIDT---EPFWGKDGRTLYFTSDRGG-  309 (430)
T ss_pred             CCCc--CCeEEC-----CC-CCEEEEEEccCCCceEEEEECCCCCeEEcccCCCCcC---CeEECCCCCEEEEEECCCC-
Confidence            2111  111121     22 334443 32 3  379999999887532111111111   1111335556665553322 


Q ss_pred             eeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCC--eEEEEEeecce
Q 003800          201 QFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRS--ILVTVSFKNRK  257 (794)
Q Consensus       201 ~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g--~L~v~~l~sg~  257 (794)
                      ...++.+|+.+|+...   +.........+.+ ..++.+++.....+  .++..|+.+++
T Consensus       310 ~~~iy~~d~~~g~~~~---lt~~~~~~~~~~~Spdg~~i~~~~~~~~~~~l~~~dl~tg~  366 (430)
T PRK00178        310 KPQIYKVNVNGGRAER---VTFVGNYNARPRLSADGKTLVMVHRQDGNFHVAAQDLQRGS  366 (430)
T ss_pred             CceEEEEECCCCCEEE---eecCCCCccceEECCCCCEEEEEEccCCceEEEEEECCCCC
Confidence            2368888998887431   1111111111222 23345555543333  57788888776


No 111
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=79.53  E-value=1.1e+02  Score=34.35  Aligned_cols=191  Identities=10%  Similarity=0.170  Sum_probs=106.6

Q ss_pred             CCCEEEEEeCC-CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc---CCeEEEEeCCCCcEeEEEeccCc
Q 003800           52 GRKRVVVSTEE-NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD---GSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        52 ~~~~Vyv~t~~-g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~---g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      ..+++|+.+.+ +.+..+|.++=.+.=.......  ..++.+...+..++|+..   .+.+..+|..++++.=+......
T Consensus        84 ~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG~~--P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~~  161 (381)
T COG3391          84 AGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGLG--PVGLAVDPDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNT  161 (381)
T ss_pred             CCCeEEEecCCCCeEEEEcCcccceeeEeeeccC--CceEEECCCCCEEEEEecccCCceEEEEeCCCCeEEEEEecCCC
Confidence            46779998875 8899999554333222222211  113323334556666543   57999999999988766554331


Q ss_pred             cccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcc----eeeeeEEEEecCCEEEEEEecCCce
Q 003800          128 KHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAES----VEVQQVIQLDESDQIYVVGYAGSSQ  201 (794)
Q Consensus       128 ~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~----~~~~~~v~s~~~~~vyv~~~~g~~~  201 (794)
                      ..  .  +..   ..+ +..+++..  ++.+..+| .++..+|+ ..+...    ..|..+....++..+|+.... ...
T Consensus       162 P~--~--~a~---~p~-g~~vyv~~~~~~~v~vi~-~~~~~v~~-~~~~~~~~~~~~P~~i~v~~~g~~~yV~~~~-~~~  230 (381)
T COG3391         162 PT--G--VAV---DPD-GNKVYVTNSDDNTVSVID-TSGNSVVR-GSVGSLVGVGTGPAGIAVDPDGNRVYVANDG-SGS  230 (381)
T ss_pred             cc--e--EEE---CCC-CCeEEEEecCCCeEEEEe-CCCcceec-cccccccccCCCCceEEECCCCCEEEEEecc-CCC
Confidence            11  1  111   122 45577774  89999999 55566665 222111    123444323466678875543 223


Q ss_pred             eEEEEEEcCCCceeeee-eeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          202 FHAYQINAMNGELLNHE-TAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       202 ~~v~ald~~tG~~~w~~-~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      ..+..+|..+|...+.. ..... ...+ ..+. .+..++..+...+.+.++|..+..
T Consensus       231 ~~v~~id~~~~~v~~~~~~~~~~-~~~~-v~~~p~g~~~yv~~~~~~~V~vid~~~~~  286 (381)
T COG3391         231 NNVLKIDTATGNVTATDLPVGSG-APRG-VAVDPAGKAAYVANSQGGTVSVIDGATDR  286 (381)
T ss_pred             ceEEEEeCCCceEEEeccccccC-CCCc-eeECCCCCEEEEEecCCCeEEEEeCCCCc
Confidence            47899999999999873 22221 1111 1111 223444444445778888877755


No 112
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=79.40  E-value=78  Score=33.96  Aligned_cols=70  Identities=16%  Similarity=0.179  Sum_probs=41.9

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeC-CEEEEE-EccCCeEEEEeCCCCcEeEE
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALG-KYVITL-SSDGSTLRAWNLPDGQMVWE  121 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g-~~~V~V-s~~g~~v~A~d~~tG~llWe  121 (794)
                      ++..|+.++.+..+---|...+...=++.-.+.+=+..++..-. ..-+++ ++.+.+|+.||..+=+++=.
T Consensus       116 dn~qivSGSrDkTiklwnt~g~ck~t~~~~~~~~WVscvrfsP~~~~p~Ivs~s~DktvKvWnl~~~~l~~~  187 (315)
T KOG0279|consen  116 DNRQIVSGSRDKTIKLWNTLGVCKYTIHEDSHREWVSCVRFSPNESNPIIVSASWDKTVKVWNLRNCQLRTT  187 (315)
T ss_pred             CCceeecCCCcceeeeeeecccEEEEEecCCCcCcEEEEEEcCCCCCcEEEEccCCceEEEEccCCcchhhc
Confidence            35558888999988888877665444433322222334432222 234444 45678999999976665533


No 113
>KOG4441 consensus Proteins containing BTB/POZ and Kelch domains, involved in regulatory/signal transduction processes [Signal transduction mechanisms; General function prediction only]
Probab=79.06  E-value=68  Score=38.37  Aligned_cols=172  Identities=11%  Similarity=0.096  Sum_probs=94.6

Q ss_pred             CCEEEEEeCCC-------EEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccC-----CeEEEEeCCCCcE
Q 003800           53 RKRVVVSTEEN-------VIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDG-----STLRAWNLPDGQM  118 (794)
Q Consensus        53 ~~~Vyv~t~~g-------~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~l  118 (794)
                      ++.||+..+.+       .+...|+++++  |++.-+-..  .-.++ .+.++.+.+|||.+     ..+--||+.  .-
T Consensus       332 ~~~lYv~GG~~~~~~~l~~ve~YD~~~~~--W~~~a~M~~~R~~~~v-~~l~g~iYavGG~dg~~~l~svE~YDp~--~~  406 (571)
T KOG4441|consen  332 NGKLYVVGGYDSGSDRLSSVERYDPRTNQ--WTPVAPMNTKRSDFGV-AVLDGKLYAVGGFDGEKSLNSVECYDPV--TN  406 (571)
T ss_pred             CCEEEEEccccCCCcccceEEEecCCCCc--eeccCCccCcccccee-EEECCEEEEEeccccccccccEEEecCC--CC
Confidence            77899877643       57888999998  998443221  11123 34577777777753     236667664  45


Q ss_pred             eEEEeccCccccCCccccccccccccCCeEEEEEC--------CEEEEEECCCCcEEEEEeccCcce-eeeeEEEEecCC
Q 003800          119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK--------GCLHAVSSIDGEILWTRDFAAESV-EVQQVIQLDESD  189 (794)
Q Consensus       119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~--------g~l~ald~~tG~~~W~~~~~~~~~-~~~~~v~s~~~~  189 (794)
                      .|+.-..-... ..  ..+   ...-++.+|+..+        ..+.++|+.++  .|+...+.... ....+  +.-++
T Consensus       407 ~W~~va~m~~~-r~--~~g---v~~~~g~iYi~GG~~~~~~~l~sve~YDP~t~--~W~~~~~M~~~R~~~g~--a~~~~  476 (571)
T KOG4441|consen  407 KWTPVAPMLTR-RS--GHG---VAVLGGKLYIIGGGDGSSNCLNSVECYDPETN--TWTLIAPMNTRRSGFGV--AVLNG  476 (571)
T ss_pred             cccccCCCCcc-ee--eeE---EEEECCEEEEEcCcCCCccccceEEEEcCCCC--ceeecCCcccccccceE--EEECC
Confidence            67765532211 00  111   1122567777532        46788888775  58876654432 11112  35689


Q ss_pred             EEEEEEecCCc--eeEEEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800          190 QIYVVGYAGSS--QFHAYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL  241 (794)
Q Consensus       190 ~vyv~~~~g~~--~~~v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~  241 (794)
                      .+|++|...+.  --.+.++|+.+-  .|..--..+...++ .+..+++.++++.
T Consensus       477 ~iYvvGG~~~~~~~~~VE~ydp~~~--~W~~v~~m~~~rs~~g~~~~~~~ly~vG  529 (571)
T KOG4441|consen  477 KIYVVGGFDGTSALSSVERYDPETN--QWTMVAPMTSPRSAVGVVVLGGKLYAVG  529 (571)
T ss_pred             EEEEECCccCCCccceEEEEcCCCC--ceeEcccCccccccccEEEECCEEEEEe
Confidence            99988754321  134788998865  35553223333333 2344454444443


No 114
>PRK04043 tolB translocation protein TolB; Provisional
Probab=78.92  E-value=1.3e+02  Score=34.54  Aligned_cols=148  Identities=10%  Similarity=0.059  Sum_probs=74.5

Q ss_pred             cCCCE-EEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEc--cCCeEEEEeCCCCcEeEEEe
Q 003800           51 TGRKR-VVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSS--DGSTLRAWNLPDGQMVWESF  123 (794)
Q Consensus        51 ~~~~~-Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~--~g~~v~A~d~~tG~llWe~~  123 (794)
                      +++++ +|+.+.   ...|+.+|..+|+..  +....++....... ..|+.+++...  ....++.+|..+|..  +.-
T Consensus       197 pDG~~~i~y~s~~~~~~~Iyv~dl~tg~~~--~lt~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~dl~~g~~--~~L  272 (419)
T PRK04043        197 NKEQTAFYYTSYGERKPTLYKYNLYTGKKE--KIASSQGMLVVSDVSKDGSKLLLTMAPKGQPDIYLYDTNTKTL--TQI  272 (419)
T ss_pred             CCCCcEEEEEEccCCCCEEEEEECCCCcEE--EEecCCCcEEeeEECCCCCEEEEEEccCCCcEEEEEECCCCcE--EEc
Confidence            34443 665443   357999999998652  22222221111111 24555665533  235799999988863  222


Q ss_pred             ccCccccCCccccccccccccCCeEEEEEC----CEEEEEECCCCcE-EEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          124 LRGSKHSKPLLLVPTNLKVDKDSLILVSSK----GCLHAVSSIDGEI-LWTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       124 l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~----g~l~ald~~tG~~-~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      ...+.....+...     ++ ++.+++.++    ..|+.+|..+|+. +-++.. .  ..+ .+  +.++..+.+.+...
T Consensus       273 T~~~~~d~~p~~S-----PD-G~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g-~--~~~-~~--SPDG~~Ia~~~~~~  340 (419)
T PRK04043        273 TNYPGIDVNGNFV-----ED-DKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHG-K--NNS-SV--STYKNYIVYSSRET  340 (419)
T ss_pred             ccCCCccCccEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCeEeCccCC-C--cCc-eE--CCCCCEEEEEEcCC
Confidence            2222110112222     23 344555442    3899999999886 333221 1  111 12  33455554444332


Q ss_pred             C-----ceeEEEEEEcCCCce
Q 003800          199 S-----SQFHAYQINAMNGEL  214 (794)
Q Consensus       199 ~-----~~~~v~ald~~tG~~  214 (794)
                      .     ....++.+|+.+|+.
T Consensus       341 ~~~~~~~~~~I~v~d~~~g~~  361 (419)
T PRK04043        341 NNEFGKNTFNLYLISTNSDYI  361 (419)
T ss_pred             CcccCCCCcEEEEEECCCCCe
Confidence            1     124788889998874


No 115
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=78.45  E-value=1.3e+02  Score=34.51  Aligned_cols=190  Identities=15%  Similarity=0.160  Sum_probs=96.0

Q ss_pred             eccCCCEEEEEeC---CCEEEEEECcCCccceEEE-cCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           49 QKTGRKRVVVSTE---ENVIASLDLRHGEIFWRHV-LGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        49 ~~~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~-l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      |...++|||+.|+   -|.|++.|. +|+-+=||. +..-- ...+ -+.|+.+|+ +. +|.++.+|+++-++. ...+
T Consensus       231 PmIV~~RvYFlsD~eG~GnlYSvdl-dGkDlrrHTnFtdYY-~R~~-nsDGkrIvF-q~-~GdIylydP~td~le-kldI  304 (668)
T COG4946         231 PMIVGERVYFLSDHEGVGNLYSVDL-DGKDLRRHTNFTDYY-PRNA-NSDGKRIVF-QN-AGDIYLYDPETDSLE-KLDI  304 (668)
T ss_pred             ceEEcceEEEEecccCccceEEecc-CCchhhhcCCchhcc-cccc-CCCCcEEEE-ec-CCcEEEeCCCcCcce-eeec
Confidence            5556889999997   478999997 576665553 22110 0011 134566666 54 457999999887653 2222


Q ss_pred             c--Cc---c---ccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcc-eeeeeEEEEecCCEEEEEE
Q 003800          125 R--GS---K---HSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAES-VEVQQVIQLDESDQIYVVG  195 (794)
Q Consensus       125 ~--~~---~---~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~-~~~~~~v~s~~~~~vyv~~  195 (794)
                      .  -.   .   ...+...+. ..+...++.+...+.|..+-.+.-.|-.+   +.+.+. ....+.  ...+..+.+..
T Consensus       305 ~lpl~rk~k~~k~~~pskyle-dfa~~~Gd~ia~VSRGkaFi~~~~~~~~i---qv~~~~~VrY~r~--~~~~e~~vigt  378 (668)
T COG4946         305 GLPLDRKKKQPKFVNPSKYLE-DFAVVNGDYIALVSRGKAFIMRPWDGYSI---QVGKKGGVRYRRI--QVDPEGDVIGT  378 (668)
T ss_pred             CCccccccccccccCHHHhhh-hhccCCCcEEEEEecCcEEEECCCCCeeE---EcCCCCceEEEEE--ccCCcceEEec
Confidence            2  11   0   001111111 01222233333347777777776555322   222221 112222  23444444444


Q ss_pred             ecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          196 YAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       196 ~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .+|.   .+..+|..+|+...-.   .+-+.-..+-+- .+..+++. .++..|.++|+.+|.
T Consensus       379 ~dgD---~l~iyd~~~~e~kr~e---~~lg~I~av~vs~dGK~~vva-Ndr~el~vididngn  434 (668)
T COG4946         379 NDGD---KLGIYDKDGGEVKRIE---KDLGNIEAVKVSPDGKKVVVA-NDRFELWVIDIDNGN  434 (668)
T ss_pred             cCCc---eEEEEecCCceEEEee---CCccceEEEEEcCCCcEEEEE-cCceEEEEEEecCCC
Confidence            4554   6788899899854222   111111112211 22334444 357889999999988


No 116
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=78.31  E-value=41  Score=38.53  Aligned_cols=93  Identities=17%  Similarity=0.259  Sum_probs=58.6

Q ss_pred             EeeEEeccCcee-eeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccc-eeeeee---eeCC--EEEEEEc
Q 003800           31 MDWHQQYIGKVK-HAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDV-VDGIDI---ALGK--YVITLSS  103 (794)
Q Consensus        31 ~dW~~~~vG~~~-~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~-i~~l~~---~~g~--~~V~Vs~  103 (794)
                      .||...+ |.+. .-...+-......|+|..+.+ |++|+. +|++.|..+|+.... ...++.   ..++  ..+.|++
T Consensus       231 ~dWs~nl-GE~~l~i~v~~~~~~~~~IvvLger~-Lf~l~~-~G~l~~~krLd~~p~~~~~Y~~~~~~~~~~~~~llV~t  307 (418)
T PF14727_consen  231 PDWSFNL-GEQALDIQVVRFSSSESDIVVLGERS-LFCLKD-NGSLRFQKRLDYNPSCFCPYRVPWYNEPSTRLNLLVGT  307 (418)
T ss_pred             ceeEEEC-CceeEEEEEEEcCCCCceEEEEecce-EEEEcC-CCeEEEEEecCCceeeEEEEEeecccCCCCceEEEEEe
Confidence            8999865 7654 211111111244577777665 899996 799999999976521 111111   1111  2356677


Q ss_pred             cCCeEEEEeCCCCcEeEEEeccCcc
Q 003800          104 DGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus       104 ~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      ..+++.-|  .+.+++|...+....
T Consensus       308 ~t~~LlVy--~d~~L~WsA~l~~~P  330 (418)
T PF14727_consen  308 HTGTLLVY--EDTTLVWSAQLPHVP  330 (418)
T ss_pred             cCCeEEEE--eCCeEEEecCCCCCC
Confidence            66799999  489999999986543


No 117
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=77.57  E-value=43  Score=35.11  Aligned_cols=115  Identities=16%  Similarity=0.299  Sum_probs=66.1

Q ss_pred             CCeEEEEeCCCC-cEeEEEeccCccccCCccccccccccccCCeEEE-E-E-CCEEEEEE--CCCCcE-----EEEEecc
Q 003800          105 GSTLRAWNLPDG-QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-S-S-KGCLHAVS--SIDGEI-----LWTRDFA  173 (794)
Q Consensus       105 g~~v~A~d~~tG-~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~-~-~g~l~ald--~~tG~~-----~W~~~~~  173 (794)
                      ++.+|.|-+..- +++|..-.-+..+           +++.+...+. . + +-++-|+|  ..+|..     +...+..
T Consensus       138 ~g~Ly~~~~~h~v~~i~~~v~IsNgl-----------~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~  206 (310)
T KOG4499|consen  138 GGELYSWLAGHQVELIWNCVGISNGL-----------AWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKS  206 (310)
T ss_pred             ccEEEEeccCCCceeeehhccCCccc-----------cccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccC
Confidence            456777765322 3445443322211           3343444433 3 2 56775555  777753     3333221


Q ss_pred             C--cceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCc
Q 003800          174 A--ESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSD  236 (794)
Q Consensus       174 ~--~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~  236 (794)
                      .  ....|..+. .-..|.+|+..+.|+   +|.-+|+.||+++-+..+..+. + .+|-++|.|
T Consensus       207 ~~~e~~~PDGm~-ID~eG~L~Va~~ng~---~V~~~dp~tGK~L~eiklPt~q-i-tsccFgGkn  265 (310)
T KOG4499|consen  207 QPFESLEPDGMT-IDTEGNLYVATFNGG---TVQKVDPTTGKILLEIKLPTPQ-I-TSCCFGGKN  265 (310)
T ss_pred             CCcCCCCCCcce-EccCCcEEEEEecCc---EEEEECCCCCcEEEEEEcCCCc-e-EEEEecCCC
Confidence            1  112233332 236889999999987   9999999999999999776542 2 356666664


No 118
>PLN02193 nitrile-specifier protein
Probab=77.57  E-value=1.5e+02  Score=34.52  Aligned_cols=198  Identities=10%  Similarity=0.129  Sum_probs=99.3

Q ss_pred             CCEEEEEeCC--------CEEEEEECcCCccceEEEcCccc--ce--eeee-eeeCCEEEEEEccC-----CeEEEEeCC
Q 003800           53 RKRVVVSTEE--------NVIASLDLRHGEIFWRHVLGIND--VV--DGID-IALGKYVITLSSDG-----STLRAWNLP  114 (794)
Q Consensus        53 ~~~Vyv~t~~--------g~l~ALn~~tG~ivWR~~l~~~~--~i--~~l~-~~~g~~~V~Vs~~g-----~~v~A~d~~  114 (794)
                      ++.||+....        +.+..+|+++.  .|+..-....  ..  .+.. +..++.+++++|.+     +.++.||+.
T Consensus       175 ~~~iyv~GG~~~~~~~~~~~v~~yD~~~~--~W~~~~~~g~~P~~~~~~~~~v~~~~~lYvfGG~~~~~~~ndv~~yD~~  252 (470)
T PLN02193        175 GNKIYSFGGEFTPNQPIDKHLYVFDLETR--TWSISPATGDVPHLSCLGVRMVSIGSTLYVFGGRDASRQYNGFYSFDTT  252 (470)
T ss_pred             CCEEEEECCcCCCCCCeeCcEEEEECCCC--EEEeCCCCCCCCCCcccceEEEEECCEEEEECCCCCCCCCccEEEEECC
Confidence            6778886552        35889999885  5986322110  00  0111 23566666667642     468999998


Q ss_pred             CCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-------CEEEEEECCCCcEEEEEeccCcce-eee-eEEEE
Q 003800          115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------GCLHAVSSIDGEILWTRDFAAESV-EVQ-QVIQL  185 (794)
Q Consensus       115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------g~l~ald~~tG~~~W~~~~~~~~~-~~~-~~v~s  185 (794)
                      +.  .|+.-......  +.+......... ++.++|+.+       ..+.++|..+.  .|+.-.+.... .++ .....
T Consensus       253 t~--~W~~l~~~~~~--P~~R~~h~~~~~-~~~iYv~GG~~~~~~~~~~~~yd~~t~--~W~~~~~~~~~~~~R~~~~~~  325 (470)
T PLN02193        253 TN--EWKLLTPVEEG--PTPRSFHSMAAD-EENVYVFGGVSATARLKTLDSYNIVDK--KWFHCSTPGDSFSIRGGAGLE  325 (470)
T ss_pred             CC--EEEEcCcCCCC--CCCccceEEEEE-CCEEEEECCCCCCCCcceEEEEECCCC--EEEeCCCCCCCCCCCCCcEEE
Confidence            64  58764321100  111111111122 566777531       34778888764  58753321100 000 00002


Q ss_pred             ecCCEEEEEEec-CCceeEEEEEEcCCCceeeeeeeec---ccCccC-ceEEEcCcEEEEEECC-------------CCe
Q 003800          186 DESDQIYVVGYA-GSSQFHAYQINAMNGELLNHETAAF---SGGFVG-DVALVSSDTLVTLDTT-------------RSI  247 (794)
Q Consensus       186 ~~~~~vyv~~~~-g~~~~~v~ald~~tG~~~w~~~v~~---~~~~s~-~~~~vg~~~lv~~d~~-------------~g~  247 (794)
                      .-++.+|+++.. |...-.+.++|+.+.+  |+..-..   |..... .+..+++.+++..-..             .+.
T Consensus       326 ~~~gkiyviGG~~g~~~~dv~~yD~~t~~--W~~~~~~g~~P~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~~nd  403 (470)
T PLN02193        326 VVQGKVWVVYGFNGCEVDDVHYYDPVQDK--WTQVETFGVRPSERSVFASAAVGKHIVIFGGEIAMDPLAHVGPGQLTDG  403 (470)
T ss_pred             EECCcEEEEECCCCCccCceEEEECCCCE--EEEeccCCCCCCCcceeEEEEECCEEEEECCccCCccccccCccceecc
Confidence            346788877643 2112368899998764  8764221   322222 3334455555443211             124


Q ss_pred             EEEEEeecceeeeEEE
Q 003800          248 LVTVSFKNRKIAFQET  263 (794)
Q Consensus       248 L~v~~l~sg~~~~~~~  263 (794)
                      ++++|+.+.+  ...+
T Consensus       404 v~~~D~~t~~--W~~~  417 (470)
T PLN02193        404 TFALDTETLQ--WERL  417 (470)
T ss_pred             EEEEEcCcCE--EEEc
Confidence            7788887776  5443


No 119
>KOG0270 consensus WD40 repeat-containing protein [Function unknown]
Probab=77.56  E-value=54  Score=37.08  Aligned_cols=94  Identities=17%  Similarity=0.265  Sum_probs=62.6

Q ss_pred             cEeeEEeccCceeee-----------eeeeeccCCCEEEEEeCCCEEEEEECc---CCccceEEEcCcccceeeeeeeeC
Q 003800           30 LMDWHQQYIGKVKHA-----------VFHTQKTGRKRVVVSTEENVIASLDLR---HGEIFWRHVLGINDVVDGIDIALG   95 (794)
Q Consensus        30 ~~dW~~~~vG~~~~~-----------~f~~~~~~~~~Vyv~t~~g~l~ALn~~---tG~ivWR~~l~~~~~i~~l~~~~g   95 (794)
                      .+.|--.. |+|+..           .++ | ...-.++.++.++.|+-.|-|   .-...|+..-+-.. + .. -...
T Consensus       268 V~lWD~~~-g~p~~s~~~~~k~Vq~l~wh-~-~~p~~LLsGs~D~~V~l~D~R~~~~s~~~wk~~g~VEk-v-~w-~~~s  341 (463)
T KOG0270|consen  268 VKLWDVDT-GKPKSSITHHGKKVQTLEWH-P-YEPSVLLSGSYDGTVALKDCRDPSNSGKEWKFDGEVEK-V-AW-DPHS  341 (463)
T ss_pred             EEEEEcCC-CCcceehhhcCCceeEEEec-C-CCceEEEeccccceEEeeeccCccccCceEEeccceEE-E-Ee-cCCC
Confidence            37788765 666532           222 1 113347778889999999888   55677886554331 1 11 0234


Q ss_pred             CEEEEEEccCCeEEEEeCC-CCcEeEEEeccCccc
Q 003800           96 KYVITLSSDGSTLRAWNLP-DGQMVWESFLRGSKH  129 (794)
Q Consensus        96 ~~~V~Vs~~g~~v~A~d~~-tG~llWe~~l~~~~~  129 (794)
                      ....++|.++|.||.+|+. .|+++|+...+....
T Consensus       342 e~~f~~~tddG~v~~~D~R~~~~~vwt~~AHd~~I  376 (463)
T KOG0270|consen  342 ENSFFVSTDDGTVYYFDIRNPGKPVWTLKAHDDEI  376 (463)
T ss_pred             ceeEEEecCCceEEeeecCCCCCceeEEEeccCCc
Confidence            5667778888899999987 679999999887654


No 120
>PLN02153 epithiospecifier protein
Probab=77.55  E-value=1.2e+02  Score=33.42  Aligned_cols=196  Identities=12%  Similarity=0.099  Sum_probs=97.9

Q ss_pred             CCEEEEEeCC--------CEEEEEECcCCccceEEEcCccc--ce--eeee-eeeCCEEEEEEccC-----CeEEEEeCC
Q 003800           53 RKRVVVSTEE--------NVIASLDLRHGEIFWRHVLGIND--VV--DGID-IALGKYVITLSSDG-----STLRAWNLP  114 (794)
Q Consensus        53 ~~~Vyv~t~~--------g~l~ALn~~tG~ivWR~~l~~~~--~i--~~l~-~~~g~~~V~Vs~~g-----~~v~A~d~~  114 (794)
                      ++.||+....        +.+..+|+.+.  .|+..-....  ..  .+.. +..++.++++||..     ..+..||+.
T Consensus        32 ~~~iyv~GG~~~~~~~~~~~~~~yd~~~~--~W~~~~~~~~~p~~~~~~~~~~~~~~~iyv~GG~~~~~~~~~v~~yd~~  109 (341)
T PLN02153         32 GDKLYSFGGELKPNEHIDKDLYVFDFNTH--TWSIAPANGDVPRISCLGVRMVAVGTKLYIFGGRDEKREFSDFYSYDTV  109 (341)
T ss_pred             CCEEEEECCccCCCCceeCcEEEEECCCC--EEEEcCccCCCCCCccCceEEEEECCEEEEECCCCCCCccCcEEEEECC
Confidence            6788886542        46899999886  5986542211  11  0111 34577777777631     358889987


Q ss_pred             CCcEeEEEeccCccccCCccccccccccccCCeEEEEEC-------------CEEEEEECCCCcEEEEEeccCcce-eee
Q 003800          115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK-------------GCLHAVSSIDGEILWTRDFAAESV-EVQ  180 (794)
Q Consensus       115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~-------------g~l~ald~~tG~~~W~~~~~~~~~-~~~  180 (794)
                      +  ..|+.--.-.....+.+..... ....++.++|+.+             ..+.++|.++.  .|+.-.+.... .++
T Consensus       110 t--~~W~~~~~~~~~~~p~~R~~~~-~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~~yd~~~~--~W~~l~~~~~~~~~r  184 (341)
T PLN02153        110 K--NEWTFLTKLDEEGGPEARTFHS-MASDENHVYVFGGVSKGGLMKTPERFRTIEAYNIADG--KWVQLPDPGENFEKR  184 (341)
T ss_pred             C--CEEEEeccCCCCCCCCCceeeE-EEEECCEEEEECCccCCCccCCCcccceEEEEECCCC--eEeeCCCCCCCCCCC
Confidence            5  4587532110000010111100 1222566777631             14778888765  58853322110 000


Q ss_pred             -eEEEEecCCEEEEEEec------CCc----eeEEEEEEcCCCceeeeeeee---cccCccC-ceEEEcCcEEEEEECC-
Q 003800          181 -QVIQLDESDQIYVVGYA------GSS----QFHAYQINAMNGELLNHETAA---FSGGFVG-DVALVSSDTLVTLDTT-  244 (794)
Q Consensus       181 -~~v~s~~~~~vyv~~~~------g~~----~~~v~ald~~tG~~~w~~~v~---~~~~~s~-~~~~vg~~~lv~~d~~-  244 (794)
                       ......-++.+|+++..      |+.    .-.+.++|+.+.  .|+..-.   .|..... .++++++.++++.-.. 
T Consensus       185 ~~~~~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~--~W~~~~~~g~~P~~r~~~~~~~~~~~iyv~GG~~~  262 (341)
T PLN02153        185 GGAGFAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASG--KWTEVETTGAKPSARSVFAHAVVGKYIIIFGGEVW  262 (341)
T ss_pred             CcceEEEECCeEEEEeccccccccCCccceecCceEEEEcCCC--cEEeccccCCCCCCcceeeeEEECCEEEEECcccC
Confidence             01112457888886532      110    125788998764  4776321   2333222 3444465555554210 


Q ss_pred             ------------CCeEEEEEeecce
Q 003800          245 ------------RSILVTVSFKNRK  257 (794)
Q Consensus       245 ------------~g~L~v~~l~sg~  257 (794)
                                  ...+++.|+.+.+
T Consensus       263 ~~~~~~~~~~~~~n~v~~~d~~~~~  287 (341)
T PLN02153        263 PDLKGHLGPGTLSNEGYALDTETLV  287 (341)
T ss_pred             CccccccccccccccEEEEEcCccE
Confidence                        1257777877665


No 121
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=77.03  E-value=79  Score=34.26  Aligned_cols=109  Identities=11%  Similarity=0.048  Sum_probs=64.3

Q ss_pred             ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCC
Q 003800           86 VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDG  164 (794)
Q Consensus        86 ~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG  164 (794)
                      .|..+.....++.+.++..++.+|.+|...-.++=+.....+.+  +..+.+       ...+++. -+|.|..+|..+|
T Consensus        15 ~IS~v~f~~~~~~LLvssWDgslrlYdv~~~~l~~~~~~~~plL--~c~F~d-------~~~~~~G~~dg~vr~~Dln~~   85 (323)
T KOG1036|consen   15 GISSVKFSPSSSDLLVSSWDGSLRLYDVPANSLKLKFKHGAPLL--DCAFAD-------ESTIVTGGLDGQVRRYDLNTG   85 (323)
T ss_pred             ceeeEEEcCcCCcEEEEeccCcEEEEeccchhhhhheecCCcee--eeeccC-------CceEEEeccCceEEEEEecCC
Confidence            44444333333445557788899999998777776666665544  112221       3456666 4999999999998


Q ss_pred             cEEEEEeccCcceeeeeEE-EEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800          165 EILWTRDFAAESVEVQQVI-QLDESDQIYVVGYAGSSQFHAYQINAMN  211 (794)
Q Consensus       165 ~~~W~~~~~~~~~~~~~~v-~s~~~~~vyv~~~~g~~~~~v~ald~~t  211 (794)
                      +..=--....+.    +++ ..-..+.+...+.++    .+-.+|+.+
T Consensus        86 ~~~~igth~~~i----~ci~~~~~~~~vIsgsWD~----~ik~wD~R~  125 (323)
T KOG1036|consen   86 NEDQIGTHDEGI----RCIEYSYEVGCVISGSWDK----TIKFWDPRN  125 (323)
T ss_pred             cceeeccCCCce----EEEEeeccCCeEEEcccCc----cEEEEeccc
Confidence            754333333222    232 122356666555554    677778765


No 122
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=76.93  E-value=1.3e+02  Score=33.77  Aligned_cols=157  Identities=15%  Similarity=0.188  Sum_probs=91.8

Q ss_pred             cCCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc-cCCeEEEEeCCCCcEeEEEeccC
Q 003800           51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS-DGSTLRAWNLPDGQMVWESFLRG  126 (794)
Q Consensus        51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~  126 (794)
                      .+++.+||+..   .+.+..+|+.++++.=....+.. . .+..+...+..+++.. ..+.+..+|. ++..+|+ ....
T Consensus       125 ~~~~~vYV~n~~~~~~~vsvid~~t~~~~~~~~vG~~-P-~~~a~~p~g~~vyv~~~~~~~v~vi~~-~~~~v~~-~~~~  200 (381)
T COG3391         125 PDGKYVYVANAGNGNNTVSVIDAATNKVTATIPVGNT-P-TGVAVDPDGNKVYVTNSDDNTVSVIDT-SGNSVVR-GSVG  200 (381)
T ss_pred             CCCCEEEEEecccCCceEEEEeCCCCeEEEEEecCCC-c-ceEEECCCCCeEEEEecCCCeEEEEeC-CCcceec-cccc
Confidence            34778999988   68999999999987655433322 1 2222233444455543 4579999994 6777776 3211


Q ss_pred             ccccCCcccccccccccc-CCeEEEEE--C--CEEEEEECCCCcEEEE-EeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800          127 SKHSKPLLLVPTNLKVDK-DSLILVSS--K--GCLHAVSSIDGEILWT-RDFAAESVEVQQVIQLDESDQIYVVGYAGSS  200 (794)
Q Consensus       127 ~~~s~~~~~~~~~~~~~~-~~~V~V~~--~--g~l~ald~~tG~~~W~-~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~  200 (794)
                      ...  ...-.+....++. +..++|..  +  +.+..+|..+|.+.|. ......  .+..+.....+..+|+....++ 
T Consensus       201 ~~~--~~~~~P~~i~v~~~g~~~yV~~~~~~~~~v~~id~~~~~v~~~~~~~~~~--~~~~v~~~p~g~~~yv~~~~~~-  275 (381)
T COG3391         201 SLV--GVGTGPAGIAVDPDGNRVYVANDGSGSNNVLKIDTATGNVTATDLPVGSG--APRGVAVDPAGKAAYVANSQGG-  275 (381)
T ss_pred             ccc--ccCCCCceEEECCCCCEEEEEeccCCCceEEEEeCCCceEEEeccccccC--CCCceeECCCCCEEEEEecCCC-
Confidence            111  0000000001222 34577753  3  6999999999999887 333332  1222222345667777654433 


Q ss_pred             eeEEEEEEcCCCceeeee
Q 003800          201 QFHAYQINAMNGELLNHE  218 (794)
Q Consensus       201 ~~~v~ald~~tG~~~w~~  218 (794)
                        .+..+|..+.+.....
T Consensus       276 --~V~vid~~~~~v~~~~  291 (381)
T COG3391         276 --TVSVIDGATDRVVKTG  291 (381)
T ss_pred             --eEEEEeCCCCceeeee
Confidence              7888998888777655


No 123
>PRK04792 tolB translocation protein TolB; Provisional
Probab=76.93  E-value=1.5e+02  Score=34.23  Aligned_cols=150  Identities=9%  Similarity=0.071  Sum_probs=71.9

Q ss_pred             eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-EE-CC--EEEEEECCCCcEE
Q 003800           94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SS-KG--CLHAVSSIDGEIL  167 (794)
Q Consensus        94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~g--~l~ald~~tG~~~  167 (794)
                      .|+.+++++..  ...|+.+|..+|+..--....+..  ..+.+.     .+ ++.+++ .. +|  .|+.+|..+|+..
T Consensus       228 DG~~La~~s~~~g~~~L~~~dl~tg~~~~lt~~~g~~--~~~~wS-----PD-G~~La~~~~~~g~~~Iy~~dl~tg~~~  299 (448)
T PRK04792        228 DGRKLAYVSFENRKAEIFVQDIYTQVREKVTSFPGIN--GAPRFS-----PD-GKKLALVLSKDGQPEIYVVDIATKALT  299 (448)
T ss_pred             CCCEEEEEEecCCCcEEEEEECCCCCeEEecCCCCCc--CCeeEC-----CC-CCEEEEEEeCCCCeEEEEEECCCCCeE
Confidence            46667776532  247999999999764322222211  111121     23 334444 33 44  5999999888642


Q ss_pred             EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCC-
Q 003800          168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRS-  246 (794)
Q Consensus       168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g-  246 (794)
                         +..........+..+.++..+++.+..++ ...++.+|+.+|+...-. ..... .........++.+++.....+ 
T Consensus       300 ---~lt~~~~~~~~p~wSpDG~~I~f~s~~~g-~~~Iy~~dl~~g~~~~Lt-~~g~~-~~~~~~SpDG~~l~~~~~~~g~  373 (448)
T PRK04792        300 ---RITRHRAIDTEPSWHPDGKSLIFTSERGG-KPQIYRVNLASGKVSRLT-FEGEQ-NLGGSITPDGRSMIMVNRTNGK  373 (448)
T ss_pred             ---ECccCCCCccceEECCCCCEEEEEECCCC-CceEEEEECCCCCEEEEe-cCCCC-CcCeeECCCCCEEEEEEecCCc
Confidence               11111100111211334555655443322 247889999988753211 11111 111122223345555443333 


Q ss_pred             -eEEEEEeecce
Q 003800          247 -ILVTVSFKNRK  257 (794)
Q Consensus       247 -~L~v~~l~sg~  257 (794)
                       .++.+++.++.
T Consensus       374 ~~I~~~dl~~g~  385 (448)
T PRK04792        374 FNIARQDLETGA  385 (448)
T ss_pred             eEEEEEECCCCC
Confidence             56777887776


No 124
>PRK14131 N-acetylneuraminic acid mutarotase; Provisional
Probab=76.37  E-value=1.4e+02  Score=33.54  Aligned_cols=36  Identities=11%  Similarity=0.025  Sum_probs=20.4

Q ss_pred             EEEEEcCCCceeeeeeeecccCccC-ceEEEcCcEEEEE
Q 003800          204 AYQINAMNGELLNHETAAFSGGFVG-DVALVSSDTLVTL  241 (794)
Q Consensus       204 v~ald~~tG~~~w~~~v~~~~~~s~-~~~~vg~~~lv~~  241 (794)
                      +.++|+.++  .|+..-..|..... .++.+++.++++.
T Consensus       314 ~e~yd~~~~--~W~~~~~lp~~r~~~~av~~~~~iyv~G  350 (376)
T PRK14131        314 DEIYALVNG--KWQKVGELPQGLAYGVSVSWNNGVLLIG  350 (376)
T ss_pred             hheEEecCC--cccccCcCCCCccceEEEEeCCEEEEEc
Confidence            456888875  48765445544433 2333466666655


No 125
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=75.66  E-value=1.4e+02  Score=33.41  Aligned_cols=149  Identities=13%  Similarity=0.014  Sum_probs=72.2

Q ss_pred             cCCCEEEEEeCC---CEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEcc--CCeEEEEeCCCCcEeEEEec
Q 003800           51 TGRKRVVVSTEE---NVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSD--GSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        51 ~~~~~Vyv~t~~---g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l  124 (794)
                      ++++.|++.+..   ..|+.+|.++|+..--......  ...... ..++.+++....  ...++.||..+|...   .+
T Consensus       199 pdg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~--~~~~~~spDg~~l~~~~~~~~~~~i~~~d~~~~~~~---~l  273 (417)
T TIGR02800       199 PDGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGM--NGAPAFSPDGSKLAVSLSKDGNPDIYVMDLDGKQLT---RL  273 (417)
T ss_pred             CCCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCC--ccceEECCCCCEEEEEECCCCCccEEEEECCCCCEE---EC
Confidence            445556555532   5799999999975432222211  111111 234455554332  246999999888642   22


Q ss_pred             cC-ccccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC
Q 003800          125 RG-SKHSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS  199 (794)
Q Consensus       125 ~~-~~~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~  199 (794)
                      .. ......+.+.     .+ ++.+++.+    ...++.+|..+|+..--.. ....  ...+..+..+..+++.+.. +
T Consensus       274 ~~~~~~~~~~~~s-----~d-g~~l~~~s~~~g~~~iy~~d~~~~~~~~l~~-~~~~--~~~~~~spdg~~i~~~~~~-~  343 (417)
T TIGR02800       274 TNGPGIDTEPSWS-----PD-GKSIAFTSDRGGSPQIYMMDADGGEVRRLTF-RGGY--NASPSWSPDGDLIAFVHRE-G  343 (417)
T ss_pred             CCCCCCCCCEEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCEEEeec-CCCC--ccCeEECCCCCEEEEEEcc-C
Confidence            11 1110111111     12 33444433    2379999988887432111 1111  1112112344455544333 2


Q ss_pred             ceeEEEEEEcCCCce
Q 003800          200 SQFHAYQINAMNGEL  214 (794)
Q Consensus       200 ~~~~v~ald~~tG~~  214 (794)
                      ...+++.+|+.+|..
T Consensus       344 ~~~~i~~~d~~~~~~  358 (417)
T TIGR02800       344 GGFNIAVMDLDGGGE  358 (417)
T ss_pred             CceEEEEEeCCCCCe
Confidence            345788899988754


No 126
>PLN02193 nitrile-specifier protein
Probab=75.62  E-value=1.3e+02  Score=34.90  Aligned_cols=152  Identities=14%  Similarity=0.103  Sum_probs=79.8

Q ss_pred             CCEEEEEeCC------CEEEEEECcCCccceEEEcCcc---ccee-eeeeeeCCEEEEEEccC-----CeEEEEeCCCCc
Q 003800           53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGIN---DVVD-GIDIALGKYVITLSSDG-----STLRAWNLPDGQ  117 (794)
Q Consensus        53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~~---~~i~-~l~~~~g~~~V~Vs~~g-----~~v~A~d~~tG~  117 (794)
                      ++.||+....      +.+.++|+++.  .|++..+..   ..-. ......++.+++++|.+     ..+..||+.+. 
T Consensus       228 ~~~lYvfGG~~~~~~~ndv~~yD~~t~--~W~~l~~~~~~P~~R~~h~~~~~~~~iYv~GG~~~~~~~~~~~~yd~~t~-  304 (470)
T PLN02193        228 GSTLYVFGGRDASRQYNGFYSFDTTTN--EWKLLTPVEEGPTPRSFHSMAADEENVYVFGGVSATARLKTLDSYNIVDK-  304 (470)
T ss_pred             CCEEEEECCCCCCCCCccEEEEECCCC--EEEEcCcCCCCCCCccceEEEEECCEEEEECCCCCCCCcceEEEEECCCC-
Confidence            6778887652      57999999986  699854321   1111 11123455666666642     34788998865 


Q ss_pred             EeEEEeccCccccCCccccccccccccCCeEEEEE--C----CEEEEEECCCCcEEEEEeccC-----cceeeeeEEEEe
Q 003800          118 MVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--K----GCLHAVSSIDGEILWTRDFAA-----ESVEVQQVIQLD  186 (794)
Q Consensus       118 llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~----g~l~ald~~tG~~~W~~~~~~-----~~~~~~~~v~s~  186 (794)
                       .|+.--.....  ..+-........ ++.+++..  +    ..++.+|..+.  .|+.-.+.     +.. ...+  +.
T Consensus       305 -~W~~~~~~~~~--~~~R~~~~~~~~-~gkiyviGG~~g~~~~dv~~yD~~t~--~W~~~~~~g~~P~~R~-~~~~--~~  375 (470)
T PLN02193        305 -KWFHCSTPGDS--FSIRGGAGLEVV-QGKVWVVYGFNGCEVDDVHYYDPVQD--KWTQVETFGVRPSERS-VFAS--AA  375 (470)
T ss_pred             -EEEeCCCCCCC--CCCCCCcEEEEE-CCcEEEEECCCCCccCceEEEECCCC--EEEEeccCCCCCCCcc-eeEE--EE
Confidence             58754221110  000000000112 45566653  2    56889998875  48765432     111 1112  24


Q ss_pred             cCCEEEEEEecCC-----------ceeEEEEEEcCCCceeeee
Q 003800          187 ESDQIYVVGYAGS-----------SQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       187 ~~~~vyv~~~~g~-----------~~~~v~ald~~tG~~~w~~  218 (794)
                      -++.+|+.+-...           ..-.+.+||+.|.  .|+.
T Consensus       376 ~~~~iyv~GG~~~~~~~~~~~~~~~~ndv~~~D~~t~--~W~~  416 (470)
T PLN02193        376 VGKHIVIFGGEIAMDPLAHVGPGQLTDGTFALDTETL--QWER  416 (470)
T ss_pred             ECCEEEEECCccCCccccccCccceeccEEEEEcCcC--EEEE
Confidence            5778888764210           0013678887755  4664


No 127
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=74.38  E-value=32  Score=39.18  Aligned_cols=73  Identities=12%  Similarity=0.216  Sum_probs=54.6

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEecc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLR  125 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~  125 (794)
                      .+-+...++-+..|--.|.+||+..=|..+.......-++ +.+..++++|+.+++++.||..+|+++=++.-.
T Consensus       269 ~g~~fLS~sfD~~lKlwDtETG~~~~~f~~~~~~~cvkf~-pd~~n~fl~G~sd~ki~~wDiRs~kvvqeYd~h  341 (503)
T KOG0282|consen  269 CGTSFLSASFDRFLKLWDTETGQVLSRFHLDKVPTCVKFH-PDNQNIFLVGGSDKKIRQWDIRSGKVVQEYDRH  341 (503)
T ss_pred             cCCeeeeeecceeeeeeccccceEEEEEecCCCceeeecC-CCCCcEEEEecCCCcEEEEeccchHHHHHHHhh
Confidence            3556888888999999999999999998886552211222 234577787887889999999999977665543


No 128
>KOG1274 consensus WD40 repeat protein [General function prediction only]
Probab=74.00  E-value=74  Score=39.15  Aligned_cols=119  Identities=12%  Similarity=0.086  Sum_probs=74.1

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc-
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS-  130 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s-  130 (794)
                      ++..+.+++++-.|-.+|..|+...=... +....+.++.....+..+.++..+|.|+-||..+|.+.-....-..... 
T Consensus       107 ~g~~iaagsdD~~vK~~~~~D~s~~~~lr-gh~apVl~l~~~p~~~fLAvss~dG~v~iw~~~~~~~~~tl~~v~k~n~~  185 (933)
T KOG1274|consen  107 SGKMIAAGSDDTAVKLLNLDDSSQEKVLR-GHDAPVLQLSYDPKGNFLAVSSCDGKVQIWDLQDGILSKTLTGVDKDNEF  185 (933)
T ss_pred             CCcEEEeecCceeEEEEeccccchheeec-ccCCceeeeeEcCCCCEEEEEecCceEEEEEcccchhhhhcccCCccccc
Confidence            45568889999999999999987643321 1122343443334555666666668999999999998866554322110 


Q ss_pred             --CCccccccccccccC-CeEEEE-ECCEEEEEECCCCcEEEEEeccC
Q 003800          131 --KPLLLVPTNLKVDKD-SLILVS-SKGCLHAVSSIDGEILWTRDFAA  174 (794)
Q Consensus       131 --~~~~~~~~~~~~~~~-~~V~V~-~~g~l~ald~~tG~~~W~~~~~~  174 (794)
                        ..+...+   ++..+ +...+. .++.|..++..+++.....+...
T Consensus       186 ~~s~i~~~~---aW~Pk~g~la~~~~d~~Vkvy~r~~we~~f~Lr~~~  230 (933)
T KOG1274|consen  186 ILSRICTRL---AWHPKGGTLAVPPVDNTVKVYSRKGWELQFKLRDKL  230 (933)
T ss_pred             cccceeeee---eecCCCCeEEeeccCCeEEEEccCCceeheeecccc
Confidence              0011111   22222 444444 58999999999888887776543


No 129
>KOG0275 consensus Conserved WD40 repeat-containing protein [General function prediction only]
Probab=73.87  E-value=89  Score=34.02  Aligned_cols=181  Identities=15%  Similarity=0.250  Sum_probs=100.4

Q ss_pred             ccccc-EeeEEeccCceeeeeeeee----------ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeee-e
Q 003800           26 DQVGL-MDWHQQYIGKVKHAVFHTQ----------KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDI-A   93 (794)
Q Consensus        26 dqvG~-~dW~~~~vG~~~~~~f~~~----------~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~   93 (794)
                      .|.|+ ..|+...--..+  .|++.          +.+...|.-++-+..+----.++|+.+=...--.. -+.-... .
T Consensus       282 sqDGkIKvWri~tG~ClR--rFdrAHtkGvt~l~FSrD~SqiLS~sfD~tvRiHGlKSGK~LKEfrGHsS-yvn~a~ft~  358 (508)
T KOG0275|consen  282 SQDGKIKVWRIETGQCLR--RFDRAHTKGVTCLSFSRDNSQILSASFDQTVRIHGLKSGKCLKEFRGHSS-YVNEATFTD  358 (508)
T ss_pred             CcCCcEEEEEEecchHHH--HhhhhhccCeeEEEEccCcchhhcccccceEEEeccccchhHHHhcCccc-cccceEEcC
Confidence            57788 779987622221  12211          11233466666666776667788876533221111 0111111 2


Q ss_pred             eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc-CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEe
Q 003800           94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS-KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRD  171 (794)
Q Consensus        94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s-~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~  171 (794)
                      .|..++..|++ ++|+.|+..++.-+=.+.-.+...+ ......|     -.....+|- ..++++-++ -.|+++-++.
T Consensus       359 dG~~iisaSsD-gtvkvW~~KtteC~~Tfk~~~~d~~vnsv~~~P-----Knpeh~iVCNrsntv~imn-~qGQvVrsfs  431 (508)
T KOG0275|consen  359 DGHHIISASSD-GTVKVWHGKTTECLSTFKPLGTDYPVNSVILLP-----KNPEHFIVCNRSNTVYIMN-MQGQVVRSFS  431 (508)
T ss_pred             CCCeEEEecCC-ccEEEecCcchhhhhhccCCCCcccceeEEEcC-----CCCceEEEEcCCCeEEEEe-ccceEEeeec
Confidence            45556665654 6999999999987766654443221 1111222     112233333 467788777 4577777765


Q ss_pred             ccCcc-eeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800          172 FAAES-VEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETA  220 (794)
Q Consensus       172 ~~~~~-~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v  220 (794)
                      ...-. -....+..+..+.-+|.++-++    .+||+...+|..-....+
T Consensus       432 SGkREgGdFi~~~lSpkGewiYcigED~----vlYCF~~~sG~LE~tl~V  477 (508)
T KOG0275|consen  432 SGKREGGDFINAILSPKGEWIYCIGEDG----VLYCFSVLSGKLERTLPV  477 (508)
T ss_pred             cCCccCCceEEEEecCCCcEEEEEccCc----EEEEEEeecCceeeeeec
Confidence            54211 0122233356677888887776    899999999987665543


No 130
>TIGR02800 propeller_TolB tol-pal system beta propeller repeat protein TolB. The Tol-PAL system is required for bacterial outer membrane integrity. E. coli TolB is involved in the tonB-independent uptake of group A colicins (colicins A, E1, E2, E3 and K), and is necessary for the colicins to reach their respective targets after initial binding to the bacteria. It is also involved in uptake of filamentous DNA. Study of its structure suggest that the TolB protein might be involved in the recycling of peptidoglycan or in its covalent linking with lipoproteins. The Tol-Pal system is also implicated in pathogenesis of E. coli, Haemophilus ducreyi, Salmonella enterica and Vibrio cholerae, but the mechanism(s) is unclear.
Probab=73.50  E-value=1.6e+02  Score=33.00  Aligned_cols=149  Identities=14%  Similarity=0.131  Sum_probs=71.2

Q ss_pred             eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E---CCEEEEEECCCCcEE
Q 003800           94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S---KGCLHAVSSIDGEIL  167 (794)
Q Consensus        94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~---~g~l~ald~~tG~~~  167 (794)
                      .|+.+++++..  ...++.||..+|+..-.....+...  .+.+.     .+ ++.+++. .   ...++.+|..+|...
T Consensus       200 dg~~la~~~~~~~~~~i~v~d~~~g~~~~~~~~~~~~~--~~~~s-----pD-g~~l~~~~~~~~~~~i~~~d~~~~~~~  271 (417)
T TIGR02800       200 DGQKLAYVSFESGKPEIYVQDLATGQREKVASFPGMNG--APAFS-----PD-GSKLAVSLSKDGNPDIYVMDLDGKQLT  271 (417)
T ss_pred             CCCEEEEEEcCCCCcEEEEEECCCCCEEEeecCCCCcc--ceEEC-----CC-CCEEEEEECCCCCccEEEEECCCCCEE
Confidence            45556665432  2579999999997654433332211  11111     22 2345443 2   346899998887542


Q ss_pred             EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCC-
Q 003800          168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTR-  245 (794)
Q Consensus       168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~-  245 (794)
                      =-........   ....+.++..+++.+..++ ...++.+|+.+|+..   ++.....-...+.+ ..+..+++.+... 
T Consensus       272 ~l~~~~~~~~---~~~~s~dg~~l~~~s~~~g-~~~iy~~d~~~~~~~---~l~~~~~~~~~~~~spdg~~i~~~~~~~~  344 (417)
T TIGR02800       272 RLTNGPGIDT---EPSWSPDGKSIAFTSDRGG-SPQIYMMDADGGEVR---RLTFRGGYNASPSWSPDGDLIAFVHREGG  344 (417)
T ss_pred             ECCCCCCCCC---CEEECCCCCEEEEEECCCC-CceEEEEECCCCCEE---EeecCCCCccCeEECCCCCEEEEEEccCC
Confidence            1111111111   1111234455655444332 236888898888743   11111111111222 2334555554322 


Q ss_pred             -CeEEEEEeecce
Q 003800          246 -SILVTVSFKNRK  257 (794)
Q Consensus       246 -g~L~v~~l~sg~  257 (794)
                       ..++..++.++.
T Consensus       345 ~~~i~~~d~~~~~  357 (417)
T TIGR02800       345 GFNIAVMDLDGGG  357 (417)
T ss_pred             ceEEEEEeCCCCC
Confidence             267777877765


No 131
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=72.85  E-value=1.6e+02  Score=32.65  Aligned_cols=69  Identities=14%  Similarity=0.253  Sum_probs=37.4

Q ss_pred             cCCEEEEEEecCC-ceeEEEEEEcCCCceeeeeeeecccCccCceEEE---cCcEEEEEECCCCeEEEEEeec-ce
Q 003800          187 ESDQIYVVGYAGS-SQFHAYQINAMNGELLNHETAAFSGGFVGDVALV---SSDTLVTLDTTRSILVTVSFKN-RK  257 (794)
Q Consensus       187 ~~~~vyv~~~~g~-~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v---g~~~lv~~d~~~g~L~v~~l~s-g~  257 (794)
                      ....+|++...|. -.+..+.+|..+|+.-.-.+...+.  +..|.+.   .+.++++++...|.+.+.-+.. |.
T Consensus        50 ~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g--~~p~yvsvd~~g~~vf~AnY~~g~v~v~p~~~dG~  123 (346)
T COG2706          50 DQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPG--SPPCYVSVDEDGRFVFVANYHSGSVSVYPLQADGS  123 (346)
T ss_pred             CCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCC--CCCeEEEECCCCCEEEEEEccCceEEEEEcccCCc
Confidence            4446888776642 2356677888889876444322221  1124332   2235566665556666666644 44


No 132
>KOG0643 consensus Translation initiation factor 3, subunit i (eIF-3i)/TGF-beta receptor-interacting protein (TRIP-1) [Translation, ribosomal structure and biogenesis; Signal transduction mechanisms]
Probab=72.81  E-value=1.4e+02  Score=31.99  Aligned_cols=103  Identities=14%  Similarity=0.075  Sum_probs=55.0

Q ss_pred             EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE------CCEEEEEECCCC-------cEEE
Q 003800          102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS------KGCLHAVSSIDG-------EILW  168 (794)
Q Consensus       102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~------~g~l~ald~~tG-------~~~W  168 (794)
                      ++.+...+.||.++|+.+-.+....++-  ...+..     + ++.+++..      .+.|..+|..+-       ++.-
T Consensus        70 GSAD~t~kLWDv~tGk~la~~k~~~~Vk--~~~F~~-----~-gn~~l~~tD~~mg~~~~v~~fdi~~~~~~~~s~ep~~  141 (327)
T KOG0643|consen   70 GSADQTAKLWDVETGKQLATWKTNSPVK--RVDFSF-----G-GNLILASTDKQMGYTCFVSVFDIRDDSSDIDSEEPYL  141 (327)
T ss_pred             ccccceeEEEEcCCCcEEEEeecCCeeE--EEeecc-----C-CcEEEEEehhhcCcceEEEEEEccCChhhhcccCceE
Confidence            4446789999999999999988776542  222221     1 23333322      455666665421       2222


Q ss_pred             EEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          169 TRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       169 ~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      ....+..  .+...+....+..++...-+    ..+..+|+.+|+.+-+.
T Consensus       142 kI~t~~s--kit~a~Wg~l~~~ii~Ghe~----G~is~~da~~g~~~v~s  185 (327)
T KOG0643|consen  142 KIPTPDS--KITSALWGPLGETIIAGHED----GSISIYDARTGKELVDS  185 (327)
T ss_pred             EecCCcc--ceeeeeecccCCEEEEecCC----CcEEEEEcccCceeeec
Confidence            2222221  11222222234444432223    38999999999776554


No 133
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=71.17  E-value=1.7e+02  Score=33.89  Aligned_cols=147  Identities=18%  Similarity=0.292  Sum_probs=78.7

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      .+-+|++|..|.|--=+..+|=..=-  +...+..-++.....+..+.-++.++.|+.||  +-++.|...+..+..  .
T Consensus       339 ~~di~vGTtrN~iL~Gt~~~~f~~~v--~gh~delwgla~hps~~q~~T~gqdk~v~lW~--~~k~~wt~~~~d~~~--~  412 (626)
T KOG2106|consen  339 KGDILVGTTRNFILQGTLENGFTLTV--QGHGDELWGLATHPSKNQLLTCGQDKHVRLWN--DHKLEWTKIIEDPAE--C  412 (626)
T ss_pred             CCcEEEeeccceEEEeeecCCceEEE--EecccceeeEEcCCChhheeeccCcceEEEcc--CCceeEEEEecCcee--E
Confidence            33399999888765544444421111  11111222342233444444366678999999  889999999877643  1


Q ss_pred             ccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800          133 LLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMN  211 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~t  211 (794)
                      +.+       +..+.+.+. ..|+...+|.++-. +=+......   +..++.-..+|..++++... .++.++-+|. +
T Consensus       413 ~~f-------hpsg~va~Gt~~G~w~V~d~e~~~-lv~~~~d~~---~ls~v~ysp~G~~lAvgs~d-~~iyiy~Vs~-~  479 (626)
T KOG2106|consen  413 ADF-------HPSGVVAVGTATGRWFVLDTETQD-LVTIHTDNE---QLSVVRYSPDGAFLAVGSHD-NHIYIYRVSA-N  479 (626)
T ss_pred             eec-------cCcceEEEeeccceEEEEecccce-eEEEEecCC---ceEEEEEcCCCCEEEEecCC-CeEEEEEECC-C
Confidence            112       224545555 48999999998843 333333332   22333212344444444432 2456666663 5


Q ss_pred             Cceeeee
Q 003800          212 GELLNHE  218 (794)
Q Consensus       212 G~~~w~~  218 (794)
                      |+.....
T Consensus       480 g~~y~r~  486 (626)
T KOG2106|consen  480 GRKYSRV  486 (626)
T ss_pred             CcEEEEe
Confidence            5554433


No 134
>PRK05137 tolB translocation protein TolB; Provisional
Probab=69.06  E-value=2.1e+02  Score=32.61  Aligned_cols=137  Identities=14%  Similarity=0.060  Sum_probs=65.0

Q ss_pred             EEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEc--cCCeEEEEeCCCCcEeEEEeccCccccCCcccccccc
Q 003800           64 VIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSS--DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNL  140 (794)
Q Consensus        64 ~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~--~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~  140 (794)
                      .|...|...+..  ++.......+..... ..|+.+++++.  ....|+.||..+|+..=-....+..  ..+.+     
T Consensus       183 ~l~~~d~dg~~~--~~lt~~~~~v~~p~wSpDG~~lay~s~~~g~~~i~~~dl~~g~~~~l~~~~g~~--~~~~~-----  253 (435)
T PRK05137        183 RLAIMDQDGANV--RYLTDGSSLVLTPRFSPNRQEITYMSYANGRPRVYLLDLETGQRELVGNFPGMT--FAPRF-----  253 (435)
T ss_pred             EEEEECCCCCCc--EEEecCCCCeEeeEECCCCCEEEEEEecCCCCEEEEEECCCCcEEEeecCCCcc--cCcEE-----
Confidence            677777754433  222222212222211 25666777763  2368999999999753111111111  11111     


Q ss_pred             ccccCCeEEE-EE---CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCce
Q 003800          141 KVDKDSLILV-SS---KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGEL  214 (794)
Q Consensus       141 ~~~~~~~V~V-~~---~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~  214 (794)
                      ..+ ++.+++ ..   ...++.+|..+|+..=-...+...   .....+.++..+++.+..++ ...++.+|+.+|+.
T Consensus       254 SPD-G~~la~~~~~~g~~~Iy~~d~~~~~~~~Lt~~~~~~---~~~~~spDG~~i~f~s~~~g-~~~Iy~~d~~g~~~  326 (435)
T PRK05137        254 SPD-GRKVVMSLSQGGNTDIYTMDLRSGTTTRLTDSPAID---TSPSYSPDGSQIVFESDRSG-SPQLYVMNADGSNP  326 (435)
T ss_pred             CCC-CCEEEEEEecCCCceEEEEECCCCceEEccCCCCcc---CceeEcCCCCEEEEEECCCC-CCeEEEEECCCCCe
Confidence            123 334443 33   346999999888653211111111   11111334555554443221 23678888877765


No 135
>PF14870 PSII_BNR:  Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=68.87  E-value=1.6e+02  Score=32.28  Aligned_cols=170  Identities=18%  Similarity=0.280  Sum_probs=78.4

Q ss_pred             cccceeecccccEeeEEeccCceeeeeeeeeccCCCEEEEEeCCCEEE-EEECcCCccceEEEcCc-ccceeeeeeeeCC
Q 003800           19 PSLSLYEDQVGLMDWHQQYIGKVKHAVFHTQKTGRKRVVVSTEENVIA-SLDLRHGEIFWRHVLGI-NDVVDGIDIALGK   96 (794)
Q Consensus        19 ~~~Al~edqvG~~dW~~~~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~-ALn~~tG~ivWR~~l~~-~~~i~~l~~~~g~   96 (794)
                      ...++|....|-..|+...-+... ....-....++++++.+..|.++ ..|  .|+-.|+..-.. ...+..+....++
T Consensus       122 ~~G~iy~T~DgG~tW~~~~~~~~g-s~~~~~r~~dG~~vavs~~G~~~~s~~--~G~~~w~~~~r~~~~riq~~gf~~~~  198 (302)
T PF14870_consen  122 DRGAIYRTTDGGKTWQAVVSETSG-SINDITRSSDGRYVAVSSRGNFYSSWD--PGQTTWQPHNRNSSRRIQSMGFSPDG  198 (302)
T ss_dssp             TT--EEEESSTTSSEEEEE-S-----EEEEEE-TTS-EEEEETTSSEEEEE---TT-SS-EEEE--SSS-EEEEEE-TTS
T ss_pred             CCCcEEEeCCCCCCeeEcccCCcc-eeEeEEECCCCcEEEEECcccEEEEec--CCCccceEEccCccceehhceecCCC
Confidence            446889888888899986643332 22221222466766666666554 555  688999975432 3334433222333


Q ss_pred             EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCc
Q 003800           97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAE  175 (794)
Q Consensus        97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~  175 (794)
                      .+..++ .|+.++-=+..+...-|+........ ..-.++.  .+...++.+++.. +|.|+.  ..+|-.-|+......
T Consensus       199 ~lw~~~-~Gg~~~~s~~~~~~~~w~~~~~~~~~-~~~~~ld--~a~~~~~~~wa~gg~G~l~~--S~DgGktW~~~~~~~  272 (302)
T PF14870_consen  199 NLWMLA-RGGQIQFSDDPDDGETWSEPIIPIKT-NGYGILD--LAYRPPNEIWAVGGSGTLLV--STDGGKTWQKDRVGE  272 (302)
T ss_dssp             -EEEEE-TTTEEEEEE-TTEEEEE---B-TTSS---S-EEE--EEESSSS-EEEEESTT-EEE--ESSTTSS-EE-GGGT
T ss_pred             CEEEEe-CCcEEEEccCCCCccccccccCCccc-CceeeEE--EEecCCCCEEEEeCCccEEE--eCCCCccceECcccc
Confidence            444434 57788888867788889886544311 1111111  1222246677764 554432  356667899876533


Q ss_pred             ce--eeeeEEEEecCCEEEEEEecC
Q 003800          176 SV--EVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       176 ~~--~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      ..  .++.++. ..+++-|+++..|
T Consensus       273 ~~~~n~~~i~f-~~~~~gf~lG~~G  296 (302)
T PF14870_consen  273 NVPSNLYRIVF-VNPDKGFVLGQDG  296 (302)
T ss_dssp             TSSS---EEEE-EETTEEEEE-STT
T ss_pred             CCCCceEEEEE-cCCCceEEECCCc
Confidence            22  2445542 4667888887666


No 136
>KOG4547 consensus WD40 repeat-containing protein [General function prediction only]
Probab=68.85  E-value=2.4e+02  Score=33.12  Aligned_cols=114  Identities=14%  Similarity=0.017  Sum_probs=78.7

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ...+..++..|.+...+..-|++-|+...+... .+....-...-+.++-++.+.++--|+..+++..-.+....+.. .
T Consensus        70 t~~lvlgt~~g~v~~ys~~~g~it~~~st~~h~~~v~~~~~~~~~~ciyS~~ad~~v~~~~~~~~~~~~~~~~~~~~~-~  148 (541)
T KOG4547|consen   70 TSMLVLGTPQGSVLLYSVAGGEITAKLSTDKHYGNVNEILDAQRLGCIYSVGADLKVVYILEKEKVIIRIWKEQKPLV-S  148 (541)
T ss_pred             ceEEEeecCCccEEEEEecCCeEEEEEecCCCCCcceeeecccccCceEecCCceeEEEEecccceeeeeeccCCCcc-c
Confidence            344788899999999999999999998754432 22222111222345434445799999999999988877766544 2


Q ss_pred             CccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccC
Q 003800          132 PLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAA  174 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~  174 (794)
                      +..+.+       ++.+.+...+.+-.+|..+++++=++..-.
T Consensus       149 sl~is~-------D~~~l~~as~~ik~~~~~~kevv~~ftgh~  184 (541)
T KOG4547|consen  149 SLCISP-------DGKILLTASRQIKVLDIETKEVVITFTGHG  184 (541)
T ss_pred             eEEEcC-------CCCEEEeccceEEEEEccCceEEEEecCCC
Confidence            333332       455666678899999999999998886543


No 137
>PRK02888 nitrous-oxide reductase; Validated
Probab=68.65  E-value=1.3e+02  Score=36.13  Aligned_cols=150  Identities=12%  Similarity=0.127  Sum_probs=81.9

Q ss_pred             eeeeccCCC-EEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc----cCCeEEEEeCCCCcEe
Q 003800           46 FHTQKTGRK-RVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS----DGSTLRAWNLPDGQMV  119 (794)
Q Consensus        46 f~~~~~~~~-~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~----~g~~v~A~d~~tG~ll  119 (794)
                      |.-|-..++ .++..++ .|.+.++|+++-++.|+..++..  .+.....-++..++++.    .+..+...++.+-.  
T Consensus       196 ~~~PlpnDGk~l~~~~ey~~~vSvID~etmeV~~qV~Vdgn--pd~v~~spdGk~afvTsyNsE~G~tl~em~a~e~d--  271 (635)
T PRK02888        196 FRIPLPNDGKDLDDPKKYRSLFTAVDAETMEVAWQVMVDGN--LDNVDTDYDGKYAFSTCYNSEEGVTLAEMMAAERD--  271 (635)
T ss_pred             cccccCCCCCEeecccceeEEEEEEECccceEEEEEEeCCC--cccceECCCCCEEEEeccCcccCcceeeeccccCc--
Confidence            333433244 4555544 58999999999999999988764  22222233556676664    24556666654433  


Q ss_pred             EEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCC----C-cEEEEEeccCcceeeeeEEEEecCCEEEEE
Q 003800          120 WESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSID----G-EILWTRDFAAESVEVQQVIQLDESDQIYVV  194 (794)
Q Consensus       120 We~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~t----G-~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~  194 (794)
                      |-..+.-...   ...+     .+ ++..++ .+++|..+|..+    | +++=....+.  . |-.+-.+.++..+|+.
T Consensus       272 ~~vvfni~~i---ea~v-----kd-GK~~~V-~gn~V~VID~~t~~~~~~~v~~yIPVGK--s-PHGV~vSPDGkylyVa  338 (635)
T PRK02888        272 WVVVFNIARI---EEAV-----KA-GKFKTI-GGSKVPVVDGRKAANAGSALTRYVPVPK--N-PHGVNTSPDGKYFIAN  338 (635)
T ss_pred             eEEEEchHHH---HHhh-----hC-CCEEEE-CCCEEEEEECCccccCCcceEEEEECCC--C-ccceEECCCCCEEEEe
Confidence            4433332211   0111     11 333443 577899999988    4 3333333332  2 3344323445556654


Q ss_pred             EecCCceeEEEEEEcCCCcee
Q 003800          195 GYAGSSQFHAYQINAMNGELL  215 (794)
Q Consensus       195 ~~~g~~~~~v~ald~~tG~~~  215 (794)
                      +--..   .+..+|.++-+..
T Consensus       339 nklS~---tVSVIDv~k~k~~  356 (635)
T PRK02888        339 GKLSP---TVTVIDVRKLDDL  356 (635)
T ss_pred             CCCCC---cEEEEEChhhhhh
Confidence            43222   6888888876654


No 138
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=68.47  E-value=2.2e+02  Score=32.58  Aligned_cols=60  Identities=15%  Similarity=0.101  Sum_probs=33.8

Q ss_pred             CCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCCeEEEEEeec
Q 003800          188 SDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       188 ~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      ..++|-++.+.    .+-+.|...|.++-...  .|..+.. +.+ .+...++|... .|.++..++..
T Consensus       188 ~~rl~TaS~D~----t~k~wdlS~g~LLlti~--fp~si~a-v~lDpae~~~yiGt~-~G~I~~~~~~~  248 (476)
T KOG0646|consen  188 NARLYTASEDR----TIKLWDLSLGVLLLTIT--FPSSIKA-VALDPAERVVYIGTE-EGKIFQNLLFK  248 (476)
T ss_pred             cceEEEecCCc----eEEEEEeccceeeEEEe--cCCccee-EEEcccccEEEecCC-cceEEeeehhc
Confidence            45667665553    56667888898887663  4443322 211 13334445543 47777777654


No 139
>KOG0282 consensus mRNA splicing factor [Function unknown]
Probab=67.25  E-value=45  Score=38.08  Aligned_cols=141  Identities=11%  Similarity=0.141  Sum_probs=84.5

Q ss_pred             EEEEEE-ccCCeEEEEeCCC-CcEeEEEeccCccccCCccccccccccccCCeEEE--EECCEEEEEECCCCcEEEEEec
Q 003800           97 YVITLS-SDGSTLRAWNLPD-GQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV--SSKGCLHAVSSIDGEILWTRDF  172 (794)
Q Consensus        97 ~~V~Vs-~~g~~v~A~d~~t-G~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V--~~~g~l~ald~~tG~~~W~~~~  172 (794)
                      +.+++| +.++.|..||.-+ |+.+-.+....... .++..-       ..+.=|.  ..|+.+.--|.+||+++=++..
T Consensus       227 ~hLlLS~gmD~~vklW~vy~~~~~lrtf~gH~k~V-rd~~~s-------~~g~~fLS~sfD~~lKlwDtETG~~~~~f~~  298 (503)
T KOG0282|consen  227 GHLLLSGGMDGLVKLWNVYDDRRCLRTFKGHRKPV-RDASFN-------NCGTSFLSASFDRFLKLWDTETGQVLSRFHL  298 (503)
T ss_pred             eeEEEecCCCceEEEEEEecCcceehhhhcchhhh-hhhhcc-------ccCCeeeeeecceeeeeeccccceEEEEEec
Confidence            345554 5578999999987 88887777766544 222221       1333333  2499999999999999998877


Q ss_pred             cCcceeeeeEEE-EecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEE
Q 003800          173 AAESVEVQQVIQ-LDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVT  250 (794)
Q Consensus       173 ~~~~~~~~~~v~-s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v  250 (794)
                      .....    ++- -.++..+|++|...+   ++...|..+|+++-++.-.+....  +..|+ ++..++.. ++.+++.+
T Consensus       299 ~~~~~----cvkf~pd~~n~fl~G~sd~---ki~~wDiRs~kvvqeYd~hLg~i~--~i~F~~~g~rFiss-SDdks~ri  368 (503)
T KOG0282|consen  299 DKVPT----CVKFHPDNQNIFLVGGSDK---KIRQWDIRSGKVVQEYDRHLGAIL--DITFVDEGRRFISS-SDDKSVRI  368 (503)
T ss_pred             CCCce----eeecCCCCCcEEEEecCCC---cEEEEeccchHHHHHHHhhhhhee--eeEEccCCceEeee-ccCccEEE
Confidence            65321    221 123446666554433   899999999999887742222211  33344 33344333 23455555


Q ss_pred             EEeec
Q 003800          251 VSFKN  255 (794)
Q Consensus       251 ~~l~s  255 (794)
                      -+...
T Consensus       369 We~~~  373 (503)
T KOG0282|consen  369 WENRI  373 (503)
T ss_pred             EEcCC
Confidence            44443


No 140
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=67.19  E-value=2.1e+02  Score=31.85  Aligned_cols=52  Identities=15%  Similarity=0.019  Sum_probs=34.2

Q ss_pred             EEEEEEcCCCceeeeeeeecccCccCceEEEcCc-EEEEEECCCCeEEEEEeecce
Q 003800          203 HAYQINAMNGELLNHETAAFSGGFVGDVALVSSD-TLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       203 ~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~-~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .+-..|..||..+-+.. +-...+.+..+-.|+. ++-|.|  +++|++-|+++++
T Consensus       315 tIk~wdv~tg~cL~tL~-ghdnwVr~~af~p~Gkyi~ScaD--Dktlrvwdl~~~~  367 (406)
T KOG0295|consen  315 TIKIWDVSTGMCLFTLV-GHDNWVRGVAFSPGGKYILSCAD--DKTLRVWDLKNLQ  367 (406)
T ss_pred             eEEEEeccCCeEEEEEe-cccceeeeeEEcCCCeEEEEEec--CCcEEEEEeccce
Confidence            67888999998876652 2233444433323444 444665  7899999999987


No 141
>PLN00033 photosystem II stability/assembly factor; Provisional
Probab=67.05  E-value=2.3e+02  Score=32.27  Aligned_cols=129  Identities=14%  Similarity=0.160  Sum_probs=64.4

Q ss_pred             ccEeeEEeccCceeee--eeeeecc---CCCEEEEEeCCCEEEEEECcCCccceEEEcCcc----c---ceeeeeeeeCC
Q 003800           29 GLMDWHQQYIGKVKHA--VFHTQKT---GRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN----D---VVDGIDIALGK   96 (794)
Q Consensus        29 G~~dW~~~~vG~~~~~--~f~~~~~---~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~----~---~i~~l~~~~g~   96 (794)
                      +-.-|++..  .|..+  .+.....   +.++-++....|.|.  -.+||-.-|++.....    +   ....+.. .++
T Consensus        73 ~G~~W~q~~--~p~~~~~~L~~V~F~~~d~~~GwAVG~~G~IL--~T~DGG~tW~~~~~~~~~~~~~~~~l~~v~f-~~~  147 (398)
T PLN00033         73 QSSEWEQVD--LPIDPGVVLLDIAFVPDDPTHGFLLGTRQTLL--ETKDGGKTWVPRSIPSAEDEDFNYRFNSISF-KGK  147 (398)
T ss_pred             CCCccEEee--cCCCCCCceEEEEeccCCCCEEEEEcCCCEEE--EEcCCCCCceECccCcccccccccceeeeEE-ECC
Confidence            334599876  34322  2222222   355677777788764  4458999999854211    1   1122212 344


Q ss_pred             EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEe
Q 003800           97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRD  171 (794)
Q Consensus        97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~  171 (794)
                      ..++++..   -+.+-..||-.-|+..............+    ....++..++. ..|.+++-  .+|-..|+..
T Consensus       148 ~g~~vG~~---G~il~T~DgG~tW~~~~~~~~~p~~~~~i----~~~~~~~~~ivg~~G~v~~S--~D~G~tW~~~  214 (398)
T PLN00033        148 EGWIIGKP---AILLHTSDGGETWERIPLSPKLPGEPVLI----KATGPKSAEMVTDEGAIYVT--SNAGRNWKAA  214 (398)
T ss_pred             EEEEEcCc---eEEEEEcCCCCCceECccccCCCCCceEE----EEECCCceEEEeccceEEEE--CCCCCCceEc
Confidence            44443332   36666779999998754321110111111    11113333333 45654444  4666788864


No 142
>PRK13684 Ycf48-like protein; Provisional
Probab=67.01  E-value=2.1e+02  Score=31.67  Aligned_cols=179  Identities=15%  Similarity=0.219  Sum_probs=88.2

Q ss_pred             cceeecccccEeeEEeccCc--eeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCE
Q 003800           21 LSLYEDQVGLMDWHQQYIGK--VKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKY   97 (794)
Q Consensus        21 ~Al~edqvG~~dW~~~~vG~--~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~   97 (794)
                      ..+|..+.|-..|+....+.  +.+. +.-.....+.++++++.|.|+.-  .||-.-|+...... ..+..+....++.
T Consensus       109 g~i~~S~DgG~tW~~~~~~~~~~~~~-~~i~~~~~~~~~~~g~~G~i~~S--~DgG~tW~~~~~~~~g~~~~i~~~~~g~  185 (334)
T PRK13684        109 SLLLHTTDGGKNWTRIPLSEKLPGSP-YLITALGPGTAEMATNVGAIYRT--TDGGKNWEALVEDAAGVVRNLRRSPDGK  185 (334)
T ss_pred             ceEEEECCCCCCCeEccCCcCCCCCc-eEEEEECCCcceeeeccceEEEE--CCCCCCceeCcCCCcceEEEEEECCCCe
Confidence            45788777777898765431  1111 11111234557777877766554  47888899755432 2233332223334


Q ss_pred             EEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEE-eccCc
Q 003800           98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTR-DFAAE  175 (794)
Q Consensus        98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~-~~~~~  175 (794)
                      .++++..| .++.- ..+|..-|+..-....  ..+..+    ....++.+++. .+|.+. +...+|-..|+. ..+..
T Consensus       186 ~v~~g~~G-~i~~s-~~~gg~tW~~~~~~~~--~~l~~i----~~~~~g~~~~vg~~G~~~-~~s~d~G~sW~~~~~~~~  256 (334)
T PRK13684        186 YVAVSSRG-NFYST-WEPGQTAWTPHQRNSS--RRLQSM----GFQPDGNLWMLARGGQIR-FNDPDDLESWSKPIIPEI  256 (334)
T ss_pred             EEEEeCCc-eEEEE-cCCCCCeEEEeeCCCc--ccceee----eEcCCCCEEEEecCCEEE-EccCCCCCccccccCCcc
Confidence            44445544 44432 2467788986533221  111111    11113444444 466543 434566678885 22211


Q ss_pred             --ceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          176 --SVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       176 --~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                        ......+. ...++.+|+++..|    .++ .. .+|-.-|+.
T Consensus       257 ~~~~~l~~v~-~~~~~~~~~~G~~G----~v~-~S-~d~G~tW~~  294 (334)
T PRK13684        257 TNGYGYLDLA-YRTPGEIWAGGGNG----TLL-VS-KDGGKTWEK  294 (334)
T ss_pred             ccccceeeEE-EcCCCCEEEEcCCC----eEE-Ee-CCCCCCCeE
Confidence              11122222 13466788877666    222 22 355556776


No 143
>KOG0318 consensus WD40 repeat stress protein/actin interacting protein [Cytoskeleton]
Probab=66.38  E-value=2.6e+02  Score=32.63  Aligned_cols=182  Identities=10%  Similarity=0.141  Sum_probs=96.9

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      .++|++.+-.|.|--||+.++++ =++.-+...+|..+.+..++..++-++.+|.+..||..+|.--   ++.+...  +
T Consensus       290 kd~lItVSl~G~in~ln~~d~~~-~~~i~GHnK~ITaLtv~~d~~~i~SgsyDG~I~~W~~~~g~~~---~~~g~~h--~  363 (603)
T KOG0318|consen  290 KDHLITVSLSGTINYLNPSDPSV-LKVISGHNKSITALTVSPDGKTIYSGSYDGHINSWDSGSGTSD---RLAGKGH--T  363 (603)
T ss_pred             CCeEEEEEcCcEEEEecccCCCh-hheecccccceeEEEEcCCCCEEEeeccCceEEEEecCCcccc---ccccccc--c
Confidence            67899999999999999999994 3433333345665644444455664556789999999988643   2222111  1


Q ss_pred             ccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEE--EeccCcceeeeeEEEEecC-CEEEEEEecCCceeEEEEEE
Q 003800          133 LLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWT--RDFAAESVEVQQVIQLDES-DQIYVVGYAGSSQFHAYQIN  208 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~--~~~~~~~~~~~~~v~s~~~-~~vyv~~~~g~~~~~v~ald  208 (794)
                      ..+..+  +....+.++-. .|.+|..++...+..-=.  .+.+..   |..+- ...+ +.+.+++..     .++-|.
T Consensus       364 nqI~~~--~~~~~~~~~t~g~Dd~l~~~~~~~~~~t~~~~~~lg~Q---P~~la-v~~d~~~avv~~~~-----~iv~l~  432 (603)
T KOG0318|consen  364 NQIKGM--AASESGELFTIGWDDTLRVISLKDNGYTKSEVVKLGSQ---PKGLA-VLSDGGTAVVACIS-----DIVLLQ  432 (603)
T ss_pred             ceEEEE--eecCCCcEEEEecCCeEEEEecccCcccccceeecCCC---ceeEE-EcCCCCEEEEEecC-----cEEEEe
Confidence            122221  22223455554 588999998764322111  222221   22221 1233 344444433     244444


Q ss_pred             cCCCceeeeeeeecccCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          209 AMNGELLNHETAAFSGGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       209 ~~tG~~~w~~~v~~~~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      -.++  +.+.    |-+...+++.+ -++-.+|+-...+.+|+..|..+.
T Consensus       433 ~~~~--~~~~----~~~y~~s~vAv~~~~~~vaVGG~Dgkvhvysl~g~~  476 (603)
T KOG0318|consen  433 DQTK--VSSI----PIGYESSAVAVSPDGSEVAVGGQDGKVHVYSLSGDE  476 (603)
T ss_pred             cCCc--ceee----ccccccceEEEcCCCCEEEEecccceEEEEEecCCc
Confidence            3222  2222    22333344444 223344554456788888887655


No 144
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=66.16  E-value=69  Score=33.10  Aligned_cols=110  Identities=13%  Similarity=0.020  Sum_probs=74.5

Q ss_pred             CCEEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ++.+|..|- +|+-.-.|++|=+++=|+..+..+  -++  ..++.-+.+|.....++-.|++|=.+.=+........  
T Consensus       100 gd~~y~LTw~egvaf~~d~~t~~~lg~~~y~GeG--WgL--t~d~~~LimsdGsatL~frdP~tfa~~~~v~VT~~g~--  173 (262)
T COG3823         100 GDYFYQLTWKEGVAFKYDADTLEELGRFSYEGEG--WGL--TSDDKNLIMSDGSATLQFRDPKTFAELDTVQVTDDGV--  173 (262)
T ss_pred             cceEEEEEeccceeEEEChHHhhhhcccccCCcc--eee--ecCCcceEeeCCceEEEecCHHHhhhcceEEEEECCe--
Confidence            677999885 688888999998888888776652  234  3444445556555789999999887777766654321  


Q ss_pred             CccccccccccccCCeEEEE--ECCEEEEEECCCCcEE-EEE
Q 003800          132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEIL-WTR  170 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~-W~~  170 (794)
                        |+.-.+...-.++.++.-  ...++.++++++|+++ |-.
T Consensus       174 --pv~~LNELE~VdG~lyANVw~t~~I~rI~p~sGrV~~wid  213 (262)
T COG3823         174 --PVSKLNELEWVDGELYANVWQTTRIARIDPDSGRVVAWID  213 (262)
T ss_pred             --ecccccceeeeccEEEEeeeeecceEEEcCCCCcEEEEEE
Confidence              222222223336777773  4888999999999976 543


No 145
>KOG1036 consensus Mitotic spindle checkpoint protein BUB3, WD repeat superfamily [Cell cycle control, cell division, chromosome partitioning]
Probab=65.88  E-value=1.5e+02  Score=32.24  Aligned_cols=61  Identities=13%  Similarity=0.142  Sum_probs=41.7

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPD  115 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~t  115 (794)
                      ...+++++-+|.|--+|..+|...==  ....+.+.++...-..+.|+-++.++++..||+..
T Consensus        65 ~~~~~~G~~dg~vr~~Dln~~~~~~i--gth~~~i~ci~~~~~~~~vIsgsWD~~ik~wD~R~  125 (323)
T KOG1036|consen   65 ESTIVTGGLDGQVRRYDLNTGNEDQI--GTHDEGIRCIEYSYEVGCVISGSWDKTIKFWDPRN  125 (323)
T ss_pred             CceEEEeccCceEEEEEecCCcceee--ccCCCceEEEEeeccCCeEEEcccCccEEEEeccc
Confidence            56799999999999999999865321  11222344443233445555577889999999976


No 146
>KOG0649 consensus WD40 repeat protein [General function prediction only]
Probab=65.81  E-value=1.9e+02  Score=30.68  Aligned_cols=159  Identities=14%  Similarity=0.131  Sum_probs=86.3

Q ss_pred             CEEEEEEccCCeEEEEeCC------CCcEeEEEeccCccccCCccccccccccc-cCCeEEEE-ECCEEEEEECCCCcEE
Q 003800           96 KYVITLSSDGSTLRAWNLP------DGQMVWESFLRGSKHSKPLLLVPTNLKVD-KDSLILVS-SKGCLHAVSSIDGEIL  167 (794)
Q Consensus        96 ~~~V~Vs~~g~~v~A~d~~------tG~llWe~~l~~~~~s~~~~~~~~~~~~~-~~~~V~V~-~~g~l~ald~~tG~~~  167 (794)
                      ++.+..+++ |.|++|.=+      -=+.+||....-...+...|-+. ..-.+ ..+.++.. .|+.+|..|.++|+..
T Consensus        72 d~~Lls~gd-G~V~gw~W~E~~es~~~K~lwe~~~P~~~~~~evPeIN-am~ldP~enSi~~AgGD~~~y~~dlE~G~i~  149 (325)
T KOG0649|consen   72 DDFLLSGGD-GLVYGWEWNEEEESLATKRLWEVKIPMQVDAVEVPEIN-AMWLDPSENSILFAGGDGVIYQVDLEDGRIQ  149 (325)
T ss_pred             hhheeeccC-ceEEEeeehhhhhhccchhhhhhcCccccCcccCCccc-eeEeccCCCcEEEecCCeEEEEEEecCCEEE
Confidence            445554554 799999632      33678988764322101111110 00111 13444444 6999999999999999


Q ss_pred             EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeec-c---c-CccC--ceEEEcCcEEEE
Q 003800          168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAF-S---G-GFVG--DVALVSSDTLVT  240 (794)
Q Consensus       168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~-~---~-~~s~--~~~~vg~~~lv~  240 (794)
                      =+++--...+  -.++.-...+.++-.+-+|    .+...|.+|++-+.....-- +   + ....  .++-++..-++|
T Consensus       150 r~~rGHtDYv--H~vv~R~~~~qilsG~EDG----tvRvWd~kt~k~v~~ie~yk~~~~lRp~~g~wigala~~edWlvC  223 (325)
T KOG0649|consen  150 REYRGHTDYV--HSVVGRNANGQILSGAEDG----TVRVWDTKTQKHVSMIEPYKNPNLLRPDWGKWIGALAVNEDWLVC  223 (325)
T ss_pred             EEEcCCccee--eeeeecccCcceeecCCCc----cEEEEeccccceeEEeccccChhhcCcccCceeEEEeccCceEEe
Confidence            8887654432  1122113455666544444    78889999998764432111 1   0 1111  244445667788


Q ss_pred             EECCCCeEEEEEeecceeeeEEEee
Q 003800          241 LDTTRSILVTVSFKNRKIAFQETHL  265 (794)
Q Consensus       241 ~d~~~g~L~v~~l~sg~~~~~~~~l  265 (794)
                      .-  ...|..-.|.+-+ ....+|+
T Consensus       224 Gg--Gp~lslwhLrsse-~t~vfpi  245 (325)
T KOG0649|consen  224 GG--GPKLSLWHLRSSE-STCVFPI  245 (325)
T ss_pred             cC--CCceeEEeccCCC-ceEEEec
Confidence            73  3355555666544 3555565


No 147
>PLN02153 epithiospecifier protein
Probab=65.49  E-value=2.2e+02  Score=31.33  Aligned_cols=152  Identities=13%  Similarity=0.096  Sum_probs=77.4

Q ss_pred             CCEEEEEeCC------CEEEEEECcCCccceEEEcCc-----ccc-eeeeeeeeCCEEEEEEccC-----------CeEE
Q 003800           53 RKRVVVSTEE------NVIASLDLRHGEIFWRHVLGI-----NDV-VDGIDIALGKYVITLSSDG-----------STLR  109 (794)
Q Consensus        53 ~~~Vyv~t~~------g~l~ALn~~tG~ivWR~~l~~-----~~~-i~~l~~~~g~~~V~Vs~~g-----------~~v~  109 (794)
                      +++||+....      +.+..+|+++.  .|+..-.-     +.. .....+..++.+++++|..           ..+.
T Consensus        85 ~~~iyv~GG~~~~~~~~~v~~yd~~t~--~W~~~~~~~~~~~p~~R~~~~~~~~~~~iyv~GG~~~~~~~~~~~~~~~v~  162 (341)
T PLN02153         85 GTKLYIFGGRDEKREFSDFYSYDTVKN--EWTFLTKLDEEGGPEARTFHSMASDENHVYVFGGVSKGGLMKTPERFRTIE  162 (341)
T ss_pred             CCEEEEECCCCCCCccCcEEEEECCCC--EEEEeccCCCCCCCCCceeeEEEEECCEEEEECCccCCCccCCCcccceEE
Confidence            6778887652      46889999875  59864321     100 1111123455566666632           2578


Q ss_pred             EEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEEC---------------CEEEEEECCCCcEEEEEecc-
Q 003800          110 AWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSK---------------GCLHAVSSIDGEILWTRDFA-  173 (794)
Q Consensus       110 A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~---------------g~l~ald~~tG~~~W~~~~~-  173 (794)
                      .||+.+.  .|+..-.....  ..+-.... ....++.+++..+               ..+.++|..+.  .|+.-.. 
T Consensus       163 ~yd~~~~--~W~~l~~~~~~--~~~r~~~~-~~~~~~~iyv~GG~~~~~~~gG~~~~~~~~v~~yd~~~~--~W~~~~~~  235 (341)
T PLN02153        163 AYNIADG--KWVQLPDPGEN--FEKRGGAG-FAVVQGKIWVVYGFATSILPGGKSDYESNAVQFFDPASG--KWTEVETT  235 (341)
T ss_pred             EEECCCC--eEeeCCCCCCC--CCCCCcce-EEEECCeEEEEeccccccccCCccceecCceEEEEcCCC--cEEecccc
Confidence            8998765  58853221100  00000000 1112456666421               35788887754  4876432 


Q ss_pred             ----CcceeeeeEEEEecCCEEEEEEecC---------C--ceeEEEEEEcCCCceeeee
Q 003800          174 ----AESVEVQQVIQLDESDQIYVVGYAG---------S--SQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       174 ----~~~~~~~~~v~s~~~~~vyv~~~~g---------~--~~~~v~ald~~tG~~~w~~  218 (794)
                          .+.. ...+  ..-++.+|+.+...         .  ..-.++++|+.+.  .|+.
T Consensus       236 g~~P~~r~-~~~~--~~~~~~iyv~GG~~~~~~~~~~~~~~~~n~v~~~d~~~~--~W~~  290 (341)
T PLN02153        236 GAKPSARS-VFAH--AVVGKYIIIFGGEVWPDLKGHLGPGTLSNEGYALDTETL--VWEK  290 (341)
T ss_pred             CCCCCCcc-eeee--EEECCEEEEECcccCCccccccccccccccEEEEEcCcc--EEEe
Confidence                1111 1111  23578899876531         0  0125788887644  5764


No 148
>KOG0647 consensus mRNA export protein (contains WD40 repeats) [RNA processing and modification]
Probab=65.24  E-value=2.1e+02  Score=31.15  Aligned_cols=154  Identities=11%  Similarity=0.058  Sum_probs=86.9

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCE--EEEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKY--VITLSSDGSTLRAWNLPDGQMVWESFLRGSKH  129 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~--~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~  129 (794)
                      ++.+||.++-++.+--.|..+|++.  +.-.....+...+-..+..  .++-|+.+.+|+-||...-.++=+..+..-.+
T Consensus        83 dgskVf~g~~Dk~~k~wDL~S~Q~~--~v~~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R~~~pv~t~~LPeRvY  160 (347)
T KOG0647|consen   83 DGSKVFSGGCDKQAKLWDLASGQVS--QVAAHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTRSSNPVATLQLPERVY  160 (347)
T ss_pred             CCceEEeeccCCceEEEEccCCCee--eeeecccceeEEEEecCCCcceeEecccccceeecccCCCCeeeeeeccceee
Confidence            4667999999999999999999652  2222222344333222222  33325568899999999999998888876544


Q ss_pred             cCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEE-eccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEE
Q 003800          130 SKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTR-DFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQI  207 (794)
Q Consensus       130 s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~-~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~al  207 (794)
                      +.+  +       . -...+|. .+..+..+++.+|-..-+. +.|...  ..+++....+..-|++|..-|   ++..-
T Consensus       161 a~D--v-------~-~pm~vVata~r~i~vynL~n~~te~k~~~SpLk~--Q~R~va~f~d~~~~alGsiEG---rv~iq  225 (347)
T KOG0647|consen  161 AAD--V-------L-YPMAVVATAERHIAVYNLENPPTEFKRIESPLKW--QTRCVACFQDKDGFALGSIEG---RVAIQ  225 (347)
T ss_pred             ehh--c-------c-CceeEEEecCCcEEEEEcCCCcchhhhhcCcccc--eeeEEEEEecCCceEeeeecc---eEEEE
Confidence            111  1       1 2233443 5788999998887543221 111111  112222233444455543322   66666


Q ss_pred             EcCCCceeeeeeeec
Q 003800          208 NAMNGELLNHETAAF  222 (794)
Q Consensus       208 d~~tG~~~w~~~v~~  222 (794)
                      ....|.+.....+.+
T Consensus       226 ~id~~~~~~nFtFkC  240 (347)
T KOG0647|consen  226 YIDDPNPKDNFTFKC  240 (347)
T ss_pred             ecCCCCccCceeEEE
Confidence            666666544444333


No 149
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=64.52  E-value=48  Score=30.42  Aligned_cols=71  Identities=27%  Similarity=0.442  Sum_probs=42.3

Q ss_pred             CccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-E
Q 003800           73 GEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-S  151 (794)
Q Consensus        73 G~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~  151 (794)
                      +.++|......+        ......+.+..+ |.++..|. +|..+|...... ..               ...+++ .
T Consensus        41 ~~~vW~snt~~~--------~~~~~~l~l~~d-GnLvl~~~-~g~~vW~S~~~~-~~---------------~~~~~~L~   94 (116)
T cd00028          41 RTVVWVANRDNP--------SGSSCTLTLQSD-GNLVIYDG-SGTVVWSSNTTR-VN---------------GNYVLVLL   94 (116)
T ss_pred             CeEEEECCCCCC--------CCCCEEEEEecC-CCeEEEcC-CCcEEEEecccC-CC---------------CceEEEEe
Confidence            678898655332        112223444554 46777776 689999866543 10               122333 3


Q ss_pred             ECCEEEEEECCCCcEEEEE
Q 003800          152 SKGCLHAVSSIDGEILWTR  170 (794)
Q Consensus       152 ~~g~l~ald~~tG~~~W~~  170 (794)
                      .+|.|.-++. +|+++|+-
T Consensus        95 ddGnlvl~~~-~~~~~W~S  112 (116)
T cd00028          95 DDGNLVLYDS-DGNFLWQS  112 (116)
T ss_pred             CCCCEEEECC-CCCEEEcC
Confidence            6788777775 58999974


No 150
>KOG1027 consensus Serine/threonine protein kinase and endoribonuclease ERN1/IRE1, sensor of the unfolded protein response pathway [Signal transduction mechanisms]
Probab=63.42  E-value=30  Score=42.30  Aligned_cols=109  Identities=17%  Similarity=0.246  Sum_probs=66.5

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      .++.+|.++.++.-+-+|++||+..|.....++  +..+ +..+..-       .+|.-.|..+=...|......... .
T Consensus       106 sdGi~ysg~k~d~~~lvD~~tg~~~~tf~~~~~--~~~~-v~~grt~-------ytv~m~d~~~~~~~wn~t~~dy~a-~  174 (903)
T KOG1027|consen  106 SDGILYSGSKQDIWYLVDPKTGEIDYTFNTAEP--IKQL-VYLGRTN-------YTVTMYDKNVRGKTWNTTFGDYSA-Q  174 (903)
T ss_pred             CCCeEEecccccceEEecCCccceeEEEecCCc--chhh-eecccce-------eEEecccCcccCceeeccccchhc-c
Confidence            477799999999999999999999999887664  3322 1222222       233333444445556555443221 1


Q ss_pred             CccccccccccccCCeEEE--EECCEEEEEECCCCcEEEEEeccCcce
Q 003800          132 PLLLVPTNLKVDKDSLILV--SSKGCLHAVSSIDGEILWTRDFAAESV  177 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V--~~~g~l~ald~~tG~~~W~~~~~~~~~  177 (794)
                      .++-.      .+.....+  .++|-+.-+|.++|+.+|..+...+..
T Consensus       175 ~~~~~------~~~~~~~~~~~~~g~i~t~D~~~g~~~~~q~~~spvv  216 (903)
T KOG1027|consen  175 YPSGV------RGEKMSHFHSLGNGYIVTVDSESGEKLWLQDLLSPVV  216 (903)
T ss_pred             CCCcc------CCceeEEEeecCCccEEeccCcccceeeccccCCceE
Confidence            11111      11122222  247777789999999999998876643


No 151
>PF08450 SGL:  SMP-30/Gluconolaconase/LRE-like region;  InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=63.25  E-value=1.9e+02  Score=29.91  Aligned_cols=145  Identities=17%  Similarity=0.234  Sum_probs=82.1

Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccc-cCCeEEEEECCEEEEEECCCCcEEEEEecc
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVD-KDSLILVSSKGCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~-~~~~V~V~~~g~l~ald~~tG~~~W~~~~~  173 (794)
                      .+.++++...++.|+.||+.+|+.. ...+..+.   +.       ... .++.+++...+.+..+|..+|+..--.+..
T Consensus        11 ~g~l~~~D~~~~~i~~~~~~~~~~~-~~~~~~~~---G~-------~~~~~~g~l~v~~~~~~~~~d~~~g~~~~~~~~~   79 (246)
T PF08450_consen   11 DGRLYWVDIPGGRIYRVDPDTGEVE-VIDLPGPN---GM-------AFDRPDGRLYVADSGGIAVVDPDTGKVTVLADLP   79 (246)
T ss_dssp             TTEEEEEETTTTEEEEEETTTTEEE-EEESSSEE---EE-------EEECTTSEEEEEETTCEEEEETTTTEEEEEEEEE
T ss_pred             CCEEEEEEcCCCEEEEEECCCCeEE-EEecCCCc---eE-------EEEccCCEEEEEEcCceEEEecCCCcEEEEeecc
Confidence            3445554445789999999888653 22322211   11       112 257777877666677799999766544442


Q ss_pred             --C-cceeeeeEEEEecCCEEEEEEecCC---ce--eEEEEEEcCCCceee-eeeeecccCccCceEEEcCcEEEEEECC
Q 003800          174 --A-ESVEVQQVIQLDESDQIYVVGYAGS---SQ--FHAYQINAMNGELLN-HETAAFSGGFVGDVALVSSDTLVTLDTT  244 (794)
Q Consensus       174 --~-~~~~~~~~v~s~~~~~vyv~~~~g~---~~--~~v~ald~~tG~~~w-~~~v~~~~~~s~~~~~vg~~~lv~~d~~  244 (794)
                        . +...+--+. ...++.+|+......   ..  ..++.+++. |+... ...+..|.++   ++-..++.+++.|+.
T Consensus        80 ~~~~~~~~~ND~~-vd~~G~ly~t~~~~~~~~~~~~g~v~~~~~~-~~~~~~~~~~~~pNGi---~~s~dg~~lyv~ds~  154 (246)
T PF08450_consen   80 DGGVPFNRPNDVA-VDPDGNLYVTDSGGGGASGIDPGSVYRIDPD-GKVTVVADGLGFPNGI---AFSPDGKTLYVADSF  154 (246)
T ss_dssp             TTCSCTEEEEEEE-E-TTS-EEEEEECCBCTTCGGSEEEEEEETT-SEEEEEEEEESSEEEE---EEETTSSEEEEEETT
T ss_pred             CCCcccCCCceEE-EcCCCCEEEEecCCCccccccccceEEECCC-CeEEEEecCcccccce---EECCcchheeecccc
Confidence              1 222232232 245777998655321   11  579999998 66432 2223333322   222234567778888


Q ss_pred             CCeEEEEEeec
Q 003800          245 RSILVTVSFKN  255 (794)
Q Consensus       245 ~g~L~v~~l~s  255 (794)
                      ++.++..++..
T Consensus       155 ~~~i~~~~~~~  165 (246)
T PF08450_consen  155 NGRIWRFDLDA  165 (246)
T ss_dssp             TTEEEEEEEET
T ss_pred             cceeEEEeccc
Confidence            88899999874


No 152
>KOG0306 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=63.25  E-value=3.5e+02  Score=33.02  Aligned_cols=101  Identities=16%  Similarity=0.175  Sum_probs=61.8

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcc-------cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGIN-------DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-------~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      |+.+.....+++|+. +|+..=+-.....       ..+.++. ...+..++.|+.|+.+.-||..+++-+-...-. ..
T Consensus       339 v~l~nNtv~~ysl~~-s~~~~p~~~~~~~i~~~GHR~dVRsl~-vS~d~~~~~Sga~~SikiWn~~t~kciRTi~~~-y~  415 (888)
T KOG0306|consen  339 VLLANNTVEWYSLEN-SGKTSPEADRTSNIEIGGHRSDVRSLC-VSSDSILLASGAGESIKIWNRDTLKCIRTITCG-YI  415 (888)
T ss_pred             EEeecCceEEEEecc-CCCCCccccccceeeeccchhheeEEE-eecCceeeeecCCCcEEEEEccCcceeEEeccc-cE
Confidence            556666778999998 6665411110000       0122332 234455666777789999999999988776643 22


Q ss_pred             ccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEE
Q 003800          129 HSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEIL  167 (794)
Q Consensus       129 ~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~  167 (794)
                      +  ...++|      ++..|++. .+|+|..+|..++..+
T Consensus       416 l--~~~Fvp------gd~~Iv~G~k~Gel~vfdlaS~~l~  447 (888)
T KOG0306|consen  416 L--ASKFVP------GDRYIVLGTKNGELQVFDLASASLV  447 (888)
T ss_pred             E--EEEecC------CCceEEEeccCCceEEEEeehhhhh
Confidence            2  223444      24555555 4999999999887644


No 153
>PRK03629 tolB translocation protein TolB; Provisional
Probab=63.02  E-value=2.8e+02  Score=31.74  Aligned_cols=149  Identities=13%  Similarity=0.139  Sum_probs=72.0

Q ss_pred             eCCEEEEEEc--cCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CC--EEEEEECCCCcEE
Q 003800           94 LGKYVITLSS--DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KG--CLHAVSSIDGEIL  167 (794)
Q Consensus        94 ~g~~~V~Vs~--~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g--~l~ald~~tG~~~  167 (794)
                      .|+.+++++.  .+..++.||..+|+..--....+..  ..+.+.     ++ +..+++. . +|  .|+.+|.++|+..
T Consensus       209 DG~~la~~s~~~g~~~i~i~dl~~G~~~~l~~~~~~~--~~~~~S-----PD-G~~La~~~~~~g~~~I~~~d~~tg~~~  280 (429)
T PRK03629        209 DGSKLAYVTFESGRSALVIQTLANGAVRQVASFPRHN--GAPAFS-----PD-GSKLAFALSKTGSLNLYVMDLASGQIR  280 (429)
T ss_pred             CCCEEEEEEecCCCcEEEEEECCCCCeEEccCCCCCc--CCeEEC-----CC-CCEEEEEEcCCCCcEEEEEECCCCCEE
Confidence            4666777653  2357999999999754333222211  112222     23 3344443 2 33  6888998888653


Q ss_pred             EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCC-
Q 003800          168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTR-  245 (794)
Q Consensus       168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~-  245 (794)
                      =-......   ...+..+.++..+++.+..++ ...++.+|+.+|+...   +.........+.+ ..+..++...... 
T Consensus       281 ~lt~~~~~---~~~~~wSPDG~~I~f~s~~~g-~~~Iy~~d~~~g~~~~---lt~~~~~~~~~~~SpDG~~Ia~~~~~~g  353 (429)
T PRK03629        281 QVTDGRSN---NTEPTWFPDSQNLAYTSDQAG-RPQVYKVNINGGAPQR---ITWEGSQNQDADVSSDGKFMVMVSSNGG  353 (429)
T ss_pred             EccCCCCC---cCceEECCCCCEEEEEeCCCC-CceEEEEECCCCCeEE---eecCCCCccCEEECCCCCEEEEEEccCC
Confidence            21111111   111221334555554443332 2478888998886531   1111111111222 2334444443322 


Q ss_pred             -CeEEEEEeecce
Q 003800          246 -SILVTVSFKNRK  257 (794)
Q Consensus       246 -g~L~v~~l~sg~  257 (794)
                       ..+++.|+.+|.
T Consensus       354 ~~~I~~~dl~~g~  366 (429)
T PRK03629        354 QQHIAKQDLATGG  366 (429)
T ss_pred             CceEEEEECCCCC
Confidence             357778887776


No 154
>KOG0303 consensus Actin-binding protein Coronin, contains WD40 repeats [Cytoskeleton]
Probab=62.93  E-value=1e+02  Score=34.64  Aligned_cols=92  Identities=13%  Similarity=0.204  Sum_probs=55.8

Q ss_pred             CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEecc
Q 003800           96 KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus        96 ~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~  173 (794)
                      .+++.-+|.++.|..||..||+-+-+........     .+    ....++..++.  .|..++.+|+.+|+++|+-..-
T Consensus       144 ~NVLlsag~Dn~v~iWnv~tgeali~l~hpd~i~-----S~----sfn~dGs~l~TtckDKkvRv~dpr~~~~v~e~~~h  214 (472)
T KOG0303|consen  144 PNVLLSAGSDNTVSIWNVGTGEALITLDHPDMVY-----SM----SFNRDGSLLCTTCKDKKVRVIDPRRGTVVSEGVAH  214 (472)
T ss_pred             hhhHhhccCCceEEEEeccCCceeeecCCCCeEE-----EE----EeccCCceeeeecccceeEEEcCCCCcEeeecccc
Confidence            3444435557899999999999887766333221     11    22235666665  3899999999999999997333


Q ss_pred             CcceeeeeEEEEecCCEEEEEEecC
Q 003800          174 AESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       174 ~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      .+.. +.+.+. ..++.++..|+..
T Consensus       215 eG~k-~~Raif-l~~g~i~tTGfsr  237 (472)
T KOG0303|consen  215 EGAK-PARAIF-LASGKIFTTGFSR  237 (472)
T ss_pred             cCCC-cceeEE-eccCceeeecccc
Confidence            3222 333332 2344455544443


No 155
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=62.66  E-value=63  Score=29.52  Aligned_cols=81  Identities=26%  Similarity=0.435  Sum_probs=45.0

Q ss_pred             CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcccccccccc
Q 003800           63 NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKV  142 (794)
Q Consensus        63 g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~  142 (794)
                      +.+.-.+..++.++|......+        ......+.+..+ |.+...|. +|..+|+...... .             
T Consensus        30 gnlV~~~~~~~~~vW~snt~~~--------~~~~~~l~l~~d-GnLvl~~~-~g~~vW~S~t~~~-~-------------   85 (114)
T smart00108       30 YNLILYKSSSRTVVWVANRDNP--------VSDSCTLTLQSD-GNLVLYDG-DGRVVWSSNTTGA-N-------------   85 (114)
T ss_pred             EEEEEEECCCCcEEEECCCCCC--------CCCCEEEEEeCC-CCEEEEeC-CCCEEEEecccCC-C-------------
Confidence            3333334333678898544322        111134444554 46777775 5899999754311 0             


Q ss_pred             ccCCeEEEE-ECCEEEEEECCCCcEEEEE
Q 003800          143 DKDSLILVS-SKGCLHAVSSIDGEILWTR  170 (794)
Q Consensus       143 ~~~~~V~V~-~~g~l~ald~~tG~~~W~~  170 (794)
                        ....+++ .+|.|.-++. .|+++|+-
T Consensus        86 --~~~~~~L~ddGnlvl~~~-~~~~~W~S  111 (114)
T smart00108       86 --GNYVLVLLDDGNLVIYDS-DGNFLWQS  111 (114)
T ss_pred             --CceEEEEeCCCCEEEECC-CCCEEeCC
Confidence              1223333 5788877774 67899973


No 156
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=62.33  E-value=2.3e+02  Score=32.08  Aligned_cols=102  Identities=15%  Similarity=0.024  Sum_probs=46.6

Q ss_pred             EEECcCCccceEEEcCcccc------eeeeeeeeCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCcccccc
Q 003800           67 SLDLRHGEIFWRHVLGINDV------VDGIDIALGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPT  138 (794)
Q Consensus        67 ALn~~tG~ivWR~~l~~~~~------i~~l~~~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~  138 (794)
                      -.|+.||..+=|-.-.....      -.+. -..|..++|.|..  ..+++.+|.++|+..==....+... .+..+.  
T Consensus        14 ~~D~~TG~~VtrLT~~~~~~h~~YF~~~~f-t~dG~kllF~s~~dg~~nly~lDL~t~~i~QLTdg~g~~~-~g~~~s--   89 (386)
T PF14583_consen   14 WIDPDTGHRVTRLTPPDGHSHRLYFYQNCF-TDDGRKLLFASDFDGNRNLYLLDLATGEITQLTDGPGDNT-FGGFLS--   89 (386)
T ss_dssp             EE-TTT--EEEE-S-TTS-EE---TTS--B--TTS-EEEEEE-TTSS-EEEEEETTT-EEEE---SS-B-T-TT-EE---
T ss_pred             EeCCCCCceEEEecCCCCcccceeecCCCc-CCCCCEEEEEeccCCCcceEEEEcccCEEEECccCCCCCc-cceEEe--
Confidence            35778887766532221100      1122 1346677886642  4789999999999873222222111 111111  


Q ss_pred             ccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcc
Q 003800          139 NLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAES  176 (794)
Q Consensus       139 ~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~  176 (794)
                         .. ++.++.. .+..|.++|..|++..=-+..|...
T Consensus        90 ---~~-~~~~~Yv~~~~~l~~vdL~T~e~~~vy~~p~~~  124 (386)
T PF14583_consen   90 ---PD-DRALYYVKNGRSLRRVDLDTLEERVVYEVPDDW  124 (386)
T ss_dssp             ---TT-SSEEEEEETTTEEEEEETTT--EEEEEE--TTE
T ss_pred             ---cC-CCeEEEEECCCeEEEEECCcCcEEEEEECCccc
Confidence               22 4454444 5679999999999877666666544


No 157
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=61.78  E-value=73  Score=35.83  Aligned_cols=108  Identities=19%  Similarity=0.205  Sum_probs=66.4

Q ss_pred             EEEcc-CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcEEEEEeccCcce
Q 003800          100 TLSSD-GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEILWTRDFAAESV  177 (794)
Q Consensus       100 ~Vs~~-g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~  177 (794)
                      ++||. +.+||.||..++...-+.++.+-..  ++.+..     + +..+...+ +..+-.+|..+-+++=.+..+.-..
T Consensus       315 ~~SgH~DkkvRfwD~Rs~~~~~sv~~gg~vt--Sl~ls~-----~-g~~lLsssRDdtl~viDlRt~eI~~~~sA~g~k~  386 (459)
T KOG0288|consen  315 VISGHFDKKVRFWDIRSADKTRSVPLGGRVT--SLDLSM-----D-GLELLSSSRDDTLKVIDLRTKEIRQTFSAEGFKC  386 (459)
T ss_pred             eeecccccceEEEeccCCceeeEeecCccee--eEeecc-----C-CeEEeeecCCCceeeeecccccEEEEeecccccc
Confidence            44663 6789999999999999999887432  221111     1 33444443 8888888888877776665543211


Q ss_pred             --eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800          178 --EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET  219 (794)
Q Consensus       178 --~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~  219 (794)
                        ....++.+.++..|-+    |+.+..|+..+..+|+......
T Consensus       387 asDwtrvvfSpd~~YvaA----GS~dgsv~iW~v~tgKlE~~l~  426 (459)
T KOG0288|consen  387 ASDWTRVVFSPDGSYVAA----GSADGSVYIWSVFTGKLEKVLS  426 (459)
T ss_pred             ccccceeEECCCCceeee----ccCCCcEEEEEccCceEEEEec
Confidence              1223332333333333    3333479999999998876654


No 158
>KOG1446 consensus Histone H3 (Lys4) methyltransferase complex and RNA cleavage factor II complex, subunit SWD2 [RNA processing and modification; Chromatin structure and dynamics; Posttranslational modification, protein turnover, chaperones]
Probab=61.09  E-value=2.5e+02  Score=30.60  Aligned_cols=202  Identities=13%  Similarity=0.110  Sum_probs=104.8

Q ss_pred             ccCceeeeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc--CCeEEEEeCC
Q 003800           37 YIGKVKHAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD--GSTLRAWNLP  114 (794)
Q Consensus        37 ~vG~~~~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~--g~~v~A~d~~  114 (794)
                      +-|++..-.|+.   ++..+++.+++..|.-.|..+|+.+=...-...+ +...........+.-|+.  +..+|-++..
T Consensus        13 ~~~~i~sl~fs~---~G~~litss~dDsl~LYd~~~g~~~~ti~skkyG-~~~~~Fth~~~~~i~sStk~d~tIryLsl~   88 (311)
T KOG1446|consen   13 TNGKINSLDFSD---DGLLLITSSEDDSLRLYDSLSGKQVKTINSKKYG-VDLACFTHHSNTVIHSSTKEDDTIRYLSLH   88 (311)
T ss_pred             CCCceeEEEecC---CCCEEEEecCCCeEEEEEcCCCceeeEeeccccc-ccEEEEecCCceEEEccCCCCCceEEEEee
Confidence            445554445652   3566888889999999999999877655444332 222222333333333432  5789999999


Q ss_pred             CCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CCEEEEEECCCCcEEEEEeccCcce----eeeeEEEEec-
Q 003800          115 DGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSIDGEILWTRDFAAESV----EVQQVIQLDE-  187 (794)
Q Consensus       115 tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~tG~~~W~~~~~~~~~----~~~~~v~s~~-  187 (794)
                      |-+-+--+......+ .++.+.|       .++.|+.+  |.+++         +|-.+.+...-    ....+. +-+ 
T Consensus        89 dNkylRYF~GH~~~V-~sL~~sP-------~~d~FlS~S~D~tvr---------LWDlR~~~cqg~l~~~~~pi~-AfDp  150 (311)
T KOG1446|consen   89 DNKYLRYFPGHKKRV-NSLSVSP-------KDDTFLSSSLDKTVR---------LWDLRVKKCQGLLNLSGRPIA-AFDP  150 (311)
T ss_pred             cCceEEEcCCCCceE-EEEEecC-------CCCeEEecccCCeEE---------eeEecCCCCceEEecCCCcce-eECC
Confidence            998887777665443 3333333       34566642  55543         46555332210    011111 223 


Q ss_pred             CCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccC--ceEEEcCc--EEEEEECCCCeEEEEEeecceeeeEEE
Q 003800          188 SDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVG--DVALVSSD--TLVTLDTTRSILVTVSFKNRKIAFQET  263 (794)
Q Consensus       188 ~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~--~~~~vg~~--~lv~~d~~~g~L~v~~l~sg~~~~~~~  263 (794)
                      .|.+|+++..+ ..++++-+-.-.+.+--...+..+ ...+  ..-+-.++  ++++.  ..+..+++|--+|.+ .+.+
T Consensus       151 ~GLifA~~~~~-~~IkLyD~Rs~dkgPF~tf~i~~~-~~~ew~~l~FS~dGK~iLlsT--~~s~~~~lDAf~G~~-~~tf  225 (311)
T KOG1446|consen  151 EGLIFALANGS-ELIKLYDLRSFDKGPFTTFSITDN-DEAEWTDLEFSPDGKSILLST--NASFIYLLDAFDGTV-KSTF  225 (311)
T ss_pred             CCcEEEEecCC-CeEEEEEecccCCCCceeEccCCC-CccceeeeEEcCCCCEEEEEe--CCCcEEEEEccCCcE-eeeE
Confidence            45566655443 356666555444555444433321 1111  11122122  33333  356777788777773 3434


Q ss_pred             ee
Q 003800          264 HL  265 (794)
Q Consensus       264 ~l  265 (794)
                      ..
T Consensus       226 s~  227 (311)
T KOG1446|consen  226 SG  227 (311)
T ss_pred             ee
Confidence            43


No 159
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=61.06  E-value=40  Score=37.47  Aligned_cols=109  Identities=17%  Similarity=0.246  Sum_probs=66.2

Q ss_pred             ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeee-----eeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           50 KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGI-----DIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l-----~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      +++++.|..++-+|.|-.-||++|+..=|.--....-|.++     +.......+.-++.++.+|-||..-|+.+-....
T Consensus       166 sPDgk~iASG~~dg~I~lwdpktg~~~g~~l~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd~~~~~~~~~lsg  245 (480)
T KOG0271|consen  166 SPDGKKIASGSKDGSIRLWDPKTGQQIGRALRGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWDTKLGTCVRTLSG  245 (480)
T ss_pred             CCCcchhhccccCCeEEEecCCCCCcccccccCcccceeEEeecccccCCCccceecccCCCCEEEEEccCceEEEEecc
Confidence            34566677788899999999999998766544433223322     1122333344345567999999999998877665


Q ss_pred             cCccccCCccccccccccccCCeEEEEE-CCEEEEEECCCCcE
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILVSS-KGCLHAVSSIDGEI  166 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~~-~g~l~ald~~tG~~  166 (794)
                      .....    ..+    ...+++.+|-.+ |+++...++.+|..
T Consensus       246 HT~~V----TCv----rwGG~gliySgS~DrtIkvw~a~dG~~  280 (480)
T KOG0271|consen  246 HTASV----TCV----RWGGEGLIYSGSQDRTIKVWRALDGKL  280 (480)
T ss_pred             Cccce----EEE----EEcCCceEEecCCCceEEEEEccchhH
Confidence            54332    111    223234444443 77777666666543


No 160
>KOG2106 consensus Uncharacterized conserved protein, contains HELP and WD40 domains [Function unknown]
Probab=60.42  E-value=3.3e+02  Score=31.74  Aligned_cols=220  Identities=13%  Similarity=0.118  Sum_probs=111.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKP  132 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~  132 (794)
                      ++.|+++...|.+.--++.+- ..=|+....++.+-++ -...++.+.-++.++++.+|| .+=+.+-+.+++.+.-  +
T Consensus       257 ngdviTgDS~G~i~Iw~~~~~-~~~k~~~aH~ggv~~L-~~lr~GtllSGgKDRki~~Wd-~~y~k~r~~elPe~~G--~  331 (626)
T KOG2106|consen  257 NGDVITGDSGGNILIWSKGTN-RISKQVHAHDGGVFSL-CMLRDGTLLSGGKDRKIILWD-DNYRKLRETELPEQFG--P  331 (626)
T ss_pred             CCCEEeecCCceEEEEeCCCc-eEEeEeeecCCceEEE-EEecCccEeecCccceEEecc-ccccccccccCchhcC--C
Confidence            556888888898888887544 4445555444455555 234555554366789999999 4555555666654321  1


Q ss_pred             ccccccccccccCCeEEEEE-CCEEE---------EEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCcee
Q 003800          133 LLLVPTNLKVDKDSLILVSS-KGCLH---------AVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQF  202 (794)
Q Consensus       133 ~~~~~~~~~~~~~~~V~V~~-~g~l~---------ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~  202 (794)
                      +..+    . .+.++++|.. .+.+.         -.-..-|+.+|.....-             ....|+.+.+..   
T Consensus       332 iRtv----~-e~~~di~vGTtrN~iL~Gt~~~~f~~~v~gh~delwgla~hp-------------s~~q~~T~gqdk---  390 (626)
T KOG2106|consen  332 IRTV----A-EGKGDILVGTTRNFILQGTLENGFTLTVQGHGDELWGLATHP-------------SKNQLLTCGQDK---  390 (626)
T ss_pred             eeEE----e-cCCCcEEEeeccceEEEeeecCCceEEEEecccceeeEEcCC-------------ChhheeeccCcc---
Confidence            1111    1 2234566652 22222         22223345667654321             122233333321   


Q ss_pred             EEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecceeeeEEEeecccCCCCCCceEEeecC
Q 003800          203 HAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKIAFQETHLSNLGEDSSGMVEILPSS  282 (794)
Q Consensus       203 ~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~  282 (794)
                      .+.-.+  .-++.|...+.-|..-.+  +-+.+ .++... ..|...++|.++..  +-++.-+      ..........
T Consensus       391 ~v~lW~--~~k~~wt~~~~d~~~~~~--fhpsg-~va~Gt-~~G~w~V~d~e~~~--lv~~~~d------~~~ls~v~ys  456 (626)
T KOG2106|consen  391 HVRLWN--DHKLEWTKIIEDPAECAD--FHPSG-VVAVGT-ATGRWFVLDTETQD--LVTIHTD------NEQLSVVRYS  456 (626)
T ss_pred             eEEEcc--CCceeEEEEecCceeEee--ccCcc-eEEEee-ccceEEEEecccce--eEEEEec------CCceEEEEEc
Confidence            344445  667889997765532211  11122 344333 46888899888855  2222221      1222233333


Q ss_pred             Ccc-eeEEEecC-cEEEEEEecCC-cEEEEEee
Q 003800          283 LTG-MFTVKINN-YKLFIRLTSED-KLEVVHKV  312 (794)
Q Consensus       283 ~~~-~~~~~~~~-~~~l~~~~~~~-~~~v~~~~  312 (794)
                      +.| .+.+.+.+ +..+++++.+| +...+..-
T Consensus       457 p~G~~lAvgs~d~~iyiy~Vs~~g~~y~r~~k~  489 (626)
T KOG2106|consen  457 PDGAFLAVGSHDNHIYIYRVSANGRKYSRVGKC  489 (626)
T ss_pred             CCCCEEEEecCCCeEEEEEECCCCcEEEEeeee
Confidence            344 33344444 55677777555 44444433


No 161
>KOG2055 consensus WD40 repeat protein [General function prediction only]
Probab=60.39  E-value=1e+02  Score=35.28  Aligned_cols=77  Identities=13%  Similarity=0.146  Sum_probs=53.3

Q ss_pred             ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           50 KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      +++++.|.++...|.|.-|.++||+.+=...++..  +..+.....+..+++++..|.|+-||...-..+-++.-.+..
T Consensus       312 Shd~~fia~~G~~G~I~lLhakT~eli~s~KieG~--v~~~~fsSdsk~l~~~~~~GeV~v~nl~~~~~~~rf~D~G~v  388 (514)
T KOG2055|consen  312 SHDSNFIAIAGNNGHIHLLHAKTKELITSFKIEGV--VSDFTFSSDSKELLASGGTGEVYVWNLRQNSCLHRFVDDGSV  388 (514)
T ss_pred             cCCCCeEEEcccCceEEeehhhhhhhhheeeeccE--EeeEEEecCCcEEEEEcCCceEEEEecCCcceEEEEeecCcc
Confidence            45577788888899999999999998877777655  433322333345554544459999999887777666655543


No 162
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=59.52  E-value=1.3e+02  Score=34.75  Aligned_cols=112  Identities=18%  Similarity=0.234  Sum_probs=68.5

Q ss_pred             cCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc
Q 003800           51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS  130 (794)
Q Consensus        51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s  130 (794)
                      .+.+..|..-.+|.|+.-|..+-. +=|+--+..+...++-+..++.-+--||-++.||.||...|+.+=+..+.+... 
T Consensus       519 pDakvcFsccsdGnI~vwDLhnq~-~VrqfqGhtDGascIdis~dGtklWTGGlDntvRcWDlregrqlqqhdF~SQIf-  596 (705)
T KOG0639|consen  519 PDAKVCFSCCSDGNIAVWDLHNQT-LVRQFQGHTDGASCIDISKDGTKLWTGGLDNTVRCWDLREGRQLQQHDFSSQIF-  596 (705)
T ss_pred             CccceeeeeccCCcEEEEEcccce-eeecccCCCCCceeEEecCCCceeecCCCccceeehhhhhhhhhhhhhhhhhhe-
Confidence            345556666678999999987643 344433333333444333344455546767899999999999999999887665 


Q ss_pred             CCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEec
Q 003800          131 KPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDF  172 (794)
Q Consensus       131 ~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~  172 (794)
                       ++...+.      ++-+.|. .++.+-.+. .+|..+.....
T Consensus       597 -SLg~cP~------~dWlavGMens~vevlh-~skp~kyqlhl  631 (705)
T KOG0639|consen  597 -SLGYCPT------GDWLAVGMENSNVEVLH-TSKPEKYQLHL  631 (705)
T ss_pred             -ecccCCC------ccceeeecccCcEEEEe-cCCccceeecc
Confidence             3334441      3444444 466666665 45555555433


No 163
>PF05262 Borrelia_P83:  Borrelia P83/100 protein;  InterPro: IPR007926 This family consists of several Borrelia P83/P100 antigen proteins.
Probab=59.09  E-value=59  Score=37.88  Aligned_cols=98  Identities=13%  Similarity=0.101  Sum_probs=59.6

Q ss_pred             CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE
Q 003800          153 KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL  232 (794)
Q Consensus       153 ~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~  232 (794)
                      -+.|..||+.+|.++-+-....-..   +-++...++.|-+.+..|...++++-||+.|=++..+......+   .++++
T Consensus       374 ls~LvllD~~tg~~l~~S~~~~Ir~---r~~~~~~~~~vaI~g~~G~~~ikLvlid~~tLev~kes~~~i~~---~S~l~  447 (489)
T PF05262_consen  374 LSELVLLDSDTGDTLKRSPVNGIRG---RTFYEREDDLVAIAGCSGNAAIKLVLIDPETLEVKKESEDEISW---QSSLI  447 (489)
T ss_pred             ceeEEEEeCCCCceecccccceecc---ceeEEcCCCEEEEeccCCchheEEEecCcccceeeeeccccccc---cCceE
Confidence            4789999999999887654432221   11222334444433344556789999999998888777432221   24555


Q ss_pred             E-cCcEEEEEECCCCeEEEEEeecc
Q 003800          233 V-SSDTLVTLDTTRSILVTVSFKNR  256 (794)
Q Consensus       233 v-g~~~lv~~d~~~g~L~v~~l~sg  256 (794)
                      + |+.+|+++...+|..+..-..++
T Consensus       448 ~~~~~iyaVv~~~~g~~~L~rF~~~  472 (489)
T PF05262_consen  448 VDGQMIYAVVKKDNGKWYLGRFDSN  472 (489)
T ss_pred             EcCCeEEEEEEcCCCeEEEeecCcc
Confidence            5 55566666345677666655543


No 164
>PF14870 PSII_BNR:  Photosynthesis system II assembly factor YCF48; PDB: 2XBG_A.
Probab=58.93  E-value=2.8e+02  Score=30.40  Aligned_cols=180  Identities=16%  Similarity=0.296  Sum_probs=83.4

Q ss_pred             eeecccccEeeEEeccCceee-eeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEE
Q 003800           23 LYEDQVGLMDWHQQYIGKVKH-AVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVIT  100 (794)
Q Consensus        23 l~edqvG~~dW~~~~vG~~~~-~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~  100 (794)
                      ++....|-..|..-.+..+.. ..+.-....++.+++++..|.|+.=  .||-.-|+...... +.+.......++..|.
T Consensus        83 ll~T~DgG~tW~~v~l~~~lpgs~~~i~~l~~~~~~l~~~~G~iy~T--~DgG~tW~~~~~~~~gs~~~~~r~~dG~~va  160 (302)
T PF14870_consen   83 LLHTTDGGKTWERVPLSSKLPGSPFGITALGDGSAELAGDRGAIYRT--TDGGKTWQAVVSETSGSINDITRSSDGRYVA  160 (302)
T ss_dssp             EEEESSTTSS-EE----TT-SS-EEEEEEEETTEEEEEETT--EEEE--SSTTSSEEEEE-S----EEEEEE-TTS-EEE
T ss_pred             EEEecCCCCCcEEeecCCCCCCCeeEEEEcCCCcEEEEcCCCcEEEe--CCCCCCeeEcccCCcceeEeEEECCCCcEEE
Confidence            455555666687643221111 1111112235567777877766544  57888999877654 2333332234556777


Q ss_pred             EEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCccee-
Q 003800          101 LSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVE-  178 (794)
Q Consensus       101 Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~-  178 (794)
                      |+..|.-+..||.  |+--|+..-....  ..+..++    ...++.+.+. .+|.++.-+..+.-..|.......... 
T Consensus       161 vs~~G~~~~s~~~--G~~~w~~~~r~~~--~riq~~g----f~~~~~lw~~~~Gg~~~~s~~~~~~~~w~~~~~~~~~~~  232 (302)
T PF14870_consen  161 VSSRGNFYSSWDP--GQTTWQPHNRNSS--RRIQSMG----FSPDGNLWMLARGGQIQFSDDPDDGETWSEPIIPIKTNG  232 (302)
T ss_dssp             EETTSSEEEEE-T--T-SS-EEEE--SS--S-EEEEE----E-TTS-EEEEETTTEEEEEE-TTEEEEE---B-TTSS--
T ss_pred             EECcccEEEEecC--CCccceEEccCcc--ceehhce----ecCCCCEEEEeCCcEEEEccCCCCccccccccCCcccCc
Confidence            7888877889975  9999986644221  1222222    1224556555 478888877555567788744322111 


Q ss_pred             --eeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800          179 --VQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET  219 (794)
Q Consensus       179 --~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~  219 (794)
                        ...+.. ..++.+|+++..|.    ++ .. .+|=.-|+..
T Consensus       233 ~~~ld~a~-~~~~~~wa~gg~G~----l~-~S-~DgGktW~~~  268 (302)
T PF14870_consen  233 YGILDLAY-RPPNEIWAVGGSGT----LL-VS-TDGGKTWQKD  268 (302)
T ss_dssp             S-EEEEEE-SSSS-EEEEESTT-----EE-EE-SSTTSS-EE-
T ss_pred             eeeEEEEe-cCCCCEEEEeCCcc----EE-Ee-CCCCccceEC
Confidence              122221 35788998876662    22 12 3555668874


No 165
>KOG0285 consensus Pleiotropic regulator 1 [RNA processing and modification]
Probab=58.70  E-value=3e+02  Score=30.72  Aligned_cols=146  Identities=13%  Similarity=0.126  Sum_probs=79.0

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLL  134 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~  134 (794)
                      ...++.++.+.-.|..||++.=.  +..- ..+.++.+..-.--+|-.+.+++|-.||+++-+.+-++...-..+ ..+.
T Consensus       166 f~tgs~DrtikIwDlatg~Lklt--ltGhi~~vr~vavS~rHpYlFs~gedk~VKCwDLe~nkvIR~YhGHlS~V-~~L~  242 (460)
T KOG0285|consen  166 FATGSADRTIKIWDLATGQLKLT--LTGHIETVRGVAVSKRHPYLFSAGEDKQVKCWDLEYNKVIRHYHGHLSGV-YCLD  242 (460)
T ss_pred             EEecCCCceeEEEEcccCeEEEe--ecchhheeeeeeecccCceEEEecCCCeeEEEechhhhhHHHhcccccee-EEEe
Confidence            55667789999999999976433  3221 123344222222234435667899999999999988877653322 1122


Q ss_pred             ccccccccccCCeEEEE-E-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800          135 LVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNG  212 (794)
Q Consensus       135 ~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG  212 (794)
                      +.|       .-++++. + |.....-|..+-..+-...--  .....+++....+..||-.+.++    .+--.|...|
T Consensus       243 lhP-------Tldvl~t~grDst~RvWDiRtr~~V~~l~GH--~~~V~~V~~~~~dpqvit~S~D~----tvrlWDl~ag  309 (460)
T KOG0285|consen  243 LHP-------TLDVLVTGGRDSTIRVWDIRTRASVHVLSGH--TNPVASVMCQPTDPQVITGSHDS----TVRLWDLRAG  309 (460)
T ss_pred             ccc-------cceeEEecCCcceEEEeeecccceEEEecCC--CCcceeEEeecCCCceEEecCCc----eEEEeeeccC
Confidence            222       2344443 2 444444444444433333211  11123333223466777655555    5666687777


Q ss_pred             ceeee
Q 003800          213 ELLNH  217 (794)
Q Consensus       213 ~~~w~  217 (794)
                      +.+-.
T Consensus       310 kt~~t  314 (460)
T KOG0285|consen  310 KTMIT  314 (460)
T ss_pred             ceeEe
Confidence            76543


No 166
>KOG0646 consensus WD40 repeat protein [General function prediction only]
Probab=58.45  E-value=3.4e+02  Score=31.22  Aligned_cols=132  Identities=12%  Similarity=0.115  Sum_probs=68.0

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCC-------CcEeEEEeccCc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPD-------GQMVWESFLRGS  127 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~t-------G~llWe~~l~~~  127 (794)
                      |+.+|.+|.||.--..||+.+=-..- ..-++..+. ..+++.+++ ++.++.|++|...+       +...=...+.+-
T Consensus        96 l~ag~i~g~lYlWelssG~LL~v~~a-HYQ~ITcL~-fs~dgs~iiTgskDg~V~vW~l~~lv~a~~~~~~~p~~~f~~H  173 (476)
T KOG0646|consen   96 LLAGTISGNLYLWELSSGILLNVLSA-HYQSITCLK-FSDDGSHIITGSKDGAVLVWLLTDLVSADNDHSVKPLHIFSDH  173 (476)
T ss_pred             EEeecccCcEEEEEeccccHHHHHHh-hccceeEEE-EeCCCcEEEecCCCccEEEEEEEeecccccCCCccceeeeccC
Confidence            55666899999999999987643311 112355553 345555555 45578899997632       111111111110


Q ss_pred             cccCCcccccccccccc---CCeEEEEE-CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          128 KHSKPLLLVPTNLKVDK---DSLILVSS-KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       128 ~~s~~~~~~~~~~~~~~---~~~V~V~~-~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                          .+++...  ....   +..++-.+ |.++...|...|..+=+...|.+-.   .+....++..+|+.+-.|
T Consensus       174 ----tlsITDl--~ig~Gg~~~rl~TaS~D~t~k~wdlS~g~LLlti~fp~si~---av~lDpae~~~yiGt~~G  239 (476)
T KOG0646|consen  174 ----TLSITDL--QIGSGGTNARLYTASEDRTIKLWDLSLGVLLLTITFPSSIK---AVALDPAERVVYIGTEEG  239 (476)
T ss_pred             ----cceeEEE--EecCCCccceEEEecCCceEEEEEeccceeeEEEecCCcce---eEEEcccccEEEecCCcc
Confidence                1111110  1111   12333333 7777777888888777776665321   222123455666654444


No 167
>KOG1272 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=58.17  E-value=28  Score=39.60  Aligned_cols=179  Identities=11%  Similarity=0.101  Sum_probs=102.7

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      +..+..+..+|.|+|||-.|+++.-...+.+.. .+.-+   ..+..+.|.. ...++-+| ..|.++=-..-..++.  
T Consensus       141 GrhlllgGrKGHlAa~Dw~t~~L~~Ei~v~Etv~Dv~~L---Hneq~~AVAQ-K~y~yvYD-~~GtElHClk~~~~v~--  213 (545)
T KOG1272|consen  141 GRHLLLGGRKGHLAAFDWVTKKLHFEINVMETVRDVTFL---HNEQFFAVAQ-KKYVYVYD-NNGTELHCLKRHIRVA--  213 (545)
T ss_pred             ccEEEecCCccceeeeecccceeeeeeehhhhhhhhhhh---cchHHHHhhh-hceEEEec-CCCcEEeehhhcCchh--
Confidence            334888889999999999999998888776551 11112   2233333333 45777777 4687776666555442  


Q ss_pred             CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEec-CCEEEEEEecCCceeEEEEEE
Q 003800          132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDE-SDQIYVVGYAGSSQFHAYQIN  208 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~-~~~vyv~~~~g~~~~~v~ald  208 (794)
                      -+.++|       -..+++.  ..|-|.-.|..+|+.+=+.....+.+   .++ ... -+.|.-+|..   ++.|.-..
T Consensus       214 rLeFLP-------yHfLL~~~~~~G~L~Y~DVS~GklVa~~~t~~G~~---~vm-~qNP~NaVih~Ghs---nGtVSlWS  279 (545)
T KOG1272|consen  214 RLEFLP-------YHFLLVAASEAGFLKYQDVSTGKLVASIRTGAGRT---DVM-KQNPYNAVIHLGHS---NGTVSLWS  279 (545)
T ss_pred             hhcccc-------hhheeeecccCCceEEEeechhhhhHHHHccCCcc---chh-hcCCccceEEEcCC---CceEEecC
Confidence            345555       3556664  37899999999999887776655443   111 011 1222223322   23676667


Q ss_pred             cCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEEEEeec
Q 003800          209 AMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       209 ~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      +.+-+++-+.  -+. +.++ ++.+-.++.|.++......+.+-||..
T Consensus       280 P~skePLvKi--LcH~g~V~-siAv~~~G~YMaTtG~Dr~~kIWDlR~  324 (545)
T KOG1272|consen  280 PNSKEPLVKI--LCHRGPVS-SIAVDRGGRYMATTGLDRKVKIWDLRN  324 (545)
T ss_pred             CCCcchHHHH--HhcCCCcc-eEEECCCCcEEeecccccceeEeeecc
Confidence            7666655333  111 2222 233323334444444456788888876


No 168
>KOG4378 consensus Nuclear protein COP1 [Signal transduction mechanisms]
Probab=58.10  E-value=2.6e+02  Score=32.44  Aligned_cols=139  Identities=10%  Similarity=0.132  Sum_probs=73.7

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEE-EEEccCCeEEEEeCCCCcEeEEEecc-CccccCCc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVI-TLSSDGSTLRAWNLPDGQMVWESFLR-GSKHSKPL  133 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V-~Vs~~g~~v~A~d~~tG~llWe~~l~-~~~~s~~~  133 (794)
                      |-.++..|.|.-.+.+||..-=.+..+.+..+..++....+..+ ...+++|.|..||...-.+...+.-. .... .++
T Consensus       136 iAsvs~gGdiiih~~~t~~~tt~f~~~sgqsvRll~ys~skr~lL~~asd~G~VtlwDv~g~sp~~~~~~~HsAP~-~gi  214 (673)
T KOG4378|consen  136 IASVSDGGDIIIHGTKTKQKTTTFTIDSGQSVRLLRYSPSKRFLLSIASDKGAVTLWDVQGMSPIFHASEAHSAPC-RGI  214 (673)
T ss_pred             eEEeccCCcEEEEecccCccccceecCCCCeEEEeecccccceeeEeeccCCeEEEEeccCCCcccchhhhccCCc-Ccc
Confidence            44455678888888888866544444433333344333333333 33456689999998655555443321 2122 444


Q ss_pred             cccccccccccCCeEEEE--ECCEEEEEECCCCcEE--EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEc
Q 003800          134 LLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEIL--WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINA  209 (794)
Q Consensus       134 ~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~--W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~  209 (794)
                      .+.+.      ...++|.  .|.+++-+|..+-+..  -.|+.|      ...+.-...|.+.++|   .++++++++|.
T Consensus       215 cfsps------ne~l~vsVG~Dkki~~yD~~s~~s~~~l~y~~P------lstvaf~~~G~~L~aG---~s~G~~i~YD~  279 (673)
T KOG4378|consen  215 CFSPS------NEALLVSVGYDKKINIYDIRSQASTDRLTYSHP------LSTVAFSECGTYLCAG---NSKGELIAYDM  279 (673)
T ss_pred             eecCC------ccceEEEecccceEEEeecccccccceeeecCC------cceeeecCCceEEEee---cCCceEEEEec
Confidence            55541      3445553  4899999986543221  122222      1222112344444443   33448999997


Q ss_pred             C
Q 003800          210 M  210 (794)
Q Consensus       210 ~  210 (794)
                      .
T Consensus       280 R  280 (673)
T KOG4378|consen  280 R  280 (673)
T ss_pred             c
Confidence            5


No 169
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=57.26  E-value=2.7e+02  Score=30.58  Aligned_cols=105  Identities=13%  Similarity=0.160  Sum_probs=52.8

Q ss_pred             EEEEEccCCeEEEEeCCCC-cEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcE--EEEEeccC
Q 003800           98 VITLSSDGSTLRAWNLPDG-QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEI--LWTRDFAA  174 (794)
Q Consensus        98 ~V~Vs~~g~~v~A~d~~tG-~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~--~W~~~~~~  174 (794)
                      ++++--.+++++.||+.+| ...|...-.-...    .      ..+ .+..++.....++.++.++|..  .+....+.
T Consensus        39 L~w~DI~~~~i~r~~~~~g~~~~~~~p~~~~~~----~------~~d-~~g~Lv~~~~g~~~~~~~~~~~~t~~~~~~~~  107 (307)
T COG3386          39 LLWVDILGGRIHRLDPETGKKRVFPSPGGFSSG----A------LID-AGGRLIACEHGVRLLDPDTGGKITLLAEPEDG  107 (307)
T ss_pred             EEEEeCCCCeEEEecCCcCceEEEECCCCcccc----e------eec-CCCeEEEEccccEEEeccCCceeEEeccccCC
Confidence            4555445789999999988 6667655432111    1      223 3334444444456666565654  44333222


Q ss_pred             ccee-eeeEEEEecCCEEEEEEec----C----CceeEEEEEEcCCCcee
Q 003800          175 ESVE-VQQVIQLDESDQIYVVGYA----G----SSQFHAYQINAMNGELL  215 (794)
Q Consensus       175 ~~~~-~~~~v~s~~~~~vyv~~~~----g----~~~~~v~ald~~tG~~~  215 (794)
                      .... +--.+ ...++.+|+....    +    .....|+-+|+. |...
T Consensus       108 ~~~~r~ND~~-v~pdG~~wfgt~~~~~~~~~~~~~~G~lyr~~p~-g~~~  155 (307)
T COG3386         108 LPLNRPNDGV-VDPDGRIWFGDMGYFDLGKSEERPTGSLYRVDPD-GGVV  155 (307)
T ss_pred             CCcCCCCcee-EcCCCCEEEeCCCccccCccccCCcceEEEEcCC-CCEE
Confidence            1110 10111 2356777765444    1    112467888873 4433


No 170
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=56.98  E-value=3.1e+02  Score=30.42  Aligned_cols=192  Identities=13%  Similarity=0.171  Sum_probs=92.8

Q ss_pred             EEEEEeCC-----C-EEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEc-c--CC--eEEEEeCCCCcEeEEEe
Q 003800           55 RVVVSTEE-----N-VIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSS-D--GS--TLRAWNLPDGQMVWESF  123 (794)
Q Consensus        55 ~Vyv~t~~-----g-~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~-~--g~--~v~A~d~~tG~llWe~~  123 (794)
                      .+|++|..     | .+.-||.++|++-=-+.....++..-+...-.+..+|+.. .  .+  ..+.||..+|++---.+
T Consensus         4 ~~YiGtyT~~~s~gI~v~~ld~~~g~l~~~~~v~~~~nptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~   83 (346)
T COG2706           4 TVYIGTYTKRESQGIYVFNLDTKTGELSLLQLVAELGNPTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNR   83 (346)
T ss_pred             EEEEeeecccCCCceEEEEEeCcccccchhhhccccCCCceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeec
Confidence            46777753     2 3666777777653333333333333332223444555421 2  23  35667777898876655


Q ss_pred             ccCccccCCccccccccccccCC-eEEEE--ECCEEEEEECC-CCcEEEE---EeccCcceeeee------EEEEe-cCC
Q 003800          124 LRGSKHSKPLLLVPTNLKVDKDS-LILVS--SKGCLHAVSSI-DGEILWT---RDFAAESVEVQQ------VIQLD-ESD  189 (794)
Q Consensus       124 l~~~~~s~~~~~~~~~~~~~~~~-~V~V~--~~g~l~ald~~-tG~~~W~---~~~~~~~~~~~~------~v~s~-~~~  189 (794)
                      ...+.  .++..+    .++.++ .|++.  ..|.+..+-.. +|.+.=.   .....+.--++|      ..... .+.
T Consensus        84 ~~~~g--~~p~yv----svd~~g~~vf~AnY~~g~v~v~p~~~dG~l~~~v~~~~h~g~~p~~rQ~~~h~H~a~~tP~~~  157 (346)
T COG2706          84 QTLPG--SPPCYV----SVDEDGRFVFVANYHSGSVSVYPLQADGSLQPVVQVVKHTGSGPHERQESPHVHSANFTPDGR  157 (346)
T ss_pred             cccCC--CCCeEE----EECCCCCEEEEEEccCceEEEEEcccCCccccceeeeecCCCCCCccccCCccceeeeCCCCC
Confidence            44322  122233    334344 56665  36777777664 4654321   111111000111      11112 233


Q ss_pred             EEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE--EcCcEEEEEECCCCeEEEEEeec
Q 003800          190 QIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL--VSSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       190 ~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~--vg~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      .+++..+ |.+  +++.+++..|...-......+.+--...++  ..+.+.+|+..-++.+-+.....
T Consensus       158 ~l~v~DL-G~D--ri~~y~~~dg~L~~~~~~~v~~G~GPRHi~FHpn~k~aY~v~EL~stV~v~~y~~  222 (346)
T COG2706         158 YLVVPDL-GTD--RIFLYDLDDGKLTPADPAEVKPGAGPRHIVFHPNGKYAYLVNELNSTVDVLEYNP  222 (346)
T ss_pred             EEEEeec-CCc--eEEEEEcccCccccccccccCCCCCcceEEEcCCCcEEEEEeccCCEEEEEEEcC
Confidence            4454333 333  566666668887754443333222112222  25567777766677777777766


No 171
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=55.16  E-value=3.1e+02  Score=29.86  Aligned_cols=33  Identities=18%  Similarity=0.239  Sum_probs=24.8

Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      ++..++-.+.+.+|++||+++|+..-+.+....
T Consensus       101 d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~  133 (338)
T KOG0265|consen  101 DGSHILSCGTDKTVRGWDAETGKRIRKHKGHTS  133 (338)
T ss_pred             CCCEEEEecCCceEEEEecccceeeehhccccc
Confidence            334444344567999999999999999888764


No 172
>KOG0295 consensus WD40 repeat-containing protein [Function unknown]
Probab=54.94  E-value=1.6e+02  Score=32.83  Aligned_cols=65  Identities=14%  Similarity=0.218  Sum_probs=42.4

Q ss_pred             EEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEE
Q 003800           97 YVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWT  169 (794)
Q Consensus        97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~  169 (794)
                      ...+.++.++.++.||..+|..+-+......-. .+..+-+       ++..++.  .|+.|...|.++++..=.
T Consensus       305 ~~l~s~SrDktIk~wdv~tg~cL~tL~ghdnwV-r~~af~p-------~Gkyi~ScaDDktlrvwdl~~~~cmk~  371 (406)
T KOG0295|consen  305 QVLGSGSRDKTIKIWDVSTGMCLFTLVGHDNWV-RGVAFSP-------GGKYILSCADDKTLRVWDLKNLQCMKT  371 (406)
T ss_pred             cEEEeecccceEEEEeccCCeEEEEEeccccee-eeeEEcC-------CCeEEEEEecCCcEEEEEeccceeeec
Confidence            344545567899999999999999887765433 2222222       3433332  488888888877765433


No 173
>KOG0288 consensus WD40 repeat protein TipD [General function prediction only]
Probab=54.87  E-value=3.3e+02  Score=30.85  Aligned_cols=185  Identities=11%  Similarity=0.064  Sum_probs=90.3

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ...++.+|.++.+.--|..++++.  +.|... +.+..+....+...|+-++.+.++--||...+.-.=+.. ..+.+ .
T Consensus       231 ~~~~iAas~d~~~r~Wnvd~~r~~--~TLsGHtdkVt~ak~~~~~~~vVsgs~DRtiK~WDl~k~~C~kt~l-~~S~c-n  306 (459)
T KOG0288|consen  231 NKHVIAASNDKNLRLWNVDSLRLR--HTLSGHTDKVTAAKFKLSHSRVVSGSADRTIKLWDLQKAYCSKTVL-PGSQC-N  306 (459)
T ss_pred             CceEEeecCCCceeeeeccchhhh--hhhcccccceeeehhhccccceeeccccchhhhhhhhhhheecccc-ccccc-c
Confidence            444777777777666665555443  223221 112222112233322213346778888876533221111 11110 0


Q ss_pred             CccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEc
Q 003800          132 PLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINA  209 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~  209 (794)
                      +         +......++.  .+++|...|..++..+-+.+..+..   ..+-.+.++..+...+-+.    .+-.+|.
T Consensus       307 D---------I~~~~~~~~SgH~DkkvRfwD~Rs~~~~~sv~~gg~v---tSl~ls~~g~~lLsssRDd----tl~viDl  370 (459)
T KOG0288|consen  307 D---------IVCSISDVISGHFDKKVRFWDIRSADKTRSVPLGGRV---TSLDLSMDGLELLSSSRDD----TLKVIDL  370 (459)
T ss_pred             c---------eEecceeeeecccccceEEEeccCCceeeEeecCcce---eeEeeccCCeEEeeecCCC----ceeeeec
Confidence            0         0101111111  2788999998888888887765521   1221112233333222222    4556677


Q ss_pred             CCCceeeeeeeec---ccCccCceEEEcCcEEEEEECCCCeEEEEEeeccee
Q 003800          210 MNGELLNHETAAF---SGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRKI  258 (794)
Q Consensus       210 ~tG~~~w~~~v~~---~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~~  258 (794)
                      .+-++.-.++...   .++.+..++-.++.++++. +.+|++++=++.+|++
T Consensus       371 Rt~eI~~~~sA~g~k~asDwtrvvfSpd~~YvaAG-S~dgsv~iW~v~tgKl  421 (459)
T KOG0288|consen  371 RTKEIRQTFSAEGFKCASDWTRVVFSPDGSYVAAG-SADGSVYIWSVFTGKL  421 (459)
T ss_pred             ccccEEEEeeccccccccccceeEECCCCceeeec-cCCCcEEEEEccCceE
Confidence            6666665554222   2333333333354555555 4679999999999884


No 174
>PF04841 Vps16_N:  Vps16, N-terminal region;  InterPro: IPR006926 This protein forms part of the Class C vacuolar protein sorting (Vps) complex. Vps16 is essential for vacuolar protein sorting, which is essential for viability in plants, but not yeast []. The Class C Vps complex is required for SNARE-mediated membrane fusion at the lysosome-like yeast vacuole. It is thought to play essential roles in membrane docking and fusion at the Golgi-to-endosome and endosome-to-vacuole stages of transport []. The role of VPS16 in this complex is not known.; GO: 0006886 intracellular protein transport, 0005737 cytoplasm
Probab=54.65  E-value=3.7e+02  Score=30.62  Aligned_cols=98  Identities=16%  Similarity=0.165  Sum_probs=59.1

Q ss_pred             EEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccc--ccc
Q 003800           65 IASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTN--LKV  142 (794)
Q Consensus        65 l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~--~~~  142 (794)
                      |.-.| .+|+++|+...+. +.+.++.-...+.+|+|..+ |.++-+|.. |..  ++.+..+..  ...+....  ...
T Consensus        63 I~iys-~sG~ll~~i~w~~-~~iv~~~wt~~e~LvvV~~d-G~v~vy~~~-G~~--~fsl~~~i~--~~~v~e~~i~~~~  134 (410)
T PF04841_consen   63 IQIYS-SSGKLLSSIPWDS-GRIVGMGWTDDEELVVVQSD-GTVRVYDLF-GEF--QFSLGEEIE--EEKVLECRIFAIW  134 (410)
T ss_pred             EEEEC-CCCCEeEEEEECC-CCEEEEEECCCCeEEEEEcC-CEEEEEeCC-Cce--eechhhhcc--ccCcccccccccc
Confidence            55555 4799999988877 34444433567788888876 589999975 777  666554321  11111100  011


Q ss_pred             ccCCeEEEE-ECCEEEEEECCCCcEEEEE
Q 003800          143 DKDSLILVS-SKGCLHAVSSIDGEILWTR  170 (794)
Q Consensus       143 ~~~~~V~V~-~~g~l~ald~~tG~~~W~~  170 (794)
                      ..+..++++ .+++++.++.-+...+|+.
T Consensus       135 ~~~~GivvLt~~~~~~~v~n~~~~~~~~~  163 (410)
T PF04841_consen  135 FYKNGIVVLTGNNRFYVVNNIDEPVKLRR  163 (410)
T ss_pred             cCCCCEEEECCCCeEEEEeCccccchhhc
Confidence            222446665 5888999976665555553


No 175
>KOG0274 consensus Cdc4 and related F-box and WD-40 proteins [General function prediction only]
Probab=53.80  E-value=4.5e+02  Score=31.27  Aligned_cols=180  Identities=13%  Similarity=0.135  Sum_probs=98.8

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccCccccC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRGSKHSK  131 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~~~~s~  131 (794)
                      ++.++.++.+..+--=|.+||+-.=-...- ...+..+  ...+ .+.+| +.+.+|++||..+|+.+=-.......+  
T Consensus       261 ~~~lvsgS~D~t~rvWd~~sg~C~~~l~gh-~stv~~~--~~~~-~~~~sgs~D~tVkVW~v~n~~~l~l~~~h~~~V--  334 (537)
T KOG0274|consen  261 GDKLVSGSTDKTERVWDCSTGECTHSLQGH-TSSVRCL--TIDP-FLLVSGSRDNTVKVWDVTNGACLNLLRGHTGPV--  334 (537)
T ss_pred             CCEEEEEecCCcEEeEecCCCcEEEEecCC-CceEEEE--EccC-ceEeeccCCceEEEEeccCcceEEEeccccccE--
Confidence            556666776777766666676543222211 1112111  2333 34444 457899999999999886665433221  


Q ss_pred             CccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecC-CEEEEEEecCCceeEEEEEEc
Q 003800          132 PLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDES-DQIYVVGYAGSSQFHAYQINA  209 (794)
Q Consensus       132 ~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~-~~vyv~~~~g~~~~~v~ald~  209 (794)
                        -.+    ..+ .+.++.. .+|.+..-|..+|+.+=+.+.-...  .+.+.  .++ ..+|-.+.++    .+-+.|+
T Consensus       335 --~~v----~~~-~~~lvsgs~d~~v~VW~~~~~~cl~sl~gH~~~--V~sl~--~~~~~~~~Sgs~D~----~IkvWdl  399 (537)
T KOG0274|consen  335 --NCV----QLD-EPLLVSGSYDGTVKVWDPRTGKCLKSLSGHTGR--VYSLI--VDSENRLLSGSLDT----TIKVWDL  399 (537)
T ss_pred             --EEE----Eec-CCEEEEEecCceEEEEEhhhceeeeeecCCcce--EEEEE--ecCcceEEeeeecc----ceEeecC
Confidence              111    111 3334444 3888888888888877666543222  22222  234 6666656664    6778888


Q ss_pred             CCC-ceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          210 MNG-ELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       210 ~tG-~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      .++ +.+-...  .+..+. ..+..-++.+++... .+.+++-|.++++
T Consensus       400 ~~~~~c~~tl~--~h~~~v-~~l~~~~~~Lvs~~a-D~~Ik~WD~~~~~  444 (537)
T KOG0274|consen  400 RTKRKCIHTLQ--GHTSLV-SSLLLRDNFLVSSSA-DGTIKLWDAEEGE  444 (537)
T ss_pred             Cchhhhhhhhc--CCcccc-cccccccceeEeccc-cccEEEeecccCc
Confidence            887 3332221  122222 122234566776654 4678888888877


No 176
>KOG3881 consensus Uncharacterized conserved protein [Function unknown]
Probab=53.06  E-value=24  Score=39.22  Aligned_cols=73  Identities=14%  Similarity=0.247  Sum_probs=50.3

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      +.+.||+++..|.|+.+|.++|+..=+.-=+-.+++.+++...+..++.-+|-++.||-+|..+-+++=...+
T Consensus       258 ~gn~Iy~gn~~g~l~~FD~r~~kl~g~~~kg~tGsirsih~hp~~~~las~GLDRyvRIhD~ktrkll~kvYv  330 (412)
T KOG3881|consen  258 SGNFIYTGNTKGQLAKFDLRGGKLLGCGLKGITGSIRSIHCHPTHPVLASCGLDRYVRIHDIKTRKLLHKVYV  330 (412)
T ss_pred             CCcEEEEecccchhheecccCceeeccccCCccCCcceEEEcCCCceEEeeccceeEEEeecccchhhhhhhh
Confidence            4677999999999999999999876553222234455554333444555456679999999998666544433


No 177
>KOG0280 consensus Uncharacterized conserved protein [Amino acid transport and metabolism]
Probab=53.03  E-value=28  Score=37.42  Aligned_cols=73  Identities=12%  Similarity=0.199  Sum_probs=48.9

Q ss_pred             CCEEEEEeCCCEEEEEECc-CCccceEEEcCcc-c--ceeeeeeeeCCEEEEEEccCCeEEEEeCC-CCcEeEEEeccCc
Q 003800           53 RKRVVVSTEENVIASLDLR-HGEIFWRHVLGIN-D--VVDGIDIALGKYVITLSSDGSTLRAWNLP-DGQMVWESFLRGS  127 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~-tG~ivWR~~l~~~-~--~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~-tG~llWe~~l~~~  127 (794)
                      .+.||.+++++.+.+.|.| -++-+|+...-.. +  .|..-  ......++.|+.+..++.||.. -|+++.+....++
T Consensus       178 pnlvytGgDD~~l~~~D~R~p~~~i~~n~kvH~~GV~SI~ss--~~~~~~I~TGsYDe~i~~~DtRnm~kPl~~~~v~GG  255 (339)
T KOG0280|consen  178 PNLVYTGGDDGSLSCWDIRIPKTFIWHNSKVHTSGVVSIYSS--PPKPTYIATGSYDECIRVLDTRNMGKPLFKAKVGGG  255 (339)
T ss_pred             CceEEecCCCceEEEEEecCCcceeeecceeeecceEEEecC--CCCCceEEEeccccceeeeehhcccCccccCccccc
Confidence            4679999999999999999 8888998433221 1  12111  1123355657788899999987 5666655554443


No 178
>KOG0294 consensus WD40 repeat-containing protein [Function unknown]
Probab=52.73  E-value=3.5e+02  Score=29.71  Aligned_cols=186  Identities=14%  Similarity=0.208  Sum_probs=95.5

Q ss_pred             eeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcC---cccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeE
Q 003800           45 VFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLG---INDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVW  120 (794)
Q Consensus        45 ~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~---~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llW  120 (794)
                      .|..|.. ...++-++++|.+...+..+    |.-.=.   ..+.+..+.+. .++-.+.|+++ ..+|.||.-+|+.-.
T Consensus        90 ~F~~~~S-~shLlS~sdDG~i~iw~~~~----W~~~~slK~H~~~Vt~lsiHPS~KLALsVg~D-~~lr~WNLV~Gr~a~  163 (362)
T KOG0294|consen   90 KFYPPLS-KSHLLSGSDDGHIIIWRVGS----WELLKSLKAHKGQVTDLSIHPSGKLALSVGGD-QVLRTWNLVRGRVAF  163 (362)
T ss_pred             EecCCcc-hhheeeecCCCcEEEEEcCC----eEEeeeecccccccceeEecCCCceEEEEcCC-ceeeeehhhcCccce
Confidence            4554432 34689999999999988766    632211   11223333222 46667787875 599999999999998


Q ss_pred             EEeccCccccCCccccccccccccCCeEEE-EECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCC
Q 003800          121 ESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGS  199 (794)
Q Consensus       121 e~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~  199 (794)
                      -.++.....  -....+       .++-|+ .....+-.+-..+-++.=+...+...    -++.-...+.+++.+-++ 
T Consensus       164 v~~L~~~at--~v~w~~-------~Gd~F~v~~~~~i~i~q~d~A~v~~~i~~~~r~----l~~~~l~~~~L~vG~d~~-  229 (362)
T KOG0294|consen  164 VLNLKNKAT--LVSWSP-------QGDHFVVSGRNKIDIYQLDNASVFREIENPKRI----LCATFLDGSELLVGGDNE-  229 (362)
T ss_pred             eeccCCcce--eeEEcC-------CCCEEEEEeccEEEEEecccHhHhhhhhccccc----eeeeecCCceEEEecCCc-
Confidence            888876432  111111       233222 23333322222222222111111101    111112455566533232 


Q ss_pred             ceeEEEEEEcCCCceeeeeeeecccCccCceEEEcC---cEEEEEECCCCeEEEEEeecc
Q 003800          200 SQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSS---DTLVTLDTTRSILVTVSFKNR  256 (794)
Q Consensus       200 ~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~---~~lv~~d~~~g~L~v~~l~sg  256 (794)
                         .+..+|..++.+...... -+..+-+ ++.+.+   .+++.+. +.|.+.+=|+...
T Consensus       230 ---~i~~~D~ds~~~~~~~~A-H~~RVK~-i~~~~~~~~~~lvTaS-SDG~I~vWd~~~~  283 (362)
T KOG0294|consen  230 ---WISLKDTDSDTPLTEFLA-HENRVKD-IASYTNPEHEYLVTAS-SDGFIKVWDIDME  283 (362)
T ss_pred             ---eEEEeccCCCccceeeec-chhheee-eEEEecCCceEEEEec-cCceEEEEEcccc
Confidence               788889888777665531 1222222 222222   2555554 4688877777654


No 179
>PF06977 SdiA-regulated:  SdiA-regulated;  InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=50.41  E-value=3.4e+02  Score=28.83  Aligned_cols=187  Identities=15%  Similarity=0.177  Sum_probs=81.1

Q ss_pred             CCCEEEEEeC-CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEcc-CCeEEEEeCCC--CcEeE----EEe
Q 003800           52 GRKRVVVSTE-ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSD-GSTLRAWNLPD--GQMVW----ESF  123 (794)
Q Consensus        52 ~~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~-g~~v~A~d~~t--G~llW----e~~  123 (794)
                      +.+++|+.++ .+.|+.||. +|+++-|..+...+..-++. ..+++.++++.. .+.++.++..+  ..+-=    +..
T Consensus        32 d~~tLfaV~d~~~~i~els~-~G~vlr~i~l~g~~D~EgI~-y~g~~~~vl~~Er~~~L~~~~~~~~~~~~~~~~~~~~~  109 (248)
T PF06977_consen   32 DTGTLFAVQDEPGEIYELSL-DGKVLRRIPLDGFGDYEGIT-YLGNGRYVLSEERDQRLYIFTIDDDTTSLDRADVQKIS  109 (248)
T ss_dssp             TTTEEEEEETTTTEEEEEET-T--EEEEEE-SS-SSEEEEE-E-STTEEEEEETTTTEEEEEEE----TT--EEEEEEEE
T ss_pred             CCCeEEEEECCCCEEEEEcC-CCCEEEEEeCCCCCCceeEE-EECCCEEEEEEcCCCcEEEEEEeccccccchhhceEEe
Confidence            4677888776 589999996 79999999997654444553 356666666553 56787777632  22111    111


Q ss_pred             ccCcc-ccCCcccccccccccc-CCeEEEEE---CCEEEEEEC--CCCcEEEEEec--cCcce---eeeeEEEEecCCEE
Q 003800          124 LRGSK-HSKPLLLVPTNLKVDK-DSLILVSS---KGCLHAVSS--IDGEILWTRDF--AAESV---EVQQVIQLDESDQI  191 (794)
Q Consensus       124 l~~~~-~s~~~~~~~~~~~~~~-~~~V~V~~---~g~l~ald~--~tG~~~W~~~~--~~~~~---~~~~~v~s~~~~~v  191 (794)
                      +.... .-.+.+-+    +.+. .+.+++..   -..++.++.  ...........  .....   .+..+..-...+.+
T Consensus       110 l~~~~~~N~G~EGl----a~D~~~~~L~v~kE~~P~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~d~S~l~~~p~t~~l  185 (248)
T PF06977_consen  110 LGFPNKGNKGFEGL----AYDPKTNRLFVAKERKPKRLYEVNGFPGGFDLFVSDDQDLDDDKLFVRDLSGLSYDPRTGHL  185 (248)
T ss_dssp             ---S---SS--EEE----EEETTTTEEEEEEESSSEEEEEEESTT-SS--EEEE-HHHH-HT--SS---EEEEETTTTEE
T ss_pred             cccccCCCcceEEE----EEcCCCCEEEEEeCCCChhhEEEccccCccceeeccccccccccceeccccceEEcCCCCeE
Confidence            11110 00111111    1222 34555553   345777775  22222222211  11110   11122212346678


Q ss_pred             EEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEEcCcEEEEEECCCCeEEEEE
Q 003800          192 YVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALVSSDTLVTLDTTRSILVTVS  252 (794)
Q Consensus       192 yv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~vg~~~lv~~d~~~g~L~v~~  252 (794)
                      |+++....   .+..+| .+|+++....+... .++...   +-+..=+|.|. +|.|++..
T Consensus       186 liLS~es~---~l~~~d-~~G~~~~~~~L~~g~~gl~~~---~~QpEGIa~d~-~G~LYIvs  239 (248)
T PF06977_consen  186 LILSDESR---LLLELD-RQGRVVSSLSLDRGFHGLSKD---IPQPEGIAFDP-DGNLYIVS  239 (248)
T ss_dssp             EEEETTTT---EEEEE--TT--EEEEEE-STTGGG-SS------SEEEEEE-T-T--EEEEE
T ss_pred             EEEECCCC---eEEEEC-CCCCEEEEEEeCCcccCcccc---cCCccEEEECC-CCCEEEEc
Confidence            88775554   778888 67887766654332 121111   11223356664 46665543


No 180
>COG3419 PilY1 Tfp pilus assembly protein, tip-associated adhesin PilY1 [Cell motility and secretion / Intracellular trafficking and secretion]
Probab=49.91  E-value=2.4e+02  Score=35.65  Aligned_cols=27  Identities=15%  Similarity=0.276  Sum_probs=24.0

Q ss_pred             EEEEEEccCCeEEEEeCCCCcEeEEEe
Q 003800           97 YVITLSSDGSTLRAWNLPDGQMVWESF  123 (794)
Q Consensus        97 ~~V~Vs~~g~~v~A~d~~tG~llWe~~  123 (794)
                      -+|+|+..++++.+||+.+|.++.-+-
T Consensus       583 ~~VyvgandGmLhaFd~~tG~E~fA~~  609 (1036)
T COG3419         583 PVVYVGANDGMLHAFDANTGSERFAYV  609 (1036)
T ss_pred             ceEEEecCCceeeeccCCccceeeecC
Confidence            478889888999999999999998765


No 181
>PF09910 DUF2139:  Uncharacterized protein conserved in archaea (DUF2139);  InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=48.84  E-value=1.2e+02  Score=32.90  Aligned_cols=98  Identities=12%  Similarity=0.232  Sum_probs=64.2

Q ss_pred             CCEEEEEECCCCcE--EEEEeccCcce---eeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCcc
Q 003800          153 KGCLHAVSSIDGEI--LWTRDFAAESV---EVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFV  227 (794)
Q Consensus       153 ~g~l~ald~~tG~~--~W~~~~~~~~~---~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s  227 (794)
                      =..|+.+|.+++++  +|+.....+.-   +...+++-.-.+.+++.=.+|..++-|+.+|..+|+..+-....++.   
T Consensus        77 YSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdIlYdP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~~ps~K---  153 (339)
T PF09910_consen   77 YSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDILYDPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSSNPSLK---  153 (339)
T ss_pred             cceEEEEEcCCCeEEEEEecccCCccccccchhheeeCCCcCEEEEEecCCcceeeeEEEcccCCceeeccCCCCcC---
Confidence            56899999999974  69876654321   23345544567888887778888899999999999998766433332   


Q ss_pred             CceEEEcCcEEEEEEC-----CCCeEEEEEeecce
Q 003800          228 GDVALVSSDTLVTLDT-----TRSILVTVSFKNRK  257 (794)
Q Consensus       228 ~~~~~vg~~~lv~~d~-----~~g~L~v~~l~sg~  257 (794)
                      +..+.    -.+|.+-     ....++++||.+|+
T Consensus       154 G~~~~----D~a~F~i~~~~~g~~~i~~~Dli~~~  184 (339)
T PF09910_consen  154 GTLVH----DYACFGINNFHKGVSGIHCLDLISGK  184 (339)
T ss_pred             ceEee----eeEEEeccccccCCceEEEEEccCCe
Confidence            21111    1223322     12358888888887


No 182
>PF01453 B_lectin:  D-mannose binding lectin;  InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]:  Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein   This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity.  Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=48.75  E-value=1.2e+02  Score=27.80  Aligned_cols=60  Identities=23%  Similarity=0.528  Sum_probs=38.0

Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEE
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTR  170 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~  170 (794)
                      +...+.+..+ +.+..+|.. |+.+|.........     . .       ...+....+|.|..+| .+|+++|+-
T Consensus        19 ~~~~L~l~~d-GnLvl~~~~-~~~iWss~~t~~~~-----~-~-------~~~~~L~~~GNlvl~d-~~~~~lW~S   78 (114)
T PF01453_consen   19 GNYTLILQSD-GNLVLYDSN-GSVIWSSNNTSGRG-----N-S-------GCYLVLQDDGNLVLYD-SSGNVLWQS   78 (114)
T ss_dssp             TTEEEEEETT-SEEEEEETT-TEEEEE--S-TTSS-------S-------SEEEEEETTSEEEEEE-TTSEEEEES
T ss_pred             ccccceECCC-CeEEEEcCC-CCEEEEecccCCcc-----c-c-------CeEEEEeCCCCEEEEe-ecceEEEee
Confidence            5667777876 478888865 88899983222110     0 0       1122233588888888 699999986


No 183
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=48.64  E-value=1.4e+02  Score=36.07  Aligned_cols=63  Identities=16%  Similarity=0.156  Sum_probs=37.7

Q ss_pred             EccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEec
Q 003800          102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDF  172 (794)
Q Consensus       102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~  172 (794)
                      |+.+.+||.||..+|..+--+......+    ..+.    ....+.-++.  .+|.+---|..+|+++=+...
T Consensus       553 GSsD~tVRlWDv~~G~~VRiF~GH~~~V----~al~----~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~  617 (707)
T KOG0263|consen  553 GSSDRTVRLWDVSTGNSVRIFTGHKGPV----TALA----FSPCGRYLASGDEDGLIKIWDLANGSLVKQLKG  617 (707)
T ss_pred             CCCCceEEEEEcCCCcEEEEecCCCCce----EEEE----EcCCCceEeecccCCcEEEEEcCCCcchhhhhc
Confidence            5557899999999999987776554332    1111    1112333332  267777777777766655433


No 184
>PRK01742 tolB translocation protein TolB; Provisional
Probab=48.41  E-value=4.6e+02  Score=29.84  Aligned_cols=144  Identities=11%  Similarity=0.055  Sum_probs=67.2

Q ss_pred             cCCCEEEEEeC---CCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEEEccCC--eEEEEeCCCCcEeEEEec
Q 003800           51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITLSSDGS--TLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~Vs~~g~--~v~A~d~~tG~llWe~~l  124 (794)
                      +++++++.++.   ...|+.+|.++|+..--..+...  ...... ..|+.+++.+..++  .++.||..+|.+. +...
T Consensus       213 PDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~~~~g~--~~~~~wSPDG~~La~~~~~~g~~~Iy~~d~~~~~~~-~lt~  289 (429)
T PRK01742        213 PDGSKLAYVSFENKKSQLVVHDLRSGARKVVASFRGH--NGAPAFSPDGSRLAFASSKDGVLNIYVMGANGGTPS-QLTS  289 (429)
T ss_pred             CCCCEEEEEEecCCCcEEEEEeCCCCceEEEecCCCc--cCceeECCCCCEEEEEEecCCcEEEEEEECCCCCeE-eecc
Confidence            34555655543   24799999999975322222221  111111 23445555443222  5788888777643 1111


Q ss_pred             cCccccCCccccccccccccCCeEEEEE--CC--EEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCc
Q 003800          125 RGSKHSKPLLLVPTNLKVDKDSLILVSS--KG--CLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSS  200 (794)
Q Consensus       125 ~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g--~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~  200 (794)
                      ..... ..+...     .+ +..+++.+  +|  .++.++..+|..... . ... .   ....+.++..+++.+..   
T Consensus       290 ~~~~~-~~~~wS-----pD-G~~i~f~s~~~g~~~I~~~~~~~~~~~~l-~-~~~-~---~~~~SpDG~~ia~~~~~---  353 (429)
T PRK01742        290 GAGNN-TEPSWS-----PD-GQSILFTSDRSGSPQVYRMSASGGGASLV-G-GRG-Y---SAQISADGKTLVMINGD---  353 (429)
T ss_pred             CCCCc-CCEEEC-----CC-CCEEEEEECCCCCceEEEEECCCCCeEEe-c-CCC-C---CccCCCCCCEEEEEcCC---
Confidence            11111 111121     22 23344332  22  677777666654432 1 111 1   11112345555554432   


Q ss_pred             eeEEEEEEcCCCcee
Q 003800          201 QFHAYQINAMNGELL  215 (794)
Q Consensus       201 ~~~v~ald~~tG~~~  215 (794)
                        .+..+|+.+|+..
T Consensus       354 --~i~~~Dl~~g~~~  366 (429)
T PRK01742        354 --NVVKQDLTSGSTE  366 (429)
T ss_pred             --CEEEEECCCCCeE
Confidence              3666899999754


No 185
>KOG4499 consensus Ca2+-binding protein Regucalcin/SMP30 [Inorganic ion transport and metabolism; Signal transduction mechanisms]
Probab=48.10  E-value=92  Score=32.75  Aligned_cols=83  Identities=19%  Similarity=0.356  Sum_probs=52.1

Q ss_pred             CCEEEEEEccCCeEEEEe--CCCCcEeEEEeccCccccCC-ccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEE
Q 003800           95 GKYVITLSSDGSTLRAWN--LPDGQMVWESFLRGSKHSKP-LLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWT  169 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d--~~tG~llWe~~l~~~~~s~~-~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~  169 (794)
                      .+...++-+.+-.|-|||  ..+|.+.=+..+-.-.-+++ -+..|..+.++..+.++|.  ++|+++.+|+.||+.+=+
T Consensus       169 ~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~~V~~~dp~tGK~L~e  248 (310)
T KOG4499|consen  169 AKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGGTVQKVDPTTGKILLE  248 (310)
T ss_pred             CcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCcEEEEECCCCCcEEEE
Confidence            344455544456898888  88887654433321000000 0122223355667778885  589999999999999999


Q ss_pred             EeccCcce
Q 003800          170 RDFAAESV  177 (794)
Q Consensus       170 ~~~~~~~~  177 (794)
                      ...|.+..
T Consensus       249 iklPt~qi  256 (310)
T KOG4499|consen  249 IKLPTPQI  256 (310)
T ss_pred             EEcCCCce
Confidence            99997654


No 186
>smart00108 B_lectin Bulb-type mannose-specific lectin.
Probab=46.81  E-value=1.9e+02  Score=26.27  Aligned_cols=52  Identities=19%  Similarity=0.409  Sum_probs=28.9

Q ss_pred             eEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEecc
Q 003800          107 TLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus       107 ~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~  173 (794)
                      .+..++...+..+|......+..             . ...+.+..+|.|.-+|. +|.++|.-...
T Consensus        31 nlV~~~~~~~~~vW~snt~~~~~-------------~-~~~l~l~~dGnLvl~~~-~g~~vW~S~t~   82 (114)
T smart00108       31 NLILYKSSSRTVVWVANRDNPVS-------------D-SCTLTLQSDGNLVLYDG-DGRVVWSSNTT   82 (114)
T ss_pred             EEEEEECCCCcEEEECCCCCCCC-------------C-CEEEEEeCCCCEEEEeC-CCCEEEEeccc
Confidence            44444443367888865433211             0 11222335888887774 48899986443


No 187
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=46.80  E-value=2.2e+02  Score=31.54  Aligned_cols=64  Identities=13%  Similarity=0.078  Sum_probs=44.4

Q ss_pred             EEE-EECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCcee
Q 003800          148 ILV-SSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELL  215 (794)
Q Consensus       148 V~V-~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~  215 (794)
                      |.| +++|.+..+|..||+.+=+++.+.+.....+++...++..|+..+.+|    .|.++|+.+-...
T Consensus        43 vav~lSngsv~lyd~~tg~~l~~fk~~~~~~N~vrf~~~ds~h~v~s~ssDG----~Vr~wD~Rs~~e~  107 (376)
T KOG1188|consen   43 VAVSLSNGSVRLYDKGTGQLLEEFKGPPATTNGVRFISCDSPHGVISCSSDG----TVRLWDIRSQAES  107 (376)
T ss_pred             EEEEecCCeEEEEeccchhhhheecCCCCcccceEEecCCCCCeeEEeccCC----eEEEEEeecchhh
Confidence            555 479999999999999988887766554333333212567788777777    7888887765444


No 188
>PF14727 PHTB1_N:  PTHB1 N-terminus
Probab=46.53  E-value=5.1e+02  Score=29.78  Aligned_cols=187  Identities=14%  Similarity=0.144  Sum_probs=106.4

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceee-eee-eeCCEEEEEEccCCeEEEEeC-----C-----------
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDG-IDI-ALGKYVITLSSDGSTLRAWNL-----P-----------  114 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~-l~~-~~g~~~V~Vs~~g~~v~A~d~-----~-----------  114 (794)
                      .+.|.|-+-+|.|.-++.+.  ..-++.++.-. +++ +.. ...+-.|+.++ ...+.++.-     .           
T Consensus       145 ~~~IcVQS~DG~L~~feqe~--~~f~~~lp~~l-lPgPl~Y~~~tDsfvt~ss-s~~l~~Yky~~La~~s~~~~~~~~~~  220 (418)
T PF14727_consen  145 RDFICVQSMDGSLSFFEQES--FAFSRFLPDFL-LPGPLCYCPRTDSFVTASS-SWTLECYKYQDLASASEASSRQSGTE  220 (418)
T ss_pred             ceEEEEEecCceEEEEeCCc--EEEEEEcCCCC-CCcCeEEeecCCEEEEecC-ceeEEEecHHHhhhcccccccccccc
Confidence            56699999999999998654  45566665531 111 111 12333444333 235555431     0           


Q ss_pred             ----CC---cEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcce--eeeeEEEE
Q 003800          115 ----DG---QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESV--EVQQVIQL  185 (794)
Q Consensus       115 ----tG---~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~--~~~~~v~s  185 (794)
                          +|   ..-|.+.++.+.+  ++.++..   ......++|++...|++++. +|.++|..++.-...  -++.+...
T Consensus       221 ~~~~~~k~l~~dWs~nlGE~~l--~i~v~~~---~~~~~~IvvLger~Lf~l~~-~G~l~~~krLd~~p~~~~~Y~~~~~  294 (418)
T PF14727_consen  221 QDISSGKKLNPDWSFNLGEQAL--DIQVVRF---SSSESDIVVLGERSLFCLKD-NGSLRFQKRLDYNPSCFCPYRVPWY  294 (418)
T ss_pred             ccccccccccceeEEECCceeE--EEEEEEc---CCCCceEEEEecceEEEEcC-CCeEEEEEecCCceeeEEEEEeecc
Confidence                22   4679999887664  3333321   11245789999999999995 799999999865432  22333111


Q ss_pred             ecCC---EEEEEEecCCceeEEEEEEcCCCceeeeeeeecc-cCccCceEEE-cCcEEEEEECCCCeEEEEEeecce
Q 003800          186 DESD---QIYVVGYAGSSQFHAYQINAMNGELLNHETAAFS-GGFVGDVALV-SSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       186 ~~~~---~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~-~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      ..++   .+.+.+..+    .+..+  ++.+.+|..++... -.++- +-+- -.+.+|.++. +|.|.+.=|+|..
T Consensus       295 ~~~~~~~~llV~t~t~----~LlVy--~d~~L~WsA~l~~~PVal~v-~~~~~~~G~IV~Ls~-~G~L~v~YLGTdP  363 (418)
T PF14727_consen  295 NEPSTRLNLLVGTHTG----TLLVY--EDTTLVWSAQLPHVPVALSV-ANFNGLKGLIVSLSD-EGQLSVSYLGTDP  363 (418)
T ss_pred             cCCCCceEEEEEecCC----eEEEE--eCCeEEEecCCCCCCEEEEe-cccCCCCceEEEEcC-CCcEEEEEeCCCC
Confidence            1222   234333333    34443  37788999976321 11110 0000 1468888874 6999999999865


No 189
>PRK13684 Ycf48-like protein; Provisional
Probab=43.82  E-value=4.8e+02  Score=28.74  Aligned_cols=168  Identities=13%  Similarity=0.114  Sum_probs=76.6

Q ss_pred             ccEeeEEeccCcee---eeeeeeeccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcc--c--ceeeeeeeeCCEEEEE
Q 003800           29 GLMDWHQQYIGKVK---HAVFHTQKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIN--D--VVDGIDIALGKYVITL  101 (794)
Q Consensus        29 G~~dW~~~~vG~~~---~~~f~~~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~--~--~i~~l~~~~g~~~V~V  101 (794)
                      ....|++...+...   .-.|.    +++..|+....|.|..  ..||-.-|++.....  .  .+..+.. .++..+++
T Consensus        33 ~~~~W~~~~~~~~~~l~~v~F~----d~~~g~avG~~G~il~--T~DgG~tW~~~~~~~~~~~~~l~~v~~-~~~~~~~~  105 (334)
T PRK13684         33 SSSPWQVIDLPTEANLLDIAFT----DPNHGWLVGSNRTLLE--TNDGGETWEERSLDLPEENFRLISISF-KGDEGWIV  105 (334)
T ss_pred             cCCCcEEEecCCCCceEEEEEe----CCCcEEEEECCCEEEE--EcCCCCCceECccCCcccccceeeeEE-cCCcEEEe
Confidence            33459988754322   12333    2445555555665543  346788899864321  1  1112211 23333333


Q ss_pred             EccCCeEEEEeCCCCcEeEEEeccCccccCC-ccccccccccccCCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceee
Q 003800          102 SSDGSTLRAWNLPDGQMVWESFLRGSKHSKP-LLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEV  179 (794)
Q Consensus       102 s~~g~~v~A~d~~tG~llWe~~l~~~~~s~~-~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~  179 (794)
                      + ..+.  .|-..||-.-|+........... ..+.     ...++.+++. ..|.+++-  .+|-..|+..........
T Consensus       106 G-~~g~--i~~S~DgG~tW~~~~~~~~~~~~~~~i~-----~~~~~~~~~~g~~G~i~~S--~DgG~tW~~~~~~~~g~~  175 (334)
T PRK13684        106 G-QPSL--LLHTTDGGKNWTRIPLSEKLPGSPYLIT-----ALGPGTAEMATNVGAIYRT--TDGGKNWEALVEDAAGVV  175 (334)
T ss_pred             C-CCce--EEEECCCCCCCeEccCCcCCCCCceEEE-----EECCCcceeeeccceEEEE--CCCCCCceeCcCCCcceE
Confidence            3 3333  34467899999876422111001 1111     1112333333 34544333  456678886443221112


Q ss_pred             eeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeee
Q 003800          180 QQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHET  219 (794)
Q Consensus       180 ~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~  219 (794)
                      ..+. ...++.+++++..|    .++.. ...|..-|+..
T Consensus       176 ~~i~-~~~~g~~v~~g~~G----~i~~s-~~~gg~tW~~~  209 (334)
T PRK13684        176 RNLR-RSPDGKYVAVSSRG----NFYST-WEPGQTAWTPH  209 (334)
T ss_pred             EEEE-ECCCCeEEEEeCCc----eEEEE-cCCCCCeEEEe
Confidence            2222 12445555555555    34432 23566677663


No 190
>PF08553 VID27:  VID27 cytoplasmic protein;  InterPro: IPR013863  This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=43.53  E-value=3.9e+02  Score=33.30  Aligned_cols=110  Identities=15%  Similarity=0.163  Sum_probs=63.0

Q ss_pred             CEEEEEEc-cCCeEEEEeCCCCcEeEEEeccCcc-ccCCcccccc-ccccccCCeEEEE-ECCEEEEEECC-CC-cEEEE
Q 003800           96 KYVITLSS-DGSTLRAWNLPDGQMVWESFLRGSK-HSKPLLLVPT-NLKVDKDSLILVS-SKGCLHAVSSI-DG-EILWT  169 (794)
Q Consensus        96 ~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~~~-~s~~~~~~~~-~~~~~~~~~V~V~-~~g~l~ald~~-tG-~~~W~  169 (794)
                      ..++.... +...|+-+|.+.|+++=+|...... .   ..+.+. ..+.-.....|+. ++..|+++|+. .| +++|.
T Consensus       493 ~~mil~~~~~~~~ly~mDLe~GKVV~eW~~~~~~~v---~~~~p~~K~aqlt~e~tflGls~n~lfriDpR~~~~k~v~~  569 (794)
T PF08553_consen  493 RNMILLDPNNPNKLYKMDLERGKVVEEWKVHDDIPV---VDIAPDSKFAQLTNEQTFLGLSDNSLFRIDPRLSGNKLVDS  569 (794)
T ss_pred             cceEeecCCCCCceEEEecCCCcEEEEeecCCCcce---eEecccccccccCCCceEEEECCCceEEeccCCCCCceeec
Confidence            34555543 4578999999999998888776432 1   011110 0000012345554 79999999998 45 46775


Q ss_pred             EeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCc
Q 003800          170 RDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGE  213 (794)
Q Consensus       170 ~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~  213 (794)
                      ....-..-...++.....+|.+.+++..|    .+.-+| ..|.
T Consensus       570 ~~k~Y~~~~~Fs~~aTt~~G~iavgs~~G----~IRLyd-~~g~  608 (794)
T PF08553_consen  570 QSKQYSSKNNFSCFATTEDGYIAVGSNKG----DIRLYD-RLGK  608 (794)
T ss_pred             cccccccCCCceEEEecCCceEEEEeCCC----cEEeec-ccch
Confidence            43221111234565455677777766666    344445 4564


No 191
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=43.17  E-value=68  Score=23.08  Aligned_cols=31  Identities=13%  Similarity=0.292  Sum_probs=24.3

Q ss_pred             CCCEEEEEeC-CCEEEEEECcCCccceEEEcC
Q 003800           52 GRKRVVVSTE-ENVIASLDLRHGEIFWRHVLG   82 (794)
Q Consensus        52 ~~~~Vyv~t~-~g~l~ALn~~tG~ivWR~~l~   82 (794)
                      +++++|++.. .+.|..+|+++|+++=+....
T Consensus         2 d~~~lyv~~~~~~~v~~id~~~~~~~~~i~vg   33 (42)
T TIGR02276         2 DGTKLYVTNSGSNTVSVIDTATNKVIATIPVG   33 (42)
T ss_pred             CCCEEEEEeCCCCEEEEEECCCCeEEEEEECC
Confidence            3677999886 689999999999776665553


No 192
>cd00028 B_lectin Bulb-type mannose-specific lectin. The domain contains a three-fold internal repeat (beta-prism architecture). The consensus sequence motif QXDXNXVXY is involved in alpha-D-mannose recognition. Lectins are carbohydrate-binding proteins which specifically recognize diverse carbohydrates and mediate a wide variety of biological processes, such as cell-cell and host-pathogen interactions, serum glycoprotein turnover, and innate immune responses.
Probab=42.59  E-value=2e+02  Score=26.29  Aligned_cols=22  Identities=23%  Similarity=0.557  Sum_probs=15.8

Q ss_pred             EECCEEEEEECCCCcEEEEEecc
Q 003800          151 SSKGCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus       151 ~~~g~l~ald~~tG~~~W~~~~~  173 (794)
                      ..+|.|+..|. +|.++|.-...
T Consensus        62 ~~dGnLvl~~~-~g~~vW~S~~~   83 (116)
T cd00028          62 QSDGNLVIYDG-SGTVVWSSNTT   83 (116)
T ss_pred             ecCCCeEEEcC-CCcEEEEeccc
Confidence            35788877774 67899986543


No 193
>KOG1517 consensus Guanine nucleotide binding protein MIP1 [Cell cycle control, cell division, chromosome partitioning]
Probab=42.28  E-value=4.2e+02  Score=33.78  Aligned_cols=147  Identities=9%  Similarity=0.068  Sum_probs=73.4

Q ss_pred             CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCc-----EEEE
Q 003800           96 KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGE-----ILWT  169 (794)
Q Consensus        96 ~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~-----~~W~  169 (794)
                      .+.++++|+-..||-||+..-...=.....++.+   +..+.  .....++.+++. .||.|..+|...-.     -.|+
T Consensus      1177 ~G~Ll~tGd~r~IRIWDa~~E~~~~diP~~s~t~---vTaLS--~~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R 1251 (1387)
T KOG1517|consen 1177 SGHLLVTGDVRSIRIWDAHKEQVVADIPYGSSTL---VTALS--ADLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYR 1251 (1387)
T ss_pred             CCeEEecCCeeEEEEEecccceeEeecccCCCcc---ceeec--ccccCCceEEEeecCCceEEeecccCCccccceeec
Confidence            3556667766789999998766666555554432   11111  122323444444 59999999876433     3465


Q ss_pred             EeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCcc--CceEEE--cCcEEEEEECCC
Q 003800          170 RDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFV--GDVALV--SSDTLVTLDTTR  245 (794)
Q Consensus       170 ~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s--~~~~~v--g~~~lv~~d~~~  245 (794)
                      .-...+.+.-..+. ...-+.++.++.+|    .+.-+|+..-....-.++..++.-.  -.++.|  -..+++|...  
T Consensus      1252 ~h~~~~~Iv~~slq-~~G~~elvSgs~~G----~I~~~DlR~~~~e~~~~iv~~~~yGs~lTal~VH~hapiiAsGs~-- 1324 (1387)
T KOG1517|consen 1252 EHNDVEPIVHLSLQ-RQGLGELVSGSQDG----DIQLLDLRMSSKETFLTIVAHWEYGSALTALTVHEHAPIIASGSA-- 1324 (1387)
T ss_pred             ccCCcccceeEEee-cCCCcceeeeccCC----eEEEEecccCcccccceeeeccccCccceeeeeccCCCeeeecCc--
Confidence            43332222111221 11223455555555    6777787653222222333333111  133333  4457777753  


Q ss_pred             CeEEEEEee
Q 003800          246 SILVTVSFK  254 (794)
Q Consensus       246 g~L~v~~l~  254 (794)
                      +.+.+.++.
T Consensus      1325 q~ikIy~~~ 1333 (1387)
T KOG1517|consen 1325 QLIKIYSLS 1333 (1387)
T ss_pred             ceEEEEecC
Confidence            445555543


No 194
>PF06433 Me-amine-dh_H:  Methylamine dehydrogenase heavy chain (MADH);  InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO).  RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor  MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=42.12  E-value=5.3e+02  Score=28.75  Aligned_cols=189  Identities=10%  Similarity=0.106  Sum_probs=108.0

Q ss_pred             CEEEEEeC-----CCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-Ec------cC---CeEEEEeCCCCcE
Q 003800           54 KRVVVSTE-----ENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SS------DG---STLRAWNLPDGQM  118 (794)
Q Consensus        54 ~~Vyv~t~-----~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~------~g---~~v~A~d~~tG~l  118 (794)
                      .||||.+-     .+.++-+|+++|+.+=.....-..+   +.+..++..+++ ++      .|   ..|..||+.|=.+
T Consensus         3 ~rvyV~D~~~~~~~~rv~viD~d~~k~lGmi~~g~~~~---~~~spdgk~~y~a~T~~sR~~rG~RtDvv~~~D~~TL~~   79 (342)
T PF06433_consen    3 HRVYVQDPVFFHMTSRVYVIDADSGKLLGMIDTGFLGN---VALSPDGKTIYVAETFYSRGTRGERTDVVEIWDTQTLSP   79 (342)
T ss_dssp             TEEEEEE-GGGGSSEEEEEEETTTTEEEEEEEEESSEE---EEE-TTSSEEEEEEEEEEETTEEEEEEEEEEEETTTTEE
T ss_pred             cEEEEECCccccccceEEEEECCCCcEEEEeecccCCc---eeECCCCCEEEEEEEEEeccccccceeEEEEEecCcCcc
Confidence            46666664     3578888888887644433322211   111223333332 21      11   3589999999999


Q ss_pred             eEEEeccCc-cccCCcccccccccc-ccCCeEEEEE---CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEE
Q 003800          119 VWESFLRGS-KHSKPLLLVPTNLKV-DKDSLILVSS---KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYV  193 (794)
Q Consensus       119 lWe~~l~~~-~~s~~~~~~~~~~~~-~~~~~V~V~~---~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv  193 (794)
                      .||..+... .. ...+... .... +.++.++|..   ...|..+|.+.++++=+.+.|.=..    +.+ ..+...+.
T Consensus        80 ~~EI~iP~k~R~-~~~~~~~-~~~ls~dgk~~~V~N~TPa~SVtVVDl~~~kvv~ei~~PGC~~----iyP-~~~~~F~~  152 (342)
T PF06433_consen   80 TGEIEIPPKPRA-QVVPYKN-MFALSADGKFLYVQNFTPATSVTVVDLAAKKVVGEIDTPGCWL----IYP-SGNRGFSM  152 (342)
T ss_dssp             EEEEEETTS-B---BS--GG-GEEE-TTSSEEEEEEESSSEEEEEEETTTTEEEEEEEGTSEEE----EEE-EETTEEEE
T ss_pred             cceEecCCcchh-eeccccc-ceEEccCCcEEEEEccCCCCeEEEEECCCCceeeeecCCCEEE----EEe-cCCCceEE
Confidence            999999864 22 1111111 0111 2256777763   7789999999999988777765332    222 35677888


Q ss_pred             EEecCCceeEEEEEEcCCCceeeeeeeecccCccC-----ceEEE-cCcEEEEEECCCCeEEEEEeeccee
Q 003800          194 VGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVG-----DVALV-SSDTLVTLDTTRSILVTVSFKNRKI  258 (794)
Q Consensus       194 ~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~-----~~~~v-g~~~lv~~d~~~g~L~v~~l~sg~~  258 (794)
                      +|.+|+  +..+.||. .|+.. +..-. ......     ...++ .++.+++.. ++|.++..++.....
T Consensus       153 lC~DGs--l~~v~Ld~-~Gk~~-~~~t~-~F~~~~dp~f~~~~~~~~~~~~~F~S-y~G~v~~~dlsg~~~  217 (342)
T PF06433_consen  153 LCGDGS--LLTVTLDA-DGKEA-QKSTK-VFDPDDDPLFEHPAYSRDGGRLYFVS-YEGNVYSADLSGDSA  217 (342)
T ss_dssp             EETTSC--EEEEEETS-TSSEE-EEEEE-ESSTTTS-B-S--EEETTTTEEEEEB-TTSEEEEEEETTSSE
T ss_pred             EecCCc--eEEEEECC-CCCEe-Eeecc-ccCCCCcccccccceECCCCeEEEEe-cCCEEEEEeccCCcc
Confidence            888873  33344442 78887 33222 111111     22223 345677776 679999999987653


No 195
>KOG0308 consensus Conserved WD40 repeat-containing protein [Function unknown]
Probab=42.08  E-value=2.2e+02  Score=34.12  Aligned_cols=101  Identities=14%  Similarity=0.141  Sum_probs=58.7

Q ss_pred             EEEEEe-CCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCc
Q 003800           55 RVVVST-EENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPL  133 (794)
Q Consensus        55 ~Vyv~t-~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~  133 (794)
                      .++|+. -++.|.-.|++|++.+=+.+ +..+++..+.+..++..+.-++.+++++.||..--+=+-.+.+..+..    
T Consensus       184 t~ivsGgtek~lr~wDprt~~kimkLr-GHTdNVr~ll~~dDGt~~ls~sSDgtIrlWdLgqQrCl~T~~vH~e~V----  258 (735)
T KOG0308|consen  184 TIIVSGGTEKDLRLWDPRTCKKIMKLR-GHTDNVRVLLVNDDGTRLLSASSDGTIRLWDLGQQRCLATYIVHKEGV----  258 (735)
T ss_pred             eEEEecCcccceEEeccccccceeeee-ccccceEEEEEcCCCCeEeecCCCceEEeeeccccceeeeEEeccCce----
Confidence            355544 47889999999999988877 444566655333333233323446799999986555555555544321    


Q ss_pred             cccccccccccCCeEEEE-ECCEEEEEECCC
Q 003800          134 LLVPTNLKVDKDSLILVS-SKGCLHAVSSID  163 (794)
Q Consensus       134 ~~~~~~~~~~~~~~V~V~-~~g~l~ald~~t  163 (794)
                      +....  ... -..+|.. .+|.+++-|..+
T Consensus       259 WaL~~--~~s-f~~vYsG~rd~~i~~Tdl~n  286 (735)
T KOG0308|consen  259 WALQS--SPS-FTHVYSGGRDGNIYRTDLRN  286 (735)
T ss_pred             EEEee--CCC-cceEEecCCCCcEEecccCC
Confidence            22211  000 1334444 377788877765


No 196
>PRK04922 tolB translocation protein TolB; Provisional
Probab=42.02  E-value=5.7e+02  Score=29.09  Aligned_cols=149  Identities=13%  Similarity=0.170  Sum_probs=71.9

Q ss_pred             eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-EE-C--CEEEEEECCCCcEE
Q 003800           94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SS-K--GCLHAVSSIDGEIL  167 (794)
Q Consensus        94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~--g~l~ald~~tG~~~  167 (794)
                      .++.+++++..  ...++.||..+|+..--....+..  ..+.+     ..+ ++.+++ .+ +  ..|+.+|..+|+..
T Consensus       214 Dg~~la~~s~~~~~~~l~~~dl~~g~~~~l~~~~g~~--~~~~~-----SpD-G~~l~~~~s~~g~~~Iy~~d~~~g~~~  285 (433)
T PRK04922        214 DGKKLAYVSFERGRSAIYVQDLATGQRELVASFRGIN--GAPSF-----SPD-GRRLALTLSRDGNPEIYVMDLGSRQLT  285 (433)
T ss_pred             CCCEEEEEecCCCCcEEEEEECCCCCEEEeccCCCCc--cCceE-----CCC-CCEEEEEEeCCCCceEEEEECCCCCeE
Confidence            46667776532  357999999999865333322211  11111     123 334444 33 3  37999999988753


Q ss_pred             EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCC
Q 003800          168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRS  246 (794)
Q Consensus       168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g  246 (794)
                      =-........   .+..+.++..+++.+..++ ...++.+|+.+|+...   +.........+.+ ..++.++......+
T Consensus       286 ~lt~~~~~~~---~~~~spDG~~l~f~sd~~g-~~~iy~~dl~~g~~~~---lt~~g~~~~~~~~SpDG~~Ia~~~~~~~  358 (433)
T PRK04922        286 RLTNHFGIDT---EPTWAPDGKSIYFTSDRGG-RPQIYRVAASGGSAER---LTFQGNYNARASVSPDGKKIAMVHGSGG  358 (433)
T ss_pred             ECccCCCCcc---ceEECCCCCEEEEEECCCC-CceEEEEECCCCCeEE---eecCCCCccCEEECCCCCEEEEEECCCC
Confidence            2111111111   1111234455555443322 2368888988887432   1111111111222 13344444433322


Q ss_pred             --eEEEEEeecce
Q 003800          247 --ILVTVSFKNRK  257 (794)
Q Consensus       247 --~L~v~~l~sg~  257 (794)
                        .+++.++.+|+
T Consensus       359 ~~~I~v~d~~~g~  371 (433)
T PRK04922        359 QYRIAVMDLSTGS  371 (433)
T ss_pred             ceeEEEEECCCCC
Confidence              57777877765


No 197
>PF15525 DUF4652:  Domain of unknown function (DUF4652)
Probab=40.85  E-value=3.8e+02  Score=27.26  Aligned_cols=65  Identities=25%  Similarity=0.412  Sum_probs=39.6

Q ss_pred             ceEEEEECCCCcEEEEEecccCCCCCCCceeeEEeeecCcccCCCCCCeEEEEEE-ecCCCCCCcEEEEEEccCCcee
Q 003800          497 RKIFALHSGDGRVVWSLLLHKSEACDSPTELNLYQWQTPHHHAMDENPSVLVVGR-CGVSSKAPAILSFVDTYTGKEL  573 (794)
Q Consensus       497 Gkl~alds~~G~i~W~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~vv~~-~~~~~~~~~~~~~~d~~tG~~~  573 (794)
                      |+||--|..+|.. |++.+.+.+....|   +.+-|-       |+...++|++. .| .-..-|.+|.+|..||+..
T Consensus        88 GkIYIkn~~~~~~-~~L~i~~~~~k~sP---K~i~Wi-------DD~~L~vIIG~a~G-TvS~GGnLy~~nl~tg~~~  153 (200)
T PF15525_consen   88 GKIYIKNLNNNNW-WSLQIDQNEEKYSP---KYIEWI-------DDNNLAVIIGYAHG-TVSKGGNLYKYNLNTGNLT  153 (200)
T ss_pred             eeEEEEecCCCce-EEEEecCcccccCC---ceeEEe-------cCCcEEEEEccccc-eEccCCeEEEEEccCCcee
Confidence            7888888887776 88877653211111   134552       23444555553 11 0245588999999999865


No 198
>PF03178 CPSF_A:  CPSF A subunit region;  InterPro: IPR004871 This family includes a region that lies towards the C terminus of the cleavage and polyadenylation specificity factor (CPSF) A (160 kDa) subunit. CPSF is involved in mRNA polyadenylation and binds the AAUAAA conserved sequence in pre-mRNA. CPSF has also been found to be necessary for splicing of single-intron pre-mRNAs []. The function of the aligned region is unknown but may be involved in RNA/DNA binding.; GO: 0003676 nucleic acid binding, 0005634 nucleus; PDB: 2B5M_A 4A0K_C 4A0B_C 3I7L_A 3I8E_A 4A09_A 4A0A_A 3EI4_C 2B5L_A 3I7O_A ....
Probab=40.38  E-value=5.1e+02  Score=28.04  Aligned_cols=175  Identities=13%  Similarity=0.104  Sum_probs=88.4

Q ss_pred             EEEEEECcCCccceEEEcCcccceeeeee-e--eC----CEEEEEEcc---------C-CeEEEEeCCCC-------cEe
Q 003800           64 VIASLDLRHGEIFWRHVLGINDVVDGIDI-A--LG----KYVITLSSD---------G-STLRAWNLPDG-------QMV  119 (794)
Q Consensus        64 ~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~--~g----~~~V~Vs~~---------g-~~v~A~d~~tG-------~ll  119 (794)
                      .|--+|+.+.+++=++.|+....+..+.. .  .+    ...++||+.         . |+++.++...+       +++
T Consensus         3 ~i~l~d~~~~~~~~~~~l~~~E~~~s~~~~~l~~~~~~~~~~ivVGT~~~~~~~~~~~~Gri~v~~i~~~~~~~~~l~~i   82 (321)
T PF03178_consen    3 SIRLVDPTTFEVLDSFELEPNEHVTSLCSVKLKGDSTGKKEYIVVGTAFNYGEDPEPSSGRILVFEISESPENNFKLKLI   82 (321)
T ss_dssp             EEEEEETTTSSEEEEEEEETTEEEEEEEEEEETTS---SSEEEEEEEEE--TTSSS-S-EEEEEEEECSS-----EEEEE
T ss_pred             EEEEEeCCCCeEEEEEECCCCceEEEEEEEEEcCccccccCEEEEEecccccccccccCcEEEEEEEEcccccceEEEEE
Confidence            46667888888887777776643332211 1  11    345555432         1 68999999885       333


Q ss_pred             EEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCc-EEEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          120 WESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGE-ILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       120 We~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~-~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      .+....+++.     .+    ... .+.+++..++.|+.++....+ ..=......+.. ...+  ...++.+++.....
T Consensus        83 ~~~~~~g~V~-----ai----~~~-~~~lv~~~g~~l~v~~l~~~~~l~~~~~~~~~~~-i~sl--~~~~~~I~vgD~~~  149 (321)
T PF03178_consen   83 HSTEVKGPVT-----AI----CSF-NGRLVVAVGNKLYVYDLDNSKTLLKKAFYDSPFY-ITSL--SVFKNYILVGDAMK  149 (321)
T ss_dssp             EEEEESS-EE-----EE----EEE-TTEEEEEETTEEEEEEEETTSSEEEEEEE-BSSS-EEEE--EEETTEEEEEESSS
T ss_pred             EEEeecCcce-----Eh----hhh-CCEEEEeecCEEEEEEccCcccchhhheecceEE-EEEE--eccccEEEEEEccc
Confidence            3444433322     11    122 566777778888888877666 221111111111 1122  23466666554433


Q ss_pred             CceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEee
Q 003800          199 SSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFK  254 (794)
Q Consensus       199 ~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~  254 (794)
                        ++.++.++...-+...-.+-..+..+....+++.++.+++.|. .|+++++...
T Consensus       150 --sv~~~~~~~~~~~l~~va~d~~~~~v~~~~~l~d~~~~i~~D~-~gnl~~l~~~  202 (321)
T PF03178_consen  150 --SVSLLRYDEENNKLILVARDYQPRWVTAAEFLVDEDTIIVGDK-DGNLFVLRYN  202 (321)
T ss_dssp             --SEEEEEEETTTE-EEEEEEESS-BEEEEEEEE-SSSEEEEEET-TSEEEEEEE-
T ss_pred             --CEEEEEEEccCCEEEEEEecCCCccEEEEEEecCCcEEEEEcC-CCeEEEEEEC
Confidence              2566666763332332222122333333333334457788885 5888777664


No 199
>KOG0265 consensus U5 snRNP-specific protein-like factor and related proteins [RNA processing and modification]
Probab=38.59  E-value=82  Score=34.10  Aligned_cols=63  Identities=17%  Similarity=0.344  Sum_probs=44.8

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCC
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPD  115 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~t  115 (794)
                      +...||-++.+-.|+..|.+||+..-|+.....- +..+.+. .|-.+|.-++++++++.||...
T Consensus       101 d~s~i~S~gtDk~v~~wD~~tG~~~rk~k~h~~~-vNs~~p~rrg~~lv~SgsdD~t~kl~D~R~  164 (338)
T KOG0265|consen  101 DGSHILSCGTDKTVRGWDAETGKRIRKHKGHTSF-VNSLDPSRRGPQLVCSGSDDGTLKLWDIRK  164 (338)
T ss_pred             CCCEEEEecCCceEEEEecccceeeehhccccce-eeecCccccCCeEEEecCCCceEEEEeecc
Confidence            3556888888999999999999999998887662 2222222 3444444344678999999863


No 200
>PF14339 DUF4394:  Domain of unknown function (DUF4394)
Probab=38.50  E-value=5e+02  Score=27.39  Aligned_cols=165  Identities=8%  Similarity=-0.003  Sum_probs=0.0

Q ss_pred             ccCCCEEEEEeCCCEEEEEECcCCccceE--EEcCcccceeeeeee---eCCEEEEEEccCCeEEEEeCCCCc------E
Q 003800           50 KTGRKRVVVSTEENVIASLDLRHGEIFWR--HVLGINDVVDGIDIA---LGKYVITLSSDGSTLRAWNLPDGQ------M  118 (794)
Q Consensus        50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR--~~l~~~~~i~~l~~~---~g~~~V~Vs~~g~~v~A~d~~tG~------l  118 (794)
                      .+..+.+|..+..|.||-||+.||.--.-  -.+........+.+-   ..+.+=+||..| +-.-+|+.+|.      .
T Consensus        35 Rpa~G~LYgl~~~g~lYtIn~~tG~aT~vg~s~~~~al~g~~~gvDFNP~aDRlRvvs~~G-qNlR~npdtGav~~~Dg~  113 (236)
T PF14339_consen   35 RPANGQLYGLGSTGRLYTINPATGAATPVGASPLTVALSGTAFGVDFNPAADRLRVVSNTG-QNLRLNPDTGAVTIVDGN  113 (236)
T ss_pred             ecCCCCEEEEeCCCcEEEEECCCCeEEEeecccccccccCceEEEecCcccCcEEEEccCC-cEEEECCCCCCceeccCc


Q ss_pred             eEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcceeeeeEEE--------------
Q 003800          119 VWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQ--------------  184 (794)
Q Consensus       119 lWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~--------------  184 (794)
                      ++-.......- ..+.+..   ........=.-...+||.+|...+...=+-.-..+.+.....+.              
T Consensus       114 L~y~~gd~~~G-~~p~v~a---aAYTNs~~g~~t~TtLy~ID~~~~~Lv~Q~ppN~GtL~~vG~LGvd~~~~~gFDI~~~  189 (236)
T PF14339_consen  114 LAYAAGDMNAG-TTPGVTA---AAYTNSFAGATTSTTLYDIDTTLDALVTQNPPNDGTLNTVGPLGVDAAGDAGFDIAGD  189 (236)
T ss_pred             cccCCCccccC-CCCceEE---EEEecccCCCccceEEEEEecCCCeEEEecCCCCCcEEeeeccccccCcccceeeecC


Q ss_pred             EecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeee
Q 003800          185 LDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAA  221 (794)
Q Consensus       185 s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~  221 (794)
                      .......|.+...++  -.+|.+|+.||+...--.+.
T Consensus       190 ~~~~~~a~a~~~~~~--~~LY~vdL~TG~at~~g~i~  224 (236)
T PF14339_consen  190 GNGGNAAYAVLGVGG--SGLYTVDLTTGAATLVGQIG  224 (236)
T ss_pred             CCcceEEEEEecCCC--cEEEEEECCCcccEEeeecC


No 201
>PF09910 DUF2139:  Uncharacterized protein conserved in archaea (DUF2139);  InterPro: IPR016675 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=38.25  E-value=5.7e+02  Score=28.01  Aligned_cols=157  Identities=10%  Similarity=0.116  Sum_probs=85.3

Q ss_pred             cccccEeeEEeccCceeeeeeee---eccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeee-eeCCEEEEE
Q 003800           26 DQVGLMDWHQQYIGKVKHAVFHT---QKTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDI-ALGKYVITL  101 (794)
Q Consensus        26 dqvG~~dW~~~~vG~~~~~~f~~---~~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~-~~g~~~V~V  101 (794)
                      .+.++..++.+++|.+-...-++   ....++.||++.                |-+.=.      .+.. ..+...+-.
T Consensus        16 ~~d~~~iY~felvG~~P~SGGDTYNAV~~vDd~IyFGG----------------WVHAPa------~y~gk~~g~~~IdF   73 (339)
T PF09910_consen   16 RDDSEKIYRFELVGPPPTSGGDTYNAVEWVDDFIYFGG----------------WVHAPA------VYEGKGDGRATIDF   73 (339)
T ss_pred             cCCceEEEEeeeccCCCCCCCccceeeeeecceEEEee----------------eecCCc------eeeeccCCceEEEE
Confidence            56677889999999764222221   111245555553                432110      1100 122333433


Q ss_pred             EccCCeEEEEeCCCC--cEeEEEeccCccc-c---CCccccccccccccCCeEEEEECC----EEEEEECCCCcEEEEEe
Q 003800          102 SSDGSTLRAWNLPDG--QMVWESFLRGSKH-S---KPLLLVPTNLKVDKDSLILVSSKG----CLHAVSSIDGEILWTRD  171 (794)
Q Consensus       102 s~~g~~v~A~d~~tG--~llWe~~l~~~~~-s---~~~~~~~~~~~~~~~~~V~V~~~g----~l~ald~~tG~~~W~~~  171 (794)
                      ...=+.|..+|.++|  +++|.-....+.. .   .++ +.    .+..+..++...||    .|+.+|..+|+..|-.+
T Consensus        74 ~NKYSHVH~yd~e~~~VrLLWkesih~~~~WaGEVSdI-lY----dP~~D~LLlAR~DGh~nLGvy~ldr~~g~~~~L~~  148 (339)
T PF09910_consen   74 RNKYSHVHEYDTENDSVRLLWKESIHDKTKWAGEVSDI-LY----DPYEDRLLLARADGHANLGVYSLDRRTGKAEKLSS  148 (339)
T ss_pred             eeccceEEEEEcCCCeEEEEEecccCCccccccchhhe-ee----CCCcCEEEEEecCCcceeeeEEEcccCCceeeccC
Confidence            432368999999999  6899988765421 0   111 11    12223333334454    69999999999999887


Q ss_pred             ccCcceeeeeEEEEecCCEEEEEEecC-CceeEEEEEEcCCCceee
Q 003800          172 FAAESVEVQQVIQLDESDQIYVVGYAG-SSQFHAYQINAMNGELLN  216 (794)
Q Consensus       172 ~~~~~~~~~~~v~s~~~~~vyv~~~~g-~~~~~v~ald~~tG~~~w  216 (794)
                      .|...-    .+  ..+...|-+ ... ...-.+.|+|+.+|+.+-
T Consensus       149 ~ps~KG----~~--~~D~a~F~i-~~~~~g~~~i~~~Dli~~~~~~  187 (339)
T PF09910_consen  149 NPSLKG----TL--VHDYACFGI-NNFHKGVSGIHCLDLISGKWVI  187 (339)
T ss_pred             CCCcCc----eE--eeeeEEEec-cccccCCceEEEEEccCCeEEE
Confidence            765432    11  123333322 110 111268999999999854


No 202
>PRK02889 tolB translocation protein TolB; Provisional
Probab=37.84  E-value=6.6e+02  Score=28.58  Aligned_cols=149  Identities=13%  Similarity=0.100  Sum_probs=69.7

Q ss_pred             cCCCEEEEEeC---CCEEEEEECcCCccceEEEcC-cccceeeeeee-eCCEEEEEEcc-C-CeEEEEeCCCCcEeEEEe
Q 003800           51 TGRKRVVVSTE---ENVIASLDLRHGEIFWRHVLG-INDVVDGIDIA-LGKYVITLSSD-G-STLRAWNLPDGQMVWESF  123 (794)
Q Consensus        51 ~~~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~-~~~~i~~l~~~-~g~~~V~Vs~~-g-~~v~A~d~~tG~llWe~~  123 (794)
                      +++++|++.+.   ...|+..|..+|+..   .+. .++........ .|+.+++.... + ..++.+|..+|.+. +..
T Consensus       205 PDG~~la~~s~~~~~~~I~~~dl~~g~~~---~l~~~~g~~~~~~~SPDG~~la~~~~~~g~~~Iy~~d~~~~~~~-~lt  280 (427)
T PRK02889        205 PDGTKLAYVSFESKKPVVYVHDLATGRRR---VVANFKGSNSAPAWSPDGRTLAVALSRDGNSQIYTVNADGSGLR-RLT  280 (427)
T ss_pred             CCCCEEEEEEccCCCcEEEEEECCCCCEE---EeecCCCCccceEECCCCCEEEEEEccCCCceEEEEECCCCCcE-ECC
Confidence            34556666553   246999999999753   221 11111112112 34445554332 2 46888888766532 211


Q ss_pred             ccCccccCCccccccccccccCCeEEEEE----CCEEEEEECCCCcEE-EEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          124 LRGSKHSKPLLLVPTNLKVDKDSLILVSS----KGCLHAVSSIDGEIL-WTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       124 l~~~~~s~~~~~~~~~~~~~~~~~V~V~~----~g~l~ald~~tG~~~-W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      -..... ..+...     .+ +..+++.+    .-.++.++..+|+.. -++.  ....  .....+.++..+++.+..+
T Consensus       281 ~~~~~~-~~~~wS-----pD-G~~l~f~s~~~g~~~Iy~~~~~~g~~~~lt~~--g~~~--~~~~~SpDG~~Ia~~s~~~  349 (427)
T PRK02889        281 QSSGID-TEPFFS-----PD-GRSIYFTSDRGGAPQIYRMPASGGAAQRVTFT--GSYN--TSPRISPDGKLLAYISRVG  349 (427)
T ss_pred             CCCCCC-cCeEEc-----CC-CCEEEEEecCCCCcEEEEEECCCCceEEEecC--CCCc--CceEECCCCCEEEEEEccC
Confidence            111111 111121     22 23344333    236777887766532 1111  1111  0111134455565555443


Q ss_pred             CceeEEEEEEcCCCcee
Q 003800          199 SSQFHAYQINAMNGELL  215 (794)
Q Consensus       199 ~~~~~v~ald~~tG~~~  215 (794)
                      + ...++.+|+.+|+..
T Consensus       350 g-~~~I~v~d~~~g~~~  365 (427)
T PRK02889        350 G-AFKLYVQDLATGQVT  365 (427)
T ss_pred             C-cEEEEEEECCCCCeE
Confidence            2 246888899998765


No 203
>COG3045 CreA Uncharacterized protein conserved in bacteria [Function unknown]
Probab=37.55  E-value=95  Score=30.15  Aligned_cols=58  Identities=19%  Similarity=0.231  Sum_probs=35.1

Q ss_pred             ChHHHHHHHHHHHHhccccccceeecccccEeeEEeccCce--eeeeeeeeccCCCEEEEEeC
Q 003800            1 MAIRFIILTLLFLSSCTIPSLSLYEDQVGLMDWHQQYIGKV--KHAVFHTQKTGRKRVVVSTE   61 (794)
Q Consensus         1 ~~~~~~l~~l~~l~~~~~~~~Al~edqvG~~dW~~~~vG~~--~~~~f~~~~~~~~~Vyv~t~   61 (794)
                      |++|.+|++.+++++++.++.   .+++|+++=-...+|.-  .-..|+.|...+=..|++..
T Consensus         3 ~~~~~~ll~~~~~~~l~~~a~---aE~iG~V~tvf~~~G~D~IvveafdDP~V~gVTCyvs~a   62 (165)
T COG3045           3 MKIRLLLLAGLLLLLLVGLAH---AEEIGSVSTVFDWLGNDHIVVEAFDDPDVKGVTCYVSRA   62 (165)
T ss_pred             chHHHHHHHHHHHHHhccccc---hhhccccceeEEEecCCcEEEEecCCCCcCcEEEEEEEe
Confidence            678888888875555555444   45567654323334443  33568877765555777664


No 204
>KOG0283 consensus WD40 repeat-containing protein [Function unknown]
Probab=36.73  E-value=4.9e+02  Score=31.81  Aligned_cols=110  Identities=13%  Similarity=0.168  Sum_probs=66.8

Q ss_pred             CEEEEEEcc-CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-ECCEEEEEECCCCcE--EEEEe
Q 003800           96 KYVITLSSD-GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-SKGCLHAVSSIDGEI--LWTRD  171 (794)
Q Consensus        96 ~~~V~Vs~~-g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~~g~l~ald~~tG~~--~W~~~  171 (794)
                      ++-.|+||. ++++|.|+..+=++.-.+.+..-+  .++.+.|     + ++..+|. .+|..+.++..+=+.  .|...
T Consensus       421 DDryFiSGSLD~KvRiWsI~d~~Vv~W~Dl~~lI--TAvcy~P-----d-Gk~avIGt~~G~C~fY~t~~lk~~~~~~I~  492 (712)
T KOG0283|consen  421 DDRYFISGSLDGKVRLWSISDKKVVDWNDLRDLI--TAVCYSP-----D-GKGAVIGTFNGYCRFYDTEGLKLVSDFHIR  492 (712)
T ss_pred             CCCcEeecccccceEEeecCcCeeEeehhhhhhh--eeEEecc-----C-CceEEEEEeccEEEEEEccCCeEEEeeeEe
Confidence            456677664 789999999999988888877433  3444444     4 4566666 488888887554433  35554


Q ss_pred             ccC------cceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          172 FAA------ESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       172 ~~~------~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      ...      ..++-.|+.+ ...+.|.|.+.+.    ++..+|..+=+++-..
T Consensus       493 ~~~~Kk~~~~rITG~Q~~p-~~~~~vLVTSnDS----rIRI~d~~~~~lv~Kf  540 (712)
T KOG0283|consen  493 LHNKKKKQGKRITGLQFFP-GDPDEVLVTSNDS----RIRIYDGRDKDLVHKF  540 (712)
T ss_pred             eccCccccCceeeeeEecC-CCCCeEEEecCCC----ceEEEeccchhhhhhh
Confidence            432      2233444442 3455677666554    5666676555555444


No 205
>KOG0379 consensus Kelch repeat-containing proteins [General function prediction only]
Probab=36.65  E-value=6.2e+02  Score=29.58  Aligned_cols=155  Identities=12%  Similarity=0.096  Sum_probs=86.3

Q ss_pred             eCCEEEEEEccC-----Ce--EEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CC------EEEE
Q 003800           94 LGKYVITLSSDG-----ST--LRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KG------CLHA  158 (794)
Q Consensus        94 ~g~~~V~Vs~~g-----~~--v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g------~l~a  158 (794)
                      .++.+++++|.+     ..  ++.+|..+  .+|......+..  +.+..+...... ++.++++.  +.      .|+.
T Consensus        69 ~~~~~~vfGG~~~~~~~~~~dl~~~d~~~--~~w~~~~~~g~~--p~~r~g~~~~~~-~~~l~lfGG~~~~~~~~~~l~~  143 (482)
T KOG0379|consen   69 IGNKLYVFGGYGSGDRLTDLDLYVLDLES--QLWTKPAATGDE--PSPRYGHSLSAV-GDKLYLFGGTDKKYRNLNELHS  143 (482)
T ss_pred             ECCEEEEECCCCCCCccccceeEEeecCC--cccccccccCCC--CCcccceeEEEE-CCeEEEEccccCCCCChhheEe
Confidence            466666666532     22  77787765  777776654432  222332211122 35555552  32      7899


Q ss_pred             EECCCCcEEEEEeccCcceeee--eEEEEecCCEEEEEEecCC---ceeEEEEEEcCCCceeeeeeee---cccCccC-c
Q 003800          159 VSSIDGEILWTRDFAAESVEVQ--QVIQLDESDQIYVVGYAGS---SQFHAYQINAMNGELLNHETAA---FSGGFVG-D  229 (794)
Q Consensus       159 ld~~tG~~~W~~~~~~~~~~~~--~~v~s~~~~~vyv~~~~g~---~~~~v~ald~~tG~~~w~~~v~---~~~~~s~-~  229 (794)
                      +|..|++  |+...+.+...+.  .......++++|+.|..+.   ..-.++++|+.+=+  |+.-..   .|+...+ .
T Consensus       144 ~d~~t~~--W~~l~~~~~~P~~r~~Hs~~~~g~~l~vfGG~~~~~~~~ndl~i~d~~~~~--W~~~~~~g~~P~pR~gH~  219 (482)
T KOG0379|consen  144 LDLSTRT--WSLLSPTGDPPPPRAGHSATVVGTKLVVFGGIGGTGDSLNDLHIYDLETST--WSELDTQGEAPSPRYGHA  219 (482)
T ss_pred             ccCCCCc--EEEecCcCCCCCCcccceEEEECCEEEEECCccCcccceeeeeeecccccc--ceecccCCCCCCCCCCce
Confidence            9988864  5554433221000  1111245688888775542   34578999998766  887432   2333333 4


Q ss_pred             eEEEcCcEEEEEECC-----CCeEEEEEeecce
Q 003800          230 VALVSSDTLVTLDTT-----RSILVTVSFKNRK  257 (794)
Q Consensus       230 ~~~vg~~~lv~~d~~-----~g~L~v~~l~sg~  257 (794)
                      ++++++..+++....     .+.++.+||.+.+
T Consensus       220 ~~~~~~~~~v~gG~~~~~~~l~D~~~ldl~~~~  252 (482)
T KOG0379|consen  220 MVVVGNKLLVFGGGDDGDVYLNDVHILDLSTWE  252 (482)
T ss_pred             EEEECCeEEEEeccccCCceecceEeeecccce
Confidence            555676766655433     2458899988855


No 206
>KOG1188 consensus WD40 repeat protein [General function prediction only]
Probab=36.36  E-value=4.8e+02  Score=28.97  Aligned_cols=172  Identities=13%  Similarity=0.121  Sum_probs=86.6

Q ss_pred             ECcCCccceEEEcCcccceeeeeeeeC-CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCe
Q 003800           69 DLRHGEIFWRHVLGINDVVDGIDIALG-KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSL  147 (794)
Q Consensus        69 n~~tG~ivWR~~l~~~~~i~~l~~~~g-~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~  147 (794)
                      .++++..-|-........+     ..+ +..|.|+-..+.++.||..+|+.+=++....+.. ....+..    -++.+.
T Consensus        17 S~~~~~~~~~Lk~~~q~~~-----~~~~e~~vav~lSngsv~lyd~~tg~~l~~fk~~~~~~-N~vrf~~----~ds~h~   86 (376)
T KOG1188|consen   17 SVRVSNEDFCLKYDIQEQV-----KDGFETAVAVSLSNGSVRLYDKGTGQLLEEFKGPPATT-NGVRFIS----CDSPHG   86 (376)
T ss_pred             ccccccccceeeccchhhh-----ccCcceeEEEEecCCeEEEEeccchhhhheecCCCCcc-cceEEec----CCCCCe
Confidence            3456666666555422211     112 2455555445789999999999998888766544 2333332    112345


Q ss_pred             EEEE-ECCEEEEEECCCCc----EEEEEeccCcceeeeeEEEEecCCEEEEEEe-cCCceeEEEEEEcCCCce-eeeeee
Q 003800          148 ILVS-SKGCLHAVSSIDGE----ILWTRDFAAESVEVQQVIQLDESDQIYVVGY-AGSSQFHAYQINAMNGEL-LNHETA  220 (794)
Q Consensus       148 V~V~-~~g~l~ald~~tG~----~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~-~g~~~~~v~ald~~tG~~-~w~~~v  220 (794)
                      |+.. ++|+|...|..+-.    ..|+...+.    +..+.+.-..+.++..+. .-++...|+-+|...-+. +.+..-
T Consensus        87 v~s~ssDG~Vr~wD~Rs~~e~a~~~~~~~~~~----~f~~ld~nck~~ii~~GtE~~~s~A~v~lwDvR~~qq~l~~~~e  162 (376)
T KOG1188|consen   87 VISCSSDGTVRLWDIRSQAESARISWTQQSGT----PFICLDLNCKKNIIACGTELTRSDASVVLWDVRSEQQLLRQLNE  162 (376)
T ss_pred             eEEeccCCeEEEEEeecchhhhheeccCCCCC----cceEeeccCcCCeEEeccccccCceEEEEEEeccccchhhhhhh
Confidence            5555 59999999887543    456654433    223332222334444332 112334566667654333 222211


Q ss_pred             ecccCccCceEEEcCc-EEEEEECCCCeEEEEEeec
Q 003800          221 AFSGGFVGDVALVSSD-TLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       221 ~~~~~~s~~~~~vg~~-~lv~~d~~~g~L~v~~l~s  255 (794)
                      +-.-+++.-++...++ +++... -.|-+.+.|++.
T Consensus       163 SH~DDVT~lrFHP~~pnlLlSGS-vDGLvnlfD~~~  197 (376)
T KOG1188|consen  163 SHNDDVTQLRFHPSDPNLLLSGS-VDGLVNLFDTKK  197 (376)
T ss_pred             hccCcceeEEecCCCCCeEEeec-ccceEEeeecCC
Confidence            1112333333333333 444443 346666666553


No 207
>KOG2321 consensus WD40 repeat protein [General function prediction only]
Probab=36.30  E-value=7.5e+02  Score=29.47  Aligned_cols=157  Identities=13%  Similarity=0.163  Sum_probs=86.0

Q ss_pred             eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc---------cCC----------------------cccccccccc
Q 003800           94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH---------SKP----------------------LLLVPTNLKV  142 (794)
Q Consensus        94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~---------s~~----------------------~~~~~~~~~~  142 (794)
                      .|+.++..|....+|+.||.++=.+..+.-+..+..         +..                      +|-.+-++..
T Consensus        62 DGqY~lAtG~YKP~ikvydlanLSLKFERhlDae~V~feiLsDD~SK~v~L~~DR~IefHak~G~hy~~RIP~~GRDm~y  141 (703)
T KOG2321|consen   62 DGQYLLATGTYKPQIKVYDLANLSLKFERHLDAEVVDFEILSDDYSKSVFLQNDRTIEFHAKYGRHYRTRIPKFGRDMKY  141 (703)
T ss_pred             CCcEEEEecccCCceEEEEcccceeeeeecccccceeEEEeccchhhheEeecCceeeehhhcCeeeeeecCcCCccccc
Confidence            466666666677899999999999999988876653         000                      0000000000


Q ss_pred             cc-CCeEEEE-ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800          143 DK-DSLILVS-SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETA  220 (794)
Q Consensus       143 ~~-~~~V~V~-~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v  220 (794)
                      +. ..++++. ++..||+|+..-|.-+=-++...+.+   .++....-..+++.|..   ...|-++|+.+-+.......
T Consensus       142 ~~~scDly~~gsg~evYRlNLEqGrfL~P~~~~~~~l---N~v~in~~hgLla~Gt~---~g~VEfwDpR~ksrv~~l~~  215 (703)
T KOG2321|consen  142 HKPSCDLYLVGSGSEVYRLNLEQGRFLNPFETDSGEL---NVVSINEEHGLLACGTE---DGVVEFWDPRDKSRVGTLDA  215 (703)
T ss_pred             cCCCccEEEeecCcceEEEEccccccccccccccccc---eeeeecCccceEEeccc---CceEEEecchhhhhheeeec
Confidence            00 1234444 57789999988887665555544433   22221223334444332   22899999988777765543


Q ss_pred             ecc----cCccC-----ceEEEcCcE-EEEEECCCCeEEEEEeecce
Q 003800          221 AFS----GGFVG-----DVALVSSDT-LVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       221 ~~~----~~~s~-----~~~~vg~~~-lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +..    .+...     ++-|-++++ ++|. ..+|+.++.||.+.+
T Consensus       216 ~~~v~s~pg~~~~~svTal~F~d~gL~~aVG-ts~G~v~iyDLRa~~  261 (703)
T KOG2321|consen  216 ASSVNSHPGGDAAPSVTALKFRDDGLHVAVG-TSTGSVLIYDLRASK  261 (703)
T ss_pred             ccccCCCccccccCcceEEEecCCceeEEee-ccCCcEEEEEcccCC
Confidence            322    11111     122223343 3344 356888888887755


No 208
>KOG3914 consensus WD repeat protein WDR4 [Function unknown]
Probab=35.62  E-value=83  Score=35.20  Aligned_cols=72  Identities=19%  Similarity=0.309  Sum_probs=36.4

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEE-EccCCeEEEEeCCCCcEeEEEecc
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITL-SSDGSTLRAWNLPDGQMVWESFLR  125 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~V-s~~g~~v~A~d~~tG~llWe~~l~  125 (794)
                      ++..|+++..+..|-......=-.+=..-++...-+..+  ...++-..+ ++.+++||.||..+|+.+=...+.
T Consensus       162 D~~~IitaDRDEkIRvs~ypa~f~IesfclGH~eFVS~i--sl~~~~~LlS~sGD~tlr~Wd~~sgk~L~t~dl~  234 (390)
T KOG3914|consen  162 DDQFIITADRDEKIRVSRYPATFVIESFCLGHKEFVSTI--SLTDNYLLLSGSGDKTLRLWDITSGKLLDTCDLS  234 (390)
T ss_pred             CCCEEEEecCCceEEEEecCcccchhhhccccHhheeee--eeccCceeeecCCCCcEEEEecccCCcccccchh
Confidence            345566666666665554322111111112111112222  333332233 344579999999999999555544


No 209
>KOG0292 consensus Vesicle coat complex COPI, alpha subunit [Intracellular trafficking, secretion, and vesicular transport]
Probab=35.28  E-value=1e+03  Score=30.03  Aligned_cols=107  Identities=11%  Similarity=0.150  Sum_probs=56.3

Q ss_pred             cceEEEcCcc--cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE
Q 003800           75 IFWRHVLGIN--DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS  152 (794)
Q Consensus        75 ivWR~~l~~~--~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~  152 (794)
                      ..|....=.+  .++.++-....++++.-.+.++.+|.||+..-+-+=.++-..+                  ..-++..
T Consensus       239 KaWEvDtcrgH~nnVssvlfhp~q~lIlSnsEDksirVwDm~kRt~v~tfrrend------------------RFW~laa  300 (1202)
T KOG0292|consen  239 KAWEVDTCRGHYNNVSSVLFHPHQDLILSNSEDKSIRVWDMTKRTSVQTFRREND------------------RFWILAA  300 (1202)
T ss_pred             cceeehhhhcccCCcceEEecCccceeEecCCCccEEEEecccccceeeeeccCC------------------eEEEEEe
Confidence            3565544322  2344331223345555355678999999975444433332221                  2222222


Q ss_pred             --CCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCC
Q 003800          153 --KGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMN  211 (794)
Q Consensus       153 --~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~t  211 (794)
                        ...|+|---++|-.+...+...|..    +   ..++.+|.+- . .   .+..+|..|
T Consensus       301 hP~lNLfAAgHDsGm~VFkleRErpa~----~---v~~n~LfYvk-d-~---~i~~~d~~t  349 (1202)
T KOG0292|consen  301 HPELNLFAAGHDSGMIVFKLERERPAY----A---VNGNGLFYVK-D-R---FIRSYDLRT  349 (1202)
T ss_pred             cCCcceeeeecCCceEEEEEcccCceE----E---EcCCEEEEEc-c-c---eEEeeeccc
Confidence              3555655556777888776554332    2   4667776654 2 2   577777776


No 210
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=34.54  E-value=7e+02  Score=27.94  Aligned_cols=146  Identities=20%  Similarity=0.199  Sum_probs=75.7

Q ss_pred             CEEEEEECcCC---ccceEEEcCcccceeeeeeeeCCEEEEEEcc---CCeEEEEeCCCCcE-eEEEeccCccccCCccc
Q 003800           63 NVIASLDLRHG---EIFWRHVLGINDVVDGIDIALGKYVITLSSD---GSTLRAWNLPDGQM-VWESFLRGSKHSKPLLL  135 (794)
Q Consensus        63 g~l~ALn~~tG---~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~---g~~v~A~d~~tG~l-lWe~~l~~~~~s~~~~~  135 (794)
                      +.++.+|..++   ...|+.............-..++...+++..   .+.|.+.+..+... .|+..+..+.-  ...+
T Consensus       252 s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l~~~~~--~~~l  329 (414)
T PF02897_consen  252 SEVYLLDLDDGGSPDAKPKLLSPREDGVEYYVDHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVLIPEDE--DVSL  329 (414)
T ss_dssp             EEEEEEECCCTTTSS-SEEEEEESSSS-EEEEEEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEEE--SS--SEEE
T ss_pred             CeEEEEeccccCCCcCCcEEEeCCCCceEEEEEccCCEEEEeeCCCCCCcEEEEecccccccccceeEEcCCCC--ceeE
Confidence            57999999886   7888887765433322212346666666643   36899999988875 56654433221  1111


Q ss_pred             cccccccccCCeEEEE--EC--CEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC-CceeEEEEEEcC
Q 003800          136 VPTNLKVDKDSLILVS--SK--GCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG-SSQFHAYQINAM  210 (794)
Q Consensus       136 ~~~~~~~~~~~~V~V~--~~--g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g-~~~~~v~ald~~  210 (794)
                      ...  ... .+.+++.  .+  .+|..++...|...-....+.... ...+......+.+++ .+.+ -.-..++.+|+.
T Consensus       330 ~~~--~~~-~~~Lvl~~~~~~~~~l~v~~~~~~~~~~~~~~p~~g~-v~~~~~~~~~~~~~~-~~ss~~~P~~~y~~d~~  404 (414)
T PF02897_consen  330 EDV--SLF-KDYLVLSYRENGSSRLRVYDLDDGKESREIPLPEAGS-VSGVSGDFDSDELRF-SYSSFTTPPTVYRYDLA  404 (414)
T ss_dssp             EEE--EEE-TTEEEEEEEETTEEEEEEEETT-TEEEEEEESSSSSE-EEEEES-TT-SEEEE-EEEETTEEEEEEEEETT
T ss_pred             EEE--EEE-CCEEEEEEEECCccEEEEEECCCCcEEeeecCCcceE-EeccCCCCCCCEEEE-EEeCCCCCCEEEEEECC
Confidence            110  122 3444443  33  468888877566666666654332 111111123444443 3322 112378889999


Q ss_pred             CCcee
Q 003800          211 NGELL  215 (794)
Q Consensus       211 tG~~~  215 (794)
                      +|+..
T Consensus       405 t~~~~  409 (414)
T PF02897_consen  405 TGELT  409 (414)
T ss_dssp             TTCEE
T ss_pred             CCCEE
Confidence            98864


No 211
>PF05567 Neisseria_PilC:  Neisseria PilC beta-propeller domain;  InterPro: IPR008707 This domain is found in several PilC protein sequences from Neisseria gonorrhoeae and Neisseria meningitidis. PilC is a phase-variable protein associated with pilus-mediated adherence of pathogenic Neisseria to target cells [].; PDB: 3HX6_A.
Probab=34.44  E-value=6.6e+02  Score=27.83  Aligned_cols=55  Identities=20%  Similarity=0.250  Sum_probs=32.0

Q ss_pred             eeEEEEEEcCC-Cceeeeeeeecc-cCccCceEEEc---C---cEEEEEECCCCeEEEEEeecce
Q 003800          201 QFHAYQINAMN-GELLNHETAAFS-GGFVGDVALVS---S---DTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       201 ~~~v~ald~~t-G~~~w~~~v~~~-~~~s~~~~~vg---~---~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                      +..++.+|++| |..+|...+... .+++. +.++.   +   ..++..|. .|+++.+|+.+..
T Consensus       180 ~~~lyi~d~~t~G~l~~~i~~~~~~~gl~~-~~~~D~d~DG~~D~vYaGDl-~GnlwR~dl~~~~  242 (335)
T PF05567_consen  180 GAALYILDADTTGALIKKIDVPGGSGGLSS-PAVVDSDGDGYVDRVYAGDL-GGNLWRFDLSSAN  242 (335)
T ss_dssp             -EEEEEEETTT---EEEEEEE--STT-EEE-EEEE-TTSSSEE-EEEEEET-TSEEEEEE--TTS
T ss_pred             CcEEEEEECCCCCceEEEEecCCCCccccc-cEEEeccCCCeEEEEEEEcC-CCcEEEEECCCCC
Confidence            47899999999 999998765443 23333 33331   1   26778886 5999999997643


No 212
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=33.85  E-value=1.3e+02  Score=33.26  Aligned_cols=98  Identities=16%  Similarity=0.165  Sum_probs=54.6

Q ss_pred             CEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE--ECCEEEEEECCCCcEEEEEecc
Q 003800           96 KYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS--SKGCLHAVSSIDGEILWTRDFA  173 (794)
Q Consensus        96 ~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~--~~g~l~ald~~tG~~~W~~~~~  173 (794)
                      +.+|..|| +.+++.|+..||+-+-......-..    ..     ... .+.++|.  +|.++.-.|...|.-+=..+--
T Consensus       331 kyIVsASg-DRTikvW~~st~efvRtl~gHkRGI----AC-----lQY-r~rlvVSGSSDntIRlwdi~~G~cLRvLeGH  399 (499)
T KOG0281|consen  331 KYIVSASG-DRTIKVWSTSTCEFVRTLNGHKRGI----AC-----LQY-RDRLVVSGSSDNTIRLWDIECGACLRVLEGH  399 (499)
T ss_pred             ceEEEecC-CceEEEEeccceeeehhhhcccccc----ee-----hhc-cCeEEEecCCCceEEEEeccccHHHHHHhch
Confidence            33444444 5799999999998886655443221    11     223 4555554  4888888888888654332211


Q ss_pred             CcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCC
Q 003800          174 AESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNG  212 (794)
Q Consensus       174 ~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG  212 (794)
                      +.   ..+++. .++.++.-.+++|    ++-..|..+|
T Consensus       400 Ee---LvRciR-Fd~krIVSGaYDG----kikvWdl~aa  430 (499)
T KOG0281|consen  400 EE---LVRCIR-FDNKRIVSGAYDG----KIKVWDLQAA  430 (499)
T ss_pred             HH---hhhhee-ecCceeeeccccc----eEEEEecccc
Confidence            11   122321 3455555444455    6666666554


No 213
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=33.55  E-value=3.9e+02  Score=24.75  Aligned_cols=68  Identities=13%  Similarity=0.139  Sum_probs=47.6

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      .+-++++|++..|-.++  ..++++...-...  +..+.....+...| +-.+|+|-.++.  ...+|+..-...
T Consensus        15 ~~eLlvGs~D~~IRvf~--~~e~~~Ei~e~~~--v~~L~~~~~~~F~Y-~l~NGTVGvY~~--~~RlWRiKSK~~   82 (111)
T PF14783_consen   15 ENELLVGSDDFEIRVFK--GDEIVAEITETDK--VTSLCSLGGGRFAY-ALANGTVGVYDR--SQRLWRIKSKNQ   82 (111)
T ss_pred             cceEEEecCCcEEEEEe--CCcEEEEEecccc--eEEEEEcCCCEEEE-EecCCEEEEEeC--cceeeeeccCCC
Confidence            46699999999999997  4578888665544  44442223444445 444579999976  889999986554


No 214
>KOG0771 consensus Prolactin regulatory element-binding protein/Protein transport protein SEC12p [Intracellular trafficking, secretion, and vesicular transport]
Probab=32.25  E-value=5.6e+02  Score=28.99  Aligned_cols=19  Identities=16%  Similarity=0.200  Sum_probs=9.0

Q ss_pred             EEEEEECCCCeEEEEEeec
Q 003800          237 TLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       237 ~lv~~d~~~g~L~v~~l~s  255 (794)
                      .++++-...|++.+.+..+
T Consensus       294 kf~AlGT~dGsVai~~~~~  312 (398)
T KOG0771|consen  294 KFLALGTMDGSVAIYDAKS  312 (398)
T ss_pred             cEEEEeccCCcEEEEEece
Confidence            3334433455555555443


No 215
>KOG2395 consensus Protein involved in vacuole import and degradation [Intracellular trafficking, secretion, and vesicular transport]
Probab=31.57  E-value=7.1e+02  Score=29.41  Aligned_cols=116  Identities=12%  Similarity=0.157  Sum_probs=66.6

Q ss_pred             CCEEEEEEc-cCCeEEEEeCCCCcEeEEEeccCccccCCcccccc-cc-ccccCCeEEEEECCEEEEEECC-CCc--EEE
Q 003800           95 GKYVITLSS-DGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPT-NL-KVDKDSLILVSSKGCLHAVSSI-DGE--ILW  168 (794)
Q Consensus        95 g~~~V~Vs~-~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~-~~-~~~~~~~V~V~~~g~l~ald~~-tG~--~~W  168 (794)
                      ...+++.++ .-..|+-+|.+.|+++=+|.+.....  -..+.+. .. .......++-+++..|+++|+. .|.  ..|
T Consensus       344 dsnlil~~~~~~~~l~klDIE~GKIVeEWk~~~di~--mv~~t~d~K~~Ql~~e~TlvGLs~n~vfriDpRv~~~~kl~~  421 (644)
T KOG2395|consen  344 DSNLILMDGGEQDKLYKLDIERGKIVEEWKFEDDIN--MVDITPDFKFAQLTSEQTLVGLSDNSVFRIDPRVQGKNKLAV  421 (644)
T ss_pred             ccceEeeCCCCcCcceeeecccceeeeEeeccCCcc--eeeccCCcchhcccccccEEeecCCceEEecccccCcceeee
Confidence            445667654 34679999999999998887765411  0001000 00 0111233444689999999986 443  557


Q ss_pred             EEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeee
Q 003800          169 TRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNH  217 (794)
Q Consensus       169 ~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~  217 (794)
                      .....-..-.-.+|...+.+|.+.+.+..|    .+.-+|. .|.+..+
T Consensus       422 ~q~kqy~~k~nFsc~aTT~sG~IvvgS~~G----dIRLYdr-i~~~AKT  465 (644)
T KOG2395|consen  422 VQSKQYSTKNNFSCFATTESGYIVVGSLKG----DIRLYDR-IGRRAKT  465 (644)
T ss_pred             eeccccccccccceeeecCCceEEEeecCC----cEEeehh-hhhhhhh
Confidence            654432221234666556778888777776    4444554 5555433


No 216
>KOG0639 consensus Transducin-like enhancer of split protein (contains WD40 repeats) [Chromatin structure and dynamics]
Probab=30.94  E-value=2.2e+02  Score=32.99  Aligned_cols=75  Identities=12%  Similarity=0.174  Sum_probs=57.3

Q ss_pred             ccCCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           50 KTGRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        50 ~~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      +.++-+++++.-+|.|-+-|.++|+.+=++++...  |-.+..--.++++.||-.++.+..+.. +|....+..+...
T Consensus       560 s~dGtklWTGGlDntvRcWDlregrqlqqhdF~SQ--IfSLg~cP~~dWlavGMens~vevlh~-skp~kyqlhlheS  634 (705)
T KOG0639|consen  560 SKDGTKLWTGGLDNTVRCWDLREGRQLQQHDFSSQ--IFSLGYCPTGDWLAVGMENSNVEVLHT-SKPEKYQLHLHES  634 (705)
T ss_pred             cCCCceeecCCCccceeehhhhhhhhhhhhhhhhh--heecccCCCccceeeecccCcEEEEec-CCccceeeccccc
Confidence            33466799999999999999999999988888766  333321246789998877778988875 7888888776653


No 217
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=30.78  E-value=1.1e+02  Score=21.85  Aligned_cols=30  Identities=10%  Similarity=0.249  Sum_probs=21.7

Q ss_pred             CCEEEEEEecCCceeEEEEEEcCCCceeeeeee
Q 003800          188 SDQIYVVGYAGSSQFHAYQINAMNGELLNHETA  220 (794)
Q Consensus       188 ~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v  220 (794)
                      +..+|+....++   .+..+|+.+++.+.+..+
T Consensus         3 ~~~lyv~~~~~~---~v~~id~~~~~~~~~i~v   32 (42)
T TIGR02276         3 GTKLYVTNSGSN---TVSVIDTATNKVIATIPV   32 (42)
T ss_pred             CCEEEEEeCCCC---EEEEEECCCCeEEEEEEC
Confidence            456887654433   788899999988877654


No 218
>PF08596 Lgl_C:  Lethal giant larvae(Lgl) like, C-terminal;  InterPro: IPR013905  The Lethal giant larvae (Lgl) tumour suppressor protein is conserved from yeast to mammals. The Lgl protein functions in cell polarity, at least in part, by regulating SNARE-mediated membrane delivery events at the cell surface []. The N-terminal half of Lgl members contains WD40 repeats (see IPR001680 from INTERPRO), while the C-terminal half appears specific to the protein []. ; PDB: 2OAJ_A.
Probab=30.12  E-value=8.6e+02  Score=27.64  Aligned_cols=182  Identities=13%  Similarity=0.143  Sum_probs=84.3

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcc--c-----ceeeee---eeeCC-----EEEEEEccCCeEEEEeCC-CC
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGIN--D-----VVDGID---IALGK-----YVITLSSDGSTLRAWNLP-DG  116 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~--~-----~i~~l~---~~~g~-----~~V~Vs~~g~~v~A~d~~-tG  116 (794)
                      =+-|-++.++|.|.-+|.|--+++-+..+.+.  .     .+..+.   ...++     -.++||++.|.+..|... .+
T Consensus        97 iGFvaigy~~G~l~viD~RGPavI~~~~i~~~~~~~~~~~~vt~ieF~vm~~~~D~ySSi~L~vGTn~G~v~~fkIlp~~  176 (395)
T PF08596_consen   97 IGFVAIGYESGSLVVIDLRGPAVIYNENIRESFLSKSSSSYVTSIEFSVMTLGGDGYSSICLLVGTNSGNVLTFKILPSS  176 (395)
T ss_dssp             TSEEEEEETTSEEEEEETTTTEEEEEEEGGG--T-SS----EEEEEEEEEE-TTSSSEEEEEEEEETTSEEEEEEEEE-G
T ss_pred             CcEEEEEecCCcEEEEECCCCeEEeeccccccccccccccCeeEEEEEEEecCCCcccceEEEEEeCCCCEEEEEEecCC
Confidence            34578888999999999999999999887661  0     111111   11222     245666666788888654 34


Q ss_pred             cEeEEEeccCccccCCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEe
Q 003800          117 QMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGY  196 (794)
Q Consensus       117 ~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~  196 (794)
                      .-.|+....+.....             ++.     -..+..+|.++|+..+........+.....    .++.+.+++.
T Consensus       177 ~g~f~v~~~~~~~~~-------------~~~-----i~~I~~i~~~~G~~a~At~~~~~~l~~g~~----i~g~vVvvSe  234 (395)
T PF08596_consen  177 NGRFSVQFAGATTNH-------------DSP-----ILSIIPINADTGESALATISAMQGLSKGIS----IPGYVVVVSE  234 (395)
T ss_dssp             GG-EEEEEEEEE--S-------------S---------EEEEEETTT--B-B-BHHHHHGGGGT--------EEEEEE-S
T ss_pred             CCceEEEEeeccccC-------------CCc-----eEEEEEEECCCCCcccCchhHhhccccCCC----cCcEEEEEcc
Confidence            455776654321000             111     113556688888776553221111100000    1223333322


Q ss_pred             cCCceeEEEEEEcCCCceeeeeeeecccCccCceEEE------cCcEEEEEECCCCeEEEEEeecceeeeEEEee
Q 003800          197 AGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALV------SSDTLVTLDTTRSILVTVSFKNRKIAFQETHL  265 (794)
Q Consensus       197 ~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~v------g~~~lv~~d~~~g~L~v~~l~sg~~~~~~~~l  265 (794)
                      .     .+..+.+.+++...... ..+ -+...+.++      ++..++|+.. +|.+.+..|-.=+ ++.++.+
T Consensus       235 ~-----~irv~~~~~~k~~~K~~-~~~-~~~~~~~vv~~~~~~~~~~Lv~l~~-~G~i~i~SLP~Lk-ei~~~~l  300 (395)
T PF08596_consen  235 S-----DIRVFKPPKSKGAHKSF-DDP-FLCSSASVVPTISRNGGYCLVCLFN-NGSIRIYSLPSLK-EIKSVSL  300 (395)
T ss_dssp             S-----EEEEE-TT---EEEEE--SS--EEEEEEEEEEEE-EEEEEEEEEEET-TSEEEEEETTT---EEEEEE-
T ss_pred             c-----ceEEEeCCCCcccceee-ccc-cccceEEEEeecccCCceEEEEEEC-CCcEEEEECCCch-HhhcccC
Confidence            2     35556666666543332 111 112222222      4457888874 6888888877633 2454544


No 219
>KOG0281 consensus Beta-TrCP (transducin repeats containing)/Slimb proteins [Function unknown]
Probab=30.08  E-value=1.1e+02  Score=33.61  Aligned_cols=72  Identities=14%  Similarity=0.182  Sum_probs=42.7

Q ss_pred             CCCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccC
Q 003800           52 GRKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRG  126 (794)
Q Consensus        52 ~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~  126 (794)
                      +++.|+.++.+-.+-.-|..||+-+=. .-+....|..+  .-.+..|+-|+.+.++|.||+..|..+--.+...
T Consensus       329 d~kyIVsASgDRTikvW~~st~efvRt-l~gHkRGIACl--QYr~rlvVSGSSDntIRlwdi~~G~cLRvLeGHE  400 (499)
T KOG0281|consen  329 DDKYIVSASGDRTIKVWSTSTCEFVRT-LNGHKRGIACL--QYRDRLVVSGSSDNTIRLWDIECGACLRVLEGHE  400 (499)
T ss_pred             ccceEEEecCCceEEEEeccceeeehh-hhcccccceeh--hccCeEEEecCCCceEEEEeccccHHHHHHhchH
Confidence            355577777788888888777754321 11111123333  2233344324457899999999999885544443


No 220
>KOG1912 consensus WD40 repeat protein [General function prediction only]
Probab=29.90  E-value=5.4e+02  Score=31.75  Aligned_cols=76  Identities=16%  Similarity=0.172  Sum_probs=49.7

Q ss_pred             EEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEE-EE-ECCEEEEEECCCCcEEEEEeccCc
Q 003800           98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLIL-VS-SKGCLHAVSSIDGEILWTRDFAAE  175 (794)
Q Consensus        98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~-V~-~~g~l~ald~~tG~~~W~~~~~~~  175 (794)
                      ++.++...|++..||-..|..+=+..-..... +++..++   ..+...+++ +. ....+.-.+..||+..|++.....
T Consensus        81 liAsaD~~GrIil~d~~~~s~~~~l~~~~~~~-qdl~W~~---~rd~Srd~LlaIh~ss~lvLwntdtG~k~Wk~~ys~~  156 (1062)
T KOG1912|consen   81 LIASADISGRIILVDFVLASVINWLSHSNDSV-QDLCWVP---ARDDSRDVLLAIHGSSTLVLWNTDTGEKFWKYDYSHE  156 (1062)
T ss_pred             eEEeccccCcEEEEEehhhhhhhhhcCCCcch-hheeeee---ccCcchheeEEecCCcEEEEEEccCCceeeccccCCc
Confidence            44444446799999999986654444443333 5666665   233233444 34 477888999999999999987655


Q ss_pred             ce
Q 003800          176 SV  177 (794)
Q Consensus       176 ~~  177 (794)
                      .+
T Consensus       157 iL  158 (1062)
T KOG1912|consen  157 IL  158 (1062)
T ss_pred             ce
Confidence            44


No 221
>KOG0263 consensus Transcription initiation factor TFIID, subunit TAF5 (also component of histone acetyltransferase SAGA) [Transcription]
Probab=29.64  E-value=2.1e+02  Score=34.75  Aligned_cols=69  Identities=13%  Similarity=0.260  Sum_probs=45.9

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEecc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLR  125 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~  125 (794)
                      +.+++.+-.+---|..+|..+=++. +...++..+.....+..+.-++.++.|.-||..+|+++=+....
T Consensus       550 ~aTGSsD~tVRlWDv~~G~~VRiF~-GH~~~V~al~~Sp~Gr~LaSg~ed~~I~iWDl~~~~~v~~l~~H  618 (707)
T KOG0263|consen  550 VATGSSDRTVRLWDVSTGNSVRIFT-GHKGPVTALAFSPCGRYLASGDEDGLIKIWDLANGSLVKQLKGH  618 (707)
T ss_pred             cccCCCCceEEEEEcCCCcEEEEec-CCCCceEEEEEcCCCceEeecccCCcEEEEEcCCCcchhhhhcc
Confidence            5555567889999999998865542 23344555533333334443555789999999999998766655


No 222
>KOG0271 consensus Notchless-like WD40 repeat-containing protein [Function unknown]
Probab=29.58  E-value=8.7e+02  Score=27.50  Aligned_cols=64  Identities=20%  Similarity=0.279  Sum_probs=40.5

Q ss_pred             CCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEE-E-CCEEEEEECCCCcEE
Q 003800           95 GKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KGCLHAVSSIDGEIL  167 (794)
Q Consensus        95 g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g~l~ald~~tG~~~  167 (794)
                      +..++. |+.+.++|-||..|-.+.-.......=. .   .+    +...++..++. + +|.+...|+++|++.
T Consensus       127 g~~l~t-GsGD~TvR~WD~~TeTp~~t~KgH~~WV-l---cv----awsPDgk~iASG~~dg~I~lwdpktg~~~  192 (480)
T KOG0271|consen  127 GSRLVT-GSGDTTVRLWDLDTETPLFTCKGHKNWV-L---CV----AWSPDGKKIASGSKDGSIRLWDPKTGQQI  192 (480)
T ss_pred             CceEEe-cCCCceEEeeccCCCCcceeecCCccEE-E---EE----EECCCcchhhccccCCeEEEecCCCCCcc
Confidence            334444 4446899999999988887777654311 0   11    22224555554 3 899999999888653


No 223
>KOG0289 consensus mRNA splicing factor [General function prediction only]
Probab=29.50  E-value=9.2e+02  Score=27.74  Aligned_cols=75  Identities=11%  Similarity=0.033  Sum_probs=51.1

Q ss_pred             cCCCEEEEEeCCCEEEEEECcCCccceEEEcCccc-ceeeeeeeeCCEEEEEE-ccCCeEEEEeCCCCcEeEEEeccC
Q 003800           51 TGRKRVVVSTEENVIASLDLRHGEIFWRHVLGIND-VVDGIDIALGKYVITLS-SDGSTLRAWNLPDGQMVWESFLRG  126 (794)
Q Consensus        51 ~~~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~-~i~~l~~~~g~~~V~Vs-~~g~~v~A~d~~tG~llWe~~l~~  126 (794)
                      +.++-++-++.++..+=-|.++|..+=.+.-+..+ .+... ...-++.++.. ..++.|+-||..++...=.+....
T Consensus       313 ~tgeYllsAs~d~~w~Fsd~~~g~~lt~vs~~~s~v~~ts~-~fHpDgLifgtgt~d~~vkiwdlks~~~~a~Fpght  389 (506)
T KOG0289|consen  313 PTGEYLLSASNDGTWAFSDISSGSQLTVVSDETSDVEYTSA-AFHPDGLIFGTGTPDGVVKIWDLKSQTNVAKFPGHT  389 (506)
T ss_pred             cCCcEEEEecCCceEEEEEccCCcEEEEEeeccccceeEEe-eEcCCceEEeccCCCceEEEEEcCCccccccCCCCC
Confidence            34667888888999888899999988777665331 22222 12345566654 457899999999988665555443


No 224
>PRK01742 tolB translocation protein TolB; Provisional
Probab=28.87  E-value=9e+02  Score=27.45  Aligned_cols=183  Identities=15%  Similarity=0.127  Sum_probs=83.4

Q ss_pred             CCEE-EEEeCC-----CEEEEEECcCCccceEEEcCcc-cceeeeee-eeCCEEEEEEcc--CCeEEEEeCCCCcEeEEE
Q 003800           53 RKRV-VVSTEE-----NVIASLDLRHGEIFWRHVLGIN-DVVDGIDI-ALGKYVITLSSD--GSTLRAWNLPDGQMVWES  122 (794)
Q Consensus        53 ~~~V-yv~t~~-----g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~-~~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~  122 (794)
                      ..+| |+.+..     ..|.-.|.. |.-.  +.+... ..+..... ..|+.+++++..  +..++.||..+|+..--.
T Consensus       168 ~~ria~v~~~~~~~~~~~i~i~d~d-g~~~--~~lt~~~~~v~~p~wSPDG~~la~~s~~~~~~~i~i~dl~tg~~~~l~  244 (429)
T PRK01742        168 RTRIAYVVQKNGGSQPYEVRVADYD-GFNQ--FIVNRSSQPLMSPAWSPDGSKLAYVSFENKKSQLVVHDLRSGARKVVA  244 (429)
T ss_pred             CCEEEEEEEEcCCCceEEEEEECCC-CCCc--eEeccCCCccccceEcCCCCEEEEEEecCCCcEEEEEeCCCCceEEEe
Confidence            3444 766542     366666764 4332  233222 11222211 256667776643  357999999999754333


Q ss_pred             eccCccccCCccccccccccccCCeEEEE-E-CC--EEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecC
Q 003800          123 FLRGSKHSKPLLLVPTNLKVDKDSLILVS-S-KG--CLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAG  198 (794)
Q Consensus       123 ~l~~~~~s~~~~~~~~~~~~~~~~~V~V~-~-~g--~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g  198 (794)
                      ...+..  ..+.+.     ++ ++.+++. . +|  .++.+|..+|+..=-......   ...+..+.++..+++.+..+
T Consensus       245 ~~~g~~--~~~~wS-----PD-G~~La~~~~~~g~~~Iy~~d~~~~~~~~lt~~~~~---~~~~~wSpDG~~i~f~s~~~  313 (429)
T PRK01742        245 SFRGHN--GAPAFS-----PD-GSRLAFASSKDGVLNIYVMGANGGTPSQLTSGAGN---NTEPSWSPDGQSILFTSDRS  313 (429)
T ss_pred             cCCCcc--CceeEC-----CC-CCEEEEEEecCCcEEEEEEECCCCCeEeeccCCCC---cCCEEECCCCCEEEEEECCC
Confidence            332211  111121     22 2334443 2 44  477888877764211111111   11122133444555444322


Q ss_pred             CceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcEEEEEECCCCeEEEEEeecce
Q 003800          199 SSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDTLVTLDTTRSILVTVSFKNRK  257 (794)
Q Consensus       199 ~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~lv~~d~~~g~L~v~~l~sg~  257 (794)
                       ....++.++..+|.....   . ... ........+..++....  ..+...|+.+|+
T Consensus       314 -g~~~I~~~~~~~~~~~~l---~-~~~-~~~~~SpDG~~ia~~~~--~~i~~~Dl~~g~  364 (429)
T PRK01742        314 -GSPQVYRMSASGGGASLV---G-GRG-YSAQISADGKTLVMING--DNVVKQDLTSGS  364 (429)
T ss_pred             -CCceEEEEECCCCCeEEe---c-CCC-CCccCCCCCCEEEEEcC--CCEEEEECCCCC
Confidence             234778888877654321   1 111 11111113334444432  356668888776


No 225
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=28.17  E-value=8.1e+02  Score=26.81  Aligned_cols=75  Identities=21%  Similarity=0.171  Sum_probs=45.0

Q ss_pred             cCCCEEEEEeC-CCEEEEEECc--CCccceE----EEcCcccceeeeeeeeCCEEEEEEc-c-CCeEEEEeCCCCcEeEE
Q 003800           51 TGRKRVVVSTE-ENVIASLDLR--HGEIFWR----HVLGINDVVDGIDIALGKYVITLSS-D-GSTLRAWNLPDGQMVWE  121 (794)
Q Consensus        51 ~~~~~Vyv~t~-~g~l~ALn~~--tG~ivWR----~~l~~~~~i~~l~~~~g~~~V~Vs~-~-g~~v~A~d~~tG~llWe  121 (794)
                      ++++.+|++.- .+.|.+++..  +|.+-=|    ..-..++..+++. ...++.+.++. . |+.|..|++. |+++=+
T Consensus       172 pDg~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~-vDadG~lw~~a~~~g~~v~~~~pd-G~l~~~  249 (307)
T COG3386         172 PDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMA-VDADGNLWVAAVWGGGRVVRFNPD-GKLLGE  249 (307)
T ss_pred             CCCCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceE-EeCCCCEEEecccCCceEEEECCC-CcEEEE
Confidence            34556777665 4778777553  3444333    2112223344553 45556665433 2 3489999997 999999


Q ss_pred             EeccCc
Q 003800          122 SFLRGS  127 (794)
Q Consensus       122 ~~l~~~  127 (794)
                      ..+...
T Consensus       250 i~lP~~  255 (307)
T COG3386         250 IKLPVK  255 (307)
T ss_pred             EECCCC
Confidence            998743


No 226
>PF11589 DUF3244:  Domain of unknown function (DUF3244);  InterPro: IPR021638  This family of proteins with unknown function appear to be restricted to Bacteroidetes. The protein may have an immunoglobulin-like beta-sandwich fold however this cannot be confirmed. ; PDB: 3D33_B 3SD2_A.
Probab=27.01  E-value=1.2e+02  Score=27.45  Aligned_cols=24  Identities=17%  Similarity=0.188  Sum_probs=19.6

Q ss_pred             cEEEEEEEEceeeeEEEEEEecCCC
Q 003800          731 AWLVVYLIDTITGRILHRMTHHGAQ  755 (794)
Q Consensus       731 ~~l~v~liD~VTG~il~s~~h~~~~  755 (794)
                      ..++|.+.| .+|+++|+.......
T Consensus        48 ~~vtI~I~d-~~G~vVy~~~~~~~~   71 (106)
T PF11589_consen   48 GDVTITIKD-STGNVVYSETVSNSA   71 (106)
T ss_dssp             SEEEEEEEE-TT--EEEEEEESCGG
T ss_pred             CCEEEEEEe-CCCCEEEEEEccCCC
Confidence            689999999 999999999988853


No 227
>PF08553 VID27:  VID27 cytoplasmic protein;  InterPro: IPR013863  This entry represents fungal and plant proteins and contains many hypothetical proteins. Vid27p is a cytoplasmic protein of unknown function, possibly regulates import of fructose-1,6-bisphosphatase into Vacuolar Import and Degradation (Vid) vesicles and is not essential for proteasome-dependent degradation of fructose-1,6-bisphosphatase (FBPase) [, ].
Probab=27.00  E-value=2.6e+02  Score=34.79  Aligned_cols=63  Identities=11%  Similarity=0.171  Sum_probs=45.3

Q ss_pred             CCEEEEEeCCCEEEEEECcCC--ccceEEEcCc--ccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCc
Q 003800           53 RKRVVVSTEENVIASLDLRHG--EIFWRHVLGI--NDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQ  117 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG--~ivWR~~l~~--~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~  117 (794)
                      ....|++-.+|.|..+|||-.  +++|.+.-..  .-.+.++ ++.++|.++||+..|.||.||. .|+
T Consensus       542 ~e~tflGls~n~lfriDpR~~~~k~v~~~~k~Y~~~~~Fs~~-aTt~~G~iavgs~~G~IRLyd~-~g~  608 (794)
T PF08553_consen  542 NEQTFLGLSDNSLFRIDPRLSGNKLVDSQSKQYSSKNNFSCF-ATTEDGYIAVGSNKGDIRLYDR-LGK  608 (794)
T ss_pred             CCceEEEECCCceEEeccCCCCCceeeccccccccCCCceEE-EecCCceEEEEeCCCcEEeecc-cch
Confidence            445899999999999999974  3677654322  2224444 4678888888887789999995 563


No 228
>PRK02889 tolB translocation protein TolB; Provisional
Probab=26.73  E-value=9.8e+02  Score=27.16  Aligned_cols=149  Identities=13%  Similarity=0.153  Sum_probs=69.2

Q ss_pred             eCCEEEEEEcc--CCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEE-EE-C--CEEEEEECCCCcEE
Q 003800           94 LGKYVITLSSD--GSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILV-SS-K--GCLHAVSSIDGEIL  167 (794)
Q Consensus        94 ~g~~~V~Vs~~--g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~-~--g~l~ald~~tG~~~  167 (794)
                      .|+.+++++..  ...++.||..+|+..--....+..  ..+.+.     .+ ++.+++ .. +  ..++.+|..+|...
T Consensus       206 DG~~la~~s~~~~~~~I~~~dl~~g~~~~l~~~~g~~--~~~~~S-----PD-G~~la~~~~~~g~~~Iy~~d~~~~~~~  277 (427)
T PRK02889        206 DGTKLAYVSFESKKPVVYVHDLATGRRRVVANFKGSN--SAPAWS-----PD-GRTLAVALSRDGNSQIYTVNADGSGLR  277 (427)
T ss_pred             CCCEEEEEEccCCCcEEEEEECCCCCEEEeecCCCCc--cceEEC-----CC-CCEEEEEEccCCCceEEEEECCCCCcE
Confidence            45556666532  256999999999764222222211  111111     22 234443 32 3  36888888776532


Q ss_pred             EEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEE-EcCcEEEEEECCCC
Q 003800          168 WTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVAL-VSSDTLVTLDTTRS  246 (794)
Q Consensus       168 W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~-vg~~~lv~~d~~~g  246 (794)
                       ........  ...+..+.++..+++.+..++ ...++.+|..+|+..-   +.........+.+ ..++.++......+
T Consensus       278 -~lt~~~~~--~~~~~wSpDG~~l~f~s~~~g-~~~Iy~~~~~~g~~~~---lt~~g~~~~~~~~SpDG~~Ia~~s~~~g  350 (427)
T PRK02889        278 -RLTQSSGI--DTEPFFSPDGRSIYFTSDRGG-APQIYRMPASGGAAQR---VTFTGSYNTSPRISPDGKLLAYISRVGG  350 (427)
T ss_pred             -ECCCCCCC--CcCeEEcCCCCEEEEEecCCC-CcEEEEEECCCCceEE---EecCCCCcCceEECCCCCEEEEEEccCC
Confidence             11111111  011112344555655443322 3478888887775321   1111111111222 12334444432333


Q ss_pred             --eEEEEEeecce
Q 003800          247 --ILVTVSFKNRK  257 (794)
Q Consensus       247 --~L~v~~l~sg~  257 (794)
                        .+++.++.+++
T Consensus       351 ~~~I~v~d~~~g~  363 (427)
T PRK02889        351 AFKLYVQDLATGQ  363 (427)
T ss_pred             cEEEEEEECCCCC
Confidence              58888888776


No 229
>PF01456 Mucin:  Mucin-like glycoprotein;  InterPro: IPR000458 This family of trypanosomal proteins resemble vertebrate mucins. The protein consists of three regions. The N and C terminii are conserved between all members of the family, whereas the central region is not well conserved and contains a large number of threonine residues which can be glycosylated []. Indirect evidence suggested that these genes might encode the core protein of parasite mucins, glycoproteins that were proposed to be involved in the interaction with, and invasion of, mammalian host cells.
Probab=26.14  E-value=49  Score=31.67  Aligned_cols=27  Identities=26%  Similarity=0.422  Sum_probs=17.8

Q ss_pred             ChHHHHHHHHHHHHhccccccceeecc
Q 003800            1 MAIRFIILTLLFLSSCTIPSLSLYEDQ   27 (794)
Q Consensus         1 ~~~~~~l~~l~~l~~~~~~~~Al~edq   27 (794)
                      |=-++|||+||+|++|.-++-..-+.+
T Consensus         1 MmtcRLLCalLvlaLcCCpsvc~t~~~   27 (143)
T PF01456_consen    1 MMTCRLLCALLVLALCCCPSVCATASE   27 (143)
T ss_pred             CchHHHHHHHHHHHHHcCcchhccccc
Confidence            335789999999999763333333433


No 230
>COG4946 Uncharacterized protein related to the periplasmic component of the Tol biopolymer transport system [Function unknown]
Probab=25.83  E-value=1.1e+03  Score=27.48  Aligned_cols=195  Identities=14%  Similarity=0.150  Sum_probs=92.0

Q ss_pred             eeeCCEEEEEEcc-C-CeEEEEeCCCCcEeEE-EeccCccccCCccccccccccccCCeEEE-EECCEEEEEECCCCcEE
Q 003800           92 IALGKYVITLSSD-G-STLRAWNLPDGQMVWE-SFLRGSKHSKPLLLVPTNLKVDKDSLILV-SSKGCLHAVSSIDGEIL  167 (794)
Q Consensus        92 ~~~g~~~V~Vs~~-g-~~v~A~d~~tG~llWe-~~l~~~~~s~~~~~~~~~~~~~~~~~V~V-~~~g~l~ald~~tG~~~  167 (794)
                      +-+++.+.+++.. | |++++-|. +|+-+-+ +.+..... .         ....++.-+| ...|.++.+|+++-++.
T Consensus       232 mIV~~RvYFlsD~eG~GnlYSvdl-dGkDlrrHTnFtdYY~-R---------~~nsDGkrIvFq~~GdIylydP~td~le  300 (668)
T COG4946         232 MIVGERVYFLSDHEGVGNLYSVDL-DGKDLRRHTNFTDYYP-R---------NANSDGKRIVFQNAGDIYLYDPETDSLE  300 (668)
T ss_pred             eEEcceEEEEecccCccceEEecc-CCchhhhcCCchhccc-c---------ccCCCCcEEEEecCCcEEEeCCCcCcce
Confidence            4578888888863 3 68999997 5655443 34333221 1         1122444444 46899999999876542


Q ss_pred             -EEEeccCcce-eeeeEE-E-------EecCCEEEEEEecCCceeEEEEEEcCCCceeeeeeeecccCccCceEEEcCcE
Q 003800          168 -WTRDFAAESV-EVQQVI-Q-------LDESDQIYVVGYAGSSQFHAYQINAMNGELLNHETAAFSGGFVGDVALVSSDT  237 (794)
Q Consensus       168 -W~~~~~~~~~-~~~~~v-~-------s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~~v~~~~~~s~~~~~vg~~~  237 (794)
                       =...+|...- ...+.+ +       +..+|..++...-|    +....++-.|-.+   ++..+.++-=....+..+-
T Consensus       301 kldI~lpl~rk~k~~k~~~pskyledfa~~~Gd~ia~VSRG----kaFi~~~~~~~~i---qv~~~~~VrY~r~~~~~e~  373 (668)
T COG4946         301 KLDIGLPLDRKKKQPKFVNPSKYLEDFAVVNGDYIALVSRG----KAFIMRPWDGYSI---QVGKKGGVRYRRIQVDPEG  373 (668)
T ss_pred             eeecCCccccccccccccCHHHhhhhhccCCCcEEEEEecC----cEEEECCCCCeeE---EcCCCCceEEEEEccCCcc
Confidence             2222221100 000000 0       22344444433344    4555554433222   1111111111111112223


Q ss_pred             EEEEECCCCeEEEEEeecceeeeEEEeecccCCCCCCceEEeecCCcceeEEEecCcEEEEEEe-cCCcEEEEEee
Q 003800          238 LVTLDTTRSILVTVSFKNRKIAFQETHLSNLGEDSSGMVEILPSSLTGMFTVKINNYKLFIRLT-SEDKLEVVHKV  312 (794)
Q Consensus       238 lv~~d~~~g~L~v~~l~sg~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~-~~~~~~v~~~~  312 (794)
                      ++..+.+...|-+.+.++++  ++.+-      ++-+.++......+|-+++-.+++..|.-++ ++|.+.+.+.-
T Consensus       374 ~vigt~dgD~l~iyd~~~~e--~kr~e------~~lg~I~av~vs~dGK~~vvaNdr~el~vididngnv~~idkS  441 (668)
T COG4946         374 DVIGTNDGDKLGIYDKDGGE--VKRIE------KDLGNIEAVKVSPDGKKVVVANDRFELWVIDIDNGNVRLIDKS  441 (668)
T ss_pred             eEEeccCCceEEEEecCCce--EEEee------CCccceEEEEEcCCCcEEEEEcCceEEEEEEecCCCeeEeccc
Confidence            34444444578888888877  33332      1113344444445566666666644444444 47776665533


No 231
>PF02897 Peptidase_S9_N:  Prolyl oligopeptidase, N-terminal beta-propeller domain;  InterPro: IPR004106 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the beta-propeller domain found at the N-terminal of prolyl oligopeptidase, including acylamino-acid-releasing enzyme (also known as acylaminoacyl peptidase), which belong to the MEROPS peptidase family S9 (clan SC), subfamily S9A. The prolyl oligopeptidase family consist of a number of evolutionary related peptidases whose catalytic activity seems to be provided by a charge relay system similar to that of the trypsin family of serine proteases, but which evolved by independent convergent evolution. The N-terminal domain of prolyl oligopeptidases form an unusual 7-bladed beta-propeller consisting of seven 4-stranded beta-sheet motifs.  Prolyl oligopeptidase is a large cytosolic enzyme involved in the maturation and degradation of peptide hormones and neuropeptides, which relate to the induction of amnesia. The enzyme contains a peptidase domain, where its catalytic triad (Ser554, His680, Asp641) is covered by the central tunnel of the N-terminal beta-propeller domain. In this way, large structured peptides are excluded from the active site, thereby protecting larger peptides and proteins from proteolysis in the cytosol []. The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. Mammalian acylaminoacyl peptidase is an exopeptidase that is a member of the same prolyl oligopeptidase family of serine peptidases. This enzyme removes acylated amino acid residues from the N terminus of oligopeptides [].; GO: 0004252 serine-type endopeptidase activity, 0006508 proteolysis; PDB: 2BKL_B 3DDU_A 1YR2_A 2XE4_A 1VZ3_A 3EQ9_A 1O6F_A 3EQ7_A 4AN0_A 1UOP_A ....
Probab=25.35  E-value=9.8e+02  Score=26.73  Aligned_cols=65  Identities=9%  Similarity=-0.044  Sum_probs=37.7

Q ss_pred             CEEEEEECCCC---cEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCce-eeeeee
Q 003800          154 GCLHAVSSIDG---EILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGEL-LNHETA  220 (794)
Q Consensus       154 g~l~ald~~tG---~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~-~w~~~v  220 (794)
                      ..++.++..++   ...|..-.+...-....+  ...++.+|+.+..+....+|+++++.+... -|+..+
T Consensus       252 s~v~~~d~~~~~~~~~~~~~l~~~~~~~~~~v--~~~~~~~yi~Tn~~a~~~~l~~~~l~~~~~~~~~~~l  320 (414)
T PF02897_consen  252 SEVYLLDLDDGGSPDAKPKLLSPREDGVEYYV--DHHGDRLYILTNDDAPNGRLVAVDLADPSPAEWWTVL  320 (414)
T ss_dssp             EEEEEEECCCTTTSS-SEEEEEESSSS-EEEE--EEETTEEEEEE-TT-TT-EEEEEETTSTSGGGEEEEE
T ss_pred             CeEEEEeccccCCCcCCcEEEeCCCCceEEEE--EccCCEEEEeeCCCCCCcEEEEecccccccccceeEE
Confidence            57888888875   445544332111111112  235888998887766678999999998886 355433


No 232
>KOG0650 consensus WD40 repeat nucleolar protein Bop1, involved in ribosome biogenesis [Translation, ribosomal structure and biogenesis]
Probab=25.35  E-value=2e+02  Score=34.02  Aligned_cols=31  Identities=23%  Similarity=0.414  Sum_probs=25.6

Q ss_pred             EEEEEccCCeEEEEeCCCCcEeEEEeccCcc
Q 003800           98 VITLSSDGSTLRAWNLPDGQMVWESFLRGSK  128 (794)
Q Consensus        98 ~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~  128 (794)
                      .+.-|+++|.||.|...||+-+|.+.+.+..
T Consensus       414 wlasGsdDGtvriWEi~TgRcvr~~~~d~~I  444 (733)
T KOG0650|consen  414 WLASGSDDGTVRIWEIATGRCVRTVQFDSEI  444 (733)
T ss_pred             eeeecCCCCcEEEEEeecceEEEEEeeccee
Confidence            4443566789999999999999999998754


No 233
>COG4447 Uncharacterized protein related to plant photosystem II stability/assembly factor [General function prediction only]
Probab=24.48  E-value=6.6e+02  Score=27.33  Aligned_cols=174  Identities=14%  Similarity=0.205  Sum_probs=86.1

Q ss_pred             CEEEEEeCCCEEEEEECcCCccceEEEcCcccce---eeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCcccc
Q 003800           54 KRVVVSTEENVIASLDLRHGEIFWRHVLGINDVV---DGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHS  130 (794)
Q Consensus        54 ~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i---~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s  130 (794)
                      ++=+..++.|  +-+-.+||-.-|+...+.....   .....+.+++.|.|+..|.-.+.|++  |+-.|.-.-..+.  
T Consensus       139 q~g~m~gd~G--ail~T~DgGk~Wk~l~e~~v~~~~~n~ia~s~dng~vaVg~rGs~f~T~~a--Gqt~~~~~g~~s~--  212 (339)
T COG4447         139 QRGEMLGDQG--AILKTTDGGKNWKALVEKAVGLAVPNEIARSADNGYVAVGARGSFFSTWGA--GQTVWLPHGRNSS--  212 (339)
T ss_pred             hhhhhhcccc--eEEEecCCcccHhHhcccccchhhhhhhhhhccCCeEEEecCcceEecCCC--CccEEeccCCCcc--
Confidence            3344455556  4567789999999877765321   11112456778888888877888886  8886655443322  


Q ss_pred             CCccccccccccccCCeEEEEECCEEEEEECCCCcEEEEEeccCccee--eeeEEE--EecCCEEEEEEecCCceeEEEE
Q 003800          131 KPLLLVPTNLKVDKDSLILVSSKGCLHAVSSIDGEILWTRDFAAESVE--VQQVIQ--LDESDQIYVVGYAGSSQFHAYQ  206 (794)
Q Consensus       131 ~~~~~~~~~~~~~~~~~V~V~~~g~l~ald~~tG~~~W~~~~~~~~~~--~~~~v~--s~~~~~vyv~~~~g~~~~~v~a  206 (794)
                      .....++.  ..++..-+++.......-.+...| --|+-........  +..+.+  -.+++.+|+.+..|+    |  
T Consensus       213 ~~letmg~--adag~~g~la~g~qg~~f~~~~~g-D~wsd~~~~~~~g~~~~Gl~d~a~~a~~~v~v~G~gGn----v--  283 (339)
T COG4447         213 RRLETMGL--ADAGSKGLLARGGQGDQFSWVCGG-DEWSDQGEPVNLGRRSWGLLDFAPRAPPEVWVSGIGGN----V--  283 (339)
T ss_pred             chhccccc--ccCCccceEEEccccceeecCCCc-ccccccccchhcccCCCccccccccCCCCeEEeccCcc----E--
Confidence            22233331  112122455543211222232333 3454322110000  001110  136788998777552    2  


Q ss_pred             EEcCCCceeeeeeeecccCccC--ceEEEcCc-EEEEEE
Q 003800          207 INAMNGELLNHETAAFSGGFVG--DVALVSSD-TLVTLD  242 (794)
Q Consensus       207 ld~~tG~~~w~~~v~~~~~~s~--~~~~vg~~-~lv~~d  242 (794)
                      +-...|-..|+.....+..+++  ++++.+.+ -++|.+
T Consensus       284 l~StdgG~t~skd~g~~er~s~l~~V~~ts~~~~~l~Gq  322 (339)
T COG4447         284 LASTDGGTTWSKDGGVEERVSNLYSVVFTSPKAGFLCGQ  322 (339)
T ss_pred             EEecCCCeeEeccCChhhhhhhhheEEeccCCceEEEcC
Confidence            2235677788875544433332  34443332 445554


No 234
>COG3292 Predicted periplasmic ligand-binding sensor domain [Signal transduction mechanisms]
Probab=24.32  E-value=3.3e+02  Score=32.36  Aligned_cols=70  Identities=11%  Similarity=0.181  Sum_probs=44.1

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccc
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKH  129 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~  129 (794)
                      .+.++++|++| |.-+|+.+|+++=+-..+....|..+.....+ -+.|+++. .++-.+++.    |+..-.+..+
T Consensus       175 ~g~lWvgT~dG-L~~fd~~~gkalql~s~~~dk~I~al~~d~qg-~LWVGTdq-Gv~~~e~~G----~~~sn~~~~l  244 (671)
T COG3292         175 NGRLWVGTPDG-LSYFDAGRGKALQLASPPLDKAINALIADVQG-RLWVGTDQ-GVYLQEAEG----WRASNWGPML  244 (671)
T ss_pred             cCcEEEecCCc-ceEEccccceEEEcCCCcchhhHHHHHHHhcC-cEEEEecc-ceEEEchhh----ccccccCCCC
Confidence            67899999999 78899999998765444433334433122333 34446653 377777654    7777655443


No 235
>KOG0279 consensus G protein beta subunit-like protein [Signal transduction mechanisms]
Probab=24.27  E-value=9.4e+02  Score=26.11  Aligned_cols=57  Identities=21%  Similarity=0.326  Sum_probs=31.7

Q ss_pred             ECcCCccceEEEcCcccceeeeeee-eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCc
Q 003800           69 DLRHGEIFWRHVLGINDVVDGIDIA-LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGS  127 (794)
Q Consensus        69 n~~tG~ivWR~~l~~~~~i~~l~~~-~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~  127 (794)
                      |.+.|.++=|..-- ...+...... .|+..+. ++.++++|.||.++|+..=++..+..
T Consensus        49 d~~~G~~~r~~~GH-sH~v~dv~~s~dg~~alS-~swD~~lrlWDl~~g~~t~~f~GH~~  106 (315)
T KOG0279|consen   49 DIKYGVPVRRLTGH-SHFVSDVVLSSDGNFALS-ASWDGTLRLWDLATGESTRRFVGHTK  106 (315)
T ss_pred             ccccCceeeeeecc-ceEecceEEccCCceEEe-ccccceEEEEEecCCcEEEEEEecCC
Confidence            55566655554331 1112222111 2333333 34468999999999988877776653


No 236
>TIGR00548 lolB outer membrane lipoprotein LolB. This protein, LolB, is known so far only in the gamma and beta subdivisions of the Proteobacteria. It is a processed, lipid-modified outer membrane protein. It is required in E. coli for insertion of the major outer lipoprotein (Lpp) into the outer membrane. Lpp is transferred to LolB from the carrier protein LolA in the periplasm. Previously, this protein was thought to play in role in 5-aminolevulinic acid synthesis and was designated HemM.
Probab=22.40  E-value=1.4e+02  Score=30.45  Aligned_cols=58  Identities=14%  Similarity=0.137  Sum_probs=32.0

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcE
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQM  118 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~l  118 (794)
                      .+++-+-+.+.      .-+|...|++.-+....+... -..|...+-+.++++.+...+ .+|+.
T Consensus        51 ~Gria~~~~~~------~~sa~~~W~q~~~~~~~l~L~-~PlG~~~~~l~~~~~~v~l~~-~~g~~  108 (202)
T TIGR00548        51 DGKVGYISPRD------SGSGRFFWQQRNQGYYDLRLS-GPLGRGALRLTGREGAVSLED-NGGGR  108 (202)
T ss_pred             eeeEEEECCCc------eeEEEEEEEECCCCceEEEEE-ccCCCcEEEEEEcCCEEEEEE-CCCCE
Confidence            55666666553      234556799985444334322 135666666655555566655 45554


No 237
>KOG1240 consensus Protein kinase containing WD40 repeats [Signal transduction mechanisms]
Probab=21.38  E-value=4.8e+02  Score=33.84  Aligned_cols=70  Identities=20%  Similarity=0.163  Sum_probs=54.0

Q ss_pred             EEEEEeCCCEEEEEECcCCccceEEEcCcc-cceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           55 RVVVSTEENVIASLDLRHGEIFWRHVLGIN-DVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        55 ~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~-~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      .|.++|.-+.+...|.++-.-+||.+.+.. +.+..+.+.....++++|+..|.+..||..=+.++=++..
T Consensus      1165 ~lvy~T~~~~iv~~D~r~~~~~w~lk~~~~hG~vTSi~idp~~~WlviGts~G~l~lWDLRF~~~i~sw~~ 1235 (1431)
T KOG1240|consen 1165 VLVYATDLSRIVSWDTRMRHDAWRLKNQLRHGLVTSIVIDPWCNWLVIGTSRGQLVLWDLRFRVPILSWEH 1235 (1431)
T ss_pred             eEEEEEeccceEEecchhhhhHHhhhcCccccceeEEEecCCceEEEEecCCceEEEEEeecCceeecccC
Confidence            688899999999999999999999888765 3343442334566888888788999999998877644443


No 238
>KOG0319 consensus WD40-repeat-containing subunit of the 18S rRNA processing complex [RNA processing and modification]
Probab=21.23  E-value=1.6e+03  Score=27.62  Aligned_cols=72  Identities=17%  Similarity=0.288  Sum_probs=43.5

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEE-EcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEec
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRH-VLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFL  124 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~-~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l  124 (794)
                      ++...+..-.+.|--+|.+||++.=+. .-+..+.+..+.+.-++..++....+..++-|+..+|+++-++..
T Consensus        30 nG~~L~t~~~d~Vi~idv~t~~~~l~s~~~ed~d~ita~~l~~d~~~L~~a~rs~llrv~~L~tgk~irswKa  102 (775)
T KOG0319|consen   30 NGQHLYTACGDRVIIIDVATGSIALPSGSNEDEDEITALALTPDEEVLVTASRSQLLRVWSLPTGKLIRSWKA  102 (775)
T ss_pred             CCCEEEEecCceEEEEEccCCceecccCCccchhhhheeeecCCccEEEEeeccceEEEEEcccchHhHhHhh
Confidence            333444444667888999999987111 111112344443344444555445567899999999988766655


No 239
>PRK13861 type IV secretion system protein VirB9; Provisional
Probab=20.92  E-value=3.6e+02  Score=29.40  Aligned_cols=33  Identities=27%  Similarity=0.250  Sum_probs=21.0

Q ss_pred             HHHHHHHHHHHHhccccccceeecccccEeeEE
Q 003800            3 IRFIILTLLFLSSCTIPSLSLYEDQVGLMDWHQ   35 (794)
Q Consensus         3 ~~~~l~~l~~l~~~~~~~~Al~edqvG~~dW~~   35 (794)
                      +|.|+++|++|++|+.++.|.-....+..|=|-
T Consensus         2 ~~~~~~~~~~~~~~~~~a~A~~~p~~~~~D~RI   34 (292)
T PRK13861          2 IKKLFLTLACLLFAAIGALAEDTPAAGKLDPRM   34 (292)
T ss_pred             hhHHHHHHHHHHHhccchhHhhcCCCCCCCCce
Confidence            456777887877777777666555555444443


No 240
>PF14583 Pectate_lyase22:  Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=20.70  E-value=3.3e+02  Score=30.85  Aligned_cols=68  Identities=16%  Similarity=0.169  Sum_probs=35.8

Q ss_pred             CCEEEEEeC---CCEEEEEECcCCccceEEEcCccc--ceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEe
Q 003800           53 RKRVVVSTE---ENVIASLDLRHGEIFWRHVLGIND--VVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESF  123 (794)
Q Consensus        53 ~~~Vyv~t~---~g~l~ALn~~tG~ivWR~~l~~~~--~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~  123 (794)
                      +++++++++   ...++.||.+||++.  |..+.++  ...+.-...+..++++.. +..|+++|++|++..=-+.
T Consensus        47 G~kllF~s~~dg~~nly~lDL~t~~i~--QLTdg~g~~~~g~~~s~~~~~~~Yv~~-~~~l~~vdL~T~e~~~vy~  119 (386)
T PF14583_consen   47 GRKLLFASDFDGNRNLYLLDLATGEIT--QLTDGPGDNTFGGFLSPDDRALYYVKN-GRSLRRVDLDTLEERVVYE  119 (386)
T ss_dssp             S-EEEEEE-TTSS-EEEEEETTT-EEE--E---SS-B-TTT-EE-TTSSEEEEEET-TTEEEEEETTT--EEEEEE
T ss_pred             CCEEEEEeccCCCcceEEEEcccCEEE--ECccCCCCCccceEEecCCCeEEEEEC-CCeEEEEECCcCcEEEEEE
Confidence            445665665   457999999999873  3333221  222221223555667765 3589999999998753333


No 241
>KOG0301 consensus Phospholipase A2-activating protein (contains WD40 repeats) [Lipid transport and metabolism]
Probab=20.50  E-value=8.2e+02  Score=29.73  Aligned_cols=94  Identities=22%  Similarity=0.268  Sum_probs=49.4

Q ss_pred             EEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccc
Q 003800           56 VVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLL  135 (794)
Q Consensus        56 Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~  135 (794)
                      ..-.+.+|.|---|. ||+.+=|..-.+.- +..+....+++.++-+|.++++|-|+..  ...=...+++..    .+.
T Consensus       193 flScsNDg~Ir~w~~-~ge~l~~~~ghtn~-vYsis~~~~~~~Ivs~gEDrtlriW~~~--e~~q~I~lPtts----iWs  264 (745)
T KOG0301|consen  193 FLSCSNDGSIRLWDL-DGEVLLEMHGHTNF-VYSISMALSDGLIVSTGEDRTLRIWKKD--ECVQVITLPTTS----IWS  264 (745)
T ss_pred             eEeecCCceEEEEec-cCceeeeeeccceE-EEEEEecCCCCeEEEecCCceEEEeecC--ceEEEEecCccc----eEE
Confidence            444445666666665 66666554433221 1122223455555546778999999864  444344443321    121


Q ss_pred             cccccccccCCeEEEE-ECCEEEEEEC
Q 003800          136 VPTNLKVDKDSLILVS-SKGCLHAVSS  161 (794)
Q Consensus       136 ~~~~~~~~~~~~V~V~-~~g~l~ald~  161 (794)
                      +-   ... .+++++. +||.|+.+..
T Consensus       265 a~---~L~-NgDIvvg~SDG~VrVfT~  287 (745)
T KOG0301|consen  265 AK---VLL-NGDIVVGGSDGRVRVFTV  287 (745)
T ss_pred             EE---Eee-CCCEEEeccCceEEEEEe
Confidence            11   111 5677776 6888776643


No 242
>KOG1445 consensus Tumor-specific antigen (contains WD repeats) [Cytoskeleton]
Probab=20.43  E-value=1.1e+03  Score=28.50  Aligned_cols=60  Identities=8%  Similarity=0.034  Sum_probs=40.9

Q ss_pred             eCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCccccccccccccCCeEEEEE--CCEEEEEECC
Q 003800           94 LGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKVDKDSLILVSS--KGCLHAVSSI  162 (794)
Q Consensus        94 ~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~~~~~~V~V~~--~g~l~ald~~  162 (794)
                      ..+++++.+. ++.++.||+.+++.+-+....+... +++..       ..++.++..+  |..+.-+|+.
T Consensus       139 TaDgil~s~a-~g~v~i~D~stqk~~~el~~h~d~v-QSa~W-------seDG~llatscKdkqirifDPR  200 (1012)
T KOG1445|consen  139 TADGILASGA-HGSVYITDISTQKTAVELSGHTDKV-QSADW-------SEDGKLLATSCKDKQIRIFDPR  200 (1012)
T ss_pred             CcCceEEecc-CceEEEEEcccCceeecccCCchhh-hcccc-------ccCCceEeeecCCcceEEeCCc
Confidence            3566777444 5799999999999999988777655 22222       2255555543  6677777765


No 243
>PF01453 B_lectin:  D-mannose binding lectin;  InterPro: IPR001480 A bulb lectin super-family (Amaryllidaceae, Orchidaceae and Aliaceae) contains a ~115-residue-long domain whose overall three dimensional fold is very similar to that of [, ]:  Dictyostelium discoideum comitin, an actin binding protein Curculigo latifolia curculin, a sweet tasting and taste-modifying protein   This domain generally binds mannose, but in at least one protein, curculin, it is apparently devoid of mannose-binding activity.  Each bulb-type lectin domain consists of three sequential beta-sheet subdomains (I, II, III) that are inter-related by pseudo three-fold symmetry. The three subdomains are flat four-stranded, antiparrallel beta-sheets. Together they form a 12-stranded beta-barrel in which the barrel axis coincides with the pseudo 3-fold axis.; GO: 0005529 sugar binding; PDB: 3M7H_A 3M7J_B 3MEZ_D 1DLP_A 1BWU_D 1KJ1_A 1B2P_A 1XD6_A 2DPF_C 2D04_B ....
Probab=20.40  E-value=6.6e+02  Score=22.93  Aligned_cols=60  Identities=17%  Similarity=0.410  Sum_probs=38.2

Q ss_pred             CCEEEEEeCCCEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEE
Q 003800           53 RKRVVVSTEENVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWES  122 (794)
Q Consensus        53 ~~~Vyv~t~~g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~  122 (794)
                      ++..+..+.+|.|.-.|.. |+++|...-....   +    ...-.+.+..+ |.+..+| .+|+.+|+.
T Consensus        19 ~~~~L~l~~dGnLvl~~~~-~~~iWss~~t~~~---~----~~~~~~~L~~~-GNlvl~d-~~~~~lW~S   78 (114)
T PF01453_consen   19 GNYTLILQSDGNLVLYDSN-GSVIWSSNNTSGR---G----NSGCYLVLQDD-GNLVLYD-SSGNVLWQS   78 (114)
T ss_dssp             TTEEEEEETTSEEEEEETT-TEEEEE--S-TTS---S-----SSEEEEEETT-SEEEEEE-TTSEEEEES
T ss_pred             ccccceECCCCeEEEEcCC-CCEEEEecccCCc---c----ccCeEEEEeCC-CCEEEEe-ecceEEEee
Confidence            4457778889999888865 8889997221110   0    01223444554 5788888 599999997


No 244
>KOG2110 consensus Uncharacterized conserved protein, contains WD40 repeats [Function unknown]
Probab=20.15  E-value=1.3e+03  Score=26.06  Aligned_cols=176  Identities=6%  Similarity=0.086  Sum_probs=98.1

Q ss_pred             CEEEEEECcCCccceEEEcCcccceeeeeeeeCCEEEEEEccCCeEEEEeCCCCcEeEEEeccCccccCCcccccccccc
Q 003800           63 NVIASLDLRHGEIFWRHVLGINDVVDGIDIALGKYVITLSSDGSTLRAWNLPDGQMVWESFLRGSKHSKPLLLVPTNLKV  142 (794)
Q Consensus        63 g~l~ALn~~tG~ivWR~~l~~~~~i~~l~~~~g~~~V~Vs~~g~~v~A~d~~tG~llWe~~l~~~~~s~~~~~~~~~~~~  142 (794)
                      ..+--+|-+-+.++=+..++.+  +-.+  ......++|--. ..++-+|..+=+++=......+.. .++...    ..
T Consensus        68 r~Lkv~~~Kk~~~ICe~~fpt~--IL~V--rmNr~RLvV~Le-e~IyIydI~~MklLhTI~t~~~n~-~gl~Al----S~  137 (391)
T KOG2110|consen   68 RKLKVVHFKKKTTICEIFFPTS--ILAV--RMNRKRLVVCLE-ESIYIYDIKDMKLLHTIETTPPNP-KGLCAL----SP  137 (391)
T ss_pred             ceEEEEEcccCceEEEEecCCc--eEEE--EEccceEEEEEc-ccEEEEecccceeehhhhccCCCc-cceEee----cc
Confidence            3577778888888888777665  4333  334444443333 259999999999987776653221 222222    11


Q ss_pred             ccCCeEEEE----ECCEEEEEECCCCcEEEEEeccCcceeeeeEEEEecCCEEEEEEecCCceeEEEEEEcCCCceeeee
Q 003800          143 DKDSLILVS----SKGCLHAVSSIDGEILWTRDFAAESVEVQQVIQLDESDQIYVVGYAGSSQFHAYQINAMNGELLNHE  218 (794)
Q Consensus       143 ~~~~~V~V~----~~g~l~ald~~tG~~~W~~~~~~~~~~~~~~v~s~~~~~vyv~~~~g~~~~~v~ald~~tG~~~w~~  218 (794)
                      ..++-.+++    +.|.|+-+|..+=++.=..+.-...+   .++.-..+|.+.+-+...|  ..+..++..+|+.+.|.
T Consensus       138 n~~n~ylAyp~s~t~GdV~l~d~~nl~~v~~I~aH~~~l---Aalafs~~G~llATASeKG--TVIRVf~v~~G~kl~eF  212 (391)
T KOG2110|consen  138 NNANCYLAYPGSTTSGDVVLFDTINLQPVNTINAHKGPL---AALAFSPDGTLLATASEKG--TVIRVFSVPEGQKLYEF  212 (391)
T ss_pred             CCCCceEEecCCCCCceEEEEEcccceeeeEEEecCCce---eEEEECCCCCEEEEeccCc--eEEEEEEcCCccEeeee
Confidence            212222222    26788888877776666665444333   3332234566655444433  24556677899999998


Q ss_pred             eeecc-cCccCceEEE-cCcEEEEEECCCCeEEEEEeec
Q 003800          219 TAAFS-GGFVGDVALV-SSDTLVTLDTTRSILVTVSFKN  255 (794)
Q Consensus       219 ~v~~~-~~~s~~~~~v-g~~~lv~~d~~~g~L~v~~l~s  255 (794)
                      +-+.. ..+- +..|- ...++ |..++++.+|+.-|+.
T Consensus       213 RRG~~~~~Iy-SL~Fs~ds~~L-~~sS~TeTVHiFKL~~  249 (391)
T KOG2110|consen  213 RRGTYPVSIY-SLSFSPDSQFL-AASSNTETVHIFKLEK  249 (391)
T ss_pred             eCCceeeEEE-EEEECCCCCeE-EEecCCCeEEEEEecc
Confidence            74432 2221 12222 23344 4444677777777654


No 245
>PF08894 DUF1838:  Protein of unknown function (DUF1838);  InterPro: IPR014990 This group of proteins are functionally uncharacterised. 
Probab=20.15  E-value=74  Score=33.29  Aligned_cols=67  Identities=24%  Similarity=0.198  Sum_probs=41.5

Q ss_pred             eEeeccCCceEEEEEEcCCCCCCcCCCCCCCcEEEEEEEEceeeeEEEEEEecCCCCCceEEEEecEEEE
Q 003800          700 VMYKYISKNLLFVATVAPKASGHIGSADPDEAWLVVYLIDTITGRILHRMTHHGAQGPVHAVLSENWVVY  769 (794)
Q Consensus       700 VLYKYLNPNl~~v~t~~~~~~~~~~~~~~~~~~l~v~liD~VTG~il~s~~h~~~~~pi~~v~~ENWvvY  769 (794)
                      .|+|-.-=|..-.+.....+.+. | -.--...|++|+ |.+||+||++-.-+-....|.+||..|=.|=
T Consensus        24 ~LF~ieGmnv~rcv~~~~g~~~~-~-~r~lSREl~~Y~-DP~TgeIL~~W~npwt~e~vpVvhVaNdpv~   90 (238)
T PF08894_consen   24 LLFKIEGMNVARCVPDEDGEGGE-G-YRFLSRELTFYL-DPVTGEILETWENPWTGEVVPVVHVANDPVN   90 (238)
T ss_pred             eeeeeeeeeeeEeeecCCCcchh-h-hhhhhheeeEEe-CCchhhHHHhhcCCCcCCccceEEeccCccc
Confidence            45555555666665554432110 0 000134677777 9999999999888877777888887654443


Done!