Query         047021
Match_columns 388
No_of_seqs    17 out of 19
Neff          2.9 
Searched_HMMs 46136
Date          Fri Mar 29 06:50:27 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/047021.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/047021hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 TIGR02168 SMC_prok_B chromosom  98.0    0.04 8.6E-07   58.9  34.5   73  207-279   864-936 (1179)
  2 TIGR02169 SMC_prok_A chromosom  97.9   0.054 1.2E-06   58.3  41.8  105    4-109   674-778 (1164)
  3 TIGR02168 SMC_prok_B chromosom  97.8   0.088 1.9E-06   56.3  34.3   22  335-356   993-1014(1179)
  4 PF00038 Filament:  Intermediat  97.5   0.084 1.8E-06   50.0  28.1   96   11-106    18-113 (312)
  5 TIGR02169 SMC_prok_A chromosom  97.4    0.33   7E-06   52.5  34.3   37  209-245   427-463 (1164)
  6 KOG0161 Myosin class II heavy   97.2    0.93   2E-05   54.6  34.6  345    5-356   937-1336(1930)
  7 PF13514 AAA_27:  AAA domain     97.0    0.94   2E-05   51.0  40.4  253    3-264   345-669 (1111)
  8 COG1196 Smc Chromosome segrega  96.9    0.34 7.4E-06   54.7  24.4   76    5-81    731-806 (1163)
  9 PF00038 Filament:  Intermediat  95.9     1.6 3.4E-05   41.5  21.3  119  155-283    10-142 (312)
 10 TIGR00606 rad50 rad50. This fa  95.9     4.8  0.0001   46.4  39.4   32  322-353  1062-1093(1311)
 11 PRK11281 hypothetical protein;  95.8     5.2 0.00011   46.2  26.5  189  132-359   125-323 (1113)
 12 PF14662 CCDC155:  Coiled-coil   95.3     2.7 5.9E-05   40.0  20.5  166    2-180     6-189 (193)
 13 PRK04863 mukB cell division pr  95.3     7.6 0.00016   46.1  25.4  182   68-249   435-671 (1486)
 14 PRK02224 chromosome segregatio  95.2     5.6 0.00012   43.1  34.1   34   79-112   555-588 (880)
 15 PF07888 CALCOCO1:  Calcium bin  95.2     5.6 0.00012   43.0  31.3  225    5-284   144-376 (546)
 16 PHA02562 46 endonuclease subun  95.0     4.8  0.0001   40.9  26.2  138   11-152   174-323 (562)
 17 PF10174 Cast:  RIM-binding pro  94.7     9.1  0.0002   42.8  31.4   84   10-107     2-85  (775)
 18 KOG0161 Myosin class II heavy   94.5      16 0.00035   44.7  37.6  157   70-255   903-1059(1930)
 19 PF09730 BicD:  Microtubule-ass  93.3      16 0.00036   40.6  30.5  151    8-181    31-181 (717)
 20 PF05622 HOOK:  HOOK protein;    93.2   0.025 5.4E-07   60.4   0.0  132    8-150   190-330 (713)
 21 COG1196 Smc Chromosome segrega  93.2      19 0.00042   41.2  41.3   42  328-369   970-1011(1163)
 22 TIGR00606 rad50 rad50. This fa  92.9      23 0.00049   41.1  41.2  188   45-256   742-935 (1311)
 23 PRK11637 AmiB activator; Provi  92.8      12 0.00026   37.9  25.2   40   73-112    94-133 (428)
 24 PF15070 GOLGA2L5:  Putative go  92.8      17 0.00037   39.6  27.3   79    9-92     99-177 (617)
 25 PF13514 AAA_27:  AAA domain     92.7      21 0.00046   40.5  35.0  161   70-244   668-836 (1111)
 26 PF05557 MAD:  Mitotic checkpoi  92.5     1.8 3.8E-05   46.7  12.5  138  202-362   503-647 (722)
 27 PF09728 Taxilin:  Myosin-like   92.1      14  0.0003   36.8  29.7  254   13-286    17-307 (309)
 28 TIGR02680 conserved hypothetic  92.1      30 0.00065   40.7  31.1  121   11-148   742-864 (1353)
 29 PF04849 HAP1_N:  HAP1 N-termin  91.8      16 0.00035   37.0  17.8  166    3-183    81-254 (306)
 30 TIGR01005 eps_transp_fam exopo  91.4      13 0.00029   39.9  17.5   90   54-147   237-335 (754)
 31 PRK09039 hypothetical protein;  91.3      17 0.00038   36.4  18.4   35   43-77     70-104 (343)
 32 PRK11637 AmiB activator; Provi  91.0      19 0.00042   36.4  21.6   88    5-93     48-135 (428)
 33 PF07888 CALCOCO1:  Calcium bin  91.0      27 0.00058   38.0  33.7   75    6-81    152-226 (546)
 34 KOG0980 Actin-binding protein   91.0      35 0.00076   39.3  23.6  119  154-283   387-505 (980)
 35 KOG1029 Endocytic adaptor prot  90.6      37 0.00079   39.0  26.3  228    6-249   319-582 (1118)
 36 PF15619 Lebercilin:  Ciliary p  90.4      15 0.00034   34.4  19.9   93   75-184    82-185 (194)
 37 PF00769 ERM:  Ezrin/radixin/mo  90.1      18  0.0004   34.7  15.8   29  327-355   174-202 (246)
 38 PF04849 HAP1_N:  HAP1 N-termin  90.1      22 0.00047   36.1  16.5   84    4-112   160-243 (306)
 39 PF05557 MAD:  Mitotic checkpoi  88.8     6.2 0.00013   42.6  12.5   23  163-185   566-588 (722)
 40 PF01576 Myosin_tail_1:  Myosin  88.5    0.14 2.9E-06   56.6   0.0  304   14-364   134-455 (859)
 41 COG4477 EzrA Negative regulato  88.0      45 0.00098   36.5  21.3  125    4-150   347-476 (570)
 42 PF09726 Macoilin:  Transmembra  87.9      49  0.0011   36.7  22.0  179   16-242   451-648 (697)
 43 PF12128 DUF3584:  Protein of u  87.3      64  0.0014   37.4  35.8  218    5-223   615-869 (1201)
 44 PF08317 Spc7:  Spc7 kinetochor  86.6      27 0.00058   34.5  14.5   53    7-60    152-204 (325)
 45 PRK09039 hypothetical protein;  86.3      39 0.00085   34.0  16.4   25  205-229   175-200 (343)
 46 KOG4807 F-actin binding protei  86.3      52  0.0011   35.3  19.1   59  223-281   414-479 (593)
 47 PF10174 Cast:  RIM-binding pro  86.1      66  0.0014   36.3  32.2  103    5-113   239-353 (775)
 48 PF07106 TBPIP:  Tat binding pr  83.1      21 0.00046   31.7  11.0   80  125-214    69-157 (169)
 49 PF08317 Spc7:  Spc7 kinetochor  80.2      65  0.0014   31.9  18.4  107   92-212   180-286 (325)
 50 TIGR03017 EpsF chain length de  79.2      72  0.0016   31.8  18.2   45   99-147   257-301 (444)
 51 PRK03918 chromosome segregatio  79.1   1E+02  0.0022   33.5  33.8   28  157-184   406-433 (880)
 52 TIGR03752 conj_TIGR03752 integ  78.3      11 0.00023   40.3   8.6   76    2-112    64-139 (472)
 53 PF09726 Macoilin:  Transmembra  77.6 1.2E+02  0.0027   33.7  21.2   46    4-49    460-505 (697)
 54 KOG4674 Uncharacterized conser  77.4   2E+02  0.0042   35.9  36.8  155  202-357   391-561 (1822)
 55 TIGR01005 eps_transp_fam exopo  76.5      37  0.0008   36.6  12.2  108    4-111   288-398 (754)
 56 PF14992 TMCO5:  TMCO5 family    76.4      91   0.002   31.5  17.8  158    8-185     8-182 (280)
 57 PF11932 DUF3450:  Protein of u  75.6      14  0.0003   34.9   7.9   54    5-58     57-113 (251)
 58 KOG1962 B-cell receptor-associ  75.5      22 0.00048   34.5   9.3   56  198-253   154-209 (216)
 59 PF15397 DUF4618:  Domain of un  75.1      93   0.002   31.0  22.1  194   34-258    22-221 (258)
 60 PRK10929 putative mechanosensi  74.9 1.8E+02   0.004   34.3  28.1  105  245-361   198-305 (1109)
 61 PF12718 Tropomyosin_1:  Tropom  73.0      70  0.0015   28.6  13.6   86    4-108    21-106 (143)
 62 smart00338 BRLZ basic region l  72.8     8.8 0.00019   29.3   4.8   38    4-41     26-63  (65)
 63 KOG0996 Structural maintenance  72.6 2.2E+02  0.0048   34.2  22.4  233    2-240   790-1032(1293)
 64 PRK11281 hypothetical protein;  72.4 2.1E+02  0.0045   33.8  26.3   82   99-183   166-254 (1113)
 65 TIGR03007 pepcterm_ChnLen poly  72.4 1.2E+02  0.0026   31.0  20.9   34   82-115   204-237 (498)
 66 PF01576 Myosin_tail_1:  Myosin  71.5     1.3 2.8E-05   49.2   0.0   70  205-274   380-449 (859)
 67 PF06005 DUF904:  Protein of un  70.6      56  0.0012   26.5   9.6   65  218-285     6-70  (72)
 68 KOG0977 Nuclear envelope prote  70.5 1.7E+02  0.0037   32.1  15.6   50   35-85    165-214 (546)
 69 PF11932 DUF3450:  Protein of u  69.6   1E+02  0.0022   29.2  13.8   72  201-272    48-119 (251)
 70 COG4942 Membrane-bound metallo  69.4 1.6E+02  0.0035   31.3  23.2   24  160-183   196-219 (420)
 71 PF09755 DUF2046:  Uncharacteri  69.0 1.4E+02  0.0031   30.6  23.9   61    7-67     23-97  (310)
 72 PF00170 bZIP_1:  bZIP transcri  68.7      13 0.00029   28.3   5.0   36    5-40     27-62  (64)
 73 PF12777 MT:  Microtubule-bindi  68.6      31 0.00068   34.3   8.8  108   25-132     8-121 (344)
 74 TIGR03185 DNA_S_dndD DNA sulfu  67.6 1.8E+02   0.004   31.3  30.2   42   71-112   205-246 (650)
 75 PF08614 ATG16:  Autophagy prot  65.2      39 0.00085   30.9   8.2  145   96-251    31-179 (194)
 76 PRK02224 chromosome segregatio  65.2 2.2E+02  0.0047   31.3  42.6   33   77-109   407-439 (880)
 77 PF15035 Rootletin:  Ciliary ro  64.8      14 0.00031   34.4   5.3   43    2-44     65-107 (182)
 78 TIGR01843 type_I_hlyD type I s  64.0 1.5E+02  0.0031   28.8  18.1  169   13-182    76-265 (423)
 79 TIGR03017 EpsF chain length de  63.8 1.3E+02  0.0029   30.0  12.2   80    6-85    256-335 (444)
 80 PF05701 WEMBL:  Weak chloropla  62.1 2.2E+02  0.0048   30.3  31.8  167    4-185    27-194 (522)
 81 KOG0977 Nuclear envelope prote  62.0 2.5E+02  0.0054   30.9  24.2   83   30-112    61-157 (546)
 82 PF01166 TSC22:  TSC-22/dip/bun  60.9     9.8 0.00021   30.5   3.0   25    2-26     19-43  (59)
 83 COG4372 Uncharacterized protei  60.0 2.5E+02  0.0055   30.3  24.0   87   10-111    94-180 (499)
 84 PRK10884 SH3 domain-containing  59.8 1.1E+02  0.0023   29.2  10.2   33   83-115   140-172 (206)
 85 KOG3977 Troponin I [Cytoskelet  58.6      23  0.0005   34.4   5.7   75    7-85     98-181 (221)
 86 TIGR02231 conserved hypothetic  58.6 2.1E+02  0.0045   29.9  13.0   44   70-113    66-109 (525)
 87 PF04012 PspA_IM30:  PspA/IM30   58.5 1.5E+02  0.0032   27.2  16.5  137    9-146    28-183 (221)
 88 PF14197 Cep57_CLD_2:  Centroso  58.4      45 0.00098   26.8   6.5   39   30-69      3-41  (69)
 89 COG4942 Membrane-bound metallo  58.3 2.6E+02  0.0056   29.8  28.3  105   11-130    38-142 (420)
 90 TIGR00219 mreC rod shape-deter  58.0      32 0.00068   33.7   6.6   52  229-283    58-109 (283)
 91 PF09755 DUF2046:  Uncharacteri  56.3 2.4E+02  0.0053   29.0  22.5   91   21-115   109-204 (310)
 92 PF13935 Ead_Ea22:  Ead/Ea22-li  56.1      67  0.0015   28.3   7.8   59  227-285    71-132 (139)
 93 KOG0249 LAR-interacting protei  54.8 1.6E+02  0.0035   33.8  11.9   91   17-110   162-258 (916)
 94 PF12329 TMF_DNA_bd:  TATA elem  54.7      27 0.00058   28.2   4.6   38    5-42     34-71  (74)
 95 PF04859 DUF641:  Plant protein  54.7      28 0.00061   31.3   5.2   46    2-47     78-123 (131)
 96 PF15294 Leu_zip:  Leucine zipp  54.2 1.7E+02  0.0038   29.5  11.1  124  235-385   130-254 (278)
 97 KOG4674 Uncharacterized conser  54.1 5.5E+02   0.012   32.3  33.8  131    8-145   569-706 (1822)
 98 PF12709 Kinetocho_Slk19:  Cent  52.9      28 0.00062   29.6   4.7   38    4-41     49-86  (87)
 99 PF06156 DUF972:  Protein of un  52.7      79  0.0017   27.4   7.5   53  213-286     5-57  (107)
100 PF13870 DUF4201:  Domain of un  52.7 1.7E+02  0.0038   26.2  12.9   90   15-112    46-135 (177)
101 PF08614 ATG16:  Autophagy prot  52.6 1.4E+02  0.0031   27.3   9.6   37    6-42    118-154 (194)
102 KOG0612 Rho-associated, coiled  51.7 5.2E+02   0.011   31.4  30.1   81   22-106   465-549 (1317)
103 PHA02562 46 endonuclease subun  51.5 2.9E+02  0.0063   28.4  23.6  146    4-150   174-328 (562)
104 PRK10929 putative mechanosensi  50.7   5E+02   0.011   30.9  27.4  116   70-188   104-240 (1109)
105 PRK10246 exonuclease subunit S  50.6 4.5E+02  0.0098   30.3  31.1   25  199-223   774-798 (1047)
106 cd07429 Cby_like Chibby, a nuc  50.4      17 0.00037   31.9   3.1   25    5-29     73-97  (108)
107 PF05667 DUF812:  Protein of un  49.9 3.9E+02  0.0085   29.4  23.7  139    4-156   335-475 (594)
108 PF00261 Tropomyosin:  Tropomyo  49.5 2.3E+02  0.0051   26.7  27.2  103  154-256   125-230 (237)
109 PF05622 HOOK:  HOOK protein;    48.7     5.8 0.00013   42.9   0.0  100   51-151   278-386 (713)
110 KOG0964 Structural maintenance  48.4 5.5E+02   0.012   30.7  28.8  169   71-256   268-444 (1200)
111 PF01724 DUF29:  Domain of unkn  48.2 1.9E+02  0.0042   25.7   9.4   45  227-273    69-113 (139)
112 COG1579 Zn-ribbon protein, pos  48.0 2.9E+02  0.0062   27.3  20.7   96   25-144    10-105 (239)
113 PRK14154 heat shock protein Gr  48.0      93   0.002   30.0   7.9   56  323-378    54-121 (208)
114 PF08172 CASP_C:  CASP C termin  47.9      33 0.00072   33.4   5.0   42    3-44     85-126 (248)
115 PRK04778 septation ring format  47.8 3.8E+02  0.0082   28.7  27.9  260    4-281   230-506 (569)
116 TIGR02231 conserved hypothetic  47.5 1.7E+02  0.0037   30.6  10.3   95   13-109    73-172 (525)
117 PF00769 ERM:  Ezrin/radixin/mo  47.5 2.7E+02  0.0059   26.9  12.6  106    6-111     7-118 (246)
118 PF05377 FlaC_arch:  Flagella a  46.9      38 0.00082   26.7   4.2   38   12-49      1-38  (55)
119 TIGR02449 conserved hypothetic  46.5      43 0.00093   27.1   4.6   38    4-41      7-44  (65)
120 KOG4643 Uncharacterized coiled  46.0   6E+02   0.013   30.5  27.5  103    8-112   174-310 (1195)
121 COG4985 ABC-type phosphate tra  45.8 1.6E+02  0.0034   29.8   9.2   65   23-87    162-233 (289)
122 PF12795 MscS_porin:  Mechanose  45.2 2.7E+02  0.0058   26.2  23.3  186   76-290    32-217 (240)
123 PRK04863 mukB cell division pr  45.1 6.7E+02   0.014   30.7  38.5   27  162-188   462-488 (1486)
124 PF07989 Microtub_assoc:  Micro  45.0      64  0.0014   26.3   5.4   53  235-287     5-58  (75)
125 PF04880 NUDE_C:  NUDE protein,  45.0      27  0.0006   32.5   3.8   43    6-48      2-47  (166)
126 TIGR03752 conj_TIGR03752 integ  44.6 2.4E+02  0.0053   30.5  11.0   23    4-26     73-95  (472)
127 PF12128 DUF3584:  Protein of u  44.3 5.9E+02   0.013   29.9  41.4   50   64-113   486-535 (1201)
128 PF15294 Leu_zip:  Leucine zipp  43.6 3.6E+02  0.0079   27.3  18.2  174  203-382     2-205 (278)
129 PF08826 DMPK_coil:  DMPK coile  43.5      77  0.0017   25.3   5.5   40    8-47     15-54  (61)
130 KOG0018 Structural maintenance  43.0 6.7E+02   0.014   30.1  25.5  226   13-273   222-473 (1141)
131 PF04156 IncA:  IncA protein;    42.5 2.5E+02  0.0054   25.0  13.2   34    6-39     83-116 (191)
132 PF09730 BicD:  Microtubule-ass  41.5 5.8E+02   0.013   29.0  26.5  140  218-361   113-305 (717)
133 PF05600 DUF773:  Protein of un  41.2 4.6E+02  0.0099   28.2  12.4  114  235-372    96-229 (507)
134 PF05103 DivIVA:  DivIVA protei  40.9      13 0.00028   31.0   0.9   19   94-112   105-123 (131)
135 PRK10698 phage shock protein P  40.4 3.3E+02  0.0072   25.9  13.8   68   70-144   115-182 (222)
136 COG2433 Uncharacterized conser  40.0 3.4E+02  0.0073   30.6  11.4   87    4-109   422-508 (652)
137 KOG4403 Cell surface glycoprot  39.9 5.5E+02   0.012   28.3  15.4  137  125-293   239-382 (575)
138 PF09403 FadA:  Adhesion protei  39.9 2.8E+02   0.006   24.9  13.5   60  123-185    15-74  (126)
139 PF10168 Nup88:  Nuclear pore c  39.7   6E+02   0.013   28.6  16.2   43    4-46    579-621 (717)
140 PF04111 APG6:  Autophagy prote  39.0 4.1E+02  0.0089   26.6  11.5   88  203-292    51-138 (314)
141 PF06005 DUF904:  Protein of un  38.9 1.4E+02  0.0031   24.2   6.6   36    5-40     19-54  (72)
142 PRK10884 SH3 domain-containing  38.1 2.9E+02  0.0062   26.4   9.4   20  127-146    92-111 (206)
143 PF00261 Tropomyosin:  Tropomyo  37.7 3.6E+02  0.0078   25.5  22.4  197   71-286    11-218 (237)
144 PF15035 Rootletin:  Ciliary ro  37.5 3.5E+02  0.0076   25.3  15.2  117  104-220    31-166 (182)
145 KOG4797 Transcriptional regula  37.2      33 0.00072   30.8   2.9   24    2-25     72-95  (123)
146 PF02994 Transposase_22:  L1 tr  37.1      74  0.0016   32.4   5.7   45    3-47    143-187 (370)
147 smart00787 Spc7 Spc7 kinetocho  36.6 4.6E+02    0.01   26.5  18.2  102   98-213   181-282 (312)
148 KOG4643 Uncharacterized coiled  35.9 8.5E+02   0.018   29.3  31.6   85  203-287   489-600 (1195)
149 TIGR02894 DNA_bind_RsfA transc  35.6      96  0.0021   29.1   5.7   39    6-44     99-137 (161)
150 TIGR01010 BexC_CtrB_KpsE polys  35.6 4.4E+02  0.0096   26.0  14.4   83   62-148   164-262 (362)
151 PRK13922 rod shape-determining  35.2 3.6E+02  0.0078   25.6   9.7   70  210-283    42-111 (276)
152 PRK04406 hypothetical protein;  35.1 1.3E+02  0.0028   24.5   5.8   52   34-88      6-59  (75)
153 PF04102 SlyX:  SlyX;  InterPro  34.3 1.4E+02  0.0031   23.5   5.8   45    3-47      3-47  (69)
154 PF11336 DUF3138:  Protein of u  34.0      39 0.00084   36.4   3.3   67   71-139    28-94  (514)
155 PRK10246 exonuclease subunit S  33.5 8.1E+02   0.018   28.4  42.6   20  289-308   498-517 (1047)
156 PF10168 Nup88:  Nuclear pore c  33.4 7.4E+02   0.016   27.9  13.2   32  195-227   633-664 (717)
157 PRK02119 hypothetical protein;  33.1 1.5E+02  0.0032   24.0   5.8   53   33-88      3-57  (73)
158 TIGR02680 conserved hypothetic  33.0 9.3E+02    0.02   28.9  32.5  108    5-112   284-398 (1353)
159 PF09304 Cortex-I_coil:  Cortex  32.8 3.6E+02  0.0077   24.0  12.6   57  101-168    42-98  (107)
160 PRK00736 hypothetical protein;  32.6 1.3E+02  0.0028   24.0   5.3   50   36-88      2-53  (68)
161 PRK11546 zraP zinc resistance   32.5 1.9E+02  0.0041   26.6   7.0   25  126-150    87-111 (143)
162 COG3206 GumC Uncharacterized p  32.4 5.7E+02   0.012   26.2  11.7  104    6-109   287-393 (458)
163 COG3880 Modulator of heat shoc  32.3   1E+02  0.0022   29.4   5.3   40   70-112   131-171 (176)
164 PF00992 Troponin:  Troponin;    32.2 1.5E+02  0.0033   26.3   6.3   75    6-80     52-131 (132)
165 TIGR02449 conserved hypothetic  31.7 1.3E+02  0.0029   24.3   5.2   37   12-48     29-65  (65)
166 PF08429 PLU-1:  PLU-1-like pro  31.7 4.3E+02  0.0094   25.6   9.8   79  206-285   202-294 (335)
167 PF06632 XRCC4:  DNA double-str  31.6 3.6E+02  0.0078   27.7   9.5   49    1-49    134-182 (342)
168 PF14988 DUF4515:  Domain of un  31.2 4.7E+02    0.01   24.9  17.7  114  131-258    57-177 (206)
169 KOG0946 ER-Golgi vesicle-tethe  31.2 9.4E+02    0.02   28.4  20.6   72    5-77    672-743 (970)
170 PF11336 DUF3138:  Protein of u  31.1 1.2E+02  0.0027   32.8   6.3   79  117-223    14-104 (514)
171 TIGR02894 DNA_bind_RsfA transc  30.8 1.1E+02  0.0024   28.7   5.3   32   15-46    101-132 (161)
172 PF08537 NBP1:  Fungal Nap bind  30.6      95  0.0021   32.0   5.2   39    5-43    183-221 (323)
173 PF03112 DUF244:  Uncharacteriz  30.5      53  0.0012   30.7   3.2   32  228-273    75-106 (158)
174 PRK02793 phi X174 lysis protei  30.2 1.4E+02  0.0031   24.0   5.2   55   32-89      1-57  (72)
175 smart00751 BSD domain in trans  30.0      43 0.00094   24.7   2.1   25  232-256     5-29  (51)
176 COG2433 Uncharacterized conser  29.9 3.2E+02  0.0069   30.7   9.3   53    4-56    429-490 (652)
177 PRK14145 heat shock protein Gr  29.1 2.5E+02  0.0054   26.9   7.5   61  318-378    42-114 (196)
178 PF10883 DUF2681:  Protein of u  29.1 1.6E+02  0.0034   25.1   5.5   36    1-36     20-55  (87)
179 KOG2129 Uncharacterized conser  28.3 4.2E+02  0.0091   28.9   9.6   83   20-110   203-299 (552)
180 PF00170 bZIP_1:  bZIP transcri  28.2 2.5E+02  0.0054   21.3   6.0   43  204-246    21-63  (64)
181 PF09304 Cortex-I_coil:  Cortex  28.0 4.3E+02  0.0094   23.5   9.5   60   15-82     34-93  (107)
182 PRK14143 heat shock protein Gr  27.8 4.7E+02    0.01   25.7   9.2   62  320-381    66-139 (238)
183 TIGR01010 BexC_CtrB_KpsE polys  27.8 3.9E+02  0.0084   26.4   8.8   10   10-19    220-229 (362)
184 PRK00295 hypothetical protein;  27.7   2E+02  0.0043   22.9   5.6   50   36-88      2-53  (68)
185 PF04111 APG6:  Autophagy prote  27.7 6.3E+02   0.014   25.3  10.7   78  202-279    57-134 (314)
186 PF09789 DUF2353:  Uncharacteri  27.7 3.7E+02  0.0081   27.6   8.8   97    2-113   131-227 (319)
187 PF10481 CENP-F_N:  Cenp-F N-te  27.4 7.1E+02   0.015   25.8  15.2  118  206-350    64-191 (307)
188 PF12329 TMF_DNA_bd:  TATA elem  27.3 2.8E+02   0.006   22.4   6.4   15   71-85     57-71  (74)
189 PF10805 DUF2730:  Protein of u  27.1 1.7E+02  0.0037   24.8   5.5   33   24-56     71-103 (106)
190 TIGR01000 bacteriocin_acc bact  26.7 7.1E+02   0.015   25.6  11.7   20  127-146   290-309 (457)
191 PF06156 DUF972:  Protein of un  26.1 1.7E+02  0.0038   25.3   5.4   44    3-46      7-57  (107)
192 PRK13922 rod shape-determining  26.0 1.2E+02  0.0027   28.7   4.9   39    4-42     69-110 (276)
193 PF13870 DUF4201:  Domain of un  26.0 4.3E+02  0.0093   23.7   8.1   76    9-88     96-172 (177)
194 KOG0980 Actin-binding protein   25.5 1.2E+03   0.026   27.7  19.4   71  202-279   459-543 (980)
195 PF14389 Lzipper-MIP1:  Leucine  25.4 3.9E+02  0.0085   22.1   8.3   27  126-152     6-32  (88)
196 PF03962 Mnd1:  Mnd1 family;  I  25.1 5.7E+02   0.012   23.9  10.4   36   16-51     67-102 (188)
197 PF14645 Chibby:  Chibby family  25.1      76  0.0016   27.8   3.1   25    4-28     71-95  (116)
198 PF08826 DMPK_coil:  DMPK coile  24.8 3.6E+02  0.0078   21.5   8.1   49   64-112     7-55  (61)
199 KOG1029 Endocytic adaptor prot  24.8 1.2E+03   0.026   27.6  13.1   26    2-27    491-516 (1118)
200 PF10779 XhlA:  Haemolysin XhlA  24.4 2.5E+02  0.0054   22.1   5.6   43    6-48      8-50  (71)
201 PRK10997 yieM hypothetical pro  24.3 4.3E+02  0.0093   28.6   8.9   83  194-286   127-225 (487)
202 PRK03947 prefoldin subunit alp  24.2 4.6E+02    0.01   22.5   9.1   42   72-113    91-132 (140)
203 PF05377 FlaC_arch:  Flagella a  23.7 2.2E+02  0.0048   22.6   5.0   39    5-43      1-39  (55)
204 PF02151 UVR:  UvrB/uvrC motif;  23.6      89  0.0019   21.8   2.6   16   96-111    21-36  (36)
205 COG4372 Uncharacterized protei  23.2 9.9E+02   0.022   26.0  23.4   96   70-183    76-171 (499)
206 PF10506 MCC-bdg_PDZ:  PDZ doma  22.9 2.7E+02  0.0059   22.6   5.6   48    8-56      2-49  (67)
207 PF10458 Val_tRNA-synt_C:  Valy  22.9   3E+02  0.0065   21.3   5.7   24  154-180     5-28  (66)
208 PF04568 IATP:  Mitochondrial A  22.8 2.6E+02  0.0057   24.2   5.8   47  195-243    50-96  (100)
209 cd00890 Prefoldin Prefoldin is  21.8 4.6E+02  0.0099   21.6   8.3   36   77-112    89-124 (129)
210 PF15619 Lebercilin:  Ciliary p  21.7 6.9E+02   0.015   23.6  11.4   32   56-89    108-139 (194)
211 PF03999 MAP65_ASE1:  Microtubu  21.5      31 0.00067   37.0   0.0  151   68-243   133-299 (619)
212 TIGR03185 DNA_S_dndD DNA sulfu  21.3 1.1E+03   0.023   25.7  31.0   78  162-241   390-467 (650)
213 KOG0946 ER-Golgi vesicle-tethe  21.2 1.4E+03   0.031   27.0  17.3   38    5-42    679-716 (970)
214 PF07106 TBPIP:  Tat binding pr  21.2 1.8E+02   0.004   25.8   4.8   20   29-48    120-139 (169)
215 PF12240 Angiomotin_C:  Angiomo  21.1 1.4E+02   0.003   29.1   4.2   38    6-43    131-168 (205)
216 KOG0804 Cytoplasmic Zn-finger   21.0 1.1E+03   0.024   25.8  14.7   20   71-90    381-400 (493)
217 PF14282 FlxA:  FlxA-like prote  20.9 5.2E+02   0.011   21.9   8.2   25  123-147    14-38  (106)
218 PRK00409 recombination and DNA  20.9 1.1E+03   0.024   26.6  11.6   50    7-56    523-572 (782)
219 PF10234 Cluap1:  Clusterin-ass  20.8   2E+02  0.0044   28.7   5.4   47    3-50    175-221 (267)
220 PF09789 DUF2353:  Uncharacteri  20.4 9.6E+02   0.021   24.8  10.6   72    7-90     75-148 (319)
221 KOG0995 Centromere-associated   20.3 1.2E+03   0.027   26.1  23.4  207  126-361   257-465 (581)
222 PF04899 MbeD_MobD:  MbeD/MobD   20.3 2.9E+02  0.0064   22.5   5.3   40  316-355    23-62  (70)
223 TIGR01069 mutS2 MutS2 family p  20.1 8.4E+02   0.018   27.5  10.5   50    7-56    518-567 (771)
224 PF10224 DUF2205:  Predicted co  20.1 4.3E+02  0.0094   22.1   6.4   51  198-255    12-62  (80)
225 PF09766 FimP:  Fms-interacting  20.0   2E+02  0.0044   29.2   5.3   51    5-56    102-152 (355)
226 PF06825 HSBP1:  Heat shock fac  20.0 1.4E+02   0.003   23.4   3.3   15   34-48     30-44  (54)
227 PF08649 DASH_Dad1:  DASH compl  20.0 2.8E+02  0.0062   22.1   5.0   41  222-265     4-44  (58)

No 1  
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=97.97  E-value=0.04  Score=58.86  Aligned_cols=73  Identities=23%  Similarity=0.191  Sum_probs=29.3

Q ss_pred             HHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          207 LEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRG  279 (388)
Q Consensus       207 le~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~  279 (388)
                      ++..+.++..-+..++.....+-.+++.-..++..+..+-..+...+.....--..++.++.++-.+...+..
T Consensus       864 ~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~l~~~~~~~~~~~~~l~~  936 (1179)
T TIGR02168       864 LEELIEELESELEALLNERASLEEALALLRSELEELSEELRELESKRSELRRELEELREKLAQLELRLEGLEV  936 (1179)
T ss_pred             HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3333333333333344444444444444333343333333333333333333334444444444444444433


No 2  
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=97.91  E-value=0.054  Score=58.34  Aligned_cols=105  Identities=18%  Similarity=0.326  Sum_probs=53.6

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      ..+..+......++.+...+...+...+.+...+...+..+.. ..+.++..+..+..+.+....-.+.+...+..++.+
T Consensus       674 ~~l~~l~~~l~~l~~~l~~l~~~~~~~~~~l~~l~~~~~~~~~-~~~~l~~~l~~~~~~l~~~~~~~~~l~~~~~~~~~~  752 (1164)
T TIGR02169       674 AELQRLRERLEGLKRELSSLQSELRRIENRLDELSQELSDASR-KIGEIEKEIEQLEQEEEKLKERLEELEEDLSSLEQE  752 (1164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455566666666666666666666666666556555554432 234455555544444444444444455555555555


Q ss_pred             hHHHHHHHhhhHhhHHHHHHHHHHHH
Q 047021           84 LKEAEEEQYRAEEDAAALRAELNSLQ  109 (388)
Q Consensus        84 lkEAEEEQyRAEEDaAaLRAEln~lQ  109 (388)
                      +...+.+--..+.+...+..+++.+.
T Consensus       753 ~~~~~~el~~l~~~i~~l~~~i~~l~  778 (1164)
T TIGR02169       753 IENVKSELKELEARIEELEEDLHKLE  778 (1164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            55554444444444444444444433


No 3  
>TIGR02168 SMC_prok_B chromosome segregation protein SMC, common bacterial type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. This family represents the SMC protein of most bacteria. The smc gene is often associated with scpB (TIGR00281) and scpA genes, where scp stands for segregation and condensation protein. SMC was shown (in Caulobacter crescentus) to be induced early in S phase but present and bound to DNA throughout the cell cycle.
Probab=97.76  E-value=0.088  Score=56.30  Aligned_cols=22  Identities=27%  Similarity=0.213  Sum_probs=11.3

Q ss_pred             hHHHHhHHHHHHHHHHHHHHHH
Q 047021          335 SRAEALSAQVLQLSAQLQQTTQ  356 (388)
Q Consensus       335 sRseaLsaqv~qLsAeL~ra~Q  356 (388)
                      .|-+.|+.|..-|.........
T Consensus       993 er~~~l~~q~~dL~~~~~~L~~ 1014 (1179)
T TIGR02168       993 EEYEELKERYDFLTAQKEDLTE 1014 (1179)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            4555555555555554444333


No 4  
>PF00038 Filament:  Intermediate filament protein;  InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups:  Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C.   All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=97.53  E-value=0.084  Score=49.95  Aligned_cols=96  Identities=25%  Similarity=0.389  Sum_probs=64.9

Q ss_pred             HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021           11 KKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE   90 (388)
Q Consensus        11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE   90 (388)
                      .|.-.||.+|+.|..+|.........-..++.+.=...+..+|+-|.+++.|+....+-.+.+-..+..++.|+.+.-..
T Consensus        18 ekVr~LE~~N~~Le~~i~~~~~~~~~~~~~~~~~ye~el~~lr~~id~~~~eka~l~~e~~~l~~e~~~~r~k~e~e~~~   97 (312)
T PF00038_consen   18 EKVRFLEQENKRLESEIEELREKKGEEVSRIKEMYEEELRELRRQIDDLSKEKARLELEIDNLKEELEDLRRKYEEELAE   97 (312)
T ss_dssp             HHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHhhhhHHHHHHHHhcccccCcccccchhhHHHHhHHhhhhHHHHhhHHhhhhhhHHHHHHHHHHHHHHHHHH
Confidence            34445566666666666655555444444455555666778889999999999999998998888888888888888888


Q ss_pred             HhhhHhhHHHHHHHHH
Q 047021           91 QYRAEEDAAALRAELN  106 (388)
Q Consensus        91 QyRAEEDaAaLRAEln  106 (388)
                      ...++.+-..||.++.
T Consensus        98 ~~~le~el~~lrk~ld  113 (312)
T PF00038_consen   98 RKDLEEELESLRKDLD  113 (312)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhhhhhhhh
Confidence            8888877777775554


No 5  
>TIGR02169 SMC_prok_A chromosome segregation protein SMC, primarily archaeal type. SMC (structural maintenance of chromosomes) proteins bind DNA and act in organizing and segregating chromosomes for partition. SMC proteins are found in bacteria, archaea, and eukaryotes. It is found in a single copy and is homodimeric in prokaryotes, but six paralogs (excluded from this family) are found in eukarotes, where SMC proteins are heterodimeric. This family represents the SMC protein of archaea and a few bacteria (Aquifex, Synechocystis, etc); the SMC of other bacteria is described by TIGR02168. The N- and C-terminal domains of this protein are well conserved, but the central hinge region is skewed in composition and highly divergent.
Probab=97.35  E-value=0.33  Score=52.53  Aligned_cols=37  Identities=19%  Similarity=0.372  Sum_probs=15.9

Q ss_pred             HHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhh
Q 047021          209 KQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEE  245 (388)
Q Consensus       209 ~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~E  245 (388)
                      .++..+..-++.++.....+-.+|+.-..+++.+-.+
T Consensus       427 ~~~~~l~~~~~~l~~~~~~~~~~l~~l~~~~~~~~~~  463 (1164)
T TIGR02169       427 AAIAGIEAKINELEEEKEDKALEIKKQEWKLEQLAAD  463 (1164)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3333333334444444444444444444444444333


No 6  
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=97.20  E-value=0.93  Score=54.62  Aligned_cols=345  Identities=23%  Similarity=0.277  Sum_probs=170.8

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh--hhHHH--------HhhhhhhhhhhHHHh---hh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV--PSLRK--------ALKDVAMEKDAAVVA---RE   71 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v--~slrk--------aL~~v~mEkDaA~va---re   71 (388)
                      .+.+|.+...++|..+.++.+++.+.+.++..|..+++.++.+.-  ...+|        -..+|..++|.+--.   +.
T Consensus       937 e~~~l~~~~~~~E~~~~k~~~Ek~~~e~~~~~l~~e~~~~~e~~~kL~kekk~lEe~~~~l~~~l~~~eek~~~l~k~~~ 1016 (1930)
T KOG0161|consen  937 EVQELKEQLEELELTLQKLELEKNAAENKLKNLEEEINSLDENISKLSKEKKELEERIRELQDDLQAEEEKAKSLNKAKA 1016 (1930)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            355666667777777778888888888888888888886544322  33333        344555555554332   45


Q ss_pred             hHHHHHHHHHHHhHHHHHHHhhhHhh--HHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Q 047021           72 DLSAQLRTLKKRLKEAEEEQYRAEED--AAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQ  149 (388)
Q Consensus        72 Dl~AQLR~~kkRlkEAEEEQyRAEED--aAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~  149 (388)
                      .+-++|..+..++..  +...|.|=+  --.|..++..+|-+.+.-  -+  ..--...+....+-|+..++.++..+..
T Consensus      1017 kle~~l~~le~~le~--e~~~r~e~Ek~~rkle~el~~~~e~~~~~--~~--~~~el~~~l~kke~El~~l~~k~e~e~~ 1090 (1930)
T KOG0161|consen 1017 KLEQQLDDLEVTLER--EKRIRMELEKAKRKLEGELKDLQESIEEL--KK--QKEELDNQLKKKESELSQLQSKLEDEQA 1090 (1930)
T ss_pred             HHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHhhhHHHHH--HH--HHHHHHHHHHHHHHHHHHHHHHhhHHHH
Confidence            555566655555543  333333333  123444444443333210  00  0000011122233344444444444444


Q ss_pred             HHHHHHHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhh---------------------
Q 047021          150 LRQQERKQ--------------LTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAA---------------------  194 (388)
Q Consensus       150 lrqqeqqr--------------la~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a---------------------  194 (388)
                      ++.+-+..              |..|-...+-+...+..|...+..+...-.+..+..+                     
T Consensus      1091 ~~~~l~k~i~eL~~~i~el~e~le~er~~r~K~ek~r~dL~~ele~l~~~Lee~~~~t~~q~e~~~k~e~e~~~l~~~le 1170 (1930)
T KOG0161|consen 1091 EVAQLQKQIKELEARIKELEEELEAERASRAKAERQRRDLSEELEELKEELEEQGGTTAAQLELNKKREAEVQKLRRDLE 1170 (1930)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44331111              1111111122222222233333332221111100000                     


Q ss_pred             ----hhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHH
Q 047021          195 ----ASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDC  270 (388)
Q Consensus       195 ----~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDc  270 (388)
                          .+......-|.++...++++...++.+.--.++|.-|=..=..|+..|-.+.+.++++-.++--....-|.|+.+-
T Consensus      1171 ee~~~~e~~~~~lr~~~~~~~~el~~qle~l~~~k~~lekek~~lq~e~~~l~~ev~~~~~~k~~~e~~~k~~E~~l~el 1250 (1930)
T KOG0161|consen 1171 EETLDHEAQIEELRKKHADSLAELQEQLEQLQKDKAKLEKEKSDLQREIADLAAELEQLSSEKKDLEKKDKKLEAQLSEL 1250 (1930)
T ss_pred             HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccHHHHHHHHHHHHHHH
Confidence                1111122446677778888999999988888887777666667777788888888888888887777777666654


Q ss_pred             HHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccc-cCCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHH
Q 047021          271 LKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVS-QNGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSA  349 (388)
Q Consensus       271 lkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~-~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsA  349 (388)
                      =.-..++...+..+= -|.+-+-.......+...+....... ...-......+..+|.+|-.+--+..+|..++.++.-
T Consensus      1251 q~k~~~~~~~~~~l~-~q~~~l~~E~~~l~~~lee~e~~~~~~~r~~~~~~~qle~~k~qle~e~r~k~~l~~~l~~l~~ 1329 (1930)
T KOG0161|consen 1251 QLKLDEQERLRNDLT-AKRSRLQNENEELSRQLEEAEAKLSALSRDKQALESQLEELKRQLEEETREKSALENALRQLEH 1329 (1930)
T ss_pred             HHHHHHHHHHHHHHH-HHHHHhhhhHHHHhhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            444444333322210 01111100000000000000000000 0012233346888899999888888888888888887


Q ss_pred             HHHHHHH
Q 047021          350 QLQQTTQ  356 (388)
Q Consensus       350 eL~ra~Q  356 (388)
                      +.....-
T Consensus      1330 e~~~l~e 1336 (1930)
T KOG0161|consen 1330 ELDLLRE 1336 (1930)
T ss_pred             HHHHHHH
Confidence            7665543


No 7  
>PF13514 AAA_27:  AAA domain
Probab=96.99  E-value=0.94  Score=50.96  Aligned_cols=253  Identities=19%  Similarity=0.240  Sum_probs=137.6

Q ss_pred             hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh-hHHHHhhhhhh-----------------hhh
Q 047021            3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP-SLRKALKDVAM-----------------EKD   64 (388)
Q Consensus         3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~-slrkaL~~v~m-----------------EkD   64 (388)
                      ..+|..|-.....++.....+...+.+...+.+.+..++..+....+| .++.+|..+-=                 .+.
T Consensus       345 ~~~i~~l~~~~~~l~~~~~~~~~~l~~~~~~~~~~~~~~~~l~~~~~~~~l~~al~~~~~~~d~~~~~~~~~~~~~~~~~  424 (1111)
T PF13514_consen  345 RERIRELLQEREQLEQALAQARRELEEAERELEQLQAELAALPAPPDPEALRAALEAAQRLGDLEARLQEAEQALEAAER  424 (1111)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhCcccCCChHHHHHHHHHHhcccHHHHHHHHHHHHHHHHH
Confidence            457888889999999999999999999999999999999988887763 35555442211                 111


Q ss_pred             hHHHhhhhH-----------------HHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCC-----
Q 047021           65 AAVVAREDL-----------------SAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISS-----  122 (388)
Q Consensus        65 aA~vareDl-----------------~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~-----  122 (388)
                      ....+-..|                 .+++....++..+.+...-++......++.++..+..+...-..+|-.+     
T Consensus       425 ~l~~~l~~L~~w~~~~~~l~~~~~P~~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~v~t~~~l  504 (1111)
T PF13514_consen  425 RLAAALAALGPWSGDLDALAALPLPSRETVEAFRAEFEELERQLRRARDRLEELEEELARLEARLRRLAAAGDVPTEEEL  504 (1111)
T ss_pred             HHHHHHHhcCCCCCChHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHH
Confidence            111111111                 1455566666667666666666666666666666655544333332211     


Q ss_pred             ----------------CCCChhh---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          123 ----------------MGISPDH---VQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFS  183 (388)
Q Consensus       123 ----------------~g~s~~~---~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~  183 (388)
                                      -..+++.   .+.-=-.++|.....-+...-.+.-+.+++........+..+..+++..+..+.
T Consensus       505 ~~aR~~Rd~~W~~~~~~~~~~~~fe~a~~~aD~laD~~~~~a~~~a~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  584 (1111)
T PF13514_consen  505 AAARARRDAAWQLAALDAALAEAFEAAVREADELADRRLREAERAARLAQLRARLEEARARLARAQARLAAAEAALAALE  584 (1111)
T ss_pred             HHHHHHHHHHhcccCCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence                            0111111   112223556666655555555555666666666666666666666666666554


Q ss_pred             hh-------ch-hhhhhhhhhhhcch-----hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhh
Q 047021          184 RK-------AS-EEVSNAAASKAFSV-----EDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLS  250 (388)
Q Consensus       184 ~~-------~s-e~~~~~a~~k~~S~-----eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Ls  250 (388)
                      ..       .. |..| .+...|+..     +.-+.+...-..+.....++...++.|...++.-..        +..|.
T Consensus       585 ~~W~~~~~~~g~p~~p-~~~~~Wl~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~l~~~~~--------~~~l~  655 (1111)
T PF13514_consen  585 AAWAALWAAAGLPLSP-AEMRDWLARREAALEAAEELRAARAELEALRARRAAARAALAAALAALGP--------AEELA  655 (1111)
T ss_pred             HHHHHHHhhcCCCCCh-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCc--------cccHH
Confidence            32       11 1112 223444422     233333444344445555555566655555544322        34444


Q ss_pred             hhHHHHHHHHHHHH
Q 047021          251 SSYQESMGIAKHWE  264 (388)
Q Consensus       251 ss~qea~~iaaqWE  264 (388)
                      .-+..+......|+
T Consensus       656 ~~l~~a~~~~~~~~  669 (1111)
T PF13514_consen  656 ALLEEAEALLEEWE  669 (1111)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44444444444443


No 8  
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=96.89  E-value=0.34  Score=54.74  Aligned_cols=76  Identities=22%  Similarity=0.358  Sum_probs=42.2

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK   81 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k   81 (388)
                      ..+.++.++.+++.+...+...+.+...+...+..+++.+.... +.++..+..++-.+++..-.++++-.+++..+
T Consensus       731 ~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~-~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  806 (1163)
T COG1196         731 ELEQLQSRLEELEEELEELEEELEELQERLEELEEELESLEEAL-AKLKEEIEELEEKRQALQEELEELEEELEEAE  806 (1163)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455555555555555555555555555555555555544444 55555555555555666655666666666665


No 9  
>PF00038 Filament:  Intermediate filament protein;  InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups:  Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C.   All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=95.93  E-value=1.6  Score=41.47  Aligned_cols=119  Identities=29%  Similarity=0.353  Sum_probs=81.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhh
Q 047021          155 RKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDN  234 (388)
Q Consensus       155 qqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~  234 (388)
                      -.|||.=..++..|..+-..|+.+|..+.....+..+          .++..++..|+++-..|..+-.-+.+|..++|+
T Consensus        10 NdRla~YIekVr~LE~~N~~Le~~i~~~~~~~~~~~~----------~~~~~ye~el~~lr~~id~~~~eka~l~~e~~~   79 (312)
T PF00038_consen   10 NDRLASYIEKVRFLEQENKRLESEIEELREKKGEEVS----------RIKEMYEEELRELRRQIDDLSKEKARLELEIDN   79 (312)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHH---------H----------HHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHhhhhHHHHHHHHhcccccCc----------ccccchhhHHHHhHHhhhhHHHHhhHHhhhhhh
Confidence            4577888889999999999999999998866433222          234555666666666666666666666666666


Q ss_pred             hhHHHHHH--------------hhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          235 QSSEIERL--------------FEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK  283 (388)
Q Consensus       235 QS~EIErL--------------f~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek  283 (388)
                      -..+|+.+              =.+..+|...+.+++.+-..++++++---.+.+-++..-+.
T Consensus        80 l~~e~~~~r~k~e~e~~~~~~le~el~~lrk~ld~~~~~r~~le~~i~~L~eEl~fl~~~hee  142 (312)
T PF00038_consen   80 LKEELEDLRRKYEEELAERKDLEEELESLRKDLDEETLARVDLENQIQSLKEELEFLKQNHEE  142 (312)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhHhHHHHHHHHHHHHHHHHHhhhhh
Confidence            66665543              45778888999999999999999998877777777765554


No 10 
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=95.85  E-value=4.8  Score=46.37  Aligned_cols=32  Identities=9%  Similarity=0.187  Sum_probs=19.4

Q ss_pred             hHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHH
Q 047021          322 EFLALKGQLAKEQSRAEALSAQVLQLSAQLQQ  353 (388)
Q Consensus       322 e~l~LK~eL~kEqsRseaLsaqv~qLsAeL~r  353 (388)
                      ++..|...-+.-.+....|..+|.+|-.+|..
T Consensus      1062 ~~~~l~~~~a~l~g~~k~le~qi~~l~~eL~e 1093 (1311)
T TIGR00606      1062 NIDLIKRNHVLALGRQKGYEKEIKHFKKELRE 1093 (1311)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence            34444444444455566677777777777743


No 11 
>PRK11281 hypothetical protein; Provisional
Probab=95.79  E-value=5.2  Score=46.24  Aligned_cols=189  Identities=19%  Similarity=0.137  Sum_probs=109.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhH
Q 047021          132 TLEKELATLKSQLQQESQLRQ-------QERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDK  204 (388)
Q Consensus       132 ~lekEl~~l~~~lqqes~lrq-------qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDk  204 (388)
                      .||..+.++..+|++...--.       .-+-+.++.|.+++....+.|++...+....          +..+++|..-+
T Consensus       125 qLEq~L~q~~~~Lq~~Q~~La~~NsqLi~~qT~PERAQ~~lsea~~RlqeI~~~L~~~~----------~~~~~l~~~~~  194 (1113)
T PRK11281        125 QLESRLAQTLDQLQNAQNDLAEYNSQLVSLQTQPERAQAALYANSQRLQQIRNLLKGGK----------VGGKALRPSQR  194 (1113)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcchHHHHHHHHHHHHHHHHHHHHHhCCC----------CCCCcCCHHHH
Confidence            366666666555555333222       2455566777777888888888887665411          11223666667


Q ss_pred             HHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          205 EKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKL  284 (388)
Q Consensus       205 ekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLekl  284 (388)
                      .+|..++.-+-..              ++-+-.|    -..|+.+.+=++.=+.....--.+.+   .+...|...++.-
T Consensus       195 ~~l~ae~~~l~~~--------------~~~~~~~----l~~~~~l~~l~~~q~d~~~~~~~~~~---~~~~~lq~~in~k  253 (1113)
T PRK11281        195 VLLQAEQALLNAQ--------------NDLQRKS----LEGNTQLQDLLQKQRDYLTARIQRLE---HQLQLLQEAINSK  253 (1113)
T ss_pred             HHHHHHHHHHHHH--------------HHHHHHH----HhcchHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence            7776664433322              2222111    12677777777776666655444333   3344566666665


Q ss_pred             HhhhhccCCCCccccccccCCCCCCccccCCCchhhh---hHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhh
Q 047021          285 RTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETHKN---EFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYN  359 (388)
Q Consensus       285 R~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~~t---e~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n  359 (388)
                      |.+++-..-...    .    .+.+.....++|....   .|..|-.+|.+--.|...|+.+-.+.-..|.+.+|+..
T Consensus       254 r~~~se~~~~~a----~----~~~~~~~~~~~p~i~~~~~~N~~Ls~~L~~~t~~~~~l~~~~~~~~~~l~~~~q~~~  323 (1113)
T PRK11281        254 RLTLSEKTVQEA----Q----SQDEAARIQANPLVAQELEINLQLSQRLLKATEKLNTLTQQNLRVKNWLDRLTQSER  323 (1113)
T ss_pred             HHHHHHHHHHHH----h----hhhhhcccCCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            555443311000    0    0000111222444433   48899999999999999999999999999999999987


No 12 
>PF14662 CCDC155:  Coiled-coil region of CCDC155
Probab=95.33  E-value=2.7  Score=40.03  Aligned_cols=166  Identities=31%  Similarity=0.349  Sum_probs=103.3

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK   81 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k   81 (388)
                      |++.|.+|+..|..|..+|.+|...+++++.-+.-|-.++-++..... |+++||.--     -+   -++=..-|++.=
T Consensus         6 L~~~v~dL~~~n~~L~~en~kL~~~ve~~ee~na~L~~e~~~L~~q~~-s~Qqal~~a-----K~---l~eEledLk~~~   76 (193)
T PF14662_consen    6 LLSCVEDLQLNNQKLADENAKLQRSVETAEEGNAQLAEEITDLRKQLK-SLQQALQKA-----KA---LEEELEDLKTLA   76 (193)
T ss_pred             HHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHH-----HH---HHHHHHHHHHHH
Confidence            689999999999999999999999999999999999999999877654 778877321     01   111122222222


Q ss_pred             HHhHH----HHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH-HHHHHHHHH---
Q 047021           82 KRLKE----AEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL-QQESQLRQQ---  153 (388)
Q Consensus        82 kRlkE----AEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l-qqes~lrqq---  153 (388)
                      +-+.|    .=..-|-.|...-.|-+++.+||..-+....    -.+..-+.+..|-.+-.+|+.++ .=++++.+.   
T Consensus        77 ~~lEE~~~~L~aq~rqlEkE~q~L~~~i~~Lqeen~kl~~----e~~~lk~~~~eL~~~~~~Lq~Ql~~~e~l~~~~da~  152 (193)
T PF14662_consen   77 KSLEEENRSLLAQARQLEKEQQSLVAEIETLQEENGKLLA----ERDGLKKRSKELATEKATLQRQLCEFESLICQRDAI  152 (193)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH----hhhhHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHH
Confidence            22111    0111122344556788888888877643322    12234566777777788888887 444444442   


Q ss_pred             --H----HHHH----HHHHHHHHHHHHHHHHHHHHHH
Q 047021          154 --E----RKQL----TEEQARISVLMSEKQELEEKLV  180 (388)
Q Consensus       154 --e----qqrl----a~Eq~~~a~l~aEkqeLeekl~  180 (388)
                        +    .+-|    ++=...+..|++||-.||+++.
T Consensus       153 l~e~t~~i~eL~~~ieEy~~~teeLR~e~s~LEeql~  189 (193)
T PF14662_consen  153 LSERTQQIEELKKTIEEYRSITEELRLEKSRLEEQLS  189 (193)
T ss_pred             HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence              1    1112    2223345567777777777764


No 13 
>PRK04863 mukB cell division protein MukB; Provisional
Probab=95.29  E-value=7.6  Score=46.15  Aligned_cols=182  Identities=20%  Similarity=0.269  Sum_probs=106.2

Q ss_pred             HhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC--CCCCCC----------------CCCCChhh
Q 047021           68 VAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG--SLGGIS----------------SMGISPDH  129 (388)
Q Consensus        68 vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~--~~s~~~----------------~~g~s~~~  129 (388)
                      ++-++|-.-+.....++.+.+.+--..|..-..+.+.+..+......-  -.+.++                ...+-.++
T Consensus       435 ~SdEeLe~~LenF~aklee~e~qL~elE~kL~~lea~leql~~~~~~l~~~~Gkv~~~~a~~~~~~~~~~~~~~~~~~~~  514 (1486)
T PRK04863        435 LTADNAEDWLEEFQAKEQEATEELLSLEQKLSVAQAAHSQFEQAYQLVRKIAGEVSRSEAWDVARELLRRLREQRHLAEQ  514 (1486)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCcCHHHHHHHHHHHHHHhHHHHHHHHh
Confidence            345777777778888888888777777777777777777777664321  122222                11133444


Q ss_pred             HHHHHHHHHHHHHHHHH----HHHHHH----------------HHHHH-------H----HHHHHHHHHHHHHHHHHHHH
Q 047021          130 VQTLEKELATLKSQLQQ----ESQLRQ----------------QERKQ-------L----TEEQARISVLMSEKQELEEK  178 (388)
Q Consensus       130 ~~~lekEl~~l~~~lqq----es~lrq----------------qeqqr-------l----a~Eq~~~a~l~aEkqeLeek  178 (388)
                      ...+...+++|+-+|++    +.++.+                +....       +    +.--++-..+..+-++|...
T Consensus       515 ~~~~~~~~~~l~~~~~~q~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~~qL~~~  594 (1486)
T PRK04863        515 LQQLRMRLSELEQRLRQQQRAERLLAEFCKRLGKNLDDEDELEQLQEELEARLESLSESVSEARERRMALRQQLEQLQAR  594 (1486)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            66788888888886554    222222                01111       1    11222335566677778888


Q ss_pred             HHHHhhhchh-hhhhhhhhh-----hcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhh
Q 047021          179 LVMFSRKASE-EVSNAAASK-----AFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSL  249 (388)
Q Consensus       179 l~~~~~~~se-~~~~~a~~k-----~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~L  249 (388)
                      |..+++.+|. -....|..+     .-..+|...+......+..-.--+--.|+++=..|..--.+|++||.-+.+.
T Consensus       595 i~~l~~~ap~W~~a~~al~~L~eq~g~~~~~~~~v~~~mq~~~~~~~~~~~~~~~~~~~~~~L~~~i~~l~~~~~g~  671 (1486)
T PRK04863        595 IQRLAARAPAWLAAQDALARLREQSGEEFEDSQDVTEYMQQLLERERELTVERDELAARKQALDEEIERLSQPGGSE  671 (1486)
T ss_pred             HHHHHHhChHHHhhHHHHHHHHHhcchhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccCCCc
Confidence            8888877776 222222222     1123666666655555555444455566677777777777788888765554


No 14 
>PRK02224 chromosome segregation protein; Provisional
Probab=95.24  E-value=5.6  Score=43.09  Aligned_cols=34  Identities=32%  Similarity=0.431  Sum_probs=19.1

Q ss_pred             HHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           79 TLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        79 ~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      ...++..+++.+.+.+.++...+..+++.+-...
T Consensus       555 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~  588 (880)
T PRK02224        555 EKREAAAEAEEEAEEAREEVAELNSKLAELKERI  588 (880)
T ss_pred             HHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHH
Confidence            3334445555566666666666666666655533


No 15 
>PF07888 CALCOCO1:  Calcium binding and coiled-coil domain (CALCOCO1) like;  InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region []. 
Probab=95.22  E-value=5.6  Score=42.97  Aligned_cols=225  Identities=24%  Similarity=0.387  Sum_probs=111.4

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRL   84 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRl   84 (388)
                      +|...++.+.+|...|..|..+......+++.|+..|+.... ..-.|+.-.++++-....+.--++.|..++-.+..|+
T Consensus       144 qlE~~qkE~eeL~~~~~~Le~e~~~l~~~v~~l~~eL~~~~e-e~e~L~~~~kel~~~~e~l~~E~~~L~~q~~e~~~ri  222 (546)
T PF07888_consen  144 QLEECQKEKEELLKENEQLEEEVEQLREEVERLEAELEQEEE-EMEQLKQQQKELTESSEELKEERESLKEQLAEARQRI  222 (546)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            445555666666666666666666666666666666664433 3445555555555544444433444444444444444


Q ss_pred             HHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           85 KEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQAR  164 (388)
Q Consensus        85 kEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~  164 (388)
                      .+-|              .++.++.+......        --.+++..+..++...+.+|+          +||......
T Consensus       223 ~~LE--------------edi~~l~qk~~E~e--------~~~~~lk~~~~elEq~~~eLk----------~rLk~~~~~  270 (546)
T PF07888_consen  223 RELE--------------EDIKTLTQKEKEQE--------KELDKLKELKAELEQLEAELK----------QRLKETVVQ  270 (546)
T ss_pred             HHHH--------------HHHHHHHHHHHHHH--------HHHHHHHHHHHHHHHHHHHHH----------HHHHHHHHH
Confidence            4333              33334444430000        000111222222222222222          222222222


Q ss_pred             H-------HHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhH
Q 047021          165 I-------SVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSS  237 (388)
Q Consensus       165 ~-------a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~  237 (388)
                      .       ..+..+...|.++|...-.....           |......|.+.|.||+       +-|++.+.|+-....
T Consensus       271 ~~~~~~~~~~~~~e~e~LkeqLr~~qe~lqa-----------Sqq~~~~L~~EL~~~~-------~~RDrt~aeLh~aRL  332 (546)
T PF07888_consen  271 LKQEETQAQQLQQENEALKEQLRSAQEQLQA-----------SQQEAELLRKELSDAV-------NVRDRTMAELHQARL  332 (546)
T ss_pred             HHHhhhhhhhHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHhhh
Confidence            2       23334444444444433211110           1233445666665554       457889999988888


Q ss_pred             HHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHH
Q 047021          238 EIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELR-GILEKL  284 (388)
Q Consensus       238 EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr-~lLekl  284 (388)
                      |.+.|   +-.|+...-+..+-.++|..--. .|.+.-+.. +-+++|
T Consensus       333 e~aql---~~qLad~~l~lke~~~q~~qEk~-~l~~~~e~~k~~ie~L  376 (546)
T PF07888_consen  333 EAAQL---KLQLADASLELKEGRSQWAQEKQ-ALQHSAEADKDEIEKL  376 (546)
T ss_pred             hHHHH---HHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHhHHHHHHH
Confidence            88888   55677766677777889986555 555544333 445443


No 16 
>PHA02562 46 endonuclease subunit; Provisional
Probab=94.97  E-value=4.8  Score=40.93  Aligned_cols=138  Identities=15%  Similarity=0.214  Sum_probs=77.9

Q ss_pred             HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021           11 KKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE   90 (388)
Q Consensus        11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE   90 (388)
                      .+..+++.++..|..++.....+.+.+.+.+.+++.....    .+.++.=+.|.......++-+++..++..+.....+
T Consensus       174 ~~~~e~~~~i~~l~~~i~~l~~~i~~~~~~i~~~~~~~~~----~i~~l~~e~~~l~~~~~~l~~~l~~l~~~i~~l~~~  249 (562)
T PHA02562        174 DKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGE----NIARKQNKYDELVEEAKTIKAEIEELTDELLNLVMD  249 (562)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            3444566666666666666666666666666666554332    233444455566666666777777666666554322


Q ss_pred             -------HhhhHhhHHHHHHHHHHHHHHHhcCCCC-CCC----CCCCChhhHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           91 -------QYRAEEDAAALRAELNSLQQQAMDGSLG-GIS----SMGISPDHVQTLEKELATLKSQLQQESQLRQ  152 (388)
Q Consensus        91 -------QyRAEEDaAaLRAEln~lQqq~~~~~~s-~~~----~~g~s~~~~~~lekEl~~l~~~lqqes~lrq  152 (388)
                             --.++.+-+.+...++.++....-.... -.+    ++..+++.+..+...+.+++.++..-.....
T Consensus       250 i~~~~~~L~~l~~~~~~~~~~l~~~~~~~~~~~~~~~Cp~C~~~~~~~~~~~~~l~d~i~~l~~~l~~l~~~i~  323 (562)
T PHA02562        250 IEDPSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAID  323 (562)
T ss_pred             cccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCCcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence                   2235555555666666665544322111 111    3334477788888888888888776444433


No 17 
>PF10174 Cast:  RIM-binding protein of the cytomatrix active zone;  InterPro: IPR019323  This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains []. 
Probab=94.71  E-value=9.1  Score=42.83  Aligned_cols=84  Identities=30%  Similarity=0.423  Sum_probs=57.6

Q ss_pred             HHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021           10 QKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE   89 (388)
Q Consensus        10 q~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE   89 (388)
                      +-++..+.++|..|..+++.+..+..+....++   ...-|.|         +|+.++  |++-.|.++.++..++.--.
T Consensus         2 q~ql~~~q~E~e~L~~ele~~~~~l~~~~~~i~---~fwspEl---------krer~~--rkee~a~l~~~k~qlr~~q~   67 (775)
T PF10174_consen    2 QAQLERLQRENERLRRELERKQSKLGSSMNSIK---TFWSPEL---------KRERAL--RKEEAAELSRLKEQLRVTQE   67 (775)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHhHHHHHHHhHh---cccchhh---------HHHHHH--HHHHHHHHHhHHHHHHHHHh
Confidence            345556667777777777766655555544332   2333543         344443  67788999999999988888


Q ss_pred             HHhhhHhhHHHHHHHHHH
Q 047021           90 EQYRAEEDAAALRAELNS  107 (388)
Q Consensus        90 EQyRAEEDaAaLRAEln~  107 (388)
                      +-.++.+...+|..+|.+
T Consensus        68 e~q~~~~ei~~LqeELr~   85 (775)
T PF10174_consen   68 ENQKAQEEIQALQEELRA   85 (775)
T ss_pred             hHHHHHHHHHHHHHHHHH
Confidence            888999999999988833


No 18 
>KOG0161 consensus Myosin class II heavy chain [Cytoskeleton]
Probab=94.47  E-value=16  Score=44.69  Aligned_cols=157  Identities=28%  Similarity=0.395  Sum_probs=83.6

Q ss_pred             hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Q 047021           70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQ  149 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~  149 (388)
                      +..+-..|+.+..|+-       -.|+..+.|-.+.+.+++-...           -.+++.+++..+..+..+...-..
T Consensus       903 k~~le~~l~~~~~~~e-------~~ee~~~~le~~~~~~~~e~~~-----------l~~~~~~~E~~~~k~~~Ek~~~e~  964 (1930)
T KOG0161|consen  903 KQELEKELKELKERLE-------EEEEKNAELERKKRKLEQEVQE-----------LKEQLEELELTLQKLELEKNAAEN  964 (1930)
T ss_pred             HHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4444444444444443       3344455566666665544321           123445555555555555555444


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHH
Q 047021          150 LRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLL  229 (388)
Q Consensus       150 lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL  229 (388)
                      -....+..++.=-..++.|..++.++++++..+............   .+ ..-+.||+.+|.||-.-.++=+..|..+ 
T Consensus       965 ~~~~l~~e~~~~~e~~~kL~kekk~lEe~~~~l~~~l~~~eek~~---~l-~k~~~kle~~l~~le~~le~e~~~r~e~- 1039 (1930)
T KOG0161|consen  965 KLKNLEEEINSLDENISKLSKEKKELEERIRELQDDLQAEEEKAK---SL-NKAKAKLEQQLDDLEVTLEREKRIRMEL- 1039 (1930)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HH-HHHHHHHHHHHHHHHHHHHHHHHHHHHH-
Confidence            444445555554566788888888888888887755333222110   01 2346777888777777776655554433 


Q ss_pred             HHhhhhhHHHHHHhhhhhhhhhhHHH
Q 047021          230 MEIDNQSSEIERLFEENSSLSSSYQE  255 (388)
Q Consensus       230 ~EID~QS~EIErLf~ENs~Lsss~qe  255 (388)
                         +   ..+-+|=+++..+-.+..+
T Consensus      1040 ---E---k~~rkle~el~~~~e~~~~ 1059 (1930)
T KOG0161|consen 1040 ---E---KAKRKLEGELKDLQESIEE 1059 (1930)
T ss_pred             ---H---HHHHHHHHHHHHhhhHHHH
Confidence               3   3334444555444444444


No 19 
>PF09730 BicD:  Microtubule-associated protein Bicaudal-D;  InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=93.28  E-value=16  Score=40.64  Aligned_cols=151  Identities=27%  Similarity=0.292  Sum_probs=102.9

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA   87 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA   87 (388)
                      .|..+..+|+.+.+-+...+.....+++-|....+++.        +.              .+++-.+-+.++.-+||.
T Consensus        31 ~~~~~i~~l~~elk~~~~~~~~~~~e~~rl~~~~~~~~--------~~--------------~~~~e~~~~~lr~e~ke~   88 (717)
T PF09730_consen   31 YLQQRILELENELKQLRQELSNVQAENERLSQLNQELR--------KE--------------CEDLELERKRLREEIKEY   88 (717)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--------HH--------------HHHHHHHHHHHHHHHHHH
Confidence            34556666666666666666666666666665555432        22              345555666667777777


Q ss_pred             HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           88 EEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISV  167 (388)
Q Consensus        88 EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~  167 (388)
                      -.-..|-=-|-.-|-.|-=+||.|+..=- ++.+-|----.+|..|+.|+.-+.+++.....|+---..+|.+...-+..
T Consensus        89 K~rE~rll~dyselEeENislQKqvs~Lk-~sQvefE~~Khei~rl~Ee~~~l~~qlee~~rLk~iae~qleEALesl~~  167 (717)
T PF09730_consen   89 KFREARLLQDYSELEEENISLQKQVSVLK-QSQVEFEGLKHEIKRLEEEIELLNSQLEEAARLKEIAEKQLEEALESLKS  167 (717)
T ss_pred             HHHHHHHhhhhHHHHHHHHHHHHHHHHHH-HhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            77778888899999999999999994322 22222221234588899999999999999999988777777777655555


Q ss_pred             HHHHHHHHHHHHHH
Q 047021          168 LMSEKQELEEKLVM  181 (388)
Q Consensus       168 l~aEkqeLeekl~~  181 (388)
                      =+-.|-.|+..|..
T Consensus       168 EReqk~~LrkEL~~  181 (717)
T PF09730_consen  168 EREQKNALRKELDQ  181 (717)
T ss_pred             HHHHHHHHHHHHHH
Confidence            55556666666666


No 20 
>PF05622 HOOK:  HOOK protein;  InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=93.22  E-value=0.025  Score=60.44  Aligned_cols=132  Identities=26%  Similarity=0.346  Sum_probs=0.0

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA   87 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA   87 (388)
                      .|.++|.+++.+...|....+....+++.|..+++..+..           +.-...-+.+...|+-+++|.++..+.+.
T Consensus       190 ~l~q~~~el~~~i~~L~~e~~~L~~e~~~l~~~~~~~~~~-----------~~~~~~~~~~~~~~l~~ql~~L~~el~~~  258 (713)
T PF05622_consen  190 ELAQRCHELEKQISDLQEEKESLQSENEELQERLSQLEGS-----------SEEPSQHLSVELADLRAQLRRLREELERL  258 (713)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHhHHHhhhhhhhhhcccCCCCCC-----------CCCcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445555565555555555555555666666666655510           01111112233457888888888877766


Q ss_pred             HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCC--CCC-------CCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021           88 EEEQYRAEEDAAALRAELNSLQQQAMDGSLG--GIS-------SMGISPDHVQTLEKELATLKSQLQQESQL  150 (388)
Q Consensus        88 EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s--~~~-------~~g~s~~~~~~lekEl~~l~~~lqqes~l  150 (388)
                      |+-.--.+.....+..+|..|++....-..-  .+.       -+.-..+....++.++..|+-.|.--.-+
T Consensus       259 e~~~~d~~~~~e~le~ei~~L~q~~~eL~~~A~~a~~LrDElD~lR~~a~r~~klE~~ve~YKkKLed~~~l  330 (713)
T PF05622_consen  259 EEQRDDLKIELEELEKEIDELRQENEELQAEAREARALRDELDELREKADRADKLENEVEKYKKKLEDLEDL  330 (713)
T ss_dssp             ------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5532222333445667777777776432111  111       12223333444666666666555543333


No 21 
>COG1196 Smc Chromosome segregation ATPases [Cell division and chromosome partitioning]
Probab=93.19  E-value=19  Score=41.19  Aligned_cols=42  Identities=14%  Similarity=0.137  Sum_probs=27.1

Q ss_pred             HHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHHHHHHH
Q 047021          328 GQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRLYKPVL  369 (388)
Q Consensus       328 ~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rly~PvL  369 (388)
                      .++-.-..|-+.|..+...|...........+.+-+..+.++
T Consensus       970 ee~e~~~~r~~~l~~~~~dl~~a~~~l~~~i~~~d~~~~~~f 1011 (1163)
T COG1196         970 EEYEEVEERYEELKSQREDLEEAKEKLLEVIEELDKEKRERF 1011 (1163)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344455566677777777777777777766666666555544


No 22 
>TIGR00606 rad50 rad50. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=92.88  E-value=23  Score=41.15  Aligned_cols=188  Identities=15%  Similarity=0.164  Sum_probs=89.7

Q ss_pred             hhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHh------hhHhhHHHHHHHHHHHHHHHhcCCCC
Q 047021           45 EQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQY------RAEEDAAALRAELNSLQQQAMDGSLG  118 (388)
Q Consensus        45 e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQy------RAEEDaAaLRAEln~lQqq~~~~~~s  118 (388)
                      ....+|.+++-+.++.-+.+.+.-.-+++-+.|-.++..+..++.-.-      |...+...++.+|..|......    
T Consensus       742 ~~~eip~l~~~l~~le~~l~~~~~~le~~~~~l~~~~~~~~~~esL~~~v~~i~r~~~ei~~l~~qie~l~~~l~~----  817 (1311)
T TIGR00606       742 KEKEIPELRNKLQKVNRDIQRLKNDIEEQETLLGTIMPEEESAKVCLTDVTIMERFQMELKDVERKIAQQAAKLQG----  817 (1311)
T ss_pred             HHhhchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcc----
Confidence            345678888888777666555555555555555555555554443211      2244555555555555444421    


Q ss_pred             CCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhh
Q 047021          119 GISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKA  198 (388)
Q Consensus       119 ~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~  198 (388)
                          +|. ...+..++.++..+..++..-..-       +..-+.....+..+...|..++..+...-..      ... 
T Consensus       818 ----~~~-~~s~~ele~ei~~~~~el~~l~~~-------~e~l~~e~e~~~~eI~~Lq~ki~el~~~klk------l~~-  878 (1311)
T TIGR00606       818 ----SDL-DRTVQQVNQEKQEKQHELDTVVSK-------IELNRKLIQDQQEQIQHLKSKTNELKSEKLQ------IGT-  878 (1311)
T ss_pred             ----ccc-cCCHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------HHH-
Confidence                111 124566777777666555442111       1111222222333333333333322211000      000 


Q ss_pred             cchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021          199 FSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQES  256 (388)
Q Consensus       199 ~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea  256 (388)
                       ....+..++.+|.++...++-+...=..+=.+|..-..+|+++..+...+.......
T Consensus       879 -~l~~r~~le~~L~el~~el~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  935 (1311)
T TIGR00606       879 -NLQRRQQFEEQLVELSTEVQSLIREIKDAKEQDSPLETFLEKDQQEKEELISSKETS  935 (1311)
T ss_pred             -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHHHHHHHHHHHHH
Confidence             344666677776666655555544444454555555555555555555544444333


No 23 
>PRK11637 AmiB activator; Provisional
Probab=92.84  E-value=12  Score=37.86  Aligned_cols=40  Identities=23%  Similarity=0.231  Sum_probs=19.7

Q ss_pred             HHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           73 LSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        73 l~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      +..++..+++++...+.+--.++.+-..++..++.+-..+
T Consensus        94 ~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l~~rlra~  133 (428)
T PRK11637         94 TQNTLNQLNKQIDELNASIAKLEQQQAAQERLLAAQLDAA  133 (428)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444445555555555555555555555555554444333


No 24 
>PF15070 GOLGA2L5:  Putative golgin subfamily A member 2-like protein 5
Probab=92.81  E-value=17  Score=39.61  Aligned_cols=79  Identities=24%  Similarity=0.297  Sum_probs=50.0

Q ss_pred             HHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021            9 FQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE   88 (388)
Q Consensus         9 lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE   88 (388)
                      |..++....+.|..|+....+++.....|+.+|.++.... |...+-|.  +|+-|.+-++|  +++|=+.+|..|.|-+
T Consensus        99 L~~qlqaqv~~ne~Ls~L~~EqEerL~ELE~~le~~~e~~-~D~~kLLe--~lqsdk~t~SR--AlsQN~eLK~QL~Elq  173 (617)
T PF15070_consen   99 LEEQLQAQVENNEQLSRLNQEQEERLAELEEELERLQEQQ-EDRQKLLE--QLQSDKATASR--ALSQNRELKEQLAELQ  173 (617)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHh--hhcccchHHHH--HHHhHHHHHHHHHHHH
Confidence            3333333344455555555666666666666666655433 44555554  36778888888  5888899999999987


Q ss_pred             HHHh
Q 047021           89 EEQY   92 (388)
Q Consensus        89 EEQy   92 (388)
                      +-=.
T Consensus       174 ~~Fv  177 (617)
T PF15070_consen  174 DAFV  177 (617)
T ss_pred             HHHH
Confidence            7333


No 25 
>PF13514 AAA_27:  AAA domain
Probab=92.74  E-value=21  Score=40.53  Aligned_cols=161  Identities=18%  Similarity=0.283  Sum_probs=90.3

Q ss_pred             hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC-CCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHH
Q 047021           70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG-SLGGISSMGISPDHVQTLEKELATLKSQLQQES  148 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~-~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes  148 (388)
                      -+...+.++.+..++..++.+--+++.....+..++...+++.... ...|+ +-+++|..+...-..+.++...+.+-.
T Consensus       668 ~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~w~~~l~~~gL-~~~~~~~~~~~~l~~l~~l~~~~~~~~  746 (1111)
T PF13514_consen  668 WEQAAARREQLEEELQQLEQELEEAEAELQEAQEALEEWQEEWQEALAELGL-PADASPEEALEALELLEELREALAEIR  746 (1111)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC-CCCCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455566666666666666666666666666666777666665322 22345 445677777776666665555544433


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhh----h---hhhhcchhhHHHHHHHHhHHHHHHHHH
Q 047021          149 QLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNA----A---ASKAFSVEDKEKLEKQLHDMAVVVERL  221 (388)
Q Consensus       149 ~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~----a---~~k~~S~eDkekle~Ql~dma~~VeRL  221 (388)
                      .+           ..++..+..+...++.++..+.....+..+..    +   ....+..  -..-..++..+...++.+
T Consensus       747 ~~-----------~~ri~~~~~~~~~f~~~~~~L~~~l~~~~~~~~~~~~~~~L~~~l~~--a~~~~~~~~~l~~~~~~~  813 (1111)
T PF13514_consen  747 EL-----------RRRIEQMEADLAAFEEQVAALAERLGPDLPEDPAEEALEALRARLEE--AREAQEERERLQEQLEEL  813 (1111)
T ss_pred             HH-----------HHHHHHHHHHHHHHHHHHHHHHHHcCcccccCcHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHH
Confidence            33           23556666777777777777665522211111    1   1222211  111123334444556666


Q ss_pred             HhhHHHHHHHhhhhhHHHHHHhh
Q 047021          222 ESSRQKLLMEIDNQSSEIERLFE  244 (388)
Q Consensus       222 E~SRqKLL~EID~QS~EIErLf~  244 (388)
                      +...+.+-.++..--.+|+.||.
T Consensus       814 ~~~~~~~~~~l~~~~~~l~~L~~  836 (1111)
T PF13514_consen  814 EEELEQAEEELEELEAELAELLE  836 (1111)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            66666666677777777888887


No 26 
>PF05557 MAD:  Mitotic checkpoint protein;  InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=92.46  E-value=1.8  Score=46.69  Aligned_cols=138  Identities=23%  Similarity=0.314  Sum_probs=74.9

Q ss_pred             hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHH--hhhhhhhhh-hHHHHHHHHHHHH----HHHHHHHHHH
Q 047021          202 EDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERL--FEENSSLSS-SYQESMGIAKHWE----NQVKDCLKQN  274 (388)
Q Consensus       202 eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErL--f~ENs~Lss-s~qea~~iaaqWE----nQvqDclkQN  274 (388)
                      +....|...       +.-|+....+|-.+|+.=..+|+++  -++...-.. -+.=...=++.|+    +.+.--=+-|
T Consensus       503 e~~~~L~~~-------~~~Le~e~~~L~~~~~~Le~~l~~~~L~g~~~~~~trVL~lr~NP~~~~~~~k~~~l~~L~~En  575 (722)
T PF05557_consen  503 EELNELQKE-------IEELERENERLRQELEELESELEKLTLQGEFNPSKTRVLHLRDNPTSKAEQIKKSTLEALQAEN  575 (722)
T ss_dssp             HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHCCCT--BTTTEEEEEESS-HHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHhhhccccCCCCceeeeeCCCcHHHHHHHHHHHHHHHHHHH
Confidence            444556666       7778888888888999888899983  122211000 0000011233343    2222222346


Q ss_pred             HHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHH
Q 047021          275 EELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQT  354 (388)
Q Consensus       275 eeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra  354 (388)
                      +.|+..|..+-.  .+..|..              -+...+......++..|+.++.....|..-|--=..+-|.+.+.|
T Consensus       576 ~~L~~~l~~le~--~~~~~~~--------------~~p~~~~~~~~~e~~~l~~~~~~~ekr~~RLkevf~~ks~eFr~a  639 (722)
T PF05557_consen  576 EDLLARLRSLEE--GNSQPVD--------------AVPTSSLESQEKEIAELKAELASAEKRNQRLKEVFKAKSQEFREA  639 (722)
T ss_dssp             HHHHHHHHHHTT--TT------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhccc--CCCCCcc--------------cccchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            666665544421  1111111              111111233345788899999998888888888888889999999


Q ss_pred             HHHhhhhH
Q 047021          355 TQAYNGLV  362 (388)
Q Consensus       355 ~QA~n~L~  362 (388)
                      +..+.|+-
T Consensus       640 v~~llGyk  647 (722)
T PF05557_consen  640 VYSLLGYK  647 (722)
T ss_dssp             HHHHHSEE
T ss_pred             HHHHhcce
Confidence            98888853


No 27 
>PF09728 Taxilin:  Myosin-like coiled-coil protein;  InterPro: IPR019132  Taxilin contains an extraordinarily long coiled-coil domain in its C-terminal half and is ubiquitously expressed. It is a novel binding partner of several syntaxin family members and is possibly involved in Ca(2+)-dependent exocytosis in neuroendocrine cells []. Gamma-taxilin, described as leucine zipper protein Factor Inhibiting ATF4-mediated Transcription (FIAT), localises to the nucleus in osteoblasts and dimerises with ATF4 to form inactive dimers, thus inhibiting ATF4-mediated transcription []. 
Probab=92.11  E-value=14  Score=36.82  Aligned_cols=254  Identities=22%  Similarity=0.319  Sum_probs=150.5

Q ss_pred             hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH-----
Q 047021           13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA-----   87 (388)
Q Consensus        13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA-----   87 (388)
                      +...++....|..+..+-..++..+++++..+++ .+-.+.+-...+.-|.+.++.+|+.|=.=-|-+++.-+..     
T Consensus        17 l~~~eeK~~~L~kk~~ell~e~k~~~k~~~~~~K-k~~~l~kek~~l~~E~~k~~~~k~KLE~LCRELQk~Nk~lkeE~~   95 (309)
T PF09728_consen   17 LSSPEEKLEALCKKYAELLEEMKRLQKQLKKLQK-KQEQLQKEKDQLQSELSKAILAKSKLESLCRELQKQNKKLKEESK   95 (309)
T ss_pred             CCChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3333444444555555555555555555544432 2333444455556677788888887766656666554443     


Q ss_pred             ---HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCC-----------------CCCCChhh----HHHHHHHHHHHHHH
Q 047021           88 ---EEEQYRAEEDAAALRAELNSLQQQAMDGSLGGIS-----------------SMGISPDH----VQTLEKELATLKSQ  143 (388)
Q Consensus        88 ---EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~-----------------~~g~s~~~----~~~lekEl~~l~~~  143 (388)
                         .++.-+-.+=+.-+..-|+.||.++..+..-...                 .|.....|    +...+.|++=+.+.
T Consensus        96 ~~~~eee~kR~el~~kFq~~L~dIq~~~ee~~~~~~k~~~eN~~L~eKlK~l~eQye~rE~~~~~~~k~keLE~Ql~~AK  175 (309)
T PF09728_consen   96 RRAREEEEKRKELSEKFQATLKDIQAQMEEQSERNIKLREENEELREKLKSLIEQYELREEHFEKLLKQKELEVQLAEAK  175 (309)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHH
Confidence               3344444555677888889999988776544332                 22223333    33455566666667


Q ss_pred             HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHhhhchhhhhhhh-hhhhcchhhHHHHHHHHhHHH
Q 047021          144 LQQESQLRQQERKQLTEEQA-------RISVLMSEKQELEEKLVMFSRKASEEVSNAA-ASKAFSVEDKEKLEKQLHDMA  215 (388)
Q Consensus       144 lqqes~lrqqeqqrla~Eq~-------~~a~l~aEkqeLeekl~~~~~~~se~~~~~a-~~k~~S~eDkekle~Ql~dma  215 (388)
                      |++......++.++...+..       .+..+...=.+|..+|..-+.+-.+--+..+ ++.+|.     .+.+.+..|.
T Consensus       176 l~q~~~~~~~e~~k~~~~~~~~l~~~~~~~~~~~~E~~Lr~QL~~Y~~Kf~efq~tL~kSNe~F~-----tfk~Emekm~  250 (309)
T PF09728_consen  176 LEQQQEEAEQEKEKAKQEKEILLEEAAQVQTLKETEKELREQLNLYSEKFEEFQDTLNKSNEVFE-----TFKKEMEKMS  250 (309)
T ss_pred             HHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH-----HHHHHHHHHH
Confidence            77777777777777777777       8888888888888888888766555322222 455552     3334444444


Q ss_pred             HHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          216 VVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRT  286 (388)
Q Consensus       216 ~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~  286 (388)
                      -.+.+||.--..+-.=.|.-          |.    .+-++++=-..|..++.-+-+|++.|.++.--|..
T Consensus       251 Kk~kklEKE~~~~k~k~e~~----------n~----~l~~m~eer~~~~~~~~~~~~k~~kLe~LcRaLQ~  307 (309)
T PF09728_consen  251 KKIKKLEKENQTWKSKWEKS----------NK----ALIEMAEERQKLEKELEKLKKKIEKLEKLCRALQA  307 (309)
T ss_pred             HHHHHHHHHHHHHHHHHHHH----------hH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            44666665443333222221          22    33444555566999999999999999988766654


No 28 
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=92.08  E-value=30  Score=40.67  Aligned_cols=121  Identities=21%  Similarity=0.230  Sum_probs=59.3

Q ss_pred             HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh--hhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021           11 KKNAELEEKNSILSSKLQVKEVESESLQTRINEL--EQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE   88 (388)
Q Consensus        11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~--e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE   88 (388)
                      .+.++|+++...|.-+|.....+...+..++..+  +....|+.+...        .|...-+.....+.+..+++..+.
T Consensus       742 ~ri~el~~~IaeL~~~i~~l~~~l~~l~~r~~~L~~e~~~~Ps~~dL~--------~A~~~l~~A~~~~~~a~~~l~~a~  813 (1353)
T TIGR02680       742 RRIAELDARLAAVDDELAELARELRALGARQRALADELAGAPSDRSLR--------AAHRRAAEAERQAESAERELARAA  813 (1353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCchHHH--------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4455555555555555555555555555555533  334556642211        111112222333444444444444


Q ss_pred             HHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHH
Q 047021           89 EEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQES  148 (388)
Q Consensus        89 EEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes  148 (388)
                      ++=-.+..+.+..+.++...        +++. .++..++.+..+...+.+|...++.=.
T Consensus       814 ~~l~~a~~~~~~a~~~l~~a--------aa~l-~L~a~~~~l~~~~~aL~~y~~~l~~l~  864 (1353)
T TIGR02680       814 RKAAAAAAAWKQARRELERD--------AADL-DLPTDPDALEAVGLALKRFGDHLHTLE  864 (1353)
T ss_pred             HHHHHHHHHHHHHHHHHHHH--------HhcC-CCCCChhHHHHHHHHHHHHHHHHHHHH
Confidence            44444444444444444321        1222 344466777778888877777766633


No 29 
>PF04849 HAP1_N:  HAP1 N-terminal conserved region;  InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=91.79  E-value=16  Score=36.98  Aligned_cols=166  Identities=20%  Similarity=0.305  Sum_probs=92.5

Q ss_pred             hhhh-HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh---HhhhhhhhhHHHHhhhhhhh---hhhHHHhhhhHHH
Q 047021            3 LSQL-KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN---ELEQNTVPSLRKALKDVAME---KDAAVVAREDLSA   75 (388)
Q Consensus         3 ~~~v-~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln---d~e~n~v~slrkaL~~v~mE---kDaA~vareDl~A   75 (388)
                      .++| |.|-++|..|.+.|..|...+...-..+.-|.-.|+   +|=+.....-..+..+-+-.   |+.....----..
T Consensus        81 aA~iGqsLl~~N~~L~~~~~~le~~L~~~~e~v~qLrHeL~~kdeLL~~ys~~~ee~~~~~~~~~~~~~~~~~~~~~~~~  160 (306)
T PF04849_consen   81 AARIGQSLLEQNQDLSERNEALEEQLGAALEQVEQLRHELSMKDELLQIYSNDDEESEPESSESTPLRRNESSLSSQKCI  160 (306)
T ss_pred             HHHHhHHHHHhcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcHhhhcccccCCCccccccccccccccch
Confidence            3455 778888888888888888888888777777777776   22111111111111110000   1110000111236


Q ss_pred             HHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCC-CCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           76 QLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGI-SSMGISPDHVQTLEKELATLKSQLQQESQLRQQE  154 (388)
Q Consensus        76 QLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~-~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqe  154 (388)
                      ++-.+++|+|.-|+|-+.=.+.|+-|..+-.+...+.-.- ..+. -.++....++..|..||+.     +.+...+|| 
T Consensus       161 ~le~Lq~Klk~LEeEN~~LR~Ea~~L~~et~~~EekEqqL-v~dcv~QL~~An~qia~LseELa~-----k~Ee~~rQQ-  233 (306)
T PF04849_consen  161 QLEALQEKLKSLEEENEQLRSEASQLKTETDTYEEKEQQL-VLDCVKQLSEANQQIASLSEELAR-----KTEENRRQQ-  233 (306)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhhccHHHHHH-HHHHHHHhhhcchhHHHHHHHHHH-----HHHHHHHHH-
Confidence            7889999999999988887777888876666554332111 1111 1445566778888888872     234444443 


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          155 RKQLTEEQARISVLMSEKQELEEKLVMFS  183 (388)
Q Consensus       155 qqrla~Eq~~~a~l~aEkqeLeekl~~~~  183 (388)
                              .-|.+|+++.-.|+.++..++
T Consensus       234 --------EEIt~LlsqivdlQ~r~k~~~  254 (306)
T PF04849_consen  234 --------EEITSLLSQIVDLQQRCKQLA  254 (306)
T ss_pred             --------HHHHHHHHHHHHHHHHHHHHh
Confidence                    223455555555555444443


No 30 
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=91.41  E-value=13  Score=39.92  Aligned_cols=90  Identities=22%  Similarity=0.244  Sum_probs=47.9

Q ss_pred             HHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHh---------hhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCC
Q 047021           54 KALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQY---------RAEEDAAALRAELNSLQQQAMDGSLGGISSMG  124 (388)
Q Consensus        54 kaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQy---------RAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g  124 (388)
                      .-|.++.-..-.|...+.+.-+++..+++.++.....-.         ...---+.||.++..++.+...-+.    .+|
T Consensus       237 ~~L~~l~~ql~~a~~~~~~a~a~~~~l~~~l~~~~~~~~~~~~~~~~~~~~~~i~~L~~~l~~l~~~~~~l~~----~y~  312 (754)
T TIGR01005       237 QQLAELNTELSRARANRAAAEGTADSVKKALQNGGSLDVLPEVLSSQLKLEDLIQRLRERQAELRATIADLST----TML  312 (754)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchhhhhcCcccccHHHHHHHHHHHHHHHHHHHHHH----hhC
Confidence            345555444444555555666666666666653211000         0011234566666666665432111    345


Q ss_pred             CChhhHHHHHHHHHHHHHHHHHH
Q 047021          125 ISPDHVQTLEKELATLKSQLQQE  147 (388)
Q Consensus       125 ~s~~~~~~lekEl~~l~~~lqqe  147 (388)
                      +.+-.+..+..++++++.++.++
T Consensus       313 ~~hP~v~~l~~qi~~l~~~i~~e  335 (754)
T TIGR01005       313 ANHPRVVAAKSSLADLDAQIRSE  335 (754)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHHH
Confidence            56666788888888888777653


No 31 
>PRK09039 hypothetical protein; Validated
Probab=91.31  E-value=17  Score=36.43  Aligned_cols=35  Identities=26%  Similarity=0.320  Sum_probs=19.2

Q ss_pred             HhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHH
Q 047021           43 ELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQL   77 (388)
Q Consensus        43 d~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQL   77 (388)
                      +++++.--.+...|.+|.-..++|.-.|++|-+++
T Consensus        70 ~le~~~~~~l~~~l~~l~~~l~~a~~~r~~Le~~~  104 (343)
T PRK09039         70 SLERQGNQDLQDSVANLRASLSAAEAERSRLQALL  104 (343)
T ss_pred             HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45555555566666555555555555555554433


No 32 
>PRK11637 AmiB activator; Provisional
Probab=90.99  E-value=19  Score=36.39  Aligned_cols=88  Identities=15%  Similarity=0.159  Sum_probs=41.7

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRL   84 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRl   84 (388)
                      +++++++...+++.+...+++++.....+...+...|+.+... +..+.+.++.+.=+-+...-.-.++-+.+...++.+
T Consensus        48 ~l~~l~~qi~~~~~~i~~~~~~~~~~~~~l~~l~~qi~~~~~~-i~~~~~~i~~~~~ei~~l~~eI~~~q~~l~~~~~~l  126 (428)
T PRK11637         48 QLKSIQQDIAAKEKSVRQQQQQRASLLAQLKKQEEAISQASRK-LRETQNTLNQLNKQIDELNASIAKLEQQQAAQERLL  126 (428)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4555566666666666555566655555555555555554332 233333443333333333333333344444444444


Q ss_pred             HHHHHHHhh
Q 047021           85 KEAEEEQYR   93 (388)
Q Consensus        85 kEAEEEQyR   93 (388)
                      +.-=-..|+
T Consensus       127 ~~rlra~Y~  135 (428)
T PRK11637        127 AAQLDAAFR  135 (428)
T ss_pred             HHHHHHHHH
Confidence            444444455


No 33 
>PF07888 CALCOCO1:  Calcium binding and coiled-coil domain (CALCOCO1) like;  InterPro: IPR012852 Proteins found in this family are similar to the coiled-coil transcriptional coactivator protein expressed by Mus musculus (CoCoA, Q8CGU1 from SWISSPROT). This protein binds to a highly conserved N-terminal domain of p160 coactivators, such as GRIP1 (Q61026 from SWISSPROT), and thus enhances transcriptional activation by a number of nuclear receptors. CoCoA has a central coiled-coil region with three leucine zipper motifs, which is required for its interaction with GRIP1 and may regulate the autonomous transcriptional activation activity of the C-terminal region []. 
Probab=90.98  E-value=27  Score=37.98  Aligned_cols=75  Identities=20%  Similarity=0.315  Sum_probs=52.0

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK   81 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k   81 (388)
                      ..+|.+.|..|+.++..|..+++....+.....++...+... +-.+......++.|+|-...-.+++.+.++.+.
T Consensus       152 ~eeL~~~~~~Le~e~~~l~~~v~~l~~eL~~~~ee~e~L~~~-~kel~~~~e~l~~E~~~L~~q~~e~~~ri~~LE  226 (546)
T PF07888_consen  152 KEELLKENEQLEEEVEQLREEVERLEAELEQEEEEMEQLKQQ-QKELTESSEELKEERESLKEQLAEARQRIRELE  226 (546)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            356777777777777777777777777777777766666654 334555666778888887777777666665554


No 34 
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton]
Probab=90.95  E-value=35  Score=39.28  Aligned_cols=119  Identities=24%  Similarity=0.295  Sum_probs=67.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhh
Q 047021          154 ERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEID  233 (388)
Q Consensus       154 eqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID  233 (388)
                      ++|+|..|.   +++.+.-.+++.....+....-.   ..+...-+ -.+++|+.+.-+.=+.+.+...---.+|=+.+-
T Consensus       387 e~eqLr~el---aql~a~r~q~eka~~~~ee~e~~---~l~~e~ry-~klkek~t~l~~~h~~lL~K~~di~kQle~~~~  459 (980)
T KOG0980|consen  387 EQEQLRNEL---AQLLASRTQLEKAQVLVEEAENK---ALAAENRY-EKLKEKYTELRQEHADLLRKYDDIQKQLESAEQ  459 (980)
T ss_pred             HHHHHHHHH---HHHHHHHHHHHHHHHHHHhHHHH---HHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455666665   56666666666554332111000   00112222 356777776665555555544433333333222


Q ss_pred             hhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          234 NQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK  283 (388)
Q Consensus       234 ~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek  283 (388)
                      +    |+.+-++|.+|...+-+.-..+-+||.-..+-.+..+.||-=|..
T Consensus       460 s----~~~~~~~~~~L~d~le~~~~~~~~~~~K~e~~~~~le~l~~El~~  505 (980)
T KOG0980|consen  460 S----IDDVEEENTNLNDQLEELQRAAGRAETKTESQAKALESLRQELAL  505 (980)
T ss_pred             h----HHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHH
Confidence            2    337788999999999999999999998776665555555544433


No 35 
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=90.65  E-value=37  Score=39.00  Aligned_cols=228  Identities=23%  Similarity=0.265  Sum_probs=116.8

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHH--------HHHhhHhhhhhhhhHHHHhhh---hhhhh--------hhH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESL--------QTRINELEQNTVPSLRKALKD---VAMEK--------DAA   66 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L--------~~~Lnd~e~n~v~slrkaL~~---v~mEk--------DaA   66 (388)
                      -.++.++|++||+.++.|.-+-+.-..+++-.        ++.....|++.--.|.|-|..   |-++|        .+-
T Consensus       319 keNy~kGqaELerRRq~leeqqqreree~eqkEreE~ekkererqEqErk~qlElekqLerQReiE~qrEEerkkeie~r  398 (1118)
T KOG1029|consen  319 KENYEKGQAELERRRQALEEQQQREREEVEQKEREEEEKKERERQEQERKAQLELEKQLERQREIERQREEERKKEIERR  398 (1118)
T ss_pred             HHhHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35789999999999998876544333333222        223334444444444444421   11111        011


Q ss_pred             HHhhhhHHHHHHH--HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH
Q 047021           67 VVAREDLSAQLRT--LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL  144 (388)
Q Consensus        67 ~vareDl~AQLR~--~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l  144 (388)
                      -.||++|--|-+.  .+-|.+|--..--|--|+--.+.|-.+.|++...+           ..+.++.|+..+.|+...+
T Consensus       399 Eaar~ElEkqRqlewErar~qem~~Qk~reqe~iv~~nak~~ql~~elet-----------Ln~k~qqls~kl~Dvr~~~  467 (1118)
T KOG1029|consen  399 EAAREELEKQRQLEWERARRQEMLNQKNREQEWIVYLNAKKKQLQQELET-----------LNFKLQQLSGKLQDVRVDI  467 (1118)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHH-----------HHHHHHHHhhhhhhheecc
Confidence            1234444433332  23333333333334445555555555566555422           2334444555555444433


Q ss_pred             HHH--------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhh-hhhhcchhhHHHHHH
Q 047021          145 QQE--------------SQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAA-ASKAFSVEDKEKLEK  209 (388)
Q Consensus       145 qqe--------------s~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a-~~k~~S~eDkekle~  209 (388)
                      ...              -.-+.+.++|+.+=|..+-.|.-|+|.|..++-..-..+-+..+... -+.+|  ..|+-+.+
T Consensus       468 tt~kt~ie~~~~q~e~~isei~qlqarikE~q~kl~~l~~Ekq~l~~qlkq~q~a~~~~~~~~s~L~aa~--~~ke~irq  545 (1118)
T KOG1029|consen  468 TTQKTEIEEVTKQRELMISEIDQLQARIKELQEKLQKLAPEKQELNHQLKQKQSAHKETTQRKSELEAAR--RKKELIRQ  545 (1118)
T ss_pred             chHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHhhhhccCcchHHHHHHHHH--HHHHHHHH
Confidence            221              11112567777778888888888888888888776655554433221 13333  55777788


Q ss_pred             HHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhh
Q 047021          210 QLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSL  249 (388)
Q Consensus       210 Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~L  249 (388)
                      +|.|-++.|+.=.   .+=|.|||+=-.-.+.|-++-+..
T Consensus       546 ~ikdqldelskE~---esk~~eidi~n~qlkelk~~~~~q  582 (1118)
T KOG1029|consen  546 AIKDQLDELSKET---ESKLNEIDIFNNQLKELKEDVNSQ  582 (1118)
T ss_pred             HHHHHHHHHHHHH---HHHHHhhhhHHHHHHHHHHHHHHH
Confidence            8777666555322   223556665444444444443333


No 36 
>PF15619 Lebercilin:  Ciliary protein causing Leber congenital amaurosis disease
Probab=90.45  E-value=15  Score=34.37  Aligned_cols=93  Identities=27%  Similarity=0.428  Sum_probs=53.7

Q ss_pred             HHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHH--
Q 047021           75 AQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQ--  152 (388)
Q Consensus        75 AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrq--  152 (388)
                      .+.|.+.+++++.+.+=|+       ++.+++.|++-++..+...          ...|..++..+...++..-.-.+  
T Consensus        82 ~~~r~~~~klk~~~~el~k-------~~~~l~~L~~L~~dknL~e----------ReeL~~kL~~~~~~l~~~~~ki~~L  144 (194)
T PF15619_consen   82 EQERELERKLKDKDEELLK-------TKDELKHLKKLSEDKNLAE----------REELQRKLSQLEQKLQEKEKKIQEL  144 (194)
T ss_pred             HHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHcCCchh----------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4667777788887777776       6777888888775544322          34455555555555444222111  


Q ss_pred             ---------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047021          153 ---------QERKQLTEEQARISVLMSEKQELEEKLVMFSR  184 (388)
Q Consensus       153 ---------qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~  184 (388)
                               --+..++.|...+..++.+.+.|...+.-+..
T Consensus       145 ek~leL~~k~~~rql~~e~kK~~~~~~~~~~l~~ei~~L~~  185 (194)
T PF15619_consen  145 EKQLELENKSFRRQLASEKKKHKEAQEEVKSLQEEIQRLNQ  185 (194)
T ss_pred             HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence                     13444556666666666666666655555443


No 37 
>PF00769 ERM:  Ezrin/radixin/moesin family;  InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=90.10  E-value=18  Score=34.71  Aligned_cols=29  Identities=17%  Similarity=0.264  Sum_probs=18.9

Q ss_pred             HHHHHHHhhHHHHhHHHHHHHHHHHHHHH
Q 047021          327 KGQLAKEQSRAEALSAQVLQLSAQLQQTT  355 (388)
Q Consensus       327 K~eL~kEqsRseaLsaqv~qLsAeL~ra~  355 (388)
                      ..+-+...-++.-|--|+..|..+|..+.
T Consensus       174 EeeR~t~~EKnk~lq~QL~~L~~EL~~~k  202 (246)
T PF00769_consen  174 EEERVTYAEKNKRLQEQLKELKSELEQLK  202 (246)
T ss_dssp             GGC---HHHH-HHHHHHHHHHHHHHHTTB
T ss_pred             HHHHHHHHHhhHHHHHHHHHHHHHHHHHh
Confidence            34445566677788889999999997665


No 38 
>PF04849 HAP1_N:  HAP1 N-terminal conserved region;  InterPro: IPR006933 This family is defined by an N-terminal conserved region found in several huntingtin-associated protein 1 (HAP1) homologues. HAP1 binds to huntingtin in a polyglutamine repeat-length-dependent manner. However, its possible role in the pathogenesis of Huntingtons disease is unclear. This family also includes a similar N-terminal conserved region from hypothetical protein products of ALS2CR3 genes found in the human juvenile amyotrophic lateral sclerosis critical region 2q33-2q34 [].
Probab=90.10  E-value=22  Score=36.12  Aligned_cols=84  Identities=31%  Similarity=0.347  Sum_probs=50.9

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      .++..||+|+..||++|..|.++......+-..++    +-|+..|                     .|.+.||+.+...
T Consensus       160 ~~le~Lq~Klk~LEeEN~~LR~Ea~~L~~et~~~E----ekEqqLv---------------------~dcv~QL~~An~q  214 (306)
T PF04849_consen  160 IQLEALQEKLKSLEEENEQLRSEASQLKTETDTYE----EKEQQLV---------------------LDCVKQLSEANQQ  214 (306)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHhhHHHhhcc----HHHHHHH---------------------HHHHHHhhhcchh
Confidence            46788999999999999999886554333332221    1222222                     2456666666666


Q ss_pred             hHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           84 LKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        84 lkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      +..--+|=-+--||...-.-||-+|+.|.
T Consensus       215 ia~LseELa~k~Ee~~rQQEEIt~Llsqi  243 (306)
T PF04849_consen  215 IASLSEELARKTEENRRQQEEITSLLSQI  243 (306)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            66666666666666655555555555443


No 39 
>PF05557 MAD:  Mitotic checkpoint protein;  InterPro: IPR008672 This family consists of several eukaryotic mitotic checkpoint (Mitotic arrest deficient or MAD) proteins. The mitotic spindle checkpoint monitors proper attachment of the bipolar spindle to the kinetochores of aligned sister chromatids and causes a cell cycle arrest in prometaphase when failures occur. Multiple components of the mitotic spindle checkpoint have been identified in Saccharomyces cerevisiae and higher eukaryotes. In Saccharomyces cerevisiae, the existence of a Mad1-dependent complex containing Mad2, Mad3, Bub3 and Cdc20 has been demonstrated [].; PDB: 1GO4_F 4DZO_A.
Probab=88.79  E-value=6.2  Score=42.65  Aligned_cols=23  Identities=17%  Similarity=0.333  Sum_probs=16.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhh
Q 047021          163 ARISVLMSEKQELEEKLVMFSRK  185 (388)
Q Consensus       163 ~~~a~l~aEkqeLeekl~~~~~~  185 (388)
                      .-+..|.+|-+.|...+..+...
T Consensus       566 ~~l~~L~~En~~L~~~l~~le~~  588 (722)
T PF05557_consen  566 STLEALQAENEDLLARLRSLEEG  588 (722)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHTTT
T ss_pred             HHHHHHHHHHHHHHHHHHhcccC
Confidence            44567777888888777776544


No 40 
>PF01576 Myosin_tail_1:  Myosin tail;  InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=88.53  E-value=0.14  Score=56.64  Aligned_cols=304  Identities=27%  Similarity=0.347  Sum_probs=0.0

Q ss_pred             HHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhh
Q 047021           14 AELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYR   93 (388)
Q Consensus        14 ~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyR   93 (388)
                      ++|-.++.-++......+-++..|..+++|+-.+. .++.|+       |.-+--.+-.+-+||..++.++.+.+-.--.
T Consensus       134 ~eL~eqle~lqk~k~~lEK~k~~l~~e~~dL~~~l-~~~~k~-------k~~~Ek~~K~lE~qL~El~~klee~er~~~e  205 (859)
T PF01576_consen  134 AELNEQLEQLQKQKAKLEKEKSQLEAELDDLQAQL-DSLQKA-------KQEAEKKRKQLEAQLNELQAKLEESERQRNE  205 (859)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH-HHHHHH-------HHHHHhHHhhHHHHHHHHHHHHHHHHHHHHH
Confidence            44555555555555555555555555555543322 333333       3333333555677888888888887776666


Q ss_pred             hHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           94 AEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQ  173 (388)
Q Consensus        94 AEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkq  173 (388)
                      .....+-|.+|+..|..+......... .   ..-....|+..|.+++.+|..++..+.....++..-..-+..|.....
T Consensus       206 l~~~k~kL~~E~~eL~~qLee~e~~~~-~---l~r~k~~L~~qLeelk~~leeEtr~k~~L~~~l~~le~e~~~L~eqle  281 (859)
T PF01576_consen  206 LTEQKAKLQSENSELTRQLEEAESQLS-Q---LQREKSSLESQLEELKRQLEEETRAKQALEKQLRQLEHELEQLREQLE  281 (859)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH-H---HHHHHHHHHHHHHhhHHHHHhHhhhhhhhHHHHHHHHHHHHHHHHHHh
Confidence            667777888888888888754432211 1   222344578888888888888776655433322211111111211111


Q ss_pred             HHHHHHHHHhhhchhhhhhhhhhhh-c---------c-hhhHHHHHH-------HHhHHHHHHHHHHhhHHHHHHHhhhh
Q 047021          174 ELEEKLVMFSRKASEEVSNAAASKA-F---------S-VEDKEKLEK-------QLHDMAVVVERLESSRQKLLMEIDNQ  235 (388)
Q Consensus       174 eLeekl~~~~~~~se~~~~~a~~k~-~---------S-~eDkekle~-------Ql~dma~~VeRLE~SRqKLL~EID~Q  235 (388)
                      +-.+....+.+.-+....+.+.++. |         . .+-|-+|.+       ++.+.-..+..|+-.+.+|-.||+-=
T Consensus       282 eE~e~k~~l~~qlsk~~~El~~~k~K~e~e~~~~~EelEeaKKkL~~~L~el~e~le~~~~~~~~LeK~k~rL~~EleDl  361 (859)
T PF01576_consen  282 EEEEAKSELERQLSKLNAELEQWKKKYEEEAEQRTEELEEAKKKLERKLQELQEQLEEANAKVSSLEKTKKRLQGELEDL  361 (859)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             hhhhhHHHHHHHHHHHhhHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            1111111111111111111111111 0         0 122334444       44444555667777777777777766


Q ss_pred             hHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCC
Q 047021          236 SSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNG  315 (388)
Q Consensus       236 S~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~  315 (388)
                      ..++|+....++              .++.-.+-.=+++.+++.-++.+..|-..                     ....
T Consensus       362 ~~eLe~~~~~~~--------------~LeKKqr~fDk~l~e~k~~~~~~~~e~d~---------------------~q~e  406 (859)
T PF01576_consen  362 TSELEKAQAAAA--------------ELEKKQRKFDKQLAEWKAKVEELQAERDA---------------------AQRE  406 (859)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHH--------------HHHHHHHhHHHHHHHHHHHHHHHHHHHHH---------------------HHHH
Confidence            666666544444              44444444445555555544433321111                     1122


Q ss_pred             CchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHH
Q 047021          316 SETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRL  364 (388)
Q Consensus       316 ~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rl  364 (388)
                      ...+.|++.+|+..|-......+.|.-+...|..+|.-.+-....-++.
T Consensus       407 ~r~~~te~~~Lk~~lee~~e~~e~lere~k~L~~El~dl~~q~~~~~k~  455 (859)
T PF01576_consen  407 ARELETELFKLKNELEELQEQLEELERENKQLQDELEDLTSQLDDAGKS  455 (859)
T ss_dssp             -------------------------------------------------
T ss_pred             hHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccchhhhhhhccc
Confidence            5567788999999998888889999999999999988877666655554


No 41 
>COG4477 EzrA Negative regulator of septation ring formation [Cell division and chromosome partitioning]
Probab=88.03  E-value=45  Score=36.49  Aligned_cols=125  Identities=26%  Similarity=0.418  Sum_probs=87.7

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      +.|+.+++++.+++....-+...++..++.--.|+..|++        +.++|.+|.=+-       +++..-|+.++| 
T Consensus       347 ~~vr~~e~eL~el~~~~~~i~~~~~~~~~~yS~lq~~l~~--------~~~~l~~i~~~q-------~~~~e~L~~Lrk-  410 (570)
T COG4477         347 GSVRKFEKELKELESVLDEILENIEAQEVAYSELQDNLEE--------IEKALTDIEDEQ-------EKVQEHLTSLRK-  410 (570)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHH--------HHHHHHHHhhhH-------HHHHHHHHHHHH-
Confidence            4678888888888888888888888888888888877664        567776665442       345555555544 


Q ss_pred             hHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCC-----CCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021           84 LKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGIS-----SMGISPDHVQTLEKELATLKSQLQQESQL  150 (388)
Q Consensus        84 lkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~-----~~g~s~~~~~~lekEl~~l~~~lqqes~l  150 (388)
                            +--.|.+++-.++..|-.+-+-+-.....|+|     -+-...++++++.+++.+.-....+-+.+
T Consensus       411 ------dEl~Are~l~~~~~~l~eikR~mek~nLPGlPe~~l~l~~~~~~~i~~l~~eLse~pinm~~v~~~  476 (570)
T COG4477         411 ------DELEARENLERLKSKLHEIKRYMEKSNLPGLPETFLSLFFTAGHEIQDLMKELSEVPINMEAVSAL  476 (570)
T ss_pred             ------HHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCcHHHHHHHHhhhhHHHHHHHHHhhcCCcHHHHHHH
Confidence                  45567889999999999999999888888886     12234455666666666554444444444


No 42 
>PF09726 Macoilin:  Transmembrane protein;  InterPro: IPR019130  This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=87.91  E-value=49  Score=36.73  Aligned_cols=179  Identities=31%  Similarity=0.415  Sum_probs=90.2

Q ss_pred             HHHHhHHhhhhHHHHHHhhHHHHHHhhHh------hhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021           16 LEEKNSILSSKLQVKEVESESLQTRINEL------EQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE   89 (388)
Q Consensus        16 le~~n~~L~sk~~~k~~~~~~L~~~Lnd~------e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE   89 (388)
                      |....+-+.+.|+.+..++|.|+++++.|      |+.+.-+|+|=|   ..|++    +|--+=+||...+|+-+++|+
T Consensus       451 l~~~Er~lk~eL~qlr~ene~Lq~Kl~~L~~aRq~DKq~l~~LEkrL---~eE~~----~R~~lEkQL~eErk~r~~ee~  523 (697)
T PF09726_consen  451 LTNNERSLKSELSQLRQENEQLQNKLQNLVQARQQDKQSLQQLEKRL---AEERR----QRASLEKQLQEERKARKEEEE  523 (697)
T ss_pred             ccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHH----HHHHHHHHHHHHHHHHhHHHH
Confidence            33333455566777777777777777744      333344444444   23333    345566777777776666443


Q ss_pred             HHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChh----hHHHHHHHHHHHHHHHHH-HHHHHHHHHHH--HH---
Q 047021           90 EQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPD----HVQTLEKELATLKSQLQQ-ESQLRQQERKQ--LT---  159 (388)
Q Consensus        90 EQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~----~~~~lekEl~~l~~~lqq-es~lrqqeqqr--la---  159 (388)
                              + +.|+=-         .   ..+.-+...|    ..+.||.|+..|..+|+. +..++..+.+.  +.   
T Consensus       524 --------~-aar~~~---------~---~~~~r~e~~e~~r~r~~~lE~E~~~lr~elk~kee~~~~~e~~~~~lr~~~  582 (697)
T PF09726_consen  524 --------K-AARALA---------Q---AQATRQECAESCRQRRRQLESELKKLRRELKQKEEQIRELESELQELRKYE  582 (697)
T ss_pred             --------h-hhhccc---------c---chhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence                    1 222200         0   0111122233    356688888887777766 33333333221  11   


Q ss_pred             HH-HHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHH--HHHhHHHHHHHHHHhhHHHHHHHhhhhh
Q 047021          160 EE-QARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLE--KQLHDMAVVVERLESSRQKLLMEIDNQS  236 (388)
Q Consensus       160 ~E-q~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle--~Ql~dma~~VeRLE~SRqKLL~EID~QS  236 (388)
                      .| ...+-.||+..+.|.+|-+.+             .+.||.|.|=||+  .-|+|-.+.+|=+++       -|.++-
T Consensus       583 ~e~~~~~e~L~~aL~amqdk~~~L-------------E~sLsaEtriKldLfsaLg~akrq~ei~~~-------~~~~~d  642 (697)
T PF09726_consen  583 KESEKDTEVLMSALSAMQDKNQHL-------------ENSLSAETRIKLDLFSALGDAKRQLEIAQG-------QLRKKD  642 (697)
T ss_pred             hhhhhhHHHHHHHHHHHHHHHHHH-------------HHhhhHHHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHH
Confidence            11 223344555544444443332             4557788887776  334444444443333       245566


Q ss_pred             HHHHHH
Q 047021          237 SEIERL  242 (388)
Q Consensus       237 ~EIErL  242 (388)
                      .||+-|
T Consensus       643 ~ei~~l  648 (697)
T PF09726_consen  643 KEIEEL  648 (697)
T ss_pred             HHHHHH
Confidence            777654


No 43 
>PF12128 DUF3584:  Protein of unknown function (DUF3584);  InterPro: IPR021979  This family consist of uncharacterised bacterial proteins. 
Probab=87.33  E-value=64  Score=37.42  Aligned_cols=218  Identities=22%  Similarity=0.333  Sum_probs=110.6

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHh-hhhHHHHHHHHHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVA-REDLSAQLRTLKKR   83 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~va-reDl~AQLR~~kkR   83 (388)
                      ++..+.++..++|.......+++.....+.......+.-.+ +.+..++...+....+...+.-. ++.+-++|..+...
T Consensus       615 ~l~~~~~~~~~~e~~l~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~l~~l~~~  693 (1201)
T PF12128_consen  615 QLQSAEERQEELEKQLKQINKKIEELKREITQAEQELKQAE-QDLQRLKNEREQLKQEIEEAKEERKEQIEEQLNELEEE  693 (1201)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45566666677777777777777776666666666655443 34455555555555555544444 23333333333333


Q ss_pred             hHHHHHHHhhhHh-------------------hHHHHHHHHHHHHHHHhcC--------------CCCCCCCCCCChhhH
Q 047021           84 LKEAEEEQYRAEE-------------------DAAALRAELNSLQQQAMDG--------------SLGGISSMGISPDHV  130 (388)
Q Consensus        84 lkEAEEEQyRAEE-------------------DaAaLRAEln~lQqq~~~~--------------~~s~~~~~g~s~~~~  130 (388)
                      ++....+.-...+                   -.+.+.+.+..+.++....              -...+.+-||.+..+
T Consensus       694 l~~~~~e~~~~~~~~~~~~~e~~~e~~~~~~~~~~~~d~~i~~i~~~i~~~~~~~~~~~~~le~~~~~eL~~~GvD~~~I  773 (1201)
T PF12128_consen  694 LKQLKQELEELLEELKEQLKELRNELKAQWQELEAELDEQIEQIKQEIAAAKQEAKEQLKELEQQYNQELAGKGVDPERI  773 (1201)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCHHHH
Confidence            3322222211111                   1122223333333322211              112344789999999


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHH
Q 047021          131 QTLEKELATLKSQLQQESQLRQ---QERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKL  207 (388)
Q Consensus       131 ~~lekEl~~l~~~lqqes~lrq---qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekl  207 (388)
                      ..+++++..+..+|+.-..-+.   .=+.=+...-.....+..++..++.++..+......--..+...+.==...+..+
T Consensus       774 ~~l~~~i~~L~~~l~~ie~~r~~V~eY~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~l  853 (1201)
T PF12128_consen  774 QQLKQEIEQLEKELKRIEERRAEVIEYEDWLQEEWDKVDELREEKPELEEQLRDLEQELQELEQELNQLQKEVKQRRKEL  853 (1201)
T ss_pred             HHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            9999999999999988544433   2222223333334455556666666666665544443333321111113444555


Q ss_pred             HHHHhHHHHHHHHHHh
Q 047021          208 EKQLHDMAVVVERLES  223 (388)
Q Consensus       208 e~Ql~dma~~VeRLE~  223 (388)
                      ++.++.+.....+++.
T Consensus       854 e~~~~~~~~~~~~~~~  869 (1201)
T PF12128_consen  854 EEELKALEEQLEQLEE  869 (1201)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            5555555544444443


No 44 
>PF08317 Spc7:  Spc7 kinetochore protein;  InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=86.55  E-value=27  Score=34.50  Aligned_cols=53  Identities=19%  Similarity=0.308  Sum_probs=31.7

Q ss_pred             HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhh
Q 047021            7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVA   60 (388)
Q Consensus         7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~   60 (388)
                      ..|+.....|..+...|...++....-...+..+.+.+.... ..|+....++.
T Consensus       152 ~~L~~~~~~L~~D~~~L~~~~~~l~~~~~~l~~~~~~L~~e~-~~Lk~~~~e~~  204 (325)
T PF08317_consen  152 EGLEENLELLQEDYAKLDKQLEQLDELLPKLRERKAELEEEL-ENLKQLVEEIE  204 (325)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHhhhh
Confidence            345566666777777777777666666666666666655433 25555554433


No 45 
>PRK09039 hypothetical protein; Validated
Probab=86.34  E-value=39  Score=33.99  Aligned_cols=25  Identities=20%  Similarity=0.311  Sum_probs=13.5

Q ss_pred             HHHHHHHhHHH-HHHHHHHhhHHHHH
Q 047021          205 EKLEKQLHDMA-VVVERLESSRQKLL  229 (388)
Q Consensus       205 ekle~Ql~dma-~~VeRLE~SRqKLL  229 (388)
                      +.|...|...- .-|..|+.-|..+.
T Consensus       175 ~~L~~~L~~a~~~~~~~l~~~~~~~~  200 (343)
T PRK09039        175 ADLGRRLNVALAQRVQELNRYRSEFF  200 (343)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence            44444444432 33666777776663


No 46 
>KOG4807 consensus F-actin binding protein, regulates actin cytoskeletal organization [Cytoskeleton]
Probab=86.27  E-value=52  Score=35.32  Aligned_cols=59  Identities=24%  Similarity=0.343  Sum_probs=35.3

Q ss_pred             hhHHHHHHHhhhhhHHHHHHhh-------hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          223 SSRQKLLMEIDNQSSEIERLFE-------ENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGIL  281 (388)
Q Consensus       223 ~SRqKLL~EID~QS~EIErLf~-------ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lL  281 (388)
                      +-|..-|.|+.+=--|+|=|++       ||..|...+-+-+-.-.+--+--|+-+.-|-+|-+-|
T Consensus       414 aLRrQyleelqsvqRELeVLSEQYSQKCLEnahLaqalEaerqaLRqCQrEnQELnaHNQELnnRL  479 (593)
T KOG4807|consen  414 ALRRQYLEELQSVQRELEVLSEQYSQKCLENAHLAQALEAERQALRQCQRENQELNAHNQELNNRL  479 (593)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHhhHH
Confidence            3344455666555566666654       7888877776655555444444456666666666644


No 47 
>PF10174 Cast:  RIM-binding protein of the cytomatrix active zone;  InterPro: IPR019323  This entry represents a family of proteins that form part of the CAZ (cytomatrix at the active zone) complex which is involved in determining the site of synaptic vesicle fusion []. Located at the C terminus is a PDZ-binding motif that binds directly to RIM (a small G protein Rab-3A effector). These proteins also contain four coiled-coil domains []. 
Probab=86.09  E-value=66  Score=36.34  Aligned_cols=103  Identities=16%  Similarity=0.295  Sum_probs=61.9

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhh------------
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVARED------------   72 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareD------------   72 (388)
                      .+..|++.+..|+.++.+|.+.+.+....++-+.+.|-.-...+..+=-|      |++=..-++|-|            
T Consensus       239 ki~~lEr~l~~le~Ei~~L~~~~~~~~~~r~~~~k~le~~~s~~~~mK~k------~d~~~~eL~rk~~E~~~~qt~l~~  312 (775)
T PF10174_consen  239 KIASLERMLRDLEDEIYRLRSRGELSEADRDRLDKQLEVYKSHSLAMKSK------MDRLKLELSRKKSELEALQTRLET  312 (775)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhcccccccchHHHHHHHHHHHhhHHHHHHH------HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            46678888888888888888888887777777777776555444432212      333333344333            


Q ss_pred             HHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021           73 LSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM  113 (388)
Q Consensus        73 l~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~  113 (388)
                      +.-+.--++.++--..+.=-+++..+.-|.+++..|.....
T Consensus       313 ~~~~~~d~r~hi~~lkesl~~ke~~~~~Lqsdve~Lr~rle  353 (775)
T PF10174_consen  313 LEEQDSDMRQHIEVLKESLRAKEQEAEMLQSDVEALRFRLE  353 (775)
T ss_pred             HHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            33333334555554555555666666666666666665553


No 48 
>PF07106 TBPIP:  Tat binding protein 1(TBP-1)-interacting protein (TBPIP);  InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=83.07  E-value=21  Score=31.71  Aligned_cols=80  Identities=31%  Similarity=0.409  Sum_probs=52.7

Q ss_pred             CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhh
Q 047021          125 ISPDHVQTLEKELATLKSQLQQESQLRQQERKQLT---------EEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAA  195 (388)
Q Consensus       125 ~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla---------~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~  195 (388)
                      ++++++..|..++.++..++..-..-.......|+         +=...+..|..+..+|+.+|..+.....        
T Consensus        69 ~s~eel~~ld~ei~~L~~el~~l~~~~k~l~~eL~~L~~~~t~~el~~~i~~l~~e~~~l~~kL~~l~~~~~--------  140 (169)
T PF07106_consen   69 PSPEELAELDAEIKELREELAELKKEVKSLEAELASLSSEPTNEELREEIEELEEEIEELEEKLEKLRSGSK--------  140 (169)
T ss_pred             CCchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhCCC--------
Confidence            57888999999999998888774333333333333         2345567788888888888877764222        


Q ss_pred             hhhcchhhHHHHHHHHhHH
Q 047021          196 SKAFSVEDKEKLEKQLHDM  214 (388)
Q Consensus       196 ~k~~S~eDkekle~Ql~dm  214 (388)
                        ..|.+|+.++.+..+.+
T Consensus       141 --~vs~ee~~~~~~~~~~~  157 (169)
T PF07106_consen  141 --PVSPEEKEKLEKEYKKW  157 (169)
T ss_pred             --CCCHHHHHHHHHHHHHH
Confidence              26678888887765443


No 49 
>PF08317 Spc7:  Spc7 kinetochore protein;  InterPro: IPR013253 This entry consists of cell division proteins which are required for kinetochore-spindle association [].
Probab=80.18  E-value=65  Score=31.88  Aligned_cols=107  Identities=20%  Similarity=0.312  Sum_probs=74.2

Q ss_pred             hhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           92 YRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSE  171 (388)
Q Consensus        92 yRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aE  171 (388)
                      -+..+--++|..++..|++....       +-...++.+..+..+|..++..+..-.....+.+.++..=...+..+..+
T Consensus       180 ~~l~~~~~~L~~e~~~Lk~~~~e-------~~~~D~~eL~~lr~eL~~~~~~i~~~k~~l~el~~el~~l~~~i~~~~~~  252 (325)
T PF08317_consen  180 PKLRERKAELEEELENLKQLVEE-------IESCDQEELEALRQELAEQKEEIEAKKKELAELQEELEELEEKIEELEEQ  252 (325)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhh-------hhhcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34445567889999999887643       22347788999999999999888876666666666666666667777788


Q ss_pred             HHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHh
Q 047021          172 KQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLH  212 (388)
Q Consensus       172 kqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~  212 (388)
                      |+++...|+.+.+.-     +  ..+.+|..|-.+|...++
T Consensus       253 k~~l~~eI~e~~~~~-----~--~~r~~t~~Ev~~Lk~~~~  286 (325)
T PF08317_consen  253 KQELLAEIAEAEKIR-----E--ECRGWTRSEVKRLKAKVD  286 (325)
T ss_pred             HHHHHHHHHHHHHHH-----H--HhcCCCHHHHHHHHHHHH
Confidence            888877777544321     1  245567777777766643


No 50 
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=79.21  E-value=72  Score=31.85  Aligned_cols=45  Identities=22%  Similarity=0.355  Sum_probs=28.5

Q ss_pred             HHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHH
Q 047021           99 AALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQE  147 (388)
Q Consensus        99 AaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqe  147 (388)
                      ..|+.+|..++.+...-.    ..+|+.+-.+..+..++++++.++..+
T Consensus       257 ~~l~~~l~~le~~l~~l~----~~y~~~hP~v~~l~~~i~~l~~~l~~e  301 (444)
T TIGR03017       257 QNLKTDIARAESKLAELS----QRLGPNHPQYKRAQAEINSLKSQLNAE  301 (444)
T ss_pred             HHHHHHHHHHHHHHHHHH----HHhCCCCcHHHHHHHHHHHHHHHHHHH
Confidence            457777777776663321    134556666777888888777777554


No 51 
>PRK03918 chromosome segregation protein; Provisional
Probab=79.07  E-value=1e+02  Score=33.51  Aligned_cols=28  Identities=25%  Similarity=0.395  Sum_probs=18.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047021          157 QLTEEQARISVLMSEKQELEEKLVMFSR  184 (388)
Q Consensus       157 rla~Eq~~~a~l~aEkqeLeekl~~~~~  184 (388)
                      ++..-..+...+..+..+|++.+..+..
T Consensus       406 ~i~~l~~~~~~~~~~i~eL~~~l~~L~~  433 (880)
T PRK03918        406 EISKITARIGELKKEIKELKKAIEELKK  433 (880)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            3444455556777777777777776653


No 52 
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=78.29  E-value=11  Score=40.28  Aligned_cols=76  Identities=25%  Similarity=0.417  Sum_probs=49.3

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK   81 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k   81 (388)
                      |+.+++++.++.+.|..+|+.|..       +|+.|.+|.++.+++.    ..||...          |           
T Consensus        64 lva~~k~~r~~~~~l~~~N~~l~~-------eN~~L~~r~~~id~~i----~~av~~~----------~-----------  111 (472)
T TIGR03752        64 LVAEVKELRKRLAKLISENEALKA-------ENERLQKREQSIDQQI----QQAVQSE----------T-----------  111 (472)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHhhhhHHHHH----HHHHHhh----------h-----------
Confidence            578899999999999999988875       4556677766666553    3333211          1           


Q ss_pred             HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                         ++...++....++-+.+...|..||.|.
T Consensus       112 ---~~~~~~~~ql~~~~~~~~~~l~~l~~~l  139 (472)
T TIGR03752       112 ---QELTKEIEQLKSERQQLQGLIDQLQRRL  139 (472)
T ss_pred             ---HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence               2222333333444556788888898887


No 53 
>PF09726 Macoilin:  Transmembrane protein;  InterPro: IPR019130  This entry represents the multi-pass transmembrane protein Macoilin, which is highly conserved in eukaryotes. ; GO: 0016021 integral to membrane
Probab=77.58  E-value=1.2e+02  Score=33.69  Aligned_cols=46  Identities=22%  Similarity=0.314  Sum_probs=35.5

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV   49 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v   49 (388)
                      +.|..|++.|.+|+..+..|....+.--.-..+|+++|.+..+...
T Consensus       460 ~eL~qlr~ene~Lq~Kl~~L~~aRq~DKq~l~~LEkrL~eE~~~R~  505 (697)
T PF09726_consen  460 SELSQLRQENEQLQNKLQNLVQARQQDKQSLQQLEKRLAEERRQRA  505 (697)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4577888888888888888887777777777888888887665543


No 54 
>KOG4674 consensus Uncharacterized conserved coiled-coil protein [Function unknown]
Probab=77.37  E-value=2e+02  Score=35.87  Aligned_cols=155  Identities=21%  Similarity=0.244  Sum_probs=95.6

Q ss_pred             hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhh-------hhhHHHHHHhhhhhhhhhhHHHHHHHH-------HHHHHHH
Q 047021          202 EDKEKLEKQLHDMAVVVERLESSRQKLLMEID-------NQSSEIERLFEENSSLSSSYQESMGIA-------KHWENQV  267 (388)
Q Consensus       202 eDkekle~Ql~dma~~VeRLE~SRqKLL~EID-------~QS~EIErLf~ENs~Lsss~qea~~ia-------aqWEnQv  267 (388)
                      .++.++.+|+.....-++|+...=.-...|++       .|.++++|+-+.++.++..|+....=-       ..-..|.
T Consensus       391 s~~~~~qqqle~~~lele~~~~~l~s~~eev~~~~p~lk~qr~~~e~~~~~~~~l~~el~~~~q~~~~~e~~~~~l~~~~  470 (1822)
T KOG4674|consen  391 SKYSKLQQQLESLKLELERLQNILSSFKEEVKQKAPILKEQRSELERMQETKAELSEELDFSNQKIQKLEKELESLKKQL  470 (1822)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            67889999999999999999988777777765       578899999999999999888776433       3344566


Q ss_pred             HHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCC-ccccCCCchhhh-hHHhhHHHHHHHhhHHHHhHHHHH
Q 047021          268 KDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKD-AVSQNGSETHKN-EFLALKGQLAKEQSRAEALSAQVL  345 (388)
Q Consensus       268 qDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~d-g~~~~~~~~~~t-e~l~LK~eL~kEqsRseaLsaqv~  345 (388)
                      .||--||-.|+--.-+|-.+=..++..-.....|.-...+.+ -..+++++.+.+ ++..+++-.--...=.+=|+ .|+
T Consensus       471 ~~~~renk~l~~~~sdlsrqv~~Ll~el~e~~~~~~~~~~s~~~~~es~S~~iIse~Lv~F~nI~eLqekN~eLL~-~vR  549 (1822)
T KOG4674|consen  471 NDLERENKLLEQQISDLSRQVNVLLLELDELRKGSKITVSSDSTENESDSEEIISERLVEFSNINELQEKNVELLN-AVR  549 (1822)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccCccccccCccHHHHHHHHHHHhccHHHHHHHHHHHHH-HHH
Confidence            777777777765554443322222211111111110011111 122334555555 56666665443333444444 778


Q ss_pred             HHHHHHHHHHHH
Q 047021          346 QLSAQLQQTTQA  357 (388)
Q Consensus       346 qLsAeL~ra~QA  357 (388)
                      -|+..+....+-
T Consensus       550 ~Lae~lE~~E~~  561 (1822)
T KOG4674|consen  550 ELAEKLEAAEKT  561 (1822)
T ss_pred             HHHHHHHHHhhh
Confidence            888777777665


No 55 
>TIGR01005 eps_transp_fam exopolysaccharide transport protein family. The model describes the exopolysaccharide transport protein family in bacteria. The transport protein is part of a large genetic locus which is associated with exopolysaccharide (EPS) biosynthesis. Detailed molecular characterization and gene fusion analysis revealed atleast seven gene products are involved in the overall regulation, which among other things, include exopolysaccharide biosynthesis, property of conferring virulence and exopolysaccharide export.
Probab=76.47  E-value=37  Score=36.65  Aligned_cols=108  Identities=21%  Similarity=0.272  Sum_probs=80.9

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      ..+++|..+..+++.+...++.+.-.+|-....+..++++++++.-..+.+.+..+.-+-+++..-...|.+++..+++|
T Consensus       288 ~~i~~L~~~l~~l~~~~~~l~~~y~~~hP~v~~l~~qi~~l~~~i~~e~~~~~~~~~~~~~~a~~~~~~L~~~l~~~~~~  367 (754)
T TIGR01005       288 DLIQRLRERQAELRATIADLSTTMLANHPRVVAAKSSLADLDAQIRSELQKITKSLLMQADAAQARESQLVSDVNQLKAA  367 (754)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45788888899999999999999999999999999999999998877777777766666666665566888888888887


Q ss_pred             hHHH---HHHHhhhHhhHHHHHHHHHHHHHH
Q 047021           84 LKEA---EEEQYRAEEDAAALRAELNSLQQQ  111 (388)
Q Consensus        84 lkEA---EEEQyRAEEDaAaLRAEln~lQqq  111 (388)
                      +...   +-+-.+-+-|+.+.+.-.+.+.++
T Consensus       368 ~~~~~~~~~e~~~L~Re~~~~~~~Y~~ll~r  398 (754)
T TIGR01005       368 SAQAGEQQVDLDALQRDAAAKRQLYESYLTN  398 (754)
T ss_pred             HHhCcHhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            7543   444444556666666665555443


No 56 
>PF14992 TMCO5:  TMCO5 family
Probab=76.36  E-value=91  Score=31.49  Aligned_cols=158  Identities=26%  Similarity=0.321  Sum_probs=95.4

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEA   87 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEA   87 (388)
                      +|+++.+.|.+.|+.+..||++++-.+.+|.+++--.+...-.+  .=.+.|++++..|                 +++-
T Consensus         8 dle~d~Q~ldE~Nq~lL~ki~~~E~~iq~Le~Eit~~~~~~~~~--e~e~~~~~~~e~~-----------------l~~l   68 (280)
T PF14992_consen    8 DLEKDEQRLDEANQSLLQKIQEKEGAIQSLEREITKMDHIADRS--EEEDIISEERETD-----------------LQEL   68 (280)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHccccCch--hHHhhhhhchHHH-----------------HHHH
Confidence            57888999999999999999999999999999998666555555  3346666666544                 2344


Q ss_pred             HHHHhhhHhhHHHHHHHHHHHHHHHhcCCCC---CCCCCCCChh-----------hHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           88 EEEQYRAEEDAAALRAELNSLQQQAMDGSLG---GISSMGISPD-----------HVQTLEKELATLKSQLQQESQLRQQ  153 (388)
Q Consensus        88 EEEQyRAEEDaAaLRAEln~lQqq~~~~~~s---~~~~~g~s~~-----------~~~~lekEl~~l~~~lqqes~lrqq  153 (388)
                      |.+.-+-|.-+..+---+..||........-   .-+++..+++           ....++++++.+....++-..+--.
T Consensus        69 e~e~~~LE~~ne~l~~~~~elq~k~~e~~~~~~~e~~~~~~~lq~sk~~lqql~~~~~~qE~ei~kve~d~~~v~~l~eD  148 (280)
T PF14992_consen   69 ELETAKLEKENEHLSKSVQELQRKQDEQETNVQCEDPQLSQSLQFSKNKLQQLLESCASQEKEIAKVEDDYQQVHQLCED  148 (280)
T ss_pred             HhhhHHHhhhhHhhhhhhhhhhhhhccccCCCCCCccchhcccHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5555666666666766667777775432211   1112222222           2445777777777666554333211


Q ss_pred             ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 047021          154 ---ERKQLTEEQARISVLMSEKQELEEKLVMFSRK  185 (388)
Q Consensus       154 ---eqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~  185 (388)
                         +-..+-+-..++-. .-|+--|+.++.+.-..
T Consensus       149 q~~~i~klkE~L~rmE~-ekE~~lLe~el~k~q~~  182 (280)
T PF14992_consen  149 QANEIKKLKEKLRRMEE-EKEMLLLEKELSKYQMQ  182 (280)
T ss_pred             HHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhch
Confidence               22233333333333 44555667666665443


No 57 
>PF11932 DUF3450:  Protein of unknown function (DUF3450);  InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=75.58  E-value=14  Score=34.91  Aligned_cols=54  Identities=22%  Similarity=0.398  Sum_probs=25.1

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH---hhhhhhhhHHHHhhh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE---LEQNTVPSLRKALKD   58 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd---~e~n~v~slrkaL~~   58 (388)
                      .+..|.+....|+..|+.+.+.+...+.++.+|+.++++   ..+..+|.+.+.++.
T Consensus        57 e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p~m~~m~~~  113 (251)
T PF11932_consen   57 EYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQELVPLMEQMIDE  113 (251)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344444444444444444444444444555555444442   223444555444443


No 58 
>KOG1962 consensus B-cell receptor-associated protein and related proteins [Defense mechanisms]
Probab=75.47  E-value=22  Score=34.45  Aligned_cols=56  Identities=27%  Similarity=0.296  Sum_probs=50.3

Q ss_pred             hcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhH
Q 047021          198 AFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSY  253 (388)
Q Consensus       198 ~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~  253 (388)
                      .--.+|.++|+..+..-..-.+..++...-|..=+|.=.-|++||-+||+.|-+.+
T Consensus       154 ~~~~~~~~kL~~el~~~~~~Le~~~~~~~al~Kq~e~~~~EydrLlee~~~Lq~~i  209 (216)
T KOG1962|consen  154 DKLKADLEKLETELEKKQKKLEKAQKKVDALKKQSEGLQDEYDRLLEEYSKLQEQI  209 (216)
T ss_pred             HHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHH
Confidence            33468999999999999999999999999999999999999999999999987654


No 59 
>PF15397 DUF4618:  Domain of unknown function (DUF4618)
Probab=75.09  E-value=93  Score=30.99  Aligned_cols=194  Identities=25%  Similarity=0.252  Sum_probs=100.7

Q ss_pred             hHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021           34 SESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM  113 (388)
Q Consensus        34 ~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~  113 (388)
                      |.-|.+.+.|.|..|....|+-|..=-+=+-+.-+-.---..+|-.++..|++.++   +.+.+.+.|..++..|.+..-
T Consensus        22 N~~L~~~IqdtE~st~~~Vr~lLqqy~~~~~~i~~le~~~~~~l~~ak~eLqe~ee---k~e~~l~~Lq~ql~~l~akI~   98 (258)
T PF15397_consen   22 NKELIKEIQDTEDSTALKVRKLLQQYDIYRTAIDILEYSNHKQLQQAKAELQEWEE---KEESKLSKLQQQLEQLDAKIQ   98 (258)
T ss_pred             hHHHHHHHHhHHhhHHHHHHHHHHHHHHHHHHHHHHHccChHHHHHHHHHHHHHHH---HHHhHHHHHHHHHHHHHHHHH
Confidence            44455555666777777777766443333332222222223456677777777776   466677777777777666553


Q ss_pred             cC-----CCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHhhhch
Q 047021          114 DG-----SLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARI-SVLMSEKQELEEKLVMFSRKAS  187 (388)
Q Consensus       114 ~~-----~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~-a~l~aEkqeLeekl~~~~~~~s  187 (388)
                      -.     ..++|-   .  .+...---.|++|.-++++-.-.-+.|..-|-+--... .++....+.-.+          
T Consensus        99 k~~~el~~L~TYk---D--~EYPvK~vqIa~L~rqlq~lk~~qqdEldel~e~~~~el~~l~~~~q~k~~----------  163 (258)
T PF15397_consen   99 KTQEELNFLSTYK---D--HEYPVKAVQIANLVRQLQQLKDSQQDELDELNEMRQMELASLSRKIQEKKE----------  163 (258)
T ss_pred             HHHHHHHHHHHHh---h--hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----------
Confidence            22     233331   1  11111122456666666554443333333232211111 122222222222          


Q ss_pred             hhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHH
Q 047021          188 EEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMG  258 (388)
Q Consensus       188 e~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~  258 (388)
                       ..-+.++.++++.=.+.-+.+-            ..-+.++.|||.+..+|+.|-++.+-|-..+++...
T Consensus       164 -~il~~~~~k~~~~~~~~l~~~~------------~~N~~m~kei~~~re~i~el~e~I~~L~~eV~~L~~  221 (258)
T PF15397_consen  164 -EILSSAAEKTQSPMQPALLQRT------------LENQVMQKEIVQFREEIDELEEEIPQLRAEVEQLQA  221 (258)
T ss_pred             -HHHHHHHHHHHhhchHHHHHHH------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence             2223334455543222222221            234678899999999999999999999998887653


No 60 
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=74.86  E-value=1.8e+02  Score=34.27  Aligned_cols=105  Identities=19%  Similarity=0.133  Sum_probs=60.4

Q ss_pred             hhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCCCchh---hh
Q 047021          245 ENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETH---KN  321 (388)
Q Consensus       245 ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~---~t  321 (388)
                      .|+++.+=++-=+......-.+.+   .+...|+..++..|.+++-..-.......        + ...+.+|..   ..
T Consensus       198 s~~~~~~L~~~q~dl~~~~~~~l~---~~~~~Lq~~in~kR~~~se~~~~~~~~~~--------~-~~~~~~~~i~~~~~  265 (1109)
T PRK10929        198 SANNRQELARLRSELAKKRSQQLD---AYLQALRNQLNSQRQREAERALESTELLA--------E-QSGDLPKSIVAQFK  265 (1109)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhH--------H-hhccCChHHHHHHH
Confidence            455555555555555444333332   34456667777766555433211100000        0 000112222   23


Q ss_pred             hHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhh
Q 047021          322 EFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGL  361 (388)
Q Consensus       322 e~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L  361 (388)
                      .|..|-.+|.+---|...|+.+-.+.-..+.+.+|++.++
T Consensus       266 ~N~~Ls~~L~~~t~~~n~l~~~~~~~~~~l~~~~q~~~~i  305 (1109)
T PRK10929        266 INRELSQALNQQAQRMDLIASQQRQAASQTLQVRQALNTL  305 (1109)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4888999999998999999999999999999999987743


No 61 
>PF12718 Tropomyosin_1:  Tropomyosin like;  InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=72.96  E-value=70  Score=28.59  Aligned_cols=86  Identities=22%  Similarity=0.307  Sum_probs=56.6

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      .+++.|+.++..+|.+..-|+.|++..+.+++.+..+|.+...+.--+-..                   ..+.-.+.+|
T Consensus        21 ~~~K~le~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~lee~~~~-------------------~~~~E~l~rr   81 (143)
T PF12718_consen   21 AKVKQLEQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLEESEKR-------------------KSNAEQLNRR   81 (143)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH-------------------HHhHHHHHhh
Confidence            567888888888888888888888888888888888887665544322211                   1112266677


Q ss_pred             hHHHHHHHhhhHhhHHHHHHHHHHH
Q 047021           84 LKEAEEEQYRAEEDAAALRAELNSL  108 (388)
Q Consensus        84 lkEAEEEQyRAEEDaAaLRAEln~l  108 (388)
                      ++--|++=-+++......-..|...
T Consensus        82 iq~LEeele~ae~~L~e~~ekl~e~  106 (143)
T PF12718_consen   82 IQLLEEELEEAEKKLKETTEKLREA  106 (143)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            7777777666666655555444443


No 62 
>smart00338 BRLZ basic region leucin zipper.
Probab=72.79  E-value=8.8  Score=29.26  Aligned_cols=38  Identities=21%  Similarity=0.427  Sum_probs=33.4

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRI   41 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~L   41 (388)
                      .++.+|+.+...|+.+|..|.+++.....++..|...+
T Consensus        26 ~~~~~Le~~~~~L~~en~~L~~~~~~l~~e~~~lk~~~   63 (65)
T smart00338       26 AEIEELERKVEQLEAENERLKKEIERLRRELEKLKSEL   63 (65)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            46789999999999999999999999999988887655


No 63 
>KOG0996 consensus Structural maintenance of chromosome protein 4 (chromosome condensation complex Condensin, subunit C) [Chromatin structure and dynamics; Cell cycle control, cell division, chromosome partitioning]
Probab=72.65  E-value=2.2e+02  Score=34.21  Aligned_cols=233  Identities=22%  Similarity=0.296  Sum_probs=0.0

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK   81 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k   81 (388)
                      +.....+++...-++++....|...+.+.+...+.+.-...++-. .+..|.+.+    -+=-+++..+.+=...|+..+
T Consensus       790 ~~~~~~~~~~~~~~~ee~~~~lr~~~~~l~~~l~~~~~~~k~~~~-~~~~l~~~i----~~~E~~~~k~~~d~~~l~~~~  864 (1293)
T KOG0996|consen  790 MSDKARQHQEQLHELEERVRKLRERIPELENRLEKLTASVKRLAE-LIEYLESQI----AELEAAVLKKVVDKKRLKELE  864 (1293)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhHHHHhHHHHHHHHHHHHHH-HHHHHHHHH----HHHHHHhhhccCcHHHHHHHH


Q ss_pred             HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCC----CCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGIS----SMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQ  157 (388)
Q Consensus        82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~----~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqr  157 (388)
                      +-+-+.+-|=--+-|.++. .+++..||...|.-..--.-    .+--..+++..++.+|..+.+.++.+-...++-+..
T Consensus       865 ~~ie~l~kE~e~~qe~~~K-k~~i~~lq~~i~~i~~e~~q~qk~kv~~~~~~~~~l~~~i~k~~~~i~~s~~~i~k~q~~  943 (1293)
T KOG0996|consen  865 EQIEELKKEVEELQEKAAK-KARIKELQNKIDEIGGEKVQAQKDKVEKINEQLDKLEADIAKLTVAIKTSDRNIAKAQKK  943 (1293)
T ss_pred             HHHHHHHHHHHHHHHhhhH-HHHHHHHHHHHHHhhchhhHHhHHHHHHHHHHHHHHHHHHHHhHHHHhcCcccHHHHHHH


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhhchh---hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhH---HHHHHH
Q 047021          158 LTEEQARISVLMSEKQELEEKLVMFSRKASE---EVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSR---QKLLME  231 (388)
Q Consensus       158 la~Eq~~~a~l~aEkqeLeekl~~~~~~~se---~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SR---qKLL~E  231 (388)
                      +-+=..-+-.+..+...|.+.+..+..+..+   ..-+.....+=-...-.-+-..+.++--++--|++.|   +-.+++
T Consensus       944 l~~le~~~~~~e~e~~~L~e~~~~~~~k~~E~~~~~~e~~~~~~E~k~~~~~~k~~~e~i~k~~~~lk~~rId~~~K~e~ 1023 (1293)
T KOG0996|consen  944 LSELEREIEDTEKELDDLTEELKGLEEKAAELEKEYKEAEESLKEIKKELRDLKSELENIKKSENELKAERIDIENKLEA 1023 (1293)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHH


Q ss_pred             hhhhhHHHH
Q 047021          232 IDNQSSEIE  240 (388)
Q Consensus       232 ID~QS~EIE  240 (388)
                      ++.+-.||+
T Consensus      1024 ~~~~l~e~~ 1032 (1293)
T KOG0996|consen 1024 INGELNEIE 1032 (1293)
T ss_pred             HHHHHHHHH


No 64 
>PRK11281 hypothetical protein; Provisional
Probab=72.45  E-value=2.1e+02  Score=33.82  Aligned_cols=82  Identities=24%  Similarity=0.225  Sum_probs=44.0

Q ss_pred             HHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHH
Q 047021           99 AALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQ-------QERKQLTEEQARISVLMSE  171 (388)
Q Consensus        99 AaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrq-------qeqqrla~Eq~~~a~l~aE  171 (388)
                      ...+..+..+.++..++..+|- +  .++.+...|+-|++-++++...+.+.-.       =.+.+..--..++..+..+
T Consensus       166 sea~~RlqeI~~~L~~~~~~~~-~--l~~~~~~~l~ae~~~l~~~~~~~~~~l~~~~~l~~l~~~q~d~~~~~~~~~~~~  242 (1113)
T PRK11281        166 YANSQRLQQIRNLLKGGKVGGK-A--LRPSQRVLLQAEQALLNAQNDLQRKSLEGNTQLQDLLQKQRDYLTARIQRLEHQ  242 (1113)
T ss_pred             HHHHHHHHHHHHHHhCCCCCCC-c--CCHHHHHHHHHHHHHHHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4445556666666644444443 2  6777788888888877766433222211       1233333333445555555


Q ss_pred             HHHHHHHHHHHh
Q 047021          172 KQELEEKLVMFS  183 (388)
Q Consensus       172 kqeLeekl~~~~  183 (388)
                      .+.|.+.+..-.
T Consensus       243 ~~~lq~~in~kr  254 (1113)
T PRK11281        243 LQLLQEAINSKR  254 (1113)
T ss_pred             HHHHHHHHHHHH
Confidence            566655555533


No 65 
>TIGR03007 pepcterm_ChnLen polysaccharide chain length determinant protein, PEP-CTERM locus subfamily. Members of this protein family belong to the family of polysaccharide chain length determinant proteins (pfam02706). All are found in species that encode the PEP-CTERM/exosortase system predicted to act in protein sorting in a number of Gram-negative bacteria, and are found near the epsH homolog that is the putative exosortase gene.
Probab=72.43  E-value=1.2e+02  Score=30.99  Aligned_cols=34  Identities=24%  Similarity=0.203  Sum_probs=21.7

Q ss_pred             HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC
Q 047021           82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG  115 (388)
Q Consensus        82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~  115 (388)
                      .++.....+...++.+-+.+.+.+++|+++....
T Consensus       204 ~~l~~l~~~l~~~~~~l~~~~a~~~~l~~~l~~~  237 (498)
T TIGR03007       204 SEISEAQEELEAARLELNEAIAQRDALKRQLGGE  237 (498)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC
Confidence            3455555555566666777777777787776543


No 66 
>PF01576 Myosin_tail_1:  Myosin tail;  InterPro: IPR002928 Muscle contraction is caused by sliding between the thick and thin filaments of the myofibril. Myosin is a major component of thick filaments and exists as a hexamer of 2 heavy chains [], 2 alkali light chains, and 2 regulatory light chains. The heavy chain can be subdivided into the N-terminal globular head and the C-terminal coiled-coil rod-like tail, although some forms have a globular region in their C-terminal. There are many cell-specific isoforms of myosin heavy chains, coded for by a multi-gene family []. Myosin interacts with actin to convert chemical energy, in the form of ATP, to mechanical energy []. The 3-D structure of the head portion of myosin has been determined [] and a model for actin-myosin complex has been constructed []. This family consists of the coiled-coil myosin heavy chain tail region. The coiled-coil is composed of the tail from two molecules of myosin. These can then assemble into the macromolecular thick filament []. The coiled-coil region provides the structural backbone of the thick filament [].; GO: 0003774 motor activity, 0016459 myosin complex; PDB: 2LNK_C 3ZWH_Q.
Probab=71.52  E-value=1.3  Score=49.23  Aligned_cols=70  Identities=24%  Similarity=0.284  Sum_probs=0.0

Q ss_pred             HHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHH
Q 047021          205 EKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQN  274 (388)
Q Consensus       205 ekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQN  274 (388)
                      -+++++|.+...-++.+...|+.+=-|...-+.||=+|..++..+...+-.+...-..+.+.+.|+..|.
T Consensus       380 r~fDk~l~e~k~~~~~~~~e~d~~q~e~r~~~te~~~Lk~~lee~~e~~e~lere~k~L~~El~dl~~q~  449 (859)
T PF01576_consen  380 RKFDKQLAEWKAKVEELQAERDAAQREARELETELFKLKNELEELQEQLEELERENKQLQDELEDLTSQL  449 (859)
T ss_dssp             ----------------------------------------------------------------------
T ss_pred             HhHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHhhccchhhh
Confidence            3466777777777777777777777777777777777777666666666666655666666666655443


No 67 
>PF06005 DUF904:  Protein of unknown function (DUF904);  InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=70.63  E-value=56  Score=26.52  Aligned_cols=65  Identities=29%  Similarity=0.276  Sum_probs=39.2

Q ss_pred             HHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          218 VERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLR  285 (388)
Q Consensus       218 VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR  285 (388)
                      .++||.-=++++.-|+.=.-||+.|-++|.+|++.-.+-..   .=+.--+|-=...+.|+++|.|+.
T Consensus         6 l~~LE~ki~~aveti~~Lq~e~eeLke~n~~L~~e~~~L~~---en~~L~~e~~~~~~rl~~LL~kl~   70 (72)
T PF06005_consen    6 LEQLEEKIQQAVETIALLQMENEELKEKNNELKEENEELKE---ENEQLKQERNAWQERLRSLLGKLE   70 (72)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHhhh
Confidence            45666666677777777777888888887777733322222   111222444455677788887764


No 68 
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=70.53  E-value=1.7e+02  Score=32.08  Aligned_cols=50  Identities=24%  Similarity=0.406  Sum_probs=35.1

Q ss_pred             HHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH
Q 047021           35 ESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK   85 (388)
Q Consensus        35 ~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk   85 (388)
                      ..|+.++.++.++. --|+..|..+-...|..++.|-|+..+.-++..+|.
T Consensus       165 ~~le~e~~~Lk~en-~rl~~~l~~~r~~ld~Etllr~d~~n~~q~Lleel~  214 (546)
T KOG0977|consen  165 KALEDELKRLKAEN-SRLREELARARKQLDDETLLRVDLQNRVQTLLEELA  214 (546)
T ss_pred             HHHHHHHHHHHHHh-hhhHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            33333333443332 346778888889999999999999988888888773


No 69 
>PF11932 DUF3450:  Protein of unknown function (DUF3450);  InterPro: IPR016866 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function. However, they are found in an operon along with components of a TonB transport system (typified by Vibrio cholerae TonB2 [], and are predicted to be localized to the periplasmic space. Caution: the low-complexity nature of these sequences produces spurious BLAST hits to chromosome segregation ATPases (which are much longer in length and contain canonical Walker motifs). Accordingly, some members are misidentified as such.
Probab=69.64  E-value=1e+02  Score=29.15  Aligned_cols=72  Identities=15%  Similarity=0.279  Sum_probs=62.1

Q ss_pred             hhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHH
Q 047021          201 VEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLK  272 (388)
Q Consensus       201 ~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclk  272 (388)
                      .+++..|...++.+-..++.|+.-.++|=.-+++|..+|++|-..-..+....++...+-.++-..++.-+.
T Consensus        48 ~~e~~~L~~e~~~l~~e~e~L~~~~~~l~~~v~~q~~el~~L~~qi~~~~~~~~~l~p~m~~m~~~L~~~v~  119 (251)
T PF11932_consen   48 DDEKQELLAEYRQLEREIENLEVYNEQLERQVASQEQELASLEQQIEQIEETRQELVPLMEQMIDELEQFVE  119 (251)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            477888888888888889999999999999999999999999999999999999988888888777766444


No 70 
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=69.42  E-value=1.6e+02  Score=31.32  Aligned_cols=24  Identities=21%  Similarity=0.275  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          160 EEQARISVLMSEKQELEEKLVMFS  183 (388)
Q Consensus       160 ~Eq~~~a~l~aEkqeLeekl~~~~  183 (388)
                      .++.+++.+..|++.++.++.+-.
T Consensus       196 ~q~~kl~~~~~E~kk~~~~l~~~l  219 (420)
T COG4942         196 AQQAKLAQLLEERKKTLAQLNSEL  219 (420)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344455555555555555555533


No 71 
>PF09755 DUF2046:  Uncharacterized conserved protein H4 (DUF2046);  InterPro: IPR019152  This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain. 
Probab=68.98  E-value=1.4e+02  Score=30.56  Aligned_cols=61  Identities=23%  Similarity=0.410  Sum_probs=44.9

Q ss_pred             HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh--------------hHHHHhhhhhhhhhhHH
Q 047021            7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP--------------SLRKALKDVAMEKDAAV   67 (388)
Q Consensus         7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~--------------slrkaL~~v~mEkDaA~   67 (388)
                      .+|..+...|..+|+.|...+.+-..+..+|+..+..+....|.              +|=|-|..+.-||+..+
T Consensus        23 ~~l~~~~~sL~qen~~Lk~El~~ek~~~~~L~~e~~~lr~~sv~~~~~aEqEEE~isN~LlKkl~~l~keKe~L~   97 (310)
T PF09755_consen   23 EQLRKRIESLQQENRVLKRELETEKARCKHLQEENRALREASVRIQAKAEQEEEFISNTLLKKLQQLKKEKETLA   97 (310)
T ss_pred             HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            57888899999999999999999999999998888766555443              34444555555555554


No 72 
>PF00170 bZIP_1:  bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature;  InterPro: IPR011616  The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=68.74  E-value=13  Score=28.25  Aligned_cols=36  Identities=28%  Similarity=0.417  Sum_probs=29.7

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTR   40 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~   40 (388)
                      .+..|+.++..|+.+|..|...+.....++..|...
T Consensus        27 ~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~~e   62 (64)
T PF00170_consen   27 YIEELEEKVEELESENEELKKELEQLKKEIQSLKSE   62 (64)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            578889999999999999988888888777777654


No 73 
>PF12777 MT:  Microtubule-binding stalk of dynein motor;  InterPro: IPR024743 The 380 kDa motor unit of dynein belongs to the AAA class of chaperone-like ATPases. The core of the 380 kDa motor unit contains a concatenated chain of six AAA modules (D1-6), of which four correspond to the ATP binding sites with P-loop signatures, and two are modules in which the P loop has been lost in evolution. This domain occurs between D4 and D5 and includes the two predicted alpha-helical coiled coil segments that form the stalk supporting the ATP-sensitive microtubule binding component [].; PDB: 3VKH_A 3VKG_A 3ERR_A 4AKI_A 4AI6_B 4AKH_A 4AKG_A 3QMZ_A 2RR7_A.
Probab=68.56  E-value=31  Score=34.25  Aligned_cols=108  Identities=23%  Similarity=0.263  Sum_probs=51.3

Q ss_pred             hhHHHHHHhhHHHHHHhhHhhhhhh---hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHH
Q 047021           25 SKLQVKEVESESLQTRINELEQNTV---PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAAL  101 (388)
Q Consensus        25 sk~~~k~~~~~~L~~~Lnd~e~n~v---~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaL  101 (388)
                      .||.+....++.|+.+|....-...   ....+.++.|..++..|-.-+......=..+....+++.+.+-.|++|-+..
T Consensus         8 ~KL~et~~~V~~m~~~L~~~~~~L~~k~~e~e~ll~~i~~~~~~a~~~~~~~~~ee~~~~~~~~ei~~~~~~a~~~L~~a   87 (344)
T PF12777_consen    8 DKLKETEEQVEEMQEELEEKQPELEEKQKEAEELLEEIEKEQEEAEKKKAIVEEEEEEAEKQAKEIEEIKEEAEEELAEA   87 (344)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHCCHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445555555555555553321111   1233455555555444443333333333344455556666666666666555


Q ss_pred             HHHHHHHHHHHhcCCCCCCC---CCCCChhhHHH
Q 047021          102 RAELNSLQQQAMDGSLGGIS---SMGISPDHVQT  132 (388)
Q Consensus       102 RAEln~lQqq~~~~~~s~~~---~~g~s~~~~~~  132 (388)
                      .-.|..-+..+..=..++++   +|+.||.-+..
T Consensus        88 ~P~L~~A~~al~~l~k~di~Eiks~~~PP~~V~~  121 (344)
T PF12777_consen   88 EPALEEAQEALKSLDKSDISEIKSYANPPEAVKL  121 (344)
T ss_dssp             HHHHHHHHHHHHCS-HHHHHHHHHSSS--HHHHH
T ss_pred             HHHHHHHHHHHHhCCHHHHHHHHhhCCCcHHHHH
Confidence            55555555555443333443   77777777554


No 74 
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=67.63  E-value=1.8e+02  Score=31.28  Aligned_cols=42  Identities=26%  Similarity=0.376  Sum_probs=30.2

Q ss_pred             hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      .++..++..+.+++++.+.+.-..+++.+.+.+++..+++..
T Consensus       205 ~~~~~~~~~le~el~~l~~~~e~l~~~i~~l~~ele~a~~~l  246 (650)
T TIGR03185       205 SSILSEIEALEAELKEQSEKYEDLAQEIAHLRNELEEAQRSL  246 (650)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            346677777777777777777777777777777777666554


No 75 
>PF08614 ATG16:  Autophagy protein 16 (ATG16);  InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=65.24  E-value=39  Score=30.90  Aligned_cols=145  Identities=28%  Similarity=0.316  Sum_probs=32.0

Q ss_pred             hhHHHHHHHHHHHHHHHhcC----CCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           96 EDAAALRAELNSLQQQAMDG----SLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSE  171 (388)
Q Consensus        96 EDaAaLRAEln~lQqq~~~~----~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aE  171 (388)
                      +.++.|.++...|+......    +.|+..+.++++..+..++..+..+..+|..--.-+.+-.++|-.       +..+
T Consensus        31 d~~~~l~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~l~~ELael~r~~~el~~~L~~-------~~~~  103 (194)
T PF08614_consen   31 DRTSLLKAENEQLQPEAESLPSSSSSSPSESGSVSSAQISSLEQKLAKLQEELAELYRSKGELAQQLVE-------LNDE  103 (194)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             ccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccccc-------cccc
Confidence            45667777777777643222    122222333445555566666555555444322222223333333       3333


Q ss_pred             HHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhh
Q 047021          172 KQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSS  251 (388)
Q Consensus       172 kqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lss  251 (388)
                      .+.++.++..-......-....    +--......++..|.++...++.|.--=.-|-.++...-..+.+|=.||..|..
T Consensus       104 l~~l~~~~~~~~~~l~~l~~~~----~~L~~~~~~l~~~l~ek~k~~e~l~DE~~~L~l~~~~~e~k~~~l~~En~~Lv~  179 (194)
T PF08614_consen  104 LQELEKELSEKERRLAELEAEL----AQLEEKIKDLEEELKEKNKANEILQDELQALQLQLNMLEEKLRKLEEENRELVE  179 (194)
T ss_dssp             ----------HHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             cchhhhhHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3344433333222211111111    111223344555555555555555555555556666666667777777776643


No 76 
>PRK02224 chromosome segregation protein; Provisional
Probab=65.16  E-value=2.2e+02  Score=31.26  Aligned_cols=33  Identities=30%  Similarity=0.333  Sum_probs=15.4

Q ss_pred             HHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHH
Q 047021           77 LRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQ  109 (388)
Q Consensus        77 LR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQ  109 (388)
                      +.....++.+++++--.+-++-+.+++.++.++
T Consensus       407 ~~~~e~~l~~l~~~~~~l~~~~~~~~~~~~~~~  439 (880)
T PRK02224        407 LGNAEDFLEELREERDELREREAELEATLRTAR  439 (880)
T ss_pred             hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334444444444444444445455555444444


No 77 
>PF15035 Rootletin:  Ciliary rootlet component, centrosome cohesion
Probab=64.84  E-value=14  Score=34.38  Aligned_cols=43  Identities=28%  Similarity=0.452  Sum_probs=38.9

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINEL   44 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~   44 (388)
                      +++++.+=+.+|..|..-|.+|..+++.+...|+.|..+|.-+
T Consensus        65 ~l~rLeEEqqR~~~L~qvN~lLReQLEq~~~~N~~L~~dl~kl  107 (182)
T PF15035_consen   65 ALIRLEEEQQRSEELAQVNALLREQLEQARKANEALQEDLQKL  107 (182)
T ss_pred             HHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4678889999999999999999999999999999999887644


No 78 
>TIGR01843 type_I_hlyD type I secretion membrane fusion protein, HlyD family. Type I secretion is an ABC transport process that exports proteins, without cleavage of any signal sequence, from the cytosol to extracellular medium across both inner and outer membranes. The secretion signal is found in the C-terminus of the transported protein. This model represents the adaptor protein between the ATP-binding cassette (ABC) protein of the inner membrane and the outer membrane protein, and is called the membrane fusion protein. This model selects a subfamily closely related to HlyD; it is defined narrowly and excludes, for example, colicin V secretion protein CvaA and multidrug efflux proteins.
Probab=64.03  E-value=1.5e+02  Score=28.84  Aligned_cols=169  Identities=16%  Similarity=0.171  Sum_probs=0.0

Q ss_pred             hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhh-------------HHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021           13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPS-------------LRKALKDVAMEKDAAVVAREDLSAQLRT   79 (388)
Q Consensus        13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~s-------------lrkaL~~v~mEkDaA~vareDl~AQLR~   79 (388)
                      +..++.+..-+..++.........|...++....+..|.             +......+........-..+.+-+++..
T Consensus        76 ~~~~~~~l~~l~~~~~~l~a~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~i~~  155 (423)
T TIGR01843        76 ATDVEADAAELESQVLRLEAEVARLRAEADSQAAIEFPDDLLSAEDPAVPELIKGQQSLFESRKSTLRAQLELILAQIKQ  155 (423)
T ss_pred             cchhhhHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCChHHhhcccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH


Q ss_pred             HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHH--------HHHHH
Q 047021           80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQ--------ESQLR  151 (388)
Q Consensus        80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqq--------es~lr  151 (388)
                      ++..+...+...-.++.....+..+++..+.-...+..|.. .+-.+..++..++.++..++.++.+        +..+.
T Consensus       156 ~~~~i~~~~~~l~~~~~~l~~~~~~~~~~~~L~~~g~is~~-~~~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~~~~~~  234 (423)
T TIGR01843       156 LEAELAGLQAQLQALRQQLEVISEELEARRKLKEKGLVSRL-ELLELERERAEAQGELGRLEAELEVLKRQIDELQLERQ  234 (423)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCHH-HHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHH


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          152 QQERKQLTEEQARISVLMSEKQELEEKLVMF  182 (388)
Q Consensus       152 qqeqqrla~Eq~~~a~l~aEkqeLeekl~~~  182 (388)
                      ................+..+...++.++..+
T Consensus       235 ~~~~~~~~~~~~~l~~~~~~l~~~~~~l~~~  265 (423)
T TIGR01843       235 QIEQTFREEVLEELTEAQARLAELRERLNKA  265 (423)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH


No 79 
>TIGR03017 EpsF chain length determinant protein EpsF. Sequences in this family of proteins are members of the chain length determinant family (pfam02706) which includes the wzc protein from E.coli. This family of proteins are homologous to the EpsF protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=63.77  E-value=1.3e+02  Score=29.97  Aligned_cols=80  Identities=19%  Similarity=0.241  Sum_probs=48.4

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK   85 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk   85 (388)
                      +.+|..+..+++.+...|...+-..|-..-.+..+++.++...-..+++....+.-+-..+.....++.+++...++++.
T Consensus       256 i~~l~~~l~~le~~l~~l~~~y~~~hP~v~~l~~~i~~l~~~l~~e~~~~~~~~~~~~~~~~~~~~~l~~~l~~~~~~~~  335 (444)
T TIGR03017       256 IQNLKTDIARAESKLAELSQRLGPNHPQYKRAQAEINSLKSQLNAEIKKVTSSVGTNSRILKQREAELREALENQKAKVL  335 (444)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45677777788888888888777777777777777777776655444444443333333333334444444444444433


No 80 
>PF05701 WEMBL:  Weak chloroplast movement under blue light;  InterPro: IPR008545 This family consists of several plant proteins of unknown function. Several sequences in this family are described as being myosin heavy chain-like.
Probab=62.07  E-value=2.2e+02  Score=30.26  Aligned_cols=167  Identities=23%  Similarity=0.255  Sum_probs=93.5

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      .++...+..+..+.++...+..++..++.++--..++|..- +.+|-.|..-|..+..+++.|.-.-+       ..|-|
T Consensus        27 e~~~~~e~eL~~~qeel~~~k~~l~~~E~~k~~~l~ELe~a-kr~veel~~kLe~~~~~~~~a~~~~e-------~~k~r   98 (522)
T PF05701_consen   27 ERVKEKETELEKAQEELAKLKEQLEAAEREKAQALSELESA-KRTVEELKLKLEKAQAEEKQAEEDSE-------LAKFR   98 (522)
T ss_pred             hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHhhH-------HhHHH
Confidence            56778888888899999999999999999988887777643 45677777778777777776654433       44556


Q ss_pred             hHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHH
Q 047021           84 LKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLT-EEQ  162 (388)
Q Consensus        84 lkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla-~Eq  162 (388)
                      ++|.+.  --+......++++|...+.|-..    .++-....-+++..+..|++.+...-. ...=+..+-.+.+ .-.
T Consensus        99 ~~e~e~--~~~~~~~~~~k~ele~~~~q~~~----~~~eL~~~k~EL~~lr~e~~~~~~~k~-~A~~~aeea~~~a~~~~  171 (522)
T PF05701_consen   99 AKELEQ--GIAEEASVAWKAELESAREQYAS----AVAELDSVKQELEKLRQELASALDAKN-AALKQAEEAVSAAEENE  171 (522)
T ss_pred             HHHHhh--hhcccchHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHH
Confidence            665543  23333334466666666655411    111111123334444444443322110 0111111111111 233


Q ss_pred             HHHHHHHHHHHHHHHHHHHHhhh
Q 047021          163 ARISVLMSEKQELEEKLVMFSRK  185 (388)
Q Consensus       163 ~~~a~l~aEkqeLeekl~~~~~~  185 (388)
                      .++..|..|...+.+-|.++...
T Consensus       172 ~kve~L~~Ei~~lke~l~~~~~a  194 (522)
T PF05701_consen  172 EKVEELSKEIIALKESLESAKLA  194 (522)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Confidence            44556666666666666665433


No 81 
>KOG0977 consensus Nuclear envelope protein lamin, intermediate filament superfamily [Cell cycle control, cell division, chromosome partitioning; Nuclear structure]
Probab=62.02  E-value=2.5e+02  Score=30.88  Aligned_cols=83  Identities=27%  Similarity=0.375  Sum_probs=58.5

Q ss_pred             HHHhhHHHHHHhhHhhhhh--------------hhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhH
Q 047021           30 KEVESESLQTRINELEQNT--------------VPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAE   95 (388)
Q Consensus        30 k~~~~~~L~~~Lnd~e~n~--------------v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAE   95 (388)
                      .+.+|..|+.+||.++...              .+++++.|++++=+|--+.+==.-|..++-.+++++-+++.+--.++
T Consensus        61 LEaqN~~L~~di~~lr~~~~~~ts~ik~~ye~El~~ar~~l~e~~~~ra~~e~ei~kl~~e~~elr~~~~~~~k~~~~~r  140 (546)
T KOG0977|consen   61 LEAQNRKLEHDINLLRGVVGRETSGIKAKYEAELATARKLLDETARERAKLEIEITKLREELKELRKKLEKAEKERRGAR  140 (546)
T ss_pred             HHHHHHHHHHHHHHHHhhccCCCcchhHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHhhhH
Confidence            3455566666666554332              35788889888777766666666677888889999999988888888


Q ss_pred             hhHHHHHHHHHHHHHHH
Q 047021           96 EDAAALRAELNSLQQQA  112 (388)
Q Consensus        96 EDaAaLRAEln~lQqq~  112 (388)
                      ++.-....-|+.++...
T Consensus       141 e~~~~~~~~l~~leAe~  157 (546)
T KOG0977|consen  141 EKLDDYLSRLSELEAEI  157 (546)
T ss_pred             HHHHHHhhhhhhhhhHH
Confidence            88776666666665544


No 82 
>PF01166 TSC22:  TSC-22/dip/bun family;  InterPro: IPR000580 Several eukaryotic proteins are evolutionary related and are thought to be involved in transcriptional regulation. These proteins are highly similar in a region of about 50 residues that include a conserved leucine-zipper domain most probably involved in homo- or hetero-dimerisation. Proteins containing this signature include:   Vertebrate protein TSC-22 [], a transcriptional regulator which seems to act on C-type natriuretic peptide (CNP) promoter. Mammalian protein DIP (DSIP-immunoreactive peptide) [], a protein whose function is not yet known. Drosophila protein bunched [] (gene bun) (also known as shortsighted), a probable transcription factor required for peripheral nervous system morphogenesis, eye development and oogenesis.  Caenorhabditis elegans hypothetical protein T18D3.7.  ; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 1DIP_B.
Probab=60.90  E-value=9.8  Score=30.47  Aligned_cols=25  Identities=32%  Similarity=0.655  Sum_probs=21.3

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhh
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSK   26 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk   26 (388)
                      |-.++.+|+.+|.+||.||.+|.+.
T Consensus        19 LK~~I~eL~~~n~~Le~EN~~Lk~~   43 (59)
T PF01166_consen   19 LKEQIAELEERNSQLEEENNLLKQN   43 (59)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            4568899999999999999998764


No 83 
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=60.02  E-value=2.5e+02  Score=30.27  Aligned_cols=87  Identities=28%  Similarity=0.350  Sum_probs=61.7

Q ss_pred             HHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021           10 QKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE   89 (388)
Q Consensus        10 q~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE   89 (388)
                      +....+.|.|+.+-.+++++...+.+....+++...+|.    -||+-+.           --+++|-+-++.||+---+
T Consensus        94 ~~~k~~~e~er~~~~~El~~~r~e~~~v~~~~~~a~~n~----~kAqQ~l-----------ar~t~Q~q~lqtrl~~l~~  158 (499)
T COG4372          94 QGEKRAAETEREAARSELQKARQEREAVRQELAAARQNL----AKAQQEL-----------ARLTKQAQDLQTRLKTLAE  158 (499)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHH-----------HHHHHHHHHHHHHHHHHHH
Confidence            334445566666666777777777777777776666653    2444222           1367888999999999888


Q ss_pred             HHhhhHhhHHHHHHHHHHHHHH
Q 047021           90 EQYRAEEDAAALRAELNSLQQQ  111 (388)
Q Consensus        90 EQyRAEEDaAaLRAEln~lQqq  111 (388)
                      +-..+++.+-+|-|+-+.||..
T Consensus       159 qr~ql~aq~qsl~a~~k~LQ~s  180 (499)
T COG4372         159 QRRQLEAQAQSLQASQKQLQAS  180 (499)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH
Confidence            8888899999999999999865


No 84 
>PRK10884 SH3 domain-containing protein; Provisional
Probab=59.80  E-value=1.1e+02  Score=29.21  Aligned_cols=33  Identities=12%  Similarity=0.128  Sum_probs=20.2

Q ss_pred             HhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcC
Q 047021           83 RLKEAEEEQYRAEEDAAALRAELNSLQQQAMDG  115 (388)
Q Consensus        83 RlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~  115 (388)
                      +.++..++-..+......|+++++.++...+..
T Consensus       140 ~n~~L~~~l~~~~~~~~~l~~~~~~~~~~~~~~  172 (206)
T PRK10884        140 ENQKLKNQLIVAQKKVDAANLQLDDKQRTIIMQ  172 (206)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            333334444446666777888888888665443


No 85 
>KOG3977 consensus Troponin I [Cytoskeleton]
Probab=58.61  E-value=23  Score=34.40  Aligned_cols=75  Identities=27%  Similarity=0.428  Sum_probs=58.8

Q ss_pred             HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh-hhHHHHhhhhhhhhhhHHH--------hhhhHHHHH
Q 047021            7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV-PSLRKALKDVAMEKDAAVV--------AREDLSAQL   77 (388)
Q Consensus         7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v-~slrkaL~~v~mEkDaA~v--------areDl~AQL   77 (388)
                      .+|.-+...||+++=-+.-+....+.++..|-..+|||+-+.| |+|++-    ++=-|+---        ...|+-|+|
T Consensus        98 ~~l~arv~~leEEkYDi~~~v~qt~~EIndLtikvnDLRGKFvkPtLkkV----sks~~kf~ka~~~~k~~~k~DlRanL  173 (221)
T KOG3977|consen   98 RELHARVDALEEEKYDIEAKVTQTETEINDLTIKVNDLRGKFVKPTLKKV----SKSADKFLKALLGSKHKVKMDLRANL  173 (221)
T ss_pred             HHHHHHHHHHHHhhcchhheeehhhhhHHHHHHHHHHhcccccCccHHHH----HhhhHHHHHHhhccchhhhHHHHHHH
Confidence            4677788888898888888888899999999999999999999 998763    333333221        267999999


Q ss_pred             HHHHHHhH
Q 047021           78 RTLKKRLK   85 (388)
Q Consensus        78 R~~kkRlk   85 (388)
                      .+.||-=-
T Consensus       174 K~VKKed~  181 (221)
T KOG3977|consen  174 KQVKKEDT  181 (221)
T ss_pred             HHhhhhhH
Confidence            99998633


No 86 
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=58.55  E-value=2.1e+02  Score=29.89  Aligned_cols=44  Identities=25%  Similarity=0.275  Sum_probs=38.0

Q ss_pred             hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021           70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM  113 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~  113 (388)
                      .+.....++.+++++++++.+..+++...+++.+.++-|..-..
T Consensus        66 ~~~~~~~~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~  109 (525)
T TIGR02231        66 SRPDPERLAELRKQIRELEAELRDLEDRGDALKALAKFLEDIRE  109 (525)
T ss_pred             CcCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            35567789999999999999999999999999999988877653


No 87 
>PF04012 PspA_IM30:  PspA/IM30 family;  InterPro: IPR007157 This family includes PspA a protein that suppresses sigma54-dependent transcription. The PspA protein, a negative regulator of the Escherichia coli phage shock psp operon, is produced when virulence factors are exported through secretins in many Gram-negative pathogenic bacteria and its homologue in plants, VIPP1, plays a critical role in thylakoid biogenesis, essential for photosynthesis. Activation of transcription by the enhancer-dependent bacterial sigma54-containing RNA polymerase occurs through ATP hydrolysis-driven protein conformational changes enabled by activator proteins that belong to the large AAA(+) mechanochemical protein family. It has been shown that PspA directly and specifically acts upon and binds to the AAA(+) domain of the PspF transcription activator [].
Probab=58.47  E-value=1.5e+02  Score=27.19  Aligned_cols=137  Identities=19%  Similarity=0.250  Sum_probs=63.3

Q ss_pred             HHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhh-hhHHHh-----------------h
Q 047021            9 FQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEK-DAAVVA-----------------R   70 (388)
Q Consensus         9 lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEk-DaA~va-----------------r   70 (388)
                      |..-..+++.....+..-+.+.......+.++++..+....---.+|..-+.-.+ |.|..|                 -
T Consensus        28 l~q~ird~e~~l~~a~~~~a~~~a~~~~le~~~~~~~~~~~~~~~~A~~Al~~g~edLAr~al~~k~~~e~~~~~l~~~~  107 (221)
T PF04012_consen   28 LEQAIRDMEEQLRKARQALARVMANQKRLERKLDEAEEEAEKWEKQAELALAAGREDLAREALQRKADLEEQAERLEQQL  107 (221)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444455555555555555555555555666555555444433333321111111 111111                 2


Q ss_pred             hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCC-CCCCCCCCCChhhHHHHHHHHHHHHHHHHH
Q 047021           71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGS-LGGISSMGISPDHVQTLEKELATLKSQLQQ  146 (388)
Q Consensus        71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~-~s~~~~~g~s~~~~~~lekEl~~l~~~lqq  146 (388)
                      +.+..++-.++..+.+.+..-....--...|.+..+.-+.+...+. .+|+ ++|.+.+.+..|+..+....++..-
T Consensus       108 ~~~~~~~~~l~~~l~~l~~kl~e~k~k~~~l~ar~~~a~a~~~~~~~~~~~-~~~~a~~~~er~e~ki~~~ea~a~a  183 (221)
T PF04012_consen  108 DQAEAQVEKLKEQLEELEAKLEELKSKREELKARENAAKAQKKVNEALASF-SVSSAMDSFERMEEKIEEMEARAEA  183 (221)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccC-CccchHHHHHHHHHHHHHHHHHHHH
Confidence            2233333333334444333332222223334444444444443333 3344 3677778888888888866555443


No 88 
>PF14197 Cep57_CLD_2:  Centrosome localisation domain of PPC89 
Probab=58.42  E-value=45  Score=26.80  Aligned_cols=39  Identities=23%  Similarity=0.273  Sum_probs=24.5

Q ss_pred             HHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHh
Q 047021           30 KEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVA   69 (388)
Q Consensus        30 k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~va   69 (388)
                      .+.++-+|+.+|+.+.++.- .-.-.++.++-|||.++..
T Consensus         3 Lea~~~~Lr~rLd~~~rk~~-~~~~~~k~L~~ERd~~~~~   41 (69)
T PF14197_consen    3 LEAEIATLRNRLDSLTRKNS-VHEIENKRLRRERDSAERQ   41 (69)
T ss_pred             HHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHH
Confidence            45566677777776666543 2235566667788887654


No 89 
>COG4942 Membrane-bound metallopeptidase [Cell division and chromosome partitioning]
Probab=58.29  E-value=2.6e+02  Score=29.85  Aligned_cols=105  Identities=22%  Similarity=0.370  Sum_probs=58.6

Q ss_pred             HhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021           11 KKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE   90 (388)
Q Consensus        11 ~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE   90 (388)
                      ++......+...+.++|....-+...|++.|.+++.. +.++..+|.....+.+       .+..++..+-+++.-.+..
T Consensus        38 ~~l~q~q~ei~~~~~~i~~~~~~~~kL~~~lk~~e~~-i~~~~~ql~~s~~~l~-------~~~~~I~~~~~~l~~l~~q  109 (420)
T COG4942          38 KQLKQIQKEIAALEKKIREQQDQRAKLEKQLKSLETE-IASLEAQLIETADDLK-------KLRKQIADLNARLNALEVQ  109 (420)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHhHHH-------HHHhhHHHHHHHHHHHHHH
Confidence            4455555666666666666667777777777776643 4556666655444433       3334444444444433333


Q ss_pred             HhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhH
Q 047021           91 QYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHV  130 (388)
Q Consensus        91 QyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~  130 (388)
                      .   .+....|-..|+++|+.- -|++   +...+++++.
T Consensus       110 ~---r~qr~~La~~L~A~~r~g-~~p~---~~ll~~~eda  142 (420)
T COG4942         110 E---REQRRRLAEQLAALQRSG-RNPP---PALLVSPEDA  142 (420)
T ss_pred             H---HHHHHHHHHHHHHHHhcc-CCCC---chhhcChhhh
Confidence            3   444556667777777643 2221   2456777773


No 90 
>TIGR00219 mreC rod shape-determining protein MreC. MreC (murein formation C) is involved in the rod shape determination in E. coli, and more generally in cell shape determination of bacteria whether or not they are rod-shaped. Cells defective in MreC are round. Species with MreC include many of the Proteobacteria, Gram-positives, and spirochetes.
Probab=58.04  E-value=32  Score=33.73  Aligned_cols=52  Identities=19%  Similarity=0.219  Sum_probs=38.5

Q ss_pred             HHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          229 LMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK  283 (388)
Q Consensus       229 L~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek  283 (388)
                      ...+.........|..||..|..-+.   ...++|....++|...|++||.+|+-
T Consensus        58 ~~~~~~~~~~~~~l~~EN~~Lr~e~~---~l~~~~~~~~~~l~~EN~rLr~LL~~  109 (283)
T TIGR00219        58 FDGISENLKDVNNLEYENYKLRQELL---KKNQQLEILTQNLKQENVRLRELLNS  109 (283)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            34455566677788899998885443   33567777777899999999998875


No 91 
>PF09755 DUF2046:  Uncharacterized conserved protein H4 (DUF2046);  InterPro: IPR019152  This is the conserved N-terminal 350 residues of a family of proteins of unknown function possibly containing a coiled-coil domain. 
Probab=56.32  E-value=2.4e+02  Score=28.95  Aligned_cols=91  Identities=26%  Similarity=0.297  Sum_probs=39.4

Q ss_pred             HHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHH---HHHHH--hhhH
Q 047021           21 SILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKE---AEEEQ--YRAE   95 (388)
Q Consensus        21 ~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkE---AEEEQ--yRAE   95 (388)
                      +.|+.||...--++..|.+.|-..-...|.-|++-+.++.-|.    .+...=.-+||..|=.+.-   .|.|-  -|=-
T Consensus       109 n~L~rkl~qLr~EK~~lE~~Le~EqE~~V~kL~k~i~~Le~e~----~~~q~~le~Lr~EKVdlEn~LE~EQE~lvN~L~  184 (310)
T PF09755_consen  109 NDLSRKLNQLRQEKVELENQLEQEQEYLVNKLQKKIERLEKEK----SAKQEELERLRREKVDLENTLEQEQEALVNRLW  184 (310)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHH----HHhHHHHHHHHHHHHhHHHHHHHHHHHHHHHHH
Confidence            3444444444444444444444333344455555554432222    1111222255554444322   12111  1112


Q ss_pred             hhHHHHHHHHHHHHHHHhcC
Q 047021           96 EDAAALRAELNSLQQQAMDG  115 (388)
Q Consensus        96 EDaAaLRAEln~lQqq~~~~  115 (388)
                      -...-|.++-..||......
T Consensus       185 Kqm~~l~~eKr~Lq~~l~~~  204 (310)
T PF09755_consen  185 KQMDKLEAEKRRLQEKLEQP  204 (310)
T ss_pred             HHHHHHHHHHHHHHHHHccc
Confidence            23455677777788877653


No 92 
>PF13935 Ead_Ea22:  Ead/Ea22-like protein
Probab=56.08  E-value=67  Score=28.32  Aligned_cols=59  Identities=20%  Similarity=0.242  Sum_probs=47.5

Q ss_pred             HHHHHhhhhhHHHHHHhhh--hhhhhhhHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH
Q 047021          227 KLLMEIDNQSSEIERLFEE--NSSLSSSYQESMGIAKHWENQV-KDCLKQNEELRGILEKLR  285 (388)
Q Consensus       227 KLL~EID~QS~EIErLf~E--Ns~Lsss~qea~~iaaqWEnQv-qDclkQNeeLr~lLeklR  285 (388)
                      -||.||+.-...|++++.+  |..++..+++++.--...+.-+ ..|..+++...++...++
T Consensus        71 ALLDElE~~~~~i~~~~~~~e~~~~a~~~~~l~~~Le~ae~~~~~~~~~~~~~~e~~~~~~~  132 (139)
T PF13935_consen   71 ALLDELERAQQRIAELEQECENEDIALDVQKLRVELEAAEKRIAAELAEQAEAYEGEIADYA  132 (139)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHH
Confidence            4899999999999999977  9999999999988666666655 677778777777666644


No 93 
>KOG0249 consensus LAR-interacting protein and related proteins [General function prediction only]
Probab=54.77  E-value=1.6e+02  Score=33.80  Aligned_cols=91  Identities=29%  Similarity=0.391  Sum_probs=63.5

Q ss_pred             HHHhHHhhhhHHHHHHhhHHHHHHhh-Hhhhh-----hhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHH
Q 047021           17 EEKNSILSSKLQVKEVESESLQTRIN-ELEQN-----TVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEE   90 (388)
Q Consensus        17 e~~n~~L~sk~~~k~~~~~~L~~~Ln-d~e~n-----~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEE   90 (388)
                      ++...+|...++++..+..-..+++. ..+-|     ||+   .-|.+-.=|+-||.--++-|.-+++++||++-+..-+
T Consensus       162 eer~~kl~~~~qe~naeL~rarqreemneeh~~rlsdtvd---ErlqlhlkermaAle~kn~L~~e~~s~kk~l~~~~~~  238 (916)
T KOG0249|consen  162 EERTRKLEEQLEELNAELQRARQREKMNEEHNKRLSDTVD---ERLQLHLKERMAALEDKNRLEQELESVKKQLEEMRHD  238 (916)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccccccccH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455555555555555555544443 11111     332   3333333678888888999999999999999999999


Q ss_pred             HhhhHhhHHHHHHHHHHHHH
Q 047021           91 QYRAEEDAAALRAELNSLQQ  110 (388)
Q Consensus        91 QyRAEEDaAaLRAEln~lQq  110 (388)
                      --|--.|--.||.|++.|-.
T Consensus       239 k~rl~~d~E~Lr~e~~qL~~  258 (916)
T KOG0249|consen  239 KDKLRTDIEDLRGELDQLRR  258 (916)
T ss_pred             HHHHhhhHHHHHHHHHHHHH
Confidence            88888999999999999874


No 94 
>PF12329 TMF_DNA_bd:  TATA element modulatory factor 1 DNA binding;  InterPro: IPR022092  This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells. 
Probab=54.74  E-value=27  Score=28.18  Aligned_cols=38  Identities=37%  Similarity=0.442  Sum_probs=24.2

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN   42 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln   42 (388)
                      .|+.|..++.++|.....+..+++..+.+.+.|..+|.
T Consensus        34 ~IKKLr~~~~e~e~~~~~l~~~~~~~e~~~~~l~~~l~   71 (74)
T PF12329_consen   34 TIKKLRAKIKELEKQIKELKKKLEELEKELESLEERLK   71 (74)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            35566666666666666666666666666666666554


No 95 
>PF04859 DUF641:  Plant protein of unknown function (DUF641);  InterPro: IPR006943 This conserved region is found in a number of plant proteins of unknown function.
Probab=54.72  E-value=28  Score=31.33  Aligned_cols=46  Identities=22%  Similarity=0.338  Sum_probs=41.4

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN   47 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n   47 (388)
                      |.+++++++..+...|.-.++|.+++..|+.++..|...|.++...
T Consensus        78 l~a~~~e~qsli~~yE~~~~kLe~e~~~Kdsei~~Lr~~L~~~~~~  123 (131)
T PF04859_consen   78 LAAEIQEQQSLIKTYEIVVKKLEAELRAKDSEIDRLREKLDELNRA  123 (131)
T ss_pred             cccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5678899999999999999999999999999999999999987653


No 96 
>PF15294 Leu_zip:  Leucine zipper
Probab=54.17  E-value=1.7e+02  Score=29.47  Aligned_cols=124  Identities=27%  Similarity=0.322  Sum_probs=83.1

Q ss_pred             hhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCcccc-
Q 047021          235 QSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQ-  313 (388)
Q Consensus       235 QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~-  313 (388)
                      -+-||.||=+||..|.+-++       .-|.++-.|+.....|..-|.++|..|++.--..             ++... 
T Consensus       130 l~kEi~rLq~EN~kLk~rl~-------~le~~at~~l~Ek~kl~~~L~~lq~~~~~~~~k~-------------~~~~~~  189 (278)
T PF15294_consen  130 LNKEIDRLQEENEKLKERLK-------SLEKQATSALDEKSKLEAQLKELQDEQGDQKGKK-------------DLSFKA  189 (278)
T ss_pred             HHHHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhccc-------------cccccc
Confidence            56799999999999887665       3577888999999999999999999887654221             01111 


Q ss_pred             CCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHHHHHHHhhHHHHHHHhhhcccc
Q 047021          314 NGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRLYKPVLRNIESSLIKMKQDGSV  385 (388)
Q Consensus       314 ~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rly~PvL~~IEsrLikmkq~~~~  385 (388)
                      ........-+..+|.+|-+.   ...++.+.-.|.-.|.-|.+-   |-|.-.| |+.+|+-|-+.-|.+++
T Consensus       190 q~l~dLE~k~a~lK~e~ek~---~~d~~~~~k~L~e~L~~~Khe---lL~~Qeq-L~~aekeLekKfqqT~a  254 (278)
T PF15294_consen  190 QDLSDLENKMAALKSELEKA---LQDKESQQKALEETLQSCKHE---LLRVQEQ-LSLAEKELEKKFQQTAA  254 (278)
T ss_pred             cchhhHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHH---HHhcchh-hhcchhhHHHHhCccHH
Confidence            00111111255566666665   334445666777777777654   5566666 88888888887776653


No 97 
>KOG4674 consensus Uncharacterized conserved coiled-coil protein [Function unknown]
Probab=54.10  E-value=5.5e+02  Score=32.33  Aligned_cols=131  Identities=23%  Similarity=0.246  Sum_probs=85.0

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHH-hh--h---hhhhhhhHHH-hhhhHHHHHHHH
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKA-LK--D---VAMEKDAAVV-AREDLSAQLRTL   80 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrka-L~--~---v~mEkDaA~v-areDl~AQLR~~   80 (388)
                      .+++.+..+-+....|.+.+...+..+++|++..+-+...++++.-.. ++  +   ..++.++++- .=.|+-..|+..
T Consensus       569 ~~k~~~~~a~e~i~~L~~~l~e~~~~i~sLl~erd~y~e~l~~~e~~~~~k~nss~~~~t~~~~~~e~~l~qLe~~le~~  648 (1822)
T KOG4674|consen  569 ILKETINEASEKIAELEKELEEQEQRIESLLTERDMYKELLAELEDSHQLKPNSSALDQTEAPRAKEKRLRQLENELESY  648 (1822)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhcccccccCCCCchhhcccccchhHHHHHHHHHHHHHHH
Confidence            455566666666666777777777777777777777766666655551 11  1   2456666522 234555555543


Q ss_pred             HHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHH
Q 047021           81 KKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQ  145 (388)
Q Consensus        81 kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lq  145 (388)
                      +   .|.-+-....+||-..|+.++-.|.-+...-.+    ..+.+.+.+.-|++.+..++.++-
T Consensus       649 ~---~E~~~~~~~l~e~~~~l~~ev~~ir~~l~k~~~----~~~fA~ekle~L~~~ie~~K~e~~  706 (1822)
T KOG4674|consen  649 K---KEKRENLKKLQEDFDSLQKEVTAIRSQLEKLKN----ELNLAKEKLENLEKNLELTKEEVE  706 (1822)
T ss_pred             H---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHH----HHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3   355666777889999999999999877733221    344577888888888887777765


No 98 
>PF12709 Kinetocho_Slk19:  Central kinetochore-associated;  InterPro: IPR024312 This is a family of proteins integrally involved in the central kinetochore. Slk19 is a yeast member and it may play an important role in the timing of nuclear migration. It may also participate, directly or indirectly, in the maintenance of centromeric tensile strength during mitotic stagnation, for instance during activation of checkpoint controls, when cells need to preserve nuclear integrity until cell cycle progression can be resumed [].
Probab=52.85  E-value=28  Score=29.62  Aligned_cols=38  Identities=21%  Similarity=0.399  Sum_probs=32.2

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRI   41 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~L   41 (388)
                      .+|.+|+..+..+.++|..|..++++.-.+...|.+=|
T Consensus        49 k~v~~L~~e~~~l~~E~e~L~~~l~~e~~Ek~~Ll~ll   86 (87)
T PF12709_consen   49 KKVDELENENKALKRENEQLKKKLDTEREEKQELLKLL   86 (87)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            57888999999999999999999999888888877644


No 99 
>PF06156 DUF972:  Protein of unknown function (DUF972);  InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=52.74  E-value=79  Score=27.35  Aligned_cols=53  Identities=32%  Similarity=0.501  Sum_probs=43.6

Q ss_pred             HHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          213 DMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRT  286 (388)
Q Consensus       213 dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~  286 (388)
                      ++-..|.+||..=..|+.+|..=...|-.|.+||..|.                     ..|..||..|.++..
T Consensus         5 ~l~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~---------------------~EN~~Lr~~l~~~~~   57 (107)
T PF06156_consen    5 ELFDRLDQLEQQLGQLLEELEELKKQLQELLEENARLR---------------------IENEHLRERLEELEQ   57 (107)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---------------------HHHHHHHHHHHHHhc
Confidence            55666888888888888888888889999999999885                     367889998888765


No 100
>PF13870 DUF4201:  Domain of unknown function (DUF4201)
Probab=52.65  E-value=1.7e+02  Score=26.17  Aligned_cols=90  Identities=24%  Similarity=0.381  Sum_probs=59.7

Q ss_pred             HHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhh
Q 047021           15 ELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRA   94 (388)
Q Consensus        15 ~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRA   94 (388)
                      .|.-+|..|..+|+++..+.-.|...... .-.+....++-|..+.-+       -..+-..|...+..+....++=|++
T Consensus        46 qLkien~~l~~kIeERn~eL~~Lk~~~~~-~v~~L~h~keKl~~~~~~-------~~~l~~~l~~~~~~~~~~r~~l~~~  117 (177)
T PF13870_consen   46 QLKIENQQLNEKIEERNKELLKLKKKIGK-TVQILTHVKEKLHFLSEE-------LERLKQELKDREEELAKLREELYRV  117 (177)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35557777777777776665555443321 112224455555444333       2345567777888888888899999


Q ss_pred             HhhHHHHHHHHHHHHHHH
Q 047021           95 EEDAAALRAELNSLQQQA  112 (388)
Q Consensus        95 EEDaAaLRAEln~lQqq~  112 (388)
                      ..+-..+|...+.|+++.
T Consensus       118 k~~r~k~~~~~~~l~~~~  135 (177)
T PF13870_consen  118 KKERDKLRKQNKKLRQQG  135 (177)
T ss_pred             HHHHHHHHHHHHHHHHhc
Confidence            999999999999998876


No 101
>PF08614 ATG16:  Autophagy protein 16 (ATG16);  InterPro: IPR013923 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. This entry represents auotphagy protein 16 (Apg16), which is required for the function of the Apg12p-Apg5p conjugate.; PDB: 3A7O_D 3A7P_B.
Probab=52.59  E-value=1.4e+02  Score=27.31  Aligned_cols=37  Identities=32%  Similarity=0.404  Sum_probs=16.8

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN   42 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln   42 (388)
                      |..|+..++.|+.+...|...|.++...++.|.+++.
T Consensus       118 l~~l~~~~~~L~~~~~~l~~~l~ek~k~~e~l~DE~~  154 (194)
T PF08614_consen  118 LAELEAELAQLEEKIKDLEEELKEKNKANEILQDELQ  154 (194)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444444444444444444444444444444444444


No 102
>KOG0612 consensus Rho-associated, coiled-coil containing protein kinase [Signal transduction mechanisms]
Probab=51.70  E-value=5.2e+02  Score=31.40  Aligned_cols=81  Identities=17%  Similarity=0.283  Sum_probs=42.5

Q ss_pred             HhhhhHHHHHHhhHHHHHHhhHhhhhhh-hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhh---Hhh
Q 047021           22 ILSSKLQVKEVESESLQTRINELEQNTV-PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRA---EED   97 (388)
Q Consensus        22 ~L~sk~~~k~~~~~~L~~~Lnd~e~n~v-~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRA---EED   97 (388)
                      ++..++++....+.+-...|++-+.++. .+.......+..++.    -+.++.+++|..+..|+.+...--.+   .+-
T Consensus       465 ~~~keL~e~i~~lk~~~~el~~~q~~l~q~~~ke~~ek~~~~~~----~~~~l~~~~~~~~eele~~q~~~~~~~~~~~k  540 (1317)
T KOG0612|consen  465 EMDKELEETIEKLKSEESELQREQKALLQHEQKEVEEKLSEEEA----KKRKLEALVRQLEEELEDAQKKNDNAADSLEK  540 (1317)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            3444444444444444455554333333 233333323333332    27899999999999999994433333   344


Q ss_pred             HHHHHHHHH
Q 047021           98 AAALRAELN  106 (388)
Q Consensus        98 aAaLRAEln  106 (388)
                      ..++|.+|.
T Consensus       541 v~~~rk~le  549 (1317)
T KOG0612|consen  541 VNSLRKQLE  549 (1317)
T ss_pred             HHHHHHHHH
Confidence            444444443


No 103
>PHA02562 46 endonuclease subunit; Provisional
Probab=51.51  E-value=2.9e+02  Score=28.38  Aligned_cols=146  Identities=9%  Similarity=0.189  Sum_probs=68.7

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh----HhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN----ELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRT   79 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln----d~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~   79 (388)
                      .+++++++.+..+..+...+..++.........+....+    +++. ....+....+.+--+.+...-.-.++-..+-.
T Consensus       174 ~~~~e~~~~i~~l~~~i~~l~~~i~~~~~~i~~~~~~~~~~i~~l~~-e~~~l~~~~~~l~~~l~~l~~~i~~l~~~i~~  252 (562)
T PHA02562        174 DKIRELNQQIQTLDMKIDHIQQQIKTYNKNIEEQRKKNGENIARKQN-KYDELVEEAKTIKAEIEELTDELLNLVMDIED  252 (562)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence            355667777777777777776666644433333322221    1110 11111222222211111111111111111111


Q ss_pred             HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh----cCCCC-CCCCCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021           80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM----DGSLG-GISSMGISPDHVQTLEKELATLKSQLQQESQL  150 (388)
Q Consensus        80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~----~~~~s-~~~~~g~s~~~~~~lekEl~~l~~~lqqes~l  150 (388)
                      ...-+++++.+--.++.+-..+...++-+...-.    ..+.. +...++.-.+.+..++.++..+...+......
T Consensus       253 ~~~~L~~l~~~~~~~~~~l~~~~~~~~~~~~~~~Cp~C~~~~~~~~~~~~~l~d~i~~l~~~l~~l~~~i~~~~~~  328 (562)
T PHA02562        253 PSAALNKLNTAAAKIKSKIEQFQKVIKMYEKGGVCPTCTQQISEGPDRITKIKDKLKELQHSLEKLDTAIDELEEI  328 (562)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCCCCcCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            2223455566666666666777777666642000    00111 12244566778888999999888888754443


No 104
>PRK10929 putative mechanosensitive channel protein; Provisional
Probab=50.71  E-value=5e+02  Score=30.89  Aligned_cols=116  Identities=18%  Similarity=0.266  Sum_probs=59.0

Q ss_pred             hhhHHHHHHHHHHHhHHHHHHHhhhHhhH--------------HHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHH
Q 047021           70 REDLSAQLRTLKKRLKEAEEEQYRAEEDA--------------AALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEK  135 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEEEQyRAEEDa--------------AaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lek  135 (388)
                      -.||-.++-...-.|.+..++....-...              ...|..+..+..+.-+....+- +  .++.+...++-
T Consensus       104 ~~~Leq~l~~~~~~L~~~q~~l~~~~~~~~~~~~~l~~~pq~~~~~~~~l~~i~~~L~~~~~~~~-~--l~~a~~~~lqa  180 (1109)
T PRK10929        104 TDALEQEILQVSSQLLEKSRQAQQEQDRAREISDSLSQLPQQQTEARRQLNEIERRLQTLGTPNT-P--LAQAQLTALQA  180 (1109)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhhhHHHHHHHhhchhhHHHHHHHHHHHHHHHhCCCCCCC-c--ccHHHHHHHHH
Confidence            35566666666666666555544433333              4566677777766633222222 2  45556666777


Q ss_pred             HHHHHHHHHHHHHH--HHHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHhhhchh
Q 047021          136 ELATLKSQLQQESQ--LRQQERKQLT-----EEQARISVLMSEKQELEEKLVMFSRKASE  188 (388)
Q Consensus       136 El~~l~~~lqqes~--lrqqeqqrla-----~Eq~~~a~l~aEkqeLeekl~~~~~~~se  188 (388)
                      |.+-++.+......  +-...++-|.     --..++..+..+.+.|.+.+..-..+-++
T Consensus       181 e~~~l~~~~~~l~~~l~s~~~~~~L~~~q~dl~~~~~~~l~~~~~~Lq~~in~kR~~~se  240 (1109)
T PRK10929        181 ESAALKALVDELELAQLSANNRQELARLRSELAKKRSQQLDAYLQALRNQLNSQRQREAE  240 (1109)
T ss_pred             HHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            77666665443222  1112222222     22334455666666666666654444443


No 105
>PRK10246 exonuclease subunit SbcC; Provisional
Probab=50.58  E-value=4.5e+02  Score=30.35  Aligned_cols=25  Identities=12%  Similarity=0.197  Sum_probs=19.9

Q ss_pred             cchhhHHHHHHHHhHHHHHHHHHHh
Q 047021          199 FSVEDKEKLEKQLHDMAVVVERLES  223 (388)
Q Consensus       199 ~S~eDkekle~Ql~dma~~VeRLE~  223 (388)
                      ++..+.+.++..+.+.-..+..+..
T Consensus       774 ~~~~~~~~l~~~i~~~~~~~~~~~~  798 (1047)
T PRK10246        774 LDEETLTQLEQLKQNLENQRQQAQT  798 (1047)
T ss_pred             CCHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4668888888888888887777766


No 106
>cd07429 Cby_like Chibby, a nuclear inhibitor of Wnt/beta-catenin mediated transcription, and similar proteins. Chibby(Cby) is a well-conserved nuclear protein that functions as part of the Wnt/beta-catenin signaling pathway. Specifically, Cby binds directly to beta-catenin by interacting with its central region, which harbors armadillo repeats. Cby-beta-catenin interactions may also involve 14-3-3 proteins. By competing with other binding partners of beta-catenin, the Tcf/Lef transcription factors, Cby inhibits transcriptional activation. Cby has been shown to play a role in adipocyte differentiation. The C-terminal region of Cby appears to contain an alpha-helical coiled-coil motif.
Probab=50.36  E-value=17  Score=31.89  Aligned_cols=25  Identities=40%  Similarity=0.658  Sum_probs=21.8

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQV   29 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~   29 (388)
                      .+..|.++|..|++||++|.-||+-
T Consensus        73 e~~rlkkk~~~LeEENNlLklKiev   97 (108)
T cd07429          73 EVLRLKKKNQQLEEENNLLKLKIEV   97 (108)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4667889999999999999998864


No 107
>PF05667 DUF812:  Protein of unknown function (DUF812);  InterPro: IPR008530 This family consists of several eukaryotic proteins of unknown function.
Probab=49.89  E-value=3.9e+02  Score=29.43  Aligned_cols=139  Identities=24%  Similarity=0.343  Sum_probs=83.2

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      .+|.+|......++.+.+++...+.....+..........++++..  +++-+.++--+.|..+   +.|-+.+-....|
T Consensus       335 ~~l~~l~~~i~~~~~~~~~l~~~~~q~~~e~~~~~~~~~~le~~~~--l~~k~~~lL~d~e~ni---~kL~~~v~~s~~r  409 (594)
T PF05667_consen  335 EQLDELESQIEELEAEIKMLKSSLKQLEEELEEKEAENEELEEELK--LKKKTVELLPDAEENI---AKLQALVEASEQR  409 (594)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHHHHHHhcCcHHHH---HHHHHHHHHHHHH
Confidence            4566666666666777777777777666666666666666666554  5555555555555555   4455555555555


Q ss_pred             hHHHHH--HHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           84 LKEAEE--EQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERK  156 (388)
Q Consensus        84 lkEAEE--EQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqq  156 (388)
                      +.+-..  |.||+     -|..|+..|....... -+....   -.+.++.+..++..+..+++.-..+..+...
T Consensus       410 l~~L~~qWe~~R~-----pL~~e~r~lk~~~~~~-~~e~~~---~~~~ik~~r~~~k~~~~e~~~Kee~~~qL~~  475 (594)
T PF05667_consen  410 LVELAQQWEKHRA-----PLIEEYRRLKEKASNR-ESESKQ---KLQEIKELREEIKEIEEEIRQKEELYKQLVK  475 (594)
T ss_pred             HHHHHHHHHHHHh-----HHHHHHHHHHHHHhhc-chHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            554332  44554     5677777777666322 222211   3466777888888888777776555554333


No 108
>PF00261 Tropomyosin:  Tropomyosin;  InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=49.47  E-value=2.3e+02  Score=26.73  Aligned_cols=103  Identities=23%  Similarity=0.265  Sum_probs=73.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh--chhhhhhhhhhhhcchh-hHHHHHHHHhHHHHHHHHHHhhHHHHHH
Q 047021          154 ERKQLTEEQARISVLMSEKQELEEKLVMFSRK--ASEEVSNAAASKAFSVE-DKEKLEKQLHDMAVVVERLESSRQKLLM  230 (388)
Q Consensus       154 eqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~--~se~~~~~a~~k~~S~e-Dkekle~Ql~dma~~VeRLE~SRqKLL~  230 (388)
                      -...|..--.+...+.+...+|++.|..+...  ..+...+.++.+--..+ .-..|..+|.+.-.-.+-.|.+=.+|=.
T Consensus       125 ~E~~Le~aEeR~e~~E~ki~eLE~el~~~~~~lk~lE~~~~~~~~re~~~e~~i~~L~~~lkeaE~Rae~aE~~v~~Le~  204 (237)
T PF00261_consen  125 LEQELERAEERAEAAESKIKELEEELKSVGNNLKSLEASEEKASEREDEYEEKIRDLEEKLKEAENRAEFAERRVKKLEK  204 (237)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44556677778889999999999999998865  33444444444333333 3567788888887778888888888888


Q ss_pred             HhhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021          231 EIDNQSSEIERLFEENSSLSSSYQES  256 (388)
Q Consensus       231 EID~QS~EIErLf~ENs~Lsss~qea  256 (388)
                      +||.--.+|+..=..+..++..+-.+
T Consensus       205 ~id~le~eL~~~k~~~~~~~~eld~~  230 (237)
T PF00261_consen  205 EIDRLEDELEKEKEKYKKVQEELDQT  230 (237)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            88888888887777776666655444


No 109
>PF05622 HOOK:  HOOK protein;  InterPro: IPR008636 This family consists of several HOOK1, 2 and 3 proteins from different eukaryotic organisms. The different members of the Homo sapiens gene family are HOOK1, HOOK2 and HOOK3. Different domains have been identified in the three Homo sapiens HOOK proteins, and it was demonstrated that the highly conserved NH2-domain mediates attachment to microtubules, whereas the central coiled-coil motif mediates homodimerisation and the more divergent C-terminal domains are involved in binding to specific organelles (organelle-binding domains). It has been demonstrated that endogenous HOOK3 binds to Golgi membranes [], whereas both HOOK1 and HOOK2 are localised to discrete but unidentified cellular structures. In mice the Hook1 gene is predominantly expressed in the testis. Hook1 function is necessary for the correct positioning of microtubular structures within the haploid germ cell. Disruption of Hook1 function in mice causes abnormal sperm head shape and fragile attachment of the flagellum to the sperm head [].; GO: 0008017 microtubule binding, 0000226 microtubule cytoskeleton organization, 0005737 cytoplasm; PDB: 1WIX_A.
Probab=48.71  E-value=5.8  Score=42.89  Aligned_cols=100  Identities=22%  Similarity=0.333  Sum_probs=0.0

Q ss_pred             hHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH--HHHHHHhhh-HhhHHHHHHHHHHHHHHHhcC---CCC---CCC
Q 047021           51 SLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK--EAEEEQYRA-EEDAAALRAELNSLQQQAMDG---SLG---GIS  121 (388)
Q Consensus        51 slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk--EAEEEQyRA-EEDaAaLRAEln~lQqq~~~~---~~s---~~~  121 (388)
                      .|++-..+...+-+.+..-||++- -||..-.|+.  |++=+-||. -+|...++.++..|.-+-+..   ...   -+.
T Consensus       278 ~L~q~~~eL~~~A~~a~~LrDElD-~lR~~a~r~~klE~~ve~YKkKLed~~~lk~qvk~Lee~N~~l~e~~~~LEeel~  356 (713)
T PF05622_consen  278 ELRQENEELQAEAREARALRDELD-ELREKADRADKLENEVEKYKKKLEDLEDLKRQVKELEEDNAVLLETKAMLEEELK  356 (713)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHHHHHHHhhhHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344444444444444444466543 2443333332  444466764 367888888888887765322   000   000


Q ss_pred             CCCCChhhHHHHHHHHHHHHHHHHHHHHHH
Q 047021          122 SMGISPDHVQTLEKELATLKSQLQQESQLR  151 (388)
Q Consensus       122 ~~g~s~~~~~~lekEl~~l~~~lqqes~lr  151 (388)
                      .++---.++..+++++.+++..+-.+...+
T Consensus       357 ~~~~~~~qle~~k~qi~eLe~~l~~~~~~~  386 (713)
T PF05622_consen  357 KARALKSQLEEYKKQIQELEQKLSEESRRA  386 (713)
T ss_dssp             ------------------------------
T ss_pred             HhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            111223445556666666666665554443


No 110
>KOG0964 consensus Structural maintenance of chromosome protein 3 (sister chromatid cohesion complex Cohesin, subunit SMC3) [Cell cycle control, cell division, chromosome partitioning]
Probab=48.41  E-value=5.5e+02  Score=30.74  Aligned_cols=169  Identities=21%  Similarity=0.308  Sum_probs=92.6

Q ss_pred             hhHHHHHHHHHHHhHH--HHHHHhhhHhh-----HHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHH
Q 047021           71 EDLSAQLRTLKKRLKE--AEEEQYRAEED-----AAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQ  143 (388)
Q Consensus        71 eDl~AQLR~~kkRlkE--AEEEQyRAEED-----aAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~  143 (388)
                      +|+.++++.+...|+-  +|-+|..|++-     --.|-=+++.||+|...|.-+--    .....++.++-+|.+-+-+
T Consensus       268 ~~~~~~i~ele~~l~~l~~ekeq~~a~~t~~~k~kt~lel~~kdlq~~i~~n~q~r~----~~l~~l~~~~~ki~e~~~E  343 (1200)
T KOG0964|consen  268 EDLKCEIKELENKLTNLREEKEQLKARETKISKKKTKLELKIKDLQDQITGNEQQRN----LALHVLQKVKDKIEEKKDE  343 (1200)
T ss_pred             HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhhHHHHHHhhhhhhhhh----hHHHHHHHHHHHHHHHHHH
Confidence            3455555555544432  35566666644     33456678888888866543321    1223344455555544444


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHH-HHHHHhHHHHHHHHHH
Q 047021          144 LQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEK-LEKQLHDMAVVVERLE  222 (388)
Q Consensus       144 lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkek-le~Ql~dma~~VeRLE  222 (388)
                      |..---..+--...-...-.+++.|....+.|=.|-       . ..+.|     -|+++|.+ +.++|.++...+--..
T Consensus       344 L~~I~Pky~~l~~ee~~~~~rl~~l~~~~~~l~~Kq-------g-r~sqF-----ssk~eRDkwir~ei~~l~~~i~~~k  410 (1200)
T KOG0964|consen  344 LSKIEPKYNSLVDEEKRLKKRLAKLEQKQRDLLAKQ-------G-RYSQF-----SSKEERDKWIRSEIEKLKRGINDTK  410 (1200)
T ss_pred             HHHhhhHHHHHHhHHHHHHHHHHHHHHHHHHHHHhh-------c-ccccc-----CcHHHHHHHHHHHHHHHHHHHhhhh
Confidence            433111111111111222333444444433332221       0 11122     25566665 6788888888888888


Q ss_pred             hhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021          223 SSRQKLLMEIDNQSSEIERLFEENSSLSSSYQES  256 (388)
Q Consensus       223 ~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea  256 (388)
                      ..++.|=+||+.=-.+.+..-++=..|.+|..+.
T Consensus       411 e~e~~lq~e~~~~e~~l~~~~e~i~~l~~si~e~  444 (1200)
T KOG0964|consen  411 EQENILQKEIEDLESELKEKLEEIKELESSINET  444 (1200)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHhhh
Confidence            8888888888888888888878777787777744


No 111
>PF01724 DUF29:  Domain of unknown function DUF29;  InterPro: IPR002636 This entry is represented by Ralstonia phage RSS1, Orf11. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of various hypothetical proteins from cyanobacteria, none of which are functionally described. The aligned region is approximately 120-140 amino acids long corresponding to almost the entire length of the proteins in the family.; PDB: 3FCN_A.
Probab=48.22  E-value=1.9e+02  Score=25.71  Aligned_cols=45  Identities=24%  Similarity=0.368  Sum_probs=29.3

Q ss_pred             HHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Q 047021          227 KLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ  273 (388)
Q Consensus       227 KLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ  273 (388)
                      -=..+|+.+-.+|+++|.+|.+|..-+.++  +..-|.+-++++.+.
T Consensus        69 sW~~tI~~~R~~i~~~l~~sPSLk~~l~~~--l~~~Y~~A~~~a~~e  113 (139)
T PF01724_consen   69 SWRATIRNQRRQIEDLLEDSPSLKNYLEEI--LEEAYQDARKLAARE  113 (139)
T ss_dssp             HHHHHHHHHHHHHHHH----GGGGGG--HH--HHHHHHHH-HHHHHH
T ss_pred             hHHHHHHHHHHHHHHHhcccccHHHHHHHH--HHHHHHHHHHHHHHH
Confidence            345699999999999999999999988887  566676665555543


No 112
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=48.05  E-value=2.9e+02  Score=27.31  Aligned_cols=96  Identities=22%  Similarity=0.338  Sum_probs=55.0

Q ss_pred             hhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHH
Q 047021           25 SKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAE  104 (388)
Q Consensus        25 sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAE  104 (388)
                      -+|++-..+++++.-+.+        +.+++|+.+--|.+++-       .-+-.++..+..-+.+=-+=+.|-..+|+.
T Consensus        10 ~~iq~lD~e~~rl~~~~~--------~~~~~l~k~~~e~e~~~-------~~~~~~~~e~e~le~qv~~~e~ei~~~r~r   74 (239)
T COG1579          10 LAIQKLDLEKDRLEPRIK--------EIRKALKKAKAELEALN-------KALEALEIELEDLENQVSQLESEIQEIRER   74 (239)
T ss_pred             HHHHHHHHHHHHHHHhhh--------hhHHHHHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345555555555554443        56777755555555443       223334444444444444556677777777


Q ss_pred             HHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH
Q 047021          105 LNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL  144 (388)
Q Consensus       105 ln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l  144 (388)
                      ++..+-..+         .+-...+..+|..|+..++.+.
T Consensus        75 ~~~~e~kl~---------~v~~~~e~~aL~~E~~~ak~r~  105 (239)
T COG1579          75 IKRAEEKLS---------AVKDERELRALNIEIQIAKERI  105 (239)
T ss_pred             HHHHHHHHh---------ccccHHHHHHHHHHHHHHHHHH
Confidence            777777662         2225677888888887655443


No 113
>PRK14154 heat shock protein GrpE; Provisional
Probab=48.02  E-value=93  Score=29.97  Aligned_cols=56  Identities=20%  Similarity=0.259  Sum_probs=42.4

Q ss_pred             HHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHH------------HHHhhhhHHHHHHHHhhHHHHHHH
Q 047021          323 FLALKGQLAKEQSRAEALSAQVLQLSAQLQQT------------TQAYNGLVRLYKPVLRNIESSLIK  378 (388)
Q Consensus       323 ~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra------------~QA~n~L~Rly~PvL~~IEsrLik  378 (388)
                      +..|+.++.....+.+.|....+-+.|+..-.            .-|.-+|.+-+.||+-|+|.-|-.
T Consensus        54 ~~~l~~el~~le~e~~elkd~~lRl~ADfeNyRKR~~kE~e~~~~~a~e~~~~~LLpVlDnLeRAL~~  121 (208)
T PRK14154         54 REKLEGQLTRMERKVDEYKTQYLRAQAEMDNLRKRIEREKADIIKFGSKQLITDLLPVADSLIHGLES  121 (208)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHhHHHHHHhc
Confidence            56677777777777777777777777776533            346778999999999999988754


No 114
>PF08172 CASP_C:  CASP C terminal;  InterPro: IPR012955 This domain is the C-terminal region of the CASP family of proteins. These are Golgi membrane proteins which are thought to have a role in vesicle transport [].; GO: 0006891 intra-Golgi vesicle-mediated transport, 0030173 integral to Golgi membrane
Probab=47.89  E-value=33  Score=33.38  Aligned_cols=42  Identities=33%  Similarity=0.341  Sum_probs=34.5

Q ss_pred             hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh
Q 047021            3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINEL   44 (388)
Q Consensus         3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~   44 (388)
                      ++|=.-+..||.+||++++++..++.....++++|+.+=-.|
T Consensus        85 tsQRDRFR~Rn~ELE~elr~~~~~~~~L~~Ev~~L~~DN~kL  126 (248)
T PF08172_consen   85 TSQRDRFRQRNAELEEELRKQQQTISSLRREVESLRADNVKL  126 (248)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455567889999999999999999999888888888874433


No 115
>PRK04778 septation ring formation regulator EzrA; Provisional
Probab=47.84  E-value=3.8e+02  Score=28.67  Aligned_cols=260  Identities=20%  Similarity=0.248  Sum_probs=0.0

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH----hhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE----LEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRT   79 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd----~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~   79 (388)
                      .|+.+|..|+.+|..+.=.+.-  ...+.++..|.+++.+    +..-.+...+..++.|.-.=|..--.=+--..--..
T Consensus       230 ~ql~el~~gy~~m~~~gy~~~~--~~i~~~i~~l~~~i~~~~~~l~~l~l~~~~~~~~~i~~~Id~Lyd~lekE~~A~~~  307 (569)
T PRK04778        230 DQLQELKAGYRELVEEGYHLDH--LDIEKEIQDLKEQIDENLALLEELDLDEAEEKNEEIQERIDQLYDILEREVKARKY  307 (569)
T ss_pred             HHHHHHHHHHHHHHHcCCCCCC--CChHHHHHHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH


Q ss_pred             HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhh---HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDH---VQTLEKELATLKSQLQQESQLRQQERK  156 (388)
Q Consensus        80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~---~~~lekEl~~l~~~lqqes~lrqqeqq  156 (388)
                      ..+....-...=..+.+..-.|..|+..+++           +++.+.+.   .+.+++++..+..++..-..-......
T Consensus       308 vek~~~~l~~~l~~~~e~~~~l~~Ei~~l~~-----------sY~l~~~e~~~~~~lekeL~~Le~~~~~~~~~i~~~~~  376 (569)
T PRK04778        308 VEKNSDTLPDFLEHAKEQNKELKEEIDRVKQ-----------SYTLNESELESVRQLEKQLESLEKQYDEITERIAEQEI  376 (569)
T ss_pred             HHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH-----------ccccCchhHHHHHHHHHHHHHHHHHHHHHHHHHHcCCC


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh-hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHH--HHhhHHHHHHHhh
Q 047021          157 QLTEEQARISVLMSEKQELEEKLVMFSRKASE-EVSNAAASKAFSVEDKEKLEKQLHDMAVVVER--LESSRQKLLMEID  233 (388)
Q Consensus       157 rla~Eq~~~a~l~aEkqeLeekl~~~~~~~se-~~~~~a~~k~~S~eDkekle~Ql~dma~~VeR--LE~SRqKLL~EID  233 (388)
                      ....-+.....+.....++++....+...-.. ...+.-+...+     .++...|+.+.+.|++  |-+.-+..+.-++
T Consensus       377 ~ysel~e~leel~e~leeie~eq~ei~e~l~~Lrk~E~eAr~kL-----~~~~~~L~~ikr~l~k~~lpgip~~y~~~~~  451 (569)
T PRK04778        377 AYSELQEELEEILKQLEEIEKEQEKLSEMLQGLRKDELEAREKL-----ERYRNKLHEIKRYLEKSNLPGLPEDYLEMFF  451 (569)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHcCCCCCcHHHHHHHH


Q ss_pred             hhhHHHHHHhhh------hhhhhh-hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          234 NQSSEIERLFEE------NSSLSS-SYQESMGIAKHWENQVKDCLKQNEELRGIL  281 (388)
Q Consensus       234 ~QS~EIErLf~E------Ns~Lss-s~qea~~iaaqWEnQvqDclkQNeeLr~lL  281 (388)
                      .=+.+|++|-.+      |-.--. -|.++..--..-+.|..|....-.-+..++
T Consensus       452 ~~~~~i~~l~~~L~~g~VNm~ai~~e~~e~~~~~~~L~~q~~dL~~~a~~lE~~I  506 (569)
T PRK04778        452 EVSDEIEALAEELEEKPINMEAVNRLLEEATEDVETLEEETEELVENATLTEQLI  506 (569)
T ss_pred             HHHHHHHHHHHHhccCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH


No 116
>TIGR02231 conserved hypothetical protein. This family consists of proteins over 500 amino acids long in Caenorhabditis elegans and several bacteria (Pseudomonas aeruginosa, Nostoc sp. PCC 7120, Leptospira interrogans, etc.). The function is unknown.
Probab=47.53  E-value=1.7e+02  Score=30.55  Aligned_cols=95  Identities=23%  Similarity=0.197  Sum_probs=44.1

Q ss_pred             hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh---hHHHHhhhhhhhhhhHHHh--hhhHHHHHHHHHHHhHHH
Q 047021           13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP---SLRKALKDVAMEKDAAVVA--REDLSAQLRTLKKRLKEA   87 (388)
Q Consensus        13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~---slrkaL~~v~mEkDaA~va--reDl~AQLR~~kkRlkEA   87 (388)
                      ..+|+.+.+.|..++.....+++.+..++.=++.+..+   ...+++.  .=..|.+-..  -+-+..++..+..++.++
T Consensus        73 ~~~l~~~l~~l~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  150 (525)
T TIGR02231        73 LAELRKQIRELEAELRDLEDRGDALKALAKFLEDIREGLTEPIKDSAK--RNEPDLKEWFQAFDFNGSEIERLLTEDREA  150 (525)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccccccccc--cCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44455555555555555555666666666555544321   1222110  0011211111  122334445555555555


Q ss_pred             HHHHhhhHhhHHHHHHHHHHHH
Q 047021           88 EEEQYRAEEDAAALRAELNSLQ  109 (388)
Q Consensus        88 EEEQyRAEEDaAaLRAEln~lQ  109 (388)
                      +.+.-..++.-+.|+.+|+.+.
T Consensus       151 ~~~~~~~~~~l~~l~~~l~~l~  172 (525)
T TIGR02231       151 ERRIRELEKQLSELQNELNALL  172 (525)
T ss_pred             HHHHHHHHHHHHHHHHHHHhhc
Confidence            5555555555556666665554


No 117
>PF00769 ERM:  Ezrin/radixin/moesin family;  InterPro: IPR011259 The ERM family consists of three closely-related proteins, ezrin, radixin and moesin []. Ezrin was first identified as a constituent of microvilli [], radixin as a barbed, end-capping actin-modulating protein from isolated junctional fractions [], and moesin as a heparin binding protein []. A tumour suppressor molecule responsible for neurofibromatosis type 2 (NF2) is highly similar to ERM proteins and has been designated merlin (moesin-ezrin-radixin-like protein). ERM molecules contain 3 domains, an N-terminal globular domain; an extended alpha-helical domain; and a charged C-terminal domain []. Ezrin, radixin and merlin also contain a polyproline region between the helical and C-terminal domains. The N-terminal domain is highly conserved, and is also found in merlin, band 4.1 proteins and members of the band 4.1 superfamily. ERM proteins crosslink actin filaments with plasma membranes. They co-localise with CD44 at actin filament-plasma membrane interaction sites, associating with CD44 via their N-terminal domains and with actin filaments via their C-terminal domains [].; GO: 0008092 cytoskeletal protein binding, 0005737 cytoplasm, 0019898 extrinsic to membrane; PDB: 2I1J_A 2I1K_A 1E5W_A 1EF1_C.
Probab=47.49  E-value=2.7e+02  Score=26.89  Aligned_cols=106  Identities=21%  Similarity=0.335  Sum_probs=64.3

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh------hHHHHhhhhhhhhhhHHHhhhhHHHHHHH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP------SLRKALKDVAMEKDAAVVAREDLSAQLRT   79 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~------slrkaL~~v~mEkDaA~vareDl~AQLR~   79 (388)
                      -++|+.++.++++++.+-+..+...+....-|...+...+.....      .+...-.++..+.-.-.-.|+.|.+.++-
T Consensus         7 k~Ele~rL~q~eee~~~a~~~L~e~e~~a~~Leek~k~aeeea~~Le~k~~eaee~~~rL~~~~~~~~eEk~~Le~e~~e   86 (246)
T PF00769_consen    7 KQELEERLRQMEEEMRRAQEALEESEETAEELEEKLKQAEEEAEELEQKRQEAEEEKQRLEEEAEMQEEEKEQLEQELRE   86 (246)
T ss_dssp             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------HHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            367889999999999999999998888888888887755543322      12222222222222222236677777777


Q ss_pred             HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHH
Q 047021           80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQ  111 (388)
Q Consensus        80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq  111 (388)
                      ....+..-+++--+.+..+..|+.++-.-+..
T Consensus        87 ~~~~i~~l~ee~~~ke~Ea~~lq~el~~ar~~  118 (246)
T PF00769_consen   87 AEAEIARLEEESERKEEEAEELQEELEEARED  118 (246)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            77777666666667777777777776665553


No 118
>PF05377 FlaC_arch:  Flagella accessory protein C (FlaC);  InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=46.88  E-value=38  Score=26.74  Aligned_cols=38  Identities=21%  Similarity=0.360  Sum_probs=31.7

Q ss_pred             hhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh
Q 047021           12 KNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV   49 (388)
Q Consensus        12 ~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v   49 (388)
                      |..++|.+...+.+.+.+...+++.+-+.++++++|+.
T Consensus         1 Ri~elEn~~~~~~~~i~tvk~en~~i~~~ve~i~envk   38 (55)
T PF05377_consen    1 RIDELENELPRIESSINTVKKENEEISESVEKIEENVK   38 (55)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34678888888888899999999999999998888873


No 119
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=46.49  E-value=43  Score=27.06  Aligned_cols=38  Identities=16%  Similarity=0.204  Sum_probs=23.1

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRI   41 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~L   41 (388)
                      .+|-.|=..|.+|.++|..|..+......++..|....
T Consensus         7 ~kle~Li~~~~~L~~EN~~Lr~q~~~~~~ER~~L~ekn   44 (65)
T TIGR02449         7 AQVEHLLEYLERLKSENRLLRAQEKTWREERAQLLEKN   44 (65)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555666666666666666666666666665555443


No 120
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown]
Probab=45.99  E-value=6e+02  Score=30.48  Aligned_cols=103  Identities=24%  Similarity=0.302  Sum_probs=59.6

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhh--------------------------------h--hhHH
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNT--------------------------------V--PSLR   53 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~--------------------------------v--~slr   53 (388)
                      .|.--|+++++.++.|+.++++|-.-+.-|.+.|+|++.-.                                +  |.-+
T Consensus       174 hL~velAdle~kir~LrqElEEK~enll~lr~eLddleae~~klrqe~~e~l~ea~ra~~yrdeldalre~aer~d~~yk  253 (1195)
T KOG4643|consen  174 HLEVELADLEKKIRTLRQELEEKFENLLRLRNELDDLEAEISKLRQEIEEFLDEAHRADRYRDELDALREQAERPDTTYK  253 (1195)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhhhHHHHHHHhhhcCCCccc
Confidence            45667788888888888888877766666666666543211                                1  3344


Q ss_pred             HHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           54 KALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        54 kaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      .+|-|+-+=||++.-+|+|=.+  ...-|-+=+++=.-|||--|-+.+-.+|=.++|+.
T Consensus       254 erlmDs~fykdRveelkedN~v--LleekeMLeeQLq~lrarse~~tleseiiqlkqkl  310 (1195)
T KOG4643|consen  254 ERLMDSDFYKDRVEELKEDNRV--LLEEKEMLEEQLQKLRARSEGATLESEIIQLKQKL  310 (1195)
T ss_pred             hhhhhhHHHHHHHHHHHhhhHH--HHHHHHHHHHHHHHHHhccccCChHHHHHHHHHHH
Confidence            5555555566666666666221  11222233334445666555566666666666665


No 121
>COG4985 ABC-type phosphate transport system, auxiliary component [Inorganic ion transport and metabolism]
Probab=45.78  E-value=1.6e+02  Score=29.81  Aligned_cols=65  Identities=23%  Similarity=0.393  Sum_probs=51.6

Q ss_pred             hhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHh-------hhhHHHHHHHHHHHhHHH
Q 047021           23 LSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVA-------REDLSAQLRTLKKRLKEA   87 (388)
Q Consensus        23 L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~va-------reDl~AQLR~~kkRlkEA   87 (388)
                      |...++..--+.+.|...+|+++..-|..+-.-|+.+.++|-+..++       -.|+.|-+-.+++|++--
T Consensus       162 l~~eLqkr~~~v~~l~~q~~k~~~~qv~~in~qlErLRL~krrlQl~g~Ld~~~q~~~~ae~seLq~r~~~l  233 (289)
T COG4985         162 LERELQKRLLEVETLRDQVDKMVEQQVRVINSQLERLRLEKRRLQLNGQLDDEFQQHYVAEKSELQKRLAQL  233 (289)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcccccHHHHHHHHHHHHHHHHHHHHH
Confidence            44556666677889999999999999999999999999999877765       566777777777777543


No 122
>PF12795 MscS_porin:  Mechanosensitive ion channel porin domain
Probab=45.23  E-value=2.7e+02  Score=26.20  Aligned_cols=186  Identities=23%  Similarity=0.259  Sum_probs=92.4

Q ss_pred             HHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           76 QLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQER  155 (388)
Q Consensus        76 QLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeq  155 (388)
                      +....+++..+-..---.|-.....++++|+.+... ...+...  +.+.|   +..|+..|....++|.....-.++..
T Consensus        32 ~~~~~~~~~~~~~~~i~~aP~~~~~l~~~l~~l~~~-~~~~~~~--~~~~s---~~eLeq~l~~~~~~L~~~q~~l~~~~  105 (240)
T PF12795_consen   32 EIKKQKKRAAEYQKQIDQAPKEIRELQKELEALKSQ-DAPSKEI--LANLS---LEELEQRLSQEQAQLQELQEQLQQEN  105 (240)
T ss_pred             HHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHhhhcc-ccccccC--cccCC---HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            333444444443333344555677788888888766 1112111  22223   45677777777777777666555555


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhh
Q 047021          156 KQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQ  235 (388)
Q Consensus       156 qrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~Q  235 (388)
                      -++..=+.+...+...+-+....+..+...-....++.  ...++...+-.|...+.-+-.-+.-|              
T Consensus       106 ~~l~~~~~~p~~aq~~l~~~~~~l~ei~~~L~~~~~~~--~~~l~~a~~~~l~ae~~~l~~~~~~l--------------  169 (240)
T PF12795_consen  106 SQLIEIQTRPERAQQQLSEARQRLQEIRNQLQNLPPNG--ESPLSEAQRWLLQAELAALEAQIEML--------------  169 (240)
T ss_pred             HHHHHHHccHHHHHHHHHHHHHHHHHHHHHHhccCCCC--cchhhHHHHHHHHHHHHHHHHHHHHH--------------
Confidence            55554444444444444444444444333311100110  13444444444444422222222221              


Q ss_pred             hHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhc
Q 047021          236 SSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQAS  290 (388)
Q Consensus       236 S~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~  290 (388)
                          +..---|+++.+=|+-=+......-++...   +..-|++.|+..|...+-
T Consensus       170 ----e~el~s~~~rq~L~~~qrdl~~~~~~~l~~---~l~~Lq~~ln~~R~~eae  217 (240)
T PF12795_consen  170 ----EQELLSNNNRQELLQLQRDLLKARIQRLQQ---QLQALQNLLNQKRRQEAE  217 (240)
T ss_pred             ----HHHHHCcHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHH
Confidence                222234666777777667766665555543   445678888887776553


No 123
>PRK04863 mukB cell division protein MukB; Provisional
Probab=45.08  E-value=6.7e+02  Score=30.74  Aligned_cols=27  Identities=19%  Similarity=0.361  Sum_probs=16.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhhhchh
Q 047021          162 QARISVLMSEKQELEEKLVMFSRKASE  188 (388)
Q Consensus       162 q~~~a~l~aEkqeLeekl~~~~~~~se  188 (388)
                      ......+...++.++++...+.+....
T Consensus       462 E~kL~~lea~leql~~~~~~l~~~~Gk  488 (1486)
T PRK04863        462 EQKLSVAQAAHSQFEQAYQLVRKIAGE  488 (1486)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCC
Confidence            334455666667777777666655444


No 124
>PF07989 Microtub_assoc:  Microtubule associated;  InterPro: IPR012943 Proteins with this domain associate with the spindle body during cell division [].
Probab=45.03  E-value=64  Score=26.28  Aligned_cols=53  Identities=28%  Similarity=0.271  Sum_probs=38.3

Q ss_pred             hhHHHHHHhhhhhhhhhhHHHHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHhh
Q 047021          235 QSSEIERLFEENSSLSSSYQESMGI-AKHWENQVKDCLKQNEELRGILEKLRTE  287 (388)
Q Consensus       235 QS~EIErLf~ENs~Lsss~qea~~i-aaqWEnQvqDclkQNeeLr~lLeklR~E  287 (388)
                      |...|++|--||=+|-=-+--...- ...-...+.+++++|.+|++-++.|+.|
T Consensus         5 qe~~i~~L~KENF~LKLrI~fLee~l~~~~~~~~~~~~keNieLKve~~~L~~e   58 (75)
T PF07989_consen    5 QEEQIDKLKKENFNLKLRIYFLEERLQKLGPESIEELLKENIELKVEVESLKRE   58 (75)
T ss_pred             HHHHHHHHHHhhhhHHHHHHHHHHHHHhcccccHHHHHHHHHHHHHHHHHHHHH
Confidence            6677999999998875332222111 1334677899999999999999988876


No 125
>PF04880 NUDE_C:  NUDE protein, C-terminal conserved region;  InterPro: IPR006964 This domain represents the C-terminal conserved region of NUDE proteins. Emericella nidulans (Aspergillus nidulans) NUDE, acts in the cytoplasmic dynein/dynactin pathway and is required for distribution of nuclei []. It is a homologue of the nuclear distribution protein RO11 of Neurospora crassa. NUDE interacts with the NUDF via an N-terminal coiled coil domain; this is the only domain which is absolutely required for NUDE function.; PDB: 2V66_B 2V71_B.
Probab=45.01  E-value=27  Score=32.50  Aligned_cols=43  Identities=35%  Similarity=0.537  Sum_probs=24.9

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHH---HhhHHHHHHhhHhhhhh
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKE---VESESLQTRINELEQNT   48 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~---~~~~~L~~~Lnd~e~n~   48 (388)
                      |.+++.|...-=|.|-+|.++|++|+   ++++-|.+++-||.+-.
T Consensus         2 LeD~EsklN~AIERnalLE~ELdEKE~L~~~~QRLkDE~RDLKqEl   47 (166)
T PF04880_consen    2 LEDFESKLNQAIERNALLESELDEKENLREEVQRLKDELRDLKQEL   47 (166)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCH--------------
T ss_pred             HHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            67899999999999999999999887   44444555555554433


No 126
>TIGR03752 conj_TIGR03752 integrating conjugative element protein, PFL_4705 family. Members of this protein family are found occasionally on plasmids such as the Pseudomonas putida toluene catabolic TOL plasmid pWWO_p085. Usually, however, they are found on the bacterial main chromosome in regions flanked by markers of conjugative transfer and/or transposition.
Probab=44.56  E-value=2.4e+02  Score=30.47  Aligned_cols=23  Identities=22%  Similarity=0.377  Sum_probs=14.6

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSK   26 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk   26 (388)
                      .++..|...|..|-.+|..|+.+
T Consensus        73 ~~~~~l~~~N~~l~~eN~~L~~r   95 (472)
T TIGR03752        73 KRLAKLISENEALKAENERLQKR   95 (472)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHh
Confidence            45666666677777777666443


No 127
>PF12128 DUF3584:  Protein of unknown function (DUF3584);  InterPro: IPR021979  This family consist of uncharacterised bacterial proteins. 
Probab=44.28  E-value=5.9e+02  Score=29.89  Aligned_cols=50  Identities=36%  Similarity=0.425  Sum_probs=29.6

Q ss_pred             hhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021           64 DAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM  113 (388)
Q Consensus        64 DaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~  113 (388)
                      ..+.-+..++-.++..+++.-..|+.+--.++....-+++++..|+.+..
T Consensus       486 ~~~~~~~~~~~~~~~~~~~~~~~a~~~l~~~~~~~~~~~~~~~~l~~~L~  535 (1201)
T PF12128_consen  486 NQAQQAVEELQAEEQELRKERDQAEEELRQARRELEELRAQIAELQRQLD  535 (1201)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence            33333344555555555555555555555555666667777777777774


No 128
>PF15294 Leu_zip:  Leucine zipper
Probab=43.64  E-value=3.6e+02  Score=27.28  Aligned_cols=174  Identities=18%  Similarity=0.223  Sum_probs=99.1

Q ss_pred             hHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhh--------------hhhhh--hhHHHHHHHHHHHH--
Q 047021          203 DKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEE--------------NSSLS--SSYQESMGIAKHWE--  264 (388)
Q Consensus       203 Dkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~E--------------Ns~Ls--ss~qea~~iaaqWE--  264 (388)
                      .+.|.+++|...-..+.=|.+||   |.|---=+.||..+|..              |..-.  --++...+-|-.|-  
T Consensus         2 ~r~kr~~~Lk~Vds~F~Dlk~sr---L~e~t~T~~EV~~~ldgL~~~v~~~vesEL~N~~htn~lllrql~~qAek~~lk   78 (278)
T PF15294_consen    2 ARSKREQHLKEVDSCFQDLKSSR---LREDTYTSDEVTEMLDGLQVVVKSEVESELINTSHTNVLLLRQLFSQAEKWYLK   78 (278)
T ss_pred             chhHHHHHHHHHHHhhhHHHHHH---HHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHhHHHhHHHHHHHHHHHHHHHHHH
Confidence            36678888888877788888877   44522223365555533              22111  23677888889994  


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccccCCCCCCccccCCCchhhhhHHhhHHHHHHHhhHHHHhHHHH
Q 047021          265 NQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRDFSEPSKDAVSQNGSETHKNEFLALKGQLAKEQSRAEALSAQV  344 (388)
Q Consensus       265 nQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv  344 (388)
                      =|+-=.-..|.+|-.-.++...-+++........+.+ +.=+|-+++  .|++....||.+|+.+..+-.+|.-.+..+.
T Consensus        79 l~~diselEn~eLLe~i~~~E~~~~~~~~~~~~~~~~-~KL~pl~e~--g~~~ll~kEi~rLq~EN~kLk~rl~~le~~a  155 (278)
T PF15294_consen   79 LQTDISELENRELLEQIAEFEKQEFTSSFKPNQETSK-PKLEPLNES--GGSELLNKEIDRLQEENEKLKERLKSLEKQA  155 (278)
T ss_pred             hcccHHHHHHHHHHHHHHHHHHhhhcccCCccccccc-ccccccccc--chHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3333333445555554444444444433311111111 111122211  1256677789999999888888888777776


Q ss_pred             HHHHHHHHHHHHHhhhhHH------------HHHHHHhhHHHHHHHhhhc
Q 047021          345 LQLSAQLQQTTQAYNGLVR------------LYKPVLRNIESSLIKMKQD  382 (388)
Q Consensus       345 ~qLsAeL~ra~QA~n~L~R------------ly~PvL~~IEsrLikmkq~  382 (388)
                      ....-+=.......+.|..            .--+-+.++|+-+..||-+
T Consensus       156 t~~l~Ek~kl~~~L~~lq~~~~~~~~k~~~~~~~q~l~dLE~k~a~lK~e  205 (278)
T PF15294_consen  156 TSALDEKSKLEAQLKELQDEQGDQKGKKDLSFKAQDLSDLENKMAALKSE  205 (278)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHhhhccccccccccchhhHHHHHHHHHHH
Confidence            6666555555555555554            2234577788888888744


No 129
>PF08826 DMPK_coil:  DMPK coiled coil domain like;  InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=43.46  E-value=77  Score=25.26  Aligned_cols=40  Identities=25%  Similarity=0.361  Sum_probs=27.7

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN   47 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n   47 (388)
                      .++..+..+-..|..+.++|++.++.|..|...+..++..
T Consensus        15 ~~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e   54 (61)
T PF08826_consen   15 AIQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKE   54 (61)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4555666666777777777777777777777777766543


No 130
>KOG0018 consensus Structural maintenance of chromosome protein 1 (sister chromatid cohesion complex Cohesin, subunit SMC1) [Cell cycle control, cell division, chromosome partitioning]
Probab=42.96  E-value=6.7e+02  Score=30.13  Aligned_cols=226  Identities=24%  Similarity=0.304  Sum_probs=135.1

Q ss_pred             hHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh--h-hHHHHhhhhhhhhhhHHHhhhhHHHHHH----------H
Q 047021           13 NAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV--P-SLRKALKDVAMEKDAAVVAREDLSAQLR----------T   79 (388)
Q Consensus        13 ~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v--~-slrkaL~~v~mEkDaA~vareDl~AQLR----------~   79 (388)
                      .-++|..+.++...+.....++..+..++|.-+...+  | ...|.++++ ..-|.-+--++..++..+          -
T Consensus       222 LfhvE~~i~k~~~els~~~~ei~~~~~~~d~~e~ei~~~k~e~~ki~re~-~~~Dk~i~~ke~~l~erp~li~~ke~~~~  300 (1141)
T KOG0018|consen  222 LFHVEACIEKANDELSRLNAEIPKLKERMDKKEREIRVRKKERGKIRREL-QKVDKKISEKEEKLAERPELIKVKENASH  300 (1141)
T ss_pred             HhhhhhhHhhhhHHHHHHhhhhHHHHhhhhHHHHHHHHHHHHHHHHHHHH-HHHHHHHHHHHHHHhhhhHHhhcchhhcc
Confidence            4467888888888888888888899999987766655  3 234555555 355666666777777621          2


Q ss_pred             HHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHH-------
Q 047021           80 LKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQ-------  152 (388)
Q Consensus        80 ~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrq-------  152 (388)
                      .++|+.+++-.=|-++.|+-++..++..+.-+.-+-+++           -..+++||.       +.++.+=       
T Consensus       301 ~k~rl~~~~k~i~~~kk~~~~~~~~ie~~ek~l~av~~~-----------~~~fekei~-------~~~q~rg~~lnl~d  362 (1141)
T KOG0018|consen  301 LKKRLEEIEKDIETAKKDYRALKETIERLEKELKAVEGA-----------KEEFEKEIE-------ERSQERGSELNLKD  362 (1141)
T ss_pred             chhHHHHhhhhHHHHHHHHHhhHHHHHHHHHHHHHHHHH-----------HHHHHHHHH-------HHHhhccccCCcch
Confidence            468888888888888888888888888887776332221           123445544       4443221       


Q ss_pred             ---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh---hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHH
Q 047021          153 ---QERKQLTEEQARISVLMSEKQELEEKLVMFSRKASE---EVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQ  226 (388)
Q Consensus       153 ---qeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se---~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRq  226 (388)
                         ++-.||-.|....+         .++|.++-..--.   +...-       ..-+..++..+..+-.+|+|+.--|.
T Consensus       363 ~~~~ey~rlk~ea~~~~---------~~el~~ln~~~r~~~~~ld~~-------~~~~~elE~r~k~l~~sver~~~~~~  426 (1141)
T KOG0018|consen  363 DQVEEYERLKEEACKEA---------LEELEVLNRNMRSDQDTLDHE-------LERRAELEARIKQLKESVERLDKRRN  426 (1141)
T ss_pred             HHHHHHHHHHHHHhhhh---------HHHHHHHHHHHHHHHHHHhhH-------HHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence               24445554443322         3334433322100   11110       12356677777777888888888888


Q ss_pred             HHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Q 047021          227 KLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ  273 (388)
Q Consensus       227 KLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ  273 (388)
                      +|+.=|++-+..-+-+-.+=..|...|-.+.+-.+---.-+.+|+-|
T Consensus       427 ~L~~~i~s~~~~~~e~~~d~~~l~~~~~~~~~~~~e~n~eL~~~~~q  473 (1141)
T KOG0018|consen  427 KLAAKITSLSRSYEELKHDLDSLESLVSSAEEEPYELNEELVEVLDQ  473 (1141)
T ss_pred             HHHHHHHHHHHHHHHHhhcHHHHHHHHhhhhhhHHHHHHHHHHHHHH
Confidence            88888877666666555555555555555544444333333333333


No 131
>PF04156 IncA:  IncA protein;  InterPro: IPR007285 Chlamydia trachomatis is an obligate intracellular bacterium that develops within a parasitophorous vacuole termed an inclusion. The inclusion is nonfusogenic with lysosomes but intercepts lipids from a host cell exocytic pathway. Initiation of chlamydial development is concurrent with modification of the inclusion membrane by a set of C. trachomatis-encoded proteins collectively designated Incs. One of these Incs, IncA (Inclusion membrane protein A), is functionally associated with the homotypic fusion of inclusions [].
Probab=42.53  E-value=2.5e+02  Score=25.05  Aligned_cols=34  Identities=29%  Similarity=0.402  Sum_probs=15.5

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQT   39 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~   39 (388)
                      +.++++..+++..+...++.++.+...+...+.+
T Consensus        83 ~~~~~~~l~~l~~el~~l~~~~~~~~~~l~~~~~  116 (191)
T PF04156_consen   83 LSELQQQLQQLQEELDQLQERIQELESELEKLKE  116 (191)
T ss_pred             HHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3444444444444444444444444444444333


No 132
>PF09730 BicD:  Microtubule-associated protein Bicaudal-D;  InterPro: IPR018477 BicD proteins consist of three coiled-coiled domains and are involved in dynein-mediated minus end-directed transport from the Golgi apparatus to the endoplasmic reticulum (ER) []. Glycogen synthase kinase-3beta (GSK-3beta) is required for the binding of BICD to dynein but not to dynactin, acting to maintain the anchoring of microtubules to the centromere []. It appears that amino-acid residues 437-617 of BicD and the kinase activity of GSK-3 are necessary for the formation of a complex between BicD and GSK-3beta in intact cells [].; GO: 0006810 transport, 0005794 Golgi apparatus
Probab=41.48  E-value=5.8e+02  Score=29.02  Aligned_cols=140  Identities=26%  Similarity=0.315  Sum_probs=92.5

Q ss_pred             HHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCC----
Q 047021          218 VERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRTEQASFMS----  293 (388)
Q Consensus       218 VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~----  293 (388)
                      |-=|+.|-    -|++.=--||-||-+|+.-|-+.+.|++.+-.--|.|+-|||-=...=|.-=..||.|-...+.    
T Consensus       113 vs~Lk~sQ----vefE~~Khei~rl~Ee~~~l~~qlee~~rLk~iae~qleEALesl~~EReqk~~LrkEL~~~~~~~~~  188 (717)
T PF09730_consen  113 VSVLKQSQ----VEFEGLKHEIKRLEEEIELLNSQLEEAARLKEIAEKQLEEALESLKSEREQKNALRKELDQHLNIESI  188 (717)
T ss_pred             HHHHHHhH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccc
Confidence            65666653    5888888999999999999999999999999999999999985433333322245555443221    


Q ss_pred             --CCcccc----------ccccCCCCCC------------------c--------cccCC-----------CchhhhhHH
Q 047021          294 --VNDKDI----------LRDFSEPSKD------------------A--------VSQNG-----------SETHKNEFL  324 (388)
Q Consensus       294 --~~~~~~----------~g~~~e~~~d------------------g--------~~~~~-----------~~~~~te~l  324 (388)
                        .++..+          .+..++|..|                  |        .+...           |.-+.+||.
T Consensus       189 ~~~~~~~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~lv~DLfSEl~~~Eiq  268 (717)
T PF09730_consen  189 SYLSNLAISLDGLKFSEDPRAATEPNNDDEEENGGLNGGPGLAKGNGDNRMSTPRKSESFSPAPSLVSDLFSELNLSEIQ  268 (717)
T ss_pred             ccccchhhcccccccccccccccCCCCchhhhcchhhccchhcccccccccCCCCCCCCCCCCCcccchhhhhcchHHHH
Confidence              111111          0111122222                  1        00000           334567999


Q ss_pred             hhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhh
Q 047021          325 ALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGL  361 (388)
Q Consensus       325 ~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L  361 (388)
                      +||.+|..--.--..|..-+.-.-.+|..+..+....
T Consensus       269 KL~qQL~qve~EK~~L~~~L~e~Q~qLe~a~~als~q  305 (717)
T PF09730_consen  269 KLKQQLLQVEREKSSLLSNLQESQKQLEHAQGALSEQ  305 (717)
T ss_pred             HHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            9999999988778889888888888888887776543


No 133
>PF05600 DUF773:  Protein of unknown function (DUF773);  InterPro: IPR008491 This family contains several eukaryotic sequences which are thought to be CDK5 activator-binding proteins, however, the function of this family is unknown.
Probab=41.16  E-value=4.6e+02  Score=28.20  Aligned_cols=114  Identities=20%  Similarity=0.271  Sum_probs=74.5

Q ss_pred             hhHHHHHHhhhhh-hhhhhHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHhhhhccCCCCccccccc
Q 047021          235 QSSEIERLFEENS-SLSSSYQESMGI-----------AKHWENQVKDCLKQNEELRGILEKLRTEQASFMSVNDKDILRD  302 (388)
Q Consensus       235 QS~EIErLf~ENs-~Lsss~qea~~i-----------aaqWEnQvqDclkQNeeLr~lLeklR~EQa~~~~~~~~~~~g~  302 (388)
                      -|.+|-|+|+.++ -|...++-.+-.           -+..+.|+.||-|...++..-....+.+-......-  |+.| 
T Consensus        96 ~W~~Iv~~yeK~n~~L~E~a~~L~r~v~YeIP~lkKqi~k~~q~~~d~~kk~~e~~~~~~~~~~~~~~~c~~l--GI~G-  172 (507)
T PF05600_consen   96 DWQEIVKLYEKDNLYLAEAAQILVRNVNYEIPALKKQIAKCQQQLEDLDKKEEELQRSAAEARERYKKACKQL--GIKG-  172 (507)
T ss_pred             HHHHHHHHHHhccchHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHh--CCcc-
Confidence            3889999998654 455555544322           357889999999999998887777777666655433  2222 


Q ss_pred             cCCCCCCccccCCCchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhhHHH--------HHHHHhhH
Q 047021          303 FSEPSKDAVSQNGSETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGLVRL--------YKPVLRNI  372 (388)
Q Consensus       303 ~~e~~~dg~~~~~~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L~Rl--------y~PvL~~I  372 (388)
                                   . .       ++.||..-..-.-.+=.+|.+.++.|..|.+-|..+..+        ..|+|+-|
T Consensus       173 -------------~-n-------ir~ELl~l~~~LP~~~~~i~~~i~~l~~aie~Y~~f~~~~~~~~~~~~Lp~L~~v  229 (507)
T PF05600_consen  173 -------------E-N-------IREELLELVKELPSLFDEIVEAISDLQEAIEYYQAFVEFVHDESESEVLPLLRFV  229 (507)
T ss_pred             -------------c-h-------hHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCccchhhHHHHHH
Confidence                         1 1       344444433334444455667778899999999998886        56776443


No 134
>PF05103 DivIVA:  DivIVA protein;  InterPro: IPR007793 The Bacillus subtilis divIVA1 mutation causes misplacement of the septum during cell division, resulting in the formation of small, circular, anucleate minicells []. Inactivation of divIVA produces a minicell phenotype, whereas overproduction of DivIVA results in a filamentation phenotype []. These proteins appear to contain coiled-coils.; PDB: 2WUK_C 2WUJ_A.
Probab=40.88  E-value=13  Score=30.97  Aligned_cols=19  Identities=42%  Similarity=0.625  Sum_probs=0.0

Q ss_pred             hHhhHHHHHHHHHHHHHHH
Q 047021           94 AEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        94 AEEDaAaLRAEln~lQqq~  112 (388)
                      |...+..++.++..|.++.
T Consensus       105 A~~~~~~l~~~~~~lk~~~  123 (131)
T PF05103_consen  105 ARAEAERLREEIEELKRQA  123 (131)
T ss_dssp             -------------------
T ss_pred             HHHHHHHHHHHHHHHHHHH
Confidence            4455555555555555443


No 135
>PRK10698 phage shock protein PspA; Provisional
Probab=40.36  E-value=3.3e+02  Score=25.90  Aligned_cols=68  Identities=15%  Similarity=0.262  Sum_probs=30.0

Q ss_pred             hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHH
Q 047021           70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQL  144 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~l  144 (388)
                      -+.|-.+++.++.++.++...+     +.-..|..-..-+.++- ...+|+ ..|.+..-+..||..+..+.++-
T Consensus       115 ~~~L~~~l~~L~~ki~eak~k~-----~~L~aR~~~A~a~~~~~-~~~~~~-~~~~a~~~f~rmE~ki~~~Ea~a  182 (222)
T PRK10698        115 LARMKKEIGELENKLSETRARQ-----QALMLRHQAASSSRDVR-RQLDSG-KLDEAMARFESFERRIDQMEAEA  182 (222)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHH-----HHHHHHHHHHHHHHHHH-HHHhCC-CcchHHHHHHHHHHHHHHHHHHH
Confidence            3455555666666666554332     22222322222222221 111222 22334455666777666554443


No 136
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=40.03  E-value=3.4e+02  Score=30.58  Aligned_cols=87  Identities=26%  Similarity=0.431  Sum_probs=61.9

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKR   83 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkR   83 (388)
                      ..++.+.++...|+++|+.|...+.++..+++-|..+|....+..--..++         |.-+.+++.=   ...+.++
T Consensus       422 ~~i~~~~~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~~~~~~---------~rei~~~~~~---I~~L~~~  489 (652)
T COG2433         422 KRIKKLEETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRK---------DREIRARDRR---IERLEKE  489 (652)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh---------hHHHHHHHHH---HHHHHHH
Confidence            467888999999999999999999999999999999999888776644443         3344444432   3455666


Q ss_pred             hHHHHHHHhhhHhhHHHHHHHHHHHH
Q 047021           84 LKEAEEEQYRAEEDAAALRAELNSLQ  109 (388)
Q Consensus        84 lkEAEEEQyRAEEDaAaLRAEln~lQ  109 (388)
                      |.|....       .--|...|+.+-
T Consensus       490 L~e~~~~-------ve~L~~~l~~l~  508 (652)
T COG2433         490 LEEKKKR-------VEELERKLAELR  508 (652)
T ss_pred             HHHHHHH-------HHHHHHHHHHHH
Confidence            6655433       234666666665


No 137
>KOG4403 consensus Cell surface glycoprotein STIM, contains SAM domain [General function prediction only]
Probab=39.95  E-value=5.5e+02  Score=28.25  Aligned_cols=137  Identities=26%  Similarity=0.338  Sum_probs=89.2

Q ss_pred             CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhH
Q 047021          125 ISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDK  204 (388)
Q Consensus       125 ~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDk  204 (388)
                      .+-+|+..|-+.|.+|    |...+=-+--|++|..++..-.....||+-||.++...     +..++.          +
T Consensus       239 ~akehv~km~kdle~L----q~aEqsl~dlQk~Lekar~e~rnvavek~~lerkl~ea-----~rl~el----------r  299 (575)
T KOG4403|consen  239 KAKEHVNKMMKDLEGL----QRAEQSLEDLQKRLEKAREEQRNVAVEKLDLERKLDEA-----PRLSEL----------R  299 (575)
T ss_pred             HHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHhhhchhhhhhhHHHHHhhh-----hhhhhh----------h
Confidence            4556677777776644    44444445678888888887788889999999988711     111111          1


Q ss_pred             HHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhh------hhhhHHHHHHHHHHH-HHHHHHHHHHHHHH
Q 047021          205 EKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSS------LSSSYQESMGIAKHW-ENQVKDCLKQNEEL  277 (388)
Q Consensus       205 ekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~------Lsss~qea~~iaaqW-EnQvqDclkQNeeL  277 (388)
                      +-.+.+            -+| |=|.-+-.+-.+.|+=|+-||+      |-.=+|=+-++-.+. +.+=|+|-||.-+-
T Consensus       300 eg~e~e------------~~r-kelE~lR~~L~kAEkele~nS~wsaP~aLQ~wLq~T~E~E~q~~~kkrqnaekql~~A  366 (575)
T KOG4403|consen  300 EGVENE------------TSR-KELEQLRVALEKAEKELEANSSWSAPLALQKWLQLTHEVEVQYYNKKRQNAEKQLKEA  366 (575)
T ss_pred             cchhHH------------HHH-HHHHHHHHHHHHHHHHHHhccCCCCcHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHH
Confidence            111222            111 1111122344678888999964      444566677776664 56889999999999


Q ss_pred             HHHHHHHHhhhhccCC
Q 047021          278 RGILEKLRTEQASFMS  293 (388)
Q Consensus       278 r~lLeklR~EQa~~~~  293 (388)
                      +.+-||++.-+.+...
T Consensus       367 ke~~eklkKKrssv~g  382 (575)
T KOG4403|consen  367 KEMAEKLKKKRSSVFG  382 (575)
T ss_pred             HHHHHHHHHhhcchhe
Confidence            9999999998877764


No 138
>PF09403 FadA:  Adhesion protein FadA;  InterPro: IPR018543  FadA (Fusobacterium adhesin A) is an adhesin which forms two alpha helices. ; PDB: 3ETZ_B 3ETY_A 2GL2_B 3ETX_C 3ETW_A.
Probab=39.94  E-value=2.8e+02  Score=24.87  Aligned_cols=60  Identities=27%  Similarity=0.374  Sum_probs=37.6

Q ss_pred             CCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh
Q 047021          123 MGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRK  185 (388)
Q Consensus       123 ~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~  185 (388)
                      ++.+.+....++.++..|.+++++   |.+.|.+|-.++.....++.++..++.+.-+.+...
T Consensus        15 ~sfaA~~~~~v~~~l~~LEae~q~---L~~kE~~r~~~~k~~ae~a~~~L~~~~~~~~~i~e~   74 (126)
T PF09403_consen   15 ISFAATATASVESELNQLEAEYQQ---LEQKEEARYNEEKQEAEAAEAELAELKELYAEIEEK   74 (126)
T ss_dssp             --------HHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHcccchHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHH
Confidence            333444467788888888887764   777788888877777777777776666665555433


No 139
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=39.67  E-value=6e+02  Score=28.62  Aligned_cols=43  Identities=21%  Similarity=0.382  Sum_probs=30.3

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQ   46 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~   46 (388)
                      ..+.+++++...+.+.-.+|..+++.....-+.|.+|++.+-+
T Consensus       579 ~~L~~l~e~~~~l~~~ae~LaeR~e~a~d~Qe~L~~R~~~vl~  621 (717)
T PF10168_consen  579 KELQELQEERKSLRESAEKLAERYEEAKDKQEKLMKRVDRVLQ  621 (717)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455666677777777777777777777777888888775544


No 140
>PF04111 APG6:  Autophagy protein Apg6;  InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=39.05  E-value=4.1e+02  Score=26.59  Aligned_cols=88  Identities=25%  Similarity=0.316  Sum_probs=44.3

Q ss_pred             hHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          203 DKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILE  282 (388)
Q Consensus       203 Dkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLe  282 (388)
                      +-.+|++.-..+..-++.||.-+..|..||.....|.+.|=.+-...=..|..-.-.-.+-.+.......|..-....|+
T Consensus        51 el~~le~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~~eE~~~~~~~n~~~~~l~~~~~e~~sl~~q~~~~~~~L~  130 (314)
T PF04111_consen   51 ELEKLEQEEEELLQELEELEKEREELDQELEELEEELEELDEEEEEYWREYNELQLELIEFQEERDSLKNQYEYASNQLD  130 (314)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            33444445455555566666666666666655555555544333333333333333333334444445555566666677


Q ss_pred             HHHhhhhccC
Q 047021          283 KLRTEQASFM  292 (388)
Q Consensus       283 klR~EQa~~~  292 (388)
                      +||.  .|..
T Consensus       131 ~L~k--tNv~  138 (314)
T PF04111_consen  131 RLRK--TNVY  138 (314)
T ss_dssp             CHHT----TT
T ss_pred             HHHh--cCch
Confidence            7766  5555


No 141
>PF06005 DUF904:  Protein of unknown function (DUF904);  InterPro: IPR009252 Cell division protein ZapB is a non-essential, abundant cell division factor that is required for proper Z-ring formation. It is recruited early to the divisome by direct interaction with FtsZ, stimulating Z-ring assembly and thereby promoting cell division earlier in the cell cycle. Its recruitment to the Z-ring requires functional FtsA or ZipA.; GO: 0000917 barrier septum formation, 0043093 cytokinesis by binary fission, 0005737 cytoplasm; PDB: 2JEE_A.
Probab=38.93  E-value=1.4e+02  Score=24.17  Aligned_cols=36  Identities=31%  Similarity=0.353  Sum_probs=19.7

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTR   40 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~   40 (388)
                      .+.-|+.++.+|.++|..|.+...+...+|.-|..+
T Consensus        19 ti~~Lq~e~eeLke~n~~L~~e~~~L~~en~~L~~e   54 (72)
T PF06005_consen   19 TIALLQMENEELKEKNNELKEENEELKEENEQLKQE   54 (72)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHH
Confidence            344455556666666555555555555555555533


No 142
>PRK10884 SH3 domain-containing protein; Provisional
Probab=38.13  E-value=2.9e+02  Score=26.36  Aligned_cols=20  Identities=30%  Similarity=0.474  Sum_probs=14.7

Q ss_pred             hhhHHHHHHHHHHHHHHHHH
Q 047021          127 PDHVQTLEKELATLKSQLQQ  146 (388)
Q Consensus       127 ~~~~~~lekEl~~l~~~lqq  146 (388)
                      -+.+..|++|+++++.+|.+
T Consensus        92 ~~rlp~le~el~~l~~~l~~  111 (206)
T PRK10884         92 RTRVPDLENQVKTLTDKLNN  111 (206)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            34577788888888877665


No 143
>PF00261 Tropomyosin:  Tropomyosin;  InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].; PDB: 2EFR_A 2Z5H_C 2Z5I_D 2D3E_B 2EFS_D 3U59_B 1C1G_C 1IHQ_A 3AZD_B 1MV4_B ....
Probab=37.68  E-value=3.6e+02  Score=25.50  Aligned_cols=197  Identities=21%  Similarity=0.297  Sum_probs=104.5

Q ss_pred             hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021           71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQL  150 (388)
Q Consensus        71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~l  150 (388)
                      |++-..+..+...|++++..-..||...++|.--|..|-.......           +.+......+.++...+......
T Consensus        11 d~~~~~~~~~~~~l~~~~~~~~~aE~e~~~l~rri~~lE~~le~~e-----------erL~~~~~kL~~~e~~~de~er~   79 (237)
T PF00261_consen   11 DEAEERLEEAEEKLKEAEKRAEKAEAEVASLQRRIQLLEEELERAE-----------ERLEEATEKLEEAEKRADESERA   79 (237)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCHHHH-----------CCCCHHHHHHHHHHHHHHHHCHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH-----------HHHHHHHHHHHHHHHHHHHHHHH
Confidence            4556677778888888888888888888888888877765553221           11222233333333333333333


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhh-hhhhc---------chhhHHHHHHHHhHHHHHHHH
Q 047021          151 RQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAA-ASKAF---------SVEDKEKLEKQLHDMAVVVER  220 (388)
Q Consensus       151 rqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a-~~k~~---------S~eDkekle~Ql~dma~~VeR  220 (388)
                      +..-..|...--.++..|.....+....+..+-.+..+...... ....+         .-.--..|+..|+.+.--+-.
T Consensus        80 ~k~lE~r~~~~eeri~~lE~~l~ea~~~~ee~e~k~~E~~rkl~~~E~~Le~aEeR~e~~E~ki~eLE~el~~~~~~lk~  159 (237)
T PF00261_consen   80 RKVLENREQSDEERIEELEQQLKEAKRRAEEAERKYEEVERKLKVLEQELERAEERAEAAESKIKELEEELKSVGNNLKS  159 (237)
T ss_dssp             HHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHH
Confidence            33222333333333334433333333333332222222111110 00000         011123455555555555555


Q ss_pred             HHhhHHHHHHHhhhhhHHHHHHhhh-hhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          221 LESSRQKLLMEIDNQSSEIERLFEE-NSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLRT  286 (388)
Q Consensus       221 LE~SRqKLL~EID~QS~EIErLf~E-Ns~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR~  286 (388)
                      |+.+-.        +.++-+.-|++ -..|..-+.++-.=+-.=|+.|..+=++++.|.+-|..++.
T Consensus       160 lE~~~~--------~~~~re~~~e~~i~~L~~~lkeaE~Rae~aE~~v~~Le~~id~le~eL~~~k~  218 (237)
T PF00261_consen  160 LEASEE--------KASEREDEYEEKIRDLEEKLKEAENRAEFAERRVKKLEKEIDRLEDELEKEKE  218 (237)
T ss_dssp             HHHHHH--------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hhhhhh--------hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            555444        44444333433 46677888888888888889999999999999998887664


No 144
>PF15035 Rootletin:  Ciliary rootlet component, centrosome cohesion
Probab=37.54  E-value=3.5e+02  Score=25.33  Aligned_cols=117  Identities=22%  Similarity=0.263  Sum_probs=63.5

Q ss_pred             HHHHHHHHHhcCCCCCCCCC--CCChhhHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          104 ELNSLQQQAMDGSLGGISSM--GISPDHVQTLEKELATLKSQLQQESQLRQ---QERKQLTEEQARISVLMSEKQELEEK  178 (388)
Q Consensus       104 Eln~lQqq~~~~~~s~~~~~--g~s~~~~~~lekEl~~l~~~lqqes~lrq---qeqqrla~Eq~~~a~l~aEkqeLeek  178 (388)
                      -..-|.++..+++.++.++.  +...+|+..|+.=+.-|..+-+++.-|.+   ..++.|.....-...|..+.+-|...
T Consensus        31 rc~ele~~l~~~~~l~~~~~~~~~~~e~s~dLe~~l~rLeEEqqR~~~L~qvN~lLReQLEq~~~~N~~L~~dl~klt~~  110 (182)
T PF15035_consen   31 RCAELEQQLSASQVLESPSQRRRSEEEHSPDLEEALIRLEEEQQRSEELAQVNALLREQLEQARKANEALQEDLQKLTQD  110 (182)
T ss_pred             HHHHHHHHHhcccCcCcccccccccccCcccHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455666655544433332  34566777888877777777777666544   24555555555555566666665555


Q ss_pred             HHHHhh----hchh-hhhhhhhhhhcchh---------hHHHHHHHHhHHHHHHHH
Q 047021          179 LVMFSR----KASE-EVSNAAASKAFSVE---------DKEKLEKQLHDMAVVVER  220 (388)
Q Consensus       179 l~~~~~----~~se-~~~~~a~~k~~S~e---------Dkekle~Ql~dma~~VeR  220 (388)
                      ...+..    +..+ ....-++..-||.+         .--.+.+++.+|....||
T Consensus       111 ~~~l~~eL~~ke~~~~~ee~~~~~y~~~eh~rll~LWr~v~~lRr~f~elr~~Ter  166 (182)
T PF15035_consen  111 WERLRDELEQKEAEWREEEENFNQYLSSEHSRLLSLWREVVALRRQFAELRTATER  166 (182)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhhhcccccHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            555321    2112 12222233444443         334556677777766665


No 145
>KOG4797 consensus Transcriptional regulator [Transcription]
Probab=37.24  E-value=33  Score=30.79  Aligned_cols=24  Identities=33%  Similarity=0.682  Sum_probs=20.1

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhh
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSS   25 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~s   25 (388)
                      |-.++++|.++|..||+||..|..
T Consensus        72 Lk~qI~eL~er~~~Le~EN~lLk~   95 (123)
T KOG4797|consen   72 LKEQIRELEERNSALERENSLLKT   95 (123)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHh
Confidence            346789999999999999998864


No 146
>PF02994 Transposase_22:  L1 transposable element;  InterPro: IPR004244 Many human L1 elements are capable of retrotransposition. Some of these have been shown to exhibit reverse transcriptase (RT) activity [] although the function of many are, as yet, unknown. More information about these proteins can be found at Protein of the Month: Transposase [].; PDB: 2LDY_A 3SOO_A 2YKQ_A 2YKO_C 2YKP_B 2W7A_B 2JRB_A.
Probab=37.10  E-value=74  Score=32.37  Aligned_cols=45  Identities=18%  Similarity=0.398  Sum_probs=26.2

Q ss_pred             hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021            3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN   47 (388)
Q Consensus         3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n   47 (388)
                      .++|.+++.+..++|.....+...++.++..+..|...|+|++-+
T Consensus       143 ~~Ri~e~Eeris~lEd~~~~i~~~~~~~~k~i~~l~~kl~DlEnr  187 (370)
T PF02994_consen  143 NSRIDELEERISELEDRIEEIEQAIKELEKRIKKLEDKLDDLENR  187 (370)
T ss_dssp             -HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhh
Confidence            355666666666666666666655555555555666666665544


No 147
>smart00787 Spc7 Spc7 kinetochore protein. This domain is found in cell division proteins which are required for kinetochore-spindle association.
Probab=36.61  E-value=4.6e+02  Score=26.45  Aligned_cols=102  Identities=23%  Similarity=0.332  Sum_probs=72.7

Q ss_pred             HHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021           98 AAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEE  177 (388)
Q Consensus        98 aAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLee  177 (388)
                      -+.|..|+..|++....-.       ...++....+..+|+....++..-.....+-+.++..=...+.....+|+++..
T Consensus       181 ~~~L~~e~~~L~~~~~e~~-------~~d~~eL~~lk~~l~~~~~ei~~~~~~l~e~~~~l~~l~~~I~~~~~~k~e~~~  253 (312)
T smart00787      181 KDALEEELRQLKQLEDELE-------DCDPTELDRAKEKLKKLLQEIMIKVKKLEELEEELQELESKIEDLTNKKSELNT  253 (312)
T ss_pred             HHHHHHHHHHHHHhHHHHH-------hCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4578888888887763211       125788888888998888888877777777777777777788888888888888


Q ss_pred             HHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhH
Q 047021          178 KLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHD  213 (388)
Q Consensus       178 kl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~d  213 (388)
                      .|+.+.+.       .--.+.||.-|-.+|..+++.
T Consensus       254 ~I~~ae~~-------~~~~r~~t~~Ei~~Lk~~~~~  282 (312)
T smart00787      254 EIAEAEKK-------LEQCRGFTFKEIEKLKEQLKL  282 (312)
T ss_pred             HHHHHHHH-------HHhcCCCCHHHHHHHHHHHHH
Confidence            88776542       123455666676677666443


No 148
>KOG4643 consensus Uncharacterized coiled-coil protein [Function unknown]
Probab=35.90  E-value=8.5e+02  Score=29.33  Aligned_cols=85  Identities=28%  Similarity=0.381  Sum_probs=50.2

Q ss_pred             hHHHHHHHHhHHHHHHHHHHhhHHHHHHH----------hhhhhHHHHHHh----hhhhhhhhhHH---HHHHHHHHHH-
Q 047021          203 DKEKLEKQLHDMAVVVERLESSRQKLLME----------IDNQSSEIERLF----EENSSLSSSYQ---ESMGIAKHWE-  264 (388)
Q Consensus       203 Dkekle~Ql~dma~~VeRLE~SRqKLL~E----------ID~QS~EIErLf----~ENs~Lsss~q---ea~~iaaqWE-  264 (388)
                      +...+-+-|++=+.+.+||.++-.+|..-          .+.+-.||+++.    .||..|=.-++   .+..=.+..| 
T Consensus       489 ~iknlnk~L~~r~~elsrl~a~~~elkeQ~kt~~~qye~~~~k~eeLe~~l~~lE~ENa~LlkqI~~Lk~t~qn~~~LEq  568 (1195)
T KOG4643|consen  489 QIKNLNKSLNNRDLELSRLHALKNELKEQYKTCDIQYELLSNKLEELEELLGNLEEENAHLLKQIQSLKTTSQNGALLEQ  568 (1195)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            35566777888899999999998888643          344445666654    47776643332   2211222222 


Q ss_pred             ---------HHHHHHHHHHHHHHHHHHHHHhh
Q 047021          265 ---------NQVKDCLKQNEELRGILEKLRTE  287 (388)
Q Consensus       265 ---------nQvqDclkQNeeLr~lLeklR~E  287 (388)
                               +-.+--+.-.+.||.+.++|+.|
T Consensus       569 ~~n~lE~~~~elkk~idaL~alrrhke~LE~e  600 (1195)
T KOG4643|consen  569 NNNDLELIHNELKKYIDALNALRRHKEKLEEE  600 (1195)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence                     12222333334778888888877


No 149
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=35.64  E-value=96  Score=29.14  Aligned_cols=39  Identities=28%  Similarity=0.392  Sum_probs=24.2

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHh
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINEL   44 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~   44 (388)
                      .+.+++.|..|+.++..|+.++...+.+++.|++++.-.
T Consensus        99 ~~~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~~~~~~  137 (161)
T TIGR02894        99 DQALQKENERLKNQNESLQKRNEELEKELEKLRQRLSTI  137 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344555666666666666666666666666666666533


No 150
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=35.57  E-value=4.4e+02  Score=25.96  Aligned_cols=83  Identities=17%  Similarity=0.251  Sum_probs=51.9

Q ss_pred             hhhhHHHhhhhHHHHHHHHHHHhHHHHHH--Hhhh-------Hhh-------HHHHHHHHHHHHHHHhcCCCCCCCCCCC
Q 047021           62 EKDAAVVAREDLSAQLRTLKKRLKEAEEE--QYRA-------EED-------AAALRAELNSLQQQAMDGSLGGISSMGI  125 (388)
Q Consensus        62 EkDaA~vareDl~AQLR~~kkRlkEAEEE--QyRA-------EED-------aAaLRAEln~lQqq~~~~~~s~~~~~g~  125 (388)
                      ....+.-+++-+-.||-.+++++.+||..  .||.       +.+       .+.|++++..++.+...-.  +  .+|+
T Consensus       164 ~~~~~~~a~~fl~~ql~~~~~~l~~ae~~l~~fr~~~~~~d~~~~~~~~~~~i~~L~~~l~~~~~~l~~l~--~--~~~~  239 (362)
T TIGR01010       164 NERARKDTIAFAENEVKEAEQRLNATKAELLKYQIKNKVFDPKAQSSAQLSLISTLEGELIRVQAQLAQLR--S--ITPE  239 (362)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcChHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--h--hCCC
Confidence            34445556777888888999999988853  3443       222       3456666666666653221  1  2344


Q ss_pred             ChhhHHHHHHHHHHHHHHHHHHH
Q 047021          126 SPDHVQTLEKELATLKSQLQQES  148 (388)
Q Consensus       126 s~~~~~~lekEl~~l~~~lqqes  148 (388)
                      .+-.+..+..+++.++.++..+.
T Consensus       240 ~~P~v~~l~~~i~~l~~~i~~e~  262 (362)
T TIGR01010       240 QNPQVPSLQARIKSLRKQIDEQR  262 (362)
T ss_pred             CCCchHHHHHHHHHHHHHHHHHH
Confidence            45567788888888888877644


No 151
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=35.17  E-value=3.6e+02  Score=25.65  Aligned_cols=70  Identities=19%  Similarity=0.240  Sum_probs=43.0

Q ss_pred             HHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          210 QLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEK  283 (388)
Q Consensus       210 Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLek  283 (388)
                      -+.++..-+.+.-..--..+.++-.....+..+-.||..|..-..+    -..|-.++..|..+|++||.+|+-
T Consensus        42 ~~~~~~~p~~~~~~~~~~~~~~~~~~~~~~~~l~~en~~L~~e~~~----l~~~~~~~~~l~~en~~L~~lL~~  111 (276)
T PRK13922         42 VVGDVVSPVQRVVNAPREFVSGVFESLASLFDLREENEELKKELLE----LESRLQELEQLEAENARLRELLNL  111 (276)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHhcC
Confidence            3344444444444444444555555556666666666666444333    345667778999999999998864


No 152
>PRK04406 hypothetical protein; Provisional
Probab=35.11  E-value=1.3e+02  Score=24.53  Aligned_cols=52  Identities=21%  Similarity=0.357  Sum_probs=0.0

Q ss_pred             hHHHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021           34 SESLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE   88 (388)
Q Consensus        34 ~~~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE   88 (388)
                      .+.+..||.+||-...  --.=..|++|-.+-+.-+   +.|..+|+.+..|+++.+
T Consensus         6 ~~~le~Ri~~LE~~lAfQE~tIe~LN~~v~~Qq~~I---~~L~~ql~~L~~rl~~~~   59 (75)
T PRK04406          6 IEQLEERINDLECQLAFQEQTIEELNDALSQQQLLI---TKMQDQMKYVVGKVKNMD   59 (75)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhc


No 153
>PF04102 SlyX:  SlyX;  InterPro: IPR007236 The SlyX protein has no known function. It is short, less than 80 amino acids, and its gene is found close to the slyD gene. The SlyX protein has a conserved PPH(Y/W) motif at its C terminus. The protein may be a coiled-coil structure.; PDB: 3EFG_A.
Probab=34.26  E-value=1.4e+02  Score=23.48  Aligned_cols=45  Identities=16%  Similarity=0.204  Sum_probs=31.8

Q ss_pred             hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhh
Q 047021            3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQN   47 (388)
Q Consensus         3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n   47 (388)
                      -.||-+|+-+.+..|.-...|+.-+-.-...++.|..++..+...
T Consensus         3 e~Ri~~LE~~la~qe~~ie~Ln~~v~~Qq~~I~~L~~~l~~L~~r   47 (69)
T PF04102_consen    3 EERIEELEIKLAFQEDTIEELNDVVTEQQRQIDRLQRQLRLLRER   47 (69)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHT
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            457788888888888888777777766666666666666655443


No 154
>PF11336 DUF3138:  Protein of unknown function (DUF3138);  InterPro: IPR021485  This family of proteins with unknown function appear to be restricted to Proteobacteria. 
Probab=33.99  E-value=39  Score=36.37  Aligned_cols=67  Identities=21%  Similarity=0.312  Sum_probs=33.7

Q ss_pred             hhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHH
Q 047021           71 EDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELAT  139 (388)
Q Consensus        71 eDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~  139 (388)
                      ++|.+||+.+|+|.+|-+-. ..|-.-+|.--|-+-.. .-+....++.-++.+..+|++.+|..++++
T Consensus        28 ~~L~~ql~aLq~~v~eL~~~-laa~~~aa~~gA~~~~~-~~a~~~aP~~~a~~~~T~d~~~~~~qqiAn   94 (514)
T PF11336_consen   28 KALQAQLQALQDQVNELRAK-LAAKPAAAPGGAAIGPA-ATAAAAAPSSDAQAGLTNDDATEMRQQIAN   94 (514)
T ss_pred             HHHHHHHHHHHHHHHHHHHH-HhcCCCCCCcccccccc-ccccccCCCcccccccChHHHHHHHHHHHh
Confidence            78999999999988874432 11111111100000000 000000111113566778999888888885


No 155
>PRK10246 exonuclease subunit SbcC; Provisional
Probab=33.51  E-value=8.1e+02  Score=28.37  Aligned_cols=20  Identities=15%  Similarity=0.104  Sum_probs=14.1

Q ss_pred             hccCCCCccccccccCCCCC
Q 047021          289 ASFMSVNDKDILRDFSEPSK  308 (388)
Q Consensus       289 a~~~~~~~~~~~g~~~e~~~  308 (388)
                      +.+-|..||+.-|+..-|++
T Consensus       498 ~~L~~GePCPVCGS~~HP~~  517 (1047)
T PRK10246        498 AQLQAGQPCPLCGSTSHPAV  517 (1047)
T ss_pred             HhCCCCCCcCCCCcccCcCc
Confidence            46678889999887554433


No 156
>PF10168 Nup88:  Nuclear pore component;  InterPro: IPR019321  Nup88 can be divided into two structural domains; the N-terminal two-thirds of the protein have no obvious structural motifs. It is, however, where it binds to Nup98; one of the components of the nuclear pore. The C-terminal end is a predicted coiled-coil domain []. Nup88 is over expressed in tumour cells []. 
Probab=33.36  E-value=7.4e+02  Score=27.89  Aligned_cols=32  Identities=25%  Similarity=0.496  Sum_probs=19.4

Q ss_pred             hhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHH
Q 047021          195 ASKAFSVEDKEKLEKQLHDMAVVVERLESSRQK  227 (388)
Q Consensus       195 ~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqK  227 (388)
                      +.+.| .++-+.+..+|++|+-.++.++.--++
T Consensus       633 AEr~~-~~EL~~~~~~l~~l~~si~~lk~k~~~  664 (717)
T PF10168_consen  633 AEREF-KKELERMKDQLQDLKASIEQLKKKLDY  664 (717)
T ss_pred             HHHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555 345667777777777777666544333


No 157
>PRK02119 hypothetical protein; Provisional
Probab=33.11  E-value=1.5e+02  Score=24.01  Aligned_cols=53  Identities=26%  Similarity=0.474  Sum_probs=0.0

Q ss_pred             hhHHHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021           33 ESESLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE   88 (388)
Q Consensus        33 ~~~~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE   88 (388)
                      +...+..|+.+||-.+.  --.=..|++|-.+-+.-+   +.|..+|+.+..|+++.+
T Consensus         3 ~~~~~e~Ri~~LE~rla~QE~tie~LN~~v~~Qq~~i---d~L~~ql~~L~~rl~~~~   57 (73)
T PRK02119          3 IQQNLENRIAELEMKIAFQENLLEELNQALIEQQFVI---DKMQVQLRYMANKLKDMQ   57 (73)
T ss_pred             chHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhc


No 158
>TIGR02680 conserved hypothetical protein TIGR02680. Members of this protein family belong to a conserved gene four-gene neighborhood found sporadically in a phylogenetically broad range of bacteria: Nocardia farcinica, Symbiobacterium thermophilum, and Streptomyces avermitilis (Actinobacteria), Geobacillus kaustophilus (Firmicutes), Azoarcus sp. EbN1 and Ralstonia solanacearum (Betaproteobacteria). Proteins in this family average over 1400 amino acids in length.
Probab=32.99  E-value=9.3e+02  Score=28.90  Aligned_cols=108  Identities=20%  Similarity=0.248  Sum_probs=61.1

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhh-------hhhhhhHHHhhhhHHHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDV-------AMEKDAAVVAREDLSAQL   77 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v-------~mEkDaA~vareDl~AQL   77 (388)
                      .+..++......+++...+..++.+.+.+.+.+..++..+...-..-..+-|.++       .-.-+...-.-.....++
T Consensus       284 ~~~~~~~~~~~~~~~~~~~~~~~~~le~~~~~l~~~~~~l~~~~a~~~~~eL~el~~ql~~~~~~a~~~~~~~~~a~~~~  363 (1353)
T TIGR02680       284 DLGRARDELETAREEERELDARTEALEREADALRTRLEALQGSPAYQDAEELERARADAEALQAAAADARQAIREAESRL  363 (1353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445555556666666666666666666666677777666544444333444322       111111111122235566


Q ss_pred             HHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           78 RTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        78 R~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      +..+.++.+++..-|.++...+..+.++..+...+
T Consensus       364 e~~~~~~~~~~~r~~~~~~~l~~~~~el~~~a~~~  398 (1353)
T TIGR02680       364 EEERRRLDEEAGRLDDAERELRAAREQLARAAERA  398 (1353)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            77777777777777777777777777776555444


No 159
>PF09304 Cortex-I_coil:  Cortexillin I, coiled coil;  InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=32.79  E-value=3.6e+02  Score=24.01  Aligned_cols=57  Identities=25%  Similarity=0.375  Sum_probs=37.5

Q ss_pred             HHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          101 LRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVL  168 (388)
Q Consensus       101 LRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l  168 (388)
                      |++-+++|++|..+           -...+.+|+.+++++...+--+-+-+-+-+-|+++++...+-+
T Consensus        42 L~~~l~~L~~q~~s-----------~~qr~~eLqaki~ea~~~le~eK~ak~~l~~r~~k~~~dka~l   98 (107)
T PF09304_consen   42 LRNALQSLQAQNAS-----------RNQRIAELQAKIDEARRNLEDEKQAKLELESRLLKAQKDKAIL   98 (107)
T ss_dssp             HHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhHH
Confidence            77777788777632           2345777888888887777666555545556666666555544


No 160
>PRK00736 hypothetical protein; Provisional
Probab=32.61  E-value=1.3e+02  Score=23.97  Aligned_cols=50  Identities=26%  Similarity=0.332  Sum_probs=0.0

Q ss_pred             HHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021           36 SLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE   88 (388)
Q Consensus        36 ~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE   88 (388)
                      ++..||.+||-++.  --.=..|++|-.+-+..+   +.|..||+.+..|+++.+
T Consensus         2 ~~e~Ri~~LE~klafqe~tie~Ln~~v~~Qq~~i---~~L~~ql~~L~~rl~~~~   53 (68)
T PRK00736          2 DAEERLTELEIRVAEQEKTIEELSDQLAEQWKTV---EQMRKKLDALTERFLSLE   53 (68)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhc


No 161
>PRK11546 zraP zinc resistance protein; Provisional
Probab=32.47  E-value=1.9e+02  Score=26.59  Aligned_cols=25  Identities=16%  Similarity=0.340  Sum_probs=19.6

Q ss_pred             ChhhHHHHHHHHHHHHHHHHHHHHH
Q 047021          126 SPDHVQTLEKELATLKSQLQQESQL  150 (388)
Q Consensus       126 s~~~~~~lekEl~~l~~~lqqes~l  150 (388)
                      .+..|.+|.+||.+|...|......
T Consensus        87 D~~kI~aL~kEI~~Lr~kL~e~r~~  111 (143)
T PRK11546         87 DSSKINAVAKEMENLRQSLDELRVK  111 (143)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4455999999999999888765543


No 162
>COG3206 GumC Uncharacterized protein involved in exopolysaccharide biosynthesis [Cell envelope biogenesis, outer membrane]
Probab=32.36  E-value=5.7e+02  Score=26.22  Aligned_cols=104  Identities=25%  Similarity=0.325  Sum_probs=78.1

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLK   85 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlk   85 (388)
                      +++|.++.+.+......+...+-.+|-..-.+...++++++.....+++.+...--+.+++.--...|..++-.+++|+.
T Consensus       287 i~~Lr~~~~~~~~~~~~l~~~~~~~~p~~~~~~~q~~~~~~~~~~e~~~~~~~~~~~~~~l~~~~~~L~~~~~~l~~~~~  366 (458)
T COG3206         287 IQDLRQQYAQVRQQIADLSTELGAKHPQLVALEAQLAELRQQIAAELRQILASLPNELALLEQQEAALEKELAQLKGRLS  366 (458)
T ss_pred             HHHHHHHHHHHHHHHHHHHHhhcccChHHHhHHHHHHHHHHHHHHHHHHHHHhchhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            67788888888888888888999999999999999999999999888888888777777766666667666666666665


Q ss_pred             HHHHHH---hhhHhhHHHHHHHHHHHH
Q 047021           86 EAEEEQ---YRAEEDAAALRAELNSLQ  109 (388)
Q Consensus        86 EAEEEQ---yRAEEDaAaLRAEln~lQ  109 (388)
                      ..=.-|   -+-+-|+.+.|.=+.++-
T Consensus       367 ~~~~~~~~l~~L~Re~~~~r~~ye~lL  393 (458)
T COG3206         367 KLPKLQVQLRELEREAEAARSLYETLL  393 (458)
T ss_pred             hchHhhhHHHHHHHHHHHHHHHHHHHH
Confidence            443333   344556666666555543


No 163
>COG3880 Modulator of heat shock repressor CtsR, McsA [Signal transduction    mechanisms]
Probab=32.33  E-value=1e+02  Score=29.45  Aligned_cols=40  Identities=25%  Similarity=0.491  Sum_probs=32.4

Q ss_pred             hhhHHHHHHHHHHHhHHHHH-HHhhhHhhHHHHHHHHHHHHHHH
Q 047021           70 REDLSAQLRTLKKRLKEAEE-EQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEE-EQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      ..++.-+++.++.-|+..=+ |-|   |.||.+|.+|++|++..
T Consensus       131 ~i~~~~~I~~L~e~Lq~~i~~Eef---EeAA~iRDqIr~Lk~k~  171 (176)
T COG3880         131 KINPKRKIIALKEALQDLIEREEF---EEAAVIRDQIRALKAKN  171 (176)
T ss_pred             cccHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHhhc
Confidence            46777889999998888743 444   78999999999999884


No 164
>PF00992 Troponin:  Troponin;  InterPro: IPR001978 The troponin (Tn) complex regulates Ca2+ induced muscle contraction. Tn contains three subunits, Ca2+ binding (TnC), inhibitory (TnI), and tropomyosin binding (TnT). This family includes troponin T and troponin I. Troponin I binds to actin and troponin T binds to tropomyosin [, , ].; PDB: 1J1D_C 1MXL_I 1LXF_I 2KRD_I 1J1E_F 2L1R_B 2KGB_I 1YTZ_I 1YV0_I 1VDJ_A ....
Probab=32.20  E-value=1.5e+02  Score=26.26  Aligned_cols=75  Identities=25%  Similarity=0.380  Sum_probs=53.4

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh-hhHHHHhhhhhhhhhhHHHh----hhhHHHHHHHH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV-PSLRKALKDVAMEKDAAVVA----REDLSAQLRTL   80 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v-~slrkaL~~v~mEkDaA~va----reDl~AQLR~~   80 (388)
                      .++|..+...++++.--+..+..-...+++.|-.+.+|+....+ |+|++-=+.+--.+.+.--+    -+||-++|-.+
T Consensus        52 ~k~lh~ri~~leeEryd~E~kv~k~~~Ei~elk~kv~d~rgkfkkp~lkkvkk~~~~~~~~~l~~k~~~~~~lr~~lK~v  131 (132)
T PF00992_consen   52 CKELHERIDKLEEERYDLEEKVAKQDYEIEELKKKVNDLRGKFKKPTLKKVKKSADAMRRKLLGIKHKVSMDLRANLKEV  131 (132)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCSHCCHHHHHHHHHHHHHHHHHHCTTCSHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhcccHHHHHhHHHHHcccccchHHHHHHHHHHHHHHhhhCcccccHHHHHHHhhcc
Confidence            45677778888888888888888888888888888888888444 98887554444444443333    56666666554


No 165
>TIGR02449 conserved hypothetical protein TIGR02449. Members of this family are small proteins, typically 73 amino acids in length, with single copies in each of several Proteobacteria, including Xylella fastidiosa, Pseudomonas aeruginosa, and Xanthomonas campestris. The function is unknown.
Probab=31.69  E-value=1.3e+02  Score=24.32  Aligned_cols=37  Identities=27%  Similarity=0.339  Sum_probs=29.9

Q ss_pred             hhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhh
Q 047021           12 KNAELEEKNSILSSKLQVKEVESESLQTRINELEQNT   48 (388)
Q Consensus        12 ~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~   48 (388)
                      ....+..++..|..|.+.+--.+++|-.||-.+|+.|
T Consensus        29 q~~~~~~ER~~L~ekne~Ar~rvEamI~RLk~leq~~   65 (65)
T TIGR02449        29 QEKTWREERAQLLEKNEQARQKVEAMITRLKALEQHT   65 (65)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhccCC
Confidence            3445677888888999999999999999998888764


No 166
>PF08429 PLU-1:  PLU-1-like protein;  InterPro: IPR013637 This domain is found in the central region of lysine-specific demethylases, which are nuclear proteins that may have a role in DNA-binding and transcription, and are associated with malignant cancer phenotypes []. The domain is also found in various other Jumonji/ARID domain-containing proteins (see IPR013129 from INTERPRO, IPR001606 from INTERPRO). ; GO: 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process
Probab=31.67  E-value=4.3e+02  Score=25.61  Aligned_cols=79  Identities=24%  Similarity=0.368  Sum_probs=58.2

Q ss_pred             HHHHHHhHHHHHHHHHHhhHHHHHH-------HhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH-----
Q 047021          206 KLEKQLHDMAVVVERLESSRQKLLM-------EIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ-----  273 (388)
Q Consensus       206 kle~Ql~dma~~VeRLE~SRqKLL~-------EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ-----  273 (388)
                      ++-..|+.+-...++.+..=..+|.       .+|.=+.+.++++.....+. .+.+++.-+..|-.+++.++.+     
T Consensus       202 ~~~~~L~~~l~~~~~We~ka~~~L~~~~~~l~~Le~l~~~~~~ipv~~~~~~-~L~~~l~kak~w~~~i~~ll~~~~~~~  280 (335)
T PF08429_consen  202 KLMAELQELLKQGEEWEEKAKELLSRPRVSLEQLEALLEEAENIPVSLPSLD-KLKDALQKAKEWLRQIEELLEQNGSKR  280 (335)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhcCCCchHHHH-HHHHHHHHHHHHHHHHHHHhcccCCCC
Confidence            3444566666666666665555554       67777888888888887754 8999999999999999999998     


Q ss_pred             --HHHHHHHHHHHH
Q 047021          274 --NEELRGILEKLR  285 (388)
Q Consensus       274 --NeeLr~lLeklR  285 (388)
                        ..+|+.++...+
T Consensus       281 p~~~el~~l~~~~~  294 (335)
T PF08429_consen  281 PTLDELEELVAESE  294 (335)
T ss_pred             CcHHHHHHHHHHHh
Confidence              466777665543


No 167
>PF06632 XRCC4:  DNA double-strand break repair and V(D)J recombination protein XRCC4;  InterPro: IPR010585 This entry represents the DNA double-strand break repair and V(D)J recombination protein XRCC4, which is found in certain Metazoans, fungi and plants. XRCC4 binds to DNA, and to DNA ligase IV (LIG4) to form the LIG4-XRCC4 complex []. The LIG4-XRCC4 complex is responsible for the ligation step in the non-homologous end joining (NHEJ) pathway of DNA double-strand break repair. XRCC4 enhances the joining activity of LIG4. It is thought that XRCC4 and LIG4 are essential for alignment-based gap filling, as well as for final ligation of the breaks []. Binding of the LIG4-XRCC4 complex to DNA ends is dependent on the assembly of the DNA-dependent protein kinase complex DNA-PK to these DNA ends. ; GO: 0003677 DNA binding, 0006302 double-strand break repair, 0006310 DNA recombination, 0005634 nucleus; PDB: 3RWR_R 1IK9_B 3SR2_E 3Q4F_H 3II6_B 1FU1_A 3MUD_B.
Probab=31.59  E-value=3.6e+02  Score=27.73  Aligned_cols=49  Identities=22%  Similarity=0.283  Sum_probs=35.2

Q ss_pred             ChhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhh
Q 047021            1 WLLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTV   49 (388)
Q Consensus         1 ~l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v   49 (388)
                      |++..+..|+.+|.+|..+|..|.+.+.......+-+++.=+..|....
T Consensus       134 ~~l~~~~~l~~~~~~L~~enerL~~e~~~~~~qlE~~v~~K~~~E~~L~  182 (342)
T PF06632_consen  134 WCLDANSRLQAENEHLQKENERLESEANKLLKQLEKFVNAKEEHEEDLY  182 (342)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            6778888888888888888888877776666666666665555554443


No 168
>PF14988 DUF4515:  Domain of unknown function (DUF4515)
Probab=31.23  E-value=4.7e+02  Score=24.88  Aligned_cols=114  Identities=25%  Similarity=0.314  Sum_probs=69.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHH
Q 047021          131 QTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQ  210 (388)
Q Consensus       131 ~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Q  210 (388)
                      ...+++++.++.+|+.=.-.+.           .=.....|++.|++.+..+...++......  +.-| ..+|..|+++
T Consensus        57 lq~~k~~~~l~~eLq~l~~~~~-----------~k~~qe~eI~~Le~e~~~~~~e~~~~l~~~--~~qf-l~EK~~LEke  122 (206)
T PF14988_consen   57 LQKEKEQAKLQQELQALKEFRR-----------LKEQQEREIQTLEEELEKMRAEHAEKLQEA--ESQF-LQEKARLEKE  122 (206)
T ss_pred             HHHHHHHHHHHHHHHHhHHHHH-----------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--HHHH-HHHHHHHHHH
Confidence            4455666666666655333311           113456788889998888888777755554  5556 6899999999


Q ss_pred             HhHHHH--HHHHHHhh-HHH----HHHHhhhhhHHHHHHhhhhhhhhhhHHHHHH
Q 047021          211 LHDMAV--VVERLESS-RQK----LLMEIDNQSSEIERLFEENSSLSSSYQESMG  258 (388)
Q Consensus       211 l~dma~--~VeRLE~S-RqK----LL~EID~QS~EIErLf~ENs~Lsss~qea~~  258 (388)
                      ++++-.  +.+|-... .+|    =++-+.+=+.---.+|.||.-|...+..-+.
T Consensus       123 ~~e~~i~~l~e~a~~el~~k~~ale~~A~~~l~e~~~~i~~EN~~L~k~L~~l~~  177 (206)
T PF14988_consen  123 ASELKILQLGERAHKELKKKAQALELAAKKSLDEFTRSIKRENQQLRKELLQLIQ  177 (206)
T ss_pred             HHHhhHHHhHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            865433  33332221 111    0113444444556789999999888766554


No 169
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport]
Probab=31.17  E-value=9.4e+02  Score=28.38  Aligned_cols=72  Identities=21%  Similarity=0.302  Sum_probs=44.0

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQL   77 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQL   77 (388)
                      ++.+|++.+.+|.-+|..|..+++........|.+.++.++-... -.+....+.+=+.|+--.--+.+-|++
T Consensus       672 ~~e~lkQ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~qLg-~~~~~~~~~~q~~e~~~t~~eel~a~~  743 (970)
T KOG0946|consen  672 QIENLKQMEKELQVENEELEEEVQDFISEHSQLKDQLDLLKNQLG-IISSKQRDLLQGAEASKTQNEELNAAL  743 (970)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc-ccccchhhHHhHHHhccCChHHHHHHH
Confidence            456677788888888888888888888888888888887765544 011122233333444444444444443


No 170
>PF11336 DUF3138:  Protein of unknown function (DUF3138);  InterPro: IPR021485  This family of proteins with unknown function appear to be restricted to Proteobacteria. 
Probab=31.11  E-value=1.2e+02  Score=32.75  Aligned_cols=79  Identities=23%  Similarity=0.371  Sum_probs=49.3

Q ss_pred             CCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchh--------
Q 047021          117 LGGISSMGISPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASE--------  188 (388)
Q Consensus       117 ~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se--------  188 (388)
                      ..+++..+-..+++.+|++++.                            .|..++.||+.+|++-...+|.        
T Consensus        14 l~~~a~a~a~a~~i~~L~~ql~----------------------------aLq~~v~eL~~~laa~~~aa~~gA~~~~~~   65 (514)
T PF11336_consen   14 LPGAAMAAATADQIKALQAQLQ----------------------------ALQDQVNELRAKLAAKPAAAPGGAAIGPAA   65 (514)
T ss_pred             ccccccccCCHHHHHHHHHHHH----------------------------HHHHHHHHHHHHHhcCCCCCCccccccccc
Confidence            3455566667778888877766                            3445566777777665443322        


Q ss_pred             ----hhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHh
Q 047021          189 ----EVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLES  223 (388)
Q Consensus       189 ----~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~  223 (388)
                          .+|...+.-.|+.+|-..+.+|+-..-+-|+-|+-
T Consensus        66 ~a~~~aP~~~a~~~~T~d~~~~~~qqiAn~~lKv~~l~d  104 (514)
T PF11336_consen   66 TAAAAAPSSDAQAGLTNDDATEMRQQIANAQLKVESLED  104 (514)
T ss_pred             cccccCCCcccccccChHHHHHHHHHHHhhhhhHHHHhh
Confidence                11222245668888888888887776666766653


No 171
>TIGR02894 DNA_bind_RsfA transcription factor, RsfA family. In a subset of endospore-forming members of the Firmcutes, members of this protein family are found, several to a genome. Two very strongly conserved sequences regions are separated by a highly variable linker region. Much of the linker region was excised from the seed alignment for this model. A characterized member is the prespore-specific transcription RsfA from Bacillus subtilis, previously called YwfN, which is controlled by sigma factor F and seems to fine-tune expression of some genes in the sigma-F regulon. A paralog in Bacillus subtilis is designated YlbO.
Probab=30.80  E-value=1.1e+02  Score=28.72  Aligned_cols=32  Identities=22%  Similarity=0.367  Sum_probs=16.1

Q ss_pred             HHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhh
Q 047021           15 ELEEKNSILSSKLQVKEVESESLQTRINELEQ   46 (388)
Q Consensus        15 ~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~   46 (388)
                      ++..+|..|..++......++.|.+.+..+..
T Consensus       101 ~~~~e~~~l~~e~~~l~~~~e~Le~e~~~L~~  132 (161)
T TIGR02894       101 ALQKENERLKNQNESLQKRNEELEKELEKLRQ  132 (161)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44445555555555555555555555544443


No 172
>PF08537 NBP1:  Fungal Nap binding protein NBP1;  InterPro: IPR013743 NBP1 is a nuclear protein which has been shown in Saccharomyces cerevisiae (Bakers yeast) to be essential for the G2/M transition of the cell cycle. 
Probab=30.64  E-value=95  Score=32.00  Aligned_cols=39  Identities=15%  Similarity=0.376  Sum_probs=23.4

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE   43 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd   43 (388)
                      .|.+|+.++.+++.++..+..+|..+...|.-|+.-|+|
T Consensus       183 k~~~l~~~l~~~~~eL~~~~k~L~faqekn~LlqslLdd  221 (323)
T PF08537_consen  183 KIDELEERLNDLEKELEITKKDLKFAQEKNALLQSLLDD  221 (323)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            344555555555556666666666666666666666664


No 173
>PF03112 DUF244:  Uncharacterized protein family (ORF7) DUF;  InterPro: IPR004335 Many of the proteins in this entry are Borrelia burgdorferi plasmid proteins of unknown function.
Probab=30.54  E-value=53  Score=30.74  Aligned_cols=32  Identities=41%  Similarity=0.574  Sum_probs=28.1

Q ss_pred             HHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHH
Q 047021          228 LLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQ  273 (388)
Q Consensus       228 LL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQ  273 (388)
                      =|+|||+--.||++.-.||...              |+..+|-||+
T Consensus        75 ~L~EI~~lq~ElnKiqnEn~k~--------------ekp~Kd~LK~  106 (158)
T PF03112_consen   75 SLMEIDSLQTELNKIQNENKKR--------------EKPIKDLLKI  106 (158)
T ss_pred             hHHHHHHHHHHHHHHHHHHHhh--------------hchHHHHHHH
Confidence            4899999999999999999876              7888888776


No 174
>PRK02793 phi X174 lysis protein; Provisional
Probab=30.17  E-value=1.4e+02  Score=24.00  Aligned_cols=55  Identities=24%  Similarity=0.372  Sum_probs=0.0

Q ss_pred             HhhHHHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021           32 VESESLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE   89 (388)
Q Consensus        32 ~~~~~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE   89 (388)
                      +....+..||.+||-.+.  --.=..|++|-.+-+..+   +.|..+|+.+..|+++.+.
T Consensus         1 m~~~~~e~Ri~~LE~~lafQe~tIe~Ln~~v~~Qq~~I---~~L~~~l~~L~~rl~~~~~   57 (72)
T PRK02793          1 MQDSSLEARLAELESRLAFQEITIEELNVTVTAHEMEM---AKLRDHLRLLTEKLKASQP   57 (72)
T ss_pred             CChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHhhcc


No 175
>smart00751 BSD domain in transcription factors and synapse-associated proteins.
Probab=30.01  E-value=43  Score=24.71  Aligned_cols=25  Identities=40%  Similarity=0.601  Sum_probs=22.0

Q ss_pred             hhhhhHHHHHHhhhhhhhhhhHHHH
Q 047021          232 IDNQSSEIERLFEENSSLSSSYQES  256 (388)
Q Consensus       232 ID~QS~EIErLf~ENs~Lsss~qea  256 (388)
                      +|.+..+|+.|+.+|.+|..-+.+.
T Consensus         5 ~~~~~~~i~~il~~~p~l~~~~~~l   29 (51)
T smart00751        5 LDAKKEEIESLLKENPLLKKLYNEL   29 (51)
T ss_pred             ccCCHHHHHHHHHHCHHHHHHHHHH
Confidence            5788899999999999999888874


No 176
>COG2433 Uncharacterized conserved protein [Function unknown]
Probab=29.91  E-value=3.2e+02  Score=30.74  Aligned_cols=53  Identities=19%  Similarity=0.340  Sum_probs=30.5

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh---------HhhhhhhhhHHHHh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN---------ELEQNTVPSLRKAL   56 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln---------d~e~n~v~slrkaL   56 (388)
                      ..|+.|+..|.+|+.++..+...|...+.+.+.+..+..         ....+++-.|++.|
T Consensus       429 ~~ve~l~~e~~~L~~~~ee~k~eie~L~~~l~~~~r~~~~~~~~~rei~~~~~~I~~L~~~L  490 (652)
T COG2433         429 ETVERLEEENSELKRELEELKREIEKLESELERFRREVRDKVRKDREIRARDRRIERLEKEL  490 (652)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHH
Confidence            345556666666655555555555555555555544443         23456677888888


No 177
>PRK14145 heat shock protein GrpE; Provisional
Probab=29.15  E-value=2.5e+02  Score=26.87  Aligned_cols=61  Identities=20%  Similarity=0.265  Sum_probs=46.4

Q ss_pred             hhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHH------------HHHhhhhHHHHHHHHhhHHHHHHH
Q 047021          318 THKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQT------------TQAYNGLVRLYKPVLRNIESSLIK  378 (388)
Q Consensus       318 ~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra------------~QA~n~L~Rly~PvL~~IEsrLik  378 (388)
                      ....++..|+.++.+-..+...|...+.-+-|+..-.            .-|.-+|.+-+.||+-|+|.-|-.
T Consensus        42 ~~~~e~~~l~~~l~~le~e~~el~d~~lR~~AEfeN~rkR~~kE~e~~~~~a~e~~~~~LLpV~DnLerAl~~  114 (196)
T PRK14145         42 QTVDEIEELKQKLQQKEVEAQEYLDIAQRLKAEFENYRKRTEKEKSEMVEYGKEQVILELLPVMDNFERALAS  114 (196)
T ss_pred             CchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhHHHHHHhc
Confidence            3445677788888877777788877777777776432            357788999999999999987743


No 178
>PF10883 DUF2681:  Protein of unknown function (DUF2681);  InterPro: IPR020274 This entry contains membrane proteins with no known function.
Probab=29.12  E-value=1.6e+02  Score=25.08  Aligned_cols=36  Identities=28%  Similarity=0.432  Sum_probs=28.7

Q ss_pred             ChhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHH
Q 047021            1 WLLSQLKDFQKKNAELEEKNSILSSKLQVKEVESES   36 (388)
Q Consensus         1 ~l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~   36 (388)
                      |+.=+++.+++.|+.|..+|..|..++..++.++.-
T Consensus        20 y~~~k~~ka~~~~~kL~~en~qlk~Ek~~~~~qvkn   55 (87)
T PF10883_consen   20 YLWWKVKKAKKQNAKLQKENEQLKTEKAVAETQVKN   55 (87)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455678999999999999999999888776655443


No 179
>KOG2129 consensus Uncharacterized conserved protein H4 [Function unknown]
Probab=28.32  E-value=4.2e+02  Score=28.94  Aligned_cols=83  Identities=34%  Similarity=0.394  Sum_probs=46.2

Q ss_pred             hHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhh----hhhhhHHHh----------hhhHHHHHHHHHHHhH
Q 047021           20 NSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVA----MEKDAAVVA----------REDLSAQLRTLKKRLK   85 (388)
Q Consensus        20 n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~----mEkDaA~va----------reDl~AQLR~~kkRlk   85 (388)
                      -+.|-++++..+.++++|++.|| ..--+ |++   -+||+    |+.|-|---          =+-+-.+|+.++|-.+
T Consensus       203 vN~LwKrmdkLe~ekr~Lq~KlD-qpvs~-p~~---prdia~~~~~~gD~a~~~~~hi~~l~~EveRlrt~l~~Aqk~~~  277 (552)
T KOG2129|consen  203 VNSLWKRMDKLEQEKRYLQKKLD-QPVST-PSL---PRDIAKIPDVHGDEAAAEKLHIDKLQAEVERLRTYLSRAQKSYQ  277 (552)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHhc-CcccC-CCc---hhhhhcCccccCchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35666777778888888888772 22222 322   25666    888866322          1334445555555444


Q ss_pred             HHHHHHhhhHhhHHHHHHHHHHHHH
Q 047021           86 EAEEEQYRAEEDAAALRAELNSLQQ  110 (388)
Q Consensus        86 EAEEEQyRAEEDaAaLRAEln~lQq  110 (388)
                      | +=-||++||-  ..|.|...||.
T Consensus       278 e-k~~qy~~Ee~--~~reen~rlQr  299 (552)
T KOG2129|consen  278 E-KLMQYRAEEV--DHREENERLQR  299 (552)
T ss_pred             H-HHHHHHHHHh--hHHHHHHHHHH
Confidence            3 2347777764  34555555544


No 180
>PF00170 bZIP_1:  bZIP transcription factor cAMP response element binding (CREB) protein signature fos transforming protein signature jun transcription factor signature;  InterPro: IPR011616  The basic-leucine zipper (bZIP) transcription factors [, ] of eukaryotic are proteins that contain a basic region mediating sequence-specific DNA-binding followed by a leucine zipper region (see IPR002158 from INTERPRO) required for dimerization.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0043565 sequence-specific DNA binding, 0046983 protein dimerization activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2H7H_B 2OQQ_B 1S9K_E 1JNM_A 1JUN_A 1FOS_H 1A02_J 1T2K_C 1CI6_A 1DH3_C ....
Probab=28.17  E-value=2.5e+02  Score=21.34  Aligned_cols=43  Identities=28%  Similarity=0.451  Sum_probs=30.6

Q ss_pred             HHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhh
Q 047021          204 KEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEEN  246 (388)
Q Consensus       204 kekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~EN  246 (388)
                      |.+-..++.+|-.-|..|++--+.|..+|+.=..++..|-.+|
T Consensus        21 R~RKk~~~~~Le~~~~~L~~en~~L~~~~~~L~~~~~~L~~e~   63 (64)
T PF00170_consen   21 RQRKKQYIEELEEKVEELESENEELKKELEQLKKEIQSLKSEN   63 (64)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            4555566667777777777777777777777777777776665


No 181
>PF09304 Cortex-I_coil:  Cortexillin I, coiled coil;  InterPro: IPR015383 This domain is predominantly found in the actin-bundling protein cortexillin I from Dictyostelium discoideum (Slime mold). The domain has a structure consisting of an 18-heptad-repeat alpha-helical coiled-coil, and is a prerequisite for the assembly of Cortexillin I []. ; PDB: 1D7M_A.
Probab=27.99  E-value=4.3e+02  Score=23.49  Aligned_cols=60  Identities=27%  Similarity=0.361  Sum_probs=35.1

Q ss_pred             HHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHH
Q 047021           15 ELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLKK   82 (388)
Q Consensus        15 ~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~kk   82 (388)
                      +|-.+++-|.+-+++....+.+.++|++++..++ ..+++.|+       .--+||.+|...|--+.+
T Consensus        34 eL~kqkd~L~~~l~~L~~q~~s~~qr~~eLqaki-~ea~~~le-------~eK~ak~~l~~r~~k~~~   93 (107)
T PF09304_consen   34 ELAKQKDQLRNALQSLQAQNASRNQRIAELQAKI-DEARRNLE-------DEKQAKLELESRLLKAQK   93 (107)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH-------HHHHHHHHHHHHHHHHHH
T ss_pred             HHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HHHHHHHH-------HHHHHHHHHHHHHHHHHh
Confidence            3444555666666667777777777777776653 34444442       234566666666544443


No 182
>PRK14143 heat shock protein GrpE; Provisional
Probab=27.79  E-value=4.7e+02  Score=25.65  Aligned_cols=62  Identities=19%  Similarity=0.275  Sum_probs=46.7

Q ss_pred             hhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHH------------HHHHhhhhHHHHHHHHhhHHHHHHHhhh
Q 047021          320 KNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQ------------TTQAYNGLVRLYKPVLRNIESSLIKMKQ  381 (388)
Q Consensus       320 ~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~r------------a~QA~n~L~Rly~PvL~~IEsrLikmkq  381 (388)
                      ..++..|+.++..-....+.|.-+.+-+.|+..-            -..|..+|.+-+.||+-|+|.-|..++.
T Consensus        66 ~~~~~~l~~el~~l~~e~~elkd~~lR~~AdfeN~RKR~~kE~e~~~~~a~~~~~~~lLpV~DnLerAl~~~~~  139 (238)
T PRK14143         66 AARLAQLEQELESLKQELEELNSQYMRIAADFDNFRKRTSREQEDLRLQLKCNTLSEILPVVDNFERARQQLKP  139 (238)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHhcccc
Confidence            3457778888777777777777777777776532            2357888999999999999998876543


No 183
>TIGR01010 BexC_CtrB_KpsE polysaccharide export inner-membrane protein, BexC/CtrB/KpsE family. This family contains gamma proteobacterial proteins involved in capsule polysaccharide export.
Probab=27.75  E-value=3.9e+02  Score=26.37  Aligned_cols=10  Identities=0%  Similarity=0.116  Sum_probs=3.6

Q ss_pred             HHhhHHHHHH
Q 047021           10 QKKNAELEEK   19 (388)
Q Consensus        10 q~~~~~le~~   19 (388)
                      +...++++.+
T Consensus       220 ~~~l~~~~~~  229 (362)
T TIGR01010       220 EGELIRVQAQ  229 (362)
T ss_pred             HHHHHHHHHH
Confidence            3333333333


No 184
>PRK00295 hypothetical protein; Provisional
Probab=27.72  E-value=2e+02  Score=22.93  Aligned_cols=50  Identities=36%  Similarity=0.481  Sum_probs=0.0

Q ss_pred             HHHHHhhHhhhhhh--hhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHH
Q 047021           36 SLQTRINELEQNTV--PSLRKALKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAE   88 (388)
Q Consensus        36 ~L~~~Lnd~e~n~v--~slrkaL~~v~mEkDaA~vareDl~AQLR~~kkRlkEAE   88 (388)
                      ++..|+.+||-+..  --.=..|++|-.+-+.-+   +.|..||+.+..|+++.+
T Consensus         2 ~~e~Ri~~LE~kla~qE~tie~Ln~~v~~Qq~~I---~~L~~ql~~L~~rl~~~~   53 (68)
T PRK00295          2 SLEERVTELESRQAFQDDTIQALNDVLVEQQRVI---ERLQLQMAALIKRQEEMV   53 (68)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHhh


No 185
>PF04111 APG6:  Autophagy protein Apg6;  InterPro: IPR007243 Macroautophagy is a bulk degradation process induced by starvation in eukaryotic cells. In yeast, 15 Apg proteins coordinate the formation of autophagosomes. No molecule involved in autophagy has yet been identified in higher eukaryotes []. The pre-autophagosomal structure contains at least five Apg proteins: Apg1p, Apg2p, Apg5p, Aut7p/Apg8p and Apg16p. It is found in the vacuole []. The C-terminal glycine of Apg12p is conjugated to a lysine residue of Apg5p via an isopeptide bond. During autophagy, cytoplasmic components are enclosed in autophagosomes and delivered to lysosomes/vacuoles. Auotphagy protein 16 (Apg16) has been shown to be bind to Apg5 and is required for the function of the Apg12p-Apg5p conjugate []. Autophagy protein 5 (Apg5) is directly required for the import of aminopeptidase I via the cytoplasm-to-vacuole targeting pathway []. Apg6/Vps30p has two distinct functions in the autophagic process, either associated with the membrane or in a retrieval step of the carboxypeptidase Y sorting pathway [].; GO: 0006914 autophagy; PDB: 3Q8T_A 3VP7_A 4DDP_A.
Probab=27.71  E-value=6.3e+02  Score=25.30  Aligned_cols=78  Identities=22%  Similarity=0.265  Sum_probs=57.2

Q ss_pred             hhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          202 EDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRG  279 (388)
Q Consensus       202 eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~  279 (388)
                      .+...+.+.|..+=.-.+.|..-..+|-.|.+....+.++.+.+.+.+---+.+...--....+|..-...+.+.|+.
T Consensus        57 ~Ee~~l~~eL~~LE~e~~~l~~el~~le~e~~~l~~eE~~~~~~~n~~~~~l~~~~~e~~sl~~q~~~~~~~L~~L~k  134 (314)
T PF04111_consen   57 QEEEELLQELEELEKEREELDQELEELEEELEELDEEEEEYWREYNELQLELIEFQEERDSLKNQYEYASNQLDRLRK  134 (314)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCHHT
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            333444444444444455555555556666666667788999999999988888888888999999999999999885


No 186
>PF09789 DUF2353:  Uncharacterized coiled-coil protein (DUF2353);  InterPro: IPR019179  Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function. 
Probab=27.69  E-value=3.7e+02  Score=27.62  Aligned_cols=97  Identities=23%  Similarity=0.325  Sum_probs=67.3

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHHHhhhhHHHHHHHHH
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAVVAREDLSAQLRTLK   81 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~vareDl~AQLR~~k   81 (388)
                      |+.++..+..++.+|+.+.+-+.-.+++...++|......+-|-.    .|--.|    =..+.-+|-=|.|.+-=|-++
T Consensus       131 lV~qLEk~~~q~~qLe~d~qs~lDEkeEl~~ERD~yk~K~~RLN~----ELn~~L----~g~~~rivDIDaLi~ENRyL~  202 (319)
T PF09789_consen  131 LVEQLEKLREQIEQLERDLQSLLDEKEELVTERDAYKCKAHRLNH----ELNYIL----NGDENRIVDIDALIMENRYLK  202 (319)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHh----CCCCCCcccHHHHHHHHHHHH
Confidence            577888999999999999999999999999999998754443311    122222    222234445678889999999


Q ss_pred             HHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021           82 KRLKEAEEEQYRAEEDAAALRAELNSLQQQAM  113 (388)
Q Consensus        82 kRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~  113 (388)
                      .|++.++||-       ..++.-|+-...-+.
T Consensus       203 erl~q~qeE~-------~l~k~~i~KYK~~le  227 (319)
T PF09789_consen  203 ERLKQLQEEK-------ELLKQTINKYKSALE  227 (319)
T ss_pred             HHHHHHHHHH-------HHHHHHHHHHHHHHH
Confidence            9999988774       344444554444443


No 187
>PF10481 CENP-F_N:  Cenp-F N-terminal domain;  InterPro: IPR018463 Mitosin or centromere-associated protein-F (Cenp-F) is found bound across the centromere as one of the proteins of the outer layer of the kinetochore []. Most of the kinetochore/centromere functions appear to depend upon binding of the C-terminal part of the molecule, whereas the N-terminal part, here, may be a cytoplasmic player in controlling the function of microtubules and dynein [].
Probab=27.45  E-value=7.1e+02  Score=25.76  Aligned_cols=118  Identities=28%  Similarity=0.361  Sum_probs=70.0

Q ss_pred             HHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHH-------H
Q 047021          206 KLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEEL-------R  278 (388)
Q Consensus       206 kle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeL-------r  278 (388)
                      -|.+-.+-+....+.|+-+||||--|+-.--.-+--                     -|-|+--|=+|.+.|       +
T Consensus        64 ~LkREnq~l~e~c~~lek~rqKlshdlq~Ke~qv~~---------------------lEgQl~s~Kkqie~Leqelkr~K  122 (307)
T PF10481_consen   64 ALKRENQSLMESCENLEKTRQKLSHDLQVKESQVNF---------------------LEGQLNSCKKQIEKLEQELKRCK  122 (307)
T ss_pred             hhhhhhhhHHHHHHHHHHHHHHhhHHHhhhHHHHHH---------------------HHHHHHHHHHHHHHHHHHHHHHH
Confidence            355667788888999999999999887544332221                     244555555555544       4


Q ss_pred             HHHHHHHhhhhccC-CCCccccccccCCCCCC-ccccCCCchh-hhhHHhhHHHHHHHhhHHHHhHHHHHHHHHH
Q 047021          279 GILEKLRTEQASFM-SVNDKDILRDFSEPSKD-AVSQNGSETH-KNEFLALKGQLAKEQSRAEALSAQVLQLSAQ  350 (388)
Q Consensus       279 ~lLeklR~EQa~~~-~~~~~~~~g~~~e~~~d-g~~~~~~~~~-~te~l~LK~eL~kEqsRseaLsaqv~qLsAe  350 (388)
                      .-||+.+.+..+.- +-+++.|      |.+. .+.-+.++.| .+-+..|+...-+|--..-.|.++|..|-+.
T Consensus       123 sELErsQ~~~~~~~~sl~~~st------pqk~f~~p~tp~q~~~~sk~e~L~ekynkeveerkrle~e~k~lq~k  191 (307)
T PF10481_consen  123 SELERSQQAASSGDVSLNPCST------PQKSFATPLTPSQYYSDSKYEELQEKYNKEVEERKRLEAEVKALQAK  191 (307)
T ss_pred             HHHHHHHHhhccCCccccccCC------chhhccCCCChhhhhhhhhHHHHHHHHHHHHHHHhhHHHHHHHHhcc
Confidence            55555554333111 1112211      1111 1112223333 3468899999999999999999999888653


No 188
>PF12329 TMF_DNA_bd:  TATA element modulatory factor 1 DNA binding;  InterPro: IPR022092  This is the middle region of a family of TATA element modulatory factor 1 proteins conserved in eukaryotes that contains at its N-terminal section a number of leucine zippers that could potentially form coiled coil structures. The whole proteins bind to the TATA element of some RNA polymerase II promoters and repress their activity. by competing with the binding of TATA binding protein. TMFs are evolutionarily conserved golgins that bind Rab6, a ubiquitous ras-like GTP-binding Golgi protein, and contribute to Golgi organisation in animal [] and plant [] cells. 
Probab=27.33  E-value=2.8e+02  Score=22.38  Aligned_cols=15  Identities=47%  Similarity=0.800  Sum_probs=6.4

Q ss_pred             hhHHHHHHHHHHHhH
Q 047021           71 EDLSAQLRTLKKRLK   85 (388)
Q Consensus        71 eDl~AQLR~~kkRlk   85 (388)
                      +++...+-.++.|++
T Consensus        57 ~~~e~~~~~l~~~l~   71 (74)
T PF12329_consen   57 EELEKELESLEERLK   71 (74)
T ss_pred             HHHHHHHHHHHHHhh
Confidence            334444444444444


No 189
>PF10805 DUF2730:  Protein of unknown function (DUF2730);  InterPro: IPR020269 This entry represents a family of various hypothetical proteins. The proteins, which include HI1498 and Gp25, from phage Mu, are currently uncharacterised.
Probab=27.06  E-value=1.7e+02  Score=24.83  Aligned_cols=33  Identities=18%  Similarity=0.162  Sum_probs=13.1

Q ss_pred             hhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021           24 SSKLQVKEVESESLQTRINELEQNTVPSLRKAL   56 (388)
Q Consensus        24 ~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL   56 (388)
                      ...+.+..-+...+..+++.+.+.+---|.+.|
T Consensus        71 ~l~l~el~G~~~~l~~~l~~v~~~~~lLlE~~l  103 (106)
T PF10805_consen   71 QLELAELRGELKELSARLQGVSHQLDLLLENEL  103 (106)
T ss_pred             HHHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            333333333344444444444443333333333


No 190
>TIGR01000 bacteriocin_acc bacteriocin secretion accessory protein. This family represents an accessory protein that works with the bacteriocin maturation and ABC transport secretion protein described by TIGR01193.
Probab=26.71  E-value=7.1e+02  Score=25.56  Aligned_cols=20  Identities=20%  Similarity=0.459  Sum_probs=10.3

Q ss_pred             hhhHHHHHHHHHHHHHHHHH
Q 047021          127 PDHVQTLEKELATLKSQLQQ  146 (388)
Q Consensus       127 ~~~~~~lekEl~~l~~~lqq  146 (388)
                      .+++...+.++..++.++..
T Consensus       290 ~~~l~~~~~~l~~~~~~l~~  309 (457)
T TIGR01000       290 KQEITDLNQKLLELESKIKS  309 (457)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            44455555555555555543


No 191
>PF06156 DUF972:  Protein of unknown function (DUF972);  InterPro: IPR010377 FUNCTION: Involved in initiation control of chromosome replication. SUBUNIT: Interacts with both DnaA and DnaN, acting as a bridge between these two proteins. SIMILARITY: Belongs to the YabA family.
Probab=26.10  E-value=1.7e+02  Score=25.28  Aligned_cols=44  Identities=32%  Similarity=0.480  Sum_probs=23.0

Q ss_pred             hhhhHHHHHhhHHHHHHhHHhhhhHHHH-------HHhhHHHHHHhhHhhh
Q 047021            3 LSQLKDFQKKNAELEEKNSILSSKLQVK-------EVESESLQTRINELEQ   46 (388)
Q Consensus         3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k-------~~~~~~L~~~Lnd~e~   46 (388)
                      +.+|..|++....+-.+...|...+.+.       .++|+-|..+|...+.
T Consensus         7 ~~~l~~le~~l~~l~~~~~~LK~~~~~l~EEN~~L~~EN~~Lr~~l~~~~~   57 (107)
T PF06156_consen    7 FDRLDQLEQQLGQLLEELEELKKQLQELLEENARLRIENEHLRERLEELEQ   57 (107)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            3444444444444444444444444433       3466666777776655


No 192
>PRK13922 rod shape-determining protein MreC; Provisional
Probab=25.97  E-value=1.2e+02  Score=28.74  Aligned_cols=39  Identities=33%  Similarity=0.427  Sum_probs=28.9

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHH---HHHHhhHHHHHHhh
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQ---VKEVESESLQTRIN   42 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~---~k~~~~~~L~~~Ln   42 (388)
                      ....++.+.|.+|+++|..|..++.   ....+++.|.+-|+
T Consensus        69 ~~~~~l~~en~~L~~e~~~l~~~~~~~~~l~~en~~L~~lL~  110 (276)
T PRK13922         69 ASLFDLREENEELKKELLELESRLQELEQLEAENARLRELLN  110 (276)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            3567788888899988888888777   55666766666555


No 193
>PF13870 DUF4201:  Domain of unknown function (DUF4201)
Probab=25.96  E-value=4.3e+02  Score=23.70  Aligned_cols=76  Identities=20%  Similarity=0.242  Sum_probs=33.7

Q ss_pred             HHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhh-hhhhHHHhhhhHHHHHHHHHHHhHHH
Q 047021            9 FQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAM-EKDAAVVAREDLSAQLRTLKKRLKEA   87 (388)
Q Consensus         9 lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~m-EkDaA~vareDl~AQLR~~kkRlkEA   87 (388)
                      +.....+.+.....+...+.....+.+.+.+..+.+....+..-.-+|    | +=|..+-.-+++-..++.++++...+
T Consensus        96 l~~~l~~~~~~~~~~r~~l~~~k~~r~k~~~~~~~l~~~~~~~~~P~l----l~Dy~~~~~~~~~l~~~i~~l~rk~~~l  171 (177)
T PF13870_consen   96 LKQELKDREEELAKLREELYRVKKERDKLRKQNKKLRQQGGLLGVPAL----LRDYDKTKEEVEELRKEIKELERKVEIL  171 (177)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCcHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            334444444444444444444444444444444444443332111111    2 23344444455555555666655555


Q ss_pred             H
Q 047021           88 E   88 (388)
Q Consensus        88 E   88 (388)
                      +
T Consensus       172 ~  172 (177)
T PF13870_consen  172 E  172 (177)
T ss_pred             H
Confidence            4


No 194
>KOG0980 consensus Actin-binding protein SLA2/Huntingtin-interacting protein Hip1 [Cytoskeleton]
Probab=25.47  E-value=1.2e+03  Score=27.71  Aligned_cols=71  Identities=23%  Similarity=0.260  Sum_probs=40.0

Q ss_pred             hhHHHHHHHHhHHHHHHHHHHhhHHHHH--------------HHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHH
Q 047021          202 EDKEKLEKQLHDMAVVVERLESSRQKLL--------------MEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQV  267 (388)
Q Consensus       202 eDkekle~Ql~dma~~VeRLE~SRqKLL--------------~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQv  267 (388)
                      ++...++++...++.+||++..-++-+.              .|.-.=--|+++|-..=+++++++       ++|-.|+
T Consensus       459 ~s~~~~~~~~~~L~d~le~~~~~~~~~~~K~e~~~~~le~l~~El~~l~~e~~~lq~~~~~~~qs~-------~~~~~~l  531 (980)
T KOG0980|consen  459 QSIDDVEEENTNLNDQLEELQRAAGRAETKTESQAKALESLRQELALLLIELEELQRTLSNLAQSH-------NNQLAQL  531 (980)
T ss_pred             HhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHHHHH-------HHHHHHH
Confidence            4444666677777777776655444321              122222235555555544444444       5688888


Q ss_pred             HHHHHHHHHHHH
Q 047021          268 KDCLKQNEELRG  279 (388)
Q Consensus       268 qDclkQNeeLr~  279 (388)
                      .|-++|-+.+-.
T Consensus       532 ~~~l~~KD~~~~  543 (980)
T KOG0980|consen  532 EDLLKQKDRLAA  543 (980)
T ss_pred             HHHHHhhHHHHH
Confidence            888888665543


No 195
>PF14389 Lzipper-MIP1:  Leucine-zipper of ternary complex factor MIP1
Probab=25.45  E-value=3.9e+02  Score=22.15  Aligned_cols=27  Identities=41%  Similarity=0.585  Sum_probs=21.4

Q ss_pred             ChhhHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          126 SPDHVQTLEKELATLKSQLQQESQLRQ  152 (388)
Q Consensus       126 s~~~~~~lekEl~~l~~~lqqes~lrq  152 (388)
                      ..+...+|++++..|+.+|+.+..+|.
T Consensus         6 ~~~~r~~LeqeV~~Lq~~L~~E~~~r~   32 (88)
T PF14389_consen    6 LHERRSALEQEVAELQKQLQEEQDLRR   32 (88)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            445677899999999999988877754


No 196
>PF03962 Mnd1:  Mnd1 family;  InterPro: IPR005647 This family of proteins includes meiotic nuclear division protein 1 (MND1) from Saccharomyces cerevisiae (Baker's yeast). The mnd1 protein forms a complex with hop2 to promote homologous chromosome pairing and meiotic double-strand break repair [].
Probab=25.09  E-value=5.7e+02  Score=23.85  Aligned_cols=36  Identities=22%  Similarity=0.262  Sum_probs=19.6

Q ss_pred             HHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhh
Q 047021           16 LEEKNSILSSKLQVKEVESESLQTRINELEQNTVPS   51 (388)
Q Consensus        16 le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~s   51 (388)
                      +......|..+++....++..|...|.+......++
T Consensus        67 ~~~~~~~l~~~~~~~~~~i~~l~~~i~~~~~~r~~~  102 (188)
T PF03962_consen   67 RQNKLEKLQKEIEELEKKIEELEEKIEEAKKGREES  102 (188)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccc
Confidence            344445555555556666666666666554444444


No 197
>PF14645 Chibby:  Chibby family
Probab=25.09  E-value=76  Score=27.79  Aligned_cols=25  Identities=32%  Similarity=0.573  Sum_probs=20.7

Q ss_pred             hhhHHHHHhhHHHHHHhHHhhhhHH
Q 047021            4 SQLKDFQKKNAELEEKNSILSSKLQ   28 (388)
Q Consensus         4 ~~v~~lq~~~~~le~~n~~L~sk~~   28 (388)
                      .-.+.+.++|..|++||++|.-|++
T Consensus        71 ~~~~~l~~~n~~L~EENN~Lklk~e   95 (116)
T PF14645_consen   71 EENQRLRKENQQLEEENNLLKLKIE   95 (116)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3457789999999999999987764


No 198
>PF08826 DMPK_coil:  DMPK coiled coil domain like;  InterPro: IPR014930 This domain is found in the myotonic dystrophy protein kinase (DMPK) and adopts a coiled coil structure. It plays a role in dimerisation []. ; GO: 0004674 protein serine/threonine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation; PDB: 1WT6_D.
Probab=24.85  E-value=3.6e+02  Score=21.51  Aligned_cols=49  Identities=20%  Similarity=0.313  Sum_probs=30.6

Q ss_pred             hhHHHhhhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           64 DAAVVAREDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        64 DaA~vareDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      |+=+.|+..+.-.|+.+|.--..++..-.-||..-..|-++|+.|..++
T Consensus         7 ~~EirakQ~~~eEL~kvk~~n~~~e~kLqeaE~rn~eL~~ei~~L~~e~   55 (61)
T PF08826_consen    7 EAEIRAKQAIQEELTKVKSANLAFESKLQEAEKRNRELEQEIERLKKEM   55 (61)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5666777777777766665555555555555556666666766666554


No 199
>KOG1029 consensus Endocytic adaptor protein intersectin [Signal transduction mechanisms; Intracellular trafficking, secretion, and vesicular transport]
Probab=24.85  E-value=1.2e+03  Score=27.62  Aligned_cols=26  Identities=27%  Similarity=0.437  Sum_probs=17.6

Q ss_pred             hhhhhHHHHHhhHHHHHHhHHhhhhH
Q 047021            2 LLSQLKDFQKKNAELEEKNSILSSKL   27 (388)
Q Consensus         2 l~~~v~~lq~~~~~le~~n~~L~sk~   27 (388)
                      |..++|++|+++..|--+..+|.-++
T Consensus       491 lqarikE~q~kl~~l~~Ekq~l~~ql  516 (1118)
T KOG1029|consen  491 LQARIKELQEKLQKLAPEKQELNHQL  516 (1118)
T ss_pred             HHHHHHHHHHHHHhhhhHHHHHHHHH
Confidence            45677888888777766666665444


No 200
>PF10779 XhlA:  Haemolysin XhlA;  InterPro: IPR019715 Haemolysin XhlA is a cell-surface associated haemolysin that lyses the two most prevalent types of insect immune cells (granulocytes and plasmatocytes) as well as rabbit and horse erythrocytes []. 
Probab=24.35  E-value=2.5e+02  Score=22.12  Aligned_cols=43  Identities=21%  Similarity=0.327  Sum_probs=23.7

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhh
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNT   48 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~   48 (388)
                      |...+.+...++...+.|.......+.++..+..+|++++.|+
T Consensus         8 l~~ie~~l~~~~~~i~~lE~~~~~~e~~i~~~~~~l~~I~~n~   50 (71)
T PF10779_consen    8 LNRIETKLDNHEERIDKLEKRDAANEKDIKNLNKQLEKIKSNT   50 (71)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3334444444444444555555555555666677777776665


No 201
>PRK10997 yieM hypothetical protein; Provisional
Probab=24.28  E-value=4.3e+02  Score=28.56  Aligned_cols=83  Identities=29%  Similarity=0.331  Sum_probs=55.4

Q ss_pred             hhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhh---hhHHHHHHhhhhhhh-------------hhhHHHHH
Q 047021          194 AASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDN---QSSEIERLFEENSSL-------------SSSYQESM  257 (388)
Q Consensus       194 a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~---QS~EIErLf~ENs~L-------------sss~qea~  257 (388)
                      +++.-|-..=+..|..++..  ..++-+|..|.|||.||-.   -+..+|..|.+|..-             ..+++.. 
T Consensus       127 ~~~~~f~~~w~~~l~~~~~~--~~~~~~e~~re~~l~el~~r~e~~~~l~~v~~~~~~~~grlWd~s~g~l~~~d~~~~-  203 (487)
T PRK10997        127 ALHTLFLQRWRLSLVVQTTT--LNQQLLEQEREQLLAELQQRMTLSGQLEPVLAENDEAAGRLWDMSAGQLKRGDYQLI-  203 (487)
T ss_pred             HHHHHHHHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhccccccccCcccccCHHHH-
Confidence            34555544445666666543  3578899999999999953   356677788877641             1122222 


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          258 GIAKHWENQVKDCLKQNEELRGILEKLRT  286 (388)
Q Consensus       258 ~iaaqWEnQvqDclkQNeeLr~lLeklR~  286 (388)
                             .+.-+-|++|.+|+.+.+.|=.
T Consensus       204 -------~~~a~~L~~~p~L~~ia~~LGR  225 (487)
T PRK10997        204 -------VQYGEFLKQQPELQKLAEQLGR  225 (487)
T ss_pred             -------HHHHHHHHhCHHHHHHHHHhCc
Confidence                   2456789999999999999643


No 202
>PRK03947 prefoldin subunit alpha; Reviewed
Probab=24.17  E-value=4.6e+02  Score=22.52  Aligned_cols=42  Identities=29%  Similarity=0.368  Sum_probs=30.8

Q ss_pred             hHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHh
Q 047021           72 DLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAM  113 (388)
Q Consensus        72 Dl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~  113 (388)
                      ++..-+-.+++|+...++.--.=+.+.+.++.+++.+++.++
T Consensus        91 ~~~eA~~~l~~~~~~l~~~~~~l~~~l~~~~~~~~~~~~~l~  132 (140)
T PRK03947         91 DLDEAIEILDKRKEELEKALEKLEEALQKLASRIAQLAQELQ  132 (140)
T ss_pred             cHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455556667777777777777777788888888888877764


No 203
>PF05377 FlaC_arch:  Flagella accessory protein C (FlaC);  InterPro: IPR008039 Although archaeal flagella appear superficially similar to those of bacteria, they are quite distinct []. In several archaea, the flagellin genes are followed immediately by the flagellar accessory genes flaCDEFGHIJ. The gene products may have a role in translocation, secretion, or assembly of the flagellum. FlaC is a protein whose exact role is unknown but it has been shown to be membrane-associated (by immuno-blotting fractionated cells) [].
Probab=23.69  E-value=2.2e+02  Score=22.55  Aligned_cols=39  Identities=5%  Similarity=0.220  Sum_probs=28.5

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE   43 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd   43 (388)
                      ++.+|+++.-.++....-+.++++.....++.+.+...+
T Consensus         1 Ri~elEn~~~~~~~~i~tvk~en~~i~~~ve~i~envk~   39 (55)
T PF05377_consen    1 RIDELENELPRIESSINTVKKENEEISESVEKIEENVKD   39 (55)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            467788888888877777777777777777776665543


No 204
>PF02151 UVR:  UvrB/uvrC motif;  InterPro: IPR001943 During the process of Escherichia coli nucleotide excision repair, DNA damage recognition and processing are achieved by the action of the uvrA, uvrB, and uvrC gene products []. UvrB and UvrC share a common domain of around 35 amino acids, the so called UVR domain. This domain in UvrB can interact with the homologous domain in UvrC throughout a putative coiled coil structure. This interaction is important for the incision of the damaged strand [].; GO: 0003677 DNA binding, 0004518 nuclease activity, 0006289 nucleotide-excision repair; PDB: 3PXG_D 3PXI_C 1E52_B 1QOJ_B 2D7D_B 2NMV_B.
Probab=23.58  E-value=89  Score=21.81  Aligned_cols=16  Identities=44%  Similarity=0.684  Sum_probs=14.4

Q ss_pred             hhHHHHHHHHHHHHHH
Q 047021           96 EDAAALRAELNSLQQQ  111 (388)
Q Consensus        96 EDaAaLRAEln~lQqq  111 (388)
                      |.||.+|-+|+.|+.|
T Consensus        21 E~Aa~~Rd~i~~l~~q   36 (36)
T PF02151_consen   21 EKAARLRDQIKALKKQ   36 (36)
T ss_dssp             HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHcC
Confidence            6899999999999875


No 205
>COG4372 Uncharacterized protein conserved in bacteria with the myosin-like domain [Function unknown]
Probab=23.24  E-value=9.9e+02  Score=26.03  Aligned_cols=96  Identities=28%  Similarity=0.401  Sum_probs=61.7

Q ss_pred             hhhHHHHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCCCCCCCCCCChhhHHHHHHHHHHHHHHHHHHHH
Q 047021           70 REDLSAQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSLGGISSMGISPDHVQTLEKELATLKSQLQQESQ  149 (388)
Q Consensus        70 reDl~AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~s~~~~~g~s~~~~~~lekEl~~l~~~lqqes~  149 (388)
                      =||+..|||.++--|+-|+++---+|-.--+.|.|+..--+                  +-...+.+++.++-.+-...+
T Consensus        76 lddi~~qlr~~rtel~~a~~~k~~~e~er~~~~~El~~~r~------------------e~~~v~~~~~~a~~n~~kAqQ  137 (499)
T COG4372          76 LDDIRPQLRALRTELGTAQGEKRAAETEREAARSELQKARQ------------------EREAVRQELAAARQNLAKAQQ  137 (499)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH------------------HHHHHHHHHHHHHHHHHHHHH
Confidence            58999999999999999999888888777777777754321                  223344444443333323222


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHh
Q 047021          150 LRQQERKQLTEEQARISVLMSEKQELEEKLVMFS  183 (388)
Q Consensus       150 lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~  183 (388)
                      .-.+-..+.-.=+.++.+|-.+-++|+++.+++-
T Consensus       138 ~lar~t~Q~q~lqtrl~~l~~qr~ql~aq~qsl~  171 (499)
T COG4372         138 ELARLTKQAQDLQTRLKTLAEQRRQLEAQAQSLQ  171 (499)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            2222233333456677888888888888887754


No 206
>PF10506 MCC-bdg_PDZ:  PDZ domain of MCC-2 bdg protein for Usher syndrome;  InterPro: IPR019536  The entry represents a protein that has a high homology to the tumour suppressor Usher syndrome type-1C protein-binding protein 1, or known as MCC2 (mutated in colon cancer).  MCC2 protein binds the first PDZ domain of AIE-75 with its C-terminal amino acids -DTFL. A possible role of MCC2 as a tumour suppressor has been put forward. The carboxyl terminus of the predicted protein was DTFL which matched the consensus motif X-S/T-X-phi (phi: hydrophobic amino acid residue) for binding to the PDZ domain of AIE-75 [, ]. 
Probab=22.87  E-value=2.7e+02  Score=22.60  Aligned_cols=48  Identities=33%  Similarity=0.517  Sum_probs=39.9

Q ss_pred             HHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021            8 DFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL   56 (388)
Q Consensus         8 ~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL   56 (388)
                      .|+.+..+|...|.+|.+.+++-....+.|..+|---|-|.+- ||-|+
T Consensus         2 rL~~~ie~L~~~n~~L~~~le~~k~~se~Ls~~lgk~es~~~a-lrlal   49 (67)
T PF10506_consen    2 RLKRRIEELKSQNEMLSSTLEERKQQSEELSMDLGKYESNATA-LRLAL   49 (67)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhHH-HHHHH
Confidence            5778889999999999999999999999999998887777653 24444


No 207
>PF10458 Val_tRNA-synt_C:  Valyl tRNA synthetase tRNA binding arm;  InterPro: IPR019499 The aminoacyl-tRNA synthetases (6.1.1. from EC) catalyse the attachment of an amino acid to its cognate transfer RNA molecule in a highly specific two-step reaction. These proteins differ widely in size and oligomeric state, and have limited sequence homology []. The 20 aminoacyl-tRNA synthetases are divided into two classes, I and II. Class I aminoacyl-tRNA synthetases contain a characteristic Rossman fold catalytic domain and are mostly monomeric []. Class II aminoacyl-tRNA synthetases share an anti-parallel beta-sheet fold flanked by alpha-helices [], and are mostly dimeric or multimeric, containing at least three conserved regions [, , ]. However, tRNA binding involves an alpha-helical structure that is conserved between class I and class II synthetases. In reactions catalysed by the class I aminoacyl-tRNA synthetases, the aminoacyl group is coupled to the 2'-hydroxyl of the tRNA, while, in class II reactions, the 3'-hydroxyl site is preferred. The synthetases specific for arginine, cysteine, glutamic acid, glutamine, isoleucine, leucine, methionine, tyrosine, tryptophan and valine belong to class I synthetases. The synthetases specific for alanine, asparagine, aspartic acid, glycine, histidine, lysine, phenylalanine, proline, serine, and threonine belong to class-II synthetases []. Based on their mode of binding to the tRNA acceptor stem, both classes of tRNA synthetases have been subdivided into three subclasses, designated 1a, 1b, 1c and 2a, 2b, 2c. This entry represents the C-terminal domain of Valyl-tRNA synthetase, which consists of two helices in a long alpha-hairpin. Valyl-tRNA synthetase (6.1.1.9 from EC) is an alpha monomer that belongs to class Ia.; GO: 0000166 nucleotide binding, 0004832 valine-tRNA ligase activity, 0005524 ATP binding, 0006438 valyl-tRNA aminoacylation, 0005737 cytoplasm; PDB: 1IVS_B 1GAX_B.
Probab=22.85  E-value=3e+02  Score=21.32  Aligned_cols=24  Identities=38%  Similarity=0.421  Sum_probs=11.1

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          154 ERKQLTEEQARISVLMSEKQELEEKLV  180 (388)
Q Consensus       154 eqqrla~Eq~~~a~l~aEkqeLeekl~  180 (388)
                      |..||..+.   ..+..+...++.+|+
T Consensus         5 E~~rL~Kel---~kl~~~i~~~~~kL~   28 (66)
T PF10458_consen    5 EIERLEKEL---EKLEKEIERLEKKLS   28 (66)
T ss_dssp             HHHHHHHHH---HHHHHHHHHHHHHHC
T ss_pred             HHHHHHHHH---HHHHHHHHHHHHHHc
Confidence            344444444   334444444555544


No 208
>PF04568 IATP:  Mitochondrial ATPase inhibitor, IATP;  InterPro: IPR007648  ATP synthase inhibitor prevents the enzyme from switching to ATP hydrolysis during collapse of the electrochemical gradient, for example during oxygen deprivation [] ATP synthase inhibitor forms a one to one complex with the F1 ATPase, possibly by binding at the alpha-beta interface. It is thought to inhibit ATP synthesis by preventing the release of ATP []. The minimum inhibitory region for bovine inhibitor (P01096 from SWISSPROT) is from residues 39 to 72 []. The inhibitor has two oligomeric states, dimer (the active state) and tetramer. At low pH , the inhibitor forms a dimer via antiparallel coiled coil interactions between the C-terminal regions of two monomers. At high pH, the inhibitor forms tetramers and higher oligomers by coiled coil interactions involving the N terminus and inhibitory region, thus preventing the inhibitory activity []. ; GO: 0004857 enzyme inhibitor activity, 0045980 negative regulation of nucleotide metabolic process, 0005739 mitochondrion; PDB: 1GMJ_B 1OHH_H 1HF9_B 2V7Q_J.
Probab=22.78  E-value=2.6e+02  Score=24.20  Aligned_cols=47  Identities=28%  Similarity=0.295  Sum_probs=33.3

Q ss_pred             hhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHh
Q 047021          195 ASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLF  243 (388)
Q Consensus       195 ~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf  243 (388)
                      +..+|+.-+++.=+.  +=-..--|+|+.-|.||-.||+.+-.+|++|=
T Consensus        50 ag~~f~krE~A~E~~--Y~r~~EkEqL~~Lk~kl~~e~~~~~k~i~~le   96 (100)
T PF04568_consen   50 AGGAFGKREAAQEEQ--YFRKKEKEQLKKLKEKLKEEIEHHRKEIDELE   96 (100)
T ss_dssp             CCHHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hCCccchHHHhhHHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455676544443222  22233479999999999999999999999873


No 209
>cd00890 Prefoldin Prefoldin is a hexameric molecular chaperone complex, found in both eukaryotes and archaea, that binds and stabilizes newly synthesized polypeptides allowing them to fold correctly.  The complex contains two alpha and four beta subunits, the two subunits being evolutionarily related. In archaea, there is usually only one gene for each subunit while in eukaryotes there two or more paralogous genes encoding each subunit adding heterogeneity to the structure of the hexamer. The structure of the complex consists of a double beta barrel assembly with six protruding coiled-coils.
Probab=21.77  E-value=4.6e+02  Score=21.58  Aligned_cols=36  Identities=28%  Similarity=0.434  Sum_probs=25.7

Q ss_pred             HHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHH
Q 047021           77 LRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQA  112 (388)
Q Consensus        77 LR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~  112 (388)
                      ...+++|+...+...-.-+.+...++.+++.+...+
T Consensus        89 ~~~l~~r~~~l~~~~~~l~~~~~~~~~~~~~l~~~l  124 (129)
T cd00890          89 IEFLKKRLETLEKQIEKLEKQLEKLQDQITELQEEL  124 (129)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            466677777777777777777777777777777655


No 210
>PF15619 Lebercilin:  Ciliary protein causing Leber congenital amaurosis disease
Probab=21.75  E-value=6.9e+02  Score=23.61  Aligned_cols=32  Identities=38%  Similarity=0.613  Sum_probs=19.8

Q ss_pred             hhhhhhhhhhHHHhhhhHHHHHHHHHHHhHHHHH
Q 047021           56 LKDVAMEKDAAVVAREDLSAQLRTLKKRLKEAEE   89 (388)
Q Consensus        56 L~~v~mEkDaA~vareDl~AQLR~~kkRlkEAEE   89 (388)
                      |+..+..|+-+-  |+.|.-+|-.+..++.+++.
T Consensus       108 L~~L~~dknL~e--ReeL~~kL~~~~~~l~~~~~  139 (194)
T PF15619_consen  108 LKKLSEDKNLAE--REELQRKLSQLEQKLQEKEK  139 (194)
T ss_pred             HHHHHHcCCchh--HHHHHHHHHHHHHHHHHHHH
Confidence            444444444333  67777777777777776654


No 211
>PF03999 MAP65_ASE1:  Microtubule associated protein (MAP65/ASE1 family);  InterPro: IPR007145 This is a family of microtubule associated proteins. One of its members is the yeast anaphase spindle elongation protein.; PDB: 3NRX_A 3NRY_A.
Probab=21.53  E-value=31  Score=37.03  Aligned_cols=151  Identities=26%  Similarity=0.393  Sum_probs=0.0

Q ss_pred             HhhhhHH--HHHHHHHHHhHHHHHHHhhhHhhHHHHHHHHHHHHHHHhcCCC------------CCCCCCCCChhhHHHH
Q 047021           68 VAREDLS--AQLRTLKKRLKEAEEEQYRAEEDAAALRAELNSLQQQAMDGSL------------GGISSMGISPDHVQTL  133 (388)
Q Consensus        68 vareDl~--AQLR~~kkRlkEAEEEQyRAEEDaAaLRAEln~lQqq~~~~~~------------s~~~~~g~s~~~~~~l  133 (388)
                      +-..|+.  ..|..++..+..-++|--+--+....++.+|..|-....-...            .+..+++++++.+..|
T Consensus       133 ~~~~~l~S~~~l~~l~~~l~~L~~e~~~R~~~v~~l~~~I~~l~~~L~~~~~~~~~e~~l~~~~~~~~~~~Ls~~~l~~L  212 (619)
T PF03999_consen  133 IDESDLPSLEELEELRQHLQRLQEEKERRLEEVRELREEIISLMEELGIDPERTSFEKDLLSYSEDEESFCLSDENLEKL  212 (619)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             CCCCCCCcHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcccccchhhccccccccccCCCCHHHHHHH
Confidence            3456777  7899999999999988888888888899888877555433221            1122567777777777


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhh-hhhhhcchhhHHHHHHHHh
Q 047021          134 EKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNA-AASKAFSVEDKEKLEKQLH  212 (388)
Q Consensus       134 ekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~-a~~k~~S~eDkekle~Ql~  212 (388)
                      ..-+..|..+..+.              ..++..+......|=+.|..    +.+....| ..+..+|...-..++..  
T Consensus       213 ~~~~~~L~~~k~~r--------------~~~~~~l~~~i~~LW~~L~~----~~ee~~~F~~~~~~ls~~~i~~l~~E--  272 (619)
T PF03999_consen  213 QELLQELEEEKEER--------------EEKLQELREKIEELWNRLDV----PEEEREAFLEENSGLSLDTIEALEEE--  272 (619)
T ss_dssp             --------------------------------------------------------------------------------
T ss_pred             HHHHHHHHHHHHHH--------------HHHHHHHHHHHHHHHHHhCC----CHHHHHHHhhccCcchHHHHHHHHHH--
Confidence            66665443322211              11223333333333333321    23344444 34566777666777777  


Q ss_pred             HHHHHHHHHHhhHHHHHHHh-hhhhHHHHHHh
Q 047021          213 DMAVVVERLESSRQKLLMEI-DNQSSEIERLF  243 (388)
Q Consensus       213 dma~~VeRLE~SRqKLL~EI-D~QS~EIErLf  243 (388)
                           |.||+.-|..-|-++ +.-..||+.|+
T Consensus       273 -----l~RL~~lK~~~lk~~I~~~R~ei~elW  299 (619)
T PF03999_consen  273 -----LERLEELKKQNLKEFIEKKRQEIEELW  299 (619)
T ss_dssp             --------------------------------
T ss_pred             -----HHHHHHHHHHhHHHHHHHHHHHHHHHH
Confidence                 888888887777664 66677777654


No 212
>TIGR03185 DNA_S_dndD DNA sulfur modification protein DndD. This model describes the DndB protein encoded by an operon associated with a sulfur-containing modification to DNA. The operon is sporadically distributed in bacteria, much like some restriction enzyme operons. DndD is described as a putative ATPase. The small number of examples known so far include species from among the Firmicutes, Actinomycetes, Proteobacteria, and Cyanobacteria.
Probab=21.34  E-value=1.1e+03  Score=25.66  Aligned_cols=78  Identities=23%  Similarity=0.272  Sum_probs=39.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHH
Q 047021          162 QARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIER  241 (388)
Q Consensus       162 q~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIEr  241 (388)
                      +..+..+..+...+++.+..+.++-....+.... +.+ .+....+++.+.....-++.++.....+-.+|..--.+|.+
T Consensus       390 ~~~~~~~~~~~~~~e~el~~l~~~l~~~~~~e~i-~~l-~e~l~~l~~~l~~~~~~~~~~~~~~~~~~~~i~~~~~~~~~  467 (650)
T TIGR03185       390 QDAKSQLLKELRELEEELAEVDKKISTIPSEEQI-AQL-LEELGEAQNELFRSEAEIEELLRQLETLKEAIEALRKTLDE  467 (650)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhcCCChHHH-HHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4556677777777777777777653321111101 111 14445555555555555555555555444444444444433


No 213
>KOG0946 consensus ER-Golgi vesicle-tethering protein p115 [Intracellular trafficking, secretion, and vesicular transport]
Probab=21.18  E-value=1.4e+03  Score=27.03  Aligned_cols=38  Identities=24%  Similarity=0.246  Sum_probs=31.4

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRIN   42 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Ln   42 (388)
                      .++.|+..|.+|+++.+...|.........+.|..+|-
T Consensus       679 ~~~~l~~e~eeL~~~vq~~~s~hsql~~q~~~Lk~qLg  716 (970)
T KOG0946|consen  679 MEKELQVENEELEEEVQDFISEHSQLKDQLDLLKNQLG  716 (970)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            46778888888888888888888888888888888887


No 214
>PF07106 TBPIP:  Tat binding protein 1(TBP-1)-interacting protein (TBPIP);  InterPro: IPR010776 This family consists of several eukaryotic TBP-1 interacting protein (TBPIP) sequences. TBP-1 has been demonstrated to interact with the human immunodeficiency virus type 1 (HIV-1) viral protein Tat, then modulate the essential replication process of HIV. In addition, TBP-1 has been shown to be a component of the 26S proteasome, a basic multiprotein complex that degrades ubiquitinated proteins in an ATP-dependent fashion. Human TBPIP interacts with human TBP-1 then modulates the inhibitory action of human TBP-1 on HIV-Tat-mediated transactivation [].
Probab=21.18  E-value=1.8e+02  Score=25.85  Aligned_cols=20  Identities=25%  Similarity=0.385  Sum_probs=9.2

Q ss_pred             HHHHhhHHHHHHhhHhhhhh
Q 047021           29 VKEVESESLQTRINELEQNT   48 (388)
Q Consensus        29 ~k~~~~~~L~~~Lnd~e~n~   48 (388)
                      +...++..|..+|..+..++
T Consensus       120 ~l~~e~~~l~~kL~~l~~~~  139 (169)
T PF07106_consen  120 ELEEEIEELEEKLEKLRSGS  139 (169)
T ss_pred             HHHHHHHHHHHHHHHHHhCC
Confidence            33444445555555444433


No 215
>PF12240 Angiomotin_C:  Angiomotin C terminal;  InterPro: IPR024646 This domain represents the C-terminal region of angiomotin. Angiomotin regulates the action of angiogenesis-inhibitor angiostatin []. The C-terminal region of angiomotin appears to be involved in directing the protein chemotactically [].
Probab=21.05  E-value=1.4e+02  Score=29.12  Aligned_cols=38  Identities=24%  Similarity=0.305  Sum_probs=33.8

Q ss_pred             hHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhH
Q 047021            6 LKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINE   43 (388)
Q Consensus         6 v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd   43 (388)
                      |.--..||.+||-..+-|-.+|.+|+.-+.+|+++.-.
T Consensus       131 l~~a~~K~qemE~RIK~LhaqI~EKDAmIkVLQqrs~~  168 (205)
T PF12240_consen  131 LHMANRKCQEMENRIKALHAQIAEKDAMIKVLQQRSRK  168 (205)
T ss_pred             HHHhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHhhccc
Confidence            44557899999999999999999999999999998763


No 216
>KOG0804 consensus Cytoplasmic Zn-finger protein BRAP2 (BRCA1 associated protein) [General function prediction only]
Probab=20.98  E-value=1.1e+03  Score=25.83  Aligned_cols=20  Identities=30%  Similarity=0.243  Sum_probs=10.9

Q ss_pred             hhHHHHHHHHHHHhHHHHHH
Q 047021           71 EDLSAQLRTLKKRLKEAEEE   90 (388)
Q Consensus        71 eDl~AQLR~~kkRlkEAEEE   90 (388)
                      |+..+|+-+--+|+..-..+
T Consensus       381 e~k~~q~q~k~~k~~kel~~  400 (493)
T KOG0804|consen  381 ERKLQQLQTKLKKCQKELKE  400 (493)
T ss_pred             HHHHHHHHHHHHHHHHHHHH
Confidence            45566666655555444333


No 217
>PF14282 FlxA:  FlxA-like protein
Probab=20.89  E-value=5.2e+02  Score=21.93  Aligned_cols=25  Identities=32%  Similarity=0.513  Sum_probs=20.5

Q ss_pred             CCCChhhHHHHHHHHHHHHHHHHHH
Q 047021          123 MGISPDHVQTLEKELATLKSQLQQE  147 (388)
Q Consensus       123 ~g~s~~~~~~lekEl~~l~~~lqqe  147 (388)
                      .|.++.+|..|++.|..|+.+|+.-
T Consensus        14 ~~~~~~~I~~L~~Qi~~Lq~ql~~l   38 (106)
T PF14282_consen   14 SGSSDSQIEQLQKQIKQLQEQLQEL   38 (106)
T ss_pred             CCcHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445888999999999999998753


No 218
>PRK00409 recombination and DNA strand exchange inhibitor protein; Reviewed
Probab=20.87  E-value=1.1e+03  Score=26.58  Aligned_cols=50  Identities=22%  Similarity=0.247  Sum_probs=26.5

Q ss_pred             HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021            7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL   56 (388)
Q Consensus         7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL   56 (388)
                      ..|+....+++.+...+...+++.+.....+...++.++.+.-.-++++.
T Consensus       523 ~~l~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~~~~~~  572 (782)
T PRK00409        523 ASLEELERELEQKAEEAEALLKEAEKLKEELEEKKEKLQEEEDKLLEEAE  572 (782)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44555555555555555555555555555555555555555444444444


No 219
>PF10234 Cluap1:  Clusterin-associated protein-1;  InterPro: IPR019366 This protein of 413 amino acids contains a central coiled-coil domain, possibly the region that binds to clusterin. Cluap1 expression is highest in the nucleus and gradually increases during late S to G2/M phases of the cell cycle and returns to the basal level in the G0/G1 phases. In addition, it is upregulated in colon cancer tissues compared to corresponding non-cancerous mucosa. It thus plays a crucial role in the life of the cell []. 
Probab=20.83  E-value=2e+02  Score=28.73  Aligned_cols=47  Identities=28%  Similarity=0.372  Sum_probs=40.5

Q ss_pred             hhhhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhh
Q 047021            3 LSQLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVP   50 (388)
Q Consensus         3 ~~~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~   50 (388)
                      ..+|..+++.+..+..+...|..||+-|..+.+-.++||..|. ++.|
T Consensus       175 ~~~~~~~~~~l~~l~~de~~Le~KIekkk~ELER~qKRL~sLq-~vRP  221 (267)
T PF10234_consen  175 QQQLQQTQQQLNNLASDEANLEAKIEKKKQELERNQKRLQSLQ-SVRP  221 (267)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-hcCh
Confidence            4578888899999999999999999999999999999999774 3444


No 220
>PF09789 DUF2353:  Uncharacterized coiled-coil protein (DUF2353);  InterPro: IPR019179  Members of this family have been annotated as being coiled-coil domain-containing protein 149, however they currently have no known function. 
Probab=20.36  E-value=9.6e+02  Score=24.76  Aligned_cols=72  Identities=22%  Similarity=0.311  Sum_probs=40.5

Q ss_pred             HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHhhhhhhhhhhHH--HhhhhHHHHHHHHHHHh
Q 047021            7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKALKDVAMEKDAAV--VAREDLSAQLRTLKKRL   84 (388)
Q Consensus         7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL~~v~mEkDaA~--vareDl~AQLR~~kkRl   84 (388)
                      .+....|..|..+...|..++.+..-++..|-..+++.+--..+            -++..  ..|++++.||-.++++.
T Consensus        75 ~~sre~Nk~L~~Ev~~Lrqkl~E~qGD~KlLR~~la~~r~~~~~------------~~~~~~~~ere~lV~qLEk~~~q~  142 (319)
T PF09789_consen   75 SESREQNKKLKEEVEELRQKLNEAQGDIKLLREKLARQRVGDEG------------IGARHFPHEREDLVEQLEKLREQI  142 (319)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhchHHHHHHHHHhhhhhhcc------------ccccccchHHHHHHHHHHHHHHHH
Confidence            34445555555555555555555555555555555443221111            11111  55999999998888887


Q ss_pred             HHHHHH
Q 047021           85 KEAEEE   90 (388)
Q Consensus        85 kEAEEE   90 (388)
                      ..-|-+
T Consensus       143 ~qLe~d  148 (319)
T PF09789_consen  143 EQLERD  148 (319)
T ss_pred             HHHHHH
Confidence            766543


No 221
>KOG0995 consensus Centromere-associated protein HEC1 [Cell cycle control, cell division, chromosome partitioning]
Probab=20.32  E-value=1.2e+03  Score=26.06  Aligned_cols=207  Identities=17%  Similarity=0.243  Sum_probs=0.0

Q ss_pred             ChhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhchhhhhhhhhhhhcchhhHH
Q 047021          126 SPDHVQTLEKELATLKSQLQQESQLRQQERKQLTEEQARISVLMSEKQELEEKLVMFSRKASEEVSNAAASKAFSVEDKE  205 (388)
Q Consensus       126 s~~~~~~lekEl~~l~~~lqqes~lrqqeqqrla~Eq~~~a~l~aEkqeLeekl~~~~~~~se~~~~~a~~k~~S~eDke  205 (388)
                      ++.-+.-|+++.++|+.-.+.-.--+.+--.+-..=.-.++.+-+|..+-|+.+..+...-.+=--.. -.+.||.+|-+
T Consensus       257 ~~~~~eslre~~~~L~~D~nK~~~y~~~~~~k~~~~~~~l~~l~~Eie~kEeE~e~lq~~~d~Lk~~I-e~Q~iS~~dve  335 (581)
T KOG0995|consen  257 DPGKEESLREKKARLQDDVNKFQAYVSQMKSKKQHMEKKLEMLKSEIEEKEEEIEKLQKENDELKKQI-ELQGISGEDVE  335 (581)
T ss_pred             CcchHHHHHHHHHHHHhHHHHHHHHHHHHHhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-HhcCCCHHHHH


Q ss_pred             HHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q 047021          206 KLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWENQVKDCLKQNEELRGILEKLR  285 (388)
Q Consensus       206 kle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEnQvqDclkQNeeLr~lLeklR  285 (388)
                      ++-.....+-.-|.++..-+++|--++=.+-++|++-|.+                 -+.++.||-.+--+|.-.     
T Consensus       336 ~mn~Er~~l~r~l~~i~~~~d~l~k~vw~~~l~~~~~f~~-----------------le~~~~~~~~l~~~i~l~-----  393 (581)
T KOG0995|consen  336 RMNLERNKLKRELNKIQSELDRLSKEVWELKLEIEDFFKE-----------------LEKKFIDLNSLIRRIKLG-----  393 (581)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHH-----------------HHHHHHHHHHHHHHHHHH-----


Q ss_pred             hhhhccCCCCccccccccCCCCCCccccCC--CchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHHHHhhhh
Q 047021          286 TEQASFMSVNDKDILRDFSEPSKDAVSQNG--SETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTTQAYNGL  361 (388)
Q Consensus       286 ~EQa~~~~~~~~~~~g~~~e~~~dg~~~~~--~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~QA~n~L  361 (388)
                            ...+..+....+..+.+.|+.-.+  -|..--.+....+++.++.-....|.--+-.+..-...-.|-..++
T Consensus       394 ------~~~~~~n~~~~pe~~~~~~~d~k~~V~~~l~el~~ei~~~~~~~~~~~~tLq~~~~~~~~~i~E~~~~l~~~  465 (581)
T KOG0995|consen  394 ------IAENSKNLERNPERAATNGVDLKSYVKPLLKELLDEISEELHEAENELETLQEHFSNKASTIEEKIQILGEI  465 (581)
T ss_pred             ------HHHHhccCCcCCccCccccccchhHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH


No 222
>PF04899 MbeD_MobD:  MbeD/MobD like ;  InterPro: IPR006983 The MbeD and MobD proteins are plasmid encoded, and are involved in the plasmid mobilisation and transfer in the presence of conjugative plasmids [].
Probab=20.29  E-value=2.9e+02  Score=22.53  Aligned_cols=40  Identities=25%  Similarity=0.347  Sum_probs=33.5

Q ss_pred             CchhhhhHHhhHHHHHHHhhHHHHhHHHHHHHHHHHHHHH
Q 047021          316 SETHKNEFLALKGQLAKEQSRAEALSAQVLQLSAQLQQTT  355 (388)
Q Consensus       316 ~~~~~te~l~LK~eL~kEqsRseaLsaqv~qLsAeL~ra~  355 (388)
                      ..+|......|+-.|.....+-.+|+++|..|+-++..--
T Consensus        23 ~~~Wq~sy~~Lq~~~~~t~~~~a~L~~qv~~Ls~qv~~Ls   62 (70)
T PF04899_consen   23 QQEWQSSYADLQHMFEQTSQENAALSEQVNNLSQQVQRLS   62 (70)
T ss_pred             HHHHHHHHHHHHHHHHHhHHHHHHHHHHHHHHHHHHHHHH
Confidence            5667778889999999999999999999999988776543


No 223
>TIGR01069 mutS2 MutS2 family protein. Function of MutS2 is unknown. It should not be considered a DNA mismatch repair protein. It is likely a DNA mismatch binding protein of unknown cellular function.
Probab=20.14  E-value=8.4e+02  Score=27.55  Aligned_cols=50  Identities=22%  Similarity=0.255  Sum_probs=27.3

Q ss_pred             HHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021            7 KDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL   56 (388)
Q Consensus         7 ~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL   56 (388)
                      .+|+....+++.....+...+.+.+.....|..+++.++...---+.+|-
T Consensus       518 ~~L~~~~~~~e~~~~~~~~~~~e~~~~~~~l~~~~~~l~~~~~~~~~~a~  567 (771)
T TIGR01069       518 EKLSALEKELEQKNEHLEKLLKEQEKLKKELEQEMEELKERERNKKLELE  567 (771)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555555555555555555555555566666666555554444443333


No 224
>PF10224 DUF2205:  Predicted coiled-coil protein (DUF2205);  InterPro: IPR019357  This entry represents a highly conserved 100 residue region which is likely to have a coiled-coil structure. The exact function is unknown. 
Probab=20.08  E-value=4.3e+02  Score=22.11  Aligned_cols=51  Identities=25%  Similarity=0.305  Sum_probs=38.8

Q ss_pred             hcchhhHHHHHHHHhHHHHHHHHHHhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHH
Q 047021          198 AFSVEDKEKLEKQLHDMAVVVERLESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQE  255 (388)
Q Consensus       198 ~~S~eDkekle~Ql~dma~~VeRLE~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qe  255 (388)
                      .+..+.++.|-+++       --|-.+=+-|+.-||..-.|.++|-.||..|..=..-
T Consensus        12 ~~~~e~k~~Li~ei-------~~LQ~sL~~L~~Rve~Vk~E~~kL~~EN~~Lq~YI~n   62 (80)
T PF10224_consen   12 KLEKEEKEELIQEI-------LELQDSLEALSDRVEEVKEENEKLESENEYLQQYIGN   62 (80)
T ss_pred             HHhHHHHHHHHHHH-------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45677888888884       4455555678889999999999999999888654433


No 225
>PF09766 FimP:  Fms-interacting protein;  InterPro: IPR019163 This entry represents Thoc5 which is one of the subunits of the THO complex, which additionally contains: HPR1, Thoc2, Thoc6 and Thoc7. The evolutionarily conserved multisubunit THO complex, which is recruited to actively transcribed genes is required for the efficient expression of genes that have internal tandem repeats. It is suggested that the THO complex functions to rectify aberrant structures that arise during transcription [, ] and is required for cell proliferation and for proper export of heat-shock mRNAs under heat stress [].   This entry also identifies the crucial 144 N-terminal residues of the FmiP protein, which is essential for the binding of the protein to the cytoplasmic domain of activated Fms-molecules in M-CSF induced haematopoietic differentiation of macrophages. The C terminus contains a putative nuclear localisation sequence and a leucine zipper which suggest further, as yet unknown, nuclear functions. The level of FMIP expression might form a threshold that determines whether cells differentiate into macrophages or into granulocytes []. 
Probab=20.04  E-value=2e+02  Score=29.16  Aligned_cols=51  Identities=25%  Similarity=0.363  Sum_probs=37.1

Q ss_pred             hhHHHHHhhHHHHHHhHHhhhhHHHHHHhhHHHHHHhhHhhhhhhhhHHHHh
Q 047021            5 QLKDFQKKNAELEEKNSILSSKLQVKEVESESLQTRINELEQNTVPSLRKAL   56 (388)
Q Consensus         5 ~v~~lq~~~~~le~~n~~L~sk~~~k~~~~~~L~~~Lnd~e~n~v~slrkaL   56 (388)
                      +=+.|.+.+.+|+.+...|...+.++....++|...|.++...+.| +++.|
T Consensus       102 ~Rk~L~~~~~el~~~k~~l~~~~~~k~~~L~~l~~~L~~l~~a~~p-lq~~l  152 (355)
T PF09766_consen  102 QRKRLEEQLKELEQRKKKLQQENKKKKKFLDSLPPQLKSLKKAAKP-LQEYL  152 (355)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHHHHHHHHHH-HHHHh
Confidence            3467777788888888888888888888888888888777666654 34444


No 226
>PF06825 HSBP1:  Heat shock factor binding protein 1;  InterPro: IPR009643 Heat shock factor binding protein 1 (HSBP1) appears to be a negative regulator of the heat shock response [].; PDB: 3CI9_A.
Probab=20.04  E-value=1.4e+02  Score=23.42  Aligned_cols=15  Identities=27%  Similarity=0.636  Sum_probs=8.8

Q ss_pred             hHHHHHHhhHhhhhh
Q 047021           34 SESLQTRINELEQNT   48 (388)
Q Consensus        34 ~~~L~~~Lnd~e~n~   48 (388)
                      .|-+-+|+++||++.
T Consensus        30 iDeM~~RIDdLE~si   44 (54)
T PF06825_consen   30 IDEMSSRIDDLEKSI   44 (54)
T ss_dssp             HHHHHHHHHCCHHHH
T ss_pred             HHHHHhhHHHHHHHH
Confidence            455556666666554


No 227
>PF08649 DASH_Dad1:  DASH complex subunit Dad1;  InterPro: IPR013958  The DASH complex is a ~10 subunit microtubule-binding complex that is transferred to the kinetochore prior to mitosis []. In Saccharomyces cerevisiae (Baker's yeast) DASH forms both rings and spiral structures on microtubules in vitro [, ]. Components of the DASH complex, including Dam1, Duo1, Spc34, Dad1 and Ask1, are essential and connect the centromere to the plus end of spindle microtubules []. Throughout the cell cycle Dad1 remains bound to kinetochores and its association is dependent on the Mis6 and Mal2 []. 
Probab=20.03  E-value=2.8e+02  Score=22.10  Aligned_cols=41  Identities=24%  Similarity=0.511  Sum_probs=33.1

Q ss_pred             HhhHHHHHHHhhhhhHHHHHHhhhhhhhhhhHHHHHHHHHHHHH
Q 047021          222 ESSRQKLLMEIDNQSSEIERLFEENSSLSSSYQESMGIAKHWEN  265 (388)
Q Consensus       222 E~SRqKLL~EID~QS~EIErLf~ENs~Lsss~qea~~iaaqWEn  265 (388)
                      |.-|++|+.||-   ..+|.++.-=+.|--|+..+++|+...+.
T Consensus         4 e~qR~~Li~eI~---~~~e~vl~nlN~LNRsLE~~i~VGkEF~~   44 (58)
T PF08649_consen    4 ERQRDRLIQEIS---ESMESVLNNLNALNRSLESVISVGKEFES   44 (58)
T ss_pred             HHHHHHHHHHHH---HHHHHHHHHHHHHHHhHHHHHHHhhhHHH
Confidence            456999999995   56778887778888999999998887654


Done!