Query         psy5736
Match_columns 346
No_of_seqs    336 out of 1547
Neff          7.2 
Searched_HMMs 46136
Date          Fri Aug 16 16:42:00 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy5736.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5736hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2084|consensus               99.6 3.5E-15 7.6E-20  149.8  16.9  180  136-333   197-377 (482)
  2 PF00856 SET:  SET domain;  Int  99.4 3.1E-13 6.7E-18  114.5   5.2   50  135-184   109-162 (162)
  3 KOG1517|consensus               99.2 1.2E-11 2.7E-16  129.7   5.6   77    6-86   1086-1164(1387)
  4 smart00317 SET SET (Su(var)3-9  99.1   6E-11 1.3E-15   95.9   3.9   46  138-183    67-116 (116)
  5 KOG2589|consensus               98.6 1.6E-08 3.4E-13   96.4   3.0   60  144-215   196-256 (453)
  6 COG2940 Proteins containing SE  98.0 4.2E-06 9.1E-11   85.3   3.3   71  143-215   404-478 (480)
  7 KOG4442|consensus               97.9   7E-06 1.5E-10   84.3   3.6   41  145-185   194-238 (729)
  8 KOG1080|consensus               97.8 1.1E-05 2.4E-10   87.4   3.6   41  144-184   939-983 (1005)
  9 KOG1082|consensus               97.2 0.00026 5.6E-09   69.8   3.0   42  146-187   274-323 (364)
 10 KOG1141|consensus               96.4  0.0012 2.5E-08   69.3   1.2   41  145-185  1190-1238(1262)
 11 KOG1083|consensus               96.3  0.0023   5E-08   68.8   2.9   41  145-185  1251-1295(1306)
 12 KOG1337|consensus               95.9  0.0067 1.4E-07   61.8   3.5   61  138-206   230-291 (472)
 13 KOG1085|consensus               95.7  0.0074 1.6E-07   56.6   2.6   41  145-185   334-378 (392)
 14 KOG0294|consensus               95.0   0.036 7.8E-07   52.8   4.8   65    3-74    226-292 (362)
 15 KOG0282|consensus               94.2   0.071 1.5E-06   53.3   5.1   53    2-63    276-328 (503)
 16 KOG0296|consensus               94.1   0.048   1E-06   52.8   3.6   51    2-63    304-354 (399)
 17 PF13424 TPR_12:  Tetratricopep  94.1    0.19 4.2E-06   37.3   6.2   51  283-334    12-62  (78)
 18 KOG1079|consensus               93.3   0.074 1.6E-06   55.3   3.5   42  144-185   665-710 (739)
 19 PLN00181 protein SPA1-RELATED;  92.8    0.19 4.1E-06   54.4   6.0   54    3-64    552-605 (793)
 20 PF13374 TPR_10:  Tetratricopep  92.3    0.32 6.9E-06   31.2   4.5   34  283-316     9-42  (42)
 21 KOG0302|consensus               91.3     0.3 6.4E-06   47.8   4.8   57    3-67    277-336 (440)
 22 KOG1188|consensus               91.3     0.3 6.5E-06   47.1   4.7   51    5-62     49-99  (376)
 23 KOG0646|consensus               90.8    0.19 4.1E-06   50.1   3.0   51    3-62    100-150 (476)
 24 KOG0283|consensus               90.6    0.31 6.6E-06   51.5   4.4   54    3-65    387-440 (712)
 25 PF00400 WD40:  WD domain, G-be  90.3     0.4 8.7E-06   30.4   3.3   31   31-63      9-39  (39)
 26 PTZ00421 coronin; Provisional   90.3    0.47   1E-05   48.8   5.4   60    3-63     95-154 (493)
 27 PTZ00421 coronin; Provisional   90.2    0.54 1.2E-05   48.3   5.8   52    3-63    145-196 (493)
 28 KOG0264|consensus               89.8    0.51 1.1E-05   46.8   5.0   57    3-65    247-305 (422)
 29 KOG0291|consensus               89.7    0.43 9.3E-06   50.4   4.5   51    5-64    329-379 (893)
 30 PF13512 TPR_18:  Tetratricopep  88.2     1.6 3.4E-05   37.4   6.2   52  277-333    11-62  (142)
 31 PF13525 YfiO:  Outer membrane   87.9       2 4.2E-05   38.5   7.2   54  276-334     5-58  (203)
 32 PTZ00420 coronin; Provisional   87.8     1.2 2.6E-05   46.6   6.4   55    3-65     51-105 (568)
 33 KOG0294|consensus               87.7    0.67 1.5E-05   44.4   4.1   55    2-63     59-113 (362)
 34 KOG0319|consensus               87.3     0.8 1.7E-05   48.2   4.7   55    2-63    123-177 (775)
 35 KOG1188|consensus               87.2    0.69 1.5E-05   44.7   3.9   55    3-64    140-195 (376)
 36 KOG0302|consensus               87.2    0.85 1.9E-05   44.7   4.5   55    3-65    321-378 (440)
 37 KOG0281|consensus               86.8    0.41 8.9E-06   46.5   2.1   61    4-64    255-347 (499)
 38 KOG0269|consensus               86.4    0.57 1.2E-05   49.5   3.1   52    4-63    108-162 (839)
 39 PTZ00420 coronin; Provisional   86.3     1.4 2.9E-05   46.2   5.8   52    3-64    145-196 (568)
 40 KOG0270|consensus               86.1     1.2 2.6E-05   44.4   4.9   53    2-62    262-314 (463)
 41 KOG0316|consensus               85.9     1.4 3.1E-05   40.8   5.0   51    3-62     78-128 (307)
 42 KOG1081|consensus               85.7    0.33 7.2E-06   49.4   1.0   50  136-185   362-416 (463)
 43 KOG0273|consensus               85.6     1.6 3.5E-05   43.9   5.6   60    2-63    294-387 (524)
 44 KOG0276|consensus               85.5    0.96 2.1E-05   47.0   4.1   50    4-62     33-82  (794)
 45 cd00200 WD40 WD40 domain, foun  85.0       2 4.3E-05   37.8   5.6   54    3-65     70-123 (289)
 46 KOG1338|consensus               83.9    0.93   2E-05   44.7   3.1   53  133-185   207-260 (466)
 47 KOG0306|consensus               83.7       1 2.2E-05   47.7   3.5   54    5-65     42-95  (888)
 48 KOG0293|consensus               82.6     1.4   3E-05   43.8   3.7   54    4-65    460-513 (519)
 49 KOG1334|consensus               82.0     1.3 2.8E-05   44.8   3.3   57    3-65    161-217 (559)
 50 KOG0274|consensus               81.8     2.7 5.9E-05   43.7   5.8   50    3-63    308-357 (537)
 51 PF13432 TPR_16:  Tetratricopep  81.8     2.9 6.3E-05   29.6   4.4   46  281-334     2-47  (65)
 52 PLN00181 protein SPA1-RELATED;  81.7     2.8   6E-05   45.5   6.1   53    3-64    595-647 (793)
 53 COG1729 Uncharacterized protei  81.6       3 6.6E-05   39.2   5.5   52  278-334   143-194 (262)
 54 KOG0263|consensus               81.4     2.9 6.2E-05   44.3   5.7   54    3-65    554-607 (707)
 55 KOG0310|consensus               81.3     1.7 3.8E-05   43.7   3.9   49    6-63    217-266 (487)
 56 KOG0271|consensus               81.3     2.3 4.9E-05   42.0   4.6   54    4-63    177-233 (480)
 57 TIGR02795 tol_pal_ybgF tol-pal  81.1     4.3 9.4E-05   31.6   5.6   48  282-334    45-92  (119)
 58 KOG0266|consensus               81.1     3.1 6.6E-05   42.2   5.8   51    3-62    222-273 (456)
 59 KOG0266|consensus               80.4     2.9 6.4E-05   42.3   5.4   55    3-66    265-319 (456)
 60 KOG1332|consensus               80.4     2.3 5.1E-05   39.6   4.2   55    4-63     78-132 (299)
 61 KOG0272|consensus               80.4     2.2 4.7E-05   42.4   4.2   52    3-62    364-415 (459)
 62 KOG2461|consensus               80.0    0.77 1.7E-05   45.8   1.0   33  155-187   114-146 (396)
 63 PF14559 TPR_19:  Tetratricopep  80.0     3.3 7.2E-05   29.4   4.2   40  287-334     2-41  (68)
 64 KOG0289|consensus               79.9     3.4 7.4E-05   41.3   5.3   50    4-62    367-416 (506)
 65 PF13414 TPR_11:  TPR repeat; P  79.8     9.4  0.0002   27.1   6.6   49  277-333     4-52  (69)
 66 KOG0292|consensus               79.5     2.5 5.3E-05   45.8   4.5   49    5-62    114-162 (1202)
 67 cd00200 WD40 WD40 domain, foun  79.3     4.5 9.7E-05   35.4   5.7   51    5-64    114-164 (289)
 68 PRK10866 outer membrane biogen  79.1     7.2 0.00016   36.1   7.1   51  278-333    34-84  (243)
 69 KOG0263|consensus               78.8     2.2 4.9E-05   45.0   4.0   51    3-62    596-646 (707)
 70 KOG0647|consensus               78.7       3 6.5E-05   39.9   4.4   55    3-65     91-145 (347)
 71 KOG0276|consensus               78.5     3.3 7.3E-05   43.2   5.0   53    3-62    160-212 (794)
 72 KOG0308|consensus               78.0     3.1 6.8E-05   43.5   4.6   53    3-64    190-242 (735)
 73 smart00320 WD40 WD40 repeats.   77.9     2.6 5.5E-05   24.3   2.6   30   32-63     11-40  (40)
 74 KOG1310|consensus               77.8     3.9 8.4E-05   42.2   5.1   55    5-65     71-125 (758)
 75 KOG0278|consensus               77.3     4.9 0.00011   37.7   5.2   53    3-63    243-295 (334)
 76 PRK10803 tol-pal system protei  76.4     7.9 0.00017   36.4   6.7   51  279-334   145-196 (263)
 77 KOG0273|consensus               76.4     4.4 9.5E-05   40.9   5.0   51    3-62    429-479 (524)
 78 KOG0277|consensus               75.9     4.8  0.0001   37.8   4.8   56    4-68    168-226 (311)
 79 PF13174 TPR_6:  Tetratricopept  75.6       5 0.00011   24.0   3.5   29  279-307     3-31  (33)
 80 KOG0275|consensus               75.5     1.6 3.4E-05   42.1   1.6   60    3-65    232-293 (508)
 81 KOG1034|consensus               75.5     3.7 7.9E-05   39.8   4.0   53    2-62    111-163 (385)
 82 KOG0272|consensus               75.2     5.7 0.00012   39.6   5.4   57    3-66    194-250 (459)
 83 KOG0265|consensus               75.0     4.4 9.6E-05   38.7   4.4   52    3-62    109-160 (338)
 84 KOG0274|consensus               74.6     5.5 0.00012   41.5   5.5   54    3-67    430-483 (537)
 85 TIGR03302 OM_YfiO outer membra  74.4     5.7 0.00012   35.7   5.0   50  279-333    36-85  (235)
 86 KOG1517|consensus               74.2     6.3 0.00014   43.6   5.9   69    2-77   1183-1251(1387)
 87 PF10516 SHNi-TPR:  SHNi-TPR;    73.5     4.6  0.0001   26.4   3.0   27  286-312    11-37  (38)
 88 PF09986 DUF2225:  Uncharacteri  73.5      10 0.00022   34.5   6.4   80  254-334    48-134 (214)
 89 cd02681 MIT_calpain7_1 MIT: do  73.3      24 0.00052   26.8   7.4   57  275-331     5-64  (76)
 90 KOG0275|consensus               72.4     3.3 7.2E-05   40.0   3.0   51    4-62    283-333 (508)
 91 KOG2055|consensus               72.3     7.4 0.00016   39.2   5.4   65    3-76    363-431 (514)
 92 KOG0308|consensus               72.0       6 0.00013   41.4   4.9   59    2-62    136-198 (735)
 93 KOG0271|consensus               71.7     7.2 0.00016   38.6   5.1   54    3-66    224-277 (480)
 94 KOG1840|consensus               70.6     9.3  0.0002   39.5   6.0   54  283-336   248-301 (508)
 95 COG4105 ComL DNA uptake lipopr  70.3      13 0.00027   34.9   6.2   52  276-332    34-85  (254)
 96 KOG0268|consensus               70.2     5.5 0.00012   39.1   3.9   53    6-66    167-219 (433)
 97 KOG0315|consensus               70.2     9.7 0.00021   35.7   5.4   62    3-74    102-164 (311)
 98 KOG0305|consensus               70.1     6.7 0.00015   40.2   4.8   54    4-65    366-419 (484)
 99 TIGR02795 tol_pal_ybgF tol-pal  69.6      16 0.00034   28.4   6.0   27  279-305     5-31  (119)
100 KOG0646|consensus               69.4     7.4 0.00016   39.2   4.7   61    3-65    142-206 (476)
101 KOG0296|consensus               69.3      11 0.00023   37.1   5.6   53    5-66    169-221 (399)
102 cd02678 MIT_VPS4 MIT: domain c  69.0      35 0.00076   25.4   7.4   58  273-330     3-62  (75)
103 cd02684 MIT_2 MIT: domain cont  68.7      34 0.00073   25.7   7.3   58  273-330     3-62  (75)
104 PLN03098 LPA1 LOW PSII ACCUMUL  67.9     8.9 0.00019   38.9   5.0   53  277-334    76-128 (453)
105 PF00515 TPR_1:  Tetratricopept  66.6      16 0.00034   22.1   4.4   29  278-306     3-31  (34)
106 KOG0281|consensus               66.6     7.2 0.00016   38.2   3.9   49    4-63    338-386 (499)
107 KOG0285|consensus               66.1      13 0.00027   36.7   5.4   54    3-65    254-307 (460)
108 cd02680 MIT_calpain7_2 MIT: do  65.7      13 0.00028   28.2   4.5   36  273-308     3-38  (75)
109 KOG1007|consensus               65.7       6 0.00013   37.8   3.1   53    4-63    191-243 (370)
110 KOG0291|consensus               65.2      13 0.00027   39.9   5.6   51    3-62    369-419 (893)
111 PF10579 Rapsyn_N:  Rapsyn N-te  65.2      35 0.00075   26.3   6.6   54  276-334     6-59  (80)
112 KOG0306|consensus               64.6     9.7 0.00021   40.7   4.7   51    4-63     85-135 (888)
113 PF07719 TPR_2:  Tetratricopept  64.2      18 0.00039   21.6   4.3   28  279-306     4-31  (34)
114 KOG0284|consensus               64.0     6.6 0.00014   39.0   3.2   73    3-84    241-322 (464)
115 KOG4328|consensus               64.0     7.9 0.00017   39.0   3.7   51    4-62    208-262 (498)
116 PRK00398 rpoP DNA-directed RNA  63.9     5.2 0.00011   27.0   1.8   30  228-265     3-32  (46)
117 cd02682 MIT_AAA_Arch MIT: doma  63.6      53  0.0012   24.9   7.4   57  275-331     5-63  (75)
118 KOG0772|consensus               63.6     6.4 0.00014   40.3   3.1   56    4-64    337-393 (641)
119 PF14783 BBS2_Mid:  Ciliary BBS  63.5       8 0.00017   31.6   3.1   30   36-65      2-33  (111)
120 KOG2110|consensus               63.0      14 0.00031   36.2   5.2   49    5-62    152-201 (391)
121 KOG1840|consensus               62.6      19 0.00042   37.2   6.4   57  287-343   336-392 (508)
122 KOG4328|consensus               61.8     8.2 0.00018   38.9   3.4   56    4-67    299-357 (498)
123 KOG0279|consensus               61.8      15 0.00032   35.0   4.9   59    3-63    251-311 (315)
124 cd02677 MIT_SNX15 MIT: domain   61.6      55  0.0012   24.7   7.2   44  273-316     3-47  (75)
125 KOG0771|consensus               61.6     8.4 0.00018   38.2   3.4   53    3-65    163-215 (398)
126 KOG0277|consensus               61.5      15 0.00033   34.5   4.9   52    4-63    125-176 (311)
127 KOG0279|consensus               61.4      13 0.00029   35.3   4.5   54    3-65     82-135 (315)
128 KOG0268|consensus               60.1     7.9 0.00017   38.0   2.9   57    3-67    291-347 (433)
129 KOG0640|consensus               59.4      16 0.00035   35.3   4.8   54    7-66    239-292 (430)
130 PF13717 zinc_ribbon_4:  zinc-r  58.0     5.6 0.00012   25.6   1.1   33  229-264     3-35  (36)
131 KOG0313|consensus               57.8      13 0.00028   36.7   4.0   51    3-63    278-328 (423)
132 TIGR03866 PQQ_ABC_repeats PQQ-  57.1      29 0.00062   31.4   6.1   52    3-63      8-59  (300)
133 KOG0639|consensus               57.0      12 0.00027   38.2   3.8   52    4-64    529-580 (705)
134 PF04212 MIT:  MIT (microtubule  56.9      25 0.00054   25.6   4.6   37  274-310     3-39  (69)
135 KOG0643|consensus               55.7      17 0.00037   34.4   4.2   51    4-62    167-217 (327)
136 cd02656 MIT MIT: domain contai  55.7      81  0.0017   23.2   7.7   57  274-330     4-62  (75)
137 CHL00033 ycf3 photosystem I as  55.5      54  0.0012   27.8   7.3   53  277-334    36-88  (168)
138 cd02683 MIT_1 MIT: domain cont  55.2      73  0.0016   24.1   7.0   56  275-330     5-62  (77)
139 KOG0290|consensus               55.2      16 0.00034   35.1   3.9   55    3-62    170-224 (364)
140 KOG0286|consensus               54.8      26 0.00057   33.6   5.4   50    5-62    165-214 (343)
141 PRK10803 tol-pal system protei  54.8      36 0.00078   32.0   6.4   50  282-336   186-235 (263)
142 KOG1446|consensus               54.7      27 0.00058   33.6   5.4   48    5-62    253-300 (311)
143 PRK00420 hypothetical protein;  54.6      60  0.0013   26.6   6.8   10  255-264    41-50  (112)
144 PRK06266 transcription initiat  53.4      13 0.00029   32.8   3.1   10  255-264   137-146 (178)
145 KOG0303|consensus               53.1      26 0.00056   34.9   5.2   52    2-63    150-201 (472)
146 KOG4714|consensus               53.1     9.9 0.00021   35.9   2.2   54    3-63    199-252 (319)
147 PF13719 zinc_ribbon_5:  zinc-r  52.7       7 0.00015   25.3   0.9   33  229-264     3-35  (37)
148 PF13240 zinc_ribbon_2:  zinc-r  52.4     8.9 0.00019   22.2   1.2    9  255-263    14-22  (23)
149 PF13248 zf-ribbon_3:  zinc-rib  52.3     8.7 0.00019   22.7   1.2   23  229-263     3-25  (26)
150 PF14783 BBS2_Mid:  Ciliary BBS  52.0      32  0.0007   28.1   4.8   47    5-61     62-110 (111)
151 PF08666 SAF:  SAF domain;  Int  51.6     9.5 0.00021   26.9   1.5   18  166-183     3-20  (63)
152 COG2888 Predicted Zn-ribbon RN  51.6      13 0.00029   26.8   2.2   49  208-263    11-59  (61)
153 PF04810 zf-Sec23_Sec24:  Sec23  51.3     6.5 0.00014   25.9   0.6   32  229-264     3-34  (40)
154 KOG1587|consensus               51.2      25 0.00053   36.8   5.0   55    5-62    264-320 (555)
155 KOG0299|consensus               51.0      28  0.0006   35.2   5.1   60    4-65    346-410 (479)
156 smart00661 RPOL9 RNA polymeras  51.0      12 0.00025   25.6   1.8   30  230-265     2-31  (52)
157 TIGR00373 conserved hypothetic  50.9     8.8 0.00019   33.3   1.5   10  255-264   129-138 (158)
158 KOG2445|consensus               50.7      29 0.00062   33.5   4.9   34   33-66    112-145 (361)
159 KOG2096|consensus               50.6      14  0.0003   35.9   2.8   42   30-73     83-126 (420)
160 KOG0299|consensus               50.3      26 0.00055   35.4   4.7   52    3-65    305-356 (479)
161 TIGR02098 MJ0042_CXXC MJ0042 f  49.8     7.7 0.00017   24.9   0.7   33  229-264     3-35  (38)
162 smart00745 MIT Microtubule Int  49.7   1E+02  0.0022   22.7   7.6   38  273-310     5-42  (77)
163 PRK02603 photosystem I assembl  49.6      73  0.0016   27.2   7.1   50  279-333    38-87  (172)
164 KOG0318|consensus               49.5      23 0.00049   36.5   4.3   17    4-20     79-95  (603)
165 KOG0318|consensus               49.4      37  0.0008   35.0   5.7   56    2-63    208-263 (603)
166 COG2956 Predicted N-acetylgluc  49.1      36 0.00077   33.3   5.3   45  290-334   221-265 (389)
167 KOG0286|consensus               48.5      32  0.0007   33.0   4.8   53    3-64    206-258 (343)
168 KOG2111|consensus               48.4      45 0.00097   32.3   5.8   54    8-68    206-259 (346)
169 PF10571 UPF0547:  Uncharacteri  48.2      13 0.00027   22.3   1.4   23  230-264     2-24  (26)
170 PF09577 Spore_YpjB:  Sporulati  47.7      43 0.00093   31.0   5.5   57  275-331     4-60  (232)
171 PF13181 TPR_8:  Tetratricopept  47.4      51  0.0011   19.6   4.3   26  281-306     6-31  (34)
172 TIGR03302 OM_YfiO outer membra  47.2      50  0.0011   29.4   6.0   48  283-335   173-220 (235)
173 PF14561 TPR_20:  Tetratricopep  47.1      58  0.0013   25.3   5.5   51  277-333    23-73  (90)
174 KOG0315|consensus               46.4      30 0.00065   32.6   4.2   50    5-63     60-111 (311)
175 PF09976 TPR_21:  Tetratricopep  45.9 1.5E+02  0.0033   24.4   8.4   49  281-334    53-101 (145)
176 TIGR02059 swm_rep_I cyanobacte  45.8      32 0.00068   27.6   3.7   23  163-185    75-97  (101)
177 KOG0772|consensus               44.2      29 0.00062   35.8   4.1   56    2-63    287-345 (641)
178 PF07754 DUF1610:  Domain of un  43.9      18 0.00039   21.3   1.6    9  254-262    16-24  (24)
179 KOG0319|consensus               43.4      36 0.00077   36.4   4.7   50    4-62    525-574 (775)
180 KOG0293|consensus               43.1      23 0.00049   35.5   3.1   55    5-65    416-470 (519)
181 PF02150 RNA_POL_M_15KD:  RNA p  43.1     8.4 0.00018   24.7   0.1   30  230-266     3-32  (35)
182 PF11817 Foie-gras_1:  Foie gra  43.1      82  0.0018   29.1   6.8   51  283-335   185-235 (247)
183 KOG0289|consensus               42.8      47   0.001   33.5   5.2   56    3-65    322-377 (506)
184 cd00350 rubredoxin_like Rubred  42.7      19 0.00042   22.5   1.7   25  229-263     2-26  (33)
185 KOG0640|consensus               42.6      18 0.00038   35.1   2.2   55    4-65    281-335 (430)
186 KOG0645|consensus               42.5      29 0.00062   33.0   3.5   33   31-64    278-310 (312)
187 PF06957 COPI_C:  Coatomer (COP  41.8   1E+02  0.0022   31.2   7.5   55  273-327   201-255 (422)
188 PF09538 FYDLN_acid:  Protein o  41.8      17 0.00037   29.6   1.7   30  229-267    10-39  (108)
189 KOG0973|consensus               41.3      33 0.00071   37.9   4.2   55    3-66    148-202 (942)
190 PF14276 DUF4363:  Domain of un  41.1 1.7E+02  0.0036   23.7   7.6   56  276-331    28-88  (121)
191 PF09976 TPR_21:  Tetratricopep  40.6 1.2E+02  0.0027   25.0   7.0   39  287-334    96-134 (145)
192 KOG1539|consensus               40.2      45 0.00098   36.2   4.9   51    4-62    265-315 (910)
193 cd00189 TPR Tetratricopeptide   39.6      62  0.0013   22.3   4.4   16  319-334    69-84  (100)
194 KOG2445|consensus               39.2      52  0.0011   31.8   4.7   59    3-64     32-90  (361)
195 PRK06266 transcription initiat  39.2      22 0.00049   31.4   2.2   33  204-243   117-149 (178)
196 KOG0553|consensus               39.0      86  0.0019   30.2   6.2   33  274-306    79-111 (304)
197 PF08271 TF_Zn_Ribbon:  TFIIB z  38.9      16 0.00034   24.3   0.9   11  255-265    20-30  (43)
198 PF14803 Nudix_N_2:  Nudix N-te  38.8      17 0.00036   23.2   1.0   30  230-263     2-31  (34)
199 KOG1445|consensus               38.4      40 0.00086   35.7   4.1   60    2-66    599-659 (1012)
200 KOG0278|consensus               38.2      66  0.0014   30.4   5.1   48    4-62    163-210 (334)
201 KOG0305|consensus               38.0      41 0.00089   34.6   4.2   56    2-65    319-376 (484)
202 KOG1240|consensus               37.9      33 0.00072   38.8   3.6   38   26-63   1031-1077(1431)
203 PRK00464 nrdR transcriptional   37.9 1.6E+02  0.0035   25.4   7.3   24  313-336   101-124 (154)
204 KOG1446|consensus               37.0      69  0.0015   30.8   5.2   53    3-64     77-129 (311)
205 PLN03088 SGT1,  suppressor of   36.9      74  0.0016   31.1   5.7   44  283-334    43-86  (356)
206 PRK05580 primosome assembly pr  36.8      54  0.0012   35.2   5.1   18  317-334   519-536 (679)
207 PF13371 TPR_9:  Tetratricopept  36.7      61  0.0013   23.0   4.0   40  287-334     6-45  (73)
208 KOG0285|consensus               35.9      28 0.00062   34.3   2.5   53    2-63    211-263 (460)
209 PF04423 Rad50_zn_hook:  Rad50   35.7      53  0.0012   22.8   3.3   25  255-279    21-45  (54)
210 COG1998 RPS31 Ribosomal protei  35.4      22 0.00047   24.7   1.2   27  229-263    20-46  (51)
211 PRK10866 outer membrane biogen  35.2 1.2E+02  0.0025   28.0   6.5   44  284-332    77-120 (243)
212 KOG1063|consensus               35.1      72  0.0016   34.0   5.4   54    3-63     31-85  (764)
213 PF13525 YfiO:  Outer membrane   35.1 1.4E+02   0.003   26.5   6.7   46  281-331    47-92  (203)
214 PF08969 USP8_dimer:  USP8 dime  35.1      83  0.0018   25.4   4.9   45  270-316    32-76  (115)
215 KOG0300|consensus               35.0      31 0.00067   33.5   2.5   51    4-63    168-218 (481)
216 PF01599 Ribosomal_S27:  Riboso  35.0      24 0.00052   24.3   1.3   30  227-262    17-46  (47)
217 COG2319 FOG: WD40 repeat [Gene  34.9      83  0.0018   28.7   5.5   51    4-62    176-226 (466)
218 KOG0649|consensus               34.7      73  0.0016   30.0   4.8   55    2-65    132-186 (325)
219 PF13176 TPR_7:  Tetratricopept  34.7   1E+02  0.0022   19.1   4.2   22  287-308    10-31  (36)
220 PRK10454 PTS system N,N'-diace  34.6      52  0.0011   27.1   3.5   32  275-306    30-61  (115)
221 KOG0647|consensus               34.5      65  0.0014   31.1   4.6   58    3-67     47-104 (347)
222 PF12773 DZR:  Double zinc ribb  34.4      32  0.0007   23.2   2.0   41  209-266     1-41  (50)
223 TIGR03866 PQQ_ABC_repeats PQQ-  34.1 1.1E+02  0.0025   27.4   6.3   59    4-63    177-235 (300)
224 PF01485 IBR:  IBR domain;  Int  34.1      27 0.00058   24.5   1.6   31  229-263    19-49  (64)
225 PF13428 TPR_14:  Tetratricopep  34.0      93   0.002   20.1   4.2   29  283-316     8-36  (44)
226 smart00531 TFIIE Transcription  33.3      11 0.00025   32.1  -0.6   10  255-264   124-133 (147)
227 smart00834 CxxC_CXXC_SSSS Puta  32.6      31 0.00068   22.1   1.6   31  228-263     5-35  (41)
228 TIGR03504 FimV_Cterm FimV C-te  32.4 1.4E+02   0.003   20.1   4.8   37  283-325     6-42  (44)
229 KOG1274|consensus               32.3      32  0.0007   37.5   2.4   37    2-43    250-292 (933)
230 PF12688 TPR_5:  Tetratrico pep  32.0 2.2E+02  0.0047   23.4   6.9   16  289-304    14-29  (120)
231 KOG0321|consensus               31.9      61  0.0013   34.2   4.3   60    3-62    164-245 (720)
232 KOG0295|consensus               31.5      84  0.0018   31.1   4.9   51    4-63    312-362 (406)
233 KOG2376|consensus               31.4      83  0.0018   33.1   5.1   67  277-343   176-249 (652)
234 KOG0316|consensus               31.3      82  0.0018   29.5   4.5   69    4-86    163-231 (307)
235 KOG1407|consensus               31.2      70  0.0015   30.3   4.2   55    4-64     40-94  (313)
236 PF12862 Apc5:  Anaphase-promot  30.9 2.3E+02   0.005   21.7   6.6   51  284-334     6-57  (94)
237 PF14853 Fis1_TPR_C:  Fis1 C-te  30.8 1.9E+02  0.0041   20.2   7.2   47  279-330     4-50  (53)
238 PF10058 DUF2296:  Predicted in  30.7      29 0.00063   24.5   1.3   33  225-263    19-53  (54)
239 KOG0547|consensus               30.6      96  0.0021   32.1   5.3   59  273-339   112-170 (606)
240 TIGR01206 lysW lysine biosynth  30.4      35 0.00075   24.2   1.6   31  229-265     3-33  (54)
241 COG1594 RPB9 DNA-directed RNA   30.4      27 0.00058   28.6   1.2   32  230-267     4-35  (113)
242 KOG0284|consensus               30.4      37  0.0008   33.9   2.3   58    5-71    159-216 (464)
243 KOG0543|consensus               30.3 1.5E+02  0.0032   29.8   6.4   49  277-330   292-340 (397)
244 KOG1036|consensus               30.1      55  0.0012   31.5   3.3   45    2-55    250-294 (323)
245 PF14828 Amnionless:  Amnionles  30.1      29 0.00063   35.2   1.6   31  227-267   207-239 (437)
246 PF02748 PyrI_C:  Aspartate car  30.0      45 0.00098   23.4   2.1   17  253-269    34-50  (52)
247 PF13424 TPR_12:  Tetratricopep  29.8 1.4E+02   0.003   21.4   5.0   26  283-308    53-78  (78)
248 COG2319 FOG: WD40 repeat [Gene  29.6 1.2E+02  0.0026   27.5   5.7   54    3-65    131-186 (466)
249 PF12895 Apc3:  Anaphase-promot  29.5 1.2E+02  0.0027   22.3   4.7   33  300-334    42-74  (84)
250 KOG0292|consensus               29.2      44 0.00095   36.8   2.8   49    5-62     30-78  (1202)
251 PRK00432 30S ribosomal protein  29.2      33 0.00073   23.8   1.3   26  228-263    20-46  (50)
252 KOG0313|consensus               29.2 1.2E+02  0.0025   30.3   5.4   52    4-63    123-175 (423)
253 PF07282 OrfB_Zn_ribbon:  Putat  28.9      58  0.0013   23.5   2.7   39  222-268    22-60  (69)
254 PRK14890 putative Zn-ribbon RN  28.9      36 0.00077   24.6   1.4   26  208-239     9-34  (59)
255 KOG0267|consensus               28.8      28  0.0006   37.2   1.2   50    4-62     90-139 (825)
256 KOG4283|consensus               28.6      96  0.0021   30.0   4.6   59    2-63     62-130 (397)
257 PRK11827 hypothetical protein;  28.6      31 0.00068   25.0   1.1   34  227-268     7-40  (60)
258 PF10083 DUF2321:  Uncharacteri  28.3 3.5E+02  0.0076   23.5   7.6   24  254-279    68-91  (158)
259 smart00028 TPR Tetratricopepti  28.2      97  0.0021   16.5   3.2   24  282-305     7-30  (34)
260 KOG0307|consensus               28.1      50  0.0011   36.8   3.1   56    3-63     87-145 (1049)
261 smart00858 SAF This domain fam  28.0      41 0.00089   23.5   1.7   16  166-181     3-18  (64)
262 PRK14714 DNA polymerase II lar  27.2      43 0.00094   38.2   2.4   12  254-265   709-720 (1337)
263 PF12895 Apc3:  Anaphase-promot  27.1 1.2E+02  0.0027   22.3   4.4   20  282-301    31-50  (84)
264 KOG1034|consensus               27.1 1.3E+02  0.0028   29.5   5.3   55    5-65    157-211 (385)
265 KOG1240|consensus               27.0      71  0.0015   36.4   3.9   59    3-64   1068-1127(1431)
266 TIGR00373 conserved hypothetic  26.9      32 0.00069   29.8   1.1   34  203-243   108-141 (158)
267 KOG1539|consensus               26.8 1.2E+02  0.0026   33.1   5.5   52    4-63    222-273 (910)
268 PF09862 DUF2089:  Protein of u  26.5      47   0.001   27.2   2.0   16  292-307    63-78  (113)
269 KOG1036|consensus               26.4 1.2E+02  0.0027   29.2   4.9   49    2-62    112-160 (323)
270 KOG0264|consensus               26.2 1.2E+02  0.0026   30.5   5.0   55    3-65    292-347 (422)
271 KOG0265|consensus               26.1      72  0.0016   30.8   3.3   52    4-65    153-204 (338)
272 KOG3914|consensus               26.1      41 0.00088   33.4   1.8   20    3-22    212-231 (390)
273 smart00659 RPOLCX RNA polymera  26.0      46   0.001   22.4   1.5    9  255-263    20-28  (44)
274 COG1997 RPL43A Ribosomal prote  25.9      41 0.00089   26.3   1.4   11  254-264    53-63  (89)
275 KOG0644|consensus               25.6      37  0.0008   37.0   1.5   51    6-65    212-262 (1113)
276 TIGR02521 type_IV_pilW type IV  25.3 2.3E+02   0.005   23.8   6.4   16  289-304    78-93  (234)
277 TIGR02552 LcrH_SycD type III s  25.3 1.9E+02  0.0042   22.9   5.6   40  287-334    62-101 (135)
278 PF03604 DNA_RNApol_7kD:  DNA d  25.0      64  0.0014   20.3   1.9    9  255-263    18-26  (32)
279 COG1198 PriA Primosomal protei  24.9 2.2E+02  0.0048   31.0   7.1   30  311-340   567-596 (730)
280 COG1675 TFA1 Transcription ini  24.7      62  0.0013   28.7   2.5   24  162-185    59-82  (176)
281 TIGR01384 TFS_arch transcripti  24.5      43 0.00094   26.5   1.4   26  230-265     2-27  (104)
282 TIGR00595 priA primosomal prot  24.0   1E+02  0.0022   31.9   4.3   18  317-334   351-368 (505)
283 KOG1523|consensus               23.9      77  0.0017   30.8   3.1   44   19-65    126-176 (361)
284 PF14938 SNAP:  Soluble NSF att  23.8 3.7E+02   0.008   25.0   7.9   44  290-335   129-172 (282)
285 CHL00033 ycf3 photosystem I as  23.8   2E+02  0.0044   24.2   5.6   52  281-333    77-128 (168)
286 PRK00564 hypA hydrogenase nick  23.2      85  0.0018   25.7   2.9   41  173-239    57-97  (117)
287 PRK14892 putative transcriptio  23.1      37  0.0008   27.2   0.7   58  227-291    20-77  (99)
288 TIGR02300 FYDLN_acid conserved  23.1      54  0.0012   27.5   1.7   32  229-269    10-41  (129)
289 PRK04016 DNA-directed RNA poly  23.0 1.3E+02  0.0028   22.0   3.4   13  254-266     4-16  (62)
290 PF13913 zf-C2HC_2:  zinc-finge  23.0      89  0.0019   18.2   2.2   18  255-272     3-20  (25)
291 TIGR02878 spore_ypjB sporulati  22.7 1.6E+02  0.0034   27.3   4.8   35  275-309     4-38  (233)
292 cd02679 MIT_spastin MIT: domai  22.3 1.8E+02   0.004   22.1   4.4   40  274-313     6-45  (79)
293 KOG2055|consensus               22.3 1.1E+02  0.0024   31.2   3.9   29   33-63    213-241 (514)
294 KOG0321|consensus               22.1 1.1E+02  0.0024   32.4   4.0   52    5-62    121-172 (720)
295 PF01927 Mut7-C:  Mut7-C RNAse   22.1      50  0.0011   28.0   1.4   34  229-264    92-134 (147)
296 PF12894 Apc4_WD40:  Anaphase-p  22.0 1.2E+02  0.0025   20.7   3.0   29   34-64     12-40  (47)
297 KOG1273|consensus               22.0 1.1E+02  0.0024   29.8   3.8   54    4-66     43-96  (405)
298 KOG4056|consensus               21.9 1.6E+02  0.0036   25.0   4.4   43  281-323    86-138 (143)
299 smart00647 IBR In Between Ring  21.9      64  0.0014   22.5   1.7   32  228-263    18-49  (64)
300 PF04652 DUF605:  Vta1 like;  I  21.8 1.9E+02  0.0041   28.3   5.7   37  270-306   343-379 (380)
301 KOG0301|consensus               21.3      99  0.0021   33.0   3.5   32   29-63    136-167 (745)
302 KOG4378|consensus               21.2 1.4E+02   0.003   30.9   4.4   54    2-62    183-236 (673)
303 PRK14873 primosome assembly pr  21.0 1.3E+02  0.0028   32.4   4.4   24  311-334   503-526 (665)
304 PLN03086 PRLI-interacting fact  20.9 1.3E+02  0.0029   31.6   4.4   57  206-269   407-468 (567)
305 KOG0310|consensus               20.8 1.7E+02  0.0037   29.9   5.0   52    3-62    130-181 (487)
306 TIGR00990 3a0801s09 mitochondr  20.7 2.8E+02  0.0062   28.9   7.0   51  275-334   126-176 (615)
307 PF02561 FliS:  Flagellar prote  20.6 4.3E+02  0.0094   21.3   6.7   45  269-313    22-69  (122)
308 KOG0282|consensus               20.5      74  0.0016   32.4   2.4   37   29-66    210-246 (503)
309 PRK00415 rps27e 30S ribosomal   20.4      70  0.0015   23.1   1.6   33  224-264     7-40  (59)
310 COG2178 Predicted RNA-binding   20.3 5.8E+02   0.013   23.1   7.8   47  266-312    19-68  (204)
311 COG1675 TFA1 Transcription ini  20.3 1.4E+02  0.0029   26.6   3.8   39  196-241   105-143 (176)
312 PF08631 SPO22:  Meiosis protei  20.2   4E+02  0.0087   24.8   7.3   57  277-333    36-99  (278)
313 PF10122 Mu-like_Com:  Mu-like   20.1      43 0.00093   23.4   0.5   31  227-263     3-33  (51)

No 1  
>KOG2084|consensus
Probab=99.65  E-value=3.5e-15  Score=149.75  Aligned_cols=180  Identities=21%  Similarity=0.351  Sum_probs=135.6

Q ss_pred             eeeeeeccccccCCCCCCCCeEEEEeCCEEEEEEeeccCCCC-ceeecccccccCCCHHHHHHHHHhhCCCceecCCCCC
Q psy5736         136 YVGVGIYPTVSLFNHDCYPAVTRYFNGKNIIVKALRPLKPKE-VVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVE  214 (346)
Q Consensus       136 ~~g~glyp~~s~~NHSC~PN~~~~f~g~~~~v~a~r~I~~Ge-el~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~  214 (346)
                      .++.|+||..+++||||.||+...|++....+++...+.+++ +|+++|.+  ..+++..||+.|+..|.|.|.|++|.+
T Consensus       197 ~~~~~l~~~~~~~~hsC~pn~~~~~~~~~~~~~~~~~~~~~~~~l~~~y~~--~~~~~~~r~~~l~~~~~f~c~c~rc~d  274 (482)
T KOG2084|consen  197 FLGRGLFPGSSLFNHSCFPNISVIFDGRGLALLVPAGIDAGEEELTISYTD--PLLSTASRQKQLRQSKLFSCQCPRCLD  274 (482)
T ss_pred             cceeeecccchhcccCCCCCeEEEECCceeEEEeecccCCCCCEEEEeecc--cccCHHHHHHHHhhccceeeecCCCCC
Confidence            589999999999999999999999999999999999999987 99999998  689999999999999999999999998


Q ss_pred             CCccccccccCccccccCCcCcCCcccccccCCcccccCcccccccCCcCChHHHHHHHHHHHHHHHHHHHHhccccHHH
Q psy5736         215 NWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSCFRE  294 (346)
Q Consensus       215 ~~~~~~~~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~l~~~~~~~~~a~~~~~~~~~~~  294 (346)
                      ++    +.+.+..+++|.+..|.+.+.+.....+   ...|.|..|........+..........      .....   .
T Consensus       275 ~~----~~~~~~~~~~c~~~~~~~~~~~~~~~~~---~~~w~~~~~~~~~~~~~~~~~~~~~~~~------~~~~~---~  338 (482)
T KOG2084|consen  275 PT----ELGTFLSSLRCENCTCGGLLGTSFLDKE---DLQWPCTECALVRLKAYVVESREELQNE------LLDAF---S  338 (482)
T ss_pred             CC----ccccchhhhhhcCCCCCCccCCCccccc---CCCccccccccchhHHHHHHHHHHHHhh------ccccC---C
Confidence            76    5566678899998888877665543221   2359999999987766555443332211      01111   1


Q ss_pred             HHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         295 AVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       295 ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      .....+.+.......+++.|.........+...+....+
T Consensus       339 ~~~~~e~l~~~~~~~~~~~~~~~~~~~~~~~~il~~~~~  377 (482)
T KOG2084|consen  339 DLLIEELLLLRQESLELPNDFEVLLLKLHLLFILGSLLG  377 (482)
T ss_pred             hhhhHHHHHHHHHhhhCcchHHHHHHHHHHHHHHHHHHh
Confidence            122344555666667888888777777766666665544


No 2  
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.39  E-value=3.1e-13  Score=114.54  Aligned_cols=50  Identities=32%  Similarity=0.672  Sum_probs=45.5

Q ss_pred             ceeeeeeccccccCCCCCCCCeEEEEe----CCEEEEEEeeccCCCCceeeccc
Q psy5736         135 NYVGVGIYPTVSLFNHDCYPAVTRYFN----GKNIIVKALRPLKPKEVVAENYG  184 (346)
Q Consensus       135 ~~~g~glyp~~s~~NHSC~PN~~~~f~----g~~~~v~a~r~I~~Geel~~~Y~  184 (346)
                      ...+.++||.++++||||.|||.+.|.    ++.++|+|+|||++||||+++||
T Consensus       109 ~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~isYG  162 (162)
T PF00856_consen  109 DRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFISYG  162 (162)
T ss_dssp             EEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEEST
T ss_pred             cccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEEEC
Confidence            457899999999999999999999998    79999999999999999999997


No 3  
>KOG1517|consensus
Probab=99.20  E-value=1.2e-11  Score=129.67  Aligned_cols=77  Identities=31%  Similarity=0.521  Sum_probs=72.0

Q ss_pred             ceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc--ccchhhhhhhhhcccc
Q psy5736           6 LKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS--HEHRLVTSFQALTDTS   83 (346)
Q Consensus         6 ~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s--h~~~Lv~a~r~L~~~~   83 (346)
                      .+|.||||+.++.+++|.|+.+    ..++||+|++||++|++|+|+|++||.||||++|..  ..++||++|..|+++.
T Consensus      1086 ~~i~vwd~e~~~~l~~F~n~~~----~~t~Vs~l~liNe~D~aLlLtas~dGvIRIwk~y~~~~~~~eLVTaw~~Ls~~~ 1161 (1387)
T KOG1517|consen 1086 ERIRVWDWEKGRLLNGFDNGAF----PDTRVSDLELINEQDDALLLTASSDGVIRIWKDYADKWKKPELVTAWSSLSDQL 1161 (1387)
T ss_pred             ceEEEEecccCceeccccCCCC----CCCccceeeeecccchhheeeeccCceEEEecccccccCCceeEEeeccccccC
Confidence            6899999999999999999997    789999999999999999999999999999999987  4689999999998887


Q ss_pred             ccc
Q psy5736          84 FVN   86 (346)
Q Consensus        84 y~~   86 (346)
                      +-.
T Consensus      1162 ~~~ 1164 (1387)
T KOG1517|consen 1162 PGA 1164 (1387)
T ss_pred             ccC
Confidence            765


No 4  
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.10  E-value=6e-11  Score=95.90  Aligned_cols=46  Identities=28%  Similarity=0.445  Sum_probs=41.0

Q ss_pred             eeeeccccccCCCCCCCCeEEEEeC--C--EEEEEEeeccCCCCceeecc
Q psy5736         138 GVGIYPTVSLFNHDCYPAVTRYFNG--K--NIIVKALRPLKPKEVVAENY  183 (346)
Q Consensus       138 g~glyp~~s~~NHSC~PN~~~~f~g--~--~~~v~a~r~I~~Geel~~~Y  183 (346)
                      +..++|.++++||||.||+...+..  +  .+.++|+|+|++||||+++|
T Consensus        67 ~~~~~~~~~~iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y  116 (116)
T smart00317       67 ARRKGNIARFINHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY  116 (116)
T ss_pred             CCccCcHHHeeCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence            3448999999999999999988753  2  69999999999999999998


No 5  
>KOG2589|consensus
Probab=98.63  E-value=1.6e-08  Score=96.39  Aligned_cols=60  Identities=32%  Similarity=0.560  Sum_probs=52.0

Q ss_pred             ccccCCCCCCCCeEEEEeC-CEEEEEEeeccCCCCceeecccccccCCCHHHHHHHHHhhCCCceecCCCCCC
Q psy5736         144 TVSLFNHDCYPAVTRYFNG-KNIIVKALRPLKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVEN  215 (346)
Q Consensus       144 ~~s~~NHSC~PN~~~~f~g-~~~~v~a~r~I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~  215 (346)
                      -++++||.|.|||.++-.| +++.|+++|||++|||||.=|+..|...            -.-.|.|..|...
T Consensus       196 PaafINHDCrpnCkFvs~g~~tacvkvlRDIePGeEITcFYgs~fFG~------------~N~~CeC~TCER~  256 (453)
T KOG2589|consen  196 PAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEITCFYGSGFFGE------------NNEECECVTCERR  256 (453)
T ss_pred             cHHhhcCCCCCCceeecCCCceeeeehhhcCCCCceeEEeecccccCC------------CCceeEEeecccc
Confidence            4779999999999999887 8999999999999999999999976443            2358999999864


No 6  
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=97.98  E-value=4.2e-06  Score=85.29  Aligned_cols=71  Identities=23%  Similarity=0.358  Sum_probs=57.3

Q ss_pred             cccccCCCCCCCCeEEEEeC----CEEEEEEeeccCCCCceeecccccccCCCHHHHHHHHHhhCCCceecCCCCCC
Q psy5736         143 PTVSLFNHDCYPAVTRYFNG----KNIIVKALRPLKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVEN  215 (346)
Q Consensus       143 p~~s~~NHSC~PN~~~~f~g----~~~~v~a~r~I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~  215 (346)
                      ..+.++||||.||+.....+    ..+.++|+|||.+||||+..|+..  .....++...+...+...|.|.+|...
T Consensus       404 ~~~r~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~--~~~~~~~~~~~~~~~~~~~~~~~~~~~  478 (480)
T COG2940         404 DVARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPS--LEDNRELKKLLEKRWGCACGEDRCSHT  478 (480)
T ss_pred             cccceeecCCCCCcceecccccccceeeecccccchhhhhhccccccc--cccchhhhhhhhhhhccccCCCccCCC
Confidence            34669999999999999764    389999999999999999999873  333333466777789999999999764


No 7  
>KOG4442|consensus
Probab=97.91  E-value=7e-06  Score=84.34  Aligned_cols=41  Identities=22%  Similarity=0.388  Sum_probs=34.0

Q ss_pred             cccCCCCCCCCeEEE---EeC-CEEEEEEeeccCCCCceeecccc
Q psy5736         145 VSLFNHDCYPAVTRY---FNG-KNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       145 ~s~~NHSC~PN~~~~---f~g-~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      +-++||||+|||.+-   ..| .+|-+.|.|+|++|||||+.|.-
T Consensus       194 aRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf  238 (729)
T KOG4442|consen  194 ARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQF  238 (729)
T ss_pred             HHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEeccc
Confidence            447899999999854   223 57888999999999999999964


No 8  
>KOG1080|consensus
Probab=97.83  E-value=1.1e-05  Score=87.36  Aligned_cols=41  Identities=24%  Similarity=0.455  Sum_probs=36.1

Q ss_pred             ccccCCCCCCCCeEEEEe---C-CEEEEEEeeccCCCCceeeccc
Q psy5736         144 TVSLFNHDCYPAVTRYFN---G-KNIIVKALRPLKPKEVVAENYG  184 (346)
Q Consensus       144 ~~s~~NHSC~PN~~~~f~---g-~~~~v~a~r~I~~Geel~~~Y~  184 (346)
                      .|.+|||||.|||..-..   | .+++|.|.|+|++|||||..|-
T Consensus       939 iAr~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYk  983 (1005)
T KOG1080|consen  939 IARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYK  983 (1005)
T ss_pred             hhheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeecc
Confidence            378999999999996653   3 6899999999999999999994


No 9  
>KOG1082|consensus
Probab=97.16  E-value=0.00026  Score=69.77  Aligned_cols=42  Identities=29%  Similarity=0.460  Sum_probs=35.9

Q ss_pred             ccCCCCCCCCeEEEEe--C------CEEEEEEeeccCCCCceeecccccc
Q psy5736         146 SLFNHDCYPAVTRYFN--G------KNIIVKALRPLKPKEVVAENYGLVF  187 (346)
Q Consensus       146 s~~NHSC~PN~~~~f~--g------~~~~v~a~r~I~~Geel~~~Y~~~~  187 (346)
                      -++||||.||+.+...  +      -.+.+.|+++|++|+|||..|+..+
T Consensus       274 RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~~  323 (364)
T KOG1082|consen  274 RFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKAY  323 (364)
T ss_pred             ccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhcccc
Confidence            3789999999998753  3      3689999999999999999999743


No 10 
>KOG1141|consensus
Probab=96.42  E-value=0.0012  Score=69.32  Aligned_cols=41  Identities=17%  Similarity=0.265  Sum_probs=33.1

Q ss_pred             cccCCCCCCCCeEEE--EeCC------EEEEEEeeccCCCCceeecccc
Q psy5736         145 VSLFNHDCYPAVTRY--FNGK------NIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       145 ~s~~NHSC~PN~~~~--f~g~------~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      +-++||||+||+.+.  |.+.      -+.+.|.+-|++|+|||..|..
T Consensus      1190 GRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Y 1238 (1262)
T KOG1141|consen 1190 GRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQY 1238 (1262)
T ss_pred             hhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccc
Confidence            347899999999865  5442      3567888999999999999975


No 11 
>KOG1083|consensus
Probab=96.35  E-value=0.0023  Score=68.78  Aligned_cols=41  Identities=29%  Similarity=0.473  Sum_probs=34.4

Q ss_pred             cccCCCCCCCCeEEE---EeC-CEEEEEEeeccCCCCceeecccc
Q psy5736         145 VSLFNHDCYPAVTRY---FNG-KNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       145 ~s~~NHSC~PN~~~~---f~g-~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      +-.+||||.|||...   .+| .++.|.|.|||++||||+..|--
T Consensus      1251 ~RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~ 1295 (1306)
T KOG1083|consen 1251 ARFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNF 1295 (1306)
T ss_pred             ccccccccCCCCccccccccceeeeeeeecCCCCCCceEEEeccc
Confidence            347899999999854   345 68999999999999999999853


No 12 
>KOG1337|consensus
Probab=95.87  E-value=0.0067  Score=61.84  Aligned_cols=61  Identities=25%  Similarity=0.273  Sum_probs=48.5

Q ss_pred             eeeeccccccCCCCCCCCeEEEEe-CCEEEEEEeeccCCCCceeecccccccCCCHHHHHHHHHhhCCCc
Q psy5736         138 GVGIYPTVSLFNHDCYPAVTRYFN-GKNIIVKALRPLKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFE  206 (346)
Q Consensus       138 g~glyp~~s~~NHSC~PN~~~~f~-g~~~~v~a~r~I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~  206 (346)
                      -.++.|..-+.||+|.+....+.. +..+.+.+.++|.+||||+++||+.    +    ...|...|||.
T Consensus       230 ~~~L~P~~D~~NH~~~~~~~~~~~~d~~~~l~~~~~v~~geevfi~YG~~----~----N~eLL~~YGFv  291 (472)
T KOG1337|consen  230 NEALAPLIDLLNHSPEVIKAGYNQEDEAVELVAERDVSAGEEVFINYGPK----S----NAELLLHYGFV  291 (472)
T ss_pred             chhhhhhHHhhccCchhccccccCCCCcEEEEEeeeecCCCeEEEecCCC----c----hHHHHHhcCCC
Confidence            367899999999999994443332 4589999999999999999999972    2    23466689995


No 13 
>KOG1085|consensus
Probab=95.66  E-value=0.0074  Score=56.58  Aligned_cols=41  Identities=24%  Similarity=0.340  Sum_probs=35.6

Q ss_pred             cccCCCCCCCCeEEEE---eC-CEEEEEEeeccCCCCceeecccc
Q psy5736         145 VSLFNHDCYPAVTRYF---NG-KNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       145 ~s~~NHSC~PN~~~~f---~g-~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      .-|||||=.+|+....   +| -.+++.|.|+|.+||||+..||+
T Consensus       334 GRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGD  378 (392)
T KOG1085|consen  334 GRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGD  378 (392)
T ss_pred             hhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccc
Confidence            4589999999998553   34 58999999999999999999998


No 14 
>KOG0294|consensus
Probab=94.97  E-value=0.036  Score=52.83  Aligned_cols=65  Identities=15%  Similarity=0.176  Sum_probs=51.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc--ccchhhh
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS--HEHRLVT   74 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s--h~~~Lv~   74 (346)
                      |-+++|++||..+...++.|.       -|..||+++.++-..+.-++.++++||.|++|+.=+.  ..+++++
T Consensus       226 ~d~~~i~~~D~ds~~~~~~~~-------AH~~RVK~i~~~~~~~~~~lvTaSSDG~I~vWd~~~~~k~~~~~l~  292 (362)
T KOG0294|consen  226 GDNEWISLKDTDSDTPLTEFL-------AHENRVKDIASYTNPEHEYLVTASSDGFIKVWDIDMETKKRPTLLA  292 (362)
T ss_pred             cCCceEEEeccCCCccceeee-------cchhheeeeEEEecCCceEEEEeccCceEEEEEccccccCCcceeE
Confidence            346899999999988888773       2789999999877777899999999999999965443  2344444


No 15 
>KOG0282|consensus
Probab=94.22  E-value=0.071  Score=53.27  Aligned_cols=53  Identities=19%  Similarity=0.340  Sum_probs=44.2

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +|.++.|.+||.|+|+-+.+|..+.        ..+++| +++++..+.|+|.+||.|+.|.
T Consensus       276 ~sfD~~lKlwDtETG~~~~~f~~~~--------~~~cvk-f~pd~~n~fl~G~sd~ki~~wD  328 (503)
T KOG0282|consen  276 ASFDRFLKLWDTETGQVLSRFHLDK--------VPTCVK-FHPDNQNIFLVGGSDKKIRQWD  328 (503)
T ss_pred             eecceeeeeeccccceEEEEEecCC--------Cceeee-cCCCCCcEEEEecCCCcEEEEe
Confidence            4788999999999999999997665        356666 4556678999999999999994


No 16 
>KOG0296|consensus
Probab=94.10  E-value=0.048  Score=52.81  Aligned_cols=51  Identities=20%  Similarity=0.211  Sum_probs=39.7

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .+++|+|.|||..+.+.  ++.++++      .-|+.|+++|   ..+++++..+|.||+|.
T Consensus       304 G~vdG~i~iyD~a~~~~--R~~c~he------~~V~~l~w~~---t~~l~t~c~~g~v~~wD  354 (399)
T KOG0296|consen  304 GSVDGTIAIYDLAASTL--RHICEHE------DGVTKLKWLN---TDYLLTACANGKVRQWD  354 (399)
T ss_pred             ccccceEEEEecccchh--heeccCC------CceEEEEEcC---cchheeeccCceEEeee
Confidence            36899999999987744  3344443      3499999999   55668899999999993


No 17 
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=94.07  E-value=0.19  Score=37.31  Aligned_cols=51  Identities=16%  Similarity=0.114  Sum_probs=43.7

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      |.-....|++++|+..+++.++. .+.+.+.|+.+..+...|+.+|...|+.
T Consensus        12 a~~~~~~~~~~~A~~~~~~al~~-~~~~~~~~~~~a~~~~~lg~~~~~~g~~   62 (78)
T PF13424_consen   12 ARVYRELGRYDEALDYYEKALDI-EEQLGDDHPDTANTLNNLGECYYRLGDY   62 (78)
T ss_dssp             HHHHHHTT-HHHHHHHHHHHHHH-HHHTTTHHHHHHHHHHHHHHHHHHTTHH
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHH-HHHHCCCCHHHHHHHHHHHHHHHHcCCH
Confidence            44445679999999999999999 7777889999999999999999999875


No 18 
>KOG1079|consensus
Probab=93.27  E-value=0.074  Score=55.26  Aligned_cols=42  Identities=21%  Similarity=0.263  Sum_probs=35.1

Q ss_pred             ccccCCCCCCCCeEEEE--e-C-CEEEEEEeeccCCCCceeecccc
Q psy5736         144 TVSLFNHDCYPAVTRYF--N-G-KNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       144 ~~s~~NHSC~PN~~~~f--~-g-~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      .+-+.|||=.|||....  . | .++-++|.|.|.+|||||+.|..
T Consensus       665 k~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrY  710 (739)
T KOG1079|consen  665 KIRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRY  710 (739)
T ss_pred             hhhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeecc
Confidence            34578999999998543  2 4 68889999999999999999975


No 19 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=92.78  E-value=0.19  Score=54.41  Aligned_cols=54  Identities=20%  Similarity=0.343  Sum_probs=43.8

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      +.+|+|.|||.++++.+..+.       .|...|+++.+. ..++.++++|+.||.|++|..
T Consensus       552 ~~Dg~v~lWd~~~~~~~~~~~-------~H~~~V~~l~~~-p~~~~~L~Sgs~Dg~v~iWd~  605 (793)
T PLN00181        552 NFEGVVQVWDVARSQLVTEMK-------EHEKRVWSIDYS-SADPTLLASGSDDGSVKLWSI  605 (793)
T ss_pred             eCCCeEEEEECCCCeEEEEec-------CCCCCEEEEEEc-CCCCCEEEEEcCCCEEEEEEC
Confidence            568999999999988777663       266789999964 345678899999999999954


No 20 
>PF13374 TPR_10:  Tetratricopeptide repeat; PDB: 3CEQ_B 3EDT_H 3NF1_A.
Probab=92.35  E-value=0.32  Score=31.19  Aligned_cols=34  Identities=18%  Similarity=0.095  Sum_probs=19.9

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHh
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKL  316 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~  316 (346)
                      |.-....|++++|++++++.++.+.+++.+.||.
T Consensus         9 a~~~~~~g~~~~A~~~~~~al~~~~~~~G~~Hpd   42 (42)
T PF13374_consen    9 ANAYRAQGRYEEALELLEEALEIRERLLGPDHPD   42 (42)
T ss_dssp             HHHHHHCT-HHHHHHHHHHHHHHH----------
T ss_pred             HHHHHhhhhcchhhHHHHHHHHHHHHHhcccccC
Confidence            4445677999999999999999999999999984


No 21 
>KOG0302|consensus
Probab=91.34  E-value=0.3  Score=47.78  Aligned_cols=57  Identities=26%  Similarity=0.473  Sum_probs=40.3

Q ss_pred             cccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCCcc
Q psy5736           3 GLFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYAS   67 (346)
Q Consensus         3 ~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~s   67 (346)
                      +.+|+|.|||..++ ++-+.+.  +.    |.+.|.-|-+ | ...++++.|.+||.++||  |.+.+
T Consensus       277 S~DgsIrIWDiRs~~~~~~~~~--kA----h~sDVNVISW-n-r~~~lLasG~DdGt~~iwDLR~~~~  336 (440)
T KOG0302|consen  277 SCDGSIRIWDIRSGPKKAAVST--KA----HNSDVNVISW-N-RREPLLASGGDDGTLSIWDLRQFKS  336 (440)
T ss_pred             ecCceEEEEEecCCCccceeEe--ec----cCCceeeEEc-c-CCcceeeecCCCceEEEEEhhhccC
Confidence            46899999999998 3444333  32    6666666554 3 234489999999999999  77754


No 22 
>KOG1188|consensus
Probab=91.32  E-value=0.3  Score=47.14  Aligned_cols=51  Identities=14%  Similarity=0.236  Sum_probs=42.1

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -|.|+++||.+++.|--|.-       +..-+.-++|++++.-..+.++++||.||+|
T Consensus        49 ngsv~lyd~~tg~~l~~fk~-------~~~~~N~vrf~~~ds~h~v~s~ssDG~Vr~w   99 (376)
T KOG1188|consen   49 NGSVRLYDKGTGQLLEEFKG-------PPATTNGVRFISCDSPHGVISCSSDGTVRLW   99 (376)
T ss_pred             CCeEEEEeccchhhhheecC-------CCCcccceEEecCCCCCeeEEeccCCeEEEE
Confidence            48999999999988887752       3455779999999665666889999999999


No 23 
>KOG0646|consensus
Probab=90.84  E-value=0.19  Score=50.10  Aligned_cols=51  Identities=24%  Similarity=0.343  Sum_probs=45.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +..|+|.+|-..+|+.|+.++       .|.-.||.|+|.+  |++++.+|+.||.|.+|
T Consensus       100 ~i~g~lYlWelssG~LL~v~~-------aHYQ~ITcL~fs~--dgs~iiTgskDg~V~vW  150 (476)
T KOG0646|consen  100 TISGNLYLWELSSGILLNVLS-------AHYQSITCLKFSD--DGSHIITGSKDGAVLVW  150 (476)
T ss_pred             cccCcEEEEEeccccHHHHHH-------hhccceeEEEEeC--CCcEEEecCCCccEEEE
Confidence            378999999999999998762       3788999999875  88999999999999999


No 24 
>KOG0283|consensus
Probab=90.60  E-value=0.31  Score=51.55  Aligned_cols=54  Identities=17%  Similarity=0.274  Sum_probs=47.6

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      ..|.||.+|+-++..-|+.|.        |..-+|+++ +|+-||.+.+.|+=||.+|+|.-.
T Consensus       387 SMDKTVRLWh~~~~~CL~~F~--------HndfVTcVa-FnPvDDryFiSGSLD~KvRiWsI~  440 (712)
T KOG0283|consen  387 SMDKTVRLWHPGRKECLKVFS--------HNDFVTCVA-FNPVDDRYFISGSLDGKVRLWSIS  440 (712)
T ss_pred             cccccEEeecCCCcceeeEEe--------cCCeeEEEE-ecccCCCcEeecccccceEEeecC
Confidence            468899999999999999995        556799999 588999999999999999999544


No 25 
>PF00400 WD40:  WD domain, G-beta repeat;  InterPro: IPR019781 WD-40 repeats (also known as WD or beta-transducin repeats) are short ~40 amino acid motifs, often terminating in a Trp-Asp (W-D) dipeptide. WD40 repeats usually assume a 7-8 bladed beta-propeller fold, but proteins have been found with 4 to 16 repeated units, which also form a circularised beta-propeller structure. WD-repeat proteins are a large family found in all eukaryotes and are implicated in a variety of functions ranging from signal transduction and transcription regulation to cell cycle control and apoptosis. Repeated WD40 motifs act as a site for protein-protein interaction, and proteins containing WD40 repeats are known to serve as platforms for the assembly of protein complexes or mediators of transient interplay among other proteins. The specificity of the proteins is determined by the sequences outside the repeats themselves. Examples of such complexes are G proteins (beta subunit is a beta-propeller), TAFII transcription factor, and E3 ubiquitin ligase [, ]. In Arabidopsis spp., several WD40-containing proteins act as key regulators of plant-specific developmental events.; PDB: 2ZKQ_a 3CFV_B 3CFS_B 1PEV_A 1NR0_A 1VYH_T 3RFH_A 3O2Z_T 3FRX_C 3U5G_g ....
Probab=90.27  E-value=0.4  Score=30.41  Aligned_cols=31  Identities=19%  Similarity=0.445  Sum_probs=25.5

Q ss_pred             CCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736          31 RNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus        31 ~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .|...|++|.+...  ...+++|+.||.|++|.
T Consensus         9 ~h~~~i~~i~~~~~--~~~~~s~~~D~~i~vwd   39 (39)
T PF00400_consen    9 GHSSSINSIAWSPD--GNFLASGSSDGTIRVWD   39 (39)
T ss_dssp             SSSSSEEEEEEETT--SSEEEEEETTSEEEEEE
T ss_pred             CCCCcEEEEEEecc--cccceeeCCCCEEEEEC
Confidence            36778999996543  78889999999999984


No 26 
>PTZ00421 coronin; Provisional
Probab=90.25  E-value=0.47  Score=48.77  Aligned_cols=60  Identities=7%  Similarity=-0.068  Sum_probs=39.3

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +.+|+|.|||..++.........-.....|...|.++.+. +....++++|+.||.|++|.
T Consensus        95 S~DgtIkIWdi~~~~~~~~~~~~l~~L~gH~~~V~~l~f~-P~~~~iLaSgs~DgtVrIWD  154 (493)
T PTZ00421         95 SEDGTIMGWGIPEEGLTQNISDPIVHLQGHTKKVGIVSFH-PSAMNVLASAGADMVVNVWD  154 (493)
T ss_pred             eCCCEEEEEecCCCccccccCcceEEecCCCCcEEEEEeC-cCCCCEEEEEeCCCEEEEEE
Confidence            5689999999976532110000000122367789999874 44456889999999999994


No 27 
>PTZ00421 coronin; Provisional
Probab=90.20  E-value=0.54  Score=48.32  Aligned_cols=52  Identities=27%  Similarity=0.317  Sum_probs=41.8

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      |.+|+|.|||.++++.+..+..       |...|.++.+-  .+..++++|+.||.|++|.
T Consensus       145 s~DgtVrIWDl~tg~~~~~l~~-------h~~~V~sla~s--pdG~lLatgs~Dg~IrIwD  196 (493)
T PTZ00421        145 GADMVVNVWDVERGKAVEVIKC-------HSDQITSLEWN--LDGSLLCTTSKDKKLNIID  196 (493)
T ss_pred             eCCCEEEEEECCCCeEEEEEcC-------CCCceEEEEEE--CCCCEEEEecCCCEEEEEE
Confidence            5689999999999987776642       44568888863  3567889999999999995


No 28 
>KOG0264|consensus
Probab=89.82  E-value=0.51  Score=46.79  Aligned_cols=57  Identities=25%  Similarity=0.320  Sum_probs=42.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~   65 (346)
                      |-++.+.|||-.++ ...--.. ..   .|..++.++. +|++++.++++|++||.|.+|  |++
T Consensus       247 ~dd~~L~iwD~R~~-~~~~~~~-~~---ah~~~vn~~~-fnp~~~~ilAT~S~D~tV~LwDlRnL  305 (422)
T KOG0264|consen  247 GDDGKLMIWDTRSN-TSKPSHS-VK---AHSAEVNCVA-FNPFNEFILATGSADKTVALWDLRNL  305 (422)
T ss_pred             cCCCeEEEEEcCCC-CCCCccc-cc---ccCCceeEEE-eCCCCCceEEeccCCCcEEEeechhc
Confidence            45678899999864 1111111 12   3678899988 799999999999999999999  665


No 29 
>KOG0291|consensus
Probab=89.66  E-value=0.43  Score=50.39  Aligned_cols=51  Identities=27%  Similarity=0.476  Sum_probs=39.4

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      -|++-||+|.+..-+   .-|..    |..++++|.  =.+|..+..+|.+||.|++|..
T Consensus       329 lgQLlVweWqsEsYV---lKQQg----H~~~i~~l~--YSpDgq~iaTG~eDgKVKvWn~  379 (893)
T KOG0291|consen  329 LGQLLVWEWQSESYV---LKQQG----HSDRITSLA--YSPDGQLIATGAEDGKVKVWNT  379 (893)
T ss_pred             cceEEEEEeecccee---eeccc----cccceeeEE--ECCCCcEEEeccCCCcEEEEec
Confidence            378999999876322   22333    888999988  3478889999999999999953


No 30 
>PF13512 TPR_18:  Tetratricopeptide repeat
Probab=88.15  E-value=1.6  Score=37.38  Aligned_cols=52  Identities=19%  Similarity=0.332  Sum_probs=36.0

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      .++|.+|.+++++|++++|++.++.+...|     |..+.-.+++..|+.+|-.-|+
T Consensus        11 ~~ly~~a~~~l~~~~Y~~A~~~le~L~~ry-----P~g~ya~qAqL~l~yayy~~~~   62 (142)
T PF13512_consen   11 QELYQEAQEALQKGNYEEAIKQLEALDTRY-----PFGEYAEQAQLDLAYAYYKQGD   62 (142)
T ss_pred             HHHHHHHHHHHHhCCHHHHHHHHHHHHhcC-----CCCcccHHHHHHHHHHHHHccC
Confidence            457778888888888888888888887766     5555555555555555555443


No 31 
>PF13525 YfiO:  Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=87.90  E-value=2  Score=38.55  Aligned_cols=54  Identities=19%  Similarity=0.341  Sum_probs=42.1

Q ss_pred             HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ...+|..|...++.|++.+|++.|++++..+     |.++....+...|+.+|-..|+.
T Consensus         5 ~~~lY~~a~~~~~~g~y~~Ai~~f~~l~~~~-----P~s~~a~~A~l~la~a~y~~~~y   58 (203)
T PF13525_consen    5 AEALYQKALEALQQGDYEEAIKLFEKLIDRY-----PNSPYAPQAQLMLAYAYYKQGDY   58 (203)
T ss_dssp             HHHHHHHHHHHHHCT-HHHHHHHHHHHHHH------TTSTTHHHHHHHHHHHHHHTT-H
T ss_pred             HHHHHHHHHHHHHCCCHHHHHHHHHHHHHHC-----CCChHHHHHHHHHHHHHHHcCCH
Confidence            4567888888899999999999999998886     77788888888888888777653


No 32 
>PTZ00420 coronin; Provisional
Probab=87.77  E-value=1.2  Score=46.59  Aligned_cols=55  Identities=16%  Similarity=0.236  Sum_probs=43.0

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |.+|.|.+|+-.+...+..+.       .|+..|.++.+ +++++.++++|+.||.|++|..-
T Consensus        51 G~~gvI~L~~~~r~~~v~~L~-------gH~~~V~~laf-sP~~~~lLASgS~DgtIrIWDi~  105 (568)
T PTZ00420         51 GLIGAIRLENQMRKPPVIKLK-------GHTSSILDLQF-NPCFSEILASGSEDLTIRVWEIP  105 (568)
T ss_pred             CceeEEEeeecCCCceEEEEc-------CCCCCEEEEEE-cCCCCCEEEEEeCCCeEEEEECC
Confidence            678999999987766555553       26778999986 55567788999999999999643


No 33 
>KOG0294|consensus
Probab=87.74  E-value=0.67  Score=44.42  Aligned_cols=55  Identities=25%  Similarity=0.266  Sum_probs=44.8

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +|-+.+|.|||..+...+-...       .|...||+|+|.-.-..+=|+.|++||.|-+|+
T Consensus        59 GssDetI~IYDm~k~~qlg~ll-------~HagsitaL~F~~~~S~shLlS~sdDG~i~iw~  113 (362)
T KOG0294|consen   59 GSSDETIHIYDMRKRKQLGILL-------SHAGSITALKFYPPLSKSHLLSGSDDGHIIIWR  113 (362)
T ss_pred             cCCCCcEEEEeccchhhhccee-------ccccceEEEEecCCcchhheeeecCCCcEEEEE
Confidence            4678999999998876665443       256789999998887777889999999999984


No 34 
>KOG0319|consensus
Probab=87.28  E-value=0.8  Score=48.20  Aligned_cols=55  Identities=22%  Similarity=0.350  Sum_probs=45.6

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .|.+|.|.|||+..+.-.-+|.       .|+.-+.++.|..-..--++..|++||.+++|.
T Consensus       123 ggaD~~v~VWdi~~~~~th~fk-------G~gGvVssl~F~~~~~~~lL~sg~~D~~v~vwn  177 (775)
T KOG0319|consen  123 GGADGRVKVWDIKNGYCTHSFK-------GHGGVVSSLLFHPHWNRWLLASGATDGTVRVWN  177 (775)
T ss_pred             ccccceEEEEEeeCCEEEEEec-------CCCceEEEEEeCCccchhheeecCCCceEEEEE
Confidence            4789999999999999998884       256678888876555567888999999999994


No 35 
>KOG1188|consensus
Probab=87.18  E-value=0.69  Score=44.71  Aligned_cols=55  Identities=24%  Similarity=0.334  Sum_probs=39.8

Q ss_pred             cccceEEEEecCCCcee-eeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRL-ISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~-~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      +-.-.|-+|||.+.+++ -.+.   .+   |.-.||.|+|+-. +..++|+||.||.+.+|..
T Consensus       140 ~s~A~v~lwDvR~~qq~l~~~~---eS---H~DDVT~lrFHP~-~pnlLlSGSvDGLvnlfD~  195 (376)
T KOG1188|consen  140 RSDASVVLWDVRSEQQLLRQLN---ES---HNDDVTQLRFHPS-DPNLLLSGSVDGLVNLFDT  195 (376)
T ss_pred             cCceEEEEEEeccccchhhhhh---hh---ccCcceeEEecCC-CCCeEEeecccceEEeeec
Confidence            33456899999888773 3221   21   6678999998764 4556689999999999943


No 36 
>KOG0302|consensus
Probab=87.15  E-value=0.85  Score=44.69  Aligned_cols=55  Identities=20%  Similarity=0.353  Sum_probs=42.9

Q ss_pred             cccceEEEEecCC---CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHIT---GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~---~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |=+|+++|||...   ++.+-.       ++-|+..||++.+ ++++++++++++.|.+|.+|..-
T Consensus       321 ~DdGt~~iwDLR~~~~~~pVA~-------fk~Hk~pItsieW-~p~e~s~iaasg~D~QitiWDls  378 (440)
T KOG0302|consen  321 GDDGTLSIWDLRQFKSGQPVAT-------FKYHKAPITSIEW-HPHEDSVIAASGEDNQITIWDLS  378 (440)
T ss_pred             CCCceEEEEEhhhccCCCccee-------EEeccCCeeEEEe-ccccCceEEeccCCCcEEEEEee
Confidence            4579999999742   333333       4468899999996 56889999999999999999654


No 37 
>KOG0281|consensus
Probab=86.76  E-value=0.41  Score=46.46  Aligned_cols=61  Identities=21%  Similarity=0.323  Sum_probs=42.5

Q ss_pred             ccceEEEEecCCCceee------------eeCCCCCC----------CCCCCC-cccceec-------ccc--CCCceEE
Q psy5736           4 LFLKEGIWDHITGSRLI------------SWRPDRGS----------WARNSA-RCTALTL-------LNE--HDDSLLL   51 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~------------~~~~~~~~----------w~~~~~-~~t~l~~-------ine--~~~~lll   51 (346)
                      -+.+|.||||++|+.++            .|+|+-.-          |++..+ .||.+.+       +|.  -++.+.+
T Consensus       255 SDsTvrvWDv~tge~l~tlihHceaVLhlrf~ng~mvtcSkDrsiaVWdm~sps~it~rrVLvGHrAaVNvVdfd~kyIV  334 (499)
T KOG0281|consen  255 SDSTVRVWDVNTGEPLNTLIHHCEAVLHLRFSNGYMVTCSKDRSIAVWDMASPTDITLRRVLVGHRAAVNVVDFDDKYIV  334 (499)
T ss_pred             CCceEEEEeccCCchhhHHhhhcceeEEEEEeCCEEEEecCCceeEEEeccCchHHHHHHHHhhhhhheeeeccccceEE
Confidence            36799999999999887            45554321          766553 4555443       344  4577888


Q ss_pred             eeecCCceEEEeC
Q psy5736          52 TGYDDGSLSVYRN   64 (346)
Q Consensus        52 ~g~~dG~vri~r~   64 (346)
                      +++.|-.+++|..
T Consensus       335 sASgDRTikvW~~  347 (499)
T KOG0281|consen  335 SASGDRTIKVWST  347 (499)
T ss_pred             EecCCceEEEEec
Confidence            8999999999843


No 38 
>KOG0269|consensus
Probab=86.37  E-value=0.57  Score=49.53  Aligned_cols=52  Identities=15%  Similarity=0.341  Sum_probs=39.5

Q ss_pred             ccceEEEEecCC---CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHIT---GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~---~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      --|.|+|||...   .+.+..|       ..|.+-++.+.|.- ..-.++++|++||.|+.|.
T Consensus       108 ~nG~i~vWdlnk~~rnk~l~~f-------~EH~Rs~~~ldfh~-tep~iliSGSQDg~vK~~D  162 (839)
T KOG0269|consen  108 TNGVISVWDLNKSIRNKLLTVF-------NEHERSANKLDFHS-TEPNILISGSQDGTVKCWD  162 (839)
T ss_pred             CCCcEEEEecCccccchhhhHh-------hhhccceeeeeecc-CCccEEEecCCCceEEEEe
Confidence            458999999988   3444444       34778899999655 4456668999999999993


No 39 
>PTZ00420 coronin; Provisional
Probab=86.28  E-value=1.4  Score=46.21  Aligned_cols=52  Identities=15%  Similarity=0.146  Sum_probs=39.9

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      |.+|+|.|||.++++.+..+.        +...|+++.+-  .+..++++++.||.+++|..
T Consensus       145 S~DgtIrIWDl~tg~~~~~i~--------~~~~V~Slsws--pdG~lLat~s~D~~IrIwD~  196 (568)
T PTZ00420        145 GFDSFVNIWDIENEKRAFQIN--------MPKKLSSLKWN--IKGNLLSGTCVGKHMHIIDP  196 (568)
T ss_pred             eCCCeEEEEECCCCcEEEEEe--------cCCcEEEEEEC--CCCCEEEEEecCCEEEEEEC
Confidence            678999999999998776553        22468888853  35667788889999999953


No 40 
>KOG0270|consensus
Probab=86.07  E-value=1.2  Score=44.40  Aligned_cols=53  Identities=23%  Similarity=0.268  Sum_probs=43.8

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +|.+-+|.+||..+|+....+.       .|+.+|.+|++.+-.+ .++|+|+-||.++++
T Consensus       262 gsaD~TV~lWD~~~g~p~~s~~-------~~~k~Vq~l~wh~~~p-~~LLsGs~D~~V~l~  314 (463)
T KOG0270|consen  262 GSADKTVKLWDVDTGKPKSSIT-------HHGKKVQTLEWHPYEP-SVLLSGSYDGTVALK  314 (463)
T ss_pred             cCCCceEEEEEcCCCCcceehh-------hcCCceeEEEecCCCc-eEEEeccccceEEee
Confidence            4788999999999998777663       3678999999888544 555779999999998


No 41 
>KOG0316|consensus
Probab=85.87  E-value=1.4  Score=40.82  Aligned_cols=51  Identities=25%  Similarity=0.373  Sum_probs=42.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      |-+..|.|||.++|+.+-+|.       -|..++..++ +|| +.++++.|+-|..+|.|
T Consensus        78 GgDk~v~vwDV~TGkv~Rr~r-------gH~aqVNtV~-fNe-esSVv~SgsfD~s~r~w  128 (307)
T KOG0316|consen   78 GGDKAVQVWDVNTGKVDRRFR-------GHLAQVNTVR-FNE-ESSVVASGSFDSSVRLW  128 (307)
T ss_pred             CCCceEEEEEcccCeeeeecc-------cccceeeEEE-ecC-cceEEEeccccceeEEE
Confidence            667899999999999887763       3778999998 464 45667889999999999


No 42 
>KOG1081|consensus
Probab=85.74  E-value=0.33  Score=49.40  Aligned_cols=50  Identities=24%  Similarity=0.291  Sum_probs=39.9

Q ss_pred             eeeeeeccccc-cCCCCCCCCeEEEE---e-CCEEEEEEeeccCCCCceeecccc
Q psy5736         136 YVGVGIYPTVS-LFNHDCYPAVTRYF---N-GKNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       136 ~~g~glyp~~s-~~NHSC~PN~~~~f---~-g~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      .+..+.++..| ++||||.||+.-.-   . ...+.+.|.+.|++|+|+|++|..
T Consensus       362 ~id~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~  416 (463)
T KOG1081|consen  362 IIDAGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNG  416 (463)
T ss_pred             ccccccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeec
Confidence            34556677666 57999999998653   2 367899999999999999999975


No 43 
>KOG0273|consensus
Probab=85.57  E-value=1.6  Score=43.88  Aligned_cols=60  Identities=27%  Similarity=0.431  Sum_probs=43.2

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCC-----C-----------------------------CCCCCcccceeccccCCC
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGS-----W-----------------------------ARNSARCTALTLLNEHDD   47 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~-----w-----------------------------~~~~~~~t~l~~ine~~~   47 (346)
                      .|.+|++.+||-.+|.....|.-...-     |                             .-|..+|.+||+ | ++.
T Consensus       294 ~~vD~ttilwd~~~g~~~q~f~~~s~~~lDVdW~~~~~F~ts~td~~i~V~kv~~~~P~~t~~GH~g~V~alk~-n-~tg  371 (524)
T KOG0273|consen  294 GGVDGTTILWDAHTGTVKQQFEFHSAPALDVDWQSNDEFATSSTDGCIHVCKVGEDRPVKTFIGHHGEVNALKW-N-PTG  371 (524)
T ss_pred             ccCCccEEEEeccCceEEEeeeeccCCccceEEecCceEeecCCCceEEEEEecCCCcceeeecccCceEEEEE-C-CCC
Confidence            367899999999888777655422110     2                             335667777773 3 568


Q ss_pred             ceEEeeecCCceEEEe
Q psy5736          48 SLLLTGYDDGSLSVYR   63 (346)
Q Consensus        48 ~lll~g~~dG~vri~r   63 (346)
                      .+|+++++||.++||.
T Consensus       372 ~LLaS~SdD~TlkiWs  387 (524)
T KOG0273|consen  372 SLLASCSDDGTLKIWS  387 (524)
T ss_pred             ceEEEecCCCeeEeee
Confidence            8999999999999994


No 44 
>KOG0276|consensus
Probab=85.53  E-value=0.96  Score=47.00  Aligned_cols=50  Identities=20%  Similarity=0.313  Sum_probs=38.8

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      ..|+|.|||++++..+.+|.=.+       ..|-+-|||.-  ..-.++|++|+.||+|
T Consensus        33 ynG~V~IWnyetqtmVksfeV~~-------~PvRa~kfiaR--knWiv~GsDD~~IrVf   82 (794)
T KOG0276|consen   33 YNGDVQIWNYETQTMVKSFEVSE-------VPVRAAKFIAR--KNWIVTGSDDMQIRVF   82 (794)
T ss_pred             ecCeeEEEecccceeeeeeeecc-------cchhhheeeec--cceEEEecCCceEEEE
Confidence            35999999999999999996444       34556666643  3456889999999998


No 45 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=85.01  E-value=2  Score=37.75  Aligned_cols=54  Identities=13%  Similarity=0.212  Sum_probs=40.7

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +.+|.|.|||.++++.+..+..       |...++++.+...  ..++++++.||.+++|..-
T Consensus        70 ~~~~~i~i~~~~~~~~~~~~~~-------~~~~i~~~~~~~~--~~~~~~~~~~~~i~~~~~~  123 (289)
T cd00200          70 SSDKTIRLWDLETGECVRTLTG-------HTSYVSSVAFSPD--GRILSSSSRDKTIKVWDVE  123 (289)
T ss_pred             cCCCeEEEEEcCcccceEEEec-------cCCcEEEEEEcCC--CCEEEEecCCCeEEEEECC
Confidence            4589999999998877766642       4457888887654  5676767779999999543


No 46 
>KOG1338|consensus
Probab=83.86  E-value=0.93  Score=44.73  Aligned_cols=53  Identities=21%  Similarity=0.305  Sum_probs=45.7

Q ss_pred             ccceeeeeeccccccCCCCCC-CCeEEEEeCCEEEEEEeeccCCCCceeecccc
Q psy5736         133 KINYVGVGIYPTVSLFNHDCY-PAVTRYFNGKNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       133 ~~~~~g~glyp~~s~~NHSC~-PN~~~~f~g~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      +....|-..-|.+-++||.=. -|+.-.+.++.+.+.|.|+|++|+|++.+|+.
T Consensus       207 e~e~ngk~m~p~ad~lNhd~~k~nanl~y~~NcL~mva~r~iekgdev~n~dg~  260 (466)
T KOG1338|consen  207 EIECNGKLMTPIADFLNHDGLKANANLRYEDNCLEMVADRNIEKGDEVDNSDGL  260 (466)
T ss_pred             ccccCcccccchhhhhccchhhcccceeccCcceeeeecCCCCCcccccccccc
Confidence            345678889999999999866 56666778899999999999999999999984


No 47 
>KOG0306|consensus
Probab=83.74  E-value=1  Score=47.66  Aligned_cols=54  Identities=30%  Similarity=0.388  Sum_probs=39.5

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      -.+|.+||..+|.......-.+     .+.+||++.  +.++..++.+|+.||.|++|.+-
T Consensus        42 ~E~vn~WdlRtge~~~~l~~~~-----~k~evt~l~--~~~d~l~lAVGYaDGsVqif~~~   95 (888)
T KOG0306|consen   42 LEQVNIWDLRTGEIEKKLILLK-----KKAEVTCLR--SSDDILLLAVGYADGSVQIFSLE   95 (888)
T ss_pred             cccEeEEeeecchhhhhhhhhc-----ccceEEEee--ccCCcceEEEEecCceEEeeccC
Confidence            4678999999994433332222     235899987  55667778999999999999776


No 48 
>KOG0293|consensus
Probab=82.62  E-value=1.4  Score=43.78  Aligned_cols=54  Identities=22%  Similarity=0.364  Sum_probs=42.0

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      -++.|.||+-.+|..+...+ |      |...|..+. -|+.+-.|...|++||.||||..-
T Consensus       460 ED~kvyIWhr~sgkll~~Ls-G------Hs~~vNcVs-wNP~~p~m~ASasDDgtIRIWg~~  513 (519)
T KOG0293|consen  460 EDSKVYIWHRISGKLLAVLS-G------HSKTVNCVS-WNPADPEMFASASDDGTIRIWGPS  513 (519)
T ss_pred             CCceEEEEEccCCceeEeec-C------CcceeeEEe-cCCCCHHHhhccCCCCeEEEecCC
Confidence            36899999999998886553 2      444455544 588999999999999999999643


No 49 
>KOG1334|consensus
Probab=81.99  E-value=1.3  Score=44.80  Aligned_cols=57  Identities=19%  Similarity=0.304  Sum_probs=45.6

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +-+.+|-||||.+++...+|..+      |...+---|||--.++.-+.+.+.||.+|.=...
T Consensus       161 SDD~~vv~WdW~~~~~~l~f~SG------H~~NvfQaKFiP~s~d~ti~~~s~dgqvr~s~i~  217 (559)
T KOG1334|consen  161 SDDLQVVVWDWVSGSPKLSFESG------HCNNVFQAKFIPFSGDRTIVTSSRDGQVRVSEIL  217 (559)
T ss_pred             CccceEEeehhhccCcccccccc------cccchhhhhccCCCCCcCceeccccCceeeeeec
Confidence            44678999999999999999765      5667877788888777777778889999875433


No 50 
>KOG0274|consensus
Probab=81.81  E-value=2.7  Score=43.67  Aligned_cols=50  Identities=24%  Similarity=0.388  Sum_probs=39.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +.+.+|.|||+++++.++-+.-       |..-|.++..-    +.++++|+.||.|++|+
T Consensus       308 s~D~tVkVW~v~n~~~l~l~~~-------h~~~V~~v~~~----~~~lvsgs~d~~v~VW~  357 (537)
T KOG0274|consen  308 SRDNTVKVWDVTNGACLNLLRG-------HTGPVNCVQLD----EPLLVSGSYDGTVKVWD  357 (537)
T ss_pred             cCCceEEEEeccCcceEEEecc-------ccccEEEEEec----CCEEEEEecCceEEEEE
Confidence            3678999999999999987631       45557776633    78889999999999994


No 51 
>PF13432 TPR_16:  Tetratricopeptide repeat; PDB: 3CVP_A 3CVL_A 3CVQ_A 3CV0_A 2GW1_B 3CVN_A 3QKY_A 2PL2_B.
Probab=81.76  E-value=2.9  Score=29.58  Aligned_cols=46  Identities=13%  Similarity=-0.025  Sum_probs=35.3

Q ss_pred             HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ..|...++.|++++|++.++++++..        |.-..+...|..++...|+.
T Consensus         2 ~~a~~~~~~g~~~~A~~~~~~~l~~~--------P~~~~a~~~lg~~~~~~g~~   47 (65)
T PF13432_consen    2 ALARALYQQGDYDEAIAAFEQALKQD--------PDNPEAWYLLGRILYQQGRY   47 (65)
T ss_dssp             HHHHHHHHCTHHHHHHHHHHHHHCCS--------TTHHHHHHHHHHHHHHTT-H
T ss_pred             hHHHHHHHcCCHHHHHHHHHHHHHHC--------CCCHHHHHHHHHHHHHcCCH
Confidence            34566788899999999998888765        33567888889999888875


No 52 
>PLN00181 protein SPA1-RELATED; Provisional
Probab=81.66  E-value=2.8  Score=45.49  Aligned_cols=53  Identities=13%  Similarity=0.114  Sum_probs=41.0

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      |.+|+|.|||..++..+..+..        +..+.++.+.. ++..++++|+.||.|++|..
T Consensus       595 s~Dg~v~iWd~~~~~~~~~~~~--------~~~v~~v~~~~-~~g~~latgs~dg~I~iwD~  647 (793)
T PLN00181        595 SDDGSVKLWSINQGVSIGTIKT--------KANICCVQFPS-ESGRSLAFGSADHKVYYYDL  647 (793)
T ss_pred             cCCCEEEEEECCCCcEEEEEec--------CCCeEEEEEeC-CCCCEEEEEeCCCeEEEEEC
Confidence            5689999999999887776642        23577777643 56778899999999999953


No 53 
>COG1729 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=81.60  E-value=3  Score=39.22  Aligned_cols=52  Identities=13%  Similarity=0.218  Sum_probs=46.0

Q ss_pred             HHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         278 EMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       278 ~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      .+|+.|.+.+.+|++.+|...|.++++.|     |+....-+|+..|-++|-..|+.
T Consensus       143 ~~Y~~A~~~~ksgdy~~A~~~F~~fi~~Y-----P~s~~~~nA~yWLGe~~y~qg~y  194 (262)
T COG1729         143 KLYNAALDLYKSGDYAEAEQAFQAFIKKY-----PNSTYTPNAYYWLGESLYAQGDY  194 (262)
T ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHHHHcC-----CCCcccchhHHHHHHHHHhcccc
Confidence            37889999999999999999999999999     77788888888888888888775


No 54 
>KOG0263|consensus
Probab=81.38  E-value=2.9  Score=44.26  Aligned_cols=54  Identities=20%  Similarity=0.298  Sum_probs=45.3

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +.+-+|.+||-.+|..+-.|. +      |+..|++|.+-  +.+..+.+|+.||.|.+|+.-
T Consensus       554 SsD~tVRlWDv~~G~~VRiF~-G------H~~~V~al~~S--p~Gr~LaSg~ed~~I~iWDl~  607 (707)
T KOG0263|consen  554 SSDRTVRLWDVSTGNSVRIFT-G------HKGPVTALAFS--PCGRYLASGDEDGLIKIWDLA  607 (707)
T ss_pred             CCCceEEEEEcCCCcEEEEec-C------CCCceEEEEEc--CCCceEeecccCCcEEEEEcC
Confidence            467899999999998887772 2      78899999953  588888999999999999654


No 55 
>KOG0310|consensus
Probab=81.35  E-value=1.7  Score=43.67  Aligned_cols=49  Identities=31%  Similarity=0.419  Sum_probs=38.1

Q ss_pred             ceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           6 LKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         6 ~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      ..|.|||.-+| +.+..+.|       |...+|+|.+-.  +...|++|+=||.+++|.
T Consensus       217 n~vkVWDl~~G~qll~~~~~-------H~KtVTcL~l~s--~~~rLlS~sLD~~VKVfd  266 (487)
T KOG0310|consen  217 NSVKVWDLTTGGQLLTSMFN-------HNKTVTCLRLAS--DSTRLLSGSLDRHVKVFD  266 (487)
T ss_pred             CeEEEEEecCCceehhhhhc-------ccceEEEEEeec--CCceEeecccccceEEEE
Confidence            47999999966 55555544       345799999655  558889999999999996


No 56 
>KOG0271|consensus
Probab=81.26  E-value=2.3  Score=41.95  Aligned_cols=54  Identities=20%  Similarity=0.187  Sum_probs=41.6

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceecccc---CCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNE---HDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine---~~~~lll~g~~dG~vri~r   63 (346)
                      .+|+|.+||=++|+.+-+      +.+.|+.-|++|.+---   ++-.++.+++-||.+|||.
T Consensus       177 ~dg~I~lwdpktg~~~g~------~l~gH~K~It~Lawep~hl~p~~r~las~skDg~vrIWd  233 (480)
T KOG0271|consen  177 KDGSIRLWDPKTGQQIGR------ALRGHKKWITALAWEPLHLVPPCRRLASSSKDGSVRIWD  233 (480)
T ss_pred             cCCeEEEecCCCCCcccc------cccCcccceeEEeecccccCCCccceecccCCCCEEEEE
Confidence            689999999999988753      34568889999987433   4455666677799999994


No 57 
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=81.12  E-value=4.3  Score=31.65  Aligned_cols=48  Identities=17%  Similarity=0.118  Sum_probs=33.7

Q ss_pred             HHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         282 RGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       282 ~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      .+...+..+++++|++.+.+++..+     |.++....+...++.+|...|+.
T Consensus        45 l~~~~~~~~~~~~A~~~~~~~~~~~-----p~~~~~~~~~~~~~~~~~~~~~~   92 (119)
T TIGR02795        45 LGEAYYAQGKYADAAKAFLAVVKKY-----PKSPKAPDALLKLGMSLQELGDK   92 (119)
T ss_pred             HHHHHHhhccHHHHHHHHHHHHHHC-----CCCCcccHHHHHHHHHHHHhCCh
Confidence            3444566677777777777777654     67776677788888888877765


No 58 
>KOG0266|consensus
Probab=81.06  E-value=3.1  Score=42.21  Aligned_cols=51  Identities=24%  Similarity=0.426  Sum_probs=38.9

Q ss_pred             cccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +.+++|.|||-+.. .-+-.+. +      |...++++.| ++.. .+++.|+.||.||+|
T Consensus       222 s~D~tiriwd~~~~~~~~~~l~-g------H~~~v~~~~f-~p~g-~~i~Sgs~D~tvriW  273 (456)
T KOG0266|consen  222 SDDKTLRIWDLKDDGRNLKTLK-G------HSTYVTSVAF-SPDG-NLLVSGSDDGTVRIW  273 (456)
T ss_pred             cCCceEEEeeccCCCeEEEEec-C------CCCceEEEEe-cCCC-CEEEEecCCCcEEEE
Confidence            45889999999444 4444553 2      6778899884 5455 888999999999999


No 59 
>KOG0266|consensus
Probab=80.42  E-value=2.9  Score=42.34  Aligned_cols=55  Identities=24%  Similarity=0.308  Sum_probs=42.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      +.+++|.|||.++++.+-.+.       .|+..|+.+.| + .++.++++++.||.+++|..+.
T Consensus       265 s~D~tvriWd~~~~~~~~~l~-------~hs~~is~~~f-~-~d~~~l~s~s~d~~i~vwd~~~  319 (456)
T KOG0266|consen  265 SDDGTVRIWDVRTGECVRKLK-------GHSDGISGLAF-S-PDGNLLVSASYDGTIRVWDLET  319 (456)
T ss_pred             cCCCcEEEEeccCCeEEEeee-------ccCCceEEEEE-C-CCCCEEEEcCCCccEEEEECCC
Confidence            568999999999987766553       46778998863 3 4666778888899999996653


No 60 
>KOG1332|consensus
Probab=80.39  E-value=2.3  Score=39.59  Aligned_cols=55  Identities=25%  Similarity=0.234  Sum_probs=40.6

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .+|.|.||..+.| |-+.    ..-|..|+..++++.+-=-.-.-+|+.|++||.|++++
T Consensus        78 YDgkVIiWke~~g-~w~k----~~e~~~h~~SVNsV~wapheygl~LacasSDG~vsvl~  132 (299)
T KOG1332|consen   78 YDGKVIIWKEENG-RWTK----AYEHAAHSASVNSVAWAPHEYGLLLACASSDGKVSVLT  132 (299)
T ss_pred             cCceEEEEecCCC-chhh----hhhhhhhcccceeecccccccceEEEEeeCCCcEEEEE
Confidence            5799999999988 4432    22355688889988854323367778889999999984


No 61 
>KOG0272|consensus
Probab=80.36  E-value=2.2  Score=42.40  Aligned_cols=52  Identities=15%  Similarity=0.213  Sum_probs=43.2

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      |-++++.|||-....-+...    |+   |...||.+++.- ....++++++-|+.+++|
T Consensus       364 s~Dnt~kVWDLR~r~~ly~i----pA---H~nlVS~Vk~~p-~~g~fL~TasyD~t~kiW  415 (459)
T KOG0272|consen  364 SSDNTCKVWDLRMRSELYTI----PA---HSNLVSQVKYSP-QEGYFLVTASYDNTVKIW  415 (459)
T ss_pred             CCCCcEEEeeecccccceec----cc---ccchhhheEecc-cCCeEEEEcccCcceeee
Confidence            46899999999977666543    54   788899999765 678899999999999999


No 62 
>KOG2461|consensus
Probab=80.00  E-value=0.77  Score=45.81  Aligned_cols=33  Identities=24%  Similarity=0.517  Sum_probs=27.4

Q ss_pred             CeEEEEeCCEEEEEEeeccCCCCceeecccccc
Q psy5736         155 AVTRYFNGKNIIVKALRPLKPKEVVAENYGLVF  187 (346)
Q Consensus       155 N~~~~f~g~~~~v~a~r~I~~Geel~~~Y~~~~  187 (346)
                      |....-.+..+..+|+|+|++||||.+.|++.|
T Consensus       114 NL~A~Q~~~~Ifyrt~r~I~p~eELlVWY~~e~  146 (396)
T KOG2461|consen  114 NLLAFQIGENIFYRTIRDIRPNEELLVWYGSEY  146 (396)
T ss_pred             hHHHHhccCceEEEecccCCCCCeEEEEeccch
Confidence            444444578999999999999999999999864


No 63 
>PF14559 TPR_19:  Tetratricopeptide repeat; PDB: 2R5S_A 3QDN_B 3QOU_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 3FP3_A 3LCA_A ....
Probab=79.99  E-value=3.3  Score=29.43  Aligned_cols=40  Identities=25%  Similarity=0.336  Sum_probs=29.8

Q ss_pred             hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ++.|++++|++.+++.++..     |.+.   .+...|+.||...|+.
T Consensus         2 l~~~~~~~A~~~~~~~l~~~-----p~~~---~~~~~la~~~~~~g~~   41 (68)
T PF14559_consen    2 LKQGDYDEAIELLEKALQRN-----PDNP---EARLLLAQCYLKQGQY   41 (68)
T ss_dssp             HHTTHHHHHHHHHHHHHHHT-----TTSH---HHHHHHHHHHHHTT-H
T ss_pred             hhccCHHHHHHHHHHHHHHC-----CCCH---HHHHHHHHHHHHcCCH
Confidence            56788999999999988775     4444   4556788999888875


No 64 
>KOG0289|consensus
Probab=79.86  E-value=3.4  Score=41.28  Aligned_cols=50  Identities=20%  Similarity=0.321  Sum_probs=40.5

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .+|.|.|||..++.-+..|       +.|...|++++|-  ..+=.+.++.+||.|++|
T Consensus       367 ~d~~vkiwdlks~~~~a~F-------pght~~vk~i~Fs--ENGY~Lat~add~~V~lw  416 (506)
T KOG0289|consen  367 PDGVVKIWDLKSQTNVAKF-------PGHTGPVKAISFS--ENGYWLATAADDGSVKLW  416 (506)
T ss_pred             CCceEEEEEcCCccccccC-------CCCCCceeEEEec--cCceEEEEEecCCeEEEE
Confidence            3789999999999866665       3578899999963  355667889999999999


No 65 
>PF13414 TPR_11:  TPR repeat; PDB: 2HO1_B 2FI7_B 2DBA_A 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2PL2_B 3IEG_B 2FBN_A ....
Probab=79.81  E-value=9.4  Score=27.11  Aligned_cols=49  Identities=18%  Similarity=0.154  Sum_probs=38.7

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      ..++..|...+..+++++|+..+.+.++.-     |.|+   .+...++.+|..+|+
T Consensus         4 ~~~~~~g~~~~~~~~~~~A~~~~~~ai~~~-----p~~~---~~~~~~g~~~~~~~~   52 (69)
T PF13414_consen    4 EAWYNLGQIYFQQGDYEEAIEYFEKAIELD-----PNNA---EAYYNLGLAYMKLGK   52 (69)
T ss_dssp             HHHHHHHHHHHHTTHHHHHHHHHHHHHHHS-----TTHH---HHHHHHHHHHHHTTT
T ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHHHHcC-----CCCH---HHHHHHHHHHHHhCc
Confidence            345566777888999999999999999873     5554   477888889988883


No 66 
>KOG0292|consensus
Probab=79.51  E-value=2.5  Score=45.84  Aligned_cols=49  Identities=10%  Similarity=0.171  Sum_probs=35.6

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +-||.||||.++.-++...-.|     |-  +-+-+|+  +++.+..+||-|-.||+|
T Consensus       114 DQTIrIWNwqsr~~iavltGHn-----HY--VMcAqFh--ptEDlIVSaSLDQTVRVW  162 (1202)
T KOG0292|consen  114 DQTIRIWNWQSRKCIAVLTGHN-----HY--VMCAQFH--PTEDLIVSASLDQTVRVW  162 (1202)
T ss_pred             CCeEEEEeccCCceEEEEecCc-----eE--EEeeccC--CccceEEEecccceEEEE
Confidence            5799999999998887665444     44  3333333  346666889999999999


No 67 
>cd00200 WD40 WD40 domain, found in a number of eukaryotic proteins that cover a wide variety of functions including adaptor/regulatory modules in signal transduction, pre-mRNA processing and cytoskeleton assembly; typically contains a GH dipeptide 11-24 residues from its N-terminus and the WD dipeptide at its C-terminus and is 40 residues long, hence the name WD40; between GH and WD lies a conserved core; serves as a stable propeller-like platform to which proteins can bind either stably or reversibly; forms a propeller-like structure with several blades where each blade is composed of a four-stranded anti-parallel b-sheet; instances with few detectable copies are hypothesized to form larger structures by dimerization; each WD40 sequence repeat forms the first three strands of one blade and the last strand in the next blade; the last C-terminal WD40 repeat completes the blade structure of the first WD40 repeat to create the closed ring propeller-structure; residues on the top and botto
Probab=79.26  E-value=4.5  Score=35.44  Aligned_cols=51  Identities=16%  Similarity=0.340  Sum_probs=38.8

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      +|.|.+||..+++....+.       .|...++++.+...  ..++++|..||.+++|..
T Consensus       114 ~~~i~~~~~~~~~~~~~~~-------~~~~~i~~~~~~~~--~~~l~~~~~~~~i~i~d~  164 (289)
T cd00200         114 DKTIKVWDVETGKCLTTLR-------GHTDWVNSVAFSPD--GTFVASSSQDGTIKLWDL  164 (289)
T ss_pred             CCeEEEEECCCcEEEEEec-------cCCCcEEEEEEcCc--CCEEEEEcCCCcEEEEEc
Confidence            8999999999887776664       24567888886543  566666666999999954


No 68 
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=79.06  E-value=7.2  Score=36.14  Aligned_cols=51  Identities=10%  Similarity=0.234  Sum_probs=28.7

Q ss_pred             HHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         278 EMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       278 ~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      ..|.+|.+.++.|++++|++.|++++..+     |..+....+...|+.+|-..|+
T Consensus        34 ~~Y~~A~~~~~~g~y~~Ai~~f~~l~~~y-----P~s~~a~~a~l~la~ayy~~~~   84 (243)
T PRK10866         34 EIYATAQQKLQDGNWKQAITQLEALDNRY-----PFGPYSQQVQLDLIYAYYKNAD   84 (243)
T ss_pred             HHHHHHHHHHHCCCHHHHHHHHHHHHHhC-----CCChHHHHHHHHHHHHHHhcCC
Confidence            35556666666666776666666666543     4444444444455555544443


No 69 
>KOG0263|consensus
Probab=78.83  E-value=2.2  Score=45.04  Aligned_cols=51  Identities=22%  Similarity=0.322  Sum_probs=44.0

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +-+|.|.|||..+|.++-.+.       .|+.-|.+|.|  .-+..++.+|+.|..|++|
T Consensus       596 ~ed~~I~iWDl~~~~~v~~l~-------~Ht~ti~SlsF--S~dg~vLasgg~DnsV~lW  646 (707)
T KOG0263|consen  596 DEDGLIKIWDLANGSLVKQLK-------GHTGTIYSLSF--SRDGNVLASGGADNSVRLW  646 (707)
T ss_pred             ccCCcEEEEEcCCCcchhhhh-------cccCceeEEEE--ecCCCEEEecCCCCeEEEE
Confidence            458999999999999998774       25567899997  6788899999999999999


No 70 
>KOG0647|consensus
Probab=78.67  E-value=3  Score=39.92  Aligned_cols=55  Identities=18%  Similarity=0.247  Sum_probs=46.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |-++++.+||.++++ +..+.       .|..-++.+++++.....+++|||=|..||.|..=
T Consensus        91 ~~Dk~~k~wDL~S~Q-~~~v~-------~Hd~pvkt~~wv~~~~~~cl~TGSWDKTlKfWD~R  145 (347)
T KOG0647|consen   91 GCDKQAKLWDLASGQ-VSQVA-------AHDAPVKTCHWVPGMNYQCLVTGSWDKTLKFWDTR  145 (347)
T ss_pred             ccCCceEEEEccCCC-eeeee-------ecccceeEEEEecCCCcceeEecccccceeecccC
Confidence            568999999999994 44442       37788999999999999999999999999999443


No 71 
>KOG0276|consensus
Probab=78.55  E-value=3.3  Score=43.17  Aligned_cols=53  Identities=15%  Similarity=0.250  Sum_probs=45.8

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .|++||.||...+...-..+.       -|.+-+.++.++.-.|.+++.+|++|..+++|
T Consensus       160 sLDrTVKVWslgs~~~nfTl~-------gHekGVN~Vdyy~~gdkpylIsgaDD~tiKvW  212 (794)
T KOG0276|consen  160 SLDRTVKVWSLGSPHPNFTLE-------GHEKGVNCVDYYTGGDKPYLISGADDLTIKVW  212 (794)
T ss_pred             eccccEEEEEcCCCCCceeee-------ccccCcceEEeccCCCcceEEecCCCceEEEe
Confidence            589999999999876655553       26678999999999999999999999999999


No 72 
>KOG0308|consensus
Probab=77.98  E-value=3.1  Score=43.46  Aligned_cols=53  Identities=23%  Similarity=0.435  Sum_probs=42.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      |..+-+.+||..+.+++-+.. +      |...|..|- +| +|+..+++|++||.||+|..
T Consensus       190 gtek~lr~wDprt~~kimkLr-G------HTdNVr~ll-~~-dDGt~~ls~sSDgtIrlWdL  242 (735)
T KOG0308|consen  190 GTEKDLRLWDPRTCKKIMKLR-G------HTDNVRVLL-VN-DDGTRLLSASSDGTIRLWDL  242 (735)
T ss_pred             CcccceEEeccccccceeeee-c------cccceEEEE-Ec-CCCCeEeecCCCceEEeeec
Confidence            667889999999999998775 2      667788776 33 56777799999999999943


No 73 
>smart00320 WD40 WD40 repeats. Note that these repeats are permuted with respect to the structural repeats (blades) of the beta propeller domain.
Probab=77.94  E-value=2.6  Score=24.32  Aligned_cols=30  Identities=17%  Similarity=0.419  Sum_probs=22.5

Q ss_pred             CCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736          32 NSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus        32 ~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      |...++++.+....  .++++|+.||.+++|.
T Consensus        11 ~~~~i~~~~~~~~~--~~~~~~~~d~~~~~~~   40 (40)
T smart00320       11 HTGPVTSVAFSPDG--KYLASASDDGTIKLWD   40 (40)
T ss_pred             cCCceeEEEECCCC--CEEEEecCCCeEEEcC
Confidence            45578888865433  6778899999999873


No 74 
>KOG1310|consensus
Probab=77.84  E-value=3.9  Score=42.16  Aligned_cols=55  Identities=15%  Similarity=0.223  Sum_probs=47.9

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +-++.|||.-+-+.+-+...+      |...|.+.||+-...+.++++|.-|..|++|..=
T Consensus        71 D~r~ivWd~~~~KllhsI~Tg------HtaNIFsvKFvP~tnnriv~sgAgDk~i~lfdl~  125 (758)
T KOG1310|consen   71 DTRLIVWDPFEYKLLHSISTG------HTANIFSVKFVPYTNNRIVLSGAGDKLIKLFDLD  125 (758)
T ss_pred             cceEEeecchhcceeeeeecc------cccceeEEeeeccCCCeEEEeccCcceEEEEecc
Confidence            457899999988888888766      5689999999999999999999999999999543


No 75 
>KOG0278|consensus
Probab=77.30  E-value=4.9  Score=37.70  Aligned_cols=53  Identities=21%  Similarity=0.343  Sum_probs=41.6

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      |-++.+..+|+.+|.-+-+|.-|      |-..+-+++|  .+|..+...||.||.||+|.
T Consensus       243 ged~~~~kfDy~TgeEi~~~nkg------h~gpVhcVrF--SPdGE~yAsGSEDGTirlWQ  295 (334)
T KOG0278|consen  243 GEDFKVYKFDYNTGEEIGSYNKG------HFGPVHCVRF--SPDGELYASGSEDGTIRLWQ  295 (334)
T ss_pred             CcceEEEEEeccCCceeeecccC------CCCceEEEEE--CCCCceeeccCCCceEEEEE
Confidence            56788999999999888877333      4456888874  36666778899999999994


No 76 
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=76.44  E-value=7.9  Score=36.42  Aligned_cols=51  Identities=18%  Similarity=0.148  Sum_probs=31.3

Q ss_pred             HHHHHHHH-hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         279 MFYRGIEQ-MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       279 ~~~~a~~~-~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      .|..|..+ ++.+++++|+..|.++++.|     |.+.....++..|..+|-..|+.
T Consensus       145 ~Y~~A~~l~~~~~~y~~Ai~af~~fl~~y-----P~s~~a~~A~y~LG~~y~~~g~~  196 (263)
T PRK10803        145 DYNAAIALVQDKSRQDDAIVAFQNFVKKY-----PDSTYQPNANYWLGQLNYNKGKK  196 (263)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHHHHC-----cCCcchHHHHHHHHHHHHHcCCH
Confidence            45555554 44567777777777777766     44555555566666666555543


No 77 
>KOG0273|consensus
Probab=76.36  E-value=4.4  Score=40.90  Aligned_cols=51  Identities=14%  Similarity=0.307  Sum_probs=44.2

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +.+++|.+||-+++..+..|.       +|...+-+|+|.  ++..++++|+-||.|-+|
T Consensus       429 s~dstV~lwdv~~gv~i~~f~-------kH~~pVysvafS--~~g~ylAsGs~dg~V~iw  479 (524)
T KOG0273|consen  429 SFDSTVKLWDVESGVPIHTLM-------KHQEPVYSVAFS--PNGRYLASGSLDGCVHIW  479 (524)
T ss_pred             ecCCeEEEEEccCCceeEeec-------cCCCceEEEEec--CCCcEEEecCCCCeeEec
Confidence            467899999999999999883       367789999965  578999999999999888


No 78 
>KOG0277|consensus
Probab=75.94  E-value=4.8  Score=37.77  Aligned_cols=56  Identities=16%  Similarity=0.167  Sum_probs=42.9

Q ss_pred             ccceEEEEecCC-CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCCccc
Q psy5736           4 LFLKEGIWDHIT-GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYASH   68 (346)
Q Consensus         4 ~~~~~~~w~~~~-~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~sh   68 (346)
                      -+|+..+||... |+.++ +.       .|..||.++.+ |..++.++.||+.|+.||.|  |++.-+
T Consensus       168 gd~~l~lwdvr~~gk~~~-i~-------ah~~Eil~cdw-~ky~~~vl~Tg~vd~~vr~wDir~~r~p  226 (311)
T KOG0277|consen  168 GDGTLRLWDVRSPGKFMS-IE-------AHNSEILCCDW-SKYNHNVLATGGVDNLVRGWDIRNLRTP  226 (311)
T ss_pred             CCceEEEEEecCCCceeE-EE-------eccceeEeecc-cccCCcEEEecCCCceEEEEehhhcccc
Confidence            478999999854 45554 42       36679998885 66889999999999999999  666433


No 79 
>PF13174 TPR_6:  Tetratricopeptide repeat; PDB: 3QKY_A 2XEV_A 3URZ_B 2Q7F_A.
Probab=75.64  E-value=5  Score=23.98  Aligned_cols=29  Identities=10%  Similarity=0.168  Sum_probs=22.4

Q ss_pred             HHHHHHHHhccccHHHHHHHHHHHHHHhh
Q psy5736         279 MFYRGIEQMNTSCFREAVESLTKFSDQIH  307 (346)
Q Consensus       279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~  307 (346)
                      ++..|.-....|++++|++.++++++.|-
T Consensus         3 ~~~~a~~~~~~g~~~~A~~~~~~~~~~~P   31 (33)
T PF13174_consen    3 LYRLARCYYKLGDYDEAIEYFQRLIKRYP   31 (33)
T ss_dssp             HHHHHHHHHHHCHHHHHHHHHHHHHHHST
T ss_pred             HHHHHHHHHHccCHHHHHHHHHHHHHHCc
Confidence            34455555667899999999999999873


No 80 
>KOG0275|consensus
Probab=75.46  E-value=1.6  Score=42.12  Aligned_cols=60  Identities=17%  Similarity=0.320  Sum_probs=39.5

Q ss_pred             cccceEEEEecCCCceee--eeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLI--SWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~--~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      ..+|-|-|||+.+|+.--  .|+-|. ++-++..-+-+|.|  .-|..|+.+|++||.|++||--
T Consensus       232 SvDGFiEVWny~~GKlrKDLkYQAqd-~fMMmd~aVlci~F--SRDsEMlAsGsqDGkIKvWri~  293 (508)
T KOG0275|consen  232 SVDGFIEVWNYTTGKLRKDLKYQAQD-NFMMMDDAVLCISF--SRDSEMLASGSQDGKIKVWRIE  293 (508)
T ss_pred             cccceeeeehhccchhhhhhhhhhhc-ceeecccceEEEee--cccHHHhhccCcCCcEEEEEEe
Confidence            368899999999994322  343332 23344444444443  2567788999999999999643


No 81 
>KOG1034|consensus
Probab=75.46  E-value=3.7  Score=39.77  Aligned_cols=53  Identities=15%  Similarity=0.265  Sum_probs=43.0

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +|.-|.|.|-|..+++-..++.       -|+..|.+||+.- +.-.++|+||.|..||+|
T Consensus       111 ~G~~GvIrVid~~~~~~~~~~~-------ghG~sINeik~~p-~~~qlvls~SkD~svRlw  163 (385)
T KOG1034|consen  111 GGYLGVIRVIDVVSGQCSKNYR-------GHGGSINEIKFHP-DRPQLVLSASKDHSVRLW  163 (385)
T ss_pred             ecceeEEEEEecchhhhcccee-------ccCccchhhhcCC-CCCcEEEEecCCceEEEE
Confidence            5778999999999987666552       2788899998653 556688999999999998


No 82 
>KOG0272|consensus
Probab=75.18  E-value=5.7  Score=39.55  Aligned_cols=57  Identities=18%  Similarity=0.190  Sum_probs=47.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      +..|.+.||+-.+...++.|.       -|..++.++.|.-.+++.-+.+|+.||.+++|+.-.
T Consensus       194 swsG~~kvW~~~~~~~~~~l~-------gH~~~v~~~~fhP~~~~~~lat~s~Dgtvklw~~~~  250 (459)
T KOG0272|consen  194 SWSGLVKVWSVPQCNLLQTLR-------GHTSRVGAAVFHPVDSDLNLATASADGTVKLWKLSQ  250 (459)
T ss_pred             ecCCceeEeecCCcceeEEEe-------ccccceeeEEEccCCCccceeeeccCCceeeeccCC
Confidence            468999999999999998884       378899999988775555568899999999996543


No 83 
>KOG0265|consensus
Probab=75.04  E-value=4.4  Score=38.72  Aligned_cols=52  Identities=25%  Similarity=0.417  Sum_probs=36.9

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      |-+-+|.+||-++|++.-+|.-       |..=+.++. -.--.-.|+-+|++||.+|+|
T Consensus       109 gtDk~v~~wD~~tG~~~rk~k~-------h~~~vNs~~-p~rrg~~lv~SgsdD~t~kl~  160 (338)
T KOG0265|consen  109 GTDKTVRGWDAETGKRIRKHKG-------HTSFVNSLD-PSRRGPQLVCSGSDDGTLKLW  160 (338)
T ss_pred             cCCceEEEEecccceeeehhcc-------ccceeeecC-ccccCCeEEEecCCCceEEEE
Confidence            6688999999999998876532       233234444 222446677788899999999


No 84 
>KOG0274|consensus
Probab=74.63  E-value=5.5  Score=41.47  Aligned_cols=54  Identities=24%  Similarity=0.313  Sum_probs=42.4

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS   67 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s   67 (346)
                      .++|+|.+||-+++..+......      |...++++.+.    +...+.+++||.+++| ++.+
T Consensus       430 ~aD~~Ik~WD~~~~~~~~~~~~~------~~~~v~~l~~~----~~~il~s~~~~~~~l~-dl~~  483 (537)
T KOG0274|consen  430 SADGTIKLWDAEEGECLRTLEGR------HVGGVSALALG----KEEILCSSDDGSVKLW-DLRS  483 (537)
T ss_pred             cccccEEEeecccCceeeeeccC------CcccEEEeecC----cceEEEEecCCeeEEE-eccc
Confidence            57899999999999988866533      34678888866    6667889999999999 4433


No 85 
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=74.41  E-value=5.7  Score=35.70  Aligned_cols=50  Identities=16%  Similarity=0.299  Sum_probs=27.4

Q ss_pred             HHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         279 MFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      .+..+...+..|++++|+..++++++.+     |.++....+...++.+|...|+
T Consensus        36 ~~~~g~~~~~~~~~~~A~~~~~~~~~~~-----p~~~~~~~a~~~la~~~~~~~~   85 (235)
T TIGR03302        36 LYEEAKEALDSGDYTEAIKYFEALESRY-----PFSPYAEQAQLDLAYAYYKSGD   85 (235)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHhC-----CCchhHHHHHHHHHHHHHhcCC
Confidence            3444555555666666666666655543     4445445555555555555544


No 86 
>KOG1517|consensus
Probab=74.24  E-value=6.3  Score=43.64  Aligned_cols=69  Identities=25%  Similarity=0.356  Sum_probs=45.4

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccccchhhhhhh
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASHEHRLVTSFQ   77 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh~~~Lv~a~r   77 (346)
                      ||--..|.|||-++-..+-...-+.      .+-+|+|-- +.+..++.++|+.||.||+|..=+.....++..+|
T Consensus      1183 tGd~r~IRIWDa~~E~~~~diP~~s------~t~vTaLS~-~~~~gn~i~AGfaDGsvRvyD~R~a~~ds~v~~~R 1251 (1387)
T KOG1517|consen 1183 TGDVRSIRIWDAHKEQVVADIPYGS------STLVTALSA-DLVHGNIIAAGFADGSVRVYDRRMAPPDSLVCVYR 1251 (1387)
T ss_pred             cCCeeEEEEEecccceeEeecccCC------Cccceeecc-cccCCceEEEeecCCceEEeecccCCccccceeec
Confidence            3445689999999887777654331      344677763 33447888999999999999433223334555554


No 87 
>PF10516 SHNi-TPR:  SHNi-TPR;  InterPro: IPR019544 The tetratrico peptide repeat region (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B.  This entry represents SHNi-TPR (Sim3-Hif1-NASP interrupted TPR), a sequence that is an interrupted form of TPR repeat []. 
Probab=73.49  E-value=4.6  Score=26.44  Aligned_cols=27  Identities=26%  Similarity=0.429  Sum_probs=24.1

Q ss_pred             HhccccHHHHHHHHHHHHHHhhcccCC
Q psy5736         286 QMNTSCFREAVESLTKFSDQIHELVVP  312 (346)
Q Consensus       286 ~~~~~~~~~ai~~l~~~l~~~~~~l~p  312 (346)
                      .++++++++|++-|+++|+...++++|
T Consensus        11 sle~e~f~qA~~D~~~aL~i~~~l~~~   37 (38)
T PF10516_consen   11 SLENENFEQAIEDYEKALEIQEELLPP   37 (38)
T ss_pred             HHHhccHHHHHHHHHHHHHHHHHhcCC
Confidence            467889999999999999999998876


No 88 
>PF09986 DUF2225:  Uncharacterized protein conserved in bacteria (DUF2225);  InterPro: IPR018708 This conserved bacterial family has no known function.
Probab=73.45  E-value=10  Score=34.51  Aligned_cols=80  Identities=11%  Similarity=0.041  Sum_probs=47.4

Q ss_pred             cccccccCCcCChHHHHH----HHHHHHHHHHHHH---HHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHH
Q psy5736         254 EKKCESCNSTSDLTEIKT----KLSELNEMFYRGI---EQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRN  326 (346)
Q Consensus       254 ~~~C~~C~~~~~~~~~~~----~l~~~~~~~~~a~---~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~  326 (346)
                      .|+|+.||-....+....    ..+.+.+....-.   +.-..-++++|++.|.-.|-.+ .+...++.....+...|+.
T Consensus        48 V~vCP~CgyA~~~~~F~~l~~~~~~~i~~~i~~~~~~~~~~~~Rt~~~ai~~YkLAll~~-~~~~~~~s~~A~l~LrlAW  126 (214)
T PF09986_consen   48 VWVCPHCGYAAFEEDFEKLSPEQKEKIKENISSRWKPRDFSGERTLEEAIESYKLALLCA-QIKKEKPSKKAGLCLRLAW  126 (214)
T ss_pred             EEECCCCCCcccccccccCCHHHHHHHHHHHHhhcccCCCCCCCCHHHHHHHHHHHHHHH-HHhCCCHHHHHHHHHHHHH
Confidence            599999998654332111    1111111111111   1111234677887776666553 4456777789999999999


Q ss_pred             HHHHhCCc
Q psy5736         327 CWSLAGNK  334 (346)
Q Consensus       327 ~y~~~G~~  334 (346)
                      +|+..|+.
T Consensus       127 lyR~~~~~  134 (214)
T PF09986_consen  127 LYRDLGDE  134 (214)
T ss_pred             HhhccCCH
Confidence            99999984


No 89 
>cd02681 MIT_calpain7_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=73.33  E-value=24  Score=26.76  Aligned_cols=57  Identities=9%  Similarity=-0.042  Sum_probs=39.2

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcc--cC-CCcHhHHHHHHHHHHHHHHh
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHEL--VV-PPYKLASLAHEALRNCWSLA  331 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~--l~-p~h~~~~~~~~~L~~~y~~~  331 (346)
                      .+.++..+|++.=+.|++++|+.+|.+.++.+...  +. +|-......+..+...+.+.
T Consensus         5 ~Ai~~a~~Ave~D~~g~y~eA~~~Y~~aie~l~~~~~~~~~n~~~k~~ir~K~~eYl~RA   64 (76)
T cd02681           5 DAVQFARLAVQRDQEGRYSEAVFYYKEAAQLLIYAEMAGTLNDSHLKTIQEKSNEYLDRA   64 (76)
T ss_pred             HHHHHHHHHHHHHHccCHHHHHHHHHHHHHHHHHHHHhcCCChHHHHHHHHHHHHHHHHH
Confidence            45566778888878899999999999999998874  22 22222333466666665554


No 90 
>KOG0275|consensus
Probab=72.41  E-value=3.3  Score=39.98  Aligned_cols=51  Identities=24%  Similarity=0.300  Sum_probs=37.8

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .+|.|.||..++|.-+-+|.  +.    |...+|+|.|-.  |.+-+|.++-|-.+|+-
T Consensus       283 qDGkIKvWri~tG~ClRrFd--rA----HtkGvt~l~FSr--D~SqiLS~sfD~tvRiH  333 (508)
T KOG0275|consen  283 QDGKIKVWRIETGQCLRRFD--RA----HTKGVTCLSFSR--DNSQILSASFDQTVRIH  333 (508)
T ss_pred             cCCcEEEEEEecchHHHHhh--hh----hccCeeEEEEcc--CcchhhcccccceEEEe
Confidence            58999999999999888884  22    566788888643  34444667778878774


No 91 
>KOG2055|consensus
Probab=72.28  E-value=7.4  Score=39.23  Aligned_cols=65  Identities=23%  Similarity=0.390  Sum_probs=46.8

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCc-ccceeccccCCCceEEeeecCCceEEEeC---Cccccchhhhhh
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSAR-CTALTLLNEHDDSLLLTGYDDGSLSVYRN---YASHEHRLVTSF   76 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~-~t~l~~ine~~~~lll~g~~dG~vri~r~---~~sh~~~Lv~a~   76 (346)
                      |-.|.|.|||..+..-+.+|..+..   .|++. |++      .+.+++.+|++.|.|.||..   +.+..|+-+.++
T Consensus       363 ~~~GeV~v~nl~~~~~~~rf~D~G~---v~gts~~~S------~ng~ylA~GS~~GiVNIYd~~s~~~s~~PkPik~~  431 (514)
T KOG2055|consen  363 GGTGEVYVWNLRQNSCLHRFVDDGS---VHGTSLCIS------LNGSYLATGSDSGIVNIYDGNSCFASTNPKPIKTV  431 (514)
T ss_pred             cCCceEEEEecCCcceEEEEeecCc---cceeeeeec------CCCceEEeccCcceEEEeccchhhccCCCCchhhh
Confidence            5578999999999988889987764   34432 233      34569999999999999952   334556666554


No 92 
>KOG0308|consensus
Probab=72.01  E-value=6  Score=41.43  Aligned_cols=59  Identities=20%  Similarity=0.298  Sum_probs=39.0

Q ss_pred             CcccceEEEEecCCC--ceeeeeCCCCCCCCC--CCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           2 TGLFLKEGIWDHITG--SRLISWRPDRGSWAR--NSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~--~~~~~~~~~~~~w~~--~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .||++.|-|||.+++  +.+-+|.|-..+...  ++..|=+|. .| ++..+.+.|+..+.+|+|
T Consensus       136 gGLD~~IflWDin~~~~~l~~s~n~~t~~sl~sG~k~siYSLA-~N-~t~t~ivsGgtek~lr~w  198 (735)
T KOG0308|consen  136 GGLDRKIFLWDINTGTATLVASFNNVTVNSLGSGPKDSIYSLA-MN-QTGTIIVSGGTEKDLRLW  198 (735)
T ss_pred             cCCCccEEEEEccCcchhhhhhccccccccCCCCCccceeeee-cC-CcceEEEecCcccceEEe
Confidence            589999999999998  445555543322111  233344443 23 445788999999999999


No 93 
>KOG0271|consensus
Probab=71.70  E-value=7.2  Score=38.55  Aligned_cols=54  Identities=20%  Similarity=0.323  Sum_probs=40.7

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      +-+|+|.|||...++-+-. ..+      |...+|++++=   .+.++-+|+.|+.|++|+...
T Consensus       224 skDg~vrIWd~~~~~~~~~-lsg------HT~~VTCvrwG---G~gliySgS~DrtIkvw~a~d  277 (480)
T KOG0271|consen  224 SKDGSVRIWDTKLGTCVRT-LSG------HTASVTCVRWG---GEGLIYSGSQDRTIKVWRALD  277 (480)
T ss_pred             cCCCCEEEEEccCceEEEE-ecc------CccceEEEEEc---CCceEEecCCCceEEEEEccc
Confidence            4589999999987754432 222      66789998853   367888999999999997664


No 94 
>KOG1840|consensus
Probab=70.62  E-value=9.3  Score=39.48  Aligned_cols=54  Identities=11%  Similarity=-0.025  Sum_probs=48.0

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccc
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWV  336 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~  336 (346)
                      |.-.+..+++.+|+..|++.+..+..++.+.|+.+..+...|+.+|...|+.-.
T Consensus       248 a~~y~~~~k~~eAv~ly~~AL~i~e~~~G~~h~~va~~l~nLa~ly~~~GKf~E  301 (508)
T KOG1840|consen  248 ALVYRSLGKYDEAVNLYEEALTIREEVFGEDHPAVAATLNNLAVLYYKQGKFAE  301 (508)
T ss_pred             HHHHHHhccHHHHHHHHHHHHHHHHHhcCCCCHHHHHHHHHHHHHHhccCChHH
Confidence            334567788999999999999999999999999999999999999999887643


No 95 
>COG4105 ComL DNA uptake lipoprotein [General function prediction only]
Probab=70.26  E-value=13  Score=34.94  Aligned_cols=52  Identities=15%  Similarity=0.332  Sum_probs=38.6

Q ss_pred             HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhC
Q psy5736         276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAG  332 (346)
Q Consensus       276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G  332 (346)
                      +..+|+++++.++.|++++|++.|+++...+     |..+...+++..|+..+-..|
T Consensus        34 ~~~LY~~g~~~L~~gn~~~A~~~fe~l~~~~-----p~s~~~~qa~l~l~yA~Yk~~   85 (254)
T COG4105          34 ASELYNEGLTELQKGNYEEAIKYFEALDSRH-----PFSPYSEQAQLDLAYAYYKNG   85 (254)
T ss_pred             HHHHHHHHHHHHhcCCHHHHHHHHHHHHHcC-----CCCcccHHHHHHHHHHHHhcc
Confidence            4568888999999999999999998887554     666666666666666655544


No 96 
>KOG0268|consensus
Probab=70.25  E-value=5.5  Score=39.07  Aligned_cols=53  Identities=19%  Similarity=0.252  Sum_probs=42.8

Q ss_pred             ceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           6 LKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         6 ~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      .+|.|||.+....+.+|+-+-+       .|.+++ +|+-.-+++.++++|+.|.+|..=+
T Consensus       167 e~i~IWD~~R~~Pv~smswG~D-------ti~svk-fNpvETsILas~~sDrsIvLyD~R~  219 (433)
T KOG0268|consen  167 EQIDIWDEQRDNPVSSMSWGAD-------SISSVK-FNPVETSILASCASDRSIVLYDLRQ  219 (433)
T ss_pred             ceeeecccccCCccceeecCCC-------ceeEEe-cCCCcchheeeeccCCceEEEeccc
Confidence            4699999999999988765543       477777 5888888889999999999995543


No 97 
>KOG0315|consensus
Probab=70.19  E-value=9.7  Score=35.72  Aligned_cols=62  Identities=18%  Similarity=0.294  Sum_probs=38.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc-ccchhhh
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS-HEHRLVT   74 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s-h~~~Lv~   74 (346)
                      |-+|++.|||..+  ..+    |+-  ..+.+.++++- +++ ...-|++|-++|.|++|..-++ +..++++
T Consensus       102 seDgt~kIWdlR~--~~~----qR~--~~~~spVn~vv-lhp-nQteLis~dqsg~irvWDl~~~~c~~~liP  164 (311)
T KOG0315|consen  102 SEDGTVKIWDLRS--LSC----QRN--YQHNSPVNTVV-LHP-NQTELISGDQSGNIRVWDLGENSCTHELIP  164 (311)
T ss_pred             CCCceEEEEeccC--ccc----chh--ccCCCCcceEE-ecC-CcceEEeecCCCcEEEEEccCCccccccCC
Confidence            5689999999986  222    221  02345677655 343 3445578999999999965543 3344444


No 98 
>KOG0305|consensus
Probab=70.13  E-value=6.7  Score=40.16  Aligned_cols=54  Identities=17%  Similarity=0.278  Sum_probs=46.6

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      -++.|.+||..+|+++.....+        .-|++|.+.-..-+-+...|++++.|++|+--
T Consensus       366 ~D~~i~fwn~~~g~~i~~vdtg--------sQVcsL~Wsk~~kEi~sthG~s~n~i~lw~~p  419 (484)
T KOG0305|consen  366 ADRCIKFWNTNTGARIDSVDTG--------SQVCSLIWSKKYKELLSTHGYSENQITLWKYP  419 (484)
T ss_pred             cccEEEEEEcCCCcEecccccC--------CceeeEEEcCCCCEEEEecCCCCCcEEEEecc
Confidence            4789999999999999876544        46999999998889999999999999999754


No 99 
>TIGR02795 tol_pal_ybgF tol-pal system protein YbgF. Members of this protein family are the product of one of seven genes regularly clustered in operons to encode the proteins of the tol-pal system, which is critical for maintaining the integrity of the bacterial outer membrane. The gene for this periplasmic protein has been designated orf2 and ybgF. All members of the seed alignment were from unique tol-pal gene regions from completed bacterial genomes. The architecture of this protein is a signal sequence, a low-complexity region usually rich in Asn and Gln, a well-conserved region with tandem repeats that resemble the tetratricopeptide (TPR) repeat, involved in protein-protein interaction.
Probab=69.57  E-value=16  Score=28.37  Aligned_cols=27  Identities=7%  Similarity=0.321  Sum_probs=14.8

Q ss_pred             HHHHHHHHhccccHHHHHHHHHHHHHH
Q psy5736         279 MFYRGIEQMNTSCFREAVESLTKFSDQ  305 (346)
Q Consensus       279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~  305 (346)
                      .+..+...+..+++++|++.+.++++.
T Consensus         5 ~~~~~~~~~~~~~~~~A~~~~~~~~~~   31 (119)
T TIGR02795         5 YYDAALLVLKAGDYADAIQAFQAFLKK   31 (119)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHH
Confidence            344444455556666666666666554


No 100
>KOG0646|consensus
Probab=69.37  E-value=7.4  Score=39.16  Aligned_cols=61  Identities=10%  Similarity=0.027  Sum_probs=45.8

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCC----CCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGS----WARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~----w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |-||.|.||+.-.-.  .-++.+.+.    |..|+.-||+|+.=+-...+.+.+.+.|..+++|..-
T Consensus       142 skDg~V~vW~l~~lv--~a~~~~~~~p~~~f~~HtlsITDl~ig~Gg~~~rl~TaS~D~t~k~wdlS  206 (476)
T KOG0646|consen  142 SKDGAVLVWLLTDLV--SADNDHSVKPLHIFSDHTLSITDLQIGSGGTNARLYTASEDRTIKLWDLS  206 (476)
T ss_pred             CCCccEEEEEEEeec--ccccCCCccceeeeccCcceeEEEEecCCCccceEEEecCCceEEEEEec
Confidence            678999999875431  111111222    8899999999997666789999999999999999543


No 101
>KOG0296|consensus
Probab=69.29  E-value=11  Score=37.09  Aligned_cols=53  Identities=21%  Similarity=0.345  Sum_probs=40.4

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      +|.|=+|.-.+++....|+-       |..+||+=+|++.  ...+++|++||.|++|....
T Consensus       169 DGsvWmw~ip~~~~~kv~~G-------h~~~ct~G~f~pd--GKr~~tgy~dgti~~Wn~kt  221 (399)
T KOG0296|consen  169 DGSVWMWQIPSQALCKVMSG-------HNSPCTCGEFIPD--GKRILTGYDDGTIIVWNPKT  221 (399)
T ss_pred             CCcEEEEECCCcceeeEecC-------CCCCcccccccCC--CceEEEEecCceEEEEecCC
Confidence            57777777777655555542       4578999888874  88889999999999997664


No 102
>cd02678 MIT_VPS4 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in intracellular protein transport proteins of the AAA-ATPase family. The molecular function of the MIT domain is unclear.
Probab=69.00  E-value=35  Score=25.45  Aligned_cols=58  Identities=12%  Similarity=0.096  Sum_probs=40.7

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc-CCCcHhH-HHHHHHHHHHHHH
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV-VPPYKLA-SLAHEALRNCWSL  330 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l-~p~h~~~-~~~~~~L~~~y~~  330 (346)
                      ++++..++.+|++.=..|++++|+..|.+.++.+-..+ ..+.+.. ...+..+...+.+
T Consensus         3 ~~~A~~l~~~Av~~D~~g~y~eA~~~Y~~aie~l~~~~k~e~~~~~k~~~~~k~~eyl~R   62 (75)
T cd02678           3 LQKAIELVKKAIEEDNAGNYEEALRLYQHALEYFMHALKYEKNPKSKESIRAKCTEYLDR   62 (75)
T ss_pred             HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence            45667777888887788999999999999999998865 3344433 3344444444444


No 103
>cd02684 MIT_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with an n-terminal serine/threonine kinase domain. The molecular function of the MIT domain is unclear.
Probab=68.74  E-value=34  Score=25.75  Aligned_cols=58  Identities=5%  Similarity=-0.063  Sum_probs=41.3

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc-CCCcHhH-HHHHHHHHHHHHH
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV-VPPYKLA-SLAHEALRNCWSL  330 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l-~p~h~~~-~~~~~~L~~~y~~  330 (346)
                      +.++..++.+|++.-..|++++|+.+|...++.|-..+ ..+++.. ...+..+...+.+
T Consensus         3 l~~Ai~lv~~Av~~D~~g~y~eA~~lY~~ale~~~~~~k~e~~~~~k~~lr~k~~eyl~R   62 (75)
T cd02684           3 LEKAIALVVQAVKKDQRGDAAAALSLYCSALQYFVPALHYETDAQRKEALRQKVLQYVSR   62 (75)
T ss_pred             HHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence            45667778888888888999999999999999998866 4455544 3344444444443


No 104
>PLN03098 LPA1 LOW PSII ACCUMULATION1; Provisional
Probab=67.86  E-value=8.9  Score=38.87  Aligned_cols=53  Identities=11%  Similarity=0.125  Sum_probs=42.4

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ...+..+..++..|++++|+..|++.|+.-     |.+.....+...++-||+.+|+.
T Consensus        76 ~a~~NLG~AL~~lGryeEAIa~f~rALeL~-----Pd~aeA~~A~yNLAcaya~LGr~  128 (453)
T PLN03098         76 EDAVNLGLSLFSKGRVKDALAQFETALELN-----PNPDEAQAAYYNKACCHAYREEG  128 (453)
T ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHHHhhC-----CCchHHHHHHHHHHHHHHHcCCH
Confidence            344556677788899999999999999874     77776667788999999998874


No 105
>PF00515 TPR_1:  Tetratricopeptide repeat;  InterPro: IPR001440 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. The X-ray structure of a domain containing three TPRs from protein phosphatase 5 revealed that TPR adopts a helix-turn-helix arrangement, with adjacent TPR motifs packing in a parallel fashion, resulting in a spiral of repeating anti-parallel alpha-helices []. The two helices are denoted helix A and helix B. The packing angle between helix A and helix B is ~24 degrees; within a single TPR and generates a right-handed superhelical shape. Helix A interacts with helix B and with helix A' of the next TPR. Two protein surfaces are generated: the inner concave surface is contributed to mainly by residue on helices A, and the other surface presents residues from both helices A and B. ; GO: 0005515 protein binding; PDB: 3SF4_C 2LNI_A 1ELW_A 2C0M_A 1FCH_B 3R9A_B 2J9Q_A 2C0L_A 1KT1_A 3FWV_A ....
Probab=66.62  E-value=16  Score=22.14  Aligned_cols=29  Identities=14%  Similarity=0.142  Sum_probs=22.4

Q ss_pred             HHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736         278 EMFYRGIEQMNTSCFREAVESLTKFSDQI  306 (346)
Q Consensus       278 ~~~~~a~~~~~~~~~~~ai~~l~~~l~~~  306 (346)
                      ..+..|...+..+++++|+..+++.|+.-
T Consensus         3 ~~~~~g~~~~~~~~~~~A~~~~~~al~~~   31 (34)
T PF00515_consen    3 AYYNLGNAYFQLGDYEEALEYYQRALELD   31 (34)
T ss_dssp             HHHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred             HHHHHHHHHHHhCCchHHHHHHHHHHHHC
Confidence            34566777788899999999999999864


No 106
>KOG0281|consensus
Probab=66.57  E-value=7.2  Score=38.15  Aligned_cols=49  Identities=14%  Similarity=0.213  Sum_probs=38.7

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      -+.+|.|||..++.-+-. -|+      |++.|.+++    ..+.+.+.|++|..||+|.
T Consensus       338 gDRTikvW~~st~efvRt-l~g------HkRGIAClQ----Yr~rlvVSGSSDntIRlwd  386 (499)
T KOG0281|consen  338 GDRTIKVWSTSTCEFVRT-LNG------HKRGIACLQ----YRDRLVVSGSSDNTIRLWD  386 (499)
T ss_pred             CCceEEEEeccceeeehh-hhc------ccccceehh----ccCeEEEecCCCceEEEEe
Confidence            367899999998854422 222      778899987    6789999999999999994


No 107
>KOG0285|consensus
Probab=66.13  E-value=13  Score=36.66  Aligned_cols=54  Identities=15%  Similarity=0.242  Sum_probs=40.4

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |-+-++.|||..+...+..+.       -|...++++. .++.+..+ .+||.|+.||+|..-
T Consensus       254 grDst~RvWDiRtr~~V~~l~-------GH~~~V~~V~-~~~~dpqv-it~S~D~tvrlWDl~  307 (460)
T KOG0285|consen  254 GRDSTIRVWDIRTRASVHVLS-------GHTNPVASVM-CQPTDPQV-ITGSHDSTVRLWDLR  307 (460)
T ss_pred             CCcceEEEeeecccceEEEec-------CCCCcceeEE-eecCCCce-EEecCCceEEEeeec
Confidence            567899999999888777663       2566777765 35555555 779999999999543


No 108
>cd02680 MIT_calpain7_2 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in the nuclear thiol protease PalBH. The molecular function of the MIT domain is unclear.
Probab=65.72  E-value=13  Score=28.17  Aligned_cols=36  Identities=14%  Similarity=0.127  Sum_probs=29.2

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhc
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHE  308 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~  308 (346)
                      +.++..++++|++.=+.|++++|+++|...++.+-.
T Consensus         3 l~kai~Lv~~A~~eD~~gny~eA~~lY~~ale~~~~   38 (75)
T cd02680           3 LERAHFLVTQAFDEDEKGNAEEAIELYTEAVELCIN   38 (75)
T ss_pred             HHHHHHHHHHHHHhhHhhhHHHHHHHHHHHHHHHHH
Confidence            345666777777777789999999999999998866


No 109
>KOG1007|consensus
Probab=65.70  E-value=6  Score=37.78  Aligned_cols=53  Identities=23%  Similarity=0.278  Sum_probs=43.7

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      -++++.-||..+..+.++..+-      |+-++-+|. +|+.-.-++.++++||-||+|.
T Consensus       191 ~d~tl~~~D~RT~~~~~sI~dA------Hgq~vrdlD-fNpnkq~~lvt~gDdgyvriWD  243 (370)
T KOG1007|consen  191 SDSTLQFWDLRTMKKNNSIEDA------HGQRVRDLD-FNPNKQHILVTCGDDGYVRIWD  243 (370)
T ss_pred             CCCcEEEEEccchhhhcchhhh------hcceeeecc-CCCCceEEEEEcCCCccEEEEe
Confidence            4688999999988777766432      777888888 6888888999999999999993


No 110
>KOG0291|consensus
Probab=65.24  E-value=13  Score=39.86  Aligned_cols=51  Identities=18%  Similarity=0.354  Sum_probs=42.0

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +-+|.|.|||-.+|--...|..       |.+-+|.++|.-  ....+++.+-||.||.|
T Consensus       369 ~eDgKVKvWn~~SgfC~vTFte-------Hts~Vt~v~f~~--~g~~llssSLDGtVRAw  419 (893)
T KOG0291|consen  369 AEDGKVKVWNTQSGFCFVTFTE-------HTSGVTAVQFTA--RGNVLLSSSLDGTVRAW  419 (893)
T ss_pred             cCCCcEEEEeccCceEEEEecc-------CCCceEEEEEEe--cCCEEEEeecCCeEEee
Confidence            4589999999999988888863       556799999764  45666889999999998


No 111
>PF10579 Rapsyn_N:  Rapsyn N-terminal myristoylation and linker region;  InterPro: IPR019568  Neuromuscular junction formation relies upon the clustering of acetylcholine receptors and other proteins in the muscle membrane. Rapsyn is a peripheral membrane protein that is selectively concentrated at the neuromuscular junction and is essential for the formation of synaptic acetylcholine receptor aggregates. Acetylcholine receptors fail to aggregate beneath nerve terminals in mice where rapsyn has been knocked out. The N-terminal six amino acids of rapsyn are its myristoylation site, and myristoylation is necessary for the targeting of the protein to the membrane []. ; GO: 0008270 zinc ion binding, 0033130 acetylcholine receptor binding, 0007268 synaptic transmission, 0005856 cytoskeleton, 0030054 cell junction, 0045211 postsynaptic membrane
Probab=65.17  E-value=35  Score=26.25  Aligned_cols=54  Identities=11%  Similarity=0.059  Sum_probs=41.6

Q ss_pred             HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      +...+++++++.++.+.++|+....+.|++..+     .+.-..+--.|+++|.+.|..
T Consensus         6 ak~~ie~GlkLY~~~~~~~Al~~W~~aL~k~~~-----~~~rf~~lG~l~qA~~e~Gky   59 (80)
T PF10579_consen    6 AKQQIEKGLKLYHQNETQQALQKWRKALEKITD-----REDRFRVLGYLIQAHMEWGKY   59 (80)
T ss_pred             HHHHHHHHHHHhccchHHHHHHHHHHHHhhcCC-----hHHHHHHHHHHHHHHHHHHHH
Confidence            445567788888888888999999999988744     346777777888888888753


No 112
>KOG0306|consensus
Probab=64.62  E-value=9.7  Score=40.67  Aligned_cols=51  Identities=20%  Similarity=0.184  Sum_probs=41.2

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      -+|.|.|||-+++..+..|+       -||..||.|++=  ..+..+++|+-||.|-+|.
T Consensus        85 aDGsVqif~~~s~~~~~tfn-------gHK~AVt~l~fd--~~G~rlaSGskDt~IIvwD  135 (888)
T KOG0306|consen   85 ADGSVQIFSLESEEILITFN-------GHKAAVTTLKFD--KIGTRLASGSKDTDIIVWD  135 (888)
T ss_pred             cCceEEeeccCCCceeeeec-------ccccceEEEEEc--ccCceEeecCCCccEEEEE
Confidence            37999999999998888772       288999999953  3456678899999998884


No 113
>PF07719 TPR_2:  Tetratricopeptide repeat;  InterPro: IPR013105 The tetratrico peptide repeat (TPR) is a structural motif present in a wide range of proteins [, , ]. It mediates protein-protein interactions and the assembly of multiprotein complexes []. The TPR motif consists of 3-16 tandem-repeats of 34 amino acids residues, although individual TPR motifs can be dispersed in the protein sequence. Sequence alignment of the TPR domains reveals a consensus sequence defined by a pattern of small and large amino acids. TPR motifs have been identified in various different organisms, ranging from bacteria to humans. Proteins containing TPRs are involved in a variety of biological processes, such as cell cycle regulation, transcriptional control, mitochondrial and peroxisomal protein transport, neurogenesis and protein folding. This repeat includes outlying Tetratricopeptide-like repeats (TPR) that are not matched by IPR001440 from INTERPRO.; PDB: 1XNF_B 3Q15_A 4ABN_A 1OUV_A 3U4T_A 3MA5_C 2KCV_A 2KCL_A 2XEV_A 3NF1_A ....
Probab=64.22  E-value=18  Score=21.63  Aligned_cols=28  Identities=21%  Similarity=0.297  Sum_probs=21.3

Q ss_pred             HHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736         279 MFYRGIEQMNTSCFREAVESLTKFSDQI  306 (346)
Q Consensus       279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~  306 (346)
                      .+..|.-.+..|++++|++.+++.++..
T Consensus         4 ~~~lg~~~~~~~~~~~A~~~~~~al~l~   31 (34)
T PF07719_consen    4 WYYLGQAYYQLGNYEEAIEYFEKALELD   31 (34)
T ss_dssp             HHHHHHHHHHTT-HHHHHHHHHHHHHHS
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHC
Confidence            3445566678899999999999999865


No 114
>KOG0284|consensus
Probab=64.03  E-value=6.6  Score=39.04  Aligned_cols=73  Identities=26%  Similarity=0.435  Sum_probs=51.3

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--e------CCccccchhhh
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--R------NYASHEHRLVT   74 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r------~~~sh~~~Lv~   74 (346)
                      |-+..|.+||-.+|.-+++..       .||.-|..++| |.+. ..||+++-|..+++|  |      -|..|+.+..+
T Consensus       241 skDnlVKlWDprSg~cl~tlh-------~HKntVl~~~f-~~n~-N~Llt~skD~~~kv~DiR~mkEl~~~r~Hkkdv~~  311 (464)
T KOG0284|consen  241 SKDNLVKLWDPRSGSCLATLH-------GHKNTVLAVKF-NPNG-NWLLTGSKDQSCKVFDIRTMKELFTYRGHKKDVTS  311 (464)
T ss_pred             cCCceeEeecCCCcchhhhhh-------hccceEEEEEE-cCCC-CeeEEccCCceEEEEehhHhHHHHHhhcchhhhee
Confidence            446689999999999888653       37778899985 5454 999999999999998  3      12235555444


Q ss_pred             h-hhhhccccc
Q psy5736          75 S-FQALTDTSF   84 (346)
Q Consensus        75 a-~r~L~~~~y   84 (346)
                      . |+-+....|
T Consensus       312 ~~WhP~~~~lf  322 (464)
T KOG0284|consen  312 LTWHPLNESLF  322 (464)
T ss_pred             eccccccccce
Confidence            3 555544433


No 115
>KOG4328|consensus
Probab=63.95  E-value=7.9  Score=38.98  Aligned_cols=51  Identities=14%  Similarity=0.236  Sum_probs=35.1

Q ss_pred             ccceEEEEecC---CCc-eeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHI---TGS-RLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~---~~~-~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -+|+|.+||..   ... +.-.|   .    +|+..|+.|+|. +.+.+..+..|.||.+|.-
T Consensus       208 K~G~VG~Wn~~~~~~d~d~v~~f---~----~hs~~Vs~l~F~-P~n~s~i~ssSyDGtiR~~  262 (498)
T KOG4328|consen  208 KGGQVGLWNFGTQEKDKDGVYLF---T----PHSGPVSGLKFS-PANTSQIYSSSYDGTIRLQ  262 (498)
T ss_pred             CCCcEEEEecCCCCCccCceEEe---c----cCCccccceEec-CCChhheeeeccCceeeee
Confidence            47999999994   222 22233   2    377889999965 4454555668999999874


No 116
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=63.90  E-value=5.2  Score=27.05  Aligned_cols=30  Identities=17%  Similarity=0.435  Sum_probs=18.9

Q ss_pred             ccccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736         228 RIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD  265 (346)
Q Consensus       228 ~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~  265 (346)
                      -++|+  +|+..+......      ..+.|+.||..+.
T Consensus         3 ~y~C~--~CG~~~~~~~~~------~~~~Cp~CG~~~~   32 (46)
T PRK00398          3 EYKCA--RCGREVELDEYG------TGVRCPYCGYRIL   32 (46)
T ss_pred             EEECC--CCCCEEEECCCC------CceECCCCCCeEE
Confidence            46788  888765444221      1478888887643


No 117
>cd02682 MIT_AAA_Arch MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in mostly archaebacterial AAA-ATPases. The molecular function of the MIT domain is unclear.
Probab=63.64  E-value=53  Score=24.89  Aligned_cols=57  Identities=7%  Similarity=0.081  Sum_probs=40.6

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc--CCCcHhHHHHHHHHHHHHHHh
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV--VPPYKLASLAHEALRNCWSLA  331 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l--~p~h~~~~~~~~~L~~~y~~~  331 (346)
                      .+.....+|++.=..|++++|+..|.+.++.+.+++  .|.++.....++.+..+..+.
T Consensus         5 ~A~~~a~~AVe~D~~gr~~eAi~~Y~~aIe~L~q~~~~~pD~~~k~~yr~ki~eY~~Ra   63 (75)
T cd02682           5 MARKYAINAVKAEKEGNAEDAITNYKKAIEVLSQIVKNYPDSPTRLIYEQMINEYKRRI   63 (75)
T ss_pred             HHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHH
Confidence            344556677877778999999999999999998865  345555555566665555443


No 118
>KOG0772|consensus
Probab=63.61  E-value=6.4  Score=40.29  Aligned_cols=56  Identities=21%  Similarity=0.193  Sum_probs=37.8

Q ss_pred             ccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           4 LFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         4 ~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      ++|.|.+||..+- .+.+..- . .++. .+..||++.|-  .|..+||+-+-|+.+++|.+
T Consensus       337 ~DGSIQ~W~~~~~~v~p~~~v-k-~AH~-~g~~Itsi~FS--~dg~~LlSRg~D~tLKvWDL  393 (641)
T KOG0772|consen  337 LDGSIQIWDKGSRTVRPVMKV-K-DAHL-PGQDITSISFS--YDGNYLLSRGFDDTLKVWDL  393 (641)
T ss_pred             cCCceeeeecCCcccccceEe-e-eccC-CCCceeEEEec--cccchhhhccCCCceeeeec
Confidence            6899999998432 2222221 1 2211 24589999963  67888898889999999943


No 119
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=63.50  E-value=8  Score=31.63  Aligned_cols=30  Identities=33%  Similarity=0.473  Sum_probs=24.0

Q ss_pred             ccceeccccCC--CceEEeeecCCceEEEeCC
Q psy5736          36 CTALTLLNEHD--DSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus        36 ~t~l~~ine~~--~~lll~g~~dG~vri~r~~   65 (346)
                      +++|-+..-+.  ..-|++|++|..||+|++=
T Consensus         2 V~al~~~d~d~dg~~eLlvGs~D~~IRvf~~~   33 (111)
T PF14783_consen    2 VTALCLFDFDGDGENELLVGSDDFEIRVFKGD   33 (111)
T ss_pred             eeEEEEEecCCCCcceEEEecCCcEEEEEeCC
Confidence            67777777644  4778999999999999765


No 120
>KOG2110|consensus
Probab=62.98  E-value=14  Score=36.25  Aligned_cols=49  Identities=20%  Similarity=0.289  Sum_probs=41.3

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCce-EEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSL-SVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~v-ri~   62 (346)
                      -|.|.|||-.+-+..+.+.   .    |+..+.+|.| | +++.++.+||+-|.| |+|
T Consensus       152 ~GdV~l~d~~nl~~v~~I~---a----H~~~lAalaf-s-~~G~llATASeKGTVIRVf  201 (391)
T KOG2110|consen  152 SGDVVLFDTINLQPVNTIN---A----HKGPLAALAF-S-PDGTLLATASEKGTVIRVF  201 (391)
T ss_pred             CceEEEEEcccceeeeEEE---e----cCCceeEEEE-C-CCCCEEEEeccCceEEEEE
Confidence            4889999999999999885   2    7889999985 3 788999999998876 776


No 121
>KOG1840|consensus
Probab=62.58  E-value=19  Score=37.18  Aligned_cols=57  Identities=14%  Similarity=-0.003  Sum_probs=50.7

Q ss_pred             hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccccCccccc
Q psy5736         287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWVIPENYSQ  343 (346)
Q Consensus       287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~~~~~~~~  343 (346)
                      ...+++++|+.++.+.++++.+.+.+.|+.+...+--|+.+|-..|+...--++++.
T Consensus       336 ~~~~~~Eea~~l~q~al~i~~~~~g~~~~~~a~~~~nl~~l~~~~gk~~ea~~~~k~  392 (508)
T KOG1840|consen  336 QSMNEYEEAKKLLQKALKIYLDAPGEDNVNLAKIYANLAELYLKMGKYKEAEELYKK  392 (508)
T ss_pred             HHhcchhHHHHHHHHHHHHHHhhccccchHHHHHHHHHHHHHHHhcchhHHHHHHHH
Confidence            345779999999999999999999999999999999999999999998777666654


No 122
>KOG4328|consensus
Probab=61.84  E-value=8.2  Score=38.88  Aligned_cols=56  Identities=16%  Similarity=0.087  Sum_probs=44.0

Q ss_pred             ccceEEEEecCCCce-eeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE--eCCcc
Q psy5736           4 LFLKEGIWDHITGSR-LISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYAS   67 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~-~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~s   67 (346)
                      -+|.++|||...+.. +.       .|..|+.+|++|. +|+.-.-++++++.|+..+||  |.+..
T Consensus       299 ~~G~f~~iD~R~~~s~~~-------~~~lh~kKI~sv~-~NP~~p~~laT~s~D~T~kIWD~R~l~~  357 (498)
T KOG4328|consen  299 NVGNFNVIDLRTDGSEYE-------NLRLHKKKITSVA-LNPVCPWFLATASLDQTAKIWDLRQLRG  357 (498)
T ss_pred             cccceEEEEeecCCccch-------hhhhhhcccceee-cCCCCchheeecccCcceeeeehhhhcC
Confidence            467899999977643 32       2567889999998 788777788999999999999  56643


No 123
>KOG0279|consensus
Probab=61.84  E-value=15  Score=34.98  Aligned_cols=59  Identities=22%  Similarity=0.328  Sum_probs=39.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCC--CCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRG--SWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~--~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +....|.|||-++++.+-...-.-.  +-+....+|.+|.+.+  |+.-|+.|++|+.||+|.
T Consensus       251 at~~sIkIwdl~~~~~v~~l~~d~~g~s~~~~~~~clslaws~--dG~tLf~g~td~~irv~q  311 (315)
T KOG0279|consen  251 ATATSIKIWDLESKAVVEELKLDGIGPSSKAGDPICLSLAWSA--DGQTLFAGYTDNVIRVWQ  311 (315)
T ss_pred             ccCCceEEEeccchhhhhhccccccccccccCCcEEEEEEEcC--CCcEEEeeecCCcEEEEE
Confidence            3445688999988877664432111  1122245688877654  677778899999999984


No 124
>cd02677 MIT_SNX15 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in sorting nexin 15 and related proteins. The molecular function of the MIT domain is unclear.
Probab=61.59  E-value=55  Score=24.66  Aligned_cols=44  Identities=16%  Similarity=0.074  Sum_probs=34.6

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc-CCCcHh
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV-VPPYKL  316 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l-~p~h~~  316 (346)
                      +.++..++.+|++.=+.+++++|++.|.+.++.|...+ ..+++.
T Consensus         3 l~~A~~l~~~Ave~d~~~~y~eA~~~Y~~~i~~~~~~~k~e~~~~   47 (75)
T cd02677           3 LEQAAELIRLALEKEEEGDYEAAFEFYRAGVDLLLKGVQGDSSPE   47 (75)
T ss_pred             HHHHHHHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHhccCCCHH
Confidence            44566777888888788999999999999999998866 345543


No 125
>KOG0771|consensus
Probab=61.57  E-value=8.4  Score=38.17  Aligned_cols=53  Identities=17%  Similarity=0.156  Sum_probs=37.9

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |-+|+++||+|.+-..+..+.       .|+.+|++|.| . .|...+.+=+.| ..++|.--
T Consensus       163 g~dg~lRv~~~Ps~~t~l~e~-------~~~~eV~DL~F-S-~dgk~lasig~d-~~~VW~~~  215 (398)
T KOG0771|consen  163 GTDGTLRVWEWPSMLTILEEI-------AHHAEVKDLDF-S-PDGKFLASIGAD-SARVWSVN  215 (398)
T ss_pred             cccceEEEEecCcchhhhhhH-------hhcCcccccee-C-CCCcEEEEecCC-ceEEEEec
Confidence            679999999999887776542       37889999985 3 344444544455 88999543


No 126
>KOG0277|consensus
Probab=61.55  E-value=15  Score=34.53  Aligned_cols=52  Identities=13%  Similarity=0.229  Sum_probs=41.5

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      -+|+|.+||...++-+..|.-.       ...|=..+ .+++...+.+.+|.||.+|+|.
T Consensus       125 WD~TiKLW~~~r~~Sv~Tf~gh-------~~~Iy~a~-~sp~~~nlfas~Sgd~~l~lwd  176 (311)
T KOG0277|consen  125 WDGTIKLWDPNRPNSVQTFNGH-------NSCIYQAA-FSPHIPNLFASASGDGTLRLWD  176 (311)
T ss_pred             cCCceEeecCCCCcceEeecCC-------ccEEEEEe-cCCCCCCeEEEccCCceEEEEE
Confidence            5899999999999999887533       34453333 5788889999999999999995


No 127
>KOG0279|consensus
Probab=61.41  E-value=13  Score=35.27  Aligned_cols=54  Identities=13%  Similarity=0.304  Sum_probs=42.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +-+|++.+||-++|+...+|.       -|...+.++. |+ .|+.-..+||-|..+.+|.-.
T Consensus        82 swD~~lrlWDl~~g~~t~~f~-------GH~~dVlsva-~s-~dn~qivSGSrDkTiklwnt~  135 (315)
T KOG0279|consen   82 SWDGTLRLWDLATGESTRRFV-------GHTKDVLSVA-FS-TDNRQIVSGSRDKTIKLWNTL  135 (315)
T ss_pred             cccceEEEEEecCCcEEEEEE-------ecCCceEEEE-ec-CCCceeecCCCcceeeeeeec
Confidence            468999999999998888885       2667788877 44 455556889999999999543


No 128
>KOG0268|consensus
Probab=60.07  E-value=7.9  Score=38.01  Aligned_cols=57  Identities=18%  Similarity=0.252  Sum_probs=42.5

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS   67 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s   67 (346)
                      +.+.+|.|+....+.----|.+.++      -.+.++|+  .+|..+.+.||+||+||+|+-+-+
T Consensus       291 syDksIRIf~~~~~~SRdiYhtkRM------q~V~~Vk~--S~Dskyi~SGSdd~nvRlWka~As  347 (433)
T KOG0268|consen  291 SYDKSIRIFPVNHGHSRDIYHTKRM------QHVFCVKY--SMDSKYIISGSDDGNVRLWKAKAS  347 (433)
T ss_pred             cccceEEEeecCCCcchhhhhHhhh------heeeEEEE--eccccEEEecCCCcceeeeecchh
Confidence            4577899998877754444555544      35777774  478899999999999999987754


No 129
>KOG0640|consensus
Probab=59.41  E-value=16  Score=35.33  Aligned_cols=54  Identities=20%  Similarity=0.332  Sum_probs=40.9

Q ss_pred             eEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           7 KEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         7 ~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      ++++||.++-+-..+-..+.    -|+.-|++++.  ..+.++..+|+-||.|++|....
T Consensus       239 ~~rlYdv~T~QcfvsanPd~----qht~ai~~V~Y--s~t~~lYvTaSkDG~IklwDGVS  292 (430)
T KOG0640|consen  239 TLRLYDVNTYQCFVSANPDD----QHTGAITQVRY--SSTGSLYVTASKDGAIKLWDGVS  292 (430)
T ss_pred             ceeEEeccceeEeeecCccc----ccccceeEEEe--cCCccEEEEeccCCcEEeecccc
Confidence            57899999776555444333    37778999873  36788889999999999997664


No 130
>PF13717 zinc_ribbon_4:  zinc-ribbon domain
Probab=57.96  E-value=5.6  Score=25.63  Aligned_cols=33  Identities=24%  Similarity=0.519  Sum_probs=18.5

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS  264 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~  264 (346)
                      +.||  +|...+..++..-| ......+|+.|+...
T Consensus         3 i~Cp--~C~~~y~i~d~~ip-~~g~~v~C~~C~~~f   35 (36)
T PF13717_consen    3 ITCP--NCQAKYEIDDEKIP-PKGRKVRCSKCGHVF   35 (36)
T ss_pred             EECC--CCCCEEeCCHHHCC-CCCcEEECCCCCCEe
Confidence            5677  78765443322111 223458899998753


No 131
>KOG0313|consensus
Probab=57.75  E-value=13  Score=36.66  Aligned_cols=51  Identities=16%  Similarity=0.251  Sum_probs=35.9

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +-+-+|.+||.++|.-......++        -++++...  ....++++|++|-.+|+|.
T Consensus       278 SwDHTIk~WDletg~~~~~~~~~k--------sl~~i~~~--~~~~Ll~~gssdr~irl~D  328 (423)
T KOG0313|consen  278 SWDHTIKVWDLETGGLKSTLTTNK--------SLNCISYS--PLSKLLASGSSDRHIRLWD  328 (423)
T ss_pred             cccceEEEEEeecccceeeeecCc--------ceeEeecc--cccceeeecCCCCceeecC
Confidence            357799999999997665443332        23443322  3678889999999999993


No 132
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=57.08  E-value=29  Score=31.41  Aligned_cols=52  Identities=15%  Similarity=0.112  Sum_probs=36.0

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +.+|+|.+||-++++.+..|....        .+..+.+ .++...+.++++.++.+++|.
T Consensus         8 ~~d~~v~~~d~~t~~~~~~~~~~~--------~~~~l~~-~~dg~~l~~~~~~~~~v~~~d   59 (300)
T TIGR03866         8 EKDNTISVIDTATLEVTRTFPVGQ--------RPRGITL-SKDGKLLYVCASDSDTIQVID   59 (300)
T ss_pred             cCCCEEEEEECCCCceEEEEECCC--------CCCceEE-CCCCCEEEEEECCCCeEEEEE
Confidence            568999999999998887775322        2445554 334344556778899999985


No 133
>KOG0639|consensus
Probab=57.02  E-value=12  Score=38.22  Aligned_cols=52  Identities=17%  Similarity=0.293  Sum_probs=38.1

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      -+|+|.|||..++..+-.|.-.-     -+..|-+|+    +|..-+-+|+-|..||-|..
T Consensus       529 sdGnI~vwDLhnq~~VrqfqGht-----DGascIdis----~dGtklWTGGlDntvRcWDl  580 (705)
T KOG0639|consen  529 SDGNIAVWDLHNQTLVRQFQGHT-----DGASCIDIS----KDGTKLWTGGLDNTVRCWDL  580 (705)
T ss_pred             cCCcEEEEEcccceeeecccCCC-----CCceeEEec----CCCceeecCCCccceeehhh
Confidence            37999999999998888885332     234444433    56666788999999999943


No 134
>PF04212 MIT:  MIT (microtubule interacting and transport) domain;  InterPro: IPR007330 The MIT domain is found in vacuolar sorting proteins, spastin (probable ATPase involved in the assembly or function of nuclear protein complexes), and a sorting nexin, which may play a role in intracellular trafficking.; PDB: 2DL1_A 2JQK_A 1WR0_A 2CPT_A 2JQH_A 2V6Y_A 2JQ9_A 2K3W_A 1YXR_A 3EAB_E ....
Probab=56.85  E-value=25  Score=25.60  Aligned_cols=37  Identities=14%  Similarity=0.241  Sum_probs=29.4

Q ss_pred             HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc
Q psy5736         274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV  310 (346)
Q Consensus       274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l  310 (346)
                      +.+.....+|++.=+.|++++|++.|.+.++.+...+
T Consensus         3 ~~A~~~~~~Av~~D~~g~~~~A~~~Y~~ai~~l~~~~   39 (69)
T PF04212_consen    3 DKAIELIKKAVEADEAGNYEEALELYKEAIEYLMQAL   39 (69)
T ss_dssp             HHHHHHHHHHHHHHHTTSHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHh
Confidence            4455666777777778999999999999998887754


No 135
>KOG0643|consensus
Probab=55.73  E-value=17  Score=34.44  Aligned_cols=51  Identities=24%  Similarity=0.334  Sum_probs=40.8

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -+|.|++||-.+|..+..-  .    ..|...|++|++  ..|..+.++|+.|..-++|
T Consensus       167 e~G~is~~da~~g~~~v~s--~----~~h~~~Ind~q~--s~d~T~FiT~s~Dttakl~  217 (327)
T KOG0643|consen  167 EDGSISIYDARTGKELVDS--D----EEHSSKINDLQF--SRDRTYFITGSKDTTAKLV  217 (327)
T ss_pred             CCCcEEEEEcccCceeeec--h----hhhccccccccc--cCCcceEEecccCccceee
Confidence            4799999999999777632  1    257889999984  4788999999988888776


No 136
>cd02656 MIT MIT: domain contained within Microtubule Interacting and Trafficking molecules. The MIT domain is found in sorting nexins, the nuclear thiol protease PalBH, the AAA protein spastin and archaebacterial proteins with similar domain architecture, vacuolar sorting proteins and others. The molecular function of the MIT domain is unclear.
Probab=55.71  E-value=81  Score=23.23  Aligned_cols=57  Identities=11%  Similarity=0.158  Sum_probs=36.9

Q ss_pred             HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccC-CCcHhH-HHHHHHHHHHHHH
Q psy5736         274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVV-PPYKLA-SLAHEALRNCWSL  330 (346)
Q Consensus       274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~-p~h~~~-~~~~~~L~~~y~~  330 (346)
                      ..+..+...|++.=+.|++++|+..|...++.+...+. .+.+.. ...+..+...+.+
T Consensus         4 ~~a~~l~~~Av~~D~~g~~~~Al~~Y~~a~e~l~~~~~~~~~~~~k~~l~~k~~~yl~R   62 (75)
T cd02656           4 QQAKELIKQAVKEDEDGNYEEALELYKEALDYLLQALKAEKEPKLRKLLRKKVKEYLDR   62 (75)
T ss_pred             HHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHhccCCCHHHHHHHHHHHHHHHHH
Confidence            34455666676666679999999999999999988663 233332 3334444444443


No 137
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=55.46  E-value=54  Score=27.82  Aligned_cols=53  Identities=9%  Similarity=-0.002  Sum_probs=30.1

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ...+..+......+++++|+..+.+.+...     |.+.....+...|..+|...|+.
T Consensus        36 ~~~~~~g~~~~~~g~~~~A~~~~~~al~l~-----~~~~~~~~~~~~lg~~~~~~g~~   88 (168)
T CHL00033         36 FTYYRDGMSAQSEGEYAEALQNYYEAMRLE-----IDPYDRSYILYNIGLIHTSNGEH   88 (168)
T ss_pred             HHHHHHHHHHHHcCCHHHHHHHHHHHHhcc-----ccchhhHHHHHHHHHHHHHcCCH
Confidence            334445555566677777777777776652     33333344555666666665543


No 138
>cd02683 MIT_1 MIT: domain contained within Microtubule Interacting and Trafficking molecules. This sub-family of MIT domains is found in proteins with unknown function, co-occuring with an as yet undescribed domain. The molecular function of the MIT domain is unclear.
Probab=55.24  E-value=73  Score=24.07  Aligned_cols=56  Identities=13%  Similarity=0.128  Sum_probs=37.5

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccC-CCcH-hHHHHHHHHHHHHHH
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVV-PPYK-LASLAHEALRNCWSL  330 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~-p~h~-~~~~~~~~L~~~y~~  330 (346)
                      .+..++.+|++.=..|++++|+..|.+.|+.|..++. .+++ .-...+..+...+.+
T Consensus         5 ~a~~l~~~Ave~D~~g~y~eAl~~Y~~aie~l~~~lk~e~d~~~k~~~r~ki~eY~~R   62 (77)
T cd02683           5 AAKEVLKRAVELDQEGRFQEALVCYQEGIDLLMQVLKGTKDEAKKKNLRQKISEYMDR   62 (77)
T ss_pred             HHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHhhCCCHHHHHHHHHHHHHHHHH
Confidence            3455667777777789999999999999999988653 2333 333334444444443


No 139
>KOG0290|consensus
Probab=55.21  E-value=16  Score=35.10  Aligned_cols=55  Identities=16%  Similarity=0.239  Sum_probs=38.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .++-|-+|||.|+|.- -+...|=-+   |.+|+-++.|.. ..-.+..+-+.||.+|+|
T Consensus       170 SiDTTCTiWdie~~~~-~~vkTQLIA---HDKEV~DIaf~~-~s~~~FASvgaDGSvRmF  224 (364)
T KOG0290|consen  170 SIDTTCTIWDIETGVS-GTVKTQLIA---HDKEVYDIAFLK-GSRDVFASVGADGSVRMF  224 (364)
T ss_pred             cccCeEEEEEEeeccc-cceeeEEEe---cCcceeEEEecc-CccceEEEecCCCcEEEE
Confidence            3577889999999721 011122222   778999999887 445555667789999999


No 140
>KOG0286|consensus
Probab=54.84  E-value=26  Score=33.56  Aligned_cols=50  Identities=20%  Similarity=0.350  Sum_probs=42.2

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +.+..+||.|+|+++..|.       .|...|-+|.+--+ +.....+|+-|+.-++|
T Consensus       165 D~TCalWDie~g~~~~~f~-------GH~gDV~slsl~p~-~~ntFvSg~cD~~aklW  214 (343)
T KOG0286|consen  165 DMTCALWDIETGQQTQVFH-------GHTGDVMSLSLSPS-DGNTFVSGGCDKSAKLW  214 (343)
T ss_pred             CceEEEEEcccceEEEEec-------CCcccEEEEecCCC-CCCeEEecccccceeee
Confidence            5688999999999999985       26788999886554 67777899999999999


No 141
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=54.77  E-value=36  Score=31.97  Aligned_cols=50  Identities=14%  Similarity=-0.027  Sum_probs=40.2

Q ss_pred             HHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccc
Q psy5736         282 RGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWV  336 (346)
Q Consensus       282 ~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~  336 (346)
                      .|.-.+..|++++|+..+.++++.|     |.|+....+...+..+|..+|+.-.
T Consensus       186 LG~~y~~~g~~~~A~~~f~~vv~~y-----P~s~~~~dAl~klg~~~~~~g~~~~  235 (263)
T PRK10803        186 LGQLNYNKGKKDDAAYYFASVVKNY-----PKSPKAADAMFKVGVIMQDKGDTAK  235 (263)
T ss_pred             HHHHHHHcCCHHHHHHHHHHHHHHC-----CCCcchhHHHHHHHHHHHHcCCHHH
Confidence            3444566788999999999988887     7888888899889999988887643


No 142
>KOG1446|consensus
Probab=54.75  E-value=27  Score=33.57  Aligned_cols=48  Identities=13%  Similarity=0.189  Sum_probs=34.5

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +|+|+|||.++|.++..+...+      ...++.++ +|+ --.  |-.+.+..+..|
T Consensus       253 dg~i~vw~~~tg~~v~~~~~~~------~~~~~~~~-fnP-~~~--mf~sa~s~l~fw  300 (311)
T KOG1446|consen  253 DGTIHVWNLETGKKVAVLRGPN------GGPVSCVR-FNP-RYA--MFVSASSNLVFW  300 (311)
T ss_pred             CCcEEEEEcCCCcEeeEecCCC------CCCccccc-cCC-cee--eeeecCceEEEE
Confidence            5999999999999999997654      45677777 453 122  335566677776


No 143
>PRK00420 hypothetical protein; Validated
Probab=54.60  E-value=60  Score=26.60  Aligned_cols=10  Identities=10%  Similarity=0.132  Sum_probs=5.7

Q ss_pred             ccccccCCcC
Q psy5736         255 KKCESCNSTS  264 (346)
Q Consensus       255 ~~C~~C~~~~  264 (346)
                      ..|+.||...
T Consensus        41 ~~Cp~Cg~~~   50 (112)
T PRK00420         41 VVCPVHGKVY   50 (112)
T ss_pred             eECCCCCCee
Confidence            5566666643


No 144
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=53.41  E-value=13  Score=32.85  Aligned_cols=10  Identities=20%  Similarity=0.667  Sum_probs=5.7

Q ss_pred             ccccccCCcC
Q psy5736         255 KKCESCNSTS  264 (346)
Q Consensus       255 ~~C~~C~~~~  264 (346)
                      ..|+.||..+
T Consensus       137 F~Cp~Cg~~L  146 (178)
T PRK06266        137 FRCPQCGEML  146 (178)
T ss_pred             CcCCCCCCCC
Confidence            5555665554


No 145
>KOG0303|consensus
Probab=53.12  E-value=26  Score=34.92  Aligned_cols=52  Identities=29%  Similarity=0.382  Sum_probs=41.2

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .|-+.+|.|||.++|.-+....        |---|.++.| | .+.+++.+..-|..||+|.
T Consensus       150 ag~Dn~v~iWnv~tgeali~l~--------hpd~i~S~sf-n-~dGs~l~TtckDKkvRv~d  201 (472)
T KOG0303|consen  150 AGSDNTVSIWNVGTGEALITLD--------HPDMVYSMSF-N-RDGSLLCTTCKDKKVRVID  201 (472)
T ss_pred             ccCCceEEEEeccCCceeeecC--------CCCeEEEEEe-c-cCCceeeeecccceeEEEc
Confidence            4678899999999998887643        3446777763 4 6788888899999999993


No 146
>KOG4714|consensus
Probab=53.08  E-value=9.9  Score=35.88  Aligned_cols=54  Identities=20%  Similarity=0.190  Sum_probs=41.8

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +-+|.|++|||...+-..+.-      +-|+.+|..+.|.-..++.+ ++.+.||.+=-|.
T Consensus       199 t~dg~~~l~d~rn~~~p~S~l------~ahk~~i~eV~FHpk~p~~L-ft~sedGslw~wd  252 (319)
T KOG4714|consen  199 TDDGIVGLWDARNVAMPVSLL------KAHKAEIWEVHFHPKNPEHL-FTCSEDGSLWHWD  252 (319)
T ss_pred             cCCCeEEEEEcccccchHHHH------HHhhhhhhheeccCCCchhe-eEecCCCcEEEEc
Confidence            347899999998874433322      23889999999999888888 6799999987663


No 147
>PF13719 zinc_ribbon_5:  zinc-ribbon domain
Probab=52.72  E-value=7  Score=25.27  Aligned_cols=33  Identities=24%  Similarity=0.509  Sum_probs=18.6

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS  264 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~  264 (346)
                      +.||  +|+..+..+++.-+ .......|++|+...
T Consensus         3 i~CP--~C~~~f~v~~~~l~-~~~~~vrC~~C~~~f   35 (37)
T PF13719_consen    3 ITCP--NCQTRFRVPDDKLP-AGGRKVRCPKCGHVF   35 (37)
T ss_pred             EECC--CCCceEEcCHHHcc-cCCcEEECCCCCcEe
Confidence            4688  88765544432101 123358888888754


No 148
>PF13240 zinc_ribbon_2:  zinc-ribbon domain
Probab=52.38  E-value=8.9  Score=22.17  Aligned_cols=9  Identities=33%  Similarity=0.936  Sum_probs=6.7

Q ss_pred             ccccccCCc
Q psy5736         255 KKCESCNST  263 (346)
Q Consensus       255 ~~C~~C~~~  263 (346)
                      -.|+.||..
T Consensus        14 ~fC~~CG~~   22 (23)
T PF13240_consen   14 KFCPNCGTP   22 (23)
T ss_pred             cchhhhCCc
Confidence            568888875


No 149
>PF13248 zf-ribbon_3:  zinc-ribbon domain
Probab=52.26  E-value=8.7  Score=22.74  Aligned_cols=23  Identities=35%  Similarity=0.811  Sum_probs=14.6

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      ..||  +|+..+.+.          .-.|+.||..
T Consensus         3 ~~Cp--~Cg~~~~~~----------~~fC~~CG~~   25 (26)
T PF13248_consen    3 MFCP--NCGAEIDPD----------AKFCPNCGAK   25 (26)
T ss_pred             CCCc--ccCCcCCcc----------cccChhhCCC
Confidence            4688  887643211          2678889875


No 150
>PF14783 BBS2_Mid:  Ciliary BBSome complex subunit 2, middle region
Probab=51.98  E-value=32  Score=28.12  Aligned_cols=47  Identities=19%  Similarity=0.400  Sum_probs=33.0

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceecccc--CCCceEEeeecCCceEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNE--HDDSLLLTGYDDGSLSV   61 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine--~~~~lll~g~~dG~vri   61 (346)
                      .|+|.|||=  ..|+=+..        .|.++++|..++.  +...-+.+|+++|.|.+
T Consensus        62 NGTVGvY~~--~~RlWRiK--------SK~~~~~~~~~D~~gdG~~eLI~GwsnGkve~  110 (111)
T PF14783_consen   62 NGTVGVYDR--SQRLWRIK--------SKNQVTSMAFYDINGDGVPELIVGWSNGKVEV  110 (111)
T ss_pred             CCEEEEEeC--cceeeeec--------cCCCeEEEEEEcCCCCCceEEEEEecCCeEEe
Confidence            488999854  44554332        2456899998887  44566789999999864


No 151
>PF08666 SAF:  SAF domain;  InterPro: IPR013974  This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=51.60  E-value=9.5  Score=26.94  Aligned_cols=18  Identities=22%  Similarity=0.307  Sum_probs=13.0

Q ss_pred             EEEEeeccCCCCceeecc
Q psy5736         166 IVKALRPLKPKEVVAENY  183 (346)
Q Consensus       166 ~v~a~r~I~~Geel~~~Y  183 (346)
                      ++.|.|+|++|+.|+-.-
T Consensus         3 vvVA~~di~~G~~i~~~d   20 (63)
T PF08666_consen    3 VVVAARDIPAGTVITAED   20 (63)
T ss_dssp             EEEESSTB-TT-BECTTT
T ss_pred             EEEEeCccCCCCEEccCC
Confidence            578999999999996544


No 152
>COG2888 Predicted Zn-ribbon RNA-binding protein with a function in translation [Translation, ribosomal structure and biogenesis]
Probab=51.56  E-value=13  Score=26.82  Aligned_cols=49  Identities=16%  Similarity=0.392  Sum_probs=25.7

Q ss_pred             ecCCCCCCCccccccccCccccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         208 KCRACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       208 ~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      .|..|-..-+    .+.....|.||  +|+..+...-..-- .......|++||..
T Consensus        11 ~CtSCg~~i~----p~e~~v~F~CP--nCGe~~I~Rc~~CR-k~g~~Y~Cp~CGF~   59 (61)
T COG2888          11 VCTSCGREIA----PGETAVKFPCP--NCGEVEIYRCAKCR-KLGNPYRCPKCGFE   59 (61)
T ss_pred             eeccCCCEec----cCCceeEeeCC--CCCceeeehhhhHH-HcCCceECCCcCcc
Confidence            4666765431    12335788999  89854432210000 00124788888863


No 153
>PF04810 zf-Sec23_Sec24:  Sec23/Sec24 zinc finger;  InterPro: IPR006895 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  COPII (coat protein complex II)-coated vesicles carry proteins from the endoplasmic reticulum (ER) to the Golgi complex []. COPII-coated vesicles form on the ER by the stepwise recruitment of three cytosolic components: Sar1-GTP to initiate coat formation, Sec23/24 heterodimer to select SNARE and cargo molecules, and Sec13/31 to induce coat polymerisation and membrane deformation [].  Sec23 p and Sec24p are structurally related, folding into five distinct domains: a beta-barrel, a zinc-finger, an alpha/beta trunk domain (IPR006896 from INTERPRO), an all-helical region (IPR006900 from INTERPRO), and a C-terminal gelsolin-like domain (IPR007123 from INTERPRO). This entry describes an approximately 55-residue Sec23/24 zinc-binding domain, which lies against the beta-barrel at the periphery of the complex. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006886 intracellular protein transport, 0006888 ER to Golgi vesicle-mediated transport, 0030127 COPII vesicle coat; PDB: 3EFO_B 3EG9_B 3EGD_A 2YRC_A 2NUP_A 2YRD_A 3EGX_A 2NUT_A 3EH1_A 1PD0_A ....
Probab=51.27  E-value=6.5  Score=25.87  Aligned_cols=32  Identities=16%  Similarity=0.509  Sum_probs=16.1

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS  264 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~  264 (346)
                      ++|.  .|++.+.|--..+.  ....|.|.-|+..-
T Consensus         3 ~rC~--~C~aylNp~~~~~~--~~~~w~C~~C~~~N   34 (40)
T PF04810_consen    3 VRCR--RCRAYLNPFCQFDD--GGKTWICNFCGTKN   34 (40)
T ss_dssp             -B-T--TT--BS-TTSEEET--TTTEEEETTT--EE
T ss_pred             cccC--CCCCEECCcceEcC--CCCEEECcCCCCcC
Confidence            5788  89988876543211  22469999998853


No 154
>KOG1587|consensus
Probab=51.22  E-value=25  Score=36.84  Aligned_cols=55  Identities=20%  Similarity=0.241  Sum_probs=40.0

Q ss_pred             cceEEEEecCCCcee--eeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRL--ISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~--~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .|+|-+||-..+.+.  ..-+..-.   -|...|+.+.+++.....=..++++||.|..|
T Consensus       264 ~GqV~lWD~~~~~~~~~s~ls~~~~---sh~~~v~~vvW~~~~~~~~f~s~ssDG~i~~W  320 (555)
T KOG1587|consen  264 NGQVVLWDLRKGSDTPPSGLSALEV---SHSEPVTAVVWLQNEHNTEFFSLSSDGSICSW  320 (555)
T ss_pred             CceEEEEEccCCCCCCCcccccccc---cCCcCeEEEEEeccCCCCceEEEecCCcEeee
Confidence            699999999988773  22222222   36678999998887433445778999999999


No 155
>KOG0299|consensus
Probab=51.00  E-value=28  Score=35.18  Aligned_cols=60  Identities=17%  Similarity=0.207  Sum_probs=40.9

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCC--C---CcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARN--S---ARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~--~---~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      -.|+|.+|+.-+.+.+..+.+..+--+.+  .   .=||+|-.|  ....|+.+|+-+|.||+|+--
T Consensus       346 dnG~IaLWs~~KKkplf~~~~AHgv~~~~~~~~~~~Witsla~i--~~sdL~asGS~~G~vrLW~i~  410 (479)
T KOG0299|consen  346 DNGSIALWSLLKKKPLFTSRLAHGVIPELDPVNGNFWITSLAVI--PGSDLLASGSWSGCVRLWKIE  410 (479)
T ss_pred             CCceEEEeeecccCceeEeeccccccCCccccccccceeeeEec--ccCceEEecCCCCceEEEEec
Confidence            46899999999999988776544321111  0   024555544  456788899999999999644


No 156
>smart00661 RPOL9 RNA polymerase subunit 9.
Probab=50.99  E-value=12  Score=25.65  Aligned_cols=30  Identities=20%  Similarity=0.538  Sum_probs=19.5

Q ss_pred             ccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736         230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD  265 (346)
Q Consensus       230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~  265 (346)
                      .||  .|++.+.+....    ....+.|+.||....
T Consensus         2 FCp--~Cg~~l~~~~~~----~~~~~vC~~Cg~~~~   31 (52)
T smart00661        2 FCP--KCGNMLIPKEGK----EKRRFVCRKCGYEEP   31 (52)
T ss_pred             CCC--CCCCccccccCC----CCCEEECCcCCCeEE
Confidence            488  898877665321    112589999998543


No 157
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=50.92  E-value=8.8  Score=33.28  Aligned_cols=10  Identities=20%  Similarity=0.597  Sum_probs=5.8

Q ss_pred             ccccccCCcC
Q psy5736         255 KKCESCNSTS  264 (346)
Q Consensus       255 ~~C~~C~~~~  264 (346)
                      ..|+.||..+
T Consensus       129 F~Cp~Cg~~L  138 (158)
T TIGR00373       129 FTCPRCGAML  138 (158)
T ss_pred             CcCCCCCCEe
Confidence            5566666654


No 158
>KOG2445|consensus
Probab=50.67  E-value=29  Score=33.55  Aligned_cols=34  Identities=21%  Similarity=0.260  Sum_probs=28.4

Q ss_pred             CCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736          33 SARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus        33 ~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      .+.+++++|--.+-+-++.+++.||.+|||...+
T Consensus       112 rssV~DV~FaP~hlGLklA~~~aDG~lRIYEA~d  145 (361)
T KOG2445|consen  112 RSSVTDVKFAPKHLGLKLAAASADGILRIYEAPD  145 (361)
T ss_pred             CcceeEEEecchhcceEEEEeccCcEEEEEecCC
Confidence            4578999987778888888999999999997654


No 159
>KOG2096|consensus
Probab=50.64  E-value=14  Score=35.89  Aligned_cols=42  Identities=24%  Similarity=0.396  Sum_probs=32.6

Q ss_pred             CCCCCcccceeccccCCCceEEeeecCCceEEE--eCCccccchhh
Q psy5736          30 ARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY--RNYASHEHRLV   73 (346)
Q Consensus        30 ~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~--r~~~sh~~~Lv   73 (346)
                      +-|+.+||+|.|-  .|+..+.+.+.|+.||+|  ++|+..+++.+
T Consensus        83 KgH~~~vt~~~Fs--SdGK~lat~~~Dr~Ir~w~~~DF~~~eHr~~  126 (420)
T KOG2096|consen   83 KGHKKEVTDVAFS--SDGKKLATISGDRSIRLWDVRDFENKEHRCI  126 (420)
T ss_pred             hccCCceeeeEEc--CCCceeEEEeCCceEEEEecchhhhhhhhHh
Confidence            4588899999864  466777999999999999  78876555443


No 160
>KOG0299|consensus
Probab=50.28  E-value=26  Score=35.42  Aligned_cols=52  Identities=19%  Similarity=0.370  Sum_probs=41.3

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      |-+.++.+|+..+..+|.+..+        ...+-+..+||+.  - .++|+++|.|-+|...
T Consensus       305 grDrT~rlwKi~eesqlifrg~--------~~sidcv~~In~~--H-fvsGSdnG~IaLWs~~  356 (479)
T KOG0299|consen  305 GRDRTVRLWKIPEESQLIFRGG--------EGSIDCVAFINDE--H-FVSGSDNGSIALWSLL  356 (479)
T ss_pred             cccceeEEEeccccceeeeeCC--------CCCeeeEEEeccc--c-eeeccCCceEEEeeec
Confidence            5789999999999999987543        3457788888853  2 4779999999999665


No 161
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=49.83  E-value=7.7  Score=24.87  Aligned_cols=33  Identities=24%  Similarity=0.490  Sum_probs=17.4

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS  264 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~  264 (346)
                      +.||  +|+..+....+.-+ .......|++|+..+
T Consensus         3 ~~CP--~C~~~~~v~~~~~~-~~~~~v~C~~C~~~~   35 (38)
T TIGR02098         3 IQCP--NCKTSFRVVDSQLG-ANGGKVRCGKCGHVW   35 (38)
T ss_pred             EECC--CCCCEEEeCHHHcC-CCCCEEECCCCCCEE
Confidence            5677  78765433321100 112247888888764


No 162
>smart00745 MIT Microtubule Interacting and Trafficking molecule domain.
Probab=49.68  E-value=1e+02  Score=22.67  Aligned_cols=38  Identities=21%  Similarity=0.294  Sum_probs=29.8

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhccc
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELV  310 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l  310 (346)
                      +.++..+..+|++.=+.|++++|+..|.+.++.+...+
T Consensus         5 ~~~A~~li~~Av~~d~~g~~~eAl~~Y~~a~e~l~~~~   42 (77)
T smart00745        5 LSKAKELISKALKADEAGDYEEALELYKKAIEYLLEGI   42 (77)
T ss_pred             HHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHh
Confidence            34555666677766668999999999999999888855


No 163
>PRK02603 photosystem I assembly protein Ycf3; Provisional
Probab=49.61  E-value=73  Score=27.23  Aligned_cols=50  Identities=12%  Similarity=0.104  Sum_probs=24.5

Q ss_pred             HHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         279 MFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      .+..+......|++++|+..+++.++...     .+.....+...++.+|..+|+
T Consensus        38 ~~~lg~~~~~~g~~~~A~~~~~~al~~~~-----~~~~~~~~~~~la~~~~~~g~   87 (172)
T PRK02603         38 YYRDGMSAQADGEYAEALENYEEALKLEE-----DPNDRSYILYNMGIIYASNGE   87 (172)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHHhh-----ccchHHHHHHHHHHHHHHcCC
Confidence            34444444555666666666666655421     111123344555555555554


No 164
>KOG0318|consensus
Probab=49.54  E-value=23  Score=36.47  Aligned_cols=17  Identities=24%  Similarity=0.436  Sum_probs=12.7

Q ss_pred             ccceEEEEecCCCceee
Q psy5736           4 LFLKEGIWDHITGSRLI   20 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~   20 (346)
                      ..|+|.||||....++-
T Consensus        79 ~sG~vRIWdtt~~~hiL   95 (603)
T KOG0318|consen   79 VSGKVRIWDTTQKEHIL   95 (603)
T ss_pred             CcCcEEEEeccCcceee
Confidence            46899999998755544


No 165
>KOG0318|consensus
Probab=49.42  E-value=37  Score=35.03  Aligned_cols=56  Identities=21%  Similarity=0.219  Sum_probs=48.5

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +|-+|+|.|||=.+|.++..+..+.+    |+.-|-+|-+.  +|..-+++.+.|..+|+|.
T Consensus       208 ~gsDgki~iyDGktge~vg~l~~~~a----HkGsIfalsWs--PDs~~~~T~SaDkt~KIWd  263 (603)
T KOG0318|consen  208 AGSDGKIYIYDGKTGEKVGELEDSDA----HKGSIFALSWS--PDSTQFLTVSADKTIKIWD  263 (603)
T ss_pred             ecCCccEEEEcCCCccEEEEecCCCC----ccccEEEEEEC--CCCceEEEecCCceEEEEE
Confidence            58899999999999999999987654    89999998863  5667778899999999994


No 166
>COG2956 Predicted N-acetylglucosaminyl transferase [Carbohydrate transport and metabolism]
Probab=49.10  E-value=36  Score=33.32  Aligned_cols=45  Identities=16%  Similarity=0.180  Sum_probs=32.8

Q ss_pred             ccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         290 SCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       290 ~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      |+..-+...+++.++.+..++-.++-.+.++-+.|..||..+|+.
T Consensus       221 G~v~~~~g~y~~AV~~~e~v~eQn~~yl~evl~~L~~~Y~~lg~~  265 (389)
T COG2956         221 GRVELAKGDYQKAVEALERVLEQNPEYLSEVLEMLYECYAQLGKP  265 (389)
T ss_pred             hHHHHhccchHHHHHHHHHHHHhChHHHHHHHHHHHHHHHHhCCH
Confidence            444444444556666666666678888999999999999999874


No 167
>KOG0286|consensus
Probab=48.48  E-value=32  Score=33.01  Aligned_cols=53  Identities=19%  Similarity=0.394  Sum_probs=41.3

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      |-+..-.+||..++.-...|.       -|...|.+++|+- ..++. .+|++|+..|+|..
T Consensus       206 ~cD~~aklWD~R~~~c~qtF~-------ghesDINsv~ffP-~G~af-atGSDD~tcRlyDl  258 (343)
T KOG0286|consen  206 GCDKSAKLWDVRSGQCVQTFE-------GHESDINSVRFFP-SGDAF-ATGSDDATCRLYDL  258 (343)
T ss_pred             ccccceeeeeccCcceeEeec-------ccccccceEEEcc-CCCee-eecCCCceeEEEee
Confidence            457788999999997777773       3667899999874 33444 78999999999943


No 168
>KOG2111|consensus
Probab=48.44  E-value=45  Score=32.34  Aligned_cols=54  Identities=20%  Similarity=0.288  Sum_probs=41.6

Q ss_pred             EEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccc
Q psy5736           8 EGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASH   68 (346)
Q Consensus         8 ~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh   68 (346)
                      |+|||-++|.++.-|.-|-.     ...|-+|.|  .++.+++.+.|+-|.+.+|......
T Consensus       206 IRIFdt~~g~~l~E~RRG~d-----~A~iy~iaF--Sp~~s~LavsSdKgTlHiF~l~~~~  259 (346)
T KOG2111|consen  206 IRIFDTEDGTLLQELRRGVD-----RADIYCIAF--SPNSSWLAVSSDKGTLHIFSLRDTE  259 (346)
T ss_pred             EEEEEcCCCcEeeeeecCCc-----hheEEEEEe--CCCccEEEEEcCCCeEEEEEeecCC
Confidence            79999999999999876653     345555442  2788888989999999999766543


No 169
>PF10571 UPF0547:  Uncharacterised protein family UPF0547;  InterPro: IPR018886  This domain may well be a type of zinc-finger as it carries two pairs of highly conserved cysteine residues though with no accompanying histidines. Several members are annotated as putative helicases. 
Probab=48.17  E-value=13  Score=22.29  Aligned_cols=23  Identities=26%  Similarity=0.696  Sum_probs=14.5

Q ss_pred             ccCCcCcCCcccccccCCcccccCcccccccCCcC
Q psy5736         230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTS  264 (346)
Q Consensus       230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~  264 (346)
                      .||  .|+..+...          .-.|+.||...
T Consensus         2 ~CP--~C~~~V~~~----------~~~Cp~CG~~F   24 (26)
T PF10571_consen    2 TCP--ECGAEVPES----------AKFCPHCGYDF   24 (26)
T ss_pred             cCC--CCcCCchhh----------cCcCCCCCCCC
Confidence            477  787665322          26788888764


No 170
>PF09577 Spore_YpjB:  Sporulation protein YpjB (SpoYpjB);  InterPro: IPR014231 Proteins in thie entry, typified by YpjB, are restricted to a subset of the endospore-forming bacteria which includes Bacillus species, but not species. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon []. Sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect, but this gene is not, however, a part of the endospore formation minimal gene set.
Probab=47.73  E-value=43  Score=31.02  Aligned_cols=57  Identities=21%  Similarity=0.217  Sum_probs=38.2

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHh
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLA  331 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~  331 (346)
                      ++..+-+++.++.++|++++|.+.++.+-+.|...-...-.....-..+|...|-..
T Consensus         4 eLd~~sd~~lqlvk~~~yeeA~q~l~~fs~~f~~~~~~~~~~t~e~iralT~t~~~a   60 (232)
T PF09577_consen    4 ELDQLSDEALQLVKQGKYEEAKQLLEYFSEQFTSVDFKGRPLTMEEIRALTETIEEA   60 (232)
T ss_pred             HHHHHHHHHHHHHHcccHHHHHHHHHHHHHHHhhccccccccCHHHHHHHHHHHHHH
Confidence            445555677888999999999999999998887754443323333444455544443


No 171
>PF13181 TPR_8:  Tetratricopeptide repeat; PDB: 3GW4_B 3MA5_C 2KCV_A 2KCL_A 3FP3_A 3LCA_A 3FP4_A 3FP2_A 1W3B_B 1ELW_A ....
Probab=47.36  E-value=51  Score=19.63  Aligned_cols=26  Identities=19%  Similarity=0.068  Sum_probs=19.7

Q ss_pred             HHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736         281 YRGIEQMNTSCFREAVESLTKFSDQI  306 (346)
Q Consensus       281 ~~a~~~~~~~~~~~ai~~l~~~l~~~  306 (346)
                      ..+.-....|++++|++.+++.++.-
T Consensus         6 ~lg~~y~~~~~~~~A~~~~~~a~~~~   31 (34)
T PF13181_consen    6 NLGKIYEQLGDYEEALEYFEKALELN   31 (34)
T ss_dssp             HHHHHHHHTTSHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHhhC
Confidence            33444566789999999999998764


No 172
>TIGR03302 OM_YfiO outer membrane assembly lipoprotein YfiO. Members of this protein family include YfiO, a near-essential protein of the outer membrane, part of a complex involved in protein insertion into the bacterial outer membrane. Many proteins in this family are annotated as ComL, based on the involvement of this protein in natural transformation with exogenous DNA in Neisseria gonorrhoeae. This protein family shows sequence similarity to, but is distinct from, the tol-pal system protein YbgF (TIGR02795).
Probab=47.21  E-value=50  Score=29.44  Aligned_cols=48  Identities=13%  Similarity=0.104  Sum_probs=40.1

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCcc
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKW  335 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~  335 (346)
                      |.-.+..|++.+|+..++++++.+     |.++....+...+...|..+|+.-
T Consensus       173 a~~~~~~g~~~~A~~~~~~al~~~-----p~~~~~~~a~~~l~~~~~~lg~~~  220 (235)
T TIGR03302       173 ARFYLKRGAYVAAINRFETVVENY-----PDTPATEEALARLVEAYLKLGLKD  220 (235)
T ss_pred             HHHHHHcCChHHHHHHHHHHHHHC-----CCCcchHHHHHHHHHHHHHcCCHH
Confidence            344577899999999999999887     556778889999999999999863


No 173
>PF14561 TPR_20:  Tetratricopeptide repeat; PDB: 3QOU_A 2R5S_A 3QDN_B.
Probab=47.13  E-value=58  Score=25.27  Aligned_cols=51  Identities=14%  Similarity=0.220  Sum_probs=29.3

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      +..+.-|...+..|++++|++.+.++++.-..+      .=..++..|..+|..+|.
T Consensus        23 ~ar~~lA~~~~~~g~~e~Al~~Ll~~v~~dr~~------~~~~ar~~ll~~f~~lg~   73 (90)
T PF14561_consen   23 DARYALADALLAAGDYEEALDQLLELVRRDRDY------EDDAARKRLLDIFELLGP   73 (90)
T ss_dssp             HHHHHHHHHHHHTT-HHHHHHHHHHHHCC-TTC------CCCHHHHHHHHHHHHH-T
T ss_pred             HHHHHHHHHHHHCCCHHHHHHHHHHHHHhCccc------cccHHHHHHHHHHHHcCC
Confidence            445566777788899999988887777543221      123345555555555544


No 174
>KOG0315|consensus
Probab=46.41  E-value=30  Score=32.57  Aligned_cols=50  Identities=16%  Similarity=0.351  Sum_probs=36.2

Q ss_pred             cceEEEEecCCCce--eeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           5 FLKEGIWDHITGSR--LISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         5 ~~~~~~w~~~~~~~--~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      --+|.++|..++..  +-.|       .-|.+.++++-|-  .|.....+|+.||.+|||.
T Consensus        60 ~qhvRlyD~~S~np~Pv~t~-------e~h~kNVtaVgF~--~dgrWMyTgseDgt~kIWd  111 (311)
T KOG0315|consen   60 NQHVRLYDLNSNNPNPVATF-------EGHTKNVTAVGFQ--CDGRWMYTGSEDGTVKIWD  111 (311)
T ss_pred             CCeeEEEEccCCCCCceeEE-------eccCCceEEEEEe--ecCeEEEecCCCceEEEEe
Confidence            34788999887743  2233       3466678887753  5677778899999999993


No 175
>PF09976 TPR_21:  Tetratricopeptide repeat;  InterPro: IPR018704  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=45.95  E-value=1.5e+02  Score=24.43  Aligned_cols=49  Identities=20%  Similarity=0.142  Sum_probs=32.9

Q ss_pred             HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ..|...++.|++++|+..|.+++...     |.......++..|++++...|+.
T Consensus        53 ~lA~~~~~~g~~~~A~~~l~~~~~~~-----~d~~l~~~a~l~LA~~~~~~~~~  101 (145)
T PF09976_consen   53 QLAKAAYEQGDYDEAKAALEKALANA-----PDPELKPLARLRLARILLQQGQY  101 (145)
T ss_pred             HHHHHHHHCCCHHHHHHHHHHHHhhC-----CCHHHHHHHHHHHHHHHHHcCCH
Confidence            34445567788888877777777633     44455666777788888777654


No 176
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=45.82  E-value=32  Score=27.61  Aligned_cols=23  Identities=13%  Similarity=0.174  Sum_probs=21.4

Q ss_pred             CEEEEEEeeccCCCCceeecccc
Q psy5736         163 KNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       163 ~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      +++.+.-.++|+.|++++++|.+
T Consensus        75 ktVTLTL~~~V~~Gq~VTVsYt~   97 (101)
T TIGR02059        75 TTITLTLAQVVEDGDEVTLSYTK   97 (101)
T ss_pred             cEEEEEecccccCCCEEEEEeeC
Confidence            38999999999999999999986


No 177
>KOG0772|consensus
Probab=44.18  E-value=29  Score=35.75  Aligned_cols=56  Identities=25%  Similarity=0.351  Sum_probs=36.1

Q ss_pred             CcccceEEEEecCCC-ceeeeeCCCCCCCCCCCCc--ccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITG-SRLISWRPDRGSWARNSAR--CTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~--~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      ++.+|++.|||.++. .++..|...-.    .+.|  .|+.. .| .|..+..+|-.||.|.+|.
T Consensus       287 ~s~DgtlRiWdv~~~k~q~qVik~k~~----~g~Rv~~tsC~-~n-rdg~~iAagc~DGSIQ~W~  345 (641)
T KOG0772|consen  287 CSYDGTLRIWDVNNTKSQLQVIKTKPA----GGKRVPVTSCA-WN-RDGKLIAAGCLDGSIQIWD  345 (641)
T ss_pred             ecCCCcEEEEecCCchhheeEEeeccC----CCcccCceeee-cC-CCcchhhhcccCCceeeee
Confidence            478999999999765 44545543221    1233  24433 34 4555568888999999995


No 178
>PF07754 DUF1610:  Domain of unknown function (DUF1610);  InterPro: IPR011668 This domain is found in archaeal species. It is likely to bind zinc via its four well-conserved cysteine residues.
Probab=43.90  E-value=18  Score=21.27  Aligned_cols=9  Identities=22%  Similarity=0.615  Sum_probs=5.3

Q ss_pred             cccccccCC
Q psy5736         254 EKKCESCNS  262 (346)
Q Consensus       254 ~~~C~~C~~  262 (346)
                      ...|+.||+
T Consensus        16 ~f~CPnCG~   24 (24)
T PF07754_consen   16 PFPCPNCGF   24 (24)
T ss_pred             eEeCCCCCC
Confidence            366666663


No 179
>KOG0319|consensus
Probab=43.38  E-value=36  Score=36.38  Aligned_cols=50  Identities=16%  Similarity=0.228  Sum_probs=35.9

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -+.||.||.-++..-+-.|.-       |..-+--..|++  .+.-++.+.+||.+++|
T Consensus       525 gD~TvKIW~is~fSClkT~eG-------H~~aVlra~F~~--~~~qliS~~adGliKlW  574 (775)
T KOG0319|consen  525 GDKTVKIWSISTFSCLKTFEG-------HTSAVLRASFIR--NGKQLISAGADGLIKLW  574 (775)
T ss_pred             CCceEEEEEeccceeeeeecC-------ccceeEeeeeee--CCcEEEeccCCCcEEEE
Confidence            478999999998888877752       222333334444  55566889999999999


No 180
>KOG0293|consensus
Probab=43.14  E-value=23  Score=35.53  Aligned_cols=55  Identities=9%  Similarity=0.208  Sum_probs=39.4

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      .-.|.+||.+..+.+-+|.-++.+   |- -|-+  -++-.++...+.||.|+.|.||..-
T Consensus       416 ~qei~LWDl~e~~lv~kY~Ghkq~---~f-iIrS--CFgg~~~~fiaSGSED~kvyIWhr~  470 (519)
T KOG0293|consen  416 DQEIHLWDLEENKLVRKYFGHKQG---HF-IIRS--CFGGGNDKFIASGSEDSKVYIWHRI  470 (519)
T ss_pred             cCeeEEeecchhhHHHHhhccccc---ce-EEEe--ccCCCCcceEEecCCCceEEEEEcc
Confidence            346899999988777777766542   11 1222  1456888999999999999999443


No 181
>PF02150 RNA_POL_M_15KD:  RNA polymerases M/15 Kd subunit;  InterPro: IPR001529 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise:  RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors.  RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs.   Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. In archaebacteria, there is generally a single form of RNA polymerase which also consist of an oligomeric assemblage of 10 to 13 polypeptides. It has recently been shown [], [] that small subunits of about 15 kDa, found in polymerase types I and II, are highly conserved. These proteins contain a probable zinc finger in their N-terminal region and a C-terminal zinc ribbon domain (see IPR001222 from INTERPRO).; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 3H0G_I 3M4O_I 3S14_I 2E2J_I 4A3J_I 3HOZ_I 1TWA_I 3S1Q_I 3S1N_I 1TWG_I ....
Probab=43.08  E-value=8.4  Score=24.68  Aligned_cols=30  Identities=23%  Similarity=0.672  Sum_probs=19.7

Q ss_pred             ccCCcCcCCcccccccCCcccccCcccccccCCcCCh
Q psy5736         230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDL  266 (346)
Q Consensus       230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~  266 (346)
                      .||  .|++.+.|..+..    . ...|..|+.....
T Consensus         3 FCp--~C~nlL~p~~~~~----~-~~~C~~C~Y~~~~   32 (35)
T PF02150_consen    3 FCP--ECGNLLYPKEDKE----K-RVACRTCGYEEPI   32 (35)
T ss_dssp             BET--TTTSBEEEEEETT----T-TEEESSSS-EEE-
T ss_pred             eCC--CCCccceEcCCCc----c-CcCCCCCCCccCC
Confidence            588  9999998875421    1 1269999987544


No 182
>PF11817 Foie-gras_1:  Foie gras liver health family 1;  InterPro: IPR021773  Mutating the gene foie gras in zebrafish has been shown to affect development; the mutants develop large, lipid-filled hepatocytes in the liver, resembling those in individuals with fatty liver disease []. Foie-gras protein is long and has several well-defined domains though none of them has a known function. We have annotated this one as the first []. THe C terminus of this region contains TPR repeats. 
Probab=43.07  E-value=82  Score=29.06  Aligned_cols=51  Identities=14%  Similarity=0.065  Sum_probs=42.5

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCcc
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKW  335 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~  335 (346)
                      |.+.+..|++++|+++|+.+...|.+-  ..+..+..+...|.+|+..+|+.-
T Consensus       185 A~ey~~~g~~~~A~~~l~~~~~~yr~e--gW~~l~~~~l~~l~~Ca~~~~~~~  235 (247)
T PF11817_consen  185 AEEYFRLGDYDKALKLLEPAASSYRRE--GWWSLLTEVLWRLLECAKRLGDVE  235 (247)
T ss_pred             HHHHHHCCCHHHHHHHHHHHHHHHHhC--CcHHHHHHHHHHHHHHHHHhCCHH
Confidence            445577899999999999998888753  566788899999999999999863


No 183
>KOG0289|consensus
Probab=42.79  E-value=47  Score=33.49  Aligned_cols=56  Identities=18%  Similarity=0.141  Sum_probs=40.9

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      ..+|++..-|.++|..|+.-+-..     ...++|+..|+  +|+-++-+|..||++++|..-
T Consensus       322 s~d~~w~Fsd~~~g~~lt~vs~~~-----s~v~~ts~~fH--pDgLifgtgt~d~~vkiwdlk  377 (506)
T KOG0289|consen  322 SNDGTWAFSDISSGSQLTVVSDET-----SDVEYTSAAFH--PDGLIFGTGTPDGVVKIWDLK  377 (506)
T ss_pred             cCCceEEEEEccCCcEEEEEeecc-----ccceeEEeeEc--CCceEEeccCCCceEEEEEcC
Confidence            357889999999999999765431     23567887764  345555667789999999544


No 184
>cd00350 rubredoxin_like Rubredoxin_like; nonheme iron binding domain containing a [Fe(SCys)4] center. The family includes rubredoxins, a small electron transfer protein, and a slightly smaller modular rubredoxin domain present in rubrerythrin and nigerythrin and detected either N- or C-terminal to such proteins as flavin reductase, NAD(P)H-nitrite reductase, and ferredoxin-thioredoxin reductase. In rubredoxin, the iron atom is coordinated by four cysteine residues (Fe(S-Cys)4), but iron can also be replaced by cobalt, nickel or zinc and believed to be involved in electron transfer.  Rubrerythrins and nigerythrins are small homodimeric proteins, generally consisting of 2 domains: a rubredoxin domain C-terminal to a non-sulfur, oxo-bridged diiron site in the N-terminal rubrerythrin domain.  Rubrerythrins and nigerythrins have putative peroxide activity.
Probab=42.75  E-value=19  Score=22.52  Aligned_cols=25  Identities=24%  Similarity=0.562  Sum_probs=16.9

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      +.|+  .|+-...+..        ..|.|+.||..
T Consensus         2 ~~C~--~CGy~y~~~~--------~~~~CP~Cg~~   26 (33)
T cd00350           2 YVCP--VCGYIYDGEE--------APWVCPVCGAP   26 (33)
T ss_pred             EECC--CCCCEECCCc--------CCCcCcCCCCc
Confidence            5787  8875544332        24999999985


No 185
>KOG0640|consensus
Probab=42.57  E-value=18  Score=35.07  Aligned_cols=55  Identities=7%  Similarity=0.159  Sum_probs=45.5

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      -+|.|.+||=-+++-+..|.+-.     -+.+|++.+|-  .+..++|+-+-|..+++|..-
T Consensus       281 kDG~IklwDGVS~rCv~t~~~AH-----~gsevcSa~Ft--kn~kyiLsSG~DS~vkLWEi~  335 (430)
T KOG0640|consen  281 KDGAIKLWDGVSNRCVRTIGNAH-----GGSEVCSAVFT--KNGKYILSSGKDSTVKLWEIS  335 (430)
T ss_pred             cCCcEEeeccccHHHHHHHHhhc-----CCceeeeEEEc--cCCeEEeecCCcceeeeeeec
Confidence            58999999999999999997664     36899998864  467888888889999999654


No 186
>KOG0645|consensus
Probab=42.46  E-value=29  Score=32.98  Aligned_cols=33  Identities=18%  Similarity=0.293  Sum_probs=26.1

Q ss_pred             CCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736          31 RNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus        31 ~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      -|..+|.+++. |+....++++|++||.+++|+.
T Consensus       278 aHe~dVNsV~w-~p~~~~~L~s~~DDG~v~~W~l  310 (312)
T KOG0645|consen  278 AHEVDVNSVQW-NPKVSNRLASGGDDGIVNFWEL  310 (312)
T ss_pred             ccccccceEEE-cCCCCCceeecCCCceEEEEEe
Confidence            35667888884 5556788899999999999975


No 187
>PF06957 COPI_C:  Coatomer (COPI) alpha subunit C-terminus;  InterPro: IPR010714 Proteins synthesised on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. This traffic is bidirectional, to ensure that proteins required to form vesicles are recycled. Vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transfer []. While clathrin mediates endocytic protein transport, and transport from ER to Golgi, coatomers primarily mediate intra-Golgi transport, as well as the reverse Golgi to ER transport of dilysine-tagged proteins []. For example, the coatomer COP1 (coat protein complex 1) is responsible for reverse transport of recycled proteins from Golgi and pre-Golgi compartments back to the ER, while COPII buds vesicles from the ER to the Golgi []. Coatomers reversibly associate with Golgi (non-clathrin-coated) vesicles to mediate protein transport and for budding from Golgi membranes []. Activated small guanine triphosphatases (GTPases) attract coat proteins to specific membrane export sites, thereby linking coatomers to export cargos. As coat proteins polymerise, vesicles are formed and budded from membrane-bound organelles. Coatomer complexes also influence Golgi structural integrity, as well as the processing, activity, and endocytic recycling of LDL receptors. In mammals, coatomer complexes can only be recruited by membranes associated to ADP-ribosylation factors (ARFs), which are small GTP-binding proteins. Coatomer complexes are hetero-oligomers composed of at least an alpha, beta, beta', gamma, delta, epsilon and zeta subunits.  This entry represents the C terminus (approximately 500 residues) of the eukaryotic coatomer alpha subunit [, ]. This domain is found along with the IPR006692 from INTERPRO domain. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0005198 structural molecule activity, 0005515 protein binding, 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030126 COPI vesicle coat; PDB: 3MKR_B 3MV2_E 3MKQ_B 3MV3_A.
Probab=41.85  E-value=1e+02  Score=31.20  Aligned_cols=55  Identities=29%  Similarity=0.373  Sum_probs=39.2

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHH
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNC  327 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~  327 (346)
                      +..+.+.++.+-+++..|++.+|+..+..+|...--+.........++++.|.-|
T Consensus       201 l~~L~~~Lk~gyk~~t~gKF~eA~~~Fr~iL~~i~l~vv~~~~E~~e~~eli~ic  255 (422)
T PF06957_consen  201 LSSLEERLKEGYKLFTAGKFEEAIEIFRSILHSIPLLVVESREEEDEAKELIEIC  255 (422)
T ss_dssp             HHHHHHHHHHHHHHHHTT-HHHHHHHHHHHHHHHHC--BSSCHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhheeeecCHHHHHHHHHHHHHH
Confidence            4445556677888889999999999999998877666677777777777666554


No 188
>PF09538 FYDLN_acid:  Protein of unknown function (FYDLN_acid);  InterPro: IPR012644 Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=41.76  E-value=17  Score=29.55  Aligned_cols=30  Identities=20%  Similarity=0.466  Sum_probs=21.4

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcCChH
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLT  267 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~  267 (346)
                      -.||  +|+..++--+-       ...+|++||..+..+
T Consensus        10 R~Cp--~CG~kFYDLnk-------~PivCP~CG~~~~~~   39 (108)
T PF09538_consen   10 RTCP--SCGAKFYDLNK-------DPIVCPKCGTEFPPE   39 (108)
T ss_pred             ccCC--CCcchhccCCC-------CCccCCCCCCccCcc
Confidence            4688  89876653321       238899999988776


No 189
>KOG0973|consensus
Probab=41.34  E-value=33  Score=37.91  Aligned_cols=55  Identities=20%  Similarity=0.204  Sum_probs=39.4

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      |++++|.|||..+-.++-.+. +      |.+-++-+-+ + +.+.++.+-++|+.+++|+..+
T Consensus       148 s~DnsViiwn~~tF~~~~vl~-~------H~s~VKGvs~-D-P~Gky~ASqsdDrtikvwrt~d  202 (942)
T KOG0973|consen  148 SLDNSVIIWNAKTFELLKVLR-G------HQSLVKGVSW-D-PIGKYFASQSDDRTLKVWRTSD  202 (942)
T ss_pred             cccceEEEEccccceeeeeee-c------ccccccceEE-C-CccCeeeeecCCceEEEEEccc
Confidence            789999999999986665552 2      3444544331 1 4477888899999999998543


No 190
>PF14276 DUF4363:  Domain of unknown function (DUF4363)
Probab=41.05  E-value=1.7e+02  Score=23.71  Aligned_cols=56  Identities=7%  Similarity=-0.021  Sum_probs=41.4

Q ss_pred             HHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccC-----CCcHhHHHHHHHHHHHHHHh
Q psy5736         276 LNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVV-----PPYKLASLAHEALRNCWSLA  331 (346)
Q Consensus       276 ~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~-----p~h~~~~~~~~~L~~~y~~~  331 (346)
                      +.+....+.+.++++++++|.+.+.++.+.+.+.-.     -.|..+......|+++-+.+
T Consensus        28 i~~~l~~i~~~i~~~dW~~A~~~~~~l~~~W~k~~~~~~~~~~h~eid~i~~sl~rl~~~i   88 (121)
T PF14276_consen   28 IEEQLEQIEEAIENEDWEKAYKETEELEKEWDKNKKRWSILIEHQEIDNIDISLARLKGYI   88 (121)
T ss_pred             HHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhchheeeeecHHHHHHHHHHHHHHHHHH
Confidence            334445556677889999999999999988776533     37888888888888776654


No 191
>PF09976 TPR_21:  Tetratricopeptide repeat;  InterPro: IPR018704  This domain, found in various hypothetical prokaryotic proteins, has no known function. 
Probab=40.64  E-value=1.2e+02  Score=24.98  Aligned_cols=39  Identities=23%  Similarity=0.429  Sum_probs=22.9

Q ss_pred             hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      +..+++++|+..|..         .+....-..+.+.+..+|...|+.
T Consensus        96 ~~~~~~d~Al~~L~~---------~~~~~~~~~~~~~~Gdi~~~~g~~  134 (145)
T PF09976_consen   96 LQQGQYDEALATLQQ---------IPDEAFKALAAELLGDIYLAQGDY  134 (145)
T ss_pred             HHcCCHHHHHHHHHh---------ccCcchHHHHHHHHHHHHHHCCCH
Confidence            445666665555433         233334455677788888887765


No 192
>KOG1539|consensus
Probab=40.19  E-value=45  Score=36.18  Aligned_cols=51  Identities=24%  Similarity=0.250  Sum_probs=39.2

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -+|.+.+||-++.++.+.-.|-      |..-++.+.+++  ..+++++++-|..+++|
T Consensus       265 ~~G~m~~wDLe~kkl~~v~~na------h~~sv~~~~fl~--~epVl~ta~~DnSlk~~  315 (910)
T KOG1539|consen  265 SNGDMAFWDLEKKKLINVTRNA------HYGSVTGATFLP--GEPVLVTAGADNSLKVW  315 (910)
T ss_pred             CCceEEEEEcCCCeeeeeeecc------ccCCcccceecC--CCceEeeccCCCceeEE
Confidence            4699999999988887766543      556677777775  56778888888888887


No 193
>cd00189 TPR Tetratricopeptide repeat domain; typically contains 34 amino acids [WLF]-X(2)-[LIM]-[GAS]-X(2)-[YLF]-X(8)-[ASE]-X(3)-[FYL]-X(2)-[ASL]-X(4)-[PKE] is the consensus sequence; found in a variety of organisms including bacteria, cyanobacteria, yeast, fungi, plants, and humans in various subcellular locations; involved in a variety of functions including protein-protein interactions, but common features in the interaction partners have not been defined; involved in chaperone, cell-cycle, transciption, and protein transport complexes; the number of TPR motifs varies among proteins (1,3-11,13 15,16,19); 5-6 tandem repeats generate a right-handed helical structure with an amphipathic channel that is thought to accomodate an alpha-helix of a target protein; it has been proposed that TPR proteins preferably interact with WD-40 repeat proteins, but in many instances several TPR-proteins seem to aggregate to multi-protein complexes; examples of TPR-proteins include, Cdc16p, Cdc23p and C
Probab=39.56  E-value=62  Score=22.34  Aligned_cols=16  Identities=19%  Similarity=-0.002  Sum_probs=9.1

Q ss_pred             HHHHHHHHHHHHhCCc
Q psy5736         319 LAHEALRNCWSLAGNK  334 (346)
Q Consensus       319 ~~~~~L~~~y~~~G~~  334 (346)
                      .+...++.++...|+.
T Consensus        69 ~~~~~~~~~~~~~~~~   84 (100)
T cd00189          69 KAYYNLGLAYYKLGKY   84 (100)
T ss_pred             hHHHHHHHHHHHHHhH
Confidence            4455566666666553


No 194
>KOG2445|consensus
Probab=39.24  E-value=52  Score=31.84  Aligned_cols=59  Identities=17%  Similarity=0.294  Sum_probs=43.4

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      ..+++|.|||..++.-  .|.+- .+|.-|..-|=.+-+.-+.-+.++.+.+-|+.++||.+
T Consensus        32 SsDq~vkI~d~~~~s~--~W~~T-s~Wrah~~Si~rV~WAhPEfGqvvA~cS~Drtv~iWEE   90 (361)
T KOG2445|consen   32 SSDQTVKIWDSTSDSG--TWSCT-SSWRAHDGSIWRVVWAHPEFGQVVATCSYDRTVSIWEE   90 (361)
T ss_pred             cCCCcEEEEeccCCCC--ceEEe-eeEEecCCcEEEEEecCccccceEEEEecCCceeeeee
Confidence            3688999999754422  22222 45788888887777777767888899999999999955


No 195
>PRK06266 transcription initiation factor E subunit alpha; Validated
Probab=39.17  E-value=22  Score=31.42  Aligned_cols=33  Identities=27%  Similarity=0.792  Sum_probs=23.6

Q ss_pred             CCceecCCCCCCCccccccccCccccccCCcCcCCccccc
Q psy5736         204 WFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQIIATV  243 (346)
Q Consensus       204 ~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~  243 (346)
                      +|.|  +.|...+...+.+.   ..|.||  .|++.+...
T Consensus       117 ~Y~C--p~C~~rytf~eA~~---~~F~Cp--~Cg~~L~~~  149 (178)
T PRK06266        117 FFFC--PNCHIRFTFDEAME---YGFRCP--QCGEMLEEY  149 (178)
T ss_pred             EEEC--CCCCcEEeHHHHhh---cCCcCC--CCCCCCeec
Confidence            4544  88988776554443   589999  999987654


No 196
>KOG0553|consensus
Probab=38.98  E-value=86  Score=30.18  Aligned_cols=33  Identities=18%  Similarity=0.187  Sum_probs=25.9

Q ss_pred             HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736         274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQI  306 (346)
Q Consensus       274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~  306 (346)
                      ..++.+=.++-++|+.++|++|+..|.+.|+.-
T Consensus        79 ~~AE~LK~eGN~~m~~~~Y~eAv~kY~~AI~l~  111 (304)
T KOG0553|consen   79 ALAESLKNEGNKLMKNKDYQEAVDKYTEAIELD  111 (304)
T ss_pred             HHHHHHHHHHHHHHHhhhHHHHHHHHHHHHhcC
Confidence            345555567788899999999999999998754


No 197
>PF08271 TF_Zn_Ribbon:  TFIIB zinc-binding;  InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH [].  TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I [].  More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=38.85  E-value=16  Score=24.30  Aligned_cols=11  Identities=18%  Similarity=0.561  Sum_probs=6.3

Q ss_pred             ccccccCCcCC
Q psy5736         255 KKCESCNSTSD  265 (346)
Q Consensus       255 ~~C~~C~~~~~  265 (346)
                      ..|..||..+.
T Consensus        20 ~vC~~CG~Vl~   30 (43)
T PF08271_consen   20 LVCPNCGLVLE   30 (43)
T ss_dssp             EEETTT-BBEE
T ss_pred             EECCCCCCEee
Confidence            67777776543


No 198
>PF14803 Nudix_N_2:  Nudix N-terminal; PDB: 3CNG_C.
Probab=38.82  E-value=17  Score=23.23  Aligned_cols=30  Identities=17%  Similarity=0.406  Sum_probs=13.8

Q ss_pred             ccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      .|+  .|++.+...-+..  +....+.|+.||..
T Consensus         2 fC~--~CG~~l~~~ip~g--d~r~R~vC~~Cg~I   31 (34)
T PF14803_consen    2 FCP--QCGGPLERRIPEG--DDRERLVCPACGFI   31 (34)
T ss_dssp             B-T--TT--B-EEE--TT---SS-EEEETTTTEE
T ss_pred             ccc--cccChhhhhcCCC--CCccceECCCCCCE
Confidence            377  8887765432211  12346899999863


No 199
>KOG1445|consensus
Probab=38.38  E-value=40  Score=35.71  Aligned_cols=60  Identities=27%  Similarity=0.264  Sum_probs=43.1

Q ss_pred             CcccceEEEEecCCCceeee-eCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           2 TGLFLKEGIWDHITGSRLIS-WRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~-~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      .|--|.|.|++..+--||-. .-.+-    -.++-||+|++ |+-|+..+.+|.+||.|++|+.-.
T Consensus       599 ~g~gG~iai~el~~PGrLPDgv~p~l----~Ngt~vtDl~W-dPFD~~rLAVa~ddg~i~lWr~~a  659 (1012)
T KOG1445|consen  599 AGSGGVIAIYELNEPGRLPDGVMPGL----FNGTLVTDLHW-DPFDDERLAVATDDGQINLWRLTA  659 (1012)
T ss_pred             cCCCceEEEEEcCCCCCCCccccccc----ccCceeeeccc-CCCChHHeeecccCceEEEEEecc
Confidence            46678999999987656531 11111    14578999985 777777779999999999997553


No 200
>KOG0278|consensus
Probab=38.23  E-value=66  Score=30.39  Aligned_cols=48  Identities=23%  Similarity=0.275  Sum_probs=31.8

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -+++|++||..++.++++..-        +..+|+|++..  +..++. -...+.|..|
T Consensus       163 dd~tVRLWD~rTgt~v~sL~~--------~s~VtSlEvs~--dG~ilT-ia~gssV~Fw  210 (334)
T KOG0278|consen  163 DDKTVRLWDHRTGTEVQSLEF--------NSPVTSLEVSQ--DGRILT-IAYGSSVKFW  210 (334)
T ss_pred             cCCceEEEEeccCcEEEEEec--------CCCCcceeecc--CCCEEE-EecCceeEEe
Confidence            468999999999999987643        34588888543  334433 3334455555


No 201
>KOG0305|consensus
Probab=37.99  E-value=41  Score=34.56  Aligned_cols=56  Identities=21%  Similarity=0.267  Sum_probs=42.2

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeee--cCCceEEEeCC
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGY--DDGSLSVYRNY   65 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~--~dG~vri~r~~   65 (346)
                      .|.+..+.|||-.....+-+|.+       |+.-+++|.+ -++...+|.+|.  .|+.|++|.-.
T Consensus       319 GgnDN~~~Iwd~~~~~p~~~~~~-------H~aAVKA~aw-cP~q~~lLAsGGGs~D~~i~fwn~~  376 (484)
T KOG0305|consen  319 GGNDNVVFIWDGLSPEPKFTFTE-------HTAAVKALAW-CPWQSGLLATGGGSADRCIKFWNTN  376 (484)
T ss_pred             CCCccceEeccCCCccccEEEec-------cceeeeEeee-CCCccCceEEcCCCcccEEEEEEcC
Confidence            37899999999976666666654       4566888775 457777878776  79999999655


No 202
>KOG1240|consensus
Probab=37.93  E-value=33  Score=38.84  Aligned_cols=38  Identities=24%  Similarity=0.478  Sum_probs=26.8

Q ss_pred             CCCCCCCCCcccceecccc--------CC-CceEEeeecCCceEEEe
Q psy5736          26 RGSWARNSARCTALTLLNE--------HD-DSLLLTGYDDGSLSVYR   63 (346)
Q Consensus        26 ~~~w~~~~~~~t~l~~ine--------~~-~~lll~g~~dG~vri~r   63 (346)
                      .+.|.+.|..|+-|....+        .+ .++.++|++||.||+|.
T Consensus      1031 ~~gW~p~G~lVAhL~Ehs~~v~k~a~s~~~~s~FvsgS~DGtVKvW~ 1077 (1431)
T KOG1240|consen 1031 PPGWNPRGILVAHLHEHSSAVIKLAVSSEHTSLFVSGSDDGTVKVWN 1077 (1431)
T ss_pred             CCCCCccceEeehhhhccccccceeecCCCCceEEEecCCceEEEee
Confidence            3447777776665544332        33 48999999999999994


No 203
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=37.88  E-value=1.6e+02  Score=25.41  Aligned_cols=24  Identities=4%  Similarity=-0.278  Sum_probs=18.0

Q ss_pred             CcHhHHHHHHHHHHHHHHhCCccc
Q psy5736         313 PYKLASLAHEALRNCWSLAGNKWV  336 (346)
Q Consensus       313 ~h~~~~~~~~~L~~~y~~~G~~~~  336 (346)
                      ..+...+.++.+.+.+...|..+.
T Consensus       101 ~~IsveEIqDiVE~~L~~~~~~ay  124 (154)
T PRK00464        101 REVPSKEIGELVMEELKKLDEVAY  124 (154)
T ss_pred             CCCCHHHHHHHHHHHHHhcCCEEE
Confidence            357778888888888888777654


No 204
>KOG1446|consensus
Probab=37.00  E-value=69  Score=30.85  Aligned_cols=53  Identities=13%  Similarity=0.090  Sum_probs=40.7

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      .-+-+|+--+....+.+-+|.       -|+.+|++|. +++.+| ..|.++-|+.||+|..
T Consensus        77 k~d~tIryLsl~dNkylRYF~-------GH~~~V~sL~-~sP~~d-~FlS~S~D~tvrLWDl  129 (311)
T KOG1446|consen   77 KEDDTIRYLSLHDNKYLRYFP-------GHKKRVNSLS-VSPKDD-TFLSSSLDKTVRLWDL  129 (311)
T ss_pred             CCCCceEEEEeecCceEEEcC-------CCCceEEEEE-ecCCCC-eEEecccCCeEEeeEe
Confidence            345678888888888887873       3788999988 565554 5588999999999943


No 205
>PLN03088 SGT1,  suppressor of G2 allele of SKP1; Provisional
Probab=36.86  E-value=74  Score=31.11  Aligned_cols=44  Identities=16%  Similarity=0.050  Sum_probs=24.6

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      |...+..|++++|+..+.++++..     |.+   ..+...+..+|..+|+.
T Consensus        43 a~~~~~~g~~~eAl~~~~~Al~l~-----P~~---~~a~~~lg~~~~~lg~~   86 (356)
T PLN03088         43 AQANIKLGNFTEAVADANKAIELD-----PSL---AKAYLRKGTACMKLEEY   86 (356)
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHhC-----cCC---HHHHHHHHHHHHHhCCH
Confidence            334455566666666666666542     333   23455666666666654


No 206
>PRK05580 primosome assembly protein PriA; Validated
Probab=36.78  E-value=54  Score=35.16  Aligned_cols=18  Identities=17%  Similarity=0.152  Sum_probs=12.3

Q ss_pred             HHHHHHHHHHHHHHhCCc
Q psy5736         317 ASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       317 ~~~~~~~L~~~y~~~G~~  334 (346)
                      ...+.+.+.|..++.|+.
T Consensus       519 ~Er~~~~l~q~~GRagR~  536 (679)
T PRK05580        519 SERTFQLLTQVAGRAGRA  536 (679)
T ss_pred             HHHHHHHHHHHHhhccCC
Confidence            334567788888888773


No 207
>PF13371 TPR_9:  Tetratricopeptide repeat
Probab=36.75  E-value=61  Score=22.96  Aligned_cols=40  Identities=18%  Similarity=0.170  Sum_probs=22.7

Q ss_pred             hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      +..+++++|++.+++.++..     |.+   ..+....+.+|..+|+.
T Consensus         6 ~~~~~~~~A~~~~~~~l~~~-----p~~---~~~~~~~a~~~~~~g~~   45 (73)
T PF13371_consen    6 LQQEDYEEALEVLERALELD-----PDD---PELWLQRARCLFQLGRY   45 (73)
T ss_pred             HhCCCHHHHHHHHHHHHHhC-----ccc---chhhHHHHHHHHHhccH
Confidence            45677777777777777663     222   23344455555555543


No 208
>KOG0285|consensus
Probab=35.94  E-value=28  Score=34.28  Aligned_cols=53  Identities=19%  Similarity=0.238  Sum_probs=37.2

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .|.+++|.-||.|..+-+-.|.-       |-+-+-+|.+.  ++..++++|+.|..+|+|.
T Consensus       211 ~gedk~VKCwDLe~nkvIR~YhG-------HlS~V~~L~lh--PTldvl~t~grDst~RvWD  263 (460)
T KOG0285|consen  211 AGEDKQVKCWDLEYNKVIRHYHG-------HLSGVYCLDLH--PTLDVLVTGGRDSTIRVWD  263 (460)
T ss_pred             ecCCCeeEEEechhhhhHHHhcc-------ccceeEEEecc--ccceeEEecCCcceEEEee
Confidence            47899999999998866654431       23333333321  4577889999999999993


No 209
>PF04423 Rad50_zn_hook:  Rad50 zinc hook motif;  InterPro: IPR007517 The Mre11 complex (Mre11 Rad50 Nbs1) is central to chromosomal maintenance and functions in homologous recombination, telomere maintenance and sister chromatid association. The Rad50 coiled-coil region contains a dimer interface at the apex of the coiled coils in which pairs of conserved Cys-X-X-Cys motifs form interlocking hooks that bind one Zn ion. This alignment includes the zinc hook motif and a short stretch of coiled-coil on either side.; GO: 0004518 nuclease activity, 0005524 ATP binding, 0008270 zinc ion binding, 0006281 DNA repair; PDB: 1L8D_B.
Probab=35.74  E-value=53  Score=22.78  Aligned_cols=25  Identities=12%  Similarity=0.227  Sum_probs=13.5

Q ss_pred             ccccccCCcCChHHHHHHHHHHHHH
Q psy5736         255 KKCESCNSTSDLTEIKTKLSELNEM  279 (346)
Q Consensus       255 ~~C~~C~~~~~~~~~~~~l~~~~~~  279 (346)
                      ..|+.|+..++.+.-...+.+....
T Consensus        21 ~~CPlC~r~l~~e~~~~li~~~~~~   45 (54)
T PF04423_consen   21 GCCPLCGRPLDEEHRQELIKKYKSE   45 (54)
T ss_dssp             EE-TTT--EE-HHHHHHHHHHHHHH
T ss_pred             CcCCCCCCCCCHHHHHHHHHHHHHH
Confidence            4899999999887655555444443


No 210
>COG1998 RPS31 Ribosomal protein S27AE [Translation, ribosomal structure and biogenesis]
Probab=35.44  E-value=22  Score=24.75  Aligned_cols=27  Identities=22%  Similarity=0.448  Sum_probs=17.1

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      -.||  .|+.++.-....      ..|.|.+||.+
T Consensus        20 ~~CP--rCG~gvfmA~H~------dR~~CGkCgyT   46 (51)
T COG1998          20 RFCP--RCGPGVFMADHK------DRWACGKCGYT   46 (51)
T ss_pred             ccCC--CCCCcchhhhcC------ceeEeccccce
Confidence            4688  897544333221      25999999975


No 211
>PRK10866 outer membrane biogenesis protein BamD; Provisional
Probab=35.21  E-value=1.2e+02  Score=28.04  Aligned_cols=44  Identities=7%  Similarity=-0.108  Sum_probs=36.0

Q ss_pred             HHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhC
Q psy5736         284 IEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAG  332 (346)
Q Consensus       284 ~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G  332 (346)
                      ...+..+++++|+..++++++.+     |.|+....+...+..++..+|
T Consensus        77 ~ayy~~~~y~~A~~~~e~fi~~~-----P~~~~~~~a~Y~~g~~~~~~~  120 (243)
T PRK10866         77 YAYYKNADLPLAQAAIDRFIRLN-----PTHPNIDYVLYMRGLTNMALD  120 (243)
T ss_pred             HHHHhcCCHHHHHHHHHHHHHhC-----cCCCchHHHHHHHHHhhhhcc
Confidence            34467888999999999999876     889988888888888766665


No 212
>KOG1063|consensus
Probab=35.10  E-value=72  Score=34.03  Aligned_cols=54  Identities=15%  Similarity=0.094  Sum_probs=40.0

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceecccc-CCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNE-HDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine-~~~~lll~g~~dG~vri~r   63 (346)
                      |..-+|.|||=++...+. ..+      .|..+|++++++-. .-.+.+++|.+||.|.+|+
T Consensus        31 Ga~~~Iav~dp~k~~i~t-~l~------GH~a~VnC~~~l~~s~~~a~~vsG~sD~~v~lW~   85 (764)
T KOG1063|consen   31 GAGPAIAVADPEKILIVT-TLD------GHVARVNCVHWLPTSEIVAEMVSGDSDGRVILWK   85 (764)
T ss_pred             cCCceEEEeCcccceeEE-ecc------CCccceEEEEEcccccccceEEEccCCCcEEEEE
Confidence            556689999988773333 222      37899999999843 3456789999999999984


No 213
>PF13525 YfiO:  Outer membrane lipoprotein; PDB: 3TGO_A 3Q5M_A 2YHC_A.
Probab=35.09  E-value=1.4e+02  Score=26.50  Aligned_cols=46  Identities=15%  Similarity=0.127  Sum_probs=33.9

Q ss_pred             HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHh
Q psy5736         281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLA  331 (346)
Q Consensus       281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~  331 (346)
                      ..|......+++++|+..++++++.+     |.|+....+...+..++-..
T Consensus        47 ~la~a~y~~~~y~~A~~~~~~fi~~y-----P~~~~~~~A~Y~~g~~~~~~   92 (203)
T PF13525_consen   47 MLAYAYYKQGDYEEAIAAYERFIKLY-----PNSPKADYALYMLGLSYYKQ   92 (203)
T ss_dssp             HHHHHHHHTT-HHHHHHHHHHHHHH------TT-TTHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHHC-----CCCcchhhHHHHHHHHHHHh
Confidence            33444567789999999999999877     88888888888887776655


No 214
>PF08969 USP8_dimer:  USP8 dimerisation domain;  InterPro: IPR015063 This domain is predominantly found in the amino terminal region of Ubiquitin carboxyl-terminal hydrolase 8 (USP8). It has no known function. ; PDB: 2XZE_B 2A9U_A.
Probab=35.07  E-value=83  Score=25.39  Aligned_cols=45  Identities=13%  Similarity=0.110  Sum_probs=34.3

Q ss_pred             HHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHh
Q psy5736         270 KTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKL  316 (346)
Q Consensus       270 ~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~  316 (346)
                      ...+..+..++++|...+..|+.+.|--+|.+++..+..+  |.|+.
T Consensus        32 ~~y~rsa~~l~~~A~~~~~egd~E~AYvl~~R~~~L~~ki--~~Hpd   76 (115)
T PF08969_consen   32 KRYLRSANKLLREAEEYRQEGDEEQAYVLYMRYLTLVEKI--PKHPD   76 (115)
T ss_dssp             HHHHHHHHHHHHHHHHHHHCT-HHHHHHHHHHHHHHHCCH--CCSCC
T ss_pred             HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHh--hcCcc
Confidence            3356677788888988899999999999999999999333  44443


No 215
>KOG0300|consensus
Probab=35.03  E-value=31  Score=33.51  Aligned_cols=51  Identities=20%  Similarity=0.371  Sum_probs=42.3

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .+-+-.||-.|+|+-|-.|.-       |...+.++||.|  ...++++|+-|+.-.||+
T Consensus       168 ADhTA~iWs~Esg~CL~~Y~G-------H~GSVNsikfh~--s~~L~lTaSGD~taHIW~  218 (481)
T KOG0300|consen  168 ADHTARIWSLESGACLATYTG-------HTGSVNSIKFHN--SGLLLLTASGDETAHIWK  218 (481)
T ss_pred             cccceeEEeeccccceeeecc-------cccceeeEEecc--ccceEEEccCCcchHHHH
Confidence            356778999999999988742       566799999988  566889999999999986


No 216
>PF01599 Ribosomal_S27:  Ribosomal protein S27a;  InterPro: IPR002906 Ribosomes are the particles that catalyse mRNA-directed protein synthesis in all organisms. The codons of the mRNA are exposed on the ribosome to allow tRNA binding. This leads to the incorporation of amino acids into the growing polypeptide chain in accordance with the genetic information. Incoming amino acid monomers enter the ribosomal A site in the form of aminoacyl-tRNAs complexed with elongation factor Tu (EF-Tu) and GTP. The growing polypeptide chain, situated in the P site as peptidyl-tRNA, is then transferred to aminoacyl-tRNA and the new peptidyl-tRNA, extended by one residue, is translocated to the P site with the aid the elongation factor G (EF-G) and GTP as the deacylated tRNA is released from the ribosome through one or more exit sites [, ]. About 2/3 of the mass of the ribosome consists of RNA and 1/3 of protein. The proteins are named in accordance with the subunit of the ribosome which they belong to - the small (S1 to S31) and the large (L1 to L44). Usually they decorate the rRNA cores of the subunits.  Many ribosomal proteins, particularly those of the large subunit, are composed of a globular, surfaced-exposed domain with long finger-like projections that extend into the rRNA core to stabilise its structure. Most of the proteins interact with multiple RNA elements, often from different domains. In the large subunit, about 1/3 of the 23S rRNA nucleotides are at least in van der Waal's contact with protein, and L22 interacts with all six domains of the 23S rRNA. Proteins S4 and S7, which initiate assembly of the 16S rRNA, are located at junctions of five and four RNA helices, respectively. In this way proteins serve to organise and stabilise the rRNA tertiary structure. While the crucial activities of decoding and peptide transfer are RNA based, proteins play an active role in functions that may have evolved to streamline the process of protein synthesis. In addition to their function in the ribosome, many ribosomal proteins have some function 'outside' the ribosome [, ]. This family of ribosomal proteins consists mainly of the 40S ribosomal protein S27a which is synthesized as a C-terminal extension of ubiquitin (CEP) (IPR000626 from INTERPRO). The S27a domain compromises the C-terminal half of the protein. The synthesis of ribosomal proteins as extensions of ubiquitin promotes their incorporation into nascent ribosomes by a transient metabolic stabilisation and is required for efficient ribosome biogenesis []. The ribosomal extension protein S27a contains a basic region that is proposed to form a zinc finger; its fusion gene is proposed as a mechanism to maintain a fixed ratio between ubiquitin necessary for degrading proteins and ribosomes a source of proteins [].; GO: 0003735 structural constituent of ribosome, 0006412 translation, 0005622 intracellular, 0005840 ribosome; PDB: 2K4X_A 3U5C_f 3U5G_f 2XZN_9 2XZM_9.
Probab=35.01  E-value=24  Score=24.31  Aligned_cols=30  Identities=23%  Similarity=0.597  Sum_probs=18.6

Q ss_pred             cccccCCcCcCCcccccccCCcccccCcccccccCC
Q psy5736         227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNS  262 (346)
Q Consensus       227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~  262 (346)
                      ..-.||++.|+.++.-...      ...+.|.+|+.
T Consensus        17 ~rk~CP~~~CG~GvFMA~H------~dR~~CGKCg~   46 (47)
T PF01599_consen   17 LRKECPSPRCGAGVFMAEH------KDRHYCGKCGY   46 (47)
T ss_dssp             SSEE-TSTTTTSSSEEEE-------SSEEEETTTSS
T ss_pred             hhhcCCCcccCCceEeeec------CCCccCCCccc
Confidence            4567998899865433322      12599999985


No 217
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=34.87  E-value=83  Score=28.67  Aligned_cols=51  Identities=22%  Similarity=0.381  Sum_probs=37.2

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .++.+.+||-..+..+..+..       |...++.+.+. +....+.++++.||.+++|
T Consensus       176 ~~~~~~~~~~~~~~~~~~~~~-------~~~~v~~~~~~-~~~~~~~~~~~~d~~i~~w  226 (466)
T COG2319         176 LDGTIKLWDLRTGKPLSTLAG-------HTDPVSSLAFS-PDGGLLIASGSSDGTIRLW  226 (466)
T ss_pred             CCCceEEEEcCCCceEEeecc-------CCCceEEEEEc-CCcceEEEEecCCCcEEEE
Confidence            378999999998766665543       45688999988 3333354545899999988


No 218
>KOG0649|consensus
Probab=34.72  E-value=73  Score=30.01  Aligned_cols=55  Identities=18%  Similarity=0.168  Sum_probs=37.9

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      .|-++.+.-||.|+|+---.|.       -|.-.+-++-.=|. ... .++|+.||.+|+|..-
T Consensus       132 AgGD~~~y~~dlE~G~i~r~~r-------GHtDYvH~vv~R~~-~~q-ilsG~EDGtvRvWd~k  186 (325)
T KOG0649|consen  132 AGGDGVIYQVDLEDGRIQREYR-------GHTDYVHSVVGRNA-NGQ-ILSGAEDGTVRVWDTK  186 (325)
T ss_pred             ecCCeEEEEEEecCCEEEEEEc-------CCcceeeeeeeccc-Ccc-eeecCCCccEEEEecc
Confidence            4668999999999996555552       25556666553333 233 3689999999999543


No 219
>PF13176 TPR_7:  Tetratricopeptide repeat; PDB: 3SF4_C 3RO3_A 3RO2_A.
Probab=34.66  E-value=1e+02  Score=19.14  Aligned_cols=22  Identities=9%  Similarity=0.119  Sum_probs=16.5

Q ss_pred             hccccHHHHHHHHHHHHHHhhc
Q psy5736         287 MNTSCFREAVESLTKFSDQIHE  308 (346)
Q Consensus       287 ~~~~~~~~ai~~l~~~l~~~~~  308 (346)
                      ...|++++|++++++.|..-.+
T Consensus        10 ~~~g~~~~Ai~~y~~aL~l~~~   31 (36)
T PF13176_consen   10 RQQGDYEKAIEYYEQALALARD   31 (36)
T ss_dssp             HHCT-HHHHHHHHHHHHHHHHH
T ss_pred             HHcCCHHHHHHHHHHHHHhccc
Confidence            4578999999999998865443


No 220
>PRK10454 PTS system N,N'-diacetylchitobiose-specific transporter subunit IIA; Provisional
Probab=34.64  E-value=52  Score=27.09  Aligned_cols=32  Identities=13%  Similarity=0.208  Sum_probs=23.8

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQI  306 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~  306 (346)
                      .+...+-+|++.++.|++++|-+.+.+.-+.+
T Consensus        30 ~ArS~~~eAl~~Ak~gdfe~A~~~l~eA~e~l   61 (115)
T PRK10454         30 QARSLAYAALKQAKQGDFAAAKAMMDQSRMAL   61 (115)
T ss_pred             HHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHH
Confidence            45566778899999999999877776655443


No 221
>KOG0647|consensus
Probab=34.49  E-value=65  Score=31.11  Aligned_cols=58  Identities=12%  Similarity=0.204  Sum_probs=37.7

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCcc
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYAS   67 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~s   67 (346)
                      .-+|+|++|+.+..-++.- .-++    .|...+-+..+.+  |.+...+|+.||.+++|..-.+
T Consensus        47 SWD~tVR~wevq~~g~~~~-ka~~----~~~~PvL~v~Wsd--dgskVf~g~~Dk~~k~wDL~S~  104 (347)
T KOG0647|consen   47 SWDGTVRIWEVQNSGQLVP-KAQQ----SHDGPVLDVCWSD--DGSKVFSGGCDKQAKLWDLASG  104 (347)
T ss_pred             ccCCceEEEEEecCCcccc-hhhh----ccCCCeEEEEEcc--CCceEEeeccCCceEEEEccCC
Confidence            4589999999988534432 1111    2444555555543  4466678999999999976644


No 222
>PF12773 DZR:  Double zinc ribbon
Probab=34.40  E-value=32  Score=23.23  Aligned_cols=41  Identities=22%  Similarity=0.542  Sum_probs=25.2

Q ss_pred             cCCCCCCCccccccccCccccccCCcCcCCcccccccCCcccccCcccccccCCcCCh
Q psy5736         209 CRACVENWPLMESLEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDL  266 (346)
Q Consensus       209 C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~  266 (346)
                      |+.|..+-+        ..+-.|+  .|+..+....       ...+.|+.|+..+..
T Consensus         1 Cp~Cg~~~~--------~~~~fC~--~CG~~l~~~~-------~~~~~C~~Cg~~~~~   41 (50)
T PF12773_consen    1 CPHCGTPNP--------DDAKFCP--HCGTPLPPPD-------QSKKICPNCGAENPP   41 (50)
T ss_pred             CCCcCCcCC--------ccccCCh--hhcCChhhcc-------CCCCCCcCCcCCCcC
Confidence            566766432        2456788  8887665111       124889999987553


No 223
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=34.09  E-value=1.1e+02  Score=27.37  Aligned_cols=59  Identities=7%  Similarity=-0.023  Sum_probs=30.6

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .+|+|.+||.++++.+..+.-..+...........+. ++++...+.++...++.+.+|.
T Consensus       177 ~~~~v~i~d~~~~~~~~~~~~~~~~~~~~~~~~~~i~-~s~dg~~~~~~~~~~~~i~v~d  235 (300)
T TIGR03866       177 IGGTVSVIDVATRKVIKKITFEIPGVHPEAVQPVGIK-LTKDGKTAFVALGPANRVAVVD  235 (300)
T ss_pred             CCCEEEEEEcCcceeeeeeeecccccccccCCccceE-ECCCCCEEEEEcCCCCeEEEEE
Confidence            4689999999998766554322111111111122333 2334444444445566788774


No 224
>PF01485 IBR:  IBR domain;  InterPro: IPR002867 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target.  This entry represents a cysteine-rich (C6HC) zinc finger domain that is present in Triad1, and which is conserved in other proteins encoded by various eukaryotes. The C6HC consensus pattern is:  C-x(4)-C-x(14-30)-C-x(1-4)-C-x(4)-C-x(2)-C-x(4)-H-x(4)-C  The C6HC zinc finger motif is the fourth family member of the zinc-binding RING, LIM, and LAP/PHD fingers. Strikingly, in most of the proteins the C6HC domain is flanked by two RING finger structures IPR001841 from INTERPRO. The novel C6HC motif has been called DRIL (double RING finger linked). The strong conservation of the larger tripartite TRIAD (twoRING fingers and DRIL) structure indicates that the three subdomains are functionally linked and identifies a novel class of proteins []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding; PDB: 2CT7_A 1WD2_A 2JMO_A 1WIM_A.
Probab=34.07  E-value=27  Score=24.49  Aligned_cols=31  Identities=26%  Similarity=0.506  Sum_probs=14.5

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      ..||++.|..++........    ....|+.|+..
T Consensus        19 ~~Cp~~~C~~~~~~~~~~~~----~~~~C~~C~~~   49 (64)
T PF01485_consen   19 RWCPNPDCEYIIEKDDGCNS----PIVTCPSCGTE   49 (64)
T ss_dssp             C--TTSST---ECS-SSTTS------CCTTSCCSE
T ss_pred             cCCCCCCCcccEEecCCCCC----CeeECCCCCCc
Confidence            48998889887765543211    12678877764


No 225
>PF13428 TPR_14:  Tetratricopeptide repeat
Probab=33.97  E-value=93  Score=20.14  Aligned_cols=29  Identities=14%  Similarity=-0.145  Sum_probs=22.5

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHh
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKL  316 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~  316 (346)
                      |...+..|++++|++.|++.++..     |.++.
T Consensus         8 a~~~~~~G~~~~A~~~~~~~l~~~-----P~~~~   36 (44)
T PF13428_consen    8 ARAYRRLGQPDEAERLLRRALALD-----PDDPE   36 (44)
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHHC-----cCCHH
Confidence            344567899999999999999864     66654


No 226
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=33.30  E-value=11  Score=32.07  Aligned_cols=10  Identities=20%  Similarity=0.584  Sum_probs=6.0

Q ss_pred             ccccccCCcC
Q psy5736         255 KKCESCNSTS  264 (346)
Q Consensus       255 ~~C~~C~~~~  264 (346)
                      ..|+.||..+
T Consensus       124 f~Cp~Cg~~l  133 (147)
T smart00531      124 FTCPRCGEEL  133 (147)
T ss_pred             EECCCCCCEE
Confidence            5666666654


No 227
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=32.58  E-value=31  Score=22.10  Aligned_cols=31  Identities=23%  Similarity=0.722  Sum_probs=16.7

Q ss_pred             ccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         228 RIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       228 ~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      .++|+  .|+..+.......   ......|+.||..
T Consensus         5 ~y~C~--~Cg~~fe~~~~~~---~~~~~~CP~Cg~~   35 (41)
T smart00834        5 EYRCE--DCGHTFEVLQKIS---DDPLATCPECGGD   35 (41)
T ss_pred             EEEcC--CCCCEEEEEEecC---CCCCCCCCCCCCc
Confidence            46777  7776443221110   0224778899874


No 228
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=32.37  E-value=1.4e+02  Score=20.06  Aligned_cols=37  Identities=14%  Similarity=0.027  Sum_probs=25.4

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHH
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALR  325 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~  325 (346)
                      |...++.|+.+.|++.+++.+..      .+.+.-..|+..|.
T Consensus         6 A~ayie~Gd~e~Ar~lL~evl~~------~~~~q~~eA~~LL~   42 (44)
T TIGR03504         6 ARAYIEMGDLEGARELLEEVIEE------GDEAQRQEARALLA   42 (44)
T ss_pred             HHHHHHcCChHHHHHHHHHHHHc------CCHHHHHHHHHHHh
Confidence            44567889999999999998852      34445555555554


No 229
>KOG1274|consensus
Probab=32.34  E-value=32  Score=37.55  Aligned_cols=37  Identities=19%  Similarity=0.217  Sum_probs=25.6

Q ss_pred             CcccceEEEEecC------CCceeeeeCCCCCCCCCCCCcccceeccc
Q psy5736           2 TGLFLKEGIWDHI------TGSRLISWRPDRGSWARNSARCTALTLLN   43 (346)
Q Consensus         2 ~~~~~~~~~w~~~------~~~~~~~~~~~~~~w~~~~~~~t~l~~in   43 (346)
                      .+++|.|.||||+      ...++|     ..+|.+....||-+..++
T Consensus       250 s~~~g~I~vWnv~t~~~~~~~~~Vc-----~~aw~p~~n~it~~~~~g  292 (933)
T KOG1274|consen  250 STLDGQILVWNVDTHERHEFKRAVC-----CEAWKPNANAITLITALG  292 (933)
T ss_pred             eccCCcEEEEecccchhccccceeE-----EEecCCCCCeeEEEeecc
Confidence            3689999999999      666666     355766666666555443


No 230
>PF12688 TPR_5:  Tetratrico peptide repeat
Probab=31.97  E-value=2.2e+02  Score=23.38  Aligned_cols=16  Identities=13%  Similarity=-0.045  Sum_probs=8.1

Q ss_pred             cccHHHHHHHHHHHHH
Q psy5736         289 TSCFREAVESLTKFSD  304 (346)
Q Consensus       289 ~~~~~~ai~~l~~~l~  304 (346)
                      .|+.++|+..|++.+.
T Consensus        14 ~G~~~~Ai~~Y~~Al~   29 (120)
T PF12688_consen   14 LGREEEAIPLYRRALA   29 (120)
T ss_pred             cCCHHHHHHHHHHHHH
Confidence            4455555555555544


No 231
>KOG0321|consensus
Probab=31.92  E-value=61  Score=34.21  Aligned_cols=60  Identities=20%  Similarity=0.267  Sum_probs=34.8

Q ss_pred             cccceEEEEecCCCc--eeeeeC-------CC-----------CCCCCCCCCcccc-eeccccCCCceEEeeec-CCceE
Q psy5736           3 GLFLKEGIWDHITGS--RLISWR-------PD-----------RGSWARNSARCTA-LTLLNEHDDSLLLTGYD-DGSLS   60 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~--~~~~~~-------~~-----------~~~w~~~~~~~t~-l~~ine~~~~lll~g~~-dG~vr   60 (346)
                      |-+|.|-|||..-..  .+-.|.       |.           .+.|..+..-|++ +..+=-.|+..+.+++. ||.|+
T Consensus       164 gRDg~illWD~R~n~~d~~e~~~~~~~~~~n~~ptpskp~~kr~~k~kA~s~ti~ssvTvv~fkDe~tlaSaga~D~~iK  243 (720)
T KOG0321|consen  164 GRDGEILLWDCRCNGVDALEEFDNRIYGRHNTAPTPSKPLKKRIRKWKAASNTIFSSVTVVLFKDESTLASAGAADSTIK  243 (720)
T ss_pred             cCCCcEEEEEEeccchhhHHHHhhhhhccccCCCCCCchhhccccccccccCceeeeeEEEEEeccceeeeccCCCcceE
Confidence            678999999995443  222222       22           1226666666655 22222245555555554 99999


Q ss_pred             EE
Q psy5736          61 VY   62 (346)
Q Consensus        61 i~   62 (346)
                      +|
T Consensus       244 VW  245 (720)
T KOG0321|consen  244 VW  245 (720)
T ss_pred             EE
Confidence            99


No 232
>KOG0295|consensus
Probab=31.53  E-value=84  Score=31.06  Aligned_cols=51  Identities=22%  Similarity=0.272  Sum_probs=37.0

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      -+++|.+||..++.-|-...  .     |..=|..+-|.  +.+.+++...+|+.+|+|.
T Consensus       312 rDktIk~wdv~tg~cL~tL~--g-----hdnwVr~~af~--p~Gkyi~ScaDDktlrvwd  362 (406)
T KOG0295|consen  312 RDKTIKIWDVSTGMCLFTLV--G-----HDNWVRGVAFS--PGGKYILSCADDKTLRVWD  362 (406)
T ss_pred             ccceEEEEeccCCeEEEEEe--c-----ccceeeeeEEc--CCCeEEEEEecCCcEEEEE
Confidence            47899999999996655442  1     44555554432  4578889999999999993


No 233
>KOG2376|consensus
Probab=31.42  E-value=83  Score=33.09  Aligned_cols=67  Identities=13%  Similarity=0.092  Sum_probs=50.5

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCC-------cHhHHHHHHHHHHHHHHhCCccccCccccc
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPP-------YKLASLAHEALRNCWSLAGNKWVIPENYSQ  343 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~-------h~~~~~~~~~L~~~y~~~G~~~~~~~~~~~  343 (346)
                      +..|+.|...++.|++.+|++.|++.++...+-|-..       --.+..++-.|+..+-.+|.+-..-++|.+
T Consensus       176 el~yN~Ac~~i~~gky~qA~elL~kA~~~~~e~l~~~d~~eEeie~el~~IrvQlayVlQ~~Gqt~ea~~iy~~  249 (652)
T KOG2376|consen  176 ELLYNTACILIENGKYNQAIELLEKALRICREKLEDEDTNEEEIEEELNPIRVQLAYVLQLQGQTAEASSIYVD  249 (652)
T ss_pred             HHHHHHHHHHHhcccHHHHHHHHHHHHHHHHHhhcccccchhhHHHHHHHHHHHHHHHHHHhcchHHHHHHHHH
Confidence            4568889999999999999999999987777655432       124666777888888888988655554444


No 234
>KOG0316|consensus
Probab=31.29  E-value=82  Score=29.55  Aligned_cols=69  Identities=13%  Similarity=0.163  Sum_probs=48.1

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccccchhhhhhhhhcccc
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASHEHRLVTSFQALTDTS   83 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh~~~Lv~a~r~L~~~~   83 (346)
                      .+|++..||...|+....+         .+..||++++-  .|....|+|+-|+.+|+...   .-.+|....+..+.++
T Consensus       163 ~DGtvRtydiR~G~l~sDy---------~g~pit~vs~s--~d~nc~La~~l~stlrLlDk---~tGklL~sYkGhkn~e  228 (307)
T KOG0316|consen  163 VDGTVRTYDIRKGTLSSDY---------FGHPITSVSFS--KDGNCSLASSLDSTLRLLDK---ETGKLLKSYKGHKNME  228 (307)
T ss_pred             cCCcEEEEEeecceeehhh---------cCCcceeEEec--CCCCEEEEeeccceeeeccc---chhHHHHHhcccccce
Confidence            5899999999999766643         24457777754  46777888998998887632   2246666666666666


Q ss_pred             ccc
Q psy5736          84 FVN   86 (346)
Q Consensus        84 y~~   86 (346)
                      |..
T Consensus       229 ykl  231 (307)
T KOG0316|consen  229 YKL  231 (307)
T ss_pred             eee
Confidence            653


No 235
>KOG1407|consensus
Probab=31.16  E-value=70  Score=30.33  Aligned_cols=55  Identities=13%  Similarity=0.146  Sum_probs=36.8

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      .++++.||+-+..+++....  +.   .|+.-+-.|-.--.+++ ++++++.|..+|+|..
T Consensus        40 ~dktv~v~n~e~~r~~~~~~--~~---gh~~svdql~w~~~~~d-~~atas~dk~ir~wd~   94 (313)
T KOG1407|consen   40 FDKTVSVWNLERDRFRKELV--YR---GHTDSVDQLCWDPKHPD-LFATASGDKTIRIWDI   94 (313)
T ss_pred             cCCceEEEEecchhhhhhhc--cc---CCCcchhhheeCCCCCc-ceEEecCCceEEEEEe
Confidence            46899999999885544332  22   25555655554444444 5588999999999943


No 236
>PF12862 Apc5:  Anaphase-promoting complex subunit 5
Probab=30.86  E-value=2.3e+02  Score=21.70  Aligned_cols=51  Identities=14%  Similarity=0.158  Sum_probs=36.2

Q ss_pred             HHHhccccHHHHHHHHHHHHHHhhcccCCC-cHhHHHHHHHHHHHHHHhCCc
Q psy5736         284 IEQMNTSCFREAVESLTKFSDQIHELVVPP-YKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       284 ~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~-h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      ...+..+++.+|++.+.++.+....-..+. ...+..+...++......|+.
T Consensus         6 ~~~~~~~dy~~A~d~L~~~fD~~~~~~~~~~~~~~~~all~lA~~~~~~G~~   57 (94)
T PF12862_consen    6 LNALRSGDYSEALDALHRYFDYAKQSNNSSSNSGLAYALLNLAELHRRFGHY   57 (94)
T ss_pred             HHHHHcCCHHHHHHHHHHHHHHHhhcccchhhHHHHHHHHHHHHHHHHhCCH
Confidence            455678899999999999888777666555 233445556677777777764


No 237
>PF14853 Fis1_TPR_C:  Fis1 C-terminal tetratricopeptide repeat; PDB: 1IYG_A 1PC2_A 1NZN_A 3UUX_C 1Y8M_A 2PQR_A 2PQN_A 3O48_A.
Probab=30.78  E-value=1.9e+02  Score=20.19  Aligned_cols=47  Identities=13%  Similarity=0.176  Sum_probs=34.2

Q ss_pred             HHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHH
Q psy5736         279 MFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSL  330 (346)
Q Consensus       279 ~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~  330 (346)
                      +|--|+.....|++++|+++...+|+.=     |.|.-....+..+..-...
T Consensus         4 lY~lAig~ykl~~Y~~A~~~~~~lL~~e-----P~N~Qa~~L~~~i~~~i~k   50 (53)
T PF14853_consen    4 LYYLAIGHYKLGEYEKARRYCDALLEIE-----PDNRQAQSLKELIEDKIQK   50 (53)
T ss_dssp             HHHHHHHHHHTT-HHHHHHHHHHHHHHT-----TS-HHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHhhhHHHHHHHHHHHHhhC-----CCcHHHHHHHHHHHHHHhc
Confidence            3455677788899999999999999864     8888877777777655443


No 238
>PF10058 DUF2296:  Predicted integral membrane metal-binding protein (DUF2296);  InterPro: IPR019273  This domain, found mainly in the eukaryotic lunapark proteins, has no known function []. 
Probab=30.74  E-value=29  Score=24.50  Aligned_cols=33  Identities=21%  Similarity=0.534  Sum_probs=21.9

Q ss_pred             CccccccCCcCcC--CcccccccCCcccccCcccccccCCc
Q psy5736         225 YPIRIRCSNDNCG--QIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       225 ~~~~~~C~~~~C~--g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      ...++.|+  +|.  +++.|..+..    ...|.|+.||..
T Consensus        19 ~r~aLIC~--~C~~hNGla~~~~~~----~i~y~C~~Cg~~   53 (54)
T PF10058_consen   19 NRYALICS--KCFSHNGLAPKEEFE----EIQYRCPYCGAL   53 (54)
T ss_pred             CceeEECc--ccchhhcccccccCC----ceEEEcCCCCCc
Confidence            35789999  895  4555533221    246999999863


No 239
>KOG0547|consensus
Probab=30.55  E-value=96  Score=32.06  Aligned_cols=59  Identities=14%  Similarity=0.141  Sum_probs=42.3

Q ss_pred             HHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCccccCc
Q psy5736         273 LSELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKWVIPE  339 (346)
Q Consensus       273 l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~~~~~  339 (346)
                      +..+.++-.++-.....++|++||++|...|+...+-     +.+   ..-++.||..+|++....|
T Consensus       112 ~k~A~~lK~~GN~~f~~kkY~eAIkyY~~AI~l~p~e-----piF---YsNraAcY~~lgd~~~Vie  170 (606)
T KOG0547|consen  112 LKYAAALKTKGNKFFRNKKYDEAIKYYTQAIELCPDE-----PIF---YSNRAACYESLGDWEKVIE  170 (606)
T ss_pred             HHHHHHHHhhhhhhhhcccHHHHHHHHHHHHhcCCCC-----chh---hhhHHHHHHHHhhHHHHHH
Confidence            4555555566777788999999999999999987553     222   2456788888888755433


No 240
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=30.42  E-value=35  Score=24.19  Aligned_cols=31  Identities=19%  Similarity=0.486  Sum_probs=20.4

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD  265 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~  265 (346)
                      +.||  .|+..+..+.+.    ......|+.||..+.
T Consensus         3 ~~CP--~CG~~iev~~~~----~GeiV~Cp~CGaele   33 (54)
T TIGR01206         3 FECP--DCGAEIELENPE----LGELVICDECGAELE   33 (54)
T ss_pred             cCCC--CCCCEEecCCCc----cCCEEeCCCCCCEEE
Confidence            5788  888776554321    134578999998653


No 241
>COG1594 RPB9 DNA-directed RNA polymerase, subunit M/Transcription elongation factor TFIIS [Transcription]
Probab=30.42  E-value=27  Score=28.60  Aligned_cols=32  Identities=22%  Similarity=0.532  Sum_probs=23.3

Q ss_pred             ccCCcCcCCcccccccCCcccccCcccccccCCcCChH
Q psy5736         230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLT  267 (346)
Q Consensus       230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~  267 (346)
                      .||  .|++.+.|.....    +..++|.+||......
T Consensus         4 FCp--~Cgsll~p~~~~~----~~~l~C~kCgye~~~~   35 (113)
T COG1594           4 FCP--KCGSLLYPKKDDE----GGKLVCRKCGYEEEAS   35 (113)
T ss_pred             ccC--CccCeeEEeEcCC----CcEEECCCCCcchhcc
Confidence            588  9999998875421    2369999999975543


No 242
>KOG0284|consensus
Probab=30.42  E-value=37  Score=33.95  Aligned_cols=58  Identities=21%  Similarity=0.243  Sum_probs=37.7

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCccccch
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYASHEHR   71 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~sh~~~   71 (346)
                      +|.|.+|+=.-.--.. +  +.    -|...|++|.|-  +.|+-.+++++||.|+||.-.+.++.+
T Consensus       159 gG~iKyWqpnmnnVk~-~--~a----hh~eaIRdlafS--pnDskF~t~SdDg~ikiWdf~~~kee~  216 (464)
T KOG0284|consen  159 GGMIKYWQPNMNNVKI-I--QA----HHAEAIRDLAFS--PNDSKFLTCSDDGTIKIWDFRMPKEER  216 (464)
T ss_pred             CceEEecccchhhhHH-h--hH----hhhhhhheeccC--CCCceeEEecCCCeEEEEeccCCchhh
Confidence            6788888765331111 1  11    244678888753  478888999999999999655545433


No 243
>KOG0543|consensus
Probab=30.27  E-value=1.5e+02  Score=29.77  Aligned_cols=49  Identities=16%  Similarity=0.140  Sum_probs=37.0

Q ss_pred             HHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHH
Q psy5736         277 NEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSL  330 (346)
Q Consensus       277 ~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~  330 (346)
                      ..+|+++...+..++++.|+..|++.++..     |.|.....-.-.|.+.+.+
T Consensus       292 KALyRrG~A~l~~~e~~~A~~df~ka~k~~-----P~Nka~~~el~~l~~k~~~  340 (397)
T KOG0543|consen  292 KALYRRGQALLALGEYDLARDDFQKALKLE-----PSNKAARAELIKLKQKIRE  340 (397)
T ss_pred             hHHHHHHHHHHhhccHHHHHHHHHHHHHhC-----CCcHHHHHHHHHHHHHHHH
Confidence            467888888899999999999999999876     7777655555555554443


No 244
>KOG1036|consensus
Probab=30.10  E-value=55  Score=31.51  Aligned_cols=45  Identities=24%  Similarity=0.269  Sum_probs=33.7

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeec
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYD   55 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~   55 (346)
                      .|-+|-|.+||-...+||+.|.--       .+.|.+|-| + +|..++.+|++
T Consensus       250 gGsDG~V~~Wd~~~rKrl~q~~~~-------~~SI~slsf-s-~dG~~LAia~s  294 (323)
T KOG1036|consen  250 GGSDGIVNIWDLFNRKRLKQLAKY-------ETSISSLSF-S-MDGSLLAIASS  294 (323)
T ss_pred             cCCCceEEEccCcchhhhhhccCC-------CCceEEEEe-c-cCCCeEEEEec
Confidence            377999999999999999987422       355777664 2 67777787773


No 245
>PF14828 Amnionless:  Amnionless
Probab=30.10  E-value=29  Score=35.19  Aligned_cols=31  Identities=19%  Similarity=0.530  Sum_probs=23.0

Q ss_pred             ccc--ccCCcCcCCcccccccCCcccccCcccccccCCcCChH
Q psy5736         227 IRI--RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLT  267 (346)
Q Consensus       227 ~~~--~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~  267 (346)
                      ..+  +|+.+.|..++.|..          -.|..||..+..+
T Consensus       207 s~v~~~C~~~~C~~pl~P~G----------hCC~iCGa~v~~~  239 (437)
T PF14828_consen  207 SNVLQRCPKPHCRSPLRPEG----------HCCPICGAIVTLE  239 (437)
T ss_pred             HHhhCcCCCCccCCCCCCCC----------CchhhcceEEEEe
Confidence            445  899999988877652          6789999876543


No 246
>PF02748 PyrI_C:  Aspartate carbamoyltransferase regulatory chain, metal binding domain;  InterPro: IPR020542 Aspartate carbamoyltransferase (aspartate transcarbamylase, ATCase) 2.1.3.2 from EC is an allosteric enzyme that plays a central role in the regulation of the pyrimidine pathway in bacteria. The holoenzyme is a dodecamer composed of six catalytic chains, each with an active site, and six regulatory chains lacking catalytic activity []. The catalytic subunits exist as a dimer of catalytic trimers, (c3)2, while the regulatory subunits exist as a trimer of regulatory dimers, (r2)3, therefore the complete holoenzyme can be represented as (c3)2(r2)3. The association of the catalytic subunits c3 with the regulatory subunits r2 is responsible for the establishment of positive co-operativity between catalytic sites for the binding of aspartate and it dictates the pattern of allosteric response toward nucleotide effectors. ATCase from Escherichia coli is the most extensively studied allosteric enzyme []. The crystal structure of the T-state, the T-state with CTP bound, the R-state with N-phosphonacetyl-L-aspartate (PALA) bound, and the R-state with phosphonoacetamide plus malonate bound have been used in interpreting kinetic and mutational studies. A high-resolution structure of E. coli ATCase in the presence of PALA (a bisubstrate analog) allows a detailed description of the binding at the active site of the enzyme and allows a detailed model of the tetrahedral intermediate to be constructed. The entire regulatory chain has been traced showing that the N-terminal regions of the regulatory chains R1 and R6 are located in close proximity to each other and to the regulatory site. This portion of the molecule may be involved in the observed asymmetry between the regulatory binding sites as well as in the heterotropic response of the enzyme []. The C-terminal domain of the regulatory chains have a rubredoxin-like zinc-bound fold.  ATCase from Enterobacter agglomerans (Erwinia herbicola) (Pantoea agglomerans) differs from the other investigated enterobacterial ATCases by its absence of homotropic co-operativity toward the substrate aspartate and its lack of response to ATP which is an allosteric effector (activator) of this family of enzymes. Nevertheless, the E. herbicola ATCase has the same quaternary structure, two trimers of catalytic chains with three dimers of regulatory chains, (c3)2(r2)3, as other enterobacterial ATCases and shows extensive primary structure conservation [].  This entry represents the C-terminal domain.; PDB: 2YWW_B 1SKU_D 1Q95_L 8ATC_B 3AT1_D 1RAI_D 4E2F_D 1NBE_B 6AT1_B 2FZC_D ....
Probab=30.05  E-value=45  Score=23.37  Aligned_cols=17  Identities=24%  Similarity=0.579  Sum_probs=11.7

Q ss_pred             CcccccccCCcCChHHH
Q psy5736         253 VEKKCESCNSTSDLTEI  269 (346)
Q Consensus       253 ~~~~C~~C~~~~~~~~~  269 (346)
                      ....|.-|++.+..+++
T Consensus        34 ~~~rC~YCe~~~~~~eI   50 (52)
T PF02748_consen   34 IKLRCHYCERIITEDEI   50 (52)
T ss_dssp             CEEEETTT--EEEHHHH
T ss_pred             CEEEeeCCCCEecccEE
Confidence            45899999998887664


No 247
>PF13424 TPR_12:  Tetratricopeptide repeat; PDB: 3RO2_A 3Q15_A 3ASG_A 3ASD_A 3AS5_A 3AS4_A 3ASH_B 4A1S_B 3CEQ_B 3EDT_H ....
Probab=29.81  E-value=1.4e+02  Score=21.44  Aligned_cols=26  Identities=23%  Similarity=0.186  Sum_probs=19.6

Q ss_pred             HHHHhccccHHHHHHHHHHHHHHhhc
Q psy5736         283 GIEQMNTSCFREAVESLTKFSDQIHE  308 (346)
Q Consensus       283 a~~~~~~~~~~~ai~~l~~~l~~~~~  308 (346)
                      |.-....|++++|++++++.++.+++
T Consensus        53 g~~~~~~g~~~~A~~~~~~al~i~~k   78 (78)
T PF13424_consen   53 GECYYRLGDYEEALEYYQKALDIFEK   78 (78)
T ss_dssp             HHHHHHTTHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHcCCHHHHHHHHHHHHhhhcC
Confidence            33345678899999999999887753


No 248
>COG2319 FOG: WD40 repeat [General function prediction only]
Probab=29.62  E-value=1.2e+02  Score=27.55  Aligned_cols=54  Identities=24%  Similarity=0.370  Sum_probs=37.0

Q ss_pred             cccceEEEEecCC-CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeec-CCceEEEeCC
Q psy5736           3 GLFLKEGIWDHIT-GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYD-DGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~-~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~-dG~vri~r~~   65 (346)
                      +.++.+.+||.+. ......+..       |...|+.+.+.....  .+..++. ||.+++|..-
T Consensus       131 ~~d~~~~~~~~~~~~~~~~~~~~-------~~~~v~~~~~~~~~~--~~~~~~~~~~~~~~~~~~  186 (466)
T COG2319         131 SLDGTVKLWDLSTPGKLIRTLEG-------HSESVTSLAFSPDGK--LLASGSSLDGTIKLWDLR  186 (466)
T ss_pred             CCCccEEEEEecCCCeEEEEEec-------CcccEEEEEECCCCC--EEEecCCCCCceEEEEcC
Confidence            4488999999998 555544432       456788777665433  5566665 9999999544


No 249
>PF12895 Apc3:  Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=29.51  E-value=1.2e+02  Score=22.27  Aligned_cols=33  Identities=12%  Similarity=0.088  Sum_probs=17.6

Q ss_pred             HHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         300 TKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       300 ~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      .+.++.+.+  ....+....+...+++|+..+|+.
T Consensus        42 ~~A~~~~~~--~~~~~~~~~~~~l~a~~~~~l~~y   74 (84)
T PF12895_consen   42 EEAIELLQK--LKLDPSNPDIHYLLARCLLKLGKY   74 (84)
T ss_dssp             HHHHHHHHC--HTHHHCHHHHHHHHHHHHHHTT-H
T ss_pred             HHHHHHHHH--hCCCCCCHHHHHHHHHHHHHhCCH
Confidence            444444444  222233355666778888888763


No 250
>KOG0292|consensus
Probab=29.24  E-value=44  Score=36.78  Aligned_cols=49  Identities=16%  Similarity=0.301  Sum_probs=41.0

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .|+|..|||.=+..+.+|.       .|...+-.+.|+.  +.++.++|++|-.|++|
T Consensus        30 sG~IQlWDYRM~tli~rFd-------eHdGpVRgv~FH~--~qplFVSGGDDykIkVW   78 (1202)
T KOG0292|consen   30 SGVIQLWDYRMGTLIDRFD-------EHDGPVRGVDFHP--TQPLFVSGGDDYKIKVW   78 (1202)
T ss_pred             CceeeeehhhhhhHHhhhh-------ccCCccceeeecC--CCCeEEecCCccEEEEE
Confidence            5899999999999998885       3556777777764  57788999999999998


No 251
>PRK00432 30S ribosomal protein S27ae; Validated
Probab=29.23  E-value=33  Score=23.78  Aligned_cols=26  Identities=27%  Similarity=0.579  Sum_probs=17.0

Q ss_pred             ccccCCcCcCC-cccccccCCcccccCcccccccCCc
Q psy5736         228 RIRCSNDNCGQ-IIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       228 ~~~C~~~~C~g-~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      .-.||  .|+. .+.+..        ..+.|.+||.+
T Consensus        20 ~~fCP--~Cg~~~m~~~~--------~r~~C~~Cgyt   46 (50)
T PRK00432         20 NKFCP--RCGSGFMAEHL--------DRWHCGKCGYT   46 (50)
T ss_pred             cCcCc--CCCcchheccC--------CcEECCCcCCE
Confidence            34798  8976 333322        24999999875


No 252
>KOG0313|consensus
Probab=29.17  E-value=1.2e+02  Score=30.26  Aligned_cols=52  Identities=13%  Similarity=0.181  Sum_probs=39.4

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCC-CceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHD-DSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~-~~lll~g~~dG~vri~r   63 (346)
                      .+|.+.|||-........-        -|...|++.-.++..+ ...+++|+.|-.+++|+
T Consensus       123 YDg~~riWd~~Gk~~~~~~--------Ght~~ik~v~~v~~n~~~~~fvsas~Dqtl~Lw~  175 (423)
T KOG0313|consen  123 YDGTSRIWDLKGKSIKTIV--------GHTGPIKSVAWVIKNSSSCLFVSASMDQTLRLWK  175 (423)
T ss_pred             cCCeeEEEecCCceEEEEe--------cCCcceeeeEEEecCCccceEEEecCCceEEEEE
Confidence            5899999998855444332        1567898888888755 56788899999999994


No 253
>PF07282 OrfB_Zn_ribbon:  Putative transposase DNA-binding domain;  InterPro: IPR010095 This entry represents a region of a sequence similarity between a family of putative transposases of Thermoanaerobacter tengcongensis, smaller related proteins from Bacillus anthracis, putative transposes described by IPR001959 from INTERPRO, and other proteins. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=28.95  E-value=58  Score=23.54  Aligned_cols=39  Identities=15%  Similarity=0.311  Sum_probs=25.9

Q ss_pred             cccCccccccCCcCcCCcccccccCCcccccCcccccccCCcCChHH
Q psy5736         222 LEKYPIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTE  268 (346)
Q Consensus       222 ~~~~~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~  268 (346)
                      ++....+-.|+  .|+.......      ....+.|+.||...+.+.
T Consensus        22 v~~~~TSq~C~--~CG~~~~~~~------~~r~~~C~~Cg~~~~rD~   60 (69)
T PF07282_consen   22 VDEAYTSQTCP--RCGHRNKKRR------SGRVFTCPNCGFEMDRDV   60 (69)
T ss_pred             ECCCCCccCcc--Cccccccccc------ccceEEcCCCCCEECcHH
Confidence            44445777898  8876554311      123599999999977654


No 254
>PRK14890 putative Zn-ribbon RNA-binding protein; Provisional
Probab=28.86  E-value=36  Score=24.61  Aligned_cols=26  Identities=35%  Similarity=0.884  Sum_probs=15.0

Q ss_pred             ecCCCCCCCccccccccCccccccCCcCcCCc
Q psy5736         208 KCRACVENWPLMESLEKYPIRIRCSNDNCGQI  239 (346)
Q Consensus       208 ~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~  239 (346)
                      .|..|-..-.   ..+. ...|.||  +|+..
T Consensus         9 ~CtSCg~~i~---~~~~-~~~F~CP--nCG~~   34 (59)
T PRK14890          9 KCTSCGIEIA---PREK-AVKFLCP--NCGEV   34 (59)
T ss_pred             cccCCCCccc---CCCc-cCEeeCC--CCCCe
Confidence            3666655431   2222 4788898  88755


No 255
>KOG0267|consensus
Probab=28.80  E-value=28  Score=37.18  Aligned_cols=50  Identities=12%  Similarity=0.203  Sum_probs=33.7

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      -+|+|.|||.|+++.+-...       -|...|+++. +++. .-+.+.|++|+.+.+|
T Consensus        90 asgtiK~wDleeAk~vrtLt-------gh~~~~~sv~-f~P~-~~~~a~gStdtd~~iw  139 (825)
T KOG0267|consen   90 ASGTIKVWDLEEAKIVRTLT-------GHLLNITSVD-FHPY-GEFFASGSTDTDLKIW  139 (825)
T ss_pred             cCCceeeeehhhhhhhhhhh-------ccccCcceee-eccc-eEEeccccccccceeh
Confidence            46899999999886443221       1456788887 4432 2333778888888887


No 256
>KOG4283|consensus
Probab=28.60  E-value=96  Score=30.01  Aligned_cols=59  Identities=10%  Similarity=0.040  Sum_probs=42.3

Q ss_pred             CcccceEEEEecCCCc----------eeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           2 TGLFLKEGIWDHITGS----------RLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~----------~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      .|-+|.|.|||.++..          .+|..  .++.=..|+=-|+++.+.- +|..|..+++=|..+++|.
T Consensus        62 Ggadgsi~v~Dl~n~t~~e~s~li~k~~c~v--~~~h~~~Hky~iss~~WyP-~DtGmFtssSFDhtlKVWD  130 (397)
T KOG4283|consen   62 GGADGSIAVFDLQNATDYEASGLIAKHKCIV--AKQHENGHKYAISSAIWYP-IDTGMFTSSSFDHTLKVWD  130 (397)
T ss_pred             cCCCccEEEEEeccccchhhccceeheeeec--cccCCccceeeeeeeEEee-ecCceeecccccceEEEee
Confidence            3779999999998765          22322  2332245777799999875 5666768788899999994


No 257
>PRK11827 hypothetical protein; Provisional
Probab=28.58  E-value=31  Score=24.99  Aligned_cols=34  Identities=21%  Similarity=0.228  Sum_probs=23.5

Q ss_pred             cccccCCcCcCCcccccccCCcccccCcccccccCCcCChHH
Q psy5736         227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTE  268 (346)
Q Consensus       227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~  268 (346)
                      .-+.||  .|+|.+....+.      ....|..|+......+
T Consensus         7 eILaCP--~ckg~L~~~~~~------~~Lic~~~~laYPI~d   40 (60)
T PRK11827          7 EIIACP--VCNGKLWYNQEK------QELICKLDNLAFPLRD   40 (60)
T ss_pred             hheECC--CCCCcCeEcCCC------CeEECCccCeeccccC
Confidence            457899  899887654321      2378999998766543


No 258
>PF10083 DUF2321:  Uncharacterized protein conserved in bacteria (DUF2321);  InterPro: IPR016891 This entry is represented by Bacteriophage 'Lactobacillus prophage Lj928', Orf-Ljo1454. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=28.30  E-value=3.5e+02  Score=23.53  Aligned_cols=24  Identities=29%  Similarity=0.535  Sum_probs=16.3

Q ss_pred             cccccccCCcCChHHHHHHHHHHHHH
Q psy5736         254 EKKCESCNSTSDLTEIKTKLSELNEM  279 (346)
Q Consensus       254 ~~~C~~C~~~~~~~~~~~~l~~~~~~  279 (346)
                      ...|..||+..+..+  ..|+.+.++
T Consensus        68 PsYC~~CGkpyPWt~--~~L~aa~el   91 (158)
T PF10083_consen   68 PSYCHNCGKPYPWTE--NALEAANEL   91 (158)
T ss_pred             ChhHHhCCCCCchHH--HHHHHHHHH
Confidence            478999999998754  334444444


No 259
>smart00028 TPR Tetratricopeptide repeats. Repeats present in 4 or more copies in proteins. Contain a minimum of 34 amino acids each and self-associate via a "knobs and holes" mechanism.
Probab=28.20  E-value=97  Score=16.49  Aligned_cols=24  Identities=21%  Similarity=0.189  Sum_probs=17.2

Q ss_pred             HHHHHhccccHHHHHHHHHHHHHH
Q psy5736         282 RGIEQMNTSCFREAVESLTKFSDQ  305 (346)
Q Consensus       282 ~a~~~~~~~~~~~ai~~l~~~l~~  305 (346)
                      .+.-.+..+++++|+..+.+.++.
T Consensus         7 ~a~~~~~~~~~~~a~~~~~~~~~~   30 (34)
T smart00028        7 LGNAYLKLGDYDEALEYYEKALEL   30 (34)
T ss_pred             HHHHHHHHhhHHHHHHHHHHHHcc
Confidence            344456667888888888887764


No 260
>KOG0307|consensus
Probab=28.08  E-value=50  Score=36.83  Aligned_cols=56  Identities=16%  Similarity=0.214  Sum_probs=42.1

Q ss_pred             cccceEEEEecCC---CceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           3 GLFLKEGIWDHIT---GSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         3 ~~~~~~~~w~~~~---~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      +-+|+|.+||-.+   +.......-.+    .|..-|..|+ +|..-..++..|.+||.|=||.
T Consensus        87 ~edG~I~ly~p~~~~~~~~~~~la~~~----~h~G~V~gLD-fN~~q~nlLASGa~~geI~iWD  145 (1049)
T KOG0307|consen   87 LEDGNIVLYDPASIIANASEEVLATKS----KHTGPVLGLD-FNPFQGNLLASGADDGEILIWD  145 (1049)
T ss_pred             ccCCceEEecchhhccCcchHHHhhhc----ccCCceeeee-ccccCCceeeccCCCCcEEEec
Confidence            3479999999875   33333332222    4778898888 6888888999999999999993


No 261
>smart00858 SAF This domain family includes a range of different proteins. Such as antifreeze proteins and flagellar FlgA proteins, and CpaB pilus proteins.
Probab=28.04  E-value=41  Score=23.49  Aligned_cols=16  Identities=38%  Similarity=0.486  Sum_probs=14.0

Q ss_pred             EEEEeeccCCCCceee
Q psy5736         166 IVKALRPLKPKEVVAE  181 (346)
Q Consensus       166 ~v~a~r~I~~Geel~~  181 (346)
                      +++|.++|++|+.|+-
T Consensus         3 v~va~~~i~~G~~i~~   18 (64)
T smart00858        3 VVVAARDLPAGEVITA   18 (64)
T ss_pred             EEEEeCccCCCCCcch
Confidence            5778899999999985


No 262
>PRK14714 DNA polymerase II large subunit; Provisional
Probab=27.22  E-value=43  Score=38.20  Aligned_cols=12  Identities=17%  Similarity=0.589  Sum_probs=8.6

Q ss_pred             cccccccCCcCC
Q psy5736         254 EKKCESCNSTSD  265 (346)
Q Consensus       254 ~~~C~~C~~~~~  265 (346)
                      ...|+.|+..+.
T Consensus       709 a~~CP~CGtplv  720 (1337)
T PRK14714        709 RVECPRCDVELT  720 (1337)
T ss_pred             cccCCCCCCccc
Confidence            357999987643


No 263
>PF12895 Apc3:  Anaphase-promoting complex, cyclosome, subunit 3; PDB: 3KAE_D 3Q4A_B 2C2L_D 3Q47_B 3Q49_B 2XPI_A 3ULQ_A.
Probab=27.14  E-value=1.2e+02  Score=22.27  Aligned_cols=20  Identities=25%  Similarity=0.250  Sum_probs=13.0

Q ss_pred             HHHHHhccccHHHHHHHHHH
Q psy5736         282 RGIEQMNTSCFREAVESLTK  301 (346)
Q Consensus       282 ~a~~~~~~~~~~~ai~~l~~  301 (346)
                      .|.-.+..|++++|++.+.+
T Consensus        31 la~~~~~~~~y~~A~~~~~~   50 (84)
T PF12895_consen   31 LAQCYFQQGKYEEAIELLQK   50 (84)
T ss_dssp             HHHHHHHTTHHHHHHHHHHC
T ss_pred             HHHHHHHCCCHHHHHHHHHH
Confidence            34455666777777777766


No 264
>KOG1034|consensus
Probab=27.09  E-value=1.3e+02  Score=29.49  Aligned_cols=55  Identities=13%  Similarity=0.114  Sum_probs=42.5

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +.+|.+||.++..-+.-|.---    -|.-++=++.+ +.+++ .++.++.|..+.+|+.-
T Consensus       157 D~svRlwnI~~~~Cv~VfGG~e----gHrdeVLSvD~-~~~gd-~i~ScGmDhslk~W~l~  211 (385)
T KOG1034|consen  157 DHSVRLWNIQTDVCVAVFGGVE----GHRDEVLSVDF-SLDGD-RIASCGMDHSLKLWRLN  211 (385)
T ss_pred             CceEEEEeccCCeEEEEecccc----cccCcEEEEEE-cCCCC-eeeccCCcceEEEEecC
Confidence            6789999999999999886433    26678888774 33344 67889999999999765


No 265
>KOG1240|consensus
Probab=26.95  E-value=71  Score=36.38  Aligned_cols=59  Identities=17%  Similarity=0.167  Sum_probs=33.7

Q ss_pred             cccceEEEEecCCCcee-eeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736           3 GLFLKEGIWDHITGSRL-ISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~-~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      +-+|+|.|||..+-.+= ..+.+- --|.+.+++++.+..+.  ...-..+|++||.|++.+-
T Consensus      1068 S~DGtVKvW~~~k~~~~~~s~rS~-ltys~~~sr~~~vt~~~--~~~~~Av~t~DG~v~~~~i 1127 (1431)
T KOG1240|consen 1068 SDDGTVKVWNLRKLEGEGGSARSE-LTYSPEGSRVEKVTMCG--NGDQFAVSTKDGSVRVLRI 1127 (1431)
T ss_pred             cCCceEEEeeehhhhcCcceeeee-EEEeccCCceEEEEecc--CCCeEEEEcCCCeEEEEEc
Confidence            45899999998654222 122111 11344455555544333  2333466899999998853


No 266
>TIGR00373 conserved hypothetical protein TIGR00373. This family of proteins is, so far, restricted to archaeal genomes. The family appears to be distantly related to the N-terminal region of the eukaryotic transcription initiation factor IIE alpha chain.
Probab=26.88  E-value=32  Score=29.80  Aligned_cols=34  Identities=21%  Similarity=0.677  Sum_probs=24.6

Q ss_pred             CCCceecCCCCCCCccccccccCccccccCCcCcCCccccc
Q psy5736         203 YWFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQIIATV  243 (346)
Q Consensus       203 ~~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~~~  243 (346)
                      -+|.|  +.|...+...+.+.   ..|.||  .|++.+...
T Consensus       108 ~~Y~C--p~c~~r~tf~eA~~---~~F~Cp--~Cg~~L~~~  141 (158)
T TIGR00373       108 MFFIC--PNMCVRFTFNEAME---LNFTCP--RCGAMLDYL  141 (158)
T ss_pred             CeEEC--CCCCcEeeHHHHHH---cCCcCC--CCCCEeeec
Confidence            45555  88988776555543   589999  999987655


No 267
>KOG1539|consensus
Probab=26.76  E-value=1.2e+02  Score=33.07  Aligned_cols=52  Identities=21%  Similarity=0.330  Sum_probs=43.0

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      ..|+|.|+|...++.+-+|.-.       ..+||+|-| ..+..+++.+|++.|.+.+|.
T Consensus       222 ~~G~ViifNlK~dkil~sFk~d-------~g~VtslSF-rtDG~p~las~~~~G~m~~wD  273 (910)
T KOG1539|consen  222 ENGTVIIFNLKFDKILMSFKQD-------WGRVTSLSF-RTDGNPLLASGRSNGDMAFWD  273 (910)
T ss_pred             cCceEEEEEcccCcEEEEEEcc-------ccceeEEEe-ccCCCeeEEeccCCceEEEEE
Confidence            3799999999999999999522       368999885 557788889899999998883


No 268
>PF09862 DUF2089:  Protein of unknown function (DUF2089);  InterPro: IPR018658  This family consists of various hypothetical prokaryotic proteins. 
Probab=26.48  E-value=47  Score=27.23  Aligned_cols=16  Identities=13%  Similarity=0.312  Sum_probs=7.3

Q ss_pred             HHHHHHHHHHHHHHhh
Q psy5736         292 FREAVESLTKFSDQIH  307 (346)
Q Consensus       292 ~~~ai~~l~~~l~~~~  307 (346)
                      |-.++.-+.++++.+.
T Consensus        63 YPTvR~rLd~ii~~lg   78 (113)
T PF09862_consen   63 YPTVRNRLDKIIEKLG   78 (113)
T ss_pred             cHHHHHHHHHHHHHhC
Confidence            3334444555554443


No 269
>KOG1036|consensus
Probab=26.43  E-value=1.2e+02  Score=29.23  Aligned_cols=49  Identities=12%  Similarity=0.203  Sum_probs=33.9

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +|.+++|.+||-........+.++        .+|-+|.    -.+..+++|.+|..+-+|
T Consensus       112 gsWD~~ik~wD~R~~~~~~~~d~~--------kkVy~~~----v~g~~LvVg~~~r~v~iy  160 (323)
T KOG1036|consen  112 GSWDKTIKFWDPRNKVVVGTFDQG--------KKVYCMD----VSGNRLVVGTSDRKVLIY  160 (323)
T ss_pred             cccCccEEEEeccccccccccccC--------ceEEEEe----ccCCEEEEeecCceEEEE
Confidence            477899999999874444444322        2455544    335667889999999999


No 270
>KOG0264|consensus
Probab=26.23  E-value=1.2e+02  Score=30.54  Aligned_cols=55  Identities=15%  Similarity=0.222  Sum_probs=43.3

Q ss_pred             cccceEEEEecCCC-ceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           3 GLFLKEGIWDHITG-SRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         3 ~~~~~~~~w~~~~~-~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +-+++|.+||-.+= +++..|.       .|+.+|..+++ .++.+.++.+.++||.+-+|..-
T Consensus       292 S~D~tV~LwDlRnL~~~lh~~e-------~H~dev~~V~W-SPh~etvLASSg~D~rl~vWDls  347 (422)
T KOG0264|consen  292 SADKTVALWDLRNLNKPLHTFE-------GHEDEVFQVEW-SPHNETVLASSGTDRRLNVWDLS  347 (422)
T ss_pred             cCCCcEEEeechhcccCceecc-------CCCcceEEEEe-CCCCCceeEecccCCcEEEEecc
Confidence            45899999999653 5555553       37788999884 77899999999999999999543


No 271
>KOG0265|consensus
Probab=26.14  E-value=72  Score=30.78  Aligned_cols=52  Identities=15%  Similarity=0.201  Sum_probs=38.1

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      .+|++.|||..+....-.|.|        +.++|++-|-.+.+. + ..|.-|+.|++|..-
T Consensus       153 dD~t~kl~D~R~k~~~~t~~~--------kyqltAv~f~d~s~q-v-~sggIdn~ikvWd~r  204 (338)
T KOG0265|consen  153 DDGTLKLWDIRKKEAIKTFEN--------KYQLTAVGFKDTSDQ-V-ISGGIDNDIKVWDLR  204 (338)
T ss_pred             CCceEEEEeecccchhhcccc--------ceeEEEEEecccccc-e-eeccccCceeeeccc
Confidence            589999999998777766643        467888886544443 3 468889999999553


No 272
>KOG3914|consensus
Probab=26.09  E-value=41  Score=33.37  Aligned_cols=20  Identities=20%  Similarity=0.190  Sum_probs=17.1

Q ss_pred             cccceEEEEecCCCceeeee
Q psy5736           3 GLFLKEGIWDHITGSRLISW   22 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~   22 (346)
                      |-+++|.+|||.+|+.|..|
T Consensus       212 sGD~tlr~Wd~~sgk~L~t~  231 (390)
T KOG3914|consen  212 SGDKTLRLWDITSGKLLDTC  231 (390)
T ss_pred             CCCCcEEEEecccCCccccc
Confidence            66899999999999888544


No 273
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=26.01  E-value=46  Score=22.44  Aligned_cols=9  Identities=22%  Similarity=0.848  Sum_probs=4.7

Q ss_pred             ccccccCCc
Q psy5736         255 KKCESCNST  263 (346)
Q Consensus       255 ~~C~~C~~~  263 (346)
                      ..|+.||..
T Consensus        20 irC~~CG~r   28 (44)
T smart00659       20 VRCRECGYR   28 (44)
T ss_pred             eECCCCCce
Confidence            555555543


No 274
>COG1997 RPL43A Ribosomal protein L37AE/L43A [Translation, ribosomal structure and biogenesis]
Probab=25.95  E-value=41  Score=26.26  Aligned_cols=11  Identities=27%  Similarity=0.586  Sum_probs=9.2

Q ss_pred             cccccccCCcC
Q psy5736         254 EKKCESCNSTS  264 (346)
Q Consensus       254 ~~~C~~C~~~~  264 (346)
                      .|.|.+||...
T Consensus        53 IW~C~kCg~~f   63 (89)
T COG1997          53 IWKCRKCGAKF   63 (89)
T ss_pred             eEEcCCCCCee
Confidence            49999999865


No 275
>KOG0644|consensus
Probab=25.61  E-value=37  Score=37.04  Aligned_cols=51  Identities=24%  Similarity=0.329  Sum_probs=39.2

Q ss_pred             ceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736           6 LKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus         6 ~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      --|.||-.+++..+.+..       -|.++||+|..  .....+..+|+.|.+||+|+.-
T Consensus       212 ~lvKiwS~et~~~lAs~r-------Ghs~ditdlav--s~~n~~iaaaS~D~vIrvWrl~  262 (1113)
T KOG0644|consen  212 RLVKIWSMETARCLASCR-------GHSGDITDLAV--SSNNTMIAAASNDKVIRVWRLP  262 (1113)
T ss_pred             ceeeeeeccchhhhccCC-------CCccccchhcc--chhhhhhhhcccCceEEEEecC
Confidence            458899999998887652       27889999873  2445666778899999999764


No 276
>TIGR02521 type_IV_pilW type IV pilus biogenesis/stability protein PilW. Members of this family are designated PilF in ref (PubMed:8973346) and PilW in ref (PubMed:15612916). This outer membrane protein is required both for pilus stability and for pilus function such as adherence to human cells. Members of this family contain copies of the TPR (tetratricopeptide repeat) domain.
Probab=25.27  E-value=2.3e+02  Score=23.84  Aligned_cols=16  Identities=13%  Similarity=0.110  Sum_probs=7.8

Q ss_pred             cccHHHHHHHHHHHHH
Q psy5736         289 TSCFREAVESLTKFSD  304 (346)
Q Consensus       289 ~~~~~~ai~~l~~~l~  304 (346)
                      .|++++|++.+.+.++
T Consensus        78 ~~~~~~A~~~~~~al~   93 (234)
T TIGR02521        78 LGELEKAEDSFRRALT   93 (234)
T ss_pred             cCCHHHHHHHHHHHHh
Confidence            3445555555544444


No 277
>TIGR02552 LcrH_SycD type III secretion low calcium response chaperone LcrH/SycD. ScyD/LcrH contains three central tetratricopeptide-like repeats that are predicted to fold into an all-alpha-helical array.
Probab=25.25  E-value=1.9e+02  Score=22.86  Aligned_cols=40  Identities=13%  Similarity=0.119  Sum_probs=21.0

Q ss_pred             hccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         287 MNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       287 ~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      +..+++++|+..+.+.++.     .|.   .......++.+|...|+.
T Consensus        62 ~~~~~~~~A~~~~~~~~~~-----~p~---~~~~~~~la~~~~~~g~~  101 (135)
T TIGR02552        62 QMLKEYEEAIDAYALAAAL-----DPD---DPRPYFHAAECLLALGEP  101 (135)
T ss_pred             HHHHHHHHHHHHHHHHHhc-----CCC---ChHHHHHHHHHHHHcCCH
Confidence            3445555555555555443     122   234445666777777664


No 278
>PF03604 DNA_RNApol_7kD:  DNA directed RNA polymerase, 7 kDa subunit;  InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=25.03  E-value=64  Score=20.27  Aligned_cols=9  Identities=22%  Similarity=0.862  Sum_probs=4.8

Q ss_pred             ccccccCCc
Q psy5736         255 KKCESCNST  263 (346)
Q Consensus       255 ~~C~~C~~~  263 (346)
                      ..|..||..
T Consensus        18 irC~~CG~R   26 (32)
T PF03604_consen   18 IRCPECGHR   26 (32)
T ss_dssp             SSBSSSS-S
T ss_pred             EECCcCCCe
Confidence            566666654


No 279
>COG1198 PriA Primosomal protein N' (replication factor Y) - superfamily II helicase [DNA replication, recombination, and repair]
Probab=24.88  E-value=2.2e+02  Score=30.99  Aligned_cols=30  Identities=17%  Similarity=0.120  Sum_probs=22.2

Q ss_pred             CCCcHhHHHHHHHHHHHHHHhCCccccCcc
Q psy5736         311 VPPYKLASLAHEALRNCWSLAGNKWVIPEN  340 (346)
Q Consensus       311 ~p~h~~~~~~~~~L~~~y~~~G~~~~~~~~  340 (346)
                      -|........-+.|.|.-++.|+.-...+.
T Consensus       567 ~~DfRA~Er~fqll~QvaGRAgR~~~~G~V  596 (730)
T COG1198         567 SPDFRASERTFQLLMQVAGRAGRAGKPGEV  596 (730)
T ss_pred             CCCcchHHHHHHHHHHHHhhhccCCCCCeE
Confidence            445556778888999999999997555443


No 280
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=24.67  E-value=62  Score=28.68  Aligned_cols=24  Identities=13%  Similarity=-0.201  Sum_probs=15.0

Q ss_pred             CCEEEEEEeeccCCCCceeecccc
Q psy5736         162 GKNIIVKALRPLKPKEVVAENYGL  185 (346)
Q Consensus       162 g~~~~v~a~r~I~~Geel~~~Y~~  185 (346)
                      .+.+..+-.|+=..|-+.+..|.+
T Consensus        59 ~~li~~~k~rd~~~~~~~y~w~~~   82 (176)
T COG1675          59 DGLISYRKKRDEESGWEEYTWYIN   82 (176)
T ss_pred             CCceEEEeecccCCCcEEEEEEec
Confidence            345555666777777666666654


No 281
>TIGR01384 TFS_arch transcription factor S, archaeal. There has been an apparent duplication event in the Halobacteriaceae lineage (Haloarcula, Haloferax, Haloquadratum, Halobacterium and Natromonas). There appears to be a separate duplication in Methanosphaera stadtmanae.
Probab=24.50  E-value=43  Score=26.48  Aligned_cols=26  Identities=23%  Similarity=0.639  Sum_probs=19.0

Q ss_pred             ccCCcCcCCcccccccCCcccccCcccccccCCcCC
Q psy5736         230 RCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSD  265 (346)
Q Consensus       230 ~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~  265 (346)
                      .||  .|++.+.+..        ..+.|..|+....
T Consensus         2 fC~--~Cg~~l~~~~--------~~~~C~~C~~~~~   27 (104)
T TIGR01384         2 FCP--KCGSLMTPKN--------GVYVCPSCGYEKE   27 (104)
T ss_pred             CCc--ccCcccccCC--------CeEECcCCCCccc
Confidence            588  8998886542        1389999998644


No 282
>TIGR00595 priA primosomal protein N'. All proteins in this family for which functions are known are components of the primosome which is involved in replication, repair, and recombination.This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University).
Probab=24.05  E-value=1e+02  Score=31.87  Aligned_cols=18  Identities=22%  Similarity=0.174  Sum_probs=12.5

Q ss_pred             HHHHHHHHHHHHHHhCCc
Q psy5736         317 ASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       317 ~~~~~~~L~~~y~~~G~~  334 (346)
                      ...+.+.|.|..++.|+.
T Consensus       351 ~E~~~~ll~q~~GRagR~  368 (505)
T TIGR00595       351 AERGFQLLTQVAGRAGRA  368 (505)
T ss_pred             HHHHHHHHHHHHhccCCC
Confidence            345567788888888773


No 283
>KOG1523|consensus
Probab=23.87  E-value=77  Score=30.84  Aligned_cols=44  Identities=23%  Similarity=0.470  Sum_probs=30.4

Q ss_pred             eeeeCCCCCCC-------CCCCCcccceeccccCCCceEEeeecCCceEEEeCC
Q psy5736          19 LISWRPDRGSW-------ARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNY   65 (346)
Q Consensus        19 ~~~~~~~~~~w-------~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~   65 (346)
                      +|||.-+|. |       +++.+.|++|.++  ++..++.+|+.||..|+|.-|
T Consensus       126 Vcy~E~ENd-WWVsKhikkPirStv~sldWh--pnnVLlaaGs~D~k~rVfSay  176 (361)
T KOG1523|consen  126 VCYYEQEND-WWVSKHIKKPIRSTVTSLDWH--PNNVLLAAGSTDGKCRVFSAY  176 (361)
T ss_pred             EEEEecccc-eehhhhhCCccccceeeeecc--CCcceecccccCcceeEEEEe
Confidence            356665553 5       4445567887765  456677788899999999655


No 284
>PF14938 SNAP:  Soluble NSF attachment protein, SNAP; PDB: 1QQE_A 2IFU_A.
Probab=23.77  E-value=3.7e+02  Score=25.02  Aligned_cols=44  Identities=11%  Similarity=0.022  Sum_probs=33.3

Q ss_pred             ccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCcc
Q psy5736         290 SCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNKW  335 (346)
Q Consensus       290 ~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~~  335 (346)
                      +++++|++.|++.++.|..--  .......+...++.++..+|+.-
T Consensus       129 ~d~e~Ai~~Y~~A~~~y~~e~--~~~~a~~~~~~~A~l~~~l~~y~  172 (282)
T PF14938_consen  129 GDYEKAIEYYQKAAELYEQEG--SPHSAAECLLKAADLYARLGRYE  172 (282)
T ss_dssp             --HHHHHHHHHHHHHHHHHTT---HHHHHHHHHHHHHHHHHTT-HH
T ss_pred             CCHHHHHHHHHHHHHHHHHCC--ChhhHHHHHHHHHHHHHHhCCHH
Confidence            589999999999999998764  44556778888999999988653


No 285
>CHL00033 ycf3 photosystem I assembly protein Ycf3
Probab=23.76  E-value=2e+02  Score=24.19  Aligned_cols=52  Identities=8%  Similarity=-0.047  Sum_probs=30.2

Q ss_pred             HHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCC
Q psy5736         281 YRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       281 ~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~  333 (346)
                      .-+.-....|++++|+..+++.+..... ....+..+..+...+.+.+..+|+
T Consensus        77 ~lg~~~~~~g~~~eA~~~~~~Al~~~~~-~~~~~~~la~i~~~~~~~~~~~g~  128 (168)
T CHL00033         77 NIGLIHTSNGEHTKALEYYFQALERNPF-LPQALNNMAVICHYRGEQAIEQGD  128 (168)
T ss_pred             HHHHHHHHcCCHHHHHHHHHHHHHhCcC-cHHHHHHHHHHHHHhhHHHHHccc
Confidence            3345556678999999999999976322 222333344444444444445555


No 286
>PRK00564 hypA hydrogenase nickel incorporation protein; Provisional
Probab=23.23  E-value=85  Score=25.72  Aligned_cols=41  Identities=12%  Similarity=0.216  Sum_probs=26.5

Q ss_pred             cCCCCceeecccccccCCCHHHHHHHHHhhCCCceecCCCCCCCccccccccCccccccCCcCcCCc
Q psy5736         173 LKPKEVVAENYGLVFSRKHLIDRQKVLSARYWFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQI  239 (346)
Q Consensus       173 I~~Geel~~~Y~~~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~  239 (346)
                      +.+|.+|.|.-.+                   ....|..|...++    ... ...++||  .|++.
T Consensus        57 ~~ega~L~Ie~vp-------------------~~~~C~~Cg~~~~----~~~-~~~~~CP--~Cgs~   97 (117)
T PRK00564         57 VCKDAILDIVDEK-------------------VELECKDCSHVFK----PNA-LDYGVCE--KCHSK   97 (117)
T ss_pred             ccCCCEEEEEecC-------------------CEEEhhhCCCccc----cCC-ccCCcCc--CCCCC
Confidence            5678888888776                   2677888876653    221 1234588  88764


No 287
>PRK14892 putative transcription elongation factor Elf1; Provisional
Probab=23.14  E-value=37  Score=27.18  Aligned_cols=58  Identities=14%  Similarity=0.308  Sum_probs=30.7

Q ss_pred             cccccCCcCcCCcccccccCCcccccCcccccccCCcCChHHHHHHHHHHHHHHHHHHHHhcccc
Q psy5736         227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEIKTKLSELNEMFYRGIEQMNTSC  291 (346)
Q Consensus       227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~~~~l~~~~~~~~~a~~~~~~~~  291 (346)
                      ..|.||  +|+....+..-.   .+.....|..||......  ...|.+..+.|.+-+++...|.
T Consensus        20 t~f~CP--~Cge~~v~v~~~---k~~~h~~C~~CG~y~~~~--V~~l~epIDVY~~wiD~~~eg~   77 (99)
T PRK14892         20 KIFECP--RCGKVSISVKIK---KNIAIITCGNCGLYTEFE--VPSVYDEVDVYNKFIDLYLEGK   77 (99)
T ss_pred             cEeECC--CCCCeEeeeecC---CCcceEECCCCCCccCEE--CCccccchhhHHHHHHHHHhcC
Confidence            567898  888433221100   012358899999865432  2234444445555555554444


No 288
>TIGR02300 FYDLN_acid conserved hypothetical protein TIGR02300. Members of this family are bacterial proteins with a conserved motif [KR]FYDLN, sometimes flanked by a pair of CXXC motifs, followed by a long region of low complexity sequence in which roughly half the residues are Asp and Glu, including multiple runs of five or more acidic residues. The function of members of this family is unknown.
Probab=23.11  E-value=54  Score=27.46  Aligned_cols=32  Identities=13%  Similarity=0.099  Sum_probs=21.4

Q ss_pred             cccCCcCcCCcccccccCCcccccCcccccccCCcCChHHH
Q psy5736         229 IRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEI  269 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~  269 (346)
                      -.||  +|+..++--+       ....+|++||..+..+.+
T Consensus        10 r~Cp--~cg~kFYDLn-------k~p~vcP~cg~~~~~~~~   41 (129)
T TIGR02300        10 RICP--NTGSKFYDLN-------RRPAVSPYTGEQFPPEEA   41 (129)
T ss_pred             ccCC--CcCccccccC-------CCCccCCCcCCccCcchh
Confidence            4688  8876654332       124899999998776633


No 289
>PRK04016 DNA-directed RNA polymerase subunit N; Provisional
Probab=23.03  E-value=1.3e+02  Score=22.03  Aligned_cols=13  Identities=15%  Similarity=0.429  Sum_probs=10.4

Q ss_pred             cccccccCCcCCh
Q psy5736         254 EKKCESCNSTSDL  266 (346)
Q Consensus       254 ~~~C~~C~~~~~~  266 (346)
                      ...|-.||+.+..
T Consensus         4 PvRCFTCGkvi~~   16 (62)
T PRK04016          4 PVRCFTCGKVIAE   16 (62)
T ss_pred             CeEecCCCCChHH
Confidence            3789999998764


No 290
>PF13913 zf-C2HC_2:  zinc-finger of a C2HC-type
Probab=23.00  E-value=89  Score=18.21  Aligned_cols=18  Identities=11%  Similarity=0.388  Sum_probs=13.2

Q ss_pred             ccccccCCcCChHHHHHH
Q psy5736         255 KKCESCNSTSDLTEIKTK  272 (346)
Q Consensus       255 ~~C~~C~~~~~~~~~~~~  272 (346)
                      ..|+.||.....+.+.+.
T Consensus         3 ~~C~~CgR~F~~~~l~~H   20 (25)
T PF13913_consen    3 VPCPICGRKFNPDRLEKH   20 (25)
T ss_pred             CcCCCCCCEECHHHHHHH
Confidence            679999998876665443


No 291
>TIGR02878 spore_ypjB sporulation protein YpjB. Members of this protein, YpjB, family are restricted to a subset of endospore-forming bacteria, including Bacillus species but not CLostridium or some others. In Bacillus subtilis, ypjB was found to be part of the sigma-E regulon, where sigma-E is a sporulation sigma factor that regulates expression in the mother cell compartment. Null mutants of ypjB show a sporulation defect. This protein family is not, however, a part of the endospore formation minimal gene set.
Probab=22.68  E-value=1.6e+02  Score=27.32  Aligned_cols=35  Identities=26%  Similarity=0.327  Sum_probs=27.3

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcc
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHEL  309 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~  309 (346)
                      ++..+-..++++.+++++++|.+.+.-+-+.|-..
T Consensus         4 eL~~lsD~~LQl~K~~~yeeAkqvL~yFs~~F~~~   38 (233)
T TIGR02878         4 ELNDLSDQVLQLTKQARYEEAKQVLEYFSKQFVSA   38 (233)
T ss_pred             hHhhhhHHHHHHHHhhhHHHHHHHHHHHHHHHHhc
Confidence            44455566788889999999999888888777664


No 292
>cd02679 MIT_spastin MIT: domain contained within Microtubule Interacting and Trafficking molecules. This MIT domain sub-family is found in the AAA protein spastin, a probable ATPase involved in the assembly or function of nuclear protein complexes; spastins might also be involved in microtubule dynamics. The molecular function of the MIT domain is unclear.
Probab=22.33  E-value=1.8e+02  Score=22.14  Aligned_cols=40  Identities=13%  Similarity=0.063  Sum_probs=29.5

Q ss_pred             HHHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCC
Q psy5736         274 SELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPP  313 (346)
Q Consensus       274 ~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~  313 (346)
                      ..+-+...+|++.=+.|..++|+..|.+.++.+.+-+.-|
T Consensus         6 ~~A~~~I~kaL~~dE~g~~e~Al~~Y~~gi~~l~eg~ai~   45 (79)
T cd02679           6 KQAFEEISKALRADEWGDKEQALAHYRKGLRELEEGIAVP   45 (79)
T ss_pred             HHHHHHHHHHhhhhhcCCHHHHHHHHHHHHHHHHHHcCCC
Confidence            3445555667776677899999999999998888754443


No 293
>KOG2055|consensus
Probab=22.29  E-value=1.1e+02  Score=31.21  Aligned_cols=29  Identities=24%  Similarity=0.386  Sum_probs=24.6

Q ss_pred             CCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736          33 SARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus        33 ~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      ...|++++|+  ++-.++|+|+-||.+|+|.
T Consensus       213 ~~~I~sv~FH--p~~plllvaG~d~~lrifq  241 (514)
T KOG2055|consen  213 HGGITSVQFH--PTAPLLLVAGLDGTLRIFQ  241 (514)
T ss_pred             cCCceEEEec--CCCceEEEecCCCcEEEEE
Confidence            3579999987  4678999999999999994


No 294
>KOG0321|consensus
Probab=22.13  E-value=1.1e+02  Score=32.44  Aligned_cols=52  Identities=13%  Similarity=0.166  Sum_probs=35.7

Q ss_pred             cceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           5 FLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         5 ~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +-++.+||.+.+...-.-.+     -.|..-++++-|.- .+..+..+|+-||.+-+|
T Consensus       121 DsT~r~Wdvk~s~l~G~~~~-----~GH~~SvkS~cf~~-~n~~vF~tGgRDg~illW  172 (720)
T KOG0321|consen  121 DSTIRPWDVKTSRLVGGRLN-----LGHTGSVKSECFMP-TNPAVFCTGGRDGEILLW  172 (720)
T ss_pred             Cceeeeeeeccceeecceee-----cccccccchhhhcc-CCCcceeeccCCCcEEEE
Confidence            56899999998744322111     12555567766544 566777889999999999


No 295
>PF01927 Mut7-C:  Mut7-C RNAse domain;  InterPro: IPR002782 This prokaryotic family of proteins have no known function. The proteins contain four conserved cysteines that may be involved in metal binding or disulphide bridges.
Probab=22.06  E-value=50  Score=28.05  Aligned_cols=34  Identities=24%  Similarity=0.433  Sum_probs=19.8

Q ss_pred             cccCCcCcCCcccccccCC---------cccccCcccccccCCcC
Q psy5736         229 IRCSNDNCGQIIATVKKLE---------PSAKKVEKKCESCNSTS  264 (346)
Q Consensus       229 ~~C~~~~C~g~~~~~~~~~---------~~~~~~~~~C~~C~~~~  264 (346)
                      -+|+  .|.+.+.+....+         ....+.-|.|+.||+..
T Consensus        92 sRC~--~CN~~L~~v~~~~v~~~vp~~v~~~~~~f~~C~~C~kiy  134 (147)
T PF01927_consen   92 SRCP--KCNGPLRPVSKEEVKDRVPPYVYETYDEFWRCPGCGKIY  134 (147)
T ss_pred             CccC--CCCcEeeechhhccccccCccccccCCeEEECCCCCCEe
Confidence            3677  7777655432110         01123479999999963


No 296
>PF12894 Apc4_WD40:  Anaphase-promoting complex subunit 4 WD40 domain
Probab=22.03  E-value=1.2e+02  Score=20.66  Aligned_cols=29  Identities=28%  Similarity=0.481  Sum_probs=21.3

Q ss_pred             CcccceeccccCCCceEEeeecCCceEEEeC
Q psy5736          34 ARCTALTLLNEHDDSLLLTGYDDGSLSVYRN   64 (346)
Q Consensus        34 ~~~t~l~~ine~~~~lll~g~~dG~vri~r~   64 (346)
                      .+|+.+.+ + ++-.++.+|+.+|.|.+||.
T Consensus        12 ~~v~~~~w-~-P~mdLiA~~t~~g~v~v~Rl   40 (47)
T PF12894_consen   12 SRVSCMSW-C-PTMDLIALGTEDGEVLVYRL   40 (47)
T ss_pred             CcEEEEEE-C-CCCCEEEEEECCCeEEEEEC
Confidence            46776664 3 33447788999999999986


No 297
>KOG1273|consensus
Probab=21.96  E-value=1.1e+02  Score=29.83  Aligned_cols=54  Identities=20%  Similarity=0.063  Sum_probs=40.3

Q ss_pred             ccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736           4 LFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus         4 ~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      .+|.|-|||..+-..-..++       -|-..|++|-+  ..|+..+|++|.|-.+.+|....
T Consensus        43 ~nG~vvI~D~~T~~iar~ls-------aH~~pi~sl~W--S~dgr~LltsS~D~si~lwDl~~   96 (405)
T KOG1273|consen   43 ANGRVVIYDFDTFRIARMLS-------AHVRPITSLCW--SRDGRKLLTSSRDWSIKLWDLLK   96 (405)
T ss_pred             cCCcEEEEEccccchhhhhh-------ccccceeEEEe--cCCCCEeeeecCCceeEEEeccC
Confidence            37999999998764333331       25667888875  46788889999999999996653


No 298
>KOG4056|consensus
Probab=21.87  E-value=1.6e+02  Score=25.02  Aligned_cols=43  Identities=14%  Similarity=0.084  Sum_probs=31.1

Q ss_pred             HHHHHHhccccHHHHHHHH----------HHHHHHhhcccCCCcHhHHHHHHH
Q psy5736         281 YRGIEQMNTSCFREAVESL----------TKFSDQIHELVVPPYKLASLAHEA  323 (346)
Q Consensus       281 ~~a~~~~~~~~~~~ai~~l----------~~~l~~~~~~l~p~h~~~~~~~~~  323 (346)
                      +.+.+++.+|++++....+          .++|..|++.++|+++.+...+..
T Consensus        86 ~lGE~L~~qg~~e~ga~h~~nAi~vcgqpaqLL~vlq~tlp~~if~~i~~~l~  138 (143)
T KOG4056|consen   86 QLGEELLAQGNEEEGAEHLANAIVVCGQPAQLLQVLQQTLPEAIFAMLLKKLP  138 (143)
T ss_pred             HhHHHHHHccCHHHHHHHHHHHHhhcCCHHHHHHHHHhhCCHHHHHHHHHHhh
Confidence            4466677888877754444          457789999999999987776653


No 299
>smart00647 IBR In Between Ring fingers. the domains occurs between pairs og RING fingers
Probab=21.87  E-value=64  Score=22.52  Aligned_cols=32  Identities=16%  Similarity=0.369  Sum_probs=18.2

Q ss_pred             ccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         228 RIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       228 ~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      ..+||+++|...+....+.    ......|+.|+..
T Consensus        18 ~~~CP~~~C~~~~~~~~~~----~~~~v~C~~C~~~   49 (64)
T smart00647       18 LKWCPAPDCSAAIIVTEEE----GCNRVTCPKCGFS   49 (64)
T ss_pred             ccCCCCCCCcceEEecCCC----CCCeeECCCCCCe
Confidence            3468888887666544210    1124677777653


No 300
>PF04652 DUF605:  Vta1 like;  InterPro: IPR006745 This family contains proteins from the Eukaryota; functionally they are uncharacterised.; PDB: 2RKK_B 2RKL_B 3MHV_A.
Probab=21.76  E-value=1.9e+02  Score=28.28  Aligned_cols=37  Identities=14%  Similarity=0.147  Sum_probs=30.3

Q ss_pred             HHHHHHHHHHHHHHHHHhccccHHHHHHHHHHHHHHh
Q psy5736         270 KTKLSELNEMFYRGIEQMNTSCFREAVESLTKFSDQI  306 (346)
Q Consensus       270 ~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~  306 (346)
                      .+.+.+++...+=|+.+++-.|+..|++.|++.|+.+
T Consensus       343 ~~~i~~aQK~aK~AiSAL~feDv~tA~~~L~~AL~lL  379 (380)
T PF04652_consen  343 PEQIAKAQKHAKFAISALNFEDVPTAIKELRKALELL  379 (380)
T ss_dssp             HHHHHHHHHHHHHHHHHHTTT-HHHHHHHHHHHHHHH
T ss_pred             ccccccccccccccccccccccccccccccccccccC
Confidence            3456677777788899999999999999999999876


No 301
>KOG0301|consensus
Probab=21.26  E-value=99  Score=33.00  Aligned_cols=32  Identities=22%  Similarity=0.357  Sum_probs=22.7

Q ss_pred             CCCCCCcccceeccccCCCceEEeeecCCceEEEe
Q psy5736          29 WARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYR   63 (346)
Q Consensus        29 w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r   63 (346)
                      |..|..-+=+++.+++.   ..++|+.|..|++|+
T Consensus       136 l~gH~asVWAv~~l~e~---~~vTgsaDKtIklWk  167 (745)
T KOG0301|consen  136 LQGHTASVWAVASLPEN---TYVTGSADKTIKLWK  167 (745)
T ss_pred             cCCcchheeeeeecCCC---cEEeccCcceeeecc
Confidence            55566667777777765   557788888888884


No 302
>KOG4378|consensus
Probab=21.19  E-value=1.4e+02  Score=30.89  Aligned_cols=54  Identities=19%  Similarity=0.286  Sum_probs=35.2

Q ss_pred             CcccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           2 TGLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         2 ~~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      .|-.|.|++||......+-.|..+      |+..|..+-| .+..+.++++-+-|..|-+|
T Consensus       183 asd~G~VtlwDv~g~sp~~~~~~~------HsAP~~gicf-spsne~l~vsVG~Dkki~~y  236 (673)
T KOG4378|consen  183 ASDKGAVTLWDVQGMSPIFHASEA------HSAPCRGICF-SPSNEALLVSVGYDKKINIY  236 (673)
T ss_pred             eccCCeEEEEeccCCCcccchhhh------ccCCcCccee-cCCccceEEEecccceEEEe
Confidence            356799999999988877766544      5666666654 33445555555556666555


No 303
>PRK14873 primosome assembly protein PriA; Provisional
Probab=20.96  E-value=1.3e+02  Score=32.39  Aligned_cols=24  Identities=8%  Similarity=-0.054  Sum_probs=15.5

Q ss_pred             CCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         311 VPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       311 ~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      .|....-..+.+.|.+.-++.|..
T Consensus       503 ~pDfRA~Er~~qll~qvagragr~  526 (665)
T PRK14873        503 RQDLRAAEDTLRRWMAAAALVRPR  526 (665)
T ss_pred             CCCcChHHHHHHHHHHHHHhhcCC
Confidence            344445666677777777777664


No 304
>PLN03086 PRLI-interacting factor K; Provisional
Probab=20.93  E-value=1.3e+02  Score=31.56  Aligned_cols=57  Identities=25%  Similarity=0.569  Sum_probs=33.3

Q ss_pred             ceecCCCCCCCccccccccC-----ccccccCCcCcCCcccccccCCcccccCcccccccCCcCChHHH
Q psy5736         206 ECKCRACVENWPLMESLEKY-----PIRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNSTSDLTEI  269 (346)
Q Consensus       206 ~C~C~~C~~~~~~~~~~~~~-----~~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~~~~~~~  269 (346)
                      +..|.-|...-|. ..+..+     ...+.||+..|+..+.....      ...|.|..|++.+.....
T Consensus       407 ~V~C~NC~~~i~l-~~l~lHe~~C~r~~V~Cp~~~Cg~v~~r~el------~~H~~C~~Cgk~f~~s~L  468 (567)
T PLN03086        407 TVECRNCKHYIPS-RSIALHEAYCSRHNVVCPHDGCGIVLRVEEA------KNHVHCEKCGQAFQQGEM  468 (567)
T ss_pred             eEECCCCCCccch-hHHHHHHhhCCCcceeCCcccccceeecccc------ccCccCCCCCCccchHHH
Confidence            4567778775442 122211     25567875558876643321      235999999998765443


No 305
>KOG0310|consensus
Probab=20.79  E-value=1.7e+02  Score=29.89  Aligned_cols=52  Identities=15%  Similarity=0.157  Sum_probs=34.1

Q ss_pred             cccceEEEEecCCCceeeeeCCCCCCCCCCCCcccceeccccCCCceEEeeecCCceEEE
Q psy5736           3 GLFLKEGIWDHITGSRLISWRPDRGSWARNSARCTALTLLNEHDDSLLLTGYDDGSLSVY   62 (346)
Q Consensus         3 ~~~~~~~~w~~~~~~~~~~~~~~~~~w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~   62 (346)
                      +=++.+.+||-.++.- ..--.+      |.-.+-+..++ +..+.++++|+=||.||+|
T Consensus       130 sDd~v~k~~d~s~a~v-~~~l~~------htDYVR~g~~~-~~~~hivvtGsYDg~vrl~  181 (487)
T KOG0310|consen  130 SDDKVVKYWDLSTAYV-QAELSG------HTDYVRCGDIS-PANDHIVVTGSYDGKVRLW  181 (487)
T ss_pred             CCCceEEEEEcCCcEE-EEEecC------CcceeEeeccc-cCCCeEEEecCCCceEEEE
Confidence            3467889999998874 222212      34344443333 3556788999999999999


No 306
>TIGR00990 3a0801s09 mitochondrial precursor proteins import receptor (72 kDa mitochondrial outermembrane protein) (mitochondrial import receptor for the ADP/ATP carrier) (translocase of outermembrane tom70).
Probab=20.69  E-value=2.8e+02  Score=28.93  Aligned_cols=51  Identities=12%  Similarity=0.113  Sum_probs=34.7

Q ss_pred             HHHHHHHHHHHHhccccHHHHHHHHHHHHHHhhcccCCCcHhHHHHHHHHHHHHHHhCCc
Q psy5736         275 ELNEMFYRGIEQMNTSCFREAVESLTKFSDQIHELVVPPYKLASLAHEALRNCWSLAGNK  334 (346)
Q Consensus       275 ~~~~~~~~a~~~~~~~~~~~ai~~l~~~l~~~~~~l~p~h~~~~~~~~~L~~~y~~~G~~  334 (346)
                      .+..+...+...+..|++++|++.|.+.++...     . +   .....++.||..+|+.
T Consensus       126 ~a~~~k~~G~~~~~~~~~~~Ai~~y~~al~~~p-----~-~---~~~~n~a~~~~~l~~~  176 (615)
T TIGR00990       126 YAAKLKEKGNKAYRNKDFNKAIKLYSKAIECKP-----D-P---VYYSNRAACHNALGDW  176 (615)
T ss_pred             HHHHHHHHHHHHHHcCCHHHHHHHHHHHHhcCC-----c-h---HHHHHHHHHHHHhCCH
Confidence            345555667778888999999999999886542     1 1   2345566677766654


No 307
>PF02561 FliS:  Flagellar protein FliS;  InterPro: IPR003713 The fliD operon of several bacteria consists of three flagellar genes, fliD, fliS, and fliT, and is transcribed in this order []. In Bacillus subtilis the operon encoding the flagellar proteins FliD, FliS, and FliT is sigma D-dependent [].; GO: 0009296 flagellum assembly, 0009288 bacterial-type flagellum; PDB: 1VH6_A 3IQC_B 3K1I_B 1ORJ_B 1ORY_A.
Probab=20.60  E-value=4.3e+02  Score=21.35  Aligned_cols=45  Identities=16%  Similarity=0.119  Sum_probs=29.7

Q ss_pred             HHHHHHHHHHHHHHHHHHhccccHHHHHHHHHH---HHHHhhcccCCC
Q psy5736         269 IKTKLSELNEMFYRGIEQMNTSCFREAVESLTK---FSDQIHELVVPP  313 (346)
Q Consensus       269 ~~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~---~l~~~~~~l~p~  313 (346)
                      +.-.++.+...++.|...++.++++.+...+.+   ++..+...|.+.
T Consensus        22 i~~Lyd~ai~~l~~a~~a~~~~~~~~~~~~l~ka~~Ii~~L~~~Ld~e   69 (122)
T PF02561_consen   22 ILMLYDGAIEFLKQAKEAIEQGDIEEKNEALQKAQDIITELQSSLDFE   69 (122)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHTTHHHHHHHHHHHHHHHHHHHHHTCCTT
T ss_pred             HHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHhhcCCC
Confidence            444456667777788888899998876555544   445555556554


No 308
>KOG0282|consensus
Probab=20.54  E-value=74  Score=32.45  Aligned_cols=37  Identities=22%  Similarity=0.402  Sum_probs=30.5

Q ss_pred             CCCCCCcccceeccccCCCceEEeeecCCceEEEeCCc
Q psy5736          29 WARNSARCTALTLLNEHDDSLLLTGYDDGSLSVYRNYA   66 (346)
Q Consensus        29 w~~~~~~~t~l~~ine~~~~lll~g~~dG~vri~r~~~   66 (346)
                      |..|..-+++++++- -..-|+|.|+.||.|.+|.-|.
T Consensus       210 ~~gH~kgvsai~~fp-~~~hLlLS~gmD~~vklW~vy~  246 (503)
T KOG0282|consen  210 LSGHTKGVSAIQWFP-KKGHLLLSGGMDGLVKLWNVYD  246 (503)
T ss_pred             ccCCccccchhhhcc-ceeeEEEecCCCceEEEEEEec
Confidence            788888899999886 3466789999999999996553


No 309
>PRK00415 rps27e 30S ribosomal protein S27e; Reviewed
Probab=20.40  E-value=70  Score=23.13  Aligned_cols=33  Identities=18%  Similarity=0.511  Sum_probs=21.7

Q ss_pred             cCccccccCCcCcCCcc-cccccCCcccccCcccccccCCcC
Q psy5736         224 KYPIRIRCSNDNCGQII-ATVKKLEPSAKKVEKKCESCNSTS  264 (346)
Q Consensus       224 ~~~~~~~C~~~~C~g~~-~~~~~~~~~~~~~~~~C~~C~~~~  264 (346)
                      .++..++||  .|.+.- ..+..      .....|..||..+
T Consensus         7 S~F~~VkCp--~C~n~q~vFsha------~t~V~C~~Cg~~L   40 (59)
T PRK00415          7 SRFLKVKCP--DCGNEQVVFSHA------STVVRCLVCGKTL   40 (59)
T ss_pred             CeEEEEECC--CCCCeEEEEecC------CcEEECcccCCCc
Confidence            456788999  897542 22322      2348899999876


No 310
>COG2178 Predicted RNA-binding protein of the translin family [Translation, ribosomal structure and biogenesis]
Probab=20.33  E-value=5.8e+02  Score=23.14  Aligned_cols=47  Identities=19%  Similarity=0.174  Sum_probs=31.6

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHHH---HHHhhcccCC
Q psy5736         266 LTEIKTKLSELNEMFYRGIEQMNTSCFREAVESLTKF---SDQIHELVVP  312 (346)
Q Consensus       266 ~~~~~~~l~~~~~~~~~a~~~~~~~~~~~ai~~l~~~---l~~~~~~l~p  312 (346)
                      .+++.+...++..+-..++.++-.|++++|...+.++   ++.+++.+.+
T Consensus        19 REE~l~lsRei~r~s~~aI~~~H~~~~eeA~~~l~~a~~~v~~Lk~~l~~   68 (204)
T COG2178          19 REEALKLSREIVRLSGEAIFLLHRGDFEEAEKKLKKASEAVEKLKRLLAG   68 (204)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            4556666667777777888889999999886666554   4555544444


No 311
>COG1675 TFA1 Transcription initiation factor IIE, alpha subunit [Transcription]
Probab=20.32  E-value=1.4e+02  Score=26.55  Aligned_cols=39  Identities=15%  Similarity=0.566  Sum_probs=18.5

Q ss_pred             HHHHHhhCCCceecCCCCCCCccccccccCccccccCCcCcCCccc
Q psy5736         196 QKVLSARYWFECKCRACVENWPLMESLEKYPIRIRCSNDNCGQIIA  241 (346)
Q Consensus       196 ~~~L~~~~~F~C~C~~C~~~~~~~~~~~~~~~~~~C~~~~C~g~~~  241 (346)
                      .+...++-+|.|  +.|.-.....+.+   ...|.||  .|++.+.
T Consensus       105 le~~~~~~~y~C--~~~~~r~sfdeA~---~~~F~Cp--~Cg~~L~  143 (176)
T COG1675         105 LEKETENNYYVC--PNCHVKYSFDEAM---ELGFTCP--KCGEDLE  143 (176)
T ss_pred             HHhhccCCceeC--CCCCCcccHHHHH---HhCCCCC--CCCchhh
Confidence            334445566666  5555443221111   2446666  5655443


No 312
>PF08631 SPO22:  Meiosis protein SPO22/ZIP4 like;  InterPro: IPR013940  SPO22 is a meiosis-specific protein with similarity to phospholipase A2, involved in completion of nuclear divisions during meiosis; induced early in meiosis []. It is also involved in sporulation [].
Probab=20.19  E-value=4e+02  Score=24.80  Aligned_cols=57  Identities=18%  Similarity=0.165  Sum_probs=39.9

Q ss_pred             HHHHHHHHHHhccc-cHHHHHHHHHHHHHHhhc-----ccCCCc-HhHHHHHHHHHHHHHHhCC
Q psy5736         277 NEMFYRGIEQMNTS-CFREAVESLTKFSDQIHE-----LVVPPY-KLASLAHEALRNCWSLAGN  333 (346)
Q Consensus       277 ~~~~~~a~~~~~~~-~~~~ai~~l~~~l~~~~~-----~l~p~h-~~~~~~~~~L~~~y~~~G~  333 (346)
                      .-.|.-+..+++.+ ++++|+.++++.++.+.+     -.++.- -....+...|+++|...+.
T Consensus        36 ~~~yn~G~~l~~~~~~~~~a~~wL~~a~~~l~~~~~~~~~~~~~~elr~~iL~~La~~~l~~~~   99 (278)
T PF08631_consen   36 RVCYNIGKSLLSKKDKYEEAVKWLQRAYDILEKPGKMDKLSPDGSELRLSILRLLANAYLEWDT   99 (278)
T ss_pred             HHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHhhhhccccCCcHHHHHHHHHHHHHHHHHcCCC
Confidence            44567788888888 999999999999988754     233322 2455566677777766543


No 313
>PF10122 Mu-like_Com:  Mu-like prophage protein Com;  InterPro: IPR019294  Members of this entry belong to the Com family of proteins that act as translational regulators of mom [, ]. 
Probab=20.10  E-value=43  Score=23.45  Aligned_cols=31  Identities=29%  Similarity=0.787  Sum_probs=17.5

Q ss_pred             cccccCCcCcCCcccccccCCcccccCcccccccCCc
Q psy5736         227 IRIRCSNDNCGQIIATVKKLEPSAKKVEKKCESCNST  263 (346)
Q Consensus       227 ~~~~C~~~~C~g~~~~~~~~~~~~~~~~~~C~~C~~~  263 (346)
                      ..+||+  .|...+.-...-    ....++|+.|+..
T Consensus         3 ~eiRC~--~CnklLa~~g~~----~~leIKCpRC~ti   33 (51)
T PF10122_consen    3 KEIRCG--HCNKLLAKAGEV----IELEIKCPRCKTI   33 (51)
T ss_pred             cceecc--chhHHHhhhcCc----cEEEEECCCCCcc
Confidence            456787  786554432111    1124788888874


Done!