Query         psy8269
Match_columns 386
No_of_seqs    294 out of 1297
Neff          6.5 
Searched_HMMs 46136
Date          Fri Aug 16 23:00:09 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy8269.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/8269hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PRK10907 intramembrane serine   99.9 2.4E-25 5.2E-30  215.3  17.0  177  170-364    93-270 (276)
  2 PTZ00101 rhomboid-1 protease;   99.9 3.9E-25 8.5E-30  213.6  17.1  183  172-364    54-240 (278)
  3 COG0705 Membrane associated se  99.9 8.9E-24 1.9E-28  198.7  13.7  187  171-364    17-211 (228)
  4 KOG2980|consensus               99.9 2.9E-24 6.3E-29  205.5   5.0  211   15-370    96-308 (310)
  5 COG5208 HAP5 CCAAT-binding fac  99.9 5.2E-23 1.1E-27  186.9   7.5  108   86-194    82-189 (286)
  6 PF01694 Rhomboid:  Rhomboid fa  99.8 2.3E-21 4.9E-26  168.7   5.9  141  216-365     3-143 (145)
  7 KOG2289|consensus               99.8 1.3E-21 2.9E-26  190.9   3.0  142  213-364   112-254 (316)
  8 KOG1657|consensus               99.7 1.6E-18 3.5E-23  163.5   2.4  104   91-196    53-156 (236)
  9 KOG2632|consensus               99.6 1.9E-15 4.1E-20  142.9  12.5  182  163-360     7-194 (258)
 10 KOG2290|consensus               99.6 5.9E-16 1.3E-20  154.5   3.6  136  218-364   450-585 (652)
 11 PF00808 CBFD_NFYB_HMF:  Histon  99.2 3.8E-11 8.2E-16   91.7   5.8   64  114-177     2-65  (65)
 12 KOG1659|consensus               99.0 2.9E-10 6.4E-15  104.0   6.2   84  109-192     8-91  (224)
 13 COG5247 BUR6 Class 2 transcrip  99.0 1.2E-09 2.6E-14   88.9   6.0   79  114-192    23-101 (113)
 14 PF08551 DUF1751:  Eukaryotic i  98.4 1.9E-07 4.1E-12   77.4   3.6   82  218-299     7-96  (99)
 15 KOG1658|consensus               98.3 3.5E-07 7.6E-12   80.2   3.2   82  111-192    56-137 (162)
 16 PF04511 DER1:  Der1-like famil  98.2 1.7E-05 3.8E-10   73.4  12.6  137  216-361    39-179 (197)
 17 KOG2890|consensus               98.1 1.3E-05 2.8E-10   77.8   8.1  154  212-370    58-221 (326)
 18 KOG0858|consensus               97.8 7.8E-05 1.7E-09   70.4   8.6  132  215-361    49-184 (239)
 19 COG2036 HHT1 Histones H3 and H  96.2  0.0092   2E-07   48.7   4.9   74  105-179    10-83  (91)
 20 cd00074 H2A Histone 2A; H2A is  95.8   0.036 7.8E-07   47.2   7.1   74  115-188    21-94  (115)
 21 PF00125 Histone:  Core histone  95.5   0.037 8.1E-07   42.8   5.6   64  114-177     8-72  (75)
 22 KOG0869|consensus               95.2   0.052 1.1E-06   48.2   6.2   72  113-184    31-103 (168)
 23 smart00803 TAF TATA box bindin  95.2   0.058 1.3E-06   41.3   5.8   63  114-177     2-64  (65)
 24 cd00076 H4 Histone H4, one of   95.2   0.067 1.5E-06   43.2   6.3   67  111-178    10-76  (85)
 25 PLN00035 histone H4; Provision  94.8   0.093   2E-06   43.8   6.3   66  113-179    28-93  (103)
 26 PTZ00015 histone H4; Provision  94.2    0.16 3.4E-06   42.4   6.4   68  110-178    26-93  (102)
 27 COG5262 HTA1 Histone H2A [Chro  93.8    0.21 4.5E-06   42.3   6.4   78  111-188    23-100 (132)
 28 COG5291 Predicted membrane pro  93.5    0.27 5.8E-06   47.1   7.3   45  212-256    53-99  (313)
 29 smart00417 H4 Histone H4.       93.2    0.24 5.2E-06   39.0   5.5   63  111-174    10-72  (74)
 30 cd07981 TAF12 TATA Binding Pro  92.0    0.53 1.2E-05   36.6   6.1   64  116-179     3-66  (72)
 31 smart00414 H2A Histone 2A.      90.0    0.96 2.1E-05   38.0   6.1   73  115-187    10-82  (106)
 32 cd08048 TAF11 TATA Binding Pro  89.9     1.2 2.6E-05   36.0   6.4   71  106-177     8-81  (85)
 33 KOG4463|consensus               89.6    0.24 5.3E-06   47.8   2.5   60  217-277    49-108 (323)
 34 KOG0870|consensus               87.4     1.9 4.1E-05   38.9   6.5   76  114-189    10-87  (172)
 35 PTZ00017 histone H2A; Provisio  86.5     1.9 4.2E-05   37.7   6.0   73  115-187    28-100 (134)
 36 PF09415 CENP-X:  CENP-S associ  85.9     2.1 4.7E-05   33.4   5.4   60  116-175     1-63  (72)
 37 KOG1756|consensus               84.7     3.6 7.7E-05   35.6   6.6   77  113-189    26-102 (131)
 38 PF13150 DUF3989:  Protein of u  83.7    0.66 1.4E-05   37.5   1.7   68   18-87      4-76  (85)
 39 PLN00154 histone H2A; Provisio  83.1     3.3 7.1E-05   36.3   5.8   74  115-188    39-113 (136)
 40 KOG0871|consensus               81.2     5.7 0.00012   35.2   6.6   77  110-186     8-85  (156)
 41 COG5150 Class 2 transcription   79.6     5.1 0.00011   34.6   5.6   84  109-192     6-93  (148)
 42 smart00428 H3 Histone H3.       78.2     8.5 0.00018   32.3   6.5   81   97-177    15-98  (105)
 43 PLN00156 histone H2AX; Provisi  77.3     6.1 0.00013   34.8   5.6   72  115-186    30-101 (139)
 44 PLN00157 histone H2A; Provisio  77.0     6.8 0.00015   34.2   5.8   70  115-184    27-96  (132)
 45 PLN00153 histone H2A; Provisio  76.8       6 0.00013   34.4   5.4   70  115-184    25-94  (129)
 46 PF03419 Peptidase_U4:  Sporula  67.7      57  0.0012   31.9  10.7   40  233-276    11-50  (293)
 47 COG4769 Predicted membrane pro  66.3      67  0.0014   29.3   9.7  119  237-364    41-163 (181)
 48 smart00576 BTP Bromodomain tra  65.3      14 0.00031   28.8   4.8   55  126-180    17-71  (77)
 49 PF04719 TAFII28:  hTAFII28-lik  61.8      21 0.00045   29.2   5.2   73  105-177    14-87  (90)
 50 cd08050 TAF6 TATA Binding Prot  61.4      17 0.00037   36.6   5.8   61  118-179     3-63  (343)
 51 cd07979 TAF9 TATA Binding Prot  59.3      28 0.00061   29.6   5.9   61  118-179     5-65  (117)
 52 PF02969 TAF:  TATA box binding  58.7      22 0.00048   27.3   4.6   49  129-177    17-65  (66)
 53 PLN00161 histone H3; Provision  57.4      38 0.00082   29.7   6.4   81   97-177    41-123 (135)
 54 TIGR02854 spore_II_GA sigma-E   57.1   1E+02  0.0022   30.3  10.2   41  232-276    10-50  (288)
 55 COG4603 ABC-type uncharacteriz  54.3 2.2E+02  0.0048   29.0  12.3  123  170-299   142-291 (356)
 56 PTZ00252 histone H2A; Provisio  54.3      35 0.00077   29.9   5.7   70  115-184    26-97  (134)
 57 PF13105 DUF3959:  Protein of u  53.4      66  0.0014   29.8   7.5   19  346-364   134-152 (239)
 58 PF04632 FUSC:  Fusaric acid re  50.3 3.3E+02  0.0072   29.3  13.8   35  347-381   474-511 (650)
 59 PF03847 TFIID_20kDa:  Transcri  48.3      57  0.0012   25.1   5.5   60  119-178     4-63  (68)
 60 TIGR00834 ae anion exchange pr  46.8 2.2E+02  0.0047   32.7  11.8   64  232-299   376-440 (900)
 61 PF09011 HMG_box_2:  HMG-box do  46.2     4.3 9.4E-05   31.1  -1.1   43   15-64     29-71  (73)
 62 KOG1658|consensus               45.5      24 0.00053   31.6   3.4   45  110-154     7-51  (162)
 63 PF06728 PIG-U:  GPI transamida  44.7      40 0.00088   34.4   5.4   88  211-309   254-342 (382)
 64 PF03540 TFIID_30kDa:  Transcri  43.5      74  0.0016   23.2   5.1   48  114-162     2-49  (51)
 65 smart00398 HMG high mobility g  43.5     6.9 0.00015   28.9  -0.3   40   17-63     28-67  (70)
 66 cd01390 HMGB-UBF_HMG-box HMGB-  40.9     7.1 0.00015   28.7  -0.6   24   17-40     27-50  (66)
 67 PTZ00199 high mobility group p  38.3     9.4  0.0002   31.1  -0.3   41   18-65     52-92  (94)
 68 cd01388 SOX-TCF_HMG-box SOX-TC  36.9     7.7 0.00017   29.6  -1.0   42   17-65     28-69  (72)
 69 cd01389 MATA_HMG-box MATA_HMG-  36.9     7.3 0.00016   30.1  -1.1   44   17-67     28-71  (77)
 70 PF15510 CENP-W:  Centromere ki  35.4      84  0.0018   25.8   4.7   63  113-176    15-92  (102)
 71 PF12805 FUSC-like:  FUSC-like   34.7 2.4E+02  0.0051   27.3   8.8   12  344-355    70-81  (284)
 72 COG0705 Membrane associated se  33.9      29 0.00063   32.3   2.2   69  216-302   137-205 (228)
 73 PLN00160 histone H3; Provision  33.8 1.4E+02  0.0031   24.7   6.0   80   97-176     7-88  (97)
 74 PF00505 HMG_box:  HMG (high mo  33.7     6.9 0.00015   29.1  -1.7   25   16-40     26-50  (69)
 75 PF10979 DUF2786:  Protein of u  33.7      49  0.0011   23.1   2.8   35  117-151     4-38  (43)
 76 PLN00121 histone H3; Provision  33.6 1.2E+02  0.0026   26.7   5.7   80   97-176    48-128 (136)
 77 PF09527 ATPase_gene1:  Putativ  33.2 1.8E+02   0.004   20.8   6.3   41  234-274     9-50  (55)
 78 PF06609 TRI12:  Fungal trichot  32.3 1.9E+02  0.0042   31.6   8.4   41  233-274    86-126 (599)
 79 PF07524 Bromo_TP:  Bromodomain  31.4 1.5E+02  0.0033   22.7   5.7   55  125-179    16-70  (77)
 80 COG3105 Uncharacterized protei  31.1      81  0.0018   27.5   4.2   24  341-364     4-27  (138)
 81 COG0370 FeoB Fe2+ transport sy  30.8   4E+02  0.0087   29.5  10.4   71  292-364   393-464 (653)
 82 PTZ00018 histone H3; Provision  30.4 1.5E+02  0.0032   26.2   5.8   79   97-176    48-128 (136)
 83 cd08045 TAF4 TATA Binding Prot  28.6 1.9E+02   0.004   27.0   6.7   66  118-184    52-123 (212)
 84 PF14898 DUF4491:  Domain of un  28.4      90   0.002   25.7   3.9   44  249-302    26-69  (94)
 85 PF15511 CENP-T:  Centromere ki  27.8      72  0.0016   33.1   4.1   67  106-172   343-414 (414)
 86 KOG1172|consensus               26.0 8.8E+02   0.019   27.8  12.1   59  236-298   371-430 (876)
 87 cd00084 HMG-box High Mobility   24.5      21 0.00045   25.9  -0.4   25   16-40     26-50  (66)
 88 KOG1142|consensus               24.4 1.1E+02  0.0023   29.9   4.2   71  107-177   147-217 (258)
 89 PRK09554 feoB ferrous iron tra  22.8 6.3E+02   0.014   28.5  10.5   51  312-364   425-475 (772)
 90 PF14241 DUF4341:  Domain of un  22.7 1.5E+02  0.0032   22.3   3.9   34  262-299     6-39  (62)
 91 PF04892 VanZ:  VanZ like famil  22.1 4.6E+02  0.0099   21.7   9.2   28  341-368   105-132 (133)
 92 PF04632 FUSC:  Fusaric acid re  21.9 3.3E+02  0.0071   29.3   8.0   31  341-371   464-494 (650)
 93 COG4956 Integral membrane prot  21.8 3.1E+02  0.0067   27.7   6.9   86  259-349    49-140 (356)
 94 smart00427 H2B Histone H2B.     21.0 2.7E+02  0.0059   22.7   5.3   60  120-180     7-67  (89)

No 1  
>PRK10907 intramembrane serine protease GlpG; Provisional
Probab=99.93  E-value=2.4e-25  Score=215.26  Aligned_cols=177  Identities=12%  Similarity=0.170  Sum_probs=126.9

Q ss_pred             hhhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCC-CCccccceeecccccCChhHHHHHHHHHHHHHHH
Q psy8269         170 RNDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNP-DQYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPS  248 (386)
Q Consensus       170 ~~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p-~~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~  248 (386)
                      ..+++..++.+|+.+|+.+.+.+.......        ..+..++ .++|+||++|++|+|.|+.|+++||+.+|.+|..
T Consensus        93 ~~p~T~~li~i~i~vf~l~~~~~~~~~~~~--------l~~~~~~~~~~q~WRl~T~~flH~~~~Hl~fNml~l~~lG~~  164 (276)
T PRK10907         93 AGPLTLGVMIACVVVFILMQILGDQTVMLW--------LAWPFDPSLKFELWRYFTHALLHFSLLHILFNLLWWWYLGGA  164 (276)
T ss_pred             CCCHHHHHHHHHHHHHHHHHHhccHHHHHH--------HhccccccccCCcHHHHhHHHHhCCHHHHHHHHHHHHHHHHH
Confidence            456888999999999999877553211000        0011111 3679999999999999999999999999999999


Q ss_pred             HHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHH
Q psy8269         249 SVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGI  328 (386)
Q Consensus       249 lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~  328 (386)
                      +|+.+|+++++.+|++++++|+++++++.     ....+|+||+|||++++......+.+.....+|.   ....++.++
T Consensus       165 iE~~~G~~~~l~l~l~s~i~~~~~~~~~~-----~~~~gGaSGvVygL~g~~~~~~~~~p~~~~~lp~---~~~~f~llw  236 (276)
T PRK10907        165 VEKRLGSGKLIVITLISALLSGWVQSKFS-----GPWFGGLSGVVYALMGYVWLRGERDPQSGIYLPR---GLIAFALLW  236 (276)
T ss_pred             HHHHHChHHHHHHHHHHHHHHHHHHHHHc-----cchhhHHHHHHHHHHHHHHHHhccccccchhhhH---HHHHHHHHH
Confidence            99999999999999999999999988765     3467899999999999876543222221111221   111122222


Q ss_pred             HHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         329 MLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       329 l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      +.+...+. + ..+|++.||+||+++|+++++...+
T Consensus       237 l~~g~~~~-~-g~~Ian~AHlgGli~Gll~g~~~~~  270 (276)
T PRK10907        237 LVAGYFDL-F-GMSIANAAHVAGLAVGLAMAFWDTR  270 (276)
T ss_pred             HHHHHHHc-c-CcccHHHHHHHHHHHHHHHHHHhhh
Confidence            22222221 2 3689999999999999999987654


No 2  
>PTZ00101 rhomboid-1 protease; Provisional
Probab=99.93  E-value=3.9e-25  Score=213.62  Aligned_cols=183  Identities=22%  Similarity=0.216  Sum_probs=122.6

Q ss_pred             hHHHHHHHHHHHHHHHhhcccccccccCCCCc--cccc-cccccCCCCccccceeecccccCChhHHHHHHHHHHHHHHH
Q psy8269         172 DIAMAITKYDMFDFLIDIVPREEVKTATGPHR--RDDA-TKTALNPDQYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPS  248 (386)
Q Consensus       172 DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~--~~~~-~~~~l~p~~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~  248 (386)
                      -+...|+.++++.|+.++.-+.+  ...+|..  .... ........++|+||++|++|+|.|+.|+++||++++.+|..
T Consensus        54 ~l~~~Iiii~iivfil~l~~~~~--~~l~p~~~~L~~~Ga~~~~~i~~gq~WRLiT~~FlH~~~~HLl~Nm~~l~~~G~~  131 (278)
T PTZ00101         54 SFIMAISIIQIIVFIISVSIKPA--DFLTPSDSLLVTLGANVASRIKQGEIHRLILPIFLHANIFHTFFNVFFQLRMGFT  131 (278)
T ss_pred             HHHHHHHHHHHHHHHHHHHhccc--ccCCCCHHHHHHHhCcchhhhhcCCCHHHHHHHHHccCHHHHHHHHHHHHHHHHH
Confidence            34567788888888887763211  1111110  0000 00111123789999999999999999999999999999999


Q ss_pred             HHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHH
Q psy8269         249 SVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGI  328 (386)
Q Consensus       249 lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~  328 (386)
                      +|+.+|+.|++.+|+++|++|++++.++.    +...++||||++||++|++....-......   +. +......+..+
T Consensus       132 lE~~~G~~r~~ilYl~sGi~G~l~s~~~~----~~~~svGASgAifGLiGa~~~~lil~w~~~---~~-~~~~~~~~i~~  203 (278)
T PTZ00101        132 LEKNYGIVKIIILYFLTGIYGNILSSSVT----YCPIKVGASTSGMGLLGIVTSELILLWHVI---RH-RERVVFNIIFF  203 (278)
T ss_pred             HHHHHChHHHHHHHHHHHHHHHHHHHHHc----cCCcEEehhHHHHHHHHHHHHHHHHHHHhh---cc-HHHHHHHHHHH
Confidence            99999999999999999999999988775    345799999999999999764210000000   00 00000001111


Q ss_pred             HHHHHH-HhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         329 MLLDFL-GVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       329 l~~~~~-~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      +++.+. ......+++|+.||+||+++|++++..+.+
T Consensus       204 ~li~~~l~~~~~g~~Id~~aHlGG~i~G~llg~~~~~  240 (278)
T PTZ00101        204 SLISFFYYFTFNGSNIDHVGHLGGLLSGISMGILYNS  240 (278)
T ss_pred             HHHHHHHHHhcCCCCccHHHHHHHHHHHHHHHHHHHh
Confidence            111111 111224789999999999999999998765


No 3  
>COG0705 Membrane associated serine protease [Amino acid transport and metabolism]
Probab=99.91  E-value=8.9e-24  Score=198.71  Aligned_cols=187  Identities=23%  Similarity=0.277  Sum_probs=139.0

Q ss_pred             hhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCC-------CccccceeecccccCChhHHHHHHHHHH
Q psy8269         171 NDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPD-------QYLSVPMLLSTFSHQSPLHIFANMFVLH  243 (386)
Q Consensus       171 ~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~-------~g~~wrllTs~FlH~~~~HLl~Nm~~L~  243 (386)
                      ..++..+..+|+..|+.+.+...+........    .....+.|.       ..|+||++|++|+|.|+.|+++||+.+|
T Consensus        17 ~~~~~~~~~~~~~v~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~w~lit~~FlH~~~~Hll~N~~~l~   92 (228)
T COG0705          17 PPVTLFLILLNILVFLLELVLGWSAIFLLTFL----FRLFGLYPLNLLGALARDQLWRLITAIFLHAGFLHLLFNMLALW   92 (228)
T ss_pred             chHHHHHHHHHHHHHHHHHHccchHHHHHHHh----hhHHhhcchhhhccccccchHHHHHHHHHHhhHHHHHHHHHHHH
Confidence            56677888899999999887664311110000    000111111       1189999999999999999999999999


Q ss_pred             HHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHH
Q psy8269         244 SFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEH  323 (386)
Q Consensus       244 ~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~  323 (386)
                      .+|..+|+.+|+.+|+.+|+.+|+++++++..+....  ..+++||||+++|++++++...|..+....... .+..+..
T Consensus        93 ~fg~~le~~~G~~~f~~~yl~~gl~~~~~~~~~~~~~--~~~~~GASG~i~gllga~~~~~~~~~~~~~~~~-~~~~~~~  169 (228)
T COG0705          93 VFGSNLERRLGTLRFLLFYLLSGLLAGLAQVLFGPKG--GAPSLGASGAIFGLLGAYFLLFPFARILLLFLS-LPRPALI  169 (228)
T ss_pred             HhhHHHHHHhchhHHHHHHHHHHHHHHHHHHHHcccc--cCcccchhHHHHHHHHHHHHHccccchhhhhcc-CchhHHH
Confidence            9999999999999999999999999999988877321  258999999999999999999999877655411 2233444


Q ss_pred             HHHHHHHHHHHHhhhcC-chhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         324 AIQGIMLLDFLGVLFRW-RLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       324 ~~~~~l~~~~~~~~~~~-~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      ++.+.+..++....... ++|++.||++|+++|++++....+
T Consensus       170 ~i~~~~~~~~~~~~~~~~~~va~~aHl~G~i~G~l~~~~~~~  211 (228)
T COG0705         170 LILIWLLYSLFSGAGSFGPSVAWSAHLGGLIGGLLLAALLSR  211 (228)
T ss_pred             HHHHHHHHHHHHHhcCCchHHHHHHHHHHHHHHHHHHHHHhh
Confidence            44455555554433221 379999999999999999998776


No 4  
>KOG2980|consensus
Probab=99.89  E-value=2.9e-24  Score=205.46  Aligned_cols=211  Identities=35%  Similarity=0.525  Sum_probs=189.2

Q ss_pred             ccchhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhhhccCCCCcchhhcccCCCccccccccccc
Q psy8269          15 SNDIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKYFASNPQSMGITLRLHAVSPEFDSRLGRSF   94 (386)
Q Consensus        15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   94 (386)
                      .++.+++||++.+.|..++||  ++.+++.+|..||..|++|..+-.|..|+.+++..++                    
T Consensus        96 ~~~f~~F~~~~~k~w~~~~~g--~v~~ll~~n~~vf~lWrv~~~~~~~~~~mls~~~~~t--------------------  153 (310)
T KOG2980|consen   96 FGGFTGFQKRGWKHWISGANG--VVFGLLIANAFVFTLWRVPQKQFTMIPWMLSRNAYKT--------------------  153 (310)
T ss_pred             cCCcccceecchHHHhhcCCc--chhHHHHHHHHHHHHHHhcchhhhhhhHHhhcccccc--------------------
Confidence            567889999999999999999  9999999999999999999999999999998876555                    


Q ss_pred             hhhhhHHhhhhhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269          95 FGSVLEVQGALSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA  174 (386)
Q Consensus        95 ~~~~~~~~~~~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa  174 (386)
                                                                                                      
T Consensus       154 --------------------------------------------------------------------------------  153 (310)
T KOG2980|consen  154 --------------------------------------------------------------------------------  153 (310)
T ss_pred             --------------------------------------------------------------------------------
Confidence                                                                                            


Q ss_pred             HHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCCCccccceeecccccCChhHHHHHHHHHHHHHH-HHHhhh
Q psy8269         175 MAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPDQYLSVPMLLSTFSHQSPLHIFANMFVLHSFMP-SSVEDL  253 (386)
Q Consensus       175 ~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~-~lE~~~  253 (386)
                                                                 -+|.+++|.|+|.+.+|+..||+.++.+.. .+.-.+
T Consensus       154 -------------------------------------------~~w~i~~s~Fsh~~a~h~g~~~~~~~~y~~~a~~~~~  190 (310)
T KOG2980|consen  154 -------------------------------------------GCWKIILSTFSHYSALHLGPNMLVLKSYLAGALKGSL  190 (310)
T ss_pred             -------------------------------------------cceeEEeehhcchhHhhhcHHHHHHHHHhcccccCCc
Confidence                                                       378999999999999999999999999988 888889


Q ss_pred             chhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHH
Q psy8269         254 GKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDF  333 (386)
Q Consensus       254 G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~  333 (386)
                      |...+.++|+++++.|......-.....+.++++||||++|++++..+.++|+.+..+.|++..+..+...+..++.+++
T Consensus       191 ~~~~~~AlylSa~~~~~~i~~~~~v~~~~~gp~LGAsGav~ai~a~~~~lfP~~~~~i~f~~~v~~ga~~~~~~i~~~~~  270 (310)
T KOG2980|consen  191 GFSSFFALYLSAGVKGLFISVKDKVPTSWAGPSLGASGAVYAILALDCTLFPKTTLYILFVFPVPAGAGLAFKAIAAYDF  270 (310)
T ss_pred             chhhcccceeccccccceeEeeccccccccccccccchHHHHHHHHHhhcCcCcceeEEEeecccccchhHHHHHHHhhh
Confidence            99999999998888887766655444456689999999999999999999999999999988888888888889999999


Q ss_pred             HHhhhcCchhHHHHHHHHHHHHHHHHHHhhh-cccccc
Q psy8269         334 LGVLFRWRLFDHAAHLGGALFGILYSKYGEQ-TWAHRA  370 (386)
Q Consensus       334 ~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~-~w~~r~  370 (386)
                      .+...++...+|+||++|.++|++++.++.. +|++|.
T Consensus       271 a~~~l~~~~~n~~Ah~~gsl~Gv~va~~~~~ri~kgR~  308 (310)
T KOG2980|consen  271 AGLILGWGFFNHAAHLSGSLFGVVVATYLWARIRKGRF  308 (310)
T ss_pred             cceeeccccchhHhhhcchHHHHHHHHHHHHHHHcCcc
Confidence            9988899999999999999999999999744 666664


No 5  
>COG5208 HAP5 CCAAT-binding factor, subunit C [Transcription]
Probab=99.88  E-value=5.2e-23  Score=186.93  Aligned_cols=108  Identities=57%  Similarity=0.905  Sum_probs=101.4

Q ss_pred             ccccccccchhhhhHHhhhhhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccc
Q psy8269          86 FDSRLGRSFFGSVLEVQGALSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKR  165 (386)
Q Consensus        86 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr  165 (386)
                      +++...|.|=+++.+.++.+ +.++|.+.+|++||||+||.|+||+||++|+|+||+|+||.||.|+|.|+|..++++||
T Consensus        82 ~~e~i~ryWq~ti~~~e~~~-q~~~k~h~LPlARIkkvMKtdedVkMisaEaPvlFak~~EiFI~ELTmRAW~~ae~NkR  160 (286)
T COG5208          82 LDERISRYWQQTIKAAEEER-QILLKDHNLPLARIKKVMKTDEDVKMISAEAPVLFAKITEIFIEELTMRAWINAEENKR  160 (286)
T ss_pred             HHHHHHHHHHHHHHHHHHhH-HHHHHhccCcHHHHHHHHhcccchhheecccchHHHHHHHHHHHHHHHHHHHHHhHhhh
Confidence            67788887777888887755 58899999999999999999999999999999999999999999999999999999999


Q ss_pred             cccchhhHHHHHHHHHHHHHHHhhccccc
Q psy8269         166 RTLQRNDIAMAITKYDMFDFLIDIVPREE  194 (386)
Q Consensus       166 ~~iq~~DIa~aI~~i~i~~FL~~ivp~~~  194 (386)
                      |++|++||+.++...++||||++++|+..
T Consensus       161 RtLQksDia~Av~kSeMfDFLidivpr~p  189 (286)
T COG5208         161 RTLQKSDIAAAVKKSEMFDFLIDIVPRNP  189 (286)
T ss_pred             hHHHHHHHHHHHHHHHHHhHHhhhccCCC
Confidence            99999999999999999999999999964


No 6  
>PF01694 Rhomboid:  Rhomboid family;  InterPro: IPR022764 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This group of proteins contain serine peptidases belonging to the MEROPS peptidase family S54 (Rhomboid, clan ST). They are integral membrane proteins related to the Drosophila melanogaster (Fruit fly) rhomboid protein P20350 from SWISSPROT. Members of this family are found in archaea, bacteria and eukaryotes. The D. melanogaster rhomboid protease cleaves type-1 transmembrane domains using a catalytic triad composed of serine, histidine and asparagine contributed by different transmembrane domains. It cleaves the transmembrane proteins Spitz, Gurken and Keren within their transmembrane domains to release a soluble TGFalpha-like growth factor. Cleavage occurs in the Golgi, following translocation of the substrates from the endoplasmic reticulum membrane by Star, another transmembrane protein. The growth factors are then able to activate the epidermal growth factor receptor [, ]. Few substrates of mammalian rhomboid homologues have been determined, but rhomboid-like protein 2 (MEROPS S54.002) has been shown to cleave ephrin B3 []. Parasite-encoded rhomboid enzymes are also important for invasion of host cells by Toxoplasma and the malaria parasite.  In Saccharomyces cerevisiae (Baker's yeast) the Pcp1 (MDM37) protein (MEROPS S54.007) is a mitochondrial endopeptidase required for the activation of cytochrome c peroxidase and for the processing of the mitochondrial dynamin-like protein Mgm1 [, ]. Mutations in Pcp1 result in cells have fragmented mitochondria, which have very few short tubulues []. This entry represents the 6 transmembrane helix rhomboid domain.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=99.84  E-value=2.3e-21  Score=168.72  Aligned_cols=141  Identities=26%  Similarity=0.336  Sum_probs=103.0

Q ss_pred             CccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHH
Q psy8269         216 QYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMA  295 (386)
Q Consensus       216 ~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~G  295 (386)
                      ++|+||++|++|+|.|+.|+++|++.++.+|..+|+.+|+.+++.+|+++++++++.+.++..   ...+.+|+||+++|
T Consensus         3 ~~~~wrl~T~~f~h~~~~hl~~n~~~l~~~g~~lE~~~G~~~~~~~~l~~~~~~~l~~~~~~~---~~~~~~G~Sg~~~~   79 (145)
T PF01694_consen    3 NGQWWRLFTSPFVHANFLHLLFNLLALWFFGSLLERRLGSRRFLALYLLSGLLGSLLSLLFSP---PNQPYVGASGAVFG   79 (145)
T ss_dssp             CC-TTHHHHGGG--SSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----S-----SSHHHHHH
T ss_pred             CCcchhhhHHHHHccCHHHHHHHHHHHHHhhhhHhhhccchHHHHHHHHHHHhhhhccccccc---cccccCCCcccchH
Confidence            569999999999999999999999999999999999999999999999999999999988772   22279999999999


Q ss_pred             HHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhhc
Q psy8269         296 VLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQT  365 (386)
Q Consensus       296 ll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~~  365 (386)
                      ++++.+...|+.+........    ........+........  .+++++.+|++|+++|++++...+++
T Consensus        80 l~~~~~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~--~~~~~~~~hl~G~~~G~~~~~~~~~~  143 (145)
T PF01694_consen   80 LLGAFLFLYPQNKKRLRFIYL----ALVVPIIVLVIILLLGF--IPNISFLGHLGGFLAGLLYGFLILRR  143 (145)
T ss_dssp             HHHHHHHHHHCCCCCS---HC----CCCCCCCCCCHHHCTSS--SSTTTHHHHHHHHHHHHHHHHHHCH-
T ss_pred             HHHHHHHHHhhccchhhcchH----HHHHHHHHHHHHHHHHH--HHhHHHHHHHHHHHHHHHHHHHHHHc
Confidence            999999988877544332000    00000001111111111  57999999999999999999987653


No 7  
>KOG2289|consensus
Probab=99.83  E-value=1.3e-21  Score=190.93  Aligned_cols=142  Identities=23%  Similarity=0.213  Sum_probs=109.2

Q ss_pred             CCC-CccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHH
Q psy8269         213 NPD-QYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASG  291 (386)
Q Consensus       213 ~p~-~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASG  291 (386)
                      .|. ++|+||++|++|+|+|+.||++||+.+.++|..+|..+|..|+.++|++||++||+++.++.    +..++|||||
T Consensus       112 ~~~~r~E~WRllTym~LHaGi~HL~~N~~~ql~iGi~LE~~~G~~RiglIYl~gg~aGSlls~l~d----~~~~sVGASg  187 (316)
T KOG2289|consen  112 KPVHRGELWRLLTYMWLHAGIFHLLLNMLSQLFIGIPLEQVHGFLRIGLIYLAGGVAGSLLSSLFD----PNSISVGASG  187 (316)
T ss_pred             ChhhhchhHHHHHHHHHhcCHHHHHHHHHHHHhccccHHhhcCceEEeeehhhhhhhhHHHHHHhc----cCCceecccH
Confidence            444 78999999999999999999999999999999999999999999999999999999999987    5778999999


Q ss_pred             HHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         292 AIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       292 aI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      ++|||+|+....--..+-...    -+..+...+..++.+++..  ...+.+++++|+||++.|..+++....
T Consensus       188 gvfaLlgA~Ls~l~~Nw~~m~----~~~~~l~~ll~Ii~i~l~~--G~~~~~~~~~h~gg~~~G~~~~fil~~  254 (316)
T KOG2289|consen  188 GVFALLGAHLSNLLTNWTIMK----NKFAALRTLLIIIFINLDL--GFAPYVDNFAHIGGLLAGFLLGFVLHI  254 (316)
T ss_pred             HHHHHHHHHHHHHHhhHHHhc----chHHHHHHHHHHHHHHHhh--ccccceeccccccccCCCcchhHHhhh
Confidence            999999998753211111000    0122223333333333321  123678889999999999999987664


No 8  
>KOG1657|consensus
Probab=99.72  E-value=1.6e-18  Score=163.54  Aligned_cols=104  Identities=59%  Similarity=0.891  Sum_probs=94.5

Q ss_pred             cccchhhhhHHhhhhhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccch
Q psy8269          91 GRSFFGSVLEVQGALSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQR  170 (386)
Q Consensus        91 ~~~~~~~~~~~~~~~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~  170 (386)
                      .++||....  .+.+...|++.+.+|++|||||||.|++|+||++|||++++||||+|+.+++.|+|.++++.+|+++++
T Consensus        53 l~~fw~~~~--~e~e~~~d~~~~~lPlaRiKkimK~dedv~mI~~Eapvl~aka~E~Fi~elt~~sw~~Tee~~rrtl~~  130 (236)
T KOG1657|consen   53 LQSFWSKQA--IEKEGQLDFKNHILPLARIKKIMKSDEDVSMITAEAPVLFAKACELFITELTLRSWVHTEENKRRTLQK  130 (236)
T ss_pred             HHhhhhccc--cccccccchhhccCcHhhccccccccccccccchhHHHHHHHHHHHHHHHHHHHhhhhhcccccccchH
Confidence            356777444  224567899999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhHHHHHHHHHHHHHHHhhccccccc
Q psy8269         171 NDIAMAITKYDMFDFLIDIVPREEVK  196 (386)
Q Consensus       171 ~DIa~aI~~i~i~~FL~~ivp~~~~~  196 (386)
                      +|++.++....+|+||+|++|.....
T Consensus       131 sdia~av~~s~~fdFL~DivP~~~~~  156 (236)
T KOG1657|consen  131 SDIAAAVTQSETFDFLRDIVPRKILA  156 (236)
T ss_pred             HHHHHHhccCCCccceeccccchhcc
Confidence            99999999999999999999997544


No 9  
>KOG2632|consensus
Probab=99.64  E-value=1.9e-15  Score=142.93  Aligned_cols=182  Identities=14%  Similarity=0.140  Sum_probs=134.4

Q ss_pred             ccccccchhhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCCCccccceeecccccCChhHHHHHHHHH
Q psy8269         163 NKRRTLQRNDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPDQYLSVPMLLSTFSHQSPLHIFANMFVL  242 (386)
Q Consensus       163 ~kr~~iq~~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~~g~~wrllTs~FlH~~~~HLl~Nm~~L  242 (386)
                      ..+.+......+.++...|+..|++-..+...          +..........+.|.||++||+++|.+..|+++||+.+
T Consensus         7 ~~~~~~~~p~~ts~~~~~~~~i~lv~~~~~i~----------~~~~l~~~~l~~~ql~RL~Ty~l~H~s~~hllfnmlaL   76 (258)
T KOG2632|consen    7 VGQFWMKIPLLTSIVVVLAILIYLVSFFPGIV----------EVLGLPSELLINWQLYRLITYALVHLSLPHLLFNMLAL   76 (258)
T ss_pred             CccccccchHHHHHHHHHHHHHHHHhccchhh----------hHhcCCHHHhhhHHHHHHHHHHHHhccHHHHHHHHHHH
Confidence            34445666677888899999999998887642          11112223344679999999999999999999999999


Q ss_pred             HHHHHHHHhhhc-hhHHHHHHHHHHHHHHHHHHHHHhh--cCC---CcceechHHHHHHHHHHHHHHcCCCcceeEechh
Q psy8269         243 HSFMPSSVEDLG-KEQFVGFYLTAGVVASLLSYVHKIL--VRK---PGLSIGASGAIMAVLAHTCITHPDTELGILFVPY  316 (386)
Q Consensus       243 ~~fG~~lE~~~G-~~rfl~lyl~sgv~gsl~s~l~~~~--~~~---~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~  316 (386)
                      |.+|...|+.+| +.+++.+....++.++++.+++...  ..+   ...++|.||..|++++..+...|.....++..  
T Consensus        77 ~~~g~~fE~~~G~t~~~l~~~~llalf~gIl~ll~~~~~~~~d~~~~~~a~G~s~v~Fam~~~~~~~sp~r~~~~fg~--  154 (258)
T KOG2632|consen   77 WPLGSQFERTHGTTVRILMFTVLLALFSGILYLLAYHVFLLSDLVYVEGAIGFSGVLFAMMAVLEVQSPVRSRSVFGL--  154 (258)
T ss_pred             HhchhHHHhhccceehHHHHHHHHHHHHHHHHHHHHHHHhhcchhhhcccccccHHHHHHHHHHhhcCcccchhhccc--
Confidence            999999999999 8899999999999999988877641  111   24579999999999999998888877433321  


Q ss_pred             hhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHH
Q psy8269         317 VRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSK  360 (386)
Q Consensus       317 ~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~  360 (386)
                      +..+++......++...    ...++.++.+|++|+++|+.+++
T Consensus       155 ~siP~~l~Pw~lLi~~~----~lvp~aSFlghl~GllvG~ay~~  194 (258)
T KOG2632|consen  155 FSIPIVLAPWALLIATQ----ILVPQASFLGHLCGLLVGYAYAF  194 (258)
T ss_pred             ccccHHHHHHHHHHHHH----HHccCchHHHHHHHHHHHHHHHH
Confidence            11122222222222221    11378999999999999999998


No 10 
>KOG2290|consensus
Probab=99.58  E-value=5.9e-16  Score=154.54  Aligned_cols=136  Identities=20%  Similarity=0.240  Sum_probs=108.2

Q ss_pred             cccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHH
Q psy8269         218 LSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVL  297 (386)
Q Consensus       218 ~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll  297 (386)
                      |.+|++||.|+|+++.|++..|.+++.+...+|+..|+.|..++|++||+.||+++.++.    ++.+.||.||+-+|++
T Consensus       450 QfYRL~~SLFlHagviH~~vSi~FQm~vmrdlEkL~g~~riAIiy~~SGitGNLASAIFl----pY~~eVgPa~sQ~Gil  525 (652)
T KOG2290|consen  450 QFYRLWLSLFLHAGVIHLLVSICFQMTVMRDLEKLAGWHRIAIIYFLSGITGNLASAIFL----PYRAEVGPAGSQFGIL  525 (652)
T ss_pred             HHHHHHHHHHHHhhHHHHHHHHHHHHHHHHHHHHhhcchhhheeeecccccccchheeee----ccccccCCcccccchH
Confidence            789999999999999999999999999999999999999999999999999999999887    5789999999999999


Q ss_pred             HHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         298 AHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       298 ~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      +.+.+-....+..+-    .+..+..-+...+++..+|+   .|.||++||+.|+++|++..+.+..
T Consensus       526 a~l~vEl~qs~~il~----~~w~a~~~Lia~~L~L~iGl---iPWiDN~aHlfG~i~GLl~s~~~~P  585 (652)
T KOG2290|consen  526 ACLFVELFQSWQILE----RPWRAFFHLIATLLVLCIGL---IPWIDNWAHLFGTIFGLLTSIIFLP  585 (652)
T ss_pred             HHHHHHHHhhhHhhh----hHHHHHHHHHHHHHHHHhcc---ccchhhHHHHHHHHHHHHHHHHhhc
Confidence            998775444332111    11222111222222222333   3889999999999999999987654


No 11 
>PF00808 CBFD_NFYB_HMF:  Histone-like transcription factor (CBF/NF-Y) and archaeal histone;  InterPro: IPR003958 The CCAAT-binding factor (CBF) is a mammalian transcription factor that binds to a CCAAT motif in the promoters of a wide variety of genes, including type I collagen and albumin. The factor is a heteromeric complex of A and B subunits, both of which are required for DNA-binding [, ]. The subunits can interact in the absence of DNA-binding, conserved regions in each being important in mediating this interaction.  The A subunit can be split into 3 domains on the basis of sequence similarity, a non-conserved N-terminal 'A domain'; a highly-conserved central 'B domain' involved in DNA-binding; and a C-terminal 'C domain', which contains a number of glutamine and acidic residues involved in protein-protein interactions []. The A subunit shows striking similarity to the HAP3 subunit of the yeast CCAAT-binding heterotrimeric transcription factor [, ]. The Kluyveromyces lactis HAP3 protein has been predicted to contain a 4-cysteine zinc finger, which is thought to be present in similar HAP3 and CBF subunit A proteins, in which the third cysteine is replaced by a serine []. This domain is found in the CCAAT transcription factor and archaeal histones.; GO: 0043565 sequence-specific DNA binding, 0005622 intracellular; PDB: 1F1E_A 2BYM_D 2BYK_D 1HTA_A 1B67_A 1JFI_B 1KU5_B 1N1J_A 1BFM_A 1B6W_A ....
Probab=99.18  E-value=3.8e-11  Score=91.72  Aligned_cols=64  Identities=44%  Similarity=0.538  Sum_probs=59.0

Q ss_pred             cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269         114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI  177 (386)
Q Consensus       114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI  177 (386)
                      .+|+++||||||.++++.++++|+..++.+++|.|+++++..++..+.+++|++++..||..++
T Consensus         2 ~lP~a~vkri~k~~~~~~~vs~ea~~~i~~a~e~Fi~~l~~~A~~~a~~~~rkti~~~Dv~~Av   65 (65)
T PF00808_consen    2 SLPLARVKRIMKSDPDVMRVSKEAVEAIAKAAEEFIQYLAKEANEIAQRDKRKTITYEDVAKAV   65 (65)
T ss_dssp             SS-HHHHHHHHHHTSTTSEE-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTTSSEE-HHHHHHHH
T ss_pred             CCChHHHHHHhccCCCccchhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCccCHHHHHHHC
Confidence            5899999999999999999999999999999999999999999999999999999999998875


No 12 
>KOG1659|consensus
Probab=99.04  E-value=2.9e-10  Score=104.02  Aligned_cols=84  Identities=29%  Similarity=0.479  Sum_probs=79.0

Q ss_pred             hhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269         109 DLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID  188 (386)
Q Consensus       109 ~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~  188 (386)
                      +......|.+|||||||.|++|.+|+.-+|+++++|.|+|++.+..++...++.++.++++.+.+..+|-....||||.+
T Consensus         8 ~~~~trfp~aRiKKIMQ~dEdIGKvaqavPViisralElFl~~l~~~t~~~t~~~~aKt~s~~hlkq~v~~~~~FdFLk~   87 (224)
T KOG1659|consen    8 KKYKTRFPPARIKKIMQSDEDIGKVAQAVPVIISRALELFLESLLQKTLEITRSRGAKTVSSSHLKQAVESDPKFDFLKE   87 (224)
T ss_pred             hhhhccCCHHHHHHHHhhhhhhhhhhhhhhhHHHHHHHHHHHHHHHHHHHHHHhcCccccCHHHHHHHHhccchhHHHHH
Confidence            34455689999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hccc
Q psy8269         189 IVPR  192 (386)
Q Consensus       189 ivp~  192 (386)
                      ++..
T Consensus        88 ~v~~   91 (224)
T KOG1659|consen   88 VVEK   91 (224)
T ss_pred             HHHh
Confidence            7654


No 13 
>COG5247 BUR6 Class 2 transcription repressor NC2, alpha subunit (DRAP1 homolog) [Transcription]
Probab=98.96  E-value=1.2e-09  Score=88.91  Aligned_cols=79  Identities=27%  Similarity=0.463  Sum_probs=75.3

Q ss_pred             cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhhccc
Q psy8269         114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDIVPR  192 (386)
Q Consensus       114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~ivp~  192 (386)
                      -.|.+|+|||||+|+|+.++++..|++.+||.|+|++++...+-..++....+.++.+.+..++-+.+-||||.+++..
T Consensus        23 rFP~ar~KkIMQ~deDiGKV~q~tPVIaskalE~Fl~~iv~~s~k~aR~~~skR~t~e~lk~a~~sdekFdFL~~~~~~  101 (113)
T COG5247          23 RFPIARLKKIMQLDEDIGKVGQSTPVIASKALEMFLTEIVGLSLKEARKKSSKRMTSEFLKRATESDEKFDFLKNMEQF  101 (113)
T ss_pred             cCCHHHHHHHHHhhhhhhhhhhcchHHHHHHHHHHHHHHHHHHHHHHHhccchhhHHHHHHHHHhhhHHHHHHHHHHHh
Confidence            4799999999999999999999999999999999999999999999999999999999999999999999999998754


No 14 
>PF08551 DUF1751:  Eukaryotic integral membrane protein (DUF1751);  InterPro: IPR013861  This entry is found in eukaryotic integral membrane proteins. Q12239 from SWISSPROT, a Saccharomyces cerevisiae (Baker's yeast) protein, has been shown to localise COP II vesicles []. 
Probab=98.42  E-value=1.9e-07  Score=77.42  Aligned_cols=82  Identities=16%  Similarity=0.235  Sum_probs=67.6

Q ss_pred             cccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhh----cCCC----cceech
Q psy8269         218 LSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKIL----VRKP----GLSIGA  289 (386)
Q Consensus       218 ~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~----~~~~----~~~vGA  289 (386)
                      .+|+++|+.|++.+++.+++|...++..|+.+|+.||+++++.++++.++++|+...+....    .+..    .+.-|.
T Consensus         7 ~pWtl~T~~fve~~i~~~l~~~~~l~~~g~~lE~~WGs~E~lkFi~vv~~~tnl~~~~~~~~~y~i~~~~~~l~~~i~G~   86 (99)
T PF08551_consen    7 YPWTLFTAGFVETNIIGLLFSLLTLFYGGRYLEPIWGSREFLKFILVVNVITNLLTFLLYLLLYAITGNESYLFVPISGF   86 (99)
T ss_pred             ehHHHHHHHHHHhHHHHHHHHHHHHHHhhHHHHHhcChHHHHHHHHHHHHHhHHHHHHHHHHHHHHhCCCceeEEEecCc
Confidence            58999999999999999999999999999999999999999999999999998877665422    2222    345677


Q ss_pred             HHHHHHHHHH
Q psy8269         290 SGAIMAVLAH  299 (386)
Q Consensus       290 SGaI~Gll~a  299 (386)
                      .|.+.|++.+
T Consensus        87 ~~~~~g~lVa   96 (99)
T PF08551_consen   87 MGVLAGFLVA   96 (99)
T ss_pred             HHhHhheEEE
Confidence            7777776544


No 15 
>KOG1658|consensus
Probab=98.32  E-value=3.5e-07  Score=80.16  Aligned_cols=82  Identities=43%  Similarity=0.584  Sum_probs=77.2

Q ss_pred             hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhhc
Q psy8269         111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDIV  190 (386)
Q Consensus       111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~iv  190 (386)
                      +..-+|+.|||.+|+++|++.|.-+|+..++.||.|.|++++...+++|+..+++.++++.|+.++|...+-+-|+-...
T Consensus        56 ~l~rLpL~rik~vvkl~pdl~l~~dea~~l~a~aaelfi~~Ln~~~~~~~q~~k~kt~qr~d~D~ai~~~de~~fle~~~  135 (162)
T KOG1658|consen   56 SLSRLPLARIKQVVKLDPDLTLLNDEASQLIAKAAELFIQELNDVAYTTAQLRKRKTEQRRDYDTAIEAVDEFAFLEGAL  135 (162)
T ss_pred             hhhhccHHHHHhhccCCcchhhhhhHHHHHHHHHHHHHHHHHHhccchhHHHHHhhhhhhhcccccccchHHHHHHhhhc
Confidence            34458999999999999999999999999999999999999999999999999999999999999999999999998776


Q ss_pred             cc
Q psy8269         191 PR  192 (386)
Q Consensus       191 p~  192 (386)
                      +.
T Consensus       136 d~  137 (162)
T KOG1658|consen  136 DT  137 (162)
T ss_pred             cc
Confidence            65


No 16 
>PF04511 DER1:  Der1-like family;  InterPro: IPR007599 The endoplasmic reticulum (ER) of the yeast Saccharomyces cerevisiae (Baker's yeast) contains a proteolytic system able to selectively degrade misfolded lumenal secretory proteins. For examination of the components involved in this degradation process, mutants were isolated. They could be divided into four complementation groups. The mutations led to stabilisation of two different substrates for this process, and the classes were called der for degradation in the ER. DER1 was cloned by complementation of the der1-2 mutation. The DER1 gene codes for a novel, hydrophobic protein that is localized to the ER. Deletion of DER1 abolished degradation of the substrate proteins, suggesting that the function of the Der1 protein may be specifically required for the degradation process associated with the ER []. Interestingly this family seems distantly related to the Rhomboid family of membrane peptidases. This family may also mediate degradation of misfolded proteins.
Probab=98.23  E-value=1.7e-05  Score=73.43  Aligned_cols=137  Identities=17%  Similarity=0.150  Sum_probs=93.7

Q ss_pred             CccccceeecccccCCh-hHHHHHHHHHHHHHHHHHhh-hc-h-hHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHH
Q psy8269         216 QYLSVPMLLSTFSHQSP-LHIFANMFVLHSFMPSSVED-LG-K-EQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASG  291 (386)
Q Consensus       216 ~g~~wrllTs~FlH~~~-~HLl~Nm~~L~~fG~~lE~~-~G-~-~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASG  291 (386)
                      ++|+||++|+.|.-++. .++++|++.++..+..+|+. ++ + ..++...+.+++.-.+.+.+.... .-+.+.+| ++
T Consensus        39 ~~q~WRl~Tsff~~g~~~~~~l~~~~~l~~~s~~LE~~~f~~~~ady~~~ll~~~~~i~~~~~~~~~~-~~~~~~l~-~~  116 (197)
T PF04511_consen   39 KFQIWRLFTSFFYFGPFSLNFLFNLYFLYQYSSSLEEGHFQGRSADYLWFLLFGASLILILSLLIGPY-FFNIPFLG-SS  116 (197)
T ss_pred             hcCceeeEEEEEEEcCCCHHHHHHHHHHHHHhhHhccCCCCCCHHHHHHHHHHHHHHHHHHHHhhccc-hhHHHHHH-HH
Confidence            57999999999975444 68999999999999999998 32 2 456666665555555555443210 00112232 44


Q ss_pred             HHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHH
Q psy8269         292 AIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKY  361 (386)
Q Consensus       292 aI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~  361 (386)
                      .+..++-.++-..|+.++.++++  ++++++++..++++++++..     +-+...++-|+++|-++-++
T Consensus       117 l~~~l~Y~wsr~np~~~v~~~g~--~~i~a~ylP~~~~~~~~l~~-----~~~~~~~l~Gi~~Ghly~fl  179 (197)
T PF04511_consen  117 LSFALTYIWSRKNPNAQVSFFGL--FTIKAKYLPWVLLAFSLLFG-----GSSPIPDLLGILVGHLYYFL  179 (197)
T ss_pred             HHHHHHHHHHHhCcccceeeEEE--EEEChhhHHHHHHHHHHHhC-----CCcHHHHHHHHHHHHHHHHH
Confidence            56666666777789999887764  35577777777777776531     22455899999999998865


No 17 
>KOG2890|consensus
Probab=98.06  E-value=1.3e-05  Score=77.83  Aligned_cols=154  Identities=18%  Similarity=0.187  Sum_probs=105.2

Q ss_pred             cCCCC--ccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHH----HHhhcCCC--
Q psy8269         212 LNPDQ--YLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYV----HKILVRKP--  283 (386)
Q Consensus       212 l~p~~--g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l----~~~~~~~~--  283 (386)
                      +.|.+  ..+|+++|+.|+-.+++-.+.|...|.+-|..+|+.||+..++.+|.+.-...+++..+    .++++...  
T Consensus        58 lvp~~~~~~~WtliTs~fie~~vw~V~~sv~~L~v~G~~lEp~Wg~~e~lkff~ivn~~~~l~v~v~~~l~Y~it~n~v~  137 (326)
T KOG2890|consen   58 LVPNALFFFPWTLITSGFIELNVWDVLVSVLTLSVGGKFLEPNWGSLELLKFFAIVNGSTTLVVLVPALLLYMITDNHVY  137 (326)
T ss_pred             cCCcchhhhhHHHHhcchhhhhHHHHHHHHHheeecceeeccCCCCHHHHHHHHHhhchhHHHHHHHHHHHHHHhcCceE
Confidence            34443  36999999999999999999999999999999999999999999887765555544332    23333322  


Q ss_pred             --cceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHH
Q psy8269         284 --GLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKY  361 (386)
Q Consensus       284 --~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~  361 (386)
                        .+..|+.|...|++.++-..+|+..+...  +.-++..+.+....+.+.++..+   -...+.+.+.-+.+|.+.+|.
T Consensus       138 L~~~i~G~~gilaGilVa~kQllpd~~il~~--~~~r~~~~~lP~~~l~~~~il~i---~~f~~f~~l~s~~~g~~~sWt  212 (326)
T KOG2890|consen  138 LYIPIHGTTGILAGILVAWKQLLPDTIILEL--KSGRFLYAHLPLLVLFLSLILSI---ITFLVFASLPSITFGVLVSWT  212 (326)
T ss_pred             EEEEeccchHHHHHHHHHHHHHcCceeEEec--cchhhhhhhCCHHHHHHHHHHHH---HHHHHhhhhHHHHHhhhhhhh
Confidence              35889999999999999999999764332  22223322222222222111111   134566777777888888887


Q ss_pred             hhhcccccc
Q psy8269         362 GEQTWAHRA  370 (386)
Q Consensus       362 ~~~~w~~r~  370 (386)
                      +.|..++++
T Consensus       213 YLRfyq~h~  221 (326)
T KOG2890|consen  213 YLRFYQRHP  221 (326)
T ss_pred             hheecccCC
Confidence            766555554


No 18 
>KOG0858|consensus
Probab=97.82  E-value=7.8e-05  Score=70.41  Aligned_cols=132  Identities=14%  Similarity=0.146  Sum_probs=91.4

Q ss_pred             CCccccceeecccccCC-hhHHHHHHHHHHHHHHHHHhhhc---hhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechH
Q psy8269         215 DQYLSVPMLLSTFSHQS-PLHIFANMFVLHSFMPSSVEDLG---KEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGAS  290 (386)
Q Consensus       215 ~~g~~wrllTs~FlH~~-~~HLl~Nm~~L~~fG~~lE~~~G---~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGAS  290 (386)
                      .+.|+||++|+.+.-.. -+|.++||++++--++.+|+-.=   +..|+.+++.++++-.+.++...      ...+| +
T Consensus        49 kk~QiWRliTs~lyfg~~gf~fl~n~~FlyrY~~~LE~g~f~~rtadf~~mllf~~~l~~~~~~~~~------~~fLg-~  121 (239)
T KOG0858|consen   49 KKFQIWRLITSFLYFGPFGFDFLMNLYFLYRYSSMLEEGSFRGRTADFLYMLLFGAVLLTLTGLFVY------IVFLG-Q  121 (239)
T ss_pred             hHhHHHHhhhhhheeccccHHHHHHHHHHHHHHHHHhcCCCCCchhHHHHHHHHHHHHHHHHHHHHH------HHHHH-H
Confidence            48899999999987765 79999999999999999999742   25677777777777776555433      12233 2


Q ss_pred             HHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHH
Q psy8269         291 GAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKY  361 (386)
Q Consensus       291 GaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~  361 (386)
                      +.++.+.=..+...|+..+..++  .+++++.++...+++++.+.   +  +.+ ..-+-|++.|-+|.++
T Consensus       122 ~l~~~l~YvWs~~Np~~~v~F~g--~~~f~a~YlPwvll~fs~l~---g--~~~-~~dllGi~~GHiy~fl  184 (239)
T KOG0858|consen  122 SLVFMLVYVWSKRNPDVIVSFFG--LITFKAPYLPWVLLGFSFLF---G--GSI-LVDLLGIIVGHIYYFL  184 (239)
T ss_pred             HHHHHHHHHHHhhCCCceEEEEE--EecCccccchHHHHHHHHHh---C--Cch-HHHHHhhhhheeEEEE
Confidence            33444444445556776666554  35778888887777777653   1  223 6778888999776653


No 19 
>COG2036 HHT1 Histones H3 and H4 [Chromatin structure and dynamics]
Probab=96.17  E-value=0.0092  Score=48.74  Aligned_cols=74  Identities=23%  Similarity=0.312  Sum_probs=65.4

Q ss_pred             hhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269         105 LSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK  179 (386)
Q Consensus       105 ~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~  179 (386)
                      ++.+......+|.+.+++||+..-.- .++.||...+.++.|.|..+++..+-.|+.-.+|+|++..||..++-.
T Consensus        10 r~~~~~~~~~Lp~apv~Ri~r~~~~~-Rvs~~A~~~l~~~~e~~~~~i~~~A~~~A~ha~RKTV~~~DI~la~~~   83 (91)
T COG2036          10 RRYQRSTDLLLPKAPVRRILRKAGAE-RVSSSAIEELQEALEEYLEEIAEDAVELAEHAKRKTVKAEDIKLALKR   83 (91)
T ss_pred             HhhhhhhhhhcCchHHHHHHHHHhHH-HhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeecHHHHHHHHHH
Confidence            45566677789999999999985433 999999999999999999999999999999999999999999887744


No 20 
>cd00074 H2A Histone 2A; H2A is a subunit of the nucleosome. The nucleosome is an octamer containing two H2A, H2B, H3, and H4 subunits. The H2A subunit performs essential roles in maintaining structural integrity of the nucleosome, chromatin condensation, and binding of specific chromatin-associated proteins.
Probab=95.78  E-value=0.036  Score=47.23  Aligned_cols=74  Identities=20%  Similarity=0.349  Sum_probs=67.1

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID  188 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~  188 (386)
                      .|++||+++|+.+..-.-|+.+|+......+|-+..|+...+...+...+++.|++.||..+|-...-++.|..
T Consensus        21 fPV~ri~R~Lk~~~~a~RVs~~A~VyLaAvLEYL~aEIlelA~n~ak~~k~krItp~hi~lAi~nD~EL~~L~~   94 (115)
T cd00074          21 FPVGRIHRYLKKGRYAERVGAGAPVYLAAVLEYLTAEVLELAGNAARDNKKKRITPRHLQLAVRNDEELNKLLK   94 (115)
T ss_pred             CcHHHHHHHHHcCccccccccchHHHHHHHHHHHHHHHHHHHHHHHHHcCCCeEcHHHHHHHHhccHHHHHHHc
Confidence            79999999999877778999999999999999999999999999999999999999999999977666665553


No 21 
>PF00125 Histone:  Core histone H2A/H2B/H3/H4 histone h2a signature histone h2b signature histone h3 signature histone h4 signature;  InterPro: IPR007125 The core histones together with some other DNA binding proteins appear to form a superfamily defined by a common fold and distant sequence similarities [, ]. Some proteins contain local homology domains related to the histone fold [].; GO: 0003677 DNA binding; PDB: 2YFW_D 2YFV_B 1U35_H 2F8N_D 2PYO_D 2NQB_D 3AN2_C 3AZJ_C 3AV1_G 3AZM_G ....
Probab=95.46  E-value=0.037  Score=42.84  Aligned_cols=64  Identities=25%  Similarity=0.314  Sum_probs=58.0

Q ss_pred             cCchHHHHHhhhcCcccc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269         114 ALPLARIKKIMKLDDGVK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI  177 (386)
Q Consensus       114 ~~p~~rik~im~~~~~v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI  177 (386)
                      ..|..|+.|-+..+-+-+ .|+.++......++|-+..++...+..++...||.+++..||..++
T Consensus         8 ~~~~~r~~r~i~~~~~~~~ris~~a~~~L~~~~E~~~~~il~~A~~~a~~~kR~tI~~~DI~~A~   72 (75)
T PF00125_consen    8 KFPFSRLLREIGEEILSKYRISSEALVALQSVLEYLLVEILEEAGNLARHAKRKTITPRDIQLAV   72 (75)
T ss_dssp             HHHHHHHHHHHHHTTSSSSEECHHHHHHHHHHHHHHHHHHHHHHHHHHHHTTBSEEGHHHHHHHH
T ss_pred             ceEEeeeeehhhcccccccccccccchhhhhhhhhhhhhhhhHHHHHHhhcCCcEecHHHHHHHH
Confidence            467788888888877775 9999999999999999999999999999999999999999998876


No 22 
>KOG0869|consensus
Probab=95.21  E-value=0.052  Score=48.24  Aligned_cols=72  Identities=25%  Similarity=0.306  Sum_probs=63.1

Q ss_pred             ccCchHHHHHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHH
Q psy8269         113 QALPLARIKKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFD  184 (386)
Q Consensus       113 ~~~p~~rik~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~  184 (386)
                      +-+|+++|-||||..= .-.+|+.||.-.+-.-+--||..+|-.|-.-+++.||+||+-.||--|+..+..-+
T Consensus        31 r~LPIANV~RIMK~~lP~naKIsKDAKE~vQECVSEfISFvT~EAsekC~~EkRKTIngdDllwAm~tLGFe~  103 (168)
T KOG0869|consen   31 RFLPIANVSRIMKKALPANAKISKDAKETVQECVSEFISFVTGEASEKCQREKRKTINGDDLLWAMSTLGFEN  103 (168)
T ss_pred             hhccHHHHHHHHHhcCCcccccchHHHHHHHHHHHHHHHHHhhHHHHHHHHHhcCcccHHHHHHHHHHcCcHh
Confidence            4599999999999874 55599999999999999999999999999999999999999999988776654433


No 23 
>smart00803 TAF TATA box binding protein associated factor. TAFs (TATA box binding protein associated factors) are part of the transcription initiation factor TFIID multimeric protein complex. TFIID is composed of the TATA box binding protein (TBP) and a number of TAFs. The TAFs provide binding sites for many different transcriptional activators and co-activators that modulate transcription initiation by Pol II. TAF proteins adopt a histone-like fold.
Probab=95.19  E-value=0.058  Score=41.27  Aligned_cols=63  Identities=32%  Similarity=0.385  Sum_probs=57.7

Q ss_pred             cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269         114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI  177 (386)
Q Consensus       114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI  177 (386)
                      .+|.+-||++.+.. |+.-++.|+-.......|-++.++...+-.+.+-.+|++++..||..++
T Consensus         2 ~~p~~~i~ria~~~-Gi~ris~~a~~~l~~~~e~rl~~i~~~A~k~~~hakRktlt~~DI~~Al   64 (65)
T smart00803        2 WLPKETIKDVAESL-GIGNLSDEAAKLLAEDVEYRIKEIVQEALKFMRHSKRTTLTTSDIDSAL   64 (65)
T ss_pred             CCCHHHHHHHHHHC-CCccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCeecHHHHHHHh
Confidence            37889999999865 7888999999999999999999999999999999999999999998764


No 24 
>cd00076 H4 Histone H4, one of the four histones, along with H2A, H2B and H3, which forms the eukaryotic nucleosome core; along with H3, it plays a central role in nucleosome formation; histones bind to DNA and wrap the genetic material into "beads on a string" in which DNA (the string) is wrapped around small blobs of histones (the beads) at regular intervals; play a role in the inheritance of specialized chromosome structures and the control of gene activity; defects in the establishment of proper chromosome structure by histones may activate or silence genes aberrantly and thus lead to disease;  the sequence of histone H4 has remained almost invariant in more than 2 billion years of evolution
Probab=95.17  E-value=0.067  Score=43.19  Aligned_cols=67  Identities=21%  Similarity=0.300  Sum_probs=61.1

Q ss_pred             hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHH
Q psy8269         111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAIT  178 (386)
Q Consensus       111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~  178 (386)
                      ..+.+|.+-|+++.+.+ |+.-||.|+-.-.....+.|..++...+-.+++-.+|+|++..||..++-
T Consensus        10 ~~~gi~k~~I~RLarr~-GvkRIS~d~y~e~~~~l~~~l~~I~~dav~ya~Ha~RKTVt~~DV~~alk   76 (85)
T cd00076          10 NIKGITKPAIRRLARRG-GVKRISGGVYDEVRNVLKSYLEDVIRDAVTYTEHAKRKTVTAMDVVYALK   76 (85)
T ss_pred             hhccCCHHHHHHHHHHc-CcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCcHHHHHHHHH
Confidence            34569999999999976 69999999999999999999999999999999999999999999988774


No 25 
>PLN00035 histone H4; Provisional
Probab=94.81  E-value=0.093  Score=43.82  Aligned_cols=66  Identities=23%  Similarity=0.315  Sum_probs=60.9

Q ss_pred             ccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269         113 QALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK  179 (386)
Q Consensus       113 ~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~  179 (386)
                      +.+|.+-|+++.+.. |+.-||.|+-..+..+.|.|..++...+-.+++-.+|+|++..||..++-.
T Consensus        28 ~~ipk~~IrRLARr~-GvkRIS~~ay~elr~vle~~l~~I~~dav~ya~HA~RKTV~~~DV~~Alkr   93 (103)
T PLN00035         28 QGITKPAIRRLARRG-GVKRISGLIYEETRGVLKIFLENVIRDAVTYTEHARRKTVTAMDVVYALKR   93 (103)
T ss_pred             ccCCHHHHHHHHHHc-CcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcCcHHHHHHHHHH
Confidence            449999999999976 699999999999999999999999999999999999999999999888744


No 26 
>PTZ00015 histone H4; Provisional
Probab=94.24  E-value=0.16  Score=42.45  Aligned_cols=68  Identities=19%  Similarity=0.263  Sum_probs=61.7

Q ss_pred             hhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHH
Q psy8269         110 LKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAIT  178 (386)
Q Consensus       110 ~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~  178 (386)
                      .+.+.+|.+-|+++.+.+ |+.-||.|+-.-.....|.|..++...+-.+++-.+|+|++..||..++-
T Consensus        26 ~~i~gI~k~~IrRLarr~-GvkRIS~d~y~e~r~vle~~l~~I~rdav~~aeHA~RKTVt~~DV~~AlK   93 (102)
T PTZ00015         26 DNIRGITKGAIRRLARRG-GVKRISGDIYEEVRGVLKAFLENVVRDSTAYTEYARRKTVTAMDVVYALK   93 (102)
T ss_pred             hcccCCCHHHHHHHHHHc-CCccchHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccHHHHHHHHH
Confidence            344669999999999964 99999999999999999999999999999999999999999999988763


No 27 
>COG5262 HTA1 Histone H2A [Chromatin structure and dynamics]
Probab=93.80  E-value=0.21  Score=42.35  Aligned_cols=78  Identities=22%  Similarity=0.328  Sum_probs=69.1

Q ss_pred             hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269         111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID  188 (386)
Q Consensus       111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~  188 (386)
                      .--..|++||||+||.+..-+-|+.++++..+.-+|-.+-|++.-+=..+...+...+++.++..+|-.-+=.+||+-
T Consensus        23 agl~fpvgrvkr~lk~~~~~~Rig~~A~Vyl~AvleYL~aEilelAgNaA~d~kkkri~PrHlqlAIrnD~EL~~l~~  100 (132)
T COG5262          23 AGLIFPVGRVKRLLKKGNYRMRIGAGAPVYLAAVLEYLAAEILELAGNAARDNKKKRIIPRHLQLAIRNDEELNKLLG  100 (132)
T ss_pred             cCccccHHHHHHHHHcCccceeecCCcHHHHHHHHHHHHHHHHHHhhhhhhhcCcceechHHHHHHhcCcHHHHHHhh
Confidence            344589999999999887888899999999999989889999999999999999999999999999977777777764


No 28 
>COG5291 Predicted membrane protein [Function unknown]
Probab=93.46  E-value=0.27  Score=47.05  Aligned_cols=45  Identities=13%  Similarity=0.174  Sum_probs=35.6

Q ss_pred             cCCCCccccceeecccccC-ChhHHHHHHHHHHHHHHHHHhh-hchh
Q psy8269         212 LNPDQYLSVPMLLSTFSHQ-SPLHIFANMFVLHSFMPSSVED-LGKE  256 (386)
Q Consensus       212 l~p~~g~~wrllTs~FlH~-~~~HLl~Nm~~L~~fG~~lE~~-~G~~  256 (386)
                      +...+-|+||++||...-+ --+..++|++.+|--.+.+|+- +|+.
T Consensus        53 L~~k~~qiwRlfTs~~~~~~~~~d~~M~vyf~Y~yS~~LE~g~f~~~   99 (313)
T COG5291          53 LFLKRLQIWRLFTSFLYFGKPTLDMFMHVYFLYRYSRMLEEGCFNTS   99 (313)
T ss_pred             hHHHHHHHHHHHHHHHhhcCcchhHHHHHHHHHHHHHHHhccccCcc
Confidence            3445679999999876554 5678999999999999999986 4444


No 29 
>smart00417 H4 Histone H4.
Probab=93.22  E-value=0.24  Score=38.98  Aligned_cols=63  Identities=22%  Similarity=0.271  Sum_probs=57.4

Q ss_pred             hcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269         111 KTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA  174 (386)
Q Consensus       111 ~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa  174 (386)
                      ..+.+|.+-|+++.+.+ |+.-||.|+-.-.....|.|..++...+-.+++-.+|+|++..||.
T Consensus        10 ~i~gI~k~~IrRLaRr~-GvkRIS~~~y~elr~vle~~l~~I~rdav~~a~ha~RKTV~~~DV~   72 (74)
T smart00417       10 NIQGITKPAIRRLARRG-GVKRISGLIYDETRNVLKSFLENVVRDAVTYTEHARRKTVTAMDVV   72 (74)
T ss_pred             hhcCCCHHHHHHHHHHc-CcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCcccHHHhe
Confidence            34569999999999965 9999999999999999999999999999999999999999988874


No 30 
>cd07981 TAF12 TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 12 (TAF12) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of the seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs function such as serving as activator-bind
Probab=92.00  E-value=0.53  Score=36.60  Aligned_cols=64  Identities=16%  Similarity=0.286  Sum_probs=54.8

Q ss_pred             chHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269         116 PLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK  179 (386)
Q Consensus       116 p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~  179 (386)
                      +-.++.+.++.-+.-.-+++|+..++....|-|+..++..+-.++.-++|.++...||..++-.
T Consensus         3 ~k~~l~~lv~~id~~~~~~~da~~~l~~~~e~fv~~v~~~a~~lAkHr~~~tv~~~Di~l~l~r   66 (72)
T cd07981           3 TKRKLQELLKEIDPREQLDPDVEELLLEIADDFVDDVVEDACRLAKHRKSDTLEVKDVQLHLER   66 (72)
T ss_pred             cHHHHHHHHHhhCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence            3455666666665557789999999999999999999999999999999999999999887744


No 31 
>smart00414 H2A Histone 2A.
Probab=89.96  E-value=0.96  Score=38.04  Aligned_cols=73  Identities=23%  Similarity=0.388  Sum_probs=64.1

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHH
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLI  187 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~  187 (386)
                      .|++||++.|+....-.-|+..+++......|=...|+-..+...+..++++-|++.++..+|-...-++-|.
T Consensus        10 fPVgRi~r~Lk~~~~~~Rv~~~A~VyLaAvLEYLtaEILeLagn~a~~~k~~rItp~hi~lAi~nD~EL~~L~   82 (106)
T smart00414       10 FPVGRIHRLLRKGTYAKRVGAGAPVYLAAVLEYLTAEVLELAGNAARDNKKRRITPRHLQLAIRNDEELNKLL   82 (106)
T ss_pred             CchHHHHHHHHcCccccccccccHHHHHHHHHHHHHHHHHHHHHHHHhcCCCccchHHHhhhccCCHHHHHHH
Confidence            7999999999988777799999999999999999999999999999999999999999999996654444333


No 32 
>cd08048 TAF11 TATA Binding Protein (TBP) Associated Factor 11 (TAF11) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 11 (TAF11) is one of several TAFs that bind TBP and are involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. Several hypothes
Probab=89.90  E-value=1.2  Score=35.95  Aligned_cols=71  Identities=20%  Similarity=0.259  Sum_probs=60.3

Q ss_pred             hhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccc---cccchhhHHHHH
Q psy8269         106 SMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKR---RTLQRNDIAMAI  177 (386)
Q Consensus       106 ~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr---~~iq~~DIa~aI  177 (386)
                      +=+++....+|-+.+||+|+.--+ +.+++++.++++-..-.|+.+++..|-.-.++...   ..+++++|..|.
T Consensus         8 Rye~~Rra~f~k~~iKr~~~~~~~-~~v~~~v~i~v~glaKvFVGeivE~A~~V~~~~~~~~~~Pl~P~HireA~   81 (85)
T cd08048           8 RYEMFRRSSFPKAAIKRLIQSVTG-QSVSQNVVIAVAGIAKVFVGEIVEEARDVQEEWGEANTGPLQPRHLREAY   81 (85)
T ss_pred             HHHHHHHhhccHHHHHHHHHHHcC-CCCCchHHHHHHHHHHHHHHHHHHHHHHHHHHhccccCCCCCcHHHHHHH
Confidence            445677778999999999998888 78899999999999999999999999888777666   677788877664


No 33 
>KOG4463|consensus
Probab=89.57  E-value=0.24  Score=47.77  Aligned_cols=60  Identities=20%  Similarity=0.351  Sum_probs=52.2

Q ss_pred             ccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHH
Q psy8269         217 YLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHK  277 (386)
Q Consensus       217 g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~  277 (386)
                      +|+||++.+.|.-.+-..+.+-++.+|.| +.+|+.+|+.+|..+.+.++..+-++...+.
T Consensus        49 ~qywrlL~~qF~~~n~~e~~~~l~I~Y~f-R~~ERlLGShky~~fiv~s~~~~~l~~~il~  108 (323)
T KOG4463|consen   49 FQYWRLLMSQFAFSNTPELMFGLYILYYF-RVFERLLGSHKYSVFIVFSGTVSLLLEVILL  108 (323)
T ss_pred             HHHHHHHHHHHHhcCChHHHHHHHHHHHH-HHHHHHhccccceeehhHHHHHHHHHHHHHH
Confidence            58999999999999988888888877766 6699999999999999999999888776654


No 34 
>KOG0870|consensus
Probab=87.42  E-value=1.9  Score=38.86  Aligned_cols=76  Identities=28%  Similarity=0.340  Sum_probs=65.0

Q ss_pred             cCchHHHHHhhhcCc--cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhh
Q psy8269         114 ALPLARIKKIMKLDD--GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDI  189 (386)
Q Consensus       114 ~~p~~rik~im~~~~--~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~i  189 (386)
                      .+|.+-|.++.|-..  +---|+.||...++.+.-.|+..+|..|=.-+...+|++++..||..++--++--.|+..+
T Consensus        10 ~lP~AiI~rlvke~l~E~~vsisKeA~~Ai~raAtVFv~~Lts~s~e~A~~q~rKt~sadDVl~aL~Eiefs~f~~pl   87 (172)
T KOG0870|consen   10 NLPNAIITRLVKEVLPESNVSISKEARLAIARAATVFVIFLTSVSNEIAKDQKRKTISADDVLKALDEIEFSSFVNPL   87 (172)
T ss_pred             hccHHHHHHHHHHhCccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCcccHHHHHHHHHHhchHHHhhHH
Confidence            489999998887554  3347899999999999999999999999999999999999999999999777766665533


No 35 
>PTZ00017 histone H2A; Provisional
Probab=86.50  E-value=1.9  Score=37.66  Aligned_cols=73  Identities=23%  Similarity=0.369  Sum_probs=64.7

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHH
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLI  187 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~  187 (386)
                      .|++||.+.|+...--.-|+..+++......|-..-|+...+...+..+++.-|++.++..+|-..+-++-|+
T Consensus        28 FPVgRi~R~Lk~g~~a~RV~a~A~VYLAAVLEYLtaEILELAgNaa~d~kk~RItPrHi~lAI~nDeEL~~Ll  100 (134)
T PTZ00017         28 FPVGRVHRYLKKGRYAKRVGAGAPVYLAAVLEYLTAEVLELAGNAAKDNKKKRITPRHIQLAIRNDEELNKLL  100 (134)
T ss_pred             cchHHHHHHHhccchhccccccchhhhHHHHHHHHHHHHHHHHHHHHhcCCCeecHHHHHhhccCcHHHHHHH
Confidence            7999999999887666689999999999999999999999999999999999999999999996655555444


No 36 
>PF09415 CENP-X:  CENP-S associating Centromere protein X;  InterPro: IPR018552 Centromere protein X (CENP-X) is a component of the CENP-S complex. The CENP-S complex is composed of at least of CENP-S and CENP-X and is essential for the stable assembly of the outer kinetchore [].  CENP-X is also a DNA-binding component of the Fanconi anemia (FA) core complex involved in DNA damage repair and genome maintenance. The FA complex is composed of CENPS, FANCA, FANCB, FANCC, FANCE, FANCF, FANCG, FANCL/PHF9, FANCM, FAAP24 and CENPX. Interacts with CENPS, FANCM and FAAP24 [, ].; PDB: 4DRB_L 4DRA_H 3V9R_D.
Probab=85.87  E-value=2.1  Score=33.40  Aligned_cols=60  Identities=22%  Similarity=0.274  Sum_probs=44.6

Q ss_pred             chHHHHHhhhc--CccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccc-cchhhHHH
Q psy8269         116 PLARIKKIMKL--DDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRT-LQRNDIAM  175 (386)
Q Consensus       116 p~~rik~im~~--~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~-iq~~DIa~  175 (386)
                      |..-|.|+++.  .+....|++|+..+..+..+.|+.|-..||....+..+... +.-.|+..
T Consensus         1 p~~li~rll~~~f~~~~tkIs~dal~l~~eyl~iFV~EAv~Ra~~~a~~e~~~~~le~e~LEk   63 (72)
T PF09415_consen    1 PPELIARLLHEHFKDDKTKISKDALKLSAEYLRIFVREAVARAAEQAEAEGDEGFLEVEHLEK   63 (72)
T ss_dssp             -CHHHHHHHCTTSSSTT-EE-CCCHHHHHHHHHHHHHHHHHHHHHHHHHTT-SSEE-HHHHHH
T ss_pred             ChHHHHHHHHHHhcCCCCCcCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCCHHHHHH
Confidence            55667888885  34566999999999999999999999999999777777666 66555544


No 37 
>KOG1756|consensus
Probab=84.73  E-value=3.6  Score=35.59  Aligned_cols=77  Identities=19%  Similarity=0.327  Sum_probs=68.3

Q ss_pred             ccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHhh
Q psy8269         113 QALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLIDI  189 (386)
Q Consensus       113 ~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~i  189 (386)
                      -.-|++||.++|+......-++.++|+..+--.|-..-+++..+=..+..+++.-+++.++..+|-.-+-..+|+.-
T Consensus        26 l~fPvgri~r~Lr~~~~~~ri~~gapV~laavLeYL~Aeile~agnaardnkk~ri~PrH~~lAI~NDeEL~~lL~~  102 (131)
T KOG1756|consen   26 LQFPVGRIHRLLRKGRYAQRVGAGAPVYLAAVLEYLTAEILELAGNAARDNKKTRITPRHLQLAIRNDEELNKLLGK  102 (131)
T ss_pred             cccCHHHHHHHHHccchhhhccCCChHHHHHHHHHHHHHHHHHhHHHhhhcCccccChHHHHHHHhCcHHHHHHhcc
Confidence            34799999999999888889999999999988887788888888889999999999999999999887778888764


No 38 
>PF13150 DUF3989:  Protein of unknown function (DUF3989)
Probab=83.69  E-value=0.66  Score=37.46  Aligned_cols=68  Identities=22%  Similarity=0.253  Sum_probs=42.6

Q ss_pred             hhhHHHH----HHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhhhccC-CCCcchhhcccCCCcccc
Q psy8269          18 IPKWRKD----LRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKYFASN-PQSMGITLRLHAVSPEFD   87 (386)
Q Consensus        18 ~~~~~~~----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~   87 (386)
                      +.++++.    |+..|..|+|.+|..+-++|+  +.|++-..=..-...++.=-.+ ...-.+||++...+++.|
T Consensus         4 ~~~~~~~~~~~Lr~~c~~Lsp~~R~~vvl~ml--~~fa~l~ly~~~~ai~~~Gk~~g~~~~IeHI~~l~l~~~~D   76 (85)
T PF13150_consen    4 IRKIKDRADDRLRRYCGRLSPKQRLRVVLVML--VLFAALCLYMTVSAIYDIGKEDGERMEIEHIRPLELPQPKD   76 (85)
T ss_pred             HHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHH--HHHHHHHHHHHHHHHHHhccCcccCccceeccccCCCCCCC
Confidence            3444444    999999999999999988887  7777766433333333332222 334456676666664433


No 39 
>PLN00154 histone H2A; Provisional
Probab=83.14  E-value=3.3  Score=36.29  Aligned_cols=74  Identities=22%  Similarity=0.253  Sum_probs=65.4

Q ss_pred             CchHHHHHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHHHh
Q psy8269         115 LPLARIKKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFLID  188 (386)
Q Consensus       115 ~p~~rik~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL~~  188 (386)
                      .|++||++.|+..- .-.-|+..+|+.....+|=..-|+..-+...+..++++-|++.++..+|-..+-++.|..
T Consensus        39 FPVgRi~r~Lk~g~~~~~RVga~ApVYLAAVLEYLtAEVLELAGNaA~d~kk~RItPrHi~lAIrnDeEL~~Ll~  113 (136)
T PLN00154         39 FPVGRIHRQLKQRVSAHGRVGATAAVYTAAILEYLTAEVLELAGNASKDLKVKRITPRHLQLAIRGDEELDTLIK  113 (136)
T ss_pred             CchHHHHHHHHhhhhhccccccchHHHHHHHHHHHHHHHHHHHHHHHHhhCCceecHHHhhhhccCcHHHHHHhc
Confidence            79999999999976 456999999999999999999999999999999999999999999999977666665554


No 40 
>KOG0871|consensus
Probab=81.23  E-value=5.7  Score=35.23  Aligned_cols=77  Identities=17%  Similarity=0.211  Sum_probs=63.4

Q ss_pred             hhcccCchHHHHHhhhcCcccc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHH
Q psy8269         110 LKTQALPLARIKKIMKLDDGVK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFL  186 (386)
Q Consensus       110 ~~~~~~p~~rik~im~~~~~v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL  186 (386)
                      .+...+|-+.|-||++--=.+. ++++|+.-++-..|--||+.++..+=.-+...+++||-..+|-.++-....-+|+
T Consensus         8 dde~sLPkAtv~KmIke~lP~d~rvakeareliincCvEFI~liSsEAneic~~e~KKTIa~EHV~KALe~LgF~eYi   85 (156)
T KOG0871|consen    8 DDELSLPKATVNKMIKEMLPKDVRVAKEARELIINCCVEFINLISSEANEICNKEAKKTIAPEHVIKALENLGFGEYI   85 (156)
T ss_pred             cccccCcHHHHHHHHHHhCCcccccchHHHHHHHHHHHHHHHHHHHHHHHHHhHHhcccCCHHHHHHHHHHcchHHHH
Confidence            3455699999999998544434 6799999999999999999999999999999999999999998888554443443


No 41 
>COG5150 Class 2 transcription repressor NC2, beta subunit (Dr1) [Transcription]
Probab=79.61  E-value=5.1  Score=34.61  Aligned_cols=84  Identities=18%  Similarity=0.237  Sum_probs=65.5

Q ss_pred             hhhcccCchHHHHHhhhcCcccc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHH---HHHH
Q psy8269         109 DLKTQALPLARIKKIMKLDDGVK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKY---DMFD  184 (386)
Q Consensus       109 ~~~~~~~p~~rik~im~~~~~v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i---~i~~  184 (386)
                      +.+...+|-+.+.|+.---=.+. |.+.|+.-++-.+|.-|++.++..+=..++..++++|...+|-.++-..   .-+.
T Consensus         6 ~dDe~sLPKATVqKMvS~iLp~dl~ftKearei~in~cieFi~~lsseAne~ce~EaKKTIa~EHviKALenLef~eyi~   85 (148)
T COG5150           6 NDDENSLPKATVQKMVSSILPKDLVFTKEAREIFINACIEFINMLSSEANEACEEEAKKTIAYEHVIKALENLEFEEYIE   85 (148)
T ss_pred             ccccccCcHHHHHHHHHHhccccccccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccccHHHHHHHHHhccHHHHHH
Confidence            34455699999998876544555 6699999999999999999999999999999999999999998887443   3333


Q ss_pred             HHHhhccc
Q psy8269         185 FLIDIVPR  192 (386)
Q Consensus       185 FL~~ivp~  192 (386)
                      ++.++...
T Consensus        86 ~~~e~~~n   93 (148)
T COG5150          86 SCMEEHEN   93 (148)
T ss_pred             HHHHHHHH
Confidence            44444443


No 42 
>smart00428 H3 Histone H3.
Probab=78.25  E-value=8.5  Score=32.34  Aligned_cols=81  Identities=23%  Similarity=0.267  Sum_probs=65.3

Q ss_pred             hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCcc-cc-ccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhH
Q psy8269          97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDDG-VK-MISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDI  173 (386)
Q Consensus        97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~~-v~-mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DI  173 (386)
                      ..+|+...++..|.--+.+|-.|+ |+|.+.-.. -. -++.||.....-|.|.|.-.+...+..|+.-.||-|++..|+
T Consensus        15 aLrEIr~yQkst~lLI~k~pF~RLVREI~~~~~~~~~~R~~~~Al~aLQeasE~ylv~lfeda~~~a~HAkRvTl~~kDi   94 (105)
T smart00428       15 ALREIRKYQKSTDLLIRKAPFQRLVREIAQKFTTGVDLRFQSSAIMALQEAAEAYLVGLFEDTNLLAIHAKRVTIMPKDI   94 (105)
T ss_pred             HHHHHHHHccCcccccccccHHHHHHHHHHHcCCCCCceeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCccCcHhhH
Confidence            445665556777888888998885 677665432 11 679999999999999999999999999999999999999999


Q ss_pred             HHHH
Q psy8269         174 AMAI  177 (386)
Q Consensus       174 a~aI  177 (386)
                      ..+.
T Consensus        95 ~La~   98 (105)
T smart00428       95 QLAR   98 (105)
T ss_pred             HHHH
Confidence            6654


No 43 
>PLN00156 histone H2AX; Provisional
Probab=77.31  E-value=6.1  Score=34.77  Aligned_cols=72  Identities=24%  Similarity=0.338  Sum_probs=62.5

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHHHH
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFDFL  186 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~FL  186 (386)
                      .|++||.+.|+...--.=|+..+|+..+...|=..-|+...+...+..++++-|++.++..+|-..+=++-|
T Consensus        30 FPVgRi~R~Lk~g~ya~RVga~ApVYLAAVLEYLtaEVLELAgNaa~d~kk~RItPrHi~lAIrnDeEL~~L  101 (139)
T PLN00156         30 FPVGRIARFLKAGKYAERVGAGAPVYLSAVLEYLAAEVLELAGNAARDNKKNRIVPRHIQLAVRNDEELSKL  101 (139)
T ss_pred             cchHHHHHHHhcCChhhccCCccHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCcHHHHHhhccCcHHHHHH
Confidence            899999999988765568899999999999999999999999999999999999999999999554444433


No 44 
>PLN00157 histone H2A; Provisional
Probab=77.04  E-value=6.8  Score=34.23  Aligned_cols=70  Identities=21%  Similarity=0.323  Sum_probs=61.9

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHH
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFD  184 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~  184 (386)
                      .|++||++.|+...--.-|+..+++..+...|=..-|+...+...+..++++-|++.++..+|-..+-++
T Consensus        27 FPVgRi~R~Lk~g~~a~RIg~~A~VYLAAVLEYLtaEVLELAgnaa~d~kk~RItPrHi~lAI~nDeEL~   96 (132)
T PLN00157         27 FPVGRIARYLKAGKYATRVGAGAPVYLAAVLEYLAAEVLELAGNAARDNKKSRIVPRHIQLAVRNDEELS   96 (132)
T ss_pred             cchHHHHHHHhcCchhhhcCCCcHhHHHHHHHHHHHHHHHHHHHHHHhcCCccccHHHHhhcccCcHHHH
Confidence            8999999999887656688999999999999999999999999999999999999999999995544344


No 45 
>PLN00153 histone H2A; Provisional
Probab=76.85  E-value=6  Score=34.43  Aligned_cols=70  Identities=24%  Similarity=0.334  Sum_probs=62.1

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHHHHHH
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKYDMFD  184 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i~i~~  184 (386)
                      .|++||.+.|+..---.=|+..+|+..+...|=..-|+...+...+..++++-|++.++..+|-..+=++
T Consensus        25 FpVgRi~R~Lr~g~~a~Rvga~A~VYLAAVLEYLtaEVLELAgnaa~d~kk~RItPrHi~lAI~nDeEL~   94 (129)
T PLN00153         25 FPVGRIARYLKKGKYAERIGAGAPVYLAAVLEYLTAEVLELAGNAARDNKKNRIVPRHIQLAIRNDEELG   94 (129)
T ss_pred             cchHHHHHHHhcCchhhccCCCchHHHHHHHHHHHHHHHHHHHHHHHhcCCCccChHHHHhhccCcHHHH
Confidence            7999999999887666688999999999999999999999999999999999999999999996544444


No 46 
>PF03419 Peptidase_U4:  Sporulation factor SpoIIGA  This family belongs to family U4 of the peptidase classification.;  InterPro: IPR005081 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.   The peptidases families associated with clan U- have an unknown catalytic mechanism as the protein fold of the active site domain and the active site residues have not been reported. This group of peptidases belong to the MEROPS peptidase family U4 (SpoIIGA peptidase family, clan U-).  Sporulation in bacteria such as Bacillus subtilis involves the formation of a polar septum, which divides the sporangium into a mother cell and a forespore. The sigma E factor, which is encoded within the spoIIG operon, is a cell-specific regulatory protein that directs gene transcription in the mother cell. Sigma E is synthesised as an inactive proprotein pro-sigma E, which is converted to the mature factor by the putative processing enzyme SpoIIGA []. ; GO: 0004190 aspartic-type endopeptidase activity, 0006508 proteolysis, 0030436 asexual sporulation
Probab=67.73  E-value=57  Score=31.87  Aligned_cols=40  Identities=10%  Similarity=0.175  Sum_probs=29.0

Q ss_pred             hHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHH
Q psy8269         233 LHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVH  276 (386)
Q Consensus       233 ~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~  276 (386)
                      ..+++|.+.|+..+..+......+|.+    +++.+|++.+++.
T Consensus        11 ~N~~md~~lL~~t~~~~~~~~~~~Rll----~~A~~Gal~~~~~   50 (293)
T PF03419_consen   11 VNFLMDYFLLWLTARLLKRRASRWRLL----LGAAIGALYSLLI   50 (293)
T ss_pred             HHHHHHHHHHHHHHHHhcCCCcHHHHH----HHHHHHHHHHHHH
Confidence            467889999999999988888888775    4555555544443


No 47 
>COG4769 Predicted membrane protein [Function unknown]
Probab=66.29  E-value=67  Score=29.33  Aligned_cols=119  Identities=21%  Similarity=0.171  Sum_probs=55.9

Q ss_pred             HHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHHHcCCC-cceeEech
Q psy8269         237 ANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCITHPDT-ELGILFVP  315 (386)
Q Consensus       237 ~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~~~p~~-~~~l~fip  315 (386)
                      .|..++..+     +.++....+.+-++-.+++++.+.-+.    .+.-..+++|++.+.++.+.+..-+- ...+..+.
T Consensus        41 aNivvlIa~-----~~l~~~~~~~~i~lr~il~AL~sGtlf----s~~Fl~sfaG~i~S~L~m~~l~~f~~k~~S~lgiS  111 (181)
T COG4769          41 ANIVVLIAF-----ETLNFKDALQTILLRVILQALFSGTLF----SPVFLYSFAGAILSTLFMYFLYQFGPKYLSLLGIS  111 (181)
T ss_pred             hhhHHHHhH-----HhccHHHHHHHHHHHHHHHHHHcCccc----hHHHHHHHHHHHHHHHHHHHHHHcCCceEeeeehh
Confidence            355555433     224555555555555555554433211    12345677788777777776654442 22233222


Q ss_pred             hhhHHHHHHHHHHHHHHHH---HhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         316 YVRFSAEHAIQGIMLLDFL---GVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       316 ~~~~~a~~~~~~~l~~~~~---~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      ...--.-.+..++++--+.   ......|-....+-+.|...|++....+.+
T Consensus       112 ~mGaF~hNl~QLivas~Lv~~~~v~l~lPll~flGivsG~~vg~~~~~~i~~  163 (181)
T COG4769         112 VMGAFTHNLGQLIVASFLVFTTSVMLYLPLLIFLGIVSGTAVGILANTLIIT  163 (181)
T ss_pred             hHHHHHHhHHHHHHHHHHHhcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            1111111111112111111   111122556667777777777777766554


No 48 
>smart00576 BTP Bromodomain transcription factors and PHD domain containing proteins. subdomain of archael histone-like transcription factors
Probab=65.28  E-value=14  Score=28.81  Aligned_cols=55  Identities=18%  Similarity=0.191  Sum_probs=48.9

Q ss_pred             cCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHH
Q psy8269         126 LDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKY  180 (386)
Q Consensus       126 ~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i  180 (386)
                      .+-|..-+++++.-.+....+.|++++...+-.+.+-.+|..++..||.+++...
T Consensus        17 ~~~Gf~~~~~sale~ltdi~~~yl~~l~~~~~~~a~~agR~~~~~~Dv~~Al~~~   71 (77)
T smart00576       17 ESAGFDSFQESALETLTDILQSYIQELGRTAHSYAELAGRTEPNLGDVVLALENL   71 (77)
T ss_pred             HHcCccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHHh
Confidence            3558888999999999999999999999999999999999999999998887553


No 49 
>PF04719 TAFII28:  hTAFII28-like protein conserved region;  InterPro: IPR006809 The general transcription factor, TFIID, consists of the TATA-binding protein (TBP) associated with a series of TBP-associated factors (TAFs) that together participate in the assembly of the transcription preinitiation complex. The conserved region is found at the C terminus of most member proteins. The crystal structure of hTAFII28 with hTAFII18 shows that this region is involved in the binding of these two subunits. The conserved region contains four alpha helices and three loops arranged as in histone H3 [, ].; GO: 0006367 transcription initiation from RNA polymerase II promoter, 0005634 nucleus; PDB: 1BH9_B 1BH8_B.
Probab=61.76  E-value=21  Score=29.19  Aligned_cols=73  Identities=19%  Similarity=0.193  Sum_probs=47.1

Q ss_pred             hhhhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhcccc-ccchhhHHHHH
Q psy8269         105 LSMLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRR-TLQRNDIAMAI  177 (386)
Q Consensus       105 ~~~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~-~iq~~DIa~aI  177 (386)
                      ++=+.+....+|.+.|||+|+.-.+=+-+++.+.++++--.-+|+=|+...|-.-.++.+.. .+++.++..+.
T Consensus        14 ~Rye~fRRs~~~k~~ikkli~~~~~~qsv~~~v~i~v~g~aKvFVGEiVE~A~~Vq~~~~~~~pl~P~hlreA~   87 (90)
T PF04719_consen   14 DRYEAFRRSSFNKAAIKKLINQVLGNQSVSQNVVIAVAGIAKVFVGEIVEEARDVQEEWGETGPLQPDHLREAY   87 (90)
T ss_dssp             HHHHHHHH----HHHHHHHHHHHHS-S---HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT--SS--HHHHHHHH
T ss_pred             HHHHHHHHccCCHHHHHHHHHHHcCCCCCChhHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCcHHHHHHH
Confidence            45667888889999999999996664566777788888888999999998887666554433 66666666553


No 50 
>cd08050 TAF6 TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 6 (TAF6) is one of several TAFs that bind TBP and are involved in forming Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTFs) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs are named after their electrophoretic mobility in polyacrylamide gels in different species. A new, unified nomenclature has been suggested for the pol II TAFs to show the relationship between TAF orthologs and paralogs. Several hypotheses are proposed for TAFs functions such as serving as
Probab=61.38  E-value=17  Score=36.59  Aligned_cols=61  Identities=26%  Similarity=0.330  Sum_probs=52.9

Q ss_pred             HHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269         118 ARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK  179 (386)
Q Consensus       118 ~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~  179 (386)
                      ..||.|.+. -|+.-++.|+....+...|-++.++...|-...+..||++++..||..|+-.
T Consensus         3 ~~i~~ia~~-~Gi~~~~~~a~~~La~~~e~~~~~i~~~A~k~~~hskR~~l~~~Di~~Al~~   63 (343)
T cd08050           3 ESIKLIAES-LGIDSLSDEVAQLLAEDVEYRLREIIQEAAKFMRHSKRRKLTTSDVNHALRL   63 (343)
T ss_pred             hHHHHHHHH-cCCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCcCCHHHHHHHHHH
Confidence            345555554 3778889999999999999999999999999999999999999999999844


No 51 
>cd07979 TAF9 TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 9 (TAF9) is one of several TAFs that bind TBP and are involved in forming the TFIID complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryotes. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. The TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAFs orthologs and paralogs. Human TAF9 has a paralogue gene (TAF9L) whi
Probab=59.34  E-value=28  Score=29.62  Aligned_cols=61  Identities=16%  Similarity=0.195  Sum_probs=53.4

Q ss_pred             HHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269         118 ARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK  179 (386)
Q Consensus       118 ~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~  179 (386)
                      .-|++|++.. |+.-.+++++.-.-.-.+.+..++...+-.+.+-.+|++++..||..+|..
T Consensus         5 ~~v~~iLk~~-Gv~~~~~~v~~~Lle~~~ry~~~il~dA~~~a~hA~r~tV~~eDV~lAi~~   65 (117)
T cd07979           5 RVIAAILKSM-GITEYEPRVINQLLEFAYRYTTDVLDDAKVYSEHAGKANIDADDVKLAIQS   65 (117)
T ss_pred             HHHHHHHHHC-CCCccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence            3466777754 888889999999999999999999999999999999999999999999944


No 52 
>PF02969 TAF:  TATA box binding protein associated factor (TAF);  InterPro: IPR004823 The TATA box binding protein associated factor (TAF) is part of the transcription initiation factor TFIID multimeric protein complex. TFIID plays a central role in mediating promoter responses to various activators and repressors. It binds tightly to TAFII-250 and directly interacts with TAFII-40. TFIID is composed of TATA binding protein (TBP)and a number of TBP-associated factors (TAFS). TAF proteins adopt a histone-like fold.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus; PDB: 1TAF_B.
Probab=58.68  E-value=22  Score=27.29  Aligned_cols=49  Identities=27%  Similarity=0.374  Sum_probs=39.6

Q ss_pred             cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269         129 GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI  177 (386)
Q Consensus       129 ~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI  177 (386)
                      |+.-++.|+-...+..+|--++++...+-......||+.++..||..++
T Consensus        17 Gi~~l~de~a~~La~dveyrlreiiq~a~kfm~hskR~~Lt~~Di~~AL   65 (66)
T PF02969_consen   17 GISNLSDEAAKALAEDVEYRLREIIQEALKFMRHSKRTKLTTDDINSAL   65 (66)
T ss_dssp             T---B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SSB-HHHHHHHH
T ss_pred             CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHhCCCCCCHHHHHHHh
Confidence            5667888999999999999999999999999999999999999998875


No 53 
>PLN00161 histone H3; Provisional
Probab=57.42  E-value=38  Score=29.74  Aligned_cols=81  Identities=23%  Similarity=0.235  Sum_probs=63.6

Q ss_pred             hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269          97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA  174 (386)
Q Consensus        97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa  174 (386)
                      ..+|+...++..|.--+.+|-.|+ |+|++.-. .---+..+|....--|.|.|+-.+-.-+..|+--.||-|++..|+.
T Consensus        41 aLrEIR~yQkst~lLIpklPF~RLVREI~~~~~~~~~Rfq~~Al~ALQEAsEayLV~lFeda~lcaiHAkRVTlm~kDm~  120 (135)
T PLN00161         41 ALREIRKYQKSTELLIRKLPFARLVREISNEMLREPFRWTAEALLALQEATEDFLVHLFEDCNLCAIHAKRVTIMPKDMQ  120 (135)
T ss_pred             HHHHHHHHccccccccccccHHHHHHHHHHhcCCCCcEeeHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCcccchhhHH
Confidence            345665556778888888999885 66765432 1125688888999999999999999999999999999999999986


Q ss_pred             HHH
Q psy8269         175 MAI  177 (386)
Q Consensus       175 ~aI  177 (386)
                      .+-
T Consensus       121 La~  123 (135)
T PLN00161        121 LAR  123 (135)
T ss_pred             HHH
Confidence            553


No 54 
>TIGR02854 spore_II_GA sigma-E processing peptidase SpoIIGA. Members of this protein family are the stage II sporulation protein SpoIIGA. This protein acts as an activating protease for Sigma-E, one of several specialized sigma factors of the sporulation process in Bacillus subtilis and related endospore-forming bacteria.
Probab=57.14  E-value=1e+02  Score=30.30  Aligned_cols=41  Identities=12%  Similarity=0.169  Sum_probs=30.1

Q ss_pred             hhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHH
Q psy8269         232 PLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVH  276 (386)
Q Consensus       232 ~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~  276 (386)
                      +..+++|.+.|+..+..+.+....+|.+    +++++|++.+.+.
T Consensus        10 l~Nf~~d~~LL~~t~~~lk~~~~~~Rll----~ga~iGa~~~~~~   50 (288)
T TIGR02854        10 LENFIIDYFLLYLTARTLKDKVSQWRLL----LAALIGSLYVLFM   50 (288)
T ss_pred             HHHHHHHHHHHHHHHHHhhccchHHHHH----HHHHHHHHHHHHH
Confidence            3567889999999999998888888765    4555555544444


No 55 
>COG4603 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=54.33  E-value=2.2e+02  Score=29.04  Aligned_cols=123  Identities=18%  Similarity=0.187  Sum_probs=61.4

Q ss_pred             hhhHHHHHHHHHHHHHHHhhcccccccccCCCCccccccccccCCCCccccceeecc-cccCChhHHHHHHHHHHHHHHH
Q psy8269         170 RNDIAMAITKYDMFDFLIDIVPREEVKTATGPHRRDDATKTALNPDQYLSVPMLLST-FSHQSPLHIFANMFVLHSFMPS  248 (386)
Q Consensus       170 ~~DIa~aI~~i~i~~FL~~ivp~~~~~~~~~p~~~~~~~~~~l~p~~g~~wrllTs~-FlH~~~~HLl~Nm~~L~~fG~~  248 (386)
                      -+++-.+|+.+.+..++.+.+-........+   . +...+..-|+..+.-+++... -+|.++.--+.-....|+   .
T Consensus       142 ~nEVIttiMlNyIa~~l~~~lV~~~~~~p~~---~-~~~~T~~i~~~a~lp~~~~~~~~~~~g~lial~~av~~~~---l  214 (356)
T COG4603         142 VNEVITTIMLNYIAVYLLNYLVEGPLRDPAG---T-NFPQTRPIPDAARLPSLFLASGRLHWGFLLALAAAVLVWF---L  214 (356)
T ss_pred             ccHHHHHHHHHHHHHHHHHHHHcCcccCCCc---C-CCccCccCchhhcCCccccccchHHHHHHHHHHHHHHHHH---H
Confidence            3567778888888888777765531111111   0 011222223333333332222 455555444444444443   2


Q ss_pred             HHhh-hch--------------------hHHHHHHHHHHHHHHHHHHHHH-----hhcCCCcceechHHHHHHHHHH
Q psy8269         249 SVED-LGK--------------------EQFVGFYLTAGVVASLLSYVHK-----ILVRKPGLSIGASGAIMAVLAH  299 (386)
Q Consensus       249 lE~~-~G~--------------------~rfl~lyl~sgv~gsl~s~l~~-----~~~~~~~~~vGASGaI~Gll~a  299 (386)
                      +++. +|-                    +..+..+++||.+++++..+..     ....+..+-.|.+|...++++-
T Consensus       215 l~kT~~G~~lr~~G~sp~Aa~yaGi~~~r~i~~sm~iSGalAGLaG~~ev~G~~~~~~~~~~~G~Gf~gI~Vallg~  291 (356)
T COG4603         215 LKKTVFGYELRVVGLSPRAARYAGISVKRTIFLSMLISGALAGLAGAVEVLGAFGRLRPSISPGYGFDGIAVALLGR  291 (356)
T ss_pred             HHHccCCeEEEEecCChhHHHHCCCChhHHHHHHHHHHHHHHHHHHHHHHhcchhhcccccCCCcChHHHHHHHHhc
Confidence            4432 111                    1234555666666666654432     1122346788999999888885


No 56 
>PTZ00252 histone H2A; Provisional
Probab=54.32  E-value=35  Score=29.86  Aligned_cols=70  Identities=16%  Similarity=0.281  Sum_probs=58.4

Q ss_pred             CchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhh--ccccccchhhHHHHHHHHHHHH
Q psy8269         115 LPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTED--NKRRTLQRNDIAMAITKYDMFD  184 (386)
Q Consensus       115 ~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~--~kr~~iq~~DIa~aI~~i~i~~  184 (386)
                      .|++||.+.|+..---.=|+..+++......|=..-|+-..+...+.+  +++.-|++.++..+|-..+=++
T Consensus        26 FPVgRi~R~Lr~g~ya~RIga~ApVYLAAVLEYLtaEVLELAgnaa~d~~~kk~RItPrHi~lAIrNDeEL~   97 (134)
T PTZ00252         26 FPVGRVGSLLRRGQYARRIGASGAVYMAAVLEYLTAELLELSVKAAAQQAKKPKRLTPRTVTLAVRHDDDLG   97 (134)
T ss_pred             CchHHHHHHHHcCCcccccCCccHHHHHHHHHHHHHHHHHHHHHHHHhccCCcccccHHHHHhhccChHHHH
Confidence            899999999988766668899999999999998888888888888754  6778899999999995544333


No 57 
>PF13105 DUF3959:  Protein of unknown function (DUF3959)
Probab=53.40  E-value=66  Score=29.84  Aligned_cols=19  Identities=21%  Similarity=0.308  Sum_probs=15.8

Q ss_pred             HHHHHHHHHHHHHHHHhhh
Q psy8269         346 AAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       346 ~aHLgG~L~G~l~~~~~~~  364 (386)
                      .--+||++.|-+++.+++|
T Consensus       134 llLvgGli~GGLlA~~~hR  152 (239)
T PF13105_consen  134 LLLVGGLILGGLLAMLIHR  152 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHh
Confidence            3457999999999998887


No 58 
>PF04632 FUSC:  Fusaric acid resistance protein family;  InterPro: IPR006726 This entry represents the p-hydroxybenzoic acid efflux pump subunit AaeB (pHBA efflux pump protein B) whose substrates are p-hydroxybenzoic acid (pHBA), 6-hydroxy-2-naphthoic and 2-hydroxycinnamate. It could function as a metabolic relief valve, allowing to eliminate certain compounds when they accumulate to high levels in the cell []. This family also includes fusaric acid resistance proteins [], which are likely to be membrane transporter proteins, and uncharacterised transporter YdhK.; GO: 0006810 transport, 0005886 plasma membrane
Probab=50.26  E-value=3.3e+02  Score=29.28  Aligned_cols=35  Identities=20%  Similarity=0.289  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHHHHHHhhh---ccccccchHHHHHHhhc
Q psy8269         347 AHLGGALFGILYSKYGEQ---TWAHRAPVVEYWKSLKK  381 (386)
Q Consensus       347 aHLgG~L~G~l~~~~~~~---~w~~r~~~~~~W~~~R~  381 (386)
                      |.+.|.+++.+...+...   .|..|+-....|+++++
T Consensus       474 a~l~G~~~a~l~~~li~p~~~~~~~rrl~~~~~~~l~~  511 (650)
T PF04632_consen  474 AILLGIVIAALVFRLIRPFSPEWRRRRLLRALRRDLAR  511 (650)
T ss_pred             HHHHHHHHHHHHHHHHCCCChhHHHHHHHHHHHHHHHH
Confidence            444444444443333322   23344444557777665


No 59 
>PF03847 TFIID_20kDa:  Transcription initiation factor TFIID subunit A;  InterPro: IPR003228 Human transcription initiation factor TFIID is composed of the TATA-binding polypeptide (TBP) and at least 13 TBP-associated factors (TAFs) that collectively or individually are involved in activator-dependent transcription [].; GO: 0006352 transcription initiation, DNA-dependent, 0005669 transcription factor TFIID complex; PDB: 1H3O_B.
Probab=48.32  E-value=57  Score=25.11  Aligned_cols=60  Identities=17%  Similarity=0.289  Sum_probs=43.8

Q ss_pred             HHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHH
Q psy8269         119 RIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAIT  178 (386)
Q Consensus       119 rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~  178 (386)
                      ++...++.=+.-..+.+|+.-++..-.+-|+..++.++-..+..++..++...||...+-
T Consensus         4 ~l~~Lv~~iDp~~~ld~~vee~Ll~laddFv~~v~~~ac~lAKhR~s~tle~~Dv~~~Le   63 (68)
T PF03847_consen    4 KLQELVKQIDPNEKLDPDVEELLLELADDFVDDVVSFACRLAKHRKSSTLEVKDVQLHLE   63 (68)
T ss_dssp             HHHHHHHCC-SS----HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SEE-HHHHHHHHH
T ss_pred             HHHHHHHHcCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCCCCHHHHHHHHH
Confidence            344555555555567888999999999999999999999999999999999988876653


No 60 
>TIGR00834 ae anion exchange protein. They preferentially catalyze anion exchange (antiport) reactions, typically acting as HCO3-:Cl- antiporters, but also transporting a range of other inorganic and organic anions. Additionally, renal Na+:HCO3- cotransporters have been found to be members of the AE family. They catalyze the reabsorption of HCO3- in the renal proximal tubule.
Probab=46.77  E-value=2.2e+02  Score=32.74  Aligned_cols=64  Identities=17%  Similarity=0.165  Sum_probs=37.8

Q ss_pred             hhHHHHHHHHHHHHHHHHHhhhchh-HHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHH
Q psy8269         232 PLHIFANMFVLHSFMPSSVEDLGKE-QFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAH  299 (386)
Q Consensus       232 ~~HLl~Nm~~L~~fG~~lE~~~G~~-rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a  299 (386)
                      ++=.+.|+.--+.||..+++.-+.. -..-+.+++++ +++...++.   +++-..+|.+|-+.-+..+
T Consensus       376 ifiyFA~L~PaIaFG~ll~~~T~g~~gv~E~Llstai-~Giifslf~---GQPL~IlG~TGPilvF~~~  440 (900)
T TIGR00834       376 IFIYFAALSPAITFGGLLGEKTRNMMGVSELLISTAV-QGVLFALLA---AQPLLVVGFSGPLLVFEEA  440 (900)
T ss_pred             HHHHHHHhhHHhhHHHHHHHhhCCcchHHHHHHHHHH-HHHHHhhhc---CCceEEecCcccHHHHHHH
Confidence            3344556666677888888774433 22333344444 444444444   5777889999966555544


No 61 
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=46.22  E-value=4.3  Score=31.11  Aligned_cols=43  Identities=12%  Similarity=0.117  Sum_probs=31.9

Q ss_pred             ccchhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHh
Q psy8269          15 SNDIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYK   64 (386)
Q Consensus        15 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   64 (386)
                      .....+..+.+...|++|++.||..|       ..-|..+..++...|..
T Consensus        29 ~~~~~e~~k~~~~~Wk~Ls~~EK~~Y-------~~~A~~~k~~y~~e~~~   71 (73)
T PF09011_consen   29 KQSFREVMKEISERWKSLSEEEKEPY-------EERAKEDKERYEREMKE   71 (73)
T ss_dssp             -SSHHHHHHHHHHHHHHS-HHHHHHH-------HHHHHHHHHHHHHHHHH
T ss_pred             CCCHHHHHHHHHHHHHhcCHHHHHHH-------HHHHHHHHHHHHHHHHh
Confidence            34577889999999999999999988       66666666666665544


No 62 
>KOG1658|consensus
Probab=45.46  E-value=24  Score=31.58  Aligned_cols=45  Identities=27%  Similarity=0.256  Sum_probs=40.9

Q ss_pred             hhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhH
Q psy8269         110 LKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTI  154 (386)
Q Consensus       110 ~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~  154 (386)
                      .+.+.+|.+..|||=|.+|-..+.+.++-.-..-+.|.|+|-++.
T Consensus         7 e~~p~~p~ekvkkiak~dPey~~te~~a~~etafatE~fvq~lv~   51 (162)
T KOG1658|consen    7 ECSPKLPMEKVKKIAKNDPEYMDTEDDAFVETAFATEQFVQVLVH   51 (162)
T ss_pred             hhCccccHHHHHHhhcCCchhhhcccchHHHHHHHHHHHHhhhhh
Confidence            355679999999999999999999999998999999999998886


No 63 
>PF06728 PIG-U:  GPI transamidase subunit PIG-U;  InterPro: IPR009600 Many eukaryotic proteins are anchored to the cell surface via glycosylphosphatidylinositol (GPI), which is posttranslationally attached to the C terminus by GPI transamidase. The mammalian GPI transamidase is a complex of at least four subunits, GPI8, GAA1, PIG-S, and PIG-T. PIG-U is thought to represent a fifth subunit in this complex and may be involved in the recognition of either the GPI attachment signal or the lipid portion of GPI [].; GO: 0006506 GPI anchor biosynthetic process, 0005789 endoplasmic reticulum membrane, 0016021 integral to membrane
Probab=44.71  E-value=40  Score=34.41  Aligned_cols=88  Identities=18%  Similarity=0.338  Sum_probs=66.8

Q ss_pred             ccCCCCccccceeecccccC-ChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceech
Q psy8269         211 ALNPDQYLSVPMLLSTFSHQ-SPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGA  289 (386)
Q Consensus       211 ~l~p~~g~~wrllTs~FlH~-~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGA  289 (386)
                      -+.|+-|-+|=++|-+|-|. .+....+|...+....+..-| +.+..+.++.+..|+++-+          .+.+++|=
T Consensus       254 dltPNlGl~WYfF~EmFd~Fr~fF~~vF~~~~~~y~~Pl~ir-l~~~p~~~~~~~~~~~~~f----------k~Ypt~~D  322 (382)
T PF06728_consen  254 DLTPNLGLWWYFFTEMFDHFRPFFLFVFQLHPFIYIIPLTIR-LRKQPLFAFFILLGIISVF----------KPYPTVGD  322 (382)
T ss_pred             CCCCCcchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhh-cccCcHHHHHHHHHHHHHh----------CCCCchhH
Confidence            35688899999999999995 888899999988777775554 4555666666666655432          46799999


Q ss_pred             HHHHHHHHHHHHHHcCCCcc
Q psy8269         290 SGAIMAVLAHTCITHPDTEL  309 (386)
Q Consensus       290 SGaI~Gll~a~~~~~p~~~~  309 (386)
                      .|...+++..+--.++..+.
T Consensus       323 ~~~~lsll~~~~~~~~~~r~  342 (382)
T PF06728_consen  323 VGLYLSLLPLFSHLFPYMRY  342 (382)
T ss_pred             HHHHHHHHHHhHHHHhhccc
Confidence            99999999887666666553


No 64 
>PF03540 TFIID_30kDa:  Transcription initiation factor TFIID 23-30kDa subunit;  InterPro: IPR003923 Transcription initiation factor TFIID is a multimeric protein complex that plays a central role in mediating promoter responses to various activators and repressors. The complex includes TATA binding protein (TBP) and various TBP-associated factors (TAFS). TFIID a bona fide RNA polymerase II-specific TATA-binding protein-associated factor (TAF) and is essential for viability []. TFIID acts to nucleate the transcription complex, recruiting the rest of the factors through a direct interaction with TFIIB. The TBP subunit of TFIID is sufficient for TATA-element binding and TFIIB interaction, and can support basal transcription. The protein belongs to the TAF2H family.; GO: 0006352 transcription initiation, DNA-dependent, 0005634 nucleus
Probab=43.53  E-value=74  Score=23.25  Aligned_cols=48  Identities=19%  Similarity=0.181  Sum_probs=36.0

Q ss_pred             cCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhh
Q psy8269         114 ALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTED  162 (386)
Q Consensus       114 ~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~  162 (386)
                      .+|.+.+.-.|+.. |++.--....-+++.|+++|+.++...++.|..-
T Consensus         2 ~IPD~v~~~yL~~~-G~~~~D~rv~RLvSLaaQKFisdI~~dA~q~~k~   49 (51)
T PF03540_consen    2 TIPDEVTDYYLERS-GFQTSDPRVKRLVSLAAQKFISDIANDAMQYCKI   49 (51)
T ss_pred             CCCHHHHHHHHHHC-CCCCCCHhHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            46777777777764 3344444556788999999999999999988754


No 65 
>smart00398 HMG high mobility group.
Probab=43.50  E-value=6.9  Score=28.87  Aligned_cols=40  Identities=15%  Similarity=0.090  Sum_probs=29.0

Q ss_pred             chhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHH
Q psy8269          17 DIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLY   63 (386)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   63 (386)
                      ..+++.+.+.+.|+.|.+.+|.++       ..-+..+...+...+.
T Consensus        28 ~~~~i~~~~~~~W~~l~~~ek~~y-------~~~a~~~~~~y~~~~~   67 (70)
T smart00398       28 SNAEISKKLGERWKLLSEEEKAPY-------EEKAKKDKERYEEEMP   67 (70)
T ss_pred             CHHHHHHHHHHHHHcCCHHHHHHH-------HHHHHHHHHHHHHHHH
Confidence            367889999999999999999988       5555544444444433


No 66 
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=40.88  E-value=7.1  Score=28.67  Aligned_cols=24  Identities=17%  Similarity=0.368  Sum_probs=22.0

Q ss_pred             chhhHHHHHHHHhccCCCCCeeee
Q psy8269          17 DIPKWRKDLRKRWNALHPGDRVYV   40 (386)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~   40 (386)
                      ...++-+.+.+.|+.|++.+|..+
T Consensus        27 ~~~~i~~~~~~~W~~ls~~eK~~y   50 (66)
T cd01390          27 SVTEVTKILGEKWKELSEEEKKKY   50 (66)
T ss_pred             CHHHHHHHHHHHHHhCCHHHHHHH
Confidence            568899999999999999999888


No 67 
>PTZ00199 high mobility group protein; Provisional
Probab=38.34  E-value=9.4  Score=31.05  Aligned_cols=41  Identities=15%  Similarity=0.073  Sum_probs=34.2

Q ss_pred             hhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhh
Q psy8269          18 IPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKY   65 (386)
Q Consensus        18 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   65 (386)
                      ..++-+.|...|+.|+++||..+       ..-+..+..+|...|-.|
T Consensus        52 ~~evsk~ige~Wk~ls~eeK~~y-------~~~A~~dk~rY~~e~~~Y   92 (94)
T PTZ00199         52 VAAVGKMVGEAWNKLSEEEKAPY-------EKKAQEDKVRYEKEKAEY   92 (94)
T ss_pred             HHHHHHHHHHHHHcCCHHHHHHH-------HHHHHHHHHHHHHHHHHH
Confidence            56778899999999999999988       777778877777777665


No 68 
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=36.95  E-value=7.7  Score=29.62  Aligned_cols=42  Identities=17%  Similarity=0.096  Sum_probs=29.9

Q ss_pred             chhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhh
Q psy8269          17 DIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKY   65 (386)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   65 (386)
                      ...++-|.+.+.|+.|++++|..+       ...+..+...+...+++|
T Consensus        28 ~~~eisk~l~~~Wk~ls~~eK~~y-------~~~a~~~k~~y~~~~p~y   69 (72)
T cd01388          28 ENRAISKILGDRWKALSNEEKQPY-------YEEAKKLKELHMKLYPDY   69 (72)
T ss_pred             CHHHHHHHHHHHHHcCCHHHHHHH-------HHHHHHHHHHHHHHCcCC
Confidence            456788899999999999999887       555555554444444443


No 69 
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=36.89  E-value=7.3  Score=30.11  Aligned_cols=44  Identities=11%  Similarity=0.067  Sum_probs=34.4

Q ss_pred             chhhHHHHHHHHhccCCCCCeeeehhhhHHHHHHHhhcCCccchHHHhhhc
Q psy8269          17 DIPKWRKDLRKRWNALHPGDRVYVPILFLNGVVFLAWFYPRLHPVLYKYFA   67 (386)
Q Consensus        17 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~   67 (386)
                      ...++-+.+.+.|+.|++.+|..+       ...|..+...+...+++|=.
T Consensus        28 ~~~eisk~~g~~Wk~ls~eeK~~y-------~~~A~~~k~~~~~~~p~Yky   71 (77)
T cd01389          28 TNNEISRIIGRMWRSESPEVKAYY-------KELAEEEKERHAREYPDYKY   71 (77)
T ss_pred             CHHHHHHHHHHHHhhCCHHHHHHH-------HHHHHHHHHHHHHHCCCCcc
Confidence            457889999999999999999988       66666666666666666543


No 70 
>PF15510 CENP-W:  Centromere kinetochore component W
Probab=35.40  E-value=84  Score=25.78  Aligned_cols=63  Identities=21%  Similarity=0.371  Sum_probs=47.4

Q ss_pred             ccCchHHHHHhhhcCc-cccccccchhhH--------------HHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHH
Q psy8269         113 QALPLARIKKIMKLDD-GVKMISAEAPML--------------FSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMA  176 (386)
Q Consensus       113 ~~~p~~rik~im~~~~-~v~mi~~~~~~l--------------~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~a  176 (386)
                      +..|.+-+|+++|-.+ -.. +..+.+.+              +-.-|=+|++.+...+-+.+-++|-.+|.+.+|..+
T Consensus        15 rkaPrgfLkrv~Kr~KphlR-l~~~~Dllv~~~~f~~~~~~~~vhLncLLFvhrLAEEaRtnA~EnK~~~Ik~~Hv~Aa   92 (102)
T PF15510_consen   15 RKAPRGFLKRVFKRQKPHLR-LETSGDLLVRFCPFSGWQWGGEVHLNCLLFVHRLAEEARTNACENKCGTIKKEHVLAA   92 (102)
T ss_pred             HhCchHHHHHHHHhcCCcee-ecccccHHHhhcccccccccceeehhHHHHHHHHHHHHHHHHHHHhhccccHHHHHHH
Confidence            3579999999999443 333 22223333              556899999999999999999999999998887544


No 71 
>PF12805 FUSC-like:  FUSC-like inner membrane protein yccS
Probab=34.69  E-value=2.4e+02  Score=27.30  Aligned_cols=12  Identities=25%  Similarity=0.077  Sum_probs=4.8

Q ss_pred             HHHHHHHHHHHH
Q psy8269         344 DHAAHLGGALFG  355 (386)
Q Consensus       344 ~~~aHLgG~L~G  355 (386)
                      +.+.|.+=+++|
T Consensus        70 ~~~~~~~l~~~G   81 (284)
T PF12805_consen   70 EALEHALLFLAG   81 (284)
T ss_pred             HHHHHHHHHHHH
Confidence            334444333333


No 72 
>COG0705 Membrane associated serine protease [Amino acid transport and metabolism]
Probab=33.93  E-value=29  Score=32.29  Aligned_cols=69  Identities=13%  Similarity=-0.011  Sum_probs=49.3

Q ss_pred             CccccceeecccccCChhHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHH
Q psy8269         216 QYLSVPMLLSTFSHQSPLHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMA  295 (386)
Q Consensus       216 ~g~~wrllTs~FlH~~~~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~G  295 (386)
                      +|.++.++++.++|....|...+...             ..+...+++...++.++......     ..+.++.++-+.|
T Consensus       137 SG~i~gllga~~~~~~~~~~~~~~~~-------------~~~~~~~~i~~~~~~~~~~~~~~-----~~~~va~~aHl~G  198 (228)
T COG0705         137 SGAIFGLLGAYFLLFPFARILLLFLS-------------LPRPALILILIWLLYSLFSGAGS-----FGPSVAWSAHLGG  198 (228)
T ss_pred             hHHHHHHHHHHHHHccccchhhhhcc-------------CchhHHHHHHHHHHHHHHHHhcC-----CchHHHHHHHHHH
Confidence            67899999999999988888777654             34455667777777776666544     2257888888888


Q ss_pred             HHHHHHH
Q psy8269         296 VLAHTCI  302 (386)
Q Consensus       296 ll~a~~~  302 (386)
                      +++....
T Consensus       199 ~i~G~l~  205 (228)
T COG0705         199 LIGGLLL  205 (228)
T ss_pred             HHHHHHH
Confidence            8866543


No 73 
>PLN00160 histone H3; Provisional
Probab=33.84  E-value=1.4e+02  Score=24.71  Aligned_cols=80  Identities=23%  Similarity=0.231  Sum_probs=62.2

Q ss_pred             hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCc-cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269          97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDD-GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA  174 (386)
Q Consensus        97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~-~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa  174 (386)
                      +.+|+..-++..|.--+.+|-.|+ |+|.+.-. +---+..+|....--|.|.|.-.+-.-+..|+--.||-|++..|+.
T Consensus         7 aLrEIR~yQkst~lLI~k~pF~RLVREI~~~~~~~~~Rfq~~Al~ALQeAsEayLv~lfed~~lca~HakRVTl~~kD~~   86 (97)
T PLN00160          7 ALKEIKMYQKSTDLLIRRLPFARLVREIQMEMSREAYRWQGSAILALQEAAEAHLVGLFEDSNLCAIHGKRVTIMPKDMQ   86 (97)
T ss_pred             HHHHHHHHccchhhhhccccHHHHHHHHHHHcCCCCcEeeHHHHHHHHHHHHHHHHHHHhhhHHHHHHhcccccchhhHH
Confidence            345555556777778888898885 66665322 2126688888899999999999999999999999999999999986


Q ss_pred             HH
Q psy8269         175 MA  176 (386)
Q Consensus       175 ~a  176 (386)
                      .+
T Consensus        87 L~   88 (97)
T PLN00160         87 LA   88 (97)
T ss_pred             HH
Confidence            54


No 74 
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=33.69  E-value=6.9  Score=29.06  Aligned_cols=25  Identities=16%  Similarity=0.194  Sum_probs=22.0

Q ss_pred             cchhhHHHHHHHHhccCCCCCeeee
Q psy8269          16 NDIPKWRKDLRKRWNALHPGDRVYV   40 (386)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~   40 (386)
                      ...+++.+.+.+.|+.|++.+|..+
T Consensus        26 ~~~~~i~~~~~~~W~~l~~~eK~~y   50 (69)
T PF00505_consen   26 LSNKEISKILAQMWKNLSEEEKAPY   50 (69)
T ss_dssp             STHHHHHHHHHHHHHCSHHHHHHHH
T ss_pred             cccccchhhHHHHHhcCCHHHHHHH
Confidence            3578899999999999999998877


No 75 
>PF10979 DUF2786:  Protein of unknown function (DUF2786);  InterPro: IPR024498 This domain is found in proteins that have no known function.
Probab=33.69  E-value=49  Score=23.14  Aligned_cols=35  Identities=26%  Similarity=0.362  Sum_probs=29.1

Q ss_pred             hHHHHHhhhcCccccccccchhhHHHHHHHHHHHH
Q psy8269         117 LARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHE  151 (386)
Q Consensus       117 ~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~  151 (386)
                      +.||||.+.+..+-.--..|+.....+|.++..++
T Consensus         4 l~kI~kLLalA~~~~~~~~EA~~A~~kAq~Lm~ky   38 (43)
T PF10979_consen    4 LEKIRKLLALAESTGSNEHEAEAALAKAQRLMAKY   38 (43)
T ss_pred             HHHHHHHHHHhhCCCCCHHHHHHHHHHHHHHHHHh
Confidence            57999999999987777789999999998766543


No 76 
>PLN00121 histone H3; Provisional
Probab=33.58  E-value=1.2e+02  Score=26.74  Aligned_cols=80  Identities=21%  Similarity=0.251  Sum_probs=63.2

Q ss_pred             hhhHHhhhhhhhhhhcccCchHHH-HHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHH
Q psy8269          97 SVLEVQGALSMLDLKTQALPLARI-KKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAM  175 (386)
Q Consensus        97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~  175 (386)
                      ..+|+...++..|.--+.+|-.|+ |+|.+.-..---+..++....--|.|.|.-.+-.-+..|+--.||-+++..|+..
T Consensus        48 aLrEIr~yQkst~lLI~k~pF~RLVREI~~~~~~~~Rf~~~Al~ALQeaaE~yLv~lfed~~lca~HakRVTl~~kD~~L  127 (136)
T PLN00121         48 ALREIRKYQKSTELLIRKLPFQRLVREIAQDFKTDLRFQSSAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQL  127 (136)
T ss_pred             HHHHHHHhccccccccccccHHHHHHHHHHHhCccceeeHHHHHHHHHHHHHHHHHHHhhhHHHHHHhcceecchhhHHH
Confidence            345665566788888888999885 6666542211156889999999999999999999999999999999999999855


Q ss_pred             H
Q psy8269         176 A  176 (386)
Q Consensus       176 a  176 (386)
                      +
T Consensus       128 ~  128 (136)
T PLN00121        128 A  128 (136)
T ss_pred             H
Confidence            4


No 77 
>PF09527 ATPase_gene1:  Putative F0F1-ATPase subunit (ATPase_gene1)
Probab=33.25  E-value=1.8e+02  Score=20.81  Aligned_cols=41  Identities=17%  Similarity=0.052  Sum_probs=29.7

Q ss_pred             HHHHHHHHHHHHHHHHHhhhch-hHHHHHHHHHHHHHHHHHH
Q psy8269         234 HIFANMFVLHSFMPSSVEDLGK-EQFVGFYLTAGVVASLLSY  274 (386)
Q Consensus       234 HLl~Nm~~L~~fG~~lE~~~G~-~rfl~lyl~sgv~gsl~s~  274 (386)
                      .++.+++.-..+|.-+++.+|+ .-+..+.++-|+++++.+.
T Consensus         9 ~~~~~i~~g~~~G~~lD~~~~t~p~~~~~g~llG~~~g~~~~   50 (55)
T PF09527_consen    9 TMAAPILVGFFLGYWLDKWFGTSPWFTLIGLLLGIAAGFYNV   50 (55)
T ss_pred             HHHHHHHHHHHHHHHHHHHcCCChHHHHHHHHHHHHHHHHHH
Confidence            4566777778889999999888 5566666666776666544


No 78 
>PF06609 TRI12:  Fungal trichothecene efflux pump (TRI12);  InterPro: IPR010573 This family consists of several fungal specific trichothecene efflux pump proteins. Many of the genes involved in trichothecene toxin biosynthesis in Fusarium sporotrichioides are present within a gene cluster. It has been suggested that TRI12 may play a role in F. sporotrichioides self-protection against trichothecenes [].
Probab=32.33  E-value=1.9e+02  Score=31.55  Aligned_cols=41  Identities=17%  Similarity=0.117  Sum_probs=27.2

Q ss_pred             hHHHHHHHHHHHHHHHHHhhhchhHHHHHHHHHHHHHHHHHH
Q psy8269         233 LHIFANMFVLHSFMPSSVEDLGKEQFVGFYLTAGVVASLLSY  274 (386)
Q Consensus       233 ~HLl~Nm~~L~~fG~~lE~~~G~~rfl~lyl~sgv~gsl~s~  274 (386)
                      ...+.+....-.+|+ +.+.+|++.++..=.+.+++|.+...
T Consensus        86 ~~~l~~av~~~~~G~-LSDlfGRr~~~i~g~~l~vvG~Iv~a  126 (599)
T PF06609_consen   86 AWTLASAVSFPFVGR-LSDLFGRRYFFIIGSLLGVVGSIVCA  126 (599)
T ss_pred             HHHHHHHHHHHhhHH-HHHHhcchHHHHHHHHHHHhHHHHhh
Confidence            355556666666666 89999999888666665665555443


No 79 
>PF07524 Bromo_TP:  Bromodomain associated;  InterPro: IPR006565 This bromodomain is found in eukaryotic transcription factors and PHD domain containing proteins (IPR001965 from INTERPRO). The tandem PHD finger-bromodomain is found in many chromatin-associated proteins. It is involved in gene silencing by the human co-repressor KRAB-associated protein 1 (KAP1). The tandem PHD finger-bromodomain of KAP1 has a distinct structure that joins the two protein modules. The first helix, alpha(Z), of an atypical bromodomain forms the central hydrophobic core that anchors the other three helices of the bromodomain on one side and the zinc binding PHD finger on the other [].  The Rap1 GTPase-activating protein, Sipa1, is modulated by the cellular bromodomain protein, Brd4. Brd4 belongs to the BET family and is a multifunctional protein involved in transcription, replication, the signal transduction pathway, and cell cycle progression. All of these functions are linked to its association with acetylated chromatin. It has tandem bromodomains []. The dysregulation of the Brd4-associated pathways may play an important role in breast cancer progression []. Bovine papillomavirus type 1 E2 also binds to chromosomes in a complex with Brd4. Interaction with Brd4 is additionally important for E2-mediated transcriptional regulation [, ]. 
Probab=31.38  E-value=1.5e+02  Score=22.70  Aligned_cols=55  Identities=18%  Similarity=0.158  Sum_probs=47.5

Q ss_pred             hcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHH
Q psy8269         125 KLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITK  179 (386)
Q Consensus       125 ~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~  179 (386)
                      ....|..-+++.+.-.+......|++.+...+=.+++..+|......|+.+++..
T Consensus        16 l~~~GF~~~~~~al~~Ltdi~~~yl~~l~~~~~~~ae~~gRt~~~~~Dv~~al~~   70 (77)
T PF07524_consen   16 LKHAGFDSASPSALDTLTDILQRYLQELGRTAKRYAEHAGRTEPNLQDVEQALEE   70 (77)
T ss_pred             HHHcCccccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCCCCHHHHHHHHHH
Confidence            3556788888999999999999999999999999999888888888999877744


No 80 
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=31.15  E-value=81  Score=27.54  Aligned_cols=24  Identities=17%  Similarity=0.225  Sum_probs=16.2

Q ss_pred             chhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         341 RLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       341 ~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      ....|..-+.|+++|+++|++.-|
T Consensus         4 t~~~W~~a~igLvvGi~IG~li~R   27 (138)
T COG3105           4 TFMTWEYALIGLVVGIIIGALIAR   27 (138)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHH
Confidence            445566667777777777776665


No 81 
>COG0370 FeoB Fe2+ transport system protein B [Inorganic ion transport and metabolism]
Probab=30.82  E-value=4e+02  Score=29.49  Aligned_cols=71  Identities=20%  Similarity=0.250  Sum_probs=44.1

Q ss_pred             HHHHHHHHHHHHcCCCcc-eeEechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         292 AIMAVLAHTCITHPDTEL-GILFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       292 aI~Gll~a~~~~~p~~~~-~l~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      .+-|++++=..-.++.++ ..+.+|+....++.-+...++-.+....  ...+.+.-.+.|++..++.++.+++
T Consensus       393 nVPaImaTRtle~~reRllt~lv~pfmsCsARLpVy~l~~~affp~~--~~~v~f~lY~lgi~~all~a~vl~k  464 (653)
T COG0370         393 NVPAIMATRTLETERERLLTILVIPFMSCSARLPVYVLFAAAFFPSN--GGLVLFSLYLLGIVVALLTAKVLKK  464 (653)
T ss_pred             chHHHHHHHHhcchhHHHHHHHHhhccchhhhHHHHHHHHHHHcccc--cchHHHHHHHHHHHHHHHHHHHHHH
Confidence            355566655444444443 3334677777777776555443332211  2456777889999999999998887


No 82 
>PTZ00018 histone H3; Provisional
Probab=30.44  E-value=1.5e+02  Score=26.16  Aligned_cols=79  Identities=22%  Similarity=0.307  Sum_probs=63.4

Q ss_pred             hhhHHhhhhhhhhhhcccCchHHH-HHhhhc-CccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHH
Q psy8269          97 SVLEVQGALSMLDLKTQALPLARI-KKIMKL-DDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIA  174 (386)
Q Consensus        97 ~~~~~~~~~~~~~~~~~~~p~~ri-k~im~~-~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa  174 (386)
                      ..+|+..-++..|.--+.+|-.|+ |+|.+. .+++ -+..++....--|.|.|.-.+-.-+..|+--.||-+++..|+.
T Consensus        48 aLrEIr~yQkst~lLI~k~pF~RLVREI~~~~~~~~-rf~~~al~aLQeaaE~yLv~lfed~~lca~HakRVTl~~kD~~  126 (136)
T PTZ00018         48 ALREIRRYQKSTELLIRKLPFQRLVREIAQDFKTDL-RFQSSAVLALQEAAEAYLVGLFEDTNLCAIHAKRVTIMPKDIQ  126 (136)
T ss_pred             HHHHHHHHcccchhccccccHHHHHHHHHHHcCCcc-eeeHHHHHHHHHHHHHHHHHHhhhhHHHHHhhcceecchhhHH
Confidence            445665556788888888999885 566543 2333 6688999999999999999999999999999999999999985


Q ss_pred             HH
Q psy8269         175 MA  176 (386)
Q Consensus       175 ~a  176 (386)
                      .+
T Consensus       127 L~  128 (136)
T PTZ00018        127 LA  128 (136)
T ss_pred             HH
Confidence            44


No 83 
>cd08045 TAF4 TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and is involved in forming Transcription Factor IID (TFIID) complex. The TATA Binding Protein (TBP) Associated Factor 4 (TAF4) is one of several TAFs that bind TBP and are involved in forming the Transcription Factor IID (TFIID) complex. TFIID is one of seven General Transcription Factors (GTF) (TFIIA, TFIIB, TFIID, TFIIE, TFIIF, and TFIID) that are involved in accurate initiation of transcription by RNA polymerase II in eukaryote. TFIID plays an important role in the recognition of promoter DNA and assembly of the pre-initiation complex. TFIID complex is composed of the TBP and at least 13 TAFs. TAFs from various species were originally named by their predicted molecular weight or their electrophoretic mobility in polyacrylamide gels. A new, unified nomenclature for the pol II TAFs has been suggested to show the relationship between TAF orthologs and paralogs. Several hypotheses are
Probab=28.58  E-value=1.9e+02  Score=27.00  Aligned_cols=66  Identities=21%  Similarity=0.232  Sum_probs=47.7

Q ss_pred             HHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhcc------ccccchhhHHHHHHHHHHHH
Q psy8269         118 ARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNK------RRTLQRNDIAMAITKYDMFD  184 (386)
Q Consensus       118 ~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~k------r~~iq~~DIa~aI~~i~i~~  184 (386)
                      .++++|++.. +++.+++|+..+++.||+.+++.+....-.+.+-+.      .+....+|+...+..+...+
T Consensus        52 ~~~~~i~~~~-g~~~~~~d~~~lis~a~e~rlr~li~k~~~~s~hR~~~~~~~~r~~~~sdvr~qL~~l~~~e  123 (212)
T cd08045          52 KKIRKIAKKH-GLKEVDEDVLDLISLALEERLRNLLEKLIEVSEHRVDSEKEDERYEITSDVRKQLRFLEQLE  123 (212)
T ss_pred             HHHHHHHHHc-CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhhhhhcCCCCceeecchHHHHHHHHHHHH
Confidence            4566666654 444889999999999999999998888776665543      34566788877776654433


No 84 
>PF14898 DUF4491:  Domain of unknown function (DUF4491)
Probab=28.43  E-value=90  Score=25.69  Aligned_cols=44  Identities=20%  Similarity=0.353  Sum_probs=30.8

Q ss_pred             HHhhhchhHHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHHHHH
Q psy8269         249 SVEDLGKEQFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAHTCI  302 (386)
Q Consensus       249 lE~~~G~~rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a~~~  302 (386)
                      .|.++|+ +.+-+|++.|+++-.++++...        +.. .++.|++|+.+.
T Consensus        26 ~EYyfg~-~~W~~FL~~Gi~~~~~Sl~~~~--------~~~-S~llgv~g~s~l   69 (94)
T PF14898_consen   26 GEYYFGT-RIWPIFLLAGIACIIASLFVSN--------VIW-SALLGVLGFSCL   69 (94)
T ss_pred             EEEecCC-CcHHHHHHHHHHHHHHHHHHcc--------hHH-HHHHHHHHHHHH
Confidence            4667888 5777999999999888888761        122 246677776553


No 85 
>PF15511 CENP-T:  Centromere kinetochore component CENP-T; PDB: 3B0D_T 3B0C_T 3VH5_T 3VH6_T.
Probab=27.77  E-value=72  Score=33.07  Aligned_cols=67  Identities=21%  Similarity=0.229  Sum_probs=40.4

Q ss_pred             hhhhhhcccCchHHHHHhhhcCc-----cccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhh
Q psy8269         106 SMLDLKTQALPLARIKKIMKLDD-----GVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRND  172 (386)
Q Consensus       106 ~~~~~~~~~~p~~rik~im~~~~-----~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~D  172 (386)
                      +.+-..+..+|.+-|||+..---     .+++|++|+...+.+|.+.|-++++.-==.|+.=-+|+||..+|
T Consensus       343 Skhgi~~P~lP~~~vK~la~~~ak~s~~sK~kiskdal~aleqasdwfFeQl~dDL~aYA~HAgRKTIdesD  414 (414)
T PF15511_consen  343 SKHGIPYPSLPPGVVKKLAQHFAKSSGGSKMKISKDALEALEQASDWFFEQLGDDLEAYAKHAGRKTIDESD  414 (414)
T ss_dssp             ---------S-HHHHHHHHHHHH-------S-B-HHHHHHHHHHHHHHHHHHHHHHHHHHHHTT-SEE-HHH
T ss_pred             CCCCCCCCCCCccHHHHHHHHHHHhhcccCCccCHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCCCcCCCCC
Confidence            55666677799999999865432     45689999999999999999999986555556666788887654


No 86 
>KOG1172|consensus
Probab=25.96  E-value=8.8e+02  Score=27.83  Aligned_cols=59  Identities=17%  Similarity=0.195  Sum_probs=33.2

Q ss_pred             HHHHHHHHHHHHHHHhhhchh-HHHHHHHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHH
Q psy8269         236 FANMFVLHSFMPSSVEDLGKE-QFVGFYLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLA  298 (386)
Q Consensus       236 l~Nm~~L~~fG~~lE~~~G~~-rfl~lyl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~  298 (386)
                      +.|+.--..||..+++.-... ...-. ++|..++++...++.   +++-..+|.+|-+.=+--
T Consensus       371 fa~l~P~ItFG~ll~~~Tdg~~~v~E~-L~stal~Giifslfg---gQPLlIlg~TgP~lVfe~  430 (876)
T KOG1172|consen  371 FACLLPAITFGGLLGEATDGLIGVVET-LLSTALCGIIFSLFG---GQPLLILGVTGPLLVFEK  430 (876)
T ss_pred             HHhhhhHhhHHHHhhhhccchHHHHHH-HHHHHHHHHHHHHhc---CCceEEEecCccHHHHHH
Confidence            344555667888887764222 22223 344444444444444   567778899986544333


No 87 
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=24.54  E-value=21  Score=25.88  Aligned_cols=25  Identities=16%  Similarity=0.258  Sum_probs=22.2

Q ss_pred             cchhhHHHHHHHHhccCCCCCeeee
Q psy8269          16 NDIPKWRKDLRKRWNALHPGDRVYV   40 (386)
Q Consensus        16 ~~~~~~~~~~~~~~~~~~~~~~~~~   40 (386)
                      ....++.+.+.+.|+.|++.+|.++
T Consensus        26 ~~~~~i~~~~~~~W~~l~~~~k~~y   50 (66)
T cd00084          26 LSVGEISKILGEMWKSLSEEEKKKY   50 (66)
T ss_pred             CCHHHHHHHHHHHHHhCCHHHHHHH
Confidence            3567899999999999999999887


No 88 
>KOG1142|consensus
Probab=24.35  E-value=1.1e+02  Score=29.87  Aligned_cols=71  Identities=15%  Similarity=0.234  Sum_probs=57.3

Q ss_pred             hhhhhcccCchHHHHHhhhcCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHH
Q psy8269         107 MLDLKTQALPLARIKKIMKLDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAI  177 (386)
Q Consensus       107 ~~~~~~~~~p~~rik~im~~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI  177 (386)
                      +++..++.+=..++.+.++.-++-..+-.|+.-+...-++-||..++.++=..+.-+|..++--.||.-.+
T Consensus       147 ~~~~~~~il~k~kl~dLvqqId~~~~LD~dVedlLleiADdFV~sii~~sC~LAKHRKsdtlEvrDIqLhL  217 (258)
T KOG1142|consen  147 DEPGNNPILSKRKLDDLVQQIDGTTKLDDDVEDLLLEIADDFVSSIIHRSCKLAKHRKSDTVEVRDIQLHL  217 (258)
T ss_pred             ccCCCCccccccchhHHHHhhcCcccccHHHHHHHHHHHHHHHHHHHHHHHHHHHhcccCccchhheeeee
Confidence            34455555666777777777788889999999999999999999999999999999998888766665444


No 89 
>PRK09554 feoB ferrous iron transport protein B; Reviewed
Probab=22.75  E-value=6.3e+02  Score=28.51  Aligned_cols=51  Identities=22%  Similarity=0.257  Sum_probs=33.7

Q ss_pred             EechhhhHHHHHHHHHHHHHHHHHhhhcCchhHHHHHHHHHHHHHHHHHHhhh
Q psy8269         312 LFVPYVRFSAEHAIQGIMLLDFLGVLFRWRLFDHAAHLGGALFGILYSKYGEQ  364 (386)
Q Consensus       312 ~fip~~~~~a~~~~~~~l~~~~~~~~~~~~~i~~~aHLgG~L~G~l~~~~~~~  364 (386)
                      +..|+.+..++.-+..+++-.+..  .....+-+...+.|+++.++.++++++
T Consensus       425 l~~pfm~C~ArlpV~~l~~~aff~--~~~~~v~~~~y~~gi~~~~~~~~ll~~  475 (772)
T PRK09554        425 MMAPFMSCGARLAIFAVFAAAFFG--QNGALAVFSLYLLGIVMAILTGLMLKY  475 (772)
T ss_pred             HHhccCCccchHHHHHHHHHHHcC--cccHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            346667777777765554433321  112356777888999999999988876


No 90 
>PF14241 DUF4341:  Domain of unknown function (DUF4341)
Probab=22.65  E-value=1.5e+02  Score=22.28  Aligned_cols=34  Identities=18%  Similarity=0.188  Sum_probs=21.4

Q ss_pred             HHHHHHHHHHHHHHHHhhcCCCcceechHHHHHHHHHH
Q psy8269         262 YLTAGVVASLLSYVHKILVRKPGLSIGASGAIMAVLAH  299 (386)
Q Consensus       262 yl~sgv~gsl~s~l~~~~~~~~~~~vGASGaI~Gll~a  299 (386)
                      |+.+.++|-.+...+.    .++...|+||.+.+++..
T Consensus         6 l~GG~lIGla~~~ll~----~~Gri~GiSGil~~~~~~   39 (62)
T PF14241_consen    6 LIGGLLIGLAASLLLL----LNGRIAGISGILSGLLSP   39 (62)
T ss_pred             HHHHHHHHHHHHHHHH----HcCcceehHHHHHHHhCC
Confidence            3444444444333333    257889999999988775


No 91 
>PF04892 VanZ:  VanZ like family ;  InterPro: IPR006976 This entry represents a conserved sequence region found in the VanZ protein and also several phosphotransbutyrylases. VanZ confers low-level resistance to the glycopeptide antibiotic teicoplanin (Te). Analysis of cytoplasmic peptidoglycan precursors, accumulated in the presence of ramoplanin, showed that VanZ-mediated Te resistance does not involve incorporation of a substituent of D-alanine into the peptidoglycan precursors [].
Probab=22.11  E-value=4.6e+02  Score=21.71  Aligned_cols=28  Identities=29%  Similarity=0.416  Sum_probs=21.4

Q ss_pred             chhHHHHHHHHHHHHHHHHHHhhhcccc
Q psy8269         341 RLFDHAAHLGGALFGILYSKYGEQTWAH  368 (386)
Q Consensus       341 ~~i~~~aHLgG~L~G~l~~~~~~~~w~~  368 (386)
                      +-.|......|++.|+++....+|+|++
T Consensus       105 d~~Dv~~n~~G~~lG~~l~~~~~~~~~r  132 (133)
T PF04892_consen  105 DIDDVLANTLGALLGYLLYRLIRKRWQR  132 (133)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHhcc
Confidence            4567788889999999988887765544


No 92 
>PF04632 FUSC:  Fusaric acid resistance protein family;  InterPro: IPR006726 This entry represents the p-hydroxybenzoic acid efflux pump subunit AaeB (pHBA efflux pump protein B) whose substrates are p-hydroxybenzoic acid (pHBA), 6-hydroxy-2-naphthoic and 2-hydroxycinnamate. It could function as a metabolic relief valve, allowing to eliminate certain compounds when they accumulate to high levels in the cell []. This family also includes fusaric acid resistance proteins [], which are likely to be membrane transporter proteins, and uncharacterised transporter YdhK.; GO: 0006810 transport, 0005886 plasma membrane
Probab=21.89  E-value=3.3e+02  Score=29.30  Aligned_cols=31  Identities=13%  Similarity=0.066  Sum_probs=21.5

Q ss_pred             chhHHHHHHHHHHHHHHHHHHhhhccccccc
Q psy8269         341 RLFDHAAHLGGALFGILYSKYGEQTWAHRAP  371 (386)
Q Consensus       341 ~~i~~~aHLgG~L~G~l~~~~~~~~w~~r~~  371 (386)
                      +...+.-...|.+.|+++++...+......+
T Consensus       464 d~~~f~n~~la~l~G~~~a~l~~~li~p~~~  494 (650)
T PF04632_consen  464 DFATFLNRALAILLGIVIAALVFRLIRPFSP  494 (650)
T ss_pred             CHHHHHHHHHHHHHHHHHHHHHHHHHCCCCh
Confidence            3455667778899999999888774443333


No 93 
>COG4956 Integral membrane protein (PIN domain superfamily) [General function prediction only]
Probab=21.83  E-value=3.1e+02  Score=27.67  Aligned_cols=86  Identities=17%  Similarity=0.256  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHhh------cCCCcceechHHHHHHHHHHHHHHcCCCcceeEechhhhHHHHHHHHHHHHHH
Q psy8269         259 VGFYLTAGVVASLLSYVHKIL------VRKPGLSIGASGAIMAVLAHTCITHPDTELGILFVPYVRFSAEHAIQGIMLLD  332 (386)
Q Consensus       259 l~lyl~sgv~gsl~s~l~~~~------~~~~~~~vGASGaI~Gll~a~~~~~p~~~~~l~fip~~~~~a~~~~~~~l~~~  332 (386)
                      +.+|+++..++..+.-.+..+      .+......|.=|.+.|++.++.+..|     +..+|+.-++....+.+.+.+.
T Consensus        49 i~~~li~~~~~~~~~~~~~~le~~i~k~~~~~ilf~tiGLiiGLlia~l~~~p-----L~~~~ip~~~~ii~vi~t~il~  123 (356)
T COG4956          49 IIFFLISFWFGKYVLNWLKRLEEQIRKLPVTTILFGTIGLIIGLLIAVLLSSP-----LFLLPIPFISTIIPVILTIILA  123 (356)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHhhH-----HhhCCccHHHhHHHHHHHHHHH


Q ss_pred             HHHhhhcCchhHHHHHH
Q psy8269         333 FLGVLFRWRLFDHAAHL  349 (386)
Q Consensus       333 ~~~~~~~~~~i~~~aHL  349 (386)
                      .++...+..+-+-.+++
T Consensus       124 y~G~~~~~k~~de~~~l  140 (356)
T COG4956         124 YFGFQLADKKRDEFLRL  140 (356)
T ss_pred             HHhhHHhhhhhHHHHHh


No 94 
>smart00427 H2B Histone H2B.
Probab=20.99  E-value=2.7e+02  Score=22.71  Aligned_cols=60  Identities=22%  Similarity=0.242  Sum_probs=36.9

Q ss_pred             HHHhhh-cCccccccccchhhHHHHHHHHHHHHHhHHhhHhhhhccccccchhhHHHHHHHH
Q psy8269         120 IKKIMK-LDDGVKMISAEAPMLFSKAAELFIHELTIRAWIHTEDNKRRTLQRNDIAMAITKY  180 (386)
Q Consensus       120 ik~im~-~~~~v~mi~~~~~~l~~~a~e~fi~~lt~rsw~~~~~~kr~~iq~~DIa~aI~~i  180 (386)
                      |.|++| .+|+. .|+.++-.+..--..-..+.+...+-+....+||++++..||..++-.+
T Consensus         7 i~kvLKqVhpd~-giS~kam~imnSfvnDiferIa~EAs~L~~~nkr~TltsreIqtAvrl~   67 (89)
T smart00427        7 IYKVLKQVHPDT-GISSKAMSIMNSFVNDIFERIAAEASKLARYNKKSTLSSREIQTAVRLI   67 (89)
T ss_pred             HHHHHHHhCCCc-cccHHHHHHHHHHHHHHHHHHHHHHHHHHhccCCCcCCHHHHHHHHHHH
Confidence            334433 34444 3555554444433333344556667777888999999999998887553


Done!