Query         000708
Match_columns 1340
No_of_seqs    387 out of 657
Neff          4.9 
Searched_HMMs 46136
Date          Mon Apr  1 22:18:59 2013
Command       hhsearch -i /work/01045/syshi/lefta3m/000708.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/leftcdd/000708hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG5602 SIN3 Histone deacetyla 100.0  1E-142  2E-147 1260.1  54.0  859   67-1328  117-1042(1163)
  2 smart00761 HDAC_interact Histo 100.0 1.5E-43 3.2E-48  338.9   7.0  100  494-593     1-101 (102)
  3 PF08295 Sin3_corepress:  Sin3  100.0 1.1E-41 2.3E-46  327.9   7.8  100  494-593     1-100 (101)
  4 KOG4204 Histone deacetylase co 100.0 2.4E-39 5.1E-44  353.2  17.9  159   74-232    15-197 (231)
  5 KOG4204 Histone deacetylase co  99.8 1.8E-19 3.9E-24  197.7  17.1  178  157-430    14-193 (231)
  6 COG5602 SIN3 Histone deacetyla  99.8 8.8E-19 1.9E-23  210.9  17.6  211  157-430   123-335 (1163)
  7 PF02671 PAH:  Paired amphipath  99.4 8.3E-13 1.8E-17  111.4   5.5   47   96-142     1-47  (47)
  8 PF02671 PAH:  Paired amphipath  99.2 1.2E-11 2.6E-16  104.4   5.9   47  180-226     1-47  (47)
  9 PTZ00146 fibrillarin; Provisio  94.2   0.053 1.1E-06   62.9   4.9    8  485-492   265-272 (293)
 10 KOG3915 Transcription regulato  93.1   0.068 1.5E-06   63.8   3.3   12  179-190   200-211 (641)
 11 KOG3973 Uncharacterized conser  92.2    0.16 3.4E-06   59.2   4.5    7   17-23    309-315 (465)
 12 KOG3973 Uncharacterized conser  91.5    0.24 5.2E-06   57.7   5.0    6   19-24    300-305 (465)
 13 PLN03138 Protein TOC75; Provis  90.1    0.31 6.6E-06   63.1   4.7   29  382-412   282-310 (796)
 14 PLN03138 Protein TOC75; Provis  88.3    0.45 9.8E-06   61.7   4.4   16  196-211   180-195 (796)
 15 PHA00370 III attachment protei  85.8     1.6 3.5E-05   49.4   6.4   13   81-93    148-160 (297)
 16 KOG3915 Transcription regulato  85.6    0.62 1.3E-05   56.1   3.2    8  220-227   260-267 (641)
 17 PHA00370 III attachment protei  84.8     2.1 4.6E-05   48.5   6.6   16  161-176   257-272 (297)
 18 COG4371 Predicted membrane pro  82.1     1.7 3.6E-05   49.2   4.5    8  202-209   181-188 (334)
 19 PF02084 Bindin:  Bindin;  Inte  79.1      21 0.00046   40.2  11.6    8  128-135   138-145 (238)
 20 KOG3074 Transcriptional regula  75.2    0.86 1.9E-05   51.0  -0.2   14   73-86     27-40  (263)
 21 COG4371 Predicted membrane pro  72.8     4.1 8.9E-05   46.2   4.2   14  119-132   182-195 (334)
 22 cd07354 HN_L-delphilin-R1_like  72.3      13 0.00028   35.8   6.7   68  163-231     3-72  (80)
 23 PF04285 DUF444:  Protein of un  72.2     6.2 0.00013   48.4   5.9   40  159-212   160-199 (421)
 24 KOG1308 Hsp70-interacting prot  68.3      13 0.00028   44.5   7.1   35   95-130   336-376 (377)
 25 PF13865 FoP_duplication:  C-te  67.6      14 0.00031   34.9   6.0   27   73-99     37-67  (74)
 26 cd07354 HN_L-delphilin-R1_like  66.5      21 0.00046   34.4   6.8   68   79-147     3-72  (80)
 27 KOG1596 Fibrillarin and relate  66.0      11 0.00024   43.2   5.7   14  484-497   288-301 (317)
 28 PTZ00110 helicase; Provisional  57.7      23  0.0005   44.9   7.2   11  728-738   485-495 (545)
 29 COG4174 ABC-type uncharacteriz  56.6      22 0.00048   41.2   6.0   29   83-111    83-111 (364)
 30 PTZ00009 heat shock 70 kDa pro  54.4      11 0.00023   48.8   3.7    8   57-64    630-637 (653)
 31 PF05642 Sporozoite_P67:  Sporo  53.6      19 0.00042   45.2   5.3   11  403-413   371-381 (727)
 32 PF11867 DUF3387:  Domain of un  52.6      45 0.00098   39.7   8.2   99  114-212   117-229 (335)
 33 PF07466 DUF1517:  Protein of u  51.4      23  0.0005   41.6   5.4   19  199-217   138-156 (289)
 34 PF05642 Sporozoite_P67:  Sporo  47.4      28 0.00061   43.8   5.4    7  374-380   363-369 (727)
 35 KOG3074 Transcriptional regula  46.2      13 0.00028   42.1   2.2   10  184-193    71-80  (263)
 36 PF07655 Secretin_N_2:  Secreti  45.4      28  0.0006   34.5   4.1   29   65-93     52-80  (98)
 37 PF05387 Chorion_3:  Chorion fa  44.9      23 0.00051   40.2   3.9   17   78-94     70-86  (277)
 38 KOG1924 RhoA GTPase effector D  44.7      42 0.00091   43.8   6.4   19  679-697   968-986 (1102)
 39 PRK05325 hypothetical protein;  42.6      24 0.00053   43.2   3.9   19  159-177   144-162 (401)
 40 PRK04537 ATP-dependent RNA hel  42.1      43 0.00094   42.8   6.2   10  371-380   556-565 (572)
 41 PRK10930 FtsH protease regulat  42.0      25 0.00054   43.4   3.9   16  382-397   302-317 (419)
 42 KOG1924 RhoA GTPase effector D  41.0      59  0.0013   42.6   6.9   16  372-390   807-822 (1102)
 43 PF04285 DUF444:  Protein of un  38.1      54  0.0012   40.6   5.9    6   13-18     51-56  (421)
 44 PF12174 RST:  RCD1-SRO-TAF4 (R  37.6      78  0.0017   29.9   5.5   45  371-415    13-57  (70)
 45 KOG3570 MAPK-activating protei  35.2      34 0.00074   44.8   3.6   37  640-676   903-940 (1588)
 46 KOG3875 Peroxisomal biogenesis  34.2      66  0.0014   38.2   5.4   12  203-214   180-191 (362)
 47 PF04094 DUF390:  Protein of un  33.6      64  0.0014   41.9   5.6    8  130-137   197-204 (828)
 48 TIGR02877 spore_yhbH sporulati  33.0      43 0.00094   40.6   3.9   31  387-427   317-347 (371)
 49 PF09060 L27_N:  L27_N;  InterP  30.5      57  0.0012   28.8   3.1   26 1224-1250    2-27  (49)
 50 KOG3304 Surfeit family protein  29.1      87  0.0019   32.9   4.7   32   80-111    16-48  (148)
 51 KOG4462 WASP-interacting prote  28.8      65  0.0014   38.7   4.3   13    7-19     37-49  (437)
 52 KOG0037 Ca2+-binding protein,   28.4      62  0.0013   36.7   3.9   27   95-122    56-82  (221)
 53 KOG3262 H/ACA small nucleolar   27.7      81  0.0018   34.9   4.5   17  214-230   128-144 (215)
 54 COG5324 Uncharacterized conser  27.6      74  0.0016   39.8   4.6  126  558-690   523-680 (758)
 55 PF12083 DUF3560:  Domain of un  27.0 3.5E+02  0.0077   28.4   8.8   65  591-655    41-118 (126)
 56 PF04094 DUF390:  Protein of un  26.8      69  0.0015   41.6   4.3    6  142-147   165-170 (828)
 57 PF08963 DUF1878:  Protein of u  26.7      76  0.0016   32.6   3.8   64  365-428    42-110 (113)
 58 KOG1456 Heterogeneous nuclear   26.5      58  0.0013   39.4   3.4   20  131-150    46-65  (494)
 59 cd07347 harmonin_N_like N-term  26.2 1.9E+02  0.0041   28.1   6.1   60  370-430     6-67  (78)
 60 PF04228 Zn_peptidase:  Putativ  25.5      79  0.0017   37.4   4.3   14  166-179   152-165 (292)
 61 PF08349 DUF1722:  Protein of u  25.4 1.4E+02   0.003   30.4   5.5   49  370-418    56-104 (117)
 62 KOG0775 Transcription factor S  25.2 2.7E+02  0.0058   32.8   8.1   20   85-104    66-87  (304)
 63 PRK05325 hypothetical protein;  24.5      79  0.0017   38.9   4.1    8  388-395   339-346 (401)
 64 KOG1456 Heterogeneous nuclear   24.2      63  0.0014   39.1   3.1   27  400-428   359-385 (494)
 65 KOG0339 ATP-dependent RNA heli  23.8      97  0.0021   39.1   4.7   10   84-93    659-668 (731)
 66 PF03564 DUF1759:  Protein of u  21.5 5.1E+02   0.011   26.6   8.9   82   92-194     1-85  (145)
 67 TIGR02877 spore_yhbH sporulati  21.4      94   0.002   37.9   3.9   16  161-176   158-173 (371)
 68 PRK09807 hypothetical protein;  20.9      89  0.0019   31.9   3.0   51   88-138     2-64  (161)

No 1  
>COG5602 SIN3 Histone deacetylase complex, SIN3 component [Chromatin structure and dynamics]
Probab=100.00  E-value=9.8e-143  Score=1260.11  Aligned_cols=859  Identities=30%  Similarity=0.442  Sum_probs=624.3

Q ss_pred             CCCCCCCCccccchhhhhHHHHHHHhcCChHHHHHHHHHHHHHHhccCChHHHHHHHHHHhccChhHHhhhhhcCCcccc
Q 000708           67 TAGMGTSQKLTTSDALTYLKEVKDMFQDQREKYDMFLEVMKDFKAQRTDTAGVIARVKDLFKGHNNLIFGFNTFLPKGYE  146 (1340)
Q Consensus        67 ~~g~~~~~~l~~~dAl~YL~~VK~~F~d~p~vY~eFLeIMkdfk~~~idt~gVi~rVs~LFkGhpdLI~GFN~FLP~gy~  146 (1340)
                      .++....++|+|+||++||++||.+|..+|++|+.||+||||||+|.|||+|||+||+.||+|+|+||+|||+|||+||+
T Consensus       117 ~p~~~~~r~Ldv~DAlsyLe~vK~~f~~rp~iYn~FLdiMkdFKsqaiDtpgVI~RVS~LFrgYP~LIegFNtFLPsgYk  196 (1163)
T COG5602         117 PPPRIPYRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIERVSVLFRGYPHLIEGFNTFLPSGYK  196 (1163)
T ss_pred             CCCCCCCCCCChHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHhcccCcHHHHHHHHHHHcCChHHHHHHhhhCCCccE
Confidence            34566789999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             ccCCCCC--------------------------------------------CC-------------CCCccchHHHHHHH
Q 000708          147 ITLDEDE--------------------------------------------AP-------------PKKTVEFEEAITFV  169 (1340)
Q Consensus       147 I~l~~de--------------------------------------------~p-------------p~~~vef~~Ai~Fv  169 (1340)
                      |+++.+.                                            ||             ++..+.|++||.||
T Consensus       197 ie~S~~~~~~s~l~vtTP~gp~s~~pv~ss~y~a~~n~~qrts~p~lps~~Q~e~s~~~~~~sd~~~~~~~d~nqAI~~v  276 (1163)
T COG5602         197 IEGSLPQPNGSRLHVTTPQGPLSSPPVQSSYYVAPCNHDQRTSHPTLPSDSQPEPSAPSHMPSDARGKHQVDFNQAIIFV  276 (1163)
T ss_pred             EEEecCCCCCCeeeecCCCCCCCCCCcccccccccccccccccCCCCccccCCCCCCcccCCcccccccceehhHHHHHH
Confidence            9875210                                            00             02346799999999


Q ss_pred             HHHHHHcCCChHHHHHHHHHHHHhhhccCcHHHHHHHHHHhhccChhHHHHHhhhCCCCCCCCcCCCCccccCCcccCCC
Q 000708          170 NKIKKRFHNDEHVYKSFLEILNMYRKEHKDINEVYSEVASLFEDHADLLEEFTRFLPDTSATSLSHNIPFVRNSTQRGNE  249 (1340)
Q Consensus       170 nKIK~RF~~dPevYk~FLeIL~~y~~~~~si~eV~~qVa~LFk~hPDLL~EF~~FLP~~~~~~~~~~~~~~~~~~~~~~~  249 (1340)
                      |+||.||+++|+.|..||++|++|++.+++|++||.+|+.||+++|||+++|..|||+....+....+.. ..+      
T Consensus       277 nkVK~r~~~~pe~y~~fl~~Lrtyq~~qr~i~ev~~~Vt~lfa~~PdLleeFk~FLPd~~~~a~q~~a~a-q~p------  349 (1163)
T COG5602         277 NKVKVRFQNNPEMYYDFLDSLRTYQMKQRSIQEVYARVTKLFAEAPDLLEEFKEFLPDSSVSAEQSTANA-QKP------  349 (1163)
T ss_pred             HHHHHhcCCCchhHHHHHHHHHHHHhhhccHHHHHHHHHHHHhhChHHHHHHHHhCcccccccccccccc-CCC------
Confidence            9999999999999999999999999999999999999999999999999999999999865432111100 001      


Q ss_pred             CCCCCCCccccchhhhhccccccccCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHhhhhcccCCCCccccccccccccc
Q 000708          250 RSAGIPPLRQMQMDKHRRRDRIATSHADRDLSVDRPEMDDEKLMIKMQKEQRRRAEKENRDRRNRDQDDREIEHDNNRDF  329 (1340)
Q Consensus       250 r~s~~P~~~q~~~~K~~~~~r~~~s~~~~d~s~~rP~~~~~~~~~~~~keqr~~~~k~~~~r~~~~~~~~~~~~~~~~~~  329 (1340)
                       +..+|+...+...+.- +.++.+++..     ..|..     .           .+..+   .+.        +...++
T Consensus       350 -~~~lPPiG~Fs~p~~a-~~~~~ps~~~-----~~p~~-----~-----------~~~~~---~r~--------d~~~S~  395 (1163)
T COG5602         350 -SKRLPPIGSFSLPTAA-PEQNRPSLLW-----ESPRS-----I-----------SNISR---YRA--------DLLTSF  395 (1163)
T ss_pred             -cccCCCCCCCCCCCcc-cccCChHHhh-----cCcch-----h-----------ccccc---ccc--------chhhhh
Confidence             1223333222111100 0000001000     00000     0           00000   000        000001


Q ss_pred             ccccCCCCccccccccCCCCCCCCCccCchhhhhhhhhhhhhHHHHHHhhcC-hHHHHHHHHHHhhhhcCCCCHHHHHHH
Q 000708          330 NLQRFPDKKKSIKKVEGFGANSSLASYDDKDALKSIYNQGFIFCDKVKEKLC-SDDYQAFLKCLHIYSNGIIKRNDLQNL  408 (1340)
Q Consensus       330 ~~q~~~~k~k~~~~~~~~g~~P~~s~~~~~~a~k~~~~eElaFFDKVKk~L~-~~~Y~eFLKcLnLYsQeIIsr~ELv~l  408 (1340)
                      ..+-.|.+.         . .|...          ..++.-.||++||.+|. ...|++||||||||+|+||++++||++
T Consensus       396 s~~~~P~~~---------~-~P~s~----------~~~~se~ffe~ikq~L~~~~~y~eFLKlLnLY~Q~iiDk~~Lver  455 (1163)
T COG5602         396 SRNFVPIRI---------H-MPISH----------LSNESEDFFERIKQYLKDKNLYHEFLKLLNLYSQKIIDKNDLVER  455 (1163)
T ss_pred             hhhcCCccc---------c-CCccc----------cCCchHHHHHHHHHHhccHHHHHHHHHHHHHHHHhhccHHHHHHH
Confidence            111111110         0 11110          00122279999999999 999999999999999999999999999


Q ss_pred             HHHHhccChHHHHHHhhhhccccccccccccccccccccCCCCCCcccccccchhhhhhhhhHhHhhhhhhhhhhhcccc
Q 000708          409 VTDLLGKYMDLMDEFNHFFERCENIDGFLAGVMSKKSLCNDGHVSRSVKIEDKDREHKREMEVTKEKDRYKEKYYAKSIQ  488 (1340)
Q Consensus       409 V~~fLG~~pDLm~~Fk~FLg~~e~~~~~~~g~~s~~~l~~~g~~~~~~~~~~~dr~~~~~~~~~ker~~~~ek~~~kp~~  488 (1340)
                      +-.|||.+.+||.|||+|++|.+...                   .|                +             +..
T Consensus       456 ~~~~L~sNe~L~~wFk~fi~y~~~~~-------------------~~----------------~-------------~~~  487 (1163)
T COG5602         456 LFAFLGSNEELIRWFKAFINYSEPEK-------------------EP----------------L-------------RET  487 (1163)
T ss_pred             HHHHhcCcHHHHHHHHHHhccCCCCC-------------------CC----------------c-------------ccc
Confidence            99999999999999999999975210                   00                0             012


Q ss_pred             ccccCCCcccCCccccCCCCCCCCCCCCccccccccccCeeeec-ccCCCCcccchhccchhhHHhhhcccchhhHHHHH
Q 000708          489 ELDLSNCQRCTPSYRLLPDDYPIPSASQRSELGAQVLNDHWVSV-TSGSEDYSFKHMRRNQYEESLFRCEDDRFELDMLL  567 (1340)
Q Consensus       489 elDls~c~r~gpSYRlLPk~~~~~~CSgR~eLc~sVLND~WVS~-t~~SED~~F~~~rKNqyEE~LfrcEDERfElDm~I  567 (1340)
                      ..||+.|+.||||||+||+++...+|||||+|||+||||.|||| ||+|||++|++||||||||+|||+||||||||.+|
T Consensus       488 ~~~le~~~~~GpSYR~LP~~e~~i~CSGRDDl~w~VLND~wVs~Pt~aSEdSgfI~hrkNqyEEaL~kiEeERyEyDr~I  567 (1163)
T COG5602         488 RKDLEKFEKCGPSYRLLPKSEEEIKCSGRDDLAWEVLNDDWVSHPTWASEDSGFIAHRKNQYEEALFKIEEERYEYDRHI  567 (1163)
T ss_pred             ccCHhHhhhhCcchhcCCchhhhCCccCcchhhHhhhccccccCCCcccccccceeecccHHHHHHHHHHHHHHHHHHHH
Confidence            46889999999999999999999999999999999999999999 69999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHHHHHHhhcccccCCCCcccccc----hhHHHHHHHHhhC-CCHHHHHHHHHhCcCcchhHHHHHHHHH
Q 000708          568 ESVSSTAKRAEELLNSINENKITLETPFHLKDHF----SALNLRCIERLYG-DHGLDVMDILRKNPAIALPVMLTRLKQK  642 (1340)
Q Consensus       568 E~~~~tI~~Le~l~~ki~~~~~e~~~~~~l~~~L----~~i~~R~I~rIYG-d~g~evie~L~knP~~AlPVVL~RLKQK  642 (1340)
                      |++.+||+.|+.++++|..|...++..+.|.++|    .+||+++|++||| +||.+||++|+++|++|+||||+|||+|
T Consensus       568 ea~~~~Ik~Le~i~d~~~~~~e~Eka~~~Lp~glg~~S~sIyKkvik~VY~KEhA~eile~L~k~P~vt~PiVlkRLk~K  647 (1163)
T COG5602         568 EATQRTIKALEQIIDKIKDMEESEKANKTLPGGLGLPSKSIYKKVIKKVYDKEHAPEILEALLKKPHVTIPIVLKRLKMK  647 (1163)
T ss_pred             HHHHHHHHHHHHHHHHhhcchhhHHHHhcCCCcCCCccHHHHHHHHHHHhchhhHHHHHHHHHhCCCcchHHHHHHHHHh
Confidence            9999999999999999999998888888888766    5899999999999 7999999999999999999999999999


Q ss_pred             HHHHHHHHHHhhHHHHHHHHHhhhhhcccCccccccccccccchhHHHHHHHHHHHhhhcchhhHHHHhhCCCCCCCCCc
Q 000708          643 QEEWTKCRSDFNKVWAEIYAKNHYKSLDHRSFYFKQQDSKNLSTKSLVAEIKQFKENKQTEDDVLFVIAAGHRQPVIPHL  722 (1340)
Q Consensus       643 ~eEWr~aq~e~nKvWrev~eKNy~KSLDHq~~~FKq~DkK~ls~K~Lv~EIe~i~eeq~~edd~~~~~~~g~~~~~~P~l  722 (1340)
                      ++|||+|+|+|||+||+|.+|||+|||||||++||+.|||.|++|.||.||+.++.++..-   ++      +....-++
T Consensus       648 d~EWR~~kRew~kiWReIEeKN~~ksLD~~g~sfK~rDKk~lstk~il~eidd~~q~kih~---~~------~~k~~fqf  718 (1163)
T COG5602         648 DEEWRSCKREWNKIWREIEEKNYHKSLDHQGVSFKSRDKKILSTKNILEEIDDRSQAKIHV---SI------DDKKVFQF  718 (1163)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHhHHhhhhhcccceeccccccchHHHHHHHhhHhhhcceee---ec------Ccccccee
Confidence            9999999999999999999999999999999999999999999999999999776643211   00      11234588


Q ss_pred             eeecCChhHHHHHHHHHHHHhhhcc-CCHHHHHHHHHHHHHhhhcccCCCCCCCCCcchhhhhccccCCCCCCCCCcccc
Q 000708          723 EYGYSDSNIHEDLYKLVQYSCEEMC-STKDQLNKAMKLWTTFLEPMLSVPSRPSDVEGAEDAGKARHSGKNNSASSMVES  801 (1340)
Q Consensus       723 ~f~y~D~~I~~D~~~Li~~~~~~~~-~s~e~~~ki~~~~~~Fv~~ff~i~~r~~~~~~~ed~~~~~~~~~~~~~~~~~~~  801 (1340)
                      .|.|+|.-|+.|++.|.......+. -+..+++++..++.+||..||++-.--     .+..+-... .+.+      +.
T Consensus       719 ~fvlcd~~If~nil~l~dt~itn~ssyl~~dkerlka~lk~~~slff~~~~~~-----i~~~~y~~~-~nv~------~~  786 (1163)
T COG5602         719 VFVLCDTYIFVNILDLKDTLITNMSSYLESDKERLKANLKTIVSLFFLLCIFR-----IIIIVYERL-LNVK------GL  786 (1163)
T ss_pred             eeecccceeEeeeccccceeeecchhhhhhhHHHHHHHHHHHHHHHhhhhhhh-----hhhhhhhhh-cccc------cc
Confidence            9999999999999998654332222 235789999999999999999844210     001110100 0000      00


Q ss_pred             CCCCCCCCCccCCCCCCCCCCCCCCcccccccccccccCCCccccccccccccccccccchhhhhhhhhhcccccccccc
Q 000708          802 DGSPGPDGTVNSRQPISSGNGDENTSTELNNLCRTALSNGDTITKENVLDSDRVYRDDLSCSALQLEKEQKNVDISDKRS  881 (1340)
Q Consensus       802 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  881 (1340)
                      +.                   +.                 +.        .+|..++                       
T Consensus       787 ni-------------------~g-----------------~k--------~~r~srs-----------------------  799 (1163)
T COG5602         787 NI-------------------DG-----------------LK--------ADRSSRS-----------------------  799 (1163)
T ss_pred             cc-------------------CC-----------------cc--------ccccccc-----------------------
Confidence            00                   00                 00        0000000                       


Q ss_pred             chhhhhhccccccCCCCCCccccccCCCCCCcccccCCCCCCcccccccccccCCCCCCCCcccccCCCCcccccccccc
Q 000708          882 GIIIQVAVGERVANSDASPAIGAENSHGRTGSEMMSASLRPCDAAKDDLKHEANVNPVPPSEVTQGCDLAKPTLLENGAL  961 (1340)
Q Consensus       882 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~~~~~~~~~~~~~~~~~~~  961 (1340)
                      ...+|-   +..++                     +                  +  +++-                   
T Consensus       800 ~~~~q~---~s~vk---------------------s------------------g--n~~q-------------------  816 (1163)
T COG5602         800 DESAQR---YSKVK---------------------S------------------G--NLEQ-------------------  816 (1163)
T ss_pred             chhhhh---HHHhh---------------------c------------------c--cchh-------------------
Confidence            000000   00000                     0                  0  0000                   


Q ss_pred             ccCcccccccccCCCCCcccccccccCCCCCcccccccccccccccccccccCCccchhccccccccccccccCCCCCCC
Q 000708          962 RDGAKGINYHEKSVGPTKIEKEEGELSPNGDFEEDNFGVYGDAAVKTLPKAKHGVESRQYQSKNEKGLQHQVVGGENDAD 1041 (1340)
Q Consensus       962 ~~~~~~~~~~~~~~~~~~~e~eegelsp~~d~e~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 1041 (1340)
                                               +|-+.|.          .++                                   
T Consensus       817 -------------------------vs~q~de----------~ai-----------------------------------  826 (1163)
T COG5602         817 -------------------------VSKQIDE----------YAI-----------------------------------  826 (1163)
T ss_pred             -------------------------cccCCch----------hHH-----------------------------------
Confidence                                     0000000          000                                   


Q ss_pred             CCcccCCCcCccCCCCCCCCCCCCccchhhhhhhhcccccccCCcccccCcccCccccccCCCCCCCCCcccccccccCc
Q 000708         1042 ADDEDSGNASVAGDDASGSESAGDEYSREEHEEEEDVERDDVDGKAESEGEADGMADQHFVGGDCMSLPMSERFLLSVKP 1121 (1340)
Q Consensus      1042 ~d~e~~~~~s~~~~~~s~ses~~~e~~~~~~~~~~~~~~~~~d~k~ese~~aeg~~~~~~~~~~~~~~~~se~~~~~~kp 1121 (1340)
                       +||       ..   +.+     .|                          .|..+.+.-.|.+.    -|        
T Consensus       827 -ede-------~k---~p~-----hp--------------------------d~l~eh~~skgi~e----ne--------  852 (1163)
T COG5602         827 -EDE-------IK---EPT-----HP--------------------------DGLKEHNISKGISE----NE--------  852 (1163)
T ss_pred             -HHH-------hc---CCC-----CC--------------------------chHHHhhhhhccch----hh--------
Confidence             000       00   000     00                          00000000001100    00        


Q ss_pred             ccccCCCccccccccccceeecCchhhHHHHHHHHHHHHHHHHHhchhHhhhcccCccCCChHHHHHHHHHHHhhccCCC
Q 000708         1122 LAKFVPATSVEERKDCRVFYGNDDFYVLFRLHQTLYERIQWAKMNTTGAEMKRRTSKEASCSDLYARFMTALYNLLDGSI 1201 (1340)
Q Consensus      1122 la~~~~~~~~~~~~~~~~FygN~~~YvffRLhqiLyERL~~aK~~s~~~e~r~k~~~d~~~~~~Y~~fL~~l~~LldG~~ 1201 (1340)
                        ++        ... ..||||.+.||||||...+||||..+|.--.+..+-      .....-|...|.++++|+.|.+
T Consensus       853 --k~--------~i~-~~~fgns~myvffrl~~~~yerLyeik~~~e~vti~------~~~~g~Yen~l~~s~rli~g~l  915 (1163)
T COG5602         853 --KQ--------VIA-LTEFGNSLMYVFFRLRFDSYERLYEIKKIKEAVTIG------MMEPGHYENELWDSYRLIFGAL  915 (1163)
T ss_pred             --HH--------HHH-HhhcccceeeeehhhhHhHHHHHHHHHHHHHHhhhh------hcccchhhhHHHHHHHHHhccc
Confidence              11        111 348999999999999999999999998533222211      1123489999999999999999


Q ss_pred             CchhhHHHHHhhhcCcceEEeeHHHHHHHHHHHHHHHhchhhhHHHHHHHHHHhhcCCC-CcchHHHHHHHHHhc-CCCC
Q 000708         1202 DNAKFEDECRAIIGNQSYVLFTLDKLLYRLCKQLQTVAADEMDNKLIQLYEYEESRKPG-KQIDSVYYENARVLL-HEEN 1279 (1340)
Q Consensus      1202 D~~~fED~~R~~fG~~aY~lFTiDKLi~~lvKqlq~iv~De~~~~ll~Ly~~E~~R~~~-~~~d~~Y~~~Ae~ll-~dEn 1279 (1340)
                      ++.+||+.+|..+++++|+++|||||.|.|+||||+++.|-+...++.||.....-++. ...+++||..+|.+| +||+
T Consensus       916 ~~~efekylr~fynntcykiytidrl~qsl~kq~h~i~~d~Ky~~~~~lfe~nsaaS~k~~~dq~~yrl~ves~l~p~e~  995 (1163)
T COG5602         916 TPCEFEKYLRPFYNNTCYKIYTIDRLRQSLKKQLHPITYDIKYIAKSELFEMNSAASSKRKQDQTKYRLEVESLLNPDEI  995 (1163)
T ss_pred             cHHHHHHHHHHhccCcccccccHHHHHHHHHHHhcccccchHHHHHHHHHHhccccCcCccchhhHHHHHHHHhcCchhh
Confidence            99999999999999999999999999999999999999999999999999554433332 246889999999999 8999


Q ss_pred             eEEEEEccCCCeeEEEEecCCcCCcchhhcccChHHHHHhhhhcccccC
Q 000708         1280 IYRIQLSSSPSRLSIQLMDNVIEKPEAFAVTMDPNFAAYLLNDFLSAFL 1328 (1340)
Q Consensus      1280 ~yrie~~~~~~~ltIqLld~~~~~~e~~~~~~~~~~~~Yl~~~fl~~~~ 1328 (1340)
                      ||||.|.+.-.++.||+|-...=-.+.+.+ -+..|-.||+ +|.-.-+
T Consensus       996 ~Fr~~w~~~~k~~~Iq~~~~adLt~~~~~~-~~~~wkyYV~-sYai~h~ 1042 (1163)
T COG5602         996 LFRFCWINKFKSFGIQIMKRADLTVDQSLD-TQRVWKYYVQ-SYAIQHL 1042 (1163)
T ss_pred             eeeeeecchhheeeeeEeeccccccccccC-hHHHHHHHHH-hhccccc
Confidence            999999999999999999554222222211 1258999999 7764433


No 2  
>smart00761 HDAC_interact Histone deacetylase (HDAC) interacting. This domain is found on transcriptional regulators. It forms interactions with histone deacetylases.
Probab=100.00  E-value=1.5e-43  Score=338.87  Aligned_cols=100  Identities=56%  Similarity=0.862  Sum_probs=96.6

Q ss_pred             CCcccCCccccCCCCCCCCCCCCccccccccccCeeeecccC-CCCcccchhccchhhHHhhhcccchhhHHHHHHHHHH
Q 000708          494 NCQRCTPSYRLLPDDYPIPSASQRSELGAQVLNDHWVSVTSG-SEDYSFKHMRRNQYEESLFRCEDDRFELDMLLESVSS  572 (1340)
Q Consensus       494 ~c~r~gpSYRlLPk~~~~~~CSgR~eLc~sVLND~WVS~t~~-SED~~F~~~rKNqyEE~LfrcEDERfElDm~IE~~~~  572 (1340)
                      +|++||||||+|||+|+.++||||++||++||||+|||||+| |||++|++||||||||+|||||||||||||+||++.+
T Consensus         1 ~c~~~gpSYr~Lpk~~~~~~cSgR~~l~~~VLND~wvsvps~~SED~~F~~~rKNqyEE~Lfr~EDeR~E~D~~ie~~~~   80 (102)
T smart00761        1 NCERCGPSYRLLPKSEKQPKCSGRDELCKEVLNDTWVSHPTWASEDSGFVAHRKNQYEEALFRCEDERFELDMVIESNSS   80 (102)
T ss_pred             CCcccCcchhhCCcccCCCCCCCccHHHHHHhCCceEeecCCcccccchhhhhccHHHHHHHHhhhhhhHHHHHHHHHHH
Confidence            599999999999999999999999999999999999999876 9999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHHhhcccccCC
Q 000708          573 TAKRAEELLNSINENKITLET  593 (1340)
Q Consensus       573 tI~~Le~l~~ki~~~~~e~~~  593 (1340)
                      ||++|++|+++|+.|+++++.
T Consensus        81 ti~~le~l~~~~~~~~~~e~~  101 (102)
T smart00761       81 TIKLLEEILNKIEDMSDEERA  101 (102)
T ss_pred             HHHHHHHHHHHHhcCChhhcc
Confidence            999999999999999887654


No 3  
>PF08295 Sin3_corepress:  Sin3 family co-repressor;  InterPro: IPR013194 This domain is found on transcriptional regulators. It forms interactions with histone deacetylases [].
Probab=100.00  E-value=1.1e-41  Score=327.91  Aligned_cols=100  Identities=60%  Similarity=0.914  Sum_probs=97.3

Q ss_pred             CCcccCCccccCCCCCCCCCCCCccccccccccCeeeecccCCCCcccchhccchhhHHhhhcccchhhHHHHHHHHHHH
Q 000708          494 NCQRCTPSYRLLPDDYPIPSASQRSELGAQVLNDHWVSVTSGSEDYSFKHMRRNQYEESLFRCEDDRFELDMLLESVSST  573 (1340)
Q Consensus       494 ~c~r~gpSYRlLPk~~~~~~CSgR~eLc~sVLND~WVS~t~~SED~~F~~~rKNqyEE~LfrcEDERfElDm~IE~~~~t  573 (1340)
                      +|++||||||+||++|+.++||||++||++||||+|||||+||||++|++||||||||+|||||||||||||+||+|++|
T Consensus         1 ~c~~~gpSYr~LP~~~~~~~cSgR~~l~~~VLND~wvs~p~~sEd~~f~~~kKnqyEE~lf~~EDeR~E~D~~ie~~~~t   80 (101)
T PF08295_consen    1 NCERCGPSYRLLPKSYQNPPCSGRDELCWSVLNDTWVSVPSWSEDSSFKAMKKNQYEEALFRCEDERFELDMLIESNRST   80 (101)
T ss_pred             CCCcCccchhhCCcccCCCCCCCCchhhhhhcCCEEEEeCCccccccccchhhhHHHHHHHHhhhhhHHHHHHHHHHHHH
Confidence            69999999999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             HHHHHHHHHHHhhcccccCC
Q 000708          574 AKRAEELLNSINENKITLET  593 (1340)
Q Consensus       574 I~~Le~l~~ki~~~~~e~~~  593 (1340)
                      |+.|++|+++|..|+++++.
T Consensus        81 I~~Le~l~~~i~~~~~ee~~  100 (101)
T PF08295_consen   81 IKLLEELQEKIQDMSPEERA  100 (101)
T ss_pred             HHHHHHHHHHHhcCCHhHhc
Confidence            99999999999999887764


No 4  
>KOG4204 consensus Histone deacetylase complex, SIN3 component [Chromatin structure and dynamics]
Probab=100.00  E-value=2.4e-39  Score=353.24  Aligned_cols=159  Identities=63%  Similarity=1.042  Sum_probs=148.1

Q ss_pred             CccccchhhhhHHHHHHHhcCChHHHHHHHHHHHHHHhccCChHHHHHHHHHHhccChhHHhhhhhcCCccccccCCCCC
Q 000708           74 QKLTTSDALTYLKEVKDMFQDQREKYDMFLEVMKDFKAQRTDTAGVIARVKDLFKGHNNLIFGFNTFLPKGYEITLDEDE  153 (1340)
Q Consensus        74 ~~l~~~dAl~YL~~VK~~F~d~p~vY~eFLeIMkdfk~~~idt~gVi~rVs~LFkGhpdLI~GFN~FLP~gy~I~l~~de  153 (1340)
                      +.++++||++||++||.+|+++|++|++||+||+|||+++||+++||+||++||+|||+||+|||+|||+||+|+++.+.
T Consensus        15 ~~~t~~DAlsYl~~VK~~f~d~p~kY~~FL~im~d~ka~~iD~~~vi~rv~eLfK~h~~Ll~gfN~fLP~~~~i~~~~~~   94 (231)
T KOG4204|consen   15 PALTLDDALAYLKAVKEAFQDEPEKYDEFLEIMKDFKAQRIDTPGVIARVKELLKGHPDLLLGFNTFLPPGYKITLSSEA   94 (231)
T ss_pred             CCCChHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHccCHHHHHHHHhhCcccceeccCccc
Confidence            34999999999999999999999999999999999999999999999999999999999999999999999999987431


Q ss_pred             -------------CCCCCc-----------cchHHHHHHHHHHHHHcCCChHHHHHHHHHHHHhhhccCcHHHHHHHHHH
Q 000708          154 -------------APPKKT-----------VEFEEAITFVNKIKKRFHNDEHVYKSFLEILNMYRKEHKDINEVYSEVAS  209 (1340)
Q Consensus       154 -------------~pp~~~-----------vef~~Ai~FvnKIK~RF~~dPevYk~FLeIL~~y~~~~~si~eV~~qVa~  209 (1340)
                                   ..++..           +++.+|++|+||||+||++++++|++||+||+||+++++++.+||++|..
T Consensus        95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~a~~fv~klk~rf~~~~~v~~s~l~il~~y~~~~ks~~e~~~eV~~  174 (231)
T KOG4204|consen   95 KDEFTIYGATSYLPSPKVAFHEEILKLLEDVEFDRAISFVNKLKTRFQGDDHVYKSFLEILRMYQEGNKSVSEVYQEVEA  174 (231)
T ss_pred             ccccccccccccCCCccccccchhhhhcccchHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHhhccchHHHHHHHHHH
Confidence                         122333           88999999999999999999999999999999999999999999999999


Q ss_pred             hhccChhHHHHHhhhCCCCCCCC
Q 000708          210 LFEDHADLLEEFTRFLPDTSATS  232 (1340)
Q Consensus       210 LFk~hPDLL~EF~~FLP~~~~~~  232 (1340)
                      ||++|+||+.+|.+|||......
T Consensus       175 L~~~~~DL~~ef~~~lp~~~~~~  197 (231)
T KOG4204|consen  175 LLQGHEDLLEEFSHFLPTDPATG  197 (231)
T ss_pred             HHccCHHHHHHHHhhccCCcccc
Confidence            99999999999999999985544


No 5  
>KOG4204 consensus Histone deacetylase complex, SIN3 component [Chromatin structure and dynamics]
Probab=99.82  E-value=1.8e-19  Score=197.69  Aligned_cols=178  Identities=25%  Similarity=0.443  Sum_probs=137.0

Q ss_pred             CCccchHHHHHHHHHHHHHcCCChHHHHHHHHHHHHhhhccCcHHHHHHHHHHhhccChhHHHHHhhhCCCCCCCCcCCC
Q 000708          157 KKTVEFEEAITFVNKIKKRFHNDEHVYKSFLEILNMYRKEHKDINEVYSEVASLFEDHADLLEEFTRFLPDTSATSLSHN  236 (1340)
Q Consensus       157 ~~~vef~~Ai~FvnKIK~RF~~dPevYk~FLeIL~~y~~~~~si~eV~~qVa~LFk~hPDLL~EF~~FLP~~~~~~~~~~  236 (1340)
                      .+.+++++|++||++||.+|+++|++|.+||+||++|+.+.++..+|+++|+.||+||||||.+|+.|||..........
T Consensus        14 ~~~~t~~DAlsYl~~VK~~f~d~p~kY~~FL~im~d~ka~~iD~~~vi~rv~eLfK~h~~Ll~gfN~fLP~~~~i~~~~~   93 (231)
T KOG4204|consen   14 SPALTLDDALAYLKAVKEAFQDEPEKYDEFLEIMKDFKAQRIDTPGVIARVKELLKGHPDLLLGFNTFLPPGYKITLSSE   93 (231)
T ss_pred             CCCCChHHHHHHHHHHHHHHhcChHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHccCHHHHHHHHhhCcccceeccCcc
Confidence            34578999999999999999999999999999999999999999999999999999999999999999999877654211


Q ss_pred             CccccCCcccCCCCCCCCCCccccchhhhhccccccccCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHhhhhcccCCCC
Q 000708          237 IPFVRNSTQRGNERSAGIPPLRQMQMDKHRRRDRIATSHADRDLSVDRPEMDDEKLMIKMQKEQRRRAEKENRDRRNRDQ  316 (1340)
Q Consensus       237 ~~~~~~~~~~~~~r~s~~P~~~q~~~~K~~~~~r~~~s~~~~d~s~~rP~~~~~~~~~~~~keqr~~~~k~~~~r~~~~~  316 (1340)
                      ....           ..++...                       ...|...                            
T Consensus        94 ~~~~-----------~~~~~~~-----------------------~~~~~~~----------------------------  111 (231)
T KOG4204|consen   94 AKDE-----------FTIYGAT-----------------------SYLPSPK----------------------------  111 (231)
T ss_pred             cccc-----------ccccccc-----------------------ccCCCcc----------------------------
Confidence            1000           0000000                       0000000                            


Q ss_pred             cccccccccccccccccCCCCccccccccCCCCCCCCCccCchhhhhhhhhhhhhHHHHHHhhcC--hHHHHHHHHHHhh
Q 000708          317 DDREIEHDNNRDFNLQRFPDKKKSIKKVEGFGANSSLASYDDKDALKSIYNQGFIFCDKVKEKLC--SDDYQAFLKCLHI  394 (1340)
Q Consensus       317 ~~~~~~~~~~~~~~~q~~~~k~k~~~~~~~~g~~P~~s~~~~~~a~k~~~~eElaFFDKVKk~L~--~~~Y~eFLKcLnL  394 (1340)
                                     ..++  ...   ...++         +     ..+.+++.|.+|||+++.  .++|+.||.+|++
T Consensus       112 ---------------~~~~--~~~---~~~~~---------~-----~~~~~a~~fv~klk~rf~~~~~v~~s~l~il~~  157 (231)
T KOG4204|consen  112 ---------------VAFH--EEI---LKLLE---------D-----VEFDRAISFVNKLKTRFQGDDHVYKSFLEILRM  157 (231)
T ss_pred             ---------------cccc--chh---hhhcc---------c-----chHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHH
Confidence                           0000  000   00000         0     145678999999999998  7799999999999


Q ss_pred             hhcCCCCHHHHHHHHHHHhccChHHHHHHhhhhccc
Q 000708          395 YSNGIIKRNDLQNLVTDLLGKYMDLMDEFNHFFERC  430 (1340)
Q Consensus       395 YsQeIIsr~ELv~lV~~fLG~~pDLm~~Fk~FLg~~  430 (1340)
                      |.++-.+..|+...|..+++.|+||+.+|..||...
T Consensus       158 y~~~~ks~~e~~~eV~~L~~~~~DL~~ef~~~lp~~  193 (231)
T KOG4204|consen  158 YQEGNKSVSEVYQEVEALLQGHEDLLEEFSHFLPTD  193 (231)
T ss_pred             HhhccchHHHHHHHHHHHHccCHHHHHHHHhhccCC
Confidence            999999999999999999999999999999999654


No 6  
>COG5602 SIN3 Histone deacetylase complex, SIN3 component [Chromatin structure and dynamics]
Probab=99.79  E-value=8.8e-19  Score=210.87  Aligned_cols=211  Identities=24%  Similarity=0.406  Sum_probs=143.7

Q ss_pred             CCccchHHHHHHHHHHHHHcCCChHHHHHHHHHHHHhhhccCcHHHHHHHHHHhhccChhHHHHHhhhCCCCCCCCcCCC
Q 000708          157 KKTVEFEEAITFVNKIKKRFHNDEHVYKSFLEILNMYRKEHKDINEVYSEVASLFEDHADLLEEFTRFLPDTSATSLSHN  236 (1340)
Q Consensus       157 ~~~vef~~Ai~FvnKIK~RF~~dPevYk~FLeIL~~y~~~~~si~eV~~qVa~LFk~hPDLL~EF~~FLP~~~~~~~~~~  236 (1340)
                      -+++.+.+|++||++||.+|..+|++|..||+||++|+.+.+++.+|+++|+.||+|.|+||++|+.|||.......+..
T Consensus       123 ~r~Ldv~DAlsyLe~vK~~f~~rp~iYn~FLdiMkdFKsqaiDtpgVI~RVS~LFrgYP~LIegFNtFLPsgYkie~S~~  202 (1163)
T COG5602         123 YRPLDVSDALSYLEKVKEQFSNRPEIYNNFLDIMKDFKSQAIDTPGVIERVSVLFRGYPHLIEGFNTFLPSGYKIEGSLP  202 (1163)
T ss_pred             CCCCChHHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHhcccCcHHHHHHHHHHHcCChHHHHHHhhhCCCccEEEEecC
Confidence            56788999999999999999999999999999999999999999999999999999999999999999999987654221


Q ss_pred             CccccCCcccCCCCCCCCCCccccchhhhhccccccccCCCCCCCCCCCCchhHHHHHHHHHHHHHHHHhhhhcccCCCC
Q 000708          237 IPFVRNSTQRGNERSAGIPPLRQMQMDKHRRRDRIATSHADRDLSVDRPEMDDEKLMIKMQKEQRRRAEKENRDRRNRDQ  316 (1340)
Q Consensus       237 ~~~~~~~~~~~~~r~s~~P~~~q~~~~K~~~~~r~~~s~~~~d~s~~rP~~~~~~~~~~~~keqr~~~~k~~~~r~~~~~  316 (1340)
                      .+  . . .+...+.+.-|.+.+. +..     ..+.+...++....+|.++.                           
T Consensus       203 ~~--~-~-s~l~vtTP~gp~s~~p-v~s-----s~y~a~~n~~qrts~p~lps---------------------------  245 (1163)
T COG5602         203 QP--N-G-SRLHVTTPQGPLSSPP-VQS-----SYYVAPCNHDQRTSHPTLPS---------------------------  245 (1163)
T ss_pred             CC--C-C-CeeeecCCCCCCCCCC-ccc-----ccccccccccccccCCCCcc---------------------------
Confidence            10  0 0 1111111111111100 000     00000000011111111100                           


Q ss_pred             cccccccccccccccccCCCCccccccccCCCCCCCCCccCchhhhhhhhhhhhhHHHHHHhhcC--hHHHHHHHHHHhh
Q 000708          317 DDREIEHDNNRDFNLQRFPDKKKSIKKVEGFGANSSLASYDDKDALKSIYNQGFIFCDKVKEKLC--SDDYQAFLKCLHI  394 (1340)
Q Consensus       317 ~~~~~~~~~~~~~~~q~~~~k~k~~~~~~~~g~~P~~s~~~~~~a~k~~~~eElaFFDKVKk~L~--~~~Y~eFLKcLnL  394 (1340)
                                 +  .|+-+.         .....|.    +...-....+++.++|..|||.++.  ++.|.+||.+|.-
T Consensus       246 -----------~--~Q~e~s---------~~~~~~s----d~~~~~~~d~nqAI~~vnkVK~r~~~~pe~y~~fl~~Lrt  299 (1163)
T COG5602         246 -----------D--SQPEPS---------APSHMPS----DARGKHQVDFNQAIIFVNKVKVRFQNNPEMYYDFLDSLRT  299 (1163)
T ss_pred             -----------c--cCCCCC---------CcccCCc----ccccccceehhHHHHHHHHHHHhcCCCchhHHHHHHHHHH
Confidence                       0  000000         0000111    0000111247899999999999998  9999999999999


Q ss_pred             hhcCCCCHHHHHHHHHHHhccChHHHHHHhhhhccc
Q 000708          395 YSNGIIKRNDLQNLVTDLLGKYMDLMDEFNHFFERC  430 (1340)
Q Consensus       395 YsQeIIsr~ELv~lV~~fLG~~pDLm~~Fk~FLg~~  430 (1340)
                      |..+.-+..|++..|+.++..+|||+++||.||...
T Consensus       300 yq~~qr~i~ev~~~Vt~lfa~~PdLleeFk~FLPd~  335 (1163)
T COG5602         300 YQMKQRSIQEVYARVTKLFAEAPDLLEEFKEFLPDS  335 (1163)
T ss_pred             HHhhhccHHHHHHHHHHHHhhChHHHHHHHHhCccc
Confidence            999999999999999999999999999999999754


No 7  
>PF02671 PAH:  Paired amphipathic helix repeat;  InterPro: IPR003822 This family contains the paired amphipathic helix (PAH) repeat. The family contains the eukaryotic Sin3 proteins, which have at least three PAH domains (PAH1, PAH2, and PAH3). Sin3 proteins are components of a co-repressor complex that silences transcription, playing important roles in the transition between proliferation and differentiation. Sin3 proteins are recruited to the DNA by various DNA-binding transcription factors such as the Mad family of repressors, Mnt/Rox, PLZF, MeCP2, p53, REST/NRSF, MNFbeta, Sp1, TGIF and Ume6 []. Sin3 acts as a scaffold protein that in turn recruits histone-binding proteins RbAp46/RbAp48 and histone deacetylases HDAC1/HDAC2, which deacetylate the core histones resulting in a repressed state of the chromatin []. The PAH domains are protein-protein interaction domains through which Sin3 fulfils its role as a scaffold. The PAH2 domain of Sin3 can interact with a wide range of unrelated and structurally diverse transcription factors that bind using different interaction motifs. For example, the Sin3 PAH2 domain can interact with the unrelated Mad and HBP1 factors using alternative interaction motifs that involve binding in opposite helical orientations [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1S5Q_B 2L9S_B 1G1E_B 1S5R_B 2CR7_A 2CZY_A 2LD7_B 2RMR_A 2RMS_A 1PD7_A ....
Probab=99.35  E-value=8.3e-13  Score=111.43  Aligned_cols=47  Identities=40%  Similarity=0.795  Sum_probs=45.4

Q ss_pred             hHHHHHHHHHHHHHHhccCChHHHHHHHHHHhccChhHHhhhhhcCC
Q 000708           96 REKYDMFLEVMKDFKAQRTDTAGVIARVKDLFKGHNNLIFGFNTFLP  142 (1340)
Q Consensus        96 p~vY~eFLeIMkdfk~~~idt~gVi~rVs~LFkGhpdLI~GFN~FLP  142 (1340)
                      |++|++||+||++|+.++++..+|+++|+.||+||||||.+||.|||
T Consensus         1 p~~Y~~FL~il~~y~~~~~~~~~v~~~v~~Ll~~hpdLl~~F~~FlP   47 (47)
T PF02671_consen    1 PEVYNEFLKILNDYKKGRISRSEVIEEVSELLRGHPDLLEEFNRFLP   47 (47)
T ss_dssp             HHHHHHHHHHHHHHHCTCSCHHHHHHHHHHHTTT-HHHHHHHHHHSS
T ss_pred             ChHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHccCHHHHHHHHhhCc
Confidence            89999999999999999999999999999999999999999999998


No 8  
>PF02671 PAH:  Paired amphipathic helix repeat;  InterPro: IPR003822 This family contains the paired amphipathic helix (PAH) repeat. The family contains the eukaryotic Sin3 proteins, which have at least three PAH domains (PAH1, PAH2, and PAH3). Sin3 proteins are components of a co-repressor complex that silences transcription, playing important roles in the transition between proliferation and differentiation. Sin3 proteins are recruited to the DNA by various DNA-binding transcription factors such as the Mad family of repressors, Mnt/Rox, PLZF, MeCP2, p53, REST/NRSF, MNFbeta, Sp1, TGIF and Ume6 []. Sin3 acts as a scaffold protein that in turn recruits histone-binding proteins RbAp46/RbAp48 and histone deacetylases HDAC1/HDAC2, which deacetylate the core histones resulting in a repressed state of the chromatin []. The PAH domains are protein-protein interaction domains through which Sin3 fulfils its role as a scaffold. The PAH2 domain of Sin3 can interact with a wide range of unrelated and structurally diverse transcription factors that bind using different interaction motifs. For example, the Sin3 PAH2 domain can interact with the unrelated Mad and HBP1 factors using alternative interaction motifs that involve binding in opposite helical orientations [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1S5Q_B 2L9S_B 1G1E_B 1S5R_B 2CR7_A 2CZY_A 2LD7_B 2RMR_A 2RMS_A 1PD7_A ....
Probab=99.23  E-value=1.2e-11  Score=104.42  Aligned_cols=47  Identities=53%  Similarity=0.863  Sum_probs=45.4

Q ss_pred             hHHHHHHHHHHHHhhhccCcHHHHHHHHHHhhccChhHHHHHhhhCC
Q 000708          180 EHVYKSFLEILNMYRKEHKDINEVYSEVASLFEDHADLLEEFTRFLP  226 (1340)
Q Consensus       180 PevYk~FLeIL~~y~~~~~si~eV~~qVa~LFk~hPDLL~EF~~FLP  226 (1340)
                      |++|++||+||++|+++.++..+|+++|+.||++||||+++|+.|||
T Consensus         1 p~~Y~~FL~il~~y~~~~~~~~~v~~~v~~Ll~~hpdLl~~F~~FlP   47 (47)
T PF02671_consen    1 PEVYNEFLKILNDYKKGRISRSEVIEEVSELLRGHPDLLEEFNRFLP   47 (47)
T ss_dssp             HHHHHHHHHHHHHHHCTCSCHHHHHHHHHHHTTT-HHHHHHHHHHSS
T ss_pred             ChHHHHHHHHHHHHHhcCCCHHHHHHHHHHHHccCHHHHHHHHhhCc
Confidence            79999999999999999999999999999999999999999999998


No 9  
>PTZ00146 fibrillarin; Provisional
Probab=94.22  E-value=0.053  Score=62.89  Aligned_cols=8  Identities=38%  Similarity=0.538  Sum_probs=3.4

Q ss_pred             cccccccc
Q 000708          485 KSIQELDL  492 (1340)
Q Consensus       485 kp~~elDl  492 (1340)
                      +++..++|
T Consensus       265 ~~~e~v~L  272 (293)
T PTZ00146        265 KPKEQLTL  272 (293)
T ss_pred             ceEEEEec
Confidence            33444444


No 10 
>KOG3915 consensus Transcription regulator dachshund, contains SKI/SNO domain [Transcription]
Probab=93.07  E-value=0.068  Score=63.79  Aligned_cols=12  Identities=25%  Similarity=0.398  Sum_probs=8.8

Q ss_pred             ChHHHHHHHHHH
Q 000708          179 DEHVYKSFLEIL  190 (1340)
Q Consensus       179 dPevYk~FLeIL  190 (1340)
                      =|.+|.-||+-|
T Consensus       200 LPQafdlFLKhl  211 (641)
T KOG3915|consen  200 LPQAFDLFLKHL  211 (641)
T ss_pred             cHHHHHHHHHHH
Confidence            477888888754


No 11 
>KOG3973 consensus Uncharacterized conserved glycine-rich protein [Function unknown]
Probab=92.16  E-value=0.16  Score=59.16  Aligned_cols=7  Identities=29%  Similarity=0.558  Sum_probs=2.5

Q ss_pred             CcccCCC
Q 000708           17 PLTTAPP   23 (1340)
Q Consensus        17 ~~~~~~~   23 (1340)
                      |+--+||
T Consensus       309 ~nE~~pp  315 (465)
T KOG3973|consen  309 TNEMVPP  315 (465)
T ss_pred             cccCCCC
Confidence            3333333


No 12 
>KOG3973 consensus Uncharacterized conserved glycine-rich protein [Function unknown]
Probab=91.48  E-value=0.24  Score=57.68  Aligned_cols=6  Identities=33%  Similarity=0.578  Sum_probs=2.4

Q ss_pred             ccCCCC
Q 000708           19 TTAPPR   24 (1340)
Q Consensus        19 ~~~~~~   24 (1340)
                      .-||-|
T Consensus       300 grvPDR  305 (465)
T KOG3973|consen  300 GRVPDR  305 (465)
T ss_pred             ccCCCC
Confidence            334444


No 13 
>PLN03138 Protein TOC75; Provisional
Probab=90.09  E-value=0.31  Score=63.14  Aligned_cols=29  Identities=17%  Similarity=0.181  Sum_probs=21.0

Q ss_pred             hHHHHHHHHHHhhhhcCCCCHHHHHHHHHHH
Q 000708          382 SDDYQAFLKCLHIYSNGIIKRNDLQNLVTDL  412 (1340)
Q Consensus       382 ~~~Y~eFLKcLnLYsQeIIsr~ELv~lV~~f  412 (1340)
                      +++.+||+..+  +.++.+++++|..-+..|
T Consensus       282 ~~v~~e~f~~~--~~~gklN~e~Lq~die~I  310 (796)
T PLN03138        282 KSVRREVLGML--RDQGKVSARLLQRIRDRV  310 (796)
T ss_pred             hHHHHHHHhhh--ccCCcCCHHHHHHHHHHH
Confidence            67788888777  566678888887665543


No 14 
>PLN03138 Protein TOC75; Provisional
Probab=88.27  E-value=0.45  Score=61.66  Aligned_cols=16  Identities=13%  Similarity=0.069  Sum_probs=10.0

Q ss_pred             ccCcHHHHHHHHHHhh
Q 000708          196 EHKDINEVYSEVASLF  211 (1340)
Q Consensus       196 ~~~si~eV~~qVa~LF  211 (1340)
                      ...+..+|.+.+..|+
T Consensus       180 ~v~trsqLq~dv~~I~  195 (796)
T PLN03138        180 GVYTKAQLQKELETLA  195 (796)
T ss_pred             CccCHHHHHHHHHHHH
Confidence            4445666667676666


No 15 
>PHA00370 III attachment protein
Probab=85.78  E-value=1.6  Score=49.37  Aligned_cols=13  Identities=15%  Similarity=0.054  Sum_probs=7.1

Q ss_pred             hhhhHHHHHHHhc
Q 000708           81 ALTYLKEVKDMFQ   93 (1340)
Q Consensus        81 Al~YL~~VK~~F~   93 (1340)
                      +..--+++|..+-
T Consensus       148 ~~kma~a~kdalt  160 (297)
T PHA00370        148 YPKMANANKDALT  160 (297)
T ss_pred             cHHHhhhhhhhhc
Confidence            4444456666654


No 16 
>KOG3915 consensus Transcription regulator dachshund, contains SKI/SNO domain [Transcription]
Probab=85.56  E-value=0.62  Score=56.10  Aligned_cols=8  Identities=25%  Similarity=0.634  Sum_probs=3.0

Q ss_pred             HHhhhCCC
Q 000708          220 EFTRFLPD  227 (1340)
Q Consensus       220 EF~~FLP~  227 (1340)
                      +|..+.-|
T Consensus       260 DFe~Ly~D  267 (641)
T KOG3915|consen  260 DFETLYND  267 (641)
T ss_pred             cHHHHHhh
Confidence            34333333


No 17 
>PHA00370 III attachment protein
Probab=84.75  E-value=2.1  Score=48.52  Aligned_cols=16  Identities=31%  Similarity=0.351  Sum_probs=7.8

Q ss_pred             chHHHHHHHHHHHHHc
Q 000708          161 EFEEAITFVNKIKKRF  176 (1340)
Q Consensus       161 ef~~Ai~FvnKIK~RF  176 (1340)
                      +|.-....|+.||--|
T Consensus       257 e~~I~CdKId~~k~v~  272 (297)
T PHA00370        257 EFIIGCDKINDFKGVF  272 (297)
T ss_pred             hhhhcchhHHHHHHHH
Confidence            3444445555555443


No 18 
>COG4371 Predicted membrane protein [Function unknown]
Probab=82.13  E-value=1.7  Score=49.22  Aligned_cols=8  Identities=38%  Similarity=0.335  Sum_probs=3.1

Q ss_pred             HHHHHHHH
Q 000708          202 EVYSEVAS  209 (1340)
Q Consensus       202 eV~~qVa~  209 (1340)
                      .|.++.+-
T Consensus       181 ~vlQEA~L  188 (334)
T COG4371         181 RVLQEAAL  188 (334)
T ss_pred             HHHHHHHH
Confidence            34444333


No 19 
>PF02084 Bindin:  Bindin;  InterPro: IPR000775 Bindin, the major protein component of the acrosome granule of sea urchin sperm, mediates species-specific adhesion of sperm to the egg surface during fertilisation [, ]. The protein coats the acrosomal process after externalisation by the acrosome reaction; it binds to sulphated, fucose-containing polysaccharides on the vitelline-layer receptor proteoglycans that cover the egg plasma membrane. Bindins from different genera show high levels of sequence similarity in both the mature bindin domain and in the probindin precursor region. The most highly conserved region is a 42-residue segment in the central portion of the mature bindin protein. This domain may be responsible for conserved functions of bindin, while the more highly divergent flanking regions may be responsible for its species-specific properties [].; GO: 0007342 fusion of sperm to egg plasma membrane
Probab=79.10  E-value=21  Score=40.24  Aligned_cols=8  Identities=38%  Similarity=0.688  Sum_probs=4.0

Q ss_pred             ccChhHHh
Q 000708          128 KGHNNLIF  135 (1340)
Q Consensus       128 kGhpdLI~  135 (1340)
                      +.|.+|+.
T Consensus       138 RHHSNLLA  145 (238)
T PF02084_consen  138 RHHSNLLA  145 (238)
T ss_pred             HHHHHHHh
Confidence            44555554


No 20 
>KOG3074 consensus Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription]
Probab=75.21  E-value=0.86  Score=50.98  Aligned_cols=14  Identities=29%  Similarity=0.427  Sum_probs=8.7

Q ss_pred             CCccccchhhhhHH
Q 000708           73 SQKLTTSDALTYLK   86 (1340)
Q Consensus        73 ~~~l~~~dAl~YL~   86 (1340)
                      ...|.++.-.-||+
T Consensus        27 sk~l~iq~KrfYlD   40 (263)
T KOG3074|consen   27 SKRLQIQHKRFYLD   40 (263)
T ss_pred             hhhhhhhhheEEEE
Confidence            45566666666664


No 21 
>COG4371 Predicted membrane protein [Function unknown]
Probab=72.81  E-value=4.1  Score=46.24  Aligned_cols=14  Identities=14%  Similarity=0.140  Sum_probs=5.9

Q ss_pred             HHHHHHHHhccChh
Q 000708          119 VIARVKDLFKGHNN  132 (1340)
Q Consensus       119 Vi~rVs~LFkGhpd  132 (1340)
                      |++...-.+-.||+
T Consensus       182 vlQEA~LalLRHPE  195 (334)
T COG4371         182 VLQEAALALLRHPE  195 (334)
T ss_pred             HHHHHHHHHHcCCc
Confidence            34443433444554


No 22 
>cd07354 HN_L-delphilin-R1_like First harmonin_N_like domain (repeat 1) of L-delphilin, and related domains. This subgroup contains the first of two harmonin_N_like domains of an alternatively spliced longer variant of mouse delphilin (L-delphilin, isoform 1), and related domains. Delphilin is a scaffold protein which binds the glutamate receptor delta-2 (GRID2) subunit and the monocarboxylate transporter 2 at the cerebellar parallel fiber-Purkinje cell synapses. The N-terminus of L-delphilin contains this harmonin_N_like domain preceded by a postsynaptic density-95/discs-large/ZO-1 (PDZ) protein-binding domain, PDZ1. L-delphilin, in common with the shorter C-terminal isoforms (S-delphilin/delphilin alpha and delphilin beta) has a second harmonin_N_like domain (not belonging to this subgroup) and a second PDZ domain, PDZ2. This first harmonin_N_like domain is a putative protein-binding module based on its sequence similarity to the harmonin N-domain.
Probab=72.31  E-value=13  Score=35.84  Aligned_cols=68  Identities=22%  Similarity=0.409  Sum_probs=56.9

Q ss_pred             HHHHHHHHHHHHHcCCChHHHHHHHHHHHHhhhccCcHHHHHHHHHHhh--ccChhHHHHHhhhCCCCCCC
Q 000708          163 EEAITFVNKIKKRFHNDEHVYKSFLEILNMYRKEHKDINEVYSEVASLF--EDHADLLEEFTRFLPDTSAT  231 (1340)
Q Consensus       163 ~~Ai~FvnKIK~RF~~dPevYk~FLeIL~~y~~~~~si~eV~~qVa~LF--k~hPDLL~EF~~FLP~~~~~  231 (1340)
                      +.|..|..||..-+.++|..=....++|++|..+ +.++.+...+..++  ..|--|+++-..|+|..+..
T Consensus         3 ~kA~eF~~Kvd~iL~~dp~~Ke~l~~aLk~Ya~~-k~vd~l~~aL~~~L~~e~~~~Lld~IR~fIP~KHr~   72 (80)
T cd07354           3 RKAQEFSRKVDAILGDDPVKKEQVFAALKQYAAD-KNVDCLVWALCGLLQTEAHKKLLDEIRIFIPKKHRA   72 (80)
T ss_pred             hHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH-cCHHHHHHHHHHHhCcHHHHHHHHHhHhcCCchhHH
Confidence            5699999999999999999999999999999764 45666777777777  56677999999999987654


No 23 
>PF04285 DUF444:  Protein of unknown function (DUF444);  InterPro: IPR006698 This entry is represented by Thermus phage phiYS40, Orf56. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches [].
Probab=72.19  E-value=6.2  Score=48.40  Aligned_cols=40  Identities=18%  Similarity=0.326  Sum_probs=21.7

Q ss_pred             ccchHHHHHHHHHHHHHcCCChHHHHHHHHHHHHhhhccCcHHHHHHHHHHhhc
Q 000708          159 TVEFEEAITFVNKIKKRFHNDEHVYKSFLEILNMYRKEHKDINEVYSEVASLFE  212 (1340)
Q Consensus       159 ~vef~~Ai~FvnKIK~RF~~dPevYk~FLeIL~~y~~~~~si~eV~~qVa~LFk  212 (1340)
                      +..++-..++.+.|+.|..-.              +-....+.++-.+|..|..
T Consensus       160 ~s~l~~~RTlr~al~Rria~~--------------~~~~~~~~~~~~~~~~~~~  199 (421)
T PF04285_consen  160 PSNLDKKRTLREALKRRIALG--------------RPRRRELRELEEELEELEA  199 (421)
T ss_pred             HHHHHHHHHhhHHHHHHHHhc--------------CCccccHHHHHHHHHHHhc
Confidence            345566666666666654321              1113346666667776654


No 24 
>KOG1308 consensus Hsp70-interacting protein Hip/Transient component of progesterone receptor complexes and an Hsp70-binding protein [Posttranslational modification, protein turnover, chaperones; Signal transduction mechanisms]
Probab=68.34  E-value=13  Score=44.45  Aligned_cols=35  Identities=17%  Similarity=0.385  Sum_probs=22.7

Q ss_pred             ChHHHHHHHHHHH------HHHhccCChHHHHHHHHHHhccC
Q 000708           95 QREKYDMFLEVMK------DFKAQRTDTAGVIARVKDLFKGH  130 (1340)
Q Consensus        95 ~p~vY~eFLeIMk------dfk~~~idt~gVi~rVs~LFkGh  130 (1340)
                      +|++--.|.++|+      .|.+.. -+..||..++.+|-|-
T Consensus       336 dp~v~aal~d~~~np~n~~kyq~n~-kv~~~i~kl~~kf~g~  376 (377)
T KOG1308|consen  336 DPEVQAALMDVSQNPANMMKYQNNP-KVMDVISKLSQKFPGM  376 (377)
T ss_pred             ChHHHhhhhhcccChHHHHHhccCh-HHHHHHHHHHhhcCCC
Confidence            4566666666643      444433 3468999999999874


No 25 
>PF13865 FoP_duplication:  C-terminal duplication domain of Friend of PRMT1
Probab=67.60  E-value=14  Score=34.89  Aligned_cols=27  Identities=19%  Similarity=0.430  Sum_probs=15.8

Q ss_pred             CCccccch---hh-hhHHHHHHHhcCChHHH
Q 000708           73 SQKLTTSD---AL-TYLKEVKDMFQDQREKY   99 (1340)
Q Consensus        73 ~~~l~~~d---Al-~YL~~VK~~F~d~p~vY   99 (1340)
                      .+++|.++   -| .|....+..+-.+-+-|
T Consensus        37 ~~~kT~EeLDaELD~Ym~~~~~~LD~~Ld~Y   67 (74)
T PF13865_consen   37 KPPKTAEELDAELDAYMSKTKSKLDAELDSY   67 (74)
T ss_pred             CCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            55556443   23 78877777775554444


No 26 
>cd07354 HN_L-delphilin-R1_like First harmonin_N_like domain (repeat 1) of L-delphilin, and related domains. This subgroup contains the first of two harmonin_N_like domains of an alternatively spliced longer variant of mouse delphilin (L-delphilin, isoform 1), and related domains. Delphilin is a scaffold protein which binds the glutamate receptor delta-2 (GRID2) subunit and the monocarboxylate transporter 2 at the cerebellar parallel fiber-Purkinje cell synapses. The N-terminus of L-delphilin contains this harmonin_N_like domain preceded by a postsynaptic density-95/discs-large/ZO-1 (PDZ) protein-binding domain, PDZ1. L-delphilin, in common with the shorter C-terminal isoforms (S-delphilin/delphilin alpha and delphilin beta) has a second harmonin_N_like domain (not belonging to this subgroup) and a second PDZ domain, PDZ2. This first harmonin_N_like domain is a putative protein-binding module based on its sequence similarity to the harmonin N-domain.
Probab=66.48  E-value=21  Score=34.41  Aligned_cols=68  Identities=18%  Similarity=0.360  Sum_probs=56.5

Q ss_pred             chhhhhHHHHHHHhcCChHHHHHHHHHHHHHHhccCChHHHHHHHHHHhcc--ChhHHhhhhhcCCccccc
Q 000708           79 SDALTYLKEVKDMFQDQREKYDMFLEVMKDFKAQRTDTAGVIARVKDLFKG--HNNLIFGFNTFLPKGYEI  147 (1340)
Q Consensus        79 ~dAl~YL~~VK~~F~d~p~vY~eFLeIMkdfk~~~idt~gVi~rVs~LFkG--hpdLI~GFN~FLP~gy~I  147 (1340)
                      +-|..|..+|..-+.++|.+-+..+++++.|-+.+ +++.+...+.-++..  |--||...-.|+|.-++.
T Consensus         3 ~kA~eF~~Kvd~iL~~dp~~Ke~l~~aLk~Ya~~k-~vd~l~~aL~~~L~~e~~~~Lld~IR~fIP~KHr~   72 (80)
T cd07354           3 RKAQEFSRKVDAILGDDPVKKEQVFAALKQYAADK-NVDCLVWALCGLLQTEAHKKLLDEIRIFIPKKHRA   72 (80)
T ss_pred             hHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHc-CHHHHHHHHHHHhCcHHHHHHHHHhHhcCCchhHH
Confidence            45889999999999999999999999999997754 566777777767664  567999999999986654


No 27 
>KOG1596 consensus Fibrillarin and related nucleolar RNA-binding proteins [RNA processing and modification]
Probab=65.99  E-value=11  Score=43.18  Aligned_cols=14  Identities=21%  Similarity=0.344  Sum_probs=8.0

Q ss_pred             hccccccccCCCcc
Q 000708          484 AKSIQELDLSNCQR  497 (1340)
Q Consensus       484 ~kp~~elDls~c~r  497 (1340)
                      .+|...|-|.-|++
T Consensus       288 lkP~EqvtLEP~er  301 (317)
T KOG1596|consen  288 LKPKEQVTLEPFER  301 (317)
T ss_pred             cCchheeccccccC
Confidence            35555666666654


No 28 
>PTZ00110 helicase; Provisional
Probab=57.70  E-value=23  Score=44.88  Aligned_cols=11  Identities=27%  Similarity=0.422  Sum_probs=4.7

Q ss_pred             ChhHHHHHHHH
Q 000708          728 DSNIHEDLYKL  738 (1340)
Q Consensus       728 D~~I~~D~~~L  738 (1340)
                      |..+..++.++
T Consensus       485 ~~~~~~~l~~~  495 (545)
T PTZ00110        485 KYRLARDLVKV  495 (545)
T ss_pred             hHHHHHHHHHH
Confidence            33444444444


No 29 
>COG4174 ABC-type uncharacterized transport system, permease component [General function prediction only]
Probab=56.59  E-value=22  Score=41.23  Aligned_cols=29  Identities=7%  Similarity=0.310  Sum_probs=23.3

Q ss_pred             hhHHHHHHHhcCChHHHHHHHHHHHHHHh
Q 000708           83 TYLKEVKDMFQDQREKYDMFLEVMKDFKA  111 (1340)
Q Consensus        83 ~YL~~VK~~F~d~p~vY~eFLeIMkdfk~  111 (1340)
                      .++++++.+|.=+...-+.|+..|.+|-.
T Consensus        83 e~i~~i~~~~GFDKp~~eR~~~Ml~~y~r  111 (364)
T COG4174          83 ELIAEIEKQYGFDKPPLERYFLMLWDYAR  111 (364)
T ss_pred             HHHHHHHHHhCCCCCHHHHHHHHHHHHHh
Confidence            57899999999544456899999999964


No 30 
>PTZ00009 heat shock 70 kDa protein; Provisional
Probab=54.41  E-value=11  Score=48.78  Aligned_cols=8  Identities=50%  Similarity=1.136  Sum_probs=3.1

Q ss_pred             CCCCCCCC
Q 000708           57 SGVGGGGG   64 (1340)
Q Consensus        57 ~~~~~~g~   64 (1340)
                      ||++||||
T Consensus       630 ~~~~~~~~  637 (653)
T PTZ00009        630 GGMPGGAG  637 (653)
T ss_pred             CCCCCCCC
Confidence            33444433


No 31 
>PF05642 Sporozoite_P67:  Sporozoite P67 surface antigen;  InterPro: IPR008845 This family consists of several Theileria P67 surface antigens. A stage specific surface antigen of Theileria parva, p67, is the basis for the development of an anti-sporozoite vaccine for the control of East Coast fever (ECF) in Bos taurus. The antigen has been shown to contain five distinct linear peptide sequences recognised by sporozoite-neutralising murine monoclonal antibodies [].
Probab=53.59  E-value=19  Score=45.18  Aligned_cols=11  Identities=36%  Similarity=0.748  Sum_probs=5.6

Q ss_pred             HHHHHHHHHHh
Q 000708          403 NDLQNLVTDLL  413 (1340)
Q Consensus       403 ~ELv~lV~~fL  413 (1340)
                      +||-.||..||
T Consensus       371 ee~~~lv~tli  381 (727)
T PF05642_consen  371 EELKSLVNTLI  381 (727)
T ss_pred             HHHHHHHHHHH
Confidence            34555555555


No 32 
>PF11867 DUF3387:  Domain of unknown function (DUF3387);  InterPro: IPR021810  This domain is functionally uncharacterised. This domain is found in bacteria and archaea. This presumed domain is typically between 255 to 340 amino acids in length. This domain is found associated with PF04851 from PFAM, PF04313 from PFAM. 
Probab=52.61  E-value=45  Score=39.71  Aligned_cols=99  Identities=23%  Similarity=0.297  Sum_probs=60.9

Q ss_pred             CChHHHHHHHHHHhccChh---HHhhhhhcCCccccccCCCCC------CCCCCccchHHHHHHHH-HHHHHcCCChHHH
Q 000708          114 TDTAGVIARVKDLFKGHNN---LIFGFNTFLPKGYEITLDEDE------APPKKTVEFEEAITFVN-KIKKRFHNDEHVY  183 (1340)
Q Consensus       114 idt~gVi~rVs~LFkGhpd---LI~GFN~FLP~gy~I~l~~de------~pp~~~vef~~Ai~Fvn-KIK~RF~~dPevY  183 (1340)
                      +|..++-.+|++|+..|=.   .+.-|..+=|..-.|.+-.++      .-+.+......-..+++ +|+.++..+|..|
T Consensus       117 ~~~~~~~~~i~~Lid~~I~s~~v~~i~~~~~~~~~disild~eFl~~v~~~~~k~~~~e~L~~~l~~~I~~~~~~N~~~~  196 (335)
T PF11867_consen  117 PDIKEVEEKIRQLIDESIASEGVVDIFEAAGLKKPDISILDDEFLEEVKKMKSKNLKAELLEKLLRDEIKVRMKENPVRY  196 (335)
T ss_pred             CCHHHHHHHHHHHHHHHHhcccchhHHhhcCCCCCChhhcCHHHHHHHHhccCchHHHHHHHHHHHHHHHHHHhcCHHHH
Confidence            5778888888888887631   112232221111112211110      01122223333344444 7899999999999


Q ss_pred             HHHH----HHHHHhhhccCcHHHHHHHHHHhhc
Q 000708          184 KSFL----EILNMYRKEHKDINEVYSEVASLFE  212 (1340)
Q Consensus       184 k~FL----eIL~~y~~~~~si~eV~~qVa~LFk  212 (1340)
                      ++|-    ++|..|+++.++..++++++..|-+
T Consensus       197 ~~fsErLe~iI~~Y~~~~i~~~e~~~eLi~la~  229 (335)
T PF11867_consen  197 KKFSERLEEIIEKYNNRSISSEEVIEELIKLAK  229 (335)
T ss_pred             HHHHHHHHHHHHHHHcccchHHHHHHHHHHHHH
Confidence            9888    5889999999999999999988754


No 33 
>PF07466 DUF1517:  Protein of unknown function (DUF1517);  InterPro: IPR010903 This family consists of several hypothetical glycine rich plant and bacterial proteins of around 300 residues in length. The function of this family is unknown.
Probab=51.44  E-value=23  Score=41.60  Aligned_cols=19  Identities=16%  Similarity=0.272  Sum_probs=11.1

Q ss_pred             cHHHHHHHHHHhhccChhH
Q 000708          199 DINEVYSEVASLFEDHADL  217 (1340)
Q Consensus       199 si~eV~~qVa~LFk~hPDL  217 (1340)
                      -+..|.+++.-.|--|||.
T Consensus       138 Gl~~~L~E~~l~LLR~~~~  156 (289)
T PF07466_consen  138 GLARVLQETTLALLRHPEY  156 (289)
T ss_pred             HHHHHHHHHHHHHHhCcCc
Confidence            3456666666655555554


No 34 
>PF05642 Sporozoite_P67:  Sporozoite P67 surface antigen;  InterPro: IPR008845 This family consists of several Theileria P67 surface antigens. A stage specific surface antigen of Theileria parva, p67, is the basis for the development of an anti-sporozoite vaccine for the control of East Coast fever (ECF) in Bos taurus. The antigen has been shown to contain five distinct linear peptide sequences recognised by sporozoite-neutralising murine monoclonal antibodies [].
Probab=47.36  E-value=28  Score=43.83  Aligned_cols=7  Identities=43%  Similarity=0.610  Sum_probs=3.8

Q ss_pred             HHHHhhc
Q 000708          374 DKVKEKL  380 (1340)
Q Consensus       374 DKVKk~L  380 (1340)
                      .|+|++|
T Consensus       363 ~~lk~fl  369 (727)
T PF05642_consen  363 EKLKKFL  369 (727)
T ss_pred             HHHHHHH
Confidence            3555555


No 35 
>KOG3074 consensus Transcriptional regulator of the PUR family, single-stranded-DNA-binding [Transcription]
Probab=46.23  E-value=13  Score=42.07  Aligned_cols=10  Identities=40%  Similarity=0.707  Sum_probs=5.4

Q ss_pred             HHHHHHHHHh
Q 000708          184 KSFLEILNMY  193 (1340)
Q Consensus       184 k~FLeIL~~y  193 (1340)
                      -.|+++|+.|
T Consensus        71 aef~d~l~~f   80 (263)
T KOG3074|consen   71 AEFRDILNDF   80 (263)
T ss_pred             HHHHHHHHHH
Confidence            3455555554


No 36 
>PF07655 Secretin_N_2:  Secretin N-terminal domain;  InterPro: IPR011514 This is a short domain found in bacterial type II/III secretory system proteins. The architecture of these proteins suggests that this family may be functionally analogous to IPR005644 from INTERPRO.; GO: 0009297 pilus assembly, 0019867 outer membrane
Probab=45.41  E-value=28  Score=34.47  Aligned_cols=29  Identities=14%  Similarity=0.331  Sum_probs=21.5

Q ss_pred             CCCCCCCCCCccccchhhhhHHHHHHHhc
Q 000708           65 GMTAGMGTSQKLTTSDALTYLKEVKDMFQ   93 (1340)
Q Consensus        65 g~~~g~~~~~~l~~~dAl~YL~~VK~~F~   93 (1340)
                      +.++......++.+.....|-+.|+..++
T Consensus        52 ~~s~~~~s~~~i~t~s~~dfW~~L~~~l~   80 (98)
T PF07655_consen   52 GGSGSSSSGTSITTESKSDFWEDLQKTLQ   80 (98)
T ss_pred             CccccccCceEEEEEECCchHHHHHHHHH
Confidence            33345566788899999999988887764


No 37 
>PF05387 Chorion_3:  Chorion family 3;  InterPro: IPR008449 This family consists Drosophila chorion proteins S36 and S38, which are involved in chorion membrane formation. The proteins consist of a central domain and two flanking 'arms'. The central domain contains tandemly repetitive peptides, which apparently generate a secondary structure of beta-sheet strands alternating with beta-turns, most probably, forming a twisted beta-pleated sheet or beta-barrel []. 
Probab=44.92  E-value=23  Score=40.22  Aligned_cols=17  Identities=6%  Similarity=0.186  Sum_probs=14.2

Q ss_pred             cchhhhhHHHHHHHhcC
Q 000708           78 TSDALTYLKEVKDMFQD   94 (1340)
Q Consensus        78 ~~dAl~YL~~VK~~F~d   94 (1340)
                      --|+...|..|..+++.
T Consensus        70 ~peea~r~~~~Qaqygs   86 (277)
T PF05387_consen   70 SPEEAGRAQRVQAQYGS   86 (277)
T ss_pred             ChhHhhhhccchhhhcc
Confidence            36888999999999984


No 38 
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=44.72  E-value=42  Score=43.80  Aligned_cols=19  Identities=32%  Similarity=0.581  Sum_probs=13.5

Q ss_pred             ccccccchhHHHHHHHHHH
Q 000708          679 QDSKNLSTKSLVAEIKQFK  697 (1340)
Q Consensus       679 ~DkK~ls~K~Lv~EIe~i~  697 (1340)
                      .|-|-.+..-+++.|.+.+
T Consensus       968 Fd~kkysmEEFFaDi~tFr  986 (1102)
T KOG1924|consen  968 FDPKKYSMEEFFADIRTFR  986 (1102)
T ss_pred             cCcccCcHHHHHHHHHHHH
Confidence            3566677888888887764


No 39 
>PRK05325 hypothetical protein; Provisional
Probab=42.58  E-value=24  Score=43.19  Aligned_cols=19  Identities=16%  Similarity=0.347  Sum_probs=11.7

Q ss_pred             ccchHHHHHHHHHHHHHcC
Q 000708          159 TVEFEEAITFVNKIKKRFH  177 (1340)
Q Consensus       159 ~vef~~Ai~FvnKIK~RF~  177 (1340)
                      +...+--..+.+.+|.|.+
T Consensus       144 ~~~l~~~RT~r~al~Rria  162 (401)
T PRK05325        144 PSNLDKKRTLREALKRRIA  162 (401)
T ss_pred             cccchHHHHHHHHHHHHhh
Confidence            3345555667777777664


No 40 
>PRK04537 ATP-dependent RNA helicase RhlB; Provisional
Probab=42.06  E-value=43  Score=42.75  Aligned_cols=10  Identities=10%  Similarity=0.355  Sum_probs=5.0

Q ss_pred             hHHHHHHhhc
Q 000708          371 IFCDKVKEKL  380 (1340)
Q Consensus       371 aFFDKVKk~L  380 (1340)
                      +|.-+++.++
T Consensus       556 ~~~~~~~~~~  565 (572)
T PRK04537        556 SLLGRIGRRL  565 (572)
T ss_pred             HHHHHHHHHH
Confidence            4555555544


No 41 
>PRK10930 FtsH protease regulator HflK; Provisional
Probab=42.04  E-value=25  Score=43.39  Aligned_cols=16  Identities=25%  Similarity=0.343  Sum_probs=11.1

Q ss_pred             hHHHHHHHHHHhhhhc
Q 000708          382 SDDYQAFLKCLHIYSN  397 (1340)
Q Consensus       382 ~~~Y~eFLKcLnLYsQ  397 (1340)
                      +-.=..|.++|.-|.+
T Consensus       302 eGda~rF~~i~~~Y~k  317 (419)
T PRK10930        302 QGEVARFAKLLPEYKA  317 (419)
T ss_pred             hhhHHHHHHHHHHHhh
Confidence            4455678888887765


No 42 
>KOG1924 consensus RhoA GTPase effector DIA/Diaphanous [Signal transduction mechanisms; Cytoskeleton]
Probab=41.05  E-value=59  Score=42.56  Aligned_cols=16  Identities=13%  Similarity=0.713  Sum_probs=7.1

Q ss_pred             HHHHHHhhcChHHHHHHHH
Q 000708          372 FCDKVKEKLCSDDYQAFLK  390 (1340)
Q Consensus       372 FFDKVKk~L~~~~Y~eFLK  390 (1340)
                      -|+-||+.   +.|..||+
T Consensus       807 ACEE~rkS---esFs~lLe  822 (1102)
T KOG1924|consen  807 ACEELRKS---ESFSKLLE  822 (1102)
T ss_pred             HHHHHHhh---hhHHHHHH
Confidence            34555543   34444443


No 43 
>PF04285 DUF444:  Protein of unknown function (DUF444);  InterPro: IPR006698 This entry is represented by Thermus phage phiYS40, Orf56. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches [].
Probab=38.08  E-value=54  Score=40.57  Aligned_cols=6  Identities=17%  Similarity=0.539  Sum_probs=2.4

Q ss_pred             cccCCc
Q 000708           13 QFKRPL   18 (1340)
Q Consensus        13 ~~~~~~   18 (1340)
                      .++-|.
T Consensus        51 ~V~IP~   56 (421)
T PF04285_consen   51 KVSIPI   56 (421)
T ss_pred             eEeecC
Confidence            334443


No 44 
>PF12174 RST:  RCD1-SRO-TAF4 (RST) plant domain;  InterPro: IPR022003  This domain is found in many plant proteins including SROs and RCD1s; it is required for interaction with multiple plant transcription factors. 
Probab=37.64  E-value=78  Score=29.91  Aligned_cols=45  Identities=11%  Similarity=0.279  Sum_probs=38.4

Q ss_pred             hHHHHHHhhcChHHHHHHHHHHhhhhcCCCCHHHHHHHHHHHhcc
Q 000708          371 IFCDKVKEKLCSDDYQAFLKCLHIYSNGIIKRNDLQNLVTDLLGK  415 (1340)
Q Consensus       371 aFFDKVKk~L~~~~Y~eFLKcLnLYsQeIIsr~ELv~lV~~fLG~  415 (1340)
                      ..|..+++.|.++......+..+-|-+..|+|+||+..+..+.|+
T Consensus        13 ~L~~~l~~~l~~~~~~~l~~~Y~~~k~~kIsR~~fvr~lR~IVGD   57 (70)
T PF12174_consen   13 MLFSALSKHLPPSKMDLLQKHYEEFKKKKISREEFVRKLRQIVGD   57 (70)
T ss_pred             HHHHHHHHHCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHH
Confidence            367888888886667777777788899999999999999999994


No 45 
>KOG3570 consensus MAPK-activating protein DENN [Signal transduction mechanisms]
Probab=35.19  E-value=34  Score=44.82  Aligned_cols=37  Identities=16%  Similarity=0.031  Sum_probs=18.6

Q ss_pred             HHHHHHHHHHHHHhhHHHHHHHHH-hhhhhcccCcccc
Q 000708          640 KQKQEEWTKCRSDFNKVWAEIYAK-NHYKSLDHRSFYF  676 (1340)
Q Consensus       640 KQK~eEWr~aq~e~nKvWrev~eK-Ny~KSLDHq~~~F  676 (1340)
                      .+|.|+-++-|+.-|.-=.++.-| -.+..||-||+-+
T Consensus       903 ~~tqe~pt~Qqrs~~~seNQqflKev~~qvL~GqgVGw  940 (1588)
T KOG3570|consen  903 TVKREPPSPQGRSSNSSENQQFLKEVVHSVLDGQGVGW  940 (1588)
T ss_pred             cccCCCCCccccccccchhhHHHHHHHHHHhcccccce
Confidence            345566665555433222222222 2345688888764


No 46 
>KOG3875 consensus Peroxisomal biogenesis protein peroxin [Intracellular trafficking, secretion, and vesicular transport]
Probab=34.17  E-value=66  Score=38.15  Aligned_cols=12  Identities=8%  Similarity=0.254  Sum_probs=5.7

Q ss_pred             HHHHHHHhhccC
Q 000708          203 VYSEVASLFEDH  214 (1340)
Q Consensus       203 V~~qVa~LFk~h  214 (1340)
                      +|..+...|+-.
T Consensus       180 lyR~~l~~L~l~  191 (362)
T KOG3875|consen  180 LYRLLLKMLKLS  191 (362)
T ss_pred             HHHHHHHHhCCC
Confidence            445555554433


No 47 
>PF04094 DUF390:  Protein of unknown function (DUF390);  InterPro: IPR007228 This domain is found in a family of long proteins that are currently found only in rice. They have no known function. However they may be some kind of transposable element. There is a putative gypsy type transposon domain (IPR007321 from INTERPRO) towards the N terminus of the proteins.
Probab=33.59  E-value=64  Score=41.92  Aligned_cols=8  Identities=25%  Similarity=0.114  Sum_probs=3.4

Q ss_pred             ChhHHhhh
Q 000708          130 HNNLIFGF  137 (1340)
Q Consensus       130 hpdLI~GF  137 (1340)
                      ||-|..-|
T Consensus       197 ~p~~~~~~  204 (828)
T PF04094_consen  197 SPFLFVFF  204 (828)
T ss_pred             CchHhhhc
Confidence            44444433


No 48 
>TIGR02877 spore_yhbH sporulation protein YhbH. This protein family, typified by YhbH in Bacillus subtilis, is found in nearly every endospore-forming bacterium and in no other genome (but note that the trusted cutoff score is set high to exclude a single high-scoring sequence from Nitrosococcus oceani ATCC 19707, which is classified in the Gammaproteobacteria). The gene in Bacillus subtilis was shown to be in the regulon of the sporulation sigma factor, sigma-E, and its mutation was shown to create a sporulation defect.
Probab=33.01  E-value=43  Score=40.63  Aligned_cols=31  Identities=19%  Similarity=0.325  Sum_probs=16.7

Q ss_pred             HHHHHHhhhhcCCCCHHHHHHHHHHHhccChHHHHHHhhhh
Q 000708          387 AFLKCLHIYSNGIIKRNDLQNLVTDLLGKYMDLMDEFNHFF  427 (1340)
Q Consensus       387 eFLKcLnLYsQeIIsr~ELv~lV~~fLG~~pDLm~~Fk~FL  427 (1340)
                      +.|.++|+|.=.-|...          ..+..||..|+.-|
T Consensus       317 ~llp~~~~f~Y~Ei~~~----------~~~~~l~~~y~~~i  347 (371)
T TIGR02877       317 KLLEVCNLFGYGEIMPY----------GYSNTLKNKFKNEI  347 (371)
T ss_pred             HHHHhhheEEEEEecCC----------CCcchHHHHHHhhh
Confidence            46677777755444320          23456666666534


No 49 
>PF09060 L27_N:  L27_N;  InterPro: IPR015145 The L27_N domain plays a role in the biogenesis of tight junctions and in the establishment of cell polarity in epithelial cells. Each L27_N domain consists of three alpha-helices, the first two of which form an antiparallel coiled-coil. Two L27 domains come together to form a four-helical bundle with the antiparallel coiled-coils formed by the first two helices. The third helix of each domain forms another coiled-coil packing at one end of the four-helix bundle, creating a large hydrophobic interface: the hydrophobic interactions are the major force that drives heterodimer formation []. ; PDB: 1Y76_B 3UIT_B 1VF6_C.
Probab=30.55  E-value=57  Score=28.82  Aligned_cols=26  Identities=31%  Similarity=0.479  Sum_probs=18.8

Q ss_pred             HHHHHHHHHHHHHHHhchhhhHHHHHH
Q 000708         1224 LDKLLYRLCKQLQTVAADEMDNKLIQL 1250 (1340)
Q Consensus      1224 iDKLi~~lvKqlq~iv~De~~~~ll~L 1250 (1340)
                      ||.|++++ |++||+++|.-+++=++|
T Consensus         2 leell~sL-k~iqh~L~D~qSQ~Dv~l   27 (49)
T PF09060_consen    2 LEELLSSL-KHIQHCLNDSQSQQDVEL   27 (49)
T ss_dssp             HHHHHHHH-HHHHCCH-SHHHHHHHHH
T ss_pred             HHHHHHHH-HHHHHhhccHhhHHhHHH
Confidence            56777664 999999999888774444


No 50 
>KOG3304 consensus Surfeit family protein 5 [General function prediction only]
Probab=29.10  E-value=87  Score=32.94  Aligned_cols=32  Identities=25%  Similarity=0.367  Sum_probs=21.8

Q ss_pred             hhhhhHHHHHHHhcCC-hHHHHHHHHHHHHHHh
Q 000708           80 DALTYLKEVKDMFQDQ-REKYDMFLEVMKDFKA  111 (1340)
Q Consensus        80 dAl~YL~~VK~~F~d~-p~vY~eFLeIMkdfk~  111 (1340)
                      -+-..|+..+.|+.|+ ..+-+.|-+|++--|-
T Consensus        16 sk~al~k~~~~Rl~ddIkS~~dNF~~Ii~~Akv   48 (148)
T KOG3304|consen   16 SKEALLKSYNKRLKDDIKSIMDNFTEIIKTAKV   48 (148)
T ss_pred             HHHHHHHHHHHHHHHhHHHHHHhHHHHHHHHhc
Confidence            3345566666666655 5688899999887664


No 51 
>KOG4462 consensus WASP-interacting protein VRP1/WIP, contains WH2 domain [Cytoskeleton]
Probab=28.80  E-value=65  Score=38.70  Aligned_cols=13  Identities=31%  Similarity=0.580  Sum_probs=8.3

Q ss_pred             cccCCCcccCCcc
Q 000708            7 DVYGGSQFKRPLT   19 (1340)
Q Consensus         7 ~~~~~~~~~~~~~   19 (1340)
                      |+.-|..||....
T Consensus        37 DI~KG~KLKK~~t   49 (437)
T KOG4462|consen   37 DIQKGKKLKKATT   49 (437)
T ss_pred             Hhhhcceecceec
Confidence            4555777777664


No 52 
>KOG0037 consensus Ca2+-binding protein, EF-Hand protein superfamily [Signal transduction mechanisms]
Probab=28.38  E-value=62  Score=36.71  Aligned_cols=27  Identities=15%  Similarity=0.099  Sum_probs=14.5

Q ss_pred             ChHHHHHHHHHHHHHHhccCChHHHHHH
Q 000708           95 QREKYDMFLEVMKDFKAQRTDTAGVIAR  122 (1340)
Q Consensus        95 ~p~vY~eFLeIMkdfk~~~idt~gVi~r  122 (1340)
                      .|.++..|-.+=+| .+++|...++..-
T Consensus        56 ~~~~~~~f~~vD~d-~sg~i~~~eLq~a   82 (221)
T KOG0037|consen   56 FPQLAGWFQSVDRD-RSGRILAKELQQA   82 (221)
T ss_pred             cHHHHHHHHhhCcc-ccccccHHHHHHH
Confidence            45666666555333 2456666666544


No 53 
>KOG3262 consensus H/ACA small nucleolar RNP component GAR1 [Translation, ribosomal structure and biogenesis]
Probab=27.68  E-value=81  Score=34.93  Aligned_cols=17  Identities=35%  Similarity=0.585  Sum_probs=11.8

Q ss_pred             ChhHHHHHhhhCCCCCC
Q 000708          214 HADLLEEFTRFLPDTSA  230 (1340)
Q Consensus       214 hPDLL~EF~~FLP~~~~  230 (1340)
                      .||-|.=..+|||....
T Consensus       128 ~p~KllPl~RFLP~p~~  144 (215)
T KOG3262|consen  128 DPDKLLPLDRFLPQPVG  144 (215)
T ss_pred             cccccCcHhhcCCCCCC
Confidence            36666677889988733


No 54 
>COG5324 Uncharacterized conserved protein [Function unknown]
Probab=27.58  E-value=74  Score=39.79  Aligned_cols=126  Identities=19%  Similarity=0.259  Sum_probs=77.0

Q ss_pred             cchhhHHHHHHHHHHHHHHHHHHHHHHhhcccccCCCCcccccchhH------HHHHHHHhhCCCH--------HHHHHH
Q 000708          558 DDRFELDMLLESVSSTAKRAEELLNSINENKITLETPFHLKDHFSAL------NLRCIERLYGDHG--------LDVMDI  623 (1340)
Q Consensus       558 DERfElDm~IE~~~~tI~~Le~l~~ki~~~~~e~~~~~~l~~~L~~i------~~R~I~rIYGd~g--------~evie~  623 (1340)
                      |+..|||.+|.+...|-+.|..+-+.|..+-++..+    ++...+-      |.-..++++|...        .|+|++
T Consensus       523 d~~ield~~~~sl~nar~i~n~~~k~~p~lip~~p~----ddd~~~a~~~a~ey~pt~rk~~gn~~kkik~~itae~i~~  598 (758)
T COG5324         523 DDIIELDPLIGSLENARRIVNYFKKNIPELIPNDPS----DDDYAAALNYAVEYVPTYRKTFGNDSKKIKNKITAEGITG  598 (758)
T ss_pred             cceeecccccchhhhHHHHHHHHHhhCccccCCCCC----chHHHHHHhhhhhcCchhhhhhCCchHHhhhheehhhccc
Confidence            678899999999988888887777776655443332    2222222      3334567777432        244444


Q ss_pred             HH---hCcCcchhHHHHHHHHHHHHHH--HHHHHhhHHHHHHHHH-------------hhhhhcccCccccccccccccc
Q 000708          624 LR---KNPAIALPVMLTRLKQKQEEWT--KCRSDFNKVWAEIYAK-------------NHYKSLDHRSFYFKQQDSKNLS  685 (1340)
Q Consensus       624 L~---knP~~AlPVVL~RLKQK~eEWr--~aq~e~nKvWrev~eK-------------Ny~KSLDHq~~~FKq~DkK~ls  685 (1340)
                      ..   +....-.-++|.   ||.-|-.  ..-...|++|.+...+             |-.|+...|...-|..||-+++
T Consensus       599 ~~~c~~~~~~~f~~ll~---~k~~~~sl~~vlt~a~~~wq~~~~ryt~q~s~~~~~i~~sq~~~n~q~~~~k~t~km~fr  675 (758)
T COG5324         599 SSTCFKKAPRYFGVLLD---RKTVESSLVQVLTIANLQWQEAFSRYTLQDSFHITMIHESQKPVNSQHMHDKNTTKMGFR  675 (758)
T ss_pred             hHHHHHHhHHHHHHHhh---hhhHHHHHHHHHHHHHHHHHHHHhhhhhhhhhhhhhhhhccCCCccccccccccccccee
Confidence            33   333333455665   4554444  4446889999988766             3456667777777777777766


Q ss_pred             hhHHH
Q 000708          686 TKSLV  690 (1340)
Q Consensus       686 ~K~Lv  690 (1340)
                      -.+|+
T Consensus       676 i~~l~  680 (758)
T COG5324         676 ITHLV  680 (758)
T ss_pred             EEEEe
Confidence            65554


No 55 
>PF12083 DUF3560:  Domain of unknown function (DUF3560);  InterPro: IPR021944  This presumed domain is functionally uncharacterised. This domain is found in bacteria. This domain is about 120 amino acids in length. This domain has a conserved GHHSE sequence motif. 
Probab=27.01  E-value=3.5e+02  Score=28.44  Aligned_cols=65  Identities=20%  Similarity=0.263  Sum_probs=49.3

Q ss_pred             cCCCCcccccchhHHHHHHHHhhC--CCHHHHHHHHH-----------hCcCcchhHHHHHHHHHHHHHHHHHHHhhH
Q 000708          591 LETPFHLKDHFSALNLRCIERLYG--DHGLDVMDILR-----------KNPAIALPVMLTRLKQKQEEWTKCRSDFNK  655 (1340)
Q Consensus       591 ~~~~~~l~~~L~~i~~R~I~rIYG--d~g~evie~L~-----------knP~~AlPVVL~RLKQK~eEWr~aq~e~nK  655 (1340)
                      +..||.++-|+..=|+|.|+|+..  +++.+..+.-.           .--..-.|.+|.||+.+..+.+..|+.+..
T Consensus        41 ~GQPIlVGHHSE~R~Rr~~eR~~~~m~kav~~~~kA~y~~~rA~aa~~~~i~~~~Pda~~ri~~~la~~r~~q~~mk~  118 (126)
T PF12083_consen   41 FGQPILVGHHSEKRHRRYRERIHNRMGKAVEAMDKAEYYEQRAAAAGRTAISSDDPDALRRIKKKLAELRASQRRMKA  118 (126)
T ss_pred             CCCCeeccccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcchhccChHHHHHHHHHHHHHHHHHHHHHH
Confidence            478888888999999999999988  45555443322           112334688899999999999999998754


No 56 
>PF04094 DUF390:  Protein of unknown function (DUF390);  InterPro: IPR007228 This domain is found in a family of long proteins that are currently found only in rice. They have no known function. However they may be some kind of transposable element. There is a putative gypsy type transposon domain (IPR007321 from INTERPRO) towards the N terminus of the proteins.
Probab=26.77  E-value=69  Score=41.64  Aligned_cols=6  Identities=17%  Similarity=0.606  Sum_probs=2.6

Q ss_pred             Cccccc
Q 000708          142 PKGYEI  147 (1340)
Q Consensus       142 P~gy~I  147 (1340)
                      ||+..+
T Consensus       165 Ppkwtf  170 (828)
T PF04094_consen  165 PPKWTF  170 (828)
T ss_pred             CCCCcc
Confidence            444444


No 57 
>PF08963 DUF1878:  Protein of unknown function (DUF1878);  InterPro: IPR015058 This family consist of hypothetical bacterial proteins. ; PDB: 1SED_B.
Probab=26.74  E-value=76  Score=32.63  Aligned_cols=64  Identities=20%  Similarity=0.424  Sum_probs=46.3

Q ss_pred             hhhhhhhHHHHHHhhcC---hHHHHHHHHHHhhhhcCCCCHHHHHHHHHHHh--ccChHHHHHHhhhhc
Q 000708          365 IYNQGFIFCDKVKEKLC---SDDYQAFLKCLHIYSNGIIKRNDLQNLVTDLL--GKYMDLMDEFNHFFE  428 (1340)
Q Consensus       365 ~~~eElaFFDKVKk~L~---~~~Y~eFLKcLnLYsQeIIsr~ELv~lV~~fL--G~~pDLm~~Fk~FLg  428 (1340)
                      ...+-+..|+++.+.+.   .+-|-.|-.+|..|...+..+=++-.-+..++  |-+++||.+|+.++.
T Consensus        42 evee~~~lce~l~ee~~~QKAeG~v~F~pLl~~F~~~L~~~L~~~eti~Al~~Qgl~~~lM~ef~~~i~  110 (113)
T PF08963_consen   42 EVEEFLRLCEELSEELEEQKAEGFVIFDPLLAQFAGMLHEKLDVHETIEALLRQGLFQPLMTEFKKIIK  110 (113)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHTT----HHHHHHHHHHS-TTS-HHHHHHHHHHTT-SHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHHHHHHhcchhhHHHHHHHHHhhcCccCCHHHHHHHHHHccCcHHHHHHHHHHHh
Confidence            44456789999999887   66788999999999887777766666677777  569999999999874


No 58 
>KOG1456 consensus Heterogeneous nuclear ribonucleoprotein L (contains RRM repeats) [RNA processing and modification]
Probab=26.53  E-value=58  Score=39.37  Aligned_cols=20  Identities=20%  Similarity=0.409  Sum_probs=10.6

Q ss_pred             hhHHhhhhhcCCccccccCC
Q 000708          131 NNLIFGFNTFLPKGYEITLD  150 (1340)
Q Consensus       131 pdLI~GFN~FLP~gy~I~l~  150 (1340)
                      -||+.+...|=|-.|-..+|
T Consensus        46 adl~eal~~fG~i~yvt~~P   65 (494)
T KOG1456|consen   46 ADLVEALSNFGPIAYVTCMP   65 (494)
T ss_pred             hHHHHHHhcCCceEEEEecc
Confidence            35555555555555555444


No 59 
>cd07347 harmonin_N_like N-terminal protein-binding module of harmonin and similar domains. This domain is found in harmonin, and similar proteins such as delphilin, and whirlin. These are postsynaptic density-95/discs-large/ZO-1 (PDZ) domain-containing scaffold proteins. Harmonin and whirlin are organizers of the Usher protein network of the inner ear and the retina, delphilin is found at the cerebellar parallel fiber-Purkinje cell synapses. This harmonin_N_like domain is found in either one or two copies. Harmonin contains a single copy, which is found at its N-terminus and binds specifically to a short internal peptide fragment of the cadherin 23 cytoplasmic domain; cadherin 23 is a component of the Usher protein network. Whirlin contains two copies of the harmonin_N_like domain; the first of these has been assayed for interaction with the cytoplasmic domain of cadherin 23 and no interaction could be detected.
Probab=26.21  E-value=1.9e+02  Score=28.06  Aligned_cols=60  Identities=15%  Similarity=0.172  Sum_probs=48.5

Q ss_pred             hhHHHHHHhhcChHHHHHHHHHHhhhhcCCCCHHHHHHHHHHHhccC--hHHHHHHhhhhccc
Q 000708          370 FIFCDKVKEKLCSDDYQAFLKCLHIYSNGIIKRNDLQNLVTDLLGKY--MDLMDEFNHFFERC  430 (1340)
Q Consensus       370 laFFDKVKk~L~~~~Y~eFLKcLnLYsQeIIsr~ELv~lV~~fLG~~--pDLm~~Fk~FLg~~  430 (1340)
                      -.|.+||...|....-+.|..+|+.|.++- +-.+||.-+..+|...  -.|+.+-..+|...
T Consensus         6 r~f~~q~~~LL~~~Er~~~~~~L~~Y~~~~-~Vd~LV~~L~~vLdtPaK~~Ll~~iR~lIp~~   67 (78)
T cd07347           6 REFSQQVDHLLTDAEREQVTRALERYHQER-NVDDLVRDLYLVLDTPAKLPLLQFLRQVIPPK   67 (78)
T ss_pred             HHHHHHHHHHCCHHHHHHHHHHHHHHHhcC-CHHHHHHHHHHHcCcHhHHHHHHHHHHHcCHH
Confidence            358999999999334778999999999999 9999999999999642  25777777777543


No 60 
>PF04228 Zn_peptidase:  Putative neutral zinc metallopeptidase;  InterPro: IPR007343 Members of this family of bacterial proteins are described as hypothetical proteins or zinc metallopeptidases. The majority have a HExxH zinc-binding motif characteristic of neutral zinc metallopeptidases, however there is no evidence to support their function as metallopeptidases.
Probab=25.49  E-value=79  Score=37.37  Aligned_cols=14  Identities=21%  Similarity=0.740  Sum_probs=10.3

Q ss_pred             HHHHHHHHHHcCCC
Q 000708          166 ITFVNKIKKRFHND  179 (1340)
Q Consensus       166 i~FvnKIK~RF~~d  179 (1340)
                      +.|++.++.+|...
T Consensus       152 ~~f~~~L~~~~ga~  165 (292)
T PF04228_consen  152 LSFFDELQQRFGAS  165 (292)
T ss_pred             hHHHHHHHHHhCCc
Confidence            35888888888654


No 61 
>PF08349 DUF1722:  Protein of unknown function (DUF1722);  InterPro: IPR013560 This domain of unknown function is found in bacteria and archaea and is homologous to the hypothetical protein ybgA from Escherichia coli. 
Probab=25.45  E-value=1.4e+02  Score=30.42  Aligned_cols=49  Identities=29%  Similarity=0.353  Sum_probs=42.5

Q ss_pred             hhHHHHHHhhcChHHHHHHHHHHhhhhcCCCCHHHHHHHHHHHhccChH
Q 000708          370 FIFCDKVKEKLCSDDYQAFLKCLHIYSNGIIKRNDLQNLVTDLLGKYMD  418 (1340)
Q Consensus       370 laFFDKVKk~L~~~~Y~eFLKcLnLYsQeIIsr~ELv~lV~~fLG~~pD  418 (1340)
                      ...|.-+|+.+..+.=++|+.+|..|.++.|+...++.+...++.++++
T Consensus        56 ~Hi~Gyfk~~ls~~EK~~~~~~i~~yr~g~i~l~~~l~~L~~~~~ry~~  104 (117)
T PF08349_consen   56 QHIFGYFKKKLSSEEKQHFLDLIEDYREGKIPLSVPLTLLKHLARRYPD  104 (117)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHHHcCCccHHHHHHHHHHHHHHCCC
Confidence            4566677888886667899999999999999999999999999988873


No 62 
>KOG0775 consensus Transcription factor SIX and related HOX domain proteins [Transcription]
Probab=25.24  E-value=2.7e+02  Score=32.84  Aligned_cols=20  Identities=25%  Similarity=0.265  Sum_probs=8.6

Q ss_pred             HHHHHHHhcC--ChHHHHHHHH
Q 000708           85 LKEVKDMFQD--QREKYDMFLE  104 (1340)
Q Consensus        85 L~~VK~~F~d--~p~vY~eFLe  104 (1340)
                      +..|=+.++.  +-+..-.||-
T Consensus        66 vacvCe~L~q~Gd~erL~rFlw   87 (304)
T KOG0775|consen   66 VACVCESLQQGGDIERLGRFLW   87 (304)
T ss_pred             HHHHHHHHHhccCHHHHHHHHH
Confidence            4444444443  3444444443


No 63 
>PRK05325 hypothetical protein; Provisional
Probab=24.50  E-value=79  Score=38.94  Aligned_cols=8  Identities=13%  Similarity=0.397  Sum_probs=3.5

Q ss_pred             HHHHHhhh
Q 000708          388 FLKCLHIY  395 (1340)
Q Consensus       388 FLKcLnLY  395 (1340)
                      .|..+|+|
T Consensus       339 llp~~~~f  346 (401)
T PRK05325        339 LLPVCNYF  346 (401)
T ss_pred             HHHHhhhe
Confidence            34444444


No 64 
>KOG1456 consensus Heterogeneous nuclear ribonucleoprotein L (contains RRM repeats) [RNA processing and modification]
Probab=24.20  E-value=63  Score=39.09  Aligned_cols=27  Identities=15%  Similarity=0.223  Sum_probs=11.7

Q ss_pred             CCHHHHHHHHHHHhccChHHHHHHhhhhc
Q 000708          400 IKRNDLQNLVTDLLGKYMDLMDEFNHFFE  428 (1340)
Q Consensus       400 Isr~ELv~lV~~fLG~~pDLm~~Fk~FLg  428 (1340)
                      +++++-+.-+.+||=  ||=--.||+|-+
T Consensus       359 ~SkQ~~v~~~~pflL--pDgSpSfKdys~  385 (494)
T KOG1456|consen  359 VSKQNFVSPVQPFLL--PDGSPSFKDYSG  385 (494)
T ss_pred             eccccccccCCceec--CCCCcchhhccc
Confidence            344454444544441  222234555544


No 65 
>KOG0339 consensus ATP-dependent RNA helicase [RNA processing and modification]
Probab=23.78  E-value=97  Score=39.15  Aligned_cols=10  Identities=20%  Similarity=0.481  Sum_probs=5.6

Q ss_pred             hHHHHHHHhc
Q 000708           84 YLKEVKDMFQ   93 (1340)
Q Consensus        84 YL~~VK~~F~   93 (1340)
                      -+..+|..|+
T Consensus       659 r~~~~r~af~  668 (731)
T KOG0339|consen  659 RITAMRTAFQ  668 (731)
T ss_pred             hHHHHHHHHH
Confidence            3455666654


No 66 
>PF03564 DUF1759:  Protein of unknown function (DUF1759);  InterPro: IPR005312 This is a small family of proteins of unknown function. 
Probab=21.45  E-value=5.1e+02  Score=26.65  Aligned_cols=82  Identities=18%  Similarity=0.373  Sum_probs=56.9

Q ss_pred             hcCChHHHHHHHHHHHHHHhc--cCChHHHHHHHHHHhccCh-hHHhhhhhcCCccccccCCCCCCCCCCccchHHHHHH
Q 000708           92 FQDQREKYDMFLEVMKDFKAQ--RTDTAGVIARVKDLFKGHN-NLIFGFNTFLPKGYEITLDEDEAPPKKTVEFEEAITF  168 (1340)
Q Consensus        92 F~d~p~vY~eFLeIMkdfk~~--~idt~gVi~rVs~LFkGhp-dLI~GFN~FLP~gy~I~l~~de~pp~~~vef~~Ai~F  168 (1340)
                      |..+|..|..|.++.+..-..  .++...=...+...++|-+ ++|.++..  ++                ..|..|+. 
T Consensus         1 F~G~~~~~~~F~~~F~~~v~~n~~~~d~~K~~~L~~~L~G~A~~~i~~~~~--~~----------------~~Y~~a~~-   61 (145)
T PF03564_consen    1 FDGDPSEWPEFIDQFDSLVHENPDLSDIEKLNYLRSCLKGEAKELIRGLPL--SE----------------ENYEEAWE-   61 (145)
T ss_pred             CCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHhcchHHHHHHcccc--cc----------------hhhHHHHH-
Confidence            667899999999999998765  4555555677888888865 67776632  21                24455543 


Q ss_pred             HHHHHHHcCCChHHHHHHHHHHHHhh
Q 000708          169 VNKIKKRFHNDEHVYKSFLEILNMYR  194 (1340)
Q Consensus       169 vnKIK~RF~~dPevYk~FLeIL~~y~  194 (1340)
                        .++.||.+...+...+++-|....
T Consensus        62 --~L~~~yg~~~~i~~~~~~~l~~l~   85 (145)
T PF03564_consen   62 --LLEERYGNPRRIIQALLEELRNLP   85 (145)
T ss_pred             --HHHHHhCCchHHHHHHHHHHhccc
Confidence              567888766667777777766654


No 67 
>TIGR02877 spore_yhbH sporulation protein YhbH. This protein family, typified by YhbH in Bacillus subtilis, is found in nearly every endospore-forming bacterium and in no other genome (but note that the trusted cutoff score is set high to exclude a single high-scoring sequence from Nitrosococcus oceani ATCC 19707, which is classified in the Gammaproteobacteria). The gene in Bacillus subtilis was shown to be in the regulon of the sporulation sigma factor, sigma-E, and its mutation was shown to create a sporulation defect.
Probab=21.44  E-value=94  Score=37.90  Aligned_cols=16  Identities=13%  Similarity=0.347  Sum_probs=8.1

Q ss_pred             chHHHHHHHHHHHHHc
Q 000708          161 EFEEAITFVNKIKKRF  176 (1340)
Q Consensus       161 ef~~Ai~FvnKIK~RF  176 (1340)
                      ..+--..+.+.+|.|.
T Consensus       158 ~l~~~RT~r~al~Rri  173 (371)
T TIGR02877       158 NLDKKRTVIEALKRNQ  173 (371)
T ss_pred             cchHHHHHHHHHHHHh
Confidence            3344455555555554


No 68 
>PRK09807 hypothetical protein; Provisional
Probab=20.93  E-value=89  Score=31.88  Aligned_cols=51  Identities=37%  Similarity=0.400  Sum_probs=40.2

Q ss_pred             HHHHhcCChHHHHHHHHHHHHHHh------c------cCChHHHHHHHHHHhccChhHHhhhh
Q 000708           88 VKDMFQDQREKYDMFLEVMKDFKA------Q------RTDTAGVIARVKDLFKGHNNLIFGFN  138 (1340)
Q Consensus        88 VK~~F~d~p~vY~eFLeIMkdfk~------~------~idt~gVi~rVs~LFkGhpdLI~GFN  138 (1340)
                      |-.-|..+|-....||.+=.||-+      |      .+|+.-.|--|..||.|.|.||.---
T Consensus         2 vaalfgcqpylvqrflavdndfaailkgngqhaavdfavdiavaipvvqtlfngqpqlisqam   64 (161)
T PRK09807          2 VAALFGCQPYLVQRFLAVDNDFAAILKGNGQHAAVDFAVDIAVAIPVVQTLFNGQPQLISQAM   64 (161)
T ss_pred             chhhhcCcHHHHHHHHHhcchHHHHHcCCCceeeeeeeeeeeeehHHHHHHhCCChHHHHHhh
Confidence            345688899999999999888864      1      24666778999999999999998643


Done!