Query         psy14480
Match_columns 190
No_of_seqs    220 out of 1456
Neff          7.4 
Searched_HMMs 46136
Date          Fri Aug 16 20:54:58 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14480.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14480hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG2084|consensus               99.6 4.3E-15 9.3E-20  131.8  13.5  113   76-189   169-288 (482)
  2 PF00856 SET:  SET domain;  Int  99.5 4.3E-15 9.4E-20  112.1   4.7   51   97-147   109-162 (162)
  3 smart00317 SET SET (Su(var)3-9  99.3 1.3E-12 2.9E-17   94.4   3.9   46  101-146    68-116 (116)
  4 KOG2589|consensus               98.8 2.6E-09 5.6E-14   91.9   4.5   80   88-177   176-257 (453)
  5 COG2940 Proteins containing SE  98.4 1.2E-07 2.6E-12   85.9   3.1   72  105-176   404-478 (480)
  6 KOG4442|consensus               98.3 4.9E-07 1.1E-11   83.3   4.0   51  103-153   189-243 (729)
  7 KOG1080|consensus               98.3 6.1E-07 1.3E-11   86.7   3.7   46  106-151   939-987 (1005)
  8 KOG1337|consensus               97.8 1.8E-05 3.9E-10   71.6   4.5   88   74-167   197-291 (472)
  9 KOG1082|consensus               97.7 2.4E-05 5.1E-10   68.7   3.1   44  107-150   273-323 (364)
 10 KOG1141|consensus               97.3 0.00011 2.4E-09   69.3   2.3   58  107-172  1190-1254(1262)
 11 KOG1083|consensus               97.0  0.0006 1.3E-08   65.9   3.5   42  108-149  1252-1296(1306)
 12 KOG1085|consensus               96.9 0.00061 1.3E-08   57.6   2.9   44  107-150   334-380 (392)
 13 KOG1338|consensus               96.8   0.004 8.6E-08   54.9   6.8   87   62-149   162-261 (466)
 14 KOG1079|consensus               95.8  0.0094   2E-07   55.5   3.7   44  106-149   665-711 (739)
 15 KOG1081|consensus               92.7   0.043 9.2E-07   49.8   0.7   50  101-150   365-418 (463)
 16 KOG2461|consensus               76.5     2.2 4.8E-05   38.1   2.7   27  124-150   120-146 (396)
 17 TIGR02059 swm_rep_I cyanobacte  75.9     3.9 8.4E-05   29.5   3.3   25  126-150    75-99  (101)
 18 PF08666 SAF:  SAF domain;  Int  69.7     3.1 6.8E-05   26.3   1.6   17  129-145     3-19  (63)
 19 KOG2155|consensus               60.2     9.7 0.00021   34.7   3.3   61  103-163   201-266 (631)
 20 smart00858 SAF This domain fam  45.9      15 0.00032   23.0   1.6   16  129-144     3-18  (64)
 21 KOG2084|consensus               40.0    0.21 4.5E-06   44.3 -10.6   80   99-178   365-452 (482)
 22 PF12947 EGF_3:  EGF domain;  I  24.7      29 0.00064   19.9   0.3   15  110-124     4-18  (36)
 23 COG4118 Phd Antitoxin of toxin  23.6      29 0.00063   24.1   0.2   16  133-148    18-33  (84)
 24 PF06718 DUF1203:  Protein of u  23.3      75  0.0016   23.5   2.4   20  131-150     6-26  (117)
 25 COG1188 Ribosome-associated he  22.7      62  0.0014   23.3   1.8   21  130-150    43-63  (100)
 26 COG1996 RPC10 DNA-directed RNA  22.3      40 0.00087   21.1   0.6   22  169-190     8-29  (49)
 27 PRK00398 rpoP DNA-directed RNA  21.8      53  0.0011   19.6   1.1   21  169-189     5-25  (46)
 28 PF14353 CpXC:  CpXC protein     20.5 1.5E+02  0.0033   21.6   3.6   33  145-178    17-49  (128)

No 1  
>KOG2084|consensus
Probab=99.63  E-value=4.3e-15  Score=131.85  Aligned_cols=113  Identities=35%  Similarity=0.620  Sum_probs=97.0

Q ss_pred             HHHHHHHHHHHHhcccccCC----CC--CceEEEcccccccCCCCCCCceeEecCCCEEEEEEcccCCCCC-eeeeeccC
Q psy14480         76 QEEILKICGIIQVNAHEMPL----TE--PSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGD-HLSICYTD  148 (190)
Q Consensus        76 ~~~~~~~~~~~~~N~f~i~~----~~--~~~~gl~p~~s~~NHSC~PN~~~~~~~~~~~~~~a~r~I~~Ge-EltisY~~  148 (190)
                      .+.+..++..+..+++....    ..  ..+.|+||..+++||||.||+...|+ +....+++...+.+++ |++++|++
T Consensus       169 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~hsC~pn~~~~~~-~~~~~~~~~~~~~~~~~~l~~~y~~  247 (482)
T KOG2084|consen  169 ADCISKLFPSLLCNSITNASSLRVPEPLFLGRGLFPGSSLFNHSCFPNISVIFD-GRGLALLVPAGIDAGEEELTISYTD  247 (482)
T ss_pred             HHHHHHHHHHHHHhhhhhhhhhhccccccceeeecccchhcccCCCCCeEEEEC-CceeEEEeecccCCCCCEEEEeecc
Confidence            44556667777777776654    11  48999999999999999999998886 6677777888888876 99999999


Q ss_pred             CCCCHHHHHHHhccCCCeeeecCCCCCCCCCCccccccccC
Q psy14480        149 PLWGTINRRHHLQTSKYFICQCERCRDPTELNTFYDGVKCP  189 (190)
Q Consensus       149 ~~~~~~~R~~~L~~~~~F~C~C~~C~~~~e~~~~~~~~~C~  189 (190)
                      ..+++..|++.|+..|.|.|.|++|.++++.+...++++|.
T Consensus       248 ~~~~~~~r~~~l~~~~~f~c~c~rc~d~~~~~~~~~~~~c~  288 (482)
T KOG2084|consen  248 PLLSTASRQKQLRQSKLFSCQCPRCLDPTELGTFLSSLRCE  288 (482)
T ss_pred             cccCHHHHHHHHhhccceeeecCCCCCCCccccchhhhhhc
Confidence            99999999999999999999999999999999999988884


No 2  
>PF00856 SET:  SET domain;  InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities [].  The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=99.55  E-value=4.3e-15  Score=112.13  Aligned_cols=51  Identities=33%  Similarity=0.614  Sum_probs=45.4

Q ss_pred             CCceEEEcccccccCCCCCCCceeEec---CCCEEEEEEcccCCCCCeeeeecc
Q psy14480         97 EPSYIAIFDRASFIEHNCYPNLYKSFT---DSGQVLLRAMKPIAPGDHLSICYT  147 (190)
Q Consensus        97 ~~~~~gl~p~~s~~NHSC~PN~~~~~~---~~~~~~~~a~r~I~~GeEltisY~  147 (190)
                      +..+.+|||.++++||||.|||.+.++   .+..+.++|.|+|++||||+|+|+
T Consensus       109 ~~~~~~l~p~~d~~NHsc~pn~~~~~~~~~~~~~~~~~a~r~I~~GeEi~isYG  162 (162)
T PF00856_consen  109 DRDGIALYPFADMLNHSCDPNCEVSFDFDGDGGCLVVRATRDIKKGEEIFISYG  162 (162)
T ss_dssp             EEEEEEEETGGGGSEEESSTSEEEEEEEETTTTEEEEEESS-B-TTSBEEEEST
T ss_pred             cccccccCcHhHheccccccccceeeEeecccceEEEEECCccCCCCEEEEEEC
Confidence            356799999999999999999999996   478999999999999999999997


No 3  
>smart00317 SET SET (Su(var)3-9, Enhancer-of-zeste, Trithorax) domain. Putative methyl transferase, based on outlier plant homologues
Probab=99.32  E-value=1.3e-12  Score=94.37  Aligned_cols=46  Identities=35%  Similarity=0.636  Sum_probs=40.2

Q ss_pred             EEEcccccccCCCCCCCceeEecC-CC--EEEEEEcccCCCCCeeeeec
Q psy14480        101 IAIFDRASFIEHNCYPNLYKSFTD-SG--QVLLRAMKPIAPGDHLSICY  146 (190)
Q Consensus       101 ~gl~p~~s~~NHSC~PN~~~~~~~-~~--~~~~~a~r~I~~GeEltisY  146 (190)
                      ..++|.++++||||.||+.+.+.. ++  .+.++|+|+|++|||||++|
T Consensus        68 ~~~~~~~~~iNHsc~pN~~~~~~~~~~~~~~~~~a~r~I~~GeEi~i~Y  116 (116)
T smart00317       68 RRKGNIARFINHSCEPNCELLFVEVNGDSRIVIFALRDIKPGEELTIDY  116 (116)
T ss_pred             CccCcHHHeeCCCCCCCEEEEEEEECCCcEEEEEECCCcCCCCEEeecC
Confidence            447899999999999999988742 22  69999999999999999998


No 4  
>KOG2589|consensus
Probab=98.85  E-value=2.6e-09  Score=91.91  Aligned_cols=80  Identities=21%  Similarity=0.462  Sum_probs=60.5

Q ss_pred             hcccccCCCCC-ceEEEccc-ccccCCCCCCCceeEecCCCEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccCCC
Q psy14480         88 VNAHEMPLTEP-SYIAIFDR-ASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKY  165 (190)
Q Consensus        88 ~N~f~i~~~~~-~~~gl~p~-~s~~NHSC~PN~~~~~~~~~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~~~  165 (190)
                      -|-|.|..+.+ .-..||.. ++++||+|.|||.++-.++.++.|+++|||++|||||.=|+..+.+.          ..
T Consensus       176 ~nDFSvmyStRk~caqLwLGPaafINHDCrpnCkFvs~g~~tacvkvlRDIePGeEITcFYgs~fFG~----------~N  245 (453)
T KOG2589|consen  176 GNDFSVMYSTRKRCAQLWLGPAAFINHDCRPNCKFVSTGRDTACVKVLRDIEPGEEITCFYGSGFFGE----------NN  245 (453)
T ss_pred             CCceeeeeecccchhhheeccHHhhcCCCCCCceeecCCCceeeeehhhcCCCCceeEEeecccccCC----------CC
Confidence            45566554222 22455543 67889999999998886558999999999999999999999887763          23


Q ss_pred             eeeecCCCCCCC
Q psy14480        166 FICQCERCRDPT  177 (190)
Q Consensus       166 F~C~C~~C~~~~  177 (190)
                      -.|.|.-|....
T Consensus       246 ~~CeC~TCER~g  257 (453)
T KOG2589|consen  246 EECECVTCERRG  257 (453)
T ss_pred             ceeEEeeccccc
Confidence            489999997653


No 5  
>COG2940 Proteins containing SET domain [General function prediction only]
Probab=98.44  E-value=1.2e-07  Score=85.89  Aligned_cols=72  Identities=26%  Similarity=0.407  Sum_probs=59.3

Q ss_pred             ccccccCCCCCCCceeEecCC---CEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccCCCeeeecCCCCCC
Q psy14480        105 DRASFIEHNCYPNLYKSFTDS---GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCERCRDP  176 (190)
Q Consensus       105 p~~s~~NHSC~PN~~~~~~~~---~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~  176 (190)
                      ..+.++||||.||+.......   ..+.++|++||++||||++.|....+....+...+...++..|.|.+|...
T Consensus       404 ~~~r~~nHS~~pN~~~~~~~~~g~~~~~~~~~rDI~~geEl~~dy~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  478 (480)
T COG2940         404 DVARFINHSCTPNCEASPIEVNGIFKISIYAIRDIKAGEELTYDYGPSLEDNRELKKLLEKRWGCACGEDRCSHT  478 (480)
T ss_pred             cccceeecCCCCCcceecccccccceeeecccccchhhhhhccccccccccchhhhhhhhhhhccccCCCccCCC
Confidence            456689999999999887432   388999999999999999999988775554556677788999999999754


No 6  
>KOG4442|consensus
Probab=98.31  E-value=4.9e-07  Score=83.35  Aligned_cols=51  Identities=29%  Similarity=0.452  Sum_probs=38.9

Q ss_pred             Eccccc-ccCCCCCCCceeE-e--cCCCEEEEEEcccCCCCCeeeeeccCCCCCH
Q psy14480        103 IFDRAS-FIEHNCYPNLYKS-F--TDSGQVLLRAMKPIAPGDHLSICYTDPLWGT  153 (190)
Q Consensus       103 l~p~~s-~~NHSC~PN~~~~-~--~~~~~~~~~a~r~I~~GeEltisY~~~~~~~  153 (190)
                      .+-..| ++||||+|||..- |  .+..+|-+.|.++|++|||||+.|--..++.
T Consensus       189 ~KGnlaRFiNHSC~PNa~~~KWtV~~~lRvGiFakk~I~~GEEITFDYqf~rYGr  243 (729)
T KOG4442|consen  189 KKGNLARFINHSCDPNAEVQKWTVPDELRVGIFAKKVIKPGEEITFDYQFDRYGR  243 (729)
T ss_pred             ccCcHHHhhcCCCCCCceeeeeeeCCeeEEEEeEecccCCCceeeEecccccccc
Confidence            343444 4699999999553 3  2236888999999999999999999776654


No 7  
>KOG1080|consensus
Probab=98.26  E-value=6.1e-07  Score=86.71  Aligned_cols=46  Identities=30%  Similarity=0.566  Sum_probs=38.0

Q ss_pred             cccccCCCCCCCceeEe---cCCCEEEEEEcccCCCCCeeeeeccCCCC
Q psy14480        106 RASFIEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPLW  151 (190)
Q Consensus       106 ~~s~~NHSC~PN~~~~~---~~~~~~~~~a~r~I~~GeEltisY~~~~~  151 (190)
                      .|.++||||.|||..-.   ++..+++|+|.|+|++|||||..|-....
T Consensus       939 iAr~InHsC~PNCyakvi~V~g~~~IvIyakr~I~~~EElTYDYkF~~e  987 (1005)
T KOG1080|consen  939 IARFINHSCNPNCYAKVITVEGDKRIVIYSKRDIAAGEELTYDYKFPTE  987 (1005)
T ss_pred             hhheeecccCCCceeeEEEecCeeEEEEEEecccccCceeeeecccccc
Confidence            47789999999995432   33479999999999999999999986654


No 8  
>KOG1337|consensus
Probab=97.83  E-value=1.8e-05  Score=71.59  Aligned_cols=88  Identities=17%  Similarity=0.111  Sum_probs=69.1

Q ss_pred             CCHHHHHHHHHHHHhcccccCCC-------CCceEEEcccccccCCCCCCCceeEecCCCEEEEEEcccCCCCCeeeeec
Q psy14480         74 FTQEEILKICGIIQVNAHEMPLT-------EPSYIAIFDRASFIEHNCYPNLYKSFTDSGQVLLRAMKPIAPGDHLSICY  146 (190)
Q Consensus        74 ~~~~~~~~~~~~~~~N~f~i~~~-------~~~~~gl~p~~s~~NHSC~PN~~~~~~~~~~~~~~a~r~I~~GeEltisY  146 (190)
                      ++.+.+...++++.+.+|.....       ...-.++.|...+.||+|.+........+..+.+++.++|++||||+|+|
T Consensus       197 ~~~~~~~w~~~~~~sr~~~~~~~~~~~~~~~~~~~~L~P~~D~~NH~~~~~~~~~~~~d~~~~l~~~~~v~~geevfi~Y  276 (472)
T KOG1337|consen  197 FTFSAFKWAYSIVNSRAFYLPSLQRLTAGDPDDNEALAPLIDLLNHSPEVIKAGYNQEDEAVELVAERDVSAGEEVFINY  276 (472)
T ss_pred             cchHHHHHHHHHHhhhhhccccccccccCCCCcchhhhhhHHhhccCchhccccccCCCCcEEEEEeeeecCCCeEEEec
Confidence            56677889999999999987642       13447899999999999999444444334589999999999999999999


Q ss_pred             cCCCCCHHHHHHHhccCCCee
Q psy14480        147 TDPLWGTINRRHHLQTSKYFI  167 (190)
Q Consensus       147 ~~~~~~~~~R~~~L~~~~~F~  167 (190)
                      ++...      ..|...|||.
T Consensus       277 G~~~N------~eLL~~YGFv  291 (472)
T KOG1337|consen  277 GPKSN------AELLLHYGFV  291 (472)
T ss_pred             CCCch------HHHHHhcCCC
Confidence            98432      2566689996


No 9  
>KOG1082|consensus
Probab=97.71  E-value=2.4e-05  Score=68.69  Aligned_cols=44  Identities=34%  Similarity=0.633  Sum_probs=36.0

Q ss_pred             ccccCCCCCCCceeEec--CC-----CEEEEEEcccCCCCCeeeeeccCCC
Q psy14480        107 ASFIEHNCYPNLYKSFT--DS-----GQVLLRAMKPIAPGDHLSICYTDPL  150 (190)
Q Consensus       107 ~s~~NHSC~PN~~~~~~--~~-----~~~~~~a~r~I~~GeEltisY~~~~  150 (190)
                      +-++||||.||+.+...  ++     ..+.+.|+++|++|+|+|..|+...
T Consensus       273 ~RfinHSC~PN~~~~~v~~~~~~~~~~~i~ffa~~~I~p~~ELT~dYg~~~  323 (364)
T KOG1082|consen  273 ARFINHSCSPNLLYQAVFQDEFVLLYLRIGFFALRDISPGEELTLDYGKAY  323 (364)
T ss_pred             cccccCCCCccceeeeeeecCCccchheeeeeeccccCCCcccchhhcccc
Confidence            44679999999987652  11     4778999999999999999999764


No 10 
>KOG1141|consensus
Probab=97.32  E-value=0.00011  Score=69.30  Aligned_cols=58  Identities=28%  Similarity=0.461  Sum_probs=42.7

Q ss_pred             ccccCCCCCCCceeE--ecC--C---CEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccCCCeeeecCC
Q psy14480        107 ASFIEHNCYPNLYKS--FTD--S---GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTSKYFICQCER  172 (190)
Q Consensus       107 ~s~~NHSC~PN~~~~--~~~--~---~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~~~F~C~C~~  172 (190)
                      +-++||||.||..+.  |.+  +   .-+.+.|.+-|++|+|||..|....-...        ..-..|.|..
T Consensus      1190 GRfLNHSC~PNl~VQnVfvdTHdlrfPwVAFFt~kyVkAgtELTWDY~Ye~g~v~--------~keL~C~CGa 1254 (1262)
T KOG1141|consen 1190 GRFLNHSCDPNLHVQNVFVDTHDLRFPWVAFFTRKYVKAGTELTWDYQYEQGQVA--------TKELTCHCGA 1254 (1262)
T ss_pred             hhhhccCCCccceeeeeeeeccccCCchhhhhhhhhhccCceeeeeccccccccc--------cceEEEecCh
Confidence            557899999998653  421  1   46678899999999999999997654332        3456888864


No 11 
>KOG1083|consensus
Probab=96.98  E-value=0.0006  Score=65.90  Aligned_cols=42  Identities=29%  Similarity=0.502  Sum_probs=33.7

Q ss_pred             cccCCCCCCCceeE-ec-CC-CEEEEEEcccCCCCCeeeeeccCC
Q psy14480        108 SFIEHNCYPNLYKS-FT-DS-GQVLLRAMKPIAPGDHLSICYTDP  149 (190)
Q Consensus       108 s~~NHSC~PN~~~~-~~-~~-~~~~~~a~r~I~~GeEltisY~~~  149 (190)
                      -++||||.|||... |. +| .++.|.|.|||.+|||||..|-..
T Consensus      1252 RfinhscKPNc~~qkwSVNG~~Rv~L~A~rDi~kGEELtYDYN~k 1296 (1306)
T KOG1083|consen 1252 RFINHSCKPNCEMQKWSVNGEYRVGLFALRDLPKGEELTYDYNFK 1296 (1306)
T ss_pred             cccccccCCCCccccccccceeeeeeeecCCCCCCceEEEecccc
Confidence            34599999999543 32 23 689999999999999999998754


No 12 
>KOG1085|consensus
Probab=96.95  E-value=0.00061  Score=57.64  Aligned_cols=44  Identities=30%  Similarity=0.388  Sum_probs=37.0

Q ss_pred             ccccCCCCCCCceeE---ecCCCEEEEEEcccCCCCCeeeeeccCCC
Q psy14480        107 ASFIEHNCYPNLYKS---FTDSGQVLLRAMKPIAPGDHLSICYTDPL  150 (190)
Q Consensus       107 ~s~~NHSC~PN~~~~---~~~~~~~~~~a~r~I~~GeEltisY~~~~  150 (190)
                      .-++|||=.+|+.--   .+....+++.|.+||++||||+..|+|-.
T Consensus       334 GRLINHS~~gNl~TKvv~Idg~pHLiLvA~rdIa~GEELlYDYGDRS  380 (392)
T KOG1085|consen  334 GRLINHSVRGNLKTKVVEIDGSPHLILVARRDIAQGEELLYDYGDRS  380 (392)
T ss_pred             hhhhcccccCcceeeEEEecCCceEEEEeccccccchhhhhhccccc
Confidence            567899999998432   34448999999999999999999999865


No 13 
>KOG1338|consensus
Probab=96.80  E-value=0.004  Score=54.85  Aligned_cols=87  Identities=13%  Similarity=0.090  Sum_probs=69.5

Q ss_pred             HHHHHhhccCCCCCHHHHHHHHHHHHhcccccCC------------CCCceEEEcccccccCCCCC-CCceeEecCCCEE
Q psy14480         62 QFLLNFFKLNEEFTQEEILKICGIIQVNAHEMPL------------TEPSYIAIFDRASFIEHNCY-PNLYKSFTDSGQV  128 (190)
Q Consensus        62 ~~l~~~~~~~~~~~~~~~~~~~~~~~~N~f~i~~------------~~~~~~gl~p~~s~~NHSC~-PN~~~~~~~~~~~  128 (190)
                      .|...+...-..++.|.++.+.+++..-+|.+..            .+..|-..-|.+.++||.=. -|+...++ +..+
T Consensus       162 pf~~~~p~vfs~~slEdF~y~~Al~laysfdve~~~s~~~~eee~e~e~ngk~m~p~ad~lNhd~~k~nanl~y~-~NcL  240 (466)
T KOG1338|consen  162 PFKQHCPIVFSRPSLEDFMYAYALGLAYSFDVEFLLSLDNLEEESEIECNGKLMTPIADFLNHDGLKANANLRYE-DNCL  240 (466)
T ss_pred             HHHHhCcchhcccCHHHHHHHHHHHHHHheeeehhcchhhhhhhhccccCcccccchhhhhccchhhcccceecc-Ccce
Confidence            3444444333347899999999999999999864            23678889999999999876 67777775 6788


Q ss_pred             EEEEcccCCCCCeeeeeccCC
Q psy14480        129 LLRAMKPIAPGDHLSICYTDP  149 (190)
Q Consensus       129 ~~~a~r~I~~GeEltisY~~~  149 (190)
                      .++|.|+|++|+|+.-+|+-.
T Consensus       241 ~mva~r~iekgdev~n~dg~~  261 (466)
T KOG1338|consen  241 EMVADRNIEKGDEVDNSDGLK  261 (466)
T ss_pred             eeeecCCCCCccccccccccC
Confidence            999999999999999999843


No 14 
>KOG1079|consensus
Probab=95.77  E-value=0.0094  Score=55.52  Aligned_cols=44  Identities=23%  Similarity=0.408  Sum_probs=35.4

Q ss_pred             cccccCCCCCCCceeE--ec-CCCEEEEEEcccCCCCCeeeeeccCC
Q psy14480        106 RASFIEHNCYPNLYKS--FT-DSGQVLLRAMKPIAPGDHLSICYTDP  149 (190)
Q Consensus       106 ~~s~~NHSC~PN~~~~--~~-~~~~~~~~a~r~I~~GeEltisY~~~  149 (190)
                      .+-+.|||=.|||..-  +. +..++-++|.|.|.+|||||+.|...
T Consensus       665 k~rFANHS~nPNCYAkvm~V~GdhRIGifAkRaIeagEELffDYrYs  711 (739)
T KOG1079|consen  665 KIRFANHSFNPNCYAKVMMVAGDHRIGIFAKRAIEAGEELFFDYRYS  711 (739)
T ss_pred             hhhhccCCCCCCcEEEEEEecCCcceeeeehhhcccCceeeeeeccC
Confidence            4556799999998543  22 33788999999999999999999854


No 15 
>KOG1081|consensus
Probab=92.70  E-value=0.043  Score=49.85  Aligned_cols=50  Identities=24%  Similarity=0.310  Sum_probs=38.6

Q ss_pred             EEEcccccc-cCCCCCCCceeEe---cCCCEEEEEEcccCCCCCeeeeeccCCC
Q psy14480        101 IAIFDRASF-IEHNCYPNLYKSF---TDSGQVLLRAMKPIAPGDHLSICYTDPL  150 (190)
Q Consensus       101 ~gl~p~~s~-~NHSC~PN~~~~~---~~~~~~~~~a~r~I~~GeEltisY~~~~  150 (190)
                      .+-+...+. .||||.||+.-.-   .....+.+.|.+.|++|+|+|.+|....
T Consensus       365 ~~~~~n~sr~~nh~~~~~v~~~k~~~~~~t~~~~~a~~~i~~g~e~t~~~n~~~  418 (463)
T KOG1081|consen  365 AGPKGNYSRFLNHSCQPNVETEKWQVIGDTRVGLFAPRQIEAGEELTFNYNGNC  418 (463)
T ss_pred             cccccchhhhhcccCCCceeechhheecccccccccccccccchhhhheeeccc
Confidence            455566655 5999999986542   2246788999999999999999999654


No 16 
>KOG2461|consensus
Probab=76.52  E-value=2.2  Score=38.11  Aligned_cols=27  Identities=19%  Similarity=0.453  Sum_probs=24.3

Q ss_pred             CCCEEEEEEcccCCCCCeeeeeccCCC
Q psy14480        124 DSGQVLLRAMKPIAPGDHLSICYTDPL  150 (190)
Q Consensus       124 ~~~~~~~~a~r~I~~GeEltisY~~~~  150 (190)
                      .+..+..+|+|+|++||||.+=|++..
T Consensus       120 ~~~~Ifyrt~r~I~p~eELlVWY~~e~  146 (396)
T KOG2461|consen  120 IGENIFYRTIRDIRPNEELLVWYGSEY  146 (396)
T ss_pred             ccCceEEEecccCCCCCeEEEEeccch
Confidence            467899999999999999999999764


No 17 
>TIGR02059 swm_rep_I cyanobacterial long protein repeat. This domain appears in 29 copies in a large (10000 amino protein in Synechococcus sp. WH8102 associated with a novel flagellar system, as one of three different repeats. Similar domains are found in two different large (<3500) proteins of Synechocystis PCC6803.
Probab=75.88  E-value=3.9  Score=29.46  Aligned_cols=25  Identities=20%  Similarity=0.406  Sum_probs=22.3

Q ss_pred             CEEEEEEcccCCCCCeeeeeccCCC
Q psy14480        126 GQVLLRAMKPIAPGDHLSICYTDPL  150 (190)
Q Consensus       126 ~~~~~~a~r~I~~GeEltisY~~~~  150 (190)
                      ..+.+.-.++|+.|++++++|.++.
T Consensus        75 ktVTLTL~~~V~~Gq~VTVsYt~ps   99 (101)
T TIGR02059        75 TTITLTLAQVVEDGDEVTLSYTKNS   99 (101)
T ss_pred             cEEEEEecccccCCCEEEEEeeCCC
Confidence            4888988999999999999998764


No 18 
>PF08666 SAF:  SAF domain;  InterPro: IPR013974  This entry includes a range of different proteins, such as antifreeze proteins, flagellar FlgA proteins, and CpaB pilus proteins. ; PDB: 1C89_A 3NLA_A 3RDN_A 1C8A_A 3FRN_A 1WVO_A 3K3S_H 3G8R_B 1XUU_A 1XUZ_A ....
Probab=69.66  E-value=3.1  Score=26.34  Aligned_cols=17  Identities=18%  Similarity=0.296  Sum_probs=12.2

Q ss_pred             EEEEcccCCCCCeeeee
Q psy14480        129 LLRAMKPIAPGDHLSIC  145 (190)
Q Consensus       129 ~~~a~r~I~~GeEltis  145 (190)
                      +++|.++|++|+.|+-.
T Consensus         3 vvVA~~di~~G~~i~~~   19 (63)
T PF08666_consen    3 VVVAARDIPAGTVITAE   19 (63)
T ss_dssp             EEEESSTB-TT-BECTT
T ss_pred             EEEEeCccCCCCEEccC
Confidence            57899999999998543


No 19 
>KOG2155|consensus
Probab=60.23  E-value=9.7  Score=34.68  Aligned_cols=61  Identities=18%  Similarity=0.248  Sum_probs=46.0

Q ss_pred             EcccccccCCCCCCCceeE---ecCC--CEEEEEEcccCCCCCeeeeeccCCCCCHHHHHHHhccC
Q psy14480        103 IFDRASFIEHNCYPNLYKS---FTDS--GQVLLRAMKPIAPGDHLSICYTDPLWGTINRRHHLQTS  163 (190)
Q Consensus       103 l~p~~s~~NHSC~PN~~~~---~~~~--~~~~~~a~r~I~~GeEltisY~~~~~~~~~R~~~L~~~  163 (190)
                      +.-.+|++.||=.||..+.   |-+.  .--.+..++++..|||+|-.+.........|...|+-.
T Consensus       201 MDefGsrvrHsdePnf~~aPf~fmPq~vaYsimwp~k~~~tgeE~trDfasg~~~p~~Rk~~l~pW  266 (631)
T KOG2155|consen  201 MDEFGSRVRHSDEPNFRIAPFMFMPQNVAYSIMWPTKPVNTGEEITRDFASGVIHPEWRKYILQPW  266 (631)
T ss_pred             HhhhhhhhccCCCCcceeeeheecchhcceeEEeeccCCCCchHHHHHHhhcCCCHHHHHHHhccc
Confidence            3366899999999998664   3222  23457789999999999999887776677787777643


No 20 
>smart00858 SAF This domain family includes a range of different proteins. Such as antifreeze proteins and flagellar FlgA proteins, and CpaB pilus proteins.
Probab=45.90  E-value=15  Score=23.03  Aligned_cols=16  Identities=13%  Similarity=0.407  Sum_probs=13.8

Q ss_pred             EEEEcccCCCCCeeee
Q psy14480        129 LLRAMKPIAPGDHLSI  144 (190)
Q Consensus       129 ~~~a~r~I~~GeElti  144 (190)
                      .++|.++|++|+.|+-
T Consensus         3 v~va~~~i~~G~~i~~   18 (64)
T smart00858        3 VVVAARDLPAGEVITA   18 (64)
T ss_pred             EEEEeCccCCCCCcch
Confidence            5778899999999875


No 21 
>KOG2084|consensus
Probab=40.03  E-value=0.21  Score=44.28  Aligned_cols=80  Identities=25%  Similarity=0.401  Sum_probs=62.5

Q ss_pred             ceEEEcccccccC--CCCCCCceeEecCC---CEEEEEEcccCCCCCeeeeeccCCCCCHHH---HHHHhccCCCeeeec
Q psy14480         99 SYIAIFDRASFIE--HNCYPNLYKSFTDS---GQVLLRAMKPIAPGDHLSICYTDPLWGTIN---RRHHLQTSKYFICQC  170 (190)
Q Consensus        99 ~~~gl~p~~s~~N--HSC~PN~~~~~~~~---~~~~~~a~r~I~~GeEltisY~~~~~~~~~---R~~~L~~~~~F~C~C  170 (190)
                      ...++++..+.++  |+|.||........   ....+.+.+++.+|.+.++.|....++...   +...+...+.+.|.|
T Consensus       365 ~~~~~~il~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  444 (482)
T KOG2084|consen  365 KLHLLFILGSLLGAFLSCSPNAELERLLNLFECRELLKALRDVKPGEEPLIAYLDYELGKLARELREKVLAEDALKDCKC  444 (482)
T ss_pred             HHHHHHHHHHHHhhhhccchhhHHHHHHHhhhhhHHHHHHHhhcchhhhhhHHHHHHHHHHHHHHHHHHhhhcCcchhHH
Confidence            4567788888888  99999998766312   256788899999999999999988765443   445677789999999


Q ss_pred             CCCCCCCC
Q psy14480        171 ERCRDPTE  178 (190)
Q Consensus       171 ~~C~~~~e  178 (190)
                      ..|.....
T Consensus       445 ~~~~~~~~  452 (482)
T KOG2084|consen  445 IMCLARAE  452 (482)
T ss_pred             HHHHHHHH
Confidence            99976543


No 22 
>PF12947 EGF_3:  EGF domain;  InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=24.71  E-value=29  Score=19.93  Aligned_cols=15  Identities=27%  Similarity=0.288  Sum_probs=10.2

Q ss_pred             cCCCCCCCceeEecC
Q psy14480        110 IEHNCYPNLYKSFTD  124 (190)
Q Consensus       110 ~NHSC~PN~~~~~~~  124 (190)
                      -||.|.||+.=...+
T Consensus         4 ~~~~C~~nA~C~~~~   18 (36)
T PF12947_consen    4 NNGGCHPNATCTNTG   18 (36)
T ss_dssp             GGGGS-TTCEEEE-T
T ss_pred             CCCCCCCCcEeecCC
Confidence            489999999877653


No 23 
>COG4118 Phd Antitoxin of toxin-antitoxin stability system [Cell division and chromosome partitioning]
Probab=23.63  E-value=29  Score=24.08  Aligned_cols=16  Identities=13%  Similarity=0.385  Sum_probs=12.4

Q ss_pred             cccCCCCCeeeeeccC
Q psy14480        133 MKPIAPGDHLSICYTD  148 (190)
Q Consensus       133 ~r~I~~GeEltisY~~  148 (190)
                      ++.+++||||.|+=-.
T Consensus        18 L~rV~aGEev~IT~~G   33 (84)
T COG4118          18 LRRVRAGEEVIITKRG   33 (84)
T ss_pred             HHHHhCCCEEEEeeCC
Confidence            3568999999998654


No 24 
>PF06718 DUF1203:  Protein of unknown function (DUF1203);  InterPro: IPR009593 This family consists of several hypothetical bacterial proteins of around 155 residues in length. Family members are present in Rhizobium, Agrobacterium and Streptomyces species.
Probab=23.30  E-value=75  Score=23.46  Aligned_cols=20  Identities=20%  Similarity=0.468  Sum_probs=16.0

Q ss_pred             EEcccCCCCCe-eeeeccCCC
Q psy14480        131 RAMKPIAPGDH-LSICYTDPL  150 (190)
Q Consensus       131 ~a~r~I~~GeE-ltisY~~~~  150 (190)
                      ..++++++||+ |.+||.+..
T Consensus         6 ~cL~~~~~Ge~~lLlsy~p~~   26 (117)
T PF06718_consen    6 HCLRDAEPGEELLLLSYRPFP   26 (117)
T ss_pred             EecccCCCCCeEEEEecCCCC
Confidence            46899999998 568998764


No 25 
>COG1188 Ribosome-associated heat shock protein implicated in the recycling of the 50S subunit (S4 paralog) [Translation, ribosomal structure and biogenesis]
Probab=22.68  E-value=62  Score=23.29  Aligned_cols=21  Identities=24%  Similarity=0.370  Sum_probs=18.3

Q ss_pred             EEEcccCCCCCeeeeeccCCC
Q psy14480        130 LRAMKPIAPGDHLSICYTDPL  150 (190)
Q Consensus       130 ~~a~r~I~~GeEltisY~~~~  150 (190)
                      +.+...|+.|++|+|.|+...
T Consensus        43 aKpS~~VK~GD~l~i~~~~~~   63 (100)
T COG1188          43 AKPSKEVKVGDILTIRFGNKE   63 (100)
T ss_pred             cccccccCCCCEEEEEeCCcE
Confidence            488899999999999999753


No 26 
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=22.27  E-value=40  Score=21.05  Aligned_cols=22  Identities=36%  Similarity=0.907  Sum_probs=16.5

Q ss_pred             ecCCCCCCCCCCccccccccCC
Q psy14480        169 QCERCRDPTELNTFYDGVKCPE  190 (190)
Q Consensus       169 ~C~~C~~~~e~~~~~~~~~C~~  190 (190)
                      .|.+|...-++.....+++||.
T Consensus         8 ~C~~Cg~~~~~~~~~~~irCp~   29 (49)
T COG1996           8 KCARCGREVELDQETRGIRCPY   29 (49)
T ss_pred             EhhhcCCeeehhhccCceeCCC
Confidence            5888888777666777888873


No 27 
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=21.75  E-value=53  Score=19.64  Aligned_cols=21  Identities=43%  Similarity=1.110  Sum_probs=11.9

Q ss_pred             ecCCCCCCCCCCccccccccC
Q psy14480        169 QCERCRDPTELNTFYDGVKCP  189 (190)
Q Consensus       169 ~C~~C~~~~e~~~~~~~~~C~  189 (190)
                      .|+.|-..-+.+.....++||
T Consensus         5 ~C~~CG~~~~~~~~~~~~~Cp   25 (46)
T PRK00398          5 KCARCGREVELDEYGTGVRCP   25 (46)
T ss_pred             ECCCCCCEEEECCCCCceECC
Confidence            466666655444444466665


No 28 
>PF14353 CpXC:  CpXC protein
Probab=20.51  E-value=1.5e+02  Score=21.56  Aligned_cols=33  Identities=12%  Similarity=0.158  Sum_probs=23.5

Q ss_pred             eccCCCCCHHHHHHHhccCCCeeeecCCCCCCCC
Q psy14480        145 CYTDPLWGTINRRHHLQTSKYFICQCERCRDPTE  178 (190)
Q Consensus       145 sY~~~~~~~~~R~~~L~~~~~F~C~C~~C~~~~e  178 (190)
                      +.++.......|.+.| +.-.|.-.|+.|.....
T Consensus        17 ~~I~~~~~p~l~e~il-~g~l~~~~CP~Cg~~~~   49 (128)
T PF14353_consen   17 TSINADEDPELKEKIL-DGSLFSFTCPSCGHKFR   49 (128)
T ss_pred             eEEcCcCCHHHHHHHH-cCCcCEEECCCCCCcee
Confidence            3445555666777777 56789999999986543


Done!