Query         psy3274
Match_columns 115
No_of_seqs    96 out of 98
Neff          4.0 
Searched_HMMs 46136
Date          Fri Aug 16 20:52:29 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3274.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3274hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF13867 SAP30_Sin3_bdg:  Sin3   99.9 7.6E-26 1.6E-30  144.3   6.1   53   46-98      1-53  (53)
  2 PF13866 zf-SAP30:  SAP30 zinc-  99.4 1.5E-14 3.2E-19   99.9   0.6   29    2-30     50-78  (78)
  3 PF10281 Ish1:  Putative stress  97.3 0.00071 1.5E-08   40.0   4.7   37   43-79      1-38  (38)
  4 PF02037 SAP:  SAP domain;  Int  94.5   0.092   2E-06   30.5   4.0   35   43-79      1-35  (35)
  5 smart00513 SAP Putative DNA-bi  90.4    0.92   2E-05   26.0   4.5   34   43-78      1-34  (35)
  6 PF07498 Rho_N:  Rho terminatio  87.8    0.84 1.8E-05   27.6   3.2   37   42-78      1-37  (43)
  7 PF12949 HeH:  HeH/LEM domain;   85.0    0.89 1.9E-05   27.0   2.2   33   44-76      2-34  (35)
  8 TIGR00578 ku70 ATP-dependent D  83.1     1.7 3.6E-05   38.9   4.1   37   42-80    548-584 (584)
  9 PF09124 Endonuc-dimeris:  T4 r  61.9      15 0.00032   23.9   3.5   39   42-80     12-50  (54)
 10 COG5126 FRQ1 Ca2+-binding prot  55.3      33  0.0007   26.4   5.0   48   40-89      8-58  (160)
 11 smart00540 LEM in nuclear memb  55.1      31 0.00067   21.4   4.0   37   42-78      2-40  (44)
 12 KOG3093|consensus               52.7     9.7 0.00021   30.7   1.8   46   37-89    133-191 (200)
 13 TIGR00988 hip integration host  49.5      31 0.00067   23.1   3.7   34   66-99      1-35  (94)
 14 PF02758 PYRIN:  PAAD/DAPIN/Pyr  46.4      35 0.00075   22.7   3.5   40   43-82      9-59  (83)
 15 PF15008 DUF4518:  Domain of un  46.1      43 0.00093   27.8   4.7   40   46-85     61-100 (262)
 16 PF05193 Peptidase_M16_C:  Pept  45.9      30 0.00066   23.3   3.2   40   46-85      2-47  (184)
 17 PF08006 DUF1700:  Protein of u  35.9      40 0.00086   25.1   2.8   40   49-93      8-50  (181)
 18 PRK00199 ihfB integration host  34.8      72  0.0016   21.4   3.7   34   66-99      1-35  (94)
 19 PF11923 DUF3441:  Domain of un  32.9      43 0.00093   24.2   2.5   31   47-79     37-68  (112)
 20 PLN02365 2-oxoglutarate-depend  31.5      56  0.0012   26.5   3.2   53   37-89      5-63  (300)
 21 PF13720 Acetyltransf_11:  Udp   30.7      80  0.0017   21.3   3.4   48   46-95     28-75  (83)
 22 PF01204 Trehalase:  Trehalase;  28.6      52  0.0011   29.0   2.7   51   47-97     18-76  (512)
 23 PHA00426 type II holin          27.5      62  0.0013   22.0   2.3   29   70-98     20-48  (67)
 24 PF11116 DUF2624:  Protein of u  26.9      92   0.002   21.9   3.2   37   43-84     12-48  (85)
 25 PLN02984 oxidoreductase, 2OG-F  25.4      79  0.0017   26.4   3.1   53   37-90     38-95  (341)
 26 PHA02564 V virion protein; Pro  25.3   2E+02  0.0043   21.7   5.0   25   69-93    111-136 (141)
 27 PRK12461 UDP-N-acetylglucosami  24.8 1.2E+02  0.0026   24.2   4.0   48   46-95    200-247 (255)
 28 PF08564 CDC37_C:  Cdc37 C term  24.1      81  0.0017   22.5   2.5   50   44-94      7-59  (99)
 29 PF10746 Phage_holin_6:  Phage   24.1      86  0.0019   21.3   2.5   27   71-97     20-46  (66)
 30 PF09106 SelB-wing_2:  Elongati  23.8      84  0.0018   19.4   2.3   39   49-92      3-42  (59)
 31 PF00797 Acetyltransf_2:  N-ace  23.1      59  0.0013   24.7   1.8   23   66-89    218-240 (240)
 32 cd01104 HTH_MlrA-CarA Helix-Tu  22.3      94   0.002   18.9   2.3   18   45-62     11-28  (68)
 33 KOG1946|consensus               21.3      95  0.0021   25.4   2.8   48   49-96    126-192 (240)
 34 smart00027 EH Eps15 homology d  21.2 2.5E+02  0.0055   18.4   5.4   32   46-77      4-38  (96)
 35 PF04501 Baculo_VP39:  Baculovi  20.8      38 0.00083   28.8   0.4   60    5-64     44-110 (300)
 36 COG2971 Predicted N-acetylgluc  20.6      84  0.0018   26.7   2.4   34   68-105   175-208 (301)
 37 PF12794 MscS_TM:  Mechanosensi  20.6      38 0.00081   28.2   0.3   16   37-52    291-306 (340)
 38 PF01436 NHL:  NHL repeat;  Int  20.2      28  0.0006   18.7  -0.4   14    4-17     13-26  (28)

No 1  
>PF13867 SAP30_Sin3_bdg:  Sin3 binding region of histone deacetylase complex subunit SAP30; PDB: 2LD7_A.
Probab=99.92  E-value=7.6e-26  Score=144.28  Aligned_cols=53  Identities=51%  Similarity=0.799  Sum_probs=41.2

Q ss_pred             CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCChHHHHHHHHHHHh
Q psy3274          46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVVEKDVLSFFFYTVK   98 (115)
Q Consensus        46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~E~evI~~Fiy~VK   98 (115)
                      |+++|||||+++|+|+..|+++|+|||++|++||+++||+|+|||++|||+||
T Consensus         1 L~~~tLrrY~~~~~l~~~~~~sK~qLa~~V~kHF~s~~v~E~evI~~Fly~vk   53 (53)
T PF13867_consen    1 LQTPTLRRYKKHYKLPERPRSSKEQLANAVRKHFNSQPVDENEVIANFLYKVK   53 (53)
T ss_dssp             S-HHHHHHHHHHTT----SS--HHHHHHHHHHHHTT----HHHHHHHHHHHHH
T ss_pred             CchHHHHHHHHHhCCCCCCCCCHHHHHHHHHHHHhcCCCCHHHHHHHHHHHhC
Confidence            68999999999999999999999999999999999999999999999999997


No 2  
>PF13866 zf-SAP30:  SAP30 zinc-finger; PDB: 2KDP_A 2LD7_A.
Probab=99.45  E-value=1.5e-14  Score=99.87  Aligned_cols=29  Identities=79%  Similarity=1.297  Sum_probs=18.1

Q ss_pred             CccceeecccchhhhHHhhhhccCCCCCC
Q psy3274           2 KARHIYICEYHKCLIQSVRNKRRRKDSED   30 (115)
Q Consensus         2 ~a~h~yicd~hk~~iqs~r~~r~~~~s~~   30 (115)
                      +|+|||||||||++|||||++|+|++|||
T Consensus        50 sa~H~YiCd~Hk~~Iqs~R~kr~rk~~ed   78 (78)
T PF13866_consen   50 SARHIYICDFHKNMIQSVRNKRKRKDSED   78 (78)
T ss_dssp             T-S-----HHHHHHHHTTSSS-S-SS--S
T ss_pred             ccchhhHHHHHHHHHhcccccccccccCC
Confidence            68999999999999999999999999986


No 3  
>PF10281 Ish1:  Putative stress-responsive nuclear envelope protein;  InterPro: IPR018803  This group of proteins, found primarily in fungi, consists of putative stress-responsive nuclear envelope protein Ish1 and homologues []. 
Probab=97.29  E-value=0.00071  Score=40.00  Aligned_cols=37  Identities=11%  Similarity=0.155  Sum_probs=34.2

Q ss_pred             cccCCHHHHHHHHHhcCCCCCCCC-CHHHHHHHHHhhh
Q psy3274          43 LFQLQVNTLRRYKKYFKVTSRPGL-NKAQLAETLLKHF   79 (115)
Q Consensus        43 f~kL~~~tLrRYkr~f~L~~~p~~-sK~QLa~~V~kHF   79 (115)
                      |+..+.+-|+.|-..+++.+++.. +|+||++.|++|+
T Consensus         1 fdtWs~~~L~~wL~~~gi~~~~~~~~rd~Ll~~~k~~y   38 (38)
T PF10281_consen    1 FDTWSDSDLKSWLKSHGIPVPKSAKTRDELLKLAKKNY   38 (38)
T ss_pred             CCCCCHHHHHHHHHHcCCCCCCCCCCHHHHHHHHHHhC
Confidence            677899999999999999999998 9999999999874


No 4  
>PF02037 SAP:  SAP domain;  InterPro: IPR003034 The SAP (after SAF-A/B, Acinus and PIAS) motif is a putative DNA binding domain found in diverse nuclear proteins involved in chromosomal organisation [], including in apoptosis []. In yeast, SAP is found in the most distal N-terminal region of E3 SUMO-protein ligase SIZ1, where it is involved in nuclear localization [].; GO: 0003676 nucleic acid binding; PDB: 2RNN_A 1JEQ_A 2KW9_A 2KVU_A 2DO1_A 1ZBU_B 1ZBH_A 2DO5_A 2RNO_A 1H1J_S ....
Probab=94.47  E-value=0.092  Score=30.53  Aligned_cols=35  Identities=31%  Similarity=0.333  Sum_probs=29.5

Q ss_pred             cccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhh
Q psy3274          43 LFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHF   79 (115)
Q Consensus        43 f~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF   79 (115)
                      |+.|.++.|+.+.+..+|++  +-+|++|++-+..||
T Consensus         1 l~~l~v~eLk~~l~~~gL~~--~G~K~~Li~Rl~~~l   35 (35)
T PF02037_consen    1 LSKLTVAELKEELKERGLST--SGKKAELIERLKEHL   35 (35)
T ss_dssp             TTTSHHHHHHHHHHHTTS-S--TSSHHHHHHHHHHHH
T ss_pred             CCcCcHHHHHHHHHHCCCCC--CCCHHHHHHHHHHhC
Confidence            46788999999999999954  455999999999986


No 5  
>smart00513 SAP Putative DNA-binding (bihelical) motif predicted to be involved in chromosomal organisation.
Probab=90.36  E-value=0.92  Score=25.95  Aligned_cols=34  Identities=29%  Similarity=0.343  Sum_probs=29.0

Q ss_pred             cccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhh
Q psy3274          43 LFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKH   78 (115)
Q Consensus        43 f~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kH   78 (115)
                      ++.+..+.|+...+.++|++  .-+|++|+.-+.+|
T Consensus         1 ~~~l~~~~Lk~~l~~~gl~~--~G~K~~Lv~Rl~~~   34 (35)
T smart00513        1 LAKLKVSELKDELKKRGLST--SGTKAELVDRLLEA   34 (35)
T ss_pred             CCcCcHHHHHHHHHHcCCCC--CCCHHHHHHHHHHh
Confidence            45788999999999999975  44699999998876


No 6  
>PF07498 Rho_N:  Rho termination factor, N-terminal domain;  InterPro: IPR011112 The Rho termination factor disengages newly transcribed RNA from its DNA template at certain, specific transcripts. It is thought that two copies of Rho bind to RNA and that Rho functions as a hexamer of protomers []. This domain is found to the N terminus of the RNA binding domain (IPR011113 from INTERPRO).; GO: 0006353 transcription termination, DNA-dependent; PDB: 1A8V_B 1PVO_A 1PV4_D 3ICE_A 1XPU_C 1XPO_D 1XPR_F 2A8V_B 2HT1_B 1A63_A ....
Probab=87.76  E-value=0.84  Score=27.56  Aligned_cols=37  Identities=22%  Similarity=0.327  Sum_probs=29.9

Q ss_pred             CcccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhh
Q psy3274          42 DLFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKH   78 (115)
Q Consensus        42 Df~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kH   78 (115)
                      ||.+++..-|+.+-+.+++.-...+.|.||+.+|.+.
T Consensus         1 eL~~~~~~eL~~iAk~lgI~~~~~~~K~eLI~~Il~~   37 (43)
T PF07498_consen    1 ELKSMTLSELREIAKELGIEGYSKMRKQELIFAILKA   37 (43)
T ss_dssp             HHHCS-HHHHHHHHHCTT-TTGCCS-HHHHHHHHHHH
T ss_pred             CcccCCHHHHHHHHHHcCCCCCCcCCHHHHHHHHHHH
Confidence            4678899999999999999878889999999988765


No 7  
>PF12949 HeH:  HeH/LEM domain; PDB: 2OUT_A.
Probab=85.00  E-value=0.89  Score=27.04  Aligned_cols=33  Identities=21%  Similarity=0.184  Sum_probs=22.2

Q ss_pred             ccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHH
Q psy3274          44 FQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLL   76 (115)
Q Consensus        44 ~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~   76 (115)
                      ..|.++-||+.-..+++..+.+..|++|+..+.
T Consensus         2 ~sltV~~Lk~iL~~~~I~~ps~AkKaeLv~L~~   34 (35)
T PF12949_consen    2 KSLTVAQLKRILDEHGIEFPSNAKKAELVALFN   34 (35)
T ss_dssp             TT--SHHHHHHHHHHT---SSS--SHHHHHH--
T ss_pred             CcCcHHHHHHHHHHcCCCCCCCCCHHHHHHHHc
Confidence            468899999999999999999999999998764


No 8  
>TIGR00578 ku70 ATP-dependent DNA helicase ii, 70 kDa subunit (ku70). Proteins in this family are involved in non-homologous end joining, a process used for the repair of double stranded DNA breaks. This family is based on the phylogenomic analysis of JA Eisen (1999, Ph.D. Thesis, Stanford University). Cutoff does not detect the putative ku70 homologs in yeast.
Probab=83.11  E-value=1.7  Score=38.88  Aligned_cols=37  Identities=32%  Similarity=0.559  Sum_probs=32.2

Q ss_pred             CcccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhc
Q psy3274          42 DLFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFK   80 (115)
Q Consensus        42 Df~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~   80 (115)
                      .|++|+++.|+-|.+.++|.  .+..|++|++.|.+||.
T Consensus       548 ~l~kltv~~Lk~~l~~~g~~--~~~kKadLi~~i~~~~~  584 (584)
T TIGR00578       548 TLGKLTVSVLKDFCRAYGLR--SGSKKQELLDALTKHFK  584 (584)
T ss_pred             ChhhccHHHHHHHHHHcCCC--ccccHHHHHHHHHHHhC
Confidence            67889999999999999876  44569999999999983


No 9  
>PF09124 Endonuc-dimeris:  T4 recombination endonuclease VII, dimerisation;  InterPro: IPR015208 This entry represents a dimerisation domain predominantly found in Bacteriophage T4 recombination endonuclease VII. It adopts a helical secondary structure, with three alpha helices oriented parallel to each other. As well as mediating dimerisation of the protein, this domain is also involved in binding to the DNA major groove []. ; PDB: 1EN7_B 1E7L_B 2QNF_A 2QNC_A 1E7D_A.
Probab=61.87  E-value=15  Score=23.94  Aligned_cols=39  Identities=18%  Similarity=0.095  Sum_probs=28.0

Q ss_pred             CcccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhc
Q psy3274          42 DLFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFK   80 (115)
Q Consensus        42 Df~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~   80 (115)
                      -|+.|+......=...+++..+++.+|+||+..-++-|.
T Consensus        12 ~FSRl~k~eMiaem~~~G~~y~~~~tK~~Lvk~fkKql~   50 (54)
T PF09124_consen   12 WFSRLTKPEMIAEMDSYGFEYNEKDTKAQLVKIFKKQLK   50 (54)
T ss_dssp             HHHTS-HHHHHHHHHHTT----TTS-HHHHHHHHHHHHH
T ss_pred             HHHhcCHHHHHHHHHHhCCcCCccccHHHHHHHHHHHHH
Confidence            477888888888888999999999999999998887664


No 10 
>COG5126 FRQ1 Ca2+-binding protein (EF-Hand superfamily) [Signal transduction mechanisms / Cytoskeleton / Cell division and chromosome partitioning / General function prediction only]
Probab=55.30  E-value=33  Score=26.41  Aligned_cols=48  Identities=15%  Similarity=0.202  Sum_probs=37.6

Q ss_pred             ccCcccCCHHHHHHHHHhcCCCC---CCCCCHHHHHHHHHhhhcCCCCChHHH
Q psy3274          40 DVDLFQLQVNTLRRYKKYFKVTS---RPGLNKAQLAETLLKHFKTIPVVEKDV   89 (115)
Q Consensus        40 ~vDf~kL~~~tLrRYkr~f~L~~---~p~~sK~QLa~~V~kHF~s~pV~E~ev   89 (115)
                      .+-|.+|+.+-+.++++.|.|--   .-.++|.+|..+++  |-.+...|.++
T Consensus         8 ~~~~~~~t~~qi~~lkeaF~l~D~d~~G~I~~~el~~ilr--~lg~~~s~~ei   58 (160)
T COG5126           8 LLTFTQLTEEQIQELKEAFQLFDRDSDGLIDRNELGKILR--SLGFNPSEAEI   58 (160)
T ss_pred             hhhcccCCHHHHHHHHHHHHHhCcCCCCCCcHHHHHHHHH--HcCCCCcHHHH
Confidence            34678899999999999999933   34589999999887  66666666554


No 11 
>smart00540 LEM in nuclear membrane-associated proteins. LEM, domain in nuclear membrane-associated proteins, including lamino-associated polypeptide 2 and emerin.
Probab=55.06  E-value=31  Score=21.35  Aligned_cols=37  Identities=22%  Similarity=0.240  Sum_probs=29.8

Q ss_pred             CcccCCHHHHHHHHHhcCCCCCC--CCCHHHHHHHHHhh
Q psy3274          42 DLFQLQVNTLRRYKKYFKVTSRP--GLNKAQLAETLLKH   78 (115)
Q Consensus        42 Df~kL~~~tLrRYkr~f~L~~~p--~~sK~QLa~~V~kH   78 (115)
                      |+..|+++-|++=...|+++..|  ..||.-+..-+.++
T Consensus         2 d~~~LSd~eL~~~L~~~G~~~gPIt~sTR~vy~kkL~~~   40 (44)
T smart00540        2 DVDRLSDAELRAELKQYGLPPGPITDTTRKLYEKKLRKL   40 (44)
T ss_pred             chhHcCHHHHHHHHHHcCCCCCCcCcchHHHHHHHHHHH
Confidence            78899999999999999997776  57787666655554


No 12 
>KOG3093|consensus
Probab=52.68  E-value=9.7  Score=30.67  Aligned_cols=46  Identities=33%  Similarity=0.361  Sum_probs=33.4

Q ss_pred             CCCccCcccC----------CHHHHHHHHHhcCCCCCC--CCC-HHHHHHHHHhhhcCCCCChHHH
Q psy3274          37 ESPDVDLFQL----------QVNTLRRYKKYFKVTSRP--GLN-KAQLAETLLKHFKTIPVVEKDV   89 (115)
Q Consensus        37 ~~p~vDf~kL----------~~~tLrRYkr~f~L~~~p--~~s-K~QLa~~V~kHF~s~pV~E~ev   89 (115)
                      ..|.|-|+..          =+++|.||..||.-+-+-  ++. |||++.       .+||.|+|+
T Consensus       133 ivPGvAFd~~g~RlGhGkGYYD~flkry~~~~~~~kp~~vgL~l~EQI~~-------~IPv~~~Dv  191 (200)
T KOG3093|consen  133 IVPGVAFDRKGARLGHGKGYYDDFLKRYQIHAPEQKPLLVGLCLKEQILS-------EIPVEEHDV  191 (200)
T ss_pred             EecccccchhhhhccCCcchHHHHHHHHHHhccccCchhhhhhhhHhhcc-------cCCCCccce
Confidence            3588888753          378999999998432221  344 899887       799999886


No 13 
>TIGR00988 hip integration host factor, beta subunit. This protein forms a site-specific DNA-binding heterodimer with the homologous integration host factor alpha subunit. It is closely related to the DNA-binding protein HU.
Probab=49.53  E-value=31  Score=23.06  Aligned_cols=34  Identities=18%  Similarity=0.338  Sum_probs=25.0

Q ss_pred             CCHHHHHHHHHhhhcCCCCChH-HHHHHHHHHHhh
Q psy3274          66 LNKAQLAETLLKHFKTIPVVEK-DVLSFFFYTVKS   99 (115)
Q Consensus        66 ~sK~QLa~~V~kHF~s~pV~E~-evI~~Fiy~VK~   99 (115)
                      ++|.||++.|.++...++-.+. .+|..|+-.+..
T Consensus         1 m~k~eli~~i~~~~~~~s~~~v~~vv~~~~~~i~~   35 (94)
T TIGR00988         1 MTKSELIERIATQQSHLPAKDVEDAVKTMLEHMAS   35 (94)
T ss_pred             CCHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH
Confidence            5899999999988766554443 677777776644


No 14 
>PF02758 PYRIN:  PAAD/DAPIN/Pyrin domain;  InterPro: IPR004020 Pyrin domain was identified as putative protein-protein interaction domain at the N-terminal region of several proteins thought to function in apoptotic and inflammatory signalling pathways. Using secondary structure prediction and potential-based fold recognition methods, the PYRIN domain is predicted to be a member of the six-helix bundle death domain-fold superfamily that includes death domains (DDs), death effector domains (DEDs), and caspase recruitment domains (CARDs). Members of the death domain-fold superfamily are well established mediators of protein-protein interactions found in many proteins involved in apoptosis and inflammation, indicating further that the PYRIN domains serve a similar function. Comparison of a circular dichroism spectrum of the PYRIN domain of CARD7/DEFCAP/NAC/NALP1 with spectra of several proteins known to adopt the death domain-fold provides experimental support for the structure prediction [] It is found in interferon-inducible proteins, pyrin and myeloid cell nuclear differentiation antigen.; PDB: 2DO9_A 2YU0_A 2KN6_A 1UCP_A 2L6A_A 2KM6_A 1PN5_A 2DBG_A 3QF2_B 2HM2_Q.
Probab=46.37  E-value=35  Score=22.70  Aligned_cols=40  Identities=23%  Similarity=0.420  Sum_probs=29.4

Q ss_pred             cccCCHHHHHHHHHhcCCCCCC-----------CCCHHHHHHHHHhhhcCC
Q psy3274          43 LFQLQVNTLRRYKKYFKVTSRP-----------GLNKAQLAETLLKHFKTI   82 (115)
Q Consensus        43 f~kL~~~tLrRYkr~f~L~~~p-----------~~sK~QLa~~V~kHF~s~   82 (115)
                      |..|+..-|++.|-+......+           +.++.+||+.+-+||...
T Consensus         9 Le~L~~~efk~FK~~L~~~~~~~~~~Ip~~~le~ad~~~la~lLv~~y~~~   59 (83)
T PF02758_consen    9 LEELSEEEFKRFKWLLKEPVKEGFPPIPRGELEKADREDLADLLVQHYGEQ   59 (83)
T ss_dssp             HHTS-HHHHHHHHHHHHSTSSTTTCSSSHCHHHHSSHHHHHHHHHHHTCHH
T ss_pred             HHhCCHHHHHHHHHHhcchhhcCCCCCCHHHHhhCCHHHHHHHHHHHcCHH
Confidence            5678899999999887422211           367899999999999754


No 15 
>PF15008 DUF4518:  Domain of unknown function (DUF4518)
Probab=46.15  E-value=43  Score=27.80  Aligned_cols=40  Identities=20%  Similarity=0.211  Sum_probs=35.3

Q ss_pred             CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCC
Q psy3274          46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVV   85 (115)
Q Consensus        46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~   85 (115)
                      ..-+.|=+|--..++.++|+.+|.+|+..+.++..+++..
T Consensus        61 V~RelLFkYLa~kgv~v~p~s~K~~LI~~~l~~W~~~~~~  100 (262)
T PF15008_consen   61 VKRELLFKYLASKGVIVPPDSEKADLIQRVLQYWSSQLQQ  100 (262)
T ss_pred             ccHHHHHHHHHHcCCCCCCCccHHHHHHHHHHHHhccccc
Confidence            5667788899999999999999999999999999887654


No 16 
>PF05193 Peptidase_M16_C:  Peptidase M16 inactive domain;  InterPro: IPR007863 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. These metallopeptidases belong to MEROPS peptidase family M16 (clan ME). They include proteins, which are classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity.  The peptidases in this group of sequences include:  Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC)  These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The mitochondrial processing peptidase consists of two structurally related domains. One is the active peptidase whereas the other, the C-terminal region, is inactive. The two domains hold the substrate like a clamp [].; GO: 0004222 metalloendopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B 1SQX_B 1NU1_B 1L0L_B 2FYU_B ....
Probab=45.89  E-value=30  Score=23.34  Aligned_cols=40  Identities=23%  Similarity=0.378  Sum_probs=29.5

Q ss_pred             CCHHHHHHHHHhcCCCC------CCCCCHHHHHHHHHhhhcCCCCC
Q psy3274          46 LQVNTLRRYKKYFKVTS------RPGLNKAQLAETLLKHFKTIPVV   85 (115)
Q Consensus        46 L~~~tLrRYkr~f~L~~------~p~~sK~QLa~~V~kHF~s~pV~   85 (115)
                      ++.+.|+.|.+.|--+.      --+++-+++...|.++|.++|..
T Consensus         2 it~e~l~~f~~~~y~p~n~~l~i~Gd~~~~~~~~~i~~~~~~l~~~   47 (184)
T PF05193_consen    2 ITLEDLRAFYKKFYRPSNMTLVIVGDIDPDELEKLIEKYFGSLPKS   47 (184)
T ss_dssp             --HHHHHHHHHHHSSGGGEEEEEEESSGHHHHHHHHHHHHTTSSHS
T ss_pred             CCHHHHHHHHHHhcCccceEEEEEcCccHHHHHHHHHhhhhhhccc
Confidence            56788899887665432      12588999999999999998854


No 17 
>PF08006 DUF1700:  Protein of unknown function (DUF1700);  InterPro: IPR012963 This family contains many hypothetical bacterial proteins and two putative membrane proteins (Q6GFD0 from SWISSPROT and Q6G806 from SWISSPROT).
Probab=35.86  E-value=40  Score=25.09  Aligned_cols=40  Identities=20%  Similarity=0.346  Sum_probs=25.3

Q ss_pred             HHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcC---CCCChHHHHHHH
Q psy3274          49 NTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKT---IPVVEKDVLSFF   93 (115)
Q Consensus        49 ~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s---~pV~E~evI~~F   93 (115)
                      +.|++|-+.  |   |...+++..+--..||..   -..+|.|++..+
T Consensus         8 ~~L~~~L~~--l---p~~e~~e~l~~Y~e~f~d~~~~G~sEeeii~~L   50 (181)
T PF08006_consen    8 NELEKYLKK--L---PEEEREEILEYYEEYFDDAGEEGKSEEEIIAEL   50 (181)
T ss_pred             HHHHHHHHc--C---CHHHHHHHHHHHHHHHHHhhhCCCCHHHHHHHc
Confidence            345555542  2   444577788888888874   345688887764


No 18 
>PRK00199 ihfB integration host factor subunit beta; Reviewed
Probab=34.76  E-value=72  Score=21.36  Aligned_cols=34  Identities=12%  Similarity=0.200  Sum_probs=22.9

Q ss_pred             CCHHHHHHHHHhhhcCCCCChH-HHHHHHHHHHhh
Q psy3274          66 LNKAQLAETLLKHFKTIPVVEK-DVLSFFFYTVKS   99 (115)
Q Consensus        66 ~sK~QLa~~V~kHF~s~pV~E~-evI~~Fiy~VK~   99 (115)
                      +||.||++.|...-..++-.+. .++..|+..+..
T Consensus         1 mtk~eli~~ia~~~~~~s~~~~~~vv~~~~~~i~~   35 (94)
T PRK00199          1 MTKSELIERLAARNPHLSAKDVENAVKEILEEMSD   35 (94)
T ss_pred             CCHHHHHHHHHHHcCCCCHHHHHHHHHHHHHHHHH
Confidence            5899999999876544443333 667777766543


No 19 
>PF11923 DUF3441:  Domain of unknown function (DUF3441);  InterPro: IPR021846  This presumed domain is functionally uncharacterised. This domain is found in archaea and eukaryotes. This domain is typically between 104 to 119 amino acids in length. This domain is found associated with PF05833 from PFAM, PF05670 from PFAM. This domain has two conserved residues (P and G) that may be functionally important. 
Probab=32.93  E-value=43  Score=24.17  Aligned_cols=31  Identities=32%  Similarity=0.552  Sum_probs=24.8

Q ss_pred             CHHHHHHHHHhcCCCCCCCCC-HHHHHHHHHhhh
Q psy3274          47 QVNTLRRYKKYFKVTSRPGLN-KAQLAETLLKHF   79 (115)
Q Consensus        47 ~~~tLrRYkr~f~L~~~p~~s-K~QLa~~V~kHF   79 (115)
                      +-.+|.+|+.+-+|  -||.. |-..|..+-.||
T Consensus        37 P~sal~~yKYkvKl--~PG~~KKGKaak~il~~f   68 (112)
T PF11923_consen   37 PYSALSKYKYKVKL--QPGNAKKGKAAKEILEYF   68 (112)
T ss_pred             CHHHHhhCceeEEE--cCCCcchHHHHHHHHHHH
Confidence            35699999888777  45655 679999999999


No 20 
>PLN02365 2-oxoglutarate-dependent dioxygenase
Probab=31.51  E-value=56  Score=26.49  Aligned_cols=53  Identities=21%  Similarity=0.343  Sum_probs=37.6

Q ss_pred             CCCccCcccC--CHHHHHHHHHhcCC--CCCCCCCHHHHHH--HHHhhhcCCCCChHHH
Q psy3274          37 ESPDVDLFQL--QVNTLRRYKKYFKV--TSRPGLNKAQLAE--TLLKHFKTIPVVEKDV   89 (115)
Q Consensus        37 ~~p~vDf~kL--~~~tLrRYkr~f~L--~~~p~~sK~QLa~--~V~kHF~s~pV~E~ev   89 (115)
                      ..|-|||+.+  ....|+.=++.++.  -+..+++.+.+.+  .+.+.|=++|..|+.-
T Consensus         5 ~iPvIDls~~~~~~~~l~~Ac~~~GfF~l~nHGi~~~l~~~~~~~~~~FF~LP~e~K~~   63 (300)
T PLN02365          5 NIPTIDLEEFPGQIEKLREACERWGCFRVVNHGVSLSLMAEMKKVVRSLFDLPDEVKRR   63 (300)
T ss_pred             CCCEEEChhhHHHHHHHHHHHHHCCEEEEEcCCCCHHHHHHHHHHHHHHHcCCHHHHhh
Confidence            5789999987  34667777777776  3567888754433  2346777899999875


No 21 
>PF13720 Acetyltransf_11:  Udp N-acetylglucosamine O-acyltransferase; Domain 2; PDB: 3I3A_A 3I3X_A 3HSQ_B 2JF2_A 1LXA_A 2AQ9_A 2QIV_X 2QIA_A 2JF3_A 4EQY_F ....
Probab=30.67  E-value=80  Score=21.28  Aligned_cols=48  Identities=15%  Similarity=0.265  Sum_probs=36.6

Q ss_pred             CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCChHHHHHHHHH
Q psy3274          46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVVEKDVLSFFFY   95 (115)
Q Consensus        46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~E~evI~~Fiy   95 (115)
                      ++.+++...++.|++=...+.+-+|-++.|..-+..  ..|.+-|..||-
T Consensus        28 fs~~~i~~l~~ayr~l~~~~~~~~~a~~~l~~~~~~--~~~v~~~~~Fi~   75 (83)
T PF13720_consen   28 FSKEEISALRRAYRILFRSGLTLEEALEELEEEYPD--SPEVREIVDFIR   75 (83)
T ss_dssp             S-HHHHHHHHHHHHHHHTSSS-HHHHHHHHHHHTTS--CHHHHHHHHHHH
T ss_pred             CCHHHHHHHHHHHHHHHhCCCCHHHHHHHHHHhccC--CHHHHHHHHHHH
Confidence            677888888888888666778889999999987754  667777778875


No 22 
>PF01204 Trehalase:  Trehalase;  InterPro: IPR001661 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 37 GH37 from CAZY comprises enzymes with only one known activity; trehalase (3.2.1.28 from EC). Trehalase is the enzyme responsible for the degradation of the disaccharide alpha,alpha-trehalose yielding two glucose subunits []. It is an enzyme found in a wide variety of organisms and whose sequence has been highly conserved throughout evolution.; GO: 0004555 alpha,alpha-trehalase activity, 0005991 trehalose metabolic process; PDB: 2JJB_B 2WYN_B 2JG0_A 2JF4_A 3C67_A 3D3I_B 3C69_A 3C68_A 2Z07_B.
Probab=28.56  E-value=52  Score=28.97  Aligned_cols=51  Identities=24%  Similarity=0.303  Sum_probs=24.1

Q ss_pred             CHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCC--------CCChHHHHHHHHHHH
Q psy3274          47 QVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTI--------PVVEKDVLSFFFYTV   97 (115)
Q Consensus        47 ~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~--------pV~E~evI~~Fiy~V   97 (115)
                      ...+|+.|+..+.-....++++++|..-|.+||..-        |.+..+--..|+..+
T Consensus        18 ~~~~~~~~~~~~~~~~~~~~~~~~l~~fv~~~f~~~G~el~~~~~~d~~~~~~~~~~~i   76 (512)
T PF01204_consen   18 PDEILAAFKKLFWNTLTRNISKFDLKQFVEDHFDEPGSELEPWTPPDWKEDPPYFLSRI   76 (512)
T ss_dssp             HHHHHHHHHHHHTS------TT--HHHHHHHHEE------------S------------
T ss_pred             HHHHHHHHHHhhhhccccCCCHHHHHHHHHHhCcCccccccccCCcccCCCCccccccC
Confidence            567899999998876677889999999999999864        344444334455544


No 23 
>PHA00426 type II holin
Probab=27.46  E-value=62  Score=22.01  Aligned_cols=29  Identities=21%  Similarity=0.106  Sum_probs=25.6

Q ss_pred             HHHHHHHhhhcCCCCChHHHHHHHHHHHh
Q psy3274          70 QLAETLLKHFKTIPVVEKDVLSFFFYTVK   98 (115)
Q Consensus        70 QLa~~V~kHF~s~pV~E~evI~~Fiy~VK   98 (115)
                      --|+.+.+-|-++..+|==-|+.|+|.|=
T Consensus        20 ~~ad~~~r~F~Glsl~~WfyvAai~YTVv   48 (67)
T PHA00426         20 AGADGAARLFFGLSLNEWFYVAAIAYTVV   48 (67)
T ss_pred             chhhHHHHHHhcccHHHHHHHHHHHHHHH
Confidence            35788999999999999999999999873


No 24 
>PF11116 DUF2624:  Protein of unknown function (DUF2624);  InterPro: IPR020277 This entry contains proteins with no known function.
Probab=26.87  E-value=92  Score=21.88  Aligned_cols=37  Identities=22%  Similarity=0.395  Sum_probs=24.5

Q ss_pred             cccCCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCC
Q psy3274          43 LFQLQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPV   84 (115)
Q Consensus        43 f~kL~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV   84 (115)
                      +..++.+-|..|.+.|+++.    +++| |+.|..+-.+.|+
T Consensus        12 ln~iT~~eLlkyskqy~i~i----t~~Q-A~~I~~~lr~k~i   48 (85)
T PF11116_consen   12 LNNITAKELLKYSKQYNISI----TKKQ-AEQIANILRGKNI   48 (85)
T ss_pred             HhcCCHHHHHHHHHHhCCCC----CHHH-HHHHHHHHhcCCC
Confidence            34688999999999999964    3333 3344445555443


No 25 
>PLN02984 oxidoreductase, 2OG-Fe(II) oxygenase family protein
Probab=25.42  E-value=79  Score=26.42  Aligned_cols=53  Identities=25%  Similarity=0.328  Sum_probs=40.2

Q ss_pred             CCCccCcccCCHHHHHHHHHhcCC--CCCCCCCHH---HHHHHHHhhhcCCCCChHHHH
Q psy3274          37 ESPDVDLFQLQVNTLRRYKKYFKV--TSRPGLNKA---QLAETLLKHFKTIPVVEKDVL   90 (115)
Q Consensus        37 ~~p~vDf~kL~~~tLrRYkr~f~L--~~~p~~sK~---QLa~~V~kHF~s~pV~E~evI   90 (115)
                      ..|-|||+.+..+.|++=.+.++.  -+..+++.+   ++....+ .|-++|..|+.-+
T Consensus        38 ~IPvIDls~~~~~~l~~A~~~~GFF~v~nHGI~~~li~~~~~~s~-~FF~LP~eeK~k~   95 (341)
T PLN02984         38 DIPVIDMECLDMEKLREACKDWGIFRLENHGIPLTLMSQLKEISE-SLLSLPFESKREL   95 (341)
T ss_pred             CCCeEeCcHHHHHHHHHHHHhCcEEEEECCCCCHHHHHHHHHHHH-HHHcCCHHHHhhh
Confidence            589999999988999999998887  457788753   4444444 4556999998765


No 26 
>PHA02564 V virion protein; Provisional
Probab=25.27  E-value=2e+02  Score=21.74  Aligned_cols=25  Identities=16%  Similarity=0.263  Sum_probs=17.9

Q ss_pred             HHHHHHHHhhhcC-CCCChHHHHHHH
Q psy3274          69 AQLAETLLKHFKT-IPVVEKDVLSFF   93 (115)
Q Consensus        69 ~QLa~~V~kHF~s-~pV~E~evI~~F   93 (115)
                      ..|+.++.+|=.+ +|..+.+|..+|
T Consensus       111 ~~l~~Aii~~EnG~~pys~~~i~~~~  136 (141)
T PHA02564        111 AALVTAIIRHENGQQPYSAEQIAQGV  136 (141)
T ss_pred             HHHHHHHHHHhcCCCCCCHHHHHHHH
Confidence            4677778888777 777776666655


No 27 
>PRK12461 UDP-N-acetylglucosamine acyltransferase; Provisional
Probab=24.76  E-value=1.2e+02  Score=24.22  Aligned_cols=48  Identities=23%  Similarity=0.246  Sum_probs=38.1

Q ss_pred             CCHHHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhcCCCCChHHHHHHHHH
Q psy3274          46 LQVNTLRRYKKYFKVTSRPGLNKAQLAETLLKHFKTIPVVEKDVLSFFFY   95 (115)
Q Consensus        46 L~~~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~s~pV~E~evI~~Fiy   95 (115)
                      ++.++++..++.|++=.+.+.+.+|.++.++.-+.  +-.|.+-|..||-
T Consensus       200 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~f~~  247 (255)
T PRK12461        200 FSSRAIRALKRAYKIIYRSGLSVQQAVAELELQQF--ESPEVEELIDFIK  247 (255)
T ss_pred             CCHHHHHHHHHHHHHHHhcCCCHHHHHHHHHHhcc--CCHHHHHHHHHHH
Confidence            77888888899999988889999998888887553  4556666777774


No 28 
>PF08564 CDC37_C:  Cdc37 C terminal domain;  InterPro: IPR013873  Cdc37 is a protein required for the activity of numerous eukaryotic protein kinases. This entry corresponds to the C-terminal domain whose function is unclear. It is found C-terminal to the Hsp90 chaperone (heat shock protein 90) binding domain IPR013874 from INTERPRO and the N-terminal kinase binding domain of Cdc37 IPR013855 from INTERPRO []. ; PDB: 1US7_B.
Probab=24.08  E-value=81  Score=22.53  Aligned_cols=50  Identities=18%  Similarity=0.189  Sum_probs=21.4

Q ss_pred             ccCCHHHHHHHHHhcCCCCCCCCCH---HHHHHHHHhhhcCCCCChHHHHHHHH
Q psy3274          44 FQLQVNTLRRYKKYFKVTSRPGLNK---AQLAETLLKHFKTIPVVEKDVLSFFF   94 (115)
Q Consensus        44 ~kL~~~tLrRYkr~f~L~~~p~~sK---~QLa~~V~kHF~s~pV~E~evI~~Fi   94 (115)
                      ..|+..--++=|.-|. ..+|.+.+   .+=++.|.+-|..+||.|.+-+...+
T Consensus         7 ~~lDp~~~~~~~evFe-slP~emq~Alet~~ld~vnkVl~~M~veeAE~~v~~~   59 (99)
T PF08564_consen    7 EGLDPEEERKAREVFE-SLPPEMQKALETGDLDEVNKVLGKMPVEEAEYHVERC   59 (99)
T ss_dssp             -S--HH------HHHH-HS-TT------------HHHHHT--SSSHHHHHHHHH
T ss_pred             CCCCcchhhhHHHHHH-HCCHHHHHHHHcCCHHHHHHHHHhCCHHHHHHHHHHH
Confidence            3455522223334454 55777665   47889999999999999998877654


No 29 
>PF10746 Phage_holin_6:  Phage holin family 6;  InterPro: IPR019682 This entry represents a protein conserved in Caudovirales (known as tailed bacteriophages). Holins are a diverse family of proteins that cause bacterial membrane lysis during late-protein synthesis. 
Probab=24.07  E-value=86  Score=21.25  Aligned_cols=27  Identities=22%  Similarity=0.194  Sum_probs=24.5

Q ss_pred             HHHHHHhhhcCCCCChHHHHHHHHHHH
Q psy3274          71 LAETLLKHFKTIPVVEKDVLSFFFYTV   97 (115)
Q Consensus        71 La~~V~kHF~s~pV~E~evI~~Fiy~V   97 (115)
                      -|+.+.+-|.+++.+|==.|+.++|.|
T Consensus        20 ~A~~~a~~f~GLslneWfyiati~Ytv   46 (66)
T PF10746_consen   20 GADVVARYFWGLSLNEWFYIATIAYTV   46 (66)
T ss_pred             hHHHHHHHHcCCCHHHHHHHHHHHHHH
Confidence            478889999999999999999999986


No 30 
>PF09106 SelB-wing_2:  Elongation factor SelB, winged helix ;  InterPro: IPR015190 This entry represents a domain with a winged helix-type fold, which consists of a closed 3-helical bundle with a right-handed twist, and a small beta-sheet wing []. Different winged helix domains share a common structure, but can differ in sequence. This entry is designated "type 2".  The winged helix motif is involved in both DNA and RNA binding. In the elongation factor SelB, the winged helix domains recognise RNA, allowing the complex to wrap around the small ribosomal subunit. In bacteria, the incorporation of the amino acid selenocysteine into proteins requires elongation factor SelB, which binds both transfer RNA (tRNA) and mRNA. SelB binds to an mRNA hairpin formed by the selenocysteine insertion sequence (SECIS) with extremely high specificity []. ; GO: 0003723 RNA binding, 0003746 translation elongation factor activity, 0005525 GTP binding, 0001514 selenocysteine incorporation, 0005737 cytoplasm; PDB: 2V9V_A 1LVA_A 2PLY_A 2UWM_A.
Probab=23.75  E-value=84  Score=19.42  Aligned_cols=39  Identities=31%  Similarity=0.469  Sum_probs=23.5

Q ss_pred             HHHHHHHHhcCCCCCCCCCHHHHHHHHHhhhc-CCCCChHHHHHH
Q psy3274          49 NTLRRYKKYFKVTSRPGLNKAQLAETLLKHFK-TIPVVEKDVLSF   92 (115)
Q Consensus        49 ~tLrRYkr~f~L~~~p~~sK~QLa~~V~kHF~-s~pV~E~evI~~   92 (115)
                      +.|..|-+.|-+  .+|++|++|.+-+   |. .+|-+..+.+..
T Consensus         3 ~~L~~yH~~~Pl--r~G~~keeLrsrl---~~~~l~~k~~~~ll~   42 (59)
T PF09106_consen    3 EILAEYHRENPL--RPGMPKEELRSRL---FKPRLPPKLFNALLE   42 (59)
T ss_dssp             HHHHHHHHH-TT--SS-EEHHHHHHHC---ST-TS-HCCHHHHHH
T ss_pred             HHHHHHHHHCcC--ccCcCHHHHHHHH---hhccCCHHHHHHHHH
Confidence            578888877755  8999999998733   33 445444444433


No 31 
>PF00797 Acetyltransf_2:  N-acetyltransferase;  InterPro: IPR001447 Arylamine N-acetyltransferase (NAT) is a cytosolic enzyme of approximately 30 kDa. It facilitates the transfer of an acetyl group from acetyl coenzyme A on to a wide range of arylamine, N-hydroxyarylamines and hydrazines. Acetylation of these compounds generally results in inactivation. NAT is found in many species from Mycobacteria (Mycobacterium tuberculosis, Mycobacterium smegmatis etc) to Homo sapiens (Human). It was the first enzyme to be observed to have polymorphic activity amongst human individuals. NAT is responsible for the inactivation of Isoniazid (a drug used to treat tuberculosis) in humans. The NAT protein has also been shown to be involved in the breakdown of folic acid. NAT catalyses the reaction:  Acetyl-coA + arylamine = coA + N-acetylarylamine   NAT is the target of a common genetic polymorphism of clinical relevance in humans. The N-acetylation polymorphism is determined by low or high NAT activity in liver. NAT has been implicated in the action and toxicity of amine-containing drugs, and in the susceptibility to cancer and systematic lupus erythematosus. Two highly similar human genes for NAT, termed NAT1 and NAT2, encode genetically invariant and variant NAT proteins, respectively. ; GO: 0016407 acetyltransferase activity, 0008152 metabolic process; PDB: 1W6F_A 1W5R_A 1GX3_D 2PQT_A 2IJA_A 1W4T_A 2BSZ_B 3D9W_B 3LTW_A 3LNB_A ....
Probab=23.06  E-value=59  Score=24.72  Aligned_cols=23  Identities=30%  Similarity=0.410  Sum_probs=15.6

Q ss_pred             CCHHHHHHHHHhhhcCCCCChHHH
Q psy3274          66 LNKAQLAETLLKHFKTIPVVEKDV   89 (115)
Q Consensus        66 ~sK~QLa~~V~kHF~s~pV~E~ev   89 (115)
                      .+.+|+.++++++| ++.++|.+|
T Consensus       218 ~~~ee~~~iL~~~F-gi~l~~~~~  240 (240)
T PF00797_consen  218 TSEEELLEILKEYF-GIDLDEEEV  240 (240)
T ss_dssp             SSHHHHHHHHHHTS----GSS---
T ss_pred             CCHHHHHHHHHHhC-CcccCCCCC
Confidence            67899999999999 588887664


No 32 
>cd01104 HTH_MlrA-CarA Helix-Turn-Helix DNA binding domain of the transcription regulators MlrA and CarA. Helix-turn-helix (HTH) transcription regulator MlrA (merR-like regulator A), N-terminal domain. The MlrA protein, also known as YehV, has been shown to control cell-cell aggregation by co-regulating the expression of curli and extracellular matrix production in Escherichia coli and Salmonella typhimurium.  Its close homolog, CarA from Myxococcus xanthus, is involved in activation of the carotenoid biosynthesis genes by light. These proteins belong to the MerR superfamily of transcription regulators that promote expression of several stress regulon genes by reconfiguring the spacer between the -35 and -10 promoter elements. Their conserved N-terminal domains contain predicted HTH motifs that mediate DNA binding, while the dissimilar C-terminal domains bind specific coactivator molecules. Many MlrA- and CarA-like proteins in this group appear to lack the long dimerization helix seen i
Probab=22.31  E-value=94  Score=18.88  Aligned_cols=18  Identities=17%  Similarity=0.468  Sum_probs=14.5

Q ss_pred             cCCHHHHHHHHHhcCCCC
Q psy3274          45 QLQVNTLRRYKKYFKVTS   62 (115)
Q Consensus        45 kL~~~tLrRYkr~f~L~~   62 (115)
                      .++..|||+|.+.+++..
T Consensus        11 gvs~~tlr~w~~~~g~~~   28 (68)
T cd01104          11 GVSPDTLRAWERRYGLPA   28 (68)
T ss_pred             CcCHHHHHHHHHhCCCCC
Confidence            478899999998887743


No 33 
>KOG1946|consensus
Probab=21.33  E-value=95  Score=25.38  Aligned_cols=48  Identities=17%  Similarity=0.135  Sum_probs=34.4

Q ss_pred             HHHHHHHHhcCCCCCCC------------------CCHHHHHHHHHhhhcCCC-CChHHHHHHHHHH
Q psy3274          49 NTLRRYKKYFKVTSRPG------------------LNKAQLAETLLKHFKTIP-VVEKDVLSFFFYT   96 (115)
Q Consensus        49 ~tLrRYkr~f~L~~~p~------------------~sK~QLa~~V~kHF~s~p-V~E~evI~~Fiy~   96 (115)
                      ..||.|-+.++|+.|.+                  +.-=++...+..||...+ ....+...+|++.
T Consensus       126 k~iw~YIke~nLqDP~nkr~IlCDekL~~iF~~k~v~~fem~KLL~~H~~~~~d~~~~~~~~~~~~e  192 (240)
T KOG1946|consen  126 KKIWAYIKEHNLQDPKNKREILCDEKLKSIFGKKRVGMFEMLKLLTKHFLKNQDMVGTDLKQGEETE  192 (240)
T ss_pred             HHHHHHHHHhccCCccccCeeeeHHHHHHHhccCccceeeHHHHHHHhccCccccccchhhcCcccc
Confidence            46899999999988764                  344578899999998643 3334555666654


No 34 
>smart00027 EH Eps15 homology domain. Pair of EF hand motifs that recognise proteins containing Asn-Pro-Phe (NPF) sequences.
Probab=21.20  E-value=2.5e+02  Score=18.40  Aligned_cols=32  Identities=22%  Similarity=0.368  Sum_probs=22.2

Q ss_pred             CCHHHHHHHHHhcCC-CC-CC-CCCHHHHHHHHHh
Q psy3274          46 LQVNTLRRYKKYFKV-TS-RP-GLNKAQLAETLLK   77 (115)
Q Consensus        46 L~~~tLrRYkr~f~L-~~-~p-~~sK~QLa~~V~k   77 (115)
                      ++.+-+.+|+..|.+ +. .. .++.++|..+++.
T Consensus         4 ls~~~~~~l~~~F~~~D~d~~G~Is~~el~~~l~~   38 (96)
T smart00027        4 ISPEDKAKYEQIFRSLDKNQDGTVTGAQAKPILLK   38 (96)
T ss_pred             CCHHHHHHHHHHHHHhCCCCCCeEeHHHHHHHHHH
Confidence            567778888888877 32 22 3777888777765


No 35 
>PF04501 Baculo_VP39:  Baculovirus major capsid protein VP39;  InterPro: IPR007589 This family constitutes the 39 kDa major capsid protein of the Baculoviridae [].; GO: 0005198 structural molecule activity, 0019028 viral capsid
Probab=20.77  E-value=38  Score=28.77  Aligned_cols=60  Identities=17%  Similarity=0.231  Sum_probs=29.0

Q ss_pred             ceeecccchhhhHHhhhhccCCCCCCCCCCCC--C-----CCccCcccCCHHHHHHHHHhcCCCCCC
Q psy3274           5 HIYICEYHKCLIQSVRNKRRRKDSEDDSNEND--E-----SPDVDLFQLQVNTLRRYKKYFKVTSRP   64 (115)
Q Consensus         5 h~yicd~hk~~iqs~r~~r~~~~s~~~s~~~d--~-----~p~vDf~kL~~~tLrRYkr~f~L~~~p   64 (115)
                      -.|||+||-+.-=...++.=...|.++-...-  +     ....+-...=..|=..|..+++++..+
T Consensus        44 G~fIC~yHL~~~FKm~K~~l~Ipdg~~n~~~r~vg~SLv~~~~~~~~RILIPt~~NY~tvlnv~~ms  110 (300)
T PF04501_consen   44 GTFICNYHLSKYFKMEKMVLPIPDGDDNSFKRLVGKSLVQHTATGNKRILIPTKENYETVLNVNSMS  110 (300)
T ss_pred             CeEEechhhhhheeeEEEeecCcCCCCceeeeEEeEeccCCCCCCCceEEeecchhHHHHhCccccC
Confidence            37999999875333322222233443311000  1     111122222233556799999986544


No 36 
>COG2971 Predicted N-acetylglucosamine kinase [Carbohydrate transport and metabolism]
Probab=20.59  E-value=84  Score=26.65  Aligned_cols=34  Identities=18%  Similarity=0.377  Sum_probs=26.8

Q ss_pred             HHHHHHHHHhhhcCCCCChHHHHHHHHHHHhhCCcccc
Q psy3274          68 KAQLAETLLKHFKTIPVVEKDVLSFFFYTVKSNTNKLD  105 (115)
Q Consensus        68 K~QLa~~V~kHF~s~pV~E~evI~~Fiy~VK~~k~Kfd  105 (115)
                      +.-|+++|++||..   +. |-|..|+|+.+-.++++-
T Consensus       175 ~t~L~d~v~~~f~~---d~-edlv~~~y~a~~~~~~ia  208 (301)
T COG2971         175 ATPLTDAVMAEFNL---DP-EDLVAFIYKAGPGDKKIA  208 (301)
T ss_pred             CChHHHHHHHHhCC---CH-HHHHHHHHhcCCchHHHH
Confidence            33499999999987   33 668899999988776654


No 37 
>PF12794 MscS_TM:  Mechanosensitive ion channel inner membrane domain 1
Probab=20.56  E-value=38  Score=28.20  Aligned_cols=16  Identities=44%  Similarity=0.746  Sum_probs=13.9

Q ss_pred             CCCccCcccCCHHHHH
Q psy3274          37 ESPDVDLFQLQVNTLR   52 (115)
Q Consensus        37 ~~p~vDf~kL~~~tLr   52 (115)
                      +.|++|+++.+.-++|
T Consensus       291 ee~~ldl~~I~~Qslr  306 (340)
T PF12794_consen  291 EEPELDLEQISQQSLR  306 (340)
T ss_pred             CCcccCHHHHHHHHHH
Confidence            7899999999888876


No 38 
>PF01436 NHL:  NHL repeat;  InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ].  The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=20.15  E-value=28  Score=18.74  Aligned_cols=14  Identities=21%  Similarity=0.432  Sum_probs=10.4

Q ss_pred             cceeecccchhhhH
Q psy3274           4 RHIYICEYHKCLIQ   17 (115)
Q Consensus         4 ~h~yicd~hk~~iq   17 (115)
                      -+||+||..-..||
T Consensus        13 g~i~VaD~~n~rV~   26 (28)
T PF01436_consen   13 GNIYVADSGNHRVQ   26 (28)
T ss_dssp             SEEEEEECCCTEEE
T ss_pred             CCEEEEECCCCEEE
Confidence            46999998766554


Done!