Query         psy16714
Match_columns 68
No_of_seqs    105 out of 549
Neff          7.1 
Searched_HMMs 46136
Date          Fri Aug 16 20:08:11 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy16714.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16714hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3589|consensus               99.8 8.7E-20 1.9E-24  119.3   7.8   61    8-68     84-146 (221)
  2 smart00315 RGS Regulator of G   99.8 4.5E-19 9.7E-24  103.7   7.7   63    5-68      3-65  (118)
  3 PF00615 RGS:  Regulator of G p  99.7 2.4E-17 5.3E-22   95.4   8.8   64    4-68      2-65  (118)
  4 PF09128 RGS-like:  Regulator o  98.3 3.3E-06 7.1E-11   54.9   6.6   62    4-67      8-69  (188)
  5 KOG3590|consensus               97.6 0.00014   3E-09   52.8   5.2   39    4-43    115-153 (602)
  6 KOG3590|consensus               97.1  0.0012 2.6E-08   48.0   4.9   58    9-66    322-386 (602)
  7 KOG0986|consensus               97.0  0.0019 4.2E-08   47.5   5.6   57    9-66     58-115 (591)
  8 PF15182 OTOS:  Otospiralin      86.1     1.6 3.5E-05   24.0   3.4   30   31-60     25-54  (69)
  9 PF02184 HAT:  HAT (Half-A-TPR)  80.7     1.3 2.9E-05   21.0   1.5   15   50-64      3-17  (32)
 10 PF11467 LEDGF:  Lens epitheliu  62.0      14 0.00031   21.9   3.2   18   44-61     59-76  (106)
 11 COG4790 EscR Type III secretor  60.8      13 0.00027   24.8   3.0   32   10-41     91-122 (214)
 12 PF05528 Coronavirus_5:  Corona  58.6     6.7 0.00014   22.2   1.3   24   23-46     54-77  (82)
 13 PRK10803 tol-pal system protei  52.2      71  0.0015   21.5   6.4   50   12-66    160-214 (263)
 14 PF10774 DUF4226:  Domain of un  47.9      16 0.00036   21.9   1.9   18    9-26     67-84  (112)
 15 COG4133 CcmA ABC-type transpor  45.8      13 0.00028   24.8   1.3   18   23-40     87-105 (209)
 16 PF09279 EF-hand_like:  Phospho  41.1      39 0.00084   18.2   2.6   18   14-31     17-34  (83)
 17 PF14894 Lsm_C:  Lsm C-terminal  40.4      28 0.00061   19.1   1.9   16   48-63     46-61  (64)
 18 PF11163 DUF2947:  Protein of u  34.6      44 0.00095   21.3   2.4   18   50-67     33-50  (153)
 19 PF09764 Nt_Gln_amidase:  N-ter  31.9      35 0.00076   22.2   1.7   17   24-42      4-20  (184)
 20 PF02671 PAH:  Paired amphipath  30.7      47   0.001   16.1   1.7   19    4-22     28-46  (47)
 21 PF14328 DUF4385:  Domain of un  29.2      47   0.001   21.0   1.8   17   46-62     47-63  (145)
 22 PF10312 Cactin_mid:  Conserved  28.4      88  0.0019   20.3   3.1   24   16-39     82-105 (191)
 23 PF12725 DUF3810:  Protein of u  28.3 2.1E+02  0.0045   19.9   5.3   49   10-66    251-302 (318)
 24 PF01193 RNA_pol_L:  RNA polyme  27.9      32  0.0007   18.1   0.8    9    3-11     15-23  (66)
 25 PF10875 DUF2670:  Protein of u  27.9      40 0.00087   20.9   1.3   18    2-19      3-20  (139)
 26 KOG2005|consensus               27.4 2.1E+02  0.0046   22.9   5.3   49   12-60    204-252 (878)
 27 PHA02901 virus redox protein;   27.0 1.1E+02  0.0024   17.2   2.9   21    3-23     34-54  (75)
 28 PF03096 Ndr:  Ndr family;  Int  26.9 1.6E+02  0.0034   20.5   4.2   29   10-38    177-206 (283)
 29 PF07631 PSD4:  Protein of unkn  26.8      78  0.0017   19.1   2.5   53    4-66     42-106 (128)
 30 PF05288 Pox_A3L:  Poxvirus A3L  26.6 1.2E+02  0.0027   16.8   3.0   22    3-24     29-50  (70)
 31 PHA02683 ORF078 thioredoxin-li  26.6 1.1E+02  0.0023   17.3   2.8   21    3-23     34-54  (75)
 32 PF10118 Metal_hydrol:  Predict  26.6   2E+02  0.0044   19.2   5.9   63    2-66     45-112 (253)
 33 PF05843 Suf:  Suppressor of fo  25.9 2.1E+02  0.0045   19.1   5.1   49   10-63     50-99  (280)
 34 KOG0063|consensus               25.9      93   0.002   23.7   3.1   31   13-43    495-525 (592)
 35 PHA03056 putative myristoylate  24.8 1.2E+02  0.0026   19.2   3.0   30   36-65     70-99  (165)
 36 KOG2348|consensus               24.2      45 0.00098   25.4   1.3   17   50-66    375-391 (667)
 37 PRK13722 lytic transglycosylas  24.1 1.2E+02  0.0026   19.5   3.0   54    8-62     85-144 (169)
 38 PF03134 TB2_DP1_HVA22:  TB2/DP  23.9      56  0.0012   18.2   1.4   14   50-63     79-92  (94)
 39 PF07956 DUF1690:  Protein of U  23.6      41  0.0009   20.9   0.9   20   24-43    121-140 (142)
 40 PF11709 Mit_ribos_Mrp51:  Mito  23.5 1.1E+02  0.0024   21.3   3.0   26    4-29    148-173 (312)
 41 PHA02335 hypothetical protein   22.7 1.9E+02  0.0041   17.5   4.0   43   24-67      4-54  (118)
 42 PF12122 DUF3582:  Protein of u  22.7      81  0.0018   18.4   1.9   18    9-26      7-24  (101)
 43 PF13981 SopA:  SopA-like centr  22.4      99  0.0021   19.0   2.4   35   30-65     92-126 (135)
 44 PF12335 SBF2:  Myotubularin pr  21.9 1.1E+02  0.0025   20.4   2.7   24    2-26     47-70  (225)
 45 PF08708 PriCT_1:  Primase C te  21.8 1.4E+02   0.003   15.6   3.6   11   48-58     57-67  (71)
 46 PF08361 TetR_C_2:  MAATS-type   21.7 1.8E+02  0.0039   16.8   4.4   34    8-41     17-51  (121)
 47 COG5562 Phage envelope protein  21.2      57  0.0012   20.4   1.1   23    2-24     75-97  (137)
 48 PF14159 CAAD:  CAAD domains of  21.2 1.7E+02  0.0036   16.7   3.0   29   33-61     61-89  (90)
 49 COG4748 Uncharacterized conser  20.6 2.9E+02  0.0064   19.9   4.6   51   13-63    172-227 (365)
 50 PF03909 BSD:  BSD domain  ;  I  20.4 1.1E+02  0.0023   15.9   2.0   31    4-36     17-47  (62)
 51 PRK09718 hypothetical protein;  20.1 1.8E+02   0.004   22.0   3.7   19   47-65    385-403 (512)

No 1  
>KOG3589|consensus
Probab=99.81  E-value=8.7e-20  Score=119.33  Aligned_cols=61  Identities=48%  Similarity=0.822  Sum_probs=57.4

Q ss_pred             HhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCH--HHHHHHHHHHHHHhcCCCCCCC
Q psy16714          8 RNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNP--DVIEEKARFIYEDYISILSPKE   68 (68)
Q Consensus         8 ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~--~~~~~~a~~Iy~~fi~~~sp~E   68 (68)
                      |++|+.|+..|..||++|||+|||.||++||+||+....  ..+..+|+.||.+||+++||+|
T Consensus        84 L~~~~~G~~~F~~fLk~e~Seeni~FW~ace~~k~~~~~~~~~~~~~a~~i~~~f~~~~ap~~  146 (221)
T KOG3589|consen   84 LLADEAGRAVFAEFLKSEFSEENLEFWLACEEFKKRRSEKSTKMSEKAKEIYEEFSAPAAPKE  146 (221)
T ss_pred             HhhChhhHHHHHHHHHHHhhHhHHHHHHHHHHHHhCcCchhhhhhHHHHHHHHHHhccCCCCC
Confidence            459999999999999999999999999999999998765  5799999999999999999986


No 2  
>smart00315 RGS Regulator of G protein signalling domain. RGS family members are GTPase-activating proteins for heterotrimeric G-protein alpha-subunits.
Probab=99.79  E-value=4.5e-19  Score=103.67  Aligned_cols=63  Identities=46%  Similarity=0.720  Sum_probs=58.5

Q ss_pred             HHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCCCC
Q psy16714          5 KRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSPKE   68 (68)
Q Consensus         5 ~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp~E   68 (68)
                      ...| .||.|+..|+.||++++|.|||.||.+|++|+.......+.+.|+.||++||.++||+|
T Consensus         3 ~~il-~d~~~~~~F~~fl~~~~~~e~l~F~~~v~~~~~~~~~~~~~~~a~~I~~~yi~~~s~~~   65 (118)
T smart00315        3 ESLL-SDPIGRLLFREFLESEFSEENLEFWLAVEEFKKAEDDEERIAKAREIYDKFLSPNAPKE   65 (118)
T ss_pred             HHHh-cChHHHHHHHHHHHHhcchHhHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhCCCCCCc
Confidence            4444 99999999999999999999999999999999998767889999999999999999975


No 3  
>PF00615 RGS:  Regulator of G protein signaling domain;  InterPro: IPR000342 RGS (Regulator of G Protein Signalling) proteins are multi-functional, GTPase-accelerating proteins that promote GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways []. Upon activation by GPCRs, heterotrimeric G proteins exchange GDP for GTP, are released from the receptor, and dissociate into free, active GTP-bound alpha subunit and beta-gamma dimer, both of which activate downstream effectors. The response is terminated upon GTP hydrolysis by the alpha subunit (IPR001019 from INTERPRO), which can then bind the beta-gamma dimer (IPR001632 from INTERPRO, IPR001770 from INTERPRO) and the receptor. RGS proteins markedly reduce the lifespan of GTP-bound alpha subunits by stabilising the G protein transition state. All RGS proteins contain an 'RGS-box' (or RGS domain), which is required for activity. Some small RGS proteins such as RGS1 and RGS4 are comprised of little more than an RGS domain, while others also contain additional domains that confer further functionality []. RGS domains can be found in conjunction with a variety of domains, including: DEP for membrane targeting (IPR000591 from INTERPRO), PDZ for binding to GPCRs (IPR001478 from INTERPRO), PTB for phosphotyrosine-binding (IPR006020 from INTERPRO), RBD for Ras-binding (IPR003116 from INTERPRO), GoLoco for guanine nucleotide inhibitor activity (IPR003109 from INTERPRO), PX for phosphatidylinositol-binding (IPR001683 from INTERPRO), PXA that is associated with PX (IPR003114 from INTERPRO), PH for stimulating guanine nucleotide exchange (IPR001849 from INTERPRO), and GGL (G protein gamma subunit-like) for binding G protein beta subunits (IPR001770 from INTERPRO) []. Those RGS proteins that contain GGL domains can interact with G protein beta subunits to form novel dimers that prevent G protein gamma subunit binding and G protein alpha subunit association, thereby preventing heterotrimer formation.; GO: 0004871 signal transducer activity; PDB: 2BCJ_A 3PSC_A 3PVU_A 1YM7_B 3PVW_A 1OMW_A 2EBZ_A 2BV1_B 2GTP_D 1CMZ_A ....
Probab=99.74  E-value=2.4e-17  Score=95.44  Aligned_cols=64  Identities=36%  Similarity=0.587  Sum_probs=59.1

Q ss_pred             HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCCCC
Q psy16714          4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSPKE   68 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp~E   68 (68)
                      ++.+| .||.|+..|..||+++++.+||.||.+|+.|+...+...+...|..||++||+++||.+
T Consensus         2 ~~~il-~~~~~~~~F~~Fl~~~~~~~~l~F~~~v~~~~~~~~~~~~~~~a~~I~~~fi~~~s~~~   65 (118)
T PF00615_consen    2 FEKIL-EDPEGLELFKEFLEKENCEENLQFWLEVEEFKSSESEEQRKKLAQQIYNKFISPGSPNE   65 (118)
T ss_dssp             HHHHH-HSHHHHHHHHHHHHHTTTTHHHHHHHHHHHHHTSCSHHHHHHHHHHHHHHHTSTTSTTC
T ss_pred             HHHHH-CChHHHHHHHHHHhHCCCHHHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHhccccccc
Confidence            44555 99999999999999999999999999999999997777889999999999999999975


No 4  
>PF09128 RGS-like:  Regulator of G protein signalling-like domain;  InterPro: IPR015212 This entry represents a domain consisting of twelve helices that fold into a compact structure that contains the overall structural scaffold observed in other regulator of G protein signalling (RGS) proteins and three additional helical elements that pack closely to it. Helices 1-9 comprise the RGS fold, in which helices 4-7 form a classic antiparallel bundle adjacent to the other helices. Like other RGS structures, helices 7 and 8 span the length of the folded domain and form essentially one continuous helix with a kink in the middle. Helices 10-12 form an apparently stable C-terminal extension of the structural domain, and although other RGS proteins lack this structure, these elements are intimately associated with the rest of the structural framework by hydrophobic interactions. This domain binds to active G-alpha proteins, promoting GTP hydrolysis by the alpha subunit of heterotrimeric G proteins, thereby inactivating the G protein and rapidly switching off G protein-coupled receptor signalling pathways []. ; GO: 0005089 Rho guanyl-nucleotide exchange factor activity, 0005737 cytoplasm; PDB: 3CX6_B 3CX8_B 3CX7_B 1HTJ_F 1SHZ_C 3AB3_D 1IAP_A.
Probab=98.30  E-value=3.3e-06  Score=54.88  Aligned_cols=62  Identities=15%  Similarity=0.246  Sum_probs=51.9

Q ss_pred             HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCCC
Q psy16714          4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSPK   67 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp~   67 (68)
                      ++.+ .+.|.-...|.+|+-..+.-.-|+|++-.+-|+..+.+ ++.+.|.+||.+|+.++||.
T Consensus         8 l~~L-k~rPaHlaVFl~yv~sq~dP~~LLfyL~~d~yk~~~~K-e~rk~~~ei~stFL~~~ApL   69 (188)
T PF09128_consen    8 LELL-KSRPAHLAVFLNYVLSQFDPSPLLFYLIADLYKQGNAK-EMRKWAYEIHSTFLDPGAPL   69 (188)
T ss_dssp             HHHH-HC-HHHHHHHHHHHHHHS-SHHHHHHHHHHHHTTS-TT-CHHHHHHHHHHHHTSTT-TT
T ss_pred             HHHH-hcCcHHHHHHHHHHHhcCCcchHHHHHHHHHHhccCHH-HHHHHHHHHHHHHcCCCCCc
Confidence            4444 48999999999999999988889999999999997654 78999999999999999996


No 5  
>KOG3590|consensus
Probab=97.61  E-value=0.00014  Score=52.78  Aligned_cols=39  Identities=21%  Similarity=0.298  Sum_probs=34.6

Q ss_pred             HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhcc
Q psy16714          4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKE   43 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~   43 (68)
                      +.++| .||..+.+|-+||...-.-..|.||+++|.||+.
T Consensus       115 L~~il-~D~~~LSYFiQYLD~r~~L~LIKFwl~~E~FKs~  153 (602)
T KOG3590|consen  115 LEQVL-HDTIVLSYFIQYLDLRRMLHLIKFWLEAESFKST  153 (602)
T ss_pred             HHHHh-ccchHHHHHHHHHHHHhHHHHHHHHHhhhhhhHH
Confidence            34444 9999999999999999999999999999999975


No 6  
>KOG3590|consensus
Probab=97.08  E-value=0.0012  Score=48.05  Aligned_cols=58  Identities=19%  Similarity=0.423  Sum_probs=47.3

Q ss_pred             hhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccC-------CHHHHHHHHHHHHHHhcCCCCC
Q psy16714          9 NTFKAGRKLFREFLRCEYSEENILFWLACEDLKKES-------NPDVIEEKARFIYEDYISILSP   66 (68)
Q Consensus         9 l~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~-------~~~~~~~~a~~Iy~~fi~~~sp   66 (68)
                      |...-.+-+|..|++++-..+-|.||++|.+|++.-       +.++...-|-.||++|++-.|.
T Consensus       322 L~~e~aLfyF~Ey~e~~~~~~~lqFwl~adnF~~~la~~E~~~d~~eAq~DAMilYdKYFSlQat  386 (602)
T KOG3590|consen  322 LFCESALFYFSEYMEKEDAVNILQFWLAADNFQSQLAAKEGQYDGQEAQNDAMILYDKYFSLQAT  386 (602)
T ss_pred             HhhhHHHHHHHHHHHHHhHHHHHHHHHHhhhHHHhhhhcccccchhhhhcchhHHHHHHhhhhcc
Confidence            355557789999999999999999999999999852       2335677788999999987664


No 7  
>KOG0986|consensus
Probab=97.03  E-value=0.0019  Score=47.53  Aligned_cols=57  Identities=26%  Similarity=0.335  Sum_probs=46.6

Q ss_pred             hhChHHHHHHHHHHHh-hcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCC
Q psy16714          9 NTFKAGRKLFREFLRC-EYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSP   66 (68)
Q Consensus         9 l~dp~g~~~F~~FL~~-e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp   66 (68)
                      ..-|.|+.+|++||.. +--.--+.||.+|++|-...+ +....+++.|+..||....|
T Consensus        58 ~~QpIGrlLFr~Fc~~~~~~~~~i~Fl~~v~eYE~~~~-~~r~~~~~~i~~~~l~~~~~  115 (591)
T KOG0986|consen   58 EKQPIGRLLFRDFCENRPELRPAIEFLEEVKEYEKADD-DERKDKARQILDAYLMKELL  115 (591)
T ss_pred             ccCcHHHHHHHHHHhcCchhhhHHHHHHHHHHHhcCCc-ccchhhHHHHHHHHhhcccc
Confidence            3579999999999996 223345899999999999884 46789999999999986544


No 8  
>PF15182 OTOS:  Otospiralin
Probab=86.08  E-value=1.6  Score=24.03  Aligned_cols=30  Identities=20%  Similarity=0.372  Sum_probs=25.5

Q ss_pred             HHHHHHHHHHhccCCHHHHHHHHHHHHHHh
Q psy16714         31 ILFWLACEDLKKESNPDVIEEKARFIYEDY   60 (68)
Q Consensus        31 l~Fw~ave~~k~~~~~~~~~~~a~~Iy~~f   60 (68)
                      -+||-=||.|+....-.++..+|+..|.-|
T Consensus        25 sDFW~YveyFrtlGAY~~indmARtfFAh~   54 (69)
T PF15182_consen   25 SDFWNYVEYFRTLGAYNQINDMARTFFAHF   54 (69)
T ss_pred             hHHHHHHHHHHHhccHHHHHHHHHHHHhhC
Confidence            379999999999877778899999887665


No 9  
>PF02184 HAT:  HAT (Half-A-TPR) repeat;  InterPro: IPR003107 The HAT (Half A TPR) repeat has a repetitive pattern characterised by three aromatic residues with a conserved spacing. They are structurally and sequentially similar to TPRs (tetratricopeptide repeats), though they lack the highly conserved alanine and glycine residues found in TPRs. The number of HAT repeats found in different proteins varies between 9 and 12. HAT-repeat-containing proteins appear to be components of macromolecular complexes that are required for RNA processing []. The repeats may be involved in protein-protein interactions. The HAT motif has striking structural similarities to HEAT repeats (IPR000357 from INTERPRO), being of a similar length and consisting of two short helices connected by a loop domain, as in HEAT repeats.; GO: 0006396 RNA processing, 0005622 intracellular
Probab=80.72  E-value=1.3  Score=21.05  Aligned_cols=15  Identities=33%  Similarity=0.627  Sum_probs=12.2

Q ss_pred             HHHHHHHHHHhcCCC
Q psy16714         50 EEKARFIYEDYISIL   64 (68)
Q Consensus        50 ~~~a~~Iy~~fi~~~   64 (68)
                      ...|+.||++||.-.
T Consensus         3 ~dRAR~IyeR~v~~h   17 (32)
T PF02184_consen    3 FDRARSIYERFVLVH   17 (32)
T ss_pred             HHHHHHHHHHHHHhC
Confidence            368999999998753


No 10 
>PF11467 LEDGF:  Lens epithelium-derived growth factor (LEDGF) ;  InterPro: IPR021567  LEDGF is a chromatin-associated protein that protects cells from stress-induced apoptosis. It is the binding partner of HIV-1 integrase in human cells. The integrase binding domain (IBD) of LEDGF is a compact right-handed bundle composed of five alpha-helices. The residues essential for the interaction with the integrase are present in the inter-helical loop regions of the bundle structure. ; PDB: 3F9K_K 3HPG_G 3U88_C 3HPH_H 2B4J_D 1Z9E_A.
Probab=62.00  E-value=14  Score=21.90  Aligned_cols=18  Identities=22%  Similarity=0.392  Sum_probs=14.3

Q ss_pred             CCHHHHHHHHHHHHHHhc
Q psy16714         44 SNPDVIEEKARFIYEDYI   61 (68)
Q Consensus        44 ~~~~~~~~~a~~Iy~~fi   61 (68)
                      .....+.++|..||++|-
T Consensus        59 ~gn~~Ir~KA~~lYnkfK   76 (106)
T PF11467_consen   59 KGNQQIRKKATELYNKFK   76 (106)
T ss_dssp             TT-HHHHHHHHHHHHHHH
T ss_pred             hccHHHHHHHHHHHHHHH
Confidence            334689999999999984


No 11 
>COG4790 EscR Type III secretory pathway, component EscR [Intracellular trafficking and secretion]
Probab=60.78  E-value=13  Score=24.82  Aligned_cols=32  Identities=25%  Similarity=0.382  Sum_probs=27.2

Q ss_pred             hChHHHHHHHHHHHhhcchhHHHHHHHHHHHh
Q psy16714         10 TFKAGRKLFREFLRCEYSEENILFWLACEDLK   41 (68)
Q Consensus        10 ~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k   41 (68)
                      -|..|+.-+|+||+|...+|+..|+....+=+
T Consensus        91 ~~~~~l~pyR~fL~k~t~~~~~~FF~~~~~~~  122 (214)
T COG4790          91 FDDKGLSPYRDFLKKNTEEEEVSFFERSAQKK  122 (214)
T ss_pred             HHHhhhhHHHHHHHHhCcHHHHHHHHHHHHHh
Confidence            45679999999999999999999999876433


No 12 
>PF05528 Coronavirus_5:  Coronavirus gene 5 protein;  InterPro: IPR008458 Infectious bronchitis virus, a member of Coronaviridae family, has a single-stranded positive-sense RNA genome, which is 27 kb in length. Gene 5 contains two (5a and 5b) open reading frames. The function of the 5a and 5b proteins is unknown [].
Probab=58.59  E-value=6.7  Score=22.16  Aligned_cols=24  Identities=25%  Similarity=0.487  Sum_probs=18.7

Q ss_pred             HhhcchhHHHHHHHHHHHhccCCH
Q psy16714         23 RCEYSEENILFWLACEDLKKESNP   46 (68)
Q Consensus        23 ~~e~s~Enl~Fw~ave~~k~~~~~   46 (68)
                      .+..+..||+-|.+|..+.+.+..
T Consensus        54 ~eH~~nNnllSw~av~~le~qt~~   77 (82)
T PF05528_consen   54 EEHQSNNNLLSWQAVKQLERQTPQ   77 (82)
T ss_pred             HHHhcccchHhHHHHHHHHhhCcc
Confidence            445577889999999999887643


No 13 
>PRK10803 tol-pal system protein YbgF; Provisional
Probab=52.21  E-value=71  Score=21.53  Aligned_cols=50  Identities=18%  Similarity=0.371  Sum_probs=33.6

Q ss_pred             hHHHHHHHHHHHhhcch---hHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcC--CCCC
Q psy16714         12 KAGRKLFREFLRCEYSE---ENILFWLACEDLKKESNPDVIEEKARFIYEDYIS--ILSP   66 (68)
Q Consensus        12 p~g~~~F~~FL~~e~s~---Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~--~~sp   66 (68)
                      ......|..|++..=.+   .|..||++--.|....     ...|...|.+++.  |++|
T Consensus       160 ~~Ai~af~~fl~~yP~s~~a~~A~y~LG~~y~~~g~-----~~~A~~~f~~vv~~yP~s~  214 (263)
T PRK10803        160 DDAIVAFQNFVKKYPDSTYQPNANYWLGQLNYNKGK-----KDDAAYYFASVVKNYPKSP  214 (263)
T ss_pred             HHHHHHHHHHHHHCcCCcchHHHHHHHHHHHHHcCC-----HHHHHHHHHHHHHHCCCCc
Confidence            34577899999865444   5899999998887653     2345555666662  5554


No 14 
>PF10774 DUF4226:  Domain of unknown function (DUF4226);  InterPro: IPR019710 This entry represents an uncharacterised family of proteins belonging to Mycobacteria. It was previously incorrectly annotated as Biofilm regulator BssS (also known as YliH). 
Probab=47.87  E-value=16  Score=21.89  Aligned_cols=18  Identities=33%  Similarity=0.488  Sum_probs=15.4

Q ss_pred             hhChHHHHHHHHHHHhhc
Q psy16714          9 NTFKAGRKLFREFLRCEY   26 (68)
Q Consensus         9 l~dp~g~~~F~~FL~~e~   26 (68)
                      +..|.|...|..||....
T Consensus        67 ldTPaG~r~f~~fL~~kl   84 (112)
T PF10774_consen   67 LDTPAGAREFQRFLRGKL   84 (112)
T ss_pred             cCCHHHHHHHHHHHHHHH
Confidence            588999999999998543


No 15 
>COG4133 CcmA ABC-type transport system involved in cytochrome c biogenesis, ATPase component [Posttranslational modification, protein turnover, chaperones]
Probab=45.82  E-value=13  Score=24.78  Aligned_cols=18  Identities=28%  Similarity=0.545  Sum_probs=13.1

Q ss_pred             Hhhcc-hhHHHHHHHHHHH
Q psy16714         23 RCEYS-EENILFWLACEDL   40 (68)
Q Consensus        23 ~~e~s-~Enl~Fw~ave~~   40 (68)
                      +.+.+ .|||.||.++-+.
T Consensus        87 K~eLTa~ENL~F~~~~~~~  105 (209)
T COG4133          87 KTELTALENLHFWQRFHGS  105 (209)
T ss_pred             cchhhHHHHHHHHHHHhCC
Confidence            34443 4999999997765


No 16 
>PF09279 EF-hand_like:  Phosphoinositide-specific phospholipase C, efhand-like;  InterPro: IPR015359 This domain is predominantly found in the enzyme phosphoinositol-specific phospholipase C. It adopts a structure consisting of a core of four alpha helices, in an EF like fold, and is required for functioning of the enzyme []. ; PDB: 3OHM_B 3QR0_A 2ZKM_X 2FJU_B 3QR1_D 1DJW_B 1DJI_B 1DJG_B 1QAS_B 2ISD_B ....
Probab=41.13  E-value=39  Score=18.20  Aligned_cols=18  Identities=39%  Similarity=0.556  Sum_probs=11.1

Q ss_pred             HHHHHHHHHHhhcchhHH
Q psy16714         14 GRKLFREFLRCEYSEENI   31 (68)
Q Consensus        14 g~~~F~~FL~~e~s~Enl   31 (68)
                      ....|+.||.++-.+.++
T Consensus        17 t~~~f~~FL~~eQ~~~~~   34 (83)
T PF09279_consen   17 TAEEFRRFLREEQGEPRL   34 (83)
T ss_dssp             EHHHHHHHHHHTSS-TTS
T ss_pred             CHHHHHHHHHHHhccccC
Confidence            356777788766666444


No 17 
>PF14894 Lsm_C:  Lsm C-terminal; PDB: 1M5Q_1.
Probab=40.41  E-value=28  Score=19.05  Aligned_cols=16  Identities=25%  Similarity=0.526  Sum_probs=12.8

Q ss_pred             HHHHHHHHHHHHhcCC
Q psy16714         48 VIEEKARFIYEDYISI   63 (68)
Q Consensus        48 ~~~~~a~~Iy~~fi~~   63 (68)
                      -+-+++..||++||..
T Consensus        46 PlAerv~~ly~eyi~~   61 (64)
T PF14894_consen   46 PLAERVYDLYNEYIEK   61 (64)
T ss_dssp             HHHHHHHHHHHHHHH-
T ss_pred             hHHHHHHHHHHHHHHh
Confidence            3678999999999864


No 18 
>PF11163 DUF2947:  Protein of unknown function (DUF2947);  InterPro: IPR021334  This family of proteins with unknown function appears to be restricted to Gammaproteobacteria. 
Probab=34.55  E-value=44  Score=21.28  Aligned_cols=18  Identities=28%  Similarity=0.473  Sum_probs=16.4

Q ss_pred             HHHHHHHHHHhcCCCCCC
Q psy16714         50 EEKARFIYEDYISILSPK   67 (68)
Q Consensus        50 ~~~a~~Iy~~fi~~~sp~   67 (68)
                      ...|..|++++|+..+|.
T Consensus        33 ~~~a~qvW~~~IS~~~~~   50 (153)
T PF11163_consen   33 QARAAQVWKENISNQSPH   50 (153)
T ss_pred             HHHHHHHHHHHhccCCCC
Confidence            688999999999999985


No 19 
>PF09764 Nt_Gln_amidase:  N-terminal glutamine amidase;  InterPro: IPR019161 This entry represents the Protein N-terminal glutamine amidohydrolase (NtQ-amidase) family of proteins. They contain a region of approximately 200 residues carrying several distinctive motifs including a WDYHV motif and one of three cysteines. Protein N-terminal glutamine amidohydrolase is responsible for degradation of N-terminal glutamine [].; GO: 0016811 hydrolase activity, acting on carbon-nitrogen (but not peptide) bonds, in linear amides; PDB: 3C9Q_A.
Probab=31.91  E-value=35  Score=22.25  Aligned_cols=17  Identities=41%  Similarity=0.786  Sum_probs=11.8

Q ss_pred             hhcchhHHHHHHHHHHHhc
Q psy16714         24 CEYSEENILFWLACEDLKK   42 (68)
Q Consensus        24 ~e~s~Enl~Fw~ave~~k~   42 (68)
                      .-+||||+  |.-|+.++.
T Consensus         4 scYCEENV--ykLce~~~~   20 (184)
T PF09764_consen    4 SCYCEENV--YKLCEYLRE   20 (184)
T ss_dssp             TT-HHHHH--HHHHHHHHH
T ss_pred             ccccHHHH--HHHHHHHHh
Confidence            35899998  777777743


No 20 
>PF02671 PAH:  Paired amphipathic helix repeat;  InterPro: IPR003822 This family contains the paired amphipathic helix (PAH) repeat. The family contains the eukaryotic Sin3 proteins, which have at least three PAH domains (PAH1, PAH2, and PAH3). Sin3 proteins are components of a co-repressor complex that silences transcription, playing important roles in the transition between proliferation and differentiation. Sin3 proteins are recruited to the DNA by various DNA-binding transcription factors such as the Mad family of repressors, Mnt/Rox, PLZF, MeCP2, p53, REST/NRSF, MNFbeta, Sp1, TGIF and Ume6 []. Sin3 acts as a scaffold protein that in turn recruits histone-binding proteins RbAp46/RbAp48 and histone deacetylases HDAC1/HDAC2, which deacetylate the core histones resulting in a repressed state of the chromatin []. The PAH domains are protein-protein interaction domains through which Sin3 fulfils its role as a scaffold. The PAH2 domain of Sin3 can interact with a wide range of unrelated and structurally diverse transcription factors that bind using different interaction motifs. For example, the Sin3 PAH2 domain can interact with the unrelated Mad and HBP1 factors using alternative interaction motifs that involve binding in opposite helical orientations [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 1S5Q_B 2L9S_B 1G1E_B 1S5R_B 2CR7_A 2CZY_A 2LD7_B 2RMR_A 2RMS_A 1PD7_A ....
Probab=30.71  E-value=47  Score=16.15  Aligned_cols=19  Identities=16%  Similarity=0.156  Sum_probs=12.5

Q ss_pred             HHHHHhhChHHHHHHHHHH
Q psy16714          4 IKRTRNTFKAGRKLFREFL   22 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL   22 (68)
                      +...+-.+|.....|..||
T Consensus        28 v~~Ll~~hpdLl~~F~~Fl   46 (47)
T PF02671_consen   28 VSELLRGHPDLLEEFNRFL   46 (47)
T ss_dssp             HHHHTTT-HHHHHHHHHHS
T ss_pred             HHHHHccCHHHHHHHHhhC
Confidence            3344456888888888875


No 21 
>PF14328 DUF4385:  Domain of unknown function (DUF4385)
Probab=29.16  E-value=47  Score=20.98  Aligned_cols=17  Identities=29%  Similarity=0.700  Sum_probs=12.7

Q ss_pred             HHHHHHHHHHHHHHhcC
Q psy16714         46 PDVIEEKARFIYEDYIS   62 (68)
Q Consensus        46 ~~~~~~~a~~Iy~~fi~   62 (68)
                      +....+.|..||..|+.
T Consensus        47 p~iA~~Ss~kIy~~Fl~   63 (145)
T PF14328_consen   47 PEIARESSEKIYEMFLD   63 (145)
T ss_pred             HHHHHHHHHHHHHHHHH
Confidence            45667888888888763


No 22 
>PF10312 Cactin_mid:  Conserved mid region of cactin;  InterPro: IPR018816  This entry represents the conserved central domain of a family of proteins referred to as cactins. It contains two of three predicted coiled-coil motifs. Most proteins containing this domain also have IPR019134 from INTERPRO at the C-terminal end. Upstream of this domain in Drosophila proteins are a serine-rich region, some non-typical RD motifs and three predicted bipartite nuclear localisation signals, none of which are well-conserved. Cactin associates with IkappaB-cactus as one of the intracellular members of the Rel (NF-kappaB) pathway which is conserved in invertebrates and vertebrates. In mammals, this pathway controls the activities of the immune and inflammatory response genes as well as viral genes, and is critical for cell growth and survival. In Drosophila, the Rel pathway functions in the innate cellular and humoral immune response, in muscle development, and in the establishment of dorsal-ventral polarity in the early embryo []. 
Probab=28.38  E-value=88  Score=20.27  Aligned_cols=24  Identities=25%  Similarity=0.337  Sum_probs=17.1

Q ss_pred             HHHHHHHHhhcchhHHHHHHHHHH
Q psy16714         16 KLFREFLRCEYSEENILFWLACED   39 (68)
Q Consensus        16 ~~F~~FL~~e~s~Enl~Fw~ave~   39 (68)
                      .-...|+.-+....|..||.++--
T Consensus        82 ~dIk~y~~Le~~~~n~~fW~~~~~  105 (191)
T PF10312_consen   82 EDIKVYLELERDNKNREFWEAMLV  105 (191)
T ss_pred             HHHHHHHHHcccccHHHHHHHHHH
Confidence            344556776777779999998753


No 23 
>PF12725 DUF3810:  Protein of unknown function (DUF3810);  InterPro: IPR024294 This family of bacterial proteins is functionally uncharacterised. Proteins in this family are typically between 333 and 377 amino acids in length and contain a conserved HEXXH sequence motif that is characteristic of metallopeptidases. This family may therefore belong to an as yet uncharacterised family of peptidase enzymes.
Probab=28.34  E-value=2.1e+02  Score=19.92  Aligned_cols=49  Identities=27%  Similarity=0.364  Sum_probs=28.9

Q ss_pred             hChHHHHHHHHHHH---hhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCCC
Q psy16714         10 TFKAGRKLFREFLR---CEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISILSP   66 (68)
Q Consensus        10 ~dp~g~~~F~~FL~---~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~sp   66 (68)
                      .||.....+..-+.   +.--.++-.||..-++        .+...+..+|++|++.|.-
T Consensus       251 ~d~e~~~~l~~~l~~~v~~d~~~~~~fW~~y~~--------~i~~~~~~~yd~yLKaN~q  302 (318)
T PF12725_consen  251 KDPEAYKELYSQLSPGVKKDLKENRAFWQKYEG--------PIEEVSDFVYDTYLKANNQ  302 (318)
T ss_pred             cCHHHHHHHHHhCCHHHHHHHHHHHHHHHHhcc--------hHHHHHHHHHHHHHHhcCc
Confidence            45655555543333   1112355566655442        3567789999999998864


No 24 
>PF01193 RNA_pol_L:  RNA polymerase Rpb3/Rpb11 dimerisation domain;  InterPro: IPR011261 DNA-directed RNA polymerases 2.7.7.6 from EC (also known as DNA-dependent RNA polymerases) are responsible for the polymerisation of ribonucleotides into a sequence complementary to the template DNA. In eukaryotes, there are three different forms of DNA-directed RNA polymerases transcribing different sets of genes. Most RNA polymerases are multimeric enzymes and are composed of a variable number of subunits. The core RNA polymerase complex consists of five subunits (two alpha, one beta, one beta-prime and one omega) and is sufficient for transcription elongation and termination but is unable to initiate transcription. Transcription initiation from promoter elements requires a sixth, dissociable subunit called a sigma factor, which reversibly associates with the core RNA polymerase complex to form a holoenzyme []. The core RNA polymerase complex forms a "crab claw"-like structure with an internal channel running along the full length []. The key functional sites of the enzyme, as defined by mutational and cross-linking analysis, are located on the inner wall of this channel. RNA synthesis follows after the attachment of RNA polymerase to a specific site, the promoter, on the template DNA strand. The RNA synthesis process continues until a termination sequence is reached. The RNA product, which is synthesised in the 5' to 3'direction, is known as the primary transcript. Eukaryotic nuclei contain three distinct types of RNA polymerases that differ in the RNA they synthesise:  RNA polymerase I: located in the nucleoli, synthesises precursors of most ribosomal RNAs. RNA polymerase II: occurs in the nucleoplasm, synthesises mRNA precursors.  RNA polymerase III: also occurs in the nucleoplasm, synthesises the precursors of 5S ribosomal RNA, the tRNAs, and a variety of other small nuclear and cytosolic RNAs.   Eukaryotic cells are also known to contain separate mitochondrial and chloroplast RNA polymerases. Eukaryotic RNA polymerases, whose molecular masses vary in size from 500 to 700 kDa, contain two non-identical large (>100 kDa) subunits and an array of up to 12 different small (less than 50 kDa) subunits. RNA polymerase (RNAP) II, which is responsible for all mRNA synthesis in eukaryotes, consists of 12 subunits. Subunits Rpb3 and Rpb11 form a heterodimer that is functionally analogous to the archaeal RNAP D/L heterodimer, and to the prokaryotic RNAP alpha subunit (RpoA) homodimer. In each case, they play a key role in RNAP assembly by forming a platform on which the catalytic subunits (eukaryotic Rpb1/Rpb2, and prokaryotic beta/beta') can interact []. These different subunits share regions of homology required for dimerisation. In eukaryotic Rpb11 and archaeal L subunits, the dimerisation domain consists of a contiguous Rpb11-like domain, whereas in eukaryotic Rpb3, archaeal D and bacterial RpoA subunits (IPR011263 from INTERPRO), the dimerisation domain consists of the Rpb11-like domain interrupted by an insert domain. In the prokaryotic alpha subunit, this dimerisation domain is the N-terminal domain [].; GO: 0003899 DNA-directed RNA polymerase activity, 0046983 protein dimerization activity, 0006351 transcription, DNA-dependent; PDB: 1HQM_B 1YNJ_A 1YNN_A 1I6V_A 2GHO_A 3HKZ_V 2PMZ_X 2PA8_L 3GTK_C 1TWH_C ....
Probab=27.91  E-value=32  Score=18.07  Aligned_cols=9  Identities=11%  Similarity=0.095  Sum_probs=6.5

Q ss_pred             HHHHHHhhC
Q psy16714          3 CIKRTRNTF   11 (68)
Q Consensus         3 ~~~~~ll~d   11 (68)
                      ++||+|+++
T Consensus        15 ~LRr~ll~~   23 (66)
T PF01193_consen   15 ALRRILLSE   23 (66)
T ss_dssp             HHHHHHHSS
T ss_pred             HHHHHHHhc
Confidence            577777776


No 25 
>PF10875 DUF2670:  Protein of unknown function (DUF2670);  InterPro: IPR022714  This bacterial family of proteins has no known function and appears to be restricted to Rickettsiaceae. 
Probab=27.88  E-value=40  Score=20.89  Aligned_cols=18  Identities=22%  Similarity=0.244  Sum_probs=14.3

Q ss_pred             hHHHHHHhhChHHHHHHH
Q psy16714          2 QCIKRTRNTFKAGRKLFR   19 (68)
Q Consensus         2 ~~~~~~ll~dp~g~~~F~   19 (68)
                      ||+||++..||-|.-...
T Consensus         3 qalrrlIaaNPMg~flw~   20 (139)
T PF10875_consen    3 QALRRLIAANPMGFFLWS   20 (139)
T ss_pred             HHHHHHHhhCCchhhHHH
Confidence            799999999998865443


No 26 
>KOG2005|consensus
Probab=27.37  E-value=2.1e+02  Score=22.92  Aligned_cols=49  Identities=16%  Similarity=0.258  Sum_probs=28.2

Q ss_pred             hHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHh
Q psy16714         12 KAGRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDY   60 (68)
Q Consensus        12 p~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~f   60 (68)
                      -.+......|+...--..-.++..+|-.|--.++...+.+-|..||-+|
T Consensus       204 ve~id~l~~~Vd~~n~~RvclYl~sc~~~lP~Pdd~~ll~~a~~IYlKf  252 (878)
T KOG2005|consen  204 VEGIDLLLDYVDEHNYQRVCLYLTSCVPLLPGPDDVALLRTALKIYLKF  252 (878)
T ss_pred             hhhHhHHHHHhhhhhHHHHHHHHHHHhhcCCCchhhHHHHHHHHHHHHH
Confidence            3344444444443333333455555555555554446788999999887


No 27 
>PHA02901 virus redox protein; Provisional
Probab=27.00  E-value=1.1e+02  Score=17.24  Aligned_cols=21  Identities=19%  Similarity=0.385  Sum_probs=16.1

Q ss_pred             HHHHHHhhChHHHHHHHHHHH
Q psy16714          3 CIKRTRNTFKAGRKLFREFLR   23 (68)
Q Consensus         3 ~~~~~ll~dp~g~~~F~~FL~   23 (68)
                      .++-+|+++|.-......||.
T Consensus        34 ~ik~~L~sqP~k~~iLk~FL~   54 (75)
T PHA02901         34 TIRAILESQPYKKKILKQFLA   54 (75)
T ss_pred             HHHHHHHcCchHHHHHHHHHH
Confidence            456678888888888887776


No 28 
>PF03096 Ndr:  Ndr family;  InterPro: IPR004142 This family consists of proteins from different gene families: Ndr1/RTP/Drg1, Ndr2, and Ndr3. Their similarity was previously noted []. The precise molecular and cellular function of members of this family is still unknown, yet they are known to be involved in cellular differentiation events. The Ndr1 group was the first to be discovered. Their expression is repressed by the proto-oncogenes N-myc and c-myc, and in line with this observation, Ndr1 protein expression is down-regulated in neoplastic cells, and is reactivated when differentiation is induced by chemicals such as retinoic acid. Ndr2 and Ndr3 expression is not under the control of N-myc or c-myc. Ndr1 expression is also activated by several chemicals: tunicamycin and homocysteine induce Ndr1 in human umbilical endothelial cells; nickel induces Ndr1 in several cell types. Members of this family are found in wide variety of multicellular eukaryotes, including an Ndr1 type protein in Helianthus annuus (Common sunflower), known as Sf21. Interestingly, the highest scoring matches in the noise are all alpha/beta hydrolases (IPR000073 from INTERPRO), suggesting that this family may have an enzymatic function.; PDB: 2QMQ_A 2XMR_B 2XMQ_B 2XMS_A.
Probab=26.89  E-value=1.6e+02  Score=20.50  Aligned_cols=29  Identities=17%  Similarity=0.229  Sum_probs=22.8

Q ss_pred             hChHHHHHHHHHHHhhcchhHH-HHHHHHH
Q psy16714         10 TFKAGRKLFREFLRCEYSEENI-LFWLACE   38 (68)
Q Consensus        10 ~dp~g~~~F~~FL~~e~s~Enl-~Fw~ave   38 (68)
                      +++...+.|++.+.+.-+..|+ .||.+..
T Consensus       177 ~n~Dlv~~yr~~l~~~~Np~Nl~~f~~sy~  206 (283)
T PF03096_consen  177 NNSDLVQTYRQHLDERINPKNLALFLNSYN  206 (283)
T ss_dssp             CT-HHHHHHHHHHHT-TTHHHHHHHHHHHH
T ss_pred             ccHHHHHHHHHHHhcCCCHHHHHHHHHHHh
Confidence            6788899999999999998887 7877764


No 29 
>PF07631 PSD4:  Protein of unknown function (DUF1592);  InterPro: IPR013042  A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013043 from INTERPRO.
Probab=26.82  E-value=78  Score=19.15  Aligned_cols=53  Identities=17%  Similarity=0.334  Sum_probs=29.7

Q ss_pred             HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHHHHHHhccCC------------HHHHHHHHHHHHHHhcCCCCC
Q psy16714          4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLACEDLKKESN------------PDVIEEKARFIYEDYISILSP   66 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~------------~~~~~~~a~~Iy~~fi~~~sp   66 (68)
                      ++|.| +||..+.....|..+         |+..+.+.....            ...+...+...+..-+..|.|
T Consensus        42 ~~RML-~dpr~~~~~~~F~~q---------WL~l~~~~~~~~d~~~~p~~~~~l~~~m~~E~~~f~~~vl~~n~~  106 (128)
T PF07631_consen   42 AERML-ADPRARRFVERFFRQ---------WLDLDRLDSIVKDPEKFPEFSPDLREAMREETDEFFEHVLEENGS  106 (128)
T ss_pred             HHHHH-cCccHHHHHHHHHHH---------HhCCCcccccCCChhhccccCHHHHHHHHHHHHHHHHHHHHcCCC
Confidence            34444 999888888877765         555554444321            123445555555555555543


No 30 
>PF05288 Pox_A3L:  Poxvirus A3L Protein;  InterPro: IPR007952 This family consists of several poxvirus A3L or A2_5L proteins. The entry of vaccinia virus (VV) into the host cell results in the delivery of the double-stranded DNA genome-containing core into the cytoplasm. The core is disassembled, releasing the viral DNA in order to initiate VV cytoplasmic transcription and DNA replication. A3L protein is a part of that core []. The A2.5L gene product is an all-alpha-helical protein with a conserved Cxx(x)C motif in the N-terminal alpha-helix. It appears to be an integral component of intracellular virions [].
Probab=26.64  E-value=1.2e+02  Score=16.81  Aligned_cols=22  Identities=23%  Similarity=0.288  Sum_probs=17.0

Q ss_pred             HHHHHHhhChHHHHHHHHHHHh
Q psy16714          3 CIKRTRNTFKAGRKLFREFLRC   24 (68)
Q Consensus         3 ~~~~~ll~dp~g~~~F~~FL~~   24 (68)
                      .++-+|+++|.-..-.++||..
T Consensus        29 ~ik~~L~sqp~k~~iLk~FL~~   50 (70)
T PF05288_consen   29 TIKMMLESQPNKLKILKRFLSF   50 (70)
T ss_pred             HHHHHHHcChhHHHHHHHHHHH
Confidence            4566788999888888888873


No 31 
>PHA02683 ORF078 thioredoxin-like protein; Provisional
Probab=26.57  E-value=1.1e+02  Score=17.27  Aligned_cols=21  Identities=19%  Similarity=0.332  Sum_probs=16.2

Q ss_pred             HHHHHHhhChHHHHHHHHHHH
Q psy16714          3 CIKRTRNTFKAGRKLFREFLR   23 (68)
Q Consensus         3 ~~~~~ll~dp~g~~~F~~FL~   23 (68)
                      .++-+|+++|.-......||.
T Consensus        34 ~ikm~L~sqP~k~~iLk~FL~   54 (75)
T PHA02683         34 NIRMVLESQPNKLRILKEFLA   54 (75)
T ss_pred             HHHHHHHcCccHHHHHHHHHH
Confidence            456678888988888887776


No 32 
>PF10118 Metal_hydrol:  Predicted metal-dependent hydrolase;  InterPro: IPR016516 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function
Probab=26.57  E-value=2e+02  Score=19.22  Aligned_cols=63  Identities=22%  Similarity=0.217  Sum_probs=43.1

Q ss_pred             hHHHHHH--hhChHHHHHHHHHHHhh--cchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCC-CCC
Q psy16714          2 QCIKRTR--NTFKAGRKLFREFLRCE--YSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISI-LSP   66 (68)
Q Consensus         2 ~~~~~~l--l~dp~g~~~F~~FL~~e--~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~-~sp   66 (68)
                      .++|.++  +.||..+..=+.|..+|  |+-+--.|-...+.  ..-+.+...+....++.+++.. .+|
T Consensus        45 ~svr~~~~~i~D~~L~~~i~~FIgQEA~H~r~H~~~n~~l~~--~G~~~~~~~~~~~~~~~~~l~~~~~~  112 (253)
T PF10118_consen   45 RSVRRARPQIKDPELREEIKGFIGQEAMHSREHRKFNEALEA--QGYDVRPFLEKMEKLFLKFLEKRLSL  112 (253)
T ss_pred             HHHHHHHhhCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH--cCCCCcHHHHHHHHHHHHHHhhhcCH
Confidence            5667764  68999999999999976  56666677666632  2222334567777777777776 444


No 33 
>PF05843 Suf:  Suppressor of forked protein (Suf);  InterPro: IPR008847 This domain consists of several eukaryotic suppressor of forked (Suf) like proteins. The Drosophila melanogaster suppressor of forked [Su(f)] protein shares homology with the Saccharomyces cerevisiae RNA14 protein and the 77 kDa subunit of Homo sapiens cleavage stimulation factor, which are proteins involved in mRNA 3' end formation. This suggests a role for Su(f) in mRNA 3' end formation in Drosophila. The su(f) gene produces three transcripts; two of them are polyadenylated at the end of the transcription unit, and one is a truncated transcript, polyadenylated in intron 4. It is thought that su(f) plays a role in the regulation of poly(A) site utilisation and the GU-rich sequence is important for this regulation to occur [].; GO: 0006397 mRNA processing, 0005634 nucleus; PDB: 2L9B_B 2OND_B 2OOE_A 4E85_B 4EBA_C 4E6H_A 2UY1_B.
Probab=25.90  E-value=2.1e+02  Score=19.12  Aligned_cols=49  Identities=29%  Similarity=0.317  Sum_probs=26.8

Q ss_pred             hChH-HHHHHHHHHHhhcchhHHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCC
Q psy16714         10 TFKA-GRKLFREFLRCEYSEENILFWLACEDLKKESNPDVIEEKARFIYEDYISI   63 (68)
Q Consensus        10 ~dp~-g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~   63 (68)
                      .|+. ++.-|..=|+.  -..+..||+.--+|-....  + ...|+.+|++.|..
T Consensus        50 ~d~~~A~~Ife~glk~--f~~~~~~~~~Y~~~l~~~~--d-~~~aR~lfer~i~~   99 (280)
T PF05843_consen   50 KDPKRARKIFERGLKK--FPSDPDFWLEYLDFLIKLN--D-INNARALFERAISS   99 (280)
T ss_dssp             S-HHHHHHHHHHHHHH--HTT-HHHHHHHHHHHHHTT----HHHHHHHHHHHCCT
T ss_pred             CCHHHHHHHHHHHHHH--CCCCHHHHHHHHHHHHHhC--c-HHHHHHHHHHHHHh
Confidence            5555 66666655543  3335566666555544332  1 35677788777765


No 34 
>KOG0063|consensus
Probab=25.87  E-value=93  Score=23.65  Aligned_cols=31  Identities=10%  Similarity=0.005  Sum_probs=27.6

Q ss_pred             HHHHHHHHHHHhhcchhHHHHHHHHHHHhcc
Q psy16714         13 AGRKLFREFLRCEYSEENILFWLACEDLKKE   43 (68)
Q Consensus        13 ~g~~~F~~FL~~e~s~Enl~Fw~ave~~k~~   43 (68)
                      .++..++-+|....++.++.+|+.++.|...
T Consensus       495 AskvikRfilhakktafvVEhdfImaTYlad  525 (592)
T KOG0063|consen  495 ASKVIKRFILHAKKTAFVVEHDFIMATYLAD  525 (592)
T ss_pred             HHHHHHHHHHhccchhhhhhhHHHHHHhhcc
Confidence            4677888899999999999999999999864


No 35 
>PHA03056 putative myristoylated protein; Provisional
Probab=24.85  E-value=1.2e+02  Score=19.16  Aligned_cols=30  Identities=30%  Similarity=0.597  Sum_probs=19.7

Q ss_pred             HHHHHhccCCHHHHHHHHHHHHHHhcCCCC
Q psy16714         36 ACEDLKKESNPDVIEEKARFIYEDYISILS   65 (68)
Q Consensus        36 ave~~k~~~~~~~~~~~a~~Iy~~fi~~~s   65 (68)
                      -|.+|-....+.......+.||..|.-|.|
T Consensus        70 LCKd~l~~~~p~T~~~~IK~Il~qy~IP~S   99 (165)
T PHA03056         70 LCKDHAEKSSPETQQMIIKHIYEQYLIPVS   99 (165)
T ss_pred             HHHHHHHcCCchHHHHHHHHHHHHhcCChh
Confidence            355554444444445668999999988876


No 36 
>KOG2348|consensus
Probab=24.23  E-value=45  Score=25.42  Aligned_cols=17  Identities=24%  Similarity=0.237  Sum_probs=13.9

Q ss_pred             HHHHHHHHHHhcCCCCC
Q psy16714         50 EEKARFIYEDYISILSP   66 (68)
Q Consensus        50 ~~~a~~Iy~~fi~~~sp   66 (68)
                      .-...+||++||+|+|.
T Consensus       375 SvfTYDIYr~Yi~PrAs  391 (667)
T KOG2348|consen  375 SVFTYDIYRNYIRPRAS  391 (667)
T ss_pred             hhhehHHHHhhcCCccC
Confidence            44568999999999874


No 37 
>PRK13722 lytic transglycosylase; Provisional
Probab=24.09  E-value=1.2e+02  Score=19.46  Aligned_cols=54  Identities=13%  Similarity=0.067  Sum_probs=29.9

Q ss_pred             HhhChHHHHHHHH-HHHhhcchhHHHHHHHHHHHhccCC-----HHHHHHHHHHHHHHhcC
Q psy16714          8 RNTFKAGRKLFRE-FLRCEYSEENILFWLACEDLKKESN-----PDVIEEKARFIYEDYIS   62 (68)
Q Consensus         8 ll~dp~g~~~F~~-FL~~e~s~Enl~Fw~ave~~k~~~~-----~~~~~~~a~~Iy~~fi~   62 (68)
                      |+.||..--..-. +|...+..-. ..|.+|-.|...+.     ......-|.+||..|-.
T Consensus        85 ll~DPc~NI~~Gaw~L~~~~~~~g-~~w~AvaaYNaG~~~~~~~~~~r~~Ya~kV~~~y~~  144 (169)
T PRK13722         85 LTTDPCMNIYTGAYYLAIAFKKWG-VSWEAVGAYNAGFRKTERQNQRRLAYASEVYRIYTG  144 (169)
T ss_pred             HccCHHHHHHHHHHHHHHHHHHcC-CchheeeecCCCCCccccchHHHHHHHHHHHHHHHH
Confidence            4578865433322 2222211111 36899999998642     24556667667666644


No 38 
>PF03134 TB2_DP1_HVA22:  TB2/DP1, HVA22 family;  InterPro: IPR004345 This family includes members from a wide variety of eukaryotes. It includes the TB2/DP1 (deleted in polyposis) protein which in human is deleted in severe forms of familial adenomatous polyposis, an autosomal dominant oncological inherited disease. The family also includes the plant protein of known similarity to TB2/DP1, the HVA22 abscisic acid-induced protein (e.g. Q07764 from SWISSPROT), which is thought to be a regulatory protein. 
Probab=23.85  E-value=56  Score=18.23  Aligned_cols=14  Identities=29%  Similarity=0.617  Sum_probs=11.2

Q ss_pred             HHHHHHHHHHhcCC
Q psy16714         50 EEKARFIYEDYISI   63 (68)
Q Consensus        50 ~~~a~~Iy~~fi~~   63 (68)
                      .+-|..||+++|+|
T Consensus        79 ~~Ga~~iy~~~i~P   92 (94)
T PF03134_consen   79 FQGAEYIYDKFIRP   92 (94)
T ss_pred             CCcHHHHHHHHccc
Confidence            46788899998876


No 39 
>PF07956 DUF1690:  Protein of Unknown function (DUF1690) ;  InterPro: IPR012471 Family of uncharacterised fungal proteins. 
Probab=23.65  E-value=41  Score=20.85  Aligned_cols=20  Identities=30%  Similarity=0.416  Sum_probs=16.6

Q ss_pred             hhcchhHHHHHHHHHHHhcc
Q psy16714         24 CEYSEENILFWLACEDLKKE   43 (68)
Q Consensus        24 ~e~s~Enl~Fw~ave~~k~~   43 (68)
                      ++...--|+.|-.|+.||+.
T Consensus       121 ~~N~~rPLnCw~EVe~FKk~  140 (142)
T PF07956_consen  121 RENDGRPLNCWEEVEAFKKE  140 (142)
T ss_pred             HHCCCCCCchHHHHHHHHHH
Confidence            55666779999999999974


No 40 
>PF11709 Mit_ribos_Mrp51:  Mitochondrial ribosomal protein subunit ;  InterPro: IPR016712 The function of mitochondrial ribosomal small-subunit protein MRP51 is not entirely clear, but deletion of the MRP51 gene completely blocks mitochondrial gene expression [].
Probab=23.49  E-value=1.1e+02  Score=21.26  Aligned_cols=26  Identities=23%  Similarity=0.349  Sum_probs=20.9

Q ss_pred             HHHHHhhChHHHHHHHHHHHhhcchh
Q psy16714          4 IKRTRNTFKAGRKLFREFLRCEYSEE   29 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL~~e~s~E   29 (68)
                      |++-|-..+..+..|++||.+.+.+.
T Consensus       148 f~~yL~kvr~~R~eF~~~L~~~~~e~  173 (312)
T PF11709_consen  148 FERYLKKVRPLRPEFKKWLREKHPES  173 (312)
T ss_pred             HHHHHHHhHHHHHHHHHHHHHhChhh
Confidence            55566677889999999999888776


No 41 
>PHA02335 hypothetical protein
Probab=22.67  E-value=1.9e+02  Score=17.53  Aligned_cols=43  Identities=16%  Similarity=0.224  Sum_probs=28.0

Q ss_pred             hhcchhHHHHHHHHHHHhcc--CCHH------HHHHHHHHHHHHhcCCCCCC
Q psy16714         24 CEYSEENILFWLACEDLKKE--SNPD------VIEEKARFIYEDYISILSPK   67 (68)
Q Consensus        24 ~e~s~Enl~Fw~ave~~k~~--~~~~------~~~~~a~~Iy~~fi~~~sp~   67 (68)
                      +|..++|+.++ |+..|..-  .+.+      ++.+...+.+.+|+..++|+
T Consensus         4 ~el~~~n~m~f-Ai~~Y~np~sVt~ddf~~DlkRi~yIkrllKRy~~~~~~k   54 (118)
T PHA02335          4 DELNEGNYMLF-AIKNYNNPQSVTYDDFEEDLKRFKYIKRLFKRYLNTGELK   54 (118)
T ss_pred             ccCCchhHHHH-HHHhcCCcccccHHHHHHHHHHHHHHHHHHHhhcCCCChh
Confidence            45667777664 67777632  1121      34567778899999999884


No 42 
>PF12122 DUF3582:  Protein of unknown function (DUF3582);  InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ].  This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important.  The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=22.67  E-value=81  Score=18.40  Aligned_cols=18  Identities=11%  Similarity=0.132  Sum_probs=15.4

Q ss_pred             hhChHHHHHHHHHHHhhc
Q psy16714          9 NTFKAGRKLFREFLRCEY   26 (68)
Q Consensus         9 l~dp~g~~~F~~FL~~e~   26 (68)
                      ++||.....|.+||....
T Consensus         7 ~~n~r~AqaF~DYl~sqg   24 (101)
T PF12122_consen    7 LNNPRAAQAFIDYLASQG   24 (101)
T ss_dssp             ESSHHHHHHHHHHHHHTT
T ss_pred             cCCHHHHHHHHHHHHHCC
Confidence            478999999999999765


No 43 
>PF13981 SopA:  SopA-like central domain; PDB: 3NB2_B 3NAW_B 3SQV_B 2QZA_B 3SY2_B 2QYU_A.
Probab=22.36  E-value=99  Score=19.01  Aligned_cols=35  Identities=26%  Similarity=0.379  Sum_probs=21.6

Q ss_pred             HHHHHHHHHHHhccCCHHHHHHHHHHHHHHhcCCCC
Q psy16714         30 NILFWLACEDLKKESNPDVIEEKARFIYEDYISILS   65 (68)
Q Consensus        30 nl~Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi~~~s   65 (68)
                      |=.|-+-+..=....+ ..+.+.|..+|++|++...
T Consensus        92 N~~FIQ~i~~~~~~~~-~~~k~~A~~LY~~YL~~~~  126 (135)
T PF13981_consen   92 NGAFIQLIAQAMTHGD-DEIKQKARDLYKKYLQLPR  126 (135)
T ss_dssp             HHHHHHHHHHHCC-TS-CCCHHHHHHHHHHHCCSTT
T ss_pred             ccHHHHHHHHHHHhcc-HHHHHHHHHHHHHHhcChh
Confidence            4444444443333332 3568999999999998653


No 44 
>PF12335 SBF2:  Myotubularin protein ;  InterPro: IPR022096  This domain family is found in eukaryotes, and is approximately 220 amino acids in length. The family is found in association with PF02141 from PFAM, PF03456 from PFAM, PF03455 from PFAM. This family is the middle region of SBF2, a member of the myotubularin family. Myotubularin-related proteins have been suggested to work in phosphoinositide-mediated signalling events that may also convey control of myelination. Mutations of SBF2 are implicated in Charcot-Marie-Tooth disease. 
Probab=21.94  E-value=1.1e+02  Score=20.40  Aligned_cols=24  Identities=17%  Similarity=0.177  Sum_probs=19.5

Q ss_pred             hHHHHHHhhChHHHHHHHHHHHhhc
Q psy16714          2 QCIKRTRNTFKAGRKLFREFLRCEY   26 (68)
Q Consensus         2 ~~~~~~ll~dp~g~~~F~~FL~~e~   26 (68)
                      .+|.++| ....||..|...|.+..
T Consensus        47 ~av~~~l-k~~~aR~~~~~~L~~~~   70 (225)
T PF12335_consen   47 PAVLRAL-KSRSARQAFCRELSKHV   70 (225)
T ss_pred             HHHHHHH-ccchHHHHHHHHHHHHH
Confidence            5677777 88889999999998543


No 45 
>PF08708 PriCT_1:  Primase C terminal 1 (PriCT-1);  InterPro: IPR014820 This alpha helical domain is found at the C-terminal of primases. 
Probab=21.84  E-value=1.4e+02  Score=15.60  Aligned_cols=11  Identities=18%  Similarity=0.172  Sum_probs=5.3

Q ss_pred             HHHHHHHHHHH
Q psy16714         48 VIEEKARFIYE   58 (68)
Q Consensus        48 ~~~~~a~~Iy~   58 (68)
                      ++...++.|++
T Consensus        57 Ev~~i~kSi~k   67 (71)
T PF08708_consen   57 EVKAIAKSIAK   67 (71)
T ss_pred             HHHHHHHHHHH
Confidence            44455555443


No 46 
>PF08361 TetR_C_2:  MAATS-type transcriptional repressor, C-terminal region;  InterPro: IPR013572  This entry is named after the various transcriptional regulatory proteins that it contains, including MtrR (Q6RV06 from SWISSPROT), AcrR (P34000 from SWISSPROT), ArpR (Q9KJC4 from SWISSPROT), TtgR (Q9AIU0 from SWISSPROT) and SmeT (Q8KLP4 from SWISSPROT). These are members of the TetR (tetracycline resistance) family of transcriptional repressors, that are involved in the control of expression of multidrug resistance proteins [, , ]. ; GO: 0003677 DNA binding; PDB: 3BCG_B 2QOP_A 2UXP_B 2XDN_C 2UXH_A 2UXI_B 2UXO_A 2UXU_B 2WUI_A 2W53_A ....
Probab=21.69  E-value=1.8e+02  Score=16.84  Aligned_cols=34  Identities=21%  Similarity=0.327  Sum_probs=25.5

Q ss_pred             HhhChHHHHHHHHHH-HhhcchhHHHHHHHHHHHh
Q psy16714          8 RNTFKAGRKLFREFL-RCEYSEENILFWLACEDLK   41 (68)
Q Consensus         8 ll~dp~g~~~F~~FL-~~e~s~Enl~Fw~ave~~k   41 (68)
                      +.+||..+..|.-.+ +-|+.+|+--.........
T Consensus        17 l~~d~~~Rrv~~I~~~kcE~~~e~~~~~~r~~~~~   51 (121)
T PF08361_consen   17 LAEDERQRRVFEILFHKCEYVEEMAPVRERRREAQ   51 (121)
T ss_dssp             HHHSHHHHHHHHHHHHS--SSTTTHHHHHHHHHHH
T ss_pred             HhhCHHHHHHHHHHHHhcccCCcccHHHHHHHHHH
Confidence            349999999999855 5899999998887766544


No 47 
>COG5562 Phage envelope protein [General function prediction only]
Probab=21.18  E-value=57  Score=20.42  Aligned_cols=23  Identities=26%  Similarity=0.439  Sum_probs=18.9

Q ss_pred             hHHHHHHhhChHHHHHHHHHHHh
Q psy16714          2 QCIKRTRNTFKAGRKLFREFLRC   24 (68)
Q Consensus         2 ~~~~~~ll~dp~g~~~F~~FL~~   24 (68)
                      .+++-++..+..|...|+.|+..
T Consensus        75 ~~i~~al~~~qsGqttF~ef~~~   97 (137)
T COG5562          75 TLIKTALRRHQSGQTTFEEFCSA   97 (137)
T ss_pred             HHHHHHHHHHhcCCccHHHHHHH
Confidence            35677888999999999999874


No 48 
>PF14159 CAAD:  CAAD domains of cyanobacterial aminoacyl-tRNA synthetase
Probab=21.15  E-value=1.7e+02  Score=16.69  Aligned_cols=29  Identities=14%  Similarity=0.213  Sum_probs=21.3

Q ss_pred             HHHHHHHHhccCCHHHHHHHHHHHHHHhc
Q psy16714         33 FWLACEDLKKESNPDVIEEKARFIYEDYI   61 (68)
Q Consensus        33 Fw~ave~~k~~~~~~~~~~~a~~Iy~~fi   61 (68)
                      =|....++-..+++++..++...+.++++
T Consensus        61 ~wF~~ryLL~~~~R~el~~~i~~~k~~i~   89 (90)
T PF14159_consen   61 GWFVYRYLLFAENRQELLQKIQSLKKEIL   89 (90)
T ss_pred             hHHHHHHHcChHhHHHHHHHHHHHHHHhc
Confidence            37777777777777778888888777654


No 49 
>COG4748 Uncharacterized conserved protein [Function unknown]
Probab=20.65  E-value=2.9e+02  Score=19.89  Aligned_cols=51  Identities=20%  Similarity=0.225  Sum_probs=35.5

Q ss_pred             HHHHHHHHHHHhhc---chhHHHHHHHHH--HHhccCCHHHHHHHHHHHHHHhcCC
Q psy16714         13 AGRKLFREFLRCEY---SEENILFWLACE--DLKKESNPDVIEEKARFIYEDYISI   63 (68)
Q Consensus        13 ~g~~~F~~FL~~e~---s~Enl~Fw~ave--~~k~~~~~~~~~~~a~~Iy~~fi~~   63 (68)
                      .....+..||.+|.   |.|-+.|..+--  +++.......-....+.++..||..
T Consensus       172 kYt~eik~~la~e~~~Ps~Efvkffaa~v~~g~~t~~~~~kFt~ivk~~~~qFi~D  227 (365)
T COG4748         172 KYTKEIKAYLAKETNNPSVEFVKFFAAKVYTGFKTTSVDEKFTDIVKNAFSQFIND  227 (365)
T ss_pred             hHHHHHHHHHHHhccCCCHHHHHHHHHHHhccceeeeHHHHHHHHHHHHHHHHHHH
Confidence            45667788888666   678888888766  5555544445566777888888754


No 50 
>PF03909 BSD:  BSD domain  ;  InterPro: IPR005607 The BSD domain is an about 60-residue long domain named after the BTF2-like transcription factors, Synapse-associated proteins and DOS2-like proteins in which it is found. Additionally, it is also found in several hypothetical proteins. The BSD domain occurs in one or two copies in a variety of species ranging from primal protozoan to human. It can be found associated with other domains such as the BTB domain (see PDOC50097 from PROSITEDOC) or the U-box in multidomain proteins. The function of the BSD domain is yet unknown []. Secondary structure prediction indicates the presence of three predicted alpha helices, which probably form a three-helical bundle in small domains. The third predicted helix contains neighbouring phenylalanine and tryptophan residues - less common amino acids that are invariant in all the BSD domains identified and that are the most striking sequence features of the domain [].  Some proteins known to contain one or two BSD domains are listed below:  Mammalian TFIIH basal transcription factor complex p62 subunit (GTF2H1).  Yeast RNA polymerase II transcription factor B 73 kDa subunit (TFB1), the homologue of BTF2.  Yeast DOS2 protein. It is involved in single-copy DNA replication and ubiquitination.  Drosophila synapse-associated protein SAP47.  Mammalian SYAP1.  Various Arabidopsis thaliana (Mouse-ear cress) hypothetical proteins.; PDB: 1X3A_A 1PFJ_A 2RNR_B 2DII_A.
Probab=20.35  E-value=1.1e+02  Score=15.89  Aligned_cols=31  Identities=35%  Similarity=0.510  Sum_probs=20.8

Q ss_pred             HHHHHhhChHHHHHHHHHHHhhcchhHHHHHHH
Q psy16714          4 IKRTRNTFKAGRKLFREFLRCEYSEENILFWLA   36 (68)
Q Consensus         4 ~~~~ll~dp~g~~~F~~FL~~e~s~Enl~Fw~a   36 (68)
                      ...+|..||..+..+...+-...+++  .||..
T Consensus        17 i~~lL~~~p~l~~~~~~lVP~~~~e~--~FW~r   47 (62)
T PF03909_consen   17 IKKLLEEDPNLRKLYNELVPSKMSEE--EFWKR   47 (62)
T ss_dssp             HHHHHHH-HHHHHHHHHCCTTTS-HH--HHHHH
T ss_pred             HHHHHHhCHHHHHHHHHhCCCCCCHH--HHHHH
Confidence            34567799999999988885555544  67864


No 51 
>PRK09718 hypothetical protein; Validated
Probab=20.08  E-value=1.8e+02  Score=22.00  Aligned_cols=19  Identities=26%  Similarity=0.501  Sum_probs=16.7

Q ss_pred             HHHHHHHHHHHHHhcCCCC
Q psy16714         47 DVIEEKARFIYEDYISILS   65 (68)
Q Consensus        47 ~~~~~~a~~Iy~~fi~~~s   65 (68)
                      ..+.++|..+|++||+..+
T Consensus       385 ~~~k~~a~~ly~~~l~~~~  403 (512)
T PRK09718        385 DTIKANAWHLYKEWIRSDD  403 (512)
T ss_pred             HHHHHHHHHHHHHHhcCcc
Confidence            5789999999999998764


Done!