Query         020681
Match_columns 322
No_of_seqs    10 out of 12
Neff          1.8 
Searched_HMMs 46136
Date          Fri Mar 29 04:18:53 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/020681.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/020681hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PF00003 7tm_3:  7 transmembran  96.0   0.012 2.6E-07   49.3   4.9  166   40-214    17-204 (238)
  2 PF12553 DUF3742:  Protein of u  39.0      11 0.00025   28.1   0.3   13   89-101    34-46  (54)
  3 PF13858 DUF4199:  Protein of u  36.0      53  0.0011   26.5   3.7   64  198-276     3-69  (163)
  4 PF14667 Polysacc_synt_C:  Poly  32.3 2.3E+02  0.0049   21.6   7.2   33  174-206    70-102 (146)
  5 PF08815 Nuc_rec_co-act:  Nucle  30.7      21 0.00045   27.2   0.5   27  252-278     8-36  (51)
  6 PF04129 Vps52:  Vps52 / Sac2 f  30.5      34 0.00074   33.9   2.1  101   34-139   267-386 (508)
  7 PF06570 DUF1129:  Protein of u  30.2 1.4E+02  0.0031   25.9   5.6   51  188-240   139-190 (206)
  8 COG3620 Predicted transcriptio  28.3      33 0.00071   31.7   1.4   19  279-299   133-151 (187)
  9 COG0828 RpsU Ribosomal protein  27.2      45 0.00097   26.3   1.8   21  287-309     2-22  (67)
 10 PF07725 LRR_3:  Leucine Rich R  25.2      25 0.00055   22.1   0.1   16  184-199     4-19  (20)
 11 PLN03232 ABC transporter C fam  25.2      65  0.0014   35.9   3.3  114   84-212    31-153 (1495)
 12 PF12149 HSV_VP16_C:  Herpes si  24.9      32 0.00069   23.8   0.6   18  263-280    11-29  (30)
 13 PF08009 CDP-OH_P_tran_2:  CDP-  24.1      96  0.0021   22.1   2.9   24  159-182     3-26  (39)
 14 PF08228 RNase_P_pop3:  RNase P  23.4      33 0.00072   30.2   0.6   18  268-285   114-131 (158)
 15 PF08261 Carcinustatin:  Carcin  22.7      34 0.00073   17.9   0.3    8   93-100     1-8   (8)
 16 KOG1586 Protein required for f  22.2      20 0.00042   34.9  -1.2   18  305-322   220-237 (288)
 17 COG4858 Uncharacterized membra  21.3      91   0.002   29.5   3.0   21  195-215   161-181 (226)
 18 PF09163 Form-deh_trans:  Forma  21.2      94   0.002   22.7   2.4   23  191-213     3-25  (44)
 19 PF04942 CC:  CC domain;  Inter  20.5      20 0.00043   24.8  -1.1   13    7-19     15-27  (36)
 20 COG4252 Predicted transmembran  20.1 1.3E+02  0.0027   30.2   3.8   25   49-75    371-395 (400)

No 1  
>PF00003 7tm_3:  7 transmembrane sweet-taste receptor of 3 GCPR;  InterPro: IPR017978 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/).  GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents the C-terminal region of family 3 GPCR receptor proteins, which contains the seven transmembrane region. The seven TM regions assemble in such a way as to produce a docking pocket into which such molecules as cyclamate and lactisole have been found to bind and consequently confer the taste of sweetness []. ; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway, 0016021 integral to membrane
Probab=96.00  E-value=0.012  Score=49.28  Aligned_cols=166  Identities=21%  Similarity=0.271  Sum_probs=98.6

Q ss_pred             eecCCCCCCceeehHhHHHHHHHHHHHHHhhhchhhhchhhhhhhhHhhhhhhcccccchhhhhhhHhhhhhhhhheecc
Q 020681           40 HKVPRTKSSGFWIPVIQVFASFNLLLSLVLSDNFLKFQRRRWWQSCYIWGVWIEGPLGFGLLMSCRIAQAFQLYYIFVRK  119 (322)
Q Consensus        40 hk~prt~~s~fwipvIQv~aSfnlL~Si~mS~N~lkf~~~hwwq~Cylw~vW~eGPlGFGlLlSCrI~QAfqLy~IFVkr  119 (322)
                      .|.|--++|+...-.+-.++.+-+..|..+..  .+-+.    ..|.+- .|. -++||.+..++=+++++|+|-+|-++
T Consensus        17 r~~~~i~~s~~~l~~~lL~G~~l~~~~~~~~~--~~~s~----~~C~~r-~~~-~~l~f~l~~~~ll~K~~ri~~if~~~   88 (238)
T PF00003_consen   17 RNTPIIRASSPELLYILLLGCLLLYSSVFLFL--LPPSD----ILCTLR-RWL-FSLGFTLIFSALLAKTWRIYRIFRNP   88 (238)
T ss_pred             cCCCceecCCHhHHHHHHHHHHHHHHHHHHhc--cCcCC----cEEEEe-eee-eeeehHhhhhHHHHhhhheeeeeccC
Confidence            45677777777655555566555444443322  22222    237754 666 46999999999999999999999865


Q ss_pred             cCCC--cch-----hhh-hhHHHHHHHhhhhhhhhccCCc-------------cccc-ceeEeeehhhhHHHHHHHHHHH
Q 020681          120 HLPP--IRS-----YVF-LPLVLMPWLIAATFIQVMRPLN-------------DRCH-MRVHWIIPFLFLHVVYVASLVG  177 (322)
Q Consensus       120 rLPp--irs-----Y~f-LPlillPWi~gaa~ih~~kpl~-------------~rCh-m~~~W~ipv~~LhalYva~Lv~  177 (322)
                      +...  .++     +.+ +.+++.+-+.-+. -....|..             ..|+ ....|..-.....++....-..
T Consensus        89 ~~~~~~~~~~~~~~~~~~~~~~~v~~ii~~~-w~~~~p~~~~~~~~~~~~~~~~~C~~~~~~~~~~~~~y~~~Ll~~~~~  167 (238)
T PF00003_consen   89 SRKRRRLISSNRSLLLLVLLLVLVQVIILII-WLILDPPTPVSDIDISSNEIYLSCSSNSNIWLILSLGYNGLLLLIGFF  167 (238)
T ss_pred             CCCCcccccCCcchhhheeeeeeehhhhhhh-hhhcccccccccccccceEEEEEecCCccchHHHHHHHHHHHHHHHHH
Confidence            5522  232     211 1222222222211 12233332             2895 3344444444444444455556


Q ss_pred             HHHhhheeeeechhHHHHHHHHHHHHHHHHHHHHHHH
Q 020681          178 FMAAIRHIEFRFDELRDLWQGIIVSASSIGLWVFAYL  214 (322)
Q Consensus       178 ~t~avrHIeFrFdElkdLwkgilVsa~si~vWv~ayi  214 (322)
                      .+++.||++-.|+|-|.+=-.+.+......+|+..|.
T Consensus       168 la~~~R~~~~~~nEa~~I~~~~~~~~~~~~~~~~~~~  204 (238)
T PF00003_consen  168 LAFKTRNVPSNFNEARYIAFAIYNITIIWIIFIPLYF  204 (238)
T ss_pred             HHHhhCCCCcccchhhhHhHhHHHHHHHHHHhhhhee
Confidence            7789999999999999876666666666666666554


No 2  
>PF12553 DUF3742:  Protein of unknown function (DUF3742);  InterPro: IPR022213  This domain family is found in bacteria, and is approximately 50 amino acids in length. There is a single completely conserved residue Y that may be functionally important. 
Probab=38.99  E-value=11  Score=28.09  Aligned_cols=13  Identities=62%  Similarity=1.257  Sum_probs=10.8

Q ss_pred             hhhhcccccchhh
Q 020681           89 GVWIEGPLGFGLL  101 (322)
Q Consensus        89 ~vW~eGPlGFGlL  101 (322)
                      .-|-+||-|||+-
T Consensus        34 ~E~R~G~~GfGlY   46 (54)
T PF12553_consen   34 PEWREGPAGFGLY   46 (54)
T ss_pred             HhheecCCCcccc
Confidence            3589999999974


No 3  
>PF13858 DUF4199:  Protein of unknown function (DUF4199)
Probab=36.05  E-value=53  Score=26.54  Aligned_cols=64  Identities=25%  Similarity=0.418  Sum_probs=38.6

Q ss_pred             HHHHHHHHHHHHHHHHHh---hhhhccchhHHHHHHHHHHhhhceeeeEEEeccCCccchhhhhccccCchhhhhhhhhc
Q 020681          198 GIIVSASSIGLWVFAYLL---NEIHDDISWLQVASRFLLLVMGGILVVVFFSISSSEPLLSQISLRKREPKEFETMGQAL  274 (322)
Q Consensus       198 gilVsa~si~vWv~ayil---nei~d~iswlqv~sRflllV~~~iLvl~ffsIsssqpllsqislrkRe~~ef~tMgqAL  274 (322)
                      |+.....+++.|+..|++   ++-.+.-+|+...+-.+..+..               ...-...|++++..+-+-+||+
T Consensus         3 g~i~G~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~i---------------~~~i~~~R~~~~~g~isf~~a~   67 (163)
T PF13858_consen    3 GLIFGLILILFFLLSYLLGMHDIKYPSNSWLGILSMVITIIFI---------------YFAIRRYRKKYNGGFISFGQAF   67 (163)
T ss_pred             HHHHHHHHHHHHHHHHHHHHccccccHhHHHHHHHHHHHHHHH---------------HHHHHHHHHHccCCCeeHHHHH
Confidence            566777888888988888   5444445555544422222111               1233345557777888888888


Q ss_pred             CC
Q 020681          275 GI  276 (322)
Q Consensus       275 GI  276 (322)
                      +.
T Consensus        68 ~~   69 (163)
T PF13858_consen   68 KV   69 (163)
T ss_pred             HH
Confidence            74


No 4  
>PF14667 Polysacc_synt_C:  Polysaccharide biosynthesis C-terminal domain
Probab=32.31  E-value=2.3e+02  Score=21.58  Aligned_cols=33  Identities=15%  Similarity=0.184  Sum_probs=23.0

Q ss_pred             HHHHHHHhhheeeeechhHHHHHHHHHHHHHHH
Q 020681          174 SLVGFMAAIRHIEFRFDELRDLWQGIIVSASSI  206 (322)
Q Consensus       174 ~Lv~~t~avrHIeFrFdElkdLwkgilVsa~si  206 (322)
                      ..+-.-..-||+..+.+-.|++||-++.++.+.
T Consensus        70 ~~~~~~~~~k~~~~~~~~~~~~~~~~~~~~~~~  102 (146)
T PF14667_consen   70 FILNLWYVRKKIGIKINWRRSLIKPILASIVMA  102 (146)
T ss_pred             HHHHHHHHHHHhCCChHHHHHHHHHHHHHHHHH
Confidence            334455566788888777779999887765543


No 5  
>PF08815 Nuc_rec_co-act:  Nuclear receptor coactivator;  InterPro: IPR014920 This entry represents the interlocking domain of the eukaryotic nuclear receptor coactivators Ncoa1, Ncoa2 and Ncoa3. The interlocking domain forms a 3-helical non-globular array that forms interlocked heterodimers with its target. Nuclear receptors are ligand-activated transcription factors involved in the regulation of many processes, including development, reproduction and homeostasis. Nuclear receptor coactivators act to modulate the function of nuclear receptors. Coactivators associate with promoters and enhancers primarily through protein-protein contacts to facilitate the interaction between DNA-bound transcription factors and the transcription machinery. In addition to their role as coactivators of various nuclear receptors, Ncoa1 and Ncoa3 both have histone acetyltransferase activity (2.3.1.48 from EC), but Ncoa2 does not [, ]. ; GO: 0003713 transcription coactivator activity, 0035257 nuclear hormone receptor binding, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2C52_B 1KBH_A.
Probab=30.71  E-value=21  Score=27.20  Aligned_cols=27  Identities=37%  Similarity=0.615  Sum_probs=19.2

Q ss_pred             cchhhhhc--cccCchhhhhhhhhcCCCC
Q 020681          252 PLLSQISL--RKREPKEFETMGQALGIPD  278 (322)
Q Consensus       252 pllsqisl--rkRe~~ef~tMgqALGIpd  278 (322)
                      -||+|+.-  +.-+..+-+..-+|||||+
T Consensus         8 ALldQL~s~L~~~D~~~LeEIDraLGIp~   36 (51)
T PF08815_consen    8 ALLDQLYSLLSNTDVTGLEEIDRALGIPD   36 (51)
T ss_dssp             HHHHHHHHHCCTSSGCCCHCCHHHTTCCC
T ss_pred             HHHHHHHHHHhccchhhHHHHHHHhCcHH
Confidence            36777653  3344457888999999987


No 6  
>PF04129 Vps52:  Vps52 / Sac2 family ;  InterPro: IPR007258 Vps52 complexes with Vps53 and Vps54 to form a multi-subunit complex involved in regulating membrane trafficking events [].
Probab=30.48  E-value=34  Score=33.91  Aligned_cols=101  Identities=21%  Similarity=0.361  Sum_probs=59.6

Q ss_pred             hccceeeecCCCCCCceeehHhHHHHHHHHHHHHHhh------hchhhhch---hhhhhh-----hH----hhhhhhcc-
Q 020681           34 ILPFVVHKVPRTKSSGFWIPVIQVFASFNLLLSLVLS------DNFLKFQR---RRWWQS-----CY----IWGVWIEG-   94 (322)
Q Consensus        34 ~~p~lvhk~prt~~s~fwipvIQv~aSfnlL~Si~mS------~N~lkf~~---~hwwq~-----Cy----lw~vW~eG-   94 (322)
                      --|.++..+.-+++.+||+.  ++|-|+|+++-=-.+      ..|++-+.   ..++..     +-    ...-++.+ 
T Consensus       267 ~~p~i~~~~a~~~~~k~~~E--~iFRS~~~~L~Dn~t~Ey~F~~~FF~~~~~~~~~if~~If~~t~~~~~~~~~~~l~~~  344 (508)
T PF04129_consen  267 DAPIIVPQIAEDNSQKYPIE--EIFRSLNKALIDNATSEYLFISEFFSGSGDAAEDIFNQIFEPTFSLLQEFTEQLLSNS  344 (508)
T ss_pred             cCCccccchhhcccccCCHH--HHHHHHHHHHHHhhhHHHHHHHHHHcccccchHHHHHHHHHHHHHHHHHHHHHHHHcC
Confidence            35777888888888888765  568999887632211      12222100   011100     00    01112332 


Q ss_pred             cccchhhhhhhHhhhhhhhhheecccCCCcchhhhhhHHHHHHHh
Q 020681           95 PLGFGLLMSCRIAQAFQLYYIFVRKHLPPIRSYVFLPLVLMPWLI  139 (322)
Q Consensus        95 PlGFGlLlSCrI~QAfqLy~IFVkrrLPpirsY~fLPlillPWi~  139 (322)
                      +=.+|+|+..||+|.||.  .--+|+.|++..|. =-+..+=|=-
T Consensus       345 ~D~iglll~Irl~~~~~~--~~~~R~ip~ld~y~-~~~~~~LWpr  386 (508)
T PF04129_consen  345 YDAIGLLLCIRLNQRYQF--EMQRRRIPVLDSYL-NSLLMLLWPR  386 (508)
T ss_pred             CcHHHHHHHHHHHHHHHH--HHHhCCCCchHHHH-HHHHHHHHHH
Confidence            236899999999999995  44789999999994 3444444543


No 7  
>PF06570 DUF1129:  Protein of unknown function (DUF1129);  InterPro: IPR009214 There are currently no experimental data for members of this group or their homologues. However, these proteins contain predicted integral membrane proteins (with several transmembrane segments).
Probab=30.15  E-value=1.4e+02  Score=25.88  Aligned_cols=51  Identities=20%  Similarity=0.498  Sum_probs=34.1

Q ss_pred             echhHHHHHHHHHHHHHHHHHHHHHHHhhhhhcc-chhHHHHHHHHHHhhhcee
Q 020681          188 RFDELRDLWQGIIVSASSIGLWVFAYLLNEIHDD-ISWLQVASRFLLLVMGGIL  240 (322)
Q Consensus       188 rFdElkdLwkgilVsa~si~vWv~ayilnei~d~-iswlqv~sRflllV~~~iL  240 (322)
                      ...+=+..||.+++++++.++|...+.+...-.. +.  -+.+-...++.|.+.
T Consensus       139 ~~~~r~~~~k~~~~~~~~~~~w~~~~~~~~~lp~~in--p~l~~~~~iiig~i~  190 (206)
T PF06570_consen  139 KKKKRPSWWKYILISVLAMVLWIVIFVLTSFLPPVIN--PVLPPWVYIIIGVIA  190 (206)
T ss_pred             ccccccHHHHHHHHHHHHHHHHHHHHHHHHHccccCC--cCCCHHHHHHHHHHH
Confidence            4555568999999999999999998887776322 21  123444455555443


No 8  
>COG3620 Predicted transcriptional regulator with C-terminal CBS domains [Transcription]
Probab=28.25  E-value=33  Score=31.70  Aligned_cols=19  Identities=32%  Similarity=0.721  Sum_probs=15.9

Q ss_pred             CcccccccCCCCCCCCChHHH
Q 020681          279 SGLLLRREPTPVIDPNEPLDK  299 (322)
Q Consensus       279 SG~l~~~ep~~~idpNePLdk  299 (322)
                      .++.  .||+|.+|||||++-
T Consensus       133 r~vM--~e~fP~Vs~~~~l~v  151 (187)
T COG3620         133 REVM--GEPFPTVSPDESLNV  151 (187)
T ss_pred             HHHh--cCCCCcCCCCCCHHH
Confidence            4555  799999999999984


No 9  
>COG0828 RpsU Ribosomal protein S21 [Translation, ribosomal structure and biogenesis]
Probab=27.18  E-value=45  Score=26.31  Aligned_cols=21  Identities=43%  Similarity=0.838  Sum_probs=15.4

Q ss_pred             CCCCCCCCChHHHHhhchhHHHH
Q 020681          287 PTPVIDPNEPLDKLLLNKKFRQS  309 (322)
Q Consensus       287 p~~~idpNePLdkLLlnkrFRqS  309 (322)
                      |-..|+.|||+|+-|  |||..+
T Consensus         2 ~~v~V~ene~~d~AL--rrFKr~   22 (67)
T COG0828           2 PQVKVRENEPLDKAL--RRFKRK   22 (67)
T ss_pred             CeeeecCCChHHHHH--HHHHHH
Confidence            345689999999988  566543


No 10 
>PF07725 LRR_3:  Leucine Rich Repeat;  InterPro: IPR011713 Leucine-rich repeats (LRR) consist of 2-45 motifs of 20-30 amino acids in length that generally folds into an arc or horseshoe shape []. LRRs occur in proteins ranging from viruses to eukaryotes, and appear to provide a structural framework for the formation of protein-protein interactions [, ].Proteins containing LRRs include tyrosine kinase receptors, cell-adhesion molecules, virulence factors, and extracellular matrix-binding glycoproteins, and are involved in a variety of biological processes, including signal transduction, cell adhesion, DNA repair, recombination, transcription, RNA processing, disease resistance, apoptosis, and the immune response []. Sequence analyses of LRR proteins suggested the existence of several different subfamilies of LRRs. The significance of this classification is that repeats from different subfamilies never occur simultaneously and have most probably evolved independently. It is, however, now clear that all major classes of LRR have curved horseshoe structures with a parallel beta sheet on the concave side and mostly helical elements on the convex side. At least six families of LRR proteins, characterised by different lengths and consensus sequences of the repeats, have been identified. Eleven-residue segments of the LRRs (LxxLxLxxN/CxL), corresponding to the beta-strand and adjacent loop regions, are conserved in LRR proteins, whereas the remaining parts of the repeats (herein termed variable) may be very different. Despite the differences, each of the variable parts contains two half-turns at both ends and a "linear" segment (as the chain follows a linear path overall), usually formed by a helix, in the middle. The concave face and the adjacent loops are the most common protein interaction surfaces on LRR proteins. 3D structure of some LRR proteins-ligand complexes show that the concave surface of LRR domain is ideal for interaction with alpha-helix, thus supporting earlier conclusions that the elongated and curved LRR structure provides an outstanding framework for achieving diverse protein-protein interactions []. Molecular modeling suggests that the conserved pattern LxxLxL, which is shorter than the previously proposed LxxLxLxxN/CxL is sufficient to impart the characteristic horseshoe curvature to proteins with 20- to 30-residue repeats [].  This entry includes some LRRs that fail to be detected by the IPR001611 from INTERPRO model.
Probab=25.25  E-value=25  Score=22.07  Aligned_cols=16  Identities=25%  Similarity=0.960  Sum_probs=13.0

Q ss_pred             eeeeechhHHHHHHHH
Q 020681          184 HIEFRFDELRDLWQGI  199 (322)
Q Consensus       184 HIeFrFdElkdLwkgi  199 (322)
                      .+..+...++.||+|+
T Consensus         4 eL~m~~S~lekLW~G~   19 (20)
T PF07725_consen    4 ELNMPYSKLEKLWEGV   19 (20)
T ss_pred             EEECCCCChHHhcCcc
Confidence            3567888999999985


No 11 
>PLN03232 ABC transporter C family member; Provisional
Probab=25.17  E-value=65  Score=35.94  Aligned_cols=114  Identities=6%  Similarity=-0.039  Sum_probs=55.9

Q ss_pred             hhHhhhhhhcccccchhhhhhhHhhhhhhhhheecccCC-Cc------chhhhhhHHHHHHHhhhhhhhhccCCcccccc
Q 020681           84 SCYIWGVWIEGPLGFGLLMSCRIAQAFQLYYIFVRKHLP-PI------RSYVFLPLVLMPWLIAATFIQVMRPLNDRCHM  156 (322)
Q Consensus        84 ~Cylw~vW~eGPlGFGlLlSCrI~QAfqLy~IFVkrrLP-pi------rsY~fLPlillPWi~gaa~ih~~kpl~~rChm  156 (322)
                      -|..=+++.--|.-|++.+.     .++++++..+++-+ |+      |....+-+++..|.......+..--+|-+=+.
T Consensus        31 ~~~~~~~~~~~~~~~~~~~~-----~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~  105 (1495)
T PLN03232         31 PCAIDSLVMIVSHSVLLGLC-----FYRIWIILDNAKAQIYVLRKKYYNCVLGILACYCVVEPVLRLVMGISLFDMDEET  105 (1495)
T ss_pred             ccHHhhHHHHHHHHHHHHHH-----HHHHHHHhhccccCCccccchHHHHHHHHHHHHHHHHHHHHHHHHhhhhcccccc
Confidence            56666676667777765544     45566664333222 22      11122333334444333322211112222222


Q ss_pred             eeEeeehhhhH--HHHHHHHHHHHHHhhheeeeechhHHHHHHHHHHHHHHHHHHHHH
Q 020681          157 RVHWIIPFLFL--HVVYVASLVGFMAAIRHIEFRFDELRDLWQGIIVSASSIGLWVFA  212 (322)
Q Consensus       157 ~~~W~ipv~~L--halYva~Lv~~t~avrHIeFrFdElkdLwkgilVsa~si~vWv~a  212 (322)
                       -.|..++.++  ++++.+++.-.+. +++        ++-|||..-+......|+..
T Consensus       106 -~~~~~~~~~~~~~~~~~~~~~~~~~-~~~--------~~~~~~~~~~~~~~~~~~~~  153 (1495)
T PLN03232        106 -DLPPFEVASLMVEAFAWFSMLVLIG-LET--------KQYVKEFRWYVRFGVVYVLV  153 (1495)
T ss_pred             -ccccchhHHhHHHHHHHHHHHHHHH-HHH--------HHHHhcCCCCceeeehhHhh
Confidence             2344555544  5666665554444 444        34577777777777777776


No 12 
>PF12149 HSV_VP16_C:  Herpes simplex virus virion protein 16 C terminal;  InterPro: IPR021051  This domain is about 30 amino acids in length. It is found in association with PF02232 from PFAM. This domain is found in the C-terminal region of the HSV virion protein 16 (alpha-TIF). This protein is a transcription promoter. The C-terminal domain is the carboxyl subdomain of the acidic transcriptional activation domain. The protein binds to DNA binding proteins to carry out its function. Such proteins include TATA binding protein, CBP, TBP-binding protein, etc.   Alpha-TIF (VP16) from Herpes Simplex virus is an essential tegument protein involved in the transcriptional activation of viral immediate early (IE) promoters (alpha genes) during the lytic phase of viral infection. VP16 associates with cellular transcription factors to enhance transcription rates, including the general transcription factor TFIIB and the transcriptional coactivator PC4. The N-terminal residues of VP16 confer specificity for the IE genes, while the C-terminal residues are responsible for transcriptional activation. Within the C-terminal region are two activation regions that can independently and cooperatively activate transcription []. VP16 forms a transcriptional regulatory complex with two cellular proteins, the POU-domain transcription factor Oct-1 and the cell-proliferation factor HCF-1 []. VP16 is an alpha/beta protein with an unusual fold. Other transcription factors may have a similar topology.; PDB: 2K2U_B 2PHG_B 2PHE_C.
Probab=24.89  E-value=32  Score=23.84  Aligned_cols=18  Identities=50%  Similarity=0.687  Sum_probs=13.8

Q ss_pred             Cchhhhhh-hhhcCCCCCc
Q 020681          263 EPKEFETM-GQALGIPDSG  280 (322)
Q Consensus       263 e~~ef~tM-gqALGIpdSG  280 (322)
                      ..+||+.| -.||||.|-|
T Consensus        11 adfefeqmftdalgid~fg   29 (30)
T PF12149_consen   11 ADFEFEQMFTDALGIDEFG   29 (30)
T ss_dssp             CCCCHHCCCCCCCCTCCC-
T ss_pred             hhHHHHHHHhhhhCccccC
Confidence            46899988 5799998865


No 13 
>PF08009 CDP-OH_P_tran_2:  CDP-alcohol phosphatidyltransferase 2;  InterPro: IPR012616  This domain is found on CDP-alcohol phosphatidyltransferases. These enzymes catalyse the displacement of CMP from a CDP-alcohol by a second alcohol with formation of a phosphodiester bond and concomitant breaking of a phosphoride anhydride bond. 
Probab=24.11  E-value=96  Score=22.09  Aligned_cols=24  Identities=21%  Similarity=0.466  Sum_probs=20.9

Q ss_pred             EeeehhhhHHHHHHHHHHHHHHhh
Q 020681          159 HWIIPFLFLHVVYVASLVGFMAAI  182 (322)
Q Consensus       159 ~W~ipv~~LhalYva~Lv~~t~av  182 (322)
                      +|.+|+..+=++|+|.|+...|..
T Consensus         3 ~~vlpl~~~v~l~~a~Lis~PW~t   26 (39)
T PF08009_consen    3 DLVLPLILLVGLYAALLISYPWLT   26 (39)
T ss_pred             ceehHHHHHHHHHHHHHHHhhHHH
Confidence            588999999999999999888753


No 14 
>PF08228 RNase_P_pop3:  RNase P subunit Pop3;  InterPro: IPR013241 This family of fungal proteins form a subunit of RNase P, the ribonucleoprotein enzyme that cleaves the leader sequence of precursor tRNAs to generate mature tRNAs. The structure of Pop3 has been assigned the L7Ae/L30e fold []. This RNA-binding fold is also present in human RNase P subunit Rpp38, raising the possibility that Pop3p and Rpp38 are functional homologues.
Probab=23.39  E-value=33  Score=30.19  Aligned_cols=18  Identities=44%  Similarity=0.794  Sum_probs=15.4

Q ss_pred             hhhhhhcCCCCCcccccc
Q 020681          268 ETMGQALGIPDSGLLLRR  285 (322)
Q Consensus       268 ~tMgqALGIpdSG~l~~~  285 (322)
                      +++++|||+|+-|+|.=+
T Consensus       114 ~rLs~aLgi~r~g~l~v~  131 (158)
T PF08228_consen  114 ARLSEALGIPRVGILAVR  131 (158)
T ss_pred             HHHHHHhCCCCccEEEEe
Confidence            679999999999999544


No 15 
>PF08261 Carcinustatin:  Carcinustatin peptide
Probab=22.72  E-value=34  Score=17.92  Aligned_cols=8  Identities=63%  Similarity=1.340  Sum_probs=6.3

Q ss_pred             cccccchh
Q 020681           93 EGPLGFGL  100 (322)
Q Consensus        93 eGPlGFGl  100 (322)
                      .||..|||
T Consensus         1 agpy~fgl    8 (8)
T PF08261_consen    1 AGPYSFGL    8 (8)
T ss_pred             CCcccccC
Confidence            38998986


No 16 
>KOG1586 consensus Protein required for fusion of vesicles in vesicular transport, alpha-SNAP [Intracellular trafficking, secretion, and vesicular transport]
Probab=22.21  E-value=20  Score=34.88  Aligned_cols=18  Identities=50%  Similarity=0.886  Sum_probs=14.2

Q ss_pred             hHHHHHhhhhccccccCC
Q 020681          305 KFRQSFMAFADRRECSFL  322 (322)
Q Consensus       305 rFRqSfmaFADs~~~~~~  322 (322)
                      ++.+-+=+|+|||||.|+
T Consensus       220 ky~~~dP~F~dsREckfl  237 (288)
T KOG1586|consen  220 KYQELDPAFTDSRECKFL  237 (288)
T ss_pred             HHHhcCCcccccHHHHHH
Confidence            355667789999999875


No 17 
>COG4858 Uncharacterized membrane-bound protein conserved in bacteria [Function unknown]
Probab=21.34  E-value=91  Score=29.54  Aligned_cols=21  Identities=24%  Similarity=0.624  Sum_probs=19.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHh
Q 020681          195 LWQGIIVSASSIGLWVFAYLL  215 (322)
Q Consensus       195 LwkgilVsa~si~vWv~ayil  215 (322)
                      -||+++|..+|+.+|.+.|+.
T Consensus       161 ~~K~~lv~~~sm~lWi~v~i~  181 (226)
T COG4858         161 TWKYLLVAVLSMLLWIAVMIA  181 (226)
T ss_pred             hHHHHHHHHHHHHHHHHHHHH
Confidence            699999999999999988864


No 18 
>PF09163 Form-deh_trans:  Formate dehydrogenase N, transmembrane;  InterPro: IPR015246 The transmembrane domain of the beta subunit of formate dehydrogenase consists of a single transmembrane helix. This domain acts as a transmembrane anchor, allowing the conduction of electrons within the protein []. ; PDB: 1KQG_B 1KQF_B.
Probab=21.21  E-value=94  Score=22.73  Aligned_cols=23  Identities=22%  Similarity=0.426  Sum_probs=17.7

Q ss_pred             hHHHHHHHHHHHHHHHHHHHHHH
Q 020681          191 ELRDLWQGIIVSASSIGLWVFAY  213 (322)
Q Consensus       191 ElkdLwkgilVsa~si~vWv~ay  213 (322)
                      ..-+||||++=-..++++|.++-
T Consensus         3 ~~V~lWKg~~Kpl~~~~~~~~~~   25 (44)
T PF09163_consen    3 PSVTLWKGVLKPLGAAGMGATAA   25 (44)
T ss_dssp             HHHHHHHTTHHHHHHHHHHHHHH
T ss_pred             chHHHHhhhHHHHHHHHHHHHHH
Confidence            34579999998888888887763


No 19 
>PF04942 CC:  CC domain;  InterPro: IPR007026 This short domain contains four conserved cysteines that are probably required for the formation of two disulphide bonds. The domain is only found in proteins from Caenorhabditis species. The domain is named after the characteristic CC motif.
Probab=20.54  E-value=20  Score=24.80  Aligned_cols=13  Identities=38%  Similarity=1.035  Sum_probs=10.7

Q ss_pred             ccccCCcchhHHH
Q 020681            7 AVKGGCPTDYIAL   19 (322)
Q Consensus         7 a~~GGCpsDYvAv   19 (322)
                      |..|-||++|+.+
T Consensus        15 ai~G~CP~G~~~i   27 (36)
T PF04942_consen   15 AINGVCPSGYTVI   27 (36)
T ss_pred             CcCCcCCCCCEEE
Confidence            5779999999764


No 20 
>COG4252 Predicted transmembrane sensor domain [Signal transduction mechanisms]
Probab=20.05  E-value=1.3e+02  Score=30.15  Aligned_cols=25  Identities=40%  Similarity=0.729  Sum_probs=21.2

Q ss_pred             ceeehHhHHHHHHHHHHHHHhhhchhh
Q 020681           49 GFWIPVIQVFASFNLLLSLVLSDNFLK   75 (322)
Q Consensus        49 ~fwipvIQv~aSfnlL~Si~mS~N~lk   75 (322)
                      |.|||+|=.+..+  +.+.+-+.|+++
T Consensus       371 gwwiP~ip~ll~l--~~~~i~~~~~~~  395 (400)
T COG4252         371 GWWIPLIPPLLAL--VGSGIWSTLFLK  395 (400)
T ss_pred             hccccchHHHHHH--HHHHHHHHHHHH
Confidence            4599999988887  788888888887


Done!