Query         psy7578
Match_columns 280
No_of_seqs    295 out of 1206
Neff          5.4 
Searched_HMMs 46136
Date          Fri Aug 16 22:01:16 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy7578.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/7578hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PTZ00199 high mobility group p  99.8   6E-21 1.3E-25  151.1  10.6   77   64-140    14-92  (94)
  2 cd01389 MATA_HMG-box MATA_HMG-  99.8 5.7E-21 1.2E-25  145.0   9.8   76   72-147     1-76  (77)
  3 KOG0527|consensus               99.8 1.7E-20 3.7E-25  178.1   8.3   85   67-151    57-141 (331)
  4 cd01388 SOX-TCF_HMG-box SOX-TC  99.8 2.6E-19 5.6E-24  134.4   9.4   70   72-141     1-70  (72)
  5 cd01390 HMGB-UBF_HMG-box HMGB-  99.8 1.8E-18   4E-23  125.8   9.1   66   73-138     1-66  (66)
  6 PF00505 HMG_box:  HMG (high mo  99.8 3.4E-18 7.3E-23  125.6   8.1   68   73-140     1-68  (69)
  7 smart00398 HMG high mobility g  99.7 1.7E-17 3.8E-22  121.2   9.6   69   72-140     1-69  (70)
  8 cd00084 HMG-box High Mobility   99.7   9E-17 1.9E-21  116.1   9.1   65   73-137     1-65  (66)
  9 PF09011 HMG_box_2:  HMG-box do  99.7   6E-17 1.3E-21  121.8   8.3   71   70-140     1-72  (73)
 10 COG5648 NHP6B Chromatin-associ  99.7 9.6E-17 2.1E-21  142.9   8.9   77   64-140    62-138 (211)
 11 KOG0381|consensus               99.7 5.7E-16 1.2E-20  121.3   9.6   70   71-140    21-91  (96)
 12 KOG0526|consensus               99.5 1.2E-14 2.6E-19  143.6   8.0   75   63-141   526-600 (615)
 13 KOG0528|consensus               99.4 8.8E-14 1.9E-18  136.2   2.8   81   65-145   318-398 (511)
 14 KOG3248|consensus               99.1 9.4E-11   2E-15  111.0   6.6   72   69-140   188-259 (421)
 15 KOG4715|consensus               98.9 2.4E-09 5.3E-14  100.8   7.5   75   68-142    60-134 (410)
 16 KOG2746|consensus               98.7 9.8E-09 2.1E-13  104.4   4.6   70   67-136   176-247 (683)
 17 PF14887 HMG_box_5:  HMG (high   98.1 2.1E-05 4.5E-10   60.7   8.7   68   73-141     4-71  (85)
 18 COG5648 NHP6B Chromatin-associ  97.2 0.00016 3.5E-09   65.2   1.5  113   14-139    96-210 (211)
 19 PF06382 DUF1074:  Protein of u  97.0  0.0024 5.3E-08   56.4   7.3   49   77-129    83-131 (183)
 20 PF04690 YABBY:  YABBY protein;  96.8  0.0028   6E-08   55.8   5.6   48   69-116   118-165 (170)
 21 PF08073 CHDNT:  CHDNT (NUC034)  94.5   0.052 1.1E-06   39.4   3.6   40   77-116    13-52  (55)
 22 PF04769 MAT_Alpha1:  Mating-ty  94.1    0.13 2.7E-06   46.5   6.1   55   68-128    39-93  (201)
 23 PF06244 DUF1014:  Protein of u  89.8    0.59 1.3E-05   39.1   4.7   48   70-117    69-117 (122)
 24 PRK15117 ABC transporter perip  84.4     4.5 9.7E-05   36.5   7.6   47   95-141    65-113 (211)
 25 TIGR03481 HpnM hopanoid biosyn  84.4     2.1 4.5E-05   38.3   5.3   46   96-141    62-109 (198)
 26 PF05494 Tol_Tol_Ttg2:  Toluene  69.1      12 0.00027   31.9   5.6   46   96-141    36-83  (170)
 27 PF12881 NUT_N:  NUT protein N   61.9      19 0.00041   34.8   5.7   55   77-131   229-283 (328)
 28 KOG3223|consensus               51.0      21 0.00046   32.4   3.9   55   69-126   160-215 (221)
 29 PF01352 KRAB:  KRAB box;  Inte  37.8      27 0.00059   23.6   2.0   29   99-127     2-31  (41)
 30 PF06945 DUF1289:  Protein of u  35.6      50  0.0011   23.2   3.1   25   99-128    23-47  (51)
 31 PF12650 DUF3784:  Domain of un  34.1      27 0.00058   27.3   1.7   15  111-125    26-40  (97)
 32 COG2854 Ttg2D ABC-type transpo  33.3      66  0.0014   29.3   4.3   51   96-146    68-120 (202)
 33 PF13875 DUF4202:  Domain of un  32.2      95  0.0021   27.9   5.0   40   78-120   130-169 (185)
 34 PRK09706 transcriptional repre  31.6      82  0.0018   25.8   4.3   38  103-140    88-125 (135)
 35 PRK12751 cpxP periplasmic stre  29.6 1.1E+02  0.0024   26.8   4.9   33  103-135   119-151 (162)
 36 PRK10236 hypothetical protein;  28.9      54  0.0012   30.6   3.0   22  103-124   118-139 (237)
 37 PF11304 DUF3106:  Protein of u  28.1   2E+02  0.0043   23.2   5.9   37   90-126    33-71  (107)
 38 PF11304 DUF3106:  Protein of u  28.0      94   0.002   25.1   4.0   26  102-127    11-36  (107)
 39 cd07081 ALDH_F20_ACDH_EutE-lik  27.1 1.2E+02  0.0026   30.3   5.3   39  102-140     6-44  (439)
 40 KOG1827|consensus               26.9     5.1 0.00011   42.0  -4.4   44   76-119   552-595 (629)
 41 smart00500 SFM Splicing Factor  26.0      27 0.00059   24.2   0.4   23  199-221     2-24  (44)
 42 PF12356 DUF3643:  Protein of u  25.7      35 0.00076   30.1   1.1   14  233-246    76-89  (165)
 43 PF08367 M16C_assoc:  Peptidase  23.6 1.4E+02   0.003   27.1   4.7   32  101-132    13-44  (248)
 44 cd07749 NT_Pol-beta-like_1 Nuc  22.4      61  0.0013   28.3   2.0   29  206-238     2-30  (156)
 45 PF05388 Carbpep_Y_N:  Carboxyp  22.4 1.4E+02  0.0029   24.7   4.0   29  100-128    45-73  (113)
 46 KOG1610|consensus               22.3 2.2E+02  0.0047   27.8   5.9   49   82-130   187-248 (322)
 47 PRK12750 cpxP periplasmic repr  20.8 2.3E+02  0.0049   24.8   5.3   35  104-138   127-161 (170)
 48 cd07133 ALDH_CALDH_CalB Conife  20.6 1.8E+02  0.0039   28.7   5.1   39  102-140     5-43  (434)
 49 cd07132 ALDH_F3AB Aldehyde deh  20.4 1.8E+02   0.004   28.8   5.2   39  102-140     5-43  (443)
 50 cd07661 BAR_ICA69 The Bin/Amph  20.3      54  0.0012   29.9   1.3   65  208-275    40-106 (204)
 51 PRK10363 cpxP periplasmic repr  20.3 2.2E+02  0.0047   25.2   5.0   37  102-139   112-148 (166)
 52 PRK14830 undecaprenyl pyrophos  20.2 3.8E+02  0.0082   25.0   6.9   58  200-257    84-141 (251)

No 1  
>PTZ00199 high mobility group protein; Provisional
Probab=99.85  E-value=6e-21  Score=151.09  Aligned_cols=77  Identities=26%  Similarity=0.502  Sum_probs=73.3

Q ss_pred             cccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578          64 EDDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMD--FSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus        64 ~~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls--~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      ++..+|+++||||+||||+|++++|.+|..+||+++  +.+|+++||++|++|+++||++|+++|+.++++|.++|.+|
T Consensus        14 ~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e~~~Y   92 (94)
T PTZ00199         14 KRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEY   92 (94)
T ss_pred             CCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345679999999999999999999999999999986  89999999999999999999999999999999999999987


No 2  
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.85  E-value=5.7e-21  Score=145.03  Aligned_cols=76  Identities=17%  Similarity=0.279  Sum_probs=73.4

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCCCCCCc
Q psy7578          72 IEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPAQKTKS  147 (280)
Q Consensus        72 ~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~~p~Kk  147 (280)
                      +||||+||||||+++.|.+++.+||++++.||+++||++|++|+++||++|.++|++++++|.+++.+|...|+++
T Consensus         1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky~p~~~   76 (77)
T cd01389           1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKYTPRKS   76 (77)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcccCCCC
Confidence            6899999999999999999999999999999999999999999999999999999999999999999998888774


No 3  
>KOG0527|consensus
Probab=99.82  E-value=1.7e-20  Score=178.12  Aligned_cols=85  Identities=27%  Similarity=0.441  Sum_probs=80.2

Q ss_pred             ccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCCCCCC
Q psy7578          67 LSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPAQKTK  146 (280)
Q Consensus        67 ~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~~p~K  146 (280)
                      +...+++||||||||+|++.+|++|.++||++++.||||+||.+||.|+++||.+|.++|+++|++|.++..+|.+.|++
T Consensus        57 k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYRPRR  136 (331)
T KOG0527|consen   57 KTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYRPRR  136 (331)
T ss_pred             CCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCccccccc
Confidence            35678999999999999999999999999999999999999999999999999999999999999999999999999998


Q ss_pred             ccCCC
Q psy7578         147 STYTP  151 (280)
Q Consensus       147 k~~~~  151 (280)
                      |.+..
T Consensus       137 Kkk~~  141 (331)
T KOG0527|consen  137 KKKKR  141 (331)
T ss_pred             ccccc
Confidence            75443


No 4  
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.80  E-value=2.6e-19  Score=134.43  Aligned_cols=70  Identities=24%  Similarity=0.386  Sum_probs=67.8

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCC
Q psy7578          72 IEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAP  141 (280)
Q Consensus        72 ~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~  141 (280)
                      ++|||+||||+|++++|.+++.+||++++.||+++||++|+.|+++||++|.++|++++++|.+++..|.
T Consensus         1 ~iKrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~   70 (72)
T cd01388           1 HIKRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYK   70 (72)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCC
Confidence            5799999999999999999999999999999999999999999999999999999999999999998873


No 5  
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.77  E-value=1.8e-18  Score=125.79  Aligned_cols=66  Identities=32%  Similarity=0.609  Sum_probs=64.3

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHh
Q psy7578          73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMS  138 (280)
Q Consensus        73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema  138 (280)
                      ||||+|||++|++++|.+++.+||++++.+|++.||++|++|+++||++|.++|++++++|+++|.
T Consensus         1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~~   66 (66)
T cd01390           1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEMK   66 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhhC
Confidence            799999999999999999999999999999999999999999999999999999999999999873


No 6  
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.75  E-value=3.4e-18  Score=125.61  Aligned_cols=68  Identities=29%  Similarity=0.536  Sum_probs=64.3

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578          73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus        73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      ||||+|||++|++++|..++.+||++++.+|++++|++|++|+++||++|.++|++++++|++++..|
T Consensus         1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y   68 (69)
T PF00505_consen    1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEY   68 (69)
T ss_dssp             SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            79999999999999999999999999999999999999999999999999999999999999999765


No 7  
>smart00398 HMG high mobility group.
Probab=99.73  E-value=1.7e-17  Score=121.16  Aligned_cols=69  Identities=29%  Similarity=0.548  Sum_probs=67.0

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578          72 IEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus        72 ~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      +||||+|||++|++++|..++.+||++++.+|++.+|++|+.|++++|++|.++|++++++|.+++..|
T Consensus         1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y   69 (70)
T smart00398        1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEY   69 (70)
T ss_pred             CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            489999999999999999999999999999999999999999999999999999999999999999876


No 8  
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.70  E-value=9e-17  Score=116.10  Aligned_cols=65  Identities=28%  Similarity=0.606  Sum_probs=63.4

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHH
Q psy7578          73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKM  137 (280)
Q Consensus        73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~em  137 (280)
                      ||||+|||++|+++.|..++.+||++++.+|++.+|++|+.|++++|++|.++|+.++++|.+++
T Consensus         1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~   65 (66)
T cd00084           1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            79999999999999999999999999999999999999999999999999999999999999875


No 9  
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.70  E-value=6e-17  Score=121.85  Aligned_cols=71  Identities=23%  Similarity=0.409  Sum_probs=63.1

Q ss_pred             CCCCCCCCChHHHHHHHHHHHHHHh-CCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578          70 ESIEKARFTAYMLWAKQIRQKLIKS-NPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus        70 ~~~PKRP~sAY~LF~ke~R~kik~e-~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      |++||||+|||++|+++++.+++.+ .+...+.|+++.++++|++||++||.+|+++|+.++++|+++|..|
T Consensus         1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~   72 (73)
T PF09011_consen    1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEW   72 (73)
T ss_dssp             SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            5789999999999999999999988 7888999999999999999999999999999999999999999865


No 10 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.68  E-value=9.6e-17  Score=142.90  Aligned_cols=77  Identities=23%  Similarity=0.416  Sum_probs=74.3

Q ss_pred             cccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578          64 EDDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus        64 ~~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      .++..|+|.||||+||||+|+.++|.+|++++|++.|.+|++.+|++|++|+++||++|..+|..++++|+.++..|
T Consensus        62 ~r~k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y  138 (211)
T COG5648          62 VRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEY  138 (211)
T ss_pred             HHHhcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhh
Confidence            44567999999999999999999999999999999999999999999999999999999999999999999999998


No 11 
>KOG0381|consensus
Probab=99.66  E-value=5.7e-16  Score=121.32  Aligned_cols=70  Identities=27%  Similarity=0.580  Sum_probs=68.3

Q ss_pred             CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHh-hC
Q psy7578          71 SIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMS-KA  140 (280)
Q Consensus        71 ~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema-~~  140 (280)
                      +.||||++||++|+++.|..++.+||++++.||++++|++|++|++++|++|..+|..++++|.++|. +|
T Consensus        21 ~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~   91 (96)
T KOG0381|consen   21 QAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEY   91 (96)
T ss_pred             CCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            59999999999999999999999999999999999999999999999999999999999999999998 66


No 12 
>KOG0526|consensus
Probab=99.54  E-value=1.2e-14  Score=143.60  Aligned_cols=75  Identities=24%  Similarity=0.455  Sum_probs=70.7

Q ss_pred             ccccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCC
Q psy7578          63 DEDDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAP  141 (280)
Q Consensus        63 k~~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~  141 (280)
                      +-++.+|||+|||+++|||+|.+..|..|+.+  +++++||+|++|++|+.|+.  |.+|+++|+.+|.+|+.+|++|.
T Consensus       526 ~~kk~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk  600 (615)
T KOG0526|consen  526 KGKKKKDPNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK  600 (615)
T ss_pred             CcccCCCCCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence            34567899999999999999999999999988  99999999999999999999  88999999999999999999994


No 13 
>KOG0528|consensus
Probab=99.40  E-value=8.8e-14  Score=136.17  Aligned_cols=81  Identities=28%  Similarity=0.439  Sum_probs=75.5

Q ss_pred             ccccCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCCCC
Q psy7578          65 DDLSQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPAQK  144 (280)
Q Consensus        65 ~~~~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~~p  144 (280)
                      .+...++|+||||||||+|.++.|.+|.+.+||+.+.+|+|+||.+||.|+..||++|+++-.++-..|.+....|.+.|
T Consensus       318 g~~ss~PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrYkP  397 (511)
T KOG0528|consen  318 GRASSEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRYKP  397 (511)
T ss_pred             CcCCCCccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccccCC
Confidence            35567899999999999999999999999999999999999999999999999999999999999999999888887766


Q ss_pred             C
Q psy7578         145 T  145 (280)
Q Consensus       145 ~  145 (280)
                      +
T Consensus       398 R  398 (511)
T KOG0528|consen  398 R  398 (511)
T ss_pred             C
Confidence            6


No 14 
>KOG3248|consensus
Probab=99.12  E-value=9.4e-11  Score=111.02  Aligned_cols=72  Identities=19%  Similarity=0.300  Sum_probs=66.0

Q ss_pred             CCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578          69 QESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus        69 d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      .+.++|+|+||||+|++|+|.++..|+---.-.+|.++||++|.+||.||..+|+++|+++++-|.+....+
T Consensus       188 KkphiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~W  259 (421)
T KOG3248|consen  188 KKPHIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGW  259 (421)
T ss_pred             cCccccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCc
Confidence            456899999999999999999999999766688999999999999999999999999999999998876544


No 15 
>KOG4715|consensus
Probab=98.92  E-value=2.4e-09  Score=100.82  Aligned_cols=75  Identities=19%  Similarity=0.407  Sum_probs=70.2

Q ss_pred             cCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCCC
Q psy7578          68 SQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAPA  142 (280)
Q Consensus        68 ~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~~  142 (280)
                      +.+.-|-+|+-.||.|++..|++++..||++...||.|+||.||..|+++||+.|..+++..|..|++.|+.|-.
T Consensus        60 kpPkppekpl~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~  134 (410)
T KOG4715|consen   60 KPPKPPEKPLMPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHN  134 (410)
T ss_pred             CCCCCCCcccchhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence            346677889999999999999999999999999999999999999999999999999999999999999998743


No 16 
>KOG2746|consensus
Probab=98.72  E-value=9.8e-09  Score=104.36  Aligned_cols=70  Identities=23%  Similarity=0.339  Sum_probs=66.4

Q ss_pred             ccCCCCCCCCCChHHHHHHHHH--HHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHH
Q psy7578          67 LSQESIEKARFTAYMLWAKQIR--QKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQK  136 (280)
Q Consensus        67 ~~d~~~PKRP~sAY~LF~ke~R--~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~e  136 (280)
                      +.+++|++||||||++|++.+|  ..+.+.||+.+++-|+++|||+|-.|-+.||+.|.++|.+.++.|-+.
T Consensus       176 kr~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka  247 (683)
T KOG2746|consen  176 KRDKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA  247 (683)
T ss_pred             cCcchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence            4678899999999999999999  889999999999999999999999999999999999999999998874


No 17 
>PF14887 HMG_box_5:  HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.12  E-value=2.1e-05  Score=60.72  Aligned_cols=68  Identities=15%  Similarity=0.289  Sum_probs=58.9

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhCC
Q psy7578          73 EKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKAP  141 (280)
Q Consensus        73 PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~~  141 (280)
                      |..|-+|--+|.+.....+...+++-...+ -+.+...|++|++.+|-+|..+|.++..+|+.+|.++.
T Consensus         4 PE~PKt~qe~Wqq~vi~dYla~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r   71 (85)
T PF14887_consen    4 PETPKTAQEIWQQSVIGDYLAKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMR   71 (85)
T ss_dssp             S----THHHHHHHHHHHHHHHHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS
T ss_pred             CCCCCCHHHHHHHHHHHHHHHHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHh
Confidence            667789999999999999999998877776 56999999999999999999999999999999999983


No 18 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=97.17  E-value=0.00016  Score=65.18  Aligned_cols=113  Identities=13%  Similarity=0.234  Sum_probs=83.4

Q ss_pred             ccccccch--hhhhhhhccCCCCCCccccccCCcccccccCCCCcccccccccccccCCCCCCCCCChHHHHHHHHHHHH
Q psy7578          14 VNISDISD--SELYNSAQAVGSGADEEFGEADSSLDAESIGSGDLADNLLIDEDDLSQESIEKARFTAYMLWAKQIRQKL   91 (280)
Q Consensus        14 v~~se~sk--se~~~t~s~~~~g~~e~~~~a~s~~~~~s~g~~k~~kk~~~k~~~~~d~~~PKRP~sAY~LF~ke~R~ki   91 (280)
                      +.|.++++  +|.|+.-.+.+.......+..+..    -|...+       ..  -.....+++|.-.|.-+..+.|.++
T Consensus        96 l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~e----rYq~ek-------~~--y~~k~~~~~~~~~~~e~~~~~r~~~  162 (211)
T COG5648          96 LTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRE----RYQREK-------EE--YNKKLPNKAPIGPFIENEPKIRPKV  162 (211)
T ss_pred             CChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHH----HHHHHH-------Hh--hhcccCCCCCCchhhhccHHhcccc
Confidence            46788888  999998766555544433322211    010000       00  1112346778888888999999999


Q ss_pred             HHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhh
Q psy7578          92 IKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSK  139 (280)
Q Consensus        92 k~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~  139 (280)
                      ...+|+....+..++++..|++|+++-|.+|.+.++.++.+|...+.+
T Consensus       163 ~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~~~~~~~~~  210 (211)
T COG5648         163 EGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYPE  210 (211)
T ss_pred             CCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHHHHhhhccc
Confidence            999999999999999999999999999999999999999999887653


No 19 
>PF06382 DUF1074:  Protein of unknown function (DUF1074);  InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=97.00  E-value=0.0024  Score=56.37  Aligned_cols=49  Identities=18%  Similarity=0.369  Sum_probs=43.9

Q ss_pred             CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHH
Q psy7578          77 FTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRL  129 (280)
Q Consensus        77 ~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~  129 (280)
                      -+||+-|.+++|.+    |.+++..|+....+.+|..|+++||..|+.++...
T Consensus        83 nnaYLNFLReFRrk----h~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~~  131 (183)
T PF06382_consen   83 NNAYLNFLREFRRK----HCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPSV  131 (183)
T ss_pred             chHHHHHHHHHHHH----ccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcchh
Confidence            47899999998874    78999999999999999999999999999976643


No 20 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=96.75  E-value=0.0028  Score=55.79  Aligned_cols=48  Identities=17%  Similarity=0.299  Sum_probs=42.9

Q ss_pred             CCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCCh
Q psy7578          69 QESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPF  116 (280)
Q Consensus        69 d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~Lse  116 (280)
                      .+.+..|-++||..|+++.-++||.+||+++..|.-+..+..|+..+.
T Consensus       118 PPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~ph  165 (170)
T PF04690_consen  118 PPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFPH  165 (170)
T ss_pred             CccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCcc
Confidence            345566889999999999999999999999999999999999987653


No 21 
>PF08073 CHDNT:  CHDNT (NUC034) domain;  InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=94.46  E-value=0.052  Score=39.44  Aligned_cols=40  Identities=25%  Similarity=0.458  Sum_probs=35.8

Q ss_pred             CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCCh
Q psy7578          77 FTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPF  116 (280)
Q Consensus        77 ~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~Lse  116 (280)
                      .+-|-+|.+..|+.|.+.||++..+.+...++.+|+.-++
T Consensus        13 lt~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~~   52 (55)
T PF08073_consen   13 LTNYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQE   52 (55)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHHh
Confidence            3568899999999999999999999999999999986543


No 22 
>PF04769 MAT_Alpha1:  Mating-type protein MAT alpha 1;  InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=94.12  E-value=0.13  Score=46.52  Aligned_cols=55  Identities=18%  Similarity=0.268  Sum_probs=42.7

Q ss_pred             cCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHH
Q psy7578          68 SQESIEKARFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADR  128 (280)
Q Consensus        68 ~d~~~PKRP~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~  128 (280)
                      ....++|||.|+||.|+.-+-    ...|+....++|..|+..|+.=  --|..|.-.|+.
T Consensus        39 ~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~d--p~k~~W~l~ak~   93 (201)
T PF04769_consen   39 RSPEKAKRPLNGFMAFRSYYS----PIFPPLPQKELSGILTKLWEKD--PFKNKWSLMAKA   93 (201)
T ss_pred             ccccccccchhHHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCC--ccHhHHHHHhhh
Confidence            445678999999999987665    3568889999999999999773  335667766653


No 23 
>PF06244 DUF1014:  Protein of unknown function (DUF1014);  InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=89.77  E-value=0.59  Score=39.14  Aligned_cols=48  Identities=25%  Similarity=0.421  Sum_probs=41.9

Q ss_pred             CCCCCCCC-ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChh
Q psy7578          70 ESIEKARF-TAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFN  117 (280)
Q Consensus        70 ~~~PKRP~-sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~Lsee  117 (280)
                      ..||-|.+ -||.-|...+-+.++.+||++..+++-.+|-.+|..-++.
T Consensus        69 drHPErR~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeN  117 (122)
T PF06244_consen   69 DRHPERRMKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPEN  117 (122)
T ss_pred             CCCcchhHHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCC
Confidence            45776666 6799999999999999999999999999999999876653


No 24 
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=84.44  E-value=4.5  Score=36.48  Aligned_cols=47  Identities=13%  Similarity=0.174  Sum_probs=41.1

Q ss_pred             CCCCCHHHHHH-HHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCC
Q psy7578          95 NPEMDFSQVSK-KLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAP  141 (280)
Q Consensus        95 ~P~ls~~eIsK-~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~  141 (280)
                      .|..+|..+++ .||.-|++.++++|+.|.+.-.+ +..-|-..+..|.
T Consensus        65 ~p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~  113 (211)
T PRK15117         65 LPYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYH  113 (211)
T ss_pred             cccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            38899999987 78999999999999999998887 5668889998884


No 25 
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=84.38  E-value=2.1  Score=38.26  Aligned_cols=46  Identities=13%  Similarity=0.336  Sum_probs=40.6

Q ss_pred             CCCCHHHHHH-HHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCC
Q psy7578          96 PEMDFSQVSK-KLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAP  141 (280)
Q Consensus        96 P~ls~~eIsK-~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~  141 (280)
                      |..++..|++ .||.-|+.+++++|+.|.+...+ +...|-..+..|.
T Consensus        62 ~~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~  109 (198)
T TIGR03481        62 EAFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYA  109 (198)
T ss_pred             HhCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            4789999987 78999999999999999999988 6788988888873


No 26 
>PF05494 Tol_Tol_Ttg2:  Toluene tolerance, Ttg2 ;  InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=69.09  E-value=12  Score=31.89  Aligned_cols=46  Identities=17%  Similarity=0.496  Sum_probs=36.4

Q ss_pred             CCCCHHHHHH-HHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCC
Q psy7578          96 PEMDFSQVSK-KLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAP  141 (280)
Q Consensus        96 P~ls~~eIsK-~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~  141 (280)
                      |.+++..+++ .||.-|+.+++++|+.|.+...+ +...|-..+..|.
T Consensus        36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~   83 (170)
T PF05494_consen   36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYS   83 (170)
T ss_dssp             GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-S
T ss_pred             HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence            7888998886 46889999999999999998887 5667888888874


No 27 
>PF12881 NUT_N:  NUT protein N terminus;  InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=61.87  E-value=19  Score=34.85  Aligned_cols=55  Identities=11%  Similarity=0.173  Sum_probs=43.3

Q ss_pred             CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHHH
Q psy7578          77 FTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQADRLAA  131 (280)
Q Consensus        77 ~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~ke  131 (280)
                      ..||..|.--.-..+....|.++..|--.+.-+.|...|.=||..|+++|++..|
T Consensus       229 ~EAlSCFLIpvLrsLar~kPtMtlEeGl~ra~qEW~~~SnfdRmifyemaekFmE  283 (328)
T PF12881_consen  229 AEALSCFLIPVLRSLARLKPTMTLEEGLWRAVQEWQHTSNFDRMIFYEMAEKFME  283 (328)
T ss_pred             hhhhhhhHHHHHHHHHhcCCCccHHHHHHHHHHHhhccccccHHHHHHHHHHHcc
Confidence            3455555555555556667778888888888899999999999999999998755


No 28 
>KOG3223|consensus
Probab=51.05  E-value=21  Score=32.37  Aligned_cols=55  Identities=24%  Similarity=0.418  Sum_probs=44.1

Q ss_pred             CCCCCCCCC-ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHHHHHHHH
Q psy7578          69 QESIEKARF-TAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKYGWKRQA  126 (280)
Q Consensus        69 d~~~PKRP~-sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA  126 (280)
                      |..||-|.| -||.-|-...-+.++.+||++..++.-.+|-.+|+.-++.   ||.+.+
T Consensus       160 ddrHPEkRmrAA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN---P~Nq~~  215 (221)
T KOG3223|consen  160 DDRHPEKRMRAAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN---PFNQAA  215 (221)
T ss_pred             cccChHHHHHHHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC---hhhHHh
Confidence            346775555 5588888888999999999999999999999999887765   555443


No 29 
>PF01352 KRAB:  KRAB box;  InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=37.80  E-value=27  Score=23.61  Aligned_cols=29  Identities=17%  Similarity=0.251  Sum_probs=15.9

Q ss_pred             CHHHHHHHHH-HHHcCCChhhHHHHHHHHH
Q psy7578          99 DFSQVSKKLG-ELWHTVPFNEKYGWKRQAD  127 (280)
Q Consensus        99 s~~eIsK~Lg-e~Wk~LseeEK~~Y~~kA~  127 (280)
                      .|.||+--.+ +.|..|.+.+|.-|.+--.
T Consensus         2 tf~Dvav~fs~eEW~~L~~~Qk~ly~dvm~   31 (41)
T PF01352_consen    2 TFEDVAVYFSQEEWELLDPAQKNLYRDVML   31 (41)
T ss_dssp             -----TT---HHHHHTS-HHHHHHHHHHHH
T ss_pred             eEEEEEEEcChhhcccccceecccchhHHH
Confidence            3555655555 5699999999999887543


No 30 
>PF06945 DUF1289:  Protein of unknown function (DUF1289);  InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=35.64  E-value=50  Score=23.19  Aligned_cols=25  Identities=8%  Similarity=0.315  Sum_probs=18.2

Q ss_pred             CHHHHHHHHHHHHcCCChhhHHHHHHHHHH
Q psy7578          99 DFSQVSKKLGELWHTVPFNEKYGWKRQADR  128 (280)
Q Consensus        99 s~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~  128 (280)
                      +..||..     |+.|+++||....+....
T Consensus        23 T~dEI~~-----W~~~s~~er~~i~~~l~~   47 (51)
T PF06945_consen   23 TLDEIRD-----WKSMSDDERRAILARLRA   47 (51)
T ss_pred             cHHHHHH-----HhhCCHHHHHHHHHHHHH
Confidence            4566665     999999999876665443


No 31 
>PF12650 DUF3784:  Domain of unknown function (DUF3784);  InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=34.09  E-value=27  Score=27.27  Aligned_cols=15  Identities=20%  Similarity=0.428  Sum_probs=12.8

Q ss_pred             HcCCChhhHHHHHHH
Q psy7578         111 WHTVPFNEKYGWKRQ  125 (280)
Q Consensus       111 Wk~LseeEK~~Y~~k  125 (280)
                      |++||+|||+.|.++
T Consensus        26 yntms~eEk~~~D~~   40 (97)
T PF12650_consen   26 YNTMSKEEKEKYDKK   40 (97)
T ss_pred             cccCCHHHHHHhhHH
Confidence            899999999988653


No 32 
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=33.27  E-value=66  Score=29.26  Aligned_cols=51  Identities=22%  Similarity=0.330  Sum_probs=41.2

Q ss_pred             CCCCHHHHH-HHHHHHHcCCChhhHHHHHHHHHH-HHHHHHHHHhhCCCCCCC
Q psy7578          96 PEMDFSQVS-KKLGELWHTVPFNEKYGWKRQADR-LAAKYTQKMSKAPAQKTK  146 (280)
Q Consensus        96 P~ls~~eIs-K~Lge~Wk~LseeEK~~Y~~kA~~-~kekY~~ema~~~~~p~K  146 (280)
                      |-+++.-++ ..||.-|++.++++++.|.+.... +...|-..+..|..++-+
T Consensus        68 p~vd~~~~a~~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~~q~~~  120 (202)
T COG2854          68 PYVDFKYAAKLVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYKGQTLK  120 (202)
T ss_pred             hhhcHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHccCCCce
Confidence            777776654 568999999999999999988877 567899999999655544


No 33 
>PF13875 DUF4202:  Domain of unknown function (DUF4202)
Probab=32.16  E-value=95  Score=27.91  Aligned_cols=40  Identities=10%  Similarity=0.211  Sum_probs=34.0

Q ss_pred             ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhHH
Q psy7578          78 TAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEKY  120 (280)
Q Consensus        78 sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK~  120 (280)
                      -+-++|...+-..+...|   +-..+-.++..-|+.||+.-++
T Consensus       130 vacLVFL~~~f~~F~~~~---deeK~v~Il~KTw~KMS~~g~~  169 (185)
T PF13875_consen  130 VACLVFLEYYFEDFAAKH---DEEKIVDILRKTWRKMSERGHE  169 (185)
T ss_pred             hHHHHhHHHHHHHHHhcC---CHHHHHHHHHHHHHHCCHHHHH
Confidence            458899999999998887   5667888999999999998764


No 34 
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=31.64  E-value=82  Score=25.81  Aligned_cols=38  Identities=16%  Similarity=0.206  Sum_probs=33.9

Q ss_pred             HHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578         103 VSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus       103 IsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      -.+.+-+.|+.|++++++...+..+...+.|++.++++
T Consensus        88 ~~~~ll~~~~~L~~~~~~~~l~~l~~~~~~~~~~~~~~  125 (135)
T PRK09706         88 DQKELLELFDALPESEQDAQLSEMRARVENFNKLFEEL  125 (135)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            35788899999999999999999999989998888876


No 35 
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=29.60  E-value=1.1e+02  Score=26.78  Aligned_cols=33  Identities=12%  Similarity=0.207  Sum_probs=26.1

Q ss_pred             HHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHH
Q psy7578         103 VSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQ  135 (280)
Q Consensus       103 IsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~  135 (280)
                      ..+..-++++-|++|+|..|.+..++..++..+
T Consensus       119 ~~~~~~qmy~lLTPEQra~l~~~~e~r~~~~~~  151 (162)
T PRK12751        119 MAKVRNQMYNLLTPEQKEALNKKHQERIEKLQQ  151 (162)
T ss_pred             HHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHh
Confidence            346667889999999999999988876666544


No 36 
>PRK10236 hypothetical protein; Provisional
Probab=28.92  E-value=54  Score=30.56  Aligned_cols=22  Identities=14%  Similarity=0.286  Sum_probs=18.6

Q ss_pred             HHHHHHHHHcCCChhhHHHHHH
Q psy7578         103 VSKKLGELWHTVPFNEKYGWKR  124 (280)
Q Consensus       103 IsK~Lge~Wk~LseeEK~~Y~~  124 (280)
                      +.|.+...|+.||++|++.+.+
T Consensus       118 l~kll~~a~~kms~eE~~~L~~  139 (237)
T PRK10236        118 LEQFLRNTWKKMDEEHKQEFLH  139 (237)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHH
Confidence            4789999999999999976554


No 37 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=28.07  E-value=2e+02  Score=23.23  Aligned_cols=37  Identities=11%  Similarity=0.183  Sum_probs=19.5

Q ss_pred             HHHHhCCCCCHHHHHHHHH--HHHcCCChhhHHHHHHHH
Q psy7578          90 KLIKSNPEMDFSQVSKKLG--ELWHTVPFNEKYGWKRQA  126 (280)
Q Consensus        90 kik~e~P~ls~~eIsK~Lg--e~Wk~LseeEK~~Y~~kA  126 (280)
                      .+...++.++..+-.+...  ..|.+|++++|..-.+.+
T Consensus        33 ~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~   71 (107)
T PF11304_consen   33 QIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENY   71 (107)
T ss_pred             HHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHH
Confidence            3444556666554433322  367777777665444433


No 38 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=27.99  E-value=94  Score=25.10  Aligned_cols=26  Identities=27%  Similarity=0.607  Sum_probs=16.8

Q ss_pred             HHHHHHHHHHcCCChhhHHHHHHHHH
Q psy7578         102 QVSKKLGELWHTVPFNEKYGWKRQAD  127 (280)
Q Consensus       102 eIsK~Lge~Wk~LseeEK~~Y~~kA~  127 (280)
                      ++-.-+.+.|+.|+++.|..|...|.
T Consensus        11 ~~L~pl~~~W~~l~~~qr~k~l~~a~   36 (107)
T PF11304_consen   11 QALAPLAERWNSLPPEQRRKWLQIAE   36 (107)
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHHHH
Confidence            34556666777777777776666654


No 39 
>cd07081 ALDH_F20_ACDH_EutE-like Coenzyme A acylating aldehyde dehydrogenase (ACDH), Ethanolamine utilization protein EutE, and related proteins. Coenzyme A acylating aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10), functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA. The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH, and may be critical enzymes in the fermentative pathway.
Probab=27.11  E-value=1.2e+02  Score=30.30  Aligned_cols=39  Identities=13%  Similarity=-0.100  Sum_probs=33.0

Q ss_pred             HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578         102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus       102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      +.++..-+.|+.++.++|..+..++.+..+++..++++.
T Consensus         6 ~~A~~A~~~W~~~~~~~R~~iL~~~a~~l~~~~~ela~~   44 (439)
T cd07081           6 AAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKL   44 (439)
T ss_pred             HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345666678999999999999999999999999888765


No 40 
>KOG1827|consensus
Probab=26.86  E-value=5.1  Score=41.97  Aligned_cols=44  Identities=27%  Similarity=0.416  Sum_probs=40.7

Q ss_pred             CCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHcCCChhhH
Q psy7578          76 RFTAYMLWAKQIRQKLIKSNPEMDFSQVSKKLGELWHTVPFNEK  119 (280)
Q Consensus        76 P~sAY~LF~ke~R~kik~e~P~ls~~eIsK~Lge~Wk~LseeEK  119 (280)
                      -+++|+.|..+.|..+-.++|+..+++++.+.|..|+.++..-|
T Consensus       552 ~~~~~~~~s~~~~~~~~~~np~v~~~~~~~~vg~~~~~lp~~~k  595 (629)
T KOG1827|consen  552 SPEPYILDSIENRTIIWFENPTVGFGEVSIIVGNDWDKLPNINK  595 (629)
T ss_pred             CCccccccccccCceeeeeCCCcccceeEEeecCCcccCccccc
Confidence            57889999999999999999999999999999999999995554


No 41 
>smart00500 SFM Splicing Factor Motif, present in Prp18 and Pr04.
Probab=25.96  E-value=27  Score=24.21  Aligned_cols=23  Identities=30%  Similarity=0.512  Sum_probs=19.5

Q ss_pred             ChhhHHHHHHHhhhhHHHhhhhh
Q psy7578         199 QPLDIAAHLRLLGDNLTIIGERL  221 (280)
Q Consensus       199 ~~id~aahl~ll~~~l~~~g~~~  221 (280)
                      .-.||-+-|+.|||..+++||.-
T Consensus         2 ~d~eV~~~LR~lgePi~lFGE~~   24 (44)
T smart00500        2 PDSEVIRRLRELGEPITLFGEDD   24 (44)
T ss_pred             CHHHHHHHHHHcCCCeeecCCCh
Confidence            34688899999999999999853


No 42 
>PF12356 DUF3643:  Protein of unknown function (DUF3643) ;  InterPro: IPR022103  This family of proteins is found in eukaryotes. Proteins in this family are typically between 217 and 4852 amino acids in length. There is a conserved TLA sequence motif. 
Probab=25.70  E-value=35  Score=30.10  Aligned_cols=14  Identities=29%  Similarity=0.646  Sum_probs=11.1

Q ss_pred             chhhhhHHHHHhhh
Q psy7578         233 GMSLLLDSFLCALG  246 (280)
Q Consensus       233 ~~s~lld~~~c~~~  246 (280)
                      .|=-=+||+|||||
T Consensus        76 ~LKkaiDSlLCS~C   89 (165)
T PF12356_consen   76 PLKKAIDSLLCSMC   89 (165)
T ss_pred             HHHHHHHHHHHHHH
Confidence            44555899999997


No 43 
>PF08367 M16C_assoc:  Peptidase M16C associated;  InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=23.59  E-value=1.4e+02  Score=27.08  Aligned_cols=32  Identities=19%  Similarity=0.163  Sum_probs=26.8

Q ss_pred             HHHHHHHHHHHcCCChhhHHHHHHHHHHHHHH
Q psy7578         101 SQVSKKLGELWHTVPFNEKYGWKRQADRLAAK  132 (280)
Q Consensus       101 ~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kek  132 (280)
                      .+..+.|.+.+.+|+++|++...+.++++++.
T Consensus        13 ~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~   44 (248)
T PF08367_consen   13 EEEKEKLAAYKASLSEEEKEKIIEQTKELKER   44 (248)
T ss_dssp             HHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHH
Confidence            45778999999999999999999999987554


No 44 
>cd07749 NT_Pol-beta-like_1 Nucleotidyltransferase (NT) domain of an uncharacterized subgroup of the Pol beta-like NT superfamily. The Pol beta-like NT superfamily includes DNA polymerase beta and other family X DNA Polymerases, as well as Class I and Class II CCA-adding enzymes, RelA- and SpoT-like ppGpp synthetases and hydrolases, 2'5'-oligoadenylate (2-5A)synthetases, Escherichia coli adenylyltransferase (GlnE), Escherichia coli uridylyl transferase (GlnD), poly(A) polymerases, terminal uridylyl transferases, Staphylococcus aureus kanamycin nucleotidyltransferase, and similar proteins. Proteins belonging to this subgroup are uncharacterized. In the majority of the Pol beta-like superfamily NTs, two carboxylates, Dx[D/E], together with a third more distal carboxylate, coordinate two divalent metal cations essential for catalysis. These divalent metal ions are involved in a two-metal ion mechanism of nucleotide addition. These carboxylate residues are conserved in this subgroup.
Probab=22.44  E-value=61  Score=28.33  Aligned_cols=29  Identities=38%  Similarity=0.476  Sum_probs=24.1

Q ss_pred             HHHHhhhhHHHhhhhhcccCCeeeeccchhhhh
Q psy7578         206 HLRLLGDNLTIIGERLKDTQGRMAISGGMSLLL  238 (280)
Q Consensus       206 hl~ll~~~l~~~g~~~~~~~~~~~~~~~~s~ll  238 (280)
                      |++.    |+-|+++|++.+..-+|.||+++-|
T Consensus         2 ~~~~----L~~i~~~ln~~~i~W~lgGS~~L~l   30 (156)
T cd07749           2 YLRA----LRKFYERLKNINVNWALTGSLSFAL   30 (156)
T ss_pred             hHHH----HHHHHHHhccCCceEEehhhHHHHH
Confidence            5555    5789999999999999999987544


No 45 
>PF05388 Carbpep_Y_N:  Carboxypeptidase Y pro-peptide;  InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=22.44  E-value=1.4e+02  Score=24.66  Aligned_cols=29  Identities=21%  Similarity=0.243  Sum_probs=25.4

Q ss_pred             HHHHHHHHHHHHcCCChhhHHHHHHHHHH
Q psy7578         100 FSQVSKKLGELWHTVPFNEKYGWKRQADR  128 (280)
Q Consensus       100 ~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~  128 (280)
                      +..+++.++|.++.|+.|-|+.|.+....
T Consensus        45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~~   73 (113)
T PF05388_consen   45 LEKISKYLNEPLKSLTSEAKALWDEMMLL   73 (113)
T ss_pred             HHHHHHHHHHHHhhccHHHHHHHHHHHHH
Confidence            55688889999999999999999997664


No 46 
>KOG1610|consensus
Probab=22.32  E-value=2.2e+02  Score=27.81  Aligned_cols=49  Identities=16%  Similarity=0.332  Sum_probs=37.2

Q ss_pred             HHHHHHHHHHHHhC-------CC------CCHHHHHHHHHHHHcCCChhhHHHHHHHHHHHH
Q psy7578          82 LWAKQIRQKLIKSN-------PE------MDFSQVSKKLGELWHTVPFNEKYGWKRQADRLA  130 (280)
Q Consensus        82 LF~ke~R~kik~e~-------P~------ls~~eIsK~Lge~Wk~LseeEK~~Y~~kA~~~k  130 (280)
                      .|+...|.++..-.       |+      .....+.+.+.+.|..++++.|+.|-+.+....
T Consensus       187 af~D~lR~EL~~fGV~VsiiePG~f~T~l~~~~~~~~~~~~~w~~l~~e~k~~YGedy~~~~  248 (322)
T KOG1610|consen  187 AFSDSLRRELRPFGVKVSIIEPGFFKTNLANPEKLEKRMKEIWERLPQETKDEYGEDYFEDY  248 (322)
T ss_pred             HHHHHHHHHHHhcCcEEEEeccCccccccCChHHHHHHHHHHHhcCCHHHHHHHHHHHHHHH
Confidence            47788888876421       33      224778999999999999999999988776553


No 47 
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=20.78  E-value=2.3e+02  Score=24.84  Aligned_cols=35  Identities=11%  Similarity=0.127  Sum_probs=28.7

Q ss_pred             HHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHh
Q psy7578         104 SKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMS  138 (280)
Q Consensus       104 sK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema  138 (280)
                      .+..-+.+.-|++++|..|.+.-++..++|.+.+.
T Consensus       127 ~~~~~~~~~vLTpEQRak~~e~~~~r~~~~~~~~~  161 (170)
T PRK12750        127 LEKRHQMLSILTPEQKAKFQELQQERMQECQDKMH  161 (170)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34455689999999999999998888888887774


No 48 
>cd07133 ALDH_CALDH_CalB Coniferyl aldehyde dehydrogenase-like. Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of Pseudomonas sp. strain HR199 (CalB) which catalyzes the NAD+-dependent oxidation of coniferyl aldehyde to ferulic acid, and similar sequences, are present in this CD.
Probab=20.57  E-value=1.8e+02  Score=28.71  Aligned_cols=39  Identities=10%  Similarity=-0.008  Sum_probs=31.8

Q ss_pred             HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578         102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus       102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      +.++..-..|+.++..+|..+.+++.+..+++..++.+.
T Consensus         5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~   43 (434)
T cd07133           5 ERQKAAFLANPPPSLEERRDRLDRLKALLLDNQDALAEA   43 (434)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345566677999999999999999888888888888764


No 49 
>cd07132 ALDH_F3AB Aldehyde dehydrogenase family 3 members A1, A2, and B1 and related proteins. NAD(P)+-dependent, aldehyde dehydrogenase, family 3 members A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and similar sequences are included in this CD. Human ALDH3A1 is a homodimer with a critical role in cellular defense against oxidative stress; it catalyzes the oxidation of various cellular membrane lipid-derived aldehydes. Corneal crystalline ALDH3A1 protects the cornea and underlying lens against UV-induced oxidative stress. Human ALDH3A2, a microsomal homodimer, catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acids. Human ALDH3B1 is highly expressed in the kidney and liver and catalyzes the oxidation of various medium- and long-chain saturated and unsaturated aliphatic aldehydes.
Probab=20.42  E-value=1.8e+02  Score=28.78  Aligned_cols=39  Identities=10%  Similarity=0.039  Sum_probs=32.2

Q ss_pred             HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhhC
Q psy7578         102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSKA  140 (280)
Q Consensus       102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~~  140 (280)
                      +.++..-..|+.++..+|..+.+++....+++.+++.+.
T Consensus         5 ~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~   43 (443)
T cd07132           5 RRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEA   43 (443)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHH
Confidence            345666678999999999999999988888888887764


No 50 
>cd07661 BAR_ICA69 The Bin/Amphiphysin/Rvs (BAR) domain of Islet Cell Autoantigen 69-kDa. The BAR domain of Arfaptin-like proteins, also called the Arfaptin domain, is a dimerization and lipid binding module that can detect and drive membrane curvature. Islet cell autoantigen 69-kDa (ICA69) is a diabetes-associated autoantigen that is highly expressed in brain and beta cells. It is involved in membrane trafficking at the Golgi complex in neurosecretory cells. It is coexpressed with Protein Interacting with C Kinase 1 (PICK1), also a the BAR domain containing protein, in many tissues at different developmental stages. In neurons, ICA69 colocalizes with PICK1 in cell bodies and dendrites but is absent in synapses where PICK1 is enriched. ICA69 contains an N-terminal BAR domain and a conserved C-terminal domain of unknown function. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions. ICA69 associate
Probab=20.35  E-value=54  Score=29.87  Aligned_cols=65  Identities=28%  Similarity=0.315  Sum_probs=48.2

Q ss_pred             HHhhhhHHHhhhhhcccCCee--eeccchhhhhHHHHHhhhhHHHhhhcCCCccCCChhHHHHhhhhhhh
Q psy7578         208 RLLGDNLTIIGERLKDTQGRM--AISGGMSLLLDSFLCALGPLLCLTQQIPEENGCSPETLSHVLDNIAY  275 (280)
Q Consensus       208 ~ll~~~l~~~g~~~~~~~~~~--~~~~~~s~lld~~~c~~~pl~~~~~~~~~~~~~~~~~~~~~l~n~~~  275 (280)
                      .=||+.|.-.|.+=++..|.+  ++.-+|+-==--+|++-+||..+.+.|--|.   -.+++|||.+|.-
T Consensus        40 ~~LG~fl~e~~~~d~t~ag~~m~~t~KaL~~sg~qrl~~r~pl~~f~~~v~Tf~---~rai~Dtl~Ti~~  106 (204)
T cd07661          40 NVLGKFLKEQGKIDKTTAGKMMAATGKALSFSSQQRLALRVPLLRLYQEVETFR---ERAIADTLQTIQR  106 (204)
T ss_pred             HHHHHHHHHHhccChhhhccHHHHHHHHHHHhHHHHHHHHhhHHHHHHHHHHHH---HHHHHHHHHHHHH
Confidence            457888888887777666533  4555666666778999999999998876664   4678899988853


No 51 
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=20.31  E-value=2.2e+02  Score=25.22  Aligned_cols=37  Identities=3%  Similarity=0.148  Sum_probs=30.0

Q ss_pred             HHHHHHHHHHcCCChhhHHHHHHHHHHHHHHHHHHHhh
Q psy7578         102 QVSKKLGELWHTVPFNEKYGWKRQADRLAAKYTQKMSK  139 (280)
Q Consensus       102 eIsK~Lge~Wk~LseeEK~~Y~~kA~~~kekY~~ema~  139 (280)
                      +..|+--++++-|++|+|..|++..++...++++ ++.
T Consensus       112 em~k~~nqmy~lLTPEQKaq~~~~~~~rm~~~~~-~~~  148 (166)
T PRK10363        112 EMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRD-VTQ  148 (166)
T ss_pred             HHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHH-HHh
Confidence            4456677899999999999999998888888765 544


No 52 
>PRK14830 undecaprenyl pyrophosphate synthase; Provisional
Probab=20.21  E-value=3.8e+02  Score=24.98  Aligned_cols=58  Identities=16%  Similarity=0.094  Sum_probs=50.6

Q ss_pred             hhhHHHHHHHhhhhHHHhhhhhcccCCeeeeccchhhhhHHHHHhhhhHHHhhhcCCC
Q psy7578         200 PLDIAAHLRLLGDNLTIIGERLKDTQGRMAISGGMSLLLDSFLCALGPLLCLTQQIPE  257 (280)
Q Consensus       200 ~id~aahl~ll~~~l~~~g~~~~~~~~~~~~~~~~s~lld~~~c~~~pl~~~~~~~~~  257 (280)
                      +-+|.+++.|+-+.|.-...++.+.+-.|.|.|-++.|=|.+..++--+--.|..=..
T Consensus        84 ~~Ev~~Lm~l~~~~l~~~~~~~~~~~iri~viG~~~~Lp~~~~~~~~~~e~~T~~~~~  141 (251)
T PRK14830         84 KDEVKFLMNLPVEFLDKFVPELIENNVKVNVIGDTDRLPEHTLRALEKAIEKTKNNTG  141 (251)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHcCCEEEEEcChhhCCHHHHHHHHHHHHHccCCCc
Confidence            5688999999999998888889999999999999999999999998887777766444


Done!