Query         psy1937
Match_columns 307
No_of_seqs    298 out of 1165
Neff          6.2 
Searched_HMMs 46136
Date          Fri Aug 16 20:05:50 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy1937.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1937hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG4715|consensus              100.0 2.5E-69 5.5E-74  498.2  18.3  215   79-302    56-279 (410)
  2 PTZ00199 high mobility group p  99.8 1.1E-19 2.4E-24  144.6   9.0   73   74-146     9-83  (94)
  3 cd01389 MATA_HMG-box MATA_HMG-  99.7 1.5E-17 3.2E-22  127.2   7.6   71   87-157     1-71  (77)
  4 KOG4715|consensus               99.7 8.2E-18 1.8E-22  156.7   2.5   58    1-58    181-238 (410)
  5 cd01388 SOX-TCF_HMG-box SOX-TC  99.7 1.3E-16 2.9E-21  120.5   7.4   68   88-155     2-69  (72)
  6 PF00505 HMG_box:  HMG (high mo  99.6 4.2E-16 9.2E-21  115.6   6.9   68   88-155     1-68  (69)
  7 cd01390 HMGB-UBF_HMG-box HMGB-  99.6 5.4E-16 1.2E-20  114.0   6.8   59   88-146     1-59  (66)
  8 smart00398 HMG high mobility g  99.6 1.3E-15 2.8E-20  112.6   7.8   69   87-155     1-69  (70)
  9 COG5648 NHP6B Chromatin-associ  99.6 7.7E-16 1.7E-20  137.0   6.2   83   77-159    60-142 (211)
 10 KOG0527|consensus               99.6 2.3E-15 5.1E-20  143.9   7.0   78   81-158    56-133 (331)
 11 cd00084 HMG-box High Mobility   99.5 1.3E-14 2.8E-19  106.1   6.8   59   88-146     1-59  (66)
 12 PF09011 HMG_box_2:  HMG-box do  99.5 1.1E-14 2.5E-19  110.2   6.5   62   85-146     1-63  (73)
 13 KOG3248|consensus               99.5 1.7E-14 3.7E-19  135.7   7.5   73   87-159   191-269 (421)
 14 KOG0381|consensus               99.5 1.3E-13 2.9E-18  109.0   8.3   63   84-146    17-81  (96)
 15 KOG0526|consensus               99.3 2.9E-12 6.2E-17  127.0   5.5   76   77-156   525-600 (615)
 16 KOG0528|consensus               98.9 7.7E-10 1.7E-14  108.9   4.1   73   84-156   322-394 (511)
 17 KOG2746|consensus               98.3 2.8E-07 6.1E-12   94.1   3.6   66   81-146   175-242 (683)
 18 PF08549 SWI-SNF_Ssr4:  Fungal   96.7  0.0036 7.9E-08   64.9   7.2   85  206-293   281-406 (669)
 19 PF04690 YABBY:  YABBY protein;  96.5  0.0038 8.3E-08   54.9   4.5   49   82-130   116-164 (170)
 20 PF06382 DUF1074:  Protein of u  95.9  0.0095 2.1E-07   52.5   4.0   49   92-144    83-131 (183)
 21 COG5648 NHP6B Chromatin-associ  95.9  0.0062 1.3E-07   55.0   2.9   62   85-146   141-202 (211)
 22 PF14887 HMG_box_5:  HMG (high   95.8   0.031 6.7E-07   43.0   5.8   69   87-158     3-73  (85)
 23 PF08073 CHDNT:  CHDNT (NUC034)  93.6   0.076 1.6E-06   38.3   3.0   39   92-130    13-51  (55)
 24 PF06320 GCN5L1:  GCN5-like pro  90.5     1.8 3.9E-05   36.1   8.1   59  239-297    28-89  (121)
 25 PF04769 MAT_Alpha1:  Mating-ty  90.0    0.39 8.6E-06   43.5   4.1   54   83-142    39-92  (201)
 26 PF06244 DUF1014:  Protein of u  84.8    0.93   2E-05   37.9   3.0   47   86-132    70-117 (122)
 27 PF11068 YlqD:  YlqD protein;    56.7      74  0.0016   26.9   7.9   61  243-303    26-94  (131)
 28 PF11629 Mst1_SARAH:  C termina  55.7      57  0.0012   23.0   5.8   35  247-284    11-45  (49)
 29 TIGR03481 HpnM hopanoid biosyn  54.8      14 0.00031   33.1   3.5   39  111-150    62-101 (198)
 30 PRK15117 ABC transporter perip  53.2      18 0.00038   32.8   3.9   55  103-158    57-115 (211)
 31 PF05494 Tol_Tol_Ttg2:  Toluene  52.6      15 0.00032   31.6   3.2   47  110-157    35-84  (170)
 32 COG4238 Murein lipoprotein [Ce  51.0 1.1E+02  0.0024   23.5   7.7   57  240-296    21-77  (78)
 33 PF04728 LPP:  Lipoprotein leuc  49.2   1E+02  0.0022   22.4   7.8   42  243-284     2-43  (56)
 34 COG1579 Zn-ribbon protein, pos  48.9   1E+02  0.0023   28.7   8.3   53  243-295   102-154 (239)
 35 KOG3223|consensus               46.9      16 0.00035   33.0   2.5   53   86-141   162-215 (221)
 36 PF02370 M:  M protein repeat;   42.3      39 0.00085   19.7   2.8   17  260-276     3-19  (21)
 37 PF02172 KIX:  KIX domain;  Int  40.3      80  0.0017   24.5   5.2   44  216-270    11-57  (81)
 38 PF12958 DUF3847:  Protein of u  39.4     8.2 0.00018   30.4  -0.4   24   14-37     39-63  (86)
 39 PF12210 Hrs_helical:  Hepatocy  38.5      78  0.0017   25.4   4.9   56  228-283    28-95  (96)
 40 PF07200 Mod_r:  Modifier of ru  33.8 2.1E+02  0.0044   24.0   7.3   70  218-289    12-93  (150)
 41 PF07334 IFP_35_N:  Interferon-  32.8      98  0.0021   23.8   4.5   30  248-277     4-33  (76)
 42 PRK09039 hypothetical protein;  32.7 1.9E+02  0.0041   28.2   7.7   54  237-290   128-183 (343)
 43 PF00038 Filament:  Intermediat  31.7 2.8E+02   0.006   25.9   8.6   53  243-295   222-274 (312)
 44 cd07598 BAR_FAM92 The Bin/Amph  27.4 2.7E+02   0.006   25.2   7.3   50  243-296     3-52  (211)
 45 PF12718 Tropomyosin_1:  Tropom  27.3 2.9E+02  0.0063   23.4   7.1   46  243-288    27-72  (143)
 46 PF07352 Phage_Mu_Gam:  Bacteri  26.6 3.2E+02   0.007   23.0   7.3   53  244-299    10-62  (149)
 47 COG3937 Uncharacterized conser  26.6 1.1E+02  0.0025   25.0   4.1   37  225-267    70-106 (108)
 48 COG3105 Uncharacterized protei  26.4 1.9E+02  0.0041   24.6   5.5   28  258-285    34-61  (138)
 49 PF10211 Ax_dynein_light:  Axon  24.5 4.6E+02    0.01   23.2   8.1   34  245-278   128-161 (189)
 50 PRK06800 fliH flagellar assemb  24.4 3.3E+02  0.0072   24.7   7.0   42  250-291    37-81  (228)
 51 PF03915 AIP3:  Actin interacti  24.4 2.2E+02  0.0049   28.8   6.7   59  236-297   231-289 (424)
 52 PF11304 DUF3106:  Protein of u  24.0 2.1E+02  0.0045   23.1   5.3   39  105-143    33-73  (107)
 53 PRK15335 type III secretion sy  23.2 1.9E+02  0.0041   24.6   4.9   45  245-289    74-121 (147)
 54 PRK10361 DNA recombination pro  23.1   4E+02  0.0086   27.4   8.2   52  245-296    68-119 (475)
 55 PF09006 Surfac_D-trimer:  Lung  23.0 2.3E+02  0.0049   19.8   4.5   23  247-269     2-24  (46)
 56 KOG0241|consensus               22.5   1E+02  0.0022   34.6   3.9   90  202-295   333-426 (1714)
 57 PF09340 NuA4:  Histone acetylt  22.0 2.1E+02  0.0046   21.9   4.7   32  252-283     3-34  (80)
 58 PRK10328 DNA binding protein,   21.4 2.6E+02  0.0056   23.7   5.6   55  225-281     1-58  (134)
 59 PF04340 DUF484:  Protein of un  20.4 2.2E+02  0.0047   25.7   5.3   63  207-276     5-72  (225)

No 1  
>KOG4715|consensus
Probab=100.00  E-value=2.5e-69  Score=498.17  Aligned_cols=215  Identities=62%  Similarity=0.921  Sum_probs=206.7

Q ss_pred             CCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH-----HHHh---
Q psy1937          79 GRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN-----LKAS---  150 (307)
Q Consensus        79 ~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~-----~k~y---  150 (307)
                      ...++.|++|.+|+.+||+|++.+|++|++.||++.+|||+|+||.||..|+++||+.|...|+.+|.     |++|   
T Consensus        56 ~t~pkpPkppekpl~pymrySrkvWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~s  135 (410)
T KOG4715|consen   56 ITRPKPPKPPEKPLMPYMRYSRKVWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHNS  135 (410)
T ss_pred             ccCCCCCCCCCcccchhhHHhhhhhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence            45678899999999999999999999999999999999999999999999999999999999999998     8999   


Q ss_pred             -hhhhccCccchhhhhhccChhhhhhhhhcCCCCccccccCCccccCCCCcccccCCCCccccccccccccchhhHHhhh
Q psy1937         151 -TKLASISRPTWRAKAKASKPKQLRKSREARPKDSGEKMQIGRIDIQPAEDEDEQEDGYSVKHVAYARYLRNHRLINEIF  229 (307)
Q Consensus       151 -~y~~y~~~k~~~~ka~~~~~~q~~~~~e~~~~~~~~~~~~~~~~~~p~ededd~ddg~S~k~~a~~Rf~rNH~l~~Eif  229 (307)
                       +|.+||++| ++++++.        +.++++..++...|+++|+|||++|+||+|||||+||+|.+||+||||||+|||
T Consensus       136 p~y~ayinaK-sra~a~l--------e~~sr~~~sr~~~ge~~~~IQPaeDeDD~ddg~stkhla~arf~rN~rLIsei~  206 (410)
T KOG4715|consen  136 PAYLAYINAK-SRAEAAL--------EEESRQRQSRMEKGEPYMSIQPAEDEDDYDDGFSTKHLATARFQRNHRLISEIL  206 (410)
T ss_pred             chHHHHhhhh-hhhhhhh--------ccccccccchhhcCCcceecccccCccccccccchhhhhhhhhhhhhHHHHHHh
Confidence             899999999 9999888        667777778888899999999999999999999999999999999999999999


Q ss_pred             cccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhhccCCCC
Q psy1937         230 SDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEKASCSEN  302 (307)
Q Consensus       230 s~~~v~d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~~~~~~  302 (307)
                      |.+||||+|||||+.||+||+|||+||++||+|||+||++||++|++|||+|+++||.|++|||++|+++|++
T Consensus       207 SesvVpDvrsVVtt~RMqvlkrQv~SL~~HQ~KLEaEL~q~Ee~hq~kKrk~~estdsf~~eLKr~c~~kvev  279 (410)
T KOG4715|consen  207 SESVVPDVRSVVTTARMQVLKRQVQSLMVHQRKLEAELLQIEERHQEKKRKFLESTDSFNNELKRLCGLKVEV  279 (410)
T ss_pred             ccccccchhhhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhccHHHHHHHHHhcCCcccc
Confidence            9999999999999999999999999999999999999999999999999999999999999999999999954


No 2  
>PTZ00199 high mobility group protein; Provisional
Probab=99.81  E-value=1.1e-19  Score=144.61  Aligned_cols=73  Identities=26%  Similarity=0.592  Sum_probs=67.7

Q ss_pred             ccccCCCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCC--HHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          74 TAKTDGRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLK--LWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        74 ~~k~~~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~--~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      ..+++++..+||++||||+||||+||.++|..|..+||+++  +++|+++||++|+.||+++|.+|+++|+.+++
T Consensus         9 ~~k~~~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~   83 (94)
T PTZ00199          9 LVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKV   83 (94)
T ss_pred             cccccCCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHH
Confidence            44556677899999999999999999999999999999986  89999999999999999999999999999975


No 3  
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.72  E-value=1.5e-17  Score=127.21  Aligned_cols=71  Identities=30%  Similarity=0.377  Sum_probs=64.0

Q ss_pred             CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhccC
Q psy1937          87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASIS  157 (307)
Q Consensus        87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~~  157 (307)
                      .||||+||||+|+++.|..|+.+||++++.||+++||++|+.||+++|++|.++|+.+++...-+|.+|.-
T Consensus         1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky   71 (77)
T cd01389           1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKY   71 (77)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcc
Confidence            48999999999999999999999999999999999999999999999999999999998744435666543


No 4  
>KOG4715|consensus
Probab=99.68  E-value=8.2e-18  Score=156.66  Aligned_cols=58  Identities=74%  Similarity=1.032  Sum_probs=56.8

Q ss_pred             CCCCcccccccccccccchhhHHhhhcCCCCCCCCcccchhhHHHHHHhhhhhhhhhc
Q psy1937           1 QEDGYSVKHVAYARYLRNHRLINEIFSDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQH   58 (307)
Q Consensus         1 ~~~~~~~~~~~~~~~~rnh~Li~~~fs~~~Vpd~rslVt~~r~q~lKkq~QSL~~hq~   58 (307)
                      +|||||+||+|.+||+||||||++|||++||||+||+||+.||++|++|++||++||.
T Consensus       181 ~ddg~stkhla~arf~rN~rLIsei~SesvVpDvrsVVtt~RMqvlkrQv~SL~~HQ~  238 (410)
T KOG4715|consen  181 YDDGFSTKHLATARFQRNHRLISEILSESVVPDVRSVVTTARMQVLKRQVQSLMVHQR  238 (410)
T ss_pred             cccccchhhhhhhhhhhhhHHHHHHhccccccchhhhhhHHHHHHHHHHHHHHHHHHH
Confidence            5899999999999999999999999999999999999999999999999999999995


No 5  
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.67  E-value=1.3e-16  Score=120.46  Aligned_cols=68  Identities=22%  Similarity=0.379  Sum_probs=61.5

Q ss_pred             CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhc
Q psy1937          88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLAS  155 (307)
Q Consensus        88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y  155 (307)
                      .|||+||||+||+++|..++.+||++++.||+++||++|+.||+++|++|.++|+.+++...-+|.+|
T Consensus         2 iKrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y   69 (72)
T cd01388           2 IKRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDY   69 (72)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCC
Confidence            58999999999999999999999999999999999999999999999999999999976333355555


No 6  
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.64  E-value=4.2e-16  Score=115.64  Aligned_cols=68  Identities=32%  Similarity=0.556  Sum_probs=58.8

Q ss_pred             CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhc
Q psy1937          88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLAS  155 (307)
Q Consensus        88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y  155 (307)
                      ||||+|||++||.+.+..++.+||+++..+|+++||++|++||+++|.+|.++|+.++....-++.+|
T Consensus         1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y   68 (69)
T PF00505_consen    1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEY   68 (69)
T ss_dssp             SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            89999999999999999999999999999999999999999999999999999999865222244443


No 7  
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.63  E-value=5.4e-16  Score=113.97  Aligned_cols=59  Identities=34%  Similarity=0.708  Sum_probs=57.2

Q ss_pred             CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      ||||+||||+|+++.|..++.+||++++.+|+++||++|++||+++|.+|.+.|+.++.
T Consensus         1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~   59 (66)
T cd01390           1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKE   59 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence            89999999999999999999999999999999999999999999999999999998854


No 8  
>smart00398 HMG high mobility group.
Probab=99.62  E-value=1.3e-15  Score=112.59  Aligned_cols=69  Identities=29%  Similarity=0.501  Sum_probs=61.4

Q ss_pred             CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhc
Q psy1937          87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLAS  155 (307)
Q Consensus        87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y  155 (307)
                      +||+|+||||+|+.+.|..+..+||++++.+|+++||.+|+.||+++|.+|.++|+.++....-++..|
T Consensus         1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y   69 (70)
T smart00398        1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEY   69 (70)
T ss_pred             CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            589999999999999999999999999999999999999999999999999999999865322244444


No 9  
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.61  E-value=7.7e-16  Score=136.99  Aligned_cols=83  Identities=24%  Similarity=0.492  Sum_probs=72.5

Q ss_pred             cCCCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhcc
Q psy1937          77 TDGRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASI  156 (307)
Q Consensus        77 ~~~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~  156 (307)
                      ...+..+||+.||||+||||+|+.++|++++.++|++++.+|++++|++|++|+++||.+|..+|..+++...-....|.
T Consensus        60 ~~~r~k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y~  139 (211)
T COG5648          60 RLVRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYN  139 (211)
T ss_pred             HHHHHhcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhhh
Confidence            44667889999999999999999999999999999999999999999999999999999999999999652222555666


Q ss_pred             Ccc
Q psy1937         157 SRP  159 (307)
Q Consensus       157 ~~k  159 (307)
                      .+.
T Consensus       140 ~k~  142 (211)
T COG5648         140 KKL  142 (211)
T ss_pred             ccc
Confidence            555


No 10 
>KOG0527|consensus
Probab=99.58  E-value=2.3e-15  Score=143.90  Aligned_cols=78  Identities=21%  Similarity=0.334  Sum_probs=69.0

Q ss_pred             CCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhccCc
Q psy1937          81 GPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASISR  158 (307)
Q Consensus        81 ~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~~~  158 (307)
                      ..+.....||||||||+|++..|.++..+||++.+.||+|.||.+|+.|+++||.||+++|++.|+++.-+|.+|.-.
T Consensus        56 ~k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYR  133 (331)
T KOG0527|consen   56 DKTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYR  133 (331)
T ss_pred             CCCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCcccc
Confidence            445567789999999999999999999999999999999999999999999999999999999999555555555543


No 11 
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.54  E-value=1.3e-14  Score=106.05  Aligned_cols=59  Identities=41%  Similarity=0.767  Sum_probs=57.2

Q ss_pred             CCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          88 PEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        88 PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      ||||+||||+|+++.|..++.++|+++..+|+++||.+|+.|++++|.+|.+.|+.++.
T Consensus         1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~   59 (66)
T cd00084           1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKE   59 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHH
Confidence            79999999999999999999999999999999999999999999999999999999854


No 12 
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.54  E-value=1.1e-14  Score=110.19  Aligned_cols=62  Identities=27%  Similarity=0.553  Sum_probs=54.7

Q ss_pred             CCCCCCCccchhchhHHHHHHHHHH-cCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          85 PKPPEKPLMPYMRYSRKVWDQVKAQ-NLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        85 p~~PKRPlsAy~lf~~e~r~~v~~e-~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      |++||+|+|||++|+.+++..++.. .+...+.|+++.||.+|++||++||.+|.+.|+.+++
T Consensus         1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~   63 (73)
T PF09011_consen    1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKE   63 (73)
T ss_dssp             SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHH
T ss_pred             CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHH
Confidence            6889999999999999999999998 7888999999999999999999999999999999976


No 13 
>KOG3248|consensus
Probab=99.52  E-value=1.7e-14  Score=135.66  Aligned_cols=73  Identities=21%  Similarity=0.350  Sum_probs=67.8

Q ss_pred             CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH--HHHh----hhhhccCcc
Q psy1937          87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN--LKAS----TKLASISRP  159 (307)
Q Consensus        87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~--~k~y----~y~~y~~~k  159 (307)
                      ..|+||||||+|++|+|.+|.+++......+|.+|||++|.+||.||..+|+++|.++++  |+.|    +.+||.++|
T Consensus       191 hiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WSARdNYgKKk  269 (421)
T KOG3248|consen  191 HIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYGKKK  269 (421)
T ss_pred             cccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcchhhhhhhhh
Confidence            469999999999999999999999988899999999999999999999999999999999  6777    778888766


No 14 
>KOG0381|consensus
Probab=99.48  E-value=1.3e-13  Score=109.04  Aligned_cols=63  Identities=32%  Similarity=0.670  Sum_probs=59.8

Q ss_pred             CC--CCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          84 AP--KPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        84 dp--~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      +|  ..||||++||++|+.+.|..++.+||++++.||+++||++|++|++++|.+|+..|..++.
T Consensus        17 ~p~~~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~   81 (96)
T KOG0381|consen   17 DPNAQAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKE   81 (96)
T ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHH
Confidence            56  5999999999999999999999999999999999999999999999999999999988865


No 15 
>KOG0526|consensus
Probab=99.28  E-value=2.9e-12  Score=126.97  Aligned_cols=76  Identities=17%  Similarity=0.385  Sum_probs=66.1

Q ss_pred             cCCCCCCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhcc
Q psy1937          77 TDGRGPKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASI  156 (307)
Q Consensus        77 ~~~k~~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~  156 (307)
                      ++.++.|||++||||++|||+|++..|..|+.+  +.+++||+|.+|++|+.|+.  |.+|+++|+.+|+...-++..|.
T Consensus       525 k~~kk~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk  600 (615)
T KOG0526|consen  525 KKGKKKKDPNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK  600 (615)
T ss_pred             cCcccCCCCCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence            556778999999999999999999999999988  89999999999999999999  99999999999862222444444


No 16 
>KOG0528|consensus
Probab=98.91  E-value=7.7e-10  Score=108.86  Aligned_cols=73  Identities=16%  Similarity=0.322  Sum_probs=61.5

Q ss_pred             CCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHhhhhhcc
Q psy1937          84 APKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKASTKLASI  156 (307)
Q Consensus        84 dp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y~y~~y~  156 (307)
                      .+..-||||||||.|.++.|.+|...+||+.+..|+||||.+|+.|+..||+||+++-+..-.++.-.|.+|.
T Consensus       322 s~PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYr  394 (511)
T KOG0528|consen  322 SEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYR  394 (511)
T ss_pred             CCccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccc
Confidence            3445699999999999999999999999999999999999999999999999999887765543333444444


No 17 
>KOG2746|consensus
Probab=98.34  E-value=2.8e-07  Score=94.12  Aligned_cols=66  Identities=20%  Similarity=0.356  Sum_probs=61.1

Q ss_pred             CCCCCCCCCCCccchhchhHHHH--HHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          81 GPKAPKPPEKPLMPYMRYSRKVW--DQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        81 ~~kdp~~PKRPlsAy~lf~~e~r--~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      .+.+.....+||||||+||+.+|  ..+...||+..++.|++|||++|-.|.+.||+.|.++|.+.|.
T Consensus       175 nkr~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~  242 (683)
T KOG2746|consen  175 NKRDKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKE  242 (683)
T ss_pred             CcCcchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHH
Confidence            44556677899999999999999  9999999999999999999999999999999999999999876


No 18 
>PF08549 SWI-SNF_Ssr4:  Fungal domain of unknown function (DUF1750);  InterPro: IPR013859  This is a fungal protein of unknown function. 
Probab=96.74  E-value=0.0036  Score=64.90  Aligned_cols=85  Identities=19%  Similarity=0.400  Sum_probs=61.2

Q ss_pred             CCCccccccccccccchhhHHhhhcc-----cCCC-C--------cchhhh---------------------------HH
Q psy1937         206 DGYSVKHVAYARYLRNHRLINEIFSD-----SVVP-D--------VRSVVT---------------------------TN  244 (307)
Q Consensus       206 dg~S~k~~a~~Rf~rNH~l~~Eifs~-----~~v~-d--------~rs~~~---------------------------~~  244 (307)
                      |-+|+|.|..+||+-||+.|+||||.     -++| |        +-||+.                           ..
T Consensus       281 D~lTPReIS~~RY~qhHEWMEEI~sSPY~i~qI~P~DLGLGrKGEL~sLTeGiF~ap~~~~~~~~~~~~~~~~~gkLdp~  360 (669)
T PF08549_consen  281 DHLTPREISKMRYQQHHEWMEEILSSPYRISQIEPVDLGLGRKGELESLTEGIFEAPGGQSGDASKEGPKKPYVGKLDPG  360 (669)
T ss_pred             ccCCHHHHHHHHHHHHHHHHHHHHhCccccccccccccCCccccchhhhhcccccCCCCCCccccccCCCcccccCCCHH
Confidence            45889999999999999999999997     2222 2        222221                           22


Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHH
Q psy1937         245 RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELK  293 (307)
Q Consensus       245 r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~  293 (307)
                      ......+   +...|.....+||+.|+.+|+.+-.+|...|--=..|+.
T Consensus       361 ~aeeF~k---RV~~~ia~~~AEIekmK~~Hak~m~k~k~~s~lk~AE~~  406 (669)
T PF08549_consen  361 KAEEFRK---RVAKKIADMNAEIEKMKARHAKRMAKFKRNSLLKDAEKE  406 (669)
T ss_pred             HHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHhhccHHHHHHHH
Confidence            2233333   345688899999999999999999999998865555543


No 19 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=96.45  E-value=0.0038  Score=54.89  Aligned_cols=49  Identities=22%  Similarity=0.405  Sum_probs=44.0

Q ss_pred             CCCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCC
Q psy1937          82 PKAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP  130 (307)
Q Consensus        82 ~kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls  130 (307)
                      .|+|.+..|-++||-.|+++.-..|++.+|+++..|.-+..+..|...+
T Consensus       116 ~kPPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p  164 (170)
T PF04690_consen  116 NKPPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP  164 (170)
T ss_pred             cCCccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence            4556666788999999999999999999999999999999999998765


No 20 
>PF06382 DUF1074:  Protein of unknown function (DUF1074);  InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=95.88  E-value=0.0095  Score=52.53  Aligned_cols=49  Identities=16%  Similarity=0.261  Sum_probs=44.1

Q ss_pred             ccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHH
Q psy1937          92 LMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQE  144 (307)
Q Consensus        92 lsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~  144 (307)
                      -+||+-|++++|.    .|.++...|+....+.+|..||+.+|..|..++...
T Consensus        83 nnaYLNFLReFRr----kh~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~~  131 (183)
T PF06382_consen   83 NNAYLNFLREFRR----KHCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPSV  131 (183)
T ss_pred             chHHHHHHHHHHH----HccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcchh
Confidence            4689999999887    468999999999999999999999999999987654


No 21 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=95.87  E-value=0.0062  Score=55.01  Aligned_cols=62  Identities=29%  Similarity=0.571  Sum_probs=57.1

Q ss_pred             CCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHH
Q psy1937          85 PKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKN  146 (307)
Q Consensus        85 p~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~  146 (307)
                      ..+|+.|..+|+-+-.+.|+.+...+|+....++++++|..|++|++.-|.+|.+.|.+.+.
T Consensus       141 k~~~~~~~~~~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~  202 (211)
T COG5648         141 KLPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKE  202 (211)
T ss_pred             ccCCCCCCchhhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHH
Confidence            35678899999999999999999999999999999999999999999999999999998753


No 22 
>PF14887 HMG_box_5:  HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=95.76  E-value=0.031  Score=42.97  Aligned_cols=69  Identities=14%  Similarity=0.162  Sum_probs=52.2

Q ss_pred             CCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHHHHHHHHHh--hhhhccCc
Q psy1937          87 PPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYEQEKNLKAS--TKLASISR  158 (307)
Q Consensus        87 ~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y--~y~~y~~~  158 (307)
                      -|..|-+|--+|.+.....+.+.+++-...+ -+.+...|++|++.+|-+|...|.++  .+.|  .+-.|..+
T Consensus         3 lPE~PKt~qe~Wqq~vi~dYla~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~Ed--qKrYE~el~e~r~~   73 (85)
T PF14887_consen    3 LPETPKTAQEIWQQSVIGDYLAKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAED--QKRYERELREMRSA   73 (85)
T ss_dssp             -S----THHHHHHHHHHHHHHHHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHH--HHHHHHHHHCCS-C
T ss_pred             CCCCCCCHHHHHHHHHHHHHHHHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHH--HHHHHHHHHHHhcC
Confidence            4777888889999999999999998877766 45899999999999999999999988  4666  44445443


No 23 
>PF08073 CHDNT:  CHDNT (NUC034) domain;  InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=93.59  E-value=0.076  Score=38.27  Aligned_cols=39  Identities=18%  Similarity=0.370  Sum_probs=35.1

Q ss_pred             ccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCC
Q psy1937          92 LMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLP  130 (307)
Q Consensus        92 lsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls  130 (307)
                      ++.|=.|++-.|+.|.+.||++..+.|..+++.+|+.-+
T Consensus        13 lt~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~   51 (55)
T PF08073_consen   13 LTNYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQ   51 (55)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHH
Confidence            456788999999999999999999999999999998543


No 24 
>PF06320 GCN5L1:  GCN5-like protein 1 (GCN5L1);  InterPro: IPR009395 This family consists of several eukaryotic GCN5-like protein 1 (GCN5L1) sequences. The function of this family is unknown [,].
Probab=90.51  E-value=1.8  Score=36.08  Aligned_cols=59  Identities=20%  Similarity=0.328  Sum_probs=51.0

Q ss_pred             hhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHhHHhhhhHHHHHHHHhhhc
Q psy1937         239 SVVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKF---EAKKRKFVESSEQFQEELKKEKA  297 (307)
Q Consensus       239 s~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~---~~kk~~~~~~s~~f~~~~~~~~~  297 (307)
                      ..++..-+..|...|+.+...|++|+.|..++....   ...-.+|+...+.|+..||.+++
T Consensus        28 ~~l~~~l~d~ln~~v~~~~~Nqk~ie~e~k~L~~~~~~l~kqt~qw~~~~~~~~~~LKEiGD   89 (121)
T PF06320_consen   28 QALTNALVDHLNSRVSEAYENQKKIEKEAKQLQRNTAKLAKQTDQWLKLVDSFNDALKEIGD   89 (121)
T ss_pred             HHHHHHHHHHHHHhHHHHHHhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence            346677788899999999999999999888887655   66778899999999999999975


No 25 
>PF04769 MAT_Alpha1:  Mating-type protein MAT alpha 1;  InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=89.97  E-value=0.39  Score=43.46  Aligned_cols=54  Identities=19%  Similarity=0.330  Sum_probs=40.3

Q ss_pred             CCCCCCCCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHHH
Q psy1937          83 KAPKPPEKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDYE  142 (307)
Q Consensus        83 kdp~~PKRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~ae  142 (307)
                      ..+..++||+|+||.|..-+-    ...|+....+++.+|+..|..=+-  |..|.-.|.
T Consensus        39 ~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~dp~--k~~W~l~ak   92 (201)
T PF04769_consen   39 RSPEKAKRPLNGFMAFRSYYS----PIFPPLPQKELSGILTKLWEKDPF--KNKWSLMAK   92 (201)
T ss_pred             ccccccccchhHHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCCcc--HhHHHHHhh
Confidence            345568999999999976655    446888999999999999986332  445555554


No 26 
>PF06244 DUF1014:  Protein of unknown function (DUF1014);  InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=84.78  E-value=0.93  Score=37.93  Aligned_cols=47  Identities=21%  Similarity=0.497  Sum_probs=40.4

Q ss_pred             CCCCCC-ccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChH
Q psy1937          86 KPPEKP-LMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPED  132 (307)
Q Consensus        86 ~~PKRP-lsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~e  132 (307)
                      ..|-|. --||.-|....-+.++.+||++..+.+-.+|-.+|..-|+.
T Consensus        70 rHPErR~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPeN  117 (122)
T PF06244_consen   70 RHPERRMKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPEN  117 (122)
T ss_pred             CCcchhHHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCCC
Confidence            345444 47899999999999999999999999999999999877653


No 27 
>PF11068 YlqD:  YlqD protein;  InterPro: IPR021297  This bacterial family of proteins has no known function. ; PDB: 4DCI_C.
Probab=56.69  E-value=74  Score=26.89  Aligned_cols=61  Identities=25%  Similarity=0.426  Sum_probs=46.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHH--------HHHHHHHHHHHhHHhhhhHHHHHHHHhhhccCCCCC
Q psy1937         243 TNRMQVLKRQVQSLTMHQKKLEAEL--------QQIEEKFEAKKRKFVESSEQFQEELKKEKASCSENE  303 (307)
Q Consensus       243 ~~r~~~l~~~~~sL~~hq~kle~el--------~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~~~~~~~  303 (307)
                      .+.+.-|-.+.+.|..-.+++..|.        ..+...|+..+.++.+--..+...|..+-.+.++.|
T Consensus        26 ~~~i~~~d~el~QLefq~kr~~~e~~~~~~~~~~~i~~q~~~e~~~r~e~k~~l~~ql~qv~~L~lgsE   94 (131)
T PF11068_consen   26 QEQIQQLDQELQQLEFQGKRMIKEIKKQNAQQIQSIQQQFEQEKQERLEQKNQLLQQLEQVQKLELGSE   94 (131)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHTTSSHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHS-TT-E
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHhhcchhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCE
Confidence            3456667777888888888877764        577889999999999999999999988877766654


No 28 
>PF11629 Mst1_SARAH:  C terminal SARAH domain of Mst1;  InterPro: IPR024205 The SARAH (Sav/Rassf/Hpo) domain is found at the C terminus in three classes of eukaryotic tumour suppressors that give the domain its name. In the Sav (Salvador) and Hpo (Hippo) families, the SARAH domain mediates signal transduction from Hpo via the Sav scaffolding protein to the downstream component Wts (Warts); the phosphorylation of Wts by Hpo triggers cell cycle arrest and apoptosis by down-regulating cyclin E, Diap 1 and other targets []. The SARAH domain is also involved in dimerisation, as in the human Hpo orthologue, Mst1, which homodimerises via its C-terminal SARAH domain. The SARAH domain is found associated with other domains, such as protein kinase domains, WW/rsp5/WWP domain (IPR001202 from INTERPRO), C1 domain (IPR002219 from INTERPRO), LIM domain (IPR001781 from INTERPRO), or the Ras-associating (RA) domain (IPR000159 from INTERPRO).; GO: 0004674 protein serine/threonine kinase activity; PDB: 2JO8_A.
Probab=55.72  E-value=57  Score=23.01  Aligned_cols=35  Identities=20%  Similarity=0.582  Sum_probs=27.7

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhh
Q psy1937         247 QVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVES  284 (307)
Q Consensus       247 ~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~  284 (307)
                      ..|.+...+|..   ..|.||..+..+|++|++=+++.
T Consensus        11 ~eL~~rl~~LD~---~ME~Eieelr~RY~~KRqPIldA   45 (49)
T PF11629_consen   11 EELQQRLASLDP---EMEQEIEELRQRYQAKRQPILDA   45 (49)
T ss_dssp             HHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHhCCH---HHHHHHHHHHHHHHHhhccHHHH
Confidence            456666667766   46889999999999999888765


No 29 
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=54.76  E-value=14  Score=33.10  Aligned_cols=39  Identities=13%  Similarity=0.438  Sum_probs=31.7

Q ss_pred             CCCCHHHHHH-HHHhhccCCChHHhHHHHHHHHHHHHHHHh
Q psy1937         111 LDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQEKNLKAS  150 (307)
Q Consensus       111 P~~~~~eisk-~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y  150 (307)
                      |...+..|++ .||.-|+.+|+++|+.|.+....-- ...|
T Consensus        62 ~~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l-~~tY  101 (198)
T TIGR03481        62 EAFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELS-IATY  101 (198)
T ss_pred             HhCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHH-HHHH
Confidence            4678888987 6899999999999999999987631 3555


No 30 
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=53.21  E-value=18  Score=32.82  Aligned_cols=55  Identities=13%  Similarity=0.190  Sum_probs=39.4

Q ss_pred             HHHHHHH-cCCCCHHHHHH-HHHhhccCCChHHhHHHHHHHHHHHHHHHh--hhhhccCc
Q psy1937         103 WDQVKAQ-NLDLKLWEIGK-IIGQMWRDLPEDQKTEYVEDYEQEKNLKAS--TKLASISR  158 (307)
Q Consensus       103 r~~v~~e-~P~~~~~eisk-~lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y--~y~~y~~~  158 (307)
                      +..|... .|...+..|++ .||.-|+.+|+++|..|.+....-- ...|  +...|-..
T Consensus        57 ~~~v~~~l~p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~L-v~tYa~~l~~y~~q  115 (211)
T PRK15117         57 RTIVDQELLPYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYL-KQAYGQALAMYHGQ  115 (211)
T ss_pred             HHHHHHHccccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHH-HHHHHHHHHHhCCc
Confidence            3444333 38889999987 6899999999999999999887642 3555  44455443


No 31 
>PF05494 Tol_Tol_Ttg2:  Toluene tolerance, Ttg2 ;  InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=52.61  E-value=15  Score=31.65  Aligned_cols=47  Identities=15%  Similarity=0.319  Sum_probs=32.3

Q ss_pred             cCCCCHHHHHHH-HHhhccCCChHHhHHHHHHHHHHHHHHHh--hhhhccC
Q psy1937         110 NLDLKLWEIGKI-IGQMWRDLPEDQKTEYVEDYEQEKNLKAS--TKLASIS  157 (307)
Q Consensus       110 ~P~~~~~eisk~-lg~~Wk~Ls~eeK~~Y~~~ae~~K~~k~y--~y~~y~~  157 (307)
                      .|...+..|++. ||.-|+.+|+++++.|.+....-- ...|  ....|..
T Consensus        35 ~~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l-~~~Y~~~l~~y~~   84 (170)
T PF05494_consen   35 DPYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLL-VRTYAKRLDEYSG   84 (170)
T ss_dssp             GGGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHH-HHHHHHHHHT-SS
T ss_pred             HHhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHH-HHHHHHHHHhhCC
Confidence            477788888876 788899999999999999987642 3444  3344443


No 32 
>COG4238 Murein lipoprotein [Cell envelope biogenesis, outer membrane]
Probab=51.04  E-value=1.1e+02  Score=23.49  Aligned_cols=57  Identities=14%  Similarity=0.295  Sum_probs=46.3

Q ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhh
Q psy1937         240 VVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEK  296 (307)
Q Consensus       240 ~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~  296 (307)
                      -+..++|.-|.-+||.|...+..|+....++...-++-|-.-...-+...+.-+..|
T Consensus        21 c~s~aK~dqlss~vq~LnAkv~qLe~dv~a~~~~~qAAk~eaarAn~rldn~a~s~~   77 (78)
T COG4238          21 CSSNAKIDQLSSDVQTLNAKVDQLENDVNAMRSDVQAAKDEAARANQRLDNQAQSYC   77 (78)
T ss_pred             cchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHhHHHHHHHHHHHHHHHhc
Confidence            345889999999999999999999999999999988877666666556655555555


No 33 
>PF04728 LPP:  Lipoprotein leucine-zipper;  InterPro: IPR006817 This repeating sequence, NAKVDQLSNDV, is found in the enterobacterial outer membrane lipoprotein LPP. The outer membrane lipoprotein is the most abundant protein in an Escherichia coli cell. The messenger RNA for the lipoprotein of the E. coli outer membrane codes for a putative precursor, prolipoprotein, which has 20 additional amino acid residues extending from the amino terminus of the lipoprotein.; GO: 0019867 outer membrane; PDB: 1JCC_A 2GUV_C 2GUS_A 1JCD_A 1KFM_A 1T8Z_D 1KFN_A 1EQ7_A.
Probab=49.22  E-value=1e+02  Score=22.40  Aligned_cols=42  Identities=17%  Similarity=0.345  Sum_probs=34.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhh
Q psy1937         243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVES  284 (307)
Q Consensus       243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~  284 (307)
                      +++|.-|..+|+.|...+.+|..++..+...-+.-|..-...
T Consensus         2 ~akid~Ls~dVq~L~~kvdqLs~dv~~lr~~v~~ak~EAaRA   43 (56)
T PF04728_consen    2 NAKIDQLSSDVQTLNSKVDQLSSDVNALRADVQAAKEEAARA   43 (56)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             chhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            467889999999999999999999999988777665544333


No 34 
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=48.93  E-value=1e+02  Score=28.69  Aligned_cols=53  Identities=23%  Similarity=0.466  Sum_probs=47.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhh
Q psy1937         243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKE  295 (307)
Q Consensus       243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~  295 (307)
                      ..|...|..++..|..-+.+|+.++..+.+++..-.+.|.+....+-.++..+
T Consensus       102 k~r~~~le~el~~l~~~~~~l~~~i~~l~~~~~~~e~~~~e~~~~~e~e~~~i  154 (239)
T COG1579         102 KERINSLEDELAELMEEIEKLEKEIEDLKERLERLEKNLAEAEARLEEEVAEI  154 (239)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            56788899999999999999999999999999999999999999998888766


No 35 
>KOG3223|consensus
Probab=46.91  E-value=16  Score=32.98  Aligned_cols=53  Identities=13%  Similarity=0.401  Sum_probs=43.3

Q ss_pred             CCC-CCCccchhchhHHHHHHHHHHcCCCCHHHHHHHHHhhccCCChHHhHHHHHHH
Q psy1937          86 KPP-EKPLMPYMRYSRKVWDQVKAQNLDLKLWEIGKIIGQMWRDLPEDQKTEYVEDY  141 (307)
Q Consensus        86 ~~P-KRPlsAy~lf~~e~r~~v~~e~P~~~~~eisk~lg~~Wk~Ls~eeK~~Y~~~a  141 (307)
                      ..| ||=.-||.-|-...-+.++.++|++.++..-.+|-.+|..-|+.   ||...+
T Consensus       162 rHPEkRmrAA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN---P~Nq~~  215 (221)
T KOG3223|consen  162 RHPEKRMRAAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN---PFNQAA  215 (221)
T ss_pred             cChHHHHHHHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC---hhhHHh
Confidence            445 45556788899999999999999999999999999999988875   554443


No 36 
>PF02370 M:  M protein repeat;  InterPro: IPR003345 This short repeat is found in multiple copies in bacterial M proteins. The M proteins bind to IgA and are closely associated with virulence. The M protein has been postulated to be a major group A streptococcal (GAS) virulence factor because of its contribution to the bacterial resistance to opsonophagocytosis [].; PDB: 2KK9_A.
Probab=42.28  E-value=39  Score=19.69  Aligned_cols=17  Identities=35%  Similarity=0.518  Sum_probs=13.6

Q ss_pred             HHHHHHHHHHHHHHHHH
Q psy1937         260 QKKLEAELQQIEEKFEA  276 (307)
Q Consensus       260 q~kle~el~~~e~~~~~  276 (307)
                      .++||++++.+++.++.
T Consensus         3 kk~lEa~~qkLe~e~q~   19 (21)
T PF02370_consen    3 KKQLEADHQKLEAEKQI   19 (21)
T ss_dssp             HHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHHHHHhh
Confidence            46789999999888864


No 37 
>PF02172 KIX:  KIX domain;  InterPro: IPR003101 The nuclear factor CREB activates transcription of target genes in part through direct interactions with the KIX domain of the coactivator CBP in a phosphorylation-dependent manner []. This provides a model for activator:coactivator interactions. The KIX domain of CBP also binds to transactivation domains of other nuclear factors including Myb and Jun.; GO: 0003712 transcription cofactor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 2KWF_A 1KDX_A 2LQH_A 2LQI_A 1SB0_A 2AGH_B.
Probab=40.26  E-value=80  Score=24.51  Aligned_cols=44  Identities=32%  Similarity=0.574  Sum_probs=28.3

Q ss_pred             cccccch---hhHHhhhcccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         216 ARYLRNH---RLINEIFSDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQKKLEAELQQI  270 (307)
Q Consensus       216 ~Rf~rNH---~l~~Eifs~~~v~d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el~~~  270 (307)
                      +.=.|||   .|+.-||..   ||+... ...||.       +|....+++|.++...
T Consensus        11 t~~lR~hlV~KLv~aI~P~---pdp~a~-~d~rm~-------~l~~yarkvE~~~fe~   57 (81)
T PF02172_consen   11 TPDLRNHLVHKLVQAIFPT---PDPNAM-NDPRMK-------NLIEYARKVEKDMFET   57 (81)
T ss_dssp             -HHHHHHHHHHHHHHHS-S---SSCCCC-CSHHHH-------HHHHHHHHHHHHHHHC
T ss_pred             CHHHHHHHHHHHHHhhCCC---CChhhh-hhHHHH-------HHHHHHHHHHHHHHHH
Confidence            3456777   777777754   566544 578888       4666777788776543


No 38 
>PF12958 DUF3847:  Protein of unknown function (DUF3847);  InterPro: IPR024215 This entry represents a family of uncharacterised proteins that were found by clustering human gut metagenomic sequences [].
Probab=39.37  E-value=8.2  Score=30.36  Aligned_cols=24  Identities=33%  Similarity=0.621  Sum_probs=18.4

Q ss_pred             ccccchhhHHh-hhcCCCCCCCCcc
Q psy1937          14 RYLRNHRLINE-IFSDSVVPDVRSV   37 (307)
Q Consensus        14 ~~~rnh~Li~~-~fs~~~Vpd~rsl   37 (307)
                      |=.||||||.. .+-.+++|++..+
T Consensus        39 RK~RtHRLi~rGa~lEsi~~e~~~l   63 (86)
T PF12958_consen   39 RKERTHRLIERGAILESIFPEPKDL   63 (86)
T ss_pred             HHHHHHHHHHhhHHHHHHhhcchhc
Confidence            55799999994 6667888888554


No 39 
>PF12210 Hrs_helical:  Hepatocyte growth factor-regulated tyrosine kinase substrate;  InterPro: IPR024641 This domain comprises the helical region of hepatocyte growth factor-regulated tyrosine kinase substrate (HRS). It is approximately 100 amino acids in length. Hrs, together with signal transducing adaptor molecule (STAM), forms the ESCRT-0 complex, which sorts ubiquitinated cell surface receptors to lysosomes for degradation []. ; PDB: 3F1I_H.
Probab=38.46  E-value=78  Score=25.42  Aligned_cols=56  Identities=25%  Similarity=0.317  Sum_probs=26.3

Q ss_pred             hhcccCCCCcchhhhHHHHHHHH--HHHHHHHHHHHHHHHH----------HHHHHHHHHHHHhHHhh
Q psy1937         228 IFSDSVVPDVRSVVTTNRMQVLK--RQVQSLTMHQKKLEAE----------LQQIEEKFEAKKRKFVE  283 (307)
Q Consensus       228 ifs~~~v~d~rs~~~~~r~~~l~--~~~~sL~~hq~kle~e----------l~~~e~~~~~kk~~~~~  283 (307)
                      |.+|+.|..+-.-+|.-.-++|+  .+.+....|...|..-          |..|++.|.+|+++..|
T Consensus        28 IanDsaVqsLF~~lt~mH~~LL~~i~~~ee~R~~~E~lQdkL~qi~eAR~AlDalR~eH~~klrr~aE   95 (96)
T PF12210_consen   28 IANDSAVQSLFQTLTAMHPQLLKYIQEQEEKRVYYEGLQDKLAQIKEARAALDALREEHREKLRRQAE   95 (96)
T ss_dssp             GGG-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred             CcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            33444333333334444444443  3334444444444444          44556778777775544


No 40 
>PF07200 Mod_r:  Modifier of rudimentary (Mod(r)) protein;  InterPro: IPR009851 This entry represents a conserved region approximately 150 residues long within a number of eukaryotic proteins that show homology with Drosophila melanogaster Modifier of rudimentary (Mod(r)) proteins. The N-terminal half of Mod(r) proteins is acidic, whereas the C-terminal half is basic [], and both of these regions are represented in this family.; PDB: 2CAZ_F 2P22_C 2F66_F.
Probab=33.77  E-value=2.1e+02  Score=23.97  Aligned_cols=70  Identities=17%  Similarity=0.374  Sum_probs=35.2

Q ss_pred             cccchhhHHhhhcccCCCCcchhhh------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhh
Q psy1937         218 YLRNHRLINEIFSDSVVPDVRSVVT------------TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESS  285 (307)
Q Consensus       218 f~rNH~l~~Eifs~~~v~d~rs~~~------------~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s  285 (307)
                      ..-||.++.+++..  .|.+..+..            ..++-.++.++..+..+...+-.++..++..|..+..++-.-+
T Consensus        12 Ll~d~~~l~~~v~~--l~~~~~~~~~~~~l~~~n~~lAe~nL~~~~~l~~~r~~l~~~~~~~~~L~~~~~~k~~~~~~l~   89 (150)
T PF07200_consen   12 LLSDEEKLDAFVKS--LPQVQELQQEREELLAENEELAEQNLSLEPELEELRSQLQELYEELKELESEYQEKEQQQDELS   89 (150)
T ss_dssp             HHHH-HHHHHHGGG--GS--HHHHHHHHHHHHHHHHHHHHH----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHcCHHHHHHHHHc--CHHHHHHHHHHHHHHHHHHHHHHHhcccchHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44566667776654  344333222            1122233345555555556666667777777777666666655


Q ss_pred             HHHH
Q psy1937         286 EQFQ  289 (307)
Q Consensus       286 ~~f~  289 (307)
                      ..|.
T Consensus        90 ~~~s   93 (150)
T PF07200_consen   90 SNYS   93 (150)
T ss_dssp             HCHH
T ss_pred             ccCC
Confidence            5554


No 41 
>PF07334 IFP_35_N:  Interferon-induced 35 kDa protein (IFP 35) N-terminus;  InterPro: IPR009938 This entry represents the N terminus of interferon-induced 35 kDa protein (IFP 35) (approximately 80 residues long), which contains a leucine zipper motif in an alpha helical configuration []. This group of proteins also includes N-myc-interactor (Nmi), a homologous interferon-induced protein.
Probab=32.76  E-value=98  Score=23.82  Aligned_cols=30  Identities=37%  Similarity=0.460  Sum_probs=20.6

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         248 VLKRQVQSLTMHQKKLEAELQQIEEKFEAK  277 (307)
Q Consensus       248 ~l~~~~~sL~~hq~kle~el~~~e~~~~~k  277 (307)
                      .|...-..|.+-.+|||+||++....|+=+
T Consensus         4 ei~eEn~~Lk~eiqkle~ELq~~~~~~qIk   33 (76)
T PF07334_consen    4 EIQEENARLKEEIQKLEAELQQNKREFQIK   33 (76)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHhhhhhc
Confidence            344444567777778888888877776643


No 42 
>PRK09039 hypothetical protein; Validated
Probab=32.66  E-value=1.9e+02  Score=28.17  Aligned_cols=54  Identities=19%  Similarity=0.306  Sum_probs=37.7

Q ss_pred             cchhhhHH--HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHH
Q psy1937         237 VRSVVTTN--RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQE  290 (307)
Q Consensus       237 ~rs~~~~~--r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~  290 (307)
                      .+.+.+.+  .+..|+.|+..|......|+++|...+++-.+...++.+-...+++
T Consensus       128 ~k~~~se~~~~V~~L~~qI~aLr~Qla~le~~L~~ae~~~~~~~~~i~~L~~~L~~  183 (343)
T PRK09039        128 EKQVSARALAQVELLNQQIAALRRQLAALEAALDASEKRDRESQAKIADLGRRLNV  183 (343)
T ss_pred             HHHHHHHhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44444433  3777888888888888889988888888877766666554444433


No 43 
>PF00038 Filament:  Intermediate filament protein;  InterPro: IPR016044 Intermediate filaments (IF) [, , ] are proteins which are primordial components of the cytoskeleton and the nuclear envelope. They generally form filamentous structures 8 to 14 nm wide. IF proteins are members of a very large multigene family of proteins which has been subdivided in five major subgroups:  Type I: Acidic cytokeratins. Type II: Basic cytokeratins. Type III: Vimentin, desmin, glial fibrillary acidic protein (GFAP), peripherin, and plasticin. Type IV: Neurofilaments L, H and M, alpha-internexin and nestin. Type V: Nuclear lamins A, B1, B2 and C.   All IF proteins are structurally similar in that they consist of: a central rod domain comprising some 300 to 350 residues which is arranged in coiled-coiled alpha-helices, with at least two short characteristic interruptions; a N-terminal non-helical domain (head) of variable length; and a C-terminal domain (tail) which is also non-helical, and which shows extreme length variation between different IF proteins. While IF proteins are evolutionary and structurally related, they have limited sequence homologies except in several regions of the rod domain. This entry represents the central rod domain found in IF proteins.; PDB: 3TNU_B 3KLT_D 1GK4_F 3TRT_A 3G1E_A 3UF1_C 1GK6_B 1GK7_A 3TYY_B 3V4W_A ....
Probab=31.73  E-value=2.8e+02  Score=25.94  Aligned_cols=53  Identities=21%  Similarity=0.477  Sum_probs=39.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhh
Q psy1937         243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKE  295 (307)
Q Consensus       243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~  295 (307)
                      ...++.|..++.+|...-..|+..|..++..+......|...-.....+|..+
T Consensus       222 r~~~~~l~~el~~l~~~~~~Le~~l~~le~~~~~~~~~~~~~i~~le~el~~l  274 (312)
T PF00038_consen  222 RRQIQSLQAELESLRAKNASLERQLRELEQRLDEEREEYQAEIAELEEELAEL  274 (312)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HhhhhHhhhhhhccccchhhhhhhHHHHHHHHHHHHHHHHHhhhccchhHHHH
Confidence            33467777777788888888888888888888888777777766666666554


No 44 
>cd07598 BAR_FAM92 The Bin/Amphiphysin/Rvs (BAR) domain of Family with sequence similarity 92 (FAM92). BAR domains are dimerization, lipid binding and curvature sensing modules found in many different proteins with diverse functions including organelle biogenesis, membrane trafficking or remodeling, and cell division and migration. This group is composed of proteins from the family with sequence similarity 92 (FAM92), which were originally identified by the presence of the unknown domain DUF1208. This domain shows similarity to the BAR domains of sorting nexins. Mammals contain at least two member types, FAM92A and FAM92B, which may exist in many variants. The Xenopus homolog of FAM92A1, xVAP019, is essential for embryo survival and cell differentiation. FAM92A1 may be involved in regulating cell proliferation and apoptosis. BAR domains form dimers that bind to membranes, induce membrane bending and curvature, and may also be involved in protein-protein interactions.
Probab=27.36  E-value=2.7e+02  Score=25.22  Aligned_cols=50  Identities=8%  Similarity=0.277  Sum_probs=41.3

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhh
Q psy1937         243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEK  296 (307)
Q Consensus       243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~  296 (307)
                      .++.++++.+|.+|..|-.+||..+..+=.    +.+++...++.|...+.-+.
T Consensus         3 ~~~~k~i~~~i~~lE~hl~~l~~~~~~lv~----k~~~L~~~~~~fak~~~~la   52 (211)
T cd07598           3 DNQTKFIQERITNVEKHFGELCQDFAAYTR----KTARLRDKGDELAKSINAYA   52 (211)
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHHHHHHhHHHHHHHHHHHH
Confidence            467899999999999999999999887654    44577888888988877664


No 45 
>PF12718 Tropomyosin_1:  Tropomyosin like;  InterPro: IPR000533 Tropomyosins [], are a family of closely related proteins present in muscle and non-muscle cells. In striated muscle, tropomyosin mediate the interactions between the troponin complex and actin so as to regulate muscle contraction []. The role of tropomyosin in smooth muscle and non-muscle tissues is not clear. Tropomyosin is an alpha-helical protein that forms a coiled-coil structure of 2 parallel helices containing 2 sets of 7 alternating actin binding sites []. There are multiple cell-specific isoforms, created by differential splicing of the messenger RNA from one gene, but the proportions of the isoforms vary between different cell types. Muscle isoforms of tropomyosin are characterised by having 284 amino acid residues and a highly conserved N-terminal region, whereas non-muscle forms are generally smaller and are heterogeneous in their N-terminal region. This entry represents tropomyosin (Tmp) 1, 2 and 3. Within the yeast Tmp1 and Tmp2, biochemical and sequence analyses indicate that Tpm2 spans four actin monomers along a filament, whereas Tpm1 spans five. Despite its shorter length, Tpm2 can compete with Tpm1 for binding to F-actin. Over-expression of Tpm2 in vivo alters the axial budding of haploids to a bipolar pattern, and this can be partially suppressed by co-over-expression of Tpm1. This suggests distinct functions for the two tropomyosins, and indicates that the ratio between them is important for correct morphogenesis [].
Probab=27.25  E-value=2.9e+02  Score=23.43  Aligned_cols=46  Identities=26%  Similarity=0.419  Sum_probs=34.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHH
Q psy1937         243 TNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQF  288 (307)
Q Consensus       243 ~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f  288 (307)
                      ..++.-+-.+|.+|...-..||.+|..+++....-+.++.++...+
T Consensus        27 e~~~~~~E~EI~sL~~K~~~lE~eld~~~~~l~~~k~~lee~~~~~   72 (143)
T PF12718_consen   27 EQENEQKEQEITSLQKKNQQLEEELDKLEEQLKEAKEKLEESEKRK   72 (143)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHHHH
Confidence            3455556678888888888888888888888888887777765543


No 46 
>PF07352 Phage_Mu_Gam:  Bacteriophage Mu Gam like protein;  InterPro: IPR009951 The Gam protein, originally characterised in Bacteriophage Mu, protects linear double stranded DNA from exonuclease degradation in vitro and in vivo []. This protein is also found in many bacterial species as part of a suspected prophage. Further studies have shown that Gam is a functional counterpart of the eukaryotic Ku protein, which has key roles in DNA repair and in certain transposition events. Gam displays DNA binding characteristics remarkably similar to those of human Ku []. In addition, Gam can interfere with Ty1 retrotransposition in Saccharomyces cerevisiae (Baker's yeast). These data reveal structural and functional parallels between bacteriophage Gam and eukaryotic Ku and suggest that their functions have been evolutionarily conserved [].; GO: 0003690 double-stranded DNA binding, 0042262 DNA protection; PDB: 2P2U_B.
Probab=26.62  E-value=3.2e+02  Score=23.02  Aligned_cols=53  Identities=17%  Similarity=0.287  Sum_probs=42.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhhccC
Q psy1937         244 NRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEKASC  299 (307)
Q Consensus       244 ~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~~~  299 (307)
                      .+|.-|++++..+..   .+..++..|.+.+++....+...-+.|..-|.-.|...
T Consensus        10 ~ki~~l~~~~~~i~~---~~~~~I~~i~~~~~~~~~~l~~~i~~l~~~l~~y~e~~   62 (149)
T PF07352_consen   10 RKIAELQREIARIEA---EANDEIARIKEWYEAEIAPLQNRIEYLEGLLQAYAEAN   62 (149)
T ss_dssp             HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHCHHHHHHHHHHHHHHHHHHHCT
T ss_pred             HHHHHHHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHC
Confidence            356677777766654   55778999999999999999999999999999888543


No 47 
>COG3937 Uncharacterized conserved protein [Function unknown]
Probab=26.61  E-value=1.1e+02  Score=25.03  Aligned_cols=37  Identities=32%  Similarity=0.449  Sum_probs=24.0

Q ss_pred             HHhhhcccCCCCcchhhhHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         225 INEIFSDSVVPDVRSVVTTNRMQVLKRQVQSLTMHQKKLEAEL  267 (307)
Q Consensus       225 ~~Eifs~~~v~d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el  267 (307)
                      |.+++++.-      |+++.=|+.|...|+.|.....+|++++
T Consensus        70 i~~ml~~~~------~~r~~~~~~l~~rvd~Lerqv~~Lenk~  106 (108)
T COG3937          70 IEEMLSDLE------VARQSEMDELTERVDALERQVADLENKL  106 (108)
T ss_pred             HHHHHhhcc------ccccchHHHHHHHHHHHHHHHHHHHHHh
Confidence            888888732      5555556666666666666666666665


No 48 
>COG3105 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.41  E-value=1.9e+02  Score=24.62  Aligned_cols=28  Identities=21%  Similarity=0.464  Sum_probs=22.4

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHhHHhhhh
Q psy1937         258 MHQKKLEAELQQIEEKFEAKKRKFVESS  285 (307)
Q Consensus       258 ~hq~kle~el~~~e~~~~~kk~~~~~~s  285 (307)
                      -+|.+|+.||+.++.++++-++.++.-+
T Consensus        34 k~q~~~q~ELe~~K~~ld~~rqel~~HF   61 (138)
T COG3105          34 KQQQKLQYELEKVKAQLDEYRQELVKHF   61 (138)
T ss_pred             hhHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4688999999999999998887765543


No 49 
>PF10211 Ax_dynein_light:  Axonemal dynein light chain;  InterPro: IPR019347  Axonemal dynein light chain proteins play a dynamic role in flagellar and cilial motility. Eukaryotic cilia and flagella are complex organelles consisting of a core structure, the axoneme, which is composed of nine microtubule doublets forming a cylinder that surrounds a pair of central singlet microtubules. This ultra-structural arrangement seems to be one of the most stable micro-tubular assemblies known and is responsible for the flagellar and ciliary movement of a large number of organisms ranging from protozoan to mammals. This light chain interacts directly with the N-terminal half of the heavy chains []. 
Probab=24.46  E-value=4.6e+02  Score=23.24  Aligned_cols=34  Identities=18%  Similarity=0.342  Sum_probs=25.0

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         245 RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKK  278 (307)
Q Consensus       245 r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk  278 (307)
                      ++..|+.....|..+...|+..+..++.+.++.+
T Consensus       128 ~i~~L~~e~~~L~~~~~~l~~~~e~~ek~~~e~~  161 (189)
T PF10211_consen  128 EIEELEEEKEELEKQVQELKNKCEQLEKREEELR  161 (189)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5666777777788888888888777777776543


No 50 
>PRK06800 fliH flagellar assembly protein H; Validated
Probab=24.38  E-value=3.3e+02  Score=24.68  Aligned_cols=42  Identities=33%  Similarity=0.582  Sum_probs=24.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHH---HHHHHHhHHhhhhHHHHHH
Q psy1937         250 KRQVQSLTMHQKKLEAELQQIEE---KFEAKKRKFVESSEQFQEE  291 (307)
Q Consensus       250 ~~~~~sL~~hq~kle~el~~~e~---~~~~kk~~~~~~s~~f~~~  291 (307)
                      ...-+.|..||+.|+.|+.+|+.   ..+..|..++..-+.|..-
T Consensus        37 ~~d~~~L~~~Q~~L~~e~~~l~~eqQ~l~~er~~l~~er~~~~~~   81 (228)
T PRK06800         37 QKDHEELLAQQKSLHKELNQLRQEQQKLERERQQLLADREQFQEH   81 (228)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34445566666666666665543   3445666666666666543


No 51 
>PF03915 AIP3:  Actin interacting protein 3;  InterPro: IPR022782 This entry represents a domain found in yeast actin interacting protein 3 and bud site selection protein 6. In these proteins it is typically found towards the C terminus. It is also found in metazoan proteins, such as the mouse enhancer trap locus 4 protein. ; PDB: 3ONX_B 3OKQ_A.
Probab=24.36  E-value=2.2e+02  Score=28.75  Aligned_cols=59  Identities=19%  Similarity=0.315  Sum_probs=39.2

Q ss_pred             CcchhhhHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhhc
Q psy1937         236 DVRSVVTTNRMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEKA  297 (307)
Q Consensus       236 d~rs~~~~~r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~~  297 (307)
                      ++|-=|++-|.....+|+..+......+..+|..+++-...-|.-|-.-   .-.||..||.
T Consensus       231 ~LRkDV~~RgvRp~~~qle~v~kdi~~a~~~L~~m~~~i~~~kp~WkKi---WE~EL~~V~e  289 (424)
T PF03915_consen  231 DLRKDVVQRGVRPSPKQLETVAKDISRASKELKKMKEYIKTEKPIWKKI---WESELQKVCE  289 (424)
T ss_dssp             HHHHHHHHH-----HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHH
T ss_pred             HHHHHHHHcCCcCCHHHHHHHHHHHHHHHHHHHHHHHHHHHhCHHHHHH---HHHHHHHHHH
Confidence            4566666666777778888888888888889998888887766666544   4456666663


No 52 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=23.99  E-value=2.1e+02  Score=23.13  Aligned_cols=39  Identities=15%  Similarity=0.329  Sum_probs=20.8

Q ss_pred             HHHHHcCCCCHHHHHHHHHh--hccCCChHHhHHHHHHHHH
Q psy1937         105 QVKAQNLDLKLWEIGKIIGQ--MWRDLPEDQKTEYVEDYEQ  143 (307)
Q Consensus       105 ~v~~e~P~~~~~eisk~lg~--~Wk~Ls~eeK~~Y~~~ae~  143 (307)
                      .+...++.++..+-.++...  .|.+||++++..--+.+..
T Consensus        33 ~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~~~   73 (107)
T PF11304_consen   33 QIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENYQR   73 (107)
T ss_pred             HHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence            34445566665554444442  3666666666554444443


No 53 
>PRK15335 type III secretion system protein SpaM; Provisional
Probab=23.20  E-value=1.9e+02  Score=24.64  Aligned_cols=45  Identities=20%  Similarity=0.533  Sum_probs=28.2

Q ss_pred             HHHHHHHHHHHHHHHHHHHHH---HHHHHHHHHHHHHhHHhhhhHHHH
Q psy1937         245 RMQVLKRQVQSLTMHQKKLEA---ELQQIEEKFEAKKRKFVESSEQFQ  289 (307)
Q Consensus       245 r~~~l~~~~~sL~~hq~kle~---el~~~e~~~~~kk~~~~~~s~~f~  289 (307)
                      |-.|++||.++|.-.--.+++   ||+.-.+.++.+..-|+.-++++.
T Consensus        74 rqaivRRQ~~~L~Lq~~~iqEKr~elqkeke~~~k~~~yWLRKe~kY~  121 (147)
T PRK15335         74 KQSIVRRQIKDLELQIIQIQEKRSELEKKREEFQKKSKYWLRKEGNYQ  121 (147)
T ss_pred             HHHHHHHHHHhccchhHHHHHHHHHHHHHHHHHHHHHHHHHHhccchH
Confidence            467888888887655444443   555556666666666666665543


No 54 
>PRK10361 DNA recombination protein RmuC; Provisional
Probab=23.11  E-value=4e+02  Score=27.44  Aligned_cols=52  Identities=13%  Similarity=0.315  Sum_probs=34.8

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhhhhHHHHHHHHhhh
Q psy1937         245 RMQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKFVESSEQFQEELKKEK  296 (307)
Q Consensus       245 r~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~~s~~f~~~~~~~~  296 (307)
                      +...+..+...+......|++.|...++.+++++.-+.++.+.+..+.+-+.
T Consensus        68 ~l~~~~~~~~~~~~~~~~l~~~le~~~~~~~ek~~~l~~~~~~L~~~F~~LA  119 (475)
T PRK10361         68 EVRSLQSINTSLEADLREVTTRMEAAQQHADDKIRQMINSEQRLSEQFENLA  119 (475)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3445555566666667777777777777777777777766666666665543


No 55 
>PF09006 Surfac_D-trimer:  Lung surfactant protein D coiled-coil trimerisation;  InterPro: IPR015097 This domain is found in the SFTPD family, which includes lung surfactant protein D (SFTPD), conglutinin, collectin-43 and collectin-46. It forms a triple-helical parallel coiled coil, and mediates trimerisation of the protein []. ; PDB: 4DN8_A 3G84_A 2RIE_C 3IKR_B 1B08_A 2GGX_B 2OS9_C 2ORK_B 1PWB_A 2RIA_C ....
Probab=22.96  E-value=2.3e+02  Score=19.78  Aligned_cols=23  Identities=30%  Similarity=0.512  Sum_probs=12.9

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         247 QVLKRQVQSLTMHQKKLEAELQQ  269 (307)
Q Consensus       247 ~~l~~~~~sL~~hq~kle~el~~  269 (307)
                      ..|+.||..|....+-|+.-+.+
T Consensus         2 ~aLrqQv~aL~~qv~~Lq~~fs~   24 (46)
T PF09006_consen    2 NALRQQVEALQGQVQRLQAAFSQ   24 (46)
T ss_dssp             HHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             hHHHHHHHHHHHHHHHHHHHHHH
Confidence            45777777655555555544433


No 56 
>KOG0241|consensus
Probab=22.53  E-value=1e+02  Score=34.64  Aligned_cols=90  Identities=24%  Similarity=0.302  Sum_probs=51.9

Q ss_pred             cccCCCCcccccc-ccccccchhhHHhhhcccCCCCcchhhhHHHHHHHHH---HHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         202 DEQEDGYSVKHVA-YARYLRNHRLINEIFSDSVVPDVRSVVTTNRMQVLKR---QVQSLTMHQKKLEAELQQIEEKFEAK  277 (307)
Q Consensus       202 dd~ddg~S~k~~a-~~Rf~rNH~l~~Eifs~~~v~d~rs~~~~~r~~~l~~---~~~sL~~hq~kle~el~~~e~~~~~k  277 (307)
                      |+||+.||+=+-| .|+-.-||-.+||==.-.+..|+|--|-+=|+++-+.   ++..|.-|-.++|.=+.+|...+++|
T Consensus       333 dnyeeTlStLRYadrAkrIvN~avvNedpnarvirElReEve~lr~qL~~ae~~~~~el~e~l~esekli~ei~~twEEk  412 (1714)
T KOG0241|consen  333 DNYEETLSTLRYADRAKRIVNHAVVNEDPNARVIRELREEVEKLREQLEQAEAMKLPELKEKLEESEKLIKEITVTWEEK  412 (1714)
T ss_pred             cchHHHHHHHHHHHHHHHhhccccccCCchHHHHHHHHHHHHHHHHHHhhhhhccchHHHHHHHHHHHHHHHHHhHHHHH
Confidence            6788888874432 3555666665555333333344444455555554442   33445556666666677888888888


Q ss_pred             HhHHhhhhHHHHHHHHhh
Q psy1937         278 KRKFVESSEQFQEELKKE  295 (307)
Q Consensus       278 k~~~~~~s~~f~~~~~~~  295 (307)
                      .++..    ..+.|+++.
T Consensus       413 l~ktE----~in~erq~~  426 (1714)
T KOG0241|consen  413 LRKTE----EINQERQAQ  426 (1714)
T ss_pred             HHHHH----HHHHHHHHH
Confidence            76654    445555544


No 57 
>PF09340 NuA4:  Histone acetyltransferase subunit NuA4;  InterPro: IPR015418 The NuA4 histone acetyltransferase (HAT) multisubunit complex is responsible for acetylation of histone H4 and H2A N-terminal tails in yeast []. NuA4 complexes are highly conserved in eukaryotes and play primary roles in transcription, cellular response to DNA damage, and cell cycle control []. 
Probab=21.99  E-value=2.1e+02  Score=21.92  Aligned_cols=32  Identities=28%  Similarity=0.530  Sum_probs=18.5

Q ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHhHHhh
Q psy1937         252 QVQSLTMHQKKLEAELQQIEEKFEAKKRKFVE  283 (307)
Q Consensus       252 ~~~sL~~hq~kle~el~~~e~~~~~kk~~~~~  283 (307)
                      ++..|....++|+.+|..||..--.+=-.+++
T Consensus         3 ~L~~l~~~k~~Le~~L~~lE~qIy~~Et~YL~   34 (80)
T PF09340_consen    3 ELKELLQKKKKLEKDLAALEKQIYDKETSYLE   34 (80)
T ss_pred             HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34456666666666666666655444444444


No 58 
>PRK10328 DNA binding protein, nucleoid-associated; Provisional
Probab=21.43  E-value=2.6e+02  Score=23.69  Aligned_cols=55  Identities=15%  Similarity=0.354  Sum_probs=29.6

Q ss_pred             HHhhhcccCCCCcchhhhHHH---HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhHH
Q psy1937         225 INEIFSDSVVPDVRSVVTTNR---MQVLKRQVQSLTMHQKKLEAELQQIEEKFEAKKRKF  281 (307)
Q Consensus       225 ~~Eifs~~~v~d~rs~~~~~r---~~~l~~~~~sL~~hq~kle~el~~~e~~~~~kk~~~  281 (307)
                      |+|++  -+...+||+....|   +..|..=.+-|......-.++.......-+++..++
T Consensus         1 ms~~l--k~l~n~R~lra~~re~~~e~Lee~~ekl~~vv~er~~~~~~~~~~~~er~~~l   58 (134)
T PRK10328          1 MSVML--QSLNNIRTLRAMAREFSIDVLEEMLEKFRVVTKERREEEEQQQRELAERQEKI   58 (134)
T ss_pred             CcHHH--HHHhhHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            45555  24557888877776   455555555555555554444444444444444433


No 59 
>PF04340 DUF484:  Protein of unknown function, DUF484;  InterPro: IPR007435 This family consists of several proteins of uncharacterised function.; PDB: 3E98_B.
Probab=20.42  E-value=2.2e+02  Score=25.66  Aligned_cols=63  Identities=24%  Similarity=0.428  Sum_probs=19.9

Q ss_pred             CCccccccccccccchhh----HHhhhcccCCCCcchhhhHHH-HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy1937         207 GYSVKHVAYARYLRNHRL----INEIFSDSVVPDVRSVVTTNR-MQVLKRQVQSLTMHQKKLEAELQQIEEKFEA  276 (307)
Q Consensus       207 g~S~k~~a~~Rf~rNH~l----~~Eifs~~~v~d~rs~~~~~r-~~~l~~~~~sL~~hq~kle~el~~~e~~~~~  276 (307)
                      .++...|+.  |-++|.-    ..|+|..-.||...     .+ +....+|++.|...-+.|+.+|..|-....+
T Consensus         5 ~l~~~~V~~--yL~~~PdFf~~~~~ll~~l~~ph~~-----~~avSL~erQ~~~LR~~~~~L~~~l~~Li~~Ar~   72 (225)
T PF04340_consen    5 ALDAEDVAA--YLRQHPDFFERHPELLAELRLPHPS-----GGAVSLVERQLERLRERNRQLEEQLEELIENARE   72 (225)
T ss_dssp             -------------------------------------------HHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             cCCHHHHHH--HHHhCcHHHHhCHHHHHHcCCCCCC-----CCcccHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455566666  7776631    12333333344321     22 3333477777777777777777777665543


Done!