Query         psy14923
Match_columns 180
No_of_seqs    187 out of 1489
Neff          7.6 
Searched_HMMs 46136
Date          Fri Aug 16 15:50:08 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14923.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14923hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 PTZ00199 high mobility group p  99.9 3.8E-23 8.3E-28  145.7  11.3   83   35-117     9-93  (94)
  2 cd01389 MATA_HMG-box MATA_HMG-  99.8 1.9E-20 4.2E-25  127.1   8.5   72   48-119     1-72  (77)
  3 COG5648 NHP6B Chromatin-associ  99.8 3.4E-20 7.3E-25  145.2   8.6   85   37-121    59-143 (211)
  4 cd01388 SOX-TCF_HMG-box SOX-TC  99.8 6.8E-20 1.5E-24  122.9   8.6   69   49-117     2-70  (72)
  5 PF00505 HMG_box:  HMG (high mo  99.8 1.6E-19 3.6E-24  119.4   9.4   69   49-117     1-69  (69)
  6 cd01390 HMGB-UBF_HMG-box HMGB-  99.8 4.3E-19 9.2E-24  116.4   9.4   65   49-113     1-65  (66)
  7 smart00398 HMG high mobility g  99.8 7.6E-19 1.7E-23  116.0   9.6   70   48-117     1-70  (70)
  8 PF09011 HMG_box_2:  HMG-box do  99.8 7.1E-19 1.5E-23  118.3   9.6   72   46-117     1-73  (73)
  9 cd00084 HMG-box High Mobility   99.7   3E-17 6.5E-22  107.1   9.4   65   49-113     1-65  (66)
 10 KOG0381|consensus               99.7 8.2E-17 1.8E-21  113.3  11.2   76   45-120    17-95  (96)
 11 KOG0527|consensus               99.7   1E-16 2.2E-21  135.2   9.7   79   42-120    56-134 (331)
 12 KOG0526|consensus               99.6   2E-15 4.4E-20  131.7   7.5   81   35-119   522-602 (615)
 13 KOG4715|consensus               99.6 1.8E-14   4E-19  118.8  12.1  111    6-119    19-135 (410)
 14 KOG3248|consensus               99.2 1.7E-11 3.7E-16  102.2   6.8   73   47-119   190-262 (421)
 15 KOG0528|consensus               99.1 8.6E-11 1.9E-15  102.0   3.9   80   42-121   319-398 (511)
 16 PF14887 HMG_box_5:  HMG (high   98.5 1.1E-06 2.4E-11   58.8   7.5   74   48-122     3-76  (85)
 17 PTZ00199 high mobility group p  98.4 1.5E-07 3.3E-12   66.1   1.9   36  142-177    11-46  (94)
 18 KOG2746|consensus               98.2 1.5E-06 3.2E-11   78.6   4.8   69   44-112   177-247 (683)
 19 PF09011 HMG_box_2:  HMG-box do  97.5 8.1E-05 1.8E-09   49.5   2.6   25  151-175     1-25  (73)
 20 cd01390 HMGB-UBF_HMG-box HMGB-  97.4  0.0001 2.3E-09   47.5   2.5   24  154-177     1-24  (66)
 21 PF04690 YABBY:  YABBY protein;  97.3  0.0005 1.1E-08   53.2   5.8   49   43-91    116-164 (170)
 22 PF00505 HMG_box:  HMG (high mo  97.3 0.00021 4.5E-09   46.5   2.4   24  154-177     1-24  (69)
 23 smart00398 HMG high mobility g  97.2 0.00024 5.3E-09   46.0   2.5   25  153-177     1-25  (70)
 24 PF06382 DUF1074:  Protein of u  97.2  0.0011 2.3E-08   51.4   5.6   49   53-105    83-131 (183)
 25 cd01389 MATA_HMG-box MATA_HMG-  97.1 0.00035 7.6E-09   46.9   2.6   25  153-177     1-25  (77)
 26 cd00084 HMG-box High Mobility   97.1 0.00035 7.5E-09   44.7   2.5   24  154-177     1-24  (66)
 27 KOG0526|consensus               97.1 0.00029 6.3E-09   62.7   2.3   36  143-178   525-560 (615)
 28 COG5648 NHP6B Chromatin-associ  97.1 0.00053 1.1E-08   54.4   3.3   68   47-114   142-209 (211)
 29 cd01388 SOX-TCF_HMG-box SOX-TC  96.8  0.0011 2.3E-08   44.0   2.5   24  154-177     2-25  (72)
 30 PF08073 CHDNT:  CHDNT (NUC034)  96.3  0.0062 1.4E-07   38.4   3.4   40   53-92     13-52  (55)
 31 KOG0381|consensus               96.1  0.0047   1E-07   42.9   2.7   28  150-177    17-46  (96)
 32 PF06244 DUF1014:  Protein of u  92.2    0.21 4.5E-06   36.7   3.7   45   48-92     72-116 (122)
 33 KOG3248|consensus               91.1    0.12 2.6E-06   44.1   1.6   23  153-175   191-213 (421)
 34 KOG0527|consensus               90.5    0.17 3.7E-06   43.3   2.0   30  148-177    57-86  (331)
 35 PF11304 DUF3106:  Protein of u  90.0       3 6.4E-05   29.8   7.8   53   77-130    10-69  (107)
 36 PF04769 MAT_Alpha1:  Mating-ty  89.0     1.4 3.1E-05   35.1   6.1   55   43-103    38-92  (201)
 37 TIGR03481 HpnM hopanoid biosyn  86.1     2.1 4.5E-05   33.9   5.5   46   73-118    63-110 (198)
 38 cd00225 API3 Ascaris pepsin in  83.3     1.9 4.1E-05   32.7   3.8   29   84-120    25-53  (159)
 39 PRK15117 ABC transporter perip  83.0       4 8.7E-05   32.6   5.9   48   72-119    66-115 (211)
 40 PF11304 DUF3106:  Protein of u  79.4      20 0.00043   25.5   9.5   32   68-99     35-68  (107)
 41 PF12881 NUT_N:  NUT protein N   77.2     6.8 0.00015   33.3   5.6   53   53-105   229-281 (328)
 42 KOG3223|consensus               76.6       4 8.7E-05   32.3   3.8   44   50-93    166-209 (221)
 43 PF05494 Tol_Tol_Ttg2:  Toluene  75.4       5 0.00011   30.5   4.2   47   72-118    36-84  (170)
 44 KOG3838|consensus               65.5     7.6 0.00016   34.2   3.5   37   89-125   268-304 (497)
 45 PF04690 YABBY:  YABBY protein;  65.3       6 0.00013   30.7   2.6   27  151-177   119-145 (170)
 46 PF01352 KRAB:  KRAB box;  Inte  63.6     5.7 0.00012   23.3   1.7   29   76-104     3-32  (41)
 47 PF13875 DUF4202:  Domain of un  62.7      19 0.00042   28.3   5.0   39   55-96    131-169 (185)
 48 PF06945 DUF1289:  Protein of u  56.3      21 0.00045   21.9   3.4   26   75-105    23-48  (51)
 49 PRK09706 transcriptional repre  52.9      53  0.0011   23.9   5.8   42   78-119    87-128 (135)
 50 KOG0528|consensus               49.2     6.9 0.00015   35.1   0.6   20  153-172   325-344 (511)
 51 PF12650 DUF3784:  Domain of un  43.1      17 0.00036   25.1   1.6   15   86-100    25-39  (97)
 52 COG2854 Ttg2D ABC-type transpo  41.7      37 0.00079   27.2   3.5   41   81-121    78-119 (202)
 53 PRK10236 hypothetical protein;  40.3      33 0.00072   28.1   3.1   26   79-104   118-143 (237)
 54 KOG1610|consensus               32.8 1.3E+02  0.0029   25.7   5.7   39   76-117   218-256 (322)
 55 PRK12751 cpxP periplasmic stre  32.2 1.2E+02  0.0026   23.3   5.0   23   83-105   123-145 (162)
 56 PRK12750 cpxP periplasmic repr  31.9 1.4E+02   0.003   23.0   5.4   23  111-133   128-150 (170)
 57 cd07081 ALDH_F20_ACDH_EutE-lik  31.5 1.2E+02  0.0027   26.8   5.6   41   78-118     6-46  (439)
 58 PF15581 Imm35:  Immunity prote  29.1      92   0.002   21.6   3.4   23   75-97     31-53  (93)
 59 PRK14296 chaperone protein Dna  26.8 1.3E+02  0.0027   26.2   4.8   38   64-101    26-67  (372)
 60 PF02026 RyR:  RyR domain;  Int  26.3      69  0.0015   22.2   2.5   21   86-106    60-80  (94)
 61 PF06628 Catalase-rel:  Catalas  26.0      76  0.0016   20.4   2.6   19   82-100    12-30  (68)
 62 PF13945 NST1:  Salt tolerance   25.7 1.9E+02   0.004   22.9   5.1   37   76-115   100-136 (190)
 63 KOG4715|consensus               25.6      53  0.0012   28.2   2.1   29  149-177    60-88  (410)
 64 PF05388 Carbpep_Y_N:  Carboxyp  25.4 1.4E+02   0.003   21.5   4.0   28   76-103    45-72  (113)
 65 PRK14279 chaperone protein Dna  24.2 1.3E+02  0.0027   26.4   4.3   41   62-102    29-74  (392)
 66 PF08367 M16C_assoc:  Peptidase  24.0 1.7E+02  0.0036   23.6   4.8   30   77-106    13-42  (248)
 67 cd07122 ALDH_F20_ACDH Coenzyme  23.7 2.2E+02  0.0048   25.2   5.9   40   79-118     7-46  (436)
 68 PRK14283 chaperone protein Dna  23.7 1.8E+02  0.0039   25.2   5.1   41   62-102    25-69  (378)
 69 TIGR00787 dctP tripartite ATP-  22.8 1.9E+02  0.0041   23.2   4.9   26   84-109   213-238 (257)
 70 PF06394 Pepsin-I3:  Pepsin inh  22.8      82  0.0018   21.1   2.2   26   88-121    38-63  (76)
 71 PRK14291 chaperone protein Dna  22.2 1.8E+02  0.0039   25.2   4.9   40   62-101    23-66  (382)
 72 cd07132 ALDH_F3AB Aldehyde deh  22.2 2.2E+02  0.0047   25.2   5.5   41   78-118     5-45  (443)
 73 PTZ00037 DnaJ_C chaperone prot  22.1 1.7E+02  0.0037   25.9   4.8   43   60-102    46-89  (421)
 74 cd08317 Death_ank Death domain  21.8      50  0.0011   22.1   1.1   21   72-92      3-23  (84)
 75 cd07085 ALDH_F6_MMSDH Methylma  21.2 2.5E+02  0.0055   24.9   5.7   37   80-116    47-83  (478)
 76 cd07133 ALDH_CALDH_CalB Conife  21.0 2.4E+02  0.0051   24.8   5.5   40   78-117     5-44  (434)
 77 PRK14299 chaperone protein Dna  20.8 2.2E+02  0.0047   23.7   5.0   39   63-101    25-67  (291)
 78 KOG1827|consensus               20.6     6.1 0.00013   36.7  -4.7   44   52-95    552-595 (629)
 79 PRK10266 curved DNA-binding pr  20.4 2.5E+02  0.0053   23.5   5.2   39   63-101    25-67  (306)
 80 PF09164 VitD-bind_III:  Vitami  20.1 2.7E+02  0.0058   18.2   4.8   32   54-85      9-40  (68)
 81 PRK10363 cpxP periplasmic repr  20.1 2.9E+02  0.0063   21.4   5.1   23   83-105   117-139 (166)

No 1  
>PTZ00199 high mobility group protein; Provisional
Probab=99.90  E-value=3.8e-23  Score=145.72  Aligned_cols=83  Identities=29%  Similarity=0.555  Sum_probs=77.3

Q ss_pred             cccccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCC--HHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHH
Q psy14923         35 LKQQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIK--IIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQN  112 (180)
Q Consensus        35 ~k~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~--~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~  112 (180)
                      .+++..++.+||+.|++|+|||++|+.++|..|..+||+++  +++|+++||++|+.||++||.+|+++|++++++|..+
T Consensus         9 ~~k~~~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e   88 (94)
T PTZ00199          9 LVRKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKE   88 (94)
T ss_pred             cccccCCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHH
Confidence            34455567899999999999999999999999999999986  8999999999999999999999999999999999999


Q ss_pred             HHhhh
Q psy14923        113 VKKYH  117 (180)
Q Consensus       113 ~~~y~  117 (180)
                      |.+|.
T Consensus        89 ~~~Y~   93 (94)
T PTZ00199         89 KAEYA   93 (94)
T ss_pred             HHHHh
Confidence            99995


No 2  
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.83  E-value=1.9e-20  Score=127.10  Aligned_cols=72  Identities=17%  Similarity=0.335  Sum_probs=69.5

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923         48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE  119 (180)
Q Consensus        48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~  119 (180)
                      .|+||+|||+||+++.|..++.+||++++.+|+++||++|+.||+++|++|.++|++++++|..++++|.-.
T Consensus         1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky~   72 (77)
T cd01389           1 KIPRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKYT   72 (77)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCccc
Confidence            478999999999999999999999999999999999999999999999999999999999999999999863


No 3  
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.82  E-value=3.4e-20  Score=145.19  Aligned_cols=85  Identities=31%  Similarity=0.606  Sum_probs=81.3

Q ss_pred             cccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy14923         37 QQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKY  116 (180)
Q Consensus        37 ~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y  116 (180)
                      +..+++.+||+.||||+|||++|+.++|.++..++|++++.+|.+++|++|++|+|+||++|...|..++++|..++..|
T Consensus        59 k~~~r~k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y  138 (211)
T COG5648          59 KRLVRKKKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEY  138 (211)
T ss_pred             HHHHHHhcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhh
Confidence            45567889999999999999999999999999999999999999999999999999999999999999999999999999


Q ss_pred             hhhCC
Q psy14923        117 HDELT  121 (180)
Q Consensus       117 ~~~~~  121 (180)
                      ...++
T Consensus       139 ~~k~~  143 (211)
T COG5648         139 NKKLP  143 (211)
T ss_pred             hcccC
Confidence            99876


No 4  
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.82  E-value=6.8e-20  Score=122.93  Aligned_cols=69  Identities=23%  Similarity=0.456  Sum_probs=67.1

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923         49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH  117 (180)
Q Consensus        49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~  117 (180)
                      .|||+|||++|++++|..++.+||++++.+|+++||++|+.||+++|++|.++|+.++++|.+++++|+
T Consensus         2 iKrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~   70 (72)
T cd01388           2 IKRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYK   70 (72)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCC
Confidence            479999999999999999999999999999999999999999999999999999999999999999885


No 5  
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.81  E-value=1.6e-19  Score=119.42  Aligned_cols=69  Identities=32%  Similarity=0.641  Sum_probs=65.8

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923         49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH  117 (180)
Q Consensus        49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~  117 (180)
                      |+||+|||++|+.+.+..++.+||+++..+|+++||++|++||+++|.+|.+.|++++.+|.+++.+|+
T Consensus         1 PkrP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y~   69 (69)
T PF00505_consen    1 PKRPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEYK   69 (69)
T ss_dssp             SSSS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            899999999999999999999999999999999999999999999999999999999999999999985


No 6  
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.80  E-value=4.3e-19  Score=116.35  Aligned_cols=65  Identities=42%  Similarity=0.806  Sum_probs=63.7

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHH
Q psy14923         49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNV  113 (180)
Q Consensus        49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~  113 (180)
                      ||+|+|||++|+.+.|..++.+||++++.+|++.||++|++||+++|++|.+.|++++++|..+|
T Consensus         1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~   65 (66)
T cd01390           1 PKRPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            89999999999999999999999999999999999999999999999999999999999999887


No 7  
>smart00398 HMG high mobility group.
Probab=99.79  E-value=7.6e-19  Score=116.04  Aligned_cols=70  Identities=34%  Similarity=0.665  Sum_probs=67.9

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923         48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH  117 (180)
Q Consensus        48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~  117 (180)
                      +|++|+|||++|+++.|..+..+||++++.+|++.||++|+.||+++|++|.+.|++++++|..++..|.
T Consensus         1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y~   70 (70)
T smart00398        1 KPKRPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK   70 (70)
T ss_pred             CcCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            5899999999999999999999999999999999999999999999999999999999999999999874


No 8  
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.79  E-value=7.1e-19  Score=118.26  Aligned_cols=72  Identities=33%  Similarity=0.621  Sum_probs=63.8

Q ss_pred             CCCCCCCCChHHHHHHHHHHHHHHh-CCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923         46 PEKPKKPLTPYFKFQGQIRNKIMER-NPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH  117 (180)
Q Consensus        46 p~~PKrP~say~lF~~e~r~~~~~~-~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~  117 (180)
                      |++||+|+|||+||+.+.+..++.. ++.....|+++.|+..|++||++||.+|+++|++++++|..+|..|.
T Consensus         1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~~   73 (73)
T PF09011_consen    1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEWN   73 (73)
T ss_dssp             SSS--SSSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred             CcCCCCCCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            6899999999999999999999888 77788999999999999999999999999999999999999999984


No 9  
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.73  E-value=3e-17  Score=107.15  Aligned_cols=65  Identities=37%  Similarity=0.753  Sum_probs=63.1

Q ss_pred             CCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHH
Q psy14923         49 PKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNV  113 (180)
Q Consensus        49 PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~  113 (180)
                      |++|+|||++|+++.+..++.++|+++..+|++.||++|+.|++++|.+|.+.|++++.+|.+++
T Consensus         1 pkrp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~   65 (66)
T cd00084           1 PKRPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            78999999999999999999999999999999999999999999999999999999999998775


No 10 
>KOG0381|consensus
Probab=99.72  E-value=8.2e-17  Score=113.30  Aligned_cols=76  Identities=37%  Similarity=0.681  Sum_probs=72.3

Q ss_pred             CC--CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH-hhhhhC
Q psy14923         45 IP--EKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVK-KYHDEL  120 (180)
Q Consensus        45 ~p--~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~-~y~~~~  120 (180)
                      +|  +.|++|+|||++|+.+.+..++.+||++++.+|++++|++|++|++++|.+|+..+..++++|..+|. .|+..+
T Consensus        17 ~p~~~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~~~~~   95 (96)
T KOG0381|consen   17 DPNAQAPKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEYKASL   95 (96)
T ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence            56  59999999999999999999999999999999999999999999999999999999999999999999 887754


No 11 
>KOG0527|consensus
Probab=99.69  E-value=1e-16  Score=135.21  Aligned_cols=79  Identities=20%  Similarity=0.413  Sum_probs=74.2

Q ss_pred             CCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhC
Q psy14923         42 PLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDEL  120 (180)
Q Consensus        42 ~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~  120 (180)
                      ........||||||||+|.++.|.+|..+||++.+.||+|.||.+|+.|+|+||.+|+++|++.+..|++++.+|+-.-
T Consensus        56 ~k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYRP  134 (331)
T KOG0527|consen   56 DKTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYRP  134 (331)
T ss_pred             CCCCccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCccccc
Confidence            3344578999999999999999999999999999999999999999999999999999999999999999999999753


No 12 
>KOG0526|consensus
Probab=99.59  E-value=2e-15  Score=131.73  Aligned_cols=81  Identities=26%  Similarity=0.490  Sum_probs=75.8

Q ss_pred             cccccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy14923         35 LKQQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVK  114 (180)
Q Consensus        35 ~k~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~  114 (180)
                      .+++..|+.+||++|||++||||+|++..|..|+.+  ++++++|++.+|++|+.||+  |.+|++.|+.++++|+.+|.
T Consensus       522 ~~~k~~kk~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~  597 (615)
T KOG0526|consen  522 EKKKKGKKKKDPNAPKRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMK  597 (615)
T ss_pred             ccccCcccCCCCCCCccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHH
Confidence            344777889999999999999999999999999887  89999999999999999999  99999999999999999999


Q ss_pred             hhhhh
Q psy14923        115 KYHDE  119 (180)
Q Consensus       115 ~y~~~  119 (180)
                      +|+..
T Consensus       598 ~yk~g  602 (615)
T KOG0526|consen  598 EYKNG  602 (615)
T ss_pred             hhcCC
Confidence            99954


No 13 
>KOG4715|consensus
Probab=99.59  E-value=1.8e-14  Score=118.81  Aligned_cols=111  Identities=30%  Similarity=0.516  Sum_probs=96.5

Q ss_pred             cCCCCCCCCccc------ccccCCCCCcccccccccccccCCCCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHH
Q psy14923          6 LTTGVFTGGNVF------NLRSCLMPHLNSVTTAGLKQQTSQPLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEV   79 (180)
Q Consensus         6 ~~~~~~~~~~~~------~~~~~~~p~~~~~~~~~~k~~~~k~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei   79 (180)
                      .-|+-|+|++.|      |.+.++.+++++++....-   ....+-|.+|-+|+-+||.|.+..|++|++.||++...||
T Consensus        19 ~st~g~~g~~~~~h~~y~n~r~~~~~~T~sr~~~~~~---~t~pkpPkppekpl~pymrySrkvWd~VkA~nPe~kLWei   95 (410)
T KOG4715|consen   19 PSTPGFGGYNPYTHLAYSNYRLPGNPATNSRVFQNSS---ITRPKPPKPPEKPLMPYMRYSRKVWDQVKASNPELKLWEI   95 (410)
T ss_pred             CCCCCCCCCCcchhhhccCcCCCCCCCCccccccccc---ccCCCCCCCCCcccchhhHHhhhhhhhhhccCcchHHHHH
Confidence            446668888877      7777778887777666533   2344567889999999999999999999999999999999


Q ss_pred             HHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923         80 TKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE  119 (180)
Q Consensus        80 ~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~  119 (180)
                      -++||.+|..|+|+||+.|.+.++.++..|++.|..|..+
T Consensus        96 GK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~s  135 (410)
T KOG4715|consen   96 GKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHNS  135 (410)
T ss_pred             HHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence            9999999999999999999999999999999999999884


No 14 
>KOG3248|consensus
Probab=99.24  E-value=1.7e-11  Score=102.15  Aligned_cols=73  Identities=15%  Similarity=0.337  Sum_probs=67.8

Q ss_pred             CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923         47 EKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE  119 (180)
Q Consensus        47 ~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~  119 (180)
                      .-.|+|+|||+||++|.|++|.+++.-....+|.++||.+|..||.||..+|.++|.++++.+.+-+..|-+.
T Consensus       190 phiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WSAR  262 (421)
T KOG3248|consen  190 PHIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSAR  262 (421)
T ss_pred             ccccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcchh
Confidence            3568999999999999999999998756689999999999999999999999999999999999999999874


No 15 
>KOG0528|consensus
Probab=99.07  E-value=8.6e-11  Score=101.99  Aligned_cols=80  Identities=19%  Similarity=0.380  Sum_probs=72.3

Q ss_pred             CCCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCC
Q psy14923         42 PLSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT  121 (180)
Q Consensus        42 ~~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~  121 (180)
                      ....+.-.||||||||+|.+|.|.+|...+||+.+..|+++||.+|+.||-.||++|.++-...-..|.+.+++|+-.-.
T Consensus       319 ~~ss~PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrYkPR  398 (511)
T KOG0528|consen  319 RASSEPHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRYKPR  398 (511)
T ss_pred             cCCCCccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccccCCC
Confidence            33445667999999999999999999999999999999999999999999999999999999988899999999986543


No 16 
>PF14887 HMG_box_5:  HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.46  E-value=1.1e-06  Score=58.79  Aligned_cols=74  Identities=15%  Similarity=0.277  Sum_probs=61.7

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCCH
Q psy14923         48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTP  122 (180)
Q Consensus        48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~~  122 (180)
                      -|..|-+|--+|.+.....+.+.++.....+ .+.+...|.+|++.+|.+|+..|.++..+|+.+|.+|+..+++
T Consensus         3 lPE~PKt~qe~Wqq~vi~dYla~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r~~~~~   76 (85)
T PF14887_consen    3 LPETPKTAQEIWQQSVIGDYLAKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMRSAPAD   76 (85)
T ss_dssp             -S----THHHHHHHHHHHHHHHHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS-CCCT
T ss_pred             CCCCCCCHHHHHHHHHHHHHHHHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHhcCCCC
Confidence            4778889999999999999999998877666 4589999999999999999999999999999999999987663


No 17 
>PTZ00199 high mobility group protein; Provisional
Probab=98.38  E-value=1.5e-07  Score=66.13  Aligned_cols=36  Identities=19%  Similarity=0.333  Sum_probs=30.5

Q ss_pred             HHHhhhhhccCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        142 REMKKLFRETGKPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       142 ~~~kk~~k~~~~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      ++.++..+|++.||+|+|||++||.++|..+..+|.
T Consensus        11 k~~~k~~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P   46 (94)
T PTZ00199         11 RKNKRKKKDPNAPKRALSAYMFFAKEKRAEIIAENP   46 (94)
T ss_pred             cccCCCCCCCCCCCCCCcHHHHHHHHHHHHHHHHCc
Confidence            344567899999999999999999999998877663


No 18 
>KOG2746|consensus
Probab=98.21  E-value=1.5e-06  Score=78.63  Aligned_cols=69  Identities=20%  Similarity=0.409  Sum_probs=63.6

Q ss_pred             CCCCCCCCCCChHHHHHHHHH--HHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHH
Q psy14923         44 SIPEKPKKPLTPYFKFQGQIR--NKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQN  112 (180)
Q Consensus        44 ~~p~~PKrP~say~lF~~e~r--~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~  112 (180)
                      .+..-..+|||+|++|++.+|  ..+...||+..+.-|+++||++|-.|.+.||+.|.++|.+.++.|-++
T Consensus       177 r~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka  247 (683)
T KOG2746|consen  177 RDKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA  247 (683)
T ss_pred             CcchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence            344567899999999999999  888899999999999999999999999999999999999999998776


No 19 
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=97.48  E-value=8.1e-05  Score=49.54  Aligned_cols=25  Identities=32%  Similarity=0.591  Sum_probs=20.7

Q ss_pred             cCCCCCCCchHHHHHHHhhhhhhhc
Q psy14923        151 TGKPKKPGNSFYLFIKDNIDKKEYE  175 (180)
Q Consensus       151 ~~~PKrP~say~lF~~e~r~~~~~~  175 (180)
                      |++||+|+|||+||+.+++..+..+
T Consensus         1 p~kpK~~~say~lF~~~~~~~~k~~   25 (73)
T PF09011_consen    1 PKKPKRPPSAYNLFMKEMRKEVKEE   25 (73)
T ss_dssp             SSS--SSSSHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCCCCHHHHHHHHHHHHHHHh
Confidence            5689999999999999999998766


No 20 
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=97.43  E-value=0.0001  Score=47.47  Aligned_cols=24  Identities=29%  Similarity=0.605  Sum_probs=22.0

Q ss_pred             CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        154 PKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       154 PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      ||+|+|||++|+++.|..+..+|.
T Consensus         1 Pkrp~saf~~f~~~~r~~~~~~~p   24 (66)
T cd01390           1 PKRPLSAYFLFSQEQRPKLKKENP   24 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCc
Confidence            899999999999999999987764


No 21 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=97.35  E-value=0.0005  Score=53.21  Aligned_cols=49  Identities=22%  Similarity=0.383  Sum_probs=43.5

Q ss_pred             CCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCC
Q psy14923         43 LSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIG   91 (180)
Q Consensus        43 ~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls   91 (180)
                      .+-|.+-.|-+|||..|+++....|++.+|+++..|....++..|...+
T Consensus       116 ~kPPEKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p  164 (170)
T PF04690_consen  116 NKPPEKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP  164 (170)
T ss_pred             cCCccccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence            3445566678999999999999999999999999999999999998765


No 22 
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=97.25  E-value=0.00021  Score=46.51  Aligned_cols=24  Identities=42%  Similarity=0.668  Sum_probs=20.1

Q ss_pred             CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        154 PKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       154 PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      |+||+|||+|||.+.+..+..+|.
T Consensus         1 PkrP~~af~lf~~~~~~~~k~~~p   24 (69)
T PF00505_consen    1 PKRPPNAFMLFCKEKRAKLKEENP   24 (69)
T ss_dssp             SSSS--HHHHHHHHHHHHHHHHST
T ss_pred             CcCCCCHHHHHHHHHHHHHHHHhc
Confidence            899999999999999999987765


No 23 
>smart00398 HMG high mobility group.
Probab=97.23  E-value=0.00024  Score=46.03  Aligned_cols=25  Identities=40%  Similarity=0.635  Sum_probs=22.2

Q ss_pred             CCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        153 KPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       153 ~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      +|++|+|||++|+++.+..+..+|.
T Consensus         1 ~pkrp~~~y~~f~~~~r~~~~~~~~   25 (70)
T smart00398        1 KPKRPMSAFMLFSQENRAKIKAENP   25 (70)
T ss_pred             CcCCCCcHHHHHHHHHHHHHHHHCc
Confidence            4899999999999999999877663


No 24 
>PF06382 DUF1074:  Protein of unknown function (DUF1074);  InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=97.15  E-value=0.0011  Score=51.41  Aligned_cols=49  Identities=18%  Similarity=0.379  Sum_probs=42.7

Q ss_pred             CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy14923         53 LTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKE  105 (180)
Q Consensus        53 ~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~  105 (180)
                      -+||+-|+.+.+.    .|.+++..|+....+..|..||+++|..|..++...
T Consensus        83 nnaYLNFLReFRr----kh~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~~~  131 (183)
T PF06382_consen   83 NNAYLNFLREFRR----KHCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAPSV  131 (183)
T ss_pred             chHHHHHHHHHHH----HccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcchh
Confidence            4679999999875    567999999999999999999999999998865543


No 25 
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=97.14  E-value=0.00035  Score=46.89  Aligned_cols=25  Identities=28%  Similarity=0.452  Sum_probs=21.8

Q ss_pred             CCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        153 KPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       153 ~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      .||||+|||+||+++.+..+..++.
T Consensus         1 ~~kRP~naf~lf~~~~r~~~~~~~p   25 (77)
T cd01389           1 KIPRPRNAFILYRQDKHAQLKTENP   25 (77)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHHCC
Confidence            3899999999999999998876654


No 26 
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=97.13  E-value=0.00035  Score=44.73  Aligned_cols=24  Identities=25%  Similarity=0.593  Sum_probs=21.7

Q ss_pred             CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        154 PKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       154 PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      ||+|+|||++|+++.+..+..+|.
T Consensus         1 pkrp~~af~~f~~~~~~~~~~~~~   24 (66)
T cd00084           1 PKRPLSAYFLFSQEHRAEVKAENP   24 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCc
Confidence            799999999999999999877764


No 27 
>KOG0526|consensus
Probab=97.09  E-value=0.00029  Score=62.72  Aligned_cols=36  Identities=17%  Similarity=0.368  Sum_probs=31.2

Q ss_pred             HHhhhhhccCCCCCCCchHHHHHHHhhhhhhhcCCC
Q psy14923        143 EMKKLFRETGKPKKPGNSFYLFIKDNIDKKEYEGKS  178 (180)
Q Consensus       143 ~~kk~~k~~~~PKrP~say~lF~~e~r~~~~~~~~~  178 (180)
                      +..++.|||++|||++||||+|....|.+++..+-+
T Consensus       525 k~~kk~kdpnapkra~sa~m~w~~~~r~~ik~dgi~  560 (615)
T KOG0526|consen  525 KKGKKKKDPNAPKRATSAYMLWLNASRESIKEDGIS  560 (615)
T ss_pred             cCcccCCCCCCCccchhHHHHHHHhhhhhHhhcCch
Confidence            345788999999999999999999999999877643


No 28 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=97.06  E-value=0.00053  Score=54.42  Aligned_cols=68  Identities=29%  Similarity=0.546  Sum_probs=61.3

Q ss_pred             CCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHH
Q psy14923         47 EKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVK  114 (180)
Q Consensus        47 ~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~  114 (180)
                      .+++.|..+|+-+-.+.+..+...+|+....+++++++..|++|++.-|.+|.+.+..++++|...++
T Consensus       142 ~~~~~~~~~~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~~~~~~~~  209 (211)
T COG5648         142 LPNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYP  209 (211)
T ss_pred             cCCCCCCchhhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHHHHhhhcc
Confidence            57778888888888888888888899999999999999999999999999999999999999877654


No 29 
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=96.78  E-value=0.0011  Score=43.98  Aligned_cols=24  Identities=38%  Similarity=0.358  Sum_probs=20.8

Q ss_pred             CCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        154 PKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       154 PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      .|||+|||++||++.|..+..+|.
T Consensus         2 iKrP~naf~~F~~~~r~~~~~~~p   25 (72)
T cd01388           2 IKRPMNAFMLFSKRHRRKVLQEYP   25 (72)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCC
Confidence            589999999999999998876653


No 30 
>PF08073 CHDNT:  CHDNT (NUC034) domain;  InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=96.25  E-value=0.0062  Score=38.38  Aligned_cols=40  Identities=25%  Similarity=0.551  Sum_probs=36.3

Q ss_pred             CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCH
Q psy14923         53 LTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGE   92 (180)
Q Consensus        53 ~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~   92 (180)
                      ++.|-+|.+..|+.+.+.||++....+...++.+|++-++
T Consensus        13 lt~yK~Fsq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~~   52 (55)
T PF08073_consen   13 LTNYKAFSQHVRPLLAKANPKAPMSKLMMLLQAKWREFQE   52 (55)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHHh
Confidence            4568899999999999999999999999999999997654


No 31 
>KOG0381|consensus
Probab=96.14  E-value=0.0047  Score=42.89  Aligned_cols=28  Identities=29%  Similarity=0.554  Sum_probs=24.8

Q ss_pred             cc--CCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        150 ET--GKPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       150 ~~--~~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      |+  +.||+|+|||++|+.+.+..+..+|.
T Consensus        17 ~p~~~~pkrp~sa~~~f~~~~~~~~k~~~p   46 (96)
T KOG0381|consen   17 DPNAQAPKRPLSAFFLFSSEQRSKIKAENP   46 (96)
T ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCC
Confidence            55  59999999999999999999888775


No 32 
>PF06244 DUF1014:  Protein of unknown function (DUF1014);  InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=92.24  E-value=0.21  Score=36.71  Aligned_cols=45  Identities=20%  Similarity=0.453  Sum_probs=39.9

Q ss_pred             CCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCH
Q psy14923         48 KPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGE   92 (180)
Q Consensus        48 ~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~   92 (180)
                      |-+|---||.-|...+.+.++.++|++..+++...|-.+|..-++
T Consensus        72 PErR~KAAy~afeE~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPe  116 (122)
T PF06244_consen   72 PERRMKAAYKAFEERRLPELKEENPGLRLSQYKQMLWKEWQKSPE  116 (122)
T ss_pred             cchhHHHHHHHHHHHHhHHHHhhCCCchHHHHHHHHHHHHhcCCC
Confidence            344445789999999999999999999999999999999988765


No 33 
>KOG3248|consensus
Probab=91.13  E-value=0.12  Score=44.08  Aligned_cols=23  Identities=39%  Similarity=0.478  Sum_probs=20.0

Q ss_pred             CCCCCCchHHHHHHHhhhhhhhc
Q psy14923        153 KPKKPGNSFYLFIKDNIDKKEYE  175 (180)
Q Consensus       153 ~PKrP~say~lF~~e~r~~~~~~  175 (180)
                      -.|+|+|||+||++|+|.++-.+
T Consensus       191 hiKKPLNAFmlyMKEmRa~vvaE  213 (421)
T KOG3248|consen  191 HIKKPLNAFMLYMKEMRAKVVAE  213 (421)
T ss_pred             cccccHHHHHHHHHHHHHHHHHH
Confidence            46899999999999999988654


No 34 
>KOG0527|consensus
Probab=90.49  E-value=0.17  Score=43.32  Aligned_cols=30  Identities=20%  Similarity=0.277  Sum_probs=25.5

Q ss_pred             hhccCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        148 FRETGKPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       148 ~k~~~~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      .+..+..|||+||||+|++..|.++-.++.
T Consensus        57 k~~~~hIKRPMNAFMVWSq~~RRkma~qnP   86 (331)
T KOG0527|consen   57 KTSTDRIKRPMNAFMVWSQGQRRKLAKQNP   86 (331)
T ss_pred             CCCccccCCCcchhhhhhHHHHHHHHHhCc
Confidence            355678999999999999999999877764


No 35 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=89.97  E-value=3  Score=29.82  Aligned_cols=53  Identities=11%  Similarity=0.349  Sum_probs=38.6

Q ss_pred             HHHHHHHHHHhhcCCHHHHHHHHHHHHH-------HHHHHHHHHHhhhhhCCHHHHHHHHH
Q psy14923         77 IEVTKLVSEEWKKIGEAEKKRLENEYAK-------EKEIYAQNVKKYHDELTPEAIEFLRK  130 (180)
Q Consensus        77 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~-------~k~~y~~~~~~y~~~~~~~~~~~~~~  130 (180)
                      .++..-++..|..|+++.+..+.+.+..       +..+....|..|.. ++|+++..+..
T Consensus        10 q~~L~pl~~~W~~l~~~qr~k~l~~a~r~~~mspeqq~r~~~rm~~W~~-LspeqR~~~R~   69 (107)
T PF11304_consen   10 QQALAPLAERWNSLPPEQRRKWLQIAERWPSMSPEQQQRLRERMRRWAA-LSPEQRQQARE   69 (107)
T ss_pred             HHHHHHHHHHHhcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHh-CCHHHHHHHHH
Confidence            4556678888888888888887777754       67777788887776 67777655443


No 36 
>PF04769 MAT_Alpha1:  Mating-type protein MAT alpha 1;  InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=88.99  E-value=1.4  Score=35.09  Aligned_cols=55  Identities=22%  Similarity=0.345  Sum_probs=41.4

Q ss_pred             CCCCCCCCCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHH
Q psy14923         43 LSIPEKPKKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYA  103 (180)
Q Consensus        43 ~~~p~~PKrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~  103 (180)
                      ......++||+|+|+.|..=.-    ...|+....+++..|+..|..=+-  |..|.-.|.
T Consensus        38 ~~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Qk~~S~~l~~lW~~dp~--k~~W~l~ak   92 (201)
T PF04769_consen   38 KRSPEKAKRPLNGFMAFRSYYS----PIFPPLPQKELSGILTKLWEKDPF--KNKWSLMAK   92 (201)
T ss_pred             cccccccccchhHHHHHHHHHH----hhcCCcCHHHHHHHHHHHHhCCcc--HhHHHHHhh
Confidence            3445678999999999976664    456788899999999999987432  566655543


No 37 
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=86.13  E-value=2.1  Score=33.94  Aligned_cols=46  Identities=13%  Similarity=0.443  Sum_probs=38.8

Q ss_pred             CCCHHHHHH-HHHHHhhcCCHHHHHHHHHHHHH-HHHHHHHHHHhhhh
Q psy14923         73 NIKIIEVTK-LVSEEWKKIGEAEKKRLENEYAK-EKEIYAQNVKKYHD  118 (180)
Q Consensus        73 ~~~~~ei~k-~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~y~~  118 (180)
                      -.++..+++ .+|.-|+.+|+++++.|.+.... ....|-..+..|..
T Consensus        63 ~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~~  110 (198)
T TIGR03481        63 AFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYAG  110 (198)
T ss_pred             hCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhcC
Confidence            467777766 58999999999999999999988 66778888888865


No 38 
>cd00225 API3 Ascaris pepsin inhibitor-3 (API3); protein inhibitor that reversibly inhibits aspartic proteinase cathepsin E, and gastric enzymes pepsin and gastricsin.
Probab=83.27  E-value=1.9  Score=32.67  Aligned_cols=29  Identities=17%  Similarity=0.402  Sum_probs=19.1

Q ss_pred             HHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhC
Q psy14923         84 SEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDEL  120 (180)
Q Consensus        84 ~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~  120 (180)
                      |-.|+.|+.+|.+.        ...|..+..+|+..+
T Consensus        25 g~~lReLt~~Eq~e--------l~~y~~d~~~yK~~~   53 (159)
T cd00225          25 GFPLRELTPDEQQE--------LAQYVEDVADYKEEV   53 (159)
T ss_pred             CceeeeCCHHHHHH--------HHHHHHHHHHHHHHH
Confidence            34699999999443        345666666666543


No 39 
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=82.99  E-value=4  Score=32.63  Aligned_cols=48  Identities=17%  Similarity=0.358  Sum_probs=39.1

Q ss_pred             CCCCHHHHHH-HHHHHhhcCCHHHHHHHHHHHHHH-HHHHHHHHHhhhhh
Q psy14923         72 PNIKIIEVTK-LVSEEWKKIGEAEKKRLENEYAKE-KEIYAQNVKKYHDE  119 (180)
Q Consensus        72 p~~~~~ei~k-~l~~~Wk~Ls~~eK~~y~~~a~~~-k~~y~~~~~~y~~~  119 (180)
                      |-.++..+++ .+|.-|+.+|+++++.|.+....- ...|-..+..|...
T Consensus        66 p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~~q  115 (211)
T PRK15117         66 PYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYHGQ  115 (211)
T ss_pred             ccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhCCc
Confidence            4567777766 689999999999999999988874 45788888888753


No 40 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=79.41  E-value=20  Score=25.52  Aligned_cols=32  Identities=16%  Similarity=0.401  Sum_probs=14.8

Q ss_pred             HHhCCCCCHHHHHHHHH--HHhhcCCHHHHHHHH
Q psy14923         68 MERNPNIKIIEVTKLVS--EEWKKIGEAEKKRLE   99 (180)
Q Consensus        68 ~~~~p~~~~~ei~k~l~--~~Wk~Ls~~eK~~y~   99 (180)
                      ....+.++..+-.....  ..|.+||++++..-.
T Consensus        35 a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R   68 (107)
T PF11304_consen   35 AERWPSMSPEQQQRLRERMRRWAALSPEQRQQAR   68 (107)
T ss_pred             HHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHH
Confidence            33444555433332222  356666666655433


No 41 
>PF12881 NUT_N:  NUT protein N terminus;  InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=77.19  E-value=6.8  Score=33.26  Aligned_cols=53  Identities=11%  Similarity=0.107  Sum_probs=44.4

Q ss_pred             CChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy14923         53 LTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKKRLENEYAKE  105 (180)
Q Consensus        53 ~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~  105 (180)
                      ..|+.+|..-....+....|.++..|-....-++|...|.-+|..|.++|++=
T Consensus       229 ~EAlSCFLIpvLrsLar~kPtMtlEeGl~ra~qEW~~~SnfdRmifyemaekF  281 (328)
T PF12881_consen  229 AEALSCFLIPVLRSLARLKPTMTLEEGLWRAVQEWQHTSNFDRMIFYEMAEKF  281 (328)
T ss_pred             hhhhhhhHHHHHHHHHhcCCCccHHHHHHHHHHHhhccccccHHHHHHHHHHH
Confidence            55677777777667777788999999999999999999999999999999753


No 42 
>KOG3223|consensus
Probab=76.58  E-value=4  Score=32.29  Aligned_cols=44  Identities=18%  Similarity=0.460  Sum_probs=39.5

Q ss_pred             CCCCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHH
Q psy14923         50 KKPLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEA   93 (180)
Q Consensus        50 KrP~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~   93 (180)
                      +|=.-||.-|-....+.++.+||++..++.-.+|-.+|..-+|.
T Consensus       166 kRmrAA~~afEe~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN  209 (221)
T KOG3223|consen  166 KRMRAAFKAFEEARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN  209 (221)
T ss_pred             HHHHHHHHHHHHhhchhhhhcCCCccHHHHHHHHHHHHhhCCCC
Confidence            44467799999999999999999999999999999999988874


No 43 
>PF05494 Tol_Tol_Ttg2:  Toluene tolerance, Ttg2 ;  InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=75.39  E-value=5  Score=30.55  Aligned_cols=47  Identities=17%  Similarity=0.475  Sum_probs=34.4

Q ss_pred             CCCCHHHHHHH-HHHHhhcCCHHHHHHHHHHHHHH-HHHHHHHHHhhhh
Q psy14923         72 PNIKIIEVTKL-VSEEWKKIGEAEKKRLENEYAKE-KEIYAQNVKKYHD  118 (180)
Q Consensus        72 p~~~~~ei~k~-l~~~Wk~Ls~~eK~~y~~~a~~~-k~~y~~~~~~y~~  118 (180)
                      |-.++..+++. ||.-|+.+|+++++.|.+...+. ...|-..+..|..
T Consensus        36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~~   84 (170)
T PF05494_consen   36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYSG   84 (170)
T ss_dssp             GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-SS
T ss_pred             HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence            55677777665 78899999999999999988774 4556677777765


No 44 
>KOG3838|consensus
Probab=65.53  E-value=7.6  Score=34.17  Aligned_cols=37  Identities=11%  Similarity=0.173  Sum_probs=32.5

Q ss_pred             cCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCCHHHH
Q psy14923         89 KIGEAEKKRLENEYAKEKEIYAQNVKKYHDELTPEAI  125 (180)
Q Consensus        89 ~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~~~~~  125 (180)
                      ++.+.||++|.++.+.....+.++.++|.+.+++++.
T Consensus       268 E~qe~ek~kyqeEfe~~q~elek~k~efkk~hpd~~~  304 (497)
T KOG3838|consen  268 EMQELEKAKYQEEFEWAQLELEKRKDEFKKSHPDAQG  304 (497)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHhhhHhhhccCCchhhc
Confidence            4556789999999999999999999999999998753


No 45 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=65.27  E-value=6  Score=30.72  Aligned_cols=27  Identities=19%  Similarity=0.373  Sum_probs=21.6

Q ss_pred             cCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        151 TGKPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       151 ~~~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      +.+-.|-.|||+.|++|....++..+.
T Consensus       119 PEKRqR~psaYn~f~k~ei~rik~~~p  145 (170)
T PF04690_consen  119 PEKRQRVPSAYNRFMKEEIQRIKAENP  145 (170)
T ss_pred             ccccCCCchhHHHHHHHHHHHHHhcCC
Confidence            334456699999999999999887764


No 46 
>PF01352 KRAB:  KRAB box;  InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=63.64  E-value=5.7  Score=23.34  Aligned_cols=29  Identities=24%  Similarity=0.402  Sum_probs=16.6

Q ss_pred             HHHHHHHHH-HHhhcCCHHHHHHHHHHHHH
Q psy14923         76 IIEVTKLVS-EEWKKIGEAEKKRLENEYAK  104 (180)
Q Consensus        76 ~~ei~k~l~-~~Wk~Ls~~eK~~y~~~a~~  104 (180)
                      +.+|+--++ +.|..|.+.+|.-|.+...+
T Consensus         3 f~Dvav~fs~eEW~~L~~~Qk~ly~dvm~E   32 (41)
T PF01352_consen    3 FEDVAVYFSQEEWELLDPAQKNLYRDVMLE   32 (41)
T ss_dssp             ----TT---HHHHHTS-HHHHHHHHHHHHH
T ss_pred             EEEEEEEcChhhcccccceecccchhHHHH
Confidence            445555555 56999999999998876544


No 47 
>PF13875 DUF4202:  Domain of unknown function (DUF4202)
Probab=62.72  E-value=19  Score=28.33  Aligned_cols=39  Identities=10%  Similarity=0.319  Sum_probs=33.2

Q ss_pred             hHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHHH
Q psy14923         55 PYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEKK   96 (180)
Q Consensus        55 ay~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK~   96 (180)
                      +=++|...+.+.+...|   +-..+..++..-|+.||+..++
T Consensus       131 acLVFL~~~f~~F~~~~---deeK~v~Il~KTw~KMS~~g~~  169 (185)
T PF13875_consen  131 ACLVFLEYYFEDFAAKH---DEEKIVDILRKTWRKMSERGHE  169 (185)
T ss_pred             HHHHhHHHHHHHHHhcC---CHHHHHHHHHHHHHHCCHHHHH
Confidence            46789999999998877   5567888999999999998775


No 48 
>PF06945 DUF1289:  Protein of unknown function (DUF1289);  InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=56.30  E-value=21  Score=21.85  Aligned_cols=26  Identities=15%  Similarity=0.473  Sum_probs=18.8

Q ss_pred             CHHHHHHHHHHHhhcCCHHHHHHHHHHHHHH
Q psy14923         75 KIIEVTKLVSEEWKKIGEAEKKRLENEYAKE  105 (180)
Q Consensus        75 ~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~  105 (180)
                      +..||..     |..|++++|....+.....
T Consensus        23 T~dEI~~-----W~~~s~~er~~i~~~l~~R   48 (51)
T PF06945_consen   23 TLDEIRD-----WKSMSDDERRAILARLRAR   48 (51)
T ss_pred             cHHHHHH-----HhhCCHHHHHHHHHHHHHH
Confidence            4566655     9999999998776655443


No 49 
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=52.90  E-value=53  Score=23.87  Aligned_cols=42  Identities=17%  Similarity=0.153  Sum_probs=35.9

Q ss_pred             HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhh
Q psy14923         78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHDE  119 (180)
Q Consensus        78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~  119 (180)
                      +-...+-..|+.|++++++..........+.|.+-+++|...
T Consensus        87 ~~~~~ll~~~~~L~~~~~~~~l~~l~~~~~~~~~~~~~~~~~  128 (135)
T PRK09706         87 EDQKELLELFDALPESEQDAQLSEMRARVENFNKLFEELLKA  128 (135)
T ss_pred             HHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345688999999999999999999999888888888877763


No 50 
>KOG0528|consensus
Probab=49.25  E-value=6.9  Score=35.11  Aligned_cols=20  Identities=35%  Similarity=0.526  Sum_probs=17.5

Q ss_pred             CCCCCCchHHHHHHHhhhhh
Q psy14923        153 KPKKPGNSFYLFIKDNIDKK  172 (180)
Q Consensus       153 ~PKrP~say~lF~~e~r~~~  172 (180)
                      --|||+||||++.++.|.++
T Consensus       325 HIKRPMNAFMVWAkDERRKI  344 (511)
T KOG0528|consen  325 HIKRPMNAFMVWAKDERRKI  344 (511)
T ss_pred             cccCCcchhhcccchhhhhh
Confidence            45999999999999998765


No 51 
>PF12650 DUF3784:  Domain of unknown function (DUF3784);  InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=43.11  E-value=17  Score=25.10  Aligned_cols=15  Identities=13%  Similarity=0.501  Sum_probs=13.0

Q ss_pred             HhhcCCHHHHHHHHH
Q psy14923         86 EWKKIGEAEKKRLEN  100 (180)
Q Consensus        86 ~Wk~Ls~~eK~~y~~  100 (180)
                      -|+.||+|||+.|.+
T Consensus        25 Gyntms~eEk~~~D~   39 (97)
T PF12650_consen   25 GYNTMSKEEKEKYDK   39 (97)
T ss_pred             hcccCCHHHHHHhhH
Confidence            489999999999965


No 52 
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=41.66  E-value=37  Score=27.16  Aligned_cols=41  Identities=10%  Similarity=0.276  Sum_probs=34.1

Q ss_pred             HHHHHHhhcCCHHHHHHHHHHHHHH-HHHHHHHHHhhhhhCC
Q psy14923         81 KLVSEEWKKIGEAEKKRLENEYAKE-KEIYAQNVKKYHDELT  121 (180)
Q Consensus        81 k~l~~~Wk~Ls~~eK~~y~~~a~~~-k~~y~~~~~~y~~~~~  121 (180)
                      ..||.-|+.+|+++++.|.+..... ...|-..+..|..+..
T Consensus        78 ~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~~q~~  119 (202)
T COG2854          78 LVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYKGQTL  119 (202)
T ss_pred             HHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHccCCCc
Confidence            4588999999999999999888774 4568889999988653


No 53 
>PRK10236 hypothetical protein; Provisional
Probab=40.30  E-value=33  Score=28.06  Aligned_cols=26  Identities=19%  Similarity=0.494  Sum_probs=21.1

Q ss_pred             HHHHHHHHhhcCCHHHHHHHHHHHHH
Q psy14923         79 VTKLVSEEWKKIGEAEKKRLENEYAK  104 (180)
Q Consensus        79 i~k~l~~~Wk~Ls~~eK~~y~~~a~~  104 (180)
                      +.+.+...|..||++|++.+.+.-..
T Consensus       118 l~kll~~a~~kms~eE~~~L~~~l~~  143 (237)
T PRK10236        118 LEQFLRNTWKKMDEEHKQEFLHAVDA  143 (237)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHhh
Confidence            47789999999999999888765443


No 54 
>KOG1610|consensus
Probab=32.79  E-value=1.3e+02  Score=25.73  Aligned_cols=39  Identities=21%  Similarity=0.516  Sum_probs=30.9

Q ss_pred             HHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923         76 IIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH  117 (180)
Q Consensus        76 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~  117 (180)
                      ...+.+.+.+.|..|+++.|+.|-+.+..+   |...+..|.
T Consensus       218 ~~~~~~~~~~~w~~l~~e~k~~YGedy~~~---~~~~~~~~~  256 (322)
T KOG1610|consen  218 PEKLEKRMKEIWERLPQETKDEYGEDYFED---YKKSLEKYL  256 (322)
T ss_pred             hHHHHHHHHHHHhcCCHHHHHHHHHHHHHH---HHHHHHhhh
Confidence            578889999999999999999998877665   444455444


No 55 
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=32.16  E-value=1.2e+02  Score=23.27  Aligned_cols=23  Identities=9%  Similarity=0.208  Sum_probs=12.2

Q ss_pred             HHHHhhcCCHHHHHHHHHHHHHH
Q psy14923         83 VSEEWKKIGEAEKKRLENEYAKE  105 (180)
Q Consensus        83 l~~~Wk~Ls~~eK~~y~~~a~~~  105 (180)
                      .-+++..|++++|..|.+..++.
T Consensus       123 ~~qmy~lLTPEQra~l~~~~e~r  145 (162)
T PRK12751        123 RNQMYNLLTPEQKEALNKKHQER  145 (162)
T ss_pred             HHHHHHcCCHHHHHHHHHHHHHH
Confidence            34455566666666665544433


No 56 
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=31.94  E-value=1.4e+02  Score=22.99  Aligned_cols=23  Identities=17%  Similarity=0.226  Sum_probs=14.1

Q ss_pred             HHHHhhhhhCCHHHHHHHHHHHH
Q psy14923        111 QNVKKYHDELTPEAIEFLRKEKE  133 (180)
Q Consensus       111 ~~~~~y~~~~~~~~~~~~~~~~~  133 (180)
                      +...+....+||+++..+.....
T Consensus       128 ~~~~~~~~vLTpEQRak~~e~~~  150 (170)
T PRK12750        128 EKRHQMLSILTPEQKAKFQELQQ  150 (170)
T ss_pred             HHHHHHHHhCCHHHHHHHHHHHH
Confidence            33444566788888776665543


No 57 
>cd07081 ALDH_F20_ACDH_EutE-like Coenzyme A acylating aldehyde dehydrogenase (ACDH), Ethanolamine utilization protein EutE, and related proteins. Coenzyme A acylating aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10), functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA. The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH, and may be critical enzymes in the fermentative pathway.
Probab=31.48  E-value=1.2e+02  Score=26.85  Aligned_cols=41  Identities=10%  Similarity=-0.026  Sum_probs=34.1

Q ss_pred             HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy14923         78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHD  118 (180)
Q Consensus        78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~  118 (180)
                      +.++..-..|+.++.++|..+...+.+..+++.+++.....
T Consensus         6 ~~A~~A~~~W~~~~~~~R~~iL~~~a~~l~~~~~ela~~~~   46 (439)
T cd07081           6 AAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKLAV   46 (439)
T ss_pred             HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34555667899999999999999999999999888887643


No 58 
>PF15581 Imm35:  Immunity protein 35
Probab=29.06  E-value=92  Score=21.55  Aligned_cols=23  Identities=22%  Similarity=0.471  Sum_probs=18.0

Q ss_pred             CHHHHHHHHHHHhhcCCHHHHHH
Q psy14923         75 KIIEVTKLVSEEWKKIGEAEKKR   97 (180)
Q Consensus        75 ~~~ei~k~l~~~Wk~Ls~~eK~~   97 (180)
                      +..-+...+...|+.|++++=..
T Consensus        31 ~i~~l~~lIe~eWRGl~~~qV~~   53 (93)
T PF15581_consen   31 TIRNLESLIEHEWRGLPEEQVLY   53 (93)
T ss_pred             HHHHHHHHHHHHHcCCCHHHHHH
Confidence            35667889999999999876543


No 59 
>PRK14296 chaperone protein DnaJ; Provisional
Probab=26.85  E-value=1.3e+02  Score=26.18  Aligned_cols=38  Identities=11%  Similarity=0.204  Sum_probs=27.2

Q ss_pred             HHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923         64 RNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE  101 (180)
Q Consensus        64 r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~  101 (180)
                      |...+.-|||.+    ..+-.++|.+.|..|||.+|..-.++
T Consensus        26 rkla~~~HPD~n~~~~a~~~F~~i~~AyevLsD~~KR~~YD~   67 (372)
T PRK14296         26 RKLAKQYHPDLNKSPDAHDKMVEINEAADVLLDKDKRKQYDQ   67 (372)
T ss_pred             HHHHHHHCcCCCCCchHHHHHHHHHHHHHHhcCHHHhhhhhh
Confidence            333445677764    45778889999999999987755554


No 60 
>PF02026 RyR:  RyR domain;  InterPro: IPR003032 This domain is called RyR for Ryanodine receptor []. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.; PDB: 4ETV_A 3RQR_A 4ETT_A 4ERT_A 4ESU_A 4ETU_A 4ERV_A 3NRT_E.
Probab=26.33  E-value=69  Score=22.18  Aligned_cols=21  Identities=14%  Similarity=0.267  Sum_probs=15.8

Q ss_pred             HhhcCCHHHHHHHHHHHHHHH
Q psy14923         86 EWKKIGEAEKKRLENEYAKEK  106 (180)
Q Consensus        86 ~Wk~Ls~~eK~~y~~~a~~~k  106 (180)
                      -|..|++++|+.+.+.+.+-.
T Consensus        60 py~~L~e~eK~~dr~~~~e~l   80 (94)
T PF02026_consen   60 PYDELSEEEKEKDRDMVRETL   80 (94)
T ss_dssp             -GGGS-HHHHHHHHHHHHHHH
T ss_pred             ChhhCCHHHHHHhHHHHHHHH
Confidence            499999999999988776543


No 61 
>PF06628 Catalase-rel:  Catalase-related immune-responsive;  InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=26.03  E-value=76  Score=20.39  Aligned_cols=19  Identities=16%  Similarity=0.429  Sum_probs=15.8

Q ss_pred             HHHHHhhcCCHHHHHHHHH
Q psy14923         82 LVSEEWKKIGEAEKKRLEN  100 (180)
Q Consensus        82 ~l~~~Wk~Ls~~eK~~y~~  100 (180)
                      ..+..|+.|+++||+.+.+
T Consensus        12 Qa~~ly~~l~~~er~~lv~   30 (68)
T PF06628_consen   12 QARDLYRVLSDEERERLVE   30 (68)
T ss_dssp             HHHHHHHHSSHHHHHHHHH
T ss_pred             hHHHHHHHCCHHHHHHHHH
Confidence            4678899999999987764


No 62 
>PF13945 NST1:  Salt tolerance down-regulator
Probab=25.68  E-value=1.9e+02  Score=22.94  Aligned_cols=37  Identities=19%  Similarity=0.300  Sum_probs=26.2

Q ss_pred             HHHHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHh
Q psy14923         76 IIEVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKK  115 (180)
Q Consensus        76 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~  115 (180)
                      ..+-...|-+-|-.|+++||.....+-++   .-.+.|.+
T Consensus       100 s~eEre~LkeFW~SL~eeERr~LVkIEKe---~VLkkmKe  136 (190)
T PF13945_consen  100 SQEEREKLKEFWESLSEEERRSLVKIEKE---AVLKKMKE  136 (190)
T ss_pred             hHHHHHHHHHHHHccCHHHHHHHHHhhHH---HHHHHHHH
Confidence            44666789999999999999977665444   34444444


No 63 
>KOG4715|consensus
Probab=25.61  E-value=53  Score=28.17  Aligned_cols=29  Identities=14%  Similarity=0.263  Sum_probs=24.1

Q ss_pred             hccCCCCCCCchHHHHHHHhhhhhhhcCC
Q psy14923        149 RETGKPKKPGNSFYLFIKDNIDKKEYEGK  177 (180)
Q Consensus       149 k~~~~PKrP~say~lF~~e~r~~~~~~~~  177 (180)
                      +-|..|-+|+-.|+-|+.-+|++++..|.
T Consensus        60 kpPkppekpl~pymrySrkvWd~VkA~nP   88 (410)
T KOG4715|consen   60 KPPKPPEKPLMPYMRYSRKVWDQVKASNP   88 (410)
T ss_pred             CCCCCCCcccchhhHHhhhhhhhhhccCc
Confidence            34556777899999999999999988775


No 64 
>PF05388 Carbpep_Y_N:  Carboxypeptidase Y pro-peptide;  InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=25.41  E-value=1.4e+02  Score=21.51  Aligned_cols=28  Identities=14%  Similarity=0.234  Sum_probs=23.8

Q ss_pred             HHHHHHHHHHHhhcCCHHHHHHHHHHHH
Q psy14923         76 IIEVTKLVSEEWKKIGEAEKKRLENEYA  103 (180)
Q Consensus        76 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~  103 (180)
                      ..-+++.+++.++.|+.+-|+.|.|+..
T Consensus        45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~   72 (113)
T PF05388_consen   45 LEKISKYLNEPLKSLTSEAKALWDEMML   72 (113)
T ss_pred             HHHHHHHHHHHHhhccHHHHHHHHHHHH
Confidence            4566778999999999999999988764


No 65 
>PRK14279 chaperone protein DnaJ; Provisional
Probab=24.23  E-value=1.3e+02  Score=26.39  Aligned_cols=41  Identities=24%  Similarity=0.251  Sum_probs=29.2

Q ss_pred             HHHHHHHHhCCCCC-----HHHHHHHHHHHhhcCCHHHHHHHHHHH
Q psy14923         62 QIRNKIMERNPNIK-----IIEVTKLVSEEWKKIGEAEKKRLENEY  102 (180)
Q Consensus        62 e~r~~~~~~~p~~~-----~~ei~k~l~~~Wk~Ls~~eK~~y~~~a  102 (180)
                      ..|...+.-|||..     ..+..+.|.+.|..|||++|..-.+++
T Consensus        29 ayr~la~~~HPD~~~~~~~a~~~f~~i~~Ay~vLsD~~KR~~YD~~   74 (392)
T PRK14279         29 AYRKLARELHPDANPGDPAAEERFKAVSEAHDVLSDPAKRKEYDET   74 (392)
T ss_pred             HHHHHHHHHCcCCCCCChHHHHHHHHHHHHHHHhcchhhhhHHHHh
Confidence            34444556688764     357778899999999999887655543


No 66 
>PF08367 M16C_assoc:  Peptidase M16C associated;  InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=24.02  E-value=1.7e+02  Score=23.59  Aligned_cols=30  Identities=17%  Similarity=0.159  Sum_probs=23.6

Q ss_pred             HHHHHHHHHHhhcCCHHHHHHHHHHHHHHH
Q psy14923         77 IEVTKLVSEEWKKIGEAEKKRLENEYAKEK  106 (180)
Q Consensus        77 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k  106 (180)
                      .+..+.|.+.+..||+++++...+.+++.+
T Consensus        13 ~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~   42 (248)
T PF08367_consen   13 EEEKEKLAAYKASLSEEEKEKIIEQTKELK   42 (248)
T ss_dssp             HHHHHHHHHHHHCS-HHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHhhCCHHHHHHHHHHHHHHH
Confidence            456778999999999999998888777654


No 67 
>cd07122 ALDH_F20_ACDH Coenzyme A acylating aldehyde dehydrogenase (ACDH), ALDH family 20-like. Coenzyme A acylating aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH and may be critical enzymes in the fermentative pathway.
Probab=23.73  E-value=2.2e+02  Score=25.22  Aligned_cols=40  Identities=10%  Similarity=0.102  Sum_probs=33.1

Q ss_pred             HHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy14923         79 VTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHD  118 (180)
Q Consensus        79 i~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~  118 (180)
                      .++..-..|+.++.++|..+...+.+..+++.+++.....
T Consensus         7 ~A~~A~~~W~~~~~~eR~~~L~~~a~~l~~~~eela~~~~   46 (436)
T cd07122           7 RARKAQREFATFSQEQVDKIVEAVAWAAADAAEELAKMAV   46 (436)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455567899999999999999999998888888877653


No 68 
>PRK14283 chaperone protein DnaJ; Provisional
Probab=23.68  E-value=1.8e+02  Score=25.24  Aligned_cols=41  Identities=20%  Similarity=0.341  Sum_probs=29.6

Q ss_pred             HHHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHHH
Q psy14923         62 QIRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENEY  102 (180)
Q Consensus        62 e~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~a  102 (180)
                      ..|...+.-|||.+    ..+..+.|.+.|..|++.+|..-.+.+
T Consensus        25 ayr~la~~~HPD~~~~~~a~~~f~~i~~Ay~~Lsd~~kR~~YD~~   69 (378)
T PRK14283         25 AYRKLARKYHPDVSEEEGAEEKFKEISEAYAVLSDDEKRQRYDQF   69 (378)
T ss_pred             HHHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhchhHHHHHHhhh
Confidence            34444556688864    467888999999999999876655543


No 69 
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=22.83  E-value=1.9e+02  Score=23.19  Aligned_cols=26  Identities=12%  Similarity=0.169  Sum_probs=20.1

Q ss_pred             HHHhhcCCHHHHHHHHHHHHHHHHHH
Q psy14923         84 SEEWKKIGEAEKKRLENEYAKEKEIY  109 (180)
Q Consensus        84 ~~~Wk~Ls~~eK~~y~~~a~~~k~~y  109 (180)
                      ...|..||++.|+...+.+.+.-...
T Consensus       213 ~~~~~~L~~e~q~~i~~a~~~~~~~~  238 (257)
T TIGR00787       213 KAFWKSLPPDLQAVVKEAAKEAGEYQ  238 (257)
T ss_pred             HHHHhcCCHHHHHHHHHHHHHHHHHH
Confidence            57799999999999888776654333


No 70 
>PF06394 Pepsin-I3:  Pepsin inhibitor-3-like repeated domain;  InterPro: IPR010480 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties.  The members of this group of proteins belong to MEROPS inhibitor family I33, clan IR; the nematode aspartyl protease inhibitors or Aspins. They are restricted to parasitic nematode species. Structural features common to the nematode Aspins include the presence of a signal peptide sequence and the conservation of all four cysteine residues in the mature protein. The Y[V.A]RDLT sequence motif has been suggested as being of crucial functional importance in several filarial nematode inhibitors [], this sequence is not conserved in Tco-API-1 from Trichostrongylus colubriformis (Black scour worm) and it has been demonstrated that Tco-API-1, is not an Aspin as it does not inhibit porcine pepsin []. Related inhibitors from Onchocerca volvulus, Ov33 [] and Ascaris suum (Pig roundworm), PI-3 [] inhibit the in vitro activity of aspartyl proteases such as pepsin and cathepsin E (MEROPS peptidase family A1).  Aspin may facilitate the safe passage of the eggs of Ascaris through the host stomach without digestion by pepsin [, ]. The other parasitic nematodes known to express homologous proteins do not pass through the stomach of their hosts []. Several proteins in the family are potent allergens in mammals. The three-dimensional structures of pepsin inhibitor-3 (PI-3) from A. suum and of the complex between PI-3 and porcine pepsin at 1. 75 A and 2.45 A resolution, respectively, have revealed the mechanism of aspartic protease inhibition. PI-3 has a new fold consisting of two identical domains, each comprising an antiparallel beta-sheet flanked by an alpha-helix. In the enzyme-inhibitor complex, the N-terminal beta-strand of PI-3 pairs with one strand of the 'active site flap' (residues 70-82) of pepsin, thus forming an eight-stranded beta-sheet that spans the two proteins. PI-3 has a novel mode of inhibition, using its N-terminal residues to occupy and therefore block the first three binding pockets in pepsin for substrate residues C-terminal to the scissile bond (S1'-S3') [].; PDB: 1F32_A 1F34_B.
Probab=22.80  E-value=82  Score=21.13  Aligned_cols=26  Identities=23%  Similarity=0.577  Sum_probs=17.1

Q ss_pred             hcCCHHHHHHHHHHHHHHHHHHHHHHHhhhhhCC
Q psy14923         88 KKIGEAEKKRLENEYAKEKEIYAQNVKKYHDELT  121 (180)
Q Consensus        88 k~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~~~~  121 (180)
                      +.|+++|++.+        +.|.+++.+|...+.
T Consensus        38 R~Lt~~E~~eL--------~~y~~~v~~y~~~l~   63 (76)
T PF06394_consen   38 RDLTPDEQQEL--------KTYQKKVAAYKEQLQ   63 (76)
T ss_dssp             EE--HHHHHHH--------HHHHHHHHHHHHHHT
T ss_pred             ccCCHHHHHHH--------HHHHHHHHHHHHHHH
Confidence            56777777654        568888888887654


No 71 
>PRK14291 chaperone protein DnaJ; Provisional
Probab=22.20  E-value=1.8e+02  Score=25.25  Aligned_cols=40  Identities=20%  Similarity=0.363  Sum_probs=28.5

Q ss_pred             HHHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923         62 QIRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE  101 (180)
Q Consensus        62 e~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~  101 (180)
                      ..|...+.-|||.+    ..+..+.|.+.|..|++.+|..-.+.
T Consensus        23 ayr~la~~~HPD~~~~~~~~~~f~~i~~Ay~vLsd~~kR~~YD~   66 (382)
T PRK14291         23 AYRRLARKYHPDFNKNPEAEEKFKEINEAYQVLSDPEKRKLYDQ   66 (382)
T ss_pred             HHHHHHHHHCCCCCCCccHHHHHHHHHHHHHHhcCHHHHHHHhh
Confidence            34444556678764    45778899999999999887655554


No 72 
>cd07132 ALDH_F3AB Aldehyde dehydrogenase family 3 members A1, A2, and B1 and related proteins. NAD(P)+-dependent, aldehyde dehydrogenase, family 3 members A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and similar sequences are included in this CD. Human ALDH3A1 is a homodimer with a critical role in cellular defense against oxidative stress; it catalyzes the oxidation of various cellular membrane lipid-derived aldehydes. Corneal crystalline ALDH3A1 protects the cornea and underlying lens against UV-induced oxidative stress. Human ALDH3A2, a microsomal homodimer, catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acids. Human ALDH3B1 is highly expressed in the kidney and liver and catalyzes the oxidation of various medium- and long-chain saturated and unsaturated aliphatic aldehydes.
Probab=22.15  E-value=2.2e+02  Score=25.16  Aligned_cols=41  Identities=5%  Similarity=-0.154  Sum_probs=33.8

Q ss_pred             HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhhh
Q psy14923         78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYHD  118 (180)
Q Consensus        78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~~  118 (180)
                      +.++..-..|+.++..+|..+........+.+.+++.+-..
T Consensus         5 ~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~   45 (443)
T cd07132           5 RRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEALA   45 (443)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            34556667899999999999999999998888888876553


No 73 
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=22.11  E-value=1.7e+02  Score=25.94  Aligned_cols=43  Identities=21%  Similarity=0.383  Sum_probs=32.2

Q ss_pred             HHHHHHHHHHhCCCCC-HHHHHHHHHHHhhcCCHHHHHHHHHHH
Q psy14923         60 QGQIRNKIMERNPNIK-IIEVTKLVSEEWKKIGEAEKKRLENEY  102 (180)
Q Consensus        60 ~~e~r~~~~~~~p~~~-~~ei~k~l~~~Wk~Ls~~eK~~y~~~a  102 (180)
                      -+..|...+.-|||.. ..+..+.|.+.|..|++.+|....+.+
T Consensus        46 KkAYrkla~k~HPDk~~~~e~F~~i~~AYevLsD~~kR~~YD~~   89 (421)
T PTZ00037         46 KKAYRKLAIKHHPDKGGDPEKFKEISRAYEVLSDPEKRKIYDEY   89 (421)
T ss_pred             HHHHHHHHHHHCCCCCchHHHHHHHHHHHHHhccHHHHHHHhhh
Confidence            4455555667799975 357888999999999999877666653


No 74 
>cd08317 Death_ank Death domain associated with Ankyrins. Death Domain (DD) associated with Ankyrins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. Ankyrins function as adaptor proteins and they interact, through ANK repeats, with structurally diverse membrane proteins, including ion channels/pumps, calcium release channels, and cell adhesion molecules. They play critical roles in the proper expression and membrane localization of these proteins. In mammals, this family includes ankyrin-R for restricted (or ANK1), ankyrin-B for broadly expressed (or ANK2) and ankyrin-G for general or giant (or ANK3). They are expressed in different combinations in many tissues and play non-overlapping functions. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-associati
Probab=21.76  E-value=50  Score=22.06  Aligned_cols=21  Identities=14%  Similarity=0.749  Sum_probs=17.6

Q ss_pred             CCCCHHHHHHHHHHHhhcCCH
Q psy14923         72 PNIKIIEVTKLVSEEWKKIGE   92 (180)
Q Consensus        72 p~~~~~ei~k~l~~~Wk~Ls~   92 (180)
                      +++.+..|+..||..|+.|..
T Consensus         3 ~~~~l~~ia~~lG~dW~~LAr   23 (84)
T cd08317           3 ADIRLADISNLLGSDWPQLAR   23 (84)
T ss_pred             ccchHHHHHHHHhhHHHHHHH
Confidence            567789999999999998754


No 75 
>cd07085 ALDH_F6_MMSDH Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2. Methylmalonate semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27) [acylating] from Bacillus subtilis is involved in valine metabolism and catalyses the NAD+- and CoA-dependent oxidation of methylmalonate semialdehyde into propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and Arabidopsis MMSDH ALDH6B2 are also present in this CD.
Probab=21.16  E-value=2.5e+02  Score=24.91  Aligned_cols=37  Identities=8%  Similarity=0.165  Sum_probs=30.7

Q ss_pred             HHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhh
Q psy14923         80 TKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKY  116 (180)
Q Consensus        80 ~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y  116 (180)
                      ++.....|+.++.++|..+...+.+..+++.+++..-
T Consensus        47 A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~   83 (478)
T cd07085          47 AKAAFPAWSATPVLKRQQVMFKFRQLLEENLDELARL   83 (478)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4445567999999999999999999888888777764


No 76 
>cd07133 ALDH_CALDH_CalB Coniferyl aldehyde dehydrogenase-like. Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of Pseudomonas sp. strain HR199 (CalB) which catalyzes the NAD+-dependent oxidation of coniferyl aldehyde to ferulic acid, and similar sequences, are present in this CD.
Probab=21.02  E-value=2.4e+02  Score=24.83  Aligned_cols=40  Identities=8%  Similarity=-0.168  Sum_probs=33.6

Q ss_pred             HHHHHHHHHhhcCCHHHHHHHHHHHHHHHHHHHHHHHhhh
Q psy14923         78 EVTKLVSEEWKKIGEAEKKRLENEYAKEKEIYAQNVKKYH  117 (180)
Q Consensus        78 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~y~  117 (180)
                      +.++..-..|+.++..+|..+.....+..+.+.+++....
T Consensus         5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~   44 (434)
T cd07133           5 ERQKAAFLANPPPSLEERRDRLDRLKALLLDNQDALAEAI   44 (434)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3455666789999999999999999999999998888754


No 77 
>PRK14299 chaperone protein DnaJ; Provisional
Probab=20.82  E-value=2.2e+02  Score=23.70  Aligned_cols=39  Identities=18%  Similarity=0.373  Sum_probs=26.3

Q ss_pred             HHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923         63 IRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE  101 (180)
Q Consensus        63 ~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~  101 (180)
                      .|...+.-||+..    ..+..+.+.+.|..|++.+|..-.+.
T Consensus        25 yr~la~~~HPD~~~~~~~~~~f~~i~~Ay~~L~d~~kr~~yD~   67 (291)
T PRK14299         25 FKKLARKYHPDVNKSPGAEEKFKEINEAYTVLSDPEKRRIYDT   67 (291)
T ss_pred             HHHHHHHHCcCCCCChhHHHHHHHHHHHHHHhcCHHHHHHHHh
Confidence            3444445577764    45677889999999998876644443


No 78 
>KOG1827|consensus
Probab=20.56  E-value=6.1  Score=36.69  Aligned_cols=44  Identities=27%  Similarity=0.420  Sum_probs=39.6

Q ss_pred             CCChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHhhcCCHHHH
Q psy14923         52 PLTPYFKFQGQIRNKIMERNPNIKIIEVTKLVSEEWKKIGEAEK   95 (180)
Q Consensus        52 P~say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~~Wk~Ls~~eK   95 (180)
                      -.++|++|..+.+..+-.++|+..+++++.+.|..|..|+..-|
T Consensus       552 ~~~~~~~~s~~~~~~~~~~np~v~~~~~~~~vg~~~~~lp~~~k  595 (629)
T KOG1827|consen  552 SPEPYILDSIENRTIIWFENPTVGFGEVSIIVGNDWDKLPNINK  595 (629)
T ss_pred             CCccccccccccCceeeeeCCCcccceeEEeecCCcccCccccc
Confidence            56789999999999999999999999999999999999994433


No 79 
>PRK10266 curved DNA-binding protein CbpA; Provisional
Probab=20.40  E-value=2.5e+02  Score=23.55  Aligned_cols=39  Identities=15%  Similarity=0.331  Sum_probs=27.4

Q ss_pred             HHHHHHHhCCCCC----HHHHHHHHHHHhhcCCHHHHHHHHHH
Q psy14923         63 IRNKIMERNPNIK----IIEVTKLVSEEWKKIGEAEKKRLENE  101 (180)
Q Consensus        63 ~r~~~~~~~p~~~----~~ei~k~l~~~Wk~Ls~~eK~~y~~~  101 (180)
                      .|...+.-|||..    ..+..+.+.+.|..|++.+|..-.+.
T Consensus        25 yr~la~k~HPD~~~~~~~~~~f~~i~~Ay~~L~~~~kr~~yD~   67 (306)
T PRK10266         25 YRRLARKYHPDVSKEPDAEARFKEVAEAWEVLSDEQRRAEYDQ   67 (306)
T ss_pred             HHHHHHHHCcCCCCCccHHHHHHHHHHHHHHhhhHHHHHHHHH
Confidence            3444455677754    56788899999999998776644443


No 80 
>PF09164 VitD-bind_III:  Vitamin D binding protein, domain III;  InterPro: IPR015247 This domain is predominantly found in Vitamin D binding proteins, and adopts a multihelical structure. It is required for formation of an actin 'clamp', allowing the protein to bind to actin []. ; PDB: 1MA9_A 1KW2_A 1KXP_D 1J7E_A 1J78_A 1LOT_A.
Probab=20.12  E-value=2.7e+02  Score=18.23  Aligned_cols=32  Identities=13%  Similarity=0.415  Sum_probs=21.3

Q ss_pred             ChHHHHHHHHHHHHHHhCCCCCHHHHHHHHHH
Q psy14923         54 TPYFKFQGQIRNKIMERNPNIKIIEVTKLVSE   85 (180)
Q Consensus        54 say~lF~~e~r~~~~~~~p~~~~~ei~k~l~~   85 (180)
                      +.|.-|-+.-.+.++...|+.+..++...+.+
T Consensus         9 ~tFtEyKKrL~e~l~~k~P~at~~~l~~lve~   40 (68)
T PF09164_consen    9 NTFTEYKKRLAERLRAKLPDATPTELKELVEK   40 (68)
T ss_dssp             S-HHHHHHHHHHHHHHH-TTS-HHHHHHHHHH
T ss_pred             ccHHHHHHHHHHHHHHHCCCCCHHHHHHHHHH
Confidence            34666777777788888888888877766544


No 81 
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=20.07  E-value=2.9e+02  Score=21.37  Aligned_cols=23  Identities=4%  Similarity=0.173  Sum_probs=10.6

Q ss_pred             HHHHhhcCCHHHHHHHHHHHHHH
Q psy14923         83 VSEEWKKIGEAEKKRLENEYAKE  105 (180)
Q Consensus        83 l~~~Wk~Ls~~eK~~y~~~a~~~  105 (180)
                      --+++.-|++|+|..|.+..++.
T Consensus       117 ~nqmy~lLTPEQKaq~~~~~~~r  139 (166)
T PRK10363        117 RNQMYRLLTPEQQAVLNEKHQQR  139 (166)
T ss_pred             HHHHHHhCCHHHHHHHHHHHHHH
Confidence            33444555555555554433333


Done!