Query         psy3264
Match_columns 156
No_of_seqs    141 out of 1679
Neff          8.5 
Searched_HMMs 46136
Date          Fri Aug 16 20:36:09 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy3264.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3264hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 COG5648 NHP6B Chromatin-associ  99.9 4.8E-26   1E-30  166.2  10.4  132    3-136    65-209 (211)
  2 PTZ00199 high mobility group p  99.9 3.1E-23 6.7E-28  137.0   7.9   77    3-79     17-93  (94)
  3 cd01389 MATA_HMG-box MATA_HMG-  99.8 4.9E-20 1.1E-24  117.5   7.9   71    8-80      1-71  (77)
  4 cd01388 SOX-TCF_HMG-box SOX-TC  99.8 1.8E-19 3.9E-24  113.5   7.9   69    9-79      2-70  (72)
  5 cd01390 HMGB-UBF_HMG-box HMGB-  99.8 1.6E-19 3.4E-24  111.5   7.5   65    9-75      1-65  (66)
  6 PF09011 HMG_box_2:  HMG-box do  99.8 1.1E-19 2.3E-24  114.8   6.2   71    6-78      1-72  (73)
  7 PF00505 HMG_box:  HMG (high mo  99.8 1.4E-19   3E-24  112.8   5.9   68    9-78      1-68  (69)
  8 smart00398 HMG high mobility g  99.8 5.7E-19 1.2E-23  110.0   7.4   69    8-78      1-69  (70)
  9 cd00084 HMG-box High Mobility   99.7 1.5E-17 3.3E-22  102.3   7.5   65    9-75      1-65  (66)
 10 KOG0381|consensus               99.7 3.4E-16 7.3E-21  103.6   7.2   72    7-80     21-93  (96)
 11 cd01388 SOX-TCF_HMG-box SOX-TC  99.5 1.7E-15 3.7E-20   95.3   1.3   69   72-140     3-71  (72)
 12 KOG0527|consensus               99.5 1.5E-14 3.2E-19  114.3   6.5   76    3-80     57-132 (331)
 13 KOG0526|consensus               99.5 7.8E-15 1.7E-19  119.8   4.4   71    3-79    530-600 (615)
 14 PTZ00199 high mobility group p  99.5 1.3E-14 2.7E-19   95.9   3.3   73   67-139    19-93  (94)
 15 cd01389 MATA_HMG-box MATA_HMG-  99.5 8.4E-15 1.8E-19   93.3   1.3   71   72-142     3-73  (77)
 16 PF00505 HMG_box:  HMG (high mo  99.4   7E-14 1.5E-18   86.8   2.0   67   73-139     3-69  (69)
 17 cd01390 HMGB-UBF_HMG-box HMGB-  99.4 2.8E-13   6E-18   83.3   2.6   63   73-135     3-65  (66)
 18 KOG0527|consensus               99.4 2.3E-13 5.1E-18  107.5   2.0   77   67-143    59-135 (331)
 19 smart00398 HMG high mobility g  99.3 5.7E-13 1.2E-17   82.6   2.6   67   73-139     4-70  (70)
 20 PF09011 HMG_box_2:  HMG-box do  99.3 1.8E-12 3.9E-17   81.6   2.0   66   74-139     7-73  (73)
 21 KOG0381|consensus               99.2 9.2E-12   2E-16   82.2   3.8   73   70-142    22-95  (96)
 22 cd00084 HMG-box High Mobility   99.2 1.7E-11 3.7E-16   75.1   2.6   63   73-135     3-65  (66)
 23 KOG3248|consensus               99.1 9.3E-11   2E-15   91.5   6.0   70    8-79    191-260 (421)
 24 COG5648 NHP6B Chromatin-associ  99.1 4.1E-11 8.9E-16   88.1   2.9   83   67-149    67-149 (211)
 25 KOG4715|consensus               99.1 9.2E-11   2E-15   90.8   4.7   74    4-79     60-133 (410)
 26 PF14887 HMG_box_5:  HMG (high   98.8 1.3E-08 2.8E-13   63.4   5.9   73    8-83      3-75  (85)
 27 KOG0526|consensus               98.8 2.5E-09 5.5E-14   88.0   2.6   64   72-139   537-600 (615)
 28 KOG0528|consensus               98.6 1.6E-08 3.5E-13   82.4   2.1   71    8-80    325-395 (511)
 29 KOG0528|consensus               98.3 1.3E-07 2.8E-12   77.2   0.9   78   69-146   324-401 (511)
 30 KOG4715|consensus               98.3 8.6E-07 1.9E-11   69.1   4.3   56   85-140    79-134 (410)
 31 KOG2746|consensus               98.3 5.4E-07 1.2E-11   76.2   3.4   66    7-74    180-247 (683)
 32 PF14887 HMG_box_5:  HMG (high   98.1 4.9E-06 1.1E-10   51.9   4.5   54   91-145    24-77  (85)
 33 PF04690 YABBY:  YABBY protein;  97.9 2.2E-05 4.8E-10   56.8   5.5   48    4-53    117-164 (170)
 34 KOG3248|consensus               97.9 1.7E-06 3.6E-11   68.1  -1.1   79   67-145   188-266 (421)
 35 PF06382 DUF1074:  Protein of u  97.4 0.00045 9.8E-09   50.0   5.5   47   13-65     83-129 (183)
 36 KOG2746|consensus               96.8  0.0011 2.3E-08   56.8   2.9   68   67-134   178-247 (683)
 37 PF08073 CHDNT:  CHDNT (NUC034)  96.6  0.0034 7.4E-08   36.9   3.3   40   13-54     13-52  (55)
 38 PF11304 DUF3106:  Protein of u  94.5   0.071 1.5E-06   35.8   4.2   81   39-120    10-103 (107)
 39 PF06382 DUF1074:  Protein of u  94.5   0.038 8.3E-07   40.2   2.9   38   88-125    92-129 (183)
 40 PF06244 DUF1014:  Protein of u  93.1    0.18 3.9E-06   34.7   4.1   48    5-54     68-116 (122)
 41 PF04769 MAT_Alpha1:  Mating-ty  93.0    0.27 5.8E-06   36.8   5.2   55    4-66     39-93  (201)
 42 PF08073 CHDNT:  CHDNT (NUC034)  92.1   0.054 1.2E-06   31.9   0.4   33   82-114    20-52  (55)
 43 TIGR03481 HpnM hopanoid biosyn  92.0    0.38 8.2E-06   35.8   5.0   47   95-141    63-111 (198)
 44 PRK15117 ABC transporter perip  88.9     1.3 2.7E-05   33.4   5.4   49   93-141    65-115 (211)
 45 PF05494 Tol_Tol_Ttg2:  Toluene  80.5     2.8   6E-05   30.1   3.8   47   94-140    36-84  (170)
 46 PF04690 YABBY:  YABBY protein;  80.5    0.54 1.2E-05   34.2   0.1   45   69-113   120-164 (170)
 47 TIGR03481 HpnM hopanoid biosyn  77.8       3 6.6E-05   31.0   3.4   42   38-79     66-109 (198)
 48 PF11304 DUF3106:  Protein of u  75.4      14 0.00031   24.6   5.9   25  101-125    12-36  (107)
 49 PF01352 KRAB:  KRAB box;  Inte  73.3     2.4 5.2E-05   23.3   1.4   28   97-124     2-30  (41)
 50 KOG3223|consensus               73.1     5.1 0.00011   29.7   3.4   44   10-55    166-209 (221)
 51 PF13875 DUF4202:  Domain of un  71.7      10 0.00022   28.0   4.7   40   15-59    131-170 (185)
 52 COG2854 Ttg2D ABC-type transpo  69.3      12 0.00026   28.0   4.7   54   94-147    68-123 (202)
 53 PF12881 NUT_N:  NUT protein N   68.1      14 0.00031   29.5   5.1   52   13-66    229-280 (328)
 54 PF05494 Tol_Tol_Ttg2:  Toluene  64.3       4 8.7E-05   29.3   1.4   41   39-79     41-83  (170)
 55 PRK15117 ABC transporter perip  62.7     6.7 0.00015   29.5   2.3   42   38-79     70-113 (211)
 56 PF06945 DUF1289:  Protein of u  59.6      11 0.00023   21.6   2.3   21  107-127    28-48  (51)
 57 PRK10363 cpxP periplasmic repr  57.0      48   0.001   24.1   5.8   44   99-143   111-154 (166)
 58 PRK09706 transcriptional repre  55.3      62  0.0014   22.1   6.1   46   99-144    86-131 (135)
 59 PF15581 Imm35:  Immunity prote  51.7      38 0.00082   21.9   4.0   30   38-67     32-61  (93)
 60 KOG3838|consensus               51.2      19  0.0004   29.9   3.2   40  111-150   268-307 (497)
 61 PF06244 DUF1014:  Protein of u  51.0      12 0.00026   25.8   1.8   31   84-114    86-116 (122)
 62 PRK12751 cpxP periplasmic stre  49.0      47   0.001   24.0   4.7   77   50-129    55-147 (162)
 63 PF12650 DUF3784:  Domain of un  46.8      14 0.00031   23.9   1.6   18  106-123    23-40  (97)
 64 PRK10236 hypothetical protein;  46.4      23 0.00051   27.2   2.9   26  101-126   118-143 (237)
 65 PRK10236 hypothetical protein;  45.6      25 0.00055   27.0   3.0   28   39-66    116-143 (237)
 66 PF11616 EZH2_WD-Binding:  WD r  43.1      47   0.001   16.9   2.9   14   39-52     12-25  (30)
 67 PRK12750 cpxP periplasmic repr  40.3      90  0.0019   22.6   5.1   35  101-135   126-160 (170)
 68 cd07081 ALDH_F20_ACDH_EutE-lik  39.4      89  0.0019   26.2   5.6   42  100-141     6-47  (439)
 69 KOG3223|consensus               39.3      25 0.00055   26.1   2.1   32   84-115   178-209 (221)
 70 PF06424 PRP1_N:  PRP1 splicing  34.4      33 0.00071   24.0   2.0   31   25-55     93-127 (133)
 71 PF05914 RIB43A:  RIB43A;  Inte  34.2      29 0.00063   28.6   1.9   40   87-126   250-291 (379)
 72 cd07122 ALDH_F20_ACDH Coenzyme  32.8 1.3E+02  0.0028   25.2   5.6   41  101-141     7-47  (436)
 73 PF09791 Oxidored-like:  Oxidor  32.7      78  0.0017   18.0   3.0   18  123-140    30-47  (48)
 74 PF11278 DUF3079:  Protein of u  32.5      32 0.00069   19.6   1.3   13    1-13      1-13  (52)
 75 cd07133 ALDH_CALDH_CalB Conife  32.1 1.3E+02  0.0028   25.0   5.5   43  100-142     5-47  (434)
 76 cd07085 ALDH_F6_MMSDH Methylma  29.8 1.5E+02  0.0033   24.9   5.6   38  103-140    48-85  (478)
 77 cd07132 ALDH_F3AB Aldehyde deh  29.2 1.5E+02  0.0033   24.7   5.5   42  100-141     5-46  (443)
 78 PF11521 TFIIE-A_C-term:  C-ter  28.6      61  0.0013   20.8   2.3   29   96-127    50-80  (86)
 79 TIGR00787 dctP tripartite ATP-  28.5 1.4E+02   0.003   22.6   4.8   29  106-134   213-241 (257)
 80 PRK12751 cpxP periplasmic stre  28.0   1E+02  0.0022   22.2   3.7   32   40-71    118-149 (162)
 81 PF02026 RyR:  RyR domain;  Int  27.9      61  0.0013   21.1   2.3   19   49-67     61-79  (94)
 82 cd07087 ALDH_F3-13-14_CALDH-li  27.9 1.7E+02  0.0036   24.2   5.5   42  100-141     5-46  (426)
 83 PF05388 Carbpep_Y_N:  Carboxyp  27.9 1.1E+02  0.0024   20.7   3.6   29   38-66     45-73  (113)
 84 PF06628 Catalase-rel:  Catalas  27.5      55  0.0012   19.7   1.9   20   44-63     12-31  (68)
 85 cd08317 Death_ank Death domain  27.4      26 0.00055   22.1   0.5   21   94-114     3-23  (84)
 86 PRK13252 betaine aldehyde dehy  26.5 1.8E+02  0.0039   24.5   5.5   39  103-141    54-92  (488)
 87 cd07150 ALDH_VaniDH_like Pseud  26.4 1.8E+02  0.0039   24.1   5.4   37  103-139    31-67  (451)
 88 cd07152 ALDH_BenzADH NAD-depen  25.6   2E+02  0.0043   23.8   5.5   40  102-141    22-61  (443)
 89 PRK13968 putative succinate se  25.4   2E+02  0.0043   24.1   5.5   53   89-141    19-77  (462)
 90 cd07118 ALDH_SNDH Gluconobacte  25.3   2E+02  0.0043   24.0   5.5   51   90-140    10-68  (454)
 91 PRK10455 periplasmic protein;   25.1 1.5E+02  0.0032   21.3   4.1   27  100-126   118-144 (161)
 92 PF00226 DnaJ:  DnaJ domain;  I  25.1 1.1E+02  0.0024   17.6   3.0   38   21-58     19-60  (64)
 93 cd07084 ALDH_KGSADH-like ALDH   24.9 1.9E+02   0.004   24.2   5.2   42  100-141     6-47  (442)
 94 cd07101 ALDH_SSADH2_GabD2 Myco  24.9   2E+02  0.0044   23.9   5.5   50   92-141    11-66  (454)
 95 PF03480 SBP_bac_7:  Bacterial   24.9 1.5E+02  0.0033   22.7   4.5   32  106-137   213-244 (286)
 96 PRK11241 gabD succinate-semial  24.7   2E+02  0.0044   24.3   5.5   53   89-141    38-96  (482)
 97 cd07077 ALDH-like NAD(P)+-depe  24.3 1.9E+02  0.0041   23.6   5.1   39  102-140     3-41  (397)
 98 cd07099 ALDH_DDALDH Methylomon  24.2 2.2E+02  0.0048   23.6   5.6   38  102-139    27-64  (453)
 99 cd07108 ALDH_MGR_2402 Magnetos  24.0 2.2E+02  0.0047   23.7   5.5   39  103-141    29-67  (457)
100 PRK00197 proA gamma-glutamyl p  23.9 2.2E+02  0.0049   23.5   5.5   40  101-140    12-51  (417)
101 TIGR01804 BADH glycine betaine  23.7 2.2E+02  0.0047   23.8   5.4   38  103-140    45-82  (467)
102 PF07813 LTXXQ:  LTXXQ motif fa  23.3 1.3E+02  0.0028   18.8   3.3   24  100-123    76-99  (100)
103 PTZ00037 DnaJ_C chaperone prot  23.2 1.8E+02  0.0039   24.3   4.8   43   21-64     47-89  (421)
104 cd07131 ALDH_AldH-CAJ73105 Unc  23.2 2.3E+02   0.005   23.7   5.5   37  103-139    47-83  (478)
105 cd07098 ALDH_F15-22 Aldehyde d  23.2 2.5E+02  0.0054   23.4   5.7   37  103-139    28-64  (465)
106 cd07137 ALDH_F3FHI Plant aldeh  23.0 2.4E+02  0.0051   23.5   5.5   42  100-141     6-47  (432)
107 cd07147 ALDH_F21_RNP123 Aldehy  22.8 2.4E+02  0.0052   23.4   5.5   39  102-140    30-68  (452)
108 PF14098 SSPI:  Small, acid-sol  22.7   1E+02  0.0022   18.8   2.4   19   45-63     42-60  (65)
109 cd08332 CARD_CASP2 Caspase act  22.7      40 0.00086   21.7   0.7   17    9-25     61-77  (90)
110 cd07106 ALDH_AldA-AAD23400 Str  22.6 2.4E+02  0.0053   23.3   5.5   38  103-140    29-66  (446)
111 cd07559 ALDH_ACDHII_AcoD-like   22.6 2.4E+02  0.0052   23.7   5.5   38  103-140    48-85  (480)
112 PLN02174 aldehyde dehydrogenas  22.5 2.4E+02  0.0051   24.1   5.4   43  101-143    18-60  (484)
113 cd07100 ALDH_SSADH1_GabD1 Myco  22.5 2.5E+02  0.0053   23.2   5.5   39  101-139     7-45  (429)
114 cd07086 ALDH_F7_AASADH-like NA  22.2 2.4E+02  0.0052   23.7   5.5   38  103-140    45-82  (478)
115 cd07104 ALDH_BenzADH-like ALDH  22.1 2.5E+02  0.0055   23.0   5.5   39  101-139     8-46  (431)
116 TIGR01780 SSADH succinate-semi  22.1 2.5E+02  0.0054   23.3   5.5   40  102-141    28-67  (448)
117 cd07142 ALDH_F2BC Arabidosis a  22.1 2.6E+02  0.0055   23.5   5.6   53   90-142    32-92  (476)
118 TIGR01238 D1pyr5carbox3 delta-  22.0 2.5E+02  0.0054   23.9   5.5   52   89-140    64-121 (500)
119 cd07151 ALDH_HBenzADH NADP+-de  22.0 2.5E+02  0.0054   23.5   5.5   38  103-140    42-79  (465)
120 PF08367 M16C_assoc:  Peptidase  21.9 1.9E+02  0.0041   21.9   4.4   32   99-130    13-44  (248)
121 cd07129 ALDH_KGSADH Alpha-Keto  21.9 2.4E+02  0.0053   23.5   5.4   39  101-139     7-45  (454)
122 PRK13473 gamma-aminobutyraldeh  21.8 2.6E+02  0.0056   23.4   5.6   38  103-140    49-86  (475)
123 cd07130 ALDH_F7_AASADH NAD+-de  21.8 2.6E+02  0.0056   23.5   5.6   38  103-140    44-81  (474)
124 cd07140 ALDH_F1L_FTFDH 10-form  21.8 2.6E+02  0.0057   23.6   5.6   53   89-141    33-93  (486)
125 cd07136 ALDH_YwdH-P39616 Bacil  21.8 2.5E+02  0.0054   23.5   5.4   43  100-142     5-47  (449)
126 COG2854 Ttg2D ABC-type transpo  21.6      63  0.0014   24.3   1.6   39   41-79     76-115 (202)
127 PF05823 Gp-FAR-1:  Nematode fa  21.5 1.1E+02  0.0024   21.7   2.8   82   23-118    45-127 (154)
128 PRK12750 cpxP periplasmic repr  21.5 1.6E+02  0.0034   21.4   3.7   34   42-75    127-160 (170)
129 TIGR03092 SASP_sspI small, aci  21.5 1.1E+02  0.0023   18.7   2.3   19   45-63     41-59  (65)
130 PF00171 Aldedh:  Aldehyde dehy  21.4 2.4E+02  0.0052   23.5   5.3   40  101-140    37-76  (462)
131 cd07138 ALDH_CddD_SSP0762 Rhod  21.3 2.5E+02  0.0054   23.5   5.4   38  103-140    46-83  (466)
132 COG2938 Uncharacterized conser  21.2 1.5E+02  0.0033   19.4   3.2   27   40-66     32-58  (94)
133 cd07110 ALDH_F10_BADH Arabidop  21.2 2.6E+02  0.0057   23.2   5.5   38  103-140    29-66  (456)
134 PLN02315 aldehyde dehydrogenas  21.0 2.7E+02  0.0058   23.8   5.5   52   89-140    46-103 (508)
135 cd07088 ALDH_LactADH-AldA Esch  21.0 2.6E+02  0.0056   23.3   5.4   51   89-139    25-81  (468)
136 PLN02278 succinic semialdehyde  21.0 2.6E+02  0.0056   23.7   5.4   38  103-140    72-109 (498)
137 TIGR02518 EutH_ACDH acetaldehy  20.9 2.7E+02  0.0059   23.6   5.5   40  100-139    15-54  (488)
138 PF08855 DUF1825:  Domain of un  20.9 2.7E+02   0.006   18.7   5.0   37  104-140    26-62  (108)
139 cd07135 ALDH_F14-YMR110C Sacch  20.8 2.7E+02  0.0057   23.2   5.4   40  102-141    14-53  (436)
140 PRK11903 aldehyde dehydrogenas  20.8 2.3E+02   0.005   24.3   5.1   34  107-140    55-88  (521)
141 cd07097 ALDH_KGSADH-YcbD Bacil  20.7 2.8E+02   0.006   23.3   5.5   50   90-139    28-83  (473)
142 TIGR03216 OH_muco_semi_DH 2-hy  20.5 2.5E+02  0.0054   23.6   5.2   33  109-141    53-85  (481)
143 TIGR03374 ABALDH 1-pyrroline d  20.5 2.8E+02   0.006   23.4   5.5   39  103-141    48-86  (472)
144 cd07092 ALDH_ABALDH-YdcW Esche  20.4   3E+02  0.0065   22.7   5.7   39  103-141    29-67  (450)
145 TIGR01722 MMSDH methylmalonic   20.4 2.9E+02  0.0063   23.2   5.6   37  103-139    48-84  (477)
146 cd07111 ALDH_F16 Aldehyde dehy  20.4 2.8E+02  0.0061   23.4   5.5   50   91-140    51-106 (480)
147 KOG1610|consensus               20.3 1.9E+02  0.0041   23.4   4.2   33   38-70    218-250 (322)
148 TIGR03240 arg_catab_astD succi  20.2 2.9E+02  0.0062   23.3   5.5   39  103-141    45-83  (484)
149 PRK09406 gabD1 succinic semial  20.2 2.4E+02  0.0052   23.6   5.0   50   92-141    16-71  (457)
150 PLN02418 delta-1-pyrroline-5-c  20.2 2.8E+02   0.006   25.0   5.6   39  101-139   302-340 (718)
151 PLN02419 methylmalonate-semial  20.1 2.8E+02   0.006   24.5   5.5   38  103-140   161-198 (604)
152 cd07105 ALDH_SaliADH Salicylal  20.0 2.5E+02  0.0054   23.2   5.1   38  102-139     9-46  (432)

No 1  
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.93  E-value=4.8e-26  Score=166.21  Aligned_cols=132  Identities=27%  Similarity=0.495  Sum_probs=121.7

Q ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCCCC
Q psy3264           3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPA   82 (156)
Q Consensus         3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~   82 (156)
                      +++||.||||+|||++|+.++|..+...+|++++  ++|++.+|+.|++||+++|++|...|..++.+|.+++..|+...
T Consensus        65 k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~--~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y~~k~  142 (211)
T COG5648          65 KKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTF--GEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKL  142 (211)
T ss_pred             hcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCCh--HHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhhhccc
Confidence            5899999999999999999999999999999965  99999999999999999999999999999999999999997533


Q ss_pred             cc-------------ccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          83 GE-------------KRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMT  136 (156)
Q Consensus        83 ~~-------------~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~  136 (156)
                      ..             .++.+.+.+|..+..+.++.++..|.+|+++-|.+|++.+.+++..|...++
T Consensus       143 ~~~~~~~~~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~~~~~~~~  209 (211)
T COG5648         143 PNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYP  209 (211)
T ss_pred             CCCCCCchhhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHHHHhhhcc
Confidence            22             2778888899999999999999999999999999999999999999987654


No 2  
>PTZ00199 high mobility group protein; Provisional
Probab=99.89  E-value=3.1e-23  Score=137.02  Aligned_cols=77  Identities=32%  Similarity=0.542  Sum_probs=73.0

Q ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264           3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT   79 (156)
Q Consensus         3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~   79 (156)
                      ++||+.||||+|||+|||.++|..|..+||+++.++++|+++||++|++||+++|.+|.++|..++.+|..++.+|.
T Consensus        17 ~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e~~~Y~   93 (94)
T PTZ00199         17 KKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEYA   93 (94)
T ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            57999999999999999999999999999998755699999999999999999999999999999999999998884


No 3  
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.82  E-value=4.9e-20  Score=117.51  Aligned_cols=71  Identities=23%  Similarity=0.346  Sum_probs=67.8

Q ss_pred             CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCC
Q psy3264           8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTP   80 (156)
Q Consensus         8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~   80 (156)
                      +||||+|||+||+++.|..|+.+||+++.  .+|+++||.+|+.||+++|++|.++|..++.+|..++..|.-
T Consensus         1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~--~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky   71 (77)
T cd01389           1 KIPRPRNAFILYRQDKHAQLKTENPGLTN--NEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKY   71 (77)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcc
Confidence            48999999999999999999999999875  899999999999999999999999999999999999999864


No 4  
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.80  E-value=1.8e-19  Score=113.47  Aligned_cols=69  Identities=26%  Similarity=0.363  Sum_probs=66.0

Q ss_pred             CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264           9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT   79 (156)
Q Consensus         9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~   79 (156)
                      .|||+|||++||+++|..++.+||++++  .+|+++||.+|+.||+++|++|.++|..++.+|.+++..|.
T Consensus         2 iKrP~naf~~F~~~~r~~~~~~~p~~~~--~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~   70 (72)
T cd01388           2 IKRPMNAFMLFSKRHRRKVLQEYPLKEN--RAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYK   70 (72)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCC
Confidence            5899999999999999999999999876  89999999999999999999999999999999999998874


No 5  
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.80  E-value=1.6e-19  Score=111.50  Aligned_cols=65  Identities=45%  Similarity=0.791  Sum_probs=62.4

Q ss_pred             CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhcccc
Q psy3264           9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEM   75 (156)
Q Consensus         9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~   75 (156)
                      ||+|+|||++|++++|..+...||+++.  .+|++.||..|++||+++|++|.+.|..++.+|..+|
T Consensus         1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~--~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~   65 (66)
T cd01390           1 PKRPLSAYFLFSQEQRPKLKKENPDASV--TEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCH--HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            8999999999999999999999999865  8999999999999999999999999999999999876


No 6  
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.80  E-value=1.1e-19  Score=114.84  Aligned_cols=71  Identities=38%  Similarity=0.701  Sum_probs=61.7

Q ss_pred             CCCCCCCCcHHHHHHHHHHHHHHHh-CCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccC
Q psy3264           6 DVKPRGRMTAYAFFVQVCREEHKKK-HPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSY   78 (156)
Q Consensus         6 p~~PkrP~say~lF~~e~r~~~~~~-~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~   78 (156)
                      |++||+|+|||+|||.+++..++.. ++...  +.++++.||..|++||++||.+|.++|+.++.+|..++..|
T Consensus         1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~--~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~   72 (73)
T PF09011_consen    1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQS--FREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEW   72 (73)
T ss_dssp             SSS--SSSSHHHHHHHHHHHHHHHHT-T-SS--HHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCCCCHHHHHHHHHHHHHHHhcccCCC--HHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            7899999999999999999999988 55443  58999999999999999999999999999999999998876


No 7  
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.79  E-value=1.4e-19  Score=112.78  Aligned_cols=68  Identities=35%  Similarity=0.635  Sum_probs=63.5

Q ss_pred             CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccC
Q psy3264           9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSY   78 (156)
Q Consensus         9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~   78 (156)
                      |+||+|||+|||.+++..++..||+++.  .+|+++||.+|++||+++|.+|.+.|..++..|..++..|
T Consensus         1 PkrP~~af~lf~~~~~~~~k~~~p~~~~--~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y   68 (69)
T PF00505_consen    1 PKRPPNAFMLFCKEKRAKLKEENPDLSN--KEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEY   68 (69)
T ss_dssp             SSSS--HHHHHHHHHHHHHHHHSTTSTH--HHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CcCCCCHHHHHHHHHHHHHHHHhccccc--ccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            8999999999999999999999999985  9999999999999999999999999999999999988776


No 8  
>smart00398 HMG high mobility group.
Probab=99.78  E-value=5.7e-19  Score=109.96  Aligned_cols=69  Identities=46%  Similarity=0.745  Sum_probs=66.1

Q ss_pred             CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccC
Q psy3264           8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSY   78 (156)
Q Consensus         8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~   78 (156)
                      +|++|+|||++|++++|..+..+||+++.  .+|++.||.+|+.||+++|+.|.+.|..++..|.+.+..|
T Consensus         1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~--~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y   69 (70)
T smart00398        1 KPKRPMSAFMLFSQENRAKIKAENPDLSN--AEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEY   69 (70)
T ss_pred             CcCCCCcHHHHHHHHHHHHHHHHCcCCCH--HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence            69999999999999999999999999875  8999999999999999999999999999999999998877


No 9  
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.73  E-value=1.5e-17  Score=102.35  Aligned_cols=65  Identities=46%  Similarity=0.753  Sum_probs=61.5

Q ss_pred             CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhcccc
Q psy3264           9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEM   75 (156)
Q Consensus         9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~   75 (156)
                      |+||+|||++|+++.+..++..+|+++.  .+|++.||.+|+.||+++|..|.+.|...+..|.+.+
T Consensus         1 pkrp~~af~~f~~~~~~~~~~~~~~~~~--~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~   65 (66)
T cd00084           1 PKRPLSAYFLFSQEHRAEVKAENPGLSV--GEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCCCcHHHHHHHHHHHHHHHHCcCCCH--HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            8999999999999999999999999765  8999999999999999999999999999999998764


No 10 
>KOG0381|consensus
Probab=99.66  E-value=3.4e-16  Score=103.63  Aligned_cols=72  Identities=31%  Similarity=0.567  Sum_probs=67.7

Q ss_pred             CCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccc-cCCC
Q psy3264           7 VKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQ-SYTP   80 (156)
Q Consensus         7 ~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~-~~~~   80 (156)
                      +.|+||+|||++|+.+.+..++..||++++  .+|++++|.+|++|++++|.+|...|..++.+|..+|. .|..
T Consensus        21 ~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~--~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~~~   93 (96)
T KOG0381|consen   21 QAPKRPLSAFFLFSSEQRSKIKAENPGLSV--GEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEYKA   93 (96)
T ss_pred             CCCCCCCcHHHHHHHHHHHHHHHhCCCCCH--HHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            489999999999999999999999999776  89999999999999999999999999999999999998 6643


No 11 
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.54  E-value=1.7e-15  Score=95.28  Aligned_cols=69  Identities=26%  Similarity=0.304  Sum_probs=64.9

Q ss_pred             ccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264          72 DTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus        72 ~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      .+.+++|+.+..+.++.++.+||++++++|++.||+.|+.||+++|++|.++|+.++++|.++++.|+-
T Consensus         3 KrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~y   71 (72)
T cd01388           3 KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYKW   71 (72)
T ss_pred             CCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCCC
Confidence            467888888889999999999999999999999999999999999999999999999999999999863


No 12 
>KOG0527|consensus
Probab=99.53  E-value=1.5e-14  Score=114.29  Aligned_cols=76  Identities=26%  Similarity=0.407  Sum_probs=71.4

Q ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCC
Q psy3264           3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTP   80 (156)
Q Consensus         3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~   80 (156)
                      ++...+.|||+||||+|.+..|.+|..++|++-+  .||+|.||.+|+.|+++||.+|++.|+..|..+.++...|.-
T Consensus        57 k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHN--SEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKY  132 (331)
T KOG0527|consen   57 KTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHN--SEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKY  132 (331)
T ss_pred             CCCccccCCCcchhhhhhHHHHHHHHHhCcchhh--HHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCccc
Confidence            4566789999999999999999999999999976  999999999999999999999999999999999999999864


No 13 
>KOG0526|consensus
Probab=99.52  E-value=7.8e-15  Score=119.84  Aligned_cols=71  Identities=34%  Similarity=0.637  Sum_probs=67.5

Q ss_pred             CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264           3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT   79 (156)
Q Consensus         3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~   79 (156)
                      +|+||.||||+||||+|++..|..|+..  ++++  ++|.+.+|++|++||.  |++|.+.|+.++.+|+.+|..|.
T Consensus       530 ~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~--~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk  600 (615)
T KOG0526|consen  530 KKDPNAPKRATSAYMLWLNASRESIKED--GISV--GDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK  600 (615)
T ss_pred             CCCCCCCccchhHHHHHHHhhhhhHhhc--CchH--HHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence            6899999999999999999999999987  6655  8999999999999999  99999999999999999999996


No 14 
>PTZ00199 high mobility group protein; Provisional
Probab=99.50  E-value=1.3e-14  Score=95.87  Aligned_cols=73  Identities=36%  Similarity=0.590  Sum_probs=65.2

Q ss_pred             hhhhhccccccCCCCCccccccccccCCCCC--chhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYG--VGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~--~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +-..-.+.+.+|+.|..+.|..+..+||+++  +.+|+++||+.|+.||+++|.+|.++|+.++.+|..+|..|+
T Consensus        19 dp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e~~~Y~   93 (94)
T PTZ00199         19 DPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEYA   93 (94)
T ss_pred             CCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            3344556778888888888999999999986  799999999999999999999999999999999999999995


No 15 
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.48  E-value=8.4e-15  Score=93.26  Aligned_cols=71  Identities=25%  Similarity=0.517  Sum_probs=65.9

Q ss_pred             ccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264          72 DTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS  142 (156)
Q Consensus        72 ~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~  142 (156)
                      .+.+++|+.+....+..++.+||++++++|+++||..|+.||+++|++|.++|+.++++|..+++.|+..-
T Consensus         3 kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky~p   73 (77)
T cd01389           3 PRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKYTP   73 (77)
T ss_pred             CCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcccC
Confidence            45678888888888999999999999999999999999999999999999999999999999999998753


No 16 
>PF00505 HMG_box:  HMG (high mobility group) box;  InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.42  E-value=7e-14  Score=86.84  Aligned_cols=67  Identities=36%  Similarity=0.625  Sum_probs=59.7

Q ss_pred             cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +.++.|..+....++.++..||+++..+|++.||..|++||+++|++|.+.|+.++.+|..+++.|+
T Consensus         3 rP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y~   69 (69)
T PF00505_consen    3 RPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEYK   69 (69)
T ss_dssp             SS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             CCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            3455666666777888999999999999999999999999999999999999999999999999995


No 17 
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.37  E-value=2.8e-13  Score=83.34  Aligned_cols=63  Identities=44%  Similarity=0.742  Sum_probs=58.1

Q ss_pred             cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH
Q psy3264          73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEM  135 (156)
Q Consensus        73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~  135 (156)
                      +.+..|..+..+.+..+...||++++.+|++.||..|+.||+++|.+|.+.|+.++.+|..+|
T Consensus         3 rp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~   65 (66)
T cd01390           3 RPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            456677777778899999999999999999999999999999999999999999999999987


No 18 
>KOG0527|consensus
Probab=99.35  E-value=2.3e-13  Score=107.55  Aligned_cols=77  Identities=26%  Similarity=0.407  Sum_probs=72.4

Q ss_pred             hhhhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy3264          67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSV  143 (156)
Q Consensus        67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~  143 (156)
                      ...+.++.|++|..+....|.+|..+||.+.+.||+|.||..||.|+++||.||+++|++++..|.+|+.+|+.+-.
T Consensus        59 ~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYRPR  135 (331)
T KOG0527|consen   59 STDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYRPR  135 (331)
T ss_pred             CccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCcccccc
Confidence            45688899999999999999999999999999999999999999999999999999999999999999999987644


No 19 
>smart00398 HMG high mobility group.
Probab=99.34  E-value=5.7e-13  Score=82.62  Aligned_cols=67  Identities=40%  Similarity=0.703  Sum_probs=60.7

Q ss_pred             cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +.+..|+.+....++.+...||++++++|++.||..|+.||+++|.+|.++|+.++++|..++..|+
T Consensus         4 rp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y~   70 (70)
T smart00398        4 RPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK   70 (70)
T ss_pred             CCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            4556666777777888999999999999999999999999999999999999999999999999884


No 20 
>PF09011 HMG_box_2:  HMG-box domain;  InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.27  E-value=1.8e-12  Score=81.63  Aligned_cols=66  Identities=27%  Similarity=0.543  Sum_probs=53.2

Q ss_pred             ccccCCCCCcccccccccc-CCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          74 EMQSYTPPAGEKRGKIKQD-NPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        74 ~~~~~~~~~~~~r~~~~~~-~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      ...+|..+....+..++.. .+..+..++++.|+..|++||++||.+|.+.|+.++.+|+.+|..|.
T Consensus         7 ~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~~   73 (73)
T PF09011_consen    7 PPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEWN   73 (73)
T ss_dssp             SSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred             CCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence            3445555555666677666 66788999999999999999999999999999999999999999984


No 21 
>KOG0381|consensus
Probab=99.22  E-value=9.2e-12  Score=82.25  Aligned_cols=73  Identities=36%  Similarity=0.602  Sum_probs=65.1

Q ss_pred             hhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHH-HHHhcC
Q psy3264          70 RYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMT-AYKNRS  142 (156)
Q Consensus        70 ~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~-~~~~~~  142 (156)
                      ...+.+.+|..++...+..++..||++++.++++.+|++|++|++++|.+|...|..++++|..+|. .|+...
T Consensus        22 ~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~~~~~   95 (96)
T KOG0381|consen   22 APKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEYKASL   95 (96)
T ss_pred             CCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence            4555666777777777899999999999999999999999999999999999999999999999999 887654


No 22 
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.16  E-value=1.7e-11  Score=75.08  Aligned_cols=63  Identities=44%  Similarity=0.735  Sum_probs=57.2

Q ss_pred             cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH
Q psy3264          73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEM  135 (156)
Q Consensus        73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~  135 (156)
                      +.+..|+.+..+.+..+...+|+++..+|++.||..|+.||+++|.+|.++|+.++.+|..++
T Consensus         3 rp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~   65 (66)
T cd00084           3 RPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM   65 (66)
T ss_pred             CCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence            456667777777888899999999999999999999999999999999999999999999875


No 23 
>KOG3248|consensus
Probab=99.13  E-value=9.3e-11  Score=91.51  Aligned_cols=70  Identities=14%  Similarity=0.313  Sum_probs=62.5

Q ss_pred             CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264           8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT   79 (156)
Q Consensus         8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~   79 (156)
                      ..|+|+|||+||++++|+.|.+++. ++. -.+|-++||..|+.||-+|..+|.++|+++++.+...+..|.
T Consensus       191 hiKKPLNAFmlyMKEmRa~vvaEct-lKe-SAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WS  260 (421)
T KOG3248|consen  191 HIKKPLNAFMLYMKEMRAKVVAECT-LKE-SAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWS  260 (421)
T ss_pred             cccccHHHHHHHHHHHHHHHHHHhh-hhh-HHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence            3599999999999999999999986 333 289999999999999999999999999999998888877774


No 24 
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.11  E-value=4.1e-11  Score=88.12  Aligned_cols=83  Identities=24%  Similarity=0.472  Sum_probs=74.6

Q ss_pred             hhhhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy3264          67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAPV  146 (156)
Q Consensus        67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~~  146 (156)
                      +...-++.+.+|..++...|..+...+|.++++++++.+|+.|++|+++||.+|...|..++++|+.++..|..+.+...
T Consensus        67 dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y~~k~~~~~  146 (211)
T COG5648          67 DPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKA  146 (211)
T ss_pred             CCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhhhcccCCCC
Confidence            55666778888888888889999999999999999999999999999999999999999999999999999999887765


Q ss_pred             CCc
Q psy3264         147 NEE  149 (156)
Q Consensus       147 ~~~  149 (156)
                      -.+
T Consensus       147 ~~~  149 (211)
T COG5648         147 PIG  149 (211)
T ss_pred             CCc
Confidence            433


No 25 
>KOG4715|consensus
Probab=99.10  E-value=9.2e-11  Score=90.83  Aligned_cols=74  Identities=19%  Similarity=0.398  Sum_probs=69.9

Q ss_pred             CCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264           4 EKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT   79 (156)
Q Consensus         4 ~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~   79 (156)
                      +.|.+|-+|+-+|+.|.+..+..|++.||++.+  =+|.++||.+|..|+++||+.|.+.+...+..|.+.|.+|.
T Consensus        60 kpPkppekpl~pymrySrkvWd~VkA~nPe~kL--WeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh  133 (410)
T KOG4715|consen   60 KPPKPPEKPLMPYMRYSRKVWDQVKASNPELKL--WEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYH  133 (410)
T ss_pred             CCCCCCCcccchhhHHhhhhhhhhhccCcchHH--HHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            467888899999999999999999999999976  89999999999999999999999999999999999999884


No 26 
>PF14887 HMG_box_5:  HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.83  E-value=1.3e-08  Score=63.36  Aligned_cols=73  Identities=23%  Similarity=0.360  Sum_probs=60.5

Q ss_pred             CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCCCCc
Q psy3264           8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPAG   83 (156)
Q Consensus         8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~~   83 (156)
                      .|..|.+|-.||.+.....+++.+++.+.   -..+.+...|++|++.+|-.|+..|.+++.+|+++|.+|..+..
T Consensus         3 lPE~PKt~qe~Wqq~vi~dYla~~~~dr~---K~~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r~~~~   75 (85)
T PF14887_consen    3 LPETPKTAQEIWQQSVIGDYLAKFRNDRK---KALKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMRSAPA   75 (85)
T ss_dssp             -S----THHHHHHHHHHHHHHHHTTSTHH---HHHHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS-CCC
T ss_pred             CCCCCCCHHHHHHHHHHHHHHHHhhHhHH---HHHHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHhcCCC
Confidence            47778899999999999999999998764   44679999999999999999999999999999999999975443


No 27 
>KOG0526|consensus
Probab=98.79  E-value=2.5e-09  Score=88.00  Aligned_cols=64  Identities=39%  Similarity=0.645  Sum_probs=55.7

Q ss_pred             ccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          72 DTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        72 ~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      ++++.+|..+....|..++.+  |+++++++|.+|++|++||.  |.+|.+.|+.++++|+.||.+|+
T Consensus       537 kra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk  600 (615)
T KOG0526|consen  537 KRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK  600 (615)
T ss_pred             ccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence            345555555555558888887  99999999999999999999  99999999999999999999999


No 28 
>KOG0528|consensus
Probab=98.61  E-value=1.6e-08  Score=82.42  Aligned_cols=71  Identities=23%  Similarity=0.314  Sum_probs=61.0

Q ss_pred             CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCC
Q psy3264           8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTP   80 (156)
Q Consensus         8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~   80 (156)
                      .-|||+||||+|.++.|..|...+|++-.  ..|+++||..|+.||-.||++|.+.-...-..+-+....|..
T Consensus       325 HIKRPMNAFMVWAkDERRKILqA~PDMHN--SnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrY  395 (511)
T KOG0528|consen  325 HIKRPMNAFMVWAKDERRKILQAFPDMHN--SNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRY  395 (511)
T ss_pred             cccCCcchhhcccchhhhhhhhcCccccc--cchhHHhcccccccccccccchHHHHHHHHHhhhccCccccc
Confidence            46999999999999999999999999986  899999999999999999999987655555566666666654


No 29 
>KOG0528|consensus
Probab=98.34  E-value=1.3e-07  Score=77.22  Aligned_cols=78  Identities=19%  Similarity=0.287  Sum_probs=69.2

Q ss_pred             hhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy3264          69 KRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAPV  146 (156)
Q Consensus        69 ~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~~  146 (156)
                      -..++.|++|..+....|..+.+.+|++....|+|+||..|+.||..||+||.++-+++-..|....+.|+.+-.++.
T Consensus       324 PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrYkPRPKR  401 (511)
T KOG0528|consen  324 PHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRYKPRPKR  401 (511)
T ss_pred             ccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccccCCCCCc
Confidence            355678888887777667778888999999999999999999999999999999999999999999999999877653


No 30 
>KOG4715|consensus
Probab=98.29  E-value=8.6e-07  Score=69.10  Aligned_cols=56  Identities=32%  Similarity=0.544  Sum_probs=52.9

Q ss_pred             ccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264          85 KRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus        85 ~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      ..+.++..||++...+|.|+||.+|..|+++||+.|+..++..+..|.+.|..|..
T Consensus        79 vWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~  134 (410)
T KOG4715|consen   79 VWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHN  134 (410)
T ss_pred             hhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence            36779999999999999999999999999999999999999999999999999853


No 31 
>KOG2746|consensus
Probab=98.29  E-value=5.4e-07  Score=76.22  Aligned_cols=66  Identities=26%  Similarity=0.415  Sum_probs=60.2

Q ss_pred             CCCCCCCcHHHHHHHHHH--HHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccc
Q psy3264           7 VKPRGRMTAYAFFVQVCR--EEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTE   74 (156)
Q Consensus         7 ~~PkrP~say~lF~~e~r--~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~   74 (156)
                      ..-+||+|+|++|++.+|  ..+...||+..+  +-|+++||+.|-.|.+.||+.|.++|.+.+..+-++
T Consensus       180 ~HirrPMnaf~ifskrhr~~g~vhq~~pn~DN--rtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka  247 (683)
T KOG2746|consen  180 DHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDN--RTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA  247 (683)
T ss_pred             hhhhhhhHHHHHHHhhcCCccchhccCccccc--hhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence            346899999999999999  899999999876  899999999999999999999999999988776654


No 32 
>PF14887 HMG_box_5:  HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.13  E-value=4.9e-06  Score=51.92  Aligned_cols=54  Identities=19%  Similarity=0.358  Sum_probs=44.2

Q ss_pred             ccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy3264          91 QDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAP  145 (156)
Q Consensus        91 ~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~  145 (156)
                      ..++.....+ .+.+...|++|++.+|-+|+..|.++..+|+.+|-+|+.-....
T Consensus        24 a~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r~~~~~~   77 (85)
T PF14887_consen   24 AKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMRSAPADA   77 (85)
T ss_dssp             HHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS-CCCTT
T ss_pred             HHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHhcCCCCC
Confidence            3455554444 45899999999999999999999999999999999999876654


No 33 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=97.94  E-value=2.2e-05  Score=56.75  Aligned_cols=48  Identities=19%  Similarity=0.257  Sum_probs=42.3

Q ss_pred             CCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCC
Q psy3264           4 EKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMS   53 (156)
Q Consensus         4 ~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls   53 (156)
                      |-|++-.|-+|||..|+++....|+..+|+++.  .|+++.++..|...+
T Consensus       117 kPPEKRqR~psaYn~f~k~ei~rik~~~p~ish--keaFs~aAknW~h~p  164 (170)
T PF04690_consen  117 KPPEKRQRVPSAYNRFMKEEIQRIKAENPDISH--KEAFSAAAKNWAHFP  164 (170)
T ss_pred             CCccccCCCchhHHHHHHHHHHHHHhcCCCCCH--HHHHHHHHHhhhhCc
Confidence            445555677899999999999999999999998  899999999998764


No 34 
>KOG3248|consensus
Probab=97.89  E-value=1.7e-06  Score=68.07  Aligned_cols=79  Identities=16%  Similarity=0.274  Sum_probs=68.3

Q ss_pred             hhhhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy3264          67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAP  145 (156)
Q Consensus        67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~  145 (156)
                      .+...++.+++|..+-.+.|..+..+-...-..+|.++||.+|..||-+|.++|.++|+++++.|..-+..|-++.+..
T Consensus       188 KkphiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WSARdNYg  266 (421)
T KOG3248|consen  188 KKPHIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYG  266 (421)
T ss_pred             cCccccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcchhhhhh
Confidence            3456678889998888898999888766555678999999999999999999999999999999999999998776654


No 35 
>PF06382 DUF1074:  Protein of unknown function (DUF1074);  InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=97.39  E-value=0.00045  Score=49.98  Aligned_cols=47  Identities=32%  Similarity=0.511  Sum_probs=40.8

Q ss_pred             CcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhh
Q psy3264          13 MTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMAD   65 (156)
Q Consensus        13 ~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~   65 (156)
                      -+||+-|+.+.+.    .|.++..  .|++...+..|..||+.+|..|..++-
T Consensus        83 nnaYLNFLReFRr----kh~~L~p--~dlI~~AAraW~rLSe~eK~rYrr~~~  129 (183)
T PF06382_consen   83 NNAYLNFLREFRR----KHCGLSP--QDLIQRAARAWCRLSEAEKNRYRRMAP  129 (183)
T ss_pred             chHHHHHHHHHHH----HccCCCH--HHHHHHHHHHHHhCCHHHHHHHHhhcc
Confidence            4699999999876    5667765  899999999999999999999998654


No 36 
>KOG2746|consensus
Probab=96.77  E-value=0.0011  Score=56.80  Aligned_cols=68  Identities=28%  Similarity=0.429  Sum_probs=58.3

Q ss_pred             hhhhhccccccCCCCCcccc--ccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHH
Q psy3264          67 DKKRYDTEMQSYTPPAGEKR--GKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKE  134 (156)
Q Consensus        67 ~k~~y~~~~~~~~~~~~~~r--~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e  134 (156)
                      ++....+.|++|..+....|  +.+.+.||+..+..|+++||+.|-.|-+.||+.|.++|-+.++.|-+.
T Consensus       178 ~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka  247 (683)
T KOG2746|consen  178 DKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA  247 (683)
T ss_pred             cchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence            33445567777777666666  778899999999999999999999999999999999999999998876


No 37 
>PF08073 CHDNT:  CHDNT (NUC034) domain;  InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=96.57  E-value=0.0034  Score=36.92  Aligned_cols=40  Identities=23%  Similarity=0.434  Sum_probs=35.9

Q ss_pred             CcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCH
Q psy3264          13 MTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSE   54 (156)
Q Consensus        13 ~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~   54 (156)
                      +|.|-+|.+..|+.|...||++.+  +.+..+++.+|+..++
T Consensus        13 lt~yK~Fsq~vRP~l~~~NPk~~~--sKl~~l~~AKwrEF~~   52 (55)
T PF08073_consen   13 LTNYKAFSQHVRPLLAKANPKAPM--SKLMMLLQAKWREFQE   52 (55)
T ss_pred             HHHHHHHHHHHHHHHHHHCCCCcH--HHHHHHHHHHHHHHHh
Confidence            567889999999999999999976  8999999999997654


No 38 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=94.55  E-value=0.071  Score=35.81  Aligned_cols=81  Identities=15%  Similarity=0.433  Sum_probs=55.3

Q ss_pred             HHHHHHHHHHhccCCHHHHHHHHHhhhh-------hhhhhccccccCCCCCccccccccccC---CCCCchhHHHHHHHH
Q psy3264          39 AEFSKKCAERWKTMSEKEKKRFHEMADK-------DKKRYDTEMQSYTPPAGEKRGKIKQDN---PEYGVGDIAKELGKK  108 (156)
Q Consensus        39 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~-------~k~~y~~~~~~~~~~~~~~r~~~~~~~---p~~~~~e~~k~~~~~  108 (156)
                      .++..-++..|+.|++..+..|...|..       .+.++...|..|..-.++.|..+...+   ..++..+.. .|...
T Consensus        10 q~~L~pl~~~W~~l~~~qr~k~l~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~~~~~~Lpp~qR~-~lr~~   88 (107)
T PF11304_consen   10 QQALAPLAERWNSLPPEQRRKWLQIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENYQRFKQLPPEQRQ-ALRAR   88 (107)
T ss_pred             HHHHHHHHHHHhcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHcCCHHHHH-HHHHH
Confidence            5677888899999999999988887755       566677777777665555555544433   344444333 46666


Q ss_pred             h---ccCChHHHHHH
Q psy3264         109 W---ADVDPSVKTKY  120 (156)
Q Consensus       109 W---k~ls~~eK~~y  120 (156)
                      |   ..||+++|...
T Consensus        89 w~~yq~l~~eeR~~l  103 (107)
T PF11304_consen   89 WEAYQQLPPEERQAL  103 (107)
T ss_pred             HHHHHcCCHHHHHHH
Confidence            6   78888888654


No 39 
>PF06382 DUF1074:  Protein of unknown function (DUF1074);  InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=94.50  E-value=0.038  Score=40.16  Aligned_cols=38  Identities=16%  Similarity=0.437  Sum_probs=34.1

Q ss_pred             cccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHH
Q psy3264          88 KIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAE  125 (156)
Q Consensus        88 ~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~  125 (156)
                      .+...|.+++..|+....+..|..||+.+|..|...+-
T Consensus        92 eFRrkh~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~  129 (183)
T PF06382_consen   92 EFRRKHCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAP  129 (183)
T ss_pred             HHHHHccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcc
Confidence            36678899999999999999999999999999998654


No 40 
>PF06244 DUF1014:  Protein of unknown function (DUF1014);  InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=93.07  E-value=0.18  Score=34.66  Aligned_cols=48  Identities=23%  Similarity=0.383  Sum_probs=40.0

Q ss_pred             CCCCC-CCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCH
Q psy3264           5 KDVKP-RGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSE   54 (156)
Q Consensus         5 ~p~~P-krP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~   54 (156)
                      ...+| +|-.-||.-|...+.+.++.++|++..  +++..+|-..|..-++
T Consensus        68 ~drHPErR~KAAy~afeE~~Lp~lK~E~PgLrl--sQ~kq~l~K~w~KSPe  116 (122)
T PF06244_consen   68 IDRHPERRMKAAYKAFEERRLPELKEENPGLRL--SQYKQMLWKEWQKSPE  116 (122)
T ss_pred             CCCCcchhHHHHHHHHHHHHhHHHHhhCCCchH--HHHHHHHHHHHhcCCC
Confidence            34445 455579999999999999999999987  8999999999987654


No 41 
>PF04769 MAT_Alpha1:  Mating-type protein MAT alpha 1;  InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=93.02  E-value=0.27  Score=36.81  Aligned_cols=55  Identities=16%  Similarity=0.220  Sum_probs=40.8

Q ss_pred             CCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264           4 EKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADK   66 (156)
Q Consensus         4 ~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~   66 (156)
                      ..+..++||+|+|++|..-.-    ...|+...  .+++.+|+..|..=+-  |..|.-+|..
T Consensus        39 ~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Q--k~~S~~l~~lW~~dp~--k~~W~l~ak~   93 (201)
T PF04769_consen   39 RSPEKAKRPLNGFMAFRSYYS----PIFPPLPQ--KELSGILTKLWEKDPF--KNKWSLMAKA   93 (201)
T ss_pred             ccccccccchhHHHHHHHHHH----hhcCCcCH--HHHHHHHHHHHhCCcc--HhHHHHHhhh
Confidence            345678999999999977654    44566654  8999999999997443  5666666654


No 42 
>PF08073 CHDNT:  CHDNT (NUC034) domain;  InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=92.12  E-value=0.054  Score=31.88  Aligned_cols=33  Identities=18%  Similarity=0.357  Sum_probs=28.3

Q ss_pred             CccccccccccCCCCCchhHHHHHHHHhccCCh
Q psy3264          82 AGEKRGKIKQDNPEYGVGDIAKELGKKWADVDP  114 (156)
Q Consensus        82 ~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~  114 (156)
                      ..--||.+...||++....+..+++.+|+..+.
T Consensus        20 sq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~~   52 (55)
T PF08073_consen   20 SQHVRPLLAKANPKAPMSKLMMLLQAKWREFQE   52 (55)
T ss_pred             HHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHHh
Confidence            334489999999999999999999999997654


No 43 
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=92.04  E-value=0.38  Score=35.83  Aligned_cols=47  Identities=17%  Similarity=0.523  Sum_probs=40.9

Q ss_pred             CCCchhHHH-HHHHHhccCChHHHHHHHHHHHH-HHHHHHHHHHHHHhc
Q psy3264          95 EYGVGDIAK-ELGKKWADVDPSVKTKYEQMAEK-DKARYEKEMTAYKNR  141 (156)
Q Consensus        95 ~~~~~e~~k-~~~~~Wk~ls~~eK~~y~~~a~~-~k~~y~~e~~~~~~~  141 (156)
                      .+++..+++ .||..|+.+|+++++.|.+.... ....|-..+..|...
T Consensus        63 ~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~~~  111 (198)
T TIGR03481        63 AFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYAGE  111 (198)
T ss_pred             hCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCc
Confidence            456788875 78999999999999999999998 888999999999654


No 44 
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=88.89  E-value=1.3  Score=33.39  Aligned_cols=49  Identities=20%  Similarity=0.328  Sum_probs=41.5

Q ss_pred             CCCCCchhHHH-HHHHHhccCChHHHHHHHHHHHH-HHHHHHHHHHHHHhc
Q psy3264          93 NPEYGVGDIAK-ELGKKWADVDPSVKTKYEQMAEK-DKARYEKEMTAYKNR  141 (156)
Q Consensus        93 ~p~~~~~e~~k-~~~~~Wk~ls~~eK~~y~~~a~~-~k~~y~~e~~~~~~~  141 (156)
                      .|.+.+..+++ .||.-|+.+|+++++.|.+.... +...|-..+..|...
T Consensus        65 ~p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~~q  115 (211)
T PRK15117         65 LPYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYHGQ  115 (211)
T ss_pred             cccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhCCc
Confidence            46778888874 78999999999999999999887 556899999999754


No 45 
>PF05494 Tol_Tol_Ttg2:  Toluene tolerance, Ttg2 ;  InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=80.48  E-value=2.8  Score=30.11  Aligned_cols=47  Identities=19%  Similarity=0.525  Sum_probs=34.4

Q ss_pred             CCCCchhHH-HHHHHHhccCChHHHHHHHHHHHHH-HHHHHHHHHHHHh
Q psy3264          94 PEYGVGDIA-KELGKKWADVDPSVKTKYEQMAEKD-KARYEKEMTAYKN  140 (156)
Q Consensus        94 p~~~~~e~~-k~~~~~Wk~ls~~eK~~y~~~a~~~-k~~y~~e~~~~~~  140 (156)
                      |.+.+..++ ..||.-|+.+|+++++.|.+..... ...|-..+..|..
T Consensus        36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~~   84 (170)
T PF05494_consen   36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYSG   84 (170)
T ss_dssp             GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-SS
T ss_pred             HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence            556677777 5677899999999999999998774 5567777777764


No 46 
>PF04690 YABBY:  YABBY protein;  InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=80.46  E-value=0.54  Score=34.19  Aligned_cols=45  Identities=20%  Similarity=0.338  Sum_probs=37.5

Q ss_pred             hhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCC
Q psy3264          69 KRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVD  113 (156)
Q Consensus        69 ~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls  113 (156)
                      ++.++.-.+|+.+..+.-++++..+|+++-.|+++..+..|...+
T Consensus       120 EKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p  164 (170)
T PF04690_consen  120 EKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP  164 (170)
T ss_pred             cccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence            445566677777777778889999999999999999999998765


No 47 
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=77.78  E-value=3  Score=30.99  Aligned_cols=42  Identities=17%  Similarity=0.510  Sum_probs=33.7

Q ss_pred             HHHHHH-HHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264          38 FAEFSK-KCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT   79 (156)
Q Consensus        38 ~~ei~k-~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~   79 (156)
                      |..+++ .||.-|+.+|+++|+.|.+.... ....|-..+..|.
T Consensus        66 f~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~  109 (198)
T TIGR03481        66 LPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYA  109 (198)
T ss_pred             HHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence            455654 78999999999999999998887 6677777777664


No 48 
>PF11304 DUF3106:  Protein of unknown function (DUF3106);  InterPro: IPR021455  Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known. 
Probab=75.42  E-value=14  Score=24.59  Aligned_cols=25  Identities=28%  Similarity=0.607  Sum_probs=12.6

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAE  125 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~  125 (156)
                      +..-+...|+.|++..+..+...|.
T Consensus        12 ~L~pl~~~W~~l~~~qr~k~l~~a~   36 (107)
T PF11304_consen   12 ALAPLAERWNSLPPEQRRKWLQIAE   36 (107)
T ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHH
Confidence            3344455555555555555554443


No 49 
>PF01352 KRAB:  KRAB box;  InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=73.30  E-value=2.4  Score=23.28  Aligned_cols=28  Identities=25%  Similarity=0.528  Sum_probs=15.6

Q ss_pred             CchhHHHHHH-HHhccCChHHHHHHHHHH
Q psy3264          97 GVGDIAKELG-KKWADVDPSVKTKYEQMA  124 (156)
Q Consensus        97 ~~~e~~k~~~-~~Wk~ls~~eK~~y~~~a  124 (156)
                      ++.|++.-++ +.|..|.+.+|.-|.+.-
T Consensus         2 tf~Dvav~fs~eEW~~L~~~Qk~ly~dvm   30 (41)
T PF01352_consen    2 TFEDVAVYFSQEEWELLDPAQKNLYRDVM   30 (41)
T ss_dssp             -----TT---HHHHHTS-HHHHHHHHHHH
T ss_pred             eEEEEEEEcChhhcccccceecccchhHH
Confidence            4566665555 459999999999998754


No 50 
>KOG3223|consensus
Probab=73.10  E-value=5.1  Score=29.71  Aligned_cols=44  Identities=14%  Similarity=0.339  Sum_probs=38.5

Q ss_pred             CCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHH
Q psy3264          10 RGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEK   55 (156)
Q Consensus        10 krP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~   55 (156)
                      +|=.-||.-|=....+.++..+|++.+  ++.-.+|-.+|..-++.
T Consensus       166 kRmrAA~~afEe~~LPrLK~e~P~lrl--sQ~Kqll~Kew~KsPDN  209 (221)
T KOG3223|consen  166 KRMRAAFKAFEEARLPRLKKENPGLRL--SQYKQLLKKEWQKSPDN  209 (221)
T ss_pred             HHHHHHHHHHHHhhchhhhhcCCCccH--HHHHHHHHHHHhhCCCC
Confidence            455678999999999999999999987  89999999999887763


No 51 
>PF13875 DUF4202:  Domain of unknown function (DUF4202)
Probab=71.66  E-value=10  Score=28.02  Aligned_cols=40  Identities=25%  Similarity=0.440  Sum_probs=34.0

Q ss_pred             HHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHH
Q psy3264          15 AYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKR   59 (156)
Q Consensus        15 ay~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~   59 (156)
                      +.++|+...+..+...|..     ..+..+|..-|+.||+.-++.
T Consensus       131 acLVFL~~~f~~F~~~~de-----eK~v~Il~KTw~KMS~~g~~~  170 (185)
T PF13875_consen  131 ACLVFLEYYFEDFAAKHDE-----EKIVDILRKTWRKMSERGHEA  170 (185)
T ss_pred             HHHHhHHHHHHHHHhcCCH-----HHHHHHHHHHHHHCCHHHHHH
Confidence            5788999999999998832     688899999999999987753


No 52 
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=69.35  E-value=12  Score=28.01  Aligned_cols=54  Identities=17%  Similarity=0.251  Sum_probs=41.5

Q ss_pred             CCCCchhHH-HHHHHHhccCChHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCCCC
Q psy3264          94 PEYGVGDIA-KELGKKWADVDPSVKTKYEQMAEK-DKARYEKEMTAYKNRSVAPVN  147 (156)
Q Consensus        94 p~~~~~e~~-k~~~~~Wk~ls~~eK~~y~~~a~~-~k~~y~~e~~~~~~~~~~~~~  147 (156)
                      |-+.+.-++ ..||.-|+++|+++++.|.+.... +.+.|-..|..|+.+...=..
T Consensus        68 p~vd~~~~a~~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~~q~~~v~~  123 (202)
T COG2854          68 PYVDFKYAAKLVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYKGQTLKVKP  123 (202)
T ss_pred             hhhcHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHccCCCceeCC
Confidence            444444444 668899999999999999998877 567799999999887654333


No 53 
>PF12881 NUT_N:  NUT protein N terminus;  InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=68.10  E-value=14  Score=29.47  Aligned_cols=52  Identities=21%  Similarity=0.306  Sum_probs=45.8

Q ss_pred             CcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264          13 MTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADK   66 (156)
Q Consensus        13 ~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~   66 (156)
                      ..|+.+|+.-....+....|.+++  .|-....=++|.-.|-=+|..|.++|.+
T Consensus       229 ~EAlSCFLIpvLrsLar~kPtMtl--EeGl~ra~qEW~~~SnfdRmifyemaek  280 (328)
T PF12881_consen  229 AEALSCFLIPVLRSLARLKPTMTL--EEGLWRAVQEWQHTSNFDRMIFYEMAEK  280 (328)
T ss_pred             hhhhhhhHHHHHHHHHhcCCCccH--HHHHHHHHHHhhccccccHHHHHHHHHH
Confidence            458889998888888888898876  7888888899999999999999999976


No 54 
>PF05494 Tol_Tol_Ttg2:  Toluene tolerance, Ttg2 ;  InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=64.31  E-value=4  Score=29.26  Aligned_cols=41  Identities=17%  Similarity=0.452  Sum_probs=29.8

Q ss_pred             HHHH-HHHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264          39 AEFS-KKCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT   79 (156)
Q Consensus        39 ~ei~-k~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~   79 (156)
                      ..++ ..||.-|+.+|+++++.|.+.... ....|-..+..|.
T Consensus        41 ~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~   83 (170)
T PF05494_consen   41 ERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYS   83 (170)
T ss_dssp             HHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-S
T ss_pred             HHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence            4444 467789999999999999988777 5566777777764


No 55 
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=62.72  E-value=6.7  Score=29.45  Aligned_cols=42  Identities=10%  Similarity=0.166  Sum_probs=32.6

Q ss_pred             HHHHH-HHHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264          38 FAEFS-KKCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT   79 (156)
Q Consensus        38 ~~ei~-k~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~   79 (156)
                      |..++ ..||.-|+++|+++|..|.+.... ....|-..+..|.
T Consensus        70 f~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~  113 (211)
T PRK15117         70 VKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYH  113 (211)
T ss_pred             HHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence            34554 478999999999999999988777 5567777777664


No 56 
>PF06945 DUF1289:  Protein of unknown function (DUF1289);  InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=59.58  E-value=11  Score=21.64  Aligned_cols=21  Identities=5%  Similarity=0.204  Sum_probs=16.6

Q ss_pred             HHhccCChHHHHHHHHHHHHH
Q psy3264         107 KKWADVDPSVKTKYEQMAEKD  127 (156)
Q Consensus       107 ~~Wk~ls~~eK~~y~~~a~~~  127 (156)
                      ..|+.||+++|..........
T Consensus        28 ~~W~~~s~~er~~i~~~l~~R   48 (51)
T PF06945_consen   28 RDWKSMSDDERRAILARLRAR   48 (51)
T ss_pred             HHHhhCCHHHHHHHHHHHHHH
Confidence            379999999998877765544


No 57 
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=57.02  E-value=48  Score=24.07  Aligned_cols=44  Identities=11%  Similarity=0.210  Sum_probs=32.3

Q ss_pred             hhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy3264          99 GDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSV  143 (156)
Q Consensus        99 ~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~  143 (156)
                      .+.++.-...++-|+|++|..|.++.++-..++.. +..+.....
T Consensus       111 Vem~k~~nqmy~lLTPEQKaq~~~~~~~rm~~~~~-~~~~q~~s~  154 (166)
T PRK10363        111 VEMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRD-VTQWQKSSS  154 (166)
T ss_pred             HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHH-HHhcCcccc
Confidence            34556667789999999999999988877777744 665544433


No 58 
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=55.33  E-value=62  Score=22.07  Aligned_cols=46  Identities=11%  Similarity=0.034  Sum_probs=39.6

Q ss_pred             hhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Q psy3264          99 GDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVA  144 (156)
Q Consensus        99 ~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~  144 (156)
                      ..-.+.|-..|+.||++++......++...+.|..-+++|..+...
T Consensus        86 ~~~~~~ll~~~~~L~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~  131 (135)
T PRK09706         86 SEDQKELLELFDALPESEQDAQLSEMRARVENFNKLFEELLKARKR  131 (135)
T ss_pred             CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence            3445778899999999999999999999999999999999765443


No 59 
>PF15581 Imm35:  Immunity protein 35
Probab=51.65  E-value=38  Score=21.88  Aligned_cols=30  Identities=10%  Similarity=0.210  Sum_probs=23.2

Q ss_pred             HHHHHHHHHHHhccCCHHHHHHHHHhhhhh
Q psy3264          38 FAEFSKKCAERWKTMSEKEKKRFHEMADKD   67 (156)
Q Consensus        38 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~   67 (156)
                      ..-+...+...|+.|+.++=..-.+.+...
T Consensus        32 i~~l~~lIe~eWRGl~~~qV~~kl~ava~~   61 (93)
T PF15581_consen   32 IRNLESLIEHEWRGLPEEQVLYKLEAVAAK   61 (93)
T ss_pred             HHHHHHHHHHHHcCCCHHHHHHHHHHHHhc
Confidence            467889999999999999876665555443


No 60 
>KOG3838|consensus
Probab=51.23  E-value=19  Score=29.91  Aligned_cols=40  Identities=20%  Similarity=0.248  Sum_probs=34.6

Q ss_pred             cCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcc
Q psy3264         111 DVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAPVNEEA  150 (156)
Q Consensus       111 ~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~~~~~~  150 (156)
                      .+.+.+|++|.++.+.....|+++-++|++.++....+++
T Consensus       268 E~qe~ek~kyqeEfe~~q~elek~k~efkk~hpd~~~e~e  307 (497)
T KOG3838|consen  268 EMQELEKAKYQEEFEWAQLELEKRKDEFKKSHPDAQGEGE  307 (497)
T ss_pred             hhhHHHHHHHHHHHHHHHHHHhhhHhhhccCCchhhcchh
Confidence            4567799999999999999999999999999988776644


No 61 
>PF06244 DUF1014:  Protein of unknown function (DUF1014);  InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=51.01  E-value=12  Score=25.78  Aligned_cols=31  Identities=19%  Similarity=0.417  Sum_probs=27.2

Q ss_pred             cccccccccCCCCCchhHHHHHHHHhccCCh
Q psy3264          84 EKRGKIKQDNPEYGVGDIAKELGKKWADVDP  114 (156)
Q Consensus        84 ~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~  114 (156)
                      .+-|.|+.++||+..+.+-.+|-..|...++
T Consensus        86 ~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPe  116 (122)
T PF06244_consen   86 RRLPELKEENPGLRLSQYKQMLWKEWQKSPE  116 (122)
T ss_pred             HHhHHHHhhCCCchHHHHHHHHHHHHhcCCC
Confidence            3468899999999999999999999988765


No 62 
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=49.01  E-value=47  Score=23.95  Aligned_cols=77  Identities=9%  Similarity=0.124  Sum_probs=43.0

Q ss_pred             ccCCHHHHHHHHHhhhhhhhhhccccccCCCCCccccccccccCCC-C---------------CchhHHHHHHHHhccCC
Q psy3264          50 KTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPAGEKRGKIKQDNPE-Y---------------GVGDIAKELGKKWADVD  113 (156)
Q Consensus        50 k~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~~~~r~~~~~~~p~-~---------------~~~e~~k~~~~~Wk~ls  113 (156)
                      -+||+++|....++....+.........   -......-+..+..+ .               -..+..+.....+..||
T Consensus        55 l~LTd~QR~qmr~im~~~r~~~~~~~~~---~~~~m~~Li~Ad~FDeaAvra~~~kma~~~~e~~v~~~~~~~qmy~lLT  131 (162)
T PRK12751         55 INLTEQQRQQMRDLMRQSHQSQPRLDLE---DREAMHKLITADKFDEAAVRAQAEKMSQNQIERHVEMAKVRNQMYNLLT  131 (162)
T ss_pred             CCCCHHHHHHHHHHHHHhhhcccchhHH---HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC
Confidence            6899999999998887766542111000   000001111122211 0               01234456677899999


Q ss_pred             hHHHHHHHHHHHHHHH
Q psy3264         114 PSVKTKYEQMAEKDKA  129 (156)
Q Consensus       114 ~~eK~~y~~~a~~~k~  129 (156)
                      +++|..|.++.++-..
T Consensus       132 PEQra~l~~~~e~r~~  147 (162)
T PRK12751        132 PEQKEALNKKHQERIE  147 (162)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            9999999995554433


No 63 
>PF12650 DUF3784:  Domain of unknown function (DUF3784);  InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=46.79  E-value=14  Score=23.88  Aligned_cols=18  Identities=17%  Similarity=0.270  Sum_probs=14.8

Q ss_pred             HHHhccCChHHHHHHHHH
Q psy3264         106 GKKWADVDPSVKTKYEQM  123 (156)
Q Consensus       106 ~~~Wk~ls~~eK~~y~~~  123 (156)
                      =+-|+.||++||+.|...
T Consensus        23 IaGyntms~eEk~~~D~~   40 (97)
T PF12650_consen   23 IAGYNTMSKEEKEKYDKK   40 (97)
T ss_pred             hhhcccCCHHHHHHhhHH
Confidence            357999999999999653


No 64 
>PRK10236 hypothetical protein; Provisional
Probab=46.35  E-value=23  Score=27.16  Aligned_cols=26  Identities=15%  Similarity=0.414  Sum_probs=21.8

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEK  126 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~  126 (156)
                      +++.+...|+.||++|++.+.+.-..
T Consensus       118 l~kll~~a~~kms~eE~~~L~~~l~~  143 (237)
T PRK10236        118 LEQFLRNTWKKMDEEHKQEFLHAVDA  143 (237)
T ss_pred             HHHHHHHHHHHCCHHHHHHHHHHHhh
Confidence            56999999999999999888765544


No 65 
>PRK10236 hypothetical protein; Provisional
Probab=45.62  E-value=25  Score=26.97  Aligned_cols=28  Identities=25%  Similarity=0.451  Sum_probs=22.8

Q ss_pred             HHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264          39 AEFSKKCAERWKTMSEKEKKRFHEMADK   66 (156)
Q Consensus        39 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~   66 (156)
                      .-+.+++...|..||++|++.+...-..
T Consensus       116 ~il~kll~~a~~kms~eE~~~L~~~l~~  143 (237)
T PRK10236        116 QLLEQFLRNTWKKMDEEHKQEFLHAVDA  143 (237)
T ss_pred             HHHHHHHHHHHHHCCHHHHHHHHHHHhh
Confidence            3467899999999999999888875544


No 66 
>PF11616 EZH2_WD-Binding:  WD repeat binding protein EZH2;  InterPro: IPR021654  This family of proteins represents Enhancer of zest homologue 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression []. This interaction is required for the HMTase activity of PCR2 []. ; GO: 0018024 histone-lysine N-methyltransferase activity; PDB: 2QXV_B.
Probab=43.14  E-value=47  Score=16.89  Aligned_cols=14  Identities=29%  Similarity=0.539  Sum_probs=10.7

Q ss_pred             HHHHHHHHHHhccC
Q psy3264          39 AEFSKKCAERWKTM   52 (156)
Q Consensus        39 ~ei~k~l~~~Wk~L   52 (156)
                      .+-+.+|.+.|+.|
T Consensus        12 ~e~t~iLN~eWk~l   25 (30)
T PF11616_consen   12 QERTDILNEEWKKL   25 (30)
T ss_dssp             HHHHHHHHHHHHH-
T ss_pred             HHHHHHHHHHHHHh
Confidence            57778899999876


No 67 
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=40.30  E-value=90  Score=22.63  Aligned_cols=35  Identities=11%  Similarity=0.303  Sum_probs=28.4

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEM  135 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~  135 (156)
                      +.+.....+.-||+++|..|.+.-..-.+.+...+
T Consensus       126 ~~~~~~~~~~vLTpEQRak~~e~~~~r~~~~~~~~  160 (170)
T PRK12750        126 MLEKRHQMLSILTPEQKAKFQELQQERMQECQDKM  160 (170)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            33455567999999999999999888877777766


No 68 
>cd07081 ALDH_F20_ACDH_EutE-like Coenzyme A acylating aldehyde dehydrogenase (ACDH), Ethanolamine utilization protein EutE, and related proteins. Coenzyme A acylating aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10), functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA. The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH, and may be critical enzymes in the fermentative pathway.
Probab=39.41  E-value=89  Score=26.17  Aligned_cols=42  Identities=7%  Similarity=-0.058  Sum_probs=35.4

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.++..-..|..+|..+|..+...+....+++..++......
T Consensus         6 ~~A~~A~~~W~~~~~~~R~~iL~~~a~~l~~~~~ela~~~~~   47 (439)
T cd07081           6 AAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKLAVS   47 (439)
T ss_pred             HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            344556678999999999999999999999999999887443


No 69 
>KOG3223|consensus
Probab=39.26  E-value=25  Score=26.14  Aligned_cols=32  Identities=19%  Similarity=0.443  Sum_probs=28.5

Q ss_pred             cccccccccCCCCCchhHHHHHHHHhccCChH
Q psy3264          84 EKRGKIKQDNPEYGVGDIAKELGKKWADVDPS  115 (156)
Q Consensus        84 ~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~  115 (156)
                      .+-|+|+.++|++.....-.+|-..|...++.
T Consensus       178 ~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN  209 (221)
T KOG3223|consen  178 ARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN  209 (221)
T ss_pred             hhchhhhhcCCCccHHHHHHHHHHHHhhCCCC
Confidence            44788999999999999999999999988764


No 70 
>PF06424 PRP1_N:  PRP1 splicing factor, N-terminal;  InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=34.41  E-value=33  Score=23.97  Aligned_cols=31  Identities=29%  Similarity=0.631  Sum_probs=21.9

Q ss_pred             HHHHHhCCCCcchHHHHHHHHH----HHhccCCHH
Q psy3264          25 EEHKKKHPEENVVFAEFSKKCA----ERWKTMSEK   55 (156)
Q Consensus        25 ~~~~~~~p~~~~~~~ei~k~l~----~~Wk~Ls~~   55 (156)
                      ..+...+|.+...|.+|-+.|+    +.|-+|++.
T Consensus        93 e~~~~~~pkI~~QFaDLKR~La~VS~eeW~~IPE~  127 (133)
T PF06424_consen   93 EKYRKENPKIQQQFADLKRSLATVSEEEWENIPEA  127 (133)
T ss_pred             HhhhccCchHHHHHHHHHHHHccCCHHHHhcCCcc
Confidence            3445567777777888877776    588888863


No 71 
>PF05914 RIB43A:  RIB43A;  InterPro: IPR008805 This family consists of several RIB43A-like eukaryotic proteins. Ciliary and flagellar microtubules contain a specialised set of protofilaments, termed ribbons, that are composed of tubulin and several associated proteins. RIB43A was first characterised in the unicellular biflagellate, Chlamydomonas reinhardtii although highly related sequences are present in several higher eukaryotes including humans. The function of this protein is unknown although the structure of RIB43A and its association with the specialised protofilament ribbons and with basal bodies is relevant to the proposed role of ribbons in forming and stabilising doublet and triplet microtubules and in organising their three-dimensional structure. Human RIB43A homologues could represent a structural requirement in centriole replication in dividing cells [].
Probab=34.19  E-value=29  Score=28.59  Aligned_cols=40  Identities=13%  Similarity=0.363  Sum_probs=27.5

Q ss_pred             ccccccCCCCCchh--HHHHHHHHhccCChHHHHHHHHHHHH
Q psy3264          87 GKIKQDNPEYGVGD--IAKELGKKWADVDPSVKTKYEQMAEK  126 (156)
Q Consensus        87 ~~~~~~~p~~~~~e--~~k~~~~~Wk~ls~~eK~~y~~~a~~  126 (156)
                      ..+..++|.+..+.  --+.+...||+||+++...+...-+.
T Consensus       250 sdlLtEnp~~a~s~~gp~Rv~~d~wKGMs~eQl~~i~~~Q~~  291 (379)
T PF05914_consen  250 SDLLTENPQVAQSSFGPHRVIPDRWKGMSPEQLEEIRKEQEQ  291 (379)
T ss_pred             CccccCCchhccccCCCCCCCCcccCCCCHHHHHHHHHHHHH
Confidence            34666777654222  23567789999999999988874443


No 72 
>cd07122 ALDH_F20_ACDH Coenzyme A acylating aldehyde dehydrogenase (ACDH), ALDH family 20-like. Coenzyme A acylating aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH and may be critical enzymes in the fermentative pathway.
Probab=32.82  E-value=1.3e+02  Score=25.23  Aligned_cols=41  Identities=12%  Similarity=0.098  Sum_probs=35.0

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      .++..-..|..+|..+|..+...+....+++..++......
T Consensus         7 ~A~~A~~~W~~~~~~eR~~~L~~~a~~l~~~~eela~~~~~   47 (436)
T cd07122           7 RARKAQREFATFSQEQVDKIVEAVAWAAADAAEELAKMAVE   47 (436)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44555678999999999999999999999999999887544


No 73 
>PF09791 Oxidored-like:  Oxidoreductase-like protein, N-terminal;  InterPro: IPR019180 This entry represents the N-terminal domain of various oxidoreductase-like proteins whose exact function is, as yet, unknown. 
Probab=32.73  E-value=78  Score=17.97  Aligned_cols=18  Identities=11%  Similarity=0.433  Sum_probs=10.9

Q ss_pred             HHHHHHHHHHHHHHHHHh
Q psy3264         123 MAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       123 ~a~~~k~~y~~e~~~~~~  140 (156)
                      ....+.++|...+++|++
T Consensus        30 ~Y~eel~~y~~~~~~~~~   47 (48)
T PF09791_consen   30 VYAEELEEYREALAAWQE   47 (48)
T ss_pred             HHHHHHHHHHHHHHHHHh
Confidence            445555666666666654


No 74 
>PF11278 DUF3079:  Protein of unknown function (DUF3079);  InterPro: IPR021430  This family of proteins with unknown function appears to be restricted to Proteobacteria. 
Probab=32.52  E-value=32  Score=19.65  Aligned_cols=13  Identities=23%  Similarity=0.345  Sum_probs=10.9

Q ss_pred             CCCCCCCCCCCCC
Q psy3264           1 MPKEKDVKPRGRM   13 (156)
Q Consensus         1 ~~k~~p~~PkrP~   13 (156)
                      |-||.|..|+.|-
T Consensus         1 MaKkFPlhP~hPE   13 (52)
T PF11278_consen    1 MAKKFPLHPKHPE   13 (52)
T ss_pred             CCCcCCCCCCCcc
Confidence            7889999998883


No 75 
>cd07133 ALDH_CALDH_CalB Coniferyl aldehyde dehydrogenase-like. Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of Pseudomonas sp. strain HR199 (CalB) which catalyzes the NAD+-dependent oxidation of coniferyl aldehyde to ferulic acid, and similar sequences, are present in this CD.
Probab=32.10  E-value=1.3e+02  Score=25.03  Aligned_cols=43  Identities=2%  Similarity=-0.187  Sum_probs=36.1

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS  142 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~  142 (156)
                      +.++..-..|+.+|..+|..+...+....+.+..++..-...-
T Consensus         5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~~e   47 (434)
T cd07133           5 ERQKAAFLANPPPSLEERRDRLDRLKALLLDNQDALAEAISAD   47 (434)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            3456666789999999999999999999999999998865543


No 76 
>cd07085 ALDH_F6_MMSDH Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2. Methylmalonate semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27) [acylating] from Bacillus subtilis is involved in valine metabolism and catalyses the NAD+- and CoA-dependent oxidation of methylmalonate semialdehyde into propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and Arabidopsis MMSDH ALDH6B2 are also present in this CD.
Probab=29.75  E-value=1.5e+02  Score=24.86  Aligned_cols=38  Identities=8%  Similarity=-0.027  Sum_probs=31.9

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..+|..+|..+...+.....++..++..-..
T Consensus        48 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~   85 (478)
T cd07085          48 KAAFPAWSATPVLKRQQVMFKFRQLLEENLDELARLIT   85 (478)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44446799999999999999999999999988877543


No 77 
>cd07132 ALDH_F3AB Aldehyde dehydrogenase family 3 members A1, A2, and B1 and related proteins. NAD(P)+-dependent, aldehyde dehydrogenase, family 3 members A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and similar sequences are included in this CD. Human ALDH3A1 is a homodimer with a critical role in cellular defense against oxidative stress; it catalyzes the oxidation of various cellular membrane lipid-derived aldehydes. Corneal crystalline ALDH3A1 protects the cornea and underlying lens against UV-induced oxidative stress. Human ALDH3A2, a microsomal homodimer, catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acids. Human ALDH3B1 is highly expressed in the kidney and liver and catalyzes the oxidation of various medium- and long-chain saturated and unsaturated aliphatic aldehydes.
Probab=29.24  E-value=1.5e+02  Score=24.66  Aligned_cols=42  Identities=10%  Similarity=-0.063  Sum_probs=35.2

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.++..-..|..++..+|..+...+....+.+..++..-...
T Consensus         5 ~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~~   46 (443)
T cd07132           5 RRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEALAK   46 (443)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            445666678999999999999999999999999998776544


No 78 
>PF11521 TFIIE-A_C-term:  C-terminal general transcription factor TFIIE alpha;  InterPro: IPR021600  TFIIE is compiled of two subunits, alpha and beta. This family of proteins are the C-terminal domain of the alpha subunit of the protein which is the largest subunit and contains several functional domains which are important for basal transcription and cell growth. The C-terminal end of the protein binds directly to the amino-terminal PH domain of p62/Tfb1 (of IIH) which is involved in the recruitment of the general transcription factor IIH to the transcription preinitiation complex. P53 competes for the same binding site as TFIIE alpha which shows their structural similarity. Like p53, TFIIE alpha 336-439 can activate transcription in vivo []. ; PDB: 2RNR_A 2RNQ_A 2JTX_A.
Probab=28.57  E-value=61  Score=20.84  Aligned_cols=29  Identities=28%  Similarity=0.448  Sum_probs=19.5

Q ss_pred             CCchhHH--HHHHHHhccCChHHHHHHHHHHHHH
Q psy3264          96 YGVGDIA--KELGKKWADVDPSVKTKYEQMAEKD  127 (156)
Q Consensus        96 ~~~~e~~--k~~~~~Wk~ls~~eK~~y~~~a~~~  127 (156)
                      .+..+|+  -.|-   ..|+++||..|+......
T Consensus        50 ~~~~eVtq~p~LV---~qMT~~EKEaYi~v~Q~~   80 (86)
T PF11521_consen   50 YPYSEVTQRPELV---AQMTPEEKEAYIQVGQEM   80 (86)
T ss_dssp             EEHHHHHH-HHHH---HHS-HHHHHHHHHHHHHH
T ss_pred             eehhhcCcchHHH---HHcCHHHHHHHHHHHHHH
Confidence            3456666  2333   489999999999987653


No 79 
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=28.54  E-value=1.4e+02  Score=22.64  Aligned_cols=29  Identities=14%  Similarity=0.263  Sum_probs=22.3

Q ss_pred             HHHhccCChHHHHHHHHHHHHHHHHHHHH
Q psy3264         106 GKKWADVDPSVKTKYEQMAEKDKARYEKE  134 (156)
Q Consensus       106 ~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e  134 (156)
                      ...|..||++.|....+.+...-......
T Consensus       213 ~~~~~~L~~e~q~~i~~a~~~~~~~~~~~  241 (257)
T TIGR00787       213 KAFWKSLPPDLQAVVKEAAKEAGEYQRKL  241 (257)
T ss_pred             HHHHhcCCHHHHHHHHHHHHHHHHHHHHH
Confidence            46799999999999988877765544443


No 80 
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=28.00  E-value=1e+02  Score=22.22  Aligned_cols=32  Identities=9%  Similarity=0.264  Sum_probs=25.1

Q ss_pred             HHHHHHHHHhccCCHHHHHHHHHhhhhhhhhh
Q psy3264          40 EFSKKCAERWKTMSEKEKKRFHEMADKDKKRY   71 (156)
Q Consensus        40 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y   71 (156)
                      +..+..-+++..||+++|..|.+..++.-...
T Consensus       118 ~~~~~~~qmy~lLTPEQra~l~~~~e~r~~~~  149 (162)
T PRK12751        118 EMAKVRNQMYNLLTPEQKEALNKKHQERIEKL  149 (162)
T ss_pred             HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHH
Confidence            44466678899999999999999887755544


No 81 
>PF02026 RyR:  RyR domain;  InterPro: IPR003032 This domain is called RyR for Ryanodine receptor []. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.; PDB: 4ETV_A 3RQR_A 4ETT_A 4ERT_A 4ESU_A 4ETU_A 4ERV_A 3NRT_E.
Probab=27.91  E-value=61  Score=21.06  Aligned_cols=19  Identities=26%  Similarity=0.511  Sum_probs=16.0

Q ss_pred             hccCCHHHHHHHHHhhhhh
Q psy3264          49 WKTMSEKEKKRFHEMADKD   67 (156)
Q Consensus        49 Wk~Ls~~eK~~y~~~a~~~   67 (156)
                      |..||+.+|+.+...+..-
T Consensus        61 y~~L~e~eK~~dr~~~~e~   79 (94)
T PF02026_consen   61 YDELSEEEKEKDRDMVRET   79 (94)
T ss_dssp             GGGS-HHHHHHHHHHHHHH
T ss_pred             hhhCCHHHHHHhHHHHHHH
Confidence            9999999999999988763


No 82 
>cd07087 ALDH_F3-13-14_CALDH-like ALDH subfamily: Coniferyl aldehyde dehydrogenase, ALDH families 3, 13, and 14, and other related proteins. ALDH subfamily which includes NAD(P)+-dependent, aldehyde dehydrogenase, family 3 member A1 and B1  (ALDH3A1, ALDH3B1,  EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and also plant ALDH family members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14 (YMR110C) and the protozoan family 13 member (ALDH13), as well as coniferyl aldehyde dehydrogenases (CALDH, EC=1.2.1.68), and other similar  sequences, such as the Pseudomonas putida benzaldehyde dehydrogenase I that is involved in the metabolism of mandelate.
Probab=27.88  E-value=1.7e+02  Score=24.22  Aligned_cols=42  Identities=10%  Similarity=-0.102  Sum_probs=34.9

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.++..-..|..++..+|..+...+....+++..++..-...
T Consensus         5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~~   46 (426)
T cd07087           5 ARLRETFLTGKTRSLEWRKAQLKALKRMLTENEEEIAAALYA   46 (426)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            345556678999999999999999999999999998876544


No 83 
>PF05388 Carbpep_Y_N:  Carboxypeptidase Y pro-peptide;  InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=27.88  E-value=1.1e+02  Score=20.66  Aligned_cols=29  Identities=24%  Similarity=0.275  Sum_probs=25.6

Q ss_pred             HHHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264          38 FAEFSKKCAERWKTMSEKEKKRFHEMADK   66 (156)
Q Consensus        38 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~   66 (156)
                      +..+++.+++..+.||.+-|..|.++...
T Consensus        45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~~   73 (113)
T PF05388_consen   45 LEKISKYLNEPLKSLTSEAKALWDEMMLL   73 (113)
T ss_pred             HHHHHHHHHHHHhhccHHHHHHHHHHHHH
Confidence            46777889999999999999999998865


No 84 
>PF06628 Catalase-rel:  Catalase-related immune-responsive;  InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=27.47  E-value=55  Score=19.74  Aligned_cols=20  Identities=20%  Similarity=0.521  Sum_probs=16.2

Q ss_pred             HHHHHhccCCHHHHHHHHHh
Q psy3264          44 KCAERWKTMSEKEKKRFHEM   63 (156)
Q Consensus        44 ~l~~~Wk~Ls~~eK~~y~~~   63 (156)
                      ..+..|+.|++.+|+.+..-
T Consensus        12 Qa~~ly~~l~~~er~~lv~n   31 (68)
T PF06628_consen   12 QARDLYRVLSDEERERLVEN   31 (68)
T ss_dssp             HHHHHHHHSSHHHHHHHHHH
T ss_pred             hHHHHHHHCCHHHHHHHHHH
Confidence            44667999999999988853


No 85 
>cd08317 Death_ank Death domain associated with Ankyrins. Death Domain (DD) associated with Ankyrins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. Ankyrins function as adaptor proteins and they interact, through ANK repeats, with structurally diverse membrane proteins, including ion channels/pumps, calcium release channels, and cell adhesion molecules. They play critical roles in the proper expression and membrane localization of these proteins. In mammals, this family includes ankyrin-R for restricted (or ANK1), ankyrin-B for broadly expressed (or ANK2) and ankyrin-G for general or giant (or ANK3). They are expressed in different combinations in many tissues and play non-overlapping functions. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-associati
Probab=27.38  E-value=26  Score=22.05  Aligned_cols=21  Identities=24%  Similarity=0.613  Sum_probs=17.1

Q ss_pred             CCCCchhHHHHHHHHhccCCh
Q psy3264          94 PEYGVGDIAKELGKKWADVDP  114 (156)
Q Consensus        94 p~~~~~e~~k~~~~~Wk~ls~  114 (156)
                      +.+.+..|++.||..|+.|-.
T Consensus         3 ~~~~l~~ia~~lG~dW~~LAr   23 (84)
T cd08317           3 ADIRLADISNLLGSDWPQLAR   23 (84)
T ss_pred             ccchHHHHHHHHhhHHHHHHH
Confidence            456788999999999998743


No 86 
>PRK13252 betaine aldehyde dehydrogenase; Provisional
Probab=26.47  E-value=1.8e+02  Score=24.53  Aligned_cols=39  Identities=18%  Similarity=0.191  Sum_probs=33.1

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.....|..+|..+|..+...+......+..++......
T Consensus        54 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   92 (488)
T PRK13252         54 KQGQKIWAAMTAMERSRILRRAVDILRERNDELAALETL   92 (488)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            445567999999999999999999999999998876543


No 87 
>cd07150 ALDH_VaniDH_like Pseudomonas putida vanillin dehydrogenase-like. Vanillin dehydrogenase (Vdh, VaniDH) involved in the metabolism of ferulic acid and other related  sequences are included in this CD.  The E. coli vanillin dehydrogenase (LigV) preferred NAD+ to NADP+  and exhibited a broad substrate preference, including vanillin,  benzaldehyde, protocatechualdehyde, m-anisaldehyde, and p-hydroxybenzaldehyde.
Probab=26.45  E-value=1.8e+02  Score=24.10  Aligned_cols=37  Identities=16%  Similarity=0.187  Sum_probs=31.1

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +..-..|..++..+|..+...+....+.+..++..-.
T Consensus        31 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~   67 (451)
T cd07150          31 YDAFPAWAATTPSERERILLKAAEIMERRADDLIDLL   67 (451)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            4445679999999999999999999999988877653


No 88 
>cd07152 ALDH_BenzADH NAD-dependent benzaldehyde dehydrogenase II-like. NAD-dependent, benzaldehyde dehydrogenase II (XylC, BenzADH, EC=1.2.1.28) is involved in the oxidation of benzyl alcohol to benzoate. In Acinetobacter calcoaceticus, this process is carried out by the chromosomally encoded, benzyl alcohol dehydrogenase (xylB) and benzaldehyde dehydrogenase II (xylC) enzymes; whereas in Pseudomonas putida they are encoded by TOL plasmids.
Probab=25.60  E-value=2e+02  Score=23.83  Aligned_cols=40  Identities=18%  Similarity=0.238  Sum_probs=32.9

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      ++..-..|..+|.++|..+...+......+..++......
T Consensus        22 A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   61 (443)
T cd07152          22 AAAAQRAWAATPPRERAAVLRRAADLLEEHADEIADWIVR   61 (443)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            3444568999999999999999999999999888866443


No 89 
>PRK13968 putative succinate semialdehyde dehydrogenase; Provisional
Probab=25.36  E-value=2e+02  Score=24.13  Aligned_cols=53  Identities=17%  Similarity=0.144  Sum_probs=40.2

Q ss_pred             ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264          89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus        89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +....|..+..++.      +..-..|..++..+|..+...+.....++..++......
T Consensus        19 ~i~~~~~~~~~~v~~av~~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   77 (462)
T PRK13968         19 QLSVLPWAGADDIENALQLAAAGFRDWRETNIDYRAQKLRDIGKALRARSEEMAQMITR   77 (462)
T ss_pred             EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence            44455555555543      444557999999999999999999999999999886543


No 90 
>cd07118 ALDH_SNDH Gluconobacter oxydans L-sorbosone dehydrogenase-like. Included in this CD is the L-sorbosone dehydrogenase (SNDH) from Gluconobacter oxydans UV10. In G. oxydans,  D-sorbitol is converted to 2-keto-L-gulonate (a precursor of L-ascorbic acid) in sequential oxidation steps catalyzed by a FAD-dependent, L-sorbose dehydrogenase and an NAD(P)+-dependent,  L-sorbosone dehydrogenase.
Probab=25.26  E-value=2e+02  Score=24.01  Aligned_cols=51  Identities=8%  Similarity=0.086  Sum_probs=37.2

Q ss_pred             cccCCCCCchhHHHHHHH--------HhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264          90 KQDNPEYGVGDIAKELGK--------KWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus        90 ~~~~p~~~~~e~~k~~~~--------~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      ....|..+..++...+..        .|..+|-.+|..+...+......+..++..-..
T Consensus        10 i~~~~~~~~~~v~~av~~A~~a~~~~~w~~~~~~~R~~~l~~~a~~l~~~~~~la~~~~   68 (454)
T cd07118          10 VARYAEGTVEDVDAAVAAARKAFDKGPWPRMSGAERAAVLLKVADLIRARRERLALIET   68 (454)
T ss_pred             EEEEeCCCHHHHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            344455555555433322        299999999999999999999999999877543


No 91 
>PRK10455 periplasmic protein; Reviewed
Probab=25.14  E-value=1.5e+02  Score=21.30  Aligned_cols=27  Identities=19%  Similarity=0.245  Sum_probs=20.5

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHH
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEK  126 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~  126 (156)
                      ..++.....+.-||+++|+.|.+..++
T Consensus       118 ~~~~~~~qiy~vLTPEQr~q~~~~~ek  144 (161)
T PRK10455        118 AHMETQNKIYNVLTPEQKKQFNANFEK  144 (161)
T ss_pred             HHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence            344556678999999999999985543


No 92 
>PF00226 DnaJ:  DnaJ domain;  InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation:  +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+   It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=25.09  E-value=1.1e+02  Score=17.55  Aligned_cols=38  Identities=24%  Similarity=0.357  Sum_probs=26.1

Q ss_pred             HHHHHHHHHhCCCCcchHH----HHHHHHHHHhccCCHHHHH
Q psy3264          21 QVCREEHKKKHPEENVVFA----EFSKKCAERWKTMSEKEKK   58 (156)
Q Consensus        21 ~e~r~~~~~~~p~~~~~~~----ei~k~l~~~Wk~Ls~~eK~   58 (156)
                      +..+..++.-||+......    +.+..|..-|..|++..+.
T Consensus        19 ~~y~~l~~~~HPD~~~~~~~~~~~~~~~i~~Ay~~L~~~~~R   60 (64)
T PF00226_consen   19 KAYRRLSKQYHPDKNSGDEAEAEEKFARINEAYEILSDPERR   60 (64)
T ss_dssp             HHHHHHHHHTSTTTGTSTHHHHHHHHHHHHHHHHHHHSHHHH
T ss_pred             HHHHhhhhccccccchhhhhhhhHHHHHHHHHHHHhCCHHHH
Confidence            3445556666888643223    7889999999998877663


No 93 
>cd07084 ALDH_KGSADH-like ALDH subfamily: NAD(P)+-dependent alpha-ketoglutaric semialdehyde dehydrogenases and plant delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH family 12-like. ALDH subfamily which includes the NAD(P)+-dependent, alpha-ketoglutaric semialdehyde dehydrogenases (KGSADH, EC 1.2.1.26); plant delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH, EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of the MaoC (monoamine oxidase C) dehydratase regulatory protein; and orthologs of MaoC, PaaZ and PaaN, which are putative ring-opening enzymes of the aerobic phenylacetic acid catabolic pathway.
Probab=24.88  E-value=1.9e+02  Score=24.15  Aligned_cols=42  Identities=10%  Similarity=-0.113  Sum_probs=35.6

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.++.....|+.++..+|......+....+.+..++......
T Consensus         6 ~~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   47 (442)
T cd07084           6 LAADISTKAARRLALPKRADFLARIIQRLAAKSYDIAAGAVL   47 (442)
T ss_pred             HHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence            445666678999999999999999999999999998877543


No 94 
>cd07101 ALDH_SSADH2_GabD2 Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 2-like. Succinate-semialdehyde dehydrogenase 2 (SSADH2) and similar proteins are in this CD. SSADH1 (GabD1, EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation of succinate semialdehyde to succinate.  SSADH activity in Mycobacterium tuberculosis is encoded by both gabD1 (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1 was shown to be much higher than that of GabD2, and GabD2 (SSADH2) is likely to serve physiologically as a dehydrogenase for a different aldehyde(s).
Probab=24.87  E-value=2e+02  Score=23.89  Aligned_cols=50  Identities=10%  Similarity=0.078  Sum_probs=37.5

Q ss_pred             cCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264          92 DNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus        92 ~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      ..+..+..++.      +.....|..+|.++|..+...+......+..++......
T Consensus        11 ~~~~~~~~~v~~av~~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~~   66 (454)
T cd07101          11 ELPQSTPADVEAAFARARAAQRAWAARPFAERAAVFLRFHDLVLERRDELLDLIQL   66 (454)
T ss_pred             EEcCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            34444555543      444567999999999999999999999999888776543


No 95 
>PF03480 SBP_bac_7:  Bacterial extracellular solute-binding protein, family 7;  InterPro: IPR018389 This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein DctP [, ] and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins []. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component [].; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space; PDB: 2HZK_C 2HZL_B 2HPG_C 2XWI_A 2XWK_A 2WX9_A 2CEY_A 2WYP_A 3B50_A 2CEX_B ....
Probab=24.87  E-value=1.5e+02  Score=22.73  Aligned_cols=32  Identities=16%  Similarity=0.262  Sum_probs=23.5

Q ss_pred             HHHhccCChHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         106 GKKWADVDPSVKTKYEQMAEKDKARYEKEMTA  137 (156)
Q Consensus       106 ~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~  137 (156)
                      ...|.+||++.|+...+.+...-.........
T Consensus       213 ~~~w~~L~~e~q~~l~~~~~~~~~~~~~~~~~  244 (286)
T PF03480_consen  213 KDWWDSLPDEDQEALDDAADEAEARAREYYEA  244 (286)
T ss_dssp             HHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred             HHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            36799999999999999888776555544443


No 96 
>PRK11241 gabD succinate-semialdehyde dehydrogenase I; Provisional
Probab=24.72  E-value=2e+02  Score=24.32  Aligned_cols=53  Identities=6%  Similarity=0.056  Sum_probs=38.3

Q ss_pred             ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264          89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus        89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      .....+..+..++.      +..-..|..+|..+|..+...+....+.+..++..-...
T Consensus        38 ~v~~~~~~~~~~v~~av~~A~~a~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   96 (482)
T PRK11241         38 KLGSVPKMGADETRAAIDAANRALPAWRALTAKERANILRRWFNLMMEHQDDLARLMTL   96 (482)
T ss_pred             EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            33344444555544      333346999999999999999999999999998876543


No 97 
>cd07077 ALDH-like NAD(P)+-dependent aldehyde dehydrogenase-like (ALDH-like) family. The aldehyde dehydrogenase-like (ALDH-like) group of the ALDH superfamily of NAD(P)+-dependent enzymes which, in general, oxidize a wide range of  endogenous and exogenous aliphatic and aromatic aldehydes to their corresponding carboxylic acids and play an  important role in detoxification. This group includes families ALDH18, ALDH19, and ALDH20 and represents such proteins as gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA reductase, and coenzyme A acylating aldehyde dehydrogenase.  All of these proteins have a conserved cysteine that aligns with the catalytic cysteine of the ALDH group.
Probab=24.26  E-value=1.9e+02  Score=23.61  Aligned_cols=39  Identities=8%  Similarity=-0.049  Sum_probs=32.8

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      ++.....|..+|..+|..+........+++..++..-..
T Consensus         3 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   41 (397)
T cd07077           3 AKNAQRTLAVNHDEQRDLIINAIANALYDTRQRLASEAV   41 (397)
T ss_pred             HHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            455667899999999999999999999999988776443


No 98 
>cd07099 ALDH_DDALDH Methylomonas sp. 4,4'-diapolycopene-dialdehyde dehydrogenase-like. The 4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH) involved in C30 carotenoid synthesis in Methylomonas sp. strain 16a and other similar sequences are present in this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde into 4,4'-diapolycopene-diacid.
Probab=24.16  E-value=2.2e+02  Score=23.60  Aligned_cols=38  Identities=13%  Similarity=0.165  Sum_probs=31.2

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      ++.....|..++..+|..+...+......+..++..-.
T Consensus        27 a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~   64 (453)
T cd07099          27 ARAAQRAWAALGVEGRAQRLLRWKRALADHADELAELL   64 (453)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            34445679999999999999999999988888877643


No 99 
>cd07108 ALDH_MGR_2402 Magnetospirillum NAD(P)+-dependent aldehyde dehydrogenase MSR-1-like. NAD(P)+-dependent aldehyde dehydrogenase of Magnetospirillum gryphiswaldense MSR-1 (MGR_2402) , and other similar sequences, are present in this CD.
Probab=24.01  E-value=2.2e+02  Score=23.71  Aligned_cols=39  Identities=13%  Similarity=0.083  Sum_probs=32.9

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.....|..+|..+|..+...+......+..++......
T Consensus        29 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   67 (457)
T cd07108          29 KAAFPEWAATPARERGKLLARIADALEARSEELARLLAL   67 (457)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            445567999999999999999999999999998776544


No 100
>PRK00197 proA gamma-glutamyl phosphate reductase; Provisional
Probab=23.88  E-value=2.2e+02  Score=23.48  Aligned_cols=40  Identities=18%  Similarity=0.106  Sum_probs=34.2

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      .++..-..|..+|..+|..+...+....+.+..++.....
T Consensus        12 ~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   51 (417)
T PRK00197         12 RAKAASRKLAQLSTAQKNRALLAIADALEANAAEILAANA   51 (417)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence            3455567899999999999999999999999999888654


No 101
>TIGR01804 BADH glycine betaine aldehyde dehydrogenase. Betaine aldehyde dehydrogenase is a member of the aldehyde dehydrogenase family (pfam00171).
Probab=23.69  E-value=2.2e+02  Score=23.83  Aligned_cols=38  Identities=11%  Similarity=0.255  Sum_probs=32.5

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..+|..+|..+...+......+..++.....
T Consensus        45 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   82 (467)
T TIGR01804        45 RRAQGEWASMTPEERGRILRRIAELIRERNEELAKLET   82 (467)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44446799999999999999999999999999887643


No 102
>PF07813 LTXXQ:  LTXXQ motif family protein;  InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT). This is a periplasmic protein that aids in combating extracytoplasmic protein-mediated toxicity, and may also be involved in the response to alkaline pH []. Another member of this family, Spy (P77754 from SWISSPROT) is also a periplasmic protein that may be involved in the response to stress []. The homology between CpxP and Spy may indicate that these two proteins are functionally related []. The motif is found repeated twice in many members of this entry. ; GO: 0042597 periplasmic space; PDB: 3ITF_B 3QZC_B 3OEO_D 3O39_A.
Probab=23.31  E-value=1.3e+02  Score=18.79  Aligned_cols=24  Identities=29%  Similarity=0.441  Sum_probs=18.1

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHH
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQM  123 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~  123 (156)
                      .........+..||+++|..|..+
T Consensus        76 ~~~~~~~~~~~vLt~eQk~~~~~l   99 (100)
T PF07813_consen   76 ERAKAQHALYAVLTPEQKEKFDQL   99 (100)
T ss_dssp             HHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred             HHHHHHHHHHhcCCHHHHHHHHHh
Confidence            355667778999999999998764


No 103
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=23.22  E-value=1.8e+02  Score=24.32  Aligned_cols=43  Identities=21%  Similarity=0.317  Sum_probs=32.2

Q ss_pred             HHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhh
Q psy3264          21 QVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMA   64 (156)
Q Consensus        21 ~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a   64 (156)
                      +..|...++-||+...+ .+.++.|.+.|..||+.+|....+..
T Consensus        47 kAYrkla~k~HPDk~~~-~e~F~~i~~AYevLsD~~kR~~YD~~   89 (421)
T PTZ00037         47 KAYRKLAIKHHPDKGGD-PEKFKEISRAYEVLSDPEKRKIYDEY   89 (421)
T ss_pred             HHHHHHHHHHCCCCCch-HHHHHHHHHHHHHhccHHHHHHHhhh
Confidence            34455666779997543 58889999999999999877666653


No 104
>cd07131 ALDH_AldH-CAJ73105 Uncharacterized Candidatus kuenenia aldehyde dehydrogenase AldH (CAJ73105)-like. Uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105) and similar sequences with similarity to alpha-aminoadipic semialdehyde dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31), Arabidopsis ALDH7B4, and Streptomyces clavuligerus delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH) are included in this CD.
Probab=23.16  E-value=2.3e+02  Score=23.70  Aligned_cols=37  Identities=16%  Similarity=0.239  Sum_probs=31.2

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +.....|..++.++|..+...+.....++..++..-.
T Consensus        47 ~~A~~~w~~~~~~~R~~~l~~~a~~l~~~~~ela~~~   83 (478)
T cd07131          47 REAFPEWRKVPAPRRAEYLFRAAELLKKRKEELARLV   83 (478)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4445579999999999999999999999998887654


No 105
>cd07098 ALDH_F15-22 Aldehyde dehydrogenase family 15A1 and 22A1-like. Aldehyde dehydrogenase family members ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1 (Arabidopsis thaliana, EC=1.2.1.3), and similar sequences, are in this CD. Significant improvement of stress tolerance in tobacco plants was observed by overexpressing the ALDH22A1 gene from maize (Zea mays) and was accompanied by a reduction of malondialdehyde  derived from cellular lipid peroxidation.
Probab=23.16  E-value=2.5e+02  Score=23.41  Aligned_cols=37  Identities=14%  Similarity=0.162  Sum_probs=30.8

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +.....|..+|..+|..+...+.....++..++....
T Consensus        28 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~   64 (465)
T cd07098          28 RAAQREWAKTSFAERRKVLRSLLKYILENQEEICRVA   64 (465)
T ss_pred             HHHHHHhccCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            4445679999999999999999999988888877553


No 106
>cd07137 ALDH_F3FHI Plant aldehyde dehydrogenase family 3 members F1, H1, and I1 and related proteins. Aldehyde dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1, ALDH3H1, and ALDH3I1), and similar plant sequences, are in this CD.  In Arabidopsis thaliana, stress-regulated expression of ALDH3I1  was observed in  leaves and osmotic stress expression of  ALDH3H1 was observed in root tissue, whereas, ALDH3F1 expression was not stress responsive. Functional analysis of ALDH3I1 suggest it may be involved in a detoxification pathway in plants that limits aldehyde accumulation and oxidative stress.
Probab=22.97  E-value=2.4e+02  Score=23.46  Aligned_cols=42  Identities=7%  Similarity=-0.114  Sum_probs=34.8

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.++..-..|..++..+|..+...+.....++..++..-...
T Consensus         6 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~~   47 (432)
T cd07137           6 RELRETFRSGRTRSAEWRKSQLKGLLRLVDENEDDIFAALRQ   47 (432)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            345666678999999999999999999999999998765443


No 107
>cd07147 ALDH_F21_RNP123 Aldehyde dehydrogenase family 21A1-like. Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was first described in the moss Tortula ruralis and is believed to play an important role in the detoxification of aldehydes generated in response to desiccation- and salinity-stress, and ALDH21A1 expression represents a unique stress tolerance mechanism. So far, of plants, only the bryophyte sequence has been observed, but similar protein sequences from bacteria and archaea are also present in this CD.
Probab=22.80  E-value=2.4e+02  Score=23.40  Aligned_cols=39  Identities=5%  Similarity=-0.135  Sum_probs=32.0

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      ++.....|..+|..+|......+......+..++..-..
T Consensus        30 A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   68 (452)
T cd07147          30 AVKAFRPMRALPAHRRAAILLHCVARLEERFEELAETIV   68 (452)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            344556799999999999999999999999988876543


No 108
>PF14098 SSPI:  Small, acid-soluble spore protein I
Probab=22.71  E-value=1e+02  Score=18.78  Aligned_cols=19  Identities=26%  Similarity=0.426  Sum_probs=14.9

Q ss_pred             HHHHhccCCHHHHHHHHHh
Q psy3264          45 CAERWKTMSEKEKKRFHEM   63 (156)
Q Consensus        45 l~~~Wk~Ls~~eK~~y~~~   63 (156)
                      .-..|+.+++.+|....+.
T Consensus        42 FE~~W~~~~~~ek~~m~~~   60 (65)
T PF14098_consen   42 FEVIWKNSDESEKQEMVNT   60 (65)
T ss_pred             HHHHHHhCCHHHHHHHHHH
Confidence            3346999999999887654


No 109
>cd08332 CARD_CASP2 Caspase activation and recruitment domain of Caspase-2. Caspase activation and recruitment domain (CARD) similar to that found in caspase-2. Caspases are aspartate-specific cysteine proteases with functions in apoptosis and immune signaling. Caspase-2 (also known as ICH1, NEDD2, or CASP2) is one of the most evolutionarily conserved caspases, and plays a role in apoptosis, DNA damage response, cell cycle regulation, and tumor suppression. It is localized in the nucleus and exhibits properties of both an initiator and an effector caspase. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and
Probab=22.70  E-value=40  Score=21.65  Aligned_cols=17  Identities=29%  Similarity=0.542  Sum_probs=14.5

Q ss_pred             CCCCCcHHHHHHHHHHH
Q psy3264           9 PRGRMTAYAFFVQVCRE   25 (156)
Q Consensus         9 PkrP~say~lF~~e~r~   25 (156)
                      |+|...||..||..-+.
T Consensus        61 ~~RG~~AF~~F~~aL~~   77 (90)
T cd08332          61 PKRGPRAFSAFCEALRE   77 (90)
T ss_pred             HHhChhHHHHHHHHHHh
Confidence            78888999999987764


No 110
>cd07106 ALDH_AldA-AAD23400 Streptomyces aureofaciens putative aldehyde dehydrogenase AldA (AAD23400)-like. Putative aldehyde dehydrogenase, AldA, from Streptomyces aureofaciens (locus AAD23400) and other similar sequences are present in this CD.
Probab=22.61  E-value=2.4e+02  Score=23.32  Aligned_cols=38  Identities=8%  Similarity=0.016  Sum_probs=32.4

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..+|..+|..+...+......+..++.....
T Consensus        29 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~   66 (446)
T cd07106          29 KAAFPGWSATPLEERRAALLAIADAIEANAEELARLLT   66 (446)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44456799999999999999999999999999887754


No 111
>cd07559 ALDH_ACDHII_AcoD-like Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II and Staphylococcus aureus AldA1 (SACOL0154)-like. Included in this CD is the NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus H16 involved in the catabolism of acetoin and ethanol, and similar proteins, such as, the dimeric dihydrolipoamide dehydrogenase of the acetoin dehydrogenase enzyme system of Klebsiella pneumonia. Also included are sequences similar to the NAD+-dependent chloroacetaldehyde dehydrogenases (AldA and AldB) of Xanthobacter autotrophicus GJ10 which are involved in the degradation of 1,2-dichloroethane, as well as, the uncharacterized aldehyde dehydrogenase from Staphylococcus aureus (AldA1, locus SACOL0154) and other similar sequences.
Probab=22.59  E-value=2.4e+02  Score=23.75  Aligned_cols=38  Identities=8%  Similarity=0.081  Sum_probs=32.1

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..++..+|..+...+......+..++..-..
T Consensus        48 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   85 (480)
T cd07559          48 HEAFKTWGKTSVAERANILNKIADRIEENLELLAVAET   85 (480)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44455799999999999999999999999988877644


No 112
>PLN02174 aldehyde dehydrogenase family 3 member H1
Probab=22.52  E-value=2.4e+02  Score=24.05  Aligned_cols=43  Identities=7%  Similarity=-0.188  Sum_probs=35.4

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSV  143 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~  143 (156)
                      .++..-..|..+|..+|..+...+....+++..++........
T Consensus        18 ~a~~af~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~l~~e~   60 (484)
T PLN02174         18 ELRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDL   60 (484)
T ss_pred             HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHHc
Confidence            3445556799999999999999999999999999988755443


No 113
>cd07100 ALDH_SSADH1_GabD1 Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like. Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1, EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation of succinate semialdehyde (SSA)  to succinate.  SSADH activity in Mycobacterium tuberculosis (Mtb) is encoded by both gabD1 (Rv0234c) and gabD2 (Rv1731).  The Mtb GabD1 SSADH1 reportedly is an enzyme of the gamma-aminobutyrate shunt, which forms a functional link between two TCA half-cycles by converting alpha-ketoglutarate to succinate.
Probab=22.49  E-value=2.5e+02  Score=23.21  Aligned_cols=39  Identities=8%  Similarity=-0.005  Sum_probs=31.7

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      .++..-..|..++..+|..+...+....+.+..++....
T Consensus         7 ~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~   45 (429)
T cd07100           7 RAHAAFLAWRKTSFAERAALLRKLADLLRERKDELARLI   45 (429)
T ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            345556789999999999999999999888888876543


No 114
>cd07086 ALDH_F7_AASADH-like NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase and related proteins. ALDH subfamily which includes the NAD+-dependent, alpha-aminoadipic semialdehyde dehydrogenase (AASADH, EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1, ALDH7B or delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH), and other similar sequences, such as the uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105).
Probab=22.17  E-value=2.4e+02  Score=23.68  Aligned_cols=38  Identities=11%  Similarity=0.181  Sum_probs=31.8

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..++.++|..+...+......+..++..-..
T Consensus        45 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~   82 (478)
T cd07086          45 REAFKEWRKVPAPRRGEIVRQIGEALRKKKEALGRLVS   82 (478)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44445799999999999999999999999988877643


No 115
>cd07104 ALDH_BenzADH-like ALDH subfamily: NAD(P)+-dependent benzaldehyde dehydrogenase II, vanillin dehydrogenase, p-hydroxybenzaldehyde dehydrogenase and related proteins. ALDH subfamily which includes the NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC, BenzADH, EC=1.2.1.28)  involved in the oxidation of benzyl alcohol to benzoate; p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH) which catalyzes the oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic acid; vanillin dehydrogenase (Vdh, VaniDH) involved in the metabolism of ferulic acid as seen in Pseudomonas putida KT2440; and other related sequences.
Probab=22.14  E-value=2.5e+02  Score=22.98  Aligned_cols=39  Identities=18%  Similarity=0.244  Sum_probs=32.6

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      .++..-..|+.++..+|..+...+......+..++..-.
T Consensus         8 ~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~   46 (431)
T cd07104           8 AAAAAQKAWAATPPQERAAILRKAAEILEERRDEIADWL   46 (431)
T ss_pred             HHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            445566789999999999999999999888888887743


No 116
>TIGR01780 SSADH succinate-semialdehyde dehydrogenase. SSADH enzyme belongs to the aldehyde dehydrogenase family (pfam00171), sharing a common evolutionary origin and enzymatic mechanism with lactaldehyde dehydrogenase. Like in lactaldehyde dehydrogenase and succinate semialdehyde dehydrogenase, the mammalian catalytic glutamic acid and cysteine residues are conserved in all the enzymes of this family (PS00687, PS00070).
Probab=22.09  E-value=2.5e+02  Score=23.35  Aligned_cols=40  Identities=10%  Similarity=0.066  Sum_probs=33.1

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      ++.....|..++.++|..+...+....+.+..++......
T Consensus        28 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~~   67 (448)
T TIGR01780        28 AYEAFKTWKNTTAKERSSLLRKWYNLMMENKDDLARLITL   67 (448)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            3455567999999999999999999999999988776543


No 117
>cd07142 ALDH_F2BC Arabidosis aldehyde dehydrogenase family 2 B4, B7, C4-like. Included in this CD is the Arabidosis aldehyde dehydrogenase family 2 members B4 and B7 (EC=1.2.1.3),  which are mitochondrial homotetramers that oxidize acetaldehyde and glycolaldehyde, but not L-lactaldehyde. Also in this group, is the Arabidosis cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme involved in the oxidation of sinapalehyde and coniferaldehyde.
Probab=22.06  E-value=2.6e+02  Score=23.51  Aligned_cols=53  Identities=9%  Similarity=0.120  Sum_probs=38.9

Q ss_pred             cccCCCCCchhHHHHHHH--------HhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264          90 KQDNPEYGVGDIAKELGK--------KWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS  142 (156)
Q Consensus        90 ~~~~p~~~~~e~~k~~~~--------~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~  142 (156)
                      ....+..+..++...+..        .|..+|..+|..+...+......+..++.......
T Consensus        32 i~~v~~~~~~~v~~av~~A~~a~~~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~~e   92 (476)
T cd07142          32 IAHVAEGDAEDVDRAVKAARKAFDEGPWPRMTGYERSRILLRFADLLEKHADELAALETWD   92 (476)
T ss_pred             EEEEcCCCHHHHHHHHHHHHHhcCCCchhhCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHh
Confidence            334455555665544432        29999999999999999999999999988765443


No 118
>TIGR01238 D1pyr5carbox3 delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain). This model represents one of several related branches of delta-1-pyrroline-5-carboxylate dehydrogenase. Members of this branch are the C-terminal domain of the PutA bifunctional proline dehydrogenase / delta-1-pyrroline-5-carboxylate dehydrogenase.
Probab=22.02  E-value=2.5e+02  Score=23.92  Aligned_cols=52  Identities=8%  Similarity=-0.031  Sum_probs=38.0

Q ss_pred             ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264          89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus        89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      .....+..+..++.      +.....|..+|..+|..+...+......+..++..-..
T Consensus        64 ~i~~~~~~~~~dv~~av~~A~~a~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~  121 (500)
T TIGR01238        64 IVGQVFHANLAHVQAAIDSAQQAFPTWNATPAKERAAKLDRLADLLELHMPELMALCV  121 (500)
T ss_pred             EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            33444444544544      44456799999999999999999999999988866443


No 119
>cd07151 ALDH_HBenzADH NADP+-dependent p-hydroxybenzaldehyde dehydrogenase-like. NADP+-dependent, p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH) which catalyzes oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic acid and other related sequences are included in this CD.
Probab=22.00  E-value=2.5e+02  Score=23.49  Aligned_cols=38  Identities=16%  Similarity=0.213  Sum_probs=31.7

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..++..+|..+...+......+..++..-..
T Consensus        42 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~   79 (465)
T cd07151          42 AAAQKEWAATLPQERAEILEKAAQILEERRDEIVEWLI   79 (465)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44456799999999999999999999998888876543


No 120
>PF08367 M16C_assoc:  Peptidase M16C associated;  InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold:  Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases.   In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding.  Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=21.94  E-value=1.9e+02  Score=21.95  Aligned_cols=32  Identities=19%  Similarity=0.140  Sum_probs=26.3

Q ss_pred             hhHHHHHHHHhccCChHHHHHHHHHHHHHHHH
Q psy3264          99 GDIAKELGKKWADVDPSVKTKYEQMAEKDKAR  130 (156)
Q Consensus        99 ~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~  130 (156)
                      .+..+.|.+...+||+++++.-++.+.+.++.
T Consensus        13 ~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~   44 (248)
T PF08367_consen   13 EEEKEKLAAYKASLSEEEKEKIIEQTKELKER   44 (248)
T ss_dssp             HHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHH
T ss_pred             HHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHH
Confidence            45678899999999999999999988886554


No 121
>cd07129 ALDH_KGSADH Alpha-Ketoglutaric Semialdehyde Dehydrogenase. Alpha-Ketoglutaric Semialdehyde (KGSA) Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the NAD(P)+-dependent conversion of KGSA to alpha-ketoglutarate. This CD contains such sequences as those seen in Azospirillum brasilense, KGSADH-II (D-glucarate/D-galactarate-inducible) and KGSADH-III (hydroxy-L-proline-inducible). Both show similar high substrate specificity for KGSA and different coenzyme specificity; KGSADH-II is NAD+-dependent and KGSADH-III is NADP+-dependent. Also included in this CD is the NADP(+)-dependent aldehyde dehydrogenase from Vibrio harveyi which catalyzes the oxidation of long-chain aliphatic aldehydes to acids.
Probab=21.87  E-value=2.4e+02  Score=23.52  Aligned_cols=39  Identities=10%  Similarity=0.177  Sum_probs=32.8

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      .++.....|..++..+|..+...+......+..++..-.
T Consensus         7 ~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~   45 (454)
T cd07129           7 AAAAAFESYRALSPARRAAFLEAIADEIEALGDELVARA   45 (454)
T ss_pred             HHHHHHHHHhhCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            445566789999999999999999999999988887653


No 122
>PRK13473 gamma-aminobutyraldehyde dehydrogenase; Provisional
Probab=21.84  E-value=2.6e+02  Score=23.44  Aligned_cols=38  Identities=8%  Similarity=0.150  Sum_probs=32.0

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..+|.++|..+...+......+..++.....
T Consensus        49 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   86 (475)
T PRK13473         49 DAAFPEWSQTTPKERAEALLKLADAIEENADEFARLES   86 (475)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44446799999999999999999999999988887644


No 123
>cd07130 ALDH_F7_AASADH NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase, ALDH family members 7A1 and 7B. Alpha-aminoadipic semialdehyde dehydrogenase (AASADH, EC=1.2.1.31), also known as ALDH7A1, Antiquitin-1, ALDH7B, or delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH), is a NAD+-dependent ALDH. Human ALDH7A1 is involved in the pipecolic acid pathway of lysine catabolism, catalyzing the oxidation of alpha-aminoadipic semialdehyde to alpha-aminoadipate.  Arabidopsis thaliana ALDH7B4 appears to be an osmotic-stress-inducible ALDH gene encoding a turgor-responsive or stress-inducible ALDH. The Streptomyces clavuligerus P6CDH appears to be involved in cephamycin biosynthesis, catalyzing the second stage of the two-step conversion of lysine to alpha-aminoadipic acid.  The ALDH7A1 enzyme and others in this group have been observed as tetramers, yet the bacterial P6CDH enzyme has been reported as a monomer.
Probab=21.80  E-value=2.6e+02  Score=23.48  Aligned_cols=38  Identities=13%  Similarity=0.205  Sum_probs=32.1

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..+|.++|......+......+..++.....
T Consensus        44 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   81 (474)
T cd07130          44 QEAFKEWRDVPAPKRGEIVRQIGDALRKKKEALGKLVS   81 (474)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            44456799999999999999999999999999876543


No 124
>cd07140 ALDH_F1L_FTFDH 10-formyltetrahydrofolate dehydrogenase, ALDH family 1L. 10-formyltetrahydrofolate dehydrogenase (FTHFDH, EC=1.5.1.6), also known as aldehyde dehydrogenase family 1 member L1 (ALDH1L1) in humans, is a multi-domain homotetramer with an N-terminal formyl transferase domain and a C-terminal ALDH domain. FTHFDH catalyzes an NADP+-dependent dehydrogenase reaction resulting in the conversion of 10-formyltetrahydrofolate to tetrahydrofolate and CO2. The ALDH domain is also capable of the oxidation of short chain aldehydes to their corresponding acids.
Probab=21.77  E-value=2.6e+02  Score=23.62  Aligned_cols=53  Identities=8%  Similarity=0.153  Sum_probs=39.8

Q ss_pred             ccccCCCCCchhHHHHHHH--------HhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264          89 IKQDNPEYGVGDIAKELGK--------KWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus        89 ~~~~~p~~~~~e~~k~~~~--------~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      .....+..+..++...+..        .|..++..+|..+...+......+..++......
T Consensus        33 ~i~~~~~~~~~~v~~av~~A~~A~~~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   93 (486)
T cd07140          33 VICKVSLATVEDVDRAVAAAKEAFENGEWGKMNARDRGRLMYRLADLMEEHQEELATIESL   93 (486)
T ss_pred             EEEEEeCCCHHHHHHHHHHHHHhcCCCchhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            4445555666666555433        4999999999999999999999999888876443


No 125
>cd07136 ALDH_YwdH-P39616 Bacillus subtilis aldehyde dehydrogenase ywdH-like. Uncharacterized Bacillus subtilis ywdH aldehyde dehydrogenase (locus P39616)  most closely related to the ALDHs and fatty ALDHs of families 3 and 14, and similar sequences, are included in this CD.
Probab=21.76  E-value=2.5e+02  Score=23.54  Aligned_cols=43  Identities=14%  Similarity=0.158  Sum_probs=36.0

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS  142 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~  142 (156)
                      +.++..-..|..++..+|.....++....+.+..++.......
T Consensus         5 ~~a~~a~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~e   47 (449)
T cd07136           5 EKQRAFFKTGATKDVEFRIEQLKKLKQAIKKYENEILEALKKD   47 (449)
T ss_pred             HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence            4556666789999999999999999999999999988765443


No 126
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=21.65  E-value=63  Score=24.26  Aligned_cols=39  Identities=15%  Similarity=0.263  Sum_probs=31.8

Q ss_pred             HHHHHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264          41 FSKKCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT   79 (156)
Q Consensus        41 i~k~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~   79 (156)
                      ....||.=|+++|+++++.|..-... ....|-..+..|.
T Consensus        76 a~~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~  115 (202)
T COG2854          76 AKLVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYK  115 (202)
T ss_pred             HHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence            34668889999999999999987777 5567888887775


No 127
>PF05823 Gp-FAR-1:  Nematode fatty acid retinoid binding protein (Gp-FAR-1);  InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=21.52  E-value=1.1e+02  Score=21.73  Aligned_cols=82  Identities=17%  Similarity=0.259  Sum_probs=41.0

Q ss_pred             HHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCCCCccccccccccCCCC-CchhH
Q psy3264          23 CREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPAGEKRGKIKQDNPEY-GVGDI  101 (156)
Q Consensus        23 ~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~-~~~e~  101 (156)
                      ....++..+|.+-..+..+...+..+...|+++-|..-...-..-+..|.....              +..|.. ....+
T Consensus        45 ~i~~LK~ksP~L~~k~~~l~~~~k~ki~~L~peak~Fv~~li~~~~~l~~~~~~--------------G~~~~~~~lk~~  110 (154)
T PF05823_consen   45 MIAALKEKSPSLYEKAEKLRDKLKKKIDKLSPEAKAFVKELIAKARSLYAQYSA--------------GEKPDLEELKQL  110 (154)
T ss_dssp             HHHHHHHH-HHHHHHHHHHHHHHHHTTTT--HHHHHHHHHHHHHHHHHHHHHHH--------------T----THHHHHH
T ss_pred             HHHHHHHhCHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHhcC--------------CCCCCHHHHHHH
Confidence            345566666665444556667777778888888887666654443333332222              111111 12455


Q ss_pred             HHHHHHHhccCChHHHH
Q psy3264         102 AKELGKKWADVDPSVKT  118 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~  118 (156)
                      .+.+...+++||++.|.
T Consensus       111 ~k~~~~~ykaLs~~ak~  127 (154)
T PF05823_consen  111 AKKVIDSYKALSPEAKD  127 (154)
T ss_dssp             H----HHHHTS-HHHHH
T ss_pred             HhhhHHHHHcCCHHHHH
Confidence            66667888888888775


No 128
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=21.48  E-value=1.6e+02  Score=21.36  Aligned_cols=34  Identities=15%  Similarity=0.324  Sum_probs=25.7

Q ss_pred             HHHHHHHhccCCHHHHHHHHHhhhhhhhhhcccc
Q psy3264          42 SKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEM   75 (156)
Q Consensus        42 ~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~   75 (156)
                      .+...+.+.-||+++|..|.++..+....+...+
T Consensus       127 ~~~~~~~~~vLTpEQRak~~e~~~~r~~~~~~~~  160 (170)
T PRK12750        127 LEKRHQMLSILTPEQKAKFQELQQERMQECQDKM  160 (170)
T ss_pred             HHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHH
Confidence            4455568999999999999998877665555443


No 129
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=21.46  E-value=1.1e+02  Score=18.67  Aligned_cols=19  Identities=32%  Similarity=0.520  Sum_probs=14.5

Q ss_pred             HHHHhccCCHHHHHHHHHh
Q psy3264          45 CAERWKTMSEKEKKRFHEM   63 (156)
Q Consensus        45 l~~~Wk~Ls~~eK~~y~~~   63 (156)
                      .-..|+.+++.+|......
T Consensus        41 FE~~W~~~~~~ek~~m~~~   59 (65)
T TIGR03092        41 FEAIWKHANEQEKDEMLET   59 (65)
T ss_pred             HHHHHHhcCHHHHHHHHHH
Confidence            3346999999999877653


No 130
>PF00171 Aldedh:  Aldehyde dehydrogenase family;  InterPro: IPR015590 Aldehyde dehydrogenases (1.2.1.3 from EC and 1.2.1.5 from EC) are enzymes that oxidize a wide variety of aliphatic and aromatic aldehydes using NADP as a cofactor. In mammals at least four different forms of the enzyme are known []: class-1 (or Ald C) a tetrameric cytosolic enzyme, class-2 (or Ald M) a tetrameric mitochondrial enzyme, class- 3 (or Ald D) a dimeric cytosolic enzyme, and class IV a microsomal enzyme. Aldehyde dehydrogenases have also been sequenced from fungal and bacterial species. A number of enzymes are known to be evolutionary related to aldehyde dehydrogenases. A glutamic acid and a cysteine residue have been implicated in the catalytic activity of mammalian aldehyde dehydrogenase. These residues are conserved in all the enzymes of this entry. Some of the proteins in this entry are allergens. Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation.  The allergens in this family include allergens with the following designations: Alt a 10 and Cla h 3.; GO: 0016491 oxidoreductase activity, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 3R31_A 3HAZ_A 1UXQ_A 1UXP_A 1UXR_A 1UXU_A 1UXN_A 1KY8_A 1UXT_A 1UXV_A ....
Probab=21.36  E-value=2.4e+02  Score=23.48  Aligned_cols=40  Identities=10%  Similarity=0.147  Sum_probs=33.5

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      .++.....|..++..+|......+....+++..++..-..
T Consensus        37 ~A~~A~~~w~~~~~~~R~~iL~~~a~~l~~~~~~la~~~~   76 (462)
T PF00171_consen   37 AARAAFKEWSKLPAAERARILERFADLLEERRDELAELIA   76 (462)
T ss_dssp             HHHHHHHHHTTSSHHHHHHHHHHHHHHHHHTHHHHHHHHH
T ss_pred             HHHHHHHhhhhhhhhhHHHHHHHHHHHHhhcccccccccc
Confidence            3455567899999999999999999999999998876544


No 131
>cd07138 ALDH_CddD_SSP0762 Rhodococcus ruber 6-oxolauric acid dehydrogenase-like. The 6-oxolauric acid dehydrogenase (CddD) from Rhodococcus ruber SC1 which converts 6-oxolauric acid to dodecanedioic acid, and the aldehyde dehydrogenase (locus SSP0762) from Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 and other similar sequences, are included in this CD.
Probab=21.34  E-value=2.5e+02  Score=23.45  Aligned_cols=38  Identities=5%  Similarity=0.063  Sum_probs=32.4

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +..-..|..+|..+|..+...+......+..++.....
T Consensus        46 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   83 (466)
T cd07138          46 RRAFPAWSATSVEERAALLERIAEAYEARADELAQAIT   83 (466)
T ss_pred             HHHHHHHhhCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44455799999999999999999999999999887654


No 132
>COG2938 Uncharacterized conserved protein [Function unknown]
Probab=21.23  E-value=1.5e+02  Score=19.38  Aligned_cols=27  Identities=11%  Similarity=0.299  Sum_probs=22.1

Q ss_pred             HHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264          40 EFSKKCAERWKTMSEKEKKRFHEMADK   66 (156)
Q Consensus        40 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~   66 (156)
                      -++..+...|.+||++++..|..+-..
T Consensus        32 il~~Fae~~~~~lsd~el~~f~~LLe~   58 (94)
T COG2938          32 ILGPFAEKEFDSLSDEELDEFERLLEC   58 (94)
T ss_pred             HHHHHHHHHHhhCCHHHHHHHHHHHcC
Confidence            456777789999999999999986544


No 133
>cd07110 ALDH_F10_BADH Arabidopsis betaine aldehyde dehydrogenase 1 and 2, ALDH family 10A8 and 10A9-like. Present in this CD are the Arabidopsis betaine aldehyde dehydrogenase (BADH) 1 (chloroplast) and 2 (mitochondria), also known as, aldehyde dehydrogenase family 10 member A8 and aldehyde dehydrogenase family 10 member A9, respectively, and are putative dehydration- and salt-inducible BADHs (EC 1.2.1.8) that catalyze the oxidation of betaine aldehyde to the compatible solute glycine betaine.
Probab=21.22  E-value=2.6e+02  Score=23.20  Aligned_cols=38  Identities=11%  Similarity=0.259  Sum_probs=31.8

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..+|.++|..+...+......+..++.....
T Consensus        29 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~   66 (456)
T cd07110          29 RRAFPRWKKTTGAERAKYLRAIAEGVRERREELAELEA   66 (456)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44556899999999999999999999999888876543


No 134
>PLN02315 aldehyde dehydrogenase family 7 member
Probab=21.03  E-value=2.7e+02  Score=23.78  Aligned_cols=52  Identities=12%  Similarity=0.074  Sum_probs=38.1

Q ss_pred             ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264          89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus        89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      .....+..+..++.      +.....|..+|..+|..+...+....+.+..++..-..
T Consensus        46 ~i~~~~~a~~~~v~~av~~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~  103 (508)
T PLN02315         46 PIAEVVEASLEDYEEGLRACEEAAKIWMQVPAPKRGEIVRQIGDALRAKLDYLGRLVS  103 (508)
T ss_pred             EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            33444555555544      33344699999999999999999999999988876543


No 135
>cd07088 ALDH_LactADH-AldA Escherichia coli lactaldehyde dehydrogenase AldA-like. Lactaldehyde dehydrogenase from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an NAD(+)-dependent enzyme involved in the metabolism of L-fucose and L-rhamnose, and other similar sequences are present in this CD.
Probab=21.03  E-value=2.6e+02  Score=23.30  Aligned_cols=51  Identities=14%  Similarity=0.178  Sum_probs=37.5

Q ss_pred             ccccCCCCCchhH------HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          89 IKQDNPEYGVGDI------AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        89 ~~~~~p~~~~~e~------~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      .....+..+..++      ++.....|..+|..+|..+...+....+.+..++....
T Consensus        25 ~i~~~~~~~~~~v~~av~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~   81 (468)
T cd07088          25 VVATVPAATAEDADRAVDAAEAAQKAWERLPAIERAAYLRKLADLIRENADELAKLI   81 (468)
T ss_pred             EEEEEeCCCHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            3334455444444      34555679999999999999999999999988887643


No 136
>PLN02278 succinic semialdehyde dehydrogenase
Probab=20.97  E-value=2.6e+02  Score=23.72  Aligned_cols=38  Identities=8%  Similarity=0.099  Sum_probs=31.9

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +..-..|..+|..+|..+...+......+..++.....
T Consensus        72 ~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~  109 (498)
T PLN02278         72 HDAFPSWSKLTASERSKILRRWYDLIIANKEDLAQLMT  109 (498)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            34445799999999999999999999999998877643


No 137
>TIGR02518 EutH_ACDH acetaldehyde dehydrogenase (acetylating).
Probab=20.94  E-value=2.7e+02  Score=23.63  Aligned_cols=40  Identities=15%  Similarity=0.043  Sum_probs=33.4

Q ss_pred             hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +.++.....|..+|.++|................++...-
T Consensus        15 ~~A~~A~~~w~~~s~~~R~~iL~~~a~~l~~~~~ela~~~   54 (488)
T TIGR02518        15 RSAKVAQKKLANMTQEQIDKIVKAIVDAAYENAVKLAKMA   54 (488)
T ss_pred             HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHhhHHHHHHHH
Confidence            3456666789999999999999999999999999888843


No 138
>PF08855 DUF1825:  Domain of unknown function (DUF1825);  InterPro: IPR014954 These roteins are uncharacterised and are principally found in cyanobacteria. 
Probab=20.89  E-value=2.7e+02  Score=18.69  Aligned_cols=37  Identities=14%  Similarity=0.070  Sum_probs=31.1

Q ss_pred             HHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         104 ELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       104 ~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      .++.....++.+.|.-|++..+.+.++|+.=+..+.-
T Consensus        26 ~~~~~~~~fd~egK~~~Id~m~~LidkqkiF~~Rl~L   62 (108)
T PF08855_consen   26 QMGSKYGKFDREGKKIHIDKMEELIDKQKIFYKRLEL   62 (108)
T ss_pred             HHHhhcccCCHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence            4556699999999999999999999999887666543


No 139
>cd07135 ALDH_F14-YMR110C Saccharomyces cerevisiae aldehyde dehydrogenase family 14 and related proteins. Aldehyde dehydrogenase family 14 (ALDH14), isolated mainly from the mitochondrial outer membrane of Saccharomyces cerevisiae (YMR110C) and most closely related to the plant and animal ALDHs and fatty ALDHs family 3 members, and similar fungal sequences, are present in this CD.
Probab=20.83  E-value=2.7e+02  Score=23.21  Aligned_cols=40  Identities=5%  Similarity=-0.146  Sum_probs=33.2

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      ++..-..|..++..+|..+...+......+..++..-...
T Consensus        14 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~~   53 (436)
T cd07135          14 LRATFRSGKTKDLEYRLWQLKQLYWAVKDNEEAIVEALKK   53 (436)
T ss_pred             HHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            4455567999999999999999999999999998875443


No 140
>PRK11903 aldehyde dehydrogenase; Provisional
Probab=20.79  E-value=2.3e+02  Score=24.31  Aligned_cols=34  Identities=3%  Similarity=-0.131  Sum_probs=30.3

Q ss_pred             HHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         107 KKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       107 ~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      ..|+.++..+|..+...+....+....++.....
T Consensus        55 ~~W~~~~~~eR~~~L~~~a~~l~~~~~ela~~~~   88 (521)
T PRK11903         55 AALRALTYAQRAALLAAIVKVLQANRDAYYDIAT   88 (521)
T ss_pred             hhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            5799999999999999999999999999887644


No 141
>cd07097 ALDH_KGSADH-YcbD Bacillus subtilis NADP+-dependent alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like. Kinetic studies of the Bacillus subtilis ALDH-like ycbD protein, which is involved in d-glucarate/d-galactarate utilization, reveal that it is a NADP+-dependent, alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH). KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent conversion of KGSA to alpha-ketoglutarate. Interestingly, the NADP+-dependent, tetrameric, 2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an enzyme involved in the catabolic pathway for D-arabinose in Sulfolobus solfataricus, also clusters in this group. This CD shows a distant phylogenetic relationship to the Azospirillum brasilense KGSADH-II (-III) group.
Probab=20.66  E-value=2.8e+02  Score=23.26  Aligned_cols=50  Identities=10%  Similarity=0.128  Sum_probs=37.2

Q ss_pred             cccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264          90 KQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus        90 ~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      ....|..+..++.      +.....|..++..+|......+......+..++....
T Consensus        28 i~~~~~~~~~~v~~av~~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~   83 (473)
T cd07097          28 VGKYARASAEDADAAIAAAAAAFPAWRRTSPEARADILDKAGDELEARKEELARLL   83 (473)
T ss_pred             EEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            3444444544443      3344579999999999999999999999988887764


No 142
>TIGR03216 OH_muco_semi_DH 2-hydroxymuconic semialdehyde dehydrogenase. Members of this protein family are 2-hydroxymuconic semialdehyde dehydrogenase. Many aromatic compounds are catabolized by way of the catechol, via the meta-cleavage pathway, to pyruvate and acetyl-CoA. This enzyme performs the second of seven steps in that pathway for catechol degradation.
Probab=20.47  E-value=2.5e+02  Score=23.62  Aligned_cols=33  Identities=9%  Similarity=0.155  Sum_probs=29.6

Q ss_pred             hccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         109 WADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       109 Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      |..+|..+|..+...+......+..++..-...
T Consensus        53 w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   85 (481)
T TIGR03216        53 WGKMTVAERADLLYAVADEIERRFDDFLAAEVA   85 (481)
T ss_pred             HhhCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            999999999999999999999999998876543


No 143
>TIGR03374 ABALDH 1-pyrroline dehydrogenase. Members of this protein family are 1-pyrroline dehydrogenase (1.5.1.35), also called gamma-aminobutyraldehyde dehydrogenase. This enzyme can follow putrescine transaminase (EC 2.6.1.82) for a two-step conversion of putrescine to gamma-aminobutyric acid (GABA). The member from Escherichia coli is characterized as a homotetramer that binds one NADH per momomer. This enzyme belongs to the medium-chain aldehyde dehydrogenases, and is quite similar in sequence to the betaine aldehyde dehydrogenase (EC 1.2.1.8) family.
Probab=20.46  E-value=2.8e+02  Score=23.38  Aligned_cols=39  Identities=8%  Similarity=0.159  Sum_probs=32.8

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +..-..|..++..+|..+...+......+..++......
T Consensus        48 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   86 (472)
T TIGR03374        48 DAAFAEWGQTTPKARAECLLKLADVIEENAQVFAELESR   86 (472)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            344457999999999999999999999999999876443


No 144
>cd07092 ALDH_ABALDH-YdcW Escherichia coli NAD+-dependent gamma-aminobutyraldehyde dehydrogenase YdcW-like. NAD+-dependent, tetrameric, gamma-aminobutyraldehyde dehydrogenase (ABALDH), YdcW of Escherichia coli K12, catalyzes the oxidation of gamma-aminobutyraldehyde to gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl medium-chain aldehydes, but with a lower catalytic efficiency.
Probab=20.44  E-value=3e+02  Score=22.75  Aligned_cols=39  Identities=15%  Similarity=0.228  Sum_probs=32.7

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.....|..+|..+|..+...+.....++..++......
T Consensus        29 ~~a~~~w~~~~~~~R~~~L~~~~~~l~~~~~~l~~~~~~   67 (450)
T cd07092          29 HAAFPSWRRTTPAERSKALLKLADAIEENAEELAALESR   67 (450)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            445567999999999999999999999999888877543


No 145
>TIGR01722 MMSDH methylmalonic acid semialdehyde dehydrogenase. In Bacillus, a highly homologous protein to methylmalonic acid semialdehyde dehydrogenase, groups out from the main MMSDH clade with Listeria and Sulfolobus. This Bacillus protein has been suggested to be located in an iol operon and/or involved in myo-inositol catabolism, converting malonic semialdehyde to acetyl CoA ad CO2. The preceeding enzymes responsible for valine catabolism are present in Bacillus, Listeria, and Sulfolobus.
Probab=20.42  E-value=2.9e+02  Score=23.20  Aligned_cols=37  Identities=11%  Similarity=0.053  Sum_probs=31.1

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      +.....|..++..+|......+......+..++..-.
T Consensus        48 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~   84 (477)
T TIGR01722        48 RETFLTWGQTSLAQRTSVLLRYQALLKEHRDEIAELI   84 (477)
T ss_pred             HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence            4455679999999999999999999988888877653


No 146
>cd07111 ALDH_F16 Aldehyde dehydrogenase family 16A1-like. Uncharacterized aldehyde dehydrogenase family 16 member A1 (ALDH16A1) and other related sequences are present in this CD. The active site cysteine and glutamate residues are not conserved in the human ALDH16A1 protein sequence.
Probab=20.39  E-value=2.8e+02  Score=23.40  Aligned_cols=50  Identities=6%  Similarity=0.126  Sum_probs=37.1

Q ss_pred             ccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264          91 QDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus        91 ~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      ...+..+..++.      +.....|+.+|..+|..+...+......+..++.....
T Consensus        51 ~~~~~a~~~~v~~ai~~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~  106 (480)
T cd07111          51 ASVLQAEEEDVDAAVAAARTAFESWSALPGHVRARHLYRIARHIQKHQRLFAVLES  106 (480)
T ss_pred             EEEcCCCHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            334444445544      33346799999999999999999999999999876544


No 147
>KOG1610|consensus
Probab=20.28  E-value=1.9e+02  Score=23.35  Aligned_cols=33  Identities=21%  Similarity=0.385  Sum_probs=0.0

Q ss_pred             HHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhh
Q psy3264          38 FAEFSKKCAERWKTMSEKEKKRFHEMADKDKKR   70 (156)
Q Consensus        38 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~   70 (156)
                      ...+.+.+...|..||++.|+.|-..+..+...
T Consensus       218 ~~~~~~~~~~~w~~l~~e~k~~YGedy~~~~~~  250 (322)
T KOG1610|consen  218 PEKLEKRMKEIWERLPQETKDEYGEDYFEDYKK  250 (322)
T ss_pred             hHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH


No 148
>TIGR03240 arg_catab_astD succinylglutamic semialdehyde dehydrogenase. Members of this protein family are succinylglutamic semialdehyde dehydrogenase (EC 1.2.1.71), the fourth enzyme in the arginine succinyltransferase (AST) pathway for arginine catabolism.
Probab=20.19  E-value=2.9e+02  Score=23.29  Aligned_cols=39  Identities=10%  Similarity=0.052  Sum_probs=32.3

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      +.....|..++..+|..+...+.....++..++......
T Consensus        45 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   83 (484)
T TIGR03240        45 RAAFPAWARLSLEERIAVVQRFAALLEERKEALARVIAR   83 (484)
T ss_pred             HHHHHHHccCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence            444467999999999999999999999999888776443


No 149
>PRK09406 gabD1 succinic semialdehyde dehydrogenase; Reviewed
Probab=20.18  E-value=2.4e+02  Score=23.60  Aligned_cols=50  Identities=4%  Similarity=0.013  Sum_probs=37.7

Q ss_pred             cCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264          92 DNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR  141 (156)
Q Consensus        92 ~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~  141 (156)
                      ..+..+..++.      +..-..|..++.++|..+...+......+..++......
T Consensus        16 ~~~~~~~~~v~~av~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~   71 (457)
T PRK09406         16 TFTALTDDEVDAAIARAHARFRDYRTTTFAQRARWANAAADLLEAEADQVAALMTL   71 (457)
T ss_pred             EEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHCHHHHHHHHHH
Confidence            34444444443      444456999999999999999999999999998876443


No 150
>PLN02418 delta-1-pyrroline-5-carboxylate synthase
Probab=20.17  E-value=2.8e+02  Score=24.98  Aligned_cols=39  Identities=15%  Similarity=0.097  Sum_probs=33.9

Q ss_pred             HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      .++.....|..+|..+|..+...+......+..++....
T Consensus       302 aA~~A~~~W~~~~~~eR~~~L~~~a~~l~~~~~ela~~~  340 (718)
T PLN02418        302 AARESSRKLQALSSEERKKILLDVADALEANEELIKAEN  340 (718)
T ss_pred             HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence            445556689999999999999999999999999998874


No 151
>PLN02419 methylmalonate-semialdehyde dehydrogenase [acylating]
Probab=20.13  E-value=2.8e+02  Score=24.47  Aligned_cols=38  Identities=5%  Similarity=0.035  Sum_probs=32.5

Q ss_pred             HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264         103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN  140 (156)
Q Consensus       103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~  140 (156)
                      +.....|..++..+|..+...+....+.+..++.....
T Consensus       161 ~~A~~~W~~~~~~eR~~iL~k~a~~L~~~~~ela~~~~  198 (604)
T PLN02419        161 KQAFPLWRNTPITTRQRVMLKFQELIRKNMDKLAMNIT  198 (604)
T ss_pred             HHHHHHhccCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence            44455799999999999999999999999999877653


No 152
>cd07105 ALDH_SaliADH Salicylaldehyde dehydrogenase, DoxF-like. Salicylaldehyde dehydrogenase (DoxF, SaliADH, EC=1.2.1.65) involved in the upper naphthalene catabolic pathway of Pseudomonas strain C18 and other similar sequences are present in this CD.
Probab=20.04  E-value=2.5e+02  Score=23.19  Aligned_cols=38  Identities=16%  Similarity=0.097  Sum_probs=31.5

Q ss_pred             HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264         102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK  139 (156)
Q Consensus       102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~  139 (156)
                      ++..-..|..+|.++|..+...+....+.+..++..-.
T Consensus         9 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~   46 (432)
T cd07105           9 AAAAFPAWSKTPPSERRDILLKAADLLESRRDEFIEAM   46 (432)
T ss_pred             HHHHHHHHhcCCHHHHHHHHHHHHHHHHhCHHHHHHHH
Confidence            44556789999999999999999999988888876654


Done!