Query psy3264
Match_columns 156
No_of_seqs 141 out of 1679
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 20:36:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy3264.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/3264hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG5648 NHP6B Chromatin-associ 99.9 4.8E-26 1E-30 166.2 10.4 132 3-136 65-209 (211)
2 PTZ00199 high mobility group p 99.9 3.1E-23 6.7E-28 137.0 7.9 77 3-79 17-93 (94)
3 cd01389 MATA_HMG-box MATA_HMG- 99.8 4.9E-20 1.1E-24 117.5 7.9 71 8-80 1-71 (77)
4 cd01388 SOX-TCF_HMG-box SOX-TC 99.8 1.8E-19 3.9E-24 113.5 7.9 69 9-79 2-70 (72)
5 cd01390 HMGB-UBF_HMG-box HMGB- 99.8 1.6E-19 3.4E-24 111.5 7.5 65 9-75 1-65 (66)
6 PF09011 HMG_box_2: HMG-box do 99.8 1.1E-19 2.3E-24 114.8 6.2 71 6-78 1-72 (73)
7 PF00505 HMG_box: HMG (high mo 99.8 1.4E-19 3E-24 112.8 5.9 68 9-78 1-68 (69)
8 smart00398 HMG high mobility g 99.8 5.7E-19 1.2E-23 110.0 7.4 69 8-78 1-69 (70)
9 cd00084 HMG-box High Mobility 99.7 1.5E-17 3.3E-22 102.3 7.5 65 9-75 1-65 (66)
10 KOG0381|consensus 99.7 3.4E-16 7.3E-21 103.6 7.2 72 7-80 21-93 (96)
11 cd01388 SOX-TCF_HMG-box SOX-TC 99.5 1.7E-15 3.7E-20 95.3 1.3 69 72-140 3-71 (72)
12 KOG0527|consensus 99.5 1.5E-14 3.2E-19 114.3 6.5 76 3-80 57-132 (331)
13 KOG0526|consensus 99.5 7.8E-15 1.7E-19 119.8 4.4 71 3-79 530-600 (615)
14 PTZ00199 high mobility group p 99.5 1.3E-14 2.7E-19 95.9 3.3 73 67-139 19-93 (94)
15 cd01389 MATA_HMG-box MATA_HMG- 99.5 8.4E-15 1.8E-19 93.3 1.3 71 72-142 3-73 (77)
16 PF00505 HMG_box: HMG (high mo 99.4 7E-14 1.5E-18 86.8 2.0 67 73-139 3-69 (69)
17 cd01390 HMGB-UBF_HMG-box HMGB- 99.4 2.8E-13 6E-18 83.3 2.6 63 73-135 3-65 (66)
18 KOG0527|consensus 99.4 2.3E-13 5.1E-18 107.5 2.0 77 67-143 59-135 (331)
19 smart00398 HMG high mobility g 99.3 5.7E-13 1.2E-17 82.6 2.6 67 73-139 4-70 (70)
20 PF09011 HMG_box_2: HMG-box do 99.3 1.8E-12 3.9E-17 81.6 2.0 66 74-139 7-73 (73)
21 KOG0381|consensus 99.2 9.2E-12 2E-16 82.2 3.8 73 70-142 22-95 (96)
22 cd00084 HMG-box High Mobility 99.2 1.7E-11 3.7E-16 75.1 2.6 63 73-135 3-65 (66)
23 KOG3248|consensus 99.1 9.3E-11 2E-15 91.5 6.0 70 8-79 191-260 (421)
24 COG5648 NHP6B Chromatin-associ 99.1 4.1E-11 8.9E-16 88.1 2.9 83 67-149 67-149 (211)
25 KOG4715|consensus 99.1 9.2E-11 2E-15 90.8 4.7 74 4-79 60-133 (410)
26 PF14887 HMG_box_5: HMG (high 98.8 1.3E-08 2.8E-13 63.4 5.9 73 8-83 3-75 (85)
27 KOG0526|consensus 98.8 2.5E-09 5.5E-14 88.0 2.6 64 72-139 537-600 (615)
28 KOG0528|consensus 98.6 1.6E-08 3.5E-13 82.4 2.1 71 8-80 325-395 (511)
29 KOG0528|consensus 98.3 1.3E-07 2.8E-12 77.2 0.9 78 69-146 324-401 (511)
30 KOG4715|consensus 98.3 8.6E-07 1.9E-11 69.1 4.3 56 85-140 79-134 (410)
31 KOG2746|consensus 98.3 5.4E-07 1.2E-11 76.2 3.4 66 7-74 180-247 (683)
32 PF14887 HMG_box_5: HMG (high 98.1 4.9E-06 1.1E-10 51.9 4.5 54 91-145 24-77 (85)
33 PF04690 YABBY: YABBY protein; 97.9 2.2E-05 4.8E-10 56.8 5.5 48 4-53 117-164 (170)
34 KOG3248|consensus 97.9 1.7E-06 3.6E-11 68.1 -1.1 79 67-145 188-266 (421)
35 PF06382 DUF1074: Protein of u 97.4 0.00045 9.8E-09 50.0 5.5 47 13-65 83-129 (183)
36 KOG2746|consensus 96.8 0.0011 2.3E-08 56.8 2.9 68 67-134 178-247 (683)
37 PF08073 CHDNT: CHDNT (NUC034) 96.6 0.0034 7.4E-08 36.9 3.3 40 13-54 13-52 (55)
38 PF11304 DUF3106: Protein of u 94.5 0.071 1.5E-06 35.8 4.2 81 39-120 10-103 (107)
39 PF06382 DUF1074: Protein of u 94.5 0.038 8.3E-07 40.2 2.9 38 88-125 92-129 (183)
40 PF06244 DUF1014: Protein of u 93.1 0.18 3.9E-06 34.7 4.1 48 5-54 68-116 (122)
41 PF04769 MAT_Alpha1: Mating-ty 93.0 0.27 5.8E-06 36.8 5.2 55 4-66 39-93 (201)
42 PF08073 CHDNT: CHDNT (NUC034) 92.1 0.054 1.2E-06 31.9 0.4 33 82-114 20-52 (55)
43 TIGR03481 HpnM hopanoid biosyn 92.0 0.38 8.2E-06 35.8 5.0 47 95-141 63-111 (198)
44 PRK15117 ABC transporter perip 88.9 1.3 2.7E-05 33.4 5.4 49 93-141 65-115 (211)
45 PF05494 Tol_Tol_Ttg2: Toluene 80.5 2.8 6E-05 30.1 3.8 47 94-140 36-84 (170)
46 PF04690 YABBY: YABBY protein; 80.5 0.54 1.2E-05 34.2 0.1 45 69-113 120-164 (170)
47 TIGR03481 HpnM hopanoid biosyn 77.8 3 6.6E-05 31.0 3.4 42 38-79 66-109 (198)
48 PF11304 DUF3106: Protein of u 75.4 14 0.00031 24.6 5.9 25 101-125 12-36 (107)
49 PF01352 KRAB: KRAB box; Inte 73.3 2.4 5.2E-05 23.3 1.4 28 97-124 2-30 (41)
50 KOG3223|consensus 73.1 5.1 0.00011 29.7 3.4 44 10-55 166-209 (221)
51 PF13875 DUF4202: Domain of un 71.7 10 0.00022 28.0 4.7 40 15-59 131-170 (185)
52 COG2854 Ttg2D ABC-type transpo 69.3 12 0.00026 28.0 4.7 54 94-147 68-123 (202)
53 PF12881 NUT_N: NUT protein N 68.1 14 0.00031 29.5 5.1 52 13-66 229-280 (328)
54 PF05494 Tol_Tol_Ttg2: Toluene 64.3 4 8.7E-05 29.3 1.4 41 39-79 41-83 (170)
55 PRK15117 ABC transporter perip 62.7 6.7 0.00015 29.5 2.3 42 38-79 70-113 (211)
56 PF06945 DUF1289: Protein of u 59.6 11 0.00023 21.6 2.3 21 107-127 28-48 (51)
57 PRK10363 cpxP periplasmic repr 57.0 48 0.001 24.1 5.8 44 99-143 111-154 (166)
58 PRK09706 transcriptional repre 55.3 62 0.0014 22.1 6.1 46 99-144 86-131 (135)
59 PF15581 Imm35: Immunity prote 51.7 38 0.00082 21.9 4.0 30 38-67 32-61 (93)
60 KOG3838|consensus 51.2 19 0.0004 29.9 3.2 40 111-150 268-307 (497)
61 PF06244 DUF1014: Protein of u 51.0 12 0.00026 25.8 1.8 31 84-114 86-116 (122)
62 PRK12751 cpxP periplasmic stre 49.0 47 0.001 24.0 4.7 77 50-129 55-147 (162)
63 PF12650 DUF3784: Domain of un 46.8 14 0.00031 23.9 1.6 18 106-123 23-40 (97)
64 PRK10236 hypothetical protein; 46.4 23 0.00051 27.2 2.9 26 101-126 118-143 (237)
65 PRK10236 hypothetical protein; 45.6 25 0.00055 27.0 3.0 28 39-66 116-143 (237)
66 PF11616 EZH2_WD-Binding: WD r 43.1 47 0.001 16.9 2.9 14 39-52 12-25 (30)
67 PRK12750 cpxP periplasmic repr 40.3 90 0.0019 22.6 5.1 35 101-135 126-160 (170)
68 cd07081 ALDH_F20_ACDH_EutE-lik 39.4 89 0.0019 26.2 5.6 42 100-141 6-47 (439)
69 KOG3223|consensus 39.3 25 0.00055 26.1 2.1 32 84-115 178-209 (221)
70 PF06424 PRP1_N: PRP1 splicing 34.4 33 0.00071 24.0 2.0 31 25-55 93-127 (133)
71 PF05914 RIB43A: RIB43A; Inte 34.2 29 0.00063 28.6 1.9 40 87-126 250-291 (379)
72 cd07122 ALDH_F20_ACDH Coenzyme 32.8 1.3E+02 0.0028 25.2 5.6 41 101-141 7-47 (436)
73 PF09791 Oxidored-like: Oxidor 32.7 78 0.0017 18.0 3.0 18 123-140 30-47 (48)
74 PF11278 DUF3079: Protein of u 32.5 32 0.00069 19.6 1.3 13 1-13 1-13 (52)
75 cd07133 ALDH_CALDH_CalB Conife 32.1 1.3E+02 0.0028 25.0 5.5 43 100-142 5-47 (434)
76 cd07085 ALDH_F6_MMSDH Methylma 29.8 1.5E+02 0.0033 24.9 5.6 38 103-140 48-85 (478)
77 cd07132 ALDH_F3AB Aldehyde deh 29.2 1.5E+02 0.0033 24.7 5.5 42 100-141 5-46 (443)
78 PF11521 TFIIE-A_C-term: C-ter 28.6 61 0.0013 20.8 2.3 29 96-127 50-80 (86)
79 TIGR00787 dctP tripartite ATP- 28.5 1.4E+02 0.003 22.6 4.8 29 106-134 213-241 (257)
80 PRK12751 cpxP periplasmic stre 28.0 1E+02 0.0022 22.2 3.7 32 40-71 118-149 (162)
81 PF02026 RyR: RyR domain; Int 27.9 61 0.0013 21.1 2.3 19 49-67 61-79 (94)
82 cd07087 ALDH_F3-13-14_CALDH-li 27.9 1.7E+02 0.0036 24.2 5.5 42 100-141 5-46 (426)
83 PF05388 Carbpep_Y_N: Carboxyp 27.9 1.1E+02 0.0024 20.7 3.6 29 38-66 45-73 (113)
84 PF06628 Catalase-rel: Catalas 27.5 55 0.0012 19.7 1.9 20 44-63 12-31 (68)
85 cd08317 Death_ank Death domain 27.4 26 0.00055 22.1 0.5 21 94-114 3-23 (84)
86 PRK13252 betaine aldehyde dehy 26.5 1.8E+02 0.0039 24.5 5.5 39 103-141 54-92 (488)
87 cd07150 ALDH_VaniDH_like Pseud 26.4 1.8E+02 0.0039 24.1 5.4 37 103-139 31-67 (451)
88 cd07152 ALDH_BenzADH NAD-depen 25.6 2E+02 0.0043 23.8 5.5 40 102-141 22-61 (443)
89 PRK13968 putative succinate se 25.4 2E+02 0.0043 24.1 5.5 53 89-141 19-77 (462)
90 cd07118 ALDH_SNDH Gluconobacte 25.3 2E+02 0.0043 24.0 5.5 51 90-140 10-68 (454)
91 PRK10455 periplasmic protein; 25.1 1.5E+02 0.0032 21.3 4.1 27 100-126 118-144 (161)
92 PF00226 DnaJ: DnaJ domain; I 25.1 1.1E+02 0.0024 17.6 3.0 38 21-58 19-60 (64)
93 cd07084 ALDH_KGSADH-like ALDH 24.9 1.9E+02 0.004 24.2 5.2 42 100-141 6-47 (442)
94 cd07101 ALDH_SSADH2_GabD2 Myco 24.9 2E+02 0.0044 23.9 5.5 50 92-141 11-66 (454)
95 PF03480 SBP_bac_7: Bacterial 24.9 1.5E+02 0.0033 22.7 4.5 32 106-137 213-244 (286)
96 PRK11241 gabD succinate-semial 24.7 2E+02 0.0044 24.3 5.5 53 89-141 38-96 (482)
97 cd07077 ALDH-like NAD(P)+-depe 24.3 1.9E+02 0.0041 23.6 5.1 39 102-140 3-41 (397)
98 cd07099 ALDH_DDALDH Methylomon 24.2 2.2E+02 0.0048 23.6 5.6 38 102-139 27-64 (453)
99 cd07108 ALDH_MGR_2402 Magnetos 24.0 2.2E+02 0.0047 23.7 5.5 39 103-141 29-67 (457)
100 PRK00197 proA gamma-glutamyl p 23.9 2.2E+02 0.0049 23.5 5.5 40 101-140 12-51 (417)
101 TIGR01804 BADH glycine betaine 23.7 2.2E+02 0.0047 23.8 5.4 38 103-140 45-82 (467)
102 PF07813 LTXXQ: LTXXQ motif fa 23.3 1.3E+02 0.0028 18.8 3.3 24 100-123 76-99 (100)
103 PTZ00037 DnaJ_C chaperone prot 23.2 1.8E+02 0.0039 24.3 4.8 43 21-64 47-89 (421)
104 cd07131 ALDH_AldH-CAJ73105 Unc 23.2 2.3E+02 0.005 23.7 5.5 37 103-139 47-83 (478)
105 cd07098 ALDH_F15-22 Aldehyde d 23.2 2.5E+02 0.0054 23.4 5.7 37 103-139 28-64 (465)
106 cd07137 ALDH_F3FHI Plant aldeh 23.0 2.4E+02 0.0051 23.5 5.5 42 100-141 6-47 (432)
107 cd07147 ALDH_F21_RNP123 Aldehy 22.8 2.4E+02 0.0052 23.4 5.5 39 102-140 30-68 (452)
108 PF14098 SSPI: Small, acid-sol 22.7 1E+02 0.0022 18.8 2.4 19 45-63 42-60 (65)
109 cd08332 CARD_CASP2 Caspase act 22.7 40 0.00086 21.7 0.7 17 9-25 61-77 (90)
110 cd07106 ALDH_AldA-AAD23400 Str 22.6 2.4E+02 0.0053 23.3 5.5 38 103-140 29-66 (446)
111 cd07559 ALDH_ACDHII_AcoD-like 22.6 2.4E+02 0.0052 23.7 5.5 38 103-140 48-85 (480)
112 PLN02174 aldehyde dehydrogenas 22.5 2.4E+02 0.0051 24.1 5.4 43 101-143 18-60 (484)
113 cd07100 ALDH_SSADH1_GabD1 Myco 22.5 2.5E+02 0.0053 23.2 5.5 39 101-139 7-45 (429)
114 cd07086 ALDH_F7_AASADH-like NA 22.2 2.4E+02 0.0052 23.7 5.5 38 103-140 45-82 (478)
115 cd07104 ALDH_BenzADH-like ALDH 22.1 2.5E+02 0.0055 23.0 5.5 39 101-139 8-46 (431)
116 TIGR01780 SSADH succinate-semi 22.1 2.5E+02 0.0054 23.3 5.5 40 102-141 28-67 (448)
117 cd07142 ALDH_F2BC Arabidosis a 22.1 2.6E+02 0.0055 23.5 5.6 53 90-142 32-92 (476)
118 TIGR01238 D1pyr5carbox3 delta- 22.0 2.5E+02 0.0054 23.9 5.5 52 89-140 64-121 (500)
119 cd07151 ALDH_HBenzADH NADP+-de 22.0 2.5E+02 0.0054 23.5 5.5 38 103-140 42-79 (465)
120 PF08367 M16C_assoc: Peptidase 21.9 1.9E+02 0.0041 21.9 4.4 32 99-130 13-44 (248)
121 cd07129 ALDH_KGSADH Alpha-Keto 21.9 2.4E+02 0.0053 23.5 5.4 39 101-139 7-45 (454)
122 PRK13473 gamma-aminobutyraldeh 21.8 2.6E+02 0.0056 23.4 5.6 38 103-140 49-86 (475)
123 cd07130 ALDH_F7_AASADH NAD+-de 21.8 2.6E+02 0.0056 23.5 5.6 38 103-140 44-81 (474)
124 cd07140 ALDH_F1L_FTFDH 10-form 21.8 2.6E+02 0.0057 23.6 5.6 53 89-141 33-93 (486)
125 cd07136 ALDH_YwdH-P39616 Bacil 21.8 2.5E+02 0.0054 23.5 5.4 43 100-142 5-47 (449)
126 COG2854 Ttg2D ABC-type transpo 21.6 63 0.0014 24.3 1.6 39 41-79 76-115 (202)
127 PF05823 Gp-FAR-1: Nematode fa 21.5 1.1E+02 0.0024 21.7 2.8 82 23-118 45-127 (154)
128 PRK12750 cpxP periplasmic repr 21.5 1.6E+02 0.0034 21.4 3.7 34 42-75 127-160 (170)
129 TIGR03092 SASP_sspI small, aci 21.5 1.1E+02 0.0023 18.7 2.3 19 45-63 41-59 (65)
130 PF00171 Aldedh: Aldehyde dehy 21.4 2.4E+02 0.0052 23.5 5.3 40 101-140 37-76 (462)
131 cd07138 ALDH_CddD_SSP0762 Rhod 21.3 2.5E+02 0.0054 23.5 5.4 38 103-140 46-83 (466)
132 COG2938 Uncharacterized conser 21.2 1.5E+02 0.0033 19.4 3.2 27 40-66 32-58 (94)
133 cd07110 ALDH_F10_BADH Arabidop 21.2 2.6E+02 0.0057 23.2 5.5 38 103-140 29-66 (456)
134 PLN02315 aldehyde dehydrogenas 21.0 2.7E+02 0.0058 23.8 5.5 52 89-140 46-103 (508)
135 cd07088 ALDH_LactADH-AldA Esch 21.0 2.6E+02 0.0056 23.3 5.4 51 89-139 25-81 (468)
136 PLN02278 succinic semialdehyde 21.0 2.6E+02 0.0056 23.7 5.4 38 103-140 72-109 (498)
137 TIGR02518 EutH_ACDH acetaldehy 20.9 2.7E+02 0.0059 23.6 5.5 40 100-139 15-54 (488)
138 PF08855 DUF1825: Domain of un 20.9 2.7E+02 0.006 18.7 5.0 37 104-140 26-62 (108)
139 cd07135 ALDH_F14-YMR110C Sacch 20.8 2.7E+02 0.0057 23.2 5.4 40 102-141 14-53 (436)
140 PRK11903 aldehyde dehydrogenas 20.8 2.3E+02 0.005 24.3 5.1 34 107-140 55-88 (521)
141 cd07097 ALDH_KGSADH-YcbD Bacil 20.7 2.8E+02 0.006 23.3 5.5 50 90-139 28-83 (473)
142 TIGR03216 OH_muco_semi_DH 2-hy 20.5 2.5E+02 0.0054 23.6 5.2 33 109-141 53-85 (481)
143 TIGR03374 ABALDH 1-pyrroline d 20.5 2.8E+02 0.006 23.4 5.5 39 103-141 48-86 (472)
144 cd07092 ALDH_ABALDH-YdcW Esche 20.4 3E+02 0.0065 22.7 5.7 39 103-141 29-67 (450)
145 TIGR01722 MMSDH methylmalonic 20.4 2.9E+02 0.0063 23.2 5.6 37 103-139 48-84 (477)
146 cd07111 ALDH_F16 Aldehyde dehy 20.4 2.8E+02 0.0061 23.4 5.5 50 91-140 51-106 (480)
147 KOG1610|consensus 20.3 1.9E+02 0.0041 23.4 4.2 33 38-70 218-250 (322)
148 TIGR03240 arg_catab_astD succi 20.2 2.9E+02 0.0062 23.3 5.5 39 103-141 45-83 (484)
149 PRK09406 gabD1 succinic semial 20.2 2.4E+02 0.0052 23.6 5.0 50 92-141 16-71 (457)
150 PLN02418 delta-1-pyrroline-5-c 20.2 2.8E+02 0.006 25.0 5.6 39 101-139 302-340 (718)
151 PLN02419 methylmalonate-semial 20.1 2.8E+02 0.006 24.5 5.5 38 103-140 161-198 (604)
152 cd07105 ALDH_SaliADH Salicylal 20.0 2.5E+02 0.0054 23.2 5.1 38 102-139 9-46 (432)
No 1
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.93 E-value=4.8e-26 Score=166.21 Aligned_cols=132 Identities=27% Similarity=0.495 Sum_probs=121.7
Q ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCCCC
Q psy3264 3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPA 82 (156)
Q Consensus 3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~ 82 (156)
+++||.||||+|||++|+.++|..+...+|++++ ++|++.+|+.|++||+++|++|...|..++.+|.+++..|+...
T Consensus 65 k~dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~--~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y~~k~ 142 (211)
T COG5648 65 KKDPNGPKRPLSAYFLYSAENRDEIRKENPKLTF--GEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKL 142 (211)
T ss_pred hcCCCCCCCchhHHHHHHHHHHHHHHHhCCCCCh--HHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhhhccc
Confidence 5899999999999999999999999999999965 99999999999999999999999999999999999999997533
Q ss_pred cc-------------ccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 83 GE-------------KRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMT 136 (156)
Q Consensus 83 ~~-------------~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~ 136 (156)
.. .++.+.+.+|..+..+.++.++..|.+|+++-|.+|++.+.+++..|...++
T Consensus 143 ~~~~~~~~~~e~~~~~r~~~~~~~~~~~~~e~~k~~~~~w~el~~skK~~~~~~~Kk~k~~~~~~~~ 209 (211)
T COG5648 143 PNKAPIGPFIENEPKIRPKVEGPSPDKALVEETKIISKAWSELDESKKKKYIDKYKKLKEEYDSFYP 209 (211)
T ss_pred CCCCCCchhhhccHHhccccCCCCcchhhhHHhhhhhhhhhhhChhhhhHHHHHHHHHHHHHhhhcc
Confidence 22 2778888899999999999999999999999999999999999999987654
No 2
>PTZ00199 high mobility group protein; Provisional
Probab=99.89 E-value=3.1e-23 Score=137.02 Aligned_cols=77 Identities=32% Similarity=0.542 Sum_probs=73.0
Q ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264 3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT 79 (156)
Q Consensus 3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~ 79 (156)
++||+.||||+|||+|||.++|..|..+||+++.++++|+++||++|++||+++|.+|.++|..++.+|..++.+|.
T Consensus 17 ~kdp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e~~~Y~ 93 (94)
T PTZ00199 17 KKDPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEYA 93 (94)
T ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 57999999999999999999999999999998755699999999999999999999999999999999999998884
No 3
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.82 E-value=4.9e-20 Score=117.51 Aligned_cols=71 Identities=23% Similarity=0.346 Sum_probs=67.8
Q ss_pred CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCC
Q psy3264 8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTP 80 (156)
Q Consensus 8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~ 80 (156)
+||||+|||+||+++.|..|+.+||+++. .+|+++||.+|+.||+++|++|.++|..++.+|..++..|.-
T Consensus 1 ~~kRP~naf~lf~~~~r~~~~~~~p~~~~--~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky 71 (77)
T cd01389 1 KIPRPRNAFILYRQDKHAQLKTENPGLTN--NEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKY 71 (77)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcc
Confidence 48999999999999999999999999875 899999999999999999999999999999999999999864
No 4
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.80 E-value=1.8e-19 Score=113.47 Aligned_cols=69 Identities=26% Similarity=0.363 Sum_probs=66.0
Q ss_pred CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264 9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT 79 (156)
Q Consensus 9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~ 79 (156)
.|||+|||++||+++|..++.+||++++ .+|+++||.+|+.||+++|++|.++|..++.+|.+++..|.
T Consensus 2 iKrP~naf~~F~~~~r~~~~~~~p~~~~--~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~ 70 (72)
T cd01388 2 IKRPMNAFMLFSKRHRRKVLQEYPLKEN--RAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYK 70 (72)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCC
Confidence 5899999999999999999999999876 89999999999999999999999999999999999998874
No 5
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.80 E-value=1.6e-19 Score=111.50 Aligned_cols=65 Identities=45% Similarity=0.791 Sum_probs=62.4
Q ss_pred CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhcccc
Q psy3264 9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEM 75 (156)
Q Consensus 9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~ 75 (156)
||+|+|||++|++++|..+...||+++. .+|++.||..|++||+++|++|.+.|..++.+|..+|
T Consensus 1 Pkrp~saf~~f~~~~r~~~~~~~p~~~~--~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~ 65 (66)
T cd01390 1 PKRPLSAYFLFSQEQRPKLKKENPDASV--TEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCH--HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 8999999999999999999999999865 8999999999999999999999999999999999876
No 6
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.80 E-value=1.1e-19 Score=114.84 Aligned_cols=71 Identities=38% Similarity=0.701 Sum_probs=61.7
Q ss_pred CCCCCCCCcHHHHHHHHHHHHHHHh-CCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccC
Q psy3264 6 DVKPRGRMTAYAFFVQVCREEHKKK-HPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSY 78 (156)
Q Consensus 6 p~~PkrP~say~lF~~e~r~~~~~~-~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~ 78 (156)
|++||+|+|||+|||.+++..++.. ++... +.++++.||..|++||++||.+|.++|+.++.+|..++..|
T Consensus 1 p~kpK~~~say~lF~~~~~~~~k~~G~~~~~--~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~ 72 (73)
T PF09011_consen 1 PKKPKRPPSAYNLFMKEMRKEVKEEGGQKQS--FREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEW 72 (73)
T ss_dssp SSS--SSSSHHHHHHHHHHHHHHHHT-T-SS--HHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCCCCHHHHHHHHHHHHHHHhcccCCC--HHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 7899999999999999999999988 55443 58999999999999999999999999999999999998876
No 7
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.79 E-value=1.4e-19 Score=112.78 Aligned_cols=68 Identities=35% Similarity=0.635 Sum_probs=63.5
Q ss_pred CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccC
Q psy3264 9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSY 78 (156)
Q Consensus 9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~ 78 (156)
|+||+|||+|||.+++..++..||+++. .+|+++||.+|++||+++|.+|.+.|..++..|..++..|
T Consensus 1 PkrP~~af~lf~~~~~~~~k~~~p~~~~--~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y 68 (69)
T PF00505_consen 1 PKRPPNAFMLFCKEKRAKLKEENPDLSN--KEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEY 68 (69)
T ss_dssp SSSS--HHHHHHHHHHHHHHHHSTTSTH--HHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CcCCCCHHHHHHHHHHHHHHHHhccccc--ccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 8999999999999999999999999985 9999999999999999999999999999999999988776
No 8
>smart00398 HMG high mobility group.
Probab=99.78 E-value=5.7e-19 Score=109.96 Aligned_cols=69 Identities=46% Similarity=0.745 Sum_probs=66.1
Q ss_pred CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccC
Q psy3264 8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSY 78 (156)
Q Consensus 8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~ 78 (156)
+|++|+|||++|++++|..+..+||+++. .+|++.||.+|+.||+++|+.|.+.|..++..|.+.+..|
T Consensus 1 ~pkrp~~~y~~f~~~~r~~~~~~~~~~~~--~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y 69 (70)
T smart00398 1 KPKRPMSAFMLFSQENRAKIKAENPDLSN--AEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEY 69 (70)
T ss_pred CcCCCCcHHHHHHHHHHHHHHHHCcCCCH--HHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhc
Confidence 69999999999999999999999999875 8999999999999999999999999999999999998877
No 9
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.73 E-value=1.5e-17 Score=102.35 Aligned_cols=65 Identities=46% Similarity=0.753 Sum_probs=61.5
Q ss_pred CCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhcccc
Q psy3264 9 PRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEM 75 (156)
Q Consensus 9 PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~ 75 (156)
|+||+|||++|+++.+..++..+|+++. .+|++.||.+|+.||+++|..|.+.|...+..|.+.+
T Consensus 1 pkrp~~af~~f~~~~~~~~~~~~~~~~~--~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~ 65 (66)
T cd00084 1 PKRPLSAYFLFSQEHRAEVKAENPGLSV--GEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCCCcHHHHHHHHHHHHHHHHCcCCCH--HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 8999999999999999999999999765 8999999999999999999999999999999998764
No 10
>KOG0381|consensus
Probab=99.66 E-value=3.4e-16 Score=103.63 Aligned_cols=72 Identities=31% Similarity=0.567 Sum_probs=67.7
Q ss_pred CCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccc-cCCC
Q psy3264 7 VKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQ-SYTP 80 (156)
Q Consensus 7 ~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~-~~~~ 80 (156)
+.|+||+|||++|+.+.+..++..||++++ .+|++++|.+|++|++++|.+|...|..++.+|..+|. .|..
T Consensus 21 ~~pkrp~sa~~~f~~~~~~~~k~~~p~~~~--~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~~~ 93 (96)
T KOG0381|consen 21 QAPKRPLSAFFLFSSEQRSKIKAENPGLSV--GEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEYKA 93 (96)
T ss_pred CCCCCCCcHHHHHHHHHHHHHHHhCCCCCH--HHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 489999999999999999999999999776 89999999999999999999999999999999999998 6643
No 11
>cd01388 SOX-TCF_HMG-box SOX-TCF_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include SRY and its homologs in insects and vertebrates, and transcription factor-like proteins, TCF-1, -3, -4, and LEF-1. They appear to bind the minor groove of the A/T C A A A G/C-motif.
Probab=99.54 E-value=1.7e-15 Score=95.28 Aligned_cols=69 Identities=26% Similarity=0.304 Sum_probs=64.9
Q ss_pred ccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 72 DTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 72 ~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
.+.+++|+.+..+.++.++.+||++++++|++.||+.|+.||+++|++|.++|+.++++|.++++.|+-
T Consensus 3 KrP~naf~~F~~~~r~~~~~~~p~~~~~eisk~l~~~Wk~ls~~eK~~y~~~a~~~k~~y~~~~p~y~y 71 (72)
T cd01388 3 KRPMNAFMLFSKRHRRKVLQEYPLKENRAISKILGDRWKALSNEEKQPYYEEAKKLKELHMKLYPDYKW 71 (72)
T ss_pred CCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHCcCCCC
Confidence 467888888889999999999999999999999999999999999999999999999999999999863
No 12
>KOG0527|consensus
Probab=99.53 E-value=1.5e-14 Score=114.29 Aligned_cols=76 Identities=26% Similarity=0.407 Sum_probs=71.4
Q ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCC
Q psy3264 3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTP 80 (156)
Q Consensus 3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~ 80 (156)
++...+.|||+||||+|.+..|.+|..++|++-+ .||+|.||.+|+.|+++||.+|++.|+..|..+.++...|.-
T Consensus 57 k~~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHN--SEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKY 132 (331)
T KOG0527|consen 57 KTSTDRIKRPMNAFMVWSQGQRRKLAKQNPKMHN--SEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKY 132 (331)
T ss_pred CCCccccCCCcchhhhhhHHHHHHHHHhCcchhh--HHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCccc
Confidence 4566789999999999999999999999999976 999999999999999999999999999999999999999864
No 13
>KOG0526|consensus
Probab=99.52 E-value=7.8e-15 Score=119.84 Aligned_cols=71 Identities=34% Similarity=0.637 Sum_probs=67.5
Q ss_pred CCCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264 3 KEKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT 79 (156)
Q Consensus 3 k~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~ 79 (156)
+|+||.||||+||||+|++..|..|+.. ++++ ++|.+.+|++|++||. |++|.+.|+.++.+|+.+|..|.
T Consensus 530 ~kdpnapkra~sa~m~w~~~~r~~ik~d--gi~~--~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk 600 (615)
T KOG0526|consen 530 KKDPNAPKRATSAYMLWLNASRESIKED--GISV--GDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK 600 (615)
T ss_pred CCCCCCCccchhHHHHHHHhhhhhHhhc--CchH--HHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence 6899999999999999999999999987 6655 8999999999999999 99999999999999999999996
No 14
>PTZ00199 high mobility group protein; Provisional
Probab=99.50 E-value=1.3e-14 Score=95.87 Aligned_cols=73 Identities=36% Similarity=0.590 Sum_probs=65.2
Q ss_pred hhhhhccccccCCCCCccccccccccCCCCC--chhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYG--VGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~--~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+-..-.+.+.+|+.|..+.|..+..+||+++ +.+|+++||+.|+.||+++|.+|.++|+.++.+|..+|..|+
T Consensus 19 dp~~PKrP~sAY~~F~~~~R~~i~~~~P~~~~~~~evsk~ige~Wk~ls~eeK~~y~~~A~~dk~rY~~e~~~Y~ 93 (94)
T PTZ00199 19 DPNAPKRALSAYMFFAKEKRAEIIAENPELAKDVAAVGKMVGEAWNKLSEEEKAPYEKKAQEDKVRYEKEKAEYA 93 (94)
T ss_pred CCCCCCCCCcHHHHHHHHHHHHHHHHCcCCcccHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 3344556778888888888999999999986 799999999999999999999999999999999999999995
No 15
>cd01389 MATA_HMG-box MATA_HMG-box, class I member of the HMG-box superfamily of DNA-binding proteins. These proteins contain a single HMG box, and bind the minor groove of DNA in a highly sequence-specific manner. Members include the fungal mating type gene products MC, MATA1 and Ste11.
Probab=99.48 E-value=8.4e-15 Score=93.26 Aligned_cols=71 Identities=25% Similarity=0.517 Sum_probs=65.9
Q ss_pred ccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264 72 DTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS 142 (156)
Q Consensus 72 ~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~ 142 (156)
.+.+++|+.+....+..++.+||++++++|+++||..|+.||+++|++|.++|+.++++|..+++.|+..-
T Consensus 3 kRP~naf~lf~~~~r~~~~~~~p~~~~~eisk~~g~~Wk~ls~eeK~~y~~~A~~~k~~~~~~~p~Yky~p 73 (77)
T cd01389 3 PRPRNAFILYRQDKHAQLKTENPGLTNNEISRIIGRMWRSESPEVKAYYKELAEEEKERHAREYPDYKYTP 73 (77)
T ss_pred CCCCcHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHCCCCcccC
Confidence 45678888888888999999999999999999999999999999999999999999999999999998753
No 16
>PF00505 HMG_box: HMG (high mobility group) box; InterPro: IPR000910 High mobility group (HMG or HMGB) proteins are a family of relatively low molecular weight non-histone components in chromatin. HMG1 (also called HMG-T in fish) and HMG2 are two highly related proteins that bind single-stranded DNA preferentially and unwind double-stranded DNA. Although they have no sequence specificity, they have a high affinity for bent or distorted DNA, and bend linear DNA. HMG1 and HMG2 contain two DNA-binding HMG-box domains (A and B) that show structural and functional differences, and have a long acidic C-terminal domain rich in aspartic and glutamic acid residues. The acidic tail modulates the affinity of the tandem HMG boxes in HMG1 and 2 for a variety of DNA targets. HMG1 and 2 appear to play important architectural roles in the assembly of nucleoprotein complexes in a variety of biological processes, for example V(D)J recombination, the initiation of transcription, and DNA repair []. The profile in this entry describing the HMG-domains is much more general than the signature. In addition to the HMG1 and HMG2 proteins, HMG-domains occur in single or multiple copies in the following protein classes; the SOX family of transcription factors; SRY sex determining region Y protein and related proteins []; LEF1 lymphoid enhancer binding factor 1 []; SSRP recombination signal recognition protein; MTF1 mitochondrial transcription factor 1; UBF1/2 nucleolar transcription factors; Abf2 yeast ARS-binding factor []; and Saccharomyces cerevisiae transcription factors Ixr1, Rox1, Nhp6a, Nhp6b and Spp41.; GO: 0003677 DNA binding; PDB: 1I11_A 1J3C_A 1J3D_A 1WZ6_A 1WGF_A 2D7L_A 1GT0_D 3U2B_C 2CRJ_A 2CS1_A ....
Probab=99.42 E-value=7e-14 Score=86.84 Aligned_cols=67 Identities=36% Similarity=0.625 Sum_probs=59.7
Q ss_pred cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+.++.|..+....++.++..||+++..+|++.||..|++||+++|++|.+.|+.++.+|..+++.|+
T Consensus 3 rP~~af~lf~~~~~~~~k~~~p~~~~~~i~~~~~~~W~~l~~~eK~~y~~~a~~~~~~y~~~~~~y~ 69 (69)
T PF00505_consen 3 RPPNAFMLFCKEKRAKLKEENPDLSNKEISKILAQMWKNLSEEEKAPYKEEAEEEKERYEKEMPEYK 69 (69)
T ss_dssp SS--HHHHHHHHHHHHHHHHSTTSTHHHHHHHHHHHHHCSHHHHHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred CCCCHHHHHHHHHHHHHHHHhcccccccchhhHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3455666666777888999999999999999999999999999999999999999999999999995
No 17
>cd01390 HMGB-UBF_HMG-box HMGB-UBF_HMG-box, class II and III members of the HMG-box superfamily of DNA-binding proteins. These proteins bind the minor groove of DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III members include nucleolar and mitochondrial transcription factors, UBF and mtTF1, which bind four-way DNA junctions.
Probab=99.37 E-value=2.8e-13 Score=83.34 Aligned_cols=63 Identities=44% Similarity=0.742 Sum_probs=58.1
Q ss_pred cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH
Q psy3264 73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEM 135 (156)
Q Consensus 73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~ 135 (156)
+.+..|..+..+.+..+...||++++.+|++.||..|+.||+++|.+|.+.|+.++.+|..+|
T Consensus 3 rp~saf~~f~~~~r~~~~~~~p~~~~~~i~~~~~~~W~~ls~~eK~~y~~~a~~~~~~y~~e~ 65 (66)
T cd01390 3 RPLSAYFLFSQEQRPKLKKENPDASVTEVTKILGEKWKELSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 456677777778899999999999999999999999999999999999999999999999987
No 18
>KOG0527|consensus
Probab=99.35 E-value=2.3e-13 Score=107.55 Aligned_cols=77 Identities=26% Similarity=0.407 Sum_probs=72.4
Q ss_pred hhhhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy3264 67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSV 143 (156)
Q Consensus 67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~ 143 (156)
...+.++.|++|..+....|.+|..+||.+.+.||+|.||..||.|+++||.||+++|++++..|.+|+.+|+.+-.
T Consensus 59 ~~~hIKRPMNAFMVWSq~~RRkma~qnP~mHNSEISK~LG~~WK~Lse~EKrPFi~EAeRLR~~HmkehPdYKYRPR 135 (331)
T KOG0527|consen 59 STDRIKRPMNAFMVWSQGQRRKLAKQNPKMHNSEISKRLGAEWKLLSEEEKRPFVDEAERLRAQHMKEYPDYKYRPR 135 (331)
T ss_pred CccccCCCcchhhhhhHHHHHHHHHhCcchhhHHHHHHHHHHHhhcCHhhhccHHHHHHHHHHHHHHhCCCcccccc
Confidence 45688899999999999999999999999999999999999999999999999999999999999999999987644
No 19
>smart00398 HMG high mobility group.
Probab=99.34 E-value=5.7e-13 Score=82.62 Aligned_cols=67 Identities=40% Similarity=0.703 Sum_probs=60.7
Q ss_pred cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+.+..|+.+....++.+...||++++++|++.||..|+.||+++|.+|.++|+.++++|..++..|+
T Consensus 4 rp~~~y~~f~~~~r~~~~~~~~~~~~~~i~~~~~~~W~~l~~~ek~~y~~~a~~~~~~y~~~~~~y~ 70 (70)
T smart00398 4 RPMSAFMLFSQENRAKIKAENPDLSNAEISKKLGERWKLLSEEEKAPYEEKAKKDKERYEEEMPEYK 70 (70)
T ss_pred CCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 4556666777777888999999999999999999999999999999999999999999999999884
No 20
>PF09011 HMG_box_2: HMG-box domain; InterPro: IPR015101 This domain is predominantly found in Maelstrom homologue proteins. It has no known function. ; GO: 0005634 nucleus; PDB: 2EQZ_A 1V64_A 2CTO_A 1H5P_A 3TQ6_A 3FGH_A 3TMM_A 1J3X_A 2YRQ_A 1AAB_A ....
Probab=99.27 E-value=1.8e-12 Score=81.63 Aligned_cols=66 Identities=27% Similarity=0.543 Sum_probs=53.2
Q ss_pred ccccCCCCCcccccccccc-CCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 74 EMQSYTPPAGEKRGKIKQD-NPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 74 ~~~~~~~~~~~~r~~~~~~-~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
...+|..+....+..++.. .+..+..++++.|+..|++||++||.+|.+.|+.++.+|+.+|..|.
T Consensus 7 ~~say~lF~~~~~~~~k~~G~~~~~~~e~~k~~~~~Wk~Ls~~EK~~Y~~~A~~~k~~y~~e~~~~~ 73 (73)
T PF09011_consen 7 PPSAYNLFMKEMRKEVKEEGGQKQSFREVMKEISERWKSLSEEEKEPYEERAKEDKERYEREMKEWN 73 (73)
T ss_dssp SSSHHHHHHHHHHHHHHHHT-T-SSHHHHHHHHHHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHHH-
T ss_pred CCCHHHHHHHHHHHHHHHhcccCCCHHHHHHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHhcC
Confidence 3445555555666677666 66788999999999999999999999999999999999999999984
No 21
>KOG0381|consensus
Probab=99.22 E-value=9.2e-12 Score=82.25 Aligned_cols=73 Identities=36% Similarity=0.602 Sum_probs=65.1
Q ss_pred hhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHH-HHHhcC
Q psy3264 70 RYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMT-AYKNRS 142 (156)
Q Consensus 70 ~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~-~~~~~~ 142 (156)
...+.+.+|..++...+..++..||++++.++++.+|++|++|++++|.+|...|..++++|..+|. .|+...
T Consensus 22 ~pkrp~sa~~~f~~~~~~~~k~~~p~~~~~~v~k~~g~~W~~l~~~~k~~y~~ka~~~k~~Y~~~~~~~~~~~~ 95 (96)
T KOG0381|consen 22 APKRPLSAFFLFSSEQRSKIKAENPGLSVGEVAKALGEMWKNLAEEEKQPYEEKASKLKEKYEKELAGEYKASL 95 (96)
T ss_pred CCCCCCcHHHHHHHHHHHHHHHhCCCCCHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHhhcc
Confidence 4555666777777777899999999999999999999999999999999999999999999999999 887654
No 22
>cd00084 HMG-box High Mobility Group (HMG)-box is found in a variety of eukaryotic chromosomal proteins and transcription factors. HMGs bind to the minor groove of DNA and have been classified by DNA binding preferences. Two phylogenically distinct groups of Class I proteins bind DNA in a sequence specific fashion and contain a single HMG box. One group (SOX-TCF) includes transcription factors, TCF-1, -3, -4; and also SRY and LEF-1, which bind four-way DNA junctions and duplex DNA targets. The second group (MATA) includes fungal mating type gene products MC, MATA1 and Ste11. Class II and III proteins (HMGB-UBF) bind DNA in a non-sequence specific fashion and contain two or more tandem HMG boxes. Class II members include non-histone chromosomal proteins, HMG1 and HMG2, which bind to bent or distorted DNA such as four-way DNA junctions, synthetic DNA cruciforms, kinked cisplatin-modified DNA, DNA bulges, cross-overs in supercoiled DNA, and can cause looping of linear DNA. Class III member
Probab=99.16 E-value=1.7e-11 Score=75.08 Aligned_cols=63 Identities=44% Similarity=0.735 Sum_probs=57.2
Q ss_pred cccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH
Q psy3264 73 TEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEM 135 (156)
Q Consensus 73 ~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~ 135 (156)
+.+..|+.+..+.+..+...+|+++..+|++.||..|+.||+++|.+|.++|+.++.+|..++
T Consensus 3 rp~~af~~f~~~~~~~~~~~~~~~~~~~i~~~~~~~W~~l~~~~k~~y~~~a~~~~~~y~~~~ 65 (66)
T cd00084 3 RPLSAYFLFSQEHRAEVKAENPGLSVGEISKILGEMWKSLSEEEKKKYEEKAEKDKERYEKEM 65 (66)
T ss_pred CCCcHHHHHHHHHHHHHHHHCcCCCHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHhh
Confidence 456667777777888899999999999999999999999999999999999999999999875
No 23
>KOG3248|consensus
Probab=99.13 E-value=9.3e-11 Score=91.51 Aligned_cols=70 Identities=14% Similarity=0.313 Sum_probs=62.5
Q ss_pred CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264 8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT 79 (156)
Q Consensus 8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~ 79 (156)
..|+|+|||+||++++|+.|.+++. ++. -.+|-++||..|+.||-+|..+|.++|+++++.+...+..|.
T Consensus 191 hiKKPLNAFmlyMKEmRa~vvaEct-lKe-SAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WS 260 (421)
T KOG3248|consen 191 HIKKPLNAFMLYMKEMRAKVVAECT-LKE-SAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWS 260 (421)
T ss_pred cccccHHHHHHHHHHHHHHHHHHhh-hhh-HHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcc
Confidence 3599999999999999999999986 333 289999999999999999999999999999998888877774
No 24
>COG5648 NHP6B Chromatin-associated proteins containing the HMG domain [Chromatin structure and dynamics]
Probab=99.11 E-value=4.1e-11 Score=88.12 Aligned_cols=83 Identities=24% Similarity=0.472 Sum_probs=74.6
Q ss_pred hhhhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy3264 67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAPV 146 (156)
Q Consensus 67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~~ 146 (156)
+...-++.+.+|..++...|..+...+|.++++++++.+|+.|++|+++||.+|...|..++++|+.++..|..+.+...
T Consensus 67 dpN~PKRp~sayf~y~~~~R~ei~~~~p~l~~~e~~k~~~e~WK~Ltd~eke~y~k~~~~~~erYq~ek~~y~~k~~~~~ 146 (211)
T COG5648 67 DPNGPKRPLSAYFLYSAENRDEIRKENPKLTFGEVGKLLSEKWKELTDEEKEPYYKEANSDRERYQREKEEYNKKLPNKA 146 (211)
T ss_pred CCCCCCCchhHHHHHHHHHHHHHHHhCCCCChHHHHHHHHHHHHhccHhhhhhHHHHHhhHHHHHHHHHHhhhcccCCCC
Confidence 55666778888888888889999999999999999999999999999999999999999999999999999999887765
Q ss_pred CCc
Q psy3264 147 NEE 149 (156)
Q Consensus 147 ~~~ 149 (156)
-.+
T Consensus 147 ~~~ 149 (211)
T COG5648 147 PIG 149 (211)
T ss_pred CCc
Confidence 433
No 25
>KOG4715|consensus
Probab=99.10 E-value=9.2e-11 Score=90.83 Aligned_cols=74 Identities=19% Similarity=0.398 Sum_probs=69.9
Q ss_pred CCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCC
Q psy3264 4 EKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYT 79 (156)
Q Consensus 4 ~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~ 79 (156)
+.|.+|-+|+-+|+.|.+..+..|++.||++.+ =+|.++||.+|..|+++||+.|.+.+...+..|.+.|.+|.
T Consensus 60 kpPkppekpl~pymrySrkvWd~VkA~nPe~kL--WeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh 133 (410)
T KOG4715|consen 60 KPPKPPEKPLMPYMRYSRKVWDQVKASNPELKL--WEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYH 133 (410)
T ss_pred CCCCCCCcccchhhHHhhhhhhhhhccCcchHH--HHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 467888899999999999999999999999976 89999999999999999999999999999999999999884
No 26
>PF14887 HMG_box_5: HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.83 E-value=1.3e-08 Score=63.36 Aligned_cols=73 Identities=23% Similarity=0.360 Sum_probs=60.5
Q ss_pred CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCCCCc
Q psy3264 8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPAG 83 (156)
Q Consensus 8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~~ 83 (156)
.|..|.+|-.||.+.....+++.+++.+. -..+.+...|++|++.+|-.|+..|.+++.+|+++|.+|..+..
T Consensus 3 lPE~PKt~qe~Wqq~vi~dYla~~~~dr~---K~~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r~~~~ 75 (85)
T PF14887_consen 3 LPETPKTAQEIWQQSVIGDYLAKFRNDRK---KALKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMRSAPA 75 (85)
T ss_dssp -S----THHHHHHHHHHHHHHHHTTSTHH---HHHHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS-CCC
T ss_pred CCCCCCCHHHHHHHHHHHHHHHHhhHhHH---HHHHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHhcCCC
Confidence 47778899999999999999999998764 44679999999999999999999999999999999999975443
No 27
>KOG0526|consensus
Probab=98.79 E-value=2.5e-09 Score=88.00 Aligned_cols=64 Identities=39% Similarity=0.645 Sum_probs=55.7
Q ss_pred ccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 72 DTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 72 ~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
++++.+|..+....|..++.+ |+++++++|.+|++|++||. |.+|.+.|+.++++|+.||.+|+
T Consensus 537 kra~sa~m~w~~~~r~~ik~d--gi~~~dv~kk~g~~wk~ms~--k~~we~ka~~dk~ry~~em~~yk 600 (615)
T KOG0526|consen 537 KRATSAYMLWLNASRESIKED--GISVGDVAKKAGEKWKQMSA--KEEWEDKAAVDKQRYEDEMKEYK 600 (615)
T ss_pred ccchhHHHHHHHhhhhhHhhc--CchHHHHHHHHhHHHhhhcc--cchhhHHHHHHHHHHHHHHHhhc
Confidence 345555555555558888887 99999999999999999999 99999999999999999999999
No 28
>KOG0528|consensus
Probab=98.61 E-value=1.6e-08 Score=82.42 Aligned_cols=71 Identities=23% Similarity=0.314 Sum_probs=61.0
Q ss_pred CCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCC
Q psy3264 8 KPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTP 80 (156)
Q Consensus 8 ~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~ 80 (156)
.-|||+||||+|.++.|..|...+|++-. ..|+++||..|+.||-.||++|.+.-...-..+-+....|..
T Consensus 325 HIKRPMNAFMVWAkDERRKILqA~PDMHN--SnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrY 395 (511)
T KOG0528|consen 325 HIKRPMNAFMVWAKDERRKILQAFPDMHN--SNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRY 395 (511)
T ss_pred cccCCcchhhcccchhhhhhhhcCccccc--cchhHHhcccccccccccccchHHHHHHHHHhhhccCccccc
Confidence 46999999999999999999999999986 899999999999999999999987655555566666666654
No 29
>KOG0528|consensus
Probab=98.34 E-value=1.3e-07 Score=77.22 Aligned_cols=78 Identities=19% Similarity=0.287 Sum_probs=69.2
Q ss_pred hhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCC
Q psy3264 69 KRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAPV 146 (156)
Q Consensus 69 ~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~~ 146 (156)
-..++.|++|..+....|..+.+.+|++....|+|+||..|+.||..||+||.++-+++-..|....+.|+.+-.++.
T Consensus 324 PHIKRPMNAFMVWAkDERRKILqA~PDMHNSnISKILGSRWKaMSN~eKQPYYEEQaRLSk~HlEk~PdYrYkPRPKR 401 (511)
T KOG0528|consen 324 PHIKRPMNAFMVWAKDERRKILQAFPDMHNSNISKILGSRWKAMSNTEKQPYYEEQARLSKLHLEKYPDYRYKPRPKR 401 (511)
T ss_pred ccccCCcchhhcccchhhhhhhhcCccccccchhHHhcccccccccccccchHHHHHHHHHhhhccCcccccCCCCCc
Confidence 355678888887777667778888999999999999999999999999999999999999999999999999877653
No 30
>KOG4715|consensus
Probab=98.29 E-value=8.6e-07 Score=69.10 Aligned_cols=56 Identities=32% Similarity=0.544 Sum_probs=52.9
Q ss_pred ccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 85 KRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 85 ~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
..+.++..||++...+|.|+||.+|..|+++||+.|+..++..+..|.+.|..|..
T Consensus 79 vWd~VkA~nPe~kLWeiGK~Ig~mW~dLpd~EK~ey~~EYeaEKieY~~smkayh~ 134 (410)
T KOG4715|consen 79 VWDQVKASNPELKLWEIGKIIGGMWLDLPDEEKQEYLNEYEAEKIEYNESMKAYHN 134 (410)
T ss_pred hhhhhhccCcchHHHHHHHHHHHHHhhCcchHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 36779999999999999999999999999999999999999999999999999853
No 31
>KOG2746|consensus
Probab=98.29 E-value=5.4e-07 Score=76.22 Aligned_cols=66 Identities=26% Similarity=0.415 Sum_probs=60.2
Q ss_pred CCCCCCCcHHHHHHHHHH--HHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccc
Q psy3264 7 VKPRGRMTAYAFFVQVCR--EEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTE 74 (156)
Q Consensus 7 ~~PkrP~say~lF~~e~r--~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~ 74 (156)
..-+||+|+|++|++.+| ..+...||+..+ +-|+++||+.|-.|.+.||+.|.++|.+.+..+-++
T Consensus 180 ~HirrPMnaf~ifskrhr~~g~vhq~~pn~DN--rtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka 247 (683)
T KOG2746|consen 180 DHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDN--RTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA 247 (683)
T ss_pred hhhhhhhHHHHHHHhhcCCccchhccCccccc--hhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence 346899999999999999 899999999876 899999999999999999999999999988776654
No 32
>PF14887 HMG_box_5: HMG (high mobility group) box 5; PDB: 1L8Y_A 1L8Z_A 2HDZ_A.
Probab=98.13 E-value=4.9e-06 Score=51.92 Aligned_cols=54 Identities=19% Similarity=0.358 Sum_probs=44.2
Q ss_pred ccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy3264 91 QDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAP 145 (156)
Q Consensus 91 ~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~ 145 (156)
..++.....+ .+.+...|++|++.+|-+|+..|.++..+|+.+|-+|+.-....
T Consensus 24 a~~~~dr~K~-~kam~~~W~~me~Kekl~WIkKA~EdqKrYE~el~e~r~~~~~~ 77 (85)
T PF14887_consen 24 AKFRNDRKKA-LKAMEAQWSQMEKKEKLKWIKKAAEDQKRYERELREMRSAPADA 77 (85)
T ss_dssp HHTTSTHHHH-HHHHHHHHHTTGGGHHHHHHHHHHHHHHHHHHHHHCCS-CCCTT
T ss_pred HHhhHhHHHH-HHHHHHHHHHhhhhhhhHHHHHHHHHHHHHHHHHHHHhcCCCCC
Confidence 3455554444 45899999999999999999999999999999999999876654
No 33
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=97.94 E-value=2.2e-05 Score=56.75 Aligned_cols=48 Identities=19% Similarity=0.257 Sum_probs=42.3
Q ss_pred CCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCC
Q psy3264 4 EKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMS 53 (156)
Q Consensus 4 ~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls 53 (156)
|-|++-.|-+|||..|+++....|+..+|+++. .|+++.++..|...+
T Consensus 117 kPPEKRqR~psaYn~f~k~ei~rik~~~p~ish--keaFs~aAknW~h~p 164 (170)
T PF04690_consen 117 KPPEKRQRVPSAYNRFMKEEIQRIKAENPDISH--KEAFSAAAKNWAHFP 164 (170)
T ss_pred CCccccCCCchhHHHHHHHHHHHHHhcCCCCCH--HHHHHHHHHhhhhCc
Confidence 445555677899999999999999999999998 899999999998764
No 34
>KOG3248|consensus
Probab=97.89 E-value=1.7e-06 Score=68.07 Aligned_cols=79 Identities=16% Similarity=0.274 Sum_probs=68.3
Q ss_pred hhhhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCC
Q psy3264 67 DKKRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAP 145 (156)
Q Consensus 67 ~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~ 145 (156)
.+...++.+++|..+-.+.|..+..+-...-..+|.++||.+|..||-+|.++|.++|+++++.|..-+..|-++.+..
T Consensus 188 KkphiKKPLNAFmlyMKEmRa~vvaEctlKeSAaiNqiLGrRWH~LSrEEQAKYyElArKerqlH~qlYP~WSARdNYg 266 (421)
T KOG3248|consen 188 KKPHIKKPLNAFMLYMKEMRAKVVAECTLKESAAINQILGRRWHALSREEQAKYYELARKERQLHMQLYPGWSARDNYG 266 (421)
T ss_pred cCccccccHHHHHHHHHHHHHHHHHHhhhhhHHHHHHHHhHHHhhhhHHHHHHHHHHHHHHHHHHHHhcCCcchhhhhh
Confidence 3456678889998888898999888766555678999999999999999999999999999999999999998776654
No 35
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=97.39 E-value=0.00045 Score=49.98 Aligned_cols=47 Identities=32% Similarity=0.511 Sum_probs=40.8
Q ss_pred CcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhh
Q psy3264 13 MTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMAD 65 (156)
Q Consensus 13 ~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~ 65 (156)
-+||+-|+.+.+. .|.++.. .|++...+..|..||+.+|..|..++-
T Consensus 83 nnaYLNFLReFRr----kh~~L~p--~dlI~~AAraW~rLSe~eK~rYrr~~~ 129 (183)
T PF06382_consen 83 NNAYLNFLREFRR----KHCGLSP--QDLIQRAARAWCRLSEAEKNRYRRMAP 129 (183)
T ss_pred chHHHHHHHHHHH----HccCCCH--HHHHHHHHHHHHhCCHHHHHHHHhhcc
Confidence 4699999999876 5667765 899999999999999999999998654
No 36
>KOG2746|consensus
Probab=96.77 E-value=0.0011 Score=56.80 Aligned_cols=68 Identities=28% Similarity=0.429 Sum_probs=58.3
Q ss_pred hhhhhccccccCCCCCcccc--ccccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHH
Q psy3264 67 DKKRYDTEMQSYTPPAGEKR--GKIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKE 134 (156)
Q Consensus 67 ~k~~y~~~~~~~~~~~~~~r--~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e 134 (156)
++....+.|++|..+....| +.+.+.||+..+..|+++||+.|-.|-+.||+.|.++|-+.++.|-+.
T Consensus 178 ~k~HirrPMnaf~ifskrhr~~g~vhq~~pn~DNrtIskiLgewWytL~~~Ekq~yhdLa~Qvk~Ahfka 247 (683)
T KOG2746|consen 178 DKDHIRRPMNAFHIFSKRHRGEGRVHQRHPNQDNRTISKILGEWWYTLGPNEKQKYHDLAFQVKEAHFKA 247 (683)
T ss_pred cchhhhhhhHHHHHHHhhcCCccchhccCccccchhHHHHHhhhHhhhCchhhhhHHHHHHHHHHHHhhh
Confidence 33445567777777666666 778899999999999999999999999999999999999999998876
No 37
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=96.57 E-value=0.0034 Score=36.92 Aligned_cols=40 Identities=23% Similarity=0.434 Sum_probs=35.9
Q ss_pred CcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCH
Q psy3264 13 MTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSE 54 (156)
Q Consensus 13 ~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~ 54 (156)
+|.|-+|.+..|+.|...||++.+ +.+..+++.+|+..++
T Consensus 13 lt~yK~Fsq~vRP~l~~~NPk~~~--sKl~~l~~AKwrEF~~ 52 (55)
T PF08073_consen 13 LTNYKAFSQHVRPLLAKANPKAPM--SKLMMLLQAKWREFQE 52 (55)
T ss_pred HHHHHHHHHHHHHHHHHHCCCCcH--HHHHHHHHHHHHHHHh
Confidence 567889999999999999999976 8999999999997654
No 38
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=94.55 E-value=0.071 Score=35.81 Aligned_cols=81 Identities=15% Similarity=0.433 Sum_probs=55.3
Q ss_pred HHHHHHHHHHhccCCHHHHHHHHHhhhh-------hhhhhccccccCCCCCccccccccccC---CCCCchhHHHHHHHH
Q psy3264 39 AEFSKKCAERWKTMSEKEKKRFHEMADK-------DKKRYDTEMQSYTPPAGEKRGKIKQDN---PEYGVGDIAKELGKK 108 (156)
Q Consensus 39 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~-------~k~~y~~~~~~~~~~~~~~r~~~~~~~---p~~~~~e~~k~~~~~ 108 (156)
.++..-++..|+.|++..+..|...|.. .+.++...|..|..-.++.|..+...+ ..++..+.. .|...
T Consensus 10 q~~L~pl~~~W~~l~~~qr~k~l~~a~r~~~mspeqq~r~~~rm~~W~~LspeqR~~~R~~~~~~~~Lpp~qR~-~lr~~ 88 (107)
T PF11304_consen 10 QQALAPLAERWNSLPPEQRRKWLQIAERWPSMSPEQQQRLRERMRRWAALSPEQRQQARENYQRFKQLPPEQRQ-ALRAR 88 (107)
T ss_pred HHHHHHHHHHHhcCCHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHhCCHHHHHHHHHHHHHHHcCCHHHHH-HHHHH
Confidence 5677888899999999999988887755 566677777777665555555544433 344444333 46666
Q ss_pred h---ccCChHHHHHH
Q psy3264 109 W---ADVDPSVKTKY 120 (156)
Q Consensus 109 W---k~ls~~eK~~y 120 (156)
| ..||+++|...
T Consensus 89 w~~yq~l~~eeR~~l 103 (107)
T PF11304_consen 89 WEAYQQLPPEERQAL 103 (107)
T ss_pred HHHHHcCCHHHHHHH
Confidence 6 78888888654
No 39
>PF06382 DUF1074: Protein of unknown function (DUF1074); InterPro: IPR024460 This family consists of several proteins which appear to be specific to Insecta. The function of this family is unknown.
Probab=94.50 E-value=0.038 Score=40.16 Aligned_cols=38 Identities=16% Similarity=0.437 Sum_probs=34.1
Q ss_pred cccccCCCCCchhHHHHHHHHhccCChHHHHHHHHHHH
Q psy3264 88 KIKQDNPEYGVGDIAKELGKKWADVDPSVKTKYEQMAE 125 (156)
Q Consensus 88 ~~~~~~p~~~~~e~~k~~~~~Wk~ls~~eK~~y~~~a~ 125 (156)
.+...|.+++..|+....+..|..||+.+|..|...+-
T Consensus 92 eFRrkh~~L~p~dlI~~AAraW~rLSe~eK~rYrr~~~ 129 (183)
T PF06382_consen 92 EFRRKHCGLSPQDLIQRAARAWCRLSEAEKNRYRRMAP 129 (183)
T ss_pred HHHHHccCCCHHHHHHHHHHHHHhCCHHHHHHHHhhcc
Confidence 36678899999999999999999999999999998654
No 40
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=93.07 E-value=0.18 Score=34.66 Aligned_cols=48 Identities=23% Similarity=0.383 Sum_probs=40.0
Q ss_pred CCCCC-CCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCH
Q psy3264 5 KDVKP-RGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSE 54 (156)
Q Consensus 5 ~p~~P-krP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~ 54 (156)
...+| +|-.-||.-|...+.+.++.++|++.. +++..+|-..|..-++
T Consensus 68 ~drHPErR~KAAy~afeE~~Lp~lK~E~PgLrl--sQ~kq~l~K~w~KSPe 116 (122)
T PF06244_consen 68 IDRHPERRMKAAYKAFEERRLPELKEENPGLRL--SQYKQMLWKEWQKSPE 116 (122)
T ss_pred CCCCcchhHHHHHHHHHHHHhHHHHhhCCCchH--HHHHHHHHHHHhcCCC
Confidence 34445 455579999999999999999999987 8999999999987654
No 41
>PF04769 MAT_Alpha1: Mating-type protein MAT alpha 1; InterPro: IPR006856 This family includes Saccharomyces cerevisiae (Baker's yeast) mating type protein alpha 1 (P01365 from SWISSPROT). MAT alpha 1 is a transcription activator that activates mating-type alpha-specific genes with the help of the MADS-box containing MCM1 transcription factor, which together bind cooperatively to PQ elements upstream of alpha-specific genes. The MCM1-MATalpha1 complex is required for the proper DNA-bending that is needed for transcriptional activation []. Alpha 1 interacts in vivo with STE12, linking expression of alpha-specific genes to the alpha-pheromone (IPR006742 from INTERPRO) response pathway [].; GO: 0000772 mating pheromone activity, 0003677 DNA binding, 0045895 positive regulation of transcription, mating-type specific, 0005634 nucleus
Probab=93.02 E-value=0.27 Score=36.81 Aligned_cols=55 Identities=16% Similarity=0.220 Sum_probs=40.8
Q ss_pred CCCCCCCCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264 4 EKDVKPRGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADK 66 (156)
Q Consensus 4 ~~p~~PkrP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~ 66 (156)
..+..++||+|+|++|..-.- ...|+... .+++.+|+..|..=+- |..|.-+|..
T Consensus 39 ~~~~~~kr~lN~Fm~FRsyy~----~~~~~~~Q--k~~S~~l~~lW~~dp~--k~~W~l~ak~ 93 (201)
T PF04769_consen 39 RSPEKAKRPLNGFMAFRSYYS----PIFPPLPQ--KELSGILTKLWEKDPF--KNKWSLMAKA 93 (201)
T ss_pred ccccccccchhHHHHHHHHHH----hhcCCcCH--HHHHHHHHHHHhCCcc--HhHHHHHhhh
Confidence 345678999999999977654 44566654 8999999999997443 5666666654
No 42
>PF08073 CHDNT: CHDNT (NUC034) domain; InterPro: IPR012958 The CHD N-terminal domain is found in PHD/RING fingers and chromo domain-associated helicases [].; GO: 0003677 DNA binding, 0005524 ATP binding, 0008270 zinc ion binding, 0016818 hydrolase activity, acting on acid anhydrides, in phosphorus-containing anhydrides, 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus
Probab=92.12 E-value=0.054 Score=31.88 Aligned_cols=33 Identities=18% Similarity=0.357 Sum_probs=28.3
Q ss_pred CccccccccccCCCCCchhHHHHHHHHhccCCh
Q psy3264 82 AGEKRGKIKQDNPEYGVGDIAKELGKKWADVDP 114 (156)
Q Consensus 82 ~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~ 114 (156)
..--||.+...||++....+..+++.+|+..+.
T Consensus 20 sq~vRP~l~~~NPk~~~sKl~~l~~AKwrEF~~ 52 (55)
T PF08073_consen 20 SQHVRPLLAKANPKAPMSKLMMLLQAKWREFQE 52 (55)
T ss_pred HHHHHHHHHHHCCCCcHHHHHHHHHHHHHHHHh
Confidence 334489999999999999999999999997654
No 43
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=92.04 E-value=0.38 Score=35.83 Aligned_cols=47 Identities=17% Similarity=0.523 Sum_probs=40.9
Q ss_pred CCCchhHHH-HHHHHhccCChHHHHHHHHHHHH-HHHHHHHHHHHHHhc
Q psy3264 95 EYGVGDIAK-ELGKKWADVDPSVKTKYEQMAEK-DKARYEKEMTAYKNR 141 (156)
Q Consensus 95 ~~~~~e~~k-~~~~~Wk~ls~~eK~~y~~~a~~-~k~~y~~e~~~~~~~ 141 (156)
.+++..+++ .||..|+.+|+++++.|.+.... ....|-..+..|...
T Consensus 63 ~~Df~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~~~ 111 (198)
T TIGR03481 63 AFDLPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYAGE 111 (198)
T ss_pred hCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhcCc
Confidence 456788875 78999999999999999999998 888999999999654
No 44
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=88.89 E-value=1.3 Score=33.39 Aligned_cols=49 Identities=20% Similarity=0.328 Sum_probs=41.5
Q ss_pred CCCCCchhHHH-HHHHHhccCChHHHHHHHHHHHH-HHHHHHHHHHHHHhc
Q psy3264 93 NPEYGVGDIAK-ELGKKWADVDPSVKTKYEQMAEK-DKARYEKEMTAYKNR 141 (156)
Q Consensus 93 ~p~~~~~e~~k-~~~~~Wk~ls~~eK~~y~~~a~~-~k~~y~~e~~~~~~~ 141 (156)
.|.+.+..+++ .||.-|+.+|+++++.|.+.... +...|-..+..|...
T Consensus 65 ~p~~Df~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~~q 115 (211)
T PRK15117 65 LPYVQVKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYHGQ 115 (211)
T ss_pred cccCCHHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhCCc
Confidence 46778888874 78999999999999999999887 556899999999754
No 45
>PF05494 Tol_Tol_Ttg2: Toluene tolerance, Ttg2 ; InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=80.48 E-value=2.8 Score=30.11 Aligned_cols=47 Identities=19% Similarity=0.525 Sum_probs=34.4
Q ss_pred CCCCchhHH-HHHHHHhccCChHHHHHHHHHHHHH-HHHHHHHHHHHHh
Q psy3264 94 PEYGVGDIA-KELGKKWADVDPSVKTKYEQMAEKD-KARYEKEMTAYKN 140 (156)
Q Consensus 94 p~~~~~e~~-k~~~~~Wk~ls~~eK~~y~~~a~~~-k~~y~~e~~~~~~ 140 (156)
|.+.+..++ ..||.-|+.+|+++++.|.+..... ...|-..+..|..
T Consensus 36 ~~~D~~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~~ 84 (170)
T PF05494_consen 36 PYFDFERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYSG 84 (170)
T ss_dssp GGB-HHHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-SS
T ss_pred HhCCHHHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhCC
Confidence 556677777 5677899999999999999998774 5567777777764
No 46
>PF04690 YABBY: YABBY protein; InterPro: IPR006780 YABBY proteins are a group of plant-specific transcription factors involved in the specification of abaxial polarity in lateral organs such as leaves and floral organs [, ].
Probab=80.46 E-value=0.54 Score=34.19 Aligned_cols=45 Identities=20% Similarity=0.338 Sum_probs=37.5
Q ss_pred hhhccccccCCCCCccccccccccCCCCCchhHHHHHHHHhccCC
Q psy3264 69 KRYDTEMQSYTPPAGEKRGKIKQDNPEYGVGDIAKELGKKWADVD 113 (156)
Q Consensus 69 ~~y~~~~~~~~~~~~~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls 113 (156)
++.++.-.+|+.+..+.-++++..+|+++-.|+++..+..|...+
T Consensus 120 EKRqR~psaYn~f~k~ei~rik~~~p~ishkeaFs~aAknW~h~p 164 (170)
T PF04690_consen 120 EKRQRVPSAYNRFMKEEIQRIKAENPDISHKEAFSAAAKNWAHFP 164 (170)
T ss_pred cccCCCchhHHHHHHHHHHHHHhcCCCCCHHHHHHHHHHhhhhCc
Confidence 445566677777777778889999999999999999999998765
No 47
>TIGR03481 HpnM hopanoid biosynthesis associated membrane protein HpnM. The genomes containing members of this family share the machinery for the biosynthesis of hopanoid lipids. Furthermore, the genes of this family are usually located proximal to other components of this biological process. The proteins are members of the pfam05494 family of putative transporters known as "toluene tolerance protein Ttg2D", although it is unlikely that the members included here have anything to do with toluene per-se.
Probab=77.78 E-value=3 Score=30.99 Aligned_cols=42 Identities=17% Similarity=0.510 Sum_probs=33.7
Q ss_pred HHHHHH-HHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264 38 FAEFSK-KCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT 79 (156)
Q Consensus 38 ~~ei~k-~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~ 79 (156)
|..+++ .||.-|+.+|+++|+.|.+.... ....|-..+..|.
T Consensus 66 f~~mar~vLG~~W~~~s~~Qr~~F~~~F~~~l~~tY~~~l~~y~ 109 (198)
T TIGR03481 66 LPAMARLTLGSSWTSLSPEQRRRFIGAFRELSIATYASQFKSYA 109 (198)
T ss_pred HHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhc
Confidence 455654 78999999999999999998887 6677777777664
No 48
>PF11304 DUF3106: Protein of unknown function (DUF3106); InterPro: IPR021455 Some members in this family of proteins are annotated as transmembrane proteins however this cannot be confirmed. Currently no function is known.
Probab=75.42 E-value=14 Score=24.59 Aligned_cols=25 Identities=28% Similarity=0.607 Sum_probs=12.6
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAE 125 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~ 125 (156)
+..-+...|+.|++..+..+...|.
T Consensus 12 ~L~pl~~~W~~l~~~qr~k~l~~a~ 36 (107)
T PF11304_consen 12 ALAPLAERWNSLPPEQRRKWLQIAE 36 (107)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHH
Confidence 3344455555555555555554443
No 49
>PF01352 KRAB: KRAB box; InterPro: IPR001909 The Krueppel-associated box (KRAB) is a domain of around 75 amino acids that is found in the N-terminal part of about one third of eukaryotic Krueppel-type C2H2 zinc finger proteins (ZFPs) []. It is enriched in charged amino acids and can be divided into subregions A and B, which are predicted to fold into two amphipathic alpha-helices. The KRAB A and B boxes can be separated by variable spacer segments and many KRAB proteins contain only the A box []. The functions currently known for members of the KRAB-containing protein family include transcriptional repression of RNA polymerase I, II, and III promoters, binding and splicing of RNA, and control of nucleolus function. The KRAB domain functions as a transcriptional repressor when tethered to the template DNA by a DNA-binding domain. A sequence of 45 amino acids in the KRAB A subdomain has been shown to be necessary and sufficient for transcriptional repression. The B box does not repress by itself but does potentiate the repression exerted by the KRAB A subdomain [, ]. Gene silencing requires the binding of the KRAB domain to the RING-B box-coiled coil (RBCC) domain of the KAP-1/TIF1-beta corepressor. As KAP-1 binds to the heterochromatin proteins HP1, it has been proposed that the KRAB-ZFP-bound target gene could be silenced following recruitment to heterochromatin [, ]. KRAB-ZFPs probably constitute the single largest class of transcription factors within the human genome []. Although the function of KRAB-ZFPs is largely unknown, they appear to play important roles during cell differentiation and development. The KRAB domain is generally encoded by two exons. The regions coded by the two exons are known as KRAB-A and KRAB-B.; GO: 0003676 nucleic acid binding, 0006355 regulation of transcription, DNA-dependent, 0005622 intracellular; PDB: 1V65_A.
Probab=73.30 E-value=2.4 Score=23.28 Aligned_cols=28 Identities=25% Similarity=0.528 Sum_probs=15.6
Q ss_pred CchhHHHHHH-HHhccCChHHHHHHHHHH
Q psy3264 97 GVGDIAKELG-KKWADVDPSVKTKYEQMA 124 (156)
Q Consensus 97 ~~~e~~k~~~-~~Wk~ls~~eK~~y~~~a 124 (156)
++.|++.-++ +.|..|.+.+|.-|.+.-
T Consensus 2 tf~Dvav~fs~eEW~~L~~~Qk~ly~dvm 30 (41)
T PF01352_consen 2 TFEDVAVYFSQEEWELLDPAQKNLYRDVM 30 (41)
T ss_dssp -----TT---HHHHHTS-HHHHHHHHHHH
T ss_pred eEEEEEEEcChhhcccccceecccchhHH
Confidence 4566665555 459999999999998754
No 50
>KOG3223|consensus
Probab=73.10 E-value=5.1 Score=29.71 Aligned_cols=44 Identities=14% Similarity=0.339 Sum_probs=38.5
Q ss_pred CCCCcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHH
Q psy3264 10 RGRMTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEK 55 (156)
Q Consensus 10 krP~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~ 55 (156)
+|=.-||.-|=....+.++..+|++.+ ++.-.+|-.+|..-++.
T Consensus 166 kRmrAA~~afEe~~LPrLK~e~P~lrl--sQ~Kqll~Kew~KsPDN 209 (221)
T KOG3223|consen 166 KRMRAAFKAFEEARLPRLKKENPGLRL--SQYKQLLKKEWQKSPDN 209 (221)
T ss_pred HHHHHHHHHHHHhhchhhhhcCCCccH--HHHHHHHHHHHhhCCCC
Confidence 455678999999999999999999987 89999999999887763
No 51
>PF13875 DUF4202: Domain of unknown function (DUF4202)
Probab=71.66 E-value=10 Score=28.02 Aligned_cols=40 Identities=25% Similarity=0.440 Sum_probs=34.0
Q ss_pred HHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHH
Q psy3264 15 AYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKR 59 (156)
Q Consensus 15 ay~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~ 59 (156)
+.++|+...+..+...|.. ..+..+|..-|+.||+.-++.
T Consensus 131 acLVFL~~~f~~F~~~~de-----eK~v~Il~KTw~KMS~~g~~~ 170 (185)
T PF13875_consen 131 ACLVFLEYYFEDFAAKHDE-----EKIVDILRKTWRKMSERGHEA 170 (185)
T ss_pred HHHHhHHHHHHHHHhcCCH-----HHHHHHHHHHHHHCCHHHHHH
Confidence 5788999999999998832 688899999999999987753
No 52
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=69.35 E-value=12 Score=28.01 Aligned_cols=54 Identities=17% Similarity=0.251 Sum_probs=41.5
Q ss_pred CCCCchhHH-HHHHHHhccCChHHHHHHHHHHHH-HHHHHHHHHHHHHhcCCCCCC
Q psy3264 94 PEYGVGDIA-KELGKKWADVDPSVKTKYEQMAEK-DKARYEKEMTAYKNRSVAPVN 147 (156)
Q Consensus 94 p~~~~~e~~-k~~~~~Wk~ls~~eK~~y~~~a~~-~k~~y~~e~~~~~~~~~~~~~ 147 (156)
|-+.+.-++ ..||.-|+++|+++++.|.+.... +.+.|-..|..|+.+...=..
T Consensus 68 p~vd~~~~a~~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~~q~~~v~~ 123 (202)
T COG2854 68 PYVDFKYAAKLVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYKGQTLKVKP 123 (202)
T ss_pred hhhcHHHHHHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHccCCCceeCC
Confidence 444444444 668899999999999999998877 567799999999887654333
No 53
>PF12881 NUT_N: NUT protein N terminus; InterPro: IPR024309 This domain is found in the N-terminal region of Nuclear Testis (NUT) proteins. It is also found in FAM22, which are a family of uncharacterised mammalian proteins.
Probab=68.10 E-value=14 Score=29.47 Aligned_cols=52 Identities=21% Similarity=0.306 Sum_probs=45.8
Q ss_pred CcHHHHHHHHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264 13 MTAYAFFVQVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADK 66 (156)
Q Consensus 13 ~say~lF~~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~ 66 (156)
..|+.+|+.-....+....|.+++ .|-....=++|.-.|-=+|..|.++|.+
T Consensus 229 ~EAlSCFLIpvLrsLar~kPtMtl--EeGl~ra~qEW~~~SnfdRmifyemaek 280 (328)
T PF12881_consen 229 AEALSCFLIPVLRSLARLKPTMTL--EEGLWRAVQEWQHTSNFDRMIFYEMAEK 280 (328)
T ss_pred hhhhhhhHHHHHHHHHhcCCCccH--HHHHHHHHHHhhccccccHHHHHHHHHH
Confidence 458889998888888888898876 7888888899999999999999999976
No 54
>PF05494 Tol_Tol_Ttg2: Toluene tolerance, Ttg2 ; InterPro: IPR008869 Toluene tolerance is mediated by increased cell membrane rigidity resulting from changes in fatty acid and phospholipid compositions, exclusion of toluene from the cell membrane, and removal of intracellular toluene by degradation []. Many proteins are involved in these processes. This family is a transporter which shows similarity to ABC transporters [].; PDB: 2QGU_A.
Probab=64.31 E-value=4 Score=29.26 Aligned_cols=41 Identities=17% Similarity=0.452 Sum_probs=29.8
Q ss_pred HHHH-HHHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264 39 AEFS-KKCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT 79 (156)
Q Consensus 39 ~ei~-k~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~ 79 (156)
..++ ..||.-|+.+|+++++.|.+.... ....|-..+..|.
T Consensus 41 ~~~ar~~LG~~w~~~s~~q~~~F~~~f~~~l~~~Y~~~l~~y~ 83 (170)
T PF05494_consen 41 ERMARRVLGRYWRKASPAQRQRFVEAFKQLLVRTYAKRLDEYS 83 (170)
T ss_dssp HHHHHHHHGGGTTTS-HHHHHHHHHHHHHHHHHHHHHHHHT-S
T ss_pred HHHHHHHHHHhHhhCCHHHHHHHHHHHHHHHHHHHHHHHHhhC
Confidence 4444 467789999999999999988777 5566777777764
No 55
>PRK15117 ABC transporter periplasmic binding protein MlaC; Provisional
Probab=62.72 E-value=6.7 Score=29.45 Aligned_cols=42 Identities=10% Similarity=0.166 Sum_probs=32.6
Q ss_pred HHHHH-HHHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264 38 FAEFS-KKCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT 79 (156)
Q Consensus 38 ~~ei~-k~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~ 79 (156)
|..++ ..||.-|+++|+++|..|.+.... ....|-..+..|.
T Consensus 70 f~~~s~~vLG~~wr~as~eQr~~F~~~F~~~Lv~tYa~~l~~y~ 113 (211)
T PRK15117 70 VKYAGALVLGRYYKDATPAQREAYFAAFREYLKQAYGQALAMYH 113 (211)
T ss_pred HHHHHHHHhhhhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHhC
Confidence 34554 478999999999999999988777 5567777777664
No 56
>PF06945 DUF1289: Protein of unknown function (DUF1289); InterPro: IPR010710 This family consists of a number of hypothetical bacterial proteins. The aligned region spans around 56 residues and contains 4 highly conserved cysteine residues towards the N terminus. The function of this family is unknown.
Probab=59.58 E-value=11 Score=21.64 Aligned_cols=21 Identities=5% Similarity=0.204 Sum_probs=16.6
Q ss_pred HHhccCChHHHHHHHHHHHHH
Q psy3264 107 KKWADVDPSVKTKYEQMAEKD 127 (156)
Q Consensus 107 ~~Wk~ls~~eK~~y~~~a~~~ 127 (156)
..|+.||+++|..........
T Consensus 28 ~~W~~~s~~er~~i~~~l~~R 48 (51)
T PF06945_consen 28 RDWKSMSDDERRAILARLRAR 48 (51)
T ss_pred HHHhhCCHHHHHHHHHHHHHH
Confidence 379999999998877765544
No 57
>PRK10363 cpxP periplasmic repressor CpxP; Reviewed
Probab=57.02 E-value=48 Score=24.07 Aligned_cols=44 Identities=11% Similarity=0.210 Sum_probs=32.3
Q ss_pred hhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy3264 99 GDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSV 143 (156)
Q Consensus 99 ~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~ 143 (156)
.+.++.-...++-|+|++|..|.++.++-..++.. +..+.....
T Consensus 111 Vem~k~~nqmy~lLTPEQKaq~~~~~~~rm~~~~~-~~~~q~~s~ 154 (166)
T PRK10363 111 VEMAKVRNQMYRLLTPEQQAVLNEKHQQRMEQLRD-VTQWQKSSS 154 (166)
T ss_pred HHHHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHH-HHhcCcccc
Confidence 34556667789999999999999988877777744 665544433
No 58
>PRK09706 transcriptional repressor DicA; Reviewed
Probab=55.33 E-value=62 Score=22.07 Aligned_cols=46 Identities=11% Similarity=0.034 Sum_probs=39.6
Q ss_pred hhHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCC
Q psy3264 99 GDIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVA 144 (156)
Q Consensus 99 ~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~ 144 (156)
..-.+.|-..|+.||++++......++...+.|..-+++|..+...
T Consensus 86 ~~~~~~ll~~~~~L~~~~~~~~l~~l~~~~~~~~~~~~~~~~~~~~ 131 (135)
T PRK09706 86 SEDQKELLELFDALPESEQDAQLSEMRARVENFNKLFEELLKARKR 131 (135)
T ss_pred CHHHHHHHHHHHHCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHhh
Confidence 3445778899999999999999999999999999999999765443
No 59
>PF15581 Imm35: Immunity protein 35
Probab=51.65 E-value=38 Score=21.88 Aligned_cols=30 Identities=10% Similarity=0.210 Sum_probs=23.2
Q ss_pred HHHHHHHHHHHhccCCHHHHHHHHHhhhhh
Q psy3264 38 FAEFSKKCAERWKTMSEKEKKRFHEMADKD 67 (156)
Q Consensus 38 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~ 67 (156)
..-+...+...|+.|+.++=..-.+.+...
T Consensus 32 i~~l~~lIe~eWRGl~~~qV~~kl~ava~~ 61 (93)
T PF15581_consen 32 IRNLESLIEHEWRGLPEEQVLYKLEAVAAK 61 (93)
T ss_pred HHHHHHHHHHHHcCCCHHHHHHHHHHHHhc
Confidence 467889999999999999876665555443
No 60
>KOG3838|consensus
Probab=51.23 E-value=19 Score=29.91 Aligned_cols=40 Identities=20% Similarity=0.248 Sum_probs=34.6
Q ss_pred cCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCCCCCCCcc
Q psy3264 111 DVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSVAPVNEEA 150 (156)
Q Consensus 111 ~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~~~~~~~~ 150 (156)
.+.+.+|++|.++.+.....|+++-++|++.++....+++
T Consensus 268 E~qe~ek~kyqeEfe~~q~elek~k~efkk~hpd~~~e~e 307 (497)
T KOG3838|consen 268 EMQELEKAKYQEEFEWAQLELEKRKDEFKKSHPDAQGEGE 307 (497)
T ss_pred hhhHHHHHHHHHHHHHHHHHHhhhHhhhccCCchhhcchh
Confidence 4567799999999999999999999999999988776644
No 61
>PF06244 DUF1014: Protein of unknown function (DUF1014); InterPro: IPR010422 This family consists of several hypothetical eukaryotic proteins of unknown function.
Probab=51.01 E-value=12 Score=25.78 Aligned_cols=31 Identities=19% Similarity=0.417 Sum_probs=27.2
Q ss_pred cccccccccCCCCCchhHHHHHHHHhccCCh
Q psy3264 84 EKRGKIKQDNPEYGVGDIAKELGKKWADVDP 114 (156)
Q Consensus 84 ~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~ 114 (156)
.+-|.|+.++||+..+.+-.+|-..|...++
T Consensus 86 ~~Lp~lK~E~PgLrlsQ~kq~l~K~w~KSPe 116 (122)
T PF06244_consen 86 RRLPELKEENPGLRLSQYKQMLWKEWQKSPE 116 (122)
T ss_pred HHhHHHHhhCCCchHHHHHHHHHHHHhcCCC
Confidence 3468899999999999999999999988765
No 62
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=49.01 E-value=47 Score=23.95 Aligned_cols=77 Identities=9% Similarity=0.124 Sum_probs=43.0
Q ss_pred ccCCHHHHHHHHHhhhhhhhhhccccccCCCCCccccccccccCCC-C---------------CchhHHHHHHHHhccCC
Q psy3264 50 KTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPAGEKRGKIKQDNPE-Y---------------GVGDIAKELGKKWADVD 113 (156)
Q Consensus 50 k~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~~~~r~~~~~~~p~-~---------------~~~e~~k~~~~~Wk~ls 113 (156)
-+||+++|....++....+......... -......-+..+..+ . -..+..+.....+..||
T Consensus 55 l~LTd~QR~qmr~im~~~r~~~~~~~~~---~~~~m~~Li~Ad~FDeaAvra~~~kma~~~~e~~v~~~~~~~qmy~lLT 131 (162)
T PRK12751 55 INLTEQQRQQMRDLMRQSHQSQPRLDLE---DREAMHKLITADKFDEAAVRAQAEKMSQNQIERHVEMAKVRNQMYNLLT 131 (162)
T ss_pred CCCCHHHHHHHHHHHHHhhhcccchhHH---HHHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHcCC
Confidence 6899999999998887766542111000 000001111122211 0 01234456677899999
Q ss_pred hHHHHHHHHHHHHHHH
Q psy3264 114 PSVKTKYEQMAEKDKA 129 (156)
Q Consensus 114 ~~eK~~y~~~a~~~k~ 129 (156)
+++|..|.++.++-..
T Consensus 132 PEQra~l~~~~e~r~~ 147 (162)
T PRK12751 132 PEQKEALNKKHQERIE 147 (162)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 9999999995554433
No 63
>PF12650 DUF3784: Domain of unknown function (DUF3784); InterPro: IPR017259 This group represents an uncharacterised conserved protein.
Probab=46.79 E-value=14 Score=23.88 Aligned_cols=18 Identities=17% Similarity=0.270 Sum_probs=14.8
Q ss_pred HHHhccCChHHHHHHHHH
Q psy3264 106 GKKWADVDPSVKTKYEQM 123 (156)
Q Consensus 106 ~~~Wk~ls~~eK~~y~~~ 123 (156)
=+-|+.||++||+.|...
T Consensus 23 IaGyntms~eEk~~~D~~ 40 (97)
T PF12650_consen 23 IAGYNTMSKEEKEKYDKK 40 (97)
T ss_pred hhhcccCCHHHHHHhhHH
Confidence 357999999999999653
No 64
>PRK10236 hypothetical protein; Provisional
Probab=46.35 E-value=23 Score=27.16 Aligned_cols=26 Identities=15% Similarity=0.414 Sum_probs=21.8
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEK 126 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~ 126 (156)
+++.+...|+.||++|++.+.+.-..
T Consensus 118 l~kll~~a~~kms~eE~~~L~~~l~~ 143 (237)
T PRK10236 118 LEQFLRNTWKKMDEEHKQEFLHAVDA 143 (237)
T ss_pred HHHHHHHHHHHCCHHHHHHHHHHHhh
Confidence 56999999999999999888765544
No 65
>PRK10236 hypothetical protein; Provisional
Probab=45.62 E-value=25 Score=26.97 Aligned_cols=28 Identities=25% Similarity=0.451 Sum_probs=22.8
Q ss_pred HHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264 39 AEFSKKCAERWKTMSEKEKKRFHEMADK 66 (156)
Q Consensus 39 ~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~ 66 (156)
.-+.+++...|..||++|++.+...-..
T Consensus 116 ~il~kll~~a~~kms~eE~~~L~~~l~~ 143 (237)
T PRK10236 116 QLLEQFLRNTWKKMDEEHKQEFLHAVDA 143 (237)
T ss_pred HHHHHHHHHHHHHCCHHHHHHHHHHHhh
Confidence 3467899999999999999888875544
No 66
>PF11616 EZH2_WD-Binding: WD repeat binding protein EZH2; InterPro: IPR021654 This family of proteins represents Enhancer of zest homologue 2, (EZH2) a 30 residue peptide which binds to a WD-repeat domain of EED by residues 39-68. EED is a component of PRC2 complex which is involved in gene expression []. This interaction is required for the HMTase activity of PCR2 []. ; GO: 0018024 histone-lysine N-methyltransferase activity; PDB: 2QXV_B.
Probab=43.14 E-value=47 Score=16.89 Aligned_cols=14 Identities=29% Similarity=0.539 Sum_probs=10.7
Q ss_pred HHHHHHHHHHhccC
Q psy3264 39 AEFSKKCAERWKTM 52 (156)
Q Consensus 39 ~ei~k~l~~~Wk~L 52 (156)
.+-+.+|.+.|+.|
T Consensus 12 ~e~t~iLN~eWk~l 25 (30)
T PF11616_consen 12 QERTDILNEEWKKL 25 (30)
T ss_dssp HHHHHHHHHHHHH-
T ss_pred HHHHHHHHHHHHHh
Confidence 57778899999876
No 67
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=40.30 E-value=90 Score=22.63 Aligned_cols=35 Identities=11% Similarity=0.303 Sum_probs=28.4
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEM 135 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~ 135 (156)
+.+.....+.-||+++|..|.+.-..-.+.+...+
T Consensus 126 ~~~~~~~~~~vLTpEQRak~~e~~~~r~~~~~~~~ 160 (170)
T PRK12750 126 MLEKRHQMLSILTPEQKAKFQELQQERMQECQDKM 160 (170)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 33455567999999999999999888877777766
No 68
>cd07081 ALDH_F20_ACDH_EutE-like Coenzyme A acylating aldehyde dehydrogenase (ACDH), Ethanolamine utilization protein EutE, and related proteins. Coenzyme A acylating aldehyde dehydrogenase (ACDH), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, acetylating (EC=1.2.1.10), functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA. The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH, and may be critical enzymes in the fermentative pathway.
Probab=39.41 E-value=89 Score=26.17 Aligned_cols=42 Identities=7% Similarity=-0.058 Sum_probs=35.4
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.++..-..|..+|..+|..+...+....+++..++......
T Consensus 6 ~~A~~A~~~W~~~~~~~R~~iL~~~a~~l~~~~~ela~~~~~ 47 (439)
T cd07081 6 AAAKVAQQGLSCKSQEMVDLIFRAAAEAAEDARIDLAKLAVS 47 (439)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 344556678999999999999999999999999999887443
No 69
>KOG3223|consensus
Probab=39.26 E-value=25 Score=26.14 Aligned_cols=32 Identities=19% Similarity=0.443 Sum_probs=28.5
Q ss_pred cccccccccCCCCCchhHHHHHHHHhccCChH
Q psy3264 84 EKRGKIKQDNPEYGVGDIAKELGKKWADVDPS 115 (156)
Q Consensus 84 ~~r~~~~~~~p~~~~~e~~k~~~~~Wk~ls~~ 115 (156)
.+-|+|+.++|++.....-.+|-..|...++.
T Consensus 178 ~~LPrLK~e~P~lrlsQ~Kqll~Kew~KsPDN 209 (221)
T KOG3223|consen 178 ARLPRLKKENPGLRLSQYKQLLKKEWQKSPDN 209 (221)
T ss_pred hhchhhhhcCCCccHHHHHHHHHHHHhhCCCC
Confidence 44788999999999999999999999988764
No 70
>PF06424 PRP1_N: PRP1 splicing factor, N-terminal; InterPro: IPR010491 This domain is specific to the N-terminal part of the prp1 splicing factor, which is involved in mRNA splicing (and possibly also poly(A)+ RNA nuclear export and cell cycle progression). This domain is specific to the N terminus of the RNA splicing factor encoded by prp1 []. It is involved in mRNA splicing and possibly also poly(A)and RNA nuclear export and cell cycle progression.; GO: 0000398 nuclear mRNA splicing, via spliceosome, 0005634 nucleus
Probab=34.41 E-value=33 Score=23.97 Aligned_cols=31 Identities=29% Similarity=0.631 Sum_probs=21.9
Q ss_pred HHHHHhCCCCcchHHHHHHHHH----HHhccCCHH
Q psy3264 25 EEHKKKHPEENVVFAEFSKKCA----ERWKTMSEK 55 (156)
Q Consensus 25 ~~~~~~~p~~~~~~~ei~k~l~----~~Wk~Ls~~ 55 (156)
..+...+|.+...|.+|-+.|+ +.|-+|++.
T Consensus 93 e~~~~~~pkI~~QFaDLKR~La~VS~eeW~~IPE~ 127 (133)
T PF06424_consen 93 EKYRKENPKIQQQFADLKRSLATVSEEEWENIPEA 127 (133)
T ss_pred HhhhccCchHHHHHHHHHHHHccCCHHHHhcCCcc
Confidence 3445567777777888877776 588888863
No 71
>PF05914 RIB43A: RIB43A; InterPro: IPR008805 This family consists of several RIB43A-like eukaryotic proteins. Ciliary and flagellar microtubules contain a specialised set of protofilaments, termed ribbons, that are composed of tubulin and several associated proteins. RIB43A was first characterised in the unicellular biflagellate, Chlamydomonas reinhardtii although highly related sequences are present in several higher eukaryotes including humans. The function of this protein is unknown although the structure of RIB43A and its association with the specialised protofilament ribbons and with basal bodies is relevant to the proposed role of ribbons in forming and stabilising doublet and triplet microtubules and in organising their three-dimensional structure. Human RIB43A homologues could represent a structural requirement in centriole replication in dividing cells [].
Probab=34.19 E-value=29 Score=28.59 Aligned_cols=40 Identities=13% Similarity=0.363 Sum_probs=27.5
Q ss_pred ccccccCCCCCchh--HHHHHHHHhccCChHHHHHHHHHHHH
Q psy3264 87 GKIKQDNPEYGVGD--IAKELGKKWADVDPSVKTKYEQMAEK 126 (156)
Q Consensus 87 ~~~~~~~p~~~~~e--~~k~~~~~Wk~ls~~eK~~y~~~a~~ 126 (156)
..+..++|.+..+. --+.+...||+||+++...+...-+.
T Consensus 250 sdlLtEnp~~a~s~~gp~Rv~~d~wKGMs~eQl~~i~~~Q~~ 291 (379)
T PF05914_consen 250 SDLLTENPQVAQSSFGPHRVIPDRWKGMSPEQLEEIRKEQEQ 291 (379)
T ss_pred CccccCCchhccccCCCCCCCCcccCCCCHHHHHHHHHHHHH
Confidence 34666777654222 23567789999999999988874443
No 72
>cd07122 ALDH_F20_ACDH Coenzyme A acylating aldehyde dehydrogenase (ACDH), ALDH family 20-like. Coenzyme A acylating aldehyde dehydrogenase (ACDH, EC=1.2.1.10), an NAD+ and CoA-dependent acetaldehyde dehydrogenase, functions as a single enzyme (such as the Ethanolamine utilization protein, EutE, in Salmonella typhimurium) or as part of a multifunctional enzyme to convert acetaldehyde into acetyl-CoA . The E. coli aldehyde-alcohol dehydrogenase includes the functional domains, alcohol dehydrogenase (ADH), ACDH, and pyruvate-formate-lyase deactivase; and the Entamoeba histolytica aldehyde-alcohol dehydrogenase 2 (ALDH20A1) includes the functional domains ADH and ACDH and may be critical enzymes in the fermentative pathway.
Probab=32.82 E-value=1.3e+02 Score=25.23 Aligned_cols=41 Identities=12% Similarity=0.098 Sum_probs=35.0
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
.++..-..|..+|..+|..+...+....+++..++......
T Consensus 7 ~A~~A~~~W~~~~~~eR~~~L~~~a~~l~~~~eela~~~~~ 47 (436)
T cd07122 7 RARKAQREFATFSQEQVDKIVEAVAWAAADAAEELAKMAVE 47 (436)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44555678999999999999999999999999999887544
No 73
>PF09791 Oxidored-like: Oxidoreductase-like protein, N-terminal; InterPro: IPR019180 This entry represents the N-terminal domain of various oxidoreductase-like proteins whose exact function is, as yet, unknown.
Probab=32.73 E-value=78 Score=17.97 Aligned_cols=18 Identities=11% Similarity=0.433 Sum_probs=10.9
Q ss_pred HHHHHHHHHHHHHHHHHh
Q psy3264 123 MAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 123 ~a~~~k~~y~~e~~~~~~ 140 (156)
....+.++|...+++|++
T Consensus 30 ~Y~eel~~y~~~~~~~~~ 47 (48)
T PF09791_consen 30 VYAEELEEYREALAAWQE 47 (48)
T ss_pred HHHHHHHHHHHHHHHHHh
Confidence 445555666666666654
No 74
>PF11278 DUF3079: Protein of unknown function (DUF3079); InterPro: IPR021430 This family of proteins with unknown function appears to be restricted to Proteobacteria.
Probab=32.52 E-value=32 Score=19.65 Aligned_cols=13 Identities=23% Similarity=0.345 Sum_probs=10.9
Q ss_pred CCCCCCCCCCCCC
Q psy3264 1 MPKEKDVKPRGRM 13 (156)
Q Consensus 1 ~~k~~p~~PkrP~ 13 (156)
|-||.|..|+.|-
T Consensus 1 MaKkFPlhP~hPE 13 (52)
T PF11278_consen 1 MAKKFPLHPKHPE 13 (52)
T ss_pred CCCcCCCCCCCcc
Confidence 7889999998883
No 75
>cd07133 ALDH_CALDH_CalB Coniferyl aldehyde dehydrogenase-like. Coniferyl aldehyde dehydrogenase (CALDH, EC=1.2.1.68) of Pseudomonas sp. strain HR199 (CalB) which catalyzes the NAD+-dependent oxidation of coniferyl aldehyde to ferulic acid, and similar sequences, are present in this CD.
Probab=32.10 E-value=1.3e+02 Score=25.03 Aligned_cols=43 Identities=2% Similarity=-0.187 Sum_probs=36.1
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS 142 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~ 142 (156)
+.++..-..|+.+|..+|..+...+....+.+..++..-...-
T Consensus 5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~~e 47 (434)
T cd07133 5 ERQKAAFLANPPPSLEERRDRLDRLKALLLDNQDALAEAISAD 47 (434)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3456666789999999999999999999999999998865543
No 76
>cd07085 ALDH_F6_MMSDH Methylmalonate semialdehyde dehydrogenase and ALDH family members 6A1 and 6B2. Methylmalonate semialdehyde dehydrogenase (MMSDH, EC=1.2.1.27) [acylating] from Bacillus subtilis is involved in valine metabolism and catalyses the NAD+- and CoA-dependent oxidation of methylmalonate semialdehyde into propionyl-CoA. Mitochondrial human MMSDH ALDH6A1 and Arabidopsis MMSDH ALDH6B2 are also present in this CD.
Probab=29.75 E-value=1.5e+02 Score=24.86 Aligned_cols=38 Identities=8% Similarity=-0.027 Sum_probs=31.9
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..+|..+|..+...+.....++..++..-..
T Consensus 48 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~ 85 (478)
T cd07085 48 KAAFPAWSATPVLKRQQVMFKFRQLLEENLDELARLIT 85 (478)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44446799999999999999999999999988877543
No 77
>cd07132 ALDH_F3AB Aldehyde dehydrogenase family 3 members A1, A2, and B1 and related proteins. NAD(P)+-dependent, aldehyde dehydrogenase, family 3 members A1 and B1 (ALDH3A1, ALDH3B1, EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and similar sequences are included in this CD. Human ALDH3A1 is a homodimer with a critical role in cellular defense against oxidative stress; it catalyzes the oxidation of various cellular membrane lipid-derived aldehydes. Corneal crystalline ALDH3A1 protects the cornea and underlying lens against UV-induced oxidative stress. Human ALDH3A2, a microsomal homodimer, catalyzes the oxidation of long-chain aliphatic aldehydes to fatty acids. Human ALDH3B1 is highly expressed in the kidney and liver and catalyzes the oxidation of various medium- and long-chain saturated and unsaturated aliphatic aldehydes.
Probab=29.24 E-value=1.5e+02 Score=24.66 Aligned_cols=42 Identities=10% Similarity=-0.063 Sum_probs=35.2
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.++..-..|..++..+|..+...+....+.+..++..-...
T Consensus 5 ~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~~ 46 (443)
T cd07132 5 RRAREAFSSGKTRPLEFRIQQLEALLRMLEENEDEIVEALAK 46 (443)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 445666678999999999999999999999999998776544
No 78
>PF11521 TFIIE-A_C-term: C-terminal general transcription factor TFIIE alpha; InterPro: IPR021600 TFIIE is compiled of two subunits, alpha and beta. This family of proteins are the C-terminal domain of the alpha subunit of the protein which is the largest subunit and contains several functional domains which are important for basal transcription and cell growth. The C-terminal end of the protein binds directly to the amino-terminal PH domain of p62/Tfb1 (of IIH) which is involved in the recruitment of the general transcription factor IIH to the transcription preinitiation complex. P53 competes for the same binding site as TFIIE alpha which shows their structural similarity. Like p53, TFIIE alpha 336-439 can activate transcription in vivo []. ; PDB: 2RNR_A 2RNQ_A 2JTX_A.
Probab=28.57 E-value=61 Score=20.84 Aligned_cols=29 Identities=28% Similarity=0.448 Sum_probs=19.5
Q ss_pred CCchhHH--HHHHHHhccCChHHHHHHHHHHHHH
Q psy3264 96 YGVGDIA--KELGKKWADVDPSVKTKYEQMAEKD 127 (156)
Q Consensus 96 ~~~~e~~--k~~~~~Wk~ls~~eK~~y~~~a~~~ 127 (156)
.+..+|+ -.|- ..|+++||..|+......
T Consensus 50 ~~~~eVtq~p~LV---~qMT~~EKEaYi~v~Q~~ 80 (86)
T PF11521_consen 50 YPYSEVTQRPELV---AQMTPEEKEAYIQVGQEM 80 (86)
T ss_dssp EEHHHHHH-HHHH---HHS-HHHHHHHHHHHHHH
T ss_pred eehhhcCcchHHH---HHcCHHHHHHHHHHHHHH
Confidence 3456666 2333 489999999999987653
No 79
>TIGR00787 dctP tripartite ATP-independent periplasmic transporter solute receptor, DctP family. TRAP-T (Tripartite ATP-independent Periplasmic Transporter) family proteins generally consist of three components, and these systems have so far been found in Gram-negative bacteria, Gram-postive bacteria and archaea. The best characterized example is the DctPQM system of Rhodobacter capsulatus, a C4 dicarboxylate (malate, fumarate, succinate) transporter. This model represents the DctP family, one of at least three major families of extracytoplasmic solute receptor for TRAP family transporters. Other are the SnoM family (see pfam03480) and TAXI (TRAP-associated extracytoplasmic immunogenic) family.
Probab=28.54 E-value=1.4e+02 Score=22.64 Aligned_cols=29 Identities=14% Similarity=0.263 Sum_probs=22.3
Q ss_pred HHHhccCChHHHHHHHHHHHHHHHHHHHH
Q psy3264 106 GKKWADVDPSVKTKYEQMAEKDKARYEKE 134 (156)
Q Consensus 106 ~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e 134 (156)
...|..||++.|....+.+...-......
T Consensus 213 ~~~~~~L~~e~q~~i~~a~~~~~~~~~~~ 241 (257)
T TIGR00787 213 KAFWKSLPPDLQAVVKEAAKEAGEYQRKL 241 (257)
T ss_pred HHHHhcCCHHHHHHHHHHHHHHHHHHHHH
Confidence 46799999999999988877765544443
No 80
>PRK12751 cpxP periplasmic stress adaptor protein CpxP; Reviewed
Probab=28.00 E-value=1e+02 Score=22.22 Aligned_cols=32 Identities=9% Similarity=0.264 Sum_probs=25.1
Q ss_pred HHHHHHHHHhccCCHHHHHHHHHhhhhhhhhh
Q psy3264 40 EFSKKCAERWKTMSEKEKKRFHEMADKDKKRY 71 (156)
Q Consensus 40 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y 71 (156)
+..+..-+++..||+++|..|.+..++.-...
T Consensus 118 ~~~~~~~qmy~lLTPEQra~l~~~~e~r~~~~ 149 (162)
T PRK12751 118 EMAKVRNQMYNLLTPEQKEALNKKHQERIEKL 149 (162)
T ss_pred HHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHH
Confidence 44466678899999999999999887755544
No 81
>PF02026 RyR: RyR domain; InterPro: IPR003032 This domain is called RyR for Ryanodine receptor []. The domain is found in four copies in the ryanodine receptor. The function of this domain is unknown.; PDB: 4ETV_A 3RQR_A 4ETT_A 4ERT_A 4ESU_A 4ETU_A 4ERV_A 3NRT_E.
Probab=27.91 E-value=61 Score=21.06 Aligned_cols=19 Identities=26% Similarity=0.511 Sum_probs=16.0
Q ss_pred hccCCHHHHHHHHHhhhhh
Q psy3264 49 WKTMSEKEKKRFHEMADKD 67 (156)
Q Consensus 49 Wk~Ls~~eK~~y~~~a~~~ 67 (156)
|..||+.+|+.+...+..-
T Consensus 61 y~~L~e~eK~~dr~~~~e~ 79 (94)
T PF02026_consen 61 YDELSEEEKEKDRDMVRET 79 (94)
T ss_dssp GGGS-HHHHHHHHHHHHHH
T ss_pred hhhCCHHHHHHhHHHHHHH
Confidence 9999999999999988763
No 82
>cd07087 ALDH_F3-13-14_CALDH-like ALDH subfamily: Coniferyl aldehyde dehydrogenase, ALDH families 3, 13, and 14, and other related proteins. ALDH subfamily which includes NAD(P)+-dependent, aldehyde dehydrogenase, family 3 member A1 and B1 (ALDH3A1, ALDH3B1, EC=1.2.1.5) and fatty aldehyde dehydrogenase, family 3 member A2 (ALDH3A2, EC=1.2.1.3), and also plant ALDH family members ALDH3F1, ALDH3H1, and ALDH3I1, fungal ALDH14 (YMR110C) and the protozoan family 13 member (ALDH13), as well as coniferyl aldehyde dehydrogenases (CALDH, EC=1.2.1.68), and other similar sequences, such as the Pseudomonas putida benzaldehyde dehydrogenase I that is involved in the metabolism of mandelate.
Probab=27.88 E-value=1.7e+02 Score=24.22 Aligned_cols=42 Identities=10% Similarity=-0.102 Sum_probs=34.9
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.++..-..|..++..+|..+...+....+++..++..-...
T Consensus 5 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~~ 46 (426)
T cd07087 5 ARLRETFLTGKTRSLEWRKAQLKALKRMLTENEEEIAAALYA 46 (426)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 345556678999999999999999999999999998876544
No 83
>PF05388 Carbpep_Y_N: Carboxypeptidase Y pro-peptide; InterPro: IPR008442 This signature is found at the N terminus of carboxypeptidase Y, which belong to MEROPS peptidase family S10. This region contains the signal peptide and pro-peptide regions [,].; GO: 0004185 serine-type carboxypeptidase activity, 0005773 vacuole
Probab=27.88 E-value=1.1e+02 Score=20.66 Aligned_cols=29 Identities=24% Similarity=0.275 Sum_probs=25.6
Q ss_pred HHHHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264 38 FAEFSKKCAERWKTMSEKEKKRFHEMADK 66 (156)
Q Consensus 38 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~ 66 (156)
+..+++.+++..+.||.+-|..|.++...
T Consensus 45 ~~~~~~~l~e~l~~Lt~e~k~~W~E~~~~ 73 (113)
T PF05388_consen 45 LEKISKYLNEPLKSLTSEAKALWDEMMLL 73 (113)
T ss_pred HHHHHHHHHHHHhhccHHHHHHHHHHHHH
Confidence 46777889999999999999999998865
No 84
>PF06628 Catalase-rel: Catalase-related immune-responsive; InterPro: IPR010582 Catalases (1.11.1.6 from EC) are antioxidant enzymes that catalyse the conversion of hydrogen peroxide to water and molecular oxygen, serving to protect cells from its toxic effects []. Hydrogen peroxide is produced as a consequence of oxidative cellular metabolism and can be converted to the highly reactive hydroxyl radical via transition metals, this radical being able to damage a wide variety of molecules within a cell, leading to oxidative stress and cell death. Catalases act to neutralise hydrogen peroxide toxicity, and are produced by all aerobic organisms ranging from bacteria to man. Most catalases are mono-functional, haem-containing enzymes, although there are also bifunctional haem-containing peroxidase/catalases (IPR000763 from INTERPRO) that are closely related to plant peroxidases, and non-haem, manganese-containing catalases (IPR007760 from INTERPRO) that are found in bacteria []. This entry represents a small conserved region within catalase enzymes that carries the immune-responsive amphipathic octa-peptide that is recognised by T cells [].; PDB: 2CAH_A 1NM0_A 1H7K_A 1E93_A 1H6N_A 3HB6_A 2CAG_A 1M85_A 1MQF_A 1A4E_C ....
Probab=27.47 E-value=55 Score=19.74 Aligned_cols=20 Identities=20% Similarity=0.521 Sum_probs=16.2
Q ss_pred HHHHHhccCCHHHHHHHHHh
Q psy3264 44 KCAERWKTMSEKEKKRFHEM 63 (156)
Q Consensus 44 ~l~~~Wk~Ls~~eK~~y~~~ 63 (156)
..+..|+.|++.+|+.+..-
T Consensus 12 Qa~~ly~~l~~~er~~lv~n 31 (68)
T PF06628_consen 12 QARDLYRVLSDEERERLVEN 31 (68)
T ss_dssp HHHHHHHHSSHHHHHHHHHH
T ss_pred hHHHHHHHCCHHHHHHHHHH
Confidence 44667999999999988853
No 85
>cd08317 Death_ank Death domain associated with Ankyrins. Death Domain (DD) associated with Ankyrins. Ankyrins are modular proteins comprising three conserved domains, an N-terminal membrane-binding domain containing ANK repeats, a spectrin-binding domain and a C-terminal DD. Ankyrins function as adaptor proteins and they interact, through ANK repeats, with structurally diverse membrane proteins, including ion channels/pumps, calcium release channels, and cell adhesion molecules. They play critical roles in the proper expression and membrane localization of these proteins. In mammals, this family includes ankyrin-R for restricted (or ANK1), ankyrin-B for broadly expressed (or ANK2) and ankyrin-G for general or giant (or ANK3). They are expressed in different combinations in many tissues and play non-overlapping functions. In general, DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-associati
Probab=27.38 E-value=26 Score=22.05 Aligned_cols=21 Identities=24% Similarity=0.613 Sum_probs=17.1
Q ss_pred CCCCchhHHHHHHHHhccCCh
Q psy3264 94 PEYGVGDIAKELGKKWADVDP 114 (156)
Q Consensus 94 p~~~~~e~~k~~~~~Wk~ls~ 114 (156)
+.+.+..|++.||..|+.|-.
T Consensus 3 ~~~~l~~ia~~lG~dW~~LAr 23 (84)
T cd08317 3 ADIRLADISNLLGSDWPQLAR 23 (84)
T ss_pred ccchHHHHHHHHhhHHHHHHH
Confidence 456788999999999998743
No 86
>PRK13252 betaine aldehyde dehydrogenase; Provisional
Probab=26.47 E-value=1.8e+02 Score=24.53 Aligned_cols=39 Identities=18% Similarity=0.191 Sum_probs=33.1
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.....|..+|..+|..+...+......+..++......
T Consensus 54 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 92 (488)
T PRK13252 54 KQGQKIWAAMTAMERSRILRRAVDILRERNDELAALETL 92 (488)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 445567999999999999999999999999998876543
No 87
>cd07150 ALDH_VaniDH_like Pseudomonas putida vanillin dehydrogenase-like. Vanillin dehydrogenase (Vdh, VaniDH) involved in the metabolism of ferulic acid and other related sequences are included in this CD. The E. coli vanillin dehydrogenase (LigV) preferred NAD+ to NADP+ and exhibited a broad substrate preference, including vanillin, benzaldehyde, protocatechualdehyde, m-anisaldehyde, and p-hydroxybenzaldehyde.
Probab=26.45 E-value=1.8e+02 Score=24.10 Aligned_cols=37 Identities=16% Similarity=0.187 Sum_probs=31.1
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+..-..|..++..+|..+...+....+.+..++..-.
T Consensus 31 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~ 67 (451)
T cd07150 31 YDAFPAWAATTPSERERILLKAAEIMERRADDLIDLL 67 (451)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 4445679999999999999999999999988877653
No 88
>cd07152 ALDH_BenzADH NAD-dependent benzaldehyde dehydrogenase II-like. NAD-dependent, benzaldehyde dehydrogenase II (XylC, BenzADH, EC=1.2.1.28) is involved in the oxidation of benzyl alcohol to benzoate. In Acinetobacter calcoaceticus, this process is carried out by the chromosomally encoded, benzyl alcohol dehydrogenase (xylB) and benzaldehyde dehydrogenase II (xylC) enzymes; whereas in Pseudomonas putida they are encoded by TOL plasmids.
Probab=25.60 E-value=2e+02 Score=23.83 Aligned_cols=40 Identities=18% Similarity=0.238 Sum_probs=32.9
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
++..-..|..+|.++|..+...+......+..++......
T Consensus 22 A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 61 (443)
T cd07152 22 AAAAQRAWAATPPRERAAVLRRAADLLEEHADEIADWIVR 61 (443)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 3444568999999999999999999999999888866443
No 89
>PRK13968 putative succinate semialdehyde dehydrogenase; Provisional
Probab=25.36 E-value=2e+02 Score=24.13 Aligned_cols=53 Identities=17% Similarity=0.144 Sum_probs=40.2
Q ss_pred ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+....|..+..++. +..-..|..++..+|..+...+.....++..++......
T Consensus 19 ~i~~~~~~~~~~v~~av~~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 77 (462)
T PRK13968 19 QLSVLPWAGADDIENALQLAAAGFRDWRETNIDYRAQKLRDIGKALRARSEEMAQMITR 77 (462)
T ss_pred EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 44455555555543 444557999999999999999999999999999886543
No 90
>cd07118 ALDH_SNDH Gluconobacter oxydans L-sorbosone dehydrogenase-like. Included in this CD is the L-sorbosone dehydrogenase (SNDH) from Gluconobacter oxydans UV10. In G. oxydans, D-sorbitol is converted to 2-keto-L-gulonate (a precursor of L-ascorbic acid) in sequential oxidation steps catalyzed by a FAD-dependent, L-sorbose dehydrogenase and an NAD(P)+-dependent, L-sorbosone dehydrogenase.
Probab=25.26 E-value=2e+02 Score=24.01 Aligned_cols=51 Identities=8% Similarity=0.086 Sum_probs=37.2
Q ss_pred cccCCCCCchhHHHHHHH--------HhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 90 KQDNPEYGVGDIAKELGK--------KWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 90 ~~~~p~~~~~e~~k~~~~--------~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
....|..+..++...+.. .|..+|-.+|..+...+......+..++..-..
T Consensus 10 i~~~~~~~~~~v~~av~~A~~a~~~~~w~~~~~~~R~~~l~~~a~~l~~~~~~la~~~~ 68 (454)
T cd07118 10 VARYAEGTVEDVDAAVAAARKAFDKGPWPRMSGAERAAVLLKVADLIRARRERLALIET 68 (454)
T ss_pred EEEEeCCCHHHHHHHHHHHHHHcCCCccccCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 344455555555433322 299999999999999999999999999877543
No 91
>PRK10455 periplasmic protein; Reviewed
Probab=25.14 E-value=1.5e+02 Score=21.30 Aligned_cols=27 Identities=19% Similarity=0.245 Sum_probs=20.5
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHH
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEK 126 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~ 126 (156)
..++.....+.-||+++|+.|.+..++
T Consensus 118 ~~~~~~~qiy~vLTPEQr~q~~~~~ek 144 (161)
T PRK10455 118 AHMETQNKIYNVLTPEQKKQFNANFEK 144 (161)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHH
Confidence 344556678999999999999985543
No 92
>PF00226 DnaJ: DnaJ domain; InterPro: IPR001623 The prokaryotic heat shock protein DnaJ interacts with the chaperone hsp70-like DnaK protein []. Structurally, the DnaJ protein consists of an N-terminal conserved domain (called 'J' domain) of about 70 amino acids, a glycine-rich region ('G' domain') of about 30 residues, a central domain containing four repeats of a CXXCXGXG motif ('CRR' domain) and a C-terminal region of 120 to 170 residues. Such a structure is shown in the following schematic representation: +------------+-+-------+-----+-----------+--------------------------------+ | N-terminal | | Gly-R | | CXXCXGXG | C-terminal | +------------+-+-------+-----+-----------+--------------------------------+ It is thought that the 'J' domain of DnaJ mediates the interaction with the dnaK protein and consists of four helices, the second of which has a charged surface that includes at least one pair of basic residues that are essential for interaction with the ATPase domain of Hsp70. The J- and CRR-domains are found in many prokaryotic and eukaryotic proteins [], either together or separately. In yeast, J-domains have been classified into 3 groups; the class III proteins are functionally distinct and do not appear to act as molecular chaperones []. ; GO: 0031072 heat shock protein binding; PDB: 2GUZ_C 2L6L_A 1HDJ_A 2EJ7_A 1FPO_C 2CUG_A 2QSA_A 2OCH_A 3BVO_B 3APQ_A ....
Probab=25.09 E-value=1.1e+02 Score=17.55 Aligned_cols=38 Identities=24% Similarity=0.357 Sum_probs=26.1
Q ss_pred HHHHHHHHHhCCCCcchHH----HHHHHHHHHhccCCHHHHH
Q psy3264 21 QVCREEHKKKHPEENVVFA----EFSKKCAERWKTMSEKEKK 58 (156)
Q Consensus 21 ~e~r~~~~~~~p~~~~~~~----ei~k~l~~~Wk~Ls~~eK~ 58 (156)
+..+..++.-||+...... +.+..|..-|..|++..+.
T Consensus 19 ~~y~~l~~~~HPD~~~~~~~~~~~~~~~i~~Ay~~L~~~~~R 60 (64)
T PF00226_consen 19 KAYRRLSKQYHPDKNSGDEAEAEEKFARINEAYEILSDPERR 60 (64)
T ss_dssp HHHHHHHHHTSTTTGTSTHHHHHHHHHHHHHHHHHHHSHHHH
T ss_pred HHHHhhhhccccccchhhhhhhhHHHHHHHHHHHHhCCHHHH
Confidence 3445556666888643223 7889999999998877663
No 93
>cd07084 ALDH_KGSADH-like ALDH subfamily: NAD(P)+-dependent alpha-ketoglutaric semialdehyde dehydrogenases and plant delta(1)-pyrroline-5-carboxylate dehydrogenase, ALDH family 12-like. ALDH subfamily which includes the NAD(P)+-dependent, alpha-ketoglutaric semialdehyde dehydrogenases (KGSADH, EC 1.2.1.26); plant delta(1)-pyrroline-5-carboxylate dehydrogenase (P5CDH, EC=1.5.1.12 ), ALDH family 12; the N-terminal domain of the MaoC (monoamine oxidase C) dehydratase regulatory protein; and orthologs of MaoC, PaaZ and PaaN, which are putative ring-opening enzymes of the aerobic phenylacetic acid catabolic pathway.
Probab=24.88 E-value=1.9e+02 Score=24.15 Aligned_cols=42 Identities=10% Similarity=-0.113 Sum_probs=35.6
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.++.....|+.++..+|......+....+.+..++......
T Consensus 6 ~~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 47 (442)
T cd07084 6 LAADISTKAARRLALPKRADFLARIIQRLAAKSYDIAAGAVL 47 (442)
T ss_pred HHHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhHHHHHHHHHH
Confidence 445666678999999999999999999999999998877543
No 94
>cd07101 ALDH_SSADH2_GabD2 Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 2-like. Succinate-semialdehyde dehydrogenase 2 (SSADH2) and similar proteins are in this CD. SSADH1 (GabD1, EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation of succinate semialdehyde to succinate. SSADH activity in Mycobacterium tuberculosis is encoded by both gabD1 (Rv0234c) and gabD2 (Rv1731), however ,the Vmax of GabD1 was shown to be much higher than that of GabD2, and GabD2 (SSADH2) is likely to serve physiologically as a dehydrogenase for a different aldehyde(s).
Probab=24.87 E-value=2e+02 Score=23.89 Aligned_cols=50 Identities=10% Similarity=0.078 Sum_probs=37.5
Q ss_pred cCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 92 DNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 92 ~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
..+..+..++. +.....|..+|.++|..+...+......+..++......
T Consensus 11 ~~~~~~~~~v~~av~~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~el~~~~~~ 66 (454)
T cd07101 11 ELPQSTPADVEAAFARARAAQRAWAARPFAERAAVFLRFHDLVLERRDELLDLIQL 66 (454)
T ss_pred EEcCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 34444555543 444567999999999999999999999999888776543
No 95
>PF03480 SBP_bac_7: Bacterial extracellular solute-binding protein, family 7; InterPro: IPR018389 This family of proteins are involved in binding extracellular solutes for transport across the bacterial cytoplasmic membrane. This family includes a C4-dicarboxylate-binding protein DctP [, ] and the sialic acid-binding protein SiaP. The structure of the SiaP receptor has revealed an overall topology similar to ATP binding cassette ESR (extracytoplasmic solute receptors) proteins []. Upon binding of sialic acid, SiaP undergoes domain closure about a hinge region and kinking of an alpha-helix hinge component [].; GO: 0006810 transport, 0030288 outer membrane-bounded periplasmic space; PDB: 2HZK_C 2HZL_B 2HPG_C 2XWI_A 2XWK_A 2WX9_A 2CEY_A 2WYP_A 3B50_A 2CEX_B ....
Probab=24.87 E-value=1.5e+02 Score=22.73 Aligned_cols=32 Identities=16% Similarity=0.262 Sum_probs=23.5
Q ss_pred HHHhccCChHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 106 GKKWADVDPSVKTKYEQMAEKDKARYEKEMTA 137 (156)
Q Consensus 106 ~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~ 137 (156)
...|.+||++.|+...+.+...-.........
T Consensus 213 ~~~w~~L~~e~q~~l~~~~~~~~~~~~~~~~~ 244 (286)
T PF03480_consen 213 KDWWDSLPDEDQEALDDAADEAEARAREYYEA 244 (286)
T ss_dssp HHHHHHS-HHHHHHHHHHHHHHHHHHHHHHHH
T ss_pred HHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 36799999999999999888776555544443
No 96
>PRK11241 gabD succinate-semialdehyde dehydrogenase I; Provisional
Probab=24.72 E-value=2e+02 Score=24.32 Aligned_cols=53 Identities=6% Similarity=0.056 Sum_probs=38.3
Q ss_pred ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
.....+..+..++. +..-..|..+|..+|..+...+....+.+..++..-...
T Consensus 38 ~v~~~~~~~~~~v~~av~~A~~a~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 96 (482)
T PRK11241 38 KLGSVPKMGADETRAAIDAANRALPAWRALTAKERANILRRWFNLMMEHQDDLARLMTL 96 (482)
T ss_pred EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 33344444555544 333346999999999999999999999999998876543
No 97
>cd07077 ALDH-like NAD(P)+-dependent aldehyde dehydrogenase-like (ALDH-like) family. The aldehyde dehydrogenase-like (ALDH-like) group of the ALDH superfamily of NAD(P)+-dependent enzymes which, in general, oxidize a wide range of endogenous and exogenous aliphatic and aromatic aldehydes to their corresponding carboxylic acids and play an important role in detoxification. This group includes families ALDH18, ALDH19, and ALDH20 and represents such proteins as gamma-glutamyl phosphate reductase, LuxC-like acyl-CoA reductase, and coenzyme A acylating aldehyde dehydrogenase. All of these proteins have a conserved cysteine that aligns with the catalytic cysteine of the ALDH group.
Probab=24.26 E-value=1.9e+02 Score=23.61 Aligned_cols=39 Identities=8% Similarity=-0.049 Sum_probs=32.8
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
++.....|..+|..+|..+........+++..++..-..
T Consensus 3 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 41 (397)
T cd07077 3 AKNAQRTLAVNHDEQRDLIINAIANALYDTRQRLASEAV 41 (397)
T ss_pred HHHHHHHHHhcCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 455667899999999999999999999999988776443
No 98
>cd07099 ALDH_DDALDH Methylomonas sp. 4,4'-diapolycopene-dialdehyde dehydrogenase-like. The 4,4'-diapolycopene-dialdehyde dehydrogenase (DDALDH) involved in C30 carotenoid synthesis in Methylomonas sp. strain 16a and other similar sequences are present in this CD. DDALDH converts 4,4'-diapolycopene-dialdehyde into 4,4'-diapolycopene-diacid.
Probab=24.16 E-value=2.2e+02 Score=23.60 Aligned_cols=38 Identities=13% Similarity=0.165 Sum_probs=31.2
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
++.....|..++..+|..+...+......+..++..-.
T Consensus 27 a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~ 64 (453)
T cd07099 27 ARAAQRAWAALGVEGRAQRLLRWKRALADHADELAELL 64 (453)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 34445679999999999999999999988888877643
No 99
>cd07108 ALDH_MGR_2402 Magnetospirillum NAD(P)+-dependent aldehyde dehydrogenase MSR-1-like. NAD(P)+-dependent aldehyde dehydrogenase of Magnetospirillum gryphiswaldense MSR-1 (MGR_2402) , and other similar sequences, are present in this CD.
Probab=24.01 E-value=2.2e+02 Score=23.71 Aligned_cols=39 Identities=13% Similarity=0.083 Sum_probs=32.9
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.....|..+|..+|..+...+......+..++......
T Consensus 29 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 67 (457)
T cd07108 29 KAAFPEWAATPARERGKLLARIADALEARSEELARLLAL 67 (457)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 445567999999999999999999999999998776544
No 100
>PRK00197 proA gamma-glutamyl phosphate reductase; Provisional
Probab=23.88 E-value=2.2e+02 Score=23.48 Aligned_cols=40 Identities=18% Similarity=0.106 Sum_probs=34.2
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
.++..-..|..+|..+|..+...+....+.+..++.....
T Consensus 12 ~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 51 (417)
T PRK00197 12 RAKAASRKLAQLSTAQKNRALLAIADALEANAAEILAANA 51 (417)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHHhhHHHHHHHHH
Confidence 3455567899999999999999999999999999888654
No 101
>TIGR01804 BADH glycine betaine aldehyde dehydrogenase. Betaine aldehyde dehydrogenase is a member of the aldehyde dehydrogenase family (pfam00171).
Probab=23.69 E-value=2.2e+02 Score=23.83 Aligned_cols=38 Identities=11% Similarity=0.255 Sum_probs=32.5
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..+|..+|..+...+......+..++.....
T Consensus 45 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 82 (467)
T TIGR01804 45 RRAQGEWASMTPEERGRILRRIAELIRERNEELAKLET 82 (467)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44446799999999999999999999999999887643
No 102
>PF07813 LTXXQ: LTXXQ motif family protein; InterPro: IPR012899 This five residue motif is found in a number of bacterial proteins bearing similarity to the protein CpxP (P32158 from SWISSPROT). This is a periplasmic protein that aids in combating extracytoplasmic protein-mediated toxicity, and may also be involved in the response to alkaline pH []. Another member of this family, Spy (P77754 from SWISSPROT) is also a periplasmic protein that may be involved in the response to stress []. The homology between CpxP and Spy may indicate that these two proteins are functionally related []. The motif is found repeated twice in many members of this entry. ; GO: 0042597 periplasmic space; PDB: 3ITF_B 3QZC_B 3OEO_D 3O39_A.
Probab=23.31 E-value=1.3e+02 Score=18.79 Aligned_cols=24 Identities=29% Similarity=0.441 Sum_probs=18.1
Q ss_pred hHHHHHHHHhccCChHHHHHHHHH
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQM 123 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~ 123 (156)
.........+..||+++|..|..+
T Consensus 76 ~~~~~~~~~~~vLt~eQk~~~~~l 99 (100)
T PF07813_consen 76 ERAKAQHALYAVLTPEQKEKFDQL 99 (100)
T ss_dssp HHHHHHHHHHTTS-HHHHHHHHHH
T ss_pred HHHHHHHHHHhcCCHHHHHHHHHh
Confidence 355667778999999999998764
No 103
>PTZ00037 DnaJ_C chaperone protein; Provisional
Probab=23.22 E-value=1.8e+02 Score=24.32 Aligned_cols=43 Identities=21% Similarity=0.317 Sum_probs=32.2
Q ss_pred HHHHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhh
Q psy3264 21 QVCREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMA 64 (156)
Q Consensus 21 ~e~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a 64 (156)
+..|...++-||+...+ .+.++.|.+.|..||+.+|....+..
T Consensus 47 kAYrkla~k~HPDk~~~-~e~F~~i~~AYevLsD~~kR~~YD~~ 89 (421)
T PTZ00037 47 KAYRKLAIKHHPDKGGD-PEKFKEISRAYEVLSDPEKRKIYDEY 89 (421)
T ss_pred HHHHHHHHHHCCCCCch-HHHHHHHHHHHHHhccHHHHHHHhhh
Confidence 34455666779997543 58889999999999999877666653
No 104
>cd07131 ALDH_AldH-CAJ73105 Uncharacterized Candidatus kuenenia aldehyde dehydrogenase AldH (CAJ73105)-like. Uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105) and similar sequences with similarity to alpha-aminoadipic semialdehyde dehydrogenase (AASADH, human ALDH7A1, EC=1.2.1.31), Arabidopsis ALDH7B4, and Streptomyces clavuligerus delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH) are included in this CD.
Probab=23.16 E-value=2.3e+02 Score=23.70 Aligned_cols=37 Identities=16% Similarity=0.239 Sum_probs=31.2
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+.....|..++.++|..+...+.....++..++..-.
T Consensus 47 ~~A~~~w~~~~~~~R~~~l~~~a~~l~~~~~ela~~~ 83 (478)
T cd07131 47 REAFPEWRKVPAPRRAEYLFRAAELLKKRKEELARLV 83 (478)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4445579999999999999999999999998887654
No 105
>cd07098 ALDH_F15-22 Aldehyde dehydrogenase family 15A1 and 22A1-like. Aldehyde dehydrogenase family members ALDH15A1 (Saccharomyces cerevisiae YHR039C) and ALDH22A1 (Arabidopsis thaliana, EC=1.2.1.3), and similar sequences, are in this CD. Significant improvement of stress tolerance in tobacco plants was observed by overexpressing the ALDH22A1 gene from maize (Zea mays) and was accompanied by a reduction of malondialdehyde derived from cellular lipid peroxidation.
Probab=23.16 E-value=2.5e+02 Score=23.41 Aligned_cols=37 Identities=14% Similarity=0.162 Sum_probs=30.8
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+.....|..+|..+|..+...+.....++..++....
T Consensus 28 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~ 64 (465)
T cd07098 28 RAAQREWAKTSFAERRKVLRSLLKYILENQEEICRVA 64 (465)
T ss_pred HHHHHHhccCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 4445679999999999999999999988888877553
No 106
>cd07137 ALDH_F3FHI Plant aldehyde dehydrogenase family 3 members F1, H1, and I1 and related proteins. Aldehyde dehydrogenase family members 3F1, 3H1, and 3I1 (ALDH3F1, ALDH3H1, and ALDH3I1), and similar plant sequences, are in this CD. In Arabidopsis thaliana, stress-regulated expression of ALDH3I1 was observed in leaves and osmotic stress expression of ALDH3H1 was observed in root tissue, whereas, ALDH3F1 expression was not stress responsive. Functional analysis of ALDH3I1 suggest it may be involved in a detoxification pathway in plants that limits aldehyde accumulation and oxidative stress.
Probab=22.97 E-value=2.4e+02 Score=23.46 Aligned_cols=42 Identities=7% Similarity=-0.114 Sum_probs=34.8
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.++..-..|..++..+|..+...+.....++..++..-...
T Consensus 6 ~~a~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~~ 47 (432)
T cd07137 6 RELRETFRSGRTRSAEWRKSQLKGLLRLVDENEDDIFAALRQ 47 (432)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 345666678999999999999999999999999998765443
No 107
>cd07147 ALDH_F21_RNP123 Aldehyde dehydrogenase family 21A1-like. Aldehyde dehydrogenase ALDH21A1 (gene name RNP123) was first described in the moss Tortula ruralis and is believed to play an important role in the detoxification of aldehydes generated in response to desiccation- and salinity-stress, and ALDH21A1 expression represents a unique stress tolerance mechanism. So far, of plants, only the bryophyte sequence has been observed, but similar protein sequences from bacteria and archaea are also present in this CD.
Probab=22.80 E-value=2.4e+02 Score=23.40 Aligned_cols=39 Identities=5% Similarity=-0.135 Sum_probs=32.0
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
++.....|..+|..+|......+......+..++..-..
T Consensus 30 A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 68 (452)
T cd07147 30 AVKAFRPMRALPAHRRAAILLHCVARLEERFEELAETIV 68 (452)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 344556799999999999999999999999988876543
No 108
>PF14098 SSPI: Small, acid-soluble spore protein I
Probab=22.71 E-value=1e+02 Score=18.78 Aligned_cols=19 Identities=26% Similarity=0.426 Sum_probs=14.9
Q ss_pred HHHHhccCCHHHHHHHHHh
Q psy3264 45 CAERWKTMSEKEKKRFHEM 63 (156)
Q Consensus 45 l~~~Wk~Ls~~eK~~y~~~ 63 (156)
.-..|+.+++.+|....+.
T Consensus 42 FE~~W~~~~~~ek~~m~~~ 60 (65)
T PF14098_consen 42 FEVIWKNSDESEKQEMVNT 60 (65)
T ss_pred HHHHHHhCCHHHHHHHHHH
Confidence 3346999999999887654
No 109
>cd08332 CARD_CASP2 Caspase activation and recruitment domain of Caspase-2. Caspase activation and recruitment domain (CARD) similar to that found in caspase-2. Caspases are aspartate-specific cysteine proteases with functions in apoptosis and immune signaling. Caspase-2 (also known as ICH1, NEDD2, or CASP2) is one of the most evolutionarily conserved caspases, and plays a role in apoptosis, DNA damage response, cell cycle regulation, and tumor suppression. It is localized in the nucleus and exhibits properties of both an initiator and an effector caspase. In general, CARDs are death domains (DDs) found associated with caspases. They are known to be important in the signaling pathways for apoptosis, inflammation, and host-defense mechanisms. DDs are protein-protein interaction domains found in a variety of domain architectures. Their common feature is that they form homodimers by self-association or heterodimers by associating with other members of the DD superfamily including PYRIN and
Probab=22.70 E-value=40 Score=21.65 Aligned_cols=17 Identities=29% Similarity=0.542 Sum_probs=14.5
Q ss_pred CCCCCcHHHHHHHHHHH
Q psy3264 9 PRGRMTAYAFFVQVCRE 25 (156)
Q Consensus 9 PkrP~say~lF~~e~r~ 25 (156)
|+|...||..||..-+.
T Consensus 61 ~~RG~~AF~~F~~aL~~ 77 (90)
T cd08332 61 PKRGPRAFSAFCEALRE 77 (90)
T ss_pred HHhChhHHHHHHHHHHh
Confidence 78888999999987764
No 110
>cd07106 ALDH_AldA-AAD23400 Streptomyces aureofaciens putative aldehyde dehydrogenase AldA (AAD23400)-like. Putative aldehyde dehydrogenase, AldA, from Streptomyces aureofaciens (locus AAD23400) and other similar sequences are present in this CD.
Probab=22.61 E-value=2.4e+02 Score=23.32 Aligned_cols=38 Identities=8% Similarity=0.016 Sum_probs=32.4
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..+|..+|..+...+......+..++.....
T Consensus 29 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~ 66 (446)
T cd07106 29 KAAFPGWSATPLEERRAALLAIADAIEANAEELARLLT 66 (446)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44456799999999999999999999999999887754
No 111
>cd07559 ALDH_ACDHII_AcoD-like Ralstonia eutrophus NAD+-dependent acetaldehyde dehydrogenase II and Staphylococcus aureus AldA1 (SACOL0154)-like. Included in this CD is the NAD+-dependent, acetaldehyde dehydrogenase II (AcDHII, AcoD, EC=1.2.1.3) from Ralstonia (Alcaligenes) eutrophus H16 involved in the catabolism of acetoin and ethanol, and similar proteins, such as, the dimeric dihydrolipoamide dehydrogenase of the acetoin dehydrogenase enzyme system of Klebsiella pneumonia. Also included are sequences similar to the NAD+-dependent chloroacetaldehyde dehydrogenases (AldA and AldB) of Xanthobacter autotrophicus GJ10 which are involved in the degradation of 1,2-dichloroethane, as well as, the uncharacterized aldehyde dehydrogenase from Staphylococcus aureus (AldA1, locus SACOL0154) and other similar sequences.
Probab=22.59 E-value=2.4e+02 Score=23.75 Aligned_cols=38 Identities=8% Similarity=0.081 Sum_probs=32.1
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..++..+|..+...+......+..++..-..
T Consensus 48 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 85 (480)
T cd07559 48 HEAFKTWGKTSVAERANILNKIADRIEENLELLAVAET 85 (480)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44455799999999999999999999999988877644
No 112
>PLN02174 aldehyde dehydrogenase family 3 member H1
Probab=22.52 E-value=2.4e+02 Score=24.05 Aligned_cols=43 Identities=7% Similarity=-0.188 Sum_probs=35.4
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcCC
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRSV 143 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~~ 143 (156)
.++..-..|..+|..+|..+...+....+++..++........
T Consensus 18 ~a~~af~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~l~~e~ 60 (484)
T PLN02174 18 ELRRSFDDGVTRGYEWRVTQLKKLMIICDNHEPEIVAALRDDL 60 (484)
T ss_pred HHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHHc
Confidence 3445556799999999999999999999999999988755443
No 113
>cd07100 ALDH_SSADH1_GabD1 Mycobacterium tuberculosis succinate-semialdehyde dehydrogenase 1-like. Succinate-semialdehyde dehydrogenase 1 (SSADH1, GabD1, EC=1.2.1.16) catalyzes the NADP(+)-dependent oxidation of succinate semialdehyde (SSA) to succinate. SSADH activity in Mycobacterium tuberculosis (Mtb) is encoded by both gabD1 (Rv0234c) and gabD2 (Rv1731). The Mtb GabD1 SSADH1 reportedly is an enzyme of the gamma-aminobutyrate shunt, which forms a functional link between two TCA half-cycles by converting alpha-ketoglutarate to succinate.
Probab=22.49 E-value=2.5e+02 Score=23.21 Aligned_cols=39 Identities=8% Similarity=-0.005 Sum_probs=31.7
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
.++..-..|..++..+|..+...+....+.+..++....
T Consensus 7 ~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~ 45 (429)
T cd07100 7 RAHAAFLAWRKTSFAERAALLRKLADLLRERKDELARLI 45 (429)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 345556789999999999999999999888888876543
No 114
>cd07086 ALDH_F7_AASADH-like NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase and related proteins. ALDH subfamily which includes the NAD+-dependent, alpha-aminoadipic semialdehyde dehydrogenase (AASADH, EC=1.2.1.31), also known as Antiquitin-1, ALDH7A1, ALDH7B or delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH), and other similar sequences, such as the uncharacterized aldehyde dehydrogenase of Candidatus kuenenia AldH (locus CAJ73105).
Probab=22.17 E-value=2.4e+02 Score=23.68 Aligned_cols=38 Identities=11% Similarity=0.181 Sum_probs=31.8
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..++.++|..+...+......+..++..-..
T Consensus 45 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~ 82 (478)
T cd07086 45 REAFKEWRKVPAPRRGEIVRQIGEALRKKKEALGRLVS 82 (478)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44445799999999999999999999999988877643
No 115
>cd07104 ALDH_BenzADH-like ALDH subfamily: NAD(P)+-dependent benzaldehyde dehydrogenase II, vanillin dehydrogenase, p-hydroxybenzaldehyde dehydrogenase and related proteins. ALDH subfamily which includes the NAD(P)+-dependent, benzaldehyde dehydrogenase II (XylC, BenzADH, EC=1.2.1.28) involved in the oxidation of benzyl alcohol to benzoate; p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH) which catalyzes the oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic acid; vanillin dehydrogenase (Vdh, VaniDH) involved in the metabolism of ferulic acid as seen in Pseudomonas putida KT2440; and other related sequences.
Probab=22.14 E-value=2.5e+02 Score=22.98 Aligned_cols=39 Identities=18% Similarity=0.244 Sum_probs=32.6
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
.++..-..|+.++..+|..+...+......+..++..-.
T Consensus 8 ~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~ 46 (431)
T cd07104 8 AAAAAQKAWAATPPQERAAILRKAAEILEERRDEIADWL 46 (431)
T ss_pred HHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 445566789999999999999999999888888887743
No 116
>TIGR01780 SSADH succinate-semialdehyde dehydrogenase. SSADH enzyme belongs to the aldehyde dehydrogenase family (pfam00171), sharing a common evolutionary origin and enzymatic mechanism with lactaldehyde dehydrogenase. Like in lactaldehyde dehydrogenase and succinate semialdehyde dehydrogenase, the mammalian catalytic glutamic acid and cysteine residues are conserved in all the enzymes of this family (PS00687, PS00070).
Probab=22.09 E-value=2.5e+02 Score=23.35 Aligned_cols=40 Identities=10% Similarity=0.066 Sum_probs=33.1
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
++.....|..++.++|..+...+....+.+..++......
T Consensus 28 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~~ 67 (448)
T TIGR01780 28 AYEAFKTWKNTTAKERSSLLRKWYNLMMENKDDLARLITL 67 (448)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 3455567999999999999999999999999988776543
No 117
>cd07142 ALDH_F2BC Arabidosis aldehyde dehydrogenase family 2 B4, B7, C4-like. Included in this CD is the Arabidosis aldehyde dehydrogenase family 2 members B4 and B7 (EC=1.2.1.3), which are mitochondrial homotetramers that oxidize acetaldehyde and glycolaldehyde, but not L-lactaldehyde. Also in this group, is the Arabidosis cytosolic, homotetramer ALDH2C4 (EC=1.2.1.3), an enzyme involved in the oxidation of sinapalehyde and coniferaldehyde.
Probab=22.06 E-value=2.6e+02 Score=23.51 Aligned_cols=53 Identities=9% Similarity=0.120 Sum_probs=38.9
Q ss_pred cccCCCCCchhHHHHHHH--------HhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264 90 KQDNPEYGVGDIAKELGK--------KWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS 142 (156)
Q Consensus 90 ~~~~p~~~~~e~~k~~~~--------~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~ 142 (156)
....+..+..++...+.. .|..+|..+|..+...+......+..++.......
T Consensus 32 i~~v~~~~~~~v~~av~~A~~a~~~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~~e 92 (476)
T cd07142 32 IAHVAEGDAEDVDRAVKAARKAFDEGPWPRMTGYERSRILLRFADLLEKHADELAALETWD 92 (476)
T ss_pred EEEEcCCCHHHHHHHHHHHHHhcCCCchhhCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHh
Confidence 334455555665544432 29999999999999999999999999988765443
No 118
>TIGR01238 D1pyr5carbox3 delta-1-pyrroline-5-carboxylate dehydrogenase (PutA C-terminal domain). This model represents one of several related branches of delta-1-pyrroline-5-carboxylate dehydrogenase. Members of this branch are the C-terminal domain of the PutA bifunctional proline dehydrogenase / delta-1-pyrroline-5-carboxylate dehydrogenase.
Probab=22.02 E-value=2.5e+02 Score=23.92 Aligned_cols=52 Identities=8% Similarity=-0.031 Sum_probs=38.0
Q ss_pred ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
.....+..+..++. +.....|..+|..+|..+...+......+..++..-..
T Consensus 64 ~i~~~~~~~~~dv~~av~~A~~a~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~ 121 (500)
T TIGR01238 64 IVGQVFHANLAHVQAAIDSAQQAFPTWNATPAKERAAKLDRLADLLELHMPELMALCV 121 (500)
T ss_pred EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 33444444544544 44456799999999999999999999999988866443
No 119
>cd07151 ALDH_HBenzADH NADP+-dependent p-hydroxybenzaldehyde dehydrogenase-like. NADP+-dependent, p-hydroxybenzaldehyde dehydrogenase (PchA, HBenzADH) which catalyzes oxidation of p-hydroxybenzaldehyde to p-hydroxybenzoic acid and other related sequences are included in this CD.
Probab=22.00 E-value=2.5e+02 Score=23.49 Aligned_cols=38 Identities=16% Similarity=0.213 Sum_probs=31.7
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..++..+|..+...+......+..++..-..
T Consensus 42 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~ 79 (465)
T cd07151 42 AAAQKEWAATLPQERAEILEKAAQILEERRDEIVEWLI 79 (465)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44456799999999999999999999998888876543
No 120
>PF08367 M16C_assoc: Peptidase M16C associated; InterPro: IPR013578 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This domain appears in eukaryotes as well as bacteria and tends to be found near the C terminus of metalloproteases and related sequences belonging to MEROPS peptidase family M16 (subfamily M16C, clan ME). These include: eupitrilysin, falcilysin, PreP peptidase, CYM1 peptidase and subfamily M16C non-peptidase homologues.; GO: 0008237 metallopeptidase activity, 0008270 zinc ion binding, 0006508 proteolysis; PDB: 2FGE_B 3S5I_A 3S5H_A 3S5M_A 3S5K_A.
Probab=21.94 E-value=1.9e+02 Score=21.95 Aligned_cols=32 Identities=19% Similarity=0.140 Sum_probs=26.3
Q ss_pred hhHHHHHHHHhccCChHHHHHHHHHHHHHHHH
Q psy3264 99 GDIAKELGKKWADVDPSVKTKYEQMAEKDKAR 130 (156)
Q Consensus 99 ~e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~ 130 (156)
.+..+.|.+...+||+++++.-++.+.+.++.
T Consensus 13 ~~e~~~L~~~k~~Ls~~e~~~i~~~~~~L~~~ 44 (248)
T PF08367_consen 13 EEEKEKLAAYKASLSEEEKEKIIEQTKELKER 44 (248)
T ss_dssp HHHHHHHHHHHHCS-HHHHHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHH
Confidence 45678899999999999999999988886554
No 121
>cd07129 ALDH_KGSADH Alpha-Ketoglutaric Semialdehyde Dehydrogenase. Alpha-Ketoglutaric Semialdehyde (KGSA) Dehydrogenase (KGSADH, EC 1.2.1.26) catalyzes the NAD(P)+-dependent conversion of KGSA to alpha-ketoglutarate. This CD contains such sequences as those seen in Azospirillum brasilense, KGSADH-II (D-glucarate/D-galactarate-inducible) and KGSADH-III (hydroxy-L-proline-inducible). Both show similar high substrate specificity for KGSA and different coenzyme specificity; KGSADH-II is NAD+-dependent and KGSADH-III is NADP+-dependent. Also included in this CD is the NADP(+)-dependent aldehyde dehydrogenase from Vibrio harveyi which catalyzes the oxidation of long-chain aliphatic aldehydes to acids.
Probab=21.87 E-value=2.4e+02 Score=23.52 Aligned_cols=39 Identities=10% Similarity=0.177 Sum_probs=32.8
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
.++.....|..++..+|..+...+......+..++..-.
T Consensus 7 ~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~ 45 (454)
T cd07129 7 AAAAAFESYRALSPARRAAFLEAIADEIEALGDELVARA 45 (454)
T ss_pred HHHHHHHHHhhCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 445566789999999999999999999999988887653
No 122
>PRK13473 gamma-aminobutyraldehyde dehydrogenase; Provisional
Probab=21.84 E-value=2.6e+02 Score=23.44 Aligned_cols=38 Identities=8% Similarity=0.150 Sum_probs=32.0
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..+|.++|..+...+......+..++.....
T Consensus 49 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 86 (475)
T PRK13473 49 DAAFPEWSQTTPKERAEALLKLADAIEENADEFARLES 86 (475)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44446799999999999999999999999988887644
No 123
>cd07130 ALDH_F7_AASADH NAD+-dependent alpha-aminoadipic semialdehyde dehydrogenase, ALDH family members 7A1 and 7B. Alpha-aminoadipic semialdehyde dehydrogenase (AASADH, EC=1.2.1.31), also known as ALDH7A1, Antiquitin-1, ALDH7B, or delta-1-piperideine-6-carboxylate dehydrogenase (P6CDH), is a NAD+-dependent ALDH. Human ALDH7A1 is involved in the pipecolic acid pathway of lysine catabolism, catalyzing the oxidation of alpha-aminoadipic semialdehyde to alpha-aminoadipate. Arabidopsis thaliana ALDH7B4 appears to be an osmotic-stress-inducible ALDH gene encoding a turgor-responsive or stress-inducible ALDH. The Streptomyces clavuligerus P6CDH appears to be involved in cephamycin biosynthesis, catalyzing the second stage of the two-step conversion of lysine to alpha-aminoadipic acid. The ALDH7A1 enzyme and others in this group have been observed as tetramers, yet the bacterial P6CDH enzyme has been reported as a monomer.
Probab=21.80 E-value=2.6e+02 Score=23.48 Aligned_cols=38 Identities=13% Similarity=0.205 Sum_probs=32.1
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..+|.++|......+......+..++.....
T Consensus 44 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 81 (474)
T cd07130 44 QEAFKEWRDVPAPKRGEIVRQIGDALRKKKEALGKLVS 81 (474)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 44456799999999999999999999999999876543
No 124
>cd07140 ALDH_F1L_FTFDH 10-formyltetrahydrofolate dehydrogenase, ALDH family 1L. 10-formyltetrahydrofolate dehydrogenase (FTHFDH, EC=1.5.1.6), also known as aldehyde dehydrogenase family 1 member L1 (ALDH1L1) in humans, is a multi-domain homotetramer with an N-terminal formyl transferase domain and a C-terminal ALDH domain. FTHFDH catalyzes an NADP+-dependent dehydrogenase reaction resulting in the conversion of 10-formyltetrahydrofolate to tetrahydrofolate and CO2. The ALDH domain is also capable of the oxidation of short chain aldehydes to their corresponding acids.
Probab=21.77 E-value=2.6e+02 Score=23.62 Aligned_cols=53 Identities=8% Similarity=0.153 Sum_probs=39.8
Q ss_pred ccccCCCCCchhHHHHHHH--------HhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 89 IKQDNPEYGVGDIAKELGK--------KWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 89 ~~~~~p~~~~~e~~k~~~~--------~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
.....+..+..++...+.. .|..++..+|..+...+......+..++......
T Consensus 33 ~i~~~~~~~~~~v~~av~~A~~A~~~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 93 (486)
T cd07140 33 VICKVSLATVEDVDRAVAAAKEAFENGEWGKMNARDRGRLMYRLADLMEEHQEELATIESL 93 (486)
T ss_pred EEEEEeCCCHHHHHHHHHHHHHhcCCCchhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 4445555666666555433 4999999999999999999999999888876443
No 125
>cd07136 ALDH_YwdH-P39616 Bacillus subtilis aldehyde dehydrogenase ywdH-like. Uncharacterized Bacillus subtilis ywdH aldehyde dehydrogenase (locus P39616) most closely related to the ALDHs and fatty ALDHs of families 3 and 14, and similar sequences, are included in this CD.
Probab=21.76 E-value=2.5e+02 Score=23.54 Aligned_cols=43 Identities=14% Similarity=0.158 Sum_probs=36.0
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhcC
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNRS 142 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~~ 142 (156)
+.++..-..|..++..+|.....++....+.+..++.......
T Consensus 5 ~~a~~a~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~e 47 (449)
T cd07136 5 EKQRAFFKTGATKDVEFRIEQLKKLKQAIKKYENEILEALKKD 47 (449)
T ss_pred HHHHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHHH
Confidence 4556666789999999999999999999999999988765443
No 126
>COG2854 Ttg2D ABC-type transport system involved in resistance to organic solvents, auxiliary component [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=21.65 E-value=63 Score=24.26 Aligned_cols=39 Identities=15% Similarity=0.263 Sum_probs=31.8
Q ss_pred HHHHHHHHhccCCHHHHHHHHHhhhh-hhhhhccccccCC
Q psy3264 41 FSKKCAERWKTMSEKEKKRFHEMADK-DKKRYDTEMQSYT 79 (156)
Q Consensus 41 i~k~l~~~Wk~Ls~~eK~~y~~~a~~-~k~~y~~~~~~~~ 79 (156)
....||.=|+++|+++++.|..-... ....|-..+..|.
T Consensus 76 a~~vLGk~~k~aspeQ~~~F~~aF~~yl~q~Y~~aL~~Y~ 115 (202)
T COG2854 76 AKLVLGKYYKTASPEQRQAFFKAFRTYLEQTYGQALLDYK 115 (202)
T ss_pred HHHHhccccccCCHHHHHHHHHHHHHHHHHHHHHHHHHcc
Confidence 34668889999999999999987777 5567888887775
No 127
>PF05823 Gp-FAR-1: Nematode fatty acid retinoid binding protein (Gp-FAR-1); InterPro: IPR008632 Parasitic nematodes produce at least two structurally novel classes of small helix-rich retinol- and fatty-acid-binding proteins that have no counterparts in their plant or animal hosts and thus represent potential targets for new nematicides. Gp-FAR-1 is a member of the nematode-specific fatty-acid- and retinol-binding (FAR) family of proteins but localises to the surface of the organism, placing it in a strategic position for interaction with the host. Gp-FAR-1 functions as a broad-spectrum retinol- and fatty-acid-binding protein, and it is thought that it is involved in the evasion of primary host plant defence systems [].; GO: 0008289 lipid binding; PDB: 2W9Y_A.
Probab=21.52 E-value=1.1e+02 Score=21.73 Aligned_cols=82 Identities=17% Similarity=0.259 Sum_probs=41.0
Q ss_pred HHHHHHHhCCCCcchHHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhhhccccccCCCCCccccccccccCCCC-CchhH
Q psy3264 23 CREEHKKKHPEENVVFAEFSKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEMQSYTPPAGEKRGKIKQDNPEY-GVGDI 101 (156)
Q Consensus 23 ~r~~~~~~~p~~~~~~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~~~~~~~~~~~r~~~~~~~p~~-~~~e~ 101 (156)
....++..+|.+-..+..+...+..+...|+++-|..-...-..-+..|..... +..|.. ....+
T Consensus 45 ~i~~LK~ksP~L~~k~~~l~~~~k~ki~~L~peak~Fv~~li~~~~~l~~~~~~--------------G~~~~~~~lk~~ 110 (154)
T PF05823_consen 45 MIAALKEKSPSLYEKAEKLRDKLKKKIDKLSPEAKAFVKELIAKARSLYAQYSA--------------GEKPDLEELKQL 110 (154)
T ss_dssp HHHHHHHH-HHHHHHHHHHHHHHHHTTTT--HHHHHHHHHHHHHHHHHHHHHHH--------------T----THHHHHH
T ss_pred HHHHHHHhCHHHHHHHHHHHHHHHHHHHcCCHHHHHHHHHHHHHHHHHHHHhcC--------------CCCCCHHHHHHH
Confidence 345566666665444556667777778888888887666654443333332222 111111 12455
Q ss_pred HHHHHHHhccCChHHHH
Q psy3264 102 AKELGKKWADVDPSVKT 118 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~ 118 (156)
.+.+...+++||++.|.
T Consensus 111 ~k~~~~~ykaLs~~ak~ 127 (154)
T PF05823_consen 111 AKKVIDSYKALSPEAKD 127 (154)
T ss_dssp H----HHHHTS-HHHHH
T ss_pred HhhhHHHHHcCCHHHHH
Confidence 66667888888888775
No 128
>PRK12750 cpxP periplasmic repressor CpxP; Reviewed
Probab=21.48 E-value=1.6e+02 Score=21.36 Aligned_cols=34 Identities=15% Similarity=0.324 Sum_probs=25.7
Q ss_pred HHHHHHHhccCCHHHHHHHHHhhhhhhhhhcccc
Q psy3264 42 SKKCAERWKTMSEKEKKRFHEMADKDKKRYDTEM 75 (156)
Q Consensus 42 ~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~y~~~~ 75 (156)
.+...+.+.-||+++|..|.++..+....+...+
T Consensus 127 ~~~~~~~~~vLTpEQRak~~e~~~~r~~~~~~~~ 160 (170)
T PRK12750 127 LEKRHQMLSILTPEQKAKFQELQQERMQECQDKM 160 (170)
T ss_pred HHHHHHHHHhCCHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455568999999999999998877665555443
No 129
>TIGR03092 SASP_sspI small, acid-soluble spore protein I. This protein family is restricted to a subset of endospore-forming bacteria such as Bacillus subtilis, all of which are in the Firmicutes (low-GC Gram-positive) lineage. It is a minor SASP (small, acid-soluble spore protein) designated SspI. The gene in Bacillus subtilis previously was designated ysfA.
Probab=21.46 E-value=1.1e+02 Score=18.67 Aligned_cols=19 Identities=32% Similarity=0.520 Sum_probs=14.5
Q ss_pred HHHHhccCCHHHHHHHHHh
Q psy3264 45 CAERWKTMSEKEKKRFHEM 63 (156)
Q Consensus 45 l~~~Wk~Ls~~eK~~y~~~ 63 (156)
.-..|+.+++.+|......
T Consensus 41 FE~~W~~~~~~ek~~m~~~ 59 (65)
T TIGR03092 41 FEAIWKHANEQEKDEMLET 59 (65)
T ss_pred HHHHHHhcCHHHHHHHHHH
Confidence 3346999999999877653
No 130
>PF00171 Aldedh: Aldehyde dehydrogenase family; InterPro: IPR015590 Aldehyde dehydrogenases (1.2.1.3 from EC and 1.2.1.5 from EC) are enzymes that oxidize a wide variety of aliphatic and aromatic aldehydes using NADP as a cofactor. In mammals at least four different forms of the enzyme are known []: class-1 (or Ald C) a tetrameric cytosolic enzyme, class-2 (or Ald M) a tetrameric mitochondrial enzyme, class- 3 (or Ald D) a dimeric cytosolic enzyme, and class IV a microsomal enzyme. Aldehyde dehydrogenases have also been sequenced from fungal and bacterial species. A number of enzymes are known to be evolutionary related to aldehyde dehydrogenases. A glutamic acid and a cysteine residue have been implicated in the catalytic activity of mammalian aldehyde dehydrogenase. These residues are conserved in all the enzymes of this entry. Some of the proteins in this entry are allergens. Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Alt a 10 and Cla h 3.; GO: 0016491 oxidoreductase activity, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 3R31_A 3HAZ_A 1UXQ_A 1UXP_A 1UXR_A 1UXU_A 1UXN_A 1KY8_A 1UXT_A 1UXV_A ....
Probab=21.36 E-value=2.4e+02 Score=23.48 Aligned_cols=40 Identities=10% Similarity=0.147 Sum_probs=33.5
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
.++.....|..++..+|......+....+++..++..-..
T Consensus 37 ~A~~A~~~w~~~~~~~R~~iL~~~a~~l~~~~~~la~~~~ 76 (462)
T PF00171_consen 37 AARAAFKEWSKLPAAERARILERFADLLEERRDELAELIA 76 (462)
T ss_dssp HHHHHHHHHTTSSHHHHHHHHHHHHHHHHHTHHHHHHHHH
T ss_pred HHHHHHHhhhhhhhhhHHHHHHHHHHHHhhcccccccccc
Confidence 3455567899999999999999999999999998876544
No 131
>cd07138 ALDH_CddD_SSP0762 Rhodococcus ruber 6-oxolauric acid dehydrogenase-like. The 6-oxolauric acid dehydrogenase (CddD) from Rhodococcus ruber SC1 which converts 6-oxolauric acid to dodecanedioic acid, and the aldehyde dehydrogenase (locus SSP0762) from Staphylococcus saprophyticus subsp. saprophyticus ATCC 15305 and other similar sequences, are included in this CD.
Probab=21.34 E-value=2.5e+02 Score=23.45 Aligned_cols=38 Identities=5% Similarity=0.063 Sum_probs=32.4
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+..-..|..+|..+|..+...+......+..++.....
T Consensus 46 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 83 (466)
T cd07138 46 RRAFPAWSATSVEERAALLERIAEAYEARADELAQAIT 83 (466)
T ss_pred HHHHHHHhhCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44455799999999999999999999999999887654
No 132
>COG2938 Uncharacterized conserved protein [Function unknown]
Probab=21.23 E-value=1.5e+02 Score=19.38 Aligned_cols=27 Identities=11% Similarity=0.299 Sum_probs=22.1
Q ss_pred HHHHHHHHHhccCCHHHHHHHHHhhhh
Q psy3264 40 EFSKKCAERWKTMSEKEKKRFHEMADK 66 (156)
Q Consensus 40 ei~k~l~~~Wk~Ls~~eK~~y~~~a~~ 66 (156)
-++..+...|.+||++++..|..+-..
T Consensus 32 il~~Fae~~~~~lsd~el~~f~~LLe~ 58 (94)
T COG2938 32 ILGPFAEKEFDSLSDEELDEFERLLEC 58 (94)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHcC
Confidence 456777789999999999999986544
No 133
>cd07110 ALDH_F10_BADH Arabidopsis betaine aldehyde dehydrogenase 1 and 2, ALDH family 10A8 and 10A9-like. Present in this CD are the Arabidopsis betaine aldehyde dehydrogenase (BADH) 1 (chloroplast) and 2 (mitochondria), also known as, aldehyde dehydrogenase family 10 member A8 and aldehyde dehydrogenase family 10 member A9, respectively, and are putative dehydration- and salt-inducible BADHs (EC 1.2.1.8) that catalyze the oxidation of betaine aldehyde to the compatible solute glycine betaine.
Probab=21.22 E-value=2.6e+02 Score=23.20 Aligned_cols=38 Identities=11% Similarity=0.259 Sum_probs=31.8
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..+|.++|..+...+......+..++.....
T Consensus 29 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 66 (456)
T cd07110 29 RRAFPRWKKTTGAERAKYLRAIAEGVRERREELAELEA 66 (456)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44556899999999999999999999999888876543
No 134
>PLN02315 aldehyde dehydrogenase family 7 member
Probab=21.03 E-value=2.7e+02 Score=23.78 Aligned_cols=52 Identities=12% Similarity=0.074 Sum_probs=38.1
Q ss_pred ccccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 89 IKQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 89 ~~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
.....+..+..++. +.....|..+|..+|..+...+....+.+..++..-..
T Consensus 46 ~i~~~~~a~~~~v~~av~~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 103 (508)
T PLN02315 46 PIAEVVEASLEDYEEGLRACEEAAKIWMQVPAPKRGEIVRQIGDALRAKLDYLGRLVS 103 (508)
T ss_pred EEEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 33444555555544 33344699999999999999999999999988876543
No 135
>cd07088 ALDH_LactADH-AldA Escherichia coli lactaldehyde dehydrogenase AldA-like. Lactaldehyde dehydrogenase from Escherichia coli (AldA, LactADH, EC=1.2.1.22), an NAD(+)-dependent enzyme involved in the metabolism of L-fucose and L-rhamnose, and other similar sequences are present in this CD.
Probab=21.03 E-value=2.6e+02 Score=23.30 Aligned_cols=51 Identities=14% Similarity=0.178 Sum_probs=37.5
Q ss_pred ccccCCCCCchhH------HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 89 IKQDNPEYGVGDI------AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 89 ~~~~~p~~~~~e~------~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
.....+..+..++ ++.....|..+|..+|..+...+....+.+..++....
T Consensus 25 ~i~~~~~~~~~~v~~av~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~ 81 (468)
T cd07088 25 VVATVPAATAEDADRAVDAAEAAQKAWERLPAIERAAYLRKLADLIRENADELAKLI 81 (468)
T ss_pred EEEEEeCCCHHHHHHHHHHHHHHHHHHhhCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3334455444444 34555679999999999999999999999988887643
No 136
>PLN02278 succinic semialdehyde dehydrogenase
Probab=20.97 E-value=2.6e+02 Score=23.72 Aligned_cols=38 Identities=8% Similarity=0.099 Sum_probs=31.9
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+..-..|..+|..+|..+...+......+..++.....
T Consensus 72 ~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~~la~~~~ 109 (498)
T PLN02278 72 HDAFPSWSKLTASERSKILRRWYDLIIANKEDLAQLMT 109 (498)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 34445799999999999999999999999998877643
No 137
>TIGR02518 EutH_ACDH acetaldehyde dehydrogenase (acetylating).
Probab=20.94 E-value=2.7e+02 Score=23.63 Aligned_cols=40 Identities=15% Similarity=0.043 Sum_probs=33.4
Q ss_pred hHHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 100 DIAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 100 e~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+.++.....|..+|.++|................++...-
T Consensus 15 ~~A~~A~~~w~~~s~~~R~~iL~~~a~~l~~~~~ela~~~ 54 (488)
T TIGR02518 15 RSAKVAQKKLANMTQEQIDKIVKAIVDAAYENAVKLAKMA 54 (488)
T ss_pred HHHHHHHHHHhhCCHHHHHHHHHHHHHHHHhhHHHHHHHH
Confidence 3456666789999999999999999999999999888843
No 138
>PF08855 DUF1825: Domain of unknown function (DUF1825); InterPro: IPR014954 These roteins are uncharacterised and are principally found in cyanobacteria.
Probab=20.89 E-value=2.7e+02 Score=18.69 Aligned_cols=37 Identities=14% Similarity=0.070 Sum_probs=31.1
Q ss_pred HHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 104 ELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 104 ~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
.++.....++.+.|.-|++..+.+.++|+.=+..+.-
T Consensus 26 ~~~~~~~~fd~egK~~~Id~m~~LidkqkiF~~Rl~L 62 (108)
T PF08855_consen 26 QMGSKYGKFDREGKKIHIDKMEELIDKQKIFYKRLEL 62 (108)
T ss_pred HHHhhcccCCHHHHHHHHHHHHHHHHHHHHHHHHHHh
Confidence 4556699999999999999999999999887666543
No 139
>cd07135 ALDH_F14-YMR110C Saccharomyces cerevisiae aldehyde dehydrogenase family 14 and related proteins. Aldehyde dehydrogenase family 14 (ALDH14), isolated mainly from the mitochondrial outer membrane of Saccharomyces cerevisiae (YMR110C) and most closely related to the plant and animal ALDHs and fatty ALDHs family 3 members, and similar fungal sequences, are present in this CD.
Probab=20.83 E-value=2.7e+02 Score=23.21 Aligned_cols=40 Identities=5% Similarity=-0.146 Sum_probs=33.2
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
++..-..|..++..+|..+...+......+..++..-...
T Consensus 14 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~~~ 53 (436)
T cd07135 14 LRATFRSGKTKDLEYRLWQLKQLYWAVKDNEEAIVEALKK 53 (436)
T ss_pred HHHHHHhcCCCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 4455567999999999999999999999999998875443
No 140
>PRK11903 aldehyde dehydrogenase; Provisional
Probab=20.79 E-value=2.3e+02 Score=24.31 Aligned_cols=34 Identities=3% Similarity=-0.131 Sum_probs=30.3
Q ss_pred HHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 107 KKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 107 ~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
..|+.++..+|..+...+....+....++.....
T Consensus 55 ~~W~~~~~~eR~~~L~~~a~~l~~~~~ela~~~~ 88 (521)
T PRK11903 55 AALRALTYAQRAALLAAIVKVLQANRDAYYDIAT 88 (521)
T ss_pred hhhhhCCHHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 5799999999999999999999999999887644
No 141
>cd07097 ALDH_KGSADH-YcbD Bacillus subtilis NADP+-dependent alpha-ketoglutaric semialdehyde dehydrogenase ycbD-like. Kinetic studies of the Bacillus subtilis ALDH-like ycbD protein, which is involved in d-glucarate/d-galactarate utilization, reveal that it is a NADP+-dependent, alpha-ketoglutaric semialdehyde dehydrogenase (KGSADH). KGSADHs (EC 1.2.1.26) catalyze the NAD(P)+-dependent conversion of KGSA to alpha-ketoglutarate. Interestingly, the NADP+-dependent, tetrameric, 2,5-dioxopentanoate dehydrogenase (EC=1.2.1.26), an enzyme involved in the catabolic pathway for D-arabinose in Sulfolobus solfataricus, also clusters in this group. This CD shows a distant phylogenetic relationship to the Azospirillum brasilense KGSADH-II (-III) group.
Probab=20.66 E-value=2.8e+02 Score=23.26 Aligned_cols=50 Identities=10% Similarity=0.128 Sum_probs=37.2
Q ss_pred cccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 90 KQDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 90 ~~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
....|..+..++. +.....|..++..+|......+......+..++....
T Consensus 28 i~~~~~~~~~~v~~av~~A~~A~~~W~~~~~~~R~~~L~~~a~~l~~~~~ela~~~ 83 (473)
T cd07097 28 VGKYARASAEDADAAIAAAAAAFPAWRRTSPEARADILDKAGDELEARKEELARLL 83 (473)
T ss_pred EEEEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 3444444544443 3344579999999999999999999999988887764
No 142
>TIGR03216 OH_muco_semi_DH 2-hydroxymuconic semialdehyde dehydrogenase. Members of this protein family are 2-hydroxymuconic semialdehyde dehydrogenase. Many aromatic compounds are catabolized by way of the catechol, via the meta-cleavage pathway, to pyruvate and acetyl-CoA. This enzyme performs the second of seven steps in that pathway for catechol degradation.
Probab=20.47 E-value=2.5e+02 Score=23.62 Aligned_cols=33 Identities=9% Similarity=0.155 Sum_probs=29.6
Q ss_pred hccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 109 WADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 109 Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
|..+|..+|..+...+......+..++..-...
T Consensus 53 w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 85 (481)
T TIGR03216 53 WGKMTVAERADLLYAVADEIERRFDDFLAAEVA 85 (481)
T ss_pred HhhCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 999999999999999999999999998876543
No 143
>TIGR03374 ABALDH 1-pyrroline dehydrogenase. Members of this protein family are 1-pyrroline dehydrogenase (1.5.1.35), also called gamma-aminobutyraldehyde dehydrogenase. This enzyme can follow putrescine transaminase (EC 2.6.1.82) for a two-step conversion of putrescine to gamma-aminobutyric acid (GABA). The member from Escherichia coli is characterized as a homotetramer that binds one NADH per momomer. This enzyme belongs to the medium-chain aldehyde dehydrogenases, and is quite similar in sequence to the betaine aldehyde dehydrogenase (EC 1.2.1.8) family.
Probab=20.46 E-value=2.8e+02 Score=23.38 Aligned_cols=39 Identities=8% Similarity=0.159 Sum_probs=32.8
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+..-..|..++..+|..+...+......+..++......
T Consensus 48 ~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 86 (472)
T TIGR03374 48 DAAFAEWGQTTPKARAECLLKLADVIEENAQVFAELESR 86 (472)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 344457999999999999999999999999999876443
No 144
>cd07092 ALDH_ABALDH-YdcW Escherichia coli NAD+-dependent gamma-aminobutyraldehyde dehydrogenase YdcW-like. NAD+-dependent, tetrameric, gamma-aminobutyraldehyde dehydrogenase (ABALDH), YdcW of Escherichia coli K12, catalyzes the oxidation of gamma-aminobutyraldehyde to gamma-aminobutyric acid. ABALDH can also oxidize n-alkyl medium-chain aldehydes, but with a lower catalytic efficiency.
Probab=20.44 E-value=3e+02 Score=22.75 Aligned_cols=39 Identities=15% Similarity=0.228 Sum_probs=32.7
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.....|..+|..+|..+...+.....++..++......
T Consensus 29 ~~a~~~w~~~~~~~R~~~L~~~~~~l~~~~~~l~~~~~~ 67 (450)
T cd07092 29 HAAFPSWRRTTPAERSKALLKLADAIEENAEELAALESR 67 (450)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 445567999999999999999999999999888877543
No 145
>TIGR01722 MMSDH methylmalonic acid semialdehyde dehydrogenase. In Bacillus, a highly homologous protein to methylmalonic acid semialdehyde dehydrogenase, groups out from the main MMSDH clade with Listeria and Sulfolobus. This Bacillus protein has been suggested to be located in an iol operon and/or involved in myo-inositol catabolism, converting malonic semialdehyde to acetyl CoA ad CO2. The preceeding enzymes responsible for valine catabolism are present in Bacillus, Listeria, and Sulfolobus.
Probab=20.42 E-value=2.9e+02 Score=23.20 Aligned_cols=37 Identities=11% Similarity=0.053 Sum_probs=31.1
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
+.....|..++..+|......+......+..++..-.
T Consensus 48 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~~l~~~~ 84 (477)
T TIGR01722 48 RETFLTWGQTSLAQRTSVLLRYQALLKEHRDEIAELI 84 (477)
T ss_pred HHHHHHHhcCCHHHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 4455679999999999999999999988888877653
No 146
>cd07111 ALDH_F16 Aldehyde dehydrogenase family 16A1-like. Uncharacterized aldehyde dehydrogenase family 16 member A1 (ALDH16A1) and other related sequences are present in this CD. The active site cysteine and glutamate residues are not conserved in the human ALDH16A1 protein sequence.
Probab=20.39 E-value=2.8e+02 Score=23.40 Aligned_cols=50 Identities=6% Similarity=0.126 Sum_probs=37.1
Q ss_pred ccCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 91 QDNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 91 ~~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
...+..+..++. +.....|+.+|..+|..+...+......+..++.....
T Consensus 51 ~~~~~a~~~~v~~ai~~A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~ 106 (480)
T cd07111 51 ASVLQAEEEDVDAAVAAARTAFESWSALPGHVRARHLYRIARHIQKHQRLFAVLES 106 (480)
T ss_pred EEEcCCCHHHHHHHHHHHHHHHHHhccCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 334444445544 33346799999999999999999999999999876544
No 147
>KOG1610|consensus
Probab=20.28 E-value=1.9e+02 Score=23.35 Aligned_cols=33 Identities=21% Similarity=0.385 Sum_probs=0.0
Q ss_pred HHHHHHHHHHHhccCCHHHHHHHHHhhhhhhhh
Q psy3264 38 FAEFSKKCAERWKTMSEKEKKRFHEMADKDKKR 70 (156)
Q Consensus 38 ~~ei~k~l~~~Wk~Ls~~eK~~y~~~a~~~k~~ 70 (156)
...+.+.+...|..||++.|+.|-..+..+...
T Consensus 218 ~~~~~~~~~~~w~~l~~e~k~~YGedy~~~~~~ 250 (322)
T KOG1610|consen 218 PEKLEKRMKEIWERLPQETKDEYGEDYFEDYKK 250 (322)
T ss_pred hHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHH
No 148
>TIGR03240 arg_catab_astD succinylglutamic semialdehyde dehydrogenase. Members of this protein family are succinylglutamic semialdehyde dehydrogenase (EC 1.2.1.71), the fourth enzyme in the arginine succinyltransferase (AST) pathway for arginine catabolism.
Probab=20.19 E-value=2.9e+02 Score=23.29 Aligned_cols=39 Identities=10% Similarity=0.052 Sum_probs=32.3
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
+.....|..++..+|..+...+.....++..++......
T Consensus 45 ~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 83 (484)
T TIGR03240 45 RAAFPAWARLSLEERIAVVQRFAALLEERKEALARVIAR 83 (484)
T ss_pred HHHHHHHccCCHHHHHHHHHHHHHHHHHhHHHHHHHHHH
Confidence 444467999999999999999999999999888776443
No 149
>PRK09406 gabD1 succinic semialdehyde dehydrogenase; Reviewed
Probab=20.18 E-value=2.4e+02 Score=23.60 Aligned_cols=50 Identities=4% Similarity=0.013 Sum_probs=37.7
Q ss_pred cCCCCCchhHH------HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHhc
Q psy3264 92 DNPEYGVGDIA------KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKNR 141 (156)
Q Consensus 92 ~~p~~~~~e~~------k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~~ 141 (156)
..+..+..++. +..-..|..++.++|..+...+......+..++......
T Consensus 16 ~~~~~~~~~v~~av~~A~~A~~~w~~~~~~~R~~~L~~~a~~l~~~~~ela~~~~~ 71 (457)
T PRK09406 16 TFTALTDDEVDAAIARAHARFRDYRTTTFAQRARWANAAADLLEAEADQVAALMTL 71 (457)
T ss_pred EEeCCCHHHHHHHHHHHHHHHHHHhcCCHHHHHHHHHHHHHHHHHCHHHHHHHHHH
Confidence 34444444443 444456999999999999999999999999998876443
No 150
>PLN02418 delta-1-pyrroline-5-carboxylate synthase
Probab=20.17 E-value=2.8e+02 Score=24.98 Aligned_cols=39 Identities=15% Similarity=0.097 Sum_probs=33.9
Q ss_pred HHHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 101 IAKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 101 ~~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
.++.....|..+|..+|..+...+......+..++....
T Consensus 302 aA~~A~~~W~~~~~~eR~~~L~~~a~~l~~~~~ela~~~ 340 (718)
T PLN02418 302 AARESSRKLQALSSEERKKILLDVADALEANEELIKAEN 340 (718)
T ss_pred HHHHHHHHHHhCCHHHHHHHHHHHHHHHHHhHHHHHHHH
Confidence 445556689999999999999999999999999998874
No 151
>PLN02419 methylmalonate-semialdehyde dehydrogenase [acylating]
Probab=20.13 E-value=2.8e+02 Score=24.47 Aligned_cols=38 Identities=5% Similarity=0.035 Sum_probs=32.5
Q ss_pred HHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHHh
Q psy3264 103 KELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYKN 140 (156)
Q Consensus 103 k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~~ 140 (156)
+.....|..++..+|..+...+....+.+..++.....
T Consensus 161 ~~A~~~W~~~~~~eR~~iL~k~a~~L~~~~~ela~~~~ 198 (604)
T PLN02419 161 KQAFPLWRNTPITTRQRVMLKFQELIRKNMDKLAMNIT 198 (604)
T ss_pred HHHHHHhccCCHHHHHHHHHHHHHHHHHhHHHHHHHHH
Confidence 44455799999999999999999999999999877653
No 152
>cd07105 ALDH_SaliADH Salicylaldehyde dehydrogenase, DoxF-like. Salicylaldehyde dehydrogenase (DoxF, SaliADH, EC=1.2.1.65) involved in the upper naphthalene catabolic pathway of Pseudomonas strain C18 and other similar sequences are present in this CD.
Probab=20.04 E-value=2.5e+02 Score=23.19 Aligned_cols=38 Identities=16% Similarity=0.097 Sum_probs=31.5
Q ss_pred HHHHHHHhccCChHHHHHHHHHHHHHHHHHHHHHHHHH
Q psy3264 102 AKELGKKWADVDPSVKTKYEQMAEKDKARYEKEMTAYK 139 (156)
Q Consensus 102 ~k~~~~~Wk~ls~~eK~~y~~~a~~~k~~y~~e~~~~~ 139 (156)
++..-..|..+|.++|..+...+....+.+..++..-.
T Consensus 9 A~~a~~~w~~~~~~~R~~~L~~~a~~l~~~~~~la~~~ 46 (432)
T cd07105 9 AAAAFPAWSKTPPSERRDILLKAADLLESRRDEFIEAM 46 (432)
T ss_pred HHHHHHHHhcCCHHHHHHHHHHHHHHHHhCHHHHHHHH
Confidence 44556789999999999999999999988888876654
Done!