Query 036651
Match_columns 60
No_of_seqs 117 out of 1076
Neff 8.4
Searched_HMMs 46136
Date Fri Mar 29 04:10:13 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/036651.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/036651hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0868 Glutathione S-transfer 99.6 7.4E-16 1.6E-20 90.0 6.1 59 1-59 66-124 (217)
2 PRK15113 glutathione S-transfe 99.1 3.7E-10 8E-15 66.3 5.1 45 2-46 68-115 (214)
3 COG0625 Gst Glutathione S-tran 99.0 4.7E-10 1E-14 65.5 5.0 48 2-49 61-111 (211)
4 KOG0406 Glutathione S-transfer 99.0 8.1E-10 1.7E-14 66.6 4.4 45 2-46 68-113 (231)
5 PRK09481 sspA stringent starva 99.0 9.8E-10 2.1E-14 64.4 4.4 44 2-45 68-111 (211)
6 TIGR01262 maiA maleylacetoacet 98.9 4.4E-09 9.5E-14 61.1 6.4 47 2-48 61-107 (210)
7 PRK13972 GSH-dependent disulfi 98.9 4.3E-09 9.3E-14 61.7 5.4 45 2-48 68-112 (215)
8 PLN02395 glutathione S-transfe 98.9 4.1E-09 9E-14 61.5 5.1 46 2-47 62-109 (215)
9 PRK10542 glutathionine S-trans 98.9 4.2E-09 9.1E-14 60.8 4.5 45 2-46 62-107 (201)
10 PLN02473 glutathione S-transfe 98.8 1.7E-08 3.7E-13 58.9 5.6 45 2-46 63-109 (214)
11 PRK10357 putative glutathione 98.8 1.5E-08 3.2E-13 58.7 4.6 44 2-45 59-102 (202)
12 PRK11752 putative S-transferas 98.7 5.1E-08 1.1E-12 59.3 5.0 41 2-44 114-154 (264)
13 KOG0867 Glutathione S-transfer 98.4 8.3E-07 1.8E-11 53.1 5.2 46 2-47 63-110 (226)
14 PLN02907 glutamate-tRNA ligase 98.3 2E-06 4.3E-11 58.8 5.5 41 2-42 48-88 (722)
15 PRK10387 glutaredoxin 2; Provi 98.2 1E-06 2.2E-11 51.1 3.1 41 2-45 58-98 (210)
16 TIGR02182 GRXB Glutaredoxin, G 97.9 1.7E-05 3.7E-10 46.6 3.8 40 2-44 57-96 (209)
17 PLN02378 glutathione S-transfe 97.7 8.1E-05 1.8E-09 43.8 4.6 33 2-36 69-101 (213)
18 PLN02817 glutathione dehydroge 97.7 9.3E-05 2E-09 45.4 4.6 34 2-37 122-155 (265)
19 PF13417 GST_N_3: Glutathione 97.7 4E-05 8.6E-10 38.4 2.4 19 2-20 56-74 (75)
20 TIGR00862 O-ClC intracellular 97.5 0.00039 8.6E-09 42.1 5.3 31 2-32 68-101 (236)
21 cd03077 GST_N_Alpha GST_N fami 97.2 0.00029 6.3E-09 35.7 2.0 17 2-18 61-77 (79)
22 PTZ00057 glutathione s-transfe 97.1 0.0018 3.9E-08 37.8 4.7 18 2-19 68-85 (205)
23 cd03061 GST_N_CLIC GST_N famil 96.9 0.00099 2.1E-08 35.2 2.4 19 2-20 71-89 (91)
24 KOG1695 Glutathione S-transfer 96.6 0.006 1.3E-07 36.5 4.3 36 2-40 61-96 (206)
25 cd03189 GST_C_GTT1_like GST_C 92.8 0.11 2.4E-06 27.3 2.2 25 24-48 2-26 (119)
26 cd03191 GST_C_Zeta GST_C famil 89.8 0.53 1.2E-05 24.8 2.9 21 28-48 2-22 (121)
27 cd03200 GST_C_JTV1 GST_C famil 86.9 0.73 1.6E-05 24.0 2.2 32 10-44 1-32 (96)
28 cd03182 GST_C_GTT2_like GST_C 86.5 0.6 1.3E-05 24.4 1.8 23 26-48 1-23 (117)
29 cd03195 GST_C_4 GST_C family, 85.9 0.38 8.3E-06 25.6 0.9 21 27-47 1-21 (114)
30 PF11287 DUF3088: Protein of u 83.0 1 2.3E-05 24.8 1.7 18 2-19 92-109 (112)
31 cd03186 GST_C_SspA GST_N famil 79.9 1 2.2E-05 23.2 1.0 22 27-48 1-22 (107)
32 COG2999 GrxB Glutaredoxin 2 [P 78.6 4.2 9.1E-05 24.5 3.3 21 2-22 58-78 (215)
33 PF14834 GST_C_4: Glutathione 76.4 2.4 5.3E-05 23.5 1.9 23 26-48 1-23 (117)
34 KOG4420 Uncharacterized conser 75.8 5.1 0.00011 25.5 3.3 24 2-25 87-111 (325)
35 cd03188 GST_C_Beta GST_C famil 75.3 2.5 5.4E-05 21.7 1.7 20 29-48 2-21 (114)
36 cd03178 GST_C_Ure2p_like GST_C 74.7 2.9 6.2E-05 21.6 1.8 20 29-48 1-20 (113)
37 cd03196 GST_C_5 GST_C family, 73.4 2 4.4E-05 22.8 1.1 24 25-48 2-25 (115)
38 cd03179 GST_C_1 GST_C family, 66.2 6 0.00013 19.9 1.9 20 29-48 2-21 (105)
39 cd03185 GST_C_Tau GST_C family 65.7 3.1 6.7E-05 21.9 0.7 21 28-48 2-22 (126)
40 KOG1422 Intracellular Cl- chan 65.4 12 0.00027 22.9 3.3 21 2-22 70-90 (221)
41 cd03180 GST_C_2 GST_C family, 65.3 7.1 0.00015 19.8 2.1 20 29-48 2-21 (110)
42 cd03181 GST_C_EFB1gamma GST_C 63.9 4.8 0.0001 21.0 1.3 20 29-48 1-20 (123)
43 PF09635 MetRS-N: MetRS-N bind 63.8 5.5 0.00012 22.2 1.5 17 2-18 47-63 (122)
44 PF09314 DUF1972: Domain of un 62.0 6.1 0.00013 23.4 1.6 17 2-18 157-173 (185)
45 cd08531 SAM_PNT-ERG_FLI-1 Ster 60.3 14 0.0003 18.8 2.5 21 24-44 2-24 (75)
46 cd08542 SAM_PNT-ETS-1 Sterile 57.2 17 0.00037 19.1 2.6 23 22-44 14-38 (88)
47 cd08540 SAM_PNT-ERG Sterile al 56.7 17 0.00036 18.5 2.4 21 24-44 2-24 (75)
48 cd08533 SAM_PNT-ETS-1,2 Steril 56.5 17 0.00037 18.3 2.5 15 30-44 9-23 (71)
49 TIGR03412 iscX_yfhJ FeS assemb 55.1 15 0.00032 18.2 2.0 17 4-20 3-19 (63)
50 cd03187 GST_C_Phi GST_C family 54.8 16 0.00035 18.7 2.3 20 29-48 2-21 (118)
51 PRK10721 hypothetical protein; 50.9 19 0.00041 18.0 2.0 17 4-20 6-22 (66)
52 cd08534 SAM_PNT-GABP-alpha Ste 49.5 28 0.00061 18.3 2.7 23 22-44 14-38 (89)
53 cd08541 SAM_PNT-FLI-1 Sterile 49.5 27 0.00058 18.5 2.6 23 22-44 12-36 (91)
54 cd08532 SAM_PNT-PDEF-like Ster 48.1 27 0.00059 17.7 2.4 22 23-44 6-29 (76)
55 cd03177 GST_C_Delta_Epsilon GS 46.7 18 0.00039 18.8 1.7 20 29-48 2-21 (118)
56 cd08543 SAM_PNT-ETS-2 Sterile 45.7 33 0.00072 18.1 2.6 23 22-44 14-38 (89)
57 COG2975 Uncharacterized protei 45.2 26 0.00056 17.3 2.0 18 3-20 3-20 (64)
58 KOG0183 20S proteasome, regula 43.3 76 0.0016 19.7 4.2 38 6-43 166-203 (249)
59 KOG4244 Failed axon connection 40.9 91 0.002 19.9 4.3 19 3-21 104-122 (281)
60 PF09098 Dehyd-heme_bind: Quin 39.4 29 0.00062 20.5 1.9 14 5-18 55-68 (167)
61 PF04384 Fe-S_assembly: Iron-s 39.0 36 0.00078 16.9 2.0 17 4-20 4-20 (64)
62 cd03183 GST_C_Theta GST_C fami 38.6 11 0.00024 19.8 0.1 18 30-47 2-19 (126)
63 smart00251 SAM_PNT SAM / Point 36.7 58 0.0013 16.6 2.6 22 23-44 13-36 (82)
64 cd03203 GST_C_Lambda GST_C fam 35.7 45 0.00097 17.6 2.2 14 26-39 1-14 (120)
65 COG3109 ProQ Activator of osmo 34.5 36 0.00078 20.4 1.8 18 3-20 7-24 (208)
66 COG4669 EscJ Type III secretor 34.4 83 0.0018 19.7 3.4 24 13-36 86-109 (246)
67 KOG2903 Predicted glutathione 33.5 55 0.0012 21.0 2.6 39 4-44 138-185 (319)
68 PRK04950 ProP expression regul 28.0 51 0.0011 20.2 1.8 17 3-19 7-23 (213)
69 PF00538 Linker_histone: linke 27.5 60 0.0013 16.0 1.8 13 6-18 24-36 (77)
70 PF09664 DUF2399: Protein of u 26.5 1.1E+02 0.0024 17.4 2.9 41 1-41 21-63 (152)
71 PF09107 SelB-wing_3: Elongati 25.2 71 0.0015 14.8 1.6 11 6-16 27-37 (50)
72 KOG3631 Alpha-parvin and relat 24.2 1.1E+02 0.0023 19.9 2.7 34 25-60 84-117 (365)
73 COG2231 Uncharacterized protei 23.6 90 0.0019 19.2 2.2 23 7-29 9-31 (215)
74 cd00073 H15 linker histone 1 a 23.5 77 0.0017 16.2 1.7 14 7-20 27-40 (88)
75 PF10264 Stork_head: Winged he 23.3 77 0.0017 16.4 1.7 19 7-26 33-51 (80)
76 KOG0962 DNA repair protein RAD 22.8 1.3E+02 0.0028 23.4 3.2 25 3-27 142-166 (1294)
77 PF02198 SAM_PNT: Sterile alph 22.4 82 0.0018 15.9 1.7 23 22-44 12-36 (84)
78 PF10189 DUF2356: Conserved pr 22.1 2.1E+02 0.0045 17.8 3.6 32 7-38 79-111 (230)
79 cd03190 GST_C_ECM4_like GST_C 21.3 91 0.002 17.0 1.9 21 28-48 3-23 (142)
80 cd03184 GST_C_Omega GST_C fami 21.2 56 0.0012 17.1 1.0 20 29-48 2-21 (124)
81 KOG1757 Histone 2A [Chromatin 20.9 72 0.0016 17.7 1.3 10 6-15 58-67 (131)
82 cd00059 FH Forkhead (FH), also 20.3 80 0.0017 16.0 1.4 13 7-19 25-37 (78)
No 1
>KOG0868 consensus Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
Probab=99.64 E-value=7.4e-16 Score=89.96 Aligned_cols=59 Identities=39% Similarity=0.596 Sum_probs=56.1
Q ss_pred CceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhcccccccchHHHHhhhhc
Q 036651 1 MLYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSIQPLQSLPVLVVSHQD 59 (60)
Q Consensus 1 ~~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 59 (60)
.+|+||.||++||+|.||+++++|.|+..|+.+++....+.+.++|.+|+.|+.++.++
T Consensus 66 ~tl~eS~AII~YLeEt~P~ppLLP~d~~KRA~~r~i~~~i~sgIQPlQNl~vl~~l~ek 124 (217)
T KOG0868|consen 66 LTLTESLAIIEYLEETYPDPPLLPKDPHKRAKARAISLLIASGIQPLQNLSVLKMLNEK 124 (217)
T ss_pred EEeehHHHHHHHHHhcCCCCCCCCcCHHHHHHHHHHHHHHHhCCCcchhhHHHHHhccc
Confidence 36899999999999999999999999999999999999999999999999999998765
No 2
>PRK15113 glutathione S-transferase; Provisional
Probab=99.06 E-value=3.7e-10 Score=66.32 Aligned_cols=45 Identities=29% Similarity=0.318 Sum_probs=40.5
Q ss_pred ceehHHHHHHHHHhhCCCCC---CCCcCHHHHHHHHHHHHHHhccccc
Q 036651 2 LYLILFAILMYLEEKNPQHP---LLPSDLKRKAINYQAAYLVSSSIQP 46 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~---l~p~~~~~ra~~~~~~~~~~~~~~~ 46 (60)
+|+||.||++||+++||++. ++|.++.+|+++++|+.|+.+++++
T Consensus 68 ~l~ES~aI~~YL~~~~~~~~~~~l~p~~~~~ra~~~~~~~~~~~~~~~ 115 (214)
T PRK15113 68 ELSESSAIAEYLEERFAPPAWERIYPADLQARARARQIQAWLRSDLMP 115 (214)
T ss_pred EEecHHHHHHHHHHHcCCCCccccCCCCHHHHHHHHHHHHHHHhhhHH
Confidence 68999999999999998765 9999999999999999999877654
No 3
>COG0625 Gst Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
Probab=99.04 E-value=4.7e-10 Score=65.55 Aligned_cols=48 Identities=31% Similarity=0.455 Sum_probs=42.3
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHH---HHHHHHHHHHHHhcccccccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLK---RKAINYQAAYLVSSSIQPLQS 49 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~---~ra~~~~~~~~~~~~~~~~~~ 49 (60)
+|+||.||++||+++||++.++|.++. +|+...+|+.+..+++++...
T Consensus 61 ~l~ES~AI~~YL~~~~~~~~l~p~~~~~r~~r~~~~~~~~~~~~~~~~~~~ 111 (211)
T COG0625 61 VLTESGAILEYLAERYPGPPLLPADPLARRARALLLWWLFFAASDLHPVIG 111 (211)
T ss_pred eeecHHHHHHHHHhhCCCCCcCCCCchhHHHHHHHHHHHHHHHhcccHHHH
Confidence 689999999999999998778998774 888888999999989888764
No 4
>KOG0406 consensus Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
Probab=98.98 E-value=8.1e-10 Score=66.62 Aligned_cols=45 Identities=24% Similarity=0.258 Sum_probs=40.6
Q ss_pred ceehHHHHHHHHHhhCC-CCCCCCcCHHHHHHHHHHHHHHhccccc
Q 036651 2 LYLILFAILMYLEEKNP-QHPLLPSDLKRKAINYQAAYLVSSSIQP 46 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p-~~~l~p~~~~~ra~~~~~~~~~~~~~~~ 46 (60)
.|.||..|++||++.+| +++++|+||.+|++++.|+.+++..+..
T Consensus 68 ~i~ESliiveYiDe~w~~~~~iLP~DPy~Ra~arfwa~~id~~~~~ 113 (231)
T KOG0406|consen 68 PICESLIIVEYIDETWPSGPPILPSDPYERAQARFWAEYIDKKVFF 113 (231)
T ss_pred eehhhHHHHHHHHhhccCCCCCCCCCHHHHHHHHHHHHHHHhHHHH
Confidence 47899999999999999 4899999999999999999999976543
No 5
>PRK09481 sspA stringent starvation protein A; Provisional
Probab=98.97 E-value=9.8e-10 Score=64.39 Aligned_cols=44 Identities=18% Similarity=0.201 Sum_probs=39.1
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhcccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSIQ 45 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~~ 45 (60)
+|+||.||++||+++||++.++|.++.+|+++++|+.++.+.++
T Consensus 68 ~l~ES~AIl~YL~~~~~~~~l~p~~~~~ra~~~~~~~~~~~~~~ 111 (211)
T PRK09481 68 TLYESRIIMEYLDERFPHPPLMPVYPVARGESRLMMHRIEKDWY 111 (211)
T ss_pred EeeCHHHHHHHHHHhCCCCCCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 68999999999999999888999999999999999988765543
No 6
>TIGR01262 maiA maleylacetoacetate isomerase. Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (EC 5.2.1.2) is described as active also on maleylpyruvate, and the example from a Ralstonia sp. catabolic plasmid is described as a maleylpyruvate isomerase involved in gentisate catabolism.
Probab=98.95 E-value=4.4e-09 Score=61.12 Aligned_cols=47 Identities=38% Similarity=0.597 Sum_probs=42.2
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhccccccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+||.||++||++++++..+.|.++.+++++++|+.++..++++..
T Consensus 61 ~l~ES~aI~~yl~~~~~~~~l~p~~~~~~a~~~~~~~~~~~~~~~~~ 107 (210)
T TIGR01262 61 VLTQSLAIIEYLEETYPDPPLLPADPIKRARVRALALLIACDIHPLN 107 (210)
T ss_pred EeecHHHHHHHHHHhCCCCCCCCCCHHHHHHHHHHHHHHhcccChhh
Confidence 68999999999999999888999999999999999999987777643
No 7
>PRK13972 GSH-dependent disulfide bond oxidoreductase; Provisional
Probab=98.90 E-value=4.3e-09 Score=61.70 Aligned_cols=45 Identities=24% Similarity=0.335 Sum_probs=38.8
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhccccccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+||.||++||+++|+ .+.|.++.+|+++++|+.|..+.+++..
T Consensus 68 ~L~ES~AI~~YL~~~~~--~l~p~~~~~ra~~~~~~~~~~~~~~~~~ 112 (215)
T PRK13972 68 SLFESGAILLYLAEKTG--LFLSHETRERAATLQWLFWQVGGLGPML 112 (215)
T ss_pred eEEcHHHHHHHHHHhcC--CCCCCCHHHHHHHHHHHHHHhhccCcce
Confidence 58999999999999985 4667889999999999999987777643
No 8
>PLN02395 glutathione S-transferase
Probab=98.89 E-value=4.1e-09 Score=61.48 Aligned_cols=46 Identities=22% Similarity=0.164 Sum_probs=40.6
Q ss_pred ceehHHHHHHHHHhhCCC--CCCCCcCHHHHHHHHHHHHHHhcccccc
Q 036651 2 LYLILFAILMYLEEKNPQ--HPLLPSDLKRKAINYQAAYLVSSSIQPL 47 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~--~~l~p~~~~~ra~~~~~~~~~~~~~~~~ 47 (60)
+|+||.||++||+++|++ +.++|.++.+++++++|+.|.+..+++.
T Consensus 62 ~l~ES~aI~~YL~~~~~~~~~~l~p~~~~~~~~~~~~~~~~~~~~~~~ 109 (215)
T PLN02395 62 KIFESRAIMRYYAEKYRSQGPDLLGKTIEERGQVEQWLDVEATSYHPP 109 (215)
T ss_pred EEEcHHHHHHHHHHHcCCCCcCcCCCChhHHHHHHHHHHHHHHhcCch
Confidence 689999999999999985 3689999999999999999988777654
No 9
>PRK10542 glutathionine S-transferase; Provisional
Probab=98.87 E-value=4.2e-09 Score=60.85 Aligned_cols=45 Identities=20% Similarity=0.286 Sum_probs=38.8
Q ss_pred ceehHHHHHHHHHhhCCCCCCC-CcCHHHHHHHHHHHHHHhccccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLL-PSDLKRKAINYQAAYLVSSSIQP 46 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~-p~~~~~ra~~~~~~~~~~~~~~~ 46 (60)
+|+||.||++||++++|++.++ |.++.+|+++++|+.+..+++++
T Consensus 62 ~l~eS~aI~~YL~~~~~~~~l~~p~~~~~ra~~~~~~~~~~~~~~~ 107 (201)
T PRK10542 62 LLTEGVAIMQYLADSVPDRQLLAPVGSLSRYHTIEWLNYIATELHK 107 (201)
T ss_pred EeecHHHHHHHHHHhCcccccCCCCCcHHHHHHHHHHHHHHhhhhh
Confidence 6899999999999999987766 56788999999999988777654
No 10
>PLN02473 glutathione S-transferase
Probab=98.80 E-value=1.7e-08 Score=58.90 Aligned_cols=45 Identities=22% Similarity=0.142 Sum_probs=39.2
Q ss_pred ceehHHHHHHHHHhhCCCC--CCCCcCHHHHHHHHHHHHHHhccccc
Q 036651 2 LYLILFAILMYLEEKNPQH--PLLPSDLKRKAINYQAAYLVSSSIQP 46 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~--~l~p~~~~~ra~~~~~~~~~~~~~~~ 46 (60)
+|+||.||++||++++|+. +++|.++.+|+++++|+.+..+.+.+
T Consensus 63 ~l~ES~aI~~YL~~~~~~~~~~l~p~~~~~ra~~~~~~~~~~~~~~~ 109 (214)
T PLN02473 63 KLFESRAIARYYATKYADQGTDLLGKTLEHRAIVDQWVEVENNYFYA 109 (214)
T ss_pred EEEehHHHHHHHHHHcCCcCCCCCCCCHHHHHHHHHHHHHHHhcccH
Confidence 6899999999999999753 68899999999999999988776654
No 11
>PRK10357 putative glutathione S-transferase; Provisional
Probab=98.77 E-value=1.5e-08 Score=58.72 Aligned_cols=44 Identities=20% Similarity=0.131 Sum_probs=38.3
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhcccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSIQ 45 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~~ 45 (60)
+|+||.||++||++++|++.++|.++.+++++++|..|....++
T Consensus 59 ~l~eS~aI~~yL~~~~~~~~l~p~~~~~~a~~~~~~~~~~~~~~ 102 (202)
T PRK10357 59 CWFDSPIIAEYIELLNVAPAMLPRDPLAALRVRQLEALADGIMD 102 (202)
T ss_pred eeecHHHHHHHHHHhCCCCCCCCCCHHHHHHHHHHHHHHHHHHH
Confidence 68999999999999998878999999999999999888765543
No 12
>PRK11752 putative S-transferase; Provisional
Probab=98.67 E-value=5.1e-08 Score=59.28 Aligned_cols=41 Identities=29% Similarity=0.322 Sum_probs=36.2
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSI 44 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~ 44 (60)
+|+||.||++||++.|+ +++|.++.+|+++++|+.|....+
T Consensus 114 ~L~ES~AIl~YL~~~~~--~L~P~~~~era~v~~wl~~~~~~~ 154 (264)
T PRK11752 114 RVFESGAILLYLAEKFG--AFLPKDLAARTETLNWLFWQQGSA 154 (264)
T ss_pred EEEcHHHHHHHHHHhcC--CcCCCCHHHHHHHHHHHHHHhhhh
Confidence 68999999999999986 488999999999999999876543
No 13
>KOG0867 consensus Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
Probab=98.41 E-value=8.3e-07 Score=53.08 Aligned_cols=46 Identities=37% Similarity=0.390 Sum_probs=40.6
Q ss_pred ceehHHHHHHHHHhhCC-CCC-CCCcCHHHHHHHHHHHHHHhcccccc
Q 036651 2 LYLILFAILMYLEEKNP-QHP-LLPSDLKRKAINYQAAYLVSSSIQPL 47 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p-~~~-l~p~~~~~ra~~~~~~~~~~~~~~~~ 47 (60)
.++||.||+.||.+.|. ... ++|.+..+|+.+.+|+.|.++.+.+.
T Consensus 63 ~l~eS~AI~~Yl~~ky~~~~~~l~p~~~~~ra~v~~~l~~~~~~l~~~ 110 (226)
T KOG0867|consen 63 TLWESHAILRYLAEKYGPLGGILLPKDLKERAIVDQWLEFENGVLDPV 110 (226)
T ss_pred EEeeHHHHHHHHHHHcCCCCcccCCcCHHHHHHHHHHHHhhhcccccc
Confidence 58999999999999996 334 88999999999999999999888765
No 14
>PLN02907 glutamate-tRNA ligase
Probab=98.29 E-value=2e-06 Score=58.77 Aligned_cols=41 Identities=7% Similarity=-0.067 Sum_probs=37.9
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSS 42 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~ 42 (60)
.|+||.||++||++.+|+..++|.++.+++++.+|+.|+..
T Consensus 48 ~L~ES~AIl~YLa~~~p~~~L~p~d~~erAqV~qWL~~~~~ 88 (722)
T PLN02907 48 KLTGTNVLLRYIARSASLPGFYGQDAFESSQVDEWLDYAPT 88 (722)
T ss_pred EEECHHHHHHHHHHhCCCcCCCCCCHHHHHHHHHHHHHHhh
Confidence 68999999999999998888999999999999999998865
No 15
>PRK10387 glutaredoxin 2; Provisional
Probab=98.25 E-value=1e-06 Score=51.14 Aligned_cols=41 Identities=17% Similarity=0.118 Sum_probs=31.7
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhcccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSIQ 45 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~~ 45 (60)
+|+||.||++||+++||++.+ + + .+++.+++|+.+....++
T Consensus 58 ~l~eS~aI~~yL~~~~~~~~l-~-~-~~~~~~~~~~~~~~~~~~ 98 (210)
T PRK10387 58 YMPESLDIVHYIDELDGKPLL-T-G-KRSPAIEEWLRKVFGYLN 98 (210)
T ss_pred EecCHHHHHHHHHHhCCCccC-C-C-cccHHHHHHHHHHHHHhh
Confidence 689999999999999987544 3 1 256788888888765554
No 16
>TIGR02182 GRXB Glutaredoxin, GrxB family. This model includes the highly abundant E. coli GrxB (Grx2) glutaredoxin which is notably longer than either GrxA or GrxC. Unlike the other two E. coli glutaredoxins, GrxB appears to be unable to reduce ribonucleotide reductase, and may have more to do with resistance to redox stress.
Probab=97.92 E-value=1.7e-05 Score=46.59 Aligned_cols=40 Identities=10% Similarity=-0.048 Sum_probs=28.1
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhccc
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSI 44 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~ 44 (60)
+|+||.+|++||+++||.+.+.+. .+..+++|+.++...+
T Consensus 57 ~l~es~~I~~yL~~~~~~~~~~~~---~~~~~~~~~~~~~~~~ 96 (209)
T TIGR02182 57 AMPESLDIVAYFDKLDGEPLLTGK---VSPEIEAWLRKVTGYA 96 (209)
T ss_pred EeccHHHHHHHHHHhCCCccCCCC---ChHHHHHHHHHHHHHh
Confidence 689999999999999987544332 3456667776655443
No 17
>PLN02378 glutathione S-transferase DHAR1
Probab=97.74 E-value=8.1e-05 Score=43.80 Aligned_cols=33 Identities=27% Similarity=0.093 Sum_probs=25.3
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHH
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQA 36 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~ 36 (60)
+|+||.||++||+++||++.+ .++.+++.+...
T Consensus 69 ~l~ES~aI~~YL~~~~~~~~l--~~~~~~a~i~~~ 101 (213)
T PLN02378 69 WVTDSDVIVGILEEKYPDPPL--KTPAEFASVGSN 101 (213)
T ss_pred EecCHHHHHHHHHHhCCCCCC--CCHHHHHHHHHH
Confidence 589999999999999987654 355666665543
No 18
>PLN02817 glutathione dehydrogenase (ascorbate)
Probab=97.70 E-value=9.3e-05 Score=45.38 Aligned_cols=34 Identities=29% Similarity=0.111 Sum_probs=27.4
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHH
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAA 37 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~ 37 (60)
+|+||.||++||+++||++.+. ++.+++.+..++
T Consensus 122 ~L~ES~aI~~YL~e~~p~~~L~--~~~era~i~~~l 155 (265)
T PLN02817 122 WVADSDVITQALEEKYPDPPLA--TPPEKASVGSKI 155 (265)
T ss_pred EEecHHHHHHHHHHHCCCCCCC--CHHHHHHHHHHH
Confidence 5899999999999999987653 567777777654
No 19
>PF13417 GST_N_3: Glutathione S-transferase, N-terminal domain; PDB: 3ERG_B 3IBH_A 3ERF_A 3UBL_A 3UBK_A 3IR4_A 3M8N_B 2R4V_A 2PER_A 2R5G_A ....
Probab=97.68 E-value=4e-05 Score=38.39 Aligned_cols=19 Identities=37% Similarity=0.445 Sum_probs=17.5
Q ss_pred ceehHHHHHHHHHhhCCCC
Q 036651 2 LYLILFAILMYLEEKNPQH 20 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~ 20 (60)
+|+||.+|++||+++||++
T Consensus 56 ~l~dS~~I~~yL~~~~~~~ 74 (75)
T PF13417_consen 56 VLTDSAAIIEYLEERYPGP 74 (75)
T ss_dssp EEESHHHHHHHHHHHSTSS
T ss_pred EEeCHHHHHHHHHHHcCCC
Confidence 6899999999999999875
No 20
>TIGR00862 O-ClC intracellular chloride channel protein. These proteins are thought to function in the regulation of the membrane potential and in transepithelial ion absorption and secretion in the kidney.
Probab=97.49 E-value=0.00039 Score=42.14 Aligned_cols=31 Identities=19% Similarity=0.108 Sum_probs=24.2
Q ss_pred ceehHHHHHHHHHhhCCC---CCCCCcCHHHHHH
Q 036651 2 LYLILFAILMYLEEKNPQ---HPLLPSDLKRKAI 32 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~---~~l~p~~~~~ra~ 32 (60)
+|+||.||++||+++|++ +.+.|.++..++.
T Consensus 68 ~l~ES~aI~eYL~e~~~~~~~p~l~p~~~~~~~~ 101 (236)
T TIGR00862 68 VKTDVNKIEEFLEETLCPPRYPKLSPKHPESNTA 101 (236)
T ss_pred EeecHHHHHHHHHHHcCCCCCCCCCCCCHHHHHH
Confidence 689999999999999975 4466667665554
No 21
>cd03077 GST_N_Alpha GST_N family, Class Alpha subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Alpha subfamily is composed of eukaryotic GSTs which can form homodimer and heterodimers. There are at least six types of class Alpha GST subunits in rats, four of which have human counterparts, resulting in many possible isoenzymes with different activities, tissue distribution and substrate specificities. Human GSTA1-1 and GSTA2-2 show high GSH peroxidase activity. GSTA3-3 catalyzes the isomerization of intermediates in steroid hormone biosynthesis. GSTA4-4 preferentially catalyzes the
Probab=97.19 E-value=0.00029 Score=35.69 Aligned_cols=17 Identities=29% Similarity=0.241 Sum_probs=15.4
Q ss_pred ceehHHHHHHHHHhhCC
Q 036651 2 LYLILFAILMYLEEKNP 18 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p 18 (60)
+|+||.||+.||+++|+
T Consensus 61 ~l~ES~AI~~YL~~~~~ 77 (79)
T cd03077 61 KLVQTRAILNYIAGKYN 77 (79)
T ss_pred EEeeHHHHHHHHHHHcC
Confidence 68999999999999874
No 22
>PTZ00057 glutathione s-transferase; Provisional
Probab=97.05 E-value=0.0018 Score=37.80 Aligned_cols=18 Identities=28% Similarity=0.355 Sum_probs=16.4
Q ss_pred ceehHHHHHHHHHhhCCC
Q 036651 2 LYLILFAILMYLEEKNPQ 19 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~ 19 (60)
+|+||.||++||+++|+.
T Consensus 68 ~l~eS~AI~~YLa~~~~~ 85 (205)
T PTZ00057 68 IFAQSQAIVRYLSKKYKI 85 (205)
T ss_pred EEecHHHHHHHHHHHcCC
Confidence 689999999999999964
No 23
>cd03061 GST_N_CLIC GST_N family, Chloride Intracellular Channel (CLIC) subfamily; composed of CLIC1-5, p64, parchorin and similar proteins. They are auto-inserting, self-assembling intracellular anion channels involved in a wide variety of functions including regulated secretion, cell division and apoptosis. They can exist in both water-soluble and membrane-bound states, and are found in various vesicles and membranes. Biochemical studies of the C. elegans homolog, EXC-4, show that the membrane localization domain is present in the N-terminal part of the protein. The structure of soluble human CLIC1 reveals that it is monomeric and it adopts a fold similar to GSTs, containing an N-terminal domain with a TRX fold and a C-terminal alpha helical domain. Upon oxidation, the N-terminal domain of CLIC1 undergoes a structural change to form a non-covalent dimer stabilized by the formation of an intramolecular disulfide bond between two cysteines that are far apart in the reduced form. The CLI
Probab=96.90 E-value=0.00099 Score=35.19 Aligned_cols=19 Identities=21% Similarity=-0.062 Sum_probs=17.1
Q ss_pred ceehHHHHHHHHHhhCCCC
Q 036651 2 LYLILFAILMYLEEKNPQH 20 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~ 20 (60)
+|+||.+|++||+++++.+
T Consensus 71 ~i~eS~~I~eYLde~~~~~ 89 (91)
T cd03061 71 VKTDNNKIEEFLEETLCPP 89 (91)
T ss_pred EecCHHHHHHHHHHHccCC
Confidence 6899999999999998765
No 24
>KOG1695 consensus Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
Probab=96.56 E-value=0.006 Score=36.53 Aligned_cols=36 Identities=22% Similarity=0.028 Sum_probs=28.4
Q ss_pred ceehHHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHH
Q 036651 2 LYLILFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLV 40 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~ 40 (60)
.|.+|.||++||+.+|+ +.+.++.+++.+....+-+
T Consensus 61 ~i~QS~AI~RyLArk~g---l~Gkt~~E~a~vD~i~d~~ 96 (206)
T KOG1695|consen 61 KLVQSRAILRYLARKFG---LAGKTEEEEAWVDMIVDQF 96 (206)
T ss_pred eeccHHHHHHHHHHHhC---cCCCCHHHHHHHHHHHHhh
Confidence 47899999999999874 6677888888777665433
No 25
>cd03189 GST_C_GTT1_like GST_C family, Saccharomyces cerevisiae GTT1-like subfamily; composed of predominantly uncharacterized proteins with similarity to the S. cerevisiae GST protein, GTT1, and the Schizosaccharomyces pombe GST-III. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. GTT1, a homodimer, exhibits GST activity with standard substrates and associates with the endopl
Probab=92.83 E-value=0.11 Score=27.33 Aligned_cols=25 Identities=20% Similarity=0.111 Sum_probs=22.0
Q ss_pred CcCHHHHHHHHHHHHHHhccccccc
Q 036651 24 PSDLKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 24 p~~~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
|.++.+|+++++|+.|.++++++..
T Consensus 2 ~~~~~~ra~~~~wl~~~~~~~~~~~ 26 (119)
T cd03189 2 PPDTAEYADYLYWLHFAEGSLMPPL 26 (119)
T ss_pred CCCHHHHHHHHHHHHHHhHhhhHHH
Confidence 5688999999999999999988764
No 26
>cd03191 GST_C_Zeta GST_C family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates, but display modest GSH peroxidase activity. They are also implicated in the detoxification of th
Probab=89.77 E-value=0.53 Score=24.85 Aligned_cols=21 Identities=29% Similarity=0.488 Sum_probs=18.1
Q ss_pred HHHHHHHHHHHHHhccccccc
Q 036651 28 KRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 28 ~~ra~~~~~~~~~~~~~~~~~ 48 (60)
.+|+++++|+.|++++++|..
T Consensus 2 ~~ra~~~~w~~~~~~~~~~~~ 22 (121)
T cd03191 2 KKRARVRALALIIACDIHPLN 22 (121)
T ss_pred hhHHHHHHHHHHHHccCCccc
Confidence 578999999999999998764
No 27
>cd03200 GST_C_JTV1 GST_C family, JTV-1 subfamily; composed of uncharacterized proteins with similarity to the translation product of the human JTV-1 gene. Human JTV-1, a gene of unknown function, initiates within the human PMS2 gene promoter, but is transcribed from the opposite strand. PMS2 encodes a protein involved in DNA mismatch repair and is mutated in a subset of patients with hereditary nonpolyposis colon cancer. It is unknown whether the expression of JTV-1 affects that of PMS2, or vice versa, as a result of their juxtaposition. JTV-1 is up-regulated while PMS2 is down-regulated in tumor cell spheroids that show increased resistance to anticancer cytotoxic drugs compared with tumor cell monolayers indicating that suppressed DNA mismatch repair may be a mechanism for multicellular resistance to alkylating agents.
Probab=86.91 E-value=0.73 Score=23.96 Aligned_cols=32 Identities=6% Similarity=-0.166 Sum_probs=25.4
Q ss_pred HHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhccc
Q 036651 10 LMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSSI 44 (60)
Q Consensus 10 ~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~~ 44 (60)
++||++. .++.|.++.+++++..|+.+....+
T Consensus 1 ~r~~~~~---~~~~~~~~~~~~~vd~~~d~~~~~l 32 (96)
T cd03200 1 ARFLYRL---LGPAPNAPNAATNIDSWVDTAIFQL 32 (96)
T ss_pred CchHHHH---hcccCCCchHHHHHHHHHHHHHHHH
Confidence 3688876 3688999999999999998776444
No 28
>cd03182 GST_C_GTT2_like GST_C family, Saccharomyces cerevisiae GTT2-like subfamily; composed of predominantly uncharacterized proteins with similarity to the S. cerevisiae GST protein, GTT2. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. GTT2, a homodimer, exhibits GST activity with standard substrates. Strains with deleted GTT2 genes are viable but exhibit increased sensiti
Probab=86.50 E-value=0.6 Score=24.36 Aligned_cols=23 Identities=4% Similarity=-0.432 Sum_probs=19.0
Q ss_pred CHHHHHHHHHHHHHHhccccccc
Q 036651 26 DLKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 26 ~~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
|+.+|+++++|+.|+++++++..
T Consensus 1 d~~~ra~~~~w~~~~~~~~~~~~ 23 (117)
T cd03182 1 TPLERAQIEMWQRRAELQGLYPI 23 (117)
T ss_pred CHHHHHHHHHHHHHHHHHHHHHH
Confidence 57789999999999988876544
No 29
>cd03195 GST_C_4 GST_C family, unknown subfamily 4; composed of uncharacterized proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain.
Probab=85.94 E-value=0.38 Score=25.57 Aligned_cols=21 Identities=19% Similarity=0.289 Sum_probs=18.3
Q ss_pred HHHHHHHHHHHHHHhcccccc
Q 036651 27 LKRKAINYQAAYLVSSSIQPL 47 (60)
Q Consensus 27 ~~~ra~~~~~~~~~~~~~~~~ 47 (60)
|.+|+++++|+.|++++++++
T Consensus 1 ~~~ra~~~~~~~~~~~~~~~~ 21 (114)
T cd03195 1 PRQRARARQVQAWLRSDLLPI 21 (114)
T ss_pred CHhhHHHHHHHHHHHhhHHHH
Confidence 357899999999999999874
No 30
>PF11287 DUF3088: Protein of unknown function (DUF3088); InterPro: IPR021439 This family of proteins with unknown function appears to be restricted to Proteobacteria.
Probab=83.03 E-value=1 Score=24.75 Aligned_cols=18 Identities=28% Similarity=0.165 Sum_probs=15.8
Q ss_pred ceehHHHHHHHHHhhCCC
Q 036651 2 LYLILFAILMYLEEKNPQ 19 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~ 19 (60)
.|+|...|+.||.++|+-
T Consensus 92 fi~d~~~I~~~La~r~g~ 109 (112)
T PF11287_consen 92 FIDDPRRILRYLAERHGF 109 (112)
T ss_pred EeCCHHHHHHHHHHHcCC
Confidence 578899999999999964
No 31
>cd03186 GST_C_SspA GST_N family, Stringent starvation protein A (SspA) subfamily; SspA is a RNA polymerase (RNAP)-associated protein required for the lytic development of phage P1 and for stationary phase-induced acid tolerance of E. coli. It is implicated in survival during nutrient starvation. SspA adopts the GST fold with an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, but it does not bind glutathione (GSH) and lacks GST activity. SspA is highly conserved among gram-negative bacteria. Related proteins found in Neisseria (called RegF), Francisella and Vibrio regulate the expression of virulence factors necessary for pathogenesis.
Probab=79.86 E-value=1 Score=23.20 Aligned_cols=22 Identities=14% Similarity=0.001 Sum_probs=18.2
Q ss_pred HHHHHHHHHHHHHHhccccccc
Q 036651 27 LKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 27 ~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
+.+|++++.|+.|+++++++..
T Consensus 1 p~~ra~~r~w~~~~~~~~~~~~ 22 (107)
T cd03186 1 PVARARSRLLMHRIEQDWYPLV 22 (107)
T ss_pred ChHHHHHHHHHHHHHHHHHHHH
Confidence 3578999999999998887654
No 32
>COG2999 GrxB Glutaredoxin 2 [Posttranslational modification, protein turnover, chaperones]
Probab=78.55 E-value=4.2 Score=24.50 Aligned_cols=21 Identities=19% Similarity=0.227 Sum_probs=17.1
Q ss_pred ceehHHHHHHHHHhhCCCCCC
Q 036651 2 LYLILFAILMYLEEKNPQHPL 22 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l 22 (60)
.+.||..|..|+++.++++-+
T Consensus 58 ~m~ESlDIV~y~d~~~~~~~l 78 (215)
T COG2999 58 AMPESLDIVHYVDELDGKPLL 78 (215)
T ss_pred cchhhhHHHHHHHHhcCchhh
Confidence 478999999999998876533
No 33
>PF14834 GST_C_4: Glutathione S-transferase, C-terminal domain; PDB: 3BBY_A.
Probab=76.44 E-value=2.4 Score=23.48 Aligned_cols=23 Identities=22% Similarity=0.216 Sum_probs=19.3
Q ss_pred CHHHHHHHHHHHHHHhccccccc
Q 036651 26 DLKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 26 ~~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
|..+|++.++...|+.|++.++.
T Consensus 1 D~~~RArAR~vqAwlrSdf~~lR 23 (117)
T PF14834_consen 1 DRQERARARQVQAWLRSDFMALR 23 (117)
T ss_dssp SHHHHHHHHHHHHHHHHS-HHHH
T ss_pred CHHHHHHHHHHHHHHHcccHHHH
Confidence 56899999999999999988765
No 34
>KOG4420 consensus Uncharacterized conserved protein (Ganglioside-induced differentiation associated protein 1, GDAP1) [Function unknown]
Probab=75.83 E-value=5.1 Score=25.51 Aligned_cols=24 Identities=21% Similarity=0.289 Sum_probs=19.8
Q ss_pred ceehHHHHHHHHHhhCCC-CCCCCc
Q 036651 2 LYLILFAILMYLEEKNPQ-HPLLPS 25 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~-~~l~p~ 25 (60)
+|.|+.-|+.|++..|-+ ..+.|.
T Consensus 87 II~d~tqIIdYvErtf~ger~l~pe 111 (325)
T KOG4420|consen 87 IISDYTQIIDYVERTFTGERVLMPE 111 (325)
T ss_pred ecccHHHHHHHHHHhhccccccccc
Confidence 588999999999998855 467784
No 35
>cd03188 GST_C_Beta GST_C family, Class Beta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Unlike mammalian GSTs which detoxify a broad range of compounds, the bacterial class Beta GSTs exhibit limited GSH conjugating activity with a narrow range of substrates. In addition to GSH conjugation, they also bind antibiotics and reduce the antimicrobial activity of beta-lactam drugs. The structure of the Proteus mirabilis enzyme reveals that the cysteine in the active site for
Probab=75.27 E-value=2.5 Score=21.68 Aligned_cols=20 Identities=15% Similarity=0.062 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+++++|+.|.++++.|..
T Consensus 2 ~ra~~~~w~~~~~~~~~~~~ 21 (114)
T cd03188 2 ERARLLEWLNFLSSELHKAF 21 (114)
T ss_pred cHHHHHHHHHHHhhhhchhh
Confidence 57899999999998887754
No 36
>cd03178 GST_C_Ure2p_like GST_C family, Ure2p-like subfamily; composed of the Saccharomyces cerevisiae Ure2p and related GSTs. Ure2p is a regulator for nitrogen catabolism in yeast. It represses the expression of several gene products involved in the use of poor nitrogen sources when rich sources are available. A transmissible conformational change of Ure2p results in a prion called [Ure3], an inactive, self-propagating and infectious amyloid. Ure2p displays a GST fold containing an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The N-terminal thioredoxin-fold domain is sufficient to induce the [Ure3] phenotype and is also called the prion domain of Ure2p. In addition to its role in nitrogen regulation, Ure2p confers protection to cells against heavy metal ion and oxidant toxicity, and shows glutathione (GSH) peroxidase activity. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of GSH with a wide range of en
Probab=74.69 E-value=2.9 Score=21.56 Aligned_cols=20 Identities=10% Similarity=0.144 Sum_probs=17.2
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+++++|+.|.++.+++..
T Consensus 1 ~ra~~~~wl~~~~~~~~~~~ 20 (113)
T cd03178 1 ERYEVLQWLFFQMGGLGPMF 20 (113)
T ss_pred ChHHHHHHHHHHHccCCCcc
Confidence 47899999999999998765
No 37
>cd03196 GST_C_5 GST_C family, unknown subfamily 5; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain.
Probab=73.39 E-value=2 Score=22.79 Aligned_cols=24 Identities=4% Similarity=-0.215 Sum_probs=18.8
Q ss_pred cCHHHHHHHHHHHHHHhccccccc
Q 036651 25 SDLKRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 25 ~~~~~ra~~~~~~~~~~~~~~~~~ 48 (60)
.|+.+++.+++|.+|...++.+..
T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~ 25 (115)
T cd03196 2 QDPAALKEMLALIAENDNEFKHHL 25 (115)
T ss_pred CchHHHHHHHHHHHHcchhhHHHH
Confidence 467888999999988888876544
No 38
>cd03179 GST_C_1 GST_C family, unknown subfamily 1; composed of uncharacterized bacterial proteins, with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain.
Probab=66.17 E-value=6 Score=19.91 Aligned_cols=20 Identities=15% Similarity=0.152 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+++++|+.|.++.+++..
T Consensus 2 ~ra~~~~wl~~~~~~~~~~~ 21 (105)
T cd03179 2 ERAQVLRWLFFEQYSHEPYI 21 (105)
T ss_pred cHHHHHHHHHHhhcccCccc
Confidence 57899999999988887654
No 39
>cd03185 GST_C_Tau GST_C family, Class Tau subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The plant-specific class Tau GST subfamily has undergone extensive gene duplication. The Arabidopsis and Oryza genomes contain 28 and 40 Tau GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Phi GSTs, showing class specificity in substrate preference. Tau enzymes are highly efficient in detoxifying diphenylether and aryloxyphenoxypropi
Probab=65.72 E-value=3.1 Score=21.86 Aligned_cols=21 Identities=14% Similarity=0.164 Sum_probs=17.0
Q ss_pred HHHHHHHHHHHHHhccccccc
Q 036651 28 KRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 28 ~~ra~~~~~~~~~~~~~~~~~ 48 (60)
.+|+++++|+.|+++.+++..
T Consensus 2 ~~ra~~~~w~~~~~~~~~~~~ 22 (126)
T cd03185 2 YERAVARFWAAFIDDKLFPAG 22 (126)
T ss_pred hhHHHHHHHHHHHHHHHHHHH
Confidence 578999999999988876544
No 40
>KOG1422 consensus Intracellular Cl- channel CLIC, contains GST domain [Inorganic ion transport and metabolism]
Probab=65.37 E-value=12 Score=22.87 Aligned_cols=21 Identities=33% Similarity=0.184 Sum_probs=17.9
Q ss_pred ceehHHHHHHHHHhhCCCCCC
Q 036651 2 LYLILFAILMYLEEKNPQHPL 22 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p~~~l 22 (60)
.++||..|-++|++.+|.+.+
T Consensus 70 ~~tDs~~Ie~~Lee~l~~p~~ 90 (221)
T KOG1422|consen 70 WVTDSDKIEEFLEEKLPPPKL 90 (221)
T ss_pred eeccHHHHHHHHHHhcCCCCC
Confidence 368999999999999988653
No 41
>cd03180 GST_C_2 GST_C family, unknown subfamily 2; composed of uncharacterized bacterial proteins, with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain.
Probab=65.33 E-value=7.1 Score=19.78 Aligned_cols=20 Identities=15% Similarity=0.064 Sum_probs=16.0
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+++++|+.|..+.+++..
T Consensus 2 ~ra~~~~~l~~~~~~~~~~~ 21 (110)
T cd03180 2 ARARADRWMDWQTSTLNPAF 21 (110)
T ss_pred chhHHHHHHHHHHhhcChHH
Confidence 47788899988888887765
No 42
>cd03181 GST_C_EFB1gamma GST_C family, Gamma subunit of Elongation Factor 1B (EFB1gamma) subfamily; EF1Bgamma is part of the eukaryotic translation elongation factor-1 (EF1) complex which plays a central role in the elongation cycle during protein biosynthesis. EF1 consists of two functionally distinct units, EF1A and EF1B. EF1A catalyzes the GTP-dependent binding of aminoacyl-tRNA to the ribosomal A site concomitant with the hydrolysis of GTP. The resulting inactive EF1A:GDP complex is recycled to the active GTP form by the guanine-nucleotide exchange factor EF1B, a complex composed of at least two subunits, alpha and gamma. Metazoan EFB1 contain a third subunit, beta. The EF1B gamma subunit contains a GST fold consisting of an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST-like domain of EF1Bgamma is believed to mediate the dimerization of the EF1 complex, which in yeast is a dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In addition to its role
Probab=63.93 E-value=4.8 Score=21.00 Aligned_cols=20 Identities=15% Similarity=0.135 Sum_probs=15.7
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+++++|+.|.++.+++..
T Consensus 1 ~ra~~~~wl~~~~~~~~~~~ 20 (123)
T cd03181 1 EEAQVLQWVSFANTELLPAV 20 (123)
T ss_pred ChHHHHHHHHHHHhhhHHHH
Confidence 36788889999888887754
No 43
>PF09635 MetRS-N: MetRS-N binding domain; InterPro: IPR018285 This entry represents the N-terminal domain of methionyl-tRNA synthetase (MetRS). This N-terminal appended domain mediates non-catalytic complex formation through its interaction with a domain in the tRNA aminoacylation cofactor Arc1p. The interacting domains of MetRS, GluRS (glutamyl-tRNA synthetase) and Arc1p form a ternary complex resembling a classical GST homo-dimer []. Domain-swapping between symmetrically related MetRS-N and Arc1p-N domains generates a 2:2 tetramer held together by van der Waals forces. This domain is necessary for formation of the aminoacyl-tRNA synthetase complex necessary for tRNA nuclear export and shuttling as part of the translational apparatus. ; PDB: 2HSN_A.
Probab=63.80 E-value=5.5 Score=22.24 Aligned_cols=17 Identities=24% Similarity=0.143 Sum_probs=9.9
Q ss_pred ceehHHHHHHHHHhhCC
Q 036651 2 LYLILFAILMYLEEKNP 18 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p 18 (60)
.|+|..||++|+..-|-
T Consensus 47 ~L~e~NAIvrYl~nDF~ 63 (122)
T PF09635_consen 47 ELFEPNAIVRYLANDFE 63 (122)
T ss_dssp ----HHHHHHHHTT--T
T ss_pred EEecccHHHHHHHhhcC
Confidence 47899999999987663
No 44
>PF09314 DUF1972: Domain of unknown function (DUF1972); InterPro: IPR015393 This domain is functionally uncharacterised and found in bacterial glycosyltransferases and rhamnosyltransferases.
Probab=62.01 E-value=6.1 Score=23.36 Aligned_cols=17 Identities=24% Similarity=0.157 Sum_probs=16.0
Q ss_pred ceehHHHHHHHHHhhCC
Q 036651 2 LYLILFAILMYLEEKNP 18 (60)
Q Consensus 2 ~l~ES~aI~~yL~~~~p 18 (60)
+|.||..|-+|+.++|+
T Consensus 157 lIaDs~~I~~y~~~~y~ 173 (185)
T PF09314_consen 157 LIADSKGIQDYIKERYG 173 (185)
T ss_pred EEEcCHHHHHHHHHHcC
Confidence 68999999999999998
No 45
>cd08531 SAM_PNT-ERG_FLI-1 Sterile alpha motif (SAM)/Pointed domain of ERG (Ets related gene) and FLI-1 (Friend leukemia integration 1) transcription factors. SAM Pointed domain of ERG/FLI-1 subfamily of ETS transcriptional regulators is a putative protein-protein interaction domain. The ERG and FLI regulators are involved in endothelial cell differentiation, bone morphogenesis and neural crest development. They are proto-oncogenes implicated in cancer development such as myeloid leukemia, Ewing's sarcoma and erythroleukemia. Members of this subfamily are potential targets for cancer therapy.
Probab=60.29 E-value=14 Score=18.79 Aligned_cols=21 Identities=14% Similarity=-0.018 Sum_probs=15.8
Q ss_pred CcCHHH--HHHHHHHHHHHhccc
Q 036651 24 PSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 24 p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
|.||.. +.++.+|+.|+..++
T Consensus 2 P~DP~~Wt~~~V~~WL~Wa~~ef 24 (75)
T cd08531 2 PADPTLWTREHVRQWLEWAVKEY 24 (75)
T ss_pred CCChhhcCHHHHHHHHHHHHHHc
Confidence 555544 689999999998764
No 46
>cd08542 SAM_PNT-ETS-1 Sterile alpha motif (SAM)/Pointed domain of ETS-1. SAM Pointed domain of ETS-1 subfamily of ETS transcriptional activators is a protein-protein interaction domain. The ETS-1 activator is regulated by phosphorylation. It contains a docking site for the ERK2 MAP (Mitogen Activated Protein) kinase, while the ERK2 phosphorylation site is located in the N-terminal disordered region upstream of the SAM Pointed domain. Mutations of the kinase docking site residues inhibit phosphorylation. ETS-1 activators play role in a number of different physiological processes, and they are expressed during embryonic development, including blood vessel formation, hematopoietic, lymphoid, neuronal and osteogenic differentiation. The Ets-1 gene is a proto-oncogene involved in progression of different tumors (including breast cancer, meningioma, and prostate cancer). Members of this subfamily are potential molecular targets for selective cancer therapy.
Probab=57.16 E-value=17 Score=19.09 Aligned_cols=23 Identities=9% Similarity=0.089 Sum_probs=18.6
Q ss_pred CCCcCHHH--HHHHHHHHHHHhccc
Q 036651 22 LLPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 22 l~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
-.|.||.. +.++..|+.|+..++
T Consensus 14 ~Ip~DP~~Wt~~~V~~WL~Wa~~ef 38 (88)
T cd08542 14 GIPKDPRQWTETHVRDWVMWAVNEF 38 (88)
T ss_pred CCCCChhhCCHHHHHHHHHHHHHHc
Confidence 35778766 799999999998764
No 47
>cd08540 SAM_PNT-ERG Sterile alpha motif (SAM)/Pointed domain of ERG transcription factor. SAM Pointed domain of ERG subfamily of ETS transcriptional regulators is a putative protein-protein interaction domain. It may participate in formation of homodimers or heterodimers with ETS-2, Fli-1, ER81, and Pu-1. However, dimeric forms are inactive and SAM Pointed domain is not essential for dimerization, since ER81 and Pu-1 do not have it. In mouse, a regulator of this type binds the ESET histone H3-specific methyltransferase (human homolog is SETDB1), followed by modification of local chromatin structure through histone methylation. ERG regulators are involved in endothelial cell differentiation, bone morphogenesis and neural crest development. The Erg gene is a proto-oncogene. It is a target of chromosomal translocations resulting in fusions with new neighboring genes. Chimeric proteins were found in solid tumors such as myeloid leukemia or Ewing's sarcoma. Members of this subfamily are po
Probab=56.66 E-value=17 Score=18.51 Aligned_cols=21 Identities=14% Similarity=-0.064 Sum_probs=15.8
Q ss_pred CcCHHH--HHHHHHHHHHHhccc
Q 036651 24 PSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 24 p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
|.||.. +.++.+|+.|+..++
T Consensus 2 P~DP~~Wt~~~V~~WL~Wa~~ef 24 (75)
T cd08540 2 PADPTLWSTDHVRQWLEWAVKEY 24 (75)
T ss_pred CCChhhcCHHHHHHHHHHHHHHh
Confidence 455544 689999999998764
No 48
>cd08533 SAM_PNT-ETS-1,2 Sterile alpha motif (SAM)/Pointed domain of ETS-1,2 family. SAM Pointed domain of ETS-1,2 family of transcriptional activators is a protein-protein interaction domain. It carries a kinase docking site and mediates interaction between ETS transcriptional activators and protein kinases. This group of transcriptional factors is involved in the Ras/MAP kinase signaling pathway. MAP kinases phosphorylate the transcription factors. Phosphorylated factors then recruit coactivators and enhance transactivation. Members of this group play a role in regulation of different embryonic developmental processes. ETS-1,2 transcriptional activators are proto-oncogenes involved in malignant transformation and tumor progression. They are potential molecular targets for selective cancer therapy.
Probab=56.47 E-value=17 Score=18.26 Aligned_cols=15 Identities=7% Similarity=-0.130 Sum_probs=12.5
Q ss_pred HHHHHHHHHHHhccc
Q 036651 30 KAINYQAAYLVSSSI 44 (60)
Q Consensus 30 ra~~~~~~~~~~~~~ 44 (60)
+.++.+|+.|+..++
T Consensus 9 ~~~V~~WL~Wa~~ef 23 (71)
T cd08533 9 ETHVRQWLLWAVNEF 23 (71)
T ss_pred HHHHHHHHHHHHHHc
Confidence 578999999998764
No 49
>TIGR03412 iscX_yfhJ FeS assembly protein IscX. Members of this protein family are YfhJ, a protein of the ISC system for iron-sulfur cluster assembly. Other genes in the system include iscSUA, hscBA, and fdx.
Probab=55.07 E-value=15 Score=18.24 Aligned_cols=17 Identities=24% Similarity=0.163 Sum_probs=14.6
Q ss_pred ehHHHHHHHHHhhCCCC
Q 036651 4 LILFAILMYLEEKNPQH 20 (60)
Q Consensus 4 ~ES~aI~~yL~~~~p~~ 20 (60)
++|..|+.-|.++||+.
T Consensus 3 ~D~~eIA~~L~e~~pd~ 19 (63)
T TIGR03412 3 TDSQEIAIALAEAHPDV 19 (63)
T ss_pred cCHHHHHHHHHHHCCCC
Confidence 57888999999999874
No 50
>cd03187 GST_C_Phi GST_C family, Class Phi subfamily; composed of plant-specific class Phi GSTs and related fungal and bacterial proteins. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Phi GST subfamily has experience extensive gene duplication. The Arabidopsis and Oryza genomes contain 13 and 16 Tau GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Tau GSTs, showing class specificity in substrate preference. Phi enzymes a
Probab=54.78 E-value=16 Score=18.72 Aligned_cols=20 Identities=25% Similarity=0.150 Sum_probs=15.1
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|+++.+|+.|..+.+++..
T Consensus 2 ~ra~~~~~l~~~~~~~~~~~ 21 (118)
T cd03187 2 ERAIVEQWLEVESHQFDPPA 21 (118)
T ss_pred chHHHHHHHHHHHhhcchhH
Confidence 46788888888887777654
No 51
>PRK10721 hypothetical protein; Provisional
Probab=50.86 E-value=19 Score=18.02 Aligned_cols=17 Identities=18% Similarity=-0.119 Sum_probs=14.7
Q ss_pred ehHHHHHHHHHhhCCCC
Q 036651 4 LILFAILMYLEEKNPQH 20 (60)
Q Consensus 4 ~ES~aI~~yL~~~~p~~ 20 (60)
++|..|+.-|.++||+.
T Consensus 6 ~D~~dIA~~L~e~~Pd~ 22 (66)
T PRK10721 6 TDSREIGEALYDAYPDL 22 (66)
T ss_pred cCHHHHHHHHHHHCCCC
Confidence 57888999999999974
No 52
>cd08534 SAM_PNT-GABP-alpha Sterile alpha motif (SAM)/Pointed domain of GA-binding protein alpha chain. SAM Pointed domain of GABP-alpha subfamily of ETS transcriptional regulators is a putative protein-protein interaction domain. This type of transcriptional regulators forms heterotetramers containing two alpha and two beta subunits. It interacts with GA repeats (purine rich repeats). GABP transcriptional factors control gene expression in cell cycle control, apoptosis, and cellular respiration. GABP participates in regulation of transmembrane receptors and key hormones especially in myeloid cells and at the neuromuscular junction.
Probab=49.54 E-value=28 Score=18.27 Aligned_cols=23 Identities=9% Similarity=0.016 Sum_probs=18.6
Q ss_pred CCCcCHHH--HHHHHHHHHHHhccc
Q 036651 22 LLPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 22 l~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
-.|.||.. +.++..|+.|+..++
T Consensus 14 ~IP~DP~~Wt~~~V~~WL~Wa~~ef 38 (89)
T cd08534 14 KIPYDPMEWTEDQVLHWVVWAVKEF 38 (89)
T ss_pred CCCCChHHcCHHHHHHHHHHHHHHc
Confidence 36778766 789999999998764
No 53
>cd08541 SAM_PNT-FLI-1 Sterile alpha motif (SAM)/Pointed domain of friend leukemia integration 1 transcription activator. SAM Pointed domain of FLI-1 (Friend Leukemia Integration) subfamily of ETS transcriptional regulators is a putative protein-protein interaction domain. The FLI-1 protein participates in regulation of cellular differentiation, proliferation, and survival. The Fli-1 gene was initially described in Friend virus-induced erythroleukemias as a site for virus integration. It is highly expressed in hematopoietic tissues and at lower level in lungs, heart, and ovaries. Fli-1 is a proto-oncogene implicated in Ewing's sarcoma and erythroleukemia. Members of this subfamily are potential targets for cancer therapy.
Probab=49.48 E-value=27 Score=18.50 Aligned_cols=23 Identities=13% Similarity=0.021 Sum_probs=18.5
Q ss_pred CCCcCHHH--HHHHHHHHHHHhccc
Q 036651 22 LLPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 22 l~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
..|.||.. +.++.+|+.|+..++
T Consensus 12 ~IP~DP~~Wt~~hV~~WL~Wa~~ef 36 (91)
T cd08541 12 IVPADPTLWTQEHVRQWLEWAIKEY 36 (91)
T ss_pred eCCCChhhcCHHHHHHHHHHHHHHc
Confidence 35778765 789999999999774
No 54
>cd08532 SAM_PNT-PDEF-like Sterile alpha motif (SAM)/Pointed domain of prostate-derived ETS factor. SAM Pointed domain of PDEF-like (Prostate-Derived ETS Factor) subfamily of ETS transcriptional regulators is a putative protein-protein interaction domain. In human males this activator is highly expressed in the prostate gland and enhances androgen-mediated activation of the PSA promoter though interaction with the DNA binding domain of androgen receptor. PDEF may play a role in prostate cancer development as well as in goblet cell formation and mucus production in the epithelial lining of respiratory and intestinal tracts.
Probab=48.05 E-value=27 Score=17.74 Aligned_cols=22 Identities=9% Similarity=-0.058 Sum_probs=16.9
Q ss_pred CCcCHHH--HHHHHHHHHHHhccc
Q 036651 23 LPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 23 ~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
.|.||.. +.++..|+.|+..++
T Consensus 6 ip~DP~~Ws~~~V~~WL~w~~~ef 29 (76)
T cd08532 6 ISPDPYQWSPANVQKWLLWTEHQY 29 (76)
T ss_pred CCCChhhcCHHHHHHHHHHHHHHh
Confidence 4666655 789999999998764
No 55
>cd03177 GST_C_Delta_Epsilon GST_C family, Class Delta and Epsilon subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Delta and Epsilon subfamily is made up primarily of insect GSTs, which play major roles in insecticide resistance by facilitating reductive dehydrochlorination of insecticides or conjugating them with GSH to produce water-soluble metabolites th
Probab=46.73 E-value=18 Score=18.82 Aligned_cols=20 Identities=20% Similarity=0.122 Sum_probs=14.4
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+++++.+|+.|..+.+++..
T Consensus 2 ~~a~~~~wl~~~~~~~~~~~ 21 (118)
T cd03177 2 KRAIVNQRLHFDSGTLYQRL 21 (118)
T ss_pred hHHHHHHHHHhhhchHHHHH
Confidence 46788888888877776543
No 56
>cd08543 SAM_PNT-ETS-2 Sterile alpha motif (SAM)/Pointed domain of ETS-2. SAM Pointed domain of ETS-2 subfamily of ETS transcriptional regulators is a protein-protein interaction domain. It contains a docking site for Cdk10 (cyclin-dependent kinase 10), a member of the Cdc2 kinase family. The interaction between ETS-2 and Cdk10 kinase inhibits ETS-2 transactivation activity in mammals. ETS-2 is also regulated by ERK2 MAP kinase. ETS-2, which is phosphorylated by ERK2, can interact with coactivators and enhance transactivation. ETS-2 transcriptional activators are involved in embryonic development and cell cycle control. The Ets-2 gene is a proto-oncogene. It is overexpressed in breast and prostate cancer cells and its overexpression is necessary for transformation of such cells. Members of ETS-2 subfamily are potential molecular targets for selective cancer therapy.
Probab=45.74 E-value=33 Score=18.08 Aligned_cols=23 Identities=9% Similarity=-0.032 Sum_probs=18.1
Q ss_pred CCCcCHHH--HHHHHHHHHHHhccc
Q 036651 22 LLPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 22 l~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
-.|.||.. +.++..|+.|+..++
T Consensus 14 ~Ip~DP~~Wt~~~V~~WL~Wa~~ef 38 (89)
T cd08543 14 GIPKNPWLWTEQQVCQWLLWATNEF 38 (89)
T ss_pred CCCCChhhCCHHHHHHHHHHHHHHc
Confidence 35777765 789999999998764
No 57
>COG2975 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=45.24 E-value=26 Score=17.27 Aligned_cols=18 Identities=17% Similarity=0.032 Sum_probs=15.0
Q ss_pred eehHHHHHHHHHhhCCCC
Q 036651 3 YLILFAILMYLEEKNPQH 20 (60)
Q Consensus 3 l~ES~aI~~yL~~~~p~~ 20 (60)
-+++.-|++-|.++||+-
T Consensus 3 WtD~~~Iae~Lyd~~pdv 20 (64)
T COG2975 3 WTDSQEIAEALYDAYPDV 20 (64)
T ss_pred cchHHHHHHHHHhcCCCC
Confidence 468899999999999763
No 58
>KOG0183 consensus 20S proteasome, regulatory subunit alpha type PSMA7/PRE6 [Posttranslational modification, protein turnover, chaperones]
Probab=43.35 E-value=76 Score=19.73 Aligned_cols=38 Identities=13% Similarity=0.050 Sum_probs=28.6
Q ss_pred HHHHHHHHHhhCCCCCCCCcCHHHHHHHHHHHHHHhcc
Q 036651 6 LFAILMYLEEKNPQHPLLPSDLKRKAINYQAAYLVSSS 43 (60)
Q Consensus 6 S~aI~~yL~~~~p~~~l~p~~~~~ra~~~~~~~~~~~~ 43 (60)
|..+.+||+..|.+......+..-+.-++.+++...+.
T Consensus 166 sk~VrEflEK~y~e~~~~~~~~~ikL~ir~LleVvqs~ 203 (249)
T KOG0183|consen 166 SKTVREFLEKNYKEEAIATEGETIKLAIRALLEVVQSG 203 (249)
T ss_pred cHHHHHHHHHhcccccccccccHHHHHHHHHHHHhhcC
Confidence 57889999999988755455556667788888877754
No 59
>KOG4244 consensus Failed axon connections (fax) protein/glutathione S-transferase-like protein [Signal transduction mechanisms]
Probab=40.89 E-value=91 Score=19.92 Aligned_cols=19 Identities=11% Similarity=-0.317 Sum_probs=16.0
Q ss_pred eehHHHHHHHHHhhCCCCC
Q 036651 3 YLILFAILMYLEEKNPQHP 21 (60)
Q Consensus 3 l~ES~aI~~yL~~~~p~~~ 21 (60)
+.||.-|...|.+++.-+.
T Consensus 104 iaDS~~I~~~L~~hf~~~~ 122 (281)
T KOG4244|consen 104 IADSDLIEDRLRKHFKIPD 122 (281)
T ss_pred ccccHHHHHHHHHHcCCCC
Confidence 6789999999999886554
No 60
>PF09098 Dehyd-heme_bind: Quinohemoprotein amine dehydrogenase A, alpha subunit, haem binding; InterPro: IPR015182 Quinohemoprotein amine dehydrogenases (QHNDH) 1.4.99 from EC) are enzymes produced in the periplasmic space of certain Gram-negative bacteria, such as Paracoccus denitrificans and Pseudomonas putida, in response to primary amines, including n-butylamine and benzylamine. QHNDH catalyses the oxidative deamination of a wide range of aliphatic and aromatic amines through formation of a Schiff-base intermediate involving one of the quinone O atoms []. Catalysis requires the presence of a novel redox cofactor, cysteine tryptophylquinone (CTQ). CTQ is derived from the post-translational modification of specific residues, which involves the oxidation of the indole ring of a tryptophan residue to form tryptophylquinone, followed by covalent cross-linking with a cysteine residue []. There is one CTQ per subunit in QHNDH. In addition to CTQ, two haem c cofactors are present in QHNDH that mediate the transfer of the substrate-derived electrons from CTQ to an external electron acceptor, cytochrome c-550 [, ]. QHNDH is a heterotrimer of alpha, beta and gamma subunits. The alpha and beta subunits contain signal peptides necessary for the translocation of QHNDH to the periplasm. The alpha subunit is composed of four domains - domain 1 forming a dihaem cytochrome, and domains 2-4 forming antiparallel beta-barrel structures; the beta subunit is a 7-bladed beta-propeller that provides part of the active site; and the small, catalytic gamma subunit contains the novel cross-linked CTQ cofactor, in addition to additional thioester cross-links between Cys and Asp/Glu residues that encage CTQ. The gamma subunit assumes a globular secondary structure with two short alpha-helices having many turns and bends []. This entry represents the dihaem cytochrome c domain of the QHNDH alpha subunit. The domain contain two cysteine residues that are involved in thioether linkages to haem []. ; PDB: 1PBY_A 1JJU_A 1JMZ_A 1JMX_A.
Probab=39.45 E-value=29 Score=20.47 Aligned_cols=14 Identities=21% Similarity=0.275 Sum_probs=11.4
Q ss_pred hHHHHHHHHHhhCC
Q 036651 5 ILFAILMYLEEKNP 18 (60)
Q Consensus 5 ES~aI~~yL~~~~p 18 (60)
|-.+|+.||.+.++
T Consensus 55 er~avVkYLAd~~G 68 (167)
T PF09098_consen 55 ERRAVVKYLADTQG 68 (167)
T ss_dssp HHHHHHHHHHHHT-
T ss_pred HHHHHHHHHHHccC
Confidence 45899999999986
No 61
>PF04384 Fe-S_assembly: Iron-sulphur cluster assembly; InterPro: IPR007479 Iron-sulphur (FeS) clusters are important cofactors for numerous proteins involved in electron transfer, in redox and non-redox catalysis, in gene regulation, and as sensors of oxygen and iron. These functions depend on the various FeS cluster prosthetic groups, the most common being [2Fe-2S] and [4Fe-4S] []. FeS cluster assembly is a complex process involving the mobilisation of Fe and S atoms from storage sources, their assembly into [Fe-S] form, their transport to specific cellular locations, and their transfer to recipient apoproteins. So far, three FeS assembly machineries have been identified, which are capable of synthesising all types of [Fe-S] clusters: ISC (iron-sulphur cluster), SUF (sulphur assimilation), and NIF (nitrogen fixation) systems. The ISC system is conserved in eubacteria and eukaryotes (mitochondria), and has broad specificity, targeting general FeS proteins [, ]. It is encoded by the isc operon (iscRSUA-hscBA-fdx-iscX). IscS is a cysteine desulphurase, which obtains S from cysteine (converting it to alanine) and serves as a S donor for FeS cluster assembly. IscU and IscA act as scaffolds to accept S and Fe atoms, assembling clusters and transfering them to recipient apoproteins. HscA is a molecular chaperone and HscB is a co-chaperone. Fdx is a [2Fe-2S]-type ferredoxin. IscR is a transcription factor that regulates expression of the isc operon. IscX (also known as YfhJ) appears to interact with IscS and may function as an Fe donor during cluster assembly []. The SUF system is an alternative pathway to the ISC system that operates under iron starvation and oxidative stress. It is found in eubacteria, archaea and eukaryotes (plastids). The SUF system is encoded by the suf operon (sufABCDSE), and the six encoded proteins are arranged into two complexes (SufSE and SufBCD) and one protein (SufA). SufS is a pyridoxal-phosphate (PLP) protein displaying cysteine desulphurase activity. SufE acts as a scaffold protein that accepts S from SufS and donates it to SufA []. SufC is an ATPase with an unorthodox ATP-binding cassette (ABC)-like component. No specific functions have been assigned to SufB and SufD. SufA is homologous to IscA [], acting as a scaffold protein in which Fe and S atoms are assembled into [FeS] cluster forms, which can then easily be transferred to apoproteins targets. In the NIF system, NifS and NifU are required for the formation of metalloclusters of nitrogenase in Azotobacter vinelandii, and other organisms, as well as in the maturation of other FeS proteins. Nitrogenase catalyses the fixation of nitrogen. It contains a complex cluster, the FeMo cofactor, which contains molybdenum, Fe and S. NifS is a cysteine desulphurase. NifU binds one Fe atom at its N-terminal, assembling an FeS cluster that is transferred to nitrogenase apoproteins []. Nif proteins involved in the formation of FeS clusters can also be found in organisms that do not fix nitrogen []. This entry represents IscX proteins (also known as hypothetical protein YfhJ) that are part of the ISC system. IscX is active as a monomer. The structure of YfhJ is an orthogonal alpha-bundle []. YfhJ is a small acidic protein that binds IscS, and contains a modified winged helix motif that is usually found in DNA-binding proteins []. YfhJ/IscX can bind Fe, and may function as an Fe donor in the assembly of FeS clusters ; GO: 0016226 iron-sulfur cluster assembly; PDB: 2BZT_A 1UJ8_A.
Probab=39.05 E-value=36 Score=16.91 Aligned_cols=17 Identities=29% Similarity=0.094 Sum_probs=13.7
Q ss_pred ehHHHHHHHHHhhCCCC
Q 036651 4 LILFAILMYLEEKNPQH 20 (60)
Q Consensus 4 ~ES~aI~~yL~~~~p~~ 20 (60)
+|+.-|+.-|.+.||+-
T Consensus 4 ~D~~eIA~~L~e~~pd~ 20 (64)
T PF04384_consen 4 TDSEEIAIELYEKYPDV 20 (64)
T ss_dssp T-HHHHHHHHHHHSTTS
T ss_pred cCHHHHHHHHHHHCCCC
Confidence 57788999999999973
No 62
>cd03183 GST_C_Theta GST_C family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenatio
Probab=38.60 E-value=11 Score=19.81 Aligned_cols=18 Identities=6% Similarity=-0.156 Sum_probs=14.0
Q ss_pred HHHHHHHHHHHhcccccc
Q 036651 30 KAINYQAAYLVSSSIQPL 47 (60)
Q Consensus 30 ra~~~~~~~~~~~~~~~~ 47 (60)
++++++|+.|..+++++.
T Consensus 2 ra~~~~wl~~~~~~~~~~ 19 (126)
T cd03183 2 RARVDEYLAWQHTNLRLG 19 (126)
T ss_pred cccHHHHHHHHHhhhHhh
Confidence 567888998888887754
No 63
>smart00251 SAM_PNT SAM / Pointed domain. A subfamily of the SAM domain
Probab=36.69 E-value=58 Score=16.63 Aligned_cols=22 Identities=9% Similarity=0.057 Sum_probs=17.0
Q ss_pred CCcCHHH--HHHHHHHHHHHhccc
Q 036651 23 LPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 23 ~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
.|.||.. +.++..|+.|+...+
T Consensus 13 ip~dP~~Wt~~~V~~Wl~w~~~ef 36 (82)
T smart00251 13 IPADPQLWTEDHVLEWLEWAVKEF 36 (82)
T ss_pred CCCChhhCCHHHHHHHHHHHHHhc
Confidence 4666655 689999999998764
No 64
>cd03203 GST_C_Lambda GST_C family, Class Lambda subfamily; composed of plant-specific class Lambda GSTs. GSTs are cytosolic, usually dimeric, proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Lambda subfamily was recently discovered, together with dehydroascorbate reductases (DHARs), as two outlying groups of the GST superfamily in Arabidopsis thaliana, which contain conserved active site cysteines. Characterization of recombinant A. thaliana proteins show that Lambda class GSTs are monomeric, similar
Probab=35.73 E-value=45 Score=17.62 Aligned_cols=14 Identities=7% Similarity=0.059 Sum_probs=11.5
Q ss_pred CHHHHHHHHHHHHH
Q 036651 26 DLKRKAINYQAAYL 39 (60)
Q Consensus 26 ~~~~ra~~~~~~~~ 39 (60)
|+.+|+.+.+++.|
T Consensus 1 d~~~ra~~~~~~~~ 14 (120)
T cd03203 1 DPAKREFADELLAY 14 (120)
T ss_pred CHHHHHHHHHHHHH
Confidence 56788999999877
No 65
>COG3109 ProQ Activator of osmoprotectant transporter ProP [Signal transduction mechanisms]
Probab=34.52 E-value=36 Score=20.45 Aligned_cols=18 Identities=22% Similarity=0.255 Sum_probs=15.1
Q ss_pred eehHHHHHHHHHhhCCCC
Q 036651 3 YLILFAILMYLEEKNPQH 20 (60)
Q Consensus 3 l~ES~aI~~yL~~~~p~~ 20 (60)
|.-|..++-||.++||-.
T Consensus 7 l~~~kEvIa~Lae~FPlc 24 (208)
T COG3109 7 LNNSKEVIAYLAERFPLC 24 (208)
T ss_pred ccchHHHHHHHHHhCccc
Confidence 566889999999999853
No 66
>COG4669 EscJ Type III secretory pathway, lipoprotein EscJ [Intracellular trafficking and secretion]
Probab=34.35 E-value=83 Score=19.71 Aligned_cols=24 Identities=29% Similarity=0.225 Sum_probs=17.2
Q ss_pred HHhhCCCCCCCCcCHHHHHHHHHH
Q 036651 13 LEEKNPQHPLLPSDLKRKAINYQA 36 (60)
Q Consensus 13 L~~~~p~~~l~p~~~~~ra~~~~~ 36 (60)
|.+.||..++.++..+++++....
T Consensus 86 l~d~Fp~dgLVsSP~eEkaR~~~~ 109 (246)
T COG4669 86 LGDIFPKDGLVSSPTEEKARLNYA 109 (246)
T ss_pred HHHhCCcccccCCcHHHHHHHHHH
Confidence 556677777777777888877654
No 67
>KOG2903 consensus Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones]
Probab=33.52 E-value=55 Score=21.02 Aligned_cols=39 Identities=21% Similarity=0.106 Sum_probs=27.5
Q ss_pred ehHHHHHHHHHhhC---------CCCCCCCcCHHHHHHHHHHHHHHhccc
Q 036651 4 LILFAILMYLEEKN---------PQHPLLPSDLKRKAINYQAAYLVSSSI 44 (60)
Q Consensus 4 ~ES~aI~~yL~~~~---------p~~~l~p~~~~~ra~~~~~~~~~~~~~ 44 (60)
.||..|++.+...| +.-.|.|+ ..++++..+.+|+...+
T Consensus 138 NES~eIIr~fNs~f~ef~~~~e~~~lDL~P~--~L~~~Ide~N~wvy~~I 185 (319)
T KOG2903|consen 138 NESSEIIRMFNSAFDEFNGIAENPVLDLYPS--SLRAQIDETNSWVYDKI 185 (319)
T ss_pred CchHHHHHHHhhhhhhhhccccCCccccCCH--HHHHHHhhhhceecccc
Confidence 58999999988333 33345554 56778899998887665
No 68
>PRK04950 ProP expression regulator; Provisional
Probab=28.02 E-value=51 Score=20.16 Aligned_cols=17 Identities=24% Similarity=0.348 Sum_probs=15.0
Q ss_pred eehHHHHHHHHHhhCCC
Q 036651 3 YLILFAILMYLEEKNPQ 19 (60)
Q Consensus 3 l~ES~aI~~yL~~~~p~ 19 (60)
+..+..|+.||.++||.
T Consensus 7 l~~~keiia~L~e~fP~ 23 (213)
T PRK04950 7 LTSSKEVIAYLAERFPL 23 (213)
T ss_pred cCCHHHHHHHHHHhChh
Confidence 56789999999999986
No 69
>PF00538 Linker_histone: linker histone H1 and H5 family; InterPro: IPR005818 Histone proteins have central roles in both chromatin organisation (as structural units of the nucleosome) and gene regulation (as dynamic components that have a direct impact on DNA transcription and replication). Eukaryotic DNA wraps around a histone octamer to form a nucleosome, the first order of compaction of eukaryotic chromatin. The core histone octamer is composed of a central H3-H4 tetramer and two flanking H2A-H2B dimers. Each of the core histone contains a common structural motif, called the histone fold, which facilitates the interactions between the individual core histones. In addition to the core histones, there is a "linker histone" called H1 (or H5 in avian species). The linker histones present in all multicellular eukaryotes are the most divergent group of histones, with numerous cell type- and stage-specific variant. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures. Histone H5 performs the same function as histone H1, and replaces H1 in certain cells. The structure of GH5, the globular domain of the linker histone H5 is known [, ]. The fold is similar to the DNA-binding domain of the catabolite gene activator protein, CAP, thus providing a possible model for the binding of GH5 to DNA. The linker histones, which do not contain the histone fold motif, are critical to the higher-order compaction of chromatin, because they bind to internucleosomal DNA and facilitate interactions between individual nucleosomes. In addition, H1 variants have been shown to be involved in the regulation of developmental genes. A common feature of this protein family is a tripartite structure in which a globular (H15) domain of about 80 amino acids is flanked by two less structured N- and C-terminal tails. The H15 domain is also characterised by high sequence homology among the family of linker histones. The highly conserved H15 domain is essential for the binding of H1 or H5 to the nucleosome. It consists of a three helix bundle (I-III), with a beta-hairpin at the C terminus. There is also a short three-residue stretch between helices I and II that is in the beta-strand conformation. Together with the C-terminal beta-hairpin, this strand forms the third strand of an antiparallel beta-sheet [, , , ]. Proteins known to contain a H15 domain are: - Eukaryotic histone H1. The histones H1 constitute a family with many variants, differing in their affinity for chromatin. Several variants are simultaneously present in a single cell. For example, the nucleated erythrocytes of birds contain both H1 and H5, the latter being an extreme variant of H1. - Eukaryotic MHYST family of histone acetyltransferase. Histone acetyltransferases transfer an acetyl group from acetyl-CoA to the epsylon- amino group of lysine within the basic NH2-termini of histones, which bind the acidic phosphates of DNA []. This entry represents the H15 domain.; GO: 0003677 DNA binding, 0006334 nucleosome assembly, 0000786 nucleosome, 0005634 nucleus; PDB: 2LSO_A 2RQP_A 1UHM_A 1UST_A 1GHC_A 1HST_A 1YQA_A 1USS_A.
Probab=27.49 E-value=60 Score=16.03 Aligned_cols=13 Identities=38% Similarity=0.445 Sum_probs=11.3
Q ss_pred HHHHHHHHHhhCC
Q 036651 6 LFAILMYLEEKNP 18 (60)
Q Consensus 6 S~aI~~yL~~~~p 18 (60)
..+|..||.+.|+
T Consensus 24 ~~aI~kyI~~~y~ 36 (77)
T PF00538_consen 24 LQAIKKYIKAKYK 36 (77)
T ss_dssp HHHHHHHHHHHSS
T ss_pred HHHHHHHHHHhcC
Confidence 3789999999996
No 70
>PF09664 DUF2399: Protein of unknown function C-terminus (DUF2399); InterPro: IPR024465 This domain is found in archaeal, bacterial and eukaryotic proteins. Its function is unknown.
Probab=26.52 E-value=1.1e+02 Score=17.44 Aligned_cols=41 Identities=12% Similarity=0.015 Sum_probs=23.6
Q ss_pred CceehHHHHHHHHHhhCC--CCCCCCcCHHHHHHHHHHHHHHh
Q 036651 1 MLYLILFAILMYLEEKNP--QHPLLPSDLKRKAINYQAAYLVS 41 (60)
Q Consensus 1 ~~l~ES~aI~~yL~~~~p--~~~l~p~~~~~ra~~~~~~~~~~ 41 (60)
+.|.|..++.+.+.++.+ .+++.=.+-.-+...+.++..+.
T Consensus 21 V~VvENp~Vf~~~~~~~~~~~~pLVCt~G~p~~A~~~LL~~L~ 63 (152)
T PF09664_consen 21 VYVVENPAVFSALADELGASCPPLVCTSGQPSAAARRLLDRLA 63 (152)
T ss_pred EEEEecHHHHHHHHHhcCCCCCeEEEcCCcHHHHHHHHHHHHH
Confidence 368899999999999843 33332222223334444555443
No 71
>PF09107 SelB-wing_3: Elongation factor SelB, winged helix ; InterPro: IPR015191 This entry represents a domain with a winged helix-type fold, which consists of a closed 3-helical bundle with a right-handed twist, and a small beta-sheet wing []. Different winged helix domains share a common structure, but can differ in sequence. This entry is designated "type 3". The winged helix motif is involved in both DNA and RNA binding. In the elongation factor SelB, the winged helix domains recognise RNA, allowing the complex to wrap around the small ribosomal subunit. In bacteria, the incorporation of the amino acid selenocysteine into proteins requires elongation factor SelB, which binds both transfer RNA (tRNA) and mRNA. SelB binds to an mRNA hairpin formed by the selenocysteine insertion sequence (SECIS) with extremely high specificity []. ; GO: 0003723 RNA binding, 0003746 translation elongation factor activity, 0005525 GTP binding, 0001514 selenocysteine incorporation, 0005737 cytoplasm; PDB: 2PJP_A 2UWM_A 1WSU_B 1LVA_A 2PLY_A.
Probab=25.15 E-value=71 Score=14.81 Aligned_cols=11 Identities=27% Similarity=0.637 Sum_probs=7.9
Q ss_pred HHHHHHHHHhh
Q 036651 6 LFAILMYLEEK 16 (60)
Q Consensus 6 S~aI~~yL~~~ 16 (60)
+.+|++|++..
T Consensus 27 ai~lLE~lD~~ 37 (50)
T PF09107_consen 27 AIPLLEYLDRE 37 (50)
T ss_dssp HHHHHHHHHHT
T ss_pred HHHHHHHHhcc
Confidence 36788888863
No 72
>KOG3631 consensus Alpha-parvin and related focal adhesion proteins [Cytoskeleton]
Probab=24.25 E-value=1.1e+02 Score=19.93 Aligned_cols=34 Identities=15% Similarity=0.090 Sum_probs=25.2
Q ss_pred cCHHHHHHHHHHHHHHhcccccccchHHHHhhhhcC
Q 036651 25 SDLKRKAINYQAAYLVSSSIQPLQSLPVLVVSHQDF 60 (60)
Q Consensus 25 ~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (60)
.|+.-.-.+..++.|++..+.+- +.|-+.+.+|+
T Consensus 84 ~DpK~~el~kvLi~WiN~~L~~e--rIvVr~LeEDl 117 (365)
T KOG3631|consen 84 KDPKFEELVKVLIDWINDVLVPE--RIVVRSLEEDL 117 (365)
T ss_pred cChhHHHHHHHHHHHHHHhhcch--hhhHHhhHHhh
Confidence 36766777888899999888773 36667777764
No 73
>COG2231 Uncharacterized protein related to Endonuclease III [DNA replication, recombination, and repair]
Probab=23.60 E-value=90 Score=19.17 Aligned_cols=23 Identities=22% Similarity=0.092 Sum_probs=18.5
Q ss_pred HHHHHHHHhhCCCCCCCCcCHHH
Q 036651 7 FAILMYLEEKNPQHPLLPSDLKR 29 (60)
Q Consensus 7 ~aI~~yL~~~~p~~~l~p~~~~~ 29 (60)
.-|...|.+.|+..++||.+...
T Consensus 9 ~~iy~~L~~~yg~q~WWp~~~~~ 31 (215)
T COG2231 9 TKIYKELLRLYGDQGWWPADNKD 31 (215)
T ss_pred HHHHHHHHHHcCCccCCCCCCch
Confidence 56788899999998999986533
No 74
>cd00073 H15 linker histone 1 and histone 5 domains; the basic subunit of chromatin is the nucleosome, consisting of an octamer of core histones, two full turns of DNA, a linker histone (H1 or H5) and a variable length of linker DNA; H1/H5 are chromatin-associated proteins that bind to the exterior of nucleosomes and dramatically stabilize the highly condensed states of chromatin fibers; stabilization of higher order folding occurs through electrostatic neutralization of the linker DNA segments, through a highly positively charged carboxy- terminal domain known as the AKP helix (Ala, Lys, Pro); thought to be involved in specific protein-protein and protein-DNA interactions and play a role in suppressing core histone tail domain acetylation in the chromatin fiber
Probab=23.46 E-value=77 Score=16.15 Aligned_cols=14 Identities=36% Similarity=0.349 Sum_probs=11.8
Q ss_pred HHHHHHHHhhCCCC
Q 036651 7 FAILMYLEEKNPQH 20 (60)
Q Consensus 7 ~aI~~yL~~~~p~~ 20 (60)
.+|..||.++|+..
T Consensus 27 ~aI~kyI~~~y~~~ 40 (88)
T cd00073 27 QAIKKYIEAKYKVD 40 (88)
T ss_pred HHHHHHHHHHCCcc
Confidence 68999999998753
No 75
>PF10264 Stork_head: Winged helix Storkhead-box1 domain; InterPro: IPR019391 In humans the Storkhead-box protein controls polyploidization of extravillus trophoblast and is implicated in pre-eclampsia []. This entry represents the conserved N-terminal winged-helix domain, which is likely to bind DNA.
Probab=23.33 E-value=77 Score=16.40 Aligned_cols=19 Identities=21% Similarity=0.284 Sum_probs=13.9
Q ss_pred HHHHHHHHhhCCCCCCCCcC
Q 036651 7 FAILMYLEEKNPQHPLLPSD 26 (60)
Q Consensus 7 ~aI~~yL~~~~p~~~l~p~~ 26 (60)
-+|.++|.++||+-. .|+.
T Consensus 33 E~l~~~L~~~yp~i~-~Ps~ 51 (80)
T PF10264_consen 33 ETLREHLRKHYPGIA-IPSQ 51 (80)
T ss_pred HHHHHHHHHhCCCCC-CCCH
Confidence 478999999998743 3553
No 76
>KOG0962 consensus DNA repair protein RAD50, ABC-type ATPase/SMC superfamily [Replication, recombination and repair]
Probab=22.80 E-value=1.3e+02 Score=23.36 Aligned_cols=25 Identities=16% Similarity=-0.032 Sum_probs=20.5
Q ss_pred eehHHHHHHHHHhhCCCCCCCCcCH
Q 036651 3 YLILFAILMYLEEKNPQHPLLPSDL 27 (60)
Q Consensus 3 l~ES~aI~~yL~~~~p~~~l~p~~~ 27 (60)
+..|.||+.|+..+|-+-..||-+.
T Consensus 142 lGVskAIl~~VIFcHQEdS~WPLsE 166 (1294)
T KOG0962|consen 142 LGVSKAILENVIFCHQEDSTWPLSE 166 (1294)
T ss_pred cCCcHHHHhhhheecccCCCCCCCC
Confidence 4568999999999998888889543
No 77
>PF02198 SAM_PNT: Sterile alpha motif (SAM)/Pointed domain; InterPro: IPR003118 Transcription factors are protein molecules that bind to specific DNA sequences in the genome, resulting in the induction or inhibition of gene transcription []. The ets oncogene is such a factor, possessing a region of 85-90 amino acids known as the ETS (erythroblast transformation specific) domain [, ]. This domain is rich in positively-charged and aromatic residues, and binds to purine-rich segments of DNA. The ETS domain IPR000418 from INTERPRO has been identified in other transcription factors such as PU.1, human erg, human elf-1, human elk-1, GA binding protein, and a number of others [, , ]. It is generally localized at the C terminus of the protein, with the exception of ELF-1, ELK-1, ELK-3, ELK-4 and ERF where it is found at the N terminus. This entry describes the highly conserved PNT (or Pointed) domain which is found within a subset of the ETs domain (IPR000418 from INTERPRO ), including mammalian Ets-1, Ets-2, Erg, Fli-1, GABPalpha, and Tel, as well as Drosophila Pnt-P2 and Yan. The PNT domain (IPR001660 from INTERPRO ) through a common tertiary arrangement of four alpha-helices. A role in protein-protein association has been established for the PNT domain [, ].; GO: 0043565 sequence-specific DNA binding, 0005634 nucleus; PDB: 1SXE_A 1SXD_A 2KMD_A 2JV3_A 2E8P_A 1SV4_B 1SV0_B 1LKY_F 1JI7_B 2DKX_A ....
Probab=22.37 E-value=82 Score=15.90 Aligned_cols=23 Identities=26% Similarity=0.146 Sum_probs=17.0
Q ss_pred CCCcCHHH--HHHHHHHHHHHhccc
Q 036651 22 LLPSDLKR--KAINYQAAYLVSSSI 44 (60)
Q Consensus 22 l~p~~~~~--ra~~~~~~~~~~~~~ 44 (60)
..|.||.. +.++..|+.|+..++
T Consensus 12 ~~p~DP~~Wt~~~V~~Wl~w~~~~f 36 (84)
T PF02198_consen 12 WLPKDPRLWTKEDVLQWLRWVVREF 36 (84)
T ss_dssp TSCSSGGG--HHHHHHHHHHHHHHT
T ss_pred CCCCChhhCCHHHHHHHHHHHHHhc
Confidence 45666644 688999999888764
No 78
>PF10189 DUF2356: Conserved protein (DUF2356); InterPro: IPR019333 The Integrator complex is involved in small nuclear RNA (snRNA) U1 and U2 transcription, and in their 3'-box-dependent processing. This complex associates with the C-terminal domain of RNA polymerase II largest subunit and is recruited to the U1 and U2 snRNAs genes []. This entry represents subunit 3 of this complex. The function of this subunit is unknown.
Probab=22.13 E-value=2.1e+02 Score=17.78 Aligned_cols=32 Identities=13% Similarity=0.094 Sum_probs=23.8
Q ss_pred HHHHHHHHhhC-CCCCCCCcCHHHHHHHHHHHH
Q 036651 7 FAILMYLEEKN-PQHPLLPSDLKRKAINYQAAY 38 (60)
Q Consensus 7 ~aI~~yL~~~~-p~~~l~p~~~~~ra~~~~~~~ 38 (60)
..|++|++..| |.....++|...|..+..|+-
T Consensus 79 ~DlIRfIc~~~hP~Nevl~S~i~pRwavIgwLl 111 (230)
T PF10189_consen 79 PDLIRFICCVYHPTNEVLQSDIIPRWAVIGWLL 111 (230)
T ss_pred HHHHHHHHHhcCChhcccccchhHHHHHHHHHH
Confidence 56889988765 445677888888888877753
No 79
>cd03190 GST_C_ECM4_like GST_C family, ECM4-like subfamily; composed of predominantly uncharacterized and taxonomically diverse proteins with similarity to the translation product of the Saccharomyces cerevisiae gene ECM4. ECM4, a gene of unknown function, is involved in cell surface biosynthesis and architecture. S. cerevisiae ECM4 mutants show increased amounts of the cell wall hexose, N-acetylglucosamine. More recently, global gene expression analysis shows that ECM4 is upregulated during genotoxic conditions and together with the expression profiles of 18 other genes could potentially differentiate between genotoxic and cytotoxic insults in yeast.
Probab=21.33 E-value=91 Score=16.96 Aligned_cols=21 Identities=5% Similarity=-0.127 Sum_probs=15.8
Q ss_pred HHHHHHHHHHHHHhccccccc
Q 036651 28 KRKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 28 ~~ra~~~~~~~~~~~~~~~~~ 48 (60)
..|+++.+|++|....+.+..
T Consensus 3 ~~~a~i~~~~~~~~~~~~~~~ 23 (142)
T cd03190 3 ELRSEIDELNEWIYDNINNGV 23 (142)
T ss_pred hHHHHHHHHHHHHHHHHhhHH
Confidence 457889999999987765543
No 80
>cd03184 GST_C_Omega GST_C family, Class Omega subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Omega GSTs show little or no GSH-conjugating activity towards standard GST substrates. Instead, they catalyze the GSH dependent reduction of protein disulfides, dehydroascorbate and monomethylarsonate, activities which are more characteristic of glutaredoxins. They contain a conserved cysteine equivalent to the first cysteine in the CXXC motif of glutaredoxins, which is a re
Probab=21.24 E-value=56 Score=17.14 Aligned_cols=20 Identities=15% Similarity=-0.012 Sum_probs=13.9
Q ss_pred HHHHHHHHHHHHhccccccc
Q 036651 29 RKAINYQAAYLVSSSIQPLQ 48 (60)
Q Consensus 29 ~ra~~~~~~~~~~~~~~~~~ 48 (60)
+|++.+.|+.++++.+.++.
T Consensus 2 ~ra~~r~~~~~~~~~~~~~~ 21 (124)
T cd03184 2 EKAQQKLLLERFSKVVSAFY 21 (124)
T ss_pred hHHHHHHHHHHHhhhhHHHH
Confidence 58888999998874444433
No 81
>KOG1757 consensus Histone 2A [Chromatin structure and dynamics]
Probab=20.86 E-value=72 Score=17.75 Aligned_cols=10 Identities=50% Similarity=0.524 Sum_probs=8.1
Q ss_pred HHHHHHHHHh
Q 036651 6 LFAILMYLEE 15 (60)
Q Consensus 6 S~aI~~yL~~ 15 (60)
+.||++||..
T Consensus 58 ~aaileYLTa 67 (131)
T KOG1757|consen 58 SAAILEYLTA 67 (131)
T ss_pred HHHHHHHHHH
Confidence 5789999864
No 82
>cd00059 FH Forkhead (FH), also known as a "winged helix". FH is named for the Drosophila fork head protein, a transcription factor which promotes terminal rather than segmental development. This family of transcription factor domains, which bind to B-DNA as monomers, are also found in the Hepatocyte nuclear factor (HNF) proteins, which provide tissue-specific gene regulation. The structure contains 2 flexible loops or "wings" in the C-terminal region, hence the term winged helix.
Probab=20.29 E-value=80 Score=15.98 Aligned_cols=13 Identities=15% Similarity=0.340 Sum_probs=10.7
Q ss_pred HHHHHHHHhhCCC
Q 036651 7 FAILMYLEEKNPQ 19 (60)
Q Consensus 7 ~aI~~yL~~~~p~ 19 (60)
..|.+|+.++||-
T Consensus 25 ~eIy~~I~~~~py 37 (78)
T cd00059 25 SEIYKWISDNFPY 37 (78)
T ss_pred HHHHHHHHHhCCc
Confidence 4699999999873
Done!