Query gi|254780594|ref|YP_003065007.1| Glutathione S-transferase domain protein [Candidatus Liberibacter asiaticus str. psy62] Match_columns 230 No_of_seqs 114 out of 6286 Neff 9.2 Searched_HMMs 39220 Date Sun May 29 21:52:45 2011 Command /home/congqian_1/programs/hhpred/hhsearch -i 254780594.hhm -d /home/congqian_1/database/cdd/Cdd.hhm No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM 1 PRK09481 sspA stringent starva 100.0 0 0 307.6 19.0 198 1-220 10-208 (211) 2 PRK13972 glutathione S-transfe 100.0 1.5E-44 0 275.1 19.0 198 1-216 1-207 (215) 3 PRK10357 predicted glutathione 100.0 9.9E-44 0 270.4 18.8 196 2-212 1-198 (202) 4 PRK10542 glutathionine S-trans 100.0 1.5E-43 0 269.3 19.4 194 2-215 1-198 (201) 5 TIGR01262 maiA maleylacetoacet 100.0 6.2E-43 0 265.8 13.6 202 3-218 1-216 (218) 6 COG0625 Gst Glutathione S-tran 100.0 2.9E-39 7.4E-44 244.7 17.3 194 2-209 1-199 (211) 7 PRK11752 putative glutathione 100.0 1.6E-37 4E-42 234.8 11.9 195 2-213 45-257 (264) 8 KOG0406 consensus 100.0 7.9E-36 2E-40 225.0 16.8 195 2-216 10-213 (231) 9 KOG0868 consensus 100.0 1.2E-34 3.1E-39 218.2 13.1 202 2-220 6-213 (217) 10 KOG0867 consensus 100.0 9.1E-34 2.3E-38 213.2 15.4 204 1-215 2-210 (226) 11 KOG4420 consensus 100.0 1.5E-27 3.8E-32 177.5 12.0 210 2-215 27-288 (325) 12 PTZ00057 glutathione s-transfe 99.9 7.7E-24 2E-28 156.2 17.5 189 2-216 5-207 (211) 13 PRK10387 glutaredoxin 2; Provi 99.9 4.6E-22 1.2E-26 146.0 12.7 190 2-206 1-205 (210) 14 cd03059 GST_N_SspA GST_N famil 99.9 5.6E-23 1.4E-27 151.2 5.8 73 2-75 1-73 (73) 15 KOG1695 consensus 99.9 6.5E-21 1.7E-25 139.4 13.3 194 2-215 4-201 (206) 16 cd03058 GST_N_Tau GST_N family 99.9 3.5E-22 9E-27 146.6 5.4 73 2-75 1-74 (74) 17 cd03055 GST_N_Omega GST_N fami 99.9 3.1E-22 7.9E-27 147.0 5.0 72 1-72 18-89 (89) 18 cd03052 GST_N_GDAP1 GST_N fami 99.8 2.5E-21 6.4E-26 141.7 5.1 70 2-72 1-73 (73) 19 cd03048 GST_N_Ure2p_like GST_N 99.8 3.4E-21 8.6E-26 141.0 5.1 75 1-76 1-80 (81) 20 cd03051 GST_N_GTT2_like GST_N 99.8 3.8E-21 9.7E-26 140.7 4.8 71 2-72 1-74 (74) 21 cd03049 GST_N_3 GST_N family, 99.8 5.7E-21 1.5E-25 139.7 5.3 71 2-72 1-73 (73) 22 cd03187 GST_C_Phi GST_C family 99.8 1.6E-19 4E-24 131.4 12.6 117 90-212 1-118 (118) 23 cd03053 GST_N_Phi GST_N family 99.8 6.7E-21 1.7E-25 139.3 5.3 72 2-74 2-76 (76) 24 cd03057 GST_N_Beta GST_N famil 99.8 1.5E-20 3.8E-25 137.3 5.7 74 2-76 1-77 (77) 25 cd03050 GST_N_Theta GST_N fami 99.8 2.2E-20 5.7E-25 136.3 5.8 73 2-75 1-76 (76) 26 cd03042 GST_N_Zeta GST_N famil 99.8 1.5E-20 3.8E-25 137.3 4.8 70 2-72 1-73 (73) 27 KOG1422 consensus 99.8 5.7E-19 1.5E-23 128.2 12.8 190 7-228 18-216 (221) 28 cd03045 GST_N_Delta_Epsilon GS 99.8 3.4E-20 8.5E-25 135.3 4.7 71 2-73 1-74 (74) 29 cd03056 GST_N_4 GST_N family, 99.8 4E-20 1E-24 134.8 5.0 70 2-72 1-73 (73) 30 cd03047 GST_N_2 GST_N family, 99.8 5.7E-20 1.4E-24 134.0 5.0 70 2-72 1-73 (73) 31 KOG3029 consensus 99.8 2.3E-18 5.9E-23 124.7 12.7 200 2-205 91-355 (370) 32 cd03060 GST_N_Omega_like GST_N 99.8 8.2E-20 2.1E-24 133.0 5.1 70 2-71 1-70 (71) 33 cd03046 GST_N_GTT1_like GST_N 99.8 2.3E-19 5.9E-24 130.5 5.4 73 2-76 1-76 (76) 34 cd03188 GST_C_Beta GST_C famil 99.8 4.6E-18 1.2E-22 123.0 11.5 113 91-212 2-114 (114) 35 cd03181 GST_C_EFB1gamma GST_C 99.8 5.3E-18 1.3E-22 122.7 11.8 115 91-215 1-117 (123) 36 cd03178 GST_C_Ure2p_like GST_C 99.8 7.6E-18 1.9E-22 121.7 12.4 113 91-213 1-113 (113) 37 cd03044 GST_N_EF1Bgamma GST_N 99.8 6.3E-19 1.6E-23 127.9 5.6 71 3-73 2-74 (75) 38 KOG4244 consensus 99.8 9.3E-18 2.4E-22 121.2 11.2 187 3-205 47-273 (281) 39 cd03180 GST_C_2 GST_C family, 99.7 2.3E-17 5.8E-22 119.0 10.4 110 90-208 1-110 (110) 40 cd03182 GST_C_GTT2_like GST_C 99.7 8.3E-17 2.1E-21 115.8 11.3 116 88-207 1-116 (117) 41 cd03038 GST_N_etherase_LigE GS 99.7 3.6E-18 9.2E-23 123.6 3.4 75 2-76 1-84 (84) 42 TIGR02182 GRXB Glutaredoxin, G 99.7 2.5E-16 6.3E-21 113.1 10.4 193 3-205 1-206 (212) 43 cd03177 GST_C_Delta_Epsilon GS 99.7 3.5E-16 8.9E-21 112.2 10.7 113 91-216 2-114 (118) 44 cd03183 GST_C_Theta GST_C fami 99.7 5.8E-16 1.5E-20 110.9 11.0 117 91-212 1-120 (126) 45 cd03179 GST_C_1 GST_C family, 99.7 1.7E-15 4.4E-20 108.2 11.8 104 91-203 2-105 (105) 46 cd00570 GST_N_family Glutathio 99.6 9.1E-16 2.3E-20 109.8 4.6 70 2-72 1-71 (71) 47 cd03206 GST_C_7 GST_C family, 99.6 5.3E-15 1.3E-19 105.4 8.3 100 95-208 1-100 (100) 48 cd03043 GST_N_1 GST_N family, 99.6 1.1E-15 2.8E-20 109.4 4.4 64 8-72 8-73 (73) 49 cd03061 GST_N_CLIC GST_N famil 99.6 3.5E-15 8.9E-20 106.5 5.3 69 8-77 20-88 (91) 50 cd03189 GST_C_GTT1_like GST_C 99.5 6.7E-14 1.7E-18 99.1 10.7 118 86-206 2-119 (119) 51 COG2999 GrxB Glutaredoxin 2 [P 99.5 1E-13 2.5E-18 98.1 10.8 195 2-208 1-207 (215) 52 pfam02798 GST_N Glutathione S- 99.5 7.5E-15 1.9E-19 104.6 4.6 71 2-73 1-72 (72) 53 cd03041 GST_N_2GST_N GST_N fam 99.5 1.3E-14 3.2E-19 103.3 5.4 75 1-75 1-77 (77) 54 cd03185 GST_C_Tau GST_C family 99.5 2.4E-13 6E-18 96.0 10.9 112 89-217 1-118 (126) 55 cd03207 GST_C_8 GST_C family, 99.5 6.1E-14 1.5E-18 99.3 7.7 75 137-213 27-101 (103) 56 cd03191 GST_C_Zeta GST_C famil 99.5 4.8E-13 1.2E-17 94.2 10.7 118 89-215 1-120 (121) 57 pfam00043 GST_C Glutathione S- 99.5 2.2E-13 5.6E-18 96.1 8.3 74 133-206 19-92 (92) 58 cd03037 GST_N_GRX2 GST_N famil 99.4 7.9E-14 2E-18 98.7 4.2 71 2-73 1-71 (71) 59 cd03080 GST_N_Metaxin_like GST 99.4 8.3E-14 2.1E-18 98.6 3.9 68 1-76 1-75 (75) 60 cd03040 GST_N_mPGES2 GST_N fam 99.4 2E-13 5.1E-18 96.4 3.8 71 2-74 2-75 (77) 61 COG0435 ECM4 Predicted glutath 99.4 5.2E-13 1.3E-17 94.0 5.3 194 2-213 52-285 (324) 62 cd03186 GST_C_SspA GST_N famil 99.4 6.2E-12 1.6E-16 87.8 10.1 107 89-212 1-107 (107) 63 cd03204 GST_C_GDAP1 GST_C fami 99.2 2.7E-11 6.8E-16 84.2 7.5 75 134-208 23-111 (111) 64 KOG2903 consensus 99.2 2.9E-11 7.3E-16 84.0 6.4 192 3-212 39-286 (319) 65 cd00299 GST_C_family Glutathio 99.2 2.3E-10 5.8E-15 78.8 9.3 97 96-202 2-100 (100) 66 cd03190 GST_C_ECM4_like GST_C 99.2 3.7E-10 9.6E-15 77.6 10.0 108 90-213 3-117 (142) 67 cd03209 GST_C_Mu GST_C family, 99.1 1.8E-09 4.6E-14 73.7 11.6 81 136-216 31-111 (121) 68 cd03184 GST_C_Omega GST_C fami 99.1 3.7E-10 9.5E-15 77.6 7.5 77 140-216 32-115 (124) 69 TIGR00862 O-ClC intracellular 99.1 4.4E-09 1.1E-13 71.4 11.8 184 8-213 17-219 (237) 70 cd03210 GST_C_Pi GST_C family, 99.0 6.5E-09 1.7E-13 70.5 11.9 79 138-216 33-114 (126) 71 KOG3028 consensus 99.0 2.9E-08 7.3E-13 66.8 15.0 187 9-205 16-234 (313) 72 cd03039 GST_N_Sigma_like GST_N 99.0 2.5E-10 6.3E-15 78.6 3.1 70 2-72 1-71 (72) 73 KOG3027 consensus 99.0 1.1E-07 2.7E-12 63.5 16.4 184 9-204 33-247 (257) 74 cd03076 GST_N_Pi GST_N family, 99.0 3.6E-10 9.1E-15 77.7 2.9 72 2-74 2-73 (73) 75 cd03077 GST_N_Alpha GST_N fami 98.9 1.3E-09 3.2E-14 74.6 3.3 74 2-76 2-77 (79) 76 cd03208 GST_C_Alpha GST_C fami 98.9 4.7E-08 1.2E-12 65.5 10.9 72 142-213 41-114 (137) 77 cd03196 GST_C_5 GST_C family, 98.8 2.8E-08 7.1E-13 66.8 9.1 110 88-211 3-114 (115) 78 cd03195 GST_C_4 GST_C family, 98.8 5.2E-08 1.3E-12 65.3 9.3 111 89-213 1-112 (114) 79 cd03054 GST_N_Metaxin GST_N fa 98.8 3.8E-09 9.7E-14 71.8 3.5 65 2-74 1-72 (72) 80 cd03194 GST_C_3 GST_C family, 98.7 1.7E-07 4.5E-12 62.3 10.7 113 89-213 1-113 (114) 81 cd03203 GST_C_Lambda GST_C fam 98.7 5.2E-08 1.3E-12 65.3 7.1 104 88-215 1-112 (120) 82 cd03201 GST_C_DHAR GST_C famil 98.6 1.1E-07 2.7E-12 63.5 7.1 71 143-213 33-109 (121) 83 cd03075 GST_N_Mu GST_N family, 98.6 6.3E-08 1.6E-12 64.8 4.3 73 2-75 1-82 (82) 84 cd03192 GST_C_Sigma_like GST_C 98.6 4.9E-07 1.3E-11 59.7 8.6 62 141-202 40-104 (104) 85 cd03205 GST_C_6 GST_C family, 98.5 1.6E-06 4E-11 56.8 9.5 67 133-202 30-98 (98) 86 cd03202 GST_C_etherase_LigE GS 98.5 2.1E-06 5.3E-11 56.1 9.7 66 139-204 57-123 (124) 87 cd03200 GST_C_JTV1 GST_C famil 98.4 1.3E-06 3.3E-11 57.3 8.2 96 68-205 1-96 (96) 88 cd03197 GST_C_mPGES2 GST_C fam 98.3 5.4E-06 1.4E-10 53.7 8.8 60 145-204 84-145 (149) 89 cd03212 GST_C_Metaxin1_3 GST_C 98.2 2.1E-05 5.4E-10 50.3 10.7 68 138-205 62-135 (137) 90 cd03193 GST_C_Metaxin GST_C fa 98.2 4.9E-06 1.3E-10 53.9 7.4 61 143-203 22-88 (88) 91 cd03198 GST_C_CLIC GST_C famil 98.2 1E-05 2.5E-10 52.2 7.6 57 157-213 62-123 (134) 92 cd02976 NrdH NrdH-redoxin (Nrd 98.1 3.4E-06 8.6E-11 54.9 4.0 63 1-64 1-64 (73) 93 cd03211 GST_C_Metaxin2 GST_C f 98.1 2.7E-05 7E-10 49.7 8.5 64 140-203 57-126 (126) 94 cd03078 GST_N_Metaxin1_like GS 98.0 6.3E-06 1.6E-10 53.3 3.7 58 9-74 15-72 (73) 95 TIGR02196 GlrX_YruB Glutaredox 97.9 8.3E-06 2.1E-10 52.6 3.6 63 1-63 1-68 (79) 96 cd03029 GRX_hybridPRX5 Glutare 97.8 2.8E-05 7E-10 49.6 4.2 69 2-71 3-71 (72) 97 COG0695 GrxC Glutaredoxin and 97.7 4.6E-05 1.2E-09 48.4 4.7 69 1-70 2-73 (80) 98 pfam00462 Glutaredoxin Glutare 97.6 7.5E-05 1.9E-09 47.1 3.6 58 2-60 1-59 (60) 99 PRK10638 glutaredoxin 3; Provi 97.4 0.00015 3.7E-09 45.5 3.8 69 2-72 4-73 (83) 100 TIGR02181 GRX_bact glutaredoxi 97.4 0.00018 4.7E-09 44.9 4.1 67 2-69 1-71 (82) 101 cd02066 GRX_family Glutaredoxi 97.4 0.00018 4.7E-09 44.9 3.8 66 2-68 2-68 (72) 102 TIGR02200 GlrX_actino Glutared 97.3 0.0001 2.6E-09 46.4 1.8 62 2-63 2-66 (78) 103 cd03027 GRX_DEP Glutaredoxin ( 97.2 0.00046 1.2E-08 42.6 4.4 64 2-66 3-67 (73) 104 cd03419 GRX_GRXh_1_2_like Glut 97.2 0.00067 1.7E-08 41.7 4.9 70 2-72 2-75 (82) 105 cd03418 GRX_GRXb_1_3_like Glut 97.1 0.00055 1.4E-08 42.2 3.7 67 2-69 2-70 (75) 106 PRK10853 hypothetical protein; 97.0 0.0016 4E-08 39.6 5.2 40 1-40 1-40 (118) 107 TIGR02190 GlrX-dom Glutaredoxi 96.9 0.00036 9.2E-09 43.2 1.4 69 3-72 11-79 (79) 108 cd03032 ArsC_Spx Arsenate Redu 96.8 0.003 7.7E-08 37.9 5.2 38 1-38 1-38 (115) 109 PRK12559 transcriptional regul 96.7 0.0029 7.4E-08 38.0 4.9 35 1-35 1-35 (131) 110 PRK10329 glutaredoxin-like pro 96.6 0.0024 6.1E-08 38.5 3.9 56 2-57 3-58 (81) 111 PRK13344 spxA transcriptional 96.5 0.0051 1.3E-07 36.6 5.1 35 1-35 1-35 (132) 112 PRK11200 grxA glutaredoxin 1; 96.4 0.0052 1.3E-07 36.6 4.6 74 2-76 3-84 (87) 113 KOG1147 consensus 96.4 0.003 7.6E-08 38.0 3.3 120 54-212 41-162 (712) 114 pfam10568 Tom37 Outer mitochon 96.4 0.0036 9.3E-08 37.5 3.5 57 9-72 13-71 (71) 115 COG1393 ArsC Arsenate reductas 96.3 0.0077 2E-07 35.6 4.8 37 1-37 2-38 (117) 116 PRK01655 spxA transcriptional 96.2 0.0038 9.8E-08 37.3 2.8 33 1-33 1-33 (131) 117 cd03033 ArsC_15kD Arsenate Red 96.2 0.0098 2.5E-07 35.0 4.8 37 2-38 2-38 (113) 118 cd03028 GRX_PICOT_like Glutare 95.7 0.011 2.8E-07 34.7 3.5 64 7-71 20-84 (90) 119 cd03079 GST_N_Metaxin2 GST_N f 95.6 0.01 2.6E-07 34.9 2.9 58 9-73 16-73 (74) 120 COG4545 Glutaredoxin-related p 95.6 0.012 3E-07 34.5 3.2 62 1-62 3-77 (85) 121 cd03199 GST_C_GRX2 GST_C famil 95.3 0.093 2.4E-06 29.4 7.0 64 140-206 60-123 (128) 122 pfam04399 Glutaredoxin2_C Glut 95.0 0.13 3.3E-06 28.6 7.0 66 140-208 59-124 (132) 123 cd03034 ArsC_ArsC Arsenate Red 95.0 0.053 1.4E-06 30.8 5.1 33 2-34 1-33 (112) 124 cd03035 ArsC_Yffb Arsenate Red 95.0 0.055 1.4E-06 30.7 5.0 33 2-34 1-33 (105) 125 TIGR02180 GRX_euk Glutaredoxin 94.9 0.053 1.3E-06 30.8 4.7 67 3-70 2-75 (85) 126 cd03036 ArsC_like Arsenate Red 94.6 0.086 2.2E-06 29.6 5.3 33 2-34 1-33 (111) 127 cd02977 ArsC_family Arsenate R 94.1 0.092 2.3E-06 29.4 4.5 33 2-34 1-33 (105) 128 PRK10026 arsenate reductase; P 94.1 0.057 1.5E-06 30.6 3.4 32 1-32 3-34 (141) 129 KOG1752 consensus 94.0 0.12 3.1E-06 28.7 4.9 68 3-71 17-88 (104) 130 TIGR01617 arsC_related conserv 93.8 0.1 2.6E-06 29.1 4.4 39 2-40 1-39 (122) 131 cd01659 TRX_superfamily Thiore 93.7 0.089 2.3E-06 29.5 3.8 57 2-58 1-62 (69) 132 PRK10824 hypothetical protein; 92.1 0.24 6.1E-06 27.0 4.1 64 6-70 26-90 (115) 133 TIGR02194 GlrX_NrdH Glutaredox 91.2 0.24 6.2E-06 27.0 3.4 62 2-63 1-62 (72) 134 pfam11801 Tom37_C Tom37 C-term 90.7 0.9 2.3E-05 23.7 5.9 38 145-182 112-150 (164) 135 pfam03960 ArsC ArsC family. Th 90.1 0.82 2.1E-05 24.0 5.3 30 5-34 1-30 (111) 136 TIGR00411 redox_disulf_1 redox 89.0 0.23 5.8E-06 27.2 1.8 57 1-58 2-62 (82) 137 PTZ00062 glutaredoxin; Provisi 86.5 0.67 1.7E-05 24.4 3.0 20 191-210 166-185 (219) 138 pfam11287 DUF3088 Protein of u 85.8 1.1 2.8E-05 23.3 3.7 69 9-77 22-108 (111) 139 TIGR00014 arsC arsenate reduct 78.0 2.4 6.2E-05 21.2 3.1 30 2-31 1-30 (114) 140 PRK12759 bifunctional gluaredo 74.3 5.1 0.00013 19.4 4.0 69 2-71 4-81 (410) 141 COG0278 Glutaredoxin-related p 74.2 3.1 8E-05 20.6 2.9 66 6-72 26-93 (105) 142 pfam05768 DUF836 Glutaredoxin- 72.0 5.2 0.00013 19.3 3.6 52 2-57 2-57 (76) 143 PHA02125 thioredoxin-like prot 64.8 3.4 8.6E-05 20.4 1.4 55 1-60 1-55 (75) 144 cd02973 TRX_GRX_like Thioredox 64.3 7.3 0.00019 18.5 3.0 53 2-57 3-60 (67) 145 TIGR02183 GRXA Glutaredoxin, G 62.3 9.9 0.00025 17.7 3.4 73 2-75 2-82 (86) 146 TIGR00365 TIGR00365 glutaredox 60.3 8.2 0.00021 18.2 2.7 66 6-72 23-92 (103) 147 cd03031 GRX_GRX_like Glutaredo 59.5 10 0.00026 17.6 3.1 66 3-69 3-79 (147) 148 KOG0911 consensus 54.9 15 0.00039 16.7 3.7 14 195-208 182-195 (227) 149 pfam09635 MetRS-N MetRS-N bind 51.4 6.7 0.00017 18.7 1.0 32 45-76 31-63 (122) 150 TIGR01246 dapE_proteo succinyl 50.6 10 0.00025 17.7 1.8 36 39-74 115-155 (383) 151 cd03020 DsbA_DsbC_DsbG DsbA fa 50.2 18 0.00047 16.2 3.8 19 4-22 83-101 (197) 152 cd03030 GRX_SH3BGR Glutaredoxi 49.2 19 0.00048 16.1 3.8 60 3-63 3-73 (92) 153 pfam04908 SH3BGR SH3-binding, 45.6 22 0.00055 15.8 3.7 47 13-60 20-69 (92) 154 TIGR01162 purE phosphoribosyla 45.5 22 0.00055 15.8 4.3 32 10-42 11-42 (159) 155 TIGR00679 hpr-ser HPr(Ser) kin 31.2 37 0.00095 14.4 2.6 47 31-79 90-137 (309) 156 cd04911 ACT_AKiii-YclM-BS_1 AC 31.0 37 0.00095 14.4 2.9 23 11-33 16-38 (76) 157 TIGR02661 MauD methylamine deh 30.8 38 0.00096 14.4 3.1 12 5-16 81-92 (189) 158 cd03026 AhpF_NTD_C TRX-GRX-lik 29.6 28 0.00073 15.1 1.4 67 3-72 17-88 (89) 159 cd02999 PDI_a_ERp44_like PDIa 28.4 34 0.00087 14.7 1.6 57 1-57 21-80 (100) 160 KOG3425 consensus 28.2 42 0.0011 14.1 2.4 66 7-72 42-120 (128) 161 COG2900 SlyX Uncharacterized p 27.7 41 0.0011 14.2 2.0 30 192-225 42-72 (72) 162 TIGR01967 DEAH_box_HrpA ATP-de 24.0 50 0.0013 13.7 6.9 56 148-204 1185-1241(1320) 163 cd02951 SoxW SoxW family; SoxW 24.0 35 0.00088 14.6 1.0 17 1-17 17-33 (125) 164 cd02960 AGR Anterior Gradient 23.7 38 0.00097 14.4 1.2 80 2-91 27-114 (130) 165 PRK11657 dsbG disulfide isomer 21.9 55 0.0014 13.4 3.3 16 19-34 31-46 (253) 166 TIGR02630 xylose_isom_A xylose 21.5 49 0.0012 13.8 1.4 13 57-69 106-118 (437) 167 TIGR01811 sdhA_Bsu succinate d 20.9 58 0.0015 13.3 2.5 29 69-100 292-320 (620) 168 PRK02119 hypothetical protein; 20.7 53 0.0014 13.5 1.4 28 194-225 45-73 (73) 169 cd02998 PDI_a_ERp38 PDIa famil 20.5 39 0.001 14.3 0.7 54 2-55 22-81 (105) No 1 >PRK09481 sspA stringent starvation protein A; Provisional Probab=100.00 E-value=0 Score=307.61 Aligned_cols=198 Identities=22% Similarity=0.308 Sum_probs=171.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCC Q ss_conf 95213589986899999999986998299988867778899985568775201110001110121035677752887233 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQ 80 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~ 80 (230) |++||+++.||||+|||++|.||||+|+.+.+++++++++|+++||.|+||+|++ ||.+|+||.+|++||+++||. T Consensus 10 mm~LY~~~~sp~s~rVri~L~EKgl~~e~~~vd~~~~~~~~l~iNP~GkVP~L~d-~~~~l~ES~aI~eYLde~~p~--- 85 (211) T PRK09481 10 VMTLFSGPTDIYSHQVRIVLAEKGVSVEIEQVEKDNLPQDLIDLNPYQSVPTLVD-RELTLYESRIIMEYLDERFPH--- 85 (211) T ss_pred EEEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHCCCCCCCEEEE-CCEEEECCCHHHHHHHHHCCC--- T ss_conf 3777699998479999999996699866987386648799997488999998987-998786570689999985899--- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC Q ss_conf 43356799999999999999986123425778999988776642122102334399999999999999855654534798 Q gi|254780594|r 81 KNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRS 160 (230) Q Consensus 81 ~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~ 160 (230) ++|+|.++.+|++++.|..+++..+...+. .+. .......+.++.++...+..++..+++++ T Consensus 86 -p~L~P~~p~~ra~~r~~~~~~~~~~~~~~~------~~~-----------~~~~~~~~~~~~~l~~~l~~l~~~l~~~~ 147 (211) T PRK09481 86 -PPLMPVYPVARGESRLMMHRIEKDWYSLMN------KIV-----------NGSASEADAARKQLREELLAIAPVFGEKP 147 (211) T ss_pred -CCCCCCCHHHHHHHHHHHHHHHHHHHHHHH------HHH-----------CCCHHHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf -888999999999999999999988999999------996-----------48876699999999999999999855287 Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCC-CCHHHCHHHHHHHHHHHCCHHHHHHHHHHHCCCC Q ss_conf 42667502899999999999996189-9945676889999999709988984133321899 Q gi|254780594|r 161 WIAGHHMSYADFAASATISILDYLGE-IDWDTAPIVKEWYQRMKSRPSFRPLLSERIRGLL 220 (230) Q Consensus 161 fl~Gd~~s~ADi~~~~~l~~~~~~~~-~~~~~~p~l~~w~~r~~~rps~~~~l~e~~~~~~ 220 (230) |++||+||+|||+++++++++..+|. .++..+|+|++|++||.+||||+++++|+++++. T Consensus 148 ~l~Gd~ftlADi~lap~l~~l~~~g~~~~~~~~p~l~~w~~Ri~~RpSf~~tl~e~e~~~~ 208 (211) T PRK09481 148 YFMSEEFSLVDCYLAPLLWRLPVLGIELSGPGAKELKGYMTRVFERDSFLASLTEAEREMR 208 (211) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHHHCCHHHHHHHHHHHHHHH T ss_conf 5567876399999999999999849777864388999999999779899998757999875 No 2 >PRK13972 glutathione S-transferase; Provisional Probab=100.00 E-value=1.5e-44 Score=275.08 Aligned_cols=198 Identities=23% Similarity=0.310 Sum_probs=160.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCC----H--HHHCCCCHHHHHH Q ss_conf 95213589986899999999986998299988867---7788999855687752011100----0--1110121035677 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEH----M--QALCGVFVISEYL 71 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~d----g--~~l~eS~aI~~yL 71 (230) |++||+.| |++|+||+++|+|+||+|+.+.++.. +++|+|+++||.|+||+|+|+| | .+|+||.||++|| T Consensus 1 M~kLY~~p-~~n~~rv~~~L~Elgl~ye~~~vd~~~ge~~~peflaiNP~G~VPvL~Dgd~~d~ge~~~l~ES~AIl~YL 79 (215) T PRK13972 1 MIDLYFAP-TPNGHKITLFLEEAGLDYRLIKVDLGKGGQFRPEFLRISPNNKIPAIVDHSPADGGEPLSLFESGAILLYL 79 (215) T ss_pred CEEECCCC-CCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEECCCCCCCCCEEEEECHHHHHHHH T ss_conf 90331499-98789999999976999869985487887388789973979858845326767888567777299999999 Q ss_pred HHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHH Q ss_conf 75288723343356799999999999999986123425778999988776642122102334399999999999999855 Q gi|254780594|r 72 DETYGAIAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKY 151 (230) Q Consensus 72 ~~~~~~~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~ 151 (230) +++++ .+.|.++.+|+++.+|+.|....+.+.+..... +... . .......++....++.+.++. T Consensus 80 a~k~~------~~~~~~~~~ra~~~~Wl~~~~~~~~p~~~~~~~----~~~~----~--~~~~~~~~~~~~~~~~~~~~~ 143 (215) T PRK13972 80 AEKTG------LFLSHETRERAATLQWLFWQVGGLGPMLGQNHH----FNHA----A--PQTIPYAIERYQVETQRLYHV 143 (215) T ss_pred HHHCC------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHH----HHCC----C--CCCCHHHHHHHHHHHHHHHHH T ss_conf 98437------789989999999999999987413258888765----3104----8--420458999999999999999 Q ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 65453479842667502899999999999996189994567688999999970998898413332 Q gi|254780594|r 152 ITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSERI 216 (230) Q Consensus 152 le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~~ 216 (230) +|++|++++|++||+||+|||++++++...... .+++..||+|++|++||.+|||||+++.++- T Consensus 144 le~~L~~~~yl~Gd~fTiADi~~~~~~~~~~~~-~~dl~~~P~l~~w~~rv~~RPa~~~al~~~~ 207 (215) T PRK13972 144 LNKRLENSPWLGGENYSIADIACWPWVNAWTRQ-RIDLAMYPAVKNWHERIRSRPATGQALLKAQ 207 (215) T ss_pred HHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHHHHCHHHHHHHHHHH T ss_conf 999864898401799889999999999999855-8993308799999999985999999999873 No 3 >PRK10357 predicted glutathione S-transferase; Provisional Probab=100.00 E-value=9.9e-44 Score=270.39 Aligned_cols=196 Identities=18% Similarity=0.264 Sum_probs=166.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCC Q ss_conf 52135899868999999999869982999888677788999855687752011100011101210356777528872334 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQK 81 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~ 81 (230) |+||+.++|||+||||++|.||||+||.+.+++++..+++.++||.||||+|+++||.+|+||.+||+|||+.++ + T Consensus 1 MkLy~~~~SPy~RkVri~l~Ekgl~~e~v~~~~~~~~~~~~~~NP~gkVP~Lv~ddG~~l~eS~~I~~YLd~~~~----~ 76 (202) T PRK10357 1 MKLIGSYTSPFVRKISVLLLEKGITFEFINELPYNADNGVAQYNPLGKVPALVTEEGECWFDSPIIAEYIELLNV----A 76 (202) T ss_pred CEECCCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHCCCCCCCEEEECCCCEECCHHHHHHHHHHHCC----C T ss_conf 977468998399999999998399857985588787266797597978787985899885588999999997389----9 Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 33567999999999999999861234257789999887766421221023343999999999999998556545347984 Q gi|254780594|r 82 NRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSW 161 (230) Q Consensus 82 ~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~f 161 (230) ++|+|+++.+++++++|..++++.++..+.. ..+. .......+...+..++.++.+.|+.+|.++.++. T Consensus 77 p~L~P~dp~~~~~~~~~~~~ad~~~~~~~~~--~~~~--------~r~~~~~~~~~~~~~~~~i~~~l~~le~~l~~~~- 145 (202) T PRK10357 77 PAMLPRDPLAALRVRQIEALADGIMDAALVS--VREQ--------ARPAAQQSEDELLRQREKINRSLDVLEGYLVDGT- 145 (202) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH--HHHH--------HCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCC- T ss_conf 8889989999999999999873446899999--9984--------0771105999999999999999999998736598- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHCCC-CCH-HHCHHHHHHHHHHHCCHHHHHHH Q ss_conf 2667502899999999999996189-994-56768899999997099889841 Q gi|254780594|r 162 IAGHHMSYADFAASATISILDYLGE-IDW-DTAPIVKEWYQRMKSRPSFRPLL 212 (230) Q Consensus 162 l~Gd~~s~ADi~~~~~l~~~~~~~~-~~~-~~~p~l~~w~~r~~~rps~~~~l 212 (230) +.|+++|+|||+++|.|.++++... ..| .++|+|.+|++|+.+||||+++. T Consensus 146 ~~g~~~sladia~a~~l~yl~~r~~~~~w~~~~P~L~~w~~~~~~RpSf~~T~ 198 (202) T PRK10357 146 LKTDTVNLATIAIACAVGYLNFRRVAPGWCVDRPHLVKLVETLFQRESFARTE 198 (202) T ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHCCHHHHHHHHHHHCCHHHHCCC T ss_conf 67887769999999999999852568004337989999999997492875449 No 4 >PRK10542 glutathionine S-transferase; Provisional Probab=100.00 E-value=1.5e-43 Score=269.28 Aligned_cols=194 Identities=18% Similarity=0.306 Sum_probs=160.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC----CHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCC Q ss_conf 521358998689999999998699829998886777----8899985568775201110001110121035677752887 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK----RREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGA 77 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~----~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~ 77 (230) ++||+.| +++++||+++|+|+||+|+.+.++...+ +++|+++||.|+||+|+++||.+|+||.||++||+++|+. T Consensus 1 mkLY~~p-~~~s~~~~i~L~E~gi~ye~~~vd~~~~~~~~~~~fl~iNP~gkVP~L~~ddg~~l~ES~AI~~YL~~~~p~ 79 (201) T PRK10542 1 MKLFYKP-GACSLASHITLRESGKDFTLVSVDLAKKRLENGDDYLAINPKGQVPALLLDDGTLLTEGVAIMQYLADSVPD 79 (201) T ss_pred CEEEECC-CCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCCHHHHHHCCCCCCCEEEECCCEEEECHHHHHHHHHHHCCC T ss_conf 9534379-852899999999869998799877888766688889973979998569966983897119999999985887 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 23343356799999999999999986123425778999988776642122102334399999999999999855654534 Q gi|254780594|r 78 IAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIK 157 (230) Q Consensus 78 ~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~ 157 (230) . +.+.|.++.+|+++++|+.+....++..... +++. ..+ ......+++++.+.++.+|..|+ T Consensus 80 ~---~l~~p~~~~~r~~~~~wl~~~~~~~~~~~~~------~~~~--------~~~-~~~~~~~~~~l~~~l~~le~~L~ 141 (201) T PRK10542 80 R---QLLAPVGSLSRYHTIEWLNYIATELHKGFTP------LFRP--------DTP-EEYKPTVRAQLEKKFQYVDEALK 141 (201) T ss_pred C---CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH------HHCC--------CCH-HHHHHHHHHHHHHHHHHHHHHHC T ss_conf 5---4789989999999999999999878788999------8676--------986-99999999999999999999973 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 7984266750289999999999999618999456768899999997099889841333 Q gi|254780594|r 158 SRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 158 ~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) +++||+||++|+|||++++.+.+....+ ++++++|+|.+|++||.+|||||++++.+ T Consensus 142 ~~~~l~G~~~t~ADi~~~~~l~~~~~~~-~~l~~~p~l~~w~~r~~~RPs~q~al~~e 198 (201) T PRK10542 142 DEQWICGQRFTIADAYLFTVLRWAYAVK-LNLEGLSHIAAYMQRVAERPAVAAALKAE 198 (201) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCHHHHHHHHHC T ss_conf 5985158998899999999999998659-99541989999999997299999999980 No 5 >TIGR01262 maiA maleylacetoacetate isomerase; InterPro: IPR005955 Maleylacetoacetate isomerase is an enzyme of tyrosine and phenylalanine catabolism. It requires glutathione and belongs by homology to the zeta family of glutathione S-transferases. The enzyme (5.2.1.2 from EC) is described as active also on maleylpyruvate.; GO: 0003824 catalytic activity, 0009072 aromatic amino acid family metabolic process, 0005737 cytoplasm. Probab=100.00 E-value=6.2e-43 Score=265.80 Aligned_cols=202 Identities=22% Similarity=0.311 Sum_probs=170.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-----CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCC Q ss_conf 2135899868999999999869982999888677-----78899985568775201110001110121035677752887 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-----KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGA 77 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-----~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~ 77 (230) +||+|+.|+.|+||||+|+.|||+|+.+.|+... ..++|.++||.|+||+|++++|.+|++|.||+|||||+||+ T Consensus 1 ~LYsYwRSS~SyRVRIALaLK~~~Y~~~pVnL~~dGGeQ~~~~f~~~NP~~~VP~L~~~~g~~l~QSLAIIEYLeE~yP~ 80 (218) T TIGR01262 1 KLYSYWRSSASYRVRIALALKGIDYESVPVNLLKDGGEQREPEFRALNPQGLVPTLDIDEGEVLTQSLAIIEYLEETYPD 80 (218) T ss_pred CCCCCCHHHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCHHHHHHHCCCHHHCCEEECCCCCCHHHHHHHHHHHHHCCCC T ss_conf 94365102268999999984377871000113324621104888742711317834538852123457899988630788 Q ss_pred CCCCCC-CCC--CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-HHHHHHHHHHHHHH Q ss_conf 233433-567--99999999999999986123425778999988776642122102334399999-99999999985565 Q gi|254780594|r 78 IAQKNR-LLS--TDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLR-IARNNMREHIKYIT 153 (230) Q Consensus 78 ~~~~~~-L~p--~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~l~~le 153 (230) ++ |+| +|+..||+++.+...+...+|+ +.+-+|.......... +++... ....-+.++|..+| T Consensus 81 ----~p~LLPPi~dp~~Ra~vRalA~~iAcDIhP-----L~Nl~Vl~~L~~~~g~----~~~~~~~W~~~wI~~Gl~A~E 147 (218) T TIGR01262 81 ----PPRLLPPIADPAKRARVRALALLIACDIHP-----LNNLRVLQYLREKLGV----DEEERRRWYQHWISKGLAALE 147 (218) T ss_pred ----CCCCCCCCCCHHHHHHHHHHHHHHHCCCCC-----CCCHHHHHHHHHHCCC----CHHHHHHHHHHHHHHHHHHHH T ss_conf ----888898767978899999999998338887-----5328999888653289----988888899999998899999 Q ss_pred HHHC----CCCCCCCCCCCHHHHHHHHHHHHHH-HCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHHHCC Q ss_conf 4534----7984266750289999999999999-618999456768899999997099889841333218 Q gi|254780594|r 154 WLIK----SRSWIAGHHMSYADFAASATISILD-YLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSERIRG 218 (230) Q Consensus 154 ~~l~----~~~fl~Gd~~s~ADi~~~~~l~~~~-~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~~~~ 218 (230) .+|. .+.|.+||++|+||||+.|.++.+. +|+ +|+..||.|++-.+++.+.|+|+++..+.-++ T Consensus 148 ~lL~~~~~~G~fc~Gd~~tlAD~cLvPQvynA~~Rf~-vDL~~YP~l~~i~~~~~~lpAF~~AhP~~QPD 216 (218) T TIGR01262 148 ALLQSDPHAGAFCVGDTPTLADLCLVPQVYNAEERFG-VDLTPYPTLRRIAEALAALPAFQAAHPENQPD 216 (218) T ss_pred HHHHCCCCCCCEEECCCCCEEEEEHHHHHHHHHHHHC-CCCCCCCHHHHHHHHHHHCHHHHHHCCCCCCC T ss_conf 9860444478265178242201215379999887414-78777856899999986243788714304688 No 6 >COG0625 Gst Glutathione S-transferase [Posttranslational modification, protein turnover, chaperones] Probab=100.00 E-value=2.9e-39 Score=244.74 Aligned_cols=194 Identities=29% Similarity=0.424 Sum_probs=160.6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC--CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCC Q ss_conf 5213589986899999999986998299988867--77889998556877520111000111012103567775288723 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW--GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIA 79 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~--~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~ 79 (230) ++||+.+.||||+||+++|.|+|++|+.+.++.. .+.++|+++||+|+||+|+++||.+|+||.+|++||+++|+. T Consensus 1 ~~L~~~~~sp~~~kv~l~l~e~g~~ye~~~v~~~~~~~~~~~~~~nP~g~VPvL~~~~~~~l~ES~AI~~YL~~~~~~-- 78 (211) T COG0625 1 MKLYGSPTSPYSRKVRLALEEKGLPYEIVLVDLDAEQKPPDFLALNPLGKVPALVDDDGEVLTESGAILEYLAERYPG-- 78 (211) T ss_pred CEEEECCCCCHHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHHHCCCCCCCCEEEECCCCEEECHHHHHHHHHHHCCC-- T ss_conf 957657999679999999984899846998455555466789965989898979879998898689999999985899-- Q ss_pred CCCCCCCCCHH---HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 34335679999---999999999998612342577899998877664212210233439999999999999985565453 Q gi|254780594|r 80 QKNRLLSTDPL---QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLI 156 (230) Q Consensus 80 ~~~~L~p~~~~---~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l 156 (230) ++|+|.++. +|+++.+|..+....+++.+...... . . .... ..+....+.+...+...+..+|..| T Consensus 79 --~~l~p~~~~~r~~r~~~~~~~~~~~~~~~~~~~~~~~~-----~-~-~~~~--~~~~~~~~~~~~~~~~~l~~le~~L 147 (211) T COG0625 79 --PPLLPADPLARRARALLLWWLFFAASDLHPVIGQRRRA-----L-L-GSEP--ELLEAALEAARAEIRALLALLEALL 147 (211) T ss_pred --CCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHHH-----H-C-CCCC--CCHHHHHHHHHHHHHHHHHHHHHHH T ss_conf --77647980668999999999999985368999998652-----0-0-1222--0128899999999999999999873 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHH Q ss_conf 47984266750289999999999999618999456768899999997099889 Q gi|254780594|r 157 KSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFR 209 (230) Q Consensus 157 ~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~ 209 (230) ++++|++|+++|+||+++++.++++...+ ..+..+|++++|++||.+||+++ T Consensus 148 ~~~~~l~G~~~tiAD~~~~~~~~~~~~~~-~~~~~~p~l~~w~~r~~~rp~~~ 199 (211) T COG0625 148 ADGPYLAGDRFTIADIALAPLLWRLALLG-EELADYPALKAWYERVLARPAFR 199 (211) T ss_pred CCCCEECCCCCCHHHHHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCCCCH T ss_conf 55965118988899999989999987424-66535889999999998487730 No 7 >PRK11752 putative glutathione S-transferase YghU; Provisional Probab=100.00 E-value=1.6e-37 Score=234.76 Aligned_cols=195 Identities=18% Similarity=0.228 Sum_probs=153.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHC------CCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHH---HHCCCCHHHH Q ss_conf 5213589986899999999986------998299988867---778899985568775201110001---1101210356 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEY------EFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQ---ALCGVFVISE 69 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~------gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~---~l~eS~aI~~ 69 (230) +.||++++ |+.+||-++|+|+ |++|+.+.++.. ..+++|+++||+||||+|+|+++. .|+||.||+. T Consensus 45 ~qLYt~~T-PNG~KvsI~LEEl~~~g~~gl~Y~~~~idI~~geQf~peFl~INPN~KIPaIVD~~~~~pi~vFESGAIL~ 123 (264) T PRK11752 45 LQLYSLGT-PNGQKVTIMLEELLALGVKGAEYDAWLIRIGEGDQFSSGFVEINPNSKIPALLDRSGNPPIRVFESGAILL 123 (264) T ss_pred CEEEECCC-CCCEEHHHHHHHHHHCCCCCCCCEEEEECCCCCCCCCHHHHEECCCCCCCEEECCCCCCCEEEECHHHHHH T ss_conf 11462589-99740022898787546678973589954888872686765048799887687279998536864789999 Q ss_pred HHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHH Q ss_conf 77752887233433567999999999999999861234257789999887766421221023343999999999999998 Q gi|254780594|r 70 YLDETYGAIAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHI 149 (230) Q Consensus 70 yL~~~~~~~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l 149 (230) ||+++|+ +|+|+++.+|+++.+|+.|.... .+.+.....+-..+.+ . ...-.++....+.++.+ T Consensus 124 YLAeK~g------kflP~d~~~r~~vlqWL~wQmg~-~p~m~gq~~hF~~yap-----e----k~~yai~RY~~E~~Rly 187 (264) T PRK11752 124 YLAEKFG------AFLPKDLAARTEVLNWLFWQQGS-APFLGGGFGHFYAYAP-----E----KIEYAINRFTMEAKRQL 187 (264) T ss_pred HHHHHHC------CCCCCCHHHHHHHHHHHHHHHCC-CCCCHHHHHHHHHHCC-----C----CCCHHHHHHHHHHHHHH T ss_conf 9999737------64799989999999999998616-7640788899998488-----6----55217899999999999 Q ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH---CC--C-CCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 5565453479842667502899999999999996---18--9-994567688999999970998898413 Q gi|254780594|r 150 KYITWLIKSRSWIAGHHMSYADFAASATISILDY---LG--E-IDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 150 ~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~---~~--~-~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) ..||.+|++++||+||++|+|||+++|++..+.. +. . ++.++||+|.+|++|+.+||+|++.++ T Consensus 188 ~VLd~rL~~~~ylaGd~YSIADIA~~PWv~~~~~~~~~~~~~~~~l~~yPnv~rW~~~I~~RPAVqRG~~ 257 (264) T PRK11752 188 DVLDKQLAEHEYIAGDEYTIADIAIWPWYGNLVLGNLYDAAEFLDVGSYKHVQRWAKEIAERPAVKRGRI 257 (264) T ss_pred HHHHHHHCCCCEECCCCCCEEEEEECCCHHHHHHCCCCCCCCCCCHHHCHHHHHHHHHHHCCHHHHHHHH T ss_conf 9999987359555468742022453300156641321341111672018289999999864888786403 No 8 >KOG0406 consensus Probab=100.00 E-value=7.9e-36 Score=225.00 Aligned_cols=195 Identities=22% Similarity=0.314 Sum_probs=160.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHC-CCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCC Q ss_conf 52135899868999999999869982999888677788999855-68775201110001110121035677752887233 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELN-PSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQ 80 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~ln-P~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~ 80 (230) .+||++++|||++||+++|++|||+||.+.++..++++.+++.| +.+|||+|++++ ..|+||..|++|||+.+++ T Consensus 10 vrL~~~w~sPfa~R~~iaL~~KgI~yE~veedl~~Ks~~ll~~np~hkKVPvL~Hn~-k~i~ESliiveYiDe~w~~--- 85 (231) T KOG0406 10 VKLLGMWFSPFAQRVRIALKLKGIPYEYVEEDLTNKSEWLLEKNPVHKKVPVLEHNG-KPICESLIIVEYIDETWPS--- 85 (231) T ss_pred EEEEEEECCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCCEEEECC-CEEHHHHHHHHHHHHHCCC--- T ss_conf 799974168499999999985599618873487778889987466556687799889-3010367899988742568--- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHC-CC Q ss_conf 43356799999999999999986123425778999988776642122102334399999999999999855654534-79 Q gi|254780594|r 81 KNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIK-SR 159 (230) Q Consensus 81 ~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~-~~ 159 (230) +++++|+||.+||+++.|..+++..+...... +... ...+..+.+.++..+.|+.+|..|. ++ T Consensus 86 ~~~iLP~DPy~Ra~arfwa~~id~~~~~~~~~------~~~~----------~~~e~~~~~~~e~~e~l~~lE~el~k~k 149 (231) T KOG0406 86 GPPILPSDPYERAQARFWAEYIDKKVFFVGRF------VVAA----------KGGEEQEAAKEELREALKVLEEELGKGK 149 (231) T ss_pred CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHH------HHHH----------CCCHHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 99889998799999999999987588898889------9751----------5765788899999999999999970489 Q ss_pred CCCCCCCCCHHHHHHHHHHHHHH----HCC---CCCHHHCHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 84266750289999999999999----618---9994567688999999970998898413332 Q gi|254780594|r 160 SWIAGHHMSYADFAASATISILD----YLG---EIDWDTAPIVKEWYQRMKSRPSFRPLLSERI 216 (230) Q Consensus 160 ~fl~Gd~~s~ADi~~~~~l~~~~----~~~---~~~~~~~p~l~~w~~r~~~rps~~~~l~e~~ 216 (230) +|+.|+++++.|+++.+++.++. -++ ..+-+++|.|.+|.+||.++|+|++++.+.+ T Consensus 150 ~~fgG~~~G~vDi~~~p~~~~~~~~~~~~~~~~~~~~~~~P~L~~W~~~~~~~~~V~~~~p~~e 213 (231) T KOG0406 150 DFFGGETIGFVDIAIGPSFERWLAVLEKFGGVKFIIEEETPKLIKWIKRMKEDEAVKAVLPDSE 213 (231) T ss_pred CCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHCCHHHHHHCCCHH T ss_conf 9888997677466688638999999998367635787789629999999735726775469999 No 9 >KOG0868 consensus Probab=100.00 E-value=1.2e-34 Score=218.23 Aligned_cols=202 Identities=22% Similarity=0.283 Sum_probs=164.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC----CHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCC Q ss_conf 521358998689999999998699829998886777----8899985568775201110001110121035677752887 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK----RREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGA 77 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~----~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~ 77 (230) .+||+|+.|.+++|||++|+.|||+||.+.++.... ..+|.++||+++||+|++ ||.+|+||.||++||||+|| T Consensus 6 piLYSYWrSSCswRVRiALaLK~iDYey~PvnLlk~~~q~~~ef~~iNPm~kVP~L~i-~g~tl~eS~AII~YLeEt~P- 83 (217) T KOG0868 6 PILYSYWRSSCSWRVRIALALKGIDYEYKPVNLLKEEDQSDSEFKEINPMEKVPTLVI-DGLTLTESLAIIEYLEETYP- 83 (217) T ss_pred CHHHHHHCCCCHHHHHHHHHHCCCCCCEEEHHHHCCHHHHHHHHHHCCCHHHCCEEEE-CCEEEEHHHHHHHHHHHCCC- T ss_conf 2133430342148999999870798520311643004441367763492420974788-99874148999999985189- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 23343356799999999999999986123425778999988776642122102334399999999999999855654534 Q gi|254780594|r 78 IAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIK 157 (230) Q Consensus 78 ~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~ 157 (230) +++|+|.|+..||.+++....+.+.+.+-... .+.+.......+. ....++.-+..++..+|..|. T Consensus 84 ---~ppLLP~d~~KRA~~r~i~~~i~sgIQPlQNl-----~vl~~l~ek~~~~------~~~W~q~~ItkGF~ALEklL~ 149 (217) T KOG0868 84 ---DPPLLPKDPHKRAKARAISLLIASGIQPLQNL-----SVLKMLNEKEPGY------GDQWAQHFITKGFTALEKLLK 149 (217) T ss_pred ---CCCCCCCCHHHHHHHHHHHHHHHHCCCCCHHH-----HHHHHHCCCCCCH------HHHHHHHHHHHHHHHHHHHHH T ss_conf ---99889867788999999999998178800215-----9999860004543------668999999886999999999 Q ss_pred C--CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHHHCCCC Q ss_conf 7--98426675028999999999999961899945676889999999709988984133321899 Q gi|254780594|r 158 S--RSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSERIRGLL 220 (230) Q Consensus 158 ~--~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~~~~~~ 220 (230) . ++|.+||++|+||+++.+.++.+..|. ++...||.+.+..+.+.+.|+||.+..+..++-+ T Consensus 150 ~~aGkycvGDevtiADl~L~pqv~nA~rf~-vdl~PYPti~ri~e~l~elpaFq~ahP~nQPD~P 213 (217) T KOG0868 150 SHAGKYCVGDEVTIADLCLPPQVYNANRFH-VDLTPYPTITRINEELAELPAFQAAHPDNQPDTP 213 (217) T ss_pred HCCCCCCCCCEEEHHHHCCCHHHHHHHHCC-CCCCCCCHHHHHHHHHHHCHHHHHCCCCCCCCCC T ss_conf 746872127600255632226663001210-3687673199999998757788852987899998 No 10 >KOG0867 consensus Probab=100.00 E-value=9.1e-34 Score=213.19 Aligned_cols=204 Identities=23% Similarity=0.349 Sum_probs=165.9 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC---CCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCC Q ss_conf 9521358998689999999998699829998886---7778899985568775201110001110121035677752887 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFP---WGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGA 77 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~---~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~ 77 (230) .++||+++.||.|++|.+++.++|++|+.+.++. .+++|+|+++||.|+||+|+++ |..++||.||+.||.++|. T Consensus 2 ~~~ly~~~~s~~~r~vl~~~~~~~l~~e~~~v~~~~ge~~~pefl~~nP~~kVP~l~d~-~~~l~eS~AI~~Yl~~ky~- 79 (226) T KOG0867 2 KLKLYGHLGSPPARAVLIAAKELGLEVELKPVDLVKGEQKSPEFLKLNPLGKVPALEDG-GLTLWESHAILRYLAEKYG- 79 (226) T ss_pred CCEEECCCCCHHHHHHHHHHHHCCCCCCEEECHHHCCCCCCHHHHHCCCCCCCCEEECC-CEEEEEHHHHHHHHHHHHC- T ss_conf 81672478870179999999983996221221011134599688842878888867469-8568609999999999856- Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHC Q ss_conf 23343356799999999999999986123425778999988776642122102334399999999999999855654534 Q gi|254780594|r 78 IAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIK 157 (230) Q Consensus 78 ~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~ 157 (230) ..+..|+|.+..+||.+.+|+.|..+.+..... ...+..+ .......+....+....++...++.+|..|. T Consensus 80 -~~~~~l~p~~~~~ra~v~~~l~~~~~~l~~~~~----~~~~~~p----~~~~~~~~~~~~~~~~~~~~~~~~~~e~~l~ 150 (226) T KOG0867 80 -PLGGILLPKDLKERAIVDQWLEFENGVLDPVTF----ERPILAP----LLVGLPLNPTAVKELEAKLRKALDNLERFLK 150 (226) T ss_pred -CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCCC----EEEEECC----HHHCCCCCHHHHHHHHHHHHHHHHHHHHHHC T ss_conf -578667876788999999999886353053212----1034313----1112555416788889999999999999973 Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHH--HHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 798426675028999999999999--9618999456768899999997099889841333 Q gi|254780594|r 158 SRSWIAGHHMSYADFAASATISIL--DYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 158 ~~~fl~Gd~~s~ADi~~~~~l~~~--~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) ++.|++|+++|+||+++++.+..+ ......+..++|++.+|++++.++|+++++.... T Consensus 151 ~~~yl~g~~~tlADl~~~~~~~~~~~~~~~~~~~~~~p~v~~W~~~~~~~P~~~e~~~~~ 210 (226) T KOG0867 151 TQVYLAGDQLTLADLSLASTLSQFQGKFATEKDFEKYPKVARWYERIQKRPAYEEANEKG 210 (226) T ss_pred CCCCCCCCCCCHHHHHHHHHHHHHHHHHHCHHHHHHCHHHHHHHHHHHHCCHHHHHHHHH T ss_conf 487147886009999973207541114322001123567889999887384177751665 No 11 >KOG4420 consensus Probab=99.95 E-value=1.5e-27 Score=177.53 Aligned_cols=210 Identities=23% Similarity=0.374 Sum_probs=144.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCC Q ss_conf 5213589986899999999986998299988867---7788999855687752011100011101210356777528872 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAI 78 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~ 78 (230) ++|||+|.|--|+|||++++||||+|+-..+++- +..|+|..+||.|.||||++++ .+|+|+.-|++|++.+|-. T Consensus 27 ~vLyhhpysf~sQkVrlvi~EK~id~~~y~V~l~~geh~epwFmrlNp~gevPVl~~g~-~II~d~tqIIdYvErtf~g- 104 (325) T KOG4420 27 LVLYHHPYSFSSQKVRLVIAEKGIDCEEYDVSLPQGEHKEPWFMRLNPGGEVPVLIHGD-NIISDYTQIIDYVERTFTG- 104 (325) T ss_pred CEEEECCCCCCCCEEEEEHHHCCCCCCEEECCCCCCCCCCCHHEECCCCCCCCEEECCC-EECCCHHHHHHHHHHHHCC- T ss_conf 35661576554310243211206565000026766554673212407899774685487-0125579999999986146- Q ss_pred CCCCCCCCCCHHHH-HHHH--HHH-------HHHHH-CCCHHHHH----H---HHH-------HHHHHHHHH-------- Q ss_conf 33433567999999-9999--999-------99861-23425778----9---999-------887766421-------- Q gi|254780594|r 79 AQKNRLLSTDPLQR-AETR--RMI-------EWFMY-QMEQDVTR----P---LAH-------ERVYKLHMT-------- 125 (230) Q Consensus 79 ~~~~~L~p~~~~~r-a~~~--~~~-------~~~~~-~~~~~~~~----~---~~~-------~~~~~~~~~-------- 125 (230) +..|.|+-.... .++. +-. .|..+ .+|+++.. + .+. +.+.+.... T Consensus 105 --er~l~pe~~S~~~d~~l~~e~~l~~lpm~~~t~g~~lh~eL~~~s~iP~~~~iR~~~~k~~~~v~~l~~~e~pdla~a 182 (325) T KOG4420 105 --ERVLMPEVGSLQHDRVLQYEELLDALPMDAYTHGCILHPELTTDSMIPKYAEIRRHLAKATTDVMKLDHEEEPDLAEA 182 (325) T ss_pred --CCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHCCCCCCCHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCHHHHH T ss_conf --521034423431278999999997457422322560360231033476307999999877899999875228135679 Q ss_pred -------HH-HCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC----CCCCCCCCCCHHHHHHHHHHHHHHHCCCC--CHH- Q ss_conf -------22-1023343999999999999998556545347----98426675028999999999999961899--945- Q gi|254780594|r 126 -------TE-QGGGSPDSKVLRIARNNMREHIKYITWLIKS----RSWIAGHHMSYADFAASATISILDYLGEI--DWD- 190 (230) Q Consensus 126 -------~~-~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~----~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~--~~~- 190 (230) .. ....-.+.+..++...++...|+.+|+.|.+ ..||+|++||+||+++.+.|+.+..+|.. -|. T Consensus 183 y~akqkkl~~kl~~hdd~s~lkkild~l~~~Ld~VEteLe~r~~~~~wL~G~efslADVsLg~~LhRL~~Lg~e~~yw~~ 262 (325) T KOG4420 183 YLAKQKKLMAKLLEHDDVSYLKKILDELAMVLDQVETELEKRKLCELWLCGCEFSLADVSLGATLHRLKFLGLEKKYWED 262 (325) T ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHHHHHHHHCCCHHHHCCC T ss_conf 99999999988870125899999999999999999999860464523422662118889999999999870437875125 Q ss_pred -HCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf -6768899999997099889841333 Q gi|254780594|r 191 -TAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 191 -~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) .+|+|.+|++|+..|+||++++++- T Consensus 263 gsrpnle~Yf~rvrrR~sf~kvlg~~ 288 (325) T KOG4420 263 GSRPNLESYFERVRRRFSFRKVLGDI 288 (325) T ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 78746999999998436788750367 No 12 >PTZ00057 glutathione s-transferase; Provisional Probab=99.92 E-value=7.7e-24 Score=156.18 Aligned_cols=189 Identities=15% Similarity=0.215 Sum_probs=134.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCC-HHHH-----HHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHC Q ss_conf 5213589986899999999986998299988867778-8999-----855687752011100011101210356777528 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKR-REFL-----ELNPSGTLPVYIDEHMQALCGVFVISEYLDETY 75 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~-~~~~-----~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~ 75 (230) +|||+++.-..+.-+|++|++.|++|+-+........ .++. +..|.|++|+|++ ||..|++|.||++||+.++ T Consensus 5 ~~L~YF~~RGraE~iRlll~~agv~yeD~r~~~~~~~~~e~~~~~~~~~~pfgqlP~Le~-dg~~i~QS~AI~RYlark~ 83 (211) T PTZ00057 5 IVLYYFDARGKAELIRLIFAYLGIEYTDKRFGENGDAFIEFKNFKKEKDTPFEQVPILEI-DDIIFAQSQAIVRYLSKKY 83 (211) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHCCCCCCCCCCEEEE-CCEEEECHHHHHHHHHHHC T ss_conf 089851984476999999998499837886366740389988514316998889998998-9999861899999999984 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH Q ss_conf 87233433567999999999999999861234257789999887766421221023343999999999999998556545 Q gi|254780594|r 76 GAIAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWL 155 (230) Q Consensus 76 ~~~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~ 155 (230) + |.|+++.+++.+.++....... .. ........+ ....+...+.+...+..+|.. T Consensus 84 g-------l~g~~~~e~~~~D~i~~~~~D~-~~----~~~~~~~~~-------------~~~~~~~~~~lp~~~~~~e~~ 138 (211) T PTZ00057 84 K-------ICGESELNEFYADMIFCGVQDI-HY----KFNNTNLFK-------------QNETTFLNEDLPKWSGYFEKI 138 (211) T ss_pred C-------CCCCCHHHHHHHHHHHHHHHHH-HH----HHHHHHHHH-------------HHHHHHHHHHHHHHHHHHHHH T ss_conf 9-------8999999999999999999999-98----852256666-------------669999998889999999999 Q ss_pred H--------CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 3--------479842667502899999999999996189994567688999999970998898413332 Q gi|254780594|r 156 I--------KSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSERI 216 (230) Q Consensus 156 l--------~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~~ 216 (230) | ++++|++|+++|+||++++..+..+.......++.||+|+++++|++++|.+++.++.+. T Consensus 139 L~~n~~~~~~~~~~~vG~~lT~aD~~~~~~ld~l~~~~p~~l~~~P~L~~~~~~v~~~P~Ik~yl~sRp 207 (211) T PTZ00057 139 LKKNHCNNNNDKYYFVGDDLTYADLAVFNLYDDIETKYPNSLKNFPLLKAHNEFISNLPNIKNYISNRK 207 (211) T ss_pred HHHCCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHHHCCCHHHCCHHHHHHHHHHHCCHHHHHHHHCCC T ss_conf 986434446898776689974999999999998988690432219769999999983978899985399 No 13 >PRK10387 glutaredoxin 2; Provisional Probab=99.88 E-value=4.6e-22 Score=145.99 Aligned_cols=190 Identities=15% Similarity=0.187 Sum_probs=125.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCC Q ss_conf 52135899868999999999869982999888677788999855687752011100011101210356777528872334 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQK 81 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~ 81 (230) |+||.|..||||.|||+++..+||++|.+.+.. ...+....+.+.++||+|+.+||.++.||..|+.|+++..+. T Consensus 1 MKLy~y~~CPfc~rarm~~~~k~i~~e~~~l~~-dd~~t~~~~~g~k~VPvL~~~dG~~~~ESlDIi~yiD~~~~~---- 75 (210) T PRK10387 1 MKLYIYDHCPFCVKARMIAGLKNIPVELIVLAN-DDEATPIRMIGAKQVPILQKDDGSYMPESLDIVHYVDELDGK---- 75 (210) T ss_pred CCCEEECCCHHHHHHHHHHHHCCCCEEEEEECC-CCCCCHHHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHCCCC---- T ss_conf 973142647899999999985499738875024-884037887157778979718988625599999999853797---- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHH-----H-------HHHHHHHHHHHCCCCCHHHHH---HHHHHHHH Q ss_conf 33567999999999999999861234257789999-----8-------877664212210233439999---99999999 Q gi|254780594|r 82 NRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAH-----E-------RVYKLHMTTEQGGGSPDSKVL---RIARNNMR 146 (230) Q Consensus 82 ~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~-----~-------~~~~~~~~~~~~~~~~~~~~~---~~~~~~l~ 146 (230) ++..... +.++..|..-..... ..+..+.+. | ..+.. ...... + ...+.. ....++++ T Consensus 76 -~~l~~~~--~~~i~~W~~~v~~~~-~~L~~PR~~~~~lpEf~t~~A~~yf~~-kKe~~~-g-~f~~~l~~t~~~~~~l~ 148 (210) T PRK10387 76 -PLLTGKR--SPAIEEWLRKVNGYL-NKLLYPRFAKADLPEFATPSARQYFID-KKEASI-G-SFDALLAHSPGLIKEIN 148 (210) T ss_pred -CCCCCCC--CHHHHHHHHHHHHHH-HHHHCCHHHCCCCCCCCCHHHHHHHHH-HHHHHC-C-CHHHHHHCCHHHHHHHH T ss_conf -3356888--889999999988898-877500454089844277879999997-622410-8-99999856799999999 Q ss_pred HHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCH Q ss_conf 998556545347984266750289999999999999618999456768899999997099 Q gi|254780594|r 147 EHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRP 206 (230) Q Consensus 147 ~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rp 206 (230) ..|..++..+...+++.| ++|+.||.+++.|..+.....+.|. |++.+|+++|.++- T Consensus 149 ~~L~~L~~ll~~~~~~~g-~ls~dDi~lFp~LR~lt~vkgi~~p--~~v~~y~~~~s~~t 205 (210) T PRK10387 149 ADLRALDPLIVKPNAVNG-ELSEDDIHLFPLLRNLTLVKGIEWP--PRVADYRDNMSKKT 205 (210) T ss_pred HHHHHHHHHHCCCHHHCC-CCCHHHHHHHHHHCCCEEECCCCCC--HHHHHHHHHHHHHH T ss_conf 999999998528231078-7879999998986052033488788--79999999999984 No 14 >cd03059 GST_N_SspA GST_N family, Stringent starvation protein A (SspA) subfamily; SspA is a RNA polymerase (RNAP)-associated protein required for the lytic development of phage P1 and for stationary phase-induced acid tolerance of E. coli. It is implicated in survival during nutrient starvation. SspA adopts the GST fold with an N-terminal TRX-fold domain and a C-terminal alpha helical domain, but it does not bind glutathione (GSH) and lacks GST activity. SspA is highly conserved among gram-negative bacteria. Related proteins found in Neisseria (called RegF), Francisella and Vibrio regulate the expression of virulence factors necessary for pathogenesis. Probab=99.88 E-value=5.6e-23 Score=151.23 Aligned_cols=73 Identities=33% Similarity=0.520 Sum_probs=69.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHC Q ss_conf 52135899868999999999869982999888677788999855687752011100011101210356777528 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETY 75 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~ 75 (230) ++||+++.||||+|||++|.||||+|+.+.+++++++++|.++||.|+||+|+++| .+|+||.+|++|||++| T Consensus 1 m~LY~~~~sp~~~rvri~L~ekgl~~e~~~v~~~~~~~~~~~lNP~G~VP~L~d~d-~~l~eS~~I~eYLde~y 73 (73) T cd03059 1 MTLYSGPDDVYSHRVRIVLAEKGVSVEIIDVDPDNPPEDLAELNPYGTVPTLVDRD-LVLYESRIIMEYLDERF 73 (73) T ss_pred CEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHCCCCCCCEEEECC-EEEECHHHHHHHHHHHC T ss_conf 98975899846999999999759962899848777969999658589988899899-99877899999998239 No 15 >KOG1695 consensus Probab=99.86 E-value=6.5e-21 Score=139.38 Aligned_cols=194 Identities=19% Similarity=0.233 Sum_probs=140.6 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCC Q ss_conf 52135899868999999999869982999888677788999855687752011100011101210356777528872334 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQK 81 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~ 81 (230) ++|++++....+.-+|++++..|++||-+.+...+.-+......|.|++|+|.. ||..|.+|.||++||+.+|+ T Consensus 4 ykL~Yf~~RG~ae~iR~lf~~a~v~fEd~r~~~~~~w~~~K~~~pfgqlP~l~v-Dg~~i~QS~AI~RyLArk~g----- 77 (206) T KOG1695 4 YKLTYFNIRGLAEPIRLLFAYAGVSFEDKRITMEDAWEELKDKMPFGQLPVLEV-DGKKLVQSRAILRYLARKFG----- 77 (206) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCCCEEEECCCCCHHHHCCCCCCCCCCEEEE-CCEEECCHHHHHHHHHHHHC----- T ss_conf 278840740108999999996599810022032330332133587888877747-99850319999999998718----- Q ss_pred CCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHH-HHHHHHHHHHHHHHHHHC--C Q ss_conf 3356799999999999999986123425778999988776642122102334399999-999999999855654534--7 Q gi|254780594|r 82 NRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLR-IARNNMREHIKYITWLIK--S 158 (230) Q Consensus 82 ~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~-~~~~~l~~~l~~le~~l~--~ 158 (230) |.++|+.++++++.+.+.+......-... .......+... ..++ .........+..++..|+ + T Consensus 78 --l~Gkt~~E~a~vD~i~d~~~D~~~~~~~~---------~~~~~~~g~~~---~~~~~~~~Pa~~~~~~~~~~~L~~~~ 143 (206) T KOG1695 78 --LAGKTEEEEAWVDMIVDQFKDFRWEIFRQ---------PYTAPEAGKSE---EELDKLYLPAKPKYFKILEKILKKNK 143 (206) T ss_pred --CCCCCHHHHHHHHHHHHHHHHHHHHHHHH---------HHHHHHHCCCH---HHHHHHHCCCHHHHHHHHHHHHHHCC T ss_conf --68898678999999998600689988876---------20225305546---65554525336889999999997579 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHHH-CCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 9842667502899999999999996-18999456768899999997099889841333 Q gi|254780594|r 159 RSWIAGHHMSYADFAASATISILDY-LGEIDWDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 159 ~~fl~Gd~~s~ADi~~~~~l~~~~~-~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) +.||+||++|.||+.++..+..+.. +.....+.+|+|+++.+|+.++|..++.+.-+ T Consensus 144 sgflvGd~lT~aDl~i~e~l~~l~~~~~~~~~~~~P~L~a~~~kv~~~p~ik~~i~~r 201 (206) T KOG1695 144 SGFLVGDKLTWADLVIAEHLDTLEELLDPSALDHFPKLKAFKERVSSIPNIKKYLESR 201 (206) T ss_pred CCEEECCCCCHHHHHHHHHHHHHHHHCCCHHHCCCHHHHHHHHHHHCCCHHHHHHHCC T ss_conf 8766768764999999989999997538010011768999999996690579998418 No 16 >cd03058 GST_N_Tau GST_N family, Class Tau subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The plant-specific class Tau GST subfamily has undergone extensive gene duplication. The Arabidopsis and Oryza genomes contain 28 and 40 Tau GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Phi GSTs, showing class specificity in substrate preference. Tau enzymes are highly efficient in detoxifying diphenylether and aryloxyphenoxypropionate herbicides. In addition, Tau GSTs play important roles in intracellular signalling, biosynthesis of anthocyanin, Probab=99.86 E-value=3.5e-22 Score=146.64 Aligned_cols=73 Identities=25% Similarity=0.333 Sum_probs=68.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCC-CCCCHHHHCCHHHHCCCCHHHHHHHHHC Q ss_conf 5213589986899999999986998299988867778899985568-7752011100011101210356777528 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPS-GTLPVYIDEHMQALCGVFVISEYLDETY 75 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~-g~vP~L~~~dg~~l~eS~aI~~yL~~~~ 75 (230) ++||++..||||+|||++|+||||+||.+.+++++++++|.++||. |+||+|++ ||.+|+||.+|++|||++| T Consensus 1 mkLyg~~~Sp~~~rvriaL~eKgi~ye~~~~~~~~~~~~~~~~nP~~gkVPvL~~-~~~~l~eS~~I~eYLde~f 74 (74) T cd03058 1 VKLLGAWASPFVLRVRIALALKGVPYEYVEEDLGNKSELLLASNPVHKKIPVLLH-NGKPICESLIIVEYIDEAW 74 (74) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHCCCCCCCCEEEE-CCEEEECHHHHHHHHHCCC T ss_conf 9785247885899999999986997389975645587789965978899999998-9999635999999987319 No 17 >cd03055 GST_N_Omega GST_N family, Class Omega subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Omega GSTs show little or no GSH-conjugating activity towards standard GST substrates. Instead, they catalyze the GSH dependent reduction of protein disulfides, dehydroascorbate and monomethylarsonate, activities which are more characteristic of glutaredoxins. They contain a conserved cysteine equivalent to the first cysteine in the CXXC motif of glutaredoxins, which is a redox active residue capable of reducing GSH mixed disulfides in a monothiol mechanism. Polymorphisms of the class Omega Probab=99.86 E-value=3.1e-22 Score=146.95 Aligned_cols=72 Identities=26% Similarity=0.389 Sum_probs=69.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 952135899868999999999869982999888677788999855687752011100011101210356777 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) |++||+++.||||+|||++|++|||+|+++.++..+++++|+++||.|+||+|+.+||.+|+||.+|++||| T Consensus 18 ~mkLY~~~~cPfa~rVri~L~eKgi~~e~v~v~l~~kpe~~l~~NP~gkVP~L~~d~g~~i~ES~~I~eYLD 89 (89) T cd03055 18 IIRLYSMRFCPYAQRARLVLAAKNIPHEVININLKDKPDWFLEKNPQGKVPALEIDEGKVVYESLIICEYLD 89 (89) T ss_pred EEEEECCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCHHHHHHCCCCCCCEEEECCCEEEECHHHHHHHHC T ss_conf 489972699919999999999839987899957865887889758799866699479989955889998649 No 18 >cd03052 GST_N_GDAP1 GST_N family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal TRX-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates. Probab=99.83 E-value=2.5e-21 Score=141.73 Aligned_cols=70 Identities=30% Similarity=0.497 Sum_probs=64.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 5213589986899999999986998299988867---7788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++|||++.|++|+|||++|+|||++|+.+.++.. +.+|+|+++||.|+||+|++ ||.+|+||.+|++||| T Consensus 1 L~LYh~~~S~~s~kVrl~L~EKgl~~e~~~vdl~~~e~~~p~fl~lNP~G~VPvLv~-~~~vi~eS~~I~eYLe 73 (73) T cd03052 1 LVLYHWTQSFSSQKVRLVIAEKGLRCEEYDVSLPLSEHNEPWFMRLNPTGEVPVLIH-GDNIICDPTQIIDYLE 73 (73) T ss_pred CEEECCCCCHHHHHHHHHHHHCCCCCEEEECCCCCHHHCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHC T ss_conf 987628998467999999996799777997667630027888995396998999978-9999878799998659 No 19 >cd03048 GST_N_Ure2p_like GST_N family, Ure2p-like subfamily; composed of the Saccharomyces cerevisiae Ure2p and related GSTs. Ure2p is a regulator for nitrogen catabolism in yeast. It represses the expression of several gene products involved in the use of poor nitrogen sources when rich sources are available. A transmissible conformational change of Ure2p results in a prion called [Ure3], an inactive, self-propagating and infectious amyloid. Ure2p displays a GST fold containing an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The N-terminal TRX-fold domain is sufficient to induce the [Ure3] phenotype and is also called the prion domain of Ure2p. In addition to its role in nitrogen regulation, Ure2p confers protection to cells against heavy metal ion and oxidant toxicity, and shows glutathione (GSH) peroxidase activity. Characterized GSTs in this subfamily include Aspergillus fumigatus GSTs 1 and 2, and Probab=99.83 E-value=3.4e-21 Score=141.01 Aligned_cols=75 Identities=25% Similarity=0.374 Sum_probs=67.8 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHC--CHHHHCCCCHHHHHHHHHC Q ss_conf 95213589986899999999986998299988867---778899985568775201110--0011101210356777528 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDE--HMQALCGVFVISEYLDETY 75 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~--dg~~l~eS~aI~~yL~~~~ 75 (230) |++||+++ ||+|+||+++|+|+||+|+.+.++.. +++++|+++||.|+||+|+++ +|.+|+||.||++||+++| T Consensus 1 MikLY~~~-~~~~~rv~~~L~e~gi~~e~~~v~~~~ge~~~~efl~iNP~g~VP~L~d~~~~~~~l~ES~AI~~YL~~~y 79 (81) T cd03048 1 MITLYTHG-TPNGFKVSIMLEELGLPYEIHPVDISKGEQKKPEFLKINPNGRIPAIVDHNGTPLTVFESGAILLYLAEKY 79 (81) T ss_pred CEEEECCC-CCCHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHCCCCCEEEEEECCCCCEEEECHHHHHHHHHHHH T ss_conf 94994399-98689999999985999889997899875379899955959986389955999989872999999999975 Q ss_pred C Q ss_conf 8 Q gi|254780594|r 76 G 76 (230) Q Consensus 76 ~ 76 (230) + T Consensus 80 ~ 80 (81) T cd03048 80 D 80 (81) T ss_pred C T ss_conf 9 No 20 >cd03051 GST_N_GTT2_like GST_N family, Saccharomyces cerevisiae GTT2-like subfamily; composed of predominantly uncharacterized proteins with similarity to the S. cerevisiae GST protein, GTT2. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GTT2, a homodimer, exhibits GST activity with standard substrates. Strains with deleted GTT2 genes are viable but exhibit increased sensitivity to heat shock. Probab=99.83 E-value=3.8e-21 Score=140.70 Aligned_cols=71 Identities=27% Similarity=0.418 Sum_probs=66.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 5213589986899999999986998299988867---7788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++||+++.||||+|||++|.|||++|+.+.++.. +++++|+++||.|+||+|+++||.+|+||.+|++||| T Consensus 1 mkLy~~~~Sp~~~rvri~L~ekgl~~e~~~v~~~~~~~~~~~~~~~NP~g~vP~L~~ddg~~l~ES~aI~~YLd 74 (74) T cd03051 1 MKLYDSPTAPNPRRVRIFLAEKGIDVPLVTVDLAAGEQRSPEFLAKNPAGTVPVLELDDGTVITESVAICRYLE 74 (74) T ss_pred CEEEECCCCHHHHHHHHHHHHHCCCCEEEEEECCCCCCCCHHHHHHCCCCCCCEEEECCCCEEECHHHHHHHHC T ss_conf 98982799927999999999969998699975777877898999779898646799789979974899998629 No 21 >cd03049 GST_N_3 GST_N family, unknown subfamily 3; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Probab=99.83 E-value=5.7e-21 Score=139.70 Aligned_cols=71 Identities=30% Similarity=0.478 Sum_probs=66.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCC--CEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 521358998689999999998699--82999888677788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEF--TTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi--~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) |+||+.++|||+||||++|.|||+ +++.+.++.++++++|.++||.|+||+|+++||.+|+||.+||+||| T Consensus 1 MkLy~~~~SP~~rkvri~l~ekgl~~~~e~v~~~~~~~~~~~~~~NP~gkVP~L~~ddG~~l~eS~~I~eYLd 73 (73) T cd03049 1 MKLLYSPTSPYVRKVRVAAHETGLGDDVELVLVNPWSDDESLLAVNPLGKIPALVLDDGEALFDSRVICEYLD 73 (73) T ss_pred CEEECCCCCCHHHHHHHHHHHHCCCCCEEEEECCCCCCCHHHHHHCCCCCCCEEEECCCCEEECHHHHHHHHC T ss_conf 9882699991899999999981999866873247777878999869898899699689988858999996639 No 22 >cd03187 GST_C_Phi GST_C family, Class Phi subfamily; composed of plant-specific class Phi GSTs and related fungal and bacterial proteins. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Phi GST subfamily has experience extensive gene duplication. The Arabidopsis and Oryza genomes contain 13 and 16 Tau GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Tau GSTs, showing class specificity in substrate preference. Phi enzymes a Probab=99.83 E-value=1.6e-19 Score=131.40 Aligned_cols=117 Identities=21% Similarity=0.336 Sum_probs=96.1 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH Q ss_conf 99999999999986123425778999988776642122102334399999999999999855654534798426675028 Q gi|254780594|r 90 LQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSY 169 (230) Q Consensus 90 ~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ 169 (230) .+||++.+|++|..+.+.+.+.. +..+.+.++. .+..+|...++.+.+.+.+.++.+|++|++++||+||++|+ T Consensus 1 ~erA~vdqWl~w~~~~~~p~~~~-~~~~~~~~p~-----~g~~~D~~~i~~~~~~l~~~l~vle~~L~~~~yl~Gd~fTl 74 (118) T cd03187 1 KERAIVEQWLEVESHQFDPPASA-LAFELVFKPM-----LGLPTDEAVVEENEEKLKKVLDVYEARLSKSKYLAGDSFTL 74 (118) T ss_pred CCHHHHHHHHHHHHHCCCHHHHH-HHHHHHHCCC-----CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCH T ss_conf 90999999999998367778999-9999960664-----68998899999999999999999999874098748888688 Q ss_pred HHHHHHHHHHHHHHCCC-CCHHHCHHHHHHHHHHHCCHHHHHHH Q ss_conf 99999999999996189-99456768899999997099889841 Q gi|254780594|r 170 ADFAASATISILDYLGE-IDWDTAPIVKEWYQRMKSRPSFRPLL 212 (230) Q Consensus 170 ADi~~~~~l~~~~~~~~-~~~~~~p~l~~w~~r~~~rps~~~~l 212 (230) |||++++.+.++...+. ..++++|+|.+|++|+.+|||||+++ T Consensus 75 ADi~~~~~~~~~~~~~~~~~~~~~P~v~~W~~~l~~RPa~kkv~ 118 (118) T cd03187 75 ADLSHLPYLQYLMATPFAKLFDSRPHVKAWWEDISARPAWKKVL 118 (118) T ss_pred HHHHHHHHHHHHHHCCCHHHHHCCHHHHHHHHHHHCCHHHHHHC T ss_conf 99988899999983871322111879999999987799889779 No 23 >cd03053 GST_N_Phi GST_N family, Class Phi subfamily; composed of plant-specific class Phi GSTs and related fungal and bacterial proteins. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Phi GST subfamily has experience extensive gene duplication. The Arabidopsis and Oryza genomes contain 13 and 16 Phi GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Tau GSTs, showing class specificity in substrate preference. Phi enzymes are highly reactive toward chloroacetanilide and thiocarbamate herbicides. Some Phi GSTs have other functions including t Probab=99.82 E-value=6.7e-21 Score=139.29 Aligned_cols=72 Identities=28% Similarity=0.283 Sum_probs=66.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHH Q ss_conf 5213589986899999999986998299988867---778899985568775201110001110121035677752 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDET 74 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~ 74 (230) ++||+++.||+|+|||++|.|+||+||.+.++.. +++++|+++||.|+||+|++ ||.+|+||.+|++||+++ T Consensus 2 mkLy~~~~S~~~~~v~~~L~e~gi~~e~~~v~~~~~e~~~~~~~~iNP~g~VP~L~d-~~~~l~ES~aI~~YL~ek 76 (76) T cd03053 2 LKLYGAAMSTCVRRVLLCLEEKGVDYELVPVDLTKGEHKSPEHLARNPFGQIPALED-GDLKLFESRAITRYLAEK 76 (76) T ss_pred EEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHHCC T ss_conf 899669999689999999999389987999578766658999997583677568998-997897689999998539 No 24 >cd03057 GST_N_Beta GST_N family, Class Beta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Unlike mammalian GSTs which detoxify a broad range of compounds, the bacterial class Beta GSTs exhibit limited GSH conjugating activity with a narrow range of substrates. In addition to GSH conjugation, they also bind antibiotics and reduce the antimicrobial activity of beta-lactam drugs. The structure of the Proteus mirabilis enzyme reveals that the cysteine in the active site forms a covalent bond with GSH. Probab=99.82 E-value=1.5e-20 Score=137.31 Aligned_cols=74 Identities=22% Similarity=0.305 Sum_probs=67.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCC Q ss_conf 5213589986899999999986998299988867---77889998556877520111000111012103567775288 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYG 76 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~ 76 (230) ++||++|.| .++|||++|+|+|++|+.+.++.. +++++|+++||.|+||+|+++||.+|+||.||++||+++|| T Consensus 1 mkLY~~p~s-~s~~v~~~L~e~gi~~e~~~v~~~~~~~~~~~~~~inP~g~VP~L~~~dg~~l~ES~AI~~yL~e~~P 77 (77) T cd03057 1 MKLYYSPGA-CSLAPHIALEELGLPFELVRVDLRTKTQKGADYLAINPKGQVPALVLDDGEVLTESAAILQYLADLHP 77 (77) T ss_pred CEEECCCCC-HHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEEECCCCEEECHHHHHHHHHHHCC T ss_conf 988617996-19999999998299988999789887658987997697988888997999899859999999987394 No 25 >cd03050 GST_N_Theta GST_N family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenation of dihalomethanes. This is an essential process in methylotrophic bacteria to enable them to use chloromethane and DC Probab=99.81 E-value=2.2e-20 Score=136.28 Aligned_cols=73 Identities=23% Similarity=0.274 Sum_probs=67.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHC Q ss_conf 5213589986899999999986998299988867---7788999855687752011100011101210356777528 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETY 75 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~ 75 (230) ++||+++.|++||+|+++|+|+|++|+.+.++.. +++++|+++||.|+||+|+| ||.+|+||.||++||+++| T Consensus 1 ~kLY~~~~S~~~r~v~~~l~e~gi~ye~~~vdl~~ge~~~~~f~~iNP~g~VP~L~d-~~~~l~ES~AI~~YL~~~y 76 (76) T cd03050 1 LKLYYDLMSQPSRAVYIFLKLNKIPFEECPIDLRKGEQLTPEFKKINPFGKVPAIVD-GDFTLAESVAILRYLARKF 76 (76) T ss_pred CEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHHHHC T ss_conf 988179998789999999999399988999768778768987997697988898997-9999861999999999669 No 26 >cd03042 GST_N_Zeta GST_N family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates but display modest GSH peroxidase activity. They are also implicated in the detoxification of the carcinogen dichloroacetic acid by catalyzing its dechlorination to glyoxylic acid. Probab=99.81 E-value=1.5e-20 Score=137.32 Aligned_cols=70 Identities=29% Similarity=0.356 Sum_probs=64.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC---CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 52135899868999999999869982999888677---788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG---KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~---~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++||+++.||+|+|||++|+||||+|+.+.+++.. ++++|+++||.|+||+|++ ||.+|+||.+|++||| T Consensus 1 m~LY~~~~S~~~~rvrlaL~ekgi~~e~~~vd~~~ge~~~~~~~~iNP~g~VPvL~~-~~~~l~ES~aIl~YLd 73 (73) T cd03042 1 MILYSYFRSSASYRVRIALNLKGLDYEYVPVNLLKGEQLSPAYRALNPQGLVPTLVI-DGLVLTQSLAIIEYLD 73 (73) T ss_pred CEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHC T ss_conf 988389998379999999998399687999758776230499998797988487988-9989836999999749 No 27 >KOG1422 consensus Probab=99.81 E-value=5.7e-19 Score=128.19 Aligned_cols=190 Identities=13% Similarity=0.124 Sum_probs=132.1 Q ss_pred CCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCCCCCCC Q ss_conf 89986899999999986998299988867778899985568775201110001110121035677752887233433567 Q gi|254780594|r 7 YPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQKNRLLS 86 (230) Q Consensus 7 ~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~~~L~p 86 (230) -.-||||+++.+.|.++|++|.+..++...++++|+++.|.|++|+|.. |+.+++||..|.+.|+++++.... +++ T Consensus 18 ~Gdcpf~qr~~m~L~~k~~~f~vttVd~~~kp~~f~~~sp~~~~P~l~~-d~~~~tDs~~Ie~~Lee~l~~p~~-~~~-- 93 (221) T KOG1422 18 LGDCPFCQRLFMTLELKGVPFKVTTVDLSRKPEWFLDISPGGKPPVLKF-DEKWVTDSDKIEEFLEEKLPPPKL-PTL-- 93 (221) T ss_pred CCCCHHHHHHHHHHHHCCCCCEEEEEECCCCCHHHHHHCCCCCCCEEEE-CCCEECCHHHHHHHHHHHCCCCCC-CCC-- T ss_conf 7897447899999998188734898616889699995288889986975-872420489999999985689987-666-- Q ss_pred CCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC---CCCCC Q ss_conf 999999999999999861234257789999887766421221023343999999999999998556545347---98426 Q gi|254780594|r 87 TDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKS---RSWIA 163 (230) Q Consensus 87 ~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~---~~fl~ 163 (230) ++.+.+.+- ..+. ........... ...-+.....+...+..|+..|+. ++||. T Consensus 94 -~~~E~asag---------------~diF-~kF~~fi~ksk-------~~~n~~~e~~Ll~~L~~Ld~yL~sp~~~~Fl~ 149 (221) T KOG1422 94 -APPESASAG---------------SDIF-AKFSAFIKKSK-------DAANDGLEKALLKELEKLDDYLKSPSRRKFLD 149 (221) T ss_pred -CCHHHHHHH---------------HHHH-HHHHHHHHCCH-------HHCCCHHHHHHHHHHHHHHHHHCCCCCCCCCC T ss_conf -877777567---------------8899-99999981763-------22001478999999999998835866786556 Q ss_pred CCCCCHHHHHHHHHHHHHH----HCCCCC-HHHCHHHHHHHHHHHCCHHHHHHH-HHHHCCCCCCCCCCCC Q ss_conf 6750289999999999999----618999-456768899999997099889841-3332189984324778 Q gi|254780594|r 164 GHHMSYADFAASATISILD----YLGEID-WDTAPIVKEWYQRMKSRPSFRPLL-SERIRGLLPVSHYTNL 228 (230) Q Consensus 164 Gd~~s~ADi~~~~~l~~~~----~~~~~~-~~~~p~l~~w~~r~~~rps~~~~l-~e~~~~~~p~~~~~~~ 228 (230) ||++|+|||.++|-|+.+. ++.... .+..+.+.+|++.+.+|.+|...- +|++-. .||++. T Consensus 150 Gd~lt~aDcsLlPKL~~i~va~k~yk~~~IP~~lt~V~rYl~~~ya~d~F~~tcp~d~ei~----~~y~~~ 216 (221) T KOG1422 150 GDKLTLADCSLLPKLHHIKVAAKHYKNFEIPASLTGVWRYLKNAYARDEFTNTCPADQEII----LAYAPV 216 (221) T ss_pred CCEEEEEHHHHCHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCHHHHHH----HHHHHH T ss_conf 7712342010010578899999774188783230699999998876887615795089999----753321 No 28 >cd03045 GST_N_Delta_Epsilon GST_N family, Class Delta and Epsilon subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Delta and Epsilon subfamily is made up primarily of insect GSTs, which play major roles in insecticide resistance by facilitating reductive dehydrochlorination of insecticides or conjugating them with GSH to produce water-soluble metabolites that are easily excreted. They are also implicated in protection against cellular damage by oxidative stress. Probab=99.80 E-value=3.4e-20 Score=135.28 Aligned_cols=71 Identities=30% Similarity=0.331 Sum_probs=64.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC---CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 52135899868999999999869982999888677---7889998556877520111000111012103567775 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG---KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~---~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) ++||+++.||+||||+++|.|+|++|+.+.++... ++++|+++||.|+||+|++ ||.+|+||.||++||++ T Consensus 1 mkLY~~~~S~~~r~v~~~l~e~~i~~e~~~vd~~~ge~~~~ef~~iNP~g~vP~L~d-~~~~l~ES~AI~~YLad 74 (74) T cd03045 1 IDLYYLPGSPPCRAVLLTAKALGLELNLKEVNLMKGEHLKPEFLKLNPQHTVPTLVD-NGFVLWESHAILIYLVE 74 (74) T ss_pred CEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCHHHHHHHCCCCCCEEEEE-CCEEEEEHHHHHHHHCC T ss_conf 988789998759999999999599888999758788624299997387985008998-99699708999998569 No 29 >cd03056 GST_N_4 GST_N family, unknown subfamily 4; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Probab=99.80 E-value=4e-20 Score=134.81 Aligned_cols=70 Identities=31% Similarity=0.429 Sum_probs=64.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 5213589986899999999986998299988867---7788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++||+++.||+|+|||++|+|+|++|+.+.++.. +++++|+++||.|+||+|++ ||.+|+||.||++||+ T Consensus 1 mkLY~~~~S~~~~rv~~~L~e~gl~~e~~~v~~~~~e~~~~~f~~iNP~g~VP~L~~-~g~~i~ES~aI~~YLA 73 (73) T cd03056 1 MKLYGFPLSGNCYKVRLLLALLGIPYEWVEVDILKGETRTPEFLALNPNGEVPVLEL-DGRVLAESNAILVYLA 73 (73) T ss_pred CEEECCCCCHHHHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHC T ss_conf 989558999039999999999299987999679898648999997786888488998-9989864899999749 No 30 >cd03047 GST_N_2 GST_N family, unknown subfamily 2; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The sequence from Burkholderia cepacia was identified as part of a gene cluster involved in the degradation of 2,4,5-trichlorophenoxyacetic acid. Some GSTs (e.g. Class Zeta and Delta) are known to catalyze dechlorination reactions. Probab=99.79 E-value=5.7e-20 Score=133.97 Aligned_cols=70 Identities=24% Similarity=0.263 Sum_probs=64.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC---CCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 521358998689999999998699829998886---77788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFP---WGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~---~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++||+.+.|++|+||+++|+|+|++|+.+.++. ..++++|+++||.||||+|+| |+.+|+||.||++||+ T Consensus 1 iklYg~~~S~~~rkV~~~l~e~gl~ye~~~v~~~~g~~~~p~f~~lnP~gkvP~l~d-g~~~l~ES~AI~~YLA 73 (73) T cd03047 1 LTIWGRRSSINVQKVLWLLDELGLPYERIDAGGQFGGLDTPEFLAMNPNGRVPVLED-GDFVLWESNAILRYLA 73 (73) T ss_pred CEEECCCCCCCHHHHHHHHHHCCCCCEEEECCCCCCCCCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHC T ss_conf 989558999029999999998599988998078778768978997697997999998-9999987999999759 No 31 >KOG3029 consensus Probab=99.79 E-value=2.3e-18 Score=124.73 Aligned_cols=200 Identities=18% Similarity=0.285 Sum_probs=119.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH--------- Q ss_conf 52135899868999999999869982999888677788999855687752011100011101210356777--------- Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD--------- 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~--------- 72 (230) .+||.|.+||||.|||.+|.+.||+|++|.+++..+++ .+.+-..|||+|.. +|+.+.||++|+.-|. T Consensus 91 l~LyQyetCPFCcKVrAFLDyhgisY~VVEVnpV~r~e--Ik~SsykKVPil~~-~Geqm~dSsvIIs~laTyLq~~~q~ 167 (370) T KOG3029 91 LVLYQYETCPFCCKVRAFLDYHGISYAVVEVNPVLRQE--IKWSSYKKVPILLI-RGEQMVDSSVIISLLATYLQDKRQD 167 (370) T ss_pred EEEEEECCCCHHHHHHHHHHHCCCCEEEEEECCHHHHH--CCCCCCCCCCEEEE-CCCEECHHHHHHHHHHHHHCCCCCC T ss_conf 79986146836888999886427832789844014643--13211453307996-3614020589999999985467888 Q ss_pred -----HHCCCCC----CC-------CCC-------CC-CCHHHHHHHHHHHHHHHHCC----CHHHHHHHHHHHHHHHHH Q ss_conf -----5288723----34-------335-------67-99999999999999986123----425778999988776642 Q gi|254780594|r 73 -----ETYGAIA----QK-------NRL-------LS-TDPLQRAETRRMIEWFMYQM----EQDVTRPLAHERVYKLHM 124 (230) Q Consensus 73 -----~~~~~~~----~~-------~~L-------~p-~~~~~ra~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~ 124 (230) +.||..+ .+ +++ -| .+...+-+-+.|-.|.++-+ .+.+... ..|.+..+.. T Consensus 168 l~eiiq~yPa~~~~ne~GK~v~~~~NKyflM~~e~d~~~~ke~~~eerkWR~WvDn~lVHLiSPNvYrn-~~EsletFew 246 (370) T KOG3029 168 LGEIIQMYPATSFFNEDGKEVNDILNKYFLMYREHDPGVSKETDEEERKWRSWVDNHLVHLISPNVYRN-MGESLETFEW 246 (370) T ss_pred HHHHHHHCCCCCCCCCCCCCHHHCCHHHEEEEECCCCCCCCCCHHHHHHHHHHHHHHHHHHCCCCCCCC-HHHHHHHHHH T ss_conf 899998666400013346304321313503420347775421037776899988655132228510067-3567788899 Q ss_pred HHHHCC---CCC-HHHH----------------------HHHHHHHHHHHH-HHHHHHHCCCCCCCCCCCCHHHHHHHHH Q ss_conf 122102---334-3999----------------------999999999998-5565453479842667502899999999 Q gi|254780594|r 125 TTEQGG---GSP-DSKV----------------------LRIARNNMREHI-KYITWLIKSRSWIAGHHMSYADFAASAT 177 (230) Q Consensus 125 ~~~~~~---~~~-~~~~----------------------~~~~~~~l~~~l-~~le~~l~~~~fl~Gd~~s~ADi~~~~~ 177 (230) ....+. ..+ .+.. +...++++..+. ++....-++++|+.|+++++||+++++. T Consensus 247 f~q~G~w~~~FpawEr~lavY~GAtAM~lisK~LKkkhni~D~Re~lydA~d~Wvaalgknr~flGG~kPnLaDLsvfGv 326 (370) T KOG3029 247 FSQAGEWDVHFPAWERDLAVYCGATAMYLISKMLKKKHNISDEREHLYDAADQWVAALGKNRPFLGGKKPNLADLSVFGV 326 (370) T ss_pred HHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHH T ss_conf 99738744567268989999962899999999987650551689999999999999857788766899986456666654 Q ss_pred HHHHHHCCCC-CHHHCHHHHHHHHHHHCC Q ss_conf 9999961899-945676889999999709 Q gi|254780594|r 178 ISILDYLGEI-DWDTAPIVKEWYQRMKSR 205 (230) Q Consensus 178 l~~~~~~~~~-~~~~~p~l~~w~~r~~~r 205 (230) |..+....-+ +.-...+|..|+.||+.. T Consensus 327 l~sm~gc~afkd~~q~t~I~eW~~rmeal 355 (370) T KOG3029 327 LRSMEGCQAFKDCLQNTSIGEWYYRMEAL 355 (370) T ss_pred HHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 56766566788987404188999999998 No 32 >cd03060 GST_N_Omega_like GST_N family, Omega-like subfamily; composed of uncharacterized proteins with similarity to class Omega GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Omega GSTs show little or no GSH-conjugating activity towards standard GST substrates. Instead, they catalyze the GSH dependent reduction of protein disulfides, dehydroascorbate and monomethylarsonate, activities which are more characteristic of glutaredoxins. Like Omega enzymes, proteins in this subfamily contain a conserved cysteine equivalent to the first cysteine in the CXXC motif of glutaredoxins, which is a r Probab=99.79 E-value=8.2e-20 Score=133.05 Aligned_cols=70 Identities=23% Similarity=0.255 Sum_probs=67.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHH Q ss_conf 5213589986899999999986998299988867778899985568775201110001110121035677 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYL 71 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL 71 (230) .+||++..||||+|||++|++||++||.+.++..+++++|+++||.|+||+|+.+||+++.||.+|++|- T Consensus 1 piLYs~~~CPya~R~rlaL~~k~i~~e~~~vdL~~kp~~~l~~nP~G~VPvL~~~dg~vi~ESl~I~~ya 70 (71) T cd03060 1 PILYSFRRCPYAMRARMALLLAGITVELREVELKNKPAEMLAASPKGTVPVLVLGNGTVIEESLDIMRWA 70 (71) T ss_pred CEEEECCCCCHHHHHHHHHHHCCCCEEEEEECCCCCCHHHHHHCCCCCCCEEEECCCEEEHHHHHHHHHH T ss_conf 9678238993799999999986997799983676798999955989860869948997851499999966 No 33 >cd03046 GST_N_GTT1_like GST_N family, Saccharomyces cerevisiae GTT1-like subfamily; composed of predominantly uncharacterized proteins with similarity to the S. cerevisiae GST protein, GTT1, and the Schizosaccharomyces pombe GST-III. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GTT1, a homodimer, exhibits GST activity with standard substrates and associates with the endoplasmic reticulum. Its expression is induced after diauxic shift and remains high throughout the stationary phase. S. pomb Probab=99.78 E-value=2.3e-19 Score=130.48 Aligned_cols=73 Identities=33% Similarity=0.434 Sum_probs=65.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC---CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCC Q ss_conf 5213589986899999999986998299988867---77889998556877520111000111012103567775288 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW---GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYG 76 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~---~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~ 76 (230) ++||+.|.| .|+|||++|+|+|++|+.+.++.. .++++|+++||.|+||+|++ ||.+|+||.||++||+++|+ T Consensus 1 m~LY~~p~s-~s~rv~~~L~e~~i~ye~~~v~~~~~~~~~~~~~~iNP~g~vP~L~~-dg~~l~ES~aI~~YL~e~yG 76 (76) T cd03046 1 ITLYHLPRS-RSFRILWLLEELGLPYELVLYDRGPGEQAPPEYLAINPLGKVPVLVD-GDLVLTESAAIILYLAEKYG 76 (76) T ss_pred CEEECCCCC-CHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHHHHCC T ss_conf 989668998-59999999998099988999779998758999996595789888998-99899879999999998519 No 34 >cd03188 GST_C_Beta GST_C family, Class Beta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Unlike mammalian GSTs which detoxify a broad range of compounds, the bacterial class Beta GSTs exhibit limited GSH conjugating activity with a narrow range of substrates. In addition to GSH conjugation, they also bind antibiotics and reduce the antimicrobial activity of beta-lactam drugs. The structure of the Proteus mirabilis enzyme reveals that the cysteine in the active site for Probab=99.77 E-value=4.6e-18 Score=123.02 Aligned_cols=113 Identities=16% Similarity=0.263 Sum_probs=91.8 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 99999999999861234257789999887766421221023343999999999999998556545347984266750289 Q gi|254780594|r 91 QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYA 170 (230) Q Consensus 91 ~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~A 170 (230) +||++.+|+.|..+.+++.........+.... ...+....+.+++++.+.++.+|.+|++++|++||++|+| T Consensus 2 eRA~~~qWl~f~~s~l~~~~~~~~~~~~~~~~--------~~~~~~~~~~~~~~~~~~~~~le~~L~~~~~l~G~~~t~a 73 (114) T cd03188 2 ERARLLEWLNFLSSELHKAFGPLFYPARWATD--------EAAQEEVKAAARERLAARLAYLDAQLAGGPYLLGDRFSVA 73 (114) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCC--------CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCEECCCCCCHH T ss_conf 47999999999984237999999821003796--------0030999999999999999999998710982078887789 Q ss_pred HHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHH Q ss_conf 999999999999618999456768899999997099889841 Q gi|254780594|r 171 DFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLL 212 (230) Q Consensus 171 Di~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l 212 (230) ||++++.+.+....+ .+.+++|++.+|++||.+||+||+++ T Consensus 74 Di~l~~~~~w~~~~~-~~~~~~P~~~a~~~r~~~rPavq~al 114 (114) T cd03188 74 DAYLFVVLRWAPGVG-LDLSDWPNLAAYLARVAARPAVQAAL 114 (114) T ss_pred HHHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCHHHHHHC T ss_conf 999999999998748-86321899999999998398989779 No 35 >cd03181 GST_C_EFB1gamma GST_C family, Gamma subunit of Elongation Factor 1B (EFB1gamma) subfamily; EF1Bgamma is part of the eukaryotic translation elongation factor-1 (EF1) complex which plays a central role in the elongation cycle during protein biosynthesis. EF1 consists of two functionally distinct units, EF1A and EF1B. EF1A catalyzes the GTP-dependent binding of aminoacyl-tRNA to the ribosomal A site concomitant with the hydrolysis of GTP. The resulting inactive EF1A:GDP complex is recycled to the active GTP form by the guanine-nucleotide exchange factor EF1B, a complex composed of at least two subunits, alpha and gamma. Metazoan EFB1 contain a third subunit, beta. The EF1B gamma subunit contains a GST fold consisting of an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST-like domain of EF1Bgamma is believed to mediate the dimerization of the EF1 complex, which in yeast is a dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In addition to its role Probab=99.77 E-value=5.3e-18 Score=122.67 Aligned_cols=115 Identities=13% Similarity=0.228 Sum_probs=92.4 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 99999999999861234257789999887766421221023343999999999999998556545347984266750289 Q gi|254780594|r 91 QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYA 170 (230) Q Consensus 91 ~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~A 170 (230) +||++.+|+.|..+.+.+.+...... .+ +....+...++.+.+++...++.+|++|++++||+||++|+| T Consensus 1 EraqV~qWl~~~~~~l~p~~~~~~~~-~~---------~~~~~~~~~~~~~~~~~~~~l~~le~~L~~~~yl~Gd~~TlA 70 (123) T cd03181 1 EEAQVLQWVSFANTELLPAVAAWFLP-LL---------GIAPYNKKSVEAALEELDRVLGVLEERLLKRTYLVGERLTLA 70 (123) T ss_pred CHHHHHHHHHHHHCCCCHHHHHHHHH-HC---------CCCCCCHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCHH T ss_conf 99899999999967881999999999-82---------788788699999999999999999999823985167876599 Q ss_pred HHHHHHHHHHHHHC--CCCCHHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 99999999999961--8999456768899999997099889841333 Q gi|254780594|r 171 DFAASATISILDYL--GEIDWDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 171 Di~~~~~l~~~~~~--~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) ||++++.+.+.... +....++||+|.+|++|+.+||+||++++|- T Consensus 71 Di~~~~~l~~~~~~~~~~~~~~~~Pnl~rw~~~i~~rP~~k~~~~~~ 117 (123) T cd03181 71 DIFVAGALLLGFTYVFDKEWRAKYPNVTRWFNTVVNQPIFKAVFGEV 117 (123) T ss_pred HHHHHHHHHHHHHHCCCHHHHHCCHHHHHHHHHHHCCHHHHHHHCCC T ss_conf 99999999999996278876612779999999998399999996798 No 36 >cd03178 GST_C_Ure2p_like GST_C family, Ure2p-like subfamily; composed of the Saccharomyces cerevisiae Ure2p and related GSTs. Ure2p is a regulator for nitrogen catabolism in yeast. It represses the expression of several gene products involved in the use of poor nitrogen sources when rich sources are available. A transmissible conformational change of Ure2p results in a prion called [Ure3], an inactive, self-propagating and infectious amyloid. Ure2p displays a GST fold containing an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The N-terminal thioredoxin-fold domain is sufficient to induce the [Ure3] phenotype and is also called the prion domain of Ure2p. In addition to its role in nitrogen regulation, Ure2p confers protection to cells against heavy metal ion and oxidant toxicity, and shows glutathione (GSH) peroxidase activity. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of GSH with a wide range of en Probab=99.77 E-value=7.6e-18 Score=121.74 Aligned_cols=113 Identities=20% Similarity=0.267 Sum_probs=91.1 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 99999999999861234257789999887766421221023343999999999999998556545347984266750289 Q gi|254780594|r 91 QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYA 170 (230) Q Consensus 91 ~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~A 170 (230) +|+++.+|+.|..+.+.+..+...... . ... ..+...++...+++.+.+..+|.+|++++|++||++|+| T Consensus 1 ERa~v~~Wl~wq~s~l~P~~g~~~~~~----~-~~p-----~~~p~~i~r~~~e~~r~~~vLd~~L~~~~yl~Gd~~TiA 70 (113) T cd03178 1 ERYEVLQWLFFQMGGLGPMFGQAGHFS----R-YAP-----EKIPYAIERYTNEAKRLYGVLDKRLAGRDYLAGDEYSIA 70 (113) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHHH----C-CCC-----CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 989999999999845667799998723----1-389-----998899999999999999999998750987467998899 Q ss_pred HHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 9999999999996189994567688999999970998898413 Q gi|254780594|r 171 DFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 171 Di~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) ||++++++......+..+.+++|+|.+|++|+.+||+||++++ T Consensus 71 Di~~~~~~~~~~~~~~~~~~~~p~l~~W~~r~~~RPa~qkala 113 (113) T cd03178 71 DIAIFPWVRRLEWIGIDDLDDFPNVKRWLDRIAARPAVQRGLA 113 (113) T ss_pred HHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHHCHHHHHHHC T ss_conf 9999999999997386431027799999999982999999729 No 37 >cd03044 GST_N_EF1Bgamma GST_N family, Gamma subunit of Elongation Factor 1B (EFB1gamma) subfamily; EF1Bgamma is part of the eukaryotic translation elongation factor-1 (EF1) complex which plays a central role in the elongation cycle during protein biosynthesis. EF1 consists of two functionally distinct units, EF1A and EF1B. EF1A catalyzes the GTP-dependent binding of aminoacyl-tRNA to the ribosomal A site concomitant with the hydrolysis of GTP. The resulting inactive EF1A:GDP complex is recycled to the active GTP form by the guanine-nucleotide exchange factor EF1B, a complex composed of at least two subunits, alpha and gamma. Metazoan EFB1 contain a third subunit, beta. The EF1B gamma subunit contains a GST fold consisting of an N-terminal TRX-fold domain and a C-terminal alpha helical domain. The GST-like domain of EF1Bgamma is believed to mediate the dimerization of the EF1 complex, which in yeast is a dimer of the heterotrimer EF1A:EF1Balpha:EF1Bgamma. In addition to its role in prot Probab=99.76 E-value=6.3e-19 Score=127.95 Aligned_cols=71 Identities=24% Similarity=0.315 Sum_probs=66.4 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC--CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 213589986899999999986998299988867--77889998556877520111000111012103567775 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW--GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~--~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) +||+++.||+++||+++++++|++++++.++.. +++++|+++||.|+||+|+++||.+|+||.||++||++ T Consensus 2 kLY~~~~spr~~~v~~~a~~~gl~~e~~~~~~~~e~~~pefl~~nP~gkvP~L~~~dG~~l~Es~AI~~YLa~ 74 (75) T cd03044 2 TLYTYPGNPRSLKILAAAKYNGLDVEIVDFQPGKENKTPEFLKKFPLGKVPAFEGADGFCLFESNAIAYYVAN 74 (75) T ss_pred EEECCCCCCCHHHHHHHHHHCCCCEEEEECCCCCCCCCHHHHHHCCCCCCCEEECCCCCEEECHHHHHHHHHH T ss_conf 2726898934799999999739964887246887659989998696999998980899898819999999975 No 38 >KOG4244 consensus Probab=99.76 E-value=9.3e-18 Score=121.24 Aligned_cols=187 Identities=16% Similarity=0.248 Sum_probs=120.2 Q ss_pred CCCCCC-------CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHC Q ss_conf 213589-------9868999999999869982999888677788999855687752011100011101210356777528 Q gi|254780594|r 3 ILYHYP-------LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETY 75 (230) Q Consensus 3 ~LY~~~-------~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~ 75 (230) -||+++ .||||.||...|...+||||.+.-.. ...+..|++|..+ =||+.+.||..|..+|.+.| T Consensus 47 YLyQF~R~~~~PnLSPfClKvEt~lR~~~IpYE~~~~~~-------~~rSr~G~lPFIE-LNGe~iaDS~~I~~~L~~hf 118 (281) T KOG4244 47 YLYQFPRTKTCPNLSPFCLKVETFLRAYDIPYEIVDCSL-------KRRSRNGTLPFIE-LNGEHIADSDLIEDRLRKHF 118 (281) T ss_pred EEEECCCCCCCCCCCHHHHHHHHHHHHHCCCCEECCCCC-------EEECCCCCCCEEE-ECCEECCCCHHHHHHHHHHC T ss_conf 999646567899987089999999998089733226633-------2415689766397-48856466199999999972 Q ss_pred CCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCC---------------------------HHHHHHHHHHHHHHHHHHHHH Q ss_conf 87233433567999999999999999861234---------------------------257789999887766421221 Q gi|254780594|r 76 GAIAQKNRLLSTDPLQRAETRRMIEWFMYQME---------------------------QDVTRPLAHERVYKLHMTTEQ 128 (230) Q Consensus 76 ~~~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~---------------------------~~~~~~~~~~~~~~~~~~~~~ 128 (230) .-.. .| .+.+||+.+.+....++.+. .....+.+...++........ T Consensus 119 ~~~~---~L---~~e~~a~s~Al~rm~dnhL~~~l~y~k~~~~~~~~~~~~~~~l~~~l~~~l~~~~~~~~f~~kv~~r~ 192 (281) T KOG4244 119 KIPD---DL---SAEQRAQSRALSRMADNHLFWILLYYKGADDAWLNTDRKLIGLPGFLFPLLLPLFWKAIFGKKVYKRS 192 (281) T ss_pred CCCC---CC---CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHHHHHHHHH T ss_conf 9987---79---97888999999999987789898875502548999988503764210689999999999889999876 Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCC------CCCHHHCHHHHHHHHHH Q ss_conf 023343999999999999998556545347984266750289999999999999618------99945676889999999 Q gi|254780594|r 129 GGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLG------EIDWDTAPIVKEWYQRM 202 (230) Q Consensus 129 ~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~------~~~~~~~p~l~~w~~r~ 202 (230) .+.-.+.+. ++..+-+++-++.++..|++++||.||++|-+|+++++.|..+.+ + +..-+++|+|.+|++|+ T Consensus 193 ~g~IG~f~~-~Ei~ell~rDlr~i~~~Lg~KkflfGdkit~~DatvFgqLa~v~Y-P~~~~i~d~le~d~p~l~eYceRI 270 (281) T KOG4244 193 TGAIGDFES-AEIDELLHRDLRAISDYLGDKKFLFGDKITPADATVFGQLAQVYY-PFRSHISDLLEGDFPNLLEYCERI 270 (281) T ss_pred HCCCCCCCH-HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCCEEEEHHHHHHEEC-CCCCCHHHHHHHHCHHHHHHHHHH T ss_conf 413367578-999999877788877875787422589777400010132353004-577768898761354899999999 Q ss_pred HCC Q ss_conf 709 Q gi|254780594|r 203 KSR 205 (230) Q Consensus 203 ~~r 205 (230) ++. T Consensus 271 r~~ 273 (281) T KOG4244 271 RKE 273 (281) T ss_pred HHH T ss_conf 887 No 39 >cd03180 GST_C_2 GST_C family, unknown subfamily 2; composed of uncharacterized bacterial proteins, with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=99.74 E-value=2.3e-17 Score=119.01 Aligned_cols=110 Identities=18% Similarity=0.337 Sum_probs=87.3 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH Q ss_conf 99999999999986123425778999988776642122102334399999999999999855654534798426675028 Q gi|254780594|r 90 LQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSY 169 (230) Q Consensus 90 ~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ 169 (230) ++||++.+|+.|..+.+.+.+.....+ .+..+ ....+...++.+..++++.++.+|.+|++++||+||++|+ T Consensus 1 a~RA~v~qWl~~~~~~l~p~~~~~~~~-~~~~p-------~~~rd~~~i~~~~~~~~~~l~vLe~~L~~~~yl~G~~~Ti 72 (110) T cd03180 1 AARARADRWMDWQTSTLNPAFRYAFWG-LVRTP-------PEQRDPAAIAASLAAWAKLMAILDAQLAGRPYLAGDRFTL 72 (110) T ss_pred CCHHHHHHHHHHHHCCCCHHHHHHHHH-HHCCC-------HHHCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH T ss_conf 918999999999970551669999999-95388-------4234999999999999999999999885399827898879 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHH Q ss_conf 999999999999961899945676889999999709988 Q gi|254780594|r 170 ADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSF 208 (230) Q Consensus 170 ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~ 208 (230) |||++++.+......+ .+..++|+|.+|++|+.+||+| T Consensus 73 ADi~~~~~l~~~~~~~-~~~~~~p~v~~w~~rl~~RPaf 110 (110) T cd03180 73 ADIPLGCSAYRWFELP-IERPPLPHLERWYARLRARPAF 110 (110) T ss_pred HHHHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHHCCCC T ss_conf 9999999999998588-7544798999999999868898 No 40 >cd03182 GST_C_GTT2_like GST_C family, Saccharomyces cerevisiae GTT2-like subfamily; composed of predominantly uncharacterized proteins with similarity to the S. cerevisiae GST protein, GTT2. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. GTT2, a homodimer, exhibits GST activity with standard substrates. Strains with deleted GTT2 genes are viable but exhibit increased sensiti Probab=99.72 E-value=8.3e-17 Score=115.79 Aligned_cols=116 Identities=16% Similarity=0.246 Sum_probs=88.1 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999999999999861234257789999887766421221023343999999999999998556545347984266750 Q gi|254780594|r 88 DPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHM 167 (230) Q Consensus 88 ~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~ 167 (230) ||.+||.+++|...+...+...+.....+. . +... .........++.+.+++.+...+..+|++|++++|++||++ T Consensus 1 Tp~ErA~i~mw~Rr~e~~~~~~~~~~fr~~--~-~~~~-~~~~~~q~pe~~e~~~~~~~~~l~~ld~~L~~~~flaGd~f 76 (117) T cd03182 1 TPLERAQIEMWQRRAELQGLYPIGQAFRHA--T-PGLK-PPDREEQVPEWGERSKARAADFLAYLDTRLAGSPYVAGDRF 76 (117) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCC--C-CCCC-CCCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCC T ss_conf 989999999999999998699999998268--7-6557-76655449999999999999999999998777984158877 Q ss_pred CHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHH Q ss_conf 2899999999999996189994567688999999970998 Q gi|254780594|r 168 SYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPS 207 (230) Q Consensus 168 s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps 207 (230) |+|||++++++.+....+....+++|+|.+|++|+.+||| T Consensus 77 TiADI~~~~~l~f~~~~~~~~~~~~p~l~~W~~ri~~RPs 116 (117) T cd03182 77 TIADITAFVGLDFAKVVKLRVPEELTHLRAWYDRMAARPS 116 (117) T ss_pred CHHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHCCC T ss_conf 7999999999999998089851028899999999983899 No 41 >cd03038 GST_N_etherase_LigE GST_N family, Beta etherase LigE subfamily; composed of proteins similar to Sphingomonas paucimobilis beta etherase, LigE, a GST-like protein that catalyzes the cleavage of the beta-aryl ether linkages present in low-moleculer weight lignins using GSH as the hydrogen donor. This reaction is an essential step in the degradation of lignin, a complex phenolic polymer that is the most abundant aromatic material in the biosphere. The beta etherase activity of LigE is enantioselective and it complements the activity of the other GST family beta etherase, LigF. Probab=99.71 E-value=3.6e-18 Score=123.61 Aligned_cols=75 Identities=24% Similarity=0.237 Sum_probs=63.2 Q ss_pred CCCCCCC-------CCHHHHHHHHHHHHCCCCEEEEECCCCCCCH--HHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 5213589-------9868999999999869982999888677788--999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYP-------LSSSSRFIRLILSEYEFTTDMIEEFPWGKRR--EFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~-------~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~--~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++||..+ +||||+|||++|++|||+|+.+.+...+..+ .....||.++||+|+++||.+|.||.+|++||| T Consensus 1 i~lyDl~~~~~~~~~SP~~wkvRlaL~~Kgi~y~~~~v~~~~~~~~~~~~~~~~~~~VPvl~d~~g~~i~dS~~Ia~YLe 80 (84) T cd03038 1 ITLYDLAGKDPVRAFSPNVWKTRLALNHKGLEYKTVPVEFPDIPPILGELTSGGFYTVPVIVDGSGEVIGDSFAIAEYLE 80 (84) T ss_pred CEEEECCCCCCCCEECCCHHHHHHHHHHCCCCCEEEEECCCCCCCCHHHCCCCCCCEEEEEEECCCCEEECHHHHHHHHH T ss_conf 98885689999970386269999999827997479887167864210002589854266899299999877999999998 Q ss_pred HHCC Q ss_conf 5288 Q gi|254780594|r 73 ETYG 76 (230) Q Consensus 73 ~~~~ 76 (230) ++|| T Consensus 81 e~yP 84 (84) T cd03038 81 EAYP 84 (84) T ss_pred HHCC T ss_conf 5683 No 42 >TIGR02182 GRXB Glutaredoxin, GrxB family; InterPro: IPR011901 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family includes the highly abundant Escherichia coli GrxB (Grx2) glutaredoxin which is notably longer than either GrxA or GrxC. Unlike the other two E. coli glutaredoxins, GrxB appears to be unable to reduce ribonucleotide reductase , and may have more to do with resistance to redox stress .; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity. Probab=99.69 E-value=2.5e-16 Score=113.07 Aligned_cols=193 Identities=16% Similarity=0.183 Sum_probs=123.8 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCC Q ss_conf 2135899868999999999869982999888677-788999855687752011100011101210356777528872334 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQK 81 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~ 81 (230) +||-|.+||||-|||++...+|||+|.+..--.+ .+| ..+-...+||+|+-+||+.+.||..|+.|+++...... T Consensus 1 KLYIYdHCPfCvrARmifGL~NIp~E~~VLlNDDe~TP--irmiG~K~VPiL~K~DG~~m~ESlDIV~y~D~l~~Pi~-- 76 (212) T TIGR02182 1 KLYIYDHCPFCVRARMIFGLKNIPVEKIVLLNDDETTP--IRMIGKKQVPILQKDDGRFMAESLDIVAYIDELAGPIE-- 76 (212) T ss_pred CCCCCCCCCHHHHHHHHHCCCCCCHHHHHHCCCCCCCC--CCCCCCCCCCEEECCCCCHHHHHHHHHHHHHHCCCCHH-- T ss_conf 97202567478999987403587277774237886670--01235431653203631010226789999975178512-- Q ss_pred CCCCCCCHHHHHHHHHHHHHH----HHCCCHHHHHHHHHHH----HHHHHHHHHHCCCCCHHHHH---HHHHHHHHHHHH Q ss_conf 335679999999999999998----6123425778999988----77664212210233439999---999999999985 Q gi|254780594|r 82 NRLLSTDPLQRAETRRMIEWF----MYQMEQDVTRPLAHER----VYKLHMTTEQGGGSPDSKVL---RIARNNMREHIK 150 (230) Q Consensus 82 ~~L~p~~~~~ra~~~~~~~~~----~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~l~ 150 (230) +.+.. .-+.++..|+.-. ...+.+.+...-..|+ ..+...........-...++ ....+++..-|+ T Consensus 77 ~~l~~---~~s~~I~~WL~~v~~y~~kL~~PR~~k~~l~EFAT~~A~~YF~~KKE~~~G~F~~~L~~t~~~~~~~~~dLe 153 (212) T TIGR02182 77 PLLTG---KVSPEIEAWLRKVTGYLNKLLLPRFAKLDLPEFATQSARKYFTDKKEASAGNFSALLNDTPNLLEEINADLE 153 (212) T ss_pred HHHCC---CCCHHHHHHHHHHCCHHHHHCCCCHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 65438---987789999987311232211551101688656768899999842263053599986134889999999999 Q ss_pred HHHHHHCCCCC-CCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCC Q ss_conf 56545347984-26675028999999999999961899945676889999999709 Q gi|254780594|r 151 YITWLIKSRSW-IAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSR 205 (230) Q Consensus 151 ~le~~l~~~~f-l~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~r 205 (230) .++..+.+... +-| +.|.-|+.++|.|..+.....+.|.. .+..|.++|.+. T Consensus 154 ~L~~L~~~~~~Gvng-~ls~dDi~vFp~LRnLt~V~gi~wP~--~V~~Y~~~mS~~ 206 (212) T TIGR02182 154 ELKSLIDGDSAGVNG-ELSVDDILVFPLLRNLTVVKGIQWPS--RVADYLESMSEK 206 (212) T ss_pred HHHHHHCCCCCCCCC-CHHHHHHHHHHHHHHHHHHHCCCCCH--HHHHHHHHHHHH T ss_conf 999873224531011-11045888875776334551375718--899999998885 No 43 >cd03177 GST_C_Delta_Epsilon GST_C family, Class Delta and Epsilon subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Delta and Epsilon subfamily is made up primarily of insect GSTs, which play major roles in insecticide resistance by facilitating reductive dehydrochlorination of insecticides or conjugating them with GSH to produce water-soluble metabolites th Probab=99.68 E-value=3.5e-16 Score=112.21 Aligned_cols=113 Identities=20% Similarity=0.404 Sum_probs=85.0 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 99999999999861234257789999887766421221023343999999999999998556545347984266750289 Q gi|254780594|r 91 QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYA 170 (230) Q Consensus 91 ~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~A 170 (230) +||.|++++.+-.+.+.+.+... ....... +..+. +...+++++.++.+|..|++++|++||++|+| T Consensus 2 ~RA~Vdq~L~fd~~~l~~~~~~~-----~~~~~~~----~~~~~----~~~~~~~~~~l~~le~~L~~~~~laGd~lT~A 68 (118) T cd03177 2 KRAIVNQRLHFDSGTLYQRLRDY-----YYPILFG----GAEPP----EEKLDKLEEALDFLETFLEGSDYVAGDQLTIA 68 (118) T ss_pred CHHHHHHHHHHHHHHHHHHHHHH-----HHHHHHC----CCCCC----HHHHHHHHHHHHHHHHHHCCCCEECCCCCCHH T ss_conf 08999899999998799999999-----9999984----99999----99999999999999999756980068988599 Q ss_pred HHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHHH Q ss_conf 9999999999996189994567688999999970998898413332 Q gi|254780594|r 171 DFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSERI 216 (230) Q Consensus 171 Di~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~~ 216 (230) ||++++.+..+.....++.+.||+|.+|++||+++|+|+++-.... T Consensus 69 Di~~~~~l~~l~~~~~~d~~~yP~l~~W~~r~~~~p~~~e~n~~G~ 114 (118) T cd03177 69 DLSLVATVSTLEALLPLDLSKYPNVRAWLERLKALPPYEEANGKGA 114 (118) T ss_pred HHHHHHHHHHHHHHCCCCHHHCHHHHHHHHHHHHCCCHHHHHHHHH T ss_conf 9999999999998834885467799999999980796788889899 No 44 >cd03183 GST_C_Theta GST_C family, Class Theta subfamily; composed of eukaryotic class Theta GSTs and bacterial dichloromethane (DCM) dehalogenase. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Mammalian class Theta GSTs show poor GSH conjugating activity towards the standard substrates, CDNB and ethacrynic acid, differentiating them from other mammalian GSTs. GSTT1-1 shows similar cataytic activity as bacterial DCM dehalogenase, catalyzing the GSH-dependent hydrolytic dehalogenatio Probab=99.68 E-value=5.8e-16 Score=110.92 Aligned_cols=117 Identities=21% Similarity=0.394 Sum_probs=94.0 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHH-HCCCCCCCCCCCCH Q ss_conf 99999999999861234257789999887766421221023343999999999999998556545-34798426675028 Q gi|254780594|r 91 QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWL-IKSRSWIAGHHMSY 169 (230) Q Consensus 91 ~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~-l~~~~fl~Gd~~s~ 169 (230) +||++++|++|....+....+. +....+..+.. .+...+.+.++.+++.+...|+.+|+. |++++|++||++|+ T Consensus 1 ~RA~ID~yL~Whh~n~r~~~~~-~f~~~vl~P~~----~~~~~~~~~v~~~~~~l~~~l~~le~~~L~~~~yl~G~~~si 75 (126) T cd03183 1 KRARVDEYLAWQHTNLRLGCAK-YFWQKVLLPLL----GGKPVSPEKVKKAEENLEESLDLLENYFLKDKPFLAGDEISI 75 (126) T ss_pred CCCHHHHHHHHHHHCHHHHHHH-HHHHHHHHHHC----CCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCEECCCCCCH T ss_conf 9303889999998730075899-99999873230----489999999999999999999999998607997625898879 Q ss_pred HHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHC--CHHHHHHH Q ss_conf 99999999999996189994567688999999970--99889841 Q gi|254780594|r 170 ADFAASATISILDYLGEIDWDTAPIVKEWYQRMKS--RPSFRPLL 212 (230) Q Consensus 170 ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~--rps~~~~l 212 (230) ||++++|.+..+...+...+..||+|.+|++||++ .|-|.++. T Consensus 76 ADi~~~~ei~ql~~~~~d~~~~~P~v~aW~~rv~~~~~p~~~e~h 120 (126) T cd03183 76 ADLSAVCEIMQPEAAGYDVFEGRPKLAAWRKRVKEAGNPLFDEAH 120 (126) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHCCCHHHHHH T ss_conf 999899876766651775566881899999999987597499999 No 45 >cd03179 GST_C_1 GST_C family, unknown subfamily 1; composed of uncharacterized bacterial proteins, with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=99.66 E-value=1.7e-15 Score=108.22 Aligned_cols=104 Identities=25% Similarity=0.410 Sum_probs=85.2 Q ss_pred HHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH Q ss_conf 99999999999861234257789999887766421221023343999999999999998556545347984266750289 Q gi|254780594|r 91 QRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYA 170 (230) Q Consensus 91 ~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~A 170 (230) +|+++.+|+.|..+.+.+.+...... .+. ......+...++.+.+++.+.++.+|++|++++||+||++|+| T Consensus 2 eRa~v~qWl~w~~~~~~p~~~~~~~~-~~~-------~~~~~~~~~~i~~~~~~~~~~l~vLd~~L~~~~yL~Gd~~TlA 73 (105) T cd03179 2 ERAQVLRWLFFEQYSHEPYIATLRFL-RVY-------LGLGEADAEVLAFLRERGHAALAVLEAHLAGRDFLVGDALTIA 73 (105) T ss_pred CHHHHHHHHHHHHHCCCHHHHHHHHH-HHC-------CCCCCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHH T ss_conf 09999999999965755168888899-971-------7866348999999999999999999999865985588998099 Q ss_pred HHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHH Q ss_conf 999999999999618999456768899999997 Q gi|254780594|r 171 DFAASATISILDYLGEIDWDTAPIVKEWYQRMK 203 (230) Q Consensus 171 Di~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~ 203 (230) ||++++++.+....+ ++.+++|+|.+|++|++ T Consensus 74 Di~~~~~~~~~~~~~-~~~~~~p~l~~W~~Rie 105 (105) T cd03179 74 DIALAAYTHVADEGG-FDLADYPAIRAWLARIE 105 (105) T ss_pred HHHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHC T ss_conf 999999999998549-98002948999999869 No 46 >cd00570 GST_N_family Glutathione S-transferase (GST) family, N-terminal domain; a large, diverse group of cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. In addition, GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. This family, also referred to as soluble GSTs, is the largest family of GSH transferases and is only distantly related to the mitochondrial GSTs (GSTK subfamily, a member of the DsbA family). Soluble GSTs bear no structural similarity to microsomal GSTs (MAPEG family) and display additional activities unique to their group, such as catalyzing thiolysis, reduction and isomerization of certain compounds. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical doma Probab=99.59 E-value=9.1e-16 Score=109.81 Aligned_cols=70 Identities=34% Similarity=0.421 Sum_probs=63.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC-CHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 521358998689999999998699829998886777-88999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK-RREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~-~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ++||+++.||+|+|||++|+++|++|+.+.++.... .++|.++||.|+||+|++ ||.+++||.+|++||+ T Consensus 1 ~~Ly~~~~s~~~~~v~~~l~~~gi~~~~~~v~~~~~~~~~~~~~~p~~~vP~l~~-~~~~i~eS~~I~~yL~ 71 (71) T cd00570 1 LKLYYFPGSPRSLRVRLALEEKGLPYELVPVDLGEGEQEEFLALNPLGKVPVLED-GGLVLTESLAILEYLA 71 (71) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCCHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHC T ss_conf 9898079993799999999981998569873455437899996486667888998-9979977899999639 No 47 >cd03206 GST_C_7 GST_C family, unknown subfamily 7; composed of uncharacterized proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=99.59 E-value=5.3e-15 Score=105.45 Aligned_cols=100 Identities=17% Similarity=0.310 Sum_probs=75.6 Q ss_pred HHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH Q ss_conf 99999998612342577899998877664212210233439999999999999985565453479842667502899999 Q gi|254780594|r 95 TRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAA 174 (230) Q Consensus 95 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~ 174 (230) |.+|+.|..+.+.+..+..... .... .+ ...+.+.++..+.+..||++|++++||+||++|+|||++ T Consensus 1 V~qWL~fq~~~l~p~~~~~r~~----~~~~-------~p--~~~~~~~~~a~r~l~vLe~~L~~~~~l~Gd~~TiADia~ 67 (100) T cd03206 1 VQRWLSVAAGEIANGPAAARLI----TLFG-------AP--LDKETAIARAHRLLRLLEEHLAGRDWLAGDRPTIADVAV 67 (100) T ss_pred CHHHHHHHHCCCCHHHHHHHHH----HHHC-------CC--CCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHH T ss_conf 9424979965812526899999----9868-------96--459999999999999999998549820589986689988 Q ss_pred HHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHH Q ss_conf 9999999961899945676889999999709988 Q gi|254780594|r 175 SATISILDYLGEIDWDTAPIVKEWYQRMKSRPSF 208 (230) Q Consensus 175 ~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~ 208 (230) ++++...... .++.++||+|.+|++|+++||+| T Consensus 68 ~~~~~~~~~~-~~d~~~yPnv~~W~~Ri~~rPgf 100 (100) T cd03206 68 YPYVALAPEG-GVDLEDYPAIRRWLARIEALPGF 100 (100) T ss_pred HHHHHHHHCC-CCCHHHCHHHHHHHHHHHHCCCC T ss_conf 6999983115-36710198999999999857898 No 48 >cd03043 GST_N_1 GST_N family, unknown subfamily 1; composed of uncharacterized proteins, predominantly from bacteria, with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Probab=99.58 E-value=1.1e-15 Score=109.37 Aligned_cols=64 Identities=28% Similarity=0.332 Sum_probs=56.6 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCCC--CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 99868999999999869982999888677--788999855687752011100011101210356777 Q gi|254780594|r 8 PLSSSSRFIRLILSEYEFTTDMIEEFPWG--KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 8 ~~sp~~~kvr~~L~e~gi~y~~~~~~~~~--~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) .-|.+|.|++++|+|+|||||.+.++... ..++|+++||.||||+|+| ||.+|+||.||++||+ T Consensus 8 ~YSswSlR~~l~L~e~glpfe~~~v~l~~~~~~~e~~~inP~gkVP~L~d-~~~~i~ES~AI~eYLA 73 (73) T cd03043 8 NYSSWSLRPWLLLKAAGIPFEEILVPLYTPDTRARILEFSPTGKVPVLVD-GGIVVWDSLAICEYLA 73 (73) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEECCCCCHHHHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHC T ss_conf 73618999999999869985799978999125699997597988888998-9989974999999759 No 49 >cd03061 GST_N_CLIC GST_N family, Chloride Intracellular Channel (CLIC) subfamily; composed of CLIC1-5, p64, parchorin and similar proteins. They are auto-inserting, self-assembling intracellular anion channels involved in a wide variety of functions including regulated secretion, cell division and apoptosis. They can exist in both water-soluble and membrane-bound states, and are found in various vesicles and membranes. Biochemical studies of the C. elegans homolog, EXC-4, show that the membrane localization domain is present in the N-terminal part of the protein. The structure of soluble human CLIC1 reveals that it is monomeric and it adopts a fold similar to GSTs, containing an N-terminal domain with a TRX fold and a C-terminal alpha helical domain. Upon oxidation, the N-terminal domain of CLIC1 undergoes a structural change to form a non-covalent dimer stabilized by the formation of an intramolecular disulfide bond between two cysteines that are far apart in the reduced form. The CLI Probab=99.56 E-value=3.5e-15 Score=106.46 Aligned_cols=69 Identities=14% Similarity=0.088 Sum_probs=64.9 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCC Q ss_conf 9986899999999986998299988867778899985568775201110001110121035677752887 Q gi|254780594|r 8 PLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGA 77 (230) Q Consensus 8 ~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~ 77 (230) ..||||+|+++.|.+|||+|++..++..+++++|++++|.|++|+|++ ||.+++||..|.+||+++++. T Consensus 20 G~CPF~QR~~mvL~~Kgv~f~vt~VDl~~kPe~f~~~~p~~~~PvL~~-~g~~~~ds~~I~e~lEe~~~p 88 (91) T cd03061 20 GNCPFCQRLFMVLWLKGVVFNVTTVDMKRKPEDLKDLAPGTQPPFLLY-NGEVKTDNNKIEEFLEETLCP 88 (91) T ss_pred CCCHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHHCCCCCCCEEEE-CCEEEECHHHHHHHHHHHCCC T ss_conf 577778999999997499745898554669299996598999999988-999963389999999854499 No 50 >cd03189 GST_C_GTT1_like GST_C family, Saccharomyces cerevisiae GTT1-like subfamily; composed of predominantly uncharacterized proteins with similarity to the S. cerevisiae GST protein, GTT1, and the Schizosaccharomyces pombe GST-III. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. GTT1, a homodimer, exhibits GST activity with standard substrates and associates with the endopl Probab=99.55 E-value=6.7e-14 Score=99.10 Aligned_cols=118 Identities=13% Similarity=0.206 Sum_probs=82.1 Q ss_pred CCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCC Q ss_conf 79999999999999998612342577899998877664212210233439999999999999985565453479842667 Q gi|254780594|r 86 STDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGH 165 (230) Q Consensus 86 p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd 165 (230) +.+..+++...+|+.+..+.+.+.+...+............... ..+.........++..+++.+|++|++++||+|+ T Consensus 2 ~~~~~~~a~yl~Wlhfaegsl~p~l~~~~~~~~~~~~~~~~~~~--~~~~~~~~~~~~~l~~~l~~le~~L~~~~yl~G~ 79 (119) T cd03189 2 PPDTAEYADYLYWLHFAEGSLMPPLLLKLVLSRIGSAPPPIANK--IADKVLAGFINPELKKHLDFLEDRLAKKGYFVGD 79 (119) T ss_pred CCCCHHHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCHHH--HHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCC T ss_conf 98977799999999998231257999999986323456740278--8878889999999999999999986669810478 Q ss_pred CCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCH Q ss_conf 50289999999999999618999456768899999997099 Q gi|254780594|r 166 HMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRP 206 (230) Q Consensus 166 ~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rp 206 (230) +||+|||++...+.+...++ .+.+++|+|.+|++|+.+|| T Consensus 80 ~fs~ADi~~~~~l~~~~~~~-~~~~~~P~l~ay~~r~~~RP 119 (119) T cd03189 80 KLTAADIMMSFPLEAALARG-PLLEKYPNIAAYLERIEARP 119 (119) T ss_pred CCCHHHHHHHHHHHHHHHCC-CCCCCCHHHHHHHHHHHCCC T ss_conf 75299999999999999739-88444857999999988188 No 51 >COG2999 GrxB Glutaredoxin 2 [Posttranslational modification, protein turnover, chaperones] Probab=99.53 E-value=1e-13 Score=98.11 Aligned_cols=195 Identities=15% Similarity=0.183 Sum_probs=119.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC-CCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCC Q ss_conf 5213589986899999999986998299988867-778899985568775201110001110121035677752887233 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW-GKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQ 80 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~-~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~ 80 (230) ++||-|.+||||-|+|++...++||++....... +.+| ..+-...+||+|+-+||+.+.||..|++|+++..+.. T Consensus 1 MkLYIYdHCPfcvrarmi~Gl~nipve~~vL~nDDe~Tp--~rmiG~KqVPiL~Kedg~~m~ESlDIV~y~d~~~~~~-- 76 (215) T COG2999 1 MKLYIYDHCPFCVRARMIFGLKNIPVELHVLLNDDEETP--IRMIGQKQVPILQKEDGRAMPESLDIVHYVDELDGKP-- 76 (215) T ss_pred CCEEEECCCHHHHHHHHHHHCCCCCHHHHEECCCCCCCH--HHHHCCCCCCEEECCCCCCCHHHHHHHHHHHHHCCCH-- T ss_conf 945674357078999998500388744410226764373--5431444353587446331413547999999853752-- Q ss_pred CCCCCCCCHHHHHHHHHHHHHHH----HCCCHHHHHHHHHHHH----HHHHHHHHHCCCCCHHHHH---HHHHHHHHHHH Q ss_conf 43356799999999999999986----1234257789999887----7664212210233439999---99999999998 Q gi|254780594|r 81 KNRLLSTDPLQRAETRRMIEWFM----YQMEQDVTRPLAHERV----YKLHMTTEQGGGSPDSKVL---RIARNNMREHI 149 (230) Q Consensus 81 ~~~L~p~~~~~ra~~~~~~~~~~----~~~~~~~~~~~~~~~~----~~~~~~~~~~~~~~~~~~~---~~~~~~l~~~l 149 (230) ++ +..-+.+++.|..-.. ....+........|+. .+.............+... ....+++..-+ T Consensus 77 ---~l--t~~~~pai~~wlrkv~~y~nkll~PR~~k~~l~EF~T~sA~~yf~~KKe~s~g~F~~~l~~t~~~~~~i~~dl 151 (215) T COG2999 77 ---LL--TGKVRPAIEAWLRKVNGYLNKLLLPRFAKSALPEFATPSARKYFTDKKEASEGSFESLLNHTAQYLKRIQADL 151 (215) T ss_pred ---HH--CCCCCHHHHHHHHHHCCHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHHCCCCHHHHHHCHHHHHHHHHHHH T ss_conf ---21--1676878999999831647555101676417710058899999986103113459998734399999999999 Q ss_pred HHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHH Q ss_conf 55654534798426675028999999999999961899945676889999999709988 Q gi|254780594|r 150 KYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSF 208 (230) Q Consensus 150 ~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~ 208 (230) +.+++.+.+..-+-| ..|.-|+.+++.|..+.....+.|.. ++..|..+|.+.-.+ T Consensus 152 ~~l~~Li~~~s~~n~-~l~~ddi~vFplLRnlt~v~gi~wps--~v~dy~~~msektqV 207 (215) T COG2999 152 RALDKLIVGPSAVNG-ELSEDDILVFPLLRNLTLVAGIQWPS--RVADYRDNMSEKTQV 207 (215) T ss_pred HHHHHHHCCCCHHCC-CCCHHHHHHHHHHCCCEECCCCCCCH--HHHHHHHHHHHHHCC T ss_conf 999998617320045-43432436448761440214677848--999999988776376 No 52 >pfam02798 GST_N Glutathione S-transferase, N-terminal domain. Function: conjugation of reduced glutathione to a variety of targets. Also included in the alignment, but are not GSTs: * S-crystallins from squid. Similarity to GST previously noted. * Eukaryotic elongation factors 1-gamma. Not known to have GST activity; similarity not previously recognized. * HSP26 family of stress-related proteins. including auxin-regulated proteins in plants and stringent starvation proteins in E. coli. Not known to have GST activity. Similarity not previously recognized. The glutathione molecule binds in a cleft between N and C-terminal domains - the catalytically important residues are proposed to reside in the N-terminal domain. Probab=99.53 E-value=7.5e-15 Score=104.57 Aligned_cols=71 Identities=24% Similarity=0.296 Sum_probs=64.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCC-CCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 5213589986899999999986998299988867778899985568-77520111000111012103567775 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPS-GTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~-g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) .+|++++...+|+++|++|+++||+|+.+.++..++++++.+.||. |+||+|++ ||.+|+||.||++||++ T Consensus 1 ~~l~yf~~rg~a~~~R~~L~~~gi~ye~~~~~~~~~~~~~~~~~p~~g~vP~l~~-~g~~l~eS~AI~~YLar 72 (72) T pfam02798 1 STLTLFNGRGRAERCRILLAAKGVEYEDVRVDFSSKSPEWLKLNPLMGQVPALED-GGKKLTESLAIARYLAR 72 (72) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHCCCCCCCCEEEE-CCEEEECHHHHHHHHCC T ss_conf 9307638853579999999985998689865522178878733888885798998-99899739999998677 No 53 >cd03041 GST_N_2GST_N GST_N family, 2 repeats of the N-terminal domain of soluble GSTs (2 GST_N) subfamily; composed of uncharacterized proteins. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Probab=99.52 E-value=1.3e-14 Score=103.27 Aligned_cols=75 Identities=27% Similarity=0.303 Sum_probs=66.8 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCC-HHHHCCCCHHHHHHHHHC Q ss_conf 952135899868999999999869982999888677-788999855687752011100-011101210356777528 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEH-MQALCGVFVISEYLDETY 75 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~d-g~~l~eS~aI~~yL~~~~ 75 (230) +++||++..||||+|||-+|.+++|+|+++.+.... +..++.+.++.++||+|+|.| |..+.||.+|++||+++| T Consensus 1 pL~LY~~e~cPfC~kVR~~L~~L~L~~~~~~~~~~~~~r~~l~~~gG~~qVP~L~D~ntgv~myES~dIi~YL~~~Y 77 (77) T cd03041 1 PLELYEFEGSPFCRLVREVLTELELDVILYPCPKGSPKRDKFLEKGGKVQVPYLVDPNTGVQMFESADIVKYLFKTY 77 (77) T ss_pred CCEEECCCCCCCHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHCCCCCCCEEEECCCCEEEECHHHHHHHHHHHC T ss_conf 97487169995249999999986997799989999666899998489967737996899848777999999999659 No 54 >cd03185 GST_C_Tau GST_C family, Class Tau subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The plant-specific class Tau GST subfamily has undergone extensive gene duplication. The Arabidopsis and Oryza genomes contain 28 and 40 Tau GSTs, respectively. They are primarily responsible for herbicide detoxification together with class Phi GSTs, showing class specificity in substrate preference. Tau enzymes are highly efficient in detoxifying diphenylether and aryloxyphenoxypropi Probab=99.51 E-value=2.4e-13 Score=95.96 Aligned_cols=112 Identities=16% Similarity=0.313 Sum_probs=87.6 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999999998612342577899998877664212210233439999999999999985565453479842667502 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMS 168 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s 168 (230) |.+||++|.|.++++..+.+..... ... ..+..+.+.+++.+.|+.+|..+++++|+.|+++| T Consensus 1 Py~RA~ARFWa~fid~k~~~~~~~~---------~~~--------~~ee~~~a~~e~~e~L~~lE~~l~~k~ffgG~~~g 63 (126) T cd03185 1 PYERAVARFWAAFIDDKLFPAGRKV---------LAA--------KGEEREKAKEEALEALKVLEEELGGKPFFGGDTIG 63 (126) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHH---------HHC--------CHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC T ss_conf 9279999996999980628999999---------965--------80999999999999999999985779998998765 Q ss_pred HHHHHHHHHHHHHHH---CCC---CCHHHCHHHHHHHHHHHCCHHHHHHHHHHHC Q ss_conf 899999999999996---189---9945676889999999709988984133321 Q gi|254780594|r 169 YADFAASATISILDY---LGE---IDWDTAPIVKEWYQRMKSRPSFRPLLSERIR 217 (230) Q Consensus 169 ~ADi~~~~~l~~~~~---~~~---~~~~~~p~l~~w~~r~~~rps~~~~l~e~~~ 217 (230) +.||++++++.++.. ++. ++.+++|.|.+|.+|+.++|+++.++.+..+ T Consensus 64 ~vDi~l~~~~~w~~~~ee~~g~~~~~~~k~P~L~~W~~~~~~~~~Vk~~lP~~~k 118 (126) T cd03185 64 YVDIALGSFLGWFRAYEEVGGVKLLDEEKTPLLAAWAERFLELEAVKEVLPDRDK 118 (126) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCHHHCCHHHHHHHHHHCCCHHHHHCCCHHH T ss_conf 3899999999999999998298236801171999999998446288887879999 No 55 >cd03207 GST_C_8 GST_C family, unknown subfamily 8; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=99.50 E-value=6.1e-14 Score=99.34 Aligned_cols=75 Identities=12% Similarity=0.224 Sum_probs=64.5 Q ss_pred HHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 99999999999985565453479842667502899999999999996189994567688999999970998898413 Q gi|254780594|r 137 VLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 137 ~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) ....+.....+.++.+|+.|++++||+||+||+|||++...+.+....+. .+.+|+|.+|++|+.+||+||++.+ T Consensus 27 ~~~~~~~~~~~~l~~Le~~L~~~~yl~Gd~FT~ADI~~~~~l~~~~~~~~--l~~~p~l~~y~~rl~aRPA~qrA~a 101 (103) T cd03207 27 ARMAGFGSYDDVLAALEQALAKGPYLLGERFTAADVLVGSPLGWGLQFGL--LPERPAFDAYIARITDRPAFQRAAA 101 (103) T ss_pred HHHCCCCCHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC--CCCCHHHHHHHHHHHCCHHHHHHHH T ss_conf 55513267999999999998669810478652999999999999997388--8998799999999981999999986 No 56 >cd03191 GST_C_Zeta GST_C family, Class Zeta subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Zeta GSTs, also known as maleylacetoacetate (MAA) isomerases, catalyze the isomerization of MAA to fumarylacetoacetate, the penultimate step in tyrosine/phenylalanine catabolism, using GSH as a cofactor. They show little GSH-conjugating activity towards traditional GST substrates, but display modest GSH peroxidase activity. They are also implicated in the detoxification of th Probab=99.48 E-value=4.8e-13 Score=94.21 Aligned_cols=118 Identities=15% Similarity=0.184 Sum_probs=87.6 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC--CCCCCCCC Q ss_conf 9999999999999861234257789999887766421221023343999999999999998556545347--98426675 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKS--RSWIAGHH 166 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~--~~fl~Gd~ 166 (230) |.+||+++.+...+...+++-. +-++.+...... +..+.+.....+.-+.++|+.+|+.|++ ++|.+||+ T Consensus 1 p~~RA~vR~la~~Ia~dihPl~-----NlrVl~~l~~~~---~~~~~~~~~W~~hwi~~Gf~AlE~~L~~~~G~fc~Gd~ 72 (121) T cd03191 1 PKKRARVRALALIIACDIHPLN-----NLRVLKYLTEEL---GLDEEAKNAWYRHWIARGFAALEKLLAQTAGKFCFGDE 72 (121) T ss_pred CHHHHHHHHHHHHHHHCCCCCC-----CHHHHHHHHHHC---CCCHHHHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCC T ss_conf 9778999999999994677761-----599999999880---89889999999999997899999998708999521898 Q ss_pred CCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 0289999999999999618999456768899999997099889841333 Q gi|254780594|r 167 MSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 167 ~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) +|+|||+++|.++....|+ +|.+.||+|.+.++++.+.|+|+++..+. T Consensus 73 ~tlADicLvPQv~nA~Rf~-vDl~~yP~i~~i~~~c~~lpAF~~A~P~~ 120 (121) T cd03191 73 PTLADICLVPQVYNARRFG-VDLSPYPTIARINEACLELPAFQAAHPDN 120 (121) T ss_pred CCHHHHHHHHHHHHHHHCC-CCHHHCHHHHHHHHHHHCCHHHHHCCCCC T ss_conf 6099999999999999839-88113438999999986477898709255 No 57 >pfam00043 GST_C Glutathione S-transferase, C-terminal domain. GST conjugates reduced glutathione to a variety of targets including S-crystallin from squid, the eukaryotic elongation factor 1-gamma, the HSP26 family of stress-related proteins and auxin-regulated proteins in plants. Stringent starvation proteins in E. coli are also included in the alignment but are not known to have GST activity. The glutathione molecule binds in a cleft between N and C-terminal domains. The catalytically important residues are proposed to reside in the N-terminal domain. In plants, GSTs are encoded by a large gene family (48 GST genes in Arabidopsis) and can be divided into the phi, tau, theta, zeta, and lambda classes. Probab=99.47 E-value=2.2e-13 Score=96.15 Aligned_cols=74 Identities=20% Similarity=0.383 Sum_probs=67.1 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCH Q ss_conf 43999999999999998556545347984266750289999999999999618999456768899999997099 Q gi|254780594|r 133 PDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRP 206 (230) Q Consensus 133 ~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rp 206 (230) .++..++.+..++...++.+|..|++++||+||++|+||+++++.+.++..++....++||+|.+|++||++|| T Consensus 19 ~d~~~~~~a~~~~~~~l~~le~~L~~~~~l~Gd~~tlADi~~~~~~~~~~~~~~~~~~~~P~l~~w~~r~~~rP 92 (92) T pfam00043 19 KDEPEVEEALEKLLKVLEALEEVLKGKTYLVGDKLTLADIALYPALDWLYMLDPDCLEKFPNLKAWRKRVAARP 92 (92) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHCCCCEECCCCCCHHHHHHHHHHHHHHHHCCCCCCCCHHHHHHHHHHHHCC T ss_conf 39999999999999999999999879985468987698999999999999949876310889999999998188 No 58 >cd03037 GST_N_GRX2 GST_N family, Glutaredoxin 2 (GRX2) subfamily; composed of bacterial proteins similar to E. coli GRX2, an atypical GRX with a molecular mass of about 24kD, compared with other GRXs which are 9-12kD in size. GRX2 adopts a GST fold containing an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. It contains a redox active CXXC motif located in the N-terminal domain but is not able to reduce ribonucleotide reductase like other GRXs. However, it catalyzes GSH-dependent protein disulfide reduction of other substrates efficiently. GRX2 is thought to function primarily in catalyzing the reversible glutathionylation of proteins in cellular redox regulation including stress responses. Probab=99.44 E-value=7.9e-14 Score=98.68 Aligned_cols=71 Identities=14% Similarity=0.134 Sum_probs=62.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 521358998689999999998699829998886777889998556877520111000111012103567775 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) ++||.|..||||.|||+++..||+++|.+.+... ..+....+...++||+|+.+||+++.||..|+.|+++ T Consensus 1 mkLY~y~hCPfc~raRm~~g~kni~~e~~~l~~d-d~~t~~~~iG~k~VPiL~~~DG~~m~ESlDIi~y~de 71 (71) T cd03037 1 MKLYIYEHCPFCVKARMIAGLKNIPVEQIILQND-DEATPIRMIGAKQVPILEKDDGSFMAESLDIVAFIDE 71 (71) T ss_pred CCEEEECCCHHHHHHHHHHHHCCCCEEEEEEECC-CCHHHHHHCCCCCCCEEEECCCCCCHHHHHHHHHHCC T ss_conf 9515834483889999998762998489983059-8441476538822237982898780589999998609 No 59 >cd03080 GST_N_Metaxin_like GST_N family, Metaxin subfamily, Metaxin-like proteins; a heterogenous group of proteins, predominantly uncharacterized, with similarity to metaxins and GSTs. Metaxin 1 is a component of a preprotein import complex of the mitochondrial outer membrane. It extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. In humans, alterations in the metaxin gene may be associated with Gaucher disease. One characterized member of this subgroup is a novel GST from Rhodococcus with toluene o-monooxygenase and gamma-glutamylcysteine synthetase activities. Also members are the cadmium-inducible lysosomal protein CDR-1 and its homologs from C. elegans, and the failed axon connections (fax) protein from Drosophila. CDR-1 is an integral membrane protein that functions to protect against cadmium toxicity and may also have a role in osmoregulation to maintain salt balance in C. ele Probab=99.43 E-value=8.3e-14 Score=98.56 Aligned_cols=68 Identities=24% Similarity=0.272 Sum_probs=58.8 Q ss_pred CCCCCCCC-------CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 95213589-------98689999999998699829998886777889998556877520111000111012103567775 Q gi|254780594|r 1 MSILYHYP-------LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 1 M~~LY~~~-------~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) |.+||+++ .||||.||+..|..+||+|+.+..... ..+|.||+|.+++ ||+.|.||..|++||++ T Consensus 1 mv~Ly~fp~~~~~ps~SPfc~Kve~~Lr~~~i~ye~~~~~~~-------~~sp~gKlP~I~~-~g~~i~DS~~Ii~~L~~ 72 (75) T cd03080 1 MITLYQFPRAFGVPSLSPFCLKVETFLRMAGIPYENKFGGLA-------KRSPKGKLPFIEL-NGEKIADSELIIDHLEE 72 (75) T ss_pred CEEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCCEEECCCCC-------CCCCCCCCCEEEE-CCEEECCHHHHHHHHHH T ss_conf 989996799889888785999999999975998477347665-------5587786687998-99894599999999997 Q ss_pred HCC Q ss_conf 288 Q gi|254780594|r 74 TYG 76 (230) Q Consensus 74 ~~~ 76 (230) +|+ T Consensus 73 ~fg 75 (75) T cd03080 73 KYG 75 (75) T ss_pred HCC T ss_conf 609 No 60 >cd03040 GST_N_mPGES2 GST_N family; microsomal Prostaglandin E synthase Type 2 (mPGES2) subfamily; mPGES2 is a membrane-anchored dimeric protein containing a CXXC motif which catalyzes the isomerization of PGH2 to PGE2. Unlike cytosolic PGE synthase (cPGES) and microsomal PGES Type 1 (mPGES1), mPGES2 does not require glutathione (GSH) for its activity, although its catalytic rate is increased two- to four-fold in the presence of DTT, GSH or other thiol compounds. PGE2 is widely distributed in various tissues and is implicated in the sleep/wake cycle, relaxation/contraction of smooth muscle, excretion of sodium ions, maintenance of body temperature and mediation of inflammation. mPGES2 contains an N-terminal hydrophobic domain which is membrane associated, and a C-terminal soluble domain with a GST-like structure. Probab=99.39 E-value=2e-13 Score=96.36 Aligned_cols=71 Identities=18% Similarity=0.231 Sum_probs=62.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHH---HCCHHHHCCCCHHHHHHHHH Q ss_conf 52135899868999999999869982999888677788999855687752011---10001110121035677752 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYI---DEHMQALCGVFVISEYLDET 74 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~---~~dg~~l~eS~aI~~yL~~~ 74 (230) ++||.|..||||.|||.+|...||+|+.+.+++..+.. ++.+..+|||+|. ++++..+.||+.|+.||+.. T Consensus 2 ltLYqy~~CPFC~KVR~~L~~~~ipy~~veV~p~~k~e--i~~s~~~KVP~l~i~~~~~~~~l~DSs~II~~L~~~ 75 (77) T cd03040 2 ITLYQYKTCPFCCKVRAFLDYHGIPYEVVEVNPVSRKE--IKWSSYKKVPILRVESGGDGQQLVDSSVIISTLKTY 75 (77) T ss_pred EEEEEECCCCCHHHHHHHHHHCCCCCEEEECCCCCCCC--CCCCCCCCCCEEEEECCCCEEEEECHHHHHHHHHHH T ss_conf 07998678961899999998759972799817301141--346897788799997289806986029999999865 No 61 >COG0435 ECM4 Predicted glutathione S-transferase [Posttranslational modification, protein turnover, chaperones] Probab=99.38 E-value=5.2e-13 Score=93.99 Aligned_cols=194 Identities=15% Similarity=0.239 Sum_probs=136.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCE----EEEEC----CCCC---------CCHH---------HHHHCC--CC--CCC Q ss_conf 52135899868999999999869982----99988----8677---------7889---------998556--87--752 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTT----DMIEE----FPWG---------KRRE---------FLELNP--SG--TLP 51 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y----~~~~~----~~~~---------~~~~---------~~~lnP--~g--~vP 51 (230) ..||..-.||++.|..++=+.||+.= .++.- .-|. ..+. |.+-.| .| +|| T Consensus 52 YhLYvslaCPWAHRTLI~R~LkgLE~~Isvsvv~~~m~~~GW~F~~~~~g~t~dpl~g~~~L~~~Y~~adP~YsgRvTVP 131 (324) T COG0435 52 YHLYVSLACPWAHRTLIFRALKGLEPVISVSVVHPLMDENGWTFDPEFPGATGDPLYGIERLSQLYTRADPDYSGRVTVP 131 (324) T ss_pred EEEEEECCCCHHHHHHHHHHHHCCCCCCEEEEECCCCCCCCCEECCCCCCCCCCCCCCHHHHHHHHHHCCCCCCCCEEEE T ss_conf 89999704855788999999846466435887156446897467688889878722362589999850488998855687 Q ss_pred HHHHCCH--HHHCCCCHHHHHHHHHCCCCC-CCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHH Q ss_conf 0111000--111012103567775288723-3433567999999999999999861234257789999887766421221 Q gi|254780594|r 52 VYIDEHM--QALCGVFVISEYLDETYGAIA-QKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQ 128 (230) Q Consensus 52 ~L~~~dg--~~l~eS~aI~~yL~~~~~~~~-~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 128 (230) +|.|... .|-.||..|++-+...|.... ...-|+|. ..|.++..+.+|+...+...+ ++- T Consensus 132 VLwDk~~~tIVnNES~eIirm~N~aFde~~~~~~dlyP~--~Lr~eId~~n~~Iy~~vNNGV---------Yk~------ 194 (324) T COG0435 132 VLWDKKTQTIVNNESAEIIRMFNSAFDEFGASAVDLYPE--ALRTEIDELNKWIYDTVNNGV---------YKA------ 194 (324) T ss_pred EEEECCCCEEECCCCHHHHHHHHHHHHHHHHCCCCCCCH--HHHHHHHHHHHHHCCCCCCCE---------EEE------ T ss_conf 898357772543775999999999999986303443869--999999998745110225845---------540------ Q ss_pred CCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHH-----HCC--CCCHHHCHHHHHHHHH Q ss_conf 023343999999999999998556545347984266750289999999999999-----618--9994567688999999 Q gi|254780594|r 129 GGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILD-----YLG--EIDWDTAPIVKEWYQR 201 (230) Q Consensus 129 ~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~-----~~~--~~~~~~~p~l~~w~~r 201 (230) +....++..+++.+++.+.|+.+|..|+++.||+||++|-||+.+++.|.+++ +|+ .--..+||+|..|... T Consensus 195 -GFA~tq~aYeea~~~lF~~Ld~lE~~L~~~ryl~Gd~lTEAD~RLftTlvRFD~VYvgHFKCN~~rI~dypnL~~yLr~ 273 (324) T COG0435 195 -GFATTQEAYEEAVKKLFEALDKLEQILSERRYLTGDQLTEADIRLFTTLVRFDPVYVGHFKCNLRRIRDYPNLWGYLRD 273 (324) T ss_pred -CCCCHHHHHHHHHHHHHHHHHHHHHHHHHCEEECCCCCHHHHHHHHHEEEEEEEEEEEEEECCCCHHCCCCHHHHHHHH T ss_conf -5422689999999999999999999864073412560107666353015762005773454261122037408899999 Q ss_pred HHCCHHHHHHHH Q ss_conf 970998898413 Q gi|254780594|r 202 MKSRPSFRPLLS 213 (230) Q Consensus 202 ~~~rps~~~~l~ 213 (230) +.+.|+|+.+.. T Consensus 274 LYq~pg~~~T~d 285 (324) T COG0435 274 LYQLPGFAETVD 285 (324) T ss_pred HHCCCCCCCCCC T ss_conf 864767122246 No 62 >cd03186 GST_C_SspA GST_N family, Stringent starvation protein A (SspA) subfamily; SspA is a RNA polymerase (RNAP)-associated protein required for the lytic development of phage P1 and for stationary phase-induced acid tolerance of E. coli. It is implicated in survival during nutrient starvation. SspA adopts the GST fold with an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, but it does not bind glutathione (GSH) and lacks GST activity. SspA is highly conserved among gram-negative bacteria. Related proteins found in Neisseria (called RegF), Francisella and Vibrio regulate the expression of virulence factors necessary for pathogenesis. Probab=99.37 E-value=6.2e-12 Score=87.80 Aligned_cols=107 Identities=21% Similarity=0.260 Sum_probs=82.5 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999999998612342577899998877664212210233439999999999999985565453479842667502 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMS 168 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s 168 (230) |..||++|.|+.-+...... .+.... . .+.+..+.+++++.+.|..++..+++++|+.||+|| T Consensus 1 Pv~RA~~Rl~~~RIe~dwy~---------l~~~i~----~----~~~k~~~~ark~L~e~L~~l~p~F~~~pyFmsdEFs 63 (107) T cd03186 1 PVARARSRLLMHRIEQDWYP---------LVDTIE----K----GRKKEAEKARKELRESLLALAPVFAHKPYFMSEEFS 63 (107) T ss_pred CCHHHHHHHHHHHHHHHHHH---------HHHHHH----C----CCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCH T ss_conf 93789999999999999999---------999998----4----986789999999999999999998048440226520 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHH Q ss_conf 89999999999999618999456768899999997099889841 Q gi|254780594|r 169 YADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLL 212 (230) Q Consensus 169 ~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l 212 (230) +.||+++|.||++.++|.........|.+|.+|+=+||||++.+ T Consensus 64 lvDc~laPlLWRL~~lgI~Lp~~~k~l~~Y~~RiF~R~sF~~SL 107 (107) T cd03186 64 LVDCALAPLLWRLPALGIELPKQAKPLKDYMERVFARDSFQKSL 107 (107) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHCCHHHHHHC T ss_conf 88999989999701227237975438999999985809889629 No 63 >cd03204 GST_C_GDAP1 GST_C family, Ganglioside-induced differentiation-associated protein 1 (GDAP1) subfamily; GDAP1 was originally identified as a highly expressed gene at the differentiated stage of GD3 synthase-transfected cells. More recently, mutations in GDAP1 have been reported to cause both axonal and demyelinating autosomal-recessive Charcot-Marie-Tooth (CMT) type 4A neuropathy. CMT is characterized by slow and progressive weakness and atrophy of muscles. Sequence analysis of GDAP1 shows similarities and differences with GSTs; it appears to contain both N-terminal thioredoxin-fold and C-terminal alpha helical domains of GSTs, however, it also contains additional C-terminal transmembrane domains unlike GSTs. GDAP1 is mainly expressed in neuronal cells and is localized in the mitochondria through its transmembrane domains. It does not exhibit GST activity using standard substrates. Probab=99.24 E-value=2.7e-11 Score=84.15 Aligned_cols=75 Identities=21% Similarity=0.522 Sum_probs=64.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCC----------CCCCCCCCHHHHHHHHHHHHHHHCCCCC--HH--HCHHHHHHH Q ss_conf 399999999999999855654534798----------4266750289999999999999618999--45--676889999 Q gi|254780594|r 134 DSKVLRIARNNMREHIKYITWLIKSRS----------WIAGHHMSYADFAASATISILDYLGEID--WD--TAPIVKEWY 199 (230) Q Consensus 134 ~~~~~~~~~~~l~~~l~~le~~l~~~~----------fl~Gd~~s~ADi~~~~~l~~~~~~~~~~--~~--~~p~l~~w~ 199 (230) +.+.++....++...|+.+|+.|..+. ||+||.||+|||++.+.|+++..+|... |. .+|+|.+|+ T Consensus 23 ~~~~l~K~L~el~~vLD~vE~~L~~~~~e~~~~~~~~wL~gd~fTlADi~l~~~L~RL~~LGl~~~~w~~g~rPni~~Yy 102 (111) T cd03204 23 NVEYLKKILDELEMVLDQVEQELQRRKEETEEQKCQLWLCGDTFTLADISLGVTLHRLKFLGLSRRYWGNGKRPNLEAYF 102 (111) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCEECCCCCCHHHHHHHHHHHHHHHCCCCHHHHHCCCCCCHHHHH T ss_conf 58999999999999999999999876411455656512026854099999999999999928748887338986499999 Q ss_pred HHHHCCHHH Q ss_conf 999709988 Q gi|254780594|r 200 QRMKSRPSF 208 (230) Q Consensus 200 ~r~~~rps~ 208 (230) +|+.+|||| T Consensus 103 ~r~~qR~sf 111 (111) T cd03204 103 ERVLQRESF 111 (111) T ss_pred HHHHCCCCC T ss_conf 999753788 No 64 >KOG2903 consensus Probab=99.21 E-value=2.9e-11 Score=84.00 Aligned_cols=192 Identities=18% Similarity=0.245 Sum_probs=129.8 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCC----EEEEEC--------CCC-----CCC----------------HHHHHH-CC-- Q ss_conf 213589986899999999986998----299988--------867-----778----------------899985-56-- Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFT----TDMIEE--------FPW-----GKR----------------REFLEL-NP-- 46 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~----y~~~~~--------~~~-----~~~----------------~~~~~l-nP-- 46 (230) -||..-.||++.|..++++.||++ +..+.- .+. +.+ .++-.+ +| T Consensus 39 hLYvslaCPWAhRtLi~r~LKGL~~~i~~s~v~~~~d~~gW~F~~~~~~~nDs~~l~~~~d~~~g~k~l~elY~~~~p~Y 118 (319) T KOG2903 39 HLYVSLACPWAHRTLIVRALKGLEPAIGVSVVHWHLDDKGWRFLDEHIIINDSERLGVTPDPLNGAKRLRELYYIASPNY 118 (319) T ss_pred EEEEECCCCHHHHHHHHHHHCCCCCCCEEEEECCCCCCCCCCCCCCCCCCCCHHCCCCCCCCCCCCHHHHHHHHHCCCCC T ss_conf 99996168188889999987075655416772113688862677756677741004677750125036888882248998 Q ss_pred CC--CCCHHHHCCH--HHHCCCCHHHHHHHHHCCC---CC--CCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHH Q ss_conf 87--7520111000--1110121035677752887---23--34335679999999999999998612342577899998 Q gi|254780594|r 47 SG--TLPVYIDEHM--QALCGVFVISEYLDETYGA---IA--QKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHE 117 (230) Q Consensus 47 ~g--~vP~L~~~dg--~~l~eS~aI~~yL~~~~~~---~~--~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~ 117 (230) .| +||+|-|-.- .|=.||..|++.+...|.+ .. +.-.|+|. ..++++..+..|....+...+ T Consensus 119 ~grfTVPVLWD~k~ktIVnNES~eIIr~fNs~f~ef~~~~e~~~lDL~P~--~L~~~Ide~N~wvy~~INNGV------- 189 (319) T KOG2903 119 TGRFTVPVLWDLKTKTIVNNESSEIIRMFNSAFDEFNGIAENPVLDLYPS--SLRAQIDETNSWVYDKINNGV------- 189 (319) T ss_pred CCEEEEEEEECCCCCEEECCCHHHHHHHHHHHHHHHHCCCCCCCCCCCCH--HHHHHHHHHHCEECCCCCCCE------- T ss_conf 85277777871321512337639999998630154420024775453778--999998641111403634745------- Q ss_pred HHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCC--CCCCCCCCHHHHHHHHHHHHHH-----HCC---CC Q ss_conf 8776642122102334399999999999999855654534798--4266750289999999999999-----618---99 Q gi|254780594|r 118 RVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRS--WIAGHHMSYADFAASATISILD-----YLG---EI 187 (230) Q Consensus 118 ~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~--fl~Gd~~s~ADi~~~~~l~~~~-----~~~---~~ 187 (230) ++- +....++..+.+..++.++|+.+|..|+++. |++|+++|.|||.+++.+.+++ +|+ .. T Consensus 190 --Yk~-------GFA~~~e~Ye~~V~~lfe~LDr~E~vL~~~~~~f~~G~~LTeaDirLy~TiIRFD~VY~~hFKCn~~~ 260 (319) T KOG2903 190 --YKC-------GFAEKQEAYEEEVNQLFEALDRCEDVLGKNRKYFLCGDTLTEADIRLYTTIIRFDEVYVQHFKCNKKT 260 (319) T ss_pred --EEE-------CCCCCCCHHHHHHHHHHHHHHHHHHHHHCCCCEEEECCCCCHHHEEEEEEEEEEHHHHHEEEECCHHH T ss_conf --663-------34122206889999999999999999751563476445233210355555786101201113334324 Q ss_pred CHHHCHHHHHHHHHHHC-CHHHHHHH Q ss_conf 94567688999999970-99889841 Q gi|254780594|r 188 DWDTAPIVKEWYQRMKS-RPSFRPLL 212 (230) Q Consensus 188 ~~~~~p~l~~w~~r~~~-rps~~~~l 212 (230) ..++||+|..|..++.+ -|+|+.+. T Consensus 261 ir~~Yp~l~~~lk~iY~~~~~~~~Tt 286 (319) T KOG2903 261 IRDEYPNLHNWLKNIYWNIPGFSSTT 286 (319) T ss_pred HHCCCCHHHHHHHHHHHHCCCHHHCC T ss_conf 33157489999999976160022105 No 65 >cd00299 GST_C_family Glutathione S-transferase (GST) family, C-terminal alpha helical domain; a large, diverse group of cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. In addition, GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. This family, also referred to as soluble GSTs, is the largest family of GSH transferases and is only distantly related to the mitochondrial GSTs (GSTK). Soluble GSTs bear no structural similarity to microsomal GSTs (MAPEG family) and display additional activities unique to their group, such as catalyzing thiolysis, reduction and isomerization of certain compounds. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an activ Probab=99.17 E-value=2.3e-10 Score=78.82 Aligned_cols=97 Identities=29% Similarity=0.452 Sum_probs=72.7 Q ss_pred HHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH Q ss_conf 99999986123425778999988776642122102334399999999999999855654534798426675028999999 Q gi|254780594|r 96 RRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAAS 175 (230) Q Consensus 96 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~ 175 (230) ++|..|.+..+...+........... ..+....+...+.+...++.+|+.|+++.|++|+++|+|||+++ T Consensus 2 ~~~~~~~~~~~~~~~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~l~~le~~L~~~~~l~G~~~s~ADi~~~ 71 (100) T cd00299 2 RAWEEWADTTLEPAARRLLLLAFVGP----------EVDEAALEEAREELAAALAALEKLLAGRPYLAGDRFSLADIALA 71 (100) T ss_pred HHHHHHHHCCCCHHHHHHHHHHCCCC----------CCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHH T ss_conf 76898896873099999999861689----------99999999999999999999999846698667899859999999 Q ss_pred HHHHHHHHCCCC--CHHHCHHHHHHHHHH Q ss_conf 999999961899--945676889999999 Q gi|254780594|r 176 ATISILDYLGEI--DWDTAPIVKEWYQRM 202 (230) Q Consensus 176 ~~l~~~~~~~~~--~~~~~p~l~~w~~r~ 202 (230) +.|.++...... ..+.+|+|.+|++|+ T Consensus 72 ~~l~~~~~~~~~~~~~~~~p~l~~w~~r~ 100 (100) T cd00299 72 PVLARLDLLGPLLGLLDEYPRLAAWYDRL 100 (100) T ss_pred HHHHHHHHCCCCCCCCCCCCHHHHHHHHC T ss_conf 99999998174313432481699999869 No 66 >cd03190 GST_C_ECM4_like GST_C family, ECM4-like subfamily; composed of predominantly uncharacterized and taxonomically diverse proteins with similarity to the translation product of the Saccharomyces cerevisiae gene ECM4. ECM4, a gene of unknown function, is involved in cell surface biosynthesis and architecture. S. cerevisiae ECM4 mutants show increased amounts of the cell wall hexose, N-acetylglucosamine. More recently, global gene expression analysis shows that ECM4 is upregulated during genotoxic conditions and together with the expression profiles of 18 other genes could potentially differentiate between genotoxic and cytotoxic insults in yeast. Probab=99.16 E-value=3.7e-10 Score=77.58 Aligned_cols=108 Identities=18% Similarity=0.264 Sum_probs=83.4 Q ss_pred HHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH Q ss_conf 99999999999986123425778999988776642122102334399999999999999855654534798426675028 Q gi|254780594|r 90 LQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSY 169 (230) Q Consensus 90 ~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ 169 (230) ..|.++..|..|+...+.. .+++.. ....++..+.+...+.++|+.+|..|++++||+||++|. T Consensus 3 ~L~~eId~~n~~iy~~iNn---------GVYk~G-------FA~tQ~aYd~a~~~lF~aLd~lE~~L~~~ryL~Gd~lTe 66 (142) T cd03190 3 ELRSEIDELNEWIYDNINN---------GVYKAG-------FATTQEAYDEAVDELFEALDRLEELLSDRRYLLGDRLTE 66 (142) T ss_pred HHHHHHHHHHHHHHHHHHH---------HHHCCC-------CCCCHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCH T ss_conf 8999999999999886650---------652536-------679999999999999999999999984397003885539 Q ss_pred HHHHHHHHHHHHHH-----CC--CCCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 99999999999996-----18--9994567688999999970998898413 Q gi|254780594|r 170 ADFAASATISILDY-----LG--EIDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 170 ADi~~~~~l~~~~~-----~~--~~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) ||+.+++.|.+++. |+ .-...+||+|.+|..++.+.|+++++.. T Consensus 67 aDirLf~TLiRFD~vY~~~Fkcn~k~i~dYpnL~~y~r~Ly~~p~~~~T~~ 117 (142) T cd03190 67 ADIRLFTTLIRFDAVYVQHFKCNLKRIRDYPNLWNYLRRLYQNPGVAETTN 117 (142) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCHHCCCHHHHHHHHHHHCCCCCHHHCC T ss_conf 999999999850389998850561501218379999999976866000079 No 67 >cd03209 GST_C_Mu GST_C family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the m Probab=99.11 E-value=1.8e-09 Score=73.68 Aligned_cols=81 Identities=17% Similarity=0.352 Sum_probs=68.8 Q ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 99999999999998556545347984266750289999999999999618999456768899999997099889841333 Q gi|254780594|r 136 KVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 136 ~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) +..+...+.+...|..++..|++++|++|+++|+||+.++..|.....+...-++.+|+|+++++|++++|.+++.++.. T Consensus 31 ~~K~~~~~~lp~~L~~~e~~L~~~~~~~G~~lTyaDf~~~e~ld~~~~~~p~~l~~~p~L~~~~~r~~~lP~I~~Yl~S~ 110 (121) T cd03209 31 KLKPDYLAKLPDKLKLFSDFLGDRPWFAGDKITYVDFLLYEALDQHRIFEPDCLDAFPNLKDFLERFEALPKISAYMKSD 110 (121) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCHHHHHCCCHHHHHHHHHHCCHHHHHHHCCH T ss_conf 88999999968999999999578974358997444699999999999879152102627999999998094289987697 Q ss_pred H Q ss_conf 2 Q gi|254780594|r 216 I 216 (230) Q Consensus 216 ~ 216 (230) . T Consensus 111 ~ 111 (121) T cd03209 111 R 111 (121) T ss_pred H T ss_conf 2 No 68 >cd03184 GST_C_Omega GST_C family, Class Omega subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Omega GSTs show little or no GSH-conjugating activity towards standard GST substrates. Instead, they catalyze the GSH dependent reduction of protein disulfides, dehydroascorbate and monomethylarsonate, activities which are more characteristic of glutaredoxins. They contain a conserved cysteine equivalent to the first cysteine in the CXXC motif of glutaredoxins, which is a re Probab=99.09 E-value=3.7e-10 Score=77.59 Aligned_cols=77 Identities=17% Similarity=0.343 Sum_probs=63.2 Q ss_pred HHHHHHHHHHHHHHHHHCC--CCCCCCCCCCHHHHHHHHHHHHHHHCC-----CCCHHHCHHHHHHHHHHHCCHHHHHHH Q ss_conf 9999999998556545347--984266750289999999999999618-----999456768899999997099889841 Q gi|254780594|r 140 IARNNMREHIKYITWLIKS--RSWIAGHHMSYADFAASATISILDYLG-----EIDWDTAPIVKEWYQRMKSRPSFRPLL 212 (230) Q Consensus 140 ~~~~~l~~~l~~le~~l~~--~~fl~Gd~~s~ADi~~~~~l~~~~~~~-----~~~~~~~p~l~~w~~r~~~rps~~~~l 212 (230) ...+++.+.|+.+|..|+. ++|+.|+++|++|++++|+..++..++ .++.+++|+|.+|+++|.++|+|+++. T Consensus 32 ~~~~e~~~~L~~lE~eL~~~~t~FFgG~~~g~vD~~iwPwfeR~~~~~~~~~~~~~~~~fP~l~~W~~~m~~~p~Vk~~~ 111 (124) T cd03184 32 EKKAELRSALENLEEELTKRGTPFFGGDSPGMVDYMIWPWFERLEALKLLLGYEFPLDRFPKLKKWMDAMKEDPAVQAFY 111 (124) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCHHHCHHHHHHHHHHHCCCHHHHHC T ss_conf 89999999999999999746998767898765699999999999999762244578422828999999870694577538 Q ss_pred HHHH Q ss_conf 3332 Q gi|254780594|r 213 SERI 216 (230) Q Consensus 213 ~e~~ 216 (230) .+.+ T Consensus 112 ~~~e 115 (124) T cd03184 112 TDTE 115 (124) T ss_pred CCHH T ss_conf 9999 No 69 >TIGR00862 O-ClC intracellular chloride channel protein; InterPro: IPR002946 Chloride (Cl^-) channels are known to be present in the plasma and intracellular membranes of many cell types. Whilst the molecular identity of plasma membrane Cl^- channels has been known for some time, only recently have intracellular Cl^- channels been characterised. Initially, a 64 kDa protein was purified from solubilised kidney cortical membranes, which, when reconstituted into phospholipid vesicles, gave rise to chloride channel activity . An antibody generated against this protein (p64) was subsequently used to identify and clone it. The predicted amino acid sequence for p64 shows that it contains two, or possibly four, putative transmembrane (TM) domains and a number of possible phosphorylation sites. Expression studies have revealed that it is targeted to intracellular organelles, not the plasma membrane . Subsequently, a number of homologous cDNAs have been identified, which, when expressed, give rise to Cl^- channels in the membranes of both cell nuclei and endoplasmic reticulum , . They do not show significant sequence similarity to the CLC family of plasma membrane voltage-gated Cl^- channels.; GO: 0005247 voltage-gated chloride channel activity, 0006821 chloride transport, 0016020 membrane. Probab=99.06 E-value=4.4e-09 Score=71.44 Aligned_cols=184 Identities=14% Similarity=0.129 Sum_probs=116.1 Q ss_pred CCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCCCCCCCC Q ss_conf 99868999999999869982999888677788999855687752011100011101210356777528872334335679 Q gi|254780594|r 8 PLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQKNRLLST 87 (230) Q Consensus 8 ~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~~~L~p~ 87 (230) ..||||+|.-++|..||+.|.+..++...++.++..+.|-+..|.|.. +..+-+|...|-++|++..... ..++|-+. T Consensus 17 GnCPf~qrlfm~lWlkGv~f~vttvd~~r~~~~~~~l~PG~~~Pfl~~-~~~~~td~~kieefle~~l~P~-~y~~l~~~ 94 (237) T TIGR00862 17 GNCPFSQRLFMILWLKGVVFNVTTVDLKRKPEELKNLAPGTHPPFLVY-NKEVKTDVNKIEEFLEEVLAPP-KYPKLAAK 94 (237) T ss_pred CCCCHHHHHHHHHHHCCEEEEEEEEECCCCHHHHHHHCCCCCCCEEEE-CCHHHHHHHHHHHHHHHHHCCH-HHHHHHCC T ss_conf 577246788888764040466675421125688985068988752555-3214567999999998750761-12443002 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHH-------------- Q ss_conf 999999999999998612342577899998877664212210233439999999999999985565-------------- Q gi|254780594|r 88 DPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYIT-------------- 153 (230) Q Consensus 88 ~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le-------------- 153 (230) .......-..+..-+.. ..+..... .+....+...+.++..-++++ T Consensus 95 ~~es~~~G~~~f~kfsa--------------yikn~~~~------~n~~le~~l~k~l~~ld~yl~~Pl~~e~d~~~~~~ 154 (237) T TIGR00862 95 HKESNTAGLDIFAKFSA--------------YIKNSKKE------ANEALEKGLLKALKKLDDYLNSPLPEEIDEDSAED 154 (237) T ss_pred CCHHHHHHHHHHHHHHH--------------HHHHHHHH------HHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHCCH T ss_conf 31124556789888889--------------87510588------89899889999999988874066502420010000 Q ss_pred HHHCCCCCCCCCCCCHHHHHHHHHHHHHHH----CCCCC-HHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 453479842667502899999999999996----18999-4567688999999970998898413 Q gi|254780594|r 154 WLIKSRSWIAGHHMSYADFAASATISILDY----LGEID-WDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 154 ~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~----~~~~~-~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) .....+.||.|+++|+|||.+.|-|..+.. +..++ .....-+.+|.....+|..|..+-+ T Consensus 155 ~~~s~r~fldG~~ltladC~llPkl~~~~v~~kkyr~f~~P~~~~G~~ryl~nay~r~ef~~tC~ 219 (237) T TIGR00862 155 EKVSRRKFLDGDELTLADCNLLPKLHIVKVVAKKYRNFDIPAEFTGVWRYLKNAYAREEFTNTCP 219 (237) T ss_pred HHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHHCCC T ss_conf 11112210265410233232144056777656652265650557899999876543556763066 No 70 >cd03210 GST_C_Pi GST_C family, Class Pi subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Class Pi GST is a homodimeric eukaryotic protein. The human GSTP1 is mainly found in erythrocytes, kidney, placenta and fetal liver. It is involved in stress responses and in cellular proliferation pathways as an inhibitor of JNK (c-Jun N-terminal kinase). Following oxidative stress, monomeric GSTP1 dissociates from JNK and dimerizes, losing its ability to bind JNK and causing an incre Probab=99.03 E-value=6.5e-09 Score=70.46 Aligned_cols=79 Identities=22% Similarity=0.375 Sum_probs=64.8 Q ss_pred HHHHHHHHHHHHHHHHHHHCC---CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHHH Q ss_conf 999999999998556545347---98426675028999999999999961899945676889999999709988984133 Q gi|254780594|r 138 LRIARNNMREHIKYITWLIKS---RSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLSE 214 (230) Q Consensus 138 ~~~~~~~l~~~l~~le~~l~~---~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~e 214 (230) .+...+.+...|..+|..|+. +.|++|+++|+||++++..|..+..+....++.||+|+++++|+.++|.+++.++. T Consensus 33 K~~~~~~lp~~L~~fe~~L~~~~g~~f~vG~~lT~aD~~~f~~ld~~~~~~p~~L~~yP~L~af~~rv~~~P~Ik~Yl~S 112 (126) T cd03210 33 KDDYIKDLPEQLKPFEKLLSKNNGKGFIVGDKISFADYNLFDLLDIHLVLAPGCLDAFPLLKAFVERLSARPKLKAYLES 112 (126) T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCCEEECCCCCHHHHHHHHHHHHHHHHCCHHHHHCHHHHHHHHHHHHCHHHHHHHCC T ss_conf 99999998899999999999769998675787537769999999999998912655487899999999818468998589 Q ss_pred HH Q ss_conf 32 Q gi|254780594|r 215 RI 216 (230) Q Consensus 215 ~~ 216 (230) .- T Consensus 113 ~~ 114 (126) T cd03210 113 DA 114 (126) T ss_pred CC T ss_conf 64 No 71 >KOG3028 consensus Probab=99.03 E-value=2.9e-08 Score=66.75 Aligned_cols=187 Identities=14% Similarity=0.095 Sum_probs=115.9 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 98689999999998699829998886777889998556877520111000111012103567775288723343356799 Q gi|254780594|r 9 LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQKNRLLSTD 88 (230) Q Consensus 9 ~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~~~L~p~~ 88 (230) .++-|..|.+++...+-+.+++...-.. .+|.|++|.|+.++|..+.+=.-|+.+|...-.....+.. .. T Consensus 16 id~~sL~~l~y~kl~~~~l~v~~ssN~~-------~s~sg~LP~l~~~ng~~va~~~~iv~~L~k~~~ky~~d~d---l~ 85 (313) T KOG3028 16 IDPDSLAALIYLKLAGAPLKVVVSSNPW-------RSPSGKLPYLITDNGTKVAGPVKIVQFLKKNTKKYNLDAD---LS 85 (313) T ss_pred CCHHHHHHHHHHHHHCCCCEEEEECCCC-------CCCCCCCCEEEECCCCEECCHHHHHHHHHHHCCCCCCCCC---HH T ss_conf 6763899999999718971168504888-------8977888758825885423689989999984005776812---77 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHH---------HHHH-----------------HHHHHHHCCCCCHHHHHHHHH Q ss_conf 99999999999998612342577899998---------8776-----------------642122102334399999999 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQDVTRPLAHE---------RVYK-----------------LHMTTEQGGGSPDSKVLRIAR 142 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~~~~~~~~~---------~~~~-----------------~~~~~~~~~~~~~~~~~~~~~ 142 (230) ..+++....|..+....+++.+...++.+ .++. ...................-. T Consensus 86 ~kq~a~~~a~~sll~~~l~~a~~~t~~v~~~Ny~e~Tkk~yak~l~fP~n~~~p~~l~~qAk~rl~l~~g~~~~~e~~i~ 165 (313) T KOG3028 86 AKQLADTLAFMSLLEENLEPALLYTFWVDTENYNEVTKKWYAKALPFPLNYILPGKLQRQAKERLQLTLGELTEREDQIY 165 (313) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHCCCCCHHHCCHHHHHHHHHHHHHHHHCCCHHHHHHHH T ss_conf 88777789999999988899999998706553356767878863887322206136679999999998488605678999 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC--C----CHHHCHHHHHHHHHHHCC Q ss_conf 99999985565453479842667502899999999999996189--9----945676889999999709 Q gi|254780594|r 143 NNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGE--I----DWDTAPIVKEWYQRMKSR 205 (230) Q Consensus 143 ~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~--~----~~~~~p~l~~w~~r~~~r 205 (230) ....+++..++..|+++.|++||++|--|..+++.+..+-.... . -+..+++|.+|.+++++. T Consensus 166 ~~Aska~~~LS~~Lgs~kffFgd~psslDa~lfs~la~~~~~~Lp~~~Lq~~l~~~~NL~~~~~~i~s~ 234 (313) T KOG3028 166 KDASKALNLLSTLLGSKKFFFGDKPSSLDALLFSYLAILLQVALPNDSLQVHLLAHKNLVRYVERIRSL 234 (313) T ss_pred HHHHHHHHHHHHHHCCCEEEECCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHHHHH T ss_conf 999999999999846761763798736889999999999744487125899998554499999999988 No 72 >cd03039 GST_N_Sigma_like GST_N family, Class Sigma_like; composed of GSTs belonging to class Sigma and similar proteins, including GSTs from class Mu, Pi and Alpha. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Vertebrate class Sigma GSTs are characterized as GSH-dependent hematopoietic prostaglandin (PG) D synthases and are responsible for the production of PGD2 by catalyzing the isomerization of PGH2. The functions of PGD2 include the maintenance of body temperature, inhibition of platelet aggregation, bronchoconstriction, vasodilation and mediation of allergy and inflammation. Other class Sigma Probab=98.99 E-value=2.5e-10 Score=78.62 Aligned_cols=70 Identities=20% Similarity=0.150 Sum_probs=58.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC-CHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 521358998689999999998699829998886777-88999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK-RREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~-~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) .+|++++....++.+|++|++.||+|+.+.++..+. ..+.....|.|++|+|++ ||..|++|.+|++||. T Consensus 1 ~~L~Yf~~rGrae~iRl~L~~~gi~yed~~~~~~~~~~~~~k~~~pfg~lP~l~~-~g~~i~qS~aI~ryla 71 (72) T cd03039 1 YKLTYFNIRGRGEPIRLLLADAGVEYEDVRITYEEWPELDLKPTLPFGQLPVLEI-DGKKLTQSNAILRYLA 71 (72) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHHCCCCCCCCCCEEEE-CCEEEEEHHHHHHHHC T ss_conf 9899808863679999999985998489860743416676435885788778999-9999860999999867 No 73 >KOG3027 consensus Probab=98.98 E-value=1.1e-07 Score=63.47 Aligned_cols=184 Identities=13% Similarity=0.169 Sum_probs=118.9 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCC Q ss_conf 98689999999998699829998886777889998556877520111000111012103567775288723343356799 Q gi|254780594|r 9 LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDETYGAIAQKNRLLSTD 88 (230) Q Consensus 9 ~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~~~~~~~~L~p~~ 88 (230) -..-|..|..+|..-++||..+.- .+.|| ++|.|+||.|.. +.+++.|-..|+++.+.+--.. . + +-+ T Consensus 33 d~ascLAVqtfLrMcnLPf~v~~~----~Naef--mSP~G~vPllr~-g~~~~aef~pIV~fVeak~~~l--~-s--~ls 100 (257) T KOG3027 33 DNASCLAVQTFLRMCNLPFNVRQR----ANAEF--MSPGGKVPLLRI-GKTLFAEFEPIVDFVEAKGVTL--T-S--WLS 100 (257) T ss_pred CCHHHHHHHHHHHHCCCCCEEEEC----CCCCC--CCCCCCCCEEEE-CCHHHHHHHHHHHHHHHHCCCH--H-H--HHH T ss_conf 622579999999973897425655----87441--189987744665-6216540247999999853420--0-0--013 Q ss_pred HHHHHHHHHHHHHHHHCCCHH-HHHHHHHHHHHHHHHHHHHCCCCC------------------------HHHHHHHHHH Q ss_conf 999999999999986123425-778999988776642122102334------------------------3999999999 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQD-VTRPLAHERVYKLHMTTEQGGGSP------------------------DSKVLRIARN 143 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~-~~~~~~~~~~~~~~~~~~~~~~~~------------------------~~~~~~~~~~ 143 (230) ..+++.++..++...+.+... .......+..+........+...| +....+...+ T Consensus 101 E~qkadmra~vslVen~~t~aEl~~s~~de~ty~~vT~~R~gs~ypWPLs~i~~f~Krr~~~r~lk~~~W~~~~~DqVie 180 (257) T KOG3027 101 EDQKADMRAYVSLVENLLTTAELYVSWNDEETYDEVTALRYGSVYPWPLSHILPFVKRRKALRELKVYDWDDKTMDQVIE 180 (257) T ss_pred HHHHHHHHHHHHHHHHHHHHHHHHHHHCCHHHHHHHHHHCCCCCCCCCHHHHHHHHHHHHHHHHHHHCCCCCCCHHHHHH T ss_conf 67898899999999999999999998420777878755404887777289899888899999997653746353889999 Q ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH--CCCC----CHHHCHHHHHHHHHHHC Q ss_conf 9999985565453479842667502899999999999996--1899----94567688999999970 Q gi|254780594|r 144 NMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDY--LGEI----DWDTAPIVKEWYQRMKS 204 (230) Q Consensus 144 ~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~--~~~~----~~~~~p~l~~w~~r~~~ 204 (230) +...-.+.++.+|+.++|+.||++|-+|.-+++++..+-- +... ...+|++|.++.+|+.+ T Consensus 181 ~vdkc~~aLsa~L~~q~yf~g~~P~elDAlvFGHlytilTt~Lpn~ela~~lkkys~LlefcrrIeq 247 (257) T KOG3027 181 QVDKCCRALSAQLGSQPYFTGDQPTELDALVFGHLYTILTTRLPNMELANILKKYSNLLEFCRRIEQ 247 (257) T ss_pred HHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCCHHHHHHHHHHHHHHHHHHHHHH T ss_conf 9999999999986678765789961788899866688653037728999999986899999999999 No 74 >cd03076 GST_N_Pi GST_N family, Class Pi subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Class Pi GST is a homodimeric eukaryotic protein. The human GSTP1 is mainly found in erythrocytes, kidney, placenta and fetal liver. It is involved in stress responses and in cellular proliferation pathways as an inhibitor of JNK (c-Jun N-terminal kinase). Following oxidative stress, monomeric GSTP1 dissociates from JNK and dimerizes, losing its ability to bind JNK and causing an increase in JNK activity, thereby promoting apoptosis. GSTP1 is expressed in various tumors and is the predominant GST in a w Probab=98.95 E-value=3.6e-10 Score=77.70 Aligned_cols=72 Identities=18% Similarity=0.140 Sum_probs=58.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHH Q ss_conf 5213589986899999999986998299988867778899985568775201110001110121035677752 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDET 74 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~ 74 (230) ++|++++.-..+..+|++|+..|++|+-+.+...+-.+.+....|.|++|+|++ ||..|++|.||++||..+ T Consensus 2 ykL~YF~~rGraE~iRl~l~~~gv~fed~r~~~~~w~~~~k~~~pfgqlP~Le~-dg~~l~QS~AI~RYlarK 73 (73) T cd03076 2 YTLTYFPVRGRAEAIRLLLADQGISWEEERVTYEEWQESLKPKMLFGQLPCFKD-GDLTLVQSNAILRHLGRK 73 (73) T ss_pred EEEEEECCCHHHHHHHHHHHHCCCCCEEEEECHHHHHHHHCCCCCCCCCCEEEE-CCEEEEHHHHHHHHHHCC T ss_conf 299997984467999999998499873898473666677416997899988999-999987299999997379 No 75 >cd03077 GST_N_Alpha GST_N family, Class Alpha subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Alpha subfamily is composed of eukaryotic GSTs which can form homodimer and heterodimers. There are at least six types of class Alpha GST subunits in rats, four of which have human counterparts, resulting in many possible isoenzymes with different activities, tissue distribution and substrate specificities. Human GSTA1-1 and GSTA2-2 show high GSH peroxidase activity. GSTA3-3 catalyzes the isomerization of intermediates in steroid hormone biosynthesis. GSTA4-4 preferentially catalyzes the Probab=98.87 E-value=1.3e-09 Score=74.56 Aligned_cols=74 Identities=12% Similarity=0.016 Sum_probs=58.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHH--HCCCCCCCHHHHCCHHHHCCCCHHHHHHHHHCC Q ss_conf 521358998689999999998699829998886777889998--556877520111000111012103567775288 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLE--LNPSGTLPVYIDEHMQALCGVFVISEYLDETYG 76 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~--lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~~~ 76 (230) .+||+++.-..+..+|++|+..||+|+-+.+...+.-+...+ .-|.|++|+|.+ ||..+++|.||++||+.+|+ T Consensus 2 PkL~YF~~rGraE~iRllla~aGvefeD~r~~~~e~~~~~k~~~~~pfgQ~P~lei-DG~~l~QS~AI~RYlark~g 77 (79) T cd03077 2 PVLHYFNGRGRMESIRWLLAAAGVEFEEKFIESAEDLEKLKKDGSLMFQQVPMVEI-DGMKLVQTRAILNYIAGKYN 77 (79) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHCCCCCCCCCCCCEEEE-CCEEEEHHHHHHHHHHHHCC T ss_conf 98999799415699999999819983473358567689854268798899888998-99897679999999998718 No 76 >cd03208 GST_C_Alpha GST_C family, Class Alpha subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Alpha subfamily is composed of vertebrate GSTs which can form homodimer and heterodimers. There are at least six types of class Alpha GST subunits in rats, four of which have human counterparts, resulting in many possible isoenzymes with different activities, tissue distribution and substrate specificities. Human GSTA1-1 and GSTA2-2 show high GSH peroxidase activity. GS Probab=98.86 E-value=4.7e-08 Score=65.51 Aligned_cols=72 Identities=17% Similarity=0.328 Sum_probs=60.2 Q ss_pred HHHHHHHHHHHHHHHCC--CCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 99999998556545347--9842667502899999999999996189994567688999999970998898413 Q gi|254780594|r 142 RNNMREHIKYITWLIKS--RSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 142 ~~~l~~~l~~le~~l~~--~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) .+...+.+..+|+.|++ +.|++|+++|+||+.++..+.....+....++++|+|+++.+||.++|.+++-+. T Consensus 41 ~ka~~~yl~~fEk~l~~~g~~flVG~kls~AD~~l~e~l~~~~~~~p~~L~~fP~Lkaf~~ri~~~P~IKkyLq 114 (137) T cd03208 41 EKAKNRYFPVFEKVLKSHGQDFLVGNKLSRADIHLLEAILMVEELDPSLLSDFPLLQAFKTRISNLPTIKKFLQ 114 (137) T ss_pred HHHHHHHHHHHHHHHHHCCCCEEECCCCHHHHHHHHHHHHHHHHHCHHHHHCCCHHHHHHHHHHCCHHHHHHHC T ss_conf 50036778999999997799745358531889999999999998673676348248999999981856899848 No 77 >cd03196 GST_C_5 GST_C family, unknown subfamily 5; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=98.84 E-value=2.8e-08 Score=66.84 Aligned_cols=110 Identities=19% Similarity=0.309 Sum_probs=77.2 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC Q ss_conf 99999999999999861234257789999887766421221023343999999999999998556545347984266750 Q gi|254780594|r 88 DPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHM 167 (230) Q Consensus 88 ~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~ 167 (230) ++...+++..|+...++.|-..+-. ++.. .. .+ ....+..+.+....|..+|..|++++||+|+++ T Consensus 3 ~~~~~~~~l~LI~~nD~~FK~~LDr-------YKY~--~R----y~-~~~~~~~R~~~~~fL~~Le~~L~~~~~L~g~~~ 68 (115) T cd03196 3 DPAALKEMLALIAENDNEFKHHLDR-------YKYA--DR----YP-EESEEEYRQQAEAFLKDLEARLQQHSYLLGDKP 68 (115) T ss_pred CHHHHHHHHHHHHHHHHHHHHHHHH-------CCCC--CC----CC-CCCHHHHHHHHHHHHHHHHHHHCCCCCCCCCCC T ss_conf 8078999999999973878999861-------4799--88----99-989999999999999999998750867778986 Q ss_pred CHHHHHHHHHHHHHHHCCC--CCHHHCHHHHHHHHHHHCCHHHHHH Q ss_conf 2899999999999996189--9945676889999999709988984 Q gi|254780594|r 168 SYADFAASATISILDYLGE--IDWDTAPIVKEWYQRMKSRPSFRPL 211 (230) Q Consensus 168 s~ADi~~~~~l~~~~~~~~--~~~~~~p~l~~w~~r~~~rps~~~~ 211 (230) |+||++++|++..+..... ++-..+|+|.+|.++..+.+-|..+ T Consensus 69 slaD~AI~PFVRQFa~vD~~WF~~~~~~~l~~WL~~~l~s~lF~~v 114 (115) T cd03196 69 SLADWAIFPFVRQFAHVDPKWFDQSPYPRLRRWLNGFLASPLFSKI 114 (115) T ss_pred CHHHHHHHHHHHHHHCCCHHHHHCCCCHHHHHHHHHHHCCHHHHHH T ss_conf 6336884328999882798887618979999999998878998976 No 78 >cd03195 GST_C_4 GST_C family, unknown subfamily 4; composed of uncharacterized proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=98.79 E-value=5.2e-08 Score=65.26 Aligned_cols=111 Identities=21% Similarity=0.297 Sum_probs=72.4 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCC-CCCCCCCCC Q ss_conf 9999999999999861234257789999887766421221023343999999999999998556545347-984266750 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKS-RSWIAGHHM 167 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~-~~fl~Gd~~ 167 (230) |.+||+++++.+|..+.|.+ +-.+.-..-... .....+-. ..++..+.+.+...+..+.+ ++||+| ++ T Consensus 1 p~~RArAR~vqA~m~S~f~~-----lR~e~p~~vvf~--~~~~~pLs---~a~~ad~~rl~~~a~~ll~~g~p~LFG-~w 69 (114) T cd03195 1 PRQRARARQVQAWLRSDLLP-----IRVERSTEVVFA--GAKAEPLS---EAAQAAAEKLIAVAEALLPPGAANLFG-EW 69 (114) T ss_pred CHHHHHHHHHHHHHHCCHHH-----HHHHCCHHHHHC--CCCCCCCC---HHHHHHHHHHHHHHHHHHHCCCCCCCC-CC T ss_conf 93669999999999864499-----885398256752--57889999---999999999999999987468986425-34 Q ss_pred CHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 2899999999999996189994567688999999970998898413 Q gi|254780594|r 168 SYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 168 s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) |+||..+++++..+..+|. + --+.+.+|.++..+|||+|++++ T Consensus 70 sIADa~~Apv~~Rl~~yg~-~--vp~~~~~Ya~~~~~~psvq~Wia 112 (114) T cd03195 70 CIADTDLALMLNRLVLNGD-P--VPERLRDYARRQWQRPSVQAWLA 112 (114) T ss_pred HHHHHHHHHHHHHHHHCCC-C--CCHHHHHHHHHHHCCHHHHHHHH T ss_conf 3998999999999988389-9--99899999999987889999983 No 79 >cd03054 GST_N_Metaxin GST_N family, Metaxin subfamily; composed of metaxins and related proteins. Metaxin 1 is a component of a preprotein import complex of the mitochondrial outer membrane. It extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. In humans, alterations in the metaxin gene may be associated with Gaucher disease. Metaxin 2 binds to metaxin 1 and may also play a role in protein translocation into the mitochondria. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals. Sequence analysis suggests that all three metaxins share a common ancestry and that they possess similarity to GSTs. Also included in the subfamily are uncharacterized proteins with similarity to metaxins, including a novel GST from Rhodococcus with toluene o-monooxygenase and glutamylcysteine synthetase activities. Probab=98.79 E-value=3.8e-09 Score=71.80 Aligned_cols=65 Identities=20% Similarity=0.192 Sum_probs=53.3 Q ss_pred CCCCCCC-------CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHH Q ss_conf 5213589-------986899999999986998299988867778899985568775201110001110121035677752 Q gi|254780594|r 2 SILYHYP-------LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDET 74 (230) Q Consensus 2 ~~LY~~~-------~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~ 74 (230) ++||.++ .||+|.||.+.|...|++|+++..... ..+|.|++|++++ +|+.+.||..|++||+++ T Consensus 1 ~~L~~~~~~~glps~sp~clkv~~~Lr~~~ip~~v~~~~~~-------~~sp~GklP~i~~-~g~~i~ds~~Ii~~L~~~ 72 (72) T cd03054 1 LELYQWGRAFGLPSLSPECLKVETYLRMAGIPYEVVFSSNP-------WRSPTGKLPFLEL-NGEKIADSEKIIEYLKKK 72 (72) T ss_pred CEEEECCCCCCCCCCCHHHHHHHHHHHHCCCCEEEECCCCC-------CCCCCCCCCEEEE-CCEEECCHHHHHHHHHCC T ss_conf 97898189889988887899999999975997450005674-------4486774388998-998986899999999609 No 80 >cd03194 GST_C_3 GST_C family, unknown subfamily 3; composed of uncharacterized proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=98.74 E-value=1.7e-07 Score=62.25 Aligned_cols=113 Identities=12% Similarity=0.109 Sum_probs=70.6 Q ss_pred HHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCC Q ss_conf 99999999999998612342577899998877664212210233439999999999999985565453479842667502 Q gi|254780594|r 89 PLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMS 168 (230) Q Consensus 89 ~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s 168 (230) +..||.++...++..+.|.. +... .+..........+..+.++.-..++.+.....-..-++++||+| +|| T Consensus 1 pa~RA~ARs~~AEMHSGF~a-LR~~-------~pMn~~~~~~~~~~~~~~~~di~Ri~~iw~~~~~~~~~GpfLFG-~fs 71 (114) T cd03194 1 PAARAWARSAAAEMHSGFAA-LRSE-------CPMNLRARVPGFELSEAVQADIARIEAIWAECLARFQGGPFLFG-DFS 71 (114) T ss_pred CHHHHHHHHHHHHHHCCHHH-HHHH-------CCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCCCC-CCH T ss_conf 93789999999999875899-9876-------87643468899998999999999999999999986289986567-551 Q ss_pred HHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 899999999999996189994567688999999970998898413 Q gi|254780594|r 169 YADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 169 ~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) +||..++|.+..+..++. + -.+...+|.+++.+.|++|++++ T Consensus 72 iADaffAPvv~Rf~tY~~-~--l~~~a~~Y~~~~l~~P~~~~W~~ 113 (114) T cd03194 72 IADAFFAPVVTRFRTYGL-P--LSPAAQAYVDALLAHPAMQEWIA 113 (114) T ss_pred HHHHHHHHHHHHHHHCCC-C--CCHHHHHHHHHHHCCHHHHHHHH T ss_conf 999999999999989179-9--99999999999988979999974 No 81 >cd03203 GST_C_Lambda GST_C family, Class Lambda subfamily; composed of plant-specific class Lambda GSTs. GSTs are cytosolic, usually dimeric, proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. The class Lambda subfamily was recently discovered, together with dehydroascorbate reductases (DHARs), as two outlying groups of the GST superfamily in Arabidopsis thaliana, which contain conserved active site cysteines. Characterization of recombinant A. thaliana proteins show that Lambda class GSTs are monomeric, similar Probab=98.71 E-value=5.2e-08 Score=65.26 Aligned_cols=104 Identities=17% Similarity=0.291 Sum_probs=66.6 Q ss_pred CHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHHHHHHHHHHH---CCCCCCCC Q ss_conf 999999999999998612342577899998877664212210233439999999999999985565453---47984266 Q gi|254780594|r 88 DPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMREHIKYITWLI---KSRSWIAG 164 (230) Q Consensus 88 ~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~l~~le~~l---~~~~fl~G 164 (230) |+..|.-++.|+++.+... ..+... ... +.. .... ...|+.+|+.| +++||++| T Consensus 1 Dpakr~faeeLl~y~dtf~-~~~~~~---------l~~-----g~~-~~e~-------~aald~lE~aL~Kf~dGPfFLG 57 (120) T cd03203 1 DPAKREFADELLAYTDAFT-KALYSS---------LIK-----GDP-SAEA-------AAALDYIENALSKFDDGPFFLG 57 (120) T ss_pred CCHHHHHHHHHHHHHHHHH-HHHHHH---------HCC-----CCH-HHHH-------HHHHHHHHHHHHCCCCCCEEEC T ss_conf 9178999999999999998-999999---------806-----852-7788-------9999999999863589984333 Q ss_pred CCCCHHHHHHHHHHHHHH----HCCCCC-HHHCHHHHHHHHHHHCCHHHHHHHHHH Q ss_conf 750289999999999999----618999-456768899999997099889841333 Q gi|254780594|r 165 HHMSYADFAASATISILD----YLGEID-WDTAPIVKEWYQRMKSRPSFRPLLSER 215 (230) Q Consensus 165 d~~s~ADi~~~~~l~~~~----~~~~~~-~~~~p~l~~w~~r~~~rps~~~~l~e~ 215 (230) ++|++||+.+|++-++. .+...+ .+++|+|.+|++.|.+|++++++-.|. T Consensus 58 -qfSLVDIay~PfvER~~~~l~~~k~ydi~~~rP~L~~W~eam~~reaY~~Tk~Dp 112 (120) T cd03203 58 -QFSLVDIAYVPFIERFQIFLSELFNYDITEGRPNLAAWIEEMNKIEAYTQTKQDP 112 (120) T ss_pred -CCEEEEEEECHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHCCHHHHHHCCCH T ss_conf -4203320100199999999999728416689912999999986665576444799 No 82 >cd03201 GST_C_DHAR GST_C family, Dehydroascorbate Reductase (DHAR) subfamily; composed of plant-specific DHARs, monomeric enzymes catalyzing the reduction of DHA into ascorbic acid (AsA) using glutathione as the reductant. DHAR allows plants to recycle oxidized AsA before it is lost. AsA serves as a cofactor of violaxanthin de-epoxidase in the xanthophyll cycle and as an antioxidant in the detoxification of reactive oxygen species. Because AsA is the major reductant in plants, DHAR serves to regulate their redox state. It has been suggested that a significant portion of DHAR activity is plastidic, acting to reduce the large amounts of ascorbate oxidized during hydrogen peroxide scavenging by ascorbate peroxidase. DHAR contains a conserved cysteine in its active site and in addition to its reductase activity, shows thiol transferase activity similar to glutaredoxins. Probab=98.64 E-value=1.1e-07 Score=63.48 Aligned_cols=71 Identities=17% Similarity=0.248 Sum_probs=57.3 Q ss_pred HHHHHHHHHHHHHHCC-CCCCCCCCCCHHHHHHHHHHHHHH----HCCCCC-HHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 9999998556545347-984266750289999999999999----618999-4567688999999970998898413 Q gi|254780594|r 143 NNMREHIKYITWLIKS-RSWIAGHHMSYADFAASATISILD----YLGEID-WDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 143 ~~l~~~l~~le~~l~~-~~fl~Gd~~s~ADi~~~~~l~~~~----~~~~~~-~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) +.+...|+.++..|++ +||+.|+++|+|||.++|.|..+. ++.... .++++.|.+|++.+.+|++|+.+-+ T Consensus 33 ~~Ll~eL~~ld~~L~~~gPfl~G~~lt~aDc~L~PKL~hl~valk~yk~~~IPe~l~~v~~Yl~~~~~~e~F~~T~p 109 (121) T cd03201 33 QALLDELEALEDHLKENGPFINGEKISAVDLSLAPKLYHLEIALGHYKNWSVPESLTSVKSYMKALFSRESFVKTKA 109 (121) T ss_pred HHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 99999999999998457997688945655364412799999999986289981342799999999875687862699 No 83 >cd03075 GST_N_Mu GST_N family, Class Mu subfamily; GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins and products of oxidative stress. The GST fold contains an N-terminal TRX-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. The class Mu subfamily is composed of eukaryotic GSTs. In rats, at least six distinct class Mu subunits have been identified, with homologous genes in humans for five of these subunits. Class Mu GSTs can form homodimers and heterodimers, giving a large number of possible isoenzymes that can be formed, all with overlapping activities but different substrate specificities. They are the most abundant GSTs in human liver, skeletal muscle and brain, and are believed to provide protection against diseases inc Probab=98.57 E-value=6.3e-08 Score=64.79 Aligned_cols=73 Identities=21% Similarity=0.115 Sum_probs=56.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC----CCCHHHHH-----HCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 5213589986899999999986998299988867----77889998-----55687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW----GKRREFLE-----LNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~----~~~~~~~~-----lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) .||.+++.-..++-+|++|+..|++|+-+..... ...+++.+ ..|.+++|.|++ |+..|++|.||++||. T Consensus 1 p~L~Y~~~RG~ae~iR~lL~y~g~~yed~ry~~~~~~~~~~~~w~~~K~~~~~~f~nLP~l~d-g~~~l~QS~AI~rYla 79 (82) T cd03075 1 PTLGYWDIRGLAQPIRLLLEYTGEKYEEKRYELGDAPDYDRSQWLNEKFKLGLDFPNLPYYID-GDVKLTQSNAILRYIA 79 (82) T ss_pred CEEEEECCCHHHHHHHHHHHHCCCCCEEEEECCCCCCCCCHHHHHHHCHHCCCCCCCCCEEEE-CCEEEEEHHHHHHHHH T ss_conf 988887773351799999998299846888537886645778987402222895788998998-9999864999999986 Q ss_pred HHC Q ss_conf 528 Q gi|254780594|r 73 ETY 75 (230) Q Consensus 73 ~~~ 75 (230) .+| T Consensus 80 rK~ 82 (82) T cd03075 80 RKH 82 (82) T ss_pred HCC T ss_conf 019 No 84 >cd03192 GST_C_Sigma_like GST_C family, Class Sigma_like; composed of GSTs belonging to class Sigma and similar proteins, including GSTs from class Mu, Pi, and Alpha. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Vertebrate class Sigma GSTs are characterized as GSH-dependent hematopoietic prostaglandin (PG) D synthases and are responsible for the production of PGD2 by catalyzing the isomerization of PGH2. The functions of PGD2 include the maintenance of body temperature, inhibition Probab=98.56 E-value=4.9e-07 Score=59.68 Aligned_cols=62 Identities=18% Similarity=0.351 Sum_probs=52.0 Q ss_pred HHHHHHHHHHHHHHHHCC--CCCCCCCCCCHHHHHHHHHHHHHHHCCCCC-HHHCHHHHHHHHHH Q ss_conf 999999998556545347--984266750289999999999999618999-45676889999999 Q gi|254780594|r 141 ARNNMREHIKYITWLIKS--RSWIAGHHMSYADFAASATISILDYLGEID-WDTAPIVKEWYQRM 202 (230) Q Consensus 141 ~~~~l~~~l~~le~~l~~--~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~-~~~~p~l~~w~~r~ 202 (230) ....+...|..+|+.|++ +.|++|+++|+||++++..+..+...+... .+.||+|+++++|+ T Consensus 40 ~~~~~p~~l~~~e~~L~~~~~~f~vG~~lT~aDl~~~~~ld~~~~~~~~~~l~~yP~L~~~~~rv 104 (104) T cd03192 40 LKEAIPKYLKKLEKILKENGGGYLVGDKLTWADLVVFDVLDYLLYLDPKLLLKKYPKLKALRERV 104 (104) T ss_pred HHHHHHHHHHHHHHHHHHCCCCEEECCCCCHHHHHHHHHHHHHHHHCCCCHHHCCHHHHHHHHHC T ss_conf 87445799999999999779986468987698999999999999868512021296689998629 No 85 >cd03205 GST_C_6 GST_C family, unknown subfamily 6; composed of uncharacterized bacterial proteins with similarity to GSTs. GSTs are cytosolic dimeric proteins involved in cellular detoxification by catalyzing the conjugation of glutathione (GSH) with a wide range of endogenous and xenobiotic alkylating agents, including carcinogens, therapeutic drugs, environmental toxins, and products of oxidative stress. GSTs also show GSH peroxidase activity and are involved in the synthesis of prostaglandins and leukotrienes. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. GSH binds to the N-terminal domain while the hydrophobic substrate occupies a pocket in the C-terminal domain. Probab=98.48 E-value=1.6e-06 Score=56.78 Aligned_cols=67 Identities=19% Similarity=0.443 Sum_probs=56.2 Q ss_pred CHHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHH-CCCCCH-HHCHHHHHHHHHH Q ss_conf 439999999999999985565453479842667502899999999999996-189994-5676889999999 Q gi|254780594|r 133 PDSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDY-LGEIDW-DTAPIVKEWYQRM 202 (230) Q Consensus 133 ~~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~-~~~~~~-~~~p~l~~w~~r~ 202 (230) .+..+++....++.+.|+.||+.+.+. .|+.+|++||+++|.|.++++ ++..+| ..+|+|.+|++++ T Consensus 30 ~~~~w~~r~~~ki~r~l~~LE~~~~~~---~~~~~~~~~Ia~a~aL~YldfR~p~~~Wr~~~P~L~~W~~~f 98 (98) T cd03205 30 RSQPWLERQRGKIERALDALEAELAKL---PLDPLDLADIAVACALGYLDFRHPDLDWRAAHPALAAWYARF 98 (98) T ss_pred CCHHHHHHHHHHHHHHHHHHHHHHHHC---CCCCCCHHHHHHHHHHHHHHHHCCCCCCCCCCHHHHHHHHHC T ss_conf 379999999999999999999776433---789888999999999999998776767410295899998709 No 86 >cd03202 GST_C_etherase_LigE GST_C family, Beta etherase LigE subfamily; composed of proteins similar to Sphingomonas paucimobilis beta etherase, LigE, a GST-like protein that catalyzes the cleavage of the beta-aryl ether linkages present in low-moleculer weight lignins using GSH as the hydrogen donor. This reaction is an essential step in the degradation of lignin, a complex phenolic polymer that is the most abundant aromatic material in the biosphere. The beta etherase activity of LigE is enantioselective and it complements the activity of the other GST family beta etherase, LigF. The GST fold contains an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain, with an active site located in a cleft between the two domains. Probab=98.46 E-value=2.1e-06 Score=56.09 Aligned_cols=66 Identities=17% Similarity=0.338 Sum_probs=55.3 Q ss_pred HHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCC-HHHCHHHHHHHHHHHC Q ss_conf 99999999998556545347984266750289999999999999618999-4567688999999970 Q gi|254780594|r 139 RIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEID-WDTAPIVKEWYQRMKS 204 (230) Q Consensus 139 ~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~-~~~~p~l~~w~~r~~~ 204 (230) +....++.+.|+-+.+.|++++||.|+.+++||+.+++.+.|+...+... ++.-..+.+|++||-. T Consensus 57 ~~~~~~~~~~L~PlR~~L~~q~fl~G~~P~~aDyivFG~fqWar~~S~~~ll~~dDpV~~W~er~Ld 123 (124) T cd03202 57 EAALANFRAALEPLRATLKGQPFLGGAAPNYADYIVFGGFQWARIVSPFPLLEEDDPVYDWFERCLD 123 (124) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHHCCCCCCCCCCCHHHHHHHHHHC T ss_conf 8799999999989999985399889899980169998599999982865358999819999998758 No 87 >cd03200 GST_C_JTV1 GST_C family, JTV-1 subfamily; composed of uncharacterized proteins with similarity to the translation product of the human JTV-1 gene. Human JTV-1, a gene of unknown function, initiates within the human PMS2 gene promoter, but is transcribed from the opposite strand. PMS2 encodes a protein involved in DNA mismatch repair and is mutated in a subset of patients with hereditary nonpolyposis colon cancer. It is unknown whether the expression of JTV-1 affects that of PMS2, or vice versa, as a result of their juxtaposition. JTV-1 is up-regulated while PMS2 is down-regulated in tumor cell spheroids that show increased resistance to anticancer cytotoxic drugs compared with tumor cell monolayers indicating that suppressed DNA mismatch repair may be a mechanism for multicellular resistance to alkylating agents. Probab=98.44 E-value=1.3e-06 Score=57.28 Aligned_cols=96 Identities=10% Similarity=0.268 Sum_probs=63.0 Q ss_pred HHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCCHHHHHHHHHHHHHH Q ss_conf 56777528872334335679999999999999998612342577899998877664212210233439999999999999 Q gi|254780594|r 68 SEYLDETYGAIAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSPDSKVLRIARNNMRE 147 (230) Q Consensus 68 ~~yL~~~~~~~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~ 147 (230) ++||...++. .+.++...-+++.|++.+...+.. ...++-.. T Consensus 1 aR~L~rl~~~-------~~~~~~~~t~ID~wlD~A~~ql~~-------------------------------GsskE~~a 42 (96) T cd03200 1 ARFLYRLLGP-------APNAPNAATNIDSWVDTAIFQLAE-------------------------------GSSKEKAA 42 (96) T ss_pred CCHHHHHHCC-------CCCCHHHHHHHHHHHHHHHHHHHC-------------------------------CCCHHHHH T ss_conf 9158875088-------878718899999999999999971-------------------------------77176879 Q ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCC Q ss_conf 9855654534798426675028999999999999961899945676889999999709 Q gi|254780594|r 148 HIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSR 205 (230) Q Consensus 148 ~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~r 205 (230) .++.++..|+.++||+|+++|+|||++++.+....... .--.|+++|++.|++. T Consensus 43 ~lr~LN~~LGks~wLvG~~lslADi~~wsal~qtg~~s----~~P~NVqrW~ksc~nl 96 (96) T cd03200 43 VLRALNSALGRSPWLVGSEFTVADIVSWCALLQTGLAS----AAPANVQRWLKSCENL 96 (96) T ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHCCCCC----CCCHHHHHHHHHHCCC T ss_conf 99999998688971037857799999999999607756----4757899999983469 No 88 >cd03197 GST_C_mPGES2 GST_C family; microsomal Prostaglandin E synthase Type 2 (mPGES2) subfamily; mPGES2 is a membrane-anchored dimeric protein containing a CXXC motif which catalyzes the isomerization of PGH2 to PGE2. Unlike cytosolic PGE synthase (cPGES) and microsomal PGES Type 1 (mPGES1), mPGES2 does not require glutathione (GSH) for its activity, although its catalytic rate is increased two- to four-fold in the presence of DTT, GSH, or other thiol compounds. PGE2 is widely distributed in various tissues and is implicated in the sleep/wake cycle, relaxation/contraction of smooth muscle, excretion of sodium ions, maintenance of body temperature, and mediation of inflammation. mPGES2 contains an N-terminal hydrophobic domain which is membrane associated and a C-terminal soluble domain with a GST-like structure. The C-terminus contains two structural domains a N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. The GST active site is located in a cleft between t Probab=98.31 E-value=5.4e-06 Score=53.70 Aligned_cols=60 Identities=20% Similarity=0.359 Sum_probs=45.2 Q ss_pred HHHHHHHHHHHH-CCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC-CHHHCHHHHHHHHHHHC Q ss_conf 999985565453-4798426675028999999999999961899-94567688999999970 Q gi|254780594|r 145 MREHIKYITWLI-KSRSWIAGHHMSYADFAASATISILDYLGEI-DWDTAPIVKEWYQRMKS 204 (230) Q Consensus 145 l~~~l~~le~~l-~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~-~~~~~p~l~~w~~r~~~ 204 (230) +...++..-..+ ++++|+.|++|.+||+++++.|..+..+..+ +..+.+++..||+||++ T Consensus 84 L~~~l~~w~~~~~~~~~F~GG~~PnLADlavyGvlrsieg~~~f~d~~~~t~i~~Wy~rM~~ 145 (149) T cd03197 84 LYDALNTWVAALGKDRQFHGGSKPNLADLAVYGVLRSVEGHPAFKDMVEETKIGEWYERMDA 145 (149) T ss_pred HHHHHHHHHHHHCCCCCCCCCCCCCHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHHHHH T ss_conf 99999999998369998789999874489886265740364379999985856799999999 No 89 >cd03212 GST_C_Metaxin1_3 GST_C family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins. Mammalian metaxin (or metaxin 1) is a component of the preprotein import complex of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken, and mammals. Probab=98.24 E-value=2.1e-05 Score=50.31 Aligned_cols=68 Identities=10% Similarity=0.052 Sum_probs=53.5 Q ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC------CHHHCHHHHHHHHHHHCC Q ss_conf 99999999999855654534798426675028999999999999961899------945676889999999709 Q gi|254780594|r 138 LRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEI------DWDTAPIVKEWYQRMKSR 205 (230) Q Consensus 138 ~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~------~~~~~p~l~~w~~r~~~r 205 (230) .+.-.+...+.++.+...|++++|+.||++|-.|..++++|..+-..... -..++|||.++.+||+++ T Consensus 62 ~~ei~~~a~~~~~aLS~~Lg~~~fFfGd~Pt~lDA~vfg~l~~il~~~l~~~~L~~~l~~~~NLv~~~~Ri~~~ 135 (137) T cd03212 62 EAEIYRDAKECLNLLSQRLGESQFFFGDTPTSLDALVFGYLAPLLKAPLPNNKLQNHLKQCPNLCRFCDRILSL 135 (137) T ss_pred HHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCCHHHHHHHHHHH T ss_conf 99999999999999999967798268999729999999999999816899869999999794199999999875 No 90 >cd03193 GST_C_Metaxin GST_C family, Metaxin subfamily; composed of metaxins and related proteins. Metaxin 1 is a component of a preprotein import complex of the mitochondrial outer membrane. It extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. In humans, alterations in the metaxin gene may be associated with Gaucher disease. Metaxin 2 binds to metaxin 1 and may also play a role in protein translocation into the mitochondria. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken, and mammals. Sequence analysis suggests that all three metaxins share a common ancestry and that they possess similarity to GSTs. Also included in the subfamily are uncharacterized proteins with similarity to metaxins, including a novel GST from Rhodococcus with toluene o-monooxygenase and glutamylcysteine synthetase activities. Other members are the cadmium-inducible Probab=98.24 E-value=4.9e-06 Score=53.92 Aligned_cols=61 Identities=13% Similarity=0.231 Sum_probs=48.6 Q ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC-C-----HHHCHHHHHHHHHHH Q ss_conf 999999855654534798426675028999999999999961899-9-----456768899999997 Q gi|254780594|r 143 NNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEI-D-----WDTAPIVKEWYQRMK 203 (230) Q Consensus 143 ~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~-~-----~~~~p~l~~w~~r~~ 203 (230) ...++.++.++..|++++|+.||+||.+|++++++|..+-..... . ..++|+|.+|.+||+ T Consensus 22 ~~~~~dl~aLs~~Lg~~~yffGd~Pt~~Da~vfg~La~i~~~~~~~~~l~~~~~~~~nL~~y~~Ri~ 88 (88) T cd03193 22 SLAKKDLKALSDLLGDKKFFFGDKPTSLDATVFGHLASILYAPLPNSALQLILKEYPNLVEYCERIR 88 (88) T ss_pred HHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCCHHHHHHHHCCHHHHHHHHHC T ss_conf 9999999999999777980379998488899999999998268999299999987939999998709 No 91 >cd03198 GST_C_CLIC GST_C family, Chloride Intracellular Channel (CLIC) subfamily; composed of CLIC1-5, p64, parchorin, and similar proteins. They are auto-inserting, self-assembling intracellular anion channels involved in a wide variety of functions including regulated secretion, cell division, and apoptosis. They can exist in both water-soluble and membrane-bound states and are found in various vesicles and membranes. Biochemical studies of the C. elegans homolog, EXC-4, show that the membrane localization domain is present in the N-terminal part of the protein. The structure of soluble human CLIC1 reveals that it is monomeric and adopts a fold similar to GSTs, containing an N-terminal domain with a thioredoxin fold and a C-terminal alpha helical domain. Upon oxidation, the N-terminal domain of CLIC1 undergoes a structural change to form a non-covalent dimer stabilized by the formation of an intramolecular disulfide bond between two cysteines that are far apart in the reduced form. T Probab=98.16 E-value=1e-05 Score=52.17 Aligned_cols=57 Identities=12% Similarity=0.209 Sum_probs=46.5 Q ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHH----HCCCCC-HHHCHHHHHHHHHHHCCHHHHHHHH Q ss_conf 47984266750289999999999999----618999-4567688999999970998898413 Q gi|254780594|r 157 KSRSWIAGHHMSYADFAASATISILD----YLGEID-WDTAPIVKEWYQRMKSRPSFRPLLS 213 (230) Q Consensus 157 ~~~~fl~Gd~~s~ADi~~~~~l~~~~----~~~~~~-~~~~p~l~~w~~r~~~rps~~~~l~ 213 (230) +.++||.|+++|+|||.+.|-|+.+. ++..++ .++.+.|.+|++.+.+|++|+.+-+ T Consensus 62 s~r~FLdGd~ltlaDc~LlPKL~~v~V~~~~yk~f~IP~~l~~v~rYl~~ay~~e~F~~Tcp 123 (134) T cd03198 62 SQRKFLDGDELTLADCNLLPKLHIVKVVAKKYRNFEIPADLTGLWRYLKNAYQREEFTNTCP 123 (134) T ss_pred CCCCCCCCCCCCHHHHHHHHHHHHHHHHHHHHCCCCCCHHHHHHHHHHHHHHHHHHHHCCCC T ss_conf 56863178877576565635789999999987089980550799999999863677873499 No 92 >cd02976 NrdH NrdH-redoxin (NrdH) family; NrdH is a small monomeric protein with a conserved redox active CXXC motif within a TRX fold, characterized by a glutaredoxin (GRX)-like sequence and TRX-like activity profile. In vitro, it displays protein disulfide reductase activity that is dependent on TRX reductase, not glutathione (GSH). It is part of the NrdHIEF operon, where NrdEF codes for class Ib ribonucleotide reductase (RNR-Ib), an efficient enzyme at low oxygen levels. Under these conditions when GSH is mostly conjugated to spermidine, NrdH can still function and act as a hydrogen donor for RNR-Ib. It has been suggested that the NrdHEF system may be the oldest RNR reducing system, capable of functioning in a microaerophilic environment, where GSH was not yet available. NrdH from Corynebacterium ammoniagenes can form domain-swapped dimers, although it is unknown if this happens in vivo. Domain-swapped dimerization, which results in the blocking of the TRX reductase binding site, cou Probab=98.09 E-value=3.4e-06 Score=54.86 Aligned_cols=63 Identities=17% Similarity=0.182 Sum_probs=50.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCC Q ss_conf 952135899868999999999869982999888677-7889998556877520111000111012 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGV 64 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS 64 (230) .+++|+.+.||||.+++-.|.++||+|+.+.++... ...++.+++...+||+++.+| +.+.+- T Consensus 1 kI~iYt~~~C~~C~~ak~~L~~~gi~y~~~di~~d~~~~~~l~~~~g~~~vP~i~i~~-~~i~Gf 64 (73) T cd02976 1 EVTVYTKPDCPYCKATKRFLDERGIPFEEVDVDEDPEALEELKKLNGYRSVPVVVIGD-EHLSGF 64 (73) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHCCCCCCCEEEECC-EEEECC T ss_conf 9999918999779999999998399049998799999999999865999618899999-999387 No 93 >cd03211 GST_C_Metaxin2 GST_C family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury. Probab=98.08 E-value=2.7e-05 Score=49.66 Aligned_cols=64 Identities=5% Similarity=0.007 Sum_probs=50.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCC--C----CHHHCHHHHHHHHHHH Q ss_conf 99999999985565453479842667502899999999999996189--9----9456768899999997 Q gi|254780594|r 140 IARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGE--I----DWDTAPIVKEWYQRMK 203 (230) Q Consensus 140 ~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~--~----~~~~~p~l~~w~~r~~ 203 (230) ...+...+.++.+...|++++|+.||+||-.|..++++|..+-.... - -..++|||.+|.+||+ T Consensus 57 evy~~a~~~l~aLS~~Lg~~~yffGd~Pt~lDA~vfg~la~il~~~lp~~~L~~~l~~~~NLv~fc~Ri~ 126 (126) T cd03211 57 QVIEEVDQCCQALSQRLGTQPYFFGDQPTELDALVFGHLFTILTTQLPNDELAEKVKKYSNLLAFCRRIE 126 (126) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHHHHHHHCCHHHHHHHHCC T ss_conf 9999999999999999777970579996099999999999998536998089999997866999997659 No 94 >cd03078 GST_N_Metaxin1_like GST_N family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins including Tom37 from fungi. Mammalian metaxin (or metaxin 1) and the fungal protein Tom37 are components of preprotein import complexes of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals. Probab=97.98 E-value=6.3e-06 Score=53.33 Aligned_cols=58 Identities=17% Similarity=0.165 Sum_probs=49.0 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHHH Q ss_conf 986899999999986998299988867778899985568775201110001110121035677752 Q gi|254780594|r 9 LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDET 74 (230) Q Consensus 9 ~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~~ 74 (230) .+|.|.++..+|...|++|+++..+-. ..+|.|++|+|++ +++.+.++..|++||.++ T Consensus 15 i~p~CL~~~aylk~~~~~~~v~~~~n~-------~~sP~G~LP~L~~-~~~~i~~~~~Ii~~L~~~ 72 (73) T cd03078 15 VDPECLAVLAYLKFAGAPLKVVPSNNP-------WRSPTGKLPALLT-SGTKISGPEKIIEYLRKQ 72 (73) T ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCC-------CCCCCCCCCEEEE-CCEEECCHHHHHHHHHHC T ss_conf 899999999999605996399974898-------7697874888998-997985879999999864 No 95 >TIGR02196 GlrX_YruB Glutaredoxin-like protein, YruB-family; InterPro: IPR011911 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This glutaredoxin-like protein family contains the conserved CxxC motif and includes the Clostridium pasteurianum protein YruB which has been cloned from a rubredoxin operon . Somewhat related to NrdH, it is unknown whether this protein actually interacts with glutathione/glutathione reducatase, or, like NrdH, some other reductant system.. Probab=97.93 E-value=8.3e-06 Score=52.63 Aligned_cols=63 Identities=16% Similarity=0.172 Sum_probs=51.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC-CHHHHHHCC-CCCCCHHH---HCCHHHHCC Q ss_conf 9521358998689999999998699829998886777-889998556-87752011---100011101 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK-RREFLELNP-SGTLPVYI---DEHMQALCG 63 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~-~~~~~~lnP-~g~vP~L~---~~dg~~l~e 63 (230) |.++|..+.||+|.++.-.|..+||+|+.+.+..... .+++++.+- ..-||+++ .+++++|.+ T Consensus 1 ~V~vYt~~~Cp~C~~aK~yL~~~gv~f~~~dV~~d~~A~~~~~k~~gG~~gVPv~vsGdi~~~~i~~G 68 (79) T TIGR02196 1 KVKVYTTPWCPPCKKAKEYLTSKGVAFEEIDVEKDSAAREELLKKLGGQRGVPVIVSGDIGTHKIIVG 68 (79) T ss_pred CEEEEEECCCCCHHHHHHHHHHCCCCEEEECCCCCHHHHHHHHHHHCCCCCCCEEEECCCCCCEEEEE T ss_conf 90687408982078999999954994576343369788999999838985455576342379724870 No 96 >cd03029 GRX_hybridPRX5 Glutaredoxin (GRX) family, PRX5 hybrid subfamily; composed of hybrid proteins containing peroxiredoxin (PRX) and GRX domains, which is found in some pathogenic bacteria and cyanobacteria. PRXs are thiol-specific antioxidant (TSA) proteins that confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins. PRX-GRX hybrid proteins from Haemophilus influenza and Neisseria meningitis exhibit GSH-dependent peroxidase activity. The flow of reducing equivalents in the catalytic cycle of the hybrid protein goes from NADPH - GSH reductase - GSH - GRX domain of hybrid - PRX domain of hybrid - peroxide substrate. Probab=97.78 E-value=2.8e-05 Score=49.64 Aligned_cols=69 Identities=14% Similarity=0.061 Sum_probs=55.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHH Q ss_conf 5213589986899999999986998299988867778899985568775201110001110121035677 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYL 71 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL 71 (230) .++|+-+.||||.++.-+|.++|++|+.+.++-.....++.+.+...+||-+.. ||+.|.+.....+|. T Consensus 3 V~Iysk~~CpyC~~AK~lL~~~gi~y~ei~l~~~~~~~~l~~~~g~~TvPQIfi-~g~~IGG~~dL~~~f 71 (72) T cd03029 3 VSLFTKPGCPFCARAKAALQENGISYEEIPLGKDITGRSLRAVTGAMTVPQVFI-DGELIGGSDDLEKYF 71 (72) T ss_pred EEEECCCCCHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCEECEEEE-CCEEEECHHHHHHHH T ss_conf 999908999679999999998599659998887767999987439972298999-999997899999975 No 97 >COG0695 GrxC Glutaredoxin and related proteins [Posttranslational modification, protein turnover, chaperones] Probab=97.74 E-value=4.6e-05 Score=48.38 Aligned_cols=69 Identities=13% Similarity=0.075 Sum_probs=51.2 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC--CCHHHHH-HCCCCCCCHHHHCCHHHHCCCCHHHHH Q ss_conf 952135899868999999999869982999888677--7889998-556877520111000111012103567 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG--KRREFLE-LNPSGTLPVYIDEHMQALCGVFVISEY 70 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~--~~~~~~~-lnP~g~vP~L~~~dg~~l~eS~aI~~y 70 (230) +.++|..+.||||.++.-+|..+|++|+.+.+.-.. ...++.+ .++..+||+++.++. .+......-.+ T Consensus 2 ~v~iyt~~~CPyC~~ak~~L~~~g~~~~~i~~~~~~~~~~~~~~~~~~g~~tvP~I~i~~~-~igg~~d~~~~ 73 (80) T COG0695 2 NVTIYTKPGCPYCKRAKRLLDRKGVDYEEIDVDDDEPEEAREMVKRGKGQRTVPQIFIGGK-HVGGCDDLDAL 73 (80) T ss_pred CEEEEECCCCCHHHHHHHHHHHCCCCCEEEECCCCCHHHHHHHHHHHCCCCCCCEEEECCE-EEECCCCHHHH T ss_conf 4799938999645999999997799709998378867899999997189857698999999-87165898999 No 98 >pfam00462 Glutaredoxin Glutaredoxin. Probab=97.55 E-value=7.5e-05 Score=47.15 Aligned_cols=58 Identities=17% Similarity=0.137 Sum_probs=45.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHH Q ss_conf 52135899868999999999869982999888677-788999855687752011100011 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQA 60 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~ 60 (230) .++|+.+.||||.+++-+|..+|++|+.+.++... ...++.+.+...+||.+.. ||+. T Consensus 1 v~iys~~~C~~C~~ak~~L~~~~i~y~~~~v~~~~~~~~~l~~~~g~~tvPqI~i-~g~~ 59 (60) T pfam00462 1 VVLFTKPTCPFCKRAKRLLKSLGVKFEEIDVDEDPEIREELKELSGWRTVPQVFI-DGEH 59 (60) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHHCCCCCCEEEE-CCEE T ss_conf 9899389897399999999980997533215899999999999829998197988-9999 No 99 >PRK10638 glutaredoxin 3; Provisional Probab=97.43 E-value=0.00015 Score=45.49 Aligned_cols=69 Identities=17% Similarity=0.115 Sum_probs=52.2 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 52135899868999999999869982999888677-788999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) .+.|+.+.||||.|+.-+|..+|++|+.+.++... ...++.+.....+||-+.. ||+.|.+...... |. T Consensus 4 V~Iysk~~CpyC~~Ak~lL~~~gi~y~ei~v~~~~~~~~~l~~~tg~~TVPqIfI-~g~~IGG~~dl~~-L~ 73 (83) T PRK10638 4 IEIYTKATCPFCHRAKALLNSKGVSFQEIPIDGDAAKREEMIKRSGRTTVPQIFI-DAQHIGGCDDLYA-LD 73 (83) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHCCCCCCCEEEE-CCEEEECHHHHHH-HH T ss_conf 8999899996799999999975998769986799999999997369996196999-9999957899999-99 No 100 >TIGR02181 GRX_bact glutaredoxin 3; InterPro: IPR011900 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family of glutaredoxins includes the E. coli protein GrxC (Grx3) which appears to have a secondary role in reducing ribonucleotide reductase (in the absence of GrxA) possibly indicating a role in the reduction of other protein disulphides .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=97.41 E-value=0.00018 Score=44.90 Aligned_cols=67 Identities=15% Similarity=0.083 Sum_probs=53.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCC-CEEEEECCCCC-C-CHHHHHHC-CCCCCCHHHHCCHHHHCCCCHHHH Q ss_conf 521358998689999999998699-82999888677-7-88999855-687752011100011101210356 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEF-TTDMIEEFPWG-K-RREFLELN-PSGTLPVYIDEHMQALCGVFVISE 69 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi-~y~~~~~~~~~-~-~~~~~~ln-P~g~vP~L~~~dg~~l~eS~aI~~ 69 (230) .++|..+.||||.++.-+|..||| .|+-+.++... . ..++.+.+ -..+||=+..++ ..|.++-.... T Consensus 1 V~iYT~~~CPYC~rAK~LL~~kGv~~~~Ei~vd~dpD~~r~eM~~Rs~Gr~TVPQIFIg~-~HVGGcDDL~~ 71 (82) T TIGR02181 1 VTIYTKPYCPYCSRAKALLESKGVGTFTEIRVDGDPDALRDEMMQRSGGRRTVPQIFIGD-VHVGGCDDLYA 71 (82) T ss_pred CEEECCCCCCCCHHHHHHHHHCCCCCCEEEECCCCCCHHHHHHHHHHCCCCCCCEEEECC-CCCCCCHHHHH T ss_conf 946315998802688999985698863034457981068899999828994043376299-02454379999 No 101 >cd02066 GRX_family Glutaredoxin (GRX) family; composed of GRX, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known including human GRX1 and GRX2, as well as E. coli GRX1 and GRX3, which Probab=97.38 E-value=0.00018 Score=44.91 Aligned_cols=66 Identities=15% Similarity=0.174 Sum_probs=50.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC-CHHHHHHCCCCCCCHHHHCCHHHHCCCCHHH Q ss_conf 521358998689999999998699829998886777-8899985568775201110001110121035 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK-RREFLELNPSGTLPVYIDEHMQALCGVFVIS 68 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~-~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~ 68 (230) .++|+-+.||||.+++-+|..+|++|+.+.+..... ..++.+.+...++|.+.. ||+.|.+..... T Consensus 2 V~iYs~~~C~~C~~ak~~L~~~~i~y~e~di~~~~~~~~~l~~~~g~~tvPqIfi-~g~~IGG~~~l~ 68 (72) T cd02066 2 VVVFSKSTCPYCKRAKRLLESLGIEFEEIDILEDGELREELKELSGWPTVPQIFI-NGEFIGGYDDLK 68 (72) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCCEEEEECCCHHHHHHHHHHHCCCCCCEEEE-CCEEECCHHHHH T ss_conf 9999779997699999999956997789980598889999999849988995999-998981689999 No 102 >TIGR02200 GlrX_actino Glutaredoxin-like protein; InterPro: IPR011915 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family of glutaredoxin-like proteins is limited to the Actinobacteria and contains the conserved CxxC motif.. Probab=97.31 E-value=0.0001 Score=46.41 Aligned_cols=62 Identities=10% Similarity=0.059 Sum_probs=53.0 Q ss_pred CCCCCCCCCHHHHHHHHHH-HHCCCCEEEEECCCCCCCHHHH-HHC-CCCCCCHHHHCCHHHHCC Q ss_conf 5213589986899999999-9869982999888677788999-855-687752011100011101 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLIL-SEYEFTTDMIEEFPWGKRREFL-ELN-PSGTLPVYIDEHMQALCG 63 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L-~e~gi~y~~~~~~~~~~~~~~~-~ln-P~g~vP~L~~~dg~~l~e 63 (230) ++.|..+.||||++.+-.| ...|++||.+.+.-.+...+|. ++| ....||+..-+||..+++ T Consensus 2 ~~vY~~~WCgyC~~l~~~Lk~~~G~~Ye~~di~~d~~AA~~v~svn~Gn~~vPtV~~~DGs~~tn 66 (78) T TIGR02200 2 VTVYATTWCGYCAQLMRTLKDKDGAAYEVVDIEEDEEAADYVKSVNDGNKTVPTVKFADGSFLTN 66 (78) T ss_pred EEEEECCCCCHHHHHHHHHHHCCCCCCEEEECCCCCCCCCEEEEECCCCCEEEEEEECCCCCCCC T ss_conf 04640699806789987534307995114524548475528665308871301687479944438 No 103 >cd03027 GRX_DEP Glutaredoxin (GRX) family, Dishevelled, Egl-10, and Pleckstrin (DEP) subfamily; composed of uncharacterized proteins containing a GRX domain and additional domains DEP and DUF547, both of which have unknown functions. GRX is a glutathione (GSH) dependent reductase containing a redox active CXXC motif in a TRX fold. It has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. By altering the redox state of target proteins, GRX is involved in many cellular functions. Probab=97.22 E-value=0.00046 Score=42.59 Aligned_cols=64 Identities=17% Similarity=0.133 Sum_probs=48.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCCCH Q ss_conf 52135899868999999999869982999888677-788999855687752011100011101210 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGVFV 66 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS~a 66 (230) .++|+-+.||||.+++-+|..+|++|+.+.++... ...++.+.+...+||-+.. |+..|.+... T Consensus 3 V~IYsk~~CpyC~~AK~lL~~kgi~y~ei~l~~~~~~~~~l~~~tg~~TVPQIFi-~~~~IGG~~d 67 (73) T cd03027 3 VTIYSRLGCEDCTAVRLFLREKGLPYVEINIDIFPERKAELEERTGSSVVPQIFF-NEKLVGGLTD 67 (73) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHHHCCCCCCEEEE-CCEEEECHHH T ss_conf 9999189996799999999987996499988899899999999719997597999-9999828999 No 104 >cd03419 GRX_GRXh_1_2_like Glutaredoxin (GRX) family, GRX human class 1 and 2 (h_1_2)-like subfamily; composed of proteins similar to human GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes Probab=97.19 E-value=0.00067 Score=41.67 Aligned_cols=70 Identities=11% Similarity=0.082 Sum_probs=52.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCH----HHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 52135899868999999999869982999888677788----999855687752011100011101210356777 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRR----EFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~----~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) .++|+-+.||||.+++-+|..+|++|+.+.++...... .+.++....+||.+.. ||+.|.+...+.++.. T Consensus 2 VvIysk~~Cp~C~~ak~lL~~~~i~~~~~~~d~~~~~~~~~~~l~~~~g~~TvPqIfi-~g~~IGG~~~l~~l~~ 75 (82) T cd03419 2 VVVFSKSYCPYCKRAKSLLKELGVKPAVVELDQHEDGSEIQDYLQELTGQRTVPNVFI-GGKFIGGCDDLMALHK 75 (82) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCEEEEECCCCCCHHHHHHHHHHHCCCCCCCEEEE-CCEEECCHHHHHHHHH T ss_conf 8999738887699999999984998069964656439999999998439998886999-9989847899999998 No 105 >cd03418 GRX_GRXb_1_3_like Glutaredoxin (GRX) family, GRX bacterial class 1 and 3 (b_1_3)-like subfamily; composed of bacterial GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known i Probab=97.09 E-value=0.00055 Score=42.15 Aligned_cols=67 Identities=13% Similarity=0.045 Sum_probs=49.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCC-CCCCCHHHHCCHHHHCCCCHHHH Q ss_conf 52135899868999999999869982999888677-7889998556-87752011100011101210356 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNP-SGTLPVYIDEHMQALCGVFVISE 69 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP-~g~vP~L~~~dg~~l~eS~aI~~ 69 (230) .++|+-+.||||.++.-+|..+|++|+.+.++... ...++.+.+. ..+||.+.. ||+.|.+...... T Consensus 2 V~iysk~~C~~C~~aK~lL~~~~i~y~ei~v~~~~~~~~~~~~~~~~~~TvPqIfi-~g~~IGG~~~l~~ 70 (75) T cd03418 2 VEIYTKPNCPYCVRAKALLDKKGVDYEEIDVDGDPALREEMINRSGGRRTVPQIFI-GDVHIGGCDDLYA 70 (75) T ss_pred EEEEECCCCHHHHHHHHHHHHHCCCCEEEECCCCHHHHHHHHHHCCCCCEECEEEE-CCEEEECHHHHHH T ss_conf 99996799977999999999819986898627999999999998089935397999-9979837899999 No 106 >PRK10853 hypothetical protein; Provisional Probab=96.99 E-value=0.0016 Score=39.57 Aligned_cols=40 Identities=20% Similarity=0.082 Sum_probs=34.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHH Q ss_conf 9521358998689999999998699829998886777889 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRRE 40 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~ 40 (230) |+++|+++.|.-|+|++-.|.+.||+|+++.+.-...+.+ T Consensus 1 MitiYgiknCdTckKA~kwL~~~~i~y~f~D~k~~~~~~~ 40 (118) T PRK10853 1 MVTLYGIKNCDTIKKARRWLEENGIDYRFHDYRVDGLDSE 40 (118) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCCEEEECCCCCCCHH T ss_conf 9668778998899999999998799818963114698999 No 107 >TIGR02190 GlrX-dom Glutaredoxin-family domain; InterPro: IPR011906 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This C-terminal domain with homology to glutaredoxin is fused to an N-terminal peroxiredoxin-like domain.. Probab=96.91 E-value=0.00036 Score=43.21 Aligned_cols=69 Identities=14% Similarity=0.113 Sum_probs=58.0 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 2135899868999999999869982999888677788999855687752011100011101210356777 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) +++.=|-||||.|+.-.|.++|++|+-|.++-.-....+.+++-.-+||=.. -||+.|..|.+.-.||. T Consensus 11 ~vfTKpGCPFC~~AK~~L~~~G~~~~Ei~Lg~~~~~~slrAvtG~~TvPQVF-I~G~lIGGsdeL~~Yla 79 (79) T TIGR02190 11 SVFTKPGCPFCAKAKAVLKEKGYDYEEIPLGKDITTRSLRAVTGATTVPQVF-INGKLIGGSDELEAYLA 79 (79) T ss_pred EEEECCCCCHHHHHHHHHHHCCCCEEEEECCCCCEEEEEEEECCCCCCCEEE-ECCEEECCHHHHHHHCC T ss_conf 8984598964466788988389864888639960068874451898578376-49856328689764319 No 108 >cd03032 ArsC_Spx Arsenate Reductase (ArsC) family, Spx subfamily; Spx is a unique RNA polymerase (RNAP)-binding protein present in bacilli and some mollicutes. It inhibits transcription by binding to the C-terminal domain of the alpha subunit of RNAP, disrupting complex formation between RNAP and certain transcriptional activator proteins like ResD and ComA. In response to oxidative stress, Spx can also activate transcription, making it a general regulator that exerts both positive and negative control over transcription initiation. Spx has been shown to exert redox-sensitive transcriptional control over genes like trxA (TRX) and trxB (TRX reductase), genes that function in thiol homeostasis. This redox-sensitive activity is dependent on the presence of a CXXC motif, present in some members of the Spx subfamily, that acts as a thiol/disulfide switch. Spx has also been shown to repress genes in a sulfate-dependent manner independent of the presence of the CXXC motif. Probab=96.76 E-value=0.003 Score=37.92 Aligned_cols=38 Identities=26% Similarity=0.198 Sum_probs=32.9 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCC Q ss_conf 95213589986899999999986998299988867778 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKR 38 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~ 38 (230) |+++||.+.|.=|++++-.|.+.|++|+++...-...+ T Consensus 1 Mi~iY~~~~Cst~rkA~~~L~~~~i~~~~~d~~k~~~s 38 (115) T cd03032 1 MIKLYTSPSCSSCRKAKQWLEEHQIPFEERNLFKQPLT 38 (115) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEECCCCC T ss_conf 97895699988999999999986997189974258969 No 109 >PRK12559 transcriptional regulator Spx; Provisional Probab=96.73 E-value=0.0029 Score=38.01 Aligned_cols=35 Identities=20% Similarity=0.156 Sum_probs=31.8 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 95213589986899999999986998299988867 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW 35 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~ 35 (230) |+++||++.|.=|+|++-.|.+.||+|+.+...-. T Consensus 1 Mi~iY~~~~CstcRKA~~wL~~~~I~y~~~d~~~~ 35 (131) T PRK12559 1 MVVLYTTASCASCRKAKAWLEENQIDYTEKNIVSN 35 (131) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCCEEECCCCC T ss_conf 97896699987799999999987998187210028 No 110 >PRK10329 glutaredoxin-like protein; Provisional Probab=96.62 E-value=0.0024 Score=38.48 Aligned_cols=56 Identities=16% Similarity=0.121 Sum_probs=43.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCC Q ss_conf 52135899868999999999869982999888677788999855687752011100 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEH 57 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~d 57 (230) +|+|+-|.|+.|....-+|..+||+|+.+.++-....-++++--...++|+.+.++ T Consensus 3 ITVYTkp~C~qC~aTKr~L~~~gI~y~~vdi~~dpea~~~vk~~G~~q~PVV~~~~ 58 (81) T PRK10329 3 ITIYTRNDCVQCHATKRAMESRGFDFEMINVDRVPEAADTLRAQGFRQLPVVIAGD 58 (81) T ss_pred EEEECCCCCCCHHHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCCCCCEEEECC T ss_conf 99976999824798999999879942998589999999999976985599899699 No 111 >PRK13344 spxA transcriptional regulator Spx; Reviewed Probab=96.53 E-value=0.0051 Score=36.60 Aligned_cols=35 Identities=11% Similarity=0.003 Sum_probs=32.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCC Q ss_conf 95213589986899999999986998299988867 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPW 35 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~ 35 (230) |+++|+++.|.-|+|++--|.+.||+|+.+.+.-. T Consensus 1 MI~iY~~~~C~TcrKA~kWL~~~~I~y~~~di~k~ 35 (132) T PRK13344 1 MIKIYTISSCTSCKKAKTWLNAHQLAYKEQNLGKE 35 (132) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCEEEEEEECC T ss_conf 96896679986789999999987996388541227 No 112 >PRK11200 grxA glutaredoxin 1; Provisional Probab=96.43 E-value=0.0052 Score=36.59 Aligned_cols=74 Identities=12% Similarity=0.213 Sum_probs=52.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHC-----CCCEEEEECCCCCC-CHHHHHHCC--CCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 5213589986899999999986-----99829998886777-889998556--877520111000111012103567775 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEY-----EFTTDMIEEFPWGK-RREFLELNP--SGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~-----gi~y~~~~~~~~~~-~~~~~~lnP--~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) .+.|+-+.||||.|+.-+|..+ +++|+.+.+..... ..++..... ..+||-+.. ||+.|.+...-..|+.+ T Consensus 3 V~Iysk~~CpyC~~AK~Ll~~l~~~~~~~~~~~idi~~~~~~~~~l~~~~g~~~~TVPQIfI-~g~~IGGy~dL~~~~~e 81 (87) T PRK11200 3 VVIFGRPGCPYCVRAKELAEKLSEERDDFDYRYVDIHAEGISKADLEKTVGKPVETVPQIFV-DQKHIGGCTDFEAYVKE 81 (87) T ss_pred EEEEECCCCHHHHHHHHHHHHHCCCCCCCCEEEEECCCCHHHHHHHHHHHCCCCCCCCEEEE-CCEEECCHHHHHHHHHH T ss_conf 99981899866999999999825335762079972677767899999984899874387989-99985399999999985 Q ss_pred HCC Q ss_conf 288 Q gi|254780594|r 74 TYG 76 (230) Q Consensus 74 ~~~ 76 (230) .++ T Consensus 82 n~~ 84 (87) T PRK11200 82 NLG 84 (87) T ss_pred CCC T ss_conf 447 No 113 >KOG1147 consensus Probab=96.42 E-value=0.003 Score=37.96 Aligned_cols=120 Identities=14% Similarity=0.184 Sum_probs=74.2 Q ss_pred HHCCHHHHCCCCHHHHHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHHHHHHHHHCCCCC Q ss_conf 11000111012103567775288723343356799999999999999986123425778999988776642122102334 Q gi|254780594|r 54 IDEHMQALCGVFVISEYLDETYGAIAQKNRLLSTDPLQRAETRRMIEWFMYQMEQDVTRPLAHERVYKLHMTTEQGGGSP 133 (230) Q Consensus 54 ~~~dg~~l~eS~aI~~yL~~~~~~~~~~~~L~p~~~~~ra~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 133 (230) +..||..+.++..+..|.... ......|++.+ .+++++..|..+.... . T Consensus 41 ~~~d~~~l~~a~~~~~~~~~~---~~~~~~lf~~~-~d~~~vd~w~~~s~~~--~------------------------- 89 (712) T KOG1147 41 QFLDGRKLNGATEPVVYSAAL---AKADPKLFGNN-IDRSQVDHWVSFSSTF--S------------------------- 89 (712) T ss_pred HCCCCCCCCCCCCCHHHHHHH---CCCCHHHCCCC-CCHHHHHHHHHHHHHC--C------------------------- T ss_conf 164430026986531333432---23588671785-5278899999876411--2------------------------- Q ss_pred HHHHHHHHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCC-C-HHHCHHHHHHHHHHHCCHHHHHH Q ss_conf 399999999999999855654534798426675028999999999999961899-9-45676889999999709988984 Q gi|254780594|r 134 DSKVLRIARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEI-D-WDTAPIVKEWYQRMKSRPSFRPL 211 (230) Q Consensus 134 ~~~~~~~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~-~-~~~~p~l~~w~~r~~~rps~~~~ 211 (230) ... +...+..++..|.-+.||+|.+.|+||+++++.|..-...... - -..+-++.+|+.-...-++.+.+ T Consensus 90 -~~~-------~s~~~~~ld~~l~~~t~lvg~sls~Ad~aiw~~l~~n~~~~~~lk~~k~~~~v~Rw~~~~~~~~a~~~v 161 (712) T KOG1147 90 -FDE-------ISSSLSELDKFLVLRTFLVGNSLSIADFAIWGALHSNGMRQEQLKAKKDYQNVERWYDLPEFQEAHNKV 161 (712) T ss_pred -HHH-------HHHHHHHHHHHHHHHHHHHCCCHHHHHHHHHHHHHCCCCHHHHHHHHCCCHHHHHHCCCHHHHHHHHHH T ss_conf -688-------888898777655577776515034778999998732520478887507850034441757689998899 Q ss_pred H Q ss_conf 1 Q gi|254780594|r 212 L 212 (230) Q Consensus 212 l 212 (230) + T Consensus 162 ~ 162 (712) T KOG1147 162 L 162 (712) T ss_pred H T ss_conf 8 No 114 >pfam10568 Tom37 Outer mitochondrial membrane transport complex protein. The TOM37 protein is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with TOM70 it guides peptides without an MTS into TOM40, the protein that forms the passage through the outer membrane. It has homology with Metaxin-1, also part of the outer mitochondrial membrane beta-barrel protein transport complex. Probab=96.38 E-value=0.0036 Score=37.46 Aligned_cols=57 Identities=23% Similarity=0.272 Sum_probs=47.3 Q ss_pred CCHHHHHHHHHHHHCCCC--EEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 986899999999986998--2999888677788999855687752011100011101210356777 Q gi|254780594|r 9 LSSSSRFIRLILSEYEFT--TDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 9 ~sp~~~kvr~~L~e~gi~--y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) .+|-|..+.+.+...+.+ ++++...-. .++|.|++|+|++.|++.+.+-..|++||. T Consensus 13 IdpeCLA~~~y~~~~~~~~~~~iv~s~N~-------~~SPtg~LP~L~~~~~~~isgf~~Iv~yL~ 71 (71) T pfam10568 13 ISPECLALIWYLKLCLPPEAFEVVFSSNT-------NLSPTGELPALITENGEKISGFRNIVDYLL 71 (71) T ss_pred CCHHHHHHHHHHHHCCCCCCEEEEECCCC-------CCCCCCCCCEEEECCCCEEECHHHHHHHHC T ss_conf 39999999999984489654899986899-------958688666699289729955689999769 No 115 >COG1393 ArsC Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism] Probab=96.29 E-value=0.0077 Score=35.60 Aligned_cols=37 Identities=22% Similarity=0.170 Sum_probs=32.1 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC Q ss_conf 9521358998689999999998699829998886777 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK 37 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~ 37 (230) |+++||.|.|.-|+|++-.|.+.||+|+.+...-... T Consensus 2 ~itiy~~p~C~t~rka~~~L~~~gi~~~~~~~~~~~~ 38 (117) T COG1393 2 MITIYGNPNCSTCRKALAWLEEHGIEYTFIDYLKTPP 38 (117) T ss_pred EEEEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCC T ss_conf 4899727987589999999998699967988206999 No 116 >PRK01655 spxA transcriptional regulator Spx; Reviewed Probab=96.18 E-value=0.0038 Score=37.33 Aligned_cols=33 Identities=27% Similarity=0.286 Sum_probs=30.5 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECC Q ss_conf 952135899868999999999869982999888 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEF 33 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~ 33 (230) |+++||+|.|.=|+|++-.|.+.||+|+.+... T Consensus 1 MitiY~~p~Cst~rkA~~wL~~~~i~~~~~d~~ 33 (131) T PRK01655 1 MVTLFTSPSCTSCRKAKAWLEEHEIPFTERNIF 33 (131) T ss_pred CEEEECCCCCHHHHHHHHHHHHCCCCCHHHHHH T ss_conf 978974899878999999999769981156787 No 117 >cd03033 ArsC_15kD Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC. It is encoded by an ORF present in a gene cluster associated with nitrogen fixation that also encodes dinitrogenase reductase ADP-ribosyltransferase (DRAT) and dinitrogenase reductase activating glycohydrolase (DRAG). ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via glutaredoxin, through a single catalytic cysteine. Probab=96.16 E-value=0.0098 Score=35.00 Aligned_cols=37 Identities=11% Similarity=0.096 Sum_probs=31.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCC Q ss_conf 5213589986899999999986998299988867778 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKR 38 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~ 38 (230) +++||+|.|.-|+|++-.|+++|++|+++...-...+ T Consensus 2 it~Y~~P~C~t~rka~~~Le~~gi~~~~~d~~~~p~t 38 (113) T cd03033 2 IIFYEKPGCANNARQKALLEAAGHEVEVRDLLTEPWT 38 (113) T ss_pred EEEEECCCCHHHHHHHHHHHHCCCCCEEEEEECCCCC T ss_conf 7999789987579999999987999479861068969 No 118 >cd03028 GRX_PICOT_like Glutaredoxin (GRX) family, PKC-interacting cousin of TRX (PICOT)-like subfamily; composed of PICOT and GRX-PICOT-like proteins. The non-PICOT members of this family contain only the GRX-like domain, whereas PICOT contains an N-terminal TRX-like domain followed by one to three GRX-like domains. It is interesting to note that PICOT from plants contain three repeats of the GRX-like domain, metazoan proteins (except for insect) have two repeats, while fungal sequences contain only one copy of the domain. PICOT is a protein that interacts with protein kinase C (PKC) theta, a calcium independent PKC isoform selectively expressed in skeletal muscle and T lymphocytes. PICOT inhibits the activation of c-Jun N-terminal kinase and the transcription factors, AP-1 and NF-kB, induced by PKC theta or T-cell activating stimuli. Both GRX and TRX domains of PICOT are required for its activity. Characterized non-PICOT members of this family include CXIP1, a CAX-interacting protein Probab=95.75 E-value=0.011 Score=34.72 Aligned_cols=64 Identities=17% Similarity=0.129 Sum_probs=46.9 Q ss_pred CCCCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHH Q ss_conf 899868999999999869982999888677-78899985568775201110001110121035677 Q gi|254780594|r 7 YPLSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYL 71 (230) Q Consensus 7 ~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL 71 (230) .|.||||.++.-+|.++|++|+.+.+.-.. ....+.+++...+||-+.. +|+.|.++..+.+.- T Consensus 20 ~P~C~f~~~a~~lL~~~~v~~~~~dv~~d~~~r~~l~~~s~~~TvPqifi-~g~~IGG~D~l~~l~ 84 (90) T cd03028 20 EPRCGFSRKVVQILNQLGVDFGTFDILEDEEVRQGLKEYSNWPTFPQLYV-NGELVGGCDIVKEMH 84 (90) T ss_pred CCCCHHHHHHHHHHHHHCCCCEECCHHCCHHHHHHHHHHCCCCCCCCEEE-CCEEECCHHHHHHHH T ss_conf 98887799999999996899741330058999999999716898883779-998786669999999 No 119 >cd03079 GST_N_Metaxin2 GST_N family, Metaxin subfamily, Metaxin 2; a metaxin 1 binding protein identified through a yeast two-hybrid system using metaxin 1 as the bait. Metaxin 2 shares sequence similarity with metaxin 1 but does not contain a C-terminal mitochondrial outer membrane signal-anchor domain. It associates with mitochondrial membranes through its interaction with metaxin 1, which is a component of the mitochondrial preprotein import complex of the outer membrane. The biological function of metaxin 2 is unknown. It is likely that it also plays a role in protein translocation into the mitochondria. However, this has not been experimentally validated. In a recent proteomics study, it has been shown that metaxin 2 is overexpressed in response to lipopolysaccharide-induced liver injury. Probab=95.58 E-value=0.01 Score=34.86 Aligned_cols=58 Identities=17% Similarity=0.234 Sum_probs=46.1 Q ss_pred CCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 98689999999998699829998886777889998556877520111000111012103567775 Q gi|254780594|r 9 LSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 9 ~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) -+..|.-|...|..-++||+.+.-. +.|| ++|.||||.|..+ ..++.|-..|+.+.+. T Consensus 16 d~AscLAVqafL~mc~Lp~~v~~r~----NAE~--MSPsGkvPfi~~g-~~vvaef~pIv~fv~~ 73 (74) T cd03079 16 DNASCLAVQTFLKMCNLPFNVRCRA----NAEF--MSPSGKVPFIRVG-NQIVSEFGPIVQFVEA 73 (74) T ss_pred CCCCCHHHHHHHHHCCCCCEEEECC----CCCC--CCCCCCCCEEEEC-CEEEECHHHHHHHHHC T ss_conf 4341199999999839974788748----8551--1899877668757-8856143579888615 No 120 >COG4545 Glutaredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=95.58 E-value=0.012 Score=34.51 Aligned_cols=62 Identities=18% Similarity=0.142 Sum_probs=47.6 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCC-----------CCHHHHHH--CCCCCCCHHHHCCHHHHC Q ss_conf 952135899868999999999869982999888677-----------78899985--568775201110001110 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWG-----------KRREFLEL--NPSGTLPVYIDEHMQALC 62 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~-----------~~~~~~~l--nP~g~vP~L~~~dg~~l~ 62 (230) |.+||+..+||-|--..-.|+-.+++|+.|.+.-.. ..++|-.. |.+--||+|..+||+++. T Consensus 3 kp~lfgsn~Cpdca~a~eyl~rl~v~yd~VeIt~Sm~NlKrFl~lRDs~~~Fd~vk~~gyiGIPall~~d~~vVl 77 (85) T COG4545 3 KPKLFGSNLCPDCAPAVEYLERLNVDYDFVEITESMANLKRFLHLRDSRPEFDEVKSNGYIGIPALLTDDGKVVL 77 (85) T ss_pred CCEEECCCCCCCHHHHHHHHHHCCCCCEEEEHHHHHHHHHHHHHHHCCCHHHHHHHHCCCCCCEEEEECCCCEEE T ss_conf 754621356864388999999818982243253246639988866115466775420682164278717986887 No 121 >cd03199 GST_C_GRX2 GST_C family, Glutaredoxin 2 (GRX2) subfamily; composed of bacterial proteins similar to E. coli GRX2, an atypical GRX with a molecular mass of about 24kD (most GRXs range from 9-12kD). GRX2 adopts a GST fold containing an N-terminal thioredoxin-fold domain and a C-terminal alpha helical domain. It contains a redox active CXXC motif located in the N-terminal domain, but is not able to reduce ribonucleotide reductase like other GRXs. However, it catalyzes GSH-dependent protein disulfide reduction of other substrates efficiently. GRX2 is thought to function primarily in catalyzing the reversible glutathionylation of proteins in cellular redox regulation including stress responses. Probab=95.29 E-value=0.093 Score=29.39 Aligned_cols=64 Identities=19% Similarity=0.232 Sum_probs=48.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCH Q ss_conf 9999999998556545347984266750289999999999999618999456768899999997099 Q gi|254780594|r 140 IARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRP 206 (230) Q Consensus 140 ~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rp 206 (230) ...++++..|..++..+......-| .+|+-||.+++.|..+.....+.|. |++.+|+++|.++- T Consensus 60 ~~i~~l~~~L~~L~~ll~~~~~~n~-~ls~DDi~lFp~LR~ltiVkgi~~p--~~v~~Y~~~~s~~t 123 (128) T cd03199 60 QYIAALNALLEELDPLILSSEAVNG-QLSTDDIILFPILRNLTLVKGLVFP--PKVKAYLERMSALT 123 (128) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC-CCCHHHHHHHHHHHHHHHHCCCCCC--HHHHHHHHHHHHHH T ss_conf 9999999999999998556455587-6779999999998434641688688--89999999999984 No 122 >pfam04399 Glutaredoxin2_C Glutaredoxin 2, C terminal domain. Glutaredoxins are a multifunctional family of glutathione-dependent disulphide oxidoreductases. Unlike other glutaredoxins, glutaredoxin 2 (Grx2) cannot reduce ribonucleotide reductase. Grx2 has significantly higher catalytic activity in the reduction of mixed disulphides with glutathione (GSH) compared with other glutaredoxins. The active site residues (Cys9-Pro10-Tyr11-Cys12, in Escherichia coli Grx2), which are found at the interface between the N- and C-terminal domains are identical to other glutaredoxins, but there is no other similarity between glutaredoxin 2 and other glutaredoxins. Grx2 is structurally similar to glutathione-S-transferases (GST), but there is no obvious sequence similarity. The inter-domain contacts are mainly hydrophobic, suggesting that the two domains are unlikely to be stable on their own. Both domains are needed for correct folding and activity of Grx2. It is thought that the primary function of Probab=95.03 E-value=0.13 Score=28.59 Aligned_cols=66 Identities=14% Similarity=0.238 Sum_probs=49.1 Q ss_pred HHHHHHHHHHHHHHHHHCCCCCCCCCCCCHHHHHHHHHHHHHHHCCCCCHHHCHHHHHHHHHHHCCHHH Q ss_conf 999999999855654534798426675028999999999999961899945676889999999709988 Q gi|254780594|r 140 IARNNMREHIKYITWLIKSRSWIAGHHMSYADFAASATISILDYLGEIDWDTAPIVKEWYQRMKSRPSF 208 (230) Q Consensus 140 ~~~~~l~~~l~~le~~l~~~~fl~Gd~~s~ADi~~~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~rps~ 208 (230) ...++++..|..++..+.......| ++|+-||.+++.|..+...+.+.|. |++.+|.++|.++-.+ T Consensus 59 ~~i~~l~~~L~~L~~ll~~~~~~n~-~Ls~DDi~lFp~LR~ltiVkgi~~p--~~v~~Y~~~~s~~t~V 124 (132) T pfam04399 59 QYIAELNADLEELDPLLLPSEAVNG-QLSTDDIILFPILRNLTIVKGLQFP--PRVRAYLDRMSALTGV 124 (132) T ss_pred HHHHHHHHHHHHHHHHHCCCCCCCC-CCCHHHHHHHHHHHHHHHCCCCCCC--HHHHHHHHHHHHHHCC T ss_conf 9999999999999998476335688-6679999999998533520788588--8999999999987498 No 123 >cd03034 ArsC_ArsC Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli. E. coli ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], the first step in the detoxification of arsenic, using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX). ArsC contains a single catalytic cysteine, within a thioredoxin fold, that forms a covalent thiolate-As(V) intermediate, which is reduced by GRX through a mixed GSH-arsenate intermediate. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Probab=95.02 E-value=0.053 Score=30.77 Aligned_cols=33 Identities=27% Similarity=0.399 Sum_probs=30.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 521358998689999999998699829998886 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFP 34 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~ 34 (230) +++||.|.|.=|++++-.|.+.|++|+++...- T Consensus 1 i~iY~~p~Cst~rka~~~L~~~~i~~~~idy~~ 33 (112) T cd03034 1 ITIYHNPRCSKSRNALALLEEAGIEPEIVEYLK 33 (112) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCCEEEECCC T ss_conf 989867998889999999998799848985014 No 124 >cd03035 ArsC_Yffb Arsenate Reductase (ArsC) family, Yffb subfamily; Yffb is an uncharacterized bacterial protein encoded by the yffb gene, related to the thioredoxin-fold arsenic reductases, ArsC. The structure of Yffb and the conservation of the catalytic cysteine suggest that it is likely to function as a glutathione (GSH)-dependent thiol reductase. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from GSH via glutaredoxin, through a single catalytic cysteine. Probab=94.95 E-value=0.055 Score=30.69 Aligned_cols=33 Identities=12% Similarity=0.004 Sum_probs=29.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 521358998689999999998699829998886 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFP 34 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~ 34 (230) +++||++.|.=|+|++-.|.+.|++|+.+.+.- T Consensus 1 i~iY~~~~CstcrkA~k~L~~~~i~~~~~d~~~ 33 (105) T cd03035 1 ITLYGIKNCDTVKKARKWLEARGVAYTFHDYRK 33 (105) T ss_pred CEEECCCCCHHHHHHHHHHHHCCCCCEEEEEEC T ss_conf 989717998789999999998799808997567 No 125 >TIGR02180 GRX_euk Glutaredoxin; InterPro: IPR011899 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry is found in eukaryotic glutaredoxins and includes sequences from fungi, plants and metazoans as well as viruses .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=94.86 E-value=0.053 Score=30.79 Aligned_cols=67 Identities=10% Similarity=0.110 Sum_probs=53.1 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCC---EEEEECCCCCCCH----HHHHHCCCCCCCHHHHCCHHHHCCCCHHHHH Q ss_conf 213589986899999999986998---2999888677788----9998556877520111000111012103567 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFT---TDMIEEFPWGKRR----EFLELNPSGTLPVYIDEHMQALCGVFVISEY 70 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~---y~~~~~~~~~~~~----~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~y 70 (230) +.|+=+.||||.++.=+|....++ +++++++...... .+.++.-..+||=+.. +|+-|.++..+... T Consensus 2 ~vFSKs~CPYC~~ak~~L~~~~~~~~~~~v~ElD~~~~g~~~Q~~L~~~TG~~TVP~iFi-~g~~iGG~~D~~~~ 75 (85) T TIGR02180 2 VVFSKSYCPYCKKAKEILKKLNVKPAKYKVVELDQLSNGSEIQDYLKEITGQRTVPNIFI-NGKFIGGCSDLLAL 75 (85) T ss_pred EEEECCCCCHHHHHHHHHHHCCCCCCCCEEEEECCCCCHHHHHHHHHHHCCCCCCCCEEE-CCEEECCCHHHHHH T ss_conf 577438984688999998744898676406770257885789999998448923882656-88576782455654 No 126 >cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine. Probab=94.62 E-value=0.086 Score=29.58 Aligned_cols=33 Identities=24% Similarity=0.292 Sum_probs=30.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 521358998689999999998699829998886 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFP 34 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~ 34 (230) +++||.+.|.-|+|++-.|.+.|++|+++...- T Consensus 1 i~iY~~~~C~t~rka~~~L~~~~i~~~~id~~k 33 (111) T cd03036 1 LKFYEYPKCSTCRKAKKWLDEHGVDYTAIDIVE 33 (111) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 969837998889999999998599839996147 No 127 >cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase. Probab=94.06 E-value=0.092 Score=29.40 Aligned_cols=33 Identities=24% Similarity=0.253 Sum_probs=29.7 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 521358998689999999998699829998886 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFP 34 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~ 34 (230) +++||.|.|.-|++++-.|.+.|++|+.+...- T Consensus 1 i~iy~~~~Cs~~rka~~~L~~~~i~~~~~d~~k 33 (105) T cd02977 1 ITIYGNPNCSTSRKALAWLEEHGIEYEFIDYLK 33 (105) T ss_pred CEEEECCCCHHHHHHHHHHHHCCCCCEEEHHHH T ss_conf 989727998789999999985799738860432 No 128 >PRK10026 arsenate reductase; Provisional Probab=94.06 E-value=0.057 Score=30.59 Aligned_cols=32 Identities=19% Similarity=0.204 Sum_probs=29.0 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEEC Q ss_conf 95213589986899999999986998299988 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEE 32 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~ 32 (230) ++|+||.|.|.=||.+.-+|.++|++++++.- T Consensus 3 ~itIYHNPrCSKSR~aL~lL~e~g~e~~vv~Y 34 (141) T PRK10026 3 NITIYHNPACGTSRNTLEMIRNSGTEPTIIYY 34 (141) T ss_pred CEEEEECCCCHHHHHHHHHHHHCCCCCEEEHH T ss_conf 42898898978899999999986999587156 No 129 >KOG1752 consensus Probab=93.97 E-value=0.12 Score=28.72 Aligned_cols=68 Identities=12% Similarity=0.080 Sum_probs=51.6 Q ss_pred CCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHH----HHCCCCCCCHHHHCCHHHHCCCCHHHHHH Q ss_conf 2135899868999999999869982999888677788999----85568775201110001110121035677 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFL----ELNPSGTLPVYIDEHMQALCGVFVISEYL 71 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~----~lnP~g~vP~L~~~dg~~l~eS~aI~~yL 71 (230) +.|+-..||||.++.-+|...|+.+.++.++......++. ++.-..+||.+.. +|+-+..+..+..+- T Consensus 17 VifSKs~C~~c~~~k~ll~~~~v~~~vvELD~~~~g~eiq~~l~~~tg~~tvP~vFI-~Gk~iGG~~dl~~lh 88 (104) T KOG1752 17 VIFSKSSCPYCHRAKELLSDLGVNPKVVELDEDEDGSEIQKALKKLTGQRTVPNVFI-GGKFIGGASDLMALH 88 (104) T ss_pred EEEECCCCCHHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHHHHHCCCCCCCEEEE-CCEEECCHHHHHHHH T ss_conf 999778573689999999847998679982589871899999997338788997999-997886788999999 No 130 >TIGR01617 arsC_related conserved hypothetical protein; InterPro: IPR006504 These sequences are a part of the Arsenate reductase family of sequences. The family includes a glutaredoxin-dependent arsenate reductase that works together with an arsenite exporter. It also includes an uncharacterised family associated with nitrogenase system genes in a number of phylogenetically distant species. The function of this group of sequences seems unlikely to be arsenate reductase, although that assignment has been given to a number of members of this family. . Probab=93.81 E-value=0.1 Score=29.10 Aligned_cols=39 Identities=21% Similarity=0.138 Sum_probs=33.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHH Q ss_conf 521358998689999999998699829998886777889 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRRE 40 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~ 40 (230) +.||++|.|.-|+|++-.|.+.||+|+.+.+.....+.+ T Consensus 1 i~~Y~~~nC~tckKa~~wL~~~~i~y~~i~~~~~~~~~e 39 (122) T TIGR01617 1 IKLYGIPNCTTCKKARRWLEANGIEYEEIDLKEDTPTRE 39 (122) T ss_pred CEEECCCCCHHHHHHHHHHHHCCCCCCEEECCCCCCCHH T ss_conf 901026875126989999985398532000345897878 No 131 >cd01659 TRX_superfamily Thioredoxin (TRX) superfamily; a large, diverse group of proteins containing a TRX-fold. Many members contain a classic TRX domain with a redox active CXXC motif. They function as protein disulfide oxidoreductases (PDOs), altering the redox state of target proteins via the reversible oxidation of their active site dithiol. The PDO members of this superfamily include TRX, protein disulfide isomerase (PDI), tlpA-like, glutaredoxin, NrdH redoxin, and the bacterial Dsb (DsbA, DsbC, DsbG, DsbE, DsbDgamma) protein families. Members of the superfamily that do not function as PDOs but contain a TRX-fold domain include phosducins, peroxiredoxins and glutathione (GSH) peroxidases, SCO proteins, GSH transferases (GST, N-terminal domain), arsenic reductases, TRX-like ferredoxins and calsequestrin, among others. Probab=93.69 E-value=0.089 Score=29.50 Aligned_cols=57 Identities=12% Similarity=0.088 Sum_probs=43.4 Q ss_pred CCCCCCCCCHHHHHHHHHHH-----HCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCH Q ss_conf 52135899868999999999-----8699829998886777889998556877520111000 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILS-----EYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHM 58 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~-----e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg 58 (230) +++|+.+.||+|++++..+. ..++.+..+.++...........++.+++|+++..++ T Consensus 1 l~~f~~~~~~~c~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~P~~~~~~~ 62 (69) T cd01659 1 LVLFYAPWCPFCQALRPVLAELALLNKGVKFEAVDVDEDPALEKELKRYGVGGVPTLVVFGP 62 (69) T ss_pred CCEEECCCCHHHHHHHHHHHHHHHHCCCCEEEEEECCCCHHHHHHHHHCCCEECCEEEEEEC T ss_conf 94799988878897679999999777998799998999867888888679563278999989 No 132 >PRK10824 hypothetical protein; Provisional Probab=92.07 E-value=0.24 Score=27.04 Aligned_cols=64 Identities=16% Similarity=0.062 Sum_probs=45.0 Q ss_pred CCCCCHHHHHHHHHHHHCCCCEEEEECCCCCC-CHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHH Q ss_conf 58998689999999998699829998886777-889998556877520111000111012103567 Q gi|254780594|r 6 HYPLSSSSRFIRLILSEYEFTTDMIEEFPWGK-RREFLELNPSGTLPVYIDEHMQALCGVFVISEY 70 (230) Q Consensus 6 ~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~-~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~y 70 (230) ..|.|.||.+|--+|...|++|..+.+..... ...+...+-.-++|=|.. +|+-|.++-.|.+- T Consensus 26 ~~P~CGFS~~~v~iL~~~gv~f~~vnVl~d~~ir~~lk~~s~WPT~PQlyi-~Ge~IGG~DI~~em 90 (115) T PRK10824 26 KLPSCGFSAQAVQALSACGERFAYVDILQNPDIRAELPKYANWPTFPQLWV-DGELVGGCDIVIEM 90 (115) T ss_pred CCCCCCCHHHHHHHHHHHCCCEEEEECCCCHHHHHHHHHHCCCCCCCCEEE-CCEEECCHHHHHHH T ss_conf 999780289999999992998068867889889999899819975561118-89766408999999 No 133 >TIGR02194 GlrX_NrdH Glutaredoxin-like protein NrdH; InterPro: IPR011909 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. NrdH-redoxin is a representative of a class of small redox proteins that contain a conserved CXXC motif and are characterised by a glutaredoxin-like amino acid sequence and thioredoxin-like activity profile. Unlike other the glutaredoxins to which it is most closely related, NrdH aparrently does not interact with glutathione/glutathione reductase, but rather with thioredoxin reductase to catalyze the reduction of ribonucleotide reductase .; GO: 0006118 electron transport, 0045454 cell redox homeostasis. Probab=91.18 E-value=0.24 Score=26.98 Aligned_cols=62 Identities=13% Similarity=0.133 Sum_probs=46.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCC Q ss_conf 52135899868999999999869982999888677788999855687752011100011101 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCG 63 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~e 63 (230) +|.|+=+-|-=|.-+.=+|.+.||+||.+.++.....-|+++-=....+||...+|..++.+ T Consensus 1 itvysKNnCvQCkmTKk~L~~~~i~Fe~inid~~pea~d~vk~lGF~~~PVi~~~g~~~~SG 62 (72) T TIGR02194 1 ITVYSKNNCVQCKMTKKALEEHGIAFEEINIDEQPEAVDYVKALGFRQVPVIVAEGDESWSG 62 (72) T ss_pred CEEECCCCCCCCCHHHHHHHHCCCCEEEEECCCCHHHHHHHHHCCCEECCEEEECCCEEEEC T ss_conf 95531798505215776676379961576358884378999874863076267679700213 No 134 >pfam11801 Tom37_C Tom37 C-terminal domain. The TOM37 protein is one of the outer membrane proteins that make up the TOM complex for guiding cytosolic mitochondrial beta-barrel proteins from the cytosol across the outer mitochondrial membrane into the intramembrane space. In conjunction with TOM70 it guides peptides without an MTS into TOM40, the protein that forms the passage through the outer membrane. It has homology with Metaxin-1, also part of the outer mitochondrial membrane beta-barrel protein transport complex. Probab=90.71 E-value=0.9 Score=23.73 Aligned_cols=38 Identities=16% Similarity=0.121 Sum_probs=32.0 Q ss_pred HHHHHHHHHHHHCCCCCCCCC-CCCHHHHHHHHHHHHHH Q ss_conf 999985565453479842667-50289999999999999 Q gi|254780594|r 145 MREHIKYITWLIKSRSWIAGH-HMSYADFAASATISILD 182 (230) Q Consensus 145 l~~~l~~le~~l~~~~fl~Gd-~~s~ADi~~~~~l~~~~ 182 (230) -.+-+..++..|++..|+.|| .+|-.|+.+++.|..+- T Consensus 112 a~~cl~~L~~~Lg~~~~~fgd~~psslD~l~~ayLalll 150 (164) T pfam11801 112 NLECLTLLEEYLGQWGYLFGDSIPSSSDLLFLAYLYLLL 150 (164) T ss_pred HHHHHHHHHHHHCCCCEECCCCCCCHHHHHHHHHHHHHH T ss_conf 999999999996025533589998739999999999994 No 135 >pfam03960 ArsC ArsC family. This family is related to glutaredoxins pfam00462. Probab=90.13 E-value=0.82 Score=23.96 Aligned_cols=30 Identities=20% Similarity=0.167 Sum_probs=27.5 Q ss_pred CCCCCCHHHHHHHHHHHHCCCCEEEEECCC Q ss_conf 358998689999999998699829998886 Q gi|254780594|r 5 YHYPLSSSSRFIRLILSEYEFTTDMIEEFP 34 (230) Q Consensus 5 Y~~~~sp~~~kvr~~L~e~gi~y~~~~~~~ 34 (230) ||.|.|.=|+++.-.|.+.|++|+.+...- T Consensus 1 y~~p~Cst~rka~~~L~~~~i~~~~~d~~~ 30 (111) T pfam03960 1 YGSPNCDTCRKALAWLEEHGIEYQEIDYLE 30 (111) T ss_pred CCCCCCHHHHHHHHHHHHCCCCEEEEECCC T ss_conf 979987899999999998799808973246 No 136 >TIGR00411 redox_disulf_1 redox-active disulfide protein 1; InterPro: IPR004502 This group of proteins includes thioredoxins, glutaredoxins, protein-disulphide isomerases, and others, some of which have several such domains. The sequence of proteins in this group at the redox-active disufide site, CPYC, matches glutaredoxins rather than thioredoxins, although overall the sequence seems closer to thioredoxins. Proteins may be involved in a ribonucleotide-reducing system component distinct from thioredoxin or glutaredoxin.; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity, 0045454 cell redox homeostasis. Probab=89.00 E-value=0.23 Score=27.16 Aligned_cols=57 Identities=11% Similarity=-0.019 Sum_probs=36.7 Q ss_pred CCCCCCCCCCHHHHHHHHH----HHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCH Q ss_conf 9521358998689999999----998699829998886777889998556877520111000 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLI----LSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHM 58 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~----L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg 58 (230) |+.||.+|+||||=++.-+ ..|.|...+...++......+-++- ..--||+|+.+|. T Consensus 2 ~i~lFTSPtCPyCP~A~~VV~~vakE~gd~~~v~~in~~e~~~kA~~y-Gi~aVPaivINg~ 62 (82) T TIGR00411 2 KIELFTSPTCPYCPKAEKVVEEVAKEEGDLVEVEKINVMEDLKKALEY-GIMAVPAIVINGK 62 (82) T ss_pred EEEEECCCCCCCCCCHHHHHHHHHCCCCCEEEEEEECCCCCHHHHHHC-CCCCCCEEEECCE T ss_conf 135511887688854368898874058973432221245484788751-6352684787790 No 137 >PTZ00062 glutaredoxin; Provisional Probab=86.52 E-value=0.67 Score=24.44 Aligned_cols=20 Identities=15% Similarity=0.313 Sum_probs=11.1 Q ss_pred HCHHHHHHHHHHHCCHHHHH Q ss_conf 67688999999970998898 Q gi|254780594|r 191 TAPIVKEWYQRMKSRPSFRP 210 (230) Q Consensus 191 ~~p~l~~w~~r~~~rps~~~ 210 (230) ..+.++.......+.|.+=+ T Consensus 166 ~D~~vR~~LK~ySnWPTiPQ 185 (219) T PTZ00062 166 DDPDLREELKIYSNWPTYPQ 185 (219) T ss_pred CCHHHHHHHHHHCCCCCCCE T ss_conf 98899973355459999783 No 138 >pfam11287 DUF3088 Protein of unknown function (DUF3088). This family of proteins with unknown function appears to be restricted to Proteobacteria. Probab=85.75 E-value=1.1 Score=23.25 Aligned_cols=69 Identities=20% Similarity=0.183 Sum_probs=46.0 Q ss_pred CCHHHHHHHHHHHHC---CCCEEEEECCCCCCCHHHHHHC--CCCCCCHHHHCCHH-------------HHCCCCHHHHH Q ss_conf 986899999999986---9982999888677788999855--68775201110001-------------11012103567 Q gi|254780594|r 9 LSSSSRFIRLILSEY---EFTTDMIEEFPWGKRREFLELN--PSGTLPVYIDEHMQ-------------ALCGVFVISEY 70 (230) Q Consensus 9 ~sp~~~kvr~~L~e~---gi~y~~~~~~~~~~~~~~~~ln--P~g~vP~L~~~dg~-------------~l~eS~aI~~y 70 (230) .||.|..+.=+|+.. .-..+++-++......+..++- -....|+|+..+|. -|.|...|++| T Consensus 22 ~Cp~c~~iEGlLa~~P~l~~~ldV~rV~f~RPR~~vv~llGe~~Qs~PvLVL~~~~~~~~~~~~~~g~rfi~d~~~I~~~ 101 (111) T pfam11287 22 YCPDCAAIEGLLASFPALAERLDVERVDFPRPRAAVVALLGEENQSLPVLVLADGDPAPDDAATAGGRRFITDPRAILRY 101 (111) T ss_pred ECCCHHHHHHHHHHCCCHHHCCCEEEECCCCCHHHHHHHHCCCCCCCCEEEECCCCCCCCCCCCCCCCEECCCHHHHHHH T ss_conf 67753766608762810253111798178886499998748001599889968999884212245880770898999999 Q ss_pred HHHHCCC Q ss_conf 7752887 Q gi|254780594|r 71 LDETYGA 77 (230) Q Consensus 71 L~~~~~~ 77 (230) |.++|+- T Consensus 102 La~r~g~ 108 (111) T pfam11287 102 LAERYGF 108 (111) T ss_pred HHHHCCC T ss_conf 9997088 No 139 >TIGR00014 arsC arsenate reductase; InterPro: IPR006659 This entry describes a distinct clade, including ArsC itself, of the broader family of ArsC and related proteins. This clade is almost completely restricted to the proteobacteria. An anion-translocating ATPase has been identified as the product of the arsenical resistance operon of resistance plasmid R773 . When expressed in Escherichia coli this ATP-driven oxyanion pump catalyses extrusion of the oxyanions arsenite, antimonite and arsenate. The pump is composed of two polypeptides, the products of the arsA and arsB genes. The pump alone produces resistance to arsenite and antimonite. This protein, ArsC, catalyzes the reduction of arsenate to arsenite, and thus extends resistance to include arsenate. ; GO: 0008794 arsenate reductase (glutaredoxin) activity, 0006118 electron transport. Probab=78.04 E-value=2.4 Score=21.24 Aligned_cols=30 Identities=23% Similarity=0.424 Sum_probs=27.0 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEE Q ss_conf 521358998689999999998699829998 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIE 31 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~ 31 (230) .|.||.|.|.=||.+..+|+.+||.=++|. T Consensus 1 vTIyHNPRCSkSR~tLalL~dkgi~P~vV~ 30 (114) T TIGR00014 1 VTIYHNPRCSKSRQTLALLEDKGIEPEVVK 30 (114) T ss_pred CCEEECCCCCHHHHHHHHHHHCCCCCEEEE T ss_conf 911226987246899999985699954741 No 140 >PRK12759 bifunctional gluaredoxin/ribonucleoside-diphosphate reductase subunit beta; Provisional Probab=74.35 E-value=5.1 Score=19.40 Aligned_cols=69 Identities=14% Similarity=0.041 Sum_probs=45.9 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHH-HHHHC--------CCCCCCHHHHCCHHHHCCCCHHHHHH Q ss_conf 521358998689999999998699829998886777889-99855--------68775201110001110121035677 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRRE-FLELN--------PSGTLPVYIDEHMQALCGVFVISEYL 71 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~-~~~ln--------P~g~vP~L~~~dg~~l~eS~aI~~yL 71 (230) .++|+-..||||-++.-.+.+.|++|+-..++-.....+ +..+| +..++|=+.. ||..|.+=..-..+. T Consensus 4 v~i~sk~~c~~c~~a~~~~~~~~~~~~~~~~d~~~~~~~~~~~~~~~~~~~~~~~~t~pqifi-d~~~ig~y~~~~~~~ 81 (410) T PRK12759 4 VRIYTKTNCPFCDLAKSWFGANDIPFTQISLDDDVKRAEFYAEVNKNILLVEEHIRTVPQIFV-GDVHIGGYDNLMARA 81 (410) T ss_pred EEEEECCCCCHHHHHHHHHHHCCCCCEEEECCCHHHHHHHHHHHHHHHHHHCCCCCCCCEEEE-CCEEECCHHHHHHHH T ss_conf 899805999658999999997499723421277798999998765555651577787861778-883424799999999 No 141 >COG0278 Glutaredoxin-related protein [Posttranslational modification, protein turnover, chaperones] Probab=74.21 E-value=3.1 Score=20.60 Aligned_cols=66 Identities=17% Similarity=0.139 Sum_probs=44.4 Q ss_pred CCCCCHHHHHHHHHHHHCC-CCEEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 5899868999999999869-982999888677-788999855687752011100011101210356777 Q gi|254780594|r 6 HYPLSSSSRFIRLILSEYE-FTTDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 6 ~~~~sp~~~kvr~~L~e~g-i~y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ..|.|.||.++-=+|...| ++|..+.+-... -...+.+.+-.-++|-|-. +|+-+.+|..|.+-.+ T Consensus 26 ~~P~CGFS~~~vqiL~~~g~v~~~~vnVL~d~eiR~~lk~~s~WPT~PQLyi-~GEfvGG~DIv~Em~q 93 (105) T COG0278 26 EFPQCGFSAQAVQILSACGVVDFAYVDVLQDPEIRQGLKEYSNWPTFPQLYV-NGEFVGGCDIVREMYQ 93 (105) T ss_pred CCCCCCCCHHHHHHHHHCCCCCEEEEEECCCHHHHHCCHHHCCCCCCCEEEE-CCEEECCHHHHHHHHH T ss_conf 9987786399999999749863367642169989852464459998641458-7787640799999997 No 142 >pfam05768 DUF836 Glutaredoxin-like domain (DUF836). These proteins are related to the pfam00462 family. Probab=72.02 E-value=5.2 Score=19.33 Aligned_cols=52 Identities=19% Similarity=0.147 Sum_probs=35.4 Q ss_pred CCCCCCCCCHHHHHHHHHHHHC----CCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCC Q ss_conf 5213589986899999999986----9982999888677788999855687752011100 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEY----EFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEH 57 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~----gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~d 57 (230) .|||+-+.|+-|..++-+|... ++.++.+.+. .++++.+. -.-+||||..++ T Consensus 2 l~Ly~r~gChLCe~a~~~L~~~~~~~~~~~~~vdI~---~d~~L~~~-Y~~~IPVl~~~~ 57 (76) T pfam05768 2 LTLYGKPGCHLCEGAKEVLAELEAALGFDLERIDID---DDEELFAR-YGLEIPVLALVG 57 (76) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHCCCCEEEEEECC---CCHHHHHH-HCCCCCEEEECC T ss_conf 899918995738999999998753459559999888---99899998-488789899879 No 143 >PHA02125 thioredoxin-like protein Probab=64.80 E-value=3.4 Score=20.42 Aligned_cols=55 Identities=13% Similarity=0.211 Sum_probs=36.4 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHH Q ss_conf 952135899868999999999869982999888677788999855687752011100011 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQA 60 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~ 60 (230) |+-+|+...|+.|+-..=+|.+.+ |+.+.++.. ...++.+..--..+|+|+ ||+. T Consensus 1 vi~~F~A~WCgpCk~l~P~l~~ie--~~~~~VD~D-~~~~la~k~~Ir~IPT~i--nG~e 55 (75) T PHA02125 1 MIYLFGAEWCANCKMVKPMLANVE--YTYVDVDTD-EGVELTAKHHIRSLPTLV--NTST 55 (75) T ss_pred CEEEEECCCCCCHHHCCCCCHHCC--CEEEEECCC-CCHHHHHHCCCCCCCEEE--CCEE T ss_conf 989984787753211366721116--357983455-377689874961598178--9989 No 144 >cd02973 TRX_GRX_like Thioredoxin (TRX)-Glutaredoxin (GRX)-like family; composed of archaeal and bacterial proteins that show similarity to both TRX and GRX, including the C-terminal TRX-fold subdomain of Pyrococcus furiosus protein disulfide oxidoreductase (PfPDO). All members contain a redox-active CXXC motif and may function as PDOs. The archaeal proteins Mj0307 and Mt807 show structures more similar to GRX, but activities more similar to TRX. Some members of the family are similar to PfPDO in that they contain a second CXXC motif located in a second TRX-fold subdomain at the N-terminus; the superimposable N- and C-terminal TRX subdomains form a compact structure. PfPDO is postulated to be the archaeal counterpart of bacterial DsbA and eukaryotic protein disulfide isomerase (PDI). The C-terminal CXXC motif of PfPDO is required for its oxidase, reductase and isomerase activities. Also included in the family is the C-terminal TRX-fold subdomain of the N-terminal domain (NTD) of bacteri Probab=64.32 E-value=7.3 Score=18.49 Aligned_cols=53 Identities=8% Similarity=0.053 Sum_probs=33.8 Q ss_pred CCCCCCCCCHHHHHHHHHHHHC-----CCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCC Q ss_conf 5213589986899999999986-----9982999888677788999855687752011100 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEY-----EFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEH 57 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~-----gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~d 57 (230) +..|..+.||+|-.+-.+++.. +|..+.++... .+++.+.---..||.++.+| T Consensus 3 i~vfvS~sCp~CP~~v~~~~~~a~~n~~I~~~~id~~~---~~~~~~~y~V~~VP~ivin~ 60 (67) T cd02973 3 IEVFVSPTCPYCPDAVQAANRIAALNPNISAEMIDAAE---FPDLADEYGVMSVPAIVING 60 (67) T ss_pred EEEEECCCCCCCHHHHHHHHHHHHHCCCEEEEEEECCC---CHHHHHHCCCCCCCEEEECC T ss_conf 99997699999789999999999768980899999954---98799976987689899999 No 145 >TIGR02183 GRXA Glutaredoxin, GrxA family; InterPro: IPR011902 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry includes the Escherichia coli glyutaredoxin GrxA which appears to have primary responsibility for the reduction of ribonucleotide reductase .; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity, 0045454 cell redox homeostasis. Probab=62.26 E-value=9.9 Score=17.74 Aligned_cols=73 Identities=12% Similarity=0.230 Sum_probs=47.3 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCC-----CCEEEEECCCCCCC-HHHHHH--CCCCCCCHHHHCCHHHHCCCCHHHHHHHH Q ss_conf 52135899868999999999869-----98299988867778-899985--56877520111000111012103567775 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYE-----FTTDMIEEFPWGKR-REFLEL--NPSGTLPVYIDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~g-----i~y~~~~~~~~~~~-~~~~~l--nP~g~vP~L~~~dg~~l~eS~aI~~yL~~ 73 (230) .+.|+-|.||||-|+.-+.+-+. -.|..+.+...+-+ .++.+. -|--+||=+..|+ +.+.+.+.--+|.-+ T Consensus 2 v~IfGr~gC~YCVRA~~LaEkl~~~~~DF~fry~Di~AEGI~K~DL~~~VGKpVeTVPQIF~D~-~~vGGCTdF~~~~k~ 80 (86) T TIGR02183 2 VVIFGRPGCPYCVRAKQLAEKLAEELADFEFRYIDIHAEGISKADLEKTVGKPVETVPQIFVDE-KHVGGCTDFEQLVKE 80 (86) T ss_pred EEEECCCCCCHHHHHHHHHHHHHHHCCCCCCEEEEEEECCCCHHHHCCCCCCCCCCCCCEEEEC-CCCCCCCCHHHHHHH T ss_conf 5773279984023368889998864035562058887458871221033788415667415407-776885217889887 Q ss_pred HC Q ss_conf 28 Q gi|254780594|r 74 TY 75 (230) Q Consensus 74 ~~ 75 (230) ++ T Consensus 81 ~~ 82 (86) T TIGR02183 81 KF 82 (86) T ss_pred HH T ss_conf 40 No 146 >TIGR00365 TIGR00365 glutaredoxin homolog; InterPro: IPR004480 Glutaredoxins , , , also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system . Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond . It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This family groups a number of hypothetical proteins from different organisms which are related to glutaredoxin proteins.. Probab=60.26 E-value=8.2 Score=18.19 Aligned_cols=66 Identities=20% Similarity=0.127 Sum_probs=46.5 Q ss_pred CCCCCHHHHHHHHHHHHCCCC---EEEEECCCCC-CCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 589986899999999986998---2999888677-788999855687752011100011101210356777 Q gi|254780594|r 6 HYPLSSSSRFIRLILSEYEFT---TDMIEEFPWG-KRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 6 ~~~~sp~~~kvr~~L~e~gi~---y~~~~~~~~~-~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ..|-|.||+++-=+|.+.||+ |..+++-... -+.++.+-+=.=++|=|-. ||+-+.++-.|.+-.. T Consensus 23 ~~P~CGFS~~~v~~L~~~~~~PakF~~~DiL~d~~iR~~lk~ysnWPT~PQLyV-nGEf~GG~Dii~e~~~ 92 (103) T TIGR00365 23 KLPKCGFSRQAVGILRECGIRPAKFAYVDILQDPEIRQGLKEYSNWPTFPQLYV-NGEFVGGCDIIIEMYQ 92 (103) T ss_pred CCCCCCCHHHHHHHHHHCCCCCCCCCCCCCCCCCHHHHHCHHCCCCCCCCCEEE-CCEEECHHHHHHHHHH T ss_conf 988998817899999854898211320002478133220100035799983566-5535561589999976 No 147 >cd03031 GRX_GRX_like Glutaredoxin (GRX) family, GRX-like domain containing protein subfamily; composed of uncharacterized eukaryotic proteins containing a GRX-like domain having only one conserved cysteine, aligning to the C-terminal cysteine of the CXXC motif of GRXs. This subfamily is predominantly composed of plant proteins. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins via a redox active CXXC motif using a similar dithiol mechanism employed by TRXs. GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. Proteins containing only the C-terminal cysteine are generally redox inactive. Probab=59.52 E-value=10 Score=17.63 Aligned_cols=66 Identities=21% Similarity=0.160 Sum_probs=42.9 Q ss_pred CCCCCC------CCHHHHHHHHHHHHCCCCEEEEECCCCC-CCHHHHHHCC----CCCCCHHHHCCHHHHCCCCHHHH Q ss_conf 213589------9868999999999869982999888677-7889998556----87752011100011101210356 Q gi|254780594|r 3 ILYHYP------LSSSSRFIRLILSEYEFTTDMIEEFPWG-KRREFLELNP----SGTLPVYIDEHMQALCGVFVISE 69 (230) Q Consensus 3 ~LY~~~------~sp~~~kvr~~L~e~gi~y~~~~~~~~~-~~~~~~~lnP----~g~vP~L~~~dg~~l~eS~aI~~ 69 (230) +||... +=--|.+||.+|.-+++.|+-+++.... ..+|+.++-. ..++|-+.. +|..|.+...+.+ T Consensus 3 VlYtTSlrgvR~TfedC~~vr~iL~~~~V~~~ERDVsm~~~~r~EL~~~lg~~~~~~tvPqvFi-~g~~vGG~dev~~ 79 (147) T cd03031 3 VLYTTSLRGVRKTFEDCNNVRAILESFRVKFDERDVSMDSGFREELRELLGAELKAVSLPRVFV-DGRYLGGAEEVLR 79 (147) T ss_pred EEEECCCCHHHHHHHHHHHHHHHHHHCCCEEEEEECCCCHHHHHHHHHHHCCCCCCCCCCEEEE-CCEEECCHHHHHH T ss_conf 9998887202678999999999999789079998345898799999999714578999996988-9999878999999 No 148 >KOG0911 consensus Probab=54.89 E-value=15 Score=16.65 Aligned_cols=14 Identities=14% Similarity=0.496 Sum_probs=4.4 Q ss_pred HHHHHHHHHCCHHH Q ss_conf 89999999709988 Q gi|254780594|r 195 VKEWYQRMKSRPSF 208 (230) Q Consensus 195 l~~w~~r~~~rps~ 208 (230) |+.=.+...+.|.| T Consensus 182 lRqglK~fSdWPTf 195 (227) T KOG0911 182 LRQGLKEFSDWPTF 195 (227) T ss_pred HHHHHHHHCCCCCC T ss_conf 99876652379986 No 149 >pfam09635 MetRS-N MetRS-N binding domain. The MetRS-N domain binds an Arc1-P domain in a tetrameric complex resembling a classical GST homo-dimer. Domain-swapping between symmetrically related MetRS-N and Arc1p-N domains generates a 2:2 tetramer held together by van der Waals forces. This domain is necessary for formation of the aminoacyl-tRNA synthetase complex necessary for tRNA nuclear export and shuttling as part of the translational apparatus. The domain is associated with pfam09334. Probab=51.37 E-value=6.7 Score=18.71 Aligned_cols=32 Identities=19% Similarity=0.208 Sum_probs=23.3 Q ss_pred CCCCCCCHHHH-CCHHHHCCCCHHHHHHHHHCC Q ss_conf 56877520111-000111012103567775288 Q gi|254780594|r 45 NPSGTLPVYID-EHMQALCGVFVISEYLDETYG 76 (230) Q Consensus 45 nP~g~vP~L~~-~dg~~l~eS~aI~~yL~~~~~ 76 (230) |+.-.-|.|++ ..|.-+.|++||++|+.+-|- T Consensus 31 ned~s~~~L~~~ksgF~L~e~NAIlrYl~~DF~ 63 (122) T pfam09635 31 NEDESNPQLKDTKSGFRLFDANAILRYVLDDFE 63 (122) T ss_pred CCCCCCCEEEECCCCCEEECCCHHHHHHHHHCC T ss_conf 776676545436987177363199999987546 No 150 >TIGR01246 dapE_proteo succinyl-diaminopimelate desuccinylase; InterPro: IPR005941 The lysine/diaminopimelic acid branch of the aspartate pathway produces the essential amino acid lysine via the intermediate meso-diaminopimelic acid (meso-DAP), which is also a vital cell wall component in Gram-negative bacteria . The production of dihydropicolinate from aspartate-semialdehyde controls flux into the lysine/diaminopimelic acid pathway. Three variants of this pathway exist, differing in how tetrahydropicolinate (formed by reduction of dihydropicolinate) is metabolised to meso-DAP. One variant, the most commonly found one in archaea and bacteria, uses primarily succinyl intermediates, while a second variant, found only in Bacillus, utilises primarily acetyl intermediates. In the third variant, found in some Gram-positive bacteria, a dehydrogenase converts tetrahydropicolinate directly to meso-DAP. In all variants meso-DAP is subsequently converted to lysine by a decarboxylase, or, in Gram-negative bacteria, assimilated into the cell wall. Evidence exists that a fourth, currently unknown, variant of this pathway may function in plants . Succinyl-diaminopimelate desuccinylase (3.5.1.18 from EC) hydrolyses N-succinyl-L,L-diaminopimelic acid which is required for the bacterial synthesis of lysine and meso-diaminopimelic acid. This group of bacterial sequences belong to the MEROPS peptidase family M20 (clan MH), subfamily M20A (non-peptidase homologs). ; GO: 0009014 succinyl-diaminopimelate desuccinylase activity, 0009089 lysine biosynthetic process via diaminopimelate. Probab=50.63 E-value=10 Score=17.71 Aligned_cols=36 Identities=25% Similarity=0.415 Sum_probs=29.6 Q ss_pred HHHHHHCCC--C-CCCHHHHC--CHHHHCCCCHHHHHHHHH Q ss_conf 899985568--7-75201110--001110121035677752 Q gi|254780594|r 39 REFLELNPS--G-TLPVYIDE--HMQALCGVFVISEYLDET 74 (230) Q Consensus 39 ~~~~~lnP~--g-~vP~L~~~--dg~~l~eS~aI~~yL~~~ 74 (230) +.|++.||. | ++=.|++. +|+.+.++..|+|||-+| T Consensus 115 erFv~~~pdhkGa~islLiTSDEEG~A~dGT~~vve~L~~r 155 (383) T TIGR01246 115 ERFVKKNPDHKGASISLLITSDEEGEAIDGTKKVVETLMAR 155 (383) T ss_pred HHHHHHCCCCCCCHHHHHHHHHCCCCCHHHHHHHHHHHHHH T ss_conf 99998477778405344333000133113688999999972 No 151 >cd03020 DsbA_DsbC_DsbG DsbA family, DsbC and DsbG subfamily; V-shaped homodimeric proteins containing a redox active CXXC motif imbedded in a TRX fold. They function as protein disulfide isomerases and chaperones in the bacterial periplasm to correct non-native disulfide bonds formed by DsbA and prevent aggregation of incorrectly folded proteins. DsbC and DsbG are kept in their reduced state by the cytoplasmic membrane protein DsbD, which utilizes the TRX/TRX reductase system in the cytosol as a source of reducing equivalents. DsbG differ from DsbC in that it has a more limited substrate specificity, and it may preferentially act later in the folding process to catalyze disulfide rearrangements in folded or partially folded proteins. Also included in the alignment is the predicted protein TrbB, whose gene was sequenced from the enterohemorrhagic E. coli type IV pilus gene cluster, which is required for efficient plasmid transfer. Probab=50.18 E-value=18 Score=16.21 Aligned_cols=19 Identities=16% Similarity=0.168 Sum_probs=10.9 Q ss_pred CCCCCCCHHHHHHHHHHHH Q ss_conf 1358998689999999998 Q gi|254780594|r 4 LYHYPLSSSSRFIRLILSE 22 (230) Q Consensus 4 LY~~~~sp~~~kvr~~L~e 22 (230) .|..|.||||+|..=-+.. T Consensus 83 vFtDp~CpyC~kl~~~l~~ 101 (197) T cd03020 83 VFTDPDCPYCRKLEKELKP 101 (197) T ss_pred EEECCCCHHHHHHHHHHHH T ss_conf 9979898789999999885 No 152 >cd03030 GRX_SH3BGR Glutaredoxin (GRX) family, SH3BGR (SH3 domain binding glutamic acid-rich protein) subfamily; a recently-identified subfamily composed of SH3BGR and similar proteins possessing significant sequence similarity to GRX, but without a redox active CXXC motif. The SH3BGR gene was cloned in an effort to identify genes mapping to chromosome 21, which could be involved in the pathogenesis of congenital heart disease affecting Down syndrome newborns. Several human SH3BGR-like (SH3BGRL) genes have been identified since, mapping to different locations in the chromosome. Of these, SH3BGRL3 was identified as a tumor necrosis factor (TNF) alpha inhibitory protein and was also named TIP-B1. Upregulation of expression of SH3BGRL3 is associated with differentiation. It has been suggested that it functions as a regulator of differentiation-related signal transduction pathways. Probab=49.21 E-value=19 Score=16.12 Aligned_cols=60 Identities=10% Similarity=-0.023 Sum_probs=34.4 Q ss_pred CCCCCCCCH------HHHHHHHHHHHCCCCEEEEECCCCCCC-HHHHHHCCC----CCCCHHHHCCHHHHCC Q ss_conf 213589986------899999999986998299988867778-899985568----7752011100011101 Q gi|254780594|r 3 ILYHYPLSS------SSRFIRLILSEYEFTTDMIEEFPWGKR-REFLELNPS----GTLPVYIDEHMQALCG 63 (230) Q Consensus 3 ~LY~~~~sp------~~~kvr~~L~e~gi~y~~~~~~~~~~~-~~~~~lnP~----g~vP~L~~~dg~~l~e 63 (230) ++|....|. -.++|..+|.-++|+|+.+++...+.. .+..+..|. ...|-+.- ++..+.+ T Consensus 3 kVy~ss~sg~~~iKk~Qq~v~~iL~~~kI~yeeiDIs~~ee~r~~Mr~~~~~~~~~~lpPQiF~-~~~y~Gd 73 (92) T cd03030 3 KVYIASSSGSTEIKKRQQEVLGFLEAKKIEFEEVDISMNEENRQWMRENVPNENGKPLPPQIFN-GDEYCGD 73 (92) T ss_pred EEEEECCCCCHHHHHHHHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHCCCCCCCCCCCEEEC-CCEECCC T ss_conf 9999226277899999999999999869983898466897899999997656689989986724-9853167 No 153 >pfam04908 SH3BGR SH3-binding, glutamic acid-rich protein. Probab=45.57 E-value=22 Score=15.79 Aligned_cols=47 Identities=19% Similarity=0.229 Sum_probs=28.2 Q ss_pred HHHHHHHHHHCCCCEEEEECCCCCCCHHH-HHHCCCC--CCCHHHHCCHHH Q ss_conf 99999999986998299988867778899-9855687--752011100011 Q gi|254780594|r 13 SRFIRLILSEYEFTTDMIEEFPWGKRREF-LELNPSG--TLPVYIDEHMQA 60 (230) Q Consensus 13 ~~kvr~~L~e~gi~y~~~~~~~~~~~~~~-~~lnP~g--~vP~L~~~dg~~ 60 (230) -++|..+|.-++|+|+.+++...+...++ .+..+.| ..|-+. .|+.. T Consensus 20 Qq~v~~iL~s~kI~ye~vDIt~~ee~r~~Mr~~~~~~~~lpPQIF-n~d~Y 69 (92) T pfam04908 20 QQRVLMILDANKIPFDEVDITKDEEQRRWMRENPPNGAPLPPQIF-NEDQY 69 (92) T ss_pred HHHHHHHHHHCCCCCEEEECCCCHHHHHHHHHHCCCCCCCCCEEE-CCCEE T ss_conf 999999999859973898555898999999984699998899072-19863 No 154 >TIGR01162 purE phosphoribosylaminoimidazole carboxylase, catalytic subunit; InterPro: IPR000031 Phosphoribosylaminoimidazole carboxylase is a fusion protein in plants and fungi, but consists of two non-interacting proteins in bacteria, PurK and PurE. PurK, N5-carboxyaminoimidazole ribonucleotide (N5_CAIR) synthetase, catalyzes the conversion of 5-aminoimidazole ribonucleotide (AIR), ATP, and bicarbonate to N5-CAIR, ADP, and Pi. PurE converts N5-CAIR to CAIR, the sixth step of de novo purine biosynthesis. In the presence of high concentrations of bicarbonate, PurE is reported able to convert AIR to CAIR directly and without ATP. Some members of this family contain two copies of this domain . The crystal structure of PurE indicates a unique quaternary structure that confirms the octameric nature of the enzyme .; GO: 0004638 phosphoribosylaminoimidazole carboxylase activity, 0006189 'de novo' IMP biosynthetic process, 0009320 phosphoribosylaminoimidazole carboxylase complex. Probab=45.45 E-value=22 Score=15.78 Aligned_cols=32 Identities=13% Similarity=0.149 Sum_probs=25.5 Q ss_pred CHHHHHHHHHHHHCCCCEEEEECCCCCCCHHHH Q ss_conf 868999999999869982999888677788999 Q gi|254780594|r 10 SSSSRFIRLILSEYEFTTDMIEEFPWGKRREFL 42 (230) Q Consensus 10 sp~~~kvr~~L~e~gi~y~~~~~~~~~~~~~~~ 42 (230) -+...++--.|++.||+||..-+.. +|+|+.. T Consensus 11 ~~~m~~a~~~L~~fgi~~e~~V~SA-HRTP~~~ 42 (159) T TIGR01162 11 LETMKKAAEILEEFGIPYELRVVSA-HRTPELM 42 (159) T ss_pred HHHHHHHHHHHHHCCCCEEEEEECC-CCCHHHH T ss_conf 7899999999985599667898606-7780889 No 155 >TIGR00679 hpr-ser HPr(Ser) kinase/phosphatase; InterPro: IPR003755 HPr(Ser) kinase is the sensor in a multicomponent phosphorelay system that controls catabolite repression, sugar transport and carbon metabolism in Gram-positive bacteria. These are all phosphorylation-dependent carbon control mechanisms. This kinase in unusual in that it recognises the tertiary structure of its target protein, HPr, a phosphocarrier protein of the bacterial phosphotransferase system and a transcriptional cofactor controlling the phenomenon of catabolite repression .; GO: 0000155 two-component sensor activity, 0004672 protein kinase activity, 0005524 ATP binding, 0000160 two-component signal transduction system (phosphorelay), 0006109 regulation of carbohydrate metabolic process. Probab=31.15 E-value=37 Score=14.44 Aligned_cols=47 Identities=15% Similarity=0.340 Sum_probs=35.6 Q ss_pred ECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCC-CCHHHHHHHHHCCCCC Q ss_conf 888677788999855687752011100011101-2103567775288723 Q gi|254780594|r 31 EEFPWGKRREFLELNPSGTLPVYIDEHMQALCG-VFVISEYLDETYGAIA 79 (230) Q Consensus 31 ~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~e-S~aI~~yL~~~~~~~~ 79 (230) ...-...+..++++|-...||+|+. +-.. +| |..+..||+++|.+.. T Consensus 90 ls~~~~~P~~ll~~~~~~~vPiL~t-~L~s-teLS~~l~~~l~~~fAp~~ 137 (309) T TIGR00679 90 LSKSFTDPEVLLDVNETYQVPILKT-RLFS-TELSFRLETYLNEQFAPTT 137 (309) T ss_pred EECCCCCHHHHHHHHCCCCCCEEEC-CCCC-HHHHHHHHHHHHHHHHHHH T ss_conf 8458897679998422378876624-6641-3677899999863111124 No 156 >cd04911 ACT_AKiii-YclM-BS_1 ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII. This CD includes the first of two ACT domains located C-terminal to the catalytic domain of the lysine plus threonine-sensitive aspartokinase isoenzyme AKIII, a monofunctional class enzyme found in Bacilli (Bacillus subtilis (BS) YclM) and Clostridia species. Aspartokinase is the first enzyme in the aspartate metabolic pathway and catalyzes the conversion of aspartate and ATP to aspartylphosphate and ADP. Bacillus subtilis YclM is reported to be a single polypeptide of 50 kD. AKIII from Bacillus subtilis strain 168 is induced by lysine and repressed by threonine and it is synergistically inhibited by lysine and threonine. Members of this CD belong to the superfamily of ACT regulatory domains. Probab=30.96 E-value=37 Score=14.42 Aligned_cols=23 Identities=13% Similarity=0.083 Sum_probs=19.9 Q ss_pred HHHHHHHHHHHHCCCCEEEEECC Q ss_conf 68999999999869982999888 Q gi|254780594|r 11 SSSRFIRLILSEYEFTTDMIEEF 33 (230) Q Consensus 11 p~~~kvr~~L~e~gi~y~~~~~~ 33 (230) .|+||+.-+|+..||+||..+.. T Consensus 16 GF~rKvL~IlE~~~IS~EHmPSG 38 (76) T cd04911 16 GFGRKLLSILEDNGISYEHMPSG 38 (76) T ss_pred HHHHHHHHHHHHCCCCEEECCCC T ss_conf 48999999999839986455888 No 157 >TIGR02661 MauD methylamine dehydrogenase accessory protein MauD; InterPro: IPR013478 MauD appears critical to proper formation of the small subunit of methylamine dehydrogenase, which has both an unusual tryptophan tryptophylquinone cofactor and multiple disulphide bonds. MauD shares sequence similarity, including a CPxC motif, with a number of thiol:disulphide interchange proteins. In MauD mutants, the small subunit apparently does not form properly and is rapidly degraded .; GO: 0030416 methylamine metabolic process. Probab=30.82 E-value=38 Score=14.41 Aligned_cols=12 Identities=8% Similarity=0.193 Sum_probs=6.1 Q ss_pred CCCCCCHHHHHH Q ss_conf 358998689999 Q gi|254780594|r 5 YHYPLSSSSRFI 16 (230) Q Consensus 5 Y~~~~sp~~~kv 16 (230) |..|.||-|.|. T Consensus 81 FTaPsCPvC~KL 92 (189) T TIGR02661 81 FTAPSCPVCDKL 92 (189) T ss_pred ECCCCCCHHHHH T ss_conf 438888532243 No 158 >cd03026 AhpF_NTD_C TRX-GRX-like family, Alkyl hydroperoxide reductase F subunit (AhpF) N-terminal domain (NTD) subfamily, C-terminal TRX-fold subdomain; AhpF is a homodimeric flavoenzyme which catalyzes the NADH-dependent reduction of the peroxiredoxin AhpC, which then reduces hydrogen peroxide and organic hydroperoxides. AhpF contains an NTD containing two contiguous TRX-fold subdomains similar to Pyrococcus furiosus protein disulfide oxidoreductase (PfPDO). It also contains a catalytic core similar to TRX reductase containing FAD and NADH binding domains with an active site disulfide. The proposed mechanism of action of AhpF is similar to a TRX/TRX reductase system. The flow of reducing equivalents goes from NADH - catalytic core of AhpF - NTD of AhpF - AhpC - peroxide substrates. The catalytic CXXC motif of the NTD of AhpF is contained in its C-terminal TRX subdomain. Probab=29.60 E-value=28 Score=15.10 Aligned_cols=67 Identities=13% Similarity=0.007 Sum_probs=36.7 Q ss_pred CCCCCCCCHHHHHHHHHHHH-----CCCCEEEEECCCCCCCHHHHHHCCCCCCCHHHHCCHHHHCCCCHHHHHHH Q ss_conf 21358998689999999998-----69982999888677788999855687752011100011101210356777 Q gi|254780594|r 3 ILYHYPLSSSSRFIRLILSE-----YEFTTDMIEEFPWGKRREFLELNPSGTLPVYIDEHMQALCGVFVISEYLD 72 (230) Q Consensus 3 ~LY~~~~sp~~~kvr~~L~e-----~gi~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~~dg~~l~eS~aI~~yL~ 72 (230) ..|-.+.||+|-.|-.+++- -+|..+.++.. .- ++..+.----.||+++.+|...-.+...|.+.|+ T Consensus 17 ~vfVSlsC~~CP~vV~a~~~~A~~Np~I~~~~iD~~--~f-~d~~~~~~ImsVP~vviN~e~~~~G~~~~~ell~ 88 (89) T cd03026 17 ETYVSLSCHNCPDVVQALNLMAVLNPNIEHEMIDGA--LF-QDEVEERGIMSVPAIFLNGELFGFGRMTLEEILA 88 (89) T ss_pred EEEECCCCCCCHHHHHHHHHHHHHCCCCEEEEEEHH--HC-HHHHHHCCCCCCCEEEECCEEEEECCCCHHHHHC T ss_conf 999828999978999999999986998169999806--47-8899975964489799999677406515998942 No 159 >cd02999 PDI_a_ERp44_like PDIa family, endoplasmic reticulum protein 44 (ERp44)-like subfamily; composed of uncharacterized PDI-like eukaryotic proteins containing only one redox active TRX (a) domain with a CXXS motif, similar to ERp44. CXXS is still a redox active motif; however, the mixed disulfide formed with the substrate is more stable than those formed by CXXC motif proteins. PDI-related proteins are usually involved in the oxidative protein folding in the ER by acting as catalysts and folding assistants. ERp44 is involved in thiol-mediated retention in the ER. Probab=28.38 E-value=34 Score=14.65 Aligned_cols=57 Identities=9% Similarity=-0.030 Sum_probs=30.3 Q ss_pred CCCCCCCCCCHHHHHHHHHHHHCCCCEE---EEECCCCCCCHHHHHHCCCCCCCHHHHCC Q ss_conf 9521358998689999999998699829---99888677788999855687752011100 Q gi|254780594|r 1 MSILYHYPLSSSSRFIRLILSEYEFTTD---MIEEFPWGKRREFLELNPSGTLPVYIDEH 57 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr~~L~e~gi~y~---~~~~~~~~~~~~~~~lnP~g~vP~L~~~d 57 (230) +.++|+.|+|++|.+..=..++..-.+. ...++-....+++.+.---..+|+|..-+ T Consensus 21 vlV~FYApWC~hCk~l~P~~~~la~~~~~~~~~a~~~~~~~~~l~~~ygV~g~PTi~lf~ 80 (100) T cd02999 21 TAVLFYASWCPFSASFRPHFNALSSMFPQIRHLAIEESSIKPSLLSRYGVVGFPTILLFN 80 (100) T ss_pred EEEEEECCCCHHHHHHHHHHHHHHHHCCCCCEEEEEECCCCHHHHHHCCCCCCCEEEEEC T ss_conf 999999788888898718999999878998489999545786678746754365799967 No 160 >KOG3425 consensus Probab=28.23 E-value=42 Score=14.14 Aligned_cols=66 Identities=12% Similarity=0.036 Sum_probs=33.5 Q ss_pred CCCCHHHHHHH----HHHHHCCCCEEEEECCCCCCCHHHHHHCCCC-------CCCHHHHCC--HHHHCCCCHHHHHHH Q ss_conf 89986899999----9999869982999888677788999855687-------752011100--011101210356777 Q gi|254780594|r 7 YPLSSSSRFIR----LILSEYEFTTDMIEEFPWGKRREFLELNPSG-------TLPVYIDEH--MQALCGVFVISEYLD 72 (230) Q Consensus 7 ~~~sp~~~kvr----~~L~e~gi~y~~~~~~~~~~~~~~~~lnP~g-------~vP~L~~~d--g~~l~eS~aI~~yL~ 72 (230) -+.||.|.++. =+|.+..-+..++.++.+++.-+--.-||.. -||+|.-=+ +.-+.+...-...|. T Consensus 42 qSWCPdCV~AEPvi~~alk~ap~~~~~v~v~VG~rp~Wk~p~n~FR~d~~~lt~vPTLlrw~~~~~rL~~~q~~~~~Lv 120 (128) T KOG3425 42 QSWCPDCVAAEPVINEALKHAPEDVHFVHVYVGNRPYWKDPANPFRKDPGILTAVPTLLRWKRQPQRLDGLQCLNDHLV 120 (128) T ss_pred CCCCCHHHHHHHHHHHHHHHCCCCEEEEEEEECCCCCCCCCCCCCCCCCCCEEECCEEEEECCCCCCCHHHHHHHHHHH T ss_conf 8378218776389999997387761799997068774148998543498741403343687375433137676278888 No 161 >COG2900 SlyX Uncharacterized protein conserved in bacteria [Function unknown] Probab=27.68 E-value=41 Score=14.17 Aligned_cols=30 Identities=20% Similarity=0.255 Sum_probs=16.9 Q ss_pred CHHHHHHHHHHHCC-HHHHHHHHHHHCCCCCCCCC Q ss_conf 76889999999709-98898413332189984324 Q gi|254780594|r 192 APIVKEWYQRMKSR-PSFRPLLSERIRGLLPVSHY 225 (230) Q Consensus 192 ~p~l~~w~~r~~~r-ps~~~~l~e~~~~~~p~~~~ 225 (230) .-.+..-.+|+.++ |+.. +....-+||||| T Consensus 42 q~qlr~L~~kl~~~~~~~~----~~~~~etpPPHY 72 (72) T COG2900 42 QAQLRLLTEKLKDLQPSAI----ASPAEETPPPHY 72 (72) T ss_pred HHHHHHHHHHHHHHCCCCC----CCCCCCCCCCCC T ss_conf 9999999999985132015----787666898888 No 162 >TIGR01967 DEAH_box_HrpA ATP-dependent helicase HrpA; InterPro: IPR010222 This entry represents HrpA, one of two related but uncharacterised DEAH-box ATP-dependent helicases in many Proteobacteria and a few high-GC Gram-positive bacteria. HrpA is about 1300 amino acids long, while its paralog HrpB, also uncharacterised, is about 800 amino acids long. Related characterised eukaryotic proteins are RNA helicases associated with pre-mRNA processing.; GO: 0005524 ATP binding, 0008026 ATP-dependent helicase activity. Probab=24.02 E-value=50 Score=13.69 Aligned_cols=56 Identities=9% Similarity=0.105 Sum_probs=20.2 Q ss_pred HHHHHHHHHCCCCCCCCCCCCHHHHHH-HHHHHHHHHCCCCCHHHCHHHHHHHHHHHC Q ss_conf 985565453479842667502899999-999999996189994567688999999970 Q gi|254780594|r 148 HIKYITWLIKSRSWIAGHHMSYADFAA-SATISILDYLGEIDWDTAPIVKEWYQRMKS 204 (230) Q Consensus 148 ~l~~le~~l~~~~fl~Gd~~s~ADi~~-~~~l~~~~~~~~~~~~~~p~l~~w~~r~~~ 204 (230) .+.-|.+.|+++-.+.- .+.+-||.. ...|.+-.+...-.+....+|-+|.+.|.- T Consensus 1185 ~~~~i~k~lKG~~~~~~-a~a~~DI~aQL~~Lv~~GFv~~~g~~rL~~lpRYLkA~~~ 1241 (1320) T TIGR01967 1185 LFANIKKRLKGKIDLTQ-ALALSDIKAQLDKLVYKGFVRETGAKRLKDLPRYLKAIEL 1241 (1320) T ss_pred HHHHHHHHHCCCCCHHH-HHHHHHHHHHHHHHHHCCHHHCCCCHHHHHHHHHHHHHHH T ss_conf 98756876069857779-9999999998500001001104753134655669999999 No 163 >cd02951 SoxW SoxW family; SoxW is a bacterial periplasmic TRX, containing a redox active CXXC motif, encoded by a genetic locus (sox operon) involved in thiosulfate oxidation. Sulfur bacteria oxidize sulfur compounds to provide reducing equivalents for carbon dioxide fixation during autotrophic growth and the respiratory electron transport chain. It is unclear what the role of SoxW is, since it has been found to be dispensable in the oxidation of thiosulfate to sulfate. SoxW is specifically kept in the reduced state by SoxV, which is essential in thiosulfate oxidation. Probab=24.00 E-value=35 Score=14.62 Aligned_cols=17 Identities=12% Similarity=0.286 Sum_probs=13.8 Q ss_pred CCCCCCCCCCHHHHHHH Q ss_conf 95213589986899999 Q gi|254780594|r 1 MSILYHYPLSSSSRFIR 17 (230) Q Consensus 1 M~~LY~~~~sp~~~kvr 17 (230) ++++++.+.||||.+.+ T Consensus 17 vlv~f~~~~C~~C~~m~ 33 (125) T cd02951 17 LLLLFSQPGCPYCDKLK 33 (125) T ss_pred EEEEEECCCCHHHHHHH T ss_conf 99999789897089999 No 164 >cd02960 AGR Anterior Gradient (AGR) family; members of this family are similar to secreted proteins encoded by the cement gland-specific genes XAG-1 and XAG-2, expressed in the anterior region of dorsal ectoderm of Xenopus. They are implicated in the formation of the cement gland and the induction of forebrain fate. The human homologs, hAG-2 and hAG-3, are secreted proteins associated with estrogen-positive breast tumors. Yeast two-hybrid studies identified the metastasis-associated C4.4a protein and dystroglycan as binding partners, indicating possible roles in the development and progression of breast cancer. hAG-2 has also been implicated in prostate cancer. Its gene was cloned as an androgen-inducible gene and it was shown to be overexpressed in prostate cancer cells at the mRNA and protein levels. AGR proteins contain one conserved cysteine corresponding to the first cysteine in the CXXC motif of TRX. They show high sequence similarity to ERp19. Probab=23.71 E-value=38 Score=14.37 Aligned_cols=80 Identities=16% Similarity=0.216 Sum_probs=42.1 Q ss_pred CCCCCCCCCHHHHHHHHHHHHC-CC---C-EEEEECCCCCCCHHHHHHCCCCC-CCHH--HHCCHHHHCCCCHHHHHHHH Q ss_conf 5213589986899999999986-99---8-29998886777889998556877-5201--11000111012103567775 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEY-EF---T-TDMIEEFPWGKRREFLELNPSGT-LPVY--IDEHMQALCGVFVISEYLDE 73 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~-gi---~-y~~~~~~~~~~~~~~~~lnP~g~-vP~L--~~~dg~~l~eS~aI~~yL~~ 73 (230) +++.|-..||.|+...=.+++. .| . .++|.++..+.+.+ .+.+|-|. ||-+ +|.+|.+..|- + . T Consensus 27 MvIiH~~~C~~c~aLKk~Fa~s~eI~~la~e~FVMvNl~~E~~d-~~~~pDG~YvPRIlFlDp~g~V~~di--~-----~ 98 (130) T cd02960 27 MVIHHLEDCPHSQALKKAFAEHKEIQKLAQEDFIMLNLVHETTD-KNLSPDGQYVPRIMFVDPSLTVRADI--T-----G 98 (130) T ss_pred EEEEECCCCHHHHHHHHHHHCCHHHHHHHHCCEEEEECCCCCCC-CCCCCCCCEECCEEEECCCCCCCHHC--C-----C T ss_conf 99986023867899999860579899873557689965657874-34499986364389989998581310--2-----6 Q ss_pred HCCCCCCCCCCCCCCHHH Q ss_conf 288723343356799999 Q gi|254780594|r 74 TYGAIAQKNRLLSTDPLQ 91 (230) Q Consensus 74 ~~~~~~~~~~L~p~~~~~ 91 (230) +|+... -...|.|... T Consensus 99 ~y~n~~--Y~Y~~~d~~~ 114 (130) T cd02960 99 RYSNRL--YTYEPADIPL 114 (130) T ss_pred CCCCCC--EEECCCCHHH T ss_conf 778854--1453676999 No 165 >PRK11657 dsbG disulfide isomerase/thiol-disulfide oxidase; Provisional Probab=21.91 E-value=55 Score=13.44 Aligned_cols=16 Identities=19% Similarity=0.046 Sum_probs=7.1 Q ss_pred HHHHCCCCEEEEECCC Q ss_conf 9998699829998886 Q gi|254780594|r 19 ILSEYEFTTDMIEEFP 34 (230) Q Consensus 19 ~L~e~gi~y~~~~~~~ 34 (230) +|+..|+.+.-....+ T Consensus 31 ~l~~~g~~v~~~f~aP 46 (253) T PRK11657 31 ALEKQGITIIKSFDAP 46 (253) T ss_pred HHHHCCCEEEEEECCC T ss_conf 9985796899874489 No 166 >TIGR02630 xylose_isom_A xylose isomerase; InterPro: IPR013452 Xylose isomerase (5.3.1.5 from EC) is an enzyme which interconverts D-xylose and D-xylulose.; GO: 0009045 xylose isomerase activity. Probab=21.50 E-value=49 Score=13.76 Aligned_cols=13 Identities=8% Similarity=-0.176 Sum_probs=6.2 Q ss_pred CHHHHCCCCHHHH Q ss_conf 0011101210356 Q gi|254780594|r 57 HMQALCGVFVISE 69 (230) Q Consensus 57 dg~~l~eS~aI~~ 69 (230) +|..|-||.+++. T Consensus 106 EG~~L~E~~~nLd 118 (437) T TIGR02630 106 EGASLKETNANLD 118 (437) T ss_pred CCCCHHHHHHHHH T ss_conf 8877899999999 No 167 >TIGR01811 sdhA_Bsu succinate dehydrogenase or fumarate reductase, flavoprotein subunit; InterPro: IPR011280 This entry represents the succinate dehydrogenase flavoprotein subunit as found in the low-GC Gram-positive bacteria and a few other lineages. This enzyme may act in a complete or partial TCA cycle, or act in the opposite direction as fumarate reductase. In some but not all species, succinate dehydrogenase and fumarate reductase may be encoded as separate isozymes.. Probab=20.90 E-value=58 Score=13.32 Aligned_cols=29 Identities=24% Similarity=0.288 Sum_probs=23.1 Q ss_pred HHHHHHCCCCCCCCCCCCCCHHHHHHHHHHHH Q ss_conf 67775288723343356799999999999999 Q gi|254780594|r 69 EYLDETYGAIAQKNRLLSTDPLQRAETRRMIE 100 (230) Q Consensus 69 ~yL~~~~~~~~~~~~L~p~~~~~ra~~~~~~~ 100 (230) -|||++||.- +-|.|.|.+.|+-...+.+ T Consensus 292 YfLEr~YP~f---GNLVPRDiAsR~i~~~c~~ 320 (620) T TIGR01811 292 YFLERRYPAF---GNLVPRDIASRAIKEVCDA 320 (620) T ss_pred CHHHCCCCCC---CCCCCHHHHHHHHHHHHHH T ss_conf 1022007777---7886224777999998864 No 168 >PRK02119 hypothetical protein; Provisional Probab=20.70 E-value=53 Score=13.54 Aligned_cols=28 Identities=21% Similarity=0.398 Sum_probs=12.9 Q ss_pred HHHHHHHHHHCC-HHHHHHHHHHHCCCCCCCCC Q ss_conf 889999999709-98898413332189984324 Q gi|254780594|r 194 IVKEWYQRMKSR-PSFRPLLSERIRGLLPVSHY 225 (230) Q Consensus 194 ~l~~w~~r~~~r-ps~~~~l~e~~~~~~p~~~~ 225 (230) .++.-.+|+++- ||- ++.. .+-+||||| T Consensus 45 ql~~L~~rlk~~~~s~---~a~~-~eE~PPPHY 73 (73) T PRK02119 45 QLRHMANKLKDFQPSN---IASQ-AEETPPPHY 73 (73) T ss_pred HHHHHHHHHHHCCCCC---CCCC-CCCCCCCCC T ss_conf 9999999998526667---8987-779884787 No 169 >cd02998 PDI_a_ERp38 PDIa family, endoplasmic reticulum protein 38 (ERp38) subfamily; composed of proteins similar to the P5-like protein first isolated from alfalfa, which contains two redox active TRX (a) domains at the N-terminus, like human P5, and a C-terminal domain with homology to the C-terminal domain of ERp29, unlike human P5. The cDNA clone of this protein (named G1) was isolated from an alfalfa cDNA library by screening with human protein disulfide isomerase (PDI) cDNA. The G1 protein is constitutively expressed in all major organs of the plant and its expression is induced by treatment with tunicamycin, indicating that it may be a glucose-regulated protein. The G1 homolog in the eukaryotic social amoeba Dictyostelium discoideum is also described as a P5-like protein, which is located in the endoplasmic reticulum (ER) despite the absence of an ER-retrieval signal. G1 homologs from Aspergillus niger and Neurospora crassa have also been characterized, and are named TIGA and ER Probab=20.54 E-value=39 Score=14.29 Aligned_cols=54 Identities=4% Similarity=-0.047 Sum_probs=28.5 Q ss_pred CCCCCCCCCHHHHHHHHHHHHCC-----C-CEEEEECCCCCCCHHHHHHCCCCCCCHHHH Q ss_conf 52135899868999999999869-----9-829998886777889998556877520111 Q gi|254780594|r 2 SILYHYPLSSSSRFIRLILSEYE-----F-TTDMIEEFPWGKRREFLELNPSGTLPVYID 55 (230) Q Consensus 2 ~~LY~~~~sp~~~kvr~~L~e~g-----i-~y~~~~~~~~~~~~~~~~lnP~g~vP~L~~ 55 (230) .++|+.|.|++|++..=.+.+.. - .+.+..++-....+++.+..--..+|+++. T Consensus 22 lV~Fya~wC~~Ck~~~p~~~~la~~~~~~~~v~i~~vd~d~~~~~l~~~~~I~~~Pti~~ 81 (105) T cd02998 22 LVEFYAPWCGHCKNLAPEYEKLAAVFANEDDVVIAKVDADEANKDLAKKYGVSGFPTLKF 81 (105) T ss_pred EEEEECCCCCCCCCCCCHHHHHHHHHHCCCCCEEEECCCCCCCHHHHHHCCCCCCCEEEE T ss_conf 999958988887554508999999985687613761366743334553157777888999 Done!