Query psy12933
Match_columns 65
No_of_seqs 159 out of 1015
Neff 6.2
Searched_HMMs 46136
Date Fri Aug 16 23:28:27 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12933.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12933hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF13621 Cupin_8: Cupin-like d 99.8 1.1E-20 2.4E-25 123.5 5.3 56 8-65 187-245 (251)
2 KOG2132|consensus 99.2 4.5E-13 9.7E-18 95.6 -5.2 54 6-61 302-355 (355)
3 PF08007 Cupin_4: Cupin superf 99.0 1.3E-09 2.8E-14 76.3 5.3 35 29-63 173-207 (319)
4 KOG2130|consensus 98.9 1.3E-09 2.8E-14 78.3 5.0 44 20-63 250-294 (407)
5 PF02373 JmjC: JmjC domain, hy 98.9 6.4E-09 1.4E-13 61.6 5.3 37 27-63 76-112 (114)
6 KOG2131|consensus 98.8 1.6E-09 3.4E-14 78.7 2.0 38 28-65 264-301 (427)
7 COG2850 Uncharacterized conser 98.3 1.4E-07 3E-12 68.4 1.1 36 28-63 175-212 (383)
8 KOG2508|consensus 97.5 6.3E-05 1.4E-09 55.1 2.0 47 19-65 256-307 (437)
9 COG2140 Thermophilic glucose-6 97.2 0.0015 3.3E-08 44.4 5.7 29 29-57 123-151 (209)
10 KOG3706|consensus 97.0 0.00031 6.6E-09 53.3 1.5 28 29-56 378-405 (629)
11 PF06560 GPI: Glucose-6-phosph 97.0 0.0017 3.8E-08 43.2 4.9 36 30-65 108-144 (182)
12 PF00190 Cupin_1: Cupin; Inte 96.9 0.0033 7.3E-08 39.1 5.3 28 35-62 87-116 (144)
13 PRK04190 glucose-6-phosphate i 96.9 0.0041 8.8E-08 41.5 5.9 34 31-64 119-154 (191)
14 COG0662 {ManC} Mannose-6-phosp 96.8 0.0019 4.1E-08 39.9 3.6 27 31-57 74-100 (127)
15 PF07883 Cupin_2: Cupin domain 96.8 0.0036 7.9E-08 33.7 4.1 26 32-57 37-62 (71)
16 KOG2508|consensus 96.4 0.0071 1.5E-07 44.6 5.0 61 5-65 333-401 (437)
17 COG4101 Predicted mannose-6-ph 95.9 0.011 2.3E-07 37.9 3.3 29 28-56 84-112 (142)
18 KOG1633|consensus 95.8 0.01 2.2E-07 46.8 3.5 31 29-59 193-223 (776)
19 smart00835 Cupin_1 Cupin. This 95.7 0.035 7.6E-07 34.5 4.9 26 31-56 74-99 (146)
20 TIGR03404 bicupin_oxalic bicup 95.5 0.032 7E-07 40.3 4.8 26 31-56 288-313 (367)
21 KOG2132|consensus 95.5 0.024 5.2E-07 41.2 4.0 45 12-56 231-277 (355)
22 PRK13290 ectC L-ectoine syntha 95.4 0.05 1.1E-06 33.9 5.0 27 31-57 74-100 (125)
23 PF09313 DUF1971: Domain of un 95.2 0.057 1.2E-06 31.7 4.4 29 29-57 47-75 (82)
24 COG1917 Uncharacterized conser 95.1 0.051 1.1E-06 33.1 4.3 28 31-58 81-108 (131)
25 PF05523 FdtA: WxcM-like, C-te 95.0 0.067 1.5E-06 33.4 4.6 26 32-57 76-102 (131)
26 TIGR01221 rmlC dTDP-4-dehydror 95.0 0.081 1.7E-06 34.9 5.2 44 15-59 82-125 (176)
27 TIGR03404 bicupin_oxalic bicup 94.6 0.11 2.3E-06 37.6 5.4 25 32-56 110-134 (367)
28 PF13759 2OG-FeII_Oxy_5: Putat 94.3 0.12 2.6E-06 30.3 4.3 36 28-63 62-100 (101)
29 PF05899 Cupin_3: Protein of u 94.2 0.03 6.5E-07 31.7 1.5 19 30-48 42-60 (74)
30 KOG2107|consensus 94.2 0.092 2E-06 35.0 4.0 30 27-56 111-140 (179)
31 PRK09943 DNA-binding transcrip 94.1 0.13 2.8E-06 33.2 4.5 27 31-57 145-171 (185)
32 PF03079 ARD: ARD/ARD' family; 93.9 0.13 2.7E-06 33.4 4.2 30 27-56 110-139 (157)
33 TIGR03037 anthran_nbaC 3-hydro 93.5 0.14 3E-06 33.6 3.8 30 32-61 70-99 (159)
34 COG3450 Predicted enzyme of th 93.5 0.084 1.8E-06 33.0 2.7 28 29-56 79-106 (116)
35 KOG1356|consensus 93.4 0.011 2.3E-07 47.1 -1.7 32 29-60 796-827 (889)
36 PF01050 MannoseP_isomer: Mann 93.4 0.15 3.2E-06 32.8 3.8 26 31-56 101-126 (151)
37 PF02311 AraC_binding: AraC-li 92.9 0.43 9.4E-06 27.5 5.0 27 31-57 40-66 (136)
38 COG3615 TehB Uncharacterized p 92.7 0.21 4.6E-06 30.5 3.6 33 31-64 61-95 (99)
39 COG1898 RfbC dTDP-4-dehydrorha 92.7 0.52 1.1E-05 31.2 5.6 47 15-64 82-130 (173)
40 PF00908 dTDP_sugar_isom: dTDP 92.5 0.44 9.5E-06 31.3 5.2 42 15-59 82-125 (176)
41 PRK13264 3-hydroxyanthranilate 92.2 0.27 5.8E-06 32.8 3.9 31 32-62 76-106 (177)
42 TIGR02466 conserved hypothetic 91.9 0.28 6E-06 32.9 3.7 32 33-64 163-199 (201)
43 PF13640 2OG-FeII_Oxy_3: 2OG-F 91.9 0.19 4.1E-06 28.8 2.6 30 36-65 66-99 (100)
44 TIGR03214 ura-cupin putative a 91.7 0.29 6.2E-06 33.7 3.7 27 31-57 217-243 (260)
45 PRK15457 ethanolamine utilizat 91.2 0.26 5.7E-06 34.1 3.1 24 31-54 192-215 (233)
46 TIGR03214 ura-cupin putative a 90.8 0.67 1.4E-05 31.9 4.8 24 33-56 99-122 (260)
47 PRK15131 mannose-6-phosphate i 90.8 0.25 5.4E-06 36.1 2.8 24 31-54 236-259 (389)
48 PF06249 EutQ: Ethanolamine ut 90.8 0.19 4.2E-06 32.7 2.0 18 31-48 112-129 (152)
49 TIGR02272 gentisate_1_2 gentis 90.7 0.45 9.7E-06 34.4 4.0 30 28-57 116-145 (335)
50 PF15138 Syncollin: Syncollin 90.5 0.35 7.5E-06 30.1 2.9 36 29-64 32-70 (112)
51 PLN00212 glutelin; Provisional 90.3 0.92 2E-05 34.4 5.5 33 29-61 390-422 (493)
52 PRK05467 Fe(II)-dependent oxyg 89.6 0.9 2E-05 31.1 4.6 34 31-64 140-175 (226)
53 TIGR01479 GMP_PMI mannose-1-ph 89.2 0.86 1.9E-05 33.7 4.6 27 31-57 414-440 (468)
54 TIGR00218 manA mannose-6-phosp 88.8 0.27 5.8E-06 34.3 1.6 22 31-52 150-171 (302)
55 PRK11171 hypothetical protein; 88.2 1.4 3E-05 30.3 4.9 26 31-56 100-125 (266)
56 PLN02288 mannose-6-phosphate i 87.9 0.46 1E-05 34.9 2.4 22 32-53 251-272 (394)
57 PRK13502 transcriptional activ 87.4 1.1 2.5E-05 29.9 3.9 26 31-56 55-80 (282)
58 TIGR03027 pepcterm_export puta 86.4 0.52 1.1E-05 30.1 1.8 18 32-49 148-165 (165)
59 PF12852 Cupin_6: Cupin 86.3 0.99 2.2E-05 28.7 3.1 25 32-56 55-79 (186)
60 COG4766 EutQ Ethanolamine util 85.9 0.66 1.4E-05 30.8 2.1 16 33-48 137-152 (176)
61 COG1482 ManA Phosphomannose is 85.8 0.77 1.7E-05 33.0 2.6 24 31-54 157-180 (312)
62 PRK11171 hypothetical protein; 84.8 1.5 3.3E-05 30.2 3.6 26 31-56 222-247 (266)
63 PRK13501 transcriptional activ 84.5 1.8 3.9E-05 29.2 3.8 27 30-56 54-80 (290)
64 PF07385 DUF1498: Protein of u 84.0 2.5 5.5E-05 29.2 4.4 28 30-57 152-179 (225)
65 PF05721 PhyH: Phytanoyl-CoA d 83.9 1.5 3.3E-05 27.0 3.1 26 29-54 177-202 (211)
66 PRK15460 cpsB mannose-1-phosph 83.8 2.6 5.6E-05 31.7 4.6 26 32-57 424-449 (478)
67 COG4297 Uncharacterized protei 82.8 1.8 4E-05 28.3 3.1 33 25-57 78-110 (163)
68 smart00702 P4Hc Prolyl 4-hydro 82.3 2.9 6.3E-05 26.3 3.9 43 23-65 130-177 (178)
69 PRK13500 transcriptional activ 81.4 3 6.5E-05 28.8 4.0 27 31-57 85-111 (312)
70 PRK10296 DNA-binding transcrip 79.7 2.5 5.4E-05 28.2 3.1 24 31-54 60-83 (278)
71 PF12973 Cupin_7: ChrR Cupin-l 79.6 2.8 6.1E-05 24.0 2.9 25 35-59 61-85 (91)
72 COG1791 Uncharacterized conser 79.6 3.6 7.8E-05 27.6 3.7 29 28-56 114-142 (181)
73 TIGR03028 EpsE polysaccharide 79.2 1.3 2.7E-05 29.9 1.5 15 35-49 225-239 (239)
74 PF13532 2OG-FeII_Oxy_2: 2OG-F 78.9 2.3 5E-05 26.9 2.6 29 28-57 144-176 (194)
75 COG5285 Protein involved in bi 77.5 3.9 8.5E-05 29.4 3.6 31 26-56 187-217 (299)
76 PF01238 PMI_typeI: Phosphoman 74.9 2.4 5.2E-05 30.6 2.1 23 32-54 250-272 (373)
77 PF06052 3-HAO: 3-hydroxyanthr 74.7 10 0.00022 24.8 4.7 33 29-61 72-104 (151)
78 PRK15401 alpha-ketoglutarate-d 73.2 8 0.00017 26.3 4.2 36 28-63 163-210 (213)
79 PRK13503 transcriptional activ 73.0 3.2 7E-05 27.5 2.2 25 32-56 53-77 (278)
80 TIGR02297 HpaA 4-hydroxyphenyl 73.0 6.7 0.00014 26.1 3.7 26 31-56 61-86 (287)
81 PLN00212 glutelin; Provisional 71.9 9.5 0.00021 29.0 4.7 32 26-57 142-173 (493)
82 PRK12335 tellurite resistance 68.1 17 0.00036 24.8 4.9 31 34-64 60-92 (287)
83 PF05995 CDO_I: Cysteine dioxy 67.3 27 0.00059 22.5 5.6 33 32-64 127-162 (175)
84 COG3822 ABC-type sugar transpo 64.2 15 0.00033 25.3 4.0 28 29-56 150-177 (225)
85 PRK10371 DNA-binding transcrip 62.8 7.1 0.00015 26.9 2.3 27 30-56 62-88 (302)
86 PF12851 Tet_JBP: Oxygenase do 62.2 22 0.00048 23.0 4.4 36 30-65 126-169 (171)
87 PF06251 Caps_synth_GfcC: Caps 62.2 6 0.00013 26.5 1.8 13 34-46 191-203 (229)
88 COG3435 Gentisate 1,2-dioxygen 61.4 5.1 0.00011 29.3 1.4 31 26-56 125-155 (351)
89 cd02786 MopB_CT_3 The MopB_CT_ 59.7 6.7 0.00014 22.9 1.5 12 40-51 73-84 (116)
90 PRK10572 DNA-binding transcrip 57.5 15 0.00032 24.6 3.1 25 31-55 66-90 (290)
91 TIGR02408 ectoine_ThpD ectoine 56.8 8.3 0.00018 26.4 1.8 25 31-55 210-234 (277)
92 TIGR01762 chlorin-enz chlorina 55.7 15 0.00032 25.6 2.9 27 29-55 206-232 (288)
93 COG3128 PiuC Uncharacterized i 54.9 25 0.00055 24.2 3.8 34 31-64 143-178 (229)
94 PF01476 LysM: LysM domain; I 53.5 6.3 0.00014 19.1 0.6 10 36-45 35-44 (44)
95 TIGR02272 gentisate_1_2 gentis 52.8 36 0.00078 24.7 4.6 27 32-58 288-314 (335)
96 PRK09685 DNA-binding transcrip 52.7 33 0.00071 23.0 4.2 26 31-56 88-113 (302)
97 cd02782 MopB_CT_1 The MopB_CT_ 52.5 8.5 0.00019 23.0 1.2 13 40-52 75-87 (129)
98 KOG0162|consensus 52.4 11 0.00023 30.9 1.9 23 29-51 1063-1089(1106)
99 cd02794 MopB_CT_DmsA-EC The Mo 51.8 9.1 0.0002 22.7 1.2 13 40-52 72-84 (121)
100 PF14604 SH3_9: Variant SH3 do 51.6 6.6 0.00014 20.2 0.5 19 33-51 12-34 (49)
101 PRK15078 polysaccharide export 51.5 9.9 0.00022 27.6 1.6 26 31-56 236-261 (379)
102 PF04970 LRAT: Lecithin retino 50.6 6.7 0.00014 23.6 0.5 23 34-56 5-28 (125)
103 KOG2757|consensus 50.2 22 0.00048 26.6 3.2 27 31-57 371-397 (411)
104 PF02080 TrkA_C: TrkA-C domain 49.9 8.5 0.00018 20.4 0.8 12 33-44 45-56 (71)
105 PF15340 COPR5: Cooperator of 49.7 7 0.00015 25.4 0.5 31 26-58 106-136 (152)
106 COG3837 Uncharacterized conser 49.6 38 0.00082 22.4 3.9 27 31-57 81-109 (161)
107 TIGR02899 spore_safA spore coa 48.9 11 0.00025 17.7 1.1 12 35-46 33-44 (44)
108 PF10014 2OG-Fe_Oxy_2: 2OG-Fe 48.2 15 0.00032 24.2 1.9 30 27-56 148-178 (195)
109 COG3257 GlxB Uncharacterized p 46.7 42 0.00091 23.7 4.0 26 31-56 100-125 (264)
110 PRK15175 Vi polysaccharide exp 46.4 14 0.0003 26.9 1.6 16 32-47 222-237 (355)
111 KOG2757|consensus 45.1 21 0.00045 26.8 2.4 24 33-56 249-272 (411)
112 PRK13498 chemoreceptor glutami 45.0 76 0.0016 20.7 4.8 34 30-64 6-39 (167)
113 cd02781 MopB_CT_Acetylene-hydr 44.6 15 0.00032 21.9 1.3 13 40-52 75-87 (130)
114 PF01427 Peptidase_M15: D-ala- 43.7 11 0.00023 25.2 0.7 16 40-55 170-185 (198)
115 PF00877 NLPC_P60: NlpC/P60 fa 43.2 13 0.00028 21.4 1.0 16 31-46 47-62 (105)
116 PF02041 Auxin_BP: Auxin bindi 42.8 40 0.00086 22.4 3.2 28 29-56 88-115 (167)
117 TIGR01015 hmgA homogentisate 1 42.1 33 0.00072 25.9 3.1 23 34-56 168-190 (429)
118 PRK05341 homogentisate 1,2-dio 42.1 34 0.00073 25.9 3.1 23 34-56 174-196 (438)
119 PRK00924 5-keto-4-deoxyuronate 41.9 24 0.00052 25.1 2.2 18 32-49 92-109 (276)
120 TIGR02451 anti_sig_ChrR anti-s 41.6 33 0.00071 22.9 2.8 28 35-62 164-193 (215)
121 TIGR02219 phage_NlpC_fam putat 41.6 15 0.00033 22.6 1.1 11 35-45 76-86 (134)
122 PF11543 UN_NPL4: Nuclear pore 41.5 15 0.00031 21.1 0.9 19 27-45 60-78 (80)
123 TIGR03805 beta_helix_1 paralle 41.3 38 0.00081 23.9 3.2 15 36-50 8-22 (314)
124 PF04209 HgmA: homogentisate 1 41.2 32 0.0007 25.8 2.9 24 34-57 166-189 (424)
125 cd02792 MopB_CT_Formate-Dh-Na- 40.8 23 0.00049 20.7 1.7 12 40-51 77-88 (122)
126 PF07653 SH3_2: Variant SH3 do 40.5 8.9 0.00019 19.9 -0.1 21 32-52 14-39 (55)
127 TIGR03505 FimV_core FimV N-ter 39.8 19 0.00041 20.5 1.2 13 35-47 40-52 (74)
128 PF11699 CENP-C_C: Mif2/CENP-C 39.7 78 0.0017 18.4 4.5 27 31-57 50-76 (85)
129 PF11142 DUF2917: Protein of u 39.6 32 0.0007 18.8 2.1 18 30-47 34-51 (63)
130 PLN02658 homogentisate 1,2-dio 39.6 39 0.00084 25.6 3.1 23 34-56 167-189 (435)
131 COG1935 Uncharacterized conser 39.6 18 0.00039 22.9 1.2 12 35-46 38-49 (122)
132 smart00306 HintN Hint (Hedgeho 38.7 21 0.00046 19.7 1.3 10 36-45 90-99 (100)
133 PF14578 GTP_EFTU_D4: Elongati 38.7 18 0.0004 21.1 1.1 10 35-44 72-81 (81)
134 cd02785 MopB_CT_4 The MopB_CT_ 38.2 20 0.00043 21.3 1.2 12 40-51 74-85 (124)
135 cd02793 MopB_CT_DMSOR-BSOR-TMA 36.7 24 0.00052 21.2 1.4 12 40-51 75-86 (129)
136 KOG3905|consensus 36.7 18 0.0004 27.1 1.0 11 38-48 302-312 (473)
137 COG2164 Uncharacterized conser 36.2 17 0.00036 22.9 0.6 16 34-49 63-78 (126)
138 TIGR00568 alkb DNA alkylation 35.6 58 0.0013 21.1 3.2 20 27-46 141-160 (169)
139 PF05118 Asp_Arg_Hydrox: Aspar 35.0 1.1E+02 0.0024 19.3 4.4 27 31-57 123-149 (163)
140 cd02791 MopB_CT_Nitrate-R-NapA 34.8 47 0.001 19.3 2.5 20 32-51 66-88 (122)
141 cd00118 LysM Lysin domain, fou 34.5 27 0.00058 15.6 1.1 12 34-45 35-46 (46)
142 PF02814 UreE_N: UreE urease a 34.1 27 0.00058 19.0 1.2 12 36-47 51-62 (65)
143 PF05708 DUF830: Orthopoxvirus 33.9 24 0.00051 21.6 1.1 9 36-44 2-10 (158)
144 cd00250 CAS_like Clavaminic ac 33.7 42 0.0009 22.4 2.3 15 30-44 218-232 (262)
145 PRK00794 flbT flagellar biosyn 33.7 42 0.00091 21.2 2.2 17 31-47 3-19 (132)
146 PF07228 SpoIIE: Stage II spor 33.2 37 0.00079 21.0 1.9 17 28-44 113-129 (193)
147 cd00508 MopB_CT_Fdh-Nap-like T 33.1 52 0.0011 18.9 2.4 12 40-51 77-88 (120)
148 PF05783 DLIC: Dynein light in 32.4 19 0.00041 27.2 0.5 14 35-48 273-286 (472)
149 PRK10178 D-alanyl-D-alanine di 32.2 16 0.00035 24.4 0.2 13 41-53 156-168 (184)
150 PRK10838 spr outer membrane li 31.6 31 0.00068 23.0 1.5 14 34-47 127-140 (190)
151 PRK06933 type III secretion sy 31.1 32 0.00069 24.6 1.5 14 35-48 175-188 (308)
152 COG2173 DdpX D-alanyl-D-alanin 30.8 16 0.00035 25.1 -0.1 24 31-54 175-198 (211)
153 cd00174 SH3 Src homology 3 dom 30.3 25 0.00054 16.9 0.6 23 32-55 14-40 (54)
154 PF02668 TauD: Taurine catabol 29.9 46 0.00099 21.5 2.0 25 29-53 218-243 (258)
155 cd02777 MopB_CT_DMSOR-like The 29.5 34 0.00074 20.3 1.2 12 40-51 76-87 (127)
156 PRK12791 flbT flagellar biosyn 29.4 53 0.0011 20.8 2.1 17 31-47 2-18 (131)
157 PRK13488 chemoreceptor glutami 29.2 1.6E+02 0.0035 18.9 4.8 30 33-64 3-34 (157)
158 PRK13494 chemoreceptor glutami 29.0 1.5E+02 0.0033 19.3 4.3 32 30-64 10-42 (163)
159 PF07378 FlbT: Flagellar prote 28.7 44 0.00095 20.9 1.7 15 33-47 2-16 (126)
160 PF03171 2OG-FeII_Oxy: 2OG-Fe( 28.5 96 0.0021 17.2 3.0 33 33-65 57-96 (98)
161 smart00326 SH3 Src homology 3 28.2 26 0.00057 16.9 0.5 22 30-51 15-40 (58)
162 PF00122 E1-E2_ATPase: E1-E2 A 28.1 27 0.00059 22.5 0.7 13 35-47 51-63 (230)
163 COG2255 RuvB Holliday junction 28.0 31 0.00067 25.2 1.0 14 32-45 97-110 (332)
164 PF05726 Pirin_C: Pirin C-term 28.0 1.2E+02 0.0026 17.6 3.5 30 32-61 2-33 (104)
165 PRK10202 ebgC cryptic beta-D-g 27.7 64 0.0014 20.4 2.3 22 32-53 106-127 (149)
166 PF04322 DUF473: Protein of un 27.6 37 0.00079 21.3 1.2 12 35-46 38-49 (119)
167 COG0490 Putative regulatory, l 27.5 33 0.00071 22.7 1.0 12 33-44 132-143 (162)
168 PRK13495 chemoreceptor glutami 27.2 1.8E+02 0.0039 18.8 4.7 31 32-64 2-34 (159)
169 COG3508 HmgA Homogentisate 1,2 26.9 1E+02 0.0022 23.3 3.5 25 33-57 165-189 (427)
170 cd02779 MopB_CT_Arsenite-Ox Th 26.8 41 0.00088 19.7 1.2 11 40-50 75-85 (115)
171 PF00018 SH3_1: SH3 domain; I 26.5 14 0.0003 18.6 -0.8 29 26-55 6-38 (48)
172 PRK14125 cell division suppres 26.3 40 0.00088 20.2 1.2 13 35-47 79-91 (103)
173 cd02780 MopB_CT_Tetrathionate_ 25.9 43 0.00094 20.4 1.3 12 40-51 72-83 (143)
174 cd02789 MopB_CT_FmdC-FwdD The 25.5 42 0.00091 19.7 1.1 10 40-49 73-82 (106)
175 TIGR02410 carnitine_TMLD trime 25.4 68 0.0015 22.9 2.4 15 30-44 311-325 (362)
176 PF10162 G8: G8 domain; Inter 25.1 64 0.0014 19.6 1.9 19 37-55 10-28 (125)
177 PF07591 PT-HINT: Pretoxin HIN 25.0 72 0.0016 19.6 2.1 22 35-56 76-97 (130)
178 cd02790 MopB_CT_Formate-Dh_H F 24.7 46 0.001 19.1 1.2 12 40-51 77-88 (116)
179 KOG0572|consensus 24.5 62 0.0013 24.7 2.0 25 29-56 217-241 (474)
180 PF02938 GAD: GAD domain; Int 24.5 39 0.00084 19.6 0.8 17 34-50 73-89 (95)
181 PF00856 SET: SET domain; Int 24.2 34 0.00074 19.7 0.6 13 35-47 9-21 (162)
182 PRK11198 LysM domain/BON super 24.1 48 0.001 20.8 1.3 11 36-46 137-147 (147)
183 PF14525 AraC_binding_2: AraC- 23.9 1.7E+02 0.0036 17.3 4.8 27 31-57 72-98 (172)
184 PF03737 Methyltransf_6: Demet 23.7 64 0.0014 20.4 1.8 10 34-43 137-146 (154)
185 PRK13491 chemoreceptor glutami 23.6 2.4E+02 0.0053 19.1 5.6 35 28-64 10-46 (199)
186 KOG2414|consensus 23.5 57 0.0012 25.0 1.7 19 28-48 424-442 (488)
187 PF08857 ParBc_2: Putative Par 23.4 1E+02 0.0022 20.1 2.7 25 22-46 36-60 (163)
188 PF11649 T4_neck-protein: Viru 23.4 49 0.0011 23.0 1.3 11 37-47 115-125 (230)
189 cd02784 MopB_CT_PHLH The MopB_ 23.3 43 0.00094 20.9 0.9 14 40-53 80-93 (137)
190 PF01426 BAH: BAH domain; Int 22.9 64 0.0014 18.5 1.6 15 34-48 1-15 (119)
191 PRK11479 hypothetical protein; 22.5 93 0.002 22.0 2.5 13 33-45 62-74 (274)
192 cd02775 MopB_CT Molybdopterin- 22.3 1.3E+02 0.0028 16.5 2.7 12 40-51 65-76 (101)
193 PF07915 PRKCSH: Glucosidase I 22.1 50 0.0011 18.1 0.9 16 39-54 1-16 (81)
194 KOG1633|consensus 22.0 48 0.001 26.8 1.1 31 25-55 541-571 (776)
195 TIGR00022 uncharacterized prot 21.8 1.4E+02 0.003 18.5 3.0 26 28-53 109-134 (142)
196 PF02563 Poly_export: Polysacc 21.7 55 0.0012 18.3 1.1 13 32-44 9-21 (82)
197 cd04721 BAH_plant_1 BAH, or Br 21.5 74 0.0016 19.7 1.7 11 34-44 6-16 (130)
198 PF13533 Biotin_lipoyl_2: Biot 21.4 85 0.0018 16.0 1.7 8 36-43 24-31 (50)
199 PF06525 SoxE: Sulfocyanin (So 21.4 50 0.0011 22.4 1.0 16 41-56 87-102 (196)
200 COG4340 Uncharacterized protei 21.3 97 0.0021 21.4 2.4 30 26-55 169-199 (226)
201 PF04074 DUF386: Domain of unk 21.2 1.5E+02 0.0032 18.5 3.1 25 29-53 110-134 (153)
202 TIGR02998 RraA_entero regulato 21.1 79 0.0017 20.6 1.8 11 33-43 135-145 (161)
203 PRK09372 ribonuclease activity 21.0 80 0.0017 20.4 1.8 10 34-43 136-145 (159)
204 PF13550 Phage-tail_3: Putativ 20.7 57 0.0012 19.7 1.1 14 34-47 138-151 (164)
205 cd02778 MopB_CT_Thiosulfate-R- 20.5 57 0.0012 19.0 1.0 12 40-51 72-83 (123)
206 PF01568 Molydop_binding: Moly 20.5 30 0.00065 19.7 -0.2 11 40-50 72-82 (110)
207 PF00568 WH1: WH1 domain; Int 20.4 2E+02 0.0043 16.9 5.3 37 27-63 58-94 (111)
208 PF08255 Leader_Trp: Trp-opero 20.1 53 0.0012 13.3 0.6 10 42-51 4-13 (14)
No 1
>PF13621 Cupin_8: Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=99.82 E-value=1.1e-20 Score=123.52 Aligned_cols=56 Identities=34% Similarity=0.634 Sum_probs=41.1
Q ss_pred eecccCCccccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEec--CC-eEEEEEeC
Q psy12933 8 ETSQSNSKEEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSL--ST-SISVSFWF 65 (65)
Q Consensus 8 ~s~~~~~~~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~--~~-sisvn~w~ 65 (65)
+|.++.+..|. +.+|.++++++++++|+|||+||||+||||+|+++ ++ +||||+||
T Consensus 187 ~~~~d~~~~d~--~~~p~~~~~~~~~~~l~pGD~LfiP~gWwH~V~~~~~~~~sisvn~w~ 245 (251)
T PF13621_consen 187 FSWVDPDNPDL--ERFPKFRKAPPYEVVLEPGDVLFIPPGWWHQVENLSDDDLSISVNYWF 245 (251)
T ss_dssp BBSS-TTS--T--TT-CGGGG--EEEEEEETT-EEEE-TT-EEEEEESTTSSCEEEEEEEE
T ss_pred eeeeeccChhh--hhhhhhccCceeEEEECCCeEEEECCCCeEEEEEcCCCCeEEEEEEEe
Confidence 45555544444 56999999999999999999999999999999999 65 99999997
No 2
>KOG2132|consensus
Probab=99.16 E-value=4.5e-13 Score=95.60 Aligned_cols=54 Identities=35% Similarity=0.622 Sum_probs=46.5
Q ss_pred cceecccCCccccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933 6 KCETSQSNSKEEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISV 61 (65)
Q Consensus 6 ~~~s~~~~~~~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisv 61 (65)
.|+|++++...| .+.||++++++.++++|+|||+||+|..|||.|++++.++|+
T Consensus 302 ~~tsqvdvenPd--lk~fp~~~k~~~l~~lL~pGe~L~iP~kwwhyvrs~d~s~s~ 355 (355)
T KOG2132|consen 302 LETSQVDVENPD--LKAFPKFAKARFLDCLLEPGEALFIPPKWWHYVRSLDKSFSV 355 (355)
T ss_pred hcccccccCCCC--hhhhhHHHHHHHHHHhcCCchhccccHHHhhhhhhccccccC
Confidence 457777775544 456999999999999999999999999999999999988764
No 3
>PF08007 Cupin_4: Cupin superfamily protein; InterPro: IPR022777 This signature represents primarily the cupin fold found in JmjC transcription factors. The fold is also found in lysine-specific demethylase NO66.; PDB: 2XDV_A 1VRB_B 4DIQ_B.
Probab=98.95 E-value=1.3e-09 Score=76.34 Aligned_cols=35 Identities=31% Similarity=0.689 Sum_probs=26.6
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF 63 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~ 63 (65)
.+..+++|+|||+||||+||||++.+.+.|+++++
T Consensus 173 ~~~~~~~L~pGD~LYlPrG~~H~~~~~~~S~hltv 207 (319)
T PF08007_consen 173 EPVEEVVLEPGDVLYLPRGWWHQAVTTDPSLHLTV 207 (319)
T ss_dssp STSEEEEE-TT-EEEE-TT-EEEEEESS-EEEEEE
T ss_pred ceeEEEEECCCCEEEECCCccCCCCCCCCceEEEE
Confidence 56889999999999999999999999997766665
No 4
>KOG2130|consensus
Probab=98.94 E-value=1.3e-09 Score=78.30 Aligned_cols=44 Identities=25% Similarity=0.542 Sum_probs=37.5
Q ss_pred ccCCCCCCC-CceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933 20 SSAPHNQTP-GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF 63 (65)
Q Consensus 20 ~~~~p~~~~-~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~ 63 (65)
+.+.|.... -+++||+++|||++|+|.||||-|.+++++|+|.-
T Consensus 250 rt~~Pswp~E~kPIEc~q~pGEt~fVP~GWWHvVlNle~TIAiTq 294 (407)
T KOG2130|consen 250 RTQLPSWPDEYKPIECLQKPGETMFVPSGWWHVVLNLEPTIAITQ 294 (407)
T ss_pred cccCCCCccccCCceeeecCCceEEecCCeEEEEeccCceeeeee
Confidence 345566554 47999999999999999999999999999999863
No 5
>PF02373 JmjC: JmjC domain, hydroxylase; InterPro: IPR013129 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with jmjN (see IPR003349 from INTERPRO) and belongs to the Cupin superfamily [].; PDB: 2YU2_A 2YU1_A 3AVR_A 3AVS_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=98.86 E-value=6.4e-09 Score=61.60 Aligned_cols=37 Identities=27% Similarity=0.542 Sum_probs=28.1
Q ss_pred CCCceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933 27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF 63 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~ 63 (65)
+..+.++++++|||.+++|+|++|+|.+++.+++++.
T Consensus 76 ~gi~~~~~~Q~~Ge~V~i~pg~~H~v~n~g~~i~~a~ 112 (114)
T PF02373_consen 76 AGIPVYRFVQKPGEFVFIPPGAYHQVFNLGDNISEAV 112 (114)
T ss_dssp TTS--EEEEEETT-EEEE-TT-EEEEEESSSEEEEEE
T ss_pred cCcccccceECCCCEEEECCCceEEEEeCCceEEEEe
Confidence 3568999999999999999999999999998776654
No 6
>KOG2131|consensus
Probab=98.82 E-value=1.6e-09 Score=78.73 Aligned_cols=38 Identities=29% Similarity=0.580 Sum_probs=35.1
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEEeC
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFWF 65 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~w~ 65 (65)
..+.++++++|||++|+|+||.|||-+++++||||.-|
T Consensus 264 ~~~~lei~Qepge~VFvPsGW~hQV~NL~dTISINHNW 301 (427)
T KOG2131|consen 264 RGPLLEIFQEPGETVFVPSGWHHQVLNLGDTISINHNW 301 (427)
T ss_pred ccchhhhhccCCceeeccCccccccccccceeeecccc
Confidence 45669999999999999999999999999999999866
No 7
>COG2850 Uncharacterized conserved protein [Function unknown]
Probab=98.35 E-value=1.4e-07 Score=68.45 Aligned_cols=36 Identities=39% Similarity=0.523 Sum_probs=30.9
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEEecCC--eEEEEE
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVRSLST--SISVSF 63 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~--sisvn~ 63 (65)
....++.+|+|||+||||++|||+..+.+. ++||.+
T Consensus 175 f~~~~d~vlepGDiLYiPp~~~H~gvae~dc~tySvG~ 212 (383)
T COG2850 175 FEPDIDEVLEPGDILYIPPGFPHYGVAEDDCMTYSVGF 212 (383)
T ss_pred CCchhhhhcCCCceeecCCCCCcCCcccccccceeeec
Confidence 456788999999999999999999999974 566655
No 8
>KOG2508|consensus
Probab=97.47 E-value=6.3e-05 Score=55.15 Aligned_cols=47 Identities=21% Similarity=0.388 Sum_probs=41.0
Q ss_pred CccCCCCCCC-CceEEEEECCCCEEEeCCCCeEEEEecC----CeEEEEEeC
Q psy12933 19 QSSAPHNQTP-GHLIECTLNPGDMLYIPPKVWHHVRSLS----TSISVSFWF 65 (65)
Q Consensus 19 ~~~~~p~~~~-~~~~~~~l~pGd~LyiP~~WwH~V~s~~----~sisvn~w~ 65 (65)
.++++|.+.. ..+..+.+.+|+++|.|..|.|+|-..+ .+|+||+|+
T Consensus 256 ~~~k~~l~~~~~k~~~~rv~a~~i~~lp~~w~h~~sq~~gd~~~~Iavn~~~ 307 (437)
T KOG2508|consen 256 PQEKIPLLEAEYKPEMDRVLAGQILYLPEDWFHHVSQVGGDPESDIAVNEWY 307 (437)
T ss_pred chhhCccccccccceeeeeeccccccChhhhhhhccccCCCccceeEEeeec
Confidence 4556777765 7899999999999999999999999984 589999996
No 9
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=97.15 E-value=0.0015 Score=44.43 Aligned_cols=29 Identities=24% Similarity=0.465 Sum_probs=26.0
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.....+.+++||++|||++|-|.+.|.+.
T Consensus 123 G~~~v~~~~~Gd~iyVPp~~gH~t~N~Gd 151 (209)
T COG2140 123 GEARVIAVRAGDVIYVPPGYGHYTINTGD 151 (209)
T ss_pred CcEEEEEecCCcEEEeCCCcceEeecCCC
Confidence 34788899999999999999999999973
No 10
>KOG3706|consensus
Probab=97.01 E-value=0.00031 Score=53.28 Aligned_cols=28 Identities=32% Similarity=0.608 Sum_probs=25.5
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.+.++.+|+|||+||+|.|.-||.+.-+
T Consensus 378 ePV~e~vle~GDllYfPRG~IHQA~t~~ 405 (629)
T KOG3706|consen 378 EPVHEFVLEPGDLLYFPRGTIHQADTPA 405 (629)
T ss_pred CchHHhhcCCCcEEEecCcceeeccccc
Confidence 5789999999999999999999998765
No 11
>PF06560 GPI: Glucose-6-phosphate isomerase (GPI); InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=97.01 E-value=0.0017 Score=43.19 Aligned_cols=36 Identities=28% Similarity=0.637 Sum_probs=24.6
Q ss_pred ceEEEEECCCCEEEeCCCCeEEEEecCC-eEEEEEeC
Q psy12933 30 HLIECTLNPGDMLYIPPKVWHHVRSLST-SISVSFWF 65 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~~-sisvn~w~ 65 (65)
..+-+.++|||+++||++|-|...|.+. .+.+..||
T Consensus 108 ~~~~v~~~~G~~v~IPp~yaH~tIN~g~~~L~~~~~~ 144 (182)
T PF06560_consen 108 DVIAVEAKPGDVVYIPPGYAHRTINTGDEPLVFAAWV 144 (182)
T ss_dssp -EEEEEE-TTEEEEE-TT-EEEEEE-SSS-EEEEEEE
T ss_pred eEEEEEeCCCCEEEECCCceEEEEECCCCcEEEEEEE
Confidence 4667889999999999999999999873 55555554
No 12
>PF00190 Cupin_1: Cupin; InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=96.90 E-value=0.0033 Score=39.11 Aligned_cols=28 Identities=18% Similarity=0.330 Sum_probs=21.4
Q ss_pred EECCCCEEEeCCCCeEEEEecC--CeEEEE
Q psy12933 35 TLNPGDMLYIPPKVWHHVRSLS--TSISVS 62 (65)
Q Consensus 35 ~l~pGd~LyiP~~WwH~V~s~~--~sisvn 62 (65)
.+++||++++|+|+.|.+.+.+ ....+.
T Consensus 87 ~l~~Gdv~~vP~G~~h~~~n~~~~~~~~~~ 116 (144)
T PF00190_consen 87 RLKAGDVFVVPAGHPHWIINDGDDEALVLI 116 (144)
T ss_dssp EEETTEEEEE-TT-EEEEEECSSSSEEEEE
T ss_pred eeecccceeeccceeEEEEcCCCCCCEEEE
Confidence 4999999999999999999994 444443
No 13
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=96.88 E-value=0.0041 Score=41.52 Aligned_cols=34 Identities=18% Similarity=0.375 Sum_probs=27.0
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW 64 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w 64 (65)
.....++|||+++||++|-|++.+.+ +-..++.|
T Consensus 119 ~~~~~v~pGd~v~IPpg~~H~~iN~G~epl~fl~v~ 154 (191)
T PRK04190 119 ARWIEMEPGTVVYVPPYWAHRSVNTGDEPLVFLACY 154 (191)
T ss_pred EEEEEECCCCEEEECCCCcEEeEECCCCCEEEEEEE
Confidence 44678999999999999999999976 34444444
No 14
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=96.81 E-value=0.0019 Score=39.86 Aligned_cols=27 Identities=33% Similarity=0.487 Sum_probs=24.4
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-++.|++||++|||+|-.|++++.+.
T Consensus 74 ~~~~~v~~gd~~~iP~g~~H~~~N~G~ 100 (127)
T COG0662 74 GEEVEVKAGDSVYIPAGTPHRVRNTGK 100 (127)
T ss_pred CEEEEecCCCEEEECCCCcEEEEcCCC
Confidence 457889999999999999999999874
No 15
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=96.76 E-value=0.0036 Score=33.73 Aligned_cols=26 Identities=46% Similarity=0.795 Sum_probs=23.9
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
-...+++||++++|++=+|++.+.+.
T Consensus 37 ~~~~l~~Gd~~~i~~~~~H~~~n~~~ 62 (71)
T PF07883_consen 37 ERVELKPGDAIYIPPGVPHQVRNPGD 62 (71)
T ss_dssp EEEEEETTEEEEEETTSEEEEEEESS
T ss_pred EEeEccCCEEEEECCCCeEEEEECCC
Confidence 47899999999999999999999874
No 16
>KOG2508|consensus
Probab=96.44 E-value=0.0071 Score=44.62 Aligned_cols=61 Identities=28% Similarity=0.498 Sum_probs=45.3
Q ss_pred CcceecccCCc-cccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEecCC-------eEEEEEeC
Q psy12933 5 LKCETSQSNSK-EEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLST-------SISVSFWF 65 (65)
Q Consensus 5 ~~~~s~~~~~~-~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~-------sisvn~w~ 65 (65)
++.|+..+..+ .|.+...+-..+..-.+-+++++|+++|+|+-|.|.+++-.. .|.+|+|+
T Consensus 333 ~~sf~~~~d~e~~d~~~~~l~~lA~~~~l~v~~~~~~~~~l~~~~~~~~~~~~~ssg~~~v~l~~~y~~ 401 (437)
T KOG2508|consen 333 LHKFMGFDDAKNVDVDKDELSALAGTVPLVVDLEPGDMLYLPASWFHEVTSSSASSGGSDVHIAFNYWF 401 (437)
T ss_pred hhhhccCCcccccccchhhccccccceeEEEecccCceeeechhheeeeeccccccCCcceeEEeeccc
Confidence 45555543333 455555565666778899999999999999999999998652 58888885
No 17
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=95.94 E-value=0.011 Score=37.89 Aligned_cols=29 Identities=38% Similarity=0.603 Sum_probs=24.8
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
+--...++..|||++|||+|--|+-.++.
T Consensus 84 ~rLE~ha~~~pGDf~YiPpgVPHqp~N~S 112 (142)
T COG4101 84 NRLEEHAEVGPGDFFYIPPGVPHQPANLS 112 (142)
T ss_pred cceeeeEEecCCCeEEcCCCCCCcccccC
Confidence 33456789999999999999999998876
No 18
>KOG1633|consensus
Probab=95.83 E-value=0.01 Score=46.78 Aligned_cols=31 Identities=26% Similarity=0.514 Sum_probs=27.2
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCCeE
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSI 59 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~si 59 (65)
...+.|+|++|+.+|||.||-|.|..-..++
T Consensus 193 dkC~~~~l~~g~T~~iPsGwIhAV~Tp~d~l 223 (776)
T KOG1633|consen 193 DKCYKCILKQGQTLFIPSGWIHAVLTPTDCL 223 (776)
T ss_pred ceeEEEEeccCceEecccceeEeeecCcchh
Confidence 4688999999999999999999999876543
No 19
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=95.65 E-value=0.035 Score=34.54 Aligned_cols=26 Identities=19% Similarity=0.331 Sum_probs=23.1
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.....+++||+++||++..|...+.+
T Consensus 74 ~~~~~l~~GD~~~ip~g~~H~~~n~~ 99 (146)
T smart00835 74 VYDARLREGDVFVVPQGHPHFQVNSG 99 (146)
T ss_pred EEEEEecCCCEEEECCCCEEEEEcCC
Confidence 34778999999999999999999875
No 20
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=95.49 E-value=0.032 Score=40.28 Aligned_cols=26 Identities=23% Similarity=0.368 Sum_probs=23.3
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..+..+++||++|+|+|..|++++.+
T Consensus 288 ~~~~~l~~GD~~~iP~g~~H~i~N~G 313 (367)
T TIGR03404 288 ARTFDYQAGDVGYVPRNMGHYVENTG 313 (367)
T ss_pred EEEEEECCCCEEEECCCCeEEEEECC
Confidence 44567999999999999999999986
No 21
>KOG2132|consensus
Probab=95.47 E-value=0.024 Score=41.22 Aligned_cols=45 Identities=16% Similarity=0.072 Sum_probs=34.0
Q ss_pred cCCccccCccCCCCCCC--CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 12 SNSKEEAQSSAPHNQTP--GHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 12 ~~~~~d~~~~~~p~~~~--~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..|...++..++|.|.. +....++..+|++++||..|||.+.+.-
T Consensus 231 k~Di~iPDyc~~~~f~~~~v~~~~w~GpaGtV~pih~dp~hNi~~qv 277 (355)
T KOG2132|consen 231 KFDISIPDYCSFPNFENEVVDINAWIGPAGTVLPIHMDPWHNILSQV 277 (355)
T ss_pred hhccCCCceeecCCCCccccceeEEeccCCceeccccccccceeeee
Confidence 33444456667899988 5555566666999999999999999875
No 22
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=95.42 E-value=0.05 Score=33.88 Aligned_cols=27 Identities=19% Similarity=0.175 Sum_probs=23.6
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
..+..|+|||++|+|++=.|+.++.++
T Consensus 74 g~~~~L~aGD~i~~~~~~~H~~~N~e~ 100 (125)
T PRK13290 74 GEVHPIRPGTMYALDKHDRHYLRAGED 100 (125)
T ss_pred CEEEEeCCCeEEEECCCCcEEEEcCCC
Confidence 455789999999999999999999754
No 23
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=95.17 E-value=0.057 Score=31.72 Aligned_cols=29 Identities=31% Similarity=0.463 Sum_probs=24.6
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
....++.+.+|+.-+|||..||+|+.+++
T Consensus 47 ~~~~~~~~~~~~~~~i~Pq~wH~V~p~s~ 75 (82)
T PF09313_consen 47 EPEEEVFIPAGQPPVIEPQQWHRVEPLSD 75 (82)
T ss_dssp SESEEEEEETTEEEEE-TT-EEEEEESST
T ss_pred ceeEEEEeCCCCCceeCCCceEEEEECCC
Confidence 56789999999999999999999999875
No 24
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=95.13 E-value=0.051 Score=33.06 Aligned_cols=28 Identities=29% Similarity=0.508 Sum_probs=24.2
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCCe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLSTS 58 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~s 58 (65)
.-..++++||++++|++=.|.+.+.+.+
T Consensus 81 g~~~~l~~Gd~i~ip~g~~H~~~a~~~~ 108 (131)
T COG1917 81 GEKKELKAGDVIIIPPGVVHGLKAVEDE 108 (131)
T ss_pred CCceEecCCCEEEECCCCeeeeccCCCC
Confidence 4456899999999999999999998754
No 25
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=95.00 E-value=0.067 Score=33.39 Aligned_cols=26 Identities=42% Similarity=0.814 Sum_probs=15.6
Q ss_pred EEEEECCC-CEEEeCCCCeEEEEecCC
Q psy12933 32 IECTLNPG-DMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 32 ~~~~l~pG-d~LyiP~~WwH~V~s~~~ 57 (65)
.+++|... ..|+||+|+||...+.+.
T Consensus 76 ~~~~L~~~~~~L~Ippg~w~~~~~~s~ 102 (131)
T PF05523_consen 76 EEFILDEPNKGLYIPPGVWHGIKNFSE 102 (131)
T ss_dssp EEEEE--TTEEEEE-TT-EEEEE---T
T ss_pred EEEEECCCCeEEEECCchhhHhhccCC
Confidence 56666665 499999999999998874
No 26
>TIGR01221 rmlC dTDP-4-dehydrorhamnose 3,5-epimerase. This enzyme participates in the biosynthesis of dTDP-L-rhamnose, often as a precursor to LPS O-antigen
Probab=95.00 E-value=0.081 Score=34.86 Aligned_cols=44 Identities=14% Similarity=0.046 Sum_probs=30.0
Q ss_pred ccccCccCCCCCCCCceEEEEECCCCEEEeCCCCeEEEEecCCeE
Q psy12933 15 KEEAQSSAPHNQTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSI 59 (65)
Q Consensus 15 ~~d~~~~~~p~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~si 59 (65)
.+|.+... |.|.+-..++..-+.+.+||||+|.+|-..+++...
T Consensus 82 ~VDlR~~S-pTfG~~~~~~L~~~~~~~l~IP~G~aHGF~~L~d~a 125 (176)
T TIGR01221 82 AVDLRRNS-PTFGKWVGVLLSAENKRQLWIPEGFAHGFVVLSDEA 125 (176)
T ss_pred EEECCCCc-CCCCeEEEEEECCCCCCEEEeCCcceeEEEEcCCCe
Confidence 45666544 455554444444444779999999999999987643
No 27
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=94.60 E-value=0.11 Score=37.62 Aligned_cols=25 Identities=28% Similarity=0.591 Sum_probs=21.9
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
+...|++||++|+|++..|..++.+
T Consensus 110 ~~~~L~~GD~~~fP~g~~H~~~n~~ 134 (367)
T TIGR03404 110 YIDDVGAGDLWYFPPGIPHSLQGLD 134 (367)
T ss_pred EEeEECCCCEEEECCCCeEEEEECC
Confidence 3346999999999999999999985
No 28
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=94.26 E-value=0.12 Score=30.31 Aligned_cols=36 Identities=28% Similarity=0.501 Sum_probs=23.2
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEEecC---CeEEEEE
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVRSLS---TSISVSF 63 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~---~sisvn~ 63 (65)
......+..++||++..|+.-+|.|.... .-|+|+|
T Consensus 62 ~~~~~~~~p~~G~lvlFPs~l~H~v~p~~~~~~Risisf 100 (101)
T PF13759_consen 62 NSPYYIVEPEEGDLVLFPSWLWHGVPPNNSDEERISISF 100 (101)
T ss_dssp C-SEEEE---TTEEEEEETTSEEEE----SSS-EEEEEE
T ss_pred cCceEEeCCCCCEEEEeCCCCEEeccCcCCCCCEEEEEc
Confidence 45678889999999999999999998765 3566665
No 29
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=94.20 E-value=0.03 Score=31.68 Aligned_cols=19 Identities=21% Similarity=0.398 Sum_probs=15.8
Q ss_pred ceEEEEECCCCEEEeCCCC
Q psy12933 30 HLIECTLNPGDMLYIPPKV 48 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~W 48 (65)
..-..+++|||++++|+||
T Consensus 42 ~G~~~~~~aGD~~~~p~G~ 60 (74)
T PF05899_consen 42 DGETVTFKAGDAFFLPKGW 60 (74)
T ss_dssp TTEEEEEETTEEEEE-TTE
T ss_pred CCCEEEEcCCcEEEECCCC
Confidence 4567899999999999998
No 30
>KOG2107|consensus
Probab=94.20 E-value=0.092 Score=35.04 Aligned_cols=30 Identities=17% Similarity=0.478 Sum_probs=26.3
Q ss_pred CCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
++..++.+-++.||++++|+|-.|+.....
T Consensus 111 ~dd~WIRi~vekGDlivlPaGiyHRFTtt~ 140 (179)
T KOG2107|consen 111 KDDQWIRIFVEKGDLIVLPAGIYHRFTTTP 140 (179)
T ss_pred CCCCEEEEEEecCCEEEecCcceeeeecCc
Confidence 356889999999999999999999987654
No 31
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=94.07 E-value=0.13 Score=33.15 Aligned_cols=27 Identities=19% Similarity=0.124 Sum_probs=23.3
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-+..|++||++|+|++-.|...+.+.
T Consensus 145 ~~~~~l~~Gd~~~~~~~~~H~~~n~~~ 171 (185)
T PRK09943 145 GQDYHLVAGQSYAINTGIPHSFSNTSA 171 (185)
T ss_pred CEEEEecCCCEEEEcCCCCeeeeCCCC
Confidence 346789999999999999999999764
No 32
>PF03079 ARD: ARD/ARD' family; InterPro: IPR004313 The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence Q9ZFE7 from SWISSPROT, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+ []. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions []. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed reaction is unclear: methylthiopropanoate is cytotoxic, and carbon monoxide can activate guanylyl cyclase, leading to increased intracellular cGMP levels [, ]. This family also contains other proteins, whose functions are not well characterised.; GO: 0010309 acireductone dioxygenase [iron(II)-requiring] activity, 0055114 oxidation-reduction process; PDB: 1VR3_A 1ZRR_A 2HJI_A.
Probab=93.95 E-value=0.13 Score=33.44 Aligned_cols=30 Identities=17% Similarity=0.295 Sum_probs=22.5
Q ss_pred CCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
++...+++.+++||.|.||+|-.|.....+
T Consensus 110 ~~~~wiri~~e~GDli~vP~g~~HrF~~~~ 139 (157)
T PF03079_consen 110 GDDVWIRILCEKGDLIVVPAGTYHRFTLGE 139 (157)
T ss_dssp TTCEEEEEEEETTCEEEE-TT--EEEEEST
T ss_pred CCCEEEEEEEcCCCEEecCCCCceeEEcCC
Confidence 345677899999999999999999998544
No 33
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=93.53 E-value=0.14 Score=33.62 Aligned_cols=30 Identities=27% Similarity=0.522 Sum_probs=26.0
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLSTSISV 61 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~~sisv 61 (65)
-.+.|++||++++|+|--|.......++.+
T Consensus 70 ~~v~L~eGd~flvP~gvpHsP~r~~~t~~L 99 (159)
T TIGR03037 70 EDVPIREGDIFLLPPHVPHSPQRPAGSIGL 99 (159)
T ss_pred EEEEECCCCEEEeCCCCCcccccCCCcEEE
Confidence 478999999999999999999887766655
No 34
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=93.48 E-value=0.084 Score=32.99 Aligned_cols=28 Identities=14% Similarity=0.096 Sum_probs=21.0
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
...-.+.++|||.+|+|+||.=-=+-++
T Consensus 79 d~Ge~v~~~aGD~~~~~~G~~g~W~V~E 106 (116)
T COG3450 79 DGGEPVEVRAGDSFVFPAGFKGTWEVLE 106 (116)
T ss_pred CCCeEEEEcCCCEEEECCCCeEEEEEee
Confidence 3456688999999999999975444433
No 35
>KOG1356|consensus
Probab=93.44 E-value=0.011 Score=47.09 Aligned_cols=32 Identities=28% Similarity=0.361 Sum_probs=28.4
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCCeEE
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSIS 60 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sis 60 (65)
+.+..+++..||.+|||+|--|||+++..++.
T Consensus 796 Ve~WtfvQ~LGdAVfIPAGaPHQVrNLkSCik 827 (889)
T KOG1356|consen 796 VEPWTFVQFLGDAVFIPAGAPHQVRNLKSCIK 827 (889)
T ss_pred CCccchhhcccceEEecCCCcHHhhhhhhHHH
Confidence 67889999999999999999999999976543
No 36
>PF01050 MannoseP_isomer: Mannose-6-phosphate isomerase; InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=93.39 E-value=0.15 Score=32.85 Aligned_cols=26 Identities=35% Similarity=0.484 Sum_probs=22.7
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-...+.+||.+|||+|-.|++++.+
T Consensus 101 ~~~~~~~~g~sv~Ip~g~~H~i~n~g 126 (151)
T PF01050_consen 101 DEEFTLKEGDSVYIPRGAKHRIENPG 126 (151)
T ss_pred CEEEEEcCCCEEEECCCCEEEEECCC
Confidence 34567999999999999999999875
No 37
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=92.87 E-value=0.43 Score=27.48 Aligned_cols=27 Identities=37% Similarity=0.482 Sum_probs=19.0
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-+..++|||++++|++-.|.....+.
T Consensus 40 ~~~~~l~~g~~~li~p~~~H~~~~~~~ 66 (136)
T PF02311_consen 40 GQEYPLKPGDLFLIPPGQPHSYYPDSN 66 (136)
T ss_dssp TEEEEE-TT-EEEE-TTS-EEEEE-TT
T ss_pred CEEEEEECCEEEEecCCccEEEecCCC
Confidence 456789999999999999999998873
No 38
>COG3615 TehB Uncharacterized protein/domain, possibly involved in tellurite resistance [Inorganic ion transport and metabolism]
Probab=92.73 E-value=0.21 Score=30.54 Aligned_cols=33 Identities=21% Similarity=0.632 Sum_probs=24.4
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW 64 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w 64 (65)
...++.++ +.-++|+.-||+|+.++ .-+.++||
T Consensus 61 e~~~~~ea-~~~~~~PQ~WHrVea~tDD~e~~l~Fy 95 (99)
T COG3615 61 EHVFSIEA-QFPVFPPQAWHRVEAMTDDAEFNLSFY 95 (99)
T ss_pred eEEEeecC-CCCccChhHeeeeeecccccEEEEEEE
Confidence 44455555 55778888999999876 47888886
No 39
>COG1898 RfbC dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes [Cell envelope biogenesis, outer membrane]
Probab=92.66 E-value=0.52 Score=31.22 Aligned_cols=47 Identities=23% Similarity=0.382 Sum_probs=32.3
Q ss_pred ccccCccCCCCCCCCceEEEEECCC--CEEEeCCCCeEEEEecCCeEEEEEe
Q psy12933 15 KEEAQSSAPHNQTPGHLIECTLNPG--DMLYIPPKVWHHVRSLSTSISVSFW 64 (65)
Q Consensus 15 ~~d~~~~~~p~~~~~~~~~~~l~pG--d~LyiP~~WwH~V~s~~~sisvn~w 64 (65)
.+|.+... |.| .+...++|.+- .+|+||+|+.|-..+++.+..+.++
T Consensus 82 ~vDlR~~S-pTy--g~~~~~~ls~~N~~~l~IP~G~AHGf~~L~d~~~~~y~ 130 (173)
T COG1898 82 AVDLRKDS-PTY--GKWVGVVLSAENKRQLYIPPGFAHGFQVLSDDAEVVYK 130 (173)
T ss_pred EEEccCCC-CCc--ceEEEEEecCCCceEEEeCCcccceeEEccCceEEEEE
Confidence 34555533 333 44555555554 9999999999999999976665553
No 40
>PF00908 dTDP_sugar_isom: dTDP-4-dehydrorhamnose 3,5-epimerase; InterPro: IPR000888 Deoxythymidine diphosphate (dTDP)-4-keto-6-deoxy-d-hexulose 3, 5-epimerase (RmlC, 5.1.3.13 from EC) is involved in the biosynthesis of dTDP-l-rhamnose, which is an essential component of the bacterial cell wall, converting dTDP-4-keto-6-deoxy-D-glucose to dTDP-4-keto-L-rhamnose. The crystal structure of RmlC from Methanobacterium thermoautotrophicum was determined in the presence and absence of a substrate analogue. RmlC is a homodimer comprising a central jelly roll motif, which extends in two directions into longer beta-sheets. Binding of dTDP is stabilised by ionic interactions to the phosphate group and by a combination of ionic and hydrophobic interactions with the base. The active site, which is located in the centre of the jelly roll, is formed by residues that are conserved in all known RmlC sequence homologues. The active site is lined with a number of charged residues and a number of residues with hydrogen-bonding potentials, which together comprise a potential network for substrate binding and catalysis. The active site is also lined with aromatic residues which provide favorable environments for the base moiety of dTDP and potentially for the sugar moiety of the substrate [].; GO: 0008830 dTDP-4-dehydrorhamnose 3,5-epimerase activity, 0009103 lipopolysaccharide biosynthetic process; PDB: 1EPZ_A 1EP0_A 1NXM_A 1NZC_D 2IXL_C 1NYW_B 2IXC_D 1PM7_B 1UPI_A 3RYK_B ....
Probab=92.51 E-value=0.44 Score=31.31 Aligned_cols=42 Identities=24% Similarity=0.423 Sum_probs=26.7
Q ss_pred ccccCccCCCCCCCCceEEEEECCC--CEEEeCCCCeEEEEecCCeE
Q psy12933 15 KEEAQSSAPHNQTPGHLIECTLNPG--DMLYIPPKVWHHVRSLSTSI 59 (65)
Q Consensus 15 ~~d~~~~~~p~~~~~~~~~~~l~pG--d~LyiP~~WwH~V~s~~~si 59 (65)
.+|.++.. |.| .+...+.|.++ -.||||+|.+|-..+++..-
T Consensus 82 ~vDlR~~S-pTf--g~~~~~~Ls~~n~~~l~IP~G~aHGf~~l~d~a 125 (176)
T PF00908_consen 82 AVDLRKGS-PTF--GKWVSVELSAENPRQLYIPPGVAHGFQTLEDDA 125 (176)
T ss_dssp EEE-BTTS-TTT--T-EEEEEEETTT--EEEE-TTEEEEEEESSSEE
T ss_pred EEECCCCC-CCC--CEEEEEEeCccccCEEEeCCcceeeEEeccCce
Confidence 34666533 444 45555666555 58999999999999998653
No 41
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=92.22 E-value=0.27 Score=32.80 Aligned_cols=31 Identities=26% Similarity=0.490 Sum_probs=26.4
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecCCeEEEE
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLSTSISVS 62 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn 62 (65)
..+.|++||++++|++--|..+....++.+-
T Consensus 76 ~~v~L~eGd~fllP~gvpHsP~r~~~tv~Lv 106 (177)
T PRK13264 76 RDVPIREGEMFLLPPHVPHSPQREAGSIGLV 106 (177)
T ss_pred eeEEECCCCEEEeCCCCCcCCccCCCeEEEE
Confidence 4788999999999999999998877666553
No 42
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=91.92 E-value=0.28 Score=32.91 Aligned_cols=32 Identities=19% Similarity=0.223 Sum_probs=24.7
Q ss_pred EEEECCCCEEEeCCCCeEEEEecC---C--eEEEEEe
Q psy12933 33 ECTLNPGDMLYIPPKVWHHVRSLS---T--SISVSFW 64 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~WwH~V~s~~---~--sisvn~w 64 (65)
.+.-++|++|..|+.-||.|.... . |||.|+-
T Consensus 163 ~v~P~~G~lvlFPS~L~H~v~p~~~~~~RISiSFNl~ 199 (201)
T TIGR02466 163 YVPPQEGRVLLFESWLRHEVPPNESEEERISVSFNYA 199 (201)
T ss_pred EECCCCCeEEEECCCCceecCCCCCCCCEEEEEEeeE
Confidence 455599999999999999998865 2 5555553
No 43
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=91.88 E-value=0.19 Score=28.80 Aligned_cols=30 Identities=30% Similarity=0.747 Sum_probs=25.5
Q ss_pred ECCCCEEEeCC-CCeEEEEec-C--CeEEEEEeC
Q psy12933 36 LNPGDMLYIPP-KVWHHVRSL-S--TSISVSFWF 65 (65)
Q Consensus 36 l~pGd~LyiP~-~WwH~V~s~-~--~sisvn~w~ 65 (65)
-++|+++++|+ ..||.|... . .-++|++||
T Consensus 66 p~~g~~v~F~~~~~~H~v~~v~~~~~R~~l~~~~ 99 (100)
T PF13640_consen 66 PKPGRLVIFPSDNSLHGVTPVGEGGRRYSLTFWF 99 (100)
T ss_dssp -BTTEEEEEESCTCEEEEEEE-EESEEEEEEEEE
T ss_pred CCCCEEEEEeCCCCeecCcccCCCCCEEEEEEEE
Confidence 89999999999 999999998 3 368888885
No 44
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=91.71 E-value=0.29 Score=33.67 Aligned_cols=27 Identities=7% Similarity=0.010 Sum_probs=23.6
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-...+++||++|||++--|+..+.+.
T Consensus 217 g~~~~V~~GD~i~i~~~~~h~~~~~G~ 243 (260)
T TIGR03214 217 NNWVPVEAGDYIWMGAYCPQACYAGGR 243 (260)
T ss_pred CEEEEecCCCEEEECCCCCEEEEecCC
Confidence 446789999999999999999999863
No 45
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=91.17 E-value=0.26 Score=34.14 Aligned_cols=24 Identities=17% Similarity=0.229 Sum_probs=19.7
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
.-+.+++|||++|||+|-.|+-.+
T Consensus 192 G~t~~l~pGDvlfIPkGs~~hf~t 215 (233)
T PRK15457 192 GETMIAKAGDVMFIPKGSSIEFGT 215 (233)
T ss_pred CEEEEeCCCcEEEECCCCeEEecC
Confidence 467789999999999999955543
No 46
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=90.84 E-value=0.67 Score=31.87 Aligned_cols=24 Identities=21% Similarity=0.387 Sum_probs=21.6
Q ss_pred EEEECCCCEEEeCCCCeEEEEecC
Q psy12933 33 ECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..+|++||.+|+|++=-|+.++.+
T Consensus 99 ~~~L~~Gd~~y~pa~~~H~~~N~~ 122 (260)
T TIGR03214 99 THELREGGYAYLPPGSKWTLANAQ 122 (260)
T ss_pred EEEECCCCEEEECCCCCEEEEECC
Confidence 458999999999999999999876
No 47
>PRK15131 mannose-6-phosphate isomerase; Provisional
Probab=90.80 E-value=0.25 Score=36.06 Aligned_cols=24 Identities=25% Similarity=0.402 Sum_probs=20.9
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
--.+.|+|||++|+|+|--|..-+
T Consensus 236 LN~v~l~pGeaifipAg~~HAyl~ 259 (389)
T PRK15131 236 LNVVKLNPGEAMFLFAETPHAYLQ 259 (389)
T ss_pred eeEEEeCCCCEEEeCCCCCeEEcC
Confidence 457899999999999999998654
No 48
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=90.79 E-value=0.19 Score=32.67 Aligned_cols=18 Identities=33% Similarity=0.612 Sum_probs=12.6
Q ss_pred eEEEEECCCCEEEeCCCC
Q psy12933 31 LIECTLNPGDMLYIPPKV 48 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~W 48 (65)
.-..+-+|||+||||.|-
T Consensus 112 G~~~~A~~GDvi~iPkGs 129 (152)
T PF06249_consen 112 GQTVTAKPGDVIFIPKGS 129 (152)
T ss_dssp TEEEEEETT-EEEE-TT-
T ss_pred CEEEEEcCCcEEEECCCC
Confidence 456778999999999993
No 49
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=90.70 E-value=0.45 Score=34.36 Aligned_cols=30 Identities=23% Similarity=0.443 Sum_probs=25.3
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.+...++.+++||++.+|++.||.-.+.+.
T Consensus 116 ~V~g~~~~~~~gD~~~tP~w~wH~H~n~~d 145 (335)
T TIGR02272 116 AVDGERTTMHPGDFIITPSWTWHDHGNPGD 145 (335)
T ss_pred EECCEEEeeeCCCEEEeCCCeeEecccCCC
Confidence 356778899999999999999999877653
No 50
>PF15138 Syncollin: Syncollin
Probab=90.50 E-value=0.35 Score=30.12 Aligned_cols=36 Identities=25% Similarity=0.509 Sum_probs=28.6
Q ss_pred CceEEEEECCC-CEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933 29 GHLIECTLNPG-DMLYIPPKVWHHVRSLS--TSISVSFW 64 (65)
Q Consensus 29 ~~~~~~~l~pG-d~LyiP~~WwH~V~s~~--~sisvn~w 64 (65)
...-+..++|| |.-|+|.+|=+.+.|+- .-..+..|
T Consensus 32 C~G~~l~v~pg~DlPylPs~w~n~iSSlvV~~RCeLtVW 70 (112)
T PF15138_consen 32 CGGAELSVEPGDDLPYLPSGWANKISSLVVAPRCELTVW 70 (112)
T ss_pred cCCcEEeecCCCCCCcCCccccCeeeeEEEcCceEEEEE
Confidence 35678899999 78899999999999874 45666655
No 51
>PLN00212 glutelin; Provisional
Probab=90.33 E-value=0.92 Score=34.37 Aligned_cols=33 Identities=18% Similarity=0.270 Sum_probs=26.5
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISV 61 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisv 61 (65)
-..+...|++||+++||.+..|..++....+.+
T Consensus 390 ~~vf~~~L~~GdvfVVPqg~~v~~~A~~egfe~ 422 (493)
T PLN00212 390 KTVFNGVLRPGQLLIIPQHYAVLKKAEREGCQY 422 (493)
T ss_pred CEEEEEEEcCCCEEEECCCCeEEEeecCCceEE
Confidence 356888999999999999999988776544433
No 52
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=89.62 E-value=0.9 Score=31.05 Aligned_cols=34 Identities=24% Similarity=0.544 Sum_probs=29.1
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW 64 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w 64 (65)
...+.+++|++|..|+.-+|+|...+ .-+++.+|
T Consensus 140 ~~~Vkp~aG~~vlfps~~lH~v~pVt~G~R~~~~~W 175 (226)
T PRK05467 140 EHRVKLPAGDLVLYPSTSLHRVTPVTRGVRVASFFW 175 (226)
T ss_pred cEEEecCCCeEEEECCCCceeeeeccCccEEEEEec
Confidence 36788999999999999999999865 46777777
No 53
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=89.25 E-value=0.86 Score=33.66 Aligned_cols=27 Identities=22% Similarity=0.252 Sum_probs=23.6
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-+..|++||++|+|++--|+.++.+.
T Consensus 414 g~~~~l~~GDsi~ip~~~~H~~~N~g~ 440 (468)
T TIGR01479 414 DETLLLTENESTYIPLGVIHRLENPGK 440 (468)
T ss_pred CEEEEecCCCEEEECCCCcEEEEcCCC
Confidence 346789999999999999999999763
No 54
>TIGR00218 manA mannose-6-phosphate isomerase, class I. The names phosphomannose isomerase and mannose-6-phosphate isomerase are synonomous. This family contains two rather deeply branched groups. One group contains an experimentally determined phosphomannose isomerase of Streptococcus mutans as well as three uncharacterized paralogous proteins of Bacillus subtilis, all at more than 50 % identity to each other, plus a more distant homolog from Archaeoglobus fulgidus. The other group contains members from E. coli, budding yeast, Borrelia burgdorferi, etc.
Probab=88.76 E-value=0.27 Score=34.28 Aligned_cols=22 Identities=36% Similarity=0.679 Sum_probs=19.3
Q ss_pred eEEEEECCCCEEEeCCCCeEEE
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHV 52 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V 52 (65)
--.+.++|||++|||+|--|..
T Consensus 150 ln~v~v~~Gd~i~ipaGt~HA~ 171 (302)
T TIGR00218 150 LNRIKLKPGDFFYVPSGTPHAY 171 (302)
T ss_pred hcccccCCCCEEEeCCCCcccc
Confidence 4567899999999999999984
No 55
>PRK11171 hypothetical protein; Provisional
Probab=88.22 E-value=1.4 Score=30.33 Aligned_cols=26 Identities=23% Similarity=0.340 Sum_probs=22.9
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-+.+|++||++|+|++=-|+.++.+
T Consensus 100 g~~~~L~~GDsi~~p~~~~H~~~N~g 125 (266)
T PRK11171 100 GKTHALSEGGYAYLPPGSDWTLRNAG 125 (266)
T ss_pred CEEEEECCCCEEEECCCCCEEEEECC
Confidence 34668999999999999999999876
No 56
>PLN02288 mannose-6-phosphate isomerase
Probab=87.91 E-value=0.46 Score=34.89 Aligned_cols=22 Identities=32% Similarity=0.421 Sum_probs=19.2
Q ss_pred EEEEECCCCEEEeCCCCeEEEE
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVR 53 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~ 53 (65)
-.+.|+|||.+|+|+|--|.--
T Consensus 251 N~v~L~PGeaifl~ag~~HAYl 272 (394)
T PLN02288 251 NYVKLNPGEALYLGANEPHAYL 272 (394)
T ss_pred ceEecCCCCEEEecCCCCceec
Confidence 3688999999999999999743
No 57
>PRK13502 transcriptional activator RhaR; Provisional
Probab=87.40 E-value=1.1 Score=29.92 Aligned_cols=26 Identities=23% Similarity=0.354 Sum_probs=22.0
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-+..++|||+++||++=.|.....+
T Consensus 55 ~~~~~l~~g~l~li~~~~~H~~~~~~ 80 (282)
T PRK13502 55 ERPYRITRGDLFYIRAEDKHSYTSVN 80 (282)
T ss_pred CEEEeecCCcEEEECCCCcccccccC
Confidence 45678999999999999999887644
No 58
>TIGR03027 pepcterm_export putative polysaccharide export protein, PEP-CTERM sytem-associated. This protein family belongs to the larger set of polysaccharide biosynthesis/export proteins described by Pfam model pfam02563. Members of this family are variable in either containing of lacking a 78-residue insert, but appear to fall within a single clade, nevertheless, where the regions in which the gene is found encode components of the PEP-CTERM/EpsH proposed exosortase protein sorting system.
Probab=86.40 E-value=0.52 Score=30.05 Aligned_cols=18 Identities=39% Similarity=0.756 Sum_probs=15.2
Q ss_pred EEEEECCCCEEEeCCCCe
Q psy12933 32 IECTLNPGDMLYIPPKVW 49 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~Ww 49 (65)
....|+|||.+|+|..++
T Consensus 148 ~n~~L~~gD~I~Vp~~~f 165 (165)
T TIGR03027 148 ANVELKPGDVLIIPESWF 165 (165)
T ss_pred CCceeCCCCEEEEecccC
Confidence 557899999999998753
No 59
>PF12852 Cupin_6: Cupin
Probab=86.29 E-value=0.99 Score=28.74 Aligned_cols=25 Identities=24% Similarity=0.339 Sum_probs=21.9
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
-.+.|++||++++|.+=-|...+..
T Consensus 55 ~~~~L~~GDivllp~g~~H~l~~~~ 79 (186)
T PF12852_consen 55 EPIRLEAGDIVLLPRGTAHVLSSDP 79 (186)
T ss_pred CeEEecCCCEEEEcCCCCeEeCCCC
Confidence 5688999999999999999997654
No 60
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=85.88 E-value=0.66 Score=30.81 Aligned_cols=16 Identities=25% Similarity=0.594 Sum_probs=13.2
Q ss_pred EEEECCCCEEEeCCCC
Q psy12933 33 ECTLNPGDMLYIPPKV 48 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~W 48 (65)
.++-+|||++|||.|-
T Consensus 137 tv~a~aGDvifiPKgs 152 (176)
T COG4766 137 TVIAGAGDVIFIPKGS 152 (176)
T ss_pred eEecCCCcEEEecCCC
Confidence 4556899999999984
No 61
>COG1482 ManA Phosphomannose isomerase [Carbohydrate transport and metabolism]
Probab=85.82 E-value=0.77 Score=32.98 Aligned_cols=24 Identities=25% Similarity=0.506 Sum_probs=20.4
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
--.+.|+|||.+|+|+|--|....
T Consensus 157 Ln~v~lkpGe~~fl~Agt~HA~~~ 180 (312)
T COG1482 157 LNRVKLKPGEAFFLPAGTPHAYLK 180 (312)
T ss_pred hcEEecCCCCEEEecCCCceeecc
Confidence 346889999999999999998654
No 62
>PRK11171 hypothetical protein; Provisional
Probab=84.83 E-value=1.5 Score=30.15 Aligned_cols=26 Identities=8% Similarity=0.030 Sum_probs=22.5
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-+..|++||+|++|+.--|..++.+
T Consensus 222 ~~~~~l~~GD~i~~~~~~~h~~~N~g 247 (266)
T PRK11171 222 NDWVEVEAGDFIWMRAYCPQACYAGG 247 (266)
T ss_pred CEEEEeCCCCEEEECCCCCEEEECCC
Confidence 34567999999999999999999875
No 63
>PRK13501 transcriptional activator RhaR; Provisional
Probab=84.55 E-value=1.8 Score=29.24 Aligned_cols=27 Identities=22% Similarity=0.317 Sum_probs=22.9
Q ss_pred ceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 30 HLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
......+++||+++||++=.|..++.+
T Consensus 54 ~~~~~~l~~g~~~~I~p~~~H~~~~~~ 80 (290)
T PRK13501 54 NDHPYRITCGDVFYIQAADHHSYESVH 80 (290)
T ss_pred CCeeeeecCCeEEEEcCCCcccccccC
Confidence 355678999999999999999988654
No 64
>PF07385 DUF1498: Protein of unknown function (DUF1498); InterPro: IPR010864 This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown.; PDB: 3MPB_B 3KMH_A.
Probab=84.02 E-value=2.5 Score=29.22 Aligned_cols=28 Identities=25% Similarity=0.514 Sum_probs=20.0
Q ss_pred ceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 30 HLIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
..-+++|+||+.+-+|++-||+-.....
T Consensus 152 aG~~l~L~PGESiTL~Pg~yH~Fw~e~g 179 (225)
T PF07385_consen 152 AGTQLRLNPGESITLPPGIYHWFWGEGG 179 (225)
T ss_dssp TT-EEEE-TT-EEEE-TTEEEEEEE-TT
T ss_pred CCceEEeCCCCeEeeCCCCeeeEEecCC
Confidence 4567899999999999999999887664
No 65
>PF05721 PhyH: Phytanoyl-CoA dioxygenase (PhyH); InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=83.94 E-value=1.5 Score=26.96 Aligned_cols=26 Identities=15% Similarity=0.442 Sum_probs=22.0
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEe
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
.....+.+++||+|+.-+.-||....
T Consensus 177 ~~~~~~~~~~Gdvl~~~~~~~H~s~~ 202 (211)
T PF05721_consen 177 DEWVPVPMKAGDVLFFHSRLIHGSGP 202 (211)
T ss_dssp SGCEEE-BSTTEEEEEETTSEEEEE-
T ss_pred CceEEeecCCCeEEEEcCCccccCCC
Confidence 56789999999999999999999875
No 66
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=83.79 E-value=2.6 Score=31.67 Aligned_cols=26 Identities=19% Similarity=0.212 Sum_probs=23.0
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.+..|+|||.+|+|++--|+.++.+.
T Consensus 424 ~~~~L~~GDSi~ip~g~~H~~~N~g~ 449 (478)
T PRK15460 424 DIKLLGENESIYIPLGATHCLENPGK 449 (478)
T ss_pred EEEEecCCCEEEECCCCcEEEEcCCC
Confidence 45789999999999999999999863
No 67
>COG4297 Uncharacterized protein containing double-stranded beta helix domain [Function unknown]
Probab=82.81 E-value=1.8 Score=28.35 Aligned_cols=33 Identities=27% Similarity=0.229 Sum_probs=26.5
Q ss_pred CCCCCceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 25 NQTPGHLIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 25 ~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.+..+...++.+..||+|.||+|-=|.-.....
T Consensus 78 ~iGG~~G~el~v~~GDvlliPAGvGH~rl~sS~ 110 (163)
T COG4297 78 QIGGADGQELEVGEGDVLLIPAGVGHCRLHSSA 110 (163)
T ss_pred EecCCCCceeeecCCCEEEEecCcccccccCCC
Confidence 355677888999999999999999887655543
No 68
>smart00702 P4Hc Prolyl 4-hydroxylase alpha subunit homologues. Mammalian enzymes catalyse hydroxylation of collagen, for example. Prokaryotic enzymes might catalyse hydroxylation of antibiotic peptides. These are 2-oxoglutarate-dependent dioxygenases, requiring 2-oxoglutarate and dioxygen as cosubstrates and ferrous iron as a cofactor.
Probab=82.32 E-value=2.9 Score=26.33 Aligned_cols=43 Identities=16% Similarity=0.271 Sum_probs=32.5
Q ss_pred CCCCCCCceEEEEECCCCEEEeCCC---CeEEEEecC--CeEEEEEeC
Q psy12933 23 PHNQTPGHLIECTLNPGDMLYIPPK---VWHHVRSLS--TSISVSFWF 65 (65)
Q Consensus 23 ~p~~~~~~~~~~~l~pGd~LyiP~~---WwH~V~s~~--~sisvn~w~ 65 (65)
||.........+.-++|++|+.|.+ -+|.|.... .-++++.|+
T Consensus 130 f~~~~~~~~~~v~P~~G~~v~f~~~~~~~~H~v~pv~~G~r~~~~~W~ 177 (178)
T smart00702 130 FPGLGLMVCATVKPKKGDLLFFPSGRGRSLHGVCPVTRGSRWAITGWI 177 (178)
T ss_pred ecCCCCccceEEeCCCCcEEEEeCCCCCccccCCcceeCCEEEEEEEE
Confidence 3333323456788899999999985 899999876 578899885
No 69
>PRK13500 transcriptional activator RhaR; Provisional
Probab=81.40 E-value=3 Score=28.77 Aligned_cols=27 Identities=22% Similarity=0.310 Sum_probs=22.5
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
..+..+++||+++||++=-|...+.+.
T Consensus 85 ~~~~~l~~Gdl~~I~~~~~H~~~~~~~ 111 (312)
T PRK13500 85 DRPYRITRGDLFYIHADDKHSYASVND 111 (312)
T ss_pred CEEEeecCCeEEEECCCCeecccccCC
Confidence 456788999999999999999876553
No 70
>PRK10296 DNA-binding transcriptional regulator ChbR; Provisional
Probab=79.68 E-value=2.5 Score=28.25 Aligned_cols=24 Identities=25% Similarity=0.333 Sum_probs=20.6
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
.....++|||+++||+|-.|+...
T Consensus 60 ~~~~~l~~g~l~~i~p~~~H~~~~ 83 (278)
T PRK10296 60 GKRVLLERGDFVFIPLGSHHQSFY 83 (278)
T ss_pred CEEEEECCCcEEEeCCCCccceee
Confidence 456799999999999999997753
No 71
>PF12973 Cupin_7: ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=79.62 E-value=2.8 Score=23.97 Aligned_cols=25 Identities=32% Similarity=0.460 Sum_probs=20.1
Q ss_pred EECCCCEEEeCCCCeEEEEecCCeE
Q psy12933 35 TLNPGDMLYIPPKVWHHVRSLSTSI 59 (65)
Q Consensus 35 ~l~pGd~LyiP~~WwH~V~s~~~si 59 (65)
...+||.++.|++--|...+.+.++
T Consensus 61 ~~~~G~~~~~p~g~~h~~~s~~gc~ 85 (91)
T PF12973_consen 61 RYGAGDWLRLPPGSSHTPRSDEGCL 85 (91)
T ss_dssp EEETTEEEEE-TTEEEEEEESSCEE
T ss_pred cCCCCeEEEeCCCCccccCcCCCEE
Confidence 3479999999999999999876643
No 72
>COG1791 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=79.57 E-value=3.6 Score=27.63 Aligned_cols=29 Identities=21% Similarity=0.554 Sum_probs=24.7
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..+.+.+.+++||.|-+|+|-.|.-.-.+
T Consensus 114 d~~~~~i~c~~gDLI~vP~gi~HwFtlt~ 142 (181)
T COG1791 114 DGKVYQIRCEKGDLISVPPGIYHWFTLTE 142 (181)
T ss_pred CCcEEEEEEccCCEEecCCCceEEEEccC
Confidence 34788999999999999999999887444
No 73
>TIGR03028 EpsE polysaccharide export protein EpsE. Sequences in this family of proteins are members of a polysaccharide export protein family (pfam02563) which includes the wza protein from E.coli. This family of proteins are homologous to the EpsE protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=79.24 E-value=1.3 Score=29.93 Aligned_cols=15 Identities=20% Similarity=0.733 Sum_probs=13.2
Q ss_pred EECCCCEEEeCCCCe
Q psy12933 35 TLNPGDMLYIPPKVW 49 (65)
Q Consensus 35 ~l~pGd~LyiP~~Ww 49 (65)
.|+|||++|+|..|+
T Consensus 225 ~l~~gDii~V~~s~~ 239 (239)
T TIGR03028 225 LVQPDDVIYVRESLF 239 (239)
T ss_pred ccCCCCEEEEeCccC
Confidence 499999999998874
No 74
>PF13532 2OG-FeII_Oxy_2: 2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A ....
Probab=78.92 E-value=2.3 Score=26.93 Aligned_cols=29 Identities=24% Similarity=0.410 Sum_probs=20.7
Q ss_pred CCceEEEEECCCCEEEeC----CCCeEEEEecCC
Q psy12933 28 PGHLIECTLNPGDMLYIP----PKVWHHVRSLST 57 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP----~~WwH~V~s~~~ 57 (65)
....+++.|.+||++++- ..| |.|.....
T Consensus 144 ~~~~~~~~L~~gsl~vm~g~~r~~~-H~I~~~~~ 176 (194)
T PF13532_consen 144 DDEPIEVPLPPGSLLVMSGEARYDW-HGIPPVKK 176 (194)
T ss_dssp TS-EEEEEE-TTEEEEEETTHHHHE-EEE-S-SC
T ss_pred CCccEEEEcCCCCEEEeChHHhhhe-eEcccccC
Confidence 347899999999999998 457 99988764
No 75
>COG5285 Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=77.53 E-value=3.9 Score=29.39 Aligned_cols=31 Identities=19% Similarity=0.364 Sum_probs=25.9
Q ss_pred CCCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
|-+.....+.|++||+|+.-+.-||...+-.
T Consensus 187 y~~~~~~pv~lekGDallF~~~L~HaA~aNr 217 (299)
T COG5285 187 YLERNAVPVELEKGDALLFNGSLWHAAGANR 217 (299)
T ss_pred hhhhcceeeeecCCCEEEEcchhhhhhhcCC
Confidence 3344588999999999999999999988765
No 76
>PF01238 PMI_typeI: Phosphomannose isomerase type I; InterPro: IPR001250 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. Type I includes eukaryotic PMI and the enzyme encoded by the manA gene in enterobacteria. PMI has a bound zinc ion, which is essential for activity. A crystal structure of PMI from Candida albicans shows that the enzyme has three distinct domains []. The active site lies in the central domain, contains a single essential zinc atom, and forms a deep, open cavity of suitable dimensions to contain M6P or F6P The central domain is flanked by a helical domain on one side and a jelly-roll like domain on the other.; GO: 0004476 mannose-6-phosphate isomerase activity, 0008270 zinc ion binding, 0005975 carbohydrate metabolic process; PDB: 1PMI_A 1QWR_B 1ZX5_A 3H1Y_A 2WFP_A 3H1M_A 3H1W_A.
Probab=74.95 E-value=2.4 Score=30.62 Aligned_cols=23 Identities=26% Similarity=0.436 Sum_probs=18.0
Q ss_pred EEEEECCCCEEEeCCCCeEEEEe
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
-.+.|+|||.+|+|++--|.-.+
T Consensus 250 N~v~L~pGeaifl~a~~~HAYl~ 272 (373)
T PF01238_consen 250 NYVELQPGEAIFLPAGEPHAYLS 272 (373)
T ss_dssp EEEEE-TT-EEEEHTTHHEEEEE
T ss_pred EEEEecCCceEEecCCCcccccc
Confidence 45699999999999999998665
No 77
>PF06052 3-HAO: 3-hydroxyanthranilic acid dioxygenase; InterPro: IPR010329 Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase (1.13.11.6 from EC). It is part of the kynurenine pathway for the degradation of tryptophan and the biosynthesis of nicotinic acid [].The prokaryotic homologue is involved in the 2-nitrobenzoate degradation pathway []. The enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.; GO: 0000334 3-hydroxyanthranilate 3,4-dioxygenase activity, 0005506 iron ion binding, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 1ZVF_A 1YFX_A 1YFW_A 1YFY_A 1YFU_A 2QNK_A 3FE5_A.
Probab=74.70 E-value=10 Score=24.81 Aligned_cols=33 Identities=21% Similarity=0.493 Sum_probs=22.4
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecCCeEEE
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLSTSISV 61 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisv 61 (65)
...-++.+++||+-++|++--|.-+-...||.+
T Consensus 72 g~~kdi~I~EGe~fLLP~~vpHsP~R~~~tiGL 104 (151)
T PF06052_consen 72 GKFKDIPIREGEMFLLPANVPHSPQRPADTIGL 104 (151)
T ss_dssp TEEEEEEE-TTEEEEE-TT--EEEEE-TT-EEE
T ss_pred CceEEEEeCCCcEEecCCCCCCCCcCCCCcEEE
Confidence 455689999999999999999998877777654
No 78
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=73.16 E-value=8 Score=26.28 Aligned_cols=36 Identities=22% Similarity=0.345 Sum_probs=27.1
Q ss_pred CCceEEEEECCCCEEEeC---CCCeEEEEecC---------CeEEEEE
Q psy12933 28 PGHLIECTLNPGDMLYIP---PKVWHHVRSLS---------TSISVSF 63 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP---~~WwH~V~s~~---------~sisvn~ 63 (65)
..+...+.|+.||+|++- ..|||.|.... .-|+++|
T Consensus 163 ~~~~~~l~L~~Gdllvm~G~sr~~~HgVp~~~~~~~p~~g~~RINLTF 210 (213)
T PRK15401 163 SDPLQRILLEHGDVVVWGGPSRLRYHGILPLKAGEHPLTGECRINLTF 210 (213)
T ss_pred CCceEEEEeCCCCEEEECchHhheeccCCcCCCCcCCCCCCCeEEEEe
Confidence 455789999999999975 35899996553 3577776
No 79
>PRK13503 transcriptional activator RhaS; Provisional
Probab=73.04 E-value=3.2 Score=27.46 Aligned_cols=25 Identities=16% Similarity=0.198 Sum_probs=20.9
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
....+++||+++||++=.|..+..+
T Consensus 53 ~~~~l~~g~~~~i~~~~~h~~~~~~ 77 (278)
T PRK13503 53 QPYTLSGGTVCFVRDHDRHLYEHTD 77 (278)
T ss_pred CcccccCCcEEEECCCccchhhhcc
Confidence 3567899999999999999876654
No 80
>TIGR02297 HpaA 4-hydroxyphenylacetate catabolism regulatory protein HpaA. This putative transcriptional regulator, which contains both the substrate-binding, dimerization domain (pfam02311) and the helix-turn-helix DNA-binding domain (pfam00165) of the AraC famil, is located proximal to genes of the 4-hydroxyphenylacetate catabolism pathway.
Probab=72.96 E-value=6.7 Score=26.14 Aligned_cols=26 Identities=19% Similarity=0.075 Sum_probs=21.8
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.....+++||++++|++--|......
T Consensus 61 ~~~~~l~~g~~~ii~~~~~H~~~~~~ 86 (287)
T TIGR02297 61 EHEYSEYAPCFFLTPPSVPHGFVTDL 86 (287)
T ss_pred CEEEEecCCeEEEeCCCCccccccCC
Confidence 34678999999999999999987654
No 81
>PLN00212 glutelin; Provisional
Probab=71.88 E-value=9.5 Score=29.02 Aligned_cols=32 Identities=19% Similarity=0.198 Sum_probs=25.6
Q ss_pred CCCCceEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
+++.+--.-.++.||++.||+|--|...+.+.
T Consensus 142 ~~d~hqkv~~lr~GDViaiPaG~~hw~yN~Gd 173 (493)
T PLN00212 142 FRDEHQKIHQFRQGDVVALPAGVAHWFYNDGD 173 (493)
T ss_pred cccccccceEeccCCEEEECCCCeEEEEeCCC
Confidence 33444444689999999999999999999874
No 82
>PRK12335 tellurite resistance protein TehB; Provisional
Probab=68.10 E-value=17 Score=24.85 Aligned_cols=31 Identities=23% Similarity=0.493 Sum_probs=22.5
Q ss_pred EEECCCCEEEeCCCCeEEEEecC--CeEEEEEe
Q psy12933 34 CTLNPGDMLYIPPKVWHHVRSLS--TSISVSFW 64 (65)
Q Consensus 34 ~~l~pGd~LyiP~~WwH~V~s~~--~sisvn~w 64 (65)
.....++..++|++-||++...+ ..+.+.|+
T Consensus 60 ~l~~~~~~~~i~p~~wh~v~~~s~d~~~~l~fy 92 (287)
T PRK12335 60 IFDAENQPPFIEPQAWHRIEAASDDLECQLSFY 92 (287)
T ss_pred EEecCCCCceeCCcceEEEEEcCCCcEEEEEEE
Confidence 33444566679999999999985 35666664
No 83
>PF05995 CDO_I: Cysteine dioxygenase type I; InterPro: IPR010300 Cysteine dioxygenase type I (1.13.11.20 from EC) converts cysteine to cysteinesulphinic acid and is the rate-limiting step in sulphate production.; GO: 0005506 iron ion binding, 0017172 cysteine dioxygenase activity, 0046439 L-cysteine metabolic process, 0055114 oxidation-reduction process; PDB: 2IC1_A 3EQE_B 3ELN_A 2B5H_A 2GH2_A 2Q4S_A 2ATF_A 2GM6_A 3USS_B.
Probab=67.28 E-value=27 Score=22.52 Aligned_cols=33 Identities=24% Similarity=0.289 Sum_probs=23.8
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecC---CeEEEEEe
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLS---TSISVSFW 64 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~---~sisvn~w 64 (65)
....+.+|.+.+.++.-.|+|++.. .++|+..+
T Consensus 127 ~~~~~~~g~~~~~~~~~iH~v~n~s~~~~avSLHvY 162 (175)
T PF05995_consen 127 RERLLPGGVTYIFDPHGIHRVENPSGDEPAVSLHVY 162 (175)
T ss_dssp EEEEETTTEEEEBTTTBEEEEEES-SSS-EEEEEEE
T ss_pred ceEecCCCeEEecCCCCeEEeccCCCCCCEEEEEEc
Confidence 4456788888888999999997653 57777654
No 84
>COG3822 ABC-type sugar transport system, auxiliary component [General function prediction only]
Probab=64.22 E-value=15 Score=25.26 Aligned_cols=28 Identities=21% Similarity=0.467 Sum_probs=23.9
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
...-+..|+||+.+-+|||-||..-+.+
T Consensus 150 ~ag~~lkL~PGesitL~Pg~~HsFwae~ 177 (225)
T COG3822 150 TAGSQLKLSPGESITLPPGLYHSFWAEE 177 (225)
T ss_pred ccceeEEECCCCcEecCCCceeeeeecC
Confidence 3467789999999999999999887665
No 85
>PRK10371 DNA-binding transcriptional regulator MelR; Provisional
Probab=62.79 E-value=7.1 Score=26.92 Aligned_cols=27 Identities=11% Similarity=0.091 Sum_probs=22.9
Q ss_pred ceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 30 HLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..-+.+++|||+++||++--|.....+
T Consensus 62 ~g~~~~l~~Gd~ili~s~~~H~~~~~~ 88 (302)
T PRK10371 62 NNEKVQINQGHITLFWACTPHQLTDPG 88 (302)
T ss_pred CCEEEEEcCCcEEEEecCCcccccccC
Confidence 356788999999999999999887654
No 86
>PF12851 Tet_JBP: Oxygenase domain of the 2OGFeDO superfamily ; InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=62.23 E-value=22 Score=23.01 Aligned_cols=36 Identities=28% Similarity=0.401 Sum_probs=29.7
Q ss_pred ceEEEEECCCCEEEeCCC-CeEEEEecC-------CeEEEEEeC
Q psy12933 30 HLIECTLNPGDMLYIPPK-VWHHVRSLS-------TSISVSFWF 65 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~-WwH~V~s~~-------~sisvn~w~ 65 (65)
..+.+.+.+|++|++-+. .-|.|..++ .-+++.|++
T Consensus 126 ~g~~~~~~~GtVl~~~~~~~~Hgvtpv~~~~~~~~~R~slvfy~ 169 (171)
T PF12851_consen 126 LGVAFAYQPGTVLIFCAKRELHGVTPVESPNRNHGTRISLVFYQ 169 (171)
T ss_pred CCEEEecCCCcEEEEcccceeeecCcccCCCCCCCeEEEEEEEe
Confidence 577889999999999999 888888776 258887764
No 87
>PF06251 Caps_synth_GfcC: Capsule biosynthesis GfcC; InterPro: IPR010425 This entry represents uncharacterised bacterial proteins that contain a central beta-grasp like domain related to the SLBB domain [].; PDB: 3P42_B.
Probab=62.22 E-value=6 Score=26.45 Aligned_cols=13 Identities=38% Similarity=0.820 Sum_probs=7.8
Q ss_pred EEECCCCEEEeCC
Q psy12933 34 CTLNPGDMLYIPP 46 (65)
Q Consensus 34 ~~l~pGd~LyiP~ 46 (65)
..+.||++||||-
T Consensus 191 ~~l~PG~~I~Vp~ 203 (229)
T PF06251_consen 191 QELAPGATIYVPF 203 (229)
T ss_dssp EE--TT-EEEE-B
T ss_pred CCCCCCCEEEEcC
Confidence 4699999999997
No 88
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=61.44 E-value=5.1 Score=29.29 Aligned_cols=31 Identities=19% Similarity=0.309 Sum_probs=24.8
Q ss_pred CCCCceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
+.-+..-...+++||.|..|.+-||.--+.+
T Consensus 125 ~T~VdGer~~M~~GDfilTP~w~wHdHgn~g 155 (351)
T COG3435 125 YTVVDGERTPMEAGDFILTPAWTWHDHGNEG 155 (351)
T ss_pred eEeecCceeeccCCCEEEccCceeccCCCCC
Confidence 4455667788999999999999999765543
No 89
>cd02786 MopB_CT_3 The MopB_CT_3 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=59.74 E-value=6.7 Score=22.92 Aligned_cols=12 Identities=8% Similarity=0.307 Sum_probs=10.6
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.||||.
T Consensus 73 g~v~~~~g~~~~ 84 (116)
T cd02786 73 GVVVAEGGWWRE 84 (116)
T ss_pred CEEEeecccccc
Confidence 678999999995
No 90
>PRK10572 DNA-binding transcriptional regulator AraC; Provisional
Probab=57.50 E-value=15 Score=24.64 Aligned_cols=25 Identities=32% Similarity=0.482 Sum_probs=20.7
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEec
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSL 55 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~ 55 (65)
.-+..++|||++++|++--|.....
T Consensus 66 ~~~~~~~~g~~i~i~p~~~h~~~~~ 90 (290)
T PRK10572 66 GRAFVCRPGDLLLFPPGEIHHYGRH 90 (290)
T ss_pred CeeEecCCCCEEEECCCCceeeccC
Confidence 4467899999999999999986544
No 91
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=56.83 E-value=8.3 Score=26.41 Aligned_cols=25 Identities=16% Similarity=0.141 Sum_probs=21.1
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEec
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSL 55 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~ 55 (65)
...+.++|||+|++-+.-||....-
T Consensus 210 ~v~~~~~aGDvl~f~~~~~H~S~~N 234 (277)
T TIGR02408 210 ISTFTGKAGSAVWFDCNTMHGSGSN 234 (277)
T ss_pred ceeeccCCceEEEEccccccCCCCC
Confidence 4567789999999999999987654
No 92
>TIGR01762 chlorin-enz chlorinating enzymes. This model represents a a group of highly homologous enzymes related to dioxygenases which chlorinate amino acid methyl groups. BarB1 and BarB2 are proposed to trichlorinate one of the methyl groups of a leucine residue in the biosynthesis of barbamide in the cyanobacterium Lyngbya majuscula. SyrB2 is proposed to chlorinate the methyl group of threonine in the biosynthesis of syringomycin in Pseudomonas syringae. CmaB is proposed to chlorinate the beta-methyl group of alloisoleucine in the process of ring closure in the biosynthesis of coronamic acid, a component of coronatine also in Pseudomonas syringae.
Probab=55.73 E-value=15 Score=25.60 Aligned_cols=27 Identities=7% Similarity=0.162 Sum_probs=22.9
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEec
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSL 55 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~ 55 (65)
.....++++|||+++.=..-||....-
T Consensus 206 ~~~v~~~lkaGd~~~f~~~t~HgS~~N 232 (288)
T TIGR01762 206 ASAVPMQMKAGQFIIFWSTLMHASYPN 232 (288)
T ss_pred cceeeeeeCCceEEEECCCceecCCCC
Confidence 356899999999999999999986543
No 93
>COG3128 PiuC Uncharacterized iron-regulated protein [Function unknown]
Probab=54.86 E-value=25 Score=24.25 Aligned_cols=34 Identities=24% Similarity=0.517 Sum_probs=28.0
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC--eEEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST--SISVSFW 64 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~--sisvn~w 64 (65)
-.++.|-+||.+..|+.--|.|...+. -++.=||
T Consensus 143 ~h~VklPAGdLVlypStSlH~VtPVTRg~R~asffW 178 (229)
T COG3128 143 NHRVKLPAGDLVLYPSTSLHEVTPVTRGERFASFFW 178 (229)
T ss_pred ceEEeccCCCEEEcccccceeccccccCceEEEeee
Confidence 578899999999999999999999874 4554444
No 94
>PF01476 LysM: LysM domain; InterPro: IPR018392 This domain is about 40 residues long and is found in a variety of enzymes involved in bacterial cell wall degradation []. This domain may have a general peptidoglycan binding function.; GO: 0016998 cell wall macromolecule catabolic process; PDB: 2DJP_A 3ZQD_A 1Y7M_B 4A52_A 2L9Y_A 1E0G_A.
Probab=53.54 E-value=6.3 Score=19.13 Aligned_cols=10 Identities=60% Similarity=1.115 Sum_probs=9.0
Q ss_pred ECCCCEEEeC
Q psy12933 36 LNPGDMLYIP 45 (65)
Q Consensus 36 l~pGd~LyiP 45 (65)
|.+|+.|.||
T Consensus 35 l~~G~~l~iP 44 (44)
T PF01476_consen 35 LQPGQKLCIP 44 (44)
T ss_dssp GGTTEEEEEC
T ss_pred CCCCCEEEeC
Confidence 8899999988
No 95
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=52.78 E-value=36 Score=24.66 Aligned_cols=27 Identities=11% Similarity=0.165 Sum_probs=20.2
Q ss_pred EEEEECCCCEEEeCCCCeEEEEecCCe
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRSLSTS 58 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s~~~s 58 (65)
..+..++||++.+|+.-+|+-.+.+..
T Consensus 288 ~~~~W~~gD~f~vPsW~~~~h~a~~da 314 (335)
T TIGR02272 288 AVFRFSPKDVFVVPSWHPVRFEASDDA 314 (335)
T ss_pred EEEEecCCCEEEECCCCcEecccCCCe
Confidence 355789999999999767766655543
No 96
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=52.73 E-value=33 Score=23.02 Aligned_cols=26 Identities=15% Similarity=0.098 Sum_probs=21.1
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-+..++|||++++|++--|.....+
T Consensus 88 g~~~~l~~G~~~l~~~~~p~~~~~~~ 113 (302)
T PRK09685 88 DRQVQLAAGDITLIDASRPCSIYPQG 113 (302)
T ss_pred CeEEEEcCCCEEEEECCCCcEeecCC
Confidence 45678999999999999888776544
No 97
>cd02782 MopB_CT_1 The MopB_CT_1 CD includes a group of related uncharacterized bacterial molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=52.50 E-value=8.5 Score=22.98 Aligned_cols=13 Identities=15% Similarity=0.220 Sum_probs=11.3
Q ss_pred CEEEeCCCCeEEE
Q psy12933 40 DMLYIPPKVWHHV 52 (65)
Q Consensus 40 d~LyiP~~WwH~V 52 (65)
+++++|.+|||..
T Consensus 75 g~v~~~~g~~~~~ 87 (129)
T cd02782 75 GVVSLPHGWGHDY 87 (129)
T ss_pred CeEEeecCcCCCc
Confidence 6899999999964
No 98
>KOG0162|consensus
Probab=52.39 E-value=11 Score=30.93 Aligned_cols=23 Identities=26% Similarity=0.527 Sum_probs=19.3
Q ss_pred CceEEEEECCCCEEEe----CCCCeEE
Q psy12933 29 GHLIECTLNPGDMLYI----PPKVWHH 51 (65)
Q Consensus 29 ~~~~~~~l~pGd~Lyi----P~~WwH~ 51 (65)
...-|..++.||+++| |+|||--
T Consensus 1063 q~~dEls~~~~diIei~~edpSGWw~g 1089 (1106)
T KOG0162|consen 1063 QDVDELSFKKGDIIEIMREDPSGWWLG 1089 (1106)
T ss_pred CCcccccccCCCEEEEeccCCCcchhh
Confidence 3566889999999998 9999964
No 99
>cd02794 MopB_CT_DmsA-EC The MopB_CT_DmsA-EC CD includes the DmsA enzyme of the dmsABC operon encoding the anaerobic dimethylsulfoxide reductase (DMSOR) of Escherichia coli and other related DMSOR-like enzymes. Unlike other DMSOR-like enzymes, this group has a predicted N-terminal iron-sulfur [4Fe-4S] cluster binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=51.82 E-value=9.1 Score=22.71 Aligned_cols=13 Identities=15% Similarity=0.542 Sum_probs=11.2
Q ss_pred CEEEeCCCCeEEE
Q psy12933 40 DMLYIPPKVWHHV 52 (65)
Q Consensus 40 d~LyiP~~WwH~V 52 (65)
+++|+|.|||+..
T Consensus 72 g~v~~~~g~~~~~ 84 (121)
T cd02794 72 GVVALPQGAWYEP 84 (121)
T ss_pred CEEEecCccccCC
Confidence 6799999999864
No 100
>PF14604 SH3_9: Variant SH3 domain; PDB: 2CRE_A 2E5K_A 2CT3_A 2DE0_X 2D8H_A 2DA9_A 2X3X_E 2X3W_D 2KRN_A 2ED0_A ....
Probab=51.61 E-value=6.6 Score=20.24 Aligned_cols=19 Identities=21% Similarity=0.496 Sum_probs=13.0
Q ss_pred EEEECCCCEEEe----CCCCeEE
Q psy12933 33 ECTLNPGDMLYI----PPKVWHH 51 (65)
Q Consensus 33 ~~~l~pGd~Lyi----P~~WwH~ 51 (65)
+..+++||+|.+ ..+||.-
T Consensus 12 ELs~~~Gd~i~v~~~~~~~W~~g 34 (49)
T PF14604_consen 12 ELSFKKGDVITVLEKSDDGWWYG 34 (49)
T ss_dssp B-EB-TTEEEEEEEESSTSEEEE
T ss_pred EeeEcCCCEEEEEEeCCCCEEEE
Confidence 788889988885 6777764
No 101
>PRK15078 polysaccharide export protein Wza; Provisional
Probab=51.47 E-value=9.9 Score=27.62 Aligned_cols=26 Identities=31% Similarity=0.422 Sum_probs=18.3
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.....|+|||++|+|+.==..|.-++
T Consensus 236 ~~ni~L~~GDvI~Vp~~~~~~v~V~G 261 (379)
T PRK15078 236 SQNRLLYPGDILYVPRNDDLKVFVMG 261 (379)
T ss_pred ccCceeCCCCEEEECCCCCcEEEEee
Confidence 35677999999999985444454443
No 102
>PF04970 LRAT: Lecithin retinol acyltransferase; InterPro: IPR007053 This entry represents a conserved sequence region found in proteins from viruses, bacteria and eukaryotes. It contains a well-conserved NCEHF motif, though its function in these proteins is unknown.; PDB: 2KYT_A 4DOT_A 4FA0_A.
Probab=50.62 E-value=6.7 Score=23.64 Aligned_cols=23 Identities=26% Similarity=0.315 Sum_probs=14.0
Q ss_pred EEECCCCEEEeCC-CCeEEEEecC
Q psy12933 34 CTLNPGDMLYIPP-KVWHHVRSLS 56 (65)
Q Consensus 34 ~~l~pGd~LyiP~-~WwH~V~s~~ 56 (65)
..++|||+|+++. .++|..-..+
T Consensus 5 ~~~~~GD~I~~~r~~y~H~gIYvG 28 (125)
T PF04970_consen 5 KRLKPGDHIEVPRGLYEHWGIYVG 28 (125)
T ss_dssp -S--TT-EEEEEETTEEEEEEEEE
T ss_pred cCCCCCCEEEEecCCccEEEEEec
Confidence 4589999999775 5677776655
No 103
>KOG2757|consensus
Probab=50.19 E-value=22 Score=26.61 Aligned_cols=27 Identities=33% Similarity=0.410 Sum_probs=19.5
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-...+++||++|||+.--=+..+.+.
T Consensus 371 ~~~~~v~rG~V~fI~a~~~i~~~~~sd 397 (411)
T KOG2757|consen 371 DSKILVNRGDVLFIPANHPIHLSSSSD 397 (411)
T ss_pred CCceeeccCcEEEEcCCCCceeeccCc
Confidence 445678999999999986555555443
No 104
>PF02080 TrkA_C: TrkA-C domain; InterPro: IPR006037 The regulator of K+ conductance (RCK) domain is found in many ligand-gated K+ channels, most often attached to the intracellular carboxy terminus. The domain is prevalent among prokaryotic K+ channels, and also found in eukaryotic, high-conductance Ca2+-activated K+ channels (BK channels) [, , ]. Largely involved in redox-linked regulation of potassium channels, the N-terminal part of the RCK domain is predicted to be an active dehydrogenase at least in some cases []. Some have a conserved sequence motif (G-x-G-x-x-G-x(n)-[DE]) for NAD+ binding [], but others do not, reflecting the diversity of ligands for RCK domains. The C-terminal part is less conserved, being absent in some channels, such as the kefC antiporter from Escherichia coli. It is predicted to bind unidentified ligands and to regulate sulphate, sodium and other transporters. The X-ray structure of several RCK domains has been solved [, , ]. It reveals an alpha-beta fold similar to dehydrogenase enzymes. The domain forms a homodimer, producing a cleft between two lobes. It has a composite structure, with an N-terminal (RCK-N), and a C-terminal (RCK-C) subdomain. The RCK-N subdomain forms a Rossmann fold with two alpha helices on one side of a six stranded parallel beta sheet and three alpha helices on the other side. The RCK-C subdomain is an all-beta-strand fold. It forms an extention of the dimer interface and further stabilises the RCK homodimer [, , ]. Ca2+ is a ligand that opens the channel in a concentration-dependent manner. Two Ca2+ ions are located at the base of a cleft between two RCK domains, coordinated by the carboxylate groups of two glutamate residues, and by an aspartate residue [, , ]. RCK domains occur in at least five different contexts: As a single domain on the C terminus of some K+ channels (for example, many prokaryotic K+ channels). As two tandem RCK domains on the C terminus of some transporters that form gating rings (for example, eukaryotic BK channels). The gating ring has an arrangement of eight identical RCK domains, one from each of the four pore-forming subunits and four from the intracellular solution. As two domains, one at the N terminus and another at the C terminus of transporter (for example, the prokaryotic trk system potassium uptake protein A). As a soluble protein (not part of a K+ channel) consisting of two tandem RCK domains. As a soluble protein consisting of a single RCK domain. This entry represents the C-terminal subdomain of RCK.; GO: 0008324 cation transmembrane transporter activity, 0006813 potassium ion transport; PDB: 2BKP_A 1VCT_A 2BKO_A 2BKN_A 3L4B_C 2FY8_D 2AEF_A 1LNQ_E 3RBX_C 3KXD_A ....
Probab=49.89 E-value=8.5 Score=20.38 Aligned_cols=12 Identities=33% Similarity=0.761 Sum_probs=8.7
Q ss_pred EEEECCCCEEEe
Q psy12933 33 ECTLNPGDMLYI 44 (65)
Q Consensus 33 ~~~l~pGd~Lyi 44 (65)
+.+|++||.|++
T Consensus 45 ~~~l~~gD~l~v 56 (71)
T PF02080_consen 45 DTVLQAGDILIV 56 (71)
T ss_dssp T-BE-TTEEEEE
T ss_pred CCEECCCCEEEE
Confidence 678999999986
No 105
>PF15340 COPR5: Cooperator of PRMT5 family
Probab=49.70 E-value=7 Score=25.38 Aligned_cols=31 Identities=32% Similarity=0.604 Sum_probs=23.9
Q ss_pred CCCCceEEEEECCCCEEEeCCCCeEEEEecCCe
Q psy12933 26 QTPGHLIECTLNPGDMLYIPPKVWHHVRSLSTS 58 (65)
Q Consensus 26 ~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~s 58 (65)
....++..|.---|||+|=|+ ||+-..|.+.
T Consensus 106 sqE~kpWvccapqGDmIYDPS--WHHPpPLiPH 136 (152)
T PF15340_consen 106 SQEVKPWVCCAPQGDMIYDPS--WHHPPPLIPH 136 (152)
T ss_pred hhhcccceeecCCCCcccCCc--cCCCCCCccc
Confidence 445667888888999999997 8887666543
No 106
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=49.56 E-value=38 Score=22.41 Aligned_cols=27 Identities=33% Similarity=0.309 Sum_probs=23.0
Q ss_pred eEEEEECCCCEEEeCCC--CeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPK--VWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~--WwH~V~s~~~ 57 (65)
.-+..|+|||++-+|+| --|+-.+-+.
T Consensus 81 ~~e~~lrpGD~~gFpAG~~~aHhliN~s~ 109 (161)
T COG3837 81 GGETRLRPGDSAGFPAGVGNAHHLINRSD 109 (161)
T ss_pred CeeEEecCCceeeccCCCcceeEEeecCC
Confidence 45788999999999999 8898888764
No 107
>TIGR02899 spore_safA spore coat assembly protein SafA. in which one of which is found in most examples of endospore-forming bacteria. Lysin motifs are repeated in many proteins.
Probab=48.90 E-value=11 Score=17.67 Aligned_cols=12 Identities=33% Similarity=0.628 Sum_probs=10.2
Q ss_pred EECCCCEEEeCC
Q psy12933 35 TLNPGDMLYIPP 46 (65)
Q Consensus 35 ~l~pGd~LyiP~ 46 (65)
.+.+|+.|.+|.
T Consensus 33 ~~~~g~~l~ip~ 44 (44)
T TIGR02899 33 LIYPGMKIKIPS 44 (44)
T ss_pred CcCCCCEEecCC
Confidence 588999999984
No 108
>PF10014 2OG-Fe_Oxy_2: 2OG-Fe dioxygenase; InterPro: IPR018724 Members of this family of hypothetical bacterial proteins have no known function. Some are described as putative biofilm formation or putative agglutination proteins. ; PDB: 3PL0_B.
Probab=48.16 E-value=15 Score=24.23 Aligned_cols=30 Identities=30% Similarity=0.551 Sum_probs=21.2
Q ss_pred CCCceEEEEECCCCEEEeC-CCCeEEEEecC
Q psy12933 27 TPGHLIECTLNPGDMLYIP-PKVWHHVRSLS 56 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP-~~WwH~V~s~~ 56 (65)
+........++|||+|++- ..-||.|....
T Consensus 148 ~~~~~~~~l~~p~d~l~~~D~~~~H~vtpI~ 178 (195)
T PF10014_consen 148 KEILFFFTLLEPGDTLLVDDRRVWHYVTPIR 178 (195)
T ss_dssp SSEEEEE---STTEEEEEETTTEEEEE--EE
T ss_pred CCcceEEEecCCCCEEEEeCCcceECCCcee
Confidence 4556778899999999999 99999998765
No 109
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=46.68 E-value=42 Score=23.68 Aligned_cols=26 Identities=23% Similarity=0.345 Sum_probs=22.4
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-+..|.+|+-+|+|+|-=|.+++..
T Consensus 100 G~th~l~eggyaylPpgs~~~~~N~~ 125 (264)
T COG3257 100 GKTHALREGGYAYLPPGSGWTLRNAQ 125 (264)
T ss_pred CeEEEeccCCeEEeCCCCcceEeecc
Confidence 44567899999999999999999765
No 110
>PRK15175 Vi polysaccharide export protein VexA; Provisional
Probab=46.35 E-value=14 Score=26.91 Aligned_cols=16 Identities=19% Similarity=0.445 Sum_probs=14.1
Q ss_pred EEEEECCCCEEEeCCC
Q psy12933 32 IECTLNPGDMLYIPPK 47 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~ 47 (65)
..+.|+|||++++|..
T Consensus 222 ~ni~L~~GDvI~V~~~ 237 (355)
T PRK15175 222 LDIALQPDDRITLRQV 237 (355)
T ss_pred CCcEeCCCCEEEEccC
Confidence 4789999999999974
No 111
>KOG2757|consensus
Probab=45.07 E-value=21 Score=26.76 Aligned_cols=24 Identities=29% Similarity=0.396 Sum_probs=19.8
Q ss_pred EEEECCCCEEEeCCCCeEEEEecC
Q psy12933 33 ECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.+.|+||+++|+-+.--|.--+-+
T Consensus 249 ~~~L~PGEA~yL~AnepHAYlsGd 272 (411)
T KOG2757|consen 249 YVRLNPGEAIYLEANEPHAYLSGD 272 (411)
T ss_pred heecCCCceeeecCCCcceeecCc
Confidence 578999999999999999755433
No 112
>PRK13498 chemoreceptor glutamine deamidase CheD; Provisional
Probab=45.04 E-value=76 Score=20.71 Aligned_cols=34 Identities=21% Similarity=0.550 Sum_probs=20.8
Q ss_pred ceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEEe
Q psy12933 30 HLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSFW 64 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~w 64 (65)
...++.|.|||+-+ ...==-=...|+.|++|-+|
T Consensus 6 ~~~~v~v~~Ge~~v-~~~p~~i~T~LGSCVav~l~ 39 (167)
T PRK13498 6 GTKEIVLQPGEVVF-ATRPTRLRTLLGSCVAITFW 39 (167)
T ss_pred CceEEEEeCCcEEE-ECCCCEEEEECcccEEEEEE
Confidence 33588899999544 33311112226789999887
No 113
>cd02781 MopB_CT_Acetylene-hydratase The MopB_CT_Acetylene-hydratase CD contains acetylene hydratase (Ahy) and other related proteins. The acetylene hydratase of Pelobacter acetylenicus is a tungsten iron-sulfur protein involved in the fermentation of acetylene to ethanol and acetate. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=44.58 E-value=15 Score=21.85 Aligned_cols=13 Identities=8% Similarity=0.120 Sum_probs=11.4
Q ss_pred CEEEeCCCCeEEE
Q psy12933 40 DMLYIPPKVWHHV 52 (65)
Q Consensus 40 d~LyiP~~WwH~V 52 (65)
+++++|.|||+..
T Consensus 75 g~v~~~~g~~~~~ 87 (130)
T cd02781 75 GVVRAEHGWWYPE 87 (130)
T ss_pred CEEEEeccccccc
Confidence 7799999999874
No 114
>PF01427 Peptidase_M15: D-ala-D-ala dipeptidase This is family M15 in the peptidase classification. ; InterPro: IPR000755 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. This group of metallopeptidases belong to MEROPS peptidase family M15 (clan MD), subfamily M15D (vanX D-Ala-D-Ala dipeptidase). The D-alanyl-D-alanine dipeptidase enzyme from Enterococcus faecalis is also known as the vancomycin resistance protein VanX, and hydrolyses D-ala-D-ala. It has a 250-fold differential in catalytic efficiency for hydrolysis of D-ala-D-ala versus D-ala-D-lactate. The latter therefore remains intact for subsequent incorporation into peptidoglycan precursors that terminate in the dipeptide D-ala-D-lactate rather than the dipeptide D-ala-D-ala, thereby preventing vancomycin from binding. The enzyme requires a metal cofactor, and is induced by vancomycin through regulation by VanS and VanR.; GO: 0008237 metallopeptidase activity, 0016805 dipeptidase activity, 0006508 proteolysis, 0005618 cell wall; PDB: 1R44_B.
Probab=43.73 E-value=11 Score=25.17 Aligned_cols=16 Identities=19% Similarity=0.447 Sum_probs=10.4
Q ss_pred CEEEeCCCCeEEEEec
Q psy12933 40 DMLYIPPKVWHHVRSL 55 (65)
Q Consensus 40 d~LyiP~~WwH~V~s~ 55 (65)
-..=+|..|||.--..
T Consensus 170 GF~~~p~EWWHf~~~d 185 (198)
T PF01427_consen 170 GFVNYPTEWWHFSYGD 185 (198)
T ss_dssp TEEE-TT-TTEEEESS
T ss_pred CCccCCchhcCcCCCC
Confidence 3667999999987654
No 115
>PF00877 NLPC_P60: NlpC/P60 family; InterPro: IPR000064 The Escherichia coli NLPC/Listeria P60 domain occurs at the C terminus of a number of different bacterial and viral proteins. The viral proteins are either described as tail assembly proteins or Gp19. In bacteria, the proteins are variously described as being putative tail component of prophage, invasin, invasion associated protein, putative lipoprotein, cell wall hydrolase, or putative endopeptidase. The E. coli NLPC/Listeria P60 domain is contained within the boundaries of the cysteine peptidase domain that defines the MEROPS peptidase family C40 (clan C-). A type example being dipeptidyl-peptidase VI from Bacillus sphaericus and gamma-glutamyl-diamino acid-endopeptidase precursor from Lactococcus lactis 3.4.19.11 from EC. This group also contains proteins classified as non-peptidase homologues in that they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the C40 family. ; PDB: 3PVQ_B 3GT2_A 3NPF_B 2K1G_A 3I86_A 3S0Q_A 2XIV_A 3PBC_A 3NE0_A 3M1U_B ....
Probab=43.23 E-value=13 Score=21.39 Aligned_cols=16 Identities=25% Similarity=0.617 Sum_probs=12.1
Q ss_pred eEEEEECCCCEEEeCC
Q psy12933 31 LIECTLNPGDMLYIPP 46 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~ 46 (65)
.-...++|||+||+..
T Consensus 47 ~~~~~~~pGDlif~~~ 62 (105)
T PF00877_consen 47 VPISELQPGDLIFFKG 62 (105)
T ss_dssp EEGGG-TTTEEEEEEG
T ss_pred cchhcCCcccEEEEeC
Confidence 4456789999999887
No 116
>PF02041 Auxin_BP: Auxin binding protein; InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=42.76 E-value=40 Score=22.38 Aligned_cols=28 Identities=25% Similarity=0.322 Sum_probs=18.1
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..+.+....|++.++||.+=-||+-+.+
T Consensus 88 G~pqef~~~pnSTf~IPvn~~HQv~NT~ 115 (167)
T PF02041_consen 88 GKPQEFPIFPNSTFHIPVNDAHQVWNTN 115 (167)
T ss_dssp -S-EEEEE-TTEEEEE-TT--EEEE---
T ss_pred CCceEEEecCCCeEEeCCCCcceeecCC
Confidence 4678999999999999999999999987
No 117
>TIGR01015 hmgA homogentisate 1,2-dioxygenase. Missing in human disease alkaptonuria.
Probab=42.13 E-value=33 Score=25.87 Aligned_cols=23 Identities=26% Similarity=0.419 Sum_probs=19.8
Q ss_pred EEECCCCEEEeCCCCeEEEEecC
Q psy12933 34 CTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 34 ~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..++|||+++||.|-=++|+-.+
T Consensus 168 L~v~pgei~VIPRG~~frv~l~g 190 (429)
T TIGR01015 168 LLVEPNEICVIPRGVRFRVTVLE 190 (429)
T ss_pred eEecCCCEEEecCccEEEEeeCC
Confidence 37899999999999999998544
No 118
>PRK05341 homogentisate 1,2-dioxygenase; Provisional
Probab=42.06 E-value=34 Score=25.92 Aligned_cols=23 Identities=26% Similarity=0.410 Sum_probs=19.4
Q ss_pred EEECCCCEEEeCCCCeEEEEecC
Q psy12933 34 CTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 34 ~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..++|||+++||.|-=++|+-.+
T Consensus 174 L~v~pgei~VIPRG~~frv~l~~ 196 (438)
T PRK05341 174 LDVEPGEIAVIPRGVKFRVELPD 196 (438)
T ss_pred eEecCCCEEEEcCccEEEEecCC
Confidence 46789999999999999998433
No 119
>PRK00924 5-keto-4-deoxyuronate isomerase; Provisional
Probab=41.88 E-value=24 Score=25.05 Aligned_cols=18 Identities=22% Similarity=0.208 Sum_probs=14.2
Q ss_pred EEEEECCCCEEEeCCCCe
Q psy12933 32 IECTLNPGDMLYIPPKVW 49 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~Ww 49 (65)
.+..|++.|+||+|.|==
T Consensus 92 ~~~~l~~~d~LYVp~G~~ 109 (276)
T PRK00924 92 ETYELGHRDALYVGKGAK 109 (276)
T ss_pred EEEecCCCcEEEECCCCc
Confidence 455688999999999943
No 120
>TIGR02451 anti_sig_ChrR anti-sigma factor, putative, ChrR family. The member of this family from Rhodobacter sphaeroides has been shown both to form a complex with sigma(E) and to negatively regulate tetrapyrrole biosynthesis. This protein likely contains (at least) two distinct functional domains; several smaller homologs (excluded by the model) show homology only to the C-terminal, including a motif PxHxHxGxE.
Probab=41.64 E-value=33 Score=22.93 Aligned_cols=28 Identities=21% Similarity=0.288 Sum_probs=23.0
Q ss_pred EECCCCEEEeCCCCeEEEEecCC--eEEEE
Q psy12933 35 TLNPGDMLYIPPKVWHHVRSLST--SISVS 62 (65)
Q Consensus 35 ~l~pGd~LyiP~~WwH~V~s~~~--sisvn 62 (65)
...+||.+.+|++--|+..+.+. ++-+.
T Consensus 164 ~y~~Gd~i~~p~~~~H~p~a~~~~~Cicl~ 193 (215)
T TIGR02451 164 VYGVGDFEEADGSVQHQPRTVSGGDCLCLA 193 (215)
T ss_pred ccCCCeEEECCCCCCcCcccCCCCCeEEEE
Confidence 45799999999999999999964 65543
No 121
>TIGR02219 phage_NlpC_fam putative phage cell wall peptidase, NlpC/P60 family. Members of this family show sequence similarity to members of the NlpC/P60 family described by Pfam model pfam00877 and by Anantharaman and Aravind (PubMed:12620121). The NlpC/P60 family includes a number of characterized bacterial cell wall hydrolases. Members of this related family are all found in prophage regions of bacterial genomes.
Probab=41.60 E-value=15 Score=22.58 Aligned_cols=11 Identities=36% Similarity=0.715 Sum_probs=9.5
Q ss_pred EECCCCEEEeC
Q psy12933 35 TLNPGDMLYIP 45 (65)
Q Consensus 35 ~l~pGd~LyiP 45 (65)
.++|||++|+-
T Consensus 76 ~~qpGDlvff~ 86 (134)
T TIGR02219 76 AAQPGDVLVFR 86 (134)
T ss_pred cCCCCCEEEEe
Confidence 58999999983
No 122
>PF11543 UN_NPL4: Nuclear pore localisation protein NPL4; InterPro: IPR024682 Npl4, along with Ufd1, forms the heterodimer adaptor complex UN, which is involved in the recruitment of p97, an AAA ATPase, for tasks involving the ubiquitin pathway. Npl4 has a N-terminal ubiquitin-like domain which has within its structure a beta-grasp fold with a helical insert []. This entry represents the ubiquitin-like domain.; PDB: 2PJH_A 1WF9_A.
Probab=41.55 E-value=15 Score=21.12 Aligned_cols=19 Identities=37% Similarity=0.564 Sum_probs=8.9
Q ss_pred CCCceEEEEECCCCEEEeC
Q psy12933 27 TPGHLIECTLNPGDMLYIP 45 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP 45 (65)
.....-+.-|+-||+||+=
T Consensus 60 ~~~tl~~lglkHGdmlyL~ 78 (80)
T PF11543_consen 60 DSKTLSSLGLKHGDMLYLK 78 (80)
T ss_dssp TT-CCCCT---TT-EEE--
T ss_pred CcCCHHHcCCCCccEEEEe
Confidence 3455666789999999974
No 123
>TIGR03805 beta_helix_1 parallel beta-helix repeat-containing protein. Members of this protein family contain a tandem pair of beta-helix repeats (see TIGR03804). Each repeat is expected to consist of three beta strands that form a single turn as they form a right-handed helix of stacked beta-structure. Member proteinsa occur regularly in two-gene pairs along with another uncharacterized protein family; both protein families exhibit either lipoprotein or regular signal peptides, suggesting transit through the plasma membrane, and the two may be fused. The function of the pair is unknown.
Probab=41.29 E-value=38 Score=23.88 Aligned_cols=15 Identities=33% Similarity=0.811 Sum_probs=13.8
Q ss_pred ECCCCEEEeCCCCeE
Q psy12933 36 LNPGDMLYIPPKVWH 50 (65)
Q Consensus 36 l~pGd~LyiP~~WwH 50 (65)
.+|||+|.+|+|-++
T Consensus 8 A~~GDtI~l~~G~Y~ 22 (314)
T TIGR03805 8 AQPGDTIVLPEGVFQ 22 (314)
T ss_pred CCCCCEEEECCCEEE
Confidence 479999999999998
No 124
>PF04209 HgmA: homogentisate 1,2-dioxygenase; InterPro: IPR005708 Alkaptonuria (AKU), a rare hereditary disorder, was the first disease to be interpreted as an inborn error of metabolism. The deficiency causes homogentisic aciduria, ochronosis, and arthritis. AKU patients are deficient for homogentisate 1,2 dioxygenase (1.13.11.5 from EC), the enzyme that mediates the conversion of homogentisate to maleylacetoacetate; a step in the catabolism of both tyrosine and phenylalanine. Homogentisate + O(2) = 4-maleylacetoacetate. ; GO: 0004411 homogentisate 1,2-dioxygenase activity, 0006559 L-phenylalanine catabolic process, 0006570 tyrosine metabolic process, 0055114 oxidation-reduction process; PDB: 1EY2_A 1EYB_A.
Probab=41.15 E-value=32 Score=25.82 Aligned_cols=24 Identities=25% Similarity=0.396 Sum_probs=16.1
Q ss_pred EEECCCCEEEeCCCCeEEEEecCC
Q psy12933 34 CTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 34 ~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
..++|||+++||.|-=++|+-.++
T Consensus 166 L~v~pGd~~VIPRG~~~rv~l~~p 189 (424)
T PF04209_consen 166 LDVRPGDYVVIPRGTRFRVELPGP 189 (424)
T ss_dssp EEE-TTEEEEE-TT--EEEE-SSS
T ss_pred EEEcCCeEEEECCeeEEEEEeCCC
Confidence 458999999999999999987654
No 125
>cd02792 MopB_CT_Formate-Dh-Na-like Formate dehydrogenase N, alpha subunit (Formate-Dh-Na) is a major component of nitrate respiration in bacteria such as in the E. coli formate dehydrogenase N (Fdh-N). Fdh-N is a membrane protein that is a complex of three different subunits and is the major electron donor to the nitrate respiratory chain. Also included in this CD is the Desulfovibrio gigas tungsten formate dehydrogenase, DgW-FDH. In contrast to Fdh-N, which is a functional heterotrimer, DgW-FDH is a heterodimer. The DgW-FDH complex is composed of a large subunit carrying the W active site and one [4Fe-4S] center, and a small subunit that harbors a series of three [4Fe-4S] clusters as well as a putative vacant binding site for a fourth cluster. The smaller subunit is not included in this alignment. This CD (MopB_CT_Formate-Dh-Na-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=40.76 E-value=23 Score=20.67 Aligned_cols=12 Identities=17% Similarity=-0.108 Sum_probs=10.7
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.+||+.
T Consensus 77 g~v~~~~g~~~~ 88 (122)
T cd02792 77 HEVGIPYHWGGM 88 (122)
T ss_pred CEEEEecccCcc
Confidence 689999999985
No 126
>PF07653 SH3_2: Variant SH3 domain; InterPro: IPR011511 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. This entry represents a variant of the SH3 domain.; PDB: 1I1J_B 1K0X_A 1HJD_A 2KEA_A 1KJW_A 1JXM_A 1JXO_B 2EBP_A 2DL3_A 2EYX_A ....
Probab=40.55 E-value=8.9 Score=19.86 Aligned_cols=21 Identities=19% Similarity=0.351 Sum_probs=12.5
Q ss_pred EEEEECCCCEEEe-----CCCCeEEE
Q psy12933 32 IECTLNPGDMLYI-----PPKVWHHV 52 (65)
Q Consensus 32 ~~~~l~pGd~Lyi-----P~~WwH~V 52 (65)
-+..++.||++.+ +.+||--.
T Consensus 14 ~~Ls~~~Gd~i~v~~~~~~~~ww~~~ 39 (55)
T PF07653_consen 14 DELSFKKGDVIEVLGEKDDDGWWLGE 39 (55)
T ss_dssp TB-EB-TTEEEEEEEEECSTSEEEEE
T ss_pred CceEEecCCEEEEEEeecCCCEEEEE
Confidence 3477888887765 57776543
No 127
>TIGR03505 FimV_core FimV N-terminal domain. This region is found at, or about 200 amino acids from, the N-terminus of FimV from Pseudomonas aeruginosa, TspA of Neisseria meningitidis, and related proteins. Disruption of FimV blocks twitching motility from type IV pili; Semmler, et al. suggest a role for this family in peptidoglycan layer remodelling required by type IV fimbrial systems. Most but not all members of this protein family have a C-terminal region recognized by TIGR03504. In between is a highly variable, often repeat-filled region rich in the negatively charged amino acids Asp and Glu.
Probab=39.84 E-value=19 Score=20.52 Aligned_cols=13 Identities=38% Similarity=0.723 Sum_probs=11.3
Q ss_pred EECCCCEEEeCCC
Q psy12933 35 TLNPGDMLYIPPK 47 (65)
Q Consensus 35 ~l~pGd~LyiP~~ 47 (65)
-|++|..|.||..
T Consensus 40 ~L~~G~~L~iP~~ 52 (74)
T TIGR03505 40 RLKVGQILRIPSE 52 (74)
T ss_pred hcCCCCEEeCCCH
Confidence 5799999999974
No 128
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=39.67 E-value=78 Score=18.42 Aligned_cols=27 Identities=7% Similarity=0.272 Sum_probs=18.6
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-+.++.+|++.+||+|=--..+++..
T Consensus 50 ~~~f~v~~G~~F~VP~gN~Y~i~N~~~ 76 (85)
T PF11699_consen 50 ETSFVVTKGGSFQVPRGNYYSIKNIGN 76 (85)
T ss_dssp TEEEEEETT-EEEE-TT-EEEEEE-SS
T ss_pred CcEEEEeCCCEEEECCCCEEEEEECCC
Confidence 346678899999999998888888764
No 129
>PF11142 DUF2917: Protein of unknown function (DUF2917); InterPro: IPR021317 This bacterial family of proteins appears to be restricted to Proteobacteria.
Probab=39.62 E-value=32 Score=18.82 Aligned_cols=18 Identities=22% Similarity=0.187 Sum_probs=14.7
Q ss_pred ceEEEEECCCCEEEeCCC
Q psy12933 30 HLIECTLNPGDMLYIPPK 47 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~ 47 (65)
..-+..|+|||.+-++++
T Consensus 34 ~~~D~~L~~G~~l~l~~g 51 (63)
T PF11142_consen 34 DPDDYWLQAGDSLRLRRG 51 (63)
T ss_pred CCCCEEECCCCEEEeCCC
Confidence 456778999999988887
No 130
>PLN02658 homogentisate 1,2-dioxygenase
Probab=39.61 E-value=39 Score=25.57 Aligned_cols=23 Identities=22% Similarity=0.403 Sum_probs=19.2
Q ss_pred EEECCCCEEEeCCCCeEEEEecC
Q psy12933 34 CTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 34 ~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
..++|||+++||.|-=++|+-.+
T Consensus 167 L~v~pgei~VIPRG~~frv~l~~ 189 (435)
T PLN02658 167 LQVSPGEIVVIPRGFRFAVDLPD 189 (435)
T ss_pred eEecCCCEEEecCccEEEEecCC
Confidence 45789999999999999998433
No 131
>COG1935 Uncharacterized conserved protein [Function unknown]
Probab=39.60 E-value=18 Score=22.90 Aligned_cols=12 Identities=25% Similarity=0.800 Sum_probs=9.8
Q ss_pred EECCCCEEEeCC
Q psy12933 35 TLNPGDMLYIPP 46 (65)
Q Consensus 35 ~l~pGd~LyiP~ 46 (65)
-|+|||++|+-+
T Consensus 38 rl~~GDlVFlT~ 49 (122)
T COG1935 38 RLHEGDLVFLTS 49 (122)
T ss_pred cCCCCCEEEEeh
Confidence 478999999754
No 132
>smart00306 HintN Hint (Hedgehog/Intein) domain N-terminal region. Hedgehog/Intein domain, N-terminal region. Domain has been split to accommodate large insertions of endonucleases.
Probab=38.71 E-value=21 Score=19.70 Aligned_cols=10 Identities=50% Similarity=1.198 Sum_probs=5.4
Q ss_pred ECCCCEEEeC
Q psy12933 36 LNPGDMLYIP 45 (65)
Q Consensus 36 l~pGd~LyiP 45 (65)
|++||.|++|
T Consensus 90 l~~gd~v~~~ 99 (100)
T smart00306 90 LKPGDYVLVP 99 (100)
T ss_pred CCCCCEEEec
Confidence 4455555554
No 133
>PF14578 GTP_EFTU_D4: Elongation factor Tu domain 4; PDB: 1G7R_A 1G7S_A 1G7T_A 1XE1_A.
Probab=38.65 E-value=18 Score=21.11 Aligned_cols=10 Identities=40% Similarity=1.059 Sum_probs=6.5
Q ss_pred EECCCCEEEe
Q psy12933 35 TLNPGDMLYI 44 (65)
Q Consensus 35 ~l~pGd~Lyi 44 (65)
.+++||+||+
T Consensus 72 ~i~eGDiLyV 81 (81)
T PF14578_consen 72 QIKEGDILYV 81 (81)
T ss_dssp TB-TT-EEEE
T ss_pred cCCCCCEEeC
Confidence 5789999986
No 134
>cd02785 MopB_CT_4 The MopB_CT_4 CD includes a group of related uncharacterized bacterial and archaeal molybdopterin-binding oxidoreductase-like domains with a putative N-terminal iron-sulfur [4Fe-4S] cluster binding site and molybdopterin cofactor binding site. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=38.16 E-value=20 Score=21.28 Aligned_cols=12 Identities=8% Similarity=0.307 Sum_probs=10.5
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.|||..
T Consensus 74 g~v~~~~g~~~~ 85 (124)
T cd02785 74 GVVTAEQGWWSR 85 (124)
T ss_pred CEEEeecccCcc
Confidence 679999999975
No 135
>cd02793 MopB_CT_DMSOR-BSOR-TMAOR The MopB_DMSOR-BSOR-TMAOR CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR), trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO.This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=36.70 E-value=24 Score=21.23 Aligned_cols=12 Identities=17% Similarity=0.659 Sum_probs=10.3
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.|||..
T Consensus 75 g~v~~~~g~w~~ 86 (129)
T cd02793 75 GVVQLPTGAWYD 86 (129)
T ss_pred CEEEEccccCcC
Confidence 578999999974
No 136
>KOG3905|consensus
Probab=36.66 E-value=18 Score=27.13 Aligned_cols=11 Identities=27% Similarity=0.531 Sum_probs=9.7
Q ss_pred CCCEEEeCCCC
Q psy12933 38 PGDMLYIPPKV 48 (65)
Q Consensus 38 pGd~LyiP~~W 48 (65)
.-|.+|||+||
T Consensus 302 EkdaVfIPAGW 312 (473)
T KOG3905|consen 302 EKDAVFIPAGW 312 (473)
T ss_pred ecceeEeccCC
Confidence 46899999998
No 137
>COG2164 Uncharacterized conserved protein [Function unknown]
Probab=36.19 E-value=17 Score=22.86 Aligned_cols=16 Identities=38% Similarity=0.681 Sum_probs=13.3
Q ss_pred EEECCCCEEEeCCCCe
Q psy12933 34 CTLNPGDMLYIPPKVW 49 (65)
Q Consensus 34 ~~l~pGd~LyiP~~Ww 49 (65)
-++++||+-|-|+|.-
T Consensus 63 dvve~GDv~YWpPGkA 78 (126)
T COG2164 63 DVVEPGDVSYWPPGKA 78 (126)
T ss_pred cccCcccccccCCCcE
Confidence 4679999999999853
No 138
>TIGR00568 alkb DNA alkylation damage repair protein AlkB. Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Probab=35.61 E-value=58 Score=21.14 Aligned_cols=20 Identities=15% Similarity=0.187 Sum_probs=15.9
Q ss_pred CCCceEEEEECCCCEEEeCC
Q psy12933 27 TPGHLIECTLNPGDMLYIPP 46 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP~ 46 (65)
+..+...+.|++||+|++-.
T Consensus 141 ~~~~~~~l~L~sGsllvM~G 160 (169)
T TIGR00568 141 RNDPPKRLRLHSGDVVIMGG 160 (169)
T ss_pred CCCceEEEEeCCCCEEEECC
Confidence 34557899999999999853
No 139
>PF05118 Asp_Arg_Hydrox: Aspartyl/Asparaginyl beta-hydroxylase; InterPro: IPR007803 The alpha-ketoglutarate-dependent dioxygenase aspartyl (asparaginyl) beta-hydroxylase (1.14.11.16 from EC) specifically hydroxylates one aspartic or asparagine residue in certain epidermal growth factor-like domains of a number of proteins. Its action may be due to histidine-675, which, when mutated to an alanine residue, causes the loss of enzymatic activity in the protein []. An invertebrate alpha-ketoglutarate-dependent aspartyl/asparaginyl beta-hydroxylase, which posttranslationally hydroxylates specific aspartyl or asparaginyl residues within epidermal growth factor-like modules [], activity was found to be similar to that of the purified mammalian aspartyl/asparaginyl beta-hydroxylase with respect to cofactor requirements, stereochemistry and substrate sequence specificity []. This enzyme requires Fe2+ as a cofactor. Some vitamin K-dependent coagulation factors, as well as synthetic peptides based on the structure of the first epidermal growth factor domain of human coagulation factor IX or X, can act as acceptors.; GO: 0018193 peptidyl-amino acid modification, 0030176 integral to endoplasmic reticulum membrane; PDB: 3RCQ_A 1E5S_A 1E5R_B.
Probab=35.04 E-value=1.1e+02 Score=19.31 Aligned_cols=27 Identities=11% Similarity=-0.022 Sum_probs=19.6
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
......++|+++++=...-|.|.+.+.
T Consensus 123 ~~~~~w~~G~~~~fD~s~~H~~~N~~~ 149 (163)
T PF05118_consen 123 GETRHWREGECWVFDDSFEHEVWNNGD 149 (163)
T ss_dssp TEEEB--CTEEEEE-TTS-EEEEESSS
T ss_pred CeEEEeccCcEEEEeCCEEEEEEeCCC
Confidence 345677999999999999999999875
No 140
>cd02791 MopB_CT_Nitrate-R-NapA-like Nitrate reductases, NapA (Nitrate-R-NapA), NasA, and NarB catalyze the reduction of nitrate to nitrite. Monomeric Nas is located in the cytoplasm and participates in nitrogen assimilation. Dimeric Nap is located in the periplasm and is coupled to quinol oxidation via a membrane-anchored tetraheme cytochrome. This CD (MopB_CT_Nitrate-R-Nap) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs
Probab=34.84 E-value=47 Score=19.27 Aligned_cols=20 Identities=5% Similarity=0.210 Sum_probs=14.7
Q ss_pred EEEEECCC---CEEEeCCCCeEE
Q psy12933 32 IECTLNPG---DMLYIPPKVWHH 51 (65)
Q Consensus 32 ~~~~l~pG---d~LyiP~~WwH~ 51 (65)
..+.+.++ +++++|.+||+.
T Consensus 66 ~~v~~~~~i~~g~v~~~~g~~~~ 88 (122)
T cd02791 66 LRVRVTDRVRPGEVFVPMHWGDQ 88 (122)
T ss_pred EEEEECCCcCCCeEEEecccCcc
Confidence 45555554 789999999975
No 141
>cd00118 LysM Lysin domain, found in a variety of enzymes involved in bacterial cell wall degradation. This domain may have a general peptidoglycan binding function.
Probab=34.47 E-value=27 Score=15.55 Aligned_cols=12 Identities=42% Similarity=0.551 Sum_probs=9.4
Q ss_pred EEECCCCEEEeC
Q psy12933 34 CTLNPGDMLYIP 45 (65)
Q Consensus 34 ~~l~pGd~LyiP 45 (65)
..+.+|+.|.+|
T Consensus 35 ~~~~~g~~l~ip 46 (46)
T cd00118 35 DNLQVGQKLKIP 46 (46)
T ss_pred cccCCCCEEecC
Confidence 357889988887
No 142
>PF02814 UreE_N: UreE urease accessory protein, N-terminal domain; InterPro: IPR004029 Urease and other nickel metalloenzymes are synthesised as precursors devoid of the metalloenzyme active site. These precursors then undergo a complex post-translational maturation process that requires a number of accessory proteins. Members of this group are nickel-binding proteins required for urease metallocentre assembly []. They are believed to function as metallochaperones to deliver nickel to urease apoprotein [, ]. It has been shown by yeast two-hybrid analysis that UreE forms a dimeric complex with UreG in Helicobacter pylori []. The UreDFG-apoenzyme complex has also been shown to exist [, ] and is believed to be, with the addition of UreE, the assembly system for active urease []. The complexes, rather than the individual proteins, presumably bind to UreB via UreE/H recognition sites. The structure of Klebsiella aerogenes UreE reveals a unique two-domain architecture.The N-terminal domain is structurally related to a heat shock protein, while the C-terminal domain shows homology to the Atx1 copper metallochaperone [, ]. Significantly, the metal-binding sites in UreE and Atx1 are distinct in location and types of residues despite the relationship between these proteins and the mechanism for UreE activation of urease is proposed to be different from the thiol ligand exchange mechanism used by the copper metallochaperones. The N-terminal domain is termed the peptide-binding domain. Deletion of this domain does not eliminate enzymatic activity, and the truncated protein can still activate urease [].; GO: 0016151 nickel ion binding, 0006461 protein complex assembly, 0019627 urea metabolic process; PDB: 3NXZ_B 3TJA_B 3LA0_B 3TJ9_B 3NY0_A 3L9Z_A 3TJ8_A 1EAR_A 1EB0_A 1GMU_B ....
Probab=34.05 E-value=27 Score=18.95 Aligned_cols=12 Identities=42% Similarity=0.794 Sum_probs=7.5
Q ss_pred ECCCCEEEeCCC
Q psy12933 36 LNPGDMLYIPPK 47 (65)
Q Consensus 36 l~pGd~LyiP~~ 47 (65)
|+.||+|+.-.+
T Consensus 51 L~~GDvL~~d~g 62 (65)
T PF02814_consen 51 LRDGDVLYLDDG 62 (65)
T ss_dssp --TTEEEEECTS
T ss_pred cCCCCEEEeCCC
Confidence 788899987544
No 143
>PF05708 DUF830: Orthopoxvirus protein of unknown function (DUF830); PDB: 2IF6_B 3KW0_C.
Probab=33.92 E-value=24 Score=21.62 Aligned_cols=9 Identities=33% Similarity=0.770 Sum_probs=5.2
Q ss_pred ECCCCEEEe
Q psy12933 36 LNPGDMLYI 44 (65)
Q Consensus 36 l~pGd~Lyi 44 (65)
|++||+|+.
T Consensus 2 l~~GDIil~ 10 (158)
T PF05708_consen 2 LQTGDIILT 10 (158)
T ss_dssp --TT-EEEE
T ss_pred CCCeeEEEE
Confidence 689999986
No 144
>cd00250 CAS_like Clavaminic acid synthetase (CAS) -like; CAS is a trifunctional Fe(II)/ 2-oxoglutarate (2OG) oxygenase carrying out three reactions in the biosynthesis of clavulanic acid, an inhibitor of class A serine beta-lactamases. In general, Fe(II)-2OG oxygenases catalyze a hydroxylation reaction, which leads to the incorporation of an oxygen atom from dioxygen into a hydroxyl group and conversion of 2OG to succinate and CO2
Probab=33.66 E-value=42 Score=22.37 Aligned_cols=15 Identities=33% Similarity=0.740 Sum_probs=13.1
Q ss_pred ceEEEEECCCCEEEe
Q psy12933 30 HLIECTLNPGDMLYI 44 (65)
Q Consensus 30 ~~~~~~l~pGd~Lyi 44 (65)
..+...++|||+|++
T Consensus 218 ~~~~~~l~~Gdivi~ 232 (262)
T cd00250 218 NQLTVKLEPGDLLIF 232 (262)
T ss_pred hEEEEEcCCCCEEEE
Confidence 577899999999986
No 145
>PRK00794 flbT flagellar biosynthesis repressor FlbT; Reviewed
Probab=33.65 E-value=42 Score=21.23 Aligned_cols=17 Identities=24% Similarity=0.561 Sum_probs=14.4
Q ss_pred eEEEEECCCCEEEeCCC
Q psy12933 31 LIECTLNPGDMLYIPPK 47 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~ 47 (65)
++...|+|||-++|-..
T Consensus 3 ~Lkl~LKp~ERi~INGA 19 (132)
T PRK00794 3 GLKLSLKPGERIFINGA 19 (132)
T ss_pred CeeEEecCCCEEEEcCe
Confidence 57899999999998754
No 146
>PF07228 SpoIIE: Stage II sporulation protein E (SpoIIE); InterPro: IPR001932 This domain is found in protein phosphatase 2C, as well as other proteins eg. pyruvate dehydrogenase (lipoamide)-phosphatase (3.1.3.43 from EC), adenylate cyclase (4.6.1.1 from EC) and some bacterial stage II sporulation E proteins (3.1.3.16 from EC). Protein phosphatase 2C (PP2C) is one of the four major classes of mammalian serine/threonine specific protein phosphatases (3.1.3.16 from EC). PP2C [] is a monomeric enzyme of about 42 Kd which shows broad substrate specificity and is dependent on divalent cations (mainly manganese and magnesium) for its activity. Its exact physiological role is still unclear. Three isozymes are currently known in mammals: PP2C-alpha, -beta and -gamma. In yeast, there are at least four PP2C homologs: phosphatase PTC1 [], which has weak tyrosine phosphatase activity in addition to its activity on serines, phosphatases PTC2 and PTC3, and hypothetical protein YBR125c. Isozymes of PP2C are also known from Arabidopsis thaliana (ABI1, PPH1), Caenorhabditis elegans (FEM-2, F42G9.1, T23F11.1), Leishmania chagasi and Paramecium tetraurelia. In A. thaliana, the kinase associated protein phosphatase (KAPP) [] is an enzyme that dephosphorylates the Ser/Thr receptor-like kinase RLK5 and which contains a C-terminal PP2C domain. PP2C does not seem to be evolutionary related to the main family of serine/ threonine phosphatases: PP1, PP2A and PP2B. However, it is significantly similar to the catalytic subunit of pyruvate dehydrogenase phosphatase 3.1.3.43 from EC (PDPC) [], which catalyzes dephosphorylation and concomitant reactivation of the alpha subunit of the E1 component of the pyruvate dehydrogenase complex. PDPC is a mitochondrial enzyme and, like PP2C, is magnesium-dependent.; GO: 0003824 catalytic activity; PDB: 3KE6_B 3ZT9_A 3RNR_A 3EQ2_A 3F7A_B 3F79_A 3ES2_B 3PU9_B 3T91_B 3T9Q_B ....
Probab=33.18 E-value=37 Score=20.99 Aligned_cols=17 Identities=35% Similarity=0.456 Sum_probs=12.4
Q ss_pred CCceEEEEECCCCEEEe
Q psy12933 28 PGHLIECTLNPGDMLYI 44 (65)
Q Consensus 28 ~~~~~~~~l~pGd~Lyi 44 (65)
..+..++.+++||.|++
T Consensus 113 ~~~~~~~~l~~gd~l~l 129 (193)
T PF07228_consen 113 DYQEQEIQLEPGDRLLL 129 (193)
T ss_dssp CEEEEEEE--TTEEEEE
T ss_pred cccceEEEeccccEEEE
Confidence 45678999999999986
No 147
>cd00508 MopB_CT_Fdh-Nap-like This CD includes formate dehydrogenases (Fdh) H and N; nitrate reductases, Nap and Nas; and other related proteins. Formate dehydrogenase H is a component of the anaerobic formate hydrogen lyase complex and catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. Formate dehydrogenase N (alpha subunit) is the major electron donor to the bacterial nitrate respiratory chain and nitrate reductases, Nap and Nas, catalyze the reduction of nitrate to nitrite. This CD (MopB_CT_Fdh-Nap-like) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=33.14 E-value=52 Score=18.88 Aligned_cols=12 Identities=8% Similarity=0.160 Sum_probs=10.5
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.+|||.
T Consensus 77 g~v~~~~~~~~~ 88 (120)
T cd00508 77 GTVFMPFHWGGE 88 (120)
T ss_pred CEEEEecccCCc
Confidence 689999999975
No 148
>PF05783 DLIC: Dynein light intermediate chain (DLIC); InterPro: IPR022780 This entry consists of several eukaryotic dynein light intermediate chain proteins. The light intermediate chains (LICs) of cytoplasmic dynein consist of multiple isoforms, which undergo post-translational modification to produce a large number of species. DLIC1 is known to be involved in assembly, organisation, and function of centrosomes and mitotic spindles when bound to pericentrin [, ]. DLIC2 is a subunit of cytoplasmic dynein 2 that may play a role in maintaining Golgi organisation by binding cytoplasmic dynein 2 to its Golgi-associated cargo [].
Probab=32.39 E-value=19 Score=27.16 Aligned_cols=14 Identities=21% Similarity=0.434 Sum_probs=11.2
Q ss_pred EECCCCEEEeCCCC
Q psy12933 35 TLNPGDMLYIPPKV 48 (65)
Q Consensus 35 ~l~pGd~LyiP~~W 48 (65)
.+-.-|.+|||+||
T Consensus 273 ~vv~~d~ifIP~Gw 286 (472)
T PF05783_consen 273 QVVERDAIFIPAGW 286 (472)
T ss_pred eeecccccccCCCC
Confidence 34556899999998
No 149
>PRK10178 D-alanyl-D-alanine dipeptidase; Provisional
Probab=32.23 E-value=16 Score=24.37 Aligned_cols=13 Identities=23% Similarity=0.473 Sum_probs=10.1
Q ss_pred EEEeCCCCeEEEE
Q psy12933 41 MLYIPPKVWHHVR 53 (65)
Q Consensus 41 ~LyiP~~WwH~V~ 53 (65)
..-+|..|||.--
T Consensus 156 F~~~~~EWWHf~~ 168 (184)
T PRK10178 156 FVHIASEWWHYEL 168 (184)
T ss_pred CCCCCCcccCcCC
Confidence 4558999999854
No 150
>PRK10838 spr outer membrane lipoprotein; Provisional
Probab=31.63 E-value=31 Score=22.96 Aligned_cols=14 Identities=21% Similarity=0.472 Sum_probs=11.0
Q ss_pred EEECCCCEEEeCCC
Q psy12933 34 CTLNPGDMLYIPPK 47 (65)
Q Consensus 34 ~~l~pGd~LyiP~~ 47 (65)
-.++|||++|+..+
T Consensus 127 ~~lqpGDLVfF~~~ 140 (190)
T PRK10838 127 SKLRTGDLVLFRAG 140 (190)
T ss_pred CCCCCCcEEEECCC
Confidence 45899999998743
No 151
>PRK06933 type III secretion system protein; Validated
Probab=31.09 E-value=32 Score=24.64 Aligned_cols=14 Identities=36% Similarity=0.600 Sum_probs=11.6
Q ss_pred EECCCCEEEeCCCC
Q psy12933 35 TLNPGDMLYIPPKV 48 (65)
Q Consensus 35 ~l~pGd~LyiP~~W 48 (65)
.|++||+|.+|+.+
T Consensus 175 ~L~~GDvLli~~~~ 188 (308)
T PRK06933 175 SLELGDVLLAPEGS 188 (308)
T ss_pred ccCCCCEEEecccc
Confidence 46899999998764
No 152
>COG2173 DdpX D-alanyl-D-alanine dipeptidase [Cell envelope biogenesis, outer membrane]
Probab=30.77 E-value=16 Score=25.11 Aligned_cols=24 Identities=17% Similarity=0.429 Sum_probs=19.3
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEe
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRS 54 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s 54 (65)
-+...++.|-..-.|..|||.-..
T Consensus 175 lL~~~M~a~GF~~~~~EWWHF~~~ 198 (211)
T COG2173 175 LLNAAMEAGGFTVYSGEWWHFDLP 198 (211)
T ss_pred HHHHHHHhCCCccCCccceecccc
Confidence 345677889999999999998654
No 153
>cd00174 SH3 Src homology 3 domains; SH3 domains bind to proline-rich ligands with moderate affinity and selectivity, preferentially to PxxP motifs; they play a role in the regulation of enzymes by intramolecular interactions, changing the subcellular localization of signal pathway components and mediate multiprotein complex assemblies.
Probab=30.33 E-value=25 Score=16.86 Aligned_cols=23 Identities=26% Similarity=0.439 Sum_probs=14.0
Q ss_pred EEEEECCCCEEEeC----CCCeEEEEec
Q psy12933 32 IECTLNPGDMLYIP----PKVWHHVRSL 55 (65)
Q Consensus 32 ~~~~l~pGd~LyiP----~~WwH~V~s~ 55 (65)
-+..+.+||++.+- .+|| .++..
T Consensus 14 ~~l~~~~Gd~v~v~~~~~~~w~-~~~~~ 40 (54)
T cd00174 14 DELSFKKGDIIEVLEKSDDGWW-EGRLL 40 (54)
T ss_pred CCCCCCCCCEEEEEEcCCCCeE-EEEEC
Confidence 46677788877753 5555 44433
No 154
>PF02668 TauD: Taurine catabolism dioxygenase TauD, TfdA family; InterPro: IPR003819 This family consists of TauD/TfdA taurine catabolism dioxygenases. The Escherichia coli tauD gene is required for the utilization of taurine (2-aminoethanesulphonic acid) as a sulphur source and is expressed only under conditions of sulphate starvation. TauD is an alpha-ketoglutarate-dependent dioxygenase catalyzing the oxygenolytic release of sulphite from taurine []. The 2,4-dichlorophenoxyacetic acid/alpha-ketoglutarate dioxygenase from Burkholderia sp. (strain RASC) also belongs to this family []. TfdA from Ralstonia eutropha (Alcaligenes eutrophus) is a 2,4-D monooxygenase [].; GO: 0016491 oxidoreductase activity, 0055114 oxidation-reduction process; PDB: 3SWT_B 3R1J_A 1GVG_A 1DRT_A 1DS1_A 1DS0_A 1DRY_A 3V15_A 3PVJ_D 3V17_A ....
Probab=29.85 E-value=46 Score=21.45 Aligned_cols=25 Identities=24% Similarity=0.284 Sum_probs=15.7
Q ss_pred CceEEEEECCCCEEEeCCC-CeEEEE
Q psy12933 29 GHLIECTLNPGDMLYIPPK-VWHHVR 53 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~-WwH~V~ 53 (65)
...+...+++||+|++=-. ..|.-+
T Consensus 218 ~~~~~~~~~~GDlli~dN~~~lHgR~ 243 (258)
T PF02668_consen 218 EYTYRHRWQPGDLLIWDNHRVLHGRT 243 (258)
T ss_dssp GGEEEEE--TTEEEEEETTTEEEEE-
T ss_pred hhcccccCCCceEEEEcCCeeEecCC
Confidence 5688999999999998543 334333
No 155
>cd02777 MopB_CT_DMSOR-like The MopB_CT_DMSOR-like CD contains dimethylsulfoxide reductase (DMSOR), biotin sulfoxide reductase (BSOR), trimethylamine N-oxide reductase (TMAOR) and other related proteins. DMSOR always catalyzes the reduction of DMSO to dimethylsulfide, but its cellular location and oligomerization state are organism-dependent. For example, in Rhodobacter sphaeriodes and Rhodobacter capsulatus, it is an 82-kDa monomeric soluble protein found in the periplasmic space; in E. coli, it is membrane-bound and exists as a heterotrimer. BSOR catalyzes the reduction of biotin sulfixode to biotin, and is unique among Mo enzymes because no additional auxiliary proteins or cofactors are required. TMAOR is similar to DMSOR, but its only natural substrate is TMAO. Also included in this group is the pyrogallol-phloroglucinol transhydroxylase from Pelobacter acidigallici. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB hom
Probab=29.46 E-value=34 Score=20.33 Aligned_cols=12 Identities=17% Similarity=0.628 Sum_probs=9.7
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.|||..
T Consensus 76 g~v~~~~g~~~~ 87 (127)
T cd02777 76 GVVALPEGAWYD 87 (127)
T ss_pred CEEEeCcccccC
Confidence 568999999953
No 156
>PRK12791 flbT flagellar biosynthesis repressor FlbT; Reviewed
Probab=29.41 E-value=53 Score=20.81 Aligned_cols=17 Identities=18% Similarity=0.257 Sum_probs=14.0
Q ss_pred eEEEEECCCCEEEeCCC
Q psy12933 31 LIECTLNPGDMLYIPPK 47 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~ 47 (65)
++.++|+|||-++|-..
T Consensus 2 ~Lkl~LKP~ERi~INGa 18 (131)
T PRK12791 2 PLRVELKPFERIVIGQS 18 (131)
T ss_pred CeeEEeCCCCEEEEcCE
Confidence 46789999999998654
No 157
>PRK13488 chemoreceptor glutamine deamidase CheD; Provisional
Probab=29.24 E-value=1.6e+02 Score=18.93 Aligned_cols=30 Identities=7% Similarity=0.207 Sum_probs=20.3
Q ss_pred EEEECCCCEEEeCCCCeEEEEe--cCCeEEEEEe
Q psy12933 33 ECTLNPGDMLYIPPKVWHHVRS--LSTSISVSFW 64 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~WwH~V~s--~~~sisvn~w 64 (65)
++.|.+||.-+--.. ..+.+ |+.|++|.+|
T Consensus 3 ~v~v~~Ge~~v~~~p--~~i~T~~LGSCVav~l~ 34 (157)
T PRK13488 3 VIKVGIGEYAVGKSP--EVIETSGLGSCVGIILY 34 (157)
T ss_pred cEEEecCcEEEECCC--CEEEEEcCcCeEEEEEE
Confidence 467899996655543 23433 5679999887
No 158
>PRK13494 chemoreceptor glutamine deamidase CheD; Provisional
Probab=29.03 E-value=1.5e+02 Score=19.32 Aligned_cols=32 Identities=13% Similarity=0.395 Sum_probs=20.9
Q ss_pred ceEEEEECCCCEEEeCCCCeEEEEe-cCCeEEEEEe
Q psy12933 30 HLIECTLNPGDMLYIPPKVWHHVRS-LSTSISVSFW 64 (65)
Q Consensus 30 ~~~~~~l~pGd~LyiP~~WwH~V~s-~~~sisvn~w 64 (65)
....+.|.||| +++... ..+.. ++.|++|.+|
T Consensus 10 ~~~~~~l~~Ge-~~v~~~--~~i~T~LGSCVaV~l~ 42 (163)
T PRK13494 10 KRDVTIIVPGE-AFVSND--RVISTILGSCVAVVLY 42 (163)
T ss_pred CccEEEEecCe-EEEeCC--cEEEEeCcccEEEEEE
Confidence 34556799999 555554 33332 5678999887
No 159
>PF07378 FlbT: Flagellar protein FlbT; InterPro: IPR009967 This family consists of several FlbT proteins. FlbT is a post-transcriptional repressor function in flagellum biogenesis. FlbT is associated with the 5' untranslated region (UTR) of fljK (25 kDa flagellin) mRNA and that this association requires a predicted loop structure in the transcript. Mutations within this loop abolish FlbT association and result in increased mRNA stability. It is therefore thought that FlbT promotes the degradation of flagellin mRNA by associating with the 5' UTR [].; GO: 0048027 mRNA 5'-UTR binding, 0006402 mRNA catabolic process, 0045718 negative regulation of flagellum assembly
Probab=28.74 E-value=44 Score=20.95 Aligned_cols=15 Identities=27% Similarity=0.543 Sum_probs=12.6
Q ss_pred EEEECCCCEEEeCCC
Q psy12933 33 ECTLNPGDMLYIPPK 47 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~ 47 (65)
..+|+|||-++|-..
T Consensus 2 kl~LkpgERi~INGA 16 (126)
T PF07378_consen 2 KLSLKPGERIIINGA 16 (126)
T ss_pred eeeecCCCEEEEcCe
Confidence 578999999998764
No 160
>PF03171 2OG-FeII_Oxy: 2OG-Fe(II) oxygenase superfamily Entry for Lysyl hydrolases This Prosite entry is a sub-family of the Pfam entry; InterPro: IPR005123 This domain is found in members of the 2-oxoglutarate (2OG) and Fe(II)-dependent oxygenase superfamily [], as well as the C-terminal of prolyl 4-hydroxylase alpha subunit. The holoenzyme has the activity (1.14.11.2 from EC) catalysing the reaction: Procollagen L-proline + 2-oxoglutarate + O2 = procollagen trans-4-hydroxy-L-proline + succinate + CO2. The full enzyme consists of a alpha2 beta2 complex with the alpha subunit contributing most of the parts of the active site []. The family also includes lysyl hydrolases, isopenicillin synthases and AlkB. ; GO: 0016491 oxidoreductase activity, 0016706 oxidoreductase activity, acting on paired donors, with incorporation or reduction of molecular oxygen, 2-oxoglutarate as one donor, and incorporation of one atom each of oxygen into both donors, 0055114 oxidation-reduction process; PDB: 3ON7_D 1BK0_A 1IPS_B 1QIQ_A 1OC1_A 2Y86_A 2Y60_A 1W03_A 2VE1_A 1QJF_A ....
Probab=28.50 E-value=96 Score=17.22 Aligned_cols=33 Identities=27% Similarity=0.462 Sum_probs=16.7
Q ss_pred EEEECCCCEEE-eC----CCCeEEEEecC--CeEEEEEeC
Q psy12933 33 ECTLNPGDMLY-IP----PKVWHHVRSLS--TSISVSFWF 65 (65)
Q Consensus 33 ~~~l~pGd~Ly-iP----~~WwH~V~s~~--~sisvn~w~ 65 (65)
.+++..||+|- +. ..+.|+|.... .-+|+.+|+
T Consensus 57 ~~~v~~G~~l~~~t~g~~~~~~HrV~~~~~~~R~s~~~f~ 96 (98)
T PF03171_consen 57 GFIVNFGDALEILTNGRYPATLHRVVPPTEGERYSLTFFL 96 (98)
T ss_dssp CEEEEEBHHHHHHTTTSS----EEEE--STS-EEEEEEEE
T ss_pred eeeeeceeeeecccCCccCCceeeeEcCCCCCEEEEEEEE
Confidence 45566666442 11 23689999885 578888874
No 161
>smart00326 SH3 Src homology 3 domains. Src homology 3 (SH3) domains bind to target proteins through sequences containing proline and hydrophobic amino acids. Pro-containing polypeptides may bind to SH3 domains in 2 different binding orientations.
Probab=28.21 E-value=26 Score=16.92 Aligned_cols=22 Identities=18% Similarity=0.398 Sum_probs=14.3
Q ss_pred ceEEEEECCCCEEEe----CCCCeEE
Q psy12933 30 HLIECTLNPGDMLYI----PPKVWHH 51 (65)
Q Consensus 30 ~~~~~~l~pGd~Lyi----P~~WwH~ 51 (65)
..-+..+++||.+.+ ..+||.-
T Consensus 15 ~~~~l~~~~Gd~v~v~~~~~~~w~~~ 40 (58)
T smart00326 15 DPDELSFKKGDIITVLEKSDDGWWKG 40 (58)
T ss_pred CCCCCCCCCCCEEEEEEcCCCCeEEE
Confidence 344566778887765 5777764
No 162
>PF00122 E1-E2_ATPase: E1-E2 ATPase p-type cation-transporting ATPase superfamily signature H+-transporting ATPase (proton pump) signature sodium/potassium-transporting ATPase signature; InterPro: IPR008250 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. P-ATPases (sometime known as E1-E2 ATPases) (3.6.3.- from EC) are found in bacteria and in a number of eukaryotic plasma membranes and organelles []. P-ATPases function to transport a variety of different compounds, including ions and phospholipids, across a membrane using ATP hydrolysis for energy. There are many different classes of P-ATPases, each of which transports a specific type of ion: H+, Na+, K+, Mg2+, Ca2+, Ag+ and Ag2+, Zn2+, Co2+, Pb2+, Ni2+, Cd2+, Cu+ and Cu2+. P-ATPases can be composed of one or two polypeptides, and can usually assume two main conformations called E1 and E2. This entry represents the actuator (A) domain, and some transmembrane helices found in P-type ATPases []. It contains the TGES-loop which is essential for the metal ion binding which results in tight association between the A and P (phosphorylation) domains []. It does not contain the phosphorylation site. It is thought that the large movement of the actuator domain, which is transmitted to the transmembrane helices, is essential to the long distance coupling between formation/decomposition of the acyl phosphate in the cytoplasmic P-domain and the changes in the ion-binding sites buried deep in the membranous region []. This domain has a modulatory effect on the phosphoenzyme processing steps through its nucleotide binding [],[]. P-type (or E1-E2-type) ATPases that form an aspartyl phosphate intermediate in the course of ATP hydrolysis, can be divided into 4 major groups []: (1) Ca2+-transporting ATPases; (2) Na+/K+- and gastric H+/K+-transporting ATPases; (3) plasma membrane H+-transporting ATPases (proton pumps) of plants, fungi and lower eukaryotes; and (4) all bacterial P-type ATPases, except the g2+-ATPase of Salmonella typhimurium, which is more similar to the eukaryotic sequences. However, great variety of sequence analysis methods results in diversity of classification. More information about this protein can be found at Protein of the Month: ATP Synthases [].; GO: 0000166 nucleotide binding, 0046872 metal ion binding; PDB: 2XZB_A 1MHS_B 3TLM_A 3A3Y_A 2ZXE_A 3NAL_A 3NAM_A 3NAN_A 2YJ6_B 2IYE_A ....
Probab=28.12 E-value=27 Score=22.47 Aligned_cols=13 Identities=31% Similarity=0.682 Sum_probs=9.7
Q ss_pred EECCCCEEEeCCC
Q psy12933 35 TLNPGDMLYIPPK 47 (65)
Q Consensus 35 ~l~pGd~LyiP~~ 47 (65)
.|.|||++.+.+|
T Consensus 51 ~L~~GDiI~l~~g 63 (230)
T PF00122_consen 51 ELVPGDIIILKAG 63 (230)
T ss_dssp GT-TTSEEEEETT
T ss_pred hccceeeeecccc
Confidence 4789999988776
No 163
>COG2255 RuvB Holliday junction resolvasome, helicase subunit [DNA replication, recombination, and repair]
Probab=28.02 E-value=31 Score=25.21 Aligned_cols=14 Identities=43% Similarity=0.736 Sum_probs=10.8
Q ss_pred EEEEECCCCEEEeC
Q psy12933 32 IECTLNPGDMLYIP 45 (65)
Q Consensus 32 ~~~~l~pGd~LyiP 45 (65)
+-..|++||+|||=
T Consensus 97 iLt~Le~~DVLFID 110 (332)
T COG2255 97 ILTNLEEGDVLFID 110 (332)
T ss_pred HHhcCCcCCeEEEe
Confidence 34568999999973
No 164
>PF05726 Pirin_C: Pirin C-terminal cupin domain; InterPro: IPR008778 This entry represents C-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 1J1L_A 3ACL_A 2P17_A.
Probab=28.02 E-value=1.2e+02 Score=17.59 Aligned_cols=30 Identities=27% Similarity=0.498 Sum_probs=19.2
Q ss_pred EEEEECCCCEEE--eCCCCeEEEEecCCeEEE
Q psy12933 32 IECTLNPGDMLY--IPPKVWHHVRSLSTSISV 61 (65)
Q Consensus 32 ~~~~l~pGd~Ly--iP~~WwH~V~s~~~sisv 61 (65)
+++.|+||-.+- +|++|-+.+..++.++.|
T Consensus 2 ~di~l~~g~~~~~~~~~~~~~~iyv~~G~~~v 33 (104)
T PF05726_consen 2 LDIKLEPGASFTLPLPPGHNAFIYVLEGSVEV 33 (104)
T ss_dssp EEEEE-TT-EEEEEEETT-EEEEEEEESEEEE
T ss_pred EEEEECCCCEEEeecCCCCEEEEEEEECcEEE
Confidence 578888887766 458887777777666554
No 165
>PRK10202 ebgC cryptic beta-D-galactosidase subunit beta; Reviewed
Probab=27.66 E-value=64 Score=20.41 Aligned_cols=22 Identities=5% Similarity=-0.035 Sum_probs=19.5
Q ss_pred EEEEECCCCEEEeCCCCeEEEE
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVR 53 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~ 53 (65)
..++|+||+.+.+-|+=.|+..
T Consensus 106 ~~v~l~~G~F~iffP~daH~P~ 127 (149)
T PRK10202 106 ETVEVHEGQIVICDIHEAYRFI 127 (149)
T ss_pred cEEEeCCCeEEEECCcccccCC
Confidence 3899999999999999899875
No 166
>PF04322 DUF473: Protein of unknown function (DUF473); InterPro: IPR007417 This is a family of uncharacterised archaeal proteins.
Probab=27.63 E-value=37 Score=21.29 Aligned_cols=12 Identities=33% Similarity=0.974 Sum_probs=9.3
Q ss_pred EECCCCEEEeCC
Q psy12933 35 TLNPGDMLYIPP 46 (65)
Q Consensus 35 ~l~pGd~LyiP~ 46 (65)
.++|||++|+=.
T Consensus 38 ~~~~Gd~VFlT~ 49 (119)
T PF04322_consen 38 SLDPGDRVFLTS 49 (119)
T ss_pred cCCCCCEEEEec
Confidence 458999999743
No 167
>COG0490 Putative regulatory, ligand-binding protein related to C-terminal domains of K+ channels [Inorganic ion transport and metabolism]
Probab=27.52 E-value=33 Score=22.66 Aligned_cols=12 Identities=33% Similarity=0.520 Sum_probs=10.2
Q ss_pred EEEECCCCEEEe
Q psy12933 33 ECTLNPGDMLYI 44 (65)
Q Consensus 33 ~~~l~pGd~Lyi 44 (65)
+.+|++||.||+
T Consensus 132 y~vle~gDtlvv 143 (162)
T COG0490 132 YTVLEAGDTLVV 143 (162)
T ss_pred hhhhcCCCEEEE
Confidence 378999999986
No 168
>PRK13495 chemoreceptor glutamine deamidase CheD; Provisional
Probab=27.18 E-value=1.8e+02 Score=18.81 Aligned_cols=31 Identities=10% Similarity=0.227 Sum_probs=20.6
Q ss_pred EEEEECCCCEEEeCCCCeEEEEe--cCCeEEEEEe
Q psy12933 32 IECTLNPGDMLYIPPKVWHHVRS--LSTSISVSFW 64 (65)
Q Consensus 32 ~~~~l~pGd~LyiP~~WwH~V~s--~~~sisvn~w 64 (65)
+.++|.+||.-+--.. ..+.+ |+.|++|.+|
T Consensus 2 ~~v~v~~Ge~~v~~~p--~~l~T~~LGSCVaV~l~ 34 (159)
T PRK13495 2 MKKVIGIGEYAVMKNP--GVIVTLGLGSCVGVCMR 34 (159)
T ss_pred ccEEEecCcEEEECCC--CEEEEecCcCeEEEEEE
Confidence 4678999996554332 23443 6789999887
No 169
>COG3508 HmgA Homogentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=26.91 E-value=1e+02 Score=23.27 Aligned_cols=25 Identities=20% Similarity=0.294 Sum_probs=20.8
Q ss_pred EEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 33 ECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 33 ~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
+..++|||++.||.|-=-+|+-.+.
T Consensus 165 ~l~v~pgeiavIPRG~~frve~~~~ 189 (427)
T COG3508 165 VLEVEPGEIAVIPRGTTFRVELKDG 189 (427)
T ss_pred eEEecCCcEEEeeCCceEEEEecCC
Confidence 4678999999999998888877653
No 170
>cd02779 MopB_CT_Arsenite-Ox This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of Arsenite oxidase (Arsenite-Ox) and related proteins. Arsenite oxidase oxidizes arsenite to the less toxic arsenate; it transfers the electrons obtained from the oxidation of arsenite towards the soluble periplasmic electron carriers cytochrome c and/or amicyanin.
Probab=26.81 E-value=41 Score=19.74 Aligned_cols=11 Identities=0% Similarity=-0.182 Sum_probs=9.4
Q ss_pred CEEEeCCCCeE
Q psy12933 40 DMLYIPPKVWH 50 (65)
Q Consensus 40 d~LyiP~~WwH 50 (65)
+++++|.|||.
T Consensus 75 g~v~~p~g~~~ 85 (115)
T cd02779 75 GQTFMLMAHPR 85 (115)
T ss_pred CeEEEEccCCc
Confidence 67999999994
No 171
>PF00018 SH3_1: SH3 domain; InterPro: IPR001452 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. The crystal structure of the SH3 domain of the cytoskeletal protein spectrin, and the solution structures of SH3 domains of phospholipase C (PLC-y) and phosphatidylinositol 3-kinase p85 alpha-subunit, have been determined [, , ]. In spite of relatively limited sequence similarity, their overall structures are similar. The domains belong to the alpha+beta structural class, with 5 to 8 beta-strands forming 2 tightly-packed, anti-parallel beta-sheets arranged in a barrel-like structure, and intervening loops sometimes forming helices. Conserved aliphatic and aromatic residues form a hydrophobic core (A11, L23, A29, V34, W42, L52 and V59 in PLC-y []) and a hydrophobic pocket on the molecular surface (L12, F13, W53 and P55 in PLC-y). The conserved core is believed to stabilise the fold, while the pocket is thought to serve as a binding site for target proteins. Conserved carboxylic amino acids located in the loops, on the periphery of the pocket (D14 and E22), may be involved in protein-protein interactions via proline-rich regions. The N- and C-termini are packed in close proximity, indicating that they are independent structural modules.; GO: 0005515 protein binding; PDB: 1UHF_A 1W1F_A 1WA7_A 1SEM_A 1KFZ_A 2SEM_B 1K76_A 3SEM_B 1X2Q_A 2J06_B ....
Probab=26.51 E-value=14 Score=18.56 Aligned_cols=29 Identities=21% Similarity=0.345 Sum_probs=16.7
Q ss_pred CCCCceEEEEECCCCEEEe----CCCCeEEEEec
Q psy12933 26 QTPGHLIECTLNPGDMLYI----PPKVWHHVRSL 55 (65)
Q Consensus 26 ~~~~~~~~~~l~pGd~Lyi----P~~WwH~V~s~ 55 (65)
|.....-+..++.||.|+| ..+||. ++..
T Consensus 6 f~~~~~~eLs~~~Gd~i~v~~~~~~~Ww~-~~~~ 38 (48)
T PF00018_consen 6 FDAEDPDELSFKKGDIIEVLEKSDDGWWK-VRNE 38 (48)
T ss_dssp BETSSTTBSEB-TTEEEEEEEESSSSEEE-EEET
T ss_pred eCCCCCCEEeEECCCEEEEEEecCCCEEE-EEEC
Confidence 3344455677889988874 345554 4444
No 172
>PRK14125 cell division suppressor protein YneA; Provisional
Probab=26.26 E-value=40 Score=20.24 Aligned_cols=13 Identities=38% Similarity=0.618 Sum_probs=10.4
Q ss_pred EECCCCEEEeCCC
Q psy12933 35 TLNPGDMLYIPPK 47 (65)
Q Consensus 35 ~l~pGd~LyiP~~ 47 (65)
.+.||+.|.||..
T Consensus 79 ~I~~Gq~L~IP~~ 91 (103)
T PRK14125 79 HIKAGDKLVIPVL 91 (103)
T ss_pred cCCCCCEEEEecC
Confidence 3889999999853
No 173
>cd02780 MopB_CT_Tetrathionate_Arsenate-R This CD contains the molybdopterin_binding C-terminal (MopB_CT) region of tetrathionate reductase, subunit A, (TtrA); respiratory arsenate As(V) reductase, catalytic subunit (ArrA); and other related proteins.
Probab=25.93 E-value=43 Score=20.39 Aligned_cols=12 Identities=17% Similarity=0.077 Sum_probs=10.4
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.+|+|.
T Consensus 72 g~V~~p~g~g~~ 83 (143)
T cd02780 72 GVVAIEHGYGHW 83 (143)
T ss_pred CEEEEecccCcc
Confidence 689999999775
No 174
>cd02789 MopB_CT_FmdC-FwdD The MopB_FmdC-FwdD CD includes the C-terminus of subunit C of molybdenum formylmethanofuran dehydrogenase (FmdC) and subunit D of tungsten formylmethanofuran dehydrogenase (FwdD), and other related proteins. Formylmethanofuran dehydrogenase catalyzes the first step in methane formation from CO2 in methanogenic archaea and some eubacteria. Members of this CD belong to the molybdopterin_binding superfamily of proteins. This CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=25.46 E-value=42 Score=19.69 Aligned_cols=10 Identities=40% Similarity=1.059 Sum_probs=9.0
Q ss_pred CEEEeCCCCe
Q psy12933 40 DMLYIPPKVW 49 (65)
Q Consensus 40 d~LyiP~~Ww 49 (65)
+++++|.+||
T Consensus 73 g~v~~~~g~~ 82 (106)
T cd02789 73 GMVFIPMGPW 82 (106)
T ss_pred CEEEEecccc
Confidence 6789999998
No 175
>TIGR02410 carnitine_TMLD trimethyllysine dioxygenase. Members of this family with known function act as trimethyllysine dioxygenase, an enzyme in the pathway for carnitine biosynthesis from lysine. This enzyme is homologous to gamma-butyrobetaine,2-oxoglutarate dioxygenase, which catalyzes the last step in carnitine biosynthesis. Members of this family appear to be eukaryotic only.
Probab=25.44 E-value=68 Score=22.86 Aligned_cols=15 Identities=40% Similarity=0.660 Sum_probs=12.9
Q ss_pred ceEEEEECCCCEEEe
Q psy12933 30 HLIECTLNPGDMLYI 44 (65)
Q Consensus 30 ~~~~~~l~pGd~Lyi 44 (65)
-.++..|+|||++++
T Consensus 311 ~~~~~~l~pGd~vi~ 325 (362)
T TIGR02410 311 NEIEFKLRPGTVLIF 325 (362)
T ss_pred cEEEEEcCCccEEEE
Confidence 478899999999985
No 176
>PF10162 G8: G8 domain; InterPro: IPR019316 This entry represents a domain found in disease proteins PKHD1 and KIAA1199 and is named G8 after its 8 conserved glycines. It is predicted to contain 10 beta strands and an alpha helix [].
Probab=25.10 E-value=64 Score=19.60 Aligned_cols=19 Identities=21% Similarity=0.192 Sum_probs=13.9
Q ss_pred CCCCEEEeCCCCeEEEEec
Q psy12933 37 NPGDMLYIPPKVWHHVRSL 55 (65)
Q Consensus 37 ~pGd~LyiP~~WwH~V~s~ 55 (65)
.+||.++||++++=-+...
T Consensus 10 ~~g~~V~I~~g~~v~lD~~ 28 (125)
T PF10162_consen 10 GAGDNVVIPAGQTVLLDVS 28 (125)
T ss_pred CCCCEEEECCCCEEEEcCC
Confidence 4789999999976554443
No 177
>PF07591 PT-HINT: Pretoxin HINT domain; InterPro: IPR011451 This entry represents a cluster of homologous proteins identified in Leptospira interrogans. One member (Q8EZX6 from SWISSPROT) has been predicted to be a phenazine biosynthesis family protein.; PDB: 2JNQ_A 2JMZ_A.
Probab=24.99 E-value=72 Score=19.60 Aligned_cols=22 Identities=32% Similarity=0.507 Sum_probs=14.5
Q ss_pred EECCCCEEEeCCCCeEEEEecC
Q psy12933 35 TLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 35 ~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.|++||.|.-+.|-+-.+.+..
T Consensus 76 ~L~~GD~L~~~~G~~~~v~~i~ 97 (130)
T PF07591_consen 76 DLKVGDRLLTADGSWVTVTSIR 97 (130)
T ss_dssp G--TTSEEEEE-SSEEEEE---
T ss_pred hCCCCCEEEcCCCCEEEEEEEE
Confidence 5899999999999998887754
No 178
>cd02790 MopB_CT_Formate-Dh_H Formate dehydrogenase H (Formate-Dh-H) catalyzes the reversible oxidation of formate to CO2 with the release of a proton and two electrons. It is a component of the anaerobic formate hydrogen lyase complex. The E. coli formate dehydrogenase H (Fdh-H) is a monomer composed of a single polypeptide chain with a Mo active site region and a [4Fe-4S] center. This CD (MopB_CT_Formate-Dh_H) is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=24.67 E-value=46 Score=19.07 Aligned_cols=12 Identities=8% Similarity=0.324 Sum_probs=10.3
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.+||+.
T Consensus 77 g~v~~~~g~~~~ 88 (116)
T cd02790 77 GVVFMPFHFAEA 88 (116)
T ss_pred CEEEEecCcCCC
Confidence 679999999964
No 179
>KOG0572|consensus
Probab=24.48 E-value=62 Score=24.67 Aligned_cols=25 Identities=28% Similarity=0.517 Sum_probs=19.6
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEEecC
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVRSLS 56 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~s~~ 56 (65)
.-.++..++|||++.|+. |.++++.
T Consensus 217 ~a~y~Rev~PGEiV~i~r---~g~~s~~ 241 (474)
T KOG0572|consen 217 GARYEREVRPGEIVEISR---NGVKSVD 241 (474)
T ss_pred ccEEEEeecCceEEEEec---CCceeee
Confidence 356788999999999998 6666553
No 180
>PF02938 GAD: GAD domain; InterPro: IPR004115 This entry represetns an 2 layer alpha/beta insertion domain found in some glutamyl-tRNA amidotransferases and aspartyl tRNA synthetases [, ]. The function of this domain is not yet known.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0005737 cytoplasm; PDB: 1ZQ1_D 1EQR_B 1IL2_B 1C0A_A 1L0W_A 1G51_B 1EFW_B 2D6F_D.
Probab=24.45 E-value=39 Score=19.55 Aligned_cols=17 Identities=24% Similarity=0.321 Sum_probs=10.2
Q ss_pred EEECCCCEEEeCCCCeE
Q psy12933 34 CTLNPGDMLYIPPKVWH 50 (65)
Q Consensus 34 ~~l~pGd~LyiP~~WwH 50 (65)
+-+++||+|++=++=|+
T Consensus 73 ~~a~~GD~ll~~Ag~~~ 89 (95)
T PF02938_consen 73 LGAKPGDLLLFVAGKKE 89 (95)
T ss_dssp TT--TTEEEEEEEESHH
T ss_pred hCCCCCCEEEEECCCHH
Confidence 34788999887665443
No 181
>PF00856 SET: SET domain; InterPro: IPR001214 The SET domain appears generally as one part of a larger multidomain protein, and recently there were described three structures of very different proteins with distinct domain compositions: Neurospora crassa DIM-5, a member of the Su(var) family of HKMTs which methylate histone H3 on lysine 9,human SET7 (also called SET9), which methylates H3 on lysine 4 and garden pea Rubisco LSMT, an enzyme that does not modify histones, but instead methylates lysine 14 in the flexible tail of the large subunit of the enzyme Rubisco. The SET domain itself turned out to be an uncommon structure. Although in all three studies, electron density maps revealed the location of the AdoMet or AdoHcy cofactor, the SET domain bears no similarity at all to the canonical/AdoMet-dependent methyltransferase fold. Strictly conserved in the C-terminal motif of the SET domain tyrosine could be involved in abstracting a proton from the protonated amino group of the substrate lysine, promoting its nucleophilic attack on the sulphonium methyl group of the AdoMet cofactor. In contrast to the AdoMet-dependent protein methyltranferases of the classical type, which tend to bind their polypeptide substrates on top of the cofactor, it is noted from the Rubisco LSMT structure that the AdoMet seems to bind in a separate cleft, suggesting how a polypeptide substrate could be subjected to multiple rounds of methylation without having to be released from the enzyme. In contrast, SET7/9 is able to add only a single methyl group to its substrate. It has been demonstrated that association of SET domain and myotubularin-related proteins modulates growth control []. The SET domain-containing Drosophila melanogaster (Fruit fly) protein, enhancer of zeste, has a function in segment determination and the mammalian homologue may be involved in the regulation of gene transcription and chromatin structure. Histone lysine methylation is part of the histone code that regulated chromatin function and epigenetic control of gene function. Histone lysine methyltransferases (HMTase) differ both in their substrate specificity for the various acceptor lysines as well as in their product specificity for the number of methyl groups (one, two, or three) they transfer. With just one exception [], the HMTases belong to SET family that can be classified according to the sequences surrounding the SET domain [, ]. Structural studies on the human SET7/9, a mono-methylase, have revealed the molecular basis for the specificity of the enzyme for the histone-target and the roles of the invariant residues in the SET domain in determining the methylation specificities []. The pre-SET domain, as found in the SUV39 SET family, contains nine invariant cysteine residues that are grouped into two segments separated by a region of variable length. These 9 cysteines coordinate 3 zinc ions to form to form a triangular cluster, where each of the zinc ions is coordinated by 4 four cysteines to give a tetrahedral configuration. The function of this domain is structural, holding together 2 long segments of random coils. The C-terminal region including the post-SET domain is disordered when not interacting with a histone tail and in the absence of zinc. The three conserved cysteines in the post-SET domain form a zinc-binding site when coupled to a fourth conserved cysteine in the knot-like structure close to the SET domain active site []. The structured post-SET region brings in the C-terminal residues that participate in S-adenosylmethine-binding and histone tail interactions. The three conserved cysteine residues are essential for HMTase activity, as replacement with serine abolishes HMTase activity [], []. ; GO: 0005515 protein binding; PDB: 3TG5_A 3S7F_A 3RIB_B 3TG4_A 3S7J_A 3S7D_A 3S7B_A 3H6L_A 3SMT_A 3K5K_A ....
Probab=24.18 E-value=34 Score=19.72 Aligned_cols=13 Identities=23% Similarity=0.710 Sum_probs=8.5
Q ss_pred EECCCCEEEeCCC
Q psy12933 35 TLNPGDMLYIPPK 47 (65)
Q Consensus 35 ~l~pGd~LyiP~~ 47 (65)
.+++||+|++|..
T Consensus 9 dI~~Ge~I~~p~~ 21 (162)
T PF00856_consen 9 DIKAGEVILIPRP 21 (162)
T ss_dssp -B-TTEEEEEESE
T ss_pred cCCCCCEEEEECc
Confidence 5788998887764
No 182
>PRK11198 LysM domain/BON superfamily protein; Provisional
Probab=24.10 E-value=48 Score=20.81 Aligned_cols=11 Identities=45% Similarity=0.951 Sum_probs=7.7
Q ss_pred ECCCCEEEeCC
Q psy12933 36 LNPGDMLYIPP 46 (65)
Q Consensus 36 l~pGd~LyiP~ 46 (65)
+.||+.|.||+
T Consensus 137 I~pGq~L~IP~ 147 (147)
T PRK11198 137 IYPGQVLRIPE 147 (147)
T ss_pred cCcCCEEecCC
Confidence 66777777774
No 183
>PF14525 AraC_binding_2: AraC-binding-like domain
Probab=23.94 E-value=1.7e+02 Score=17.32 Aligned_cols=27 Identities=22% Similarity=0.255 Sum_probs=22.3
Q ss_pred eEEEEECCCCEEEeCCCCeEEEEecCC
Q psy12933 31 LIECTLNPGDMLYIPPKVWHHVRSLST 57 (65)
Q Consensus 31 ~~~~~l~pGd~LyiP~~WwH~V~s~~~ 57 (65)
.-++.+.|||+++++++=-...+..+.
T Consensus 72 g~~~~~~pg~~~l~d~~~~~~~~~~~~ 98 (172)
T PF14525_consen 72 GREVELAPGDVVLLDPGQPYRLEFSAG 98 (172)
T ss_pred CEEEEEcCCeEEEEcCCCCEEEEECCC
Confidence 568899999999999998877776653
No 184
>PF03737 Methyltransf_6: Demethylmenaquinone methyltransferase; InterPro: IPR005493 This entry represents a structural motif found in demethylmenaquinone methyltransferases and in the regulator of ribonuclease E activity A (RraA). These proteins contain a swivelling 3-layer beta/beta/alpha domain that appears to be mobile in most multi-domain proteins known to contain it. These proteins are structurally similar, and may have distant homology, to the phosphohistidine domain of pyruvate phosphate dikinase. The RraA fold is an ancient platform that has been adapted for a wide range of functions. RraA had been identified as a putative demethylmenaquinone methyltransferase and was annotated as MenG, but further analysis showed that RraA lacked the structural motifs usually required for methylases. The Escherichia coli protein regulator RraA acts as a trans-acting modulator of RNA turnover, binding essential endonuclease RNase E and inhibiting RNA processing []. RNase E forms the core of a large RNA-catalysis machine termed the degradosomes. RraA (and RraB) causes remodelling of degradosome composition, which is associated with alterations in RNA decay and global transcript abundance and as such is a bacterial mechanism for the regulation of RNA cleavage. Demethylmenaquinone methyltransferases convert dimethylmenaquinone (DMK) to menaquinone (MK) in the final step of menaquinone biosynthesis. This region is also found at the C terminus of the DlpA protein Q48806 from SWISSPROT.; PDB: 3C8O_A 2PCN_A 3K4I_B 2YJV_A 1Q5X_B 2C5Q_D 2YJT_B 1NXJ_B 3NOJ_A 1J3L_A ....
Probab=23.68 E-value=64 Score=20.37 Aligned_cols=10 Identities=50% Similarity=1.039 Sum_probs=6.6
Q ss_pred EEECCCCEEE
Q psy12933 34 CTLNPGDMLY 43 (65)
Q Consensus 34 ~~l~pGd~Ly 43 (65)
++++|||++|
T Consensus 137 v~V~pGD~I~ 146 (154)
T PF03737_consen 137 VTVNPGDIIV 146 (154)
T ss_dssp EEE-TTSEEE
T ss_pred EEECCCCEEE
Confidence 4678888776
No 185
>PRK13491 chemoreceptor glutamine deamidase CheD; Provisional
Probab=23.57 E-value=2.4e+02 Score=19.08 Aligned_cols=35 Identities=11% Similarity=0.177 Sum_probs=23.7
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEE--EecCCeEEEEEe
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHV--RSLSTSISVSFW 64 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V--~s~~~sisvn~w 64 (65)
+.+..++.|.+||+.+--.. ..+ ..|+.|++|-+|
T Consensus 10 ~~~~~~v~V~~Ge~~vs~~p--~~litT~LGSCVaV~Ly 46 (199)
T PRK13491 10 ASQISITHVTQGSCVASDSP--NEVYATILGSCICTCMC 46 (199)
T ss_pred cceEEEEEEecCcEEEECCC--CEEEEEeccccEEEEEE
Confidence 45566888999996665442 132 336789999887
No 186
>KOG2414|consensus
Probab=23.48 E-value=57 Score=25.00 Aligned_cols=19 Identities=32% Similarity=0.517 Sum_probs=16.1
Q ss_pred CCceEEEEECCCCEEEeCCCC
Q psy12933 28 PGHLIECTLNPGDMLYIPPKV 48 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~W 48 (65)
-.+.+.+++||| ||||..|
T Consensus 424 L~pg~ViTIEPG--vYIP~d~ 442 (488)
T KOG2414|consen 424 LQPGMVITIEPG--VYIPEDD 442 (488)
T ss_pred CCCCceEEecCc--eecCccC
Confidence 356788999999 8999987
No 187
>PF08857 ParBc_2: Putative ParB-like nuclease; InterPro: IPR014956 These proteins are probably distantly related to IPR003115 from INTERPRO. Suggesting these, uncharacterised proteins have a nuclease function. ; PDB: 2HWJ_E.
Probab=23.42 E-value=1e+02 Score=20.14 Aligned_cols=25 Identities=20% Similarity=0.409 Sum_probs=14.3
Q ss_pred CCCCCCCCceEEEEECCCCEEEeCC
Q psy12933 22 APHNQTPGHLIECTLNPGDMLYIPP 46 (65)
Q Consensus 22 ~~p~~~~~~~~~~~l~pGd~LyiP~ 46 (65)
.+-.+-+.+++.++++||.-+|+=-
T Consensus 36 ~~~~~l~~~~vpvViGP~g~lyl~D 60 (163)
T PF08857_consen 36 KFDDYLEKHPVPVVIGPGGQLYLTD 60 (163)
T ss_dssp -----TT-BEEEEEE-STT-EEE-S
T ss_pred HHHHHHhcCCCCEEECCCCCeEEEC
Confidence 4455778899999999999999743
No 188
>PF11649 T4_neck-protein: Virus neck protein; InterPro: IPR021674 This family of protein is represented by bacteriophage T4 Gp14, a major component of the neck along with Gp13. When added to completed heads, Gp13 and Gp14 structural proteins activate the heads for tail joining in vitro complementation mixtures. Gp14 is rich is beta-sheets. The formation of the neck to the head of the bacteriophage is crucial for the tail attachment [].
Probab=23.37 E-value=49 Score=23.00 Aligned_cols=11 Identities=36% Similarity=0.948 Sum_probs=9.5
Q ss_pred CCCCEEEeCCC
Q psy12933 37 NPGDMLYIPPK 47 (65)
Q Consensus 37 ~pGd~LyiP~~ 47 (65)
+.||+||+|.+
T Consensus 115 ~eGDLIYfP~~ 125 (230)
T PF11649_consen 115 KEGDLIYFPMT 125 (230)
T ss_pred CcCCEEEEEcC
Confidence 67999999975
No 189
>cd02784 MopB_CT_PHLH The MopB_CT_PHLH CD includes a group of related uncharacterized putative hydrogenase-like homologs (PHLH) of molybdopterin binding proteins. This CD is of the PHLH region homologous to the conserved molybdopterin-binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=23.31 E-value=43 Score=20.93 Aligned_cols=14 Identities=0% Similarity=-0.097 Sum_probs=12.0
Q ss_pred CEEEeCCCCeEEEE
Q psy12933 40 DMLYIPPKVWHHVR 53 (65)
Q Consensus 40 d~LyiP~~WwH~V~ 53 (65)
+++++|.||.|...
T Consensus 80 gvV~i~~G~G~~~~ 93 (137)
T cd02784 80 GVVLLALGYGRTHA 93 (137)
T ss_pred CEEEEecccCcccC
Confidence 78999999999644
No 190
>PF01426 BAH: BAH domain; InterPro: IPR001025 The BAH (bromo-adjacent homology) family contains proteins such as eukaryotic DNA (cytosine-5) methyltransferases IPR001525 from INTERPRO, the origin recognition complex 1 (Orc1) proteins, as well as several proteins involved in transcriptional regulation. The BAH domain appears to act as a protein-protein interaction module specialised in gene silencing, as suggested for example by its interaction within yeast Orc1p with the silent information regulator Sir1p. The BAH module might therefore play an important role by linking DNA methylation, replication and transcriptional regulation [].; GO: 0003677 DNA binding; PDB: 4DA4_A 3PT6_B 3AV6_A 3AV5_A 3AV4_A 3PT9_A 3SWR_A 3PTA_A 1M4Z_A 1ZBX_A ....
Probab=22.92 E-value=64 Score=18.49 Aligned_cols=15 Identities=33% Similarity=0.705 Sum_probs=11.8
Q ss_pred EEECCCCEEEeCCCC
Q psy12933 34 CTLNPGDMLYIPPKV 48 (65)
Q Consensus 34 ~~l~pGd~LyiP~~W 48 (65)
++++.||++|+-+.-
T Consensus 1 ~~~~vGD~V~v~~~~ 15 (119)
T PF01426_consen 1 VTYKVGDFVYVKPDD 15 (119)
T ss_dssp EEEETTSEEEEECTS
T ss_pred CEEeCCCEEEEeCCC
Confidence 467889999987765
No 191
>PRK11479 hypothetical protein; Provisional
Probab=22.49 E-value=93 Score=22.05 Aligned_cols=13 Identities=38% Similarity=0.797 Sum_probs=10.4
Q ss_pred EEEECCCCEEEeC
Q psy12933 33 ECTLNPGDMLYIP 45 (65)
Q Consensus 33 ~~~l~pGd~LyiP 45 (65)
.-.|+|||+||.-
T Consensus 62 ~~~LqpGDLVFfs 74 (274)
T PRK11479 62 APDLKPGDLLFSS 74 (274)
T ss_pred hhhCCCCCEEEEe
Confidence 3578999999964
No 192
>cd02775 MopB_CT Molybdopterin-Binding, C-terminal (MopB_CT) domain of the MopB superfamily of proteins, a large, diverse, heterogeneous superfamily of enzymes that, in general, bind molybdopterin as a cofactor. The MopB domain is found in a wide variety of molybdenum- and tungsten-containing enzymes, including formate dehydrogenase-H (Fdh-H) and -N (Fdh-N), several forms of nitrate reductase (Nap, Nas, NarG), dimethylsulfoxide reductase (DMSOR), thiosulfate reductase, formylmethanofuran dehydrogenase, and arsenite oxidase. Molybdenum is present in most of these enzymes in the form of molybdopterin, a modified pterin ring with a dithiolene side chain, which is responsible for ligating the Mo. In many bacterial and archaeal species, molybdopterin is in the form of a dinucleotide, with two molybdopterin dinucleotide units per molybdenum. These proteins can function as monomers, heterodimers, or heterotrimers, depending on the protein and organism. Also included in the MopB superfamily is
Probab=22.28 E-value=1.3e+02 Score=16.54 Aligned_cols=12 Identities=17% Similarity=0.468 Sum_probs=10.6
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
+++++|.+|||.
T Consensus 65 g~v~~~~~~~~~ 76 (101)
T cd02775 65 GVVFLPHGWGHR 76 (101)
T ss_pred CEEEeeCccCcc
Confidence 589999999985
No 193
>PF07915 PRKCSH: Glucosidase II beta subunit-like protein; InterPro: IPR012913 The sequences found in this family are similar to a region found in the beta-subunit of glucosidase II (P14314 from SWISSPROT), which is also known as protein kinase C substrate 80K-H (PRKCSH). The enzyme catalyses the sequential removal of two alpha-1,3-linked glucose residues in the second step of N-linked oligosaccharide processing []. The beta subunit is required for the solubility and stability of the heterodimeric enzyme, and is involved in retaining the enzyme within the endoplasmic reticulum []. Mutations in the gene coding for PRKCSH have been found to be involved in the development of autosomal dominant polycystic liver disease (ADPLD), but the precise role the protein has in the pathogenesis of this disease is unknown []. ; PDB: 3AIH_A.
Probab=22.12 E-value=50 Score=18.08 Aligned_cols=16 Identities=25% Similarity=0.395 Sum_probs=10.7
Q ss_pred CCEEEeCCCCeEEEEe
Q psy12933 39 GDMLYIPPKVWHHVRS 54 (65)
Q Consensus 39 Gd~LyiP~~WwH~V~s 54 (65)
|.|++...|||..--.
T Consensus 1 ~~C~~~~~g~WtYe~C 16 (81)
T PF07915_consen 1 GQCLYRSEGWWTYEFC 16 (81)
T ss_dssp --EEEEEETTEEEEEE
T ss_pred CcceecCCCEEEEEEC
Confidence 6788888999975433
No 194
>KOG1633|consensus
Probab=22.00 E-value=48 Score=26.79 Aligned_cols=31 Identities=19% Similarity=0.142 Sum_probs=25.7
Q ss_pred CCCCCceEEEEECCCCEEEeCCCCeEEEEec
Q psy12933 25 NQTPGHLIECTLNPGDMLYIPPKVWHHVRSL 55 (65)
Q Consensus 25 ~~~~~~~~~~~l~pGd~LyiP~~WwH~V~s~ 55 (65)
...........+++|+.+.+|++|-|++..-
T Consensus 541 ~~~~~~~~~~~~~~g~~~~~P~~~i~~~~~p 571 (776)
T KOG1633|consen 541 DSANCDDKRDVLKEGETGLIPAGPIHAVLTP 571 (776)
T ss_pred ccccCCCccccccCCCcccCCCCcccccccc
Confidence 3344567788999999999999999999865
No 195
>TIGR00022 uncharacterized protein, YhcH/YjgK/YiaL family. This family consists of conserved hypothetical proteins, about 150 amino acids in length. Members with limited information include YhcH, a possible sugar isomerase of sialic acid catabolism, and YjgK.
Probab=21.75 E-value=1.4e+02 Score=18.48 Aligned_cols=26 Identities=19% Similarity=0.119 Sum_probs=21.9
Q ss_pred CCceEEEEECCCCEEEeCCCCeEEEE
Q psy12933 28 PGHLIECTLNPGDMLYIPPKVWHHVR 53 (65)
Q Consensus 28 ~~~~~~~~l~pGd~LyiP~~WwH~V~ 53 (65)
......+.|.||+.+.+-|+=.|+..
T Consensus 109 ~~~~~~i~l~~G~faiffP~D~H~P~ 134 (142)
T TIGR00022 109 IDDEQTVILKPGMFAVFYPGEPHKPG 134 (142)
T ss_pred CCCceEEEeCCCcEEEECCCCccccc
Confidence 35678999999999998888899864
No 196
>PF02563 Poly_export: Polysaccharide biosynthesis/export protein; InterPro: IPR003715 The extracellular polysaccharide colanic acid (CA) is produced by species of the family Enterobacteriaceae. In Escherichia coli (strain K12) the CA cluster comprises 19 genes. The wzx gene encodes a protein with multiple transmembrane segments that may function in export of the CA repeat unit from the cytoplasm into the periplasm in a process analogous to O-unit export. The CA gene clusters may be involved in the export of polysaccharide from the cell [].; GO: 0015159 polysaccharide transmembrane transporter activity, 0015774 polysaccharide transport, 0016020 membrane; PDB: 2W8I_E 2W8H_E 2J58_D.
Probab=21.71 E-value=55 Score=18.29 Aligned_cols=13 Identities=54% Similarity=0.805 Sum_probs=4.8
Q ss_pred EEEEECCCCEEEe
Q psy12933 32 IECTLNPGDMLYI 44 (65)
Q Consensus 32 ~~~~l~pGd~Lyi 44 (65)
-+-.|.|||.|-|
T Consensus 9 ~~y~l~pGD~l~i 21 (82)
T PF02563_consen 9 PEYRLGPGDVLRI 21 (82)
T ss_dssp ------TT-EEEE
T ss_pred CCCEECCCCEEEE
Confidence 3457889998875
No 197
>cd04721 BAH_plant_1 BAH, or Bromo Adjacent Homology domain, plant-specific sub-family with unknown function. BAH domains are found in a variety of proteins playing roles in transcriptional silencing and the remodeling of chromatin. It is assumed that in most or all of these instances the BAH domain mediates protein-protein interactions.
Probab=21.45 E-value=74 Score=19.70 Aligned_cols=11 Identities=27% Similarity=0.667 Sum_probs=9.5
Q ss_pred EEECCCCEEEe
Q psy12933 34 CTLNPGDMLYI 44 (65)
Q Consensus 34 ~~l~pGd~Lyi 44 (65)
.++++||++||
T Consensus 6 ~~i~vGD~V~v 16 (130)
T cd04721 6 VTISVHDFVYV 16 (130)
T ss_pred EEEECCCEEEE
Confidence 45899999999
No 198
>PF13533 Biotin_lipoyl_2: Biotin-lipoyl like
Probab=21.44 E-value=85 Score=15.99 Aligned_cols=8 Identities=38% Similarity=0.787 Sum_probs=5.0
Q ss_pred ECCCCEEE
Q psy12933 36 LNPGDMLY 43 (65)
Q Consensus 36 l~pGd~Ly 43 (65)
++.||.|+
T Consensus 24 VkkGd~L~ 31 (50)
T PF13533_consen 24 VKKGDVLL 31 (50)
T ss_pred EcCCCEEE
Confidence 45667665
No 199
>PF06525 SoxE: Sulfocyanin (SoxE); InterPro: IPR010532 Members of this family are blue-copper redox proteins designated sulfocyanin, from the archaeal genera Sulfolobus, Ferroplasma, and Picrophilus. The most closely related proteins characterised as functionally different are the rusticyanins.
Probab=21.41 E-value=50 Score=22.40 Aligned_cols=16 Identities=13% Similarity=0.264 Sum_probs=12.0
Q ss_pred EEEeCCCCeEEEEecC
Q psy12933 41 MLYIPPKVWHHVRSLS 56 (65)
Q Consensus 41 ~LyiP~~WwH~V~s~~ 56 (65)
.||+|+||==+++-..
T Consensus 87 ~i~VPAGw~V~i~f~N 102 (196)
T PF06525_consen 87 TIYVPAGWNVQITFTN 102 (196)
T ss_pred EEEEcCCCEEEEEEEc
Confidence 6999999976665443
No 200
>COG4340 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.34 E-value=97 Score=21.40 Aligned_cols=30 Identities=20% Similarity=0.374 Sum_probs=23.2
Q ss_pred CCCCceEEEEECCCCEEEeC-CCCeEEEEec
Q psy12933 26 QTPGHLIECTLNPGDMLYIP-PKVWHHVRSL 55 (65)
Q Consensus 26 ~~~~~~~~~~l~pGd~LyiP-~~WwH~V~s~ 55 (65)
-..++...-+|+|||++++- ..-||.+...
T Consensus 169 ~~~~p~f~kvl~pGe~~~l~Dh~~~H~~tpi 199 (226)
T COG4340 169 DGASPGFFKVLAPGEAVFLDDHRVLHGVTPI 199 (226)
T ss_pred CCCCcceEEeccCCcEEEeccchhcccccce
Confidence 45677888999999999985 4678877654
No 201
>PF04074 DUF386: Domain of unknown function (DUF386); InterPro: IPR004375 This family consists of conserved hypothetical proteins, about 150 amino acids in length, with no known function. The family is restricted to the bacteria. It includes three members in Escherichia coli (strain K12) and three in Streptococcus pneumoniae.; PDB: 1S4C_B 1JOP_B.
Probab=21.24 E-value=1.5e+02 Score=18.46 Aligned_cols=25 Identities=28% Similarity=0.264 Sum_probs=16.9
Q ss_pred CceEEEEECCCCEEEeCCCCeEEEE
Q psy12933 29 GHLIECTLNPGDMLYIPPKVWHHVR 53 (65)
Q Consensus 29 ~~~~~~~l~pGd~LyiP~~WwH~V~ 53 (65)
.....+.|+||+.+.+=|+=.|+..
T Consensus 110 ~~~~~i~l~~g~f~iffP~d~H~p~ 134 (153)
T PF04074_consen 110 KNESFITLKPGDFAIFFPEDAHRPG 134 (153)
T ss_dssp TTEEEEEE-TTEEEEE-TT--EEEE
T ss_pred CCceEEEEcCCEEEEECCCcccccc
Confidence 4456788999999998888899965
No 202
>TIGR02998 RraA_entero regulator of ribonuclease activity A. THIS PROTEIN IS _NOT_ MenG, AKA S-adenosylmethionine: 2-demethylmenaquinone methyltransferase (EC 2.1.-.-). See the references characterizing this as a case of transitive annotation error .
Probab=21.10 E-value=79 Score=20.55 Aligned_cols=11 Identities=36% Similarity=0.709 Sum_probs=8.4
Q ss_pred EEEECCCCEEE
Q psy12933 33 ECTLNPGDMLY 43 (65)
Q Consensus 33 ~~~l~pGd~Ly 43 (65)
-++++|||.++
T Consensus 135 gv~V~PGD~Iv 145 (161)
T TIGR02998 135 GVTFFPDDYIY 145 (161)
T ss_pred CEEECCCCEEE
Confidence 46788998776
No 203
>PRK09372 ribonuclease activity regulator protein RraA; Provisional
Probab=21.00 E-value=80 Score=20.37 Aligned_cols=10 Identities=60% Similarity=1.062 Sum_probs=8.1
Q ss_pred EEECCCCEEE
Q psy12933 34 CTLNPGDMLY 43 (65)
Q Consensus 34 ~~l~pGd~Ly 43 (65)
++++|||+++
T Consensus 136 v~V~PGD~I~ 145 (159)
T PRK09372 136 VTFFPGDYLY 145 (159)
T ss_pred EEECCCCEEE
Confidence 6888998876
No 204
>PF13550 Phage-tail_3: Putative phage tail protein
Probab=20.66 E-value=57 Score=19.71 Aligned_cols=14 Identities=29% Similarity=0.669 Sum_probs=11.2
Q ss_pred EEECCCCEEEeCCC
Q psy12933 34 CTLNPGDMLYIPPK 47 (65)
Q Consensus 34 ~~l~pGd~LyiP~~ 47 (65)
..|+|||++-++..
T Consensus 138 ~~l~pGDvi~l~~~ 151 (164)
T PF13550_consen 138 LALEPGDVIALSDD 151 (164)
T ss_pred ccCCCCCEEEEEeC
Confidence 55899999988754
No 205
>cd02778 MopB_CT_Thiosulfate-R-like The MopB_CT_Thiosulfate-R-like CD contains thiosulfate-, sulfur-, and polysulfide-reductases, and other related proteins. Thiosulfate reductase catalyzes the cleavage of sulfur-sulfur bonds in thiosulfate. Polysulfide reductase is a membrane-bound enzyme that catalyzes the reduction of polysulfide using either hydrogen or formate as the electron donor. Also included in this CD is the phenylacetyl-CoA:acceptor oxidoreductase, large subunit (PadB2), which has been characterized as a membrane-bound molybdenum-iron-sulfur enzyme involved in anaerobic metabolism of phenylalanine in the denitrifying bacterium Thauera aromatica. The MopB_CT_Thiosulfate-R-like CD is of the conserved molybdopterin_binding C-terminal (MopB_CT) region present in many, but not all, MopB homologs.
Probab=20.52 E-value=57 Score=18.97 Aligned_cols=12 Identities=25% Similarity=0.562 Sum_probs=9.8
Q ss_pred CEEEeCCCCeEE
Q psy12933 40 DMLYIPPKVWHH 51 (65)
Q Consensus 40 d~LyiP~~WwH~ 51 (65)
.++++|.+|+|.
T Consensus 72 g~v~~~~g~g~~ 83 (123)
T cd02778 72 DTVFMPHGFGHW 83 (123)
T ss_pred CEEEEeccCCcC
Confidence 579999999764
No 206
>PF01568 Molydop_binding: Molydopterin dinucleotide binding domain; InterPro: IPR006657 A domain in this entry corresponds to the C-terminal domain IV in dimethyl sulphoxide (DMSO)reductase which interacts with the 2-amino pyrimidone ring of both molybdopterin guanine dinucleotide molecules [].; GO: 0016491 oxidoreductase activity, 0030151 molybdenum ion binding, 0055114 oxidation-reduction process; PDB: 2IVF_A 1OGY_G 3ML1_A 3O5A_A 1TI2_G 1VLE_M 1VLD_U 1VLF_O 1TI4_I 1TI6_E ....
Probab=20.47 E-value=30 Score=19.74 Aligned_cols=11 Identities=9% Similarity=0.259 Sum_probs=10.2
Q ss_pred CEEEeCCCCeE
Q psy12933 40 DMLYIPPKVWH 50 (65)
Q Consensus 40 d~LyiP~~WwH 50 (65)
+++++|.+||+
T Consensus 72 g~v~~~~~~~~ 82 (110)
T PF01568_consen 72 GVVFMPHGWGG 82 (110)
T ss_dssp TEEEEESTHTT
T ss_pred CEEEEeccccc
Confidence 78999999998
No 207
>PF00568 WH1: WH1 domain; InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][]. WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,]. Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=20.41 E-value=2e+02 Score=16.92 Aligned_cols=37 Identities=16% Similarity=0.195 Sum_probs=30.2
Q ss_pred CCCceEEEEECCCCEEEeCCCCeEEEEecCCeEEEEE
Q psy12933 27 TPGHLIECTLNPGDMLYIPPKVWHHVRSLSTSISVSF 63 (65)
Q Consensus 27 ~~~~~~~~~l~pGd~LyiP~~WwH~V~s~~~sisvn~ 63 (65)
.+....++.|.+|=.-.-+...+|+-+..+..+.+||
T Consensus 58 ~~~~v~e~~l~~~~~Y~~~~~~Fh~f~~~~~~~GLnF 94 (111)
T PF00568_consen 58 DGKVVWEQELYPGFVYTKARPFFHQFEDDDCVYGLNF 94 (111)
T ss_dssp TTEEEEEEEESTT-EEEEESSSEEEEEETTCEEEEEE
T ss_pred ccEEEEEeEecCCCEEEeCCCcEEEEEeCCeEEEEec
Confidence 4556888899998766678999999999888888887
No 208
>PF08255 Leader_Trp: Trp-operon Leader Peptide; InterPro: IPR013205 The tryptophan operon regulatory region of Citrobacter freundii (leader transcript) encodes a 14-residue peptide containing characteristic tandem tryptophan residues. It is about 10 nucleotides shorter than those of Escherichia coli and Salmonella typhimurium [].
Probab=20.10 E-value=53 Score=13.28 Aligned_cols=10 Identities=10% Similarity=0.315 Sum_probs=6.4
Q ss_pred EEeCCCCeEE
Q psy12933 42 LYIPPKVWHH 51 (65)
Q Consensus 42 LyiP~~WwH~ 51 (65)
+|.=.+|||-
T Consensus 4 ~~~L~~WWrt 13 (14)
T PF08255_consen 4 TFSLHGWWRT 13 (14)
T ss_pred EEEEeeEEEc
Confidence 4555678873
Done!