Query psy840
Match_columns 178
No_of_seqs 193 out of 838
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 17:45:58 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy840.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/840hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2107|consensus 100.0 3.5E-66 7.6E-71 412.8 12.5 172 1-173 1-178 (179)
2 PF03079 ARD: ARD/ARD' family; 100.0 2.1E-49 4.6E-54 316.7 13.0 150 3-153 1-157 (157)
3 COG1791 Uncharacterized conser 100.0 6.1E-46 1.3E-50 298.1 15.8 168 1-173 1-180 (181)
4 COG0662 {ManC} Mannose-6-phosp 99.1 3.8E-10 8.2E-15 86.6 7.3 61 82-147 50-111 (127)
5 COG1917 Uncharacterized conser 99.1 5.8E-10 1.3E-14 85.0 8.0 63 80-147 55-118 (131)
6 PF07883 Cupin_2: Cupin domain 99.1 4.5E-10 9.7E-15 75.9 6.5 61 80-145 10-71 (71)
7 PRK04190 glucose-6-phosphate i 98.9 2.8E-08 6E-13 82.0 11.5 85 58-148 68-157 (191)
8 TIGR03037 anthran_nbaC 3-hydro 98.8 1.4E-08 2.9E-13 81.8 7.8 59 81-140 41-99 (159)
9 smart00835 Cupin_1 Cupin. This 98.8 5.4E-08 1.2E-12 75.7 10.1 69 80-148 42-112 (146)
10 TIGR03404 bicupin_oxalic bicup 98.8 7.6E-08 1.6E-12 86.3 11.3 69 80-148 257-326 (367)
11 PF00190 Cupin_1: Cupin; Inte 98.8 7.4E-08 1.6E-12 74.8 9.5 69 79-147 45-120 (144)
12 PRK13264 3-hydroxyanthranilate 98.7 3.5E-08 7.6E-13 80.6 7.5 55 81-136 47-101 (177)
13 PF02311 AraC_binding: AraC-li 98.6 8.5E-08 1.8E-12 70.2 6.3 59 79-142 14-72 (136)
14 TIGR03214 ura-cupin putative a 98.6 1.9E-07 4.2E-12 79.9 8.9 55 83-142 195-249 (260)
15 PRK13290 ectC L-ectoine syntha 98.6 1.7E-07 3.7E-12 72.4 7.5 61 81-147 48-109 (125)
16 COG2140 Thermophilic glucose-6 98.6 4E-07 8.7E-12 76.1 9.2 69 82-150 94-165 (209)
17 PRK09943 DNA-binding transcrip 98.6 2.2E-07 4.8E-12 75.0 7.4 61 82-147 122-182 (185)
18 PF06560 GPI: Glucose-6-phosph 98.5 1.2E-06 2.6E-11 72.0 9.1 92 50-148 43-147 (182)
19 TIGR03404 bicupin_oxalic bicup 98.4 1.6E-06 3.4E-11 77.9 9.2 66 81-147 80-145 (367)
20 TIGR01479 GMP_PMI mannose-1-ph 98.3 2.2E-06 4.8E-11 78.8 7.6 63 81-148 389-452 (468)
21 PRK13500 transcriptional activ 98.2 2.5E-06 5.4E-11 73.9 6.6 54 80-138 60-113 (312)
22 PRK11171 hypothetical protein; 98.2 7E-06 1.5E-10 70.5 9.2 52 84-140 201-252 (266)
23 PRK13501 transcriptional activ 98.2 2.8E-06 6E-11 72.3 6.1 51 81-136 31-81 (290)
24 PLN00212 glutelin; Provisional 98.2 7.4E-06 1.6E-10 76.3 8.4 69 79-147 91-184 (493)
25 PF02041 Auxin_BP: Auxin bindi 98.2 1.3E-05 2.8E-10 64.3 8.4 74 81-154 57-135 (167)
26 PRK15460 cpsB mannose-1-phosph 98.1 1.1E-05 2.4E-10 74.8 8.8 61 82-147 399-460 (478)
27 PRK15457 ethanolamine utilizat 98.1 7.7E-06 1.7E-10 69.5 6.8 44 84-132 171-214 (233)
28 PRK10296 DNA-binding transcrip 98.1 7.6E-06 1.7E-10 68.9 6.8 49 80-133 35-83 (278)
29 PRK13503 transcriptional activ 98.1 4.2E-06 9.2E-11 70.0 5.1 52 80-136 27-78 (278)
30 COG4297 Uncharacterized protei 98.1 2E-05 4.3E-10 62.5 8.3 95 45-157 34-129 (163)
31 PRK10371 DNA-binding transcrip 98.1 6.2E-06 1.4E-10 71.3 6.1 51 81-136 39-89 (302)
32 PRK13502 transcriptional activ 98.1 8.1E-06 1.8E-10 68.8 6.4 52 81-137 31-82 (282)
33 COG4101 Predicted mannose-6-ph 98.1 2.9E-05 6.3E-10 60.3 8.7 74 59-144 47-121 (142)
34 PF01050 MannoseP_isomer: Mann 98.0 2.7E-05 5.9E-10 62.0 8.3 80 53-148 59-139 (151)
35 COG3837 Uncharacterized conser 98.0 1.5E-05 3.3E-10 64.1 6.6 65 81-150 56-123 (161)
36 TIGR02297 HpaA 4-hydroxyphenyl 98.0 1.5E-05 3.3E-10 67.0 6.2 58 81-143 36-94 (287)
37 TIGR03214 ura-cupin putative a 98.0 3.9E-05 8.5E-10 65.7 8.7 63 83-150 74-140 (260)
38 PRK11171 hypothetical protein; 98.0 4.3E-05 9.4E-10 65.6 8.9 51 87-142 82-132 (266)
39 TIGR02451 anti_sig_ChrR anti-s 97.8 4.3E-05 9.3E-10 63.9 5.7 72 58-148 127-198 (215)
40 COG3435 Gentisate 1,2-dioxygen 97.8 2.7E-05 5.8E-10 68.6 4.3 73 60-136 72-156 (351)
41 PF12973 Cupin_7: ChrR Cupin-l 97.7 7.1E-05 1.5E-09 53.9 5.6 59 59-136 25-83 (91)
42 TIGR02272 gentisate_1_2 gentis 97.7 0.0001 2.2E-09 65.8 6.7 56 81-140 94-149 (335)
43 PLN00212 glutelin; Provisional 97.6 0.00083 1.8E-08 62.7 11.1 86 62-148 332-429 (493)
44 PF05899 Cupin_3: Protein of u 97.6 0.00013 2.7E-09 51.2 4.4 49 82-135 20-68 (74)
45 COG4766 EutQ Ethanolamine util 97.5 0.00045 9.8E-09 55.7 6.9 74 56-136 85-161 (176)
46 TIGR02272 gentisate_1_2 gentis 97.4 0.0013 2.8E-08 58.7 10.1 88 44-136 214-313 (335)
47 PF06052 3-HAO: 3-hydroxyanthr 97.4 0.00099 2.2E-08 53.3 8.2 54 82-136 47-100 (151)
48 PF05523 FdtA: WxcM-like, C-te 97.1 0.0026 5.6E-08 49.3 7.8 57 81-138 46-104 (131)
49 PF06249 EutQ: Ethanolamine ut 97.1 0.00073 1.6E-08 54.2 4.7 45 86-135 93-137 (152)
50 PF12852 Cupin_6: Cupin 96.8 0.003 6.4E-08 50.5 5.8 44 90-136 37-80 (186)
51 COG3450 Predicted enzyme of th 96.6 0.0038 8.3E-08 48.0 4.8 50 82-135 56-106 (116)
52 PRK10572 DNA-binding transcrip 96.6 0.005 1.1E-07 52.1 6.0 51 81-136 42-92 (290)
53 PF04209 HgmA: homogentisate 1 96.6 0.0044 9.6E-08 57.0 5.9 57 85-146 143-199 (424)
54 TIGR01015 hmgA homogentisate 1 96.3 0.015 3.3E-07 53.5 7.4 56 86-146 146-201 (429)
55 PRK05341 homogentisate 1,2-dio 96.0 0.024 5.1E-07 52.4 7.5 56 86-146 152-208 (438)
56 PF02373 JmjC: JmjC domain, hy 96.0 0.015 3.3E-07 42.2 5.1 29 108-136 78-106 (114)
57 PLN02658 homogentisate 1,2-dio 95.7 0.039 8.4E-07 51.0 7.5 55 86-145 145-200 (435)
58 PRK09685 DNA-binding transcrip 95.2 0.058 1.3E-06 45.7 6.3 49 89-142 72-120 (302)
59 PF06339 Ectoine_synth: Ectoin 95.1 0.12 2.6E-06 40.3 7.3 58 87-149 54-111 (126)
60 COG3435 Gentisate 1,2-dioxygen 94.6 0.061 1.3E-06 47.8 5.1 95 44-149 225-335 (351)
61 PF13621 Cupin_8: Cupin-like d 94.5 0.12 2.5E-06 42.0 6.2 68 80-147 143-247 (251)
62 PF14525 AraC_binding_2: AraC- 94.4 0.14 3.1E-06 38.9 6.2 50 90-144 57-106 (172)
63 KOG3995|consensus 94.3 0.068 1.5E-06 45.5 4.4 49 82-131 47-95 (279)
64 PF11699 CENP-C_C: Mif2/CENP-C 93.7 0.28 6E-06 35.6 6.2 68 57-138 11-78 (85)
65 COG3257 GlxB Uncharacterized p 93.5 0.23 5E-06 42.5 6.2 43 89-136 84-126 (264)
66 PF00908 dTDP_sugar_isom: dTDP 93.5 0.73 1.6E-05 37.6 9.0 56 82-137 57-124 (176)
67 COG3822 ABC-type sugar transpo 93.2 0.13 2.8E-06 43.1 4.2 59 79-137 97-179 (225)
68 PF08007 Cupin_4: Cupin superf 93.1 0.51 1.1E-05 41.4 8.1 56 80-135 127-200 (319)
69 PF02678 Pirin: Pirin; InterP 93.1 0.75 1.6E-05 34.6 7.8 63 79-145 40-106 (107)
70 PF14499 DUF4437: Domain of un 92.8 0.16 3.5E-06 43.8 4.4 52 81-136 49-100 (251)
71 COG3257 GlxB Uncharacterized p 92.5 0.27 5.9E-06 42.1 5.2 52 85-141 200-251 (264)
72 PRK12335 tellurite resistance 92.4 0.88 1.9E-05 39.0 8.4 76 74-149 17-96 (287)
73 COG3508 HmgA Homogentisate 1,2 92.0 0.55 1.2E-05 42.7 6.9 45 88-137 146-190 (427)
74 KOG2757|consensus 91.8 0.37 7.9E-06 43.9 5.6 57 87-148 352-408 (411)
75 PF04962 KduI: KduI/IolB famil 90.8 0.44 9.6E-06 41.1 4.9 48 80-130 165-228 (261)
76 TIGR01221 rmlC dTDP-4-dehydror 90.5 2.7 5.8E-05 34.3 9.0 58 80-137 56-124 (176)
77 PF07385 DUF1498: Protein of u 90.1 0.59 1.3E-05 39.8 4.9 59 79-137 98-180 (225)
78 PF09313 DUF1971: Domain of un 89.3 3.6 7.8E-05 29.6 7.9 59 78-136 13-75 (82)
79 COG1482 ManA Phosphomannose is 86.0 0.53 1.2E-05 41.9 2.3 23 112-134 159-181 (312)
80 COG1898 RfbC dTDP-4-dehydrorha 85.9 6.6 0.00014 32.1 8.4 58 81-138 58-125 (173)
81 TIGR00218 manA mannose-6-phosp 85.8 0.41 8.8E-06 41.7 1.5 20 112-131 152-171 (302)
82 PF13759 2OG-FeII_Oxy_5: Putat 85.4 1.1 2.5E-05 32.4 3.4 58 78-135 10-90 (101)
83 COG1741 Pirin-related protein 85.0 3.6 7.7E-05 36.0 6.9 67 79-149 55-126 (276)
84 PF14499 DUF4437: Domain of un 83.7 0.96 2.1E-05 39.1 2.8 60 81-143 184-243 (251)
85 PF06865 DUF1255: Protein of u 83.1 4.5 9.8E-05 30.0 5.8 45 89-136 42-86 (94)
86 TIGR00218 manA mannose-6-phosp 82.4 2.4 5.2E-05 36.8 4.8 38 88-130 253-290 (302)
87 PRK10579 hypothetical protein; 81.6 6.5 0.00014 29.3 6.1 45 89-136 42-86 (94)
88 PF06172 Cupin_5: Cupin superf 79.0 16 0.00035 28.7 8.0 73 80-153 53-132 (139)
89 PRK15131 mannose-6-phosphate i 79.0 4.2 9.1E-05 37.1 5.3 38 88-130 339-376 (389)
90 PRK15131 mannose-6-phosphate i 77.7 1.8 3.8E-05 39.5 2.5 23 112-134 238-260 (389)
91 COG2850 Uncharacterized conser 71.8 2.2 4.8E-05 38.9 1.6 25 112-136 180-204 (383)
92 COG1482 ManA Phosphomannose is 69.8 9.6 0.00021 34.0 5.1 42 88-134 260-301 (312)
93 PRK00924 5-keto-4-deoxyuronate 67.5 15 0.00032 32.3 5.7 57 80-136 190-250 (276)
94 cd00038 CAP_ED effector domain 66.7 24 0.00053 23.7 5.8 37 88-124 35-72 (115)
95 PRK15186 AraC family transcrip 64.1 17 0.00037 31.7 5.5 43 90-136 40-82 (291)
96 PRK11753 DNA-binding transcrip 62.9 43 0.00094 26.3 7.3 36 88-123 38-74 (211)
97 KOG3706|consensus 62.6 7 0.00015 37.2 2.9 40 108-147 378-417 (629)
98 COG3123 Uncharacterized protei 62.6 20 0.00044 26.4 4.7 46 88-136 41-86 (94)
99 PRK09391 fixK transcriptional 58.7 53 0.0011 26.8 7.3 46 89-134 57-103 (230)
100 PF00027 cNMP_binding: Cyclic 58.2 36 0.00078 22.3 5.3 36 88-123 17-53 (91)
101 PLN02288 mannose-6-phosphate i 57.0 18 0.00038 33.2 4.5 40 86-128 352-391 (394)
102 KOG3416|consensus 56.8 20 0.00044 28.2 4.2 56 87-153 34-93 (134)
103 PRK13918 CRP/FNR family transc 55.7 46 0.001 26.0 6.3 36 89-124 27-63 (202)
104 PF11142 DUF2917: Protein of u 54.2 31 0.00068 23.3 4.3 39 92-133 20-58 (63)
105 PF06719 AraC_N: AraC-type tra 52.3 84 0.0018 24.4 7.1 58 84-146 19-79 (155)
106 smart00100 cNMP Cyclic nucleot 50.4 49 0.0011 22.2 5.0 37 89-125 36-73 (120)
107 PRK10402 DNA-binding transcrip 49.2 1.4E+02 0.0031 24.1 8.3 36 89-124 50-86 (226)
108 PLN02288 mannose-6-phosphate i 49.0 16 0.00034 33.5 2.9 23 112-134 252-274 (394)
109 cd00214 Calpain_III Calpain, s 48.8 37 0.0008 26.3 4.6 34 111-149 111-144 (150)
110 KOG1417|consensus 48.7 57 0.0012 29.5 6.2 51 92-145 157-207 (446)
111 cd06919 Asp_decarbox Aspartate 46.9 10 0.00022 29.1 1.1 58 45-125 28-88 (111)
112 PF05726 Pirin_C: Pirin C-term 46.4 1E+02 0.0022 22.3 6.4 53 88-149 20-72 (104)
113 TIGR02466 conserved hypothetic 46.0 79 0.0017 26.2 6.4 87 45-134 74-185 (201)
114 PF05721 PhyH: Phytanoyl-CoA d 45.9 22 0.00047 27.2 2.9 28 108-135 177-204 (211)
115 PF05995 CDO_I: Cysteine dioxy 44.0 1.7E+02 0.0036 23.4 10.3 70 79-148 86-165 (175)
116 COG3097 Uncharacterized protei 43.2 49 0.0011 24.8 4.2 45 94-149 16-60 (106)
117 PRK05449 aspartate alpha-decar 41.5 13 0.00029 29.0 1.0 58 45-125 29-89 (126)
118 TIGR00223 panD L-aspartate-alp 41.4 13 0.00029 29.0 1.0 59 45-126 29-90 (126)
119 PRK09392 ftrB transcriptional 41.4 64 0.0014 26.1 5.1 34 89-122 49-82 (236)
120 PF07653 SH3_2: Variant SH3 do 41.1 35 0.00075 21.7 2.8 34 113-154 16-49 (55)
121 PF05962 HutD: HutD; InterPro 40.1 36 0.00077 27.7 3.4 49 88-143 135-183 (184)
122 COG3806 ChrR Transcriptional a 38.7 49 0.0011 28.0 4.0 55 79-142 139-193 (216)
123 PF13640 2OG-FeII_Oxy_3: 2OG-F 38.5 63 0.0014 22.5 4.1 57 78-134 9-86 (100)
124 COG0361 InfA Translation initi 37.6 67 0.0015 22.9 4.0 12 112-123 44-55 (75)
125 PRK10202 ebgC cryptic beta-D-g 37.6 2E+02 0.0043 22.5 7.9 45 88-132 65-127 (149)
126 PF01987 AIM24: Mitochondrial 37.5 50 0.0011 26.8 3.9 34 93-127 134-167 (215)
127 PF04622 ERG2_Sigma1R: ERG2 an 36.9 1.2E+02 0.0026 25.6 6.2 61 85-153 116-176 (216)
128 TIGR03697 NtcA_cyano global ni 36.3 86 0.0019 24.1 5.0 34 89-122 12-46 (193)
129 COG0664 Crp cAMP-binding prote 36.1 69 0.0015 24.5 4.4 39 89-127 42-81 (214)
130 TIGR03805 beta_helix_1 paralle 35.2 22 0.00047 31.3 1.5 16 114-129 7-22 (314)
131 TIGR02408 ectoine_ThpD ectoine 34.7 68 0.0015 27.4 4.4 36 111-146 211-249 (277)
132 smart00720 calpain_III calpain 34.7 80 0.0017 23.9 4.4 33 112-149 107-139 (143)
133 PHA02890 hypothetical protein; 34.2 2.2E+02 0.0047 25.1 7.3 58 90-149 92-151 (278)
134 PHA02984 hypothetical protein; 33.1 2.3E+02 0.0049 25.1 7.3 58 89-147 92-152 (286)
135 PF04773 FecR: FecR protein; 32.9 1.6E+02 0.0035 20.1 6.2 57 89-147 39-95 (98)
136 PRK11161 fumarate/nitrate redu 32.8 1.8E+02 0.0039 23.3 6.5 35 89-123 56-91 (235)
137 TIGR03027 pepcterm_export puta 32.8 29 0.00063 27.3 1.7 15 113-127 150-164 (165)
138 cd05793 S1_IF1A S1_IF1A: Trans 32.2 95 0.0021 21.8 4.1 12 112-123 36-47 (77)
139 PRK00924 5-keto-4-deoxyuronate 31.6 65 0.0014 28.3 3.8 45 83-132 68-114 (276)
140 smart00652 eIF1a eukaryotic tr 30.4 1E+02 0.0022 22.0 4.1 28 96-123 15-52 (83)
141 PLN02868 acyl-CoA thioesterase 30.4 1.1E+02 0.0024 27.5 5.3 35 89-123 50-84 (413)
142 PF01067 Calpain_III: Calpain 29.7 75 0.0016 23.7 3.5 33 112-149 111-143 (147)
143 TIGR00568 alkb DNA alkylation 28.2 1.3E+02 0.0029 24.1 4.9 40 92-131 125-166 (169)
144 PRK03606 ureidoglycolate hydro 28.0 3.2E+02 0.007 22.0 7.5 57 83-139 73-137 (162)
145 KOG1029|consensus 27.7 31 0.00067 34.9 1.2 28 95-124 694-721 (1118)
146 PF10983 DUF2793: Protein of u 27.1 1.3E+02 0.0027 21.9 4.1 41 116-157 29-74 (87)
147 PF04962 KduI: KduI/IolB famil 27.1 1.9E+02 0.0041 24.9 5.9 48 85-136 44-99 (261)
148 COG0853 PanD Aspartate 1-decar 27.0 24 0.00053 27.5 0.4 58 45-125 28-88 (126)
149 TIGR00156 conserved hypothetic 26.5 47 0.001 25.9 1.9 45 88-132 71-115 (126)
150 PF01238 PMI_typeI: Phosphoman 26.4 36 0.00078 30.7 1.4 21 113-133 252-272 (373)
151 PLN02156 gibberellin 2-beta-di 25.7 2.6E+02 0.0056 24.8 6.6 67 81-147 197-277 (335)
152 PF13464 DUF4115: Domain of un 25.5 2.2E+02 0.0048 19.3 7.0 44 94-137 4-48 (77)
153 PHA00672 hypothetical protein 24.7 2.1E+02 0.0046 22.7 5.2 55 80-140 59-113 (152)
154 PRK15401 alpha-ketoglutarate-d 24.7 1.5E+02 0.0033 24.9 4.7 41 92-132 146-188 (213)
155 PF02080 TrkA_C: TrkA-C domain 24.5 1E+02 0.0022 20.0 3.1 30 88-123 27-56 (71)
156 cd00248 Mth938-like Mth938-lik 24.1 85 0.0018 23.2 2.8 27 97-131 6-32 (109)
157 PF13532 2OG-FeII_Oxy_2: 2OG-F 23.8 1.8E+02 0.0039 22.6 4.9 36 93-128 128-165 (194)
158 PTZ00273 oxidase reductase; Pr 23.4 1.8E+02 0.0038 25.3 5.1 64 82-146 196-274 (320)
159 PLN02904 oxidoreductase 23.2 2.5E+02 0.0054 25.1 6.1 67 81-148 225-305 (357)
160 PF02261 Asp_decarbox: Asparta 22.6 18 0.00039 27.9 -1.1 57 45-124 29-88 (116)
161 TIGR00192 urease_beta urease, 22.3 80 0.0017 23.8 2.3 30 115-144 55-84 (101)
162 KOG0498|consensus 22.2 1.2E+02 0.0026 30.2 4.2 35 88-122 460-494 (727)
163 TIGR00523 eIF-1A eukaryotic/ar 22.1 1.7E+02 0.0036 21.7 4.0 28 96-123 29-66 (99)
164 PRK13201 ureB urease subunit b 21.9 78 0.0017 25.0 2.3 29 116-144 56-84 (136)
165 PF04076 BOF: Bacterial OB fol 21.9 45 0.00097 24.9 0.9 44 88-131 48-91 (103)
166 PF08903 DUF1846: Domain of un 21.7 85 0.0018 29.6 2.8 56 30-87 126-196 (491)
167 PRK10053 hypothetical protein; 21.5 57 0.0012 25.5 1.5 37 88-124 75-111 (130)
168 PF04967 HTH_10: HTH DNA bindi 21.5 66 0.0014 21.2 1.6 19 22-40 19-37 (53)
169 COG3145 AlkB Alkylated DNA rep 21.3 2E+02 0.0043 24.0 4.7 40 93-132 137-178 (194)
170 KOG2130|consensus 21.2 75 0.0016 29.0 2.3 29 107-135 259-287 (407)
171 CHL00010 infA translation init 20.9 1.3E+02 0.0028 21.1 3.1 11 113-123 45-55 (78)
172 PRK04012 translation initiatio 20.6 1.8E+02 0.0038 21.7 3.9 28 96-123 31-68 (100)
173 PLN02403 aminocyclopropanecarb 20.4 3.2E+02 0.007 23.8 6.2 64 82-147 171-251 (303)
174 COG3111 Periplasmic protein wi 20.3 1.2E+02 0.0025 23.9 2.9 38 86-123 69-106 (128)
175 cd04456 S1_IF1A_like S1_IF1A_l 20.0 1.5E+02 0.0033 20.8 3.3 11 113-123 37-47 (78)
No 1
>KOG2107|consensus
Probab=100.00 E-value=3.5e-66 Score=412.80 Aligned_cols=172 Identities=55% Similarity=0.989 Sum_probs=168.5
Q ss_pred CeEEEEecCCCCCCCCCCCcCCCCCcCCHhhhhhhcCeeEEee------ChHHHHHHHHhcCCCeeeEEEEcCCCCCCHH
Q psy840 1 MVRVWYMDTDVKSDQRLEHHRTPKEYLDLQTLNKLTGVEYFNV------QDEVLTKLRKDRNYNYEDEITCSKDCLPNYE 74 (178)
Q Consensus 1 m~~aw~~d~~~~~d~r~ph~~~p~~~v~~~~L~~~lGv~~~~~------~~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e 74 (178)
||+|||||+.+|||||+|||.+|++.||+++| +++||+||++ .+++|++|++++||+.+|+++++++++|||+
T Consensus 1 m~qaw~mdd~~~~D~RlPhh~~p~~~vs~d~L-~~lGVly~kld~D~~e~~~~L~~lr~e~~~~~~d~~~~~~e~~~nfd 79 (179)
T KOG2107|consen 1 MMQAWYMDDSPCEDQRLPHHKDPKKEVSLDEL-ARLGVLYWKLDADNYELDEELDRLREERGYSYMDICTVCPETLPNFD 79 (179)
T ss_pred CeeEEEcCCCCcccccCCCCCCCcccCCHHHH-HhhCcEEEEecCchHHHHHHHHHHHHHcCCceeeEEEEchhhcccHH
Confidence 89999999998899999999999999999999 9999999999 4688999999999999999999999999999
Q ss_pred HHhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCCceeec
Q psy840 75 EKLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEPVWTPH 154 (178)
Q Consensus 75 ~~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~gW~~~ 154 (178)
+|+++||+||.|.+||||||++|+|+|+|++++|.||||.|++||+|+|||||+||||+++++++||||||.++|.|+|+
T Consensus 80 eKvk~FfEEhlh~deeiR~il~GtgYfDVrd~dd~WIRi~vekGDlivlPaGiyHRFTtt~~n~vkamRlF~~~p~wta~ 159 (179)
T KOG2107|consen 80 EKVKSFFEEHLHEDEEIRYILEGTGYFDVRDKDDQWIRIFVEKGDLIVLPAGIYHRFTTTPSNYVKAMRLFVGEPKWTAY 159 (179)
T ss_pred HHHHHHHHHhcCchhheEEEeecceEEeeccCCCCEEEEEEecCCEEEecCcceeeeecCchHHHHHHHHhcCCcccccC
Confidence 99999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCCCchHHHHHHHHHHhh
Q psy840 155 NRPADDMECRLEYLKKIQN 173 (178)
Q Consensus 155 ~r~~d~~~~r~~yl~~~~~ 173 (178)
|||+|..++|+.||..|..
T Consensus 160 nR~~d~l~~r~~yl~~i~~ 178 (179)
T KOG2107|consen 160 NRPHDELPARKQYLNFISQ 178 (179)
T ss_pred CCccccchhHHHHHhhccc
Confidence 9999999999999998864
No 2
>PF03079 ARD: ARD/ARD' family; InterPro: IPR004313 The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence Q9ZFE7 from SWISSPROT, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+ []. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions []. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed reaction is unclear: methylthiopropanoate is cytotoxic, and carbon monoxide can activate guanylyl cyclase, leading to increased intracellular cGMP levels [, ]. This family also contains other proteins, whose functions are not well characterised.; GO: 0010309 acireductone dioxygenase [iron(II)-requiring] activity, 0055114 oxidation-reduction process; PDB: 1VR3_A 1ZRR_A 2HJI_A.
Probab=100.00 E-value=2.1e-49 Score=316.66 Aligned_cols=150 Identities=50% Similarity=0.856 Sum_probs=129.3
Q ss_pred EEEEecCCCCCCCCCCCcCCCCCcCCHhhhhhhcCeeEEee------ChHHHHHHHHhcCCCeeeE-EEEcCCCCCCHHH
Q psy840 3 RVWYMDTDVKSDQRLEHHRTPKEYLDLQTLNKLTGVEYFNV------QDEVLTKLRKDRNYNYEDE-ITCSKDCLPNYEE 75 (178)
Q Consensus 3 ~aw~~d~~~~~d~r~ph~~~p~~~v~~~~L~~~lGv~~~~~------~~~~l~~l~~~~gy~~~Dv-v~l~~~~~p~~e~ 75 (178)
||||||+.+++|+|+||+++|++.+|+++| +++||++|++ ..++++.+++.++|...++ +...++++|++++
T Consensus 1 ~~~~~d~~~~~d~~~~~~~~p~~~~s~~~l-~~~~v~~w~~~~~~~~~~~~~~~~~~~~~~~~~~~dv~~~~~~~p~~~~ 79 (157)
T PF03079_consen 1 RAWYYDEEDPGDQRLPHHSDPDKIVSLLQL-AGLGVLYWKLDADDPEDAEELQIIRAYRNYIDRDIDVVSLHPDHPNYEA 79 (157)
T ss_dssp EEEEB-S--S-STCCEEE-SCHHCHHHHHC-CCTCEEEEE-SCGGTTS-HHHHHHHHCHCHHCCCCEEEESTTTSTCHHH
T ss_pred CEEEECCCCcccCCCcccCCcccccCHHHh-hCceEEEeecCCCccCCccHHHHHHHHcCCceEEEEEEecCCCCcchhH
Confidence 799999987799999999999999999999 9999999999 4578899999999988887 5555445799999
Q ss_pred HhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCCceee
Q psy840 76 KLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEPVWTP 153 (178)
Q Consensus 76 ~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~gW~~ 153 (178)
++++|+.||+|+++|||||++|+|+|+|++.+|.|+||.|++||+|+||+||+|||+++++++++|||||+++|||+|
T Consensus 80 ~~~~f~~EH~H~deEvR~i~~G~g~Fdvr~~~~~wiri~~e~GDli~vP~g~~HrF~~~~~~~i~aiRlF~~~~gWva 157 (157)
T PF03079_consen 80 KLKKFFEEHTHEDEEVRYIVDGSGYFDVRDGDDVWIRILCEKGDLIVVPAGTYHRFTLGESPYIKAIRLFKDEPGWVA 157 (157)
T ss_dssp HHHHHCS-EEESS-EEEEEEECEEEEEEE-TTCEEEEEEEETTCEEEE-TT--EEEEESTTSSEEEEEEESSCGGEES
T ss_pred HhhhhheeEecChheEEEEeCcEEEEEEEcCCCEEEEEEEcCCCEEecCCCCceeEEcCCCCcEEEEEeecCCCCccC
Confidence 999999999999999999999999999999999999999999999999999999999999999999999999999997
No 3
>COG1791 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=100.00 E-value=6.1e-46 Score=298.13 Aligned_cols=168 Identities=31% Similarity=0.517 Sum_probs=150.5
Q ss_pred CeEEEEecCC---CCCCCCCCCcCCCCCcCCHhhhhh-----hcCeeEEee---ChHHHHHHHHhcCCCeeeEEEEcCCC
Q psy840 1 MVRVWYMDTD---VKSDQRLEHHRTPKEYLDLQTLNK-----LTGVEYFNV---QDEVLTKLRKDRNYNYEDEITCSKDC 69 (178)
Q Consensus 1 m~~aw~~d~~---~~~d~r~ph~~~p~~~v~~~~L~~-----~lGv~~~~~---~~~~l~~l~~~~gy~~~Dvv~l~~~~ 69 (178)
|++++.+|+. +..|++.|+- ++.-|+.+++ . +.|...-++ +..++++|++++||+++|||+++|++
T Consensus 1 Ms~l~I~d~~~~~~~~deia~~l--~~i~v~~e~w-e~~~~~~~~~~~~~i~~a~~~eid~l~~e~Gyk~~Dvvsv~~~~ 77 (181)
T COG1791 1 MSRLRIHDETKIITNQDEIAPEL--SKIEVSFERW-EATALIKHGAEKEHIIDAYETEIDRLIRERGYKNRDVVSVSPSN 77 (181)
T ss_pred CceEEEecCcccccCHhHhhhhc--ccceeEhhhh-hhccccccCcchhhhHhhHHHHHHHHHHhhCCceeeEEEeCCCC
Confidence 8999999986 4458899984 6788999999 6 555444444 78999999999999999999999986
Q ss_pred CCCHHHHhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 70 LPNYEEKLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 70 ~p~~e~~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
|++++++++|+++|+|.++||||||+|+|+|+|+..+|+|++|.|++||+|+||+||+|||+++++++|+|||||+.++
T Consensus 78 -pk~del~akF~~EH~H~d~EvRy~vaG~GiF~v~~~d~~~~~i~c~~gDLI~vP~gi~HwFtlt~~~~f~AvRlF~~~~ 156 (181)
T COG1791 78 -PKLDELRAKFLQEHLHTDDEVRYFVAGEGIFDVHSPDGKVYQIRCEKGDLISVPPGIYHWFTLTESPNFKAVRLFTEPE 156 (181)
T ss_pred -ccHHHHHHHHHHHhccCCceEEEEEecceEEEEECCCCcEEEEEEccCCEEecCCCceEEEEccCCCcEEEEEEeeCCC
Confidence 9999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ceeecCCCCCchHHHHH-HHHHHhh
Q psy840 150 VWTPHNRPADDMECRLE-YLKKIQN 173 (178)
Q Consensus 150 gW~~~~r~~d~~~~r~~-yl~~~~~ 173 (178)
||+|+++ +|+++.|++ |+..+.+
T Consensus 157 gWVa~yt-g~di~~~~~~y~~~i~~ 180 (181)
T COG1791 157 GWVAIYT-GDDIADRFPKYIEEINQ 180 (181)
T ss_pred Cceeeec-CchhHHHHHHHHHHhhc
Confidence 9999887 556655555 9998864
No 4
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=99.08 E-value=3.8e-10 Score=86.59 Aligned_cols=61 Identities=28% Similarity=0.330 Sum_probs=52.5
Q ss_pred cceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 82 TEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 82 ~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
.+|.|.. +|++||++|+|.+.++++. +.+++||.++||+|+.|++...++..++.+-+..+
T Consensus 50 ~~~~H~~~dE~~~Vl~G~g~v~~~~~~-----~~v~~gd~~~iP~g~~H~~~N~G~~~L~liei~~p 111 (127)
T COG0662 50 SLHHHHHRDEHWYVLEGTGKVTIGGEE-----VEVKAGDSVYIPAGTPHRVRNTGKIPLVLIEVQSP 111 (127)
T ss_pred CcccccCcceEEEEEeeEEEEEECCEE-----EEecCCCEEEECCCCcEEEEcCCCcceEEEEEecC
Confidence 5666666 9999999999999998643 99999999999999999999988877777766554
No 5
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=99.07 E-value=5.8e-10 Score=85.04 Aligned_cols=63 Identities=27% Similarity=0.368 Sum_probs=53.7
Q ss_pred hccceecC-cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 80 FYTEHLHT-DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 80 f~~eH~H~-~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
...+|+|+ .+++.||++|++.|+++++. ..+++||+|++|+|+.||+.+.++..+..|-++..
T Consensus 55 ~~~~H~hp~~~~~~~Vl~G~~~~~~~g~~-----~~l~~Gd~i~ip~g~~H~~~a~~~~~~~~l~v~~~ 118 (131)
T COG1917 55 VIPWHTHPLGEQTIYVLEGEGTVQLEGEK-----KELKAGDVIIIPPGVVHGLKAVEDEPMVLLLVFPL 118 (131)
T ss_pred ccccccCCCcceEEEEEecEEEEEecCCc-----eEecCCCEEEECCCCeeeeccCCCCceeEEEEeee
Confidence 46899998 79999999999999998533 89999999999999999999888765555555554
No 6
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=99.07 E-value=4.5e-10 Score=75.93 Aligned_cols=61 Identities=30% Similarity=0.452 Sum_probs=53.4
Q ss_pred hccceecCcc-eEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEe
Q psy840 80 FYTEHLHTDE-EIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYF 145 (178)
Q Consensus 80 f~~eH~H~~d-Ei~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF 145 (178)
....|.|+.. |+.||++|++.+.+.+ + ++.+++||.+.+|+|+.|++...++..++.+.+|
T Consensus 10 ~~~~h~H~~~~e~~~vl~G~~~~~~~~---~--~~~l~~Gd~~~i~~~~~H~~~n~~~~~~~~l~V~ 71 (71)
T PF07883_consen 10 SIPPHRHPGEDEFFYVLSGEGTLTVDG---E--RVELKPGDAIYIPPGVPHQVRNPGDEPARFLVVY 71 (71)
T ss_dssp EEEEEEESSEEEEEEEEESEEEEEETT---E--EEEEETTEEEEEETTSEEEEEEESSSEEEEEEEE
T ss_pred CCCCEECCCCCEEEEEEECCEEEEEcc---E--EeEccCCEEEEECCCCeEEEEECCCCCEEEEEEC
Confidence 4679999997 9999999999999743 3 4899999999999999999999888888777765
No 7
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=98.88 E-value=2.8e-08 Score=82.04 Aligned_cols=85 Identities=16% Similarity=0.195 Sum_probs=65.3
Q ss_pred CeeeEEEEcCCCCCCHHHHhhhhc--cceecCc---ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 58 NYEDEITCSKDCLPNYEEKLKSFY--TEHLHTD---EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 58 ~~~Dvv~l~~~~~p~~e~~~~~f~--~eH~H~~---dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
-..++.++.|.... +.|+ .-|.|.. .|++||++|+|.+.+.+.++....+.++|||++.||+|+.|++.
T Consensus 68 L~~g~t~l~PG~~g------~e~~mt~gH~H~~~~~~EiyyvlsG~g~~~l~~~~G~~~~~~v~pGd~v~IPpg~~H~~i 141 (191)
T PRK04190 68 LNFGTTRLYPGKVG------DEYFMTKGHFHAKADRAEIYYGLKGKGLMLLQDPEGEARWIEMEPGTVVYVPPYWAHRSV 141 (191)
T ss_pred eEEEEEEECCCcEe------cccccCCCeEcCCCCCCEEEEEEeCEEEEEEecCCCcEEEEEECCCCEEEECCCCcEEeE
Confidence 45688888887422 2222 2477753 69999999999999988766666799999999999999999998
Q ss_pred cCCCCcEEEEEEeecC
Q psy840 133 LDINNYVKAKRYFIGE 148 (178)
Q Consensus 133 ~~~~~~~~alrlF~~~ 148 (178)
..++..++.+-++...
T Consensus 142 N~G~epl~fl~v~p~~ 157 (191)
T PRK04190 142 NTGDEPLVFLACYPAD 157 (191)
T ss_pred ECCCCCEEEEEEEcCC
Confidence 8777777766665443
No 8
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=98.83 E-value=1.4e-08 Score=81.80 Aligned_cols=59 Identities=19% Similarity=0.278 Sum_probs=49.6
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEE
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVK 140 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~ 140 (178)
+.+|.|+.||.+|+++|+..+.+.+. ++.-.+.+++||+++||+|++|++...++..+.
T Consensus 41 ~d~H~~~tdE~FyqleG~~~l~v~d~-g~~~~v~L~eGd~flvP~gvpHsP~r~~~t~~L 99 (159)
T TIGR03037 41 TDFHDDPGEEFFYQLKGEMYLKVTEE-GKREDVPIREGDIFLLPPHVPHSPQRPAGSIGL 99 (159)
T ss_pred cccccCCCceEEEEEcceEEEEEEcC-CcEEEEEECCCCEEEeCCCCCcccccCCCcEEE
Confidence 57899999999999999999999874 443359999999999999999999886654433
No 9
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=98.79 E-value=5.4e-08 Score=75.74 Aligned_cols=69 Identities=19% Similarity=0.313 Sum_probs=56.2
Q ss_pred hccceecC-cceEEEEecceEEEEEEeCC-CcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecC
Q psy840 80 FYTEHLHT-DEEIRLVLDGSGYFDVRDKF-DDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGE 148 (178)
Q Consensus 80 f~~eH~H~-~dEi~~il~G~g~f~v~~~~-d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~ 148 (178)
....|.|+ .+|+.||++|++.+.+.+++ +......+++||++.||+|+.|++...++..+..+-+....
T Consensus 42 ~~~~h~H~~~~e~~~Vl~G~~~~~~~~~~~~~~~~~~l~~GD~~~ip~g~~H~~~n~~~~~~~~l~~~~~~ 112 (146)
T smart00835 42 MLPPHYHPRATELLYVVRGEGRVGVVDPNGNKVYDARLREGDVFVVPQGHPHFQVNSGDENLEFVAFNTND 112 (146)
T ss_pred CcCCeeCCCCCEEEEEEeCeEEEEEEeCCCCeEEEEEecCCCEEEECCCCEEEEEcCCCCCEEEEEEecCC
Confidence 36799997 58999999999999998753 35567999999999999999999988776667777444433
No 10
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=98.76 E-value=7.6e-08 Score=86.30 Aligned_cols=69 Identities=14% Similarity=0.185 Sum_probs=60.2
Q ss_pred hccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecC
Q psy840 80 FYTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGE 148 (178)
Q Consensus 80 f~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~ 148 (178)
....|.|+. +|+.||++|++.+.+.+.++.-....+++||+++||+|..|++...++..++.+-+|+++
T Consensus 257 ~~~~H~H~~~~E~~yvl~G~~~~~v~d~~g~~~~~~l~~GD~~~iP~g~~H~i~N~G~e~l~fL~if~s~ 326 (367)
T TIGR03404 257 MRELHWHPNADEWQYFIQGQARMTVFAAGGNARTFDYQAGDVGYVPRNMGHYVENTGDETLVFLEVFKAD 326 (367)
T ss_pred ccCCeeCcCCCeEEEEEEEEEEEEEEecCCcEEEEEECCCCEEEECCCCeEEEEECCCCCEEEEEEECCC
Confidence 357999996 899999999999999765555445789999999999999999998887789999999884
No 11
>PF00190 Cupin_1: Cupin; InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=98.75 E-value=7.4e-08 Score=74.83 Aligned_cols=69 Identities=23% Similarity=0.309 Sum_probs=52.2
Q ss_pred hhccceecCcceEEEEecceEEEEEEeCCC-----cEEEEE--EeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 79 SFYTEHLHTDEEIRLVLDGSGYFDVRDKFD-----DWIRIE--VTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 79 ~f~~eH~H~~dEi~~il~G~g~f~v~~~~d-----~~~ri~--~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
.+...|.|...|+.||++|+|.+.+-..++ +...-. +++||+++||+|..||...+++.....+.+|..
T Consensus 45 ~~~~Ph~h~a~~i~~V~~G~~~~~~v~~~~~~~~~~~~~~~v~l~~Gdv~~vP~G~~h~~~n~~~~~~~~~~~f~~ 120 (144)
T PF00190_consen 45 GLRAPHYHNADEIVYVIEGRGRVGVVGPGGPQEEFRDFSQKVRLKAGDVFVVPAGHPHWIINDGDDEALVLIIFDT 120 (144)
T ss_dssp EEEEEEEESSEEEEEEEESEEEEEEEETTCSSSEEEEEEEEEEEETTEEEEE-TT-EEEEEECSSSSEEEEEEEEE
T ss_pred CccceeEeeeeEEeeeeccceEEEEEecCCccccceeeeceeeeecccceeeccceeEEEEcCCCCCCEEEEEEEC
Confidence 357899996699999999999999888654 233344 999999999999999999887444455555543
No 12
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=98.73 E-value=3.5e-08 Score=80.63 Aligned_cols=55 Identities=18% Similarity=0.355 Sum_probs=47.8
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
+.+|.|+.||.+|+++|++...+++ +|+.-.+.+++||+++||+|++|+....++
T Consensus 47 ~d~H~~~tdE~FyqleG~~~l~v~d-~g~~~~v~L~eGd~fllP~gvpHsP~r~~~ 101 (177)
T PRK13264 47 TDFHYDPGEEFFYQLEGDMYLKVQE-DGKRRDVPIREGEMFLLPPHVPHSPQREAG 101 (177)
T ss_pred cccccCCCceEEEEECCeEEEEEEc-CCceeeEEECCCCEEEeCCCCCcCCccCCC
Confidence 5789999999999999999999987 344335999999999999999999977554
No 13
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=98.63 E-value=8.5e-08 Score=70.23 Aligned_cols=59 Identities=29% Similarity=0.384 Sum_probs=43.0
Q ss_pred hhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEE
Q psy840 79 SFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAK 142 (178)
Q Consensus 79 ~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~al 142 (178)
..+..|.|+.-|+.||++|+|.+.++++. ..++|||++++|+|..|.+...++..+...
T Consensus 14 ~~~~~h~h~~~~i~~v~~G~~~~~~~~~~-----~~l~~g~~~li~p~~~H~~~~~~~~~~~~~ 72 (136)
T PF02311_consen 14 FEFPPHWHDFYEIIYVLSGEGTLHIDGQE-----YPLKPGDLFLIPPGQPHSYYPDSNEPWEYY 72 (136)
T ss_dssp -SEEEETT-SEEEEEEEEE-EEEEETTEE-----EEE-TT-EEEE-TTS-EEEEE-TTSEEEEE
T ss_pred CccCCEECCCEEEEEEeCCEEEEEECCEE-----EEEECCEEEEecCCccEEEecCCCCCEEEE
Confidence 34688999999999999999999996533 899999999999999999998886444433
No 14
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=98.61 E-value=1.9e-07 Score=79.89 Aligned_cols=55 Identities=22% Similarity=0.300 Sum_probs=47.8
Q ss_pred ceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEE
Q psy840 83 EHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAK 142 (178)
Q Consensus 83 eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~al 142 (178)
-|+|..+|..|||+|+|.+.+. |+| +.+++||+|.||||..||+.+.++..++-|
T Consensus 195 ~~~H~~eh~~yiL~G~G~~~~~---g~~--~~V~~GD~i~i~~~~~h~~~~~G~~~~~~l 249 (260)
T TIGR03214 195 IETHVMEHGLYVLEGKGVYNLD---NNW--VPVEAGDYIWMGAYCPQACYAGGRGEFRYL 249 (260)
T ss_pred cccccceeEEEEEeceEEEEEC---CEE--EEecCCCEEEECCCCCEEEEecCCCcEEEE
Confidence 4888889999999999999884 566 899999999999999999998877666544
No 15
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=98.60 E-value=1.7e-07 Score=72.36 Aligned_cols=61 Identities=11% Similarity=0.152 Sum_probs=49.7
Q ss_pred ccceecCcceEEEEecceEEEE-EEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFD-VRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~-v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
...|+|..+|+.||++|++.+. +.+ |+ ...+++||.+.+|+|..|++... ..+..|-+++.
T Consensus 48 ~~~h~h~~~E~~yVL~G~~~~~~i~~--g~--~~~L~aGD~i~~~~~~~H~~~N~--e~~~~l~v~tP 109 (125)
T PRK13290 48 THLHYKNHLEAVYCIEGEGEVEDLAT--GE--VHPIRPGTMYALDKHDRHYLRAG--EDMRLVCVFNP 109 (125)
T ss_pred ccceeCCCEEEEEEEeCEEEEEEcCC--CE--EEEeCCCeEEEECCCCcEEEEcC--CCEEEEEEECC
Confidence 4678887789999999999999 642 22 28999999999999999999987 34666777774
No 16
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=98.56 E-value=4e-07 Score=76.11 Aligned_cols=69 Identities=25% Similarity=0.293 Sum_probs=62.1
Q ss_pred cceecCc-ce--EEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCCc
Q psy840 82 TEHLHTD-EE--IRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEPV 150 (178)
Q Consensus 82 ~eH~H~~-dE--i~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~g 150 (178)
..|.|+. || +.|+++|+|.+.|....++.+.+.+++||+|.||+|--|+-..+++..+..+-+|....+
T Consensus 94 ~~H~Hp~ade~E~y~vi~G~g~m~v~~~~G~~~v~~~~~Gd~iyVPp~~gH~t~N~Gd~pLvf~~v~~~~~~ 165 (209)
T COG2140 94 ELHYHPNADEPEIYYVLKGEGRMLVQKPEGEARVIAVRAGDVIYVPPGYGHYTINTGDEPLVFLNVYPADAG 165 (209)
T ss_pred ccccCCCCCcccEEEEEeccEEEEEEcCCCcEEEEEecCCcEEEeCCCcceEeecCCCCCEEEEEEEeCCCC
Confidence 4599987 66 999999999999999889988999999999999999999999999888988888877543
No 17
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=98.56 E-value=2.2e-07 Score=74.96 Aligned_cols=61 Identities=21% Similarity=0.248 Sum_probs=52.1
Q ss_pred cceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 82 TEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 82 ~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
..|.|+.+|+.||++|++.+.+.++. +.+++||.+.+|+++.|++...++..++++-++..
T Consensus 122 ~~~~h~~~E~~~Vl~G~~~~~~~~~~-----~~l~~Gd~~~~~~~~~H~~~n~~~~~~~~l~~~~p 182 (185)
T PRK09943 122 ERIKHQGEEIGTVLEGEIVLTINGQD-----YHLVAGQSYAINTGIPHSFSNTSAGICRIISAHTP 182 (185)
T ss_pred cccccCCcEEEEEEEeEEEEEECCEE-----EEecCCCEEEEcCCCCeeeeCCCCCCeEEEEEeCC
Confidence 46778899999999999999996533 89999999999999999999877777777776653
No 18
>PF06560 GPI: Glucose-6-phosphate isomerase (GPI); InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=98.45 E-value=1.2e-06 Score=72.00 Aligned_cols=92 Identities=21% Similarity=0.300 Sum_probs=57.1
Q ss_pred HHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhc--cceecCc-------ceEEEEecceEEEEEEeCCC----cEEEEEEe
Q psy840 50 KLRKDRNYNYEDEITCSKDCLPNYEEKLKSFY--TEHLHTD-------EEIRLVLDGSGYFDVRDKFD----DWIRIEVT 116 (178)
Q Consensus 50 ~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~--~eH~H~~-------dEi~~il~G~g~f~v~~~~d----~~~ri~~~ 116 (178)
++.++++.. .|+..+.|.++.+ .|+ .-|.|+. .|++++++|+|.|-+...++ .++-+.++
T Consensus 43 ~~~~~~~L~-ygiTvi~Pg~vG~------E~~~T~GH~H~~~~~~~~~pEvY~vl~G~g~~lLq~~~~~~~~~~~~v~~~ 115 (182)
T PF06560_consen 43 EWLQKRNLR-YGITVIPPGKVGG------EYFMTKGHYHPISPCGLSYPEVYEVLSGEGLILLQKEEGDDVGDVIAVEAK 115 (182)
T ss_dssp -------EE-EEEEEE---EETT------EE-B---BB-SS----TT--EEEEEEESSEEEEEE-TTS-----EEEEEE-
T ss_pred ccceeeeEE-eeeEEEcCcccCC------ccccCCCccCCccccCCCCCcEEEEEeCEEEEEEEecCCCcceeEEEEEeC
Confidence 355666664 5889999886553 222 3577764 89999999999999999887 78889999
Q ss_pred CCcEEEeCCCCeeeeecCCCCcEEEEEEeecC
Q psy840 117 KGDLIIIPAGIYHRFTLDINNYVKAKRYFIGE 148 (178)
Q Consensus 117 ~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~ 148 (178)
+||+++||+|..|+-...++..+++.-++...
T Consensus 116 ~G~~v~IPp~yaH~tIN~g~~~L~~~~~~~~~ 147 (182)
T PF06560_consen 116 PGDVVYIPPGYAHRTINTGDEPLVFAAWVPRD 147 (182)
T ss_dssp TTEEEEE-TT-EEEEEE-SSS-EEEEEEEETT
T ss_pred CCCEEEECCCceEEEEECCCCcEEEEEEEecC
Confidence 99999999999999988777778776665543
No 19
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=98.39 E-value=1.6e-06 Score=77.88 Aligned_cols=66 Identities=26% Similarity=0.176 Sum_probs=54.9
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
...|.|..+|+.||++|++.+.+.+.+++.+...+++||++++|+|..|.+...++ ....+-+|..
T Consensus 80 ~~~HwH~~~E~~yVl~G~~~v~~~d~~g~~~~~~L~~GD~~~fP~g~~H~~~n~~~-~~~~l~vf~~ 145 (367)
T TIGR03404 80 RELHWHKEAEWAYVLYGSCRITAVDENGRNYIDDVGAGDLWYFPPGIPHSLQGLDE-GCEFLLVFDD 145 (367)
T ss_pred CCcccCCCceEEEEEeeEEEEEEEcCCCcEEEeEECCCCEEEECCCCeEEEEECCC-CeEEEEEeCC
Confidence 57999999999999999999999876677665689999999999999999987743 3455555655
No 20
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=98.28 E-value=2.2e-06 Score=78.77 Aligned_cols=63 Identities=19% Similarity=0.293 Sum_probs=52.6
Q ss_pred ccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecC
Q psy840 81 YTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGE 148 (178)
Q Consensus 81 ~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~ 148 (178)
...|.|.. +|..||++|++.+.+.++. +.+++||.+.+|+|++|++...++..++.+-++.++
T Consensus 389 ~~~h~H~~~~E~~~Vl~G~~~v~~dg~~-----~~l~~GDsi~ip~~~~H~~~N~g~~~~~~i~v~~~~ 452 (468)
T TIGR01479 389 LSLQMHHHRAEHWIVVSGTARVTIGDET-----LLLTENESTYIPLGVIHRLENPGKIPLELIEVQSGS 452 (468)
T ss_pred cCccccCCCceEEEEEeeEEEEEECCEE-----EEecCCCEEEECCCCcEEEEcCCCCCEEEEEEEcCC
Confidence 35566654 6788999999999997533 899999999999999999999888888888887754
No 21
>PRK13500 transcriptional activator RhaR; Provisional
Probab=98.23 E-value=2.5e-06 Score=73.85 Aligned_cols=54 Identities=28% Similarity=0.396 Sum_probs=47.7
Q ss_pred hccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCc
Q psy840 80 FYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNY 138 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~ 138 (178)
.+.+|.|+.-|+.||++|+|.+.+++.. ..+++||+++||+|..|.+...++..
T Consensus 60 ~~~~H~H~~~el~~v~~G~g~~~v~~~~-----~~l~~Gdl~~I~~~~~H~~~~~~~~~ 113 (312)
T PRK13500 60 VFAEHTHDFCELVIVWRGNGLHVLNDRP-----YRITRGDLFYIHADDKHSYASVNDLV 113 (312)
T ss_pred CCCccccceEEEEEEEcCeEEEEECCEE-----EeecCCeEEEECCCCeecccccCCce
Confidence 3689999999999999999999997643 89999999999999999998766543
No 22
>PRK11171 hypothetical protein; Provisional
Probab=98.23 E-value=7e-06 Score=70.47 Aligned_cols=52 Identities=27% Similarity=0.399 Sum_probs=44.7
Q ss_pred eecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEE
Q psy840 84 HLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVK 140 (178)
Q Consensus 84 H~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~ 140 (178)
|.|..+|..||++|++.+.+. |+| ..+++||.|.+|++..|+|.+.++..++
T Consensus 201 ~~~~~ee~i~Vl~G~~~~~~~---~~~--~~l~~GD~i~~~~~~~h~~~N~g~~~~~ 252 (266)
T PRK11171 201 ETHVMEHGLYVLEGKGVYRLN---NDW--VEVEAGDFIWMRAYCPQACYAGGPGPFR 252 (266)
T ss_pred cCCCceEEEEEEeCEEEEEEC---CEE--EEeCCCCEEEECCCCCEEEECCCCCcEE
Confidence 568889999999999999984 454 8999999999999999999987665443
No 23
>PRK13501 transcriptional activator RhaR; Provisional
Probab=98.20 E-value=2.8e-06 Score=72.26 Aligned_cols=51 Identities=29% Similarity=0.411 Sum_probs=45.6
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
..+|.|+.-|+.||++|+|.+.|++. .+.+++||+++||+|..|.+...++
T Consensus 31 ~~~H~H~~~ei~~i~~G~~~~~i~~~-----~~~l~~g~~~~I~p~~~H~~~~~~~ 81 (290)
T PRK13501 31 FVEHTHQFCEIVIVWRGNGLHVLNDH-----PYRITCGDVFYIQAADHHSYESVHD 81 (290)
T ss_pred CccccccceeEEEEecCceEEEECCe-----eeeecCCeEEEEcCCCcccccccCC
Confidence 56899999999999999999999763 3899999999999999999986544
No 24
>PLN00212 glutelin; Provisional
Probab=98.16 E-value=7.4e-06 Score=76.25 Aligned_cols=69 Identities=17% Similarity=0.195 Sum_probs=56.1
Q ss_pred hhccceecCcceEEEEecceEEEEEEeCCC-------------------------cEEEEEEeCCcEEEeCCCCeeeeec
Q psy840 79 SFYTEHLHTDEEIRLVLDGSGYFDVRDKFD-------------------------DWIRIEVTKGDLIIIPAGIYHRFTL 133 (178)
Q Consensus 79 ~f~~eH~H~~dEi~~il~G~g~f~v~~~~d-------------------------~~~ri~~~~GDli~vPaG~~H~f~~ 133 (178)
.++..|.|...++.||+.|+|.+.+-.++- ..--..+++||+|.||||+.||...
T Consensus 91 gL~lP~y~na~~liyV~qG~G~~G~v~pGcpeT~~~~~~~~~~~~~~~~~~~~d~hqkv~~lr~GDViaiPaG~~hw~yN 170 (493)
T PLN00212 91 GLLLPRYSNTPGLVYIIQGRGSMGLTFPGCPATYQQQFQQFLTEGQSQSQKFRDEHQKIHQFRQGDVVALPAGVAHWFYN 170 (493)
T ss_pred cccCccccCCCeEEEEEeCeEEEEEEeCCCcchhhhhcccccccccccccccccccccceEeccCCEEEECCCCeEEEEe
Confidence 468899998899999999999998874210 0001489999999999999999999
Q ss_pred CCCCcEEEEEEeec
Q psy840 134 DINNYVKAKRYFIG 147 (178)
Q Consensus 134 ~~~~~~~alrlF~~ 147 (178)
+++..+.++.++..
T Consensus 171 ~Gd~~~v~v~~~d~ 184 (493)
T PLN00212 171 DGDAPVVALYVYDI 184 (493)
T ss_pred CCCCcEEEEEEEec
Confidence 98888888877754
No 25
>PF02041 Auxin_BP: Auxin binding protein; InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=98.15 E-value=1.3e-05 Score=64.27 Aligned_cols=74 Identities=19% Similarity=0.155 Sum_probs=47.9
Q ss_pred ccceecCcceEEEEecceEEEEEEeC----CCcEEEEEEeCCcEEEeCCCCeeee-ecCCCCcEEEEEEeecCCceeec
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDK----FDDWIRIEVTKGDLIIIPAGIYHRF-TLDINNYVKAKRYFIGEPVWTPH 154 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~----~d~~~ri~~~~GDli~vPaG~~H~f-~~~~~~~~~alrlF~~~~gW~~~ 154 (178)
-+.|.|+-|||++|++|+|+..+... .|+.-.+...+++.+.||.|-.|.. ..++...+.++-+.+.+|.=+-+
T Consensus 57 TPiHRHsCEEVFvVLkG~GTl~l~~~~~~~pG~pqef~~~pnSTf~IPvn~~HQv~NT~e~eDlqvlViiSrpPvkvf~ 135 (167)
T PF02041_consen 57 TPIHRHSCEEVFVVLKGSGTLYLASSHEKYPGKPQEFPIFPNSTFHIPVNDAHQVWNTNEHEDLQVLVIISRPPVKVFI 135 (167)
T ss_dssp --EEEESS-EEEEEEE--EEEEE--SSSSS--S-EEEEE-TTEEEEE-TT--EEEE---SSS-EEEEEEEESSS--EEE
T ss_pred CCCccccccEEEEEEecceEEEEecccccCCCCceEEEecCCCeEEeCCCCcceeecCCCCcceEEEEEecCCCeEEEE
Confidence 37999999999999999999999865 3566679999999999999999995 55556889999999988865443
No 26
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=98.13 E-value=1.1e-05 Score=74.78 Aligned_cols=61 Identities=16% Similarity=0.194 Sum_probs=50.8
Q ss_pred cceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 82 TEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 82 ~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
..|.|.. +|..||++|++.+.+.++. ..+.+||.+.+|+|+.|++...++..++.|-+..+
T Consensus 399 ~~~~H~~~~E~~~VlsG~~~v~idg~~-----~~L~~GDSi~ip~g~~H~~~N~g~~~l~iI~V~~g 460 (478)
T PRK15460 399 SVQMHHHRAEHWVVVAGTAKVTIDGDI-----KLLGENESIYIPLGATHCLENPGKIPLDLIEVRSG 460 (478)
T ss_pred CcCCCCCCceEEEEEeeEEEEEECCEE-----EEecCCCEEEECCCCcEEEEcCCCCCEEEEEEEcC
Confidence 3455543 6999999999999997543 89999999999999999999988877877777655
No 27
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=98.11 E-value=7.7e-06 Score=69.45 Aligned_cols=44 Identities=25% Similarity=0.384 Sum_probs=37.9
Q ss_pred eecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 84 HLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 84 H~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
+++..+|+.||++|+..|.+.++ ...++|||.++||+|..|.|.
T Consensus 171 wtl~~dEi~YVLEGe~~l~IdG~-----t~~l~pGDvlfIPkGs~~hf~ 214 (233)
T PRK15457 171 WTLNYDEIDMVLEGELHVRHEGE-----TMIAKAGDVMFIPKGSSIEFG 214 (233)
T ss_pred eeccceEEEEEEEeEEEEEECCE-----EEEeCCCcEEEECCCCeEEec
Confidence 55677999999999999999653 399999999999999996663
No 28
>PRK10296 DNA-binding transcriptional regulator ChbR; Provisional
Probab=98.11 E-value=7.6e-06 Score=68.93 Aligned_cols=49 Identities=33% Similarity=0.506 Sum_probs=43.5
Q ss_pred hccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeec
Q psy840 80 FYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTL 133 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~ 133 (178)
....|.|+.-|+.||++|++.+.+.++ .+.+.+||+++||+|..|.+..
T Consensus 35 ~~~~H~H~~~ei~~v~~G~~~~~i~~~-----~~~l~~g~l~~i~p~~~H~~~~ 83 (278)
T PRK10296 35 VSGLHQHDYYEFTLVLTGRYYQEINGK-----RVLLERGDFVFIPLGSHHQSFY 83 (278)
T ss_pred CCCCcccccEEEEEEEeceEEEEECCE-----EEEECCCcEEEeCCCCccceee
Confidence 457999999999999999999999653 3899999999999999997643
No 29
>PRK13503 transcriptional activator RhaS; Provisional
Probab=98.10 E-value=4.2e-06 Score=69.98 Aligned_cols=52 Identities=21% Similarity=0.275 Sum_probs=46.5
Q ss_pred hccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 80 FYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
.+.+|.|+.-|+.||++|+|.+.+++.. ..+++||+++||+|+.|.+...++
T Consensus 27 ~~~~H~H~~~ei~~v~~G~~~~~i~~~~-----~~l~~g~~~~i~~~~~h~~~~~~~ 78 (278)
T PRK13503 27 AFPEHHHDFHEIVIVEHGTGIHVFNGQP-----YTLSGGTVCFVRDHDRHLYEHTDN 78 (278)
T ss_pred cccccccCceeEEEEecCceeeEecCCc-----ccccCCcEEEECCCccchhhhccC
Confidence 4679999999999999999999998744 899999999999999999876654
No 30
>COG4297 Uncharacterized protein containing double-stranded beta helix domain [Function unknown]
Probab=98.09 E-value=2e-05 Score=62.54 Aligned_cols=95 Identities=19% Similarity=0.214 Sum_probs=71.1
Q ss_pred hHHHHHHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEe
Q psy840 45 DEVLTKLRKDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIII 123 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~v 123 (178)
..+++++.+.+|+.. ++.. .-|---|.|+. -|+.-+++|++...+.+.++. .+.++.||.|+|
T Consensus 34 a~~~e~~~~~~gW~g----sW~g----------~Vf~yHHYHs~aHEVl~vlrgqA~l~iGG~~G~--el~v~~GDvlli 97 (163)
T COG4297 34 AAQVEDHFKANGWFG----SWRG----------GVFNYHHYHSGAHEVLGVLRGQAGLQIGGADGQ--ELEVGEGDVLLI 97 (163)
T ss_pred HHHHHHHHhhcCCcc----cccc----------cccccccccCCcceEEEEecceeEEEecCCCCc--eeeecCCCEEEE
Confidence 356888999999862 1111 12333578887 799999999999999998887 599999999999
Q ss_pred CCCCeeeeecCCCCcEEEEEEeecCCceeecCCC
Q psy840 124 PAGIYHRFTLDINNYVKAKRYFIGEPVWTPHNRP 157 (178)
Q Consensus 124 PaG~~H~f~~~~~~~~~alrlF~~~~gW~~~~r~ 157 (178)
|||+-|+- +..+..|..|--|.+...|.- .++
T Consensus 98 PAGvGH~r-l~sS~DF~VvGaYp~G~q~di-qtg 129 (163)
T COG4297 98 PAGVGHCR-LHSSADFQVVGAYPPGQQADI-QTG 129 (163)
T ss_pred ecCccccc-ccCCCCeEEEcccCCcccccc-cCC
Confidence 99999974 444566888877766665553 444
No 31
>PRK10371 DNA-binding transcriptional regulator MelR; Provisional
Probab=98.09 E-value=6.2e-06 Score=71.34 Aligned_cols=51 Identities=22% Similarity=0.324 Sum_probs=45.7
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
..+|.|.+-|+.|+++|++.|.++++. +.+.+||+++||+|+.|.+...++
T Consensus 39 ~~~HwH~e~Ei~yv~~G~~~~~i~g~~-----~~l~~Gd~ili~s~~~H~~~~~~~ 89 (302)
T PRK10371 39 PTSHWHGQVEVNVPFDGDVEYLINNEK-----VQINQGHITLFWACTPHQLTDPGN 89 (302)
T ss_pred CCCCccccEEEEEecCCcEEEEECCEE-----EEEcCCcEEEEecCCcccccccCC
Confidence 589999999999999999999997643 899999999999999999876554
No 32
>PRK13502 transcriptional activator RhaR; Provisional
Probab=98.08 E-value=8.1e-06 Score=68.79 Aligned_cols=52 Identities=29% Similarity=0.411 Sum_probs=46.0
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCC
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~ 137 (178)
+..|.|+.-|+.|+++|+|.+.++++. ..+++||+++||+|..|.+...++.
T Consensus 31 ~~~H~h~~~~l~~v~~G~~~~~i~~~~-----~~l~~g~l~li~~~~~H~~~~~~~~ 82 (282)
T PRK13502 31 FAEHTHEFCELVMVWRGNGLHVLNERP-----YRITRGDLFYIRAEDKHSYTSVNDL 82 (282)
T ss_pred CCccccceEEEEEEecCcEEEEECCEE-----EeecCCcEEEECCCCcccccccCCc
Confidence 678999999999999999999996533 8999999999999999999765543
No 33
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=98.07 E-value=2.9e-05 Score=60.29 Aligned_cols=74 Identities=24% Similarity=0.269 Sum_probs=56.4
Q ss_pred eeeEEEEcCCCCCCHHHHhhhhccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCC
Q psy840 59 YEDEITCSKDCLPNYEEKLKSFYTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 59 ~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~ 137 (178)
++-++++.|+. | -..|.|.. |-+.|+++|+.....++.=. .-+.+++||++.||+|++|--...++.
T Consensus 47 ~~~~vTi~pgA------k----akaH~H~~hEtaIYvlsG~ah~w~G~rLE--~ha~~~pGDf~YiPpgVPHqp~N~S~e 114 (142)
T COG4101 47 CMHLVTIPPGA------K----AKAHLHEEHETAIYVLSGEAHTWYGNRLE--EHAEVGPGDFFYIPPGVPHQPANLSTE 114 (142)
T ss_pred eEEEEeeCCCc------c----ccccccccccEEEEEEeceeeeeecccee--eeEEecCCCeEEcCCCCCCcccccCCC
Confidence 67889988863 1 26799998 88899999999988765322 248999999999999999997665555
Q ss_pred cEEEEEE
Q psy840 138 YVKAKRY 144 (178)
Q Consensus 138 ~~~alrl 144 (178)
.+.|+-.
T Consensus 115 p~s~vIa 121 (142)
T COG4101 115 PLSAVIA 121 (142)
T ss_pred CeEEEEE
Confidence 5555433
No 34
>PF01050 MannoseP_isomer: Mannose-6-phosphate isomerase; InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=98.04 E-value=2.7e-05 Score=62.03 Aligned_cols=80 Identities=21% Similarity=0.304 Sum_probs=61.5
Q ss_pred HhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeee
Q psy840 53 KDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRF 131 (178)
Q Consensus 53 ~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f 131 (178)
...+|+. -.+++.|+ +-+..|.|.. .|.++|++|+|.+.+.+.. ..+.+||.+.||+|..|+.
T Consensus 59 ~~~~~~v-kri~V~pG----------~~lSlq~H~~R~E~W~Vv~G~a~v~~~~~~-----~~~~~g~sv~Ip~g~~H~i 122 (151)
T PF01050_consen 59 EGEGYKV-KRITVNPG----------KRLSLQYHHHRSEHWTVVSGTAEVTLDDEE-----FTLKEGDSVYIPRGAKHRI 122 (151)
T ss_pred ccCCEEE-EEEEEcCC----------CccceeeecccccEEEEEeCeEEEEECCEE-----EEEcCCCEEEECCCCEEEE
Confidence 3445653 44555554 2367888887 9999999999999996432 7899999999999999999
Q ss_pred ecCCCCcEEEEEEeecC
Q psy840 132 TLDINNYVKAKRYFIGE 148 (178)
Q Consensus 132 ~~~~~~~~~alrlF~~~ 148 (178)
.+.++..+..|-+-.++
T Consensus 123 ~n~g~~~L~~IEVq~G~ 139 (151)
T PF01050_consen 123 ENPGKTPLEIIEVQTGE 139 (151)
T ss_pred ECCCCcCcEEEEEecCC
Confidence 88777667777766553
No 35
>COG3837 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=98.02 E-value=1.5e-05 Score=64.08 Aligned_cols=65 Identities=22% Similarity=0.287 Sum_probs=54.0
Q ss_pred ccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCC--CeeeeecCCCCcEEEEEEeecCCc
Q psy840 81 YTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAG--IYHRFTLDINNYVKAKRYFIGEPV 150 (178)
Q Consensus 81 ~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG--~~H~f~~~~~~~~~alrlF~~~~g 150 (178)
-..|.|.. ||.+|||+|++.+.+.+. +..++|||.+--||| +-|-|...++.-++.|.+-+.++-
T Consensus 56 s~~H~Hs~edEfv~ILeGE~~l~~d~~-----e~~lrpGD~~gFpAG~~~aHhliN~s~~~~~yL~vG~r~~~ 123 (161)
T COG3837 56 SLRHWHSAEDEFVYILEGEGTLREDGG-----ETRLRPGDSAGFPAGVGNAHHLINRSDVILRYLEVGTREPD 123 (161)
T ss_pred ccccccccCceEEEEEcCceEEEECCe-----eEEecCCceeeccCCCcceeEEeecCCceEEEEEecccccc
Confidence 35677765 899999999999988653 389999999999999 999999988877877777666553
No 36
>TIGR02297 HpaA 4-hydroxyphenylacetate catabolism regulatory protein HpaA. This putative transcriptional regulator, which contains both the substrate-binding, dimerization domain (pfam02311) and the helix-turn-helix DNA-binding domain (pfam00165) of the AraC famil, is located proximal to genes of the 4-hydroxyphenylacetate catabolism pathway.
Probab=97.98 E-value=1.5e-05 Score=67.04 Aligned_cols=58 Identities=14% Similarity=0.240 Sum_probs=48.4
Q ss_pred ccceecC-cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEE
Q psy840 81 YTEHLHT-DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKR 143 (178)
Q Consensus 81 ~~eH~H~-~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alr 143 (178)
...|.|+ .-|+.|+++|++.+.+.+. ...+++||+++||+|+.|.+...++....++.
T Consensus 36 ~~~H~H~~~~~l~~~~~G~~~~~~~~~-----~~~l~~g~~~ii~~~~~H~~~~~~~~~~~~i~ 94 (287)
T TIGR02297 36 MPVHFHDRYYQLHYLTEGSIALQLDEH-----EYSEYAPCFFLTPPSVPHGFVTDLDADGHVLT 94 (287)
T ss_pred CCCcccccceeEEEEeeCceEEEECCE-----EEEecCCeEEEeCCCCccccccCCCcceEEEE
Confidence 5789998 6999999999999999653 38999999999999999999876654444444
No 37
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=97.97 E-value=3.9e-05 Score=65.70 Aligned_cols=63 Identities=11% Similarity=0.111 Sum_probs=47.6
Q ss_pred ceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEE---EEeecCCc
Q psy840 83 EHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAK---RYFIGEPV 150 (178)
Q Consensus 83 eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~al---rlF~~~~g 150 (178)
.|.|+. +|..||++|++.+.+.+ + ...+++||.+.+|+|..|+|...++..++.+ +-|..-+|
T Consensus 74 ~~~~~g~ee~iyVl~G~l~v~~~g---~--~~~L~~Gd~~y~pa~~~H~~~N~~~~~a~~l~v~k~y~~~~g 140 (260)
T TIGR03214 74 GFGGEGIETFLFVISGEVNVTAEG---E--THELREGGYAYLPPGSKWTLANAQAEDARFFLYKKRYQPVEG 140 (260)
T ss_pred CCCCCceEEEEEEEeCEEEEEECC---E--EEEECCCCEEEECCCCCEEEEECCCCCEEEEEEEeeeEEcCC
Confidence 455776 89999999999988854 3 2799999999999999999987665444433 34444444
No 38
>PRK11171 hypothetical protein; Provisional
Probab=97.97 E-value=4.3e-05 Score=65.59 Aligned_cols=51 Identities=14% Similarity=0.202 Sum_probs=41.7
Q ss_pred CcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEE
Q psy840 87 TDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAK 142 (178)
Q Consensus 87 ~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~al 142 (178)
..+|+.||++|++.+.+.++ ...+++||.+.+|+|+.|+|....+..+..+
T Consensus 82 ~~eE~~~VlsG~l~v~~~g~-----~~~L~~GDsi~~p~~~~H~~~N~g~~~a~~l 132 (266)
T PRK11171 82 GAETFLFVVEGEITLTLEGK-----THALSEGGYAYLPPGSDWTLRNAGAEDARFH 132 (266)
T ss_pred CceEEEEEEeCEEEEEECCE-----EEEECCCCEEEECCCCCEEEEECCCCCEEEE
Confidence 45899999999999998643 3899999999999999999987555444444
No 39
>TIGR02451 anti_sig_ChrR anti-sigma factor, putative, ChrR family. The member of this family from Rhodobacter sphaeroides has been shown both to form a complex with sigma(E) and to negatively regulate tetrapyrrole biosynthesis. This protein likely contains (at least) two distinct functional domains; several smaller homologs (excluded by the model) show homology only to the C-terminal, including a motif PxHxHxGxE.
Probab=97.79 E-value=4.3e-05 Score=63.86 Aligned_cols=72 Identities=15% Similarity=0.031 Sum_probs=56.6
Q ss_pred CeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCC
Q psy840 58 NYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 58 ~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~ 137 (178)
...-++.+.|+. -+..|+|...|+.+||+|+. .+++ -...+||+|.+|+|..|.+.+.++.
T Consensus 127 ~~v~Ll~i~pG~----------~~p~H~H~G~E~tlVLeG~f----~de~-----g~y~~Gd~i~~p~~~~H~p~a~~~~ 187 (215)
T TIGR02451 127 ARVRLLYIEAGQ----------SIPQHTHKGFELTLVLHGAF----SDET-----GVYGVGDFEEADGSVQHQPRTVSGG 187 (215)
T ss_pred cEEEEEEECCCC----------ccCCCcCCCcEEEEEEEEEE----EcCC-----CccCCCeEEECCCCCCcCcccCCCC
Confidence 455677777652 37899999999999999993 3434 4689999999999999999998877
Q ss_pred cEEEEEEeecC
Q psy840 138 YVKAKRYFIGE 148 (178)
Q Consensus 138 ~~~alrlF~~~ 148 (178)
.+.++-+..++
T Consensus 188 ~Cicl~v~dap 198 (215)
T TIGR02451 188 DCLCLAVLDAP 198 (215)
T ss_pred CeEEEEEecCC
Confidence 67777665554
No 40
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=97.77 E-value=2.7e-05 Score=68.63 Aligned_cols=73 Identities=23% Similarity=0.337 Sum_probs=55.3
Q ss_pred eeEEEEcCCCCCCHHHHhh------------hhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCC
Q psy840 60 EDEITCSKDCLPNYEEKLK------------SFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGI 127 (178)
Q Consensus 60 ~Dvv~l~~~~~p~~e~~~~------------~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~ 127 (178)
+-++.+...++++...+-. ..-+.|.|...-+|||++|.|.|++.+.+ ++.+++||+|+.|+++
T Consensus 72 RRvi~L~NP~l~g~ssiT~TLyAglQlilPGEvApsHrHsqsAlRFvveG~Ga~T~VdGe----r~~M~~GDfilTP~w~ 147 (351)
T COG3435 72 RRVIYLENPGLRGRSSITPTLYAGLQLILPGEVAPSHRHNQSALRFVVEGKGAYTVVDGE----RTPMEAGDFILTPAWT 147 (351)
T ss_pred eEEEEecCCCCCCcccccHHHHhhhheecCcccCCcccccccceEEEEeccceeEeecCc----eeeccCCCEEEccCce
Confidence 4566665554565422222 24689999999999999999999998743 5999999999999999
Q ss_pred eeeeecCCC
Q psy840 128 YHRFTLDIN 136 (178)
Q Consensus 128 ~H~f~~~~~ 136 (178)
.|-.-....
T Consensus 148 wHdHgn~g~ 156 (351)
T COG3435 148 WHDHGNEGT 156 (351)
T ss_pred eccCCCCCC
Confidence 998755433
No 41
>PF12973 Cupin_7: ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=97.75 E-value=7.1e-05 Score=53.94 Aligned_cols=59 Identities=24% Similarity=0.225 Sum_probs=43.3
Q ss_pred eeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 59 YEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 59 ~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
..-++.+.|+ ..++.|.|+..|-.|||+|+.. +.++ .+.+||++..|+|+.|.+...+.
T Consensus 25 ~~~L~r~~pG----------~~~p~H~H~g~ee~~VLeG~~~----d~~~-----~~~~G~~~~~p~g~~h~~~s~~g 83 (91)
T PF12973_consen 25 RVSLLRLEPG----------ASLPRHRHPGGEEILVLEGELS----DGDG-----RYGAGDWLRLPPGSSHTPRSDEG 83 (91)
T ss_dssp EEEEEEE-TT----------EEEEEEEESS-EEEEEEECEEE----ETTC-----EEETTEEEEE-TTEEEEEEESSC
T ss_pred EEEEEEECCC----------CCcCccCCCCcEEEEEEEEEEE----ECCc-----cCCCCeEEEeCCCCccccCcCCC
Confidence 4566666664 3478999999888899999965 2232 45999999999999999996543
No 42
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=97.69 E-value=0.0001 Score=65.79 Aligned_cols=56 Identities=27% Similarity=0.487 Sum_probs=47.2
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEE
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVK 140 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~ 140 (178)
-..|.|...-++||++|+|.|.+-+ |+ ++.+++||++++|++..|.....++..+.
T Consensus 94 ~~~HRht~sAl~~vveG~G~~t~V~--g~--~~~~~~gD~~~tP~w~wH~H~n~~d~~~~ 149 (335)
T TIGR02272 94 APSHRHTQSALRFIVEGKGAFTAVD--GE--RTTMHPGDFIITPSWTWHDHGNPGDEPMI 149 (335)
T ss_pred CCccccccceEEEEEEcCceEEEEC--CE--EEeeeCCCEEEeCCCeeEecccCCCCcEE
Confidence 5689999999999999999877754 33 69999999999999999998877664443
No 43
>PLN00212 glutelin; Provisional
Probab=97.55 E-value=0.00083 Score=62.74 Aligned_cols=86 Identities=9% Similarity=0.093 Sum_probs=64.9
Q ss_pred EEEEcCCCCCCHHHHh----------hhhccceecCc-ceEEEEecceEEEEEEeCC-CcEEEEEEeCCcEEEeCCCCee
Q psy840 62 EITCSKDCLPNYEEKL----------KSFYTEHLHTD-EEIRLVLDGSGYFDVRDKF-DDWIRIEVTKGDLIIIPAGIYH 129 (178)
Q Consensus 62 vv~l~~~~~p~~e~~~----------~~f~~eH~H~~-dEi~~il~G~g~f~v~~~~-d~~~ri~~~~GDli~vPaG~~H 129 (178)
+.++++.++|.+..+- ...+..|.|+. .+|.||++|+|...|.+.. ..++.-.+++||+++||+|-.|
T Consensus 332 it~v~~~~~P~L~~L~LSa~rv~L~~gam~~PHwn~nA~eI~yV~rG~g~vqvV~~~g~~vf~~~L~~GdvfVVPqg~~v 411 (493)
T PLN00212 332 ITRLNSQKFPILNLIQMSATRVNLYQNALLSPFWNVNAHSVVYITQGRARVQVVSNNGKTVFNGVLRPGQLLIIPQHYAV 411 (493)
T ss_pred EEEechhhCccccccCeeEEEEEEcCCcccCCeecCCCCEEEEEeecceEEEEEcCCCCEEEEEEEcCCCEEEECCCCeE
Confidence 3455565566542221 23589999998 8999999999999998755 4688889999999999999999
Q ss_pred eeecCCCCcEEEEEEeecC
Q psy840 130 RFTLDINNYVKAKRYFIGE 148 (178)
Q Consensus 130 ~f~~~~~~~~~alrlF~~~ 148 (178)
--.++.++ +..+-+.+..
T Consensus 412 ~~~A~~eg-fe~v~F~tna 429 (493)
T PLN00212 412 LKKAEREG-CQYIAFKTNA 429 (493)
T ss_pred EEeecCCc-eEEEEeecCC
Confidence 87777554 6666555554
No 44
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=97.55 E-value=0.00013 Score=51.23 Aligned_cols=49 Identities=16% Similarity=0.236 Sum_probs=37.5
Q ss_pred cceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCC
Q psy840 82 TEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDI 135 (178)
Q Consensus 82 ~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~ 135 (178)
..+... +|..|||+|++.+... ++. ++.++|||++++|+|..-..+..+
T Consensus 20 ~~~~~~-~E~~~vleG~v~it~~--~G~--~~~~~aGD~~~~p~G~~~~w~v~~ 68 (74)
T PF05899_consen 20 PWPYPE-DEFFYVLEGEVTITDE--DGE--TVTFKAGDAFFLPKGWTGTWEVRE 68 (74)
T ss_dssp EEEESS-EEEEEEEEEEEEEEET--TTE--EEEEETTEEEEE-TTEEEEEEEEE
T ss_pred EeeCCC-CEEEEEEEeEEEEEEC--CCC--EEEEcCCcEEEECCCCEEEEEECe
Confidence 344433 9999999999987764 454 499999999999999987776554
No 45
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=97.46 E-value=0.00045 Score=55.66 Aligned_cols=74 Identities=20% Similarity=0.227 Sum_probs=55.4
Q ss_pred CCCeeeEEEEcCCCCC---CHHHHhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 56 NYNYEDEITCSKDCLP---NYEEKLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 56 gy~~~Dvv~l~~~~~p---~~e~~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
+.-..|+++...++ + +|-+....+|+|- -..||+-||++|+....+.++. +..+|||+|.||.|-.--|+
T Consensus 85 ~V~~tdLvt~~~g~-~l~aG~m~~~~~tf~wt-l~yDe~d~VlEGrL~V~~~g~t-----v~a~aGDvifiPKgssIefs 157 (176)
T COG4766 85 CVYTTDLVTEQEGS-RLGAGLMEMKNTTFPWT-LNYDEIDYVLEGRLHVRIDGRT-----VIAGAGDVIFIPKGSSIEFS 157 (176)
T ss_pred eEEeeceeecccCC-ccccceeeeccccCcce-ecccceeEEEeeeEEEEEcCCe-----EecCCCcEEEecCCCeEEEe
Confidence 33455777765542 2 2555555778774 4679999999999887776543 89999999999999999998
Q ss_pred cCCC
Q psy840 133 LDIN 136 (178)
Q Consensus 133 ~~~~ 136 (178)
...+
T Consensus 158 t~ge 161 (176)
T COG4766 158 TTGE 161 (176)
T ss_pred ccce
Confidence 8776
No 46
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=97.42 E-value=0.0013 Score=58.74 Aligned_cols=88 Identities=15% Similarity=0.119 Sum_probs=63.2
Q ss_pred ChHHHHHHHH---hcCCCeeeEEEEcCCCCCCHHHHhhh---------hccceecCcceEEEEecceEEEEEEeCCCcEE
Q psy840 44 QDEVLTKLRK---DRNYNYEDEITCSKDCLPNYEEKLKS---------FYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWI 111 (178)
Q Consensus 44 ~~~~l~~l~~---~~gy~~~Dvv~l~~~~~p~~e~~~~~---------f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ 111 (178)
..+.|+++.+ ..+|...-+--++|.+-+.....++. --..|.|+...|++|++|+|+-.|+++
T Consensus 214 ~~~aL~~~~~~~~~~~~~g~~l~y~NP~TG~~~~pti~~~~q~L~~G~~t~~~r~T~s~Vf~VieG~G~s~ig~~----- 288 (335)
T TIGR02272 214 SREALDDLTRTGEWDPWHGLKLRYVNPATGGYPMPTIGAFIQLLPKGFRTATYRSTDATVFCVVEGRGQVRIGDA----- 288 (335)
T ss_pred HHHHHHHHHhccCCCCCceEEEEEeCCCCCCCcchhHHHHHhccCCCCCCCCccccccEEEEEEeCeEEEEECCE-----
Confidence 4566777654 34565555566677654443222222 255799999999999999999999652
Q ss_pred EEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
++..++||+++||+...|.+..+++
T Consensus 289 ~~~W~~gD~f~vPsW~~~~h~a~~d 313 (335)
T TIGR02272 289 VFRFSPKDVFVVPSWHPVRFEASDD 313 (335)
T ss_pred EEEecCCCEEEECCCCcEecccCCC
Confidence 4999999999999998888877654
No 47
>PF06052 3-HAO: 3-hydroxyanthranilic acid dioxygenase; InterPro: IPR010329 Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase (1.13.11.6 from EC). It is part of the kynurenine pathway for the degradation of tryptophan and the biosynthesis of nicotinic acid [].The prokaryotic homologue is involved in the 2-nitrobenzoate degradation pathway []. The enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.; GO: 0000334 3-hydroxyanthranilate 3,4-dioxygenase activity, 0005506 iron ion binding, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 1ZVF_A 1YFX_A 1YFW_A 1YFY_A 1YFU_A 2QNK_A 3FE5_A.
Probab=97.40 E-value=0.00099 Score=53.29 Aligned_cols=54 Identities=20% Similarity=0.368 Sum_probs=40.8
Q ss_pred cceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 82 TEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 82 ~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
..|.-+.+|.+|-++|.....+.+ +|+.-.|.++.||+..+|++++|...-.++
T Consensus 47 DyHine~eE~FyQ~kG~m~Lkv~e-~g~~kdi~I~EGe~fLLP~~vpHsP~R~~~ 100 (151)
T PF06052_consen 47 DYHINETEEFFYQLKGDMCLKVVE-DGKFKDIPIREGEMFLLPANVPHSPQRPAD 100 (151)
T ss_dssp SEEE-SS-EEEEEEES-EEEEEEE-TTEEEEEEE-TTEEEEE-TT--EEEEE-TT
T ss_pred ccccCCcceEEEEEeCcEEEEEEe-CCceEEEEeCCCcEEecCCCCCCCCcCCCC
Confidence 789999999999999999999987 567778999999999999999999876553
No 48
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=97.13 E-value=0.0026 Score=49.28 Aligned_cols=57 Identities=19% Similarity=0.227 Sum_probs=36.5
Q ss_pred ccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCc-EEEeCCCCeeeeecCCCCc
Q psy840 81 YTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGD-LIIIPAGIYHRFTLDINNY 138 (178)
Q Consensus 81 ~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GD-li~vPaG~~H~f~~~~~~~ 138 (178)
-.+|.|.. .|.+++++|+..+.+.+.... -.+.+...+ .|.||+|+.|.+..-+++.
T Consensus 46 RG~H~Hk~~~~~~~~l~Gs~~v~~~d~~~~-~~~~L~~~~~~L~Ippg~w~~~~~~s~~s 104 (131)
T PF05523_consen 46 RGWHAHKKTTQWFIVLSGSFKVVLDDGREE-EEFILDEPNKGLYIPPGVWHGIKNFSEDS 104 (131)
T ss_dssp EEEEEESS--EEEEEEES-EEEEEE-SS-E-EEEEE--TTEEEEE-TT-EEEEE---TT-
T ss_pred ccccccccccEEEEEEeCEEEEEEecCCCc-EEEEECCCCeEEEECCchhhHhhccCCCc
Confidence 36999977 999999999999999874433 456666554 7999999999997666553
No 49
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=97.12 E-value=0.00073 Score=54.19 Aligned_cols=45 Identities=29% Similarity=0.379 Sum_probs=32.9
Q ss_pred cCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCC
Q psy840 86 HTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDI 135 (178)
Q Consensus 86 H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~ 135 (178)
=..||+-||++|+.... . +|+ .+..+|||+|.||+|+.=.|....
T Consensus 93 l~YDEi~~VlEG~L~i~--~-~G~--~~~A~~GDvi~iPkGs~I~fst~~ 137 (152)
T PF06249_consen 93 LTYDEIKYVLEGTLEIS--I-DGQ--TVTAKPGDVIFIPKGSTITFSTPD 137 (152)
T ss_dssp -SSEEEEEEEEEEEEEE--E-TTE--EEEEETT-EEEE-TT-EEEEEEEE
T ss_pred eecceEEEEEEeEEEEE--E-CCE--EEEEcCCcEEEECCCCEEEEecCC
Confidence 35699999999975554 4 355 389999999999999999997654
No 50
>PF12852 Cupin_6: Cupin
Probab=96.82 E-value=0.003 Score=50.53 Aligned_cols=44 Identities=27% Similarity=0.534 Sum_probs=35.4
Q ss_pred eEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 90 EIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 90 Ei~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
-+.+|++|++++.+.+.+ . .+.+++||++++|.|..|++.-+++
T Consensus 37 ~fh~V~~G~~~l~~~~~~-~--~~~L~~GDivllp~g~~H~l~~~~~ 80 (186)
T PF12852_consen 37 SFHVVLRGSCWLRVPGGG-E--PIRLEAGDIVLLPRGTAHVLSSDPD 80 (186)
T ss_pred EEEEEECCeEEEEEcCCC-C--eEEecCCCEEEEcCCCCeEeCCCCC
Confidence 457899999999986522 2 3999999999999999999954443
No 51
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=96.61 E-value=0.0038 Score=48.00 Aligned_cols=50 Identities=22% Similarity=0.340 Sum_probs=39.3
Q ss_pred cceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCC
Q psy840 82 TEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDI 135 (178)
Q Consensus 82 ~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~ 135 (178)
.++.+-. +|..+||+|.+.+.-++ |+ .+.+++||.+++|+|..=.-+..+
T Consensus 56 ~~r~~y~~~E~chil~G~v~~T~d~--Ge--~v~~~aGD~~~~~~G~~g~W~V~E 106 (116)
T COG3450 56 KFRVTYDEDEFCHILEGRVEVTPDG--GE--PVEVRAGDSFVFPAGFKGTWEVLE 106 (116)
T ss_pred cceEEcccceEEEEEeeEEEEECCC--Ce--EEEEcCCCEEEECCCCeEEEEEee
Confidence 3555533 89999999999988765 45 389999999999999876655544
No 52
>PRK10572 DNA-binding transcriptional regulator AraC; Provisional
Probab=96.60 E-value=0.005 Score=52.14 Aligned_cols=51 Identities=18% Similarity=0.268 Sum_probs=41.6
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
...+....-++.++++|+|.+.+.++ +..+++||+|++|+|+.|.+...++
T Consensus 42 ~r~~~~~~~~i~~~~~G~~~~~~~~~-----~~~~~~g~~i~i~p~~~h~~~~~~~ 92 (290)
T PRK10572 42 DRPLGMKGYILNLTIRGQGVIFNGGR-----AFVCRPGDLLLFPPGEIHHYGRHPD 92 (290)
T ss_pred ecCCCccceEEEEEEeccEEEecCCe-----eEecCCCCEEEECCCCceeeccCCC
Confidence 35566666788999999999987543 3899999999999999999876544
No 53
>PF04209 HgmA: homogentisate 1,2-dioxygenase; InterPro: IPR005708 Alkaptonuria (AKU), a rare hereditary disorder, was the first disease to be interpreted as an inborn error of metabolism. The deficiency causes homogentisic aciduria, ochronosis, and arthritis. AKU patients are deficient for homogentisate 1,2 dioxygenase (1.13.11.5 from EC), the enzyme that mediates the conversion of homogentisate to maleylacetoacetate; a step in the catabolism of both tyrosine and phenylalanine. Homogentisate + O(2) = 4-maleylacetoacetate. ; GO: 0004411 homogentisate 1,2-dioxygenase activity, 0006559 L-phenylalanine catabolic process, 0006570 tyrosine metabolic process, 0055114 oxidation-reduction process; PDB: 1EY2_A 1EYB_A.
Probab=96.59 E-value=0.0044 Score=56.96 Aligned_cols=57 Identities=23% Similarity=0.264 Sum_probs=39.7
Q ss_pred ecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEee
Q psy840 85 LHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFI 146 (178)
Q Consensus 85 ~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~ 146 (178)
-.+.||+.++-+|++.+..+- | .+.+++||+++||.||.++..+.+....-++..|.
T Consensus 143 NaDGD~Li~~q~G~l~l~Te~--G---~L~v~pGd~~VIPRG~~~rv~l~~p~rgyi~E~~~ 199 (424)
T PF04209_consen 143 NADGDELIFPQQGSLRLETEF--G---RLDVRPGDYVVIPRGTRFRVELPGPARGYIIENFG 199 (424)
T ss_dssp ESSEEEEEEEEES-EEEEETT--E---EEEE-TTEEEEE-TT--EEEE-SSSEEEEEEEEES
T ss_pred cCCCCEEEEEEECCEEEEecC--e---eEEEcCCeEEEECCeeEEEEEeCCCceEEEEEcCC
Confidence 345599999999999887754 3 38999999999999999999999655555555553
No 54
>TIGR01015 hmgA homogentisate 1,2-dioxygenase. Missing in human disease alkaptonuria.
Probab=96.27 E-value=0.015 Score=53.46 Aligned_cols=56 Identities=13% Similarity=0.106 Sum_probs=45.3
Q ss_pred cCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEee
Q psy840 86 HTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFI 146 (178)
Q Consensus 86 H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~ 146 (178)
.+.|++.++-+|++.+..+- |. +.++|||+++||.||.++.++.+....-++..|.
T Consensus 146 aDGD~Livpq~G~l~i~TEf--G~---L~v~pgei~VIPRG~~frv~l~gp~rgyi~E~~g 201 (429)
T TIGR01015 146 ADGDFLIVPQQGALLITTEF--GR---LLVEPNEICVIPRGVRFRVTVLEPARGYICEVYG 201 (429)
T ss_pred cCCCEEEEEEeCcEEEEEec--cc---eEecCCCEEEecCccEEEEeeCCCceEEEEeccC
Confidence 45599999999999988876 33 9999999999999999999987554444555554
No 55
>PRK05341 homogentisate 1,2-dioxygenase; Provisional
Probab=96.04 E-value=0.024 Score=52.38 Aligned_cols=56 Identities=16% Similarity=0.175 Sum_probs=43.8
Q ss_pred cCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCC-CCcEEEEEEee
Q psy840 86 HTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDI-NNYVKAKRYFI 146 (178)
Q Consensus 86 H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~-~~~~~alrlF~ 146 (178)
.+.|++.++-+|++.+..+- |. +.+++||+++||.||.++..+.+ ....-++..|.
T Consensus 152 aDGD~Livpq~G~l~i~TEf--G~---L~v~pgei~VIPRG~~frv~l~~gp~rgyi~E~~g 208 (438)
T PRK05341 152 ADGELLIVPQQGRLRLATEL--GV---LDVEPGEIAVIPRGVKFRVELPDGPARGYVCENYG 208 (438)
T ss_pred CCCCEEEEEEeCCEEEEEec--cc---eEecCCCEEEEcCccEEEEecCCCCeeEEEEEecC
Confidence 44599999999999998876 33 99999999999999999999754 33343444444
No 56
>PF02373 JmjC: JmjC domain, hydroxylase; InterPro: IPR013129 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with jmjN (see IPR003349 from INTERPRO) and belongs to the Cupin superfamily [].; PDB: 2YU2_A 2YU1_A 3AVR_A 3AVS_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=96.01 E-value=0.015 Score=42.18 Aligned_cols=29 Identities=24% Similarity=0.358 Sum_probs=19.9
Q ss_pred CcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 108 DDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 108 d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
-+.+++..+|||+|+||||++|+.-..+.
T Consensus 78 i~~~~~~Q~~Ge~V~i~pg~~H~v~n~g~ 106 (114)
T PF02373_consen 78 IPVYRFVQKPGEFVFIPPGAYHQVFNLGD 106 (114)
T ss_dssp S--EEEEEETT-EEEE-TT-EEEEEESSS
T ss_pred cccccceECCCCEEEECCCceEEEEeCCc
Confidence 35677999999999999999999755554
No 57
>PLN02658 homogentisate 1,2-dioxygenase
Probab=95.72 E-value=0.039 Score=50.96 Aligned_cols=55 Identities=15% Similarity=0.260 Sum_probs=43.7
Q ss_pred cCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCC-CCcEEEEEEe
Q psy840 86 HTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDI-NNYVKAKRYF 145 (178)
Q Consensus 86 H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~-~~~~~alrlF 145 (178)
.+.|++.++-+|++.+..+- |. +.+++||+++||.||.++..+.+ ....-++..|
T Consensus 145 aDGD~Livpq~G~l~i~TEf--G~---L~v~pgei~VIPRG~~frv~l~~gp~rgyv~E~~ 200 (435)
T PLN02658 145 ADGDFLIVPQQGRLWIKTEL--GK---LQVSPGEIVVIPRGFRFAVDLPDGPSRGYVLEIF 200 (435)
T ss_pred CCCCEEEEEEeCCEEEEEec--cc---eEecCCCEEEecCccEEEEecCCCCeeEEEEeec
Confidence 45599999999999988876 33 99999999999999999999744 3344444445
No 58
>PRK09685 DNA-binding transcriptional activator FeaR; Provisional
Probab=95.17 E-value=0.058 Score=45.68 Aligned_cols=49 Identities=14% Similarity=0.094 Sum_probs=37.9
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEE
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAK 142 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~al 142 (178)
--+.++++|++.+.+++.. +.+.|||++++|++.+|.+....+.....+
T Consensus 72 ~~l~~~~~G~~~~~~~g~~-----~~l~~G~~~l~~~~~p~~~~~~~~~~~~~l 120 (302)
T PRK09685 72 FFTVFQLSGHAIIEQDDRQ-----VQLAAGDITLIDASRPCSIYPQGLSEQISL 120 (302)
T ss_pred EEEEEEecceEEEEECCeE-----EEEcCCCEEEEECCCCcEeecCCCceeEEE
Confidence 4467789999999886533 899999999999999999877655333333
No 59
>PF06339 Ectoine_synth: Ectoine synthase; InterPro: IPR010462 This family consists of several bacterial ectoine synthase proteins. The ectABC genes encode the diaminobutyric acid acetyltransferase (EctA), the diaminobutyric acid aminotransferase (EctB), and the ectoine synthase (EctC). Together these proteins constitute the ectoine biosynthetic pathway [].; GO: 0016836 hydro-lyase activity, 0006596 polyamine biosynthetic process
Probab=95.11 E-value=0.12 Score=40.34 Aligned_cols=58 Identities=12% Similarity=0.104 Sum_probs=49.5
Q ss_pred CcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 87 TDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 87 ~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
..-|..|+++|+|...... ++++ ..++||.+..+-..=.|..++.+ .+.++.+|.++-
T Consensus 54 nHlEAvyci~G~Gev~~~~-~G~~--~~i~pGt~YaLd~hD~H~lra~~--dm~~vCVFnPpl 111 (126)
T PF06339_consen 54 NHLEAVYCIEGEGEVEDLD-TGEV--HPIKPGTMYALDKHDRHYLRAKT--DMRLVCVFNPPL 111 (126)
T ss_pred CceEEEEEEeceEEEEEcc-CCcE--EEcCCCeEEecCCCccEEEEecC--CEEEEEEcCCCC
Confidence 4479999999999986654 5665 78999999999999999999988 589999999873
No 60
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=94.62 E-value=0.061 Score=47.83 Aligned_cols=95 Identities=23% Similarity=0.361 Sum_probs=65.3
Q ss_pred ChHHHHHHHHhc------CCCeeeEEEEcCCCCCC-----HHHHh----hhh-ccceecCcceEEEEecceEEEEEEeCC
Q psy840 44 QDEVLTKLRKDR------NYNYEDEITCSKDCLPN-----YEEKL----KSF-YTEHLHTDEEIRLVLDGSGYFDVRDKF 107 (178)
Q Consensus 44 ~~~~l~~l~~~~------gy~~~Dvv~l~~~~~p~-----~e~~~----~~f-~~eH~H~~dEi~~il~G~g~f~v~~~~ 107 (178)
+++.|++|.... ||+ +=.++|.+ .+ .-+.+ .-| -..|.|.+.-|+-|++|+|+-.|.++
T Consensus 225 t~eAL~~la~~e~~dp~dG~~---~ryvNP~T-Gg~~mptI~a~mqlL~~Gf~~~~~r~t~s~iy~V~eGsg~~~Ig~~- 299 (351)
T COG3435 225 TREALERLARLEEPDPFDGYK---MRYVNPVT-GGYAMPTIGAFMQLLPPGFHGKAHRHTDSTIYHVVEGSGYTIIGGE- 299 (351)
T ss_pred HHHHHHHHHhccCCCCCCcce---EEEecCCC-CCCcCchHHHHHHhcCCcccCCceeccCCEEEEEEecceeEEECCE-
Confidence 467788888776 654 22334432 22 11111 123 45899999999999999999999763
Q ss_pred CcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 108 DDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 108 d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
|....+||+++||.--.|.+..+++.. ++--|++-|
T Consensus 300 ----rf~~~~~D~fvVPsW~~~~~~~gs~da--~LFsfsD~P 335 (351)
T COG3435 300 ----RFDWSAGDIFVVPSWAWHEHVNGSEDA--VLFSFSDRP 335 (351)
T ss_pred ----EeeccCCCEEEccCcceeecccCCcce--EEEecCCcH
Confidence 599999999999999999999875433 333344444
No 61
>PF13621 Cupin_8: Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=94.49 E-value=0.12 Score=42.04 Aligned_cols=68 Identities=15% Similarity=0.250 Sum_probs=43.8
Q ss_pred hccceecCcceEEEEecceEEEEEEeC---------C--------------------------CcEEEEEEeCCcEEEeC
Q psy840 80 FYTEHLHTDEEIRLVLDGSGYFDVRDK---------F--------------------------DDWIRIEVTKGDLIIIP 124 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G~g~f~v~~~---------~--------------------------d~~~ri~~~~GDli~vP 124 (178)
.-..|....+-+..++.|+=.|.+-.. . -..+.+.++|||+|.||
T Consensus 143 ~t~lH~D~~~n~~~~i~G~K~~~L~pP~~~~~l~~~~~~~~~~~~~~~d~~~~d~~~~p~~~~~~~~~~~l~pGD~LfiP 222 (251)
T PF13621_consen 143 FTPLHYDPSHNLLAQIRGRKRWILFPPDDSPNLYPRPDSHGGTVFSWVDPDNPDLERFPKFRKAPPYEVVLEPGDVLFIP 222 (251)
T ss_dssp EEEEEE-SSEEEEEEEESEEEEEEE-GGGGGGCTBETTTST-TCBBSS-TTS--TTT-CGGGG--EEEEEEETT-EEEE-
T ss_pred eeeeeECchhhhhhccCCCEEEEEECCccccccccceecccccceeeeeccChhhhhhhhhccCceeEEEECCCeEEEEC
Confidence 345666666778888888877777421 0 14688999999999999
Q ss_pred CCCeeeeecC-CCC-cEEEEEEeec
Q psy840 125 AGIYHRFTLD-INN-YVKAKRYFIG 147 (178)
Q Consensus 125 aG~~H~f~~~-~~~-~~~alrlF~~ 147 (178)
+|-.|..... +++ .|..=.-|..
T Consensus 223 ~gWwH~V~~~~~~~~sisvn~w~~~ 247 (251)
T PF13621_consen 223 PGWWHQVENLSDDDLSISVNYWFRT 247 (251)
T ss_dssp TT-EEEEEESTTSSCEEEEEEEEES
T ss_pred CCCeEEEEEcCCCCeEEEEEEEecc
Confidence 9999999877 333 4444444443
No 62
>PF14525 AraC_binding_2: AraC-binding-like domain
Probab=94.41 E-value=0.14 Score=38.90 Aligned_cols=50 Identities=20% Similarity=0.337 Sum_probs=40.2
Q ss_pred eEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEE
Q psy840 90 EIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRY 144 (178)
Q Consensus 90 Ei~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrl 144 (178)
-+.+.++|++.+..++. ++.+.|||+++++++-+.++...++.....+++
T Consensus 57 ~l~~~~~G~~~~~~~g~-----~~~~~pg~~~l~d~~~~~~~~~~~~~~~~~l~i 106 (172)
T PF14525_consen 57 LLVLPLSGSARIEQGGR-----EVELAPGDVVLLDPGQPYRLEFSAGCRQLSLRI 106 (172)
T ss_pred EEEEEccCCEEEEECCE-----EEEEcCCeEEEEcCCCCEEEEECCCccEEEEEE
Confidence 45677888888777653 399999999999999999999887766666655
No 63
>KOG3995|consensus
Probab=94.26 E-value=0.068 Score=45.53 Aligned_cols=49 Identities=20% Similarity=0.405 Sum_probs=43.2
Q ss_pred cceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeee
Q psy840 82 TEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRF 131 (178)
Q Consensus 82 ~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f 131 (178)
..|.-+.+|.+|-..|+....|-++ ++.-.+.++.||+..+|++++|..
T Consensus 47 dyHieegeE~FyQ~KGdMvLKVie~-g~~rDivI~qGe~flLParVpHSP 95 (279)
T KOG3995|consen 47 DYHIEEGEEVFYQLKGDMVLKVLEQ-GKHRDVVIRQGEIFLLPARVPHSP 95 (279)
T ss_pred ccccCCcchhheeecCceEEeeecc-CcceeeEEecCcEEEeccCCCCCh
Confidence 5788899999999999999999774 445579999999999999999975
No 64
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=93.71 E-value=0.28 Score=35.63 Aligned_cols=68 Identities=18% Similarity=0.192 Sum_probs=44.7
Q ss_pred CCeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 57 YNYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 57 y~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
+-..-++.|.|+. .| -..++..+--++||++|.....+++. ...+.+||...||+|-.=.+....+
T Consensus 11 ~fa~G~l~Lpp~~-----~K----~~k~s~~~~~vF~V~~G~v~Vti~~~-----~f~v~~G~~F~VP~gN~Y~i~N~~~ 76 (85)
T PF11699_consen 11 FFASGMLELPPGG-----EK----PPKNSRDNTMVFYVIKGKVEVTIHET-----SFVVTKGGSFQVPRGNYYSIKNIGN 76 (85)
T ss_dssp S-EEEEEEE-TCC-----CE----EEEE--SEEEEEEEEESEEEEEETTE-----EEEEETT-EEEE-TT-EEEEEE-SS
T ss_pred CceeEEEEeCCCC-----cc----CCcccCCcEEEEEEEeCEEEEEEcCc-----EEEEeCCCEEEECCCCEEEEEECCC
Confidence 4455677777653 11 14566667889999999999999763 3899999999999998888876655
Q ss_pred Cc
Q psy840 137 NY 138 (178)
Q Consensus 137 ~~ 138 (178)
..
T Consensus 77 ~~ 78 (85)
T PF11699_consen 77 EE 78 (85)
T ss_dssp S-
T ss_pred Cc
Confidence 44
No 65
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=93.52 E-value=0.23 Score=42.52 Aligned_cols=43 Identities=16% Similarity=0.206 Sum_probs=36.9
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
+-+.||++|+....+.++. ..+++|++..+|||.-|.++....
T Consensus 84 e~~lfVv~Ge~tv~~~G~t-----h~l~eggyaylPpgs~~~~~N~~~ 126 (264)
T COG3257 84 ETFLFVVSGEITVKAEGKT-----HALREGGYAYLPPGSGWTLRNAQK 126 (264)
T ss_pred eEEEEEEeeeEEEEEcCeE-----EEeccCCeEEeCCCCcceEeeccC
Confidence 4578999999998887654 899999999999999999986543
No 66
>PF00908 dTDP_sugar_isom: dTDP-4-dehydrorhamnose 3,5-epimerase; InterPro: IPR000888 Deoxythymidine diphosphate (dTDP)-4-keto-6-deoxy-d-hexulose 3, 5-epimerase (RmlC, 5.1.3.13 from EC) is involved in the biosynthesis of dTDP-l-rhamnose, which is an essential component of the bacterial cell wall, converting dTDP-4-keto-6-deoxy-D-glucose to dTDP-4-keto-L-rhamnose. The crystal structure of RmlC from Methanobacterium thermoautotrophicum was determined in the presence and absence of a substrate analogue. RmlC is a homodimer comprising a central jelly roll motif, which extends in two directions into longer beta-sheets. Binding of dTDP is stabilised by ionic interactions to the phosphate group and by a combination of ionic and hydrophobic interactions with the base. The active site, which is located in the centre of the jelly roll, is formed by residues that are conserved in all known RmlC sequence homologues. The active site is lined with a number of charged residues and a number of residues with hydrogen-bonding potentials, which together comprise a potential network for substrate binding and catalysis. The active site is also lined with aromatic residues which provide favorable environments for the base moiety of dTDP and potentially for the sugar moiety of the substrate [].; GO: 0008830 dTDP-4-dehydrorhamnose 3,5-epimerase activity, 0009103 lipopolysaccharide biosynthetic process; PDB: 1EPZ_A 1EP0_A 1NXM_A 1NZC_D 2IXL_C 1NYW_B 2IXC_D 1PM7_B 1UPI_A 3RYK_B ....
Probab=93.51 E-value=0.73 Score=37.55 Aligned_cols=56 Identities=23% Similarity=0.472 Sum_probs=41.3
Q ss_pred cceecC----cceEEEEecceEEEEEEe--C----CCcEEEEEEeCCc--EEEeCCCCeeeeecCCCC
Q psy840 82 TEHLHT----DEEIRLVLDGSGYFDVRD--K----FDDWIRIEVTKGD--LIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 82 ~eH~H~----~dEi~~il~G~g~f~v~~--~----~d~~~ri~~~~GD--li~vPaG~~H~f~~~~~~ 137 (178)
..|.|. ...+..++.|+..--+-| + =++|..+.+.+++ .|.||+|..|-|.+-+++
T Consensus 57 GlH~q~~~~~q~Klv~~~~G~i~dV~vDlR~~SpTfg~~~~~~Ls~~n~~~l~IP~G~aHGf~~l~d~ 124 (176)
T PF00908_consen 57 GLHYQSPPYAQAKLVRCLRGEIFDVAVDLRKGSPTFGKWVSVELSAENPRQLYIPPGVAHGFQTLEDD 124 (176)
T ss_dssp EEEEESTTT-EEEEEEEEESEEEEEEEE-BTTSTTTT-EEEEEEETTT--EEEE-TTEEEEEEESSSE
T ss_pred EEEEecCCCCCCcEEEEecCeEEEEEEECCCCCCCCCEEEEEEeCccccCEEEeCCcceeeEEeccCc
Confidence 455554 468899999998664443 2 2789999998887 699999999999877664
No 67
>COG3822 ABC-type sugar transport system, auxiliary component [General function prediction only]
Probab=93.21 E-value=0.13 Score=43.09 Aligned_cols=59 Identities=20% Similarity=0.312 Sum_probs=44.0
Q ss_pred hhccceecCcceEEEEecceEEEEEEeC------------------CC------cEEEEEEeCCcEEEeCCCCeeeeecC
Q psy840 79 SFYTEHLHTDEEIRLVLDGSGYFDVRDK------------------FD------DWIRIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 79 ~f~~eH~H~~dEi~~il~G~g~f~v~~~------------------~d------~~~ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
+.-++|.|.-.-.-.|-.|.|.+-++.- ++ .|-.+.++||.-|.+|+|++|+|-+.
T Consensus 97 QvtPmHrH~~k~eDiinrgggtlv~el~~~d~~~~~~~ks~vtv~~dg~r~~~~ag~~lkL~PGesitL~Pg~~HsFwae 176 (225)
T COG3822 97 QVTPMHRHWRKPEDIINRGGGTLVVELWNVDLVEGQDEKSDVTVPVDGCRQTHTAGSQLKLSPGESITLPPGLYHSFWAE 176 (225)
T ss_pred CcCcccccccchhhhhhcCCceEEEEEeccccccCcCCCCCeEecCCCcEEEeccceeEEECCCCcEecCCCceeeeeec
Confidence 3578999996555567788888877620 11 12357899999999999999999987
Q ss_pred CCC
Q psy840 135 INN 137 (178)
Q Consensus 135 ~~~ 137 (178)
+..
T Consensus 177 ~g~ 179 (225)
T COG3822 177 EGG 179 (225)
T ss_pred CCc
Confidence 764
No 68
>PF08007 Cupin_4: Cupin superfamily protein; InterPro: IPR022777 This signature represents primarily the cupin fold found in JmjC transcription factors. The fold is also found in lysine-specific demethylase NO66.; PDB: 2XDV_A 1VRB_B 4DIQ_B.
Probab=93.12 E-value=0.51 Score=41.44 Aligned_cols=56 Identities=18% Similarity=0.358 Sum_probs=39.0
Q ss_pred hccceecCcceEEEEecceEEEEEEeC------------------CCcEEEEEEeCCcEEEeCCCCeeeeecCC
Q psy840 80 FYTEHLHTDEEIRLVLDGSGYFDVRDK------------------FDDWIRIEVTKGDLIIIPAGIYHRFTLDI 135 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G~g~f~v~~~------------------~d~~~ri~~~~GDli~vPaG~~H~f~~~~ 135 (178)
=|..|.=..|=+..-++|+=.+.|... ......+.++|||+|.||+|+.|...+.+
T Consensus 127 g~~~H~D~~dvfvlQ~~G~K~W~l~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~pGD~LYlPrG~~H~~~~~~ 200 (319)
T PF08007_consen 127 GFGPHYDDHDVFVLQLEGRKRWRLYPPPDEPAPLYSDQPFKQLEEFEPVEEVVLEPGDVLYLPRGWWHQAVTTD 200 (319)
T ss_dssp ESECEE-SSEEEEEEEES-EEEEEE-SCCCTTTSSCE--TTTCG--STSEEEEE-TT-EEEE-TT-EEEEEESS
T ss_pred CccCEECCcccEEEECCceeEEEECCCCcccccccCCCCccccccCceeEEEEECCCCEEEECCCccCCCCCCC
Confidence 367777777878888999998888751 02345799999999999999999998877
No 69
>PF02678 Pirin: Pirin; InterPro: IPR003829 This entry represents N-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 2VEC_A 1J1L_A 3ACL_A 2P17_A 1TQ5_A.
Probab=93.05 E-value=0.75 Score=34.64 Aligned_cols=63 Identities=22% Similarity=0.305 Sum_probs=43.4
Q ss_pred hhccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCC--CCeeeeecCCC-CcEEEEEEe
Q psy840 79 SFYTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPA--GIYHRFTLDIN-NYVKAKRYF 145 (178)
Q Consensus 79 ~f~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPa--G~~H~f~~~~~-~~~~alrlF 145 (178)
.=|..|-|.+ |-|.|+++|+.... |..+. +..+++||+-.+=| |+.|.=...++ ..+..++|+
T Consensus 40 ~gf~~HPH~g~eivTyv~~G~~~H~--Ds~G~--~~~l~~G~vq~m~AG~Gi~H~E~~~~~~~~~~~lQlW 106 (107)
T PF02678_consen 40 AGFPMHPHRGFEIVTYVLEGELRHR--DSLGN--RGVLRAGDVQWMTAGSGIVHSERNASDGGPLHGLQLW 106 (107)
T ss_dssp TEEEEEEECSEEEEEEEEESEEEEE--ETTSE--EEEEETTEEEEEE-TTTEEEEEEE-TSSS-EEEEEEE
T ss_pred CCCCCcCCCCceEEEEEecCEEEEE--CCCCC--eeEeCCCeEEEEeCCCCceEEEecCCCCCeEEEEEEc
Confidence 3379999999 77899999987653 32333 37899999966555 58887655554 667777765
No 70
>PF14499 DUF4437: Domain of unknown function (DUF4437); PDB: 2QDR_A.
Probab=92.78 E-value=0.16 Score=43.80 Aligned_cols=52 Identities=17% Similarity=0.120 Sum_probs=31.7
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
-+.|+|..++-.||++|....+=.. --...+.+|.+...|+|..|.--..++
T Consensus 49 ~pph~H~~~~~~~Vi~G~~~~~~~~----a~~~~l~~Gsy~~~PaG~~h~~~~~~~ 100 (251)
T PF14499_consen 49 SPPHIHNADYRGTVISGELHNGDPK----AAAMWLPAGSYWFQPAGEPHITAAEGE 100 (251)
T ss_dssp E--BEESS-EEEEEEESEEEETTEE---------E-TTEEEEE-TT-EEEETTS-E
T ss_pred CCCcceeeeEEEEEEEeEEEcCCCc----ccceecCCCceEeccCCCceeeeccCc
Confidence 4799999999999999986653111 112569999999999998887655554
No 71
>COG3257 GlxB Uncharacterized protein, possibly involved in glyoxylate utilization [General function prediction only]
Probab=92.47 E-value=0.27 Score=42.06 Aligned_cols=52 Identities=23% Similarity=0.337 Sum_probs=40.2
Q ss_pred ecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEE
Q psy840 85 LHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKA 141 (178)
Q Consensus 85 ~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~a 141 (178)
+|--|--.|||+|.+.|.++ ..| +.+++||+|-+-|--+-+...++.+.++-
T Consensus 200 tHvmEHGlyvLeGk~vYrLn---~dw--v~V~aGD~mwm~A~cpQacyagG~g~frY 251 (264)
T COG3257 200 THVMEHGLYVLEGKGVYRLN---NNW--VPVEAGDYIWMGAYCPQACYAGGRGAFRY 251 (264)
T ss_pred hhhhhcceEEEecceEEeec---Cce--EEeecccEEEeeccChhhhccCCCCceEE
Confidence 34445568999999999995 457 89999999999888888777766555543
No 72
>PRK12335 tellurite resistance protein TehB; Provisional
Probab=92.36 E-value=0.88 Score=38.98 Aligned_cols=76 Identities=12% Similarity=0.059 Sum_probs=54.9
Q ss_pred HHHhhhhccceecCc--ceEEEEecceEEEEEEeCCCcE-EEEEEeC-CcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 74 EEKLKSFYTEHLHTD--EEIRLVLDGSGYFDVRDKFDDW-IRIEVTK-GDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 74 e~~~~~f~~eH~H~~--dEi~~il~G~g~f~v~~~~d~~-~ri~~~~-GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
+++-+.|...|.|.. .|..-|++|+..|.+-++++.. .+..+.+ +..-+||++..|+....+++--.-+.+|..+.
T Consensus 17 ~~~p~~~~~~H~t~~g~~~~~~vl~G~l~~~~~de~g~~~~~~~l~~~~~~~~i~p~~wh~v~~~s~d~~~~l~fy~~~~ 96 (287)
T PRK12335 17 DTLPEMFQEKHNTKEGTWAKLTVLKGELKFYELTEDGEELSEHIFDAENQPPFIEPQAWHRIEAASDDLECQLSFYCKPE 96 (287)
T ss_pred hhchHHHHhccCCCCCcceEEEEEeeeEEEEEECCCCCeeeEEEEecCCCCceeCCcceEEEEEcCCCcEEEEEEEEcch
Confidence 455588999999953 8999999999999887766542 2233444 45557999999999988665555566666543
No 73
>COG3508 HmgA Homogentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=92.03 E-value=0.55 Score=42.75 Aligned_cols=45 Identities=22% Similarity=0.323 Sum_probs=39.0
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCC
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~ 137 (178)
.|++.++-.|+..|..+- | .+.+++||+.+||.|+.-+.++-++.
T Consensus 146 ge~Livpq~G~l~l~te~--G---~l~v~pgeiavIPRG~~frve~~~~~ 190 (427)
T COG3508 146 GELLIVPQQGELRLKTEL--G---VLEVEPGEIAVIPRGTTFRVELKDGE 190 (427)
T ss_pred CCEEEEeecceEEEEEee--c---eEEecCCcEEEeeCCceEEEEecCCc
Confidence 499999999999998876 2 39999999999999999999886554
No 74
>KOG2757|consensus
Probab=91.83 E-value=0.37 Score=43.93 Aligned_cols=57 Identities=28% Similarity=0.395 Sum_probs=44.9
Q ss_pred CcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecC
Q psy840 87 TDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGE 148 (178)
Q Consensus 87 ~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~ 148 (178)
+..-|..+++|+|+..... ++ .+.+++||++.|||...=.+...+++ ++..|-|...
T Consensus 352 ~~~SIllv~~G~g~l~~~t-~~---~~~v~rG~V~fI~a~~~i~~~~~sd~-~~~yrAf~~~ 408 (411)
T KOG2757|consen 352 DGPSILLVLKGSGILKTDT-DS---KILVNRGDVLFIPANHPIHLSSSSDP-FLGYRAFSNS 408 (411)
T ss_pred CCceEEEEEecceEEecCC-CC---ceeeccCcEEEEcCCCCceeeccCcc-eeeeeccccc
Confidence 3457899999999987762 22 28999999999999999988888765 6677776553
No 75
>PF04962 KduI: KduI/IolB family; InterPro: IPR021120 The KduI/IolB family of enzymes includes 5-keto 4-deoxyuronate isomerase (KduI) and 5-deoxy-glucuronate isomerase (IolB). KduI is involved in pectin degradation by free-living soil bacteria that use pectin as a carbon source, breaking it down to 2-keto-3-deoxygluconate, which can ultimately be converted to pyruvate. KduI catalyses the fourth step in pectin degradation, namely the interconversion of 5-keto-4-deoxyuronate and 2,5-diketo-3-dexoygluconate []. KduI has a TIM-barrel fold []. IolB is one of several bacterial proteins encoded by the inositol operon (iolABCDEFGHIJ) in Bacillus subtilis that are involved in myo-inositol catabolism. The enzyme is responsible for isomerization of 5-deoxy-D-glucuronic acid by IolB to produce 2-deoxy-5-keto-D-gluconic acid []. IolBs possess a cupin-like structure.; GO: 0016861 intramolecular oxidoreductase activity, interconverting aldoses and ketoses, 0008152 metabolic process; PDB: 1YWK_B 2QJV_B 1X8M_A 1XRU_A.
Probab=90.76 E-value=0.44 Score=41.12 Aligned_cols=48 Identities=38% Similarity=0.681 Sum_probs=30.5
Q ss_pred hccceecCc---------ceEEEEe-c---ceEEEEEEeC---CCcEEEEEEeCCcEEEeCCCCeee
Q psy840 80 FYTEHLHTD---------EEIRLVL-D---GSGYFDVRDK---FDDWIRIEVTKGDLIIIPAGIYHR 130 (178)
Q Consensus 80 f~~eH~H~~---------dEi~~il-~---G~g~f~v~~~---~d~~~ri~~~~GDli~vPaG~~H~ 130 (178)
-++.|.|+. +|++|+. . |-|...+-.. .|+ -+.++.||.++||.| +|-
T Consensus 165 SyPPH~Hd~~~~~~e~~leEiYyf~~~p~~Gfg~q~~y~~~~~~d~--~~~V~~~d~V~iP~g-yHp 228 (261)
T PF04962_consen 165 SYPPHKHDRRMEPDETELEEIYYFRFNPPQGFGFQRVYTDDPQLDE--HYVVRNGDAVLIPSG-YHP 228 (261)
T ss_dssp S-SEEE-CCEEEESEECTEEEEEEESSTTS-EEEEEEE-TTSSSEE--EEEEETTEEEEESTT-B-S
T ss_pred CcCCccCCCcCCCccccceeEEEEEccCcccEEEEEEECCCCCCcE--EEEEECCCEEEeCCC-CCC
Confidence 389999999 8999883 2 4444222221 122 378999999999999 773
No 76
>TIGR01221 rmlC dTDP-4-dehydrorhamnose 3,5-epimerase. This enzyme participates in the biosynthesis of dTDP-L-rhamnose, often as a precursor to LPS O-antigen
Probab=90.53 E-value=2.7 Score=34.29 Aligned_cols=58 Identities=21% Similarity=0.395 Sum_probs=43.6
Q ss_pred hccceec---CcceEEEEecceEEEEEEe---C---CCcEEEEEEeC--CcEEEeCCCCeeeeecCCCC
Q psy840 80 FYTEHLH---TDEEIRLVLDGSGYFDVRD---K---FDDWIRIEVTK--GDLIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 80 f~~eH~H---~~dEi~~il~G~g~f~v~~---~---~d~~~ri~~~~--GDli~vPaG~~H~f~~~~~~ 137 (178)
.-..|.| ....+..++.|+.+--+-| . =++|..+.+.+ +-.|.||+|..|-|.+-+++
T Consensus 56 lRGlH~q~~~~q~Klv~c~~G~i~dV~VDlR~~SpTfG~~~~~~L~~~~~~~l~IP~G~aHGF~~L~d~ 124 (176)
T TIGR01221 56 LRGLHYQRPHPQGKLVRVLRGEVFDVAVDLRRNSPTFGKWVGVLLSAENKRQLWIPEGFAHGFVVLSDE 124 (176)
T ss_pred EEEEEECCCCCCceEEEEccCCEEEEEEECCCCcCCCCeEEEEEECCCCCCEEEeCCcceeEEEEcCCC
Confidence 3456665 5799999999998764443 1 15788888877 55999999999999876554
No 77
>PF07385 DUF1498: Protein of unknown function (DUF1498); InterPro: IPR010864 This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown.; PDB: 3MPB_B 3KMH_A.
Probab=90.11 E-value=0.59 Score=39.81 Aligned_cols=59 Identities=20% Similarity=0.360 Sum_probs=33.1
Q ss_pred hhccceecCcceEEEEecceEEEEEE--eC----------------CCcE------EEEEEeCCcEEEeCCCCeeeeecC
Q psy840 79 SFYTEHLHTDEEIRLVLDGSGYFDVR--DK----------------FDDW------IRIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 79 ~f~~eH~H~~dEi~~il~G~g~f~v~--~~----------------~d~~------~ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
+..+.|.|..-.=-.|-.|.|.+.|+ .. ||.+ -.+.+.||.-|.||+|++|+|-..
T Consensus 98 Q~tP~H~H~~K~EDIINRGGG~L~i~l~~s~~~~~~~~~~~v~V~~DG~~~t~~aG~~l~L~PGESiTL~Pg~yH~Fw~e 177 (225)
T PF07385_consen 98 QVTPMHFHWKKMEDIINRGGGNLVIELYNSDPDGELDADTDVTVPVDGIRRTVPAGTQLRLNPGESITLPPGIYHWFWGE 177 (225)
T ss_dssp -EEEEEEESS--EEEEEEEES-EEEEEEEB--TTSSB-SS-EEEEETTEEEEE-TT-EEEE-TT-EEEE-TTEEEEEEE-
T ss_pred CcCCcccCcchhhheeecCCceEEEEEEeccCCCccccCCCeEEecCCcEEEecCCceEEeCCCCeEeeCCCCeeeEEec
Confidence 45789999873333455554443332 10 2211 146889999999999999999986
Q ss_pred CCC
Q psy840 135 INN 137 (178)
Q Consensus 135 ~~~ 137 (178)
+..
T Consensus 178 ~g~ 180 (225)
T PF07385_consen 178 GGD 180 (225)
T ss_dssp TTS
T ss_pred CCC
Confidence 654
No 78
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=89.33 E-value=3.6 Score=29.64 Aligned_cols=59 Identities=17% Similarity=0.108 Sum_probs=42.0
Q ss_pred hhhccceecCc--ceEEEEecceEEEEEEeCCCc--EEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 78 KSFYTEHLHTD--EEIRLVLDGSGYFDVRDKFDD--WIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 78 ~~f~~eH~H~~--dEi~~il~G~g~f~v~~~~d~--~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
+.|...|.=.. -...-|++|+..|..-++++. -..+.+.+|+.-+||+...|+....++
T Consensus 13 ~~l~~~H~TK~GtWg~l~Vl~G~L~f~~~~~~~~~~~~~~~~~~~~~~~i~Pq~wH~V~p~s~ 75 (82)
T PF09313_consen 13 AALLERHNTKAGTWGKLRVLEGELKFYGLDEEGEEPEEEVFIPAGQPPVIEPQQWHRVEPLSD 75 (82)
T ss_dssp GGGGSSBCCSTTEEEEEEEEESEEEEEEESSTT-SESEEEEEETTEEEEE-TT-EEEEEESST
T ss_pred HHHHhhcCCCCCeEEEEEEEeeEEEEEEECCCCCceeEEEEeCCCCCceeCCCceEEEEECCC
Confidence 55666664433 456779999999988875432 135889999999999999999986554
No 79
>COG1482 ManA Phosphomannose isomerase [Carbohydrate transport and metabolism]
Probab=86.03 E-value=0.53 Score=41.88 Aligned_cols=23 Identities=26% Similarity=0.628 Sum_probs=20.3
Q ss_pred EEEEeCCcEEEeCCCCeeeeecC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
+|.++|||.+.|||||.|-.--+
T Consensus 159 ~v~lkpGe~~fl~Agt~HA~~~G 181 (312)
T COG1482 159 RVKLKPGEAFFLPAGTPHAYLKG 181 (312)
T ss_pred EEecCCCCEEEecCCCceeeccc
Confidence 69999999999999999976544
No 80
>COG1898 RfbC dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes [Cell envelope biogenesis, outer membrane]
Probab=85.86 E-value=6.6 Score=32.13 Aligned_cols=58 Identities=19% Similarity=0.359 Sum_probs=43.6
Q ss_pred ccceecCc--ceEEEEecceEEEEEEe-CCC-----cEEEEEEeCC--cEEEeCCCCeeeeecCCCCc
Q psy840 81 YTEHLHTD--EEIRLVLDGSGYFDVRD-KFD-----DWIRIEVTKG--DLIIIPAGIYHRFTLDINNY 138 (178)
Q Consensus 81 ~~eH~H~~--dEi~~il~G~g~f~v~~-~~d-----~~~ri~~~~G--Dli~vPaG~~H~f~~~~~~~ 138 (178)
=..|.|.. .+...++.|+...-+.+ ..| +|.-+.+.+- -+|.||+|..|-|.+.++..
T Consensus 58 RGlHyq~~~q~klv~~v~G~v~dv~vDlR~~SpTyg~~~~~~ls~~N~~~l~IP~G~AHGf~~L~d~~ 125 (173)
T COG1898 58 RGLHYQHKPQGKLVRVVSGKVFDVAVDLRKDSPTYGKWVGVVLSAENKRQLYIPPGFAHGFQVLSDDA 125 (173)
T ss_pred EEEEcccCCCCeEEEEecCeEEEEEEEccCCCCCcceEEEEEecCCCceEEEeCCcccceeEEccCce
Confidence 35777754 68899999999764443 223 4877777765 78999999999998877654
No 81
>TIGR00218 manA mannose-6-phosphate isomerase, class I. The names phosphomannose isomerase and mannose-6-phosphate isomerase are synonomous. This family contains two rather deeply branched groups. One group contains an experimentally determined phosphomannose isomerase of Streptococcus mutans as well as three uncharacterized paralogous proteins of Bacillus subtilis, all at more than 50 % identity to each other, plus a more distant homolog from Archaeoglobus fulgidus. The other group contains members from E. coli, budding yeast, Borrelia burgdorferi, etc.
Probab=85.81 E-value=0.41 Score=41.67 Aligned_cols=20 Identities=35% Similarity=0.833 Sum_probs=18.4
Q ss_pred EEEEeCCcEEEeCCCCeeee
Q psy840 112 RIEVTKGDLIIIPAGIYHRF 131 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f 131 (178)
.+.++|||.|.||||+.|-.
T Consensus 152 ~v~v~~Gd~i~ipaGt~HA~ 171 (302)
T TIGR00218 152 RIKLKPGDFFYVPSGTPHAY 171 (302)
T ss_pred ccccCCCCEEEeCCCCcccc
Confidence 48999999999999999983
No 82
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=85.41 E-value=1.1 Score=32.40 Aligned_cols=58 Identities=19% Similarity=0.338 Sum_probs=29.1
Q ss_pred hhhccceecCcc---eEEEEe----cceEEEEEEe----------------CCCcEEEEEEeCCcEEEeCCCCeeeeecC
Q psy840 78 KSFYTEHLHTDE---EIRLVL----DGSGYFDVRD----------------KFDDWIRIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 78 ~~f~~eH~H~~d---Ei~~il----~G~g~f~v~~----------------~~d~~~ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
+.+...|.|+.. =|+||- .|...|.-.. ....++.+..++||||+-|+-+.|.....
T Consensus 10 g~~~~~H~H~~s~~SgVyYv~~p~~~~~l~f~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~G~lvlFPs~l~H~v~p~ 89 (101)
T PF13759_consen 10 GGYNEPHNHPNSWLSGVYYVQVPEGSGPLRFHDPRGSFSFGAPFDNYDQNDLNSPYYIVEPEEGDLVLFPSWLWHGVPPN 89 (101)
T ss_dssp T--EEEE--TT-SEEEEEECE--TTS-SEEEE-TTCCCGTTS----TTTTCCC-SEEEE---TTEEEEEETTSEEEE---
T ss_pred CCccCceECCCcCEEEEEEEECCCCCCceeeeCCCccceecccccccccCcccCceEEeCCCCCEEEEeCCCCEEeccCc
Confidence 356789999884 345553 2333333110 11246789999999999999999997654
Q ss_pred C
Q psy840 135 I 135 (178)
Q Consensus 135 ~ 135 (178)
.
T Consensus 90 ~ 90 (101)
T PF13759_consen 90 N 90 (101)
T ss_dssp -
T ss_pred C
Confidence 3
No 83
>COG1741 Pirin-related protein [General function prediction only]
Probab=84.99 E-value=3.6 Score=35.99 Aligned_cols=67 Identities=21% Similarity=0.304 Sum_probs=48.5
Q ss_pred hhccceecCc-ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCC--Ceeeeec--CCCCcEEEEEEeecCC
Q psy840 79 SFYTEHLHTD-EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAG--IYHRFTL--DINNYVKAKRYFIGEP 149 (178)
Q Consensus 79 ~f~~eH~H~~-dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG--~~H~f~~--~~~~~~~alrlF~~~~ 149 (178)
+.|..|.|.+ +=|.|+++|+....=. .|. .-.+.|||+-..=|| |.|.=.. .++..+..+.++...|
T Consensus 55 ~~f~pHPHrg~etvTyvl~G~i~HrDS--~Gn--~~~i~pGdvqwMTAG~GI~HSE~~~~~~~~~l~~~QlWv~lP 126 (276)
T COG1741 55 RGFPPHPHRGLETVTYVLDGEIEHRDS--LGN--KGVIRPGDVQWMTAGSGIVHSEMNPPSTGKPLHGLQLWVNLP 126 (276)
T ss_pred CcCCCCCCCCcEEEEEEEccEEEEeec--CCc--eeeecccceeEEcCCCceeecccCCccCCCccceeeeecCCc
Confidence 4799999999 6779999999776433 233 378999999766654 7887443 3445688888876653
No 84
>PF14499 DUF4437: Domain of unknown function (DUF4437); PDB: 2QDR_A.
Probab=83.68 E-value=0.96 Score=39.07 Aligned_cols=60 Identities=18% Similarity=0.156 Sum_probs=34.6
Q ss_pred ccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEE
Q psy840 81 YTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKR 143 (178)
Q Consensus 81 ~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alr 143 (178)
...|+|+..|=-|+++|+..+......+. -.+.+|.++..|+++.|....+++..+.-||
T Consensus 184 g~i~~h~~~eraVvI~G~~~~~~~~~~~~---~~L~~GSYf~s~~~~~H~~~~~e~~~vlyIR 243 (251)
T PF14499_consen 184 GRIHTHASNERAVVISGELDYQSYGASNF---GTLDPGSYFGSPGHITHGIFITEDECVLYIR 243 (251)
T ss_dssp -SEEE--S-EEEEEEEEEEEETTEEEETT---EEEEE-TT-EE--E------EESS-EEEEEE
T ss_pred CceeccCCceEEEEEEeEEEEeecccCCC---ccccCCcccccCCcccccccccCCCEEEEEE
Confidence 46899999999999999999866443333 6899999999999999997666666665555
No 85
>PF06865 DUF1255: Protein of unknown function (DUF1255); InterPro: IPR009664 This family consists of several conserved hypothetical bacterial proteins of around 95 residues in length. The function of this family is unknown; PDB: 2OYZ_A 3HQX_A.
Probab=83.14 E-value=4.5 Score=30.05 Aligned_cols=45 Identities=18% Similarity=0.244 Sum_probs=34.2
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
.|+.-|++|++...+.+ .+.| ....+|+-..|||+..-.....+.
T Consensus 42 ~E~M~vvsG~l~V~lpg-~~ew--~~~~aGesF~VpanssF~v~v~~~ 86 (94)
T PF06865_consen 42 PERMEVVSGELEVKLPG-EDEW--QTYSAGESFEVPANSSFDVKVKEP 86 (94)
T ss_dssp -EEEEEEESEEEEEETT--SS---EEEETT-EEEE-TTEEEEEEESS-
T ss_pred CEEEEEEEeEEEEEcCC-Cccc--EEeCCCCeEEECCCCeEEEEECcc
Confidence 68888999999999877 3468 789999999999999988877664
No 86
>TIGR00218 manA mannose-6-phosphate isomerase, class I. The names phosphomannose isomerase and mannose-6-phosphate isomerase are synonomous. This family contains two rather deeply branched groups. One group contains an experimentally determined phosphomannose isomerase of Streptococcus mutans as well as three uncharacterized paralogous proteins of Bacillus subtilis, all at more than 50 % identity to each other, plus a more distant homolog from Archaeoglobus fulgidus. The other group contains members from E. coli, budding yeast, Borrelia burgdorferi, etc.
Probab=82.41 E-value=2.4 Score=36.84 Aligned_cols=38 Identities=29% Similarity=0.393 Sum_probs=29.9
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeee
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHR 130 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~ 130 (178)
.-.+..+++|++.....+ + .+.+++|+.++|||+....
T Consensus 253 ~~~il~v~~G~~~i~~~~--~---~~~l~~G~~~~ipa~~~~~ 290 (302)
T TIGR00218 253 SALILSVLEGSGRIKSGG--K---TLPLKKGESFFIPAHLGPF 290 (302)
T ss_pred CcEEEEEEcceEEEEECC--E---EEEEecccEEEEccCCccE
Confidence 467889999999876532 2 3889999999999998553
No 87
>PRK10579 hypothetical protein; Provisional
Probab=81.61 E-value=6.5 Score=29.26 Aligned_cols=45 Identities=16% Similarity=0.219 Sum_probs=38.2
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
.|+.=|++|++...+.+ .+.| ....+|+-..||++..-.....+.
T Consensus 42 ~E~MeivsG~l~V~Lpg-~~ew--~~~~aG~sF~VpanssF~l~v~~~ 86 (94)
T PRK10579 42 PEEMTVISGALNVLLPG-ATDW--QVYEAGEVFNVPGHSEFHLQVAEP 86 (94)
T ss_pred cEEEEEEeeEEEEECCC-Cccc--EEeCCCCEEEECCCCeEEEEECcc
Confidence 68888999999998877 4567 789999999999999888776654
No 88
>PF06172 Cupin_5: Cupin superfamily (DUF985); InterPro: IPR009327 This is a family of uncharacterised proteins found in bacteria and eukaryotes.; PDB: 1ZNP_G 1XE8_B 1XE7_A 3M3I_F 3LOI_A 3LZZ_B 1YUD_D.
Probab=79.05 E-value=16 Score=28.66 Aligned_cols=73 Identities=16% Similarity=0.203 Sum_probs=46.3
Q ss_pred hccceecCcceEEEEecc-eEEEEEEeCCCcEEEEEEeC----Cc--EEEeCCCCeeeeecCCCCcEEEEEEeecCCcee
Q psy840 80 FYTEHLHTDEEIRLVLDG-SGYFDVRDKFDDWIRIEVTK----GD--LIIIPAGIYHRFTLDINNYVKAKRYFIGEPVWT 152 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G-~g~f~v~~~~d~~~ri~~~~----GD--li~vPaG~~H~f~~~~~~~~~alrlF~~~~gW~ 152 (178)
+-.+|.-..||+.+...| .+.+.+-++++.+.++.+.+ |. .++||+|+.....+.+.... ++-=.+=.||+.
T Consensus 53 ~S~~Hrv~sdEiw~~~~G~pl~l~~i~~dg~~~~~~LG~d~~~g~~~q~vVp~G~W~aa~l~~~~~y-~Lvsc~VaPGF~ 131 (139)
T PF06172_consen 53 FSAWHRVDSDEIWHFHAGDPLELHLIDPDGSYETVVLGPDLAAGERPQVVVPAGTWQAAELEPEGDY-SLVSCTVAPGFD 131 (139)
T ss_dssp EEEEEEESSEEEEEEEEES-EEEEEECTTSTEEEEEESSTTCTTEBSEEEE-TTSEEEEEECESSSE-EEEEEEESSC--
T ss_pred CCccEEcCCCEEEEEEcCCCEEEEEEcCCCCeEEEEECCCCCCCceEEEEECCCEEEEccccCCCCE-EEEEEEEcCCCc
Confidence 456777777999999999 55666666788887777754 33 48999999988865433322 222233345554
Q ss_pred e
Q psy840 153 P 153 (178)
Q Consensus 153 ~ 153 (178)
.
T Consensus 132 f 132 (139)
T PF06172_consen 132 F 132 (139)
T ss_dssp G
T ss_pred h
Confidence 3
No 89
>PRK15131 mannose-6-phosphate isomerase; Provisional
Probab=79.05 E-value=4.2 Score=37.06 Aligned_cols=38 Identities=13% Similarity=0.163 Sum_probs=30.3
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeee
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHR 130 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~ 130 (178)
.-.|.++++|++.+... ++ .+.+++|+.+.|||+....
T Consensus 339 ~~~Illv~~G~~~i~~~--~~---~~~l~~G~~~fipa~~~~~ 376 (389)
T PRK15131 339 SAAILFCVEGEAVLWKG--EQ---QLTLKPGESAFIAANESPV 376 (389)
T ss_pred CcEEEEEEcceEEEEeC--Ce---EEEECCCCEEEEeCCCccE
Confidence 35899999999997643 22 3789999999999987654
No 90
>PRK15131 mannose-6-phosphate isomerase; Provisional
Probab=77.71 E-value=1.8 Score=39.50 Aligned_cols=23 Identities=13% Similarity=0.294 Sum_probs=20.1
Q ss_pred EEEEeCCcEEEeCCCCeeeeecC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
.|.++|||.|.||||+.|-.--+
T Consensus 238 ~v~l~pGeaifipAg~~HAyl~G 260 (389)
T PRK15131 238 VVKLNPGEAMFLFAETPHAYLQG 260 (389)
T ss_pred EEEeCCCCEEEeCCCCCeEEcCC
Confidence 49999999999999999986443
No 91
>COG2850 Uncharacterized conserved protein [Function unknown]
Probab=71.80 E-value=2.2 Score=38.89 Aligned_cols=25 Identities=24% Similarity=0.371 Sum_probs=21.4
Q ss_pred EEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
...++|||++.||+|..|.-....+
T Consensus 180 d~vlepGDiLYiPp~~~H~gvae~d 204 (383)
T COG2850 180 DEVLEPGDILYIPPGFPHYGVAEDD 204 (383)
T ss_pred hhhcCCCceeecCCCCCcCCccccc
Confidence 4689999999999999999876643
No 92
>COG1482 ManA Phosphomannose isomerase [Carbohydrate transport and metabolism]
Probab=69.79 E-value=9.6 Score=34.01 Aligned_cols=42 Identities=26% Similarity=0.283 Sum_probs=33.3
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecC
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
.-.|.++++|++.....+ + ...+++|+-++||+...=+.--+
T Consensus 260 ~~~il~v~eG~~~l~~~~---~--~~~l~~G~s~~ipa~~~~~~i~g 301 (312)
T COG1482 260 SFSILLVLEGEGTLIGGG---Q--TLKLKKGESFFIPANDGPYTIEG 301 (312)
T ss_pred CcEEEEEEcCeEEEecCC---E--EEEEcCCcEEEEEcCCCcEEEEe
Confidence 568899999999987753 2 38999999999999976554333
No 93
>PRK00924 5-keto-4-deoxyuronate isomerase; Provisional
Probab=67.49 E-value=15 Score=32.32 Aligned_cols=57 Identities=18% Similarity=0.120 Sum_probs=34.9
Q ss_pred hccceecCc-ceEEE-Ee--cceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 80 FYTEHLHTD-EEIRL-VL--DGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 80 f~~eH~H~~-dEi~~-il--~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
-++.|+|+- .|++| +- ++.-.|..-++-|+-=-+.++.||.+++|+---|.-.-+.+
T Consensus 190 SyPPHkHDrr~E~YlYf~l~~~qrV~h~mG~pdETrh~~v~n~~aVisP~wsih~g~gt~~ 250 (276)
T PRK00924 190 TMPCHTHDRRMEVYFYFDMPEDARVFHFMGEPQETRHIVVHNEQAVISPSWSIHSGVGTSN 250 (276)
T ss_pred CCCCccCCCCcceEEEEEcCCCceEEecCCCccceeeEEEECCCEEECCCcceecCcCccc
Confidence 388999994 56544 21 22222222222333222889999999999998887655543
No 94
>cd00038 CAP_ED effector domain of the CAP family of transcription factors; members include CAP (or cAMP receptor protein (CRP)), which binds cAMP, FNR (fumarate and nitrate reduction), which uses an iron-sulfur cluster to sense oxygen) and CooA, a heme containing CO sensor. In all cases binding of the effector leads to conformational changes and the ability to activate transcription. Cyclic nucleotide-binding domain similar to CAP are also present in cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) and vertebrate cyclic nucleotide-gated ion-channels. Cyclic nucleotide-monophosphate binding domain; proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues; the best studied is the prokaryotic catabolite gene activator, CAP, where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure; three conserved glycine residues are thought to be essential for maintenance of
Probab=66.69 E-value=24 Score=23.73 Aligned_cols=37 Identities=24% Similarity=0.088 Sum_probs=28.1
Q ss_pred cceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEeC
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIIIP 124 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~vP 124 (178)
.+.+++|++|.........+++ .+...+.+||++-.+
T Consensus 35 ~~~~~~i~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~ 72 (115)
T cd00038 35 ADSLYIVLSGSVEVYKLDEDGREQIVGFLGPGDLFGEL 72 (115)
T ss_pred CCeEEEEEeCEEEEEEECCCCcEEEEEecCCccCcChH
Confidence 3779999999999888775543 455678899987553
No 95
>PRK15186 AraC family transcriptional regulator; Provisional
Probab=64.13 E-value=17 Score=31.70 Aligned_cols=43 Identities=12% Similarity=0.119 Sum_probs=33.1
Q ss_pred eEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 90 EIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 90 Ei~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
=+.++.+|.....-. ++.+ +.|.+|.+|++|.+..|.+...+.
T Consensus 40 ~li~v~~G~~~i~~~--~g~~--l~i~~p~~~~~p~~~~~~~~~~~~ 82 (291)
T PRK15186 40 VLIKLTTGKISITTS--SGEY--ITASGPMLIFLAKDQTIHITMEET 82 (291)
T ss_pred EEEEeccceEEEEeC--CCce--EEeCCCeEEEEeCCcEEEEEeccc
Confidence 356778777665443 3443 899999999999999999987764
No 96
>PRK11753 DNA-binding transcriptional dual regulator Crp; Provisional
Probab=62.85 E-value=43 Score=26.31 Aligned_cols=36 Identities=22% Similarity=0.257 Sum_probs=26.8
Q ss_pred cceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEe
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIII 123 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~v 123 (178)
.+.+++|++|.......+.+|+ .+--.+.+||++-.
T Consensus 38 ~~~~y~V~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~ 74 (211)
T PRK11753 38 AETLYYIVKGSVAVLIKDEEGKEMILSYLNQGDFIGE 74 (211)
T ss_pred CCeEEEEEeCEEEEEEECCCCCEEEEEEcCCCCEEee
Confidence 3678999999998876665554 44457899999744
No 97
>KOG3706|consensus
Probab=62.65 E-value=7 Score=37.19 Aligned_cols=40 Identities=15% Similarity=0.088 Sum_probs=30.6
Q ss_pred CcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 108 DDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 108 d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
..++...++|||+|.+|.|+-|-....+.-+-..+-+.+.
T Consensus 378 ePV~e~vle~GDllYfPRG~IHQA~t~~~vHSlHvTlSty 417 (629)
T KOG3706|consen 378 EPVHEFVLEPGDLLYFPRGTIHQADTPALVHSLHVTLSTY 417 (629)
T ss_pred CchHHhhcCCCcEEEecCcceeeccccchhceeEEEeehh
Confidence 3467789999999999999999998887654444444443
No 98
>COG3123 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=62.57 E-value=20 Score=26.41 Aligned_cols=46 Identities=20% Similarity=0.261 Sum_probs=37.3
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCC
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~ 136 (178)
..|+.-++.|.+.+-+-+ .+.| ....+|....||++-..-....+.
T Consensus 41 ~~E~Mtvv~Gal~v~lpg-s~dW--q~~~~Ge~F~VpgnS~F~lqVaea 86 (94)
T COG3123 41 APEEMTVVSGALTVLLPG-SDDW--QVYTAGEVFNVPGNSEFDLQVAEA 86 (94)
T ss_pred CceEEEEEeeEEEEEcCC-Cccc--EEecCCceEEcCCCCeEEEEEeee
Confidence 368888999999988876 5678 789999999999998766655543
No 99
>PRK09391 fixK transcriptional regulator FixK; Provisional
Probab=58.68 E-value=53 Score=26.83 Aligned_cols=46 Identities=22% Similarity=0.110 Sum_probs=32.3
Q ss_pred ceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEeCCCCeeeeecC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
+-+++|++|.......+.+|+ .+--.+.+||++-...+..+.+++.
T Consensus 57 ~~ly~I~~G~vkl~~~~~~G~e~i~~~~~~Gd~fG~~~~~~~~~~~~ 103 (230)
T PRK09391 57 DYVYQVESGAVRTYRLLSDGRRQIGAFHLPGDVFGLESGSTHRFTAE 103 (230)
T ss_pred CeEEEEEeCEEEEEEECCCCcEEEEEEecCCceecccCCCcCCeEEE
Confidence 568899999998877665665 3444568999987666655555443
No 100
>PF00027 cNMP_binding: Cyclic nucleotide-binding domain; InterPro: IPR000595 Proteins that bind cyclic nucleotides (cAMP or cGMP) share a structural domain of about 120 residues [, , ]. The best studied of these proteins is the prokaryotic catabolite gene activator (also known as the cAMP receptor protein) (gene crp) where such a domain is known to be composed of three alpha-helices and a distinctive eight-stranded, antiparallel beta-barrel structure. There are six invariant amino acids in this domain, three of which are glycine residues that are thought to be essential for maintenance of the structural integrity of the beta-barrel. cAMP- and cGMP-dependent protein kinases (cAPK and cGPK) contain two tandem copies of the cyclic nucleotide-binding domain. The cAPK's are composed of two different subunits, a catalytic chain and a regulatory chain, which contains both copies of the domain. The cGPK's are single chain enzymes that include the two copies of the domain in their N-terminal section. Vertebrate cyclic nucleotide-gated ion-channels also contain this domain. Two such cations channels have been fully characterised, one is found in rod cells where it plays a role in visual signal transduction.; PDB: 1O7F_A 2BYV_E 3E97_A 3U10_A 2H6B_A 3SHR_A 2OZ6_A 1WGP_A 3LA2_A 3LA3_B ....
Probab=58.20 E-value=36 Score=22.34 Aligned_cols=36 Identities=17% Similarity=0.137 Sum_probs=28.3
Q ss_pred cceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEe
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIII 123 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~v 123 (178)
.+.+++|++|.+.......+++ -+-..+.+||++-.
T Consensus 17 ~~~~~~i~~G~v~~~~~~~~~~~~~~~~~~~g~~~g~ 53 (91)
T PF00027_consen 17 CDHIYIILSGEVKVSSINEDGKEQIIFFLGPGDIFGE 53 (91)
T ss_dssp ESEEEEEEESEEEEEEETTTSEEEEEEEEETTEEESG
T ss_pred CCEEEEEEECceEEEeceecceeeeecceeeeccccc
Confidence 5889999999999988876665 34567889998644
No 101
>PLN02288 mannose-6-phosphate isomerase
Probab=57.00 E-value=18 Score=33.22 Aligned_cols=40 Identities=23% Similarity=0.478 Sum_probs=30.0
Q ss_pred cCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCe
Q psy840 86 HTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIY 128 (178)
Q Consensus 86 H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~ 128 (178)
.....|.++++|++...... +. ..+.+++|+.+.||++..
T Consensus 352 ~~gp~Illv~~G~~~i~~~~--~~-~~~~l~~G~~~fv~a~~~ 391 (394)
T PLN02288 352 VPGPSVFLVIEGEGVLSTGS--SE-DGTAAKRGDVFFVPAGTE 391 (394)
T ss_pred CCCCEEEEEEcCEEEEecCC--cc-ceEEEeceeEEEEeCCCc
Confidence 34568999999999876543 21 136799999999999754
No 102
>KOG3416|consensus
Probab=56.85 E-value=20 Score=28.19 Aligned_cols=56 Identities=21% Similarity=0.253 Sum_probs=39.4
Q ss_pred CcceEEEEe----cceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCCceee
Q psy840 87 TDEEIRLVL----DGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEPVWTP 153 (178)
Q Consensus 87 ~~dEi~~il----~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~gW~~ 153 (178)
+..|++-.+ .|+..+.|-++.+ -.++|||+|-+-.|..--|.- +++||.+..||..
T Consensus 34 dg~~v~~~kVaD~TgsI~isvW~e~~----~~~~PGDIirLt~Gy~Si~qg-------~LtL~~GK~Ge~~ 93 (134)
T KOG3416|consen 34 DGHEVRSCKVADETGSINISVWDEEG----CLIQPGDIIRLTGGYASIFQG-------CLTLYVGKGGEVQ 93 (134)
T ss_pred CCCEEEEEEEecccceEEEEEecCcC----cccCCccEEEecccchhhhcC-------ceEEEecCCceEe
Confidence 346777665 5778888877544 488999999987776544431 5677777777764
No 103
>PRK13918 CRP/FNR family transcriptional regulator; Provisional
Probab=55.73 E-value=46 Score=26.01 Aligned_cols=36 Identities=14% Similarity=0.014 Sum_probs=27.3
Q ss_pred ceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEeC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIIIP 124 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~vP 124 (178)
+.+++|++|.......+.+|+ .+--.+.|||++--+
T Consensus 27 ~~~y~I~~G~vr~~~~~~~G~e~~l~~~~~Gd~~G~~ 63 (202)
T PRK13918 27 DMLYRVRSGLVRLHTVDDEGNALTLRYVRPGEYFGEE 63 (202)
T ss_pred CeEEEEEeeEEEEEEECCCCCEEEEEEecCCCeechH
Confidence 568999999999887776666 444456999987543
No 104
>PF11142 DUF2917: Protein of unknown function (DUF2917); InterPro: IPR021317 This bacterial family of proteins appears to be restricted to Proteobacteria.
Probab=54.18 E-value=31 Score=23.33 Aligned_cols=39 Identities=18% Similarity=0.142 Sum_probs=30.6
Q ss_pred EEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeec
Q psy840 92 RLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTL 133 (178)
Q Consensus 92 ~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~ 133 (178)
.-+.+|..-.+..+..+. +.+++||-+.+|+|-.=+...
T Consensus 20 l~v~~G~vWlT~~g~~~D---~~L~~G~~l~l~~g~~vvl~a 58 (63)
T PF11142_consen 20 LRVESGRVWLTREGDPDD---YWLQAGDSLRLRRGGRVVLSA 58 (63)
T ss_pred EEEccccEEEECCCCCCC---EEECCCCEEEeCCCCEEEEEe
Confidence 667899988888763332 889999999999998766655
No 105
>PF06719 AraC_N: AraC-type transcriptional regulator N-terminus; InterPro: IPR009594 This entry represents the N terminus of bacterial ARAC-type transcriptional regulators. In Escherichia coli these regulate the L-arabinose operon through sensing the presence of arabinose, and when the sugar is present, transmitting this information from the arabinose-binding domains to the protein s DNA-binding domains []. This family might represent the N-terminal arm of the protein, which binds to the C-terminal DNA binding domains to hold them in a state where the protein prefers to loop and remain non-activating []. This domain is associated with the IPR000005 from INTERPRO domain.
Probab=52.29 E-value=84 Score=24.45 Aligned_cols=58 Identities=16% Similarity=0.140 Sum_probs=43.8
Q ss_pred eecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeee---cCCCCcEEEEEEee
Q psy840 84 HLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFT---LDINNYVKAKRYFI 146 (178)
Q Consensus 84 H~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~---~~~~~~~~alrlF~ 146 (178)
|.--.-=+.+|+.|+=...++++. +...+|+++++|.+++=... ++++..+.++.+.-
T Consensus 19 ~~~y~p~i~~vlQG~K~~~~g~~~-----~~Y~~g~~lv~~~~lPv~~~v~~AS~~~P~l~l~l~l 79 (155)
T PF06719_consen 19 PCVYEPSICIVLQGSKRVHLGDQV-----FEYDAGQYLVSSVDLPVESEVVEASPEEPYLALSLEL 79 (155)
T ss_pred ceecCCeEEEEEeeeEEEEECCce-----EEecCCcEEEecCCCcEEEEEeeccCCCCEEEEEEEc
Confidence 333345678899999988886532 89999999999999876654 45666788888753
No 106
>smart00100 cNMP Cyclic nucleotide-monophosphate binding domain. Catabolite gene activator protein (CAP) is a prokaryotic homologue of eukaryotic cNMP-binding domains, present in ion channels, and cNMP-dependent kinases.
Probab=50.36 E-value=49 Score=22.20 Aligned_cols=37 Identities=16% Similarity=0.025 Sum_probs=27.0
Q ss_pred ceEEEEecceEEEEEEeCCC-cEEEEEEeCCcEEEeCC
Q psy840 89 EEIRLVLDGSGYFDVRDKFD-DWIRIEVTKGDLIIIPA 125 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d-~~~ri~~~~GDli~vPa 125 (178)
+.+++|++|.......+.++ ..+.-.+.+||++-...
T Consensus 36 ~~~y~v~~G~v~~~~~~~~g~~~~~~~~~~g~~~g~~~ 73 (120)
T smart00100 36 DSFYIILSGEVRVYKVLEDGREQILGILGPGDFFGELA 73 (120)
T ss_pred CcEEEEEeeEEEEEEECCCCceEEEEeecCCceechhh
Confidence 67899999999887765444 34556778999875543
No 107
>PRK10402 DNA-binding transcriptional activator YeiL; Provisional
Probab=49.17 E-value=1.4e+02 Score=24.11 Aligned_cols=36 Identities=6% Similarity=-0.225 Sum_probs=27.0
Q ss_pred ceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEeC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIIIP 124 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~vP 124 (178)
+.+++|++|.......+.+|+ .+--.+.+||++-..
T Consensus 50 ~~~y~V~~G~v~v~~~~~~G~e~~~~~~~~g~~~G~~ 86 (226)
T PRK10402 50 SYLFYLTRGRAKLYATLANGKVSLIDFFAAPCFIGEI 86 (226)
T ss_pred ceEEEEEeCEEEEEEECCCCCEeeeeecCCCCeEEee
Confidence 678999999998877766665 333467899987643
No 108
>PLN02288 mannose-6-phosphate isomerase
Probab=49.00 E-value=16 Score=33.53 Aligned_cols=23 Identities=13% Similarity=0.288 Sum_probs=19.9
Q ss_pred EEEEeCCcEEEeCCCCeeeeecC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
.+.++||+.|.+|||+.|-.--+
T Consensus 252 ~v~L~PGeaifl~ag~~HAYl~G 274 (394)
T PLN02288 252 YVKLNPGEALYLGANEPHAYLSG 274 (394)
T ss_pred eEecCCCCEEEecCCCCceecCC
Confidence 48999999999999999986443
No 109
>cd00214 Calpain_III Calpain, subdomain III. Calpains are calcium-activated cytoplasmic cysteine proteinases, participate in cytoskeletal remodeling processes, cell differentiation, apoptosis and signal transduction. Catalytic domain and the two calmodulin-like domains are separated by C2-like domain III. Domain III plays an important role in calcium-induced activation of calpain involving electrostatic interactions with subdomain II. Proposed to mediate calpain's interaction with phospholipids and translocation to cytoplasmic/nuclear membranes. CD includes subdomain III of typical and atypical calpains.
Probab=48.81 E-value=37 Score=26.29 Aligned_cols=34 Identities=21% Similarity=0.444 Sum_probs=25.1
Q ss_pred EEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 111 IRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 111 ~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
.++.+.||.+++||. .|..+.+..+ .||+|+..+
T Consensus 111 ~~~~L~pG~YvIIPs----T~~p~~~g~F-~LrVfs~~~ 144 (150)
T cd00214 111 LRFRLPPGEYVIVPS----TFEPGEEGEF-LLRVFSEKS 144 (150)
T ss_pred EEEEcCCCCEEEEee----ecCCCCcccE-EEEEEecCC
Confidence 457899999999984 5666665555 688998764
No 110
>KOG1417|consensus
Probab=48.68 E-value=57 Score=29.54 Aligned_cols=51 Identities=20% Similarity=0.309 Sum_probs=33.3
Q ss_pred EEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEe
Q psy840 92 RLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYF 145 (178)
Q Consensus 92 ~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF 145 (178)
+.|+--+|...|..+-| |+.+.|+++-+||.|++....+.+...--.+.+|
T Consensus 157 FLiVPQ~G~L~I~TEfG---rllV~P~EI~VIpqG~RFsi~v~~~sRGYilEvY 207 (446)
T KOG1417|consen 157 FLIVPQQGRLWITTEFG---RLLVTPNEIAVIPQGIRFSIDVPGPSRGYILEVY 207 (446)
T ss_pred EEEecccCcEEEEeecc---ceeecccceEEeecccEEEEecCCCCcceEEEEe
Confidence 33444445555544444 4889999999999999998887665433334443
No 111
>cd06919 Asp_decarbox Aspartate alpha-decarboxylase or L-aspartate 1-decarboxylase, a pyruvoyl group-dependent decarboxylase in beta-alanine production. Decarboxylation of aspartate is the major route of beta-alanine production in bacteria, and is catalyzed by the enzyme L-aspartate decarboxylase (ADC), EC:4.1.1.11 which requires a pyruvoyl group for its activity. The pyruvoyl cofactor is covalently bound to the enzyme. The protein is synthesized as a proenzyme and cleaved via self-processing at Gly23-Ser24 to yield an alpha chain (C-terminal fragment) and beta chain (N-terminal fragment), and the pyruvoyl group. Beta-alanine is required for the biosynthesis of pantothenate, in which the enzyme plays a critical regulatory role. The active site of the tetrameric enzyme is located at the interface of two subunits, with a Lysine and a Histidine from the beta chain of one subunit forming the active site with residues from the alpha chain of the adjacent subunit. This alignment
Probab=46.90 E-value=10 Score=29.05 Aligned_cols=58 Identities=24% Similarity=0.202 Sum_probs=37.9
Q ss_pred hHHHHHHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecc---eEEEEEEeCCCcEEEEEEeCCcEE
Q psy840 45 DEVLTKLRKDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDG---SGYFDVRDKFDDWIRIEVTKGDLI 121 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G---~g~f~v~~~~d~~~ri~~~~GDli 121 (178)
|++|=+.+.=.-|...+|++++.+ ++|- .|++.| ||...+.+.. ...+++||.|
T Consensus 28 D~~Ll~aagi~~~E~V~I~Nv~NG---------~Rf~----------TYvI~g~~gSg~I~lNGAA----Ar~~~~GD~v 84 (111)
T cd06919 28 DEDLLEAAGILPYEKVLVVNVNNG---------ARFE----------TYVIPGERGSGVICLNGAA----ARLGQPGDRV 84 (111)
T ss_pred CHHHHHhcCCCCCCEEEEEECCCC---------cEEE----------EEEEEcCCCCCEEEeCCHH----HhcCCCCCEE
Confidence 444444444455666777777765 2222 567766 4999998754 3689999999
Q ss_pred EeCC
Q psy840 122 IIPA 125 (178)
Q Consensus 122 ~vPa 125 (178)
+|-+
T Consensus 85 II~s 88 (111)
T cd06919 85 IIMA 88 (111)
T ss_pred EEEE
Confidence 9854
No 112
>PF05726 Pirin_C: Pirin C-terminal cupin domain; InterPro: IPR008778 This entry represents C-terminal domain of Pirin proteins from both eukaryotes and prokaryotes. The function of Pirin is unknown but the gene coding for this protein is known to be expressed in all tissues in the human body although it is expressed most strongly in the liver and heart. Pirin is known to be a nuclear protein, exclusively localised within the nucleoplasma and predominantly concentrated within dot-like subnuclear structures []. Pirin is composed of two structurally similar domains arranged face to face. The N-terminal domain additionally features four beta-strands, and the C-terminal domain also includes four additional -strands and a short alpha-helix. Although the two domains are similar, the C-terminal domain of Pirin differs from the N-terminal domain as it does not contain a metal binding site and its sequence does not contain the conserved metal-coordinating residues []. Pirin is confirmed to be a member of the cupin superfamily on the basis of primary sequence and structural similarity. The presence of a metal binding site in the N-terminal beta-barrel of Pirin, may be significant in its role in regulating NFI DNA replication and NF-kappaB transcription factor activity []. Pirin structure has been found to closely resemble members of the cupin superfamily. Pirin contains the two characteristic sequences of the cupin superfamily, namely PG-(X)5-HXH-(X)4-E-(X)6-G and G-(X)5-PXG-(X)2-H-(X)3-N separated by a variable stretch of 15-50 amino acids. These motifs are best conserved in the N-terminal where the conserved histidine and glutamic acid residues correspond to the metal-coordinating residues. The C-terminal domain motifs lack the metal binding residues normally associated with the cupin fold []. Pirin was identified to be a metal-binding protein [], and was found that the metal-binding residues of Pirins are highly conserved across mammals, plants, fungi, and prokaryotic organisms. Pirin acts as a cofactor for the transcription factor NFI, the regulatory mechanism of which is generally believed to require the assistance of a metal ion []. Structural data supports the hypothesis that the bound iron of Pirin may participate in this transcriptional regulation by enhancing and stabilising the formation of the p50,Bcl3,DNA complex []. Metals have been implicated directly or indirectly in the NF-kappaB family of transcription factors that control expression of a number of early response genes associated with inflammatory responses, cell growth, cell cycle progression, and neoplastic transformation []. However, most metal-dependent transcription factors are DNA-binding proteins that bind to specific sequences when the metal binds to the protein. Pirin, on the other hand, appears to function differently and bind to the transcription factor DNA complex [].; PDB: 1J1L_A 3ACL_A 2P17_A.
Probab=46.45 E-value=1e+02 Score=22.29 Aligned_cols=53 Identities=23% Similarity=0.240 Sum_probs=34.5
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
.+=..|+++|++. +.+.. ..+.+|+++++..|..=.+++.++ ... +-|+.++|
T Consensus 20 ~~~~iyv~~G~~~--v~~~~-----~~~~~~~~~~l~~g~~i~~~a~~~-~a~-~lll~GeP 72 (104)
T PF05726_consen 20 HNAFIYVLEGSVE--VGGEE-----DPLEAGQLVVLEDGDEIELTAGEE-GAR-FLLLGGEP 72 (104)
T ss_dssp -EEEEEEEESEEE--ETTTT-----EEEETTEEEEE-SECEEEEEESSS-SEE-EEEEEE--
T ss_pred CEEEEEEEECcEE--ECCCc-----ceECCCcEEEECCCceEEEEECCC-CcE-EEEEEccC
Confidence 3667999999975 43322 689999999999888888888742 222 34555544
No 113
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=45.98 E-value=79 Score=26.20 Aligned_cols=87 Identities=15% Similarity=0.192 Sum_probs=49.6
Q ss_pred hHHHHHHHHhcCCCe--eeEEEEcCCCCCCHHHHhhhhccceecCcc---eEEEEe----cceEEEEEEe----------
Q psy840 45 DEVLTKLRKDRNYNY--EDEITCSKDCLPNYEEKLKSFYTEHLHTDE---EIRLVL----DGSGYFDVRD---------- 105 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~--~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~d---Ei~~il----~G~g~f~v~~---------- 105 (178)
++.+.+..++.|+.. .++ .+.. ..+|... .+.+...|.|+.- =++|+- .|.+.|.--.
T Consensus 74 ~~~v~~~~~~l~~d~~~~~l-~i~~-~W~ni~~-~Gg~h~~H~Hp~~~lSgvyYl~~p~~~g~~~f~~p~~~~~~~~~~~ 150 (201)
T TIGR02466 74 DKHVAKFARDLEGDNDGLEL-RIQK-AWVNILP-QGGTHSPHLHPGSVISGTYYVQTPENCGAIKFEDPRLDDMMAAPMR 150 (201)
T ss_pred HHHHHHHHHHcCCCccccce-EEee-EeEEEcC-CCCccCceECCCceEEEEEEEeCCCCCCceeEecCcchhhhccccc
Confidence 455667777777731 121 1111 1222111 2457889999984 456665 3344442100
Q ss_pred ------CCCcEEEEEEeCCcEEEeCCCCeeeeecC
Q psy840 106 ------KFDDWIRIEVTKGDLIIIPAGIYHRFTLD 134 (178)
Q Consensus 106 ------~~d~~~ri~~~~GDli~vPaG~~H~f~~~ 134 (178)
....++.+.-++||+|+-|+-+.|.....
T Consensus 151 ~~~~~~~~~~~~~v~P~~G~lvlFPS~L~H~v~p~ 185 (201)
T TIGR02466 151 IPNAKRAVQRFVYVPPQEGRVLLFESWLRHEVPPN 185 (201)
T ss_pred cCccccccCccEEECCCCCeEEEECCCCceecCCC
Confidence 01125557779999999999999997543
No 114
>PF05721 PhyH: Phytanoyl-CoA dioxygenase (PhyH); InterPro: IPR008775 This family is made up of several eukaryotic phytanoyl-CoA dioxygenase (PhyH) proteins as well as a number of bacterial deoxygenases. PhyH is a peroxisomal enzyme catalysing the first step of phytanic acid alpha-oxidation. PhyH deficiency causes Refsum's disease (RD) which is an inherited neurological syndrome biochemically characterised by the accumulation of phytanic acid in plasma and tissues [].; PDB: 3GJA_A 3EMR_A 3OBZ_A 2OPW_A 3NNL_B 3NNF_A 3NNM_B 3NNJ_A 2FCV_B 2FCU_A ....
Probab=45.87 E-value=22 Score=27.22 Aligned_cols=28 Identities=18% Similarity=0.561 Sum_probs=22.0
Q ss_pred CcEEEEEEeCCcEEEeCCCCeeeeecCC
Q psy840 108 DDWIRIEVTKGDLIIIPAGIYHRFTLDI 135 (178)
Q Consensus 108 d~~~ri~~~~GDli~vPaG~~H~f~~~~ 135 (178)
..++.+.+++||+|+.-..+.|+-....
T Consensus 177 ~~~~~~~~~~Gdvl~~~~~~~H~s~~N~ 204 (211)
T PF05721_consen 177 DEWVPVPMKAGDVLFFHSRLIHGSGPNT 204 (211)
T ss_dssp SGCEEE-BSTTEEEEEETTSEEEEE-B-
T ss_pred CceEEeecCCCeEEEEcCCccccCCCCC
Confidence 3568899999999999999999986543
No 115
>PF05995 CDO_I: Cysteine dioxygenase type I; InterPro: IPR010300 Cysteine dioxygenase type I (1.13.11.20 from EC) converts cysteine to cysteinesulphinic acid and is the rate-limiting step in sulphate production.; GO: 0005506 iron ion binding, 0017172 cysteine dioxygenase activity, 0046439 L-cysteine metabolic process, 0055114 oxidation-reduction process; PDB: 2IC1_A 3EQE_B 3ELN_A 2B5H_A 2GH2_A 2Q4S_A 2ATF_A 2GM6_A 3USS_B.
Probab=44.00 E-value=1.7e+02 Score=23.43 Aligned_cols=70 Identities=14% Similarity=-0.089 Sum_probs=43.3
Q ss_pred hhccceecCc-ceEEEEecceEEEEEEeC-CC--cE-----EEEEEeCCcEEEeCCCCeeeeec-CCCCcEEEEEEeecC
Q psy840 79 SFYTEHLHTD-EEIRLVLDGSGYFDVRDK-FD--DW-----IRIEVTKGDLIIIPAGIYHRFTL-DINNYVKAKRYFIGE 148 (178)
Q Consensus 79 ~f~~eH~H~~-dEi~~il~G~g~f~v~~~-~d--~~-----~ri~~~~GDli~vPaG~~H~f~~-~~~~~~~alrlF~~~ 148 (178)
++-..|-|.. .=+..|++|+..-..-.. ++ .. .......|...+.+.+--|+... ..+.....|-+|.++
T Consensus 86 q~S~IHDH~~s~g~~~vl~G~l~e~~y~~~~~~~~~~~~~~~~~~~~~g~~~~~~~~~iH~v~n~s~~~~avSLHvYspP 165 (175)
T PF05995_consen 86 QRSPIHDHGGSWGWVKVLSGELEETRYRRPDDGGAPLELVGRERLLPGGVTYIFDPHGIHRVENPSGDEPAVSLHVYSPP 165 (175)
T ss_dssp -B--EEE-TTSEEEEEEEESEEEEEEEEESTSSS-EEEECEEEEEETTTEEEEBTTTBEEEEEES-SSS-EEEEEEEES-
T ss_pred CcCCCCCCCCceEEEEEecceEEEEEeccCCcccCcccccCceEecCCCeEEecCCCCeEEeccCCCCCCEEEEEEcCCC
Confidence 4568999987 457889999976543221 22 11 22345677777889999999954 446778888888874
No 116
>COG3097 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.24 E-value=49 Score=24.84 Aligned_cols=45 Identities=20% Similarity=0.358 Sum_probs=37.1
Q ss_pred EecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 94 VLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 94 il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
|+.|+-...++++.. =..++||.+-| ++...+.+++.+.+-+-+|
T Consensus 16 ilagrKTITIRD~SE----Shf~~g~vlrV-------~r~Ed~~~fc~I~vl~vsp 60 (106)
T COG3097 16 ILAGRKTITIRDKSE----SHFKPGDVLRV-------GRFEDDRYFCTIEVLAVSP 60 (106)
T ss_pred HhCCCceEEEeccch----hcCCCCCEEEE-------EEecCCcEEEEEEEEEecc
Confidence 789999999998764 58899999987 6677778888888877665
No 117
>PRK05449 aspartate alpha-decarboxylase; Provisional
Probab=41.53 E-value=13 Score=28.99 Aligned_cols=58 Identities=26% Similarity=0.264 Sum_probs=37.7
Q ss_pred hHHHHHHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecc---eEEEEEEeCCCcEEEEEEeCCcEE
Q psy840 45 DEVLTKLRKDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDG---SGYFDVRDKFDDWIRIEVTKGDLI 121 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G---~g~f~v~~~~d~~~ri~~~~GDli 121 (178)
|++|=+.+.=.-|...+|++++.+ ++|- .|++.| ||.+.+.+.. ...+++||.|
T Consensus 29 D~~Ll~aagi~p~E~V~V~Nv~NG---------~Rf~----------TYvI~g~~GSg~I~lNGAA----Ar~~~~GD~v 85 (126)
T PRK05449 29 DEDLLDAAGILENEKVQIVNVNNG---------ARFE----------TYVIAGERGSGVICLNGAA----ARLVQVGDLV 85 (126)
T ss_pred CHHHHHhcCCCCCCEEEEEECCCC---------cEEE----------EEEEEcCCCCCEEEeCCHH----HhcCCCCCEE
Confidence 344433444445667777777765 1221 467765 5999998754 3689999999
Q ss_pred EeCC
Q psy840 122 IIPA 125 (178)
Q Consensus 122 ~vPa 125 (178)
+|-+
T Consensus 86 II~a 89 (126)
T PRK05449 86 IIAA 89 (126)
T ss_pred EEEE
Confidence 9854
No 118
>TIGR00223 panD L-aspartate-alpha-decarboxylase. Members of this family are aspartate 1-decarboxylase, the enzyme that makes beta-alanine and C02 from aspartate. Beta-alanine is then used to make the vitamin pantothenate, from which coenzyme A is made. Aspartate 1-decarboxylase is synthesized as a proenzyme, then cleaved to an alpha (C-terminal) and beta (N-terminal) subunit with a pyruvoyl group.
Probab=41.43 E-value=13 Score=29.01 Aligned_cols=59 Identities=24% Similarity=0.227 Sum_probs=37.3
Q ss_pred hHHHHHHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecc---eEEEEEEeCCCcEEEEEEeCCcEE
Q psy840 45 DEVLTKLRKDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDG---SGYFDVRDKFDDWIRIEVTKGDLI 121 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G---~g~f~v~~~~d~~~ri~~~~GDli 121 (178)
|++|=+.+.=.-|...+|++++.+ ++|- .|++.| ||.+.+.+.. ...+++||.|
T Consensus 29 D~~Lm~aagi~p~E~V~V~Nv~NG---------~Rf~----------TYvI~G~~GSg~I~lNGAA----Arl~~~GD~V 85 (126)
T TIGR00223 29 DEDLLDAAGILENEKVDIVNVNNG---------KRFS----------TYAIAGKRGSRIICVNGAA----ARCVSVGDIV 85 (126)
T ss_pred CHHHHHhcCCCCCCEEEEEECCCC---------cEEE----------EEEEEcCCCCCEEEeCCHH----HhcCCCCCEE
Confidence 343333334444666677777665 1221 467765 5999998754 3689999999
Q ss_pred EeCCC
Q psy840 122 IIPAG 126 (178)
Q Consensus 122 ~vPaG 126 (178)
+|=+-
T Consensus 86 II~sy 90 (126)
T TIGR00223 86 IIASY 90 (126)
T ss_pred EEEEC
Confidence 98553
No 119
>PRK09392 ftrB transcriptional activator FtrB; Provisional
Probab=41.41 E-value=64 Score=26.07 Aligned_cols=34 Identities=15% Similarity=0.053 Sum_probs=24.8
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEE
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLII 122 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~ 122 (178)
+.+++|++|.........++...-..+.+||++-
T Consensus 49 ~~~~~v~~G~v~~~~~~~~~~~~i~~~~~g~~~g 82 (236)
T PRK09392 49 DFLFVVLDGLVELSASSQDRETTLAILRPVSTFI 82 (236)
T ss_pred ceEEEEEeCEEEEEEcCCCceEEEEEeCCCchhh
Confidence 6789999999988765544444445788899764
No 120
>PF07653 SH3_2: Variant SH3 domain; InterPro: IPR011511 SH3 (src Homology-3) domains are small protein modules containing approximately 50 amino acid residues [, ]. They are found in a great variety of intracellular or membrane-associated proteins [, , ] for example, in a variety of proteins with enzymatic activity, in adaptor proteins that lack catalytic sequences and in cytoskeletal proteins, such as fodrin and yeast actin binding protein ABP-1. The SH3 domain has a characteristic fold which consists of five or six beta-strands arranged as two tightly packed anti-parallel beta sheets. The linker regions may contain short helices []. The surface of the SH3-domain bears a flat, hydrophobic ligand-binding pocket which consists of three shallow grooves defined by conservative aromatic residues in which the ligand adopts an extended left-handed helical arrangement. The ligand binds with low affinity but this may be enhanced by multiple interactions. The region bound by the SH3 domain is in all cases proline-rich and contains PXXP as a core-conserved binding motif. The function of the SH3 domain is not well understood but they may mediate many diverse processes such as increasing local concentration of proteins, altering their subcellular location and mediating the assembly of large multiprotein complexes []. This entry represents a variant of the SH3 domain.; PDB: 1I1J_B 1K0X_A 1HJD_A 2KEA_A 1KJW_A 1JXM_A 1JXO_B 2EBP_A 2DL3_A 2EYX_A ....
Probab=41.10 E-value=35 Score=21.74 Aligned_cols=34 Identities=24% Similarity=0.379 Sum_probs=19.2
Q ss_pred EEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCCceeec
Q psy840 113 IEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEPVWTPH 154 (178)
Q Consensus 113 i~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~gW~~~ 154 (178)
+.+++||+|.|= -.-..+....+.. .+..||.|.
T Consensus 16 Ls~~~Gd~i~v~------~~~~~~~ww~~~~--~g~~G~~P~ 49 (55)
T PF07653_consen 16 LSFKKGDVIEVL------GEKDDDGWWLGEN--NGRRGWFPS 49 (55)
T ss_dssp -EB-TTEEEEEE------EEECSTSEEEEEE--TTEEEEEEG
T ss_pred eEEecCCEEEEE------EeecCCCEEEEEE--CCcEEEEcH
Confidence 889999988774 0122234444433 667788883
No 121
>PF05962 HutD: HutD; InterPro: IPR010282 This entry contains proteins of unknown function, which include HutD from Pseudomonas fluorescens and Ves from Escherichia coli K12. HutD from P. fluorescens is a component of the histidine uptake and utilisation operon. HutD is operonic with the well characterised repressor protein HutC. Genetic analysis using transcriptional fusions (lacZ) and deletion mutants shows that hutD is necessary to maintain fitness in environments replete with histidine. HutD probably sets an upper bound on the level of hut operon transcription []. The mechanistic basis is unknown, but in silico molecular docking studies based on the crystal structure of HutD from Pseudomonas aeruginosa show that urocanate (the first breakdown product of histidine) docks with the active site of HutD.; PDB: 3ESG_A 1YLL_D.
Probab=40.14 E-value=36 Score=27.67 Aligned_cols=49 Identities=18% Similarity=0.236 Sum_probs=28.6
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEE
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKR 143 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alr 143 (178)
..-+.|+++|+....... + .+.+.+||.+++-. .+...+..+..+..++
T Consensus 135 ~~~l~~~~~G~~~i~~~~---~--~~~L~~~d~l~~~~--~~~~~l~~~g~ll~v~ 183 (184)
T PF05962_consen 135 STVLVYVLEGAWSITEGG---N--CISLSAGDLLLIDD--EEDLPLTGDGQLLWVS 183 (184)
T ss_dssp SEEEEEESSS-EEECCCE---E--EEEE-TT-EEEEES--EECEEEEEECCEEEEE
T ss_pred CEEEEEEeeCcEEEecCC---C--ceEcCCCCEEEEeC--CCceEecCCeeEEEEe
Confidence 355678999975543322 1 38999999998866 4444555555555544
No 122
>COG3806 ChrR Transcriptional activator [Transcription]
Probab=38.71 E-value=49 Score=28.03 Aligned_cols=55 Identities=20% Similarity=0.224 Sum_probs=40.1
Q ss_pred hhccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEE
Q psy840 79 SFYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAK 142 (178)
Q Consensus 79 ~f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~al 142 (178)
..++.|+|..-|-..|++|. |+-+ .| ....||++.-+.++.|.-....+..+.++
T Consensus 139 ~s~P~HtH~G~E~t~vl~G~--~sde--~G-----~y~vgD~~~~d~~v~H~piv~~~~eClcl 193 (216)
T COG3806 139 RSFPDHTHVGIERTAVLEGA--FSDE--NG-----EYLVGDFTLADGTVQHSPIVLPPGECLCL 193 (216)
T ss_pred cccccccccceEEEEEEeec--cccC--CC-----ccccCceeecCCccccccccCCCCCceEE
Confidence 45889999999998888775 4433 33 56789999999999999755444333333
No 123
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=38.52 E-value=63 Score=22.50 Aligned_cols=57 Identities=23% Similarity=0.352 Sum_probs=33.6
Q ss_pred hhhccceecC---c-ceEEEE--ec-c-----eEEEEEEeC---CCcEEEEE-----EeCCcEEEeCC-CCeeeeecC
Q psy840 78 KSFYTEHLHT---D-EEIRLV--LD-G-----SGYFDVRDK---FDDWIRIE-----VTKGDLIIIPA-GIYHRFTLD 134 (178)
Q Consensus 78 ~~f~~eH~H~---~-dEi~~i--l~-G-----~g~f~v~~~---~d~~~ri~-----~~~GDli~vPa-G~~H~f~~~ 134 (178)
+.++.+|+.. . ..+.++ |. . .|.+.+... ++....+. .++|++|+.|. .+.|..+..
T Consensus 9 G~~~~~H~D~~~~~~~~~t~llyL~~~~~~~~GG~l~~~~~~~~~~~~~~~~~~~~~p~~g~~v~F~~~~~~H~v~~v 86 (100)
T PF13640_consen 9 GGFFGPHTDNSYDPHRRVTLLLYLNDPEWEFEGGELEFYPSKDSDDVSREVEDFDIVPKPGRLVIFPSDNSLHGVTPV 86 (100)
T ss_dssp TEEEEEEESSSCCCSEEEEEEEESS-CS-HCEE--EEETTTS-TSSTCEEEGGGSEE-BTTEEEEEESCTCEEEEEEE
T ss_pred CCEEeeeECCCCCCcceEEEEEEECCCCcccCCCEEEEeccccCCCcceEEEeccccCCCCEEEEEeCCCCeecCccc
Confidence 3567788876 3 333333 44 2 255555431 22333344 89999999999 999998766
No 124
>COG0361 InfA Translation initiation factor 1 (IF-1) [Translation, ribosomal structure and biogenesis]
Probab=37.63 E-value=67 Score=22.89 Aligned_cols=12 Identities=33% Similarity=0.899 Sum_probs=9.8
Q ss_pred EEEEeCCcEEEe
Q psy840 112 RIEVTKGDLIII 123 (178)
Q Consensus 112 ri~~~~GDli~v 123 (178)
+|.+.|||.++|
T Consensus 44 ~i~I~~GD~V~V 55 (75)
T COG0361 44 RIRILPGDVVLV 55 (75)
T ss_pred eEEeCCCCEEEE
Confidence 578888998876
No 125
>PRK10202 ebgC cryptic beta-D-galactosidase subunit beta; Reviewed
Probab=37.61 E-value=2e+02 Score=22.52 Aligned_cols=45 Identities=22% Similarity=0.215 Sum_probs=33.1
Q ss_pred cceEEEEecceEEEEEEeC------------C------CcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 88 DEEIRLVLDGSGYFDVRDK------------F------DDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~------------~------d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
.-.|-|+++|+=.+.+... . +..-.+.|.||++++..|+=.|+..
T Consensus 65 YiDIq~~l~G~E~i~~~~~~~~~~~~~y~~e~D~~f~~~~~~~v~l~~G~F~iffP~daH~P~ 127 (149)
T PRK10202 65 YFEVHYYLQGQQKIEYAPKETLQVVEYYRDETDREYLKGCGETVEVHEGQIVICDIHEAYRFI 127 (149)
T ss_pred EEEEEEEEeCeEEEEEEEcccCccccccCcccCeeeccCCCcEEEeCCCeEEEECCcccccCC
Confidence 4778888888877776531 1 1112588999999999999999987
No 126
>PF01987 AIM24: Mitochondrial biogenesis AIM24; InterPro: IPR002838 The proteins in this family have no known function.; PDB: 1PG6_A 1YOX_D.
Probab=37.48 E-value=50 Score=26.80 Aligned_cols=34 Identities=26% Similarity=0.473 Sum_probs=27.4
Q ss_pred EEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCC
Q psy840 93 LVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGI 127 (178)
Q Consensus 93 ~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~ 127 (178)
.-++|+|...+.. .+..+.+.+.+|+-++|.++-
T Consensus 134 ~~l~G~G~v~l~~-~G~i~~i~L~~ge~~~Vd~~~ 167 (215)
T PF01987_consen 134 LKLSGRGTVFLSG-YGAIYEIDLAPGEEIIVDPGH 167 (215)
T ss_dssp EEEESSCEEEEEE-CCSEEEEEEE-EEEEEEEGGG
T ss_pred EEEEEEEEEEEEe-CCcEEEEEccCCceEEEcCCC
Confidence 4588999998888 578889999999999987763
No 127
>PF04622 ERG2_Sigma1R: ERG2 and Sigma1 receptor like protein; InterPro: IPR006716 This family consists of the fungal C-8 sterol isomerase and mammalian sigma1 receptor. C-8 sterol isomerase (delta-8--delta-7 sterol isomerase), catalyses a reaction in ergosterol biosynthesis, which results in unsaturation at C-7 in the B ring of sterols []. Sigma 1 receptor is a low molecular mass mammalian protein located in the endoplasmic reticulum [], which interacts with endogenous steroid hormones, such as progesterone and testosterone []. It also binds the sigma ligands, which are a set of chemically unrelated drugs including haloperidol, pentazocine, and ditolylguanidine []. Sigma1 effectors are not well understood, but sigma1 agonists have been observed to affect NMDA receptor function, the alpha-adrenergic system and opioid analgesia.; GO: 0000247 C-8 sterol isomerase activity, 0006696 ergosterol biosynthetic process, 0005783 endoplasmic reticulum
Probab=36.91 E-value=1.2e+02 Score=25.65 Aligned_cols=61 Identities=16% Similarity=0.310 Sum_probs=45.0
Q ss_pred ecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCCceee
Q psy840 85 LHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEPVWTP 153 (178)
Q Consensus 85 ~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~gW~~ 153 (178)
.|..|.-+.||+|+..=...+..+ ....+|||....|.|...-..+.++..+ +.+ ..||+|
T Consensus 116 rh~ad~y~tIL~G~~~~~~~g~~~---~evy~pGd~~~l~rg~a~~y~m~~~tw~--LEY---~RG~IP 176 (216)
T PF04622_consen 116 RHWADDYFTILSGEQWAWSPGSLE---PEVYKPGDSHHLPRGEAKQYQMPPGTWA--LEY---GRGWIP 176 (216)
T ss_pred ceEeeeEEEEEEEEEEEEcCCCCC---ceEeccCCEEEecCceEEEEEeCCCeEE--EEe---cCCchh
Confidence 355688899999997665544333 4788999999999999999988877543 332 257776
No 128
>TIGR03697 NtcA_cyano global nitrogen regulator NtcA, cyanobacterial. Members of this protein family, found in the cyanobacteria, are the global nitrogen regulator NtcA. This DNA-binding transcriptional regulator is required for expressing many different ammonia-repressible genes. The consensus NtcA-binding site is G T A N(8)T A C.
Probab=36.30 E-value=86 Score=24.13 Aligned_cols=34 Identities=6% Similarity=0.008 Sum_probs=26.1
Q ss_pred ceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEE
Q psy840 89 EEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLII 122 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~ 122 (178)
+-+++|++|........++|+ .+--.+.+||++-
T Consensus 12 ~~~~~i~~G~v~~~~~~~~G~e~~l~~~~~g~~~G 46 (193)
T TIGR03697 12 EKVYFLRRGAVKLSRVYESGEEITVALLRENSVFG 46 (193)
T ss_pred CcEEEEEecEEEEEEeCCCCcEeeeEEccCCCEee
Confidence 568999999999887766665 3345689999864
No 129
>COG0664 Crp cAMP-binding proteins - catabolite gene activator and regulatory subunit of cAMP-dependent protein kinases [Signal transduction mechanisms]
Probab=36.10 E-value=69 Score=24.47 Aligned_cols=39 Identities=15% Similarity=0.102 Sum_probs=29.5
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEE-EEeCCcEEEeCCCC
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRI-EVTKGDLIIIPAGI 127 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri-~~~~GDli~vPaG~ 127 (178)
+-+++|++|.........+|+..-+ .+.+||++-..+=.
T Consensus 42 ~~~y~v~~G~v~~~~~~~~G~~~~~~~~~~g~~fg~~~l~ 81 (214)
T COG0664 42 DSLYIILSGIVKLYANTEDGREIILGFLGPGDFFGELALL 81 (214)
T ss_pred ceEEEEEEeEEEEEEECCCCcEEEEEEecCCchhhhHHHh
Confidence 3489999999999888877764334 58899998776533
No 130
>TIGR03805 beta_helix_1 parallel beta-helix repeat-containing protein. Members of this protein family contain a tandem pair of beta-helix repeats (see TIGR03804). Each repeat is expected to consist of three beta strands that form a single turn as they form a right-handed helix of stacked beta-structure. Member proteinsa occur regularly in two-gene pairs along with another uncharacterized protein family; both protein families exhibit either lipoprotein or regular signal peptides, suggesting transit through the plasma membrane, and the two may be fused. The function of the pair is unknown.
Probab=35.21 E-value=22 Score=31.30 Aligned_cols=16 Identities=31% Similarity=0.895 Sum_probs=14.7
Q ss_pred EEeCCcEEEeCCCCee
Q psy840 114 EVTKGDLIIIPAGIYH 129 (178)
Q Consensus 114 ~~~~GDli~vPaG~~H 129 (178)
..+|||.|.||+|+++
T Consensus 7 ~A~~GDtI~l~~G~Y~ 22 (314)
T TIGR03805 7 AAQPGDTIVLPEGVFQ 22 (314)
T ss_pred hCCCCCEEEECCCEEE
Confidence 4689999999999998
No 131
>TIGR02408 ectoine_ThpD ectoine hydroxylase. Both ectoine and hydroxyectoine are compatible solvents that serve as protectants against osmotic and thermal stresses. A number of genomes synthesize ectoine. This enzyme allows conversion of ectoine to hydroxyectoine, which may be more effective for some purposes, and is found in a subset of ectoine-producing organisms.
Probab=34.74 E-value=68 Score=27.36 Aligned_cols=36 Identities=8% Similarity=-0.020 Sum_probs=26.4
Q ss_pred EEEEEeCCcEEEeCCCCeeeeecCCC---CcEEEEEEee
Q psy840 111 IRIEVTKGDLIIIPAGIYHRFTLDIN---NYVKAKRYFI 146 (178)
Q Consensus 111 ~ri~~~~GDli~vPaG~~H~f~~~~~---~~~~alrlF~ 146 (178)
+.+.+++||+++.-..+.|+.....+ ..+..+++..
T Consensus 211 v~~~~~aGDvl~f~~~~~H~S~~N~s~~~R~~l~l~y~~ 249 (277)
T TIGR02408 211 STFTGKAGSAVWFDCNTMHGSGSNITPWPRSNVFMVFNS 249 (277)
T ss_pred eeeccCCceEEEEccccccCCCCCCCCCcceeEEEEEec
Confidence 34778999999999999999865433 3355566653
No 132
>smart00720 calpain_III calpain_III.
Probab=34.67 E-value=80 Score=23.87 Aligned_cols=33 Identities=24% Similarity=0.510 Sum_probs=23.5
Q ss_pred EEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
+..+.||.+++||. .|..+....+ .||+|+..+
T Consensus 107 ~~~L~~G~Y~iVPs----T~~p~~~g~F-~LrV~s~~~ 139 (143)
T smart00720 107 RFRLPPGEYVIVPS----TFEPNQEGDF-LLRVFSEGP 139 (143)
T ss_pred EEEcCCCCEEEEEe----ecCCCCccCE-EEEEEecCc
Confidence 46788999999874 4555655555 678888754
No 133
>PHA02890 hypothetical protein; Provisional
Probab=34.18 E-value=2.2e+02 Score=25.10 Aligned_cols=58 Identities=17% Similarity=0.153 Sum_probs=43.4
Q ss_pred eEEE--EecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 90 EIRL--VLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 90 Ei~~--il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
|-+| .++|++..-+.. +|+-+...+.+||-+++--++.|.-.. .+-.+..+++-.+-|
T Consensus 92 Ey~FVlCL~Gs~~In~~~-~d~~iS~~I~kGeaF~mdv~t~H~i~T-Knl~L~Viky~vd~p 151 (278)
T PHA02890 92 ECFFVACIEGSCKINVNI-GDREISDHIHENQGFIMDVGLDHAIDS-DNVGLFITKFEVDAH 151 (278)
T ss_pred cEEEEEEeCCeEEEEEec-CCceeeeeeecCceEEEEccceEEEEc-cceeEEEEEEEecce
Confidence 4444 479999988876 567788999999999999999999877 444454555444433
No 134
>PHA02984 hypothetical protein; Provisional
Probab=33.09 E-value=2.3e+02 Score=25.14 Aligned_cols=58 Identities=16% Similarity=0.098 Sum_probs=42.8
Q ss_pred ceEEE--EecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCC-CCcEEEEEEeec
Q psy840 89 EEIRL--VLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDI-NNYVKAKRYFIG 147 (178)
Q Consensus 89 dEi~~--il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~-~~~~~alrlF~~ 147 (178)
-|-+| .++|++...+.. +|+-+...+++|+-+.+--++.|.-+... +-.+..+++-.+
T Consensus 92 nEy~FvlCl~G~~~I~~~~-~~~~is~~I~kGeaf~md~~t~h~i~T~~knl~L~Vi~y~v~ 152 (286)
T PHA02984 92 NEYMFVLCLNGKTSIECFN-KGSKITNTIKKGEAFTLNLKTKYVTTTKDKNLHLAVITYTSN 152 (286)
T ss_pred ccEEEEEEcCCeEEEEEec-CCceeeeEEecCceEEEEccceEEEEeCCCceEEEEEEEEec
Confidence 34444 479999988877 56778899999999999999999986653 444444444333
No 135
>PF04773 FecR: FecR protein; InterPro: IPR006860 FecR is involved in regulation of iron dicitrate transport. In the absence of citrate FecR inactivates FecI. FecR is probably a sensor that recognises iron dicitrate in the periplasm.
Probab=32.91 E-value=1.6e+02 Score=20.08 Aligned_cols=57 Identities=19% Similarity=0.104 Sum_probs=37.8
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeec
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIG 147 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~ 147 (178)
.-...+..|++++.+....... +.++-+...+...|+.-+.....+..-..+.++.+
T Consensus 39 ~~~~~L~~G~~~~~~~~~~~~~--~~V~T~~~~i~v~GT~f~v~v~~~~~~~~v~v~~G 95 (98)
T PF04773_consen 39 PTRLRLLSGEILFDVSPGKKRP--FEVRTPTATIGVRGTRFSVRVDAEDGSTRVAVLEG 95 (98)
T ss_pred ceEEEEcCCCEEEEEcccCCCC--EEEEeCCEEEEEecCEEEEEEECCCCcEEEEEEee
Confidence 3357789999999998644332 67777788888899877556544434445555543
No 136
>PRK11161 fumarate/nitrate reduction transcriptional regulator; Provisional
Probab=32.79 E-value=1.8e+02 Score=23.32 Aligned_cols=35 Identities=17% Similarity=0.035 Sum_probs=25.8
Q ss_pred ceEEEEecceEEEEEEeCCCc-EEEEEEeCCcEEEe
Q psy840 89 EEIRLVLDGSGYFDVRDKFDD-WIRIEVTKGDLIII 123 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~-~~ri~~~~GDli~v 123 (178)
+.+++|++|.......+.+|+ .+--.+.|||++-.
T Consensus 56 ~~ly~v~~G~v~~~~~~~~G~e~i~~~~~~gd~~g~ 91 (235)
T PRK11161 56 KSLYAIRSGTIKSYTITEQGDEQITGFHLAGDLVGF 91 (235)
T ss_pred ceEEEEeeceEEEEEECCCCCEEEEEeccCCceecc
Confidence 568999999998877766554 34445689999854
No 137
>TIGR03027 pepcterm_export putative polysaccharide export protein, PEP-CTERM sytem-associated. This protein family belongs to the larger set of polysaccharide biosynthesis/export proteins described by Pfam model pfam02563. Members of this family are variable in either containing of lacking a 78-residue insert, but appear to fall within a single clade, nevertheless, where the regions in which the gene is found encode components of the PEP-CTERM/EpsH proposed exosortase protein sorting system.
Probab=32.78 E-value=29 Score=27.33 Aligned_cols=15 Identities=40% Similarity=0.846 Sum_probs=13.6
Q ss_pred EEEeCCcEEEeCCCC
Q psy840 113 IEVTKGDLIIIPAGI 127 (178)
Q Consensus 113 i~~~~GDli~vPaG~ 127 (178)
+.+++||.|+||..+
T Consensus 150 ~~L~~gD~I~Vp~~~ 164 (165)
T TIGR03027 150 VELKPGDVLIIPESW 164 (165)
T ss_pred ceeCCCCEEEEeccc
Confidence 889999999999864
No 138
>cd05793 S1_IF1A S1_IF1A: Translation initiation factor IF1A, also referred to as eIF1A in eukaryotes and aIF1A in archaea, S1-like RNA-binding domain. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. IF1A is essential for translation initiation. eIF1A acts synergistically with eIF1 to mediate assembly of ribosomal initiation complexes at the initiation codon and maintain the accuracy of this process by recognizing and destabilizing aberrant preinitiation complexes from the mRNA. Without eIF1A and eIF1, 43S ribosomal preinitiation complexes can bind to the cap-proximal region, but are unable to reach the initiation codon. eIF1a also enhances the formation of 5'-terminal complexes in the presence of other translation initiation factors. This protein family is only found in eukaryotes and archaea.
Probab=32.23 E-value=95 Score=21.84 Aligned_cols=12 Identities=25% Similarity=0.872 Sum_probs=9.0
Q ss_pred EEEEeCCcEEEe
Q psy840 112 RIEVTKGDLIII 123 (178)
Q Consensus 112 ri~~~~GDli~v 123 (178)
++.+.+||.++|
T Consensus 36 ~iwI~~GD~V~V 47 (77)
T cd05793 36 RVWINEGDIVLV 47 (77)
T ss_pred cEEEcCCCEEEE
Confidence 367778888777
No 139
>PRK00924 5-keto-4-deoxyuronate isomerase; Provisional
Probab=31.56 E-value=65 Score=28.31 Aligned_cols=45 Identities=18% Similarity=0.245 Sum_probs=31.9
Q ss_pred ceecCcce-EEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCee-eee
Q psy840 83 EHLHTDEE-IRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYH-RFT 132 (178)
Q Consensus 83 eH~H~~dE-i~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H-~f~ 132 (178)
+..=...| ....|.|.|...+.+ +. ..+.+.|.+.||.|..- .|.
T Consensus 68 ~~fl~rrE~giV~lgG~~~V~vdG---~~--~~l~~~d~LYVp~G~~~v~~a 114 (276)
T PRK00924 68 SYFLERRELGIINIGGAGTVTVDG---ET--YELGHRDALYVGKGAKEVVFA 114 (276)
T ss_pred eeecCCcEEEEEEccceEEEEECC---EE--EecCCCcEEEECCCCcEEEEE
Confidence 33334555 467788999888854 32 56889999999999874 453
No 140
>smart00652 eIF1a eukaryotic translation initiation factor 1A.
Probab=30.37 E-value=1e+02 Score=22.02 Aligned_cols=28 Identities=18% Similarity=0.367 Sum_probs=16.1
Q ss_pred cceEEEEEEeCCCcEE----------EEEEeCCcEEEe
Q psy840 96 DGSGYFDVRDKFDDWI----------RIEVTKGDLIII 123 (178)
Q Consensus 96 ~G~g~f~v~~~~d~~~----------ri~~~~GDli~v 123 (178)
-|.+.|.|...+|..+ +|.+.+||+++|
T Consensus 15 lG~~~~~V~~~dG~~~la~ipgK~Rk~iwI~~GD~VlV 52 (83)
T smart00652 15 LGNGRLEVMCADGKERLARIPGKMRKKVWIRRGDIVLV 52 (83)
T ss_pred cCCCEEEEEECCCCEEEEEEchhhcccEEEcCCCEEEE
Confidence 3455555555444321 366777888777
No 141
>PLN02868 acyl-CoA thioesterase family protein
Probab=30.35 E-value=1.1e+02 Score=27.52 Aligned_cols=35 Identities=6% Similarity=0.045 Sum_probs=26.7
Q ss_pred ceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEe
Q psy840 89 EEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIII 123 (178)
Q Consensus 89 dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~v 123 (178)
+.+++|++|+......+.+++.+-..+++||++-.
T Consensus 50 ~~lyiI~~G~V~v~~~~~~ge~~l~~l~~Gd~fG~ 84 (413)
T PLN02868 50 DGLYFIWKGEAEVSGPAEEESRPEFLLKRYDYFGY 84 (413)
T ss_pred ceEEEEEeCEEEEEEECCCCcEEEEEeCCCCEeeh
Confidence 57899999999887766555544456799998764
No 142
>PF01067 Calpain_III: Calpain large subunit, domain III; InterPro: IPR022682 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the MEROPS peptidase family C2 (calpain family, clan CA). A type example is calpain, which is an intracellular protease involved in many important cellular functions that are regulated by calcium []. The protein is a complex of 2 polypeptide chains (light and heavy), with three known forms in mammals [, ]: a highly calcium-sensitive (i.e., micro-molar range) form known as mu-calpain, mu-CANP or calpain I; a form sensitive to calcium in the milli-molar range, known as m-calpain, m-CANP or calpain II; and a third form, known as p94, which is found in skeletal muscle only []. All forms have identical light but different heavy chains. Both mu- and m-calpain are heterodimers containing an identical 28kDa subunit and an 80kDa subunit that shares 55-65% sequence homology between the two proteases [, ]. The crystallographic structure of m-calpain reveals six "domains" in the 80kDa subunit: A 19-amino acid NH2-terminal sequence; Active site domain IIa; Active site domain IIb. Domain 2 shows low levels of sequence similarity to papain; although the catalytic His has not been located by biochemical means, it is likely that calpain and papain are related []. Domain III; An 18-amino acid extended sequence linking domain III to domain IV; Domain IV, which resembles the penta EF-hand family of polypeptides, binds calcium and regulates activity []. />]. Ca2+-binding causes a rearrangement of the protein backbone, the net effect of which is that a Trp side chain, which acts as a wedge between catalytic domains IIa and IIb in the apo state, moves away from the active site cleft allowing for the proper formation of the catalytic triad []. Calpain-like mRNAs have been identified in other organisms including bacteria, but the molecules encoded by these mRNAs have not been isolated, so little is known about their properties. How calpain activity is regulated in these organisms cells is still unclear In metazoans, the activity of calpain is controlled by a single proteinase inhibitor, calpastatin (IPR001259 from INTERPRO). The calpastatin gene can produce eight or more calpastatin polypeptides ranging from 17 to 85 kDa by use of different promoters and alternative splicing events. The physiological significance of these different calpastatins is unclear, although all bind to three different places on the calpain molecule; binding to at least two of the sites is Ca2+ dependent. The calpains ostensibly participate in a variety of cellular processes including remodelling of cytoskeletal/membrane attachments, different signal transduction pathways, and apoptosis. Deregulated calpain activity following loss of Ca2+ homeostasis results in tissue damage in response to events such as myocardial infarcts, stroke, and brain trauma []. This entry represents domain III. It is found in association with PF00648 from PFAM. The function of the domain III and I are currently unknown. Domain II is a cysteine protease and domain IV is a calcium binding domain. Calpains are believed to participate in intracellular signaling pathways mediated by calcium ions. ; PDB: 1QXP_B 2QFE_A 1DF0_A 1U5I_A 3DF0_A 3BOW_A 1KFU_L 1KFX_L.
Probab=29.71 E-value=75 Score=23.72 Aligned_cols=33 Identities=18% Similarity=0.432 Sum_probs=21.0
Q ss_pred EEEEeCCcEEEeCCCCeeeeecCCCCcEEEEEEeecCC
Q psy840 112 RIEVTKGDLIIIPAGIYHRFTLDINNYVKAKRYFIGEP 149 (178)
Q Consensus 112 ri~~~~GDli~vPaG~~H~f~~~~~~~~~alrlF~~~~ 149 (178)
.+.+.||++++||. .+..+....+ .||+|+..+
T Consensus 111 ~~~L~~G~YvIVPs----T~~~~~~g~F-~L~v~s~~~ 143 (147)
T PF01067_consen 111 EFTLPPGTYVIVPS----TYEPGQEGEF-TLRVFSDSP 143 (147)
T ss_dssp EEEE-SEEEEEEEE----ESSTT--EEE-EEEEEESSS
T ss_pred EEEcCCCCEEEEEe----cCCCCCeeeE-EEEEEECCC
Confidence 36889999999984 3344444445 678888765
No 143
>TIGR00568 alkb DNA alkylation damage repair protein AlkB. Proteins in this family have an as of yet undetermined function in the repair of alkylation damage to DNA. Alignment and family designation based on phylogenomic analysis of Jonathan A. Eisen (PhD Thesis, Stanford University, 1999).
Probab=28.24 E-value=1.3e+02 Score=24.15 Aligned_cols=40 Identities=20% Similarity=0.240 Sum_probs=30.5
Q ss_pred EEEecceEEEEEEeC--CCcEEEEEEeCCcEEEeCCCCeeee
Q psy840 92 RLVLDGSGYFDVRDK--FDDWIRIEVTKGDLIIIPAGIYHRF 131 (178)
Q Consensus 92 ~~il~G~g~f~v~~~--~d~~~ri~~~~GDli~vPaG~~H~f 131 (178)
.+=+--+..|.++.+ ++....+.++.||+++.-...+++|
T Consensus 125 SvSLG~~r~F~~~~~~~~~~~~~l~L~sGsllvM~G~sR~~~ 166 (169)
T TIGR00568 125 SVSLGLPAIFLIGGLKRNDPPKRLRLHSGDVVIMGGESRLAF 166 (169)
T ss_pred EEeCCCCEEEEecCCcCCCceEEEEeCCCCEEEECCchhccc
Confidence 344566788888764 3446889999999999988887765
No 144
>PRK03606 ureidoglycolate hydrolase; Provisional
Probab=28.02 E-value=3.2e+02 Score=21.98 Aligned_cols=57 Identities=12% Similarity=0.039 Sum_probs=41.5
Q ss_pred ceecCc-ceEEEEecceEEEEEEeCC-----CcEEEEEEeCCcEEEeCCCCeeee--ecCCCCcE
Q psy840 83 EHLHTD-EEIRLVLDGSGYFDVRDKF-----DDWIRIEVTKGDLIIIPAGIYHRF--TLDINNYV 139 (178)
Q Consensus 83 eH~H~~-dEi~~il~G~g~f~v~~~~-----d~~~ri~~~~GDli~vPaG~~H~f--~~~~~~~~ 139 (178)
.=.|+. .|.+.-+.|+-++-+--.+ +..-.+.+.+|.-|.+-+||.|.. .++....+
T Consensus 73 mERHp~~sQafiPl~~~~~lvvVA~~~~~~~~~~raF~~~~~qgV~y~~G~WH~pl~~l~~~~dF 137 (162)
T PRK03606 73 LERHPLGSQAFIPLNGRPFLVVVAPDGDGDPGTPRAFVTNGRQGVNYHRGVWHHPLLALGEVSDF 137 (162)
T ss_pred EEeCCCceEEEEECCCCEEEEEEeCCCCCCccceEEEEecCCcEEEeCCCcccccccccCCCceE
Confidence 335664 7889999999988776532 235568889999999999999974 44444444
No 145
>KOG1029|consensus
Probab=27.66 E-value=31 Score=34.87 Aligned_cols=28 Identities=21% Similarity=0.401 Sum_probs=18.0
Q ss_pred ecceEEEEEEeCCCcEEEEEEeCCcEEEeC
Q psy840 95 LDGSGYFDVRDKFDDWIRIEVTKGDLIIIP 124 (178)
Q Consensus 95 l~G~g~f~v~~~~d~~~ri~~~~GDli~vP 124 (178)
+.=++.|-++-..-. .|.++|||+|+|+
T Consensus 694 vkyrAly~FeaRs~d--Eisf~pGDII~V~ 721 (1118)
T KOG1029|consen 694 VKYRALYPFEARSHD--EISFEPGDIIIVF 721 (1118)
T ss_pred EEEeeecccccCCcc--cccccCCCEEEEe
Confidence 333555555432211 3999999999998
No 146
>PF10983 DUF2793: Protein of unknown function (DUF2793); InterPro: IPR021251 This entry is represented by Bacteriophage D3112, Orf54. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=27.09 E-value=1.3e+02 Score=21.88 Aligned_cols=41 Identities=24% Similarity=0.458 Sum_probs=26.3
Q ss_pred eCCcEEEeCCCCeeeeecCCCCcEEE-----EEEeecCCceeecCCC
Q psy840 116 TKGDLIIIPAGIYHRFTLDINNYVKA-----KRYFIGEPVWTPHNRP 157 (178)
Q Consensus 116 ~~GDli~vPaG~~H~f~~~~~~~~~a-----lrlF~~~~gW~~~~r~ 157 (178)
..||..+||+|-.=-+ .+-+..+.+ -+|+...+||.++...
T Consensus 29 ~~Gd~yiv~~~atGaW-aG~~g~iA~~~~g~W~f~~P~~GW~a~v~~ 74 (87)
T PF10983_consen 29 AEGDRYIVPAGATGAW-AGQDGKIAAWQDGAWRFLTPRPGWRAWVAD 74 (87)
T ss_pred CCCCEEEECCCCCccc-ccCCCCEEEEECCeEEEeCCCCCcEEEEeC
Confidence 4588888888842222 122244443 6889999999987664
No 147
>PF04962 KduI: KduI/IolB family; InterPro: IPR021120 The KduI/IolB family of enzymes includes 5-keto 4-deoxyuronate isomerase (KduI) and 5-deoxy-glucuronate isomerase (IolB). KduI is involved in pectin degradation by free-living soil bacteria that use pectin as a carbon source, breaking it down to 2-keto-3-deoxygluconate, which can ultimately be converted to pyruvate. KduI catalyses the fourth step in pectin degradation, namely the interconversion of 5-keto-4-deoxyuronate and 2,5-diketo-3-dexoygluconate []. KduI has a TIM-barrel fold []. IolB is one of several bacterial proteins encoded by the inositol operon (iolABCDEFGHIJ) in Bacillus subtilis that are involved in myo-inositol catabolism. The enzyme is responsible for isomerization of 5-deoxy-D-glucuronic acid by IolB to produce 2-deoxy-5-keto-D-gluconic acid []. IolBs possess a cupin-like structure.; GO: 0016861 intramolecular oxidoreductase activity, interconverting aldoses and ketoses, 0008152 metabolic process; PDB: 1YWK_B 2QJV_B 1X8M_A 1XRU_A.
Probab=27.05 E-value=1.9e+02 Score=24.89 Aligned_cols=48 Identities=19% Similarity=0.243 Sum_probs=30.8
Q ss_pred ecCcceEEEEecceEEEEEEeCCCcEEEEEEeCC--------cEEEeCCCCeeeeecCCC
Q psy840 85 LHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKG--------DLIIIPAGIYHRFTLDIN 136 (178)
Q Consensus 85 ~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~G--------Dli~vPaG~~H~f~~~~~ 136 (178)
+-..|-...+|+|.+.+.+.+. .+ ..+... |.+.||.|+.=..++.++
T Consensus 44 ~~~~E~~vv~l~G~~~v~~~g~--~~--~~l~~R~~vF~~~~d~lYvp~g~~~~i~a~~~ 99 (261)
T PF04962_consen 44 LERRELGVVNLGGKATVTVDGE--EF--YELGGRESVFDGPPDALYVPRGTKVVIFASTD 99 (261)
T ss_dssp CCSEEEEEEEESSSEEEEETTE--EE--EEE-TTSSGGGS--EEEEE-TT--EEEEESST
T ss_pred CCCcEEEEEEeCCEEEEEeCCc--eE--EEecccccccCCCCcEEEeCCCCeEEEEEcCC
Confidence 4444667888999999998431 33 455555 999999999977666544
No 148
>COG0853 PanD Aspartate 1-decarboxylase [Coenzyme metabolism]
Probab=27.02 E-value=24 Score=27.53 Aligned_cols=58 Identities=26% Similarity=0.268 Sum_probs=38.2
Q ss_pred hHHHHHHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEe---cceEEEEEEeCCCcEEEEEEeCCcEE
Q psy840 45 DEVLTKLRKDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVL---DGSGYFDVRDKFDDWIRIEVTKGDLI 121 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il---~G~g~f~v~~~~d~~~ri~~~~GDli 121 (178)
|+.+=..+.=.-|-..|+++++.+ ++|- .|++ .|||...+.+.. ...|++||++
T Consensus 28 D~dlldaagile~EkV~I~N~nNG---------aRf~----------TYvI~g~rGSg~I~lNGAA----Arl~~~GD~V 84 (126)
T COG0853 28 DEDLLDAAGILENEKVDIVNVNNG---------ARFS----------TYVIAGERGSGVICLNGAA----ARLVQVGDLV 84 (126)
T ss_pred CHHHHhhcCCCCCceEEEEECCCC---------cEEE----------EEEEEccCCCcEEEechHH----HhhCCCCCEE
Confidence 444444555555667788887765 2332 4555 468888887754 3689999999
Q ss_pred EeCC
Q psy840 122 IIPA 125 (178)
Q Consensus 122 ~vPa 125 (178)
+|-+
T Consensus 85 II~s 88 (126)
T COG0853 85 IIMS 88 (126)
T ss_pred EEEE
Confidence 8854
No 149
>TIGR00156 conserved hypothetical protein TIGR00156. As of the last revision, this family consists only of two proteins from Escherichia coli and one from the related species Haemophilus influenzae.
Probab=26.52 E-value=47 Score=25.87 Aligned_cols=45 Identities=20% Similarity=0.245 Sum_probs=33.2
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
.+|-+.+-+++|...|+-.++.|--..+.|+|.|.|=.-+-|.+.
T Consensus 71 ~~d~Y~F~D~TG~I~VeId~~~w~G~~v~p~d~V~I~GeVDk~~~ 115 (126)
T TIGR00156 71 GDDRYVFRDKSGEINVVIPAAVWNGREVQPKDMVNISGSLDKKSA 115 (126)
T ss_pred CCceEEEECCCCCEEEEECHHHcCCCcCCCCCEEEEEEEECCCCC
Confidence 467788888888888877556677778888888888666666543
No 150
>PF01238 PMI_typeI: Phosphomannose isomerase type I; InterPro: IPR001250 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. Type I includes eukaryotic PMI and the enzyme encoded by the manA gene in enterobacteria. PMI has a bound zinc ion, which is essential for activity. A crystal structure of PMI from Candida albicans shows that the enzyme has three distinct domains []. The active site lies in the central domain, contains a single essential zinc atom, and forms a deep, open cavity of suitable dimensions to contain M6P or F6P The central domain is flanked by a helical domain on one side and a jelly-roll like domain on the other.; GO: 0004476 mannose-6-phosphate isomerase activity, 0008270 zinc ion binding, 0005975 carbohydrate metabolic process; PDB: 1PMI_A 1QWR_B 1ZX5_A 3H1Y_A 2WFP_A 3H1M_A 3H1W_A.
Probab=26.36 E-value=36 Score=30.66 Aligned_cols=21 Identities=33% Similarity=0.662 Sum_probs=17.1
Q ss_pred EEEeCCcEEEeCCCCeeeeec
Q psy840 113 IEVTKGDLIIIPAGIYHRFTL 133 (178)
Q Consensus 113 i~~~~GDli~vPaG~~H~f~~ 133 (178)
+.++||+-|++|||+.|-.--
T Consensus 252 v~L~pGeaifl~a~~~HAYl~ 272 (373)
T PF01238_consen 252 VELQPGEAIFLPAGEPHAYLS 272 (373)
T ss_dssp EEE-TT-EEEEHTTHHEEEEE
T ss_pred EEecCCceEEecCCCcccccc
Confidence 899999999999999998644
No 151
>PLN02156 gibberellin 2-beta-dioxygenase
Probab=25.68 E-value=2.6e+02 Score=24.84 Aligned_cols=67 Identities=16% Similarity=0.305 Sum_probs=40.1
Q ss_pred ccceecCcceEEEEe--cceEEEEEEeCCCcEEEEEEeCCcEEEeCC------------CCeeeeecCCCCcEEEEEEee
Q psy840 81 YTEHLHTDEEIRLVL--DGSGYFDVRDKFDDWIRIEVTKGDLIIIPA------------GIYHRFTLDINNYVKAKRYFI 146 (178)
Q Consensus 81 ~~eH~H~~dEi~~il--~G~g~f~v~~~~d~~~ri~~~~GDli~vPa------------G~~H~f~~~~~~~~~alrlF~ 146 (178)
+..-.|.|-...-+| ++.+=..|..++|+|+.|...+|-+|+-=. -+.||........=..|-+|-
T Consensus 197 ~g~~~HTD~g~lTlL~Qd~v~GLQV~~~~g~Wi~Vpp~pga~VVNiGD~l~~wTNg~~kSt~HRVv~~~~~~R~SiafF~ 276 (335)
T PLN02156 197 IGFGEHTDPQLISLLRSNDTAGLQICVKDGTWVDVPPDHSSFFVLVGDTLQVMTNGRFKSVKHRVVTNTKRSRISMIYFA 276 (335)
T ss_pred cCCCCccCCCceEEEEeCCCCceEEEeCCCCEEEccCCCCcEEEEhHHHHHHHhCCeeeccceeeecCCCCCEEEEEEee
Confidence 345578885554433 354446676567899999888877665422 278998755433223444454
Q ss_pred c
Q psy840 147 G 147 (178)
Q Consensus 147 ~ 147 (178)
.
T Consensus 277 ~ 277 (335)
T PLN02156 277 G 277 (335)
T ss_pred c
Confidence 4
No 152
>PF13464 DUF4115: Domain of unknown function (DUF4115)
Probab=25.50 E-value=2.2e+02 Score=19.29 Aligned_cols=44 Identities=11% Similarity=0.145 Sum_probs=33.4
Q ss_pred EecceEEEEEEeCCCc-EEEEEEeCCcEEEeCCCCeeeeecCCCC
Q psy840 94 VLDGSGYFDVRDKFDD-WIRIEVTKGDLIIIPAGIYHRFTLDINN 137 (178)
Q Consensus 94 il~G~g~f~v~~~~d~-~~ri~~~~GDli~vPaG~~H~f~~~~~~ 137 (178)
-..|.+.+.|.+.+|+ .+.-.+++||-+.++.+-+=.+.++...
T Consensus 4 ~a~~~sWv~V~d~dG~~~~~~~l~~G~~~~~~~~~~~~i~iGna~ 48 (77)
T PF13464_consen 4 TATGDSWVEVTDADGKVLFSGTLKAGETKTFEGKEPFRIRIGNAG 48 (77)
T ss_pred EEeCCeEEEEEeCCCcEeeeeeeCCCcEEEEeCCCCEEEEEeCCC
Confidence 3458888888876774 5777889999988887777777777654
No 153
>PHA00672 hypothetical protein
Probab=24.75 E-value=2.1e+02 Score=22.72 Aligned_cols=55 Identities=16% Similarity=0.190 Sum_probs=42.0
Q ss_pred hccceecCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeeeecCCCCcEE
Q psy840 80 FYTEHLHTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRFTLDINNYVK 140 (178)
Q Consensus 80 f~~eH~H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f~~~~~~~~~ 140 (178)
.+.--+|..+-+ .|.+|.....+.+ .. +.+..=-.|.-|||+.--|.+.++....
T Consensus 59 ~LtG~~hkf~~~-ii~sG~itV~tdg---e~--~rl~g~~~i~~~aG~KragyAHeDT~wt 113 (152)
T PHA00672 59 ALTGALIKVSTV-LIFSGHATVFIGG---EA--VELRGYHVIPASAGRKQAFVAHADTDLT 113 (152)
T ss_pred eeeeeeeEeeEE-EEecccEEEEeCC---cE--EEEecceeeecCCCcccceeeeccceEE
Confidence 355667777777 8999999888853 32 5677777889999999999988885543
No 154
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=24.72 E-value=1.5e+02 Score=24.93 Aligned_cols=41 Identities=20% Similarity=0.290 Sum_probs=31.3
Q ss_pred EEEecceEEEEEEe--CCCcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 92 RLVLDGSGYFDVRD--KFDDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 92 ~~il~G~g~f~v~~--~~d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
.+=+-.+..|.++. ..++...+.++.||+|+.-...++|+-
T Consensus 146 SvSLG~~~~F~~~~~~~~~~~~~l~L~~Gdllvm~G~sr~~~H 188 (213)
T PRK15401 146 SVSLGLPAVFQFGGLKRSDPLQRILLEHGDVVVWGGPSRLRYH 188 (213)
T ss_pred EEeCCCCeEEEecccCCCCceEEEEeCCCCEEEECchHhheec
Confidence 44456778888864 345678999999999999777777763
No 155
>PF02080 TrkA_C: TrkA-C domain; InterPro: IPR006037 The regulator of K+ conductance (RCK) domain is found in many ligand-gated K+ channels, most often attached to the intracellular carboxy terminus. The domain is prevalent among prokaryotic K+ channels, and also found in eukaryotic, high-conductance Ca2+-activated K+ channels (BK channels) [, , ]. Largely involved in redox-linked regulation of potassium channels, the N-terminal part of the RCK domain is predicted to be an active dehydrogenase at least in some cases []. Some have a conserved sequence motif (G-x-G-x-x-G-x(n)-[DE]) for NAD+ binding [], but others do not, reflecting the diversity of ligands for RCK domains. The C-terminal part is less conserved, being absent in some channels, such as the kefC antiporter from Escherichia coli. It is predicted to bind unidentified ligands and to regulate sulphate, sodium and other transporters. The X-ray structure of several RCK domains has been solved [, , ]. It reveals an alpha-beta fold similar to dehydrogenase enzymes. The domain forms a homodimer, producing a cleft between two lobes. It has a composite structure, with an N-terminal (RCK-N), and a C-terminal (RCK-C) subdomain. The RCK-N subdomain forms a Rossmann fold with two alpha helices on one side of a six stranded parallel beta sheet and three alpha helices on the other side. The RCK-C subdomain is an all-beta-strand fold. It forms an extention of the dimer interface and further stabilises the RCK homodimer [, , ]. Ca2+ is a ligand that opens the channel in a concentration-dependent manner. Two Ca2+ ions are located at the base of a cleft between two RCK domains, coordinated by the carboxylate groups of two glutamate residues, and by an aspartate residue [, , ]. RCK domains occur in at least five different contexts: As a single domain on the C terminus of some K+ channels (for example, many prokaryotic K+ channels). As two tandem RCK domains on the C terminus of some transporters that form gating rings (for example, eukaryotic BK channels). The gating ring has an arrangement of eight identical RCK domains, one from each of the four pore-forming subunits and four from the intracellular solution. As two domains, one at the N terminus and another at the C terminus of transporter (for example, the prokaryotic trk system potassium uptake protein A). As a soluble protein (not part of a K+ channel) consisting of two tandem RCK domains. As a soluble protein consisting of a single RCK domain. This entry represents the C-terminal subdomain of RCK.; GO: 0008324 cation transmembrane transporter activity, 0006813 potassium ion transport; PDB: 2BKP_A 1VCT_A 2BKO_A 2BKN_A 3L4B_C 2FY8_D 2AEF_A 1LNQ_E 3RBX_C 3KXD_A ....
Probab=24.52 E-value=1e+02 Score=19.96 Aligned_cols=30 Identities=23% Similarity=0.357 Sum_probs=18.2
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEe
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIII 123 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~v 123 (178)
.--|..|..| +....-..+ ..+++||.|+|
T Consensus 27 ~~~i~~i~R~-~~~~~p~~~-----~~l~~gD~l~v 56 (71)
T PF02080_consen 27 GVRIVAIKRG-GEIIIPDGD-----TVLQAGDILIV 56 (71)
T ss_dssp TEEEEEEEET-EEEES--TT------BE-TTEEEEE
T ss_pred CEEEEEEEEC-CEEECCCCC-----CEECCCCEEEE
Confidence 5567777888 444444333 78999999887
No 156
>cd00248 Mth938-like Mth938-like domain. The members of this family include: Mth938, 2P1, Xcr35, Rpa2829, and several uncharacterized sequences. Mth938 is a hypothetical protein encoded by the Methanobacterium thermoautotrophicum (Mth) genome. This protein crystallizes as a dimer, although it is monomeric in solution, with one disulfide bond in each monomer. 2P1 is a partially characterized nuclear protein which is homologous to E3-3 from rat and known to be alternately spliced. Xcr35 and Rpa2829 are hypothetical proteins of unknown function from the Xanthomonas campestris and Rhodopseudomonas palustris genomes, respectively, for which the crystal structures have been determined.
Probab=24.07 E-value=85 Score=23.25 Aligned_cols=27 Identities=26% Similarity=0.575 Sum_probs=19.3
Q ss_pred ceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeee
Q psy840 97 GSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRF 131 (178)
Q Consensus 97 G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f 131 (178)
|.|.|.+. |..+ .|++++.|.++..|-
T Consensus 6 ~~g~~~i~---g~~y-----~~~viv~p~~~~~w~ 32 (109)
T cd00248 6 GPGGFRIA---GQVY-----RGPLLVLPDGVVPWD 32 (109)
T ss_pred cCCEEEEC---CEEE-----eeCEEEeCCceeecC
Confidence 45566664 3333 499999999999883
No 157
>PF13532 2OG-FeII_Oxy_2: 2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A ....
Probab=23.77 E-value=1.8e+02 Score=22.63 Aligned_cols=36 Identities=28% Similarity=0.443 Sum_probs=25.5
Q ss_pred EEecceEEEEEEeC--CCcEEEEEEeCCcEEEeCCCCe
Q psy840 93 LVLDGSGYFDVRDK--FDDWIRIEVTKGDLIIIPAGIY 128 (178)
Q Consensus 93 ~il~G~g~f~v~~~--~d~~~ri~~~~GDli~vPaG~~ 128 (178)
+=+-++..|.++.+ .+..+.+.+.+||+++.-...+
T Consensus 128 lSLG~~~~~~f~~~~~~~~~~~~~L~~gsl~vm~g~~r 165 (194)
T PF13532_consen 128 LSLGSSRVFRFRNKSDDDEPIEVPLPPGSLLVMSGEAR 165 (194)
T ss_dssp EEEES-EEEEEEECGGTS-EEEEEE-TTEEEEEETTHH
T ss_pred EEEccCceEEEeeccCCCccEEEEcCCCCEEEeChHHh
Confidence 34556788888875 3578999999999999887654
No 158
>PTZ00273 oxidase reductase; Provisional
Probab=23.37 E-value=1.8e+02 Score=25.33 Aligned_cols=64 Identities=22% Similarity=0.285 Sum_probs=39.8
Q ss_pred cceecCcceEEEEe--cceEEEEEEeCCCcEEEEEEeCCcEEEeCCC-------------CeeeeecCCCCcEEEEEEee
Q psy840 82 TEHLHTDEEIRLVL--DGSGYFDVRDKFDDWIRIEVTKGDLIIIPAG-------------IYHRFTLDINNYVKAKRYFI 146 (178)
Q Consensus 82 ~eH~H~~dEi~~il--~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG-------------~~H~f~~~~~~~~~alrlF~ 146 (178)
....|.|--..-+| ++.+=..|...+|+|+.|...+|-+|+- -| +.||....+...+-..-|+.
T Consensus 196 g~~~HTD~g~lTlL~qd~~~GLqV~~~~g~Wi~V~p~pg~lvVN-vGD~l~~~TnG~~kSt~HRVv~~~~~R~Si~~F~~ 274 (320)
T PTZ00273 196 VCGEHTDYGIITLLYQDSVGGLQVRNLSGEWMDVPPLEGSFVVN-IGDMMEMWSNGRYRSTPHRVVNTGVERYSMPFFCE 274 (320)
T ss_pred ccccccCCCeEEEEecCCCCceEEECCCCCEEeCCCCCCeEEEE-HHHHHHHHHCCeeeCCCccccCCCCCeEEEEEEEc
Confidence 34567775444333 4444466766678999999998876654 44 88998755444444444443
No 159
>PLN02904 oxidoreductase
Probab=23.20 E-value=2.5e+02 Score=25.08 Aligned_cols=67 Identities=18% Similarity=0.275 Sum_probs=39.8
Q ss_pred ccceecCcceEE-EEe-cceEEEEEEeCCCcEEEEEEeCCcEEEeCC------------CCeeeeecCCCCcEEEEEEee
Q psy840 81 YTEHLHTDEEIR-LVL-DGSGYFDVRDKFDDWIRIEVTKGDLIIIPA------------GIYHRFTLDINNYVKAKRYFI 146 (178)
Q Consensus 81 ~~eH~H~~dEi~-~il-~G~g~f~v~~~~d~~~ri~~~~GDli~vPa------------G~~H~f~~~~~~~~~alrlF~ 146 (178)
+....|.|--.. ++. ++.| ..|...+|+|+.|...+|-+|+-=. -+.||....++..=..|-+|-
T Consensus 225 ~g~~~HtD~g~lTlL~qd~~G-LQV~~~~g~Wi~V~p~pgalVVNiGD~Le~~TNG~~kSt~HRVv~~~~~~R~Si~~F~ 303 (357)
T PLN02904 225 LGMPPHSDFGSLTILLQSSQG-LQIMDCNKNWVCVPYIEGALIVQLGDQVEVMSNGIYKSVVHRVTVNKDYKRLSFASLH 303 (357)
T ss_pred cCCcCccCCCceEEEecCCCe-eeEEeCCCCEEECCCCCCeEEEEccHHHHHHhCCeeeccCCcccCCCCCCEEEEEEee
Confidence 456677774333 332 4433 5676667899999888887765432 278998654333233555554
Q ss_pred cC
Q psy840 147 GE 148 (178)
Q Consensus 147 ~~ 148 (178)
.+
T Consensus 304 ~p 305 (357)
T PLN02904 304 SL 305 (357)
T ss_pred cC
Confidence 43
No 160
>PF02261 Asp_decarbox: Aspartate decarboxylase; InterPro: IPR003190 Decarboxylation of aspartate is the major route of alanine production in bacteria, and is catalysed by the enzyme aspartate decarboxylase. The enzyme is translated as an inactive proenzyme of two chains, A and B. This family contains both chains of aspartate decarboxylase.; GO: 0004068 aspartate 1-decarboxylase activity, 0006523 alanine biosynthetic process; PDB: 1PYU_C 1AW8_A 1PYQ_B 3TM7_C 1PT1_A 1PQH_A 1PPY_B 1PT0_B 1PQF_A 1PQE_A ....
Probab=22.63 E-value=18 Score=27.90 Aligned_cols=57 Identities=25% Similarity=0.260 Sum_probs=30.9
Q ss_pred hHHHHHHHHhcCCCeeeEEEEcCCCCCCHHHHhhhhccceecCcceEEEEecc---eEEEEEEeCCCcEEEEEEeCCcEE
Q psy840 45 DEVLTKLRKDRNYNYEDEITCSKDCLPNYEEKLKSFYTEHLHTDEEIRLVLDG---SGYFDVRDKFDDWIRIEVTKGDLI 121 (178)
Q Consensus 45 ~~~l~~l~~~~gy~~~Dvv~l~~~~~p~~e~~~~~f~~eH~H~~dEi~~il~G---~g~f~v~~~~d~~~ri~~~~GDli 121 (178)
|++|=+.+.=.-|...+|++++.+ ++|- .|++.| ||...+.+.. ...+++||.|
T Consensus 29 D~~Ll~aagi~p~E~V~V~Nv~nG---------~Rf~----------TYvI~g~~GSg~I~lNGaA----Arl~~~GD~v 85 (116)
T PF02261_consen 29 DEDLLDAAGILPYEQVQVVNVNNG---------ERFE----------TYVIPGERGSGVICLNGAA----ARLVQVGDRV 85 (116)
T ss_dssp EHHHHHHCT--TTBEEEEEETTT-----------EEE----------EEEEEESTTTT-EEEEGGG----GGCS-TT-EE
T ss_pred CHHHHHHcCCCcCCEEEEEECCCC---------cEEE----------EEEEEccCCCcEEEECCHH----HhccCCCCEE
Confidence 344444444445666677776654 1221 466654 5889998855 2589999999
Q ss_pred EeC
Q psy840 122 IIP 124 (178)
Q Consensus 122 ~vP 124 (178)
+|=
T Consensus 86 II~ 88 (116)
T PF02261_consen 86 IIM 88 (116)
T ss_dssp EEE
T ss_pred EEE
Confidence 884
No 161
>TIGR00192 urease_beta urease, beta subunit. In a number of species, including B.subtilis, Synechocystis, and Haemophilus influenzae, urease subunits beta and gamma are encoded as separate polypeptides. In Helicobacter pylori UreA and in the fission yeast Schizosaccharomyces pombe, beta subunit-like sequence follows gamma subunit-like sequence in a single chain; the fission yeast protein contains additional C-terminal regions.
Probab=22.29 E-value=80 Score=23.81 Aligned_cols=30 Identities=23% Similarity=0.234 Sum_probs=23.5
Q ss_pred EeCCcEEEeCCCCeeeeecCCCCcEEEEEE
Q psy840 115 VTKGDLIIIPAGIYHRFTLDINNYVKAKRY 144 (178)
Q Consensus 115 ~~~GDli~vPaG~~H~f~~~~~~~~~alrl 144 (178)
..-|=-+-|||||.-||..++...+..+.|
T Consensus 55 ~A~G~RLdIpaGTavRFEPG~~k~V~LV~~ 84 (101)
T TIGR00192 55 LAFGMRLDIPSGTAVRFEPGEEKSVELVAI 84 (101)
T ss_pred hhcCcccccCCCCeEeECCCCeEEEEEEEc
Confidence 345888999999999999999866655533
No 162
>KOG0498|consensus
Probab=22.18 E-value=1.2e+02 Score=30.18 Aligned_cols=35 Identities=14% Similarity=0.069 Sum_probs=27.9
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEE
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLII 122 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~ 122 (178)
-++++||+.|+....-.+.++..+...+++||++=
T Consensus 460 v~~myFI~rG~le~~~~~~g~~~~~~~L~~Gd~~G 494 (727)
T KOG0498|consen 460 VTDMYFIVRGSLESITTDGGGFFVVAILGPGDFFG 494 (727)
T ss_pred cceeEEEEeeeEEEEEccCCceEEEEEecCCCccc
Confidence 47899999999987776644556778999999864
No 163
>TIGR00523 eIF-1A eukaryotic/archaeal initiation factor 1A. Recommended nomenclature: eIF-1A for eukaryotes, aIF-1A for Archaea. Also called eIF-4C
Probab=22.09 E-value=1.7e+02 Score=21.71 Aligned_cols=28 Identities=32% Similarity=0.408 Sum_probs=16.4
Q ss_pred cceEEEEEEeCCCcE----------EEEEEeCCcEEEe
Q psy840 96 DGSGYFDVRDKFDDW----------IRIEVTKGDLIII 123 (178)
Q Consensus 96 ~G~g~f~v~~~~d~~----------~ri~~~~GDli~v 123 (178)
-|.+.|.|...++.. -+|.+.+||+++|
T Consensus 29 lG~~~~~V~~~dG~~~la~i~GK~Rk~iwI~~GD~VlV 66 (99)
T TIGR00523 29 LGAGRVKVRCLDGKTRLGRIPGKLKKRIWIREGDVVIV 66 (99)
T ss_pred cCCCEEEEEeCCCCEEEEEEchhhcccEEecCCCEEEE
Confidence 355566665444332 1466778888777
No 164
>PRK13201 ureB urease subunit beta; Reviewed
Probab=21.93 E-value=78 Score=25.05 Aligned_cols=29 Identities=31% Similarity=0.381 Sum_probs=23.2
Q ss_pred eCCcEEEeCCCCeeeeecCCCCcEEEEEE
Q psy840 116 TKGDLIIIPAGIYHRFTLDINNYVKAKRY 144 (178)
Q Consensus 116 ~~GDli~vPaG~~H~f~~~~~~~~~alrl 144 (178)
.-|=-|-|||||.-||..++...+..+.|
T Consensus 56 A~G~RLdIPAGTAVRFEPG~~k~V~LV~i 84 (136)
T PRK13201 56 AYGKHLDIPAGAAVRFEPGDKKEVQLVEY 84 (136)
T ss_pred hcCcccccCCCCeEeECCCCeEEEEEEEc
Confidence 44788999999999999998876665533
No 165
>PF04076 BOF: Bacterial OB fold (BOF) protein; InterPro: IPR005220 Proteins in this entry have an OB-fold fold (oligonucleotide/oligosaccharide binding motif). Analysis of the predicted nucleotide-binding site of the OB-fold suggests that they lack nucleic acid-binding properties. They contain an predicted N-terminal signal peptide which indicates that they localise to the periplasm where they may function to bind proteins, small molecules, or other typical OB-fold ligands. As hypothesised for the distantly related OB-fold containing bacterial enterotoxins, the loss of nucleotide-binding function and the rapid evolution of the OB-fold ligand-binding site may be associated with the presence of members in mobile genetic elements and their potential role in bacterial pathogenicity [].; PDB: 1NNX_A.
Probab=21.90 E-value=45 Score=24.91 Aligned_cols=44 Identities=23% Similarity=0.315 Sum_probs=26.3
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeCCCCeeee
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIPAGIYHRF 131 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vPaG~~H~f 131 (178)
.+|-+.+-+++|...|+-.++.|--..+.|+|.|.|=.-+-+.+
T Consensus 48 ~~d~Y~F~D~TG~I~VeId~~~w~g~~vt~~~~Vri~GeVDk~~ 91 (103)
T PF04076_consen 48 GDDKYLFRDATGEIEVEIDDDVWRGQTVTPDDKVRISGEVDKDW 91 (103)
T ss_dssp ETTEEEEEETTEEEEEE--GGGSTT----TTSEEEEEEEEEEET
T ss_pred cCCEEEEECCCCcEEEEEChhhcCCcccCCCCEEEEEEEEeCCC
Confidence 56677777888877777655556667778888877766565554
No 166
>PF08903 DUF1846: Domain of unknown function (DUF1846); InterPro: IPR014999 This group of proteins are functionally uncharacterised. ; PDB: 3BH1_B.
Probab=21.65 E-value=85 Score=29.64 Aligned_cols=56 Identities=16% Similarity=0.283 Sum_probs=37.8
Q ss_pred hhhhhhcCeeEEee-----ChHHHHHHHHhcCCCee-------eEEEEcCCCCCC---HHHHhhhhccceecC
Q psy840 30 QTLNKLTGVEYFNV-----QDEVLTKLRKDRNYNYE-------DEITCSKDCLPN---YEEKLKSFYTEHLHT 87 (178)
Q Consensus 30 ~~L~~~lGv~~~~~-----~~~~l~~l~~~~gy~~~-------Dvv~l~~~~~p~---~e~~~~~f~~eH~H~ 87 (178)
..| +++||..+.= +...++.|..+.||..- .+|.+.... |+ ..+=+.+.|.+|.+-
T Consensus 126 ~kL-e~~gIkvy~Hy~i~GYP~dv~~IvSdeGyGkN~YIeTtrpLVVVTAPG-PGSGKlATCLSQlYHe~krG 196 (491)
T PF08903_consen 126 NKL-ERLGIKVYRHYPIEGYPTDVDLIVSDEGYGKNDYIETTRPLVVVTAPG-PGSGKLATCLSQLYHEHKRG 196 (491)
T ss_dssp HHH-HTTT-EEEEEE--TTTTT-HHHHSSTTTGGGS-----SSSEEEEE-SS-TTS-HHHHHHHHHHHHHCTT
T ss_pred HHH-HHCCCcEEEecCcCCCCCCccceeCcccCCCCCceeccCCeEEEeCCC-CCccHHHHHHHHHHHHHhcC
Confidence 567 8889987443 77889999999999754 455554432 54 566667788888765
No 167
>PRK10053 hypothetical protein; Provisional
Probab=21.50 E-value=57 Score=25.53 Aligned_cols=37 Identities=22% Similarity=0.145 Sum_probs=23.9
Q ss_pred cceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEeC
Q psy840 88 DEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIIIP 124 (178)
Q Consensus 88 ~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~vP 124 (178)
.+|-+.|-+|+|...|+-.++.|--..+.|.|.|.|=
T Consensus 75 g~d~Y~F~D~tG~I~VeID~~~w~G~~v~p~~kV~I~ 111 (130)
T PRK10053 75 GDDRYVFRDKSGEINVIIPAAVFDGREVQPDQMININ 111 (130)
T ss_pred CCceEEEECCCCcEEEEeCHHHcCCCcCCCCCEEEEE
Confidence 3566667777777777664455666666777776653
No 168
>PF04967 HTH_10: HTH DNA binding domain; InterPro: IPR007050 Numerous bacterial transcription regulatory proteins bind DNA via a helix-turn-helix (HTH) motif. This entry represents the HTH DNA binding domain found in Halobacterium salinarium (Halobacterium halobium) and described as a putative bacterio-opsin activator.
Probab=21.46 E-value=66 Score=21.21 Aligned_cols=19 Identities=16% Similarity=0.233 Sum_probs=14.9
Q ss_pred CCCCcCCHhhhhhhcCeeE
Q psy840 22 TPKEYLDLQTLNKLTGVEY 40 (178)
Q Consensus 22 ~p~~~v~~~~L~~~lGv~~ 40 (178)
+.++.+|+++|.+.|||--
T Consensus 19 d~PR~~tl~elA~~lgis~ 37 (53)
T PF04967_consen 19 DVPRRITLEELAEELGISK 37 (53)
T ss_pred CCCCcCCHHHHHHHhCCCH
Confidence 4567899999977888843
No 169
>COG3145 AlkB Alkylated DNA repair protein [DNA replication, recombination, and repair]
Probab=21.34 E-value=2e+02 Score=23.99 Aligned_cols=40 Identities=18% Similarity=0.358 Sum_probs=30.6
Q ss_pred EEecceEEEEEEeCC--CcEEEEEEeCCcEEEeCCCCeeeee
Q psy840 93 LVLDGSGYFDVRDKF--DDWIRIEVTKGDLIIIPAGIYHRFT 132 (178)
Q Consensus 93 ~il~G~g~f~v~~~~--d~~~ri~~~~GDli~vPaG~~H~f~ 132 (178)
.=+-..+.|.++... +.+.++.++.||+++.=...++-|.
T Consensus 137 lSLg~~~~F~~~~~~r~~~~~~~~L~~Gdvvvm~G~~r~~~~ 178 (194)
T COG3145 137 LSLGAPCIFRLRGRRRRGPGLRLRLEHGDVVVMGGPSRLAWH 178 (194)
T ss_pred EecCCCeEEEeccccCCCCceeEEecCCCEEEecCCcccccc
Confidence 335567888888765 7789999999999988666665554
No 170
>KOG2130|consensus
Probab=21.16 E-value=75 Score=28.98 Aligned_cols=29 Identities=17% Similarity=0.147 Sum_probs=24.2
Q ss_pred CCcEEEEEEeCCcEEEeCCCCeeeeecCC
Q psy840 107 FDDWIRIEVTKGDLIIIPAGIYHRFTLDI 135 (178)
Q Consensus 107 ~d~~~ri~~~~GDli~vPaG~~H~f~~~~ 135 (178)
+.+-|.+.++||..+.||.|=.|-.-.-+
T Consensus 259 E~kPIEc~q~pGEt~fVP~GWWHvVlNle 287 (407)
T KOG2130|consen 259 EYKPIECLQKPGETMFVPSGWWHVVLNLE 287 (407)
T ss_pred ccCCceeeecCCceEEecCCeEEEEeccC
Confidence 45678999999999999999999865444
No 171
>CHL00010 infA translation initiation factor 1
Probab=20.92 E-value=1.3e+02 Score=21.11 Aligned_cols=11 Identities=27% Similarity=0.570 Sum_probs=6.5
Q ss_pred EEEeCCcEEEe
Q psy840 113 IEVTKGDLIII 123 (178)
Q Consensus 113 i~~~~GDli~v 123 (178)
+.+..||.+.+
T Consensus 45 i~~~vGD~V~v 55 (78)
T CHL00010 45 IRILPGDRVKV 55 (78)
T ss_pred cccCCCCEEEE
Confidence 44556777655
No 172
>PRK04012 translation initiation factor IF-1A; Provisional
Probab=20.62 E-value=1.8e+02 Score=21.66 Aligned_cols=28 Identities=25% Similarity=0.266 Sum_probs=16.5
Q ss_pred cceEEEEEEeCCCcE----------EEEEEeCCcEEEe
Q psy840 96 DGSGYFDVRDKFDDW----------IRIEVTKGDLIII 123 (178)
Q Consensus 96 ~G~g~f~v~~~~d~~----------~ri~~~~GDli~v 123 (178)
-|.+.|.|...+|.. -+|.+.+||.++|
T Consensus 31 lG~~~~~V~~~dG~~~la~i~GK~Rk~IwI~~GD~VlV 68 (100)
T PRK04012 31 LGANRVRVRCMDGVERMGRIPGKMKKRMWIREGDVVIV 68 (100)
T ss_pred cCCCEEEEEeCCCCEEEEEEchhhcccEEecCCCEEEE
Confidence 355666665444332 1467778888777
No 173
>PLN02403 aminocyclopropanecarboxylate oxidase
Probab=20.43 E-value=3.2e+02 Score=23.80 Aligned_cols=64 Identities=23% Similarity=0.323 Sum_probs=38.3
Q ss_pred cceecCcceEEEEe-c-c-eEEEEEEeCCCcEEEEEEeCCcEEEeCCC-------------CeeeeecCCC-CcEEEEEE
Q psy840 82 TEHLHTDEEIRLVL-D-G-SGYFDVRDKFDDWIRIEVTKGDLIIIPAG-------------IYHRFTLDIN-NYVKAKRY 144 (178)
Q Consensus 82 ~eH~H~~dEi~~il-~-G-~g~f~v~~~~d~~~ri~~~~GDli~vPaG-------------~~H~f~~~~~-~~~~alrl 144 (178)
....|+|--..-+| . + .+=..|. .+|+|+.|...+|+-|+|--| +.||.....+ ..+ .+-+
T Consensus 171 G~~~HtD~g~lTlL~q~~~v~GLqV~-~~g~Wi~V~p~p~~~lvVNvGD~L~~~Tng~~~S~~HRVv~~~~~~R~-Si~~ 248 (303)
T PLN02403 171 GLREHTDAGGIILLLQDDQVPGLEFL-KDGKWVPIPPSKNNTIFVNTGDQLEVLSNGRYKSTLHRVMADKNGSRL-SIAT 248 (303)
T ss_pred CccCccCCCeEEEEEecCCCCceEec-cCCeEEECCCCCCCEEEEEehHHHHHHhCCeeecccceeecCCCCCEE-EEEE
Confidence 34578885544333 3 2 3334553 478998888888776667666 7899875433 344 4444
Q ss_pred eec
Q psy840 145 FIG 147 (178)
Q Consensus 145 F~~ 147 (178)
|-.
T Consensus 249 F~~ 251 (303)
T PLN02403 249 FYN 251 (303)
T ss_pred EEc
Confidence 444
No 174
>COG3111 Periplasmic protein with OB-fold [Function unknown]
Probab=20.28 E-value=1.2e+02 Score=23.87 Aligned_cols=38 Identities=24% Similarity=0.255 Sum_probs=21.3
Q ss_pred cCcceEEEEecceEEEEEEeCCCcEEEEEEeCCcEEEe
Q psy840 86 HTDEEIRLVLDGSGYFDVRDKFDDWIRIEVTKGDLIII 123 (178)
Q Consensus 86 H~~dEi~~il~G~g~f~v~~~~d~~~ri~~~~GDli~v 123 (178)
|-.++.+.+-+|+|...|.-.+..|--..+.|-|.|.|
T Consensus 69 qi~~D~y~FrD~sGeI~VeIdd~~w~g~tv~P~dkV~I 106 (128)
T COG3111 69 QIGDDRYVFRDASGEINVDIDDKVWNGQTVTPKDKVRI 106 (128)
T ss_pred eeCCceEEEEcCCccEEEEecccccCCcccCcccEEEE
Confidence 34455556666666665555444455555666666554
No 175
>cd04456 S1_IF1A_like S1_IF1A_like: Translation initiation factor IF1A-like, S1-like RNA-binding domain. IF1A is also referred to as eIF1A in eukaryotes and aIF1A in archaea. S1-like RNA-binding domains are found in a wide variety of RNA-associated proteins. IF1A is essential for translation initiation. eIF1A acts synergistically with eIF1 to mediate assembly of ribosomal initiation complexes at the initiation codon and maintain the accuracy of this process by recognizing and destabilizing aberrant preinitiation complexes from the mRNA. Without eIF1A and eIF1, 43S ribosomal preinitiation complexes can bind to the cap-proximal region, but are unable to reach the initiation codon. eIF1a also enhances the formation of 5'-terminal complexes in the presence of other translation initiation factors. This protein family is only found in eukaryotes and archaea.
Probab=20.05 E-value=1.5e+02 Score=20.85 Aligned_cols=11 Identities=36% Similarity=0.872 Sum_probs=9.1
Q ss_pred EEEeCCcEEEe
Q psy840 113 IEVTKGDLIII 123 (178)
Q Consensus 113 i~~~~GDli~v 123 (178)
+.+.+||+++|
T Consensus 37 iwI~~GD~VlV 47 (78)
T cd04456 37 IWIKRGDFLIV 47 (78)
T ss_pred EEEcCCCEEEE
Confidence 67888998887
Done!