Query psy12281
Match_columns 456
No_of_seqs 432 out of 1676
Neff 7.3
Searched_HMMs 46136
Date Fri Aug 16 23:19:01 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12281.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12281hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2131|consensus 100.0 2.1E-61 4.5E-66 468.8 8.1 340 83-441 25-400 (427)
2 KOG2130|consensus 100.0 2E-48 4.3E-53 371.5 13.1 242 86-347 52-310 (407)
3 PF13621 Cupin_8: Cupin-like d 100.0 1.3E-40 2.8E-45 321.8 14.1 225 97-337 1-249 (251)
4 KOG2132|consensus 99.9 4.3E-28 9.2E-33 234.5 3.8 214 94-329 134-355 (355)
5 KOG2508|consensus 99.9 1.7E-23 3.6E-28 204.0 13.4 234 84-337 20-311 (437)
6 KOG2923|consensus 99.9 5.4E-23 1.2E-27 151.4 5.4 61 3-63 2-62 (67)
7 PF05207 zf-CSL: CSL zinc fing 99.8 7.9E-21 1.7E-25 141.0 6.6 55 6-60 1-55 (55)
8 COG5216 Uncharacterized conser 99.8 1.8E-19 3.8E-24 130.7 5.5 58 4-61 3-60 (67)
9 PF08007 Cupin_4: Cupin superf 99.6 1.7E-15 3.7E-20 152.7 10.4 102 228-341 113-217 (319)
10 PF02373 JmjC: JmjC domain, hy 99.6 7.2E-16 1.6E-20 131.6 4.8 102 232-333 1-114 (114)
11 COG2850 Uncharacterized conser 99.6 6.3E-15 1.4E-19 145.9 12.0 262 93-396 13-286 (383)
12 KOG1633|consensus 99.3 6.1E-12 1.3E-16 137.8 9.8 192 137-346 42-242 (776)
13 KOG1356|consensus 99.0 2.7E-11 5.8E-16 130.2 -2.6 241 98-353 531-852 (889)
14 KOG3706|consensus 98.8 3.5E-09 7.7E-14 107.7 4.4 96 227-331 315-414 (629)
15 COG2140 Thermophilic glucose-6 97.4 0.0011 2.4E-08 62.2 9.6 75 237-337 88-164 (209)
16 smart00558 JmjC A domain famil 96.8 0.0011 2.5E-08 49.3 3.1 45 212-260 11-55 (57)
17 PF00190 Cupin_1: Cupin; Inte 96.5 0.01 2.2E-07 52.9 7.8 80 233-336 39-122 (144)
18 PF06560 GPI: Glucose-6-phosph 96.0 0.024 5.3E-07 52.5 7.3 59 255-335 87-145 (182)
19 TIGR03037 anthran_nbaC 3-hydro 95.8 0.044 9.5E-07 49.6 8.2 75 229-332 28-102 (159)
20 PRK04190 glucose-6-phosphate i 95.6 0.094 2E-06 49.2 9.8 67 242-334 87-156 (191)
21 PF07883 Cupin_2: Cupin domain 95.6 0.024 5.2E-07 43.4 4.9 57 238-326 7-63 (71)
22 PRK13264 3-hydroxyanthranilate 95.6 0.054 1.2E-06 49.8 7.8 74 229-331 34-107 (177)
23 KOG2132|consensus 95.5 0.0064 1.4E-07 60.4 1.7 88 236-325 181-278 (355)
24 TIGR03404 bicupin_oxalic bicup 95.0 0.11 2.4E-06 53.7 9.1 65 233-325 250-314 (367)
25 TIGR02098 MJ0042_CXXC MJ0042 f 94.6 0.067 1.4E-06 36.3 4.3 34 24-57 3-37 (38)
26 COG0662 {ManC} Mannose-6-phosp 93.9 0.31 6.7E-06 42.4 8.2 61 231-325 39-100 (127)
27 TIGR02272 gentisate_1_2 gentis 93.4 0.18 3.9E-06 51.3 6.5 66 237-334 89-154 (335)
28 PF06052 3-HAO: 3-hydroxyanthr 93.1 0.34 7.4E-06 43.2 6.9 76 227-331 31-106 (151)
29 COG1917 Uncharacterized conser 92.9 0.56 1.2E-05 40.7 8.2 66 233-331 47-115 (131)
30 smart00835 Cupin_1 Cupin. This 92.7 0.97 2.1E-05 40.1 9.5 66 233-325 35-100 (146)
31 TIGR03404 bicupin_oxalic bicup 91.6 1.4 3E-05 45.6 10.4 65 233-326 72-136 (367)
32 COG1996 RPC10 DNA-directed RNA 91.3 0.28 6E-06 35.4 3.5 34 20-58 3-37 (49)
33 PF02041 Auxin_BP: Auxin bindi 91.0 0.77 1.7E-05 40.9 6.6 65 237-325 52-116 (167)
34 PF05523 FdtA: WxcM-like, C-te 90.6 1.2 2.6E-05 39.1 7.6 77 231-336 33-112 (131)
35 KOG0958|consensus 90.3 0.11 2.5E-06 56.2 1.0 114 227-340 171-299 (690)
36 COG4101 Predicted mannose-6-ph 90.2 0.3 6.6E-06 41.7 3.3 26 300-325 88-113 (142)
37 PF13717 zinc_ribbon_4: zinc-r 89.9 0.33 7.2E-06 32.7 2.7 31 25-55 4-35 (36)
38 PF13719 zinc_ribbon_5: zinc-r 89.6 0.47 1E-05 32.1 3.2 32 25-56 4-36 (37)
39 PF05995 CDO_I: Cysteine dioxy 89.5 1.6 3.5E-05 40.2 7.9 74 237-331 83-161 (175)
40 PRK13290 ectC L-ectoine syntha 88.8 2.3 5E-05 37.0 7.9 30 301-330 76-105 (125)
41 COG5285 Protein involved in bi 88.6 0.97 2.1E-05 44.8 5.9 98 238-337 129-233 (299)
42 PF01050 MannoseP_isomer: Mann 87.5 1.4 3E-05 39.8 5.9 62 231-325 66-127 (151)
43 PF05180 zf-DNL: DNL zinc fing 87.0 0.69 1.5E-05 35.6 3.1 33 22-54 3-38 (66)
44 TIGR01221 rmlC dTDP-4-dehydror 85.8 4.9 0.00011 37.2 8.7 78 232-331 47-129 (176)
45 COG1898 RfbC dTDP-4-dehydrorha 85.4 6.8 0.00015 36.1 9.3 89 230-338 46-136 (173)
46 PF13759 2OG-FeII_Oxy_5: Putat 85.3 5.7 0.00012 32.8 8.1 36 297-332 63-101 (101)
47 PF09723 Zn-ribbon_8: Zinc rib 84.7 0.65 1.4E-05 32.4 1.8 31 21-53 3-34 (42)
48 smart00834 CxxC_CXXC_SSSS Puta 84.1 0.83 1.8E-05 31.2 2.1 34 21-56 3-37 (41)
49 PLN00212 glutelin; Provisional 83.1 7.2 0.00016 41.9 9.6 74 238-337 357-430 (493)
50 TIGR02466 conserved hypothetic 82.8 8.1 0.00018 36.5 8.9 35 299-333 161-198 (201)
51 PF09313 DUF1971: Domain of un 82.4 13 0.00027 30.0 8.6 57 250-330 24-80 (82)
52 PRK09943 DNA-binding transcrip 82.4 8.3 0.00018 35.6 8.8 34 301-334 147-180 (185)
53 PF00908 dTDP_sugar_isom: dTDP 82.3 9.1 0.0002 35.4 8.9 82 232-335 46-133 (176)
54 TIGR01479 GMP_PMI mannose-1-ph 80.8 5.4 0.00012 42.6 7.8 26 300-325 415-440 (468)
55 KOG3277|consensus 80.8 1.7 3.8E-05 38.8 3.3 43 12-54 68-113 (165)
56 TIGR01206 lysW lysine biosynth 78.4 2.5 5.4E-05 31.3 3.0 32 25-59 4-36 (54)
57 TIGR02605 CxxC_CxxC_SSSS putat 75.6 2.1 4.5E-05 31.0 2.0 34 21-56 3-38 (52)
58 PF05899 Cupin_3: Protein of u 74.9 1.9 4.2E-05 33.7 1.8 18 300-317 44-61 (74)
59 KOG2107|consensus 74.7 3.9 8.4E-05 37.1 3.8 28 297-324 113-140 (179)
60 PF02311 AraC_binding: AraC-li 74.0 17 0.00037 30.3 7.7 57 236-325 10-66 (136)
61 PF12852 Cupin_6: Cupin 73.5 14 0.0003 33.9 7.4 25 301-325 56-80 (186)
62 PRK15457 ethanolamine utilizat 72.8 13 0.00028 35.8 7.0 52 240-325 167-218 (233)
63 smart00659 RPOLCX RNA polymera 71.9 6.8 0.00015 27.6 3.7 28 23-56 2-30 (44)
64 TIGR03214 ura-cupin putative a 69.9 24 0.00052 34.6 8.6 33 301-333 99-131 (260)
65 TIGR03214 ura-cupin putative a 69.0 5.6 0.00012 39.1 3.9 28 298-325 216-243 (260)
66 PF03079 ARD: ARD/ARD' family; 68.7 41 0.00089 30.5 9.1 38 297-334 112-150 (157)
67 PRK15460 cpsB mannose-1-phosph 66.1 21 0.00046 38.3 7.8 26 300-325 424-449 (478)
68 COG3450 Predicted enzyme of th 65.9 5.2 0.00011 34.4 2.6 25 300-324 82-106 (116)
69 PLN00212 glutelin; Provisional 64.0 31 0.00068 37.1 8.5 90 232-326 84-174 (493)
70 PRK13502 transcriptional activ 63.2 29 0.00062 33.9 7.7 30 300-329 56-85 (282)
71 PF07385 DUF1498: Protein of u 62.9 21 0.00046 34.1 6.2 91 232-329 90-183 (225)
72 KOG3995|consensus 61.9 21 0.00045 33.8 5.9 76 227-331 31-106 (279)
73 PRK00398 rpoP DNA-directed RNA 60.0 7.9 0.00017 27.3 2.2 29 23-56 3-32 (46)
74 PF06249 EutQ: Ethanolamine ut 59.9 6.7 0.00015 35.4 2.3 19 299-317 112-130 (152)
75 KOG3130|consensus 59.0 33 0.00071 35.6 7.1 67 293-362 38-104 (514)
76 COG4766 EutQ Ethanolamine util 58.3 7.2 0.00016 35.0 2.1 17 301-317 137-153 (176)
77 PRK15131 mannose-6-phosphate i 57.8 8.2 0.00018 40.3 2.8 22 301-322 238-259 (389)
78 KOG2508|consensus 56.5 20 0.00044 36.7 5.1 51 297-347 358-416 (437)
79 PRK11171 hypothetical protein; 55.2 74 0.0016 31.3 9.0 25 301-325 102-126 (266)
80 KOG1633|consensus 54.2 6.6 0.00014 44.4 1.5 93 237-335 481-583 (776)
81 COG1482 ManA Phosphomannose is 54.1 15 0.00032 37.2 3.8 24 300-323 158-181 (312)
82 COG1791 Uncharacterized conser 54.0 43 0.00093 30.8 6.4 45 255-325 99-143 (181)
83 PRK05467 Fe(II)-dependent oxyg 53.5 52 0.0011 31.7 7.3 37 299-335 140-178 (226)
84 COG1579 Zn-ribbon protein, pos 53.2 7.9 0.00017 37.6 1.7 28 27-54 199-230 (239)
85 TIGR02272 gentisate_1_2 gentis 52.9 59 0.0013 33.2 8.0 59 236-327 257-315 (335)
86 PF12662 cEGF: Complement Clr- 52.8 4.6 9.9E-05 24.7 0.0 15 23-37 1-15 (24)
87 PF03604 DNA_RNApol_7kD: DNA d 52.5 5.8 0.00013 26.0 0.5 27 24-56 1-28 (32)
88 TIGR00218 manA mannose-6-phosp 51.8 7.9 0.00017 38.8 1.5 20 300-319 151-170 (302)
89 PRK10296 DNA-binding transcrip 50.8 52 0.0011 32.0 7.2 50 241-323 35-84 (278)
90 TIGR03027 pepcterm_export puta 50.7 9.7 0.00021 34.5 1.8 17 301-317 149-165 (165)
91 COG3435 Gentisate 1,2-dioxygen 50.4 24 0.00051 35.4 4.5 69 238-343 101-171 (351)
92 PLN02288 mannose-6-phosphate i 47.1 13 0.00028 38.9 2.2 22 301-322 252-273 (394)
93 COG2331 Uncharacterized protei 47.1 4.9 0.00011 31.7 -0.6 35 20-56 9-44 (82)
94 KOG2131|consensus 45.6 5.1 0.00011 40.9 -0.9 109 300-428 285-393 (427)
95 PRK00464 nrdR transcriptional 43.5 14 0.0003 33.5 1.6 33 24-56 1-39 (154)
96 PF05280 FlhC: Flagellar trans 40.9 14 0.00029 34.3 1.1 31 21-54 132-163 (175)
97 PF10955 DUF2757: Protein of u 40.2 22 0.00047 28.2 2.0 8 22-29 3-11 (76)
98 COG4306 Uncharacterized protei 39.7 5.9 0.00013 34.2 -1.3 55 44-100 38-103 (160)
99 smart00531 TFIIE Transcription 39.1 39 0.00084 30.1 3.8 38 20-57 96-135 (147)
100 PRK11171 hypothetical protein; 39.0 49 0.0011 32.5 4.9 29 298-326 221-249 (266)
101 PF02591 DUF164: Putative zinc 38.8 29 0.00063 25.5 2.5 26 27-52 24-53 (56)
102 KOG1779|consensus 37.9 36 0.00078 27.1 2.9 40 19-59 28-67 (84)
103 PF14354 Lar_restr_allev: Rest 36.4 24 0.00053 26.1 1.7 30 24-53 4-37 (61)
104 PF13640 2OG-FeII_Oxy_3: 2OG-F 36.1 31 0.00066 27.9 2.5 30 304-333 66-99 (100)
105 PF09845 DUF2072: Zn-ribbon co 35.7 16 0.00035 32.0 0.7 24 24-52 2-26 (131)
106 COG3364 Zn-ribbon containing p 34.7 18 0.00039 30.2 0.8 29 24-57 3-34 (112)
107 KOG3053|consensus 33.5 15 0.00032 35.8 0.2 39 14-52 28-78 (293)
108 PRK12860 transcriptional activ 33.4 23 0.0005 33.1 1.4 28 22-52 133-161 (189)
109 KOG1246|consensus 33.3 56 0.0012 38.0 4.9 112 229-340 329-455 (904)
110 COG4297 Uncharacterized protei 32.8 2.9E+02 0.0063 24.6 7.9 40 297-336 82-121 (163)
111 PF05605 zf-Di19: Drought indu 32.8 22 0.00048 25.9 1.0 29 23-53 2-39 (54)
112 PRK12722 transcriptional activ 32.7 24 0.00053 32.9 1.4 28 23-53 134-162 (187)
113 PF09862 DUF2089: Protein of u 32.0 56 0.0012 28.0 3.4 25 26-59 1-26 (113)
114 TIGR03028 EpsE polysaccharide 31.8 26 0.00056 33.9 1.5 15 303-317 225-239 (239)
115 COG3822 ABC-type sugar transpo 31.4 1.7E+02 0.0037 27.4 6.6 33 297-329 150-182 (225)
116 TIGR03504 FimV_Cterm FimV C-te 31.4 53 0.0012 23.1 2.7 31 422-452 14-44 (44)
117 PF14353 CpXC: CpXC protein 30.9 28 0.0006 30.1 1.5 42 25-66 3-59 (128)
118 PRK12335 tellurite resistance 30.2 1.9E+02 0.0041 28.5 7.5 57 250-330 32-88 (287)
119 PRK13501 transcriptional activ 29.2 64 0.0014 31.6 3.9 26 300-325 56-81 (290)
120 COG5553 Predicted metal-depend 29.2 1.3E+02 0.0028 27.5 5.3 27 297-331 126-154 (191)
121 PF15138 Syncollin: Syncollin 29.2 46 0.001 28.0 2.3 40 297-336 32-74 (112)
122 TIGR02297 HpaA 4-hydroxyphenyl 28.9 1.5E+02 0.0033 28.7 6.5 25 301-325 63-87 (287)
123 PF11331 DUF3133: Protein of u 28.7 41 0.00089 24.0 1.7 17 43-59 29-45 (46)
124 PHA00616 hypothetical protein 28.7 13 0.00027 26.4 -0.9 14 24-37 2-16 (44)
125 PF13831 PHD_2: PHD-finger; PD 26.6 50 0.0011 22.1 1.8 18 43-60 2-19 (36)
126 KOG0162|consensus 26.6 65 0.0014 36.1 3.5 29 294-322 1060-1092(1106)
127 PF08271 TF_Zn_Ribbon: TFIIB z 26.0 81 0.0017 21.7 2.8 30 24-57 1-31 (43)
128 PF08273 Prim_Zn_Ribbon: Zinc- 25.5 57 0.0012 22.5 1.9 28 22-51 2-32 (40)
129 COG5175 MOT2 Transcriptional r 25.4 13 0.00027 37.6 -1.9 18 9-29 20-37 (480)
130 PF05876 Terminase_GpA: Phage 24.0 49 0.0011 36.3 2.2 39 18-56 195-240 (557)
131 PRK15401 alpha-ketoglutarate-d 24.0 2.6E+02 0.0056 26.7 6.8 28 297-324 164-194 (213)
132 PF01238 PMI_typeI: Phosphoman 24.0 47 0.001 34.5 1.9 22 301-322 251-272 (373)
133 PRK00762 hypA hydrogenase nick 23.7 34 0.00074 29.7 0.7 31 22-52 69-99 (124)
134 PF00675 Peptidase_M16: Insuli 23.5 1E+02 0.0022 26.7 3.8 35 420-454 81-118 (149)
135 PRK13500 transcriptional activ 23.5 1E+02 0.0023 30.7 4.3 27 299-325 85-111 (312)
136 PRK10371 DNA-binding transcrip 23.2 1.2E+02 0.0027 30.1 4.7 27 299-325 63-89 (302)
137 COG3435 Gentisate 1,2-dioxygen 22.1 2.6E+02 0.0057 28.2 6.5 32 300-331 299-331 (351)
138 COG5525 Bacteriophage tail ass 21.9 50 0.0011 36.1 1.6 39 19-57 223-271 (611)
139 PF12973 Cupin_7: ChrR Cupin-l 21.8 1.2E+02 0.0026 24.3 3.6 25 304-328 62-86 (91)
140 PF13532 2OG-FeII_Oxy_2: 2OG-F 21.8 1.8E+02 0.004 26.4 5.3 28 297-325 145-176 (194)
141 PRK06260 threonine synthase; V 21.3 58 0.0013 33.9 2.0 33 22-61 2-35 (397)
142 PF12851 Tet_JBP: Oxygenase do 21.1 3.4E+02 0.0073 24.8 6.8 28 297-324 125-153 (171)
143 KOG1816|consensus 20.8 83 0.0018 31.4 2.8 83 265-349 40-142 (308)
144 KOG1379|consensus 20.5 1.6E+02 0.0034 29.9 4.7 55 253-313 191-252 (330)
145 PF11699 CENP-C_C: Mif2/CENP-C 20.3 1.8E+02 0.004 23.5 4.2 27 300-326 51-77 (85)
146 PRK03824 hypA hydrogenase nick 20.2 65 0.0014 28.4 1.8 14 22-35 69-83 (135)
No 1
>KOG2131|consensus
Probab=100.00 E-value=2.1e-61 Score=468.78 Aligned_cols=340 Identities=35% Similarity=0.597 Sum_probs=279.3
Q ss_pred cCCCCceeeCC-CCCHHHHHHhhcCCCCcEEEecCCCccccc----ccccc-CCC-------------------------
Q psy12281 83 ALPQVDYNIDP-SHIYPYCFNNFLSKNKPFILKSCVNEWNSS----LHWVQ-NKQ------------------------- 131 (456)
Q Consensus 83 ~~p~~I~ri~~-~ls~~~F~~~y~~~~~PvVi~g~~~~Wpa~----~~Wt~-~~~------------------------- 131 (456)
+.+..|+|+.. .+++.+|..+|...+.||+|.+...+|+++ ..|.. .+.
T Consensus 25 qlr~~v~r~~g~~~d~~~~a~~~~~k~~~v~is~v~~d~~~~~~~~~s~~dl~~~~~l~n~~~~f~k~~v~~~~~~v~~~ 104 (427)
T KOG2131|consen 25 QLRPCVLRLIGPTADYGDFAERDTHKRPGVLISYVLEDWPCVSRDNESGADLNNNPDLKNFARSFAKRIVQNGDKNVFLC 104 (427)
T ss_pred hhhhhhhhhcccccCcccchhhhcccCCCccccccccCcccccCCCcchhhhccCcchhHHHHHHhHHHHhcCCcccccc
Confidence 45577888865 699999999999999999999988999998 44433 222
Q ss_pred -----CChhHHHHHhCCceEEEEecCcccccCCceEEeeHHHHHHHHHhcccCCCCCCCCeeeeeccccccCCCCCcccc
Q psy12281 132 -----PNFEYLSNHYGDIEVPVANCNSYYFNAHEKTNMTLKEYTIYWQNKIDGKLSETEPLYYLKDWHFTRDFKTEDIYR 206 (456)
Q Consensus 132 -----~~~d~L~~~~Gd~~V~V~~~~~~~y~~~~~~~m~l~efl~~~~~~~~~~~~~~~~~~Yl~d~~~~~~~p~l~~y~ 206 (456)
++.++|.+.||+..|+++.|+..++.+ |+++++.-+.+...... ...+++|++|||+...+|....|.
T Consensus 105 ~~s~f~~~~~Le~ny~d~~v~~a~~~e~e~~~-----l~lad~~~~e~~~~~~~--~q~p~ly~kDwHL~~~~d~~~~~~ 177 (427)
T KOG2131|consen 105 DLSLFPDYEYLERNYGDFVVPVANCNEKERYS-----LKLADWEFREEQVQLAC--PQAPSLYGKDWHLYRFLDNDFPYW 177 (427)
T ss_pred ccccCccHHHHHhhhhhcchhhccccchhhhh-----hhhccchhHHHhHhhhc--CCCccccccchheeeecCcccccc
Confidence 355555556666666665555433222 44444444333322111 235689999999999999888999
Q ss_pred cCcccCcchhhHhhhhhccCCCccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccC
Q psy12281 207 VPNVFSSDWLNEYYSEHLEHKDDYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDM 286 (456)
Q Consensus 207 ~P~~f~~D~ln~~~~~~~~~~~~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~ 286 (456)
+|.+|.+||+|+++.... .++|||+||||+|||||+|.|+++++||+++|||+|+|+|+||+++..|+++.|++|+++
T Consensus 178 ~pd~F~~dwlne~~i~~~--~ddyrFvy~Gp~gSwtp~HaDVf~s~swS~nicG~KrWl~~pP~qe~~l~dr~gnlp~~~ 255 (427)
T KOG2131|consen 178 TPDLFAKDWLNEFVIDGE--SDDYRFVYAGPAGSWTPFHADVFHSPSWSVNICGRKRWLLYPPEQEQTLADRYGNLPLPS 255 (427)
T ss_pred cchhhhhcccchhhccCC--CCceeEEEeccCCCCCccchhhhcCCcceeeeecceeEEEeChHHhhhhhhhccCcCCcc
Confidence 999999999999998766 899999999999999999999999999999999999999999999999999999999988
Q ss_pred CCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHHHHHHHHHHHHHHHHHhhhhcccchhH
Q psy12281 287 RSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNIDHVYHEMVSHLEAVKKEIDDCKDMDDW 366 (456)
Q Consensus 287 ~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~~~~~~l~~~~~~~~~~~~~~~~~~~~ 366 (456)
..+.+..| +.+.++++|+|||+||||+||+|||.||++||||||||+|++||.+||.+|++++.++.++|+| ..|++|
T Consensus 256 ~~~~ld~~-~~~~lei~Qepge~VFvPsGW~hQV~NL~dTISINHNW~N~~nl~~~w~~Lk~~y~a~~esisd-d~md~~ 333 (427)
T KOG2131|consen 256 WITKLDLF-RGPLLEIFQEPGETVFVPSGWHHQVLNLGDTISINHNWCNATNLAWMWDALKEDYPALAESISD-DLMDDF 333 (427)
T ss_pred cccccccc-ccchhhhhccCCceeeccCccccccccccceeeecccccccccHHHHHHHHHhhhhhhhhhhhh-hccccc
Confidence 88766543 3667999999999999999999999999999999999999999999999999999999999999 669999
Q ss_pred HHHHHHHHhhhcCCCHHHHHHHHHHHHHhhhhhcccCCccccccccccCccchhccHHHHHHHHHHhhcCCccch
Q psy12281 367 TSHCQLMLQVSFGINFRQFFDMLKFICQKRLTSLKNGTTIRLYGNWILSLDHTMFDLRKASDVLKLCAQHEDWDL 441 (456)
Q Consensus 367 ~~~~~~~l~~~~g~~~~~f~~~l~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 441 (456)
+.+||.+|++++|||+.+|+.|+++++ ++..-....+ ..++...+++|+..+-++|..|+.+.+++.
T Consensus 334 ~~~cq~~lra~~gin~~ef~~f~~~~a--~l~h~~~~~~------a~l~~s~~~~~~~~q~evl~~l~~~~~~t~ 400 (427)
T KOG2131|consen 334 PLECQFSLRATEGINANEFIYFPDFWA--MLKHATLNRD------ASLFISTFLSDWPIQPEVLPPLSWEYSITK 400 (427)
T ss_pred hhhhhhhhhhhccccccceEeehHHHH--HHHHHHhccc------hhhhHHhhhcccccchhhhcceeecCceec
Confidence 999999999999999999999999998 3333222222 235555567788888888999998888764
No 2
>KOG2130|consensus
Probab=100.00 E-value=2e-48 Score=371.47 Aligned_cols=242 Identities=26% Similarity=0.487 Sum_probs=205.4
Q ss_pred CCceeeCC-CCCHHHHHHhhcCCCCcEEEecCCCccccccccccCCCCChhHHHHHhCCceEEEEecCcccccCCceEEe
Q psy12281 86 QVDYNIDP-SHIYPYCFNNFLSKNKPFILKSCVNEWNSSLHWVQNKQPNFEYLSNHYGDIEVPVANCNSYYFNAHEKTNM 164 (456)
Q Consensus 86 ~~I~ri~~-~ls~~~F~~~y~~~~~PvVi~g~~~~Wpa~~~Wt~~~~~~~d~L~~~~Gd~~V~V~~~~~~~y~~~~~~~m 164 (456)
++++|++. ++|.++|.++|..+++||||+|++++|||..+|| +++|.++||++++.+...+ .+...+|
T Consensus 52 dn~~r~~~~~isveeF~~~ferp~kPVll~g~~DgW~A~ekWT------~drLskkyrnq~Fkcged~-----~gnsv~M 120 (407)
T KOG2130|consen 52 DNGNRERRRNISVEEFIERFERPYKPVLLSGLTDGWPASEKWT------LDRLSKKYRNQKFKCGEDN-----NGNSVKM 120 (407)
T ss_pred ccchhhhhcCCCHHHHHHHhcccCCceEeeccccCCchhhhhh------HHHHHHHhcCccccccccC-----CCcceee
Confidence 56666655 9999999999999999999999999999999997 6999999999999886432 3447899
Q ss_pred eHHHHHHHHHhcccCCCCCCCCeeeeeccccccCCC--CC-cccccCcccCcchhhHhhhhhccCCCccceeeeCCCCCc
Q psy12281 165 TLKEYTIYWQNKIDGKLSETEPLYYLKDWHFTRDFK--TE-DIYRVPNVFSSDWLNEYYSEHLEHKDDYRFVYMGPKETW 241 (456)
Q Consensus 165 ~l~efl~~~~~~~~~~~~~~~~~~Yl~d~~~~~~~p--~l-~~y~~P~~f~~D~ln~~~~~~~~~~~~~~~l~iG~~gs~ 241 (456)
+|+.|++|++... ++.++|++|-.|.++.| .| .+|.+|.||.+||+. |++... +++|||+.|||++|+
T Consensus 121 KmkyY~~Ym~~~R------ddsPLYiFDssFgE~~~~rkLl~dY~VPk~F~dDlF~-y~g~e~--RPpyRWfvmGParSG 191 (407)
T KOG2130|consen 121 KMKYYIEYMKSTR------DDSPLYIFDSSFGEHAPRRKLLEDYSVPKYFRDDLFQ-YLGEER--RPPYRWFVMGPARSG 191 (407)
T ss_pred eHHHHHHHHhccc------cCCCeEEecchhhcccchhhhhhhcCcchhhhHHHHH-hcCccc--CCCceeEEecCCCCC
Confidence 9999999999765 47899999999999888 55 699999999999985 665554 899999999999999
Q ss_pred ccccccccCccceeEEEEeEEEEEEeCCCCCcccccC----CCCccccCCCc--------ccCCCCC-CceEEEEEcCCC
Q psy12281 242 TPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDS----MGNLISDMRSV--------DWSTLPR-DTVIIVEQEAGD 308 (456)
Q Consensus 242 T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~----~g~~p~d~~~p--------d~~~fp~-~~~~~~~l~pGD 308 (456)
|.+|+|+.++.+|+++|+|+|||.||||+..+.+-.. .|..|..+..+ .+..+|. .++++|+|.|||
T Consensus 192 tsiHIDPlgTSAWNtll~GhKrW~LfPp~~p~~lvkv~~~e~g~~~de~itwf~~~y~rt~~Pswp~E~kPIEc~q~pGE 271 (407)
T KOG2130|consen 192 TSIHIDPLGTSAWNTLLQGHKRWVLFPPGTPPELVKVTVDEGGKQPDEIITWFSTIYPRTQLPSWPDEYKPIECLQKPGE 271 (407)
T ss_pred ceeEECCcchHHHHHHhhccceeEEcCCCCCCCceeecccccCCCCcceechhhhccccccCCCCccccCCceeeecCCc
Confidence 9999999999999999999999999999997755421 22333222111 0111233 799999999999
Q ss_pred EEEeCCCCeEEEEeCCCcEEEEeecCCCCCHHHHHHHHH
Q psy12281 309 SIFVPSGWHHQVTNLEHTISINHNWINGTNIDHVYHEMV 347 (456)
Q Consensus 309 ~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~~~~~~l~ 347 (456)
++|||+||||-|.|+++||+|++||++..|+..||+.+.
T Consensus 272 t~fVP~GWWHvVlNle~TIAiTqNf~s~eNf~~Vw~ktv 310 (407)
T KOG2130|consen 272 TMFVPSGWWHVVLNLEPTIAITQNFASKENFPFVWHKTV 310 (407)
T ss_pred eEEecCCeEEEEeccCceeeeeeccccccCCceeeeccc
Confidence 999999999999999999999999999999999998763
No 3
>PF13621 Cupin_8: Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=100.00 E-value=1.3e-40 Score=321.77 Aligned_cols=225 Identities=27% Similarity=0.569 Sum_probs=161.7
Q ss_pred HHHHHHhhcCCCCcEEEecCCCccccccccccCCCCChhHHHHHhCCceEEEEecCc---cccc--------CCceEEee
Q psy12281 97 YPYCFNNFLSKNKPFILKSCVNEWNSSLHWVQNKQPNFEYLSNHYGDIEVPVANCNS---YYFN--------AHEKTNMT 165 (456)
Q Consensus 97 ~~~F~~~y~~~~~PvVi~g~~~~Wpa~~~Wt~~~~~~~d~L~~~~Gd~~V~V~~~~~---~~y~--------~~~~~~m~ 165 (456)
|++|+++|+.+++||||+|++.+|+|+++|+. ....++||++++|+..|.+..... ..+. ......|+
T Consensus 1 p~eF~~~~~~~~~PvVi~g~~~~w~a~~~W~~-~~~~~~~l~~~~g~~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 79 (251)
T PF13621_consen 1 PEEFFERYVKPNKPVVIRGAAADWPAFQKWTD-DEYLLDYLKEKYGDVPVPVRDSPNGISDSFGYNDDRKFVNFERERMP 79 (251)
T ss_dssp -HHHHHHTCCTT--EEEESSSTTCCHHHH-S------HHHHHHHHTSSEEEEEEEST--SSBTS-B-GCCCTCEEEEEEE
T ss_pred CHHHHHHHhCCCCCEEEeCCccCCchhhhccc-hhHHHHHHHHHcCCeEEEEEecCCcccccccccccccccCCeeEECc
Confidence 68999999999999999999999999999973 223479999999999999975431 1111 23468999
Q ss_pred HHHHHHHHHhcccCCCCCCCCeeeeeccccccCCCCCccc-ccCcccCcchhhHhhhhhccCCCccceeeeCCCCCcccc
Q psy12281 166 LKEYTIYWQNKIDGKLSETEPLYYLKDWHFTRDFKTEDIY-RVPNVFSSDWLNEYYSEHLEHKDDYRFVYMGPKETWTPL 244 (456)
Q Consensus 166 l~efl~~~~~~~~~~~~~~~~~~Yl~d~~~~~~~p~l~~y-~~P~~f~~D~ln~~~~~~~~~~~~~~~l~iG~~gs~T~l 244 (456)
|++|++.+..... ......+|+.++.+...+|.+... .+| +..++++ . ......+|||++|+.|++
T Consensus 80 ~~~f~~~~~~~~~---~~~~~~~y~~~~~~~~~~~~l~~~~~~~--~~~~~~~----~----~~~~~~l~ig~~gs~t~l 146 (251)
T PF13621_consen 80 FRDFLDRLRANRD---QRDKPYYYLQNWNLFEDFPELREDNDLP--FPPELFG----K----EPQSSNLWIGPPGSFTPL 146 (251)
T ss_dssp HHHHHHHHHHSCH---STSSSEEEEEEETHHHHSHHHHCCS-CH--HHHCHSC----C----HCCEEEEEEE-TTEEEEE
T ss_pred HHHHHHHHHhccc---ccCCceEEEecCchHHHhhhhhhccccc--cchhhcc----c----CccccEEEEeCCCceeee
Confidence 9999999987653 123468999999888888877422 222 2222221 1 124567999999999999
Q ss_pred cccccCccceeEEEEeEEEEEEeCCCCCcccccCC------CCccccCCCcccCCCCC---CceEEEEEcCCCEEEeCCC
Q psy12281 245 HADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSM------GNLISDMRSVDWSTLPR---DTVIIVEQEAGDSIFVPSG 315 (456)
Q Consensus 245 H~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~------g~~p~d~~~pd~~~fp~---~~~~~~~l~pGD~LfIPsg 315 (456)
|+|..+ +|++||.|+|+|+||||.+...||+.. ....+|+.+++..+||+ +++++|+|+|||+||||+|
T Consensus 147 H~D~~~--n~~~~i~G~K~~~L~pP~~~~~l~~~~~~~~~~~~~~~d~~~~d~~~~p~~~~~~~~~~~l~pGD~LfiP~g 224 (251)
T PF13621_consen 147 HYDPSH--NLLAQIRGRKRWILFPPDDSPNLYPRPDSHGGTVFSWVDPDNPDLERFPKFRKAPPYEVVLEPGDVLFIPPG 224 (251)
T ss_dssp EE-SSE--EEEEEEESEEEEEEE-GGGGGGCTBETTTST-TCBBSS-TTS--TTT-CGGGG--EEEEEEETT-EEEE-TT
T ss_pred eECchh--hhhhccCCCEEEEEECCccccccccceecccccceeeeeccChhhhhhhhhccCceeEEEECCCeEEEECCC
Confidence 999965 559999999999999999999999764 11247888888888875 7899999999999999999
Q ss_pred CeEEEEeC--CC-cEEEEeecCCCC
Q psy12281 316 WHHQVTNL--EH-TISINHNWINGT 337 (456)
Q Consensus 316 W~H~V~nl--~~-sIsvn~~~~~~~ 337 (456)
|||+|+|+ ++ |||||+||....
T Consensus 225 WwH~V~~~~~~~~sisvn~w~~~~~ 249 (251)
T PF13621_consen 225 WWHQVENLSDDDLSISVNYWFRTPF 249 (251)
T ss_dssp -EEEEEESTTSSCEEEEEEEEESS-
T ss_pred CeEEEEEcCCCCeEEEEEEEecccc
Confidence 99999999 77 999999998753
No 4
>KOG2132|consensus
Probab=99.94 E-value=4.3e-28 Score=234.54 Aligned_cols=214 Identities=21% Similarity=0.320 Sum_probs=178.1
Q ss_pred CCCHHHHHHhhcCCCCcEEEecCCCccccccccccCCCCChhHHHHHhCCceEEEEecCcccccCCceEEeeHHHHHHHH
Q psy12281 94 SHIYPYCFNNFLSKNKPFILKSCVNEWNSSLHWVQNKQPNFEYLSNHYGDIEVPVANCNSYYFNAHEKTNMTLKEYTIYW 173 (456)
Q Consensus 94 ~ls~~~F~~~y~~~~~PvVi~g~~~~Wpa~~~Wt~~~~~~~d~L~~~~Gd~~V~V~~~~~~~y~~~~~~~m~l~efl~~~ 173 (456)
.++..+|+.+++.+++|++++|...+|||.++|. ++||.+..|.+.|+|.-..+..-......-|+..+|+.+.
T Consensus 134 ~~~l~~~l~d~~~~~~p~~~l~~~~hwpa~~kw~------l~~l~~va~~rTvpvEvg~~y~~~~w~q~l~t~~~g~~R~ 207 (355)
T KOG2132|consen 134 RLSLQEFLNDTVGRGTPVILLGFNWHWPAIQKWS------LGYLQQVAGNRTVPVEVGSTYADEDWSQNLMTQIKGIKRC 207 (355)
T ss_pred chHHHHHHHHhhcCCCceEEecccccchHhhhcc------hhHHHhcccceeeeeecccccchhHHHhhhHHHhhhhhhh
Confidence 6888999999999999999999999999999994 8999999999999997544322222345678888888877
Q ss_pred HhcccCCCCCCCCeeeeeccccccCCCCCc-ccccCcccCcchhhHhhhhhccCCCccceeeeCCCCCcccccccccCcc
Q psy12281 174 QNKIDGKLSETEPLYYLKDWHFTRDFKTED-IYRVPNVFSSDWLNEYYSEHLEHKDDYRFVYMGPKETWTPLHADVFHSY 252 (456)
Q Consensus 174 ~~~~~~~~~~~~~~~Yl~d~~~~~~~p~l~-~y~~P~~f~~D~ln~~~~~~~~~~~~~~~l~iG~~gs~T~lH~D~~~~~ 252 (456)
.... .++++|+.+.++....|+++ ++..|.+-..--++ . +...+..|+||+|+.+|+|.|+.+|+
T Consensus 208 ~~~~------p~~~~YlAqh~lfdqi~elk~Di~iPDyc~~~~f~------~--~~v~~~~w~GpaGtV~pih~dp~hNi 273 (355)
T KOG2132|consen 208 ILFE------PRECLYLAQHHLFDQIPELKFDISIPDYCSFPNFE------N--EVVDINAWIGPAGTVLPIHMDPWHNI 273 (355)
T ss_pred hcCC------ccccchhhhhhhhccchhhhhccCCCceeecCCCC------c--cccceeEEeccCCceeccccccccce
Confidence 6543 24688999999999999884 77778753321111 0 23557899999999999999999999
Q ss_pred ceeEEEEeEEEEEEeCCCCCcccccCCCCc----cccCCCcccCCCCC---CceEEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 253 SWSVNICGRKQWLLLAPGNEKYFKDSMGNL----ISDMRSVDWSTLPR---DTVIIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 253 sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~----p~d~~~pd~~~fp~---~~~~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
.+||.|+|+|+|+||.....||+..+.+ .+|+.+||+..||+ ++.+.++++|||+||||.-|||.|++++.
T Consensus 274 --~~qv~G~k~i~l~~p~~s~~lyP~d~~~~~tsqvdvenPdlk~fp~~~k~~~l~~lL~pGe~L~iP~kwwhyvrs~d~ 351 (355)
T KOG2132|consen 274 --LSQVFGRKRIRLYPPEDSGALYPTDTYLLETSQVDVENPDLKAFPKFAKARFLDCLLEPGEALFIPPKWWHYVRSLDK 351 (355)
T ss_pred --eeeeecceEEEEecCcccCCCCCccchhhcccccccCCCChhhhhHHHHHHHHHHhcCCchhccccHHHhhhhhhccc
Confidence 9999999999999999999999875542 47899999999986 89999999999999999999999999998
Q ss_pred cEEE
Q psy12281 326 TISI 329 (456)
Q Consensus 326 sIsv 329 (456)
++|+
T Consensus 352 s~s~ 355 (355)
T KOG2132|consen 352 SFSV 355 (355)
T ss_pred cccC
Confidence 7764
No 5
>KOG2508|consensus
Probab=99.90 E-value=1.7e-23 Score=204.01 Aligned_cols=234 Identities=21% Similarity=0.362 Sum_probs=169.2
Q ss_pred CCCCceeeCCCCCHHHHHHhhcCCCCcEEEecCCCccccccccccCCCCChhHHHHHhCCceEEEEecCcccccC-----
Q psy12281 84 LPQVDYNIDPSHIYPYCFNNFLSKNKPFILKSCVNEWNSSLHWVQNKQPNFEYLSNHYGDIEVPVANCNSYYFNA----- 158 (456)
Q Consensus 84 ~p~~I~ri~~~ls~~~F~~~y~~~~~PvVi~g~~~~Wpa~~~Wt~~~~~~~d~L~~~~Gd~~V~V~~~~~~~y~~----- 158 (456)
+|..|.+.|..+|+.+|.++|+.+|+||||+.++.+|||.++|+. .+||....||..|.|+-.+. .|.+
T Consensus 20 vp~~v~~lD~~Ptpl~Fyr~fvs~n~PvIIrkAL~hWpal~lWs~-----p~Yl~~algd~vvsvaitPn-gyadgav~~ 93 (437)
T KOG2508|consen 20 VPVLVNLLDLTPTPLDFYRKFVSTNTPVIIRKALPHWPALKLWSQ-----PDYLLSALGDIVVSVAITPN-GYADGAVMS 93 (437)
T ss_pred cccccccccCCCchHHHHHhhhcCCCcEEEecccccCchhhccCc-----hHHHHHhccCeEEEEEeCCC-Ccccccccc
Confidence 456788998889999999999999999999999999999999984 78999999999998873221 1111
Q ss_pred -------CceEEeeHHHHHHHHHhcccCCCCCCCCeeeeecc--ccccCCCCCc-ccccCcccCcchhhHhhhhhccCCC
Q psy12281 159 -------HEKTNMTLKEYTIYWQNKIDGKLSETEPLYYLKDW--HFTRDFKTED-IYRVPNVFSSDWLNEYYSEHLEHKD 228 (456)
Q Consensus 159 -------~~~~~m~l~efl~~~~~~~~~~~~~~~~~~Yl~d~--~~~~~~p~l~-~y~~P~~f~~D~ln~~~~~~~~~~~ 228 (456)
....+|++++.+..+..+.+ ....+|+.+. .+..++|.+. +-..-. -+|-.+... ..+
T Consensus 94 g~e~f~~pae~KlklsevL~vl~~~~~-----~~~Vly~qkQ~snl~~elp~la~Dl~~~d---~~wa~eafg----k~P 161 (437)
T KOG2508|consen 94 GNEMFIKPAEQKLKLSEVLYVLTQYDE-----SNEVLYGQKQNSNLVKELPPLATDLLLTD---FSWAPEAFG----KVP 161 (437)
T ss_pred CcceeechhhhhccchhhheehhcccC-----CCceeehhhhccccccccccccccccccc---ccccHHHhC----CCh
Confidence 11346777777776665532 3578898764 3455666542 111100 023222221 256
Q ss_pred ccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccC---------C---CCccc------------
Q psy12281 229 DYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDS---------M---GNLIS------------ 284 (456)
Q Consensus 229 ~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~---------~---g~~p~------------ 284 (456)
+..++|||...++|.||.|+++|+ +|+|.|.|+|+|.||++.+.|+-. . |.+-.
T Consensus 162 davNlWiG~~~avTSlHkDhyENl--YaViSGeKhFvlipP~d~~~lp~eiyp~g~ysy~~e~g~fk~e~~dEe~~~~a~ 239 (437)
T KOG2508|consen 162 DAVNLWIGKSEAVTSLHKDHYENL--YAVISGEKHFVLIPPDDANQLPLEIYPNGLYSYEGEDGPFKSEGLDEEDAAIAK 239 (437)
T ss_pred hheeeeecccccccccccccccce--EEEEeccceEEEeCcccccccceeecCCcceeeecCCCceeecccchhhhhhcc
Confidence 788999999999999999999999 999999999999999998765421 1 11100
Q ss_pred -----cCC-Cc-----ccCCCC---C-CceEEEEEcCCCEEEeCCCCeEEEEeC----CCcEEEEeecCCCC
Q psy12281 285 -----DMR-SV-----DWSTLP---R-DTVIIVEQEAGDSIFVPSGWHHQVTNL----EHTISINHNWINGT 337 (456)
Q Consensus 285 -----d~~-~p-----d~~~fp---~-~~~~~~~l~pGD~LfIPsgW~H~V~nl----~~sIsvn~~~~~~~ 337 (456)
|.. +| +..+|| . .++..+.+.+|+++|.|..|.|+|-.. +.+|++|.|+-...
T Consensus 240 ~~~svd~~~s~~~~a~~~~k~~l~~~~~k~~~~rv~a~~i~~lp~~w~h~~sq~~gd~~~~Iavn~~~d~~~ 311 (437)
T KOG2508|consen 240 TWFSVDPIGSPKELAVPQEKIPLLEAEYKPEMDRVLAGQILYLPEDWFHHVSQVGGDPESDIAVNEWYDIEA 311 (437)
T ss_pred eeeeecccCCHHHhccchhhCccccccccceeeeeeccccccChhhhhhhccccCCCccceeEEeeeccccc
Confidence 100 11 122233 3 788899999999999999999999888 45999999987664
No 6
>KOG2923|consensus
Probab=99.88 E-value=5.4e-23 Score=151.43 Aligned_cols=61 Identities=66% Similarity=1.227 Sum_probs=58.1
Q ss_pred eeeeEEeCCCccccCCCceEEEecCCCCcEEeehhhhhcCCcEEeCCCCcceEEEeeeehh
Q psy12281 3 VYHDEIEIEDFEYDEDDETYYYPCPCGDRFQITKEQLKAGEDVATCPTCSLIIKPSVIKSV 63 (456)
Q Consensus 3 ~~~~~~~l~d~~~~~~~~~~~~~CrCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~~~~~ 63 (456)
++||+|.|+||+|+++++.|+|||||||.|+|++++|+.|+.+++|+||||.|+|.|+..+
T Consensus 2 ~~ydeVeiedfe~~~e~~~y~yPCpCGDrf~It~edL~~ge~Va~CpsCSL~I~ViYd~ed 62 (67)
T KOG2923|consen 2 SFYDEVEIEDFEFDEENQTYYYPCPCGDRFQITLEDLENGEDVARCPSCSLIIRVIYDKED 62 (67)
T ss_pred cccceEEeecceeccCCCeEEcCCCCCCeeeecHHHHhCCCeeecCCCceEEEEEEeCHHH
Confidence 5799999999999999999999999999999999999999999999999999999997653
No 7
>PF05207 zf-CSL: CSL zinc finger; InterPro: IPR007872 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a probable zinc binding motif that contains four cysteines and may chelate zinc, known as the DPH-type after the diphthamide (DPH) biosynthesis protein in which it was first characterised, including the proteins DPH3 and DPH4. This domain is also found associated with N-terminal domain of heat shock protein DnaJ IPR001623 from INTERPRO domain. Diphthamide is a unique post-translationally modified histidine residue found only in translation elongation factor 2 (eEF-2). It is conserved from archaea to humans and serves as the target for diphteria toxin and Pseudomonas exotoxin A. These two toxins catalyse the transfer of ADP-ribose to diphtamide on eEF-2, thus inactivating eEF-2, halting cellular protein synthesis, and causing cell death []. The biosynthesis of diphtamide is dependent on at least five proteins, DPH1 to -5, and a still unidentified amidating enzyme. DPH3 and DPH4 share a conserved region, which encode a putative zinc finger, the DPH-type or CSL-type (after the conserved motif of the final cysteine) zinc finger [, ]. The function of this motif is unknown. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; PDB: 2L6L_A 1WGE_A 2JR7_A 1YOP_A 1YWS_A.
Probab=99.83 E-value=7.9e-21 Score=140.97 Aligned_cols=55 Identities=42% Similarity=0.970 Sum_probs=49.3
Q ss_pred eEEeCCCccccCCCceEEEecCCCCcEEeehhhhhcCCcEEeCCCCcceEEEeee
Q psy12281 6 DEIEIEDFEYDEDDETYYYPCPCGDRFQITKEQLKAGEDVATCPTCSLIIKPSVI 60 (456)
Q Consensus 6 ~~~~l~d~~~~~~~~~~~~~CrCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~~ 60 (456)
|+|+|+||+++++++.|+|+|||||.|.|++++|++++.+|+|++|||+|+|.|+
T Consensus 1 dev~l~d~~~~~~~~~~~y~CRCG~~f~i~e~~l~~~~~iv~C~sCSL~I~V~~~ 55 (55)
T PF05207_consen 1 DEVSLDDMEFDEEEGVYSYPCRCGGEFEISEEDLEEGEVIVQCDSCSLWIRVNYD 55 (55)
T ss_dssp -EEETTTSEEETTTTEEEEEETTSSEEEEEHHHHHCT--EEEETTTTEEEEEE--
T ss_pred CeEEhhhceecCCCCEEEEcCCCCCEEEEcchhccCcCEEEECCCCccEEEEEeC
Confidence 6899999999999999999999999999999999999999999999999999984
No 8
>COG5216 Uncharacterized conserved protein [Function unknown]
Probab=99.78 E-value=1.8e-19 Score=130.74 Aligned_cols=58 Identities=53% Similarity=1.083 Sum_probs=55.8
Q ss_pred eeeEEeCCCccccCCCceEEEecCCCCcEEeehhhhhcCCcEEeCCCCcceEEEeeee
Q psy12281 4 YHDEIEIEDFEYDEDDETYYYPCPCGDRFQITKEQLKAGEDVATCPTCSLIIKPSVIK 61 (456)
Q Consensus 4 ~~~~~~l~d~~~~~~~~~~~~~CrCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~~~ 61 (456)
+||+|.++||+++.++..|+||||||+.|+|+.++|.+|+.++.|+||||.|.|.|+.
T Consensus 3 fYDEieiedftf~~e~~~ftyPCPCGDRFeIsLeDl~~GE~VArCPSCSLiv~vvyd~ 60 (67)
T COG5216 3 FYDEIEIEDFTFSREEKTFTYPCPCGDRFEISLEDLRNGEVVARCPSCSLIVCVVYDA 60 (67)
T ss_pred ccceeEeeeeEEcCCCceEEecCCCCCEeEEEHHHhhCCceEEEcCCceEEEEEEecH
Confidence 5799999999999999999999999999999999999999999999999999999954
No 9
>PF08007 Cupin_4: Cupin superfamily protein; InterPro: IPR022777 This signature represents primarily the cupin fold found in JmjC transcription factors. The fold is also found in lysine-specific demethylase NO66.; PDB: 2XDV_A 1VRB_B 4DIQ_B.
Probab=99.61 E-value=1.7e-15 Score=152.72 Aligned_cols=102 Identities=24% Similarity=0.382 Sum_probs=69.6
Q ss_pred CccceeeeCCCC-CcccccccccCccceeEEEEeEEEEEEeCCCCCc-ccccCCCCccccCCCcccCCCC-CCceEEEEE
Q psy12281 228 DDYRFVYMGPKE-TWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEK-YFKDSMGNLISDMRSVDWSTLP-RDTVIIVEQ 304 (456)
Q Consensus 228 ~~~~~l~iG~~g-s~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~-~Ly~~~g~~p~d~~~pd~~~fp-~~~~~~~~l 304 (456)
....++|++|+| .+.+.|+|..+++ .+|+.|+|+|.|++|.... ..+. ..++.... ..+..+++|
T Consensus 113 ~~~~n~Y~tp~g~~g~~~H~D~~dvf--vlQ~~G~K~W~l~~~~~~~~~~~~----------~~~~~~~~~~~~~~~~~L 180 (319)
T PF08007_consen 113 PVGANAYLTPPGSQGFGPHYDDHDVF--VLQLEGRKRWRLYPPPDEPAPLYS----------DQPFKQLEEFEPVEEVVL 180 (319)
T ss_dssp -EEEEEEEETSSBEESECEE-SSEEE--EEEEES-EEEEEE-SCCCTTTSSC----------E--TTTCG--STSEEEEE
T ss_pred ccceEEEecCCCCCCccCEECCcccE--EEECCceeEEEECCCCcccccccC----------CCCccccccCceeEEEEE
Confidence 456789999998 5799999999999 9999999999999844322 1111 01111111 146779999
Q ss_pred cCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHHH
Q psy12281 305 EAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNIDH 341 (456)
Q Consensus 305 ~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~~ 341 (456)
+|||+||||+||||++.+.+.|++++..+...+..+.
T Consensus 181 ~pGD~LYlPrG~~H~~~~~~~S~hltv~~~~~t~~dl 217 (319)
T PF08007_consen 181 EPGDVLYLPRGWWHQAVTTDPSLHLTVGFRAPTWADL 217 (319)
T ss_dssp -TT-EEEE-TT-EEEEEESS-EEEEEEEECCEBHHHH
T ss_pred CCCCEEEECCCccCCCCCCCCceEEEEeeeCCchhhH
Confidence 9999999999999999999999999999777654443
No 10
>PF02373 JmjC: JmjC domain, hydroxylase; InterPro: IPR013129 Jumonji protein is required for neural tube formation in mice [].There is evidence of domain swapping within the jumonji family of transcription factors []. This domain is often associated with jmjN (see IPR003349 from INTERPRO) and belongs to the Cupin superfamily [].; PDB: 2YU2_A 2YU1_A 3AVR_A 3AVS_A 2OX0_B 2OQ6_B 2WWJ_A 2Q8D_A 3PDQ_A 2YBK_A ....
Probab=99.60 E-value=7.2e-16 Score=131.56 Aligned_cols=102 Identities=26% Similarity=0.377 Sum_probs=70.0
Q ss_pred eeeeCCCCCcccccccccCccc-eeEEEEeEEEEEEeCCCCCcccccCCCCc-----cccCC------CcccCCCCCCce
Q psy12281 232 FVYMGPKETWTPLHADVFHSYS-WSVNICGRKQWLLLAPGNEKYFKDSMGNL-----ISDMR------SVDWSTLPRDTV 299 (456)
Q Consensus 232 ~l~iG~~gs~T~lH~D~~~~~s-W~~~i~G~Krw~L~pP~~~~~Ly~~~g~~-----p~d~~------~pd~~~fp~~~~ 299 (456)
+++||.++|+|++|.|.....+ |+....|.|.|.++||.....+....... |.... .|+.-.=...+.
T Consensus 1 ~~~ig~~~s~t~~H~e~~~~~sv~~~~~g~~k~W~~v~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~p~~l~~~gi~~ 80 (114)
T PF02373_consen 1 WLYIGMKGSYTPWHIEDNGLSSVNYHHFGGSKVWYIVPPEDADKFEKFLRSKESQNCPQFLDHKNIFVSPEQLKKAGIPV 80 (114)
T ss_dssp EEEEE-TTEEEEEEE-GGG-EEEEEEEEESEEEEEEE-GGGHHHHHHHHHHHHHHHSTTGGCTGGEEEGHHHHHHTTS--
T ss_pred CEEEeCCCcCCCcEecCCCCceeeeeccCcceEeEEechhhhhhHHHHHhhcccccccccccccccccceeeeeccCccc
Confidence 5899999999999999999984 88888999999999999877655321110 00000 010000013789
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeec
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNW 333 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~ 333 (456)
++++|+|||+||||+||||+|.|+++||+++.||
T Consensus 81 ~~~~Q~~Ge~V~i~pg~~H~v~n~g~~i~~a~Nf 114 (114)
T PF02373_consen 81 YRFVQKPGEFVFIPPGAYHQVFNLGDNISEAVNF 114 (114)
T ss_dssp EEEEEETT-EEEE-TT-EEEEEESSSEEEEEEEE
T ss_pred ccceECCCCEEEECCCceEEEEeCCceEEEEecC
Confidence 9999999999999999999999999999999987
No 11
>COG2850 Uncharacterized conserved protein [Function unknown]
Probab=99.59 E-value=6.3e-15 Score=145.92 Aligned_cols=262 Identities=13% Similarity=0.122 Sum_probs=143.0
Q ss_pred CCCCHHHHHHhhcCCCCcEEEecCCCccccccccccCCCCChhHHHHHhCCceEEEEecCcccccCCceEEeeHHHHHHH
Q psy12281 93 PSHIYPYCFNNFLSKNKPFILKSCVNEWNSSLHWVQNKQPNFEYLSNHYGDIEVPVANCNSYYFNAHEKTNMTLKEYTIY 172 (456)
Q Consensus 93 ~~ls~~~F~~~y~~~~~PvVi~g~~~~Wpa~~~Wt~~~~~~~d~L~~~~Gd~~V~V~~~~~~~y~~~~~~~m~l~efl~~ 172 (456)
..+|+++|+++|+ ..||+|||+++.+--. .. .++.|++-.-...|...--....-+......-+|.++=-+
T Consensus 13 ~glt~~~FL~~YW-qkKPlliR~a~p~~~~-p~-------~pdeLa~La~~edV~srLV~~~~~~~w~~~~gPfe~~d~~ 83 (383)
T COG2850 13 LGLTPEDFLRDYW-QKKPLLIRNAFPEFHS-PL-------SPDELAGLAQEEDVTSRLVSHESDGTWQVSHGPFEEEDFL 83 (383)
T ss_pred CCCCHHHHHHHHh-hhcchHHhhccccccc-CC-------CHHHHHHHhccccccchhhhhccCCceeEeeCccchhccc
Confidence 4799999999999 6799999998854211 12 3466776665544432100000000111122222222111
Q ss_pred HHhcccCCCCCCCCeeeeeccccccCCCCCcccccCcccCcchhhHhhhhhccCCCccceeeeCCCCCcccccccccCcc
Q psy12281 173 WQNKIDGKLSETEPLYYLKDWHFTRDFKTEDIYRVPNVFSSDWLNEYYSEHLEHKDDYRFVYMGPKETWTPLHADVFHSY 252 (456)
Q Consensus 173 ~~~~~~~~~~~~~~~~Yl~d~~~~~~~p~l~~y~~P~~f~~D~ln~~~~~~~~~~~~~~~l~iG~~gs~T~lH~D~~~~~ 252 (456)
. ....+-|.-.......|++...--|--|-.+| +-+...|...+.|.+...|+|.++++
T Consensus 84 -~---------~p~~wsllvq~vd~w~p~v~~l~~~FrflP~w-----------r~ddiMIS~a~~GGgvg~H~D~YDVf 142 (383)
T COG2850 84 -G---------LPRNWSLLVQAVDHWHPEVAALMEPFRFLPDW-----------RIDDIMISFAAPGGGVGPHFDQYDVF 142 (383)
T ss_pred -c---------CCcCceEEEehhhhcCHHHHHHHHHhccCccc-----------cccceEEEEecCCCccCccccchhee
Confidence 0 00112222111112223322111111122233 33445566666677799999999999
Q ss_pred ceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEee
Q psy12281 253 SWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHN 332 (456)
Q Consensus 253 sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~ 332 (456)
..|..|+|||.+-.+.....+++..-. .+.. |-...++.+++|||+||||+||||.-...++|+..+..
T Consensus 143 --liQg~G~RRW~v~~~~~~~~~~~~~d~---~~~~------~f~~~~d~vlepGDiLYiPp~~~H~gvae~dc~tySvG 211 (383)
T COG2850 143 --LIQGQGRRRWRVGKKCNMSTLCPHPDL---LILA------PFEPDIDEVLEPGDILYIPPGFPHYGVAEDDCMTYSVG 211 (383)
T ss_pred --EEeecccceeecCCcccccCcCCCcch---hhcC------CCCchhhhhcCCCceeecCCCCCcCCcccccccceeee
Confidence 999999999999877665544431100 0000 11455678999999999999999999999999999998
Q ss_pred cCCCCCHHHHHHHHHHHHH----HHHHhhhhccc-------chhHHHHHHHHHhhhcCCCHHHHHHHHH-HHHHhh
Q psy12281 333 WINGTNIDHVYHEMVSHLE----AVKKEIDDCKD-------MDDWTSHCQLMLQVSFGINFRQFFDMLK-FICQKR 396 (456)
Q Consensus 333 ~~~~~nl~~~~~~l~~~~~----~~~~~~~~~~~-------~~~~~~~~~~~l~~~~g~~~~~f~~~l~-~~~~~r 396 (456)
+..++..+.+=+++..-.. ..+-.-.++.. ...+-+....||.+. -++++.|..+-. ++++.|
T Consensus 212 ~r~Pn~~ell~~~~~~~~~~~~~~~rysd~~~~~~~~p~~~~~~~~d~l~~~l~~~-~~~~e~f~~~Fg~~~~t~~ 286 (383)
T COG2850 212 FRAPNTRELLSGFLDFFLQRESDELRYSDADLPPRGHPGELLPAMVDALAAMLLDL-LNQPEAFDNWFGEFITTPR 286 (383)
T ss_pred ccCCcHHHHHHHHHhhccccccccccccccCCCCCCCcccccHHHHHHHHHHHHHH-hcCHHHHHHHHHHHhcCcc
Confidence 9888765544444332211 22223333321 224444444455542 233666554433 666555
No 12
>KOG1633|consensus
Probab=99.30 E-value=6.1e-12 Score=137.83 Aligned_cols=192 Identities=21% Similarity=0.418 Sum_probs=135.4
Q ss_pred HHHHhC-CceEEEEecCcccccCCceEEeeHHHHHHHHHhcccCCCCCCCCeeeeeccccccCCCCC-cccccCcccC-c
Q psy12281 137 LSNHYG-DIEVPVANCNSYYFNAHEKTNMTLKEYTIYWQNKIDGKLSETEPLYYLKDWHFTRDFKTE-DIYRVPNVFS-S 213 (456)
Q Consensus 137 L~~~~G-d~~V~V~~~~~~~y~~~~~~~m~l~efl~~~~~~~~~~~~~~~~~~Yl~d~~~~~~~p~l-~~y~~P~~f~-~ 213 (456)
.+...| ++.+.|.+.+ .+...+|+|.+|++|+..... +..|-+.+..|...- | ...+.|.... -
T Consensus 42 v~~~vg~~r~~~v~dv~-----~q~~~km~~~~~~~yy~~~~~------~R~yNVisLEfS~Tr--l~~~V~~P~ivr~l 108 (776)
T KOG1633|consen 42 VKELVGSDRMIDVVDVN-----TQKDCKMTLKEFVKYYSSPQR------KRLYNVISLEFSDTR--LSNLVESPEIVRKL 108 (776)
T ss_pred hHHhhCCCccceeeeee-----ccccccccHHHHhhhhcCcch------hhhhheeccccCcch--HHhcCCCchhhhhh
Confidence 334445 4566666543 455779999999999987643 222335555443211 1 1223343322 2
Q ss_pred chhhHhhhhhc--cCCCccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcc-cccCCCCccccCCCcc
Q psy12281 214 DWLNEYYSEHL--EHKDDYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKY-FKDSMGNLISDMRSVD 290 (456)
Q Consensus 214 D~ln~~~~~~~--~~~~~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~-Ly~~~g~~p~d~~~pd 290 (456)
||.+.+|...- .......++.|+.+++.|.+|.|..++.+|+..+.|.|.++|.||..... +|.... ...+.
T Consensus 109 dwV~n~wP~~~~~~~P~vqkyclmsv~~~Ytdfhidfggtsvwyhil~G~K~f~lI~pt~~nl~~ye~w~-----~s~~q 183 (776)
T KOG1633|consen 109 DWVDNQWPDDLKMEYPKVQKYCLMSVKDSYTDFHIDFGGTSVWYHILAGEKTFYLIPPTCENLELYECWE-----SSTPQ 183 (776)
T ss_pred hchhccCCchhcccccccccceeeeccccccccccCCCCcchhhhhhccccceeeeCCcccchhhhhhhh-----hcccc
Confidence 67777765422 13455678889999999999999999999999999999999999998764 333211 11111
Q ss_pred cC-CCCC--CceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHHHHHHHH
Q psy12281 291 WS-TLPR--DTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNIDHVYHEM 346 (456)
Q Consensus 291 ~~-~fp~--~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~~~~~~l 346 (456)
-+ -|.. ..++.|++++|+++|||+||.|.|++..+|+++..||....++...++.-
T Consensus 184 ~~~ffGd~VdkC~~~~l~~g~T~~iPsGwIhAV~Tp~d~l~fgGnflhsl~i~~QLr~y 242 (776)
T KOG1633|consen 184 DEIFFGDCVDKCYKCILKQGQTLFIPSGWIHAVLTPTDCLVFGGNFLHSLTIEMQLRIY 242 (776)
T ss_pred cccccCCccceeEEEEeccCceEecccceeEeeecCcchheeccchhhhhhhHHHHHHH
Confidence 11 1443 78999999999999999999999999999999999999999988877643
No 13
>KOG1356|consensus
Probab=99.00 E-value=2.7e-11 Score=130.24 Aligned_cols=241 Identities=21% Similarity=0.303 Sum_probs=139.6
Q ss_pred HHHHHhhcCCCCcEEEecCCCccccccccccCCCCChhHHHHHhCCceEEEEecCcccccCCceEEeeHHHHHHHHHhcc
Q psy12281 98 PYCFNNFLSKNKPFILKSCVNEWNSSLHWVQNKQPNFEYLSNHYGDIEVPVANCNSYYFNAHEKTNMTLKEYTIYWQNKI 177 (456)
Q Consensus 98 ~~F~~~y~~~~~PvVi~g~~~~Wpa~~~Wt~~~~~~~d~L~~~~Gd~~V~V~~~~~~~y~~~~~~~m~l~efl~~~~~~~ 177 (456)
.-| ++++..++||||+|....... ..|- ++.|...||+..+.+.+|+.. +....-.+++|.+-+....
T Consensus 531 ~~F-QEhWkqGqPViVs~V~~~l~g-~lW~------P~a~~~~~g~q~~~l~n~~~~----~i~s~d~~~~fwegFe~~~ 598 (889)
T KOG1356|consen 531 KHF-QEHWKQGQPVIVSGVHKKLNG-LLWK------PEALSRAFGDQVVDLSNCNNS----QIISNDCVDNFWEGFEGYS 598 (889)
T ss_pred HHH-HHHHhcCCcEEehHhhhhccc-cccc------hHHHHHHhccchhhhhcCCCC----CccccchhhhHHHhhcccc
Confidence 345 677889999999998865544 3684 689999999998888777532 2222334445554444332
Q ss_pred cCCCC--CCCCeeeeeccccccCCCCC---------cccccCcccCcc-hhhHhhhhhcc--CCCcc-ceee--------
Q psy12281 178 DGKLS--ETEPLYYLKDWHFTRDFKTE---------DIYRVPNVFSSD-WLNEYYSEHLE--HKDDY-RFVY-------- 234 (456)
Q Consensus 178 ~~~~~--~~~~~~Yl~d~~~~~~~p~l---------~~y~~P~~f~~D-~ln~~~~~~~~--~~~~~-~~l~-------- 234 (456)
.+-.. .....+-|+||.-.+.|.++ ....+|.|...+ -|| +....+. .++|. .-+|
T Consensus 599 kr~~~~~g~p~vLKLKDWpp~~~Fkd~lP~r~eell~sLPlpEYt~r~G~LN-lAs~LP~~fv~PDLGPk~y~AYG~~~e 677 (889)
T KOG1356|consen 599 KRLKSENGWPEVLKLKDWPPGEDFKDMLPRRFEELLASLPLPEYTDRDGKLN-LASKLPEGFVRPDLGPKLYNAYGVSTE 677 (889)
T ss_pred cCcccccCCeeEEeecCCCchHhHhhhhhHHHHHHHHcCCchhhhcCCCccc-hHhhCcccccCCCCCchhhhhcccccc
Confidence 21111 12345778999644333221 122233332221 111 0000000 01111 1122
Q ss_pred eCCCCCcccccccccCccceeEEEE---------------------------------eE----EEEEEeCCCCCccccc
Q psy12281 235 MGPKETWTPLHADVFHSYSWSVNIC---------------------------------GR----KQWLLLAPGNEKYFKD 277 (456)
Q Consensus 235 iG~~gs~T~lH~D~~~~~sW~~~i~---------------------------------G~----Krw~L~pP~~~~~Ly~ 277 (456)
.|.+.+.|.||.|+.+.. +.++. +. -.|-+|+..+.+.|..
T Consensus 678 ~gr~~gtTnLH~dvSDaV--NILvyv~e~~~~~~~~~~~~k~~~~~~~de~~~~~~~~~~e~~GALWhIF~~~Dv~Kire 755 (889)
T KOG1356|consen 678 LGRGDGTTNLHLDVSDAV--NILVYVGEPPGQIEQIAKVLKKIQEGDLDEITRSRISSVSETPGALWHIFRAQDVPKIRE 755 (889)
T ss_pred ccCCCCceeeceehhhhh--hheeeeccCCchHHhHHHHHHhhhhcchhhhhhhhccccccCCcchhhhhhhcchHHHHH
Confidence 233345699999999977 44441 11 3455555555444433
Q ss_pred CC-------CC-cc--ccCCCc---ccCCC------C--CCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCC
Q psy12281 278 SM-------GN-LI--SDMRSV---DWSTL------P--RDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWING 336 (456)
Q Consensus 278 ~~-------g~-~p--~d~~~p---d~~~f------p--~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~ 336 (456)
.. |. .+ ++.-++ -++++ . ...+++++|..||+||||+|-.|||+||-.||.|+..|+++
T Consensus 756 yL~k~~~E~~~~~~~v~hPIhDQS~YLd~~lr~RLkeEyGVe~WtfvQ~LGdAVfIPAGaPHQVrNLkSCikVa~DFVSP 835 (889)
T KOG1356|consen 756 YLRKVCKEQGHEVPKVHHPIHDQSWYLDRYLRRRLKEEYGVEPWTFVQFLGDAVFIPAGAPHQVRNLKSCIKVAEDFVSP 835 (889)
T ss_pred HHHHhhHHhcCCCCcccCCCcccceeccHHHHHHHHHHhCCCccchhhcccceEEecCCCcHHhhhhhhHHHHHHhhCCh
Confidence 21 11 00 111110 11111 1 28999999999999999999999999999999999999999
Q ss_pred CCHHHHHHHHHHHHHHH
Q psy12281 337 TNIDHVYHEMVSHLEAV 353 (456)
Q Consensus 337 ~nl~~~~~~l~~~~~~~ 353 (456)
.|+...++..+..+.++
T Consensus 836 E~v~ec~rLT~EfR~Lp 852 (889)
T KOG1356|consen 836 EHVSECFRLTQEFRQLP 852 (889)
T ss_pred hhHHHHHHHHHHHhhCC
Confidence 99999999887555444
No 14
>KOG3706|consensus
Probab=98.80 E-value=3.5e-09 Score=107.74 Aligned_cols=96 Identities=21% Similarity=0.288 Sum_probs=69.1
Q ss_pred CCccceeeeCCCCC-cccccccccCccceeEEEEeEEEEEEeCCCCC-cccccCCCCccccCCCcccCCCCCCceEEEEE
Q psy12281 227 KDDYRFVYMGPKET-WTPLHADVFHSYSWSVNICGRKQWLLLAPGNE-KYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQ 304 (456)
Q Consensus 227 ~~~~~~l~iG~~gs-~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~-~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l 304 (456)
...-.++|+-|+|| +-+.|+|--+.| ..||.|||+|+||.|... ..|+-... -+++..++ +.+.++.++
T Consensus 315 c~vGaNvYLTPagSqGfaPHyDdIeaF--vlQvEGrK~Wrly~P~~~~eel~l~sS---~Nf~eedl----gePV~e~vl 385 (629)
T KOG3706|consen 315 CLVGANVYLTPAGSQGFAPHYDDIEAF--VLQVEGRKHWRLYHPTVPLEELALVSS---DNFTEEDL----GEPVHEFVL 385 (629)
T ss_pred cccccceeecCCCCCCCCCchhhhhhh--hheeccceeeEeecCCCcHhhhhhccC---CCCChhHh----CCchHHhhc
Confidence 34456799999887 688999999988 999999999999977653 33332100 01111222 257789999
Q ss_pred cCCCEEEeCCCCeEEEEeCCC--cEEEEe
Q psy12281 305 EAGDSIFVPSGWHHQVTNLEH--TISINH 331 (456)
Q Consensus 305 ~pGD~LfIPsgW~H~V~nl~~--sIsvn~ 331 (456)
+|||+||+|.|..||..+.+- |+.|+.
T Consensus 386 e~GDllYfPRG~IHQA~t~~~vHSlHvTl 414 (629)
T KOG3706|consen 386 EPGDLLYFPRGTIHQADTPALVHSLHVTL 414 (629)
T ss_pred CCCcEEEecCcceeeccccchhceeEEEe
Confidence 999999999999999877654 444443
No 15
>COG2140 Thermophilic glucose-6-phosphate isomerase and related metalloenzymes [Carbohydrate transport and metabolism / General function prediction only]
Probab=97.39 E-value=0.0011 Score=62.19 Aligned_cols=75 Identities=19% Similarity=0.268 Sum_probs=57.6
Q ss_pred CCCCcccccccccCc-cceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCC
Q psy12281 237 PKETWTPLHADVFHS-YSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSG 315 (456)
Q Consensus 237 ~~gs~T~lH~D~~~~-~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsg 315 (456)
..|+...+|+.+... --|...|.|+-+.+|+.|.- +..++.++|||++|||++
T Consensus 88 t~G~~~~~H~Hp~ade~E~y~vi~G~g~m~v~~~~G--------------------------~~~v~~~~~Gd~iyVPp~ 141 (209)
T COG2140 88 TPGAMRELHYHPNADEPEIYYVLKGEGRMLVQKPEG--------------------------EARVIAVRAGDVIYVPPG 141 (209)
T ss_pred cCCcccccccCCCCCcccEEEEEeccEEEEEEcCCC--------------------------cEEEEEecCCcEEEeCCC
Confidence 345557778877643 13899999999999987752 356899999999999999
Q ss_pred CeEEEEeCCC-cEEEEeecCCCC
Q psy12281 316 WHHQVTNLEH-TISINHNWINGT 337 (456)
Q Consensus 316 W~H~V~nl~~-sIsvn~~~~~~~ 337 (456)
|-|.+.|.++ .+-+..+|....
T Consensus 142 ~gH~t~N~Gd~pLvf~~v~~~~~ 164 (209)
T COG2140 142 YGHYTINTGDEPLVFLNVYPADA 164 (209)
T ss_pred cceEeecCCCCCEEEEEEEeCCC
Confidence 9999999998 555555555443
No 16
>smart00558 JmjC A domain family that is part of the cupin metalloenzyme superfamily. Probable enzymes, but of unknown functions, that regulate chromatin reorganisation processes (Clissold and Ponting, in press).
Probab=96.80 E-value=0.0011 Score=49.27 Aligned_cols=45 Identities=24% Similarity=0.357 Sum_probs=30.9
Q ss_pred CcchhhHhhhhhccCCCccceeeeCCCCCcccccccccCccceeEEEEe
Q psy12281 212 SSDWLNEYYSEHLEHKDDYRFVYMGPKETWTPLHADVFHSYSWSVNICG 260 (456)
Q Consensus 212 ~~D~ln~~~~~~~~~~~~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G 260 (456)
..++++.+..... ..+.+++|||+.+|.|++|.|.++.. +..+.|
T Consensus 11 ~~~ll~~~~~~~~--~~~~~~~~~G~~~s~t~~H~d~~~~~--n~~~~~ 55 (57)
T smart00558 11 KLNLLSDLPEDIL--GPDVPYLYMGMAGSVTPWHIDDYDLV--NYLHQG 55 (57)
T ss_pred cchHHHHCCcccC--CCCcceEEEeCCCCccceeEcCCCeE--EEEEec
Confidence 3455554332222 44559999999999999999999974 445554
No 17
>PF00190 Cupin_1: Cupin; InterPro: IPR006045 This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant. ; GO: 0045735 nutrient reservoir activity; PDB: 2E9Q_A 2EVX_A 1OD5_A 1UCX_A 1UD1_C 1FXZ_C 3KGL_C 3KSC_D 1UIJ_F 1IPK_B ....
Probab=96.52 E-value=0.01 Score=52.87 Aligned_cols=80 Identities=21% Similarity=0.211 Sum_probs=53.8
Q ss_pred eeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEE--EEcCCCEE
Q psy12281 233 VYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIV--EQEAGDSI 310 (456)
Q Consensus 233 l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~--~l~pGD~L 310 (456)
+.|.| |+...+|+. +...|..++.|+=+..++.|+.... +.+...- .+++||++
T Consensus 39 ~~i~p-g~~~~Ph~h--~a~~i~~V~~G~~~~~~v~~~~~~~---------------------~~~~~~~~v~l~~Gdv~ 94 (144)
T PF00190_consen 39 VLIEP-GGLRAPHYH--NADEIVYVIEGRGRVGVVGPGGPQE---------------------EFRDFSQKVRLKAGDVF 94 (144)
T ss_dssp EEEET-TEEEEEEEE--SSEEEEEEEESEEEEEEEETTCSSS---------------------EEEEEEEEEEEETTEEE
T ss_pred eehhc-CCccceeEe--eeeEEeeeeccceEEEEEecCCccc---------------------cceeeeceeeeecccce
Confidence 44455 555888888 5556799999999988887765310 0112222 69999999
Q ss_pred EeCCCCeEEEEeCC--CcEEEEeecCCC
Q psy12281 311 FVPSGWHHQVTNLE--HTISINHNWING 336 (456)
Q Consensus 311 fIPsgW~H~V~nl~--~sIsvn~~~~~~ 336 (456)
+||+||.|...|.+ +.+.+.....+.
T Consensus 95 ~vP~G~~h~~~n~~~~~~~~~~~f~~~~ 122 (144)
T PF00190_consen 95 VVPAGHPHWIINDGDDEALVLIIFDTNN 122 (144)
T ss_dssp EE-TT-EEEEEECSSSSEEEEEEEEESS
T ss_pred eeccceeEEEEcCCCCCCEEEEEEECCC
Confidence 99999999999995 455555544443
No 18
>PF06560 GPI: Glucose-6-phosphate isomerase (GPI); InterPro: IPR010551 This entry consists of several bacterial and archaeal glucose-6-phosphate isomerase (GPI) proteins (5.3.1.9 from EC), which are involved in glycolysis and in gluconeogenesis and catalyse the conversion of D-glucose 6-phosphate to D-fructose 6-phosphate. The deduced amino acid sequence of the first archaeal PGI isolated from Pyrococcus furiosus revealed that it is not related to its eukaryotic and many of its bacterial counterparts. In contrast, this archaeal PGI shares similarity with the cupin superfamily that consists of a variety of proteins that are generally involved in sugar metabolism in both prokaryotes and eukaryotes [].; GO: 0004347 glucose-6-phosphate isomerase activity, 0006094 gluconeogenesis, 0006096 glycolysis, 0005737 cytoplasm; PDB: 1J3Q_B 1J3R_B 1J3P_A 2GC0_A 1X8E_A 1X82_A 1QY4_B 2GC2_B 1QXJ_A 1QXR_B ....
Probab=95.95 E-value=0.024 Score=52.53 Aligned_cols=59 Identities=24% Similarity=0.451 Sum_probs=38.5
Q ss_pred eEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecC
Q psy12281 255 SVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWI 334 (456)
Q Consensus 255 ~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~ 334 (456)
+..+.|+-..+|..+.... -...+.+...|||+++||++|.|.+.|.++.--+..+|+
T Consensus 87 Y~vl~G~g~~lLq~~~~~~----------------------~~~~~~v~~~~G~~v~IPp~yaH~tIN~g~~~L~~~~~~ 144 (182)
T PF06560_consen 87 YEVLSGEGLILLQKEEGDD----------------------VGDVIAVEAKPGDVVYIPPGYAHRTINTGDEPLVFAAWV 144 (182)
T ss_dssp EEEEESSEEEEEE-TTS---------------------------EEEEEE-TTEEEEE-TT-EEEEEE-SSS-EEEEEEE
T ss_pred EEEEeCEEEEEEEecCCCc----------------------ceeEEEEEeCCCCEEEECCCceEEEEECCCCcEEEEEEE
Confidence 5788999888887655411 035678999999999999999999999999444444444
Q ss_pred C
Q psy12281 335 N 335 (456)
Q Consensus 335 ~ 335 (456)
+
T Consensus 145 ~ 145 (182)
T PF06560_consen 145 P 145 (182)
T ss_dssp E
T ss_pred e
Confidence 4
No 19
>TIGR03037 anthran_nbaC 3-hydroxyanthranilate 3,4-dioxygenase. Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase. This enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.
Probab=95.83 E-value=0.044 Score=49.61 Aligned_cols=75 Identities=16% Similarity=0.194 Sum_probs=58.8
Q ss_pred ccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCC
Q psy12281 229 DYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGD 308 (456)
Q Consensus 229 ~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD 308 (456)
++.-+.+|+.+..+.+|.+..+. |+.|+.|.=...+...+ +...+.+++||
T Consensus 28 ~~~v~~vgGpn~R~d~H~~~tdE--~FyqleG~~~l~v~d~g---------------------------~~~~v~L~eGd 78 (159)
T TIGR03037 28 EFMVTVVGGPNARTDFHDDPGEE--FFYQLKGEMYLKVTEEG---------------------------KREDVPIREGD 78 (159)
T ss_pred cEEEEEeCCCCCCcccccCCCce--EEEEEcceEEEEEEcCC---------------------------cEEEEEECCCC
Confidence 55666788888889999977554 59999999776654332 23469999999
Q ss_pred EEEeCCCCeEEEEeCCCcEEEEee
Q psy12281 309 SIFVPSGWHHQVTNLEHTISINHN 332 (456)
Q Consensus 309 ~LfIPsgW~H~V~nl~~sIsvn~~ 332 (456)
+++||+|--|.....++|+.+-.-
T Consensus 79 ~flvP~gvpHsP~r~~~t~~LvIE 102 (159)
T TIGR03037 79 IFLLPPHVPHSPQRPAGSIGLVIE 102 (159)
T ss_pred EEEeCCCCCcccccCCCcEEEEEE
Confidence 999999999999998888777653
No 20
>PRK04190 glucose-6-phosphate isomerase; Provisional
Probab=95.61 E-value=0.094 Score=49.15 Aligned_cols=67 Identities=21% Similarity=0.256 Sum_probs=46.4
Q ss_pred ccccccccC-ccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEE
Q psy12281 242 TPLHADVFH-SYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQV 320 (456)
Q Consensus 242 T~lH~D~~~-~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V 320 (456)
|+.|+.... ..=++..+.|+-.++|-.++ .......++|||+++||+||-|.+
T Consensus 87 t~gH~H~~~~~~EiyyvlsG~g~~~l~~~~--------------------------G~~~~~~v~pGd~v~IPpg~~H~~ 140 (191)
T PRK04190 87 TKGHFHAKADRAEIYYGLKGKGLMLLQDPE--------------------------GEARWIEMEPGTVVYVPPYWAHRS 140 (191)
T ss_pred CCCeEcCCCCCCEEEEEEeCEEEEEEecCC--------------------------CcEEEEEECCCCEEEECCCCcEEe
Confidence 666765432 12237789999888874321 134578999999999999999999
Q ss_pred EeCCC--cEEEEeecC
Q psy12281 321 TNLEH--TISINHNWI 334 (456)
Q Consensus 321 ~nl~~--sIsvn~~~~ 334 (456)
.|.++ -.-++.++.
T Consensus 141 iN~G~epl~fl~v~p~ 156 (191)
T PRK04190 141 VNTGDEPLVFLACYPA 156 (191)
T ss_pred EECCCCCEEEEEEEcC
Confidence 99887 344444333
No 21
>PF07883 Cupin_2: Cupin domain; InterPro: IPR013096 This family represents the conserved barrel domain of the cupin superfamily [] (cupa is the Latin term for a small barrel). ; PDB: 2OPK_C 3BU7_B 2PHD_D 3NVC_A 3NKT_A 3NJZ_A 3NW4_A 3NST_A 3NL1_A 2H0V_A ....
Probab=95.59 E-value=0.024 Score=43.37 Aligned_cols=57 Identities=25% Similarity=0.303 Sum_probs=44.8
Q ss_pred CCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCe
Q psy12281 238 KETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 238 ~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~ 317 (456)
+|..++.|.+.... -+..++.|+-.+.+ . .-...++|||++++|++=+
T Consensus 7 pG~~~~~h~H~~~~-e~~~vl~G~~~~~~-~------------------------------~~~~~l~~Gd~~~i~~~~~ 54 (71)
T PF07883_consen 7 PGGSIPPHRHPGED-EFFYVLSGEGTLTV-D------------------------------GERVELKPGDAIYIPPGVP 54 (71)
T ss_dssp TTEEEEEEEESSEE-EEEEEEESEEEEEE-T------------------------------TEEEEEETTEEEEEETTSE
T ss_pred CCCCCCCEECCCCC-EEEEEEECCEEEEE-c------------------------------cEEeEccCCEEEEECCCCe
Confidence 44568999976652 34889999977774 1 1158999999999999999
Q ss_pred EEEEeCCCc
Q psy12281 318 HQVTNLEHT 326 (456)
Q Consensus 318 H~V~nl~~s 326 (456)
|++.|.++.
T Consensus 55 H~~~n~~~~ 63 (71)
T PF07883_consen 55 HQVRNPGDE 63 (71)
T ss_dssp EEEEEESSS
T ss_pred EEEEECCCC
Confidence 999999873
No 22
>PRK13264 3-hydroxyanthranilate 3,4-dioxygenase; Provisional
Probab=95.55 E-value=0.054 Score=49.84 Aligned_cols=74 Identities=16% Similarity=0.190 Sum_probs=59.5
Q ss_pred ccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCC
Q psy12281 229 DYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGD 308 (456)
Q Consensus 229 ~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD 308 (456)
++....+|.+|..+.+|.|....+ +.|+.|.=+..+...+ +...+.+.+||
T Consensus 34 d~~VmvvgGpn~r~d~H~~~tdE~--FyqleG~~~l~v~d~g---------------------------~~~~v~L~eGd 84 (177)
T PRK13264 34 DFIVMVVGGPNARTDFHYDPGEEF--FYQLEGDMYLKVQEDG---------------------------KRRDVPIREGE 84 (177)
T ss_pred CEEEEEEccCCcccccccCCCceE--EEEECCeEEEEEEcCC---------------------------ceeeEEECCCC
Confidence 555667788888899999887665 8999999777765422 22368999999
Q ss_pred EEEeCCCCeEEEEeCCCcEEEEe
Q psy12281 309 SIFVPSGWHHQVTNLEHTISINH 331 (456)
Q Consensus 309 ~LfIPsgW~H~V~nl~~sIsvn~ 331 (456)
+++||+|=-|.....++|+.+-.
T Consensus 85 ~fllP~gvpHsP~r~~~tv~Lvi 107 (177)
T PRK13264 85 MFLLPPHVPHSPQREAGSIGLVI 107 (177)
T ss_pred EEEeCCCCCcCCccCCCeEEEEE
Confidence 99999999999999888887776
No 23
>KOG2132|consensus
Probab=95.52 E-value=0.0064 Score=60.43 Aligned_cols=88 Identities=16% Similarity=0.164 Sum_probs=65.9
Q ss_pred CCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccC--CCCc---cccCCCcccCCCC---C--CceEEEEEc
Q psy12281 236 GPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDS--MGNL---ISDMRSVDWSTLP---R--DTVIIVEQE 305 (456)
Q Consensus 236 G~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~--~g~~---p~d~~~pd~~~fp---~--~~~~~~~l~ 305 (456)
+.....|..|.|-..++ ..++.|-||-++++|.+...+..+ .-.+ -.|+..||+..++ . +....++..
T Consensus 181 pvEvg~~y~~~~w~q~l--~t~~~g~~R~~~~~p~~~~YlAqh~lfdqi~elk~Di~iPDyc~~~~f~~~~v~~~~w~Gp 258 (355)
T KOG2132|consen 181 PVEVGSTYADEDWSQNL--MTQIKGIKRCILFEPRECLYLAQHHLFDQIPELKFDISIPDYCSFPNFENEVVDINAWIGP 258 (355)
T ss_pred eeecccccchhHHHhhh--HHHhhhhhhhhcCCccccchhhhhhhhccchhhhhccCCCceeecCCCCccccceeEEecc
Confidence 33333489999999999 999999999999999933322111 1122 2688888887754 3 566677777
Q ss_pred CCCEEEeCCCCeEEEEeCCC
Q psy12281 306 AGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 306 pGD~LfIPsgW~H~V~nl~~ 325 (456)
+|++++||..|||.+....-
T Consensus 259 aGtV~pih~dp~hNi~~qv~ 278 (355)
T KOG2132|consen 259 AGTVLPIHMDPWHNILSQVF 278 (355)
T ss_pred CCceeccccccccceeeeee
Confidence 79999999999999998776
No 24
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=95.05 E-value=0.11 Score=53.70 Aligned_cols=65 Identities=26% Similarity=0.365 Sum_probs=50.8
Q ss_pred eeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEe
Q psy12281 233 VYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFV 312 (456)
Q Consensus 233 l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfI 312 (456)
+-+.| |..+++|..+.. --|...+.|+=+..++.+.. ...+..++|||++||
T Consensus 250 ~~l~P-G~~~~~H~H~~~-~E~~yvl~G~~~~~v~d~~g--------------------------~~~~~~l~~GD~~~i 301 (367)
T TIGR03404 250 VTVEP-GAMRELHWHPNA-DEWQYFIQGQARMTVFAAGG--------------------------NARTFDYQAGDVGYV 301 (367)
T ss_pred EEECC-CCccCCeeCcCC-CeEEEEEEEEEEEEEEecCC--------------------------cEEEEEECCCCEEEE
Confidence 44455 445899997654 35789999999999876542 233567999999999
Q ss_pred CCCCeEEEEeCCC
Q psy12281 313 PSGWHHQVTNLEH 325 (456)
Q Consensus 313 PsgW~H~V~nl~~ 325 (456)
|.|..|.++|.++
T Consensus 302 P~g~~H~i~N~G~ 314 (367)
T TIGR03404 302 PRNMGHYVENTGD 314 (367)
T ss_pred CCCCeEEEEECCC
Confidence 9999999999986
No 25
>TIGR02098 MJ0042_CXXC MJ0042 family finger-like domain. This domain contains a CXXCX(19)CXXC motif suggestive of both zinc fingers and thioredoxin, usually found at the N-terminus of prokaryotic proteins. One partially characterized gene, agmX, is among a large set in Myxococcus whose interruption affects adventurous gliding motility.
Probab=94.61 E-value=0.067 Score=36.31 Aligned_cols=34 Identities=24% Similarity=0.565 Sum_probs=29.5
Q ss_pred Eec-CCCCcEEeehhhhhcCCcEEeCCCCcceEEE
Q psy12281 24 YPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIKP 57 (456)
Q Consensus 24 ~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~ 57 (456)
..| +||..|.|..+.+.+....+.|+.|...+.+
T Consensus 3 ~~CP~C~~~~~v~~~~~~~~~~~v~C~~C~~~~~~ 37 (38)
T TIGR02098 3 IQCPNCKTSFRVVDSQLGANGGKVRCGKCGHVWYA 37 (38)
T ss_pred EECCCCCCEEEeCHHHcCCCCCEEECCCCCCEEEe
Confidence 569 7999999999999876568999999988875
No 26
>COG0662 {ManC} Mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=93.95 E-value=0.31 Score=42.42 Aligned_cols=61 Identities=25% Similarity=0.255 Sum_probs=45.5
Q ss_pred ceeeeCCCCC-cccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCE
Q psy12281 231 RFVYMGPKET-WTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDS 309 (456)
Q Consensus 231 ~~l~iG~~gs-~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~ 309 (456)
..+.+.+.+. ..+.|.+ .+-+ ..++.|+=++.+= .-++.+.+||+
T Consensus 39 ~~~~v~pg~~~~~~~H~~-~dE~--~~Vl~G~g~v~~~-------------------------------~~~~~v~~gd~ 84 (127)
T COG0662 39 ARILVKPGEEISLHHHHH-RDEH--WYVLEGTGKVTIG-------------------------------GEEVEVKAGDS 84 (127)
T ss_pred EEEEECCCcccCcccccC-cceE--EEEEeeEEEEEEC-------------------------------CEEEEecCCCE
Confidence 3445555444 3667776 4445 6789999877761 23689999999
Q ss_pred EEeCCCCeEEEEeCCC
Q psy12281 310 IFVPSGWHHQVTNLEH 325 (456)
Q Consensus 310 LfIPsgW~H~V~nl~~ 325 (456)
+|||+|=.|.++|.+.
T Consensus 85 ~~iP~g~~H~~~N~G~ 100 (127)
T COG0662 85 VYIPAGTPHRVRNTGK 100 (127)
T ss_pred EEECCCCcEEEEcCCC
Confidence 9999999999999987
No 27
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=93.41 E-value=0.18 Score=51.32 Aligned_cols=66 Identities=17% Similarity=0.095 Sum_probs=48.6
Q ss_pred CCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCC
Q psy12281 237 PKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGW 316 (456)
Q Consensus 237 ~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW 316 (456)
++|..++.|......+ ..+|.|.-.++.+ ...++..++||++.+|++.
T Consensus 89 ~pGe~~~~HRht~sAl--~~vveG~G~~t~V------------------------------~g~~~~~~~gD~~~tP~w~ 136 (335)
T TIGR02272 89 LPGEVAPSHRHTQSAL--RFIVEGKGAFTAV------------------------------DGERTTMHPGDFIITPSWT 136 (335)
T ss_pred CCCCCCCccccccceE--EEEEEcCceEEEE------------------------------CCEEEeeeCCCEEEeCCCe
Confidence 4566688888777666 6777777555552 2347899999999999999
Q ss_pred eEEEEeCCCcEEEEeecC
Q psy12281 317 HHQVTNLEHTISINHNWI 334 (456)
Q Consensus 317 ~H~V~nl~~sIsvn~~~~ 334 (456)
||+-.|.++...|...+.
T Consensus 137 wH~H~n~~d~~~~wld~l 154 (335)
T TIGR02272 137 WHDHGNPGDEPMIWLDGL 154 (335)
T ss_pred eEecccCCCCcEEEEecC
Confidence 999999988664443333
No 28
>PF06052 3-HAO: 3-hydroxyanthranilic acid dioxygenase; InterPro: IPR010329 Members of this protein family, from both bacteria and eukaryotes, are the enzyme 3-hydroxyanthranilate 3,4-dioxygenase (1.13.11.6 from EC). It is part of the kynurenine pathway for the degradation of tryptophan and the biosynthesis of nicotinic acid [].The prokaryotic homologue is involved in the 2-nitrobenzoate degradation pathway []. The enzyme acts on the tryptophan metabolite 3-hydroxyanthranilate and produces 2-amino-3-carboxymuconate semialdehyde, which can rearrange spontaneously to quinolinic acid and feed into nicotinamide biosynthesis, or undergo further enzymatic degradation.; GO: 0000334 3-hydroxyanthranilate 3,4-dioxygenase activity, 0005506 iron ion binding, 0008152 metabolic process, 0055114 oxidation-reduction process; PDB: 1ZVF_A 1YFX_A 1YFW_A 1YFY_A 1YFU_A 2QNK_A 3FE5_A.
Probab=93.06 E-value=0.34 Score=43.20 Aligned_cols=76 Identities=17% Similarity=0.263 Sum_probs=51.9
Q ss_pred CCccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcC
Q psy12281 227 KDDYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEA 306 (456)
Q Consensus 227 ~~~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~p 306 (456)
..++.-+.+|++.+.|.+|.|...-+ +.|+.|.=.-.+...+ +.-.+.+++
T Consensus 31 ~~~f~VmvVGGPN~R~DyHine~eE~--FyQ~kG~m~Lkv~e~g---------------------------~~kdi~I~E 81 (151)
T PF06052_consen 31 KDDFIVMVVGGPNQRTDYHINETEEF--FYQLKGDMCLKVVEDG---------------------------KFKDIPIRE 81 (151)
T ss_dssp ESSEEEEEEESSB--SSEEE-SS-EE--EEEEES-EEEEEEETT---------------------------EEEEEEE-T
T ss_pred cCCeEEEEEcCCCCCCccccCCcceE--EEEEeCcEEEEEEeCC---------------------------ceEEEEeCC
Confidence 34667778899999999999998866 9999998666654433 344799999
Q ss_pred CCEEEeCCCCeEEEEeCCCcEEEEe
Q psy12281 307 GDSIFVPSGWHHQVTNLEHTISINH 331 (456)
Q Consensus 307 GD~LfIPsgW~H~V~nl~~sIsvn~ 331 (456)
||+-++|++-.|...-.++||.+-.
T Consensus 82 Ge~fLLP~~vpHsP~R~~~tiGLVi 106 (151)
T PF06052_consen 82 GEMFLLPANVPHSPQRPADTIGLVI 106 (151)
T ss_dssp TEEEEE-TT--EEEEE-TT-EEEEE
T ss_pred CcEEecCCCCCCCCcCCCCcEEEEE
Confidence 9999999999999999888888754
No 29
>COG1917 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=92.92 E-value=0.56 Score=40.72 Aligned_cols=66 Identities=20% Similarity=0.148 Sum_probs=51.2
Q ss_pred eeeCCCCCcccccccc-cCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEE
Q psy12281 233 VYMGPKETWTPLHADV-FHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIF 311 (456)
Q Consensus 233 l~iG~~gs~T~lH~D~-~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~Lf 311 (456)
.+-..+|..++.|..+ .... ..++.|+=++.+- .-..++++||+++
T Consensus 47 ~v~~~~G~~~~~H~hp~~~~~--~~Vl~G~~~~~~~-------------------------------g~~~~l~~Gd~i~ 93 (131)
T COG1917 47 LVTFEPGAVIPWHTHPLGEQT--IYVLEGEGTVQLE-------------------------------GEKKELKAGDVII 93 (131)
T ss_pred EEEECCCcccccccCCCcceE--EEEEecEEEEEec-------------------------------CCceEecCCCEEE
Confidence 4555677789999998 4344 7889998777773 1146889999999
Q ss_pred eCCCCeEEEEeCCCc--EEEEe
Q psy12281 312 VPSGWHHQVTNLEHT--ISINH 331 (456)
Q Consensus 312 IPsgW~H~V~nl~~s--Isvn~ 331 (456)
||+|=.|.+.+.+++ +-|..
T Consensus 94 ip~g~~H~~~a~~~~~~~~l~v 115 (131)
T COG1917 94 IPPGVVHGLKAVEDEPMVLLLV 115 (131)
T ss_pred ECCCCeeeeccCCCCceeEEEE
Confidence 999999999999996 44444
No 30
>smart00835 Cupin_1 Cupin. This family represents the conserved barrel domain of the 'cupin' superfamily ('cupa' is the Latin term for a small barrel). This family contains 11S and 7S plant seed storage proteins, and germins. Plant seed storage proteins provide the major nitrogen source for the developing plant.
Probab=92.67 E-value=0.97 Score=40.11 Aligned_cols=66 Identities=21% Similarity=0.241 Sum_probs=46.3
Q ss_pred eeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEe
Q psy12281 233 VYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFV 312 (456)
Q Consensus 233 l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfI 312 (456)
+-+.+ |+..+.|+.... .-+..++.|+=+..+--+.. .+..+..+++||+++|
T Consensus 35 ~~i~p-g~~~~~h~H~~~-~e~~~Vl~G~~~~~~~~~~~-------------------------~~~~~~~l~~GD~~~i 87 (146)
T smart00835 35 VNLEP-GGMLPPHYHPRA-TELLYVVRGEGRVGVVDPNG-------------------------NKVYDARLREGDVFVV 87 (146)
T ss_pred EEecC-CcCcCCeeCCCC-CEEEEEEeCeEEEEEEeCCC-------------------------CeEEEEEecCCCEEEE
Confidence 44555 444778875422 23377899987776643311 1355789999999999
Q ss_pred CCCCeEEEEeCCC
Q psy12281 313 PSGWHHQVTNLEH 325 (456)
Q Consensus 313 PsgW~H~V~nl~~ 325 (456)
|+|..|...|.++
T Consensus 88 p~g~~H~~~n~~~ 100 (146)
T smart00835 88 PQGHPHFQVNSGD 100 (146)
T ss_pred CCCCEEEEEcCCC
Confidence 9999999999876
No 31
>TIGR03404 bicupin_oxalic bicupin, oxalate decarboxylase family. Members of this protein family are defined as bicupins as they have two copies of the cupin domain (pfam00190). Two different known activities for members of this family are oxalate decarboxylase (EC 4.1.1.2) and oxalate oxidase (EC 1.2.3.4), although the latter activity has more often been found in distantly related monocupin (germin) proteins.
Probab=91.59 E-value=1.4 Score=45.55 Aligned_cols=65 Identities=20% Similarity=0.315 Sum_probs=48.6
Q ss_pred eeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEe
Q psy12281 233 VYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFV 312 (456)
Q Consensus 233 l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfI 312 (456)
+-+.+.+ ..++|.... .-|..++.|+=+..+.... .+.+...+++||++||
T Consensus 72 ~~l~pG~-~~~~HwH~~--~E~~yVl~G~~~v~~~d~~--------------------------g~~~~~~L~~GD~~~f 122 (367)
T TIGR03404 72 MRLEPGA-IRELHWHKE--AEWAYVLYGSCRITAVDEN--------------------------GRNYIDDVGAGDLWYF 122 (367)
T ss_pred EEEcCCC-CCCcccCCC--ceEEEEEeeEEEEEEEcCC--------------------------CcEEEeEECCCCEEEE
Confidence 4455544 477888753 3579999999998886432 1234448999999999
Q ss_pred CCCCeEEEEeCCCc
Q psy12281 313 PSGWHHQVTNLEHT 326 (456)
Q Consensus 313 PsgW~H~V~nl~~s 326 (456)
|+|..|...|.++.
T Consensus 123 P~g~~H~~~n~~~~ 136 (367)
T TIGR03404 123 PPGIPHSLQGLDEG 136 (367)
T ss_pred CCCCeEEEEECCCC
Confidence 99999999999764
No 32
>COG1996 RPC10 DNA-directed RNA polymerase, subunit RPC10 (contains C4-type Zn-finger) [Transcription]
Probab=91.34 E-value=0.28 Score=35.41 Aligned_cols=34 Identities=24% Similarity=0.537 Sum_probs=25.4
Q ss_pred ceEEEec-CCCCcEEeehhhhhcCCcEEeCCCCcceEEEe
Q psy12281 20 ETYYYPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIKPS 58 (456)
Q Consensus 20 ~~~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~ 58 (456)
....|-| |||..|.+ +++ ...+.|+.|+-.|-+.
T Consensus 3 ~~~~Y~C~~Cg~~~~~----~~~-~~~irCp~Cg~rIl~K 37 (49)
T COG1996 3 AMMEYKCARCGREVEL----DQE-TRGIRCPYCGSRILVK 37 (49)
T ss_pred ceEEEEhhhcCCeeeh----hhc-cCceeCCCCCcEEEEe
Confidence 4578999 99999832 222 3578999999988764
No 33
>PF02041 Auxin_BP: Auxin binding protein; InterPro: IPR000526 Auxin binding protein is located in the lumen of the endoplasmic reticulum (ER). The primary structure contains an N-terminal hydrophobic leader sequence of 30-40 amino acids, which could represent a signal for translocation of the protein to the ER [, ]. The mature protein comprises around 165 residues, and contains a number of potential N-glycosylation sites. In vitro transport studies have demonstrated co-translational glycosylation []. Retention within the lumen of the ER correlates with an additional signal located at the C terminus, represented by the sequence Lys-Asp-Glu-Leu, known to be responsible for preventing secretion of proteins from the lumen of the ER in eukaryotic cells [, ].; GO: 0004872 receptor activity, 0005788 endoplasmic reticulum lumen; PDB: 1LR5_D 1LRH_D.
Probab=91.00 E-value=0.77 Score=40.93 Aligned_cols=65 Identities=17% Similarity=0.127 Sum_probs=38.2
Q ss_pred CCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCC
Q psy12281 237 PKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGW 316 (456)
Q Consensus 237 ~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW 316 (456)
.+|+.||.|..-+.-. ++++.|+=.-.|...... | + .++.++..-|++++.||.+=
T Consensus 52 APG~~TPiHRHsCEEV--FvVLkG~GTl~l~~~~~~---~--p-----------------G~pqef~~~pnSTf~IPvn~ 107 (167)
T PF02041_consen 52 APGSATPIHRHSCEEV--FVVLKGSGTLYLASSHEK---Y--P-----------------GKPQEFPIFPNSTFHIPVND 107 (167)
T ss_dssp -TT-B--EEEESS-EE--EEEEE--EEEEE--SSSS---S---------------------S-EEEEE-TTEEEEE-TT-
T ss_pred cCCCCCCCccccccEE--EEEEecceEEEEeccccc---C--C-----------------CCceEEEecCCCeEEeCCCC
Confidence 5678899999999998 899999988888622210 1 1 46788999999999999999
Q ss_pred eEEEEeCCC
Q psy12281 317 HHQVTNLEH 325 (456)
Q Consensus 317 ~H~V~nl~~ 325 (456)
-|||.|.+.
T Consensus 108 ~HQv~NT~e 116 (167)
T PF02041_consen 108 AHQVWNTNE 116 (167)
T ss_dssp -EEEE---S
T ss_pred cceeecCCC
Confidence 999999874
No 34
>PF05523 FdtA: WxcM-like, C-terminal ; InterPro: IPR008894 This entry includes FdtA (Q6T1W8 from SWISSPROT) from Aneurinibacillus thermoaerophilus, which has been characterised as a dTDP-6-deoxy-3,4-keto-hexulose isomerase []. It also includes WxcM (Q93S92 from SWISSPROT) from Xanthomonas campestris pv campestris) []. ; PDB: 2PAK_A 2PAE_A 2PA7_B 2PAM_A.
Probab=90.64 E-value=1.2 Score=39.08 Aligned_cols=77 Identities=17% Similarity=0.257 Sum_probs=40.4
Q ss_pred ceeee--CCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCC-
Q psy12281 231 RFVYM--GPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAG- 307 (456)
Q Consensus 231 ~~l~i--G~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pG- 307 (456)
|..|+ -+.|..-.+|.....+ -|.+.+.|+=++.+..... ..+++|...
T Consensus 33 rvy~i~~~~~~~~RG~H~Hk~~~-~~~~~l~Gs~~v~~~d~~~---------------------------~~~~~L~~~~ 84 (131)
T PF05523_consen 33 RVYYIYNVPPGVIRGWHAHKKTT-QWFIVLSGSFKVVLDDGRE---------------------------EEEFILDEPN 84 (131)
T ss_dssp EEEEEES--SS--EEEEEESS---EEEEEEES-EEEEEE-SS----------------------------EEEEEE--TT
T ss_pred EEEEEEcCCCCCccccccccccc-EEEEEEeCEEEEEEecCCC---------------------------cEEEEECCCC
Confidence 44555 4555556788776544 4789999998888643222 134555444
Q ss_pred CEEEeCCCCeEEEEeCCCcEEEEeecCCC
Q psy12281 308 DSIFVPSGWHHQVTNLEHTISINHNWING 336 (456)
Q Consensus 308 D~LfIPsgW~H~V~nl~~sIsvn~~~~~~ 336 (456)
..|+||+|+||...++++. ||-..|.+.
T Consensus 85 ~~L~Ippg~w~~~~~~s~~-svlLv~as~ 112 (131)
T PF05523_consen 85 KGLYIPPGVWHGIKNFSED-SVLLVLASE 112 (131)
T ss_dssp EEEEE-TT-EEEEE---TT--EEEEEESS
T ss_pred eEEEECCchhhHhhccCCC-cEEEEEcCC
Confidence 4899999999999999885 444445543
No 35
>KOG0958|consensus
Probab=90.29 E-value=0.11 Score=56.23 Aligned_cols=114 Identities=20% Similarity=0.209 Sum_probs=85.3
Q ss_pred CCccceeeeCCCCCcccccccccCccceeEEEEeE-EEEEEeCCCCCcccccCCCCc-cccC-CCcccCCC------C--
Q psy12281 227 KDDYRFVYMGPKETWTPLHADVFHSYSWSVNICGR-KQWLLLAPGNEKYFKDSMGNL-ISDM-RSVDWSTL------P-- 295 (456)
Q Consensus 227 ~~~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~-Krw~L~pP~~~~~Ly~~~g~~-p~d~-~~pd~~~f------p-- 295 (456)
....+.||.|.=-+.-++|.+-++-+|-+..--|. |.|+.+||.....|....+.+ |... ..+++-++ |
T Consensus 171 gvNt~yLyfGmwKttFaWHtEdmDLySINyLHFGaPK~WYaIP~eh~~rfekla~~~fp~~~~~C~aFLRHK~~LiSP~~ 250 (690)
T KOG0958|consen 171 GVNTPYLYFGMWKTTFAWHTEDMDLYSINYLHFGAPKQWYAIPPEHGDRFEKLASELFPDSSQGCPAFLRHKMTLISPSV 250 (690)
T ss_pred ccCccceeeeeeecccccccCCccceeeeeeecCCCcceeecCHHHHHHHHHHHHhhCCccccCCHHHHhhcccccCHHH
Confidence 35667899998888899999999999777777776 999999999987765432222 1110 01122121 1
Q ss_pred ----CCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHH
Q psy12281 296 ----RDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNID 340 (456)
Q Consensus 296 ----~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~ 340 (456)
..+..+++|++||.|.-=++=||.--|++.+++=+.||....-+.
T Consensus 251 LkqnGIpfn~ivqeagEFmITFPygyHaGFN~GfN~aES~nFat~Rwi~ 299 (690)
T KOG0958|consen 251 LKQNGIPFNRIVQEAGEFMITFPYGYHAGFNHGFNCAESTNFATPRWID 299 (690)
T ss_pred HHHcCCCcceeeecCCcEEEecCcccccccccchhhhhhhcccchhhhh
Confidence 277889999999999988888999999999999888888775443
No 36
>COG4101 Predicted mannose-6-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=90.19 E-value=0.3 Score=41.73 Aligned_cols=26 Identities=35% Similarity=0.521 Sum_probs=24.0
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
..++..|||.+|||+|-.||-.|+.+
T Consensus 88 ~ha~~~pGDf~YiPpgVPHqp~N~S~ 113 (142)
T COG4101 88 EHAEVGPGDFFYIPPGVPHQPANLST 113 (142)
T ss_pred eeEEecCCCeEEcCCCCCCcccccCC
Confidence 36889999999999999999999987
No 37
>PF13717 zinc_ribbon_4: zinc-ribbon domain
Probab=89.94 E-value=0.33 Score=32.73 Aligned_cols=31 Identities=19% Similarity=0.695 Sum_probs=27.0
Q ss_pred ec-CCCCcEEeehhhhhcCCcEEeCCCCcceE
Q psy12281 25 PC-PCGDRFQITKEQLKAGEDVATCPTCSLII 55 (456)
Q Consensus 25 ~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i 55 (456)
.| +|+-.|.|.++.|-.+...++|+.|+-.+
T Consensus 4 ~Cp~C~~~y~i~d~~ip~~g~~v~C~~C~~~f 35 (36)
T PF13717_consen 4 TCPNCQAKYEIDDEKIPPKGRKVRCSKCGHVF 35 (36)
T ss_pred ECCCCCCEEeCCHHHCCCCCcEEECCCCCCEe
Confidence 58 69999999999998877899999998654
No 38
>PF13719 zinc_ribbon_5: zinc-ribbon domain
Probab=89.56 E-value=0.47 Score=32.15 Aligned_cols=32 Identities=31% Similarity=0.914 Sum_probs=27.9
Q ss_pred ec-CCCCcEEeehhhhhcCCcEEeCCCCcceEE
Q psy12281 25 PC-PCGDRFQITKEQLKAGEDVATCPTCSLIIK 56 (456)
Q Consensus 25 ~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~ 56 (456)
.| .|+..|.|+++.|.++...|.|+.|.-...
T Consensus 4 ~CP~C~~~f~v~~~~l~~~~~~vrC~~C~~~f~ 36 (37)
T PF13719_consen 4 TCPNCQTRFRVPDDKLPAGGRKVRCPKCGHVFR 36 (37)
T ss_pred ECCCCCceEEcCHHHcccCCcEEECCCCCcEee
Confidence 58 599999999999988778999999986654
No 39
>PF05995 CDO_I: Cysteine dioxygenase type I; InterPro: IPR010300 Cysteine dioxygenase type I (1.13.11.20 from EC) converts cysteine to cysteinesulphinic acid and is the rate-limiting step in sulphate production.; GO: 0005506 iron ion binding, 0017172 cysteine dioxygenase activity, 0046439 L-cysteine metabolic process, 0055114 oxidation-reduction process; PDB: 2IC1_A 3EQE_B 3ELN_A 2B5H_A 2GH2_A 2Q4S_A 2ATF_A 2GM6_A 3USS_B.
Probab=89.50 E-value=1.6 Score=40.23 Aligned_cols=74 Identities=16% Similarity=0.171 Sum_probs=46.3
Q ss_pred CCCCcccccccccCccceeEEEEeEEEEEEeCCCCC--cccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCC
Q psy12281 237 PKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNE--KYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPS 314 (456)
Q Consensus 237 ~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~--~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPs 314 (456)
++|..|+.| |..++..|..++.|.=+=..|-.... ..+. .......+.+|.+.+.|.
T Consensus 83 ~pGq~S~IH-DH~~s~g~~~vl~G~l~e~~y~~~~~~~~~~~--------------------~~~~~~~~~~g~~~~~~~ 141 (175)
T PF05995_consen 83 PPGQRSPIH-DHGGSWGWVKVLSGELEETRYRRPDDGGAPLE--------------------LVGRERLLPGGVTYIFDP 141 (175)
T ss_dssp -TT-B--EE-E-TTSEEEEEEEESEEEEEEEEESTSSS-EEE--------------------ECEEEEEETTTEEEEBTT
T ss_pred CCCCcCCCC-CCCCceEEEEEecceEEEEEeccCCcccCccc--------------------ccCceEecCCCeEEecCC
Confidence 567789999 77777788999999966665533222 1000 223445677888888899
Q ss_pred CCeEEEEeCC-C--cEEEEe
Q psy12281 315 GWHHQVTNLE-H--TISINH 331 (456)
Q Consensus 315 gW~H~V~nl~-~--sIsvn~ 331 (456)
+..|+|.|.. + +||+..
T Consensus 142 ~~iH~v~n~s~~~~avSLHv 161 (175)
T PF05995_consen 142 HGIHRVENPSGDEPAVSLHV 161 (175)
T ss_dssp TBEEEEEES-SSS-EEEEEE
T ss_pred CCeEEeccCCCCCCEEEEEE
Confidence 9999999875 4 666654
No 40
>PRK13290 ectC L-ectoine synthase; Reviewed
Probab=88.78 E-value=2.3 Score=37.00 Aligned_cols=30 Identities=10% Similarity=-0.017 Sum_probs=25.8
Q ss_pred EEEEcCCCEEEeCCCCeEEEEeCCCcEEEE
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTNLEHTISIN 330 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn 330 (456)
+..++|||++|+|++=.|+..|.++...|.
T Consensus 76 ~~~L~aGD~i~~~~~~~H~~~N~e~~~~l~ 105 (125)
T PRK13290 76 VHPIRPGTMYALDKHDRHYLRAGEDMRLVC 105 (125)
T ss_pred EEEeCCCeEEEECCCCcEEEEcCCCEEEEE
Confidence 688999999999999999999997654444
No 41
>COG5285 Protein involved in biosynthesis of mitomycin antibiotics/polyketide fumonisin [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=88.59 E-value=0.97 Score=44.77 Aligned_cols=98 Identities=16% Similarity=0.127 Sum_probs=59.1
Q ss_pred CCCcccccccccCccceeEEEEeEEEEEEeCCCCCcc--cccCCCCcccc--CCCcccCCCCCCceEEEEEcCCCEEEeC
Q psy12281 238 KETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKY--FKDSMGNLISD--MRSVDWSTLPRDTVIIVEQEAGDSIFVP 313 (456)
Q Consensus 238 ~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~--Ly~~~g~~p~d--~~~pd~~~fp~~~~~~~~l~pGD~LfIP 313 (456)
.+..|++|.|...-..=.-+.++ -|+-+-|-...+ +.--+|..-+| +..||-+.|.+.....+.++|||+||.=
T Consensus 129 ~~~~t~~HqD~~~~~~~~~~lV~--~wiAl~d~~~dnGat~vvPgSH~~~~~~~r~d~~~y~~~~~~pv~lekGDallF~ 206 (299)
T COG5285 129 GAVATRWHQDYPLVSPGYPALVN--AWIALCDFTEDNGATLVVPGSHKWDVIPERPDHETYLERNAVPVELEKGDALLFN 206 (299)
T ss_pred cccccccccccccccCCccceEE--EEEeccccccccCceEEEecccccccCCCCCCccchhhhcceeeeecCCCEEEEc
Confidence 34568899994332100111111 455555544443 22223444344 4445555566666888999999999998
Q ss_pred CCCeEEEEeCCC---cEEEEeecCCCC
Q psy12281 314 SGWHHQVTNLEH---TISINHNWINGT 337 (456)
Q Consensus 314 sgW~H~V~nl~~---sIsvn~~~~~~~ 337 (456)
..=||....-.. -++++++|+...
T Consensus 207 ~~L~HaA~aNrT~~~R~A~~~~~~~~~ 233 (299)
T COG5285 207 GSLWHAAGANRTSADRVALTLQFTVSF 233 (299)
T ss_pred chhhhhhhcCCCCcccceEEEEEeecc
Confidence 888887755443 588888888776
No 42
>PF01050 MannoseP_isomer: Mannose-6-phosphate isomerase; InterPro: IPR001538 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. The type II phosphomannose isomerases are bifunctional enzymes 5.3.1.8 from EC. This entry covers the isomerase region of the protein []. The guanosine diphospho-D-mannose pyrophosphorylase region is described in another InterPro entry (see IPR005836 from INTERPRO).; GO: 0016779 nucleotidyltransferase activity, 0005976 polysaccharide metabolic process
Probab=87.55 E-value=1.4 Score=39.78 Aligned_cols=62 Identities=21% Similarity=0.390 Sum_probs=45.5
Q ss_pred ceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEE
Q psy12281 231 RFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSI 310 (456)
Q Consensus 231 ~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~L 310 (456)
+.+.+-|..+ .++|+....+=.| .+|.|.-..++ ..-...+.+||.+
T Consensus 66 kri~V~pG~~-lSlq~H~~R~E~W-~Vv~G~a~v~~-------------------------------~~~~~~~~~g~sv 112 (151)
T PF01050_consen 66 KRITVNPGKR-LSLQYHHHRSEHW-TVVSGTAEVTL-------------------------------DDEEFTLKEGDSV 112 (151)
T ss_pred EEEEEcCCCc-cceeeecccccEE-EEEeCeEEEEE-------------------------------CCEEEEEcCCCEE
Confidence 4466666554 7777777666555 35777776666 1125679999999
Q ss_pred EeCCCCeEEEEeCCC
Q psy12281 311 FVPSGWHHQVTNLEH 325 (456)
Q Consensus 311 fIPsgW~H~V~nl~~ 325 (456)
|||.|-.|++.|.+.
T Consensus 113 ~Ip~g~~H~i~n~g~ 127 (151)
T PF01050_consen 113 YIPRGAKHRIENPGK 127 (151)
T ss_pred EECCCCEEEEECCCC
Confidence 999999999999875
No 43
>PF05180 zf-DNL: DNL zinc finger; InterPro: IPR007853 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. The DNL-type zinc finger is found in Tim15, a zinc finger protein essential for protein import into mitochondria. Mitochondrial functions rely on the correct transport of resident proteins synthesized in the cytosol to mitochondria. Protein import into mitochondria is mediated by membrane protein complexes, protein translocators, in the outer and inner mitochondrial membranes, in cooperation with their assistant proteins in the cytosol, intermembrane space and matrix. Proteins destined to the mitochondrial matrix cross the outer membrane with the aid of the outer membrane translocator, the tOM40 complex, and then the inner membrane with the aid of the inner membrane translocator, the TIM23 complex, and mitochondrial motor and chaperone (MMC) proteins including mitochondrial heat- shock protein 70 (mtHsp70), and translocase in the inner mitochondrial membrane (Tim)15. Tim15 is also known as zinc finger motif (Zim)17 or mtHsp70 escort protein (Hep)1. Tim15 contains a zinc-finger motif (CXXC and CXXC) of ~100 residues, which has been named DNL after a short C-terminal motif of D(N/H)L [, , ]. The DNL-type zinc finger is an L-shaped molecule. The two CXXC motifs are located at the end of the L, and are sandwiched by two- stranded antiparallel beta-sheets. Two short alpha-helices constitute another leg of the L. The outer (convex) face of the L has a large acidic groove, which is lined with five acidic residues, whereas the inner (concave) face of the L has two positively charged residues, next to the CXXC motifs []. This entry represents the DNL-type zinc finger.; GO: 0008270 zinc ion binding; PDB: 2E2Z_A.
Probab=87.02 E-value=0.69 Score=35.63 Aligned_cols=33 Identities=24% Similarity=0.652 Sum_probs=26.0
Q ss_pred EEEec-CCC--CcEEeehhhhhcCCcEEeCCCCcce
Q psy12281 22 YYYPC-PCG--DRFQITKEQLKAGEDVATCPTCSLI 54 (456)
Q Consensus 22 ~~~~C-rCg--~~f~i~~~~l~~~~~~~~C~~CSl~ 54 (456)
..|.| .|| ..-.|++..-+.|..+|+|++|.-+
T Consensus 3 l~FTC~~C~~Rs~~~~sk~aY~~GvViv~C~gC~~~ 38 (66)
T PF05180_consen 3 LTFTCNKCGTRSAKMFSKQAYHKGVVIVQCPGCKNR 38 (66)
T ss_dssp EEEEETTTTEEEEEEEEHHHHHTSEEEEE-TTS--E
T ss_pred EEEEcCCCCCccceeeCHHHHhCCeEEEECCCCcce
Confidence 46789 699 4788999999999999999999865
No 44
>TIGR01221 rmlC dTDP-4-dehydrorhamnose 3,5-epimerase. This enzyme participates in the biosynthesis of dTDP-L-rhamnose, often as a precursor to LPS O-antigen
Probab=85.85 E-value=4.9 Score=37.15 Aligned_cols=78 Identities=14% Similarity=0.052 Sum_probs=49.0
Q ss_pred eeeeCCCCCccccccccc---CccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcC--
Q psy12281 232 FVYMGPKETWTPLHADVF---HSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEA-- 306 (456)
Q Consensus 232 ~l~iG~~gs~T~lH~D~~---~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~p-- 306 (456)
++-...+|+.-.+|+... .-+ ...+.|+=.-+++..... +|. | .+...++|.+
T Consensus 47 n~S~S~~gvlRGlH~q~~~~q~Kl--v~c~~G~i~dV~VDlR~~---------------SpT---f--G~~~~~~L~~~~ 104 (176)
T TIGR01221 47 NHSKSYKGVLRGLHYQRPHPQGKL--VRVLRGEVFDVAVDLRRN---------------SPT---F--GKWVGVLLSAEN 104 (176)
T ss_pred EEEEecCCEEEEEEECCCCCCceE--EEEccCCEEEEEEECCCC---------------cCC---C--CeEEEEEECCCC
Confidence 344456688888888642 223 556666655555433221 111 1 3455666666
Q ss_pred CCEEEeCCCCeEEEEeCCCcEEEEe
Q psy12281 307 GDSIFVPSGWHHQVTNLEHTISINH 331 (456)
Q Consensus 307 GD~LfIPsgW~H~V~nl~~sIsvn~ 331 (456)
+-+||||+|.+|.-.+|++...+.+
T Consensus 105 ~~~l~IP~G~aHGF~~L~d~a~v~Y 129 (176)
T TIGR01221 105 KRQLWIPEGFAHGFVVLSDEAEFLY 129 (176)
T ss_pred CCEEEeCCcceeEEEEcCCCeEEEE
Confidence 6699999999999999998744444
No 45
>COG1898 RfbC dTDP-4-dehydrorhamnose 3,5-epimerase and related enzymes [Cell envelope biogenesis, outer membrane]
Probab=85.39 E-value=6.8 Score=36.12 Aligned_cols=89 Identities=12% Similarity=0.065 Sum_probs=56.2
Q ss_pred cceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCC--
Q psy12281 230 YRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAG-- 307 (456)
Q Consensus 230 ~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pG-- 307 (456)
..++.++.+|+.-.+|+..-.-.-..+.+.|+=.-.++..... +| .| .+...+++.+-
T Consensus 46 Q~n~S~S~~GvlRGlHyq~~~q~klv~~v~G~v~dv~vDlR~~---------------Sp---Ty--g~~~~~~ls~~N~ 105 (173)
T COG1898 46 QDNHSFSYPGVLRGLHYQHKPQGKLVRVVSGKVFDVAVDLRKD---------------SP---TY--GKWVGVVLSAENK 105 (173)
T ss_pred cceEEEecCCeeEEEEcccCCCCeEEEEecCeEEEEEEEccCC---------------CC---Cc--ceEEEEEecCCCc
Confidence 3456667788888999987662112445566655555433221 11 11 23445566554
Q ss_pred CEEEeCCCCeEEEEeCCCcEEEEeecCCCCC
Q psy12281 308 DSIFVPSGWHHQVTNLEHTISINHNWINGTN 338 (456)
Q Consensus 308 D~LfIPsgW~H~V~nl~~sIsvn~~~~~~~n 338 (456)
.+|+||.|+.|--.+|+++..+.+.-.+.++
T Consensus 106 ~~l~IP~G~AHGf~~L~d~~~~~y~~~~~Y~ 136 (173)
T COG1898 106 RQLYIPPGFAHGFQVLSDDAEVVYKVTEEYD 136 (173)
T ss_pred eEEEeCCcccceeEEccCceEEEEEecceeC
Confidence 8999999999999999998766665444443
No 46
>PF13759 2OG-FeII_Oxy_5: Putative 2OG-Fe(II) oxygenase; PDB: 3BVC_B 2RG4_A.
Probab=85.26 E-value=5.7 Score=32.83 Aligned_cols=36 Identities=39% Similarity=0.480 Sum_probs=23.7
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCCC---cEEEEee
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLEH---TISINHN 332 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~~---sIsvn~~ 332 (456)
...+.+..++|++|+.|+.-+|.|..... -|||++|
T Consensus 63 ~~~~~~~p~~G~lvlFPs~l~H~v~p~~~~~~RisisfN 101 (101)
T PF13759_consen 63 SPYYIVEPEEGDLVLFPSWLWHGVPPNNSDEERISISFN 101 (101)
T ss_dssp -SEEEE---TTEEEEEETTSEEEE----SSS-EEEEEEE
T ss_pred CceEEeCCCCCEEEEeCCCCEEeccCcCCCCCEEEEEcC
Confidence 67788999999999999999999976543 5666664
No 47
>PF09723 Zn-ribbon_8: Zinc ribbon domain; InterPro: IPR013429 This entry represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB []. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=84.66 E-value=0.65 Score=32.37 Aligned_cols=31 Identities=32% Similarity=0.927 Sum_probs=24.0
Q ss_pred eEEEec-CCCCcEEeehhhhhcCCcEEeCCCCcc
Q psy12281 21 TYYYPC-PCGDRFQITKEQLKAGEDVATCPTCSL 53 (456)
Q Consensus 21 ~~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl 53 (456)
.|.|.| .||..|++-..- .+ ...+.|+.|.-
T Consensus 3 ~Yey~C~~Cg~~fe~~~~~-~~-~~~~~CP~Cg~ 34 (42)
T PF09723_consen 3 IYEYRCEECGHEFEVLQSI-SE-DDPVPCPECGS 34 (42)
T ss_pred CEEEEeCCCCCEEEEEEEc-CC-CCCCcCCCCCC
Confidence 489999 599999986543 33 46899999976
No 48
>smart00834 CxxC_CXXC_SSSS Putative regulatory protein. CxxC_CXXC_SSSS represents a region of about 41 amino acids found in a number of small proteins in a wide range of bacteria. The region usually begins with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One protein in this entry has been noted as a putative regulatory protein, designated FmdB. Most proteins in this entry have a C-terminal region containing highly degenerate sequence.
Probab=84.11 E-value=0.83 Score=31.16 Aligned_cols=34 Identities=32% Similarity=0.849 Sum_probs=25.0
Q ss_pred eEEEec-CCCCcEEeehhhhhcCCcEEeCCCCcceEE
Q psy12281 21 TYYYPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIK 56 (456)
Q Consensus 21 ~~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~ 56 (456)
.|.|.| -||..|++....- + ...+.|+.|.-.++
T Consensus 3 ~Y~y~C~~Cg~~fe~~~~~~-~-~~~~~CP~Cg~~~~ 37 (41)
T smart00834 3 IYEYRCEDCGHTFEVLQKIS-D-DPLATCPECGGDVR 37 (41)
T ss_pred CEEEEcCCCCCEEEEEEecC-C-CCCCCCCCCCCcce
Confidence 489999 4999998865432 2 35788999987543
No 49
>PLN00212 glutelin; Provisional
Probab=83.14 E-value=7.2 Score=41.86 Aligned_cols=74 Identities=14% Similarity=0.056 Sum_probs=55.7
Q ss_pred CCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCe
Q psy12281 238 KETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 238 ~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~ 317 (456)
+|+..+.|+.+..+ .-..++.|+=+.-++.+. | .+.+...|++||++.||.|+.
T Consensus 357 ~gam~~PHwn~nA~-eI~yV~rG~g~vqvV~~~---------g----------------~~vf~~~L~~GdvfVVPqg~~ 410 (493)
T PLN00212 357 QNALLSPFWNVNAH-SVVYITQGRARVQVVSNN---------G----------------KTVFNGVLRPGQLLIIPQHYA 410 (493)
T ss_pred CCcccCCeecCCCC-EEEEEeecceEEEEEcCC---------C----------------CEEEEEEEcCCCEEEECCCCe
Confidence 45568889887744 347788898888887643 1 245778999999999999999
Q ss_pred EEEEeCCCcEEEEeecCCCC
Q psy12281 318 HQVTNLEHTISINHNWINGT 337 (456)
Q Consensus 318 H~V~nl~~sIsvn~~~~~~~ 337 (456)
|...+-.+.+-++....++.
T Consensus 411 v~~~A~~egfe~v~F~tna~ 430 (493)
T PLN00212 411 VLKKAEREGCQYIAFKTNAN 430 (493)
T ss_pred EEEeecCCceEEEEeecCCC
Confidence 99888777776666555553
No 50
>TIGR02466 conserved hypothetical protein. This family consists of uncharacterized proteins in Caulobacter crescentus CB15, Bdellovibrio bacteriovorus HD100, Synechococcus sp. WH 8102 (2), Silicibacter pomeroyi DSS-3 (2), and Hyphomonas neptunium ATCC 15444. The context of nearby genes differs substantially between members and does point to any specific biological role.
Probab=82.84 E-value=8.1 Score=36.50 Aligned_cols=35 Identities=26% Similarity=0.412 Sum_probs=25.6
Q ss_pred eEEEEEcCCCEEEeCCCCeEEEEeCCC---cEEEEeec
Q psy12281 299 VIIVEQEAGDSIFVPSGWHHQVTNLEH---TISINHNW 333 (456)
Q Consensus 299 ~~~~~l~pGD~LfIPsgW~H~V~nl~~---sIsvn~~~ 333 (456)
.+.+.-++|++|+.||.-+|.|..... -|||++|+
T Consensus 161 ~~~v~P~~G~lvlFPS~L~H~v~p~~~~~~RISiSFNl 198 (201)
T TIGR02466 161 FVYVPPQEGRVLLFESWLRHEVPPNESEEERISVSFNY 198 (201)
T ss_pred cEEECCCCCeEEEECCCCceecCCCCCCCCEEEEEEee
Confidence 344556999999999999999987643 45555443
No 51
>PF09313 DUF1971: Domain of unknown function (DUF1971); InterPro: IPR015392 This uncharacterised domain is predominantly found in bacterial Tellurite resistance proteins. ; PDB: 3BB6_C 3M70_A 3DL3_I.
Probab=82.43 E-value=13 Score=30.02 Aligned_cols=57 Identities=16% Similarity=0.144 Sum_probs=42.3
Q ss_pred CccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEE
Q psy12281 250 HSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISI 329 (456)
Q Consensus 250 ~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsv 329 (456)
++.+-..++.|+=+|+.+.+... ....++...+|+.-+||+.-||.|..+++-+.+
T Consensus 24 GtWg~l~Vl~G~L~f~~~~~~~~------------------------~~~~~~~~~~~~~~~i~Pq~wH~V~p~s~D~~f 79 (82)
T PF09313_consen 24 GTWGKLRVLEGELKFYGLDEEGE------------------------EPEEEVFIPAGQPPVIEPQQWHRVEPLSDDLRF 79 (82)
T ss_dssp TEEEEEEEEESEEEEEEESSTT-------------------------SESEEEEEETTEEEEE-TT-EEEEEESSTT-EE
T ss_pred CeEEEEEEEeeEEEEEEECCCCC------------------------ceeEEEEeCCCCCceeCCCceEEEEECCCCEEE
Confidence 44555778999999999887543 124478999999999999999999999985544
Q ss_pred E
Q psy12281 330 N 330 (456)
Q Consensus 330 n 330 (456)
.
T Consensus 80 ~ 80 (82)
T PF09313_consen 80 Q 80 (82)
T ss_dssp E
T ss_pred E
Confidence 3
No 52
>PRK09943 DNA-binding transcriptional repressor PuuR; Provisional
Probab=82.35 E-value=8.3 Score=35.57 Aligned_cols=34 Identities=21% Similarity=0.090 Sum_probs=27.1
Q ss_pred EEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecC
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWI 334 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~ 334 (456)
+..+++||.+++|++=.|...|.++.-+.-.++.
T Consensus 147 ~~~l~~Gd~~~~~~~~~H~~~n~~~~~~~~l~~~ 180 (185)
T PRK09943 147 DYHLVAGQSYAINTGIPHSFSNTSAGICRIISAH 180 (185)
T ss_pred EEEecCCCEEEEcCCCCeeeeCCCCCCeEEEEEe
Confidence 5789999999999999999999877444444333
No 53
>PF00908 dTDP_sugar_isom: dTDP-4-dehydrorhamnose 3,5-epimerase; InterPro: IPR000888 Deoxythymidine diphosphate (dTDP)-4-keto-6-deoxy-d-hexulose 3, 5-epimerase (RmlC, 5.1.3.13 from EC) is involved in the biosynthesis of dTDP-l-rhamnose, which is an essential component of the bacterial cell wall, converting dTDP-4-keto-6-deoxy-D-glucose to dTDP-4-keto-L-rhamnose. The crystal structure of RmlC from Methanobacterium thermoautotrophicum was determined in the presence and absence of a substrate analogue. RmlC is a homodimer comprising a central jelly roll motif, which extends in two directions into longer beta-sheets. Binding of dTDP is stabilised by ionic interactions to the phosphate group and by a combination of ionic and hydrophobic interactions with the base. The active site, which is located in the centre of the jelly roll, is formed by residues that are conserved in all known RmlC sequence homologues. The active site is lined with a number of charged residues and a number of residues with hydrogen-bonding potentials, which together comprise a potential network for substrate binding and catalysis. The active site is also lined with aromatic residues which provide favorable environments for the base moiety of dTDP and potentially for the sugar moiety of the substrate [].; GO: 0008830 dTDP-4-dehydrorhamnose 3,5-epimerase activity, 0009103 lipopolysaccharide biosynthetic process; PDB: 1EPZ_A 1EP0_A 1NXM_A 1NZC_D 2IXL_C 1NYW_B 2IXC_D 1PM7_B 1UPI_A 3RYK_B ....
Probab=82.28 E-value=9.1 Score=35.39 Aligned_cols=82 Identities=17% Similarity=0.169 Sum_probs=52.0
Q ss_pred eeeeCCCCCcccccccccC----ccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCC
Q psy12281 232 FVYMGPKETWTPLHADVFH----SYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAG 307 (456)
Q Consensus 232 ~l~iG~~gs~T~lH~D~~~----~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pG 307 (456)
++-...+|+.-.+|+.... -+ ..++.|+=.-+++..... +|.+ .+...++|.++
T Consensus 46 n~S~s~~gvlRGlH~q~~~~~q~Kl--v~~~~G~i~dV~vDlR~~---------------SpTf-----g~~~~~~Ls~~ 103 (176)
T PF00908_consen 46 NISVSKKGVLRGLHYQSPPYAQAKL--VRCLRGEIFDVAVDLRKG---------------SPTF-----GKWVSVELSAE 103 (176)
T ss_dssp EEEEEETTBEEEEEEESTTT-EEEE--EEEEESEEEEEEEE-BTT---------------STTT-----T-EEEEEEETT
T ss_pred EEEEccccEEEEEEEecCCCCCCcE--EEEecCeEEEEEEECCCC---------------CCCC-----CEEEEEEeCcc
Confidence 4455566888889987664 23 556677655555422111 1211 46777888777
Q ss_pred C--EEEeCCCCeEEEEeCCCcEEEEeecCC
Q psy12281 308 D--SIFVPSGWHHQVTNLEHTISINHNWIN 335 (456)
Q Consensus 308 D--~LfIPsgW~H~V~nl~~sIsvn~~~~~ 335 (456)
+ .||||+|.+|...++++.-.|.+.--+
T Consensus 104 n~~~l~IP~G~aHGf~~l~d~a~v~Y~~t~ 133 (176)
T PF00908_consen 104 NPRQLYIPPGVAHGFQTLEDDAEVLYKVTN 133 (176)
T ss_dssp T--EEEE-TTEEEEEEESSSEEEEEEEESS
T ss_pred ccCEEEeCCcceeeEEeccCceEEEEecCC
Confidence 6 799999999999999996555554433
No 54
>TIGR01479 GMP_PMI mannose-1-phosphate guanylyltransferase/mannose-6-phosphate isomerase. This enzyme is known to be bifunctional, as both mannose-6-phosphate isomerase (EC 5.3.1.8) (PMI) and mannose-1-phosphate guanylyltransferase (EC 2.7.7.22) in Pseudomonas aeruginosa, Xanthomonas campestris, and Gluconacetobacter xylinus. The literature on the enzyme from E. coli attributes mannose-6-phosphate isomerase activity to an adjacent gene, but the present sequence has not been shown to lack the activity. The PMI domain is C-terminal.
Probab=80.82 E-value=5.4 Score=42.63 Aligned_cols=26 Identities=19% Similarity=0.318 Sum_probs=24.0
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
-+..++|||.+|||++=.|+..|.++
T Consensus 415 ~~~~l~~GDsi~ip~~~~H~~~N~g~ 440 (468)
T TIGR01479 415 ETLLLTENESTYIPLGVIHRLENPGK 440 (468)
T ss_pred EEEEecCCCEEEECCCCcEEEEcCCC
Confidence 36899999999999999999999887
No 55
>KOG3277|consensus
Probab=80.80 E-value=1.7 Score=38.83 Aligned_cols=43 Identities=23% Similarity=0.503 Sum_probs=34.0
Q ss_pred CccccCCCceEEEec-CCC--CcEEeehhhhhcCCcEEeCCCCcce
Q psy12281 12 DFEYDEDDETYYYPC-PCG--DRFQITKEQLKAGEDVATCPTCSLI 54 (456)
Q Consensus 12 d~~~~~~~~~~~~~C-rCg--~~f~i~~~~l~~~~~~~~C~~CSl~ 54 (456)
+|-...-.....|.| -|| ..=.|++..-+.|..+|+|++|.-.
T Consensus 68 ~~~~~kp~m~l~yTCkvCntRs~ktisk~AY~~GvVivqC~gC~~~ 113 (165)
T KOG3277|consen 68 FFKVPKPRMQLAYTCKVCNTRSTKTISKQAYEKGVVIVQCPGCKNH 113 (165)
T ss_pred cccCCCcceEEEEEeeccCCccccccChhhhhCceEEEECCCCccc
Confidence 344444455689999 599 4788999999999999999999854
No 56
>TIGR01206 lysW lysine biosynthesis protein LysW. This very small, poorly characterized protein has been shown essential in Thermus thermophilus for an unusual pathway of Lys biosynthesis from aspartate by way of alpha-aminoadipate (AAA) rather than diaminopimelate. It is found also in Deinococcus radiodurans and Pyrococcus horikoshii, which appear to share the AAA pathway.
Probab=78.40 E-value=2.5 Score=31.27 Aligned_cols=32 Identities=22% Similarity=0.723 Sum_probs=25.6
Q ss_pred ec-CCCCcEEeehhhhhcCCcEEeCCCCcceEEEee
Q psy12281 25 PC-PCGDRFQITKEQLKAGEDVATCPTCSLIIKPSV 59 (456)
Q Consensus 25 ~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~ 59 (456)
.| .||..+.+....+ | .++.|+.|...+.|.-
T Consensus 4 ~CP~CG~~iev~~~~~--G-eiV~Cp~CGaeleVv~ 36 (54)
T TIGR01206 4 ECPDCGAEIELENPEL--G-ELVICDECGAELEVVS 36 (54)
T ss_pred CCCCCCCEEecCCCcc--C-CEEeCCCCCCEEEEEe
Confidence 58 5999999965554 4 4889999999998864
No 57
>TIGR02605 CxxC_CxxC_SSSS putative regulatory protein, FmdB family. This model represents a region of about 50 amino acids found in a number of small proteins in a wide range of bacteria. The region begins usually with the initiator Met and contains two CxxC motifs separated by 17 amino acids. One member of this family is has been noted as a putative regulatory protein, designated FmdB (PubMed:8841393). Most members of this family have a C-terminal region containing highly degenerate sequence, such as SSTSESTKSSGSSGSSGSSESKASGSTEKSTSSTTAAAAV in Mycobacterium tuberculosis and VAVGGSAPAPSPAPRAGGGGGGCCGGGCCG in Streptomyces avermitilis. These low complexity regions, which are not included in the model, resemble low-complexity C-terminal regions of some heterocycle-containing bacteriocin precursors.
Probab=75.64 E-value=2.1 Score=30.97 Aligned_cols=34 Identities=35% Similarity=0.950 Sum_probs=24.0
Q ss_pred eEEEec-CCCCcEEeehhhhhcCCcEEeCCCCcc-eEE
Q psy12281 21 TYYYPC-PCGDRFQITKEQLKAGEDVATCPTCSL-IIK 56 (456)
Q Consensus 21 ~~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl-~i~ 56 (456)
.|.|.| .||..|++.... .+ ...+.|+.|.- .++
T Consensus 3 ~Yey~C~~Cg~~fe~~~~~-~~-~~~~~CP~Cg~~~~~ 38 (52)
T TIGR02605 3 IYEYRCTACGHRFEVLQKM-SD-DPLATCPECGGEKLR 38 (52)
T ss_pred CEEEEeCCCCCEeEEEEec-CC-CCCCCCCCCCCCcee
Confidence 489999 599999986422 22 35678999986 443
No 58
>PF05899 Cupin_3: Protein of unknown function (DUF861); InterPro: IPR008579 The function of the proteins in this entry are unknown. They contain the conserved barrel domain of the 'cupin' superfamily and members are specific to plants and bacteria.; PDB: 1RC6_A 3MYX_A 1O5U_A 2K9Z_A 1LKN_A 3ES4_A 1SFN_B 3BCW_A.
Probab=74.86 E-value=1.9 Score=33.74 Aligned_cols=18 Identities=44% Similarity=0.951 Sum_probs=14.9
Q ss_pred EEEEEcCCCEEEeCCCCe
Q psy12281 300 IIVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~ 317 (456)
-..++.|||++++|+||-
T Consensus 44 ~~~~~~aGD~~~~p~G~~ 61 (74)
T PF05899_consen 44 ETVTFKAGDAFFLPKGWT 61 (74)
T ss_dssp EEEEEETTEEEEE-TTEE
T ss_pred CEEEEcCCcEEEECCCCE
Confidence 358999999999999993
No 59
>KOG2107|consensus
Probab=74.74 E-value=3.9 Score=37.15 Aligned_cols=28 Identities=36% Similarity=0.475 Sum_probs=24.9
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCC
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLE 324 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~ 324 (456)
..++.|-++.||.|++|+|-+|.-....
T Consensus 113 d~WIRi~vekGDlivlPaGiyHRFTtt~ 140 (179)
T KOG2107|consen 113 DQWIRIFVEKGDLIVLPAGIYHRFTTTP 140 (179)
T ss_pred CCEEEEEEecCCEEEecCcceeeeecCc
Confidence 5789999999999999999999976654
No 60
>PF02311 AraC_binding: AraC-like ligand binding domain; InterPro: IPR003313 This entry defines the arabinose-binding and dimerisation domain of the bacterial gene regulatory protein AraC. The crystal structure of the arabinose-binding and dimerization domain of the Escherichia coli gene regulatory protein AraC was determined in the presence and absence of L-arabinose. The arabinose-bound molecule shows that the protein adopts an unusual fold, binding sugar within a beta barrel and completely burying the arabinose with the amino-terminal arm of the protein. Dimer contacts in the presence of arabinose are mediated by an antiparallel coiled-coil. In the uncomplexed protein, the amino-terminal arm is disordered, uncovering the sugar-binding pocket and allowing it to serve as an oligomerization interface [].; GO: 0006355 regulation of transcription, DNA-dependent; PDB: 1XJA_B 2ARA_A 2AAC_B 2ARC_A.
Probab=73.99 E-value=17 Score=30.29 Aligned_cols=57 Identities=16% Similarity=0.123 Sum_probs=35.9
Q ss_pred CCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCC
Q psy12281 236 GPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSG 315 (456)
Q Consensus 236 G~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsg 315 (456)
+..+..++.|...+..+ ..++.|.-.+.+ ..-+..++|||+++||+|
T Consensus 10 ~~~~~~~~~h~h~~~~i--~~v~~G~~~~~~-------------------------------~~~~~~l~~g~~~li~p~ 56 (136)
T PF02311_consen 10 RSPNFEFPPHWHDFYEI--IYVLSGEGTLHI-------------------------------DGQEYPLKPGDLFLIPPG 56 (136)
T ss_dssp TSTT-SEEEETT-SEEE--EEEEEE-EEEEE-------------------------------TTEEEEE-TT-EEEE-TT
T ss_pred cCCCCccCCEECCCEEE--EEEeCCEEEEEE-------------------------------CCEEEEEECCEEEEecCC
Confidence 33444566777665555 778888876654 123588999999999999
Q ss_pred CeEEEEeCCC
Q psy12281 316 WHHQVTNLEH 325 (456)
Q Consensus 316 W~H~V~nl~~ 325 (456)
=.|.....++
T Consensus 57 ~~H~~~~~~~ 66 (136)
T PF02311_consen 57 QPHSYYPDSN 66 (136)
T ss_dssp S-EEEEE-TT
T ss_pred ccEEEecCCC
Confidence 9999999884
No 61
>PF12852 Cupin_6: Cupin
Probab=73.47 E-value=14 Score=33.93 Aligned_cols=25 Identities=28% Similarity=0.481 Sum_probs=21.3
Q ss_pred EEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
.+.+++||++++|.|=.|....-.+
T Consensus 56 ~~~L~~GDivllp~g~~H~l~~~~~ 80 (186)
T PF12852_consen 56 PIRLEAGDIVLLPRGTAHVLSSDPD 80 (186)
T ss_pred eEEecCCCEEEEcCCCCeEeCCCCC
Confidence 3889999999999999999865544
No 62
>PRK15457 ethanolamine utilization protein EutQ; Provisional
Probab=72.77 E-value=13 Score=35.84 Aligned_cols=52 Identities=15% Similarity=0.179 Sum_probs=38.0
Q ss_pred CcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEE
Q psy12281 240 TWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQ 319 (456)
Q Consensus 240 s~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~ 319 (456)
+--++|++.... ..++.|+=++.+ ..-+.++.|||++|||+|=.|+
T Consensus 167 ~sf~wtl~~dEi---~YVLEGe~~l~I-------------------------------dG~t~~l~pGDvlfIPkGs~~h 212 (233)
T PRK15457 167 AFFPWTLNYDEI---DMVLEGELHVRH-------------------------------EGETMIAKAGDVMFIPKGSSIE 212 (233)
T ss_pred CccceeccceEE---EEEEEeEEEEEE-------------------------------CCEEEEeCCCcEEEECCCCeEE
Confidence 446678777654 477888877666 1236889999999999999966
Q ss_pred EEeCCC
Q psy12281 320 VTNLEH 325 (456)
Q Consensus 320 V~nl~~ 325 (456)
-.+.+.
T Consensus 213 f~tp~~ 218 (233)
T PRK15457 213 FGTPSS 218 (233)
T ss_pred ecCCCC
Confidence 655443
No 63
>smart00659 RPOLCX RNA polymerase subunit CX. present in RNA polymerase I, II and III
Probab=71.86 E-value=6.8 Score=27.64 Aligned_cols=28 Identities=36% Similarity=0.808 Sum_probs=22.4
Q ss_pred EEec-CCCCcEEeehhhhhcCCcEEeCCCCcceEE
Q psy12281 23 YYPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIK 56 (456)
Q Consensus 23 ~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~ 56 (456)
.|-| +||..|.+. ....+.|+.|.--|-
T Consensus 2 ~Y~C~~Cg~~~~~~------~~~~irC~~CG~rIl 30 (44)
T smart00659 2 IYICGECGRENEIK------SKDVVRCRECGYRIL 30 (44)
T ss_pred EEECCCCCCEeecC------CCCceECCCCCceEE
Confidence 5889 899999886 135799999998775
No 64
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=69.88 E-value=24 Score=34.60 Aligned_cols=33 Identities=15% Similarity=0.208 Sum_probs=26.2
Q ss_pred EEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeec
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTNLEHTISINHNW 333 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~ 333 (456)
+.+|++||.+|+|++=-|...|.++.-+.-.++
T Consensus 99 ~~~L~~Gd~~y~pa~~~H~~~N~~~~~a~~l~v 131 (260)
T TIGR03214 99 THELREGGYAYLPPGSKWTLANAQAEDARFFLY 131 (260)
T ss_pred EEEECCCCEEEECCCCCEEEEECCCCCEEEEEE
Confidence 469999999999999999999988733333333
No 65
>TIGR03214 ura-cupin putative allantoin catabolism protein. This model represents a protein containing a tandem arrangement of cupin domains (N-terminal part of pfam07883 and C-terminal more distantly related to pfam00190). This protein is found in the vicinity of genes involved in the catabolism of allantoin, a breakdown product of urate and sometimes of urate iteslf. The distribution of pathway components in the genomes in which this family is observed suggests that the function is linked to the allantoate catabolism to glyoxylate pathway (GenProp0686) since it is sometimes found in genomes lacking any elements of the xanthine-to-allantoin pathways (e.g. in Enterococcus faecalis).
Probab=69.03 E-value=5.6 Score=39.06 Aligned_cols=28 Identities=21% Similarity=0.083 Sum_probs=25.0
Q ss_pred ceEEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 298 TVIIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 298 ~~~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
..-...+++||++|||++--|+..|.++
T Consensus 216 ~g~~~~V~~GD~i~i~~~~~h~~~~~G~ 243 (260)
T TIGR03214 216 DNNWVPVEAGDYIWMGAYCPQACYAGGR 243 (260)
T ss_pred CCEEEEecCCCEEEECCCCCEEEEecCC
Confidence 3457889999999999999999999986
No 66
>PF03079 ARD: ARD/ARD' family; InterPro: IPR004313 The two acireductone dioxygenase enzymes (ARD and ARD', previously known as E-2 and E-2') from Klebsiella pneumoniae share the same amino acid sequence Q9ZFE7 from SWISSPROT, but bind different metal ions: ARD binds Ni2+, ARD' binds Fe2+ []. ARD and ARD' can be experimentally interconverted by removal of the bound metal ion and reconstitution with the appropriate metal ion. The two enzymes share the same substrate, 1,2-dihydroxy-3-keto-5-(methylthio)pentene, but yield different products. ARD' yields the alpha-keto precursor of methionine (and formate), thus forming part of the ubiquitous methionine salvage pathway that converts 5'-methylthioadenosine (MTA) to methionine. This pathway is responsible for the tight control of the concentration of MTA, which is a powerful inhibitor of polyamine biosynthesis and transmethylation reactions []. ARD yields methylthiopropanoate, carbon monoxide and formate, and thus prevents the conversion of MTA to methionine. The role of the ARD catalysed reaction is unclear: methylthiopropanoate is cytotoxic, and carbon monoxide can activate guanylyl cyclase, leading to increased intracellular cGMP levels [, ]. This family also contains other proteins, whose functions are not well characterised.; GO: 0010309 acireductone dioxygenase [iron(II)-requiring] activity, 0055114 oxidation-reduction process; PDB: 1VR3_A 1ZRR_A 2HJI_A.
Probab=68.70 E-value=41 Score=30.50 Aligned_cols=38 Identities=34% Similarity=0.308 Sum_probs=25.0
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCCC-cEEEEeecC
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLEH-TISINHNWI 334 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~~-sIsvn~~~~ 334 (456)
...+.+.+++||.|.||+|-+|...--+. .|..-.-|.
T Consensus 112 ~~wiri~~e~GDli~vP~g~~HrF~~~~~~~i~aiRlF~ 150 (157)
T PF03079_consen 112 DVWIRILCEKGDLIVVPAGTYHRFTLGESPYIKAIRLFK 150 (157)
T ss_dssp CEEEEEEEETTCEEEE-TT--EEEEESTTSSEEEEEEES
T ss_pred CEEEEEEEcCCCEEecCCCCceeEEcCCCCcEEEEEeec
Confidence 34667999999999999999999875443 554443343
No 67
>PRK15460 cpsB mannose-1-phosphate guanyltransferase; Provisional
Probab=66.10 E-value=21 Score=38.27 Aligned_cols=26 Identities=27% Similarity=0.330 Sum_probs=24.0
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
.+..++|||.+|||+|=-|...|.++
T Consensus 424 ~~~~L~~GDSi~ip~g~~H~~~N~g~ 449 (478)
T PRK15460 424 DIKLLGENESIYIPLGATHCLENPGK 449 (478)
T ss_pred EEEEecCCCEEEECCCCcEEEEcCCC
Confidence 36899999999999999999999877
No 68
>COG3450 Predicted enzyme of the cupin superfamily [General function prediction only]
Probab=65.93 E-value=5.2 Score=34.42 Aligned_cols=25 Identities=40% Similarity=0.514 Sum_probs=19.1
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLE 324 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~ 324 (456)
-.++++|||.+|+|+||.=-=.-++
T Consensus 82 e~v~~~aGD~~~~~~G~~g~W~V~E 106 (116)
T COG3450 82 EPVEVRAGDSFVFPAGFKGTWEVLE 106 (116)
T ss_pred eEEEEcCCCEEEECCCCeEEEEEee
Confidence 3589999999999999975443333
No 69
>PLN00212 glutelin; Provisional
Probab=63.97 E-value=31 Score=37.09 Aligned_cols=90 Identities=17% Similarity=0.132 Sum_probs=52.6
Q ss_pred eeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCC-CcccCCCCCCceEEEEEcCCCEE
Q psy12281 232 FVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMR-SVDWSTLPRDTVIIVEQEAGDSI 310 (456)
Q Consensus 232 ~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~-~pd~~~fp~~~~~~~~l~pGD~L 310 (456)
.+-|.|.|-.-|-|. ....+ ..++.|+=..-+.-|+.........- ++.-. ...-.++...+--.-.+++||++
T Consensus 84 R~~i~p~gL~lP~y~-na~~l--iyV~qG~G~~G~v~pGcpeT~~~~~~--~~~~~~~~~~~~~~d~hqkv~~lr~GDVi 158 (493)
T PLN00212 84 RRVIEPQGLLLPRYS-NTPGL--VYIIQGRGSMGLTFPGCPATYQQQFQ--QFLTEGQSQSQKFRDEHQKIHQFRQGDVV 158 (493)
T ss_pred EEEecCCcccCcccc-CCCeE--EEEEeCeEEEEEEeCCCcchhhhhcc--cccccccccccccccccccceEeccCCEE
Confidence 356677676444444 44444 68899999999887776443110000 00000 00000111122223578999999
Q ss_pred EeCCCCeEEEEeCCCc
Q psy12281 311 FVPSGWHHQVTNLEHT 326 (456)
Q Consensus 311 fIPsgW~H~V~nl~~s 326 (456)
.||+|--|..+|.+++
T Consensus 159 aiPaG~~hw~yN~Gd~ 174 (493)
T PLN00212 159 ALPAGVAHWFYNDGDA 174 (493)
T ss_pred EECCCCeEEEEeCCCC
Confidence 9999999999999985
No 70
>PRK13502 transcriptional activator RhaR; Provisional
Probab=63.21 E-value=29 Score=33.85 Aligned_cols=30 Identities=13% Similarity=0.161 Sum_probs=24.0
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCCcEEE
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEHTISI 329 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~sIsv 329 (456)
-+..+.|||+++||++=.|.....++....
T Consensus 56 ~~~~l~~g~l~li~~~~~H~~~~~~~~~~~ 85 (282)
T PRK13502 56 RPYRITRGDLFYIRAEDKHSYTSVNDLVLQ 85 (282)
T ss_pred EEEeecCCcEEEECCCCcccccccCCceEE
Confidence 368899999999999999998776654333
No 71
>PF07385 DUF1498: Protein of unknown function (DUF1498); InterPro: IPR010864 This family consists of several hypothetical bacterial proteins of around 225 residues in length. The function of this family is unknown.; PDB: 3MPB_B 3KMH_A.
Probab=62.86 E-value=21 Score=34.13 Aligned_cols=91 Identities=15% Similarity=0.066 Sum_probs=43.8
Q ss_pred eeeeCCCCCcccccccccCccceeEEEE--eEEEEEEeCCCCCcccc-cCCCCccccCCCcccCCCCCCceEEEEEcCCC
Q psy12281 232 FVYMGPKETWTPLHADVFHSYSWSVNIC--GRKQWLLLAPGNEKYFK-DSMGNLISDMRSVDWSTLPRDTVIIVEQEAGD 308 (456)
Q Consensus 232 ~l~iG~~gs~T~lH~D~~~~~sW~~~i~--G~Krw~L~pP~~~~~Ly-~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD 308 (456)
-+.|...|..||+|+.....= -.--+ |.=...|+-......+. .....++.|-. .+.-...-+++|.||+
T Consensus 90 Kim~~~~~Q~tP~H~H~~K~E--DIINRGGG~L~i~l~~s~~~~~~~~~~~v~V~~DG~-----~~t~~aG~~l~L~PGE 162 (225)
T PF07385_consen 90 KIMIVREGQVTPMHFHWKKME--DIINRGGGNLVIELYNSDPDGELDADTDVTVPVDGI-----RRTVPAGTQLRLNPGE 162 (225)
T ss_dssp EEEEE-BT-EEEEEEESS--E--EEEEEEES-EEEEEEEB--TTSSB-SS-EEEEETTE-----EEEE-TT-EEEE-TT-
T ss_pred hheeccCCCcCCcccCcchhh--heeecCCceEEEEEEeccCCCccccCCCeEEecCCc-----EEEecCCceEEeCCCC
Confidence 356667788899999876432 11222 34445666444322111 10001111100 0001456689999999
Q ss_pred EEEeCCCCeEEEEeCCCcEEE
Q psy12281 309 SIFVPSGWHHQVTNLEHTISI 329 (456)
Q Consensus 309 ~LfIPsgW~H~V~nl~~sIsv 329 (456)
-|-+|+|-||+-..-+..+-|
T Consensus 163 SiTL~Pg~yH~Fw~e~g~vLi 183 (225)
T PF07385_consen 163 SITLPPGIYHWFWGEGGDVLI 183 (225)
T ss_dssp EEEE-TTEEEEEEE-TTSEEE
T ss_pred eEeeCCCCeeeEEecCCCEEE
Confidence 999999999999987776433
No 72
>KOG3995|consensus
Probab=61.88 E-value=21 Score=33.77 Aligned_cols=76 Identities=14% Similarity=0.234 Sum_probs=59.3
Q ss_pred CCccceeeeCCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcC
Q psy12281 227 KDDYRFVYMGPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEA 306 (456)
Q Consensus 227 ~~~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~p 306 (456)
.....-+|+|.+.+.+.+|.....-+ +.|..|.-.-.++.-++ .-.++.+.
T Consensus 31 ~~qlkVm~VGGPN~RkdyHieegeE~--FyQ~KGdMvLKVie~g~---------------------------~rDivI~q 81 (279)
T KOG3995|consen 31 QEQLKVMFVGGPNTRKDYHIEEGEEV--FYQLKGDMVLKVLEQGK---------------------------HRDVVIRQ 81 (279)
T ss_pred ccCeEEEEecCCCcccccccCCcchh--heeecCceEEeeeccCc---------------------------ceeeEEec
Confidence 44567789999999999999987765 88888875555443332 23689999
Q ss_pred CCEEEeCCCCeEEEEeCCCcEEEEe
Q psy12281 307 GDSIFVPSGWHHQVTNLEHTISINH 331 (456)
Q Consensus 307 GD~LfIPsgW~H~V~nl~~sIsvn~ 331 (456)
||+..+|+.--|...-..+||.+-.
T Consensus 82 Ge~flLParVpHSPqRFantvGlVV 106 (279)
T KOG3995|consen 82 GEIFLLPARVPHSPQRFANTVGLVV 106 (279)
T ss_pred CcEEEeccCCCCChhhhccceeEEE
Confidence 9999999999999888888777643
No 73
>PRK00398 rpoP DNA-directed RNA polymerase subunit P; Provisional
Probab=60.03 E-value=7.9 Score=27.29 Aligned_cols=29 Identities=28% Similarity=0.697 Sum_probs=21.4
Q ss_pred EEec-CCCCcEEeehhhhhcCCcEEeCCCCcceEE
Q psy12281 23 YYPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIK 56 (456)
Q Consensus 23 ~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~ 56 (456)
.|.| .||..|.+.+.. ..+.|+.|.--+-
T Consensus 3 ~y~C~~CG~~~~~~~~~-----~~~~Cp~CG~~~~ 32 (46)
T PRK00398 3 EYKCARCGREVELDEYG-----TGVRCPYCGYRIL 32 (46)
T ss_pred EEECCCCCCEEEECCCC-----CceECCCCCCeEE
Confidence 5889 699999986544 2689999975443
No 74
>PF06249 EutQ: Ethanolamine utilisation protein EutQ; InterPro: IPR010424 The eut operon of Salmonella typhimurium encodes proteins involved in the cobalamin-dependent degradation of ethanolamine. The role of EutQ in this process is unclear [].; PDB: 2PYT_B 3LWC_A.
Probab=59.91 E-value=6.7 Score=35.37 Aligned_cols=19 Identities=37% Similarity=0.595 Sum_probs=13.5
Q ss_pred eEEEEEcCCCEEEeCCCCe
Q psy12281 299 VIIVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 299 ~~~~~l~pGD~LfIPsgW~ 317 (456)
..+++-+|||+||||.|==
T Consensus 112 G~~~~A~~GDvi~iPkGs~ 130 (152)
T PF06249_consen 112 GQTVTAKPGDVIFIPKGST 130 (152)
T ss_dssp TEEEEEETT-EEEE-TT-E
T ss_pred CEEEEEcCCcEEEECCCCE
Confidence 4478999999999999853
No 75
>KOG3130|consensus
Probab=59.00 E-value=33 Score=35.57 Aligned_cols=67 Identities=25% Similarity=0.340 Sum_probs=58.5
Q ss_pred CCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHHHHHHHHHHHHHHHHHhhhhccc
Q psy12281 293 TLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNIDHVYHEMVSHLEAVKKEIDDCKD 362 (456)
Q Consensus 293 ~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~~~~~~l~~~~~~~~~~~~~~~~ 362 (456)
.||+.-.+.+.+.-|..-|+|. |-|++-+-++-+.-||+..+..+..-..+..++..+++++.+..+
T Consensus 38 ~~~kkLs~~Imvpig~~a~mpG---~lVhTNevtv~~g~nyf~~CS~h~A~~I~~~R~~~~r~q~~~l~~ 104 (514)
T KOG3130|consen 38 TLPKKLSYNIMVPIGPFAFMPG---KLVHTNEVTVLLGDNYFAKCSAHQAVGIVEHRKEHVRKQIDDLKK 104 (514)
T ss_pred Hhhhhcccceeeeccccccccc---ceeeechhhhhhccchHhhhhHHHHHHHHHHHHHHHHHHHHHHHH
Confidence 3666666788888999999998 999999999999999999999999888888888889998888754
No 76
>COG4766 EutQ Ethanolamine utilization protein [Amino acid transport and metabolism]
Probab=58.29 E-value=7.2 Score=34.99 Aligned_cols=17 Identities=47% Similarity=0.720 Sum_probs=14.9
Q ss_pred EEEEcCCCEEEeCCCCe
Q psy12281 301 IVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~ 317 (456)
+++-+|||++|||.|=-
T Consensus 137 tv~a~aGDvifiPKgss 153 (176)
T COG4766 137 TVIAGAGDVIFIPKGSS 153 (176)
T ss_pred eEecCCCcEEEecCCCe
Confidence 68899999999999853
No 77
>PRK15131 mannose-6-phosphate isomerase; Provisional
Probab=57.82 E-value=8.2 Score=40.27 Aligned_cols=22 Identities=18% Similarity=0.380 Sum_probs=20.2
Q ss_pred EEEEcCCCEEEeCCCCeEEEEe
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTN 322 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~n 322 (456)
.+.++|||++|||+|-.|....
T Consensus 238 ~v~l~pGeaifipAg~~HAyl~ 259 (389)
T PRK15131 238 VVKLNPGEAMFLFAETPHAYLQ 259 (389)
T ss_pred EEEeCCCCEEEeCCCCCeEEcC
Confidence 6899999999999999999765
No 78
>KOG2508|consensus
Probab=56.49 E-value=20 Score=36.66 Aligned_cols=51 Identities=22% Similarity=0.432 Sum_probs=38.4
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCCC-------cEEEEeecCCC-CCHHHHHHHHH
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLEH-------TISINHNWING-TNIDHVYHEMV 347 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~~-------sIsvn~~~~~~-~nl~~~~~~l~ 347 (456)
.-++.+.++||+++|.|+-|.|.|.+-.. .|.+|+|+-.. .+-+.+.+.+.
T Consensus 358 ~~~l~v~~~~~~~~~l~~~~~~~~~~~~~ssg~~~v~l~~~y~~~~~~~~~~~~~r~~~ 416 (437)
T KOG2508|consen 358 TVPLVVDLEPGDMLYLPASWFHEVTSSSASSGGSDVHIAFNYWFHPEDFDHPYLDRNIW 416 (437)
T ss_pred ceeEEEecccCceeeechhheeeeeccccccCCcceeEEeeccccccccCchhhhcCHh
Confidence 56789999999999999999999987543 57888876653 34455555443
No 79
>PRK11171 hypothetical protein; Provisional
Probab=55.23 E-value=74 Score=31.27 Aligned_cols=25 Identities=16% Similarity=0.274 Sum_probs=23.6
Q ss_pred EEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
+.+|++||.+++|++=-|+..|.++
T Consensus 102 ~~~L~~GDsi~~p~~~~H~~~N~g~ 126 (266)
T PRK11171 102 THALSEGGYAYLPPGSDWTLRNAGA 126 (266)
T ss_pred EEEECCCCEEEECCCCCEEEEECCC
Confidence 6899999999999999999999886
No 80
>KOG1633|consensus
Probab=54.20 E-value=6.6 Score=44.43 Aligned_cols=93 Identities=18% Similarity=0.191 Sum_probs=62.9
Q ss_pred CCCCcccccccccCccceeEEEEeEE-----E-EEEeCCCCCcccccCCCCccccCCCcccCCC-C---CCceEEEEEcC
Q psy12281 237 PKETWTPLHADVFHSYSWSVNICGRK-----Q-WLLLAPGNEKYFKDSMGNLISDMRSVDWSTL-P---RDTVIIVEQEA 306 (456)
Q Consensus 237 ~~gs~T~lH~D~~~~~sW~~~i~G~K-----r-w~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~f-p---~~~~~~~~l~p 306 (456)
.++..|.+|.|...+.+-+-++.|.| . |+-.+.++...-+. + ....++.... . ........+++
T Consensus 481 ~~~~~tdf~~~~~~~s~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~--~----~~~~~~~~~~~~~~~~~~~~~~~~~~ 554 (776)
T KOG1633|consen 481 MKGRYTDFHVDFSGTSVYLEGIKGEKPSGKEIFALPSPTETNEASRQ--A----TESKKDFSRVRGDSANCDDKRDVLKE 554 (776)
T ss_pred cccccceeEeccCCcccccccccccccccchhhcccccCcccchhhh--h----hhccccchhcccccccCCCccccccC
Confidence 46888999999999876677787777 3 44444443322111 0 0011111111 1 15667889999
Q ss_pred CCEEEeCCCCeEEEEeCCCcEEEEeecCC
Q psy12281 307 GDSIFVPSGWHHQVTNLEHTISINHNWIN 335 (456)
Q Consensus 307 GD~LfIPsgW~H~V~nl~~sIsvn~~~~~ 335 (456)
|+..++|+||.|+|...-+.++...+++.
T Consensus 555 g~~~~~P~~~i~~~~~p~~~~~~~~~~~~ 583 (776)
T KOG1633|consen 555 GETGLIPAGPIHAVLTPVDSLSFGGNFLT 583 (776)
T ss_pred CCcccCCCCccccccccccccccccchhh
Confidence 99999999999999999888888888874
No 81
>COG1482 ManA Phosphomannose isomerase [Carbohydrate transport and metabolism]
Probab=54.13 E-value=15 Score=37.15 Aligned_cols=24 Identities=25% Similarity=0.418 Sum_probs=21.2
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNL 323 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl 323 (456)
-.+.++|||.+|||+|-.|....-
T Consensus 158 n~v~lkpGe~~fl~Agt~HA~~~G 181 (312)
T COG1482 158 NRVKLKPGEAFFLPAGTPHAYLKG 181 (312)
T ss_pred cEEecCCCCEEEecCCCceeeccc
Confidence 368999999999999999997664
No 82
>COG1791 Uncharacterized conserved protein, contains double-stranded beta-helix domain [Function unknown]
Probab=54.03 E-value=43 Score=30.77 Aligned_cols=45 Identities=29% Similarity=0.344 Sum_probs=34.4
Q ss_pred eEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 255 SVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 255 ~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
-..|.|.=.|.+-+|+ .+.+.+.+++||.|-||+|-+|.-.--++
T Consensus 99 Ry~vaG~GiF~v~~~d--------------------------~~~~~i~c~~gDLI~vP~gi~HwFtlt~~ 143 (181)
T COG1791 99 RYFVAGEGIFDVHSPD--------------------------GKVYQIRCEKGDLISVPPGIYHWFTLTES 143 (181)
T ss_pred EEEEecceEEEEECCC--------------------------CcEEEEEEccCCEEecCCCceEEEEccCC
Confidence 3467777777776654 24678899999999999999999866555
No 83
>PRK05467 Fe(II)-dependent oxygenase superfamily protein; Provisional
Probab=53.49 E-value=52 Score=31.71 Aligned_cols=37 Identities=24% Similarity=0.205 Sum_probs=29.8
Q ss_pred eEEEEEcCCCEEEeCCCCeEEEEeCCC--cEEEEeecCC
Q psy12281 299 VIIVEQEAGDSIFVPSGWHHQVTNLEH--TISINHNWIN 335 (456)
Q Consensus 299 ~~~~~l~pGD~LfIPsgW~H~V~nl~~--sIsvn~~~~~ 335 (456)
...+...+|++|+.|+.-+|+|..... -+++..|..+
T Consensus 140 ~~~Vkp~aG~~vlfps~~lH~v~pVt~G~R~~~~~Wi~S 178 (226)
T PRK05467 140 EHRVKLPAGDLVLYPSTSLHRVTPVTRGVRVASFFWIQS 178 (226)
T ss_pred cEEEecCCCeEEEECCCCceeeeeccCccEEEEEecHHH
Confidence 357889999999999999999999776 5666665544
No 84
>COG1579 Zn-ribbon protein, possibly nucleic acid-binding [General function prediction only]
Probab=53.23 E-value=7.9 Score=37.56 Aligned_cols=28 Identities=21% Similarity=0.692 Sum_probs=19.1
Q ss_pred CCCCcEE-eehhh---hhcCCcEEeCCCCcce
Q psy12281 27 PCGDRFQ-ITKEQ---LKAGEDVATCPTCSLI 54 (456)
Q Consensus 27 rCg~~f~-i~~~~---l~~~~~~~~C~~CSl~ 54 (456)
+|||-+. |+... +..++.+|.||+||=.
T Consensus 199 ~C~GC~m~l~~~~~~~V~~~d~iv~CP~CgRI 230 (239)
T COG1579 199 VCGGCHMKLPSQTLSKVRKKDEIVFCPYCGRI 230 (239)
T ss_pred cccCCeeeecHHHHHHHhcCCCCccCCccchH
Confidence 6999554 44333 3346789999999843
No 85
>TIGR02272 gentisate_1_2 gentisate 1,2-dioxygenase. This family consists of gentisate 1,2-dioxygenases. This ring-opening enzyme acts in salicylate degradation that goes via gentisate rather than via catechol. It converts gentisate to maleylpyruvate. Some putative gentisate 1,2-dioxygenases are excluded by a relatively high trusted cutoff score because they are too closely related to known examples of 1-hydroxy-2-naphthoate dioxygenase. Therefore some homologs may be bona fide gentisate 1,2-dioxygenases even if they score below the given cutoffs.
Probab=52.91 E-value=59 Score=33.25 Aligned_cols=59 Identities=17% Similarity=0.080 Sum_probs=39.7
Q ss_pred CCCCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCC
Q psy12281 236 GPKETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSG 315 (456)
Q Consensus 236 G~~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsg 315 (456)
=++|..|..|....... +.+|.|+=+-.+ ...++..++||++.||+.
T Consensus 257 L~~G~~t~~~r~T~s~V--f~VieG~G~s~i-------------------------------g~~~~~W~~gD~f~vPsW 303 (335)
T TIGR02272 257 LPKGFRTATYRSTDATV--FCVVEGRGQVRI-------------------------------GDAVFRFSPKDVFVVPSW 303 (335)
T ss_pred cCCCCCCCCccccccEE--EEEEeCeEEEEE-------------------------------CCEEEEecCCCEEEECCC
Confidence 34555666777665555 666666644333 123688999999999998
Q ss_pred CeEEEEeCCCcE
Q psy12281 316 WHHQVTNLEHTI 327 (456)
Q Consensus 316 W~H~V~nl~~sI 327 (456)
-+|+-.+.++++
T Consensus 304 ~~~~h~a~~da~ 315 (335)
T TIGR02272 304 HPVRFEASDDAV 315 (335)
T ss_pred CcEecccCCCeE
Confidence 777777766654
No 86
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=52.84 E-value=4.6 Score=24.74 Aligned_cols=15 Identities=27% Similarity=0.895 Sum_probs=12.6
Q ss_pred EEecCCCCcEEeehh
Q psy12281 23 YYPCPCGDRFQITKE 37 (456)
Q Consensus 23 ~~~CrCg~~f~i~~~ 37 (456)
+|.|.|..+|.++.+
T Consensus 1 sy~C~C~~Gy~l~~d 15 (24)
T PF12662_consen 1 SYTCSCPPGYQLSPD 15 (24)
T ss_pred CEEeeCCCCCcCCCC
Confidence 488999999998754
No 87
>PF03604 DNA_RNApol_7kD: DNA directed RNA polymerase, 7 kDa subunit; InterPro: IPR006591 DNA-dependent RNA polymerase catalyzes the transcription of DNA into RNA using the four ribonucleoside triphosphates as substrates. Each class of RNA polymerase is assembled from 9 to 15 different polypeptides. Rbp10 (RNA polymerase CX) is a domain found in RNA polymerase subunit 10; present in RNA polymerase I, II and III.; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 2PMZ_Z 3HKZ_X 2NVX_L 3S1Q_L 2JA6_L 3S17_L 3HOW_L 3HOV_L 3PO2_L 3HOZ_L ....
Probab=52.52 E-value=5.8 Score=26.03 Aligned_cols=27 Identities=33% Similarity=0.840 Sum_probs=18.4
Q ss_pred Eec-CCCCcEEeehhhhhcCCcEEeCCCCcceEE
Q psy12281 24 YPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIK 56 (456)
Q Consensus 24 ~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~ 56 (456)
|-| +||..+.+... +.+.|+.|.-.|-
T Consensus 1 Y~C~~Cg~~~~~~~~------~~irC~~CG~RIl 28 (32)
T PF03604_consen 1 YICGECGAEVELKPG------DPIRCPECGHRIL 28 (32)
T ss_dssp EBESSSSSSE-BSTS------STSSBSSSS-SEE
T ss_pred CCCCcCCCeeEcCCC------CcEECCcCCCeEE
Confidence 668 89988886432 3579999987764
No 88
>TIGR00218 manA mannose-6-phosphate isomerase, class I. The names phosphomannose isomerase and mannose-6-phosphate isomerase are synonomous. This family contains two rather deeply branched groups. One group contains an experimentally determined phosphomannose isomerase of Streptococcus mutans as well as three uncharacterized paralogous proteins of Bacillus subtilis, all at more than 50 % identity to each other, plus a more distant homolog from Archaeoglobus fulgidus. The other group contains members from E. coli, budding yeast, Borrelia burgdorferi, etc.
Probab=51.76 E-value=7.9 Score=38.81 Aligned_cols=20 Identities=35% Similarity=0.577 Sum_probs=18.4
Q ss_pred EEEEEcCCCEEEeCCCCeEE
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQ 319 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~ 319 (456)
-.+.++|||++|||+|-.|.
T Consensus 151 n~v~v~~Gd~i~ipaGt~HA 170 (302)
T TIGR00218 151 NRIKLKPGDFFYVPSGTPHA 170 (302)
T ss_pred cccccCCCCEEEeCCCCccc
Confidence 36789999999999999998
No 89
>PRK10296 DNA-binding transcriptional regulator ChbR; Provisional
Probab=50.83 E-value=52 Score=31.97 Aligned_cols=50 Identities=28% Similarity=0.358 Sum_probs=35.6
Q ss_pred cccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEE
Q psy12281 241 WTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQV 320 (456)
Q Consensus 241 ~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V 320 (456)
..+.|...+.-+ ...+.|.-...+ ..-...+.|||+++||+|=.|+.
T Consensus 35 ~~~~H~H~~~ei--~~v~~G~~~~~i-------------------------------~~~~~~l~~g~l~~i~p~~~H~~ 81 (278)
T PRK10296 35 VSGLHQHDYYEF--TLVLTGRYYQEI-------------------------------NGKRVLLERGDFVFIPLGSHHQS 81 (278)
T ss_pred CCCCcccccEEE--EEEEeceEEEEE-------------------------------CCEEEEECCCcEEEeCCCCccce
Confidence 457887665555 666777654443 12357999999999999999987
Q ss_pred EeC
Q psy12281 321 TNL 323 (456)
Q Consensus 321 ~nl 323 (456)
...
T Consensus 82 ~~~ 84 (278)
T PRK10296 82 FYE 84 (278)
T ss_pred eee
Confidence 543
No 90
>TIGR03027 pepcterm_export putative polysaccharide export protein, PEP-CTERM sytem-associated. This protein family belongs to the larger set of polysaccharide biosynthesis/export proteins described by Pfam model pfam02563. Members of this family are variable in either containing of lacking a 78-residue insert, but appear to fall within a single clade, nevertheless, where the regions in which the gene is found encode components of the PEP-CTERM/EpsH proposed exosortase protein sorting system.
Probab=50.67 E-value=9.7 Score=34.54 Aligned_cols=17 Identities=35% Similarity=0.855 Sum_probs=14.8
Q ss_pred EEEEcCCCEEEeCCCCe
Q psy12281 301 IVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~ 317 (456)
.+.|+|||+||||..++
T Consensus 149 n~~L~~gD~I~Vp~~~f 165 (165)
T TIGR03027 149 NVELKPGDVLIIPESWF 165 (165)
T ss_pred CceeCCCCEEEEecccC
Confidence 47899999999998774
No 91
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=50.44 E-value=24 Score=35.36 Aligned_cols=69 Identities=23% Similarity=0.311 Sum_probs=52.4
Q ss_pred CCCcccccccccCccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCe
Q psy12281 238 KETWTPLHADVFHSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWH 317 (456)
Q Consensus 238 ~gs~T~lH~D~~~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~ 317 (456)
+|-.+|-|......+ -.+|.|+--|+.+. .-...+++||.|..|++=|
T Consensus 101 PGEvApsHrHsqsAl--RFvveG~Ga~T~Vd------------------------------Ger~~M~~GDfilTP~w~w 148 (351)
T COG3435 101 PGEVAPSHRHNQSAL--RFVVEGKGAYTVVD------------------------------GERTPMEAGDFILTPAWTW 148 (351)
T ss_pred CcccCCcccccccce--EEEEeccceeEeec------------------------------CceeeccCCCEEEccCcee
Confidence 355699999887766 88899988777642 2247899999999999999
Q ss_pred EEEEeCCC--cEEEEeecCCCCCHHHHH
Q psy12281 318 HQVTNLEH--TISINHNWINGTNIDHVY 343 (456)
Q Consensus 318 H~V~nl~~--sIsvn~~~~~~~nl~~~~ 343 (456)
|.-.|.++ +| |.+.-.+..+.
T Consensus 149 HdHgn~g~eP~i-----WlDgLDiplv~ 171 (351)
T COG3435 149 HDHGNEGTEPCI-----WLDGLDIPLVN 171 (351)
T ss_pred ccCCCCCCCceE-----EEcccchHHHH
Confidence 99888765 44 66666666543
No 92
>PLN02288 mannose-6-phosphate isomerase
Probab=47.10 E-value=13 Score=38.90 Aligned_cols=22 Identities=14% Similarity=0.341 Sum_probs=19.8
Q ss_pred EEEEcCCCEEEeCCCCeEEEEe
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTN 322 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~n 322 (456)
.+.|+|||+||+|+|=.|.-..
T Consensus 252 ~v~L~PGeaifl~ag~~HAYl~ 273 (394)
T PLN02288 252 YVKLNPGEALYLGANEPHAYLS 273 (394)
T ss_pred eEecCCCCEEEecCCCCceecC
Confidence 5799999999999999998655
No 93
>COG2331 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=47.09 E-value=4.9 Score=31.67 Aligned_cols=35 Identities=34% Similarity=0.900 Sum_probs=25.0
Q ss_pred ceEEEec-CCCCcEEeehhhhhcCCcEEeCCCCcceEE
Q psy12281 20 ETYYYPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIK 56 (456)
Q Consensus 20 ~~~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~ 56 (456)
-+|.|.| -||..|.+-..- .+ +.+..|+.|+-.++
T Consensus 9 PtY~Y~c~~cg~~~dvvq~~-~d-dplt~ce~c~a~~k 44 (82)
T COG2331 9 PTYSYECTECGNRFDVVQAM-TD-DPLTTCEECGARLK 44 (82)
T ss_pred cceEEeecccchHHHHHHhc-cc-CccccChhhChHHH
Confidence 4699999 799988763322 22 46899999986443
No 94
>KOG2131|consensus
Probab=45.63 E-value=5.1 Score=40.87 Aligned_cols=109 Identities=17% Similarity=0.196 Sum_probs=57.2
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHHHHHHHHHHHHHHHHHhhhhcccchhHHHHHHHHHhhhcC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNIDHVYHEMVSHLEAVKKEIDDCKDMDDWTSHCQLMLQVSFG 379 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~~~~~~l~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~g 379 (456)
+.-+.+-||+|=|=-.|.-.. +|+ |+- .+|..-+..+...+.+ ..-.++..-..+....-.-|.++-+
T Consensus 285 ~hQV~NL~dTISINHNW~N~~-----nl~----~~w-~~Lk~~y~a~~esisd--d~md~~~~~cq~~lra~~gin~~ef 352 (427)
T KOG2131|consen 285 HHQVLNLGDTISINHNWCNAT-----NLA----WMW-DALKEDYPALAESISD--DLMDDFPLECQFSLRATEGINANEF 352 (427)
T ss_pred ccccccccceeeecccccccc-----cHH----HHH-HHHHhhhhhhhhhhhh--hccccchhhhhhhhhhhccccccce
Confidence 445677788888877775321 111 110 0111122222322222 2334555444455555566777778
Q ss_pred CCHHHHHHHHHHHHHhhhhhcccCCccccccccccCccchhccHHHHHH
Q psy12281 380 INFRQFFDMLKFICQKRLTSLKNGTTIRLYGNWILSLDHTMFDLRKASD 428 (456)
Q Consensus 380 ~~~~~f~~~l~~~~~~r~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 428 (456)
+-|.+|++++++-..+|-..+... +-++++|..+++.....
T Consensus 353 ~~f~~~~a~l~h~~~~~~a~l~~s--------~~~~~~~~q~evl~~l~ 393 (427)
T KOG2131|consen 353 IYFPDFWAMLKHATLNRDASLFIS--------TFLSDWPIQPEVLPPLS 393 (427)
T ss_pred EeehHHHHHHHHHHhccchhhhHH--------hhhcccccchhhhccee
Confidence 888888888887666665554332 33555666655554433
No 95
>PRK00464 nrdR transcriptional regulator NrdR; Validated
Probab=43.48 E-value=14 Score=33.48 Aligned_cols=33 Identities=24% Similarity=0.635 Sum_probs=23.4
Q ss_pred EecC-CCCc--EEeehhhhhcCCcE---EeCCCCcceEE
Q psy12281 24 YPCP-CGDR--FQITKEQLKAGEDV---ATCPTCSLIIK 56 (456)
Q Consensus 24 ~~Cr-Cg~~--f~i~~~~l~~~~~~---~~C~~CSl~i~ 56 (456)
+-|| ||+- =+++...+.+|..+ .+|+.|+-.-.
T Consensus 1 m~cp~c~~~~~~~~~s~~~~~~~~~~~~~~c~~c~~~f~ 39 (154)
T PRK00464 1 MRCPFCGHPDTRVIDSRPAEDGNAIRRRRECLACGKRFT 39 (154)
T ss_pred CcCCCCCCCCCEeEeccccCCCCceeeeeeccccCCcce
Confidence 3587 9964 46777777777533 79999987654
No 96
>PF05280 FlhC: Flagellar transcriptional activator (FlhC); InterPro: IPR007944 This family consists of several bacterial flagellar transcriptional activator (FlhC) proteins. FlhC combines with FlhD to form a regulatory complex in Escherichia coli, this complex has been shown to be a global regulator involved in many cellular processes as well as a flagellar transcriptional activator [].; GO: 0003677 DNA binding, 0030092 regulation of flagellum assembly, 0045893 positive regulation of transcription, DNA-dependent; PDB: 2AVU_E.
Probab=40.86 E-value=14 Score=34.25 Aligned_cols=31 Identities=26% Similarity=0.616 Sum_probs=13.4
Q ss_pred eEEEec-CCCCcEEeehhhhhcCCcEEeCCCCcce
Q psy12281 21 TYYYPC-PCGDRFQITKEQLKAGEDVATCPTCSLI 54 (456)
Q Consensus 21 ~~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~ 54 (456)
.=.-+| +||+.|+....++.. -+.|+-|++-
T Consensus 132 l~l~~C~~C~~~fv~~~~~~~~---~~~Cp~C~~p 163 (175)
T PF05280_consen 132 LQLAPCRRCGGHFVTHAHDPRH---SFVCPFCQPP 163 (175)
T ss_dssp EEEEE-TTT--EEEEESS--SS-------TT----
T ss_pred ccccCCCCCCCCeECcCCCCCc---CcCCCCCCCc
Confidence 456678 599999998766543 3689999863
No 97
>PF10955 DUF2757: Protein of unknown function (DUF2757); InterPro: IPR020115 This entry contains proteins with no known function.
Probab=40.19 E-value=22 Score=28.22 Aligned_cols=8 Identities=50% Similarity=1.269 Sum_probs=6.6
Q ss_pred EEEecC-CC
Q psy12281 22 YYYPCP-CG 29 (456)
Q Consensus 22 ~~~~Cr-Cg 29 (456)
..|.|| ||
T Consensus 3 i~Y~CRHCg 11 (76)
T PF10955_consen 3 IHYYCRHCG 11 (76)
T ss_pred eEEEecCCC
Confidence 579997 88
No 98
>COG4306 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=39.73 E-value=5.9 Score=34.24 Aligned_cols=55 Identities=13% Similarity=0.105 Sum_probs=36.9
Q ss_pred cEEeCCCCcceEEEeeeeh--------hhhhhcccccCccccccccC-cCCCCceeeCC--CCCHHHH
Q psy12281 44 DVATCPTCSLIIKPSVIKS--------VMLMLSDSCNSNEQINVTTT-ALPQVDYNIDP--SHIYPYC 100 (456)
Q Consensus 44 ~~~~C~~CSl~i~~~~~~~--------~~~~~~~~c~~~~~~~~~~~-~~p~~I~ri~~--~ls~~~F 100 (456)
.+.+|+.||-.||..|... ++++ .-|+.|..-.+|.. .+.+.++-+.+ ++++++-
T Consensus 38 ti~qcp~csasirgd~~vegvlglg~dye~p--sfchncgs~fpwterkiaga~elvea~~~l~pdev 103 (160)
T COG4306 38 TITQCPICSASIRGDYYVEGVLGLGGDYEPP--SFCHNCGSRFPWTERKIAGAVELVEAGENLNPDEV 103 (160)
T ss_pred HHhcCCccCCcccccceeeeeeccCCCCCCc--chhhcCCCCCCcHHHHHhHHHHHHHccccCCHHHH
Confidence 4889999999999888653 4454 56776655555544 33355555544 7888864
No 99
>smart00531 TFIIE Transcription initiation factor IIE.
Probab=39.09 E-value=39 Score=30.11 Aligned_cols=38 Identities=21% Similarity=0.610 Sum_probs=26.1
Q ss_pred ceEEEec-CCCCcEEeehhhhh-cCCcEEeCCCCcceEEE
Q psy12281 20 ETYYYPC-PCGDRFQITKEQLK-AGEDVATCPTCSLIIKP 57 (456)
Q Consensus 20 ~~~~~~C-rCg~~f~i~~~~l~-~~~~~~~C~~CSl~i~~ 57 (456)
+.-.|-| +||..|...+.... +......|+.|.-.+..
T Consensus 96 ~~~~Y~Cp~C~~~y~~~ea~~~~d~~~~f~Cp~Cg~~l~~ 135 (147)
T smart00531 96 NNAYYKCPNCQSKYTFLEANQLLDMDGTFTCPRCGEELEE 135 (147)
T ss_pred CCcEEECcCCCCEeeHHHHHHhcCCCCcEECCCCCCEEEE
Confidence 4468999 69999997544322 22334899999887764
No 100
>PRK11171 hypothetical protein; Provisional
Probab=39.04 E-value=49 Score=32.52 Aligned_cols=29 Identities=24% Similarity=0.087 Sum_probs=25.4
Q ss_pred ceEEEEEcCCCEEEeCCCCeEEEEeCCCc
Q psy12281 298 TVIIVEQEAGDSIFVPSGWHHQVTNLEHT 326 (456)
Q Consensus 298 ~~~~~~l~pGD~LfIPsgW~H~V~nl~~s 326 (456)
..-+..+++||+|++|++=-|...|.++.
T Consensus 221 ~~~~~~l~~GD~i~~~~~~~h~~~N~g~~ 249 (266)
T PRK11171 221 NNDWVEVEAGDFIWMRAYCPQACYAGGPG 249 (266)
T ss_pred CCEEEEeCCCCEEEECCCCCEEEECCCCC
Confidence 44577899999999999999999998773
No 101
>PF02591 DUF164: Putative zinc ribbon domain; InterPro: IPR003743 This entry describes proteins of unknown function.
Probab=38.79 E-value=29 Score=25.45 Aligned_cols=26 Identities=23% Similarity=0.910 Sum_probs=19.8
Q ss_pred CCCC-cEEeehhhhhc---CCcEEeCCCCc
Q psy12281 27 PCGD-RFQITKEQLKA---GEDVATCPTCS 52 (456)
Q Consensus 27 rCg~-~f~i~~~~l~~---~~~~~~C~~CS 52 (456)
.|++ ...|+...+.+ ++.++.|++|.
T Consensus 24 ~C~gC~~~l~~~~~~~i~~~~~i~~Cp~Cg 53 (56)
T PF02591_consen 24 TCSGCHMELPPQELNEIRKGDEIVFCPNCG 53 (56)
T ss_pred ccCCCCEEcCHHHHHHHHcCCCeEECcCCC
Confidence 6776 67787776554 56799999996
No 102
>KOG1779|consensus
Probab=37.86 E-value=36 Score=27.05 Aligned_cols=40 Identities=28% Similarity=0.535 Sum_probs=30.1
Q ss_pred CceEEEecCCCCcEEeehhhhhcCCcEEeCCCCcceEEEee
Q psy12281 19 DETYYYPCPCGDRFQITKEQLKAGEDVATCPTCSLIIKPSV 59 (456)
Q Consensus 19 ~~~~~~~CrCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~ 59 (456)
...|+..=+|+|-|.||----.. ..+|.|.+|+..+--.-
T Consensus 28 ~nsyFm~VkC~gc~~iT~vfSHa-qtvVvc~~c~~il~~~t 67 (84)
T KOG1779|consen 28 PNSYFMDVKCPGCFKITTVFSHA-QTVVVCEGCSTILCQPT 67 (84)
T ss_pred CCceEEEEEcCCceEEEEEeecC-ceEEEcCCCceEEEEec
Confidence 45699999999999986443222 35899999999887554
No 103
>PF14354 Lar_restr_allev: Restriction alleviation protein Lar
Probab=36.40 E-value=24 Score=26.12 Aligned_cols=30 Identities=27% Similarity=0.677 Sum_probs=17.4
Q ss_pred EecC-CCCcEEeehhhhhc--CC-cEEeCCCCcc
Q psy12281 24 YPCP-CGDRFQITKEQLKA--GE-DVATCPTCSL 53 (456)
Q Consensus 24 ~~Cr-Cg~~f~i~~~~l~~--~~-~~~~C~~CSl 53 (456)
+||| ||..-+..+..... +. ..|.|..|-.
T Consensus 4 kPCPFCG~~~~~~~~~~~~~~~~~~~V~C~~Cga 37 (61)
T PF14354_consen 4 KPCPFCGSADVLIRQDEGFDYGMYYYVECTDCGA 37 (61)
T ss_pred cCCCCCCCcceEeecccCCCCCCEEEEEcCCCCC
Confidence 6898 98544443332221 11 5688998866
No 104
>PF13640 2OG-FeII_Oxy_3: 2OG-Fe(II) oxygenase superfamily; PDB: 3DKQ_B 3GZE_D 3HQR_A 2Y34_A 2G1M_A 2G19_A 3OUI_A 3OUJ_A 2HBU_A 2Y33_A ....
Probab=36.08 E-value=31 Score=27.89 Aligned_cols=30 Identities=27% Similarity=0.370 Sum_probs=24.4
Q ss_pred EcCCCEEEeCC-CCeEEEEeC-CC--cEEEEeec
Q psy12281 304 QEAGDSIFVPS-GWHHQVTNL-EH--TISINHNW 333 (456)
Q Consensus 304 l~pGD~LfIPs-gW~H~V~nl-~~--sIsvn~~~ 333 (456)
-++|++|+.|+ ..||.|... .. -++|+.||
T Consensus 66 p~~g~~v~F~~~~~~H~v~~v~~~~~R~~l~~~~ 99 (100)
T PF13640_consen 66 PKPGRLVIFPSDNSLHGVTPVGEGGRRYSLTFWF 99 (100)
T ss_dssp -BTTEEEEEESCTCEEEEEEE-EESEEEEEEEEE
T ss_pred CCCCEEEEEeCCCCeecCcccCCCCCEEEEEEEE
Confidence 99999999999 999999998 33 57777665
No 105
>PF09845 DUF2072: Zn-ribbon containing protein (DUF2072); InterPro: IPR018645 This archaeal Zinc-ribbon containing proteins have no known function.
Probab=35.66 E-value=16 Score=31.97 Aligned_cols=24 Identities=29% Similarity=0.924 Sum_probs=16.7
Q ss_pred Eec-CCCCcEEeehhhhhcCCcEEeCCCCc
Q psy12281 24 YPC-PCGDRFQITKEQLKAGEDVATCPTCS 52 (456)
Q Consensus 24 ~~C-rCg~~f~i~~~~l~~~~~~~~C~~CS 52 (456)
+.| +||..|.=-..+| +-+|+.|-
T Consensus 2 H~Ct~Cg~~f~dgs~ei-----l~GCP~CG 26 (131)
T PF09845_consen 2 HQCTKCGRVFEDGSKEI-----LSGCPECG 26 (131)
T ss_pred cccCcCCCCcCCCcHHH-----HccCcccC
Confidence 468 8998887555554 44798774
No 106
>COG3364 Zn-ribbon containing protein [General function prediction only]
Probab=34.74 E-value=18 Score=30.24 Aligned_cols=29 Identities=28% Similarity=0.798 Sum_probs=17.8
Q ss_pred Eec-CCCCcEEeehhhhhcCCcEEeCCCC--cceEEE
Q psy12281 24 YPC-PCGDRFQITKEQLKAGEDVATCPTC--SLIIKP 57 (456)
Q Consensus 24 ~~C-rCg~~f~i~~~~l~~~~~~~~C~~C--Sl~i~~ 57 (456)
+.| |||..|.=-.+.| +-+|+.| -+...|
T Consensus 3 H~CtrCG~vf~~g~~~i-----l~GCp~CG~nkF~yv 34 (112)
T COG3364 3 HQCTRCGEVFDDGSEEI-----LSGCPKCGCNKFLYV 34 (112)
T ss_pred ceecccccccccccHHH-----HccCccccchheEec
Confidence 568 9998887644443 4478744 444443
No 107
>KOG3053|consensus
Probab=33.51 E-value=15 Score=35.79 Aligned_cols=39 Identities=28% Similarity=0.601 Sum_probs=25.8
Q ss_pred cccCCCce--EEEecCCCCc----------EEeehhhhhcCCcEEeCCCCc
Q psy12281 14 EYDEDDET--YYYPCPCGDR----------FQITKEQLKAGEDVATCPTCS 52 (456)
Q Consensus 14 ~~~~~~~~--~~~~CrCg~~----------f~i~~~~l~~~~~~~~C~~CS 52 (456)
.-|+|+.. |.+||||-|. --|+|.+..+....|.|+-|-
T Consensus 28 ~TdeDn~~a~WV~PCrCRGt~KWVHqsCL~rWiDEK~~~n~~q~V~C~QCq 78 (293)
T KOG3053|consen 28 ATDEDNRLAAWVHPCRCRGTTKWVHQSCLSRWIDEKQRGNPLQTVSCPQCQ 78 (293)
T ss_pred ccCcccchhhhcccccccCccHHHHHHHHHHHHhHHhcCCCCceeechhhc
Confidence 34566543 9999999873 235666653333478999985
No 108
>PRK12860 transcriptional activator FlhC; Provisional
Probab=33.39 E-value=23 Score=33.09 Aligned_cols=28 Identities=25% Similarity=0.647 Sum_probs=18.5
Q ss_pred EEEecC-CCCcEEeehhhhhcCCcEEeCCCCc
Q psy12281 22 YYYPCP-CGDRFQITKEQLKAGEDVATCPTCS 52 (456)
Q Consensus 22 ~~~~Cr-Cg~~f~i~~~~l~~~~~~~~C~~CS 52 (456)
=.-+|+ ||+.|+..-.++..+ ..|+-|.
T Consensus 133 ~l~~C~~Cgg~fv~~~~e~~~~---f~CplC~ 161 (189)
T PRK12860 133 QLARCCRCGGKFVTHAHDLRHN---FVCGLCQ 161 (189)
T ss_pred eeccCCCCCCCeeccccccCCC---CcCCCCC
Confidence 455674 888888766654443 4777777
No 109
>KOG1246|consensus
Probab=33.31 E-value=56 Score=38.01 Aligned_cols=112 Identities=17% Similarity=0.143 Sum_probs=78.4
Q ss_pred ccceeeeCCCCCcccccccccCccceeEEEEe-EEEEEEeCCCCCcccccC-----CCC--ccccC-------CCcccCC
Q psy12281 229 DYRFVYMGPKETWTPLHADVFHSYSWSVNICG-RKQWLLLAPGNEKYFKDS-----MGN--LISDM-------RSVDWST 293 (456)
Q Consensus 229 ~~~~l~iG~~gs~T~lH~D~~~~~sW~~~i~G-~Krw~L~pP~~~~~Ly~~-----~g~--~p~d~-------~~pd~~~ 293 (456)
.-.++|+|..-+.-.+|.+....++-+.+-.| .|.|.-+|+.....+-.. +|. .+.|. .+|.+-.
T Consensus 329 ~~p~l~~gm~fs~~~wh~ed~~~~slny~h~g~pk~wy~v~~~~ae~~e~~~~~~~p~~~~~~pd~~~~~~~~~~p~~l~ 408 (904)
T KOG1246|consen 329 TVPWLYIGMCFSTFCWHVEDHSLYSLNYLHLGEPKTWYSVPGSAAEKFEKAMNKLSPGLFIEQPDLLHALVTLMSPNFLT 408 (904)
T ss_pred ccccccccccccccccccCCccccccchhhcCCceEEEecCcchHHHHHHHHHhhCCcccccCcccccccccccCcchhh
Confidence 34678999998989999998888766544444 599999999987644321 111 11111 1121111
Q ss_pred CCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCCCCHH
Q psy12281 294 LPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWINGTNID 340 (456)
Q Consensus 294 fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~~nl~ 340 (456)
....+.+.++|+||+.++-=+.=||...+-+.+.+-..||.+...|.
T Consensus 409 ~~gvpv~~~~q~~ge~vitfP~~Y~~g~~~gf~~~e~vn~ap~dwl~ 455 (904)
T KOG1246|consen 409 DEGVPVYRTVQNPGEFVITFPRAYHAGFNCGFNFAEAVNFAPSDWLP 455 (904)
T ss_pred cCCCCceecccCCCCEeecCCCeeeecccccccHHHhcccCCcchhH
Confidence 23489999999999999999999999999998877777777665433
No 110
>COG4297 Uncharacterized protein containing double-stranded beta helix domain [Function unknown]
Probab=32.78 E-value=2.9e+02 Score=24.61 Aligned_cols=40 Identities=18% Similarity=0.258 Sum_probs=30.0
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEEEeecCCC
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISINHNWING 336 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsvn~~~~~~ 336 (456)
+...++.+..||+|.||+|--|.-..-..-+.|-..|-..
T Consensus 82 ~~G~el~v~~GDvlliPAGvGH~rl~sS~DF~VvGaYp~G 121 (163)
T COG4297 82 ADGQELEVGEGDVLLIPAGVGHCRLHSSADFQVVGAYPPG 121 (163)
T ss_pred CCCceeeecCCCEEEEecCcccccccCCCCeEEEcccCCc
Confidence 4456788999999999999999877666656665555443
No 111
>PF05605 zf-Di19: Drought induced 19 protein (Di19), zinc-binding; InterPro: IPR008598 This entry consists of several drought induced 19 (Di19) like and RING finger 114 proteins. Di19 has been found to be strongly expressed in both the roots and leaves of Arabidopsis thaliana during progressive drought [], whilst RING finger proteins are thought to play a role in spermatogenesis. The precise function is unknown.
Probab=32.77 E-value=22 Score=25.86 Aligned_cols=29 Identities=31% Similarity=0.854 Sum_probs=20.8
Q ss_pred EEecC-CCCcEEeehhhhhc--------CCcEEeCCCCcc
Q psy12281 23 YYPCP-CGDRFQITKEQLKA--------GEDVATCPTCSL 53 (456)
Q Consensus 23 ~~~Cr-Cg~~f~i~~~~l~~--------~~~~~~C~~CSl 53 (456)
+|.|| ||. .++...|.+ ....+.|+-|+-
T Consensus 2 ~f~CP~C~~--~~~~~~L~~H~~~~H~~~~~~v~CPiC~~ 39 (54)
T PF05605_consen 2 SFTCPYCGK--GFSESSLVEHCEDEHRSESKNVVCPICSS 39 (54)
T ss_pred CcCCCCCCC--ccCHHHHHHHHHhHCcCCCCCccCCCchh
Confidence 47897 998 366777754 123689999985
No 112
>PRK12722 transcriptional activator FlhC; Provisional
Probab=32.67 E-value=24 Score=32.92 Aligned_cols=28 Identities=21% Similarity=0.548 Sum_probs=17.5
Q ss_pred EEecC-CCCcEEeehhhhhcCCcEEeCCCCcc
Q psy12281 23 YYPCP-CGDRFQITKEQLKAGEDVATCPTCSL 53 (456)
Q Consensus 23 ~~~Cr-Cg~~f~i~~~~l~~~~~~~~C~~CSl 53 (456)
.-+|+ ||+.|+..-.++..+ ..|+-|..
T Consensus 134 l~~C~~Cgg~fv~~~~e~~~~---f~CplC~~ 162 (187)
T PRK12722 134 LSSCNCCGGHFVTHAHDPVGS---FVCGLCQP 162 (187)
T ss_pred eccCCCCCCCeeccccccCCC---CcCCCCCC
Confidence 44574 888887766654332 46777766
No 113
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=32.03 E-value=56 Score=27.98 Aligned_cols=25 Identities=36% Similarity=0.974 Sum_probs=20.8
Q ss_pred cC-CCCcEEeehhhhhcCCcEEeCCCCcceEEEee
Q psy12281 26 CP-CGDRFQITKEQLKAGEDVATCPTCSLIIKPSV 59 (456)
Q Consensus 26 Cr-Cg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~ 59 (456)
|| ||+...|| -..|++|--.|...|
T Consensus 1 CPvCg~~l~vt---------~l~C~~C~t~i~G~F 26 (113)
T PF09862_consen 1 CPVCGGELVVT---------RLKCPSCGTEIEGEF 26 (113)
T ss_pred CCCCCCceEEE---------EEEcCCCCCEEEeee
Confidence 75 99888885 368999999999888
No 114
>TIGR03028 EpsE polysaccharide export protein EpsE. Sequences in this family of proteins are members of a polysaccharide export protein family (pfam02563) which includes the wza protein from E.coli. This family of proteins are homologous to the EpsE protein of the methanolan biosynthesis operon of Methylobacillus species strain 12S. The distribution of this protein appears to be restricted to a subset of exopolysaccharide operons containing a syntenic grouping of genes including a variant of the EpsH exosortase protein. Exosortase has been proposed to be involved in the targetting and processing of proteins containing the PEP-CTERM domain to the exopolysaccharide layer.
Probab=31.76 E-value=26 Score=33.85 Aligned_cols=15 Identities=20% Similarity=0.288 Sum_probs=13.3
Q ss_pred EEcCCCEEEeCCCCe
Q psy12281 303 EQEAGDSIFVPSGWH 317 (456)
Q Consensus 303 ~l~pGD~LfIPsgW~ 317 (456)
.++|||+||||..|+
T Consensus 225 ~l~~gDii~V~~s~~ 239 (239)
T TIGR03028 225 LVQPDDVIYVRESLF 239 (239)
T ss_pred ccCCCCEEEEeCccC
Confidence 489999999999885
No 115
>COG3822 ABC-type sugar transport system, auxiliary component [General function prediction only]
Probab=31.44 E-value=1.7e+02 Score=27.44 Aligned_cols=33 Identities=21% Similarity=0.276 Sum_probs=27.3
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEE
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISI 329 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsv 329 (456)
...-.+.|+||+-+=+|+|-||.-..-+.-+-|
T Consensus 150 ~ag~~lkL~PGesitL~Pg~~HsFwae~g~vlv 182 (225)
T COG3822 150 TAGSQLKLSPGESITLPPGLYHSFWAEEGGVLV 182 (225)
T ss_pred ccceeEEECCCCcEecCCCceeeeeecCCcEEE
Confidence 455678999999999999999999987765443
No 116
>TIGR03504 FimV_Cterm FimV C-terminal domain. This protein is found at the extreme C-terminus of FimV from Pseudomonas aeruginosa, and of TspA of Neisseria meningitidis. Disruption of the former blocks twitching motility from type IV pili; Semmler, et al. suggest a role in peptidoglycan layer remodelling required by type IV fimbrial systems.
Probab=31.42 E-value=53 Score=23.10 Aligned_cols=31 Identities=26% Similarity=0.356 Sum_probs=27.7
Q ss_pred cHHHHHHHHHHhhcCCccchhhhHHHHHHHh
Q psy12281 422 DLRKASDVLKLCAQHEDWDLKLETNKLIEDL 452 (456)
Q Consensus 422 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 452 (456)
|-..-+++|.+++.+.++..+.|-.++++.|
T Consensus 14 d~e~Ar~lL~evl~~~~~~q~~eA~~LL~~l 44 (44)
T TIGR03504 14 DLEGARELLEEVIEEGDEAQRQEARALLAQL 44 (44)
T ss_pred ChHHHHHHHHHHHHcCCHHHHHHHHHHHhcC
Confidence 7778899999999999999999999998865
No 117
>PF14353 CpXC: CpXC protein
Probab=30.85 E-value=28 Score=30.06 Aligned_cols=42 Identities=24% Similarity=0.453 Sum_probs=29.7
Q ss_pred ecC-CCCcEEeehhh-------------hhcCC-cEEeCCCCcceEEEeeeehhhhh
Q psy12281 25 PCP-CGDRFQITKEQ-------------LKAGE-DVATCPTCSLIIKPSVIKSVMLM 66 (456)
Q Consensus 25 ~Cr-Cg~~f~i~~~~-------------l~~~~-~~~~C~~CSl~i~~~~~~~~~~~ 66 (456)
.|+ ||..|.++-.. |.+|+ -.+.|+.|--..++.|.-.+..+
T Consensus 3 tCP~C~~~~~~~v~~~I~~~~~p~l~e~il~g~l~~~~CP~Cg~~~~~~~p~lY~D~ 59 (128)
T PF14353_consen 3 TCPHCGHEFEFEVWTSINADEDPELKEKILDGSLFSFTCPSCGHKFRLEYPLLYHDP 59 (128)
T ss_pred CCCCCCCeeEEEEEeEEcCcCCHHHHHHHHcCCcCEEECCCCCCceecCCCEEEEcC
Confidence 597 99877764332 22343 37999999999998886666655
No 118
>PRK12335 tellurite resistance protein TehB; Provisional
Probab=30.17 E-value=1.9e+02 Score=28.48 Aligned_cols=57 Identities=9% Similarity=0.136 Sum_probs=38.8
Q ss_pred CccceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCCcEEE
Q psy12281 250 HSYSWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEHTISI 329 (456)
Q Consensus 250 ~~~sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~sIsv 329 (456)
++...+.+++|+=.++++.+.... ...+.....++.-+||++=||.+..+++.+..
T Consensus 32 g~~~~~~vl~G~l~~~~~de~g~~------------------------~~~~~l~~~~~~~~i~p~~wh~v~~~s~d~~~ 87 (287)
T PRK12335 32 GTWAKLTVLKGELKFYELTEDGEE------------------------LSEHIFDAENQPPFIEPQAWHRIEAASDDLEC 87 (287)
T ss_pred CcceEEEEEeeeEEEEEECCCCCe------------------------eeEEEEecCCCCceeCCcceEEEEEcCCCcEE
Confidence 344557899999999999654422 12233444456667999999999999774443
Q ss_pred E
Q psy12281 330 N 330 (456)
Q Consensus 330 n 330 (456)
.
T Consensus 88 ~ 88 (287)
T PRK12335 88 Q 88 (287)
T ss_pred E
Confidence 3
No 119
>PRK13501 transcriptional activator RhaR; Provisional
Probab=29.23 E-value=64 Score=31.62 Aligned_cols=26 Identities=15% Similarity=0.299 Sum_probs=22.7
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
....+++||++|||+|=.|++...++
T Consensus 56 ~~~~l~~g~~~~I~p~~~H~~~~~~~ 81 (290)
T PRK13501 56 HPYRITCGDVFYIQAADHHSYESVHD 81 (290)
T ss_pred eeeeecCCeEEEEcCCCcccccccCC
Confidence 46889999999999999999887655
No 120
>COG5553 Predicted metal-dependent enzyme of the double-stranded beta helix superfamily [General function prediction only]
Probab=29.20 E-value=1.3e+02 Score=27.50 Aligned_cols=27 Identities=26% Similarity=0.206 Sum_probs=17.1
Q ss_pred CceEEEEEcCCCEEEeCCCCeEEEEeCCC--cEEEEe
Q psy12281 297 DTVIIVEQEAGDSIFVPSGWHHQVTNLEH--TISINH 331 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsgW~H~V~nl~~--sIsvn~ 331 (456)
+.+=+++|.|||+ |+|.|... |+||..
T Consensus 126 ~apgeV~lSpgdi--------hsv~n~~sdrs~aiHv 154 (191)
T COG5553 126 AAPGEVHLSPGDI--------HSVANTGSDRSGAIHV 154 (191)
T ss_pred cCcceEeeCCCCe--------eeecccCCCccceEEE
Confidence 3444566777666 78887654 666654
No 121
>PF15138 Syncollin: Syncollin
Probab=29.16 E-value=46 Score=27.98 Aligned_cols=40 Identities=23% Similarity=0.362 Sum_probs=30.8
Q ss_pred CceEEEEEcCCC-EEEeCCCCeEEEEeCCC--cEEEEeecCCC
Q psy12281 297 DTVIIVEQEAGD-SIFVPSGWHHQVTNLEH--TISINHNWING 336 (456)
Q Consensus 297 ~~~~~~~l~pGD-~LfIPsgW~H~V~nl~~--sIsvn~~~~~~ 336 (456)
...-+..++||| .=|+|++|-|.+.+|-- -..++.|=...
T Consensus 32 C~G~~l~v~pg~DlPylPs~w~n~iSSlvV~~RCeLtVWS~~g 74 (112)
T PF15138_consen 32 CGGAELSVEPGDDLPYLPSGWANKISSLVVAPRCELTVWSRSG 74 (112)
T ss_pred cCCcEEeecCCCCCCcCCccccCeeeeEEEcCceEEEEEeccC
Confidence 677788999996 55999999999988753 66777765443
No 122
>TIGR02297 HpaA 4-hydroxyphenylacetate catabolism regulatory protein HpaA. This putative transcriptional regulator, which contains both the substrate-binding, dimerization domain (pfam02311) and the helix-turn-helix DNA-binding domain (pfam00165) of the AraC famil, is located proximal to genes of the 4-hydroxyphenylacetate catabolism pathway.
Probab=28.88 E-value=1.5e+02 Score=28.69 Aligned_cols=25 Identities=12% Similarity=-0.036 Sum_probs=21.5
Q ss_pred EEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
...++|||+++||+|=.|......+
T Consensus 63 ~~~l~~g~~~ii~~~~~H~~~~~~~ 87 (287)
T TIGR02297 63 EYSEYAPCFFLTPPSVPHGFVTDLD 87 (287)
T ss_pred EEEecCCeEEEeCCCCccccccCCC
Confidence 5788999999999999999876554
No 123
>PF11331 DUF3133: Protein of unknown function (DUF3133); InterPro: IPR021480 This eukaryotic family of proteins has no known function.
Probab=28.74 E-value=41 Score=24.02 Aligned_cols=17 Identities=24% Similarity=0.532 Sum_probs=14.4
Q ss_pred CcEEeCCCCcceEEEee
Q psy12281 43 EDVATCPTCSLIIKPSV 59 (456)
Q Consensus 43 ~~~~~C~~CSl~i~~~~ 59 (456)
..-++|.+||-+|.+.+
T Consensus 29 ~~klrCGaCs~vl~~s~ 45 (46)
T PF11331_consen 29 QQKLRCGACSEVLSFSL 45 (46)
T ss_pred eeEEeCCCCceeEEEec
Confidence 45799999999998875
No 124
>PHA00616 hypothetical protein
Probab=28.66 E-value=13 Score=26.37 Aligned_cols=14 Identities=50% Similarity=1.125 Sum_probs=8.7
Q ss_pred Eec-CCCCcEEeehh
Q psy12281 24 YPC-PCGDRFQITKE 37 (456)
Q Consensus 24 ~~C-rCg~~f~i~~~ 37 (456)
|.| |||..|.-..+
T Consensus 2 YqC~~CG~~F~~~s~ 16 (44)
T PHA00616 2 YQCLRCGGIFRKKKE 16 (44)
T ss_pred CccchhhHHHhhHHH
Confidence 567 67777765433
No 125
>PF13831 PHD_2: PHD-finger; PDB: 2L43_A 2KU3_A.
Probab=26.57 E-value=50 Score=22.13 Aligned_cols=18 Identities=17% Similarity=0.525 Sum_probs=11.3
Q ss_pred CcEEeCCCCcceEEEeee
Q psy12281 43 EDVATCPTCSLIIKPSVI 60 (456)
Q Consensus 43 ~~~~~C~~CSl~i~~~~~ 60 (456)
+.++.|++|++.|+-.-.
T Consensus 2 n~ll~C~~C~v~VH~~CY 19 (36)
T PF13831_consen 2 NPLLFCDNCNVAVHQSCY 19 (36)
T ss_dssp CEEEE-SSS--EEEHHHH
T ss_pred CceEEeCCCCCcCChhhC
Confidence 468999999999985543
No 126
>KOG0162|consensus
Probab=26.55 E-value=65 Score=36.13 Aligned_cols=29 Identities=34% Similarity=0.518 Sum_probs=22.9
Q ss_pred CCCCceEEEEEcCCCEEEe----CCCCeEEEEe
Q psy12281 294 LPRDTVIIVEQEAGDSIFV----PSGWHHQVTN 322 (456)
Q Consensus 294 fp~~~~~~~~l~pGD~LfI----PsgW~H~V~n 322 (456)
|+...+-+..++.||+|+| |||||-.-.+
T Consensus 1060 y~gq~~dEls~~~~diIei~~edpSGWw~gk~~ 1092 (1106)
T KOG0162|consen 1060 YPGQDVDELSFKKGDIIEIMREDPSGWWLGKLN 1092 (1106)
T ss_pred CCCCCcccccccCCCEEEEeccCCCcchhhccC
Confidence 4555667888999999999 9999976533
No 127
>PF08271 TF_Zn_Ribbon: TFIIB zinc-binding; InterPro: IPR013137 Zinc finger (Znf) domains are relatively small protein motifs which contain multiple finger-like protrusions that make tandem contacts with their target molecule. Some of these domains bind zinc, but many do not; instead binding other metals such as iron, or no metal at all. For example, some family members form salt bridges to stabilise the finger-like folds. They were first identified as a DNA-binding motif in transcription factor TFIIIA from Xenopus laevis (African clawed frog), however they are now recognised to bind DNA, RNA, protein and/or lipid substrates [, , , , ]. Their binding properties depend on the amino acid sequence of the finger domains and of the linker between fingers, as well as on the higher-order structures and the number of fingers. Znf domains are often found in clusters, where fingers can have different binding specificities. There are many superfamilies of Znf motifs, varying in both sequence and structure. They display considerable versatility in binding modes, even between members of the same class (e.g. some bind DNA, others protein), suggesting that Znf motifs are stable scaffolds that have evolved specialised functions. For example, Znf-containing proteins function in gene transcription, translation, mRNA trafficking, cytoskeleton organisation, epithelial development, cell adhesion, protein folding, chromatin remodelling and zinc sensing, to name but a few []. Zinc-binding motifs are stable structures, and they rarely undergo conformational changes upon binding their target. This entry represents a zinc finger motif found in transcription factor IIB (TFIIB). In eukaryotes the initiation of transcription of protein encoding genes by the polymerase II complexe (Pol II) is modulated by general and specific transcription factors. The general transcription factors operate through common promoters elements (such as the TATA box). At least seven different proteins associate to form the general transcription factors: TFIIA, -IIB, -IID, -IIE, -IIF, -IIG, and -IIH []. TFIIB and TFIID are responsible for promoter recognition and interaction with pol II; together with Pol II, they form a minimal initiation complex capable of transcription under certain conditions. The TATA box of a Pol II promoter is bound in the initiation complex by the TBP subunit of TFIID, which bends the DNA around the C-terminal domain of TFIIB whereas the N-terminal zinc finger of TFIIB interacts with Pol II [, ]. The TFIIB zinc finger adopts a zinc ribbon fold characterised by two beta-hairpins forming two structurally similar zinc-binding sub-sites []. The zinc finger contacts the rbp1 subunit of Pol II through its dock domain, a conserved region of about 70 amino acids located close to the polymerase active site []. In the Pol II complex this surface is located near the RNA exit groove. Interestingly this sequence is best conserved in the three polymerases that utilise a TFIIB-like general transcription factor (Pol II, Pol III, and archaeal RNA polymerase) but not in Pol I []. More information about these proteins can be found at Protein of the Month: Zinc Fingers [].; GO: 0008270 zinc ion binding, 0006355 regulation of transcription, DNA-dependent; PDB: 1VD4_A 1PFT_A 3K1F_M 3K7A_M 1RO4_A 1RLY_A 1DL6_A.
Probab=25.96 E-value=81 Score=21.72 Aligned_cols=30 Identities=27% Similarity=0.810 Sum_probs=18.5
Q ss_pred Eec-CCCCcEEeehhhhhcCCcEEeCCCCcceEEE
Q psy12281 24 YPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIKP 57 (456)
Q Consensus 24 ~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~ 57 (456)
|.| .||....+.... .| -.-|..|-++|.-
T Consensus 1 m~Cp~Cg~~~~~~D~~--~g--~~vC~~CG~Vl~e 31 (43)
T PF08271_consen 1 MKCPNCGSKEIVFDPE--RG--ELVCPNCGLVLEE 31 (43)
T ss_dssp ESBTTTSSSEEEEETT--TT--EEEETTT-BBEE-
T ss_pred CCCcCCcCCceEEcCC--CC--eEECCCCCCEeec
Confidence 568 499877543322 23 4589999988863
No 128
>PF08273 Prim_Zn_Ribbon: Zinc-binding domain of primase-helicase; InterPro: IPR013237 This entry is represented by bacteriophage T7 Gp4. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a zinc binding domain found in the N-terminal region of the bacteriophage T7 Gp4 and P4 alpha protein. P4 is a multifunctional protein with origin recognition, helicase and primase activities [, , ].; GO: 0003896 DNA primase activity, 0004386 helicase activity, 0008270 zinc ion binding; PDB: 1NUI_B.
Probab=25.47 E-value=57 Score=22.54 Aligned_cols=28 Identities=39% Similarity=0.871 Sum_probs=14.0
Q ss_pred EEEecC-CC--CcEEeehhhhhcCCcEEeCCCC
Q psy12281 22 YYYPCP-CG--DRFQITKEQLKAGEDVATCPTC 51 (456)
Q Consensus 22 ~~~~Cr-Cg--~~f~i~~~~l~~~~~~~~C~~C 51 (456)
+--||| || |.|.|..+.-.. -.--|.+|
T Consensus 2 ~h~pCP~CGG~DrFri~~d~~~~--G~~~C~~C 32 (40)
T PF08273_consen 2 KHGPCPICGGKDRFRIFDDKDGR--GTWICRQC 32 (40)
T ss_dssp EEE--TTTT-TTTEEEETT------S-EEETTT
T ss_pred CCCCCCCCcCccccccCcCcccC--CCEECCCC
Confidence 345886 88 689966654222 34567777
No 129
>COG5175 MOT2 Transcriptional repressor [Transcription]
Probab=25.35 E-value=13 Score=37.64 Aligned_cols=18 Identities=39% Similarity=1.189 Sum_probs=10.7
Q ss_pred eCCCccccCCCceEEEecCCC
Q psy12281 9 EIEDFEYDEDDETYYYPCPCG 29 (456)
Q Consensus 9 ~l~d~~~~~~~~~~~~~CrCg 29 (456)
.+++|+..+. -+.||+||
T Consensus 20 cie~mditdk---nf~pc~cg 37 (480)
T COG5175 20 CIEPMDITDK---NFFPCPCG 37 (480)
T ss_pred cccccccccC---CcccCCcc
Confidence 3444444433 37788888
No 130
>PF05876 Terminase_GpA: Phage terminase large subunit (GpA); InterPro: IPR008866 This entry is represented by Bacteriophage lambda, GpA. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry consists of several phage terminase large subunit proteins as well as related sequences from several bacterial species. The DNA packaging enzyme of bacteriophage lambda, terminase, is a heteromultimer composed of a small subunit, gpNu1, and a large subunit, gpA, products of the Nu1 and A genes, respectively. Terminase is involved in the site-specific binding and cutting of the DNA in the initial stages of packaging. It is now known that gpA is actively involved in late stages of packaging, including DNA translocation, and that this enzyme contains separate functional domains for its early and late packaging activities [].
Probab=24.04 E-value=49 Score=36.26 Aligned_cols=39 Identities=38% Similarity=0.866 Sum_probs=27.9
Q ss_pred CCceEEEecC-CCCcEEeehhhhh------cCCcEEeCCCCcceEE
Q psy12281 18 DDETYYYPCP-CGDRFQITKEQLK------AGEDVATCPTCSLIIK 56 (456)
Q Consensus 18 ~~~~~~~~Cr-Cg~~f~i~~~~l~------~~~~~~~C~~CSl~i~ 56 (456)
+...|.-+|| ||..+.+.=+.|. .......|+.|--.|.
T Consensus 195 dqr~~~vpCPhCg~~~~l~~~~l~w~~~~~~~~a~y~C~~Cg~~i~ 240 (557)
T PF05876_consen 195 DQRRYYVPCPHCGEEQVLEWENLKWDKGEAPETARYVCPHCGCEIE 240 (557)
T ss_pred CceEEEccCCCCCCCccccccceeecCCCCccceEEECCCCcCCCC
Confidence 3456999995 9998888755552 1234789999987775
No 131
>PRK15401 alpha-ketoglutarate-dependent dioxygenase AlkB; Provisional
Probab=24.03 E-value=2.6e+02 Score=26.70 Aligned_cols=28 Identities=21% Similarity=0.190 Sum_probs=22.6
Q ss_pred CceEEEEEcCCCEEEeC---CCCeEEEEeCC
Q psy12281 297 DTVIIVEQEAGDSIFVP---SGWHHQVTNLE 324 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIP---sgW~H~V~nl~ 324 (456)
.+...+.|+.||+|.+- ..|||.|-...
T Consensus 164 ~~~~~l~L~~Gdllvm~G~sr~~~HgVp~~~ 194 (213)
T PRK15401 164 DPLQRILLEHGDVVVWGGPSRLRYHGILPLK 194 (213)
T ss_pred CceEEEEeCCCCEEEECchHhheeccCCcCC
Confidence 45678999999999995 56999996554
No 132
>PF01238 PMI_typeI: Phosphomannose isomerase type I; InterPro: IPR001250 Mannose-6-phosphate isomerase or phosphomannose isomerase (5.3.1.8 from EC) (PMI) is the enzyme that catalyses the interconversion of mannose-6-phosphate and fructose-6-phosphate. In eukaryotes PMI is involved in the synthesis of GDP-mannose, a constituent of N- and O-linked glycans and GPI anchors and in prokaryotes it participates in a variety of pathways, including capsular polysaccharide biosynthesis and D-mannose metabolism. PMI's belong to the cupin superfamily whose functions range from isomerase and epimerase activities involved in the modification of cell wall carbohydrates in bacteria and plants, to non-enzymatic storage proteins in plant seeds, and transcription factors linked to congenital baldness in mammals []. Three classes of PMI have been defined []. Type I includes eukaryotic PMI and the enzyme encoded by the manA gene in enterobacteria. PMI has a bound zinc ion, which is essential for activity. A crystal structure of PMI from Candida albicans shows that the enzyme has three distinct domains []. The active site lies in the central domain, contains a single essential zinc atom, and forms a deep, open cavity of suitable dimensions to contain M6P or F6P The central domain is flanked by a helical domain on one side and a jelly-roll like domain on the other.; GO: 0004476 mannose-6-phosphate isomerase activity, 0008270 zinc ion binding, 0005975 carbohydrate metabolic process; PDB: 1PMI_A 1QWR_B 1ZX5_A 3H1Y_A 2WFP_A 3H1M_A 3H1W_A.
Probab=23.98 E-value=47 Score=34.45 Aligned_cols=22 Identities=36% Similarity=0.669 Sum_probs=18.1
Q ss_pred EEEEcCCCEEEeCCCCeEEEEe
Q psy12281 301 IVEQEAGDSIFVPSGWHHQVTN 322 (456)
Q Consensus 301 ~~~l~pGD~LfIPsgW~H~V~n 322 (456)
.+.|+|||.+|+|+|=.|....
T Consensus 251 ~v~L~pGeaifl~a~~~HAYl~ 272 (373)
T PF01238_consen 251 YVELQPGEAIFLPAGEPHAYLS 272 (373)
T ss_dssp EEEE-TT-EEEEHTTHHEEEEE
T ss_pred EEEecCCceEEecCCCcccccc
Confidence 4699999999999999999776
No 133
>PRK00762 hypA hydrogenase nickel incorporation protein; Provisional
Probab=23.69 E-value=34 Score=29.65 Aligned_cols=31 Identities=23% Similarity=0.596 Sum_probs=19.3
Q ss_pred EEEecCCCCcEEeehhhhhcCCcEEeCCCCc
Q psy12281 22 YYYPCPCGDRFQITKEQLKAGEDVATCPTCS 52 (456)
Q Consensus 22 ~~~~CrCg~~f~i~~~~l~~~~~~~~C~~CS 52 (456)
-...|.||..|.+....+........||.|-
T Consensus 69 ~~~~C~Cg~~~~~~~~~~~~~~~~~~CP~Cg 99 (124)
T PRK00762 69 VEIECECGYEGVVDEDEIDHYAAVIECPVCG 99 (124)
T ss_pred eeEEeeCcCcccccccchhccccCCcCcCCC
Confidence 5666889988877654333221235799884
No 134
>PF00675 Peptidase_M16: Insulinase (Peptidase family M16) This is family M16 in the peptidase classification. ; InterPro: IPR011765 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Metalloproteases are the most diverse of the four main types of protease, with more than 50 families identified to date. In these enzymes, a divalent cation, usually zinc, activates the water molecule. The metal ion is held in place by amino acid ligands, usually three in number. The known metal ligands are His, Glu, Asp or Lys and at least one other residue is required for catalysis, which may play an electrophillic role. Of the known metalloproteases, around half contain an HEXXH motif, which has been shown in crystallographic studies to form part of the metal-binding site []. The HEXXH motif is relatively common, but can be more stringently defined for metalloproteases as 'abXHEbbHbc', where 'a' is most often valine or threonine and forms part of the S1' subsite in thermolysin and neprilysin, 'b' is an uncharged residue, and 'c' a hydrophobic residue. Proline is never found in this site, possibly because it would break the helical structure adopted by this motif in metalloproteases []. The majority of the sequences in this entry are metallopeptidases and non-peptidase homologs belong to MEROPS peptidase family M16 (clan ME), subfamilies M16A, M16B and M16C; they include: Insulinase, insulin-degrading enzyme (3.4.24.56 from EC) Mitochondrial processing peptidase alpha subunit, (Alpha-MPP, 3.4.24.64 from EC) Pitrlysin, Protease III precursor (3.4.24.55 from EC) Nardilysin, (3.4.24.61 from EC) Ubiquinol-cytochrome C reductase complex core protein I,mitochondrial precursor (1.10.2.2 from EC) Coenzyme PQQ synthesis protein F (3.4.99 from EC) These proteins do not share many regions of sequence similarity; the most noticeable is in the N-terminal section. This region includes a conserved histidine followed, two residues later by a glutamate and another histidine. In pitrilysin, it has been shown [] that this H-x-x-E-H motif is involved in enzymatic activity; the two histidines bind zinc and the glutamate is necessary for catalytic activity. The proteins classified as non-peptidase homologues either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity. ; GO: 0004222 metalloendopeptidase activity, 0006508 proteolysis; PDB: 3P7L_A 3P7O_A 3TUV_A 3GO9_A 1BE3_B 1PP9_B 2A06_B 1SQB_B 1SQP_B 1L0N_B ....
Probab=23.50 E-value=1e+02 Score=26.65 Aligned_cols=35 Identities=20% Similarity=0.307 Sum_probs=29.1
Q ss_pred hccHHHHHHHHHHhhcCCccchh---hhHHHHHHHhhc
Q psy12281 420 MFDLRKASDVLKLCAQHEDWDLK---LETNKLIEDLNS 454 (456)
Q Consensus 420 ~~~~~~~~~~~~~~~~~~~~~~~---~~~~~~~~~~~~ 454 (456)
.-|+..+.++|.+++.+|.|..+ .|++.++.||++
T Consensus 81 ~~~~~~~l~~l~~~~~~P~f~~~~~~~~r~~~~~ei~~ 118 (149)
T PF00675_consen 81 SEDLEKALELLADMLFNPSFDEEEFEREREQILQEIEE 118 (149)
T ss_dssp GGGHHHHHHHHHHHHHSBGGCHHHHHHHHHHHHHHHHH
T ss_pred cccchhHHHHHHHHHhCCCCCHHHHHHHHHHHHHHHHH
Confidence 44699999999999999999655 778888888875
No 135
>PRK13500 transcriptional activator RhaR; Provisional
Probab=23.46 E-value=1e+02 Score=30.67 Aligned_cols=27 Identities=11% Similarity=0.155 Sum_probs=23.3
Q ss_pred eEEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 299 VIIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 299 ~~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
.....+.|||++|||++=.|...+.++
T Consensus 85 ~~~~~l~~Gdl~~I~~~~~H~~~~~~~ 111 (312)
T PRK13500 85 DRPYRITRGDLFYIHADDKHSYASVND 111 (312)
T ss_pred CEEEeecCCeEEEECCCCeecccccCC
Confidence 346889999999999999999887666
No 136
>PRK10371 DNA-binding transcriptional regulator MelR; Provisional
Probab=23.19 E-value=1.2e+02 Score=30.06 Aligned_cols=27 Identities=19% Similarity=0.081 Sum_probs=23.1
Q ss_pred eEEEEEcCCCEEEeCCCCeEEEEeCCC
Q psy12281 299 VIIVEQEAGDSIFVPSGWHHQVTNLEH 325 (456)
Q Consensus 299 ~~~~~l~pGD~LfIPsgW~H~V~nl~~ 325 (456)
.-++++.|||+++||++=.|.....++
T Consensus 63 g~~~~l~~Gd~ili~s~~~H~~~~~~~ 89 (302)
T PRK10371 63 NEKVQINQGHITLFWACTPHQLTDPGN 89 (302)
T ss_pred CEEEEEcCCcEEEEecCCcccccccCC
Confidence 346889999999999999999876665
No 137
>COG3435 Gentisate 1,2-dioxygenase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=22.15 E-value=2.6e+02 Score=28.22 Aligned_cols=32 Identities=28% Similarity=0.267 Sum_probs=25.2
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeC-CCcEEEEe
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNL-EHTISINH 331 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl-~~sIsvn~ 331 (456)
.+..-.+||+..||+.-||...|. ++++-.++
T Consensus 299 ~rf~~~~~D~fvVPsW~~~~~~~gs~da~LFsf 331 (351)
T COG3435 299 ERFDWSAGDIFVVPSWAWHEHVNGSEDAVLFSF 331 (351)
T ss_pred EEeeccCCCEEEccCcceeecccCCcceEEEec
Confidence 356678999999999999999994 44666554
No 138
>COG5525 Bacteriophage tail assembly protein [General function prediction only]
Probab=21.89 E-value=50 Score=36.11 Aligned_cols=39 Identities=31% Similarity=0.888 Sum_probs=29.1
Q ss_pred CceEEEecC-CCCcEEeehhhhhc--C-------CcEEeCCCCcceEEE
Q psy12281 19 DETYYYPCP-CGDRFQITKEQLKA--G-------EDVATCPTCSLIIKP 57 (456)
Q Consensus 19 ~~~~~~~Cr-Cg~~f~i~~~~l~~--~-------~~~~~C~~CSl~i~~ 57 (456)
...|+-+|| ||..+.+.-++... | ...++|+.|...|+.
T Consensus 223 ~rr~yvpCPHCGe~q~l~~~e~~~~~g~~~~~~~~~~~~c~h~~~~i~~ 271 (611)
T COG5525 223 QRRFYVPCPHCGEEQQLKFGEKSGPRGLKDTPAEAAFIQCEHCGCVIRP 271 (611)
T ss_pred ceeEEeeCCCCCchhhccccccCCCcCcccchhhhhhhhccccCceeee
Confidence 345999994 99988886655432 1 236799999999997
No 139
>PF12973 Cupin_7: ChrR Cupin-like domain; PDB: 3O14_B 2Z2S_F 2Q1Z_B 3EBR_A.
Probab=21.84 E-value=1.2e+02 Score=24.28 Aligned_cols=25 Identities=28% Similarity=0.321 Sum_probs=20.7
Q ss_pred EcCCCEEEeCCCCeEEEEeCCCcEE
Q psy12281 304 QEAGDSIFVPSGWHHQVTNLEHTIS 328 (456)
Q Consensus 304 l~pGD~LfIPsgW~H~V~nl~~sIs 328 (456)
..+||.++.|+|--|+..+.++|+-
T Consensus 62 ~~~G~~~~~p~g~~h~~~s~~gc~~ 86 (91)
T PF12973_consen 62 YGAGDWLRLPPGSSHTPRSDEGCLI 86 (91)
T ss_dssp EETTEEEEE-TTEEEEEEESSCEEE
T ss_pred CCCCeEEEeCCCCccccCcCCCEEE
Confidence 4899999999999999999777543
No 140
>PF13532 2OG-FeII_Oxy_2: 2OG-Fe(II) oxygenase superfamily; PDB: 2IUW_A 3BTZ_A 3RZL_A 3RZH_A 3S5A_A 3RZG_A 3RZJ_A 3BUC_A 3H8X_A 3H8R_A ....
Probab=21.83 E-value=1.8e+02 Score=26.36 Aligned_cols=28 Identities=21% Similarity=0.458 Sum_probs=20.6
Q ss_pred CceEEEEEcCCCEEEeC----CCCeEEEEeCCC
Q psy12281 297 DTVIIVEQEAGDSIFVP----SGWHHQVTNLEH 325 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIP----sgW~H~V~nl~~ 325 (456)
...+.+.|.+|+++.+- ..| |.|.....
T Consensus 145 ~~~~~~~L~~gsl~vm~g~~r~~~-H~I~~~~~ 176 (194)
T PF13532_consen 145 DEPIEVPLPPGSLLVMSGEARYDW-HGIPPVKK 176 (194)
T ss_dssp S-EEEEEE-TTEEEEEETTHHHHE-EEE-S-SC
T ss_pred CccEEEEcCCCCEEEeChHHhhhe-eEcccccC
Confidence 46789999999999998 568 99987765
No 141
>PRK06260 threonine synthase; Validated
Probab=21.31 E-value=58 Score=33.89 Aligned_cols=33 Identities=24% Similarity=0.689 Sum_probs=24.4
Q ss_pred EEEec-CCCCcEEeehhhhhcCCcEEeCCCCcceEEEeeee
Q psy12281 22 YYYPC-PCGDRFQITKEQLKAGEDVATCPTCSLIIKPSVIK 61 (456)
Q Consensus 22 ~~~~C-rCg~~f~i~~~~l~~~~~~~~C~~CSl~i~~~~~~ 61 (456)
|.+.| +||..|..+. ....|+.|.-.+.+.|+.
T Consensus 2 ~~~~C~~cg~~~~~~~-------~~~~Cp~cg~~l~~~y~~ 35 (397)
T PRK06260 2 YWLKCIECGKEYDPDE-------IIYTCPECGGLLEVIYDL 35 (397)
T ss_pred CEEEECCCCCCCCCCC-------ccccCCCCCCeEEEEecc
Confidence 56889 8998875421 245799998888888863
No 142
>PF12851 Tet_JBP: Oxygenase domain of the 2OGFeDO superfamily ; InterPro: IPR024779 TETs are 2OG- and Fe(II)-dependent oxygenases that catalyse the conversion of 5 methyl-Cytosine (5-MC) to 5-hydroxymethyl-cytosine (hmC) in cultured cells and in vitro []. Interestingly TET2 is considered as an oncogene, as it is found mutated in some types of cancer []. This entry represents the double-stranded beta helix (DSBH) fold of the 2-oxoglutarate (2OG) - Fe(II) oxygenases. DSBH comprises a part of the catalytic domain in TETS. It is found in many organisms including fruit fly, African malaria mosquito, zebrafish, mouse and human.
Probab=21.05 E-value=3.4e+02 Score=24.79 Aligned_cols=28 Identities=18% Similarity=0.120 Sum_probs=24.5
Q ss_pred CceEEEEEcCCCEEEeCCC-CeEEEEeCC
Q psy12281 297 DTVIIVEQEAGDSIFVPSG-WHHQVTNLE 324 (456)
Q Consensus 297 ~~~~~~~l~pGD~LfIPsg-W~H~V~nl~ 324 (456)
.-.+.+...||++|+.-+. --|.|..++
T Consensus 125 ~~g~~~~~~~GtVl~~~~~~~~Hgvtpv~ 153 (171)
T PF12851_consen 125 ILGVAFAYQPGTVLIFCAKRELHGVTPVE 153 (171)
T ss_pred cCCEEEecCCCcEEEEcccceeeecCccc
Confidence 3577899999999999988 889988877
No 143
>KOG1816|consensus
Probab=20.85 E-value=83 Score=31.35 Aligned_cols=83 Identities=19% Similarity=0.328 Sum_probs=55.8
Q ss_pred EEeCCCCCcccccCC--CCccccCCCcccCCCCCCceEEEEEcCCCEEEeCCCCeEEEEeCCC--cEEE-----------
Q psy12281 265 LLLAPGNEKYFKDSM--GNLISDMRSVDWSTLPRDTVIIVEQEAGDSIFVPSGWHHQVTNLEH--TISI----------- 329 (456)
Q Consensus 265 ~L~pP~~~~~Ly~~~--g~~p~d~~~pd~~~fp~~~~~~~~l~pGD~LfIPsgW~H~V~nl~~--sIsv----------- 329 (456)
++.||+.-..|.... ++.-|.+++.+.++..++-.+|++-+.|. +|+| .|--|-..+++ -|-|
T Consensus 40 IilPPSaL~~Ls~lnI~yPMlFkLtn~~~~r~THcGVLEF~AeEG~-vyLP-~WMmq~L~le~gdlv~i~~v~lpkgtyv 117 (308)
T KOG1816|consen 40 IILPPSALDRLSSLNITYPMLFKLTNVDVDRVTHCGVLEFTAEEGR-VYLP-YWMMQNLLLEEGDLVRIRSVTLPKGTYV 117 (308)
T ss_pred EEeCHHHHHHHHHhcCCCceEEEEeccccceeeeeeEEEEEecCce-EEee-hHhhhhccCCCCCeEEEEEeecccccee
Confidence 456887766655422 22226666666655556888999999996 5888 88888888876 2211
Q ss_pred -----EeecCCCCCHHHHHHHHHHH
Q psy12281 330 -----NHNWINGTNIDHVYHEMVSH 349 (456)
Q Consensus 330 -----n~~~~~~~nl~~~~~~l~~~ 349 (456)
+..|.+-+|-.++++...+.
T Consensus 118 KLqP~s~dFLDItNpKAvLE~~LRn 142 (308)
T KOG1816|consen 118 KLQPHSVDFLDITNPKAVLENALRN 142 (308)
T ss_pred eeccCCCCccccCChHHHHHHHHhh
Confidence 45677888877877766544
No 144
>KOG1379|consensus
Probab=20.47 E-value=1.6e+02 Score=29.88 Aligned_cols=55 Identities=18% Similarity=0.195 Sum_probs=34.5
Q ss_pred ceeEEEEeEEEEEEeCCCCCcccccCCCCccccCCCc--cc----CCCCC-CceEEEEEcCCCEEEeC
Q psy12281 253 SWSVNICGRKQWLLLAPGNEKYFKDSMGNLISDMRSV--DW----STLPR-DTVIIVEQEAGDSIFVP 313 (456)
Q Consensus 253 sW~~~i~G~Krw~L~pP~~~~~Ly~~~g~~p~d~~~p--d~----~~fp~-~~~~~~~l~pGD~LfIP 313 (456)
||+.+|+.-|...=-|+.+..+ +.||.+..+ .. .+.|+ +..+.+.+++||+|.+-
T Consensus 191 SGF~VvR~G~vv~~S~~Q~H~F------N~PyQLs~~p~~~~~~~~d~p~~ad~~~~~v~~GDvIilA 252 (330)
T KOG1379|consen 191 SGFLVVREGKVVFRSPEQQHYF------NTPYQLSSPPEGYSSYISDVPDSADVTSFDVQKGDVIILA 252 (330)
T ss_pred cceEEEECCEEEEcCchheecc------CCceeeccCCccccccccCCccccceEEEeccCCCEEEEe
Confidence 5677777776555444444332 234555443 22 23466 89999999999998764
No 145
>PF11699 CENP-C_C: Mif2/CENP-C like; PDB: 2VPV_B.
Probab=20.33 E-value=1.8e+02 Score=23.46 Aligned_cols=27 Identities=22% Similarity=0.259 Sum_probs=20.2
Q ss_pred EEEEEcCCCEEEeCCCCeEEEEeCCCc
Q psy12281 300 IIVEQEAGDSIFVPSGWHHQVTNLEHT 326 (456)
Q Consensus 300 ~~~~l~pGD~LfIPsgW~H~V~nl~~s 326 (456)
-+.++.+|+..+||+|=.....|+.+.
T Consensus 51 ~~f~v~~G~~F~VP~gN~Y~i~N~~~~ 77 (85)
T PF11699_consen 51 TSFVVTKGGSFQVPRGNYYSIKNIGNE 77 (85)
T ss_dssp EEEEEETT-EEEE-TT-EEEEEE-SSS
T ss_pred cEEEEeCCCEEEECCCCEEEEEECCCC
Confidence 367899999999999999999998873
No 146
>PRK03824 hypA hydrogenase nickel incorporation protein; Provisional
Probab=20.20 E-value=65 Score=28.36 Aligned_cols=14 Identities=21% Similarity=0.812 Sum_probs=11.7
Q ss_pred EEEecC-CCCcEEee
Q psy12281 22 YYYPCP-CGDRFQIT 35 (456)
Q Consensus 22 ~~~~Cr-Cg~~f~i~ 35 (456)
-.+.|+ ||..|.+.
T Consensus 69 ~~~~C~~CG~~~~~~ 83 (135)
T PRK03824 69 AVLKCRNCGNEWSLK 83 (135)
T ss_pred eEEECCCCCCEEecc
Confidence 678895 99888886
Done!