Query psy15537
Match_columns 80
No_of_seqs 100 out of 586
Neff 5.9
Searched_HMMs 46136
Date Fri Aug 16 23:51:58 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15537.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15537hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03727 Hexokinase_2: Hexokin 99.9 9.3E-25 2E-29 156.3 5.7 73 1-73 163-243 (243)
2 PLN02596 hexokinase-like 99.9 5.1E-24 1.1E-28 166.1 6.5 74 1-74 408-487 (490)
3 PLN02405 hexokinase 99.9 5.9E-24 1.3E-28 166.0 6.8 74 1-74 408-492 (497)
4 PLN02362 hexokinase 99.9 1.4E-23 3.1E-28 164.2 7.0 76 1-76 407-503 (509)
5 PLN02914 hexokinase 99.9 1.2E-23 2.7E-28 164.0 6.4 73 1-73 406-489 (490)
6 PTZ00107 hexokinase; Provision 99.9 1.5E-23 3.4E-28 162.5 6.6 74 1-74 386-463 (464)
7 KOG1369|consensus 99.9 3.9E-23 8.4E-28 160.7 5.8 76 1-76 393-472 (474)
8 COG5026 Hexokinase [Carbohydra 99.6 9.2E-16 2E-20 118.9 5.3 76 1-76 386-464 (466)
9 PF02782 FGGY_C: FGGY family o 95.8 0.031 6.8E-07 37.5 5.6 51 15-71 147-197 (198)
10 PRK13310 N-acetyl-D-glucosamin 84.5 3.7 8E-05 29.5 5.7 62 4-68 234-300 (303)
11 PF01869 BcrAD_BadFG: BadF/Bad 84.0 0.48 1E-05 33.7 0.9 61 7-68 210-271 (271)
12 KOG1794|consensus 80.1 4.4 9.6E-05 31.1 4.8 66 7-72 248-318 (336)
13 TIGR00744 ROK_glcA_fam ROK fam 78.9 8.6 0.00019 27.7 5.9 62 5-68 241-308 (318)
14 PRK04123 ribulokinase; Provisi 78.2 5.6 0.00012 31.3 5.1 49 19-72 440-488 (548)
15 TIGR01312 XylB D-xylulose kina 74.3 7.7 0.00017 29.6 4.8 48 19-72 392-439 (481)
16 PRK09698 D-allose kinase; Prov 73.7 19 0.00041 25.8 6.5 63 4-68 225-294 (302)
17 PLN02295 glycerol kinase 73.0 6.9 0.00015 30.6 4.3 47 20-72 415-461 (512)
18 PRK05082 N-acetylmannosamine k 72.4 13 0.00029 26.5 5.4 61 4-67 222-285 (291)
19 TIGR01314 gntK_FGGY gluconate 72.2 13 0.00029 28.9 5.8 47 20-72 404-450 (505)
20 TIGR01234 L-ribulokinase L-rib 71.9 12 0.00027 29.4 5.5 49 19-72 437-485 (536)
21 PTZ00281 actin; Provisional 70.6 2.3 5.1E-05 32.0 1.2 41 30-70 306-350 (376)
22 PTZ00466 actin-like protein; P 70.1 3 6.5E-05 31.7 1.7 41 30-70 310-354 (380)
23 PRK00047 glpK glycerol kinase; 68.8 11 0.00023 29.4 4.6 46 21-72 407-452 (498)
24 PTZ00452 actin; Provisional 68.1 4.4 9.5E-05 30.7 2.2 49 21-70 297-349 (375)
25 PF00022 Actin: Actin; InterP 67.6 13 0.00029 27.4 4.7 43 29-71 321-368 (393)
26 TIGR01311 glycerol_kin glycero 67.4 12 0.00026 29.1 4.5 47 20-72 402-448 (493)
27 PF00752 XPG_N: XPG N-terminal 67.3 3.4 7.3E-05 25.3 1.3 43 18-61 26-83 (101)
28 PRK15027 xylulokinase; Provisi 66.8 21 0.00046 27.6 5.8 47 20-72 389-436 (484)
29 TIGR01315 5C_CHO_kinase FGGY-f 66.6 14 0.0003 29.2 4.9 47 20-72 446-492 (541)
30 COG1069 AraB Ribulose kinase [ 66.2 17 0.00038 29.7 5.3 61 6-72 417-480 (544)
31 PRK10331 L-fuculokinase; Provi 66.1 13 0.00027 28.7 4.5 47 20-72 392-438 (470)
32 PTZ00004 actin-2; Provisional 66.1 3.8 8.2E-05 30.9 1.6 50 21-71 300-353 (378)
33 PRK10939 autoinducer-2 (AI-2) 65.6 17 0.00036 28.5 5.1 47 20-72 412-458 (520)
34 PTZ00294 glycerol kinase-like 65.1 15 0.00032 28.6 4.7 47 20-72 409-455 (504)
35 PF01764 Lipase_3: Lipase (cla 63.1 29 0.00064 21.5 5.1 41 33-77 45-88 (140)
36 TIGR02628 fuculo_kin_coli L-fu 60.8 18 0.00039 27.9 4.4 47 20-72 396-442 (465)
37 PF06935 DUF1284: Protein of u 59.6 18 0.00038 22.9 3.6 29 30-58 2-32 (103)
38 PRK05571 ribose-5-phosphate is 59.5 11 0.00024 25.6 2.7 35 30-68 42-76 (148)
39 TIGR02065 ECX1 archaeal exosom 59.4 35 0.00077 23.9 5.5 38 34-71 93-138 (230)
40 KOG0680|consensus 57.0 14 0.00031 28.9 3.3 32 21-54 322-354 (400)
41 smart00268 ACTIN Actin. ACTIN 55.2 6.4 0.00014 29.1 1.1 50 21-71 295-348 (373)
42 smart00485 XPGN Xeroderma pigm 54.4 13 0.00028 22.6 2.2 44 18-62 24-82 (99)
43 TIGR01118 lacA galactose-6-pho 53.8 14 0.00031 25.0 2.5 36 29-68 38-73 (141)
44 PRK12615 galactose-6-phosphate 53.6 16 0.00035 25.5 2.8 35 30-68 41-75 (171)
45 TIGR01120 rpiB ribose 5-phosph 52.7 24 0.00052 23.9 3.5 36 29-68 39-74 (143)
46 PRK09273 hypothetical protein; 51.6 19 0.00042 26.0 3.1 35 30-68 47-81 (211)
47 PRK03983 exosome complex exonu 51.2 59 0.0013 23.0 5.5 38 34-71 99-144 (244)
48 PRK12613 galactose-6-phosphate 51.1 16 0.00035 24.7 2.5 36 29-68 37-72 (141)
49 PTZ00280 Actin-related protein 50.2 15 0.00032 28.0 2.4 42 30-71 324-385 (414)
50 PF00538 Linker_histone: linke 49.4 14 0.0003 21.8 1.8 16 31-46 2-17 (77)
51 KOG2517|consensus 49.4 25 0.00054 28.6 3.6 50 19-74 416-465 (516)
52 PRK08622 galactose-6-phosphate 49.0 21 0.00045 25.0 2.8 35 30-68 41-75 (171)
53 TIGR00689 rpiB_lacA_lacB sugar 48.7 29 0.00063 23.5 3.4 35 30-68 39-73 (144)
54 TIGR02886 spore_II_AA anti-sig 48.7 58 0.0012 19.5 5.4 49 18-67 9-62 (106)
55 TIGR00071 hisT_truA pseudourid 48.7 46 0.001 23.6 4.6 30 18-47 4-39 (227)
56 PRK08621 galactose-6-phosphate 47.9 23 0.00049 24.0 2.8 35 30-68 39-73 (142)
57 smart00526 H15 Domain in histo 47.0 17 0.00037 20.7 1.8 17 30-46 3-19 (66)
58 TIGR02627 rhamnulo_kin rhamnul 44.5 36 0.00078 26.1 3.8 46 20-72 390-435 (454)
59 PRK01158 phosphoglycolate phos 43.3 20 0.00044 24.2 2.0 35 21-56 8-42 (230)
60 PRK13317 pantothenate kinase; 42.7 65 0.0014 23.7 4.8 23 20-42 225-247 (277)
61 PRK14588 tRNA pseudouridine sy 42.5 61 0.0013 23.8 4.6 30 18-47 5-40 (272)
62 PF02017 CIDE-N: CIDE-N domain 42.3 52 0.0011 20.2 3.5 30 33-62 22-52 (78)
63 cd02643 R3H_NF-X1 R3H domain o 41.7 16 0.00036 21.7 1.2 20 27-46 3-22 (74)
64 cd00073 H15 linker histone 1 a 41.7 22 0.00047 21.6 1.8 17 30-46 3-19 (88)
65 PRK05659 sulfur carrier protei 41.1 8.9 0.00019 21.7 0.0 48 5-59 16-65 (66)
66 cd06535 CIDE_N_CAD CIDE_N doma 41.0 58 0.0013 20.0 3.6 30 33-62 22-52 (77)
67 PLN02669 xylulokinase 40.7 1E+02 0.0022 24.8 5.8 47 20-73 449-495 (556)
68 cd00741 Lipase Lipase. Lipase 40.1 52 0.0011 21.0 3.6 38 35-76 11-51 (153)
69 TIGR01119 lacB galactose-6-pho 39.9 48 0.0011 23.1 3.5 35 30-68 41-75 (171)
70 cd01615 CIDE_N CIDE_N domain, 39.7 64 0.0014 19.8 3.7 30 33-62 22-52 (78)
71 PTZ00215 ribose 5-phosphate is 39.6 48 0.001 22.6 3.4 36 30-69 45-80 (151)
72 PLN02423 phosphomannomutase 39.6 26 0.00056 24.8 2.2 37 21-59 12-48 (245)
73 COG0603 Predicted PP-loop supe 39.5 33 0.00071 25.0 2.7 38 9-46 109-146 (222)
74 PRK09557 fructokinase; Reviewe 39.4 1E+02 0.0022 22.0 5.3 60 5-67 234-298 (301)
75 PRK12434 tRNA pseudouridine sy 38.9 80 0.0017 22.7 4.7 30 18-47 5-41 (245)
76 PRK10640 rhaB rhamnulokinase; 38.1 54 0.0012 25.5 3.9 46 20-72 378-423 (471)
77 TIGR02133 RPI_actino ribose 5- 38.0 19 0.0004 24.5 1.2 35 30-68 42-76 (148)
78 COG4892 Predicted heme/steroid 37.9 19 0.00041 22.3 1.1 16 18-33 18-33 (81)
79 PF07318 DUF1464: Protein of u 37.8 43 0.00093 25.9 3.2 47 21-69 264-314 (343)
80 PF01321 Creatinase_N: Creatin 37.7 61 0.0013 19.6 3.5 22 6-27 80-101 (132)
81 smart00266 CAD Domains present 37.2 74 0.0016 19.4 3.6 30 33-62 20-50 (74)
82 COG0698 RpiB Ribose 5-phosphat 37.1 36 0.00078 23.4 2.5 36 29-68 41-76 (151)
83 PF06519 TolA: TolA C-terminal 37.0 56 0.0012 20.5 3.2 28 33-60 15-50 (96)
84 TIGR01487 SPP-like sucrose-pho 36.3 31 0.00068 23.3 2.1 36 21-57 6-41 (215)
85 PF00535 Glycos_transf_2: Glyc 36.1 98 0.0021 18.5 6.4 59 7-72 17-76 (169)
86 TIGR01548 HAD-SF-IA-hyp1 haloa 35.6 34 0.00073 22.8 2.2 25 22-46 6-30 (197)
87 TIGR01966 RNasePH ribonuclease 35.5 1.5E+02 0.0032 20.9 5.5 39 34-72 89-135 (236)
88 KOG1069|consensus 35.3 56 0.0012 23.8 3.3 50 19-71 59-116 (217)
89 PRK14494 putative molybdopteri 34.8 90 0.0019 22.5 4.4 38 18-55 183-227 (229)
90 PTZ00174 phosphomannomutase; P 34.4 32 0.00069 24.2 2.0 41 19-60 8-48 (247)
91 PF02502 LacAB_rpiB: Ribose/Ga 33.6 48 0.001 22.2 2.6 46 20-69 30-75 (140)
92 cd02570 PseudoU_synth_EcTruA P 33.1 1E+02 0.0022 21.8 4.4 29 19-47 1-35 (239)
93 PRK00173 rph ribonuclease PH; 32.8 1.7E+02 0.0036 20.7 5.5 40 34-73 90-137 (238)
94 cd01989 STK_N The N-terminal d 32.5 1.2E+02 0.0027 18.7 4.5 35 19-58 2-37 (146)
95 cd06539 CIDE_N_A CIDE_N domain 32.1 99 0.0021 19.0 3.6 30 33-62 22-52 (78)
96 cd04195 GT2_AmsE_like GT2_AmsE 31.8 1.4E+02 0.0031 19.1 5.8 59 6-71 18-77 (201)
97 TIGR02261 benz_CoA_red_D benzo 31.6 70 0.0015 23.7 3.4 35 8-43 202-236 (262)
98 TIGR01482 SPP-subfamily Sucros 31.4 40 0.00086 22.6 2.0 33 23-56 5-37 (225)
99 PRK00021 truA tRNA pseudouridi 31.1 1.2E+02 0.0027 21.5 4.6 30 18-47 5-40 (244)
100 PHA00649 hypothetical protein 30.9 44 0.00095 20.6 1.9 15 60-74 66-80 (83)
101 PRK03011 butyrate kinase; Prov 30.4 2.5E+02 0.0054 21.5 6.6 57 4-64 282-338 (358)
102 TIGR01484 HAD-SF-IIB HAD-super 29.9 46 0.001 22.1 2.1 38 21-59 4-42 (204)
103 PRK10018 putative glycosyl tra 29.7 2.2E+02 0.0048 20.6 6.9 59 6-71 23-82 (279)
104 cd00012 ACTIN Actin; An ubiqui 29.7 35 0.00076 25.2 1.6 50 21-71 293-348 (371)
105 cd02569 PseudoU_synth_ScPus3 P 29.6 1.3E+02 0.0029 21.8 4.6 28 19-46 1-35 (256)
106 TIGR02364 dha_pts dihydroxyace 29.4 1.6E+02 0.0036 19.1 5.8 58 4-62 13-72 (125)
107 PF12539 Csm1: Chromosome segr 29.3 40 0.00088 21.2 1.6 36 19-54 43-87 (90)
108 cd06844 STAS Sulphate Transpor 29.0 1.3E+02 0.0027 17.9 3.8 49 18-67 9-62 (100)
109 PLN02408 phospholipase A1 28.7 1.5E+02 0.0033 23.0 4.9 39 34-74 180-221 (365)
110 PF12301 CD99L2: CD99 antigen 28.7 53 0.0012 22.9 2.3 21 60-80 122-142 (169)
111 TIGR01549 HAD-SF-IA-v1 haloaci 28.7 36 0.00078 21.5 1.3 23 23-45 6-28 (154)
112 PRK10530 pyridoxal phosphate ( 28.1 53 0.0011 22.7 2.2 35 21-56 8-42 (272)
113 TIGR03351 PhnX-like phosphonat 27.3 55 0.0012 22.0 2.1 27 20-46 5-31 (220)
114 PLN02698 Probable pectinestera 27.1 43 0.00094 26.9 1.8 37 33-69 194-236 (497)
115 PLN02488 probable pectinestera 27.1 21 0.00046 28.9 0.1 38 33-70 173-220 (509)
116 COG3195 Uncharacterized protei 26.8 24 0.00052 24.9 0.3 14 24-37 23-36 (176)
117 TIGR00336 pyrE orotate phospho 26.5 1.3E+02 0.0029 20.1 3.9 44 19-62 22-66 (173)
118 PF13180 PDZ_2: PDZ domain; PD 26.3 1.1E+02 0.0025 17.4 3.2 40 17-60 35-74 (82)
119 PRK10976 putative hydrolase; P 26.2 53 0.0011 22.9 1.9 26 21-46 7-32 (266)
120 COG0101 TruA Pseudouridylate s 26.2 97 0.0021 22.9 3.4 29 18-46 5-39 (266)
121 PRK06437 hypothetical protein; 26.1 1.1E+02 0.0023 17.7 2.9 43 5-59 21-66 (67)
122 COG0561 Cof Predicted hydrolas 25.9 55 0.0012 22.8 2.0 41 19-60 6-46 (264)
123 PRK10725 fructose-1-P/6-phosph 25.8 42 0.00091 21.8 1.3 27 19-45 8-34 (188)
124 cd07211 Pat_PNPLA8 Patatin-lik 25.5 94 0.002 22.6 3.2 30 21-52 190-219 (308)
125 PRK14586 tRNA pseudouridine sy 25.2 1.9E+02 0.004 20.8 4.6 30 18-47 5-40 (245)
126 PF00733 Asn_synthase: Asparag 24.5 2.2E+02 0.0049 19.0 5.2 38 33-73 1-39 (255)
127 TIGR02629 L_rham_iso_rhiz L-rh 24.0 1.4E+02 0.003 23.8 4.0 36 10-46 158-198 (412)
128 COG1017 Hmp Hemoglobin-like fl 23.8 61 0.0013 22.4 1.8 37 9-48 76-112 (150)
129 PF01138 RNase_PH: 3' exoribon 23.7 1.9E+02 0.0041 17.9 5.9 39 34-72 78-125 (132)
130 PF09048 Cro: Cro; InterPro: 23.7 1.1E+02 0.0024 18.0 2.6 22 40-61 27-49 (59)
131 PRK13835 conjugal transfer pro 23.7 2.3E+02 0.005 19.4 4.6 30 41-70 49-78 (145)
132 PF11889 DUF3409: Domain of un 23.5 63 0.0014 18.8 1.5 14 16-29 35-48 (56)
133 PRK08053 sulfur carrier protei 23.5 78 0.0017 17.9 2.0 48 5-59 16-65 (66)
134 TIGR01454 AHBA_synth_RP 3-amin 23.5 51 0.0011 22.0 1.4 24 23-46 5-28 (205)
135 TIGR02252 DREG-2 REG-2-like, H 23.1 87 0.0019 20.7 2.5 26 21-46 5-30 (203)
136 PF09376 NurA: NurA domain; I 22.9 44 0.00096 22.3 1.0 8 58-65 89-96 (247)
137 KOG0369|consensus 22.9 1.9E+02 0.0041 25.4 4.7 41 31-72 745-785 (1176)
138 PF10923 DUF2791: P-loop Domai 22.7 1.4E+02 0.0029 23.6 3.8 48 9-64 16-64 (416)
139 PRK00767 transcriptional regul 22.6 40 0.00086 22.0 0.7 36 5-46 30-65 (197)
140 PRK07440 hypothetical protein; 22.5 80 0.0017 18.4 1.9 48 5-59 20-69 (70)
141 PRK10513 sugar phosphate phosp 22.4 70 0.0015 22.2 2.0 35 21-56 8-42 (270)
142 PF06523 DUF1106: Protein of u 22.4 1.2E+02 0.0025 18.9 2.7 21 35-55 67-87 (91)
143 PRK10073 putative glycosyl tra 22.3 1.1E+02 0.0025 22.5 3.2 29 36-64 19-49 (328)
144 PF08758 Cadherin_pro: Cadheri 22.3 42 0.00092 20.8 0.7 16 18-33 47-62 (90)
145 TIGR01993 Pyr-5-nucltdase pyri 22.2 64 0.0014 21.1 1.7 24 21-44 5-28 (184)
146 TIGR02259 benz_CoA_red_A benzo 22.1 87 0.0019 25.1 2.6 47 18-67 382-431 (432)
147 PRK14089 ipid-A-disaccharide s 22.1 2E+02 0.0043 21.9 4.4 33 16-54 76-108 (347)
148 cd00565 ThiS ThiaminS ubiquiti 22.1 83 0.0018 17.6 1.9 26 5-30 15-42 (65)
149 COG0554 GlpK Glycerol kinase [ 22.0 2.8E+02 0.006 22.7 5.4 49 19-73 404-452 (499)
150 cd00497 PseudoU_synth_TruA_lik 21.9 1.6E+02 0.0035 20.8 3.8 27 21-47 1-33 (224)
151 smart00489 DEXDc3 DEAD-like he 21.9 3.1E+02 0.0067 20.0 5.3 42 28-70 7-48 (289)
152 smart00488 DEXDc2 DEAD-like he 21.9 3.1E+02 0.0067 20.0 5.3 42 28-70 7-48 (289)
153 PLN02990 Probable pectinestera 21.8 50 0.0011 27.0 1.2 38 32-69 237-281 (572)
154 COG0253 DapF Diaminopimelate e 21.8 36 0.00078 25.4 0.4 46 14-59 153-208 (272)
155 cd06538 CIDE_N_FSP27 CIDE_N do 21.6 1.9E+02 0.0042 17.8 3.6 30 33-62 22-51 (79)
156 KOG3935|consensus 21.5 69 0.0015 25.3 1.9 33 30-64 309-346 (446)
157 PF02035 Coagulin: Coagulin; 21.5 42 0.00092 23.1 0.6 15 26-40 47-61 (174)
158 TIGR00368 Mg chelatase-related 21.4 1.8E+02 0.0039 23.3 4.2 39 35-73 41-87 (499)
159 PF13186 SPASM: Iron-sulfur cl 21.4 65 0.0014 17.1 1.3 14 17-30 6-19 (64)
160 COG2605 Predicted kinase relat 21.2 1.5E+02 0.0033 22.9 3.6 38 28-65 67-107 (333)
161 TIGR00241 CoA_E_activ CoA-subs 21.2 1.3E+02 0.0029 21.1 3.2 22 20-42 206-227 (248)
162 PF00173 Cyt-b5: Cytochrome b5 21.2 71 0.0015 17.9 1.5 16 15-30 12-27 (76)
163 TIGR01990 bPGM beta-phosphoglu 21.1 63 0.0014 20.8 1.4 24 22-45 5-28 (185)
164 PRK09417 mogA molybdenum cofac 21.1 80 0.0017 22.1 2.0 37 28-64 44-80 (193)
165 cd04196 GT_2_like_d Subfamily 21.1 1.4E+02 0.0031 19.1 3.2 19 51-69 56-74 (214)
166 cd06059 Tubulin The tubulin su 21.0 1.5E+02 0.0033 22.4 3.6 40 31-70 68-114 (382)
167 PF08282 Hydrolase_3: haloacid 20.9 61 0.0013 21.4 1.4 24 23-46 5-28 (254)
168 PF00091 Tubulin: Tubulin/FtsZ 20.7 2.9E+02 0.0064 19.0 5.0 44 34-77 106-152 (216)
169 TIGR02009 PGMB-YQAB-SF beta-ph 20.6 63 0.0014 20.8 1.3 26 21-46 6-31 (185)
170 COG3630 OadG Na+-transporting 20.4 1.2E+02 0.0026 19.0 2.5 23 58-80 61-83 (84)
171 PF10278 Med19: Mediator of RN 20.3 26 0.00056 24.8 -0.6 31 36-66 37-69 (178)
172 cd02188 gamma_tubulin Gamma-tu 20.2 1.7E+02 0.0037 22.9 3.9 46 28-73 106-158 (431)
173 PTZ00260 dolichyl-phosphate be 20.1 3.7E+02 0.008 19.9 6.2 51 18-72 109-160 (333)
174 COG3543 Uncharacterized conser 20.1 1.7E+02 0.0038 19.8 3.4 31 28-58 19-49 (135)
No 1
>PF03727 Hexokinase_2: Hexokinase; InterPro: IPR022673 Hexokinase is an important enzyme that catalyses the ATP-dependent conversion of aldo- and keto-hexose sugars to the hexose-6-phosphate (H6P). The enzyme can catalyse this reaction on glucose, fructose, sorbitol and glucosamine, and as such is the first step in a number of metabolic pathways []. The addition of a phosphate group to the sugar acts to trap it in a cell, since the negatively charged phosphate cannot easily traverse the plasma membrane. The enzyme is widely distributed in eukaryotes. There are three isozymes of hexokinase in yeast (PI, PII and glucokinase): isozymes PI and PII phosphorylate both aldo- and keto-sugars; glucokinase is specific for aldo-hexoses. All three isozymes contain two domains []. Structural studies of yeast hexokinase reveal a well-defined catalytic pocket that binds ATP and hexose, allowing easy transfer of the phosphate from ATP to the sugar []. Vertebrates contain four hexokinase isozymes, designated I to IV, where types I to III contain a duplication of the two-domain yeast-type hexokinases. Both the N- and C-terminal halves bind hexose and H6P, though in types I an III only the C-terminal half supports catalysis, while both halves support catalysis in type II. The N-terminal half is the regulatory region. Type IV hexokinase is similar to the yeast enzyme in containing only the two domains, and is sometimes incorrectly referred to as glucokinase. The different vertebrate isozymes differ in their catalysis, localisation and regulation, thereby contributing to the different patterns of glucose metabolism in different tissues []. Whereas types I to III can phosphorylate a variety of hexose sugars and are inhibited by glucose-6-phosphate (G6P), type IV is specific for glucose and shows no G6P inhibition. Type I enzyme may have a catabolic function, producing H6P for energy production in glycolysis; it is bound to the mitochondrial membrane, which enables the coordination of glycolysis with the TCA cycle. Types II and III enzyme may have anabolic functions, providing H6P for glycogen or lipid synthesis. Type IV enzyme is found in the liver and pancreatic beta-cells, where it is controlled by insulin (activation) and glucagon (inhibition). In pancreatic beta-cells, type IV enzyme acts as a glucose sensor to modify insulin secretion. Mutations in type IV hexokinase have been associated with diabetes mellitus. Hexokinase (2.7.1.1 from EC), a fructose and glucose phosphorylating enzyme, contains two structurally similar domains represented by this family and PF00349 from PFAM. Some members of the family have two copies of each of these domains. This entry represents the more C-terminal domain.; GO: 0005524 ATP binding, 0016773 phosphotransferase activity, alcohol group as acceptor, 0005975 carbohydrate metabolic process; PDB: 4DHY_A 3ID8_A 4DCH_A 3FGU_A 3QIC_A 3A0I_X 3VEY_A 3IDH_A 3VEV_A 3VF6_A ....
Probab=99.91 E-value=9.3e-25 Score=156.31 Aligned_cols=73 Identities=37% Similarity=0.640 Sum_probs=66.3
Q ss_pred Ceehhhhhhhhhhhc----CC--ceEEeEeCceeeeCccHHHHHHHHHHhhCCC--CceEEEEeCCcchHHHHHHHHHHh
Q psy15537 1 MTTAQGTAVLVRRID----RD--DITIAVDGSLYKYHPRLKHWLQKYIQLLVPN--KTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 1 ~L~aa~iaai~~~~~----~~--~~~IavDGsvye~~p~f~~~l~~~l~~l~~~--~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
+|+|++|+||++|++ .+ +++||||||+|+|||.|+++|++++++|+++ ++|+|.+++||||+|||++||++.
T Consensus 163 ~L~Aa~iaail~~~~~~~~~~~~~v~VavDGSv~~~~p~f~~~l~~~l~~L~~~~~~~v~~~~~~dgsg~GAAi~AA~a~ 242 (243)
T PF03727_consen 163 RLVAAAIAAILNKIRENKGRPRREVTVAVDGSVYEKYPNFRERLQEALDELLPEEGCKVEFVLSEDGSGVGAAIAAAVAC 242 (243)
T ss_dssp HHHHHHHHHHHHHHHHHCTCSSEEEEEEEESHHHHHSTTHHHHHHHHHHHHSTT-CEEEEEEE-SSTHHHHHHHHHHHHH
T ss_pred HHHHHHHHHHHHhhhccccccCCceEEEEeCcceeeCHHHHHHHHHHHHHhcccccceEEEEEecCchHHHHHHHHHHhc
Confidence 489999999999953 23 6999999999999999999999999999976 799999999999999999999987
Q ss_pred H
Q psy15537 73 K 73 (80)
Q Consensus 73 ~ 73 (80)
|
T Consensus 243 r 243 (243)
T PF03727_consen 243 R 243 (243)
T ss_dssp H
T ss_pred C
Confidence 5
No 2
>PLN02596 hexokinase-like
Probab=99.90 E-value=5.1e-24 Score=166.14 Aligned_cols=74 Identities=18% Similarity=0.328 Sum_probs=67.6
Q ss_pred CeehhhhhhhhhhhcC---CceEEeEeCceeeeCccHHHHHHHHHHhhCCC---CceEEEEeCCcchHHHHHHHHHHhHH
Q psy15537 1 MTTAQGTAVLVRRIDR---DDITIAVDGSLYKYHPRLKHWLQKYIQLLVPN---KTFRLLLAEDGSGKGAGLASAIALKL 74 (80)
Q Consensus 1 ~L~aa~iaai~~~~~~---~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~---~~v~~~~a~DGSg~GAAl~aA~a~~~ 74 (80)
+|+||+|+||++|+++ .+++||||||||||||.|+++|++++++|+++ .+|++.+++||||+||||+||+.+..
T Consensus 408 rL~Aa~iaail~k~g~~~~~~~~VavDGSvye~~p~f~~~l~~al~ellg~~~~~~i~~~~s~DGSG~GAAl~AA~~~~~ 487 (490)
T PLN02596 408 RLAGAGIVGIIKKLGRIENKKSVVTVEGGLYEHYRVFRNYLHSSVWEMLGSELSDNVVIEHSHGGSGAGALFLAACQTGE 487 (490)
T ss_pred HHHHHHHHHHHHhcCCCCCCceEEEEeCcceeeCcCHHHHHHHHHHHHhCcccCCcEEEEEccCchhHHHHHHHHhhccC
Confidence 5899999999999985 36999999999999999999999999999942 68999999999999999999997653
No 3
>PLN02405 hexokinase
Probab=99.90 E-value=5.9e-24 Score=165.98 Aligned_cols=74 Identities=23% Similarity=0.387 Sum_probs=68.3
Q ss_pred CeehhhhhhhhhhhcC--------CceEEeEeCceeeeCccHHHHHHHHHHhhCCC---CceEEEEeCCcchHHHHHHHH
Q psy15537 1 MTTAQGTAVLVRRIDR--------DDITIAVDGSLYKYHPRLKHWLQKYIQLLVPN---KTFRLLLAEDGSGKGAGLASA 69 (80)
Q Consensus 1 ~L~aa~iaai~~~~~~--------~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~---~~v~~~~a~DGSg~GAAl~aA 69 (80)
||+||+|+||++|+++ .+++||||||||+|||.|+++|++++++|+++ ++|++.+++||||+|||++||
T Consensus 408 rL~Aa~iaail~k~~~~~~~~~~~~~~~VavDGsvye~yp~f~~~~~~~l~ell~~~~~~~v~l~~a~DGSGvGAAl~AA 487 (497)
T PLN02405 408 RLSAAGIYGILKKLGRDTVKDGEKQKSVIAMDGGLFEHYTEFSKCMESTLKELLGEEVSESIEVEHSNDGSGIGAALLAA 487 (497)
T ss_pred HHHHHHHHHHHHHhCccccccCCCcceEEEEeCchhhcCcCHHHHHHHHHHHHhCcccCceEEEEEecCchHHHHHHHHH
Confidence 5899999999999997 25899999999999999999999999999943 689999999999999999999
Q ss_pred HHhHH
Q psy15537 70 IALKL 74 (80)
Q Consensus 70 ~a~~~ 74 (80)
++++.
T Consensus 488 ~~~~~ 492 (497)
T PLN02405 488 SHSLY 492 (497)
T ss_pred HHhhh
Confidence 98764
No 4
>PLN02362 hexokinase
Probab=99.89 E-value=1.4e-23 Score=164.20 Aligned_cols=76 Identities=25% Similarity=0.415 Sum_probs=68.5
Q ss_pred CeehhhhhhhhhhhcC------------------CceEEeEeCceeeeCccHHHHHHHHHHhhCCC---CceEEEEeCCc
Q psy15537 1 MTTAQGTAVLVRRIDR------------------DDITIAVDGSLYKYHPRLKHWLQKYIQLLVPN---KTFRLLLAEDG 59 (80)
Q Consensus 1 ~L~aa~iaai~~~~~~------------------~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~---~~v~~~~a~DG 59 (80)
||+||+|+||++|+++ ++++||||||||||||.|+++|++++++|+++ .+|++.+++||
T Consensus 407 rL~Aa~iaail~k~~~~~~~~~~~~~~~~~~~~~~~~~VavDGsvye~yp~f~~~~~~~l~ell~~~~~~~v~i~~a~DG 486 (509)
T PLN02362 407 RLAAAGIVGILKKIGRDGSGGITSGRSRSDIQIMRRTVVAVEGGLYTNYTMFREYLHEALNEILGEDVAQHVILKATEDG 486 (509)
T ss_pred HHHHHHHHHHHHHhcccccccccccccccccCCCceEEEEEeCchhhcCcCHHHHHHHHHHHHhCcccCceEEEEEccCc
Confidence 5899999999999982 25899999999999999999999999999943 58999999999
Q ss_pred chHHHHHHHHHHhHHhh
Q psy15537 60 SGKGAGLASAIALKLGA 76 (80)
Q Consensus 60 Sg~GAAl~aA~a~~~~~ 76 (80)
||+|||++||++.+...
T Consensus 487 SgvGAAl~AA~~~~~~~ 503 (509)
T PLN02362 487 SGIGSALLAASYSSYSV 503 (509)
T ss_pred hHHHHHHHHHHHHhhhh
Confidence 99999999999877543
No 5
>PLN02914 hexokinase
Probab=99.89 E-value=1.2e-23 Score=164.00 Aligned_cols=73 Identities=27% Similarity=0.499 Sum_probs=67.1
Q ss_pred CeehhhhhhhhhhhcCC--------ceEEeEeCceeeeCccHHHHHHHHHHhhCC---CCceEEEEeCCcchHHHHHHHH
Q psy15537 1 MTTAQGTAVLVRRIDRD--------DITIAVDGSLYKYHPRLKHWLQKYIQLLVP---NKTFRLLLAEDGSGKGAGLASA 69 (80)
Q Consensus 1 ~L~aa~iaai~~~~~~~--------~~~IavDGsvye~~p~f~~~l~~~l~~l~~---~~~v~~~~a~DGSg~GAAl~aA 69 (80)
||+||+|+||++|+++. +++||||||||+|||.|+++|++++++|++ .++|++.+++||||+|||++||
T Consensus 406 rL~Aa~iaail~k~~~~~~~~~~~~~~~VavDGSv~~~~p~f~~~l~~~l~ellg~~~~~~i~i~~a~DGSGvGAAl~AA 485 (490)
T PLN02914 406 RLAGAGIVGILEKMEEDSKGMIFGKRTVVAMDGGLYEKYPQYRRYMQDAVTELLGLELSKNIAIEHTKDGSGIGAALLAA 485 (490)
T ss_pred HHHHHHHHHHHHHhcccccccCCCceEEEEEeCchhhcCccHHHHHHHHHHHHhCcccCCcEEEEEccCchHHHHHHHHH
Confidence 58999999999999873 599999999999999999999999999993 2589999999999999999999
Q ss_pred HHhH
Q psy15537 70 IALK 73 (80)
Q Consensus 70 ~a~~ 73 (80)
+++.
T Consensus 486 ~~s~ 489 (490)
T PLN02914 486 TNSK 489 (490)
T ss_pred Hhhc
Confidence 8763
No 6
>PTZ00107 hexokinase; Provisional
Probab=99.89 E-value=1.5e-23 Score=162.45 Aligned_cols=74 Identities=32% Similarity=0.537 Sum_probs=69.1
Q ss_pred CeehhhhhhhhhhhcC--CceEEeEeCceeeeCccHHHHHHHHHHhhC-CC-CceEEEEeCCcchHHHHHHHHHHhHH
Q psy15537 1 MTTAQGTAVLVRRIDR--DDITIAVDGSLYKYHPRLKHWLQKYIQLLV-PN-KTFRLLLAEDGSGKGAGLASAIALKL 74 (80)
Q Consensus 1 ~L~aa~iaai~~~~~~--~~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~-~~v~~~~a~DGSg~GAAl~aA~a~~~ 74 (80)
+|+||+|+||++|++. .+++||||||+|+|||.|+++|++++++|+ |+ ++|+|.+++||||+|||++||++.+.
T Consensus 386 ~L~Aa~iaail~k~~~~~~~~~VgvDGSv~~~~p~f~~~~~~~l~~ll~~~~~~v~l~~a~DGSg~GAAl~AA~~~~~ 463 (464)
T PTZ00107 386 QLAAAFIAAPAKKTRTVQGKATVAIDGSVYVKNPWFRRLLQEYINSILGPDAGNVVFYLADDGSGKGAAIIAAMVAND 463 (464)
T ss_pred HHHHHHHHHHHHHhCCCCCceEEEEeCcceecCccHHHHHHHHHHHHhCCCCCcEEEEEccCchHHHHHHHHHHhccC
Confidence 5899999999999998 679999999999999999999999999999 54 79999999999999999999998653
No 7
>KOG1369|consensus
Probab=99.88 E-value=3.9e-23 Score=160.70 Aligned_cols=76 Identities=38% Similarity=0.694 Sum_probs=70.9
Q ss_pred CeehhhhhhhhhhhcC---CceEEeEeCceeeeCccHHHHHHHHHHhhC-CCCceEEEEeCCcchHHHHHHHHHHhHHhh
Q psy15537 1 MTTAQGTAVLVRRIDR---DDITIAVDGSLYKYHPRLKHWLQKYIQLLV-PNKTFRLLLAEDGSGKGAGLASAIALKLGA 76 (80)
Q Consensus 1 ~L~aa~iaai~~~~~~---~~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~GAAl~aA~a~~~~~ 76 (80)
||+||+|+||++|++. .+++||+|||||++||.|+++|+++++||+ +++.|.+.+++||||+||||+||++++.+.
T Consensus 393 ~L~aagIaail~k~~~~~~~~~~VgvdGsly~~yP~f~~~m~~~l~eLlg~~~~v~i~~s~dgSg~GAAL~Aav~~~~~~ 472 (474)
T KOG1369|consen 393 RLAAAGIAAILNKTGELSRKRVTVGVDGSLYKNHPFFREYLKEALRELLGPSIHVKLVLSEDGSGRGAALIAAVASRLKQ 472 (474)
T ss_pred HHHHhHHHHHHHHhcccccCceEEEeccchhHcCchHHHHHHHHHHHHhCCCceEEEEECCCCccccHHHHHHHHhhhhc
Confidence 5899999999999984 479999999999999999999999999999 678999999999999999999999988654
No 8
>COG5026 Hexokinase [Carbohydrate transport and metabolism]
Probab=99.60 E-value=9.2e-16 Score=118.87 Aligned_cols=76 Identities=28% Similarity=0.383 Sum_probs=69.5
Q ss_pred Ceehhhhhhhhhhhc-CCceEEeEeCceeeeCccHHHHHHHHHHhhCC--CCceEEEEeCCcchHHHHHHHHHHhHHhh
Q psy15537 1 MTTAQGTAVLVRRID-RDDITIAVDGSLYKYHPRLKHWLQKYIQLLVP--NKTFRLLLAEDGSGKGAGLASAIALKLGA 76 (80)
Q Consensus 1 ~L~aa~iaai~~~~~-~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~--~~~v~~~~a~DGSg~GAAl~aA~a~~~~~ 76 (80)
||+|+.|+||+.+++ .+...|+.|||+|++||+|++++.+++++++. ..+|++..++||||+|||++++++.+.++
T Consensus 386 rlaa~~iaAi~~ktG~~k~~~v~~dGsv~e~yp~f~~~~~~~l~~~~g~~~~~i~i~~a~dgsglGAAl~a~~~~k~~~ 464 (466)
T COG5026 386 RLAAVPIAAIVIKTGAYKAYHVGADGSVIERYPGFRSMLREALKALLGEEGEKIKIKPAEDGSGLGAALCALLAQKPKA 464 (466)
T ss_pred HHHhhhHHHhhhhcCCCccceeeeecchhhhchhHHHHHHHHHHHhhcccCceeeEEecccCcchHHHHHHHHhccccc
Confidence 578999999999999 66778999999999999999999999999995 46999999999999999999999877554
No 9
>PF02782 FGGY_C: FGGY family of carbohydrate kinases, C-terminal domain; InterPro: IPR018485 It has been shown [] that four different type of carbohydrate kinases seem to be evolutionary related. These enzymes include L-fucolokinase (2.7.1.51 from EC) (gene fucK); gluconokinase (2.7.1.12 from EC) (gene gntK); glycerol kinase (2.7.1.30 from EC) (gene glpK); xylulokinase (2.7.1.17 from EC) (gene xylB); and L-xylulose kinase (2.7.1.53 from EC) (gene lyxK). These enzymes are proteins of from 480 to 520 amino acid residues. This entry represents the C-terminal domain of these proteins. It adopts a ribonuclease H-like fold and is structurally related to the N-terminal domain [, ].; GO: 0016773 phosphotransferase activity, alcohol group as acceptor, 0005975 carbohydrate metabolic process; PDB: 4E1J_B 2W40_C 2W41_A 2UYT_A 2CGK_B 2CGL_A 2CGJ_A 3GBT_A 3LL3_B 3HZ6_A ....
Probab=95.84 E-value=0.031 Score=37.51 Aligned_cols=51 Identities=24% Similarity=0.191 Sum_probs=41.7
Q ss_pred cCCceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHH
Q psy15537 15 DRDDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIA 71 (80)
Q Consensus 15 ~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a 71 (80)
+.+-..|-++|+.. ++|.+.+++-+.+ +.+|.....++++.+|||++|+++
T Consensus 147 ~~~~~~i~~~GG~~-~n~~~~q~~Advl-----~~~V~~~~~~e~~a~GaA~~A~~a 197 (198)
T PF02782_consen 147 GIPIRRIRVSGGGA-KNPLWMQILADVL-----GRPVVRPEVEEASALGAALLAAVA 197 (198)
T ss_dssp TSCESEEEEESGGG-GSHHHHHHHHHHH-----TSEEEEESSSTHHHHHHHHHHHHH
T ss_pred cccceeeEeccccc-cChHHHHHHHHHh-----CCceEeCCCCchHHHHHHHHHHhh
Confidence 33445688899998 8999999888777 357877777999999999999986
No 10
>PRK13310 N-acetyl-D-glucosamine kinase; Provisional
Probab=84.47 E-value=3.7 Score=29.55 Aligned_cols=62 Identities=19% Similarity=0.243 Sum_probs=40.1
Q ss_pred hhhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC-CC-CceEEEEeCCc---chHHHHHHH
Q psy15537 4 AQGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV-PN-KTFRLLLAEDG---SGKGAGLAS 68 (80)
Q Consensus 4 aa~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~-~~v~~~~a~DG---Sg~GAAl~a 68 (80)
+.+++.++.-++++. |-+.|.+.+ .|.|.+.+++.+++.. |. .++.+..++-| .-+|||..+
T Consensus 234 a~~l~n~~~~ldP~~--IvlgG~~~~-~~~~~~~l~~~~~~~~~~~~~~~~i~~s~~~~~a~~~GAa~~~ 300 (303)
T PRK13310 234 AICLGNILTIVDPHL--VVLGGGLSN-FDAIYEQLPKRLPRHLLPVARVPRIEKARHGDAGGVRGAAFLH 300 (303)
T ss_pred HHHHHHHHHHcCCCE--EEECCcccC-hHHHHHHHHHHHHHHhcccccCceEEEcccCchHHHHhHHHHh
Confidence 445566666666544 445788877 6888899999998755 32 35566666555 455666543
No 11
>PF01869 BcrAD_BadFG: BadF/BadG/BcrA/BcrD ATPase family; InterPro: IPR002731 This domain is found in the BadF (O07462 from SWISSPROT) and BadG (O07463 from SWISSPROT) proteins that are two subunits of Benzoyl-CoA reductase, that may be involved in ATP hydrolysis. The family also includes an activase subunit from the enzyme 2-hydroxyglutaryl-CoA dehydratase (P11568 from SWISSPROT). The hypothetical protein AQ_278 from Aquifex aeolicus O66634 from SWISSPROT contains two copies of this region suggesting that the family may structurally dimerise.; PDB: 2E2N_B 2E2Q_A 2E2P_B 2E2O_A 1ZBS_A 2CH6_A 2CH5_D 1ZC6_A 1HUX_A.
Probab=84.00 E-value=0.48 Score=33.69 Aligned_cols=61 Identities=16% Similarity=0.321 Sum_probs=39.2
Q ss_pred hhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhCCCC-ceEEEEeCCcchHHHHHHH
Q psy15537 7 TAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNK-TFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 7 iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~-~v~~~~a~DGSg~GAAl~a 68 (80)
+.+++++....+..|..=||++++.| +.+.+++.|++-+++. ...+...+.-.-.|||+.|
T Consensus 210 i~~~~~~~~~~~~~v~l~GGv~~~~~-~~~~l~~~l~~~~~~~~~~~~~~~~~~~a~GAallA 271 (271)
T PF01869_consen 210 IKAVLKRLGPEKEPVVLSGGVFKNSP-LVKALRDALKEKLPKVPIIIPVEPQYDPAYGAALLA 271 (271)
T ss_dssp HHHHHHTCTCCCCSEEEESGGGGCHH-HHHHHGGGS-HHHHCCTCECECCGSSHHHHHHHHHH
T ss_pred HHHHHHhcCCCCCeEEEECCccCchH-HHHHHHHHHHHhcCCCceEECCCCCccHHHHHHHhC
Confidence 34555566554445999999998766 4455666666666543 3344566667778888875
No 12
>KOG1794|consensus
Probab=80.07 E-value=4.4 Score=31.13 Aligned_cols=66 Identities=12% Similarity=0.138 Sum_probs=46.2
Q ss_pred hhhhhhhhcCC-----ceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 7 TAVLVRRIDRD-----DITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 7 iaai~~~~~~~-----~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
+.++++++.+. ...|-+=|+||.-+...++-....++..-.-.++++..-++.|.+|||+.||-..
T Consensus 248 V~aVl~~l~~~~k~g~~l~Iv~vG~V~~Sw~~l~~Gfl~sls~~~~f~~~~l~~~k~ssAvgAA~laa~~~ 318 (336)
T KOG1794|consen 248 VVAVLPQLPPTLKKGKTLPIVCVGGVFDSWDLLQEGFLDSLSDTRGFERVELYRPKESSAVGAAILAASLD 318 (336)
T ss_pred HHHHHhhcCchhcccCcceEEEEcchhhHHHHHHHHHHHHhhcccCccceEEEeecccchHHHHHHhhhhc
Confidence 55677777652 3567788999988655555444444332123579999999999999999998643
No 13
>TIGR00744 ROK_glcA_fam ROK family protein (putative glucokinase). This alignment models one branch of the ROK superfamily of proteins. The three members of the seed alignment for this model all have experimental evidence for activity as glucokinase, but the set of related proteins is crowded with paralogs of different or unknown function. Proteins scoring above the trusted_cutoff will show strong similarity to at least one known glucokinase and may be designated as putative glucokinases. However, definitive identification of glucokinases should be done only with extreme caution.
Probab=78.93 E-value=8.6 Score=27.66 Aligned_cols=62 Identities=19% Similarity=0.162 Sum_probs=41.6
Q ss_pred hhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC-C--CCceEEEEeCCc---chHHHHHHH
Q psy15537 5 QGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV-P--NKTFRLLLAEDG---SGKGAGLAS 68 (80)
Q Consensus 5 a~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~-~--~~~v~~~~a~DG---Sg~GAAl~a 68 (80)
.+++.++.-++++ .|-+.|++.+..|.|.+.+++.+++.. + ..++.+..++-| .-+|||..+
T Consensus 241 ~~i~~~~~~~dP~--~IvlgG~~~~~~~~~~~~i~~~~~~~~~~~~~~~~~i~~s~~~~~~~~~Gaa~~~ 308 (318)
T TIGR00744 241 AGLADLASLFNPS--AIVLGGGLSDAGDLLLDPIRKSYKRWLFGGARQVADIIAAQLGNDAGLVGAADLA 308 (318)
T ss_pred HHHHHHHHHhCCC--EEEECChhhhCcHHHHHHHHHHHHHHhhhcccCCcEEEEcccCCchhhHHHHHHH
Confidence 3455555555443 477899999989999999999998865 3 234555555544 456777553
No 14
>PRK04123 ribulokinase; Provisional
Probab=78.19 E-value=5.6 Score=31.29 Aligned_cols=49 Identities=16% Similarity=0.335 Sum_probs=38.2
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
..|-+.|+..-++|.+.+++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 440 ~~i~~~GGg~s~s~~w~Qi~ADv~-----g~pV~~~~~~e~~alGaA~lA~~~~ 488 (548)
T PRK04123 440 EEVIAAGGIARKNPVLMQIYADVL-----NRPIQVVASDQCPALGAAIFAAVAA 488 (548)
T ss_pred ceEEEeCCCcccCHHHHHHHHHhc-----CCceEecCccccchHHHHHHHHHHh
Confidence 457788988778888888776665 4567666678899999999999864
No 15
>TIGR01312 XylB D-xylulose kinase. D-xylulose kinase (XylB) generally is found with xylose isomerase (XylA) and acts in xylose utilization.
Probab=74.34 E-value=7.7 Score=29.56 Aligned_cols=48 Identities=23% Similarity=0.302 Sum_probs=36.7
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
-.|-+.|+. -+++.+.+++-+.+ +.+|.....++++.+|||++|+.+.
T Consensus 392 ~~i~~~GG~-s~s~~~~Q~~Adv~-----g~pv~~~~~~e~~a~GaA~~a~~~~ 439 (481)
T TIGR01312 392 QSIRLIGGG-AKSPAWRQMLADIF-----GTPVDVPEGEEGPALGAAILAAWAL 439 (481)
T ss_pred ceEEEeccc-cCCHHHHHHHHHHh-----CCceeecCCCcchHHHHHHHHHHhc
Confidence 346777775 47888887777665 3467776788899999999999875
No 16
>PRK09698 D-allose kinase; Provisional
Probab=73.67 E-value=19 Score=25.79 Aligned_cols=63 Identities=22% Similarity=0.349 Sum_probs=42.0
Q ss_pred hhhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC--C--CCceEEEEeCCc---chHHHHHHH
Q psy15537 4 AQGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV--P--NKTFRLLLAEDG---SGKGAGLAS 68 (80)
Q Consensus 4 aa~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~--~--~~~v~~~~a~DG---Sg~GAAl~a 68 (80)
+.+++.++.-++++ .|-+.|++.+..+.|.+.+.+.+++.+ | ..++.+..+.-| .-+|||..+
T Consensus 225 a~~l~~li~~ldP~--~IvlgG~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~i~~~~~~~~a~~~GAa~~~ 294 (302)
T PRK09698 225 ARAIATSINLFDPD--AIILGGGVMDMPAFPRETLIAMIQKYLRKPLPYEVVRFIYASSSDFNGAQGAAILA 294 (302)
T ss_pred HHHHHHHHHHhCCC--EEEEcCccccCchhHHHHHHHHHHHHccCccccCCcEEEECCcCCcccHHhHHHHH
Confidence 34455555555543 466789998888888899999998866 3 235666655544 456877664
No 17
>PLN02295 glycerol kinase
Probab=73.04 E-value=6.9 Score=30.62 Aligned_cols=47 Identities=17% Similarity=0.188 Sum_probs=36.5
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-++|+. -++|.+.|++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 415 ~i~~~GGg-a~s~~w~Qi~ADv~-----g~pV~~~~~~e~~alGaA~~A~~~~ 461 (512)
T PLN02295 415 LLRVDGGA-TANNLLMQIQADLL-----GSPVVRPADIETTALGAAYAAGLAV 461 (512)
T ss_pred eEEEeccc-hhCHHHHHHHHHhc-----CCceEecCccccHHHHHHHHHHhhc
Confidence 47777774 68888888777666 4567766778899999999998874
No 18
>PRK05082 N-acetylmannosamine kinase; Provisional
Probab=72.39 E-value=13 Score=26.48 Aligned_cols=61 Identities=23% Similarity=0.111 Sum_probs=38.3
Q ss_pred hhhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCc---chHHHHHH
Q psy15537 4 AQGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDG---SGKGAGLA 67 (80)
Q Consensus 4 aa~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DG---Sg~GAAl~ 67 (80)
+.+++.++.-++++. |-+.|++.. .+.|.+.+++.+++.....++.+..+..| +-+|||..
T Consensus 222 a~~l~~l~~~~dpe~--IvlgG~~~~-~~~~~~~i~~~l~~~~~~~~~~i~~s~~~~~~~~~GAa~~ 285 (291)
T PRK05082 222 ARLIADLKATLDCQC--VVLGGSVGL-AEGYLELVQAYLAQEPAIYHVPLLAAHYRHDAGLLGAALW 285 (291)
T ss_pred HHHHHHHHHHhCCCE--EEEcCcccc-HHHHHHHHHHHHHhcccccCCeEEECccCCchhhhhHHHH
Confidence 345555666666544 555788654 67788889999988632235666666655 45566654
No 19
>TIGR01314 gntK_FGGY gluconate kinase, FGGY type. Gluconate is derived from glucose in two steps. This model describes one form of gluconate kinase, belonging to the FGGY family of carbohydrate kinases. Gluconate kinase phosphoryates gluconate for entry into the Entner-Douderoff pathway.
Probab=72.15 E-value=13 Score=28.85 Aligned_cols=47 Identities=15% Similarity=0.165 Sum_probs=35.7
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+. -++|.+.+++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 404 ~i~~~GGg-a~s~~w~Qi~Adv~-----g~pv~~~~~~e~~a~GaA~la~~~~ 450 (505)
T TIGR01314 404 MIQATGGF-ASSEVWRQMMSDIF-----EQEIVVPESYESSCLGACILGLKAL 450 (505)
T ss_pred EEEEecCc-ccCHHHHHHHHHHc-----CCeeEecCCCCcchHHHHHHHHHhc
Confidence 35566664 47788888877666 4578777788899999999998864
No 20
>TIGR01234 L-ribulokinase L-ribulokinase. This enzyme catalyzes the second step in arabinose catabolism. The most closely related protein subfamily outside the scope of this model includes ribitol kinase from E. coli.
Probab=71.88 E-value=12 Score=29.38 Aligned_cols=49 Identities=14% Similarity=0.333 Sum_probs=36.6
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
-.|-+.|+.--+++.+.+++-+.+ +.+|+.....+++.+|||++|+++.
T Consensus 437 ~~i~~~GGg~a~s~~w~Qi~Adv~-----g~pV~~~~~~e~~a~GaA~lA~~~~ 485 (536)
T TIGR01234 437 EELMAAGGIARKNPVIMQIYADVT-----NRPLQIVASDQAPALGAAIFAAVAA 485 (536)
T ss_pred ceEEEeCCccccCHHHHHHHHHhh-----CCeeEeccCCcchhHHHHHHHHHHc
Confidence 347788887556677777666555 4578777778899999999999875
No 21
>PTZ00281 actin; Provisional
Probab=70.57 E-value=2.3 Score=32.00 Aligned_cols=41 Identities=12% Similarity=0.346 Sum_probs=33.0
Q ss_pred eCccHHHHHHHHHHhhCCC-CceEEEEeCC---cchHHHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPN-KTFRLLLAED---GSGKGAGLASAI 70 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~-~~v~~~~a~D---GSg~GAAl~aA~ 70 (80)
..|+|.+||++-|+.++|. .+|++....| .+.+|+++.|.+
T Consensus 306 ~~~Gf~~RL~~El~~~~p~~~~v~v~~~~~r~~~aW~Ggsilasl 350 (376)
T PTZ00281 306 MFPGIADRMNKELTALAPSTMKIKIIAPPERKYSVWIGGSILASL 350 (376)
T ss_pred cCcCHHHHHHHHHHHhCCCCcceEEecCCCCceeEEECcccccCc
Confidence 5689999999999999975 5677765544 678899988875
No 22
>PTZ00466 actin-like protein; Provisional
Probab=70.15 E-value=3 Score=31.68 Aligned_cols=41 Identities=15% Similarity=0.233 Sum_probs=32.6
Q ss_pred eCccHHHHHHHHHHhhCCC-CceEEEEeCC---cchHHHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPN-KTFRLLLAED---GSGKGAGLASAI 70 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~-~~v~~~~a~D---GSg~GAAl~aA~ 70 (80)
..|+|.+||++-|++++|. .+|++....+ .+.+|+++.|.+
T Consensus 310 l~~Gf~~RL~~EL~~l~p~~~~v~v~~~~~r~~~aW~GgSilasl 354 (380)
T PTZ00466 310 MFHGFGDRLLNEIRKFAPKDITIRISAPPERKFSTFIGGSILASL 354 (380)
T ss_pred ccCCHHHHHHHHHHHhCCCCceEEEecCCCCceeEEECchhhcCc
Confidence 3789999999999999974 5677664443 488999998875
No 23
>PRK00047 glpK glycerol kinase; Provisional
Probab=68.81 E-value=11 Score=29.36 Aligned_cols=46 Identities=15% Similarity=0.231 Sum_probs=35.1
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
|-+-|+ --++|.+.+++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 407 i~~~GG-ga~s~~w~Qi~ADvl-----g~pV~~~~~~e~~a~GaA~~A~~~~ 452 (498)
T PRK00047 407 LRVDGG-AVANNFLMQFQADIL-----GVPVERPVVAETTALGAAYLAGLAV 452 (498)
T ss_pred EEEecC-cccCHHHHHHHHHhh-----CCeeEecCcccchHHHHHHHHhhhc
Confidence 555555 468888888877666 4577776678899999999999874
No 24
>PTZ00452 actin; Provisional
Probab=68.08 E-value=4.4 Score=30.67 Aligned_cols=49 Identities=18% Similarity=0.352 Sum_probs=36.3
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCC-CceEEEEeCC---cchHHHHHHHHH
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPN-KTFRLLLAED---GSGKGAGLASAI 70 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~-~~v~~~~a~D---GSg~GAAl~aA~ 70 (80)
|=+-|+ -...|+|.+||++-|++++|. .++++....| .+.+|+++.|.+
T Consensus 297 IvL~GG-~Sl~~Gf~~RL~~El~~~~p~~~~v~v~~~~~r~~~aW~GgSilasl 349 (375)
T PTZ00452 297 IVLSGG-TTLFPGIANRLSNELTNLVPSQLKIQVAAPPDRRFSAWIGGSIQCTL 349 (375)
T ss_pred EEEecc-cccccCHHHHHHHHHHHhCCCCceeEEecCCCcceeEEECchhhcCc
Confidence 333344 346789999999999999974 5677665444 578899998865
No 25
>PF00022 Actin: Actin; InterPro: IPR004000 Actin [, ] is a ubiquitous protein involved in the formation of filaments that are major components of the cytoskeleton. These filaments interact with myosin to produce a sliding effect, which is the basis of muscular contraction and many aspects of cell motility, including cytokinesis. Each actin protomer binds one molecule of ATP and has one high affinity site for either calcium or magnesium ions, as well as several low affinity sites. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Actin from many sources forms a tight complex with deoxyribonuclease (DNase I) although the significance of this is still unknown. The formation of this complex results in the inhibition of DNase I activity, and actin loses its ability to polymerise. It has been shown that an ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins [, ]. In vertebrates there are three groups of actin isoforms: alpha, beta and gamma. The alpha actins are found in muscle tissues and are a major constituent of the contractile apparatus. The beta and gamma actins co-exists in most cell types as components of the cytoskeleton and as mediators of internal cell motility. In plants there are many isoforms which are probably involved in a variety of functions such as cytoplasmic streaming, cell shape determination, tip growth, graviperception, cell wall deposition, etc. Recently some divergent actin-like proteins have been identified in several species. These proteins include centractin (actin-RPV) from mammals, fungi yeast ACT5, Neurospora crassa ro-4) and Pneumocystis carinii, which seems to be a component of a multi-subunit centrosomal complex involved in microtubule based vesicle motility (this subfamily is known as ARP1); ARP2 subfamily, which includes chicken ACTL, Saccharomyces cerevisiae ACT2, Drosophila melanogaster 14D and Caenorhabditis elegans actC; ARP3 subfamily, which includes actin 2 from mammals, Drosophila 66B, yeast ACT4 and Schizosaccharomyces pombe act2; and ARP4 subfamily, which includes yeast ACT3 and Drosophila 13E.; PDB: 2OAN_B 1HLU_A 2BTF_A 3UB5_A 3U4L_A 4EFH_A 1YVN_A 1YAG_A 1D4X_A 1MDU_B ....
Probab=67.58 E-value=13 Score=27.43 Aligned_cols=43 Identities=16% Similarity=0.334 Sum_probs=33.4
Q ss_pred eeCccHHHHHHHHHHhhCC-CCceEEEEeC----CcchHHHHHHHHHH
Q psy15537 29 KYHPRLKHWLQKYIQLLVP-NKTFRLLLAE----DGSGKGAGLASAIA 71 (80)
Q Consensus 29 e~~p~f~~~l~~~l~~l~~-~~~v~~~~a~----DGSg~GAAl~aA~a 71 (80)
-..|+|.+++++-|..+.| ..++++...+ ..+-+||++.|.+.
T Consensus 321 S~i~G~~eRL~~eL~~~~~~~~~~~v~~~~~~~~~~aW~Ggsilasl~ 368 (393)
T PF00022_consen 321 SLIPGFKERLQQELRSLLPSSTKVKVIAPPSDRQFAAWIGGSILASLS 368 (393)
T ss_dssp GGSTTHHHHHHHHHHHHSGTTSTEEEE--T-TTTSHHHHHHHHHHTSG
T ss_pred ccccchHHHHHHHhhhhhhccccceeccCchhhhhcccccceeeeccc
Confidence 5779999999999999985 4677777665 45889999988764
No 26
>TIGR01311 glycerol_kin glycerol kinase. This model describes glycerol kinase, a member of the FGGY family of carbohydrate kinases.
Probab=67.39 E-value=12 Score=29.06 Aligned_cols=47 Identities=17% Similarity=0.268 Sum_probs=35.4
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+. -++|.+.+++-+.+ +..|.....++++.+|||++|+++.
T Consensus 402 ~i~~~GGg-a~s~~w~Qi~ADv~-----g~pv~~~~~~e~~alGaA~~a~~~~ 448 (493)
T TIGR01311 402 KLRVDGGM-TNNNLLMQFQADIL-----GVPVVRPKVTETTALGAAYAAGLAV 448 (493)
T ss_pred eEEEeccc-ccCHHHHHHHHHhc-----CCeeEecCCCcchHHHHHHHHHhhc
Confidence 46667763 48888888777665 4467766778899999999998874
No 27
>PF00752 XPG_N: XPG N-terminal domain; InterPro: IPR006085 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair [, ]. XP-G can be corrected by a 133 Kd nuclear protein, XPGC []. XPGC is an acidic protein that confers normal UV resistance in expressing cells []. It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms [, ]. XPGC cleaves one strand of the duplex at the border with the single-stranded region []. XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases [, ]; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents the N-terminal of XPG.; GO: 0004518 nuclease activity, 0006281 DNA repair; PDB: 1A77_A 1A76_A 1MC8_B 3QEB_Z 3QEA_Z 3QE9_Y 1UL1_Z 3Q8K_A 3Q8M_A 3Q8L_A ....
Probab=67.29 E-value=3.4 Score=25.28 Aligned_cols=43 Identities=16% Similarity=0.167 Sum_probs=24.5
Q ss_pred ceEEeEeCceeeeC---------------ccHHHHHHHHHHhhCCCCceEEEEeCCcch
Q psy15537 18 DITIAVDGSLYKYH---------------PRLKHWLQKYIQLLVPNKTFRLLLAEDGSG 61 (80)
Q Consensus 18 ~~~IavDGsvye~~---------------p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg 61 (80)
..+||||||.|-+. +.+...+.+-+..|+ +.+|++...-||.-
T Consensus 26 g~~vaID~s~wl~~~~~~~~~~~~~~~~~~~~~~~~~~r~~~L~-~~gI~PifVFDG~~ 83 (101)
T PF00752_consen 26 GKRVAIDASCWLHQFLFSCREELGQGVGTDSHLRGLFSRLCRLL-EHGIKPIFVFDGKP 83 (101)
T ss_dssp TCEEEEEHHHHHHHHHHHSBCTTSCB-BS-HHHHHHHHHHHHHH-HTTEEEEEEE--ST
T ss_pred CCEEEEEcHHHHHHHHHHhHHHhccccchHHHHHHHHHHHHHHH-HCCCEEEEEECCCC
Confidence 47899999965331 123334444443332 56888888888853
No 28
>PRK15027 xylulokinase; Provisional
Probab=66.77 E-value=21 Score=27.62 Aligned_cols=47 Identities=15% Similarity=0.166 Sum_probs=32.2
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEE-eCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLL-AEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~-a~DGSg~GAAl~aA~a~ 72 (80)
.|=+.|+. -+++.+.+++.+.+ +.+|.... .++++.+|||++|+++.
T Consensus 389 ~i~~~GGg-a~s~~w~Qi~Adv~-----g~pv~~~~~~~~~~a~GaA~lA~~~~ 436 (484)
T PRK15027 389 SVTLIGGG-ARSEYWRQMLADIS-----GQQLDYRTGGDVGPALGAARLAQIAA 436 (484)
T ss_pred EEEEeCcc-cCCHHHHHHHHHHh-----CCeEEeecCCCcchHHHHHHHHHHhc
Confidence 35555663 57888888777666 44564433 34588999999999874
No 29
>TIGR01315 5C_CHO_kinase FGGY-family pentulose kinase. This model represents a subfamily of the FGGY family of carbohydrate kinases. This subfamily is closely related to a set of ribulose kinases, and many members are designated ribitol kinase. However, the member from Klebsiella pneumoniae, from a ribitol catabolism operon, accepts D-ribulose and to a lesser extent D-arabinitol and ribitol (PubMed:9639934 and JW Lengeler, personal communication); its annotation in GenBank as ribitol kinase is imprecise and may have affected public annotation of related proteins.
Probab=66.62 E-value=14 Score=29.22 Aligned_cols=47 Identities=17% Similarity=0.196 Sum_probs=36.6
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-++|+. -+.|.+.+++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 446 ~i~~~GGg-a~s~~w~Qi~ADvl-----g~pV~~~~~~e~~alGaA~lA~~~~ 492 (541)
T TIGR01315 446 SIFMSGGQ-CQNPLLMQLIADAC-----DMPVLIPYVNEAVLHGAAMLGAKAA 492 (541)
T ss_pred EEEEecCc-ccCHHHHHHHHHHH-----CCeeEecChhHHHHHHHHHHHHHhc
Confidence 47788885 57788877777655 4577777788899999999998764
No 30
>COG1069 AraB Ribulose kinase [Energy production and conversion]
Probab=66.15 E-value=17 Score=29.72 Aligned_cols=61 Identities=16% Similarity=0.295 Sum_probs=46.1
Q ss_pred hhhhhhhhhcCCc---eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 6 GTAVLVRRIDRDD---ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 6 ~iaai~~~~~~~~---~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
+.-.|+.-+..+. -+|=+-|+. .|+|.+.+.+-+.+ ++.|.+..+++-.-+|+|+.+|+|.
T Consensus 417 GtR~Iie~~~~~g~~Id~l~~sGG~-~KN~llmql~aDvt-----g~~v~i~~s~~a~llGsAm~~avAa 480 (544)
T COG1069 417 GTRAIIETFEDQGIAIDTLFASGGI-RKNPLLMQLYADVT-----GRPVVIPASDQAVLLGAAMFAAVAA 480 (544)
T ss_pred hHHHHHHHHHHcCCeeeEEEecCCc-ccCHHHHHHHHHhc-----CCeEEeecccchhhhHHHHHHHHHh
Confidence 3344555555432 457778888 99998877766555 5689999999999999999999985
No 31
>PRK10331 L-fuculokinase; Provisional
Probab=66.12 E-value=13 Score=28.73 Aligned_cols=47 Identities=9% Similarity=0.104 Sum_probs=35.2
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+- -++|.+.+++-+.+ +..|+....++++.+|||++|+++.
T Consensus 392 ~i~~~GGg-a~s~~w~Qi~Advl-----g~pV~~~~~~e~~a~GaA~la~~~~ 438 (470)
T PRK10331 392 ELLLVGGG-SRNALWNQIKANML-----DIPIKVLDDAETTVAGAAMFGWYGV 438 (470)
T ss_pred eEEEEccc-ccCHHHHHHHHHhc-----CCeeEecCcccchHHHHHHHHHHhc
Confidence 35566664 58888888777666 4567777778899999999999874
No 32
>PTZ00004 actin-2; Provisional
Probab=66.08 E-value=3.8 Score=30.86 Aligned_cols=50 Identities=18% Similarity=0.269 Sum_probs=36.0
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCC-CceEEEE---eCCcchHHHHHHHHHH
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPN-KTFRLLL---AEDGSGKGAGLASAIA 71 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~-~~v~~~~---a~DGSg~GAAl~aA~a 71 (80)
|=+=|+. -..|+|.+|+++-|+.++|. .++++.. ....+.+|+++++.+.
T Consensus 300 Ivl~GG~-s~~~Gf~~RL~~EL~~~~p~~~~~~v~~~~~~~~~aW~Ggsilas~~ 353 (378)
T PTZ00004 300 IVLSGGT-TMYRGLPERLTKELTTLAPSTMKIKVVAPPERKYSVWIGGSILSSLP 353 (378)
T ss_pred EEeccch-hcCcCHHHHHHHHHHHhCCCCccEEEecCCCCceeEEECcccccCcc
Confidence 3333443 34899999999999999975 4666554 3466888999988653
No 33
>PRK10939 autoinducer-2 (AI-2) kinase; Provisional
Probab=65.63 E-value=17 Score=28.49 Aligned_cols=47 Identities=9% Similarity=0.187 Sum_probs=35.5
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+ .-+.|.+.+++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 412 ~i~~~GG-ga~s~~w~Qi~ADvl-----g~pV~~~~~~e~~alGaA~lA~~~~ 458 (520)
T PRK10939 412 SLVFAGG-GSKGKLWSQILADVT-----GLPVKVPVVKEATALGCAIAAGVGA 458 (520)
T ss_pred EEEEeCC-cccCHHHHHHHHHhc-----CCeeEEecccCchHHHHHHHHHHHh
Confidence 3556666 467888877776655 4577777788999999999998874
No 34
>PTZ00294 glycerol kinase-like protein; Provisional
Probab=65.08 E-value=15 Score=28.65 Aligned_cols=47 Identities=21% Similarity=0.330 Sum_probs=33.7
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+.- +++.+.+++-+.+ +.+|.....++.+.+|||++|+++.
T Consensus 409 ~i~~~GG~a-~s~~w~Qi~Adv~-----g~pV~~~~~~e~~alGaAl~aa~a~ 455 (504)
T PTZ00294 409 SLRVDGGLT-KNKLLMQFQADIL-----GKDIVVPEMAETTALGAALLAGLAV 455 (504)
T ss_pred eEEEecccc-cCHHHHHHHHHHh-----CCceEecCcccchHHHHHHHHHhhc
Confidence 355666555 6677776666555 4567766777889999999999874
No 35
>PF01764 Lipase_3: Lipase (class 3); InterPro: IPR002921 Triglyceride lipases are lipolytic enzymes that hydrolyse ester linkages of triglycerides []. Lipases are widely distributed in animals, plants and prokaryotes. This family of lipases have been called Class 3 as they are not closely related to other lipase families.; GO: 0004806 triglyceride lipase activity, 0006629 lipid metabolic process; PDB: 1LGY_A 1DTE_A 1DT5_F 4DYH_B 1DU4_C 4EA6_B 1GT6_B 1EIN_A 1DT3_A 1TIB_A ....
Probab=63.13 E-value=29 Score=21.46 Aligned_cols=41 Identities=27% Similarity=0.366 Sum_probs=24.9
Q ss_pred cHHHHHHHHHHhhC---CCCceEEEEeCCcchHHHHHHHHHHhHHhhh
Q psy15537 33 RLKHWLQKYIQLLV---PNKTFRLLLAEDGSGKGAGLASAIALKLGAF 77 (80)
Q Consensus 33 ~f~~~l~~~l~~l~---~~~~v~~~~a~DGSg~GAAl~aA~a~~~~~~ 77 (80)
.+.+.+.+.++++. |+.+|.+ =|-++|+|++..++.....+
T Consensus 45 ~~~~~~~~~l~~~~~~~~~~~i~i----tGHSLGGalA~l~a~~l~~~ 88 (140)
T PF01764_consen 45 SLYDQILDALKELVEKYPDYSIVI----TGHSLGGALASLAAADLASH 88 (140)
T ss_dssp HHHHHHHHHHHHHHHHSTTSEEEE----EEETHHHHHHHHHHHHHHHC
T ss_pred HHHHHHHHHHHHHHhcccCccchh----hccchHHHHHHHHHHhhhhc
Confidence 33444556666666 3234444 48888999888777665443
No 36
>TIGR02628 fuculo_kin_coli L-fuculokinase. Members of this family are L-fuculokinase, from the clade that includes the L-fuculokinase of Escherichia coli. This enzyme catalyzes the second step in fucose catabolism. This family belongs to FGGY family of carbohydrate kinases (pfam02782, pfam00370). It is encoded by the kinase (K) gene of the fucose (fuc) operon.
Probab=60.80 E-value=18 Score=27.91 Aligned_cols=47 Identities=6% Similarity=0.087 Sum_probs=34.1
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+- -++|.+.+++-+.+ +.+|.....++++.+|||++|+++.
T Consensus 396 ~i~~~GGg-a~s~~w~Qi~Adv~-----g~pV~~~~~~e~~~lGaA~~a~~a~ 442 (465)
T TIGR02628 396 ELLLVGGG-SKNTLWNQIRANML-----DIPVKVVDDAETTVAGAAMFGFYGV 442 (465)
T ss_pred eEEEecCc-cCCHHHHHHhhhhc-----CCeeEeccCCcchHHHHHHHHHHhc
Confidence 35555553 57888877776555 3567767777899999999999874
No 37
>PF06935 DUF1284: Protein of unknown function (DUF1284); InterPro: IPR009702 This family consists of several hypothetical bacterial and archaeal proteins of around 130 residues in length. The function of this family is unknown, although it is thought that they may be iron-sulphur binding proteins.
Probab=59.59 E-value=18 Score=22.89 Aligned_cols=29 Identities=21% Similarity=0.452 Sum_probs=24.1
Q ss_pred eCccHHHHHHHHHHhh--CCCCceEEEEeCC
Q psy15537 30 YHPRLKHWLQKYIQLL--VPNKTFRLLLAED 58 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l--~~~~~v~~~~a~D 58 (80)
+.|.|-++|++.+.+| -|+..|.++...|
T Consensus 2 YS~~Fv~Nm~~Iv~~l~~~~~~~I~iv~~~D 32 (103)
T PF06935_consen 2 YSPEFVENMKKIVERLRNDPGEPIEIVDGPD 32 (103)
T ss_pred CCHHHHHHHHHHHHHHHHCCCCCEEEEECcC
Confidence 4689999999999999 4778888887665
No 38
>PRK05571 ribose-5-phosphate isomerase B; Provisional
Probab=59.45 E-value=11 Score=25.64 Aligned_cols=35 Identities=14% Similarity=0.111 Sum_probs=27.2
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||.|...+-+.+. +.+..+.+.=.|||+|.+++|
T Consensus 42 dYpd~a~~va~~V~----~g~~~~GIliCGtGiG~siaA 76 (148)
T PRK05571 42 DYPDYAKKVAEAVV----AGEADRGILICGTGIGMSIAA 76 (148)
T ss_pred CHHHHHHHHHHHHH----cCCCCEEEEEcCCcHHHHHHH
Confidence 67888877777763 345777888899999999876
No 39
>TIGR02065 ECX1 archaeal exosome-like complex exonuclease 1. This family contains the archaeal protein orthologous to the eukaryotic exosome protein Rrp41. It is somewhat more distantly related to the bacterial protein ribonuclease PH. An exosome-like complex has been demonstrated experimentally for the Archaea in Sulfolobus solfataricus, so members of this family are designated exosome complex exonuclease 1, after usage in SwissProt.
Probab=59.42 E-value=35 Score=23.94 Aligned_cols=38 Identities=18% Similarity=0.390 Sum_probs=26.0
Q ss_pred HHHHHHHHHHhhC-----CCCceEE---EEeCCcchHHHHHHHHHH
Q psy15537 34 LKHWLQKYIQLLV-----PNKTFRL---LLAEDGSGKGAGLASAIA 71 (80)
Q Consensus 34 f~~~l~~~l~~l~-----~~~~v~~---~~a~DGSg~GAAl~aA~a 71 (80)
+...++++++.++ |...|.+ .+..||+-.-|++.||..
T Consensus 93 ~s~~l~~~l~~~i~~~~~p~~~i~i~v~vl~~DG~~~~aai~aa~l 138 (230)
T TIGR02065 93 ISKVIREALEPAILLEQFPRTAIDVFIEVLQADAGTRCAGLTAASL 138 (230)
T ss_pred HHHHHHHHHHHHhChhhcCCeEEEEEEEEEEcCCCHHHHHHHHHHH
Confidence 5556677776655 2333443 689999999888888764
No 40
>KOG0680|consensus
Probab=56.99 E-value=14 Score=28.91 Aligned_cols=32 Identities=16% Similarity=0.384 Sum_probs=23.3
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCC-CceEEE
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPN-KTFRLL 54 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~-~~v~~~ 54 (80)
|-+-|+- ..|+|++|+.+-++.|+|. ..+++.
T Consensus 322 v~iGGn~--~fPgF~~RL~~Elr~l~P~d~~v~V~ 354 (400)
T KOG0680|consen 322 VCIGGNS--NFPGFRQRLARELRSLLPADWEVSVS 354 (400)
T ss_pred EEecCcc--CCcchHHHHHHHHHhhCCccceEEEe
Confidence 4455552 5799999999999999984 344443
No 41
>smart00268 ACTIN Actin. ACTIN subfamily of ACTIN/mreB/sugarkinase/Hsp70 superfamily
Probab=55.17 E-value=6.4 Score=29.10 Aligned_cols=50 Identities=16% Similarity=0.285 Sum_probs=36.9
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCC-CceEEEEeC---CcchHHHHHHHHHH
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPN-KTFRLLLAE---DGSGKGAGLASAIA 71 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~-~~v~~~~a~---DGSg~GAAl~aA~a 71 (80)
|=+=|+.-. .|+|.+++++.+++++|. .++++.... ..+.+|+++.|.+.
T Consensus 295 IvltGG~s~-i~Gl~~RL~~el~~~~p~~~~v~v~~~~~~~~~~W~G~silas~~ 348 (373)
T smart00268 295 IVLSGGSTL-IPGFGERLEKELKQLAPKKLKVKVIAPPERKYSVWLGGSILASLS 348 (373)
T ss_pred eEeeccccc-CcCHHHHHHHHHHHhCCCCceeEEecCCCCccceEeCcccccCcc
Confidence 555566544 699999999999999974 466665443 56888999887653
No 42
>smart00485 XPGN Xeroderma pigmentosum G N-region. domain in nucleases
Probab=54.38 E-value=13 Score=22.64 Aligned_cols=44 Identities=23% Similarity=0.240 Sum_probs=25.1
Q ss_pred ceEEeEeCceeee-------------CccH--HHHHHHHHHhhCCCCceEEEEeCCcchH
Q psy15537 18 DITIAVDGSLYKY-------------HPRL--KHWLQKYIQLLVPNKTFRLLLAEDGSGK 62 (80)
Q Consensus 18 ~~~IavDGsvye~-------------~p~f--~~~l~~~l~~l~~~~~v~~~~a~DGSg~ 62 (80)
..+||||+|.|-+ +|.. -..+..-+..|+ +.+|++...-||-..
T Consensus 24 g~~vaIDa~~wl~~~~~~~~~~~~~~~~~~~~l~~~~~rl~~L~-~~~I~PifVFDG~~~ 82 (99)
T smart00485 24 GKTLAIDASIWLYQFLTACREKLGTPLPNSKHLMGLFYRTCRLL-EFGIKPIFVFDGKPP 82 (99)
T ss_pred CceEeccHHHHHHHHHHHHhhhhcCCCCchHHHHHHHHHHHHHH-HCCCeEEEEECCCCc
Confidence 4789999999632 1111 111222222232 567888888898643
No 43
>TIGR01118 lacA galactose-6-phosphate isomerase, LacA subunit. This family contains members from low GC gram-positive bacteria. Galactose-6-phosphate isomerase is involved in lactose catabolism by the tagatose-6-phosphate pathway.
Probab=53.85 E-value=14 Score=24.99 Aligned_cols=36 Identities=14% Similarity=0.090 Sum_probs=28.0
Q ss_pred eeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 29 KYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 29 e~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
..||.|...+-+.+.+ .+....+.=.|+|+|.+++|
T Consensus 38 ~dYpd~a~~va~~V~~----~e~~~GIliCGtGiG~siaA 73 (141)
T TIGR01118 38 QDFVDVTLAVASEVQK----DEQNLGIVIDAYGAGSFMVA 73 (141)
T ss_pred CCcHHHHHHHHHHHHc----CCCceEEEEcCCCHhHhhhh
Confidence 4688888888877743 35667778899999999876
No 44
>PRK12615 galactose-6-phosphate isomerase subunit LacB; Reviewed
Probab=53.61 E-value=16 Score=25.54 Aligned_cols=35 Identities=20% Similarity=0.134 Sum_probs=27.4
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||.|...+-+.+. +.+..+.+.=.|||+|.+++|
T Consensus 41 dYpd~a~~va~~V~----~g~~d~GIliCGTGiG~siaA 75 (171)
T PRK12615 41 HYPIFGKKVGEAVV----NGQADLGVCICGTGVGINNAV 75 (171)
T ss_pred ChHHHHHHHHHHHH----cCCCCEEEEEcCCcHHHHHHH
Confidence 58888888777773 335677888899999999876
No 45
>TIGR01120 rpiB ribose 5-phosphate isomerase B. Involved in the non-oxidative branch of the pentose phospate pathway.
Probab=52.65 E-value=24 Score=23.86 Aligned_cols=36 Identities=14% Similarity=0.114 Sum_probs=28.3
Q ss_pred eeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 29 KYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 29 e~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
..||.|...+-+.+. +.+..+.+.-.|||+|.+++|
T Consensus 39 ~dYpd~a~~va~~V~----~~~~~~GIliCGtGiG~siaA 74 (143)
T TIGR01120 39 TDYPHYAKQVALAVA----GGEVDGGILICGTGIGMSIAA 74 (143)
T ss_pred CCHHHHHHHHHHHHH----CCCCceEEEEcCCcHHHHHHH
Confidence 368888888877773 346778888899999999876
No 46
>PRK09273 hypothetical protein; Provisional
Probab=51.61 E-value=19 Score=26.02 Aligned_cols=35 Identities=20% Similarity=0.190 Sum_probs=26.5
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||.|...+-+.+ .+....+.+.-.|||.|+.++|
T Consensus 47 dYpd~a~~vA~~V----~~g~~d~GIliCGTGiG~siAA 81 (211)
T PRK09273 47 TYVQNGIMASILL----NSKAVDFVVTGCGTGQGAMLAL 81 (211)
T ss_pred ChHHHHHHHHHHH----HcCCCCEEEEEcCcHHHHHHHH
Confidence 5777776666555 3346788888999999999876
No 47
>PRK03983 exosome complex exonuclease Rrp41; Provisional
Probab=51.23 E-value=59 Score=23.02 Aligned_cols=38 Identities=16% Similarity=0.363 Sum_probs=26.3
Q ss_pred HHHHHHHHHHhhC-----CCCceEE---EEeCCcchHHHHHHHHHH
Q psy15537 34 LKHWLQKYIQLLV-----PNKTFRL---LLAEDGSGKGAGLASAIA 71 (80)
Q Consensus 34 f~~~l~~~l~~l~-----~~~~v~~---~~a~DGSg~GAAl~aA~a 71 (80)
+...+.++++..+ |...|.+ .+..||+-.-|++.|+..
T Consensus 99 ~s~~l~~~l~~~i~~~~~p~~~I~I~i~VL~~DG~~~~aai~Aa~l 144 (244)
T PRK03983 99 ISKVIREALEPAIMLELFPRTVIDVFIEVLQADAGTRVAGITAASL 144 (244)
T ss_pred HHHHHHHHHHHhccHHhCCCeEEEEEEEEEECCCCHHHHHHHHHHH
Confidence 5567777777755 3333554 489999988888887654
No 48
>PRK12613 galactose-6-phosphate isomerase subunit LacA; Provisional
Probab=51.10 E-value=16 Score=24.69 Aligned_cols=36 Identities=17% Similarity=0.069 Sum_probs=28.0
Q ss_pred eeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 29 KYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 29 e~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
..||.|...+-+.+. +.+..+.+.=+|||+|.+++|
T Consensus 37 ~dypd~a~~va~~V~----~~e~~~GIliCGtGiG~siaA 72 (141)
T PRK12613 37 SDFIDNTLAVAKAVN----EAEGRLGIMVDAYGAGPFMVA 72 (141)
T ss_pred CChHHHHHHHHHHHH----cCCCceEEEEcCCCHhHhhhh
Confidence 368888888877773 345777888899999999876
No 49
>PTZ00280 Actin-related protein 3; Provisional
Probab=50.23 E-value=15 Score=27.99 Aligned_cols=42 Identities=10% Similarity=0.069 Sum_probs=30.9
Q ss_pred eCccHHHHHHHHHHhhCCC-----------------CceEEEEeC---CcchHHHHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPN-----------------KTFRLLLAE---DGSGKGAGLASAIA 71 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~-----------------~~v~~~~a~---DGSg~GAAl~aA~a 71 (80)
..|+|.+||++-+++++|. .+|++...+ ..+.+|+++.|...
T Consensus 324 ~~~Gf~eRL~~El~~~~~~~~~~~~~~~~~~~~~~~~~v~v~~~~~~~~~~W~GgSilas~~ 385 (414)
T PTZ00280 324 MFKGFDKRLQRDVRKRVDRRLKKAEELSGGKLKPIPIDVNVVSHPRQRYAVWYGGSMLASSP 385 (414)
T ss_pred cCcCHHHHHHHHHHHhccccccccccccccccCCCCceEEEecCCccceeEEEChhhcccCc
Confidence 5689999999999999742 245554333 56799999988653
No 50
>PF00538 Linker_histone: linker histone H1 and H5 family; InterPro: IPR005818 Histone proteins have central roles in both chromatin organisation (as structural units of the nucleosome) and gene regulation (as dynamic components that have a direct impact on DNA transcription and replication). Eukaryotic DNA wraps around a histone octamer to form a nucleosome, the first order of compaction of eukaryotic chromatin. The core histone octamer is composed of a central H3-H4 tetramer and two flanking H2A-H2B dimers. Each of the core histone contains a common structural motif, called the histone fold, which facilitates the interactions between the individual core histones. In addition to the core histones, there is a "linker histone" called H1 (or H5 in avian species). The linker histones present in all multicellular eukaryotes are the most divergent group of histones, with numerous cell type- and stage-specific variant. Linker histone H1 is an essential component of chromatin structure. H1 links nucleosomes into higher order structures. Histone H5 performs the same function as histone H1, and replaces H1 in certain cells. The structure of GH5, the globular domain of the linker histone H5 is known [, ]. The fold is similar to the DNA-binding domain of the catabolite gene activator protein, CAP, thus providing a possible model for the binding of GH5 to DNA. The linker histones, which do not contain the histone fold motif, are critical to the higher-order compaction of chromatin, because they bind to internucleosomal DNA and facilitate interactions between individual nucleosomes. In addition, H1 variants have been shown to be involved in the regulation of developmental genes. A common feature of this protein family is a tripartite structure in which a globular (H15) domain of about 80 amino acids is flanked by two less structured N- and C-terminal tails. The H15 domain is also characterised by high sequence homology among the family of linker histones. The highly conserved H15 domain is essential for the binding of H1 or H5 to the nucleosome. It consists of a three helix bundle (I-III), with a beta-hairpin at the C terminus. There is also a short three-residue stretch between helices I and II that is in the beta-strand conformation. Together with the C-terminal beta-hairpin, this strand forms the third strand of an antiparallel beta-sheet [, , , ]. Proteins known to contain a H15 domain are: - Eukaryotic histone H1. The histones H1 constitute a family with many variants, differing in their affinity for chromatin. Several variants are simultaneously present in a single cell. For example, the nucleated erythrocytes of birds contain both H1 and H5, the latter being an extreme variant of H1. - Eukaryotic MHYST family of histone acetyltransferase. Histone acetyltransferases transfer an acetyl group from acetyl-CoA to the epsylon- amino group of lysine within the basic NH2-termini of histones, which bind the acidic phosphates of DNA []. This entry represents the H15 domain.; GO: 0003677 DNA binding, 0006334 nucleosome assembly, 0000786 nucleosome, 0005634 nucleus; PDB: 2LSO_A 2RQP_A 1UHM_A 1UST_A 1GHC_A 1HST_A 1YQA_A 1USS_A.
Probab=49.44 E-value=14 Score=21.80 Aligned_cols=16 Identities=25% Similarity=0.397 Sum_probs=14.4
Q ss_pred CccHHHHHHHHHHhhC
Q psy15537 31 HPRLKHWLQKYIQLLV 46 (80)
Q Consensus 31 ~p~f~~~l~~~l~~l~ 46 (80)
||.|.+++.++|.+|-
T Consensus 2 hP~y~~mI~eAI~~l~ 17 (77)
T PF00538_consen 2 HPPYSDMILEAIKALK 17 (77)
T ss_dssp SSCHHHHHHHHHHHCC
T ss_pred CCCHHHHHHHHHHHcC
Confidence 8999999999998884
No 51
>KOG2517|consensus
Probab=49.38 E-value=25 Score=28.61 Aligned_cols=50 Identities=18% Similarity=0.111 Sum_probs=36.0
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHhHH
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIALKL 74 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~~~ 74 (80)
.+..+||++- ++|.|.+...+.+ + ..|.+-...|-.++|||++++.+...
T Consensus 416 ~~L~~~GG~s-~N~ll~Q~~ADi~----g-~pv~~p~~~e~~~~GaA~l~~~a~~~ 465 (516)
T KOG2517|consen 416 STLRVCGGLS-KNPLLMQLQADIL----G-LPVVRPQDVEAVALGAAMLAGAASGK 465 (516)
T ss_pred ceeeeccccc-cCHHHHHHHHHHh----C-CccccccchhHHHHHHHHHHHhhcCC
Confidence 5689999975 5677766666544 3 45555555666999999999988654
No 52
>PRK08622 galactose-6-phosphate isomerase subunit LacB; Reviewed
Probab=48.98 E-value=21 Score=24.99 Aligned_cols=35 Identities=17% Similarity=0.118 Sum_probs=27.0
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||.|...+-+.+. +.+..+.+.-.|||+|.+++|
T Consensus 41 dYpd~a~~va~~V~----~g~~d~GIliCGTGiG~siaA 75 (171)
T PRK08622 41 HYPIFGKKVGEAVA----SGEADLGVCICGTGVGISNAV 75 (171)
T ss_pred ChHHHHHHHHHHHH----cCCCcEEEEEcCCcHHHHHHH
Confidence 47888877777773 345677888899999999876
No 53
>TIGR00689 rpiB_lacA_lacB sugar-phosphate isomerases, RpiB/LacA/LacB family. Proteins of known function in this family act as sugar (pentose and/or hexose)-phosphate isomerases, including the LacA and LacB subunits of galactose-6-phosphate isomerases from Gram-positive bacteria and RpiB. RpiB is the second ribose phosphate isomerase of E. coli. It lacks homology to RpiA, its inducer is unknown (but is not ribose), and it can be replaced by the homologous galactose-6-phosphate isomerase of Streptococcus mutans, all of which suggests that the ribose phosphate isomerase activity of RpiB is a secondary function. On the other hand, there appear to be a significant number of species which contain rpiB, lack rpiA and seem to require rpi activity in order to copplete the pentose phosphate pathway.
Probab=48.70 E-value=29 Score=23.48 Aligned_cols=35 Identities=17% Similarity=0.214 Sum_probs=27.5
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||.|...+-+.+. +.+..+.+.=.|||+|.+++|
T Consensus 39 dYpd~a~~va~~V~----~g~~~~GIliCGtGiG~siaA 73 (144)
T TIGR00689 39 DYPDYAKLVADKVV----AGEVSLGILICGTGIGMSIAA 73 (144)
T ss_pred ChHHHHHHHHHHHH----cCCCceEEEEcCCcHHHHHHH
Confidence 57888887777763 346777888899999999876
No 54
>TIGR02886 spore_II_AA anti-sigma F factor antagonist. The anti-sigma F factor antagonist, also called stage II sporulation protein AA, is a protein universal among endospore-forming bacteria, all of which belong to the Firmcutes
Probab=48.68 E-value=58 Score=19.48 Aligned_cols=49 Identities=27% Similarity=0.329 Sum_probs=31.0
Q ss_pred ceEEeEeCce-eeeCccHHHHHHHHHHhhCCCCceEEEEe----CCcchHHHHHH
Q psy15537 18 DITIAVDGSL-YKYHPRLKHWLQKYIQLLVPNKTFRLLLA----EDGSGKGAGLA 67 (80)
Q Consensus 18 ~~~IavDGsv-ye~~p~f~~~l~~~l~~l~~~~~v~~~~a----~DGSg~GAAl~ 67 (80)
-.++-++|.+ |..-+.|++.+++.+.+- +-..+.+-++ =|.||+|+-+.
T Consensus 9 ~~vi~l~G~L~f~~~~~~~~~l~~~~~~~-~~~~vilDls~v~~iDssgi~~L~~ 62 (106)
T TIGR02886 9 VLIVRLSGELDHHTAERVRRKIDDAIERR-PIKHLILNLKNVTFMDSSGLGVILG 62 (106)
T ss_pred EEEEEEecccchhhHHHHHHHHHHHHHhC-CCCEEEEECCCCcEecchHHHHHHH
Confidence 4678999998 556677777777766431 1134444333 38888876554
No 55
>TIGR00071 hisT_truA pseudouridylate synthase I. universal so far, single copy in all prokaryotes, 3 in yeast. Trusted cutoff for orthology is about 100 based on 1 match only in complete prokaryote with length 200.
Probab=48.65 E-value=46 Score=23.59 Aligned_cols=30 Identities=27% Similarity=0.362 Sum_probs=23.2
Q ss_pred ceEEeEeCcee---eeC---ccHHHHHHHHHHhhCC
Q psy15537 18 DITIAVDGSLY---KYH---PRLKHWLQKYIQLLVP 47 (80)
Q Consensus 18 ~~~IavDGsvy---e~~---p~f~~~l~~~l~~l~~ 47 (80)
.++|+.||+-| ++. +.-++.++++|..+..
T Consensus 4 ~l~i~YdGt~f~G~Q~Q~~~~TVq~~le~aL~~~~~ 39 (227)
T TIGR00071 4 ALKIAYDGSNYHGWQRQPNKRTVQGELEKALEAIGK 39 (227)
T ss_pred EEEEEEcCCCeeEEeECcCCCCHHHHHHHHHHHHhC
Confidence 36899999987 333 5688899999988873
No 56
>PRK08621 galactose-6-phosphate isomerase subunit LacA; Reviewed
Probab=47.93 E-value=23 Score=24.02 Aligned_cols=35 Identities=14% Similarity=0.030 Sum_probs=27.5
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
-||.|...+-+.+.+ .+....+.=.|||+|.+++|
T Consensus 39 dYpd~a~~va~~V~~----~~~~~GIliCGTGiG~siaA 73 (142)
T PRK08621 39 DFVDSTLAVAKEVNK----SEDNLGIVIDAYGAGSFMVA 73 (142)
T ss_pred CcHHHHHHHHHHHHc----CCCceEEEEcCCChhhhhhh
Confidence 688888888877733 45677778899999999876
No 57
>smart00526 H15 Domain in histone families 1 and 5.
Probab=47.00 E-value=17 Score=20.68 Aligned_cols=17 Identities=24% Similarity=0.342 Sum_probs=15.5
Q ss_pred eCccHHHHHHHHHHhhC
Q psy15537 30 YHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~ 46 (80)
.||.|.+++.++|.+|-
T Consensus 3 ~hP~~~~mI~eAI~~l~ 19 (66)
T smart00526 3 SHPPYSEMITEAISALK 19 (66)
T ss_pred CCCCHHHHHHHHHHHcC
Confidence 58999999999999985
No 58
>TIGR02627 rhamnulo_kin rhamnulokinase. This model describes rhamnulokinase, an enzyme that catalyzes the second step in rhamnose catabolism.
Probab=44.47 E-value=36 Score=26.12 Aligned_cols=46 Identities=13% Similarity=-0.008 Sum_probs=31.3
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|=+.|+- -+.|.+.+++-+.+ +.+|... .++++.+|||++|+++.
T Consensus 390 ~i~~~GGg-a~s~~w~Qi~ADvl-----g~pV~~~-~~e~~a~GaA~~a~~~~ 435 (454)
T TIGR02627 390 QLHIVGGG-SQNAFLNQLCADAC-----GIRVIAG-PVEASTLGNIGVQLMAL 435 (454)
T ss_pred EEEEECCh-hhhHHHHHHHHHHh-----CCceEcC-CchHHHHHHHHHHHHhc
Confidence 35555554 47777777766655 3456433 36688999999999875
No 59
>PRK01158 phosphoglycolate phosphatase; Provisional
Probab=43.31 E-value=20 Score=24.24 Aligned_cols=35 Identities=23% Similarity=0.260 Sum_probs=25.0
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEe
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLA 56 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a 56 (80)
.=+|||++.......+...++|+++- +..+.+.++
T Consensus 8 ~DlDGTLl~~~~~i~~~~~~al~~l~-~~G~~~~ia 42 (230)
T PRK01158 8 IDIDGTITDKDRRLSLKAVEAIRKAE-KLGIPVILA 42 (230)
T ss_pred EecCCCcCCCCCccCHHHHHHHHHHH-HCCCEEEEE
Confidence 34799999888778888888998875 223444444
No 60
>PRK13317 pantothenate kinase; Provisional
Probab=42.68 E-value=65 Score=23.67 Aligned_cols=23 Identities=26% Similarity=0.413 Sum_probs=19.6
Q ss_pred EEeEeCceeeeCccHHHHHHHHH
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYI 42 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l 42 (80)
.|..-|+-..++|.+++.+.+.+
T Consensus 225 ~Ivf~G~gla~n~~l~~~l~~~l 247 (277)
T PRK13317 225 NIVYIGSTLTNNPLLQEIIESYT 247 (277)
T ss_pred eEEEECcccccCHHHHHHHHHHH
Confidence 79999988889999988888765
No 61
>PRK14588 tRNA pseudouridine synthase ACD; Provisional
Probab=42.54 E-value=61 Score=23.78 Aligned_cols=30 Identities=17% Similarity=0.256 Sum_probs=23.6
Q ss_pred ceEEeEeCceee---e---CccHHHHHHHHHHhhCC
Q psy15537 18 DITIAVDGSLYK---Y---HPRLKHWLQKYIQLLVP 47 (80)
Q Consensus 18 ~~~IavDGsvye---~---~p~f~~~l~~~l~~l~~ 47 (80)
.++|+.||+-|. + .|..++.++++|..+.+
T Consensus 5 ~l~iaYdGt~f~G~Q~Q~~~~TVq~~Le~aL~~l~~ 40 (272)
T PRK14588 5 ALLLEYDGTDFAGSQWQTDGRTVQGALEAAWQALTQ 40 (272)
T ss_pred EEEEEEcCCceeeeEECCCCCCHHHHHHHHHHHhhC
Confidence 478999999972 2 25689999999988874
No 62
>PF02017 CIDE-N: CIDE-N domain; InterPro: IPR003508 This domain consists of caspase-activated (CAD) nucleases, which induce DNA fragmentation and chromatin condensation during apoptosis, and the cell death activator proteins CIDE-A and CIDE-B, which are inhibitors of CAD nuclease. The two proteins interact through the region defined by the method signatures.; GO: 0006915 apoptosis, 0005622 intracellular; PDB: 1IBX_A 2EEL_A 1F2R_I 1C9F_A 1D4B_A.
Probab=42.25 E-value=52 Score=20.18 Aligned_cols=30 Identities=37% Similarity=0.555 Sum_probs=23.6
Q ss_pred cHHHHHHHHHHhhC-CCCceEEEEeCCcchH
Q psy15537 33 RLKHWLQKYIQLLV-PNKTFRLLLAEDGSGK 62 (80)
Q Consensus 33 ~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~ 62 (80)
.+.+.++++-+.+. +...+++.+.+||..+
T Consensus 22 sL~eL~~K~~~~l~~~~~~~~lvL~eDGT~V 52 (78)
T PF02017_consen 22 SLEELLEKACDKLQLPEEPVRLVLEEDGTEV 52 (78)
T ss_dssp SHHHHHHHHHHHHT-SSSTCEEEETTTTCBE
T ss_pred CHHHHHHHHHHHhCCCCcCcEEEEeCCCcEE
Confidence 56778888887776 6678999999999753
No 63
>cd02643 R3H_NF-X1 R3H domain of the X1 box binding protein (NF-X1) and related proteins. Human NF-X1 is a transcription factor that regulates the expression of class II major histocompatibility complex (MHC) genes. The Drosophila homolog shuttle craft (STC) has been shown to be a DNA- or RNA-binding protein required for proper axon guidance in the central nervous system and, the yeast homolog FAP1 encodes a dosage suppressor of rapamycin toxicity. The name of the R3H domain comes from the characteristic spacing of the most conserved arginine and histidine residues. The function of the domain is predicted to bind ssDNA or ssRNA in a sequence-specific manner.
Probab=41.74 E-value=16 Score=21.73 Aligned_cols=20 Identities=25% Similarity=0.293 Sum_probs=18.3
Q ss_pred eeeeCccHHHHHHHHHHhhC
Q psy15537 27 LYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 27 vye~~p~f~~~l~~~l~~l~ 46 (80)
.|.++|.|...+++.|++|.
T Consensus 3 ~~~~~~~~~~~vE~~l~~la 22 (74)
T cd02643 3 MAKKDPKFVKDVEKDLIELV 22 (74)
T ss_pred hHHHCHHHHHHHHHHHHHHH
Confidence 57899999999999999987
No 64
>cd00073 H15 linker histone 1 and histone 5 domains; the basic subunit of chromatin is the nucleosome, consisting of an octamer of core histones, two full turns of DNA, a linker histone (H1 or H5) and a variable length of linker DNA; H1/H5 are chromatin-associated proteins that bind to the exterior of nucleosomes and dramatically stabilize the highly condensed states of chromatin fibers; stabilization of higher order folding occurs through electrostatic neutralization of the linker DNA segments, through a highly positively charged carboxy- terminal domain known as the AKP helix (Ala, Lys, Pro); thought to be involved in specific protein-protein and protein-DNA interactions and play a role in suppressing core histone tail domain acetylation in the chromatin fiber
Probab=41.70 E-value=22 Score=21.61 Aligned_cols=17 Identities=24% Similarity=0.348 Sum_probs=15.2
Q ss_pred eCccHHHHHHHHHHhhC
Q psy15537 30 YHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~ 46 (80)
.||.|.+++.++|.+|-
T Consensus 3 ~hP~y~~MI~eAI~~l~ 19 (88)
T cd00073 3 SHPPYSEMVTEAIKALK 19 (88)
T ss_pred CCCCHHHHHHHHHHHcC
Confidence 48999999999999885
No 65
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=41.13 E-value=8.9 Score=21.66 Aligned_cols=48 Identities=17% Similarity=0.243 Sum_probs=30.0
Q ss_pred hhhhhhhhhhcCC--ceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCc
Q psy15537 5 QGTAVLVRRIDRD--DITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDG 59 (80)
Q Consensus 5 a~iaai~~~~~~~--~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DG 59 (80)
..+..++..++.+ ++.|++||.++.+.-+-... |-++.+|++...-.|
T Consensus 16 ~tl~~lL~~l~~~~~~vav~vNg~iv~r~~~~~~~-------l~~gD~vei~~~vgG 65 (66)
T PRK05659 16 ESVAALLAREGLAGRRVAVEVNGEIVPRSQHASTA-------LREGDVVEIVHALGG 65 (66)
T ss_pred CCHHHHHHhcCCCCCeEEEEECCeEeCHHHcCccc-------CCCCCEEEEEEEecC
Confidence 4677788877753 68899999998874322221 114466766655444
No 66
>cd06535 CIDE_N_CAD CIDE_N domain of CAD nuclease. The CIDE_N (cell death-inducing DFF45-like effector, N-terminal) domain is found at the N-terminus of CAD nuclease (caspase-activated DNase/DNA fragmentation factor, DFF40) and its inhibitor, ICAD(DFF45). These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of CAD/DFF40 and ICAD/DFF45 during apoptosis. In normal cells, DFF exists in the nucleus as a heterodimer composed of CAD/DFF40 as a latent nuclease and its chaperone and inhibitor subunit ICAD/DFF45. Apoptotic activation of caspase-3 results in the cleavage of DFF45/ICAD and the release of active DFF40/CAD nuclease.
Probab=41.01 E-value=58 Score=20.02 Aligned_cols=30 Identities=43% Similarity=0.481 Sum_probs=22.9
Q ss_pred cHHHHHHHHHHhhC-CCCceEEEEeCCcchH
Q psy15537 33 RLKHWLQKYIQLLV-PNKTFRLLLAEDGSGK 62 (80)
Q Consensus 33 ~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~ 62 (80)
.+++.+.++-+.+. +...+++.+.+||..+
T Consensus 22 sL~eL~~K~~~~l~l~~~~~~l~L~eDGTeV 52 (77)
T cd06535 22 NLKELLRKGCRLLQLPCAGSRLCLYEDGTEV 52 (77)
T ss_pred CHHHHHHHHHHHhCCCCCCcEEEEecCCcEe
Confidence 56777777776665 5567899999999765
No 67
>PLN02669 xylulokinase
Probab=40.74 E-value=1e+02 Score=24.77 Aligned_cols=47 Identities=17% Similarity=0.186 Sum_probs=33.1
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHhH
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIALK 73 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~~ 73 (80)
.+.+-|. -++|.+.+.+-+.+ +..|......+.+.+|||+.|+.+-.
T Consensus 449 i~~~GGg--s~s~~w~Qi~ADVl-----g~pV~~~~~~ea~alGAA~~A~~~~~ 495 (556)
T PLN02669 449 IIATGGA--SANQSILKLIASIF-----GCDVYTVQRPDSASLGAALRAAHGWL 495 (556)
T ss_pred EEEEcCh--hcCHHHHHHHHHHc-----CCCeEecCCCCchHHHHHHHHHHHHh
Confidence 3444443 38888887777665 34676666778889999999987543
No 68
>cd00741 Lipase Lipase. Lipases are esterases that can hydrolyze long-chain acyl-triglycerides into di- and monoglycerides, glycerol, and free fatty acids at a water/lipid interface. A typical feature of lipases is "interfacial activation", the process of becoming active at the lipid/water interface, although several examples of lipases have been identified that do not undergo interfacial activation . The active site of a lipase contains a catalytic triad consisting of Ser - His - Asp/Glu, but unlike most serine proteases, the active site is buried inside the structure. A "lid" or "flap" covers the active site, making it inaccessible to solvent and substrates. The lid opens during the process of interfacial activation, allowing the lipid substrate access to the active site.
Probab=40.10 E-value=52 Score=21.00 Aligned_cols=38 Identities=18% Similarity=0.335 Sum_probs=23.7
Q ss_pred HHHHHHHHHhhC---CCCceEEEEeCCcchHHHHHHHHHHhHHhh
Q psy15537 35 KHWLQKYIQLLV---PNKTFRLLLAEDGSGKGAGLASAIALKLGA 76 (80)
Q Consensus 35 ~~~l~~~l~~l~---~~~~v~~~~a~DGSg~GAAl~aA~a~~~~~ 76 (80)
.+.+.+.+++.. |+.+|.+. |-++|++++..++.....
T Consensus 11 ~~~i~~~~~~~~~~~p~~~i~v~----GHSlGg~lA~l~a~~~~~ 51 (153)
T cd00741 11 ANLVLPLLKSALAQYPDYKIHVT----GHSLGGALAGLAGLDLRG 51 (153)
T ss_pred HHHHHHHHHHHHHHCCCCeEEEE----EcCHHHHHHHHHHHHHHh
Confidence 334444444444 66677665 788899888776665544
No 69
>TIGR01119 lacB galactose-6-phosphate isomerase, LacB subunit. This family contains four members from low GC gram-positive bacteria. Galactose-6-phosphate isomerase is involved in lactose catabolism by the tagatose-6-phosphate pathway.
Probab=39.94 E-value=48 Score=23.13 Aligned_cols=35 Identities=17% Similarity=0.106 Sum_probs=27.3
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||.|...+-+++. +.+..+.+.-.|||+|.+++|
T Consensus 41 dYpd~a~~va~~V~----~g~~~~GIliCGTGiG~siaA 75 (171)
T TIGR01119 41 HYPIFGKKVGEAVV----SGEADLGVCICGTGVGINNAV 75 (171)
T ss_pred ChHHHHHHHHHHHH----cCCCCEEEEEcCCcHHHHHHH
Confidence 58888888877773 335667888899999999876
No 70
>cd01615 CIDE_N CIDE_N domain, found at the N-terminus of the CIDE (cell death-inducing DFF45-like effector) proteins, as well as CAD nuclease (caspase-activated DNase/DNA fragmentation factor, DFF40) and its inhibitor, ICAD(DFF45). These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of ICAD/DFF45, and the CAD/DFF40 and CIDE nucleases during apoptosis. The CIDE-N domain is also found in the FSP27/CIDE-C protein.
Probab=39.69 E-value=64 Score=19.80 Aligned_cols=30 Identities=33% Similarity=0.473 Sum_probs=23.0
Q ss_pred cHHHHHHHHHHhhC-CCCceEEEEeCCcchH
Q psy15537 33 RLKHWLQKYIQLLV-PNKTFRLLLAEDGSGK 62 (80)
Q Consensus 33 ~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~ 62 (80)
.+.+.+.++-+.+. |...+++.+.+||.-+
T Consensus 22 sL~eL~~K~~~~l~l~~~~~~lvL~eDGTeV 52 (78)
T cd01615 22 SLEELLSKACEKLKLPSAPVTLVLEEDGTEV 52 (78)
T ss_pred CHHHHHHHHHHHcCCCCCCeEEEEeCCCcEE
Confidence 56777777776665 6678999999999754
No 71
>PTZ00215 ribose 5-phosphate isomerase; Provisional
Probab=39.62 E-value=48 Score=22.62 Aligned_cols=36 Identities=17% Similarity=0.127 Sum_probs=26.5
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASA 69 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA 69 (80)
.||.|...+-+.+. +.+..+.+.-+|||+|.+++|-
T Consensus 45 dYp~~a~~va~~V~----~~~~~~GIliCGtGiG~siaAN 80 (151)
T PTZ00215 45 DYPDFAEKVCEEVL----KGEADTGILVCGSGIGISIAAN 80 (151)
T ss_pred CHHHHHHHHHHHHh----cCCCcEEEEEcCCcHHHHHHHh
Confidence 57777777766663 3356678888999999998763
No 72
>PLN02423 phosphomannomutase
Probab=39.60 E-value=26 Score=24.82 Aligned_cols=37 Identities=14% Similarity=0.165 Sum_probs=27.9
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCc
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDG 59 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DG 59 (80)
.=+||++....-.+.+...++++++. +. +.|.++.+-
T Consensus 12 ~D~DGTLl~~~~~i~~~~~~ai~~l~-~~-i~fviaTGR 48 (245)
T PLN02423 12 FDVDGTLTAPRKEATPEMLEFMKELR-KV-VTVGVVGGS 48 (245)
T ss_pred EeccCCCcCCCCcCCHHHHHHHHHHH-hC-CEEEEECCc
Confidence 45899999766667788888998886 33 777777644
No 73
>COG0603 Predicted PP-loop superfamily ATPase [General function prediction only]
Probab=39.50 E-value=33 Score=24.96 Aligned_cols=38 Identities=13% Similarity=0.161 Sum_probs=29.3
Q ss_pred hhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 9 VLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 9 ai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
+...+++.+.+.+||+.+=|..||-.+.-.-+++++.+
T Consensus 109 ~~Ae~~g~~~I~~Gv~~~D~sgYPDcrpefi~a~~~~~ 146 (222)
T COG0603 109 AYAEALGADAIIIGVNEEDFSGYPDCRPEFIEALNEAL 146 (222)
T ss_pred HHHHHcCCCeEEEEecccccCCCCCCCHHHHHHHHHHH
Confidence 34566677789999999999999877766666666655
No 74
>PRK09557 fructokinase; Reviewed
Probab=39.40 E-value=1e+02 Score=22.03 Aligned_cols=60 Identities=20% Similarity=0.211 Sum_probs=36.0
Q ss_pred hhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC-CC-CceEEEEeC---CcchHHHHHH
Q psy15537 5 QGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV-PN-KTFRLLLAE---DGSGKGAGLA 67 (80)
Q Consensus 5 a~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~-~~v~~~~a~---DGSg~GAAl~ 67 (80)
.+++.++.-++++ .|-+.|++.+. +.|.+.+++.+++.. +. .++.+..+. |..-+|||..
T Consensus 234 ~~l~~l~~~ldP~--~IvlgG~~~~~-~~~~~~l~~~~~~~~~~~~~~~~i~~s~~~~~a~~~GAa~~ 298 (301)
T PRK09557 234 KSLAHVINILDPD--VIVLGGGMSNV-DRLYPTLPALLKQYVFGGECETPVRKALHGDSSGVRGAAWL 298 (301)
T ss_pred HHHHHHHHHhCCC--EEEEcCcccch-HHHHHHHHHHHHHHhcccccCCeEEEcccCCchhhhhhhHh
Confidence 3455555545543 36667888874 667778888888765 22 245555544 4345677654
No 75
>PRK12434 tRNA pseudouridine synthase A; Reviewed
Probab=38.87 E-value=80 Score=22.66 Aligned_cols=30 Identities=23% Similarity=0.360 Sum_probs=23.8
Q ss_pred ceEEeEeCceee---e----CccHHHHHHHHHHhhCC
Q psy15537 18 DITIAVDGSLYK---Y----HPRLKHWLQKYIQLLVP 47 (80)
Q Consensus 18 ~~~IavDGsvye---~----~p~f~~~l~~~l~~l~~ 47 (80)
.++|+.||+-|. + .+.-++.++++|.++.+
T Consensus 5 ~l~i~YdGt~y~G~Q~Q~~~~~TVq~~le~aL~~~~~ 41 (245)
T PRK12434 5 KLTIQYDGSRYKGWQKLGNNDNTIQGKIESVLSEMTG 41 (245)
T ss_pred EEEEEECCCccceEeeCCCCCCCHHHHHHHHHHHHhC
Confidence 478999999864 3 34689999999998874
No 76
>PRK10640 rhaB rhamnulokinase; Provisional
Probab=38.07 E-value=54 Score=25.49 Aligned_cols=46 Identities=11% Similarity=-0.060 Sum_probs=31.3
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
.|-+.|+. -++|.+.+++-+.+ +.+|.... .+++.+|||++|+++.
T Consensus 378 ~i~~~GGg-a~s~~w~Qi~ADvl-----g~pV~~~~-~ea~alGaa~~a~~a~ 423 (471)
T PRK10640 378 QLHIVGGG-CQNALLNQLCADAC-----GIRVIAGP-VEASTLGNIGIQLMTL 423 (471)
T ss_pred eEEEECCh-hhhHHHHHHHHHHh-----CCCeeeCC-hhHHHHHHHHHHHHHc
Confidence 45566664 66777777776666 34564333 4689999999999874
No 77
>TIGR02133 RPI_actino ribose 5-phosphate isomerase. This family is a member of the RpiB/LacA/LacB subfamily (TIGR00689) but lies outside the RpiB equivalog (TIGR01120) which is also a member of that subfamily. Ribose 5-phosphate isomerase is an essential enzyme of the pentose phosphate pathway; a pathway that appears to be present in the actinobacteria. The only candidates for ribose 5-phosphate isomerase in the Actinobacteria are members of this family.
Probab=37.99 E-value=19 Score=24.46 Aligned_cols=35 Identities=20% Similarity=0.193 Sum_probs=26.1
Q ss_pred eCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
.||-|...+-+.+.+ .+..+.+.=.|||+|.+++|
T Consensus 42 dYpd~a~~va~~V~~----~~~~~GIliCGtGiG~siaA 76 (148)
T TIGR02133 42 DYPCFCIAAAEAVAR----DAADLGIVIGGSGNGEAIAA 76 (148)
T ss_pred CchHHHHHHHHHHhc----CCCceEEEEcCCChhheeee
Confidence 578888777777732 34666777899999998865
No 78
>COG4892 Predicted heme/steroid binding protein [General function prediction only]
Probab=37.88 E-value=19 Score=22.27 Aligned_cols=16 Identities=31% Similarity=0.480 Sum_probs=12.5
Q ss_pred ceEEeEeCceeeeCcc
Q psy15537 18 DITIAVDGSLYKYHPR 33 (80)
Q Consensus 18 ~~~IavDGsvye~~p~ 33 (80)
..-||+||.||.-.-.
T Consensus 18 paYiA~~G~VYDvS~s 33 (81)
T COG4892 18 PAYIAVNGTVYDVSLS 33 (81)
T ss_pred CeEEEECCEEEeeccC
Confidence 4779999999976433
No 79
>PF07318 DUF1464: Protein of unknown function (DUF1464); InterPro: IPR009927 This family consists of several hypothetical archaeal proteins of around 350 residues in length. The function of this family is unknown.
Probab=37.84 E-value=43 Score=25.89 Aligned_cols=47 Identities=21% Similarity=0.372 Sum_probs=32.7
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEE----EEeCCcchHHHHHHHH
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRL----LLAEDGSGKGAGLASA 69 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~----~~a~DGSg~GAAl~aA 69 (80)
|=+-|++. .+|.|.+.+.+.+.++++-....+ ..+++ +..|+|++|-
T Consensus 264 IilSGr~~-~~~~~~~~l~~~l~~~~~~~v~~l~~~~~~aKe-aA~GaAiIA~ 314 (343)
T PF07318_consen 264 IILSGRFS-RIPEFRKKLEDRLEDYFPVKVRKLEGLARKAKE-AAQGAAIIAN 314 (343)
T ss_pred EEEecccc-ccHHHHHHHHHHHHhhcccceeecccccccchh-hhhhHHHHhh
Confidence 66778766 789999999999999886211111 12344 7788998873
No 80
>PF01321 Creatinase_N: Creatinase/Prolidase N-terminal domain; InterPro: IPR000587 Creatinase or creatine amidinohydrolase (3.5.3.3 from EC) catalyses the conversion of creatine and water to sarcosine and urea. The enzyme works as a homodimer, and is induced by choline chloride. Each monomer of creatinase has two clearly defined domains, a small N-terminal domain, and a large C-terminal domain. The structure of the C-terminal region represents the "pita-bread" fold. The fold contains both alpha helices and an anti-parallel beta sheet within two structurally similar domains that are thought to be derived from an ancient gene duplication. The active site, where conserved, is located between the two domains. The fold is common to methionine aminopeptidase (3.4.11.18 from EC), aminopeptidase P (3.4.11.9 from EC), prolidase (3.4.13.9 from EC), agropine synthase and creatinase (3.5.3.3 from EC). Though many of these peptidases require a divalent cation, creatinase is not a metal-dependent enzyme [, , ]. ; GO: 0016787 hydrolase activity; PDB: 1PV9_A 3CTZ_A 3IL0_B 3PN9_A 2HOW_A 1WN1_B 3I7M_A 1CHM_B 3QOC_D 1KP0_B ....
Probab=37.70 E-value=61 Score=19.57 Aligned_cols=22 Identities=14% Similarity=0.249 Sum_probs=16.1
Q ss_pred hhhhhhhhhcCCceEEeEeCce
Q psy15537 6 GTAVLVRRIDRDDITIAVDGSL 27 (80)
Q Consensus 6 ~iaai~~~~~~~~~~IavDGsv 27 (80)
.++.++++.+.+.-+||+|.+.
T Consensus 80 ~~~~~l~~~~~~~~~igve~~~ 101 (132)
T PF01321_consen 80 AIAEALKKLGPEGKRIGVEPDS 101 (132)
T ss_dssp HHHHHHHHHTTTTSEEEEETTT
T ss_pred hHHHHHHHhCCCCCEEEEcCCc
Confidence 4566777776666788999876
No 81
>smart00266 CAD Domains present in proteins implicated in post-mortem DNA fragmentation.
Probab=37.21 E-value=74 Score=19.37 Aligned_cols=30 Identities=33% Similarity=0.475 Sum_probs=22.0
Q ss_pred cHHHHHHHHHHhhC-CCCceEEEEeCCcchH
Q psy15537 33 RLKHWLQKYIQLLV-PNKTFRLLLAEDGSGK 62 (80)
Q Consensus 33 ~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~ 62 (80)
.+.+.+.++-+.+. |...+++.+.+||..+
T Consensus 20 sL~eL~~K~~~~l~l~~~~~~l~L~eDGT~V 50 (74)
T smart00266 20 SLEELLSKVCDKLALPDSPVTLVLEEDGTIV 50 (74)
T ss_pred CHHHHHHHHHHHhCCCCCCcEEEEecCCcEE
Confidence 56677777766655 5568999999999754
No 82
>COG0698 RpiB Ribose 5-phosphate isomerase RpiB [Carbohydrate transport and metabolism]
Probab=37.10 E-value=36 Score=23.40 Aligned_cols=36 Identities=17% Similarity=0.123 Sum_probs=27.3
Q ss_pred eeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHH
Q psy15537 29 KYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLAS 68 (80)
Q Consensus 29 e~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~a 68 (80)
..||.|...+-+.+.. .+..+.+.-+|+|+|..++|
T Consensus 41 ~dyp~~a~~va~~v~~----~~~d~GIliCGTGiG~~iaA 76 (151)
T COG0698 41 VDYPDYAKKVAEAVLN----GEADLGILICGTGIGMSIAA 76 (151)
T ss_pred cchHHHHHHHHHHHHc----CCCCeeEEEecCChhHHHHh
Confidence 4688888777766632 26777888899999998876
No 83
>PF06519 TolA: TolA C-terminal; InterPro: IPR014161 TolA couples the inner membrane complex of itself with TolQ and TolR to the outer membrane complex of TolB and OprL (also called Pal). Most of the length of the protein consists of low-complexity sequence that may differ in both length and composition from one species to another, complicating efforts to discriminate TolA (the most divergent gene in the tol-pal system) from paralogs such as TonB. Selection of members of the seed alignment and criteria for setting scoring cut-offs are based largely on conserved operon structure. The Tol-Pal complex is required for maintaining outer membrane integrity, and is also involved in transport (uptake) of colicins and filamentous DNA, and implicated in pathogenesis. Transport is energized by the proton motive force. TolA is an inner membrane protein that interacts with periplasmic TolB and with outer membrane porins OmpC, PhoE and LamB.; GO: 0005215 transporter activity, 0006810 transport, 0016020 membrane; PDB: 2X9A_D 3QDP_A 3QDR_A 1TOL_A 1S62_A.
Probab=36.99 E-value=56 Score=20.47 Aligned_cols=28 Identities=29% Similarity=0.275 Sum_probs=21.5
Q ss_pred cHHHHHHHHHHhhCC--------CCceEEEEeCCcc
Q psy15537 33 RLKHWLQKYIQLLVP--------NKTFRLLLAEDGS 60 (80)
Q Consensus 33 ~f~~~l~~~l~~l~~--------~~~v~~~~a~DGS 60 (80)
.|...++++|.+=+. .|.+++.++.||.
T Consensus 15 ~Y~~~I~~~Iq~~l~~~~~y~GK~C~v~i~l~~dG~ 50 (96)
T PF06519_consen 15 RYAAQIKQAIQRNLYDDESYKGKECRVRIRLAPDGL 50 (96)
T ss_dssp HHHHHHHHHHHTTTTTGGGGTT--EEEEEEEETTSE
T ss_pred HHHHHHHHHHHHhcCCccccCCCEEEEEEEECCCCc
Confidence 577788888877552 3789999999994
No 84
>TIGR01487 SPP-like sucrose-phosphate phosphatase-like hydrolase, Archaeal. TIGR01482, in turn, is a member of the IIB subfamily (TIGR01484) of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases.
Probab=36.28 E-value=31 Score=23.32 Aligned_cols=36 Identities=19% Similarity=0.215 Sum_probs=25.0
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeC
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAE 57 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~ 57 (80)
.=+||+++..+..+.+...++++++- +..+.+.++.
T Consensus 6 ~DlDGTLl~~~~~i~~~~~~~i~~l~-~~g~~~~~~T 41 (215)
T TIGR01487 6 IDIDGTLTEPNRMISERAIEAIRKAE-KKGIPVSLVT 41 (215)
T ss_pred EecCCCcCCCCcccCHHHHHHHHHHH-HCCCEEEEEc
Confidence 34799999877778888888898885 2234444443
No 85
>PF00535 Glycos_transf_2: Glycosyl transferase family 2; InterPro: IPR001173 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. This domain is found in a diverse family of glycosyl transferases that transfer the sugar from UDP-glucose, UDP-N-acetyl-galactosamine, GDP-mannose or CDP-abequose, to a range of substrates including cellulose, dolichol phosphate and teichoic acids.; PDB: 2Z87_A 2Z86_B 2D7R_A 2D7I_A 3CKN_A 3CKQ_A 3CKJ_A 3CKV_A 3CKO_A 2FFU_A ....
Probab=36.07 E-value=98 Score=18.53 Aligned_cols=59 Identities=20% Similarity=0.312 Sum_probs=37.3
Q ss_pred hhhhhhhhcC-CceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 7 TAVLVRRIDR-DDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 7 iaai~~~~~~-~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
+..+.++... .++.|.-||| .+.-.+.+++..+ ...++++...+...|.+.++..++..
T Consensus 17 l~sl~~q~~~~~eiivvdd~s----~d~~~~~~~~~~~---~~~~i~~i~~~~n~g~~~~~n~~~~~ 76 (169)
T PF00535_consen 17 LESLLKQTDPDFEIIVVDDGS----TDETEEILEEYAE---SDPNIRYIRNPENLGFSAARNRGIKH 76 (169)
T ss_dssp HHHHHHHSGCEEEEEEEECS-----SSSHHHHHHHHHC---CSTTEEEEEHCCCSHHHHHHHHHHHH
T ss_pred HHHHhhccCCCEEEEEecccc----ccccccccccccc---cccccccccccccccccccccccccc
Confidence 4445555222 2577888888 2334555555443 35688999888888999998877643
No 86
>TIGR01548 HAD-SF-IA-hyp1 haloacid dehalogenase superfamily, subfamily IA hydrolase, TIGR01548. All but the Halobacterium sequence currently found are annotated as "Imidazoleglycerol-phosphate dehydratase", however, the source of the annotation could not be traced and significant homology could not be found between any of these sequences and known IGPD's.
Probab=35.57 E-value=34 Score=22.82 Aligned_cols=25 Identities=8% Similarity=0.159 Sum_probs=22.6
Q ss_pred eEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 22 AVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 22 avDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
=+||+++...|.+.+.+++++++.-
T Consensus 6 D~DGTLiDs~~~~~~a~~~~~~~~g 30 (197)
T TIGR01548 6 DMDGVMADVSQSYRRAIIDTVEHFG 30 (197)
T ss_pred ecCceEEechHHHHHHHHHHHHHHc
Confidence 4799999999999999999999874
No 87
>TIGR01966 RNasePH ribonuclease PH. This bacterial enzyme, ribonuclease PH, performs the final 3'-trimming and modification of tRNA precursors. This model is restricted absolutely to bacteria. Related families outside the model include proteins described as probable exosome complex exonucleases (rRNA processing) and polyribonucleotide nucleotidyltransferases (mRNA degradation). The most divergent member within the family is RNase PH from Deinococcus radiodurans.
Probab=35.48 E-value=1.5e+02 Score=20.93 Aligned_cols=39 Identities=15% Similarity=0.301 Sum_probs=26.1
Q ss_pred HHHHHHHHHHhhCC-----CCc---eEEEEeCCcchHHHHHHHHHHh
Q psy15537 34 LKHWLQKYIQLLVP-----NKT---FRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 34 f~~~l~~~l~~l~~-----~~~---v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
+...+.++++.+++ ... ....+..||+-.-||+.||.+.
T Consensus 89 ~~~~i~r~lr~~i~l~~l~~~~i~I~v~VL~~DG~~~~aai~Aa~aA 135 (236)
T TIGR01966 89 IQRLIGRALRAVVDLEALGERTIWIDCDVIQADGGTRTASITGAFVA 135 (236)
T ss_pred HHHHHHHHHHHhcCHhhcCCeEEEEEEEEEeCCCCHHHHHHHHHHHH
Confidence 44566666666553 331 2336889999999999887754
No 88
>KOG1069|consensus
Probab=35.28 E-value=56 Score=23.81 Aligned_cols=50 Identities=20% Similarity=0.331 Sum_probs=33.2
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhC-----CC--CceEEE-EeCCcchHHHHHHHHHH
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLV-----PN--KTFRLL-LAEDGSGKGAGLASAIA 71 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~-----~~--~~v~~~-~a~DGSg~GAAl~aA~a 71 (80)
-.+++.|.+++ .|...++.++..+. |. ..|.+. +.+|||-.-+|+-||+.
T Consensus 59 p~~G~~~~~eK---~~e~iI~~tl~~~I~l~l~Prt~iqVsiqvv~ddgs~LacaINaAcl 116 (217)
T KOG1069|consen 59 PKSGVNGTVEK---VLERIIRKTLSKAIILELYPRTTIQVSIQVVEDDGSTLACAINAACL 116 (217)
T ss_pred cccCcchHHHH---HHHHHHHHHHHHhheeeecCCceEEEEEEEEecCCcchHHHHHHHHH
Confidence 45788999883 57777777776654 54 356665 55677766666666653
No 89
>PRK14494 putative molybdopterin-guanine dinucleotide biosynthesis protein MobB/FeS domain-containing protein protein; Provisional
Probab=34.82 E-value=90 Score=22.47 Aligned_cols=38 Identities=16% Similarity=0.434 Sum_probs=31.5
Q ss_pred ceEEeEeCceeeeCccHHHHHHHHHHhhCC-------CCceEEEE
Q psy15537 18 DITIAVDGSLYKYHPRLKHWLQKYIQLLVP-------NKTFRLLL 55 (80)
Q Consensus 18 ~~~IavDGsvye~~p~f~~~l~~~l~~l~~-------~~~v~~~~ 55 (80)
++.+-|||-....+|+-++.+.++++-++. ..+|++.+
T Consensus 183 ~~~~~v~g~~i~ln~fv~~~~~~~i~gmi~~lkg~~~~~~i~i~i 227 (229)
T PRK14494 183 RVKLIVDGKIIPLNPFVQEIVENTIKGMLSALKGYENPKKIEIII 227 (229)
T ss_pred eeEEEECCeeecCCHHHHHHHHHHHHHHHHHhcCcCCCceEEEEe
Confidence 488999999999999999999999988872 24666654
No 90
>PTZ00174 phosphomannomutase; Provisional
Probab=34.40 E-value=32 Score=24.17 Aligned_cols=41 Identities=20% Similarity=0.311 Sum_probs=28.9
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcc
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGS 60 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGS 60 (80)
+..=+||+++...-.+.+...++++++. +..++|.++.+-+
T Consensus 8 ia~DlDGTLL~~~~~is~~~~~ai~~l~-~~Gi~~viaTGR~ 48 (247)
T PTZ00174 8 LLFDVDGTLTKPRNPITQEMKDTLAKLK-SKGFKIGVVGGSD 48 (247)
T ss_pred EEEECcCCCcCCCCCCCHHHHHHHHHHH-HCCCEEEEEcCCC
Confidence 3345899999887778888888888875 3346666665443
No 91
>PF02502 LacAB_rpiB: Ribose/Galactose Isomerase; InterPro: IPR003500 This entry represents the sugar isomerase enzymes ribose 5-phosphate isomerase B (rpiB), galactose isomerase subunit A (LacA) and galactose isomerase subunit B (LacB). Galactose-6-phosphate isomerase (5.3.1.26 from EC) is a heteromultimeric protein consisting of subunits LacA and LacB, and catalyses the conversion of D-galactose 6-phosphate to D-tagatose and 6-phosphate in the tagatose 6-phosphate pathway of lactose catabolism []. Galactose-6-phosphate isomerase is induced by galactose or lactose. This entry represents the LacB subunit. Ribose 5-phosphate isomerase (5.3.1.6 from EC) forms a homodimer and catalyses the interconversion of D-ribose 5-phosphate and D-ribulose 5-phosphate in the non-oxidative branch of the pentose phosphate pathway. This reaction permits the synthesis of ribose from other sugars, as well as the recycling of sugars from nucleotide breakdown. Two unrelated enzymes can catalyse this reaction: RpiA (found in most organisms) and RpiB (found in some bacteria and eukaryotes). RpiB is also involved in metabolism of the rare sugar, allose, in addition to ribose sugars. The structures of RpiA and RpiB are distinct, RpiB having a Rossmann-type alpha/beta/alpha sandwich topology [].; GO: 0005975 carbohydrate metabolic process; PDB: 3HEE_A 3HE8_A 3PH3_B 3PH4_B 3ONO_A 4EM8_B 3S5P_B 1O1X_A 2BES_D 2VVP_D ....
Probab=33.64 E-value=48 Score=22.17 Aligned_cols=46 Identities=11% Similarity=0.032 Sum_probs=28.2
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHH
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASA 69 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA 69 (80)
.++.+..-=..||.|...+-+.+.+ .+..+.+.-.|+|+|.+++|-
T Consensus 30 D~G~~~~~~~dy~~~a~~va~~V~~----~~~d~GIliCgtGiG~~iaAN 75 (140)
T PF02502_consen 30 DFGTYSEDSVDYPDFAEKVAEAVAS----GEADRGILICGTGIGMSIAAN 75 (140)
T ss_dssp EESESSTST--HHHHHHHHHHHHHT----TSSSEEEEEESSSHHHHHHHH
T ss_pred EeCCCCCCCCCHHHHHHHHHHHHHc----ccCCeEEEEcCCChhhhhHhh
Confidence 3454443333566666666665542 345567778899999998763
No 92
>cd02570 PseudoU_synth_EcTruA PseudoU_synth_EcTruA: Pseudouridine synthase, Escherichia coli TruA like. This group consists of eukaryotic and bacterial pseudouridine synthases similar to E. coli TruA, Pseudomonas aeruginosa truA and human pseudouridine synthase-like 1 (PUSL1). Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. E. coli TruA makes psi38/39 and/or 40 in tRNA. psi38 and psi39 in tRNAs are highly phylogenetically conserved. P. aeruginosa truA is required for induction of type III secretory genes and may act through modifying tRNAs critical for the expression of type III genes or their regulators.
Probab=33.07 E-value=1e+02 Score=21.82 Aligned_cols=29 Identities=21% Similarity=0.298 Sum_probs=22.1
Q ss_pred eEEeEeCceeee---C---ccHHHHHHHHHHhhCC
Q psy15537 19 ITIAVDGSLYKY---H---PRLKHWLQKYIQLLVP 47 (80)
Q Consensus 19 ~~IavDGsvye~---~---p~f~~~l~~~l~~l~~ 47 (80)
++|+.||+-|.- . +.-+..++++|..+..
T Consensus 1 l~i~Y~Gt~f~G~q~q~~~~TVq~~le~aL~~~~~ 35 (239)
T cd02570 1 LTIEYDGTNFSGWQRQPNGRTVQGELEKALSKIAG 35 (239)
T ss_pred CEEEEeCCccceeeeCCCCCCHHHHHHHHHHHHhC
Confidence 368999998752 2 4688889999988863
No 93
>PRK00173 rph ribonuclease PH; Reviewed
Probab=32.80 E-value=1.7e+02 Score=20.67 Aligned_cols=40 Identities=15% Similarity=0.264 Sum_probs=26.9
Q ss_pred HHHHHHHHHHhhCC-----CCc---eEEEEeCCcchHHHHHHHHHHhH
Q psy15537 34 LKHWLQKYIQLLVP-----NKT---FRLLLAEDGSGKGAGLASAIALK 73 (80)
Q Consensus 34 f~~~l~~~l~~l~~-----~~~---v~~~~a~DGSg~GAAl~aA~a~~ 73 (80)
+...+.++++.+++ ... ....+..||+-.-||+.||.+.-
T Consensus 90 ~sr~i~r~lr~~i~l~~l~~~~i~v~v~VL~~DG~~~~aai~Aa~~AL 137 (238)
T PRK00173 90 IQRLIGRSLRAVVDLKALGERTITIDCDVIQADGGTRTASITGAYVAL 137 (238)
T ss_pred HHHHHHHHHHHhcCHHHcCCeEEEEEEEEEeCCCCHHHHHHHHHHHHH
Confidence 55567777776664 211 22238999999999998887643
No 94
>cd01989 STK_N The N-terminal domain of Eukaryotic Serine Threonine kinases. The Serine Threonine kinases are enzymes that belong to a very extensive family of proteins which share a conserved catalytic core common with both serine/threonine and tyrosine protein kinases. The N-terminal domain is homologous to the USP family which has a ATP binding fold. The N-terminal domain is predicted to be involved in ATP binding.
Probab=32.53 E-value=1.2e+02 Score=18.66 Aligned_cols=35 Identities=23% Similarity=0.260 Sum_probs=22.5
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhC-CCCceEEEEeCC
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLV-PNKTFRLLLAED 58 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~-~~~~v~~~~a~D 58 (80)
+.|++|||- .=...++.+++... .+.++++....+
T Consensus 2 ILVavD~S~-----~s~~al~~a~~~a~~~~~~l~ll~v~~ 37 (146)
T cd01989 2 VAVAVDKDK-----KSKNALKWALDNLATKGQTIVLVHVHP 37 (146)
T ss_pred EEEEecCcc-----ccHHHHHHHHHhccCCCCcEEEEEecc
Confidence 569999984 44566676764433 356777766544
No 95
>cd06539 CIDE_N_A CIDE_N domain of CIDE-A proteins. The CIDE_N (cell death-inducing DFF45-like effector, N-terminal) domain is found at the N-terminus of the CIDE (cell death-inducing DFF45-like effector) proteins. These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of the CAD/DFF40, ICAD/DFF45, and CIDE nucleases during apoptosis. The CIDE protein family includes 3 members: CIDE-A, CIDE-B, and FSP27(CIDE-C). Based on sequence similarity with DFF40 and DFF45, the CIDE proteins were initially characterized as mitochondrial activators of apoptosis. However, strong metabolic phenotypes of mice lacking CIDE-A and CIDE-B indicated that this family may play critical roles in energy balance.
Probab=32.09 E-value=99 Score=19.04 Aligned_cols=30 Identities=30% Similarity=0.433 Sum_probs=22.5
Q ss_pred cHHHHHHHHHHhhC-CCCceEEEEeCCcchH
Q psy15537 33 RLKHWLQKYIQLLV-PNKTFRLLLAEDGSGK 62 (80)
Q Consensus 33 ~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~ 62 (80)
.+.+.+.+.-+.+. +...+++.+.+||..+
T Consensus 22 sL~EL~~K~~~~l~~~~~~~~lvL~eDGT~V 52 (78)
T cd06539 22 SLQELISKTLDALVITSGLVTLVLEEDGTVV 52 (78)
T ss_pred CHHHHHHHHHHHhCCCCCCcEEEEeCCCCEE
Confidence 56677777776665 5668999999999754
No 96
>cd04195 GT2_AmsE_like GT2_AmsE_like is involved in exopolysaccharide amylovora biosynthesis. AmsE is a glycosyltransferase involved in exopolysaccharide amylovora biosynthesis in Erwinia amylovora. Amylovara is one of the three exopolysaccharide produced by E. amylovora. Amylovara-deficient mutants are non-pathogenic. It is a subfamily of Glycosyltransferase Family GT2, which includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds.
Probab=31.79 E-value=1.4e+02 Score=19.10 Aligned_cols=59 Identities=15% Similarity=0.360 Sum_probs=34.5
Q ss_pred hhhhhhhhhcC-CceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHH
Q psy15537 6 GTAVLVRRIDR-DDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIA 71 (80)
Q Consensus 6 ~iaai~~~~~~-~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a 71 (80)
++-.++.+... .++.|-.|||- .+...+.+++.. .+.++.+...+...|.+.|.-.++.
T Consensus 18 ~l~Sl~~q~~~~~eiiivdd~ss---~d~t~~~~~~~~----~~~~i~~i~~~~n~G~~~a~N~g~~ 77 (201)
T cd04195 18 ALESILKQTLPPDEVVLVKDGPV---TQSLNEVLEEFK----RKLPLKVVPLEKNRGLGKALNEGLK 77 (201)
T ss_pred HHHHHHhcCCCCcEEEEEECCCC---chhHHHHHHHHH----hcCCeEEEEcCccccHHHHHHHHHH
Confidence 34445554433 35777789973 233333343332 2234777777778899988876664
No 97
>TIGR02261 benz_CoA_red_D benzoyl-CoA reductase, bcr type, subunit D. This model describes the D subunit of benzoyl-CoA reductase, a 4-subunit enzyme. Many aromatic compounds are metabolized by way of benzoyl-CoA. This family shows sequence similarity to the A subunit (TIGR02259) and to the 2-hydroxyglutaryl-CoA dehydratase alpha chain.
Probab=31.57 E-value=70 Score=23.65 Aligned_cols=35 Identities=29% Similarity=0.301 Sum_probs=22.3
Q ss_pred hhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHH
Q psy15537 8 AVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQ 43 (80)
Q Consensus 8 aai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~ 43 (80)
+.++++++..+-.|..-|++ -++|.+.+.+++.+.
T Consensus 202 ~~~~~~~~~~~~~v~~~GGv-a~n~~~~~~le~~l~ 236 (262)
T TIGR02261 202 AKLLKSLGALDGTVLCTGGL-ALDAGLLEALKDAIQ 236 (262)
T ss_pred HHHHhccCCCCCcEEEECcc-cccHHHHHHHHHHhc
Confidence 34455554334468888984 466788777777663
No 98
>TIGR01482 SPP-subfamily Sucrose-phosphate phosphatase subfamily. catalyze the same reaction as SPP.
Probab=31.40 E-value=40 Score=22.64 Aligned_cols=33 Identities=15% Similarity=0.186 Sum_probs=20.7
Q ss_pred EeCceeeeCccHHHHHHHHHHhhCCCCceEEEEe
Q psy15537 23 VDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLA 56 (80)
Q Consensus 23 vDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a 56 (80)
+||+++.....+.+...++++++- +..+.+.++
T Consensus 5 lDGTLl~~~~~i~~~~~~al~~l~-~~Gi~~~~a 37 (225)
T TIGR01482 5 IDGTLTDPNRAINESALEAIRKAE-SVGIPVVLV 37 (225)
T ss_pred ccCccCCCCcccCHHHHHHHHHHH-HCCCEEEEE
Confidence 699999776666666667776653 223444444
No 99
>PRK00021 truA tRNA pseudouridine synthase A; Validated
Probab=31.09 E-value=1.2e+02 Score=21.54 Aligned_cols=30 Identities=23% Similarity=0.328 Sum_probs=23.0
Q ss_pred ceEEeEeCceee---eC---ccHHHHHHHHHHhhCC
Q psy15537 18 DITIAVDGSLYK---YH---PRLKHWLQKYIQLLVP 47 (80)
Q Consensus 18 ~~~IavDGsvye---~~---p~f~~~l~~~l~~l~~ 47 (80)
.++|+.||+-|. +. +.-++.++++|..+..
T Consensus 5 ~l~i~YdGt~y~G~q~q~~~~TVq~~le~aL~~~~~ 40 (244)
T PRK00021 5 ALTIEYDGTNFHGWQRQPNGRTVQGELEKALSKLAG 40 (244)
T ss_pred EEEEEECCCccceeeeCCCCCCHHHHHHHHHHHHhC
Confidence 478999999874 22 4688888999888863
No 100
>PHA00649 hypothetical protein
Probab=30.88 E-value=44 Score=20.59 Aligned_cols=15 Identities=33% Similarity=0.366 Sum_probs=12.2
Q ss_pred chHHHHHHHHHHhHH
Q psy15537 60 SGKGAGLASAIALKL 74 (80)
Q Consensus 60 Sg~GAAl~aA~a~~~ 74 (80)
||+|||+++......
T Consensus 66 ~~iGAA~W~G~~s~L 80 (83)
T PHA00649 66 VALGAAVWAGITSKL 80 (83)
T ss_pred HHHHHHHHHHHHHHh
Confidence 899999999876654
No 101
>PRK03011 butyrate kinase; Provisional
Probab=30.38 E-value=2.5e+02 Score=21.45 Aligned_cols=57 Identities=16% Similarity=0.176 Sum_probs=41.2
Q ss_pred hhhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHH
Q psy15537 4 AQGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGA 64 (80)
Q Consensus 4 aa~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GA 64 (80)
+-.|+.+..-++.+.-.|-+-|++-+ .+.|.+.+.+.++.+ .++.+..++++.--++
T Consensus 282 ak~I~~l~~~L~gdpD~IVlgGGI~~-~~~l~~~I~~~l~~~---~pv~i~p~~~e~~A~a 338 (358)
T PRK03011 282 AKEIGAMAAVLKGKVDAIVLTGGLAY-SKRLVERIKERVSFI---APVIVYPGEDEMEALA 338 (358)
T ss_pred HHHHHHHHHHhCCCCCEEEEeCcccc-CHHHHHHHHHHHHhh---CCeEEEeCCCHHHHHH
Confidence 34566666666322334677888887 899999999999877 3789999999854443
No 102
>TIGR01484 HAD-SF-IIB HAD-superfamily hydrolase, subfamily IIB. The IIB subfamily consists of Trehalose-6-phosphatase (TIGR00685), plant and cyanobacterial Sucrose-phosphatase and a closely related group of bacterial and archaeal sequences, eukaryotic phosphomannomutase (pfam03332), a large subfamily ("Cof-like hydrolases", TIGR00099) containing many closely related bacterial sequences, a hypothetical equivalog containing the E. coli YedP protein, as well as two small clusters containing sequences whose relationship to the other groups is unclear.
Probab=29.89 E-value=46 Score=22.12 Aligned_cols=38 Identities=16% Similarity=0.209 Sum_probs=25.3
Q ss_pred EeEeCceeeeC-ccHHHHHHHHHHhhCCCCceEEEEeCCc
Q psy15537 21 IAVDGSLYKYH-PRLKHWLQKYIQLLVPNKTFRLLLAEDG 59 (80)
Q Consensus 21 IavDGsvye~~-p~f~~~l~~~l~~l~~~~~v~~~~a~DG 59 (80)
.=+||++.... ..+.+.+.+++++|.... +.+.++.+-
T Consensus 4 ~D~DgTL~~~~~~~~~~~~~~~l~~l~~~g-~~~~i~TGR 42 (204)
T TIGR01484 4 FDLDGTLLDPNAHELSPETIEALERLREAG-VKVVLVTGR 42 (204)
T ss_pred EeCcCCCcCCCCCcCCHHHHHHHHHHHHCC-CEEEEECCC
Confidence 34799999765 567788888888886332 455544433
No 103
>PRK10018 putative glycosyl transferase; Provisional
Probab=29.69 E-value=2.2e+02 Score=20.63 Aligned_cols=59 Identities=22% Similarity=0.223 Sum_probs=38.9
Q ss_pred hhhhhhhhhcCC-ceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHH
Q psy15537 6 GTAVLVRRIDRD-DITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIA 71 (80)
Q Consensus 6 ~iaai~~~~~~~-~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a 71 (80)
+|..++.+...+ ++.|.=|||. . .+.+++.+.++ .+..|++...+...|.++|.-.++.
T Consensus 23 ~l~Svl~Qt~~~~EiIVVDDgS~-----~-~~~~~~~~~~~-~~~ri~~i~~~~n~G~~~a~N~gi~ 82 (279)
T PRK10018 23 AIKSVLRQDYSNWEMIIVDDCST-----S-WEQLQQYVTAL-NDPRITYIHNDINSGACAVRNQAIM 82 (279)
T ss_pred HHHHHHhCCCCCeEEEEEECCCC-----C-HHHHHHHHHHc-CCCCEEEEECCCCCCHHHHHHHHHH
Confidence 455666665543 6778888885 1 24455555443 3457888888888899988876654
No 104
>cd00012 ACTIN Actin; An ubiquitous protein involved in the formation of filaments that are a major component of the cytoskeleton. Interaction with myosin provides the basis of muscular contraction and many aspects of cell motility. Each actin protomer binds one molecule of ATP and either calcium or magnesium ions. Actin exists as a monomer in low salt concentrations, but filaments form rapidly as salt concentration rises, with the consequent hydrolysis of ATP. Polymerization is regulated by so-called capping proteins. The ATPase domain of actin shares similarity with ATPase domains of hexokinase and hsp70 proteins.
Probab=29.67 E-value=35 Score=25.19 Aligned_cols=50 Identities=18% Similarity=0.296 Sum_probs=34.6
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCC---CceEEEEe---CCcchHHHHHHHHHH
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPN---KTFRLLLA---EDGSGKGAGLASAIA 71 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~---~~v~~~~a---~DGSg~GAAl~aA~a 71 (80)
|-+=|+.-. .|+|.+++++.++.++|. ..+.+... ...+.+|+++.|.+.
T Consensus 293 Ivl~GG~s~-~~gl~~rl~~el~~~~~~~~~~~~~~~~~~~~~~~aw~G~si~as~~ 348 (371)
T cd00012 293 IVLSGGSTL-FPGFGERLQKELLKLAPPSKDTKVKVIAPPERKYSVWLGGSILASLS 348 (371)
T ss_pred EEEeCCccC-CcCHHHHHHHHHHHhCCcccceEEEEccCCCccccEEeCchhhcCch
Confidence 444454433 589999999999999975 23444443 345788999888764
No 105
>cd02569 PseudoU_synth_ScPus3 PseudoU_synth_ScPus3-like: Pseudouridine synthase, Saccharomyces cerevisiae Pus3 like. This group consists of eukaryotic pseudouridine synthases similar to S. cerevisiae Pus3p, mouse Pus3p and, human PUS2. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. S. cerevisiae Pus3p makes psi38 and psi39 in tRNAs. Mouse Pus3p has been shown to makes psi38 and, possibly also psi 39, in tRNAs. Psi38 and psi39 are highly conserved in tRNAs from eubacteria, archea and eukarya.
Probab=29.59 E-value=1.3e+02 Score=21.78 Aligned_cols=28 Identities=14% Similarity=0.063 Sum_probs=22.3
Q ss_pred eEEeEeCceee-----e--CccHHHHHHHHHHhhC
Q psy15537 19 ITIAVDGSLYK-----Y--HPRLKHWLQKYIQLLV 46 (80)
Q Consensus 19 ~~IavDGsvye-----~--~p~f~~~l~~~l~~l~ 46 (80)
.+|+.||+-|. . .|..+..++++|..+.
T Consensus 1 l~i~YdGt~y~G~Q~Q~~~~~TVq~~Le~aL~~~~ 35 (256)
T cd02569 1 LRFAYLGWNYNGFAVQEETTNTVEETLFEALEKTR 35 (256)
T ss_pred CEEEEeCCcceeeeECCCCCCCHHHHHHHHHHHhc
Confidence 36899999875 2 3678999999998885
No 106
>TIGR02364 dha_pts dihydroxyacetone kinase, phosphotransfer subunit. In E. coli and many other bacteria, unlike the yeasts and a few bacteria such as Citrobacter freundii, the dihydroxyacetone kinase (also called glycerone kinase) transfers a phosphate from a phosphoprotein rather than from ATP and contains multiple subunits. This protein, which resembles proteins of PTS transport systems, is found with its gene adjacent to
Probab=29.36 E-value=1.6e+02 Score=19.06 Aligned_cols=58 Identities=16% Similarity=0.117 Sum_probs=40.7
Q ss_pred hhhhhhhhhhhcC-CceEEeEeCceeeeCccHHHHHHHHHHhhCC-CCceEEEEeCCcchH
Q psy15537 4 AQGTAVLVRRIDR-DDITIAVDGSLYKYHPRLKHWLQKYIQLLVP-NKTFRLLLAEDGSGK 62 (80)
Q Consensus 4 aa~iaai~~~~~~-~~~~IavDGsvye~~p~f~~~l~~~l~~l~~-~~~v~~~~a~DGSg~ 62 (80)
|-++...++.|.. +.-..+++|.-....-.|.++++++++++.+ +..|-+...= ||..
T Consensus 13 A~Gi~~~~~~~~g~~~~i~~~gg~~d~~~gt~~~~I~~ai~~~~~~~dgVlvl~DL-Ggs~ 72 (125)
T TIGR02364 13 AEGIKELIKQMAGDDVTIISAGGTDDGRLGTSPDKIIEAIEKADNEADGVLIFYDL-GSAV 72 (125)
T ss_pred HHHHHHHHHHHcCCCccEEEEecCCCCCccchHHHHHHHHHHhcCCCCCEEEEEcC-CCcH
Confidence 4566777888854 4556788888888878899999999999975 4444433333 5554
No 107
>PF12539 Csm1: Chromosome segregation protein Csm1/Pcs1; InterPro: IPR020981 Saccharomyces cerevisiae Csm1 is part of the monopolin complex. Csm1 forms a complex with Mde4 and promotes monoorientation during meiosis []. It plays a mitotic role in DNA replication [] and is required for condensin recruitment to the replication fork barrier site and rDNA repeat segregation []. This entry also contains the Schizosaccharomyces pombe homologue to Csm1, Pcs1. Pcs1 forms a complex with Mde4 and acts in the central kinetochore domain to clamp microtubule binding sites together[]. The two complexes (Csm1/Lrs4 and Pcs1/Mde4) contribute to the prevention of merotelic attachment [].; GO: 0005515 protein binding; PDB: 3N4S_C 3N4R_B 3N7N_A 3N4X_B.
Probab=29.32 E-value=40 Score=21.20 Aligned_cols=36 Identities=19% Similarity=0.308 Sum_probs=23.8
Q ss_pred eEEeEeCc---eeeeCccHHHHHHH---HHHhhCCC---CceEEE
Q psy15537 19 ITIAVDGS---LYKYHPRLKHWLQK---YIQLLVPN---KTFRLL 54 (80)
Q Consensus 19 ~~IavDGs---vye~~p~f~~~l~~---~l~~l~~~---~~v~~~ 54 (80)
..|.=|++ -+.+.|.|.+...+ .+.++||+ .++.|-
T Consensus 43 L~i~~~~~~~~e~~Y~P~l~~~~d~~~~~L~~~LPdYL~d~itFp 87 (90)
T PF12539_consen 43 LGIDKDFSEDTEFTYVPLLDENRDRNLKELIKILPDYLCDEITFP 87 (90)
T ss_dssp EEEE-STTT--EEEEEEE-SSS-HHHHHHHHHHS-GGGGS-EEEE
T ss_pred EEeccCCCCCceEEEEeccCcccchhHHHHHHHCcHhhEEEEEec
Confidence 44555554 89999999999999 88888897 466663
No 108
>cd06844 STAS Sulphate Transporter and Anti-Sigma factor antagonist domain found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors. The STAS (Sulphate Transporter and Anti-Sigma factor antagonist) domain is found in the C-terminal region of sulphate transporters as well as in bacterial and archaeal proteins involved in the regulation of sigma factors, like anti-anti-sigma factors and "stressosome" components. The sigma factor regulators are involved in protein-protein interaction which is regulated by phosphorylation.
Probab=29.00 E-value=1.3e+02 Score=17.91 Aligned_cols=49 Identities=24% Similarity=0.279 Sum_probs=27.6
Q ss_pred ceEEeEeCce-eeeCccHHHHHHHHHHhhCCCCceEEEEe----CCcchHHHHHH
Q psy15537 18 DITIAVDGSL-YKYHPRLKHWLQKYIQLLVPNKTFRLLLA----EDGSGKGAGLA 67 (80)
Q Consensus 18 ~~~IavDGsv-ye~~p~f~~~l~~~l~~l~~~~~v~~~~a----~DGSg~GAAl~ 67 (80)
-.++-++|.+ |...+.|++.+.+.+.+- +...+.+.++ =|.||.|+=+-
T Consensus 9 v~ii~~~G~l~f~~~~~~~~~l~~~~~~~-~~~~vilDls~v~~iDssgl~~L~~ 62 (100)
T cd06844 9 YWVVRLEGELDHHSVEQFKEELLHNITNV-AGKTIVIDISALEFMDSSGTGVLLE 62 (100)
T ss_pred EEEEEEEEEecHhhHHHHHHHHHHHHHhC-CCCEEEEECCCCcEEcHHHHHHHHH
Confidence 3568999998 445566777666555321 1133333332 27777765443
No 109
>PLN02408 phospholipase A1
Probab=28.74 E-value=1.5e+02 Score=23.03 Aligned_cols=39 Identities=26% Similarity=0.392 Sum_probs=22.3
Q ss_pred HHHHHHHHHHhhC---CCCceEEEEeCCcchHHHHHHHHHHhHH
Q psy15537 34 LKHWLQKYIQLLV---PNKTFRLLLAEDGSGKGAGLASAIALKL 74 (80)
Q Consensus 34 f~~~l~~~l~~l~---~~~~v~~~~a~DGSg~GAAl~aA~a~~~ 74 (80)
.++.+.+.|++++ |+.++.+ .--|-.+|+||+.-.+...
T Consensus 180 ~r~qVl~eI~~ll~~y~~~~~sI--~vTGHSLGGALAtLaA~dl 221 (365)
T PLN02408 180 LQEMVREEIARLLQSYGDEPLSL--TITGHSLGAALATLTAYDI 221 (365)
T ss_pred HHHHHHHHHHHHHHhcCCCCceE--EEeccchHHHHHHHHHHHH
Confidence 4445556666666 3433322 2348888999876555443
No 110
>PF12301 CD99L2: CD99 antigen like protein 2; InterPro: IPR022078 This family of proteins is found in eukaryotes. Proteins in this family are typically between 165 and 237 amino acids in length. CD99L2 and CD99 are involved in trans-endothelial migration of neutrophils in vitro and in the recruitment of neutrophils into inflamed peritoneum.
Probab=28.68 E-value=53 Score=22.86 Aligned_cols=21 Identities=19% Similarity=0.137 Sum_probs=16.1
Q ss_pred chHHHHHHHHHHhHHhhhhCC
Q psy15537 60 SGKGAGLASAIALKLGAFQSK 80 (80)
Q Consensus 60 Sg~GAAl~aA~a~~~~~~~~~ 80 (80)
|.++.|++-|++.-..-|++|
T Consensus 122 sav~valvGAvsSyiaYqkKK 142 (169)
T PF12301_consen 122 SAVVVALVGAVSSYIAYQKKK 142 (169)
T ss_pred HHHHHHHHHHHHHHHHHHhhc
Confidence 677888888888777777765
No 111
>TIGR01549 HAD-SF-IA-v1 haloacid dehalogenase superfamily, subfamily IA, variant 1 with third motif having Dx(3-4)D or Dx(3-4)E. HAD subfamilies caused by an overly broad single model.
Probab=28.66 E-value=36 Score=21.46 Aligned_cols=23 Identities=13% Similarity=0.320 Sum_probs=19.9
Q ss_pred EeCceeeeCccHHHHHHHHHHhh
Q psy15537 23 VDGSLYKYHPRLKHWLQKYIQLL 45 (80)
Q Consensus 23 vDGsvye~~p~f~~~l~~~l~~l 45 (80)
+||+++-..|.+...+++.+++.
T Consensus 6 ~DGTL~d~~~~~~~~~~~~~~~~ 28 (154)
T TIGR01549 6 IDGTLVDSSFAIRRAFEETLEEF 28 (154)
T ss_pred CCCcccccHHHHHHHHHHHHHHh
Confidence 69999999999888888888765
No 112
>PRK10530 pyridoxal phosphate (PLP) phosphatase; Provisional
Probab=28.08 E-value=53 Score=22.72 Aligned_cols=35 Identities=9% Similarity=0.183 Sum_probs=23.4
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEe
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLA 56 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a 56 (80)
.=+|||++.......+.-.++|+++.. ..+.|.++
T Consensus 8 ~DlDGTLl~~~~~i~~~~~~ai~~~~~-~G~~~~ia 42 (272)
T PRK10530 8 LDLDGTLLTPKKTILPESLEALARARE-AGYKVIIV 42 (272)
T ss_pred EeCCCceECCCCccCHHHHHHHHHHHH-CCCEEEEE
Confidence 447999997666677777888888752 23444444
No 113
>TIGR03351 PhnX-like phosphonatase-like hydrolase. This clade of sequences are the closest homologs to the PhnX enzyme, phosphonoacetaldehyde (Pald) hydrolase (phosphonatase, TIGR01422). This phosphonatase-like enzyme and PhnX itself are members of the haloacid dehalogenase (HAD) superfamily (pfam00702) having a a number of distinctive features that set them apart from typical HAD enzymes. The typical HAD N-terminal motif DxDx(T/V) here is DxAGT and the usual conserved lysine prior to the C-terminal motif is instead an arginine. Also distinctive of phosphonatase, and particular to its bi-catalytic mechanism is a conserved lysine in the variable "cap" domain. This lysine forms a Schiff base with the aldehyde of phosphonoacetaldehyde, providing, through the resulting positive charge, a polarization of the C-P bond necesary for cleavage as well as a route to the initial product of cleavage, an ene-amine. The conservation of these elements in this phosphonatase-like enzyme suggests that the
Probab=27.31 E-value=55 Score=21.96 Aligned_cols=27 Identities=7% Similarity=0.037 Sum_probs=22.9
Q ss_pred EEeEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
..=+||+++...|.+.+.+++++++.-
T Consensus 5 iFD~DGTL~ds~~~~~~~~~~~~~~~g 31 (220)
T TIGR03351 5 VLDMAGTTVDEDGLVYRALRQAVTAAG 31 (220)
T ss_pred EEecCCCeeccCchHHHHHHHHHHHcC
Confidence 345899999999999999999998753
No 114
>PLN02698 Probable pectinesterase/pectinesterase inhibitor
Probab=27.14 E-value=43 Score=26.92 Aligned_cols=37 Identities=19% Similarity=0.371 Sum_probs=27.0
Q ss_pred cHHHHHHHHHHhhCC--CCceEEEEeCCcch----HHHHHHHH
Q psy15537 33 RLKHWLQKYIQLLVP--NKTFRLLLAEDGSG----KGAGLASA 69 (80)
Q Consensus 33 ~f~~~l~~~l~~l~~--~~~v~~~~a~DGSg----~GAAl~aA 69 (80)
.|+.+|...=++||. ..+..+..++|||| +.+|+.++
T Consensus 194 ~~p~w~~~~~r~ll~~~~~~~~~~Va~dGsG~f~tiq~Ai~a~ 236 (497)
T PLN02698 194 VFPRWVSAGDRKLLQGRTIKANAVVAKDGTGNYETVSEAITAA 236 (497)
T ss_pred CCCcccchhhHhhhccCCCCceEEEcCCCCCCcccHHHHHHhh
Confidence 577788888888883 34567789999998 56666544
No 115
>PLN02488 probable pectinesterase/pectinesterase inhibitor
Probab=27.11 E-value=21 Score=28.93 Aligned_cols=38 Identities=26% Similarity=0.336 Sum_probs=27.4
Q ss_pred cHHHHHHHHHHhhCC--C--C--ceEEEEeCCcch----HHHHHHHHH
Q psy15537 33 RLKHWLQKYIQLLVP--N--K--TFRLLLAEDGSG----KGAGLASAI 70 (80)
Q Consensus 33 ~f~~~l~~~l~~l~~--~--~--~v~~~~a~DGSg----~GAAl~aA~ 70 (80)
.|+.++...-++||. . . +..+..++|||| +.+|+.++=
T Consensus 173 ~~P~W~~~~dR~lL~~~~~~~~~~~~vvVa~dGsG~f~TIq~AI~a~P 220 (509)
T PLN02488 173 NGPSWLSNVDKKYLYLNPEVLKKIADVVVAKDGSGKYNTVNAAIAAAP 220 (509)
T ss_pred CCCCCCCccchhhhhcCcccccccccEEECCCCCCCccCHHHHHHhch
Confidence 566777777777772 2 1 357889999999 888886654
No 116
>COG3195 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=26.81 E-value=24 Score=24.94 Aligned_cols=14 Identities=14% Similarity=0.368 Sum_probs=12.0
Q ss_pred eCceeeeCccHHHH
Q psy15537 24 DGSLYKYHPRLKHW 37 (80)
Q Consensus 24 DGsvye~~p~f~~~ 37 (80)
=|++|||.|+..++
T Consensus 23 fg~ifEhSpWvaEr 36 (176)
T COG3195 23 FGGIFEHSPWVAER 36 (176)
T ss_pred hhhHhhcCchHHHH
Confidence 38999999999876
No 117
>TIGR00336 pyrE orotate phosphoribosyltransferase. The conserved Lys (K) residue at position 101 of the seed alignment has been proposed as the active site for the enzyme.
Probab=26.49 E-value=1.3e+02 Score=20.12 Aligned_cols=44 Identities=14% Similarity=0.283 Sum_probs=33.8
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCC-CceEEEEeCCcchH
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPN-KTFRLLLAEDGSGK 62 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~-~~v~~~~a~DGSg~ 62 (80)
..+=+|-+.+-.+|.+.+.+.+.+.+++++ .++......+-+|.
T Consensus 22 s~~y~d~~~i~~~p~~~~~v~~~~~~~~~~~~~~d~Ivg~~~gG~ 66 (173)
T TIGR00336 22 SPYYFNIKLFNTGPELANLIARYAAAIIKSHLEFDVIAGPALGGI 66 (173)
T ss_pred CCEEEECeecCChHHHHHHHHHHHHHHHHhcCCCCEEEccccChH
Confidence 457789999999999999999999998854 45555665555554
No 118
>PF13180 PDZ_2: PDZ domain; PDB: 2L97_A 1Y8T_A 2Z9I_A 1LCY_A 2PZD_B 2P3W_A 1VCW_C 1TE0_B 1SOZ_C 1SOT_C ....
Probab=26.33 E-value=1.1e+02 Score=17.44 Aligned_cols=40 Identities=25% Similarity=0.361 Sum_probs=27.8
Q ss_pred CceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcc
Q psy15537 17 DDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGS 60 (80)
Q Consensus 17 ~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGS 60 (80)
..+.++|||.-......|. +.+...-++.++.+...++|.
T Consensus 35 GD~I~~ing~~v~~~~~~~----~~l~~~~~g~~v~l~v~R~g~ 74 (82)
T PF13180_consen 35 GDIILAINGKPVNSSEDLV----NILSKGKPGDTVTLTVLRDGE 74 (82)
T ss_dssp TEEEEEETTEESSSHHHHH----HHHHCSSTTSEEEEEEEETTE
T ss_pred CcEEEEECCEEcCCHHHHH----HHHHhCCCCCEEEEEEEECCE
Confidence 3688999999885543333 444444477888888888773
No 119
>PRK10976 putative hydrolase; Provisional
Probab=26.19 E-value=53 Score=22.86 Aligned_cols=26 Identities=23% Similarity=0.406 Sum_probs=20.6
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
+=+|||++.......+.-.++|+++.
T Consensus 7 ~DlDGTLl~~~~~is~~~~~ai~~l~ 32 (266)
T PRK10976 7 SDLDGTLLSPDHTLSPYAKETLKLLT 32 (266)
T ss_pred EeCCCCCcCCCCcCCHHHHHHHHHHH
Confidence 44799999876677788888888875
No 120
>COG0101 TruA Pseudouridylate synthase [Translation, ribosomal structure and biogenesis]
Probab=26.18 E-value=97 Score=22.91 Aligned_cols=29 Identities=21% Similarity=0.314 Sum_probs=22.5
Q ss_pred ceEEeEeCceee---eC---ccHHHHHHHHHHhhC
Q psy15537 18 DITIAVDGSLYK---YH---PRLKHWLQKYIQLLV 46 (80)
Q Consensus 18 ~~~IavDGsvye---~~---p~f~~~l~~~l~~l~ 46 (80)
...|+.|||-|- .. |..++.++++|.++.
T Consensus 5 ~l~iaYdGt~f~G~Q~Qp~~~TVQ~~le~aL~~i~ 39 (266)
T COG0101 5 ALKIAYDGTRFHGWQRQPNVRTVQGELEKALSKIG 39 (266)
T ss_pred EEEEEEcCCceeeeccCCCCCCHHHHHHHHHHHhc
Confidence 478999999863 33 678888888888876
No 121
>PRK06437 hypothetical protein; Provisional
Probab=26.12 E-value=1.1e+02 Score=17.66 Aligned_cols=43 Identities=16% Similarity=0.186 Sum_probs=28.4
Q ss_pred hhhhhhhhhhcCC--ceEEeEeCceeeeCccHHHHHHHHHHhhC-CCCceEEEEeCCc
Q psy15537 5 QGTAVLVRRIDRD--DITIAVDGSLYKYHPRLKHWLQKYIQLLV-PNKTFRLLLAEDG 59 (80)
Q Consensus 5 a~iaai~~~~~~~--~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~~~v~~~~a~DG 59 (80)
+.+.-++++++.+ .+.|.+||.+.. -+..+ ++.+|.+...-.|
T Consensus 21 ~tv~dLL~~Lgi~~~~vaV~vNg~iv~------------~~~~L~dgD~Veiv~~V~G 66 (67)
T PRK06437 21 LTVNDIIKDLGLDEEEYVVIVNGSPVL------------EDHNVKKEDDVLILEVFSG 66 (67)
T ss_pred CcHHHHHHHcCCCCccEEEEECCEECC------------CceEcCCCCEEEEEecccC
Confidence 4567778877764 588899999885 12233 5577777765544
No 122
>COG0561 Cof Predicted hydrolases of the HAD superfamily [General function prediction only]
Probab=25.92 E-value=55 Score=22.79 Aligned_cols=41 Identities=17% Similarity=0.294 Sum_probs=30.2
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcc
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGS 60 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGS 60 (80)
+.+=+||++........+...++++++. +..+.+.++.+-+
T Consensus 6 i~~DlDGTLl~~~~~i~~~~~~al~~~~-~~g~~v~iaTGR~ 46 (264)
T COG0561 6 LAFDLDGTLLDSNKTISPETKEALARLR-EKGVKVVLATGRP 46 (264)
T ss_pred EEEcCCCCccCCCCccCHHHHHHHHHHH-HCCCEEEEECCCC
Confidence 4566899999999999999999998664 3345555555443
No 123
>PRK10725 fructose-1-P/6-phosphogluconate phosphatase; Provisional
Probab=25.79 E-value=42 Score=21.85 Aligned_cols=27 Identities=11% Similarity=0.308 Sum_probs=22.5
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhh
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLL 45 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l 45 (80)
+..=+||+++...+.+.+..++++++.
T Consensus 8 viFD~DGTLiDs~~~~~~a~~~~~~~~ 34 (188)
T PRK10725 8 LIFDMDGTILDTEPTHRKAWREVLGRY 34 (188)
T ss_pred EEEcCCCcCccCHHHHHHHHHHHHHHc
Confidence 344589999999999999999888875
No 124
>cd07211 Pat_PNPLA8 Patatin-like phospholipase domain containing protein 8. PNPLA8 is a Ca-independent myocardial phospholipase which maintains mitochondrial integrity. PNPLA8 is also known as iPLA2-gamma. In humans, it is predominantly expressed in heart tissue. iPLA2-gamma can catalyze both phospholipase A1 and A2 reactions (PLA1 and PLA2 respectively). This family includes PNPLA8 (iPLA2-gamma) from Homo sapiens and iPLA2-2 from Mus musculus.
Probab=25.53 E-value=94 Score=22.57 Aligned_cols=30 Identities=23% Similarity=0.381 Sum_probs=20.0
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceE
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFR 52 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~ 52 (80)
.=|||+++..+|...-. .|+. .+.|+.++.
T Consensus 190 ~~vDGGv~aNnP~~~a~-~ea~-~~~~~~~i~ 219 (308)
T cd07211 190 LHQDGGLLANNPTALAL-HEAK-LLWPDTPIQ 219 (308)
T ss_pred eEEECCcccCCcHHHHH-HHHH-HhCCCCCCc
Confidence 56899999999986654 4443 445654443
No 125
>PRK14586 tRNA pseudouridine synthase ACD; Provisional
Probab=25.23 E-value=1.9e+02 Score=20.78 Aligned_cols=30 Identities=10% Similarity=0.183 Sum_probs=23.1
Q ss_pred ceEEeEeCceee---eC---ccHHHHHHHHHHhhCC
Q psy15537 18 DITIAVDGSLYK---YH---PRLKHWLQKYIQLLVP 47 (80)
Q Consensus 18 ~~~IavDGsvye---~~---p~f~~~l~~~l~~l~~ 47 (80)
.++|+.||+-|. +. +.-++.++++|..+.+
T Consensus 5 ~l~i~YdGt~f~G~Q~Q~~~~TVq~~le~aL~~~~~ 40 (245)
T PRK14586 5 AAVVSYDGSNFFGYQGQPDVRTVQGVFEDALERIFK 40 (245)
T ss_pred EEEEEEcCCceeeEEECCCCCCHHHHHHHHHHHHhC
Confidence 478999999763 33 4688899999988874
No 126
>PF00733 Asn_synthase: Asparagine synthase; InterPro: IPR001962 This domain is always found associated with (IPR000583 from INTERPRO). Family members that contain this domain catalyse the conversion of aspartate to asparagine. Asparagine synthetase B (6.3.5.4 from EC) catalyzes the assembly of asparagine from aspartate, Mg(2+)ATP, and glutamine. The three-dimensional architecture of the N-terminal domain of asparagine synthetase B is similar to that observed for glutamine phosphoribosylpyrophosphate amidotransferase while the molecular motif of the C-domain is reminiscent to that observed for GMP synthetase [].; GO: 0004066 asparagine synthase (glutamine-hydrolyzing) activity, 0006529 asparagine biosynthetic process; PDB: 1JGT_A 1M1Z_B 1MB9_B 1MBZ_B 1MC1_A 1Q15_D 1Q19_C 1CT9_C 3K32_F.
Probab=24.46 E-value=2.2e+02 Score=18.99 Aligned_cols=38 Identities=21% Similarity=0.390 Sum_probs=26.6
Q ss_pred cHHHHHHHHHHhhC-CCCceEEEEeCCcchHHHHHHHHHHhH
Q psy15537 33 RLKHWLQKYIQLLV-PNKTFRLLLAEDGSGKGAGLASAIALK 73 (80)
Q Consensus 33 ~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~GAAl~aA~a~~ 73 (80)
.|++.+++++++.+ .+.+|-+.+| .|+-..++++++.+
T Consensus 1 ~~r~~l~~av~~rl~~~~~i~~~LS---GGlDSs~i~~~~~~ 39 (255)
T PF00733_consen 1 ELRELLEEAVARRLRSDKPIGILLS---GGLDSSAIAALAAR 39 (255)
T ss_dssp HHHHHHHHHHHHHCGCTSEEEEE-----SSHHHHHHHHHHHH
T ss_pred CHHHHHHHHHHHHHhcCCCEEEECC---CChhHHHHHHHHHH
Confidence 37899999999988 4677776665 45666677766655
No 127
>TIGR02629 L_rham_iso_rhiz L-rhamnose catabolism isomerase, Pseudomonas stutzeri subtype. Members of this family are isomerases in the pathway of L-rhamnose catabolism as found in Pseudomonas stutzeri and in a number of the Rhizobiales. This family differs from the L-rhamnose isomerases of Escherichia coli (see TIGR01748). This enzyme catalyzes the isomerization step in rhamnose catabolism. Genetic evidence in Rhizobium leguminosarum bv. trifolii suggests phosphorylation occurs first, then isomerization of the the phosphorylated sugar, but characterization of the recombinant enzyme from Pseudomonas stutzeri does show L-rhamnose isomerase activity. The name given is deliberately vague because the relative order of phosphorylation and isomerization is unclear.
Probab=23.97 E-value=1.4e+02 Score=23.77 Aligned_cols=36 Identities=14% Similarity=0.232 Sum_probs=26.9
Q ss_pred hhhhhcCCceE--EeEeCceeeeCccH---HHHHHHHHHhhC
Q psy15537 10 LVRRIDRDDIT--IAVDGSLYKYHPRL---KHWLQKYIQLLV 46 (80)
Q Consensus 10 i~~~~~~~~~~--IavDGsvye~~p~f---~~~l~~~l~~l~ 46 (80)
|.+.++.+.++ ++ ||+=|-.-..| ++++.++++++.
T Consensus 158 Ig~elGs~~v~IW~g-DG~~yP~Q~~~~~~~~rl~esL~eI~ 198 (412)
T TIGR02629 158 IGKALGSKALTVWIG-DGSNFPGQSNFTRAFERYLDAMKAVY 198 (412)
T ss_pred HHHHhCCCeeEEECC-CCCCCcCccchHHHHHHHHHHHHHHH
Confidence 35556655555 58 99998776666 788999999988
No 128
>COG1017 Hmp Hemoglobin-like flavoprotein [Energy production and conversion]
Probab=23.78 E-value=61 Score=22.37 Aligned_cols=37 Identities=16% Similarity=0.252 Sum_probs=26.1
Q ss_pred hhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhCCC
Q psy15537 9 VLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLVPN 48 (80)
Q Consensus 9 ai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~ 48 (80)
.++.++...++..+| .=|+||-.-++|-.+++|++++
T Consensus 76 ~~v~rIa~KHvsl~I---~pEhYpIVge~LL~aI~evlgd 112 (150)
T COG1017 76 PVVERIAHKHVSLQI---KPEHYPIVGEHLLAAIKEVLGD 112 (150)
T ss_pred HHHHHHHHHHHhcCC---ChhhccHHHHHHHHHHHHHhCc
Confidence 344444444444444 3489999999999999999964
No 129
>PF01138 RNase_PH: 3' exoribonuclease family, domain 1 This Prosite family only includes Ribonuclease PH; InterPro: IPR001247 The PH (phosphorolytic) domain is responsible for 3'-5' exoribonuclease activity, although in some proteins this domain has lost its catalytic function. An active PH domain uses inorganic phosphate as a nucleophile, adding it across the phosphodiester bond between the end two nucleotides in order to release ribonucleoside 5'-diphosphate (rNDP) from the 3' end of the RNA substrate. PH domains can be found in bacterial/organelle RNases and PNPases (polynucleotide phosphorylases) [], as well as in archaeal and eukaryotic RNA exosomes [, ], the later acting as nano-compartments for the degradation or processing of RNA (including mRNA, rRNA, snRNA and snoRNA). Bacterial/organelle PNPases share a common barrel structure with RNA exosomes, consisting of a hexameric ring of PH domains that act as a degradation chamber, and an S1-domain/KH-domain containing cap that binds the RNA substrate (and sometimes accessory proteins) in order to regulate and restrict entry into the degradation chamber []. Unstructured RNA substrates feed in through the pore made by the S1 domains, are degraded by the PH domain ring, and exit as nucleotides via the PH pore at the opposite end of the barrel [, ]. This entry represents the phosphorolytic (PH) domain 1, which has a core 2-layer alpha/beta structure with a left-handed crossover, similar to that found in ribosomal protein S5. This domain is found in bacterial/organelle PNPases and in archaeal/eukaryotic exosomes []. More information about these proteins can be found at Protein of the Month: RNA Exosomes [].; PDB: 2C38_G 2BR2_O 2C37_M 3L7Z_A 2JEB_A 2C39_A 2JEA_A 2JE6_A 3U1K_A 4AM3_B ....
Probab=23.71 E-value=1.9e+02 Score=17.87 Aligned_cols=39 Identities=18% Similarity=0.236 Sum_probs=23.3
Q ss_pred HHHHHHHHHHhhC--CCC-----ce-EEEEeCCc-chHHHHHHHHHHh
Q psy15537 34 LKHWLQKYIQLLV--PNK-----TF-RLLLAEDG-SGKGAGLASAIAL 72 (80)
Q Consensus 34 f~~~l~~~l~~l~--~~~-----~v-~~~~a~DG-Sg~GAAl~aA~a~ 72 (80)
+...+++.++... +.. .| ...+..|| +-.-+++.|++..
T Consensus 78 l~~~l~~~l~~~~~~~~~~~~~i~v~v~vl~~dG~~~~~a~~~A~~~A 125 (132)
T PF01138_consen 78 LSSLLERALRSSILLEGYPRWQIHVDVQVLSDDGGNLLDAAINAACLA 125 (132)
T ss_dssp HHHHHHHHHHHTBSTTTTSSEEEEEEEEEEECSSSSHHHHHHHHHHHH
T ss_pred HHHHHhhhccccccccccCceEEEEEEEEEecCCCCHHHHHHHHHHHH
Confidence 4455555555544 222 22 23478899 8888888777654
No 130
>PF09048 Cro: Cro; InterPro: IPR000655 Bacteriophage lambda encodes two repressors: the Cro repressor that acts to turn off early gene transcription during the lytic cycle, and the lambda or cI repressor that is required to maintain lysogenic growth. Together the Cro and cI repressors form a helix-turn-helix (HTH) superfamily. The lambda Cro repressor binds to DNA as a highly flexible dimer. The crystal structure of the lambda Cro repressor [] reveals a HTH DNA-binding protein with an alpha/beta fold that differs from other Cro family members, possibly by an evolutionary fold change []. Most Cro proteins, such as Enterobacteria phage P22 Cro and Bacteriophage 434 Cro, have an all-alpha structure that is thought to be ancestral to lambda Cro, where the fourth and fifth helices are replaced by a beta-sheet, possibly as a result of secondary structure switching rather than by nonhomologous replacement []. This entry represents the lambda-type Cro repressor with an alpha/beta topology.; GO: 0003700 sequence-specific DNA binding transcription factor activity, 0006355 regulation of transcription, DNA-dependent; PDB: 5CRO_A 2ECS_A 2OVG_A 6CRO_A 1D1L_A 2ORC_A 1D1M_B 3ORC_A 1ORC_A 2A63_A ....
Probab=23.67 E-value=1.1e+02 Score=17.96 Aligned_cols=22 Identities=18% Similarity=0.223 Sum_probs=14.7
Q ss_pred HHHHhhC-CCCceEEEEeCCcch
Q psy15537 40 KYIQLLV-PNKTFRLLLAEDGSG 61 (80)
Q Consensus 40 ~~l~~l~-~~~~v~~~~a~DGSg 61 (80)
-++...+ .+.+|.+..-+|||-
T Consensus 27 ~AIsKAlr~gR~I~v~~~~dGs~ 49 (59)
T PF09048_consen 27 SAISKALRAGRNIFVTIMPDGSV 49 (59)
T ss_dssp HHHHHHHHCT-EEEEEEETTSEE
T ss_pred HHHHHHHHcCCcEEEEEcCCCeE
Confidence 3444444 468999999999973
No 131
>PRK13835 conjugal transfer protein TrbH; Provisional
Probab=23.66 E-value=2.3e+02 Score=19.39 Aligned_cols=30 Identities=23% Similarity=0.220 Sum_probs=20.5
Q ss_pred HHHhhCCCCceEEEEeCCcchHHHHHHHHH
Q psy15537 41 YIQLLVPNKTFRLLLAEDGSGKGAGLASAI 70 (80)
Q Consensus 41 ~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~ 70 (80)
-|.+++|-.+=+|.+-+|.|..|.||++++
T Consensus 49 qLae~~pPa~tt~~l~q~~d~Fg~aL~~aL 78 (145)
T PRK13835 49 RLAEQIGPGTTTIKLKKDTSPFGQALEAAL 78 (145)
T ss_pred HHHHhcCCCceEEEEeecCcHHHHHHHHHH
Confidence 344445433456666777799999999886
No 132
>PF11889 DUF3409: Domain of unknown function (DUF3409); InterPro: IPR021824 This domain is functionally uncharacterised. This domain is found in viruses. This presumed domain is about 60 amino acids in length. This domain is found associated with PF00271 from PFAM, PF05550 from PFAM, PF05578 from PFAM. ; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0004252 serine-type endopeptidase activity, 0016817 hydrolase activity, acting on acid anhydrides, 0017111 nucleoside-triphosphatase activity, 0070008 serine-type exopeptidase activity
Probab=23.50 E-value=63 Score=18.76 Aligned_cols=14 Identities=50% Similarity=0.710 Sum_probs=11.4
Q ss_pred CCceEEeEeCceee
Q psy15537 16 RDDITIAVDGSLYK 29 (80)
Q Consensus 16 ~~~~~IavDGsvye 29 (80)
++..||-|||--|+
T Consensus 35 PPdativv~GvKYQ 48 (56)
T PF11889_consen 35 PPDATIVVDGVKYQ 48 (56)
T ss_pred CCCceEEEeeeEEE
Confidence 34689999998886
No 133
>PRK08053 sulfur carrier protein ThiS; Provisional
Probab=23.48 E-value=78 Score=17.92 Aligned_cols=48 Identities=10% Similarity=0.178 Sum_probs=29.4
Q ss_pred hhhhhhhhhhcCC--ceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCc
Q psy15537 5 QGTAVLVRRIDRD--DITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDG 59 (80)
Q Consensus 5 a~iaai~~~~~~~--~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DG 59 (80)
..+..++.+++.+ .+.|++||.+..+.-+=. + .|-++.+|.+...-.|
T Consensus 16 ~tl~~ll~~l~~~~~~vaVavN~~iv~r~~w~~-----~--~L~~gD~Ieii~~v~G 65 (66)
T PRK08053 16 QTVHELLEQLNQLQPGAALAINQQIIPREQWAQ-----H--IVQDGDQILLFQVIAG 65 (66)
T ss_pred CCHHHHHHHcCCCCCcEEEEECCEEeChHHcCc-----c--ccCCCCEEEEEEEccC
Confidence 4577788887754 588999999977521100 0 1224567776655444
No 134
>TIGR01454 AHBA_synth_RP 3-amino-5-hydroxybenoic acid synthesis related protein. The most closely related enzyme below the noise cutoff is IndB which is involved in the biosynthesis of Indigoidine in Pectobacterium (Erwinia) chrysanthemi, a gamma proteobacter. This enzyme is similarly related to PGP. In this case, too it is unclear what role would be be played by a PGPase activity.
Probab=23.47 E-value=51 Score=21.98 Aligned_cols=24 Identities=17% Similarity=0.207 Sum_probs=21.5
Q ss_pred EeCceeeeCccHHHHHHHHHHhhC
Q psy15537 23 VDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 23 vDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
+||+++-..|.+.+.+++++.+..
T Consensus 5 lDGTL~Ds~~~~~~~~~~~~~~~~ 28 (205)
T TIGR01454 5 LDGVLVDSFAVMREAFAIAYREVV 28 (205)
T ss_pred CcCccccCHHHHHHHHHHHHHHhc
Confidence 699999999999999999998863
No 135
>TIGR02252 DREG-2 REG-2-like, HAD superfamily (subfamily IA) hydrolase. Most likely, these sequences, like the vast majority of HAD sequences, represent phosphatase enzymes.
Probab=23.07 E-value=87 Score=20.69 Aligned_cols=26 Identities=8% Similarity=0.011 Sum_probs=21.4
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
.=+||+++...+.+.+.+++.+++.-
T Consensus 5 FDlDGTL~d~~~~~~~a~~~~~~~~g 30 (203)
T TIGR02252 5 FDAVGTLLALKEPVGEVYCEIARKYG 30 (203)
T ss_pred EecCCceeeeCCCHHHHHHHHHHHhC
Confidence 35799999999999988888887653
No 136
>PF09376 NurA: NurA domain; InterPro: IPR018977 This domain is found in several uncharacterised proteins and in NurA, a nuclease exhibiting both single-stranded endonuclease activity and 5'-3' exonuclease activity on single-stranded and double-stranded DNA from the hyperthermophilic archaeon Sulfolobus acidocaldarius []. ; PDB: 1ZUP_B 2YGK_A 3TAZ_B 3TAL_A 3TAI_B.
Probab=22.94 E-value=44 Score=22.33 Aligned_cols=8 Identities=38% Similarity=0.090 Sum_probs=3.8
Q ss_pred CcchHHHH
Q psy15537 58 DGSGKGAG 65 (80)
Q Consensus 58 DGSg~GAA 65 (80)
|||-....
T Consensus 89 DGsl~~~~ 96 (247)
T PF09376_consen 89 DGSLYPSL 96 (247)
T ss_dssp ESSHHHHH
T ss_pred cCCchhhh
Confidence 55554433
No 137
>KOG0369|consensus
Probab=22.88 E-value=1.9e+02 Score=25.39 Aligned_cols=41 Identities=24% Similarity=0.297 Sum_probs=29.6
Q ss_pred CccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 31 HPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 31 ~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
-|.-..++-.++++-.|+-.|++ +++|-||-|.|-..|++.
T Consensus 745 KP~aa~lLi~alRdk~PdlPiHv-HtHDtsGagVAsMlaca~ 785 (1176)
T KOG0369|consen 745 KPEAAKLLIGALRDKFPDLPIHV-HTHDTSGAGVASMLACAL 785 (1176)
T ss_pred CHHHHHHHHHHHHhhCCCCceEE-eccCCccHHHHHHHHHHH
Confidence 46667777777888778878875 689999887666555543
No 138
>PF10923 DUF2791: P-loop Domain of unknown function (DUF2791); InterPro: IPR021228 This is a family of proteins found in archaea and bacteria. Some of the proteins in this family are annotated as being methyl-accepting chemotaxis proteins and ATP/GTP binding proteins.
Probab=22.72 E-value=1.4e+02 Score=23.64 Aligned_cols=48 Identities=27% Similarity=0.387 Sum_probs=33.4
Q ss_pred hhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC-CCCceEEEEeCCcchHHH
Q psy15537 9 VLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV-PNKTFRLLLAEDGSGKGA 64 (80)
Q Consensus 9 ai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~GA 64 (80)
|++.+.+-+.++||-| .+.+.+.+.++.+. ....+.|..-+-|||+==
T Consensus 16 GvVP~~Gl~~~~VGr~--------~e~~~l~~~l~~v~~G~s~~kfi~G~YGsGKTf 64 (416)
T PF10923_consen 16 GVVPRIGLDHIAVGRE--------REIEALDRDLDRVADGGSSFKFIRGEYGSGKTF 64 (416)
T ss_pred CCCCcccCcceeechH--------HHHHHHHHHHHHHhCCCCeEEEEEeCCCCcHHH
Confidence 3344444455677655 56677777777765 457899999999999943
No 139
>PRK00767 transcriptional regulator BetI; Validated
Probab=22.63 E-value=40 Score=22.01 Aligned_cols=36 Identities=19% Similarity=0.262 Sum_probs=23.9
Q ss_pred hhhhhhhhhhcCCceEEeEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 5 QGTAVLVRRIDRDDITIAVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 5 a~iaai~~~~~~~~~~IavDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
+++-.|.++.+-+ -||+|.++|+=.+.+.+.++...
T Consensus 30 ~s~~~Ia~~aGvs------~gslY~~F~~Ke~L~~~~l~~~~ 65 (197)
T PRK00767 30 ATIAQIARRAGVS------TGIISHYFGGKDGLLEATMRHLL 65 (197)
T ss_pred CCHHHHHHHhCCC------HHHHHHHhCCHHHHHHHHHHHHH
Confidence 3444455544433 28999999998888777776543
No 140
>PRK07440 hypothetical protein; Provisional
Probab=22.51 E-value=80 Score=18.38 Aligned_cols=48 Identities=8% Similarity=0.213 Sum_probs=29.5
Q ss_pred hhhhhhhhhhcCC--ceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCc
Q psy15537 5 QGTAVLVRRIDRD--DITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDG 59 (80)
Q Consensus 5 a~iaai~~~~~~~--~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DG 59 (80)
..+..+++.++.+ .+.|++||.+..+.-+=.. .|-++.+|++...-.|
T Consensus 20 ~tl~~lL~~l~~~~~~vav~~N~~iv~r~~w~~~-------~L~~gD~IEIv~~v~G 69 (70)
T PRK07440 20 TSLPDLLQQLGFNPRLVAVEYNGEILHRQFWEQT-------QVQPGDRLEIVTIVGG 69 (70)
T ss_pred CCHHHHHHHcCCCCCeEEEEECCEEeCHHHcCce-------ecCCCCEEEEEEEecC
Confidence 4577788877654 5889999999886322111 1224566766655444
No 141
>PRK10513 sugar phosphate phosphatase; Provisional
Probab=22.45 E-value=70 Score=22.23 Aligned_cols=35 Identities=20% Similarity=0.255 Sum_probs=24.4
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEe
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLA 56 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a 56 (80)
+=+|||++...-...+...++|+++. +..+.+.++
T Consensus 8 ~DlDGTLl~~~~~i~~~~~~ai~~l~-~~G~~~~ia 42 (270)
T PRK10513 8 IDMDGTLLLPDHTISPAVKQAIAAAR-AKGVNVVLT 42 (270)
T ss_pred EecCCcCcCCCCccCHHHHHHHHHHH-HCCCEEEEe
Confidence 45799999776667777788888875 233555555
No 142
>PF06523 DUF1106: Protein of unknown function (DUF1106); InterPro: IPR009490 This family consists of several hypothetical bacterial proteins found in Escherichia coli and Citrobacter rodentium. The function of this family is unknown.
Probab=22.44 E-value=1.2e+02 Score=18.91 Aligned_cols=21 Identities=10% Similarity=0.433 Sum_probs=17.2
Q ss_pred HHHHHHHHHhhCCCCceEEEE
Q psy15537 35 KHWLQKYIQLLVPNKTFRLLL 55 (80)
Q Consensus 35 ~~~l~~~l~~l~~~~~v~~~~ 55 (80)
++.+-+.++|++|+..|+|-+
T Consensus 67 k~yi~dklkeifpde~iefsi 87 (91)
T PF06523_consen 67 KNYIFDKLKEIFPDEAIEFSI 87 (91)
T ss_pred chhHHHHHHHhCCCCceEEEE
Confidence 577889999999998777754
No 143
>PRK10073 putative glycosyl transferase; Provisional
Probab=22.32 E-value=1.1e+02 Score=22.45 Aligned_cols=29 Identities=14% Similarity=0.469 Sum_probs=18.8
Q ss_pred HHHHHHHHhhCCC--CceEEEEeCCcchHHH
Q psy15537 36 HWLQKYIQLLVPN--KTFRLLLAEDGSGKGA 64 (80)
Q Consensus 36 ~~l~~~l~~l~~~--~~v~~~~a~DGSg~GA 64 (80)
+.+.++++.++.+ .++++.+..|||-=+.
T Consensus 19 ~~L~~~l~Sl~~Qt~~~~EIIiVdDgStD~t 49 (328)
T PRK10073 19 KDFRAFMESLIAQTWTALEIIIVNDGSTDNS 49 (328)
T ss_pred HHHHHHHHHHHhCCCCCeEEEEEeCCCCccH
Confidence 4566666666633 4677888888876443
No 144
>PF08758 Cadherin_pro: Cadherin prodomain like; InterPro: IPR014868 Cadherins are a group of proteins that mediate calcium dependent cell-cell adhesion. They are activated through cleavage of a prosequence in the late Golgi. This protein corresponds to the folded region of the prosequence, and is termed the prodomain. The prodomain shows structural resemblance to the cadherin domain, but lacks all the features known to be important for cadherin-cadherin interactions []. ; GO: 0007155 cell adhesion, 0016021 integral to membrane; PDB: 1OP4_A.
Probab=22.26 E-value=42 Score=20.77 Aligned_cols=16 Identities=38% Similarity=0.673 Sum_probs=12.3
Q ss_pred ceEEeEeCceeeeCcc
Q psy15537 18 DITIAVDGSLYKYHPR 33 (80)
Q Consensus 18 ~~~IavDGsvye~~p~ 33 (80)
+..|.=|||||-..|.
T Consensus 47 dF~V~~DGsVy~~r~v 62 (90)
T PF08758_consen 47 DFRVLEDGSVYAKRPV 62 (90)
T ss_dssp EEEEETTTEEEEES--
T ss_pred CEEEcCCCeEEEeeeE
Confidence 5778999999999764
No 145
>TIGR01993 Pyr-5-nucltdase pyrimidine 5'-nucleotidase. These enzymes are members of the haloacid dehalogenase (HAD) superfamily of hydrolases, specifically the IA subfamily (variant 3, TIGR01509).
Probab=22.24 E-value=64 Score=21.10 Aligned_cols=24 Identities=17% Similarity=0.256 Sum_probs=19.1
Q ss_pred EeEeCceeeeCccHHHHHHHHHHh
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQL 44 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~ 44 (80)
.=.||+++...+.+...+++++.+
T Consensus 5 FDlDGTL~ds~~~~~~~~~~~~~~ 28 (184)
T TIGR01993 5 FDLDNTLYPHSAGIFLQIDRNITE 28 (184)
T ss_pred EeCCCCCCCCcccHHHHHHHHHHH
Confidence 347999999988888888777654
No 146
>TIGR02259 benz_CoA_red_A benzoyl-CoA reductase, bcr type, subunit A. This model describes A, or gamma, subunit of the bcr type of benzoyl-CoA reductase, a 4-subunit enzyme. Many aromatic compounds are metabolized by way of benzoyl-CoA. This family shows strong sequence similarity to the 2-hydroxyglutaryl-CoA dehydratase alpha chain and to subunits of different types of benzoyl-CoA reductase (such as the bzd type).
Probab=22.11 E-value=87 Score=25.08 Aligned_cols=47 Identities=15% Similarity=0.051 Sum_probs=27.3
Q ss_pred ceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcc---hHHHHHH
Q psy15537 18 DITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGS---GKGAGLA 67 (80)
Q Consensus 18 ~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGS---g~GAAl~ 67 (80)
+-.|..-|++- ++|.+.+.+.+.+.+=.++.+| ...++.. .+|||++
T Consensus 382 ~~~VvftGGvA-~N~gvv~aLe~~L~~~~~~~~V--~Vp~~pq~~GALGAAL~ 431 (432)
T TIGR02259 382 TDQFTFTGGVA-KNEAAVKELRKLIKENYGEVQI--NIDPDSIYTGALGASEF 431 (432)
T ss_pred CCCEEEECCcc-ccHHHHHHHHHHHccccCCCeE--ecCCCccHHHHHHHHHh
Confidence 44688889976 6778887777777432233333 3344433 3455554
No 147
>PRK14089 ipid-A-disaccharide synthase; Provisional
Probab=22.09 E-value=2e+02 Score=21.86 Aligned_cols=33 Identities=21% Similarity=0.238 Sum_probs=25.0
Q ss_pred CCceEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEE
Q psy15537 16 RDDITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLL 54 (80)
Q Consensus 16 ~~~~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~ 54 (80)
++++.|.|| ||.|--++.+.+++..|+.++...
T Consensus 76 ~pd~~i~iD------~p~Fnl~lak~~k~~~~~i~viyy 108 (347)
T PRK14089 76 QADKVLLMD------SSSFNIPLAKKIKKAYPKKEIIYY 108 (347)
T ss_pred CCCEEEEeC------CCCCCHHHHHHHHhcCCCCCEEEE
Confidence 346888888 899999999999988666544433
No 148
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=22.08 E-value=83 Score=17.63 Aligned_cols=26 Identities=19% Similarity=0.377 Sum_probs=19.9
Q ss_pred hhhhhhhhhhcCC--ceEEeEeCceeee
Q psy15537 5 QGTAVLVRRIDRD--DITIAVDGSLYKY 30 (80)
Q Consensus 5 a~iaai~~~~~~~--~~~IavDGsvye~ 30 (80)
+.+..++.+++.+ .+.|++||.+..+
T Consensus 15 ~tv~~ll~~l~~~~~~i~V~vNg~~v~~ 42 (65)
T cd00565 15 ATLAELLEELGLDPRGVAVALNGEIVPR 42 (65)
T ss_pred CCHHHHHHHcCCCCCcEEEEECCEEcCH
Confidence 4567788887654 5889999998766
No 149
>COG0554 GlpK Glycerol kinase [Energy production and conversion]
Probab=21.95 E-value=2.8e+02 Score=22.74 Aligned_cols=49 Identities=14% Similarity=0.266 Sum_probs=34.8
Q ss_pred eEEeEeCceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHHHhH
Q psy15537 19 ITIAVDGSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAIALK 73 (80)
Q Consensus 19 ~~IavDGsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~a~~ 73 (80)
..+-|||+.-+. +.+++..-+++ +.+|.--...+-+.+|||++|.++.-
T Consensus 404 ~~LrvDGG~s~n-----~~lmQfqADil-g~~V~Rp~~~EtTAlGaA~lAGla~G 452 (499)
T COG0554 404 TRLRVDGGASRN-----NFLMQFQADIL-GVPVERPVVLETTALGAAYLAGLAVG 452 (499)
T ss_pred eeEEEcCccccc-----hhHHHHHHHHh-CCeeeccccchhhHHHHHHHHhhhhC
Confidence 568999999776 34444555665 23555555678899999999988753
No 150
>cd00497 PseudoU_synth_TruA_like PseudoU_synth_TruA: Pseudouridine synthase, TruA family. This group consists of eukaryotic, bacterial and archeal pseudouridine synthases similar to Escherichia coli TruA, Saccharomyces cerevisiae Pus1p, S. cerevisiae Pus3p Caenorhabditis elegans Pus1p and human PUS1. Pseudouridine synthases catalyze the isomerization of specific uridines in an RNA molecule to pseudouridines (5-ribosyluracil, psi). No cofactors are required. S. cerevisiae PUS1 catalyzes the formation of psi34 and psi36 in the intron containing tRNAIle, psi35 in the intron containing tRNATyr, psi27 and/or psi28 in several yeast cytoplasmic tRNAs and, psi44 in U2 small nuclear RNA (U2 snRNA). The presence of the intron is required for the formation of psi 34, 35 and 36. In addition S. cerevisiae PUS1 makes psi 26, 65 and 67. C. elegans Pus1p does not modify psi44 in U2 snRNA. S. cerevisiae Pus3p makes psi38 and psi39 in tRNAs. Psi44 in U2 snRNA and, psi38 and psi39 in tRNAs are highly ph
Probab=21.93 E-value=1.6e+02 Score=20.77 Aligned_cols=27 Identities=30% Similarity=0.239 Sum_probs=20.1
Q ss_pred EeEeCcee---eeC---ccHHHHHHHHHHhhCC
Q psy15537 21 IAVDGSLY---KYH---PRLKHWLQKYIQLLVP 47 (80)
Q Consensus 21 IavDGsvy---e~~---p~f~~~l~~~l~~l~~ 47 (80)
|+.||+-| ++. +.-++.++++|..+..
T Consensus 1 i~YdGt~f~G~Q~Q~~~~TVq~~le~aL~~~~~ 33 (224)
T cd00497 1 IAYDGWKYHGFQRQPDVPTVEGELFKALLKTGL 33 (224)
T ss_pred CEEcCCceeeEeeCcCCCCHHHHHHHHHHHhhC
Confidence 57899987 333 3588899999988763
No 151
>smart00489 DEXDc3 DEAD-like helicases superfamily.
Probab=21.85 E-value=3.1e+02 Score=19.97 Aligned_cols=42 Identities=19% Similarity=0.305 Sum_probs=28.8
Q ss_pred eeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHH
Q psy15537 28 YKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAI 70 (80)
Q Consensus 28 ye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~ 70 (80)
|+.+|.=.++|+...+.+. +...-+..++-|+|+=.+++.+.
T Consensus 7 y~~r~~Q~~~m~~v~~~~~-~~~~~~~eapTGtGKTl~~L~~a 48 (289)
T smart00489 7 YEPYPIQYEFMEELKRVLD-RGKIGILESPTGTGKTLSLLCLT 48 (289)
T ss_pred CCCCHHHHHHHHHHHHHHH-cCCcEEEECCCCcchhHHHHHHH
Confidence 5666666666666555554 34556788999999997776544
No 152
>smart00488 DEXDc2 DEAD-like helicases superfamily.
Probab=21.85 E-value=3.1e+02 Score=19.97 Aligned_cols=42 Identities=19% Similarity=0.305 Sum_probs=28.8
Q ss_pred eeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHHHHHHHH
Q psy15537 28 YKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGAGLASAI 70 (80)
Q Consensus 28 ye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GAAl~aA~ 70 (80)
|+.+|.=.++|+...+.+. +...-+..++-|+|+=.+++.+.
T Consensus 7 y~~r~~Q~~~m~~v~~~~~-~~~~~~~eapTGtGKTl~~L~~a 48 (289)
T smart00488 7 YEPYPIQYEFMEELKRVLD-RGKIGILESPTGTGKTLSLLCLT 48 (289)
T ss_pred CCCCHHHHHHHHHHHHHHH-cCCcEEEECCCCcchhHHHHHHH
Confidence 5666666666666555554 34556788999999997776544
No 153
>PLN02990 Probable pectinesterase/pectinesterase inhibitor
Probab=21.82 E-value=50 Score=27.04 Aligned_cols=38 Identities=21% Similarity=0.422 Sum_probs=28.1
Q ss_pred ccHHHHHHHHHHhhCC---CCceEEEEeCCcch----HHHHHHHH
Q psy15537 32 PRLKHWLQKYIQLLVP---NKTFRLLLAEDGSG----KGAGLASA 69 (80)
Q Consensus 32 p~f~~~l~~~l~~l~~---~~~v~~~~a~DGSg----~GAAl~aA 69 (80)
..|+.+|...-++||. ..+-.+..+.|||| +.+|+.++
T Consensus 237 ~~~p~w~~~~drrll~~~~~~~~~~~Va~dGsG~f~TIq~Av~a~ 281 (572)
T PLN02990 237 DGIPSWVGPNTRRLMATKGGVKANVVVAQDGSGQYKTINEALNAV 281 (572)
T ss_pred cCCCccCChhhhhhhhcccCCCceEEECCCCCCCCcCHHHHHhhC
Confidence 4688888888888882 24557789999998 66676554
No 154
>COG0253 DapF Diaminopimelate epimerase [Amino acid transport and metabolism]
Probab=21.75 E-value=36 Score=25.38 Aligned_cols=46 Identities=22% Similarity=0.318 Sum_probs=27.0
Q ss_pred hcCCceEEeEe----------CceeeeCccHHHHHHHHHHhhCCCCceEEEEeCCc
Q psy15537 14 IDRDDITIAVD----------GSLYKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDG 59 (80)
Q Consensus 14 ~~~~~~~IavD----------Gsvye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DG 59 (80)
|+-++.++=|| |.+++.||.|.+...=.+.+.+....+.++.=|-|
T Consensus 153 ~GnPH~V~~Vddv~~~~~~~~g~~l~~h~~Fp~~vNV~F~~v~~~~~i~vrv~ERG 208 (272)
T COG0253 153 MGNPHLVIFVDDVETANLEELGPLLESHELFPEGVNVGFVQVLSRDAIRLRVYERG 208 (272)
T ss_pred cCCCeEEEEeCCcccchhhhhhhhhhcCccCCCceEEEEEEeCCCCcEEEEEeecC
Confidence 44445555555 77777788777666555545544445555554444
No 155
>cd06538 CIDE_N_FSP27 CIDE_N domain of FSP27 proteins. The CIDE-N (cell death-inducing DFF45-like effector, N-terminal) domain is found in the FSP27/CIDE-C protein, which has been identified as a n adipocyte lipid droplet protein that negatively regulates lipolysis and promotes triglyceride accumulation. The CIDE protein family includes 3 members: CIDE-A, CIDE-B, and FSP27(CIDE-C). Based on sequence similarity with DFF40 and DFF45, CIDE proteins were initially characterized as mitochondrial activators of apoptosis. The CIDE-N domain of FSP27 is sufficient to increase apoptosis in vitro when overexpressed.
Probab=21.58 E-value=1.9e+02 Score=17.76 Aligned_cols=30 Identities=33% Similarity=0.491 Sum_probs=20.9
Q ss_pred cHHHHHHHHHHhhCCCCceEEEEeCCcchH
Q psy15537 33 RLKHWLQKYIQLLVPNKTFRLLLAEDGSGK 62 (80)
Q Consensus 33 ~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~ 62 (80)
.+.+.+.++-+.+.-.+.+++.+.+||..+
T Consensus 22 sL~eL~~K~~~~l~l~~~~~lvL~eDGT~V 51 (79)
T cd06538 22 SLEDLLNKVLDALLLDCISSLVLDEDGTGV 51 (79)
T ss_pred CHHHHHHHHHHHcCCCCccEEEEecCCcEE
Confidence 466677766655553346899999999754
No 156
>KOG3935|consensus
Probab=21.53 E-value=69 Score=25.33 Aligned_cols=33 Identities=24% Similarity=0.343 Sum_probs=17.9
Q ss_pred eCccHHHHHHHHHHhhCC-----CCceEEEEeCCcchHHH
Q psy15537 30 YHPRLKHWLQKYIQLLVP-----NKTFRLLLAEDGSGKGA 64 (80)
Q Consensus 30 ~~p~f~~~l~~~l~~l~~-----~~~v~~~~a~DGSg~GA 64 (80)
.||.|++-++....+-.| +.+=++.++ |+|+|-
T Consensus 309 s~Pt~~~Ale~~~~~~~Pi~Ll~GGEptv~ls--g~G~GG 346 (446)
T KOG3935|consen 309 SYPTFRRALENLTIENYPIALLFGGEPTVHLS--GPGKGG 346 (446)
T ss_pred cchHHHHHHHhhhhccCCeEEEeCCCceEEec--CCCCCc
Confidence 367777777666655333 122223344 777763
No 157
>PF02035 Coagulin: Coagulin; InterPro: IPR000275 Coagulogen is a gel-forming protein of hemolymph that hinders the spread of invaders by immobilising them [, ]. The protein contains a single 175- residue polypeptide chain; this is cleaved after Arg-18 and Arg-46 by a clotting enzyme contained in the hemocyte and activated by a bacterial endotoxin (lipopolysaccharide). Cleavage releases two chains of coagulin, A and B, linked by two disulphide bonds, together with the peptide C [, ]. Gel formation results from interlinking of coagulin molecules. Secondary structure prediction suggests the C peptide forms an alpha- helix, which is released during the proteolytic conversion of coagulogen to coagulin gel []. The beta-sheet structure and 16 half-cystines found in the molecule appear to yield a compact protein stable to acid and heat. Mammalian blood coagulation is based on the proteolytically induced polymerisation of fibrinogens. Initially, fibrin monomers noncovalently interact with each other. The resulting homopolymers are further stabilised when the plasma transglutaminase (TGase) intermolecularly cross-links epsilon-(gamma-glutamyl)lysine bonds. In crustaceans, hemolymph coagulation depends on the TGase-mediated cross-linking of specific plasma-clotting proteins, but without the proteolytic cascade. In horseshoe crabs, the proteolytic coagulation cascade triggered by lipopolysaccharides and beta-1,3-glucans leads to the conversion of coagulogen into coagulin, resulting in noncovalent coagulin homopolymers through head-to-tail interaction. Horseshoe crab TGase, however, does not cross-link coagulins intermolecularly. Recently, we found that coagulins are cross-linked on hemocyte cell surface proteins called proxins. This indicates that a cross-linking reaction at the final stage of hemolymph coagulation is an important innate immune system of horseshoe crabs [].; GO: 0042381 hemolymph coagulation, 0005576 extracellular region; PDB: 1AOC_A.
Probab=21.45 E-value=42 Score=23.11 Aligned_cols=15 Identities=20% Similarity=0.416 Sum_probs=11.2
Q ss_pred ceeeeCccHHHHHHH
Q psy15537 26 SLYKYHPRLKHWLQK 40 (80)
Q Consensus 26 svye~~p~f~~~l~~ 40 (80)
|+|.+||.|++.-.-
T Consensus 47 s~f~~hpvfrecgky 61 (174)
T PF02035_consen 47 SIFSHHPVFRECGKY 61 (174)
T ss_dssp GGCCCSHHHHHH---
T ss_pred eeecCChhHhhccce
Confidence 899999999987653
No 158
>TIGR00368 Mg chelatase-related protein. The N-terminal end matches very strongly a pfam Mg_chelatase domain.
Probab=21.37 E-value=1.8e+02 Score=23.29 Aligned_cols=39 Identities=21% Similarity=0.225 Sum_probs=29.1
Q ss_pred HHHHHHHHHhhC---CCCceEEEEe-----CCcchHHHHHHHHHHhH
Q psy15537 35 KHWLQKYIQLLV---PNKTFRLLLA-----EDGSGKGAGLASAIALK 73 (80)
Q Consensus 35 ~~~l~~~l~~l~---~~~~v~~~~a-----~DGSg~GAAl~aA~a~~ 73 (80)
++++..+++.+= |..+|++-++ ++|++..-|+++|+...
T Consensus 41 ~erVr~al~n~g~~~p~~~I~VNlaPggl~k~g~~~DLaIA~ailsa 87 (499)
T TIGR00368 41 RERVKSAIKNSGFHFPAKRITINLAPADLPKEGGRFDLPIAIGILAA 87 (499)
T ss_pred HHHHHHHHHhcCCCCCCeeEEEEecCCCeeccCccccHHHHHHHHHh
Confidence 556666665543 5678888887 79999999999888655
No 159
>PF13186 SPASM: Iron-sulfur cluster-binding domain
Probab=21.37 E-value=65 Score=17.13 Aligned_cols=14 Identities=29% Similarity=0.615 Sum_probs=10.8
Q ss_pred CceEEeEeCceeee
Q psy15537 17 DDITIAVDGSLYKY 30 (80)
Q Consensus 17 ~~~~IavDGsvye~ 30 (80)
..++|.-||.||-=
T Consensus 6 ~~~~I~~dG~v~pC 19 (64)
T PF13186_consen 6 NSLYIDPDGDVYPC 19 (64)
T ss_pred eEEEEeeCccEEeC
Confidence 35788999999763
No 160
>COG2605 Predicted kinase related to galactokinase and mevalonate kinase [General function prediction only]
Probab=21.25 E-value=1.5e+02 Score=22.95 Aligned_cols=38 Identities=18% Similarity=0.211 Sum_probs=23.2
Q ss_pred eeeCccHHHHHHHHHHhhCCCCceEEEEe---CCcchHHHH
Q psy15537 28 YKYHPRLKHWLQKYIQLLVPNKTFRLLLA---EDGSGKGAG 65 (80)
Q Consensus 28 ye~~p~f~~~l~~~l~~l~~~~~v~~~~a---~DGSg~GAA 65 (80)
++|-|.|++.+..-+-++-...++++... +-|||+|.-
T Consensus 67 ~~h~~~~~~~l~r~~l~~~g~~~~el~~~~D~P~GSGLGSS 107 (333)
T COG2605 67 NEHKPLVVESLKRDFLEFNGGTPIELHTQSDAPPGSGLGSS 107 (333)
T ss_pred hcCchHHHHHHHHHHHhhcCCCceEEEEecCCCCCCCCCch
Confidence 34447777777766656654444555544 448999853
No 161
>TIGR00241 CoA_E_activ CoA-substrate-specific enzyme activase, putative. This domain may be involved in generating or regenerating the active sites of enzymes related to (R)-2-hydroxyglutaryl-CoA dehydratase and benzoyl-CoA reductase.
Probab=21.24 E-value=1.3e+02 Score=21.08 Aligned_cols=22 Identities=27% Similarity=0.348 Sum_probs=17.2
Q ss_pred EEeEeCceeeeCccHHHHHHHHH
Q psy15537 20 TIAVDGSLYKYHPRLKHWLQKYI 42 (80)
Q Consensus 20 ~IavDGsvye~~p~f~~~l~~~l 42 (80)
.|.+.|++.. +|.+.+.+.+.+
T Consensus 206 ~Vvl~GGva~-n~~l~~~l~~~l 227 (248)
T TIGR00241 206 PIVFTGGVSK-NKGLVKALEKKL 227 (248)
T ss_pred CEEEECcccc-CHHHHHHHHHHh
Confidence 5888999876 777887777766
No 162
>PF00173 Cyt-b5: Cytochrome b5-like Heme/Steroid binding domain This prints entry is a subset of the Pfam entry; InterPro: IPR001199 Cytochromes b5 are ubiquitous electron transport proteins found in animals, plants and yeasts []. The microsomal and mitochondrial variants are membrane-bound, while those from erythrocytes and other animal tissues are water-soluble [, ]. The 3D structure of bovine cyt b5 is known, the fold belonging to the alpha+beta class, with 5 strands and 5 short helices forming a framework for supporting a central haem group []. The cytochrome b5 domain is similar to that of a number of oxidoreductases, such as plant and fungal nitrate reductases, sulphite oxidase, yeast flavocytochrome b2 (L-lactate dehydrogenase) and plant cyt b5/acyl lipid desaturase fusion protein.; GO: 0020037 heme binding; PDB: 2I96_A 3KS0_A 1KBI_B 1KBJ_B 1LTD_A 1SZG_B 1SZF_A 1LDC_B 2OZ0_B 1LCO_A ....
Probab=21.20 E-value=71 Score=17.92 Aligned_cols=16 Identities=25% Similarity=0.609 Sum_probs=12.5
Q ss_pred cCCceEEeEeCceeee
Q psy15537 15 DRDDITIAVDGSLYKY 30 (80)
Q Consensus 15 ~~~~~~IavDGsvye~ 30 (80)
...++-|++||.||--
T Consensus 12 ~~~~~~v~i~g~VYDv 27 (76)
T PF00173_consen 12 KKGDCWVIIDGKVYDV 27 (76)
T ss_dssp ETTEEEEEETTEEEEC
T ss_pred CCCCEEEEECCEEccc
Confidence 3456889999999864
No 163
>TIGR01990 bPGM beta-phosphoglucomutase. The enzyme from L. lactis has been extensively characterized including a remarkable crystal structure which traps the pentacoordinate transition state.
Probab=21.13 E-value=63 Score=20.81 Aligned_cols=24 Identities=13% Similarity=0.008 Sum_probs=19.3
Q ss_pred eEeCceeeeCccHHHHHHHHHHhh
Q psy15537 22 AVDGSLYKYHPRLKHWLQKYIQLL 45 (80)
Q Consensus 22 avDGsvye~~p~f~~~l~~~l~~l 45 (80)
=+||+++..++.+.+.+++.+.+.
T Consensus 5 D~DGTL~ds~~~~~~~~~~~~~~~ 28 (185)
T TIGR01990 5 DLDGVITDTAEYHYLAWKALADEL 28 (185)
T ss_pred cCCCccccChHHHHHHHHHHHHHc
Confidence 379999999998888888766554
No 164
>PRK09417 mogA molybdenum cofactor biosynthesis protein MogA; Provisional
Probab=21.10 E-value=80 Score=22.12 Aligned_cols=37 Identities=19% Similarity=0.316 Sum_probs=26.5
Q ss_pred eeeCccHHHHHHHHHHhhCCCCceEEEEeCCcchHHH
Q psy15537 28 YKYHPRLKHWLQKYIQLLVPNKTFRLLLAEDGSGKGA 64 (80)
Q Consensus 28 ye~~p~f~~~l~~~l~~l~~~~~v~~~~a~DGSg~GA 64 (80)
|..-|-=.+.+++++++++.+....+.+...|+|+|-
T Consensus 44 ~~iVpDd~~~I~~aL~~a~~~~~~DlIITTGGtg~g~ 80 (193)
T PRK09417 44 TRLIPDEQDLIEQTLIELVDEMGCDLVLTTGGTGPAR 80 (193)
T ss_pred EEECCCCHHHHHHHHHHHhhcCCCCEEEECCCCCCCC
Confidence 3455666677888888887544567788889988763
No 165
>cd04196 GT_2_like_d Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=21.07 E-value=1.4e+02 Score=19.06 Aligned_cols=19 Identities=16% Similarity=0.162 Sum_probs=8.9
Q ss_pred eEEEEeCCcchHHHHHHHH
Q psy15537 51 FRLLLAEDGSGKGAGLASA 69 (80)
Q Consensus 51 v~~~~a~DGSg~GAAl~aA 69 (80)
+.+...+.+.|.+.++..+
T Consensus 56 ~~~~~~~~~~G~~~~~n~g 74 (214)
T cd04196 56 IILIRNGKNLGVARNFESL 74 (214)
T ss_pred EEEEeCCCCccHHHHHHHH
Confidence 4444444445555554433
No 166
>cd06059 Tubulin The tubulin superfamily includes five distinct families, the alpha-, beta-, gamma-, delta-, and epsilon-tubulins and a sixth family (zeta-tubulin) which is present only in kinetoplastid protozoa. The alpha- and beta-tubulins are the major components of microtubules, while gamma-tubulin plays a major role in the nucleation of microtubule assembly. The delta- and epsilon-tubulins are widespread but unlike the alpha, beta, and gamma-tubulins they are not ubiquitous among eukaryotes. The alpha/beta-tubulin heterodimer is the structural subunit of microtubules. The alpha- and beta-tubulins share 40% amino-acid sequence identity, exist in several isotype forms, and undergo a variety of posttranslational modifications. The structures of alpha- and beta-tubulin are basically identical: each monomer is formed by a core of two beta-sheets surrounded by alpha-helices. The monomer structure is very compact, but can be divided into three regions based on function: the amino-termi
Probab=21.04 E-value=1.5e+02 Score=22.44 Aligned_cols=40 Identities=25% Similarity=0.283 Sum_probs=23.3
Q ss_pred CccHHHHHHHHHHhhCCCC----ceEEEEeCC---cchHHHHHHHHH
Q psy15537 31 HPRLKHWLQKYIQLLVPNK----TFRLLLAED---GSGKGAGLASAI 70 (80)
Q Consensus 31 ~p~f~~~l~~~l~~l~~~~----~v~~~~a~D---GSg~GAAl~aA~ 70 (80)
-+.+.+.+.+.++..+.+| .|.+..+-+ |||.|+-++-.+
T Consensus 68 g~~~~e~~~d~ir~~~E~cD~l~gf~i~~sl~GGTGSG~gs~l~e~l 114 (382)
T cd06059 68 GPELIDEILDRIRKQVEKCDSLQGFQITHSLGGGTGSGLGSLLLELL 114 (382)
T ss_pred CHHHHHHHHHHHHHHHHhCCCcCceEEEEecCCCcchhHHHHHHHHH
Confidence 3567777777777777543 245554444 356665554433
No 167
>PF08282 Hydrolase_3: haloacid dehalogenase-like hydrolase; InterPro: IPR013200 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification []. This HAD domain is found in several distinct enzymes including: Phospholipid-transporting ATPase 1 (3.6.3.1 from EC), a putative lipid-flipping enzyme involved in cold tolerance in Arabidopsis [] 3-deoxy-D-manno-octulosonate (KDO) 8-phosphate phosphatase (3.1.3.45 from EC), which catalyses the final step in the biosynthesis of KDO - a component of lipopolysaccharide in Gram-negative bacteria [] Mannosyl-3-phosphoglycerate phosphatase (3.1.3.70 from EC), which hydrolyzes mannosyl-3-phosphoglycerate to form the osmolyte mannosylglycerate [] Phosphoglycolate phopshatase (3.1.3.18 from EC), which catalyses the dephosphorylation of 2-phosphoglycolate [] ; PDB: 2B30_B 3R4C_A 1XVI_B 3IJ5_B 3MMZ_C 3L7Y_A 1XPJ_C 1RLT_B 1RLM_B 2HF2_A ....
Probab=20.90 E-value=61 Score=21.36 Aligned_cols=24 Identities=17% Similarity=0.228 Sum_probs=19.4
Q ss_pred EeCceeeeCccHHHHHHHHHHhhC
Q psy15537 23 VDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 23 vDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
+|||+....-...+...++++++.
T Consensus 5 lDGTLl~~~~~i~~~~~~al~~l~ 28 (254)
T PF08282_consen 5 LDGTLLNSDGKISPETIEALKELQ 28 (254)
T ss_dssp CCTTTCSTTSSSCHHHHHHHHHHH
T ss_pred ECCceecCCCeeCHHHHHHHHhhc
Confidence 699998876667788888888876
No 168
>PF00091 Tubulin: Tubulin/FtsZ family, GTPase domain; InterPro: IPR003008 This domain is found in all tubulin chains, as well as the bacterial FtsZ family of proteins. These proteins are involved in polymer formation. Tubulin is the major component of microtubules, while FtsZ is the polymer-forming protein of bacterial cell division, it is part of a ring in the middle of the dividing cell that is required for constriction of cell membrane and cell envelope to yield two daughter cells. FtsZ and tubulin are GTPases, this entry is the GTPase domain. FtsZ can polymerise into tubes, sheets, and rings in vitro and is ubiquitous in bacteria and archaea.; GO: 0051258 protein polymerization, 0043234 protein complex; PDB: 3E22_B 1SA1_D 3DU7_B 2P4N_B 3DCO_B 1Z2B_D 1SA0_B 2XRP_C 1TVK_B 2BTQ_B ....
Probab=20.71 E-value=2.9e+02 Score=18.95 Aligned_cols=44 Identities=18% Similarity=0.169 Sum_probs=22.7
Q ss_pred HHHHHHHHHHhhC---CCCceEEEEeCCcchHHHHHHHHHHhHHhhh
Q psy15537 34 LKHWLQKYIQLLV---PNKTFRLLLAEDGSGKGAGLASAIALKLGAF 77 (80)
Q Consensus 34 f~~~l~~~l~~l~---~~~~v~~~~a~DGSg~GAAl~aA~a~~~~~~ 77 (80)
..+...+.|++.+ +.....+...-=|-|.|..+...++...++.
T Consensus 106 ~~~~~~~~ir~~~e~~d~~~~~~i~~slgGGTGSG~~~~l~~~l~~~ 152 (216)
T PF00091_consen 106 ALEEILEQIRKEIEKCDSLDGFFIVHSLGGGTGSGLGPVLAEMLREE 152 (216)
T ss_dssp HHHHHHHHHHHHHHTSTTESEEEEEEESSSSHHHHHHHHHHHHHHHT
T ss_pred cccccccccchhhccccccccceecccccceeccccccccchhhhcc
Confidence 4455556666666 3334444444445455555555555554444
No 169
>TIGR02009 PGMB-YQAB-SF beta-phosphoglucomutase family hydrolase. All of these are members of the larger Haloacid dehalogenase (HAD) subfamily IA and include the "variant 3" glu-asp version of the third conserved HAD domain (TIGR01509).
Probab=20.63 E-value=63 Score=20.83 Aligned_cols=26 Identities=12% Similarity=0.116 Sum_probs=21.0
Q ss_pred EeEeCceeeeCccHHHHHHHHHHhhC
Q psy15537 21 IAVDGSLYKYHPRLKHWLQKYIQLLV 46 (80)
Q Consensus 21 IavDGsvye~~p~f~~~l~~~l~~l~ 46 (80)
.=+||+++...+.....+++++++.-
T Consensus 6 FD~DGTL~ds~~~~~~~~~~~~~~~g 31 (185)
T TIGR02009 6 FDMDGVIVDTAPLHAQAWKHLADKYG 31 (185)
T ss_pred EcCCCcccCChHHHHHHHHHHHHHcC
Confidence 34799999999988888888887653
No 170
>COG3630 OadG Na+-transporting methylmalonyl-CoA/oxaloacetate decarboxylase, gamma subunit [Energy production and conversion]
Probab=20.45 E-value=1.2e+02 Score=19.01 Aligned_cols=23 Identities=13% Similarity=0.010 Sum_probs=14.8
Q ss_pred CcchHHHHHHHHHHhHHhhhhCC
Q psy15537 58 DGSGKGAGLASAIALKLGAFQSK 80 (80)
Q Consensus 58 DGSg~GAAl~aA~a~~~~~~~~~ 80 (80)
|-.+..++++||.....+.+++|
T Consensus 61 ~~~~~l~~viaaavh~h~~~~~k 83 (84)
T COG3630 61 EDFERLAAVIAAAVHHHRRRTAK 83 (84)
T ss_pred chhHHHHHHHHHHHHHHHhhccc
Confidence 56667777777665555666654
No 171
>PF10278 Med19: Mediator of RNA pol II transcription subunit 19 ; InterPro: IPR019403 The Mediator complex is a coactivator involved in the regulated transcription of nearly all RNA polymerase II-dependent genes. Mediator functions as a bridge to convey information from gene-specific regulatory proteins to the basal RNA polymerase II transcription machinery. The Mediator complex, having a compact conformation in its free form, is recruited to promoters by direct interactions with regulatory proteins and serves for the assembly of a functional preinitiation complex with RNA polymerase II and the general transcription factors. On recruitment the Mediator complex unfolds to an extended conformation and partially surrounds RNA polymerase II, specifically interacting with the unphosphorylated form of the C-terminal domain (CTD) of RNA polymerase II. The Mediator complex dissociates from the RNA polymerase II holoenzyme and stays at the promoter when transcriptional elongation begins. The Mediator complex is composed of at least 31 subunits: MED1, MED4, MED6, MED7, MED8, MED9, MED10, MED11, MED12, MED13, MED13L, MED14, MED15, MED16, MED17, MED18, MED19, MED20, MED21, MED22, MED23, MED24, MED25, MED26, MED27, MED29, MED30, MED31, CCNC, CDK8 and CDC2L6/CDK11. The subunits form at least three structurally distinct submodules. The head and the middle modules interact directly with RNA polymerase II, whereas the elongated tail module interacts with gene-specific regulatory proteins. Mediator containing the CDK8 module is less active than Mediator lacking this module in supporting transcriptional activation. The head module contains: MED6, MED8, MED11, SRB4/MED17, SRB5/MED18, ROX3/MED19, SRB2/MED20 and SRB6/MED22. The middle module contains: MED1, MED4, NUT1/MED5, MED7, CSE2/MED9, NUT2/MED10, SRB7/MED21 and SOH1/MED31. CSE2/MED9 interacts directly with MED4. The tail module contains: MED2, PGD1/MED3, RGR1/MED14, GAL11/MED15 and SIN4/MED16. The CDK8 module contains: MED12, MED13, CCNC and CDK8. Individual preparations of the Mediator complex lacking one or more distinct subunits have been variously termed ARC, CRSP, DRIP, PC2, SMCC and TRAP. Med19 represents a family of conserved proteins which are members of the multi-protein co-activator Mediator complex []. ; GO: 0001104 RNA polymerase II transcription cofactor activity, 0006357 regulation of transcription from RNA polymerase II promoter, 0016592 mediator complex
Probab=20.35 E-value=26 Score=24.78 Aligned_cols=31 Identities=16% Similarity=0.214 Sum_probs=24.2
Q ss_pred HHHHHHHHhhCCC--CceEEEEeCCcchHHHHH
Q psy15537 36 HWLQKYIQLLVPN--KTFRLLLAEDGSGKGAGL 66 (80)
Q Consensus 36 ~~l~~~l~~l~~~--~~v~~~~a~DGSg~GAAl 66 (80)
..|.+.+...||. .+|.+..+.|+|++..-|
T Consensus 37 kklke~lSsFLp~lpG~~~~~~~~d~ssLrslI 69 (178)
T PF10278_consen 37 KKLKESLSSFLPHLPGNIDLPGSQDNSSLRSLI 69 (178)
T ss_pred CcchhhHHHHccCCCCcccCCCCcCcchHHHHh
Confidence 5567777777776 588899999999987654
No 172
>cd02188 gamma_tubulin Gamma-tubulin is a ubiquitous phylogenetically conserved member of tubulin superfamily. Gamma is a low abundance protein present within the cells in both various types of microtubule-organizing centers and cytoplasmic protein complexes. Gamma-tubulin recruits the alpha/beta-tubulin dimers that form the minus ends of microtubules and is thought to be involved in microtubule nucleation and capping.
Probab=20.21 E-value=1.7e+02 Score=22.86 Aligned_cols=46 Identities=20% Similarity=0.218 Sum_probs=31.8
Q ss_pred eeeCccHHHHHHHHHHhhCCC----CceEEEEeCCc---chHHHHHHHHHHhH
Q psy15537 28 YKYHPRLKHWLQKYIQLLVPN----KTFRLLLAEDG---SGKGAGLASAIALK 73 (80)
Q Consensus 28 ye~~p~f~~~l~~~l~~l~~~----~~v~~~~a~DG---Sg~GAAl~aA~a~~ 73 (80)
|..-+.+.+.+.+.|+..+.+ ..|.+..+-.| ||.|+-++-.+...
T Consensus 106 y~~g~~~~d~i~d~ir~~~E~cd~l~gf~i~~SlgGGTGSG~gs~l~e~L~d~ 158 (431)
T cd02188 106 YSQGEEVQEEILDIIDREADGSDSLEGFVLCHSIAGGTGSGMGSYLLERLNDR 158 (431)
T ss_pred HHHHHHHHHHHHHHHHHHHhcCCCcceeEEEecCCCCcchhHHHHHHHHHHhH
Confidence 445567778888888887743 25777777764 88888777666443
No 173
>PTZ00260 dolichyl-phosphate beta-glucosyltransferase; Provisional
Probab=20.08 E-value=3.7e+02 Score=19.86 Aligned_cols=51 Identities=25% Similarity=0.358 Sum_probs=32.8
Q ss_pred ceEEeEeCceeeeCccHHHHHHHHHHhhC-CCCceEEEEeCCcchHHHHHHHHHHh
Q psy15537 18 DITIAVDGSLYKYHPRLKHWLQKYIQLLV-PNKTFRLLLAEDGSGKGAGLASAIAL 72 (80)
Q Consensus 18 ~~~IavDGsvye~~p~f~~~l~~~l~~l~-~~~~v~~~~a~DGSg~GAAl~aA~a~ 72 (80)
++.|--|||. ..-.+.+++...+-. |+.++++...+-..|+|+|+.+++..
T Consensus 109 EIIVVDDgSt----D~T~~i~~~~~~~~~~~~~~i~vi~~~~N~G~~~A~~~Gi~~ 160 (333)
T PTZ00260 109 EIIIVNDGSK----DKTLKVAKDFWRQNINPNIDIRLLSLLRNKGKGGAVRIGMLA 160 (333)
T ss_pred EEEEEeCCCC----CchHHHHHHHHHhcCCCCCcEEEEEcCCCCChHHHHHHHHHH
Confidence 5777788883 344445554444332 34567777777778999999887653
No 174
>COG3543 Uncharacterized conserved protein [Function unknown]
Probab=20.08 E-value=1.7e+02 Score=19.82 Aligned_cols=31 Identities=16% Similarity=0.232 Sum_probs=24.3
Q ss_pred eeeCccHHHHHHHHHHhhCCCCceEEEEeCC
Q psy15537 28 YKYHPRLKHWLQKYIQLLVPNKTFRLLLAED 58 (80)
Q Consensus 28 ye~~p~f~~~l~~~l~~l~~~~~v~~~~a~D 58 (80)
|-+.|.|-++|.....+|-...+|.++-..|
T Consensus 19 kGYS~~FveN~d~I~~rL~~ge~i~lV~g~D 49 (135)
T COG3543 19 KGYSPAFVENYDAIAERLKAGEDIKLVDGPD 49 (135)
T ss_pred ccCCHHHHHHHHHHHHHhhcCCCeEEEeccc
Confidence 5578999999999999997555577776655
Done!