Query psy12446
Match_columns 77
No_of_seqs 101 out of 180
Neff 4.4
Searched_HMMs 46136
Date Fri Aug 16 19:30:26 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy12446.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/12446hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG2836|consensus 100.0 1.9E-33 4E-38 198.9 6.8 69 9-77 10-79 (173)
2 PTZ00393 protein tyrosine phos 99.8 2.6E-21 5.6E-26 143.9 5.6 69 8-76 84-153 (241)
3 PTZ00242 protein tyrosine phos 99.7 3.6E-18 7.8E-23 118.7 6.4 68 9-76 9-77 (166)
4 KOG1720|consensus 99.3 1.8E-12 4E-17 96.1 3.7 50 27-76 82-131 (225)
5 PF05706 CDKN3: Cyclin-depende 98.5 1.9E-07 4E-12 66.9 4.1 51 25-75 57-115 (168)
6 cd00127 DSPc Dual specificity 97.2 0.00069 1.5E-08 43.4 4.6 44 31-75 19-62 (139)
7 smart00195 DSPc Dual specifici 95.0 0.077 1.7E-06 34.2 5.0 38 32-72 19-56 (138)
8 PF00782 DSPc: Dual specificit 93.9 0.11 2.4E-06 33.1 3.9 39 32-70 11-49 (133)
9 PF00102 Y_phosphatase: Protei 89.9 0.93 2E-05 30.8 4.9 37 13-49 38-76 (235)
10 COG2453 CDC14 Predicted protei 86.1 0.78 1.7E-05 31.9 2.7 32 45-76 57-88 (180)
11 cd00047 PTPc Protein tyrosine 85.9 1.3 2.9E-05 30.9 3.8 35 15-49 40-75 (231)
12 PRK00726 murG undecaprenyldiph 83.1 4.5 9.8E-05 29.4 5.7 54 16-69 2-59 (357)
13 smart00194 PTPc Protein tyrosi 81.8 2.4 5.1E-05 30.2 3.7 36 14-49 67-103 (258)
14 KOG3201|consensus 78.1 2 4.4E-05 31.8 2.4 39 18-60 136-176 (201)
15 PF05771 Pox_A31: Poxvirus A31 76.2 2.9 6.3E-05 28.7 2.6 24 10-33 28-51 (114)
16 PF04273 DUF442: Putative phos 75.5 5.4 0.00012 26.3 3.7 45 29-73 17-70 (110)
17 PLN02727 NAD kinase 75.1 4.6 9.9E-05 36.0 4.1 58 15-76 260-326 (986)
18 PRK09545 znuA high-affinity zi 74.4 7.2 0.00016 29.4 4.6 50 23-74 236-290 (311)
19 PF00762 Ferrochelatase: Ferro 70.9 3.5 7.6E-05 31.4 2.2 49 24-72 239-301 (316)
20 PRK12361 hypothetical protein; 68.9 11 0.00024 30.3 4.7 53 18-75 102-157 (547)
21 PHA02746 protein tyrosine phos 68.7 8.9 0.00019 29.2 4.0 33 16-48 113-146 (323)
22 PF04343 DUF488: Protein of un 66.3 12 0.00027 24.2 3.8 41 27-67 1-51 (122)
23 PF12122 DUF3582: Protein of u 65.9 10 0.00022 24.8 3.4 31 16-46 1-31 (101)
24 TIGR02571 ComEB ComE operon pr 64.5 14 0.0003 25.6 4.0 54 12-69 84-141 (151)
25 PHA02742 protein tyrosine phos 64.1 12 0.00026 28.1 3.9 33 16-48 93-126 (303)
26 PF10369 ALS_ss_C: Small subun 62.5 6.2 0.00013 24.2 1.8 38 9-46 29-67 (75)
27 PHA02738 hypothetical protein; 61.8 12 0.00027 28.4 3.6 33 16-48 90-123 (320)
28 PHA02747 protein tyrosine phos 61.7 15 0.00032 27.8 4.0 34 16-49 93-127 (312)
29 PF08257 Sulfakinin: Sulfakini 60.8 2.5 5.4E-05 17.2 -0.1 7 12-18 3-9 (9)
30 cd01137 PsaA Metal binding pro 60.5 13 0.00028 27.5 3.5 48 23-72 210-261 (287)
31 PF13911 AhpC-TSA_2: AhpC/TSA 60.0 9.4 0.0002 23.9 2.3 30 28-57 2-31 (115)
32 cd01017 AdcA Metal binding pro 58.6 18 0.00039 26.4 4.0 43 23-67 204-250 (282)
33 cd01019 ZnuA Zinc binding prot 58.4 24 0.00051 26.1 4.6 47 23-71 212-263 (286)
34 PF13579 Glyco_trans_4_4: Glyc 57.3 29 0.00062 21.1 4.2 44 28-72 7-51 (160)
35 PF03162 Y_phosphatase2: Tyros 56.4 14 0.0003 25.7 2.9 48 22-73 19-69 (164)
36 TIGR01244 conserved hypothetic 56.3 25 0.00053 23.3 4.0 44 30-73 18-70 (135)
37 cd04876 ACT_RelA-SpoT ACT dom 56.1 21 0.00044 18.4 3.0 33 13-45 38-71 (71)
38 PF12646 DUF3783: Domain of un 56.1 14 0.00031 21.5 2.5 28 17-44 2-30 (58)
39 PHA02588 cd deoxycytidylate de 55.3 25 0.00054 24.6 4.1 51 11-67 97-154 (168)
40 cd03819 GT1_WavL_like This fam 53.3 38 0.00082 23.7 4.7 46 27-73 15-60 (355)
41 COG5603 TRS20 Subunit of TRAPP 51.6 24 0.00053 24.7 3.4 52 6-60 69-123 (136)
42 PF13344 Hydrolase_6: Haloacid 49.3 31 0.00067 21.7 3.5 34 29-62 20-57 (101)
43 PF14814 UB2H: Bifunctional tr 48.6 10 0.00022 23.6 1.1 48 25-75 6-56 (85)
44 PF10740 DUF2529: Protein of u 48.5 40 0.00086 24.5 4.3 59 17-76 84-144 (172)
45 KOG2781|consensus 46.7 47 0.001 25.9 4.7 50 14-68 102-156 (290)
46 PRK13809 orotate phosphoribosy 46.6 60 0.0013 23.4 5.0 53 14-66 117-175 (206)
47 PHA02740 protein tyrosine phos 45.7 34 0.00073 25.9 3.7 31 18-48 93-124 (298)
48 cd01018 ZntC Metal binding pro 45.6 39 0.00084 24.5 3.9 43 22-66 200-246 (266)
49 PF11121 DUF2639: Protein of u 43.4 16 0.00034 20.9 1.2 13 30-42 7-19 (40)
50 TIGR01133 murG undecaprenyldip 43.3 97 0.0021 22.1 5.6 52 17-69 2-58 (348)
51 TIGR00109 hemH ferrochelatase. 42.9 35 0.00076 25.9 3.5 45 24-68 244-302 (322)
52 PF01297 TroA: Periplasmic sol 42.4 63 0.0014 22.9 4.5 45 22-68 182-230 (256)
53 cd01020 TroA_b Metal binding p 42.2 62 0.0013 23.5 4.5 42 23-66 190-238 (264)
54 PF09827 CRISPR_Cas2: CRISPR a 41.7 33 0.00072 20.4 2.6 26 17-43 3-30 (78)
55 PF00571 CBS: CBS domain CBS d 41.5 34 0.00074 18.2 2.5 28 22-49 12-39 (57)
56 PF14044 NETI: NETI protein 40.5 39 0.00084 20.6 2.7 31 23-53 5-35 (57)
57 PF07905 PucR: Purine cataboli 40.5 79 0.0017 20.4 4.5 43 26-68 59-107 (123)
58 PF07166 DUF1398: Protein of u 40.4 11 0.00025 25.7 0.4 21 24-44 14-34 (125)
59 PF00156 Pribosyltran: Phospho 40.0 53 0.0011 20.2 3.4 32 13-44 86-120 (125)
60 PF10653 Phage-A118_gp45: Prot 39.5 15 0.00032 22.5 0.7 20 26-45 43-62 (62)
61 KOG0117|consensus 39.4 1.1E+02 0.0025 25.6 6.0 53 10-65 159-214 (506)
62 TIGR00119 acolac_sm acetolacta 38.9 62 0.0013 22.6 3.9 36 11-46 112-148 (157)
63 PLN02404 6,7-dimethyl-8-ribity 38.7 39 0.00085 23.5 2.9 36 11-46 3-48 (141)
64 PRK07114 keto-hydroxyglutarate 37.4 42 0.0009 24.7 3.0 43 28-70 5-50 (222)
65 cd03808 GT1_cap1E_like This fa 37.0 1.2E+02 0.0027 20.3 5.1 45 25-70 13-57 (359)
66 COG1040 ComFC Predicted amidop 36.4 53 0.0012 23.8 3.4 30 18-47 188-219 (225)
67 PRK05476 S-adenosyl-L-homocyst 36.3 1.4E+02 0.003 24.1 6.0 54 12-66 45-103 (425)
68 PRK14465 ribosomal RNA large s 36.2 37 0.00079 26.5 2.7 25 22-46 301-325 (342)
69 PRK00002 aroB 3-dehydroquinate 36.2 92 0.002 23.7 4.8 47 28-74 19-72 (358)
70 PRK07455 keto-hydroxyglutarate 36.1 44 0.00096 23.4 2.9 43 29-71 3-48 (187)
71 PF09673 TrbC_Ftype: Type-F co 36.0 39 0.00085 22.0 2.5 24 24-47 9-32 (113)
72 PRK13811 orotate phosphoribosy 36.0 1.2E+02 0.0026 20.9 5.0 54 13-66 102-161 (170)
73 PRK10907 intramembrane serine 35.8 35 0.00075 25.8 2.5 30 16-45 1-30 (276)
74 PRK13812 orotate phosphoribosy 35.5 1.3E+02 0.0028 21.0 5.2 53 14-66 106-164 (176)
75 cd03784 GT1_Gtf_like This fami 35.4 74 0.0016 23.5 4.1 49 16-66 1-53 (401)
76 PF15632 ATPgrasp_Ter: ATP-gra 35.4 58 0.0013 25.3 3.7 30 31-64 58-94 (329)
77 cd02071 MM_CoA_mut_B12_BD meth 34.9 29 0.00064 22.3 1.7 48 24-74 63-113 (122)
78 cd06578 HemD Uroporphyrinogen- 34.7 1.2E+02 0.0027 20.2 4.8 45 23-67 103-152 (239)
79 PF12804 NTP_transf_3: MobA-li 34.6 38 0.00083 21.7 2.2 45 23-67 22-66 (160)
80 COG1149 MinD superfamily P-loo 34.4 1.2E+02 0.0026 23.7 5.2 55 16-70 188-250 (284)
81 cd08197 DOIS 2-deoxy-scyllo-in 34.3 97 0.0021 23.9 4.7 48 28-75 11-65 (355)
82 PF01548 DEDD_Tnp_IS110: Trans 34.1 52 0.0011 21.1 2.8 41 22-64 29-74 (144)
83 cd00851 MTH1175 This uncharact 33.8 70 0.0015 19.2 3.2 37 29-67 53-91 (103)
84 PF05037 DUF669: Protein of un 33.8 63 0.0014 21.6 3.3 50 25-76 91-140 (141)
85 TIGR02990 ectoine_eutA ectoine 33.7 95 0.0021 22.8 4.4 43 22-66 102-151 (239)
86 PF02449 Glyco_hydro_42: Beta- 33.0 33 0.00071 26.0 1.9 50 24-74 8-73 (374)
87 cd04955 GT1_like_6 This family 33.0 1.3E+02 0.0027 21.1 4.8 46 25-70 18-63 (363)
88 PRK11895 ilvH acetolactate syn 32.9 39 0.00085 23.8 2.2 25 23-47 126-150 (161)
89 COG5324 Uncharacterized conser 32.8 53 0.0011 28.4 3.2 31 24-54 161-191 (758)
90 TIGR02765 crypto_DASH cryptoch 32.8 1E+02 0.0022 23.8 4.7 58 15-74 74-150 (429)
91 COG0120 RpiA Ribose 5-phosphat 32.7 36 0.00079 25.6 2.1 46 20-66 24-72 (227)
92 TIGR00640 acid_CoA_mut_C methy 32.3 41 0.00089 22.5 2.1 34 28-61 70-106 (132)
93 PRK14463 ribosomal RNA large s 32.1 46 0.00099 25.8 2.6 23 23-45 298-320 (349)
94 PF10662 PduV-EutP: Ethanolami 32.0 51 0.0011 22.8 2.6 27 21-47 100-127 (143)
95 PRK14457 ribosomal RNA large s 31.9 46 0.001 25.8 2.6 25 22-46 302-326 (345)
96 COG3185 4-hydroxyphenylpyruvat 31.5 38 0.00082 27.3 2.1 51 15-65 219-272 (363)
97 cd02072 Glm_B12_BD B12 binding 31.3 31 0.00068 23.5 1.5 49 24-75 63-120 (128)
98 CHL00100 ilvH acetohydroxyacid 31.2 90 0.002 22.2 3.8 38 11-48 113-151 (174)
99 PRK00979 tetrahydromethanopter 31.0 96 0.0021 24.3 4.2 59 15-73 96-160 (308)
100 TIGR00201 comF comF family pro 30.9 77 0.0017 21.9 3.4 34 14-47 151-187 (190)
101 PRK12435 ferrochelatase; Provi 30.7 51 0.0011 25.2 2.6 42 24-65 230-285 (311)
102 PRK12419 riboflavin synthase s 30.3 63 0.0014 22.9 2.9 33 14-46 9-51 (158)
103 PF13117 Cag12: Cag pathogenic 30.2 94 0.002 20.9 3.6 25 21-45 86-110 (113)
104 PF12123 Amidase02_C: N-acetyl 30.1 65 0.0014 18.5 2.4 27 14-40 14-41 (45)
105 PF13477 Glyco_trans_4_2: Glyc 30.1 1.4E+02 0.003 18.4 5.6 41 27-68 12-52 (139)
106 PRK13810 orotate phosphoribosy 29.9 1.9E+02 0.0042 20.4 5.3 53 14-66 121-179 (187)
107 PF02602 HEM4: Uroporphyrinoge 29.7 1.3E+02 0.0029 20.4 4.4 46 22-67 97-147 (231)
108 PF00752 XPG_N: XPG N-terminal 29.6 67 0.0015 19.7 2.7 22 27-48 60-81 (101)
109 PF00975 Thioesterase: Thioest 29.6 20 0.00043 24.1 0.2 49 27-75 51-106 (229)
110 TIGR00114 lumazine-synth 6,7-d 29.1 65 0.0014 22.1 2.7 31 16-46 1-41 (138)
111 PF05724 TPMT: Thiopurine S-me 28.9 67 0.0015 23.1 2.9 45 24-68 22-67 (218)
112 TIGR00511 ribulose_e2b2 ribose 28.7 1.6E+02 0.0034 22.4 5.0 48 16-64 117-170 (301)
113 PF11032 ApoM: Apolipoprotein 28.7 1.1E+02 0.0024 22.2 3.9 32 7-38 133-164 (186)
114 PF02120 Flg_hook: Flagellar h 28.2 36 0.00078 20.1 1.2 22 54-75 62-83 (85)
115 PF00583 Acetyltransf_1: Acety 28.2 81 0.0017 17.5 2.7 36 27-62 46-83 (83)
116 COG3981 Predicted acetyltransf 28.2 1.3E+02 0.0029 21.9 4.3 45 28-72 119-165 (174)
117 PF14871 GHL6: Hypothetical gl 27.8 1E+02 0.0022 20.7 3.5 45 24-73 42-88 (132)
118 TIGR01091 upp uracil phosphori 27.8 1.1E+02 0.0023 21.7 3.8 35 13-47 120-157 (207)
119 PRK14462 ribosomal RNA large s 27.6 56 0.0012 25.7 2.5 24 22-45 310-333 (356)
120 cd03785 GT1_MurG MurG is an N- 27.5 1.8E+02 0.004 20.6 4.9 40 30-69 18-57 (350)
121 PF01705 CX: CX module; Inter 27.4 28 0.00061 20.6 0.6 13 63-75 28-40 (61)
122 COG3816 Uncharacterized protei 27.4 16 0.00034 27.1 -0.6 47 14-69 52-102 (205)
123 TIGR00336 pyrE orotate phospho 27.4 1.7E+02 0.0037 20.0 4.6 52 15-66 108-167 (173)
124 PF02579 Nitro_FeMo-Co: Dinitr 27.1 67 0.0015 18.9 2.3 39 28-66 42-80 (94)
125 PF13727 CoA_binding_3: CoA-bi 27.0 59 0.0013 20.7 2.1 41 26-66 128-174 (175)
126 cd04618 CBS_pair_5 The CBS dom 26.9 1.5E+02 0.0032 17.7 3.9 28 20-47 4-31 (98)
127 PRK14467 ribosomal RNA large s 26.8 62 0.0013 25.2 2.6 24 22-45 301-324 (348)
128 TIGR02742 TrbC_Ftype type-F co 26.6 1.2E+02 0.0027 20.6 3.7 22 24-45 10-31 (130)
129 PF12996 DUF3880: DUF based on 26.6 56 0.0012 19.8 1.9 16 29-44 27-42 (79)
130 cd03409 Chelatase_Class_II Cla 26.5 1.1E+02 0.0024 18.2 3.2 25 24-48 43-67 (101)
131 PF13086 AAA_11: AAA domain; P 26.4 81 0.0017 20.7 2.7 37 13-49 51-94 (236)
132 cd04908 ACT_Bt0572_1 N-termina 26.3 1.3E+02 0.0028 16.9 4.1 25 17-41 3-28 (66)
133 TIGR01718 Uridine-psphlse urid 26.1 2.1E+02 0.0045 20.7 5.1 44 4-47 41-86 (245)
134 COG0095 LplA Lipoate-protein l 26.1 55 0.0012 23.9 2.0 45 2-46 19-72 (248)
135 PF06676 DUF1178: Protein of u 25.9 33 0.00072 24.1 0.8 17 54-70 128-144 (148)
136 TIGR01694 MTAP 5'-deoxy-5'-met 25.9 62 0.0013 23.3 2.3 23 26-48 62-84 (241)
137 PF06557 DUF1122: Protein of u 25.7 48 0.001 24.1 1.7 19 54-72 109-128 (170)
138 smart00481 POLIIIAc DNA polyme 25.5 1.4E+02 0.003 16.9 4.5 41 24-64 13-58 (67)
139 COG0803 LraI ABC-type metal io 25.5 1.2E+02 0.0026 22.7 3.8 47 23-71 225-275 (303)
140 PRK06552 keto-hydroxyglutarate 25.5 78 0.0017 22.9 2.7 39 31-69 6-47 (213)
141 PF08279 HTH_11: HTH domain; 25.4 47 0.001 18.1 1.3 16 24-39 28-43 (55)
142 cd02169 Citrate_lyase_ligase C 25.1 1.6E+02 0.0034 22.5 4.4 38 27-65 46-83 (297)
143 PF00885 DMRL_synthase: 6,7-di 25.0 1.2E+02 0.0027 20.7 3.5 35 14-48 2-46 (144)
144 PRK15454 ethanol dehydrogenase 24.9 2E+02 0.0044 22.4 5.1 39 26-64 35-81 (395)
145 PF08825 E2_bind: E2 binding d 24.7 60 0.0013 20.5 1.8 19 20-38 1-19 (84)
146 KOG4549|consensus 24.7 38 0.00082 24.1 0.9 31 28-58 49-81 (144)
147 PF11210 DUF2996: Protein of u 24.6 46 0.001 22.9 1.3 21 54-74 21-41 (119)
148 cd04440 DEP_2_P-Rex DEP (Dishe 24.5 45 0.00098 21.7 1.2 19 54-72 64-85 (93)
149 PF10775 ATP_sub_h: ATP syntha 24.4 39 0.00085 21.0 0.9 20 24-43 10-29 (67)
150 TIGR00010 hydrolase, TatD fami 24.3 81 0.0018 21.6 2.5 20 27-46 16-35 (252)
151 cd01311 PDC_hydrolase 2-pyrone 24.3 1.2E+02 0.0025 21.8 3.4 26 24-49 26-51 (263)
152 cd00419 Ferrochelatase_C Ferro 24.2 1.2E+02 0.0027 20.2 3.3 37 24-60 75-124 (135)
153 PF13730 HTH_36: Helix-turn-he 24.0 55 0.0012 17.8 1.4 17 24-40 38-54 (55)
154 TIGR00048 radical SAM enzyme, 23.9 81 0.0018 24.5 2.7 24 23-46 306-329 (355)
155 PF09587 PGA_cap: Bacterial ca 23.9 1.2E+02 0.0026 21.6 3.4 40 26-65 62-110 (250)
156 cd08190 HOT Hydroxyacid-oxoaci 23.9 1.9E+02 0.0041 22.6 4.8 44 27-72 10-61 (414)
157 PRK08564 5'-methylthioadenosin 23.7 2.1E+02 0.0045 21.5 4.8 48 6-53 43-102 (267)
158 cd08175 G1PDH Glycerol-1-phosp 23.7 1.7E+02 0.0036 22.0 4.3 45 28-72 11-62 (348)
159 PF06230 DUF1009: Protein of u 23.7 59 0.0013 24.1 1.8 24 29-52 2-28 (214)
160 cd00401 AdoHcyase S-adenosyl-L 23.6 2.8E+02 0.0061 22.2 5.8 57 10-67 31-92 (413)
161 TIGR02638 lactal_redase lactal 23.6 2.4E+02 0.0053 21.6 5.2 44 26-71 15-66 (379)
162 PF02735 Ku: Ku70/Ku80 beta-ba 23.6 1.4E+02 0.0031 20.5 3.7 57 14-70 103-175 (200)
163 smart00292 BRCT breast cancer 23.6 1.3E+02 0.0028 16.0 5.9 35 13-47 3-38 (80)
164 PF13670 PepSY_2: Peptidase pr 23.5 59 0.0013 19.6 1.6 18 54-71 36-53 (83)
165 PRK14468 ribosomal RNA large s 23.4 81 0.0017 24.4 2.6 24 22-45 293-316 (343)
166 COG0514 RecQ Superfamily II DN 23.4 1.2E+02 0.0027 25.7 3.9 30 11-40 294-323 (590)
167 PRK14466 ribosomal RNA large s 23.3 78 0.0017 24.9 2.5 24 22-45 297-320 (345)
168 PRK14460 ribosomal RNA large s 23.3 83 0.0018 24.5 2.7 25 22-46 304-328 (354)
169 TIGR00639 PurN phosphoribosylg 23.3 2.6E+02 0.0055 19.8 5.0 53 16-68 1-58 (190)
170 COG1167 ARO8 Transcriptional r 23.3 1.3E+02 0.0027 23.9 3.7 55 17-73 157-213 (459)
171 PRK05718 keto-hydroxyglutarate 23.1 99 0.0022 22.4 2.9 43 27-69 4-49 (212)
172 PLN02449 ferrochelatase 22.9 1E+02 0.0022 25.5 3.2 48 24-71 336-397 (485)
173 cd00673 AlaRS_core Alanyl-tRNA 22.8 74 0.0016 24.0 2.2 47 28-75 5-53 (232)
174 TIGR01697 PNPH-PUNA-XAPA inosi 22.7 1.8E+02 0.0038 21.1 4.2 25 24-48 67-91 (248)
175 KOG2915|consensus 22.7 62 0.0013 25.7 1.8 44 30-74 145-193 (314)
176 COG4750 LicC CTP:phosphocholin 22.7 67 0.0015 24.4 2.0 40 8-47 14-53 (231)
177 TIGR03556 photolyase_8HDF deox 22.6 1.5E+02 0.0033 23.6 4.1 32 33-65 84-124 (471)
178 PF12993 DUF3877: Domain of un 22.6 50 0.0011 24.1 1.3 65 8-74 65-147 (175)
179 PRK08005 epimerase; Validated 22.4 1.3E+02 0.0029 21.8 3.5 48 25-73 67-118 (210)
180 PRK14456 ribosomal RNA large s 22.4 81 0.0017 24.8 2.5 25 22-46 325-349 (368)
181 KOG3985|consensus 22.4 96 0.0021 24.2 2.8 25 22-46 68-92 (283)
182 PF09383 NIL: NIL domain; Int 22.2 76 0.0016 18.6 1.8 23 19-41 49-72 (76)
183 PRK12560 adenine phosphoribosy 22.2 2.8E+02 0.0061 19.4 5.0 54 13-66 112-173 (187)
184 KOG3349|consensus 22.1 1.6E+02 0.0034 21.5 3.7 41 29-70 25-72 (170)
185 PRK14719 bifunctional RNAse/5- 22.0 1.6E+02 0.0035 23.1 4.0 39 26-64 53-99 (360)
186 TIGR01573 cas2 CRISPR-associat 21.9 1.2E+02 0.0026 19.0 2.8 24 16-39 1-29 (95)
187 PF15533 Toxin_54: Putative to 21.9 16 0.00034 22.9 -1.2 18 50-67 10-27 (66)
188 PRK14454 ribosomal RNA large s 21.8 84 0.0018 24.3 2.4 24 22-45 298-321 (342)
189 PRK14461 ribosomal RNA large s 21.8 91 0.002 24.9 2.7 24 23-46 325-348 (371)
190 PRK14469 ribosomal RNA large s 21.7 92 0.002 23.8 2.6 25 22-46 297-321 (343)
191 PF14403 CP_ATPgrasp_2: Circul 21.6 1.1E+02 0.0023 25.1 3.0 43 22-67 196-241 (445)
192 PRK14453 chloramphenicol/florf 21.6 90 0.0019 24.3 2.5 25 22-46 302-326 (347)
193 cd03742 SOCS_Rab40 SOCS (suppr 21.5 16 0.00034 20.9 -1.2 29 4-35 11-39 (43)
194 PRK13256 thiopurine S-methyltr 21.4 1.1E+02 0.0024 22.5 2.9 45 24-68 28-73 (226)
195 cd08191 HHD 6-hydroxyhexanoate 21.2 2.5E+02 0.0055 21.5 4.9 42 28-72 11-60 (386)
196 PRK05647 purN phosphoribosylgl 21.2 3.1E+02 0.0068 19.4 5.1 53 16-68 2-59 (200)
197 TIGR00126 deoC deoxyribose-pho 21.1 52 0.0011 23.9 1.1 52 24-75 16-69 (211)
198 PRK14464 ribosomal RNA large s 21.0 1E+02 0.0022 24.2 2.7 25 22-46 289-313 (344)
199 cd04441 DEP_2_DEP6 DEP (Dishev 20.9 56 0.0012 20.8 1.1 19 54-72 59-80 (85)
200 PF03683 UPF0175: Uncharacteri 20.8 72 0.0016 19.4 1.6 25 26-57 46-70 (76)
201 PRK13111 trpA tryptophan synth 20.7 1.4E+02 0.0031 22.2 3.4 26 27-53 130-155 (258)
202 TIGR00936 ahcY adenosylhomocys 20.7 3.6E+02 0.0077 21.6 5.8 56 11-67 28-88 (406)
203 cd08195 DHQS Dehydroquinate sy 20.6 2.7E+02 0.0059 20.9 4.9 37 39-75 23-66 (345)
204 cd00562 NifX_NifB This CD repr 20.6 1.8E+02 0.0038 17.3 3.3 35 29-65 51-87 (102)
205 PF00532 Peripla_BP_1: Peripla 20.6 2.7E+02 0.0059 20.0 4.8 48 16-64 32-83 (279)
206 PRK13902 alaS alanyl-tRNA synt 20.5 73 0.0016 28.1 2.0 51 26-76 63-115 (900)
207 PF02635 DrsE: DsrE/DsrF-like 20.4 1.9E+02 0.004 17.3 3.4 24 24-49 59-83 (122)
208 PF07997 DUF1694: Protein of u 20.3 1.3E+02 0.0028 20.1 2.8 41 12-53 60-101 (120)
209 PF05378 Hydant_A_N: Hydantoin 20.3 1.9E+02 0.0041 20.1 3.8 30 24-53 132-161 (176)
210 PF04007 DUF354: Protein of un 20.2 1.7E+02 0.0037 22.7 3.8 49 16-64 1-52 (335)
211 cd04878 ACT_AHAS N-terminal AC 20.2 1.6E+02 0.0034 15.7 2.9 22 16-38 44-65 (72)
212 cd07856 STKc_Sty1_Hog1 Catalyt 20.2 44 0.00095 24.1 0.6 30 42-76 290-319 (328)
213 COG0735 Fur Fe2+/Zn2+ uptake r 20.0 54 0.0012 22.2 1.0 48 20-72 31-80 (145)
214 TIGR01744 XPRTase xanthine pho 20.0 3E+02 0.0066 19.4 4.8 58 14-71 116-184 (191)
No 1
>KOG2836|consensus
Probab=100.00 E-value=1.9e-33 Score=198.87 Aligned_cols=69 Identities=38% Similarity=0.611 Sum_probs=66.4
Q ss_pred eeeeccceeEEeec-CCcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCCC
Q psy12446 9 RTLDYKTVTLYHGV-NLENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSPE 77 (77)
Q Consensus 9 ~~~~y~~~~fli~~-pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~~ 77 (77)
-+|+||+|||||++ |||+||+.||+|||+|||+.||||||||||+..+|++||.|.||+|+||++||.+
T Consensus 10 veIsy~~MrFLIThnPtnaTln~fieELkKygvttvVRVCe~TYdt~~lek~GI~Vldw~f~dg~ppp~q 79 (173)
T KOG2836|consen 10 VEISYKNMRFLITHNPTNATLNKFIEELKKYGVTTVVRVCEPTYDTTPLEKEGITVLDWPFDDGAPPPNQ 79 (173)
T ss_pred eeeeccceEEEEecCCCchhHHHHHHHHHhcCCeEEEEecccccCCchhhhcCceEeecccccCCCCchH
Confidence 47999999999987 6999999999999999999999999999999999999999999999999999963
No 2
>PTZ00393 protein tyrosine phosphatase; Provisional
Probab=99.84 E-value=2.6e-21 Score=143.86 Aligned_cols=69 Identities=28% Similarity=0.503 Sum_probs=65.9
Q ss_pred eeeeeccceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 8 CRTLDYKTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 8 ~~~~~y~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
-+.|+|+.|||||+ .|++++++.||++||++||++|||+|+++|+.+.++++||++++|||+||++|+.
T Consensus 84 ~~~ie~~~~rfLi~~~P~~~~~~~yl~eLk~~gV~~lVrlcE~~Yd~~~~~~~GI~~~~lpipDg~aPs~ 153 (241)
T PTZ00393 84 PTKIEHGKIKILILDAPTNDLLPLYIKEMKNYNVTDLVRTCERTYNDGEITSAGINVHELIFPDGDAPTV 153 (241)
T ss_pred chhhccCceeEEEeCCCCHHHHHHHHHHHHHcCCCEEEECCCCCCCHHHHHHcCCeEEEeecCCCCCCCH
Confidence 35799999999995 7999999999999999999999999999999999999999999999999999985
No 3
>PTZ00242 protein tyrosine phosphatase; Provisional
Probab=99.74 E-value=3.6e-18 Score=118.73 Aligned_cols=68 Identities=34% Similarity=0.611 Sum_probs=65.2
Q ss_pred eeeeccceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 9 RTLDYKTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 9 ~~~~y~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
|.|+|+++|||+. .|+++++++|+++|+++||++||++|++.|+.+.++..||++++||++||++|+.
T Consensus 9 ~~~~~~~~r~~~~~~P~~~~~~~~l~~L~~~gI~~Iv~l~~~~~~~~~~~~~gi~~~~~p~~D~~~P~~ 77 (166)
T PTZ00242 9 RQIEYVLFKFLILDAPSPSNLPLYIKELQRYNVTHLVRVCGPTYDAELLEKNGIEVHDWPFDDGAPPPK 77 (166)
T ss_pred cceeeeceEEEEecCCCcccHHHHHHHHHhCCCeEEEecCCCCCCHHHHHHCCCEEEecCCCCCCCCCH
Confidence 6799999999997 5999999999999999999999999999999999999999999999999999975
No 4
>KOG1720|consensus
Probab=99.29 E-value=1.8e-12 Score=96.06 Aligned_cols=50 Identities=24% Similarity=0.466 Sum_probs=47.9
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
.++.|+..|++++|+.+||+++..||.+.|+++||.|||++|+||++|++
T Consensus 82 ~~~~~~~~~~~~~v~s~vrln~~~yd~~~f~~~Gi~h~~l~f~Dg~tP~~ 131 (225)
T KOG1720|consen 82 LPQPYIQYFKNNNVTSIVRLNKRLYDAKRFTDAGIDHHDLFFADGSTPTD 131 (225)
T ss_pred CChhHHHHhhhcccceEEEcCCCCCChHHhcccCceeeeeecCCCCCCCH
Confidence 46789999999999999999999999999999999999999999999985
No 5
>PF05706 CDKN3: Cyclin-dependent kinase inhibitor 3 (CDKN3); InterPro: IPR022778 This entry represents a domain found in cyclin-dependent kinase inhibitor 3 or kinase associated phosphatase proteins from several mammalian species. The cyclin-dependent kinase (Cdk)-associated protein phosphatase (KAP) is a human dual specificity protein phosphatase that dephosphorylates Cdk2 on threonine 160 in a cyclin-dependent manner [], []. This domain is also found in MAP kinase phosphatase and esterases. This entry contains both eukaryotic and bacterial proteins.; GO: 0004721 phosphoprotein phosphatase activity, 0004725 protein tyrosine phosphatase activity; PDB: 1FQ1_A 1FPZ_F.
Probab=98.46 E-value=1.9e-07 Score=66.90 Aligned_cols=51 Identities=25% Similarity=0.412 Sum_probs=34.3
Q ss_pred cccHHHHHHHHHhCCCeEEEEeCC----CCcCc----hhhhcCCceEEEeecCCCCCCC
Q psy12446 25 ENHLHLAYHELKKHQVKNVVRVCE----PTYKV----EDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 25 ~~tl~~yi~elk~~~Vt~VVRvCe----~tYd~----~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
.++|+..++.+|.+||++||-+|+ ..|.. +.+++.||.+|++|++||++|+
T Consensus 57 ~RdL~~DL~~Lk~~G~~~Vvtl~~~~EL~~l~Vp~L~~~~~~~Gi~~~h~PI~D~~aPd 115 (168)
T PF05706_consen 57 RRDLQADLERLKDWGAQDVVTLLTDHELARLGVPDLGEAAQARGIAWHHLPIPDGSAPD 115 (168)
T ss_dssp EB-HHHHHHHHHHTT--EEEE-S-HHHHHHTT-TTHHHHHHHTT-EEEE----TTS---
T ss_pred cchHHHHHHHHHHCCCCEEEEeCcHHHHHHcCCccHHHHHHHcCCEEEecCccCCCCCC
Confidence 789999999999999999999998 45665 8889999999999999999995
No 6
>cd00127 DSPc Dual specificity phosphatases (DSP); Ser/Thr and Tyr protein phosphatases. Structurally similar to tyrosine-specific phosphatases but with a shallower active site cleft and a distinctive active site signature motif, HCxxGxxR. Characterized as VHR- or Cdc25-like.
Probab=97.24 E-value=0.00069 Score=43.40 Aligned_cols=44 Identities=23% Similarity=0.159 Sum_probs=37.7
Q ss_pred HHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCC
Q psy12446 31 AYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 31 yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
.++.|+++||+.||-+|...+. ......|++++++|+.|+..++
T Consensus 19 d~~~L~~~gi~~VI~l~~~~~~-~~~~~~~~~~~~~~~~D~~~~~ 62 (139)
T cd00127 19 DKELLKKLGITHVLNVAKEVPN-ENLFLSDFNYLYVPILDLPSQD 62 (139)
T ss_pred CHHHHHHcCCCEEEEcccCCCC-cccCCCCceEEEEEceeCCCCC
Confidence 4678899999999999998876 6677899999999999988543
No 7
>smart00195 DSPc Dual specificity phosphatase, catalytic domain.
Probab=94.99 E-value=0.077 Score=34.24 Aligned_cols=38 Identities=24% Similarity=0.295 Sum_probs=31.6
Q ss_pred HHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCC
Q psy12446 32 YHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 32 i~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~ 72 (77)
++.++++||+.||-+++..+. ....||+.+.+|.+|..
T Consensus 19 ~~~l~~~gi~~Vi~l~~~~~~---~~~~~~~~~~ipi~D~~ 56 (138)
T smart00195 19 LALLKKLGITHVINVTNEVPN---LNKKGFTYLGVPILDNT 56 (138)
T ss_pred HHHHHHcCCCEEEEccCCCCC---CCCCCCEEEEEECCCCC
Confidence 578899999999999986553 24789999999999943
No 8
>PF00782 DSPc: Dual specificity phosphatase, catalytic domain; InterPro: IPR000340 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry represents dual specificity protein-tyrosine phosphatases. Ser/Thr and Tyr dual specificity phosphatases are a group of enzymes with both Ser/Thr (3.1.3.16 from EC) and tyrosine specific protein phosphatase (3.1.3.48 from EC) activity able to remove both the serine/threonine or tyrosine-bound phosphate group from a wide range of phosphoproteins, including a number of enzymes which have been phosphorylated under the action of a kinase. Dual specificity protein phosphatases (DSPs) regulate mitogenic signal transduction and control the cell cycle. The crystal structure of a human DSP, vaccinia H1-related phosphatase (or VHR), has been determined at 2.1 angstrom resolution []. A shallow active site pocket in VHR allows for the hydrolysis of phosphorylated serine, threonine, or tyrosine protein residues, whereas the deeper active site of protein tyrosine phosphatases (PTPs) restricts substrate specificity to only phosphotyrosine. Positively charged crevices near the active site may explain the enzyme's preference for substrates with two phosphorylated residues. The VHR structure defines a conserved structural scaffold for both DSPs and PTPs. A "recognition region" connecting helix alpha1 to strand beta1, may determine differences in substrate specificity between VHR, the PTPs, and other DSPs. These proteins may also have inactive phosphatase domains, and dependent on the domain composition this loss of catalytic activity has different effects on protein function. Inactive single domain phosphatases can still specifically bind substrates, and protect again dephosphorylation, while the inactive domains of tandem phosphatases can be further subdivided into two classes. Those which bind phosphorylated tyrosine residues may recruit multi-phosphorylated substrates for the adjacent active domains and are more conserved, while the other class have accumulated several variable amino acid substitutions and have a complete loss of tyrosine binding capability. The second class shows a release of evolutionary constraint for the sites around the catalytic centre, which emphasises a difference in function from the first group. There is a region of higher conservation common to both classes, suggesting a new regulatory centre [].; GO: 0008138 protein tyrosine/serine/threonine phosphatase activity, 0006470 protein dephosphorylation; PDB: 2G6Z_A 1MKP_A 1YZ4_A 2P4D_A 1M3G_A 1ZZW_A 2OUD_A 2HXP_A 3LJ8_A 1OHD_A ....
Probab=93.89 E-value=0.11 Score=33.13 Aligned_cols=39 Identities=23% Similarity=0.226 Sum_probs=34.5
Q ss_pred HHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCC
Q psy12446 32 YHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYED 70 (77)
Q Consensus 32 i~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~D 70 (77)
++.+++.||+.|+-++.........+..|+..+.+|.+|
T Consensus 11 ~~~l~~~~I~~Vin~~~~~~~~~~~~~~~~~~~~i~~~D 49 (133)
T PF00782_consen 11 IAFLKNLGITHVINLQEECPNPYFYKPEGIEYLRIPIDD 49 (133)
T ss_dssp HHHHHHTTEEEEEECSSSSSTSHHHTTTTSEEEEEEEES
T ss_pred HHHHHHCCCCEEEEccCCCcCchhcccCCCEEEEEEecC
Confidence 678899999999999987666577888999999999999
No 9
>PF00102 Y_phosphatase: Protein-tyrosine phosphatase; InterPro: IPR000242 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry repesents several receptor and non-receptor protein-tyrosine phosphatases. Structurally, all known receptor PTPases, are made up of a variable length extracellular domain, followed by a transmembrane region and a C-terminal catalytic cytoplasmic domain. Some of the receptor PTPases contain fibronectin type III (FN-III) repeats, immunoglobulin-like domains, MAM domains or carbonic anhydrase-like domains in their extracellular region. The cytoplasmic region generally contains two copies of the PTPase domain. The first seems to have enzymatic activity, while the second is inactive. The inactive domains of tandem phosphatases can be divided into two classes. Those which bind phosphorylated tyrosine residues may recruit multi-phosphorylated substrates for the adjacent active domains and are more conserved, while the other class have accumulated several variable amino acid substitutions and have a complete loss of tyrosine binding capability. The second class shows a release of evolutionary constraint for the sites around the catalytic centre, which emphasises a difference in function from the first group. There is a region of higher conservation common to both classes, suggesting a new regulatory centre []. PTPase domains consist of about 300 amino acids. There are two conserved cysteines, the second one has been shown to be absolutely required for activity. Furthermore, a number of conserved residues in its immediate vicinity have also been shown to be important.; GO: 0004725 protein tyrosine phosphatase activity, 0006470 protein dephosphorylation; PDB: 3O4T_A 3O4S_A 3O4U_A 2A3K_A 2QDP_A 2QDC_A 2QDM_A 2HVL_A 1ZC0_A 3D44_A ....
Probab=89.89 E-value=0.93 Score=30.79 Aligned_cols=37 Identities=11% Similarity=0.240 Sum_probs=30.0
Q ss_pred c-cceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCC
Q psy12446 13 Y-KTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEP 49 (77)
Q Consensus 13 y-~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~ 49 (77)
| ++-+|.+. +|+.+|+++|..-.-+++|+.||-+|..
T Consensus 38 ~~~~~~~I~~q~P~~~t~~~FW~mv~~~~~~~Iv~L~~~ 76 (235)
T PF00102_consen 38 YKNGKKFIVTQAPMPDTIEDFWQMVWEQKVQIIVMLCSF 76 (235)
T ss_dssp SSSTEEEEEEES-SGGGHHHHHHHHHHTTBSEEEEESBS
T ss_pred ccchhhheeecccccccccceehheeeccccceeccccc
Confidence 5 34455554 7999999999999999999999999973
No 10
>COG2453 CDC14 Predicted protein-tyrosine phosphatase [Signal transduction mechanisms]
Probab=86.14 E-value=0.78 Score=31.89 Aligned_cols=32 Identities=31% Similarity=0.429 Sum_probs=29.1
Q ss_pred EeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 45 RVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 45 RvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
+.-.+.|.....+..|+++.++|+.||.+|+.
T Consensus 57 ~~~~~~~~~~~~~~~~~~~~~~~~~D~~~p~~ 88 (180)
T COG2453 57 ELLGALYNVAIEENDGIQVLHLPILDGTVPDL 88 (180)
T ss_pred eccccceecceeccCCceeeeeeecCCCCCcH
Confidence 67778899999999999999999999999973
No 11
>cd00047 PTPc Protein tyrosine phosphatases (PTP) catalyze the dephosphorylation of phosphotyrosine peptides; they regulate phosphotyrosine levels in signal transduction pathways. The depth of the active site cleft renders the enzyme specific for phosphorylated Tyr (pTyr) residues, instead of pSer or pThr. This family has a distinctive active site signature motif, HCSAGxGRxG. Characterized as either transmembrane, receptor-like or non-transmembrane (soluble) PTPs. Receptor-like PTP domains tend to occur in two copies in the cytoplasmic region of the transmembrane proteins, only one copy may be active.
Probab=85.86 E-value=1.3 Score=30.87 Aligned_cols=35 Identities=14% Similarity=0.078 Sum_probs=30.9
Q ss_pred ceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCC
Q psy12446 15 TVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEP 49 (77)
Q Consensus 15 ~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~ 49 (77)
..+|.++ +|...|+++|..-+-+++++.||.+|+.
T Consensus 40 ~~~fI~tQ~P~~~t~~~FW~mv~~~~~~~IV~l~~~ 75 (231)
T cd00047 40 PKAYIATQGPLPNTVEDFWRMVWEQKVPVIVMLTEL 75 (231)
T ss_pred CcceEEcCCCChhhHHHHHHHHHhcCCCEEEEcccc
Confidence 4677775 7999999999999999999999999863
No 12
>PRK00726 murG undecaprenyldiphospho-muramoylpentapeptide beta-N- acetylglucosaminyltransferase; Provisional
Probab=83.06 E-value=4.5 Score=29.41 Aligned_cols=54 Identities=13% Similarity=0.054 Sum_probs=40.1
Q ss_pred eeEEeec-CCcccHH---HHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecC
Q psy12446 16 VTLYHGV-NLENHLH---LAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 16 ~~fli~~-pt~~tl~---~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~ 69 (77)
||.+|.. .+..... +..++|++.|..-.|-.+...++.+.+++.|+.++.+++.
T Consensus 2 ~~i~i~~~g~gG~~~~~~~la~~L~~~g~ev~vv~~~~~~~~~~~~~~g~~~~~~~~~ 59 (357)
T PRK00726 2 KKILLAGGGTGGHVFPALALAEELKKRGWEVLYLGTARGMEARLVPKAGIEFHFIPSG 59 (357)
T ss_pred cEEEEEcCcchHhhhHHHHHHHHHHhCCCEEEEEECCCchhhhccccCCCcEEEEecc
Confidence 7888875 3555555 6788899988877776665556567777789999988864
No 13
>smart00194 PTPc Protein tyrosine phosphatase, catalytic domain.
Probab=81.79 E-value=2.4 Score=30.23 Aligned_cols=36 Identities=11% Similarity=0.063 Sum_probs=31.2
Q ss_pred cceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCC
Q psy12446 14 KTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEP 49 (77)
Q Consensus 14 ~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~ 49 (77)
..-+|.++ +|...|+.+|..-.-+++|..||-+|+.
T Consensus 67 ~~~~fI~tQ~P~~~t~~dFW~mv~~~~~~~IVmL~~~ 103 (258)
T smart00194 67 GPKAYIATQGPLPSTVEDFWRMVWEEKVTTIVMLTEL 103 (258)
T ss_pred CccceEEeCCCchHHHHHHHHHHHhcCCCEEEEeccc
Confidence 34567675 7999999999999999999999999873
No 14
>KOG3201|consensus
Probab=78.05 E-value=2 Score=31.83 Aligned_cols=39 Identities=18% Similarity=0.160 Sum_probs=31.8
Q ss_pred EEeecC-CcccHHHHHHHHHhCCCeEEEEeC-CCCcCchhhhcCC
Q psy12446 18 LYHGVN-LENHLHLAYHELKKHQVKNVVRVC-EPTYKVEDLKTEG 60 (77)
Q Consensus 18 fli~~p-t~~tl~~yi~elk~~~Vt~VVRvC-e~tYd~~~~~~~G 60 (77)
=|+.+| ...+|+.|.+|.+.-| +++| ++.||...-+++|
T Consensus 136 Al~fsPRRg~sL~kF~de~~~~g----f~v~l~enyde~iwqrh~ 176 (201)
T KOG3201|consen 136 ALLFSPRRGQSLQKFLDEVGTVG----FTVCLEENYDEAIWQRHG 176 (201)
T ss_pred eeEecCcccchHHHHHHHHHhce----eEEEecccHhHHHHHHHH
Confidence 467888 9999999999999999 7888 5889986554443
No 15
>PF05771 Pox_A31: Poxvirus A31 protein; InterPro: IPR008786 This family contains the vaccinia virus A31R protein, the function of which is not known.
Probab=76.21 E-value=2.9 Score=28.67 Aligned_cols=24 Identities=13% Similarity=0.090 Sum_probs=22.6
Q ss_pred eeeccceeEEeecCCcccHHHHHH
Q psy12446 10 TLDYKTVTLYHGVNLENHLHLAYH 33 (77)
Q Consensus 10 ~~~y~~~~fli~~pt~~tl~~yi~ 33 (77)
.++||++.|.--.|.+++++.|+.
T Consensus 28 ~~~hk~~~FiFy~Pk~~~v~rYl~ 51 (114)
T PF05771_consen 28 VYDHKNMTFIFYAPKDSTVQRYLS 51 (114)
T ss_pred EEeeCCcEEEEECCcHHHHHHHHH
Confidence 689999999999999999999987
No 16
>PF04273 DUF442: Putative phosphatase (DUF442); InterPro: IPR005939 Although this domain is uncharacterised it seems likely that it performs a phosphatase function.; GO: 0016787 hydrolase activity; PDB: 2F46_A 3GXH_B 3GXG_B.
Probab=75.54 E-value=5.4 Score=26.25 Aligned_cols=45 Identities=13% Similarity=0.225 Sum_probs=26.2
Q ss_pred HHHHHHHHhCCCeEEEEeCC-------CCcCc--hhhhcCCceEEEeecCCCCC
Q psy12446 29 HLAYHELKKHQVKNVVRVCE-------PTYKV--EDLKTEGINVKDLAYEDGTS 73 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VVRvCe-------~tYd~--~~~~~~GI~vhdl~F~DG~~ 73 (77)
+..++.+++.|++.||-+.- |.... +..++.|+..+++|..-|..
T Consensus 17 ~~d~~~la~~GfktVInlRpd~E~~~qp~~~~~~~~a~~~Gl~y~~iPv~~~~~ 70 (110)
T PF04273_consen 17 PEDLAQLAAQGFKTVINLRPDGEEPGQPSSAEEAAAAEALGLQYVHIPVDGGAI 70 (110)
T ss_dssp HHHHHHHHHCT--EEEE-S-TTSTTT-T-HHCHHHHHHHCT-EEEE----TTT-
T ss_pred HHHHHHHHHCCCcEEEECCCCCCCCCCCCHHHHHHHHHHcCCeEEEeecCCCCC
Confidence 34568999999999997652 33322 67899999999999997764
No 17
>PLN02727 NAD kinase
Probab=75.07 E-value=4.6 Score=36.05 Aligned_cols=58 Identities=22% Similarity=0.303 Sum_probs=39.3
Q ss_pred ceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCC-----Cc---CchhhhcCCceEEEeecCCCCCCCC
Q psy12446 15 TVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEP-----TY---KVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 15 ~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~-----tY---d~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
+..|... .|+...+ +.+.++|++.||=+... -| -.++-+++||+++++|+.++.+|+.
T Consensus 260 ~~~~~rsgQpspe~l----a~LA~~GfKTIINLRpd~E~~q~~~~ee~eAae~~GL~yVhIPVs~~~apt~ 326 (986)
T PLN02727 260 EAAFWRGGQVTEEGL----KWLLEKGFKTIVDLRAEIVKDNFYQAAVDDAISSGKIEVVKIPVEVRTAPSA 326 (986)
T ss_pred eeeEEEeCCCCHHHH----HHHHHCCCeEEEECCCCCcCCCchhHHHHHHHHHcCCeEEEeecCCCCCCCH
Confidence 3444442 3454443 55778999999977641 22 1245567999999999999888864
No 18
>PRK09545 znuA high-affinity zinc transporter periplasmic component; Reviewed
Probab=74.42 E-value=7.2 Score=29.41 Aligned_cols=50 Identities=16% Similarity=0.209 Sum_probs=39.3
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCch----hhhcCCceEEEe-ecCCCCCC
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKVE----DLKTEGINVKDL-AYEDGTSP 74 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~----~~~~~GI~vhdl-~F~DG~~P 74 (77)
||.+.+..-++.+|++||+.|+ +++.++.+ .-++-|+++..+ |+.+|..+
T Consensus 236 ps~~~l~~l~~~ik~~~v~~If--~e~~~~~~~~~~la~e~g~~v~~ldpl~~~~~~ 290 (311)
T PRK09545 236 PGAQRLHEIRTQLVEQKATCVF--AEPQFRPAVIESVAKGTSVRMGTLDPLGTNIKL 290 (311)
T ss_pred CCHHHHHHHHHHHHHcCCCEEE--ecCCCChHHHHHHHHhcCCeEEEeccccccccC
Confidence 6889999999999999999987 66777763 336678888776 77766544
No 19
>PF00762 Ferrochelatase: Ferrochelatase; InterPro: IPR001015 Synonym(s): Protohaem ferro-lyase, Iron chelatase, etc. Ferrochelatase catalyses the last step in haem biosynthesis: the chelation of a ferrous ion to proto-porphyrin IX, to form protohaem [, ]. In eukaryotic cells, it binds to the mitochondrial inner membrane with its active site on the matrix side of the membrane. The X-ray structure of Bacillus subtilis and human ferrochelatase have been solved [, ]. The human enzyme exists as a homodimer. Each subunit contains one [2Fe-2S] cluster. The monomer is folded into two similar domains, each with a four-stranded parallel beta-sheet flanked by an alpha-helix in a beta-alpha-beta motif that is reminiscent of the fold found in the periplasmic binding proteins. The topological similarity between the domains suggests that they have arisen from a gene duplication event. However, significant differences exist between the two domains, including an N-terminal section (residues 80-130) that forms part of the active site pocket, and a C-terminal extension (residues 390-423) that is involved in coordination of the [2Fe-2S] cluster and in stabilisation of the homodimer. Ferrochelatase seems to have a structurally conserved core region that is common to the enzyme from bacteria, plants and mammals. Porphyrin binds in the identified cleft; this cleft also includes the metal-binding site of the enzyme. It is likely that the structure of the cleft region will have different conformations upon substrate binding and release [].; GO: 0004325 ferrochelatase activity, 0006783 heme biosynthetic process; PDB: 2QD3_B 2HRE_C 3HCN_B 2PNJ_A 2QD1_C 1HRK_A 2QD4_B 3AQI_B 2HRC_B 3HCO_B ....
Probab=70.90 E-value=3.5 Score=31.43 Aligned_cols=49 Identities=27% Similarity=0.262 Sum_probs=36.1
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeC--------CCCcCc-----hhhhcCCc-eEEEeecCCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVC--------EPTYKV-----EDLKTEGI-NVKDLAYEDGT 72 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvC--------e~tYd~-----~~~~~~GI-~vhdl~F~DG~ 72 (77)
..-+.++-+++|++.||+.|+-+| |..|+- +.++++|+ ..+-.|-.+.+
T Consensus 239 L~P~~~~~l~~l~~~G~~~V~v~p~gFv~D~lETl~eidie~re~~~~~G~~~~~~ip~lN~~ 301 (316)
T PF00762_consen 239 LGPSTEDVLEELAKEGVKRVVVVPPGFVSDCLETLYEIDIEYRELAEEAGGEEFVRIPCLNDS 301 (316)
T ss_dssp SSSBHHHHHHHHHHCT-SEEEEEETT-SSSSHHHHCCCCCHHHHHHHHHTCCEEEE---STT-
T ss_pred ccccHHHHHHHHHhcCCCeEEEECCccccccHhHHHHHHHHHHHHHHHcCCceEEEeCCCCCC
Confidence 667889999999999999999998 355665 88899999 66666666554
No 20
>PRK12361 hypothetical protein; Provisional
Probab=68.88 E-value=11 Score=30.28 Aligned_cols=53 Identities=21% Similarity=0.369 Sum_probs=36.6
Q ss_pred EEee-cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc--hhhhcCCceEEEeecCCCCCCC
Q psy12446 18 LYHG-VNLENHLHLAYHELKKHQVKNVVRVCEPTYKV--EDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 18 fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~--~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
.++. .|+.++ ++.|+++||+.||=+|.. ++. ......|++.+.+|-.|...|+
T Consensus 102 l~lG~~~~a~d----~~~L~~~gI~~Vldlt~E-~~~~~~~~~~~~i~yl~iPi~D~~~p~ 157 (547)
T PRK12361 102 LYLGCRLFPAD----LEKLKSNKITAILDVTAE-FDGLDWSLTEEDIDYLNIPILDHSVPT 157 (547)
T ss_pred EEECCCCCccc----HHHHHHcCCCEEEEcccc-cccccccccccCceEEEeecCCCCCCc
Confidence 4453 355443 567889999999977631 221 2334578999999999988875
No 21
>PHA02746 protein tyrosine phosphatase; Provisional
Probab=68.66 E-value=8.9 Score=29.17 Aligned_cols=33 Identities=9% Similarity=0.132 Sum_probs=29.0
Q ss_pred eeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 16 VTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 16 ~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
-+|..+ +|.+.|+.+|..-.=+++|..||-+|+
T Consensus 113 ~~fIaTQ~Pl~~T~~dFW~MIwe~~v~vIVMLt~ 146 (323)
T PHA02746 113 NKFICAQGPKEDTSEDFFKLISEHESQVIVSLTD 146 (323)
T ss_pred CeEEEeCCCChhhHHHHHHHHHhhCCCEEEEecc
Confidence 456556 799999999999999999999999986
No 22
>PF04343 DUF488: Protein of unknown function, DUF488; InterPro: IPR007438 This family includes several proteins of uncharacterised function.
Probab=66.33 E-value=12 Score=24.15 Aligned_cols=41 Identities=27% Similarity=0.388 Sum_probs=29.6
Q ss_pred cHHHHHHHHHhCCCeEEE--EeCC----CCcCc----hhhhcCCceEEEee
Q psy12446 27 HLHLAYHELKKHQVKNVV--RVCE----PTYKV----EDLKTEGINVKDLA 67 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VV--RvCe----~tYd~----~~~~~~GI~vhdl~ 67 (77)
.++.|++.++++||+.|| |..= |-++. ..++++||.-..++
T Consensus 1 ~~e~f~~~l~~~~i~~lVDVR~~P~S~~~~~~k~~l~~~l~~~gi~Y~~~~ 51 (122)
T PF04343_consen 1 SIERFYDLLKKNGIRVLVDVRLWPRSRKPGFNKEDLASFLEEAGIEYVWLP 51 (122)
T ss_pred CHHHHHHHHHHCCCeEEEEECCCCCCCCCCCCHHHHHHHHHHCCceEeech
Confidence 478899999999999998 4442 34565 45677899865443
No 23
>PF12122 DUF3582: Protein of unknown function (DUF3582); InterPro: IPR022732 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This entry represents the N-terminal domain of membrane-bound serine endopeptidases belonging to MEROPS peptidase family S54 (rhomboid-1, clan ST). This domain contains a conserved ASW sequence motif and a single completely conserved residue F that may be functionally important. The tertiary structure of the GlpG protein from Escherichia coli has been determined []. The GlpG protein has six transmembrane domains (other members of the family are predicted to have seven), with the N- and C-terminal ends anchored in the cytoplasm. One transmembrane domain is shorter than the rest, creating an internal, aqueous cavity just below the membrane surface and it is here were proteolysis occurs. There is also a membrane-embedded loop between the first and second transmembrane domains which is postulated to act as a gate controlling substrate access to the active site. No other family of serine peptidases is known to have active site residues within transmembrane domains (although transmembrane active sites are known for aspartic peptidase and metallopeptidases), and the GlpG protein has the type structure for clan ST.; GO: 0004252 serine-type endopeptidase activity, 0016021 integral to membrane; PDB: 3UBB_A 3B45_A 3B44_A 2NRF_A 3TXT_A 2O7L_A 2XTU_A 2IRV_A 2XOW_A 2XTV_A ....
Probab=65.95 E-value=10 Score=24.84 Aligned_cols=31 Identities=6% Similarity=-0.016 Sum_probs=24.8
Q ss_pred eeEEeecCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 16 VTLYHGVNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 16 ~~fli~~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
|+.|+..+..+--+.|+.+|+..||..-++.
T Consensus 1 M~~l~~~~n~r~AqaF~DYl~sqgI~~~i~~ 31 (101)
T PF12122_consen 1 MIRLGSLNNPRAAQAFIDYLASQGIELQIEP 31 (101)
T ss_dssp -EEEEEESSHHHHHHHHHHHHHTT--EEEE-
T ss_pred CeEEEecCCHHHHHHHHHHHHHCCCeEEEEE
Confidence 6777888899999999999999999999986
No 24
>TIGR02571 ComEB ComE operon protein 2. This protein is found in the ComE operon for "late competence" as characterized in B. subtilis. Proteins in this family contain homology to a cytidine/deoxycytidine deaminase domain family (pfam00383), and may carry out this activity.
Probab=64.50 E-value=14 Score=25.58 Aligned_cols=54 Identities=11% Similarity=0.256 Sum_probs=39.3
Q ss_pred eccceeEEeec-CCcccHHHHHHHHHhCCCeEEEEeCCC---CcCchhhhcCCceEEEeecC
Q psy12446 12 DYKTVTLYHGV-NLENHLHLAYHELKKHQVKNVVRVCEP---TYKVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 12 ~y~~~~fli~~-pt~~tl~~yi~elk~~~Vt~VVRvCe~---tYd~~~~~~~GI~vhdl~F~ 69 (77)
+-++.+.+.+. |=. .+...+.+.||+.||-.-.. .+..+.|+++||+|.+++++
T Consensus 84 ~l~g~tlYvT~ePC~----~Ca~ai~~agI~~Vvy~~~~~~~~~~~~~l~~~gi~v~~~~~~ 141 (151)
T TIGR02571 84 STEGAEIYVTHFPCL----QCTKSIIQAGIKKIYYAQDYHNHPYAIELFEQAGVELKKVPFD 141 (151)
T ss_pred CcCCcEEEEeCCCcH----HHHHHHHHhCCCEEEEccCCCCcHHHHHHHHHCCCEEEEeCcc
Confidence 45677788864 522 58888899999999865321 12347999999999998754
No 25
>PHA02742 protein tyrosine phosphatase; Provisional
Probab=64.09 E-value=12 Score=28.10 Aligned_cols=33 Identities=15% Similarity=0.157 Sum_probs=29.2
Q ss_pred eeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 16 VTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 16 ~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
-+|..+ +|..+|+.+|..-.-+++|..||-+|.
T Consensus 93 ~~fIatQ~Pl~~T~~dFW~MVw~~~~~vIVmLt~ 126 (303)
T PHA02742 93 GRFICTQAPLEETALDFWQAIFQDQVRVIVMITK 126 (303)
T ss_pred CeEEEECCCCcccHHHHHHHHhhcCCCeEEEcch
Confidence 466666 799999999999999999999999985
No 26
>PF10369 ALS_ss_C: Small subunit of acetolactate synthase; InterPro: IPR019455 This entry represents the C-terminal domain of the small subunit of acetolactate synthase (the N-terminal domain being an ACT domain). Acetolactate synthase is a tetrameric enzyme, composed of two large and two small subunits, which catalyses the first step in branched-chain amino acid biosynthesis. This reaction is sensitive to certain herbicides []. ; PDB: 2F1F_B 2FGC_A 2PC6_A.
Probab=62.46 E-value=6.2 Score=24.16 Aligned_cols=38 Identities=13% Similarity=0.174 Sum_probs=28.2
Q ss_pred eeeeccceeEEee-cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 9 RTLDYKTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 9 ~~~~y~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
+-||+..=.+.|- .-+...+++|++.|+++|+..++|.
T Consensus 29 ~ivd~~~~~~iie~tG~~~kid~fi~~l~~~gi~Ei~Rt 67 (75)
T PF10369_consen 29 RIVDVSPDSIIIELTGTPEKIDAFIKLLKPFGILEIART 67 (75)
T ss_dssp EEEEEETTEEEEEEEE-HHHHHHHHHHSTGGGEEEEEEE
T ss_pred EEEEECCCEEEEEEcCCHHHHHHHHHHhhhcCCEEEEcc
Confidence 4455554445553 3577889999999999999999985
No 27
>PHA02738 hypothetical protein; Provisional
Probab=61.76 E-value=12 Score=28.36 Aligned_cols=33 Identities=18% Similarity=0.226 Sum_probs=29.2
Q ss_pred eeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 16 VTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 16 ~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
-+|..+ +|...|..+|+.-.=+++|..||-+|+
T Consensus 90 ~kfI~TQ~Pl~~T~~dFW~MVwe~~v~~IVmL~~ 123 (320)
T PHA02738 90 KKFICGQAPTRQTCYDFYRMLWMEHVQIIVMLCK 123 (320)
T ss_pred ceeEEecCCChHHHHHHHHHHHHhCCcEEEEeee
Confidence 356666 799999999999999999999999985
No 28
>PHA02747 protein tyrosine phosphatase; Provisional
Probab=61.71 E-value=15 Score=27.81 Aligned_cols=34 Identities=3% Similarity=-0.030 Sum_probs=29.3
Q ss_pred eeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCCC
Q psy12446 16 VTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCEP 49 (77)
Q Consensus 16 ~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe~ 49 (77)
-+|..+ .|...|+.+|..-.=+++|..||-+|+.
T Consensus 93 ~~yIaTQ~Pl~~T~~dFW~MVwe~~~~~IVMLt~~ 127 (312)
T PHA02747 93 KKFIATQGPFAETCADFWKAVWQEHCSIIVMLTPT 127 (312)
T ss_pred CeEEEeCCCcchhHHHHHHHHHhccCCEEEEcccc
Confidence 356666 7999999999999999999999998864
No 29
>PF08257 Sulfakinin: Sulfakinin family; InterPro: IPR013259 The sulfakinin (SK) family of neuropeptides have only been identified in crustaceans and insects. For most species there is the potential for producing two sulfakinin peptides, one has a short sulfakinin sequence. The function of the sulfakinins is difficult to assess. For the Periplaneta americana (American cockroach), various forms of the endogenous sulfakinins have been shown to be active on the hindgut, and also on the heart. In Calliphora vomitoria (Blue blowfly) the peptides act as neurotransmitters or neuromodulators, linking the brain with all thoracic and abdominal ganglia. In adults of Penaeus monodon (Penoeid shrimp) they appear to be restricted to a few neurones in the brain with a neural pathway extending along to the ventral thoracic and abdominal ganglia [].
Probab=60.84 E-value=2.5 Score=17.25 Aligned_cols=7 Identities=29% Similarity=0.691 Sum_probs=5.9
Q ss_pred eccceeE
Q psy12446 12 DYKTVTL 18 (77)
Q Consensus 12 ~y~~~~f 18 (77)
+|-+|||
T Consensus 3 dyghmrf 9 (9)
T PF08257_consen 3 DYGHMRF 9 (9)
T ss_pred ccccccC
Confidence 6888987
No 30
>cd01137 PsaA Metal binding protein PsaA. These proteins have been shown to function as initial receptors in ABC transport of Mn2+ and as surface adhesins in some eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=60.49 E-value=13 Score=27.51 Aligned_cols=48 Identities=21% Similarity=0.265 Sum_probs=37.4
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEEEeecCCCC
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vhdl~F~DG~ 72 (77)
|+.+.|...++.+|++||+.|+ .++.++. ..-++.|+.+....+.|+-
T Consensus 210 ps~~~l~~l~~~ik~~~v~~if--~e~~~~~~~~~~ia~~~gv~v~~~~~~d~l 261 (287)
T cd01137 210 GTPKQVATLIEQVKKEKVPAVF--VESTVNDRLMKQVAKETGAKIGGQLYTDSL 261 (287)
T ss_pred CCHHHHHHHHHHHHHhCCCEEE--EeCCCChHHHHHHHHHhCCccccccccccC
Confidence 7889999999999999999888 6777666 4445679987555666654
No 31
>PF13911 AhpC-TSA_2: AhpC/TSA antioxidant enzyme
Probab=59.99 E-value=9.4 Score=23.95 Aligned_cols=30 Identities=30% Similarity=0.256 Sum_probs=23.2
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCCcCchhhh
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPTYKVEDLK 57 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~tYd~~~~~ 57 (77)
|..+.++|++.||+.|+-.|+..=..+.|.
T Consensus 2 L~~~~~~l~~~gv~lv~I~~g~~~~~~~f~ 31 (115)
T PF13911_consen 2 LSRRKPELEAAGVKLVVIGCGSPEGIEKFC 31 (115)
T ss_pred hhHhHHHHHHcCCeEEEEEcCCHHHHHHHH
Confidence 567789999999999999998552255555
No 32
>cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence).
Probab=58.55 E-value=18 Score=26.40 Aligned_cols=43 Identities=21% Similarity=0.128 Sum_probs=34.7
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCch----hhhcCCceEEEee
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKVE----DLKTEGINVKDLA 67 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~----~~~~~GI~vhdl~ 67 (77)
|+.+.+...++.+|++||..|+ +++.++.+ .-++.|+.+..+.
T Consensus 204 ps~~~l~~l~~~ik~~~v~~if--~e~~~~~~~~~~la~~~g~~v~~ld 250 (282)
T cd01017 204 PSPKQLAELVEFVKKSDVKYIF--FEENASSKIAETLAKETGAKLLVLN 250 (282)
T ss_pred CCHHHHHHHHHHHHHcCCCEEE--EeCCCChHHHHHHHHHcCCcEEEec
Confidence 7899999999999999999776 77777763 3467899887653
No 33
>cd01019 ZnuA Zinc binding protein ZnuA. These proteins have been shown to function as initial receptors in the ABC uptake of Zn2+. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a single helix and bind their specific ligands in the cleft between these domains. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=58.41 E-value=24 Score=26.08 Aligned_cols=47 Identities=19% Similarity=0.205 Sum_probs=36.3
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEEEe-ecCCC
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVKDL-AYEDG 71 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vhdl-~F~DG 71 (77)
|+.+.|...++.+|++||..|+ +++..+. ..-++.|+.+..+ ++..+
T Consensus 212 ps~~~l~~l~~~ik~~~v~~If--~e~~~~~~~~~~ia~~~g~~v~~l~~l~~~ 263 (286)
T cd01019 212 PGAKRLAKIRKEIKEKGATCVF--AEPQFHPKIAETLAEGTGAKVGELDPLGGL 263 (286)
T ss_pred CCHHHHHHHHHHHHHcCCcEEE--ecCCCChHHHHHHHHhcCceEEEecccccc
Confidence 6889999999999999999996 7777665 4455678887654 55443
No 34
>PF13579 Glyco_trans_4_4: Glycosyl transferase 4-like domain; PDB: 3C4Q_B 3C4V_A 3C48_B 1Z2T_A.
Probab=57.27 E-value=29 Score=21.06 Aligned_cols=44 Identities=16% Similarity=0.213 Sum_probs=25.9
Q ss_pred HHHHHHHHHhCCCeEEEEeCC-CCcCchhhhcCCceEEEeecCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCE-PTYKVEDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe-~tYd~~~~~~~GI~vhdl~F~DG~ 72 (77)
+....+.|.+.|.+-.| +|- +..........|+.++.++++...
T Consensus 7 ~~~l~~~L~~~G~~V~v-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 51 (160)
T PF13579_consen 7 VRELARALAARGHEVTV-VTPQPDPEDDEEEEDGVRVHRLPLPRRP 51 (160)
T ss_dssp HHHHHHHHHHTT-EEEE-EEE---GGG-SEEETTEEEEEE--S-SS
T ss_pred HHHHHHHHHHCCCEEEE-EecCCCCcccccccCCceEEeccCCccc
Confidence 46678899999975443 442 223333477899999999887664
No 35
>PF03162 Y_phosphatase2: Tyrosine phosphatase family; InterPro: IPR004861 Protein tyrosine (pTyr) phosphorylation is a common post-translational modification which can create novel recognition motifs for protein interactions and cellular localisation, affect protein stability, and regulate enzyme activity. Consequently, maintaining an appropriate level of protein tyrosine phosphorylation is essential for many cellular functions. Tyrosine-specific protein phosphatases (PTPase; 3.1.3.48 from EC) catalyse the removal of a phosphate group attached to a tyrosine residue, using a cysteinyl-phosphate enzyme intermediate. These enzymes are key regulatory components in signal transduction pathways (such as the MAP kinase pathway) and cell cycle control, and are important in the control of cell growth, proliferation, differentiation and transformation [, ]. The PTP superfamily can be divided into four subfamilies []: (1) pTyr-specific phosphatases (2) dual specificity phosphatases (dTyr and dSer/dThr) (3) Cdc25 phosphatases (dTyr and/or dThr) (4) LMW (low molecular weight) phosphatases Based on their cellular localisation, PTPases are also classified as: Receptor-like, which are transmembrane receptors that contain PTPase domains [] Non-receptor (intracellular) PTPases [] All PTPases carry the highly conserved active site motif C(X)5R (PTP signature motif), employ a common catalytic mechanism, and share a similar core structure made of a central parallel beta-sheet with flanking alpha-helices containing a beta-loop-alpha-loop that encompasses the PTP signature motif []. Functional diversity between PTPases is endowed by regulatory domains and subunits. This entry represents protein-tyrosine phosphatases predominantly from fungi, plants and bacteria, several of which are putative enzymes. These proteins are closely related to the Y-phosphatase and DSPc families. This entry includes the PTPase SIW14 from Saccharomyces cerevisiae (Baker's yeast), which plays a role in actin filament organisation and endocytosis.; PDB: 2Q47_A 1XRI_A.
Probab=56.42 E-value=14 Score=25.65 Aligned_cols=48 Identities=8% Similarity=0.103 Sum_probs=23.3
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc--h-hhhcCCceEEEeecCCCCC
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCEPTYKV--E-DLKTEGINVKDLAYEDGTS 73 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~--~-~~~~~GI~vhdl~F~DG~~ 73 (77)
.|+..++ +.|+++|+++||=++...+.. . ..+++||++++++......
T Consensus 19 ~P~~~n~----~fL~~L~LKTII~L~~e~~~~~~~~f~~~~~I~l~~~~~~~~~~ 69 (164)
T PF03162_consen 19 QPTPANF----PFLERLGLKTIINLRPEPPSQDFLEFAEENGIKLIHIPMSSSKD 69 (164)
T ss_dssp S--HHHH----HHHHHHT-SEEEE--SS---HHHHHHHHHTT-EEEE-------G
T ss_pred CCChhhH----HHHHHCCCceEEEecCCCCCHHHHHHHhhcCceEEEeccccccC
Confidence 3554443 567889999999988654443 2 3489999999998875544
No 36
>TIGR01244 conserved hypothetical protein TIGR01244. No member of this family is characterized. The member from Xylella fastidiosa is a longer protein with an N-terminal region described by this model, followed by a metallo-beta-lactamase family domain and an additional C-terminal region. Members scoring above the trusted cutoff are limited to the proteobacteria.
Probab=56.34 E-value=25 Score=23.27 Aligned_cols=44 Identities=16% Similarity=0.237 Sum_probs=29.6
Q ss_pred HHHHHHHhCCCeEEE--EeC-C----CCcCc--hhhhcCCceEEEeecCCCCC
Q psy12446 30 LAYHELKKHQVKNVV--RVC-E----PTYKV--EDLKTEGINVKDLAYEDGTS 73 (77)
Q Consensus 30 ~yi~elk~~~Vt~VV--RvC-e----~tYd~--~~~~~~GI~vhdl~F~DG~~ 73 (77)
..++.|++.|++.|+ |-= | |.... +..+..|++.+++|+..+..
T Consensus 18 ~d~~~L~~~GiktVIdlR~~~E~~~~p~~~~~~~~a~~~gl~y~~iPv~~~~~ 70 (135)
T TIGR01244 18 ADAAQAAQLGFKTVINNRPDREEESQPDFAQIKAAAEAAGVTYHHQPVTAGDI 70 (135)
T ss_pred HHHHHHHHCCCcEEEECCCCCCCCCCCCHHHHHHHHHHCCCeEEEeecCCCCC
Confidence 344678999999999 322 2 22211 24467899999999887653
No 37
>cd04876 ACT_RelA-SpoT ACT domain found C-terminal of the RelA/SpoT domains. ACT_RelA-SpoT: the ACT domain found C-terminal of the RelA/SpoT domains. Enzymes of the Rel/Spo family enable bacteria to survive prolonged periods of nutrient limitation by controlling guanosine-3'-diphosphate-5'-(tri)diphosphate ((p)ppGpp) production and subsequent rRNA repression (stringent response). Both the synthesis of (p)ppGpp from ATP and GDP(GTP), and its hydrolysis to GDP(GTP) and pyrophosphate, are catalyzed by Rel/Spo proteins. In Escherichia coli and its close relatives, the metabolism of (p)ppGpp is governed by two homologous proteins, RelA and SpoT. The RelA protein catalyzes (p)ppGpp synthesis in a reaction requiring its binding to ribosomes bearing codon-specified uncharged tRNA. The major role of the SpoT protein is the breakdown of (p)ppGpp by a manganese-dependent (p)ppGpp pyrophosphohydrolase activity. Although the stringent response appears to be tightly regulated by these two enzymes i
Probab=56.11 E-value=21 Score=18.40 Aligned_cols=33 Identities=27% Similarity=0.250 Sum_probs=23.8
Q ss_pred ccceeEEeecCCcccHHHHHHHHHhC-CCeEEEE
Q psy12446 13 YKTVTLYHGVNLENHLHLAYHELKKH-QVKNVVR 45 (77)
Q Consensus 13 y~~~~fli~~pt~~tl~~yi~elk~~-~Vt~VVR 45 (77)
+-.+.|-+..++...+...++.|++. ||..|-|
T Consensus 38 ~~~~~~~~~~~~~~~~~~~~~~l~~~~~v~~v~r 71 (71)
T cd04876 38 LATIRLTLEVRDLEHLARIMRKLRQIPGVIDVRR 71 (71)
T ss_pred EEEEEEEEEECCHHHHHHHHHHHhCCCCcEEEEC
Confidence 33456656678888899999999975 7766654
No 38
>PF12646 DUF3783: Domain of unknown function (DUF3783); InterPro: IPR016621 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=56.10 E-value=14 Score=21.54 Aligned_cols=28 Identities=11% Similarity=0.012 Sum_probs=23.1
Q ss_pred eEEeecC-CcccHHHHHHHHHhCCCeEEE
Q psy12446 17 TLYHGVN-LENHLHLAYHELKKHQVKNVV 44 (77)
Q Consensus 17 ~fli~~p-t~~tl~~yi~elk~~~Vt~VV 44 (77)
+|++.+. ++.-+..++..+|+.|+...+
T Consensus 2 ~~ll~~g~~~~el~~~l~~~r~~~~~~~~ 30 (58)
T PF12646_consen 2 EFLLFSGFSGEELDKFLDALRKAGIPIPL 30 (58)
T ss_pred CEEEECCCCHHHHHHHHHHHHHcCCCcce
Confidence 4777777 889999999999999994443
No 39
>PHA02588 cd deoxycytidylate deaminase; Provisional
Probab=55.34 E-value=25 Score=24.65 Aligned_cols=51 Identities=18% Similarity=0.288 Sum_probs=39.0
Q ss_pred eeccceeEEeec-CCcccHHHHHHHHHhCCCeEEEEeCCCCcC------chhhhcCCceEEEee
Q psy12446 11 LDYKTVTLYHGV-NLENHLHLAYHELKKHQVKNVVRVCEPTYK------VEDLKTEGINVKDLA 67 (77)
Q Consensus 11 ~~y~~~~fli~~-pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd------~~~~~~~GI~vhdl~ 67 (77)
.+.++.+++++. |= ..+..-+...||+.||--+. |+ .+.|+++||+|..++
T Consensus 97 ~~~~g~tLYvTlePC----~~Ca~aI~~~gI~rVvy~~~--~~~~~~~~~~~L~~~Gi~v~~~~ 154 (168)
T PHA02588 97 ISIEGATMYVTASPC----PDCAKAIAQSGIKKLVYCEK--YDRNGPGWDDILRKSGIEVIQIP 154 (168)
T ss_pred CCCCCcEEEEeCCCc----HHHHHHHHHhCCCEEEEeec--cCCCcHHHHHHHHHCCCEEEEeC
Confidence 456778888864 54 46888889999999997764 54 268999999998753
No 40
>cd03819 GT1_WavL_like This family is most closely related to the GT1 family of glycosyltransferases. WavL in Vibrio cholerae has been shown to be involved in the biosynthesis of the lipopolysaccharide core.
Probab=53.26 E-value=38 Score=23.71 Aligned_cols=46 Identities=15% Similarity=0.127 Sum_probs=32.8
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCC
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTS 73 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~ 73 (77)
-+....++|++.|..-.| ++...+..+.++..|+.++.+++..+.+
T Consensus 15 ~~~~l~~~L~~~g~~v~v-~~~~~~~~~~~~~~~~~~~~~~~~~~~~ 60 (355)
T cd03819 15 GTLELARALVERGHRSLV-ASAGGRLVAELEAEGSRHIKLPFISKNP 60 (355)
T ss_pred HHHHHHHHHHHcCCEEEE-EcCCCchHHHHHhcCCeEEEccccccch
Confidence 445566788999977555 4445556677888999999888766543
No 41
>COG5603 TRS20 Subunit of TRAPP, an ER-Golgi tethering complex [Cell motility and secretion]
Probab=51.57 E-value=24 Score=24.73 Aligned_cols=52 Identities=13% Similarity=0.176 Sum_probs=38.0
Q ss_pred eeeeeeeccceeEEeec-C-CcccHHHHHHHHHhCCCeEEEEeCCCCcC-chhhhcCC
Q psy12446 6 LICRTLDYKTVTLYHGV-N-LENHLHLAYHELKKHQVKNVVRVCEPTYK-VEDLKTEG 60 (77)
Q Consensus 6 ~~~~~~~y~~~~fli~~-p-t~~tl~~yi~elk~~~Vt~VVRvCe~tYd-~~~~~~~G 60 (77)
.|..-|+|.+|+|+.-+ | +..|+..|.+|+.+-=++.+. .|.|. .+.+...|
T Consensus 69 ~IsAYi~pSgmkf~~iH~n~s~~N~rsF~qevHely~ktLm---spfy~~~~pirsqa 123 (136)
T COG5603 69 RISAYIMPSGMKFLFIHQNQSRKNARSFLQEVHELYAKTLM---SPFYEPDEPIRSQA 123 (136)
T ss_pred EEEEEEccCCceEEEEeccchhhhHHHHHHHHHHHHHHHhh---CcccCCCccccchh
Confidence 46667899999999976 5 888999999998877666655 35666 34444443
No 42
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=49.32 E-value=31 Score=21.74 Aligned_cols=34 Identities=21% Similarity=0.154 Sum_probs=16.6
Q ss_pred HHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCce
Q psy12446 29 HLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGIN 62 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~ 62 (77)
..+|+.|++.|..-++-.|.++-.. +.|++.||.
T Consensus 20 ~e~l~~L~~~g~~~~~lTNns~~s~~~~~~~L~~~Gi~ 57 (101)
T PF13344_consen 20 VEALDALRERGKPVVFLTNNSSRSREEYAKKLKKLGIP 57 (101)
T ss_dssp HHHHHHHHHTTSEEEEEES-SSS-HHHHHHHHHHTTTT
T ss_pred HHHHHHHHHcCCCEEEEeCCCCCCHHHHHHHHHhcCcC
Confidence 3455555555555555555554442 334555554
No 43
>PF14814 UB2H: Bifunctional transglycosylase second domain; PDB: 3FWL_A 3VMA_A.
Probab=48.57 E-value=10 Score=23.62 Aligned_cols=48 Identities=19% Similarity=0.266 Sum_probs=26.6
Q ss_pred cccHHHHHHHHHhCCCeEEEEeCCC-CcCchhhhcCCceEE--EeecCCCCCCC
Q psy12446 25 ENHLHLAYHELKKHQVKNVVRVCEP-TYKVEDLKTEGINVK--DLAYEDGTSPS 75 (77)
Q Consensus 25 ~~tl~~yi~elk~~~Vt~VVRvCe~-tYd~~~~~~~GI~vh--dl~F~DG~~Pp 75 (77)
.=+.+..++||+.-|=..|-.+-+| +|.. ....|.++ .+.|+||.-|+
T Consensus 6 ~ls~~~l~~eL~~LgYR~v~~~~~pG~y~~---~g~~i~i~~R~F~F~Dg~e~~ 56 (85)
T PF14814_consen 6 PLSPAQLEQELELLGYRKVSNPDRPGEYSR---SGNRIEIYTRGFDFPDGQEPA 56 (85)
T ss_dssp S--HHHHHHHHHHTT-EE-SS--STTEEEE---ETTEEEEEE--EEETTCEE--
T ss_pred ccCHHHHHHHHHHcCCCcCCCCCCCeEEEE---ECCEEEEEECCCCCCCCCccC
Confidence 3467889999999998888543332 2332 22445555 78899998775
No 44
>PF10740 DUF2529: Protein of unknown function (DUF2529); InterPro: IPR019676 This entry represents a protein family conserved in the Bacillales. Their function is not known. ; PDB: 3JX9_A.
Probab=48.49 E-value=40 Score=24.46 Aligned_cols=59 Identities=14% Similarity=-0.009 Sum_probs=33.1
Q ss_pred eEEeecC--CcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 17 TLYHGVN--LENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 17 ~fli~~p--t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
|.||.+| ++.......+.|...|+.-|+.. ...=+...+++.=..+.|+-.+-|-.|.+
T Consensus 84 RVllfs~~~~~~e~~~~a~~L~~~gi~~v~Vs-~~~~~~~~l~~~~~~~Idl~~~~~LvP~E 144 (172)
T PF10740_consen 84 RVLLFSPFSTDEEAVALAKQLIEQGIPFVGVS-PNKPDEEDLEDLADVHIDLKLPKPLVPTE 144 (172)
T ss_dssp EEEEEES-S--HHHHHHHHHHHHHT--EEEEE--SS---TTGGG-SSS-EE----S-SEE-T
T ss_pred eEEEEeCCCCCHHHHHHHHHHHHCCCCEEEEE-ecCCCCCchhhhhhheeecccCCCcccCC
Confidence 7888887 55589999999999999877655 55555667777666667777777766654
No 45
>KOG2781|consensus
Probab=46.71 E-value=47 Score=25.94 Aligned_cols=50 Identities=18% Similarity=0.202 Sum_probs=34.9
Q ss_pred cceeEEe-ec---C-CcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeec
Q psy12446 14 KTVTLYH-GV---N-LENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAY 68 (77)
Q Consensus 14 ~~~~fli-~~---p-t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F 68 (77)
|+|+|++ .+ | -+--+..-++-.+++|+|+|+-|-|..=.. .|.-|.++||
T Consensus 102 KelkLvfPNaqr~nRG~~~~~~lv~a~ra~~~Td~iivHEhRG~P-----dgL~vshlPf 156 (290)
T KOG2781|consen 102 KELKLVFPNAQRLNRGNYVVGELVDAARANGVTDLIIVHEHRGIP-----DGLVVSHLPF 156 (290)
T ss_pred HhheEeccChhhhcccceeHHHHHHHHHHCCCceEEEEeccCCCC-----CceEEEecCC
Confidence 6788888 22 3 455677888889999999999888743221 3455666666
No 46
>PRK13809 orotate phosphoribosyltransferase; Provisional
Probab=46.58 E-value=60 Score=23.40 Aligned_cols=53 Identities=9% Similarity=0.065 Sum_probs=38.5
Q ss_pred cceeEEe--ec-CCcccHHHHHHHHHhCCCeEEEEeC--CCC-cCchhhhcCCceEEEe
Q psy12446 14 KTVTLYH--GV-NLENHLHLAYHELKKHQVKNVVRVC--EPT-YKVEDLKTEGINVKDL 66 (77)
Q Consensus 14 ~~~~fli--~~-pt~~tl~~yi~elk~~~Vt~VVRvC--e~t-Yd~~~~~~~GI~vhdl 66 (77)
++-|.+| |. -|..|+..-++.+++.|..-+.-+| +.. -..+.+++.|+.++-+
T Consensus 117 ~g~~VlIVDDViTTG~Ti~~a~~~L~~~G~~vv~v~vlvdr~~~~~~~l~~~gi~v~sl 175 (206)
T PRK13809 117 PGQTCLVINDMVSSGKSIIETAVALEEEGLVVREALVFLDRQKGACQPLGPQGIKLSSV 175 (206)
T ss_pred CCCEEEEEEeccccCHHHHHHHHHHHHCCCEEEEEEEEEECcccHHHHHHhcCCCEEEE
Confidence 5667888 44 6999999999999999966544444 433 2346777889988754
No 47
>PHA02740 protein tyrosine phosphatase; Provisional
Probab=45.71 E-value=34 Score=25.89 Aligned_cols=31 Identities=13% Similarity=0.075 Sum_probs=27.5
Q ss_pred EEee-cCCcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 18 LYHG-VNLENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 18 fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
|..+ .|..+|+.+|..-.=++++..||-+|.
T Consensus 93 fIaTQ~Pl~~T~~dFW~MVwe~~~~iIVMLt~ 124 (298)
T PHA02740 93 FICIINLCEDACDKFLQALSDNKVQIIVLISR 124 (298)
T ss_pred EEEecCCchhhHHHHHHHHHhcCCCEEEEccc
Confidence 4445 699999999999999999999999886
No 48
>cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=45.56 E-value=39 Score=24.52 Aligned_cols=43 Identities=16% Similarity=0.184 Sum_probs=33.5
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVKDL 66 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vhdl 66 (77)
.|+.+.+...++.+|++||+.|+ +++..+. ..-++.|+.+..+
T Consensus 200 eps~~~l~~l~~~ik~~~v~~if--~e~~~~~~~~~~la~~~g~~v~~l 246 (266)
T cd01018 200 EPSPADLKRLIDLAKEKGVRVVF--VQPQFSTKSAEAIAREIGAKVVTI 246 (266)
T ss_pred CCCHHHHHHHHHHHHHcCCCEEE--EcCCCCcHHHHHHHHHcCCeEEEe
Confidence 36888999999999999999776 6666666 3346788888654
No 49
>PF11121 DUF2639: Protein of unknown function (DUF2639); InterPro: IPR022580 This family is conserved in the Bacillaceae family. Several members are named as being YflJ, but the function is not known.
Probab=43.43 E-value=16 Score=20.90 Aligned_cols=13 Identities=31% Similarity=0.355 Sum_probs=10.1
Q ss_pred HHHHHHHhCCCeE
Q psy12446 30 LAYHELKKHQVKN 42 (77)
Q Consensus 30 ~yi~elk~~~Vt~ 42 (77)
-|+++||+.||+.
T Consensus 7 w~V~eLKk~GI~~ 19 (40)
T PF11121_consen 7 WYVKELKKLGIRR 19 (40)
T ss_pred HHHHHHHHhCccc
Confidence 4788888888864
No 50
>TIGR01133 murG undecaprenyldiphospho-muramoylpentapeptide beta-N-acetylglucosaminyltransferase. RL J Bacteriol 1993 Mar;175(6):1841-3
Probab=43.28 E-value=97 Score=22.05 Aligned_cols=52 Identities=13% Similarity=0.077 Sum_probs=31.4
Q ss_pred eEEee-cCCcccHH---HHHHHHHhCCCeEEEEeCC-CCcCchhhhcCCceEEEeecC
Q psy12446 17 TLYHG-VNLENHLH---LAYHELKKHQVKNVVRVCE-PTYKVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 17 ~fli~-~pt~~tl~---~yi~elk~~~Vt~VVRvCe-~tYd~~~~~~~GI~vhdl~F~ 69 (77)
+.+|. .=+...+. ..+++|++.|..-.| +|. ..+..+.+++.|+.++.+++.
T Consensus 2 ~i~~~~g~~~g~~~~~~~La~~L~~~g~eV~v-v~~~~~~~~~~~~~~g~~~~~i~~~ 58 (348)
T TIGR01133 2 KVVLAAGGTGGHIFPALAVAEELIKRGVEVLW-LGTKRGLEKRLVPKAGIEFYFIPVG 58 (348)
T ss_pred eEEEEeCccHHHHhHHHHHHHHHHhCCCEEEE-EeCCCcchhcccccCCCceEEEecc
Confidence 44553 33445553 678999998865433 443 223334446689999988764
No 51
>TIGR00109 hemH ferrochelatase. Human ferrochelatase, found at the mitochondrial inner membrane inner surface, was shown in an active recombinant form to be a homodimer. This contrasts to an earlier finding by gel filtration that overexpressed E. coli ferrochelatase runs as a monomer.
Probab=42.94 E-value=35 Score=25.93 Aligned_cols=45 Identities=20% Similarity=0.211 Sum_probs=35.4
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeC--------CCCcCc-----hhhhcCCce-EEEeec
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVC--------EPTYKV-----EDLKTEGIN-VKDLAY 68 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvC--------e~tYd~-----~~~~~~GI~-vhdl~F 68 (77)
..-++++-++++.+.|++.|+-+| |..|+- +.+++.|++ .+-.|.
T Consensus 244 l~P~~~~~l~~l~~~G~k~V~vvP~gFv~D~lETl~ei~~e~~~~~~~~G~~~~~~vp~ 302 (322)
T TIGR00109 244 LGPYTEELLEKLGEQGVQHIVVVPIGFTADHLETLYEIDEEYREVAEDAGGDKYQRCPA 302 (322)
T ss_pred CCCCHHHHHHHHHHcCCceEEEECCcccccchhHHHhhhHHHHHHHHHcCCCeEEECCC
Confidence 567788889999999999999998 456765 578889998 544444
No 52
>PF01297 TroA: Periplasmic solute binding protein family; InterPro: IPR006127 This is a family of ABC transporter metal-binding lipoproteins. An example is the periplasmic zinc-binding protein TroA P96116 from SWISSPROT that interacts with an ATP-binding cassette transport system in Treponema pallidum and plays a role in the transport of zinc across the cytoplasmic membrane. Related proteins are found in both Gram-positive and Gram-negative bacteria. ; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2PS9_A 2PS0_A 2OSV_A 2OGW_A 2PS3_A 2PRS_B 3MFQ_C 3GI1_B 2OV3_A 1PQ4_A ....
Probab=42.38 E-value=63 Score=22.95 Aligned_cols=45 Identities=22% Similarity=0.246 Sum_probs=30.8
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEEEeec
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVKDLAY 68 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vhdl~F 68 (77)
-|+.+.+..-++.+|++||..|+- ++..+. ..-++.|+.+..+.-
T Consensus 182 ~ps~~~l~~l~~~ik~~~v~~i~~--e~~~~~~~~~~la~~~g~~vv~ld~ 230 (256)
T PF01297_consen 182 EPSPKDLAELIKLIKENKVKCIFT--EPQFSSKLAEALAKETGVKVVYLDP 230 (256)
T ss_dssp SS-HHHHHHHHHHHHHTT-SEEEE--ETTS-THHHHHHHHCCT-EEEESST
T ss_pred CCCHHHHHHHHHHhhhcCCcEEEe--cCCCChHHHHHHHHHcCCcEEEeCC
Confidence 379999999999999999999985 444444 334678999865433
No 53
>cd01020 TroA_b Metal binding protein TroA_b. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=42.21 E-value=62 Score=23.48 Aligned_cols=42 Identities=14% Similarity=0.281 Sum_probs=33.8
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCchh-------hhcCCceEEEe
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKVED-------LKTEGINVKDL 66 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~-------~~~~GI~vhdl 66 (77)
|+.+.+..-++.+|+++|..|+ +++.++.+. -++.|+.+..+
T Consensus 190 ps~~~l~~l~~~ik~~~v~~if--~e~~~~~k~~~~l~~la~~~~~~v~~l 238 (264)
T cd01020 190 PSPADIAAFQNAIKNRQIDALI--VNPQQASSATTNITGLAKRSGVPVVEV 238 (264)
T ss_pred CCHHHHHHHHHHHHhCCCCEEE--eCCCCCcHHHHHHHHHHHHcCCCEEee
Confidence 6888889999999999999776 687777644 36789998765
No 54
>PF09827 CRISPR_Cas2: CRISPR associated protein Cas2; InterPro: IPR019199 Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) are a family of DNA direct repeats separated by regularly sized non-repetitive spacer sequences that are found in most bacterial and archaeal genomes []. CRISPRs appear to provide acquired resistance against bacteriophages, possibly acting with an RNA interference-like mechanism to inhibit gene functions of invasive DNA elements [, ]. Differences in the number and type of spacers between CRISPR repeats correlate with phage sensitivity. It is thought that following phage infection, bacteria integrate new spacers derived from phage genomic sequences, and that the removal or addition of particular spacers modifies the phage-resistance phenotype of the cell. Therefore, the specificity of CRISPRs may be determined by spacer-phage sequence similarity. In addition, there are many protein families known as CRISPR-associated sequences (Cas), which are encoded in the vicinity of CRISPR loci []. CRISPR/cas gene regions can be quite large, with up to 20 different, tandem-arranged cas genes next to a CRISPR cluster or filling the region between two repeat clusters. Cas genes and CRISPRs are found on mobile genetic elements such as plasmids, and have undergone extensive horizontal transfer. Cas proteins are thought to be involved in the propagation and functioning of CRISPRs. Some Cas proteins show similarity to helicases and repair proteins, although the functions of most are unknown. Cas families can be divided into subtypes according to operon organisation and phylogeny. Members of this family of bacterial proteins comprise various hypothetical proteins, as well as CRISPR (clustered regularly interspaced short palindromic repeats) associated proteins, conferring resistance to infection by certain bacteriophages. ; PDB: 3EXC_X 2I0X_A 3OQ2_B 3UI3_A 1ZPW_X 2I8E_A 2IVY_A.
Probab=41.70 E-value=33 Score=20.36 Aligned_cols=26 Identities=8% Similarity=-0.000 Sum_probs=19.7
Q ss_pred eEEe--ecCCcccHHHHHHHHHhCCCeEE
Q psy12446 17 TLYH--GVNLENHLHLAYHELKKHQVKNV 43 (77)
Q Consensus 17 ~fli--~~pt~~tl~~yi~elk~~~Vt~V 43 (77)
.++| |-..+.......+.|+++| +.+
T Consensus 3 ~~lv~YDi~~~k~~~kv~k~L~~~g-~~i 30 (78)
T PF09827_consen 3 LYLVAYDISDNKRRNKVRKILKSYG-TRI 30 (78)
T ss_dssp EEEEEEEEHSHHHHHHHHHHHHHTT-EEE
T ss_pred EEEEEEECCCcHHHHHHHHHHHHhC-ccc
Confidence 5666 4456777899999999999 443
No 55
>PF00571 CBS: CBS domain CBS domain web page. Mutations in the CBS domain of Swiss:P35520 lead to homocystinuria.; InterPro: IPR000644 CBS (cystathionine-beta-synthase) domains are small intracellular modules, mostly found in two or four copies within a protein, that occur in a variety of proteins in bacteria, archaea, and eukaryotes [, ]. Tandem pairs of CBS domains can act as binding domains for adenosine derivatives and may regulate the activity of attached enzymatic or other domains []. In some cases, CBS domains may act as sensors of cellular energy status by being activated by AMP and inhibited by ATP []. In chloride ion channels, the CBS domains have been implicated in intracellular targeting and trafficking, as well as in protein-protein interactions, but results vary with different channels: in the CLC-5 channel, the CBS domain was shown to be required for trafficking [], while in the CLC-1 channel, the CBS domain was shown to be critical for channel function, but not necessary for trafficking []. Recent experiments revealing that CBS domains can bind adenosine-containing ligands such ATP, AMP, or S-adenosylmethionine have led to the hypothesis that CBS domains function as sensors of intracellular metabolites [, ]. Crystallographic studies of CBS domains have shown that pairs of CBS sequences form a globular domain where each CBS unit adopts a beta-alpha-beta-beta-alpha pattern []. Crystal structure of the CBS domains of the AMP-activated protein kinase in complexes with AMP and ATP shows that the phosphate groups of AMP/ATP lie in a surface pocket at the interface of two CBS domains, which is lined with basic residues, many of which are associated with disease-causing mutations []. In humans, mutations in conserved residues within CBS domains cause a variety of human hereditary diseases, including (with the gene mutated in parentheses): homocystinuria (cystathionine beta-synthase); Wolff-Parkinson-White syndrome (gamma 2 subunit of AMP-activated protein kinase); retinitis pigmentosa (IMP dehydrogenase-1); congenital myotonia, idiopathic generalized epilepsy, hypercalciuric nephrolithiasis, and classic Bartter syndrome (CLC chloride channel family members).; GO: 0005515 protein binding; PDB: 3JTF_A 3TE5_C 3TDH_C 3T4N_C 2QLV_C 3OI8_A 3LV9_A 2QH1_B 1PVM_B 3LQN_A ....
Probab=41.55 E-value=34 Score=18.21 Aligned_cols=28 Identities=21% Similarity=0.262 Sum_probs=23.9
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCCC
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCEP 49 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe~ 49 (77)
.+.+.++.+-++.|+++|+..+..+.+.
T Consensus 12 v~~~~~l~~~~~~~~~~~~~~~~V~d~~ 39 (57)
T PF00571_consen 12 VSPDDSLEEALEIMRKNGISRLPVVDED 39 (57)
T ss_dssp EETTSBHHHHHHHHHHHTSSEEEEESTT
T ss_pred EcCcCcHHHHHHHHHHcCCcEEEEEecC
Confidence 4567899999999999999999887654
No 56
>PF14044 NETI: NETI protein
Probab=40.54 E-value=39 Score=20.58 Aligned_cols=31 Identities=19% Similarity=0.263 Sum_probs=28.2
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCc
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKV 53 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~ 53 (77)
.-|.|++++++-|++-|-..|-|.=+|-|..
T Consensus 5 ~enETI~~CL~RM~~eGY~PvrR~EkPiF~e 35 (57)
T PF14044_consen 5 EENETISDCLARMKKEGYMPVRRIEKPIFKE 35 (57)
T ss_pred cCCCcHHHHHHHHHHcCCCceeeccccceEE
Confidence 4578999999999999999999999998865
No 57
>PF07905 PucR: Purine catabolism regulatory protein-like family; InterPro: IPR012914 This domain is found in the purine catabolism regulatory protein expressed by Bacillus subtilis (PucR, O32138 from SWISSPROT). PucR is thought to be a transcriptional regulator of genes involved in the purine degradation pathway, and may contain a LysR-like DNA-binding domain. It is similar to LysR-type regulators in that it represses its own expression []. The other members of this family are also putative regulatory proteins.
Probab=40.50 E-value=79 Score=20.42 Aligned_cols=43 Identities=12% Similarity=0.094 Sum_probs=34.2
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcCc------hhhhcCCceEEEeec
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYKV------EDLKTEGINVKDLAY 68 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd~------~~~~~~GI~vhdl~F 68 (77)
..+.+||++|.+.|+..++--..+.++. +.-.+.|+-+..+|+
T Consensus 59 ~~~~~~i~~L~~~~~agL~i~~~~~~~~iP~~~i~~A~~~~lPli~ip~ 107 (123)
T PF07905_consen 59 EELREFIRELAEKGAAGLGIKTGRYLDEIPEEIIELADELGLPLIEIPW 107 (123)
T ss_pred HHHHHHHHHHHHCCCeEEEEeccCccccCCHHHHHHHHHcCCCEEEeCC
Confidence 4599999999999999999766544444 667788888887776
No 58
>PF07166 DUF1398: Protein of unknown function (DUF1398); InterPro: IPR009833 This entry is represented by Bacteriophage DE3, Orf: ECD_10033. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several hypothetical Enterobacterial proteins of around 130 residues in length. Members of this family seem to be found exclusively in Escherichia coli and Salmonella species. The function of this family is unknown.; PDB: 2HH8_A.
Probab=40.43 E-value=11 Score=25.72 Aligned_cols=21 Identities=19% Similarity=0.202 Sum_probs=14.4
Q ss_pred CcccHHHHHHHHHhCCCeEEE
Q psy12446 24 LENHLHLAYHELKKHQVKNVV 44 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VV 44 (77)
++++.+.|++|||++||+.=.
T Consensus 14 s~aDFp~~~~elk~~gV~~Y~ 34 (125)
T PF07166_consen 14 SGADFPQFIQELKRMGVSHYI 34 (125)
T ss_dssp HH---HHHHHHHHHH-SSEEE
T ss_pred ccCChHHHHHHHHhcCceEEE
Confidence 567889999999999998644
No 59
>PF00156 Pribosyltran: Phosphoribosyl transferase domain; InterPro: IPR000836 The name PRT comes from phosphoribosyltransferase (PRTase) enzymes, which carry out phosphoryl transfer reactions on 5-phosphoribosyl-alpha1-pyrophosphate PRPP, an activated form of ribose-5-phosphate. Members of Phosphoribosyltransferase (PRT) are catalytic and are regulatory proteins involved in nucleotide synthesis and salvage []. This includes a range of diverse phosphoribosyl transferase enzymes including adenine phosphoribosyltransferase (2.4.2.7 from EC); hypoxanthine-guanine-xanthine phosphoribosyltransferase; hypoxanthine phosphoribosyltransferase (2.4.2.8 from EC); ribose-phosphate pyrophosphokinase (2.7.6.1 from EC); amidophosphoribosyltransferase (2.4.2.14 from EC); orotate phosphoribosyltransferase (2.4.2.10 from EC);uracil phosphoribosyltransferase (2.4.2.9 from EC); and xanthine-guanine phosphoribosyltransferase (2.4.2.22 from EC). Not all PRT proteins are enzymes. For example, in some bacteria PRT proteins regulate the expression of purine and pyrimidine synthetic genes. Members of PRT are defined by the protein fold and by a short 13-residue sequence motif, The motif consists of four hydrophobic amino acids, two acidic amino acids and seven amino acids of variable character, usually including glycine and threonine. The motif has been predicted to be a PRPP-binding site in advance of structural information [, ]. Apart of this motif, different PRT proteins have a low level of sequence identity, less than 15%. The PRT sequence motif is only found in PRTases from the nucleotide synthesis and salvage pathways. Other PRTases, from the tryptophan, histidine and nicotinamide synthetic and salvage pathways, lack the PRT sequence motif and appear to be unrelated to each other and unrelated to the PRT family.; GO: 0009116 nucleoside metabolic process; PDB: 2JBH_A 1Y0B_D 2FXV_B 1GPH_1 1AO0_D 1ORO_B 1VCH_C 2WNS_A 2PRZ_B 2PS1_A ....
Probab=40.05 E-value=53 Score=20.20 Aligned_cols=32 Identities=22% Similarity=0.121 Sum_probs=25.6
Q ss_pred ccceeEEe--e-cCCcccHHHHHHHHHhCCCeEEE
Q psy12446 13 YKTVTLYH--G-VNLENHLHLAYHELKKHQVKNVV 44 (77)
Q Consensus 13 y~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VV 44 (77)
.++-+.|| | .-|..|+...++.|++.|+..|.
T Consensus 86 ~~gk~vliVDDvi~tG~Tl~~~~~~L~~~g~~~v~ 120 (125)
T PF00156_consen 86 IKGKRVLIVDDVIDTGGTLKEAIELLKEAGAKVVG 120 (125)
T ss_dssp GTTSEEEEEEEEESSSHHHHHHHHHHHHTTBSEEE
T ss_pred ccceeEEEEeeeEcccHHHHHHHHHHHhCCCcEEE
Confidence 45566666 4 47999999999999999988554
No 60
>PF10653 Phage-A118_gp45: Protein gp45 of Bacteriophage A118; InterPro: IPR018915 The proteins in this entry represents Gp45 in Listeria phage A118 (Bacteriophage A118) and related proteins; Gp45 is thought to have a function in the phage tail-fibre system.
Probab=39.45 E-value=15 Score=22.54 Aligned_cols=20 Identities=25% Similarity=0.250 Sum_probs=15.3
Q ss_pred ccHHHHHHHHHhCCCeEEEE
Q psy12446 26 NHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVR 45 (77)
..+.+||+|||-.|-..|-|
T Consensus 43 ekm~dyieelklkgyeevtr 62 (62)
T PF10653_consen 43 EKMTDYIEELKLKGYEEVTR 62 (62)
T ss_pred HHHHHHHHHHhhcchhhhcC
Confidence 35789999999888765543
No 61
>KOG0117|consensus
Probab=39.37 E-value=1.1e+02 Score=25.64 Aligned_cols=53 Identities=19% Similarity=0.296 Sum_probs=40.2
Q ss_pred eeeccceeEEee-cCCcccHHHHHHHHHhC--CCeEEEEeCCCCcCchhhhcCCceEEE
Q psy12446 10 TLDYKTVTLYHG-VNLENHLHLAYHELKKH--QVKNVVRVCEPTYKVEDLKTEGINVKD 65 (77)
Q Consensus 10 ~~~y~~~~fli~-~pt~~tl~~yi~elk~~--~Vt~VVRvCe~tYd~~~~~~~GI~vhd 65 (77)
-+|=-|-|++|. -|-+.+=+..++|+++. ||.+|+.-=.|+ ..-++.|+...+
T Consensus 159 c~Svan~RLFiG~IPK~k~keeIlee~~kVteGVvdVivy~~p~---dk~KNRGFaFve 214 (506)
T KOG0117|consen 159 CVSVANCRLFIGNIPKTKKKEEILEEMKKVTEGVVDVIVYPSPD---DKTKNRGFAFVE 214 (506)
T ss_pred EEeeecceeEeccCCccccHHHHHHHHHhhCCCeeEEEEecCcc---ccccccceEEEE
Confidence 355568899996 59999999999999986 799998554443 355667776654
No 62
>TIGR00119 acolac_sm acetolactate synthase, small subunit. acetohydroxyacid synthase is a synonym.
Probab=38.86 E-value=62 Score=22.63 Aligned_cols=36 Identities=14% Similarity=0.156 Sum_probs=26.4
Q ss_pred eeccceeEEee-cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 11 LDYKTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 11 ~~y~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
||...=.+.|- .-+..-++.|++.|+.+|+..++|.
T Consensus 112 vdv~~~~~~ie~tG~~~ki~~~~~~l~~~gi~e~~Rt 148 (157)
T TIGR00119 112 VDVSPDSYTVEVTGDSDKIDAFLELLRPFGIKEVART 148 (157)
T ss_pred EEecCCEEEEEEcCCHHHHHHHHHHhhhcCCEEEEcc
Confidence 44433333343 3477889999999999999999985
No 63
>PLN02404 6,7-dimethyl-8-ribityllumazine synthase
Probab=38.65 E-value=39 Score=23.45 Aligned_cols=36 Identities=33% Similarity=0.287 Sum_probs=28.5
Q ss_pred eeccceeEEee-c--C---CcccHHHHHHHHHhCCCe----EEEEe
Q psy12446 11 LDYKTVTLYHG-V--N---LENHLHLAYHELKKHQVK----NVVRV 46 (77)
Q Consensus 11 ~~y~~~~fli~-~--p---t~~tl~~yi~elk~~~Vt----~VVRv 46 (77)
++-.++||.|= + | ++.-++..++.|+++||. ++++|
T Consensus 3 ~~~~~~ri~IV~s~fn~~I~~~Ll~ga~~~l~~~gv~~~~i~v~~V 48 (141)
T PLN02404 3 LDGEGLRFGVVVARFNEIITKNLLEGALETFKRYSVKEENIDVVWV 48 (141)
T ss_pred cCCCCCEEEEEEecCcHHHHHHHHHHHHHHHHHcCCCccceEEEEc
Confidence 45567899992 2 3 788899999999999996 67776
No 64
>PRK07114 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=37.42 E-value=42 Score=24.73 Aligned_cols=43 Identities=9% Similarity=0.021 Sum_probs=34.2
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCCcCc---hhhhcCCceEEEeecCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPTYKV---EDLKTEGINVKDLAYED 70 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~tYd~---~~~~~~GI~vhdl~F~D 70 (77)
....++.|++++|-.|+|...+..-. +.+.+.||.+.|..|..
T Consensus 5 ~~~~~~~l~~~~vi~Vvr~~~~~~a~~~~~al~~gGi~~iEiT~~t 50 (222)
T PRK07114 5 RIAVLTAMKATGMVPVFYHADVEVAKKVIKACYDGGARVFEFTNRG 50 (222)
T ss_pred HHHHHHHHHhCCEEEEEEcCCHHHHHHHHHHHHHCCCCEEEEeCCC
Confidence 34677899999999999988643222 78899999999999854
No 65
>cd03808 GT1_cap1E_like This family is most closely related to the GT1 family of glycosyltransferases. cap1E in Streptococcus pneumoniae is required for the synthesis of type 1 capsular polysaccharides.
Probab=37.01 E-value=1.2e+02 Score=20.28 Aligned_cols=45 Identities=11% Similarity=0.022 Sum_probs=32.2
Q ss_pred cccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCC
Q psy12446 25 ENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYED 70 (77)
Q Consensus 25 ~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~D 70 (77)
..-+...+++|++.|.+-.+-........ .+...|+.++.+++..
T Consensus 13 ~~~~~~l~~~L~~~g~~v~~~~~~~~~~~-~~~~~~~~~~~~~~~~ 57 (359)
T cd03808 13 YSFRLPLIKALRAAGYEVHVVAPPGDELE-ELEALGVKVIPIPLDR 57 (359)
T ss_pred HHHHHHHHHHHHhcCCeeEEEecCCCccc-ccccCCceEEeccccc
Confidence 45566788899888877666554444433 6788999999888775
No 66
>COG1040 ComFC Predicted amidophosphoribosyltransferases [General function prediction only]
Probab=36.45 E-value=53 Score=23.85 Aligned_cols=30 Identities=20% Similarity=0.125 Sum_probs=25.1
Q ss_pred EEe-e-cCCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 18 LYH-G-VNLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 18 fli-~-~pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
+|| | .=|.+|+..--+.|++.|++.|--++
T Consensus 188 lLvDDV~TTGaTl~~~~~~L~~~Ga~~v~~~~ 219 (225)
T COG1040 188 LLVDDVYTTGATLKEAAKLLREAGAKRVFVLT 219 (225)
T ss_pred EEEecccccHHHHHHHHHHHHHcCCceEEEEE
Confidence 455 4 46999999999999999999987654
No 67
>PRK05476 S-adenosyl-L-homocysteine hydrolase; Provisional
Probab=36.33 E-value=1.4e+02 Score=24.11 Aligned_cols=54 Identities=17% Similarity=0.175 Sum_probs=40.3
Q ss_pred eccceeEEeecCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc-----hhhhcCCceEEEe
Q psy12446 12 DYKTVTLYHGVNLENHLHLAYHELKKHQVKNVVRVCEPTYKV-----EDLKTEGINVKDL 66 (77)
Q Consensus 12 ~y~~~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-----~~~~~~GI~vhdl 66 (77)
-.+++|..+-.+-...-...++-|+..|..-.+-.|.|. ++ ..+.+.||.|+.|
T Consensus 45 pl~G~~i~~~~Hl~~~Ta~l~~~L~~~GA~v~~~~~np~-Stqd~vaaaL~~~gi~v~a~ 103 (425)
T PRK05476 45 PLKGARIAGCLHMTIQTAVLIETLKALGAEVRWASCNPF-STQDDVAAALAAAGIPVFAW 103 (425)
T ss_pred CCCCCEEEEEEeccccHHHHHHHHHHcCCEEEEEeCCCc-ccCHHHHHHHHHCCceEEec
Confidence 456777777555444455778999999999777777774 44 6788889999988
No 68
>PRK14465 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=36.22 E-value=37 Score=26.54 Aligned_cols=25 Identities=12% Similarity=-0.033 Sum_probs=22.8
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...++.|.+.|+++||...||-
T Consensus 301 ~ps~e~i~~F~~~L~~~Gi~v~~R~ 325 (342)
T PRK14465 301 RPTDDEVAEFIMLLEPAGVPILNRR 325 (342)
T ss_pred CCCHHHHHHHHHHHHHCCCeEEEeC
Confidence 3799999999999999999999983
No 69
>PRK00002 aroB 3-dehydroquinate synthase; Reviewed
Probab=36.19 E-value=92 Score=23.68 Aligned_cols=47 Identities=23% Similarity=0.198 Sum_probs=30.7
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCCcCc-------hhhhcCCceEEEeecCCCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPTYKV-------EDLKTEGINVKDLAYEDGTSP 74 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~tYd~-------~~~~~~GI~vhdl~F~DG~~P 74 (77)
++.--+++++++.+.++-++.+.... +.++++||.+....|++|.+-
T Consensus 19 ~~~l~~~l~~~~~~~~livtd~~~~~~~~~~v~~~L~~~gi~~~~~~~~~~e~~ 72 (358)
T PRK00002 19 LSELGELLAPLKGKKVAIVTDETVAPLYLEKLRASLEAAGFEVDVVVLPDGEQY 72 (358)
T ss_pred HHHHHHHHHhcCCCeEEEEECCchHHHHHHHHHHHHHhcCCceEEEEeCCCCCC
Confidence 33334455566667777777654322 457788998876678998773
No 70
>PRK07455 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=36.10 E-value=44 Score=23.39 Aligned_cols=43 Identities=14% Similarity=0.308 Sum_probs=33.7
Q ss_pred HHHHHHHHhCCCeEEEEeCCCCcC---chhhhcCCceEEEeecCCC
Q psy12446 29 HLAYHELKKHQVKNVVRVCEPTYK---VEDLKTEGINVKDLAYEDG 71 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VVRvCe~tYd---~~~~~~~GI~vhdl~F~DG 71 (77)
.+.++.|++++|-.|+|...+..- .+.+-+.|+.+.++.+.+-
T Consensus 3 ~~~~~~l~~~~~~~v~r~~~~~~~~~~~~~~~~~Gv~~vqlr~k~~ 48 (187)
T PRK07455 3 QDWLAQLQQHRAIAVIRAPDLELGLQMAEAVAAGGMRLIEITWNSD 48 (187)
T ss_pred hHHHHHHHhCCEEEEEEcCCHHHHHHHHHHHHHCCCCEEEEeCCCC
Confidence 457889999999999999864322 2677788999999888764
No 71
>PF09673 TrbC_Ftype: Type-F conjugative transfer system pilin assembly protein; InterPro: IPR019106 This entry represents TrbC, a protein that is an essential component of the F-type conjugative pilus assembly system (aka type 4 secretion system) for the transfer of plasmid DNA [, ]. The N-terminal portion of these proteins is heterogeneous.
Probab=35.97 E-value=39 Score=22.03 Aligned_cols=24 Identities=17% Similarity=0.101 Sum_probs=21.3
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
.+++|.+++++..+.|+.-|+|==
T Consensus 9 P~~~L~~l~~~a~~~~~~~V~RG~ 32 (113)
T PF09673_consen 9 PDASLRNLLKQAERAGVVVVFRGF 32 (113)
T ss_pred CHHHHHHHHHHHHhCCcEEEEECC
Confidence 578899999999999999999843
No 72
>PRK13811 orotate phosphoribosyltransferase; Provisional
Probab=35.97 E-value=1.2e+02 Score=20.90 Aligned_cols=54 Identities=15% Similarity=0.034 Sum_probs=40.3
Q ss_pred ccceeEEe--e-cCCcccHHHHHHHHHhCCCeEEEEeC--CCCcC-chhhhcCCceEEEe
Q psy12446 13 YKTVTLYH--G-VNLENHLHLAYHELKKHQVKNVVRVC--EPTYK-VEDLKTEGINVKDL 66 (77)
Q Consensus 13 y~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VVRvC--e~tYd-~~~~~~~GI~vhdl 66 (77)
.++-|.+| | .-|..|+..-++.+++.|.+-+--+| +.... .+.+++.||.++-+
T Consensus 102 ~~g~~VlIVDDvi~TG~T~~~~~~~l~~~Ga~v~~~~~~vdr~~g~~~~l~~~gv~~~sl 161 (170)
T PRK13811 102 VKGKRVLLVEDVTTSGGSALYGIEQLRAAGAVVDDVVTVVDREQGAEELLAELGITLTPL 161 (170)
T ss_pred cCCCEEEEEEecccccHHHHHHHHHHHHCCCeEEEEEEEEECCccHHHHHHhcCCcEEEE
Confidence 57788888 4 46999999999999999876554444 44333 46677899998854
No 73
>PRK10907 intramembrane serine protease GlpG; Provisional
Probab=35.82 E-value=35 Score=25.85 Aligned_cols=30 Identities=3% Similarity=-0.030 Sum_probs=25.7
Q ss_pred eeEEeecCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 16 VTLYHGVNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 16 ~~fli~~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
|+-++.-|.++..+.|+++|+.+||..-++
T Consensus 1 M~~~~~~~~~~~a~~f~dyl~~~~i~~~~~ 30 (276)
T PRK10907 1 MLMITSFSNPRLAQAFVDYMATQGVILTIQ 30 (276)
T ss_pred CcchhcCCCHHHHHHHHHHHHHCCCcEEEe
Confidence 444667799999999999999999999775
No 74
>PRK13812 orotate phosphoribosyltransferase; Provisional
Probab=35.50 E-value=1.3e+02 Score=20.97 Aligned_cols=53 Identities=9% Similarity=0.044 Sum_probs=39.0
Q ss_pred cceeEEe--e-cCCcccHHHHHHHHHhCCCeEEEEe--CCCCcC-chhhhcCCceEEEe
Q psy12446 14 KTVTLYH--G-VNLENHLHLAYHELKKHQVKNVVRV--CEPTYK-VEDLKTEGINVKDL 66 (77)
Q Consensus 14 ~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VVRv--Ce~tYd-~~~~~~~GI~vhdl 66 (77)
++-+.+| | .-|..|+..-++.+++.|..-+--+ .+..-. .+.+++.|+.++-+
T Consensus 106 ~g~~VlIVDDvitTG~Tl~~~~~~l~~~Ga~vv~~~vlvdr~~~~~~~l~~~g~~v~sL 164 (176)
T PRK13812 106 EGEEVVVLEDIATTGQSAVDAVEALREAGATVNRVLVVVDREEGARENLADHDVELEAL 164 (176)
T ss_pred CcCEEEEEEEeeCCCHHHHHHHHHHHHCCCeEEEEEEEEECCcchHHHHHhcCCcEEEE
Confidence 6778888 4 4799999999999999997644433 344333 36778899988754
No 75
>cd03784 GT1_Gtf_like This family includes the Gtfs, a group of homologous glycosyltransferases involved in the final stages of the biosynthesis of antibiotics vancomycin and related chloroeremomycin. Gtfs transfer sugar moieties from an activated NDP-sugar donor to the oxidatively cross-linked heptapeptide core of vancomycin group antibiotics. The core structure is important for the bioactivity of the antibiotics.
Probab=35.42 E-value=74 Score=23.47 Aligned_cols=49 Identities=8% Similarity=0.110 Sum_probs=33.4
Q ss_pred eeEEe-ecCCcccHHHHHH---HHHhCCCeEEEEeCCCCcCchhhhcCCceEEEe
Q psy12446 16 VTLYH-GVNLENHLHLAYH---ELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDL 66 (77)
Q Consensus 16 ~~fli-~~pt~~tl~~yi~---elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl 66 (77)
||+|+ ..|+...+.-++. +|++.|- .|.=+|.+ .-+..++..|+++..+
T Consensus 1 mrIl~~~~p~~GHv~P~l~la~~L~~rGh-~V~~~t~~-~~~~~v~~~G~~~~~~ 53 (401)
T cd03784 1 MRVLITTIGSRGDVQPLVALAWALRAAGH-EVRVATPP-EFADLVEAAGLEFVPV 53 (401)
T ss_pred CeEEEEeCCCcchHHHHHHHHHHHHHCCC-eEEEeeCH-hHHHHHHHcCCceeeC
Confidence 78888 4677777777664 7888993 33333333 4467888999987644
No 76
>PF15632 ATPgrasp_Ter: ATP-grasp in the biosynthetic pathway with Ter operon
Probab=35.36 E-value=58 Score=25.32 Aligned_cols=30 Identities=17% Similarity=0.334 Sum_probs=23.1
Q ss_pred HHHHHHhCCCeEEEEeCCCCcCc-------hhhhcCCceEE
Q psy12446 31 AYHELKKHQVKNVVRVCEPTYKV-------EDLKTEGINVK 64 (77)
Q Consensus 31 yi~elk~~~Vt~VVRvCe~tYd~-------~~~~~~GI~vh 64 (77)
.++-.++|+|..++ |.++. +.|++.|+++.
T Consensus 58 ~l~~C~~~~Idv~~----P~~~~~~l~~~r~~F~a~Gv~l~ 94 (329)
T PF15632_consen 58 CLDFCKEHGIDVFV----PGRNRELLAAHRDEFEALGVKLL 94 (329)
T ss_pred HHHHHHHhCCeEEE----cCccHHHHHHHHHHHHHhCCEEE
Confidence 34467889999998 77766 67888888875
No 77
>cd02071 MM_CoA_mut_B12_BD methylmalonyl CoA mutase B12 binding domain. This domain binds to B12 (adenosylcobamide), which initiates the conversion of succinyl CoA and methylmalonyl CoA by forming an adenosyl radical, which then undergoes a rearrangement exchanging a hydrogen atom with a group attached to a neighboring carbon atom. This family is present in both mammals and bacteria. Bacterial members are heterodimers and involved in the fermentation of pyruvate to propionate. Mammalian members are homodimers and responsible for the conversion of odd-chain fatty acids and branched-chain amino acids via propionyl CoA to succinyl CoA for further degradation.
Probab=34.92 E-value=29 Score=22.34 Aligned_cols=48 Identities=19% Similarity=0.128 Sum_probs=32.3
Q ss_pred CcccHHHHHHHHHhCCCeEEE-EeC--CCCcCchhhhcCCceEEEeecCCCCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVV-RVC--EPTYKVEDLKTEGINVKDLAYEDGTSP 74 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VV-RvC--e~tYd~~~~~~~GI~vhdl~F~DG~~P 74 (77)
+...+...++.+++.|...+. =+. .+.-+.+++++.|| |-.|.=|+++
T Consensus 63 ~~~~~~~~~~~L~~~~~~~i~i~~GG~~~~~~~~~~~~~G~---d~~~~~~~~~ 113 (122)
T cd02071 63 HMTLFPEVIELLRELGAGDILVVGGGIIPPEDYELLKEMGV---AEIFGPGTSI 113 (122)
T ss_pred hHHHHHHHHHHHHhcCCCCCEEEEECCCCHHHHHHHHHCCC---CEEECCCCCH
Confidence 455678888999998664333 344 25555678889997 5667666654
No 78
>cd06578 HemD Uroporphyrinogen-III synthase (HemD) catalyzes the asymmetrical cyclization of tetrapyrrole (linear) to uroporphyrinogen-III, the fourth step in the biosynthesis of heme. This ubiquitous enzyme is present in eukaryotes, bacteria and archaea. Mutations in the human uroporphyrinogen-III synthase gene cause congenital erythropoietic porphyria, a recessive inborn error of metabolism also known as Gunther disease.
Probab=34.70 E-value=1.2e+02 Score=20.21 Aligned_cols=45 Identities=18% Similarity=0.195 Sum_probs=29.3
Q ss_pred CCcccHHHHHHHHHhC--CCeEEEEeCCCCcC---chhhhcCCceEEEee
Q psy12446 23 NLENHLHLAYHELKKH--QVKNVVRVCEPTYK---VEDLKTEGINVKDLA 67 (77)
Q Consensus 23 pt~~tl~~yi~elk~~--~Vt~VVRvCe~tYd---~~~~~~~GI~vhdl~ 67 (77)
|...+.+..++.++++ .-..|+.+|...-+ .+.++++|+.|.+++
T Consensus 103 ~~~~~~~~L~~~i~~~~~~~~~il~~~g~~~~~~l~~~L~~~g~~v~~~~ 152 (239)
T cd06578 103 PEEGDSEGLLELLELQDGKGKRILRPRGGRAREDLAEALRERGAEVDEVE 152 (239)
T ss_pred CCccCHHHHHHHHHhcCCCCCEEEEEcCcchhHHHHHHHHHCCCEEEEEE
Confidence 3455666677777775 66777777754422 256677788887765
No 79
>PF12804 NTP_transf_3: MobA-like NTP transferase domain; PDB: 3FWW_A 2XME_D 2XMH_C 2DPW_A 2WAW_A 2OI5_B 1HV9_B 1FWY_A 2OI6_A 2OI7_B ....
Probab=34.57 E-value=38 Score=21.71 Aligned_cols=45 Identities=20% Similarity=0.134 Sum_probs=31.5
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEee
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLA 67 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~ 67 (77)
.-.+-++.-++.+++.|+.+|+.++...--.+.+.+.|+.+..-+
T Consensus 22 ~g~~li~~~l~~l~~~~~~~Ivvv~~~~~~~~~~~~~~~~~v~~~ 66 (160)
T PF12804_consen 22 GGKPLIERVLEALREAGVDDIVVVTGEEEIYEYLERYGIKVVVDP 66 (160)
T ss_dssp TTEEHHHHHHHHHHHHTESEEEEEESTHHHHHHHTTTTSEEEE-S
T ss_pred CCccHHHHHHHHhhccCCceEEEecChHHHHHHHhccCceEEEec
Confidence 344567888899999999999999887322234567788775443
No 80
>COG1149 MinD superfamily P-loop ATPase containing an inserted ferredoxin domain [Energy production and conversion]
Probab=34.36 E-value=1.2e+02 Score=23.71 Aligned_cols=55 Identities=13% Similarity=0.155 Sum_probs=39.4
Q ss_pred eeEEeecCCcccHHHH---HHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEE-EeecCC
Q psy12446 16 VTLYHGVNLENHLHLA---YHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVK-DLAYED 70 (77)
Q Consensus 16 ~~fli~~pt~~tl~~y---i~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vh-dl~F~D 70 (77)
+-++++-||.+.+++. ++..+..|+...+.+|...-.. +.-+++||.+. ..||++
T Consensus 188 ~ai~VTEPTp~glhD~kr~~el~~~f~ip~~iViNr~~~g~s~ie~~~~e~gi~il~~IPyd~ 250 (284)
T COG1149 188 LAILVTEPTPFGLHDLKRALELVEHFGIPTGIVINRYNLGDSEIEEYCEEEGIPILGEIPYDK 250 (284)
T ss_pred EEEEEecCCccchhHHHHHHHHHHHhCCceEEEEecCCCCchHHHHHHHHcCCCeeEECCcch
Confidence 4466788999888776 5678899999999999762111 23456788765 677765
No 81
>cd08197 DOIS 2-deoxy-scyllo-inosose synthase (DOIS) catalyzes carbocycle formation from D-glucose-6-phosphate to 2-deoxy-scyllo-inosose through a multistep reaction in the biosynthesis of aminoglycoside antibiotics. 2-deoxy-scyllo-inosose synthase (DOIS) catalyzes carbocycle formation from D-glucose-6-phosphate to 2-deoxy-scyllo-inosose through a multistep reaction in the biosynthesis of aminoglycoside antibiotics. 2-Deoxystreptamine (DOS)-containing aminoglycoside antibiotics includes neomycin, kanamycin, gentamicin, and ribostamycin. They are important antibacterial agents. DOIS is a homologue of the dehydroquinate synthase which catalyzes the cyclization of 3-deoxy-D-arabino-heputulosonate-7-phosphate to dehydroquinate (DHQ) in the shikimate pathway.
Probab=34.33 E-value=97 Score=23.92 Aligned_cols=48 Identities=19% Similarity=0.212 Sum_probs=30.7
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCC-cC------chhhhcCCceEEEeecCCCCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPT-YK------VEDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~t-Yd------~~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
+...-++++++|.+.++-++.+. ++ .+.++++|+.+.-..|++|.+-|
T Consensus 11 l~~l~~~l~~~g~~rvlvVtd~~v~~~~~~~l~~~L~~~g~~~~~~~~~~~e~~k 65 (355)
T cd08197 11 LDSVLGYLPELNADKYLLVTDSNVEDLYGHRLLEYLREAGAPVELLSVPSGEEHK 65 (355)
T ss_pred HHHHHHHHHhcCCCeEEEEECccHHHHHHHHHHHHHHhcCCceEEEEeCCCCCCC
Confidence 33334456667777777777643 22 16677889887656788887643
No 82
>PF01548 DEDD_Tnp_IS110: Transposase; InterPro: IPR002525 Transposase proteins are necessary for efficient DNA transposition. This entry represents the N-terminal region of the pilin gene inverting protein (PIVML) and members of the IS111A/IS1328/IS1533 family of transposases [, ]. More information about these proteins can be found at Protein of the Month: Transposase [].; GO: 0003677 DNA binding, 0004803 transposase activity, 0006313 transposition, DNA-mediated
Probab=34.14 E-value=52 Score=21.09 Aligned_cols=41 Identities=27% Similarity=0.102 Sum_probs=32.2
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCCCCc--Cc---hhhhcCCceEE
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCEPTY--KV---EDLKTEGINVK 64 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe~tY--d~---~~~~~~GI~vh 64 (77)
.++.+.+..+++-+++++ .++-++|+|- .. ..+.++|+.|+
T Consensus 29 ~~~~~~~~~l~~~l~~~~--~~~v~~E~tg~y~~~l~~~L~~~g~~v~ 74 (144)
T PF01548_consen 29 ENDPAGLEKLLDWLASLG--PVLVVMEATGGYWRPLADFLQDAGIEVV 74 (144)
T ss_pred eccccchhHHhhhhcccc--ccccccccccccchhhhhheeccccccc
Confidence 467888999999999997 6666688763 43 67788899886
No 83
>cd00851 MTH1175 This uncharacterized conserved protein belongs to a family of iron-molybdenum cluster-binding proteins that includes NifX, NifB, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme. This domain is a predicted small-molecule-binding domain (SMBD) with an alpha/beta fold that is present either as a stand-alone domain (e.g. NifX and NifY) or fused to another conserved domain (e.g. NifB) however, its function is still undetermined.The SCOP database suggests that this domain is most similar to structures within the ribonuclease H superfamily. This conserved domain is represented in two of the three major divisions of life (bacteria and archaea).
Probab=33.80 E-value=70 Score=19.17 Aligned_cols=37 Identities=24% Similarity=0.209 Sum_probs=25.4
Q ss_pred HHHHHHHHhCCCeEEEE--eCCCCcCchhhhcCCceEEEee
Q psy12446 29 HLAYHELKKHQVKNVVR--VCEPTYKVEDLKTEGINVKDLA 67 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VVR--vCe~tYd~~~~~~~GI~vhdl~ 67 (77)
..-.+.|+++||..|+- .+++.+ ..|+++||++....
T Consensus 53 ~~~~~~l~~~~v~~vi~~~iG~~~~--~~l~~~gI~v~~~~ 91 (103)
T cd00851 53 GKAAEFLADEGVDVVIVGGIGPRAL--NKLRNAGIKVYKGA 91 (103)
T ss_pred hHHHHHHHHcCCCEEEeCCCCcCHH--HHHHHCCCEEEEcC
Confidence 45666677799999883 233322 67899999997443
No 84
>PF05037 DUF669: Protein of unknown function (DUF669); InterPro: IPR007731 This entry is represented by Streptococcus phage Sfi11, Gp151. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=33.76 E-value=63 Score=21.59 Aligned_cols=50 Identities=14% Similarity=0.171 Sum_probs=36.6
Q ss_pred cccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 25 ENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 25 ~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
-++++.++..|.-.-++-.|-.-+..|+.+.+....+ ..|.+.|++.|.-
T Consensus 91 ~~sl~~~~~~l~gk~l~V~v~~~~~e~nGk~y~~~~v--~~~~~~~~~~P~~ 140 (141)
T PF05037_consen 91 FESLEQFLNQLLGKPLRVTVKWEENEYNGKTYPNNKV--KAWNPSKTSAPNQ 140 (141)
T ss_pred cccHHHHHHHHcCCeeEEEecccccCCCCcEeecCce--eccCccccCCCCC
Confidence 3467888888877777766666567788877776665 4588899999864
No 85
>TIGR02990 ectoine_eutA ectoine utilization protein EutA. Members of this protein family are EutA, a predicted arylmalonate decarboxylase found in a conserved ectoine utilization operon of species that include Sinorhizobium meliloti 1021 (where it is known to be induced by ectoine), Mesorhizobium loti and Silicibacter pomeroyi. It is missing from two other species with the other ectoine transport and utilization genes: Pseudomonas putida and Agrobacterium tumefaciens.
Probab=33.66 E-value=95 Score=22.84 Aligned_cols=43 Identities=12% Similarity=0.190 Sum_probs=35.9
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc-------hhhhcCCceEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCEPTYKV-------EDLKTEGINVKDL 66 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-------~~~~~~GI~vhdl 66 (77)
.|.-.+....++-|+..|++.|.-+.= |.. +-|+++||+|..+
T Consensus 102 ~p~tt~~~A~~~AL~alg~~RIalvTP--Y~~~v~~~~~~~l~~~G~eV~~~ 151 (239)
T TIGR02990 102 TPVVTPSSAAVDGLAALGVRRISLLTP--YTPETSRPMAQYFAVRGFEIVNF 151 (239)
T ss_pred CCeeCHHHHHHHHHHHcCCCEEEEECC--CcHHHHHHHHHHHHhCCcEEeee
Confidence 577788899999999999999987753 554 6799999999876
No 86
>PF02449 Glyco_hydro_42: Beta-galactosidase; InterPro: IPR013529 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This group of beta-galactosidase enzymes (3.2.1.23 from EC) belong to the glycosyl hydrolase 42 family GH42 from CAZY. The enzyme catalyses the hydrolysis of terminal, non-reducing terminal beta-D-galactosidase residues.; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process, 0009341 beta-galactosidase complex; PDB: 1KWK_A 1KWG_A 3U7V_A.
Probab=33.03 E-value=33 Score=26.03 Aligned_cols=50 Identities=22% Similarity=0.314 Sum_probs=30.5
Q ss_pred CcccHHHHHHHHHhCCCeEEE--EeC----CC---CcCc-------hhhhcCCceEEEeecCCCCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVV--RVC----EP---TYKV-------EDLKTEGINVKDLAYEDGTSP 74 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VV--RvC----e~---tYd~-------~~~~~~GI~vhdl~F~DG~~P 74 (77)
....++.+++.||+.|+..|- .+. || .||= +.+.++||+|. |-++-+++|
T Consensus 8 ~~e~~~~d~~~m~~~G~n~vri~~~~W~~lEP~eG~ydF~~lD~~l~~a~~~Gi~vi-L~~~~~~~P 73 (374)
T PF02449_consen 8 PEEEWEEDLRLMKEAGFNTVRIGEFSWSWLEPEEGQYDFSWLDRVLDLAAKHGIKVI-LGTPTAAPP 73 (374)
T ss_dssp -CCHHHHHHHHHHHHT-SEEEE-CCEHHHH-SBTTB---HHHHHHHHHHHCTT-EEE-EEECTTTS-
T ss_pred CHHHHHHHHHHHHHcCCCEEEEEEechhhccCCCCeeecHHHHHHHHHHHhccCeEE-EEecccccc
Confidence 457788899999999987653 344 23 2332 67788999885 766666655
No 87
>cd04955 GT1_like_6 This family is most closely related to the GT1 family of glycosyltransferases. Glycosyltransferases catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. The acceptor molecule can be a lipid, a protein, a heterocyclic compound, or another carbohydrate residue. This group of glycosyltransferases is most closely related to the previously defined glycosyltransferase family 1 (GT1). The members of this family may transfer UDP, ADP, GDP, or CMP linked sugars. The diverse enzymatic activities among members of this family reflect a wide range of biological functions. The protein structure available for this family has the GTB topology, one of the two protein topologies observed for nucleotide-sugar-dependent glycosyltransferases. GTB proteins have distinct N- and C- terminal domains each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homolog
Probab=32.97 E-value=1.3e+02 Score=21.07 Aligned_cols=46 Identities=13% Similarity=0.130 Sum_probs=30.9
Q ss_pred cccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecCC
Q psy12446 25 ENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYED 70 (77)
Q Consensus 25 ~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~D 70 (77)
..-+.+..++|.++|..-.|-+..........+..||.++.++.+.
T Consensus 18 ~~~~~~la~~L~~~g~~v~v~~~~~~~~~~~~~~~~i~~~~~~~~~ 63 (363)
T cd04955 18 ETFVEELAPRLVARGHEVTVYCRSPYPKQKETEYNGVRLIHIPAPE 63 (363)
T ss_pred HHHHHHHHHHHHhcCCCEEEEEccCCCCCcccccCCceEEEcCCCC
Confidence 3445666778999998766655554444435667899998877653
No 88
>PRK11895 ilvH acetolactate synthase 3 regulatory subunit; Reviewed
Probab=32.90 E-value=39 Score=23.76 Aligned_cols=25 Identities=16% Similarity=0.206 Sum_probs=21.8
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
-+..-++.|++.|+.+|+..++|.=
T Consensus 126 G~~~ki~~~~~~l~~~gi~e~~RTG 150 (161)
T PRK11895 126 GDSDKIDAFIDLLRPYGIKEIVRTG 150 (161)
T ss_pred CCHHHHHHHHHHhhhcCCEEEEccC
Confidence 4667899999999999999999863
No 89
>COG5324 Uncharacterized conserved protein [Function unknown]
Probab=32.84 E-value=53 Score=28.41 Aligned_cols=31 Identities=19% Similarity=0.332 Sum_probs=27.4
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCCCCcCch
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCEPTYKVE 54 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~ 54 (77)
++.|-+++-.+|-.+|+|.|.-+|.+.|+.-
T Consensus 161 ~~~~k~e~a~~l~~~n~TaVaE~cddefeeH 191 (758)
T COG5324 161 TGRTKQELAHELIENNCTAVAEVCDDEFEEH 191 (758)
T ss_pred hCccHHHHHHHHHhcCCeEeeeecCccchhh
Confidence 5677888999999999999999999888873
No 90
>TIGR02765 crypto_DASH cryptochrome, DASH family. Photolyases and cryptochromes are related flavoproteins. Photolyases harness the energy of blue light to repair DNA damage by removing pyrimidine dimers. Cryptochromes do not repair DNA and are presumed to act instead in some other (possibly unknown) process such as entraining circadian rhythms. This model describes the cryptochrome DASH subfamily, one of at least five major subfamilies, which is found in plants, animals, marine bacteria, etc. Members of this family bind both folate and FAD. They may show weak photolyase activity in vitro but have not been shown to affect DNA repair in vivo. Rather, DASH family cryptochromes have been shown to bind RNA (Vibrio cholerae VC1814), or DNA, and seem likely to act in light-responsive regulatory processes.
Probab=32.76 E-value=1e+02 Score=23.81 Aligned_cols=58 Identities=21% Similarity=0.182 Sum_probs=30.1
Q ss_pred ceeEEeecCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc--------hhhhcCCceEE-----------EeecCCCCCC
Q psy12446 15 TVTLYHGVNLENHLHLAYHELKKHQVKNVVRVCEPTYKV--------EDLKTEGINVK-----------DLAYEDGTSP 74 (77)
Q Consensus 15 ~~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~--------~~~~~~GI~vh-----------dl~F~DG~~P 74 (77)
+.+|++.. ....+...+..+++||+.|+-==++.... +.+++.||.++ +++|.+|++|
T Consensus 74 g~~L~v~~--G~~~~vl~~L~~~~~~~~V~~~~~~~~~~~~rd~~v~~~l~~~~i~~~~~~~~~l~~p~~v~~~~~~~~ 150 (429)
T TIGR02765 74 GSDLLVRS--GKPEDVLPELIKELGVRTVFLHQEVGSEEKSVERLLQQALARLGIHVEQHWGSTLYHEDDLPFDLEDLP 150 (429)
T ss_pred CCCeEEEe--CCHHHHHHHHHHHhCCCEEEEeccCCHHHHHHHHHHHHHHHhcCceEEEecCCEeECHHhcCCCCCCCC
Confidence 34555532 12233333444667787777322222111 34678899865 3566666665
No 91
>COG0120 RpiA Ribose 5-phosphate isomerase [Carbohydrate transport and metabolism]
Probab=32.66 E-value=36 Score=25.59 Aligned_cols=46 Identities=17% Similarity=0.119 Sum_probs=35.5
Q ss_pred eecCCcccHHHHHHHHHh---CCCeEEEEeCCCCcCchhhhcCCceEEEe
Q psy12446 20 HGVNLENHLHLAYHELKK---HQVKNVVRVCEPTYKVEDLKTEGINVKDL 66 (77)
Q Consensus 20 i~~pt~~tl~~yi~elk~---~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl 66 (77)
|.--|.+|...||++|-+ .+ ..++=|+...=...+.++.||.+.++
T Consensus 24 iGlGTGST~~~fI~~Lg~~~~~e-~~i~~V~TS~~t~~l~~~~GI~v~~l 72 (227)
T COG0120 24 IGLGTGSTAAYFIEALGRRVKGE-LDIGGVPTSFQTEELARELGIPVSSL 72 (227)
T ss_pred EEEcCcHHHHHHHHHHHHhhccC-ccEEEEeCCHHHHHHHHHcCCeecCc
Confidence 455689999999999963 23 46777777666668899999998754
No 92
>TIGR00640 acid_CoA_mut_C methylmalonyl-CoA mutase C-terminal domain. Methylmalonyl-CoA mutase (EC 5.4.99.2) catalyzes a reversible isomerization between L-methylmalonyl-CoA and succinyl-CoA. The enzyme uses an adenosylcobalamin cofactor. It may be a homodimer, as in mitochondrion, or a heterodimer with partially homologous beta chain that does not bind the adenosylcobalamin cofactor, as in Propionibacterium freudenreichii. The most similar archaeal sequences are separate chains, such as AF2215 and AF2219 of Archaeoglobus fulgidus, that correspond roughly to the first 500 and last 130 residues, respectively of known methylmalonyl-CoA mutases. This model describes the C-terminal domain subfamily. In a neighbor-joining tree (methylaspartate mutase S chain as the outgroup), AF2219 branches with a coenzyme B12-dependent enzyme known not to be 5.4.99.2.
Probab=32.30 E-value=41 Score=22.53 Aligned_cols=34 Identities=24% Similarity=0.368 Sum_probs=14.8
Q ss_pred HHHHHHHHHhCCCeEEEEeCC---CCcCchhhhcCCc
Q psy12446 28 LHLAYHELKKHQVKNVVRVCE---PTYKVEDLKTEGI 61 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe---~tYd~~~~~~~GI 61 (77)
++.-++.|++.|..++.-++- |.-+.+.+++.|+
T Consensus 70 ~~~~~~~L~~~g~~~i~vivGG~~~~~~~~~l~~~Gv 106 (132)
T TIGR00640 70 VPALRKELDKLGRPDILVVVGGVIPPQDFDELKEMGV 106 (132)
T ss_pred HHHHHHHHHhcCCCCCEEEEeCCCChHhHHHHHHCCC
Confidence 444444555554433222222 4444455555555
No 93
>PRK14463 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=32.10 E-value=46 Score=25.83 Aligned_cols=23 Identities=13% Similarity=0.108 Sum_probs=21.6
Q ss_pred CCcccHHHHHHHHHhCCCeEEEE
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
|+...+..|.+.|+++|++..||
T Consensus 298 ps~e~i~~f~~~L~~~gi~v~vR 320 (349)
T PRK14463 298 PTQEAIDRFHKYLLDKHVTVITR 320 (349)
T ss_pred CCHHHHHHHHHHHHHCCceEEEe
Confidence 68888999999999999999999
No 94
>PF10662 PduV-EutP: Ethanolamine utilisation - propanediol utilisation; InterPro: IPR012381 Members of this family function in ethanolamine [] and propanediol [] degradation pathways. Both pathways require coenzyme B12 (adenosylcobalamin, AdoCbl). Bacteria that harbour these pathways can use ethanolamine as a source of carbon and nitrogen, or propanediol as a sole carbon and energy source, respectively. The exact roles of the EutP and PduV proteins in these respective pathways are not yet determined. Members of this family contain P-loop consensus motifs in the N-terminal part, and are distantly related to various GTPases and ATPases, including ATPase components of transport systems. Propanediol degradation is thought to be important for the natural Salmonella populations, since propanediol is produced by the fermentation of the common plant sugars rhamnose and fucose [, ]. More than 1% of the Salmonella enterica genome is devoted to the utilisation of propanediol and cobalamin biosynthesis. In vivo expression technology has indicated that propanediol utilisation (pdu) genes may be important for growth in host tissues, and competitive index studies with mice have shown that pdu mutations confer a virulence defect [, ]. The pdu operon is contiguous and co-regulated with the cobalamin (B12) biosynthesis cob operon, indicating that propanediol catabolism may be the primary reason for de novo B12 synthesis in Salmonella [, , ]. Please see IPR003207 from INTERPRO, IPR003208 from INTERPRO, IPR009204 from INTERPRO, IPR009191 from INTERPRO, IPR009192 from INTERPRO for more details on the propanediol utilisation pathway and the pdu operon.; GO: 0005524 ATP binding, 0006576 cellular biogenic amine metabolic process
Probab=32.01 E-value=51 Score=22.82 Aligned_cols=27 Identities=22% Similarity=0.236 Sum_probs=20.6
Q ss_pred ecC-CcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 21 GVN-LENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 21 ~~p-t~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
|.+ ++++++.-.+.|+..|++.++++.
T Consensus 100 Dl~~~~~~i~~a~~~L~~aG~~~if~vS 127 (143)
T PF10662_consen 100 DLPSDDANIERAKKWLKNAGVKEIFEVS 127 (143)
T ss_pred cCccchhhHHHHHHHHHHcCCCCeEEEE
Confidence 444 678888888888888888776654
No 95
>PRK14457 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=31.92 E-value=46 Score=25.84 Aligned_cols=25 Identities=16% Similarity=0.105 Sum_probs=22.4
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...++.|.+.|+++|+...||-
T Consensus 302 ~ps~e~i~~f~~~L~~~Gi~vtvR~ 326 (345)
T PRK14457 302 RPSPKRIQAFQRVLEQRGVAVSVRA 326 (345)
T ss_pred CCCHHHHHHHHHHHHHCCCeEEEeC
Confidence 4688999999999999999999983
No 96
>COG3185 4-hydroxyphenylpyruvate dioxygenase and related hemolysins [Amino acid transport and metabolism / General function prediction only]
Probab=31.52 E-value=38 Score=27.30 Aligned_cols=51 Identities=12% Similarity=0.184 Sum_probs=41.8
Q ss_pred ceeEEeecC-Cccc-HHHHHHHHHhCCCeEEEEeCCCCcCc-hhhhcCCceEEE
Q psy12446 15 TVTLYHGVN-LENH-LHLAYHELKKHQVKNVVRVCEPTYKV-EDLKTEGINVKD 65 (77)
Q Consensus 15 ~~~fli~~p-t~~t-l~~yi~elk~~~Vt~VVRvCe~tYd~-~~~~~~GI~vhd 65 (77)
.+|+-|..+ ++.+ ++.||.+-+=-||++|-=-+..-|++ +.+.++|++..+
T Consensus 219 ~vrlplN~s~~~~sqi~efl~~y~G~GIQHIA~~T~dI~~tv~~lr~rG~~fl~ 272 (363)
T COG3185 219 KVRLPLNESADDKSQIGEFLREYRGEGIQHIAFGTDDIYATVAALRERGVKFLP 272 (363)
T ss_pred cEEeecccCCCchhHHHHHHHHhCCCcceEEEecccHHHHHHHHHHHcCCccCC
Confidence 378889643 4444 89999999999999999888888887 889999998754
No 97
>cd02072 Glm_B12_BD B12 binding domain of glutamate mutase (Glm). Glutamate mutase catalysis the conversion of (S)-glutamate with (2S,3S)-3-methylaspartate. The rearrangement reaction is initiated by the extraction of a hydrogen from the protein-bound substrate by a 5'-desoxyadenosyl radical, which is generated by the homolytic cleavage of the organometallic bond of the cofactor B12. Glm is a heterotetrameric molecule consisting of two alpha and two epsilon polypeptide chains.
Probab=31.34 E-value=31 Score=23.45 Aligned_cols=49 Identities=16% Similarity=0.167 Sum_probs=35.3
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCC-----C----CcCchhhhcCCceEEEeecCCCCCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCE-----P----TYKVEDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe-----~----tYd~~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
+-..+..-++.|++.|+.++.-++- | -++.++|++.|+ +--|+-|++|.
T Consensus 63 ~~~~~~~~~~~l~~~gl~~v~vivGG~~~i~~~d~~~~~~~L~~~Gv---~~vf~pgt~~~ 120 (128)
T cd02072 63 GEIDCKGLREKCDEAGLKDILLYVGGNLVVGKQDFEDVEKRFKEMGF---DRVFAPGTPPE 120 (128)
T ss_pred CHHHHHHHHHHHHHCCCCCCeEEEECCCCCChhhhHHHHHHHHHcCC---CEEECcCCCHH
Confidence 4556788888999999866655553 1 235577999999 56788888664
No 98
>CHL00100 ilvH acetohydroxyacid synthase small subunit
Probab=31.24 E-value=90 Score=22.23 Aligned_cols=38 Identities=13% Similarity=0.128 Sum_probs=27.5
Q ss_pred eeccceeEEee-cCCcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 11 LDYKTVTLYHG-VNLENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 11 ~~y~~~~fli~-~pt~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
||...=.+.|- .-+..-++.|++.|+.+|+..++|.=.
T Consensus 113 vdv~~~~~~ie~tG~~~ki~a~~~~l~~~gi~e~~RtG~ 151 (174)
T CHL00100 113 VDLSEESLILEVTGDPGKIVAIEQLLEKFGIIEIARTGK 151 (174)
T ss_pred EEecCCEEEEEEcCCHHHHHHHHHHhhhcCCEEEEccCc
Confidence 34433334443 347788999999999999999998643
No 99
>PRK00979 tetrahydromethanopterin S-methyltransferase subunit H; Provisional
Probab=30.98 E-value=96 Score=24.30 Aligned_cols=59 Identities=12% Similarity=0.026 Sum_probs=41.1
Q ss_pred ceeEEeecCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc-----hhhhcCCce-EEEeecCCCCC
Q psy12446 15 TVTLYHGVNLENHLHLAYHELKKHQVKNVVRVCEPTYKV-----EDLKTEGIN-VKDLAYEDGTS 73 (77)
Q Consensus 15 ~~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-----~~~~~~GI~-vhdl~F~DG~~ 73 (77)
++-|+||++....++.=+++.+..|...=.=.|--+.+. .++++.|+. +.=|.|++|.|
T Consensus 96 d~Pl~IDSt~p~a~eaaLk~~~e~G~~gR~IiNSIn~e~~~eel~llk~yg~aavIvLa~d~~~p 160 (308)
T PRK00979 96 DLPFLIDSTSPEARIAAAKYATELGLADRAIYNSINPSIEEEEIEALKESDIKAAIVLAFDPMDP 160 (308)
T ss_pred CCCEEEeCCCHHHHHHHHHHhhhcCCCCceEEEeccCCCCHHHHHHHHHhCCceEEEEEcCCCCC
Confidence 367899999888888888888776753333334333332 677888976 77799999844
No 100
>TIGR00201 comF comF family protein. This protein is found in species that do (Bacillus subtilis, Haemophilus influenzae) or do not (E. coli, Borrelia burgdorferi) have described systems for natural transformation with exogenous DNA. It is involved in competence for transformation in Bacillus subtilis.
Probab=30.92 E-value=77 Score=21.87 Aligned_cols=34 Identities=15% Similarity=-0.001 Sum_probs=26.7
Q ss_pred cceeEEe--e-cCCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 14 KTVTLYH--G-VNLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 14 ~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
++-+.|| | .-|.+|+....+.|++.|+..|.-+|
T Consensus 151 ~~~~vllvDDV~TTGaTl~~~~~~L~~~Ga~~V~~~~ 187 (190)
T TIGR00201 151 QGRNIVLVDDVVTTGATLHEIARLLLELGAASVQVWT 187 (190)
T ss_pred CCCEEEEEeeeeccHHHHHHHHHHHHHcCCCEEEEEE
Confidence 3445666 4 46999999999999999998776554
No 101
>PRK12435 ferrochelatase; Provisional
Probab=30.75 E-value=51 Score=25.17 Aligned_cols=42 Identities=14% Similarity=0.058 Sum_probs=32.6
Q ss_pred CcccHHHHHHHHHhC-CCeEEEEeC--------CCCcCc-----hhhhcCCceEEE
Q psy12446 24 LENHLHLAYHELKKH-QVKNVVRVC--------EPTYKV-----EDLKTEGINVKD 65 (77)
Q Consensus 24 t~~tl~~yi~elk~~-~Vt~VVRvC--------e~tYd~-----~~~~~~GI~vhd 65 (77)
..-...+-|++|.+. ||+.|+-+| |..|+- +.+++.|++..-
T Consensus 230 L~P~t~d~l~~l~~~~G~k~v~vvpigFvsDhlETl~Eldie~~e~a~~~G~~~~r 285 (311)
T PRK12435 230 LGPDVQDLTRDLYEEHGYKSFIYTPVGFVAEHLEVLYDNDYECKVVTDEIGAKYYR 285 (311)
T ss_pred CCCCHHHHHHHHHHhcCCceEEEECCchhhhhHHHHHHHHHHHHHHHHHcCCcEEe
Confidence 455677888889776 999999999 456766 578999997554
No 102
>PRK12419 riboflavin synthase subunit beta; Provisional
Probab=30.34 E-value=63 Score=22.95 Aligned_cols=33 Identities=6% Similarity=0.002 Sum_probs=25.7
Q ss_pred cceeEEee-c--C---CcccHHHHHHHHHhCCCe----EEEEe
Q psy12446 14 KTVTLYHG-V--N---LENHLHLAYHELKKHQVK----NVVRV 46 (77)
Q Consensus 14 ~~~~fli~-~--p---t~~tl~~yi~elk~~~Vt----~VVRv 46 (77)
+++||.|= + | |+.-++-.++.|+++||. +|+||
T Consensus 9 ~~~riaIV~srfn~~It~~Ll~gA~~~l~~~G~~~~~i~v~~V 51 (158)
T PRK12419 9 TPQRIAFIQARWHADIVDQARKGFVAEIAARGGAASQVDIFDV 51 (158)
T ss_pred CCCEEEEEEecCCHHHHHHHHHHHHHHHHHcCCCccceEEEEC
Confidence 56899992 2 3 788899999999999974 56655
No 103
>PF13117 Cag12: Cag pathogenicity island protein Cag12
Probab=30.21 E-value=94 Score=20.91 Aligned_cols=25 Identities=16% Similarity=0.041 Sum_probs=20.7
Q ss_pred ecCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 21 GVNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 21 ~~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
..+.......|-..|+++|++.++.
T Consensus 86 ~~~~~~~~~~~K~wL~~nGa~avIe 110 (113)
T PF13117_consen 86 LTGDGNLFFQYKNWLRKNGATAVIE 110 (113)
T ss_pred EcCCHHHHHHHHHHHHHcCCceeEE
Confidence 3456677888999999999999885
No 104
>PF12123 Amidase02_C: N-acetylmuramoyl-l-alanine amidase; InterPro: IPR021976 This domain is found in bacteria and viruses. This domain is about 50 amino acids in length. This domain is classified with the enzyme classification code 3.5.1.28 from EC. This domain is the C-terminal of the enzyme which hydrolyses the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. ; PDB: 2L48_B.
Probab=30.14 E-value=65 Score=18.51 Aligned_cols=27 Identities=7% Similarity=-0.048 Sum_probs=19.3
Q ss_pred cceeEEeecC-CcccHHHHHHHHHhCCC
Q psy12446 14 KTVTLYHGVN-LENHLHLAYHELKKHQV 40 (77)
Q Consensus 14 ~~~~fli~~p-t~~tl~~yi~elk~~~V 40 (77)
.++.++++-| +++.|..+-+.|.++|-
T Consensus 14 ~Gl~y~vT~~~s~~~L~k~~~wld~rgW 41 (45)
T PF12123_consen 14 DGLPYFVTDPLSDAELDKFTAWLDERGW 41 (45)
T ss_dssp TS-EEEEE----HHHHHHHHHHHHHTT-
T ss_pred CCcEEEEeCCCCHHHHHHHHHHHHhcCc
Confidence 5688899887 89999999999988874
No 105
>PF13477 Glyco_trans_4_2: Glycosyl transferase 4-like
Probab=30.08 E-value=1.4e+02 Score=18.39 Aligned_cols=41 Identities=24% Similarity=0.147 Sum_probs=28.3
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeec
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAY 68 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F 68 (77)
-.-+.+++|++.|..-.+=.-...+ .+....+||.++.++.
T Consensus 12 ~~~~~~~~L~~~g~~V~ii~~~~~~-~~~~~~~~i~~~~~~~ 52 (139)
T PF13477_consen 12 FIYNLAKELKKRGYDVHIITPRNDY-EKYEIIEGIKVIRLPS 52 (139)
T ss_pred HHHHHHHHHHHCCCEEEEEEcCCCc-hhhhHhCCeEEEEecC
Confidence 3567789999988765554444444 4455589999998863
No 106
>PRK13810 orotate phosphoribosyltransferase; Provisional
Probab=29.92 E-value=1.9e+02 Score=20.43 Aligned_cols=53 Identities=11% Similarity=0.066 Sum_probs=39.8
Q ss_pred cceeEEe--e-cCCcccHHHHHHHHHhCCCeEEE--EeCCCCcC-chhhhcCCceEEEe
Q psy12446 14 KTVTLYH--G-VNLENHLHLAYHELKKHQVKNVV--RVCEPTYK-VEDLKTEGINVKDL 66 (77)
Q Consensus 14 ~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VV--RvCe~tYd-~~~~~~~GI~vhdl 66 (77)
++-|.+| | .-|..|+..-++.+++.|.+-+- -+.+.... .+.+++.||.++-+
T Consensus 121 ~g~rVlIVDDVitTGgS~~~~i~~l~~~Ga~V~~v~vlvdr~~g~~~~l~~~gi~~~sl 179 (187)
T PRK13810 121 PEDRIVMLEDVTTSGGSVREAIEVVREAGAYIKYVITVVDREEGAEENLKEADVELVPL 179 (187)
T ss_pred CcCEEEEEEeccCCChHHHHHHHHHHHCCCEEEEEEEEEECCcChHHHHHHcCCcEEEE
Confidence 6778888 4 46999999999999999865444 44454444 36788899988754
No 107
>PF02602 HEM4: Uroporphyrinogen-III synthase HemD; InterPro: IPR003754 Tetrapyrroles are large macrocyclic compounds derived from a common biosynthetic pathway []. The end-product, uroporphyrinogen III, is used to synthesise a number of important molecules, including vitamin B12, haem, sirohaem, chlorophyll, coenzyme F430 and phytochromobilin []. The first stage in tetrapyrrole synthesis is the synthesis of 5-aminoaevulinic acid ALA via two possible routes: (1) condensation of succinyl CoA and glycine (C4 pathway) using ALA synthase (2.3.1.37 from EC), or (2) decarboxylation of glutamate (C5 pathway) via three different enzymes, glutamyl-tRNA synthetase (6.1.1.17 from EC) to charge a tRNA with glutamate, glutamyl-tRNA reductase (1.2.1.70 from EC) to reduce glutamyl-tRNA to glutamate-1-semialdehyde (GSA), and GSA aminotransferase (5.4.3.8 from EC) to catalyse a transamination reaction to produce ALA. The second stage is to convert ALA to uroporphyrinogen III, the first macrocyclic tetrapyrrolic structure in the pathway. This is achieved by the action of three enzymes in one common pathway: porphobilinogen (PBG) synthase (or ALA dehydratase, 4.2.1.24 from EC) to condense two ALA molecules to generate porphobilinogen; hydroxymethylbilane synthase (or PBG deaminase, 2.5.1.61 from EC) to polymerise four PBG molecules into preuroporphyrinogen (tetrapyrrole structure); and uroporphyrinogen III synthase (4.2.1.75 from EC) to link two pyrrole units together (rings A and D) to yield uroporphyrinogen III. Uroporphyrinogen III is the first branch point of the pathway. To synthesise cobalamin (vitamin B12), sirohaem, and coenzyme F430, uroporphyrinogen III needs to be converted into precorrin-2 by the action of uroporphyrinogen III methyltransferase (2.1.1.107 from EC). To synthesise haem and chlorophyll, uroporphyrinogen III needs to be decarboxylated into coproporphyrinogen III by the action of uroporphyrinogen III decarboxylase (4.1.1.37 from EC) []. This entry represents uroporphyrinogen III synthase (4.2.1.75 from EC) which functions during the second stage of tetrapyrrole biosynthesis. This enzyme catalyses the inversion of the final pyrrole unit (ring D) of the linear tetrapyrrole molecule, linking it to the first pyrrole unit (ring A), thereby generating a large macrocyclic structure called uroporphyrinogen III []. The enzyme folds into two alpha/beta domains connected by a beta-ladder, the active site being located between the two domains []. Congenital erythropoietic porphyria (CEP) is an autosomal recessive inborn error of metabolism that results from the markedly deficient activity of uroporphyrinogen III synthase []. ; GO: 0004852 uroporphyrinogen-III synthase activity, 0033014 tetrapyrrole biosynthetic process; PDB: 1WD7_B 1WCX_A 1WCW_A 3D8R_A 3D8T_B 3D8S_A 3D8N_A 3RE1_A 3MW8_A 3P9Z_A ....
Probab=29.73 E-value=1.3e+02 Score=20.42 Aligned_cols=46 Identities=24% Similarity=0.196 Sum_probs=32.0
Q ss_pred cCC-cccHHHHHHHHHh-CCCeEEEEeCCCCcCc---hhhhcCCceEEEee
Q psy12446 22 VNL-ENHLHLAYHELKK-HQVKNVVRVCEPTYKV---EDLKTEGINVKDLA 67 (77)
Q Consensus 22 ~pt-~~tl~~yi~elk~-~~Vt~VVRvCe~tYd~---~~~~~~GI~vhdl~ 67 (77)
+|. ..+-+..++.|++ ..-..|+-+|...-+. +.++++|++|.+|+
T Consensus 97 ~~~~~~~s~~L~~~l~~~~~~~~vl~~~g~~~~~~l~~~L~~~g~~v~~~~ 147 (231)
T PF02602_consen 97 VPSSEGSSEGLAELLKEQLRGKRVLILRGEGGRPDLPEKLREAGIEVTEVI 147 (231)
T ss_dssp E-TTSSSHHHHHGGHHHCCTTEEEEEEESSSSCHHHHHHHHHTTEEEEEEE
T ss_pred cCCCCCCHHHHHHHHHhhCCCCeEEEEcCCCccHHHHHHHHHCCCeEEEEE
Confidence 464 6677777888885 2226788888754444 77889999998775
No 108
>PF00752 XPG_N: XPG N-terminal domain; InterPro: IPR006085 Xeroderma pigmentosum (XP) [] is a human autosomal recessive disease, characterised by a high incidence of sunlight-induced skin cancer. People's skin cells with this condition are hypersensitive to ultraviolet light, due to defects in the incision step of DNA excision repair. There are a minimum of seven genetic complementation groups involved in this pathway: XP-A to XP-G. XP-G is one of the most rare and phenotypically heterogeneous of XP, showing anything from slight to extreme dysfunction in DNA excision repair [, ]. XP-G can be corrected by a 133 Kd nuclear protein, XPGC []. XPGC is an acidic protein that confers normal UV resistance in expressing cells []. It is a magnesium-dependent, single-strand DNA endonuclease that makes structure-specific endonucleolytic incisions in a DNA substrate containing a duplex region and single-stranded arms [, ]. XPGC cleaves one strand of the duplex at the border with the single-stranded region []. XPG belongs to a family of proteins that includes RAD2 from Saccharomyces cerevisiae (Baker's yeast) and rad13 from Schizosaccharomyces pombe (Fission yeast), which are single-stranded DNA endonucleases [, ]; mouse and human FEN-1, a structure-specific endonuclease; RAD2 from fission yeast and RAD27 from budding yeast; fission yeast exo1, a 5'-3' double-stranded DNA exonuclease that may act in a pathway that corrects mismatched base pairs; yeast DHS1, and yeast DIN7. Sequence alignment of this family of proteins reveals that similarities are largely confined to two regions. The first is located at the N-terminal extremity (N-region) and corresponds to the first 95 to 105 amino acids. The second region is internal (I-region) and found towards the C terminus; it spans about 140 residues and contains a highly conserved core of 27 amino acids that includes a conserved pentapeptide (E-A-[DE]-A-[QS]). It is possible that the conserved acidic residues are involved in the catalytic mechanism of DNA excision repair in XPG. The amino acids linking the N- and I-regions are not conserved. This entry represents the N-terminal of XPG.; GO: 0004518 nuclease activity, 0006281 DNA repair; PDB: 1A77_A 1A76_A 1MC8_B 3QEB_Z 3QEA_Z 3QE9_Y 1UL1_Z 3Q8K_A 3Q8M_A 3Q8L_A ....
Probab=29.64 E-value=67 Score=19.71 Aligned_cols=22 Identities=18% Similarity=0.149 Sum_probs=15.0
Q ss_pred cHHHHHHHHHhCCCeEEEEeCC
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe 48 (77)
++-..+..|+++||+.|+.+=.
T Consensus 60 ~~~~r~~~L~~~gI~PifVFDG 81 (101)
T PF00752_consen 60 GLFSRLCRLLEHGIKPIFVFDG 81 (101)
T ss_dssp HHHHHHHHHHHTTEEEEEEE--
T ss_pred HHHHHHHHHHHCCCEEEEEECC
Confidence 4445666788999999987754
No 109
>PF00975 Thioesterase: Thioesterase domain; InterPro: IPR001031 Thioesterase domains often occur integrated in or associated with peptide synthetases which are involved in the non-ribosomal synthesis of peptide antibiotics []. Thioesterases are required for the addition of the last amino acid to the peptide antibiotic, thereby forming a cyclic antibiotic. Next to the operons encoding these enzymes, in almost all cases, are genes that encode proteins that have similarity to the type II fatty acid thioesterases of vertebrates.; GO: 0016788 hydrolase activity, acting on ester bonds, 0009058 biosynthetic process; PDB: 2RON_A 2K2Q_B 3LCR_B 2HFJ_B 1MNQ_A 1MN6_B 1MNA_B 2HFK_B 2H7Y_B 2H7X_A ....
Probab=29.57 E-value=20 Score=24.11 Aligned_cols=49 Identities=16% Similarity=0.157 Sum_probs=35.9
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCc-------hhhhcCCceEEEeecCCCCCCC
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKV-------EDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~-------~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
-.+.|++.+++..-+-=++++.-++-. ..++++|.+|..+..-|+.+|.
T Consensus 51 la~~y~~~I~~~~~~gp~~L~G~S~Gg~lA~E~A~~Le~~G~~v~~l~liD~~~p~ 106 (229)
T PF00975_consen 51 LASRYAEAIRARQPEGPYVLAGWSFGGILAFEMARQLEEAGEEVSRLILIDSPPPS 106 (229)
T ss_dssp HHHHHHHHHHHHTSSSSEEEEEETHHHHHHHHHHHHHHHTT-SESEEEEESCSSTT
T ss_pred HHHHHHHHhhhhCCCCCeeehccCccHHHHHHHHHHHHHhhhccCceEEecCCCCC
Confidence 367888888887766455555544443 7899999999999999987774
No 110
>TIGR00114 lumazine-synth 6,7-dimethyl-8-ribityllumazine synthase. Archaeal members of this family are considered putative, although included in the seed and scoring above the trusted cutoff.
Probab=29.08 E-value=65 Score=22.13 Aligned_cols=31 Identities=26% Similarity=0.181 Sum_probs=22.7
Q ss_pred eeEEe-ec--C---CcccHHHHHHHHHhCCCe----EEEEe
Q psy12446 16 VTLYH-GV--N---LENHLHLAYHELKKHQVK----NVVRV 46 (77)
Q Consensus 16 ~~fli-~~--p---t~~tl~~yi~elk~~~Vt----~VVRv 46 (77)
|||.| -+ + ++.-++..++.|+++|+. ++++|
T Consensus 1 ~ri~IV~s~~n~~i~~~L~~ga~~~l~~~g~~~~~i~v~~V 41 (138)
T TIGR00114 1 VRVGIVIARFNRDITDMLLKGAIDALKRLGAEVDNIDVIWV 41 (138)
T ss_pred CEEEEEEecCCHHHHHHHHHHHHHHHHHcCCCccceEEEEC
Confidence 46666 22 2 778889999999999987 45554
No 111
>PF05724 TPMT: Thiopurine S-methyltransferase (TPMT); InterPro: IPR008854 This family consists of thiopurine S-methyltransferase proteins from both eukaryotes and prokaryotes. Thiopurine S-methyltransferase (TPMT) is a cytosolic enzyme that catalyses S-methylation of aromatic and heterocyclic sulphydryl compounds, including anticancer and immunosuppressive thiopurines [].; GO: 0008119 thiopurine S-methyltransferase activity, 0008152 metabolic process, 0005737 cytoplasm; PDB: 1PJZ_A 2H11_A 2BZG_A 3LCC_A 3BGD_A 2GB4_A 3BGI_B.
Probab=28.93 E-value=67 Score=23.12 Aligned_cols=45 Identities=18% Similarity=0.136 Sum_probs=35.2
Q ss_pred CcccHHHHHHHHHh-CCCeEEEEeCCCCcCchhhhcCCceEEEeec
Q psy12446 24 LENHLHLAYHELKK-HQVKNVVRVCEPTYKVEDLKTEGINVKDLAY 68 (77)
Q Consensus 24 t~~tl~~yi~elk~-~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F 68 (77)
.+..|..|++.+.. .+-+.+|=.|...||...|-++|.+|.-+-|
T Consensus 22 ~~p~L~~~~~~l~~~~~~rvLvPgCG~g~D~~~La~~G~~VvGvDl 67 (218)
T PF05724_consen 22 PNPALVEYLDSLALKPGGRVLVPGCGKGYDMLWLAEQGHDVVGVDL 67 (218)
T ss_dssp STHHHHHHHHHHTTSTSEEEEETTTTTSCHHHHHHHTTEEEEEEES
T ss_pred CCHHHHHHHHhcCCCCCCeEEEeCCCChHHHHHHHHCCCeEEEEec
Confidence 45567777777554 3447888899999999999999999986654
No 112
>TIGR00511 ribulose_e2b2 ribose-1,5-bisphosphate isomerase, e2b2 family. The delineation of this family was based originally, in part, on a discussion and neighbor-joining phylogenetic study by Kyrpides and Woese of archaeal and other proteins homologous to the alpha, beta, and delta subunits of eukaryotic initiation factor 2B (eIF-2B), a five-subunit molecule that catalyzes GTP recycling for eIF-2. Recently, Sato, et al. assigned the function ribulose-1,5 bisphosphate isomerase.
Probab=28.70 E-value=1.6e+02 Score=22.37 Aligned_cols=48 Identities=17% Similarity=0.118 Sum_probs=33.9
Q ss_pred eeEEeecCCcccHHHHHHHHHhCCCeEEEEeCC--CCcC----chhhhcCCceEE
Q psy12446 16 VTLYHGVNLENHLHLAYHELKKHQVKNVVRVCE--PTYK----VEDLKTEGINVK 64 (77)
Q Consensus 16 ~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe--~tYd----~~~~~~~GI~vh 64 (77)
.+.|+-+++ ++...++....+.|..-=|-++| |.+. ...+.++||.+.
T Consensus 117 ~~ILT~~~S-~tv~~~l~~a~~~~~~f~V~v~EsrP~~~G~~~a~~L~~~gI~vt 170 (301)
T TIGR00511 117 DVVMTHCNS-EAALSVIKTAFEQGKDIEVIATETRPRKQGHITAKELRDYGIPVT 170 (301)
T ss_pred CEEEEECCc-HHHHHHHHHHHHcCCcEEEEEecCCCcchHHHHHHHHHHCCCCEE
Confidence 344444444 78889999988887766666777 4443 377889999987
No 113
>PF11032 ApoM: Apolipoprotein M (ApoM); InterPro: IPR022734 ApoM is a 25 kDa plasma protein associated with high-density lipoproteins (HDLs). ApoM is important in the formation of pre-ss-HDL and also in increasing cholesterol efflux from macrophage foam cells []. Lipoproteins consist of lipids solubilized by apolipoproteins. ApoM lacks an external amphipathic motif and is uniquely secreted to plasma without cleavage of its terminal signal peptide []. ; PDB: 2XKL_A 2WEX_A 2YG2_B 2WEW_A.
Probab=28.65 E-value=1.1e+02 Score=22.21 Aligned_cols=32 Identities=13% Similarity=-0.038 Sum_probs=21.4
Q ss_pred eeeeeeccceeEEeecCCcccHHHHHHHHHhC
Q psy12446 7 ICRTLDYKTVTLYHGVNLENHLHLAYHELKKH 38 (77)
Q Consensus 7 ~~~~~~y~~~~fli~~pt~~tl~~yi~elk~~ 38 (77)
|-.+.+=..-|||+.+.+....+.++||||++
T Consensus 133 il~e~~~~~~rllLysR~~~~~~~~lEeFk~q 164 (186)
T PF11032_consen 133 ILKETGDSYQRLLLYSRSPKLEEEELEEFKAQ 164 (186)
T ss_dssp EEEEEETTEEEEEEEESSSS--HHHHHHHHHH
T ss_pred EEEEcCCCceEEEEEeCCCCCCHHHHHHHHHh
Confidence 33344333447898888888899999999874
No 114
>PF02120 Flg_hook: Flagellar hook-length control protein FliK; InterPro: IPR021136 This entry represents the C-terminal domain of the flagellar hook-length control protein FliK. This entry also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. During flagellar morphogenesis in Salmonella typhimurium and Escherichia coli, flagellar hook-length control protein (FliK) controls the length of the hook by directly measuring the hook length [, ]. It is considered unlikely that FliK functions as a molecular ruler for determining hook length, but that it is more likely to be employing a novel mechanism. The deduced amino acid sequences of FliK proteins from S. typhimurium and E. coli have molecular masses of 41,748 and 39,246 Da, respectively, and are fairly hydrophilic []. Sequence comparison reveals around 50% identity, with greatest conservation in the C-terminal region, with 71% identity in the last 154 amino acids - mutagenesis of this conserved region completely abolishes motility. The central and C-terminal regions are rich in proline and glutamine respectively; it is thought that they may constitute distinct domains [].; PDB: 2RRL_A.
Probab=28.22 E-value=36 Score=20.11 Aligned_cols=22 Identities=32% Similarity=0.617 Sum_probs=14.9
Q ss_pred hhhhcCCceEEEeecCCCCCCC
Q psy12446 54 EDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 54 ~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
+.|.+.||++.++.+..|.+|.
T Consensus 62 ~~L~~~G~~~~~~~v~~~~~~~ 83 (85)
T PF02120_consen 62 ERLQAQGLEVVNLSVSQGSSQQ 83 (85)
T ss_dssp HHHHTTT-EEEEEEEESS----
T ss_pred HHHHHCCCCeEEEEEEECCCCC
Confidence 5788999999999998888764
No 115
>PF00583 Acetyltransf_1: Acetyltransferase (GNAT) family; InterPro: IPR000182 The N-acetyltransferases (NAT) (EC 2.3.1.-) are enzymes that use acetyl coenzyme A (CoA) to transfer an acetyl group to a substrate, a reaction implicated in various functions from bacterial antibiotic resistance to mammalian circadian rhythm and chromatin remodeling. The Gcn5-related N-acetyltransferases (GNAT) catalyze the transfer of the acetyl from the CoA donor to a primary amine of the acceptor. The GNAT proteins share a domain composed of four conserved sequence motifs A-D [, ]. This GNAT domain is named after yeast GCN5 (from General Control Nonrepressed) and related histone acetyltransferases (HATs) like Hat1 and PCAF. HATs acetylate lysine residues of amino terminal histone tails, resulting in transcription activation. Another category of GNAT, the aminoglycoside N-acetyltransferases, confer antibiotic resistance by catalyzing the acetylation of amino groups in aminoglycoside antibiotics []. GNAT proteins can also have anabolic and catabolic functions in both prokaryotes and eukaryotes [, , , , ]. The acetyltransferase/GNAT domain forms a structurally conserved fold of 6 to 7 beta strands (B) and 4 helices (H) in the topology B1-H1-H2-B2-B3-B4-H3-B5-H4-B6, followed by a C-terminal strand which may be from the same monomer or contributed by another [, ]. Motifs D (B2-B3), A (B4-H3) and B (B5-H4) are collectively called the HAT core [, , ], while the N-terminal motif C (B1-H1) is less conserved. Some proteins known to contain a GNAT domain: Yeast GCN5 and Hat1, which are histone acetyltransferases (EC 2.3.1.48). Human PCAF, a histone acetyltransferase. Mammalian serotonin N-acetyltransferase (SNAT) or arylalkylamine NAT (AANAT), which acetylates serotonin into a circadian neurohormone that may participate in light-dark rhythms, and human mood and behavior. Mammalian glucosamine 6-phosphate N-acetyltransferase (GNA1) (EC 2.3.1.4). Escherichia coli rimI and rimJ, which acetylate the N-terminal alanine of ribosomal proteins S18 and S5, respectively (EC 2.3.1.128). Mycobacterium tuberculosis aminoglycoside 2'-N-acetyltransferase (aac), which acetylates the 2' hydroxyl or amino group of a broad spectrum of aminoglycoside antibiotics. Bacillus subtilis bltD and paiA, which acetylate spermine and spermidine. This entry represents the entire GNAT domain.; GO: 0008080 N-acetyltransferase activity, 0008152 metabolic process; PDB: 3T9Y_A 2R7H_B 2OZH_A 1Y9W_B 1VKC_B 2OH1_C 3R9E_B 3R9G_B 3R9F_A 3R96_A ....
Probab=28.20 E-value=81 Score=17.47 Aligned_cols=36 Identities=17% Similarity=0.111 Sum_probs=25.5
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCC--cCchhhhcCCce
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPT--YKVEDLKTEGIN 62 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~t--Yd~~~~~~~GI~ 62 (77)
-++..++.+++.|++.|.-.|.+. .....+++.|++
T Consensus 46 L~~~~~~~~~~~g~~~i~~~~~~~n~~~~~~~~k~Gf~ 83 (83)
T PF00583_consen 46 LLQAAEEWARKRGIKRIYLDVSPDNPAARRFYEKLGFE 83 (83)
T ss_dssp HHHHHHHHHHHTTESEEEEEEETTGHHHHHHHHHTTEE
T ss_pred hhhhhhhhHHhcCccEEEEEEeCCCHHHHHHHHHcCCC
Confidence 466777888899999998877644 223667777763
No 116
>COG3981 Predicted acetyltransferase [General function prediction only]
Probab=28.16 E-value=1.3e+02 Score=21.87 Aligned_cols=45 Identities=18% Similarity=0.365 Sum_probs=33.5
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCC--cCchhhhcCCceEEEeecCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPT--YKVEDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~t--Yd~~~~~~~GI~vhdl~F~DG~ 72 (77)
|..=+++-+..|++.|..+|.+. =++...+++|=...+-.|.||.
T Consensus 119 Lkl~L~~ar~lgi~~Vlvtcd~dN~ASrkvI~~NGGile~~~~~~g~ 165 (174)
T COG3981 119 LKLALEKARELGIKKVLVTCDKDNIASRKVIEANGGILENEFFGEGK 165 (174)
T ss_pred HHHHHHHHHHcCCCeEEEEeCCCCchhhHHHHhcCCEEeEEEccCCc
Confidence 55567888999999999999854 3456677777657667777654
No 117
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=27.79 E-value=1e+02 Score=20.69 Aligned_cols=45 Identities=7% Similarity=0.027 Sum_probs=35.0
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeec--CCCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAY--EDGTS 73 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F--~DG~~ 73 (77)
...-|...++..++.|+.-++|++=. +|....++ |=||.. +||.+
T Consensus 42 ~~Dllge~v~a~h~~Girv~ay~~~~-~d~~~~~~----HPeW~~~~~~G~~ 88 (132)
T PF14871_consen 42 KRDLLGEQVEACHERGIRVPAYFDFS-WDEDAAER----HPEWFVRDADGRP 88 (132)
T ss_pred CcCHHHHHHHHHHHCCCEEEEEEeee-cChHHHHh----CCceeeECCCCCC
Confidence 45667899999999999999999977 88877753 456765 55653
No 118
>TIGR01091 upp uracil phosphoribosyltransferase. that includes uracil phosphoribosyltransferase, uridine kinases, and other, uncharacterized proteins.
Probab=27.79 E-value=1.1e+02 Score=21.73 Aligned_cols=35 Identities=17% Similarity=0.131 Sum_probs=26.4
Q ss_pred ccceeEEe-e-c-CCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 13 YKTVTLYH-G-V-NLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 13 y~~~~fli-~-~-pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
.++-.-|| | + -|.+|+..-++.|+++|+..|.-+|
T Consensus 120 i~~~~VllvDd~laTG~Tl~~ai~~L~~~G~~~I~v~~ 157 (207)
T TIGR01091 120 IDERTVIVLDPMLATGGTMIAALDLLKKRGAKKIKVLS 157 (207)
T ss_pred CCCCEEEEECCCccchHHHHHHHHHHHHcCCCEEEEEE
Confidence 34445555 4 3 5999999999999999999855444
No 119
>PRK14462 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=27.58 E-value=56 Score=25.65 Aligned_cols=24 Identities=8% Similarity=-0.054 Sum_probs=22.1
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
.|+...++.|.+.|+++|+...||
T Consensus 310 ~ps~e~i~~f~~~l~~~gi~vtvR 333 (356)
T PRK14462 310 RPSLEDMIKFQDYLNSKGLLCTIR 333 (356)
T ss_pred CCCHHHHHHHHHHHHHCCCcEEEe
Confidence 368899999999999999999998
No 120
>cd03785 GT1_MurG MurG is an N-acetylglucosaminyltransferase, the last enzyme involved in the intracellular phase of peptidoglycan biosynthesis. It transfers N-acetyl-D-glucosamine (GlcNAc) from UDP-GlcNAc to the C4 hydroxyl of a lipid-linked N-acetylmuramoyl pentapeptide (NAM). The resulting disaccharide is then transported across the cell membrane, where it is polymerized into NAG-NAM cell-wall repeat structure. MurG belongs to the GT-B structural superfamily of glycoslytransferases, which have characteristic N- and C-terminal domains, each containing a typical Rossmann fold. The two domains have high structural homology despite minimal sequence homology. The large cleft that separates the two domains includes the catalytic center and permits a high degree of flexibility.
Probab=27.45 E-value=1.8e+02 Score=20.63 Aligned_cols=40 Identities=10% Similarity=0.114 Sum_probs=28.0
Q ss_pred HHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEeecC
Q psy12446 30 LAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 30 ~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F~ 69 (77)
...+.|.+.|..-.|-.+........++..|++++.+++.
T Consensus 18 ~la~~l~~~G~ev~v~~~~~~~~~~~~~~~~~~~~~~~~~ 57 (350)
T cd03785 18 ALAEELRERGAEVLFLGTKRGLEARLVPKAGIPLHTIPVG 57 (350)
T ss_pred HHHHHHHhCCCEEEEEECCCcchhhcccccCCceEEEEec
Confidence 6678889888766555555444445555679999999874
No 121
>PF01705 CX: CX module; InterPro: IPR002619 This domain has no known function. It is found in several Caenorhabditis elegans proteins. The domain contains 6 conserved cysteines that probably form three disulphide bridges.
Probab=27.42 E-value=28 Score=20.62 Aligned_cols=13 Identities=23% Similarity=0.736 Sum_probs=10.3
Q ss_pred EEEeecCCCCCCC
Q psy12446 63 VKDLAYEDGTSPS 75 (77)
Q Consensus 63 vhdl~F~DG~~Pp 75 (77)
.....|+||+.|.
T Consensus 28 l~~v~F~nGt~p~ 40 (61)
T PF01705_consen 28 LGNVTFPNGTRPK 40 (61)
T ss_pred ccceEccCCCcce
Confidence 4467899999985
No 122
>COG3816 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=27.39 E-value=16 Score=27.12 Aligned_cols=47 Identities=28% Similarity=0.357 Sum_probs=32.7
Q ss_pred cceeEEeecC--CcccHHHHHHHHHhCCCeEEEEeCC--CCcCchhhhcCCceEEEeecC
Q psy12446 14 KTVTLYHGVN--LENHLHLAYHELKKHQVKNVVRVCE--PTYKVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 14 ~~~~fli~~p--t~~tl~~yi~elk~~~Vt~VVRvCe--~tYd~~~~~~~GI~vhdl~F~ 69 (77)
.+.-|+..-| ...-...|- .|+|--+ .+|=...+|+-||.|.|-||-
T Consensus 52 DGtWFylGTPIGR~plVrLFS---------tvlRkD~Dg~tyLVTPVEKvgIrV~DAPFi 102 (205)
T COG3816 52 DGTWFYLGTPIGRKPLVRLFS---------TVLRKDEDGKTYLVTPVEKVGIRVEDAPFI 102 (205)
T ss_pred CceEEEecCccchHHHHHHHH---------HHheECCCCCEEEEeehHhcceEeccCceE
Confidence 4556777777 333334443 3556554 688889999999999999994
No 123
>TIGR00336 pyrE orotate phosphoribosyltransferase. The conserved Lys (K) residue at position 101 of the seed alignment has been proposed as the active site for the enzyme.
Probab=27.36 E-value=1.7e+02 Score=19.98 Aligned_cols=52 Identities=10% Similarity=0.065 Sum_probs=36.6
Q ss_pred ceeEEe--e-cCCcccHHHHHHHHHhCCCeEEEEeC--CCCc--CchhhhcC-CceEEEe
Q psy12446 15 TVTLYH--G-VNLENHLHLAYHELKKHQVKNVVRVC--EPTY--KVEDLKTE-GINVKDL 66 (77)
Q Consensus 15 ~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VVRvC--e~tY--d~~~~~~~-GI~vhdl 66 (77)
+=+-|| | ..|..|+..-++.+++.|..-+--+| +... ..+.++++ ||.++-+
T Consensus 108 g~~VlIVDDvi~TG~Tl~~a~~~l~~~Ga~v~~~~vlvdr~~~~~~~~l~~~~gv~~~sl 167 (173)
T TIGR00336 108 GDKVVVVEDVITTGTSILEAVEIIQAAGGQVAGVIIAVDRQERSAGQEFEKEYGLPVISL 167 (173)
T ss_pred CCEEEEEeccccChHHHHHHHHHHHHcCCeEEEEEEEEecCchhHHHHHHHhcCCeEEEE
Confidence 345566 4 47999999999999999976655555 4444 24666655 9988754
No 124
>PF02579 Nitro_FeMo-Co: Dinitrogenase iron-molybdenum cofactor; InterPro: IPR003731 This entry represents several Nif (B, X and Y) proteins, which are involved in the biosynthesis of the iron-molybdenum cofactor (FeMo-co) found in the dinitrogenase enzyme of the nitrogenase complex in nitrogen-fixing bacteria. The nitrogenase complex catalyses the reduction of atmospheric dinitrogen to ammonia, and is composed of an iron metalloprotein (dinitrogenase reductase; homodimer of NifH; IPR000392 from INTERPRO) and a Fe-Mo metalloprotein (dinitrogenase; heterotetramer of NifD and NifK; IPR000318 from INTERPRO). The pathway for the synthesis of the Fe-Mo cofactor involves several proteins, including NifB, NifE, NifH, NifN, NifQ, NifV and NifX. NifB appears to be an iron-sulphur source for FeMo-co biosynthesis, while NifX may be associated with the mature FeMo-co, in particular with the addition of homocitrate during the last step of biosynthesis []. The NifX protein shows sequence similarity with the C terminus of NifB [], as well as to the conserved protein MTH1175 from the archaeon Methanobacterium thermoautotrophicum, which displays a ribonuclease H-like motif of three layers, alpha/beta/alpha, with a single mixed beta-sheet [].; PDB: 2QTD_A 2KLA_A 1EO1_A 1P90_A 1RDU_A 2YX6_D 1O13_A 1T3V_A 2RE2_B 2WFB_A.
Probab=27.08 E-value=67 Score=18.92 Aligned_cols=39 Identities=18% Similarity=-0.009 Sum_probs=24.3
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEEe
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKDL 66 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl 66 (77)
-....+.|..+||..|+---=-.=-...|+++||+|...
T Consensus 42 ~~~~~~~l~~~~v~~li~~~iG~~~~~~L~~~gI~v~~~ 80 (94)
T PF02579_consen 42 GDKIAKFLAEEGVDVLICGGIGEGAFRALKEAGIKVYQG 80 (94)
T ss_dssp STHHHHHHHHTTESEEEESCSCHHHHHHHHHTTSEEEES
T ss_pred chhHHHHHHHcCCCEEEEeCCCHHHHHHHHHCCCEEEEc
Confidence 334445556699998883221111237889999999754
No 125
>PF13727 CoA_binding_3: CoA-binding domain; PDB: 3NKL_B.
Probab=27.02 E-value=59 Score=20.73 Aligned_cols=41 Identities=17% Similarity=0.280 Sum_probs=23.9
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcCc------hhhhcCCceEEEe
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYKV------EDLKTEGINVKDL 66 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd~------~~~~~~GI~vhdl 66 (77)
.++++..+.+++++|..|+-.....=.. +.+++.|++|+=.
T Consensus 128 g~~~~l~~~~~~~~id~v~ial~~~~~~~i~~ii~~~~~~~v~v~~v 174 (175)
T PF13727_consen 128 GDLDDLPELVREHDIDEVIIALPWSEEEQIKRIIEELENHGVRVRVV 174 (175)
T ss_dssp --GGGHHHHHHHHT--EEEE--TTS-HHHHHHHHHHHHTTT-EEEE-
T ss_pred cCHHHHHHHHHhCCCCEEEEEcCccCHHHHHHHHHHHHhCCCEEEEe
Confidence 3678888899999999988665432222 6778888888643
No 126
>cd04618 CBS_pair_5 The CBS domain, named after human CBS, is a small domain originally identified in cystathionine beta-synthase and is subsequently found in a wide range of different proteins. CBS domains usually occur in tandem repeats. They associate to form a so-called Bateman domain or a CBS pair based on crystallographic studies in bacteria. The CBS pair was used as a basis for this cd hierarchy since the human CBS proteins can adopt the typical core structure and form an intramolecular CBS pair. The interface between the two CBS domains forms a cleft that is a potential ligand binding site. The CBS pair coexists with a variety of other functional domains and this has been used to help in its classification here. It has been proposed that the CBS domain may play a regulatory role, although its exact function is unknown. Mutations of conserved residues within this domain are associated with a variety of human hereditary diseases, including congenital myotonia, idiopathic genera
Probab=26.92 E-value=1.5e+02 Score=17.67 Aligned_cols=28 Identities=7% Similarity=0.032 Sum_probs=23.4
Q ss_pred eecCCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 20 HGVNLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 20 i~~pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
++.+.+.++..-++.|.++++..++.+-
T Consensus 4 ~~v~~~~~l~~a~~~~~~~~~~~~~Vvd 31 (98)
T cd04618 4 VVFDTKLPVKKAFNALVENGIRSAPLWD 31 (98)
T ss_pred EEECCCCcHHHHHHHHHHcCCceEEEEe
Confidence 4567789999999999999998888664
No 127
>PRK14467 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=26.78 E-value=62 Score=25.21 Aligned_cols=24 Identities=13% Similarity=-0.032 Sum_probs=21.8
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
.|+...++.|.+.|+++|+...||
T Consensus 301 ~ps~e~i~~f~~~L~~~gi~v~vR 324 (348)
T PRK14467 301 RPELERVYKFQKILWDNGISTFVR 324 (348)
T ss_pred CCCHHHHHHHHHHHHHCCCcEEEe
Confidence 358889999999999999999999
No 128
>TIGR02742 TrbC_Ftype type-F conjugative transfer system pilin assembly protein TrbC. This protein is an essential component of the F-type conjugative pilus assembly system for the transfer of plasmid DNA. The N-terminal portion of these proteins are heterogeneous and are not covered by this model.
Probab=26.59 E-value=1.2e+02 Score=20.58 Aligned_cols=22 Identities=18% Similarity=0.105 Sum_probs=20.2
Q ss_pred CcccHHHHHHHHHhCCCeEEEE
Q psy12446 24 LENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVR 45 (77)
..++|.+++.+..+.|+..|+|
T Consensus 10 P~~~Lk~l~~~a~~~g~~~VlR 31 (130)
T TIGR02742 10 PEPLLKQLLDQAEALGAPLVIR 31 (130)
T ss_pred CHHHHHHHHHHHHHhCCeEEEe
Confidence 5788999999999999999998
No 129
>PF12996 DUF3880: DUF based on E. rectale Gene description (DUF3880); InterPro: IPR024542 This entry represents proteins of unknown function. The Eubacterium rectale gene appears to be upregulated in the presence of Bacteroides thetaiotaomicron compared to growth in pure culture [].
Probab=26.58 E-value=56 Score=19.77 Aligned_cols=16 Identities=19% Similarity=0.175 Sum_probs=13.2
Q ss_pred HHHHHHHHhCCCeEEE
Q psy12446 29 HLAYHELKKHQVKNVV 44 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VV 44 (77)
..+++++|+.|+..|.
T Consensus 27 ~~~~~~~~~~G~~~V~ 42 (79)
T PF12996_consen 27 RSFVEEYRNLGAENVF 42 (79)
T ss_pred HHHHHHHHHcCCCCEE
Confidence 3578999999988876
No 130
>cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III).
Probab=26.46 E-value=1.1e+02 Score=18.16 Aligned_cols=25 Identities=28% Similarity=0.158 Sum_probs=21.0
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
...++++-+++|++.|++.|+-+.-
T Consensus 43 ~~P~i~~~l~~l~~~g~~~vvvvPl 67 (101)
T cd03409 43 LGPDTEEAIRELAEEGYQRVVIVPL 67 (101)
T ss_pred CCCCHHHHHHHHHHcCCCeEEEEeC
Confidence 3667889999999999999987763
No 131
>PF13086 AAA_11: AAA domain; PDB: 2XZL_A 2XZO_A 2WJY_A 2WJV_A 2XZP_A 2GK6_A 2GK7_A 2GJK_A.
Probab=26.38 E-value=81 Score=20.75 Aligned_cols=37 Identities=11% Similarity=0.050 Sum_probs=24.1
Q ss_pred ccceeEEeecCCcccHHHHHHHHHh-------CCCeEEEEeCCC
Q psy12446 13 YKTVTLYHGVNLENHLHLAYHELKK-------HQVKNVVRVCEP 49 (77)
Q Consensus 13 y~~~~fli~~pt~~tl~~yi~elk~-------~~Vt~VVRvCe~ 49 (77)
-..-+.|+.+|+|..+.+.++.|.+ ..-..++|+-.+
T Consensus 51 ~~~~~il~~~~sN~avd~~~~~l~~~~~~~~~~~~~~~ir~~~~ 94 (236)
T PF13086_consen 51 DRGKKILVVSPSNAAVDNILERLKKLLDEDGKVYKPKIIRLGSE 94 (236)
T ss_dssp CCSS-EEEEESSHHHHHHHHHHHHC--------TT--EEE---G
T ss_pred hccccceeecCCchhHHHHHHHHHhhccccccccccchhhhccc
Confidence 4456778899999999999999998 333456666543
No 132
>cd04908 ACT_Bt0572_1 N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains. Included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains as seen in the uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related ACT domains. These tandem ACT domain proteins belong to the superfamily of ACT regulatory domains.
Probab=26.33 E-value=1.3e+02 Score=16.90 Aligned_cols=25 Identities=12% Similarity=0.020 Sum_probs=18.5
Q ss_pred eEEeecC-CcccHHHHHHHHHhCCCe
Q psy12446 17 TLYHGVN-LENHLHLAYHELKKHQVK 41 (77)
Q Consensus 17 ~fli~~p-t~~tl~~yi~elk~~~Vt 41 (77)
|+.|..| ....|.+-.+.|.++|+.
T Consensus 3 ri~v~v~d~pG~La~v~~~l~~~~in 28 (66)
T cd04908 3 QLSVFLENKPGRLAAVTEILSEAGIN 28 (66)
T ss_pred EEEEEEcCCCChHHHHHHHHHHCCCC
Confidence 4555554 677888888999888874
No 133
>TIGR01718 Uridine-psphlse uridine phosphorylase. Sequences from Clostridium, Streptomyces, Treponema, Halobacterium and Pyrobaculum were included above trusted on the basis of sequence homology and a PAM-based neighbor-joining tree. A clade including second sequences from Halobacterium and Vibrio was somewhat more distantly related and may represent a slightly different substrate specificity - these were placed below the noise cutoff. More distantly related is a clade of archaeal sequences which as related to the DeoD family of inosine phosphorylases (TIGR00107) as they are to these uridine phosphorylases. This clade includes a characterized protein from Sulfolobus solfataricus which has been mis-named as a methylthioadenosine phosphorylase, but which acts on inosine and guanosine - it is unclear whether uridine has been evaluated as a substrate.
Probab=26.07 E-value=2.1e+02 Score=20.72 Aligned_cols=44 Identities=14% Similarity=0.119 Sum_probs=31.5
Q ss_pred eeeeeeeeeccceeEEee-cC-CcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 4 RALICRTLDYKTVTLYHG-VN-LENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 4 ~~~~~~~~~y~~~~fli~-~p-t~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
|....-+-.|++.++.+- +. -..+..-.++||.+.|+..|+|+=
T Consensus 41 r~~~~~~G~~~g~~v~v~~~GiG~~~aai~~~eLi~~g~~~iIr~G 86 (245)
T TIGR01718 41 REFVTYRGELDGKPVIVCSTGIGGPSTAIAVEELAQLGARTFIRVG 86 (245)
T ss_pred CCEEEEEEEECCEEEEEEcCCCCHHHHHHHHHHHHHhCCCEEEEee
Confidence 334444567899998884 33 444555678899999999999974
No 134
>COG0095 LplA Lipoate-protein ligase A [Coenzyme metabolism]
Probab=26.05 E-value=55 Score=23.89 Aligned_cols=45 Identities=18% Similarity=0.142 Sum_probs=29.2
Q ss_pred cceeeeeeeeeccc-eeEEeecC--------CcccHHHHHHHHHhCCCeEEEEe
Q psy12446 2 AYRALICRTLDYKT-VTLYHGVN--------LENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 2 ~~~~~~~~~~~y~~-~~fli~~p--------t~~tl~~yi~elk~~~Vt~VVRv 46 (77)
|.-..+|+.+..+. ++|+.-+| .+.-.+-.+++++++||.-|.|-
T Consensus 19 A~de~ll~~~~~~~~i~~~~~~~~~v~lG~~q~~~~Ev~~~~~~~~~i~vvRR~ 72 (248)
T COG0095 19 ALDEALLRSLSEGGTLRLYWWNPPTVVLGRFQNTLPEVNLEYVKEDGIPVVRRP 72 (248)
T ss_pred HHHHHHHHhCCCCCeEEEEEECCCeEEECCccchHhHhhHHHHHHcCCcEEEEc
Confidence 44445566666444 66665332 34445789999999997777774
No 135
>PF06676 DUF1178: Protein of unknown function (DUF1178); InterPro: IPR009562 This family consists of several hypothetical bacterial proteins of around 150 residues in length. The function of this family is unknown.
Probab=25.92 E-value=33 Score=24.14 Aligned_cols=17 Identities=35% Similarity=0.561 Sum_probs=14.5
Q ss_pred hhhhcCCceEEEeecCC
Q psy12446 54 EDLKTEGINVKDLAYED 70 (77)
Q Consensus 54 ~~~~~~GI~vhdl~F~D 70 (77)
+.|.++||.|.-+|++.
T Consensus 128 ~~L~eEGI~v~pLP~~~ 144 (148)
T PF06676_consen 128 KELIEEGIEVLPLPWPP 144 (148)
T ss_pred HHHHHcCCeEeecCCCC
Confidence 78999999999888653
No 136
>TIGR01694 MTAP 5'-deoxy-5'-methylthioadenosine phosphorylase. In between the trusted and noise cutoffs are: 1) several archaeal sequences which appear to contain several residues characteristic of phosphorylases which act on guanosine or inosine (according to the crystal structure of MTAP and alignments). In any case, these residues are not conserved. 2) sequences from Mycobacterium tuberculosis and Streptomyces coelicolor which have better, although not perfect retention of the active site residues, but considering the general observation that bacteria utilize the MTA/SAH nucleotidase enzyme and a kinase to do this reaction, these have been excluded pending stronger evidence of their function, and 3) a sequence from Drosophila which appears to be a recent divergence (long branch in neighbor-joining trees) and lacks some of the conserved active site residues.
Probab=25.85 E-value=62 Score=23.32 Aligned_cols=23 Identities=26% Similarity=0.206 Sum_probs=18.9
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
-|...-+++|+++||+.|+|+..
T Consensus 62 vn~~a~~~~L~~~Gv~~iI~~Gs 84 (241)
T TIGR01694 62 VNYRANIWALKSLGVKYVISVNA 84 (241)
T ss_pred CCcHHHHHHHHHcCCCEEEEecc
Confidence 44444689999999999999985
No 137
>PF06557 DUF1122: Protein of unknown function (DUF1122); InterPro: IPR008304 There are currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.; PDB: 2ARH_C.
Probab=25.72 E-value=48 Score=24.13 Aligned_cols=19 Identities=21% Similarity=0.272 Sum_probs=11.8
Q ss_pred hhhhcCCceEE-EeecCCCC
Q psy12446 54 EDLKTEGINVK-DLAYEDGT 72 (77)
Q Consensus 54 ~~~~~~GI~vh-dl~F~DG~ 72 (77)
-.+.++|+... ||+||.|.
T Consensus 109 ~~Ll~~GFtwfKdWYfPEG~ 128 (170)
T PF06557_consen 109 FSLLKAGFTWFKDWYFPEGG 128 (170)
T ss_dssp HHHHTTT--EEEEEE--TTT
T ss_pred HHHHhCCcEEEeeeeccCcc
Confidence 45788999865 99999985
No 138
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=25.54 E-value=1.4e+02 Score=16.87 Aligned_cols=41 Identities=12% Similarity=-0.061 Sum_probs=27.6
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCCCCcCc-----hhhhcCCceEE
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCEPTYKV-----EDLKTEGINVK 64 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-----~~~~~~GI~vh 64 (77)
...+++++++..++.|...|.=.-..+... +..++.||++.
T Consensus 13 ~~~~~~~~~~~a~~~g~~~v~iTDh~~~~~~~~~~~~~~~~gi~~i 58 (67)
T smart00481 13 GALSPEELVKRAKELGLKAIAITDHGNLFGAVEFYKAAKKAGIKPI 58 (67)
T ss_pred ccCCHHHHHHHHHHcCCCEEEEeeCCcccCHHHHHHHHHHcCCeEE
Confidence 456789999999999998887544332221 44456777764
No 139
>COG0803 LraI ABC-type metal ion transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism]
Probab=25.47 E-value=1.2e+02 Score=22.72 Aligned_cols=47 Identities=21% Similarity=0.261 Sum_probs=36.3
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEEEeecCCC
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVKDLAYEDG 71 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vhdl~F~DG 71 (77)
||.+.+.+-++.+|+++|..|+. +.+.+. ..-++.|+.+.-+.+-|+
T Consensus 225 ~s~~~l~~l~~~ik~~~i~~If~--e~~~~~~~~~~la~e~g~~~~~~~~~d~ 275 (303)
T COG0803 225 PSPKDLAKLVDLIKKKNIKAIFV--ESNVSSKSAETLAKETGVKILGLLYLDS 275 (303)
T ss_pred CCHHHHHHHHHHHHHcCCCEEEE--eCCCChHHHHHHHHHhCCcccceeeecc
Confidence 68899999999999999999997 555544 566677877665555554
No 140
>PRK06552 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=25.46 E-value=78 Score=22.90 Aligned_cols=39 Identities=23% Similarity=0.220 Sum_probs=32.1
Q ss_pred HHHHHHhCCCeEEEEeCCCCc---CchhhhcCCceEEEeecC
Q psy12446 31 AYHELKKHQVKNVVRVCEPTY---KVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 31 yi~elk~~~Vt~VVRvCe~tY---d~~~~~~~GI~vhdl~F~ 69 (77)
.++.|.+++|-.|+|...+.. =.+.+.+.||.+.|..|.
T Consensus 6 ~~~~l~~~~vi~vir~~~~~~a~~~~~al~~~Gi~~iEit~~ 47 (213)
T PRK06552 6 ILTKLKANGVVAVVRGESKEEALKISLAVIKGGIKAIEVTYT 47 (213)
T ss_pred HHHHHHHCCEEEEEECCCHHHHHHHHHHHHHCCCCEEEEECC
Confidence 568999999999999985332 228889999999999886
No 141
>PF08279 HTH_11: HTH domain; InterPro: IPR013196 Winged helix DNA-binding proteins share a related winged helix-turn-helix DNA-binding motif, where the "wings", or loops, are small beta-sheets. The winged helix motif consists of two wings (W1, W2), three alpha helices (H1, H2, H3) and three beta-sheets (S1, S2, S3) arranged in the order H1-S1-H2-H3-S2-W1-S3-W2 []. The DNA-recognition helix makes sequence-specific DNA contacts with the major groove of DNA, while the wings make different DNA contacts, often with the minor groove or the backbone of DNA. Several winged-helix proteins display an exposed patch of hydrophobic residues thought to mediate protein-protein interactions. This entry represents a subset of the winged helix domain superfamily which is predominantly found in bacterial proteins, though there are also some archaeal and eukaryotic examples. This domain is commonly found in the biotin (vitamin H) repressor protein BirA which regulates transcription of the biotin operon []. It is also found in other proteins including regulators of amino acid biosynthsis such as LysM [], and regulators of carbohydrate metabolisms such as LicR and FrvR [, ].; PDB: 1HXD_B 2EWN_B 1BIA_A 1BIB_A 1J5Y_A 3V7S_A 3V7C_A 3RKW_A 3RIR_A 3RKX_A ....
Probab=25.38 E-value=47 Score=18.11 Aligned_cols=16 Identities=13% Similarity=0.185 Sum_probs=14.3
Q ss_pred CcccHHHHHHHHHhCC
Q psy12446 24 LENHLHLAYHELKKHQ 39 (77)
Q Consensus 24 t~~tl~~yi~elk~~~ 39 (77)
|.+|+.++|++|++.|
T Consensus 28 S~rTi~~~i~~L~~~~ 43 (55)
T PF08279_consen 28 SRRTIRRDIKELREWG 43 (55)
T ss_dssp -HHHHHHHHHHHHHTT
T ss_pred CHHHHHHHHHHHHHCC
Confidence 7789999999999999
No 142
>cd02169 Citrate_lyase_ligase Citrate lyase ligase. Citrate lyase ligase, also known as [Citrate (pro-3S)-lyase] ligase, is responsible for acetylation of the (2-(5''-phosphoribosyl)-3'-dephosphocoenzyme-A) prosthetic group of the gamma subunit of citrate lyase, converting the inactive thiol form of this enzyme to the active form. The acetylation of 1 molecule of deacetyl-citrate lyase to enzymatically active citrate lyase requires 6 molecules of ATP. The Adenylylyltranferase activity of the enzyme involves the formation of AMP and and pyrophosphate in the acetylation reaction.
Probab=25.07 E-value=1.6e+02 Score=22.46 Aligned_cols=38 Identities=5% Similarity=0.056 Sum_probs=27.9
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEEE
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVKD 65 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vhd 65 (77)
-++..++++++.|++.+.-.|... ....+++.|++...
T Consensus 46 Ll~~l~~~a~~~g~~~i~L~t~~~-~~~fYek~GF~~~~ 83 (297)
T cd02169 46 IVSELINKAYEEGIFHLFLFTKPK-NAKFFRGLGFKELA 83 (297)
T ss_pred HHHHHHHHHHHCCCCEEEEEEccc-HHHHHHHCCCEEec
Confidence 466777889999999877666432 35788889987665
No 143
>PF00885 DMRL_synthase: 6,7-dimethyl-8-ribityllumazine synthase; InterPro: IPR002180 6,7-dimethyl-8-ribityllumazine synthase (riboflavin synthase) catalyses the biosynthesis of riboflavin according to the reaction: 2 6,7-dimethyl-8-(1-D-ribityl)lumazine = riboflavin + 4-(1-D-ribitylamino)-5-amino-2,6-dihydroxypyrimidine. The biosynthesis of one riboflavin molecule requires one molecule of GTP and two molecules of ribulose 5-phosphate as substrates. The final step in the biosynthesis of the vitamin involves the dismutation of 6,7-dimethyl-8-ribityllumazine catalyzed by riboflavin synthase. The second product, 5-amino-6-ribitylamino-2,4(1H,3H)-pyrimidinedione, is recycled in the biosynthetic pathway by 6,7-dimethyl-8-ribityllumazine synthase []. N-[2,4-dioxo-6-d-ribitylamino-1,2,3,4-tetrahydropyrimidin-5-yl]oxalamic acid derivatives inhibit riboflavin synthase []. This family includes the beta chain of 6,7-dimethyl-8-ribityllumazine synthase 2.5.1.9 from EC. The family also includes a subfamily of distant archaebacterial proteins that may also have the same function for example O28856 from SWISSPROT.; GO: 0009231 riboflavin biosynthetic process, 0009349 riboflavin synthase complex; PDB: 2O6H_D 1C41_C 2OBX_H 1VSX_H 1VSW_3 3JV8_C 3MK3_r 3NQ4_G 2A58_A 2A57_D ....
Probab=24.98 E-value=1.2e+02 Score=20.71 Aligned_cols=35 Identities=31% Similarity=0.311 Sum_probs=27.1
Q ss_pred cceeEEe-ec--C---CcccHHHHHHHHHhCCC----eEEEEeCC
Q psy12446 14 KTVTLYH-GV--N---LENHLHLAYHELKKHQV----KNVVRVCE 48 (77)
Q Consensus 14 ~~~~fli-~~--p---t~~tl~~yi~elk~~~V----t~VVRvCe 48 (77)
+++|+.| .+ + ++.-++..++.|+++|+ -++++|-.
T Consensus 2 ~~~ri~IV~s~~n~~i~~~ll~~a~~~l~~~g~~~~~i~~~~VPG 46 (144)
T PF00885_consen 2 SGLRIAIVVSRFNEEITDRLLEGALEELKRHGVAEENIEVIRVPG 46 (144)
T ss_dssp TTEEEEEEEESTTHHHHHHHHHHHHHHHHHTTTTGGCEEEEEESS
T ss_pred CCCEEEEEEEeccHHHHHHHHHHHHHHHHHcCCCccceEEEEcCC
Confidence 4688888 33 3 67778889999999998 47777765
No 144
>PRK15454 ethanol dehydrogenase EutG; Provisional
Probab=24.93 E-value=2e+02 Score=22.38 Aligned_cols=39 Identities=10% Similarity=0.223 Sum_probs=29.6
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcC--------chhhhcCCceEE
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYK--------VEDLKTEGINVK 64 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd--------~~~~~~~GI~vh 64 (77)
..+++--++++++|.+.++-++.+.-. .+.|+++||++.
T Consensus 35 g~~~~l~~~~~~~g~~~~lvv~~~~~~~~g~~~~v~~~L~~~gi~~~ 81 (395)
T PRK15454 35 GAVSSCGQQAQTRGLKHLFVMADSFLHQAGMTAGLTRSLAVKGIAMT 81 (395)
T ss_pred CHHHHHHHHHHhcCCCEEEEEcCcchhhCccHHHHHHHHHHcCCeEE
Confidence 456666678899999999988875432 267899999875
No 145
>PF08825 E2_bind: E2 binding domain; InterPro: IPR014929 E1 and E2 enzymes play a central role in ubiquitin and ubiquitin-like protein transfer cascades. This is an E2 binding domain that is found on NEDD8 activating E1 enzyme. The protein resembles ubiquitin, and recruits the catalytic core of the E2 enzyme Ubc12 in a similar manner to that in which ubiquitin interacts with ubiquitin binding domains []. ; GO: 0005524 ATP binding, 0016881 acid-amino acid ligase activity, 0045116 protein neddylation; PDB: 3GZN_D 3DBL_F 1R4N_H 1R4M_D 2NVU_B 1TT5_D 3DBR_D 3DBH_H 1YOV_B 3FN1_A ....
Probab=24.74 E-value=60 Score=20.52 Aligned_cols=19 Identities=16% Similarity=-0.029 Sum_probs=16.1
Q ss_pred eecCCcccHHHHHHHHHhC
Q psy12446 20 HGVNLENHLHLAYHELKKH 38 (77)
Q Consensus 20 i~~pt~~tl~~yi~elk~~ 38 (77)
|..+.+.||+++|+.|...
T Consensus 1 i~v~~~~TL~~lid~L~~~ 19 (84)
T PF08825_consen 1 IEVSPSWTLQDLIDSLCEK 19 (84)
T ss_dssp EEESTTSBSHHHHHHHHHS
T ss_pred CCcCccchHHHHHHHHHhC
Confidence 4567889999999999876
No 146
>KOG4549|consensus
Probab=24.73 E-value=38 Score=24.08 Aligned_cols=31 Identities=26% Similarity=0.313 Sum_probs=23.9
Q ss_pred HHHHHHHHHhCCCeEEE--EeCCCCcCchhhhc
Q psy12446 28 LHLAYHELKKHQVKNVV--RVCEPTYKVEDLKT 58 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VV--RvCe~tYd~~~~~~ 58 (77)
...-|.+||+.||+.++ |.|.|.|-.+.|+.
T Consensus 49 i~rIL~dLk~~GVtl~~ASRt~ap~iA~q~L~~ 81 (144)
T KOG4549|consen 49 IRRILVDLKKLGVTLIHASRTMAPQIASQGLET 81 (144)
T ss_pred hhHHHHHHHhcCcEEEEecCCCCHHHHHHHHHH
Confidence 34567899999999998 88888776655544
No 147
>PF11210 DUF2996: Protein of unknown function (DUF2996); InterPro: IPR021374 This family of proteins has no known function.
Probab=24.59 E-value=46 Score=22.90 Aligned_cols=21 Identities=33% Similarity=0.661 Sum_probs=16.5
Q ss_pred hhhhcCCceEEEeecCCCCCC
Q psy12446 54 EDLKTEGINVKDLAYEDGTSP 74 (77)
Q Consensus 54 ~~~~~~GI~vhdl~F~DG~~P 74 (77)
+.++++||.-.++.|.++.-|
T Consensus 21 ~~l~~~Gi~d~~L~f~~~~~p 41 (119)
T PF11210_consen 21 KALEKEGISDVELSFEKNKRP 41 (119)
T ss_pred HHHHHcCCCcceEEeccCCcc
Confidence 567778888888888888665
No 148
>cd04440 DEP_2_P-Rex DEP (Dishevelled, Egl-10, and Pleckstrin) domain 2 found in P-Rex-like proteins. The P-Rex family is the guanine-nucleotide exchange factor (GEF) for the small GTPase Rac that contains an N-terminal RhoGEF domain, two DEP and PDZ domains. Rac-GEF activity is stimulated by phosphatidylinositol (3,4,5)-trisphosphate (PtdIns(3,4,5)P3), a lipid second messenger, and the G beta-gamma subunits of heterotrimeric G proteins. The DEP domains are not involved in mediating these stimuli, but may be of importance for basal and stimulated levels Rac-GEF activity.
Probab=24.45 E-value=45 Score=21.74 Aligned_cols=19 Identities=16% Similarity=0.239 Sum_probs=15.5
Q ss_pred hhhhcCCceEE---EeecCCCC
Q psy12446 54 EDLKTEGINVK---DLAYEDGT 72 (77)
Q Consensus 54 ~~~~~~GI~vh---dl~F~DG~ 72 (77)
..+.+.||-+| +..|.||.
T Consensus 64 q~Ll~~gii~HV~~~h~F~D~~ 85 (93)
T cd04440 64 VGLCNNGFMHHVLEKSEFKDEP 85 (93)
T ss_pred HHHHhCCCEEecCCCcCcCCcC
Confidence 45678899888 89999984
No 149
>PF10775 ATP_sub_h: ATP synthase complex subunit h; InterPro: IPR019711 ATPases (or ATP synthases) are membrane-bound enzyme complexes/ion transporters that combine ATP synthesis and/or hydrolysis with the transport of protons across a membrane. ATPases can harness the energy from a proton gradient, using the flux of ions across the membrane via the ATPase proton channel to drive the synthesis of ATP. Some ATPases work in reverse, using the energy from the hydrolysis of ATP to create a proton gradient. There are different types of ATPases, which can differ in function (ATP synthesis and/or hydrolysis), structure (e.g., F-, V- and A-ATPases, which contain rotary motors) and in the type of ions they transport [, ]. The different types include: F-ATPases (F1F0-ATPases), which are found in mitochondria, chloroplasts and bacterial plasma membranes where they are the prime producers of ATP, using the proton gradient generated by oxidative phosphorylation (mitochondria) or photosynthesis (chloroplasts). V-ATPases (V1V0-ATPases), which are primarily found in eukaryotic vacuoles and catalyse ATP hydrolysis to transport solutes and lower pH in organelles. A-ATPases (A1A0-ATPases), which are found in Archaea and function like F-ATPases (though with respect to their structure and some inhibitor responses, A-ATPases are more closely related to the V-ATPases). P-ATPases (E1E2-ATPases), which are found in bacteria and in eukaryotic plasma membranes and organelles, and function to transport a variety of different ions across membranes. E-ATPases, which are cell-surface enzymes that hydrolyse a range of NTPs, including extracellular ATP. F-ATPases (also known as F1F0-ATPase, or H(+)-transporting two-sector ATPase) (3.6.3.14 from EC) are composed of two linked complexes: the F1 ATPase complex is the catalytic core and is composed of 5 subunits (alpha, beta, gamma, delta, epsilon), while the F0 ATPase complex is the membrane-embedded proton channel that is composed of at least 3 subunits (A-C), nine in mitochondria (A-G, F6, F8). Both the F1 and F0 complexes are rotary motors that are coupled back-to-back. In the F1 complex, the central gamma subunit forms the rotor inside the cylinder made of the alpha(3)beta(3) subunits, while in the F0 complex, the ring-shaped C subunits forms the rotor. The two rotors rotate in opposite directions, but the F0 rotor is usually stronger, using the force from the proton gradient to push the F1 rotor in reverse in order to drive ATP synthesis []. These ATPases can also work in reverse to hydrolyse ATP to create a proton gradient. This entry represents subunit H found in the F0 complex of F-ATPases from fungal mitochondria. Subunit H is homologous to the mammalian factor F6, and is essential for the correct assembly and/or functioning of F-ATPases, since yeast cells lacking it are not able to grow on non-fermentable carbon sources. Subunit H occupies a central place in the peripheral stalk between the F1 sector and the membrane [].
Probab=24.42 E-value=39 Score=20.95 Aligned_cols=20 Identities=20% Similarity=0.122 Sum_probs=15.8
Q ss_pred CcccHHHHHHHHHhCCCeEE
Q psy12446 24 LENHLHLAYHELKKHQVKNV 43 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~V 43 (77)
.|---+.||+|||.+..+.+
T Consensus 10 ~d~VQDLYLkELKayKp~p~ 29 (67)
T PF10775_consen 10 ADLVQDLYLKELKAYKPPPI 29 (67)
T ss_pred ccHHHHHHHHHHHhcCCCCC
Confidence 45556789999999988765
No 150
>TIGR00010 hydrolase, TatD family. Several genomes have multiple paralogs related to this family. However, a set of 17 proteins can be found, one each from 17 of the first 20 genomes, such that each member forms a bidirectional best hit across genomes with all other members of the set. This core set (and one other near-perfect member), but not the other paralogs, form the seed for this model. Additionally, members of the seed alignment and all trusted hits, but not all paralogs, have a conserved motif DxHxH near the amino end. The member from E. coli was recently shown to have DNase activity.
Probab=24.26 E-value=81 Score=21.56 Aligned_cols=20 Identities=25% Similarity=0.212 Sum_probs=17.9
Q ss_pred cHHHHHHHHHhCCCeEEEEe
Q psy12446 27 HLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRv 46 (77)
+++.++++++++||+.+|-+
T Consensus 16 ~~~~~~~~~~~~Gv~~~v~~ 35 (252)
T TIGR00010 16 DVEEVIERAKAAGVTAVVAV 35 (252)
T ss_pred CHHHHHHHHHHcCCCEEEEe
Confidence 69999999999999999844
No 151
>cd01311 PDC_hydrolase 2-pyrone-4,6-dicarboxylic acid (PDC) hydrolase hydrolyzes PDC to yield 4-oxalomesaconic acid (OMA) or its tautomer, 4-carboxy-2-hydroxymuconic acid (CHM). This reaction is part of the protocatechuate (PCA) 4,5-cleavage pathway. PCA is one of the most important intermediate metabolites in the bacterial pathways for various phenolic compounds, including lignin, which is the most abundant aromatic material in nature.
Probab=24.26 E-value=1.2e+02 Score=21.76 Aligned_cols=26 Identities=12% Similarity=-0.026 Sum_probs=22.6
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCEP 49 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe~ 49 (77)
.+.++++|++.++..||...|-+...
T Consensus 26 ~~~~~e~l~~~m~~~gV~~aV~vq~~ 51 (263)
T cd01311 26 YDPGIDDLRALRSTLGIDRVVIVQAS 51 (263)
T ss_pred CCCCHHHHHHHHHHhCCCcEEEeCcc
Confidence 47899999999999999999977643
No 152
>cd00419 Ferrochelatase_C Ferrochelatase, C-terminal domain: Ferrochelatase (protoheme ferrolyase or HemH) is the terminal enzyme of the heme biosynthetic pathway. It catalyzes the insertion of ferrous iron into the protoporphyrin IX ring yielding protoheme. This enzyme is ubiquitous in nature and widely distributed in bacteria and eukaryotes. Recently, some archaeal members have been identified. The oligomeric state of these enzymes varies depending on the presence of a dimerization motif at the C-terminus.
Probab=24.15 E-value=1.2e+02 Score=20.18 Aligned_cols=37 Identities=41% Similarity=0.386 Sum_probs=29.4
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCC--------CCcCc-----hhhhcCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCE--------PTYKV-----EDLKTEG 60 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe--------~tYd~-----~~~~~~G 60 (77)
..-++++-|+++.+.|++.|+-+|= ..||- +.+++.|
T Consensus 75 l~P~~~~~l~~l~~~G~~~i~v~p~gF~~D~~Etl~di~~e~~~~~~~~G 124 (135)
T cd00419 75 LEPSTDDALEELAKEGVKNVVVVPIGFVSDHLETLYELDIEYRELAEEAG 124 (135)
T ss_pred CCCCHHHHHHHHHHcCCCeEEEECCccccccHHHHHHHHHHHHHHHHHcC
Confidence 4778999999999999999999884 44554 5577777
No 153
>PF13730 HTH_36: Helix-turn-helix domain
Probab=24.04 E-value=55 Score=17.83 Aligned_cols=17 Identities=18% Similarity=0.253 Sum_probs=14.0
Q ss_pred CcccHHHHHHHHHhCCC
Q psy12446 24 LENHLHLAYHELKKHQV 40 (77)
Q Consensus 24 t~~tl~~yi~elk~~~V 40 (77)
+.+|+.+.|++|++.|.
T Consensus 38 s~~Tv~~~i~~L~~~G~ 54 (55)
T PF13730_consen 38 SRRTVQRAIKELEEKGL 54 (55)
T ss_pred CHHHHHHHHHHHHHCcC
Confidence 57788999999998873
No 154
>TIGR00048 radical SAM enzyme, Cfr family. A Staphylococcus sciuri plasmid-borne member of this family, Cfr, has been identified as essential to transferrable resistance to chloramphenicol and florfenicol by an unknown mechanism. A 14-15 residue cluster with four perfectly conserved Cys residues suggests this protein may be an enzyme with an iron-sulfur cluster. The Cys cluster is part of the radical SAM domain, suggested to provide a general mechanism by which the Fe-S center cleaves S-adenosylmethionine to initiate radical-based catalysis. Members of this family lack apparent transmembrane domains.
Probab=23.95 E-value=81 Score=24.48 Aligned_cols=24 Identities=8% Similarity=0.011 Sum_probs=21.3
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
|+...++.|.+.|+++|++..||-
T Consensus 306 ps~e~i~~f~~~L~~~gi~v~iR~ 329 (355)
T TIGR00048 306 PSNEQIDRFAKTLMSYGFTVTIRK 329 (355)
T ss_pred CCHHHHHHHHHHHHHCCCeEEEeC
Confidence 577889999999999999999984
No 155
>PF09587 PGA_cap: Bacterial capsule synthesis protein PGA_cap; InterPro: IPR019079 CapA is a putative poly-gamma-glutamate capsule biosynthesis protein found in bacteria. Poly-gamma-glutamate is a natural polymer that may be involved in virulence and may help bacteria survive in high salt concentrations. It is a surface-associated protein [].
Probab=23.92 E-value=1.2e+02 Score=21.62 Aligned_cols=40 Identities=20% Similarity=0.150 Sum_probs=29.7
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcCc---------hhhhcCCceEEE
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYKV---------EDLKTEGINVKD 65 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd~---------~~~~~~GI~vhd 65 (77)
.+.+..++.|+..|+..|---+.+.+|- +.++++||.+.-
T Consensus 62 ~~~~~~~~~L~~~G~d~vslANNH~~D~G~~gl~~Tl~~L~~~gi~~~G 110 (250)
T PF09587_consen 62 NAPPEILDALKDAGFDVVSLANNHIFDYGEEGLLDTLEALDKAGIPYVG 110 (250)
T ss_pred cCCHHHHHHHHHcCCCEEEecCCCCccccHHHHHHHHHHHHHCCCcEeE
Confidence 4456678888888888777777777775 677788887653
No 156
>cd08190 HOT Hydroxyacid-oxoacid transhydrogenase (HOT) involved in gamma-hydroxybutyrate metabolism. Hydroxyacid-oxoacid transhydrogenase (HOT), also known as D-2-hydroxyglutarate transhydrogenase. It catalyzes the conversion of gamma-hydroxybutyrate (GHB) to succinic semialdehyde (SSA), coupled to the stoichiometric conversion of alpha-ketoglutarate to D-2-hydroxyglutarate in gamma-Hydroxybutyrate catabolism. Unlike many other alcohols, which are oxidized by NAD-linked dehydrogenases, gamma-hydroxybutyrate is metabolized to succinate semialdehyde by hydroxyacid-oxoacid transhydrogenase which does not require free NAD or NADP, but instead using alpha -ketoglutarate as an acceptor, converting it to d-2-hydroxyglutarate. Alpha-ketoglutarate serves as an intermediate acceptor to regenerate NAD(P) required for the oxidation of GHB. HOT also catalyzes the reversible oxidation of a hydroxyacid obligatorily coupled to the reduction of an oxoacid, and requires no cofactor. In mammals, the HOT
Probab=23.86 E-value=1.9e+02 Score=22.56 Aligned_cols=44 Identities=25% Similarity=0.367 Sum_probs=29.5
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCc--------hhhhcCCceEEEeecCCCC
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKV--------EDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~--------~~~~~~GI~vhdl~F~DG~ 72 (77)
.++..-++++++|.+.++-++.+.... +.|+++||++. .|++..
T Consensus 10 ~~~~l~~~l~~~g~~~vlivt~~~~~~~g~~~~v~~~L~~~gi~~~--~f~~v~ 61 (414)
T cd08190 10 VTAEVGMDLKNLGARRVCLVTDPNLAQLPPVKVVLDSLEAAGINFE--VYDDVR 61 (414)
T ss_pred HHHHHHHHHHHcCCCeEEEEECcchhhcchHHHHHHHHHHcCCcEE--EeCCCC
Confidence 456666778888888888888764322 34778888875 355433
No 157
>PRK08564 5'-methylthioadenosine phosphorylase II; Reviewed
Probab=23.75 E-value=2.1e+02 Score=21.49 Aligned_cols=48 Identities=21% Similarity=0.261 Sum_probs=32.5
Q ss_pred eeeeeeeccceeEE--e----ec-CCcccHHHHHHHHHhCCCeEEEEeCC-----CCcCc
Q psy12446 6 LICRTLDYKTVTLY--H----GV-NLENHLHLAYHELKKHQVKNVVRVCE-----PTYKV 53 (77)
Q Consensus 6 ~~~~~~~y~~~~fl--i----~~-pt~~tl~~yi~elk~~~Vt~VVRvCe-----~tYd~ 53 (77)
++.-+++=+.+=|+ + .. |.+-+...-|..||+-||+.|++++. +.|..
T Consensus 43 l~~g~l~g~~v~~l~RhGr~H~y~~~~i~~~a~i~aLk~LGvk~iI~tnavGsl~~~~~p 102 (267)
T PRK08564 43 IIIGEIEGVEVAFLPRHGRGHRIPPHKINYRANIWALKELGVEWVIAVSAVGSLREDYKP 102 (267)
T ss_pred EEEEEECCEEEEEEeCCCCCcccCCccCcchHHHHHHHHCCCcEEEEeccccccCCCCCC
Confidence 34444544556666 3 22 35555677789999999999999995 56654
No 158
>cd08175 G1PDH Glycerol-1-phosphate dehydrogenase (G1PDH) catalyzes the reversible reduction of dihydroxyacetone phosphate (DHAP) to glycerol-1-phosphate (G1P) in an NADH-dependent manner. Glycerol-1-phosphate dehydrogenase (G1PDH) plays a role in the synthesis of phosphoglycerolipids in Gram-positive bacterial species. It catalyzes the reversibly reduction of dihydroxyacetone phosphate (DHAP) to glycerol-1-phosphate (G1P) in a NADH-dependent manner. Its activity requires a Ni++ ion. In Bacillus subtilis, it has been described as AraM gene in L-arabinose (ara) operon. AraM protein forms homodimer. This family is bacteria specific.
Probab=23.70 E-value=1.7e+02 Score=22.02 Aligned_cols=45 Identities=24% Similarity=0.253 Sum_probs=28.4
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCC-cC------chhhhcCCceEEEeecCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPT-YK------VEDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~t-Yd------~~~~~~~GI~vhdl~F~DG~ 72 (77)
+++--++++++|.+.++-++.+. ++ .+.|+++||+++-+.+.+|.
T Consensus 11 l~~l~~~~~~~~~~~~livtd~~~~~~~~~~v~~~l~~~~i~~~~~~~~~~~ 62 (348)
T cd08175 11 LERLPEILKEFGYKKALIVADENTYAAAGKKVEALLKRAGVVVLLIVLPAGD 62 (348)
T ss_pred HHHHHHHHHhcCCCcEEEEECCcHHHHHHHHHHHHHHHCCCeeEEeecCCCc
Confidence 44444556667877777777642 22 15688899988655556554
No 159
>PF06230 DUF1009: Protein of unknown function (DUF1009); InterPro: IPR010415 This is a family of uncharacterised bacterial proteins.
Probab=23.70 E-value=59 Score=24.06 Aligned_cols=24 Identities=29% Similarity=0.383 Sum_probs=17.8
Q ss_pred HHHHHHHHhCCCeEEE---EeCCCCcC
Q psy12446 29 HLAYHELKKHQVKNVV---RVCEPTYK 52 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VV---RvCe~tYd 52 (77)
...++.||++||+.|| .+-.|.++
T Consensus 2 G~~i~~lk~~gv~~vvmaG~v~rp~~~ 28 (214)
T PF06230_consen 2 GKIIKFLKREGVTRVVMAGKVKRPIFS 28 (214)
T ss_pred hHHHHHHHHcCCCEEEEeecccCcccc
Confidence 4578999999999998 44445543
No 160
>cd00401 AdoHcyase S-adenosyl-L-homocysteine hydrolase (AdoHycase) catalyzes the hydrolysis of S-adenosyl-L-homocysteine (AdoHyc) to form adenosine (Ado) and homocysteine (Hcy). The equilibrium lies far on the side of AdoHyc synthesis, but in nature the removal of Ado and Hyc is sufficiently fast, so that the net reaction is in the direction of hydrolysis. Since AdoHyc is a potent inhibitor of S-adenosyl-L-methionine dependent methyltransferases, AdoHycase plays a critical role in the modulation of the activity of various methyltransferases. The enzyme forms homooligomers of 45-50kDa subunits, each binding one molecule of NAD+.
Probab=23.61 E-value=2.8e+02 Score=22.22 Aligned_cols=57 Identities=11% Similarity=0.114 Sum_probs=41.0
Q ss_pred eeeccceeEEeecCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc-----hhhhcCCceEEEee
Q psy12446 10 TLDYKTVTLYHGVNLENHLHLAYHELKKHQVKNVVRVCEPTYKV-----EDLKTEGINVKDLA 67 (77)
Q Consensus 10 ~~~y~~~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-----~~~~~~GI~vhdl~ 67 (77)
+--.|++|..+-.+-...-...++-|+.-|..-.+--|.| +++ ..+.+.||.|+-|.
T Consensus 31 ~~p~~g~~i~~~~hl~~~ta~l~~~L~~~GA~v~~~~~np-~stqd~vaa~l~~~gi~v~a~~ 92 (413)
T cd00401 31 SKPLKGARIAGCLHMTVQTAVLIETLVALGAEVRWSSCNI-FSTQDHAAAAIAAAGIPVFAWK 92 (413)
T ss_pred cCCCCCCEEEEEEcchHHHHHHHHHHHHcCCEEEEEcCCC-ccchHHHHHHHHhcCceEEEEc
Confidence 3446777877765655555667788999999887777776 333 67788899998764
No 161
>TIGR02638 lactal_redase lactaldehyde reductase. This clade of genes encoding iron-containing alcohol dehydrogenase (pfam00465) proteins is generally found in apparent operons for the catabolism of rhamnose or fucose. Catabolism of both of these monosaccharides results in lactaldehyde which is reduced by this enzyme to 1,2 propanediol. This protein is alternatively known by the name 1,2 propanediol oxidoreductase. This enzyme is active under anaerobic conditions in E. coli while being inactivated by reactive oxygen species under aerobic conditions. Under aerobic conditions the lactaldehyde product of rhamnose and fucose catabolism is believed to be oxidized to lactate by a separate enzyme, lactaldehyde dehydrogenase.
Probab=23.61 E-value=2.4e+02 Score=21.57 Aligned_cols=44 Identities=16% Similarity=0.251 Sum_probs=30.2
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcC------c--hhhhcCCceEEEeecCCC
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYK------V--EDLKTEGINVKDLAYEDG 71 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd------~--~~~~~~GI~vhdl~F~DG 71 (77)
..++.--++++++|.+.++-++.+... . +.|+++||++. .|++.
T Consensus 15 g~l~~l~~~l~~~g~~r~lvvt~~~~~~~g~~~~v~~~L~~~~i~~~--~~~~v 66 (379)
T TIGR02638 15 GAIEDIVDEVKRRGFKKALVVTDKDLIKFGVADKVTDLLDEAGIAYE--LFDEV 66 (379)
T ss_pred CHHHHHHHHHHhcCCCEEEEEcCcchhhccchHHHHHHHHHCCCeEE--EECCC
Confidence 456666678888998888888865322 1 56788899875 35443
No 162
>PF02735 Ku: Ku70/Ku80 beta-barrel domain; InterPro: IPR006164 The Ku heterodimer is composed of Ku70 and Ku80 (or Ku86), 70 kDa and 80 kDa subunits of an ATP-dependent DNA helicase, which contributes to genomic integrity through its ability to bind DNA double-stranded breaks and facilitate repair by the non-homologous end-joining pathway. This is the central DNA-binding beta-barrel domain and is found in both the Ku70 and Ku80 proteins. Ku makes only a few contacts with the sugar-phosphate backbone, and none with the DNA bases, but it fits sterically to major and minor groove contours forming a ring that encircles duplex DNA, cradling two full turns of the DNA molecule. By forming a bridge between the broken DNA ends, Ku acts to structurally support and align the DNA ends, to protect them from degradation, and to prevent promiscuous binding to unbroken DNA. Ku effectively aligns the DNA, while still allowing access of polymerases, nucleases and ligases to the broken DNA ends to promote end joining [].; GO: 0003677 DNA binding, 0004003 ATP-dependent DNA helicase activity, 0006303 double-strand break repair via nonhomologous end joining; PDB: 1JEY_B 1JEQ_B.
Probab=23.60 E-value=1.4e+02 Score=20.52 Aligned_cols=57 Identities=11% Similarity=0.086 Sum_probs=37.3
Q ss_pred cceeEEeecC-C----cccHHHHHHHHHhCCCeEEEEeCCCCcCc-----------hhhhcCCceEEEeecCC
Q psy12446 14 KTVTLYHGVN-L----ENHLHLAYHELKKHQVKNVVRVCEPTYKV-----------EDLKTEGINVKDLAYED 70 (77)
Q Consensus 14 ~~~~fli~~p-t----~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-----------~~~~~~GI~vhdl~F~D 70 (77)
+.-.|++-.+ . ...+...++-|.+.|.-.|+|.+-..-.. +.....|+.++.|||.|
T Consensus 103 ~~~~~i~p~~~~~~~s~~~f~aL~~am~~~~~vaI~r~v~r~~~~p~l~aL~P~~~~~~~~~gl~~~~Lpf~d 175 (200)
T PF02735_consen 103 GESYFIVPDEKSGEGSAKAFSALVQAMLEKNKVAIARYVLRSNSRPRLVALIPQIEESDTPEGLVLIRLPFAD 175 (200)
T ss_dssp CEEEEEEEETTTECCHHHHHHHHHHHHHHCTEEEEEEEESSTTS--EEEEEEEEE-CEEC-CEEEEEE---GG
T ss_pred CCcEEEEECCCCcchhHHHHHHHHHHHHhcCcEEEEEEEEcCCCCcEEEEEEEeccccCCCCeEEEEEcCChh
Confidence 4455666333 2 25788999999999999999999755222 11123899999999976
No 163
>smart00292 BRCT breast cancer carboxy-terminal domain.
Probab=23.59 E-value=1.3e+02 Score=15.97 Aligned_cols=35 Identities=9% Similarity=-0.122 Sum_probs=27.9
Q ss_pred ccceeEEeec-CCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 13 YKTVTLYHGV-NLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 13 y~~~~fli~~-pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
.++++|++.. .+....+...+.++++|-+.+..+.
T Consensus 3 f~g~~~~~~g~~~~~~~~~l~~~i~~~Gg~~~~~~~ 38 (80)
T smart00292 3 FKGKVFVITGKFDKNERDELKELIEALGGKVTSSLS 38 (80)
T ss_pred cCCeEEEEeCCCCCccHHHHHHHHHHcCCEEecccC
Confidence 4678999975 6778888888888999988777655
No 164
>PF13670 PepSY_2: Peptidase propeptide and YPEB domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification.
Probab=23.50 E-value=59 Score=19.58 Aligned_cols=18 Identities=22% Similarity=0.700 Sum_probs=12.8
Q ss_pred hhhhcCCceEEEeecCCC
Q psy12446 54 EDLKTEGINVKDLAYEDG 71 (77)
Q Consensus 54 ~~~~~~GI~vhdl~F~DG 71 (77)
..++++|.+|+++.|+|+
T Consensus 36 ~~l~~~G~~v~~ve~~~~ 53 (83)
T PF13670_consen 36 AKLEAQGYQVREVEFDDD 53 (83)
T ss_pred HHHHhcCCceEEEEEcCC
Confidence 566777888888888443
No 165
>PRK14468 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=23.41 E-value=81 Score=24.37 Aligned_cols=24 Identities=17% Similarity=0.110 Sum_probs=21.6
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
.|+...++.|.+.|+++|+...+|
T Consensus 293 ~ps~e~i~~f~~~L~~~Gi~vtiR 316 (343)
T PRK14468 293 SSPRAQILAFADVLERRGVPVSVR 316 (343)
T ss_pred CCCHHHHHHHHHHHHHCCCeEEEe
Confidence 368889999999999999999998
No 166
>COG0514 RecQ Superfamily II DNA helicase [DNA replication, recombination, and repair]
Probab=23.39 E-value=1.2e+02 Score=25.68 Aligned_cols=30 Identities=17% Similarity=0.203 Sum_probs=25.7
Q ss_pred eeccceeEEeecCCcccHHHHHHHHHhCCC
Q psy12446 11 LDYKTVTLYHGVNLENHLHLAYHELKKHQV 40 (77)
Q Consensus 11 ~~y~~~~fli~~pt~~tl~~yi~elk~~~V 40 (77)
||..++||.|-..-..+++.|.+|.=+-|=
T Consensus 294 IdKpdVRfViH~~lP~s~EsYyQE~GRAGR 323 (590)
T COG0514 294 IDKPDVRFVIHYDLPGSIESYYQETGRAGR 323 (590)
T ss_pred cCCCCceEEEEecCCCCHHHHHHHHhhccC
Confidence 799999999987777899999999876653
No 167
>PRK14466 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=23.33 E-value=78 Score=24.87 Aligned_cols=24 Identities=17% Similarity=0.073 Sum_probs=21.9
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
.|+...++.|.+.|+++|+..-||
T Consensus 297 ~~s~~~~~~F~~~L~~~gi~~tvR 320 (345)
T PRK14466 297 GSDMARMEAFRDYLTSHGVFTTIR 320 (345)
T ss_pred CCCHHHHHHHHHHHHHCCCcEEEe
Confidence 367889999999999999999998
No 168
>PRK14460 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=23.32 E-value=83 Score=24.46 Aligned_cols=25 Identities=12% Similarity=0.108 Sum_probs=22.1
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...++.|.+.|+++|+...+|-
T Consensus 304 ~p~~e~v~~f~~~l~~~Gi~vtir~ 328 (354)
T PRK14460 304 APTEERILAFEKYLWSKGITAIIRK 328 (354)
T ss_pred CCCHHHHHHHHHHHHHCCCeEEEeC
Confidence 4688899999999999999999983
No 169
>TIGR00639 PurN phosphoribosylglycinamide formyltransferase, formyltetrahydrofolate-dependent. In phylogenetic analyses, the member from Saccharomyces cerevisiae shows a long branch length but membership in the family, while the formyltetrahydrofolate deformylases form a closely related outgroup.
Probab=23.32 E-value=2.6e+02 Score=19.76 Aligned_cols=53 Identities=15% Similarity=0.179 Sum_probs=34.8
Q ss_pred eeEEeec-CCcccHHHHHHHHHhCCC--eEEEEeCCC--CcCchhhhcCCceEEEeec
Q psy12446 16 VTLYHGV-NLENHLHLAYHELKKHQV--KNVVRVCEP--TYKVEDLKTEGINVKDLAY 68 (77)
Q Consensus 16 ~~fli~~-pt~~tl~~yi~elk~~~V--t~VVRvCe~--tYd~~~~~~~GI~vhdl~F 68 (77)
||.-|.+ -+.+++...++.+.+.+. .-++.+|.. .+-.+..++.||.++-+..
T Consensus 1 ~riail~sg~gs~~~~ll~~~~~~~l~~~I~~vi~~~~~~~~~~~A~~~gip~~~~~~ 58 (190)
T TIGR00639 1 KRIVVLISGNGSNLQAIIDACKEGKIPASVVLVISNKPDAYGLERAAQAGIPTFVLSL 58 (190)
T ss_pred CeEEEEEcCCChhHHHHHHHHHcCCCCceEEEEEECCccchHHHHHHHcCCCEEEECc
Confidence 3555654 488899999998887664 333333442 3445778889999876544
No 170
>COG1167 ARO8 Transcriptional regulators containing a DNA-binding HTH domain and an aminotransferase domain (MocR family) and their eukaryotic orthologs [Transcription / Amino acid transport and metabolism]
Probab=23.27 E-value=1.3e+02 Score=23.87 Aligned_cols=55 Identities=18% Similarity=0.196 Sum_probs=44.6
Q ss_pred eEEeecCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc--hhhhcCCceEEEeecCCCCC
Q psy12446 17 TLYHGVNLENHLHLAYHELKKHQVKNVVRVCEPTYKV--EDLKTEGINVKDLAYEDGTS 73 (77)
Q Consensus 17 ~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~--~~~~~~GI~vhdl~F~DG~~ 73 (77)
..+|++=+...|..-..-|.+-|.+-+ +=.|+|.. ..|+..|+++...|-+|.+.
T Consensus 157 ~IiiT~G~q~al~l~~~~l~~pGd~v~--vE~PtY~~~~~~~~~~g~~~~~vp~d~~G~ 213 (459)
T COG1167 157 QIVITSGAQQALDLLLRLLLDPGDTVL--VEDPTYPGALQALEALGARVIPVPVDEDGI 213 (459)
T ss_pred eEEEeCCHHHHHHHHHHHhCCCCCEEE--EcCCCcHHHHHHHHHcCCcEEecCCCCCCC
Confidence 678888888889988888888665544 35599998 89999999999999877654
No 171
>PRK05718 keto-hydroxyglutarate-aldolase/keto-deoxy-phosphogluconate aldolase; Provisional
Probab=23.07 E-value=99 Score=22.41 Aligned_cols=43 Identities=14% Similarity=0.099 Sum_probs=32.7
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCc---CchhhhcCCceEEEeecC
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTY---KVEDLKTEGINVKDLAYE 69 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tY---d~~~~~~~GI~vhdl~F~ 69 (77)
+....++.|+++++-.|+|...+.- =.+.+.+.||.+.|..+.
T Consensus 4 ~~~~~~~~l~~~~~iaV~r~~~~~~a~~i~~al~~~Gi~~iEitl~ 49 (212)
T PRK05718 4 WKTSIEEILRAGPVVPVIVINKLEDAVPLAKALVAGGLPVLEVTLR 49 (212)
T ss_pred hHHHHHHHHHHCCEEEEEEcCCHHHHHHHHHHHHHcCCCEEEEecC
Confidence 3445678999999999999765322 227888899999988764
No 172
>PLN02449 ferrochelatase
Probab=22.95 E-value=1e+02 Score=25.48 Aligned_cols=48 Identities=21% Similarity=0.094 Sum_probs=34.6
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCC--------CCcCc-----hhhhcCCce-EEEeecCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCE--------PTYKV-----EDLKTEGIN-VKDLAYEDG 71 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe--------~tYd~-----~~~~~~GI~-vhdl~F~DG 71 (77)
..-..++-|++|.+.||+.|+-+|= ..|+- +.+++.|++ .+-.|-.+.
T Consensus 336 L~P~t~d~L~~L~~~Gvk~VlvvPigFvSDhiETL~EiDiE~re~a~e~G~~~~~rVP~LN~ 397 (485)
T PLN02449 336 LKPYTDETIVELGKKGVKSLLAVPISFVSEHIETLEEIDMEYRELALESGIENWGRVPALGC 397 (485)
T ss_pred CCCCHHHHHHHHHHcCCCeEEEECCcccccchHHHHHHHHHHHHHHHHcCCceEEEcCCCCC
Confidence 4567778889999999999999983 44543 678889995 444554443
No 173
>cd00673 AlaRS_core Alanyl-tRNA synthetase (AlaRS) class II core catalytic domain. AlaRS is a homodimer. It is responsible for the attachment of alanine to the 3' OH group of ribose of the appropriate tRNA. This domain is primarily responsible for ATP-dependent formation of the enzyme bound aminoacyl-adenylate. Class II assignment is based upon its predicted structure and the presence of three characteristic sequence motifs.
Probab=22.78 E-value=74 Score=24.03 Aligned_cols=47 Identities=13% Similarity=0.005 Sum_probs=30.8
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCCcCc--hhhhcCCceEEEeecCCCCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPTYKV--EDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~tYd~--~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
-+.|++.|+++|-+.|=+---.-.|. -+|..+|+.-.-=.| -|..+|
T Consensus 5 R~~fl~FF~~kgH~~v~s~slvp~dDptllFtnAGM~~Fkp~f-~G~~~p 53 (232)
T cd00673 5 RETFLSFFEKKGHTRVPSSPVVPRDDPTLLFTNAGMNQFKPIF-LGEVPP 53 (232)
T ss_pred HHHHHHHHHhCCCEEeCCCCcCCCCCCchheeccchhhhhHHh-cCCCCC
Confidence 46899999999977664333222342 388999997765555 665544
No 174
>TIGR01697 PNPH-PUNA-XAPA inosine guanosine and xanthosine phosphorylase family. Sequences from Clostridium and Thermotoga fall between these last two clades and are uncharacterized with respect to substrate range and operon.
Probab=22.74 E-value=1.8e+02 Score=21.14 Aligned_cols=25 Identities=12% Similarity=-0.011 Sum_probs=21.4
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCE 48 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe 48 (77)
.++.+..-+++|+++||+.|+|+..
T Consensus 67 ~~~a~~~~~~~l~~~Gv~~II~~Gs 91 (248)
T TIGR01697 67 DMATVTFPVRVMKLLGVEILVVTNA 91 (248)
T ss_pred CHHHHHHHHHHHHHcCCCEEEEecc
Confidence 5667777799999999999999875
No 175
>KOG2915|consensus
Probab=22.73 E-value=62 Score=25.66 Aligned_cols=44 Identities=25% Similarity=0.458 Sum_probs=33.3
Q ss_pred HHHHHHHhCCCeEEE-----EeCCCCcCchhhhcCCceEEEeecCCCCCC
Q psy12446 30 LAYHELKKHQVKNVV-----RVCEPTYKVEDLKTEGINVKDLAYEDGTSP 74 (77)
Q Consensus 30 ~yi~elk~~~Vt~VV-----RvCe~tYd~~~~~~~GI~vhdl~F~DG~~P 74 (77)
.=.+||++||+-+.| =+|......+.+...++ ..|+|=|.-+.|
T Consensus 145 ka~eeFr~hgi~~~vt~~hrDVc~~GF~~ks~~aDaV-FLDlPaPw~AiP 193 (314)
T KOG2915|consen 145 KALEEFREHGIGDNVTVTHRDVCGSGFLIKSLKADAV-FLDLPAPWEAIP 193 (314)
T ss_pred HHHHHHHHhCCCcceEEEEeecccCCccccccccceE-EEcCCChhhhhh
Confidence 345899999987766 37999998887777777 448887776666
No 176
>COG4750 LicC CTP:phosphocholine cytidylyltransferase involved in choline phosphorylation for cell surface LPS epitopes [Cell envelope biogenesis, outer membrane]
Probab=22.67 E-value=67 Score=24.40 Aligned_cols=40 Identities=18% Similarity=0.145 Sum_probs=34.5
Q ss_pred eeeeeccceeEEeecCCcccHHHHHHHHHhCCCeEEEEeC
Q psy12446 8 CRTLDYKTVTLYHGVNLENHLHLAYHELKKHQVKNVVRVC 47 (77)
Q Consensus 8 ~~~~~y~~~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvC 47 (77)
|..+++.+=+=|+..+-...++.=|+.|++-|+.+++-|-
T Consensus 14 ~~plT~~tpK~LlkV~g~plIErqI~~L~e~gI~dI~IVv 53 (231)
T COG4750 14 FVPLTQSTPKSLLKVNGEPLIERQIEQLREAGIDDITIVV 53 (231)
T ss_pred cccccccCChHHHHhcCcccHHHHHHHHHHCCCceEEEEe
Confidence 6677787777888888888999999999999999998775
No 177
>TIGR03556 photolyase_8HDF deoxyribodipyrimidine photo-lyase, 8-HDF type. This model describes a narrow clade of cyanobacterial deoxyribodipyrimidine photo-lyase. This group, in contrast to several closely related proteins, uses a chromophore that, in other lineages is modified further to become coenzyme F420. This chromophore is called 8-HDF in most articles on the DNA photolyase and FO in most literature on coenzyme F420.
Probab=22.60 E-value=1.5e+02 Score=23.56 Aligned_cols=32 Identities=31% Similarity=0.337 Sum_probs=19.6
Q ss_pred HHHHhCCCeEEE--EeCCCCcCc-------hhhhcCCceEEE
Q psy12446 33 HELKKHQVKNVV--RVCEPTYKV-------EDLKTEGINVKD 65 (77)
Q Consensus 33 ~elk~~~Vt~VV--RvCe~tYd~-------~~~~~~GI~vhd 65 (77)
+..+++||+.|+ +-.++ |.. +.+.+.||+++.
T Consensus 84 ~l~~~~~~~~V~~~~~~~~-~~~~rd~~v~~~l~~~~i~~~~ 124 (471)
T TIGR03556 84 QLAQQLGAKAVYWNLDVEP-YGRKRDRAVAAALKEAGIAVVT 124 (471)
T ss_pred HHHHHcCCCEEEEecccCH-HHHHHHHHHHHHHHHCCCEEEE
Confidence 444567888888 33332 222 456778998864
No 178
>PF12993 DUF3877: Domain of unknown function, E. rectale Gene description (DUF3877); InterPro: IPR024539 This entry represents proteins of unknown function found primarily in Firmicutes. The Eubacterium rectale gene appears to be upregulated in the presence of Bacteroides thetaiotaomicron compared to growth in pure culture [].
Probab=22.55 E-value=50 Score=24.13 Aligned_cols=65 Identities=12% Similarity=0.197 Sum_probs=43.5
Q ss_pred eeeeeccceeEEeecCC------------cccHHHHHHHHHhCCCeE--EEEeCCCCcCc----hhhhcCCceEEEeecC
Q psy12446 8 CRTLDYKTVTLYHGVNL------------ENHLHLAYHELKKHQVKN--VVRVCEPTYKV----EDLKTEGINVKDLAYE 69 (77)
Q Consensus 8 ~~~~~y~~~~fli~~pt------------~~tl~~yi~elk~~~Vt~--VVRvCe~tYd~----~~~~~~GI~vhdl~F~ 69 (77)
-.+|+.|+=||-+.-|- +.=|..+|+-+-+||+|. |.++. ..|+. +...+.++... ..|+
T Consensus 65 ~v~vs~k~eRFCf~IP~~g~~YVhe~~~~~eFik~lIe~v~~hgcT~e~I~~~F-~~ys~~~~~e~~~~~eFD~~-i~Fe 142 (175)
T PF12993_consen 65 EVEVSHKGERFCFHIPEEGSEYVHEHTKENEFIKELIELVGKHGCTLEDILELF-HKYSDNVHCEEMDNGEFDYL-IYFE 142 (175)
T ss_pred cEEEEecCcEEEEEcCcHHHHHHHhcCCCCHHHHHHHHHHhcCCcCHHHHHHHH-HHhcCCeEEEeecCCCCCEE-EEec
Confidence 45789999999886552 333667777777888762 33333 24444 66777788775 8999
Q ss_pred CCCCC
Q psy12446 70 DGTSP 74 (77)
Q Consensus 70 DG~~P 74 (77)
||.+=
T Consensus 143 d~~~D 147 (175)
T PF12993_consen 143 DGNPD 147 (175)
T ss_pred CCCCC
Confidence 98763
No 179
>PRK08005 epimerase; Validated
Probab=22.44 E-value=1.3e+02 Score=21.81 Aligned_cols=48 Identities=15% Similarity=0.058 Sum_probs=33.6
Q ss_pred cccHHHHHHHHHhCCCeEEEEeCCCCcCc----hhhhcCCceEEEeecCCCCC
Q psy12446 25 ENHLHLAYHELKKHQVKNVVRVCEPTYKV----EDLKTEGINVKDLAYEDGTS 73 (77)
Q Consensus 25 ~~tl~~yi~elk~~~Vt~VVRvCe~tYd~----~~~~~~GI~vhdl~F~DG~~ 73 (77)
-.++++||+.|.+.|+..|.-=-|.+-+. +..+++|++. .+.+.=+++
T Consensus 67 v~~P~~~i~~~~~~gad~It~H~Ea~~~~~~~l~~Ik~~G~k~-GlAlnP~Tp 118 (210)
T PRK08005 67 VSSPQRWLPWLAAIRPGWIFIHAESVQNPSEILADIRAIGAKA-GLALNPATP 118 (210)
T ss_pred cCCHHHHHHHHHHhCCCEEEEcccCccCHHHHHHHHHHcCCcE-EEEECCCCC
Confidence 34689999999999998887666654332 6677888876 455554443
No 180
>PRK14456 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=22.38 E-value=81 Score=24.78 Aligned_cols=25 Identities=12% Similarity=-0.049 Sum_probs=21.5
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...++.|.+.|+++|+...||-
T Consensus 325 ~ps~e~i~~F~~~L~~~Gi~vtvR~ 349 (368)
T PRK14456 325 PVCSSTRERFRDRLLDAGLQVTVRK 349 (368)
T ss_pred CCCHHHHHHHHHHHHHCCCcEEeeC
Confidence 3567789999999999999999983
No 181
>KOG3985|consensus
Probab=22.37 E-value=96 Score=24.17 Aligned_cols=25 Identities=8% Similarity=0.129 Sum_probs=22.7
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
+|++-+...-|.-||+.|++.|+-+
T Consensus 68 mPt~Vn~rANiwALk~~gc~~ii~~ 92 (283)
T KOG3985|consen 68 MPTKVNFRANIWALKSLGCTAIISF 92 (283)
T ss_pred CCCcCchhHhHHHHHhCCCcEEEEe
Confidence 5899999999999999999999854
No 182
>PF09383 NIL: NIL domain; InterPro: IPR018449 This domain is found at the C terminus of ABC transporter proteins involved in D-methionine transport as well as a number of ferredoxin-like proteins. This domain is likely to act as a substrate binding domain. The domain has been named after a conserved sequence in some members of the family. ; PDB: 2QRR_A 3CED_A 2QSW_A 3TUZ_D 3TUJ_D 3DHX_B 3TUI_H 3DHW_D.
Probab=22.18 E-value=76 Score=18.61 Aligned_cols=23 Identities=13% Similarity=0.140 Sum_probs=16.7
Q ss_pred EeecC-CcccHHHHHHHHHhCCCe
Q psy12446 19 YHGVN-LENHLHLAYHELKKHQVK 41 (77)
Q Consensus 19 li~~p-t~~tl~~yi~elk~~~Vt 41 (77)
++..+ .+..++..+++|+++||.
T Consensus 49 ~l~l~g~~~~~~~a~~~L~~~~v~ 72 (76)
T PF09383_consen 49 ILELPGDDEEIEKAIAYLREQGVE 72 (76)
T ss_dssp EEEEES-HHHHHHHHHHHHHTTEE
T ss_pred EEEEECCHHHHHHHHHHHHHCCCe
Confidence 33444 556689999999999975
No 183
>PRK12560 adenine phosphoribosyltransferase; Provisional
Probab=22.15 E-value=2.8e+02 Score=19.41 Aligned_cols=54 Identities=20% Similarity=0.188 Sum_probs=38.8
Q ss_pred ccceeEEe--e-cCCcccHHHHHHHHHhCCCeEEEEeC--CCCc-C-chhh-hcCCceEEEe
Q psy12446 13 YKTVTLYH--G-VNLENHLHLAYHELKKHQVKNVVRVC--EPTY-K-VEDL-KTEGINVKDL 66 (77)
Q Consensus 13 y~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VVRvC--e~tY-d-~~~~-~~~GI~vhdl 66 (77)
.++-|.|| | .-|..|+..-++.+++.|.+-+-.+| +... . .+.+ ++.||.++-+
T Consensus 112 ~~G~rVlIVDDvitTG~T~~~ai~ll~~aGa~vv~v~~vvd~~~~~g~~~l~~~~gv~v~sl 173 (187)
T PRK12560 112 EKGDRVAIIDDTLSTGGTVIALIKAIENSGGIVSDVICVIEKTQNNGRKKLFTQTGINVKSL 173 (187)
T ss_pred CCcCEEEEEEeccccCHHHHHHHHHHHHCCCEEEEEEEEEEecccchHHHHhhccCCcEEEE
Confidence 46778888 4 47999999999999999987655554 4332 2 2444 6789988743
No 184
>KOG3349|consensus
Probab=22.09 E-value=1.6e+02 Score=21.52 Aligned_cols=41 Identities=12% Similarity=0.239 Sum_probs=27.4
Q ss_pred HHHHHHHHhCCCeEEEE-------eCCCCcCchhhhcCCceEEEeecCC
Q psy12446 29 HLAYHELKKHQVKNVVR-------VCEPTYKVEDLKTEGINVKDLAYED 70 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VVR-------vCe~tYd~~~~~~~GI~vhdl~F~D 70 (77)
+.+.++|+++|++.++- +|.-.-+.-. +..|+.+.-.-|.+
T Consensus 25 ~~~~~~L~k~G~~kLiiQ~Grg~~~~~d~~~~~~-k~~gl~id~y~f~p 72 (170)
T KOG3349|consen 25 EEFLQELQKRGFTKLIIQIGRGQPFFGDPIDLIR-KNGGLTIDGYDFSP 72 (170)
T ss_pred HHHHHHHHHcCccEEEEEecCCccCCCCHHHhhc-ccCCeEEEEEecCc
Confidence 67889999999999882 3333333222 57888887666643
No 185
>PRK14719 bifunctional RNAse/5-amino-6-(5-phosphoribosylamino)uracil reductase; Provisional
Probab=22.01 E-value=1.6e+02 Score=23.11 Aligned_cols=39 Identities=15% Similarity=0.322 Sum_probs=29.6
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcC--------chhhhcCCceEE
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYK--------VEDLKTEGINVK 64 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd--------~~~~~~~GI~vh 64 (77)
..+.++.+.+.+.||..||-.+.|.=. .+.|+++||+|.
T Consensus 53 ~p~~~cad~ii~~gi~rVVi~~D~d~~G~~~~~~~~~~L~~aGi~V~ 99 (360)
T PRK14719 53 TPVFQIADDLIAENISEVILLTDFDRAGRVYAKNIMEEFQSRGIKVN 99 (360)
T ss_pred CchHHHHHHHHHcCCCEEEEEECCCCCCCccchHHHHHHHHCCCEEE
Confidence 344558899999999999988853322 378999999993
No 186
>TIGR01573 cas2 CRISPR-associated endoribonuclease Cas2. This model describes most members of the family of Cas2, one of the first four protein families found to mark prokaryotic genomes that contain multiple CRISPR elements. It is an endoribonuclease, capable of cleaving single-stranded RNA. CRISPR is an acronym for Clustered Regularly Interspaced Short Palindromic Repeats. The cas genes are found near the repeats. A distinct branch of the Cas2 family shows a very low level of sequence identity and is modeled by TIGR01873 instead.
Probab=21.92 E-value=1.2e+02 Score=19.01 Aligned_cols=24 Identities=8% Similarity=0.023 Sum_probs=16.7
Q ss_pred eeEEe--ecCCc---ccHHHHHHHHHhCC
Q psy12446 16 VTLYH--GVNLE---NHLHLAYHELKKHQ 39 (77)
Q Consensus 16 ~~fli--~~pt~---~tl~~yi~elk~~~ 39 (77)
|..|| |-|.+ .....+.+.|+++|
T Consensus 1 M~~lV~YDI~~~~~~k~r~kv~k~L~~~G 29 (95)
T TIGR01573 1 MLVLVVYDIPTDGERKRRRKLRKLLEKYG 29 (95)
T ss_pred CEEEEEEECCCCchHHHHHHHHHHHHHcc
Confidence 44555 55666 57778888888888
No 187
>PF15533 Toxin_54: Putative toxin 54
Probab=21.92 E-value=16 Score=22.91 Aligned_cols=18 Identities=33% Similarity=0.589 Sum_probs=13.8
Q ss_pred CcCchhhhcCCceEEEee
Q psy12446 50 TYKVEDLKTEGINVKDLA 67 (77)
Q Consensus 50 tYd~~~~~~~GI~vhdl~ 67 (77)
.+|.+.|+++||..|++-
T Consensus 10 ~~di~~Lk~~G~d~H~lK 27 (66)
T PF15533_consen 10 KGDIEKLKKNGIDAHELK 27 (66)
T ss_pred HhHHHHHHHcCCcHHHHH
Confidence 356688899999888764
No 188
>PRK14454 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=21.79 E-value=84 Score=24.30 Aligned_cols=24 Identities=17% Similarity=0.257 Sum_probs=21.7
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEE
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVR 45 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVR 45 (77)
.|+...++.|.+.|+++|+...||
T Consensus 298 ~ps~e~l~~f~~~l~~~gi~v~iR 321 (342)
T PRK14454 298 KSSKEKIKKFKNILKKNGIETTIR 321 (342)
T ss_pred CCCHHHHHHHHHHHHHCCCcEEEe
Confidence 357889999999999999999998
No 189
>PRK14461 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=21.79 E-value=91 Score=24.93 Aligned_cols=24 Identities=17% Similarity=0.109 Sum_probs=21.5
Q ss_pred CCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 23 NLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 23 pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
|+...+..|.+.|+++||...||-
T Consensus 325 ps~~~i~~F~~~L~~~gi~vtiR~ 348 (371)
T PRK14461 325 SERERVTTFQRILTDYGIPCTVRV 348 (371)
T ss_pred CCHHHHHHHHHHHHHCCceEEEeC
Confidence 467789999999999999999985
No 190
>PRK14469 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=21.73 E-value=92 Score=23.79 Aligned_cols=25 Identities=12% Similarity=0.065 Sum_probs=21.9
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...++.|.+.|+++|+...+|-
T Consensus 297 ~ps~e~l~~f~~~l~~~gi~vtvr~ 321 (343)
T PRK14469 297 KPSRERIERFKEILLKNGIEAEIRR 321 (343)
T ss_pred CCCHHHHHHHHHHHHHCCCeEEEeC
Confidence 3578889999999999999999983
No 191
>PF14403 CP_ATPgrasp_2: Circularly permuted ATP-grasp type 2
Probab=21.63 E-value=1.1e+02 Score=25.11 Aligned_cols=43 Identities=16% Similarity=0.286 Sum_probs=34.7
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEeCC---CCcCchhhhcCCceEEEee
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRVCE---PTYKVEDLKTEGINVKDLA 67 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRvCe---~tYd~~~~~~~GI~vhdl~ 67 (77)
.|+.+.++.|.+.|.++|+..+| |. -+|+...|-..|..| |+-
T Consensus 196 ~~~~~Ef~~f~~~f~~~G~~~vI--~d~~~L~y~~g~L~~~~~~I-D~V 241 (445)
T PF14403_consen 196 YPTLSEFEVFQRLFEEHGYDCVI--CDPRDLEYRDGRLYAGGRPI-DAV 241 (445)
T ss_pred CCccchHHHHHHHHHHcCCceEe--cChHHceecCCEEEECCEee-ehh
Confidence 68999999999999999999987 64 468888887777654 443
No 192
>PRK14453 chloramphenicol/florfenicol resistance protein; Provisional
Probab=21.59 E-value=90 Score=24.29 Aligned_cols=25 Identities=16% Similarity=0.171 Sum_probs=21.5
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...+..|.+.|+++|+...||-
T Consensus 302 ~ps~e~v~~f~~~L~~~Gi~vtiR~ 326 (347)
T PRK14453 302 SSSAGQIKQFCSTLKSAGISVTVRT 326 (347)
T ss_pred CCCHHHHHHHHHHHHHCCCcEEEeC
Confidence 3567779999999999999999883
No 193
>cd03742 SOCS_Rab40 SOCS (suppressors of cytokine signaling) box of Rab40-like proteins. Rab40 is part of the Rab family of small GTP-binding proteins that form the largest family within the Ras superfamily. Rab proteins regulate vesicular trafficking pathways, behaving as membrane-associated molecular switches. Rab40 is characterized by a SOCS box c-terminal to the GTPase domain. The SOCS boxes interact with Elongins B and C, Cullin-5 or Cullin-2, Rbx-1, and E2. Therefore, SOCS-box-containing proteins probably function as E3 ubiquitin ligases and mediate the degradation of proteins associated through their N-terminal regions.
Probab=21.53 E-value=16 Score=20.94 Aligned_cols=29 Identities=7% Similarity=-0.028 Sum_probs=19.1
Q ss_pred eeeeeeeeeccceeEEeecCCcccHHHHHHHH
Q psy12446 4 RALICRTLDYKTVTLYHGVNLENHLHLAYHEL 35 (77)
Q Consensus 4 ~~~~~~~~~y~~~~fli~~pt~~tl~~yi~el 35 (77)
|+.|++-++|-++.=| |-...|.+|++++
T Consensus 11 R~~I~~~t~~~~I~~L---PLP~~Lk~yLkey 39 (43)
T cd03742 11 CRAIVSCTPVYLIDKL---PLPVSIKSHLKSF 39 (43)
T ss_pred HHHHHHhCCcchhhhC---CCCHHHHHHHHhc
Confidence 4556666666544333 7888899997765
No 194
>PRK13256 thiopurine S-methyltransferase; Reviewed
Probab=21.39 E-value=1.1e+02 Score=22.49 Aligned_cols=45 Identities=11% Similarity=0.242 Sum_probs=34.8
Q ss_pred CcccHHHHHHHHH-hCCCeEEEEeCCCCcCchhhhcCCceEEEeec
Q psy12446 24 LENHLHLAYHELK-KHQVKNVVRVCEPTYKVEDLKTEGINVKDLAY 68 (77)
Q Consensus 24 t~~tl~~yi~elk-~~~Vt~VVRvCe~tYd~~~~~~~GI~vhdl~F 68 (77)
.+..|..++..+. ..|-+-+|=.|...+|...|-+.|..|.-+-|
T Consensus 28 pnp~L~~~~~~l~~~~~~rvLvPgCGkg~D~~~LA~~G~~V~GvDl 73 (226)
T PRK13256 28 PNEFLVKHFSKLNINDSSVCLIPMCGCSIDMLFFLSKGVKVIGIEL 73 (226)
T ss_pred CCHHHHHHHHhcCCCCCCeEEEeCCCChHHHHHHHhCCCcEEEEec
Confidence 3455666666665 34568899999999999999999999986544
No 195
>cd08191 HHD 6-hydroxyhexanoate dehydrogenase (HHD) catalyzes the oxidation of 6-hydroxyhexanoate to 6-oxohexanoate. 6-hydroxyhexanoate dehydrogenase (HHD). The 6-hydroxyhexanoate dehydrogenase catalyzes the oxidation of 6-hydroxyhexanoate to 6-oxohexanoate. Some bacteria can grow on cyclic ketones, cyclohexylamine, and alcohols as sole carbon source. Cyclohexylamine is an insecticide and antiseptic in various industries and is considered a possible environmental pollutant. The degradation of these chemical compounds are through the cyclohexanol and cyclohexanone biological oxidation pathway. The intermediates of this pathway include cyclohexanol, cyclohexanone, e-caprolactone, 6-hydroxyhexanoate, 6-oxohexanoate and adipate. The 6-hydroxyhexanoate dehydrogenase catalyzes the oxidation of 6-hydroxyhexanoate to 6-oxohexanoate.
Probab=21.16 E-value=2.5e+02 Score=21.54 Aligned_cols=42 Identities=10% Similarity=0.115 Sum_probs=26.8
Q ss_pred HHHHHHHHHhCCCeEEEEeCCCC-cCc-------hhhhcCCceEEEeecCCCC
Q psy12446 28 LHLAYHELKKHQVKNVVRVCEPT-YKV-------EDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 28 l~~yi~elk~~~Vt~VVRvCe~t-Yd~-------~~~~~~GI~vhdl~F~DG~ 72 (77)
++.--++++++| +.++-++.+. ++. +.++++||++. .|++..
T Consensus 11 l~~l~~~~~~~g-~~~livt~~~~~~~~~~~~v~~~L~~~~~~~~--~f~~v~ 60 (386)
T cd08191 11 RRQLPRLAARLG-SRALIVTDERMAGTPVFAELVQALAAAGVEVE--VFDGVL 60 (386)
T ss_pred HHHHHHHHHHcC-CeEEEEECcchhhcchHHHHHHHHHHcCCeEE--EECCCC
Confidence 455556677778 7777777643 331 45778899874 366655
No 196
>PRK05647 purN phosphoribosylglycinamide formyltransferase; Reviewed
Probab=21.15 E-value=3.1e+02 Score=19.43 Aligned_cols=53 Identities=15% Similarity=0.113 Sum_probs=35.7
Q ss_pred eeEEeecC-CcccHHHHHHHHHhCC--CeEEEEeCCC--CcCchhhhcCCceEEEeec
Q psy12446 16 VTLYHGVN-LENHLHLAYHELKKHQ--VKNVVRVCEP--TYKVEDLKTEGINVKDLAY 68 (77)
Q Consensus 16 ~~fli~~p-t~~tl~~yi~elk~~~--Vt~VVRvCe~--tYd~~~~~~~GI~vhdl~F 68 (77)
||..|.+. +.++++..++.+.+.+ +.-++-++.. .+-.+..++.||.++.+..
T Consensus 2 ~ki~vl~sg~gs~~~~ll~~~~~~~~~~~I~~vvs~~~~~~~~~~a~~~gIp~~~~~~ 59 (200)
T PRK05647 2 KRIVVLASGNGSNLQAIIDACAAGQLPAEIVAVISDRPDAYGLERAEAAGIPTFVLDH 59 (200)
T ss_pred ceEEEEEcCCChhHHHHHHHHHcCCCCcEEEEEEecCccchHHHHHHHcCCCEEEECc
Confidence 45566544 7888999999888765 4455445553 3445677888999876543
No 197
>TIGR00126 deoC deoxyribose-phosphate aldolase. Deoxyribose-phosphate aldolase is involved in the catabolism of nucleotides and deoxyriibonucleotides. The catalytic process is as follows: 2-deoxy-D-ribose 5-phosphate = D-glyceraldehyde 3-phosphate + acetaldehyde. It is found in both gram-postive and gram-negative bacteria.
Probab=21.11 E-value=52 Score=23.91 Aligned_cols=52 Identities=15% Similarity=0.117 Sum_probs=36.8
Q ss_pred CcccHHHHHHHHHhCCCeEEE-EeCCCCcCchhhhcCCceEE-EeecCCCCCCC
Q psy12446 24 LENHLHLAYHELKKHQVKNVV-RVCEPTYKVEDLKTEGINVK-DLAYEDGTSPS 75 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VV-RvCe~tYd~~~~~~~GI~vh-dl~F~DG~~Pp 75 (77)
|...+.++.++-+++|+-.|. .-|.-.+=++.++..++++- =.-||=|..|+
T Consensus 16 t~~~i~~lc~~A~~~~~~avcv~p~~v~~a~~~l~~~~v~v~tVigFP~G~~~~ 69 (211)
T TIGR00126 16 TEEDIITLCAQAKTYKFAAVCVNPSYVPLAKELLKGTEVRICTVVGFPLGASTT 69 (211)
T ss_pred CHHHHHHHHHHHHhhCCcEEEeCHHHHHHHHHHcCCCCCeEEEEeCCCCCCCcH
Confidence 888899999999999987763 44444455566766667654 24566687765
No 198
>PRK14464 ribosomal RNA large subunit methyltransferase N; Provisional
Probab=21.00 E-value=1e+02 Score=24.23 Aligned_cols=25 Identities=16% Similarity=0.045 Sum_probs=22.5
Q ss_pred cCCcccHHHHHHHHHhCCCeEEEEe
Q psy12446 22 VNLENHLHLAYHELKKHQVKNVVRV 46 (77)
Q Consensus 22 ~pt~~tl~~yi~elk~~~Vt~VVRv 46 (77)
.|+...+..|.+.|+++||..-||-
T Consensus 289 rp~~~~i~~f~~~L~~~gi~~tiR~ 313 (344)
T PRK14464 289 RPSGERIVAMARYLHRRGVLTKVRN 313 (344)
T ss_pred CCCHHHHHHHHHHHHHCCceEEEEC
Confidence 4788899999999999999999984
No 199
>cd04441 DEP_2_DEP6 DEP (Dishevelled, Egl-10, and Pleckstrin) domain 2 found in DEP6-like proteins. DEP6 proteins contain two DEP and a PDZ domain. Their function is unknown.
Probab=20.93 E-value=56 Score=20.80 Aligned_cols=19 Identities=21% Similarity=0.244 Sum_probs=15.3
Q ss_pred hhhhcCCceEE---EeecCCCC
Q psy12446 54 EDLKTEGINVK---DLAYEDGT 72 (77)
Q Consensus 54 ~~~~~~GI~vh---dl~F~DG~ 72 (77)
+.|.++||-+| +..|.||.
T Consensus 59 q~Ll~~gii~HV~~~h~F~D~~ 80 (85)
T cd04441 59 RRLLEHGIIQHVSNKHHFFDSN 80 (85)
T ss_pred HHHHHCCCEEecCCCCCccCCC
Confidence 55778888777 88999985
No 200
>PF03683 UPF0175: Uncharacterised protein family (UPF0175); InterPro: IPR005368 This entry contains small proteins of unknown function.
Probab=20.80 E-value=72 Score=19.36 Aligned_cols=25 Identities=20% Similarity=0.256 Sum_probs=18.9
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCCCCcCchhhh
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCEPTYKVEDLK 57 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~ 57 (77)
=+...|++.|+++||. + .|+.+.++
T Consensus 46 ~s~~eF~~~L~~~gI~--~-----~~~~eel~ 70 (76)
T PF03683_consen 46 MSRWEFLELLKERGIP--I-----NYDEEELE 70 (76)
T ss_pred CCHHHHHHHHHHCCCC--C-----CCCHHHHH
Confidence 3677899999999998 3 27766554
No 201
>PRK13111 trpA tryptophan synthase subunit alpha; Provisional
Probab=20.71 E-value=1.4e+02 Score=22.16 Aligned_cols=26 Identities=19% Similarity=0.125 Sum_probs=12.5
Q ss_pred cHHHHHHHHHhCCCeEEEEeCCCCcCc
Q psy12446 27 HLHLAYHELKKHQVKNVVRVCEPTYKV 53 (77)
Q Consensus 27 tl~~yi~elk~~~Vt~VVRvCe~tYd~ 53 (77)
..+.|++.++++|+..| -+|.|+...
T Consensus 130 e~~~~~~~~~~~gl~~I-~lvap~t~~ 155 (258)
T PRK13111 130 EAEELRAAAKKHGLDLI-FLVAPTTTD 155 (258)
T ss_pred HHHHHHHHHHHcCCcEE-EEeCCCCCH
Confidence 34444555555555444 345555543
No 202
>TIGR00936 ahcY adenosylhomocysteinase. This enzyme hydrolyzes adenosylhomocysteine as part of a cycle for the regeneration of the methyl donor S-adenosylmethionine. Species that lack this enzyme are likely to have adenosylhomocysteine nucleosidase (EC 3.2.2.9), an enzyme which also acts as 5'-methyladenosine nucleosidase (see TIGR01704).
Probab=20.71 E-value=3.6e+02 Score=21.64 Aligned_cols=56 Identities=14% Similarity=0.141 Sum_probs=39.7
Q ss_pred eeccceeEEeecCCcccHHHHHHHHHhCCCeEEEEeCCCCcCc-----hhhhcCCceEEEee
Q psy12446 11 LDYKTVTLYHGVNLENHLHLAYHELKKHQVKNVVRVCEPTYKV-----EDLKTEGINVKDLA 67 (77)
Q Consensus 11 ~~y~~~~fli~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd~-----~~~~~~GI~vhdl~ 67 (77)
=-.|++|..+-.+-...--..++-|+.-|..-.+--|.|. ++ ..+.+.||.|+-|.
T Consensus 28 ~pl~G~~i~~~~hl~~~Ta~l~~~L~~~GA~v~~~~~np~-stqd~vaaaL~~~gi~v~a~~ 88 (406)
T TIGR00936 28 KPLKGARIAACLHVTVETAVLIETLVAGGAEVAWTSCNPL-STQDDVAAALAKAGIPVFAWR 88 (406)
T ss_pred CCCCCCEEEEEEechHHHHHHHHHHHHcCCEEEEEccCCc-cccHHHHHHHHhCCceEEEec
Confidence 3467777777555444445667889999998877777874 44 67788899998553
No 203
>cd08195 DHQS Dehydroquinate synthase (DHQS) catalyzes the conversion of DAHP to DHQ in shikimate pathway for aromatic compounds synthesis. Dehydroquinate synthase (DHQS) catalyzes the conversion of 3-deoxy-D-arabino-heptulosonate-7-phosphate (DAHP) to dehydroquinate (DHQ) in the second step of the shikimate pathway. This pathway, which involves seven sequential enzymatic steps in the conversion of erythrose 4-phosphate and phosphoenolpyruvate into chorismate for subsequent synthesis of aromatic compounds, is found in bacteria, microbial eukaryotes, and plants, but not in mammals. Therefore, enzymes of this pathway are attractive targets for the development of non-toxic antimicrobial compounds, herbicides and anti-parasitic agents. The activity of DHQS requires nicotinamide adenine dinucleotide (NAD) as cofactor. A single active site in DHQS catalyzes five sequential reactions involving alcohol oxidation, phosphate elimination, carbonyl reduction, ring opening, and intramolecular aldol
Probab=20.64 E-value=2.7e+02 Score=20.94 Aligned_cols=37 Identities=14% Similarity=0.264 Sum_probs=24.6
Q ss_pred CCeEEEEeCCCCcCc-------hhhhcCCceEEEeecCCCCCCC
Q psy12446 39 QVKNVVRVCEPTYKV-------EDLKTEGINVKDLAYEDGTSPS 75 (77)
Q Consensus 39 ~Vt~VVRvCe~tYd~-------~~~~~~GI~vhdl~F~DG~~Pp 75 (77)
+.+.++-++.+.... +.++++|+.+.-+.|++|.+-|
T Consensus 23 ~~~~~livtd~~~~~~~~~~l~~~L~~~g~~~~~~~~~~~e~~~ 66 (345)
T cd08195 23 KGSKILIVTDENVAPLYLEKLKAALEAAGFEVEVIVIPAGEASK 66 (345)
T ss_pred CCCeEEEEECCchHHHHHHHHHHHHHhcCCceEEEEeCCCCCcC
Confidence 335666666654322 5577889888878888887643
No 204
>cd00562 NifX_NifB This CD represents a family of iron-molybdenum cluster-binding proteins that includes NifB, NifX, and NifY, all of which are involved in the synthesis of an iron-molybdenum cofactor (FeMo-co) that binds the active site of the dinitrogenase enzyme. This domain is a predicted small-molecule-binding domain (SMBD) with an alpha/beta fold that is present either as a stand-alone domain (e.g. NifX and NifY) or fused to another conserved domain (e.g. NifB) however, its function is still undetermined.The SCOP database suggests that this domain is most similar to structures within the ribonuclease H superfamily. This conserved domain is represented in two of the three major divisions of life (bacteria and archaea).
Probab=20.58 E-value=1.8e+02 Score=17.27 Aligned_cols=35 Identities=23% Similarity=0.273 Sum_probs=24.4
Q ss_pred HHHHHHHHhCCCeEEE--EeCCCCcCchhhhcCCceEEE
Q psy12446 29 HLAYHELKKHQVKNVV--RVCEPTYKVEDLKTEGINVKD 65 (77)
Q Consensus 29 ~~yi~elk~~~Vt~VV--RvCe~tYd~~~~~~~GI~vhd 65 (77)
..-.+.+..+||.-|+ +.++..+ ..|+++||++..
T Consensus 51 ~~~~~~l~~~~v~~vi~~~iG~~a~--~~l~~~gI~v~~ 87 (102)
T cd00562 51 KLAARLLALEGCDAVLVGGIGGPAA--AKLEAAGIKPIK 87 (102)
T ss_pred hHHHHHHHHCCCcEEEEcccCccHH--HHHHHcCCEEEE
Confidence 3445667789999888 3333333 678999999864
No 205
>PF00532 Peripla_BP_1: Periplasmic binding proteins and sugar binding domain of LacI family; InterPro: IPR001761 This family includes the periplasmic binding proteins, and the LacI family transcriptional regulators. The periplasmic binding proteins are the primary receptors for chemotaxis and transport of many sugar based solutes. The LacI family of proteins consist of transcriptional regulators related to the lac repressor. In this case, generally the sugar binding domain binds a sugar which changes the DNA binding activity of the repressor domain (lacI) [, ].; PDB: 1BAP_A 7ABP_A 6ABP_A 1ABF_A 5ABP_A 2WRZ_B 9ABP_A 1APB_A 1ABE_A 8ABP_A ....
Probab=20.57 E-value=2.7e+02 Score=20.04 Aligned_cols=48 Identities=15% Similarity=0.122 Sum_probs=33.3
Q ss_pred eeEEe-ecCCcccHHHHHHHHHhCCCeEEEEeCCCCcC-c--hhhhcCCceEE
Q psy12446 16 VTLYH-GVNLENHLHLAYHELKKHQVKNVVRVCEPTYK-V--EDLKTEGINVK 64 (77)
Q Consensus 16 ~~fli-~~pt~~tl~~yi~elk~~~Vt~VVRvCe~tYd-~--~~~~~~GI~vh 64 (77)
...++ ....+...+++++.|.+++|.-++=+ ....+ . ..+.+.||-+.
T Consensus 32 y~l~l~~t~~~~~~e~~i~~l~~~~vDGiI~~-s~~~~~~~l~~~~~~~iPvV 83 (279)
T PF00532_consen 32 YQLLLCNTGDDEEKEEYIELLLQRRVDGIILA-SSENDDEELRRLIKSGIPVV 83 (279)
T ss_dssp CEEEEEEETTTHHHHHHHHHHHHTTSSEEEEE-SSSCTCHHHHHHHHTTSEEE
T ss_pred CEEEEecCCCchHHHHHHHHHHhcCCCEEEEe-cccCChHHHHHHHHcCCCEE
Confidence 34444 55666666699999999999999976 43344 2 45566688765
No 206
>PRK13902 alaS alanyl-tRNA synthetase; Provisional
Probab=20.51 E-value=73 Score=28.10 Aligned_cols=51 Identities=16% Similarity=0.071 Sum_probs=35.7
Q ss_pred ccHHHHHHHHHhCCCeEEEEeCC-CCc-CchhhhcCCceEEEeecCCCCCCCC
Q psy12446 26 NHLHLAYHELKKHQVKNVVRVCE-PTY-KVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 26 ~tl~~yi~elk~~~Vt~VVRvCe-~tY-d~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
.--+.|++.|+++|-+.|=|-=- |.+ |.-.|+.+||.-.-=.|..|..||+
T Consensus 63 eiR~~Fl~FF~~~gH~~v~s~pvvprw~dDllft~Agm~~Fkp~f~~G~~~pp 115 (900)
T PRK13902 63 EMREKFLSFFEKHGHTRIERYPVVARWRDDVYLTIASIYDFQPWVTSGLVPPP 115 (900)
T ss_pred HHHHHHHHHHHhCCCEEcCCcCcCCCCCCCeeeeecchhhhhHHhcCCCCCCC
Confidence 34578999999999766533221 343 3377899999887777777886664
No 207
>PF02635 DrsE: DsrE/DsrF-like family; InterPro: IPR003787 Four small, soluble proteins (DsrE, DsrF, DsrH and DsrC) are encoded in the dsr gene region of the phototrophic sulphur bacterium Chromatium vinosum D. The dsrAB genes encoding dissimilatory sulphite reductase are part of the gene cluster, dsrABEFHCMK. The remaining proteins that are encoded are a transmembrane protein (DsrM) with similarity to haem-b-binding polypeptides and a soluble protein (DsrK) resembling [4Fe-4S]-cluster-containing heterodisulphide reductase from methanogenic archaea. DsrE is a small soluble protein involved in intracellular sulphur reduction [].; PDB: 1L1S_A 2HYB_B 2HY5_B 2PD2_B 3MC3_A 2D1P_H 1JX7_B 2FB6_A.
Probab=20.40 E-value=1.9e+02 Score=17.29 Aligned_cols=24 Identities=38% Similarity=0.412 Sum_probs=17.9
Q ss_pred CcccHHHHHHHHHhCC-CeEEEEeCCC
Q psy12446 24 LENHLHLAYHELKKHQ-VKNVVRVCEP 49 (77)
Q Consensus 24 t~~tl~~yi~elk~~~-Vt~VVRvCe~ 49 (77)
...++...+++|+++| |+-. +|+.
T Consensus 59 ~~~~~~~~l~~l~~~g~v~i~--~C~~ 83 (122)
T PF02635_consen 59 GDPPLQELLKELKEAGGVKIY--VCET 83 (122)
T ss_dssp TSHCHHHHHHHHHHTTT-EEE--EEHH
T ss_pred ccccHHHHHHHHHhcCCcEEE--EcHH
Confidence 4677999999999997 7744 5663
No 208
>PF07997 DUF1694: Protein of unknown function (DUF1694); InterPro: IPR012543 This family contains many hypothetical proteins.; PDB: 2OHW_A.
Probab=20.30 E-value=1.3e+02 Score=20.13 Aligned_cols=41 Identities=22% Similarity=0.218 Sum_probs=25.3
Q ss_pred eccceeEEeecC-CcccHHHHHHHHHhCCCeEEEEeCCCCcCc
Q psy12446 12 DYKTVTLYHGVN-LENHLHLAYHELKKHQVKNVVRVCEPTYKV 53 (77)
Q Consensus 12 ~y~~~~fli~~p-t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~ 53 (77)
+|.+.+.+|--. ..+.+..||+.-+++|+.-=+ +.++.+..
T Consensus 60 ~~~~~~l~ing~l~~~~~~~YiklA~~~~~~fTi-v~~~~~~~ 101 (120)
T PF07997_consen 60 DYPNYKLKINGNLDYSFQSKYIKLANKHGIPFTI-VNDPEYHS 101 (120)
T ss_dssp C-SSEEEEEETTS-HHHHHHHHHHHHHTT--EEE-E---SS--
T ss_pred hCCCeEEEEcCCCCHHHHHHHHHHHHHcCCCEEE-eCCCCccC
Confidence 478888999655 777899999999999997654 34444443
No 209
>PF05378 Hydant_A_N: Hydantoinase/oxoprolinase N-terminal region; InterPro: IPR008040 This domain is found at the N terminus of the hydantoinase/oxoprolinase IPR002821 from INTERPRO family.
Probab=20.30 E-value=1.9e+02 Score=20.08 Aligned_cols=30 Identities=27% Similarity=0.248 Sum_probs=25.4
Q ss_pred CcccHHHHHHHHHhCCCeEEEEeCCCCcCc
Q psy12446 24 LENHLHLAYHELKKHQVKNVVRVCEPTYKV 53 (77)
Q Consensus 24 t~~tl~~yi~elk~~~Vt~VVRvCe~tYd~ 53 (77)
....+...+++++..||..|+-+..-.|.-
T Consensus 132 d~~~v~~~~~~l~~~gv~avAV~~~fS~~n 161 (176)
T PF05378_consen 132 DEDEVREALRELKDKGVEAVAVSLLFSYRN 161 (176)
T ss_pred CHHHHHHHHHHHHhCCCCEEEEECccCCCC
Confidence 567799999999999999999888766554
No 210
>PF04007 DUF354: Protein of unknown function (DUF354); InterPro: IPR007152 Members of this family are around 350 amino acids in length. They are found in archaea and some bacteria and have no known function.
Probab=20.23 E-value=1.7e+02 Score=22.69 Aligned_cols=49 Identities=14% Similarity=0.179 Sum_probs=33.4
Q ss_pred eeEEeecCCc---ccHHHHHHHHHhCCCeEEEEeCCCCcCchhhhcCCceEE
Q psy12446 16 VTLYHGVNLE---NHLHLAYHELKKHQVKNVVRVCEPTYKVEDLKTEGINVK 64 (77)
Q Consensus 16 ~~fli~~pt~---~tl~~yi~elk~~~Vt~VVRvCe~tYd~~~~~~~GI~vh 64 (77)
||-+||..+. .-....|++|++.|..-+|=.-+..--.++++..||...
T Consensus 1 MkIwiDi~~p~hvhfFk~~I~eL~~~GheV~it~R~~~~~~~LL~~yg~~y~ 52 (335)
T PF04007_consen 1 MKIWIDITHPAHVHFFKNIIRELEKRGHEVLITARDKDETEELLDLYGIDYI 52 (335)
T ss_pred CeEEEECCCchHHHHHHHHHHHHHhCCCEEEEEEeccchHHHHHHHcCCCeE
Confidence 6778875433 344567889999988777644443334488888898765
No 211
>cd04878 ACT_AHAS N-terminal ACT domain of the Escherichia coli IlvH-like regulatory subunit of acetohydroxyacid synthase (AHAS). ACT_AHAS: N-terminal ACT domain of the Escherichia coli IlvH-like regulatory subunit of acetohydroxyacid synthase (AHAS). AHAS catalyses the first common step in the biosynthesis of the three branched-chain amino acids. The first step involves the condensation of either pyruvate or 2-ketobutyrate with the two-carbon hydroxyethyl fragment derived from another pyruvate molecule, covalently bound to the coenzyme thiamine diphosphate. Bacterial AHASs generally consist of regulatory and catalytic subunits. The effector (valine) binding sites are proposed to be located in two symmetrically related positions in the interface between a pair of N-terminal ACT domains with the C-terminal domain of IlvH contacting the catalytic dimer. Plants Arabidopsis and Oryza have tandem IlvH subunits; both the first and second ACT domain sequences are present in this CD. Members of
Probab=20.19 E-value=1.6e+02 Score=15.65 Aligned_cols=22 Identities=14% Similarity=0.119 Sum_probs=16.5
Q ss_pred eeEEeecCCcccHHHHHHHHHhC
Q psy12446 16 VTLYHGVNLENHLHLAYHELKKH 38 (77)
Q Consensus 16 ~~fli~~pt~~tl~~yi~elk~~ 38 (77)
+.|.+..++ ..++..++.|++.
T Consensus 44 ~~~~~~~~~-~~~~~l~~~l~~~ 65 (72)
T cd04878 44 ITIVVEGDD-DVIEQIVKQLNKL 65 (72)
T ss_pred EEEEEECCH-HHHHHHHHHHhCC
Confidence 555556566 8999999999863
No 212
>cd07856 STKc_Sty1_Hog1 Catalytic domain of the Serine/Threonine Kinases, Fungal Mitogen-Activated Protein Kinases Sty1 and Hog1. Serine/Threonine Kinases (STKs), Fungal Mitogen-Activated Protein Kinase (MAPK) Sty1/Hog1 subfamily, catalytic (c) domain. STKs catalyze the transfer of the gamma-phosphoryl group from ATP to serine/threonine residues on protein substrates. The Sty1/Hog1 subfamily is part of a larger superfamily that includes the catalytic domains of other protein STKs, protein tyrosine kinases, RIO kinases, aminoglycoside phosphotransferase, choline kinase, and phosphoinositide 3-kinase. This subfamily is composed of the MAPKs Sty1 from Schizosaccharomyces pombe, Hog1 from Saccharomyces cerevisiae, and similar proteins. MAPKs are important mediators of cellular responses to extracellular signals. Sty1 and Hog1 are stress-activated MAPKs that partipate in transcriptional regulation in response to stress. Sty1 is activated in response to oxidative stress, osmotic stress, and U
Probab=20.18 E-value=44 Score=24.14 Aligned_cols=30 Identities=20% Similarity=0.310 Sum_probs=21.9
Q ss_pred EEEEeCCCCcCchhhhcCCceEEEeecCCCCCCCC
Q psy12446 42 NVVRVCEPTYKVEDLKTEGINVKDLAYEDGTSPSP 76 (77)
Q Consensus 42 ~VVRvCe~tYd~~~~~~~GI~vhdl~F~DG~~Pp~ 76 (77)
-+++.+.|+++.- ..+|| ||.|+|..-|.+
T Consensus 290 ~~~~~~~~~~~~~--~~~~~---~~~~~~~~~~~~ 319 (328)
T cd07856 290 YLAPYHDPTDEPV--AEEKF---DWSFNDADLPVD 319 (328)
T ss_pred ccccccCCccccC--chhhc---CCchhcccCCHH
Confidence 5677888888854 34455 899999988753
No 213
>COG0735 Fur Fe2+/Zn2+ uptake regulation proteins [Inorganic ion transport and metabolism]
Probab=20.04 E-value=54 Score=22.17 Aligned_cols=48 Identities=23% Similarity=0.349 Sum_probs=29.8
Q ss_pred eecCCcccHHHHHHHHHhCCCeEEEEeCCCC-cCc-hhhhcCCceEEEeecCCCC
Q psy12446 20 HGVNLENHLHLAYHELKKHQVKNVVRVCEPT-YKV-EDLKTEGINVKDLAYEDGT 72 (77)
Q Consensus 20 i~~pt~~tl~~yi~elk~~~Vt~VVRvCe~t-Yd~-~~~~~~GI~vhdl~F~DG~ 72 (77)
..++..-+.++=.+++++.+.+.=+ +| |++ +.|+++|| |+.+.|.+|.
T Consensus 31 ~~~~~~~sAeei~~~l~~~~p~isl----aTVYr~L~~l~e~Gl-v~~~~~~~~~ 80 (145)
T COG0735 31 LEADGHLSAEELYEELREEGPGISL----ATVYRTLKLLEEAGL-VHRLEFEGGK 80 (145)
T ss_pred HhcCCCCCHHHHHHHHHHhCCCCCH----hHHHHHHHHHHHCCC-EEEEEeCCCE
Confidence 3333334555555666664433211 33 777 88999999 8889888873
No 214
>TIGR01744 XPRTase xanthine phosphoribosyltransferase. This model represent a xanthine-specific phosphoribosyltransferase of Bacillus subtilis and closely related proteins from other species, mostly from other Gram-positive bacteria. The adjacent gene is a xanthine transporter; B. subtilis can import xanthine for the purine salvage pathway or for catabolism to obtain nitrogen.
Probab=20.00 E-value=3e+02 Score=19.44 Aligned_cols=58 Identities=17% Similarity=0.130 Sum_probs=40.4
Q ss_pred cceeEEe--e-cCCcccHHHHHHHHHhCCCeEEEEeC--CCCc--CchhhhcCCceEEEe----ecCCC
Q psy12446 14 KTVTLYH--G-VNLENHLHLAYHELKKHQVKNVVRVC--EPTY--KVEDLKTEGINVKDL----AYEDG 71 (77)
Q Consensus 14 ~~~~fli--~-~pt~~tl~~yi~elk~~~Vt~VVRvC--e~tY--d~~~~~~~GI~vhdl----~F~DG 71 (77)
++=|.|| | .-|..|+..-++.+++.|.+-+--+| +... ..+.+++.|+.++-+ .++||
T Consensus 116 ~G~rVLIVDDvvtTGgT~~a~~~ll~~aGa~Vvgv~~lvd~~~~~g~~~l~~~gvpv~sL~~~~~l~~g 184 (191)
T TIGR01744 116 DQDRVLIIDDFLANGQAAHGLVDIAKQAGAKIAGIGIVIEKSFQNGRQELVELGYRVESLARIQSLEEG 184 (191)
T ss_pred CcCEEEEEEehhccChHHHHHHHHHHHCCCEEEEEEEEEEecCccHHHHHHhcCCcEEEEEEEeeEeCC
Confidence 4556777 4 46999999999999999976655544 3321 247778889887743 25555
Done!