Query psy1868
Match_columns 108
No_of_seqs 138 out of 1073
Neff 5.9
Searched_HMMs 46136
Date Fri Aug 16 18:21:59 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy1868.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/1868hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PLN02229 alpha-galactosidase 100.0 1.5E-32 3.3E-37 224.8 7.8 85 14-108 51-135 (427)
2 KOG2366|consensus 100.0 1.2E-32 2.6E-37 222.0 6.9 95 14-108 21-115 (414)
3 PLN02808 alpha-galactosidase 100.0 2.6E-32 5.7E-37 221.2 6.8 85 14-108 20-104 (386)
4 PLN02692 alpha-galactosidase 100.0 1.3E-31 2.7E-36 218.6 7.1 83 16-108 46-128 (412)
5 PLN02899 alpha-galactosidase 100.0 7.8E-31 1.7E-35 221.5 7.4 87 11-108 16-120 (633)
6 PLN03231 putative alpha-galact 100.0 4.3E-29 9.4E-34 200.8 5.3 72 26-108 1-93 (357)
7 PF02065 Melibiase: Melibiase; 99.8 3.8E-21 8.2E-26 156.4 4.9 73 20-108 35-111 (394)
8 COG3345 GalA Alpha-galactosida 99.6 1.3E-15 2.8E-20 128.5 6.0 72 20-107 286-361 (687)
9 PLN02219 probable galactinol-- 99.2 8.1E-12 1.8E-16 108.4 5.8 72 23-107 191-291 (775)
10 PLN02355 probable galactinol-- 99.2 1.4E-11 2.9E-16 107.0 5.8 51 24-87 196-251 (758)
11 PLN02711 Probable galactinol-- 99.2 4.8E-11 1.1E-15 103.6 5.8 44 28-84 215-258 (777)
12 PLN02684 Probable galactinol-- 99.1 1.1E-10 2.4E-15 101.2 5.2 66 28-107 204-290 (750)
13 cd06592 GH31_glucosidase_KIAA1 99.1 1.7E-10 3.6E-15 90.6 4.9 65 24-107 12-76 (303)
14 PF05691 Raffinose_syn: Raffin 99.0 4.9E-10 1.1E-14 97.5 4.5 47 28-87 197-243 (747)
15 PLN02982 galactinol-raffinose 98.4 3.5E-07 7.6E-12 80.4 4.8 46 29-87 225-270 (865)
16 cd06593 GH31_xylosidase_YicI Y 97.8 4.1E-05 8.8E-10 59.8 4.7 52 50-107 21-72 (308)
17 cd06589 GH31 The enzymes of gl 97.6 9E-05 1.9E-09 57.0 4.5 50 50-107 21-72 (265)
18 cd06595 GH31_xylosidase_XylS-l 97.4 0.00027 5.8E-09 55.3 4.7 54 50-107 22-80 (292)
19 cd06599 GH31_glycosidase_Aec37 97.2 0.00046 9.9E-09 54.6 4.5 54 50-107 26-79 (317)
20 cd06598 GH31_transferase_CtsZ 97.1 0.0007 1.5E-08 53.5 4.6 53 51-107 22-76 (317)
21 cd06602 GH31_MGAM_SI_GAA This 96.9 0.0014 2.9E-08 52.5 4.5 50 50-107 21-72 (339)
22 cd06601 GH31_lyase_GLase GLase 96.9 0.0015 3.2E-08 52.5 4.6 50 50-107 21-70 (332)
23 cd06603 GH31_GANC_GANAB_alpha 96.9 0.0014 3.1E-08 52.1 4.3 50 50-107 21-70 (339)
24 cd06591 GH31_xylosidase_XylS X 96.8 0.0016 3.4E-08 51.6 4.3 52 50-107 21-72 (319)
25 cd06604 GH31_glucosidase_II_Ma 96.8 0.0017 3.7E-08 51.6 4.4 50 50-107 21-70 (339)
26 cd06600 GH31_MGAM-like This fa 96.6 0.0025 5.4E-08 50.5 4.1 49 51-107 22-70 (317)
27 PF01055 Glyco_hydro_31: Glyco 96.6 0.0036 7.7E-08 50.9 4.7 50 50-107 40-89 (441)
28 cd06594 GH31_glucosidase_YihQ 96.6 0.0041 9E-08 49.3 4.9 52 51-107 21-77 (317)
29 PRK10658 putative alpha-glucos 96.5 0.0051 1.1E-07 53.6 5.7 52 50-107 280-331 (665)
30 cd06597 GH31_transferase_CtsY 95.7 0.024 5.2E-07 45.4 5.2 30 50-83 21-50 (340)
31 PRK10426 alpha-glucosidase; Pr 95.1 0.036 7.8E-07 48.1 4.7 52 51-107 219-275 (635)
32 PLN02763 hydrolase, hydrolyzin 94.2 0.069 1.5E-06 48.7 4.6 49 51-107 199-247 (978)
33 COG1501 Alpha-glucosidases, fa 93.8 0.088 1.9E-06 46.9 4.3 49 50-106 277-326 (772)
34 PF10566 Glyco_hydro_97: Glyco 92.9 0.15 3.3E-06 40.3 4.0 29 50-83 29-57 (273)
35 KOG1065|consensus 90.6 0.34 7.4E-06 43.5 4.0 64 27-107 293-357 (805)
36 PF13670 PepSY_2: Peptidase pr 88.9 2 4.3E-05 27.3 5.7 41 50-96 29-74 (83)
37 PF07485 DUF1529: Domain of Un 69.6 3.9 8.4E-05 28.7 2.1 59 20-83 33-93 (123)
38 PF15240 Pro-rich: Proline-ric 66.1 4.6 0.0001 30.3 2.0 23 1-23 1-24 (179)
39 COG2894 MinD Septum formation 66.1 7 0.00015 30.9 3.1 25 50-79 97-121 (272)
40 PRK09121 5-methyltetrahydropte 57.9 8.1 0.00018 31.0 2.2 32 51-82 147-180 (339)
41 PRK06233 hypothetical protein; 56.0 9.4 0.0002 31.0 2.3 32 51-82 162-195 (372)
42 COG0436 Aspartate/tyrosine/aro 53.8 30 0.00064 28.2 4.9 38 41-84 171-208 (393)
43 PF08784 RPA_C: Replication pr 51.6 15 0.00033 24.0 2.4 25 50-79 77-101 (102)
44 PF01043 SecA_PP_bind: SecA pr 51.0 46 0.001 22.8 4.8 53 51-106 58-111 (113)
45 PRK06520 5-methyltetrahydropte 47.9 15 0.00033 29.8 2.3 32 51-82 161-194 (368)
46 PF08479 POTRA_2: POTRA domain 38.7 27 0.00058 21.5 1.9 26 47-73 28-53 (76)
47 TIGR01264 tyr_amTase_E tyrosin 37.9 79 0.0017 25.0 5.0 36 42-83 177-212 (401)
48 PLN02376 1-aminocyclopropane-1 37.0 89 0.0019 26.2 5.3 37 41-83 207-243 (496)
49 TIGR00268 conserved hypothetic 35.0 34 0.00074 26.0 2.4 14 66-79 233-246 (252)
50 PRK08960 hypothetical protein; 33.6 81 0.0018 24.8 4.4 36 42-83 174-209 (387)
51 PRK06855 aminotransferase; Val 33.6 1.2E+02 0.0026 24.6 5.5 37 41-83 179-215 (433)
52 cd02931 ER_like_FMN Enoate red 33.1 33 0.00071 27.9 2.1 53 48-100 138-204 (382)
53 PLN00145 tyrosine/nicotianamin 32.3 1.2E+02 0.0027 24.6 5.3 36 42-83 199-234 (430)
54 PLN02397 aspartate transaminas 31.9 92 0.002 25.2 4.5 38 41-84 202-239 (423)
55 PF02113 Peptidase_S13: D-Ala- 31.6 92 0.002 26.0 4.5 39 47-86 91-129 (444)
56 PF02606 LpxK: Tetraacyldisacc 31.0 65 0.0014 25.8 3.4 17 69-85 126-142 (326)
57 PRK00652 lpxK tetraacyldisacch 30.9 68 0.0015 25.8 3.6 17 69-85 140-156 (325)
58 cd06433 GT_2_WfgS_like WfgS an 30.9 83 0.0018 21.1 3.6 30 52-83 10-39 (202)
59 PF04079 DUF387: Putative tran 30.8 72 0.0016 23.1 3.4 35 51-86 25-61 (159)
60 PRK06207 aspartate aminotransf 30.4 1.4E+02 0.003 23.9 5.3 36 42-83 187-222 (405)
61 PLN02607 1-aminocyclopropane-1 30.1 1.2E+02 0.0026 25.0 5.0 36 42-83 209-244 (447)
62 cd03312 CIMS_N_terminal_like C 29.9 54 0.0012 26.5 2.9 32 52-83 174-207 (360)
63 cd06438 EpsO_like EpsO protein 29.6 93 0.002 21.4 3.7 30 52-81 9-38 (183)
64 PRK15458 tagatose 6-phosphate 29.2 59 0.0013 27.5 3.0 32 50-82 100-131 (426)
65 TIGR00666 PBP4 D-alanyl-D-alan 28.6 1.3E+02 0.0028 24.1 4.8 37 46-84 63-100 (345)
66 PF07302 AroM: AroM protein; 28.1 64 0.0014 24.9 2.8 32 51-87 163-195 (221)
67 TIGR02810 agaZ_gatZ D-tagatose 27.9 64 0.0014 27.3 3.0 31 50-81 96-126 (420)
68 TIGR03540 DapC_direct LL-diami 27.7 1.4E+02 0.003 23.4 4.8 36 42-83 173-208 (383)
69 PRK05942 aspartate aminotransf 27.6 1.5E+02 0.0032 23.5 4.9 35 42-82 179-213 (394)
70 PRK08354 putative aminotransfe 27.5 1.1E+02 0.0024 23.4 4.1 36 42-83 126-161 (311)
71 COG0463 WcaA Glycosyltransfera 27.2 92 0.002 19.6 3.1 28 54-83 17-44 (291)
72 TIGR01265 tyr_nico_aTase tyros 27.2 1.7E+02 0.0036 23.3 5.2 36 42-83 178-213 (403)
73 PLN02368 alanine transaminase 27.1 1.2E+02 0.0025 24.8 4.4 37 41-83 218-254 (407)
74 smart00674 CENPB Putative DNA- 27.1 29 0.00062 20.6 0.6 30 50-83 25-54 (66)
75 cd02933 OYE_like_FMN Old yello 26.8 59 0.0013 26.0 2.5 53 49-101 141-206 (338)
76 PRK06959 putative threonine-ph 26.8 1.4E+02 0.0029 23.4 4.6 40 36-83 135-174 (339)
77 PRK07337 aminotransferase; Val 26.5 1.3E+02 0.0028 23.6 4.4 36 42-83 172-207 (388)
78 PRK01906 tetraacyldisaccharide 26.3 96 0.0021 25.1 3.7 15 70-84 148-162 (338)
79 cd04192 GT_2_like_e Subfamily 26.3 1.2E+02 0.0026 21.0 3.8 29 52-80 9-37 (229)
80 PLN02656 tyrosine transaminase 26.3 1.4E+02 0.0031 23.9 4.7 36 42-83 178-213 (409)
81 PRK14582 pgaB outer membrane N 26.2 1.3E+02 0.0028 26.9 4.7 36 48-83 67-119 (671)
82 PRK06425 histidinol-phosphate 25.8 1.3E+02 0.0029 23.2 4.3 37 41-83 130-166 (332)
83 PF00155 Aminotran_1_2: Aminot 25.8 1.2E+02 0.0025 23.3 4.0 37 42-84 157-193 (363)
84 PRK09276 LL-diaminopimelate am 25.7 1.9E+02 0.0042 22.6 5.3 36 42-83 175-210 (385)
85 PTZ00433 tyrosine aminotransfe 25.7 1.6E+02 0.0036 23.5 4.9 36 42-83 186-221 (412)
86 PRK08636 aspartate aminotransf 25.5 1.6E+02 0.0035 23.4 4.9 36 42-83 184-219 (403)
87 PF09992 DUF2233: Predicted pe 25.3 59 0.0013 22.7 2.1 27 53-83 116-142 (170)
88 cd02932 OYE_YqiM_FMN Old yello 25.3 52 0.0011 26.0 1.9 52 49-100 143-207 (336)
89 PRK07568 aspartate aminotransf 25.2 1.7E+02 0.0037 22.9 4.8 36 42-83 171-206 (397)
90 cd06378 PBP1_iGluR_NMDA_NR2 N- 25.0 1.4E+02 0.0031 23.8 4.5 46 51-104 202-247 (362)
91 PF14871 GHL6: Hypothetical gl 25.0 1E+02 0.0022 21.5 3.2 43 50-92 41-88 (132)
92 TIGR00682 lpxK tetraacyldisacc 24.9 1.1E+02 0.0024 24.4 3.7 16 69-84 119-134 (311)
93 PRK08056 threonine-phosphate d 24.7 1.4E+02 0.003 23.2 4.3 37 41-83 150-186 (356)
94 COG1060 ThiH Thiamine biosynth 24.7 87 0.0019 25.7 3.2 31 46-81 86-116 (370)
95 PF08013 Tagatose_6_P_K: Tagat 24.4 89 0.0019 26.5 3.2 31 50-81 100-130 (424)
96 COG4874 Uncharacterized protei 24.3 1.4E+02 0.003 24.0 4.1 20 60-80 60-79 (318)
97 PLN00143 tyrosine/nicotianamin 24.3 1.5E+02 0.0031 23.8 4.4 36 42-83 179-214 (409)
98 PF03065 Glyco_hydro_57: Glyco 24.1 20 0.00044 28.5 -0.6 14 67-80 183-196 (360)
99 PRK05664 threonine-phosphate d 23.9 1.6E+02 0.0035 22.7 4.5 40 36-83 129-168 (330)
100 PRK05839 hypothetical protein; 23.5 1.3E+02 0.0029 23.6 4.0 37 41-83 163-199 (374)
101 PRK07550 hypothetical protein; 23.4 1.4E+02 0.0031 23.4 4.1 36 42-83 172-207 (386)
102 PRK07309 aromatic amino acid a 23.3 2.3E+02 0.0049 22.4 5.3 36 42-83 175-210 (391)
103 cd06913 beta3GnTL1_like Beta 1 23.3 1.4E+02 0.0031 21.0 3.8 30 51-81 8-37 (219)
104 PF08373 RAP: RAP domain; Int 23.2 50 0.0011 19.0 1.2 19 61-79 21-39 (58)
105 PLN02475 5-methyltetrahydropte 23.2 70 0.0015 28.9 2.5 32 51-82 577-610 (766)
106 PF01717 Meth_synt_2: Cobalami 23.1 78 0.0017 24.7 2.6 32 51-82 145-178 (324)
107 PRK08175 aminotransferase; Val 23.1 1.8E+02 0.0039 23.0 4.7 36 42-83 173-208 (395)
108 PRK09200 preprotein translocas 23.0 84 0.0018 28.6 3.0 49 56-106 287-335 (790)
109 PF06955 XET_C: Xyloglucan end 23.0 36 0.00079 20.0 0.5 9 95-103 39-47 (51)
110 COG0112 GlyA Glycine/serine hy 22.9 73 0.0016 26.9 2.4 23 53-76 287-309 (413)
111 PF13479 AAA_24: AAA domain 22.7 1.2E+02 0.0025 22.3 3.3 26 51-79 51-76 (213)
112 PRK15052 D-tagatose-1,6-bispho 22.5 91 0.002 26.4 2.9 31 50-81 97-127 (421)
113 PRK06348 aspartate aminotransf 22.5 1.7E+02 0.0038 23.0 4.5 37 41-83 170-206 (384)
114 KOG2366|consensus 22.4 78 0.0017 26.7 2.5 48 26-84 122-169 (414)
115 PF07244 Surf_Ag_VNR: Surface 22.2 1.4E+02 0.0031 17.5 3.1 29 48-77 30-58 (78)
116 PTZ00376 aspartate aminotransf 22.1 1.7E+02 0.0036 23.3 4.3 37 41-83 184-220 (404)
117 PF06543 Lac_bphage_repr: Lact 22.1 73 0.0016 19.1 1.7 29 25-63 15-43 (49)
118 COG4451 RbcS Ribulose bisphosp 21.9 1.1E+02 0.0024 21.7 2.9 25 50-75 19-43 (127)
119 PRK07366 succinyldiaminopimela 21.9 1.6E+02 0.0035 23.1 4.1 36 42-83 174-209 (388)
120 TIGR03815 CpaE_hom_Actino heli 21.8 1.1E+02 0.0025 23.6 3.3 27 50-82 189-215 (322)
121 PRK07865 N-succinyldiaminopime 21.8 1.7E+02 0.0036 22.8 4.2 37 41-83 156-192 (364)
122 PRK09257 aromatic amino acid a 21.7 1.8E+02 0.0039 23.0 4.4 37 41-83 180-216 (396)
123 PF13614 AAA_31: AAA domain; P 21.6 64 0.0014 21.7 1.6 28 50-83 102-129 (157)
124 PLN02450 1-aminocyclopropane-1 21.4 1.6E+02 0.0036 24.3 4.3 36 42-83 200-235 (468)
125 KOG0186|consensus 21.3 98 0.0021 26.8 2.9 41 23-75 313-353 (506)
126 PRK11113 D-alanyl-D-alanine ca 21.1 2E+02 0.0044 24.2 4.8 38 46-84 107-144 (477)
127 PRK06107 aspartate aminotransf 21.1 2.4E+02 0.0052 22.4 5.0 37 42-83 175-211 (402)
128 PF11148 DUF2922: Protein of u 20.8 1.6E+02 0.0034 18.0 3.2 23 47-69 23-45 (69)
129 KOG3858|consensus 20.6 74 0.0016 24.8 1.9 13 24-36 25-37 (233)
130 cd04747 OYE_like_5_FMN Old yel 20.6 73 0.0016 25.9 2.0 52 49-100 133-197 (361)
131 PF12419 DUF3670: SNF2 Helicas 20.6 1.2E+02 0.0025 21.2 2.8 32 47-80 81-112 (141)
132 PF07631 PSD4: Protein of unkn 20.5 21 0.00045 25.0 -1.1 45 51-101 34-80 (128)
133 cd02930 DCR_FMN 2,4-dienoyl-Co 20.5 72 0.0016 25.5 1.9 31 49-79 126-158 (353)
134 cd02525 Succinoglycan_BP_ExoA 20.4 1.7E+02 0.0037 20.5 3.7 31 51-81 11-41 (249)
135 TIGR01140 L_thr_O3P_dcar L-thr 20.4 1.9E+02 0.0042 22.2 4.2 36 42-83 134-169 (330)
136 TIGR03537 DapC succinyldiamino 20.3 1.9E+02 0.0042 22.4 4.2 36 42-83 145-180 (350)
137 PF01656 CbiA: CobQ/CobB/MinD/ 20.3 1.3E+02 0.0028 20.7 3.0 10 71-80 94-103 (195)
No 1
>PLN02229 alpha-galactosidase
Probab=99.97 E-value=1.5e-32 Score=224.78 Aligned_cols=85 Identities=46% Similarity=0.862 Sum_probs=80.8
Q ss_pred HhhhccCCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCee
Q psy1868 14 VSKALDNGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQ 93 (108)
Q Consensus 14 ~~~~~~~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~ 93 (108)
.+..++|++++|||||||||+.|+|+ |||+.|+++||+|++.||+++||+||+|||||+...||+.|+++
T Consensus 51 ~~~~~~ngla~tPpmGWnSWn~~~~~----------i~E~~i~~~ad~~v~~Gl~~~Gy~yv~iDDgW~~~~rd~~G~l~ 120 (427)
T PLN02229 51 GRLQLNNGLARTPQMGWNSWNFFACN----------INETVIKETADALVSTGLADLGYIHVNIDDCWSNLKRDSKGQLV 120 (427)
T ss_pred hhhhccCCccCCCCceEEchhhhCcc----------cCHHHHHHHHHHHHHhHHHhCCCEEEEEcCCcCCCCcCCCCCEE
Confidence 44566899999999999999999999 99999999999999999999999999999999988899999999
Q ss_pred ccCCCCCCChhhhhC
Q psy1868 94 ADAKRFPRGIADLSN 108 (108)
Q Consensus 94 ~d~~rFP~G~k~Lad 108 (108)
||++|||+|||+|+|
T Consensus 121 ~d~~rFP~G~k~lad 135 (427)
T PLN02229 121 PDPKTFPSGIKLLAD 135 (427)
T ss_pred EChhhcCCcHHHHHH
Confidence 999999999999986
No 2
>KOG2366|consensus
Probab=99.97 E-value=1.2e-32 Score=221.98 Aligned_cols=95 Identities=60% Similarity=1.119 Sum_probs=92.6
Q ss_pred HhhhccCCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCee
Q psy1868 14 VSKALDNGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQ 93 (108)
Q Consensus 14 ~~~~~~~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~ 93 (108)
.+.+++||++++|||||+||+.|.|+.+|-.+|+.||+|+.++++||+|++.|+++.||+||+|||||.+.+||.+|+++
T Consensus 21 ~~~~l~NGLg~tP~MGw~sW~~f~cniDCv~~pd~cIsE~l~~~~ad~mvseG~~~vGY~yi~iDDCW~e~~Rd~~grLv 100 (414)
T KOG2366|consen 21 GRMSLNNGLGRTPQMGWNSWERFRCNIDCVFGPDFCISEQLFKEMADAMVSEGLADVGYEYINIDDCWSEVTRDSDGRLV 100 (414)
T ss_pred hheeeccccccCCCcccccccceeeecccccCCccchhHHHHHHHHHHHHHhHHHhcCcEEEechhhhhhhccCCccccc
Confidence 47889999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred ccCCCCCCChhhhhC
Q psy1868 94 ADAKRFPRGIADLSN 108 (108)
Q Consensus 94 ~d~~rFP~G~k~Lad 108 (108)
+|++|||+||++|+|
T Consensus 101 a~~~rFP~Gi~~lad 115 (414)
T KOG2366|consen 101 ADPSRFPSGIKALAD 115 (414)
T ss_pred cChhhcccchhhhhh
Confidence 999999999999986
No 3
>PLN02808 alpha-galactosidase
Probab=99.97 E-value=2.6e-32 Score=221.24 Aligned_cols=85 Identities=49% Similarity=0.996 Sum_probs=81.2
Q ss_pred HhhhccCCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCee
Q psy1868 14 VSKALDNGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQ 93 (108)
Q Consensus 14 ~~~~~~~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~ 93 (108)
.+.+++|+++++||||||||+.|+|+ |||++|+++||+|++.||+++||+||+|||||+...||++|+++
T Consensus 20 ~~~~~~ngla~tPpmGWnsW~~~~~~----------i~e~~i~~~a~~mv~~Gl~~~Gy~yv~iDd~W~~~~rd~~G~~~ 89 (386)
T PLN02808 20 SRNLLDNGLGLTPQMGWNSWNHFQCN----------INETLIKQTADAMVSSGLAALGYKYINLDDCWAELKRDSQGNLV 89 (386)
T ss_pred hhhcccCcccCCCcceEEchHHHCCC----------CCHHHHHHHHHHHHHcchHHhCCEEEEEcCCcCCCCcCCCCCEe
Confidence 34567999999999999999999999 99999999999999999999999999999999988899999999
Q ss_pred ccCCCCCCChhhhhC
Q psy1868 94 ADAKRFPRGIADLSN 108 (108)
Q Consensus 94 ~d~~rFP~G~k~Lad 108 (108)
||++|||+|||+|+|
T Consensus 90 ~d~~rFP~G~~~lad 104 (386)
T PLN02808 90 PKASTFPSGIKALAD 104 (386)
T ss_pred eChhhcCccHHHHHH
Confidence 999999999999986
No 4
>PLN02692 alpha-galactosidase
Probab=99.97 E-value=1.3e-31 Score=218.60 Aligned_cols=83 Identities=43% Similarity=0.983 Sum_probs=79.6
Q ss_pred hhccCCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeecc
Q psy1868 16 KALDNGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQAD 95 (108)
Q Consensus 16 ~~~~~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d 95 (108)
.+++|++++|||||||||+.|+|+ |||+.|+++||+|++.||+++||+||+|||||+...||++|+++||
T Consensus 46 ~~~~ngla~tPpmGWnSW~~~~~~----------i~E~~i~~~ad~~~~~gl~~~Gy~yv~iDDgW~~~~rd~~G~~~~d 115 (412)
T PLN02692 46 NLLANGLGITPPMGWNSWNHFSCK----------IDEKMIKETADALVSTGLSKLGYTYVNIDDCWAEIARDEKGNLVPK 115 (412)
T ss_pred hcccCcCcCCCcceEEchhhhCcc----------cCHHHHHHHHHHHHhccchhcCcEEEEEcCCcCCCCCCCCCCeeeC
Confidence 445899999999999999999999 9999999999999999999999999999999998899999999999
Q ss_pred CCCCCCChhhhhC
Q psy1868 96 AKRFPRGIADLSN 108 (108)
Q Consensus 96 ~~rFP~G~k~Lad 108 (108)
++|||+|||+|+|
T Consensus 116 ~~kFP~G~k~lad 128 (412)
T PLN02692 116 KSTFPSGIKALAD 128 (412)
T ss_pred hhhcCCcHHHHHH
Confidence 9999999999986
No 5
>PLN02899 alpha-galactosidase
Probab=99.97 E-value=7.8e-31 Score=221.48 Aligned_cols=87 Identities=29% Similarity=0.548 Sum_probs=79.0
Q ss_pred HHHHhhhccCCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCC-----
Q psy1868 11 LVAVSKALDNGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKT----- 85 (108)
Q Consensus 11 ~~~~~~~~~~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~----- 85 (108)
.+-.+.+.++++++|||||||||+.|.|+ |||+.|+++||. ++.||+++||+||+|||||+.+.
T Consensus 16 ~~~~~~~~~~glA~TPPMGWNSWn~f~~~----------I~E~~i~~~Ad~-vs~GLk~~GY~YVnIDDcW~~~~~~g~~ 84 (633)
T PLN02899 16 SLWIGASSQQQLASFPPRGWNSYDSFSWI----------VSEEEFLQNAEI-VSQRLLPFGYEYVVVDYLWYRKKVEGAY 84 (633)
T ss_pred hhhccccccCcccCCCCCCCcchhhhccC----------CCHHHHHHHHHH-HHcchHhhCCeEEEEccccccccccccc
Confidence 33466778999999999999999999999 999999999995 68899999999999999998653
Q ss_pred --------CCCCCCeeccCCCCCC-----ChhhhhC
Q psy1868 86 --------RSFNGRLQADAKRFPR-----GIADLSN 108 (108)
Q Consensus 86 --------rd~~G~~~~d~~rFP~-----G~k~Lad 108 (108)
||++|+++||++|||+ |||+|||
T Consensus 85 ~~s~g~~~~D~~GrLvPDp~RFPSs~~g~GmK~LAD 120 (633)
T PLN02899 85 VDSLGFDVIDEWGRPIPDPGRWPSSRGGKGFTEVAE 120 (633)
T ss_pred cccccccccCCCCCCccCcccCCCCccCCCcHHHHH
Confidence 6889999999999998 9999986
No 6
>PLN03231 putative alpha-galactosidase; Provisional
Probab=99.95 E-value=4.3e-29 Score=200.84 Aligned_cols=72 Identities=40% Similarity=0.700 Sum_probs=68.0
Q ss_pred CceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC----------------CCCCC
Q psy1868 26 PPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK----------------TRSFN 89 (108)
Q Consensus 26 PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~----------------~rd~~ 89 (108)
||||||||+.|+|+ |||+.|+++|| |+++||+++||+||+|||||+.+ .||++
T Consensus 1 PpMGWNSWn~f~~~----------i~E~~i~~~Ad-~v~~gL~~~GY~Yv~iDd~W~~~~~~g~~~~~~~~~~~~~~d~~ 69 (357)
T PLN03231 1 PPRGWNSYDSFSFT----------ISEEQFLENAK-IVSETLKPHGYEYVVIDYLWYRKLKHGWFKTSAKSPGYDLIDKW 69 (357)
T ss_pred CCCCccchhccCcC----------cCHHHHHHHHH-HHHcchHHhCCEEEEECCcccccccccccccccccccccccCCC
Confidence 89999999999999 99999999999 78999999999999999999864 26889
Q ss_pred CCeeccCCCCCC-----ChhhhhC
Q psy1868 90 GRLQADAKRFPR-----GIADLSN 108 (108)
Q Consensus 90 G~~~~d~~rFP~-----G~k~Lad 108 (108)
|+++||++|||+ |||+|+|
T Consensus 70 G~l~pd~~rFPs~~~~~G~k~lAD 93 (357)
T PLN03231 70 GRPLPDPKRWPSTTGGKGFAPIAA 93 (357)
T ss_pred CCcccCcccCCCCccccCcHHHHH
Confidence 999999999999 9999986
No 7
>PF02065 Melibiase: Melibiase; InterPro: IPR000111 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycosyl hydrolase family 27, family 31 and family 36 alpha-galactosidases form the glycosyl hydrolase clan GH-D (acc_GH from CAZY), a superfamily of alpha-galactosidases, alpha-N-acetylgalactosaminidases, and isomaltodextranases which are likely to share a common catalytic mechanism and structural topology. Alpha-galactosidase (3.2.1.22 from EC) (melibiase) [] catalyzes the hydrolysis of melibiose into galactose and glucose. In man, the deficiency of this enzyme is the cause of Fabry's disease (X-linked sphingolipidosis). Alpha-galactosidase is present in a variety of organisms. There is a considerable degree of similarity in the sequence of alpha-galactosidase from various eukaryotic species. Escherichia coli alpha-galactosidase (gene melA), which requires NAD and magnesium as cofactors, is not structurally related to the eukaryotic enzymes; by contrast, an Escherichia coli plasmid encoded alpha-galactosidase (gene rafA P16551 from SWISSPROT) [] contains a region of about 50 amino acids which is similar to a domain of the eukaryotic alpha-galactosidases. Alpha-N-acetylgalactosaminidase (3.2.1.49 from EC) [] catalyzes the hydrolysis of terminal non-reducing N-acetyl-D-galactosamine residues in N-acetyl-alpha-D- galactosaminides. In man, the deficiency of this enzyme is the cause of Schindler and Kanzaki diseases. The sequence of this enzyme is highly related to that of the eukaryotic alpha-galactosidases.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1KTC_A 1KTB_A 1UAS_A 3H55_A 3H53_A 3IGU_B 3H54_A 3LRM_A 3LRL_A 3LRK_A ....
Probab=99.83 E-value=3.8e-21 Score=156.45 Aligned_cols=73 Identities=37% Similarity=0.779 Sum_probs=59.0
Q ss_pred CCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCC----CCeecc
Q psy1868 20 NGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFN----GRLQAD 95 (108)
Q Consensus 20 ~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~----G~~~~d 95 (108)
+...++||||||||++++|+ +||+.|+++|+.+ +++||++|+|||||+.. |+.+ |+|.+|
T Consensus 35 ~~~~~~~pv~~nsW~~~~~d----------~~e~~i~~~a~~~-----~~~G~e~fviDDGW~~~-r~~d~~~~GdW~~~ 98 (394)
T PF02065_consen 35 PWRDKPPPVGWNSWEAYYFD----------ITEEKILELADAA-----AELGYEYFVIDDGWFGG-RDDDNAGLGDWEPD 98 (394)
T ss_dssp TTTTSS--EEEESHHHHTTG------------HHHHHHHHHHH-----HHHT-SEEEE-SSSBCT-ESTTTSTTSBECBB
T ss_pred ccCCCCCceEEEcccccCcC----------CCHHHHHHHHHHH-----HHhCCEEEEEcCccccc-cCCCcccCCceeEC
Confidence 34468899999999999999 9999999999998 47889999999999974 5444 999999
Q ss_pred CCCCCCChhhhhC
Q psy1868 96 AKRFPRGIADLSN 108 (108)
Q Consensus 96 ~~rFP~G~k~Lad 108 (108)
++|||+||++|++
T Consensus 99 ~~kFP~Gl~~l~~ 111 (394)
T PF02065_consen 99 PKKFPNGLKPLAD 111 (394)
T ss_dssp TTTSTTHHHHHHH
T ss_pred hhhhCCcHHHHHH
Confidence 9999999999974
No 8
>COG3345 GalA Alpha-galactosidase [Carbohydrate transport and metabolism]
Probab=99.59 E-value=1.3e-15 Score=128.52 Aligned_cols=72 Identities=26% Similarity=0.576 Sum_probs=62.6
Q ss_pred CCCCCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCC----CCeecc
Q psy1868 20 NGVALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFN----GRLQAD 95 (108)
Q Consensus 20 ~~~~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~----G~~~~d 95 (108)
+-+.+..|||||||++|+.+ ++++.|+++++.. ++.|.+.|+|||||++ .|+.+ |+|.+|
T Consensus 286 ~~~~kprPi~~nsWea~Yfd----------~t~e~ile~vk~a-----kk~gvE~FvlDDGwfg-~rndd~~slGDWlv~ 349 (687)
T COG3345 286 PRVKKPRPIGWNSWEAYYFD----------FTEEEILENVKEA-----KKFGVELFVLDDGWFG-GRNDDLKSLGDWLVN 349 (687)
T ss_pred cccCCCCcceeeceeeeeec----------CCHHHHHHHHHHH-----hhcCeEEEEEcccccc-ccCcchhhhhceecc
Confidence 34455679999999999999 9999999999986 4678999999999996 56544 999999
Q ss_pred CCCCCCChhhhh
Q psy1868 96 AKRFPRGIADLS 107 (108)
Q Consensus 96 ~~rFP~G~k~La 107 (108)
.+|||+||..|+
T Consensus 350 seKfPsgiE~li 361 (687)
T COG3345 350 SEKFPSGIEELI 361 (687)
T ss_pred hhhccccHHHHH
Confidence 999999999886
No 9
>PLN02219 probable galactinol--sucrose galactosyltransferase 2
Probab=99.25 E-value=8.1e-12 Score=108.37 Aligned_cols=72 Identities=18% Similarity=0.399 Sum_probs=58.0
Q ss_pred CCCCc-----eeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCC--------
Q psy1868 23 ALTPP-----MGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFN-------- 89 (108)
Q Consensus 23 ~~~Pp-----mGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~-------- 89 (108)
.++.| .|||||++|+.+ |||+.|++..+.|.+.|.. .++++||||||....+..
T Consensus 191 ~K~~p~~~D~~GWCTWdafy~d----------Vt~~~I~~~l~~l~e~gip---~~~viIDDGwQsi~~~~~~~~~~~~~ 257 (775)
T PLN02219 191 KKKLPSFLDWFGWCTWDAFYTD----------VTAEGVDEGLKSLSEGGTP---PKFLIIDDGWQQIENKEKDENCVVQE 257 (775)
T ss_pred cccCccccceeeEEEhhHhhcc----------CCHHHHHHHHHHHHhCCCC---ceEEEEccCccccccccccccccccc
Confidence 45678 899999999998 9999999999999998887 789999999997544321
Q ss_pred --------CCeeccCC--------CCCCChhhhh
Q psy1868 90 --------GRLQADAK--------RFPRGIADLS 107 (108)
Q Consensus 90 --------G~~~~d~~--------rFP~G~k~La 107 (108)
-++..|++ .||.|||+++
T Consensus 258 g~qf~~rL~~f~en~KF~~~~~~~~fp~Glk~~V 291 (775)
T PLN02219 258 GAQFATRLTGIKENAKFQKNDQKNEQVSGLKHVV 291 (775)
T ss_pred cchhhhhhccccccccccccccccCCCCcHHHHH
Confidence 12344543 5899999876
No 10
>PLN02355 probable galactinol--sucrose galactosyltransferase 1
Probab=99.22 E-value=1.4e-11 Score=106.96 Aligned_cols=51 Identities=22% Similarity=0.511 Sum_probs=45.1
Q ss_pred CCCc-----eeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCC
Q psy1868 24 LTPP-----MGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRS 87 (108)
Q Consensus 24 ~~Pp-----mGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd 87 (108)
++.| .|||||++|+.+ +||+.|++..+.|++.|.. .++++||||||....|
T Consensus 196 K~~P~~ld~~GWCTW~afy~~----------Vt~~~I~~~l~~l~~~g~p---~~~viIDDGwQs~~~d 251 (758)
T PLN02355 196 KKMPDMLNWFGWCTWDAFYTN----------VTAEGVKQGLESLEKGGVT---PKFVIIDDGWQSVGMD 251 (758)
T ss_pred ccCCcccceeeEEehhHhhcc----------CCHHHHHHHHHHHHhCCCC---ccEEEEeccccccccc
Confidence 4677 899999999998 9999999999999998777 7899999999975433
No 11
>PLN02711 Probable galactinol--sucrose galactosyltransferase
Probab=99.15 E-value=4.8e-11 Score=103.63 Aligned_cols=44 Identities=23% Similarity=0.474 Sum_probs=40.9
Q ss_pred eeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC
Q psy1868 28 MGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK 84 (108)
Q Consensus 28 mGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~ 84 (108)
.|||||++|+.+ |||+.|++-.+.|.+.|.. .++++||||||+-
T Consensus 215 fGWCTWdAfy~~----------Vt~egI~~gl~~L~~~Gip---~~~vIIDDGWQsi 258 (777)
T PLN02711 215 FGWCTWDAFYLT----------VHPQGVWEGVKGLVDGGCP---PGLVLIDDGWQSI 258 (777)
T ss_pred ceEEehhHhccc----------CCHHHHHHHHHHHHhCCCC---ccEEEEcCCcccc
Confidence 499999999998 9999999999999998888 6899999999974
No 12
>PLN02684 Probable galactinol--sucrose galactosyltransferase
Probab=99.09 E-value=1.1e-10 Score=101.18 Aligned_cols=66 Identities=20% Similarity=0.517 Sum_probs=53.0
Q ss_pred eeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCC------------C---e
Q psy1868 28 MGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNG------------R---L 92 (108)
Q Consensus 28 mGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G------------~---~ 92 (108)
.|||||++|+.+ |||+.|++..+.|.+.|.. .++++||||||....|... + +
T Consensus 204 fGWCTWdafy~d----------Vt~~~I~~~l~~l~~~g~p---~~~vIIDDGwQs~~~d~~~~~~~~~~~q~~~rL~~f 270 (750)
T PLN02684 204 FGWCTWDAFYQE----------VTQEGVEAGLKSLAAGGTP---PKFVIIDDGWQSVGGDPTVEAGDEKKEQPLLRLTGI 270 (750)
T ss_pred eeEEEhhHhhcc----------CCHHHHHHHHHHHHhCCCC---ceEEEEecccccccccccccccccccchhhhhhccC
Confidence 599999999998 9999999999999998877 7899999999975443211 2 3
Q ss_pred eccCCCCCC------Chhhhh
Q psy1868 93 QADAKRFPR------GIADLS 107 (108)
Q Consensus 93 ~~d~~rFP~------G~k~La 107 (108)
..| .|||+ |||+++
T Consensus 271 ~en-~KF~~~~~p~~Glk~~V 290 (750)
T PLN02684 271 KEN-EKFKKKDDPNVGIKNIV 290 (750)
T ss_pred ccc-cccccccCCCccHHHHH
Confidence 344 67875 998875
No 13
>cd06592 GH31_glucosidase_KIAA1161 KIAA1161 is an uncharacterized Homo sapiens protein with a glycosyl hydrolase family 31 (GH31) domain that is homologous to the Escherichia coli YihQ glucosidase. Orthologs of KIA1161 are found in eukaryotes and prokaryotes. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=99.06 E-value=1.7e-10 Score=90.57 Aligned_cols=65 Identities=25% Similarity=0.643 Sum_probs=56.5
Q ss_pred CCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCCh
Q psy1868 24 LTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGI 103 (108)
Q Consensus 24 ~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~ 103 (108)
.++|. ||+|+++.+. +||+.|++.++.+++.|+. ++.|+||++|+.. .|++..|++|||+ +
T Consensus 12 ~~~p~-W~~W~~~~~~----------~s~~~v~~~~~~~~~~~iP---~d~i~iD~~w~~~----~g~f~~d~~~FPd-p 72 (303)
T cd06592 12 FRSPI-WSTWARYKAD----------INQETVLNYAQEIIDNGFP---NGQIEIDDNWETC----YGDFDFDPTKFPD-P 72 (303)
T ss_pred hCCCc-cCChhhhccC----------cCHHHHHHHHHHHHHcCCC---CCeEEeCCCcccc----CCccccChhhCCC-H
Confidence 45665 9999999777 8999999999999988764 8999999999864 6899999999998 7
Q ss_pred hhhh
Q psy1868 104 ADLS 107 (108)
Q Consensus 104 k~La 107 (108)
+.++
T Consensus 73 ~~mi 76 (303)
T cd06592 73 KGMI 76 (303)
T ss_pred HHHH
Confidence 7765
No 14
>PF05691 Raffinose_syn: Raffinose synthase or seed imbibition protein Sip1; InterPro: IPR008811 This family consists of several raffinose synthase proteins, also known as seed imbibition (Sip1) proteins. Raffinose (O-alpha- D-galactopyranosyl- (1-->6)- O-alpha- D-glucopyranosyl-(1-->2)- O-beta- D-fructofuranoside) is a widespread oligosaccharide in plant seeds and other tissues. Raffinose synthase (2.4.1.82 from EC) is the key enzyme that channels sucrose into the raffinose oligosaccharide pathway [].
Probab=98.97 E-value=4.9e-10 Score=97.51 Aligned_cols=47 Identities=21% Similarity=0.514 Sum_probs=42.0
Q ss_pred eeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCC
Q psy1868 28 MGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRS 87 (108)
Q Consensus 28 mGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd 87 (108)
.|||||++|+.+ +|++.|++..+.|.+.|.. .++++||||||...++
T Consensus 197 lGwCTWdaf~~~----------v~~~~i~~~l~~L~~~gi~---~~~viIDDGWQ~~~~~ 243 (747)
T PF05691_consen 197 LGWCTWDAFYQD----------VTEEGILEGLKSLEEGGIP---PRFVIIDDGWQSVDND 243 (747)
T ss_pred hccccHHHhccc----------cCHHHHHHHHHHHHhCCCC---ceEEEEecchhccccc
Confidence 499999999998 9999999999999988887 6799999999976543
No 15
>PLN02982 galactinol-raffinose galactosyltransferase/ghydrolase, hydrolyzing O-glycosyl compounds
Probab=98.39 E-value=3.5e-07 Score=80.36 Aligned_cols=46 Identities=20% Similarity=0.419 Sum_probs=40.7
Q ss_pred eeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCC
Q psy1868 29 GWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRS 87 (108)
Q Consensus 29 GWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd 87 (108)
|||||++||.+ ++.+.|.+-.+.|++-|.. -++++||||||.-..|
T Consensus 225 GWCTWDAFY~~----------V~p~GV~~Gv~~l~~gG~p---prfvIIDDGWQsi~~d 270 (865)
T PLN02982 225 GWCTWDAFYLT----------VDPVGVWHGVKEFAEGGVP---PRFLIIDDGWQSINFD 270 (865)
T ss_pred eEEeechhhcc----------cCHHHHHHHHHHHhcCCCC---ccEEEEecchhhcccc
Confidence 99999999998 8999999999999887777 4799999999976544
No 16
>cd06593 GH31_xylosidase_YicI YicI alpha-xylosidase is a glycosyl hydrolase family 31 (GH31) enzyme that catalyzes the release of an alpha-xylosyl residue from the non-reducing end of alpha-xyloside substrates such as alpha-xylosyl fluoride and isoprimeverose. YicI forms a homohexamer (a trimer of dimers). All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The YicI family corresponds to subgroup 4 in the Ernst et al classification of GH31 enzymes.
Probab=97.75 E-value=4.1e-05 Score=59.85 Aligned_cols=52 Identities=19% Similarity=0.308 Sum_probs=43.3
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
.|++.|++.++.+++.|+. ++.|.||++|+.. ...|++.+|++|||+ ++.++
T Consensus 21 ~~~~~v~~~~~~~~~~~iP---~d~~~lD~~w~~~--~~~~~f~~d~~~FPd-~~~~i 72 (308)
T cd06593 21 YDEEEVNEFADGMRERNLP---CDVIHLDCFWMKE--FQWCDFEFDPDRFPD-PEGML 72 (308)
T ss_pred CCHHHHHHHHHHHHHcCCC---eeEEEEecccccC--CcceeeEECcccCCC-HHHHH
Confidence 6899999999999887766 8999999999953 223589999999998 67664
No 17
>cd06589 GH31 The enzymes of glycosyl hydrolase family 31 (GH31) occur in prokaryotes, eukaryotes, and archaea with a wide range of hydrolytic activities, including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. In most cases, the pyranose moiety recognized in subsite -1 of the substrate binding site is an alpha-D-glucose, though some GH31 family members show a preference for alpha-D-xylose. Several GH31 enzymes can accommodate both glucose and xylose and different levels of discrimination between the two have been observed. Most characterized GH31 enzymes are alpha-glucosidases. In mammals, GH31 members with alpha-glucosidase activity are implicated in at least three distinct biological processes
Probab=97.60 E-value=9e-05 Score=57.00 Aligned_cols=50 Identities=18% Similarity=0.227 Sum_probs=42.9
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCe--eccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRL--QADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~--~~d~~rFP~G~k~La 107 (108)
.|++.|++.++.+.+.++. ++.|.||++|+.. .|+. ..|++|||+ ++.++
T Consensus 21 ~~~~~v~~~~~~~~~~~iP---~d~~~lD~~~~~~----~~~f~~~~d~~~Fpd-p~~~i 72 (265)
T cd06589 21 GDQDKVLEVIDGMRENDIP---LDGFVLDDDYTDG----YGDFTFDWDAGKFPN-PKSMI 72 (265)
T ss_pred CCHHHHHHHHHHHHHcCCC---ccEEEECcccccC----CceeeeecChhhCCC-HHHHH
Confidence 5999999999999998776 8999999999974 4555 999999998 77664
No 18
>cd06595 GH31_xylosidase_XylS-like This family represents an uncharacterized glycosyl hydrolase family 31 (GH31) enzyme found in bacteria and eukaryotes that is related to the XylS xylosidase of Sulfolobus solfataricus. Alpha-xylosidases catalyze the release of an alpha-xylose residue from the non-reducing end of alpha-xyloside substrates. Enzymes of the GH31 family possess a wide range of different hydrolytic activities including alpha-glucosidase (glucoamylase and sucrase-isomaltase), alpha-xylosidase, 6-alpha-glucosyltransferase, 3-alpha-isomaltosyltransferase and alpha-1,4-glucan lyase. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=97.38 E-value=0.00027 Score=55.28 Aligned_cols=54 Identities=19% Similarity=0.166 Sum_probs=44.8
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCC-----CCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKT-----RSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~-----rd~~G~~~~d~~rFP~G~k~La 107 (108)
.|++.|++.|+.+.+.++. ++.|.||..|+... .+..+++..|+++||+ .+.++
T Consensus 22 ~s~~ev~~v~~~~r~~~iP---~D~i~lD~dw~~~~~~~~~~~~~~~ft~d~~~FPd-p~~mi 80 (292)
T cd06595 22 YSDEEYLALMDRFKKHNIP---LDVLVIDMDWHVTDIPSKYGSGWTGYSWNRKLFPD-PEKLL 80 (292)
T ss_pred CCHHHHHHHHHHHHHhCCC---ccEEEEecccccccccccccCCcceeEEChhcCCC-HHHHH
Confidence 4899999999999998777 89999999998532 2456889999999998 66654
No 19
>cd06599 GH31_glycosidase_Aec37 Glycosyl hydrolase family 31 (GH31) domain of a bacterial protein family represented by Escherichia coli protein Aec37. The gene encoding Aec37 (aec-37) is located within a genomic island (AGI-3) isolated from the extraintestinal avian pathogenic Escherichia coli strain BEN2908. The function of Aec37 and its orthologs is unknown; however, deletion of a region of the genome that includes aec-37 affects the assimilation of seven carbohydrates, decreases growth rate of the strain in minimal medium containing galacturonate or trehalose, and attenuates the virulence of E. coli BEN2908 in chickens. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=97.23 E-value=0.00046 Score=54.57 Aligned_cols=54 Identities=9% Similarity=0.103 Sum_probs=43.4
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
.+++.|++.++.+.+.++. ++.|.||.+|+...-.....+..|+++||+ .+.++
T Consensus 26 ~~q~~v~~~~~~~r~~~iP---~d~i~ld~~~~~~~~~~~~~f~~d~~~FPd-p~~mi 79 (317)
T cd06599 26 DAQEALLEFIDKCREHDIP---CDSFHLSSGYTSIEGGKRYVFNWNKDRFPD-PAAFV 79 (317)
T ss_pred cHHHHHHHHHHHHHHcCCC---eeEEEEeccccccCCCceeeeecCcccCCC-HHHHH
Confidence 4689999999999998777 899999999986422234568999999998 76664
No 20
>cd06598 GH31_transferase_CtsZ CtsZ (cyclic tetrasaccharide-synthesizing enzyme Z) is a bacterial 6-alpha-glucosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsY. CtsZ and CtsY both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=97.13 E-value=0.0007 Score=53.51 Aligned_cols=53 Identities=19% Similarity=0.283 Sum_probs=44.1
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCC--CCCCCeeccCCCCCCChhhhh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTR--SFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~r--d~~G~~~~d~~rFP~G~k~La 107 (108)
+++.|++.++.+.+.++. .+.|.||.+|..... ...|++..|++|||+ .+.++
T Consensus 22 ~~~~v~~~~~~~~~~~iP---~d~i~lD~~w~~~~~~~~~~~~f~wd~~~FPd-p~~mi 76 (317)
T cd06598 22 NWQEVDDTIKTLREKDFP---LDAAILDLYWFGKDIDKGHMGNLDWDRKAFPD-PAGMI 76 (317)
T ss_pred CHHHHHHHHHHHHHhCCC---ceEEEEechhhcCcccCCceeeeEeccccCCC-HHHHH
Confidence 789999999999998776 899999999986432 245899999999998 66654
No 21
>cd06602 GH31_MGAM_SI_GAA This family includes the following three closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), and lysosomal acid alpha-glucosidase (GAA), also known as acid-maltase. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal domain found near the membrane-bound end, and a C-terminal luminal domain. Both of
Probab=96.91 E-value=0.0014 Score=52.50 Aligned_cols=50 Identities=14% Similarity=0.257 Sum_probs=42.6
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCCh--hhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGI--ADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~--k~La 107 (108)
-+++.|++.++.+.+.++. ++.|.||..|+.. .+++..|+++||+ . +.++
T Consensus 21 ~~~~~v~~~~~~~r~~~iP---~d~i~lD~~~~~~----~~~f~~d~~~FPd-p~~~~mi 72 (339)
T cd06602 21 KNVDEVKEVVENMRAAGIP---LDVQWNDIDYMDR----RRDFTLDPVRFPG-LKMPEFV 72 (339)
T ss_pred CCHHHHHHHHHHHHHhCCC---cceEEECcccccC----ccceecccccCCC-ccHHHHH
Confidence 3889999999999998777 8899999999864 6789999999998 4 5443
No 22
>cd06601 GH31_lyase_GLase GLases (alpha-1,4-glucan lyases) are glycosyl hydrolase family 31 (GH31) enzymes that degrade alpha-1,4-glucans and maltooligosaccharides via a nonhydrolytic pathway to yield 1,5-D-anhydrofructose from the nonreducing end. GLases cleave the bond between C1 and O1 of the nonreducing sugar residue of alpha-glucans to generate a monosaccharide product with a double bond between C1 and C2. This family corresponds to subgroup 2 in the Ernst et al classification of GH31 enzymes.
Probab=96.90 E-value=0.0015 Score=52.48 Aligned_cols=50 Identities=8% Similarity=0.155 Sum_probs=42.9
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
-|++.|++.++.+.+.++. ++.|.||..|... .+.+..|+++||+ .+.++
T Consensus 21 ~~~~ev~~v~~~~r~~~IP---~D~i~lDidy~~~----~~~Ft~d~~~FPd-p~~mv 70 (332)
T cd06601 21 SNRSDLEEVVEGYRDNNIP---LDGLHVDVDFQDN----YRTFTTNGGGFPN-PKEMF 70 (332)
T ss_pred CCHHHHHHHHHHHHHcCCC---CceEEEcCchhcC----CCceeecCCCCCC-HHHHH
Confidence 3889999999999998766 8999999999853 6789999999998 66554
No 23
>cd06603 GH31_GANC_GANAB_alpha This family includes the closely related glycosyl hydrolase family 31 (GH31) isozymes, neutral alpha-glucosidase C (GANC) and the alpha subunit of heterodimeric neutral alpha-glucosidase AB (GANAB). Initially distinguished on the basis of differences in electrophoretic mobility in starch gel, GANC and GANAB have been shown to have other differences, including those of substrate specificity. GANC and GANAB are key enzymes in glycogen metabolism that hydrolyze terminal, non-reducing 1,4-linked alpha-D-glucose residues from glycogen in the endoplasmic reticulum. The GANC/GANAB family includes the alpha-glucosidase II (ModA) from Dictyostelium discoideum as well as the alpha-glucosidase II (GLS2, or ROT2 - Reversal of TOR2 lethality protein 2) from Saccharomyces cerevisiae.
Probab=96.87 E-value=0.0014 Score=52.14 Aligned_cols=50 Identities=16% Similarity=0.183 Sum_probs=42.4
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
-|++.|++.++.+.+.++. ++.|.||..|... .+.+..|++|||+ .+.++
T Consensus 21 ~~~~ev~~~~~~~~~~~iP---~d~i~lD~~~~~~----~~~f~~d~~~FPd-p~~mi 70 (339)
T cd06603 21 KDQEDVKEVDAGFDEHDIP---YDVIWLDIEHTDG----KRYFTWDKKKFPD-PEKMQ 70 (339)
T ss_pred CCHHHHHHHHHHHHHcCCC---ceEEEEChHHhCC----CCceEeCcccCCC-HHHHH
Confidence 4899999999999998777 8999999999753 5679999999998 66554
No 24
>cd06591 GH31_xylosidase_XylS XylS is a glycosyl hydrolase family 31 (GH31) alpha-xylosidase found in prokaryotes, eukaryotes, and archaea, that catalyzes the release of alpha-xylose from the non-reducing terminal side of the alpha-xyloside substrate. XylS has been characterized in Sulfolobus solfataricus where it hydrolyzes isoprimeverose, the p-nitrophenyl-beta derivative of isoprimeverose, and xyloglucan oligosaccharides, and has transxylosidic activity. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein. The XylS family corresponds to subgroup 3 in the Ernst et al classification of GH31 enzymes.
Probab=96.84 E-value=0.0016 Score=51.58 Aligned_cols=52 Identities=19% Similarity=0.223 Sum_probs=42.2
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
-+++.|++.++.+.+.++. ++.|.||..|....+ .+++..|++|||+ ++.++
T Consensus 21 ~~~~ev~~~~~~~~~~~iP---~d~i~lD~~~~~~~~--~~~f~~d~~~FPd-p~~mi 72 (319)
T cd06591 21 KTQEELLDVAKEYRKRGIP---LDVIVQDWFYWPKQG--WGEWKFDPERFPD-PKAMV 72 (319)
T ss_pred CCHHHHHHHHHHHHHhCCC---ccEEEEechhhcCCC--ceeEEEChhhCCC-HHHHH
Confidence 3889999999999987666 899999998875322 3589999999998 66554
No 25
>cd06604 GH31_glucosidase_II_MalA Alpha-glucosidase II (alpha-D-glucoside glucohydrolase) is a glycosyl hydrolase family 31 (GH31) enzyme, found in bacteria and plants, which has exo-alpha-1,4-glucosidase and oligo-1,6-glucosidase activities. Alpha-glucosidase II has been characterized in Bacillus thermoamyloliquefaciens where it forms a homohexamer. This family also includes the MalA alpha-glucosidase from Sulfolobus sulfataricus and the AglA alpha-glucosidase from Picrophilus torridus. MalA is part of the carbohydrate-metabolizing machinery that allows this organism to utilize carbohydrates, such as maltose, as the sole carbon and energy source.
Probab=96.82 E-value=0.0017 Score=51.61 Aligned_cols=50 Identities=24% Similarity=0.300 Sum_probs=42.4
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
-|++.|++.++.+.+.|+. ++.|.||..|+.. .+.+..|++|||+ .+.+.
T Consensus 21 ~~~~~v~~~~~~~~~~~iP---~d~i~lD~~~~~~----~~~f~~d~~~fPd-p~~m~ 70 (339)
T cd06604 21 YPEEEVREIADEFRERDIP---CDAIYLDIDYMDG----YRVFTWDKERFPD-PKELI 70 (339)
T ss_pred CCHHHHHHHHHHHHHhCCC---cceEEECchhhCC----CCceeeccccCCC-HHHHH
Confidence 3889999999999998777 8999999999853 6779999999997 55543
No 26
>cd06600 GH31_MGAM-like This family includes the following closely related glycosyl hydrolase family 31 (GH31) enzymes: maltase-glucoamylase (MGAM), sucrase-isomaltase (SI), lysosomal acid alpha-glucosidase (GAA), neutral alpha-glucosidase C (GANC), the alpha subunit of neutral alpha-glucosidase AB (GANAB), and alpha-glucosidase II. MGAM is one of the two enzymes responsible for catalyzing the last glucose-releasing step in starch digestion. SI is implicated in the digestion of dietary starch and major disaccharides such as sucrose and isomaltose, while GAA degrades glycogen in the lysosome, cleaving both alpha-1,4 and alpha-1,6 glucosidic linkages. MGAM and SI are anchored to small-intestinal brush-border epithelial cells. The absence of SI from the brush border membrane or its malfunction is associated with malabsorption disorders such as congenital sucrase-isomaltase deficiency (CSID). The domain architectures of MGAM and SI include two tandem GH31 catalytic domains, an N-terminal do
Probab=96.65 E-value=0.0025 Score=50.46 Aligned_cols=49 Identities=22% Similarity=0.425 Sum_probs=41.8
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
|++.|++.++.+.+.++. .+.|.||..|+.. .+.+..|+++||+ .+.++
T Consensus 22 ~~~~v~~~~~~~~~~~iP---~d~i~lD~~~~~~----~~~f~~d~~~FPd-p~~~i 70 (317)
T cd06600 22 PQDKVVEVVDIMQKEGFP---YDVVFLDIHYMDS----YRLFTWDPYRFPE-PKKLI 70 (317)
T ss_pred CHHHHHHHHHHHHHcCCC---cceEEEChhhhCC----CCceeechhcCCC-HHHHH
Confidence 889999999999988766 8999999999853 5779999999998 55554
No 27
>PF01055 Glyco_hydro_31: Glycosyl hydrolases family 31 ; InterPro: IPR000322 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 31 GH31 from CAZY comprises enzymes with several known activities; alpha-glucosidase (3.2.1.20 from EC), alpha-galactosidase (3.2.1.22 from EC); glucoamylase (3.2.1.3 from EC), sucrase-isomaltase (3.2.1.48 from EC); isomaltase (3.2.1.10 from EC); alpha-xylosidase (3.2.1 from EC); alpha-glucan lyase (4.2.2.13 from EC). Glycoside hydrolase family 31 groups a number of glycosyl hydrolases on the basis of sequence similarities [, , ] An aspartic acid has been implicated [] in the catalytic activity of sucrase, isomaltase, and lysosomal alpha-glucosidase.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 3L4U_A 3L4X_A 3L4W_A 3L4V_A 3CTT_A 2QMJ_A 2QLY_A 3L4Z_A 3L4Y_A 3L4T_A ....
Probab=96.57 E-value=0.0036 Score=50.92 Aligned_cols=50 Identities=28% Similarity=0.393 Sum_probs=38.3
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
-|++.|++.++.+.+.++. ++.+.||+.|+.. .+++..|+++||+ ++.++
T Consensus 40 ~~~~~v~~~i~~~~~~~iP---~d~~~iD~~~~~~----~~~f~~d~~~FPd-~~~~~ 89 (441)
T PF01055_consen 40 YNQDEVREVIDRYRSNGIP---LDVIWIDDDYQDG----YGDFTWDPERFPD-PKQMI 89 (441)
T ss_dssp TSHHHHHHHHHHHHHTT-----EEEEEE-GGGSBT----TBTT-B-TTTTTT-HHHHH
T ss_pred CCHHHHHHHHHHHHHcCCC---ccceecccccccc----ccccccccccccc-hHHHH
Confidence 3789999999999998666 8999999999864 5689999999996 66654
No 28
>cd06594 GH31_glucosidase_YihQ YihQ is a bacterial alpha-glucosidase with a conserved glycosyl hydrolase family 31 (GH31) domain that catalyzes the release of an alpha-glucosyl residue from the non-reducing end of alpha-glucoside substrates such as alpha-glucosyl fluoride. Orthologs of YihQ that have not yet been functionally characterized are present in plants and fungi. YihQ has sequence similarity to other GH31 enzymes such as CtsZ, a 6-alpha-glucosyltransferase from Bacillus globisporus, and YicI, an alpha-xylosidase from Echerichia coli. In bacteria, YihQ (along with YihO) is important for bacterial O-antigen capsule assembly and translocation.
Probab=96.56 E-value=0.0041 Score=49.31 Aligned_cols=52 Identities=13% Similarity=0.252 Sum_probs=40.4
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCC-----CeeccCCCCCCChhhhh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNG-----RLQADAKRFPRGIADLS 107 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G-----~~~~d~~rFP~G~k~La 107 (108)
+++.|++.++.+.+.++. ++.|.|| .|+...-+..| ++..|++|||+ .+.++
T Consensus 21 s~~~v~~~~~~~~~~~iP---~d~i~ld-dw~~~~~~~~g~~~~~~f~~d~~~FPd-p~~mi 77 (317)
T cd06594 21 GTDKVLEALEKARAAGVK---VAGLWLQ-DWTGRRETSFGDRLWWNWEWDPERYPG-LDELI 77 (317)
T ss_pred CHHHHHHHHHHHHHcCCC---eeEEEEc-cccCcccccccceeeeeeEEChhhCCC-HHHHH
Confidence 789999999999998777 8999999 58642222234 47999999998 66664
No 29
>PRK10658 putative alpha-glucosidase; Provisional
Probab=96.54 E-value=0.0051 Score=53.58 Aligned_cols=52 Identities=13% Similarity=0.249 Sum_probs=42.4
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
.+|+.|++.++.+.+.++. .+.|.||+.|.... ..+++..|+++||+ .+.++
T Consensus 280 ~~e~~v~~~~~~~r~~~iP---~d~i~lD~~w~~~~--~~~~f~wd~~~FPd-p~~mi 331 (665)
T PRK10658 280 YDEATVNSFIDGMAERDLP---LHVFHFDCFWMKEF--QWCDFEWDPRTFPD-PEGML 331 (665)
T ss_pred CCHHHHHHHHHHHHHcCCC---ceEEEEchhhhcCC--ceeeeEEChhhCCC-HHHHH
Confidence 5799999999999998776 78999999998531 24689999999997 55543
No 30
>cd06597 GH31_transferase_CtsY CtsY (cyclic tetrasaccharide-synthesizing enzyme Y) is a bacterial 3-alpha-isomaltosyltransferase, first identified in Arthrobacter globiformis, that produces cyclic tetrasaccharides together with a closely related enzyme CtsZ. CtsY and CtsZ both have a glycosyl hydrolase family 31 (GH31) catalytic domain. All GH31 enzymes cleave a terminal carbohydrate moiety from a substrate that varies considerably in size, depending on the enzyme, and may be either a starch or a glycoprotein.
Probab=95.66 E-value=0.024 Score=45.44 Aligned_cols=30 Identities=17% Similarity=0.245 Sum_probs=26.5
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
-|++.|++.++.+.+.|+. ++.|.||+ |++
T Consensus 21 ~~~~ev~~v~~~~~~~~iP---~d~i~lD~-W~~ 50 (340)
T cd06597 21 DTQAEVMRQMDAHEEHGIP---VTVVVIEQ-WSD 50 (340)
T ss_pred CCHHHHHHHHHHHHHcCCC---eeEEEEec-ccC
Confidence 4899999999999998777 89999995 885
No 31
>PRK10426 alpha-glucosidase; Provisional
Probab=95.07 E-value=0.036 Score=48.12 Aligned_cols=52 Identities=13% Similarity=0.168 Sum_probs=39.4
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCC-----CCeeccCCCCCCChhhhh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFN-----GRLQADAKRFPRGIADLS 107 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~-----G~~~~d~~rFP~G~k~La 107 (108)
+++.|++.++.+.+.|+. .+.|.|| .|++...... +++..|+++||+ .+.++
T Consensus 219 ~~~~v~~v~~~~r~~~IP---~d~i~ld-dw~~~~~~~~g~~~~~~~~~d~~~FPd-p~~mi 275 (635)
T PRK10426 219 GTEVVQKKLDTMRNAGVK---VNGIWAQ-DWSGIRMTSFGKRLMWNWKWDSERYPQ-LDSRI 275 (635)
T ss_pred CHHHHHHHHHHHHHcCCC---eeEEEEe-cccccccccccccccccceEChhhCCC-HHHHH
Confidence 567899999999988776 7889998 4986432222 457899999998 66554
No 32
>PLN02763 hydrolase, hydrolyzing O-glycosyl compounds
Probab=94.23 E-value=0.069 Score=48.71 Aligned_cols=49 Identities=18% Similarity=0.238 Sum_probs=41.5
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhhh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADLS 107 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~La 107 (108)
+++.|++.++.+.+.++. ++.|.+|..|... .+.+..|+++||+ .+.++
T Consensus 199 sq~eV~eva~~fre~~IP---~DvIwlDidYm~g----~~~FTwD~~rFPd-P~~mv 247 (978)
T PLN02763 199 SAKRVAEIARTFREKKIP---CDVVWMDIDYMDG----FRCFTFDKERFPD-PKGLA 247 (978)
T ss_pred CHHHHHHHHHHHHHcCCC---ceEEEEehhhhcC----CCceeECcccCCC-HHHHH
Confidence 889999999999998776 8999999999753 4569999999998 66554
No 33
>COG1501 Alpha-glucosidases, family 31 of glycosyl hydrolases [Carbohydrate transport and metabolism]
Probab=93.76 E-value=0.088 Score=46.88 Aligned_cols=49 Identities=20% Similarity=0.328 Sum_probs=40.1
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccc-cccCCCCCCCCeeccCCCCCCChhhh
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDC-WLDKTRSFNGRLQADAKRFPRGIADL 106 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdg-W~~~~rd~~G~~~~d~~rFP~G~k~L 106 (108)
-+|+.+.+.++.+.+. +.-++.|.+|.. |.. ..+.+..|+.+||+ .|.|
T Consensus 277 ~~e~~v~~~i~~~~~~---~IP~d~~~lD~~~~~~----~~~~F~wd~~~FP~-pk~m 326 (772)
T COG1501 277 YDEDEVLEFIDEMRER---DIPLDVFVLDIDFWMD----NWGDFTWDPDRFPD-PKQM 326 (772)
T ss_pred ccHHHHHHHHhhcccc---cCcceEEEEeehhhhc----cccceEECcccCCC-HHHH
Confidence 3799999999999765 455999999997 875 46789999999998 4444
No 34
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=92.88 E-value=0.15 Score=40.25 Aligned_cols=29 Identities=21% Similarity=0.510 Sum_probs=24.3
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+|-+..++-+|..+ ++|++|+.||+||..
T Consensus 29 ~~t~~~k~yIDfAa-----~~G~eYvlvD~GW~~ 57 (273)
T PF10566_consen 29 ATTETQKRYIDFAA-----EMGIEYVLVDAGWYG 57 (273)
T ss_dssp SSHHHHHHHHHHHH-----HTT-SEEEEBTTCCG
T ss_pred CCHHHHHHHHHHHH-----HcCCCEEEecccccc
Confidence 78899999999865 678889999999985
No 35
>KOG1065|consensus
Probab=90.61 E-value=0.34 Score=43.48 Aligned_cols=64 Identities=19% Similarity=0.252 Sum_probs=49.0
Q ss_pred ceeeechhhhhc-cCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhh
Q psy1868 27 PMGWLAWERFRC-NTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIAD 105 (108)
Q Consensus 27 pmGWnSW~~~~~-~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~ 105 (108)
|--|+-+..+.- .. .|...+++.++.+.+.|+. ++.+.+|.-|++. +++...|+.+||. ++.
T Consensus 293 ~pYWslGf~~~RwgY---------~nls~~~dvv~~~~~agiP---ld~~~~DiDyMd~----ykDFTvd~~~fp~-~~~ 355 (805)
T KOG1065|consen 293 PPYWSLGFQLCRWGY---------KNLSVVRDVVENYRAAGIP---LDVIVIDIDYMDG----YKDFTVDKVWFPD-LKD 355 (805)
T ss_pred Cchhhccceeccccc---------ccHHHHHHHHHHHHHcCCC---cceeeeehhhhhc----ccceeeccccCcc-hHH
Confidence 335877765321 21 5889999999999876554 8999999999975 6779999999998 665
Q ss_pred hh
Q psy1868 106 LS 107 (108)
Q Consensus 106 La 107 (108)
..
T Consensus 356 fv 357 (805)
T KOG1065|consen 356 FV 357 (805)
T ss_pred HH
Confidence 43
No 36
>PF13670 PepSY_2: Peptidase propeptide and YPEB domain This Prosite motif covers only the active site. This is family M4 in the peptidase classification.
Probab=88.90 E-value=2 Score=27.35 Aligned_cols=41 Identities=22% Similarity=0.455 Sum_probs=22.9
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEe-cc-ccccCCCCCCCC---eeccC
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINI-DD-CWLDKTRSFNGR---LQADA 96 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~I-Dd-gW~~~~rd~~G~---~~~d~ 96 (108)
++.+.+++.+. ..|+. .+-|-+ |+ +|.-.-++.+|+ +..|+
T Consensus 29 ~~~~~~~~~l~---~~G~~---v~~ve~~~~g~yev~~~~~dG~~~ev~vD~ 74 (83)
T PF13670_consen 29 LSIEQAVAKLE---AQGYQ---VREVEFDDDGCYEVEARDKDGKKVEVYVDP 74 (83)
T ss_pred CCHHHHHHHHH---hcCCc---eEEEEEcCCCEEEEEEEECCCCEEEEEEcC
Confidence 34445554333 33442 456777 66 887654777776 34454
No 37
>PF07485 DUF1529: Domain of Unknown Function (DUF1259); InterPro: IPR011094 This family is the lppY/lpqO homologue family. They are related to 'probable conserved lipoproteins' LppY and LpqO from Mycobacterium bovis.
Probab=69.64 E-value=3.9 Score=28.74 Aligned_cols=59 Identities=25% Similarity=0.427 Sum_probs=42.3
Q ss_pred CCCCCCCceeeechhhhhccCCCC--CCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 20 NGVALTPPMGWLAWERFRCNTDCK--NDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 20 ~~~~~~PpmGWnSW~~~~~~~~~~--~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
.|...+||||-++|-+|.-..+++ ..++.-+.|+.|....++|.+ .|.+..-|=.-|..
T Consensus 33 ~G~~~~~~~~l~~~~aF~~~~~G~A~~~Gd~vll~~EV~pvi~aL~~-----~GI~vtAlHNH~l~ 93 (123)
T PF07485_consen 33 DGRRLPPPMGLTSWIAFEPDGDGKAMVMGDFVLLEDEVNPVISALRK-----NGIEVTALHNHWLF 93 (123)
T ss_pred CCEEcCCccccceeEEEEECCCCcEEEeecEEecHHHHHHHHHHHHH-----CCceEEEEeccccc
Confidence 355668999999999987642212 156777899999999999764 55567777666653
No 38
>PF15240 Pro-rich: Proline-rich
Probab=66.08 E-value=4.6 Score=30.27 Aligned_cols=23 Identities=22% Similarity=0.280 Sum_probs=16.3
Q ss_pred ChHHHH-HHHHHHHHhhhccCCCC
Q psy1868 1 MWLAGL-LVASLVAVSKALDNGVA 23 (108)
Q Consensus 1 ~~~~~~-~~~~~~~~~~~~~~~~~ 23 (108)
||+.+| |+||+|++|..+++.+.
T Consensus 1 MLlVLLSvALLALSSAQ~~dEdv~ 24 (179)
T PF15240_consen 1 MLLVLLSVALLALSSAQSTDEDVS 24 (179)
T ss_pred ChhHHHHHHHHHhhhccccccccc
Confidence 888544 67777778877777663
No 39
>COG2894 MinD Septum formation inhibitor-activating ATPase [Cell division and chromosome partitioning]
Probab=66.06 E-value=7 Score=30.91 Aligned_cols=25 Identities=20% Similarity=0.472 Sum_probs=21.1
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDD 79 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDd 79 (108)
++.+.++...+.| ++.+|+||++|+
T Consensus 97 lt~E~v~~vv~eL-----~~~~fDyIi~Ds 121 (272)
T COG2894 97 LTPEGVKKVVNEL-----KAMDFDYIIIDS 121 (272)
T ss_pred CCHHHHHHHHHHH-----HhcCCCEEEecC
Confidence 7888888888876 457899999998
No 40
>PRK09121 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase; Provisional
Probab=57.87 E-value=8.1 Score=31.01 Aligned_cols=32 Identities=22% Similarity=0.335 Sum_probs=22.9
Q ss_pred CHHHHHHHHHHHHH--ccccccCceEEEeccccc
Q psy1868 51 SERLFRTMADLVVS--EGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 51 ~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW~ 82 (108)
.|+.+.+.|+++.+ ..|.++|.+||+||+.-.
T Consensus 147 ~~el~~dlA~al~~Ei~~L~~aG~~~IQiDeP~l 180 (339)
T PRK09121 147 REKLAWEFAKILNQEAKELEAAGVDIIQFDEPAF 180 (339)
T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEecccHH
Confidence 46666667765542 347789999999999754
No 41
>PRK06233 hypothetical protein; Provisional
Probab=56.01 E-value=9.4 Score=31.01 Aligned_cols=32 Identities=19% Similarity=0.341 Sum_probs=23.3
Q ss_pred CHHHHHHHHHHHHH--ccccccCceEEEeccccc
Q psy1868 51 SERLFRTMADLVVS--EGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 51 ~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW~ 82 (108)
.|+.+.+.|+++.+ .-|.++|.+||+|||.-.
T Consensus 162 ~eel~~dlA~a~~~Ei~~L~~aG~~~IQiDeP~~ 195 (372)
T PRK06233 162 WDDYLDDLAQAYHDTIQHFYDLGARYIQLDDTTW 195 (372)
T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEEcCCCH
Confidence 46667777765543 347789999999999753
No 42
>COG0436 Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism]
Probab=53.75 E-value=30 Score=28.21 Aligned_cols=38 Identities=13% Similarity=0.179 Sum_probs=30.7
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK 84 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~ 84 (108)
+|.|+.+...+++.+++.++...+.++ +|+.|+.+..-
T Consensus 171 ~P~NPTGav~~~~~l~~i~~~a~~~~i------~ii~DEiY~~l 208 (393)
T COG0436 171 SPNNPTGAVYSKEELKAIVELAREHDI------IIISDEIYEEL 208 (393)
T ss_pred CCCCCcCcCCCHHHHHHHHHHHHHcCe------EEEEehhhhhc
Confidence 467777888899999999999877543 58999999853
No 43
>PF08784 RPA_C: Replication protein A C terminal; InterPro: IPR014892 This protein corresponds to the C-terminal of the single stranded DNA binding protein RPA (replication protein A). RPA is involved in many DNA metabolic pathways including DNA replication, DNA repair, recombination, cell cycle and DNA damage checkpoints. ; PDB: 1QUQ_C 2PQA_C 3KDF_B 2Z6K_B 2PI2_B 1L1O_E 1DPU_A 1Z1D_A.
Probab=51.65 E-value=15 Score=23.96 Aligned_cols=25 Identities=40% Similarity=0.591 Sum_probs=19.1
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDD 79 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDd 79 (108)
++++.|+..++.|.+.|+. |=.|||
T Consensus 77 ~~~~~v~~al~~L~~eG~I-----YsTiDd 101 (102)
T PF08784_consen 77 MSENEVRKALDFLSNEGHI-----YSTIDD 101 (102)
T ss_dssp S-HHHHHHHHHHHHHTTSE-----EESSST
T ss_pred cCHHHHHHHHHHHHhCCeE-----ecccCC
Confidence 7899999999998765554 777886
No 44
>PF01043 SecA_PP_bind: SecA preprotein cross-linking domain; InterPro: IPR011130 The SecA ATPase is involved in the insertion and retraction of preproteins through the plasma membrane. This domain has been found to cross-link to preproteins, thought to indicate a role in preprotein binding. The pre-protein cross-linking domain is comprised of two sub domains that are inserted within the ATPase domain [].; GO: 0017038 protein import, 0016020 membrane; PDB: 3DIN_B 3JUX_A 3IQY_A 2IBM_A 3DL8_A 3JV2_B 3IQM_A 1TF2_A 1TF5_A 1M74_A ....
Probab=51.01 E-value=46 Score=22.80 Aligned_cols=53 Identities=15% Similarity=0.248 Sum_probs=37.3
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCCCC-CCCeeccCCCCCCChhhh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSF-NGRLQADAKRFPRGIADL 106 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~-~G~~~~d~~rFP~G~k~L 106 (108)
+...+...-.+|.+.-|-..+-+||+-|+-=.- -|+ .||..++. ||.+|+...
T Consensus 58 ~~~~~~~i~~AL~A~~l~~rd~dYiV~dg~V~I--VDe~TGR~m~gR-rws~GLHQa 111 (113)
T PF01043_consen 58 NSELYHHINQALKAHHLFKRDVDYIVRDGEVVI--VDEFTGRIMPGR-RWSDGLHQA 111 (113)
T ss_dssp CHHHHHHHHHHHHHHHCSTTTTSEEEETTEEEE--BCTTTTSEBTT---STTTHHHH
T ss_pred cHHHHHHHHHHHHHHHHHhCCcceEEEcCEEEE--EECCCCCcCCCC-cCCchhhHh
Confidence 435566666777777788889999997765432 343 59999996 999999764
No 45
>PRK06520 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase; Provisional
Probab=47.91 E-value=15 Score=29.76 Aligned_cols=32 Identities=19% Similarity=0.396 Sum_probs=22.3
Q ss_pred CHHHHHHHHHHHHH--ccccccCceEEEeccccc
Q psy1868 51 SERLFRTMADLVVS--EGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 51 ~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW~ 82 (108)
.|+.+.+.|+++.+ .-|.++|.+||+||+.-.
T Consensus 161 ~~~~~~dlA~al~~Ei~~L~~aG~~~IQiDep~l 194 (368)
T PRK06520 161 LDDYFDDLAKTWRDAIKAFYDAGCRYLQLDDTVW 194 (368)
T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEecCcch
Confidence 34556666665542 247789999999999753
No 46
>PF08479 POTRA_2: POTRA domain, ShlB-type; InterPro: IPR013686 The POTRA domain (for polypeptide-transport-associated domain) is found towards the N terminus of ShlB family proteins (IPR005565 from INTERPRO). ShlB is important in the secretion and activation of the haemolysin ShlA. It has been postulated that the POTRA domain has a chaperone-like function over ShlA; it may fold back into the C-terminal beta-barrel channel []. ; PDB: 2X8X_X 2QDZ_A 3NJT_A 3MC8_A 3MC9_B.
Probab=38.67 E-value=27 Score=21.51 Aligned_cols=26 Identities=23% Similarity=0.479 Sum_probs=19.1
Q ss_pred CCCCCHHHHHHHHHHHHHccccccCce
Q psy1868 47 ENCISERLFRTMADLVVSEGYAAVGYE 73 (108)
Q Consensus 47 ~~~i~e~~i~~~ad~l~~~Gl~~~Gy~ 73 (108)
+.|++.+.+.+.++.+... +++.||-
T Consensus 28 g~~l~~~~l~~~~~~l~~~-y~~~GY~ 53 (76)
T PF08479_consen 28 GRCLTLADLQQLADALTNY-YREKGYI 53 (76)
T ss_dssp TSBB-HHHHHHHHHHHHHH-HHHTT-T
T ss_pred CCCcCHHHHHHHHHHHHHH-HHHcCce
Confidence 4679999999999988654 7788884
No 47
>TIGR01264 tyr_amTase_E tyrosine aminotransferase, eukaryotic. This model describes tyrosine aminotransferase as found in animals and Trypanosoma cruzi. It is the first enzyme of a pathway of tyrosine degradation via homogentisate. Several plant enzyme designated as probable tyrosine aminotransferases are very closely related to an experimentally demonstrated nicotianamine aminotransferase, an enzyme in a siderophore (iron uptake chelator) biosynthesis pathway. These plant sequences are excluded from the model seed and score between the trusted an noise cutoffs.
Probab=37.91 E-value=79 Score=25.04 Aligned_cols=36 Identities=14% Similarity=0.105 Sum_probs=27.1
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ +|++|+.+..
T Consensus 177 p~NPtG~~~~~~~~~~l~~~a~~~~~------~ii~De~y~~ 212 (401)
T TIGR01264 177 PSNPCGSVFSRQHLEEILAVAERQCL------PIIADEIYGD 212 (401)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCC------EEEEEhhhhh
Confidence 55666677888889998888765443 5899999874
No 48
>PLN02376 1-aminocyclopropane-1-carboxylate synthase
Probab=36.97 E-value=89 Score=26.22 Aligned_cols=37 Identities=8% Similarity=0.210 Sum_probs=28.9
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+|.|+.+.+++++.+++.++...+.++ +|++|+-+..
T Consensus 207 nP~NPTG~~~s~e~l~~L~~~a~~~~i------~lI~DEiY~~ 243 (496)
T PLN02376 207 NPSNPLGTMLDKDTLTNLVRFVTRKNI------HLVVDEIYAA 243 (496)
T ss_pred CCCCCCCccCCHHHHHHHHHHHHHcCC------EEEEEcCccc
Confidence 367788888999999999888755433 5899998874
No 49
>TIGR00268 conserved hypothetical protein TIGR00268. The N-terminal region of the model shows similarity to Argininosuccinate synthase proteins using PSI-blast and using the recognize protein identification server.
Probab=35.00 E-value=34 Score=25.98 Aligned_cols=14 Identities=21% Similarity=0.506 Sum_probs=9.5
Q ss_pred cccccCceEEEecc
Q psy1868 66 GYAAVGYEYINIDD 79 (108)
Q Consensus 66 Gl~~~Gy~yv~IDd 79 (108)
.|+++||+||.+|-
T Consensus 233 ~~~~~gf~~v~ldl 246 (252)
T TIGR00268 233 KFKDIGFRKVLIDL 246 (252)
T ss_pred HHHHcCCCeEEEcc
Confidence 46667777777763
No 50
>PRK08960 hypothetical protein; Provisional
Probab=33.64 E-value=81 Score=24.85 Aligned_cols=36 Identities=11% Similarity=0.185 Sum_probs=28.1
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+.+.++..++.|+ ++++|+.+..
T Consensus 174 p~NPtG~~~~~~~~~~l~~~~~~~~~------~li~De~Y~~ 209 (387)
T PRK08960 174 PANPTGTLLSRDELAALSQALRARGG------HLVVDEIYHG 209 (387)
T ss_pred CCCCCCcCcCHHHHHHHHHHHHHcCC------EEEEEccccc
Confidence 56777888899999999988765433 4889998864
No 51
>PRK06855 aminotransferase; Validated
Probab=33.64 E-value=1.2e+02 Score=24.59 Aligned_cols=37 Identities=14% Similarity=0.299 Sum_probs=28.2
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.|+.+..++.+.+++.++...+.++ +|+.||-+..
T Consensus 179 ~P~NPTG~~~s~~~~~~l~~~a~~~~~------~II~De~Y~~ 215 (433)
T PRK06855 179 NPDNPTGAVYPKEILREIVDIAREYDL------FIICDEIYNN 215 (433)
T ss_pred CCCCCCCcCCCHHHHHHHHHHHHHcCC------EEEEeccccc
Confidence 367788888899999999888765432 5888998864
No 52
>cd02931 ER_like_FMN Enoate reductase (ER)-like FMN-binding domain. Enoate reductase catalyzes the NADH-dependent reduction of carbon-carbon double bonds of several molecules, including nonactivated 2-enoates, alpha,beta-unsaturated aldehydes, cyclic ketones, and methylketones. ERs are similar to 2,4-dienoyl-CoA reductase from E. coli and to the old yellow enzyme from Saccharomyces cerevisiae.
Probab=33.09 E-value=33 Score=27.93 Aligned_cols=53 Identities=15% Similarity=0.188 Sum_probs=36.1
Q ss_pred CCCCHHHHHHHHHHHH--HccccccCceEEEecc---ccc---------cCCCCCCCCeeccCCCCC
Q psy1868 48 NCISERLFRTMADLVV--SEGYAAVGYEYINIDD---CWL---------DKTRSFNGRLQADAKRFP 100 (108)
Q Consensus 48 ~~i~e~~i~~~ad~l~--~~Gl~~~Gy~yv~IDd---gW~---------~~~rd~~G~~~~d~~rFP 100 (108)
..+|++.|.+..+.++ +.-.+++||+-|.|=. |+- .+..|++|--..|..||+
T Consensus 138 ~~mt~~eI~~ii~~f~~AA~ra~~AGfDgVEih~ah~GyLl~qFLSp~~N~RtDeyGGslenR~rf~ 204 (382)
T cd02931 138 RELTTEEVETFVGKFGESAVIAKEAGFDGVEIHAVHEGYLLDQFTISLFNKRTDKYGGSLENRLRFA 204 (382)
T ss_pred CcCCHHHHHHHHHHHHHHHHHHHHcCCCEEEEeccccChHHHHhcCCccCCCCCcCCCCHHHHhHHH
Confidence 3479999999888665 2335578999999987 772 223466775555655654
No 53
>PLN00145 tyrosine/nicotianamine aminotransferase; Provisional
Probab=32.32 E-value=1.2e+02 Score=24.61 Aligned_cols=36 Identities=14% Similarity=0.110 Sum_probs=26.5
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++|+.+..++++.+++.++...+.|+ +|++|+-+..
T Consensus 199 P~NPtG~v~~~~~l~~i~~~a~~~~i------~ii~De~Y~~ 234 (430)
T PLN00145 199 PNNPCGSVYSYEHLAKIAETARKLGI------LVIADEVYDH 234 (430)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHcCC------EEEEeccchh
Confidence 55666777899999998887655443 5888998864
No 54
>PLN02397 aspartate transaminase
Probab=31.88 E-value=92 Score=25.24 Aligned_cols=38 Identities=13% Similarity=0.105 Sum_probs=29.8
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK 84 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~ 84 (108)
+|.|+.+..++.+.+++.++...+.|+ +|+.|+-+..-
T Consensus 202 ~P~NPTG~v~s~e~l~~i~~~a~~~~~------~vI~De~Y~~l 239 (423)
T PLN02397 202 CAHNPTGVDPTPEQWEQISDLIKSKNH------LPFFDSAYQGF 239 (423)
T ss_pred CCCCCCCCCCCHHHHHHHHHHHHhCCc------EEEEecccCCc
Confidence 467788888999999999998765433 68999998753
No 55
>PF02113 Peptidase_S13: D-Ala-D-Ala carboxypeptidase 3 (S13) family; InterPro: IPR000667 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This family of serine peptidases belong to MEROPS peptidase family S13 (D-Ala-D-Ala carboxypeptidase C, clan SE). The predicted active site residues for members of this family and family S12 occur in the motif SXXK. D-Ala-D-Ala carboxypeptidase C is involved in the metabolism of cell components []; it is synthesised with a leader peptide to target it to the cell membrane []. After cleavage of the leader peptide, the enzyme is retained in the membrane by a C-terminal anchor []. There are three families of serine-type D-Ala-D-Ala peptidase (designated S11, S12 and S13), which are also known as low molecular weight penicillin-binding proteins []. Family S13 comprises D-Ala-D-Ala peptidases that have sufficient sequence similarity around their active sites to assume a distant evolutionary relationship to other clan members; members of the S13 family also bind penicillin and have D-amino-peptidase activity. Proteases of family S11 have exclusive D-Ala-D-Ala peptidase activity, while some members of S12 are C beta-lactamases [].; GO: 0004185 serine-type carboxypeptidase activity, 0006508 proteolysis; PDB: 3A3F_B 3A3E_B 3A3D_A 3A3I_B 2Y59_C 1W8Q_A 3ZVT_B 3ZVW_B 2VGJ_B 1W79_D ....
Probab=31.56 E-value=92 Score=25.97 Aligned_cols=39 Identities=18% Similarity=0.292 Sum_probs=26.9
Q ss_pred CCCCCHHHHHHHHHHHHHccccccCceEEEeccccccCCC
Q psy1868 47 ENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTR 86 (108)
Q Consensus 47 ~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~r 86 (108)
+..++.+.+.+.++.|.+.|++.. -..|++|+.++...+
T Consensus 91 DP~l~~~~l~~la~~l~~~Gi~~I-~g~i~~D~s~f~~~~ 129 (444)
T PF02113_consen 91 DPSLTSEDLWALAAQLKAAGIKRI-TGDIILDDSLFDGPR 129 (444)
T ss_dssp BTTBCHHHHHHHHHHHHHTT-SEE-SSEEEEETTTC-S-S
T ss_pred CCccCHHHHHHHHHHHHHcCCCeE-eeEEEEECcccCCCC
Confidence 334899999999999887777653 135699999986433
No 56
>PF02606 LpxK: Tetraacyldisaccharide-1-P 4'-kinase; InterPro: IPR003758 Tetraacyldisaccharide 4'-kinase phosphorylates the 4'-position of a tetraacyldisaccharide 1-phosphate precursor (DS-1-P) of lipid A, but the enzyme has not yet been purified because of instability []. This enzyme is involved in the synthesis of lipid A portion of the bacterial lipopolysaccharide layer (LPS).; GO: 0005524 ATP binding, 0009029 tetraacyldisaccharide 4'-kinase activity, 0009245 lipid A biosynthetic process
Probab=30.99 E-value=65 Score=25.82 Aligned_cols=17 Identities=18% Similarity=0.280 Sum_probs=13.7
Q ss_pred ccCceEEEeccccccCC
Q psy1868 69 AVGYEYINIDDCWLDKT 85 (108)
Q Consensus 69 ~~Gy~yv~IDdgW~~~~ 85 (108)
+.+.+.|++|||-|...
T Consensus 126 ~~~~dviilDDGfQh~~ 142 (326)
T PF02606_consen 126 EFPADVIILDDGFQHRR 142 (326)
T ss_pred HCCCCEEEEcCCccccc
Confidence 45689999999999543
No 57
>PRK00652 lpxK tetraacyldisaccharide 4'-kinase; Reviewed
Probab=30.92 E-value=68 Score=25.79 Aligned_cols=17 Identities=29% Similarity=0.276 Sum_probs=13.8
Q ss_pred ccCceEEEeccccccCC
Q psy1868 69 AVGYEYINIDDCWLDKT 85 (108)
Q Consensus 69 ~~Gy~yv~IDdgW~~~~ 85 (108)
..|.+.|++|||.|...
T Consensus 140 ~~~~dviilDDGfQh~~ 156 (325)
T PRK00652 140 AHGADIIILDDGLQHYR 156 (325)
T ss_pred cCCCCEEEEcCCccCcc
Confidence 34889999999999543
No 58
>cd06433 GT_2_WfgS_like WfgS and WfeV are involved in O-antigen biosynthesis. Escherichia coli WfgS and Shigella dysenteriae WfeV are glycosyltransferase 2 family enzymes involved in O-antigen biosynthesis. GT-2 enzymes have GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=30.92 E-value=83 Score=21.14 Aligned_cols=30 Identities=33% Similarity=0.421 Sum_probs=21.3
Q ss_pred HHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 52 ERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 52 e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.+.+..+.+.+....+ ++.+++|||=.+
T Consensus 10 ~~~l~~~l~sl~~q~~~~--~evivvDd~s~d 39 (202)
T cd06433 10 AETLEETIDSVLSQTYPN--IEYIVIDGGSTD 39 (202)
T ss_pred HHHHHHHHHHHHhCCCCC--ceEEEEeCCCCc
Confidence 455667777776655544 899999999754
No 59
>PF04079 DUF387: Putative transcriptional regulators (Ypuh-like); InterPro: IPR005234 This family represents ScpB, which along with ScpA (IPR003768 from INTERPRO) interacts with SMC in vivo forming a complex that is required for chromosome condensation and segregation [, ]. The SMC-Scp complex appears to be similar to the MukB-MukE-Muk-F complex in Escherichia coli [], where MukB (IPR007406 from INTERPRO) is the homologue of SMC. ScpA and ScpB have little sequence similarity to MukE (IPR007385 from INTERPRO) or MukF (IPR005582 from INTERPRO), they are predicted to be structurally similar, being predominantly alpha-helical with coiled coil regions. In general scpA and scpB form an operon in most bacterial genomes. Flanking genes are highly variable suggesting that the operon has moved throughout evolution. Bacteria containing an smc gene also contain scpA or scpB but not necessarily both. An exception is found in Deinococcus radiodurans, which contains scpB but neither smc nor scpA. In the archaea the gene order SMC-ScpA is conserved in nearly all species, as is the very short distance between the two genes, indicating co-transcription of the both in different archaeal genera and arguing that interaction of the gene products is not confined to the homologues in Bacillus subtilis. It would seem probable that, in light of all the studies, SMC, ScpA and ScpB proteins or homologues act together in chromosome condensation and segregation in all prokaryotes []. ; GO: 0051304 chromosome separation; PDB: 1T6S_B 2Z99_A.
Probab=30.76 E-value=72 Score=23.08 Aligned_cols=35 Identities=20% Similarity=0.295 Sum_probs=23.9
Q ss_pred CHHHHHHHHHHHHHccc--cccCceEEEeccccccCCC
Q psy1868 51 SERLFRTMADLVVSEGY--AAVGYEYINIDDCWLDKTR 86 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl--~~~Gy~yv~IDdgW~~~~r 86 (108)
+++.|.+..+.|.+. + .+.|++.+.+++||+-..+
T Consensus 25 ~~~~v~~~l~~L~~~-y~~~~~gl~l~~~~~~y~l~tk 61 (159)
T PF04079_consen 25 SEDEVEEALEELQEE-YNEEDRGLELVEVGGGYRLQTK 61 (159)
T ss_dssp -HHHHHHHHHHHHHH-HHHCT-SEEEEEETTEEEEEE-
T ss_pred CHHHHHHHHHHHHHH-hccCCCCEEEEEECCEEEEEEh
Confidence 455677777766553 5 6789999999999975433
No 60
>PRK06207 aspartate aminotransferase; Provisional
Probab=30.35 E-value=1.4e+02 Score=23.92 Aligned_cols=36 Identities=17% Similarity=0.144 Sum_probs=28.3
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.+ -+|++|+-+..
T Consensus 187 P~NPTG~~~s~e~l~~l~~~a~~~~------~~iI~De~Y~~ 222 (405)
T PRK06207 187 PNNPAGVVYSAEEIAQIAALARRYG------ATVIVDQLYSR 222 (405)
T ss_pred CCCCCCcCCCHHHHHHHHHHHHHcC------CEEEEeccccc
Confidence 6777888889999999998876433 26999999874
No 61
>PLN02607 1-aminocyclopropane-1-carboxylate synthase
Probab=30.06 E-value=1.2e+02 Score=24.98 Aligned_cols=36 Identities=14% Similarity=0.228 Sum_probs=28.1
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
|.|+.+.+++++.+++.++...+.++ +++.|+-+..
T Consensus 209 P~NPtG~~~s~e~l~~l~~~~~~~~i------~lI~DEiYa~ 244 (447)
T PLN02607 209 PSNPLGATVQRSVLEDILDFVVRKNI------HLVSDEIYSG 244 (447)
T ss_pred CCCCcCcccCHHHHHHHHHHHHHCCC------EEEEeccccc
Confidence 67788888999999998887765433 5888998864
No 62
>cd03312 CIMS_N_terminal_like CIMS - Cobalamine-independent methonine synthase, or MetE, N-terminal domain_like. Many members have been characterized as 5-methyltetrahydropteroyltriglutamate-homocysteine methyltransferases, EC:2.1.1.14, mostly from bacteria and plants. This enzyme catalyses the last step in the production of methionine by transferring a methyl group from 5-methyltetrahydrofolate to L-homocysteine without using an intermediate methyl carrier. The active enzyme has a dual (beta-alpha)8-barrel structure, and this model covers the N-terminal barrel, and a few single-barrel sequences most similar to the N-terminal barrel. It is assumed that the homologous N-terminal barrel has evolved from the C-terminus via gene duplication and has subsequently lost binding sites, and it seems as if the two barrels forming the active enzyme may sometimes reside on different polypeptides. The C-terminal domain incorporates the Zinc ion, which binds and activates homocysteine. Side chains fro
Probab=29.92 E-value=54 Score=26.50 Aligned_cols=32 Identities=25% Similarity=0.310 Sum_probs=20.4
Q ss_pred HHHHHHHHHHHHH--ccccccCceEEEecccccc
Q psy1868 52 ERLFRTMADLVVS--EGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 52 e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.+.+.+.+..+ ..|.++|.+||+||+.-..
T Consensus 174 ~el~~dla~~y~~el~~L~~aG~~~IQiDEP~l~ 207 (360)
T cd03312 174 LSLLDKLLPVYKELLKKLAAAGAEWVQIDEPALV 207 (360)
T ss_pred HHHHHHHHHHHHHHHHHHHHCCCCEEEeeCChhh
Confidence 4455555543211 2378899999999996543
No 63
>cd06438 EpsO_like EpsO protein participates in the methanolan synthesis. The Methylobacillus sp EpsO protein is predicted to participate in the methanolan synthesis. Methanolan is an exopolysaccharide (EPS), composed of glucose, mannose and galactose. A 21 genes cluster was predicted to participate in the methanolan synthesis. Gene disruption analysis revealed that EpsO is one of the glycosyltransferase enzymes involved in the synthesis of repeating sugar units onto the lipid carrier.
Probab=29.61 E-value=93 Score=21.43 Aligned_cols=30 Identities=13% Similarity=0.197 Sum_probs=19.4
Q ss_pred HHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 52 ERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 52 e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
++.|.+..+.+.+......-|+.|++|||=
T Consensus 9 ~~~i~~~l~sl~~~~~p~~~~eiivvdd~s 38 (183)
T cd06438 9 EAVIGNTVRSLKAQDYPRELYRIFVVADNC 38 (183)
T ss_pred HHHHHHHHHHHHhcCCCCcccEEEEEeCCC
Confidence 345666677776543322348899999983
No 64
>PRK15458 tagatose 6-phosphate aldolase subunit KbaZ; Provisional
Probab=29.16 E-value=59 Score=27.53 Aligned_cols=32 Identities=22% Similarity=0.329 Sum_probs=26.7
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~ 82 (108)
.+++.-++.|+.|+.. .-++||+-|.||+-=.
T Consensus 100 lpa~eAM~~A~~li~a-yV~AGF~kIHLD~Sm~ 131 (426)
T PRK15458 100 LPAAQAMANADDLIKS-YVAAGFKKIHLDCSMS 131 (426)
T ss_pred CCHHHHHHHHHHHHHH-HHHcCCceEEecCCCC
Confidence 6788889999999875 6779999999998553
No 65
>TIGR00666 PBP4 D-alanyl-D-alanine carboxypeptidase, serine-type, PBP4 family. In E. coli, this protein is known as penicillin binding protein 4 (dacB). A signal sequence is cleaved from a precursor form. The protein is described as periplasmic in E. coli (Gram-negative) and extracellular in Actinomadura R39 (Gram-positive). Unlike some other proteins with similar activity, it does not form transpeptidation. It is not essential for viability. This family is related to class A beta-lactamases.
Probab=28.55 E-value=1.3e+02 Score=24.11 Aligned_cols=37 Identities=8% Similarity=0.224 Sum_probs=28.3
Q ss_pred CCCCCCHHHHHHHHHHHHHccccccCce-EEEeccccccC
Q psy1868 46 PENCISERLFRTMADLVVSEGYAAVGYE-YINIDDCWLDK 84 (108)
Q Consensus 46 ~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~-yv~IDdgW~~~ 84 (108)
++..++.+.+.+.++.+.+.|++.. + -|++|+.++..
T Consensus 63 GDP~L~~~~L~~la~~l~~~Gi~~i--~G~v~~D~s~f~~ 100 (345)
T TIGR00666 63 GDPTLKRQDIRNLVATLKKSGVKQI--DGNVLVDTSAFSS 100 (345)
T ss_pred cCCCcCHHHHHHHHHHHHHcCCcEE--EeeEEEEcccccC
Confidence 3444888999999999988887753 2 38899988864
No 66
>PF07302 AroM: AroM protein; InterPro: IPR010843 This family consists of several bacterial and archaeal AroM proteins. In Escherichia coli the aroM gene is cotranscribed with aroL []. The function of this family is unknown.
Probab=28.09 E-value=64 Score=24.86 Aligned_cols=32 Identities=22% Similarity=0.269 Sum_probs=24.7
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEecc-ccccCCCC
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDD-CWLDKTRS 87 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDd-gW~~~~rd 87 (108)
+++.+.+.|..| ++.|-++|++|| |+...+|+
T Consensus 163 ~~~~l~~Aa~~L-----~~~gadlIvLDCmGYt~~~r~ 195 (221)
T PF07302_consen 163 DEEELAAAAREL-----AEQGADLIVLDCMGYTQEMRD 195 (221)
T ss_pred CHHHHHHHHHHH-----HhcCCCEEEEECCCCCHHHHH
Confidence 678888888886 467888999999 77654443
No 67
>TIGR02810 agaZ_gatZ D-tagatose-bisphosphate aldolase, class II, non-catalytic subunit. Aldolases specific for D-tagatose-bisphosphate occur in distinct pathways in Escherichia coli and other bacteria, one for the degradation of galactitol (formerly dulcitol) and one for degradation of N-acetyl-galactosamine and D-galactosamine. This family represents a protein of both systems that behaves as a non-catalytic subunit of D-tagatose-bisphosphate aldolase, required both for full activity and for good stability of the aldolase. Note that members of this protein family appear in public databases annotated as putative tagatose 6-phosphate kinases, possibly in error.
Probab=27.95 E-value=64 Score=27.28 Aligned_cols=31 Identities=19% Similarity=0.334 Sum_probs=26.2
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
.+++.-++.|+.|+.. .-++||+-|.||+-=
T Consensus 96 lpa~eAM~~A~~li~a-yV~AGF~kIHLD~Sm 126 (420)
T TIGR02810 96 LPADEAMAKAAALVDA-YVEAGFTKIHLDASM 126 (420)
T ss_pred CCHHHHHHHHHHHHHH-HHHcCCceEEecCCC
Confidence 6888899999999875 667999999999754
No 68
>TIGR03540 DapC_direct LL-diaminopimelate aminotransferase. This clade of the pfam00155 superfamily of aminotransferases includes several which are adjacent to elements of the lysine biosynthesis via diaminopimelate pathway (GenProp0125). Every member of this clade is from a genome which possesses most of the lysine biosynthesis pathway but lacks any of the known aminotransferases, succinylases, desuccinylases, acetylases or deacetylases typical of the acylated versions of this pathway nor do they have the direct, NADPH-dependent enzyme (ddh). Although there is no experimental characterization of any of the sequences in this clade, a direct pathway is known in plants and Chlamydia, so it seems quite reasonable that these enzymes catalyze the same transformation.
Probab=27.71 E-value=1.4e+02 Score=23.38 Aligned_cols=36 Identities=8% Similarity=0.211 Sum_probs=27.4
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.+ -++++|+-+..
T Consensus 173 P~NPtG~~~~~~~~~~i~~~a~~~~------~~ii~De~y~~ 208 (383)
T TIGR03540 173 PNNPTGAVAPLKFFKELVEFAKEYN------IIVCHDNAYSE 208 (383)
T ss_pred CCCCcCccCCHHHHHHHHHHHHHcC------EEEEEecchhh
Confidence 6677788889999999998876432 26889998863
No 69
>PRK05942 aspartate aminotransferase; Provisional
Probab=27.60 E-value=1.5e+02 Score=23.49 Aligned_cols=35 Identities=11% Similarity=0.198 Sum_probs=26.7
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~ 82 (108)
+.|+.+..++.+.+++.++...+.++ +|++|+.+.
T Consensus 179 P~NPtG~~~s~~~~~~i~~~a~~~~~------~iI~De~y~ 213 (394)
T PRK05942 179 PSNPTTATAPREFFEEIVAFARKYEI------MLVHDLCYA 213 (394)
T ss_pred CCCCCCCcCCHHHHHHHHHHHHHcCe------EEEEeccch
Confidence 66777788899999998888755332 588899875
No 70
>PRK08354 putative aminotransferase; Provisional
Probab=27.50 E-value=1.1e+02 Score=23.40 Aligned_cols=36 Identities=14% Similarity=0.279 Sum_probs=27.2
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
|.|+.+..++.+.+++.++...+.| -++++|+-+..
T Consensus 126 P~NPTG~~~~~~~l~~l~~~a~~~~------~~li~De~y~~ 161 (311)
T PRK08354 126 PNNPDGKFYNFKELKPLLDAVEDRN------ALLILDEAFID 161 (311)
T ss_pred CCCCCCCccCHHHHHHHHHHhhhcC------cEEEEeCcchh
Confidence 6677777889999999888764332 26999998864
No 71
>COG0463 WcaA Glycosyltransferases involved in cell wall biogenesis [Cell envelope biogenesis, outer membrane]
Probab=27.18 E-value=92 Score=19.62 Aligned_cols=28 Identities=21% Similarity=0.386 Sum_probs=16.6
Q ss_pred HHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 54 LFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 54 ~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
.+....+.+.+..... ++.|++|||=.+
T Consensus 17 ~l~~~l~s~~~q~~~~--~eiivvddgs~d 44 (291)
T COG0463 17 YLPEALESLLNQTYKD--FEIIVVDDGSTD 44 (291)
T ss_pred hHHHHHHHHHhhhhcc--eEEEEEeCCCCC
Confidence 3444444444433333 789999998654
No 72
>TIGR01265 tyr_nico_aTase tyrosine/nicotianamine aminotransferases. This subfamily of pyridoxal phosphate-dependent enzymes includes known examples of both tyrosine aminotransferase from animals and nicotianamine aminotransferase from barley.
Probab=27.16 E-value=1.7e+02 Score=23.30 Aligned_cols=36 Identities=17% Similarity=0.155 Sum_probs=26.0
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ +|++|+.+..
T Consensus 178 p~NPtG~~~~~~~~~~i~~~a~~~~~------~ii~De~y~~ 213 (403)
T TIGR01265 178 PSNPCGSVFSRDHLQKIAEVARKLGI------PIIADEIYGH 213 (403)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCC------EEEEEccccc
Confidence 44556667788888888887765433 5999998764
No 73
>PLN02368 alanine transaminase
Probab=27.14 E-value=1.2e+02 Score=24.78 Aligned_cols=37 Identities=19% Similarity=0.314 Sum_probs=29.3
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+|.|+.+..++++.+++.++...+.++ +|+.|+-+..
T Consensus 218 nP~NPTG~v~s~e~l~~l~~~a~~~~~------~II~DE~Y~~ 254 (407)
T PLN02368 218 NPGNPTGQCLSEANLREILKFCYQERL------VLLGDEVYQQ 254 (407)
T ss_pred CCCCCCCccCCHHHHHHHHHHHHHcCC------EEEEEccccc
Confidence 367788888899999999998765433 6999998865
No 74
>smart00674 CENPB Putative DNA-binding domain in centromere protein B, mouse jerky and transposases.
Probab=27.08 E-value=29 Score=20.57 Aligned_cols=30 Identities=27% Similarity=0.434 Sum_probs=23.1
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+|.+.|++.|..|.. ..|.+.+.+..||-.
T Consensus 25 it~~~i~~~A~~i~~----~~~~~~f~~s~~Wl~ 54 (66)
T smart00674 25 ISGEQIREKALEILQ----RLGLENFKASNGWLT 54 (66)
T ss_pred CCHHHHHHHHHHHHH----HcCCCCCCCCHHHHH
Confidence 799999999999864 345556778888853
No 75
>cd02933 OYE_like_FMN Old yellow enzyme (OYE)-like FMN binding domain. OYE was the first flavin-dependent enzyme identified, however its true physiological role remains elusive to this day. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Members of OYE family include 12-oxophytodienoate reductase, pentaerythritol tetranitrate reductase, morphinone reductase, and related enzymes.
Probab=26.82 E-value=59 Score=26.04 Aligned_cols=53 Identities=11% Similarity=0.132 Sum_probs=35.8
Q ss_pred CCCHHHHHHHHHHHH--HccccccCceEEEecccc-----------ccCCCCCCCCeeccCCCCCC
Q psy1868 49 CISERLFRTMADLVV--SEGYAAVGYEYINIDDCW-----------LDKTRSFNGRLQADAKRFPR 101 (108)
Q Consensus 49 ~i~e~~i~~~ad~l~--~~Gl~~~Gy~yv~IDdgW-----------~~~~rd~~G~~~~d~~rFP~ 101 (108)
.+|++.|.+..+.++ ..-.+++||+-|.|-.+- ..+..|++|--..|..||+.
T Consensus 141 ~mt~~eI~~ii~~f~~aA~~a~~aGfDgVeih~ahGyLl~qFlSp~~N~R~D~yGGslenR~rf~~ 206 (338)
T cd02933 141 ALTTEEIPGIVADFRQAARNAIEAGFDGVEIHGANGYLIDQFLRDGSNKRTDEYGGSIENRARFLL 206 (338)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCCEEEEccccchhHHHhcCCccCCCCCcCCCcHHHhhhHHH
Confidence 379999998888665 234667899999997654 22223566544667777753
No 76
>PRK06959 putative threonine-phosphate decarboxylase; Provisional
Probab=26.78 E-value=1.4e+02 Score=23.37 Aligned_cols=40 Identities=10% Similarity=0.115 Sum_probs=29.7
Q ss_pred hhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 36 FRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 36 ~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+ |.|+.+..++.+.+++.++.+.+. ..++++|+-+..
T Consensus 135 ~l~n--PnNPTG~~~s~~~l~~l~~~~~~~------~~~vI~DEay~~ 174 (339)
T PRK06959 135 IVVN--PNNPTAERLPAARLLRWHAQLAAR------GGTLIVDEAFAD 174 (339)
T ss_pred EEeC--CCCCCCCCCCHHHHHHHHHHHHHc------CCEEEEECCCcc
Confidence 4565 678888888999998888776432 236888998864
No 77
>PRK07337 aminotransferase; Validated
Probab=26.46 E-value=1.3e+02 Score=23.61 Aligned_cols=36 Identities=17% Similarity=0.291 Sum_probs=27.0
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ ++++||.++.
T Consensus 172 p~NPtG~~~~~~~~~~i~~~a~~~~~------~ii~De~y~~ 207 (388)
T PRK07337 172 PSNPTGTSIAPDELRRIVEAVRARGG------FTIVDEIYQG 207 (388)
T ss_pred CCCCCCcCcCHHHHHHHHHHHHHCCC------EEEEeccccc
Confidence 55666777889999998888755432 6899998864
No 78
>PRK01906 tetraacyldisaccharide 4'-kinase; Provisional
Probab=26.33 E-value=96 Score=25.13 Aligned_cols=15 Identities=27% Similarity=0.195 Sum_probs=12.8
Q ss_pred cCceEEEeccccccC
Q psy1868 70 VGYEYINIDDCWLDK 84 (108)
Q Consensus 70 ~Gy~yv~IDdgW~~~ 84 (108)
.|.+.|++|||-|..
T Consensus 148 ~~~dviIlDDGfQH~ 162 (338)
T PRK01906 148 PGVDVIVSDDGLQHY 162 (338)
T ss_pred CCCCEEEECCCCccc
Confidence 478999999999954
No 79
>cd04192 GT_2_like_e Subfamily of Glycosyltransferase Family GT2 of unknown function. GT-2 includes diverse families of glycosyltransferases with a common GT-A type structural fold, which has two tightly associated beta/alpha/beta domains that tend to form a continuous central sheet of at least eight beta-strands. These are enzymes that catalyze the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. Glycosyltransferases have been classified into more than 90 distinct sequence based families.
Probab=26.27 E-value=1.2e+02 Score=21.04 Aligned_cols=29 Identities=17% Similarity=0.259 Sum_probs=19.5
Q ss_pred HHHHHHHHHHHHHccccccCceEEEeccc
Q psy1868 52 ERLFRTMADLVVSEGYAAVGYEYINIDDC 80 (108)
Q Consensus 52 e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdg 80 (108)
++.|.+..+.+.+.-...--++.|+||||
T Consensus 9 ~~~l~~~l~sl~~q~~~~~~~eiivvdd~ 37 (229)
T cd04192 9 AENLPRLLQSLSALDYPKEKFEVILVDDH 37 (229)
T ss_pred HHHHHHHHHHHHhCCCCCCceEEEEEcCC
Confidence 45577777777655443323799999998
No 80
>PLN02656 tyrosine transaminase
Probab=26.26 E-value=1.4e+02 Score=23.86 Aligned_cols=36 Identities=14% Similarity=0.086 Sum_probs=26.5
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ +|++|+-+..
T Consensus 178 P~NPtG~~~s~~~~~~i~~~a~~~~~------~ii~De~y~~ 213 (409)
T PLN02656 178 PGNPCGNVYSYQHLKKIAETAEKLKI------LVIADEVYGH 213 (409)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHcCC------EEEEehhhhh
Confidence 45666677788889988887755432 6899999874
No 81
>PRK14582 pgaB outer membrane N-deacetylase; Provisional
Probab=26.20 E-value=1.3e+02 Score=26.91 Aligned_cols=36 Identities=14% Similarity=0.217 Sum_probs=25.7
Q ss_pred CCCCHHHHHHHHHHHHHcccc-----------c------cCceEEEecccccc
Q psy1868 48 NCISERLFRTMADLVVSEGYA-----------A------VGYEYINIDDCWLD 83 (108)
Q Consensus 48 ~~i~e~~i~~~ad~l~~~Gl~-----------~------~Gy~yv~IDdgW~~ 83 (108)
.+|+.+.+.++.+.|.+.|++ . -..-.+.+||||.+
T Consensus 67 ~~Vspe~Fe~qL~~Lk~nGY~~ISl~el~~~~~g~~~LP~K~VaLTFDDGy~s 119 (671)
T PRK14582 67 MSVRTSALREQFAWLRENGYQPVSVAQILEAHRGGKPLPEKAVLLTFDDGYSS 119 (671)
T ss_pred cccCHHHHHHHHHHHHHCcCEEccHHHHHHHHhcCCCCCCCeEEEEEEcCCCc
Confidence 468899999999998776653 1 12356778888864
No 82
>PRK06425 histidinol-phosphate aminotransferase; Validated
Probab=25.80 E-value=1.3e+02 Score=23.19 Aligned_cols=37 Identities=24% Similarity=0.381 Sum_probs=28.6
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.|+.+..++.+.+++.++...+.+. +|++|+-+..
T Consensus 130 nP~NPTG~~~s~~~~~~l~~~a~~~~~------~iI~DE~Y~~ 166 (332)
T PRK06425 130 SPDNPLGNLISRDSLLTISEICRKKGA------LLFIDEAFID 166 (332)
T ss_pred CCCCCcCCccCHHHHHHHHHHHHHcCC------EEEEecchhc
Confidence 477788888899999998887654332 6999998864
No 83
>PF00155 Aminotran_1_2: Aminotransferase class I and II 1-aminocyclopropane-1-carboxylate synthase signature aspartate aminotransferase signature; InterPro: IPR004839 Aminotransferases share certain mechanistic features with other pyridoxal-phosphate dependent enzymes, such as the covalent binding of the pyridoxal-phosphate group to a lysine residue. On the basis of sequence similarity, these various enzymes can be grouped [] into class I and class II. This entry includes proteins from both subfamilies.; GO: 0016769 transferase activity, transferring nitrogenous groups, 0030170 pyridoxal phosphate binding, 0009058 biosynthetic process; PDB: 3NRA_B 3P6K_B 3OP7_A 3ASB_A 3ASA_A 1W7M_A 3FVX_A 1W7N_A 3FVU_B 3FVS_A ....
Probab=25.76 E-value=1.2e+02 Score=23.27 Aligned_cols=37 Identities=22% Similarity=0.278 Sum_probs=26.4
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK 84 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~ 84 (108)
+.|+.+..++.+.+++.++...+.++ +|++|+.+..-
T Consensus 157 p~nPtG~~~~~~~l~~l~~~~~~~~~------~ii~De~y~~~ 193 (363)
T PF00155_consen 157 PNNPTGSVLSLEELRELAELAREYNI------IIIVDEAYSDL 193 (363)
T ss_dssp SBTTTTBB--HHHHHHHHHHHHHTTS------EEEEEETTTTG
T ss_pred cccccccccccccccchhhhhccccc------ceeeeeceecc
Confidence 44556667899999999998876533 59999998753
No 84
>PRK09276 LL-diaminopimelate aminotransferase; Provisional
Probab=25.75 E-value=1.9e+02 Score=22.59 Aligned_cols=36 Identities=11% Similarity=0.169 Sum_probs=27.4
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
|.|+.+..++.+.+++.++...+.++ +|++|+-+..
T Consensus 175 p~NPtG~~~~~~~~~~l~~~~~~~~~------~ii~De~y~~ 210 (385)
T PRK09276 175 PNNPTGAVADLEFFEEVVDFAKKYDI------IVCHDAAYSE 210 (385)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHCCc------EEEEecchhh
Confidence 66777788899999998887765332 6889998864
No 85
>PTZ00433 tyrosine aminotransferase; Provisional
Probab=25.68 E-value=1.6e+02 Score=23.46 Aligned_cols=36 Identities=11% Similarity=0.044 Sum_probs=26.4
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.++ +|++|+.+..
T Consensus 186 p~NPtG~~~s~~~~~~l~~~a~~~~~------~ii~De~y~~ 221 (412)
T PTZ00433 186 PSNPCGSNFSRKHVEDIIRLCEELRL------PLISDEIYAG 221 (412)
T ss_pred CCCCCCcccCHHHHHHHHHHHHHcCC------eEEEeccccc
Confidence 55666677789999888887654332 6899998864
No 86
>PRK08636 aspartate aminotransferase; Provisional
Probab=25.51 E-value=1.6e+02 Score=23.35 Aligned_cols=36 Identities=19% Similarity=0.375 Sum_probs=26.2
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
|.|+.+..++++..++.++...+.++ +|+.||-+..
T Consensus 184 P~NPTG~~~s~~~~~~l~~~a~~~~~------~II~De~Y~~ 219 (403)
T PRK08636 184 PHNPTTATVEKSFYERLVALAKKERF------YIISDIAYAD 219 (403)
T ss_pred CCCCCCccCCHHHHHHHHHHHHHcCc------EEEEeccchh
Confidence 45677777899999999987654322 5888887763
No 87
>PF09992 DUF2233: Predicted periplasmic protein (DUF2233); InterPro: IPR018711 This entry contains proteins that catalyze the second step in the formation of the mannose 6-phosphate targeting signal on lysosomal enzyme oligosaccharides, this is achieved by removing GlcNAc residues from GlcNAc-alpha-P-mannose moieties, which are formed in the first step.; PDB: 3OHG_A.
Probab=25.34 E-value=59 Score=22.75 Aligned_cols=27 Identities=26% Similarity=0.273 Sum_probs=16.5
Q ss_pred HHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 53 RLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 53 ~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
-.+.+.++.|++.|..++ +++|.|=+.
T Consensus 116 ~tl~ela~~l~~lG~~~A----inLDGGgSs 142 (170)
T PF09992_consen 116 MTLDELAQLLKSLGCVDA----INLDGGGSS 142 (170)
T ss_dssp B-HHHHHHHHHHHT-SEE----EE---GGG-
T ss_pred CCHHHHHHHHHHcCcCeE----EEecCCcce
Confidence 357788888888888764 999999765
No 88
>cd02932 OYE_YqiM_FMN Old yellow enzyme (OYE) YqjM-like FMN binding domain. YqjM is involved in the oxidative stress response of Bacillus subtilis. Like the other OYE members, each monomer of YqjM contains FMN as a non-covalently bound cofactor and uses NADPH as a reducing agent. The YqjM enzyme exists as a homotetramer that is assembled as a dimer of catalytically dependent dimers, while other OYE members exist only as monomers or dimers. Moreover, the protein displays a shared active site architecture where an arginine finger at the COOH terminus of one monomer extends into the active site of the adjacent monomer and is directly involved in substrate recognition. Another remarkable difference in the binding of the ligand in YqjM is represented by the contribution of the NH2-terminal tyrosine instead of a COOH-terminal tyrosine in OYE and its homologs.
Probab=25.28 E-value=52 Score=26.00 Aligned_cols=52 Identities=17% Similarity=0.279 Sum_probs=33.8
Q ss_pred CCCHHHHHHHHHHHHH--ccccccCceEEEeccc--c---------ccCCCCCCCCeeccCCCCC
Q psy1868 49 CISERLFRTMADLVVS--EGYAAVGYEYINIDDC--W---------LDKTRSFNGRLQADAKRFP 100 (108)
Q Consensus 49 ~i~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdg--W---------~~~~rd~~G~~~~d~~rFP 100 (108)
.+|++.|.+..+.+++ .-.+++||+-|.|-.+ + ..+..|++|--..|..||+
T Consensus 143 ~mt~~eI~~ii~~~~~aA~~a~~aGfDgVei~~~~gyLl~qFlsp~~N~R~D~yGgsl~nr~rf~ 207 (336)
T cd02932 143 ELTREEIAEVVDAFVAAARRAVEAGFDVIEIHAAHGYLLHQFLSPLSNKRTDEYGGSLENRMRFL 207 (336)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHcCCCEEEEccccccHHHHhcCCccCCCCcccCCCHHHHhHHH
Confidence 3899999888876653 3356789999999864 3 2233456665555554553
No 89
>PRK07568 aspartate aminotransferase; Provisional
Probab=25.20 E-value=1.7e+02 Score=22.92 Aligned_cols=36 Identities=8% Similarity=0.183 Sum_probs=25.8
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+.+.++...+.+ -+|++|+-+..
T Consensus 171 p~NPtG~~~~~~~~~~i~~~~~~~~------~~ii~De~y~~ 206 (397)
T PRK07568 171 PGNPTGVVYTKEELEMLAEIAKKHD------LFLISDEVYRE 206 (397)
T ss_pred CCCCCCccCCHHHHHHHHHHHHHCC------cEEEEeccchh
Confidence 4555666678888888888776543 36899998864
No 90
>cd06378 PBP1_iGluR_NMDA_NR2 N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the NR2 subunit of NMDA receptor family. N-terminal leucine/isoleucine/valine-binding protein (LIVBP)-like domain of the NR2 subunit of NMDA receptor family. The ionotropic N-methyl-d-asparate (NMDA) subtype of glutamate receptor serves critical functions in neuronal development, functioning, and degeneration in the mammalian central nervous system. The functional NMDA receptor is a heterotetramer composed of two NR1 and two NR2 (A, B, C, and D) or of NR3 (A and B) subunits. The receptor controls a cation channel that is highly permeable to monovalent ions and calcium and exhibits voltage-dependent inhibition by magnesium. Dual agonists, glutamate and glycine, are required for efficient activation of the NMDA receptor. Among NMDA receptor subtypes, the NR2B subunit containing receptors appear particularly important for pain perception; thus NR2B-selective antagonists may be useful in
Probab=25.02 E-value=1.4e+02 Score=23.79 Aligned_cols=46 Identities=20% Similarity=0.234 Sum_probs=29.7
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChh
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIA 104 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k 104 (108)
+++.+....+...+.|+-..||.||.=|-.-.. ...+++.||.|+-
T Consensus 202 s~~~a~~if~~A~~~gm~g~~yvWI~t~~~~~~--------~~~~~~~~~~G~i 247 (362)
T cd06378 202 SKEEAEYIFRAARSAGLTGPGYVWIVPSLVLGN--------TDLGPSEFPVGLI 247 (362)
T ss_pred CHHHHHHHHHHHHHcCCcCCCeEEEecccccCC--------CccccccCCcceE
Confidence 455566666667777888888998887743332 1234457777754
No 91
>PF14871 GHL6: Hypothetical glycosyl hydrolase 6
Probab=24.96 E-value=1e+02 Score=21.49 Aligned_cols=43 Identities=16% Similarity=0.242 Sum_probs=34.1
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccccCC-----CCCCCCe
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDKT-----RSFNGRL 92 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~-----rd~~G~~ 92 (108)
+..+.+.++.++.++.|++-..|--+..|.-....+ |+.+|++
T Consensus 41 L~~Dllge~v~a~h~~Girv~ay~~~~~d~~~~~~HPeW~~~~~~G~~ 88 (132)
T PF14871_consen 41 LKRDLLGEQVEACHERGIRVPAYFDFSWDEDAAERHPEWFVRDADGRP 88 (132)
T ss_pred CCcCHHHHHHHHHHHCCCEEEEEEeeecChHHHHhCCceeeECCCCCC
Confidence 677889999999999999988777777777665433 6777875
No 92
>TIGR00682 lpxK tetraacyldisaccharide 4'-kinase. Also called lipid-A 4'-kinase. This essential gene encodes an enzyme in the pathway of lipid A biosynthesis in Gram-negative organisms. A single copy of this protein is found in Gram-negative bacteria. PSI-BLAST converges on this set of apparent orthologs without identifying any other homologs.
Probab=24.89 E-value=1.1e+02 Score=24.41 Aligned_cols=16 Identities=19% Similarity=0.260 Sum_probs=13.4
Q ss_pred ccCceEEEeccccccC
Q psy1868 69 AVGYEYINIDDCWLDK 84 (108)
Q Consensus 69 ~~Gy~yv~IDdgW~~~ 84 (108)
..+.+.|++|||-|..
T Consensus 119 ~~~~dviilDDGfQh~ 134 (311)
T TIGR00682 119 QLDPDVIILDDGLQHR 134 (311)
T ss_pred cCCCCEEEECCCCcCc
Confidence 4579999999999954
No 93
>PRK08056 threonine-phosphate decarboxylase; Provisional
Probab=24.71 E-value=1.4e+02 Score=23.23 Aligned_cols=37 Identities=16% Similarity=0.214 Sum_probs=27.5
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.|+.+..++++.+++.++...+.+ -+|++|+-+.+
T Consensus 150 ~p~NPTG~~~~~~~~~~i~~~a~~~~------~~ii~De~y~~ 186 (356)
T PRK08056 150 TPNNPTGLLPERQLLQAIAERCKSLN------IALILDEAFID 186 (356)
T ss_pred CCcCCCCCCCCHHHHHHHHHHHHhcC------CEEEEecchhc
Confidence 36667777789998998888875433 36899998864
No 94
>COG1060 ThiH Thiamine biosynthesis enzyme ThiH and related uncharacterized enzymes [Coenzyme metabolism / General function prediction only]
Probab=24.66 E-value=87 Score=25.74 Aligned_cols=31 Identities=13% Similarity=-0.039 Sum_probs=22.4
Q ss_pred CCCCCCHHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 46 PENCISERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 46 ~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
-.+.++++.|++.++.+++.|.+ -|.|=.|=
T Consensus 86 ~~y~Ls~eeI~~~~~~~~~~G~~-----Evli~gG~ 116 (370)
T COG1060 86 KAYTLSPEEILEEVREAVKRGIT-----EVLIVGGE 116 (370)
T ss_pred cccccCHHHHHHHHHHHHHcCCe-----EEEEecCc
Confidence 45678999999999998765544 56665443
No 95
>PF08013 Tagatose_6_P_K: Tagatose 6 phosphate kinase; InterPro: IPR012062 Escherichia coli and other enteric bacteria contain two closely related D-tagatose 1,6-bisphosphate (TagBP)-specific aldolases involved in catabolism of galactitol (genes gatY gatZ) and of N-acetyl-galactosamine and D-galactosamine (genes kbaY, kbaZ, also called agaY, agaZ). The catalytic subunits GatY/KbaY alone are sufficient to show aldolase activity and contain most or all of the residues that have been identified as essential in substrate/product recognition and catalysis for class II aldolases [, ]. However, these aldolases differ from other Class II aldolases (which are homodimeric enzymes) in that they require subunits GatZ/KbaZ for full activity and for good in vivo and in vitro stability. The Z subunits alone do not show any aldolase activity []. It should be noted that the previous suggestion of a tagatose 6P-kinase function for AgaZ [] and other members of this family turned out to be erroneous [, ].; GO: 0019402 galactitol metabolic process; PDB: 2FIQ_A 3TXV_A.
Probab=24.39 E-value=89 Score=26.47 Aligned_cols=31 Identities=16% Similarity=0.291 Sum_probs=22.4
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
..++.-++.|+.|+.. ..++||+-|.||+-=
T Consensus 100 lpaeeAM~~A~~li~a-yv~AGF~KIHLD~Sm 130 (424)
T PF08013_consen 100 LPAEEAMAKAKELIRA-YVEAGFTKIHLDCSM 130 (424)
T ss_dssp SBHHHHHHHHHHHHHH-HHCTT--EEEE---C
T ss_pred CCHHHHHHHHHHHHHH-HHHcCCceEeecCCC
Confidence 6788899999999875 778999999999865
No 96
>COG4874 Uncharacterized protein conserved in bacteria containing a pentein-type domain [Function unknown]
Probab=24.35 E-value=1.4e+02 Score=24.04 Aligned_cols=20 Identities=25% Similarity=0.491 Sum_probs=17.0
Q ss_pred HHHHHccccccCceEEEeccc
Q psy1868 60 DLVVSEGYAAVGYEYINIDDC 80 (108)
Q Consensus 60 d~l~~~Gl~~~Gy~yv~IDdg 80 (108)
++|++ ||++.|.+.+++||-
T Consensus 60 ~amve-~L~~~GvdV~ifddt 79 (318)
T COG4874 60 NAMVE-GLRQAGVDVVIFDDT 79 (318)
T ss_pred HHHHH-HHHhcCceEEEeecC
Confidence 56765 799999999999993
No 97
>PLN00143 tyrosine/nicotianamine aminotransferase; Provisional
Probab=24.32 E-value=1.5e+02 Score=23.83 Aligned_cols=36 Identities=14% Similarity=0.097 Sum_probs=28.4
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
|.|+.+..++++.+++.++...+.++ +|++|+-+..
T Consensus 179 P~NPTG~~~s~~~~~~l~~~a~~~~~------~ii~De~Y~~ 214 (409)
T PLN00143 179 PGNPCGSVYSYEHLNKIAETARKLGI------LVIADEVYGH 214 (409)
T ss_pred CCCCCCCccCHHHHHHHHHHHHHcCC------eEEEEccccc
Confidence 67788888899999999988765332 6999998874
No 98
>PF03065 Glyco_hydro_57: Glycosyl hydrolase family 57; InterPro: IPR004300 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 57 GH57 from CAZY comprises enzymes with two known activities; alpha-amylase (3.2.1.1 from EC) and 4-alpha-glucanotransferase (2.4.1 from EC).; GO: 0003824 catalytic activity, 0005975 carbohydrate metabolic process; PDB: 1K1X_B 1K1W_A 1K1Y_A 3N98_A 3N8T_A 3N92_A 1UFA_A 3P0B_A.
Probab=24.06 E-value=20 Score=28.54 Aligned_cols=14 Identities=29% Similarity=0.465 Sum_probs=9.5
Q ss_pred ccccCceEEEeccc
Q psy1868 67 YAAVGYEYINIDDC 80 (108)
Q Consensus 67 l~~~Gy~yv~IDdg 80 (108)
|+++||+|+++|.-
T Consensus 183 l~~~Gi~~~i~d~~ 196 (360)
T PF03065_consen 183 LAEAGIRYTILDGH 196 (360)
T ss_dssp HHHTT--EEEEECH
T ss_pred HHHcCCEEEEECcH
Confidence 45789999999963
No 99
>PRK05664 threonine-phosphate decarboxylase; Reviewed
Probab=23.91 E-value=1.6e+02 Score=22.69 Aligned_cols=40 Identities=13% Similarity=0.255 Sum_probs=29.3
Q ss_pred hhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 36 FRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 36 ~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+ |.|+.+..++.+.+++.++...+. | -+|++|+.+..
T Consensus 129 ~l~n--P~NPTG~~~s~~~l~~l~~~~~~~-----~-~~iI~DE~y~~ 168 (330)
T PRK05664 129 VVVN--PNNPTGRRFDPARLLAWHARLAAR-----G-GWLVVDEAFMD 168 (330)
T ss_pred EEeC--CcCCCCCccCHHHHHHHHHHHHhc-----C-CEEEEECCccc
Confidence 4454 667778888999999888776532 3 27999998853
No 100
>PRK05839 hypothetical protein; Provisional
Probab=23.46 E-value=1.3e+02 Score=23.63 Aligned_cols=37 Identities=14% Similarity=0.281 Sum_probs=28.5
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.|+.+..++.+.+++.++...+.++ +|++|+.+..
T Consensus 163 nP~NPTG~~~s~~~l~~i~~~~~~~~~------~ii~DE~Y~~ 199 (374)
T PRK05839 163 SPNNPTGRTLSLEELIEWVKLALKHDF------ILINDECYSE 199 (374)
T ss_pred CCCCCcCcccCHHHHHHHHHHHHHcCC------EEEeccchhh
Confidence 367777888899999998888754322 6999999875
No 101
>PRK07550 hypothetical protein; Provisional
Probab=23.35 E-value=1.4e+02 Score=23.37 Aligned_cols=36 Identities=19% Similarity=0.294 Sum_probs=26.9
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+.+++.+.+++.++...+.|+ +|++||.+..
T Consensus 172 P~NPtG~~~~~~~~~~i~~~~~~~~~------~iI~Dd~y~~ 207 (386)
T PRK07550 172 PNNPTGVVYPPELLHELYDLARRHGI------ALILDETYRD 207 (386)
T ss_pred CCCCCCcccCHHHHHHHHHHHHHcCe------EEEEeccchh
Confidence 55666777898888888888765443 6899998863
No 102
>PRK07309 aromatic amino acid aminotransferase; Validated
Probab=23.32 E-value=2.3e+02 Score=22.42 Aligned_cols=36 Identities=14% Similarity=0.238 Sum_probs=26.0
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.++ +|++|+-+..
T Consensus 175 P~NPtG~~~s~~~~~~l~~~~~~~~~------~ii~D~~y~~ 210 (391)
T PRK07309 175 PANPTGVTYSREQIKALADVLKKYDI------FVISDEVYSE 210 (391)
T ss_pred CCCCCCcCcCHHHHHHHHHHHHHcCc------EEEEEccccc
Confidence 45566667788889998887754322 6889998864
No 103
>cd06913 beta3GnTL1_like Beta 1, 3-N-acetylglucosaminyltransferase is essential for the formation of poly-N-acetyllactosamine . This family includes human Beta3GnTL1 and related eukaryotic proteins. Human Beta3GnTL1 is a putative beta-1,3-N-acetylglucosaminyltransferase. Beta3GnTL1 is expressed at various levels in most of tissues examined. Beta 1, 3-N-acetylglucosaminyltransferase has been found to be essential for the formation of poly-N-acetyllactosamine. Poly-N-acetyllactosamine is a unique carbohydrate composed of N-acetyllactosamine repeats. It is often an important part of cell-type-specific oligosaccharide structures and some functional oligosaccharides. It has been shown that the structure and biosynthesis of poly-N-acetyllactosamine display a dramatic change during development and oncogenesis. Several members of beta-1, 3-N-acetylglucosaminyltransferase have been identified.
Probab=23.32 E-value=1.4e+02 Score=20.98 Aligned_cols=30 Identities=13% Similarity=0.196 Sum_probs=20.7
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
.++.+.+..+.+...-+. ..++.|++|||=
T Consensus 8 ~~~~l~~~l~sl~~q~~~-~~~eiiVvDd~S 37 (219)
T cd06913 8 GEQWLDECLESVLQQDFE-GTLELSVFNDAS 37 (219)
T ss_pred cHHHHHHHHHHHHhCCCC-CCEEEEEEeCCC
Confidence 456677777777654333 248999999984
No 104
>PF08373 RAP: RAP domain; InterPro: IPR013584 The ~60-residue RAP (an acronym for RNA-binding domain abundant in Apicomplexans) domain is found in various proteins in eukaryotes. It is particularly abundant in apicomplexans and might mediate a range of cellular functions through its potential interactions with RNA []. The RAP domain consists of multiple blocks of charged and aromatics residues and is predicted to be composed of alpha helical and beta strand structures. Two predicted loop regions that are dominated by glycine and tryptophan residues are found before and after the central beta sheet []. Some proteins known to contain a RAP domain are listed below: Human hypothetical protein MGC5297, Mammalian FAST kinase domain-containing proteins (FASTKDs), Chlamydomonas reinhardtii chloroplastic trans-splicing factor Raa3.
Probab=23.24 E-value=50 Score=19.03 Aligned_cols=19 Identities=26% Similarity=0.361 Sum_probs=14.1
Q ss_pred HHHHccccccCceEEEecc
Q psy1868 61 LVVSEGYAAVGYEYINIDD 79 (108)
Q Consensus 61 ~l~~~Gl~~~Gy~yv~IDd 79 (108)
.|...-|+.+||+.|.|--
T Consensus 21 ~lk~r~L~~~G~~Vi~Ip~ 39 (58)
T PF08373_consen 21 KLKHRHLKALGYKVISIPY 39 (58)
T ss_pred HHHHHHHHHCCCEEEEecH
Confidence 4445557899999999864
No 105
>PLN02475 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferase
Probab=23.18 E-value=70 Score=28.85 Aligned_cols=32 Identities=22% Similarity=0.245 Sum_probs=22.8
Q ss_pred CHHHHHHHHHHHHH--ccccccCceEEEeccccc
Q psy1868 51 SERLFRTMADLVVS--EGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 51 ~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW~ 82 (108)
.++.+.+.|+++.+ .-|.++|.++|+||+.-.
T Consensus 577 ~~e~~~~iA~alr~Ev~~L~~aG~~~IQIDEPal 610 (766)
T PLN02475 577 RHETCYQIALAIKDEVEDLEKAGITVIQIDEAAL 610 (766)
T ss_pred HHHHHHHHHHHHHHHHHHHHHcCCCEEEEeCcch
Confidence 45666666665542 347789999999999654
No 106
>PF01717 Meth_synt_2: Cobalamin-independent synthase, Catalytic domain; InterPro: IPR002629 This is a domain of vitamin-B12 independent methionine synthases or 5-methyltetrahydropteroyltriglutamate--homocysteine methyltransferases, 2.1.1.14 from EC from bacteria and plants. Plants are the only higher eukaryotes that have the required enzymes for methionine synthesis []. This enzyme catalyses the last step in the production of methionine by transferring a methyl group from 5-methyltetrahydrofolate to homocysteine []. The aligned region makes up the carboxy region of the approximately 750 amino acid protein except in some hypothetical archaeal proteins present in the family, where this region corresponds to the entire length.; GO: 0003871 5-methyltetrahydropteroyltriglutamate-homocysteine S-methyltransferase activity, 0009086 methionine biosynthetic process; PDB: 1U22_A 1U1H_A 1U1U_A 1U1J_A 3BQ5_A 3BQ6_A 1XDJ_B 1XR2_B 1T7L_B 1XPG_B ....
Probab=23.14 E-value=78 Score=24.72 Aligned_cols=32 Identities=28% Similarity=0.420 Sum_probs=20.6
Q ss_pred CHHHHHHHHHHHHH--ccccccCceEEEeccccc
Q psy1868 51 SERLFRTMADLVVS--EGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 51 ~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW~ 82 (108)
-++.+.+.|+++.+ ..|.++|.+||+||+--.
T Consensus 145 ~~~~~~dla~a~~~ei~~l~~~G~~~iQiDeP~l 178 (324)
T PF01717_consen 145 REELLEDLAEAYREEIRALYDAGCRYIQIDEPAL 178 (324)
T ss_dssp HHHHHHHHHHHHHHHHHHHHHTT-SEEEEEETCH
T ss_pred HHHHHHHHHHHHHHHHHHHHHCCCCEEEecchHh
Confidence 34555666654442 247789999999999853
No 107
>PRK08175 aminotransferase; Validated
Probab=23.10 E-value=1.8e+02 Score=22.98 Aligned_cols=36 Identities=17% Similarity=0.281 Sum_probs=25.4
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ ++++|+....
T Consensus 173 p~NPtG~~~~~~~~~~i~~~a~~~~i------~ii~De~y~~ 208 (395)
T PRK08175 173 PSNPTAQCVELEFFEKVVALAKRYDV------LVVHDLAYAD 208 (395)
T ss_pred CCCCCCCCCCHHHHHHHHHHHHHcCc------EEEEecchHh
Confidence 45566677799888888877654332 6888987753
No 108
>PRK09200 preprotein translocase subunit SecA; Reviewed
Probab=23.04 E-value=84 Score=28.57 Aligned_cols=49 Identities=18% Similarity=0.253 Sum_probs=35.0
Q ss_pred HHHHHHHHHccccccCceEEEeccccccCCCCCCCCeeccCCCCCCChhhh
Q psy1868 56 RTMADLVVSEGYAAVGYEYINIDDCWLDKTRSFNGRLQADAKRFPRGIADL 106 (108)
Q Consensus 56 ~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~~rd~~G~~~~d~~rFP~G~k~L 106 (108)
.....+|.+.-+-..+.+||+-|+--.--.. ..|++.++. +|.+|+...
T Consensus 287 ~~i~~Al~A~~~~~~d~dYiV~~~~v~ivD~-~TGr~~~gr-~~s~GlhQa 335 (790)
T PRK09200 287 RHIILALRAHVLFKRDVDYIVYDGEIVLVDR-FTGRVLPGR-KLQDGLHQA 335 (790)
T ss_pred HHHHHHHHHHHHhhcCCcEEEECCEEEEEEC-CCCcCCCCC-ccChHHHHH
Confidence 4444566665566778999999987653222 359988887 999999765
No 109
>PF06955 XET_C: Xyloglucan endo-transglycosylase (XET) C-terminus; InterPro: IPR010713 This entry represents the C terminus (approximately 60 residues) of plant xyloglucan endo-transglycosylase (XET). Xyloglucan is the predominant hemicellulose in the cell walls of most dicotyledons. With cellulose, it forms a network that strengthens the cell wall. XET catalyses the splitting of xyloglucan chains and the linking of the newly generated reducing end to the non-reducing end of another xyloglucan chain, thereby loosening the cell wall []. ; GO: 0016762 xyloglucan:xyloglucosyl transferase activity, 0006073 cellular glucan metabolic process, 0005618 cell wall, 0048046 apoplast; PDB: 1UMZ_A 1UN1_B 2VH9_B 2UWC_A 2UWB_B 2UWA_C.
Probab=22.97 E-value=36 Score=19.99 Aligned_cols=9 Identities=56% Similarity=1.084 Sum_probs=4.3
Q ss_pred cCCCCCCCh
Q psy1868 95 DAKRFPRGI 103 (108)
Q Consensus 95 d~~rFP~G~ 103 (108)
|+.|||.++
T Consensus 39 D~~Rfp~~~ 47 (51)
T PF06955_consen 39 DTKRFPNPL 47 (51)
T ss_dssp -TTT-SGCG
T ss_pred CCCcCCCCC
Confidence 456777663
No 110
>COG0112 GlyA Glycine/serine hydroxymethyltransferase [Amino acid transport and metabolism]
Probab=22.92 E-value=73 Score=26.89 Aligned_cols=23 Identities=9% Similarity=0.209 Sum_probs=19.0
Q ss_pred HHHHHHHHHHHHccccccCceEEE
Q psy1868 53 RLFRTMADLVVSEGYAAVGYEYIN 76 (108)
Q Consensus 53 ~~i~~~ad~l~~~Gl~~~Gy~yv~ 76 (108)
++|.++|++|++. |.+.||+++.
T Consensus 287 ~qVv~NAkaLAe~-l~~~G~~vvs 309 (413)
T COG0112 287 KQVVKNAKALAEA-LKERGFKVVS 309 (413)
T ss_pred HHHHHHHHHHHHH-HHHcCCeEec
Confidence 5688899999876 8889999875
No 111
>PF13479 AAA_24: AAA domain
Probab=22.75 E-value=1.2e+02 Score=22.28 Aligned_cols=26 Identities=27% Similarity=0.279 Sum_probs=18.5
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEecc
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDD 79 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDd 79 (108)
+-+.+++..+.+.+. ..+|++|+||.
T Consensus 51 s~~~~~~~~~~l~~~---~~~y~tiVIDs 76 (213)
T PF13479_consen 51 SWEDFLEALDELEED---EADYDTIVIDS 76 (213)
T ss_pred CHHHHHHHHHHHHhc---cCCCCEEEEEC
Confidence 556667666666543 57899999994
No 112
>PRK15052 D-tagatose-1,6-bisphosphate aldolase subunit GatZ; Provisional
Probab=22.50 E-value=91 Score=26.41 Aligned_cols=31 Identities=16% Similarity=0.218 Sum_probs=25.5
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
.+++.-++.|+.|+.. .-++||+-|.||+-=
T Consensus 97 ~pa~eAM~~A~~li~a-yV~AGF~kIHLD~Sm 127 (421)
T PRK15052 97 EPADAAMEKSVELVKA-YVRAGFSKIHLDASM 127 (421)
T ss_pred CCHHHHHHHHHHHHHH-HHHcCCceEEecCCC
Confidence 4677888999988875 667999999999754
No 113
>PRK06348 aspartate aminotransferase; Provisional
Probab=22.46 E-value=1.7e+02 Score=22.97 Aligned_cols=37 Identities=14% Similarity=0.247 Sum_probs=27.5
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+|.|+.+..++.+.+++.++...+.++ +|++|+.+..
T Consensus 170 ~p~NPtG~~~s~~~~~~l~~~a~~~~~------~ii~De~y~~ 206 (384)
T PRK06348 170 SPNNPTGAVFSKETLEEIAKIAIEYDL------FIISDEVYDG 206 (384)
T ss_pred CCCCCCCcCCCHHHHHHHHHHHHHCCe------EEEEeccccc
Confidence 366677778899999988887654332 6889998864
No 114
>KOG2366|consensus
Probab=22.39 E-value=78 Score=26.70 Aligned_cols=48 Identities=21% Similarity=0.293 Sum_probs=31.9
Q ss_pred CceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC
Q psy1868 26 PPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK 84 (108)
Q Consensus 26 PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~ 84 (108)
-+.||=+=... ..|.++|+ +.......|+.+ .+-|.+|+.+|.|....
T Consensus 122 LKlGiYsD~G~---~TC~g~PG---S~~~e~~DA~tF-----A~WgvDylKlD~C~~~~ 169 (414)
T KOG2366|consen 122 LKLGIYSDAGN---FTCAGYPG---SLGHEESDAKTF-----ADWGVDYLKLDGCFNNL 169 (414)
T ss_pred CceeeeeccCc---hhhccCCc---ccchhhhhhhhh-----HhhCCcEEecccccccc
Confidence 37787665542 23667777 334445556665 57899999999999754
No 115
>PF07244 Surf_Ag_VNR: Surface antigen variable number repeat; InterPro: IPR010827 This motif is found primarily in bacterial surface antigens, normally as variable number repeats at the N terminus. The C terminus of these proteins is normally represented by IPR000184 from INTERPRO. There may also be a relationship to haemolysin activator HlyB (IPR005565 from INTERPRO). The alignment centres on a -GY- or -GF- motif. Some members of this family are found in the mitochondria. It is predicted to have a mixed alpha/beta secondary structure.; GO: 0019867 outer membrane; PDB: 2X8X_X 3MC8_A 3OG5_A 3MC9_B 2QCZ_B 3EFC_A 3Q6B_A 2QDF_A 2V9H_A.
Probab=22.17 E-value=1.4e+02 Score=17.55 Aligned_cols=29 Identities=10% Similarity=0.220 Sum_probs=19.8
Q ss_pred CCCCHHHHHHHHHHHHHccccccCceEEEe
Q psy1868 48 NCISERLFRTMADLVVSEGYAAVGYEYINI 77 (108)
Q Consensus 48 ~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~I 77 (108)
..++++.+.+..+.|.+. +.+.||-+..|
T Consensus 30 ~~~~~~~i~~~~~~l~~~-y~~~Gy~~~~V 58 (78)
T PF07244_consen 30 DPFNPEKIEEDIERLQDY-YKDKGYFFAKV 58 (78)
T ss_dssp SBECHHHHHHHHHHHHHH-HHTTSCSEEEE
T ss_pred CEeCHHHHHHHHHHHHHH-HHHcCCcceEE
Confidence 447889999988888743 55667744443
No 116
>PTZ00376 aspartate aminotransferase; Provisional
Probab=22.12 E-value=1.7e+02 Score=23.33 Aligned_cols=37 Identities=14% Similarity=0.133 Sum_probs=28.5
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+|.|+.+..++++.+++.++...+.+ . +|+.|+.+..
T Consensus 184 ~p~NPTG~~~s~~~~~~l~~~a~~~~-----~-~ii~De~Y~~ 220 (404)
T PTZ00376 184 CAHNPTGVDPTEEQWKEIADVMKRKN-----L-IPFFDMAYQG 220 (404)
T ss_pred CCCCCCCCCCCHHHHHHHHHHHHhCC-----c-EEEEehhhcC
Confidence 47778888889999999998875432 2 6888998864
No 117
>PF06543 Lac_bphage_repr: Lactococcus bacteriophage repressor; InterPro: IPR009498 This entry represents the C terminus of various Lactococcus bacteriophage repressor proteins.
Probab=22.09 E-value=73 Score=19.06 Aligned_cols=29 Identities=17% Similarity=0.599 Sum_probs=20.1
Q ss_pred CCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHH
Q psy1868 25 TPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVV 63 (108)
Q Consensus 25 ~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~ 63 (108)
.|-.-|+.|-+|.-.+ ++|+ +++....+-
T Consensus 15 d~kvdWd~wvSf~GrP---------ltde-vK~a~k~i~ 43 (49)
T PF06543_consen 15 DPKVDWDKWVSFDGRP---------LTDE-VKEAMKLIF 43 (49)
T ss_pred CcccchHHheeeCCee---------CCHH-HHHHHHHHH
Confidence 3456799999998663 6776 666666553
No 118
>COG4451 RbcS Ribulose bisphosphate carboxylase small subunit [Energy production and conversion]
Probab=21.93 E-value=1.1e+02 Score=21.70 Aligned_cols=25 Identities=24% Similarity=0.660 Sum_probs=22.0
Q ss_pred CCHHHHHHHHHHHHHccccccCceEE
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYI 75 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv 75 (108)
++.+.+.+|...+.+.|++ -|.+|+
T Consensus 19 Ltd~qi~~QVrylL~QGyk-igvE~~ 43 (127)
T COG4451 19 LTDEQIAEQVRYLLSQGYK-IGVEYV 43 (127)
T ss_pred CcHHHHHHHHHHHHhCCcc-cceeec
Confidence 7889999999999999998 678875
No 119
>PRK07366 succinyldiaminopimelate transaminase; Validated
Probab=21.87 E-value=1.6e+02 Score=23.10 Aligned_cols=36 Identities=11% Similarity=0.083 Sum_probs=27.5
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+. | -+|++|+-+..
T Consensus 174 p~NPTG~~~s~~~~~~l~~~a~~~-----~-~~ii~De~Y~~ 209 (388)
T PRK07366 174 PHNPTTAIAPLSFFQEAVAFCQQH-----D-LVLVHDFPYVD 209 (388)
T ss_pred CCCCCCccCCHHHHHHHHHHHHHc-----C-eEEEEecchhh
Confidence 667778888999999998877543 2 36889998864
No 120
>TIGR03815 CpaE_hom_Actino helicase/secretion neighborhood CpaE-like protein. Members of this protein family belong to the MinD/ParA family of P-loop NTPases, and in particular show homology to the CpaE family of pilus assembly proteins (see PubMed:12370432). Nearly all members are found, not only in a gene context consistent with pilus biogenesis or a pilus-like secretion apparatus, but also near a DEAD/DEAH-box helicase, suggesting an involvement in DNA transfer activity. The model describes a clade restricted to the Actinobacteria.
Probab=21.81 E-value=1.1e+02 Score=23.65 Aligned_cols=27 Identities=15% Similarity=0.118 Sum_probs=19.0
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEeccccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWL 82 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~ 82 (108)
++++.+++..+.+. ..|+||+||.+=.
T Consensus 189 ~~~~~l~~~l~~l~------~~~D~VIID~p~~ 215 (322)
T TIGR03815 189 LPPAAVRAVLDAAR------RGGDLVVVDLPRR 215 (322)
T ss_pred CCHHHHHHHHHHHH------hcCCEEEEeCCCC
Confidence 56666776666553 3499999999854
No 121
>PRK07865 N-succinyldiaminopimelate aminotransferase; Reviewed
Probab=21.80 E-value=1.7e+02 Score=22.79 Aligned_cols=37 Identities=16% Similarity=0.357 Sum_probs=27.3
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.|+.+..++.+.+++.++...+.|+ ++++|+.+..
T Consensus 156 ~p~NPtG~~~~~~~~~~i~~~a~~~~~------~ii~De~y~~ 192 (364)
T PRK07865 156 SPSNPTGRVLGVDHLRKVVAWARERGA------VVASDECYLE 192 (364)
T ss_pred CCCCCCCccCCHHHHHHHHHHHHHcCC------EEEEecchhh
Confidence 356667777898888888887654332 6999999874
No 122
>PRK09257 aromatic amino acid aminotransferase; Provisional
Probab=21.74 E-value=1.8e+02 Score=22.97 Aligned_cols=37 Identities=16% Similarity=0.301 Sum_probs=28.7
Q ss_pred CCCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 41 DCKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 41 ~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.|+.+..++.+..++.++...+.++ +|+.|+.+..
T Consensus 180 ~p~NPTG~~~s~~~~~~l~~~a~~~~~------~ii~De~Y~~ 216 (396)
T PRK09257 180 CCHNPTGADLTPEQWDELAELLKERGL------IPFLDIAYQG 216 (396)
T ss_pred CCCCCCCCCCCHHHHHHHHHHHHhCCc------EEEEeccccc
Confidence 467788888899999999988765433 5888998864
No 123
>PF13614 AAA_31: AAA domain; PDB: 2VED_B 2PH1_A 3EA0_B 3FKQ_A 3KB1_B 1ION_A 3LA6_H 3BFV_B 3CIO_D.
Probab=21.61 E-value=64 Score=21.66 Aligned_cols=28 Identities=18% Similarity=0.190 Sum_probs=19.6
Q ss_pred CCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 50 ISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 50 i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
++.+.+.+..+.+.+ .|++|+||.+-..
T Consensus 102 ~~~~~~~~li~~l~~------~yd~IivD~~~~~ 129 (157)
T PF13614_consen 102 LTPEDVEELIDALKE------HYDYIIVDLPSSL 129 (157)
T ss_dssp HTSHHHHHHHHHHHH------HSSEEEEEEESTT
T ss_pred cCHHHHHHHHHHHHH------cCCEEEEECcCCc
Confidence 455566666666643 5999999998753
No 124
>PLN02450 1-aminocyclopropane-1-carboxylate synthase
Probab=21.44 E-value=1.6e+02 Score=24.29 Aligned_cols=36 Identities=8% Similarity=0.145 Sum_probs=28.7
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
|.|+.+.+++.+.+++.++...+.++ +|+.|+.+..
T Consensus 200 P~NPTG~~~s~e~l~~ll~~a~~~~~------~iI~DE~Y~~ 235 (468)
T PLN02450 200 PSNPLGTTTTRTELNLLVDFITAKNI------HLISDEIYSG 235 (468)
T ss_pred CCCCCCcccCHHHHHHHHHHHHHCCc------EEEEEccccc
Confidence 77788888999999999988765433 5899999874
No 125
>KOG0186|consensus
Probab=21.29 E-value=98 Score=26.79 Aligned_cols=41 Identities=12% Similarity=0.274 Sum_probs=30.9
Q ss_pred CCCCceeeechhhhhccCCCCCCCCCCCCHHHHHHHHHHHHHccccccCceEE
Q psy1868 23 ALTPPMGWLAWERFRCNTDCKNDPENCISERLFRTMADLVVSEGYAAVGYEYI 75 (108)
Q Consensus 23 ~~~PpmGWnSW~~~~~~~~~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv 75 (108)
.+.-|.-||||.+|--+ +-+.+...++.-+..|+. .|.+.|
T Consensus 313 ~k~~p~v~NTyQaYLKd-----------a~e~l~~d~k~A~reG~~-~G~KLV 353 (506)
T KOG0186|consen 313 DKERPIVFNTYQAYLKD-----------AGERLHLDLKLARREGWP-FGAKLV 353 (506)
T ss_pred CCCCceEeeeHHHHHhh-----------HHHHHHHHHHHHHHcCCc-hhhhhh
Confidence 56779999999999876 667787778877777776 555543
No 126
>PRK11113 D-alanyl-D-alanine carboxypeptidase/endopeptidase; Provisional
Probab=21.11 E-value=2e+02 Score=24.20 Aligned_cols=38 Identities=13% Similarity=0.230 Sum_probs=26.5
Q ss_pred CCCCCCHHHHHHHHHHHHHccccccCceEEEeccccccC
Q psy1868 46 PENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLDK 84 (108)
Q Consensus 46 ~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~~ 84 (108)
++..++.+.+.+.++.|++.|++.-- --|++|+..+..
T Consensus 107 GDPtL~~~~L~~la~~l~~~GI~~I~-G~lv~D~s~f~~ 144 (477)
T PRK11113 107 GDPTLTRQDLRNMVATLKKSGVKQID-GNLLIDTSVFAS 144 (477)
T ss_pred cCCCCCHHHHHHHHHHHHHcCCcEEe-eeEEEECccccC
Confidence 34448888899999998877766420 146888888754
No 127
>PRK06107 aspartate aminotransferase; Provisional
Probab=21.06 E-value=2.4e+02 Score=22.40 Aligned_cols=37 Identities=19% Similarity=0.209 Sum_probs=26.0
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|..+...+.+.+++.++...+. .--++++|+-+..
T Consensus 175 p~NPtG~~~s~~~~~~l~~~a~~~-----~~~~iI~De~y~~ 211 (402)
T PRK06107 175 PSNPTGAVYSRAELRALADVLLRH-----PHVLVLTDDIYDH 211 (402)
T ss_pred CCCCCCcCcCHHHHHHHHHHHHHc-----CCeEEEEehhccc
Confidence 456666777888899988877542 1126889998863
No 128
>PF11148 DUF2922: Protein of unknown function (DUF2922); InterPro: IPR021321 This bacterial family of proteins has no known function.
Probab=20.79 E-value=1.6e+02 Score=17.97 Aligned_cols=23 Identities=9% Similarity=0.169 Sum_probs=18.4
Q ss_pred CCCCCHHHHHHHHHHHHHccccc
Q psy1868 47 ENCISERLFRTMADLVVSEGYAA 69 (108)
Q Consensus 47 ~~~i~e~~i~~~ad~l~~~Gl~~ 69 (108)
-..++++.|++.++.|++.++-.
T Consensus 23 k~~lt~~~V~~~m~~ii~~~vf~ 45 (69)
T PF11148_consen 23 KEDLTEAEVKAAMQAIIAKKVFA 45 (69)
T ss_pred CCCCCHHHHHHHHHHHHHhCCee
Confidence 33489999999999999876543
No 129
>KOG3858|consensus
Probab=20.65 E-value=74 Score=24.83 Aligned_cols=13 Identities=15% Similarity=0.146 Sum_probs=10.9
Q ss_pred CCCceeeechhhh
Q psy1868 24 LTPPMGWLAWERF 36 (108)
Q Consensus 24 ~~PpmGWnSW~~~ 36 (108)
..+++-|||-|.-
T Consensus 25 ~~~~VyWNSSNp~ 37 (233)
T KOG3858|consen 25 NLHPVYWNSSNPR 37 (233)
T ss_pred cccceEecCCCcc
Confidence 3799999999973
No 130
>cd04747 OYE_like_5_FMN Old yellow enzyme (OYE)-related FMN binding domain, group 5. Each monomer of OYE contains FMN as a non-covalently bound cofactor, uses NADPH as a reducing agent with oxygens, quinones, and alpha,beta-unsaturated aldehydes and ketones, and can act as electron acceptors in the catalytic reaction. Other members of OYE family include trimethylamine dehydrogenase, 2,4-dienoyl-CoA reductase, enoate reductase, pentaerythriol tetranitrate reductase, xenobiotic reductase, and morphinone reductase.
Probab=20.65 E-value=73 Score=25.94 Aligned_cols=52 Identities=13% Similarity=0.188 Sum_probs=35.5
Q ss_pred CCCHHHHHHHHHHHHH--ccccccCceEEEecccc-----------ccCCCCCCCCeeccCCCCC
Q psy1868 49 CISERLFRTMADLVVS--EGYAAVGYEYINIDDCW-----------LDKTRSFNGRLQADAKRFP 100 (108)
Q Consensus 49 ~i~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDdgW-----------~~~~rd~~G~~~~d~~rFP 100 (108)
.+|++.|.+..+.++. .-.+++||+-|.|=.+- ..+..|++|-=..|..||+
T Consensus 133 ~mt~~eI~~ii~~f~~AA~~a~~aGfDgVeih~ahGyLl~qFLSp~~N~RtDeYGGslenR~Rf~ 197 (361)
T cd04747 133 EMTEADIDDVIAAFARAAADARRLGFDGIELHGAHGYLIDQFFWAGTNRRADGYGGSLAARSRFA 197 (361)
T ss_pred cCCHHHHHHHHHHHHHHHHHHHHcCCCEEEEecccchHHHHhcCCCCCCCCCCCCCCHHHHHHHH
Confidence 4899999988886652 34567899999998765 1112366776566666664
No 131
>PF12419 DUF3670: SNF2 Helicase protein ; InterPro: IPR022138 This domain family is found in bacteria, archaea and eukaryotes, and is approximately 140 amino acids in length. The family is found in association with PF00271 from PFAM, PF00176 from PFAM. Most of the proteins in this family are annotated as SNF2 helicases but there is little accompanying literature to confirm this.
Probab=20.58 E-value=1.2e+02 Score=21.24 Aligned_cols=32 Identities=13% Similarity=0.252 Sum_probs=21.7
Q ss_pred CCCCCHHHHHHHHHHHHHccccccCceEEEeccc
Q psy1868 47 ENCISERLFRTMADLVVSEGYAAVGYEYINIDDC 80 (108)
Q Consensus 47 ~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdg 80 (108)
+..+|++.+.+.++. +.+|...+=+||.+|-.
T Consensus 81 d~~Ls~eEf~~L~~~--~~~LV~~rg~WV~ld~~ 112 (141)
T PF12419_consen 81 DEELSEEEFEQLVEQ--KRPLVRFRGRWVELDPE 112 (141)
T ss_pred CEECCHHHHHHHHHc--CCCeEEECCEEEEECHH
Confidence 345899998887764 45666666667766654
No 132
>PF07631 PSD4: Protein of unknown function (DUF1592); InterPro: IPR013042 A region of similarity shared by several Rhodopirellula baltica cytochrome-like proteins that are predicted to be secreted. These proteins also contain IPR011478 from INTERPRO, IPR013036 from INTERPRO, IPR013039 from INTERPRO and IPR013043 from INTERPRO.
Probab=20.50 E-value=21 Score=24.96 Aligned_cols=45 Identities=18% Similarity=0.286 Sum_probs=28.5
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEe--ccccccCCCCCCCCeeccCCCCCC
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINI--DDCWLDKTRSFNGRLQADAKRFPR 101 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~I--DdgW~~~~rd~~G~~~~d~~rFP~ 101 (108)
+++.+.++++.|.+. -..+.++- =.-|.. -+..+...+|+++||.
T Consensus 34 ~~~~l~~q~~RML~d----pr~~~~~~~F~~qWL~--l~~~~~~~~d~~~~p~ 80 (128)
T PF07631_consen 34 TPEQLRAQAERMLAD----PRARRFVERFFRQWLD--LDRLDSIVKDPEKFPE 80 (128)
T ss_pred CHHHHHHHHHHHHcC----ccHHHHHHHHHHHHhC--CCcccccCCChhhccc
Confidence 788999999998542 22222211 124542 3456788899999986
No 133
>cd02930 DCR_FMN 2,4-dienoyl-CoA reductase (DCR) FMN-binding domain. DCR in E. coli is an iron-sulfur flavoenzyme which contains FMN, FAD, and a 4Fe-4S cluster. It is also a monomer, unlike that of its eukaryotic counterparts which form homotetramers and lack the flavin and iron-sulfur cofactors. Metabolism of unsaturated fatty acids requires auxiliary enzymes in addition to those used in b-oxidation. After a given number of cycles through the b-oxidation pathway, those unsaturated fatty acyl-CoAs with double bonds at even-numbered carbon positions contain 2-trans, 4-cis double bonds that can not be modified by enoyl-CoA hydratase. DCR utilizes NADPH to remove the C4-C5 double bond. DCR can catalyze the reduction of both natural fatty acids with cis double bonds, as well as substrates containing trans double bonds. The reaction is initiated by hybrid transfer from NADPH to FAD, which in turn transfers electrons, one at a time, to FMN via the 4Fe-4S cluster. The fully reduced FMN provi
Probab=20.49 E-value=72 Score=25.46 Aligned_cols=31 Identities=16% Similarity=0.142 Sum_probs=23.5
Q ss_pred CCCHHHHHHHHHHHHH--ccccccCceEEEecc
Q psy1868 49 CISERLFRTMADLVVS--EGYAAVGYEYINIDD 79 (108)
Q Consensus 49 ~i~e~~i~~~ad~l~~--~Gl~~~Gy~yv~IDd 79 (108)
.+|++.|.+..+.++. .-.+++||+.|-|=.
T Consensus 126 ~mt~~eI~~i~~~f~~aA~~a~~aGfDgVeih~ 158 (353)
T cd02930 126 ELSEEEIEQTIEDFARCAALAREAGYDGVEIMG 158 (353)
T ss_pred CCCHHHHHHHHHHHHHHHHHHHHcCCCEEEEec
Confidence 3799999988886653 235679999999943
No 134
>cd02525 Succinoglycan_BP_ExoA ExoA is involved in the biosynthesis of succinoglycan. Succinoglycan Biosynthesis Protein ExoA catalyzes the formation of a beta-1,3 linkage of the second sugar (glucose) of the succinoglycan with the galactose on the lipid carrie. Succinoglycan is an acidic exopolysaccharide that is important for invasion of the nodules. Succinoglycan is a high-molecular-weight polymer composed of repeating octasaccharide units. These units are synthesized on membrane-bound isoprenoid lipid carriers, beginning with galactose followed by seven glucose molecules, and modified by the addition of acetate, succinate, and pyruvate. ExoA is a membrane protein with a transmembrance domain at c-terminus.
Probab=20.40 E-value=1.7e+02 Score=20.53 Aligned_cols=31 Identities=16% Similarity=0.255 Sum_probs=20.4
Q ss_pred CHHHHHHHHHHHHHccccccCceEEEecccc
Q psy1868 51 SERLFRTMADLVVSEGYAAVGYEYINIDDCW 81 (108)
Q Consensus 51 ~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW 81 (108)
.++.+.+..+.+.+......-|+.|++|++=
T Consensus 11 ~~~~l~~~l~sl~~q~~~~~~~evivvd~~s 41 (249)
T cd02525 11 EEKYIEELLESLLNQSYPKDLIEIIVVDGGS 41 (249)
T ss_pred chhhHHHHHHHHHhccCCCCccEEEEEeCCC
Confidence 3455666667776554433458999999883
No 135
>TIGR01140 L_thr_O3P_dcar L-threonine-O-3-phosphate decarboxylase. This family contains pyridoxal phosphate-binding class II aminotransferases (see PFAM:PF00222) closely related to, yet distinct from, histidinol-phosphate aminotransferase (HisC). It is found in cobalamin biosynthesis operons in Salmonella typhimurium and Bacillus halodurans (each of which also has HisC) and has been shown to have L-threonine-O-3-phosphate decarboxylase activity in Salmonella. Although the gene symbol cobD was assigned in Salmonella, cobD in other contexts refers to a different cobalamin biosynthesis enzyme, modeled by pfam03186 and called cbiB in Salmonella.
Probab=20.36 E-value=1.9e+02 Score=22.25 Aligned_cols=36 Identities=17% Similarity=0.332 Sum_probs=27.0
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ +|++|+.+..
T Consensus 134 p~NPtG~~~~~~~~~~l~~~a~~~~~------~ii~De~y~~ 169 (330)
T TIGR01140 134 PNNPTGRLIPPETLLALAARLRARGG------WLVVDEAFID 169 (330)
T ss_pred CCCCCCCCCCHHHHHHHHHHhHhcCC------EEEEECcccc
Confidence 55666677899999988888765443 6999998853
No 136
>TIGR03537 DapC succinyldiaminopimelate transaminase. Note: the detailed information included in the EC:2.6.1.17 record includes the assertions that the enzyme uses the pyridoxal pyrophosphate cofactor, which is consistent with the pfam00155 family, and the assertion that the amino group donor is L-glutamate, which is undetermined for the sequences in this clade.
Probab=20.32 E-value=1.9e+02 Score=22.36 Aligned_cols=36 Identities=11% Similarity=0.298 Sum_probs=26.3
Q ss_pred CCCCCCCCCCHHHHHHHHHHHHHccccccCceEEEecccccc
Q psy1868 42 CKNDPENCISERLFRTMADLVVSEGYAAVGYEYINIDDCWLD 83 (108)
Q Consensus 42 ~~~~~~~~i~e~~i~~~ad~l~~~Gl~~~Gy~yv~IDdgW~~ 83 (108)
+.|+.+..++.+.+++.++...+.|+ +|++|+.+..
T Consensus 145 p~NPtG~~~~~~~~~~l~~~a~~~~~------~ii~De~y~~ 180 (350)
T TIGR03537 145 PHNPTGATAPRSYLKETIAMCREHGI------ILCSDECYTE 180 (350)
T ss_pred CCCCcCcccCHHHHHHHHHHHHHcCc------EEEEeccccc
Confidence 55666677898889998887755432 5899998753
No 137
>PF01656 CbiA: CobQ/CobB/MinD/ParA nucleotide binding domain; InterPro: IPR002586 This entry consists of various cobyrinic acid a,c-diamide synthases. These include CbiA and CbiP from Salmonella typhimurium []., and CobQ from Rhodobacter capsulatus []. These amidases catalyse amidations to various side chains of hydrogenobyrinic acid or cobyrinic acid a,c-diamide in the biosynthesis of cobalamin (vitamin B12) from uroporphyrinogen III. Vitamin B12 is an important cofactor and an essential nutrient for many plants and animals and is primarily produced by bacteria [].; PDB: 3K9G_A 3K9H_B 3EZ9_B 3EZF_A 3EZ2_B 3EZ6_A 3EZ7_A 1G3Q_A 1G3R_A 1DTS_A ....
Probab=20.32 E-value=1.3e+02 Score=20.71 Aligned_cols=10 Identities=50% Similarity=0.840 Sum_probs=8.8
Q ss_pred CceEEEeccc
Q psy1868 71 GYEYINIDDC 80 (108)
Q Consensus 71 Gy~yv~IDdg 80 (108)
.|+||+||.+
T Consensus 94 ~yD~iiiD~~ 103 (195)
T PF01656_consen 94 DYDYIIIDTP 103 (195)
T ss_dssp TSSEEEEEEC
T ss_pred cccceeeccc
Confidence 3999999986
Done!