Query psy15702
Match_columns 259
No_of_seqs 165 out of 723
Neff 7.8
Searched_HMMs 46136
Date Fri Aug 16 19:19:12 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy15702.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/15702hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3861|consensus 100.0 1.1E-57 2.4E-62 394.7 13.8 256 2-257 45-308 (438)
2 TIGR03521 GldG gliding-associa 99.9 2.8E-27 6E-32 227.9 16.5 196 4-210 226-512 (552)
3 PF09822 ABC_transp_aux: ABC-t 99.6 3.5E-15 7.6E-20 132.0 8.2 72 3-74 191-271 (271)
4 COG3225 GldG ABC-type uncharac 99.6 1.1E-14 2.3E-19 134.7 11.1 198 2-209 222-489 (538)
5 PF03345 DDOST_48kD: Oligosacc 99.5 7.1E-13 1.5E-17 123.1 12.4 174 9-198 47-258 (423)
6 KOG2754|consensus 99.3 9.2E-12 2E-16 111.4 10.6 169 10-193 73-280 (443)
7 PF06283 ThuA: Trehalose utili 97.2 0.0046 1E-07 52.8 11.0 163 2-191 45-217 (217)
8 KOG4266|consensus 97.1 0.004 8.6E-08 60.1 10.4 63 8-70 678-763 (1033)
9 PF14258 DUF4350: Domain of un 97.0 0.0013 2.8E-08 45.9 4.9 37 7-43 32-69 (70)
10 COG5426 Uncharacterized membra 95.7 0.11 2.3E-06 43.7 9.4 40 154-193 211-253 (254)
11 PF09960 DUF2194: Uncharacteri 95.7 0.06 1.3E-06 52.7 9.3 56 5-69 97-154 (585)
12 TIGR02336 1,3-beta-galactosyl- 94.9 0.51 1.1E-05 46.6 12.4 157 3-193 494-659 (719)
13 cd03143 A4_beta-galactosidase_ 94.5 0.061 1.3E-06 43.2 4.6 36 6-42 50-85 (154)
14 cd03142 GATase1_ThuA Type 1 gl 94.5 0.08 1.7E-06 45.4 5.4 44 2-45 51-96 (215)
15 PF08532 Glyco_hydro_42M: Beta 94.3 0.037 7.9E-07 47.0 2.9 40 6-46 54-93 (207)
16 PF13709 DUF4159: Domain of un 94.1 0.16 3.5E-06 43.2 6.4 42 4-45 48-90 (207)
17 PF07090 DUF1355: Protein of u 93.8 0.047 1E-06 45.4 2.7 43 5-47 63-111 (177)
18 PF07685 GATase_3: CobB/CobQ-l 85.2 1.1 2.3E-05 36.4 3.5 43 4-46 2-51 (158)
19 PF09825 BPL_N: Biotin-protein 81.7 1.9 4.1E-05 40.0 4.1 41 6-46 46-91 (367)
20 PF09508 Lact_bio_phlase: Lact 71.9 11 0.00023 37.5 6.2 161 5-199 493-661 (716)
21 COG1245 Predicted ATPase, RNas 69.1 9.5 0.00021 36.6 5.1 67 5-71 469-564 (591)
22 PRK05569 flavodoxin; Provision 67.9 16 0.00036 28.3 5.7 65 6-70 45-117 (141)
23 KOG0257|consensus 62.9 13 0.00029 34.8 4.8 43 6-49 169-215 (420)
24 PHA03411 putative methyltransf 61.4 12 0.00027 33.3 4.2 72 8-79 125-221 (279)
25 TIGR00537 hemK_rel_arch HemK-r 60.6 25 0.00054 28.5 5.7 38 27-67 123-160 (179)
26 PF13380 CoA_binding_2: CoA bi 59.9 27 0.00059 26.6 5.4 58 6-71 52-110 (116)
27 PRK00107 gidB 16S rRNA methylt 58.9 20 0.00043 29.9 4.8 60 8-73 111-170 (187)
28 PF04412 DUF521: Protein of un 57.9 15 0.00032 34.6 4.2 64 5-70 285-354 (400)
29 PRK06242 flavodoxin; Provision 55.7 37 0.00081 26.5 5.8 65 6-70 40-108 (150)
30 cd03144 GATase1_ScBLP_like Typ 55.0 9.8 0.00021 29.3 2.1 36 8-43 43-83 (114)
31 cd06542 GH18_EndoS-like Endo-b 53.6 56 0.0012 28.1 7.0 43 5-47 23-74 (255)
32 TIGR01755 flav_wrbA NAD(P)H:qu 53.3 15 0.00032 30.8 3.2 64 4-69 63-141 (197)
33 PF04016 DUF364: Domain of unk 53.2 5.7 0.00012 31.8 0.6 62 4-73 57-118 (147)
34 PRK04266 fibrillarin; Provisio 52.2 36 0.00078 29.2 5.5 63 6-68 138-206 (226)
35 PF11303 DUF3105: Protein of u 51.7 59 0.0013 25.5 6.1 40 6-47 46-86 (130)
36 TIGR01753 flav_short flavodoxi 50.5 38 0.00082 25.8 5.0 63 6-68 42-114 (140)
37 COG1832 Predicted CoA-binding 50.0 59 0.0013 25.9 5.8 56 4-67 68-123 (140)
38 PRK05568 flavodoxin; Provision 50.0 42 0.00091 25.9 5.2 65 6-70 45-116 (142)
39 PF13344 Hydrolase_6: Haloacid 49.4 30 0.00064 25.6 4.0 51 23-75 16-66 (101)
40 TIGR02469 CbiT precorrin-6Y C5 48.8 30 0.00066 25.5 4.1 37 7-44 86-122 (124)
41 PRK14813 NADH dehydrogenase su 46.3 42 0.00092 28.1 4.7 39 5-46 63-103 (189)
42 cd01355 AcnX Putative Aconitas 45.9 42 0.0009 31.5 5.2 60 8-70 277-341 (389)
43 PRK14968 putative methyltransf 45.8 56 0.0012 26.2 5.5 39 27-68 131-169 (188)
44 PF08468 MTS_N: Methyltransfer 44.9 54 0.0012 26.5 5.1 51 8-64 68-122 (155)
45 TIGR01957 nuoB_fam NADH-quinon 44.9 50 0.0011 26.5 4.9 39 5-46 53-93 (145)
46 PF05846 Chordopox_A15: Chordo 44.5 60 0.0013 23.8 4.7 40 5-48 41-83 (90)
47 PF08282 Hydrolase_3: haloacid 42.6 36 0.00078 28.3 4.0 45 17-66 11-55 (254)
48 PF04270 Strep_his_triad: Stre 42.0 16 0.00035 24.1 1.3 19 18-36 34-52 (53)
49 PF01210 NAD_Gly3P_dh_N: NAD-d 41.0 12 0.00027 29.9 0.8 56 5-62 65-120 (157)
50 TIGR00138 gidB 16S rRNA methyl 40.9 46 0.001 27.4 4.3 55 7-67 107-164 (181)
51 TIGR01019 sucCoAalpha succinyl 40.8 58 0.0013 29.1 5.1 59 5-67 57-117 (286)
52 COG1492 CobQ Cobyric acid synt 40.5 44 0.00095 32.2 4.5 42 7-48 288-336 (486)
53 TIGR01457 HAD-SF-IIA-hyp2 HAD- 40.0 59 0.0013 28.1 5.0 63 9-75 7-69 (249)
54 COG2813 RsmC 16S RNA G1207 met 39.2 43 0.00093 30.2 4.0 63 4-69 32-95 (300)
55 PF02593 dTMP_synthase: Thymid 38.9 80 0.0017 27.1 5.5 60 7-69 49-108 (217)
56 TIGR03534 RF_mod_PrmC protein- 38.7 63 0.0014 27.3 5.0 36 27-66 200-235 (251)
57 PRK03767 NAD(P)H:quinone oxido 38.7 36 0.00079 28.3 3.4 64 4-69 64-142 (200)
58 cd02440 AdoMet_MTases S-adenos 37.8 41 0.00089 22.9 3.1 37 7-43 64-103 (107)
59 COG0677 WecC UDP-N-acetyl-D-ma 37.7 82 0.0018 29.8 5.7 66 4-71 79-155 (436)
60 PF10131 PTPS_related: 6-pyruv 37.4 54 0.0012 32.7 4.8 63 3-69 481-543 (616)
61 PF12847 Methyltransf_18: Meth 37.3 25 0.00055 25.6 2.0 40 4-43 65-110 (112)
62 PF03464 eRF1_2: eRF1 domain 2 37.1 75 0.0016 24.6 4.8 50 7-61 71-123 (133)
63 PRK10513 sugar phosphate phosp 36.7 97 0.0021 26.6 5.9 45 17-66 16-60 (270)
64 PF12123 Amidase02_C: N-acetyl 36.3 29 0.00062 22.1 1.8 24 12-38 17-40 (45)
65 TIGR01177 conserved hypothetic 35.9 65 0.0014 29.1 4.8 59 8-73 247-316 (329)
66 PRK13302 putative L-aspartate 35.0 73 0.0016 28.1 4.9 57 5-67 63-119 (271)
67 PRK08287 cobalt-precorrin-6Y C 34.9 65 0.0014 26.2 4.3 57 7-67 95-151 (187)
68 PRK05678 succinyl-CoA syntheta 34.7 78 0.0017 28.4 5.0 59 5-67 59-119 (291)
69 TIGR01458 HAD-SF-IIA-hyp3 HAD- 34.6 86 0.0019 27.2 5.2 66 8-75 6-73 (257)
70 PF11293 DUF3094: Protein of u 34.4 50 0.0011 21.8 2.7 20 20-39 2-21 (55)
71 cd06905 Peptidase_M14-like_8 A 33.9 52 0.0011 30.5 3.8 32 15-47 228-259 (360)
72 PRK14816 NADH dehydrogenase su 33.8 84 0.0018 26.3 4.6 41 5-46 77-117 (182)
73 PRK00121 trmB tRNA (guanine-N( 33.7 63 0.0014 26.9 4.0 58 8-68 110-177 (202)
74 PF00891 Methyltransf_2: O-met 33.5 37 0.0008 29.0 2.7 39 4-43 154-198 (241)
75 COG3845 ABC-type uncharacteriz 33.3 1.2E+02 0.0027 29.2 6.3 46 2-47 414-462 (501)
76 PRK14967 putative methyltransf 33.0 93 0.002 26.2 5.0 37 27-66 142-178 (223)
77 PRK06067 flagellar accessory p 33.0 1.1E+02 0.0024 25.9 5.5 64 2-69 18-81 (234)
78 PTZ00146 fibrillarin; Provisio 33.0 1.4E+02 0.003 26.9 6.3 59 7-68 200-267 (293)
79 COG1385 Uncharacterized protei 32.2 51 0.0011 28.8 3.3 24 9-32 191-214 (246)
80 PRK00170 azoreductase; Reviewe 32.0 48 0.001 27.3 3.0 18 7-24 84-101 (201)
81 PRK06703 flavodoxin; Provision 31.5 1.2E+02 0.0026 23.7 5.2 64 6-69 45-118 (151)
82 PRK10530 pyridoxal phosphate ( 31.5 1.1E+02 0.0023 26.2 5.3 45 17-66 16-60 (272)
83 cd01018 ZntC Metal binding pro 31.3 1.4E+02 0.003 26.0 6.0 48 17-67 196-243 (266)
84 PRK00103 rRNA large subunit me 31.2 83 0.0018 25.5 4.2 42 24-66 54-95 (157)
85 COG4019 Uncharacterized protei 31.1 49 0.0011 26.0 2.6 42 23-67 22-63 (156)
86 PRK11713 16S ribosomal RNA met 31.0 47 0.001 28.6 2.9 22 11-32 179-200 (234)
87 PF01297 TroA: Periplasmic sol 30.7 95 0.0021 26.7 4.8 49 16-67 177-225 (256)
88 PRK09328 N5-glutamine S-adenos 30.6 99 0.0022 26.6 5.0 35 28-66 222-256 (275)
89 TIGR02716 C20_methyl_CrtF C-20 30.6 69 0.0015 28.5 4.0 35 9-43 215-253 (306)
90 TIGR03704 PrmC_rel_meth putati 30.4 1.1E+02 0.0023 26.6 5.1 35 29-67 201-235 (251)
91 COG1236 YSH1 Predicted exonucl 30.1 72 0.0016 30.2 4.2 57 6-65 177-241 (427)
92 PRK15126 thiamin pyrimidine py 30.0 1E+02 0.0022 26.7 4.9 45 17-66 15-59 (272)
93 COG3845 ABC-type uncharacteriz 30.0 1.1E+02 0.0024 29.5 5.4 37 7-43 156-196 (501)
94 PRK09489 rsmC 16S ribosomal RN 29.9 70 0.0015 29.3 4.0 56 6-64 73-129 (342)
95 cd03238 ABC_UvrA The excision 29.8 1.4E+02 0.0029 24.5 5.3 41 7-47 103-148 (176)
96 PF08351 DUF1726: Domain of un 29.5 41 0.00089 24.7 1.9 40 8-48 10-49 (92)
97 PF09862 DUF2089: Protein of u 29.3 92 0.002 23.9 3.9 33 20-68 32-64 (113)
98 TIGR01752 flav_long flavodoxin 29.2 1.2E+02 0.0026 24.4 4.9 63 6-69 41-116 (167)
99 PF04452 Methyltrans_RNA: RNA 28.7 50 0.0011 28.1 2.6 22 11-32 176-197 (225)
100 PRK10976 putative hydrolase; P 28.6 1.2E+02 0.0027 26.0 5.2 45 17-66 15-59 (266)
101 TIGR00046 RNA methyltransferas 28.5 64 0.0014 27.9 3.3 22 11-32 186-207 (240)
102 TIGR00313 cobQ cobyric acid sy 28.3 58 0.0013 31.3 3.3 39 5-44 281-326 (475)
103 TIGR03567 FMN_reduc_SsuE FMN r 28.0 75 0.0016 25.7 3.5 64 7-70 63-134 (171)
104 PRK00377 cbiT cobalt-precorrin 27.6 87 0.0019 25.8 3.9 56 8-67 110-165 (198)
105 PF00919 UPF0004: Uncharacteri 27.6 2.5E+02 0.0055 20.6 6.3 55 6-64 33-91 (98)
106 PRK04017 hypothetical protein; 27.5 1.1E+02 0.0024 24.1 4.2 55 9-69 43-98 (132)
107 cd03130 GATase1_CobB Type 1 gl 27.2 93 0.002 25.9 4.0 39 7-45 38-83 (198)
108 PRK10444 UMP phosphatase; Prov 27.2 1.1E+02 0.0024 26.5 4.6 64 8-75 6-69 (248)
109 TIGR03801 asp_4_decarbox aspar 26.8 1.4E+02 0.003 29.1 5.6 41 8-48 239-284 (521)
110 TIGR00099 Cof-subfamily Cof su 26.7 1.4E+02 0.0031 25.4 5.2 44 18-66 13-56 (256)
111 PRK06411 NADH dehydrogenase su 26.6 86 0.0019 26.2 3.5 39 5-46 70-110 (183)
112 cd06228 Peptidase_M14-like_3 A 26.5 76 0.0016 29.1 3.5 46 14-66 157-202 (332)
113 KOG1970|consensus 26.4 1.1E+02 0.0024 30.3 4.6 41 27-68 91-139 (634)
114 PLN00125 Succinyl-CoA ligase [ 26.1 1.4E+02 0.003 27.0 5.1 56 5-67 63-124 (300)
115 cd01653 GATase1 Type 1 glutami 25.5 1.7E+02 0.0036 19.9 4.7 38 6-43 43-85 (115)
116 PF13847 Methyltransf_31: Meth 25.3 46 0.00099 26.1 1.7 39 8-46 72-112 (152)
117 cd01750 GATase1_CobQ Type 1 gl 25.1 93 0.002 25.8 3.6 36 7-43 35-78 (194)
118 TIGR01689 EcbF-BcbF capsule bi 25.0 1.8E+02 0.0039 22.6 5.0 47 22-68 25-81 (126)
119 cd01017 AdcA Metal binding pro 24.7 1.5E+02 0.0032 26.1 5.0 48 17-67 199-246 (282)
120 cd03110 Fer4_NifH_child This p 24.3 2.4E+02 0.0052 22.5 5.9 11 8-18 91-101 (179)
121 PF01963 TraB: TraB family; I 24.2 1.2E+02 0.0026 25.8 4.3 42 26-68 216-257 (259)
122 TIGR00080 pimt protein-L-isoas 24.0 1.2E+02 0.0025 25.4 4.0 35 8-46 145-179 (215)
123 cd03870 M14_CPA Peptidase M14 24.0 90 0.002 28.1 3.5 31 16-47 160-190 (301)
124 cd01137 PsaA Metal binding pro 23.9 1.5E+02 0.0032 26.3 4.9 48 17-67 205-252 (287)
125 PF09488 Osmo_MPGsynth: Mannos 23.7 1.3E+02 0.0029 27.9 4.5 57 5-66 72-131 (381)
126 TIGR00268 conserved hypothetic 23.7 79 0.0017 27.4 3.0 37 27-66 2-38 (252)
127 cd01748 GATase1_IGP_Synthase T 23.6 83 0.0018 26.0 3.0 37 6-43 33-77 (198)
128 TIGR03877 thermo_KaiC_1 KaiC d 23.4 1.9E+02 0.0041 24.6 5.3 62 2-67 14-75 (237)
129 COG2242 CobL Precorrin-6B meth 23.4 1.2E+02 0.0026 25.4 3.9 53 9-66 102-154 (187)
130 PRK07366 succinyldiaminopimela 23.3 1.7E+02 0.0036 26.7 5.3 30 9-38 165-198 (388)
131 TIGR03687 pupylate_cterm ubiqu 23.2 52 0.0011 19.4 1.2 12 27-38 21-32 (33)
132 PF14317 YcxB: YcxB-like prote 23.2 34 0.00074 22.0 0.5 18 18-35 45-62 (62)
133 TIGR01487 SPP-like sucrose-pho 23.0 2.1E+02 0.0045 23.6 5.4 45 17-66 14-58 (215)
134 PRK05928 hemD uroporphyrinogen 22.9 74 0.0016 26.7 2.7 53 18-74 106-158 (249)
135 CHL00023 ndhK NADH dehydrogena 22.7 1.7E+02 0.0036 25.3 4.6 41 5-46 67-107 (225)
136 PF01965 DJ-1_PfpI: DJ-1/PfpI 22.7 1.1E+02 0.0024 23.8 3.5 41 6-47 34-80 (147)
137 PF01408 GFO_IDH_MocA: Oxidore 22.4 1.3E+02 0.0028 22.1 3.7 54 9-67 62-116 (120)
138 PRK09189 uroporphyrinogen-III 22.4 1.4E+02 0.0031 25.3 4.4 51 22-76 102-153 (240)
139 TIGR02717 AcCoA-syn-alpha acet 22.3 2.8E+02 0.0062 26.3 6.7 64 4-71 59-129 (447)
140 PF11576 DUF3236: Protein of u 22.1 57 0.0012 26.1 1.6 41 23-66 21-61 (154)
141 COG1889 NOP1 Fibrillarin-like 22.0 1.1E+02 0.0024 26.3 3.3 39 8-48 144-184 (231)
142 cd03421 SirA_like_N SirA_like_ 21.8 66 0.0014 21.6 1.7 32 34-68 22-53 (67)
143 cd03128 GAT_1 Type 1 glutamine 21.6 2.3E+02 0.005 18.1 5.4 36 6-41 43-83 (92)
144 PRK09275 aspartate aminotransf 21.5 2E+02 0.0043 28.2 5.5 41 8-48 240-285 (527)
145 cd01076 NAD_bind_1_Glu_DH NAD( 21.5 1.6E+02 0.0034 25.3 4.4 58 8-74 105-162 (227)
146 TIGR02522 pilus_cpaD pilus (Ca 21.4 1.5E+02 0.0033 25.0 4.2 50 20-69 53-107 (198)
147 PRK12359 flavodoxin FldB; Prov 21.1 4.2E+02 0.0091 21.7 6.7 62 6-67 42-115 (172)
148 COG0803 LraI ABC-type metal io 21.0 2.3E+02 0.005 25.3 5.6 51 16-69 219-269 (303)
149 TIGR03880 KaiC_arch_3 KaiC dom 20.9 2.2E+02 0.0048 23.7 5.2 63 2-68 9-71 (224)
150 cd03237 ABC_RNaseL_inhibitor_d 20.8 2E+02 0.0043 24.7 5.0 42 5-46 129-174 (246)
151 PRK05452 anaerobic nitric oxid 20.8 1.4E+02 0.0031 28.7 4.3 60 9-70 304-370 (479)
152 COG1587 HemD Uroporphyrinogen- 20.7 73 0.0016 27.5 2.2 59 14-76 100-158 (248)
153 PF05939 Phage_min_tail: Phage 20.5 1.7E+02 0.0037 22.0 3.9 28 22-49 50-79 (109)
154 COG0436 Aspartate/tyrosine/aro 20.4 1.3E+02 0.0028 28.1 3.8 40 7-47 161-204 (393)
155 TIGR00379 cobB cobyrinic acid 20.3 86 0.0019 29.8 2.7 40 5-45 283-329 (449)
156 PRK14004 hisH imidazole glycer 20.3 1.2E+02 0.0026 25.7 3.3 37 6-43 34-78 (210)
157 TIGR03855 NAD_NadX aspartate d 20.2 2E+02 0.0043 24.8 4.7 55 8-67 36-92 (229)
158 PRK13896 cobyrinic acid a,c-di 20.0 1.6E+02 0.0035 28.0 4.5 41 5-46 271-317 (433)
No 1
>KOG3861|consensus
Probab=100.00 E-value=1.1e-57 Score=394.66 Aligned_cols=256 Identities=43% Similarity=0.761 Sum_probs=241.8
Q ss_pred CChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeeccCCccc
Q psy15702 2 ITPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRCHFYKYF 81 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~~~~~~~ 81 (259)
|+.+.++++.++|+++|++.|+++|+++|++|++-||.++|+++++||.+..+|+|.+|++|||.+|.|.|+|++|++||
T Consensus 45 i~~e~L~~Vk~~i~agP~~~Ft~~Efevlkkyve~GGsl~vllGEGGE~rf~tnvNf~le~YGI~vN~DtVvR~vy~KyF 124 (438)
T KOG3861|consen 45 IRMERLARVKIFILAGPQDRFTEDEFEVLKKYVEVGGSLVVLLGEGGEPRFNTNVNFFLEQYGIYVNGDTVVRPVYYKYF 124 (438)
T ss_pred hhhhhhcceeEEEecCcccccchhHHHHHHHHHhcCCeEEEEecCCCCccccccHHHHHHHhCeEecCCceeehhhhhcc
Confidence 56778999999999999999999999999999999999999999999988899999999999999999999999999999
Q ss_pred CCccccccCccchhhhhhhhcCCCc------cccccCcceEEecccceeeccCCcceEEeeCCcccccCcceEEEEeecC
Q psy15702 82 HPKECFIGNGVLNRGIAEALHRDVT------EDSLASSLHFVYPYGASLNVIKPSVPVFSTGSASCPLNRPVVSFYRDKT 155 (259)
Q Consensus 82 ~P~~~~i~~~i~~~~i~~~~~~~i~------~~~~~~~~~~~~p~~~~l~v~~~~~pll~ts~~s~p~~~pv~A~~~~~~ 155 (259)
||+++++++|++++.+.+.+.+.+. .+.+++.++|+||||++|+|.+|+..+|+||+.+||++||++|++..+.
T Consensus 125 hPKEalV~~GVvnr~i~raa~K~v~~~v~~~~~~n~qal~F~YPyGaTL~V~~panvvLstGsv~fP~nRP~~af~~~kN 204 (438)
T KOG3861|consen 125 HPKEALVGGGVVNRSIWRAALKLVIEKVYYDFSDNKQALHFQYPYGATLNVSEPANVVLSTGSVVFPFNRPLVAFFTNKN 204 (438)
T ss_pred ChHHhhhccceeeHHHHHHHHhhhHHHHHhhcccchheEEEecccCceeeccccceeEeccCceeccCCCcceeeeeccC
Confidence 9999999999999999887765322 1356788999999999999999999999999999999999999998655
Q ss_pred CCcEEEEEecCcccccccccccccHHHHHHHHHhhcCCCCCccccCCCCCCcccccccccHHHHHhhccccccc--cccC
Q psy15702 156 RGGKLVVLGSAALFSDQYLEREENDKMRQIILSFLLTDDINLNQIDADDADVTDYTLVPEIRYTAAQLKSTLME--FQEV 233 (259)
Q Consensus 156 ~~grvvViGs~~~~~d~~~~~~~N~~f~~n~i~wL~~~~~~l~~I~~k~~~~~~~~~v~d~~~~a~~~~~c~~~--~~~~ 233 (259)
++||++|+||+.||.|+|++++.|.+++.-+++||.+++..++.|++.++++.+|..+||+.-+|+++|.|||| ++++
T Consensus 205 ~gGki~vvGS~~mfhD~YldkeeN~kifd~~v~~L~~g~i~ln~iDa~dpEisDy~~~pd~~~ls~~~k~Cl~e~i~~~i 284 (438)
T KOG3861|consen 205 KGGKILVVGSGYMFHDKYLDKEENDKIFDYLVKLLGGGEITLNHIDANDPEISDYKHFPDIGFLSDMPKACLIESIGTDI 284 (438)
T ss_pred cCceEEEeeeeeeechhhccccccchHHHHHHHHhcCCceEeeccccCCCcccccccCcchhhhhhccHHHHhhcccccc
Confidence 77999999999999999999999999999999999998899999999999999999999999999999999999 6899
Q ss_pred CchhHHhhccccccccCCCCCccc
Q psy15702 234 LDDYTRFFSLDLSKVDMSTVPLDQ 257 (259)
Q Consensus 234 ~~d~~~~~~~~~~~~~~~~~~~~~ 257 (259)
|.||+++||.+|+++++.+||+|.
T Consensus 285 p~DF~~~fD~sl~~l~~~~~~~vi 308 (438)
T KOG3861|consen 285 PTDFKQMFDMSLCRLSNRLLKDVI 308 (438)
T ss_pred chHHHHHHHHHHHhhccchhHHHH
Confidence 999999999999999999999875
No 2
>TIGR03521 GldG gliding-associated putative ABC transporter substrate-binding component GldG. Members of this protein family are exclusive to the Bacteroidetes phylum (previously Cytophaga-Flavobacteria-Bacteroides). GldG is a protein linked to a type of rapid surface gliding motility found in certain Bacteroidetes, such as Flavobacterium johnsoniae and Cytophaga hutchinsonii. Knockouts of GldG abolish the gliding phenotype. GldG, along with GldA and GldF are believed to compose an ABC transporter and are observed as an operon. Gliding motility appears closely linked to chitin utilization in the model species Flavobacterium johnsoniae. Bacteroidetes with members of this protein family appear to have all of the genes associated with gliding motility.
Probab=99.95 E-value=2.8e-27 Score=227.93 Aligned_cols=196 Identities=18% Similarity=0.283 Sum_probs=145.4
Q ss_pred hhhcC---CccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCC---C-----------CCCCccHHHHHHHhCcc
Q psy15702 4 PELLR---AASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEG---E-----------RGSSTNINFLLEQYGVS 66 (259)
Q Consensus 4 ~~ip~---~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~---~-----------~~~~~nln~LL~~~Gi~ 66 (259)
.++|+ +||+|||++|+++|++.|+++|++|+++|||+++++||.. + ....+||+.||++|||+
T Consensus 226 ~~ip~~l~d~d~LvI~~P~~~ls~~e~~~Ldqfl~~GG~ll~~~dp~~~~~~~~~~~~g~~~~~~~~~~L~~Ll~~~Gi~ 305 (552)
T TIGR03521 226 AKTLADLKKFDLIVIAKPTEAFSEREKYILDQYIMNGGKALFLVDAVAMEMDSLYNGDGATFALPRDLNLDDLLFKYGIR 305 (552)
T ss_pred ccccccccCcCEEEEeCCCccCCHHHHHHHHHHHHcCCeEEEEecCcccccccccccCCccccCCCCCCHHHHHHHhCeE
Confidence 36776 8999999999999999999999999999999999999953 1 01237999999999999
Q ss_pred cCCceeeeccCCc-------------ccCCccccccCccchhhhhhhhcCCCccccccCcceEEecccceeeccCC---c
Q psy15702 67 VNSDCVVRCHFYK-------------YFHPKECFIGNGVLNRGIAEALHRDVTEDSLASSLHFVYPYGASLNVIKP---S 130 (259)
Q Consensus 67 ~~~~~V~d~~~~~-------------~~~P~~~~i~~~i~~~~i~~~~~~~i~~~~~~~~~~~~~p~~~~l~v~~~---~ 130 (259)
++++.|+|..... |.++...++ +.+..+..|++|.... .+++|++++|...+. .
T Consensus 306 ~~~~~V~D~~~~~~~~~~g~~g~~~~~~~~~~~~~------p~~~~~~~~piT~~l~----~v~~~~a~~i~~~~~~~~~ 375 (552)
T TIGR03521 306 INPDLVEDLNSAPIVLATGNQGNNTQYQPLPWPYY------PQVYSFNKHPITKNLD----AVKFEFASTIDTLKNGIKK 375 (552)
T ss_pred eCcCeEecCcCCceeeecCCcCCcceecccCcccc------cccccCCCCccccCcc----cEEEeccceeEecCCCeEE
Confidence 9999999855431 111111100 1122234566663332 347788888875432 7
Q ss_pred ceEEeeCCccccc-----------------------CcceEEEEee--------------------cCCCcEEEEEecCc
Q psy15702 131 VPVFSTGSASCPL-----------------------NRPVVSFYRD--------------------KTRGGKLVVLGSAA 167 (259)
Q Consensus 131 ~pll~ts~~s~p~-----------------------~~pv~A~~~~--------------------~~~~grvvViGs~~ 167 (259)
+|||+||+.||-. ..++++..+| +.+.+||||+||++
T Consensus 376 tpLL~TS~~s~~~~~~~~i~~~~~~~~~dp~~~~~g~~~la~~l~g~~~s~f~~~~~~~~~~~~~~~~~~~rvvvvgd~d 455 (552)
T TIGR03521 376 TPLLQTSPYSKIEGTPAQISLSEVTEEPDPESYNLGNLPLAVLLEGSFTSAYKNRILPFEIPFKRDQGKPTKMIVVADGD 455 (552)
T ss_pred EEeEEeChhhhccCCCccccHHHhhcCCChhHcCCCCeEEEEEEEeccccccCCCCCccccccccccCCCceEEEEechH
Confidence 9999999999921 1234444332 12568999999999
Q ss_pred ccccccccc---------------cccHHHHHHHHHhhcCCCCCccccCCCCCCcccc
Q psy15702 168 LFSDQYLER---------------EENDKMRQIILSFLLTDDINLNQIDADDADVTDY 210 (259)
Q Consensus 168 ~~~d~~~~~---------------~~N~~f~~n~i~wL~~~~~~l~~I~~k~~~~~~~ 210 (259)
|++|+++.. .+|.+|++|+++||+++ +++++||+|..+.+++
T Consensus 456 ~l~d~~~~~~~~~~~g~~~~~~~~~~N~df~lN~vd~L~~~-~~li~IR~k~~~~r~l 512 (552)
T TIGR03521 456 VIRNQLDNNGKPLELGYDRFTGNLYGNKEFLLNAVNYLLDD-TGLINIRSKEITLPPL 512 (552)
T ss_pred HhhhhHhhcCCccccccchhccccCccHHHHHHHHHHhcCC-chhhhccccCcccCCC
Confidence 999998832 37999999999999998 8899999999996544
No 3
>PF09822 ABC_transp_aux: ABC-type uncharacterized transport system; InterPro: IPR019196 This domain is found in various eukaryotic and prokaryotic intra-flagellar transport proteins involved in gliding motility, as well as in several hypothetical proteins.
Probab=99.58 E-value=3.5e-15 Score=132.02 Aligned_cols=72 Identities=35% Similarity=0.580 Sum_probs=65.5
Q ss_pred ChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCC---------CCCccHHHHHHHhCcccCCceee
Q psy15702 3 TPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGER---------GSSTNINFLLEQYGVSVNSDCVV 73 (259)
Q Consensus 3 t~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~---------~~~~nln~LL~~~Gi~~~~~~V~ 73 (259)
+.+||++||+|||++|+++|++.|+++|++||++|||+|+++++.... ...++|+.||++|||+++++.|+
T Consensus 191 ~~~IP~~~d~Lvi~~P~~~ls~~e~~~l~~yl~~GG~ll~~~d~~~~~~~~~~~~~~~~~~~L~~lL~~~Gi~~~~~~V~ 270 (271)
T PF09822_consen 191 NEEIPDDADVLVIAGPKTDLSEEELYALDQYLMNGGKLLILLDPFSVELQGLWAGGAQRDSNLNDLLEEYGIRINPGLVV 270 (271)
T ss_pred ccccCCCCCEEEEECCCCCCCHHHHHHHHHHHHcCCeEEEEECCcccccccccccccccccCHHHHHHHcCCEeCCCEec
Confidence 468999999999999999999999999999999999999999997421 12679999999999999999998
Q ss_pred e
Q psy15702 74 R 74 (259)
Q Consensus 74 d 74 (259)
|
T Consensus 271 D 271 (271)
T PF09822_consen 271 D 271 (271)
T ss_pred C
Confidence 6
No 4
>COG3225 GldG ABC-type uncharacterized transport system involved in gliding motility, auxiliary component [Cell motility and secretion]
Probab=99.58 E-value=1.1e-14 Score=134.75 Aligned_cols=198 Identities=18% Similarity=0.228 Sum_probs=133.9
Q ss_pred CChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCC---------CCCCccHHHHHHHhCcccCCcee
Q psy15702 2 ITPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGE---------RGSSTNINFLLEQYGVSVNSDCV 72 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~---------~~~~~nln~LL~~~Gi~~~~~~V 72 (259)
++.+||.-.+++++.+|.. +.+...++|++|+-+||+++.++++-.. .+..++++.||+.|||+++++.|
T Consensus 222 ~st~ipa~~~~l~~v~p~~-l~E~~~~~iD~fVl~GGkllafV~~ls~~~~~~~~v~pk~sdll~~Lle~wGv~l~p~li 300 (538)
T COG3225 222 ISTRIPALVNVLLIVGPLN-LDEQAAYDIDAFVLAGGKLLAFVSTLSYYLNALYMVGPKSSDLLPDLLESWGVRLNPRLI 300 (538)
T ss_pred ccccccccccceeEecccc-cchhhhhhhHHHhhcCCeeEEEecchhhccccccccCccccccHHHHHHHhCeecCCCee
Confidence 4567899999999999995 9999999999999999999999988432 34577899999999999999999
Q ss_pred eeccCC-cc------------cCCccccccCccchhhhhhhhcCCCccccccCcceEEecccceeeccCC----cceEEe
Q psy15702 73 VRCHFY-KY------------FHPKECFIGNGVLNRGIAEALHRDVTEDSLASSLHFVYPYGASLNVIKP----SVPVFS 135 (259)
Q Consensus 73 ~d~~~~-~~------------~~P~~~~i~~~i~~~~i~~~~~~~i~~~~~~~~~~~~~p~~~~l~v~~~----~~pll~ 135 (259)
+|.+-. .. .||....+.-..+.+.+. +. ..+.--..-+++.+.+..... ..|+..
T Consensus 301 ld~~~~a~~L~p~~~~~~~r~~~P~w~~i~~~pl~~~vd------~~--~~agiesltL~~~s~le~~~~a~~~flpln~ 372 (538)
T COG3225 301 LDDSGFAMSLGPNAPTRYDRGNHPIWLLIPKSPLVRPVD------MP--LQAGIESLTLAWPSSLELSKGASVEFLPLNP 372 (538)
T ss_pred eecccceeecCCCCcccccccCCCcceecccccccCCCC------cc--cccCceeeeccCchhhcccccceecccCCCh
Confidence 975422 11 234322222111111111 00 000000112334444444322 577888
Q ss_pred eCCcccccCcceEE------------------------------EEee--c-------CCCcEEEEEecCcccccccccc
Q psy15702 136 TGSASCPLNRPVVS------------------------------FYRD--K-------TRGGKLVVLGSAALFSDQYLER 176 (259)
Q Consensus 136 ts~~s~p~~~pv~A------------------------------~~~~--~-------~~~grvvViGs~~~~~d~~~~~ 176 (259)
+++.+||+..+++. +... + ..+..+++++|++++++..+..
T Consensus 373 ss~~~~Pf~~~~~~~~~~~l~~~~~Ps~~~~~~~ak~~gPApsaf~a~p~~~~~g~~~~~ns~~~lv~ds~~~s~~~~~q 452 (538)
T COG3225 373 SSDLSGPFDLPVALTRTISLNANLVPSGKPEELAAKIPGPAPSAFIASPENDQEGKSDRENSPAILVADSDLLSAYMANQ 452 (538)
T ss_pred hhhccCccchhhhhcccccccCcCCCCCCcccccccCCCCCCCccccCCCCCCCCcccccCCceeecCcccccchhhhhc
Confidence 88888887433211 1000 1 1244599999999999998765
Q ss_pred -----cccHHHHHHHHHhhcCCCCCccccCCCCCCccc
Q psy15702 177 -----EENDKMRQIILSFLLTDDINLNQIDADDADVTD 209 (259)
Q Consensus 177 -----~~N~~f~~n~i~wL~~~~~~l~~I~~k~~~~~~ 209 (259)
.+|.+|+.|+++||.+. +.++.+++|-...++
T Consensus 453 ~~~v~~~N~~fv~N~~d~l~g~-D~fln~~Sr~~~~~p 489 (538)
T COG3225 453 SQQVVAGNFEFVTNIFDYLSGG-DAFLNSKSRLEVRRP 489 (538)
T ss_pred CCceeeccHHHHHHHHHHHhCC-cceEeeecccccccc
Confidence 56999999999999998 889999998665444
No 5
>PF03345 DDOST_48kD: Oligosaccharyltransferase 48 kDa subunit beta; InterPro: IPR005013 During N-linked glycosylation of proteins, oligosaccharide chains are assembled on the carrier molecule dolichyl pyrophosphate in the following order: 2 molecules of N-acetylglucosamine (GlcNAc), 9 molecules of mannose, and 3 molecules of glucose. These 14-residue oligosaccharide cores are then transferred to asparagine residues on nascent polypeptide chains in the endoplasmic reticulum (ER). As proteins progress through the Golgi apparatus, the oligosaccharide cores are modified by trimming and extension to generate a diverse array of glycosylated proteins [, ]. The oligosaccharyl transferase complex (OST complex) 2.4.1.119 from EC transfers 14-sugar branched oligosaccharides from dolichyl pyrophosphate to asparagine residues []. The complex contains nine protein subunits: Ost1p, Ost2p, Ost3p, Ost4p, Ost5p, Ost6p, Stt3p, Swp1p, and Wbp1p, all of which are integral membrane proteins of the ER. The OST complex interacts with the Sec61p pore complex [] involved in protein import into the ER. This entry represents subunits OST3 and OST6. OST3 is homologous to OST6 [], and several lines of evidence indicate that they are alternative members of the OST complex. Disruption of both OST3 and OST6 causes severe underglycosylation of soluble and membrane-bound glycoproteins and a defect in the assembly of the complex. Hence, the function of these genes seems to be essential for recruiting a fully active complex necessary for efficient N-glycosylation []. This entry also includes the magnesium transporter protein 1, also known as OST3 homologue B, which might be involved in N-glycosylation through its association with the oligosaccharyl transferase (OST) complex. Wbp1p is the beta subunit of the OST complex, one of the original six subunits purified []. Wbp1 is essential [, ], but conditional mutants have decreased transferase activity [, ]. Wbp1p is homologous to mammalian OST48 [].; GO: 0004579 dolichyl-diphosphooligosaccharide-protein glycotransferase activity, 0018279 protein N-linked glycosylation via asparagine, 0005789 endoplasmic reticulum membrane
Probab=99.45 E-value=7.1e-13 Score=123.14 Aligned_cols=174 Identities=17% Similarity=0.342 Sum_probs=118.2
Q ss_pred CccEEEEcCCCC-CCCH-HHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCc--eeeeccCCcccCCc
Q psy15702 9 AASLFICTGPRE-QFTI-TELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSD--CVVRCHFYKYFHPK 84 (259)
Q Consensus 9 ~advLVI~~P~~-~fs~-~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~--~V~d~~~~~~~~P~ 84 (259)
-.|.|||..|+. .|.. -..+.|.+|+++||++|+++++.+ ..+.+..|+.++||.+.+. .|+|+..+.-....
T Consensus 47 ~YD~LIif~~~~k~~g~~ls~~~ll~Fvd~GgNilv~~s~~~---~~~~ir~~~~E~gi~~~~~~~~viDHf~~~~~~~~ 123 (423)
T PF03345_consen 47 LYDHLIIFPPSVKEFGGSLSPKTLLDFVDNGGNILVAGSSDA---IPDSIREFANELGIEFDPKGSKVIDHFNYDSSSDS 123 (423)
T ss_pred hcceEEEeCCcccccCCCCCHHHHHHHHhCCCcEEEEeCCCc---CcHHHHHHHHHCCeEECCCCCEEEcCCCCcccccc
Confidence 378999999974 3332 358999999999999999999983 3578999999999999865 67787654211010
Q ss_pred c--c-cc--cCccchhhhhhhhcCCCccccccCcceEEecccceeeccC-C-cceEEeeCCcccccCc---------c--
Q psy15702 85 E--C-FI--GNGVLNRGIAEALHRDVTEDSLASSLHFVYPYGASLNVIK-P-SVPVFSTGSASCPLNR---------P-- 146 (259)
Q Consensus 85 ~--~-~i--~~~i~~~~i~~~~~~~i~~~~~~~~~~~~~p~~~~l~v~~-~-~~pll~ts~~s~p~~~---------p-- 146 (259)
. . ++ .+..+.+.. +. ......++| .|.++.+.. | ..|+|+.+.+||..+. |
T Consensus 124 ~~h~~i~~~~~~~i~~~~-------~~---~~~~~pilf-~G~g~~l~~n~l~~pIL~a~~Tays~~~~~~~~~~~~~~~ 192 (423)
T PF03345_consen 124 EDHTVIVLSSNNLIKSSV-------IV---GFKTKPILF-RGVGHLLDNNPLVFPILRAPSTAYSYDPKEPIEEYDDPWA 192 (423)
T ss_pred cCcceEEecCCccccCcc-------cc---cCCCCcEEE-EeEEEEecCCCceeeeeccCCceeccccccccccccchhh
Confidence 0 0 00 111111000 00 011123344 777776653 3 8999999999993211 1
Q ss_pred ------eEEEEeecCCCcEEEEEecCcccccccccc----------cccHHHHHHHHHhhcCCCCCcc
Q psy15702 147 ------VVSFYRDKTRGGKLVVLGSAALFSDQYLER----------EENDKMRQIILSFLLTDDINLN 198 (259)
Q Consensus 147 ------v~A~~~~~~~~grvvViGs~~~~~d~~~~~----------~~N~~f~~n~i~wL~~~~~~l~ 198 (259)
+++..|. ++++|++++||.+||+|.++.. .+|++|+.+++.|-++. ...+
T Consensus 193 ~G~q~~LVsa~Qa-rNNARv~~~GS~d~fsd~~f~~~v~~~~~~~~~~N~~f~~~l~~WtF~e-~gvL 258 (423)
T PF03345_consen 193 AGSQISLVSAFQA-RNNARVVFSGSLDMFSDEFFDSKVQKAGKKSKSGNREFAKELSKWTFQE-KGVL 258 (423)
T ss_pred ccccceEEEEEec-ccCcEEEEEecHHHhccHHHhhhhhhcccccchhHHHHHHHHHHhHHhh-cCeE
Confidence 2333342 6899999999999999998874 68999999999999986 5443
No 6
>KOG2754|consensus
Probab=99.33 E-value=9.2e-12 Score=111.36 Aligned_cols=169 Identities=20% Similarity=0.354 Sum_probs=114.4
Q ss_pred ccEEEEcCCCCC-CCH-HHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCc--eeeeccCCcccCCcc
Q psy15702 10 ASLFICTGPREQ-FTI-TELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSD--CVVRCHFYKYFHPKE 85 (259)
Q Consensus 10 advLVI~~P~~~-fs~-~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~--~V~d~~~~~~~~P~~ 85 (259)
.|-|||.+|..+ |-. -..+.|-+|+++||++|++.++. ....+..++.++||.++++ .|+|++.+.+..+..
T Consensus 73 yDnlIifap~~e~fgg~is~k~l~~Fvd~gGnvlv~ass~----~~d~iRe~~~E~g~~~~e~~~~ViDH~~~d~s~~sg 148 (443)
T KOG2754|consen 73 YDNLIIFAPSVENFGGSISVKSLAKFVDSGGNVLVAASSA----IGDAIREFASECGIEFDEEGAAVIDHHNYDVSSDSG 148 (443)
T ss_pred hccEEEecCchhhcCCCCCHHHHHHHHhCCCcEEEEcCCc----ccHHHHHHHHHhCcccCcccceeeeeeeccccCCCC
Confidence 467888888532 222 24678899999999999999985 3578999999999999975 577988776554432
Q ss_pred c--cc-cCccchhh-hhhhhcCCCccccccCcceEEecccceeecc--CC-cceEEeeCCcccccC-------cce----
Q psy15702 86 C--FI-GNGVLNRG-IAEALHRDVTEDSLASSLHFVYPYGASLNVI--KP-SVPVFSTGSASCPLN-------RPV---- 147 (259)
Q Consensus 86 ~--~i-~~~i~~~~-i~~~~~~~i~~~~~~~~~~~~~p~~~~l~v~--~~-~~pll~ts~~s~p~~-------~pv---- 147 (259)
. ++ ++..+... |- ...+..-. ++..|.++... +| +.|+|+.+++||..+ .|.
T Consensus 149 dhtli~~~nl~~~~~Iv---------g~~~~~~p-iLfrgig~~l~~~n~lvl~IL~a~~TsYs~np~a~~s~np~a~Gs 218 (443)
T KOG2754|consen 149 DHTLIVADNLLKAPYIV---------GKSKRAAP-ILFRGIGHVLGPTNPLVLPILRASPTSYSYNPEAKVSCNPWAAGS 218 (443)
T ss_pred CeEEEeecccccCceee---------ccccccCC-eeeecchhhcCCCCcceeeeecCCCcceecCcccccccCcccccc
Confidence 1 11 11111110 10 01111122 44455554443 33 789999999999544 121
Q ss_pred ----EEEEeecCCCcEEEEEecCcccccccccc-------------cccHHHHHHHHHhhcCC
Q psy15702 148 ----VSFYRDKTRGGKLVVLGSAALFSDQYLER-------------EENDKMRQIILSFLLTD 193 (259)
Q Consensus 148 ----~A~~~~~~~~grvvViGs~~~~~d~~~~~-------------~~N~~f~~n~i~wL~~~ 193 (259)
.+..| .++++|||+.||.+||+|+++.. .+|.+|+..+.+|.++.
T Consensus 219 ~~~LV~~lQ-arNNARvv~sGS~d~fsd~~~~~~~~~~~~~~~~~~sgN~~fA~~lskWvF~E 280 (443)
T KOG2754|consen 219 QTLLVSGLQ-ARNNARVVFSGSSDFFSDEFFSSQVQSASSSVQYEQSGNQEFATELSKWVFKE 280 (443)
T ss_pred ceEEEEeee-ccCCceEEEeccHHhhhcccccccccccCCcchhhccccHHHHHHHHHhhhcc
Confidence 22233 36899999999999999998864 46999999999999986
No 7
>PF06283 ThuA: Trehalose utilisation; PDB: 4E5V_A 1T0B_A.
Probab=97.22 E-value=0.0046 Score=52.76 Aligned_cols=163 Identities=15% Similarity=0.145 Sum_probs=80.6
Q ss_pred CChhhcCCccEEEEcCCCC-CCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeeccCCcc
Q psy15702 2 ITPELLRAASLFICTGPRE-QFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRCHFYKY 80 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~-~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~~~~~~ 80 (259)
++.+.++++|+||+..-.. .++++..++|++|+++||.++.+=..... ...+...+.+=.|-.+...
T Consensus 45 ~~~~~L~~~Dvvv~~~~~~~~l~~~~~~al~~~v~~Ggglv~lH~~~~~--~~~~~~~~~~l~Gg~f~~h---------- 112 (217)
T PF06283_consen 45 LTPENLKGYDVVVFYNTGGDELTDEQRAALRDYVENGGGLVGLHGAATD--SFPDWPEYNELLGGYFKGH---------- 112 (217)
T ss_dssp TSHHCHCT-SEEEEE-SSCCGS-HHHHHHHHHHHHTT-EEEEEGGGGGC--CHTT-HHHHHHHS--SEEE----------
T ss_pred CChhHhcCCCEEEEECCCCCcCCHHHHHHHHHHHHcCCCEEEEcccccc--cchhHHHHHHeeCccccCC----------
Confidence 4566789999999988875 59999999999999999999888412211 1234566666666444221
Q ss_pred cCCccccccCccchhhhhhhhcCCCccccccCcceE---EecccceeeccCCcceEEeeCCcccc------cCcceEEEE
Q psy15702 81 FHPKECFIGNGVLNRGIAEALHRDVTEDSLASSLHF---VYPYGASLNVIKPSVPVFSTGSASCP------LNRPVVSFY 151 (259)
Q Consensus 81 ~~P~~~~i~~~i~~~~i~~~~~~~i~~~~~~~~~~~---~~p~~~~l~v~~~~~pll~ts~~s~p------~~~pv~A~~ 151 (259)
|... ...+. .....||++.... ..... .|-.-. ...+..++|++....++. ...|++=..
T Consensus 113 --~~~~---~~~v~---~~~~~HPi~~gl~-~~f~~~DE~Y~~~~--~~~~~~~vL~~~~~~~~~~~~~~~~~~Pv~W~~ 181 (217)
T PF06283_consen 113 --PPPQ---PFTVR---VEDPDHPITRGLP-ESFTIYDEWYYFLR--DPRPNVTVLLTADESSYDPEGGEGGDHPVAWTR 181 (217)
T ss_dssp --ECEE---EEEEE---ESSTTSCCCTTS--SEEEEEEEEEES-B--S---CEEEEEEEE--GGG--TTTSSEEEEEEEE
T ss_pred --CCCc---eEEEE---EcCCCChhhcCCC-CCceEccccccccc--CCCCCEEEEEEEEeccccccccCCCeEEEEEEE
Confidence 1000 00000 0011233331110 00000 111100 111125677777754331 233554433
Q ss_pred eecCCCcEEEEEecCcccccccccccccHHHHHHHHHhhc
Q psy15702 152 RDKTRGGKLVVLGSAALFSDQYLEREENDKMRQIILSFLL 191 (259)
Q Consensus 152 ~~~~~~grvvViGs~~~~~d~~~~~~~N~~f~~n~i~wL~ 191 (259)
+.++|||+...=++--. -+..++=++++.|.+.|.+
T Consensus 182 --~~GkGRvf~~~lGH~~~--~~~~~~~~~ll~ngi~WaA 217 (217)
T PF06283_consen 182 --EYGKGRVFYTTLGHDEE--TWEDPDFRRLLRNGIRWAA 217 (217)
T ss_dssp --ECTTEEEEEE----TTS--HHHBHHHHHHHHHHHHHCH
T ss_pred --EeCCeeEEEECCCCChh--hcCCHHHHHHHHHHHHhhC
Confidence 35889999998777322 2345677889999999974
No 8
>KOG4266|consensus
Probab=97.12 E-value=0.004 Score=60.13 Aligned_cols=63 Identities=13% Similarity=0.221 Sum_probs=50.1
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHc-CCcEEEEecCCCC----------------------CCCCccHHHHHHHhC
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDE-GGSLLIALGEEGE----------------------RGSSTNINFLLEQYG 64 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~-GG~llv~~~~~~~----------------------~~~~~nln~LL~~~G 64 (259)
++.-.|++..|.++|-++|+++|++=+.+ |=++.++.|-..- ....+.||.||+.||
T Consensus 678 sqYGtLLmVD~E~~yfpEEI~kLr~dV~n~GL~lVvF~dWYNt~vM~K~kFfDeNTRqWWtPdTGGANIPALN~LL~~fg 757 (1033)
T KOG4266|consen 678 SQYGTLLMVDLEDDYFPEEIEKLRDDVINTGLGLVVFADWYNTDVMVKMKFFDENTRQWWTPDTGGANIPALNNLLASFG 757 (1033)
T ss_pred hHCceEEEEccccccCHHHHHHHHHHHHhcCceEEEEeccccceeeeeeeeeccCcceeccCCCCCcCcHhHHHHHhhhc
Confidence 35667889999999999999999999876 7788887765321 124578999999999
Q ss_pred cccCCc
Q psy15702 65 VSVNSD 70 (259)
Q Consensus 65 i~~~~~ 70 (259)
|.+.+.
T Consensus 758 iaFgD~ 763 (1033)
T KOG4266|consen 758 IAFGDK 763 (1033)
T ss_pred cccccc
Confidence 999754
No 9
>PF14258 DUF4350: Domain of unknown function (DUF4350)
Probab=97.04 E-value=0.0013 Score=45.91 Aligned_cols=37 Identities=19% Similarity=0.424 Sum_probs=32.2
Q ss_pred cCCccEEEEcCCCCCCCH-HHHHHHHHHHHcCCcEEEE
Q psy15702 7 LRAASLFICTGPREQFTI-TELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~-~E~~~L~~fl~~GG~llv~ 43 (259)
+..-..+|+++|...+++ .++++|.+|+++||+++++
T Consensus 32 ~~~~~tll~i~~~~~~~~~~~~~~l~~~v~~G~~lvl~ 69 (70)
T PF14258_consen 32 EADDGTLLVIGPDLRLSEPEEAEALLEWVEAGNTLVLA 69 (70)
T ss_pred CCCCCEEEEEeCCCCCCchHHHHHHHHHHHcCCEEEEe
Confidence 345567889999988996 9999999999999999986
No 10
>COG5426 Uncharacterized membrane protein [Function unknown]
Probab=95.74 E-value=0.11 Score=43.65 Aligned_cols=40 Identities=18% Similarity=0.241 Sum_probs=30.6
Q ss_pred cCCCcEEEEEec---CcccccccccccccHHHHHHHHHhhcCC
Q psy15702 154 KTRGGKLVVLGS---AALFSDQYLEREENDKMRQIILSFLLTD 193 (259)
Q Consensus 154 ~~~~grvvViGs---~~~~~d~~~~~~~N~~f~~n~i~wL~~~ 193 (259)
..+.||.+...| .++++-+++.=..=.+|+.|++.|++.+
T Consensus 211 ~y~qGrt~~~~SD~~PHW~sp~F~~W~~Y~~lw~n~L~~it~k 253 (254)
T COG5426 211 GYGQGRTAAWTSDCGPHWLSPQFCEWDGYGTLWKNILGWITEK 253 (254)
T ss_pred ccCCcceEEEeccCCcCcCChhhcccccHHHHHHHHHhHhhcC
Confidence 357788887764 4567777766678899999999999864
No 11
>PF09960 DUF2194: Uncharacterized protein conserved in bacteria (DUF2194); InterPro: IPR018695 This family of prokaryotic proteins has no known function; however it may be a membrane protein.
Probab=95.74 E-value=0.06 Score=52.73 Aligned_cols=56 Identities=16% Similarity=0.259 Sum_probs=39.9
Q ss_pred hhcC--CccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCC
Q psy15702 5 ELLR--AASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 5 ~ip~--~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~ 69 (259)
..|+ +.+.+||..+..+.-+. .+|.+|+++||+++++..+.. .+.++.+ +||.-..
T Consensus 97 ~~p~l~~Y~~vII~~~~l~~l~~--~~i~~yV~~GG~vif~~~~~~----~~~~~~I---lGI~~~~ 154 (585)
T PF09960_consen 97 SIPSLSDYRGVIILTTDLDPLGN--EAIMNYVENGGTVIFATTPEK----TPWLNFI---LGIRSVG 154 (585)
T ss_pred cCCcccceeEEEEEeccccccCh--HHHHHHHHcCCeEEEEecccc----Ccchhhe---eeeeecc
Confidence 4454 78878887776433223 999999999999999887763 3456666 7777544
No 12
>TIGR02336 1,3-beta-galactosyl-N-acetylhexosamine phosphorylase. Members of this family are found in phylogenetically diverse bacteria, including Clostridium perfringens (in the Firmicutes), Bifidobacterium longum and Propionibacterium acnes (in the Actinobacteria), and Vibrio vulnificus (in the Proteobacteria), most of which occur as mammalian pathogens or commensals. The nominal activity, 1,3-beta-galactosyl-N-acetylhexosamine phosphorylase (EC 2.4.1.211), varies somewhat from instance to instance in relative rates for closely related substrates.
Probab=94.87 E-value=0.51 Score=46.61 Aligned_cols=157 Identities=14% Similarity=0.227 Sum_probs=89.7
Q ss_pred ChhhcCCccEEEEcCCC-CCC-------CHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeee
Q psy15702 3 TPELLRAASLFICTGPR-EQF-------TITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVR 74 (259)
Q Consensus 3 t~~ip~~advLVI~~P~-~~f-------s~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d 74 (259)
...||+|.||+|=+|.. +.| .++.+.+|++|+.+||.++=+.+|.+......-. .|..-+|+..+...-+.
T Consensus 494 ~~gi~~didViIN~G~a~ta~SGG~~W~d~~~~~aLr~fV~~GGglIGVgDpsa~v~~G~~P-qLadvLGV~~E~f~tL~ 572 (719)
T TIGR02336 494 EHGIDSDIDVIINGGDADTAWSGGDVWTNPKLVETVRAWVRGGGGFVGVGEPSAAPQNGRFF-QLADVIGVDKERYQTLS 572 (719)
T ss_pred hcCCCcCCcEEEecCcccccccCccccCCHHHHHHHHHHHHcCCeEEEEECCccCCcCCCCc-chhhhcceeeeeeeccc
Confidence 45799999999977764 333 3466899999999999999777775432222222 56666799887554332
Q ss_pred c-cCCcccCCccccccCccchhhhhhhhcCCCccccccCcceEEecccceeeccCCcceEEeeCCcccccCcceEEEEee
Q psy15702 75 C-HFYKYFHPKECFIGNGVLNRGIAEALHRDVTEDSLASSLHFVYPYGASLNVIKPSVPVFSTGSASCPLNRPVVSFYRD 153 (259)
Q Consensus 75 ~-~~~~~~~P~~~~i~~~i~~~~i~~~~~~~i~~~~~~~~~~~~~p~~~~l~v~~~~~pll~ts~~s~p~~~pv~A~~~~ 153 (259)
. .|..-.+|.. +|..++....++ .+ .+..-+|+.+.. +.||.-+.. .|..+..
T Consensus 573 ~dky~~~~~~~h-fi~~d~~~~~~d--fG---------e~~~~iy~~~~~-------~~il~~~~G-----~PALTvN-- 626 (719)
T TIGR02336 573 VDKYFPPVVPHH-FITADIPVDGID--FG---------EPVLNVYPVNEN-------VTVLRADGG-----QVQLATN-- 626 (719)
T ss_pred cccccccccccc-eeeecccccccc--cC---------CCcCceeecCCc-------eEEEeecCC-----eeEEEee--
Confidence 1 1221112221 111111110111 00 111114444433 335555532 3444433
Q ss_pred cCCCcEEEEEecCcccccccccccccHHHHHHHHHhhcCC
Q psy15702 154 KTRGGKLVVLGSAALFSDQYLEREENDKMRQIILSFLLTD 193 (259)
Q Consensus 154 ~~~~grvvViGs~~~~~d~~~~~~~N~~f~~n~i~wL~~~ 193 (259)
..++||.+-+|.-- ....|..|+..++.|+...
T Consensus 627 ~YGkGRAyYIAgrp-------~s~fnarLL~RiI~~Agvk 659 (719)
T TIGR02336 627 DYGKGRGVYISGLP-------YSAENARLLERVLFWASHN 659 (719)
T ss_pred cCCCeEEEEEeecC-------CHHHHHHHHHHHHHHhcCC
Confidence 36889999998421 3357999999999999876
No 13
>cd03143 A4_beta-galactosidase_middle_domain A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. A4 beta-galactosidase middle domain: a type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to beta-galactosidase from Thermus thermophilus. Beta-Galactosidase hydrolyzes the beta-1,4-D-galactosidic linkage of lactose, as well as those of related chromogens, o-nitrophenyl-beta-D-galactopyranoside (ONP-Gal) and 5-bromo-4-chloro-3-indolyl-beta-D-galactoside (X-gal). This A4 beta-galactosidase middle domain lacks the catalytic triad of typical GATase1 domains. The reactive Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in typical GATase1 domains is not conserved in this group.
Probab=94.53 E-value=0.061 Score=43.17 Aligned_cols=36 Identities=19% Similarity=0.346 Sum_probs=30.9
Q ss_pred hcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEE
Q psy15702 6 LLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLI 42 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv 42 (259)
.+++.++||+..... +++++.++|++|+++||++++
T Consensus 50 ~l~~y~~vi~P~~~~-~~~~~~~~l~~~v~~GG~li~ 85 (154)
T cd03143 50 DLSGYKLVVLPDLYL-LSDATAAALRAYVENGGTLVA 85 (154)
T ss_pred CcccCCEEEECchhc-CCHHHHHHHHHHHHCCCEEEE
Confidence 456799999998874 899999999999999996555
No 14
>cd03142 GATase1_ThuA Type 1 glutamine amidotransferase (GATase1)-like domain found in Sinorhizobium meliloti Rm1021 ThuA (SmThuA). Type 1 glutamine amidotransferase (GATase1)-like domain found in Sinorhizobium meliloti Rm1021 ThuA (SmThuA). This group includes proteins similar to SmThuA which plays a role in a major pathway for trehalose catabolism. SmThuA is induced by trehalose but not by related structurally similar disaccharides like sucrose or maltose. Proteins in this group lack the catalytic triad of typical GATase1 domains: a His replaces the reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. S. meliloti Rm1021 thuA mutants are impaired in competitive colonization of Medicago sativa roots but are more competitive than the wild-type Rml021 in infecting alfalfa roots and forming nitrogen-fixing nodules.
Probab=94.49 E-value=0.08 Score=45.40 Aligned_cols=44 Identities=11% Similarity=0.186 Sum_probs=37.0
Q ss_pred CChhhcCCccEEEEc-CCC-CCCCHHHHHHHHHHHHcCCcEEEEec
Q psy15702 2 ITPELLRAASLFICT-GPR-EQFTITELNSLKTYLDEGGSLLIALG 45 (259)
Q Consensus 2 lt~~ip~~advLVI~-~P~-~~fs~~E~~~L~~fl~~GG~llv~~~ 45 (259)
++.+.++++|+||.. ... ..++++..+++.+|+++||.++.+=+
T Consensus 51 ~~~~~L~~~D~lV~~~~~~~~~l~~eq~~~l~~~V~~GgGlv~lHs 96 (215)
T cd03142 51 LTEEVLAETDVLLWWGHIAHDEVKDEIVERVHRRVLDGMGLIVLHS 96 (215)
T ss_pred CCHhHHhcCCEEEEeCCCCcCcCCHHHHHHHHHHHHcCCCEEEECC
Confidence 577789999999983 333 57999999999999999999998843
No 15
>PF08532 Glyco_hydro_42M: Beta-galactosidase trimerisation domain; InterPro: IPR013738 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is non catalytic domain B of beta-galactosidase enzymes belong to the glycosyl hydrolase 42 family. This domain is related to glutamine amidotransferase enzymes, but the catalytic residues are replaced by non functional amino acids. This domain is involved in trimerisation. ; GO: 0004565 beta-galactosidase activity, 0005975 carbohydrate metabolic process; PDB: 1KWK_A 1KWG_A.
Probab=94.26 E-value=0.037 Score=46.99 Aligned_cols=40 Identities=18% Similarity=0.317 Sum_probs=23.0
Q ss_pred hcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecC
Q psy15702 6 LLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~ 46 (259)
-++++.+||+.+... +++++.++|++|+++||+|++..-.
T Consensus 54 dL~~Ykllv~P~~~~-l~~~~~~~L~~yV~~GG~li~~~~t 93 (207)
T PF08532_consen 54 DLSGYKLLVLPSLYI-LSPEFAERLRAYVENGGTLILTPRT 93 (207)
T ss_dssp --TT-SEEEES--SC---HHH---HHHHHT-SS-EEE-TTT
T ss_pred CcccCcEEEEeeEEE-EChHHHHHHHHHHHCCCEEEEEccc
Confidence 478899999988874 8999999999999999999875444
No 16
>PF13709 DUF4159: Domain of unknown function (DUF4159)
Probab=94.05 E-value=0.16 Score=43.22 Aligned_cols=42 Identities=24% Similarity=0.335 Sum_probs=32.5
Q ss_pred hhhcCCccEEEEcCC-CCCCCHHHHHHHHHHHHcCCcEEEEec
Q psy15702 4 PELLRAASLFICTGP-REQFTITELNSLKTYLDEGGSLLIALG 45 (259)
Q Consensus 4 ~~ip~~advLVI~~P-~~~fs~~E~~~L~~fl~~GG~llv~~~ 45 (259)
.+-+.++.+|.+.+. .-.||++|+++|++||++||-|++=..
T Consensus 48 ~~~L~~yP~ly~~g~~~~~~s~~e~~~Lr~Yl~~GGfl~~D~~ 90 (207)
T PF13709_consen 48 DDELFFYPFLYWPGHGDFPLSDEEIANLRRYLENGGFLLFDDR 90 (207)
T ss_pred chhHHhCCEEEEeCCCCCCCCHHHHHHHHHHHHcCCEEEEECC
Confidence 344556777777666 457999999999999999998777444
No 17
>PF07090 DUF1355: Protein of unknown function (DUF1355); InterPro: IPR010768 This entry is found in several hypothetical bacterial proteins of around 250 residues in length. The function of these proteins is unknown.; PDB: 2GK3_D 3SOZ_C 3RHT_D.
Probab=93.84 E-value=0.047 Score=45.37 Aligned_cols=43 Identities=28% Similarity=0.495 Sum_probs=33.0
Q ss_pred hhcCCccEEEEcC-CCCCC-----CHHHHHHHHHHHHcCCcEEEEecCC
Q psy15702 5 ELLRAASLFICTG-PREQF-----TITELNSLKTYLDEGGSLLIALGEE 47 (259)
Q Consensus 5 ~ip~~advLVI~~-P~~~f-----s~~E~~~L~~fl~~GG~llv~~~~~ 47 (259)
+-+...|++|+.. |...+ ++..++.|++|+++||.++++.++.
T Consensus 63 ~~L~~yD~vIl~dv~~~~ll~~~~~~~~~~~l~~yV~~GGgLlmigG~~ 111 (177)
T PF07090_consen 63 EELNRYDVVILSDVPANSLLKSRRSPNQLELLADYVRDGGGLLMIGGPR 111 (177)
T ss_dssp HHHCT-SEEEEES--HHHHHT----HHHHHHHHHHHHTT-EEEEE-STT
T ss_pred hHHhcCCEEEEeCCCchhcccccCCHHHHHHHHHHHHhCCEEEEEeChh
Confidence 6688999999887 55667 9999999999999999988887765
No 18
>PF07685 GATase_3: CobB/CobQ-like glutamine amidotransferase domain; InterPro: IPR011698 This group of enzymes was suggested to be related to the MinD family of ATPases involved in regulation of cell division in bacteria and archaea []. Further sequence analysis suggests a model for the interaction of CobB and CobQ with their respective substrates []. CobB and CobQ were also found to contain unusual Triad family (class I) glutamine amidotransferase domains with conserved Cys and His residues, but lacking the Glu residue of the catalytic triad []. ; GO: 0003824 catalytic activity, 0009236 cobalamin biosynthetic process
Probab=85.18 E-value=1.1 Score=36.36 Aligned_cols=43 Identities=19% Similarity=0.244 Sum_probs=34.7
Q ss_pred hhhcCCccEEEEcCCCCCCCHHH-------HHHHHHHHHcCCcEEEEecC
Q psy15702 4 PELLRAASLFICTGPREQFTITE-------LNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E-------~~~L~~fl~~GG~llv~~~~ 46 (259)
.+.|++||+|+|.|-+....+.+ .++|++|+++|+.++-..+.
T Consensus 2 ~~~~~~~D~i~lpGg~pe~~~~~l~~~~~~~~~I~~~~~~G~pi~aeCGG 51 (158)
T PF07685_consen 2 EELPPDADGIYLPGGYPELFALELSRNRGLKEAIREAAEAGGPIYAECGG 51 (158)
T ss_pred CCCCCCCCEEEECCCcHHHHHHHHHHHhCHHHHHHHHHHcCCcEEEEchH
Confidence 57899999999999976555555 47899999999998776543
No 19
>PF09825 BPL_N: Biotin-protein ligase, N terminal; InterPro: IPR019197 The function of this structural domain is unknown. It is found to the N terminus of the biotin protein ligase catalytic domain []. Biotin protein ligase carries out the post-translational modification of specific proteins by the attachment of biotin. It acts on various carboxylases such as acetyl-CoA-carboxylase, pyruvate carboxylase, propionyl CoA carboxylase, and 3-methylcrotonyl CoA carboxylase.
Probab=81.65 E-value=1.9 Score=40.00 Aligned_cols=41 Identities=12% Similarity=0.267 Sum_probs=32.3
Q ss_pred hcCCccEEEEcCCC-----CCCCHHHHHHHHHHHHcCCcEEEEecC
Q psy15702 6 LLRAASLFICTGPR-----EQFTITELNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 6 ip~~advLVI~~P~-----~~fs~~E~~~L~~fl~~GG~llv~~~~ 46 (259)
-.+++++||+.|-. +.+...-..+|++|+++||+-|=+...
T Consensus 46 w~~~~~LlV~PGG~d~~y~~~l~~~g~~~Ir~fV~~GG~YlGiCAG 91 (367)
T PF09825_consen 46 WQSKCALLVMPGGADLPYCRSLNGEGNRRIRQFVENGGGYLGICAG 91 (367)
T ss_pred cccCCcEEEECCCcchHHHHhhChHHHHHHHHHHHcCCcEEEECcc
Confidence 35789999999865 346666799999999999997766544
No 20
>PF09508 Lact_bio_phlase: Lacto-N-biose phosphorylase; InterPro: IPR012711 The gene which codes for this protein in gut-bacteria is located in a novel putative operon for galactose metabolism. The protein appears to be a carbohydrate-processing phosphorolytic enzyme (2.4.1.211 from EC), unlike either glycoside hydrolases or glycoside lyase. Intestinal colonisation by Bifidobacteria is important for human health, especially in paediatrics, because colonisation seems to prevent infection by some pathogenic bacteria that cause diarrhoea or other illnesses. The operon seems to be involved in intestinal colonisation by Bifidobacteria mediated by metabolism of mucin sugars. In addition, it may also resolve the question of the nature of the bifidus factor in human milk as the lacto-N-biose structure found in milk oligosaccharides. ; GO: 0016758 transferase activity, transferring hexosyl groups; PDB: 2ZUW_A 2ZUU_C 2ZUT_D 2ZUV_A 2ZUS_B.
Probab=71.95 E-value=11 Score=37.55 Aligned_cols=161 Identities=12% Similarity=0.188 Sum_probs=75.6
Q ss_pred hhcCCccEEEEcCCC-CCCCH-------HHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeecc
Q psy15702 5 ELLRAASLFICTGPR-EQFTI-------TELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRCH 76 (259)
Q Consensus 5 ~ip~~advLVI~~P~-~~fs~-------~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~~ 76 (259)
.||+++||+|=+|.. +.||- .=+.+|++|+-+||.++=.-+|.+-. ....+=.|..-+|+.-+.+.-+.+
T Consensus 493 gi~~didViINaGdA~TA~SGG~~W~d~~iv~~lr~fV~~GGGfIGVGEPsA~~-~~g~~FqLadVLGVDkE~g~tl~~- 570 (716)
T PF09508_consen 493 GILEDIDVIINAGDAGTAWSGGENWKDPKIVTALREFVYNGGGFIGVGEPSAHQ-GQGRFFQLADVLGVDKETGFTLST- 570 (716)
T ss_dssp -S-TT--EEEEEESTTSTTT-GGGGG-HHHHHHHHHHHHTT-EEEEEESTEEEE-ETTEEETTHHHHSEEE--SS-TTB-
T ss_pred CCcccCCEEEecCcccccccCccccCCHHHHHHHHHHHHcCCCEEEcCCCcccc-CCCeEEeehhccCccccccccccc-
Confidence 689999999977764 45654 33899999999999988877774311 112233455556666654433321
Q ss_pred CCcccCCccccccCccchhhhhhhhcCCCccccccCcceEEecccceeeccCCcceEEeeCCcccccCcceEEEEeecCC
Q psy15702 77 FYKYFHPKECFIGNGVLNRGIAEALHRDVTEDSLASSLHFVYPYGASLNVIKPSVPVFSTGSASCPLNRPVVSFYRDKTR 156 (259)
Q Consensus 77 ~~~~~~P~~~~i~~~i~~~~i~~~~~~~i~~~~~~~~~~~~~p~~~~l~v~~~~~pll~ts~~s~p~~~pv~A~~~~~~~ 156 (259)
.+|..-... .|-++ ..+... +. ...+..-+|+-+. .+.+|.-.+.. +..|+.+ .+
T Consensus 571 -dky~~~~~~--~HFI~-~d~~~~----~d---fGe~~~~iy~~~~-------~t~vL~~~~~~-----v~la~n~--yG 625 (716)
T PF09508_consen 571 -DKYNWEVEP--EHFIT-EDIDGE----LD---FGEGKKNIYALSG-------DTEVLAQSDGE-----VQLAVNE--YG 625 (716)
T ss_dssp ---B--------S-TTT-TTS-------S------S--TTEEESST-------TSEEEE-GTTS------SEEEEE--ET
T ss_pred -CcCCCcCCC--Cceee-cCCCcC----cc---cCCCcCceEEcCC-------CeEEeeecCCe-----EEEEecc--cC
Confidence 122211111 12221 111100 00 0000001333322 23444444332 3344433 48
Q ss_pred CcEEEEEecCcccccccccccccHHHHHHHHHhhcCCCCCccc
Q psy15702 157 GGKLVVLGSAALFSDQYLEREENDKMRQIILSFLLTDDINLNQ 199 (259)
Q Consensus 157 ~grvvViGs~~~~~d~~~~~~~N~~f~~n~i~wL~~~~~~l~~ 199 (259)
+||-|-++.-- .+..|..++.+++-|.+++|..+..
T Consensus 626 kGR~VYlaGlp-------yS~~NtRlL~rai~~aa~~E~~~~~ 661 (716)
T PF09508_consen 626 KGRGVYLAGLP-------YSPENTRLLYRAIYWAAHKEDEMKK 661 (716)
T ss_dssp TEEEEEES------------HHHHHHHHHHHHHHTT-HHHCST
T ss_pred CccEEEeCCCC-------CCHHHHHHHHHHHHHHhCChhhhhh
Confidence 89999887532 2468999999999999997554443
No 21
>COG1245 Predicted ATPase, RNase L inhibitor (RLI) homolog [General function prediction only]
Probab=69.14 E-value=9.5 Score=36.63 Aligned_cols=67 Identities=18% Similarity=0.419 Sum_probs=50.7
Q ss_pred hhcCCccEEEEcCCCCCCCHHH----HHHHHHHHHcCCcEEEEecCC---------------CC----------CCCCcc
Q psy15702 5 ELLRAASLFICTGPREQFTITE----LNSLKTYLDEGGSLLIALGEE---------------GE----------RGSSTN 55 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E----~~~L~~fl~~GG~llv~~~~~---------------~~----------~~~~~n 55 (259)
....+||+.++=-|..-+.-++ .++|++|+++.++..+.+++. |+ ....++
T Consensus 469 ~L~reADlYllDEPSA~LDvEqR~~vakvIRR~~e~~~kta~vVdHDi~~~dyvsDr~ivF~GePg~~g~a~~P~~mr~G 548 (591)
T COG1245 469 ALSREADLYLLDEPSAYLDVEQRIIVAKVIRRFIENNEKTALVVDHDIYMIDYVSDRLIVFEGEPGKHGHASPPMSMREG 548 (591)
T ss_pred HhccccCEEEecCchhhccHHHHHHHHHHHHHHHhhcCceEEEEecceehhhhhhceEEEEecCCCccCcCCCCccHHHH
Confidence 4568999999999997555544 688999999988876666652 11 123478
Q ss_pred HHHHHHHhCcccCCce
Q psy15702 56 INFLLEQYGVSVNSDC 71 (259)
Q Consensus 56 ln~LL~~~Gi~~~~~~ 71 (259)
.|.||+..||+|..|.
T Consensus 549 MN~FLk~l~vTFRRD~ 564 (591)
T COG1245 549 MNRFLKNLGVTFRRDP 564 (591)
T ss_pred HHHHHHHcCcEEecCc
Confidence 9999999999998654
No 22
>PRK05569 flavodoxin; Provisional
Probab=67.90 E-value=16 Score=28.29 Aligned_cols=65 Identities=15% Similarity=0.117 Sum_probs=38.5
Q ss_pred hcCCccEEEEcCCCCCCC---HHHHHHHHHHHH---cCCcEEEEecCCCCC--CCCccHHHHHHHhCcccCCc
Q psy15702 6 LLRAASLFICTGPREQFT---ITELNSLKTYLD---EGGSLLIALGEEGER--GSSTNINFLLEQYGVSVNSD 70 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs---~~E~~~L~~fl~---~GG~llv~~~~~~~~--~~~~nln~LL~~~Gi~~~~~ 70 (259)
...++|.||++.|..... +.++..+-+.+. ..|+..+++...+.. .....+..+++..|+.+-..
T Consensus 45 ~~~~~d~iilgsPty~~~~~~~~~~~~~~~~l~~~~~~~K~v~~f~t~g~~~~~~~~~~~~~l~~~g~~~~~~ 117 (141)
T PRK05569 45 DVLEADAVAFGSPSMDNNNIEQEEMAPFLDQFKLTPNENKKCILFGSYGWDNGEFMKLWKDRMKDYGFNVIGD 117 (141)
T ss_pred HHhhCCEEEEECCCcCCCcCChHHHHHHHHHhhccCcCCCEEEEEeCCCCCCCcHHHHHHHHHHHCCCeEeee
Confidence 456899999999985332 245555555553 235544444443321 12345778888889877443
No 23
>KOG0257|consensus
Probab=62.87 E-value=13 Score=34.83 Aligned_cols=43 Identities=23% Similarity=0.346 Sum_probs=35.6
Q ss_pred hcCCccEEEEcCCCCC----CCHHHHHHHHHHHHcCCcEEEEecCCCC
Q psy15702 6 LLRAASLFICTGPREQ----FTITELNSLKTYLDEGGSLLIALGEEGE 49 (259)
Q Consensus 6 ip~~advLVI~~P~~~----fs~~E~~~L~~fl~~GG~llv~~~~~~~ 49 (259)
+=+..+++||..|..+ ||.+|++.|-++.++.| ++++.|+..+
T Consensus 169 ~t~kTk~Ii~ntPhNPtGkvfsReeLe~ia~l~~k~~-~lvisDevYe 215 (420)
T KOG0257|consen 169 ITEKTKAIILNTPHNPTGKVFSREELERIAELCKKHG-LLVISDEVYE 215 (420)
T ss_pred ccCCccEEEEeCCCCCcCcccCHHHHHHHHHHHHHCC-EEEEEhhHhH
Confidence 3466899999999765 89999999999999999 7777787543
No 24
>PHA03411 putative methyltransferase; Provisional
Probab=61.45 E-value=12 Score=33.31 Aligned_cols=72 Identities=17% Similarity=0.204 Sum_probs=45.4
Q ss_pred CCccEEEEcCCCCCCCHHHHHH---------------HHHHHHc-------CCcEEEEecCCC---CCCCCccHHHHHHH
Q psy15702 8 RAASLFICTGPREQFTITELNS---------------LKTYLDE-------GGSLLIALGEEG---ERGSSTNINFLLEQ 62 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~---------------L~~fl~~-------GG~llv~~~~~~---~~~~~~nln~LL~~ 62 (259)
+..|++|...|-..+...+... +++|+.. +|.+.+..+... ..........+|++
T Consensus 125 ~kFDlIIsNPPF~~l~~~d~~~~~~~~GG~~g~~~l~~~~~l~~v~~~L~p~G~~~~~yss~~~y~~sl~~~~y~~~l~~ 204 (279)
T PHA03411 125 EKFDVVISNPPFGKINTTDTKDVFEYTGGEFEFKVMTLGQKFADVGYFIVPTGSAGFAYSGRPYYDGTMKSNKYLKWSKQ 204 (279)
T ss_pred CCCcEEEEcCCccccCchhhhhhhhhccCccccccccHHHHHhhhHheecCCceEEEEEeccccccccCCHHHHHHHHHh
Confidence 5689888877754443332211 3455544 455555544321 12234567799999
Q ss_pred hCcccCCceeeeccCCc
Q psy15702 63 YGVSVNSDCVVRCHFYK 79 (259)
Q Consensus 63 ~Gi~~~~~~V~d~~~~~ 79 (259)
.|+.+.+++=+|++++.
T Consensus 205 ~g~~~~~~~~~~~~~~~ 221 (279)
T PHA03411 205 TGLVTYAGCGIDTSIYR 221 (279)
T ss_pred cCcEecCCCCcccceeh
Confidence 99999999999998773
No 25
>TIGR00537 hemK_rel_arch HemK-related putative methylase. The gene hemK from E. coli was found to contribute to heme biosynthesis and originally suggested to be protoporphyrinogen oxidase (Medline 95189105). Functional analysis of the nearest homolog in Saccharomyces cerevisiae, YNL063w, finds it is not protoporphyrinogen oxidase and sequence analysis suggests that HemK homologs have S-adenosyl-methionine-dependent methyltransferase activity (Medline 99237242). Homologs are found, usually in a single copy, in nearly all completed genomes, but varying somewhat in apparent domain architecture. This model represents an archaeal and eukaryotic protein family that lacks an N-terminal domain found in HemK and its eubacterial homologs. It is found in a single copy in the first six completed archaeal and eukaryotic genomes.
Probab=60.61 E-value=25 Score=28.53 Aligned_cols=38 Identities=21% Similarity=0.295 Sum_probs=27.5
Q ss_pred HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 27 LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 27 ~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
+..+.+.|+.||++++...... ....+-.+|++.|+..
T Consensus 123 l~~~~~~Lk~gG~~~~~~~~~~---~~~~~~~~l~~~gf~~ 160 (179)
T TIGR00537 123 LDELPEILKEGGRVQLIQSSLN---GEPDTFDKLDERGFRY 160 (179)
T ss_pred HHhHHHhhCCCCEEEEEEeccC---ChHHHHHHHHhCCCeE
Confidence 6667788899999888754432 1356778888888765
No 26
>PF13380 CoA_binding_2: CoA binding domain; PDB: 3FF4_A 2D5A_A 2D59_A 2E6U_X 1IUL_A 1IUK_A 1Y81_A 2DUW_A.
Probab=59.94 E-value=27 Score=26.55 Aligned_cols=58 Identities=12% Similarity=0.080 Sum_probs=37.9
Q ss_pred hcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC-Cce
Q psy15702 6 LLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN-SDC 71 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~-~~~ 71 (259)
+|+..|+++|.-|. +.-.+.+++-.+.|=+-+++.... ....+..++++.|+++- +++
T Consensus 52 ~p~~iDlavv~~~~----~~~~~~v~~~~~~g~~~v~~~~g~----~~~~~~~~a~~~gi~vigp~C 110 (116)
T PF13380_consen 52 IPEPIDLAVVCVPP----DKVPEIVDEAAALGVKAVWLQPGA----ESEELIEAAREAGIRVIGPNC 110 (116)
T ss_dssp CSST-SEEEE-S-H----HHHHHHHHHHHHHT-SEEEE-TTS------HHHHHHHHHTT-EEEESS-
T ss_pred CCCCCCEEEEEcCH----HHHHHHHHHHHHcCCCEEEEEcch----HHHHHHHHHHHcCCEEEeCCc
Confidence 68999999999884 455677777888887766665553 35789999999999975 443
No 27
>PRK00107 gidB 16S rRNA methyltransferase GidB; Reviewed
Probab=58.85 E-value=20 Score=29.89 Aligned_cols=60 Identities=15% Similarity=0.133 Sum_probs=44.1
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceee
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVV 73 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~ 73 (259)
+..|+++..+ ... -+.-++.+.+.|+.||+++++..+.. ...+..+.+..|..++.-+++
T Consensus 111 ~~fDlV~~~~-~~~-~~~~l~~~~~~LkpGG~lv~~~~~~~----~~~l~~~~~~~~~~~~~~~~~ 170 (187)
T PRK00107 111 EKFDVVTSRA-VAS-LSDLVELCLPLLKPGGRFLALKGRDP----EEEIAELPKALGGKVEEVIEL 170 (187)
T ss_pred CCccEEEEcc-ccC-HHHHHHHHHHhcCCCeEEEEEeCCCh----HHHHHHHHHhcCceEeeeEEE
Confidence 4688877643 111 23457888999999999999876653 467888889999998876655
No 28
>PF04412 DUF521: Protein of unknown function (DUF521); InterPro: IPR007506 This is a group of hypothetical proteins.
Probab=57.91 E-value=15 Score=34.62 Aligned_cols=64 Identities=20% Similarity=0.215 Sum_probs=48.3
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCC-----cEEEEecCCC-CCCCCccHHHHHHHhCcccCCc
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDEGG-----SLLIALGEEG-ERGSSTNINFLLEQYGVSVNSD 70 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG-----~llv~~~~~~-~~~~~~nln~LL~~~Gi~~~~~ 70 (259)
.-.++.|++++..|. +|-+|+..|.+++++.+ .++|+.+..- +.-...++-..|+++|+.+-.|
T Consensus 285 ~~~~~~D~V~lGcPH--~S~~El~~ia~ll~gr~~~~~~~~~i~t~~~v~~~a~~~G~~~~le~~G~~iv~d 354 (400)
T PF04412_consen 285 AGDEKVDLVALGCPH--LSLEELREIAELLEGRKVHPNVPLWITTSRAVYELAERMGYVERLEKAGVQIVTD 354 (400)
T ss_pred CCCCCCCEEEECCCC--CCHHHHHHHHHHHhCCCCCCCceEEEECCHHHHHHHHhCCHHHHHHHcCCEEEcc
Confidence 345789999999995 89999999999998665 3677766532 1112347889999999987655
No 29
>PRK06242 flavodoxin; Provisional
Probab=55.67 E-value=37 Score=26.47 Aligned_cols=65 Identities=12% Similarity=0.033 Sum_probs=37.5
Q ss_pred hcCCccEEEEcCCCC--CCCHHHHHHHHHHHH-cCCcEEEEecCCCCC-CCCccHHHHHHHhCcccCCc
Q psy15702 6 LLRAASLFICTGPRE--QFTITELNSLKTYLD-EGGSLLIALGEEGER-GSSTNINFLLEQYGVSVNSD 70 (259)
Q Consensus 6 ip~~advLVI~~P~~--~fs~~E~~~L~~fl~-~GG~llv~~~~~~~~-~~~~nln~LL~~~Gi~~~~~ 70 (259)
.+.++|.||++.|.. .+.+.-..-|++.-. +|.++.++...+... .....+..+|+..|..+-..
T Consensus 40 ~~~~~d~ii~g~pvy~~~~~~~~~~fl~~~~~~~~k~~~~f~t~g~~~~~~~~~l~~~l~~~g~~~~~~ 108 (150)
T PRK06242 40 DLSEYDLIGFGSGIYFGKFHKSLLKLIEKLPPVSGKKAFIFSTSGLPFLKYHKALKKKLKEKGFEIVGE 108 (150)
T ss_pred cHhHCCEEEEeCchhcCCcCHHHHHHHHhhhhhcCCeEEEEECCCCCcchHHHHHHHHHHHCCCEEEEE
Confidence 357899999999964 233332222332212 455555554433221 12467788899999888544
No 30
>cd03144 GATase1_ScBLP_like Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Saccharomyces cerevisiae biotin-apoprotein ligase (ScBLP). Type 1 glutamine amidotransferase (GATase1)-like domain found in proteins similar to Saccharomyces cerevisiae biotin-apoprotein ligase (ScBLP). Biotin-apoprotein ligase modifies proteins by covalently attaching biotin. ScBLP is known to biotinylate acety-CoA carboxylase and pyruvate carboxylase. The catalytic triad typical of GATase1 domains is not conserved in this GATase1-like domain. However, the Cys residue found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow in a typical GATase1 domain is conserved.
Probab=55.02 E-value=9.8 Score=29.27 Aligned_cols=36 Identities=14% Similarity=0.331 Sum_probs=25.8
Q ss_pred CCccEEEEcCCC-----CCCCHHHHHHHHHHHHcCCcEEEE
Q psy15702 8 RAASLFICTGPR-----EQFTITELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 8 ~~advLVI~~P~-----~~fs~~E~~~L~~fl~~GG~llv~ 43 (259)
.++|.||+.|-. ..+.+...++|++|+++||.+|=.
T Consensus 43 ~~ad~lVlPGGa~~~~~~~L~~~g~~~i~~~v~~g~p~LGI 83 (114)
T cd03144 43 SKTALLVVPGGADLPYCRALNGKGNRRIRNFVRNGGNYLGI 83 (114)
T ss_pred hCCCEEEECCCChHHHHHHHHhhCcHHHHHHHHCCCcEEEE
Confidence 489999999832 113333478999999999987633
No 31
>cd06542 GH18_EndoS-like Endo-beta-N-acetylglucosaminidases are bacterial chitinases that hydrolyze the chitin core of various asparagine (N)-linked glycans and glycoproteins. The endo-beta-N-acetylglucosaminidases have a glycosyl hydrolase family 18 (GH18) catalytic domain. Some members also have an additional C-terminal glycosyl hydrolase family 20 (GH20) domain while others have an N-terminal domain of unknown function (pfam08522). Members of this family include endo-beta-N-acetylglucosaminidase S (EndoS) from Streptococcus pyogenes, EndoF1, EndoF2, EndoF3, and EndoH from Flavobacterium meningosepticum, and EndoE from Enterococcus faecalis. EndoS is a secreted endoglycosidase from Streptococcus pyogenes that specifically hydrolyzes the glycan on human IgG between two core N-acetylglucosamine residues. EndoE is a secreted endoglycosidase, encoded by the ndoE gene in Enterococcus faecalis, that hydrolyzes the glycan on human RNase B.
Probab=53.58 E-value=56 Score=28.13 Aligned_cols=43 Identities=12% Similarity=0.175 Sum_probs=29.1
Q ss_pred hhcCCccEEEEcCCCCCCC--------HHHHHHHHHHH-HcCCcEEEEecCC
Q psy15702 5 ELLRAASLFICTGPREQFT--------ITELNSLKTYL-DEGGSLLIALGEE 47 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs--------~~E~~~L~~fl-~~GG~llv~~~~~ 47 (259)
++|...|++++.+....++ .++.....+.+ ++|-|+++.++..
T Consensus 23 ~~pds~D~v~lf~~~~~~~~~~~~~~~~~~~~~~i~~l~~kG~KVl~sigg~ 74 (255)
T cd06542 23 NLPDSVDMVSLFAANINLDAATAVQFLLTNKETYIRPLQAKGTKVLLSILGN 74 (255)
T ss_pred cCCCcceEEEEcccccCcccccchhhhhHHHHHHHHHHhhCCCEEEEEECCC
Confidence 5899999999965443344 34444444445 5888999888764
No 32
>TIGR01755 flav_wrbA NAD(P)H:quinone oxidoreductase, type IV. This model represents a protein, WrbA, related to and slightly larger than flavodoxin. It was just shown, in E. coli and Archaeoglobus fulgidus (and previously for some eukaryotic homologs) to act as fourth type of NAD(P)H:quinone oxidoreductase. In E. coli, this protein was earlier reported to be produced during stationary phase, bind to the trp repressor, and make trp operon repression more efficient. WrbA does not interact with the trp operator by itself. Members are found in species in which homologs of the E. coli trp operon repressor TrpR are not detected.
Probab=53.27 E-value=15 Score=30.78 Aligned_cols=64 Identities=16% Similarity=0.188 Sum_probs=35.3
Q ss_pred hhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcC----------CcEE-EEecCCCC-CCC---CccHHHHHHHhCcccC
Q psy15702 4 PELLRAASLFICTGPREQFTITELNSLKTYLDEG----------GSLL-IALGEEGE-RGS---STNINFLLEQYGVSVN 68 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~G----------G~ll-v~~~~~~~-~~~---~~nln~LL~~~Gi~~~ 68 (259)
.+...+||.+|+++|... ..-...++.|+++= ||.. ++...+.. ... ...+..+|..+|+.+-
T Consensus 63 ~~~l~~aD~ii~GSPty~--g~~~~~lk~fld~~~~~~~~~~l~gK~~~~f~s~g~~~Gg~~~~l~~l~~~l~~~Gm~vv 140 (197)
T TIGR01755 63 PQELADYDAIIFGTPTRF--GNMASQMRNFLDQTGGLWASGALVGKVGSVFTSTGTQHGGQESTILSTWTTLLHHGMIIV 140 (197)
T ss_pred HHHHHHCCEEEEEecccc--cCccHHHHHHHHhccccccccccCCCEEEEEEECCCCCCCHHHHHHHHHHHHHHCCCEEe
Confidence 355678999999999842 22222344555432 5544 44333221 111 1345566688998875
Q ss_pred C
Q psy15702 69 S 69 (259)
Q Consensus 69 ~ 69 (259)
+
T Consensus 141 ~ 141 (197)
T TIGR01755 141 P 141 (197)
T ss_pred C
Confidence 4
No 33
>PF04016 DUF364: Domain of unknown function (DUF364); InterPro: IPR007161 This is a entry represents of bacterial and archaeal proteins of unknown function.; PDB: 3L5O_B 3NPG_A.
Probab=53.19 E-value=5.7 Score=31.81 Aligned_cols=62 Identities=21% Similarity=0.231 Sum_probs=45.5
Q ss_pred hhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceee
Q psy15702 4 PELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVV 73 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~ 73 (259)
.+++++||+++|.|-+ +-...++.|.+..++ ++.+++++|.. +-..+.|..+|+..-.+..+
T Consensus 57 ~~~l~~aD~viiTGsT--lvN~Ti~~iL~~~~~-~~~vil~GpS~-----~~~P~~l~~~Gv~~v~g~~v 118 (147)
T PF04016_consen 57 EEILPWADVVIITGST--LVNGTIDDILELARN-AREVILYGPSA-----PLHPEALFDYGVTYVGGSRV 118 (147)
T ss_dssp HHHGGG-SEEEEECHH--CCTTTHHHHHHHTTT-SSEEEEESCCG-----GS-GGGGCCTT-SEEEEEEE
T ss_pred HHHHccCCEEEEEeee--eecCCHHHHHHhCcc-CCeEEEEecCc-----hhhHHHHHhCCCCEEEEEEE
Confidence 5688999999999986 566777888888774 67778889874 34555889999988776655
No 34
>PRK04266 fibrillarin; Provisional
Probab=52.16 E-value=36 Score=29.21 Aligned_cols=63 Identities=14% Similarity=0.189 Sum_probs=36.3
Q ss_pred hcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEec-----CCCCC-CCCccHHHHHHHhCcccC
Q psy15702 6 LLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALG-----EEGER-GSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~-----~~~~~-~~~~nln~LL~~~Gi~~~ 68 (259)
+++.+|+++.-.+..+-...-+..+.++|+.||++++.+. ...+. ........++++.|+++.
T Consensus 138 l~~~~D~i~~d~~~p~~~~~~L~~~~r~LKpGG~lvI~v~~~~~d~~~~~~~~~~~~~~~l~~aGF~~i 206 (226)
T PRK04266 138 VVEKVDVIYQDVAQPNQAEIAIDNAEFFLKDGGYLLLAIKARSIDVTKDPKEIFKEEIRKLEEGGFEIL 206 (226)
T ss_pred ccccCCEEEECCCChhHHHHHHHHHHHhcCCCcEEEEEEecccccCcCCHHHHHHHHHHHHHHcCCeEE
Confidence 3566888875433210001125788889999999999633 32210 011224577888887754
No 35
>PF11303 DUF3105: Protein of unknown function (DUF3105); InterPro: IPR021454 Some members in this family of proteins are annotated as membrane proteins however this cannot be confirmed. Currently no function is known.
Probab=51.65 E-value=59 Score=25.54 Aligned_cols=40 Identities=15% Similarity=0.152 Sum_probs=33.0
Q ss_pred hcCCccEEEEcCCCCCCCHHHHHHHHHHHHc-CCcEEEEecCC
Q psy15702 6 LLRAASLFICTGPREQFTITELNSLKTYLDE-GGSLLIALGEE 47 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~E~~~L~~fl~~-GG~llv~~~~~ 47 (259)
.++.-.++|.-.|. ++++++++|++.+++ .+++++.-.++
T Consensus 46 ~LEHGaV~i~Y~p~--~~~~~v~~L~~l~~~~~~~~visP~~~ 86 (130)
T PF11303_consen 46 NLEHGAVWITYDPC--LPPDQVAKLKALAKSCLPYVVISPYPG 86 (130)
T ss_pred hhhcCcEEEEECCC--CCHHHHHHHHHHHhccCCcEEEecCCC
Confidence 36777899999997 899999999999998 77777765444
No 36
>TIGR01753 flav_short flavodoxin, short chain. Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They can act in nitrogen fixation by nitrogenase, in sulfite reduction, and light-dependent NADP+ reduction in during photosynthesis, among other roles. This model describes the short chain type. Many of these are involved in sulfite reduction.
Probab=50.50 E-value=38 Score=25.84 Aligned_cols=63 Identities=11% Similarity=0.074 Sum_probs=38.3
Q ss_pred hcCCccEEEEcCCCCCC--C-HHHHHHHHHHHHc---CCcEEEEecCCCC-C---CCCccHHHHHHHhCcccC
Q psy15702 6 LLRAASLFICTGPREQF--T-ITELNSLKTYLDE---GGSLLIALGEEGE-R---GSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 6 ip~~advLVI~~P~~~f--s-~~E~~~L~~fl~~---GG~llv~~~~~~~-~---~~~~nln~LL~~~Gi~~~ 68 (259)
-+.++|.+|++.|+... . +.++..+-+++.. .|+-+.+++.++. . .....+..+|++.|..+-
T Consensus 42 ~l~~~d~iilgspty~~g~~p~~~~~~f~~~l~~~~~~gk~~~vfgt~g~~~~f~~~~~~~~~~l~~~g~~~v 114 (140)
T TIGR01753 42 DLLSYDAVLLGCSTWGDEDLEQDDFEPFFEELEDIDLGGKKVALFGSGDWGYEFCEAVDDWEERLKEAGATII 114 (140)
T ss_pred HHhcCCEEEEEcCCCCCCCCCcchHHHHHHHhhhCCCCCCEEEEEecCCCCchhhHHHHHHHHHHHHCCCEEe
Confidence 34579999999999532 2 2566666666652 3554444444332 1 123457788888898863
No 37
>COG1832 Predicted CoA-binding protein [General function prediction only]
Probab=49.99 E-value=59 Score=25.93 Aligned_cols=56 Identities=7% Similarity=0.038 Sum_probs=44.2
Q ss_pred hhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 4 PELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
.+||+..|++.++-| ++.-.+.++++++.|-+++++--.- ....-...+++.|+.+
T Consensus 68 ~dIpe~IDiVdvFR~----~e~~~~i~~eal~~~~kv~W~QlGi----~n~ea~~~~~~aG~~v 123 (140)
T COG1832 68 ADIPEPIDIVDVFRR----SEAAPEVAREALEKGAKVVWLQLGI----RNEEAAEKARDAGLDV 123 (140)
T ss_pred HhCCCCCcEEEEecC----hhhhHHHHHHHHhhCCCeEEEecCc----CCHHHHHHHHHhCcHH
Confidence 378999999999977 5556788899999999999984333 2445788999999954
No 38
>PRK05568 flavodoxin; Provisional
Probab=49.98 E-value=42 Score=25.93 Aligned_cols=65 Identities=12% Similarity=0.094 Sum_probs=37.5
Q ss_pred hcCCccEEEEcCCCCCCC---HHHHHHHHHHHH---cCCcEEEEecCCCC-CCCCccHHHHHHHhCcccCCc
Q psy15702 6 LLRAASLFICTGPREQFT---ITELNSLKTYLD---EGGSLLIALGEEGE-RGSSTNINFLLEQYGVSVNSD 70 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs---~~E~~~L~~fl~---~GG~llv~~~~~~~-~~~~~nln~LL~~~Gi~~~~~ 70 (259)
...++|.||++.|..... +..+..+-+.+. +|-++.++...+.. ......+...|+..|+.+-..
T Consensus 45 ~~~~~d~iilgsp~y~~~~~~~~~~~~f~~~~~~~~~~k~~~~f~t~G~~~~~~~~~~~~~l~~~g~~~~~~ 116 (142)
T PRK05568 45 DVKGADVVALGSPAMGDEVLEEGEMEPFVESISSLVKGKKLVLFGSYGWGDGEWMRDWVERMEGYGANLVNE 116 (142)
T ss_pred HHHhCCEEEEECCccCcccccchhHHHHHHHhhhhhCCCEEEEEEccCCCCChHHHHHHHHHHHCCCEEeCC
Confidence 357899999999985332 233433333332 44455555544322 112355677888888886554
No 39
>PF13344 Hydrolase_6: Haloacid dehalogenase-like hydrolase; PDB: 2HO4_B 1YV9_A 1WVI_B 3EPR_A 2P27_A 2OYC_A 2CFT_A 2P69_A 2CFS_A 2CFR_A ....
Probab=49.37 E-value=30 Score=25.62 Aligned_cols=51 Identities=14% Similarity=0.280 Sum_probs=35.0
Q ss_pred CHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeec
Q psy15702 23 TITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRC 75 (259)
Q Consensus 23 s~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~ 75 (259)
-+...++|+..-++|.+++++.+.... ....+-.-|+.+|+.+..+.|+-+
T Consensus 16 ipga~e~l~~L~~~g~~~~~lTNns~~--s~~~~~~~L~~~Gi~~~~~~i~ts 66 (101)
T PF13344_consen 16 IPGAVEALDALRERGKPVVFLTNNSSR--SREEYAKKLKKLGIPVDEDEIITS 66 (101)
T ss_dssp -TTHHHHHHHHHHTTSEEEEEES-SSS---HHHHHHHHHHTTTT--GGGEEEH
T ss_pred CcCHHHHHHHHHHcCCCEEEEeCCCCC--CHHHHHHHHHhcCcCCCcCEEECh
Confidence 344578888888889999999887653 235677777999999988777743
No 40
>TIGR02469 CbiT precorrin-6Y C5,15-methyltransferase (decarboxylating), CbiT subunit. This model recognizes the CbiT methylase which is responsible, in part (along with CbiE), for methylating precorrin-6y (or cobalt-precorrin-6y) at both the 5 and 15 positions as well as the concomitant decarbozylation at C-12. In many organisms, this protein is fused to the CbiE subunit. The fused protein, when found in organisms catalyzing the oxidative version of the cobalamin biosynthesis pathway, is called CobL.
Probab=48.80 E-value=30 Score=25.48 Aligned_cols=37 Identities=16% Similarity=0.244 Sum_probs=28.3
Q ss_pred cCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEe
Q psy15702 7 LRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIAL 44 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~ 44 (259)
++..|++++.++...+ +.-++.+.++|+.||.+++.+
T Consensus 86 ~~~~D~v~~~~~~~~~-~~~l~~~~~~Lk~gG~li~~~ 122 (124)
T TIGR02469 86 LPEPDRVFIGGSGGLL-QEILEAIWRRLRPGGRIVLNA 122 (124)
T ss_pred cCCCCEEEECCcchhH-HHHHHHHHHHcCCCCEEEEEe
Confidence 4678999987765433 455889999999999988753
No 41
>PRK14813 NADH dehydrogenase subunit B; Provisional
Probab=46.26 E-value=42 Score=28.15 Aligned_cols=39 Identities=15% Similarity=0.330 Sum_probs=32.5
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHc--CCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDE--GGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~--GG~llv~~~~ 46 (259)
..|+.||+++|.|| .+..+...+++-.++ .=|.+|.++.
T Consensus 63 asPR~ADvllVtG~---Vt~km~~~l~~~y~qmPePK~VIA~Ga 103 (189)
T PRK14813 63 SSPRQSDLMIVAGT---VTMKMAERVVRLYEQMPEPRYVLSMGS 103 (189)
T ss_pred CCcccceEEEEecc---CchhhHHHHHHHHHhCCCCCEEEEecc
Confidence 57999999999999 577777777777776 8889998886
No 42
>cd01355 AcnX Putative Aconitase X catalytic domain. Putative Aconitase X catalytic domain. It is predicted by comparative genomic analysis. The proteins are mainly found in archaea and proteobacteria. They are distantly related to Aconitase family of proteins by sequence similarity and seconary structure prediction. The functions have not yet been experimentally characterized. Thus, the prediction should be treated with caution.
Probab=45.88 E-value=42 Score=31.49 Aligned_cols=60 Identities=18% Similarity=0.248 Sum_probs=45.7
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcC----C-cEEEEecCCCCCCCCccHHHHHHHhCcccCCc
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEG----G-SLLIALGEEGERGSSTNINFLLEQYGVSVNSD 70 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~G----G-~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~ 70 (259)
++.|+++|..|. +|-+|+..|-+.+++- + .++|..+..-- ....++-..+|.+|+++-.|
T Consensus 277 ~~~dlv~lGcPH--~Sl~E~~~la~ll~g~~~~~~~~~~v~t~r~v~-a~~~g~~~~ie~~G~~i~~D 341 (389)
T cd01355 277 SEPDLVVLGCPH--ASLEELRKLADLLAGRRVAPSVPLYVTTSRAVY-AKRMGYVDVIEKLGARVLTD 341 (389)
T ss_pred cCCCEEEecCCC--CCHHHHHHHHHHhcCCccCCCCCEEEEccHHHH-HhhccHHHHHHHcCCEEEee
Confidence 468999999995 8999999999999654 2 47777555420 13457889999999998665
No 43
>PRK14968 putative methyltransferase; Provisional
Probab=45.84 E-value=56 Score=26.17 Aligned_cols=39 Identities=23% Similarity=0.428 Sum_probs=29.1
Q ss_pred HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 27 LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 27 ~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
++.+.++|+.||.+++...... ....+..++++.|.++.
T Consensus 131 i~~~~~~Lk~gG~~~~~~~~~~---~~~~l~~~~~~~g~~~~ 169 (188)
T PRK14968 131 LDEVGRYLKPGGRILLLQSSLT---GEDEVLEYLEKLGFEAE 169 (188)
T ss_pred HHHHHHhcCCCeEEEEEEcccC---CHHHHHHHHHHCCCeee
Confidence 6777888999999888775543 23468889999997653
No 44
>PF08468 MTS_N: Methyltransferase small domain N-terminal; InterPro: IPR013675 This domain is found to the N terminus of the methyltransferase small domain (IPR007848 from INTERPRO) in bacterial proteins []. ; GO: 0008990 rRNA (guanine-N2-)-methyltransferase activity, 0006364 rRNA processing; PDB: 2PJD_A.
Probab=44.92 E-value=54 Score=26.50 Aligned_cols=51 Identities=18% Similarity=0.246 Sum_probs=31.9
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHH----HHcCCcEEEEecCCCCCCCCccHHHHHHHhC
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTY----LDEGGSLLIALGEEGERGSSTNINFLLEQYG 64 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~f----l~~GG~llv~~~~~~~~~~~~nln~LL~~~G 64 (259)
.++|.+|+.=|+ +.+|..-+-.. |..||.++|.-+-.++ -.....+|++||
T Consensus 68 ~~~D~vvly~PK---aK~e~~~lL~~l~~~L~~g~~i~vVGEnk~G---IkSa~K~L~~~~ 122 (155)
T PF08468_consen 68 QDFDTVVLYWPK---AKAEAQYLLANLLSHLPPGTEIFVVGENKGG---IKSAEKQLAPYG 122 (155)
T ss_dssp TT-SEEEEE--S---SHHHHHHHHHHHHTTS-TT-EEEEEEEGGGT---GGGHHHHHTTTS
T ss_pred cCCCEEEEEccC---cHHHHHHHHHHHHHhCCCCCEEEEEecCccc---HHHHHHHHHhhC
Confidence 679999999998 55665444444 4568998887554443 356788899996
No 45
>TIGR01957 nuoB_fam NADH-quinone oxidoreductase, B subunit. This model describes the B chain of complexes that resemble NADH-quinone oxidoreductases. The electron acceptor is a quinone, ubiquinone, in mitochondria and most bacteria, including Escherichia coli, where the recommended gene symbol is nuoB. The quinone is plastoquinone in Synechocystis (where the chain is designated K) and in chloroplast, where NADH may be replaced by NADPH. In the methanogenic archaeal genus Methanosarcina, NADH is replaced by F420H2.
Probab=44.89 E-value=50 Score=26.49 Aligned_cols=39 Identities=23% Similarity=0.338 Sum_probs=31.2
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHc--CCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDE--GGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~--GG~llv~~~~ 46 (259)
..|++||+++|.||- +..+..+|++=.++ .-+.+|+++.
T Consensus 53 ~sPr~aDvllVtG~v---t~~~~~~l~~~~e~~p~pk~VIA~Gs 93 (145)
T TIGR01957 53 ASPRQADVMIVAGTV---TKKMAPALRRLYDQMPEPKWVISMGA 93 (145)
T ss_pred CCCCcceEEEEecCC---cHHHHHHHHHHHHhccCCceEEEecc
Confidence 579999999999994 56677777776655 7888888776
No 46
>PF05846 Chordopox_A15: Chordopoxvirus A15 protein; InterPro: IPR008445 This family consists of several Chordopoxvirus A15 like sequences.
Probab=44.54 E-value=60 Score=23.85 Aligned_cols=40 Identities=15% Similarity=0.395 Sum_probs=33.0
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHc---CCcEEEEecCCC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDE---GGSLLIALGEEG 48 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~---GG~llv~~~~~~ 48 (259)
.+.+++..+++..| +..|+-++--|+++ ||+++++.+...
T Consensus 41 ~i~~~~ksilLVNP----s~~~Llkicv~ikr~kW~G~i~iLFe~~n 83 (90)
T PF05846_consen 41 KIDKDFKSILLVNP----SYIQLLKICVYIKRNKWGGNIYILFERDN 83 (90)
T ss_pred ccccccceEEEECC----CHHHHHHHHHHHhcCCcCCeEEEEEecCC
Confidence 46788889999999 56888999888864 899999988754
No 47
>PF08282 Hydrolase_3: haloacid dehalogenase-like hydrolase; InterPro: IPR013200 The Haloacid Dehydrogenase (HAD) superfamily includes phosphatases, phosphonatases, P-type ATPases, beta-phosphoglucomutases, phosphomannomutases, and dehalogenases, which are involved in a variety of cellular processes ranging from amino acid biosynthesis to detoxification []. This HAD domain is found in several distinct enzymes including: Phospholipid-transporting ATPase 1 (3.6.3.1 from EC), a putative lipid-flipping enzyme involved in cold tolerance in Arabidopsis [] 3-deoxy-D-manno-octulosonate (KDO) 8-phosphate phosphatase (3.1.3.45 from EC), which catalyses the final step in the biosynthesis of KDO - a component of lipopolysaccharide in Gram-negative bacteria [] Mannosyl-3-phosphoglycerate phosphatase (3.1.3.70 from EC), which hydrolyzes mannosyl-3-phosphoglycerate to form the osmolyte mannosylglycerate [] Phosphoglycolate phopshatase (3.1.3.18 from EC), which catalyses the dephosphorylation of 2-phosphoglycolate [] ; PDB: 2B30_B 3R4C_A 1XVI_B 3IJ5_B 3MMZ_C 3L7Y_A 1XPJ_C 1RLT_B 1RLM_B 2HF2_A ....
Probab=42.59 E-value=36 Score=28.26 Aligned_cols=45 Identities=24% Similarity=0.435 Sum_probs=36.4
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.+...+++..+++|+++.++|.++.+..+-. ...+..++.++|+.
T Consensus 11 ~~~~~i~~~~~~al~~l~~~g~~~~i~TGR~-----~~~~~~~~~~~~~~ 55 (254)
T PF08282_consen 11 NSDGKISPETIEALKELQEKGIKLVIATGRS-----YSSIKRLLKELGID 55 (254)
T ss_dssp STTSSSCHHHHHHHHHHHHTTCEEEEECSST-----HHHHHHHHHHTTHC
T ss_pred cCCCeeCHHHHHHHHhhcccceEEEEEccCc-----ccccccccccccch
Confidence 3556689999999999999999888876654 35688899988876
No 48
>PF04270 Strep_his_triad: Streptococcal histidine triad protein ; InterPro: IPR006270 This entry represents a repeated sequence region that includes a His-X-X-His-X-His (histidine triad) motif, which is found in family of Streptococcal proteins. Members of the family are suggested to cleave human complement component 3, and family member PhpA has been shown in vaccine studies to be a protective antigen in mice []. ; PDB: 2CS7_C.
Probab=42.02 E-value=16 Score=24.11 Aligned_cols=19 Identities=26% Similarity=0.644 Sum_probs=15.5
Q ss_pred CCCCCCHHHHHHHHHHHHc
Q psy15702 18 PREQFTITELNSLKTYLDE 36 (259)
Q Consensus 18 P~~~fs~~E~~~L~~fl~~ 36 (259)
|+.+||+.|+++-++|+++
T Consensus 34 ~k~dLs~~E~~aA~~~~~~ 52 (53)
T PF04270_consen 34 PKSDLSASELKAAQAYLAG 52 (53)
T ss_dssp EGGGS-HHHHHHHHHHHH-
T ss_pred chhhCCHHHHHHHHHHHhc
Confidence 5678999999999999975
No 49
>PF01210 NAD_Gly3P_dh_N: NAD-dependent glycerol-3-phosphate dehydrogenase N-terminus; InterPro: IPR011128 NAD-dependent glycerol-3-phosphate dehydrogenase (GPDH) catalyses the interconversion of dihydroxyacetone phosphate and L-glycerol-3-phosphate. This family represents the N-terminal NAD-binding domain [].; GO: 0016616 oxidoreductase activity, acting on the CH-OH group of donors, NAD or NADP as acceptor, 0051287 NAD binding, 0046168 glycerol-3-phosphate catabolic process, 0055114 oxidation-reduction process, 0005737 cytoplasm; PDB: 1YJ8_B 2PLA_A 1WPQ_B 1X0V_A 1X0X_A 1BG6_A 1TXG_B 1N1G_A 1M67_A 1JDJ_A ....
Probab=40.98 E-value=12 Score=29.90 Aligned_cols=56 Identities=23% Similarity=0.313 Sum_probs=32.6
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHH
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQ 62 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~ 62 (259)
+..+++|++|++-|.. +-++=++.|..|+++|-.+++ +..+-+.....-+..++++
T Consensus 65 ~a~~~ad~IiiavPs~-~~~~~~~~l~~~l~~~~~ii~-~~KG~~~~~~~~~~~~i~~ 120 (157)
T PF01210_consen 65 EALEDADIIIIAVPSQ-AHREVLEQLAPYLKKGQIIIS-ATKGFEPGTLLLLSEVIEE 120 (157)
T ss_dssp HHHTT-SEEEE-S-GG-GHHHHHHHHTTTSHTT-EEEE-TS-SEETTEEEEHHHHHHH
T ss_pred HHhCcccEEEecccHH-HHHHHHHHHhhccCCCCEEEE-ecCCcccCCCccHHHHHHH
Confidence 4678999999999985 557779999999965555444 3333222223344555554
No 50
>TIGR00138 gidB 16S rRNA methyltransferase GidB. GidB (glucose-inhibited division protein B) appears to be present and in a single copy in nearly all complete eubacterial genomes. It is missing only from some obligate intracellular species of various lineages (Chlamydiae, Ehrlichia, Wolbachia, Anaplasma, Buchnera, etc.). GidB shows a methytransferase fold in its the crystal structure, and acts as a 7-methylguanosine (m(7)G) methyltransferase, apparently specific to 16S rRNA.
Probab=40.91 E-value=46 Score=27.37 Aligned_cols=55 Identities=16% Similarity=0.118 Sum_probs=36.2
Q ss_pred cCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHH---hCccc
Q psy15702 7 LRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQ---YGVSV 67 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~---~Gi~~ 67 (259)
...+|+++..+ -..+ +.-++.+.+.|+.||.+++..++.. ...+..+.+. +|+..
T Consensus 107 ~~~fD~I~s~~-~~~~-~~~~~~~~~~LkpgG~lvi~~~~~~----~~~~~~~~e~~~~~~~~~ 164 (181)
T TIGR00138 107 EEQFDVITSRA-LASL-NVLLELTLNLLKVGGYFLAYKGKKY----LDEIEEAKRKCQVLGVEP 164 (181)
T ss_pred cCCccEEEehh-hhCH-HHHHHHHHHhcCCCCEEEEEcCCCc----HHHHHHHHHhhhhcCceE
Confidence 35688766543 2222 2345677899999999998865543 4566677777 78775
No 51
>TIGR01019 sucCoAalpha succinyl-CoA synthetase, alpha subunit. ATP citrate lyases appear to form an outgroup.
Probab=40.78 E-value=58 Score=29.15 Aligned_cols=59 Identities=15% Similarity=0.182 Sum_probs=42.6
Q ss_pred hhcCC--ccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 5 ELLRA--ASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 5 ~ip~~--advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
++|+. .|+.||.-|. +.=.+++++-.++|=+.++.+..+........+....+++|+++
T Consensus 57 dlp~~~~~Dlavi~vpa----~~v~~~l~e~~~~Gvk~avIis~Gf~e~~~~~l~~~a~~~giri 117 (286)
T TIGR01019 57 EAVEETGANASVIFVPA----PFAADAIFEAIDAGIELIVCITEGIPVHDMLKVKRYMEESGTRL 117 (286)
T ss_pred HHhhccCCCEEEEecCH----HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHcCCEE
Confidence 57776 7999999883 45578889999999888777776643211235667777889886
No 52
>COG1492 CobQ Cobyric acid synthase [Coenzyme metabolism]
Probab=40.46 E-value=44 Score=32.22 Aligned_cols=42 Identities=24% Similarity=0.358 Sum_probs=34.1
Q ss_pred cCCccEEEEcCCCCCCCHHH-------HHHHHHHHHcCCcEEEEecCCC
Q psy15702 7 LRAASLFICTGPREQFTITE-------LNSLKTYLDEGGSLLIALGEEG 48 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E-------~~~L~~fl~~GG~llv~~~~~~ 48 (259)
+.++|++||.++++.+++-+ -.+|.+|+++||.++-..+..+
T Consensus 288 l~~~dlvIlPGsk~t~~DL~~lr~~g~d~~i~~~~~~~~~viGICGG~Q 336 (486)
T COG1492 288 LRDADLVILPGSKNTIADLKILREGGMDEKILEYARKGGDVIGICGGYQ 336 (486)
T ss_pred CCCCCEEEeCCCcccHHHHHHHHHcCHHHHHHHHHhCCCCEEEEcchHH
Confidence 45699999999998777743 3489999999999999877653
No 53
>TIGR01457 HAD-SF-IIA-hyp2 HAD-superfamily subfamily IIA hydrolase, TIGR01457. This hypothetical equivalog is a member of the Class IIA subfamily of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. The sequences modelled by this equivalog are all gram positive (low-GC) bacteria. Sequences found in this model are annotated variously as related to NagD or 4-nitrophenyl phosphatase, and this hypothetical equivalog, of all of those within the Class IIA subfamily, is most closely related to the E. coli NagD enzyme and the PGP_euk equivalog (TIGR01452). However, there is presently no evidence that this hypothetical equivalog has the same function of either those.
Probab=40.00 E-value=59 Score=28.09 Aligned_cols=63 Identities=6% Similarity=0.120 Sum_probs=46.2
Q ss_pred CccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeec
Q psy15702 9 AASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRC 75 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~ 75 (259)
|.|-.+..+.+ .++ ...++|++..++|.+++++.+-.+. ....+...|+..|+.+..+.|+-.
T Consensus 7 D~DGtl~~~~~-~i~-~a~~~l~~l~~~g~~~~~~Tnn~~r--~~~~~~~~l~~~g~~~~~~~iit~ 69 (249)
T TIGR01457 7 DLDGTMYKGKE-RIP-EAETFVHELQKRDIPYLFVTNNSTR--TPESVAEMLASFDIPATLETVFTA 69 (249)
T ss_pred eCCCceEcCCe-eCc-CHHHHHHHHHHCCCeEEEEeCCCCC--CHHHHHHHHHHcCCCCChhhEeeH
Confidence 45555555544 455 5789999999999999998874432 345688889999999988777754
No 54
>COG2813 RsmC 16S RNA G1207 methylase RsmC [Translation, ribosomal structure and biogenesis]
Probab=39.21 E-value=43 Score=30.24 Aligned_cols=63 Identities=16% Similarity=0.232 Sum_probs=43.1
Q ss_pred hhhcCCccEEEEcCCCCC-CCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCC
Q psy15702 4 PELLRAASLFICTGPREQ-FTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~-fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~ 69 (259)
...+++||++++..|+.+ ..+..+..+..=+..||.+++.-+-... -..+..+|++||-.-..
T Consensus 32 ~~~~~~~d~~l~~~pK~~~e~e~qLa~ll~~~~~g~~i~v~g~~~~g---~~s~~k~l~~~~~~~~~ 95 (300)
T COG2813 32 ADAPDDFDAVLLYWPKHKAEAEFQLAQLLARLPPGGEIVVVGEKRDG---VRSAEKMLEKYGGPTKT 95 (300)
T ss_pred ccccCCCCEEEEEccCchHHHHHHHHHHHhhCCCCCeEEEEecccch---HHHHHHHHHHhcCcccc
Confidence 345779999999999852 3333466677777789998887554322 24577888888765543
No 55
>PF02593 dTMP_synthase: Thymidylate synthase; InterPro: IPR003745 This entry describes proteins of unknown function.
Probab=38.94 E-value=80 Score=27.14 Aligned_cols=60 Identities=23% Similarity=0.314 Sum_probs=39.3
Q ss_pred cCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCC
Q psy15702 7 LRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~ 69 (259)
+.+||++|..+=--|++..=...+++ +|.+.++.-...+.......+...++++|+.+.-
T Consensus 49 i~~~Dl~I~y~lHPDl~~~l~~~~~e---~g~kavIvp~~~~~~g~~~~lk~~~e~~gi~~~~ 108 (217)
T PF02593_consen 49 IPEADLLIAYGLHPDLTYELPEIAKE---AGVKAVIVPSESPKPGLRRQLKKQLEEFGIEVEF 108 (217)
T ss_pred CCCCCEEEEeccCchhHHHHHHHHHH---cCCCEEEEecCCCccchHHHHHHHHHhcCceeec
Confidence 57899988754333565544444443 7888777766554322345799999999988753
No 56
>TIGR03534 RF_mod_PrmC protein-(glutamine-N5) methyltransferase, release factor-specific. Members of this protein family are HemK (PrmC), a protein once thought to be involved in heme biosynthesis but now recognized to be a protein-glutamine methyltransferase that modifies the peptide chain release factors. All members of the seed alignment are encoded next to the release factor 1 gene (prfA) and confirmed by phylogenetic analysis. SIMBAL analysis (manuscript in prep.) shows the motif [LIV]PRx[DE]TE (in Escherichia coli, IPRPDTE) confers specificity for the release factors rather than for ribosomal protein L3.
Probab=38.74 E-value=63 Score=27.33 Aligned_cols=36 Identities=25% Similarity=0.463 Sum_probs=27.0
Q ss_pred HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 27 LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 27 ~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
+..+.++|+.||.+++..+... ...+..+++++|..
T Consensus 200 i~~~~~~L~~gG~~~~~~~~~~----~~~~~~~l~~~gf~ 235 (251)
T TIGR03534 200 IAQAPRLLKPGGWLLLEIGYDQ----GEAVRALFEAAGFA 235 (251)
T ss_pred HHHHHHhcccCCEEEEEECccH----HHHHHHHHHhCCCC
Confidence 4566778899999988766542 35688889998864
No 57
>PRK03767 NAD(P)H:quinone oxidoreductase; Provisional
Probab=38.66 E-value=36 Score=28.33 Aligned_cols=64 Identities=17% Similarity=0.244 Sum_probs=35.8
Q ss_pred hhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcC----------CcEEEEecCCC----CC-CCCccHHHHHHHhCcccC
Q psy15702 4 PELLRAASLFICTGPREQFTITELNSLKTYLDEG----------GSLLIALGEEG----ER-GSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~G----------G~llv~~~~~~----~~-~~~~nln~LL~~~Gi~~~ 68 (259)
.+.+.+||.+|+..|....+ -...++.|+++- ||...++...+ +. .....+..++..+|+.+-
T Consensus 64 ~~~l~~aD~ii~gsPty~g~--~~~~lk~fld~~~~~~~~~~l~gK~~~~f~s~g~~~Gg~~~~l~~l~~~~~~~gm~vv 141 (200)
T PRK03767 64 PDELADYDAIIFGTPTRFGN--MAGQMRNFLDQTGGLWAKGALVGKVGSVFTSTGTQHGGQETTITSTHTTLLHHGMVIV 141 (200)
T ss_pred HHHHHhCCEEEEEecccCCC--chHHHHHHHHHhccccccCCccCCEEEEEEeCCCCCCChHHHHHHHHHHHHHcCCEEe
Confidence 34567899999999985332 223455555432 45444333322 11 122345555578998875
Q ss_pred C
Q psy15702 69 S 69 (259)
Q Consensus 69 ~ 69 (259)
+
T Consensus 142 ~ 142 (200)
T PRK03767 142 G 142 (200)
T ss_pred C
Confidence 4
No 58
>cd02440 AdoMet_MTases S-adenosylmethionine-dependent methyltransferases (SAM or AdoMet-MTase), class I; AdoMet-MTases are enzymes that use S-adenosyl-L-methionine (SAM or AdoMet) as a substrate for methyltransfer, creating the product S-adenosyl-L-homocysteine (AdoHcy). There are at least five structurally distinct families of AdoMet-MTases, class I being the largest and most diverse. Within this class enzymes can be classified by different substrate specificities (small molecules, lipids, nucleic acids, etc.) and different target atoms for methylation (nitrogen, oxygen, carbon, sulfur, etc.).
Probab=37.84 E-value=41 Score=22.94 Aligned_cols=37 Identities=19% Similarity=0.264 Sum_probs=26.1
Q ss_pred cCCccEEEEcCCCCC---CCHHHHHHHHHHHHcCCcEEEE
Q psy15702 7 LRAASLFICTGPREQ---FTITELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 7 p~~advLVI~~P~~~---fs~~E~~~L~~fl~~GG~llv~ 43 (259)
+..+|+++...+-.. ....-++.+.+.++.||.+++.
T Consensus 64 ~~~~d~i~~~~~~~~~~~~~~~~l~~~~~~l~~~g~~~~~ 103 (107)
T cd02440 64 DESFDVIISDPPLHHLVEDLARFLEEARRLLKPGGVLVLT 103 (107)
T ss_pred CCceEEEEEccceeehhhHHHHHHHHHHHHcCCCCEEEEE
Confidence 567888888877543 2344567777888899988764
No 59
>COG0677 WecC UDP-N-acetyl-D-mannosaminuronate dehydrogenase [Cell envelope biogenesis, outer membrane]
Probab=37.73 E-value=82 Score=29.76 Aligned_cols=66 Identities=24% Similarity=0.375 Sum_probs=40.9
Q ss_pred hhhcCCccEEEEcCCCCCCCH----------HHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHH-hCcccCCce
Q psy15702 4 PELLRAASLFICTGPREQFTI----------TELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQ-YGVSVNSDC 71 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~----------~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~-~Gi~~~~~~ 71 (259)
.+.++.||++||+-|+ +|+. ...+.|-.+|++| .+.++=+.........-...+++. =|+.+..|.
T Consensus 79 ~~~l~~~dv~iI~VPT-Pl~~~~~pDls~v~~aa~sIa~~L~kG-~LVIlEST~~PGTTe~v~~plle~~sgL~~~~Df 155 (436)
T COG0677 79 PEELKECDVFIICVPT-PLKKYREPDLSYVESAARSIAPVLKKG-DLVILESTTPPGTTEEVVKPLLEERSGLKFGEDF 155 (436)
T ss_pred hhhcccCCEEEEEecC-CcCCCCCCChHHHHHHHHHHHHhcCCC-CEEEEecCCCCCcHHHHHHHHHhhcCCCccccee
Confidence 4556799999999997 3443 4578888888765 455543332211123345566665 577776553
No 60
>PF10131 PTPS_related: 6-pyruvoyl-tetrahydropterin synthase related domain; membrane protein; InterPro: IPR018776 This entry is found in various bacterial and archaeal hypothetical membrane proteins, as well as in tetratricopeptide TPR_2 repeat protein. Its function has not yet been established, though it shows similarity to 6-pyruvoyl-tetrahydropterin synthase.
Probab=37.40 E-value=54 Score=32.69 Aligned_cols=63 Identities=19% Similarity=0.170 Sum_probs=41.1
Q ss_pred ChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCC
Q psy15702 3 TPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 3 t~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~ 69 (259)
|.+-+++.+++++.+|+.+=.+..-+.+.+|.+.||+++|.+.+-.. .....--+.+||..+.
T Consensus 481 ~~~~l~~~~~v~~~~~~~~~~~~~e~~~~~~~~~G~~v~i~~~~~~~----~~~~~~~~~lgv~~~~ 543 (616)
T PF10131_consen 481 TLEELPKYKVVYLSGPSYKDESKAEKLVSKLARSGGKVVIDMPRIPD----DRIARQGEFLGVTGEP 543 (616)
T ss_pred CHhHhccceEEEecCCCccchhHHHHHHHHHHhCCCEEEEEcCCCCc----chhhcccccccceEEE
Confidence 34567788999999997422222235899999999999998777432 2223333446766543
No 61
>PF12847 Methyltransf_18: Methyltransferase domain; PDB: 3G2Q_A 3G2O_A 3G2M_B 3G2P_B 3D2L_B 1IM8_B 3NJR_A 3E05_H 3EVZ_A 3HM2_A ....
Probab=37.31 E-value=25 Score=25.60 Aligned_cols=40 Identities=30% Similarity=0.584 Sum_probs=28.3
Q ss_pred hhhcCCccEEEEcC-CCCCCCHH-----HHHHHHHHHHcCCcEEEE
Q psy15702 4 PELLRAASLFICTG-PREQFTIT-----ELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 4 ~~ip~~advLVI~~-P~~~fs~~-----E~~~L~~fl~~GG~llv~ 43 (259)
.+.++.+|++++.+ .-..+... -++.+.+.|+.||.+++-
T Consensus 65 ~~~~~~~D~v~~~~~~~~~~~~~~~~~~~l~~~~~~L~pgG~lvi~ 110 (112)
T PF12847_consen 65 PDFLEPFDLVICSGFTLHFLLPLDERRRVLERIRRLLKPGGRLVIN 110 (112)
T ss_dssp TTTSSCEEEEEECSGSGGGCCHHHHHHHHHHHHHHHEEEEEEEEEE
T ss_pred cccCCCCCEEEECCCccccccchhHHHHHHHHHHHhcCCCcEEEEE
Confidence 35567899988887 33223332 278889999999998874
No 62
>PF03464 eRF1_2: eRF1 domain 2; InterPro: IPR005141 This domain is found in the release factor eRF1 which terminates protein biosynthesis by recognizing stop codons at the A site of the ribosome and stimulating peptidyl-tRNA bond hydrolysis at the peptidyl transferase centre. The crystal structure of human eRF1 is known []. The overall shape and dimensions of eRF1 resemble a tRNA molecule with domains 1, 2, and 3 of eRF1 corresponding to the anticodon loop, aminoacyl acceptor stem, and T stem of a tRNA molecule, respectively. The position of the essential GGQ motif at an exposed tip of domain 2 suggests that the Gln residue coordinates a water molecule to mediate the hydrolytic activity at the peptidyl transferase centre. A conserved groove on domain 1, 80 A from the GGQ motif, is proposed to form the codon recognition site []. This domain is also found in other proteins which may also be involved in translation termination ; PDB: 3AGK_A 2VGN_A 2VGM_A 3J16_A 3IZQ 3IR9_A 3OBW_A 3MCA_B 2QI2_A 3E1Y_D ....
Probab=37.15 E-value=75 Score=24.64 Aligned_cols=50 Identities=18% Similarity=0.282 Sum_probs=29.5
Q ss_pred cCCccEEEEcCCCCCCCHHHH--HHHHHHHHcC-CcEEEEecCCCCCCCCccHHHHHH
Q psy15702 7 LRAASLFICTGPREQFTITEL--NSLKTYLDEG-GSLLIALGEEGERGSSTNINFLLE 61 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E~--~~L~~fl~~G-G~llv~~~~~~~~~~~~nln~LL~ 61 (259)
-.+.+.|||+||. |..+++ +...++-... .++.+.--..+ ...+++++|+
T Consensus 71 ~~~~~~iIiaGPG--f~k~~f~~~l~~~~~~~~~~~i~~~~~s~~---~~~gl~Evl~ 123 (133)
T PF03464_consen 71 FDDVKCIIIAGPG--FTKEEFYKYLKAEARRKDKKKIVVVDTSSG---GESGLNEVLK 123 (133)
T ss_dssp TTTCSEEEEEEST--THHHHHHHHHHHHHHHHTCCEEEEEE-SSS---CHHHHHHHHH
T ss_pred cccccEEEEECCH--HHHHHHHHHHHHhhHhhcCCEEEEEECCCC---CHHHHHHHHH
Confidence 4678999999996 677765 3334444455 55554332332 2456777765
No 63
>PRK10513 sugar phosphate phosphatase; Provisional
Probab=36.68 E-value=97 Score=26.61 Aligned_cols=45 Identities=9% Similarity=0.187 Sum_probs=35.4
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.+...+++.-.++|++..++|.++.+..+-. ...+..+++++|+.
T Consensus 16 ~~~~~i~~~~~~ai~~l~~~G~~~~iaTGR~-----~~~~~~~~~~l~~~ 60 (270)
T PRK10513 16 LPDHTISPAVKQAIAAARAKGVNVVLTTGRP-----YAGVHRYLKELHME 60 (270)
T ss_pred CCCCccCHHHHHHHHHHHHCCCEEEEecCCC-----hHHHHHHHHHhCCC
Confidence 4455799999999999999999888876554 34577888998874
No 64
>PF12123 Amidase02_C: N-acetylmuramoyl-l-alanine amidase; InterPro: IPR021976 This domain is found in bacteria and viruses. This domain is about 50 amino acids in length. This domain is classified with the enzyme classification code 3.5.1.28 from EC. This domain is the C-terminal of the enzyme which hydrolyses the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. ; PDB: 2L48_B.
Probab=36.35 E-value=29 Score=22.10 Aligned_cols=24 Identities=33% Similarity=0.679 Sum_probs=16.1
Q ss_pred EEEEcCCCCCCCHHHHHHHHHHHHcCC
Q psy15702 12 LFICTGPREQFTITELNSLKTYLDEGG 38 (259)
Q Consensus 12 vLVI~~P~~~fs~~E~~~L~~fl~~GG 38 (259)
..++.+| +++.|+++++.|+++.|
T Consensus 17 ~y~vT~~---~s~~~L~k~~~wld~rg 40 (45)
T PF12123_consen 17 PYFVTDP---LSDAELDKFTAWLDERG 40 (45)
T ss_dssp EEEEE-------HHHHHHHHHHHHHTT
T ss_pred EEEEeCC---CCHHHHHHHHHHHHhcC
Confidence 3444444 79999999999998776
No 65
>TIGR01177 conserved hypothetical protein TIGR01177. This family is found exclusively in the Archaea.
Probab=35.91 E-value=65 Score=29.08 Aligned_cols=59 Identities=15% Similarity=0.138 Sum_probs=39.4
Q ss_pred CCccEEEEcCCCC---C--------CCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceee
Q psy15702 8 RAASLFICTGPRE---Q--------FTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVV 73 (259)
Q Consensus 8 ~~advLVI~~P~~---~--------fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~ 73 (259)
...|++|...|=. . +-..-+..+.+.|+.||++++.+... ..+..+++..|. +......
T Consensus 247 ~~~D~Iv~dPPyg~~~~~~~~~~~~l~~~~l~~~~r~Lk~gG~lv~~~~~~------~~~~~~~~~~g~-i~~~~~~ 316 (329)
T TIGR01177 247 ESVDAIATDPPYGRSTTAAGDGLESLYERSLEEFHEVLKSEGWIVYAVPTR------IDLESLAEDAFR-VVKRFEV 316 (329)
T ss_pred CCCCEEEECCCCcCcccccCCchHHHHHHHHHHHHHHccCCcEEEEEEcCC------CCHHHHHhhcCc-chheeee
Confidence 4678887765511 0 11334666777888999998886554 357889999999 7655443
No 66
>PRK13302 putative L-aspartate dehydrogenase; Provisional
Probab=35.04 E-value=73 Score=28.06 Aligned_cols=57 Identities=14% Similarity=0.153 Sum_probs=38.3
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
++.+++|++|+..|.. ...+...+.++.|..+++. +.+.- .....+-...++.|..+
T Consensus 63 ell~~~D~Vvi~tp~~----~h~e~~~~aL~aGk~Vi~~-s~gal-~~~~~L~~~A~~~g~~l 119 (271)
T PRK13302 63 QLATHADIVVEAAPAS----VLRAIVEPVLAAGKKAIVL-SVGAL-LRNEDLIDLARQNGGQI 119 (271)
T ss_pred HHhcCCCEEEECCCcH----HHHHHHHHHHHcCCcEEEe-cchhH-HhHHHHHHHHHHcCCEE
Confidence 4567899999999963 3466668888998888875 32210 01245666777888774
No 67
>PRK08287 cobalt-precorrin-6Y C(15)-methyltransferase; Validated
Probab=34.91 E-value=65 Score=26.22 Aligned_cols=57 Identities=19% Similarity=0.137 Sum_probs=37.3
Q ss_pred cCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 7 LRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
+...|++++.+.... .+.-+..+.+.|+.||.+++-.-... ....+..++++.|...
T Consensus 95 ~~~~D~v~~~~~~~~-~~~~l~~~~~~Lk~gG~lv~~~~~~~---~~~~~~~~l~~~g~~~ 151 (187)
T PRK08287 95 PGKADAIFIGGSGGN-LTAIIDWSLAHLHPGGRLVLTFILLE---NLHSALAHLEKCGVSE 151 (187)
T ss_pred CcCCCEEEECCCccC-HHHHHHHHHHhcCCCeEEEEEEecHh---hHHHHHHHHHHCCCCc
Confidence 457898888776432 34456778889999999866321111 2345667888888653
No 68
>PRK05678 succinyl-CoA synthetase subunit alpha; Validated
Probab=34.72 E-value=78 Score=28.41 Aligned_cols=59 Identities=17% Similarity=0.154 Sum_probs=42.8
Q ss_pred hhcCC--ccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 5 ELLRA--ASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 5 ~ip~~--advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
++|+. .|+.||.-|. +.-.++|++-.++|=+..+.+..+........+-.+..++|+++
T Consensus 59 dlp~~~~~DlAvi~vp~----~~v~~~l~e~~~~gvk~avI~s~Gf~~~~~~~l~~~a~~~girv 119 (291)
T PRK05678 59 EAVEATGANASVIYVPP----PFAADAILEAIDAGIDLIVCITEGIPVLDMLEVKAYLERKKTRL 119 (291)
T ss_pred HHhhccCCCEEEEEcCH----HHHHHHHHHHHHCCCCEEEEECCCCCHHHHHHHHHHHHHcCCEE
Confidence 57776 8999999883 45578899999999887777776543101115667888899886
No 69
>TIGR01458 HAD-SF-IIA-hyp3 HAD-superfamily subfamily IIA hydrolase, TIGR01458. This hypothetical equivalog is a member of the IIA subfamily (TIGR01460) of the haloacid dehalogenase superfamily of aspartate-nucleophile hydrolases. One sequence (GP|10716807) has been annotated as a "phospholysine phosphohistidine inorganic pyrophosphatase," probably in reference to studies on similarly described (but unsequenced) enzymes from bovine and rat tissues. However, the supporting information for this annotation has never been published.
Probab=34.57 E-value=86 Score=27.22 Aligned_cols=66 Identities=12% Similarity=0.045 Sum_probs=47.4
Q ss_pred CCccEEEEcCCC--CCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeec
Q psy15702 8 RAASLFICTGPR--EQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRC 75 (259)
Q Consensus 8 ~~advLVI~~P~--~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~ 75 (259)
-|.|-.+..+-+ ....+...++|++.-++|.+++++.+-... ....+-..|+.+|+.+..+.|+-+
T Consensus 6 ~D~DGtl~~~~~~~~~~~~~a~~al~~l~~~G~~~~~~Tn~~~~--~~~~~~~~l~~~g~~~~~~~i~ts 73 (257)
T TIGR01458 6 LDISGVLYISDAKSGVAVPGSQEAVKRLRGASVKVRFVTNTTKE--SKQDLLERLQRLGFDISEDEVFTP 73 (257)
T ss_pred EeCCCeEEeCCCcccCcCCCHHHHHHHHHHCCCeEEEEECCCCC--CHHHHHHHHHHcCCCCCHHHeEcH
Confidence 355656655543 125668899999988899999998876542 223577888999999988877754
No 70
>PF11293 DUF3094: Protein of unknown function (DUF3094); InterPro: IPR021444 This family of proteins with unknown function appears to be restricted to Gammaproteobacteria.
Probab=34.39 E-value=50 Score=21.84 Aligned_cols=20 Identities=15% Similarity=0.371 Sum_probs=17.0
Q ss_pred CCCCHHHHHHHHHHHHcCCc
Q psy15702 20 EQFTITELNSLKTYLDEGGS 39 (259)
Q Consensus 20 ~~fs~~E~~~L~~fl~~GG~ 39 (259)
.++++++.+++++|++.|=+
T Consensus 2 ~rL~pEDQ~~Vd~yL~a~~~ 21 (55)
T PF11293_consen 2 SRLNPEDQQRVDEYLQAGVN 21 (55)
T ss_pred CCCCHHHHHHHHHHHhCCCC
Confidence 46899999999999998744
No 71
>cd06905 Peptidase_M14-like_8 A functionally uncharacterized subgroup of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Two major subfamilies of the M14 family, defined based on sequence and structural homology, are the A/B and N/E subfamilies. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by an N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. The A forms have slightly differe
Probab=33.87 E-value=52 Score=30.49 Aligned_cols=32 Identities=13% Similarity=0.188 Sum_probs=25.9
Q ss_pred EcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCC
Q psy15702 15 CTGPREQFTITELNSLKTYLDEGGSLLIALGEE 47 (259)
Q Consensus 15 I~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~ 47 (259)
-+||. +||+.|..+|.+|+++.+++.++++-+
T Consensus 228 y~G~~-p~SEpEt~av~~~~~~~~~i~~~is~H 259 (360)
T cd06905 228 GAGPF-PFSEPETRAVVEFWTDHPNINGFISYH 259 (360)
T ss_pred CCCCC-CCChHHHHHHHHHHhcCCCeEEEEEec
Confidence 35666 799999999999999888876666654
No 72
>PRK14816 NADH dehydrogenase subunit B; Provisional
Probab=33.77 E-value=84 Score=26.25 Aligned_cols=41 Identities=20% Similarity=0.192 Sum_probs=28.0
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~ 46 (259)
..|+.||+++|.||-+.-....+..+.+- +..-+.+|+++.
T Consensus 77 ~sPRhADvllVtG~VT~~m~~~l~~~~e~-~p~pK~VIAvGs 117 (182)
T PRK14816 77 ASPRQADMIMVCGTITNKMAPVLKRLYDQ-MADPKYVIAVGG 117 (182)
T ss_pred CCCCcceEEEEecCCcchhHHHHHHHHHh-cCCCCEEEEecc
Confidence 56999999999999643333334433333 377788888776
No 73
>PRK00121 trmB tRNA (guanine-N(7)-)-methyltransferase; Reviewed
Probab=33.74 E-value=63 Score=26.94 Aligned_cols=58 Identities=17% Similarity=0.042 Sum_probs=38.7
Q ss_pred CCccEEEEcCCCCCC----------CHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 8 RAASLFICTGPREQF----------TITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 8 ~~advLVI~~P~~~f----------s~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
...|++++..|.... .+.-+..+.+.|+.||.+++..+... ....+-..++++|....
T Consensus 110 ~~~D~V~~~~~~p~~~~~~~~~~~~~~~~l~~i~~~LkpgG~l~i~~~~~~---~~~~~~~~~~~~g~~~~ 177 (202)
T PRK00121 110 GSLDRIYLNFPDPWPKKRHHKRRLVQPEFLALYARKLKPGGEIHFATDWEG---YAEYMLEVLSAEGGFLV 177 (202)
T ss_pred cccceEEEECCCCCCCccccccccCCHHHHHHHHHHcCCCCEEEEEcCCHH---HHHHHHHHHHhCccccc
Confidence 457888876553211 24458899999999999998765432 22345566777887665
No 74
>PF00891 Methyltransf_2: O-methyltransferase; InterPro: IPR001077 Methyl transfer from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms ranging from bacteria to plants and mammals. The reaction is catalysed by methyltransferases (Mtases) and modifies DNA, RNA, proteins and small molecules, such as catechol for regulatory purposes. The various aspects of the role of DNA methylation in prokaryotic restriction-modification systems and in a number of cellular processes in eukaryotes including gene regulation and differentiation is well documented. Three classes of DNA Mtases transfer the methyl group from AdoMet to the target base to form either N-6-methyladenine, or N-4-methylcytosine, or C-5- methylcytosine. In C-5-cytosine Mtases, ten conserved motifs are arranged in the same order []. Motif I (a glycine-rich or closely related consensus sequence; FAGxGG in M.HhaI []), shared by other AdoMet-Mtases [], is part of the cofactor binding site and motif IV (PCQ) is part of the catalytic site. In contrast, sequence comparison among N-6-adenine and N-4-cytosine Mtases indicated two of the conserved segments [], although more conserved segments may be present. One of them corresponds to motif I in C-5-cytosine Mtases, and the other is named (D/N/S)PP(Y/F). Crystal structures are known for a number of Mtases [, , , ]. The cofactor binding sites are almost identical and the essential catalytic amino acids coincide. The comparable protein folding and the existence of equivalent amino acids in similar secondary and tertiary positions indicate that many (if not all) AdoMet-Mtases have a common catalytic domain structure. This permits tertiary structure prediction of other DNA, RNA, protein, and small-molecule AdoMet-Mtases from their amino acid sequences []. This domain includes a range of O-methyltransferases some of which utilise S-adenosyl methionine as substrate []. In prokaryotes, the major role of DNA methylation is to protect host DNA against degradation by restriction enzymes. In eukaryotes, DNA methylation has been implicated in the control of several cellular processes, including differentiation, gene regulation, and embryonic development. O-methyltransferases have a common catalytic domain structure, which might be universal among S-adenosyl-L-methionine (AdoMet)-dependent methyltransferases []. Comparative analysis of the predicted amino acid sequences of a number of plant O-methyltransferase cDNA clones show that they share some 32-71% sequence identity, and can be grouped according to the different compounds they utilise as substrates [].; GO: 0008171 O-methyltransferase activity; PDB: 1FPQ_A 1FP1_D 3P9K_B 3P9I_D 3P9C_A 3I53_A 3I5U_A 3I64_A 3I58_A 1ZG3_A ....
Probab=33.54 E-value=37 Score=28.95 Aligned_cols=39 Identities=23% Similarity=0.338 Sum_probs=32.1
Q ss_pred hhhcCCccEEEEcCCCCCCCHHH----HHHHHHHHHcC--CcEEEE
Q psy15702 4 PELLRAASLFICTGPREQFTITE----LNSLKTYLDEG--GSLLIA 43 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E----~~~L~~fl~~G--G~llv~ 43 (259)
..+|. +|++++.+==.++++++ ++++.+-+..| |+|+|.
T Consensus 154 ~~~P~-~D~~~l~~vLh~~~d~~~~~iL~~~~~al~pg~~g~llI~ 198 (241)
T PF00891_consen 154 DPLPV-ADVYLLRHVLHDWSDEDCVKILRNAAAALKPGKDGRLLII 198 (241)
T ss_dssp TCCSS-ESEEEEESSGGGS-HHHHHHHHHHHHHHSEECTTEEEEEE
T ss_pred hhhcc-ccceeeehhhhhcchHHHHHHHHHHHHHhCCCCCCeEEEE
Confidence 56788 99999998888999998 67777788888 998885
No 75
>COG3845 ABC-type uncharacterized transport systems, ATPase components [General function prediction only]
Probab=33.35 E-value=1.2e+02 Score=29.24 Aligned_cols=46 Identities=20% Similarity=0.453 Sum_probs=37.0
Q ss_pred CChhhcCCccEEEEcCCCCCCCHHHHHHHHHHH---HcCCcEEEEecCC
Q psy15702 2 ITPELLRAASLFICTGPREQFTITELNSLKTYL---DEGGSLLIALGEE 47 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~~fs~~E~~~L~~fl---~~GG~llv~~~~~ 47 (259)
+.-|+..+-++||++.|++-+.-...+.|.+-+ .+.|+.+++++..
T Consensus 414 laREl~~~p~lLI~~qPTrGLDvgA~~~I~~~l~e~r~~G~AVLLiS~d 462 (501)
T COG3845 414 LARELARRPDLLIAAQPTRGLDVGAIEFIHERLLELRDAGKAVLLISED 462 (501)
T ss_pred hhhhhccCCCEEEEcCCCccccHHHHHHHHHHHHHHHhcCCEEEEEehh
Confidence 445778899999999999999998888888877 4777777776654
No 76
>PRK14967 putative methyltransferase; Provisional
Probab=33.03 E-value=93 Score=26.21 Aligned_cols=37 Identities=24% Similarity=0.300 Sum_probs=23.1
Q ss_pred HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 27 LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 27 ~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
+..+.++|+.||++++....... ...+-.+++..|+.
T Consensus 142 l~~a~~~Lk~gG~l~~~~~~~~~---~~~~~~~l~~~g~~ 178 (223)
T PRK14967 142 CDAAPALLAPGGSLLLVQSELSG---VERTLTRLSEAGLD 178 (223)
T ss_pred HHHHHHhcCCCcEEEEEEecccC---HHHHHHHHHHCCCC
Confidence 34567889999998887555421 22344555555654
No 77
>PRK06067 flagellar accessory protein FlaH; Validated
Probab=32.98 E-value=1.1e+02 Score=25.86 Aligned_cols=64 Identities=9% Similarity=0.077 Sum_probs=43.0
Q ss_pred CChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCC
Q psy15702 2 ITPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~ 69 (259)
|...+|...-++|...|..-=|---.+.+...+++|++++++.-+.. ...+-.-++.+|+.+.+
T Consensus 18 l~gG~~~g~~~~i~G~~GsGKt~l~~~~~~~~~~~g~~~~y~~~e~~----~~~~~~~~~~~g~~~~~ 81 (234)
T PRK06067 18 LGGGIPFPSLILIEGDHGTGKSVLSQQFVYGALKQGKKVYVITTENT----SKSYLKQMESVKIDISD 81 (234)
T ss_pred hCCCCcCCcEEEEECCCCCChHHHHHHHHHHHHhCCCEEEEEEcCCC----HHHHHHHHHHCCCChhH
Confidence 45568888888888877655555556666778889999888766542 22344445678877653
No 78
>PTZ00146 fibrillarin; Provisional
Probab=32.98 E-value=1.4e+02 Score=26.89 Aligned_cols=59 Identities=12% Similarity=0.183 Sum_probs=34.9
Q ss_pred cCCccEEEEcCCCCCCCHHH--HHHHHHHHHcCCcEEEEecCCCCCCCCccH-------HHHHHHhCcccC
Q psy15702 7 LRAASLFICTGPREQFTITE--LNSLKTYLDEGGSLLIALGEEGERGSSTNI-------NFLLEQYGVSVN 68 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E--~~~L~~fl~~GG~llv~~~~~~~~~~~~nl-------n~LL~~~Gi~~~ 68 (259)
++.+|+++.-..+ ..+.+ +..+..||+.||++++......-. ..... ..+|++.|++..
T Consensus 200 ~~~vDvV~~Dva~--pdq~~il~~na~r~LKpGG~~vI~ika~~id-~g~~pe~~f~~ev~~L~~~GF~~~ 267 (293)
T PTZ00146 200 VPMVDVIFADVAQ--PDQARIVALNAQYFLKNGGHFIISIKANCID-STAKPEVVFASEVQKLKKEGLKPK 267 (293)
T ss_pred cCCCCEEEEeCCC--cchHHHHHHHHHHhccCCCEEEEEEeccccc-cCCCHHHHHHHHHHHHHHcCCceE
Confidence 4568888775543 22222 345788999999999965543211 11111 255777787754
No 79
>COG1385 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.19 E-value=51 Score=28.81 Aligned_cols=24 Identities=21% Similarity=0.499 Sum_probs=14.8
Q ss_pred CccEEEEcCCCCCCCHHHHHHHHH
Q psy15702 9 AASLFICTGPREQFTITELNSLKT 32 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E~~~L~~ 32 (259)
...+++++||..=||++|++.+++
T Consensus 191 ~~~i~iiIGPEGGfs~~Ei~~l~~ 214 (246)
T COG1385 191 EGKVLLIIGPEGGFSEDEIELLRE 214 (246)
T ss_pred CCcEEEEECCCCCCCHHHHHHHHH
Confidence 455666666666666666665544
No 80
>PRK00170 azoreductase; Reviewed
Probab=31.98 E-value=48 Score=27.27 Aligned_cols=18 Identities=28% Similarity=0.436 Sum_probs=14.2
Q ss_pred cCCccEEEEcCCCCCCCH
Q psy15702 7 LRAASLFICTGPREQFTI 24 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~ 24 (259)
...||.|||+.|-.-++-
T Consensus 84 i~~AD~iV~~sP~y~~~~ 101 (201)
T PRK00170 84 FLAADKIVIAAPMYNFSI 101 (201)
T ss_pred HHHCCEEEEeecccccCC
Confidence 567999999999765544
No 81
>PRK06703 flavodoxin; Provisional
Probab=31.54 E-value=1.2e+02 Score=23.69 Aligned_cols=64 Identities=16% Similarity=0.040 Sum_probs=37.7
Q ss_pred hcCCccEEEEcCCCCCC--CHHHHHHHHHHHHc---CCcEEEEecCCCCC-----CCCccHHHHHHHhCcccCC
Q psy15702 6 LLRAASLFICTGPREQF--TITELNSLKTYLDE---GGSLLIALGEEGER-----GSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 6 ip~~advLVI~~P~~~f--s~~E~~~L~~fl~~---GG~llv~~~~~~~~-----~~~~nln~LL~~~Gi~~~~ 69 (259)
.+.++|.|||+.|+... .+..+..+-+++++ .|+-+.+++.++.. .....+...|++.|..+-.
T Consensus 45 ~l~~~d~viigspt~~~g~~p~~~~~f~~~l~~~~l~~k~~~vfg~g~~~y~~~~~a~~~l~~~l~~~G~~~~~ 118 (151)
T PRK06703 45 ELLAYDGIILGSYTWGDGDLPYEAEDFHEDLENIDLSGKKVAVFGSGDTAYPLFCEAVTIFEERLVERGAELVQ 118 (151)
T ss_pred HHhcCCcEEEEECCCCCCcCcHHHHHHHHHHhcCCCCCCEEEEEccCCCChHHHHHHHHHHHHHHHHCCCEEcc
Confidence 46789999999998533 23445555555542 24444444443210 1123388899999997643
No 82
>PRK10530 pyridoxal phosphate (PLP) phosphatase; Provisional
Probab=31.46 E-value=1.1e+02 Score=26.23 Aligned_cols=45 Identities=16% Similarity=0.298 Sum_probs=35.1
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.+...+++.-+++|+++.++|.++.+..+-. ...+..+++++|+.
T Consensus 16 ~~~~~i~~~~~~ai~~~~~~G~~~~iaTGR~-----~~~~~~~~~~l~~~ 60 (272)
T PRK10530 16 TPKKTILPESLEALARAREAGYKVIIVTGRH-----HVAIHPFYQALALD 60 (272)
T ss_pred CCCCccCHHHHHHHHHHHHCCCEEEEEcCCC-----hHHHHHHHHhcCCC
Confidence 4555799999999999999999888876554 24567788888765
No 83
>cd01018 ZntC Metal binding protein ZntC. These proteins are predicted to function as initial receptors in ABC transport of metal ions. They belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and bind their specific ligands in the cleft between these domains. In addition, many of these proteins possess a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=31.27 E-value=1.4e+02 Score=26.03 Aligned_cols=48 Identities=13% Similarity=0.167 Sum_probs=39.7
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
.|...-|..++..|.+.+++.|--+++.++... ..-.+.+.++.|+.+
T Consensus 196 ~~~~eps~~~l~~l~~~ik~~~v~~if~e~~~~---~~~~~~la~~~g~~v 243 (266)
T cd01018 196 EEGKEPSPADLKRLIDLAKEKGVRVVFVQPQFS---TKSAEAIAREIGAKV 243 (266)
T ss_pred CCCCCCCHHHHHHHHHHHHHcCCCEEEEcCCCC---cHHHHHHHHHcCCeE
Confidence 477788999999999999988888888888753 456778889999865
No 84
>PRK00103 rRNA large subunit methyltransferase; Provisional
Probab=31.18 E-value=83 Score=25.50 Aligned_cols=42 Identities=29% Similarity=0.313 Sum_probs=31.2
Q ss_pred HHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 24 ITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 24 ~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
+.|-++|...+..|. .+|++|+.|....+..|..+++.|...
T Consensus 54 ~~E~~~il~~l~~~~-~~i~LDe~Gk~~sS~~fA~~l~~~~~~ 95 (157)
T PRK00103 54 AKEGERILAALPKGA-RVIALDERGKQLSSEEFAQELERWRDD 95 (157)
T ss_pred HHHHHHHHhhCCCCC-EEEEEcCCCCcCCHHHHHHHHHHHHhc
Confidence 567777888886555 677789988766777888889888433
No 85
>COG4019 Uncharacterized protein conserved in archaea [Function unknown]
Probab=31.12 E-value=49 Score=26.00 Aligned_cols=42 Identities=14% Similarity=0.312 Sum_probs=33.1
Q ss_pred CHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 23 TITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 23 s~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
-+.|+++|++|+-+--++.+..+-.. .-.-+|..|..||..=
T Consensus 22 ~eeEve~ireyi~sA~r~vV~t~N~~---K~~aindvlrrf~l~E 63 (156)
T COG4019 22 KEEEVEKIREYIVSAKRIVVATNNQK---KFKAINDVLRRFCLAE 63 (156)
T ss_pred hHHHHHHHHHHHhccceEEEecCCHH---HHHHHHHHHHHhccch
Confidence 46789999999999999988755432 3467899999998754
No 86
>PRK11713 16S ribosomal RNA methyltransferase RsmE; Provisional
Probab=30.97 E-value=47 Score=28.57 Aligned_cols=22 Identities=23% Similarity=0.531 Sum_probs=19.4
Q ss_pred cEEEEcCCCCCCCHHHHHHHHH
Q psy15702 11 SLFICTGPREQFTITELNSLKT 32 (259)
Q Consensus 11 dvLVI~~P~~~fs~~E~~~L~~ 32 (259)
+++|++||.--||+.|++.+.+
T Consensus 179 ~i~viIGPEGGfs~~Ei~~l~~ 200 (234)
T PRK11713 179 KVLLLIGPEGGFSPEEIELLRE 200 (234)
T ss_pred eEEEEECCCCCCCHHHHHHHHH
Confidence 5889999999999999997664
No 87
>PF01297 TroA: Periplasmic solute binding protein family; InterPro: IPR006127 This is a family of ABC transporter metal-binding lipoproteins. An example is the periplasmic zinc-binding protein TroA P96116 from SWISSPROT that interacts with an ATP-binding cassette transport system in Treponema pallidum and plays a role in the transport of zinc across the cytoplasmic membrane. Related proteins are found in both Gram-positive and Gram-negative bacteria. ; GO: 0046872 metal ion binding, 0030001 metal ion transport; PDB: 2PS9_A 2PS0_A 2OSV_A 2OGW_A 2PS3_A 2PRS_B 3MFQ_C 3GI1_B 2OV3_A 1PQ4_A ....
Probab=30.72 E-value=95 Score=26.69 Aligned_cols=49 Identities=22% Similarity=0.188 Sum_probs=37.0
Q ss_pred cCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 16 TGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 16 ~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
.+|....|..++..|.+.++++|--+++.++... ..-.+.|.++.|+.+
T Consensus 177 ~~~~~~ps~~~l~~l~~~ik~~~v~~i~~e~~~~---~~~~~~la~~~g~~v 225 (256)
T PF01297_consen 177 ISPGEEPSPKDLAELIKLIKENKVKCIFTEPQFS---SKLAEALAKETGVKV 225 (256)
T ss_dssp SSSSSSS-HHHHHHHHHHHHHTT-SEEEEETTS----THHHHHHHHCCT-EE
T ss_pred cccccCCCHHHHHHHHHHhhhcCCcEEEecCCCC---hHHHHHHHHHcCCcE
Confidence 3677889999999999999988876777777653 456788888999887
No 88
>PRK09328 N5-glutamine S-adenosyl-L-methionine-dependent methyltransferase; Provisional
Probab=30.64 E-value=99 Score=26.65 Aligned_cols=35 Identities=29% Similarity=0.520 Sum_probs=25.8
Q ss_pred HHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 28 NSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 28 ~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
..+.++|+.||.+++-.+... ...+..++++.|..
T Consensus 222 ~~~~~~Lk~gG~l~~e~g~~~----~~~~~~~l~~~gf~ 256 (275)
T PRK09328 222 EQAPRYLKPGGWLLLEIGYDQ----GEAVRALLAAAGFA 256 (275)
T ss_pred HHHHHhcccCCEEEEEECchH----HHHHHHHHHhCCCc
Confidence 344488999999998766543 35688899888864
No 89
>TIGR02716 C20_methyl_CrtF C-20 methyltransferase BchU. Members of this protein family are the S-adenosylmethionine-depenedent C-20 methyltransferase BchU, part of the pathway of bacteriochlorophyll c production in photosynthetic green sulfur bacteria. The position modified by this enzyme represents the difference between bacteriochlorophylls c and d; strains lacking this protein can only produced bacteriochlorophyll d.
Probab=30.64 E-value=69 Score=28.46 Aligned_cols=35 Identities=17% Similarity=0.068 Sum_probs=26.5
Q ss_pred CccEEEEcCCCCCCCHHH----HHHHHHHHHcCCcEEEE
Q psy15702 9 AASLFICTGPREQFTITE----LNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E----~~~L~~fl~~GG~llv~ 43 (259)
.+|++++..--..+++++ ++.+.+-|+.||++++.
T Consensus 215 ~~D~v~~~~~lh~~~~~~~~~il~~~~~~L~pgG~l~i~ 253 (306)
T TIGR02716 215 EADAVLFCRILYSANEQLSTIMCKKAFDAMRSGGRLLIL 253 (306)
T ss_pred CCCEEEeEhhhhcCChHHHHHHHHHHHHhcCCCCEEEEE
Confidence 478888777555666654 66778888999999886
No 90
>TIGR03704 PrmC_rel_meth putative protein-(glutamine-N5) methyltransferase, unknown substrate-specific. This protein family is closely related to two different families of protein-(glutamine-N5) methyltransferase. The first is PrmB, which modifies ribosomal protein L3 in some bacteria. The second is PrmC (HemK), which modifies peptide chain release factors 1 and 2 in most bacteria and also in eukaryotes. The glutamine side chain-binding motif NPPY shared by PrmB and PrmC is N[VAT]PY in this family. The protein substrate is unknown.
Probab=30.40 E-value=1.1e+02 Score=26.62 Aligned_cols=35 Identities=20% Similarity=0.289 Sum_probs=25.4
Q ss_pred HHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 29 SLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 29 ~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
...++|+.||.+++-.+... ...+-.+++++|+..
T Consensus 201 ~a~~~L~~gG~l~l~~~~~~----~~~v~~~l~~~g~~~ 235 (251)
T TIGR03704 201 GAPDWLAPGGHLLVETSERQ----APLAVEAFARAGLIA 235 (251)
T ss_pred HHHHhcCCCCEEEEEECcch----HHHHHHHHHHCCCCc
Confidence 33477889999998877653 356778888888654
No 91
>COG1236 YSH1 Predicted exonuclease of the beta-lactamase fold involved in RNA processing [Translation, ribosomal structure and biogenesis]
Probab=30.15 E-value=72 Score=30.21 Aligned_cols=57 Identities=23% Similarity=0.409 Sum_probs=38.8
Q ss_pred hcCCccEEEEcC-------CCCCCCHHH-HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCc
Q psy15702 6 LLRAASLFICTG-------PREQFTITE-LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGV 65 (259)
Q Consensus 6 ip~~advLVI~~-------P~~~fs~~E-~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi 65 (259)
.+..+|+||+-+ |..+..+.+ ++.+++.+.+||++++=...-+ +.-.+--+|.++|.
T Consensus 177 ~~~~~DvLI~EsTYg~~~~~~r~~~e~~f~~~v~~~l~~GG~vlipafa~g---raQEll~~L~~~~~ 241 (427)
T COG1236 177 LPPCIDVLIVESTYGDRLHPNRDEVERRFIESVKAALERGGTVLIPAFALG---RAQELLLILRELGF 241 (427)
T ss_pred cCCCCcEEEEecccCCccCCCHHHHHHHHHHHHHHHHhCCCEEEEeccccc---HHHHHHHHHHHHhc
Confidence 344459888743 334444444 6799999999999999777665 23346666777775
No 92
>PRK15126 thiamin pyrimidine pyrophosphate hydrolase; Provisional
Probab=30.05 E-value=1e+02 Score=26.69 Aligned_cols=45 Identities=16% Similarity=0.252 Sum_probs=35.4
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.+...+++.-+++|++..++|.++.+..+-. ...+..+++++|+.
T Consensus 15 ~~~~~i~~~~~~ai~~l~~~G~~~~iaTGR~-----~~~~~~~~~~l~~~ 59 (272)
T PRK15126 15 MPDHHLGEKTLSTLARLRERDITLTFATGRH-----VLEMQHILGALSLD 59 (272)
T ss_pred CCCCcCCHHHHHHHHHHHHCCCEEEEECCCC-----HHHHHHHHHHcCCC
Confidence 4556799999999999999998888875543 34677888888875
No 93
>COG3845 ABC-type uncharacterized transport systems, ATPase components [General function prediction only]
Probab=30.01 E-value=1.1e+02 Score=29.55 Aligned_cols=37 Identities=27% Similarity=0.443 Sum_probs=27.7
Q ss_pred cCCccEEEEcCCCCCCCHHHHHHH----HHHHHcCCcEEEE
Q psy15702 7 LRAASLFICTGPREQFTITELNSL----KTYLDEGGSLLIA 43 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~E~~~L----~~fl~~GG~llv~ 43 (259)
-.++++||+=-|+.-+|+.|++.| +++-+.|..+++.
T Consensus 156 yr~a~iLILDEPTaVLTP~E~~~lf~~l~~l~~~G~tIi~I 196 (501)
T COG3845 156 YRGARLLILDEPTAVLTPQEADELFEILRRLAAEGKTIIFI 196 (501)
T ss_pred hcCCCEEEEcCCcccCCHHHHHHHHHHHHHHHHCCCEEEEE
Confidence 478999999999999999996654 4444555555554
No 94
>PRK09489 rsmC 16S ribosomal RNA m2G1207 methyltransferase; Provisional
Probab=29.90 E-value=70 Score=29.31 Aligned_cols=56 Identities=14% Similarity=0.198 Sum_probs=37.5
Q ss_pred hcCCccEEEEcCCCCC-CCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhC
Q psy15702 6 LLRAASLFICTGPREQ-FTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYG 64 (259)
Q Consensus 6 ip~~advLVI~~P~~~-fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~G 64 (259)
.+.++|++|+.-|+.+ ..+.-+..+..-|..||.+++.-.-... -..+..+++.|+
T Consensus 73 ~~~~~d~~~~~~pk~k~~~~~~l~~~~~~l~~g~~i~~~G~~~~g---~~s~~k~~~~~~ 129 (342)
T PRK09489 73 DVADCDTLIYYWPKNKQEAQFQLMNLLSLLPVGTDIFVVGENRSG---VRSAEKMLADYA 129 (342)
T ss_pred cCCCCCEEEEECCCCHHHHHHHHHHHHHhCCCCCEEEEEEecccc---HHHHHHHHHHhc
Confidence 3468999999999853 2223355566666779998887444332 245778888887
No 95
>cd03238 ABC_UvrA The excision repair protein UvrA; Nucleotide excision repair in eubacteria is a process that repairs DNA damage by the removal of a 12-13-mer oligonucleotide containing the lesion. Recognition and cleavage of the damaged DNA is a multistep ATP-dependent reaction that requires the UvrA, UvrB, and UvrC proteins. Both UvrA and UvrB are ATPases, with UvrA having two ATP binding sites, which have the characteristic signature of the family of ABC proteins, and UvrB having one ATP binding site that is structurally related to that of helicases.
Probab=29.75 E-value=1.4e+02 Score=24.53 Aligned_cols=41 Identities=20% Similarity=0.367 Sum_probs=29.8
Q ss_pred cCC--ccEEEEcCCCCCCCHHHHHHHHHHHHc---CCcEEEEecCC
Q psy15702 7 LRA--ASLFICTGPREQFTITELNSLKTYLDE---GGSLLIALGEE 47 (259)
Q Consensus 7 p~~--advLVI~~P~~~fs~~E~~~L~~fl~~---GG~llv~~~~~ 47 (259)
..+ ++++++=-|...+.....+.+.+++++ .|+.++++.+.
T Consensus 103 ~~~~~p~llLlDEPt~~LD~~~~~~l~~~l~~~~~~g~tvIivSH~ 148 (176)
T cd03238 103 FSEPPGTLFILDEPSTGLHQQDINQLLEVIKGLIDLGNTVILIEHN 148 (176)
T ss_pred hhCCCCCEEEEeCCcccCCHHHHHHHHHHHHHHHhCCCEEEEEeCC
Confidence 345 889999999999999888877777754 45555555543
No 96
>PF08351 DUF1726: Domain of unknown function (DUF1726); InterPro: IPR013562 This entry represents a protein of unknown function and is found towards the N terminus of putative ATPases (IPR007807 from INTERPRO). ; PDB: 2ZPA_B.
Probab=29.53 E-value=41 Score=24.66 Aligned_cols=40 Identities=25% Similarity=0.302 Sum_probs=25.4
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCC
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEG 48 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~ 48 (259)
..++++|+-. ...|+++.+.++-.=|+.||=+++++.+-.
T Consensus 10 ~e~~~~i~d~-~~g~~pnal~a~~gtv~gGGllill~p~~~ 49 (92)
T PF08351_consen 10 QEFDLLIFDA-FEGFDPNALAALAGTVRGGGLLILLLPPWE 49 (92)
T ss_dssp --BSSEEEE--SS---HHHHHHHHTTB-TT-EEEEEES-GG
T ss_pred CccCEEEEEc-cCCCCHHHHHHHhcceecCeEEEEEcCCHH
Confidence 4567787776 457999999999999988888888877743
No 97
>PF09862 DUF2089: Protein of unknown function (DUF2089); InterPro: IPR018658 This family consists of various hypothetical prokaryotic proteins.
Probab=29.31 E-value=92 Score=23.92 Aligned_cols=33 Identities=15% Similarity=0.396 Sum_probs=25.2
Q ss_pred CCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 20 EQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 20 ~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
..+++++++=|+.|+...|+ |.++-++|||++.
T Consensus 32 ~~L~~E~~~Fi~~Fi~~rGn----------------lKe~e~~lgiSYP 64 (113)
T PF09862_consen 32 ARLSPEQLEFIKLFIKNRGN----------------LKEMEKELGISYP 64 (113)
T ss_pred hcCCHHHHHHHHHHHHhcCC----------------HHHHHHHHCCCcH
Confidence 46899999999999998876 5566666666653
No 98
>TIGR01752 flav_long flavodoxin, long chain. Flavodoxins are small redox-active proteins with a flavin mononucleotide (FMN) prosthetic group. They can act in nitrogen fixation by nitrogenase, in sulfite reduction, and light-dependent NADP+ reduction in during photosynthesis, among other roles. This model describes the long chain type, typical for nitrogen fixation but associated with pyruvate formate-lyase activation and cobalamin-dependent methionine synthase activity in E. coli.
Probab=29.24 E-value=1.2e+02 Score=24.35 Aligned_cols=63 Identities=16% Similarity=0.137 Sum_probs=34.5
Q ss_pred hcCCccEEEEcCCCCC---CCHHHHHHHHHHHH---cCCcEEEEecCCCCCCC-------CccHHHHHHHhCcccCC
Q psy15702 6 LLRAASLFICTGPREQ---FTITELNSLKTYLD---EGGSLLIALGEEGERGS-------STNINFLLEQYGVSVNS 69 (259)
Q Consensus 6 ip~~advLVI~~P~~~---fs~~E~~~L~~fl~---~GG~llv~~~~~~~~~~-------~~nln~LL~~~Gi~~~~ 69 (259)
-+.++|+|||+.|+.. +. ..+...-+.+. -.|+-+.+++-++.... ...+..+|++.|..+-.
T Consensus 41 ~l~~~d~ii~gspty~~g~~p-~~~~~fl~~l~~~~l~gk~v~~fg~g~~~~y~~~f~~a~~~l~~~l~~~G~~~ig 116 (167)
T TIGR01752 41 DLNAYDKLILGTPTWGVGELQ-EDWEDFLPTLEELDFTGKTVALFGLGDQEGYSETFCDGMGILYDKIKARGAKVVG 116 (167)
T ss_pred HHhhCCEEEEEecCCCCCcCc-HHHHHHHHHhhcCCCCCCEEEEEecCCCCcccHHHHHHHHHHHHHHHHcCCeEEc
Confidence 3568899999999852 33 23332222221 23554445554321101 24677888888887643
No 99
>PF04452 Methyltrans_RNA: RNA methyltransferase; InterPro: IPR006700 Methyltransferases (Mtases) are responsible for the transfer of methyl groups between two molecules. The transfer of the methyl group from the ubiquitous S-adenosyl-L-methionine (AdoMet) to either nitrogen, oxygen or carbon atoms is frequently employed in diverse organisms. The reaction is catalyzed by Mtases and modifies DNA, RNA, proteins or small molecules, such as catechol, for regulatory purposes. Proteins in this entry belong to the RsmE family of Mtases, this is supported by crystal structural studying, which show a close structural homology to other known methyltransferases []. This entry contains RsmE of Escherichia coli, which specifically methylates the uridine in position 1498 of 16S rRNA in the fully assembled 30S ribosomal subunit [, ].; GO: 0008168 methyltransferase activity, 0006364 rRNA processing; PDB: 1NXZ_B 1VHY_B 2EGW_A 2EGV_A 2Z0Y_A 2CX8_A 3KW2_A 1VHK_D 1Z85_B 1V6Z_A ....
Probab=28.68 E-value=50 Score=28.14 Aligned_cols=22 Identities=36% Similarity=0.533 Sum_probs=15.6
Q ss_pred cEEEEcCCCCCCCHHHHHHHHH
Q psy15702 11 SLFICTGPREQFTITELNSLKT 32 (259)
Q Consensus 11 dvLVI~~P~~~fs~~E~~~L~~ 32 (259)
++.|++||.--||++|++.+++
T Consensus 176 ~i~i~IGPEGGfs~~E~~~~~~ 197 (225)
T PF04452_consen 176 SIAIIIGPEGGFSEEEIEFLKE 197 (225)
T ss_dssp EEEEEE--TT---HHHHHHHHH
T ss_pred cEEEEECCCCCCCHHHHHHHHH
Confidence 8999999999999999998876
No 100
>PRK10976 putative hydrolase; Provisional
Probab=28.56 E-value=1.2e+02 Score=25.96 Aligned_cols=45 Identities=13% Similarity=0.273 Sum_probs=35.6
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.+...+++.-.++|++..++|.++.+..+-. ...+..+++.+|+.
T Consensus 15 ~~~~~is~~~~~ai~~l~~~G~~~~iaTGR~-----~~~~~~~~~~l~~~ 59 (266)
T PRK10976 15 SPDHTLSPYAKETLKLLTARGIHFVFATGRH-----HVDVGQIRDNLEIK 59 (266)
T ss_pred CCCCcCCHHHHHHHHHHHHCCCEEEEEcCCC-----hHHHHHHHHhcCCC
Confidence 4455699999999999999999988886654 34577788888865
No 101
>TIGR00046 RNA methyltransferase, RsmE family. Members of this protein family, previously called conserved hypothetical protein TIGR00046, include the YggJ protein of E. coli, which has now been shown to methylate U1498 in 16S rRNA.
Probab=28.48 E-value=64 Score=27.87 Aligned_cols=22 Identities=32% Similarity=0.560 Sum_probs=18.5
Q ss_pred cEEEEcCCCCCCCHHHHHHHHH
Q psy15702 11 SLFICTGPREQFTITELNSLKT 32 (259)
Q Consensus 11 dvLVI~~P~~~fs~~E~~~L~~ 32 (259)
++++++||.--||+.|++.+.+
T Consensus 186 ~v~~~IGPEGGfs~~Ei~~~~~ 207 (240)
T TIGR00046 186 NIVIIIGPEGGFSEKEIQLLKE 207 (240)
T ss_pred cEEEEECCCCCCCHHHHHHHHH
Confidence 5888999999999999887664
No 102
>TIGR00313 cobQ cobyric acid synthase CobQ.
Probab=28.32 E-value=58 Score=31.28 Aligned_cols=39 Identities=15% Similarity=0.239 Sum_probs=28.2
Q ss_pred hhcCCccEEEEcCCCCCCCHHH-------HHHHHHHHHcCCcEEEEe
Q psy15702 5 ELLRAASLFICTGPREQFTITE-------LNSLKTYLDEGGSLLIAL 44 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E-------~~~L~~fl~~GG~llv~~ 44 (259)
++| +||.|||.|-.....+.+ .++|++|+++|+.++-..
T Consensus 281 ~l~-~~d~lilpGg~~~~~~~~~l~~~~~~~~i~~~~~~G~pvlgiC 326 (475)
T TIGR00313 281 SLT-GCDAVIIPGSKSTIADLYALKQSGFAEEILDFAKEGGIVIGIC 326 (475)
T ss_pred ccc-cCCEEEECCcchHHHHHHHHHhcChHHHHHHHHHcCCcEEEEc
Confidence 444 899999999874333322 578999999988876643
No 103
>TIGR03567 FMN_reduc_SsuE FMN reductase, SsuE family. Members of this protein family use NAD(P)H to reduce FMN and regenerate FMNH2. Members include the homodimeric, NAD(P)H-dependent enzyme SsuE from Escherichia coli, which serves as a partner to an FMNH2-dependent alkanesulfonate monooxygenase. It is induced by sulfate starvation. The NADH-dependent enzyme MsuE from Pseudomonas aeruginosa is outside the scope of this model (see model TIGR03566).
Probab=28.01 E-value=75 Score=25.69 Aligned_cols=64 Identities=14% Similarity=0.186 Sum_probs=35.7
Q ss_pred cCCccEEEEcCCCCCCCHH-HHHHHHHHHH---cCCcEEEEecCCCCCCC----CccHHHHHHHhCcccCCc
Q psy15702 7 LRAASLFICTGPREQFTIT-ELNSLKTYLD---EGGSLLIALGEEGERGS----STNINFLLEQYGVSVNSD 70 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~~-E~~~L~~fl~---~GG~llv~~~~~~~~~~----~~nln~LL~~~Gi~~~~~ 70 (259)
..+||.+||+.|...++-. -++..-+++. =.||...++..++.... ...+..+|..+|+.....
T Consensus 63 i~~AD~iI~~sP~Y~~sip~~LK~~iD~~~~~~l~~K~v~~~~~gg~~~~~~~~~~~l~~~l~~l~~~~~~~ 134 (171)
T TIGR03567 63 VAQADGVVVATPVYKASYSGVLKALLDLLPQRALRGKVVLPIATGGSIAHLLAIDYALKPVLSALGARHILP 134 (171)
T ss_pred HHHCCEEEEECCcccCCCCHHHHHHHHhCChhhhCCCEEEEEEcCCchhHHHHHHHHHHHHHHHcCCccccc
Confidence 4579999999998655543 2333333332 12454443333332111 135788999999953333
No 104
>PRK00377 cbiT cobalt-precorrin-6Y C(15)-methyltransferase; Provisional
Probab=27.63 E-value=87 Score=25.83 Aligned_cols=56 Identities=18% Similarity=0.152 Sum_probs=34.7
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
..+|.+++.+... -...-++.+.+.|+.||++++-.-.. + ....+-.+|++.|...
T Consensus 110 ~~~D~V~~~~~~~-~~~~~l~~~~~~LkpgG~lv~~~~~~-~--~~~~~~~~l~~~g~~~ 165 (198)
T PRK00377 110 EKFDRIFIGGGSE-KLKEIISASWEIIKKGGRIVIDAILL-E--TVNNALSALENIGFNL 165 (198)
T ss_pred CCCCEEEECCCcc-cHHHHHHHHHHHcCCCcEEEEEeecH-H--HHHHHHHHHHHcCCCe
Confidence 5688888765432 22445778888899999987632211 1 1234556677778643
No 105
>PF00919 UPF0004: Uncharacterized protein family UPF0004; InterPro: IPR013848 The methylthiotransferase (MTTase) or miaB-like family is named after the (dimethylallyl)adenosine tRNA MTTase miaB protein, which catalyses a C-H to C-S bond conversion in the methylthiolation of tRNA. A related bacterial enzyme rimO performs a similar methylthiolation, but on a protein substrate. RimO acts on the ribosomal protein S12 and forms a separate MTTase subfamily. The miaB-subfamily includes mammalian CDK5 regulatory subunit-associated proteins and similar proteins in other eukaryotes. Two other subfamilies, yqeV and CDKAL1, are named after a Bacillus subtilis and a human protein, respectively. While yqeV-like proteins are found in bacteria, CDKAL1 subfamily members occur in eukaryotes and in archaebacteria. The likely MTTases from these 4 subfamilies contain an N-terminal MTTase domain, a central radical generating fold and a C-terminal TRAM domain (see PDOC50926 from PROSITEDOC). The core forms a radical SAM fold (or AdoMet radical), containing a cysteine motif CxxxCxxC that binds a [4Fe-4S] cluster [, , ]. A reducing equivalent from the [4Fe-4S]+ cluster is used to cleave S-adenosylmethionine (SAM) to generate methionine and a 5'-deoxyadenosyl radical. The latter is thought to produce a reactive substrate radical that is amenable to sulphur insertion [, ]. The N-terminal MTTase domain contains 3 cysteines that bind a second [4Fe-4S] cluster, in addition to the radical-generating [4Fe-4S] cluster, which could be involved in the thiolation reaction. The C-terminal TRAM domain is not shared with other radical SAM proteins outside the MTTase family. The TRAM domain can bind to RNA substrate and seems to be important for substrate recognition. The tertiary structure of the central radical SAM fold has six beta/alpha motifs resembling a three-quarter TIM barrel core (see PDOC00155 from PROSITEDOC) []. The N-terminal MTTase domain might form an additional [beta/alpha]2 TIM barrel unit []. ; GO: 0003824 catalytic activity, 0051539 4 iron, 4 sulfur cluster binding, 0009451 RNA modification
Probab=27.61 E-value=2.5e+02 Score=20.63 Aligned_cols=55 Identities=11% Similarity=0.064 Sum_probs=33.6
Q ss_pred hcCCccEEEEcCCCCCCCHH--HHHHHHHHHHcC--CcEEEEecCCCCCCCCccHHHHHHHhC
Q psy15702 6 LLRAASLFICTGPREQFTIT--ELNSLKTYLDEG--GSLLIALGEEGERGSSTNINFLLEQYG 64 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~--E~~~L~~fl~~G--G~llv~~~~~~~~~~~~nln~LL~~~G 64 (259)
-+++||++||..=.-.-+.+ -+..|+++.+.+ +..+++.+-..+ ..-+.+.+.++
T Consensus 33 ~~e~AD~iiiNTC~V~~~Ae~k~~~~i~~l~~~~~~~~~ivv~GC~aq----~~~~~l~~~~p 91 (98)
T PF00919_consen 33 DPEEADVIIINTCTVRESAEQKSRNRIRKLKKLKKPGAKIVVTGCMAQ----RYGEELKKEFP 91 (98)
T ss_pred ccccCCEEEEEcCCCCcHHHHHHHHHHHHHHHhcCCCCEEEEEeCccc----cChHHHHhhCC
Confidence 46899999999886322221 256666776654 556666676643 23355666554
No 106
>PRK04017 hypothetical protein; Provisional
Probab=27.47 E-value=1.1e+02 Score=24.10 Aligned_cols=55 Identities=11% Similarity=0.254 Sum_probs=35.0
Q ss_pred CccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCC-CCCccHHHHHHHhCcccCC
Q psy15702 9 AASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGER-GSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~-~~~~nln~LL~~~Gi~~~~ 69 (259)
++++++..+- .+++.. +.+ ..++..++++.|+.... .-...+...+..+|+.++.
T Consensus 43 ~~~iI~t~g~--~~~~~~-e~i---a~~~r~VIILTD~D~~GekIr~~l~~~l~~~G~~vd~ 98 (132)
T PRK04017 43 EGEIIKVSRT--PLAEIA-ELI---ASRGKEVIILTDFDRKGEELAKKLSEYLQGYGIKVDT 98 (132)
T ss_pred CccEEEECCe--ecchHH-HHH---HhcCCeEEEEECCCcchHHHHHHHHHHHHhCCCCccH
Confidence 4555555543 466666 555 55777899999987531 1234566777788888864
No 107
>cd03130 GATase1_CobB Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. Type 1 glutamine amidotransferase (GATase1) domain found in Cobyrinic Acid a,c-Diamide Synthase. CobB plays a role in cobalamin biosythesis catalyzing the conversion of cobyrinic acid to cobyrinic acid a,c-diamide. CobB belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobB.
Probab=27.25 E-value=93 Score=25.91 Aligned_cols=39 Identities=18% Similarity=0.344 Sum_probs=25.2
Q ss_pred cCCccEEEEcCCCC-----CCCHH--HHHHHHHHHHcCCcEEEEec
Q psy15702 7 LRAASLFICTGPRE-----QFTIT--ELNSLKTYLDEGGSLLIALG 45 (259)
Q Consensus 7 p~~advLVI~~P~~-----~fs~~--E~~~L~~fl~~GG~llv~~~ 45 (259)
+.++|.|||.|-.. .+... -.+.|++|+++|+.++-...
T Consensus 38 l~~~D~lilPGG~~~~~~~~L~~~~~~~~~i~~~~~~g~pilgICg 83 (198)
T cd03130 38 LPDADGLYLGGGYPELFAEELSANQSMRESIRAFAESGGPIYAECG 83 (198)
T ss_pred CCCCCEEEECCCchHHHHHHHHhhHHHHHHHHHHHHcCCCEEEEcc
Confidence 34599999999632 12111 25778899988887665543
No 108
>PRK10444 UMP phosphatase; Provisional
Probab=27.18 E-value=1.1e+02 Score=26.50 Aligned_cols=64 Identities=13% Similarity=0.100 Sum_probs=44.8
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeec
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRC 75 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~ 75 (259)
-|+|-.++-+- ...+...++|+..-++|.++.++.+.... ....+..-|+..|+.+..+.++-+
T Consensus 6 ~DlDGtL~~~~--~~~p~a~~~l~~L~~~g~~~~~~Tn~~~~--~~~~~~~~l~~~G~~~~~~~i~ts 69 (248)
T PRK10444 6 CDIDGVLMHDN--VAVPGAAEFLHRILDKGLPLVLLTNYPSQ--TGQDLANRFATAGVDVPDSVFYTS 69 (248)
T ss_pred EeCCCceEeCC--eeCccHHHHHHHHHHCCCeEEEEeCCCCC--CHHHHHHHHHHcCCCCCHhhEecH
Confidence 35555555554 35667778888777888888888777643 234566777889999988877643
No 109
>TIGR03801 asp_4_decarbox aspartate 4-decarboxylase. This enzyme, aspartate 4-decarboxylase (EC 4.1.1.12), removes the side-chain carboxylate from L-aspartate, converting it to L-alanine plus carbon dioxide. It is a PLP-dependent enzyme, homologous to aspartate aminotransferase (EC 2.6.1.1).
Probab=26.84 E-value=1.4e+02 Score=29.15 Aligned_cols=41 Identities=7% Similarity=0.156 Sum_probs=32.6
Q ss_pred CCccEEEEcCCCCC----CCHHHHHHHHHHHHc-CCcEEEEecCCC
Q psy15702 8 RAASLFICTGPREQ----FTITELNSLKTYLDE-GGSLLIALGEEG 48 (259)
Q Consensus 8 ~~advLVI~~P~~~----fs~~E~~~L~~fl~~-GG~llv~~~~~~ 48 (259)
+++.+++|..|..+ ++.++++.|.++.++ ..+++|+.|+..
T Consensus 239 ~~~kai~l~nP~NPTG~vls~e~l~~I~~ia~~~~~~l~II~DEvY 284 (521)
T TIGR03801 239 PSIKALFVVNPSNPPSVAMSDESIEKIVDIVANDRPDLMILTDDVY 284 (521)
T ss_pred CCCcEEEEeCCCCCCCCCCCHHHHHHHHHHHHhcCCCeEEEECCCc
Confidence 46788888889765 888999999999876 457888888754
No 110
>TIGR00099 Cof-subfamily Cof subfamily of IIB subfamily of haloacid dehalogenase superfamily. The members of this subfamily are restricted almost exclusively to bacteria (one sequences from S. pombe scores above trusted, while another is between trusted and noise). It is notable that no archaea are found in this group, the closest relations to the archaea found here being two Deinococcus sequences.
Probab=26.71 E-value=1.4e+02 Score=25.43 Aligned_cols=44 Identities=14% Similarity=0.310 Sum_probs=34.7
Q ss_pred CCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 18 PREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 18 P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
+...+++...++|++..++|.++.+..+-. ...+..+++++|+.
T Consensus 13 ~~~~i~~~~~~~i~~l~~~G~~~~iaTGR~-----~~~~~~~~~~~~~~ 56 (256)
T TIGR00099 13 DDHTISPSTKEALAKLREKGIKVVLATGRP-----YKEVKNILKELGLD 56 (256)
T ss_pred CCCccCHHHHHHHHHHHHCCCeEEEEeCCC-----HHHHHHHHHHcCCC
Confidence 445689999999999999988888876654 34677888888765
No 111
>PRK06411 NADH dehydrogenase subunit B; Validated
Probab=26.56 E-value=86 Score=26.22 Aligned_cols=39 Identities=26% Similarity=0.332 Sum_probs=28.7
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHH--cCCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLD--EGGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~--~GG~llv~~~~ 46 (259)
..|++||+++|.|+- +..+..+|++-.+ ..-+.+|+++.
T Consensus 70 ~sPr~aDvllV~G~v---t~~~~~~l~~~~e~mp~pk~VIA~Ga 110 (183)
T PRK06411 70 ASPRQADLMIVAGTL---TNKMAPALRRLYDQMPEPKWVISMGS 110 (183)
T ss_pred CCCCceeEEEEEeCC---CccchHHHHHHHHHcCcCCeEEEEec
Confidence 679999999999996 3445566665554 47788888775
No 112
>cd06228 Peptidase_M14-like_3 A functionally uncharacterized subgroup of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding carboxypeptidases (CPs) which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. Two major subfamilies of the M14 family, defined based on sequence and structural homology, are the A/B and N/E subfamilies. Enzymes belonging to the A/B subfamily are normally synthesized as inactive precursors containing preceding signal peptide, followed by an N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The A/B enzymes can be further divided based on their substrate specificity; Carboxypeptidase A-like (CPA-like) enzymes favor hydrophobic residues while carboxypeptidase B-like (CPB-like) enzymes only cleave the basic residues lysine or arginine. The A forms have slightly differe
Probab=26.50 E-value=76 Score=29.10 Aligned_cols=46 Identities=13% Similarity=0.068 Sum_probs=32.6
Q ss_pred EEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 14 ICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 14 VI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
...||. +||+.|.++|++|+++.+++-+.++-+. --+-++-.||-+
T Consensus 157 ~Y~G~~-pfSEPET~av~~~~~~~~~~~~yls~HS------ygq~il~P~g~~ 202 (332)
T cd06228 157 TFHGTA-AFSEPETRNVVWVMDTFKNIRWFVDLHS------YGGDVLYSWGDD 202 (332)
T ss_pred ccCCCC-CCccHHHHHHHHHHhccCCeEEEEEecc------CCceEEecccCC
Confidence 344555 7999999999999999887777777653 123445555555
No 113
>KOG1970|consensus
Probab=26.40 E-value=1.1e+02 Score=30.25 Aligned_cols=41 Identities=24% Similarity=0.459 Sum_probs=34.0
Q ss_pred HHHHHHHHH--------cCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 27 LNSLKTYLD--------EGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 27 ~~~L~~fl~--------~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
+..+++|++ -|+++|++.+|.|-. ..+.+.-|..++|+++.
T Consensus 91 I~eVk~WL~~~~~~~~~l~~~iLLltGPsGcG-KSTtvkvLskelg~~~~ 139 (634)
T KOG1970|consen 91 ISEVKQWLKQVAEFTPKLGSRILLLTGPSGCG-KSTTVKVLSKELGYQLI 139 (634)
T ss_pred HHHHHHHHHHHHHhccCCCceEEEEeCCCCCC-chhHHHHHHHhhCceee
Confidence 667888887 689999999997643 56789999999999985
No 114
>PLN00125 Succinyl-CoA ligase [GDP-forming] subunit alpha
Probab=26.09 E-value=1.4e+02 Score=26.97 Aligned_cols=56 Identities=11% Similarity=0.040 Sum_probs=38.6
Q ss_pred hhcCC--ccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHH----HHHhCccc
Q psy15702 5 ELLRA--ASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFL----LEQYGVSV 67 (259)
Q Consensus 5 ~ip~~--advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~L----L~~~Gi~~ 67 (259)
++|+. .|+.||.-|. +.=..++++-.++|=+.++.+..+... ..+..+ .+++|+++
T Consensus 63 dlp~~~~~DlAvI~vPa----~~v~~al~e~~~~Gvk~~vIisaGf~e---~g~~~~~~~~ar~~girv 124 (300)
T PLN00125 63 EAKAETKANASVIYVPP----PFAAAAILEAMEAELDLVVCITEGIPQ---HDMVRVKAALNRQSKTRL 124 (300)
T ss_pred HHhhccCCCEEEEecCH----HHHHHHHHHHHHcCCCEEEEECCCCCc---ccHHHHHHHHHhhcCCEE
Confidence 56765 7999999983 455788999999998877776665421 223333 45677765
No 115
>cd01653 GATase1 Type 1 glutamine amidotransferase (GATase1)-like domain. Type 1 glutamine amidotransferase (GATase1)-like domain. This group includes proteins similar to Class I glutamine amidotransferases, the intracellular PH1704 from Pyrococcus horikoshii, the C-terminal of the large catalase: Escherichia coli HP-II, Sinorhizobium meliloti Rm1021 ThuA. and, the A4 beta-galactosidase middle domain. The majority of proteins in this group have a reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For Class I glutamine amidotransferases proteins which transfer ammonia from the amide side chain of glutamine to an acceptor substrate, this Cys forms a Cys-His-Glu catalytic triad in the active site. Glutamine amidotransferases activity can be found in a range of biosynthetic enzymes included in this cd: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase, anthranilate synthase component II, glutamine-depende
Probab=25.53 E-value=1.7e+02 Score=19.92 Aligned_cols=38 Identities=18% Similarity=0.041 Sum_probs=24.2
Q ss_pred hcCCccEEEEcCCCCCC-----CHHHHHHHHHHHHcCCcEEEE
Q psy15702 6 LLRAASLFICTGPREQF-----TITELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 6 ip~~advLVI~~P~~~f-----s~~E~~~L~~fl~~GG~llv~ 43 (259)
.+++.|++||.+....- ....++.++++..+|+.++-.
T Consensus 43 ~~~~~d~lii~g~~~~~~~~~~~~~~~~~i~~~~~~~~~i~~~ 85 (115)
T cd01653 43 DLDDYDGLILPGGPGTPDDLARDEALLALLREAAAAGKPILGI 85 (115)
T ss_pred ChhccCEEEECCCCCchhhhccCHHHHHHHHHHHHcCCEEEEE
Confidence 46789999999875322 244566666666665555543
No 116
>PF13847 Methyltransf_31: Methyltransferase domain; PDB: 3T0I_B 3SVZ_B 3SXJ_A 3F4K_A 3GU3_B 2GH1_A 1R8Y_E 1R8X_B 2B3T_A 1T43_A ....
Probab=25.34 E-value=46 Score=26.05 Aligned_cols=39 Identities=26% Similarity=0.399 Sum_probs=31.4
Q ss_pred CCccEEEEcCCCCCCCHHH--HHHHHHHHHcCCcEEEEecC
Q psy15702 8 RAASLFICTGPREQFTITE--LNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E--~~~L~~fl~~GG~llv~~~~ 46 (259)
+..|+++..++-..+.+.+ ++.+.++++.||.+++..-.
T Consensus 72 ~~~D~I~~~~~l~~~~~~~~~l~~~~~~lk~~G~~i~~~~~ 112 (152)
T PF13847_consen 72 EKFDIIISNGVLHHFPDPEKVLKNIIRLLKPGGILIISDPN 112 (152)
T ss_dssp TTEEEEEEESTGGGTSHHHHHHHHHHHHEEEEEEEEEEEEE
T ss_pred CCeeEEEEcCchhhccCHHHHHHHHHHHcCCCcEEEEEECC
Confidence 6899999998876677664 88999999999998776433
No 117
>cd01750 GATase1_CobQ Type 1 glutamine amidotransferase (GATase1) domain found in Cobyric Acid Synthase (CobQ). Type 1 glutamine amidotransferase (GATase1) domain found in Cobyric Acid Synthase (CobQ). CobQ plays a role in cobalamin biosythesis. CobQ catalyses amidations at positions B, D, E, and G on adenosylcobyrinic A,C-diamide in the biosynthesis of cobalamin. CobQ belongs to the triad family of amidotransferases. Two of the three residues of the catalytic triad that are involved in glutamine binding, hydrolysis and transfer of the resulting ammonia to the acceptor substrate in other triad aminodotransferases are conserved in CobQ.
Probab=25.14 E-value=93 Score=25.79 Aligned_cols=36 Identities=28% Similarity=0.415 Sum_probs=25.6
Q ss_pred cCCccEEEEcCCCCCCCH--------HHHHHHHHHHHcCCcEEEE
Q psy15702 7 LRAASLFICTGPREQFTI--------TELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 7 p~~advLVI~~P~~~fs~--------~E~~~L~~fl~~GG~llv~ 43 (259)
++++|.|||.|... +.+ ...+.|++|+++|+-++-.
T Consensus 35 ~~~~d~lilpGg~~-~~~~~~~~~~~~~~~~i~~~~~~g~pvlgi 78 (194)
T cd01750 35 LGDADLIILPGSKD-TIQDLAWLRKRGLAEAIKNYARAGGPVLGI 78 (194)
T ss_pred CCCCCEEEECCCcc-hHHHHHHHHHcCHHHHHHHHHHCCCcEEEE
Confidence 67899999999862 221 2456788998887776553
No 118
>TIGR01689 EcbF-BcbF capsule biosynthesis phosphatase. Due to the likelihood that the substrates of these enzymes are different depending on the nature of the particular polysaccharides associated with each species, this model has been classified as a subfamily despite the close homology.
Probab=24.99 E-value=1.8e+02 Score=22.59 Aligned_cols=47 Identities=17% Similarity=0.168 Sum_probs=32.3
Q ss_pred CCHHHHHHHHHHHHcCCcEEEEecCCCCCC----------CCccHHHHHHHhCcccC
Q psy15702 22 FTITELNSLKTYLDEGGSLLIALGEEGERG----------SSTNINFLLEQYGVSVN 68 (259)
Q Consensus 22 fs~~E~~~L~~fl~~GG~llv~~~~~~~~~----------~~~nln~LL~~~Gi~~~ 68 (259)
..+.-+++|+++-++|-.++++.+-..... ..+-...+|++||+.++
T Consensus 25 ~~~~~ie~L~~l~~~G~~IiiaTGR~~~~~~~n~~~i~~~~~~~t~~wL~k~~ipYd 81 (126)
T TIGR01689 25 PILAVIEKLRHYKALGFEIVISSSRNMRTYEGNVGKINIHTLPIIILWLNQHNVPYD 81 (126)
T ss_pred cCHHHHHHHHHHHHCCCEEEEECCCCchhhhccccccchhhHHHHHHHHHHcCCCCc
Confidence 456778888888888888777765533210 01256789999999984
No 119
>cd01017 AdcA Metal binding protein AcdA. These proteins have been shown to function in the ABC uptake of Zn2+ and Mn2+ and in competence for genetic transformation and adhesion. The AcdA proteins belong to the TroA superfamily of helical backbone metal receptor proteins that share a distinct fold and ligand binding mechanism. They are comprised of two globular subdomains connected by a long alpha helix and they bind their ligand in the cleft between these domains. In addition, many of these proteins have a low complexity region containing metal binding histidine-rich motif (repetitive HDH sequence).
Probab=24.72 E-value=1.5e+02 Score=26.11 Aligned_cols=48 Identities=17% Similarity=0.246 Sum_probs=39.9
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
.|...-+..++..|.+.++++|--.++.++... ....+.+.++.|+.+
T Consensus 199 ~~~~eps~~~l~~l~~~ik~~~v~~if~e~~~~---~~~~~~la~~~g~~v 246 (282)
T cd01017 199 SPEVEPSPKQLAELVEFVKKSDVKYIFFEENAS---SKIAETLAKETGAKL 246 (282)
T ss_pred CCCCCCCHHHHHHHHHHHHHcCCCEEEEeCCCC---hHHHHHHHHHcCCcE
Confidence 466778999999999999988888888888753 456788888999876
No 120
>cd03110 Fer4_NifH_child This protein family's function is unkown. It contains nucleotide binding site. It uses NTP as energy source to transfer electron or ion.
Probab=24.31 E-value=2.4e+02 Score=22.46 Aligned_cols=11 Identities=45% Similarity=0.776 Sum_probs=5.3
Q ss_pred CCccEEEEcCC
Q psy15702 8 RAASLFICTGP 18 (259)
Q Consensus 8 ~~advLVI~~P 18 (259)
+++|.+||=.|
T Consensus 91 ~~~d~viiDtp 101 (179)
T cd03110 91 EGAELIIIDGP 101 (179)
T ss_pred cCCCEEEEECc
Confidence 34455555444
No 121
>PF01963 TraB: TraB family; InterPro: IPR002816 In prokaryotes, for example Enterococcus faecalis (Streptococcus faecalis), the conjugative transfer of certain plasmids is controlled by peptide pheromones []. Plasmid free recipient cells secret plasmid specific oligopeptides, termed sex pheromones. They induce bacterial clumping and specifically activate the conjugative transfer of the corresponding plasmid. Once recipient cells acquire the plasmid they start to produce a pheromone inhibitor to block the activity of the pheromone and to prevent plasmid containing cells from clumping; they also become donor cells able to transfer the plasmid to plasmid free recipient cells. Examples of such plasmid-pheromone systems are bacteriocin plasmid pPD1 [], haemolysin/bacteriocin plasmid, pAD1 [], tetracycline-resistance plasmid, pCF10 [], and the haemolysin/bacteriocin plasmid, pOB1 []. TraB in combination with another factor contributes to pheromone shutdown in cells that have acquired a plasmid. It exact function has not yet been determined [, ]. This entry also contains plant and mammalian proteins, suggesting that these Trab-related proteins may have a somewhat wider or different function in eukaryotes.
Probab=24.20 E-value=1.2e+02 Score=25.84 Aligned_cols=42 Identities=21% Similarity=0.378 Sum_probs=33.1
Q ss_pred HHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 26 ELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 26 E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
=..+|.++++++|+++++++.+.=. -..++-.+|++-|.++.
T Consensus 216 ~~~~i~~~l~~~~~~fvvVGa~HL~-G~~gvl~lLr~~Gy~V~ 257 (259)
T PF01963_consen 216 WAEKIEELLKEGGTVFVVVGAGHLP-GEDGVLDLLRKKGYTVE 257 (259)
T ss_pred HHHHHHHHHhcCCCEEEEEcchhcc-chhhHHHHHHhCCceee
Confidence 3688999999999999999985311 13578899999998875
No 122
>TIGR00080 pimt protein-L-isoaspartate(D-aspartate) O-methyltransferase. Among the prokaryotes, the gene name is pcm. Among eukaryotes, pimt.
Probab=24.00 E-value=1.2e+02 Score=25.43 Aligned_cols=35 Identities=26% Similarity=0.409 Sum_probs=25.8
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecC
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~ 46 (259)
...|++++.++...+. +.+.+.|+.||++++.+..
T Consensus 145 ~~fD~Ii~~~~~~~~~----~~~~~~L~~gG~lv~~~~~ 179 (215)
T TIGR00080 145 APYDRIYVTAAGPKIP----EALIDQLKEGGILVMPVGE 179 (215)
T ss_pred CCCCEEEEcCCccccc----HHHHHhcCcCcEEEEEEcC
Confidence 4689888877655443 4577889999998887654
No 123
>cd03870 M14_CPA Peptidase M14 Carboxypeptidase (CP) A (CPA) belongs to the A/B subfamily of the M14 family of metallocarboxypeptidases (MCPs). The M14 family are zinc-binding CPs which hydrolyze single, C-terminal amino acids from polypeptide chains, and have a recognition site for the free C-terminal carboxyl group, which is a key determinant of specificity. CPA enzymes generally favor hydrophobic residues. A/B subfamily enzymes are normally synthesized as inactive precursors containing preceding signal peptide, followed by a globular N-terminal pro-region linked to the enzyme; these proenzymes are called procarboxypeptidases. The procarboxypeptidase A (PCPA) is produced by the exocrine pancreas and stored as a stable zymogen in the pancreatic granules until secretion into the digestive tract occurs. This subfamily includes CPA1, CPA2 and CPA4 forms. Within these A forms, there are slightly different specificities, with CPA1 preferring aliphatic and small aromatic residues, and CPA2 p
Probab=23.99 E-value=90 Score=28.09 Aligned_cols=31 Identities=16% Similarity=0.308 Sum_probs=24.5
Q ss_pred cCCCCCCCHHHHHHHHHHHHcCCcEEEEecCC
Q psy15702 16 TGPREQFTITELNSLKTYLDEGGSLLIALGEE 47 (259)
Q Consensus 16 ~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~ 47 (259)
.||. +||+.|-++|++|++..+++..+++-+
T Consensus 160 ~G~~-pfSEpEt~av~~~~~~~~~~~~~l~lH 190 (301)
T cd03870 160 HGPY-ANSEVEVKSIVDFVKSHGNFKAFISIH 190 (301)
T ss_pred CCCC-CCccHHHHHHHHHHhhCCCeEEEEEec
Confidence 4555 799999999999999877776666654
No 124
>cd01137 PsaA Metal binding protein PsaA. These proteins have been shown to function as initial receptors in ABC transport of Mn2+ and as surface adhesins in some eubacterial species. They belong to the TroA superfamily of periplasmic metal binding proteins that share a distinct fold and ligand binding mechanism. A typical TroA protein is comprised of two globular subdomains connected by a single helix and can bind the metal ion in the cleft between these domains. In addition, these proteins sometimes have a low complexity region containing a metal-binding histidine-rich motif (repetitive HDH sequence).
Probab=23.87 E-value=1.5e+02 Score=26.27 Aligned_cols=48 Identities=8% Similarity=0.085 Sum_probs=39.7
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
+|...-|..++..|.+.+++.|--+++.++... ...++.+.++.|+.+
T Consensus 205 ~~~~eps~~~l~~l~~~ik~~~v~~if~e~~~~---~~~~~~ia~~~gv~v 252 (287)
T cd01137 205 NTEEEGTPKQVATLIEQVKKEKVPAVFVESTVN---DRLMKQVAKETGAKI 252 (287)
T ss_pred CCCCCCCHHHHHHHHHHHHHhCCCEEEEeCCCC---hHHHHHHHHHhCCcc
Confidence 466778999999999999988777777788753 456889999999986
No 125
>PF09488 Osmo_MPGsynth: Mannosyl-3-phosphoglycerate synthase (osmo_MPGsynth); InterPro: IPR012812 This family consists of examples of mannosyl-3-phosphoglycerate synthase (MPGS), which together with mannosyl-3-phosphoglycerate phosphatase (MPGP), comprises a two-step pathway for mannosylglycerate biosynthesis. Mannosylglycerate is a compatible solute that tends to be restricted to extreme thermophiles of archaea and bacteria. Note that in Rhodothermus marinus (Rhodothermus obamensis), this pathway is one of two; the other is condensation of GDP-mannose with D-glycerate by mannosylglycerate synthase.; GO: 0050504 mannosyl-3-phosphoglycerate synthase activity, 0051479 mannosylglycerate biosynthetic process, 0005737 cytoplasm; PDB: 2WVM_A 2WVL_A 2WVK_A 2ZU7_B 2ZU9_B 2ZU8_A.
Probab=23.74 E-value=1.3e+02 Score=27.87 Aligned_cols=57 Identities=9% Similarity=0.153 Sum_probs=39.2
Q ss_pred hhcCCccEEEEcCCCC---CCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 5 ELLRAASLFICTGPRE---QFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 5 ~ip~~advLVI~~P~~---~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.||.+|-++|+-..++ +--..|.+.|++|-.-..+-++++... ++.+..-+.+.|..
T Consensus 72 gIPh~C~IIvVSNS~r~~~d~f~~E~d~l~~f~~~t~r~~~~vHQk-----Dp~lA~Af~~aGy~ 131 (381)
T PF09488_consen 72 GIPHDCLIIVVSNSSREPVDRFKMEVDLLKHFCRLTRRQIIIVHQK-----DPGLAEAFKEAGYP 131 (381)
T ss_dssp CS-TTSEEEEEE---CSSSCHHHHHHHHHHHHHHHCT--EEEEETT------HHHHHHHHHTT--
T ss_pred cCCCCCeEEEEECCCCCCccHHHHHHHHHHHHHHhhcCceEEEecC-----CHHHHHHHHHcCcH
Confidence 5899999999887776 566789999999999998877777664 46677777777765
No 126
>TIGR00268 conserved hypothetical protein TIGR00268. The N-terminal region of the model shows similarity to Argininosuccinate synthase proteins using PSI-blast and using the recognize protein identification server.
Probab=23.73 E-value=79 Score=27.43 Aligned_cols=37 Identities=24% Similarity=0.337 Sum_probs=27.6
Q ss_pred HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 27 LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 27 ~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
+++|+++++++++++++++.|-+ ++-+-.++.+.|..
T Consensus 2 ~~~l~~~l~~~~~vlVa~SGGvD---Ss~ll~la~~~g~~ 38 (252)
T TIGR00268 2 LENLRNFLKEFKKVLIAYSGGVD---SSLLAAVCSDAGTE 38 (252)
T ss_pred hHHHHHHHHhcCCEEEEecCcHH---HHHHHHHHHHhCCC
Confidence 46899999999999999998753 44555666666543
No 127
>cd01748 GATase1_IGP_Synthase Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). Type 1 glutamine amidotransferase (GATase1) domain found in imidazole glycerol phosphate synthase (IGPS). IGPS incorporates ammonia derived from glutamine into N1-[(5'-phosphoribulosyl)-formimino]-5-aminoimidazole-4-carboxamide ribonucleotide (PRFAR) to form 5'-(5-aminoimidazole-4-carboxamide) ribonucleotide (AICAR) and imidazole glycerol phosphate (IGP). The glutamine amidotransferase domain generates the ammonia nucleophile which is channeled from the glutaminase active site to the PRFAR active site. IGPS belong to the triad family of amidotransferases having a conserved Cys-His-Glu catalytic triad in the glutaminase active site.
Probab=23.56 E-value=83 Score=25.98 Aligned_cols=37 Identities=22% Similarity=0.230 Sum_probs=25.8
Q ss_pred hcCCccEEEEcCCCCCCCH--------HHHHHHHHHHHcCCcEEEE
Q psy15702 6 LLRAASLFICTGPREQFTI--------TELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~--------~E~~~L~~fl~~GG~llv~ 43 (259)
.++++|.+||.||.. .++ ...+.|+++.++|..+|-.
T Consensus 33 ~l~~~d~iiipG~~~-~~~~~~~~~~~~~~~~i~~~~~~~~pilGi 77 (198)
T cd01748 33 EILSADKLILPGVGA-FGDAMANLRERGLIEALKEAIASGKPFLGI 77 (198)
T ss_pred HhccCCEEEECCCCc-HHHHHHHHHHcChHHHHHHHHHCCCcEEEE
Confidence 357899999999853 222 2367888888887766544
No 128
>TIGR03877 thermo_KaiC_1 KaiC domain protein, Ph0284 family. Members of this family contain a single copy of the KaiC domain (pfam06745) that occurs in two copies of the circadian clock protein kinase KaiC itself. Members occur primarily in thermophilic archaea and in Thermotoga.
Probab=23.45 E-value=1.9e+02 Score=24.61 Aligned_cols=62 Identities=18% Similarity=0.186 Sum_probs=41.5
Q ss_pred CChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCccc
Q psy15702 2 ITPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~ 67 (259)
+...+|.+.-+||.+.|-.-=|.--.+-+.+.+++|.+++++.-+.. ...+-.-++.+|+.+
T Consensus 14 l~GG~~~gs~~lI~G~pGsGKT~la~~~l~~~~~~ge~~lyvs~ee~----~~~i~~~~~~~g~~~ 75 (237)
T TIGR03877 14 LHGGIPERNVVLLSGGPGTGKSIFSQQFLWNGLQMGEPGIYVALEEH----PVQVRRNMAQFGWDV 75 (237)
T ss_pred hcCCCcCCeEEEEEcCCCCCHHHHHHHHHHHHHHcCCcEEEEEeeCC----HHHHHHHHHHhCCCH
Confidence 45678888888877777654444455667788888888877765432 234555566888765
No 129
>COG2242 CobL Precorrin-6B methylase 2 [Coenzyme metabolism]
Probab=23.41 E-value=1.2e+02 Score=25.41 Aligned_cols=53 Identities=19% Similarity=0.105 Sum_probs=37.3
Q ss_pred CccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 9 AASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
+.|.++|.|-.. -+.=+++..++|+.||++++-.---. ....+-..++++|..
T Consensus 102 ~~daiFIGGg~~--i~~ile~~~~~l~~ggrlV~naitlE---~~~~a~~~~~~~g~~ 154 (187)
T COG2242 102 SPDAIFIGGGGN--IEEILEAAWERLKPGGRLVANAITLE---TLAKALEALEQLGGR 154 (187)
T ss_pred CCCEEEECCCCC--HHHHHHHHHHHcCcCCeEEEEeecHH---HHHHHHHHHHHcCCc
Confidence 689999998842 44558899999999999887544321 122344667788885
No 130
>PRK07366 succinyldiaminopimelate transaminase; Validated
Probab=23.28 E-value=1.7e+02 Score=26.70 Aligned_cols=30 Identities=10% Similarity=0.216 Sum_probs=14.9
Q ss_pred CccEEEEcCCCCC----CCHHHHHHHHHHHHcCC
Q psy15702 9 AASLFICTGPREQ----FTITELNSLKTYLDEGG 38 (259)
Q Consensus 9 ~advLVI~~P~~~----fs~~E~~~L~~fl~~GG 38 (259)
++++++|..|..+ ++.++++.|-++.++-|
T Consensus 165 ~~k~i~l~~p~NPTG~~~s~~~~~~l~~~a~~~~ 198 (388)
T PRK07366 165 QARLMVLSYPHNPTTAIAPLSFFQEAVAFCQQHD 198 (388)
T ss_pred cceEEEEeCCCCCCCccCCHHHHHHHHHHHHHcC
Confidence 3445555554432 45555555555555443
No 131
>TIGR03687 pupylate_cterm ubiquitin-like protein Pup. Members of this protein family are Pup, a small protein whose ligation to target proteins steers them toward degradation. This protein family occurs in a number of bacteria, especially Actinobacteria such as Mycobacterium tuberculosis, that possess an archeal-type proteasome. All members of this protein family known during model construction end with the C-terminal motif [FY][VI]QKGG[QE]. Ligation is thought to occur between the C-terminal COOH of Pup and an epsilon-amino group of a Lys on the target protein. The N-terminal half of this protein is poorly conserved and not represented in the seed alignment.
Probab=23.23 E-value=52 Score=19.35 Aligned_cols=12 Identities=17% Similarity=0.581 Sum_probs=9.4
Q ss_pred HHHHHHHHHcCC
Q psy15702 27 LNSLKTYLDEGG 38 (259)
Q Consensus 27 ~~~L~~fl~~GG 38 (259)
-+-++.|+++||
T Consensus 21 e~FV~~fVQKGG 32 (33)
T TIGR03687 21 EEFVRGFVQKGG 32 (33)
T ss_pred HHHHHHHHHccC
Confidence 345788999998
No 132
>PF14317 YcxB: YcxB-like protein
Probab=23.19 E-value=34 Score=21.95 Aligned_cols=18 Identities=22% Similarity=0.612 Sum_probs=16.1
Q ss_pred CCCCCCHHHHHHHHHHHH
Q psy15702 18 PREQFTITELNSLKTYLD 35 (259)
Q Consensus 18 P~~~fs~~E~~~L~~fl~ 35 (259)
|++.|+++|.+.+.+|++
T Consensus 45 Pk~~f~~~e~~~f~~~lk 62 (62)
T PF14317_consen 45 PKRAFSEEEKEEFREFLK 62 (62)
T ss_pred EHHHCCHhHHHHHHHHhC
Confidence 888999999999999874
No 133
>TIGR01487 SPP-like sucrose-phosphate phosphatase-like hydrolase, Archaeal. TIGR01482, in turn, is a member of the IIB subfamily (TIGR01484) of the Haloacid Dehalogenase (HAD) superfamily of aspartate-nucleophile hydrolases.
Probab=22.97 E-value=2.1e+02 Score=23.64 Aligned_cols=45 Identities=13% Similarity=0.187 Sum_probs=35.4
Q ss_pred CCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 17 GPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 17 ~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
.+...+++.-.++|++.-++|.++.+..+-. ...+..+++.+|+.
T Consensus 14 ~~~~~i~~~~~~~i~~l~~~g~~~~~~TGR~-----~~~~~~~~~~l~~~ 58 (215)
T TIGR01487 14 EPNRMISERAIEAIRKAEKKGIPVSLVTGNT-----VPFARALAVLIGTS 58 (215)
T ss_pred CCCcccCHHHHHHHHHHHHCCCEEEEEcCCc-----chhHHHHHHHhCCC
Confidence 4556799999999999999998888876654 34677888888765
No 134
>PRK05928 hemD uroporphyrinogen-III synthase; Reviewed
Probab=22.93 E-value=74 Score=26.74 Aligned_cols=53 Identities=25% Similarity=0.254 Sum_probs=35.0
Q ss_pred CCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeee
Q psy15702 18 PREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVR 74 (259)
Q Consensus 18 P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d 74 (259)
|...-+++=+..|.+....|+++++.-+..+ ...+...|++.|..+..=.+.+
T Consensus 106 ~~~~~~~~l~~~l~~~~~~~~~ili~~~~~~----~~~l~~~L~~~G~~v~~~~~Y~ 158 (249)
T PRK05928 106 PEDGESSELLLELPELLLKGKRVLYLRGNGG----REVLGDTLEERGAEVDECEVYE 158 (249)
T ss_pred CCCCcChHHHHhChhhhcCCCEEEEECCCCC----HHHHHHHHHHCCCEEeEEEEEE
Confidence 4333334446666666457889999876654 4578899999999876544443
No 135
>CHL00023 ndhK NADH dehydrogenase subunit K
Probab=22.68 E-value=1.7e+02 Score=25.31 Aligned_cols=41 Identities=22% Similarity=0.195 Sum_probs=29.8
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~ 46 (259)
..|+.||++||.||-+.-....+..+.+= +..-+.+|+++.
T Consensus 67 aSPRhADvliVtG~VT~km~~~L~rlyeq-mPePK~VIA~Ga 107 (225)
T CHL00023 67 SSPRQADLILTAGTVTMKMAPSLVRLYEQ-MPEPKYVIAMGA 107 (225)
T ss_pred CCcccceEEEEecCCccccHHHHHHHHHh-cCCCCeEEEEcc
Confidence 46999999999999765555555555443 466778888775
No 136
>PF01965 DJ-1_PfpI: DJ-1/PfpI family; InterPro: IPR002818 This signature defines a diverse group of protein families which include proteins involved in RNA-protein interaction regulation, thiamine biosynthesis, Ras-related signal transduction, and those with protease activity. Examples of annotation are: Catalase A, 1.11.1.6 from EC Catalase II ES-1 DJ-1 RNA-binding protein, regulatory subunit [] protease I FGAM I, 6.3.5.3 from EC Putative/Intracellular protease Chaperone hchA Transcriptional regulator, AraC family THiJ/PfpI family [,] ; PDB: 2FEX_C 1OY1_B 1VHQ_B 3UK7_B 1OI4_A 1QVW_B 1QVV_D 1QVZ_A 1RW7_A 1G2I_C ....
Probab=22.66 E-value=1.1e+02 Score=23.82 Aligned_cols=41 Identities=17% Similarity=0.232 Sum_probs=24.9
Q ss_pred hcCCccEEEEcCCCC---CCC-H-HHH-HHHHHHHHcCCcEEEEecCC
Q psy15702 6 LLRAASLFICTGPRE---QFT-I-TEL-NSLKTYLDEGGSLLIALGEE 47 (259)
Q Consensus 6 ip~~advLVI~~P~~---~fs-~-~E~-~~L~~fl~~GG~llv~~~~~ 47 (259)
.+++.|+|||.|-.. .|. + .++ ..|++|.++ |+.+.++..+
T Consensus 34 ~~~~yDalilpGG~~~~~~l~~~~~~l~~~~~~~~~~-~k~iaaIC~g 80 (147)
T PF01965_consen 34 DPSDYDALILPGGHGGADDLRTDSKDLLELLKEFYEA-GKPIAAICHG 80 (147)
T ss_dssp TGGGESEEEEE-BTHHHHHHTTCHHHHHHHHHHHHHT-T-EEEEETTC
T ss_pred ChhhCCEEEECCCCchhhhHhhHHHHHHHHHHHHHHc-CCeEEecCCC
Confidence 456899999998853 455 2 444 445555545 7777776664
No 137
>PF01408 GFO_IDH_MocA: Oxidoreductase family, NAD-binding Rossmann fold; InterPro: IPR000683 This group of enzymes utilise NADP or NAD, and is known as the GFO/IDH/MOCA family in UniProtKB/Swiss-Prot. GFO is a glucose--fructose oxidoreductase, which converts D-glucose and D-fructose into D-gluconolactone and D-glucitol in the sorbitol-gluconate pathway. MOCA is a rhizopine catabolism protein which may catalyse the NADH-dependent dehydrogenase reaction involved in rhizopine catabolism. Other proteins belonging to this family include Gal80, a negative regulator for the expression of lactose and galactose metabolic genes; and several hypothetical proteins from yeast, Escherichia coli and Bacillus subtilis. The oxidoreductase, N-terminal domain is almost always associated with the oxidoreductase, C-terminal domain (see IPR004104 from INTERPRO).; GO: 0016491 oxidoreductase activity; PDB: 1LC0_A 1LC3_A 1GCU_A 3IP3_E 3CEA_C 3EVN_A 3NTQ_A 3NTR_B 3NT5_A 3MZ0_A ....
Probab=22.40 E-value=1.3e+02 Score=22.06 Aligned_cols=54 Identities=17% Similarity=0.175 Sum_probs=34.5
Q ss_pred CccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCC-CCCCccHHHHHHHhCccc
Q psy15702 9 AASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGE-RGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~-~~~~~nln~LL~~~Gi~~ 67 (259)
+.|+++|+.|. ....+.+++.++.|-++++= -|... ......+-.+.++.|..+
T Consensus 62 ~~D~V~I~tp~----~~h~~~~~~~l~~g~~v~~E-KP~~~~~~~~~~l~~~a~~~~~~~ 116 (120)
T PF01408_consen 62 DVDAVIIATPP----SSHAEIAKKALEAGKHVLVE-KPLALTLEEAEELVEAAKEKGVKV 116 (120)
T ss_dssp TESEEEEESSG----GGHHHHHHHHHHTTSEEEEE-SSSSSSHHHHHHHHHHHHHHTSCE
T ss_pred cCCEEEEecCC----cchHHHHHHHHHcCCEEEEE-cCCcCCHHHHHHHHHHHHHhCCEE
Confidence 79999999995 56788889999999866652 33321 001223445555666554
No 138
>PRK09189 uroporphyrinogen-III synthase; Validated
Probab=22.38 E-value=1.4e+02 Score=25.27 Aligned_cols=51 Identities=12% Similarity=0.017 Sum_probs=36.5
Q ss_pred CCHHH-HHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeecc
Q psy15702 22 FTITE-LNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRCH 76 (259)
Q Consensus 22 fs~~E-~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~~ 76 (259)
++.+. ++.+.+.+..|++++++-+..+ ...+...|++.|+.++.=.|.++.
T Consensus 102 ~~~e~L~~~~~~~~~~~~~vL~~rg~~~----r~~l~~~L~~~G~~v~~~~vY~~~ 153 (240)
T PRK09189 102 GDGVRLAETVAAALAPTARLLYLAGRPR----APVFEDRLAAAGIPFRVAECYDML 153 (240)
T ss_pred CCHHHHHHHHHHhcCCCCcEEEeccCcc----cchhHHHHHhCCCeeEEEEEEEee
Confidence 44444 3445555568999999987764 467999999999998766666543
No 139
>TIGR02717 AcCoA-syn-alpha acetyl coenzyme A synthetase (ADP forming), alpha domain. Although technically reversible, it is believed that this group of ADP-dependent acetyl-CoA synthetases (ACS) act in the direction of acetate and ATP production in the organisms in which it has been characterized. In most species this protein exists as a fused alpha-beta domain polypeptide. In Pyrococcus and related species, however the domains exist as separate polypeptides. This model represents the alpha (N-terminal) domain. In Pyrococcus and related species there appears to have been the development of a paralogous family such that four other proteins are close relatives. In reference, one of these (along with its beta-domain partner) was characterized as ACS-II showing specificity for phenylacetyl-CoA. This model has been constructed to exclude these non-ACS-I paralogs. This may result in new, authentic ACS-I sequences falling below the trusted cutoff.
Probab=22.26 E-value=2.8e+02 Score=26.29 Aligned_cols=64 Identities=14% Similarity=0.207 Sum_probs=42.7
Q ss_pred hhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCC-CCC-----CCccHHHHHHHhCccc-CCce
Q psy15702 4 PELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEG-ERG-----SSTNINFLLEQYGVSV-NSDC 71 (259)
Q Consensus 4 ~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~-~~~-----~~~nln~LL~~~Gi~~-~~~~ 71 (259)
.++|+..|+.||.-|. +.-.+.+++-.+.|=+.++++..+. +.. ....+-...+++|+.+ .+++
T Consensus 59 ~~lp~~~Dlavi~vp~----~~~~~~l~e~~~~gv~~~vi~s~gf~e~g~~g~~~~~~l~~~a~~~girvlGPnc 129 (447)
T TIGR02717 59 LEIPDPVDLAVIVVPA----KYVPQVVEECGEKGVKGAVVITAGFKEVGEEGAELEQELVEIARKYGMRLLGPNC 129 (447)
T ss_pred HHCCCCCCEEEEecCH----HHHHHHHHHHHhcCCCEEEEECCCccccCcchHHHHHHHHHHHHHcCCEEEecCe
Confidence 3678899999999884 4446777888888877666655432 211 1234556777889995 3554
No 140
>PF11576 DUF3236: Protein of unknown function (DUF3236); InterPro: IPR012019 This family of proteins with unknown function appears to be restricted to Methanobacteria. ; PDB: 3BRC_B.
Probab=22.14 E-value=57 Score=26.13 Aligned_cols=41 Identities=17% Similarity=0.362 Sum_probs=26.4
Q ss_pred CHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcc
Q psy15702 23 TITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVS 66 (259)
Q Consensus 23 s~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~ 66 (259)
..+|+++|++|+.+.-++.|...-.. .-.-++..+.++|+.
T Consensus 21 k~EEv~~Ir~~I~nakkIvV~t~N~k---Kf~vi~~il~~~~~~ 61 (154)
T PF11576_consen 21 KEEEVEAIREYILNAKKIVVATNNEK---KFKVINDILSKFNLP 61 (154)
T ss_dssp -HHHHHHHHHHHHH-S-EEE----HH---HHHHHHHHHHHTT--
T ss_pred cHHHHHHHHHHHhcCceEEEecCCch---HhHHHHHHHHHhcCC
Confidence 45899999999999999998754432 245688999999743
No 141
>COG1889 NOP1 Fibrillarin-like rRNA methylase [Translation, ribosomal structure and biogenesis]
Probab=21.96 E-value=1.1e+02 Score=26.30 Aligned_cols=39 Identities=15% Similarity=0.349 Sum_probs=26.0
Q ss_pred CCccEEE--EcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCC
Q psy15702 8 RAASLFI--CTGPREQFTITELNSLKTYLDEGGSLLIALGEEG 48 (259)
Q Consensus 8 ~~advLV--I~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~ 48 (259)
+.+|++. |+.|.+ .+--+..-+.||++||.+++++-...
T Consensus 144 e~VDviy~DVAQp~Q--a~I~~~Na~~FLk~~G~~~i~iKArS 184 (231)
T COG1889 144 EKVDVIYQDVAQPNQ--AEILADNAEFFLKKGGYVVIAIKARS 184 (231)
T ss_pred ccccEEEEecCCchH--HHHHHHHHHHhcccCCeEEEEEEeec
Confidence 4466665 455532 33345566899999999999887654
No 142
>cd03421 SirA_like_N SirA_like_N, a protein of unknown function with an N-terminal SirA-like domain. The SirA, YedF, YeeD protein family is present in bacteria as well as archaea. SirA (also known as UvrY, and YhhP) belongs to a family of a two-component response regulators that controls secondary metabolism and virulence. The other member of this two-component system is a sensor kinase called BarA which phosphorylates SirA. A variety of microorganisms have similar proteins, all of which contain a common CPxP sequence motif in the N-terminal region. YhhP is suggested to be important for normal cell division and growth in rich nutrient medium. Moreover, despite a low primary sequence similarity, the YccP structure closely resembles the non-homologous C-terminal RNA-binding domain of E. coli translation initiation factor IF3. The signature CPxP motif serves to stabilize the N-terminal helix as part of the N-capping box and might be important in mRNA-binding.
Probab=21.84 E-value=66 Score=21.58 Aligned_cols=32 Identities=25% Similarity=0.455 Sum_probs=25.0
Q ss_pred HHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 34 LDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 34 l~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
++.|+.+.+.++... ...++..++++.|.++.
T Consensus 22 ~~~g~~l~v~~d~~~---s~~~i~~~~~~~G~~~~ 53 (67)
T cd03421 22 LEAGGEIEVLVDNEV---AKENVSRFAESRGYEVS 53 (67)
T ss_pred cCCCCEEEEEEcChh---HHHHHHHHHHHcCCEEE
Confidence 457888888877654 35799999999999883
No 143
>cd03128 GAT_1 Type 1 glutamine amidotransferase (GATase1)-like domain. Type 1 glutamine amidotransferase (GATase1)-like domain. This group contains proteins similar to Class I glutamine amidotransferases, the intracellular PH1704 from Pyrococcus horikoshii, the C-terminal of the large catalase: Escherichia coli HP-II, Sinorhizobium meliloti Rm1021 ThuA, the A4 beta-galactosidase middle domain and peptidase E. The majority of proteins in this group have a reactive Cys found in the sharp turn between a beta strand and an alpha helix termed the nucleophile elbow. For Class I glutamine amidotransferases proteins which transfer ammonia from the amide side chain of glutamine to an acceptor substrate, this Cys forms a Cys-His-Glu catalytic triad in the active site. Glutamine amidotransferases activity can be found in a range of biosynthetic enzymes included in this cd: glutamine amidotransferase, formylglycinamide ribonucleotide, GMP synthetase, anthranilate synthase component II, glutamin
Probab=21.56 E-value=2.3e+02 Score=18.07 Aligned_cols=36 Identities=19% Similarity=0.083 Sum_probs=22.7
Q ss_pred hcCCccEEEEcCCCCCC-----CHHHHHHHHHHHHcCCcEE
Q psy15702 6 LLRAASLFICTGPREQF-----TITELNSLKTYLDEGGSLL 41 (259)
Q Consensus 6 ip~~advLVI~~P~~~f-----s~~E~~~L~~fl~~GG~ll 41 (259)
...++|.+|+.+....- ....++.+.++..+|+.++
T Consensus 43 ~~~~~~~lii~g~~~~~~~~~~~~~~~~~~~~~~~~~~~i~ 83 (92)
T cd03128 43 DLDDYDGLILPGGPGTPDDLAWDEALLALLREAAAAGKPVL 83 (92)
T ss_pred CcccCCEEEECCCCcchhhhccCHHHHHHHHHHHHcCCEEE
Confidence 36788999998875422 2345666777776644433
No 144
>PRK09275 aspartate aminotransferase; Provisional
Probab=21.49 E-value=2e+02 Score=28.16 Aligned_cols=41 Identities=10% Similarity=0.190 Sum_probs=30.9
Q ss_pred CCccEEEEcCCCCC----CCHHHHHHHHHHHH-cCCcEEEEecCCC
Q psy15702 8 RAASLFICTGPREQ----FTITELNSLKTYLD-EGGSLLIALGEEG 48 (259)
Q Consensus 8 ~~advLVI~~P~~~----fs~~E~~~L~~fl~-~GG~llv~~~~~~ 48 (259)
+++.+++|..|..+ ++.++++.|-++.+ ...+++|+.|+..
T Consensus 240 ~~tkai~l~nP~NPTG~v~s~e~l~~I~~ia~~~~~~l~II~DEvY 285 (527)
T PRK09275 240 PSIKALFLVNPSNPPSVAMSDESLEKIADIVNEKRPDLMIITDDVY 285 (527)
T ss_pred CCCCEEEEeCCcCCcCCCCCHHHHHHHHHHHHhcCCCcEEEECCCC
Confidence 45778888888754 88888999988885 3556777778754
No 145
>cd01076 NAD_bind_1_Glu_DH NAD(P) binding domain of glutamate dehydrogenase, subgroup 1. Amino acid dehydrogenase (DH) is a widely distributed family of enzymes that catalyzes the oxidative deamination of an amino acid to its keto acid and ammonia with concomitant reduction of NADP+. Glutamate DH is a multidomain enzyme that catalyzes the reaction from glutamate to 2-oxyoglutarate and ammonia in the presence of NAD or NADP. It is present in all organisms. Enzymes involved in ammonia assimilation are typically NADP+-dependent, while those involved in glutamate catabolism are generally NAD+-dependent. Amino acid DH-like NAD(P)-binding domains are members of the Rossmann fold superfamily and include glutamate, leucine, and phenylalanine DHs, methylene tetrahydrofolate DH, methylene-tetrahydromethanopterin DH, methylene-tetrahydropholate DH/cyclohydrolase, Shikimate DH-like proteins, malate oxidoreductases, and glutamyl tRNA reductase. Amino acid DHs catalyze the deamination of amino acids
Probab=21.47 E-value=1.6e+02 Score=25.29 Aligned_cols=58 Identities=17% Similarity=0.237 Sum_probs=44.2
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeee
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVR 74 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d 74 (259)
.+||++|=++++.-.+++.+.+|+ .=+.++....+ ..+.-...|++-||.+-+|.+..
T Consensus 105 ~~~Dvlip~a~~~~i~~~~~~~l~--------a~~I~egAN~~-~t~~a~~~L~~rGi~~~PD~~aN 162 (227)
T cd01076 105 LDCDILIPAALENQITADNADRIK--------AKIIVEAANGP-TTPEADEILHERGVLVVPDILAN 162 (227)
T ss_pred ecccEEEecCccCccCHHHHhhce--------eeEEEeCCCCC-CCHHHHHHHHHCCCEEEChHHhc
Confidence 479999999999999999999985 33333433322 24677899999999999888774
No 146
>TIGR02522 pilus_cpaD pilus (Caulobacter type) biogenesis lipoprotein CpaD. This family consists of a pilus biogenesis protein, CpaD, from Caulobacter, and homologs in other bacteria, including three in the root nodule bacterium Bradyrhizobium japonicum. The molecular function is not known.
Probab=21.42 E-value=1.5e+02 Score=24.96 Aligned_cols=50 Identities=14% Similarity=0.228 Sum_probs=35.7
Q ss_pred CCCCHHHHHHHHHHHH----cCCcEEEEecCCC-CCCCCccHHHHHHHhCcccCC
Q psy15702 20 EQFTITELNSLKTYLD----EGGSLLIALGEEG-ERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 20 ~~fs~~E~~~L~~fl~----~GG~llv~~~~~~-~~~~~~nln~LL~~~Gi~~~~ 69 (259)
..|+..+...|+.|++ .|+..+.+..|.+ .......+..+|.++||...+
T Consensus 53 ~~Lt~~qr~~l~~f~~~~~~~g~~~i~i~~psgaa~~~~~~ir~~L~~~Gv~~~~ 107 (198)
T TIGR02522 53 RGLSASQQDQLLGFLKDWGRASAQTLTLIIPSAAAEAMAGEIRRVLAASGVGARN 107 (198)
T ss_pred CCCCHHHHHHHHHHHHHHhhcCCceEEEecChhHHHHHHHHHHHHHHHcCCChhc
Confidence 4499999999999885 3444555555552 222346788999999999877
No 147
>PRK12359 flavodoxin FldB; Provisional
Probab=21.13 E-value=4.2e+02 Score=21.65 Aligned_cols=62 Identities=19% Similarity=0.266 Sum_probs=33.4
Q ss_pred hcCCccEEEEcCCCCCCCH--HHHHHHHHHHHc---CCcEEEEecCCCCCCCCc-------cHHHHHHHhCccc
Q psy15702 6 LLRAASLFICTGPREQFTI--TELNSLKTYLDE---GGSLLIALGEEGERGSST-------NINFLLEQYGVSV 67 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~--~E~~~L~~fl~~---GG~llv~~~~~~~~~~~~-------nln~LL~~~Gi~~ 67 (259)
-+.++|++|+..|+-..-+ .+....-..+.+ .|+.+.+.+-+....... .+...|++.|..+
T Consensus 42 ~l~~yD~iIlG~pTw~~Gel~~d~~~~~~~l~~~dl~gK~vAlFG~Gd~~~y~~~f~~a~~~l~~~l~~~Ga~i 115 (172)
T PRK12359 42 LMEQYDVLILGIPTWDFGEIQEDWEAVWDQLDDLNLEGKIVALYGMGDQLGYGEWFLDALGMLHDKLAPKGVKF 115 (172)
T ss_pred HHccCCEEEEEecccCCCcCcHHHHHHHHHHhhCCCCCCEEEEEeCCCCccchHHHHHHHHHHHHHHHhCCCeE
Confidence 4568999999999953332 244443333432 254444555443211222 4666666677665
No 148
>COG0803 LraI ABC-type metal ion transport system, periplasmic component/surface adhesin [Inorganic ion transport and metabolism]
Probab=21.05 E-value=2.3e+02 Score=25.31 Aligned_cols=51 Identities=18% Similarity=0.201 Sum_probs=41.8
Q ss_pred cCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCC
Q psy15702 16 TGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNS 69 (259)
Q Consensus 16 ~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~ 69 (259)
.+|...-|..++..|.+++++++--.++..... .....+.+.++.|+.+..
T Consensus 219 ~~~~~e~s~~~l~~l~~~ik~~~i~~If~e~~~---~~~~~~~la~e~g~~~~~ 269 (303)
T COG0803 219 ISPEAEPSPKDLAKLVDLIKKKNIKAIFVESNV---SSKSAETLAKETGVKILG 269 (303)
T ss_pred cCcccCCCHHHHHHHHHHHHHcCCCEEEEeCCC---ChHHHHHHHHHhCCcccc
Confidence 356678999999999999999998888876654 356788999999977654
No 149
>TIGR03880 KaiC_arch_3 KaiC domain protein, AF_0351 family. This model represents a rather narrowly distributed archaeal protein family in which members have a single copy of the KaiC domain. This stands in contrast to the circadian clock protein KaiC itself, with two copies of the domain. Members are expected to have weak ATPase activity, by homology to the autokinase/autophosphorylase KaiC itself.
Probab=20.92 E-value=2.2e+02 Score=23.71 Aligned_cols=63 Identities=14% Similarity=0.028 Sum_probs=40.6
Q ss_pred CChhhcCCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccC
Q psy15702 2 ITPELLRAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVN 68 (259)
Q Consensus 2 lt~~ip~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~ 68 (259)
+...+|...-++|...|-.--|.--.+-+...+++|.+++++.-+.. ...+-.-++.+|..+.
T Consensus 9 l~gGi~~g~~~li~G~~G~GKt~~~~~~~~~~~~~g~~~~y~s~e~~----~~~l~~~~~~~~~~~~ 71 (224)
T TIGR03880 9 LGGGFPEGHVIVVIGEYGTGKTTFSLQFLYQGLKNGEKAMYISLEER----EERILGYAKSKGWDLE 71 (224)
T ss_pred hcCCCCCCeEEEEECCCCCCHHHHHHHHHHHHHhCCCeEEEEECCCC----HHHHHHHHHHcCCChH
Confidence 34567787777777766655555566777888889888877754432 3345555566676553
No 150
>cd03237 ABC_RNaseL_inhibitor_domain2 The ATPase domain 2 of RNase L inhibitor. The ABC ATPase, RNase L inhibitor (RLI), is a key enzyme in ribosomal biogenesis, formation of translation preinitiation complexes, and assembly of HIV capsids. RLI's are not transport proteins and thus cluster with a group of soluble proteins that lack the transmembrane components commonly found in other members of the family. Structurally, RLI's have an N-terminal Fe-S domain and two nucleotide-binding domains which are arranged to form two composite active sites in their interface cleft. RLI is one of the most conserved enzymes between archaea and eukaryotes with a sequence identity of more than 48%. The high degree of evolutionary conservation suggests that RLI performs a central role in archaeal and eukaryotic physiology.
Probab=20.84 E-value=2e+02 Score=24.75 Aligned_cols=42 Identities=7% Similarity=0.195 Sum_probs=29.2
Q ss_pred hhcCCccEEEEcCCCCCCCHHHHHHHH----HHHHcCCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITELNSLK----TYLDEGGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E~~~L~----~fl~~GG~llv~~~~ 46 (259)
.+..+++++++=-|...+.......+. ++.++.|+.++++.+
T Consensus 129 aL~~~p~llllDEPt~~LD~~~~~~l~~~l~~~~~~~~~tiiivsH 174 (246)
T cd03237 129 CLSKDADIYLLDEPSAYLDVEQRLMASKVIRRFAENNEKTAFVVEH 174 (246)
T ss_pred HHhcCCCEEEEeCCcccCCHHHHHHHHHHHHHHHHhcCCEEEEEeC
Confidence 356789999999999999887765544 444444555555554
No 151
>PRK05452 anaerobic nitric oxide reductase flavorubredoxin; Provisional
Probab=20.78 E-value=1.4e+02 Score=28.69 Aligned_cols=60 Identities=13% Similarity=0.178 Sum_probs=38.4
Q ss_pred CccEEEEcCCCCCCCHHHHHHHHHHHHc------CCcEEEEecCCCC-CCCCccHHHHHHHhCcccCCc
Q psy15702 9 AASLFICTGPREQFTITELNSLKTYLDE------GGSLLIALGEEGE-RGSSTNINFLLEQYGVSVNSD 70 (259)
Q Consensus 9 ~advLVI~~P~~~fs~~E~~~L~~fl~~------GG~llv~~~~~~~-~~~~~nln~LL~~~Gi~~~~~ 70 (259)
+||.+|+++|+. ...+...+..|+.. .||...+++..+- ......+...|+..|+.+-..
T Consensus 304 ~ad~vilGspT~--~~~~~p~~~~fl~~l~~~~l~gK~~~vFGSygw~g~a~~~~~~~l~~~g~~~~~~ 370 (479)
T PRK05452 304 RSKGVLVGSSTM--NNVMMPKIAGLLEEITGLRFRNKRASAFGSHGWSGGAVDRLSTRLQDAGFEMSLS 370 (479)
T ss_pred hCCEEEEECCcc--CCcchHHHHHHHHHhhccCcCCCEEEEEECCCcCcHHHHHHHHHHHHCCCEEecc
Confidence 699999999984 33444445555432 4666666665431 113467888888999998543
No 152
>COG1587 HemD Uroporphyrinogen-III synthase [Coenzyme metabolism]
Probab=20.66 E-value=73 Score=27.47 Aligned_cols=59 Identities=25% Similarity=0.255 Sum_probs=41.6
Q ss_pred EEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCCCCCCccHHHHHHHhCcccCCceeeecc
Q psy15702 14 ICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGERGSSTNINFLLEQYGVSVNSDCVVRCH 76 (259)
Q Consensus 14 VI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~~~~~~nln~LL~~~Gi~~~~~~V~d~~ 76 (259)
....|...-++.-+..+.++...|-+++++.+..+ .+.+..-|.+.|+.+..=.+.++.
T Consensus 100 ~~~~p~~~~~~~l~~~l~~~~~~~~~vl~~~~~~~----r~~l~~~L~~~G~~v~~~~~Y~~~ 158 (248)
T COG1587 100 VDFIPEDGDSEGLLEELPELLKGGKRVLILRGNGG----REVLEEKLEERGAEVREVEVYRTE 158 (248)
T ss_pred CCcCCCccchHHHHHHhhhhccCCCeEEEEcCCCc----hHHHHHHHHhCCCEEEEEeeeeec
Confidence 33444432333447788888878889999988875 467999999999999765555543
No 153
>PF05939 Phage_min_tail: Phage minor tail protein; InterPro: IPR010265 This entry is represented by Bacteriophage lambda, GpM, the minor tail protein. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of a series of phage minor tail proteins and related sequences from several bacterial species.
Probab=20.46 E-value=1.7e+02 Score=21.99 Aligned_cols=28 Identities=25% Similarity=0.385 Sum_probs=21.8
Q ss_pred CCHHHHHHHHHHHHcCC--cEEEEecCCCC
Q psy15702 22 FTITELNSLKTYLDEGG--SLLIALGEEGE 49 (259)
Q Consensus 22 fs~~E~~~L~~fl~~GG--~llv~~~~~~~ 49 (259)
.+.+++.+|++||++.| +-+....|.++
T Consensus 50 ~~~~~~~~I~~FL~~h~G~~sF~WtpP~~~ 79 (109)
T PF05939_consen 50 GTEAEIRAIEAFLDRHGGVKSFLWTPPGGE 79 (109)
T ss_pred ECHHHHHHHHHHHHHCCCceEEEEECCCCC
Confidence 35688999999997443 78888888764
No 154
>COG0436 Aspartate/tyrosine/aromatic aminotransferase [Amino acid transport and metabolism]
Probab=20.43 E-value=1.3e+02 Score=28.10 Aligned_cols=40 Identities=18% Similarity=0.241 Sum_probs=25.2
Q ss_pred cCCccEEEEcCCCCC----CCHHHHHHHHHHHHcCCcEEEEecCC
Q psy15702 7 LRAASLFICTGPREQ----FTITELNSLKTYLDEGGSLLIALGEE 47 (259)
Q Consensus 7 p~~advLVI~~P~~~----fs~~E~~~L~~fl~~GG~llv~~~~~ 47 (259)
-++..+++|..|.++ ++.+|+++|-++.++-|- +++.|+-
T Consensus 161 ~~ktk~i~ln~P~NPTGav~~~~~l~~i~~~a~~~~i-~ii~DEi 204 (393)
T COG0436 161 TPKTKAIILNSPNNPTGAVYSKEELKAIVELAREHDI-IIISDEI 204 (393)
T ss_pred CccceEEEEeCCCCCcCcCCCHHHHHHHHHHHHHcCe-EEEEehh
Confidence 345667777777644 677777777777777443 4444543
No 155
>TIGR00379 cobB cobyrinic acid a,c-diamide synthase. This model describes cobyrinic acid a,c-diamide synthase, the cobB (cbiA in Salmonella) protein of cobalamin biosynthesis. It is responsible for the amidation of carboxylic groups at positions A and C of either cobyrinic acid or hydrogenobrynic acid. NH(2) groups are provided by glutamine and one molecule of ATP hydrogenolyzed for each amidation.
Probab=20.33 E-value=86 Score=29.83 Aligned_cols=40 Identities=23% Similarity=0.283 Sum_probs=27.9
Q ss_pred hhcCCccEEEEcCCCCCCCHHH-------HHHHHHHHHcCCcEEEEec
Q psy15702 5 ELLRAASLFICTGPREQFTITE-------LNSLKTYLDEGGSLLIALG 45 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E-------~~~L~~fl~~GG~llv~~~ 45 (259)
++| ++|.|+|.|-....-..+ .++|++|+++|+.++-..+
T Consensus 283 ~l~-~~d~l~ipGG~~~~~~~~l~~~~~~~~~i~~~~~~G~pv~g~Cg 329 (449)
T TIGR00379 283 ELP-DVDAVYIGGGFPELFAEELSQNQALRDSIKTFIHQGLPIYGECG 329 (449)
T ss_pred CCC-CCCEEEeCCcHHHHHHHHHHhhhHHHHHHHHHHHcCCCEEEEcH
Confidence 445 999999999863222111 5789999999987765443
No 156
>PRK14004 hisH imidazole glycerol phosphate synthase subunit HisH; Provisional
Probab=20.29 E-value=1.2e+02 Score=25.72 Aligned_cols=37 Identities=14% Similarity=0.244 Sum_probs=26.5
Q ss_pred hcCCccEEEEcCCCCCCCHH--------HHHHHHHHHHcCCcEEEE
Q psy15702 6 LLRAASLFICTGPREQFTIT--------ELNSLKTYLDEGGSLLIA 43 (259)
Q Consensus 6 ip~~advLVI~~P~~~fs~~--------E~~~L~~fl~~GG~llv~ 43 (259)
-.+++|.||+.|+.. +++. -...|++++++|..+|=.
T Consensus 34 ~l~~~d~iIlPG~g~-~~~~~~~l~~~gl~~~i~~~~~~~~pilGi 78 (210)
T PRK14004 34 TIENSKALILPGDGH-FDKAMENLNSTGLRSTIDKHVESGKPLFGI 78 (210)
T ss_pred HhccCCEEEECCCCc-hHHHHHHHHHcCcHHHHHHHHHcCCCEEEE
Confidence 357999999999985 4442 256778888887775543
No 157
>TIGR03855 NAD_NadX aspartate dehydrogenase. Members of this protein family are L-aspartate dehydrogenase, as shown for the NADP-dependent enzyme TM_1643 of Thermotoga maritima. Members lack homology to NadB, the aspartate oxidase (EC 1.4.3.16) of most mesophilic bacteria (described by TIGR00551), which this enzyme replaces in the generation of oxaloacetate from aspartate for the NAD biosynthetic pathway. All members of the seed alignment are found adjacent to other genes of NAD biosynthesis, although other uses of L-aspartate dehydrogenase may occur.
Probab=20.17 E-value=2e+02 Score=24.79 Aligned_cols=55 Identities=15% Similarity=0.148 Sum_probs=39.2
Q ss_pred CCccEEEEcCCCCCCCHHHHHHHHHHHHcCCcEEEEecCCCC--CCCCccHHHHHHHhCccc
Q psy15702 8 RAASLFICTGPREQFTITELNSLKTYLDEGGSLLIALGEEGE--RGSSTNINFLLEQYGVSV 67 (259)
Q Consensus 8 ~~advLVI~~P~~~fs~~E~~~L~~fl~~GG~llv~~~~~~~--~~~~~nln~LL~~~Gi~~ 67 (259)
++.|+++++.| +....+...+.|+.|-.++++ .++.- ......+....++.|..+
T Consensus 36 ~~vDaVviatp----~~~H~e~a~~aL~aGkhVl~~-s~gAlad~e~~~~l~~aA~~~g~~l 92 (229)
T TIGR03855 36 EDVDIVVEAAS----QEAVKEYAEKILKNGKDLLIM-SVGALADRELRERLREVARSSGRKV 92 (229)
T ss_pred CCCCEEEECCC----hHHHHHHHHHHHHCCCCEEEE-CCcccCCHHHHHHHHHHHHhcCCEE
Confidence 57999999999 456678888999999999984 55321 011234566777888765
No 158
>PRK13896 cobyrinic acid a,c-diamide synthase; Provisional
Probab=20.04 E-value=1.6e+02 Score=28.00 Aligned_cols=41 Identities=17% Similarity=0.152 Sum_probs=29.4
Q ss_pred hhcCCccEEEEcCCCCCCCHHH------HHHHHHHHHcCCcEEEEecC
Q psy15702 5 ELLRAASLFICTGPREQFTITE------LNSLKTYLDEGGSLLIALGE 46 (259)
Q Consensus 5 ~ip~~advLVI~~P~~~fs~~E------~~~L~~fl~~GG~llv~~~~ 46 (259)
++| +||.|+|.|-...+-..+ .+.|++|.++|+.++-..+.
T Consensus 271 ~lp-~~D~l~lpGG~~e~~~~~L~~n~~~~~i~~~~~~G~pi~aeCGG 317 (433)
T PRK13896 271 PLP-DCDGVYLPGGYPELHADALADSPALDELADRAADGLPVLGECGG 317 (433)
T ss_pred CCC-CCCEEEeCCCchhhHHHHHHhCCcHHHHHHHHHCCCcEEEEehH
Confidence 455 999999999874433332 47899999999887665443
Done!