Query psy627
Match_columns 89
No_of_seqs 109 out of 354
Neff 4.9
Searched_HMMs 46136
Date Fri Aug 16 22:05:36 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy627.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/627hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF04800 ETC_C1_NDUFA4: ETC co 100.0 1E-49 2.2E-54 268.4 6.0 86 1-86 13-99 (101)
2 KOG3389|consensus 100.0 1.2E-43 2.7E-48 254.5 7.0 89 1-89 88-178 (178)
3 PF09954 DUF2188: Uncharacteri 91.5 0.14 3E-06 30.7 1.8 41 27-67 10-55 (62)
4 PRK05370 argininosuccinate syn 87.0 0.44 9.5E-06 39.6 2.1 22 40-61 164-185 (447)
5 PHA02552 4 head completion pro 85.9 0.75 1.6E-05 33.1 2.6 21 45-65 120-140 (151)
6 PF07045 DUF1330: Protein of u 81.7 1.5 3.2E-05 26.4 2.3 21 32-52 36-56 (65)
7 PF08727 P3A: Poliovirus 3A pr 81.0 1.1 2.3E-05 27.7 1.5 16 46-61 30-45 (57)
8 PF08722 Tn7_Tnp_TnsA_N: TnsA 79.0 2.5 5.4E-05 26.5 2.8 19 45-63 69-87 (88)
9 PF11360 DUF3110: Protein of u 78.4 2.1 4.5E-05 27.9 2.4 30 36-65 23-77 (86)
10 PF05573 NosL: NosL; InterPro 77.4 2.2 4.7E-05 29.7 2.4 47 9-56 77-135 (149)
11 PF06194 Phage_Orf51: Phage Co 77.2 2.1 4.5E-05 28.0 2.0 26 36-61 48-73 (80)
12 PF02844 GARS_N: Phosphoribosy 74.0 3.8 8.1E-05 27.5 2.7 30 37-66 43-72 (100)
13 cd01207 Ena-Vasp Enabled-VASP- 72.2 3.5 7.6E-05 28.2 2.3 22 34-55 82-103 (111)
14 PRK04527 argininosuccinate syn 71.4 2.8 6.1E-05 34.2 1.9 22 38-59 146-168 (400)
15 COG5470 Uncharacterized conser 69.9 5.2 0.00011 27.0 2.7 26 23-51 44-69 (96)
16 KOG2854|consensus 69.4 4.2 9.1E-05 32.9 2.5 23 38-60 215-237 (343)
17 PF05406 WGR: WGR domain; Int 69.1 3.6 7.8E-05 25.6 1.7 45 8-54 19-64 (81)
18 PF03462 PCRF: PCRF domain; I 68.4 4.1 8.8E-05 27.3 1.9 15 49-63 73-87 (115)
19 PF03108 DBD_Tnp_Mut: MuDR fam 66.4 7.2 0.00016 23.2 2.6 35 33-67 1-39 (67)
20 PF00568 WH1: WH1 domain; Int 66.2 6.4 0.00014 25.9 2.5 19 36-54 88-106 (111)
21 PF12518 DUF3721: Protein of u 65.6 6.5 0.00014 21.9 2.1 16 42-57 1-16 (34)
22 KOG4748|consensus 65.5 4.6 9.9E-05 32.9 2.0 22 44-65 127-148 (364)
23 cd00837 EVH1 EVH1 (Enabled, Va 63.7 7.8 0.00017 25.4 2.6 20 35-54 80-99 (104)
24 PRK05773 3,4-dihydroxy-2-butan 62.3 7.3 0.00016 29.5 2.5 18 43-60 191-208 (219)
25 smart00461 WH1 WASP homology r 61.7 8.7 0.00019 25.4 2.5 20 35-54 82-101 (106)
26 PF01693 Cauli_VI: Caulimoviru 61.5 8.4 0.00018 21.6 2.1 14 38-51 31-44 (44)
27 PF01071 GARS_A: Phosphoribosy 60.5 10 0.00022 28.0 3.0 26 39-64 20-45 (194)
28 PRK09437 bcp thioredoxin-depen 59.4 13 0.00029 24.7 3.1 27 38-64 68-94 (154)
29 COG0108 RibB 3,4-dihydroxy-2-b 59.3 8.6 0.00019 29.1 2.4 21 43-63 172-192 (203)
30 COG0137 ArgG Argininosuccinate 59.1 7.4 0.00016 32.2 2.1 22 40-62 150-171 (403)
31 PF13031 DUF3892: Protein of u 58.4 10 0.00022 23.9 2.3 20 43-62 31-50 (85)
32 PF07862 Nif11: Nitrogen fixat 57.7 5.4 0.00012 22.6 0.8 19 43-61 27-45 (49)
33 PF08803 ydhR: Putative mono-o 57.2 8.1 0.00018 26.0 1.8 28 28-55 38-66 (97)
34 PLN00139 hypothetical protein; 56.5 11 0.00023 29.6 2.6 25 40-64 195-219 (320)
35 PF10566 Glyco_hydro_97: Glyco 56.0 9.6 0.00021 29.6 2.2 26 39-64 28-53 (273)
36 smart00156 PP2Ac Protein phosp 55.9 21 0.00045 27.1 4.0 41 20-64 176-219 (271)
37 PF10075 PCI_Csn8: COP9 signal 55.5 7.8 0.00017 26.3 1.5 20 40-60 107-126 (143)
38 KOG1706|consensus 54.5 7.8 0.00017 31.8 1.6 23 41-63 154-176 (412)
39 PF03793 PASTA: PASTA domain; 53.4 17 0.00036 21.0 2.5 22 43-64 9-30 (63)
40 PF03682 UPF0158: Uncharacteri 52.6 11 0.00025 26.9 2.0 20 45-64 127-146 (163)
41 PF12745 HGTP_anticodon2: Anti 52.4 17 0.00036 28.1 3.1 23 43-65 44-66 (273)
42 PF09383 NIL: NIL domain; Int 51.2 18 0.00038 21.9 2.5 29 34-62 44-76 (76)
43 COG1225 Bcp Peroxiredoxin [Pos 50.6 13 0.00029 26.7 2.1 30 38-67 68-97 (157)
44 cd03017 PRX_BCP Peroxiredoxin 50.5 24 0.00052 22.7 3.2 28 38-65 61-88 (140)
45 COG3341 Predicted double-stran 49.9 9.8 0.00021 29.2 1.4 14 41-54 39-52 (225)
46 PF13911 AhpC-TSA_2: AhpC/TSA 49.6 18 0.0004 23.1 2.5 29 38-66 17-45 (115)
47 PLN00200 argininosuccinate syn 49.1 14 0.00029 30.3 2.2 18 42-59 153-170 (404)
48 TIGR00506 ribB 3,4-dihydroxy-2 48.7 17 0.00037 27.1 2.5 19 42-60 172-190 (199)
49 cd07996 WGR_MMR_like WGR domai 48.6 51 0.0011 19.7 4.3 50 6-55 12-62 (74)
50 PRK01792 ribB 3,4-dihydroxy-2- 48.4 17 0.00037 27.5 2.5 19 42-60 182-200 (214)
51 PF05198 IF3_N: Translation in 48.4 18 0.0004 22.9 2.3 26 42-67 27-52 (76)
52 PRK00923 sirohydrochlorin coba 47.0 31 0.00068 22.6 3.4 25 40-64 43-67 (126)
53 COG0290 InfC Translation initi 46.7 28 0.00061 25.8 3.3 33 43-75 34-68 (176)
54 PF14259 RRM_6: RNA recognitio 46.5 21 0.00046 20.4 2.2 20 37-56 41-60 (70)
55 cd00913 PCD_DCoH_subfamily_a P 45.5 32 0.00069 21.2 3.0 29 27-55 1-30 (76)
56 PRK13010 purU formyltetrahydro 45.4 32 0.0007 26.6 3.7 30 36-65 121-150 (289)
57 PF02786 CPSase_L_D2: Carbamoy 45.4 22 0.00048 26.1 2.6 21 43-63 25-45 (211)
58 cd01206 Homer Homer type EVH1 45.0 21 0.00045 24.7 2.3 23 32-54 78-102 (111)
59 KOG0081|consensus 44.6 19 0.00042 27.1 2.2 22 42-63 141-162 (219)
60 PF05637 Glyco_transf_34: gala 44.6 15 0.00033 27.4 1.7 18 45-62 30-47 (239)
61 PRK00910 ribB 3,4-dihydroxy-2- 44.5 21 0.00046 27.0 2.5 20 42-61 183-202 (218)
62 PF07014 Hs1pro-1_C: Hs1pro-1 44.3 9.1 0.0002 29.8 0.5 23 29-51 228-253 (261)
63 PF00926 DHBP_synthase: 3,4-di 44.2 22 0.00048 26.3 2.5 20 42-61 167-186 (194)
64 KOG0855|consensus 44.0 19 0.00042 27.2 2.1 30 38-67 128-157 (211)
65 PRK00823 phhB pterin-4-alpha-c 43.8 28 0.00062 22.5 2.7 34 23-56 16-50 (97)
66 PF00764 Arginosuc_synth: Argi 43.4 13 0.00029 30.3 1.3 17 45-61 147-163 (388)
67 PRK13820 argininosuccinate syn 42.6 17 0.00037 29.6 1.8 18 44-61 148-165 (394)
68 smart00757 CRA CT11-RanBPM. pr 42.3 20 0.00043 22.1 1.8 17 44-60 2-18 (99)
69 cd01999 Argininosuccinate_Synt 42.2 20 0.00044 29.0 2.2 20 42-61 146-165 (385)
70 smart00481 POLIIIAc DNA polyme 42.2 40 0.00087 19.6 3.0 24 43-66 15-38 (67)
71 PF00076 RRM_1: RNA recognitio 42.1 33 0.00071 19.0 2.5 20 38-57 42-62 (70)
72 cd03012 TlpA_like_DipZ_like Tl 42.1 27 0.00059 22.5 2.5 43 43-85 71-117 (126)
73 KOG2741|consensus 41.9 19 0.00042 29.2 2.0 21 38-58 36-56 (351)
74 PRK00014 ribB 3,4-dihydroxy-2- 41.8 25 0.00054 27.0 2.5 19 42-60 187-205 (230)
75 PRK06027 purU formyltetrahydro 41.5 36 0.00079 26.2 3.4 31 35-65 116-146 (286)
76 COG4314 NosL Predicted lipopro 41.5 21 0.00045 26.4 2.0 23 38-60 129-151 (176)
77 PRK11118 putative monooxygenas 41.3 36 0.00078 23.1 3.0 28 28-55 41-69 (100)
78 cd06576 PASTA_Pbp2x-like_1 PAS 39.6 51 0.0011 17.6 3.0 22 43-64 8-29 (55)
79 PF03960 ArsC: ArsC family; I 39.6 26 0.00057 22.6 2.1 21 44-64 8-28 (110)
80 PTZ00056 glutathione peroxidas 39.5 31 0.00067 24.8 2.6 21 43-63 89-109 (199)
81 cd00860 ThrRS_anticodon ThrRS 39.3 45 0.00097 19.8 3.0 24 41-64 36-59 (91)
82 PF03795 YCII: YCII-related do 39.2 21 0.00045 22.0 1.5 17 38-54 61-77 (95)
83 cd07202 cPLA2_Grp-IVC Group IV 39.0 15 0.00033 30.5 1.0 22 42-63 321-342 (430)
84 PF14657 Integrase_AP2: AP2-li 38.7 28 0.00062 19.4 1.9 19 36-54 17-36 (46)
85 PF00331 Glyco_hydro_10: Glyco 38.6 20 0.00043 27.7 1.6 22 39-60 55-76 (320)
86 PRK08179 prfH peptide chain re 38.6 25 0.00054 26.2 2.1 17 49-65 27-43 (200)
87 KOG0097|consensus 38.5 25 0.00053 26.1 2.0 17 44-60 135-151 (215)
88 cd03027 GRX_DEP Glutaredoxin ( 38.5 40 0.00086 19.8 2.6 24 42-65 11-34 (73)
89 COG5453 Uncharacterized conser 38.4 21 0.00046 24.1 1.5 18 41-58 68-85 (96)
90 cd02977 ArsC_family Arsenate R 38.4 29 0.00064 22.0 2.1 22 43-64 10-31 (105)
91 cd07238 Glo_EDI_BRP_like_5 Thi 37.7 76 0.0017 19.3 3.9 26 36-61 59-84 (112)
92 cd02970 PRX_like2 Peroxiredoxi 37.5 55 0.0012 21.0 3.4 28 38-65 61-88 (149)
93 cd07415 MPP_PP2A_PP4_PP6 PP2A, 37.4 67 0.0014 24.7 4.3 26 37-63 207-232 (285)
94 PF14542 Acetyltransf_CG: GCN5 37.0 33 0.00072 21.3 2.2 18 45-62 44-61 (78)
95 PF10009 DUF2252: Uncharacteri 36.9 25 0.00054 28.3 1.9 38 20-57 288-335 (385)
96 PF07611 DUF1574: Protein of u 36.7 20 0.00044 28.8 1.4 27 40-66 249-275 (345)
97 PF02811 PHP: PHP domain; Int 36.3 40 0.00086 22.2 2.6 23 43-65 16-38 (175)
98 TIGR03798 ocin_TIGR03798 bacte 36.2 17 0.00037 21.8 0.7 22 42-63 24-45 (64)
99 PF11823 DUF3343: Protein of u 36.1 48 0.001 20.1 2.8 24 39-62 46-69 (73)
100 cd04438 DEP_dishevelled DEP (D 35.9 45 0.00098 21.4 2.7 24 42-65 46-72 (84)
101 KOG0098|consensus 35.9 30 0.00065 26.4 2.1 18 43-60 129-146 (216)
102 PF01910 DUF77: Domain of unkn 35.9 34 0.00074 22.2 2.1 18 46-63 20-37 (92)
103 PRK12354 carbamate kinase; Rev 35.8 32 0.00069 27.2 2.3 22 41-62 125-146 (307)
104 cd04908 ACT_Bt0572_1 N-termina 35.4 58 0.0013 18.8 2.9 24 36-59 41-64 (66)
105 PF03129 HGTP_anticodon: Antic 35.3 55 0.0012 19.9 3.0 22 42-63 38-59 (94)
106 cd03412 CbiK_N Anaerobic cobal 35.2 49 0.0011 22.2 2.9 26 40-65 53-78 (127)
107 cd00859 HisRS_anticodon HisRS 35.2 56 0.0012 18.8 2.9 20 44-63 39-58 (91)
108 PRK06806 fructose-bisphosphate 34.9 47 0.001 25.6 3.1 26 37-62 147-172 (281)
109 cd07417 MPP_PP5_C PP5, C-termi 34.6 83 0.0018 24.7 4.5 26 38-64 227-252 (316)
110 PRK00509 argininosuccinate syn 34.5 32 0.00069 28.1 2.2 19 41-59 148-166 (399)
111 cd03018 PRX_AhpE_like Peroxire 34.5 61 0.0013 21.0 3.3 27 38-64 66-92 (149)
112 PRK03745 signal recognition pa 34.2 1E+02 0.0022 20.7 4.2 24 39-62 28-52 (100)
113 cd01992 PP-ATPase N-terminal d 34.0 44 0.00095 22.8 2.6 22 42-63 146-167 (185)
114 PF13669 Glyoxalase_4: Glyoxal 33.9 36 0.00078 21.3 2.0 32 32-63 66-97 (109)
115 cd07419 MPP_Bsu1_C Arabidopsis 33.8 84 0.0018 24.4 4.4 23 42-64 239-261 (311)
116 PF12123 Amidase02_C: N-acetyl 33.7 56 0.0012 19.0 2.6 18 44-61 28-45 (45)
117 cd07249 MMCE Methylmalonyl-CoA 33.6 66 0.0014 19.6 3.2 24 36-59 74-97 (128)
118 TIGR02432 lysidine_TilS_N tRNA 33.6 48 0.001 22.8 2.7 20 45-64 47-66 (189)
119 PF01329 Pterin_4a: Pterin 4 a 33.5 38 0.00082 21.8 2.1 32 23-54 15-46 (95)
120 TIGR03072 release_prfH putativ 33.2 34 0.00074 25.5 2.0 17 49-65 26-42 (200)
121 PF04895 DUF651: Archaeal prot 33.2 30 0.00065 23.6 1.6 22 34-55 62-83 (110)
122 KOG0369|consensus 32.9 40 0.00088 30.6 2.7 23 42-64 170-192 (1176)
123 PF05159 Capsule_synth: Capsul 32.7 45 0.00097 24.5 2.6 23 43-65 11-33 (269)
124 TIGR00655 PurU formyltetrahydr 32.6 54 0.0012 25.3 3.1 29 36-64 112-140 (280)
125 COG3584 Uncharacterized protei 32.4 25 0.00055 24.2 1.1 29 26-54 70-103 (109)
126 cd01205 WASP WASP-type EVH1 do 32.4 48 0.001 22.4 2.5 19 36-54 82-100 (105)
127 PF10686 DUF2493: Protein of u 32.4 53 0.0011 20.3 2.5 19 47-65 48-66 (71)
128 PF05389 MecA: Negative regula 32.2 31 0.00067 25.2 1.6 21 37-57 131-151 (220)
129 COG0216 PrfA Protein chain rel 32.1 38 0.00082 27.8 2.2 19 49-67 139-157 (363)
130 COG5079 SAC3 Nuclear protein e 32.0 27 0.00058 30.4 1.4 25 39-63 356-380 (646)
131 cd08344 MhqB_like_N N-terminal 32.0 77 0.0017 19.5 3.3 28 33-60 54-83 (112)
132 cd00858 GlyRS_anticodon GlyRS 31.9 60 0.0013 21.2 2.9 23 40-63 62-84 (121)
133 PLN03097 FHY3 Protein FAR-RED 31.8 59 0.0013 29.2 3.5 28 36-63 74-105 (846)
134 cd08349 BLMA_like Bleomycin bi 31.7 1E+02 0.0022 18.3 3.7 29 37-65 61-89 (112)
135 cd06575 PASTA_Pbp2x-like_2 PAS 31.6 75 0.0016 16.7 2.8 21 43-63 8-28 (54)
136 KOG1769|consensus 31.5 26 0.00056 23.7 1.1 13 49-61 48-60 (99)
137 PRK10291 glyoxalase I; Provisi 31.4 68 0.0015 20.3 3.0 27 35-61 66-92 (129)
138 TIGR00385 dsbE periplasmic pro 31.2 43 0.00093 23.1 2.2 47 39-85 97-149 (173)
139 TIGR00269 conserved hypothetic 31.1 47 0.001 21.7 2.2 24 40-63 7-30 (104)
140 COG4862 MecA Negative regulato 31.0 42 0.00092 25.8 2.2 22 37-58 131-152 (224)
141 cd00914 PCD_DCoH_subfamily_b P 31.0 60 0.0013 20.0 2.6 29 27-55 1-30 (76)
142 PF00462 Glutaredoxin: Glutare 31.0 63 0.0014 18.1 2.6 19 45-63 12-30 (60)
143 TIGR02540 gpx7 putative glutat 30.9 39 0.00084 22.6 1.9 21 43-63 72-93 (153)
144 PF00588 SpoU_methylase: SpoU 30.7 1.1E+02 0.0025 20.1 4.1 25 40-64 59-83 (142)
145 PRK03353 ribB 3,4-dihydroxy-2- 30.5 48 0.001 24.9 2.5 18 43-60 183-200 (217)
146 COG1453 Predicted oxidoreducta 30.5 43 0.00094 27.7 2.4 20 47-66 187-206 (391)
147 PF08057 Ery_res_leader2: Eryt 30.4 23 0.00051 16.0 0.5 10 36-45 3-12 (14)
148 cd03421 SirA_like_N SirA_like_ 30.4 58 0.0013 19.1 2.4 20 45-64 38-57 (67)
149 PF09840 DUF2067: Uncharacteri 30.4 47 0.001 24.4 2.4 17 38-54 1-17 (190)
150 cd07233 Glyoxalase_I Glyoxalas 30.4 92 0.002 18.9 3.4 28 34-61 70-97 (121)
151 cd03032 ArsC_Spx Arsenate Redu 30.3 51 0.0011 21.5 2.3 22 43-64 11-32 (115)
152 COG0656 ARA1 Aldo/keto reducta 30.2 59 0.0013 25.4 3.0 31 36-67 162-194 (280)
153 KOG4213|consensus 30.1 34 0.00074 25.9 1.6 24 37-60 152-175 (205)
154 PTZ00244 serine/threonine-prot 30.1 1.2E+02 0.0026 23.5 4.7 41 20-64 200-243 (294)
155 PF11387 DUF2795: Protein of u 30.0 58 0.0013 18.4 2.2 21 43-63 8-28 (44)
156 PHA02087 hypothetical protein 30.0 52 0.0011 21.4 2.2 21 37-57 3-23 (83)
157 PRK11478 putative lyase; Provi 29.9 76 0.0017 19.7 3.0 25 36-60 77-101 (129)
158 TIGR00552 nadE NAD+ synthetase 29.4 44 0.00095 24.7 2.1 21 43-63 64-84 (250)
159 smart00034 CLECT C-type lectin 29.4 44 0.00095 20.3 1.8 19 43-61 20-38 (126)
160 TIGR03748 conj_PilL conjugativ 29.3 40 0.00087 23.0 1.7 23 32-54 18-46 (105)
161 cd03593 CLECT_NK_receptors_lik 29.1 41 0.0009 20.9 1.7 21 43-63 20-40 (116)
162 cd03414 CbiX_SirB_C Sirohydroc 29.0 77 0.0017 20.2 3.0 26 40-65 42-67 (117)
163 COG1979 Uncharacterized oxidor 29.0 34 0.00075 28.1 1.5 22 41-62 70-91 (384)
164 cd00861 ProRS_anticodon_short 28.9 48 0.001 20.0 1.9 22 42-63 40-61 (94)
165 PRK00522 tpx lipid hydroperoxi 28.8 59 0.0013 22.4 2.5 27 38-64 80-107 (167)
166 cd00738 HGTP_anticodon HGTP an 28.5 81 0.0017 18.7 2.9 22 43-64 41-62 (94)
167 KOG0638|consensus 28.3 44 0.00095 27.4 2.0 38 23-60 78-117 (381)
168 cd00488 PCD_DCoH PCD_DCoH: The 28.2 94 0.002 18.9 3.1 29 27-55 1-29 (75)
169 PF13263 PHP_C: PHP-associated 28.1 64 0.0014 18.7 2.2 18 46-63 5-22 (56)
170 COG4668 MtlA Mannitol/fructose 28.1 37 0.0008 24.3 1.4 17 38-54 10-26 (142)
171 cd04882 ACT_Bt0572_2 C-termina 28.1 97 0.0021 17.1 3.0 24 36-59 41-64 (65)
172 TIGR01465 cobM_cbiF precorrin- 28.1 89 0.0019 22.2 3.4 30 30-64 78-107 (229)
173 PTZ00480 serine/threonine-prot 28.0 1.2E+02 0.0025 24.1 4.3 40 20-63 207-249 (320)
174 cd04448 DEP_PIKfyve DEP (Dishe 28.0 51 0.0011 20.9 1.9 24 42-65 44-70 (81)
175 PF11216 DUF3012: Protein of u 27.9 44 0.00096 18.3 1.4 10 44-53 19-28 (32)
176 PF00059 Lectin_C: Lectin C-ty 27.8 43 0.00093 19.7 1.5 20 44-63 4-23 (105)
177 cd04442 DEP_1_DEP6 DEP (Dishev 27.8 51 0.0011 21.2 1.9 21 43-63 45-68 (82)
178 PRK10696 tRNA 2-thiocytidine b 27.6 66 0.0014 23.9 2.7 21 44-64 75-95 (258)
179 COG0151 PurD Phosphoribosylami 27.5 72 0.0016 26.7 3.2 34 32-66 40-73 (428)
180 COG5171 YRB1 Ran GTPase-activa 27.5 43 0.00093 25.3 1.7 27 28-54 164-196 (211)
181 cd07416 MPP_PP2B PP2B, metallo 27.5 1.1E+02 0.0023 23.8 4.0 26 38-64 216-241 (305)
182 PRK12465 xylose isomerase; Pro 27.4 74 0.0016 26.8 3.2 30 45-74 221-251 (445)
183 PRK11235 bifunctional antitoxi 27.3 37 0.0008 21.9 1.2 22 37-58 5-26 (80)
184 PRK11370 YciI-like protein; Re 27.2 66 0.0014 20.6 2.4 18 37-54 60-77 (99)
185 cd03036 ArsC_like Arsenate Red 27.0 57 0.0012 21.2 2.1 21 43-63 10-30 (111)
186 COG2154 Pterin-4a-carbinolamin 26.9 80 0.0017 21.3 2.8 34 22-55 15-49 (101)
187 PF10827 DUF2552: Protein of u 26.9 52 0.0011 21.4 1.8 16 40-55 56-71 (79)
188 cd04449 DEP_DEPDC5-like DEP (D 26.7 55 0.0012 20.6 1.9 22 42-63 46-70 (83)
189 COG4545 Glutaredoxin-related p 26.7 71 0.0015 21.1 2.4 21 43-63 13-33 (85)
190 cd03409 Chelatase_Class_II Cla 26.4 95 0.0021 19.0 3.0 24 42-65 44-67 (101)
191 KOG1380|consensus 26.4 27 0.00058 28.9 0.5 12 22-33 234-245 (409)
192 smart00633 Glyco_10 Glycosyl h 26.4 43 0.00093 24.7 1.6 21 39-59 12-32 (254)
193 PLN03181 glycosyltransferase; 26.3 48 0.001 27.9 2.0 20 45-64 154-173 (453)
194 PRK09732 hypothetical protein; 26.2 67 0.0015 22.3 2.4 18 45-62 16-33 (134)
195 KOG3048|consensus 26.2 46 0.00099 24.2 1.6 13 43-55 97-109 (153)
196 PF09691 PulS_OutS: Bacterial 26.1 46 0.00099 22.7 1.5 13 47-59 54-66 (109)
197 COG0269 SgbH 3-hexulose-6-phos 26.0 73 0.0016 24.3 2.8 31 34-64 4-34 (217)
198 cd04440 DEP_2_P-Rex DEP (Dishe 26.0 56 0.0012 21.6 1.9 23 42-64 53-78 (93)
199 PF05172 Nup35_RRM: Nup53/35/4 25.9 56 0.0012 21.7 1.9 22 38-59 57-78 (100)
200 KOG2564|consensus 25.8 41 0.00088 27.3 1.4 22 38-59 199-220 (343)
201 KOG0092|consensus 25.8 56 0.0012 24.7 2.1 52 9-64 97-149 (200)
202 cd04441 DEP_2_DEP6 DEP (Dishev 25.8 58 0.0013 21.1 1.9 22 42-63 48-72 (85)
203 PRK10638 glutaredoxin 3; Provi 25.7 67 0.0015 19.4 2.1 22 44-65 14-35 (83)
204 cd03416 CbiX_SirB_N Sirohydroc 25.5 1.1E+02 0.0024 19.0 3.2 26 39-64 40-65 (101)
205 COG1393 ArsC Arsenate reductas 25.5 60 0.0013 21.9 2.0 21 43-63 12-32 (117)
206 PF07476 MAAL_C: Methylasparta 25.4 60 0.0013 25.3 2.2 17 45-61 178-194 (248)
207 PRK09314 bifunctional 3,4-dihy 25.3 65 0.0014 25.9 2.5 18 43-60 172-189 (339)
208 PF01171 ATP_bind_3: PP-loop f 25.3 46 0.001 23.1 1.5 24 40-63 144-167 (182)
209 cd03592 CLECT_selectins_like C 25.3 59 0.0013 20.4 1.9 19 43-61 10-28 (115)
210 PF15649 Tox-REase-7: Restrict 25.3 94 0.002 20.3 2.9 22 45-66 59-80 (87)
211 PF14555 UBA_4: UBA-like domai 25.2 73 0.0016 17.5 2.1 16 44-59 14-29 (43)
212 PLN02923 xylose isomerase 25.2 75 0.0016 27.0 2.9 30 45-74 255-285 (478)
213 PF02177 APP_N: Amyloid A4 N-t 25.1 61 0.0013 22.0 2.0 25 38-62 30-54 (102)
214 KOG2944|consensus 25.1 1.9E+02 0.004 21.4 4.6 42 30-72 109-152 (170)
215 PF03754 DUF313: Domain of unk 25.1 43 0.00092 22.9 1.2 34 34-67 43-84 (114)
216 TIGR00168 infC translation ini 24.9 1E+02 0.0023 22.1 3.3 33 43-75 23-57 (165)
217 TIGR00746 arcC carbamate kinas 24.9 81 0.0017 24.8 2.9 21 41-61 131-151 (310)
218 PRK02315 adaptor protein; Prov 24.9 80 0.0017 23.6 2.8 23 35-57 138-160 (233)
219 PRK06746 peptide chain release 24.7 57 0.0012 26.2 2.1 16 49-64 113-128 (326)
220 COG0007 CysG Uroporphyrinogen- 24.5 99 0.0022 23.8 3.3 25 41-65 94-118 (244)
221 PF14370 Topo_C_assoc: C-termi 24.4 50 0.0011 21.1 1.4 18 43-60 31-48 (71)
222 PRK11172 dkgB 2,5-diketo-D-glu 24.4 88 0.0019 23.1 2.9 23 44-66 160-182 (267)
223 KOG0091|consensus 24.3 70 0.0015 24.2 2.4 19 44-62 135-153 (213)
224 cd03413 CbiK_C Anaerobic cobal 24.3 1.1E+02 0.0023 20.0 3.0 25 41-65 40-64 (103)
225 PF04221 RelB: RelB antitoxin; 24.2 41 0.00088 21.1 1.0 20 38-57 6-25 (83)
226 PRK12454 carbamate kinase-like 24.2 73 0.0016 25.3 2.6 20 41-60 134-153 (313)
227 cd00037 CLECT C-type lectin (C 24.2 68 0.0015 18.7 1.9 15 42-56 9-23 (116)
228 PRK12865 YciI-like protein; Re 24.1 79 0.0017 20.2 2.3 17 38-54 55-71 (97)
229 PF01922 SRP19: SRP19 protein; 24.1 1.7E+02 0.0036 19.1 3.9 25 39-63 25-49 (95)
230 PF02407 Viral_Rep: Putative v 24.0 8.8 0.00019 24.8 -2.2 12 43-54 74-85 (86)
231 PF02244 Propep_M14: Carboxype 24.0 1E+02 0.0022 18.1 2.7 29 36-64 36-64 (74)
232 PF14277 DUF4364: Domain of un 23.9 76 0.0016 22.7 2.4 23 32-54 124-146 (163)
233 cd08914 START_STARD15-like Lip 23.9 78 0.0017 24.2 2.6 33 33-65 34-66 (236)
234 TIGR01617 arsC_related transcr 23.9 71 0.0015 20.8 2.1 22 43-64 10-31 (117)
235 PRK13789 phosphoribosylamine-- 23.8 89 0.0019 25.2 3.0 25 40-64 127-151 (426)
236 COG0151 PurD Phosphoribosylami 23.8 83 0.0018 26.3 2.9 24 40-63 122-145 (428)
237 PRK09318 bifunctional 3,4-dihy 23.8 71 0.0015 26.1 2.5 18 43-60 159-176 (387)
238 PRK00028 infC translation init 23.8 1.1E+02 0.0023 22.3 3.2 34 43-76 35-70 (177)
239 PRK02628 nadE NAD synthetase; 23.7 62 0.0013 27.8 2.2 20 44-63 409-428 (679)
240 PF11247 DUF2675: Protein of u 23.7 41 0.00089 22.8 0.9 21 43-63 56-76 (98)
241 PF13539 Peptidase_M15_4: D-al 23.6 1E+02 0.0022 18.5 2.6 38 13-58 15-52 (67)
242 PRK06186 hypothetical protein; 23.6 58 0.0013 24.7 1.8 18 43-60 69-86 (229)
243 KOG0781|consensus 23.5 81 0.0018 27.3 2.8 23 46-68 456-478 (587)
244 cd03418 GRX_GRXb_1_3_like Glut 23.4 94 0.002 17.8 2.4 22 43-64 11-32 (75)
245 PLN03199 delta6-acyl-lipid des 23.3 38 0.00083 27.9 0.9 40 24-63 407-455 (485)
246 PRK13307 bifunctional formalde 23.3 77 0.0017 25.8 2.6 37 20-56 156-195 (391)
247 PRK14019 bifunctional 3,4-dihy 23.1 67 0.0014 25.9 2.2 17 43-59 173-189 (367)
248 KOG2486|consensus 23.1 68 0.0015 25.9 2.2 23 43-65 235-257 (320)
249 cd03033 ArsC_15kD Arsenate Red 23.0 76 0.0016 21.1 2.1 20 44-63 12-31 (113)
250 cd05565 PTS_IIB_lactose PTS_II 22.9 72 0.0016 21.0 2.0 17 47-63 19-35 (99)
251 PF09633 DUF2023: Protein of u 22.8 99 0.0021 21.0 2.7 22 44-65 26-47 (101)
252 PRK00578 prfB peptide chain re 22.8 65 0.0014 26.1 2.1 16 49-64 151-166 (367)
253 PF00071 Ras: Ras family; Int 22.8 95 0.0021 20.0 2.6 20 44-63 123-142 (162)
254 PF06356 DUF1064: Protein of u 22.7 72 0.0016 21.9 2.0 43 39-81 13-71 (118)
255 cd08578 GDPD_NUC-2_fungi Putat 22.7 61 0.0013 25.3 1.9 24 40-63 226-249 (300)
256 TIGR02630 xylose_isom_A xylose 22.7 1E+02 0.0022 25.9 3.2 30 45-74 210-240 (434)
257 PTZ00175 diphthine synthase; P 22.7 1.4E+02 0.003 22.9 3.7 29 30-63 83-111 (270)
258 PTZ00323 NAD+ synthase; Provis 22.5 74 0.0016 24.8 2.3 21 44-64 93-113 (294)
259 PF12671 Amidase_6: Putative a 22.5 62 0.0013 22.4 1.7 16 44-59 3-18 (157)
260 PRK15412 thiol:disulfide inter 22.4 98 0.0021 21.6 2.7 49 38-86 101-155 (185)
261 PF12760 Zn_Tnp_IS1595: Transp 22.4 90 0.002 17.4 2.1 23 43-65 1-23 (46)
262 PRK09958 DNA-binding transcrip 22.4 70 0.0015 21.2 1.9 20 39-58 185-204 (204)
263 PF04343 DUF488: Protein of un 22.3 1.1E+02 0.0024 20.1 2.8 23 41-63 29-51 (122)
264 PF02540 NAD_synthase: NAD syn 22.3 67 0.0015 24.0 1.9 20 44-63 61-80 (242)
265 PRK12420 histidyl-tRNA synthet 22.3 99 0.0021 24.6 3.0 22 43-64 374-395 (423)
266 PRK12677 xylose isomerase; Pro 22.3 1.2E+02 0.0027 24.2 3.6 32 27-59 16-47 (384)
267 PRK00754 signal recognition pa 22.2 1.3E+02 0.0029 19.8 3.2 25 39-63 31-55 (95)
268 cd03588 CLECT_CSPGs C-type lec 22.2 73 0.0016 20.4 1.9 36 27-62 4-39 (124)
269 CHL00199 infC translation init 22.1 1.2E+02 0.0027 22.4 3.3 38 42-80 39-78 (182)
270 cd07414 MPP_PP1_PPKL PP1, PPKL 22.1 2.1E+02 0.0046 22.1 4.7 40 20-63 198-240 (293)
271 cd05564 PTS_IIB_chitobiose_lic 22.1 1.1E+02 0.0023 19.6 2.6 20 45-64 16-35 (96)
272 PRK03147 thiol-disulfide oxido 22.0 1.1E+02 0.0025 20.2 2.9 48 39-86 99-151 (173)
273 cd03034 ArsC_ArsC Arsenate Red 21.9 85 0.0018 20.5 2.2 21 43-63 10-30 (112)
274 PF09363 XFP_C: XFP C-terminal 21.9 93 0.002 23.5 2.6 18 37-54 2-19 (203)
275 TIGR03662 Chlor_Arch_YYY Chlor 21.8 97 0.0021 27.3 3.0 33 38-70 661-695 (723)
276 TIGR00032 argG argininosuccina 21.7 93 0.002 25.3 2.8 20 44-63 147-166 (394)
277 TIGR00268 conserved hypothetic 21.7 1E+02 0.0022 22.9 2.8 20 45-64 54-73 (252)
278 smart00740 PASTA PASTA domain. 21.6 1.4E+02 0.0031 16.1 2.9 20 43-62 13-32 (66)
279 TIGR03645 glyox_marine lactoyl 21.6 1.2E+02 0.0027 20.5 3.0 24 38-61 97-120 (162)
280 TIGR02190 GlrX-dom Glutaredoxi 21.5 99 0.0021 18.5 2.3 21 43-63 19-39 (79)
281 PRK05474 xylose isomerase; Pro 21.5 1.1E+02 0.0025 25.6 3.3 29 46-74 212-241 (437)
282 TIGR01004 PulS_OutS lipoprotei 21.5 64 0.0014 22.8 1.6 13 47-59 70-82 (128)
283 PRK13813 orotidine 5'-phosphat 21.5 1.1E+02 0.0024 21.7 2.8 34 32-65 2-35 (215)
284 TIGR00442 hisS histidyl-tRNA s 21.4 1.1E+02 0.0024 23.8 3.1 21 43-63 359-379 (397)
285 PRK13011 formyltetrahydrofolat 21.3 1.2E+02 0.0027 23.3 3.3 29 36-64 117-145 (286)
286 PRK05589 peptide chain release 21.2 74 0.0016 25.5 2.1 16 49-64 112-127 (325)
287 PRK04247 hypothetical protein; 21.2 1.1E+02 0.0023 23.6 2.9 22 45-66 216-237 (238)
288 COG0504 PyrG CTP synthase (UTP 21.0 65 0.0014 27.7 1.8 17 44-60 360-376 (533)
289 cd02971 PRX_family Peroxiredox 20.9 1.4E+02 0.0031 18.9 3.1 27 38-64 60-87 (140)
290 KOG1502|consensus 20.9 98 0.0021 24.8 2.7 21 45-65 172-192 (327)
291 cd07420 MPP_RdgC Drosophila me 20.9 1.9E+02 0.0042 22.8 4.3 25 38-63 247-271 (321)
292 PRK11070 ssDNA exonuclease Rec 20.9 1E+02 0.0023 26.2 3.0 21 45-65 140-160 (575)
293 PF13380 CoA_binding_2: CoA bi 20.8 90 0.002 20.5 2.2 17 43-59 89-105 (116)
294 TIGR00068 glyox_I lactoylgluta 20.8 1.2E+02 0.0026 19.9 2.7 26 35-60 87-112 (150)
295 PRK08057 cobalt-precorrin-6x r 20.7 1.4E+02 0.0031 22.6 3.4 27 40-66 49-75 (248)
296 cd04443 DEP_GPR155 DEP (Dishev 20.7 84 0.0018 20.1 1.9 22 42-63 46-70 (83)
297 PRK12485 bifunctional 3,4-dihy 20.6 88 0.0019 25.4 2.4 17 43-59 173-189 (369)
298 TIGR00011 YbaK_EbsC ybaK/ebsC 20.6 83 0.0018 21.3 2.0 17 47-63 2-18 (152)
299 PF14174 YycC: YycC-like prote 20.4 95 0.0021 18.8 1.9 17 44-60 7-23 (53)
300 cd03078 GST_N_Metaxin1_like GS 20.3 1.6E+02 0.0035 17.7 3.1 21 43-63 17-37 (73)
301 TIGR00020 prfB peptide chain r 20.3 79 0.0017 25.7 2.1 16 49-64 151-166 (364)
302 cd00340 GSH_Peroxidase Glutath 20.3 1.1E+02 0.0024 20.4 2.5 21 43-63 71-92 (152)
303 PF14057 GGGtGRT: GGGtGRT prot 20.2 84 0.0018 25.1 2.1 21 39-59 18-38 (328)
304 PRK08787 peptide chain release 20.2 80 0.0017 25.2 2.1 16 49-64 92-107 (313)
305 PRK07342 peptide chain release 20.1 80 0.0017 25.5 2.1 16 49-64 115-130 (339)
306 PRK00591 prfA peptide chain re 20.0 81 0.0017 25.6 2.1 17 49-65 138-154 (359)
No 1
>PF04800 ETC_C1_NDUFA4: ETC complex I subunit conserved region; InterPro: IPR006885 This entry represents prokaryotic NADH-ubiquinone oxidoreductase subunits (1.6.5.3 from EC, 1.6.99.3 from EC) from complex I of the electron transport chain initially identified in Neurospora crassa as a 21 kDa protein [].; GO: 0016651 oxidoreductase activity, acting on NADH or NADPH, 0022900 electron transport chain, 0005743 mitochondrial inner membrane; PDB: 2JYA_A 2LJU_A.
Probab=100.00 E-value=1e-49 Score=268.43 Aligned_cols=86 Identities=63% Similarity=1.152 Sum_probs=54.1
Q ss_pred CCcccCCCCceEEEecCCCCccCCCCCCCCCCCCCCc-eeeecCCHHHHHHHHHHcCccEEEeccCCCCCCcccccccCC
Q psy627 1 MQSGTHNANNWFIQFDTRERWENPLMGWCSTGDPLSN-LALNFSSKEDAIQYCQKNGWKFFVEEPKWKTPKVKSYAFNFS 79 (89)
Q Consensus 1 mQSG~~~t~~W~lefe~~~~~~dPLMGWtss~D~~~q-v~l~F~s~e~Aiayae~~G~~Y~V~~p~~~~~~~ksYadNF~ 79 (89)
||||++++++|+||||..++|+|||||||||+||++| |+|.|+|+|+||+|||+|||+|+|++|+.+++++|||||||+
T Consensus 13 mQSG~~~t~~W~lefe~~~~~~~PLMGWtss~D~~~q~v~l~F~skE~Ai~yaer~G~~Y~V~~p~~r~~~~ksY~dNF~ 92 (101)
T PF04800_consen 13 MQSGTANTKKWVLEFETRARWENPLMGWTSSGDPLSQSVRLKFDSKEDAIAYAERNGWDYEVEEPKKRKRRPKSYADNFS 92 (101)
T ss_dssp STTTT-SS--EEEEEEESSS---TTT-SSSS--SEEE-CEEEESSHHHHHHHHHHCT-EEEEE-STT-------------
T ss_pred CCCCCCCCCCEEEeecCcCCcCCCccCCCCCCChhhCeeEeeeCCHHHHHHHHHHcCCeEEEeCCCCCcCCcccHHHhCC
Confidence 8999999999999999889999999999999999998 999999999999999999999999999999999999999999
Q ss_pred cCCCcce
Q psy627 80 WNKRTRT 86 (89)
Q Consensus 80 ~~r~~~~ 86 (89)
|+|+++.
T Consensus 93 ~~r~~~~ 99 (101)
T PF04800_consen 93 WNRRTRW 99 (101)
T ss_dssp -------
T ss_pred cCCCCCC
Confidence 9999874
No 2
>KOG3389|consensus
Probab=100.00 E-value=1.2e-43 Score=254.48 Aligned_cols=89 Identities=62% Similarity=1.052 Sum_probs=87.4
Q ss_pred CCcccCCCCceEEEecCCCCccCCCCCCCCCCCCCCc--eeeecCCHHHHHHHHHHcCccEEEeccCCCCCCcccccccC
Q psy627 1 MQSGTHNANNWFIQFDTRERWENPLMGWCSTGDPLSN--LALNFSSKEDAIQYCQKNGWKFFVEEPKWKTPKVKSYAFNF 78 (89)
Q Consensus 1 mQSG~~~t~~W~lefe~~~~~~dPLMGWtss~D~~~q--v~l~F~s~e~Aiayae~~G~~Y~V~~p~~~~~~~ksYadNF 78 (89)
||||.++++.|.|||+.+.+|+||||||+|++||+++ +.|.|+|+||||+|||+|||+|.|++|+.+++++|||||||
T Consensus 88 tQSg~gntkkWkiefd~r~rWENPLMGWtsTaDPlsNvgm~L~F~tkEdA~sFaEkngW~ydveep~~pk~K~KsYg~NF 167 (178)
T KOG3389|consen 88 TQSGSGNTKKWKIEFDSRLRWENPLMGWTSTADPLSNVGMALAFDTKEDAKSFAEKNGWDYDVEEPNTPKLKVKSYGDNF 167 (178)
T ss_pred hhcccCCccceEEEecchhhccCccccccccCCcccccceeeeeccHHHHHHHHHHcCCcccccCCCCCccccccccccc
Confidence 7999999999999999999999999999999999999 58999999999999999999999999999999999999999
Q ss_pred CcCCCcceecC
Q psy627 79 SWNKRTRTSTK 89 (89)
Q Consensus 79 ~~~r~~~~~~k 89 (89)
+|++++|..||
T Consensus 168 sWn~rtr~~tk 178 (178)
T KOG3389|consen 168 SWNGRTRPETK 178 (178)
T ss_pred cccCCCCCCCC
Confidence 99999999987
No 3
>PF09954 DUF2188: Uncharacterized protein conserved in bacteria (DUF2188); InterPro: IPR018691 This family has no known function.
Probab=91.47 E-value=0.14 Score=30.71 Aligned_cols=41 Identities=20% Similarity=0.415 Sum_probs=26.8
Q ss_pred CCCCCCCCCCceeeecCCHHHHHHHHHH----c-CccEEEeccCCC
Q psy627 27 GWCSTGDPLSNLALNFSSKEDAIQYCQK----N-GWKFFVEEPKWK 67 (89)
Q Consensus 27 GWtss~D~~~qv~l~F~s~e~Aiayae~----~-G~~Y~V~~p~~~ 67 (89)
||.--.+.-......|+|+++||++|+. + +-+..|.....+
T Consensus 10 ~W~v~~eg~~ra~~~~~Tk~eAi~~Ar~~a~~~~~~el~Ih~~dG~ 55 (62)
T PF09954_consen 10 GWAVKKEGAKRASKTFDTKAEAIEAARELAKNQGGGELIIHGRDGK 55 (62)
T ss_pred CceEEeCCCcccccccCcHHHHHHHHHHHHHhCCCcEEEEECCCCe
Confidence 4554444445557899999999998764 3 566666554433
No 4
>PRK05370 argininosuccinate synthase; Validated
Probab=87.04 E-value=0.44 Score=39.59 Aligned_cols=22 Identities=5% Similarity=0.318 Sum_probs=19.3
Q ss_pred eecCCHHHHHHHHHHcCccEEE
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V 61 (89)
-.|+|+++.|+||++|||+..+
T Consensus 164 ~~f~sR~e~i~Ya~~hGIpv~~ 185 (447)
T PRK05370 164 DELGGRAEMSEFLIAHGFDYKM 185 (447)
T ss_pred cccCCHHHHHHHHHHcCCCCCc
Confidence 3689999999999999999754
No 5
>PHA02552 4 head completion protein; Provisional
Probab=85.91 E-value=0.75 Score=33.06 Aligned_cols=21 Identities=33% Similarity=0.673 Sum_probs=18.4
Q ss_pred HHHHHHHHHHcCccEEEeccC
Q psy627 45 KEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p~ 65 (89)
-++|.+||+++||.|.|....
T Consensus 120 ~~Aa~~~a~~~Gw~F~iiTE~ 140 (151)
T PHA02552 120 WKAARALCEKKGWKFKIITED 140 (151)
T ss_pred HHHHHHHHHHcCCEEEEEEHH
Confidence 478999999999999997754
No 6
>PF07045 DUF1330: Protein of unknown function (DUF1330); InterPro: IPR010753 This family consists of several hypothetical bacterial proteins of around 90 residues in length. The function of this family is unknown.; PDB: 2FIU_B 3HHL_A 3DCA_D 3LO3_I.
Probab=81.69 E-value=1.5 Score=26.38 Aligned_cols=21 Identities=29% Similarity=0.347 Sum_probs=16.2
Q ss_pred CCCCCceeeecCCHHHHHHHH
Q psy627 32 GDPLSNLALNFSSKEDAIQYC 52 (89)
Q Consensus 32 ~D~~~qv~l~F~s~e~Aiaya 52 (89)
.++-.-|-|.|+|.++|+++-
T Consensus 36 ~~~~~~viieFPs~~aa~~~~ 56 (65)
T PF07045_consen 36 WDPDRVVIIEFPSMEAAKAWY 56 (65)
T ss_dssp -SSSEEEEEEESSHHHHHHHH
T ss_pred CCCCeEEEEECCCHHHHHHHH
Confidence 444455899999999999873
No 7
>PF08727 P3A: Poliovirus 3A protein like; InterPro: IPR014838 The 3A protein is found in positive-strand RNA viruses. It is a critical component of the poliovirus replication complex, and is also an inhibitor of host cell ER to Golgi transport. ; GO: 0003968 RNA-directed RNA polymerase activity, 0004197 cysteine-type endopeptidase activity, 0017111 nucleoside-triphosphatase activity; PDB: 1NG7_A.
Probab=80.99 E-value=1.1 Score=27.70 Aligned_cols=16 Identities=25% Similarity=0.843 Sum_probs=10.7
Q ss_pred HHHHHHHHHcCccEEE
Q psy627 46 EDAIQYCQKNGWKFFV 61 (89)
Q Consensus 46 e~Aiayae~~G~~Y~V 61 (89)
++-+.||+++||-..+
T Consensus 30 ~eV~~YC~~~GWIip~ 45 (57)
T PF08727_consen 30 PEVREYCEEQGWIIPA 45 (57)
T ss_dssp HHHHHHHHHHT--TT-
T ss_pred HHHHHHHHHCCccccC
Confidence 4589999999996443
No 8
>PF08722 Tn7_Tnp_TnsA_N: TnsA endonuclease N terminal; InterPro: IPR014833 The Tn7 transposase is composed of proteins TnsA and TnsB. DNA breakage at the 5'-end of the transposon is carried out by TnsA, and breakage and joining at the 3'-end is carried out by TnsB. The N-terminal domain of TnsA is catalytic. ; PDB: 1F1Z_B 1T0F_B.
Probab=79.00 E-value=2.5 Score=26.50 Aligned_cols=19 Identities=21% Similarity=0.291 Sum_probs=13.7
Q ss_pred HHHHHHHHHHcCccEEEec
Q psy627 45 KEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~ 63 (89)
.+.|-.||+++||+|.|..
T Consensus 69 ~~~~~~y~~~~g~~f~ivT 87 (88)
T PF08722_consen 69 LEIEREYWEEQGIPFRIVT 87 (88)
T ss_dssp HHHHHHHHHHCT--EEEE-
T ss_pred HHHHHHHHHHcCCeEEEEc
Confidence 4568899999999999864
No 9
>PF11360 DUF3110: Protein of unknown function (DUF3110); InterPro: IPR021503 This family of proteins has no known function.
Probab=78.40 E-value=2.1 Score=27.95 Aligned_cols=30 Identities=30% Similarity=0.730 Sum_probs=23.8
Q ss_pred CceeeecCCHHHHHHH-------------------------HHHcCccEEEeccC
Q psy627 36 SNLALNFSSKEDAIQY-------------------------CQKNGWKFFVEEPK 65 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiay-------------------------ae~~G~~Y~V~~p~ 65 (89)
.++.|-|.+.+||.+| |+.+|+.|.|..+.
T Consensus 23 ~~~Vl~FE~edDA~RYa~lLEAqd~~~p~Ve~id~~~i~~fC~~~gy~~~iv~~g 77 (86)
T PF11360_consen 23 RNVVLMFEDEDDAERYAGLLEAQDFPDPTVEEIDPEEIEEFCRSAGYEYEIVPPG 77 (86)
T ss_pred CCEEEEEccHHHHHHHHHHHHhcCCCCCCeEEECHHHHHHHHHHCCceEEEECCC
Confidence 7889999999999776 56677777777665
No 10
>PF05573 NosL: NosL; InterPro: IPR008719 NosL is one of the accessory proteins of the nos (nitrous oxide reductase) gene cluster. NosL is a monomeric protein of 18,540 MW that specifically and stoichiometrically binds Cu(I). The copper ion in NosL is ligated by a Cys residue, and one Met and one His are thought to serve as the other ligands. It is possible that NosL is a copper chaperone involved in metallocentre assembly []. This entry also contains HTH-type transcriptional repressors, including YcnK. YcnK may act as a negative transcriptional regulator of YcnJ in the presence of copper and may use copper as a corepressor. The gene, ycnK, is significantly induced under copper-limiting conditions.; PDB: 2HQ3_A 2HPU_A.
Probab=77.44 E-value=2.2 Score=29.71 Aligned_cols=47 Identities=26% Similarity=0.596 Sum_probs=26.4
Q ss_pred CceEEEecCCCCccCCCC-CCCCC-----------CCCCCceeeecCCHHHHHHHHHHcC
Q psy627 9 NNWFIQFDTRERWENPLM-GWCST-----------GDPLSNLALNFSSKEDAIQYCQKNG 56 (89)
Q Consensus 9 ~~W~lefe~~~~~~dPLM-GWtss-----------~D~~~qv~l~F~s~e~Aiayae~~G 56 (89)
..||-+|.. ..|.+|-- -|... .-|+.---+-|.++++|.+|+++||
T Consensus 77 ~~~V~D~~~-~~~~~p~~~~~Ida~~A~yV~gs~~~~pMg~~~~aF~~~~~A~~F~~~~G 135 (149)
T PF05573_consen 77 AIYVTDYST-TDWDQPGDGEWIDAKKAYYVVGSDVRGPMGPDLIAFASKEDAEAFAKEHG 135 (149)
T ss_dssp EEEEEES-T-S-TT-S---EEEEGGGSEEEEESSSS-TTS--EEEES-HHHHHHHHHHTE
T ss_pred EEEEEeccc-ccccCCCCCceEEcceEEEEECCCcCCCCCCcccccCCHHHHHHHHHHcC
Confidence 457888876 44444432 23322 1233334688999999999999998
No 11
>PF06194 Phage_Orf51: Phage Conserved Open Reading Frame 51; InterPro: IPR009338 This entry is represented by the Staphylococcus phage PVL (bacteriophage phi-PVL), Orf51. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=77.16 E-value=2.1 Score=27.99 Aligned_cols=26 Identities=19% Similarity=0.294 Sum_probs=23.3
Q ss_pred CceeeecCCHHHHHHHHHHcCccEEE
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~V 61 (89)
+.|+..|+|+++-..|-+.+++.|+=
T Consensus 48 ~gvKktFn~~~Ef~~Yi~~~~L~~ee 73 (80)
T PF06194_consen 48 KGVKKTFNNKEEFENYIKQHELYFEE 73 (80)
T ss_pred cchhhhcCcHHHHHHHHHHcCCceeh
Confidence 45899999999999999999999863
No 12
>PF02844 GARS_N: Phosphoribosylglycinamide synthetase, N domain; InterPro: IPR020562 Phosphoribosylglycinamide synthetase (6.3.4.13 from EC) (GARS) (phosphoribosylamine glycine ligase) [] catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by phosphoribosylglycinamide synthetase is the ATP-dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide: ATP + 5-phosphoribosylamine + glycine = ADP + Pi + 5'-phosphoribosylglycinamide In bacteria, GARS is a monofunctional enzyme (encoded by the purD gene). In yeast, GARS is part of a bifunctional enzyme (encoded by the ADE5/7 gene) in conjunction with phosphoribosylformylglycinamidine cyclo-ligase (AIRS) (IPR000728 from INTERPRO). In higher eukaryotes, GARS is part of a trifunctional enzyme in conjunction with AIRS (IPR000728 from INTERPRO) and with phosphoribosylglycinamide formyltransferase (GART) (), forming GARS-AIRS-GART. This entry represents the N-domain, which is related to the N-terminal domain of biotin carboxylase/carbamoyl phosphate synthetase (IPR005481 from INTERPRO).; GO: 0004637 phosphoribosylamine-glycine ligase activity, 0009113 purine base biosynthetic process; PDB: 3MJF_A 2XD4_A 2XCL_A 2IP4_A 2YW2_B 2YYA_A 3LP8_A 1VKZ_A 2YS6_A 2YRX_A ....
Probab=73.99 E-value=3.8 Score=27.48 Aligned_cols=30 Identities=17% Similarity=0.403 Sum_probs=23.8
Q ss_pred ceeeecCCHHHHHHHHHHcCccEEEeccCC
Q psy627 37 NLALNFSSKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
.+.+...+.++-++||+++++++.|.-|..
T Consensus 43 ~~~~~~~d~~~l~~~a~~~~idlvvvGPE~ 72 (100)
T PF02844_consen 43 NVPIDITDPEELADFAKENKIDLVVVGPEA 72 (100)
T ss_dssp EE-S-TT-HHHHHHHHHHTTESEEEESSHH
T ss_pred ecCCCCCCHHHHHHHHHHcCCCEEEECChH
Confidence 356688999999999999999999998854
No 13
>cd01207 Ena-Vasp Enabled-VASP-type homology (EVH1) domain. Enabled-VASP-type homology (EVH1) domain. The EVH1 domain binds to other proteins at proline rich sequences. It is found in proteins involved in cytoskeletal reorganization such as Enabled and VASP. Ena-VASP type EVH1 domains specifically recognize FPPPP motifs in the focal adhesion proteins zyxin and vinculin, and the ActA surface protein of Listeria monocytogenes. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=72.19 E-value=3.5 Score=28.22 Aligned_cols=22 Identities=36% Similarity=0.448 Sum_probs=18.6
Q ss_pred CCCceeeecCCHHHHHHHHHHc
Q psy627 34 PLSNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 34 ~~~qv~l~F~s~e~Aiayae~~ 55 (89)
...+..|.|.|+|||-+|.+..
T Consensus 82 ~~~v~GLnF~Se~eA~~F~~~v 103 (111)
T cd01207 82 ARQVYGLNFGSKEDATMFASAM 103 (111)
T ss_pred CCeEEeeccCCHHHHHHHHHHH
Confidence 3478899999999999998753
No 14
>PRK04527 argininosuccinate synthase; Provisional
Probab=71.40 E-value=2.8 Score=34.23 Aligned_cols=22 Identities=9% Similarity=0.291 Sum_probs=18.1
Q ss_pred eeee-cCCHHHHHHHHHHcCccE
Q psy627 38 LALN-FSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 38 v~l~-F~s~e~Aiayae~~G~~Y 59 (89)
..|. ++++++.|+||++|||+.
T Consensus 146 ~~~~k~~~R~~~i~ya~~~gipv 168 (400)
T PRK04527 146 IQKEHTQTRAYEQKYLEERGFGV 168 (400)
T ss_pred hcCcccccHHHHHHHHHHcCCCC
Confidence 3443 469999999999999986
No 15
>COG5470 Uncharacterized conserved protein [Function unknown]
Probab=69.94 E-value=5.2 Score=27.03 Aligned_cols=26 Identities=31% Similarity=0.410 Sum_probs=20.3
Q ss_pred CCCCCCCCCCCCCCceeeecCCHHHHHHH
Q psy627 23 NPLMGWCSTGDPLSNLALNFSSKEDAIQY 51 (89)
Q Consensus 23 dPLMGWtss~D~~~qv~l~F~s~e~Aiay 51 (89)
++|=| ..+|-..|.+.|+|+|.|.++
T Consensus 44 ~~lEG---~w~ptr~vviEFps~~~ar~~ 69 (96)
T COG5470 44 ETLEG---EWRPTRNVVIEFPSLEAARDC 69 (96)
T ss_pred eeccC---CCCcccEEEEEcCCHHHHHHH
Confidence 44444 357778899999999999876
No 16
>KOG2854|consensus
Probab=69.43 E-value=4.2 Score=32.91 Aligned_cols=23 Identities=17% Similarity=0.569 Sum_probs=20.7
Q ss_pred eeeecCCHHHHHHHHHHcCccEE
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
+-+-|.+.+||.+||+.+||+-.
T Consensus 215 ~DiifgNe~EA~af~~~~~~~t~ 237 (343)
T KOG2854|consen 215 ADIIFGNEDEAAAFARAHGWETK 237 (343)
T ss_pred ceEEEcCHHHHHHHHHhhCCccc
Confidence 57899999999999999999863
No 17
>PF05406 WGR: WGR domain; InterPro: IPR008893 This domain is named after the most conserved central motif of the domain. It is found in a variety of polyA polymerases as well as the Escherichia coli molybdate metabolism regulator P33345 from SWISSPROT and other proteins of unknown function.The domain is found in isolation in proteins such as Q9JN21 from SWISSPROT and is between 70 and 80 residues in length. ; PDB: 2EOC_A 2RA8_A 4DQY_C 2CR9_A.
Probab=69.08 E-value=3.6 Score=25.61 Aligned_cols=45 Identities=22% Similarity=0.262 Sum_probs=25.9
Q ss_pred CCceEEEecCCCCcc-CCCCCCCCCCCCCCceeeecCCHHHHHHHHHH
Q psy627 8 ANNWFIQFDTRERWE-NPLMGWCSTGDPLSNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 8 t~~W~lefe~~~~~~-dPLMGWtss~D~~~qv~l~F~s~e~Aiayae~ 54 (89)
.+.|.|+..+..-|. --==|-.|+. ...++ -.|+|.|+|++.+++
T Consensus 19 ~kfY~iql~~~~~~~v~~~wGRiG~~-gq~~~-~~f~s~~eA~~~f~~ 64 (81)
T PF05406_consen 19 NKFYRIQLLPDLEWVVFRRWGRIGSK-GQTRI-KPFDSEEEAIKEFEK 64 (81)
T ss_dssp EEEEEEEEEEETTEEEEEEEEETTSS-EEEEE-EEESSHHHHHHHHHH
T ss_pred EEEEEEEEEeCCCeEEEEEECCCCCc-CcEEE-EeCCCHHHHHHHHHH
Confidence 456777776432110 1111334554 33444 489999999998875
No 18
>PF03462 PCRF: PCRF domain; InterPro: IPR005139 This domain is found in peptide chain release factors. Peptide chain release factors are important for protein synthesis since they direct the termination of translation in response to the peptide chain termination codons UAG and UAA. These are structurally distinct but both contain the PCRF domain [].; GO: 0016149 translation release factor activity, codon specific, 0006415 translational termination, 0005737 cytoplasm; PDB: 3D5A_X 3D5C_X 3MR8_V 3MS0_V 3F1G_X 3F1E_X 1ZBT_A 2IHR_1 2X9R_Y 2X9T_Y ....
Probab=68.39 E-value=4.1 Score=27.31 Aligned_cols=15 Identities=33% Similarity=0.802 Sum_probs=12.7
Q ss_pred HHHHHHcCccEEEec
Q psy627 49 IQYCQKNGWKFFVEE 63 (89)
Q Consensus 49 iayae~~G~~Y~V~~ 63 (89)
+.||+++||.++|..
T Consensus 73 ~~~a~~~gw~~~~l~ 87 (115)
T PF03462_consen 73 QRYAERRGWKVEVLD 87 (115)
T ss_dssp HHHHHHTT-EEEEEE
T ss_pred HHHHHHcCCEEEEEe
Confidence 489999999999987
No 19
>PF03108 DBD_Tnp_Mut: MuDR family transposase; InterPro: IPR004332 The plant MuDR transposase domain is present in plant proteins that are presumed to be the transposases for Mutator transposable elements [, ]. The function of these proteins is unknown. More information about these proteins can be found at Protein of the Month: Transposase [].
Probab=66.45 E-value=7.2 Score=23.20 Aligned_cols=35 Identities=34% Similarity=0.553 Sum_probs=26.9
Q ss_pred CCCCceeeecCCHHHH----HHHHHHcCccEEEeccCCC
Q psy627 33 DPLSNLALNFSSKEDA----IQYCQKNGWKFFVEEPKWK 67 (89)
Q Consensus 33 D~~~qv~l~F~s~e~A----iayae~~G~~Y~V~~p~~~ 67 (89)
||.=.+.+.|+|+++. ..||-+++.+|.|..-...
T Consensus 1 n~~l~~G~~F~~~~e~k~av~~yai~~~~~~~v~ksd~~ 39 (67)
T PF03108_consen 1 NPELEVGQTFPSKEEFKEAVREYAIKNGFEFKVKKSDKK 39 (67)
T ss_pred CCccccCCEECCHHHHHHHHHHHHHhcCcEEEEeccCCE
Confidence 4555688999999876 4689999999998765443
No 20
>PF00568 WH1: WH1 domain; InterPro: IPR000697 The EVH1 (WH1, RanBP1-WASP) domain is found in multi-domain proteins implicated in a diverse range of signalling, nuclear transport and cytoskeletal events. This domain of around 115 amino acids is present in species ranging from yeast to mammals. Many EVH1-containing proteins associate with actin-based structures and play a role in cytoskeletal organisation. EVH1 domains recognise and bind the proline-rich motif FPPPP with low-affinity, further interactions then form between flanking residues [][]. WASP family proteins contain a EVH1 (WH1) in their N-terminals which bind proline-rich sequences in the WASP interacting protein. Proteins of the RanBP1 family contain a WH1 domain in their N-terminal region, which seems to bind a different sequence motif present in the C-terminal part of RanGTP protein [,]. Tertiary structure of the WH1 domain of the Mena protein revealed structure similarities with the pleckstrin homology (PH) domain. The overall fold consists of a compact parallel beta-sandwich, closed along one edge by a long alpha-helix. A highly conserved cluster of three surface-exposed aromatic side-chains forms the recognition site for the molecules target ligands. [].; GO: 0005515 protein binding; PDB: 1I2H_A 1DDV_A 1DDW_A 1EGX_A 3SYX_A 1TJ6_B 1XOD_B 1EVH_A 1I7A_B 2JP2_A ....
Probab=66.18 E-value=6.4 Score=25.93 Aligned_cols=19 Identities=42% Similarity=0.607 Sum_probs=17.2
Q ss_pred CceeeecCCHHHHHHHHHH
Q psy627 36 SNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~ 54 (89)
..+.|.|.|.+||-.|+++
T Consensus 88 ~~~GLnF~se~eA~~F~~~ 106 (111)
T PF00568_consen 88 CVYGLNFASEEEADQFYKK 106 (111)
T ss_dssp CEEEEEESSHHHHHHHHHH
T ss_pred eEEEEecCCHHHHHHHHHH
Confidence 4889999999999999875
No 21
>PF12518 DUF3721: Protein of unknown function; InterPro: IPR022196 This domain family is found in bacteria and eukaryotes, and is approximately 30 amino acids in length. There is a conserved WMPC sequence motif. There are two completely conserved residues (A and C) that may be functionally important.
Probab=65.57 E-value=6.5 Score=21.86 Aligned_cols=16 Identities=19% Similarity=0.378 Sum_probs=14.1
Q ss_pred cCCHHHHHHHHHHcCc
Q psy627 42 FSSKEDAIQYCQKNGW 57 (89)
Q Consensus 42 F~s~e~Aiayae~~G~ 57 (89)
|+|+++|.+=|.+.|-
T Consensus 1 y~tk~eAe~~A~~~GC 16 (34)
T PF12518_consen 1 YPTKAEAEKRAKELGC 16 (34)
T ss_pred CCcHHHHHHHHHHcCC
Confidence 8999999999998774
No 22
>KOG4748|consensus
Probab=65.52 E-value=4.6 Score=32.89 Aligned_cols=22 Identities=23% Similarity=0.575 Sum_probs=18.5
Q ss_pred CHHHHHHHHHHcCccEEEeccC
Q psy627 44 SKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
+-+-=|+||++||+++++..-.
T Consensus 127 ~ikNridYA~rHgy~~~~~~~~ 148 (364)
T KOG4748|consen 127 SIKNRIDYARRHGYEFEYKNAT 148 (364)
T ss_pred HHHhHHHHHHHhCCeEEEEecc
Confidence 6677799999999999987654
No 23
>cd00837 EVH1 EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. EVH1 (Enabled, Vasp-Homology) or WASP Homology (WH1) domain. The EVH1 domain binds to other proteins at proline rich sequences in either FPPPP or PPXXF motifs. It is found in the cytoskeletal reorganization proteins Enabled VASP, and WASP, and in the synaptic scaffolding protein Homer. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=63.74 E-value=7.8 Score=25.39 Aligned_cols=20 Identities=40% Similarity=0.549 Sum_probs=17.5
Q ss_pred CCceeeecCCHHHHHHHHHH
Q psy627 35 LSNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 35 ~~qv~l~F~s~e~Aiayae~ 54 (89)
...+.|.|.|.+||-+|.++
T Consensus 80 ~~~~GL~F~se~eA~~F~~~ 99 (104)
T cd00837 80 NCVYGLNFASEEEAAQFRKK 99 (104)
T ss_pred CcEEEEeeCCHHHHHHHHHH
Confidence 44789999999999999875
No 24
>PRK05773 3,4-dihydroxy-2-butanone 4-phosphate synthase; Validated
Probab=62.35 E-value=7.3 Score=29.50 Aligned_cols=18 Identities=28% Similarity=0.372 Sum_probs=16.2
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
-++++|++||++||+++.
T Consensus 191 ~~~~~~~~fA~~~~l~~i 208 (219)
T PRK05773 191 LSKEKAKKIAKNLGFPLV 208 (219)
T ss_pred cCHHHHHHHHHHcCCcEE
Confidence 479999999999999983
No 25
>smart00461 WH1 WASP homology region 1. Region of the Wiskott-Aldrich syndrome protein (WASp) that contains point mutations in the majority of patients with WAS. Unknown function. Ena-like WH1 domains bind polyproline-containing peptides, and that Homer contains a WH1 domain.
Probab=61.67 E-value=8.7 Score=25.36 Aligned_cols=20 Identities=35% Similarity=0.522 Sum_probs=17.4
Q ss_pred CCceeeecCCHHHHHHHHHH
Q psy627 35 LSNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 35 ~~qv~l~F~s~e~Aiayae~ 54 (89)
...+.|.|.|.+||-+|.++
T Consensus 82 ~~~~GLnF~se~EA~~F~~~ 101 (106)
T smart00461 82 KCVYGLNFASEEEAKKFRKK 101 (106)
T ss_pred CeEEEeecCCHHHHHHHHHH
Confidence 34689999999999999876
No 26
>PF01693 Cauli_VI: Caulimovirus viroplasmin; InterPro: IPR011320 This entry represents the N-terminal domain of RNase HI, which has a 3-layer alpha/beta/alpha structure []. This domain is lacking in retroviral and prokaryotic enzymes, but shows a striking structural similarity to the ribosomal protein L9 N-terminal domain, and may function as a regulatory RNA-binding module. However, the topology of this domain differs from structures of known RNA binding domains such as the double-stranded RNA binding domain (dsRBD), the hnRNP K homology (KH) domain and the RNP motif. Eukaryotic RNases HI possess either one or two copies of this small N-terminal domain, in addition to the well-conserved catalytic RNase H domain. RNase HI belongs to the family of ribonuclease H enzymes that recognise RNA:DNA hybrids and degrade the RNA component. ; PDB: 1QHK_A 3BSU_C.
Probab=61.46 E-value=8.4 Score=21.62 Aligned_cols=14 Identities=21% Similarity=0.560 Sum_probs=12.4
Q ss_pred eeeecCCHHHHHHH
Q psy627 38 LALNFSSKEDAIQY 51 (89)
Q Consensus 38 v~l~F~s~e~Aiay 51 (89)
+...|+|.|+|.+|
T Consensus 31 ~~k~F~t~~eA~~~ 44 (44)
T PF01693_consen 31 IYKSFKTREEAEEF 44 (44)
T ss_dssp EEEEESSHHHHHHH
T ss_pred eECCcCCHHHHhhC
Confidence 47899999999986
No 27
>PF01071 GARS_A: Phosphoribosylglycinamide synthetase, ATP-grasp (A) domain; InterPro: IPR020561 Phosphoribosylglycinamide synthetase (6.3.4.13 from EC) (GARS) (phosphoribosylamine glycine ligase) [] catalyses the second step in the de novo biosynthesis of purine. The reaction catalysed by phosphoribosylglycinamide synthetase is the ATP-dependent addition of 5-phosphoribosylamine to glycine to form 5'phosphoribosylglycinamide: ATP + 5-phosphoribosylamine + glycine = ADP + Pi + 5'-phosphoribosylglycinamide In bacteria, GARS is a monofunctional enzyme (encoded by the purD gene). In yeast, GARS is part of a bifunctional enzyme (encoded by the ADE5/7 gene) in conjunction with phosphoribosylformylglycinamidine cyclo-ligase (AIRS) (IPR000728 from INTERPRO). In higher eukaryotes, GARS is part of a trifunctional enzyme in conjunction with AIRS (IPR000728 from INTERPRO) and with phosphoribosylglycinamide formyltransferase (GART) (), forming GARS-AIRS-GART. This entry represents the A-domain of the enzyme, and is related to the ATP-grasp domain of biotin carboxylase/carbamoyl phosphate synthetase.; PDB: 1GSO_A 3LP8_A 2IP4_A 1VKZ_A 2QK4_A 2XD4_A 2XCL_A 3MJF_A 2YRW_A 2YS6_A ....
Probab=60.52 E-value=10 Score=28.02 Aligned_cols=26 Identities=23% Similarity=0.526 Sum_probs=21.7
Q ss_pred eeecCCHHHHHHHHHHcCccEEEecc
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
--.|.+.++|++|.+.++.++.|+.+
T Consensus 20 ~~~f~~~~~A~~~l~~~~~p~~ViKa 45 (194)
T PF01071_consen 20 YKVFTDYEEALEYLEEQGYPYVVIKA 45 (194)
T ss_dssp EEEESSHHHHHHHHHHHSSSEEEEEE
T ss_pred eeEECCHHHHHHHHHhcCCCceEEcc
Confidence 35799999999999999999955444
No 28
>PRK09437 bcp thioredoxin-dependent thiol peroxidase; Reviewed
Probab=59.39 E-value=13 Score=24.74 Aligned_cols=27 Identities=15% Similarity=0.236 Sum_probs=24.1
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
|.+..++.+++.+|++++|++|.+...
T Consensus 68 i~Is~d~~~~~~~~~~~~~~~~~~l~D 94 (154)
T PRK09437 68 LGISTDKPEKLSRFAEKELLNFTLLSD 94 (154)
T ss_pred EEEcCCCHHHHHHHHHHhCCCCeEEEC
Confidence 578889999999999999999988764
No 29
>COG0108 RibB 3,4-dihydroxy-2-butanone 4-phosphate synthase [Coenzyme metabolism]
Probab=59.31 E-value=8.6 Score=29.05 Aligned_cols=21 Identities=10% Similarity=0.249 Sum_probs=17.3
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
..++++++||++||+++.-.+
T Consensus 172 ar~~~~~~fa~~h~l~~iti~ 192 (203)
T COG0108 172 ARLPELEEFAKEHGLPVITIE 192 (203)
T ss_pred cChHHHHHHHHHcCCcEEEHH
Confidence 468999999999999986433
No 30
>COG0137 ArgG Argininosuccinate synthase [Amino acid transport and metabolism]
Probab=59.07 E-value=7.4 Score=32.16 Aligned_cols=22 Identities=23% Similarity=0.527 Sum_probs=19.2
Q ss_pred eecCCHHHHHHHHHHcCccEEEe
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
..|. +|+.|+||+++||++.+.
T Consensus 150 w~~~-R~~~i~Ya~~~gipv~~~ 171 (403)
T COG0137 150 WNLT-REEEIEYAEEHGIPVKAT 171 (403)
T ss_pred hccC-hHHHHHHHHHcCCCcccc
Confidence 4566 999999999999998776
No 31
>PF13031 DUF3892: Protein of unknown function (DUF3892)
Probab=58.43 E-value=10 Score=23.89 Aligned_cols=20 Identities=40% Similarity=0.895 Sum_probs=18.0
Q ss_pred CCHHHHHHHHHHcCccEEEe
Q psy627 43 SSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~ 62 (89)
-|+++||+++|...+.|.|.
T Consensus 31 ~s~~~~i~~ie~g~~~~yv~ 50 (85)
T PF13031_consen 31 YSREEAIAWIENGKWSFYVE 50 (85)
T ss_pred ccHHHHHHHHHcCCceEEeC
Confidence 48999999999988888887
No 32
>PF07862 Nif11: Nitrogen fixation protein of unknown function; InterPro: IPR012903 This domain is found in the cyanobacteria, and the nitrogen-fixing proteobacterium Azotobacter vinelandii and may be involved in nitrogen fixation, but no role has been assigned [].
Probab=57.73 E-value=5.4 Score=22.64 Aligned_cols=19 Identities=16% Similarity=0.471 Sum_probs=17.0
Q ss_pred CCHHHHHHHHHHcCccEEE
Q psy627 43 SSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V 61 (89)
.+.|+-|++|+.+|+.+..
T Consensus 27 ~~~~e~~~lA~~~Gy~ft~ 45 (49)
T PF07862_consen 27 QNPEEVVALAREAGYDFTE 45 (49)
T ss_pred CCHHHHHHHHHHcCCCCCH
Confidence 5999999999999998864
No 33
>PF08803 ydhR: Putative mono-oxygenase ydhR; InterPro: IPR014910 YdhR is a homodimeric protein that comprises of a central four-stranded beta sheet and four surrounding alpha helices []. It shows structural homology to the ActVA-Orf6 and YgiN proteins which indicates it could be a mono-oxygenase. ; PDB: 1WD6_B 2HIQ_B 2ASY_B.
Probab=57.21 E-value=8.1 Score=26.02 Aligned_cols=28 Identities=14% Similarity=0.245 Sum_probs=20.6
Q ss_pred CCCCCCCCCc-eeeecCCHHHHHHHHHHc
Q psy627 28 WCSTGDPLSN-LALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 28 Wtss~D~~~q-v~l~F~s~e~Aiayae~~ 55 (89)
||.+.++..- =-.-|+|+++|.+|+++|
T Consensus 38 Wten~~t~eaGGiYLFe~e~~A~aY~~~h 66 (97)
T PF08803_consen 38 WTENEETGEAGGIYLFEDEASAEAYLEMH 66 (97)
T ss_dssp EEEETTTTEEEEEEEESSHHHHHHHHHHH
T ss_pred EEecCCCCccceEEEECCHHHHHHHHHHH
Confidence 6666665432 135589999999999987
No 34
>PLN00139 hypothetical protein; Provisional
Probab=56.52 E-value=11 Score=29.61 Aligned_cols=25 Identities=16% Similarity=0.053 Sum_probs=21.0
Q ss_pred eecCCHHHHHHHHHHcCccEEEecc
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..-++++++.+||.++|+.|+-.+.
T Consensus 195 F~t~d~~eve~~~~~~g~~~eW~~d 219 (320)
T PLN00139 195 FGTSDKAEAERRAKALGMDMEWLPN 219 (320)
T ss_pred hCCCCHHHHHHHHHHcCCeEEEcCC
Confidence 3347899999999999999987654
No 35
>PF10566 Glyco_hydro_97: Glycoside hydrolase 97 ; InterPro: IPR019563 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is the 97th family of glycosidases, in this case bacterial. The central part of the GH97 family protein sequences represents a typical and complete (beta/alpha)8-barrel or catalytic TIM-barrel type domain. The N- and C-terminal parts of the sequences, mainly consisting of beta-strands, most probably form two additional non-catalytic domains with as yet unknown functions. The non-catalytic domains of glycosidases from the alpha-galactosidase and alpha-glucosidase superfamilies are also predominantly composed of beta-strands, and at least some of these domains are involved in oligomerisation and carbohydrate binding. In all known glycosidases with the (beta-alpha)8-barrel fold, the amino acid residues at the active site are located on the C-termini of the beta-strands []. ; PDB: 2JKP_A 2JKE_A 2D73_B 2ZQ0_B 2JKA_A 3A24_A.
Probab=55.98 E-value=9.6 Score=29.63 Aligned_cols=26 Identities=8% Similarity=0.295 Sum_probs=21.0
Q ss_pred eeecCCHHHHHHHHHHcCccEEEecc
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
.+.+++-..=|+||.++||+|.+...
T Consensus 28 g~~t~~~k~yIDfAa~~G~eYvlvD~ 53 (273)
T PF10566_consen 28 GATTETQKRYIDFAAEMGIEYVLVDA 53 (273)
T ss_dssp SSSHHHHHHHHHHHHHTT-SEEEEBT
T ss_pred CCCHHHHHHHHHHHHHcCCCEEEecc
Confidence 45677888889999999999999853
No 36
>smart00156 PP2Ac Protein phosphatase 2A homologues, catalytic domain. Large family of serine/threonine phosphatases, that includes PP1, PP2A and PP2B (calcineurin) family members.
Probab=55.91 E-value=21 Score=27.14 Aligned_cols=41 Identities=17% Similarity=0.442 Sum_probs=28.4
Q ss_pred CccCC---CCCCCCCCCCCCceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 20 RWENP---LMGWCSTGDPLSNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 20 ~~~dP---LMGWtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
-|.|| .-||..| + ......|. .+++.+|++++|+.+.|..-
T Consensus 176 lWsDP~~~~~~~~~~--~-Rg~g~~fg-~~~~~~Fl~~n~l~~iiR~H 219 (271)
T smart00156 176 LWSDPDQPVDGFQPS--I-RGASYYFG-PDAVDEFLKKNNLKLIIRAH 219 (271)
T ss_pred eecCCCcccCCCccC--C-CCCccccC-HHHHHHHHHHCCCeEEEecC
Confidence 46676 3455544 2 33556674 78888999999999998653
No 37
>PF10075 PCI_Csn8: COP9 signalosome, subunit CSN8; InterPro: IPR019280 The photomorphogenic 9 (COP9) signalosome or CSN complex is composed of eight subunits: Cops1/GPS1, Cops2, Cops3, Cops4, Cops5, Cop6, Cops7 (Cops7A or Cops7B) and Cops8. In the complex, Cops8, which is the smallest subunit, probably interacts directly with Cops3, Cops4 and Cops7 (Cops7A or Cops7B). This signalosome is homologous to the lid subcomplex of the 26S proteasome and regulates the ubiquitin-proteasome pathway. It functions as a structural scaffold for subunit-subunit interactions within the complex and is a key regulator of photomorphogenic development [].; PDB: 1RZ4_A.
Probab=55.49 E-value=7.8 Score=26.25 Aligned_cols=20 Identities=25% Similarity=0.602 Sum_probs=15.0
Q ss_pred eecCCHHHHHHHHHHcCccEE
Q psy627 40 LNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~ 60 (89)
|-|+ .+++++||+++||.++
T Consensus 107 Lg~~-~~el~~~~~~~gW~~d 126 (143)
T PF10075_consen 107 LGLS-EEELEKFIKSRGWTVD 126 (143)
T ss_dssp TTS--HHHHHHHHHHHT-EE-
T ss_pred hCCC-HHHHHHHHHHcCCEEC
Confidence 4556 8999999999999985
No 38
>KOG1706|consensus
Probab=54.53 E-value=7.8 Score=31.77 Aligned_cols=23 Identities=22% Similarity=0.515 Sum_probs=19.9
Q ss_pred ecCCHHHHHHHHHHcCccEEEec
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+|.-++|+++||++|||+.-|..
T Consensus 154 rf~Gr~Dl~eYakq~giPvpvT~ 176 (412)
T KOG1706|consen 154 RFKGRKDLLEYAKQHGIPVPVTP 176 (412)
T ss_pred hhcCchHHHHHHHhcCCCccccC
Confidence 58889999999999999976643
No 39
>PF03793 PASTA: PASTA domain; InterPro: IPR005543 The PASTA domain is found at the C-termini of several Penicillin-binding proteins (PBP) and bacterial serine/threonine kinases. It binds the beta-lactam stem, which implicates it in sensing D-alanyl-D-alanine - the PBP transpeptidase substrate. In PknB of Mycobacterium tuberculosis (P71584 from SWISSPROT), all of the extracellular portion is predicted to be made up of four PASTA domains, which strongly suggests that it is a signal-binding sensor domain. The domain has also been found in proteins involved in cell wall biosynthesis, where it is implicated in localizing the biosynthesis complex to unlinked peptidoglycan. PASTA is a small globular fold consisting of 3 beta-sheets and an alpha-helix, with a loop region of variable length between the first and second beta-strands. The name PASTA is derived from PBP and Serine/Threonine kinase Associated domain [].; GO: 0008658 penicillin binding; PDB: 2ZC3_C 1QME_A 1RP5_B 2Z2M_C 2Z2L_F 2ZC4_C 1QMF_A 3M9G_A 3PY9_A 1K25_B ....
Probab=53.36 E-value=17 Score=20.97 Aligned_cols=22 Identities=18% Similarity=0.158 Sum_probs=17.6
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
-|.++|.+..+..||.+.+...
T Consensus 9 ~~~~~a~~~l~~~g~~~~~~~~ 30 (63)
T PF03793_consen 9 MTYDEAKSILEAAGLTVNVVEE 30 (63)
T ss_dssp SBHHHHHHHHHHTT-EEEEEEE
T ss_pred CcHHHHHHHHHHCCCEEEEEEE
Confidence 4789999999999998877653
No 40
>PF03682 UPF0158: Uncharacterised protein family (UPF0158); InterPro: IPR005361 This is a small family of hypothetical bacterial proteins of unknown function.
Probab=52.60 E-value=11 Score=26.89 Aligned_cols=20 Identities=35% Similarity=0.539 Sum_probs=17.7
Q ss_pred HHHHHHHHHHcCccEEEecc
Q psy627 45 KEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p 64 (89)
++.|+..|+.|||.|...++
T Consensus 127 r~~a~eWleen~I~~~~~~~ 146 (163)
T PF03682_consen 127 RERAIEWLEENGIEPVFEED 146 (163)
T ss_pred HHHHHHHHHHcCCCcccccc
Confidence 67899999999999988775
No 41
>PF12745 HGTP_anticodon2: Anticodon binding domain of tRNAs; InterPro: IPR024435 This is an anticodon binding domain, found largely on Gcn2 proteins which bind tRNA to down regulate translation in certain stress situations [].; GO: 0000049 tRNA binding
Probab=52.42 E-value=17 Score=28.09 Aligned_cols=23 Identities=26% Similarity=0.582 Sum_probs=20.3
Q ss_pred CCHHHHHHHHHHcCccEEEeccC
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.|.||.+.+|..+|+.|.|.--.
T Consensus 44 ~S~Eel~~~~~~~gi~wiViikq 66 (273)
T PF12745_consen 44 PSQEELQSYCREDGISWIVIIKQ 66 (273)
T ss_pred CCHHHHHHHHHHCCCCEEEEEec
Confidence 59999999999999999987633
No 42
>PF09383 NIL: NIL domain; InterPro: IPR018449 This domain is found at the C terminus of ABC transporter proteins involved in D-methionine transport as well as a number of ferredoxin-like proteins. This domain is likely to act as a substrate binding domain. The domain has been named after a conserved sequence in some members of the family. ; PDB: 2QRR_A 3CED_A 2QSW_A 3TUZ_D 3TUJ_D 3DHX_B 3TUI_H 3DHW_D.
Probab=51.24 E-value=18 Score=21.88 Aligned_cols=29 Identities=31% Similarity=0.436 Sum_probs=21.1
Q ss_pred CCCceeeecC----CHHHHHHHHHHcCccEEEe
Q psy627 34 PLSNLALNFS----SKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 34 ~~~qv~l~F~----s~e~Aiayae~~G~~Y~V~ 62 (89)
|+..+.|.++ ..++|++|.+.+|+..+|.
T Consensus 44 ~~G~l~l~l~g~~~~~~~a~~~L~~~~v~vEvl 76 (76)
T PF09383_consen 44 PFGILILELPGDDEEIEKAIAYLREQGVEVEVL 76 (76)
T ss_dssp EEEEEEEEEES-HHHHHHHHHHHHHTTEEEEEE
T ss_pred eEEEEEEEEECCHHHHHHHHHHHHHCCCeEEEC
Confidence 3445566662 2468999999999998874
No 43
>COG1225 Bcp Peroxiredoxin [Posttranslational modification, protein turnover, chaperones]
Probab=50.62 E-value=13 Score=26.73 Aligned_cols=30 Identities=17% Similarity=0.236 Sum_probs=26.4
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEeccCCC
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEPKWK 67 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p~~~ 67 (89)
+.++-+|.++=-+||+++|++|.+.....+
T Consensus 68 ~GIS~Ds~~~~~~F~~k~~L~f~LLSD~~~ 97 (157)
T COG1225 68 LGISPDSPKSHKKFAEKHGLTFPLLSDEDG 97 (157)
T ss_pred EEEeCCCHHHHHHHHHHhCCCceeeECCcH
Confidence 479999999999999999999999876543
No 44
>cd03017 PRX_BCP Peroxiredoxin (PRX) family, Bacterioferritin comigratory protein (BCP) subfamily; composed of thioredoxin-dependent thiol peroxidases, widely expressed in pathogenic bacteria, that protect cells against toxicity from reactive oxygen species by reducing and detoxifying hydroperoxides. The protein was named BCP based on its electrophoretic mobility before its function was known. BCP shows substrate selectivity toward fatty acid hydroperoxides rather than hydrogen peroxide or alkyl hydroperoxides. BCP contains the peroxidatic cysteine but appears not to possess a resolving cysteine (some sequences, not all, contain a second cysteine but its role is still unknown). Unlike other PRXs, BCP exists as a monomer. The plant homolog of BCP is PRX Q, which is expressed only in leaves and is cellularly localized in the chloroplasts and the guard cells of stomata. Also included in this subfamily is the fungal nuclear protein, Dot5p (for disrupter of telomere silencing protein 5), w
Probab=50.51 E-value=24 Score=22.68 Aligned_cols=28 Identities=18% Similarity=0.272 Sum_probs=24.1
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
|.+.-++.+++.+|++++|+.|.|....
T Consensus 61 v~is~d~~~~~~~~~~~~~~~~~~l~D~ 88 (140)
T cd03017 61 IGVSPDSVESHAKFAEKYGLPFPLLSDP 88 (140)
T ss_pred EEEcCCCHHHHHHHHHHhCCCceEEECC
Confidence 5677889999999999999999887643
No 45
>COG3341 Predicted double-stranded RNA/RNA-DNA hybrid binding protein [General function prediction only]
Probab=49.87 E-value=9.8 Score=29.21 Aligned_cols=14 Identities=43% Similarity=0.805 Sum_probs=12.9
Q ss_pred ecCCHHHHHHHHHH
Q psy627 41 NFSSKEDAIQYCQK 54 (89)
Q Consensus 41 ~F~s~e~Aiayae~ 54 (89)
+|+|.|+|.+||+-
T Consensus 39 sf~s~EeA~a~~~g 52 (225)
T COG3341 39 SFKSYEEAEAYCEG 52 (225)
T ss_pred ccccHHHHHHHhcc
Confidence 49999999999995
No 46
>PF13911 AhpC-TSA_2: AhpC/TSA antioxidant enzyme
Probab=49.61 E-value=18 Score=23.12 Aligned_cols=29 Identities=17% Similarity=0.336 Sum_probs=24.2
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEeccCC
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
|-+...+.+.+-+||+..++++.|.....
T Consensus 17 v~I~~g~~~~~~~f~~~~~~p~~ly~D~~ 45 (115)
T PF13911_consen 17 VVIGCGSPEGIEKFCELTGFPFPLYVDPE 45 (115)
T ss_pred EEEEcCCHHHHHHHHhccCCCCcEEEeCc
Confidence 46778999889999999999999866444
No 47
>PLN00200 argininosuccinate synthase; Provisional
Probab=49.10 E-value=14 Score=30.28 Aligned_cols=18 Identities=28% Similarity=0.543 Sum_probs=16.2
Q ss_pred cCCHHHHHHHHHHcCccE
Q psy627 42 FSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y 59 (89)
|.|+++.++||+++|++.
T Consensus 153 ~~~r~e~~~~A~~~Gipv 170 (404)
T PLN00200 153 IKGREDLIEYAKKHNIPV 170 (404)
T ss_pred CCCHHHHHHHHHHcCCCC
Confidence 559999999999999974
No 48
>TIGR00506 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase. Several members of the family are bifunctional, involving both ribA and ribB function. In these cases, ribA tends to be on the C-terminal end of the protein and ribB tends to be on the N-terminal.
Probab=48.71 E-value=17 Score=27.10 Aligned_cols=19 Identities=21% Similarity=0.545 Sum_probs=16.7
Q ss_pred cCCHHHHHHHHHHcCccEE
Q psy627 42 FSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~ 60 (89)
-.+++++.+||++||+++.
T Consensus 172 m~~~~~~~~fA~~~~l~~i 190 (199)
T TIGR00506 172 MARKPELMEYAKKHNLKLI 190 (199)
T ss_pred ccCHHHHHHHHHHcCCcEE
Confidence 4678999999999999983
No 49
>cd07996 WGR_MMR_like WGR domain of molybdate metabolism regulator and related proteins. The WGR domain is found in the putative Escherichia coli molybdate metabolism regulator and related bacterial proteins, as well as in various other bacterial proteins of unknown function. It has been called WGR after the most conserved central motif of the domain. The domain appears to occur in single-domain proteins and in a variety of domain architectures, together with ATP-dependent DNA ligase domains, WD40 repeats, leucine-rich repeats, and other domains. It has been proposed to function as a nucleic acid binding domain.
Probab=48.55 E-value=51 Score=19.73 Aligned_cols=50 Identities=18% Similarity=0.197 Sum_probs=29.6
Q ss_pred CCCCceEEEecCCCCc-cCCCCCCCCCCCCCCceeeecCCHHHHHHHHHHc
Q psy627 6 HNANNWFIQFDTRERW-ENPLMGWCSTGDPLSNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 6 ~~t~~W~lefe~~~~~-~dPLMGWtss~D~~~qv~l~F~s~e~Aiayae~~ 55 (89)
...+.|.|+..+...- ..-+.=|---+..-+.....|+|.++|++.+++.
T Consensus 12 n~~kfy~i~l~~~lfg~~~v~~~wGRiG~~Gq~~~~~~~s~~~A~~~~~k~ 62 (74)
T cd07996 12 NSARFYEIELEGDLFGEWSLVRRWGRIGTKGQSRTKTFDSEEEALKAAEKL 62 (74)
T ss_pred CCCcEEEEEEcccCCCCEEEEEEECCCCCCCceEEEECCCHHHHHHHHHHH
Confidence 3446788888763111 1223334333433233457799999999988763
No 50
>PRK01792 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional
Probab=48.45 E-value=17 Score=27.47 Aligned_cols=19 Identities=11% Similarity=0.466 Sum_probs=16.9
Q ss_pred cCCHHHHHHHHHHcCccEE
Q psy627 42 FSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~ 60 (89)
..+++++++||++||+++.
T Consensus 182 ma~~~~~~~fA~~~~l~~i 200 (214)
T PRK01792 182 MARTPEIVEFAKKFGYAVV 200 (214)
T ss_pred ccCHHHHHHHHHHcCCcEE
Confidence 4679999999999999984
No 51
>PF05198 IF3_N: Translation initiation factor IF-3, N-terminal domain; InterPro: IPR019814 Initiation factor 3 (IF-3) (gene infC) is one of the three factors required for the initiation of protein biosynthesis in bacteria. IF-3 is thought to function as a fidelity factor during the assembly of the ternary initiation complex which consist of the 30S ribosomal subunit, the initiator tRNA and the messenger RNA. IF-3 is a basic protein that binds to the 30S ribosomal subunit []. The chloroplast initiation factor IF-3(chl) is a protein that enhances the poly(A,U,G)-dependent binding of the initiator tRNA to chloroplast ribosomal 30s subunits in which the central section is evolutionary related to the sequence of bacterial IF-3 []. ; GO: 0003743 translation initiation factor activity, 0006413 translational initiation; PDB: 1TIF_A.
Probab=48.45 E-value=18 Score=22.85 Aligned_cols=26 Identities=15% Similarity=0.294 Sum_probs=20.4
Q ss_pred cCCHHHHHHHHHHcCccEEEeccCCC
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEEPKWK 67 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~p~~~ 67 (89)
.=+.++|++-|++.|++-....|...
T Consensus 27 v~~~~eAl~~A~~~~lDLV~v~~~~~ 52 (76)
T PF05198_consen 27 VMSLREALRLAKEKGLDLVEVSPNAD 52 (76)
T ss_dssp EEEHHHHHHHHHHTT-EEEEEETTSS
T ss_pred eEEHHHHHHHHHHcCCcEEEEcCCCC
Confidence 34689999999999999988776543
No 52
>PRK00923 sirohydrochlorin cobaltochelatase; Reviewed
Probab=47.00 E-value=31 Score=22.61 Aligned_cols=25 Identities=16% Similarity=0.170 Sum_probs=21.9
Q ss_pred eecCCHHHHHHHHHHcCccEEEecc
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+..+|.+++|+-|.+.|..-.|..|
T Consensus 43 ~~~P~l~~~l~~l~~~g~~~v~vvP 67 (126)
T PRK00923 43 FNEPTIPEALKKLIGTGADKIIVVP 67 (126)
T ss_pred cCCCCHHHHHHHHHHcCCCEEEEEc
Confidence 4568999999999999999877776
No 53
>COG0290 InfC Translation initiation factor 3 (IF-3) [Translation, ribosomal structure and biogenesis]
Probab=46.72 E-value=28 Score=25.81 Aligned_cols=33 Identities=24% Similarity=0.384 Sum_probs=26.8
Q ss_pred CCHHHHHHHHHHcCccEEEeccCCC--CCCccccc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPKWK--TPKVKSYA 75 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~~~--~~~~ksYa 75 (89)
-|.++|+..|+..|++..+..|+.. .-+.-.||
T Consensus 34 v~~~eAL~lA~e~~LDLV~Ispna~PPVcKImDYG 68 (176)
T COG0290 34 VSIEEALKLAEEAGLDLVEISPNAKPPVCKIMDYG 68 (176)
T ss_pred eeHHHHHHHHHHcCCCEEEECCCCCCCeeEeeecc
Confidence 4789999999999999999998743 33556676
No 54
>PF14259 RRM_6: RNA recognition motif (a.k.a. RRM, RBD, or RNP domain); PDB: 2DNN_A 1WI6_A 2EVZ_A 2ADC_A 1QM9_A 2KG1_A 2HGN_A 1U2F_A 2G4B_A 2HZC_A ....
Probab=46.48 E-value=21 Score=20.38 Aligned_cols=20 Identities=30% Similarity=0.204 Sum_probs=17.5
Q ss_pred ceeeecCCHHHHHHHHHHcC
Q psy627 37 NLALNFSSKEDAIQYCQKNG 56 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G 56 (89)
..-+.|.|.|+|...++.++
T Consensus 41 ~a~v~f~~~~~a~~al~~~~ 60 (70)
T PF14259_consen 41 FAFVEFSSEEDAKRALELLN 60 (70)
T ss_dssp EEEEEESSHHHHHHHHHHHT
T ss_pred EEEEEeCCHHHHHHHHHHCC
Confidence 35699999999999999877
No 55
>cd00913 PCD_DCoH_subfamily_a PCD_DCoH: The bifunctional protein pterin-4alpha-carbinolamine dehydratase (PCD), also known as DCoH (dimerization cofactor of hepatocyte nuclear factor-1), is both a transcription activator and a metabolic enzyme. DCoH stimulates gene expression by associating with specific DNA binding proteins such as HNF-1alpha (hepatocyte nuclear factor-1) and Xenopus enhancer of rudimentary homologue (XERH). DCoH also catalyzes the dehydration of 4alpha- hydroxy- tetrahydrobiopterin (4alpha-OH-BH4) to quinoiddihydrobiopterin, a percursor of the phenylalanine hydroxylase cofactor BH4 (tetrahydrobiopterin). The DCoH homodimer has a saddle-shaped structure similar to that of TBP (TATA binding protein).
Probab=45.48 E-value=32 Score=21.22 Aligned_cols=29 Identities=21% Similarity=0.445 Sum_probs=22.3
Q ss_pred CCCCCCCCC-CceeeecCCHHHHHHHHHHc
Q psy627 27 GWCSTGDPL-SNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 27 GWtss~D~~-~qv~l~F~s~e~Aiayae~~ 55 (89)
||.-..|+. -.-+..|++-.+|++|+.+-
T Consensus 1 gW~~~~~~~~l~r~f~f~~f~~a~~f~~~v 30 (76)
T cd00913 1 GWELADDGLKLERTFRFKNFVEALEFVNAV 30 (76)
T ss_pred CCcCcCCCCeEEEEEEcCCHHHHHHHHHHH
Confidence 788777753 34578899999999998763
No 56
>PRK13010 purU formyltetrahydrofolate deformylase; Reviewed
Probab=45.36 E-value=32 Score=26.65 Aligned_cols=30 Identities=10% Similarity=0.018 Sum_probs=25.8
Q ss_pred CceeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
..+.+-..++.+|.++|+++||++.+....
T Consensus 121 ~~i~~visn~~~~~~~A~~~gIp~~~~~~~ 150 (289)
T PRK13010 121 MDIVGIISNHPDLQPLAVQHDIPFHHLPVT 150 (289)
T ss_pred cEEEEEEECChhHHHHHHHcCCCEEEeCCC
Confidence 567888899999999999999999986543
No 57
>PF02786 CPSase_L_D2: Carbamoyl-phosphate synthase L chain, ATP binding domain; InterPro: IPR005479 Carbamoyl phosphate synthase (CPSase) is a heterodimeric enzyme composed of a small and a large subunit (with the exception of CPSase III, see below). CPSase catalyses the synthesis of carbamoyl phosphate from biocarbonate, ATP and glutamine (6.3.5.5 from EC) or ammonia (6.3.4.16 from EC), and represents the first committed step in pyrimidine and arginine biosynthesis in prokaryotes and eukaryotes, and in the urea cycle in most terrestrial vertebrates [, ]. CPSase has three active sites, one in the small subunit and two in the large subunit. The small subunit contains the glutamine binding site and catalyses the hydrolysis of glutamine to glutamate and ammonia. The large subunit has two homologous carboxy phosphate domains, both of which have ATP-binding sites; however, the N-terminal carboxy phosphate domain catalyses the phosphorylation of biocarbonate, while the C-terminal domain catalyses the phosphorylation of the carbamate intermediate []. The carboxy phosphate domain found duplicated in the large subunit of CPSase is also present as a single copy in the biotin-dependent enzymes acetyl-CoA carboxylase (6.4.1.2 from EC) (ACC), propionyl-CoA carboxylase (6.4.1.3 from EC) (PCCase), pyruvate carboxylase (6.4.1.1 from EC) (PC) and urea carboxylase (6.3.4.6 from EC). Most prokaryotes carry one form of CPSase that participates in both arginine and pyrimidine biosynthesis, however certain bacteria can have separate forms. The large subunit in bacterial CPSase has four structural domains: the carboxy phosphate domain 1, the oligomerisation domain, the carbamoyl phosphate domain 2 and the allosteric domain []. CPSase heterodimers from Escherichia coli contain two molecular tunnels: an ammonia tunnel and a carbamate tunnel. These inter-domain tunnels connect the three distinct active sites, and function as conduits for the transport of unstable reaction intermediates (ammonia and carbamate) between successive active sites []. The catalytic mechanism of CPSase involves the diffusion of carbamate through the interior of the enzyme from the site of synthesis within the N-terminal domain of the large subunit to the site of phosphorylation within the C-terminal domain. Eukaryotes have two distinct forms of CPSase: a mitochondrial enzyme (CPSase I) that participates in both arginine biosynthesis and the urea cycle; and a cytosolic enzyme (CPSase II) involved in pyrimidine biosynthesis. CPSase II occurs as part of a multi-enzyme complex along with aspartate transcarbamoylase and dihydroorotase; this complex is referred to as the CAD protein []. The hepatic expression of CPSase is transcriptionally regulated by glucocorticoids and/or cAMP []. There is a third form of the enzyme, CPSase III, found in fish, which uses glutamine as a nitrogen source instead of ammonia []. CPSase III is closely related to CPSase I, and is composed of a single polypeptide that may have arisen from gene fusion of the glutaminase and synthetase domains []. This entry represents the ATP-binding domain found in the large subunit of carbamoyl phosphate synthase, as well as in related proteins.; GO: 0003824 catalytic activity, 0005524 ATP binding, 0008152 metabolic process; PDB: 3U9S_A 3U9T_A 2C00_B 2VQD_A 1W96_B 1W93_A 1M6V_C 1CS0_C 1C30_E 1C3O_G ....
Probab=45.36 E-value=22 Score=26.08 Aligned_cols=21 Identities=24% Similarity=0.605 Sum_probs=17.7
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.|.|+|+++|++.|++..|..
T Consensus 25 ~~~eea~~~a~~iGyPVliKa 45 (211)
T PF02786_consen 25 SSVEEALEFAEEIGYPVLIKA 45 (211)
T ss_dssp SSHHHHHHHHHHH-SSEEEEE
T ss_pred CCHHHHHHHHHhcCCceEEee
Confidence 689999999999999987754
No 58
>cd01206 Homer Homer type EVH1 domain. Homer type EVH1 domain. Homer is a synaptic scaffolding protein, involved in neuronal signaling. It contains an EVH1 domain, which binds to both neurotransmitter receptors, such as the metabotropic glutamate receptor (mGluR) and to other scaffolding proteins via PPXXF motifs, in order to target them to the synaptic junction. It has a PH-like fold, despite having minimal sequence similarity to PH or PTB domains.
Probab=44.96 E-value=21 Score=24.69 Aligned_cols=23 Identities=26% Similarity=0.538 Sum_probs=18.3
Q ss_pred CCCCC-ce-eeecCCHHHHHHHHHH
Q psy627 32 GDPLS-NL-ALNFSSKEDAIQYCQK 54 (89)
Q Consensus 32 ~D~~~-qv-~l~F~s~e~Aiayae~ 54 (89)
.|+.. +| .|.|.|+|||-.|+++
T Consensus 78 rD~R~~tVyGLnF~Sk~ea~~F~~~ 102 (111)
T cd01206 78 ADSRANTVYGLGFSSEQQLTKFAEK 102 (111)
T ss_pred cccccceeeecccCCHHHHHHHHHH
Confidence 36663 33 7999999999999875
No 59
>KOG0081|consensus
Probab=44.60 E-value=19 Score=27.14 Aligned_cols=22 Identities=23% Similarity=0.400 Sum_probs=18.2
Q ss_pred cCCHHHHHHHHHHcCccEEEec
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
--|.+.|++.|++.|++|.-..
T Consensus 141 ~Vs~~qa~~La~kyglPYfETS 162 (219)
T KOG0081|consen 141 VVSEDQAAALADKYGLPYFETS 162 (219)
T ss_pred hhhHHHHHHHHHHhCCCeeeec
Confidence 3478999999999999996443
No 60
>PF05637 Glyco_transf_34: galactosyl transferase GMA12/MNN10 family; InterPro: IPR008630 This family contains a number of glycosyltransferase enzymes that contain a DXD motif. This family includes a number of Caenorhabditis elegans homologues where the DXD is replaced by DXH. Some members of this family are included in glycosyltransferase family 34.; GO: 0016758 transferase activity, transferring hexosyl groups, 0016021 integral to membrane; PDB: 2P72_B 2P73_A 2P6W_A.
Probab=44.59 E-value=15 Score=27.41 Aligned_cols=18 Identities=11% Similarity=0.676 Sum_probs=12.1
Q ss_pred HHHHHHHHHHcCccEEEe
Q psy627 45 KEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~ 62 (89)
.+-=+.||++||++|.+.
T Consensus 30 ~~Nr~~Ya~~HgY~~~~~ 47 (239)
T PF05637_consen 30 IQNRVDYARRHGYDLYYR 47 (239)
T ss_dssp ---HHHHHHHHT-EEEEE
T ss_pred HHHHHHHHHhcCCEEEEE
Confidence 334468999999999984
No 61
>PRK00910 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional
Probab=44.47 E-value=21 Score=27.02 Aligned_cols=20 Identities=5% Similarity=0.214 Sum_probs=17.4
Q ss_pred cCCHHHHHHHHHHcCccEEE
Q psy627 42 FSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V 61 (89)
..+++++++||++||+++.-
T Consensus 183 ma~~~~l~~fA~~h~l~~is 202 (218)
T PRK00910 183 MAKTPEIIAFGKLHNMPVLT 202 (218)
T ss_pred cCCHHHHHHHHHHcCCcEEE
Confidence 56899999999999999843
No 62
>PF07014 Hs1pro-1_C: Hs1pro-1 protein C-terminus; InterPro: IPR009743 This entry represents the C terminus (approximately 270 residues) of a number of plant Hs1pro-1 proteins, which are believed to confer nematode resistance [].
Probab=44.33 E-value=9.1 Score=29.79 Aligned_cols=23 Identities=26% Similarity=0.381 Sum_probs=19.9
Q ss_pred CCCCCCCCcee---eecCCHHHHHHH
Q psy627 29 CSTGDPLSNLA---LNFSSKEDAIQY 51 (89)
Q Consensus 29 tss~D~~~qv~---l~F~s~e~Aiay 51 (89)
++++|+++|+- -+|+|+++|--|
T Consensus 228 ~~~~D~Lsq~flEPtYfPSLDaAKTF 253 (261)
T PF07014_consen 228 IESADSLSQIFLEPTYFPSLDAAKTF 253 (261)
T ss_pred ccccchHHHHhcCCCCCCchhHHHHH
Confidence 57799999985 579999999887
No 63
>PF00926 DHBP_synthase: 3,4-dihydroxy-2-butanone 4-phosphate synthase; InterPro: IPR000422 3,4-dihydroxy-2-butanone 4-phosphate synthase (4.1.99.12 from EC) (DHBP synthase) (RibB) catalyses the conversion of D-ribulose 5-phosphate to formate and 3,4-dihydroxy-2-butanone 4-phosphate, the latter serving as the biosynthetic precursor for the xylene ring of riboflavin []. In Photobacterium leiognathi, the riboflavin synthesis genes ribB (DHBP synthase), ribE (riboflavin synthase), ribH (lumazone synthase) and ribA (GTP cyclohydrolase II) all reside in the lux operon []. RibB is sometimes found as a bifunctional enzyme with GTP cyclohydrolase II that catalyses the first committed step in the biosynthesis of riboflavin (IPR000926 from INTERPRO). No sequences with significant homology to DHBP synthase are found in the metazoa.; GO: 0008686 3,4-dihydroxy-2-butanone-4-phosphate synthase activity, 0009231 riboflavin biosynthetic process; PDB: 1K4O_A 1K4L_A 1K4P_A 1K49_A 1K4I_A 1TKU_A 1TKS_B 2RIS_A 2RIU_A 3MIO_A ....
Probab=44.18 E-value=22 Score=26.32 Aligned_cols=20 Identities=10% Similarity=0.423 Sum_probs=16.0
Q ss_pred cCCHHHHHHHHHHcCccEEE
Q psy627 42 FSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V 61 (89)
-.+++++.+||++||+++.-
T Consensus 167 ~~~~~~~~~fA~~~~l~~vs 186 (194)
T PF00926_consen 167 MARRDELEEFAKKHGLPIVS 186 (194)
T ss_dssp BHCHHHHHHHHHHTT-EEEE
T ss_pred cCCHHHHHHHHHHcCCcEEE
Confidence 35789999999999998753
No 64
>KOG0855|consensus
Probab=43.99 E-value=19 Score=27.16 Aligned_cols=30 Identities=7% Similarity=0.125 Sum_probs=26.3
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEeccCCC
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEPKWK 67 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p~~~ 67 (89)
+.|+|++-.+--+|+-+++++|.+.....+
T Consensus 128 ~GlS~D~s~sqKaF~sKqnlPYhLLSDpk~ 157 (211)
T KOG0855|consen 128 IGLSGDDSASQKAFASKQNLPYHLLSDPKN 157 (211)
T ss_pred EeeccCchHHHHHhhhhccCCeeeecCcch
Confidence 579999999999999999999999875443
No 65
>PRK00823 phhB pterin-4-alpha-carbinolamine dehydratase; Validated
Probab=43.84 E-value=28 Score=22.49 Aligned_cols=34 Identities=21% Similarity=0.421 Sum_probs=25.7
Q ss_pred CCCCCCCCCCCCC-CceeeecCCHHHHHHHHHHcC
Q psy627 23 NPLMGWCSTGDPL-SNLALNFSSKEDAIQYCQKNG 56 (89)
Q Consensus 23 dPLMGWtss~D~~-~qv~l~F~s~e~Aiayae~~G 56 (89)
..|-||.-..|+. -.-...|++-.+|++|+.+-+
T Consensus 16 ~~l~gW~~~~~~~~l~r~f~f~~f~~a~~f~~~Va 50 (97)
T PRK00823 16 PQLPGWTLVGDRDAIERTFKFKNFNEAFAFMNRVA 50 (97)
T ss_pred hcCCCCeEeCCcCeEEEEEEeCCHHHHHHHHHHHH
Confidence 4577999887753 334688999999999987644
No 66
>PF00764 Arginosuc_synth: Arginosuccinate synthase; InterPro: IPR001518 Argininosuccinate synthase (6.3.4.5 from EC) (AS) is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate [, ]. In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterised by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity.; GO: 0004055 argininosuccinate synthase activity, 0005524 ATP binding, 0006526 arginine biosynthetic process; PDB: 1K97_A 1KP2_A 1K92_A 1KP3_A 2NZ2_A 1VL2_A 1J1Z_D 1KOR_C 1J20_D 1KH2_C ....
Probab=43.37 E-value=13 Score=30.29 Aligned_cols=17 Identities=35% Similarity=0.643 Sum_probs=12.4
Q ss_pred HHHHHHHHHHcCccEEE
Q psy627 45 KEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V 61 (89)
+|+.|+||+++||+..+
T Consensus 147 R~~~i~ya~~~gIpv~~ 163 (388)
T PF00764_consen 147 REEEIEYAKKHGIPVPV 163 (388)
T ss_dssp HHHHHHHHHHTT----S
T ss_pred HHHHHHHHHHcCCCCCC
Confidence 99999999999998654
No 67
>PRK13820 argininosuccinate synthase; Provisional
Probab=42.55 E-value=17 Score=29.57 Aligned_cols=18 Identities=28% Similarity=0.446 Sum_probs=16.5
Q ss_pred CHHHHHHHHHHcCccEEE
Q psy627 44 SKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V 61 (89)
|+++-++||+++|+++.+
T Consensus 148 tK~ei~~ya~~~gip~~~ 165 (394)
T PRK13820 148 TREWEIEYAKEKGIPVPV 165 (394)
T ss_pred CHHHHHHHHHHcCCCCCc
Confidence 999999999999999853
No 68
>smart00757 CRA CT11-RanBPM. protein-protein interaction domain present in crown eukaryotes (plants, animals, fungi)
Probab=42.28 E-value=20 Score=22.14 Aligned_cols=17 Identities=24% Similarity=0.264 Sum_probs=13.4
Q ss_pred CHHHHHHHHHHcCccEE
Q psy627 44 SKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~ 60 (89)
+.++||.||+++=-+|.
T Consensus 2 ~~~eAi~yar~~l~~~~ 18 (99)
T smart00757 2 KIEEALAYARELLAPFA 18 (99)
T ss_pred cHHHHHHHHHHHHhhhc
Confidence 46899999998866654
No 69
>cd01999 Argininosuccinate_Synthase Argininosuccinate synthase. The Argininosuccinate synthase is a urea cycle enzyme that catalyzes the penultimate step in arginine biosynthesis: the ATP-dependent ligation of citrulline to aspartate to form argininosuccinate, AMP and pyrophosphate . In humans, a defect in the AS gene causes citrullinemia, a genetic disease characterized by severe vomiting spells and mental retardation. AS is a homotetrameric enzyme of chains of about 400 amino-acid residues. An arginine seems to be important for the enzyme's catalytic mechanism. The sequences of AS from various prokaryotes, archaebacteria and eukaryotes show significant similarity
Probab=42.25 E-value=20 Score=28.96 Aligned_cols=20 Identities=35% Similarity=0.635 Sum_probs=17.8
Q ss_pred cCCHHHHHHHHHHcCccEEE
Q psy627 42 FSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V 61 (89)
+-|+++-++||+++|+++..
T Consensus 146 ~~sr~ev~~~A~~~Gip~~~ 165 (385)
T cd01999 146 FLSREEEIEYAEEHGIPVPV 165 (385)
T ss_pred cCCHHHHHHHHHHcCCCCcc
Confidence 56999999999999998754
No 70
>smart00481 POLIIIAc DNA polymerase alpha chain like domain. DNA polymerase alpha chain like domain, incl. family of hypothetical proteins
Probab=42.15 E-value=40 Score=19.58 Aligned_cols=24 Identities=17% Similarity=0.122 Sum_probs=20.7
Q ss_pred CCHHHHHHHHHHcCccEEEeccCC
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
.+.++.|+.|+++|++......+.
T Consensus 15 ~~~~~~~~~a~~~g~~~v~iTDh~ 38 (67)
T smart00481 15 LSPEELVKRAKELGLKAIAITDHG 38 (67)
T ss_pred CCHHHHHHHHHHcCCCEEEEeeCC
Confidence 368999999999999998877664
No 71
>PF00076 RRM_1: RNA recognition motif. (a.k.a. RRM, RBD, or RNP domain); InterPro: IPR000504 Many eukaryotic proteins containing one or more copies of a putative RNA-binding domain of about 90 amino acids are known to bind single-stranded RNAs [, , ]. The largest group of single strand RNA-binding proteins is the eukaryotic RNA recognition motif (RRM) family that contains an eight amino acid RNP-1 consensus sequence [, ]. RRM proteins have a variety of RNA binding preferences and functions, and include heterogeneous nuclear ribonucleoproteins (hnRNPs), proteins implicated in regulation of alternative splicing (SR, U2AF, Sxl), protein components of small nuclear ribonucleoproteins (U1 and U2 snRNPs), and proteins that regulate RNA stability and translation (PABP, La, Hu) [, , ]. The RRM in heterodimeric splicing factor U2 snRNP auxiliary factor (U2AF) appears to have two RRM-like domains with specialised features for protein recognition []. The motif also appears in a few single stranded DNA binding proteins. The typical RRM consists of four anti-parallel beta-strands and two alpha-helices arranged in a beta-alpha-beta-beta-alpha-beta fold with side chains that stack with RNA bases. Specificity of RNA binding is determined by multiple contacts with surrounding amino acids. A third helix is present during RNA binding in some cases []. The RRM is reviewed in a number of publications [, , ].; GO: 0003676 nucleic acid binding; PDB: 2RNE_A 2DGO_A 2DO4_A 1YTY_B 2VOO_B 2VOP_A 2VON_B 1ZH5_B 2VOD_A 1S79_A ....
Probab=42.11 E-value=33 Score=19.03 Aligned_cols=20 Identities=35% Similarity=0.370 Sum_probs=16.4
Q ss_pred eeeecCCHHHHHHHHH-HcCc
Q psy627 38 LALNFSSKEDAIQYCQ-KNGW 57 (89)
Q Consensus 38 v~l~F~s~e~Aiayae-~~G~ 57 (89)
.-+.|.+.++|....+ -+|.
T Consensus 42 a~V~F~~~~~a~~a~~~l~g~ 62 (70)
T PF00076_consen 42 AFVEFESEEDAEKALEELNGK 62 (70)
T ss_dssp EEEEESSHHHHHHHHHHHTTE
T ss_pred EEEEEcCHHHHHHHHHHcCCC
Confidence 4699999999999888 5663
No 72
>cd03012 TlpA_like_DipZ_like TlpA-like family, DipZ-like subfamily; composed uncharacterized proteins containing a TlpA-like TRX domain. Some members show domain architectures similar to that of E. coli DipZ protein (also known as DsbD). The only eukaryotic members of the TlpA family belong to this subfamily. TlpA is a disulfide reductase known to have a crucial role in the biogenesis of cytochrome aa3.
Probab=42.08 E-value=27 Score=22.49 Aligned_cols=43 Identities=5% Similarity=-0.043 Sum_probs=27.5
Q ss_pred CCHHHHHHHHHHcCccEEEeccCCC----CCCcccccccCCcCCCcc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPKWK----TPKVKSYAFNFSWNKRTR 85 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~~~----~~~~ksYadNF~~~r~~~ 85 (89)
++.++..+|++++|+.|.+...... .......-..|..++..+
T Consensus 71 ~~~~~~~~~~~~~~~~~p~~~D~~~~~~~~~~v~~~P~~~vid~~G~ 117 (126)
T cd03012 71 RDLANVKSAVLRYGITYPVANDNDYATWRAYGNQYWPALYLIDPTGN 117 (126)
T ss_pred cCHHHHHHHHHHcCCCCCEEECCchHHHHHhCCCcCCeEEEECCCCc
Confidence 4688999999999999987664321 123333444555554443
No 73
>KOG2741|consensus
Probab=41.92 E-value=19 Score=29.25 Aligned_cols=21 Identities=24% Similarity=0.275 Sum_probs=17.6
Q ss_pred eeeecCCHHHHHHHHHHcCcc
Q psy627 38 LALNFSSKEDAIQYCQKNGWK 58 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~ 58 (89)
|.+.=+|.|-|++||++|+|+
T Consensus 36 vava~~s~~~A~~fAq~~~~~ 56 (351)
T KOG2741|consen 36 VAVADPSLERAKEFAQRHNIP 56 (351)
T ss_pred EEEecccHHHHHHHHHhcCCC
Confidence 345567999999999999994
No 74
>PRK00014 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional
Probab=41.76 E-value=25 Score=26.95 Aligned_cols=19 Identities=16% Similarity=0.232 Sum_probs=16.9
Q ss_pred cCCHHHHHHHHHHcCccEE
Q psy627 42 FSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~ 60 (89)
..+++++++||++||+++.
T Consensus 187 ma~~~~l~~fA~~~~l~ii 205 (230)
T PRK00014 187 MMRGASLERYAAKEGLVAL 205 (230)
T ss_pred ccCHHHHHHHHHHcCCcEE
Confidence 4679999999999999984
No 75
>PRK06027 purU formyltetrahydrofolate deformylase; Reviewed
Probab=41.51 E-value=36 Score=26.17 Aligned_cols=31 Identities=16% Similarity=0.157 Sum_probs=26.5
Q ss_pred CCceeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 35 LSNLALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 35 ~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
-.+|.+-+.++++|-++|+++||++.+..++
T Consensus 116 ~~~i~~visn~~~~~~lA~~~gIp~~~~~~~ 146 (286)
T PRK06027 116 PVEIAAVISNHDDLRSLVERFGIPFHHVPVT 146 (286)
T ss_pred CcEEEEEEEcChhHHHHHHHhCCCEEEeccC
Confidence 3568888899999999999999999886654
No 76
>COG4314 NosL Predicted lipoprotein involved in nitrous oxide reduction [Energy production and conversion]
Probab=41.50 E-value=21 Score=26.44 Aligned_cols=23 Identities=35% Similarity=0.521 Sum_probs=19.4
Q ss_pred eeeecCCHHHHHHHHHHcCccEE
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
.-+-|.++|+|.+||.++|=...
T Consensus 129 ~A~pF~~e~aA~~faa~~GGrvl 151 (176)
T COG4314 129 LASPFSDEEAAERFAADNGGRVL 151 (176)
T ss_pred hcccccCHHHHHHHHHhcCCeEE
Confidence 45669999999999999997653
No 77
>PRK11118 putative monooxygenase; Provisional
Probab=41.31 E-value=36 Score=23.11 Aligned_cols=28 Identities=14% Similarity=0.252 Sum_probs=21.3
Q ss_pred CCCCCCCCCc-eeeecCCHHHHHHHHHHc
Q psy627 28 WCSTGDPLSN-LALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 28 Wtss~D~~~q-v~l~F~s~e~Aiayae~~ 55 (89)
||.+.++..- =-.-|++.++|-+|.+.|
T Consensus 41 WTen~~t~eaGGiYlF~~e~~a~aYl~mH 69 (100)
T PRK11118 41 WTENEKTQEAGGIYLFEDEASAEAYLEMH 69 (100)
T ss_pred eecCCCCcccceEEEECCHHHHHHHHHHH
Confidence 7777776533 245689999999999876
No 78
>cd06576 PASTA_Pbp2x-like_1 PASTA domain of PBP2x-like proteins, first repeat. Penicillin-binding proteins (PBPs) are the major targets for beta-lactam antibiotics, like penicillins and cephalosporins. Beta-lactam antibiotics specifically inhibit transpeptidase activity by acylating the active site serine. PBPs catalyze key steps in the synthesis of the peptidoglycan, such as the interconnecting of glycan chains (polymers of N-glucosamine and N-acetylmuramic acid residues) and the cross-linking (transpeptidation) of short stem peptides, which are connected to glycan chains. Peptidoglycan is essential in cell division and protects bacteria from osmotic shock and lysis. PBP2x is one of the two monofunctional high molecular mass PBPs in Streptococcus pneumoniae and has been seen as the primary PBP target in beta-lactam-resistant strains. The PASTA domain is found at the C-termini of several PBPs and bacterial serine/threonine kinases. The name PASTA is derived from PBP and Serine/Threonine
Probab=39.65 E-value=51 Score=17.64 Aligned_cols=22 Identities=23% Similarity=0.148 Sum_probs=18.6
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
.+.++|.+.+++.|+.+++..+
T Consensus 8 ~~~~~a~~~l~~~g~~~~~~~~ 29 (55)
T cd06576 8 KSVEEAKKELKEAGLQPVVIGN 29 (55)
T ss_pred CCHHHHHHHHHHCCCEEEEeCC
Confidence 4789999999999999977653
No 79
>PF03960 ArsC: ArsC family; InterPro: IPR006660 Several bacterial taxon have a chromosomal resistance system, encoded by the ars operon, for the detoxification of arsenate, arsenite, and antimonite []. This system transports arsenite and antimonite out of the cell. The pump is composed of two polypeptides, the products of the arsA and arsB genes. This two-subunit enzyme produces resistance to arsenite and antimonite. Arsenate, however, must first be reduced to arsenite before it is extruded. A third gene, arsC, expands the substrate specificity to allow for arsenate pumping and resistance. ArsC is an approximately 150-residue arsenate reductase that uses reduced glutathione (GSH) to convert arsenate to arsenite with a redox active cysteine residue in the active site. ArsC forms an active quaternary complex with GSH, arsenate, and glutaredoxin 1 (Grx1). The three ligands must be present simultaneously for reduction to occur []. The arsC family also comprises the Spx proteins which are GRAM-positive bacterial transcription factors that regulate the transcription of multiple genes in response to disulphide stress []. The arsC protein structure has been solved []. It belongs to the thioredoxin superfamily fold which is defined by a beta-sheet core surrounded by alpha-helices. The active cysteine residue of ArsC is located in the loop between the first beta-strand and the first helix, which is also conserved in the Spx protein and its homologues.; PDB: 2KOK_A 1SK1_A 1SK2_A 1JZW_A 1J9B_A 1S3C_A 1SD8_A 1SD9_A 1I9D_A 1SK0_A ....
Probab=39.63 E-value=26 Score=22.61 Aligned_cols=21 Identities=14% Similarity=0.316 Sum_probs=15.8
Q ss_pred CHHHHHHHHHHcCccEEEecc
Q psy627 44 SKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p 64 (89)
|...|+++.+.+|++|++..=
T Consensus 8 t~rka~~~L~~~gi~~~~~d~ 28 (110)
T PF03960_consen 8 TCRKALKWLEENGIEYEFIDY 28 (110)
T ss_dssp HHHHHHHHHHHTT--EEEEET
T ss_pred HHHHHHHHHHHcCCCeEeehh
Confidence 456799999999999998763
No 80
>PTZ00056 glutathione peroxidase; Provisional
Probab=39.49 E-value=31 Score=24.83 Aligned_cols=21 Identities=24% Similarity=0.302 Sum_probs=18.8
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
++.|++.+|++++|+.|.|..
T Consensus 89 d~~e~~~~f~~~~~~~fpvl~ 109 (199)
T PTZ00056 89 PNTKDIRKFNDKNKIKYNFFE 109 (199)
T ss_pred CCHHHHHHHHHHcCCCceeee
Confidence 688999999999999999853
No 81
>cd00860 ThrRS_anticodon ThrRS Threonyl-anticodon binding domain. ThrRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=39.28 E-value=45 Score=19.81 Aligned_cols=24 Identities=8% Similarity=-0.021 Sum_probs=19.8
Q ss_pred ecCCHHHHHHHHHHcCccEEEecc
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
.=.+....+.||++.|++|.|.-.
T Consensus 36 ~~~~~~~~~~~a~~~g~~~~iiig 59 (91)
T cd00860 36 RNEKLGKKIREAQLQKIPYILVVG 59 (91)
T ss_pred CCCCHHHHHHHHHHcCCCEEEEEC
Confidence 346788999999999999988664
No 82
>PF03795 YCII: YCII-related domain; InterPro: IPR005545 The majority of proteins in this group contain a single copy of this domain, though it is also found as a repeat (e.g. in Q9AJZ7 from SWISSPROT). A strongly conserved histidine and a aspartate suggest that the domain has an enzymatic function. This entry also covers what was previously known as the DGPF domain (COG3795). Although its function is unknown it is found fused to a sigma-70 factor family domain in Q9A8M4 from SWISSPROT, suggesting that this domain may plays a role in transcription initiation. This domain is named after the most conserved motif in the alignment.; PDB: 1S7I_A 1MWQ_A.
Probab=39.24 E-value=21 Score=22.01 Aligned_cols=17 Identities=18% Similarity=0.348 Sum_probs=14.9
Q ss_pred eeeecCCHHHHHHHHHH
Q psy627 38 LALNFSSKEDAIQYCQK 54 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~ 54 (89)
+-+..+|+|+|+++|++
T Consensus 61 ~i~~a~s~e~A~~~~~~ 77 (95)
T PF03795_consen 61 IIVEAESREEAEEIAKE 77 (95)
T ss_dssp EEEEESSHHHHHHHHCT
T ss_pred EEEEeCCHHHHHHHHHh
Confidence 47899999999999875
No 83
>cd07202 cPLA2_Grp-IVC Group IVC cytoplasmic phospholipase A2; catalytic domain; Ca-independent. Group IVC cPLA2, a small 61 kDa protein, is a single domain alpha/beta hydrolase. It lacks a C2 domain; therefore, it has no Ca-dependence. Group IVC cPLA2 is also referred to as cPLA2-gamma. The cPLA2-gamma enzyme is predominantly found in cardiac and skeletal muscles, and to a lesser extent in the brain. Human cPLA2-gamma is approximately 30% identical to cPLA2-alpha. The catalytic domain of cytosolic phospholipase A2 (PLA2; EC 3.1.1.4) hydrolyzes the sn-2-acyl ester bond of phospholipids to release arachidonic acid. At the active site, cPLA2 contains a serine nucleophile through which the catalytic mechanism is initiated. The active site is partially covered by a solvent-accessible flexible lid. cPLA2 displays interfacial activation as it exists in both "closed lid" and "open lid" forms. Movement of the cPLA2 lid possibly exposes a greater hydrophobic surface and the active site. cPLA2 be
Probab=39.02 E-value=15 Score=30.53 Aligned_cols=22 Identities=27% Similarity=0.602 Sum_probs=17.7
Q ss_pred cCCHHHHHHHHHHcCccEEEec
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
|.+...|-+||++|||+|-=++
T Consensus 321 f~~l~~~~~~c~~~~ipfp~i~ 342 (430)
T cd07202 321 FETIKDTAEYCRKHNIPFPQVD 342 (430)
T ss_pred hHHHHHHHHHHHHcCCCCCCcC
Confidence 5577899999999999994333
No 84
>PF14657 Integrase_AP2: AP2-like DNA-binding integrase domain
Probab=38.72 E-value=28 Score=19.42 Aligned_cols=19 Identities=21% Similarity=0.242 Sum_probs=15.1
Q ss_pred Cceeee-cCCHHHHHHHHHH
Q psy627 36 SNLALN-FSSKEDAIQYCQK 54 (89)
Q Consensus 36 ~qv~l~-F~s~e~Aiayae~ 54 (89)
.++.-+ |.|+.||.+++.+
T Consensus 17 k~~~k~GF~TkkeA~~~~~~ 36 (46)
T PF14657_consen 17 KQKTKRGFKTKKEAEKALAK 36 (46)
T ss_pred EEEEcCCCCcHHHHHHHHHH
Confidence 345556 9999999999875
No 85
>PF00331 Glyco_hydro_10: Glycosyl hydrolase family 10; InterPro: IPR001000 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 10 GH10 from CAZY comprises enzymes with a number of known activities; xylanase (3.2.1.8 from EC); endo-1,3-beta-xylanase (3.2.1.32 from EC); cellobiohydrolase (3.2.1.91 from EC). These enzymes were formerly known as cellulase family F. The microbial degradation of cellulose and xylans requires several types of enzymes such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) [, ]. Fungi and bacteria produces a spectrum of cellulolytic enzymes (cellulases) and xylanases which, on the basis of sequence similarities, can be classified into families. One of these families is known as the cellulase family F [] or as the glycosyl hydrolases family 10 []. ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1UQZ_A 1UQY_A 1UR2_A 1UR1_A 2CNC_A 1OD8_A 1E0W_A 1E0V_A 1V0M_A 1E0X_B ....
Probab=38.61 E-value=20 Score=27.72 Aligned_cols=22 Identities=27% Similarity=0.539 Sum_probs=18.7
Q ss_pred eeecCCHHHHHHHHHHcCccEE
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
++.|.+.++.++||+++|+...
T Consensus 55 ~~~~~~~D~~~~~a~~~g~~vr 76 (320)
T PF00331_consen 55 RFNFESADAILDWARENGIKVR 76 (320)
T ss_dssp BEE-HHHHHHHHHHHHTT-EEE
T ss_pred ccCccchhHHHHHHHhcCccee
Confidence 7899999999999999999986
No 86
>PRK08179 prfH peptide chain release factor-like protein; Reviewed
Probab=38.60 E-value=25 Score=26.24 Aligned_cols=17 Identities=18% Similarity=0.303 Sum_probs=14.6
Q ss_pred HHHHHHcCccEEEeccC
Q psy627 49 IQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 49 iayae~~G~~Y~V~~p~ 65 (89)
+.||+++||.++|.+-.
T Consensus 27 ~~~a~~~g~~~~ii~~~ 43 (200)
T PRK08179 27 LKEAARQGVRVTVLETE 43 (200)
T ss_pred HHHHHHcCCeEEEEeCC
Confidence 57999999999998743
No 87
>KOG0097|consensus
Probab=38.51 E-value=25 Score=26.13 Aligned_cols=17 Identities=29% Similarity=0.657 Sum_probs=15.0
Q ss_pred CHHHHHHHHHHcCccEE
Q psy627 44 SKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~ 60 (89)
+-|+|-+|||.||+-|-
T Consensus 135 ~yeeak~faeengl~fl 151 (215)
T KOG0097|consen 135 TYEEAKEFAEENGLMFL 151 (215)
T ss_pred cHHHHHHHHhhcCeEEE
Confidence 56899999999999874
No 88
>cd03027 GRX_DEP Glutaredoxin (GRX) family, Dishevelled, Egl-10, and Pleckstrin (DEP) subfamily; composed of uncharacterized proteins containing a GRX domain and additional domains DEP and DUF547, both of which have unknown functions. GRX is a glutathione (GSH) dependent reductase containing a redox active CXXC motif in a TRX fold. It has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. By altering the redox state of target proteins, GRX is involved in many cellular functions.
Probab=38.45 E-value=40 Score=19.79 Aligned_cols=24 Identities=4% Similarity=0.066 Sum_probs=20.0
Q ss_pred cCCHHHHHHHHHHcCccEEEeccC
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.+....|..+.+++|++|++..-.
T Consensus 11 C~~C~ka~~~L~~~gi~~~~~di~ 34 (73)
T cd03027 11 CEDCTAVRLFLREKGLPYVEINID 34 (73)
T ss_pred ChhHHHHHHHHHHCCCceEEEECC
Confidence 356788999999999999987643
No 89
>COG5453 Uncharacterized conserved protein [Function unknown]
Probab=38.39 E-value=21 Score=24.06 Aligned_cols=18 Identities=33% Similarity=0.536 Sum_probs=16.0
Q ss_pred ecCCHHHHHHHHHHcCcc
Q psy627 41 NFSSKEDAIQYCQKNGWK 58 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~ 58 (89)
.|.+++|||+++-+.|.-
T Consensus 68 ~ft~~~DAve~~lkKg~q 85 (96)
T COG5453 68 LFTSLDDAVEIALKKGRQ 85 (96)
T ss_pred hccchhHHHHHHHHHHHH
Confidence 589999999999998865
No 90
>cd02977 ArsC_family Arsenate Reductase (ArsC) family; composed of TRX-fold arsenic reductases and similar proteins including the transcriptional regulator, Spx. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX), through a single catalytic cysteine. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases. Spx is a general regulator that exerts negative and positive control over transcription initiation by binding to the C-terminal domain of the alpha subunit of RNA polymerase.
Probab=38.38 E-value=29 Score=22.02 Aligned_cols=22 Identities=14% Similarity=0.314 Sum_probs=18.4
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
++...|.+|.+.+|++|++..-
T Consensus 10 ~~c~ka~~~L~~~~i~~~~idi 31 (105)
T cd02977 10 STSRKALAWLEEHGIEYEFIDY 31 (105)
T ss_pred HHHHHHHHHHHHcCCCcEEEee
Confidence 3567899999999999988764
No 91
>cd07238 Glo_EDI_BRP_like_5 This conserved domain belongs to a superfamily including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. This protein family belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. A bound metal ion is required for protein activities for the members of this superfamily. A variety of metal ions have been found in the catalytic centers of these proteins including Fe(II), Mn(II), Zn(II), Ni(II) and Mg(II). The protein superfamily contains members with or without domain swapping. The structure of this family is a that of a strand-swapped dimer.
Probab=37.74 E-value=76 Score=19.29 Aligned_cols=26 Identities=12% Similarity=0.316 Sum_probs=22.3
Q ss_pred CceeeecCCHHHHHHHHHHcCccEEE
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~V 61 (89)
..+.+.-++.+++++.+++.|+.+..
T Consensus 59 ~~i~~~v~d~~~~~~~l~~~G~~~~~ 84 (112)
T cd07238 59 PDLSIEVDDVDAALARAVAAGFAIVY 84 (112)
T ss_pred CEEEEEeCCHHHHHHHHHhcCCeEec
Confidence 46788889999999999999998653
No 92
>cd02970 PRX_like2 Peroxiredoxin (PRX)-like 2 family; hypothetical proteins that show sequence similarity to PRXs. Members of this group contain a CXXC motif, similar to TRX. The second cysteine in the motif corresponds to the peroxidatic cysteine of PRX, however, these proteins do not contain the other two residues of the catalytic triad of PRX. PRXs confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. TRXs alter the redox state of target proteins by catalyzing the reduction of their disulfide bonds via the CXXC motif using reducing equivalents derived from either NADPH or ferredoxins.
Probab=37.50 E-value=55 Score=21.00 Aligned_cols=28 Identities=18% Similarity=0.045 Sum_probs=23.4
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
|.+..++.+++.+|+++++++|.|....
T Consensus 61 v~V~~~~~~~~~~~~~~~~~~~p~~~D~ 88 (149)
T cd02970 61 VAVGPESPEKLEAFDKGKFLPFPVYADP 88 (149)
T ss_pred EEEeCCCHHHHHHHHHhcCCCCeEEECC
Confidence 4677888888999999999999887643
No 93
>cd07415 MPP_PP2A_PP4_PP6 PP2A, PP4, and PP6 phosphoprotein phosphatases, metallophosphatase domain. PP2A-like family of phosphoprotein phosphatases (PPP's) including PP4 and PP6. PP2A (Protein phosphatase 2A) is a critical regulator of many cellular activities. PP2A comprises about 1% of total cellular proteins. PP2A, together with protein phosphatase 1 (PP1), accounts for more than 90% of all serine/threonine phosphatase activities in most cells and tissues. The PP2A subunit in addition to having a catalytic domain homologous to PP1, has a unique C-terminal tail, containing a motif that is conserved in the catalytic subunits of all PP2A-like phosphatases including PP4 and PP6, and has an important role in PP2A regulation. The PP2A-like family of phosphatases all share a similar heterotrimeric architecture, that includes: a 65kDa scaffolding subunit (A), a 36kDa catalytic subunit (C), and one of 18 regulatory subunits (B). The PPP (phosphoprotein phosphatase) family, to which PP2
Probab=37.38 E-value=67 Score=24.74 Aligned_cols=26 Identities=12% Similarity=0.296 Sum_probs=21.1
Q ss_pred ceeeecCCHHHHHHHHHHcCccEEEec
Q psy627 37 NLALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.....|. .+++.+|++++|+.+.|..
T Consensus 207 g~g~~fg-~~~~~~Fl~~n~l~~iiR~ 232 (285)
T cd07415 207 GAGYLFG-QDVVEEFNHNNGLTLICRA 232 (285)
T ss_pred CCccccC-HHHHHHHHHHCCCeEEEEc
Confidence 3455665 7899999999999999865
No 94
>PF14542 Acetyltransf_CG: GCN5-related N-acetyl-transferase; PDB: 2H5M_A 2Q44_A 1XMT_A 2Q4Y_A 2IL4_A 2EVN_A 1R57_A.
Probab=36.99 E-value=33 Score=21.25 Aligned_cols=18 Identities=33% Similarity=0.554 Sum_probs=13.8
Q ss_pred HHHHHHHHHHcCccEEEe
Q psy627 45 KEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~ 62 (89)
.++|++||+++|+.....
T Consensus 44 ~~~~l~~a~~~~~kv~p~ 61 (78)
T PF14542_consen 44 VEAALDYARENGLKVVPT 61 (78)
T ss_dssp HHHHHHHHHHTT-EEEET
T ss_pred HHHHHHHHHHCCCEEEEE
Confidence 378999999999986543
No 95
>PF10009 DUF2252: Uncharacterized protein conserved in bacteria (DUF2252); InterPro: IPR018721 This domain has no known function.
Probab=36.87 E-value=25 Score=28.29 Aligned_cols=38 Identities=18% Similarity=0.430 Sum_probs=24.8
Q ss_pred CccCCCCCCCCCCCC------CCceeeecC----CHHHHHHHHHHcCc
Q psy627 20 RWENPLMGWCSTGDP------LSNLALNFS----SKEDAIQYCQKNGW 57 (89)
Q Consensus 20 ~~~dPLMGWtss~D~------~~qv~l~F~----s~e~Aiayae~~G~ 57 (89)
.-.||++||++.++- ++..+.+++ +.++.+.||+-.|+
T Consensus 288 ~~sD~~LG~~~~~g~~f~VRql~p~k~~~d~~~l~~~~~~~~a~~~g~ 335 (385)
T PF10009_consen 288 AASDPFLGWVTIDGRDFYVRQLRPMKGSVDLEDLDPEELQDYARLCGR 335 (385)
T ss_pred hcCCcccCceeeCCceEEEEECCCcCCCcChhhCCHHHHHHHHHHHHH
Confidence 345999999998762 122222222 67888888887765
No 96
>PF07611 DUF1574: Protein of unknown function (DUF1574); InterPro: IPR011468 This is a family of hypothetical proteins found in Leptospira interrogans and other bacteria.
Probab=36.72 E-value=20 Score=28.82 Aligned_cols=27 Identities=22% Similarity=0.219 Sum_probs=22.7
Q ss_pred eecCCHHHHHHHHHHcCccEEEeccCC
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
+.|.-.|++++.||++||++.|.-|.-
T Consensus 249 ~q~~F~e~~L~~ake~~I~~vl~~P~V 275 (345)
T PF07611_consen 249 TQFFFLEKFLKLAKENGIPVVLWWPKV 275 (345)
T ss_pred hHHHHHHHHHHHHHHcCCcEEEEEecc
Confidence 445568999999999999999988863
No 97
>PF02811 PHP: PHP domain; InterPro: IPR004013 The PHP (Polymerase and Histidinol Phosphatase) domain is a putative phosphoesterase domain. This family is often associated with an N-terminal region IPR003141 from INTERPRO.; GO: 0003824 catalytic activity; PDB: 2WJE_A 3QY8_A 2WJD_A 2WJF_A 1PB0_B 1M68_A 1M65_A 3E38_B 2W9M_A 3E0F_A ....
Probab=36.27 E-value=40 Score=22.24 Aligned_cols=23 Identities=13% Similarity=0.228 Sum_probs=21.0
Q ss_pred CCHHHHHHHHHHcCccEEEeccC
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.+.|+.|+.|+++|+......++
T Consensus 16 ~~~~e~v~~A~~~Gl~~i~iTDH 38 (175)
T PF02811_consen 16 DSPEEYVEQAKEKGLDAIAITDH 38 (175)
T ss_dssp SSHHHHHHHHHHTTESEEEEEEE
T ss_pred CCHHHHHHHHHHcCCCEEEEcCC
Confidence 48999999999999999888877
No 98
>TIGR03798 ocin_TIGR03798 bacteriocin propeptide, TIGR03798 family. This model describes a conserved, fairly long (about 65 residue) propeptide region for a family of putative microcins, that is, bacteriocins of small size. Members of the seed alignment tend to have the Gly-Gly motif as the last two residues of the matched region. This is a cleavage site for a combination processing/export ABC transporter with a peptidase domain.
Probab=36.20 E-value=17 Score=21.80 Aligned_cols=22 Identities=23% Similarity=0.453 Sum_probs=18.9
Q ss_pred cCCHHHHHHHHHHcCccEEEec
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
..+.++-|++|+.+|+.+...+
T Consensus 24 ~~~~e~~~~lA~~~Gf~ft~~e 45 (64)
T TIGR03798 24 AEDPEDRVAIAKEAGFEFTGED 45 (64)
T ss_pred cCCHHHHHHHHHHcCCCCCHHH
Confidence 4789999999999999987654
No 99
>PF11823 DUF3343: Protein of unknown function (DUF3343); InterPro: IPR021778 This family of proteins are functionally uncharacterised. This protein is found in bacteria and archaea. Proteins in this family are typically between 78 to 102 amino acids in length.
Probab=36.11 E-value=48 Score=20.05 Aligned_cols=24 Identities=13% Similarity=0.081 Sum_probs=19.1
Q ss_pred eeecCCHHHHHHHHHHcCccEEEe
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
++.....|.+++..+++|+.|+=.
T Consensus 46 ~~~~~d~~~i~~~l~~~~i~~~~i 69 (73)
T PF11823_consen 46 RFEPEDLEKIKEILEENGIEYEGI 69 (73)
T ss_pred EEChhhHHHHHHHHHHCCCCeeEE
Confidence 444477999999999999998643
No 100
>cd04438 DEP_dishevelled DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in dishevelled-like proteins. Dishevelled-like proteins play a key role in the transduction of the Wnt signal from the cell surface to the nucleus, which in turn is an important regulatory pathway for cellular development and growth. They contain an N-terminal DIX domain, a central PDZ domain, and a C-terminal DEP domain.
Probab=35.92 E-value=45 Score=21.35 Aligned_cols=24 Identities=17% Similarity=0.301 Sum_probs=19.4
Q ss_pred cCCHHHHHHHHH---HcCccEEEeccC
Q psy627 42 FSSKEDAIQYCQ---KNGWKFFVEEPK 65 (89)
Q Consensus 42 F~s~e~Aiayae---~~G~~Y~V~~p~ 65 (89)
..|+++|+.||. ++|+=+.|...+
T Consensus 46 ~~~R~eAv~~g~~Ll~~G~i~HV~~~h 72 (84)
T cd04438 46 LTDRREARKYASSLLKLGYIRHTVNKI 72 (84)
T ss_pred CCCHHHHHHHHHHHHHCCcEEecCCCc
Confidence 479999999998 688888886543
No 101
>KOG0098|consensus
Probab=35.89 E-value=30 Score=26.44 Aligned_cols=18 Identities=28% Similarity=0.560 Sum_probs=16.2
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
=|+||-.+||+.||+-|-
T Consensus 129 Vs~EEGeaFA~ehgLifm 146 (216)
T KOG0098|consen 129 VSKEEGEAFAREHGLIFM 146 (216)
T ss_pred ccHHHHHHHHHHcCceee
Confidence 478999999999999885
No 102
>PF01910 DUF77: Domain of unknown function DUF77; InterPro: IPR002767 This entry contains several hypothetical proteins of unknown function found in archaebacteria, eukaryotes and eubacteria. The structures of YBL001c from Saccharomyces cerevisiae and its homologue MTH1187 from the archaea Methanobacterium thermoautotrophicum have been determined []. These proteins have a ferredoxin-like alpha/beta sandwich structure with anti-parallel beta-sheets. Generally, they have two domains that form a single beta-sheet dimer, where two dimers pack sheet-to-sheet into a tetramer, some proteins having an extra C-terminal helix. ; PDB: 1LXJ_A 1YQH_A 2EKY_G 2EPI_A 1VK8_D 2IBO_C 1LXN_B.
Probab=35.87 E-value=34 Score=22.17 Aligned_cols=18 Identities=28% Similarity=0.457 Sum_probs=15.9
Q ss_pred HHHHHHHHHcCccEEEec
Q psy627 46 EDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 46 e~Aiayae~~G~~Y~V~~ 63 (89)
.+||++.++.|++|+|-.
T Consensus 20 ~~~i~~i~~sgl~y~v~p 37 (92)
T PF01910_consen 20 AEAIEVIKESGLKYEVGP 37 (92)
T ss_dssp HHHHHHHHTSSSEEEEET
T ss_pred HHHHHHHHHcCCceEEcC
Confidence 689999999999999853
No 103
>PRK12354 carbamate kinase; Reviewed
Probab=35.75 E-value=32 Score=27.22 Aligned_cols=22 Identities=14% Similarity=0.376 Sum_probs=19.0
Q ss_pred ecCCHHHHHHHHHHcCccEEEe
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
.|-|+|+|.+.++++||.+...
T Consensus 125 ~~y~~~~a~~~~~e~g~~~~~d 146 (307)
T PRK12354 125 PVYDEAEAERLAAEKGWTIKPD 146 (307)
T ss_pred cccCHHHHHHHHHhcCCEEeec
Confidence 5889999999999999976554
No 104
>cd04908 ACT_Bt0572_1 N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains. Included in this CD is the N-terminal ACT domain of a novel protein composed almost entirely of two tandem ACT domains as seen in the uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related ACT domains. These tandem ACT domain proteins belong to the superfamily of ACT regulatory domains.
Probab=35.37 E-value=58 Score=18.82 Aligned_cols=24 Identities=21% Similarity=0.254 Sum_probs=20.6
Q ss_pred CceeeecCCHHHHHHHHHHcCccE
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y 59 (89)
.-++|.+++.|.|++--+++|+..
T Consensus 41 ~~~rl~~~~~~~~~~~L~~~G~~v 64 (66)
T cd04908 41 GILRLIVSDPDKAKEALKEAGFAV 64 (66)
T ss_pred CEEEEEECCHHHHHHHHHHCCCEE
Confidence 567899999999999999999764
No 105
>PF03129 HGTP_anticodon: Anticodon binding domain; InterPro: IPR004154 tRNA synthetases, or tRNA ligases are involved in protein synthesis. This domain is found in histidyl, glycyl, threonyl and prolyl tRNA synthetases [] it is probably the anticodon binding domain [].; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding; PDB: 1KOG_B 1EVL_D 1EVK_B 1QF6_A 1FYF_B 2I4O_A 2I4M_A 2I4N_A 2I4L_A 1HC7_D ....
Probab=35.33 E-value=55 Score=19.89 Aligned_cols=22 Identities=27% Similarity=0.304 Sum_probs=18.6
Q ss_pred cCCHHHHHHHHHHcCccEEEec
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
=.+.-..+.+|++.|++|.|.-
T Consensus 38 ~~~~~k~~~~a~~~g~p~~iii 59 (94)
T PF03129_consen 38 DKSLGKQIKYADKLGIPFIIII 59 (94)
T ss_dssp SSTHHHHHHHHHHTTESEEEEE
T ss_pred CCchhHHHHHHhhcCCeEEEEE
Confidence 3567789999999999998865
No 106
>cd03412 CbiK_N Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), N-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases and is a homomeric enzyme that does not require ATP for its enzymatic activity.
Probab=35.16 E-value=49 Score=22.21 Aligned_cols=26 Identities=12% Similarity=0.293 Sum_probs=23.2
Q ss_pred eecCCHHHHHHHHHHcCccEEEeccC
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
+.++|.++|++-+.++|+.=.|..|-
T Consensus 53 ~~~p~~~eaL~~l~~~G~~~V~V~Pl 78 (127)
T cd03412 53 IEVDTPEEALAKLAADGYTEVIVQSL 78 (127)
T ss_pred CCCCCHHHHHHHHHHCCCCEEEEEeC
Confidence 56899999999999999999888874
No 107
>cd00859 HisRS_anticodon HisRS Histidyl-anticodon binding domain. HisRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=35.16 E-value=56 Score=18.83 Aligned_cols=20 Identities=20% Similarity=0.386 Sum_probs=17.3
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
+....+.||++.|+++.|..
T Consensus 39 ~~~~~~~~a~~~~~~~~i~i 58 (91)
T cd00859 39 KLKKQFKYADRSGARFAVIL 58 (91)
T ss_pred CHHHHHHHHHHcCCCEEEEE
Confidence 67899999999999986654
No 108
>PRK06806 fructose-bisphosphate aldolase; Provisional
Probab=34.92 E-value=47 Score=25.59 Aligned_cols=26 Identities=8% Similarity=0.257 Sum_probs=23.0
Q ss_pred ceeeecCCHHHHHHHHHHcCccEEEe
Q psy627 37 NLALNFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
.+.+.+.|.|+|++|+++-|.+|.-.
T Consensus 147 ~~g~s~t~~eea~~f~~~tg~DyLAv 172 (281)
T PRK06806 147 DIEMLLTSTTEAKRFAEETDVDALAV 172 (281)
T ss_pred cccceeCCHHHHHHHHHhhCCCEEEE
Confidence 34677899999999999999999887
No 109
>cd07417 MPP_PP5_C PP5, C-terminal metallophosphatase domain. Serine/threonine protein phosphatase-5 (PP5) is a member of the PPP gene family of protein phosphatases that is highly conserved among eukaryotes and widely expressed in mammalian tissues. PP5 has a C-terminal phosphatase domain and an extended N-terminal TPR (tetratricopeptide repeat) domain containing three TPR motifs. The PPP (phosphoprotein phosphatase) family, to which PP5 belongs, is one of two known protein phosphatase families specific for serine and threonine. The PPP family also includes: PP1, PP2A, PP2B (calcineurin), PP4, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic domain is defined by three conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-). The PPP enzyme family is ancient with members found in all eukaryotes, and in most bacterial and archeal genomes. Dephosphorylation of phosphoserines and phosphothreonines on target proteins plays a central role in the regulation of many cel
Probab=34.57 E-value=83 Score=24.66 Aligned_cols=26 Identities=8% Similarity=0.360 Sum_probs=20.8
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+...| ..+++.+|+++||+.+.|..-
T Consensus 227 ~g~~f-g~~~~~~Fl~~n~l~~iiR~H 252 (316)
T cd07417 227 VGCQF-GPDVTKRFLEENNLEYIIRSH 252 (316)
T ss_pred CceEe-CHHHHHHHHHHcCCcEEEECC
Confidence 45566 677788999999999998653
No 110
>PRK00509 argininosuccinate synthase; Provisional
Probab=34.49 E-value=32 Score=28.09 Aligned_cols=19 Identities=26% Similarity=0.627 Sum_probs=16.7
Q ss_pred ecCCHHHHHHHHHHcCccE
Q psy627 41 NFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y 59 (89)
.|.|+++-++||+++|++.
T Consensus 148 ~~~tK~eir~~A~~~Gipv 166 (399)
T PRK00509 148 DLKSREELIAYAEEHGIPI 166 (399)
T ss_pred CCCCHHHHHHHHHHcCCCC
Confidence 3559999999999999975
No 111
>cd03018 PRX_AhpE_like Peroxiredoxin (PRX) family, AhpE-like subfamily; composed of proteins similar to Mycobacterium tuberculosis AhpE. AhpE is described as a 1-cys PRX because of the absence of a resolving cysteine. The structure and sequence of AhpE, however, show greater similarity to 2-cys PRXs than 1-cys PRXs. PRXs are thiol-specific antioxidant (TSA) proteins that confer a protective role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either thioredoxin, glutathione, trypanothione and AhpF. The first step of catalysis is the nucleophilic attack by the peroxidatic cysteine on the peroxide leading to the formation of a cysteine sulfenic acid intermediate. The absence of a resolving cysteine suggests that functional AhpE is regenerated by an external reductant. The solution behavior and crystal structure of AhpE show that it forms dimers and octamers.
Probab=34.48 E-value=61 Score=21.01 Aligned_cols=27 Identities=19% Similarity=0.276 Sum_probs=23.0
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+.+..++.+++.+|++++++.|.+...
T Consensus 66 i~vs~d~~~~~~~~~~~~~~~~~~~~D 92 (149)
T cd03018 66 LGISVDSPFSLRAWAEENGLTFPLLSD 92 (149)
T ss_pred EEecCCCHHHHHHHHHhcCCCceEecC
Confidence 356778999999999999999998764
No 112
>PRK03745 signal recognition particle protein Srp19; Provisional
Probab=34.16 E-value=1e+02 Score=20.74 Aligned_cols=24 Identities=21% Similarity=0.396 Sum_probs=19.3
Q ss_pred eeecCCHHHHHHHHHHcCccEE-Ee
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFF-VE 62 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~-V~ 62 (89)
.+.=++.++=++-|++.|++|. |+
T Consensus 28 aV~~P~~~EI~~a~~~lgl~~~~~E 52 (100)
T PRK03745 28 AVERPTLEEIVDAAEALGFKVIEVD 52 (100)
T ss_pred cccCCCHHHHHHHHHHcCCCcEeec
Confidence 3445788999999999999995 54
No 113
>cd01992 PP-ATPase N-terminal domain of predicted ATPase of the PP-loop faimly implicated in cell cycle control [Cell division and chromosome partitioning]. This is a subfamily of Adenine nucleotide alpha hydrolases superfamily.Adeninosine nucleotide alpha hydrolases superfamily includes N type ATP PPases and ATP sulphurylases. It forms a apha/beta/apha fold which binds to Adenosine group. This domain has a strongly conserved motif SGGXD at the N terminus.
Probab=34.03 E-value=44 Score=22.76 Aligned_cols=22 Identities=14% Similarity=0.332 Sum_probs=18.7
Q ss_pred cCCHHHHHHHHHHcCccEEEec
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.=+++|-++||+.+|+++....
T Consensus 146 ~~~k~eI~~~~~~~~l~~~~~~ 167 (185)
T cd01992 146 GITRAEIEAYLRENGLPWWEDP 167 (185)
T ss_pred CCCHHHHHHHHHHcCCCeEECC
Confidence 4489999999999999987643
No 114
>PF13669 Glyoxalase_4: Glyoxalase/Bleomycin resistance protein/Dioxygenase superfamily; PDB: 3RMU_B 3ISQ_A 1JC5_D 1JC4_D 3HDP_A 2QH0_A 3GM5_A 3OA4_A 3CT8_A.
Probab=33.95 E-value=36 Score=21.34 Aligned_cols=32 Identities=19% Similarity=0.399 Sum_probs=26.8
Q ss_pred CCCCCceeeecCCHHHHHHHHHHcCccEEEec
Q psy627 32 GDPLSNLALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 32 ~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+.-+..|.+.-++.|+|++.++.+|+...-..
T Consensus 66 ~~gi~Hia~~v~D~d~~~~~l~~~G~~~~~~~ 97 (109)
T PF13669_consen 66 GGGIHHIAFEVDDLDAAIARLEAQGFRVLDEG 97 (109)
T ss_dssp SSEEEEEEEEESHHHHHHHHHHHTTECEEECE
T ss_pred CCCEEEEEEEeCCHHHHHHHHHHCCCEEcccC
Confidence 44466788999999999999999999987544
No 115
>cd07419 MPP_Bsu1_C Arabidopsis thaliana Bsu1 phosphatase and related proteins, C-terminal metallophosphatase domain. Bsu1 encodes a nuclear serine-threonine protein phosphatase found in plants and protozoans. Bsu1 has a C-terminal phosphatase domain and an N-terminal Kelch-repeat domain. Bsu1 is preferentially expressed in elongating plant cells. It modulates the phosphorylation state of Bes1, a transcriptional regulator phosphorylated by the glycogen synthase kinase Bin2, as part of a steroid hormone signal transduction pathway. The PPP (phosphoprotein phosphatase) family, to which Bsu1 belongs, is one of two known protein phosphatase families specific for serine and threonine. The PPP family also includes: PP1, PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic domain is defined by three conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-). The PPP enzyme family is ancient with members found in all eukaryotes, and in most
Probab=33.85 E-value=84 Score=24.39 Aligned_cols=23 Identities=4% Similarity=0.172 Sum_probs=19.0
Q ss_pred cCCHHHHHHHHHHcCccEEEecc
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+=..+++-+||+++|+.+.|..-
T Consensus 239 ~fg~~~~~~Fl~~n~l~~iiRgH 261 (311)
T cd07419 239 KFGPDRVHRFLEENDLQMIIRAH 261 (311)
T ss_pred eECHHHHHHHHHHCCCeEEEEec
Confidence 34678889999999999998653
No 116
>PF12123 Amidase02_C: N-acetylmuramoyl-l-alanine amidase; InterPro: IPR021976 This domain is found in bacteria and viruses. This domain is about 50 amino acids in length. This domain is classified with the enzyme classification code 3.5.1.28 from EC. This domain is the C-terminal of the enzyme which hydrolyses the link between N-acetylmuramoyl residues and L-amino acid residues in certain cell-wall glycopeptides. ; PDB: 2L48_B.
Probab=33.73 E-value=56 Score=19.02 Aligned_cols=18 Identities=17% Similarity=0.466 Sum_probs=13.7
Q ss_pred CHHHHHHHHHHcCccEEE
Q psy627 44 SKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V 61 (89)
.++.+-+|-.++||-|+|
T Consensus 28 ~L~k~~~wld~rgWwYe~ 45 (45)
T PF12123_consen 28 ELDKFTAWLDERGWWYEV 45 (45)
T ss_dssp HHHHHHHHHHHTT--EEE
T ss_pred HHHHHHHHHHhcCcEEeC
Confidence 467888999999999987
No 117
>cd07249 MMCE Methylmalonyl-CoA epimerase (MMCE). MMCE, also called methylmalonyl-CoA racemase (EC 5.1.99.1) interconverts (2R)-methylmalonyl-CoA and (2S)-methylmalonyl-CoA. MMCE has been found in bacteria, archaea, and in animals. In eukaryotes, MMCE is an essential enzyme in a pathway that converts propionyl-CoA to succinyl-CoA, and is important in the breakdown of odd-chain length fatty acids, branched-chain amino acids, and other metabolites. In bacteria, MMCE participates in the reverse pathway for propionate fermentation, glyoxylate regeneration, and the biosynthesis of polyketide antibiotics. MMCE is closely related to glyoxalase I and type I extradiol dioxygenases.
Probab=33.57 E-value=66 Score=19.62 Aligned_cols=24 Identities=13% Similarity=0.317 Sum_probs=21.3
Q ss_pred CceeeecCCHHHHHHHHHHcCccE
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y 59 (89)
..+.+..++.+++++.+.+.|+.+
T Consensus 74 ~h~~f~v~d~~~~~~~l~~~G~~~ 97 (128)
T cd07249 74 HHIAFEVDDIDAALARLKAQGVRL 97 (128)
T ss_pred EEEEEEeCCHHHHHHHHHHCCCee
Confidence 447888999999999999999975
No 118
>TIGR02432 lysidine_TilS_N tRNA(Ile)-lysidine synthetase, N-terminal domain. The only examples in which the wobble position of a tRNA must discriminate between G and A of mRNA are AUA (Ile) vs. AUG (Met) and UGA (stop) vs. UGG (Trp). In all bacteria, the wobble position of the tRNA(Ile) recognizing AUA is lysidine, a lysine derivative of cytidine. This family describes a protein domain found, apparently, in all bacteria in a single copy. Eukaryotic sequences appear to be organellar. The domain archictecture of this protein family is variable; some, including characterized proteins of E. coli and B. subtilis known to be tRNA(Ile)-lysidine synthetase, include a conserved 50-residue domain that many other members lack. This protein belongs to the ATP-binding PP-loop family ( pfam01171). It appears in the literature and protein databases as TilS, YacA, and putative cell cycle protein MesJ (a misnomer).
Probab=33.56 E-value=48 Score=22.81 Aligned_cols=20 Identities=20% Similarity=0.296 Sum_probs=17.4
Q ss_pred HHHHHHHHHHcCccEEEecc
Q psy627 45 KEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p 64 (89)
.+.|.++|+..|+++.+..-
T Consensus 47 ~~~~~~~~~~~gi~~~~~~~ 66 (189)
T TIGR02432 47 AEFVQQFCKKLNIPLEIKKV 66 (189)
T ss_pred HHHHHHHHHHcCCCEEEEEe
Confidence 68899999999999988763
No 119
>PF01329 Pterin_4a: Pterin 4 alpha carbinolamine dehydratase; InterPro: IPR001533 DCoH is the dimerisation cofactor of hepatocyte nuclear factor 1 (HNF-1) that functions as both a transcriptional coactivator and a pterin dehydratase []. X-ray crystallographic studies have shown that the ligand binds at four sites per tetrameric enzyme, with little apparent conformational change in the protein.; GO: 0008124 4-alpha-hydroxytetrahydrobiopterin dehydratase activity, 0006729 tetrahydrobiopterin biosynthetic process; PDB: 2V6T_B 2V6U_A 2V6S_B 2EBB_A 1USM_A 1F93_B 1DCP_C 1DCH_E 3HXA_E 1DCO_C ....
Probab=33.54 E-value=38 Score=21.78 Aligned_cols=32 Identities=19% Similarity=0.442 Sum_probs=22.5
Q ss_pred CCCCCCCCCCCCCCceeeecCCHHHHHHHHHH
Q psy627 23 NPLMGWCSTGDPLSNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 23 dPLMGWtss~D~~~qv~l~F~s~e~Aiayae~ 54 (89)
..|-||.-.++..-.-...|++-..|++|..+
T Consensus 15 ~~l~~W~~~~~~~l~r~f~f~~f~~a~~f~~~ 46 (95)
T PF01329_consen 15 AELPGWKLDGGGRLERTFKFKDFAEAVEFVNR 46 (95)
T ss_dssp HTSTTSEEETSSEEEEEEE-SSHHHHHHHHHH
T ss_pred hcCcCCEECCCCcEEEEEEeCCHHHHHHHHHH
Confidence 36789998873222347899999999998754
No 120
>TIGR03072 release_prfH putative peptide chain release factor H. Members of this protein family are bacterial proteins homologous to peptide chain release factors 1 (RF-1, product of the prfA gene), and 2 (RF-2, product of the prfB gene). The member from Escherichia coli K-12, designated prfH, appears to be a pseudogene. This class I release factor is always found as the downstream gene of a two-gene operon.
Probab=33.25 E-value=34 Score=25.51 Aligned_cols=17 Identities=18% Similarity=0.130 Sum_probs=14.6
Q ss_pred HHHHHHcCccEEEeccC
Q psy627 49 IQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 49 iayae~~G~~Y~V~~p~ 65 (89)
+.||+++||.++|.+-.
T Consensus 26 ~~~a~~~g~~~eii~~~ 42 (200)
T TIGR03072 26 TREAAARGVRVEVLEQE 42 (200)
T ss_pred HHHHHHcCCeEEEEeCC
Confidence 57999999999998743
No 121
>PF04895 DUF651: Archaeal protein of unknown function (DUF651); InterPro: IPR006979 This conserved region is found in the C-terminal region of a number of conserved archaeal proteins of unknown function.
Probab=33.19 E-value=30 Score=23.63 Aligned_cols=22 Identities=18% Similarity=0.425 Sum_probs=17.5
Q ss_pred CCCceeeecCCHHHHHHHHHHc
Q psy627 34 PLSNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 34 ~~~qv~l~F~s~e~Aiayae~~ 55 (89)
.+.+--..|+|.++|+.|+...
T Consensus 62 A~~~~p~~f~~l~eAl~~~~~~ 83 (110)
T PF04895_consen 62 AMKGKPEKFETLEEALEYVSSR 83 (110)
T ss_pred HHhCCCcccCCHHHHHHHHHHH
Confidence 3455568999999999998764
No 122
>KOG0369|consensus
Probab=32.85 E-value=40 Score=30.65 Aligned_cols=23 Identities=13% Similarity=0.478 Sum_probs=19.8
Q ss_pred cCCHHHHHHHHHHcCccEEEecc
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..|.|+|++||+++|++..+...
T Consensus 170 itt~~EA~eF~k~yG~PvI~KAA 192 (1176)
T KOG0369|consen 170 ITTVEEALEFVKEYGLPVIIKAA 192 (1176)
T ss_pred cccHHHHHHHHHhcCCcEEEeec
Confidence 36899999999999999888653
No 123
>PF05159 Capsule_synth: Capsule polysaccharide biosynthesis protein; InterPro: IPR007833 This family includes export proteins involved in capsule polysaccharide biosynthesis, such as KpsS P42218 from SWISSPROT and LipB P57038 from SWISSPROT. Capsule polysaccharide modification protein lipB/A is involved in the phospholipid modification of the capsular polysaccharide and is a strong requirement for its translocation to the cell surface. The capsule of Neisseria meningitidis serogroup B and of other meningococcal serogroups and other Gram-negative bacterial pathogens, are anchored in the outer membrane through a 1,2-diacylglycerol moiety. The lipA and lipB genes are located on the 3' end of the ctr operon. lipA and lipB do not encode proteins responsible for diacylglycerophosphatidic acid substitution of the meningococcal capsule polymer, but they are required for proper translocation and surface expression of the lipidated polymer []. KpsS is an unusual sulphate-modified form of the capsular polysaccharide in Rhizobium loti (Mesorhizobium loti). Many plants, including R. loti, enter into symbiotic relationships with bacteria that allow survival in nutrient-limiting environments. KpsS functions as a fucosyl sulphotransferase in vitro. The kpsS gene product shares no significant amino acid similarity with previously identified sulphotransferases []. Sulphated cell surface polysaccharides are required for optimum nodule formation but limit growth rate and nodule colonisation in M. loti [].; GO: 0000271 polysaccharide biosynthetic process, 0015774 polysaccharide transport
Probab=32.72 E-value=45 Score=24.50 Aligned_cols=23 Identities=17% Similarity=0.143 Sum_probs=18.8
Q ss_pred CCHHHHHHHHHHcCccEEEeccC
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.....|++.|+++|+++.+.|.-
T Consensus 11 ~~~~~~~~~a~~~~i~~~~~E~G 33 (269)
T PF05159_consen 11 PYHRAAIEVAKELGIPVIFFEDG 33 (269)
T ss_pred cHHHHHHHHHHHhCCCEEEEecC
Confidence 34567889999999999998853
No 124
>TIGR00655 PurU formyltetrahydrofolate deformylase. This model describes formyltetrahydrofolate deformylases. The enzyme is a homohexamer. Sequences from a related enzyme formyl tetrahydrofolate-specific enzyme, phosphoribosylglycinamide formyltransferase, serve as an outgroup for phylogenetic analysis. Putative members of this family, scoring below the trusted cutoff, include a sequence from Rhodobacter capsulatus that lacks an otherwise conserved C-terminal region.
Probab=32.57 E-value=54 Score=25.28 Aligned_cols=29 Identities=24% Similarity=0.312 Sum_probs=25.2
Q ss_pred CceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..|.+-+.++.+|-++|+++||+|.+...
T Consensus 112 ~~i~~visn~~~~~~~A~~~gIp~~~~~~ 140 (280)
T TIGR00655 112 AEIALVISNHEDLRSLVERFGIPFHYIPA 140 (280)
T ss_pred cEEEEEEEcChhHHHHHHHhCCCEEEcCC
Confidence 56788889999999999999999988654
No 125
>COG3584 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=32.42 E-value=25 Score=24.18 Aligned_cols=29 Identities=28% Similarity=0.400 Sum_probs=21.0
Q ss_pred CCCCCCCCCC-----CceeeecCCHHHHHHHHHH
Q psy627 26 MGWCSTGDPL-----SNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 26 MGWtss~D~~-----~qv~l~F~s~e~Aiayae~ 54 (89)
-||-=..||- .++-|.|+|.+||++|-.+
T Consensus 70 YG~ava~DTGgaIkGNkIDl~~~s~~da~~wG~k 103 (109)
T COG3584 70 YGVAVAEDTGGAIKGNKIDLYMPSERDAINWGRK 103 (109)
T ss_pred ccEEEEeecCCcccCcEEEEEcCcHHHHHhhCce
Confidence 4555556764 4578999999999977443
No 126
>cd01205 WASP WASP-type EVH1 domain. WASP-type EVH1 domain. Wiskott-Aldrich syndrome (WAS) is an X-linked recessive disease, characterized by eczema, immunodeficiency, and thrombocytopenia. The majority of patients with WAS, or a milder version of the disorder, X-linked thrombocytopenia (XLT), have point mutations in the EVH1 domain of WASP (Wiskott-Aldrich syndrome protein). WASP is an actin regulatory protein consisting of an N-terminal EVH1 domain, a basic region, a GTP binding domain, a proline rich region and a WH2 acidic region. Yeast members lack the GTP binding domain. WASP binds a 25 residue proline rich motif from the WASP Interacting Protein (WIP) via its N-terminal EVH1 domain.
Probab=32.41 E-value=48 Score=22.41 Aligned_cols=19 Identities=26% Similarity=0.550 Sum_probs=17.0
Q ss_pred CceeeecCCHHHHHHHHHH
Q psy627 36 SNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~ 54 (89)
-++.|.|.+.+||-.|-++
T Consensus 82 c~~GL~Fade~EA~~F~k~ 100 (105)
T cd01205 82 CVVGLNFADETEAAEFRKK 100 (105)
T ss_pred cEEEEEECCHHHHHHHHHH
Confidence 5789999999999999776
No 127
>PF10686 DUF2493: Protein of unknown function (DUF2493); InterPro: IPR019627 This entry is represented by Mycobacteriophage D29, Gp61. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. Members of this family are mainly Proteobacteria. The function is not known.
Probab=32.35 E-value=53 Score=20.34 Aligned_cols=19 Identities=11% Similarity=0.104 Sum_probs=15.9
Q ss_pred HHHHHHHHcCccEEEeccC
Q psy627 47 DAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 47 ~Aiayae~~G~~Y~V~~p~ 65 (89)
-|..+|+++|++.++..|.
T Consensus 48 iA~~wA~~~gv~~~~~~ad 66 (71)
T PF10686_consen 48 IAARWARERGVPVIRFPAD 66 (71)
T ss_pred HHHHHHHHCCCeeEEeCcC
Confidence 4778999999999887665
No 128
>PF05389 MecA: Negative regulator of genetic competence (MecA); InterPro: IPR008681 Competence is the ability of a cell to take up exogenous DNA from its environment, resulting in transformation. It is widespread among bacteria and is probably an important mechanism for the horizontal transfer of genes. Cells that take up DNA inevitably acquire the nucleotides the DNA consists of, and, because nucleotides are needed for DNA and RNA synthesis and are expensive to synthesise, these may make a significant contribution to the cell's energy budget []. The lateral gene transfer caused by competence also contributes to the genetic diversity that makes evolution possible. DNA usually becomes available by the death and lysis of other cells. Competent bacteria use components of extracellular filaments called type 4 pili to create pores in their membranes and pull DNA through the pores into the cytoplasm. This process, including the development of competence and the expression of the uptake machinery, is regulated in response to cell-cell signalling and/or nutritional conditions []. This family contains several bacterial MecA proteins. In complex media competence development is poor, and there is little or no expression of late competence genes. Overexpression of MecA inhibits comG transcription [, , ]. MecA enables the recognition and targeting of unfolded and aggregated proteins to the ClpC protease or to other proteins involved in proteolysis. Acts negatively in the development of competence by binding ComK and recruiting it to the ClpCP protease. When overexpressed, inhibits sporulation. Also involved in Spx degradation by ClpC. ; PDB: 3JTP_C 2Y1R_O 3PXI_c 3PXG_b 3JTO_D 3JTN_A.
Probab=32.22 E-value=31 Score=25.18 Aligned_cols=21 Identities=24% Similarity=0.265 Sum_probs=18.4
Q ss_pred ceeeecCCHHHHHHHHHHcCc
Q psy627 37 NLALNFSSKEDAIQYCQKNGW 57 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~ 57 (89)
.+.+.|++.|++|++|+...-
T Consensus 131 ~~v~~F~dfedvI~la~~l~~ 151 (220)
T PF05389_consen 131 TRVFEFDDFEDVIQLAKRLQD 151 (220)
T ss_dssp EEEEEESSHHHHHHHHCHCCH
T ss_pred EEEEEcCCHHHHHHHHhhccc
Confidence 467999999999999998754
No 129
>COG0216 PrfA Protein chain release factor A [Translation, ribosomal structure and biogenesis]
Probab=32.14 E-value=38 Score=27.75 Aligned_cols=19 Identities=21% Similarity=0.484 Sum_probs=15.9
Q ss_pred HHHHHHcCccEEEeccCCC
Q psy627 49 IQYCQKNGWKFFVEEPKWK 67 (89)
Q Consensus 49 iayae~~G~~Y~V~~p~~~ 67 (89)
..||+.+||.++|....+.
T Consensus 139 ~rYAe~kgWk~ei~s~se~ 157 (363)
T COG0216 139 SRYAESKGWKVEILSASES 157 (363)
T ss_pred HHHHHhCCCEEEEeecCcc
Confidence 3899999999999886543
No 130
>COG5079 SAC3 Nuclear protein export factor [Intracellular trafficking and secretion / Cell division and chromosome partitioning]
Probab=32.01 E-value=27 Score=30.39 Aligned_cols=25 Identities=24% Similarity=0.618 Sum_probs=22.1
Q ss_pred eeecCCHHHHHHHHHHcCccEEEec
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.|.|+++++++.||+-+|++..+..
T Consensus 356 il~f~~~~e~~efckyy~lei~~ed 380 (646)
T COG5079 356 ILDFEEKGEGEEFCKYYGLEIRIED 380 (646)
T ss_pred hccccccchhHHHhhhcceeeeccc
Confidence 5889999999999999999986554
No 131
>cd08344 MhqB_like_N N-terminal domain of MhqB, a type I extradiol dioxygenase, and similar proteins. This subfamily contains the N-terminal, non-catalytic, domain of Burkholderia sp. NF100 MhqB and similar proteins. MhqB is a type I extradiol dioxygenase involved in the catabolism of methylhydroquinone, an intermediate in the degradation of fenitrothion. The purified enzyme has shown extradiol ring cleavage activity toward 3-methylcatechol. Fe2+ was suggested as a cofactor, the same as most other enzymes in the family. Burkholderia sp. NF100 MhqB is encoded on the plasmid pNF1. The type I family of extradiol dioxygenases contains two structurally homologous barrel-shaped domains at the N- and C-terminal. The active-site metal is located in the C-terminal barrel and plays an essential role in the catalytic mechanism.
Probab=31.97 E-value=77 Score=19.51 Aligned_cols=28 Identities=11% Similarity=0.100 Sum_probs=20.9
Q ss_pred CCCCceeeec--CCHHHHHHHHHHcCccEE
Q psy627 33 DPLSNLALNF--SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 33 D~~~qv~l~F--~s~e~Aiayae~~G~~Y~ 60 (89)
+.+..+.+.| ++.+++++.++++|+++.
T Consensus 54 ~~~~~~~~~~~~~d~~~~~~~l~~~Gi~~~ 83 (112)
T cd08344 54 KRLAYLSFGIFEDDFAAFARHLEAAGVALA 83 (112)
T ss_pred CceeeEEEEeEhhhHHHHHHHHHHcCCcee
Confidence 3344455555 789999999999999954
No 132
>cd00858 GlyRS_anticodon GlyRS Glycyl-anticodon binding domain. GlyRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=31.89 E-value=60 Score=21.20 Aligned_cols=23 Identities=17% Similarity=0.003 Sum_probs=19.3
Q ss_pred eecCCHHHHHHHHHHcCccEEEec
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+. .+....+.+|++.|++|.|.-
T Consensus 62 ~~-~sl~kqlk~A~k~g~~~~iii 84 (121)
T cd00858 62 DS-GSIGRRYARQDEIGTPFCVTV 84 (121)
T ss_pred CC-CCHHHHHHHhHhcCCCEEEEE
Confidence 45 788999999999999988765
No 133
>PLN03097 FHY3 Protein FAR-RED ELONGATED HYPOCOTYL 3; Provisional
Probab=31.77 E-value=59 Score=29.23 Aligned_cols=28 Identities=18% Similarity=0.400 Sum_probs=24.0
Q ss_pred CceeeecCCHHHH----HHHHHHcCccEEEec
Q psy627 36 SNLALNFSSKEDA----IQYCQKNGWKFFVEE 63 (89)
Q Consensus 36 ~qv~l~F~s~e~A----iayae~~G~~Y~V~~ 63 (89)
..|.|.|+|.|+| .+||.+.|....|..
T Consensus 74 P~vGMeF~S~eeA~~FYn~YA~~~GFsVRi~~ 105 (846)
T PLN03097 74 PLSGMEFESHGEAYSFYQEYARSMGFNTAIQN 105 (846)
T ss_pred CcCCCeECCHHHHHHHHHHHHhhcCceEEeec
Confidence 4688999999999 578999999987754
No 134
>cd08349 BLMA_like Bleomycin binding protein (BLMA) and similar proteins; BLMA confers bleomycin (Bm) resistance by directly binding to Bm. BLMA also called Bleomycin resistance protein, confers Bm resistance by directly binding to Bm. Bm is a glycopeptide antibiotic produced naturally by actinomycetes. It is a potent anti-cancer drug, which acts as a strong DNA-cutting agent, thereby causing cell death. BLMA is produced by actinomycetes to protect themselves against their own lethal compound. BLMA has two identically-folded subdomains, with the same alpha/beta fold; these two halves have no sequence similarity. BLMAs are dimers and each dimer binds to two Bm molecules at the Bm-binding pockets formed at the dimer interface; two Bm molecules are bound per dimer. BLMA belongs to a conserved domain superfamily that is found in a variety of structurally related metalloproteins, including the bleomycin resistance protein, glyoxalase I, and type I ring-cleaving dioxygenases. As for the large
Probab=31.67 E-value=1e+02 Score=18.33 Aligned_cols=29 Identities=10% Similarity=0.195 Sum_probs=22.9
Q ss_pred ceeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 37 NLALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.+.+.-++.+++.+.++++|+.+.+..+.
T Consensus 61 ~~~~~~~~~~~~~~~l~~~G~~~~~~~~~ 89 (112)
T cd08349 61 SVYIEVEDVDALYAELKAKGADLIVYPPE 89 (112)
T ss_pred EEEEEeCCHHHHHHHHHHcCCcceecCcc
Confidence 46677788999999999999986444443
No 135
>cd06575 PASTA_Pbp2x-like_2 PASTA domain of PBP2x-like proteins, second repeat. Penicillin-binding proteins (PBPs) are the major targets for beta-lactam antibiotics, like penicillins and cephalosporins. Beta-lactam antibiotics specifically inhibit transpeptidase activity by acylating the active site serine. PBPs catalyze key steps in the synthesis of the peptidoglycan, such as the interconnecting of glycan chains (polymers of N-glucosamine and N-acetylmuramic acid residues) and the cross-linking (transpeptidation) of short stem peptides, which are attached to glycan chains. Peptidoglycan is essential in cell division and protects bacteria from osmotic shock and lysis. PBP2x is one of the two monofunctional high molecular mass PBPs in Streptococcus pneumoniae and has been seen as the primary PBP target in beta-lactam-resistant strains. The PASTA domain is found at the C-termini of several PBPs and bacterial serine/threonine kinases. The name PASTA is derived from PBP and Serine/Threonine
Probab=31.64 E-value=75 Score=16.66 Aligned_cols=21 Identities=29% Similarity=0.230 Sum_probs=17.9
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+.++|.+..++.|+.+.+..
T Consensus 8 ~~~~~a~~~l~~~g~~~~~~~ 28 (54)
T cd06575 8 WSKRDALKLLELLGLKVKFSG 28 (54)
T ss_pred CCHHHHHHHHHHCCCeEEEee
Confidence 578999999999999987654
No 136
>KOG1769|consensus
Probab=31.47 E-value=26 Score=23.74 Aligned_cols=13 Identities=23% Similarity=0.657 Sum_probs=10.8
Q ss_pred HHHHHHcCccEEE
Q psy627 49 IQYCQKNGWKFFV 61 (89)
Q Consensus 49 iayae~~G~~Y~V 61 (89)
-+||+++|+++.-
T Consensus 48 ~aYc~r~Gl~~~s 60 (99)
T KOG1769|consen 48 KAYCERQGLSMNS 60 (99)
T ss_pred HHHHHHcCCccce
Confidence 4899999998743
No 137
>PRK10291 glyoxalase I; Provisional
Probab=31.40 E-value=68 Score=20.32 Aligned_cols=27 Identities=19% Similarity=0.266 Sum_probs=23.0
Q ss_pred CCceeeecCCHHHHHHHHHHcCccEEE
Q psy627 35 LSNLALNFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 35 ~~qv~l~F~s~e~Aiayae~~G~~Y~V 61 (89)
...+.+..++.+++++.++.+|+.+..
T Consensus 66 ~~hlaf~V~d~~~~~~~l~~~G~~~~~ 92 (129)
T PRK10291 66 YGHIALSVDNAAEACEKIRQNGGNVTR 92 (129)
T ss_pred eeEEEEEeCCHHHHHHHHHHcCCcccc
Confidence 456888889999999999999998653
No 138
>TIGR00385 dsbE periplasmic protein thiol:disulfide oxidoreductases, DsbE subfamily. Involved in the biogenesis of c-type cytochromes as well as in disulfide bond formation in some periplasmic proteins.
Probab=31.22 E-value=43 Score=23.08 Aligned_cols=47 Identities=13% Similarity=0.239 Sum_probs=27.3
Q ss_pred eeec-CCHHHHHHHHHHcCccEE-E-eccCCC---CCCcccccccCCcCCCcc
Q psy627 39 ALNF-SSKEDAIQYCQKNGWKFF-V-EEPKWK---TPKVKSYAFNFSWNKRTR 85 (89)
Q Consensus 39 ~l~F-~s~e~Aiayae~~G~~Y~-V-~~p~~~---~~~~ksYadNF~~~r~~~ 85 (89)
.+.. ++.+++.+|++++|++|. | .++... .....+.-.+|-.++..+
T Consensus 97 ~V~~~~~~~~~~~~~~~~~~~f~~v~~D~~~~~~~~~~v~~~P~~~~id~~G~ 149 (173)
T TIGR00385 97 GVDYKDQSQNALKFLKELGNPYQAILIDPNGKLGLDLGVYGAPETFLVDGNGV 149 (173)
T ss_pred EEECCCChHHHHHHHHHcCCCCceEEECCCCchHHhcCCeeCCeEEEEcCCce
Confidence 4445 356778999999999986 3 234322 122233334566655444
No 139
>TIGR00269 conserved hypothetical protein TIGR00269.
Probab=31.06 E-value=47 Score=21.68 Aligned_cols=24 Identities=21% Similarity=0.444 Sum_probs=20.4
Q ss_pred eecCCHHHHHHHHHHcCccEEEec
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
|.|-+.++-+.||..+|++|.-.+
T Consensus 7 L~~v~E~ei~~ya~~~~lp~~~~~ 30 (104)
T TIGR00269 7 LRYIPEKEVVLYAFLNELKVHLDE 30 (104)
T ss_pred cccCCHHHHHHHHHHcCCCcCCCC
Confidence 677888999999999999987444
No 140
>COG4862 MecA Negative regulator of genetic competence, sporulation and motility [Posttranslational modification, protein turnover, chaperones / Signal transduction mechanisms / Cell motility and secretion]
Probab=31.03 E-value=42 Score=25.78 Aligned_cols=22 Identities=23% Similarity=0.351 Sum_probs=19.0
Q ss_pred ceeeecCCHHHHHHHHHHcCcc
Q psy627 37 NLALNFSSKEDAIQYCQKNGWK 58 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~ 58 (89)
.+.|.|.+.|++|++|+...+.
T Consensus 131 ~~il~F~d~eDvI~lsk~~~~~ 152 (224)
T COG4862 131 YYILRFADFEDVISLSKNLKFE 152 (224)
T ss_pred EEEEEeccHHHHHhHhhhcccc
Confidence 4789999999999999986654
No 141
>cd00914 PCD_DCoH_subfamily_b PCD_DCoH: The bifunctional protein pterin-4alpha-carbinolamine dehydratase (PCD), also known as DCoH (dimerization cofactor of hepatocyte nuclear factor-1), is both a transcription activator and a metabolic enzyme. DCoH stimulates gene expression by associating with specific DNA binding proteins such as HNF-1alpha (hepatocyte nuclear factor-1) and Xenopus enhancer of rudimentary homologue (XERH). DCoH also catalyzes the dehydration of 4alpha- hydroxy- tetrahydrobiopterin (4alpha-OH-BH4) to quinoiddihydrobiopterin, a percursor of the phenylalanine hydroxylase cofactor BH4 (tetrahydrobiopterin). The DCoH homodimer has a saddle-shaped structure similar to that of TBP (TATA binding protein). Two DCoH proteins have been identifed in humans: DCoH1 and DCoH2. Mutations in human DCoH1 cause hyperphenylalaninemia. Loss of enzymic activity of DCoH in humans is associated with the depigmentation disorder vitiligo. DCoH1 has been reported to be overexpessed in colon
Probab=30.99 E-value=60 Score=20.02 Aligned_cols=29 Identities=14% Similarity=0.343 Sum_probs=21.5
Q ss_pred CCCCCCCCC-CceeeecCCHHHHHHHHHHc
Q psy627 27 GWCSTGDPL-SNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 27 GWtss~D~~-~qv~l~F~s~e~Aiayae~~ 55 (89)
||.-..++. -.-+..|++-.+|++|+.+-
T Consensus 1 gW~~~~~~~~l~r~f~f~~f~~a~~f~~~v 30 (76)
T cd00914 1 GWTLVDGRDAIHKSFKFKDFNEAFGFMTRV 30 (76)
T ss_pred CCeEeCCCCeEEEEEEeCCHHHHHHHHHHH
Confidence 677766654 23468899999999997663
No 142
>PF00462 Glutaredoxin: Glutaredoxin; InterPro: IPR002109 Glutaredoxins [, , ], also known as thioltransferases (disulphide reductases, are small proteins of approximately one hundred amino-acid residues which utilise glutathione and NADPH as cofactors. Oxidized glutathione is regenerated by glutathione reductase. Together these components compose the glutathione system []. Glutaredoxin functions as an electron carrier in the glutathione-dependent synthesis of deoxyribonucleotides by the enzyme ribonucleotide reductase. Like thioredoxin, which functions in a similar way, glutaredoxin possesses an active centre disulphide bond []. It exists in either a reduced or an oxidized form where the two cysteine residues are linked in an intramolecular disulphide bond. Glutaredoxin has been sequenced in a variety of species. On the basis of extensive sequence similarity, it has been proposed [] that Vaccinia virus protein O2L is most probably a glutaredoxin. Finally, it must be noted that Bacteriophage T4 thioredoxin seems also to be evolutionary related. In position 5 of the pattern T4 thioredoxin has Val instead of Pro. This entry represents Glutaredoxin.; GO: 0009055 electron carrier activity, 0015035 protein disulfide oxidoreductase activity, 0045454 cell redox homeostasis; PDB: 1QFN_A 1GRX_A 1EGO_A 1EGR_A 3RHC_A 3RHB_A 3IPZ_A 1NHO_A 3GX8_A 3D5J_A ....
Probab=30.98 E-value=63 Score=18.12 Aligned_cols=19 Identities=11% Similarity=0.207 Sum_probs=16.5
Q ss_pred HHHHHHHHHHcCccEEEec
Q psy627 45 KEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~ 63 (89)
-..|.++-+++|++|++..
T Consensus 12 C~~~~~~L~~~~i~y~~~d 30 (60)
T PF00462_consen 12 CKKAKEFLDEKGIPYEEVD 30 (60)
T ss_dssp HHHHHHHHHHTTBEEEEEE
T ss_pred HHHHHHHHHHcCCeeeEcc
Confidence 4678999999999999876
No 143
>TIGR02540 gpx7 putative glutathione peroxidase Gpx7. This model represents one of several families of known and probable glutathione peroxidases. This family is restricted to animals and designated GPX7.
Probab=30.92 E-value=39 Score=22.64 Aligned_cols=21 Identities=14% Similarity=0.271 Sum_probs=18.1
Q ss_pred CCHHHHHHHHHH-cCccEEEec
Q psy627 43 SSKEDAIQYCQK-NGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~-~G~~Y~V~~ 63 (89)
++.|++.+|+++ +|++|.|..
T Consensus 72 d~~~~~~~f~~~~~~~~fp~~~ 93 (153)
T TIGR02540 72 DSSKEIESFARRNYGVTFPMFS 93 (153)
T ss_pred CCHHHHHHHHHHhcCCCCCccc
Confidence 778999999986 899998854
No 144
>PF00588 SpoU_methylase: SpoU rRNA Methylase family; InterPro: IPR001537 The spoU gene of Escherichia coli codes for a protein that shows strong similarities to previously characterised 2'-O-methyltransferases [, ]. The Pet56 protein of Saccharomyces cerevisiae has been shown to be required for ribose methylation at a universally conserved nucleotide in the peptidyl transferase centre of the mitochondrial large ribosomal RNA (21S rRNA). Cells reduced in this activity were deficient in formation of functional large subunits of the mitochondrial ribosome. The Pet56 protein catalyzes the site-specific formation of 2'-O-methylguanosine on in vitro transcripts of both mitochondrial 21S rRNA and E. coli 23S rRNA providing evidence for an essential modified nucleotide in rRNA [].; GO: 0003723 RNA binding, 0008173 RNA methyltransferase activity, 0006396 RNA processing; PDB: 3N4J_A 3N4K_A 1IPA_A 3ONP_A 3NK6_A 3NK7_A 3IC6_A 1GZ0_D 1MXI_A 1J85_A ....
Probab=30.71 E-value=1.1e+02 Score=20.15 Aligned_cols=25 Identities=20% Similarity=0.428 Sum_probs=20.8
Q ss_pred eecCCHHHHHHHHHHcCccEEEecc
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..+++.+++++.++..|+.....++
T Consensus 59 ~~~~~~~~~l~~~~~~g~~i~~~~~ 83 (142)
T PF00588_consen 59 RRVDDLEEALKDLKENGYTIVATSP 83 (142)
T ss_dssp EEESSHHHHHHHHHHTTEEEEEEST
T ss_pred eeeehhhhhcccccccccccceeee
Confidence 6789999999999999877655555
No 145
>PRK03353 ribB 3,4-dihydroxy-2-butanone 4-phosphate synthase; Provisional
Probab=30.53 E-value=48 Score=24.85 Aligned_cols=18 Identities=6% Similarity=0.329 Sum_probs=16.0
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
...+++++||++||+...
T Consensus 183 ~~~~~~~~fA~~~~l~~v 200 (217)
T PRK03353 183 ARAPECIAFAKQHNMPVL 200 (217)
T ss_pred ccHHHHHHHHHHcCCcEE
Confidence 468999999999999983
No 146
>COG1453 Predicted oxidoreductases of the aldo/keto reductase family [General function prediction only]
Probab=30.47 E-value=43 Score=27.66 Aligned_cols=20 Identities=25% Similarity=0.572 Sum_probs=18.2
Q ss_pred HHHHHHHHcCccEEEeccCC
Q psy627 47 DAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 47 ~Aiayae~~G~~Y~V~~p~~ 66 (89)
+||+||.++|+...|-+|-.
T Consensus 187 ~~l~~A~~~~~gI~IMeP~~ 206 (391)
T COG1453 187 EGLKYAASKGLGIFIMEPLD 206 (391)
T ss_pred HHHHHHHhCCCcEEEEeeCC
Confidence 68999999999999999854
No 147
>PF08057 Ery_res_leader2: Erythromycin resistance leader peptide; InterPro: IPR012559 This family consists of erythromycin resistance gene leader peptides. These leader peptides are involved in the transcriptional attenuation control of the synthesis of the macrolide-lincosamide -streptogramin B resistance protein. It acts as a transcriptional attenuator, in contrast to other inducible erm genes. The mRNA leader sequence can fold in either of two mutually exclusive conformations, one of which is postulated to form in the absence of induction, and to contain two rho factor-independent terminators [].; GO: 0046677 response to antibiotic
Probab=30.43 E-value=23 Score=15.95 Aligned_cols=10 Identities=20% Similarity=0.468 Sum_probs=7.6
Q ss_pred CceeeecCCH
Q psy627 36 SNLALNFSSK 45 (89)
Q Consensus 36 ~qv~l~F~s~ 45 (89)
..++|+|+++
T Consensus 3 hsmrlrfptl 12 (14)
T PF08057_consen 3 HSMRLRFPTL 12 (14)
T ss_pred cceeeecccc
Confidence 3578999885
No 148
>cd03421 SirA_like_N SirA_like_N, a protein of unknown function with an N-terminal SirA-like domain. The SirA, YedF, YeeD protein family is present in bacteria as well as archaea. SirA (also known as UvrY, and YhhP) belongs to a family of a two-component response regulators that controls secondary metabolism and virulence. The other member of this two-component system is a sensor kinase called BarA which phosphorylates SirA. A variety of microorganisms have similar proteins, all of which contain a common CPxP sequence motif in the N-terminal region. YhhP is suggested to be important for normal cell division and growth in rich nutrient medium. Moreover, despite a low primary sequence similarity, the YccP structure closely resembles the non-homologous C-terminal RNA-binding domain of E. coli translation initiation factor IF3. The signature CPxP motif serves to stabilize the N-terminal helix as part of the N-capping box and might be important in mRNA-binding.
Probab=30.40 E-value=58 Score=19.08 Aligned_cols=20 Identities=30% Similarity=0.596 Sum_probs=16.4
Q ss_pred HHHHHHHHHHcCccEEEecc
Q psy627 45 KEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p 64 (89)
.++-.+||+++|..+.+.+.
T Consensus 38 ~~~i~~~~~~~G~~~~~~~~ 57 (67)
T cd03421 38 KENVSRFAESRGYEVSVEEK 57 (67)
T ss_pred HHHHHHHHHHcCCEEEEEec
Confidence 46788999999999977653
No 149
>PF09840 DUF2067: Uncharacterized protein conserved in archaea (DUF2067); InterPro: IPR019202 This family of archaeal proteins, have no known function.
Probab=30.40 E-value=47 Score=24.44 Aligned_cols=17 Identities=12% Similarity=0.555 Sum_probs=15.1
Q ss_pred eeeecCCHHHHHHHHHH
Q psy627 38 LALNFSSKEDAIQYCQK 54 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~ 54 (89)
+.++|++.++++.||++
T Consensus 1 i~~~~~~~~E~~~fle~ 17 (190)
T PF09840_consen 1 ISFKFRDDEECEEFLER 17 (190)
T ss_pred CeEEcCChHHHHHHHHH
Confidence 46899999999999986
No 150
>cd07233 Glyoxalase_I Glyoxalase I catalyzes the isomerization of the hemithioacetal, formed by a 2-oxoaldehyde and glutathione, to S-D-lactoylglutathione. Glyoxalase I (also known as lactoylglutathione lyase; EC 4.4.1.5) is part of the glyoxalase system, a two-step system for detoxifying methylglyoxal, a side product of glycolysis. This system is responsible for the conversion of reactive, acyclic alpha-oxoaldehydes into the corresponding alpha-hydroxyacids and involves 2 enzymes, glyoxalase I and II. Glyoxalase I catalyses an intramolecular redox reaction of the hemithioacetal (formed from methylglyoxal and glutathione) to form the thioester, S-D-lactoylglutathione. This reaction involves the transfer of two hydrogen atoms from C1 to C2 of the methylglyoxal, and proceeds via an ene-diol intermediate. Glyoxalase I has a requirement for bound metal ions for catalysis. Eukaryotic glyoxalase I prefers the divalent cation zinc as cofactor, whereas Escherichia coil and other prokaryotic gly
Probab=30.39 E-value=92 Score=18.87 Aligned_cols=28 Identities=14% Similarity=0.101 Sum_probs=23.3
Q ss_pred CCCceeeecCCHHHHHHHHHHcCccEEE
Q psy627 34 PLSNLALNFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 34 ~~~qv~l~F~s~e~Aiayae~~G~~Y~V 61 (89)
....+.+.-++.+++++.++++|+...-
T Consensus 70 ~~~~i~~~v~did~~~~~l~~~G~~~~~ 97 (121)
T cd07233 70 GFGHLAFAVDDVYAACERLEEMGVEVTK 97 (121)
T ss_pred CeEEEEEEeCCHHHHHHHHHHCCCEEee
Confidence 3446778889999999999999999764
No 151
>cd03032 ArsC_Spx Arsenate Reductase (ArsC) family, Spx subfamily; Spx is a unique RNA polymerase (RNAP)-binding protein present in bacilli and some mollicutes. It inhibits transcription by binding to the C-terminal domain of the alpha subunit of RNAP, disrupting complex formation between RNAP and certain transcriptional activator proteins like ResD and ComA. In response to oxidative stress, Spx can also activate transcription, making it a general regulator that exerts both positive and negative control over transcription initiation. Spx has been shown to exert redox-sensitive transcriptional control over genes like trxA (TRX) and trxB (TRX reductase), genes that function in thiol homeostasis. This redox-sensitive activity is dependent on the presence of a CXXC motif, present in some members of the Spx subfamily, that acts as a thiol/disulfide switch. Spx has also been shown to repress genes in a sulfate-dependent manner independent of the presence of the CXXC motif.
Probab=30.34 E-value=51 Score=21.49 Aligned_cols=22 Identities=23% Similarity=0.138 Sum_probs=18.5
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
++...|+++.+.+|++|++..-
T Consensus 11 ~~c~ka~~~L~~~gi~~~~idi 32 (115)
T cd03032 11 SSCRKAKQWLEEHQIPFEERNL 32 (115)
T ss_pred HHHHHHHHHHHHCCCceEEEec
Confidence 4567899999999999998763
No 152
>COG0656 ARA1 Aldo/keto reductases, related to diketogulonate reductase [General function prediction only]
Probab=30.16 E-value=59 Score=25.40 Aligned_cols=31 Identities=19% Similarity=0.354 Sum_probs=24.1
Q ss_pred Cce--eeecCCHHHHHHHHHHcCccEEEeccCCC
Q psy627 36 SNL--ALNFSSKEDAIQYCQKNGWKFFVEEPKWK 67 (89)
Q Consensus 36 ~qv--~l~F~s~e~Aiayae~~G~~Y~V~~p~~~ 67 (89)
.|| +..+.-.| -++||++|||..+.-.|-.+
T Consensus 162 NQIe~hp~~~q~e-l~~~~~~~gI~v~AysPL~~ 194 (280)
T COG0656 162 NQIEYHPYLRQPE-LLPFCQRHGIAVEAYSPLAK 194 (280)
T ss_pred EEEEeccCCCcHH-HHHHHHHcCCEEEEECCccc
Confidence 455 45555555 99999999999999888764
No 153
>KOG4213|consensus
Probab=30.12 E-value=34 Score=25.89 Aligned_cols=24 Identities=21% Similarity=0.174 Sum_probs=21.8
Q ss_pred ceeeecCCHHHHHHHHHHcCccEE
Q psy627 37 NLALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
.|.+.|.+++.|-+|++.++..|-
T Consensus 152 svkv~f~tk~qa~a~~~~~e~~~~ 175 (205)
T KOG4213|consen 152 SVKVTFQTKEQAFANDDTHEEKGA 175 (205)
T ss_pred ceEEEeecHHHHHhhhhhhhhhcc
Confidence 589999999999999999988874
No 154
>PTZ00244 serine/threonine-protein phosphatase PP1; Provisional
Probab=30.10 E-value=1.2e+02 Score=23.49 Aligned_cols=41 Identities=12% Similarity=0.292 Sum_probs=28.7
Q ss_pred CccCCC---CCCCCCCCCCCceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 20 RWENPL---MGWCSTGDPLSNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 20 ~~~dPL---MGWtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
-|-||. -||..| .......| ..++..+||++||+.+.|..-
T Consensus 200 lWsDP~~~~~~~~~~---~Rg~g~~f-g~~~~~~Fl~~n~l~~iiR~H 243 (294)
T PTZ00244 200 LWADPEDEVRGFLES---DRGVSYLF-GEDIVNDFLDMVDMDLIVRAH 243 (294)
T ss_pred eecCcccccCCCCcC---CCCCcccc-CHHHHHHHHHHcCCcEEEEcC
Confidence 577774 355432 13455666 677899999999999999653
No 155
>PF11387 DUF2795: Protein of unknown function (DUF2795); InterPro: IPR021527 This family of proteins has no known function.
Probab=30.00 E-value=58 Score=18.40 Aligned_cols=21 Identities=19% Similarity=0.265 Sum_probs=17.6
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.|+++=|++|+++|.+=.|.+
T Consensus 8 a~k~~Lv~~A~~~gA~~~vl~ 28 (44)
T PF11387_consen 8 ADKDELVRHARRNGAPDDVLD 28 (44)
T ss_pred CCHHHHHHHHHHcCCCHHHHH
Confidence 479999999999998876654
No 156
>PHA02087 hypothetical protein
Probab=29.95 E-value=52 Score=21.35 Aligned_cols=21 Identities=14% Similarity=0.306 Sum_probs=17.6
Q ss_pred ceeeecCCHHHHHHHHHHcCc
Q psy627 37 NLALNFSSKEDAIQYCQKNGW 57 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~ 57 (89)
-+|++|.|.|---+||+-.-+
T Consensus 3 i~rvkfks~ehrk~fa~s~~y 23 (83)
T PHA02087 3 IVRVKFKSPEHRKAFAESEWY 23 (83)
T ss_pred eEEEEcCCHHHHHHHHhhhHH
Confidence 479999999999999986543
No 157
>PRK11478 putative lyase; Provisional
Probab=29.86 E-value=76 Score=19.69 Aligned_cols=25 Identities=16% Similarity=0.170 Sum_probs=21.9
Q ss_pred CceeeecCCHHHHHHHHHHcCccEE
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
..+.+..++.+++++.++++|+++.
T Consensus 77 ~hi~f~v~d~~~~~~~l~~~G~~~~ 101 (129)
T PRK11478 77 RHLAFSVDDIDAAVAHLESHNVKCE 101 (129)
T ss_pred eEEEEEeCCHHHHHHHHHHcCCeee
Confidence 4677888999999999999999864
No 158
>TIGR00552 nadE NAD+ synthetase. NAD+ synthetase is a nearly ubiquitous enzyme for the final step in the biosynthesis of the essensial cofactor NAD. The member of this family from Bacillus subtilis is a strictly NH(3)-dependent NAD(+) synthetase of 272 amino acids. Proteins consisting only of the domain modeled here may be named as NH3-dependent NAD+ synthetase. Amidotransferase activity may reside in a separate protein, or not be present. Some other members of the family, such as from Mycobacterium tuberculosis, are considerably longer, contain an apparent amidotransferase domain, and show glutamine-dependent as well as NH(3)-dependent activity.
Probab=29.42 E-value=44 Score=24.71 Aligned_cols=21 Identities=14% Similarity=0.227 Sum_probs=18.4
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+.++|.++|++.|+++.+..
T Consensus 64 ~e~~~a~~~a~~lgi~~~~i~ 84 (250)
T TIGR00552 64 QDVQDALALAEPLGINYKNID 84 (250)
T ss_pred HHHHHHHHHHHHhCCeEEEEc
Confidence 357899999999999999875
No 159
>smart00034 CLECT C-type lectin (CTL) or carbohydrate-recognition domain (CRD). Many of these domains function as calcium-dependent carbohydrate binding modules.
Probab=29.40 E-value=44 Score=20.25 Aligned_cols=19 Identities=26% Similarity=0.384 Sum_probs=14.9
Q ss_pred CCHHHHHHHHHHcCccEEE
Q psy627 43 SSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V 61 (89)
.+-++|.++|+.+|=....
T Consensus 20 ~~~~~A~~~C~~~~~~La~ 38 (126)
T smart00034 20 KTWADAQAFCQSLGAHLAS 38 (126)
T ss_pred cCHHHHHHHHHhcCCEEcc
Confidence 6789999999999854433
No 160
>TIGR03748 conj_PilL conjugative transfer region protein, TIGR03748 family. This model describes the conserved N-terminal region of a variable length protein family associated with laterally transfered regions flanked by markers of conjugative plasmid integration and/or transposition. Most members of the family have the lipoprotein signal peptide motif. A member of the family from a pathogenicity island in Salmonella enterica serovar Dublin strain was designated PilL for nomenclature consistency with a neighboring gene for the pilin structural protein PilS. However, the species distribution of this protein family tracks much better with markers of conjugal transfer than with markers of PilS-like pilin structure.
Probab=29.32 E-value=40 Score=22.97 Aligned_cols=23 Identities=22% Similarity=0.465 Sum_probs=18.6
Q ss_pred CCCCCc-eeeecC-----CHHHHHHHHHH
Q psy627 32 GDPLSN-LALNFS-----SKEDAIQYCQK 54 (89)
Q Consensus 32 ~D~~~q-v~l~F~-----s~e~Aiayae~ 54 (89)
-||+.| +.+.|+ |-.||+.|.-+
T Consensus 18 ~dpL~qvv~v~~P~~~~~TVGdAl~~~L~ 46 (105)
T TIGR03748 18 RDLLNQVIDITIPPSLVATVGDALRYLLR 46 (105)
T ss_pred hChHhheeEEECCccccCCHHHHHHHHHH
Confidence 466777 589999 99999999854
No 161
>cd03593 CLECT_NK_receptors_like C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs). CLECT_NK_receptors_like: C-type lectin-like domain (CTLD) of the type found in natural killer cell receptors (NKRs), including proteins similar to oxidized low density lipoprotein (OxLDL) receptor (LOX-1), CD94, CD69, NKG2-A and -D, osteoclast inhibitory lectin (OCIL), dendritic cell-associated C-type lectin-1 (dectin-1), human myeloid inhibitory C-type lectin-like receptor (MICL), mast cell-associated functional antigen (MAFA), killer cell lectin-like receptors: subfamily F, member 1 (KLRF1) and subfamily B, member 1 (KLRB1), and lys49 receptors. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. NKRs are variously associated with activation or inhibition of natural killer (NK) cells. Activating NKRs stimulate cytolysis by NK cells of virally infected or transformed cells; inhibitory NKRs block cytolysis up
Probab=29.15 E-value=41 Score=20.86 Aligned_cols=21 Identities=5% Similarity=0.197 Sum_probs=16.3
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.|=++|..+|+.+|=...++.
T Consensus 20 ~~w~~A~~~C~~~g~~La~i~ 40 (116)
T cd03593 20 KTWNESKEACSSKNSSLLKID 40 (116)
T ss_pred CCHHHHHHHHHhCCCcEEEEC
Confidence 567899999999997765544
No 162
>cd03414 CbiX_SirB_C Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), C-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both CbiX and SirB are found in a wide range of bacteria.
Probab=29.00 E-value=77 Score=20.22 Aligned_cols=26 Identities=19% Similarity=0.096 Sum_probs=21.5
Q ss_pred eecCCHHHHHHHHHHcCccEEEeccC
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
+..++.++||+-|.++|..-.|..|-
T Consensus 42 ~~~P~~~~~l~~l~~~g~~~i~vvP~ 67 (117)
T cd03414 42 ATRPSLPEALERLRALGARRVVVLPY 67 (117)
T ss_pred CCCCCHHHHHHHHHHcCCCEEEEEec
Confidence 33578899999999999998877773
No 163
>COG1979 Uncharacterized oxidoreductases, Fe-dependent alcohol dehydrogenase family [Energy production and conversion]
Probab=28.97 E-value=34 Score=28.13 Aligned_cols=22 Identities=14% Similarity=0.484 Sum_probs=19.3
Q ss_pred ecCCHHHHHHHHHHcCccEEEe
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
++.|...||..|+++||+|.+-
T Consensus 70 ~~~Tv~kaV~i~kee~idflLA 91 (384)
T COG1979 70 RLETLMKAVEICKEENIDFLLA 91 (384)
T ss_pred hHHHHHHHHHHHHHcCceEEEE
Confidence 4678999999999999999764
No 164
>cd00861 ProRS_anticodon_short ProRS Prolyl-anticodon binding domain, short version found predominantly in bacteria. ProRS belongs to class II aminoacyl-tRNA synthetases (aaRS). This alignment contains the anticodon binding domain, which is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only.
Probab=28.91 E-value=48 Score=19.97 Aligned_cols=22 Identities=5% Similarity=-0.117 Sum_probs=16.3
Q ss_pred cCCHHHHHHHHHHcCccEEEec
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
-.+...-+++|++.|++|.|.-
T Consensus 40 ~~~l~k~i~~a~~~g~~~~iii 61 (94)
T cd00861 40 NERPGVKFADADLIGIPYRIVV 61 (94)
T ss_pred CCCcccchhHHHhcCCCEEEEE
Confidence 3566777888888888887755
No 165
>PRK00522 tpx lipid hydroperoxide peroxidase; Provisional
Probab=28.78 E-value=59 Score=22.39 Aligned_cols=27 Identities=11% Similarity=0.245 Sum_probs=21.8
Q ss_pred eeeecCCHHHHHHHHHHcCcc-EEEecc
Q psy627 38 LALNFSSKEDAIQYCQKNGWK-FFVEEP 64 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~-Y~V~~p 64 (89)
+.++-++.+++-+|++++|+. +.+...
T Consensus 80 v~vs~D~~~~~~~f~~~~~~~~~~~lsD 107 (167)
T PRK00522 80 LCISADLPFAQKRFCGAEGLENVITLSD 107 (167)
T ss_pred EEEeCCCHHHHHHHHHhCCCCCceEeec
Confidence 568889999999999999998 555543
No 166
>cd00738 HGTP_anticodon HGTP anticodon binding domain, as found at the C-terminus of histidyl, glycyl, threonyl and prolyl tRNA synthetases, which are classified as a group of class II aminoacyl-tRNA synthetases (aaRS). In aaRSs, the anticodon binding domain is responsible for specificity in tRNA-binding, so that the activated amino acid is transferred to a ribose 3' OH group of the appropriate tRNA only. This domain is also found in the accessory subunit of mitochondrial polymerase gamma (Pol gamma b).
Probab=28.46 E-value=81 Score=18.71 Aligned_cols=22 Identities=14% Similarity=0.042 Sum_probs=18.5
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
.+....+++|++.|++|.|.-.
T Consensus 41 ~~~~k~~~~a~~~g~~~~iiig 62 (94)
T cd00738 41 RKIGKKFREADLRGVPFAVVVG 62 (94)
T ss_pred cCHhHHHHHHHhCCCCEEEEEC
Confidence 4688899999999999977653
No 167
>KOG0638|consensus
Probab=28.33 E-value=44 Score=27.43 Aligned_cols=38 Identities=24% Similarity=0.349 Sum_probs=31.9
Q ss_pred CCCCC--CCCCCCCCCceeeecCCHHHHHHHHHHcCccEE
Q psy627 23 NPLMG--WCSTGDPLSNLALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 23 dPLMG--Wtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
||++| =..-+|+...|....++.++|+.-+..+|....
T Consensus 78 ~~~~G~~l~~HgdgvkdvafeVeD~da~~~~~va~Ga~v~ 117 (381)
T KOG0638|consen 78 NSEYGDHLVKHGDGVKDVAFEVEDADAIFQEAVANGAKVV 117 (381)
T ss_pred chhhhhhhhhcccchhceEEEecchHHHHHHHHHcCCccc
Confidence 66777 334588888999999999999999999999853
No 168
>cd00488 PCD_DCoH PCD_DCoH: The bifunctional protein pterin-4alpha-carbinolamine dehydratase (PCD), also known as DCoH (dimerization cofactor of hepatocyte nuclear factor-1), is both a transcription activator and a metabolic enzyme. DCoH stimulates gene expression by associating with specific DNA binding proteins such as HNF-1alpha (hepatocyte nuclear factor-1) and Xenopus enhancer of rudimentary homologue (XERH). DCoH also catalyzes the dehydration of 4alpha- hydroxy- tetrahydrobiopterin (4alpha-OH-BH4) to quinoiddihydrobiopterin, a percursor of the phenylalanine hydroxylase cofactor BH4 (tetrahydrobiopterin). The DCoH homodimer has a saddle-shaped structure similar to that of TBP (TATA binding protein). Two DCoH proteins have been identifed in humans: DCoH1 and DCoH2. Mutations in human DCoH1 cause hyperphenylalaninemia. Loss of enzymic activity of DCoH in humans is associated with the depigmentation disorder vitiligo. DCoH1 has been reported to be overexpessed in colon cancer carc
Probab=28.19 E-value=94 Score=18.93 Aligned_cols=29 Identities=17% Similarity=0.407 Sum_probs=19.9
Q ss_pred CCCCCCCCCCceeeecCCHHHHHHHHHHc
Q psy627 27 GWCSTGDPLSNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 27 GWtss~D~~~qv~l~F~s~e~Aiayae~~ 55 (89)
||.=..+..-.-+..|++-.+|++|+.+-
T Consensus 1 gW~~~~~~~l~r~f~f~~f~~a~~f~~~v 29 (75)
T cd00488 1 GWELADGDALERTFKFKDFKEAIAFVNRV 29 (75)
T ss_pred CCcCCCCCcEEEEEEcCCHHHHHHHHHHH
Confidence 56554412234578899999999997763
No 169
>PF13263 PHP_C: PHP-associated; PDB: 2Z4G_B 2YXO_B 2YZ5_A 3DCP_B.
Probab=28.08 E-value=64 Score=18.66 Aligned_cols=18 Identities=22% Similarity=0.254 Sum_probs=12.5
Q ss_pred HHHHHHHHHcCccEEEec
Q psy627 46 EDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 46 e~Aiayae~~G~~Y~V~~ 63 (89)
+.|..||+++|++...-.
T Consensus 5 ~~A~~~A~~~~lp~~~gS 22 (56)
T PF13263_consen 5 RRAAELAEKYGLPFTGGS 22 (56)
T ss_dssp -HHHHHHHHTT--EEEE-
T ss_pred HHHHHHHHHcCCCeEeEE
Confidence 579999999999988755
No 170
>COG4668 MtlA Mannitol/fructose-specific phosphotransferase system, IIA domain [Carbohydrate transport and metabolism]
Probab=28.07 E-value=37 Score=24.35 Aligned_cols=17 Identities=29% Similarity=0.511 Sum_probs=14.3
Q ss_pred eeeecCCHHHHHHHHHH
Q psy627 38 LALNFSSKEDAIQYCQK 54 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~ 54 (89)
+..++.+||+||.+|=+
T Consensus 10 l~~~~~nK~eAi~~~g~ 26 (142)
T COG4668 10 LNQKAANKEEAIEFAGE 26 (142)
T ss_pred hhhcccCHHHHHHHHHH
Confidence 46789999999999854
No 171
>cd04882 ACT_Bt0572_2 C-terminal ACT domain of a novel protein composed of just two ACT domains. Included in this CD is the C-terminal ACT domain of a novel protein composed of just two ACT domains, as seen in the yet uncharacterized structure (pdb 2F06) of the Bt0572 protein from Bacteroides thetaiotaomicron and related proteins. Members of this CD belong to the superfamily of ACT regulatory domains.
Probab=28.06 E-value=97 Score=17.09 Aligned_cols=24 Identities=25% Similarity=0.293 Sum_probs=20.3
Q ss_pred CceeeecCCHHHHHHHHHHcCccE
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y 59 (89)
..+.+..++.+++.+-.+++||.+
T Consensus 41 ~~v~~~ve~~~~~~~~L~~~G~~v 64 (65)
T cd04882 41 ALLIFRTEDIEKAIEVLQERGVEL 64 (65)
T ss_pred EEEEEEeCCHHHHHHHHHHCCceE
Confidence 457788889999999999999853
No 172
>TIGR01465 cobM_cbiF precorrin-4 C11-methyltransferase. This model represents precorrin-4 C11-methyltransferase, one of two methyltransferases commonly referred to as precorrin-3 methylase (the other is precorrin-3B C17-methyltransferase, EC 2.1.1.131). This enzyme participates in the pathway toward the biosynthesis of cobalamin and related products.
Probab=28.06 E-value=89 Score=22.25 Aligned_cols=30 Identities=23% Similarity=0.336 Sum_probs=22.2
Q ss_pred CCCCCCCceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 30 STGDPLSNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 30 ss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
.+|||. -|.+--.-+..++++|++++|..-
T Consensus 78 ~~GDP~-----~~~~~~~l~~~~~~~g~~veviPG 107 (229)
T TIGR01465 78 HTGDPS-----IYGAIAEQMQLLEALGIPYEVVPG 107 (229)
T ss_pred eCcCcc-----ccccHHHHHHHHHHCCCCEEEECC
Confidence 457773 467777777888889999988753
No 173
>PTZ00480 serine/threonine-protein phosphatase; Provisional
Probab=28.03 E-value=1.2e+02 Score=24.07 Aligned_cols=40 Identities=20% Similarity=0.488 Sum_probs=27.1
Q ss_pred CccCCC---CCCCCCCCCCCceeeecCCHHHHHHHHHHcCccEEEec
Q psy627 20 RWENPL---MGWCSTGDPLSNLALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 20 ~~~dPL---MGWtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
-|-||. .||.-|. ..+...| ..++..+||+++|+++.|..
T Consensus 207 lWSDP~~~~~~~~~s~---RG~g~~F-G~~~~~~Fl~~n~l~~IiR~ 249 (320)
T PTZ00480 207 LWSDPDKDVQGWADNE---RGVSYVF-SQEIVQVFLKKHELDLICRA 249 (320)
T ss_pred eecCcccccCCCccCC---CCCcccc-CHHHHHHHHHhCCCcEEEEc
Confidence 466774 3554331 2234444 57889999999999998865
No 174
>cd04448 DEP_PIKfyve DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in fungal RhoGEF (GDP/GTP exchange factor) PIKfyve-like proteins. PIKfyve contains N-terminal Fyve finger and DEP domains, a central chaperonin-like domain and a C-terminal PIPK (phosphatidylinositol phosphate kinase) domain. PIKfyve-like proteins are important phosphatidylinositol (3)-monophosphate (PtdIns(3)P)-5-kinases, producing PtdIns(3,5)P2, which plays a major role in multivesicular body (MVB) sorting and control of retrograde traffic from the vacuole back to the endosome and/or Golgi. PIKfyve itself has been shown to be play a role in regulating early-endosome-to-trans-Golgi network (TGN) retrograde trafficking.
Probab=28.02 E-value=51 Score=20.86 Aligned_cols=24 Identities=25% Similarity=0.397 Sum_probs=18.5
Q ss_pred cCCHHHHHHHHH---HcCccEEEeccC
Q psy627 42 FSSKEDAIQYCQ---KNGWKFFVEEPK 65 (89)
Q Consensus 42 F~s~e~Aiayae---~~G~~Y~V~~p~ 65 (89)
..|+++||++|. .+|+=..|...+
T Consensus 44 ~~~R~eAv~~gq~Ll~~g~i~hV~~~~ 70 (81)
T cd04448 44 AATRVQAIAIGQALLDAGWIECVSDDD 70 (81)
T ss_pred CCCHHHHHHHHHHHHHCCCEEecCCCC
Confidence 599999999998 567776665543
No 175
>PF11216 DUF3012: Protein of unknown function (DUF3012); InterPro: IPR021379 This family of proteins with unknown function is restricted to Gammaproteobacteria.
Probab=27.86 E-value=44 Score=18.34 Aligned_cols=10 Identities=10% Similarity=0.478 Sum_probs=8.1
Q ss_pred CHHHHHHHHH
Q psy627 44 SKEDAIQYCQ 53 (89)
Q Consensus 44 s~e~Aiayae 53 (89)
|.++|.+||+
T Consensus 19 tanea~~fAK 28 (32)
T PF11216_consen 19 TANEAADFAK 28 (32)
T ss_pred cHhHHHHHHH
Confidence 5678999987
No 176
>PF00059 Lectin_C: Lectin C-type domain; InterPro: IPR001304 Lectins occur in plants, animals, bacteria and viruses. Initially described for their carbohydrate-binding activity [], they are now recognised as a more diverse group of proteins, some of which are involved in protein-protein, protein-lipid or protein-nucleic acid interactions []. There are at least twelve structural families of lectins: C-type lectins, which are Ca+-dependent. S-type (galectins), a widespread family of glycan-binding proteins []. I-type, which have an immunoglobulin-like fold and can recognise sialic acids, other sugars and glycosaminoglycans []. P-type, which bind phosphomannosyl receptors []. Pentraxins []. (Trout) egg lectins. Calreticulin and calnexin, which act as molecular chaperones of the endoplasmic reticulum []. ERGIC-53 and VIP-36 []. Discoidins []. Eel aggutinins (fucolectins) []. Annexin lectins []. Fibrinogen-type lectins, which includes ficolins, tachylectins 5A and 5B, and Limax flavus (Spotted garden slug) agglutinin (these proteins have clear distinctions from one another, but they share a homologous fibrinogen-like domain used for carbohydrate binding). Also unclassified orphan lectins, including amphoterin, Cel-II, complement factor H, thrombospondin, sailic acid-binding lectins, adherence lectin, and cytokins (such as tumour necrosis factor and several interleukins). C-type lectins can be further divided into seven subgroups based on additional non-lectin domains and gene structure: (I) hyalectans, (II) asialoglycoprotein receptors, (III) collectins, (IV) selectins, (V) NK group transmembrane receptors, (VI) macrophage mannose receptors, and (VII) simple (single domain) lectins []. Therefore, lectins are a diverse group of proteins, both in terms of structure and activity. Carbohydrate binding ability may have evolved independently and sporadically in numerous unrelated families, where each evolved a structure that was conserved to fulfil some other activity and function. In general, animal lectins act as recognition molecules within the immune system, their functions involving defence against pathogens, cell trafficking, immune regulation and the prevention of autoimmunity [].; GO: 0005488 binding; PDB: 1T8D_A 2H2T_B 1T8C_A 2H2R_A 1TN3_A 1RJH_A 1HTN_A 3G8K_B 2E3X_B 1UMR_D ....
Probab=27.83 E-value=43 Score=19.71 Aligned_cols=20 Identities=35% Similarity=0.466 Sum_probs=13.8
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
|-++|..+|+++|=...+..
T Consensus 4 ~~~~A~~~C~~~~~~L~~i~ 23 (105)
T PF00059_consen 4 TWEEAQQYCQSMGAHLASIN 23 (105)
T ss_dssp EHHHHHHHHHHTTSEEB-GS
T ss_pred CHHHHHHHHhcCCCEEeEeC
Confidence 35789999999876654433
No 177
>cd04442 DEP_1_DEP6 DEP (Dishevelled, Egl-10, and Pleckstrin) domain 1 found in DEP6-like proteins. DEP6 proteins contain two DEP and a PDZ domain. Their function is unknown.
Probab=27.77 E-value=51 Score=21.15 Aligned_cols=21 Identities=29% Similarity=0.305 Sum_probs=16.1
Q ss_pred CCHHHHHHHHH---HcCccEEEec
Q psy627 43 SSKEDAIQYCQ---KNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae---~~G~~Y~V~~ 63 (89)
.|+++||.+|. .+|+=-.|..
T Consensus 45 ~sR~eAv~lgq~Ll~~gvi~HV~~ 68 (82)
T cd04442 45 SDRETAIKIMQKLLDHSIIHHVCD 68 (82)
T ss_pred CCHHHHHHHHHHHHHCCCEEeccC
Confidence 79999999998 4676655543
No 178
>PRK10696 tRNA 2-thiocytidine biosynthesis protein TtcA; Provisional
Probab=27.56 E-value=66 Score=23.87 Aligned_cols=21 Identities=19% Similarity=0.328 Sum_probs=17.0
Q ss_pred CHHHHHHHHHHcCccEEEecc
Q psy627 44 SKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p 64 (89)
+.+.+.++|++.||++.|...
T Consensus 75 ~~~~~~~~~~~lgI~~~v~~~ 95 (258)
T PRK10696 75 PEHVLPEYLESLGVPYHIEEQ 95 (258)
T ss_pred CHHHHHHHHHHhCCCEEEEEe
Confidence 345678999999999998763
No 179
>COG0151 PurD Phosphoribosylamine-glycine ligase [Nucleotide transport and metabolism]
Probab=27.49 E-value=72 Score=26.67 Aligned_cols=34 Identities=15% Similarity=0.329 Sum_probs=28.4
Q ss_pred CCCCCceeeecCCHHHHHHHHHHcCccEEEeccCC
Q psy627 32 GDPLSNLALNFSSKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 32 ~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
.+.+.++.+. .+.|+-++||++++++..|.-|..
T Consensus 40 ~~~~~~~~~~-~~~~~lv~fA~~~~idl~vVGPE~ 73 (428)
T COG0151 40 EAYLVNIEID-TDHEALVAFAKEKNVDLVVVGPEA 73 (428)
T ss_pred hhhhccCccc-cCHHHHHHHHHHcCCCEEEECCcH
Confidence 4556666666 789999999999999999998865
No 180
>COG5171 YRB1 Ran GTPase-activating protein (Ran-binding protein) [Intracellular trafficking and secretion]
Probab=27.49 E-value=43 Score=25.31 Aligned_cols=27 Identities=33% Similarity=0.688 Sum_probs=21.4
Q ss_pred CCCCCCCC-----Cc-eeeecCCHHHHHHHHHH
Q psy627 28 WCSTGDPL-----SN-LALNFSSKEDAIQYCQK 54 (89)
Q Consensus 28 Wtss~D~~-----~q-v~l~F~s~e~Aiayae~ 54 (89)
|++++|+. .| ..++|.|.|.|-.|-|.
T Consensus 164 w~~taD~~Egea~a~tFairF~s~Ena~~FkEe 196 (211)
T COG5171 164 WMSTADTVEGEAKAQTFAIRFYSEENAKRFKEE 196 (211)
T ss_pred EEeecccccCcceeeEEEEeeccHHHHHHHHHH
Confidence 78887764 34 48999999999998765
No 181
>cd07416 MPP_PP2B PP2B, metallophosphatase domain. PP2B (calcineurin) is a unique serine/threonine protein phosphatase in its regulation by a second messenger (calcium and calmodulin). PP2B is involved in many biological processes including immune responses, the second messenger cAMP pathway, sodium/potassium ion transport in the nephron, cell cycle progression in lower eukaryotes, cardiac hypertrophy, and memory formation. PP2B is highly conserved from yeast to humans, but is absent from plants. PP2B is a heterodimer consisting of a catalytic subunit (CnA) and a regulatory subunit (CnB); CnB contains four Ca2+ binding motifs referred to as EF hands. The PPP (phosphoprotein phosphatase) family, to which PP2B belongs, is one of two known protein phosphatase families specific for serine and threonine. The PPP family also includes: PP1, PP2A, PP4, PP5, PP6, PP7, Bsu1, RdgC, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic domain is defined by three conserved motifs (-GDXHG-, -G
Probab=27.45 E-value=1.1e+02 Score=23.82 Aligned_cols=26 Identities=15% Similarity=0.181 Sum_probs=20.3
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+...|. .+++.+||++||+.+.|..-
T Consensus 216 ~g~~fG-~~~~~~Fl~~n~l~~iiR~H 241 (305)
T cd07416 216 CSYFYS-YRAVCEFLQKNNLLSIIRAH 241 (305)
T ss_pred CceecC-HHHHHHHHHHcCCeEEEEec
Confidence 455665 47888999999999998653
No 182
>PRK12465 xylose isomerase; Provisional
Probab=27.43 E-value=74 Score=26.78 Aligned_cols=30 Identities=30% Similarity=0.255 Sum_probs=21.5
Q ss_pred HHHHHHHHHHcCccEEE-eccCCCCCCcccc
Q psy627 45 KEDAIQYCQKNGWKFFV-EEPKWKTPKVKSY 74 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V-~~p~~~~~~~ksY 74 (89)
.+-|++|+.+.|++..+ +||++.-++.-+|
T Consensus 221 l~ma~dY~~~iGf~g~f~IEPKP~EPr~hqy 251 (445)
T PRK12465 221 LTLARDYGRSIGFKGNFLIEPKPMEPMKHQY 251 (445)
T ss_pred HHHHHHHhHhcCCCceEEeccCCCCCCcccc
Confidence 45677899999996654 5777766666665
No 183
>PRK11235 bifunctional antitoxin/transcriptional repressor RelB; Provisional
Probab=27.32 E-value=37 Score=21.89 Aligned_cols=22 Identities=23% Similarity=0.160 Sum_probs=17.4
Q ss_pred ceeeecCCHHHHHHHHHHcCcc
Q psy627 37 NLALNFSSKEDAIQYCQKNGWK 58 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~~G~~ 58 (89)
++++.=+.|++|.+-|++.|+.
T Consensus 5 ~vRiD~~lK~~A~~vl~~lGls 26 (80)
T PRK11235 5 NVRVDDELKARAYAVLEKLGVT 26 (80)
T ss_pred EEEeCHHHHHHHHHHHHHhCCC
Confidence 5667777788999988888875
No 184
>PRK11370 YciI-like protein; Reviewed
Probab=27.16 E-value=66 Score=20.57 Aligned_cols=18 Identities=22% Similarity=0.320 Sum_probs=15.1
Q ss_pred ceeeecCCHHHHHHHHHH
Q psy627 37 NLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~ 54 (89)
=+-+.|+|.|+|.++.+.
T Consensus 60 ~ii~ea~s~~~a~~~~~~ 77 (99)
T PRK11370 60 TVIAEFESLEAAQAWADA 77 (99)
T ss_pred EEEEEECCHHHHHHHHHC
Confidence 456799999999999764
No 185
>cd03036 ArsC_like Arsenate Reductase (ArsC) family, unknown subfamily; uncharacterized proteins containing a CXXC motif with similarity to thioredoxin (TRX)-fold arsenic reductases, ArsC. Proteins containing a redox active CXXC motif like TRX and glutaredoxin (GRX) function as protein disulfide oxidoreductases, altering the redox state of target proteins via the reversible oxidation of the active site dithiol. ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via GRX, through a single catalytic cysteine.
Probab=27.02 E-value=57 Score=21.23 Aligned_cols=21 Identities=14% Similarity=0.271 Sum_probs=17.8
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
++...|.++-+.+|++|++..
T Consensus 10 ~~c~ka~~~L~~~~i~~~~id 30 (111)
T cd03036 10 STCRKAKKWLDEHGVDYTAID 30 (111)
T ss_pred HHHHHHHHHHHHcCCceEEec
Confidence 345689999999999999876
No 186
>COG2154 Pterin-4a-carbinolamine dehydratase [Coenzyme metabolism]
Probab=26.92 E-value=80 Score=21.32 Aligned_cols=34 Identities=21% Similarity=0.370 Sum_probs=26.7
Q ss_pred cCCCCCCCCCCC-CCCceeeecCCHHHHHHHHHHc
Q psy627 22 ENPLMGWCSTGD-PLSNLALNFSSKEDAIQYCQKN 55 (89)
Q Consensus 22 ~dPLMGWtss~D-~~~qv~l~F~s~e~Aiayae~~ 55 (89)
...|-||.-..| ..-.-..+|.+-.+||+|.-+-
T Consensus 15 l~~l~gW~l~~~~~~l~r~f~FknF~~a~~F~~~v 49 (101)
T COG2154 15 LRALPGWELADDGAKLTRTFKFKNFKQAIAFVNRV 49 (101)
T ss_pred hcCCCCCEEecCcceEEEEEEcCCHHHHHHHHHHH
Confidence 368999999988 3334479999999999997653
No 187
>PF10827 DUF2552: Protein of unknown function (DUF2552) ; InterPro: IPR020157 This entry contains proteins with no known function.
Probab=26.92 E-value=52 Score=21.40 Aligned_cols=16 Identities=19% Similarity=0.322 Sum_probs=12.6
Q ss_pred eecCCHHHHHHHHHHc
Q psy627 40 LNFSSKEDAIQYCQKN 55 (89)
Q Consensus 40 l~F~s~e~Aiayae~~ 55 (89)
-.|++.|+||+.-..|
T Consensus 56 ~eF~tld~Ai~Wi~e~ 71 (79)
T PF10827_consen 56 QEFPTLDLAIAWIGEH 71 (79)
T ss_pred cccccHHHHHHHHHhc
Confidence 3578899999987655
No 188
>cd04449 DEP_DEPDC5-like DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in DEPDC5-like proteins. DEPDC5, in human also known as KIAA0645, is a DEP domain containing protein of unknown function.
Probab=26.74 E-value=55 Score=20.58 Aligned_cols=22 Identities=23% Similarity=0.354 Sum_probs=16.9
Q ss_pred cCCHHHHHHHHH---HcCccEEEec
Q psy627 42 FSSKEDAIQYCQ---KNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae---~~G~~Y~V~~ 63 (89)
..|+++||++|+ .+|+=..|..
T Consensus 46 ~~~r~eAv~lgq~Ll~~g~I~hv~~ 70 (83)
T cd04449 46 VDTREEAVELGQELMNEGLIEHVSG 70 (83)
T ss_pred CCCHHHHHHHHHHHHHCCCEEecCC
Confidence 579999999998 4677666543
No 189
>COG4545 Glutaredoxin-related protein [Posttranslational modification, protein turnover, chaperones]
Probab=26.67 E-value=71 Score=21.07 Aligned_cols=21 Identities=14% Similarity=0.208 Sum_probs=18.2
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+....|++|-++.+++|+-++
T Consensus 13 pdca~a~eyl~rl~v~yd~Ve 33 (85)
T COG4545 13 PDCAPAVEYLERLNVDYDFVE 33 (85)
T ss_pred cchHHHHHHHHHcCCCceeee
Confidence 556889999999999998766
No 190
>cd03409 Chelatase_Class_II Class II Chelatase: a family of ATP-independent monomeric or homodimeric enzymes that catalyze the insertion of metal into protoporphyrin rings. This family includes protoporphyrin IX ferrochelatase (HemH), sirohydrochlorin ferrochelatase (SirB) and the cobaltochelatases, CbiK and CbiX. HemH and SirB are involved in heme and siroheme biosynthesis, respectively, while the cobaltochelatases are associated with cobalamin biosynthesis. Excluded from this family are the ATP-dependent heterotrimeric chelatases (class I) and the multifunctional homodimeric enzymes with dehydrogenase and chelatase activities (class III).
Probab=26.44 E-value=95 Score=18.95 Aligned_cols=24 Identities=25% Similarity=0.295 Sum_probs=21.3
Q ss_pred cCCHHHHHHHHHHcCccEEEeccC
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.++.++|++-+.+.|++-.|.-|-
T Consensus 44 ~P~i~~~l~~l~~~g~~~vvvvPl 67 (101)
T cd03409 44 GPDTEEAIRELAEEGYQRVVIVPL 67 (101)
T ss_pred CCCHHHHHHHHHHcCCCeEEEEeC
Confidence 589999999999999998888774
No 191
>KOG1380|consensus
Probab=26.44 E-value=27 Score=28.85 Aligned_cols=12 Identities=42% Similarity=0.994 Sum_probs=9.2
Q ss_pred cCCCCCCCCCCC
Q psy627 22 ENPLMGWCSTGD 33 (89)
Q Consensus 22 ~dPLMGWtss~D 33 (89)
+-|||||...+-
T Consensus 234 IPPLMGwAAasg 245 (409)
T KOG1380|consen 234 IPPLMGWAAASG 245 (409)
T ss_pred CCccchhhhhcc
Confidence 579999987643
No 192
>smart00633 Glyco_10 Glycosyl hydrolase family 10.
Probab=26.40 E-value=43 Score=24.70 Aligned_cols=21 Identities=29% Similarity=0.642 Sum_probs=18.8
Q ss_pred eeecCCHHHHHHHHHHcCccE
Q psy627 39 ALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y 59 (89)
+..|...+.+|++|+++|+..
T Consensus 12 ~~n~~~~D~~~~~a~~~gi~v 32 (254)
T smart00633 12 QFNFSGADAIVNFAKENGIKV 32 (254)
T ss_pred ccChHHHHHHHHHHHHCCCEE
Confidence 467899999999999999995
No 193
>PLN03181 glycosyltransferase; Provisional
Probab=26.33 E-value=48 Score=27.89 Aligned_cols=20 Identities=20% Similarity=0.730 Sum_probs=15.4
Q ss_pred HHHHHHHHHHcCccEEEecc
Q psy627 45 KEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p 64 (89)
.+-=++||++||+++.+...
T Consensus 154 ikNR~dYArrHGY~lf~~~a 173 (453)
T PLN03181 154 FKNKVDYCRIHGYDIFYNNA 173 (453)
T ss_pred HHHHHHHHHHhCCcEEEecc
Confidence 34457899999999987554
No 194
>PRK09732 hypothetical protein; Provisional
Probab=26.20 E-value=67 Score=22.34 Aligned_cols=18 Identities=22% Similarity=0.381 Sum_probs=14.5
Q ss_pred HHHHHHHHHHcCccEEEe
Q psy627 45 KEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~ 62 (89)
.++|++.|+++|++..|-
T Consensus 16 ~~aA~~~A~~~g~~v~ia 33 (134)
T PRK09732 16 IAAGQEEAQKNNWSVSIA 33 (134)
T ss_pred HHHHHHHHHHhCCCEEEE
Confidence 367899999999987653
No 195
>KOG3048|consensus
Probab=26.20 E-value=46 Score=24.22 Aligned_cols=13 Identities=31% Similarity=0.557 Sum_probs=11.5
Q ss_pred CCHHHHHHHHHHc
Q psy627 43 SSKEDAIQYCQKN 55 (89)
Q Consensus 43 ~s~e~Aiayae~~ 55 (89)
.|.|+|.+||+|.
T Consensus 97 K~~e~akdyfkRK 109 (153)
T KOG3048|consen 97 KDAEDAKDYFKRK 109 (153)
T ss_pred echHHHHHHHHHH
Confidence 6899999999984
No 196
>PF09691 PulS_OutS: Bacterial chaperone lipoprotein (PulS_OutS); InterPro: IPR019114 This family comprises lipoproteins from gamma proteobacterial species: pullulanase secretion protein PulS protein of Klebsiella pneumoniae (P20440 from SWISSPROT), the lipoprotein OutS protein of Erwinia chrysanthemi (Q01567 from SWISSPROT) and the functionally uncharacterised type II secretion protein EtpO (Q7BSV3 from SWISSPROT) from Escherichia coli O157:H7. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family. In addition to the PulS/OutS proteins this entry also includes other functionally uncharacterised proteins, such as YacC from Escherischia coli (P0AA95 from SWISSPROT).; PDB: 3SOL_A 3UYM_A 3UTK_A 4A56_A.
Probab=26.14 E-value=46 Score=22.68 Aligned_cols=13 Identities=31% Similarity=0.833 Sum_probs=9.9
Q ss_pred HHHHHHHHcCccE
Q psy627 47 DAIQYCQKNGWKF 59 (89)
Q Consensus 47 ~Aiayae~~G~~Y 59 (89)
.||.+|+++||+.
T Consensus 54 ~a~~~A~~~gWd~ 66 (109)
T PF09691_consen 54 AAVIFAQQRGWDT 66 (109)
T ss_dssp HHHHHHHHTT--C
T ss_pred HHHHHHHHcCCCc
Confidence 6999999999984
No 197
>COG0269 SgbH 3-hexulose-6-phosphate synthase and related proteins [Carbohydrate transport and metabolism]
Probab=26.02 E-value=73 Score=24.27 Aligned_cols=31 Identities=26% Similarity=0.268 Sum_probs=25.6
Q ss_pred CCCceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 34 PLSNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 34 ~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
|+-||-|-..|+|+||..+++-+=..+|.|-
T Consensus 4 p~LQvALD~~~l~~Ai~~a~~v~~~~diiEv 34 (217)
T COG0269 4 PLLQVALDLLDLEEAIEIAEEVADYVDIIEV 34 (217)
T ss_pred cceEeeecccCHHHHHHHHHHhhhcceEEEe
Confidence 6789999999999999999987655556553
No 198
>cd04440 DEP_2_P-Rex DEP (Dishevelled, Egl-10, and Pleckstrin) domain 2 found in P-Rex-like proteins. The P-Rex family is the guanine-nucleotide exchange factor (GEF) for the small GTPase Rac that contains an N-terminal RhoGEF domain, two DEP and PDZ domains. Rac-GEF activity is stimulated by phosphatidylinositol (3,4,5)-trisphosphate (PtdIns(3,4,5)P3), a lipid second messenger, and the G beta-gamma subunits of heterotrimeric G proteins. The DEP domains are not involved in mediating these stimuli, but may be of importance for basal and stimulated levels Rac-GEF activity.
Probab=25.98 E-value=56 Score=21.62 Aligned_cols=23 Identities=26% Similarity=0.384 Sum_probs=18.2
Q ss_pred cCCHHHHHHHHH---HcCccEEEecc
Q psy627 42 FSSKEDAIQYCQ---KNGWKFFVEEP 64 (89)
Q Consensus 42 F~s~e~Aiayae---~~G~~Y~V~~p 64 (89)
-.|+|+||.|++ .+||=-.|...
T Consensus 53 ~~tR~eAv~~gq~Ll~~gii~HV~~~ 78 (93)
T cd04440 53 CRTREEAVILGVGLCNNGFMHHVLEK 78 (93)
T ss_pred CCCHHHHHHHHHHHHhCCCEEecCCC
Confidence 469999999998 57887777643
No 199
>PF05172 Nup35_RRM: Nup53/35/40-type RNA recognition motif; InterPro: IPR007846 The MPPN (Mitotic PhosphoProtein N end) family is uncharacterised however it probably plays a role in the cell cycle because the family includes mitotic phosphoproteins O13026 from SWISSPROT []. This family also includes Q05166 from SWISSPROT a suppressor of thermosensitive mutations in the DNA polymerase delta gene, Pol III []. The conserved central region appears to be distantly related to the RNA-binding region RNP-1 (RNA recognition motif, IPR000504 from INTERPRO), suggesting an RNA binding function for this protein.; PDB: 1WWH_C 3P3D_A.
Probab=25.87 E-value=56 Score=21.67 Aligned_cols=22 Identities=27% Similarity=0.379 Sum_probs=19.7
Q ss_pred eeeecCCHHHHHHHHHHcCccE
Q psy627 38 LALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y 59 (89)
|+|+|+++.+|..--.+||--+
T Consensus 57 i~I~Y~~~~~A~rAL~~NG~i~ 78 (100)
T PF05172_consen 57 IHITYDNPLSAQRALQKNGTIF 78 (100)
T ss_dssp EEEEESSHHHHHHHHTTTTEEE
T ss_pred EEEECCCHHHHHHHHHhCCeEE
Confidence 8999999999999999998643
No 200
>KOG2564|consensus
Probab=25.84 E-value=41 Score=27.31 Aligned_cols=22 Identities=32% Similarity=0.365 Sum_probs=18.2
Q ss_pred eeeecCCHHHHHHHHHHcCccE
Q psy627 38 LALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y 59 (89)
---.|.|.|+||..|-+.|.-=
T Consensus 199 rP~~F~Si~~Ai~W~v~sg~~R 220 (343)
T KOG2564|consen 199 RPKSFKSIEDAIEWHVRSGQLR 220 (343)
T ss_pred CCccccchhhHHHHHhcccccc
Confidence 3457999999999999998643
No 201
>KOG0092|consensus
Probab=25.82 E-value=56 Score=24.69 Aligned_cols=52 Identities=21% Similarity=0.423 Sum_probs=33.8
Q ss_pred CceEEEecCCCCccCCCCCCCCC-CCCCCceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 9 NNWFIQFDTRERWENPLMGWCST-GDPLSNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 9 ~~W~lefe~~~~~~dPLMGWtss-~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+.|+.|...... .|-+|-=.|+ .|-.+...+ ..+||.+||+.+|+.|.-...
T Consensus 97 K~WvkeL~~~~~-~~~vialvGNK~DL~~~R~V---~~~ea~~yAe~~gll~~ETSA 149 (200)
T KOG0092|consen 97 KNWVKELQRQAS-PNIVIALVGNKADLLERREV---EFEEAQAYAESQGLLFFETSA 149 (200)
T ss_pred HHHHHHHHhhCC-CCeEEEEecchhhhhhcccc---cHHHHHHHHHhcCCEEEEEec
Confidence 556666665544 5555555555 455443333 468999999999999965543
No 202
>cd04441 DEP_2_DEP6 DEP (Dishevelled, Egl-10, and Pleckstrin) domain 2 found in DEP6-like proteins. DEP6 proteins contain two DEP and a PDZ domain. Their function is unknown.
Probab=25.82 E-value=58 Score=21.10 Aligned_cols=22 Identities=27% Similarity=0.516 Sum_probs=17.1
Q ss_pred cCCHHHHHHHHH---HcCccEEEec
Q psy627 42 FSSKEDAIQYCQ---KNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae---~~G~~Y~V~~ 63 (89)
..|+++||.+|+ .+|+=-.|..
T Consensus 48 ~~sR~eAv~lgq~Ll~~gii~HV~~ 72 (85)
T cd04441 48 AESRREAVQLCRRLLEHGIIQHVSN 72 (85)
T ss_pred CCCHHHHHHHHHHHHHCCCEEecCC
Confidence 589999999998 4677666653
No 203
>PRK10638 glutaredoxin 3; Provisional
Probab=25.71 E-value=67 Score=19.36 Aligned_cols=22 Identities=14% Similarity=-0.058 Sum_probs=18.2
Q ss_pred CHHHHHHHHHHcCccEEEeccC
Q psy627 44 SKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
-...|..+.+++|++|++.+-.
T Consensus 14 ~C~~a~~~L~~~gi~y~~~dv~ 35 (83)
T PRK10638 14 FCHRAKALLNSKGVSFQEIPID 35 (83)
T ss_pred hHHHHHHHHHHcCCCcEEEECC
Confidence 4578999999999999987643
No 204
>cd03416 CbiX_SirB_N Sirohydrochlorin cobalt chelatase (CbiX) and sirohydrochlorin iron chelatase (SirB), N-terminal domain. SirB catalyzes the ferro-chelation of sirohydrochlorin to siroheme, the prosthetic group of sulfite and nitrite reductases. CbiX is a cobaltochelatase, responsible for the chelation of Co2+ into sirohydrochlorin, an important step in the vitamin B12 biosynthetic pathway. CbiX often contains a C-terminal histidine-rich region that may be important for metal delivery and/or storage, and may also contain an iron-sulfur center. Both are found in a wide range of bacteria. This subgroup also contains single domain proteins from archaea and bacteria which may represent the ancestral form of class II chelatases before domain duplication occurred.
Probab=25.55 E-value=1.1e+02 Score=18.98 Aligned_cols=26 Identities=23% Similarity=0.170 Sum_probs=21.9
Q ss_pred eeecCCHHHHHHHHHHcCccEEEecc
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
....++.++|++-+.+.|.+-.+..|
T Consensus 40 e~~~p~~~~~l~~l~~~g~~~v~vvP 65 (101)
T cd03416 40 ELAEPSLAEALDELAAQGATRIVVVP 65 (101)
T ss_pred EcCCCCHHHHHHHHHHcCCCEEEEEe
Confidence 34568999999999999999877776
No 205
>COG1393 ArsC Arsenate reductase and related proteins, glutaredoxin family [Inorganic ion transport and metabolism]
Probab=25.49 E-value=60 Score=21.91 Aligned_cols=21 Identities=14% Similarity=0.358 Sum_probs=17.5
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.|...|.++-+.||++|++..
T Consensus 12 ~t~rka~~~L~~~gi~~~~~~ 32 (117)
T COG1393 12 STCRKALAWLEEHGIEYTFID 32 (117)
T ss_pred hHHHHHHHHHHHcCCCcEEEE
Confidence 466789999999999997653
No 206
>PF07476 MAAL_C: Methylaspartate ammonia-lyase C-terminus; InterPro: IPR022662 Methylaspartate ammonia-lyase 4.3.1.2 from EC catalyses the second step of fermentation of glutamate. It is a homodimer. This domain represents the C-terminal region of methylaspartate ammonia-lyase and contains a TIM barrel fold similar to the PF01188 from PFAM. This domain represents the catalytic domain and contains a metal binding site []. ; PDB: 1KKO_B 1KKR_A 3ZVI_A 1KD0_B 1KCZ_B 3ZVH_A.
Probab=25.42 E-value=60 Score=25.31 Aligned_cols=17 Identities=24% Similarity=0.681 Sum_probs=13.2
Q ss_pred HHHHHHHHHHcCccEEE
Q psy627 45 KEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V 61 (89)
--+||-||+++|+.-.+
T Consensus 178 tieAvlyCk~~gvgaY~ 194 (248)
T PF07476_consen 178 TIEAVLYCKEHGVGAYL 194 (248)
T ss_dssp HHHHHHHHHHTT-EEEE
T ss_pred HHHHHHHHHhcCCceee
Confidence 35899999999998665
No 207
>PRK09314 bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein; Provisional
Probab=25.32 E-value=65 Score=25.95 Aligned_cols=18 Identities=17% Similarity=0.464 Sum_probs=15.8
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
..++++++||++||++..
T Consensus 172 a~~~~l~~fA~~h~l~~i 189 (339)
T PRK09314 172 ARRDDLEDFAKKHNLKMI 189 (339)
T ss_pred ccHHHHHHHHHHcCCcEE
Confidence 468999999999999983
No 208
>PF01171 ATP_bind_3: PP-loop family; InterPro: IPR011063 This entry represents the PP-loop motif superfamily [,]. The PP-loop motif appears to be a modified version of the P-loop of nucleotide binding domain that is involved in phosphate binding []. Named PP-motif, since it appears to be a part of a previously uncharacterised ATP pyrophophatase domain. ATP sulfurylases, Escherichia coli NtrL, and Bacillus subtilis OutB consist of this domain alone. In other proteins, the pyrophosphatase domain is associated with amidotransferase domains (type I or type II), a putative citrulline-aspartate ligase domain or a nitrilase/amidase domain.; PDB: 3A2K_A 2E89_B 2E21_D 1WY5_B 1NI5_A.
Probab=25.28 E-value=46 Score=23.09 Aligned_cols=24 Identities=25% Similarity=0.515 Sum_probs=18.4
Q ss_pred eecCCHHHHHHHHHHcCccEEEec
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
|-.-++++-++||+.+||+|....
T Consensus 144 Ll~~~k~ei~~~~~~~~i~~~~D~ 167 (182)
T PF01171_consen 144 LLYVSKDEIRAYAKENGIPYVEDP 167 (182)
T ss_dssp GGCS-HHHHHHHHHHTT-SSBS-C
T ss_pred chhCCHHHHHHHHHHCCCcEEECc
Confidence 667899999999999999987643
No 209
>cd03592 CLECT_selectins_like C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CLECT_selectins_like: C-type lectin-like domain (CTLD) of the type found in the type 1 transmembrane proteins: P(platlet)-, E(endothelial)-, and L(leukocyte)- selectins (sels). CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. P- E- and L-sels are cell adhesion receptors that mediate the initial attachment, tethering, and rolling of lymphocytes on inflamed vascular walls enabling subsequent lymphocyte adhesion and transmigration. L- sel is expressed constitutively on most leukocytes. P-sel is stored in the Weibel-Palade bodies of endothelial cells and in the alpha granules of platlets. E- sels are present on endothelial cells. Following platelet and/or endothelial cell activation P- sel is rapidly translocated to the cell surface and E-sel exp
Probab=25.28 E-value=59 Score=20.40 Aligned_cols=19 Identities=21% Similarity=0.604 Sum_probs=14.5
Q ss_pred CCHHHHHHHHHHcCccEEE
Q psy627 43 SSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V 61 (89)
.|=++|.+||+++|-....
T Consensus 10 ~~w~~A~~~C~~~g~~La~ 28 (115)
T cd03592 10 MTFNEAVKYCKSRGTDLVA 28 (115)
T ss_pred cCHHHHHHHHHHcCCeEee
Confidence 3558999999999965444
No 210
>PF15649 Tox-REase-7: Restriction endonuclease fold toxin 7
Probab=25.27 E-value=94 Score=20.28 Aligned_cols=22 Identities=23% Similarity=0.510 Sum_probs=17.6
Q ss_pred HHHHHHHHHHcCccEEEeccCC
Q psy627 45 KEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
.-+-++||+.+|.+|.+.-.+.
T Consensus 59 lr~~~~~A~~~G~~~~Lvv~~~ 80 (87)
T PF15649_consen 59 LRDYVKYAKENGYRFNLVVNHR 80 (87)
T ss_pred HHHHHHHHHHcCCcEEEEEcCC
Confidence 3578999999999998876543
No 211
>PF14555 UBA_4: UBA-like domain; PDB: 2DAL_A 3BQ3_A 2L4E_A 2L4F_A 2DZL_A 2L2D_A 2DAM_A 1V92_A 3E21_A.
Probab=25.23 E-value=73 Score=17.46 Aligned_cols=16 Identities=38% Similarity=0.814 Sum_probs=13.5
Q ss_pred CHHHHHHHHHHcCccE
Q psy627 44 SKEDAIQYCQKNGWKF 59 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y 59 (89)
+.+.|+.|-++++|+.
T Consensus 14 ~~~~A~~~L~~~~wdl 29 (43)
T PF14555_consen 14 DEDVAIQYLEANNWDL 29 (43)
T ss_dssp SHHHHHHHHHHTTT-H
T ss_pred CHHHHHHHHHHcCCCH
Confidence 6799999999999975
No 212
>PLN02923 xylose isomerase
Probab=25.19 E-value=75 Score=26.96 Aligned_cols=30 Identities=33% Similarity=0.462 Sum_probs=22.5
Q ss_pred HHHHHHHHHHcCccEEE-eccCCCCCCcccc
Q psy627 45 KEDAIQYCQKNGWKFFV-EEPKWKTPKVKSY 74 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V-~~p~~~~~~~ksY 74 (89)
..-|++|+++.|.+..+ +||++.-+++-.|
T Consensus 255 l~ma~dY~~~iGf~g~flIEPKP~EPtkHqY 285 (478)
T PLN02923 255 LEAAVAYKKKIGFNGTLLIEPKPQEPTKHQY 285 (478)
T ss_pred HHHHHHHHHhcCCCceEEecCCCCCCCCCcc
Confidence 35689999999999865 6788766665555
No 213
>PF02177 APP_N: Amyloid A4 N-terminal heparin-binding; InterPro: IPR015849 Amyloid-beta precursor protein (APP, or A4) is associated with Alzheimer's disease (AD), because one of its breakdown products, amyloid-beta (A-beta), aggregates to form amyloid or senile plaques [, ]. Mutations in APP or in proteins that process APP have been linked with early-onset, familial AD. Individuals with Down's syndrome carry an extra copy of chromosome 21, which contains the APP gene, and almost invariably develop amyloid plaques and Alzheimer's symptoms. APP is important for the neurogenesis and neuronal regeneration, either through the intact protein, or through its many breakdown products []. APP consists of a large N-terminal extracellular region containing heparin-binding and copper-binding sites, a short hydrophobic transmembrane domain, and a short C-terminal intracellular domain. The N-terminal region is similar in structure to cysteine-rich growth factors and appears to function as a cell surface receptor, contributing to neurite growth, neuronal adhesion, axonogenesis and cell mobility []. APP acts as a kinesin I membrane receptor to mediate the axonal transport of beta-secretase and presenilin 1. The N-terminal domain can regulate neurite outgrowth through its binding to heparin and collagen I and IV, which are components of the extracellular matrix. APP is also coupled to apoptosis-inducing pathways, and is involved in copper homeostasis/oxidative stress through copper ion reduction, where copper-metallated APP induces neuronal death []. The C-terminal intracellular domain appears to be involved in transcription regulation through protein-protein interactions. APP can promote transcription activation through binding to APBB1/Tip60, and may bind to the adaptor protein FE65 to transactivate a wide variety of different promoters. APP can be processed by different sets of enzymes: In the non-amyloidogenic (non-plaque-forming) pathway, APP is cleaved by alpha-secretase to yield a soluble N-terminal sAPP-alpha (neuroprotective) and a membrane-bound CTF-alpha. CTF-alpha is broken-down by presenilin-containing gamma-secretase to yield soluble p3 and membrane-bound AICD (nuclear signalling). In the amyloidogenic pathway (plaque-forming), APP is broken down by beta-secretase to yield soluble sAPP-beta and membrane-bound CTF-beta. CTF-beta is broken down by gamma-secretase to yield soluble amyloid-beta and membrane-bound AICD. Amyloid-beta is required for neuronal function, but can aggregate to form amyloid plaques that seem to disrupt brain cells by clogging points of cell-cell contact. This entry represents a heparin-binding domain found at the N-terminal of the extracellular domain, which is itself found at the N-terminal of amyloidogenic glycoproteins such as amyloid-beta precursor protein (APP, or A4). The core of the heparin-binding domain has an unusual disulphide-rich fold, consisting of a beta-x-alpha-beta-loop-beta topology []. More information about these protein can be found at Protein of the Month: Amyloid-beta Precursor Protein [].; GO: 0005488 binding, 0016021 integral to membrane; PDB: 3KTM_E 1MWP_A 2FKL_A 1OWT_A.
Probab=25.13 E-value=61 Score=22.00 Aligned_cols=25 Identities=24% Similarity=0.380 Sum_probs=16.1
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEe
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
..-.|.++++.++||++.--+..|.
T Consensus 30 ~~~C~~~k~eIL~YCrkvYP~l~It 54 (102)
T PF02177_consen 30 SASCLKDKEEILKYCRKVYPELQIT 54 (102)
T ss_dssp --B---SHHHHHHHHHHHSTTS-EE
T ss_pred CccccCChHHHHHHHHHhCCCCcee
Confidence 3667899999999999987766553
No 214
>KOG2944|consensus
Probab=25.10 E-value=1.9e+02 Score=21.44 Aligned_cols=42 Identities=14% Similarity=0.242 Sum_probs=32.6
Q ss_pred CCCCC--CCceeeecCCHHHHHHHHHHcCccEEEeccCCCCCCcc
Q psy627 30 STGDP--LSNLALNFSSKEDAIQYCQKNGWKFFVEEPKWKTPKVK 72 (89)
Q Consensus 30 ss~D~--~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p~~~~~~~k 72 (89)
++.|| ...+.+..++-++|+..-|..|+. .+..|..-++++.
T Consensus 109 gN~~prGfgHIci~V~di~sac~~lkekGV~-f~Kk~~dGk~K~i 152 (170)
T KOG2944|consen 109 GNKEPRGFGHICIEVDDINSACERLKEKGVR-FKKKLKDGKMKPI 152 (170)
T ss_pred CCCCCCccceEEEEeCCHHHHHHHHHHhCce-eeecCCCccccce
Confidence 45666 467899999999999999999999 5666666555443
No 215
>PF03754 DUF313: Domain of unknown function (DUF313) ; InterPro: IPR005508 This is a family of proteins from Arabidopsis thaliana (Mouse-ear cress) with uncharacterised function.
Probab=25.09 E-value=43 Score=22.88 Aligned_cols=34 Identities=18% Similarity=0.416 Sum_probs=28.3
Q ss_pred CCCce-eeecCCHHHHHHH-------HHHcCccEEEeccCCC
Q psy627 34 PLSNL-ALNFSSKEDAIQY-------CQKNGWKFFVEEPKWK 67 (89)
Q Consensus 34 ~~~qv-~l~F~s~e~Aiay-------ae~~G~~Y~V~~p~~~ 67 (89)
|.+|| ...|-|.||--+. .+..|++..|..|...
T Consensus 43 P~~qi~~~dFLt~eE~~~i~~~~~~~~~~~Gv~V~lvdp~~~ 84 (114)
T PF03754_consen 43 PFNQIIDNDFLTEEEKRIIKEEKKNNDKKKGVEVILVDPSLR 84 (114)
T ss_pred cHHHhcccccCCHHHHHHHHHhhccCcccCCceEEEECCcCc
Confidence 45566 7889999999888 6788999999998764
No 216
>TIGR00168 infC translation initiation factor IF-3. render its expression particularly sensitive to excess of its gene product IF-3 thereby regulating its own expression
Probab=24.95 E-value=1e+02 Score=22.11 Aligned_cols=33 Identities=24% Similarity=0.465 Sum_probs=24.2
Q ss_pred CCHHHHHHHHHHcCccEEEeccCCC--CCCccccc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPKWK--TPKVKSYA 75 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~~~--~~~~ksYa 75 (89)
=+..+|++.|++.|++-....|... .-+.-.|+
T Consensus 23 ~~~~eAl~~A~~~~lDLVev~~~a~PPVckImdy~ 57 (165)
T TIGR00168 23 VSREEALEIAEEAGLDLVLISPNAKPPVCKIMDYG 57 (165)
T ss_pred ccHHHHHHHHHHcCCcEEEECCCCCCCEEEEeeHH
Confidence 3688999999999999877776532 22444555
No 217
>TIGR00746 arcC carbamate kinase. The seed alignment for this model includes experimentally confirmed examples from a set of phylogenetically distinct species. In a neighbor-joining tree constructed from an alignment of candidate carbamate kinases and several acetylglutamate kinases, the latter group forms a clear outgroup which roots the tree of carbamate kinase-like proteins. This analysis suggests that in E. coli, the ArcC paralog YqeA may be a second isozyme, while the paralog YahI branches as an outlier and is less likely to be an authentic carbamate kinase. The homolog from Mycoplasma pneumoniae likewise branches outside the set containing known carbamate kinases and also scores below the trusted cutoff.
Probab=24.88 E-value=81 Score=24.79 Aligned_cols=21 Identities=24% Similarity=0.491 Sum_probs=17.9
Q ss_pred ecCCHHHHHHHHHHcCccEEE
Q psy627 41 NFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V 61 (89)
.|-|+|+|.+.+++.||.|.-
T Consensus 131 ~~y~~~~a~~~~~~~~~~~~~ 151 (310)
T TIGR00746 131 PFYTEEEAKRLAAEKGWIVKE 151 (310)
T ss_pred CCcCHHHHHHHHHHcCCeEee
Confidence 478999999999999996544
No 218
>PRK02315 adaptor protein; Provisional
Probab=24.87 E-value=80 Score=23.62 Aligned_cols=23 Identities=22% Similarity=0.210 Sum_probs=19.5
Q ss_pred CCceeeecCCHHHHHHHHHHcCc
Q psy627 35 LSNLALNFSSKEDAIQYCQKNGW 57 (89)
Q Consensus 35 ~~qv~l~F~s~e~Aiayae~~G~ 57 (89)
..++.+.|++.|+.|++|+...+
T Consensus 138 ~~~~i~~F~d~EdvI~la~~l~~ 160 (233)
T PRK02315 138 QLTYVLKFDDFEDVISLAKTLKV 160 (233)
T ss_pred ceEEEEEcCCHHHHHHHHHhcCc
Confidence 35678999999999999998843
No 219
>PRK06746 peptide chain release factor 2; Provisional
Probab=24.72 E-value=57 Score=26.16 Aligned_cols=16 Identities=19% Similarity=0.376 Sum_probs=13.8
Q ss_pred HHHHHHcCccEEEecc
Q psy627 49 IQYCQKNGWKFFVEEP 64 (89)
Q Consensus 49 iayae~~G~~Y~V~~p 64 (89)
..||+++||.++|.+-
T Consensus 113 ~r~a~~~g~~~evi~~ 128 (326)
T PRK06746 113 TRWAEKRGFKVETVDY 128 (326)
T ss_pred HHHHHHcCCEEEEEec
Confidence 4699999999999863
No 220
>COG0007 CysG Uroporphyrinogen-III methylase [Coenzyme metabolism]
Probab=24.53 E-value=99 Score=23.83 Aligned_cols=25 Identities=20% Similarity=0.193 Sum_probs=20.1
Q ss_pred ecCCHHHHHHHHHHcCccEEEeccC
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
-|.=--+=++++.++||+|+|..-.
T Consensus 94 iFGRggEE~~~l~~~gI~~eVVPGi 118 (244)
T COG0007 94 IFGRGGEEIEALAEAGIEFEVVPGI 118 (244)
T ss_pred eecCcHHHHHHHHHcCCceEEeCcc
Confidence 4666667788999999999998744
No 221
>PF14370 Topo_C_assoc: C-terminal topoisomerase domain; PDB: 1TL8_A 1K4T_A 1A36_A 1RR8_C 1T8I_A 1SC7_A 1EJ9_A 1LPQ_A 1RRJ_A 1A31_A ....
Probab=24.43 E-value=50 Score=21.12 Aligned_cols=18 Identities=6% Similarity=0.224 Sum_probs=11.2
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
-.+-=.||+|++++++.+
T Consensus 31 lDPRItvAWck~~~VPie 48 (71)
T PF14370_consen 31 LDPRITVAWCKRHEVPIE 48 (71)
T ss_dssp S-HHHHHHHHHHTT--GG
T ss_pred cCcHHHHHHHHHhCCcHH
Confidence 344457999999998743
No 222
>PRK11172 dkgB 2,5-diketo-D-gluconate reductase B; Provisional
Probab=24.43 E-value=88 Score=23.10 Aligned_cols=23 Identities=4% Similarity=0.095 Sum_probs=19.4
Q ss_pred CHHHHHHHHHHcCccEEEeccCC
Q psy627 44 SKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
..++.+.+|+++|+.+..-.|-.
T Consensus 160 ~~~~ll~~~~~~gi~v~a~spl~ 182 (267)
T PRK11172 160 QNRKVVAFAKEHGIHVTSYMTLA 182 (267)
T ss_pred CcHHHHHHHHHCCCEEEEECCCC
Confidence 34689999999999999888764
No 223
>KOG0091|consensus
Probab=24.35 E-value=70 Score=24.21 Aligned_cols=19 Identities=21% Similarity=0.361 Sum_probs=16.7
Q ss_pred CHHHHHHHHHHcCccEEEe
Q psy627 44 SKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~ 62 (89)
|.|+|..||+-+|..|.=.
T Consensus 135 t~EEaEklAa~hgM~FVET 153 (213)
T KOG0091|consen 135 TAEEAEKLAASHGMAFVET 153 (213)
T ss_pred cHHHHHHHHHhcCceEEEe
Confidence 6899999999999998643
No 224
>cd03413 CbiK_C Anaerobic cobalamin biosynthetic cobalt chelatase (CbiK), C-terminal domain. CbiK is part of the cobalt-early path for cobalamin biosynthesis. It catalyzes the insertion of cobalt into the oxidized form of precorrin-2, factor II (sirohydrochlorin), the second step of the anaerobic branch of vitamin B12 biosynthesis. CbiK belongs to the class II family of chelatases, and is a homomeric enzyme that does not require ATP for its enzymatic activity.
Probab=24.26 E-value=1.1e+02 Score=20.01 Aligned_cols=25 Identities=20% Similarity=0.276 Sum_probs=21.7
Q ss_pred ecCCHHHHHHHHHHcCccEEEeccC
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.+++.+++|+-+.+.|..-.+.-|-
T Consensus 40 ~~P~i~~~l~~l~~~G~~~i~lvPl 64 (103)
T cd03413 40 GYPGLDDVLAKLKKAGIKKVTLMPL 64 (103)
T ss_pred CCCCHHHHHHHHHHcCCCEEEEEeh
Confidence 5689999999999999998777773
No 225
>PF04221 RelB: RelB antitoxin; InterPro: IPR007337 Plasmids may be maintained stably in bacterial populations through the action of addiction modules, in which a toxin and antidote are encoded in a cassette on the plasmid. In any daughter cell that lacks the plasmid, the toxin persists and is lethal after the antidote protein is depleted. Toxin/antitoxin pairs are also found on main chromosomes, and likely represent selfish DNA. Sequences in the seed for this alignment all were found adjacent to toxin genes. Several toxin/antitoxin pairs may occur in a single species. RelE and RelB form a toxin-antitoxin system; RelE represses translation, probably through binding ribosomes [, ]. RelB stably binds RelE, presumably deactivating it.; PDB: 2KC8_B 2K29_A.
Probab=24.22 E-value=41 Score=21.08 Aligned_cols=20 Identities=20% Similarity=0.218 Sum_probs=10.3
Q ss_pred eeeecCCHHHHHHHHHHcCc
Q psy627 38 LALNFSSKEDAIQYCQKNGW 57 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~ 57 (89)
+|+-=+.+++|.+.|+++|+
T Consensus 6 ~Rid~~lK~~a~~il~~~Gl 25 (83)
T PF04221_consen 6 VRIDEELKEEAEAILEELGL 25 (83)
T ss_dssp EEE-HHHHHHHHHHHHHTT-
T ss_pred EEcCHHHHHHHHHHHHHcCC
Confidence 44444556666666665543
No 226
>PRK12454 carbamate kinase-like carbamoyl phosphate synthetase; Reviewed
Probab=24.21 E-value=73 Score=25.29 Aligned_cols=20 Identities=25% Similarity=0.559 Sum_probs=17.5
Q ss_pred ecCCHHHHHHHHHHcCccEE
Q psy627 41 NFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~ 60 (89)
.|-|+|+|.+.++++||.|.
T Consensus 134 ~~y~~~~a~~~~~~~g~~~~ 153 (313)
T PRK12454 134 PFYDEEEAKKLAKEKGWIVK 153 (313)
T ss_pred CCcCHHHHHHHHHHcCCEEE
Confidence 38899999999999999654
No 227
>cd00037 CLECT C-type lectin (CTL)/C-type lectin-like (CTLD) domain. CLECT: C-type lectin (CTL)/C-type lectin-like (CTLD) domain; protein domains homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. This group is chiefly comprised of eukaryotic CTLDs, but contains some, as yet functionally uncharacterized, bacterial CTLDs. Many CTLDs are calcium-dependent carbohydrate binding modules; other CTLDs bind protein ligands, lipids, and inorganic surfaces, including CaCO3 and ice. Animal C-type lectins are involved in such functions as extracellular matrix organization, endocytosis, complement activation, pathogen recognition, and cell-cell interactions. For example: mannose-binding lectin and lung surfactant proteins A and D bind carbohydrates on surfaces (e.g. pathogens, allergens, necrotic, and apoptotic cells) and mediate functions associated with killing and phagocytosis; P (platlet)-, E (endothelial)-, and L (leukocyte)- selectins (sels) mediate the initia
Probab=24.15 E-value=68 Score=18.71 Aligned_cols=15 Identities=33% Similarity=0.570 Sum_probs=13.3
Q ss_pred cCCHHHHHHHHHHcC
Q psy627 42 FSSKEDAIQYCQKNG 56 (89)
Q Consensus 42 F~s~e~Aiayae~~G 56 (89)
..+-++|.++|+.+|
T Consensus 9 ~~~~~~A~~~C~~~~ 23 (116)
T cd00037 9 KLTWEEAQEYCRSLG 23 (116)
T ss_pred ccCHHHHHHHHHHcC
Confidence 467899999999999
No 228
>PRK12865 YciI-like protein; Reviewed
Probab=24.14 E-value=79 Score=20.20 Aligned_cols=17 Identities=18% Similarity=0.210 Sum_probs=14.5
Q ss_pred eeeecCCHHHHHHHHHH
Q psy627 38 LALNFSSKEDAIQYCQK 54 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~ 54 (89)
+-+.++|+|+|.++++.
T Consensus 55 ~i~~a~s~e~a~~~~~~ 71 (97)
T PRK12865 55 VIVKAETKEAAKALADA 71 (97)
T ss_pred EEEEcCCHHHHHHHHHc
Confidence 45799999999999874
No 229
>PF01922 SRP19: SRP19 protein; InterPro: IPR002778 The signal recognition particle (SRP) is a multimeric protein, which along with its conjugate receptor (SR), is involved in targeting secretory proteins to the rough endoplasmic reticulum (RER) membrane in eukaryotes, or to the plasma membrane in prokaryotes [, ]. SRP recognises the signal sequence of the nascent polypeptide on the ribosome, retards its elongation, and docks the SRP-ribosome-polypeptide complex to the RER membrane via the SR receptor. Eukaryotic SRP consists of six polypeptides (SRP9, SRP14, SRP19, SRP54, SRP68 and SRP72) and a single 300 nucleotide 7S RNA molecule. The RNA component catalyses the interaction of SRP with its SR receptor []. In higher eukaryotes, the SRP complex consists of the Alu domain and the S domain linked by the SRP RNA. The Alu domain consists of a heterodimer of SRP9 and SRP14 bound to the 5' and 3' terminal sequences of SRP RNA. This domain is necessary for retarding the elongation of the nascent polypeptide chain, which gives SRP time to dock the ribosome-polypeptide complex to the RER membrane. In archaea, the SRP complex contains 7S RNA like its eukaryotic counterpart, yet only includes two of the six protein subunits found in the eukarytic complex: SRP19 and SRP54 []. This entry represents the SRP19 subunit. The SRP19 protein is unstructured but forms a compact core domain and two extended RNA-binding loops upon binding the signal recognition particle (SRP) RNA [].; GO: 0008312 7S RNA binding, 0006614 SRP-dependent cotranslational protein targeting to membrane, 0048500 signal recognition particle; PDB: 3DLU_A 3DLV_B 2J37_B 1MFQ_B 3KTV_D 1RY1_B 1JID_A 1KVV_A 1KVN_A 3KTW_B ....
Probab=24.14 E-value=1.7e+02 Score=19.08 Aligned_cols=25 Identities=28% Similarity=0.395 Sum_probs=19.4
Q ss_pred eeecCCHHHHHHHHHHcCccEEEec
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+.-++.+|=..-|++.|++++|+.
T Consensus 25 aV~~P~~~EI~~a~~~Lgl~~~~E~ 49 (95)
T PF01922_consen 25 AVENPTLEEIADACKKLGLPCVVEP 49 (95)
T ss_dssp SBSS--HHHHHHHHHHTTSEEEEET
T ss_pred cCCCCCHHHHHHHHHHcCCCEEEcc
Confidence 3456899999999999999999763
No 230
>PF02407 Viral_Rep: Putative viral replication protein; InterPro: IPR003365 Proteins in this entry are essential for the replication of viral ssDNA. The closed circular ssDNA genome is first converted to a superhelical dsDNA. Rep and/or Rep' binds a specific hairpin at the genome origin of replication introducing an endonucleolytic nick within the conserved sequence 5'-AGTATTAC-3'. This initiates rolling circle replication (RCR). Following cleavage, the protein binds covalently to the 5'-phosphate of DNA as a tyrosyl ester. The cleavage gives rise to a free 3'-OH that serves as a primer for the cellular DNA polymerase. The polymerase synthesizes the (+) strand DNA by rolling circle mechanism. After one round of replication, a Rep-catalyzed nucleotidyl transfer reaction releases a circular single-stranded virus genome, thereby terminating the replication.; GO: 0016779 nucleotidyltransferase activity, 0016888 endodeoxyribonuclease activity, producing 5'-phosphomonoesters, 0042624 ATPase activity, uncoupled, 0006260 DNA replication, 0018142 protein-DNA covalent cross-linking; PDB: 2HW0_A 2HWT_A.
Probab=24.02 E-value=8.8 Score=24.85 Aligned_cols=12 Identities=50% Similarity=0.888 Sum_probs=10.3
Q ss_pred CCHHHHHHHHHH
Q psy627 43 SSKEDAIQYCQK 54 (89)
Q Consensus 43 ~s~e~Aiayae~ 54 (89)
.|.++|++||.|
T Consensus 74 Gs~~qa~~YC~K 85 (86)
T PF02407_consen 74 GSDEQAREYCSK 85 (86)
T ss_dssp TTCCHHHCCHCT
T ss_pred CCHHHHHHHccC
Confidence 579999999976
No 231
>PF02244 Propep_M14: Carboxypeptidase activation peptide; InterPro: IPR003146 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. The peptidases are synthesised as inactive molecules, zymogens, with propeptides that must be removed by proteolytic cleavage to activate the enzyme. Structural studies of carboxypeptidases A and B reveal the propeptide to exist as a globular domain, followed by an extended alpha-helix; this shields the catalytic site, without specifically binding to it, while the substrate-binding site is blocked by making specific contacts [, ]. Members of this propeptide family are found in the metallocarboxypeptidases: A1, A2 [], A3, A4, A5, A6, U, insect gut carboxypeptidase and B [], and and are associated with peptidases belonging to MEROPS peptidase family M14A. Carboxypeptidases are found in abundance in pancreatic secretions. The pro-segment moiety (activation peptide) accounts for up to a quarter of the total length of the peptidase.; GO: 0004180 carboxypeptidase activity, 0006508 proteolysis; PDB: 3D68_C 3D67_B 3D66_A 1PBA_A 3GLJ_A 1NSA_A 1KWM_A 2BOA_B 1PYT_A 1JQG_A ....
Probab=24.01 E-value=1e+02 Score=18.15 Aligned_cols=29 Identities=10% Similarity=0.090 Sum_probs=23.2
Q ss_pred CceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..|.+.=...+....+.+++|+.|+|.-.
T Consensus 36 ~dv~V~p~~~~~f~~~L~~~~i~~~v~i~ 64 (74)
T PF02244_consen 36 VDVMVPPEKLEEFEELLKEHGIEYEVLIE 64 (74)
T ss_dssp EEEEEEGGGHHHHHHHHHHTT-EEEEEES
T ss_pred EEEEECHHHHHHHHHHHHHCCCcEEEEeH
Confidence 45677777888999999999999999653
No 232
>PF14277 DUF4364: Domain of unknown function (DUF4364)
Probab=23.89 E-value=76 Score=22.68 Aligned_cols=23 Identities=35% Similarity=0.521 Sum_probs=19.0
Q ss_pred CCCCCceeeecCCHHHHHHHHHH
Q psy627 32 GDPLSNLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 32 ~D~~~qv~l~F~s~e~Aiayae~ 54 (89)
+.++=.+.|.-||+|+|.+-|++
T Consensus 124 ~~~Li~l~l~vp~~~qA~~ic~n 146 (163)
T PF14277_consen 124 DSDLIDLKLNVPSEEQAEAICNN 146 (163)
T ss_pred CceEEEEEEeCCCHHHHHHHHHH
Confidence 34455689999999999999985
No 233
>cd08914 START_STARD15-like Lipid-binding START domain of mammalian STARD15 and related proteins. This subgroup includes the steroidogenic acute regulatory protein (StAR)-related lipid transfer (START) domains of STARD15/ACOT12 (also known as cytoplasmic acetyl-CoA hydrolase/CACH, THEAL, and MGC105114) and related domains. It belongs to the START domain family, and in turn to the SRPBCC (START/RHO_alpha_C/PITP/Bet_v1/CoxG/CalC) domain superfamily of proteins that bind hydrophobic ligands. SRPBCC domains have a deep hydrophobic ligand-binding pocket. STARD15/ACOT12 is a type II acetyl-CoA thioesterase; it catalyzes the hydrolysis of acyl-CoAs to free fatty acid and CoASH. Rat CACH hydrolyzes acetyl-CoA to acetate and CoA. In addition to having a START domain, most proteins in this subgroup have two tandem copies of the hotdog domain. Human STARD15/ACOT12 may have roles in cholesterol metabolism and in beta-oxidation.
Probab=23.88 E-value=78 Score=24.15 Aligned_cols=33 Identities=24% Similarity=0.269 Sum_probs=29.9
Q ss_pred CCCCceeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 33 DPLSNLALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 33 D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
||-.|+-|+..+-++-...+..+||........
T Consensus 34 ~~~~~~~~~~~~~~~~~~~a~~~~W~l~~dkdg 66 (236)
T cd08914 34 DIGNQASLSDSNVEALKKLAAKSGWEVTSTVEK 66 (236)
T ss_pred CCCceEEEeeCCHHHhhhhcccCCCEEEEccCC
Confidence 788899999999999999999999999887654
No 234
>TIGR01617 arsC_related transcriptional regulator, Spx/MgsR family. This model represents a portion of the proteins within the larger set covered by Pfam model pfam03960. That larger family includes a glutaredoxin-dependent arsenate reductase (TIGR00014). Characterized members of this family include Spx and MgsR from Bacillus subtili. Spx is a global regulator for response to thiol-specific oxidative stress. It interacts with RNA polymerase. MgsR (modulator of the general stress response, also called YqgZ) provides a second level of regulation for more than a third of the proteins in the B. subtilis general stress regulon controlled by Sigma-B.
Probab=23.85 E-value=71 Score=20.82 Aligned_cols=22 Identities=14% Similarity=0.261 Sum_probs=18.8
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
++...|+++.+++|++|++..-
T Consensus 10 ~~c~ka~~~L~~~~i~~~~idi 31 (117)
T TIGR01617 10 TTCKKARRWLEANGIEYQFIDI 31 (117)
T ss_pred HHHHHHHHHHHHcCCceEEEec
Confidence 5667899999999999998764
No 235
>PRK13789 phosphoribosylamine--glycine ligase; Provisional
Probab=23.84 E-value=89 Score=25.16 Aligned_cols=25 Identities=8% Similarity=0.251 Sum_probs=21.3
Q ss_pred eecCCHHHHHHHHHHcCccEEEecc
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..|.+.++|++|+++.|+++.|-..
T Consensus 127 ~~~~~~~ea~~~~~~~~~PvVVKp~ 151 (426)
T PRK13789 127 KTFTEYSSSLSYLESEMLPIVIKAD 151 (426)
T ss_pred EeeCCHHHHHHHHHhcCCCEEEEeC
Confidence 5788999999999999999887554
No 236
>COG0151 PurD Phosphoribosylamine-glycine ligase [Nucleotide transport and metabolism]
Probab=23.81 E-value=83 Score=26.32 Aligned_cols=24 Identities=25% Similarity=0.383 Sum_probs=20.3
Q ss_pred eecCCHHHHHHHHHHcCccEEEec
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
-.|.+.|+|.+|.++.|-++.|-.
T Consensus 122 ~~f~~~e~a~ayi~~~g~piVVKa 145 (428)
T COG0151 122 EVFTDPEEAKAYIDEKGAPIVVKA 145 (428)
T ss_pred cccCCHHHHHHHHHHcCCCEEEec
Confidence 468899999999999999976644
No 237
>PRK09318 bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II protein; Provisional
Probab=23.79 E-value=71 Score=26.11 Aligned_cols=18 Identities=6% Similarity=0.113 Sum_probs=15.7
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
..++++++||++||++..
T Consensus 159 ~~~~~~~~fA~~h~l~~i 176 (387)
T PRK09318 159 HDLDYVLKLAEKFSLPVL 176 (387)
T ss_pred ccHHHHHHHHHHCCCcEE
Confidence 368999999999999984
No 238
>PRK00028 infC translation initiation factor IF-3; Reviewed
Probab=23.78 E-value=1.1e+02 Score=22.26 Aligned_cols=34 Identities=21% Similarity=0.333 Sum_probs=25.4
Q ss_pred CCHHHHHHHHHHcCccEEEeccCCC--CCCcccccc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPKWK--TPKVKSYAF 76 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~~~--~~~~ksYad 76 (89)
-+.++|++-|++.|++-....|... .-+.-.|+.
T Consensus 35 ~~~~eAl~~A~~~~lDLV~v~~~~~PPVckI~dy~k 70 (177)
T PRK00028 35 VSTREALELAEEAGLDLVEISPNAKPPVCKIMDYGK 70 (177)
T ss_pred eeHHHHHHHHHHcCCCEEEECCCCCCCEEEEEeHHH
Confidence 4688999999999999988877543 334556654
No 239
>PRK02628 nadE NAD synthetase; Reviewed
Probab=23.72 E-value=62 Score=27.79 Aligned_cols=20 Identities=15% Similarity=0.119 Sum_probs=18.3
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
|.++|.++|+..|++|.+..
T Consensus 409 s~~~a~~la~~LGi~~~~i~ 428 (679)
T PRK02628 409 TKNNAVALMKALGVTAREID 428 (679)
T ss_pred HHHHHHHHHHHhCCeEEEEE
Confidence 67899999999999999875
No 240
>PF11247 DUF2675: Protein of unknown function (DUF2675) ; InterPro: IPR022611 Members in this family of proteins include Bacteriophage T7 gene 5.5; they have no known function.
Probab=23.72 E-value=41 Score=22.75 Aligned_cols=21 Identities=24% Similarity=0.403 Sum_probs=16.5
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.-.|.||+||-|+|+-=.|.+
T Consensus 56 ~G~egav~f~~k~g~R~~IKe 76 (98)
T PF11247_consen 56 HGPEGAVAFVVKQGIREAIKE 76 (98)
T ss_pred cChHHHHHHHHHHHHHHHHHH
Confidence 347999999999998655544
No 241
>PF13539 Peptidase_M15_4: D-alanyl-D-alanine carboxypeptidase
Probab=23.60 E-value=1e+02 Score=18.48 Aligned_cols=38 Identities=21% Similarity=0.514 Sum_probs=24.8
Q ss_pred EEecCCCCccCCCCCCCCCCCCCCceeeecCCHHHHHHHHHHcCcc
Q psy627 13 IQFDTRERWENPLMGWCSTGDPLSNLALNFSSKEDAIQYCQKNGWK 58 (89)
Q Consensus 13 lefe~~~~~~dPLMGWtss~D~~~qv~l~F~s~e~Aiayae~~G~~ 58 (89)
|++.|. .||+..|.+.-+. -....-+..++.+++.||.
T Consensus 15 iDin~~---~np~~~~~~~~~~-----~~~~~~~~~~~~~~~~G~~ 52 (67)
T PF13539_consen 15 IDINPD---ENPYIQWNGDVDG-----YKIADYKEVVAIFEKLGFR 52 (67)
T ss_pred EEEccc---cCCeeeeCCccch-----hhhhhHHHHHHHHHhCCCE
Confidence 455433 6787777776542 2234556788999999984
No 242
>PRK06186 hypothetical protein; Validated
Probab=23.57 E-value=58 Score=24.70 Aligned_cols=18 Identities=28% Similarity=0.573 Sum_probs=15.2
Q ss_pred CCHHHHHHHHHHcCccEE
Q psy627 43 SSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~ 60 (89)
+-+=.||.||..++++|-
T Consensus 69 ~Gki~ai~~Are~~iP~L 86 (229)
T PRK06186 69 DGALTAIRFARENGIPFL 86 (229)
T ss_pred hHHHHHHHHHHHcCCCeE
Confidence 456789999999999983
No 243
>KOG0781|consensus
Probab=23.51 E-value=81 Score=27.34 Aligned_cols=23 Identities=17% Similarity=0.354 Sum_probs=18.9
Q ss_pred HHHHHHHHHcCccEEEeccCCCC
Q psy627 46 EDAIQYCQKNGWKFFVEEPKWKT 68 (89)
Q Consensus 46 e~Aiayae~~G~~Y~V~~p~~~~ 68 (89)
-+||.||+++|.+..+..-..|.
T Consensus 456 k~AI~~a~~~gfDVvLiDTAGR~ 478 (587)
T KOG0781|consen 456 KEAIQEARNQGFDVVLIDTAGRM 478 (587)
T ss_pred HHHHHHHHhcCCCEEEEeccccc
Confidence 57999999999999888765543
No 244
>cd03418 GRX_GRXb_1_3_like Glutaredoxin (GRX) family, GRX bacterial class 1 and 3 (b_1_3)-like subfamily; composed of bacterial GRXs, approximately 10 kDa in size, and proteins containing a GRX or GRX-like domain. GRX is a glutathione (GSH) dependent reductase, catalyzing the disulfide reduction of target proteins such as ribonucleotide reductase. It contains a redox active CXXC motif in a TRX fold and uses a similar dithiol mechanism employed by TRXs for intramolecular disulfide bond reduction of protein substrates. Unlike TRX, GRX has preference for mixed GSH disulfide substrates, in which it uses a monothiol mechanism where only the N-terminal cysteine is required. The flow of reducing equivalents in the GRX system goes from NADPH - GSH reductase - GSH - GRX - protein substrates. By altering the redox state of target proteins, GRX is involved in many cellular functions including DNA synthesis, signal transduction and the defense against oxidative stress. Different classes are known i
Probab=23.41 E-value=94 Score=17.80 Aligned_cols=22 Identities=14% Similarity=-0.013 Sum_probs=18.0
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
+....|.++.+++|++|+...-
T Consensus 11 p~C~~ak~~L~~~~i~~~~i~i 32 (75)
T cd03418 11 PYCVRAKALLDKKGVDYEEIDV 32 (75)
T ss_pred hHHHHHHHHHHHCCCcEEEEEC
Confidence 4567899999999999988653
No 245
>PLN03199 delta6-acyl-lipid desaturase-like protein; Provisional
Probab=23.27 E-value=38 Score=27.91 Aligned_cols=40 Identities=15% Similarity=0.302 Sum_probs=23.1
Q ss_pred CCCCCCCCCCCCCceeeecCC---------HHHHHHHHHHcCccEEEec
Q psy627 24 PLMGWCSTGDPLSNLALNFSS---------KEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 24 PLMGWtss~D~~~qv~l~F~s---------~e~Aiayae~~G~~Y~V~~ 63 (89)
+++.|-+.+=-+|=..--||+ ..--.++|+++|++|....
T Consensus 407 ~~~~wf~GGLN~QIEHHLFP~mp~~~y~~i~piVk~~C~k~glpY~~~~ 455 (485)
T PLN03199 407 AFVDWFCGGLQYQVDHHLFPMLPRHNIAKCHALVESFCKEWGVKYHEAD 455 (485)
T ss_pred hHHHHHhccchhhhhhhcCCCCchhhHHhhhHHHHHHHHHhCCCccccC
Confidence 466665544222222344555 2233589999999998643
No 246
>PRK13307 bifunctional formaldehyde-activating enzyme/3-hexulose-6-phosphate synthase; Provisional
Probab=23.27 E-value=77 Score=25.84 Aligned_cols=37 Identities=24% Similarity=0.358 Sum_probs=30.2
Q ss_pred CccCCCCCCCCC---CCCCCceeeecCCHHHHHHHHHHcC
Q psy627 20 RWENPLMGWCST---GDPLSNLALNFSSKEDAIQYCQKNG 56 (89)
Q Consensus 20 ~~~dPLMGWtss---~D~~~qv~l~F~s~e~Aiayae~~G 56 (89)
+-.+|+||---. .=|+-||-|-+.|.|+|++.+++-+
T Consensus 156 ~~~h~~~~~~~~~~~~~p~L~vALD~~~~~~A~~i~~~l~ 195 (391)
T PRK13307 156 RALHPIMGFKVTRLWDPPYLQVALDLPDLEEVERVLSQLP 195 (391)
T ss_pred cccCCccccchhhhcccceEEEecCCCCHHHHHHHHHhcc
Confidence 445899997655 2257899999999999999999875
No 247
>PRK14019 bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II-like protein; Provisional
Probab=23.15 E-value=67 Score=25.94 Aligned_cols=17 Identities=18% Similarity=0.401 Sum_probs=15.4
Q ss_pred CCHHHHHHHHHHcCccE
Q psy627 43 SSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y 59 (89)
.+.++|++||++||+++
T Consensus 173 ~~~~~~~~fA~~h~L~~ 189 (367)
T PRK14019 173 ARLPDLEEFAKEHGLKI 189 (367)
T ss_pred cchHHHHHHHHHcCCcE
Confidence 46899999999999998
No 248
>KOG2486|consensus
Probab=23.11 E-value=68 Score=25.89 Aligned_cols=23 Identities=13% Similarity=0.091 Sum_probs=19.0
Q ss_pred CCHHHHHHHHHHcCccEEEeccC
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
++--.+|+||.+++.+|+++--+
T Consensus 235 ~~D~~~i~~~ge~~VP~t~vfTK 257 (320)
T KOG2486|consen 235 PTDNPEIAWLGENNVPMTSVFTK 257 (320)
T ss_pred CCChHHHHHHhhcCCCeEEeeeh
Confidence 34557999999999999998754
No 249
>cd03033 ArsC_15kD Arsenate Reductase (ArsC) family, 15kD protein subfamily; composed of proteins of unknown function with similarity to thioredoxin-fold arsenic reductases, ArsC. It is encoded by an ORF present in a gene cluster associated with nitrogen fixation that also encodes dinitrogenase reductase ADP-ribosyltransferase (DRAT) and dinitrogenase reductase activating glycohydrolase (DRAG). ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], using reducing equivalents derived from glutathione via glutaredoxin, through a single catalytic cysteine.
Probab=22.97 E-value=76 Score=21.07 Aligned_cols=20 Identities=10% Similarity=0.137 Sum_probs=17.4
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
+.-.|.++-+.+|++|++..
T Consensus 12 ~crkA~~~L~~~gi~~~~~d 31 (113)
T cd03033 12 NNARQKALLEAAGHEVEVRD 31 (113)
T ss_pred HHHHHHHHHHHcCCCcEEee
Confidence 45689999999999999876
No 250
>cd05565 PTS_IIB_lactose PTS_IIB_lactose: subunit IIB of enzyme II (EII) of the lactose-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS) found in Firmicutes as well as Actinobacteria. In this system, EII is a lactose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. The IIC and IIB domains are expressed as a single protein from the lac operon. The IIB domain fold includes a central four-stranded parallel open twisted beta-sheet flanked by alpha-helices on both sides. The seven major PTS systems with this IIB fold include lactose, chitobiose/lichenan, ascorbate, galactitol, mannitol, fructose, and a sensory system with similarity to the bacterial bgl system.
Probab=22.90 E-value=72 Score=21.02 Aligned_cols=17 Identities=6% Similarity=-0.031 Sum_probs=14.0
Q ss_pred HHHHHHHHcCccEEEec
Q psy627 47 DAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 47 ~Aiayae~~G~~Y~V~~ 63 (89)
.--++|+++|++++|..
T Consensus 19 km~~~a~~~gi~~~i~a 35 (99)
T cd05565 19 ALNKGAKERGVPLEAAA 35 (99)
T ss_pred HHHHHHHHCCCcEEEEE
Confidence 34579999999999875
No 251
>PF09633 DUF2023: Protein of unknown function (DUF2023); InterPro: IPR018594 This protein of approx.120 residues consists of three beta strands and five alpha helices, thought to fold into a homo-dimer. ; PDB: 2GUK_B.
Probab=22.80 E-value=99 Score=21.03 Aligned_cols=22 Identities=23% Similarity=0.383 Sum_probs=15.5
Q ss_pred CHHHHHHHHHHcCccEEEeccC
Q psy627 44 SKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
..+.|++=-+++||+|.|++-.
T Consensus 26 ~~~~~~~rL~~~~I~y~iq~v~ 47 (101)
T PF09633_consen 26 YEEFAIARLERQGIDYFIQPVG 47 (101)
T ss_dssp GHHHHHHHHHHTT--EEEEE-T
T ss_pred hHHHHHHHHHHCCCCEEEEEcC
Confidence 4567888889999999998753
No 252
>PRK00578 prfB peptide chain release factor 2; Validated
Probab=22.80 E-value=65 Score=26.14 Aligned_cols=16 Identities=19% Similarity=0.507 Sum_probs=13.9
Q ss_pred HHHHHHcCccEEEecc
Q psy627 49 IQYCQKNGWKFFVEEP 64 (89)
Q Consensus 49 iayae~~G~~Y~V~~p 64 (89)
..||+++||.++|.+-
T Consensus 151 ~~~a~~~g~~~evi~~ 166 (367)
T PRK00578 151 LRWAERHGFKVEVLDY 166 (367)
T ss_pred HHHHHHcCCEEEEEec
Confidence 4699999999999874
No 253
>PF00071 Ras: Ras family; InterPro: IPR001806 Small GTPases form an independent superfamily within the larger class of regulatory GTP hydrolases. This superfamily contains proteins that control a vast number of important processes and possess a common, structurally preserved GTP-binding domain [, ]. Sequence comparisons of small G proteins from various species have revealed that they are conserved in primary structures at the level of 30-55% similarity []. Crystallographic analysis of various small G proteins revealed the presence of a 20 kDa catalytic domain that is unique for the whole superfamily [, ]. The domain is built of five alpha helices (A1-A5), six beta-strands (B1-B6) and five polypeptide loops (G1-G5). A structural comparison of the GTP- and GDP-bound form, allows one to distinguish two functional loop regions: switch I and switch II that surround the gamma-phosphate group of the nucleotide. The G1 loop (also called the P-loop) that connects the B1 strand and the A1 helix is responsible for the binding of the phosphate groups. The G3 loop provides residues for Mg(2+) and phosphate binding and is located at the N terminus of the A2 helix. The G1 and G3 loops are sequentially similar to Walker A and Walker B boxes that are found in other nucleotide binding motifs. The G2 loop connects the A1 helix and the B2 strand and contains a conserved Thr residue responsible for Mg(2+) binding. The guanine base is recognised by the G4 and G5 loops. The consensus sequence NKXD of the G4 loop contains Lys and Asp residues directly interacting with the nucleotide. Part of the G5 loop located between B6 and A5 acts as a recognition site for the guanine base []. The small GTPase superfamily can be divided into at least 8 different families, including: Arf small GTPases. GTP-binding proteins involved in protein trafficking by modulating vesicle budding and uncoating within the Golgi apparatus. Ran small GTPases. GTP-binding proteins involved in nucleocytoplasmic transport. Required for the import of proteins into the nucleus and also for RNA export. Rab small GTPases. GTP-binding proteins involved in vesicular traffic. Rho small GTPases. GTP-binding proteins that control cytoskeleton reorganisation. Ras small GTPases. GTP-binding proteins involved in signalling pathways. Sar1 small GTPases. Small GTPase component of the coat protein complex II (COPII) which promotes the formation of transport vesicles from the endoplasmic reticulum (ER). Mitochondrial Rho (Miro). Small GTPase domain found in mitochondrial proteins involved in mitochondrial trafficking. Roc small GTPases domain. Small GTPase domain always found associated with the COR domain. ; GO: 0005525 GTP binding, 0007264 small GTPase mediated signal transduction; PDB: 1M7B_A 2V55_B 3EG5_C 3LAW_A 1YHN_A 1T91_B 1HE8_B 3SEA_B 3T5G_A 1XTS_A ....
Probab=22.76 E-value=95 Score=20.01 Aligned_cols=20 Identities=25% Similarity=0.438 Sum_probs=15.6
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
+.++|.++|++++..|.-..
T Consensus 123 ~~~~~~~~~~~~~~~~~e~S 142 (162)
T PF00071_consen 123 SVEEAQEFAKELGVPYFEVS 142 (162)
T ss_dssp CHHHHHHHHHHTTSEEEEEB
T ss_pred hhhHHHHHHHHhCCEEEEEE
Confidence 46799999999998774433
No 254
>PF06356 DUF1064: Protein of unknown function (DUF1064); InterPro: IPR009414 This entry is represented by Bacteriophage 92, Orf34. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This family consists of several phage and bacterial proteins whose functions have not been experimentally determined. Computational analysis involving sequence, predicted strucutre and genomic context suggests that these proteins may be endonucleases which function in phage genome segregation, or the repair of double-stranded breaks introduced during either this process or DNA replication [].
Probab=22.73 E-value=72 Score=21.89 Aligned_cols=43 Identities=21% Similarity=0.339 Sum_probs=26.4
Q ss_pred eeecCCHHHHHHHHH-----HcC--ccEEEeccC---------CCCCCcccccccCCcC
Q psy627 39 ALNFSSKEDAIQYCQ-----KNG--WKFFVEEPK---------WKTPKVKSYAFNFSWN 81 (89)
Q Consensus 39 ~l~F~s~e~Aiayae-----~~G--~~Y~V~~p~---------~~~~~~ksYadNF~~~ 81 (89)
.+.|+|+-||-.|++ +.| ..++++.+= ....++..|-.-|.+.
T Consensus 13 Gi~FDSk~Ea~~Y~~Lk~~~~~g~I~~l~lQ~~f~L~~~~~~~Gk~~r~i~Y~aDF~~~ 71 (118)
T PF06356_consen 13 GIKFDSKKEAEYYQELKLLERAGEISDLELQPRFELQPKFKKNGKKIRAITYIADFVYY 71 (118)
T ss_pred CeEEccHHHHHHHHHHHHhHhcCceEEEEEeCCEEEecccccCCcEecceEEeccEEEE
Confidence 478999999998874 334 244444321 1233566787777663
No 255
>cd08578 GDPD_NUC-2_fungi Putative glycerophosphodiester phosphodiesterase domain of ankyrin repeat protein NUC-2 and similar proteins. This subfamily corresponds to a putative glycerophosphodiester phosphodiesterase domain (GDPD) present in Neurospora crassa ankyrin repeat protein NUC-2 and its Saccharomyces cerevisiae counterpart, Phosphate system positive regulatory protein PHO81. Some uncharecaterized NUC-2 sequence homologs are also included in this family. NUC-2 plays an important role in the phosphate-regulated signal transduction pathway in Neurospora crassa. It shows high similarity to a cyclin-dependent kinase inhibitory protein PHO81, which is part of the phosphate regulatory cascade in S. cerevisiae. Both NUC-2 and PHO81 have multi-domain architecture, including an SPX N-terminal domain following by several ankyrin repeats and a putative C-terminal GDPD domain with unknown function. Although the putative GDPD domain displays sequence homology to that of bacterial glycerophos
Probab=22.73 E-value=61 Score=25.30 Aligned_cols=24 Identities=13% Similarity=0.167 Sum_probs=19.0
Q ss_pred eecCCHHHHHHHHHHcCccEEEec
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+=.|.++||.||..+|+-=.|..
T Consensus 226 ~r~~Si~~Av~fA~~~nL~Giv~~ 249 (300)
T cd08578 226 PRSRSIKEAVRFAKNNNLLGLILP 249 (300)
T ss_pred chhhhHHHHHHHHHHcCCcEEEec
Confidence 444689999999999998766654
No 256
>TIGR02630 xylose_isom_A xylose isomerase. Members of this family are the enzyme xylose isomerase (5.3.1.5), which interconverts D-xylose and D-xylulose.
Probab=22.70 E-value=1e+02 Score=25.87 Aligned_cols=30 Identities=37% Similarity=0.468 Sum_probs=22.0
Q ss_pred HHHHHHHHHHcCccEEE-eccCCCCCCcccc
Q psy627 45 KEDAIQYCQKNGWKFFV-EEPKWKTPKVKSY 74 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V-~~p~~~~~~~ksY 74 (89)
...|++|+.+.|++..| +||++.-++..+|
T Consensus 210 l~~~~dYa~~iGf~~~f~IEPKP~EPr~hqy 240 (434)
T TIGR02630 210 LHMAVDYAKKIGFKGQFLIEPKPKEPTKHQY 240 (434)
T ss_pred HHHHHHHhhhcCCCceEEeccCCCCcccccc
Confidence 45689999999996654 5777766666655
No 257
>PTZ00175 diphthine synthase; Provisional
Probab=22.70 E-value=1.4e+02 Score=22.92 Aligned_cols=29 Identities=24% Similarity=0.323 Sum_probs=23.3
Q ss_pred CCCCCCCceeeecCCHHHHHHHHHHcCccEEEec
Q psy627 30 STGDPLSNLALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 30 ss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+|||. -|.+--+=+..|+++|++++|..
T Consensus 83 ~~GDP~-----i~~t~~~l~~~~~~~gi~vevIP 111 (270)
T PTZ00175 83 VVGDPF-----CATTHTDLYLRAKKKGIEVEVIH 111 (270)
T ss_pred ECCCCC-----ccCCHHHHHHHHHHCCCcEEEEC
Confidence 357774 67777777889999999999986
No 258
>PTZ00323 NAD+ synthase; Provisional
Probab=22.51 E-value=74 Score=24.82 Aligned_cols=21 Identities=10% Similarity=-0.040 Sum_probs=18.2
Q ss_pred CHHHHHHHHHHcCccEEEecc
Q psy627 44 SKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~p 64 (89)
+.++|..+|+..|+++.+..-
T Consensus 93 ~~~~A~~la~~lGi~~~~idi 113 (294)
T PTZ00323 93 ALNRGRENIQACGATEVTVDQ 113 (294)
T ss_pred HHHHHHHHHHHhCCcEEEEEC
Confidence 469999999999999988763
No 259
>PF12671 Amidase_6: Putative amidase domain
Probab=22.48 E-value=62 Score=22.38 Aligned_cols=16 Identities=19% Similarity=0.283 Sum_probs=13.4
Q ss_pred CHHHHHHHHHHcCccE
Q psy627 44 SKEDAIQYCQKNGWKF 59 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y 59 (89)
+++.||+||++.-..+
T Consensus 3 nr~~Av~YA~k~~~~~ 18 (157)
T PF12671_consen 3 NRSAAVAYANKYANNP 18 (157)
T ss_pred CHHHHHHHHHHHhcCC
Confidence 5789999999987765
No 260
>PRK15412 thiol:disulfide interchange protein DsbE; Provisional
Probab=22.39 E-value=98 Score=21.61 Aligned_cols=49 Identities=12% Similarity=0.156 Sum_probs=30.5
Q ss_pred eeeec-CCHHHHHHHHHHcCccEEE-e-ccCCC---CCCcccccccCCcCCCcce
Q psy627 38 LALNF-SSKEDAIQYCQKNGWKFFV-E-EPKWK---TPKVKSYAFNFSWNKRTRT 86 (89)
Q Consensus 38 v~l~F-~s~e~Aiayae~~G~~Y~V-~-~p~~~---~~~~ksYadNF~~~r~~~~ 86 (89)
+.+.+ ++.+++.+|.+++|++|.+ . .+... ......+-..|..++..++
T Consensus 101 i~v~~~~~~~~~~~~~~~~~~~~~~~~~D~~~~~~~~~gv~~~P~t~vid~~G~i 155 (185)
T PRK15412 101 VGMNYKDDRQKAISWLKELGNPYALSLFDGDGMLGLDLGVYGAPETFLIDGNGII 155 (185)
T ss_pred EEEECCCCHHHHHHHHHHcCCCCceEEEcCCccHHHhcCCCcCCeEEEECCCceE
Confidence 35555 5678899999999999974 2 23221 2234445566777665544
No 261
>PF12760 Zn_Tnp_IS1595: Transposase zinc-ribbon domain; InterPro: IPR024442 This zinc binding domain is found in a range of transposase proteins such as ISSPO8, ISSOD11, ISRSSP2 etc. It may be a zinc-binding beta ribbon domain that could bind DNA.
Probab=22.38 E-value=90 Score=17.39 Aligned_cols=23 Identities=17% Similarity=0.261 Sum_probs=18.0
Q ss_pred CCHHHHHHHHHHcCccEEEeccC
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
+|.|+|++|-+..-|+=-+.=|+
T Consensus 1 p~e~~c~~~l~~~RW~~g~~CP~ 23 (46)
T PF12760_consen 1 PDEEACREYLEEIRWPDGFVCPH 23 (46)
T ss_pred CCHHHHHHHHHHhcCCCCCCCCC
Confidence 57899999999999986554454
No 262
>PRK09958 DNA-binding transcriptional activator EvgA; Provisional
Probab=22.37 E-value=70 Score=21.18 Aligned_cols=20 Identities=25% Similarity=0.260 Sum_probs=16.9
Q ss_pred eeecCCHHHHHHHHHHcCcc
Q psy627 39 ALNFSSKEDAIQYCQKNGWK 58 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~ 58 (89)
+|...++.+++.+|.+.|+.
T Consensus 185 Kl~~~~~~~l~~~~~~~~~~ 204 (204)
T PRK09958 185 KLECKSLMDLYTFAQRNKIG 204 (204)
T ss_pred HcCCCCHHHHHHHHHHhCCC
Confidence 57789999999999998863
No 263
>PF04343 DUF488: Protein of unknown function, DUF488; InterPro: IPR007438 This family includes several proteins of uncharacterised function.
Probab=22.29 E-value=1.1e+02 Score=20.06 Aligned_cols=23 Identities=17% Similarity=0.193 Sum_probs=18.9
Q ss_pred ecCCHHHHHHHHHHcCccEEEec
Q psy627 41 NFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 41 ~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+-++++.-.+++.+||.|.=..
T Consensus 29 ~~~~k~~l~~~l~~~gi~Y~~~~ 51 (122)
T PF04343_consen 29 PGFNKEDLASFLEEAGIEYVWLP 51 (122)
T ss_pred CCCCHHHHHHHHHHCCceEeech
Confidence 35689999999999999996544
No 264
>PF02540 NAD_synthase: NAD synthase; InterPro: IPR022310 NAD+ synthase (6.3.5.1 from EC) catalyzes the last step in the biosynthesis of nicotinamide adenine dinucleotide and is induced by stress factors such as heat shock and glucose limitation. The three-dimensional structure of NH3-dependent NAD+ synthetase from Bacillus subtilis, in its free form and in complex with ATP shows that the enzyme consists of a tight homodimer with alpha/beta subunit topology []. Catalyzes the synthesis of GMP from XMP. The protein is a homodimer, but in the archaea it is a heterodimer composed of a glutamine amidotransferase subunit (A) and a GMP-binding subunit (B). This entry contains the GMP-binding subunit (B). ; PDB: 2VXO_A 3UOW_B 3N05_A 2DPL_B 3A4I_A 3SEQ_D 3SZG_A 3SYT_A 3SDB_A 3SEZ_C ....
Probab=22.29 E-value=67 Score=24.01 Aligned_cols=20 Identities=30% Similarity=0.444 Sum_probs=17.6
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
+.++|...|++-|++|.+..
T Consensus 61 ~~~~A~~la~~lgi~~~~i~ 80 (242)
T PF02540_consen 61 DIEDAKELAEKLGIEYIVID 80 (242)
T ss_dssp HHHHHHHHHHHHTSEEEEEE
T ss_pred HHHHHHHHHHHhCCCeeccc
Confidence 46799999999999998875
No 265
>PRK12420 histidyl-tRNA synthetase; Provisional
Probab=22.29 E-value=99 Score=24.60 Aligned_cols=22 Identities=14% Similarity=0.252 Sum_probs=18.6
Q ss_pred CCHHHHHHHHHHcCccEEEecc
Q psy627 43 SSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~p 64 (89)
.+.+.+++||.+.|++|.|.--
T Consensus 374 ~~l~~~i~~A~~~g~~~iviig 395 (423)
T PRK12420 374 RKLKKALNYANKENIPYVLIIG 395 (423)
T ss_pred cCHHHHHHHHHHcCCCEEEEEC
Confidence 3678999999999999988653
No 266
>PRK12677 xylose isomerase; Provisional
Probab=22.28 E-value=1.2e+02 Score=24.22 Aligned_cols=32 Identities=19% Similarity=0.245 Sum_probs=19.7
Q ss_pred CCCCCCCCCCceeeecCCHHHHHHHHHHcCccE
Q psy627 27 GWCSTGDPLSNLALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 27 GWtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y 59 (89)
||.+. ||+..-...=-+.+++|+.+.+.|+..
T Consensus 16 ~~~~~-~~~g~~~~~~~~~~E~v~~~a~~Gf~g 47 (384)
T PRK12677 16 GWQGR-DPFGDATRPPLDPVEAVHKLAELGAYG 47 (384)
T ss_pred cCCCC-CCCCCCCCCCCCHHHHHHHHHHhCCCE
Confidence 55555 776554333346777777777777553
No 267
>PRK00754 signal recognition particle protein Srp19; Provisional
Probab=22.17 E-value=1.3e+02 Score=19.82 Aligned_cols=25 Identities=28% Similarity=0.217 Sum_probs=21.7
Q ss_pred eeecCCHHHHHHHHHHcCccEEEec
Q psy627 39 ALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+.-++.+|=.+-|++.|++++|+.
T Consensus 31 aV~~P~~~EI~~a~~~lgl~~~~e~ 55 (95)
T PRK00754 31 AVKEPRLEEIIEAAEKLGLNPEVEP 55 (95)
T ss_pred hccCCCHHHHHHHHHHcCCCeEEee
Confidence 4567899999999999999998864
No 268
>cd03588 CLECT_CSPGs C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins. CLECT_CSPGs: C-type lectin-like domain (CTLD) of the type found in chondroitin sulfate proteoglycan core proteins (CSPGs) in human and chicken aggrecan, frog brevican, and zebra fish dermacan. CTLD refers to a domain homologous to the carbohydrate-recognition domains (CRDs) of the C-type lectins. In cartilage, aggrecan forms cartilage link protein stabilized aggregates with hyaluronan (HA). These aggregates contribute to the tissue's load bearing properties. Aggregates having other CSPGs substituting for aggrecan may contribute to the structural integrity of many different tissues. Xenopus brevican is expressed in the notochord and the brain during early embryogenesis. Zebra fish dermacan is expressed in dermal bones and may play a role in dermal bone development. CSPGs do contain LINK domain(s) which bind HA. These LINK domains are considered by one classif
Probab=22.16 E-value=73 Score=20.44 Aligned_cols=36 Identities=17% Similarity=0.139 Sum_probs=20.9
Q ss_pred CCCCCCCCCCceeeecCCHHHHHHHHHHcCccEEEe
Q psy627 27 GWCSTGDPLSNLALNFSSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 27 GWtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~ 62 (89)
||+.-.+--=++...-.|-.+|..+|+.+|-...++
T Consensus 4 gw~~~~~~Cy~~~~~~~sw~~A~~~C~~~gg~La~i 39 (124)
T cd03588 4 GWDKFQGHCYRHFPDRETWEDAERRCREQQGHLSSI 39 (124)
T ss_pred cceecCCEEEEEECCccCHHHHHHHHHhcCCEEecc
Confidence 665544321112222356789999999999665443
No 269
>CHL00199 infC translation initiation factor 3; Provisional
Probab=22.15 E-value=1.2e+02 Score=22.37 Aligned_cols=38 Identities=18% Similarity=0.330 Sum_probs=28.2
Q ss_pred cCCHHHHHHHHHHcCccEEEeccCC--CCCCcccccccCCc
Q psy627 42 FSSKEDAIQYCQKNGWKFFVEEPKW--KTPKVKSYAFNFSW 80 (89)
Q Consensus 42 F~s~e~Aiayae~~G~~Y~V~~p~~--~~~~~ksYadNF~~ 80 (89)
--+..+|++-|++.|++.....|.. +.-+.-.||. |.|
T Consensus 39 v~~~~eAl~~A~~~~lDLVeVs~~a~PPVCKImdygK-~ky 78 (182)
T CHL00199 39 IFTSEQAIQLAANQGLDLVLVSEKSDPPVCRIIDYGK-YKF 78 (182)
T ss_pred ceeHHHHHHHHHHcCCCEEEECCCCCCCeEEEeehHH-HHH
Confidence 3478899999999999998888764 3335566765 443
No 270
>cd07414 MPP_PP1_PPKL PP1, PPKL (PP1 and kelch-like) enzymes, and related proteins, metallophosphatase domain. PP1 (protein phosphatase type 1) is a serine/threonine phosphatase that regulates many cellular processes including: cell-cycle progression, protein synthesis, muscle contraction, carbohydrate metabolism, transcription and neuronal signaling, through its interaction with at least 180 known targeting proteins. PP1 occurs in all tissues and regulates many pathways, ranging from cell-cycle progression to carbohydrate metabolism. Also included here are the PPKL (PP1 and kelch-like) enzymes including the PPQ, PPZ1, and PPZ2 fungal phosphatases. These PPKLs have a large N-terminal kelch repeat in addition to a C-terminal phosphoesterase domain. The PPP (phosphoprotein phosphatase) family, to which PP1 belongs, is one of two known protein phosphatase families specific for serine and threonine. The PPP family also includes: PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1, Rdg
Probab=22.14 E-value=2.1e+02 Score=22.07 Aligned_cols=40 Identities=18% Similarity=0.546 Sum_probs=27.0
Q ss_pred CccCCC---CCCCCCCCCCCceeeecCCHHHHHHHHHHcCccEEEec
Q psy627 20 RWENPL---MGWCSTGDPLSNLALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 20 ~~~dPL---MGWtss~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
-|-||. -||..|. ......| ..+++.+|++++|+.+.|..
T Consensus 198 lWsDP~~~~~~~~~~~---Rg~g~~f-g~~~~~~Fl~~n~l~~iiR~ 240 (293)
T cd07414 198 LWSDPDKDVQGWGEND---RGVSFTF-GKDVVAKFLNKHDLDLICRA 240 (293)
T ss_pred hccCcccccCCCccCC---CCcceec-CHHHHHHHHHHcCCeEEEEC
Confidence 466664 3444331 2334556 56899999999999998865
No 271
>cd05564 PTS_IIB_chitobiose_lichenan PTS_IIB_chitobiose_lichenan: subunit IIB of enzyme II (EII) of the N,N-diacetylchitobiose-specific and lichenan-specific phosphoenolpyruvate:carbohydrate phosphotransferase system (PTS). In these systems, EII is either a lichenan- or an N,N-diacetylchitobiose-specific permease with two cytoplasmic domains (IIA and IIB) and a transmembrane channel IIC domain. In the chitobiose system, these subunits are expressed as separate proteins from chbA, chbB, and chbC of the chb operon (formerly the cel (cellulose) operon). In the lichenan system, these subunits are expressed from licA, licB, and licC of the lic operon. The lic operon of Bacillus subtilis is required for the transport and degradation of oligomeric beta-glucosides, which are produced by extracellular enzymes on substrates such as lichenan or barley glucan. The lic operon is transcribed from a gammaA-dependent promoter and is inducible by lichenan, lichenan hydrolysate, and cellobiose. The IIB d
Probab=22.14 E-value=1.1e+02 Score=19.60 Aligned_cols=20 Identities=15% Similarity=0.106 Sum_probs=16.3
Q ss_pred HHHHHHHHHHcCccEEEecc
Q psy627 45 KEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p 64 (89)
.+...++|+++|++|+|..-
T Consensus 16 ~~ki~~~~~~~~~~~~v~~~ 35 (96)
T cd05564 16 VKKMKKAAEKRGIDAEIEAV 35 (96)
T ss_pred HHHHHHHHHHCCCceEEEEe
Confidence 34677899999999999763
No 272
>PRK03147 thiol-disulfide oxidoreductase; Provisional
Probab=21.99 E-value=1.1e+02 Score=20.23 Aligned_cols=48 Identities=19% Similarity=0.211 Sum_probs=31.2
Q ss_pred eeec-CCHHHHHHHHHHcCccEEEeccCC----CCCCcccccccCCcCCCcce
Q psy627 39 ALNF-SSKEDAIQYCQKNGWKFFVEEPKW----KTPKVKSYAFNFSWNKRTRT 86 (89)
Q Consensus 39 ~l~F-~s~e~Aiayae~~G~~Y~V~~p~~----~~~~~ksYadNF~~~r~~~~ 86 (89)
.+.. ++.++...|+++++++|.|..... +......+-.+|..++..++
T Consensus 99 ~i~~d~~~~~~~~~~~~~~~~~~~~~d~~~~~~~~~~v~~~P~~~lid~~g~i 151 (173)
T PRK03147 99 AVNVDETELAVKNFVNRYGLTFPVAIDKGRQVIDAYGVGPLPTTFLIDKDGKV 151 (173)
T ss_pred EEEcCCCHHHHHHHHHHhCCCceEEECCcchHHHHcCCCCcCeEEEECCCCcE
Confidence 4444 356788999999999998865432 23344555667777655443
No 273
>cd03034 ArsC_ArsC Arsenate Reductase (ArsC) family, ArsC subfamily; arsenic reductases similar to that encoded by arsC on the R733 plasmid of Escherichia coli. E. coli ArsC catalyzes the reduction of arsenate [As(V)] to arsenite [As(III)], the first step in the detoxification of arsenic, using reducing equivalents derived from glutathione (GSH) via glutaredoxin (GRX). ArsC contains a single catalytic cysteine, within a thioredoxin fold, that forms a covalent thiolate-As(V) intermediate, which is reduced by GRX through a mixed GSH-arsenate intermediate. This family of predominantly bacterial enzymes is unrelated to two other families of arsenate reductases which show similarity to low-molecular-weight acid phosphatases and phosphotyrosyl phosphatases.
Probab=21.89 E-value=85 Score=20.47 Aligned_cols=21 Identities=19% Similarity=0.227 Sum_probs=17.5
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+|.-.|+++.+.+|++|++..
T Consensus 10 ~t~rkA~~~L~~~~i~~~~~d 30 (112)
T cd03034 10 SKSRNALALLEEAGIEPEIVE 30 (112)
T ss_pred HHHHHHHHHHHHCCCCeEEEe
Confidence 355679999999999998875
No 274
>PF09363 XFP_C: XFP C-terminal domain; InterPro: IPR018969 Phosphoketolases (PK) are key enzymes of the pentose phosphate pathway of heterofermentative and facultative homofermentative lactic acid bacteria and of the D-fructose 6-phosphate shunt of bifidobacteria. PK activity has been sporadically reported in other microorganisms including eukaryotic yeasts. Xylulose-5-phosphate/fructose-6-phosphate phosphoketolase is a thiamine diphosphate (ThdP)-dependent enzyme found in bacteria such as Bifidobacterium sp [, ]. This enzyme has dual-specificity with the following catalytic activities: 4.1.2.9 from EC: xylose 5-P + Pi = acetyl-P + glyeraldehyde-3-P 4.1.2.22 from EC: fructose-6-P + Pi = acetyl-P + erythrose-4-P Phosphoketolases are distantly related to transketolases, e.g. IPR005475 from INTERPRO.; GO: 0016832 aldehyde-lyase activity, 0005975 carbohydrate metabolic process; PDB: 3AI7_B 3AHC_A 3AHJ_A 3AHG_A 3AHE_A 3AHI_A 3AHD_A 3AHF_A 3AHH_A.
Probab=21.85 E-value=93 Score=23.49 Aligned_cols=18 Identities=22% Similarity=0.248 Sum_probs=13.9
Q ss_pred ceeeecCCHHHHHHHHHH
Q psy627 37 NLALNFSSKEDAIQYCQK 54 (89)
Q Consensus 37 qv~l~F~s~e~Aiayae~ 54 (89)
|-...|-|.|+|++.|++
T Consensus 2 qp~pqwls~eeA~~hc~~ 19 (203)
T PF09363_consen 2 QPRPQWLSMEEARAHCAA 19 (203)
T ss_dssp SEEEE-S-HHHHHHHHHH
T ss_pred CCCCccCCHHHHHHHHHc
Confidence 457889999999999986
No 275
>TIGR03662 Chlor_Arch_YYY Chlor_Arch_YYY domain. Members of this highly hydrophobic probable integral membrane family belong to two classes. In one, a single copy of the region modeled by This model represents essentially the full length of a strongly hydrophobic protein of about 700 to 900 residues (variable because of long inserts in some). The domain architecture of the other class consists of an additional N-terminal region, two copies of the region represented by this model, and three to four repeats of TPR, or tetratricopeptide repeat. The unusual species range includes several Archaea, several Chloroflexi, and Clostridium phytofermentans. An unusual motif YYYxG is present, and we suggest the name Chlor_Arch_YYY protein. The function is unknown.
Probab=21.77 E-value=97 Score=27.33 Aligned_cols=33 Identities=9% Similarity=0.172 Sum_probs=27.5
Q ss_pred eeeecC--CHHHHHHHHHHcCccEEEeccCCCCCC
Q psy627 38 LALNFS--SKEDAIQYCQKNGWKFFVEEPKWKTPK 70 (89)
Q Consensus 38 v~l~F~--s~e~Aiayae~~G~~Y~V~~p~~~~~~ 70 (89)
|+--+. +.+++.+..+++|++|.+..|.+|.+-
T Consensus 661 V~~IY~s~d~~~~~~Ll~kY~V~YVyVG~~ER~~Y 695 (723)
T TIGR03662 661 VETIYTSGDPTEARELLERYGVDYVYVGPLERQRY 695 (723)
T ss_pred HHHHHcCCCHHHHHHHHHHcCCeEEEECchhhhhc
Confidence 555555 899999999999999999999888543
No 276
>TIGR00032 argG argininosuccinate synthase. argG in bacteria, ARG1 in Saccharomyces cerevisiae. There is a very unusual clustering in the alignment, with a deep split between one cohort of E. coli, H. influenzae, and Streptomyces, and the other cohort of eukaryotes, archaea, and the rest of the eubacteria.
Probab=21.74 E-value=93 Score=25.28 Aligned_cols=20 Identities=20% Similarity=0.514 Sum_probs=18.0
Q ss_pred CHHHHHHHHHHcCccEEEec
Q psy627 44 SKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~V~~ 63 (89)
++++-++||+++|+++.+..
T Consensus 147 ~r~ei~~ya~~~Gip~~~~~ 166 (394)
T TIGR00032 147 TREEEIEYAIQCGIPYPMSK 166 (394)
T ss_pred CHHHHHHHHHHcCCCeeEec
Confidence 69999999999999998754
No 277
>TIGR00268 conserved hypothetical protein TIGR00268. The N-terminal region of the model shows similarity to Argininosuccinate synthase proteins using PSI-blast and using the recognize protein identification server.
Probab=21.68 E-value=1e+02 Score=22.86 Aligned_cols=20 Identities=25% Similarity=0.135 Sum_probs=17.2
Q ss_pred HHHHHHHHHHcCccEEEecc
Q psy627 45 KEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p 64 (89)
.++|.++|++.|++++|..-
T Consensus 54 ~~~a~~~a~~lgi~~~ii~~ 73 (252)
T TIGR00268 54 LEDAIIIAKEIGVNHEFVKI 73 (252)
T ss_pred HHHHHHHHHHcCCCEEEEEc
Confidence 47789999999999998763
No 278
>smart00740 PASTA PASTA domain.
Probab=21.64 E-value=1.4e+02 Score=16.12 Aligned_cols=20 Identities=35% Similarity=0.248 Sum_probs=17.2
Q ss_pred CCHHHHHHHHHHcCccEEEe
Q psy627 43 SSKEDAIQYCQKNGWKFFVE 62 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~ 62 (89)
.+.++|.+.++..|+...+.
T Consensus 13 ~~~~~a~~~l~~~g~~~~~~ 32 (66)
T smart00740 13 LSKEEAKKLLKALGLKVEVV 32 (66)
T ss_pred CCHHHHHHHHHHCCCEEEEE
Confidence 57889999999999988776
No 279
>TIGR03645 glyox_marine lactoylglutathione lyase family protein. Members of this protein family share homology with lactoylglutathione lyase (glyoxalase I) and are found mainly in marine members of the gammaproteobacteria, including CPS_0532 from Colwellia psychrerythraea 34H. This family excludes a well-separated, more narrowly distributed paralogous family, exemplified by CPS_3492 from C. psychrerythraea. The function is of this protein family is unknown.
Probab=21.61 E-value=1.2e+02 Score=20.51 Aligned_cols=24 Identities=13% Similarity=-0.013 Sum_probs=19.0
Q ss_pred eeeecCCHHHHHHHHHHcCccEEE
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFV 61 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V 61 (89)
+.+.-++.|++++-++++|+...+
T Consensus 97 la~~v~dida~~~~l~~~G~~~~~ 120 (162)
T TIGR03645 97 FCVQDPDVEGLAERIVAAGGKKRM 120 (162)
T ss_pred EEEEcCCHHHHHHHHHHcCCcccC
Confidence 444448899999999999988544
No 280
>TIGR02190 GlrX-dom Glutaredoxin-family domain. This C-terminal domain with homology to glutaredoxin is fused to an N-terminal peroxiredoxin-like domain.
Probab=21.54 E-value=99 Score=18.52 Aligned_cols=21 Identities=14% Similarity=0.077 Sum_probs=17.8
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+-...|.++.+++|++|++..
T Consensus 19 p~C~~ak~~L~~~gi~y~~id 39 (79)
T TIGR02190 19 PFCAKAKATLKEKGYDFEEIP 39 (79)
T ss_pred HhHHHHHHHHHHcCCCcEEEE
Confidence 346789999999999999865
No 281
>PRK05474 xylose isomerase; Provisional
Probab=21.50 E-value=1.1e+02 Score=25.58 Aligned_cols=29 Identities=31% Similarity=0.440 Sum_probs=22.2
Q ss_pred HHHHHHHHHcCccEEE-eccCCCCCCcccc
Q psy627 46 EDAIQYCQKNGWKFFV-EEPKWKTPKVKSY 74 (89)
Q Consensus 46 e~Aiayae~~G~~Y~V-~~p~~~~~~~ksY 74 (89)
..+++|+.+.|++..+ +||++.-++.-+|
T Consensus 212 ~~v~dYa~~iGf~~~f~IEPKP~EPr~hqy 241 (437)
T PRK05474 212 QMVVDYKHKIGFKGTFLIEPKPQEPTKHQY 241 (437)
T ss_pred HHHHHHhhhcCCCceEEeccCCCCCCCccc
Confidence 4679999999996654 6788777777776
No 282
>TIGR01004 PulS_OutS lipoprotein, PulS/OutS family. This family comprises lipoproteins from four gamma proteobacterial species: PulS protein of Klebsiella pneumoniae, the OutS protein of Erwinia chrysanthemi and Pectobacterium chrysanthemi, and the functionally uncharacterized E. coli protein EtpO. PulS and OutS have been shown to interact with and facilitate insertion of secretins into the outer membrane, suggesting a chaperone-like, or piloting function for members of this family.
Probab=21.47 E-value=64 Score=22.76 Aligned_cols=13 Identities=23% Similarity=0.412 Sum_probs=11.6
Q ss_pred HHHHHHHHcCccE
Q psy627 47 DAIQYCQKNGWKF 59 (89)
Q Consensus 47 ~Aiayae~~G~~Y 59 (89)
.||.||+.+||+-
T Consensus 70 aai~~A~~~gWd~ 82 (128)
T TIGR01004 70 TANNVAIGKGWDS 82 (128)
T ss_pred HHHHHHHHcCCch
Confidence 7999999999964
No 283
>PRK13813 orotidine 5'-phosphate decarboxylase; Provisional
Probab=21.46 E-value=1.1e+02 Score=21.75 Aligned_cols=34 Identities=18% Similarity=0.085 Sum_probs=25.3
Q ss_pred CCCCCceeeecCCHHHHHHHHHHcCccEEEeccC
Q psy627 32 GDPLSNLALNFSSKEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 32 ~D~~~qv~l~F~s~e~Aiayae~~G~~Y~V~~p~ 65 (89)
.||.-||-|-+.|.++|++.+++-+=...+.++.
T Consensus 2 ~~~~l~~alD~~~~~~~~~~~~~~~~~~~~vk~g 35 (215)
T PRK13813 2 KDSRIILALDVTDRERALKIAEELDDYVDAIKVG 35 (215)
T ss_pred CCCCEEEEeCCCCHHHHHHHHHhccccCCEEEEc
Confidence 4677789999999999999999875332344443
No 284
>TIGR00442 hisS histidyl-tRNA synthetase. This model finds a histidyl-tRNA synthetase in every completed genome. Apparent second copies from Bacillus subtilis, Synechocystis sp., and Aquifex aeolicus are slightly shorter, more closely related to each other than to other hisS proteins, and actually serve as regulatory subunits for an enzyme of histidine biosynthesis. They were excluded from the seed alignment and score much lower than do single copy histidyl-tRNA synthetases of other genomes not included in the seed alignment. These putative second copies of HisS score below the trusted cutoff. The regulatory protein kinase GCN2 of Saccharomyces cerevisiae (YDR283c), and related proteins from other species designated eIF-2 alpha kinase, have a domain closely related to histidyl-tRNA synthetase that may serve to detect and respond to uncharged tRNA(his), an indicator of amino acid starvation; these regulatory proteins are not orthologous and so score below the noise cutoff.
Probab=21.41 E-value=1.1e+02 Score=23.80 Aligned_cols=21 Identities=24% Similarity=0.339 Sum_probs=18.3
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
.+.+++++||++.|++|.|.-
T Consensus 359 ~~l~k~~~~a~~~g~~~~i~i 379 (397)
T TIGR00442 359 RKLKKQLKYADKLGARFAVIL 379 (397)
T ss_pred CCHHHHHHHHHHcCCCEEEEE
Confidence 357899999999999998865
No 285
>PRK13011 formyltetrahydrofolate deformylase; Reviewed
Probab=21.34 E-value=1.2e+02 Score=23.35 Aligned_cols=29 Identities=14% Similarity=0.015 Sum_probs=24.2
Q ss_pred CceeeecCCHHHHHHHHHHcCccEEEecc
Q psy627 36 SNLALNFSSKEDAIQYCQKNGWKFFVEEP 64 (89)
Q Consensus 36 ~qv~l~F~s~e~Aiayae~~G~~Y~V~~p 64 (89)
..|.+-+.++.+|-+.|+++||++.+...
T Consensus 117 ~~i~~visn~~~~~~lA~~~gIp~~~~~~ 145 (286)
T PRK13011 117 MDIVGVVSNHPDLEPLAAWHGIPFHHFPI 145 (286)
T ss_pred cEEEEEEECCccHHHHHHHhCCCEEEeCC
Confidence 45777788888999999999999987643
No 286
>PRK05589 peptide chain release factor 2; Provisional
Probab=21.19 E-value=74 Score=25.47 Aligned_cols=16 Identities=19% Similarity=0.461 Sum_probs=14.0
Q ss_pred HHHHHHcCccEEEecc
Q psy627 49 IQYCQKNGWKFFVEEP 64 (89)
Q Consensus 49 iayae~~G~~Y~V~~p 64 (89)
..||+++||.++|.+-
T Consensus 112 ~~~a~~~g~~~~vi~~ 127 (325)
T PRK05589 112 TRWAEKKGYKVEIIDL 127 (325)
T ss_pred HHHHHHcCCEEEEEec
Confidence 4699999999999874
No 287
>PRK04247 hypothetical protein; Provisional
Probab=21.19 E-value=1.1e+02 Score=23.58 Aligned_cols=22 Identities=27% Similarity=0.382 Sum_probs=19.4
Q ss_pred HHHHHHHHHHcCccEEEeccCC
Q psy627 45 KEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
.+.|..+|+..|++|...+|..
T Consensus 216 ~~~A~~ll~~~Gle~~~l~p~~ 237 (238)
T PRK04247 216 TDRARRLLEKEGLEFVKLEPPK 237 (238)
T ss_pred CHHHHHHHHHcCCeEEEecCCC
Confidence 5789999999999999988853
No 288
>COG0504 PyrG CTP synthase (UTP-ammonia lyase) [Nucleotide transport and metabolism]
Probab=21.00 E-value=65 Score=27.66 Aligned_cols=17 Identities=35% Similarity=0.595 Sum_probs=14.6
Q ss_pred CHHHHHHHHHHcCccEE
Q psy627 44 SKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~ 60 (89)
-+=.||+||..|+++|-
T Consensus 360 GkI~Ai~yAREn~iP~l 376 (533)
T COG0504 360 GKIAAIRYARENNIPFL 376 (533)
T ss_pred HHHHHHHHHHhcCCCEE
Confidence 35689999999999984
No 289
>cd02971 PRX_family Peroxiredoxin (PRX) family; composed of the different classes of PRXs including many proteins originally known as bacterioferritin comigratory proteins (BCP), based on their electrophoretic mobility before their function was identified. PRXs are thiol-specific antioxidant (TSA) proteins also known as TRX peroxidases and alkyl hydroperoxide reductase C22 (AhpC) proteins. They confer a protective antioxidant role in cells through their peroxidase activity in which hydrogen peroxide, peroxynitrate, and organic hydroperoxides are reduced and detoxified using reducing equivalents derived from either TRX, glutathione, trypanothione and AhpF. They are distinct from other peroxidases in that they have no cofactors such as metals or prosthetic groups. The first step of catalysis, common to all PRXs, is the nucleophilic attack by the catalytic cysteine (also known as the peroxidatic cysteine) on the peroxide leading to cleavage of the oxygen-oxygen bond and the formation of a
Probab=20.92 E-value=1.4e+02 Score=18.89 Aligned_cols=27 Identities=19% Similarity=0.270 Sum_probs=23.2
Q ss_pred eeeecCCHHHHHHHHHHc-CccEEEecc
Q psy627 38 LALNFSSKEDAIQYCQKN-GWKFFVEEP 64 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~-G~~Y~V~~p 64 (89)
+.+..++.+++-+|++++ +..|.+...
T Consensus 60 i~is~d~~~~~~~~~~~~~~~~~~~l~D 87 (140)
T cd02971 60 LGVSVDSPFSHKAWAEKEGGLNFPLLSD 87 (140)
T ss_pred EEEeCCCHHHHHHHHhcccCCCceEEEC
Confidence 567788999999999999 999988764
No 290
>KOG1502|consensus
Probab=20.91 E-value=98 Score=24.83 Aligned_cols=21 Identities=19% Similarity=0.472 Sum_probs=18.5
Q ss_pred HHHHHHHHHHcCccEEEeccC
Q psy627 45 KEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p~ 65 (89)
..+|-+||+++|++..+.-|.
T Consensus 172 EkaAw~fa~e~~~~lv~inP~ 192 (327)
T KOG1502|consen 172 EKAAWEFAKENGLDLVTINPG 192 (327)
T ss_pred HHHHHHHHHhCCccEEEecCC
Confidence 478999999999999988874
No 291
>cd07420 MPP_RdgC Drosophila melanogaster RdgC and related proteins, metallophosphatase domain. RdgC (retinal degeneration C) is a vertebrate serine-threonine protein phosphatase that is required to prevent light-induced retinal degeneration. In addition to its catalytic domain, RdgC has two C-terminal EF hands. Homologs of RdgC include the human phosphatases protein phosphatase with EF hands 1 and -2 (PPEF-1 and -2). PPEF-1 transcripts are present at low levels in the retina, PPEF-2 transcripts and PPEF-2 protein are present at high levels in photoreceptors. The PPP (phosphoprotein phosphatase) family, to which RdgC belongs, is one of two known protein phosphatase families specific for serine and threonine. The PPP family also includes: PP1, PP2A, PP2B (calcineurin), PP4, PP5, PP6, PP7, Bsu1, PrpE, PrpA/PrpB, and ApA4 hydrolase. The PPP catalytic domain is defined by three conserved motifs (-GDXHG-, -GDXVDRG- and -GNHE-). The PPP enzyme family is ancient with members found in all
Probab=20.89 E-value=1.9e+02 Score=22.82 Aligned_cols=25 Identities=16% Similarity=0.246 Sum_probs=20.0
Q ss_pred eeeecCCHHHHHHHHHHcCccEEEec
Q psy627 38 LALNFSSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 38 v~l~F~s~e~Aiayae~~G~~Y~V~~ 63 (89)
....|. .++..+||++||+.+.|..
T Consensus 247 ~g~~FG-~~~~~~Fl~~n~l~~IIR~ 271 (321)
T cd07420 247 GGCYFG-PDVTSKVLQKHGLSLLIRS 271 (321)
T ss_pred CccccC-HHHHHHHHHHCCCcEEEEc
Confidence 455565 7777799999999999865
No 292
>PRK11070 ssDNA exonuclease RecJ; Provisional
Probab=20.88 E-value=1e+02 Score=26.23 Aligned_cols=21 Identities=10% Similarity=0.184 Sum_probs=18.0
Q ss_pred HHHHHHHHHHcCccEEEeccC
Q psy627 45 KEDAIQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 45 ~e~Aiayae~~G~~Y~V~~p~ 65 (89)
--++|+||+.+|++..|..-|
T Consensus 140 ~~e~i~~a~~~gidvIVtDHH 160 (575)
T PRK11070 140 SHAGVAHAHALGIPVLVTDHH 160 (575)
T ss_pred CHHHHHHHHHCCCCEEEECCC
Confidence 467999999999999997744
No 293
>PF13380 CoA_binding_2: CoA binding domain; PDB: 3FF4_A 2D5A_A 2D59_A 2E6U_X 1IUL_A 1IUK_A 1Y81_A 2DUW_A.
Probab=20.78 E-value=90 Score=20.52 Aligned_cols=17 Identities=18% Similarity=0.403 Sum_probs=11.9
Q ss_pred CCHHHHHHHHHHcCccE
Q psy627 43 SSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y 59 (89)
...++++++|+++|+.+
T Consensus 89 ~~~~~~~~~a~~~gi~v 105 (116)
T PF13380_consen 89 AESEELIEAAREAGIRV 105 (116)
T ss_dssp S--HHHHHHHHHTT-EE
T ss_pred hHHHHHHHHHHHcCCEE
Confidence 45688999999999874
No 294
>TIGR00068 glyox_I lactoylglutathione lyase. Glyoxylase I is a homodimer in many species. In some eukaryotes, including yeasts and plants, the orthologous protein carries a tandem duplication, is twice as long, and hits this model twice.
Probab=20.76 E-value=1.2e+02 Score=19.95 Aligned_cols=26 Identities=12% Similarity=0.157 Sum_probs=21.9
Q ss_pred CCceeeecCCHHHHHHHHHHcCccEE
Q psy627 35 LSNLALNFSSKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 35 ~~qv~l~F~s~e~Aiayae~~G~~Y~ 60 (89)
..++.+..++.+++++.++++|+...
T Consensus 87 ~~hi~f~v~dld~~~~~l~~~G~~~~ 112 (150)
T TIGR00068 87 FGHIAIGVDDVYKACERVRALGGNVV 112 (150)
T ss_pred eeEEEEecCCHHHHHHHHHHcCCccc
Confidence 34577888999999999999999753
No 295
>PRK08057 cobalt-precorrin-6x reductase; Reviewed
Probab=20.73 E-value=1.4e+02 Score=22.55 Aligned_cols=27 Identities=15% Similarity=0.294 Sum_probs=22.4
Q ss_pred eecCCHHHHHHHHHHcCccEEEeccCC
Q psy627 40 LNFSSKEDAIQYCQKNGWKFFVEEPKW 66 (89)
Q Consensus 40 l~F~s~e~Aiayae~~G~~Y~V~~p~~ 66 (89)
=.|.+.|+=.+||+++++...|..-|+
T Consensus 49 G~l~~~~~l~~~l~~~~i~~VIDATHP 75 (248)
T PRK08057 49 GGFGGAEGLAAYLREEGIDLVIDATHP 75 (248)
T ss_pred CCCCCHHHHHHHHHHCCCCEEEECCCc
Confidence 357788999999999999998887665
No 296
>cd04443 DEP_GPR155 DEP (Dishevelled, Egl-10, and Pleckstrin) domain found in GPR155-like proteins. GRP155-like proteins, also known as PGR22, contain an N-terminal permease domain, a central transmembrane region and a C-terminal DEP domain. They are orphan receptors of the class B G protein-coupled receptors. Their function is unknown.
Probab=20.71 E-value=84 Score=20.07 Aligned_cols=22 Identities=14% Similarity=0.293 Sum_probs=18.0
Q ss_pred cCCHHHHHHHHH---HcCccEEEec
Q psy627 42 FSSKEDAIQYCQ---KNGWKFFVEE 63 (89)
Q Consensus 42 F~s~e~Aiayae---~~G~~Y~V~~ 63 (89)
..|+++||.+|. .+|+=..|..
T Consensus 46 ~~sR~eAv~lg~~Ll~~G~i~HV~~ 70 (83)
T cd04443 46 AQDRGEAVLYGRRLLQGGVLQHITN 70 (83)
T ss_pred CCCHHHHHHHHHHHHHCCCEEecCC
Confidence 589999999998 5788777754
No 297
>PRK12485 bifunctional 3,4-dihydroxy-2-butanone 4-phosphate synthase/GTP cyclohydrolase II-like protein; Provisional
Probab=20.65 E-value=88 Score=25.39 Aligned_cols=17 Identities=24% Similarity=0.417 Sum_probs=15.4
Q ss_pred CCHHHHHHHHHHcCccE
Q psy627 43 SSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y 59 (89)
.+.+++++||++||++.
T Consensus 173 ~~~~~~~~fA~~h~l~~ 189 (369)
T PRK12485 173 ARRPDLEVFAAKHGIKI 189 (369)
T ss_pred cChHHHHHHHHHcCCcE
Confidence 36899999999999997
No 298
>TIGR00011 YbaK_EbsC ybaK/ebsC protein. This model represents the YbaK family, bacterial proteins whose full length sequence is homologous to an insertion domain in proline--tRNA ligases. The domain deacylates mischarged tRNAs. The YbaK protein of Haemophilus influenzae (HI1434), although still considered undefined in its role in vivo, likewise deacylates Ala-tRNA(Pro), but not the correctly charged Pro-tRNA(Pro). A crystallographic study of HI1434 suggests a nucleotide binding function. Previously, a member of this family was described as EbsC and was thought to be involved in cell wall metabolism.
Probab=20.57 E-value=83 Score=21.25 Aligned_cols=17 Identities=29% Similarity=0.380 Sum_probs=15.1
Q ss_pred HHHHHHHHcCccEEEec
Q psy627 47 DAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 47 ~Aiayae~~G~~Y~V~~ 63 (89)
-|+++-+++||+|++.+
T Consensus 2 ~~~~~L~~~~i~~~~~~ 18 (152)
T TIGR00011 2 NAIRLLDKAKIEYEVHE 18 (152)
T ss_pred HHHHHHHHcCCCcEEEE
Confidence 37899999999999986
No 299
>PF14174 YycC: YycC-like protein
Probab=20.35 E-value=95 Score=18.83 Aligned_cols=17 Identities=29% Similarity=0.305 Sum_probs=15.0
Q ss_pred CHHHHHHHHHHcCccEE
Q psy627 44 SKEDAIQYCQKNGWKFF 60 (89)
Q Consensus 44 s~e~Aiayae~~G~~Y~ 60 (89)
|.|-|+..|++-|.+.+
T Consensus 7 S~eTA~kLs~~L~vPlE 23 (53)
T PF14174_consen 7 SPETAVKLSKKLGVPLE 23 (53)
T ss_pred CHHHHHHHHHHHCCcHH
Confidence 78999999999998754
No 300
>cd03078 GST_N_Metaxin1_like GST_N family, Metaxin subfamily, Metaxin 1-like proteins; composed of metaxins 1 and 3, and similar proteins including Tom37 from fungi. Mammalian metaxin (or metaxin 1) and the fungal protein Tom37 are components of preprotein import complexes of the mitochondrial outer membrane. Metaxin extends to the cytosol and is anchored to the mitochondrial membrane through its C-terminal domain. In mice, metaxin is required for embryonic development. Like the murine gene, the human metaxin gene is located downstream to the glucocerebrosidase (GBA) pseudogene and is convergently transcribed. Inherited deficiency of GBA results in Gaucher disease, which presents many diverse clinical phenotypes. Alterations in the metaxin gene, in addition to GBA mutations, may be associated with Gaucher disease. Genome sequencing shows that a third metaxin gene also exists in zebrafish, Xenopus, chicken and mammals.
Probab=20.33 E-value=1.6e+02 Score=17.67 Aligned_cols=21 Identities=14% Similarity=0.061 Sum_probs=17.0
Q ss_pred CCHHHHHHHHHHcCccEEEec
Q psy627 43 SSKEDAIQYCQKNGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~~G~~Y~V~~ 63 (89)
+..=.++.|++--|++|+|..
T Consensus 17 p~clk~~~~Lr~~~~~~~v~~ 37 (73)
T cd03078 17 PECLAVLAYLKFAGAPLKVVP 37 (73)
T ss_pred HHHHHHHHHHHcCCCCEEEEe
Confidence 345568889999999999975
No 301
>TIGR00020 prfB peptide chain release factor 2. In many but not all taxa, there is a conserved real translational frameshift at a TGA codon. RF-2 helps terminate translation at TGA codons and can therefore regulate its own production by readthrough when RF-2 is insufficient. There is a Pfam model called "RF-1" for the superfamily of RF-1, RF-2, mitochondrial, RF-H, etc.
Probab=20.31 E-value=79 Score=25.67 Aligned_cols=16 Identities=13% Similarity=0.484 Sum_probs=13.8
Q ss_pred HHHHHHcCccEEEecc
Q psy627 49 IQYCQKNGWKFFVEEP 64 (89)
Q Consensus 49 iayae~~G~~Y~V~~p 64 (89)
..||+++||.++|.+-
T Consensus 151 ~~~a~~~g~~~evi~~ 166 (364)
T TIGR00020 151 LRWAERRGFKVEIIDY 166 (364)
T ss_pred HHHHHHcCCEEEEEec
Confidence 4699999999999873
No 302
>cd00340 GSH_Peroxidase Glutathione (GSH) peroxidase family; tetrameric selenoenzymes that catalyze the reduction of a variety of hydroperoxides including lipid peroxidases, using GSH as a specific electron donor substrate. GSH peroxidase contains one selenocysteine residue per subunit, which is involved in catalysis. Different isoenzymes are known in mammals,which are involved in protection against reactive oxygen species, redox regulation of many metabolic processes, peroxinitrite scavenging, and modulation of inflammatory processes.
Probab=20.28 E-value=1.1e+02 Score=20.42 Aligned_cols=21 Identities=24% Similarity=0.485 Sum_probs=17.6
Q ss_pred CCHHHHHHHHHH-cCccEEEec
Q psy627 43 SSKEDAIQYCQK-NGWKFFVEE 63 (89)
Q Consensus 43 ~s~e~Aiayae~-~G~~Y~V~~ 63 (89)
++.|++.+|+++ +|++|-|..
T Consensus 71 ~~~~~~~~f~~~~~~~~fp~~~ 92 (152)
T cd00340 71 GSNEEIKEFCETNYGVTFPMFA 92 (152)
T ss_pred CCHHHHHHHHHHhcCCCceeee
Confidence 567899999997 899998864
No 303
>PF14057 GGGtGRT: GGGtGRT protein
Probab=20.23 E-value=84 Score=25.14 Aligned_cols=21 Identities=24% Similarity=0.427 Sum_probs=18.4
Q ss_pred eeecCCHHHHHHHHHHcCccE
Q psy627 39 ALNFSSKEDAIQYCQKNGWKF 59 (89)
Q Consensus 39 ~l~F~s~e~Aiayae~~G~~Y 59 (89)
..-|.|+|+|-++|...|++.
T Consensus 18 ~~gi~sleea~~~c~~~gid~ 38 (328)
T PF14057_consen 18 EYGIASLEEAKEICLDKGIDV 38 (328)
T ss_pred HhCcccHHHHHHHHHHcCCCH
Confidence 356899999999999999974
No 304
>PRK08787 peptide chain release factor 2; Provisional
Probab=20.16 E-value=80 Score=25.25 Aligned_cols=16 Identities=25% Similarity=0.636 Sum_probs=13.9
Q ss_pred HHHHHHcCccEEEecc
Q psy627 49 IQYCQKNGWKFFVEEP 64 (89)
Q Consensus 49 iayae~~G~~Y~V~~p 64 (89)
..||+++||.++|.+-
T Consensus 92 ~r~A~~~g~~~evi~~ 107 (313)
T PRK08787 92 LRWAESRGWKTELMEV 107 (313)
T ss_pred HHHHHHcCCeEEEEec
Confidence 4699999999999774
No 305
>PRK07342 peptide chain release factor 2; Provisional
Probab=20.12 E-value=80 Score=25.48 Aligned_cols=16 Identities=25% Similarity=0.364 Sum_probs=13.9
Q ss_pred HHHHHHcCccEEEecc
Q psy627 49 IQYCQKNGWKFFVEEP 64 (89)
Q Consensus 49 iayae~~G~~Y~V~~p 64 (89)
..||+++||.++|.+-
T Consensus 115 ~~~a~~~g~~~~vi~~ 130 (339)
T PRK07342 115 TRWAERQGRKVEVLEV 130 (339)
T ss_pred HHHHHHcCCEEEEEec
Confidence 4699999999999774
No 306
>PRK00591 prfA peptide chain release factor 1; Validated
Probab=20.04 E-value=81 Score=25.56 Aligned_cols=17 Identities=24% Similarity=0.671 Sum_probs=14.3
Q ss_pred HHHHHHcCccEEEeccC
Q psy627 49 IQYCQKNGWKFFVEEPK 65 (89)
Q Consensus 49 iayae~~G~~Y~V~~p~ 65 (89)
+.||+++||.++|.+-.
T Consensus 138 ~~~a~~~g~~~~vi~~~ 154 (359)
T PRK00591 138 SRYAERQGWKVEILSAS 154 (359)
T ss_pred HHHHHHCCCEEEEEecC
Confidence 56999999999998743
Done!