Query psy17993
Match_columns 191
No_of_seqs 176 out of 1099
Neff 8.5
Searched_HMMs 46136
Date Fri Aug 16 19:34:21 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17993.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17993hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 cd09020 D-hex-6-P-epi_like D-h 100.0 4.7E-47 1E-51 308.6 21.8 182 8-191 86-269 (269)
2 KOG1594|consensus 100.0 3.6E-47 7.8E-52 297.2 18.7 182 8-190 111-294 (305)
3 cd09025 Aldose_epim_Slr1438 Al 100.0 1.6E-33 3.5E-38 229.4 15.9 165 10-191 97-271 (271)
4 COG0676 Uncharacterized enzyme 100.0 4.9E-33 1.1E-37 221.0 15.1 175 3-190 105-282 (287)
5 cd09021 Aldose_epim_Ec_YphB al 100.0 1.6E-29 3.4E-34 206.0 18.0 161 10-191 91-273 (273)
6 cd09024 Aldose_epim_lacX Aldos 100.0 1.8E-28 3.9E-33 201.3 20.0 171 8-191 84-287 (288)
7 cd01081 Aldose_epim aldose 1-e 99.9 1.8E-25 3.9E-30 182.0 17.9 164 10-188 93-283 (284)
8 cd09022 Aldose_epim_Ec_YihR Al 99.9 6.3E-25 1.4E-29 179.9 19.5 170 9-190 87-284 (284)
9 PF01263 Aldose_epim: Aldose 1 99.9 1E-25 2.2E-30 185.2 12.2 179 11-191 103-300 (300)
10 PRK15172 putative aldose-1-epi 99.9 4E-24 8.7E-29 176.5 19.3 167 9-191 106-296 (300)
11 cd09019 galactose_mutarotase_l 99.9 8.8E-22 1.9E-26 164.3 19.9 178 8-190 104-324 (326)
12 COG2017 GalM Galactose mutarot 99.9 5.4E-21 1.2E-25 158.3 18.0 163 12-191 116-304 (308)
13 PLN00194 aldose 1-epimerase; P 99.8 2.9E-19 6.4E-24 149.7 19.8 175 11-190 119-333 (337)
14 TIGR02636 galM_Leloir galactos 99.7 2.2E-16 4.8E-21 132.3 19.2 175 8-190 109-332 (335)
15 cd09023 Aldose_epim_Ec_c4013 A 99.7 4.4E-17 9.5E-22 133.6 12.7 162 12-188 96-283 (284)
16 PRK11055 galM galactose-1-epim 99.6 1.5E-13 3.2E-18 115.4 19.1 173 9-189 115-336 (342)
17 PTZ00485 aldolase 1-epimerase; 99.4 2.2E-11 4.9E-16 103.1 19.9 172 11-186 125-362 (376)
18 cd09269 deoxyribose_mutarotase 99.1 8.8E-10 1.9E-14 90.7 12.3 163 14-190 88-292 (293)
19 PF14486 DUF4432: Domain of un 98.6 8E-07 1.7E-11 73.6 11.8 153 25-190 123-300 (302)
20 KOG1604|consensus 98.1 0.00027 5.9E-09 58.5 15.8 169 13-189 132-346 (353)
21 COG0832 UreB Urea amidohydrola 90.1 0.56 1.2E-05 32.2 3.8 32 35-69 12-43 (106)
22 cd00407 Urease_beta Urease bet 89.6 0.52 1.1E-05 32.5 3.4 32 35-69 12-43 (101)
23 TIGR00192 urease_beta urease, 89.4 0.56 1.2E-05 32.3 3.5 31 36-69 13-43 (101)
24 PRK13203 ureB urease subunit b 89.1 0.59 1.3E-05 32.3 3.4 31 36-69 13-43 (102)
25 PF00699 Urease_beta: Urease b 88.3 0.48 1E-05 32.6 2.6 31 36-69 12-42 (100)
26 PRK13202 ureB urease subunit b 87.3 0.93 2E-05 31.4 3.5 31 36-69 13-44 (104)
27 PRK13201 ureB urease subunit b 86.6 0.98 2.1E-05 32.7 3.4 31 36-69 13-43 (136)
28 PRK13205 ureB urease subunit b 86.5 0.99 2.1E-05 33.4 3.5 31 36-69 13-43 (162)
29 PRK13204 ureB urease subunit b 85.5 1.2 2.5E-05 33.1 3.4 32 35-69 35-66 (159)
30 PF14315 DUF4380: Domain of un 85.5 18 0.0004 29.4 15.9 34 28-62 109-144 (274)
31 PRK13198 ureB urease subunit b 85.3 1.2 2.6E-05 33.0 3.4 32 35-69 40-71 (158)
32 PRK13192 bifunctional urease s 79.4 2.4 5.2E-05 33.0 3.3 42 25-69 110-152 (208)
33 PRK13986 urease subunit alpha; 78.2 2.5 5.5E-05 33.3 3.1 32 35-69 117-148 (225)
34 PF04744 Monooxygenase_B: Mono 72.7 4.6 0.0001 34.4 3.5 29 33-62 255-283 (381)
35 TIGR03079 CH4_NH3mon_ox_B meth 68.5 6.8 0.00015 33.4 3.6 27 34-61 275-301 (399)
36 PF05506 DUF756: Domain of unk 67.9 15 0.00033 24.4 4.7 38 27-67 6-43 (89)
37 PF12690 BsuPI: Intracellular 67.1 9 0.00019 25.4 3.4 20 42-62 1-20 (82)
38 PF09095 DUF1926: Domain of un 53.0 27 0.00059 28.6 4.6 29 26-55 144-172 (278)
39 PF06030 DUF916: Bacterial pro 51.2 42 0.00092 23.9 4.8 29 40-69 26-54 (121)
40 PF14796 AP3B1_C: Clathrin-ada 50.7 63 0.0014 23.9 5.7 15 176-190 124-138 (145)
41 PLN02303 urease 43.7 25 0.00054 33.4 3.3 32 35-69 142-173 (837)
42 PF00345 PapD_N: Pili and flag 42.0 71 0.0015 22.2 4.8 27 36-63 9-35 (122)
43 PF03351 DOMON: DOMON domain; 41.5 79 0.0017 21.7 5.0 35 31-66 4-38 (124)
44 PF14310 Fn3-like: Fibronectin 41.3 33 0.00072 21.6 2.7 16 175-190 26-41 (71)
45 TIGR01451 B_ant_repeat conserv 38.6 44 0.00096 19.9 2.8 16 40-55 11-26 (53)
46 PF14849 YidC_periplas: YidC p 38.4 1.6E+02 0.0036 23.4 7.0 44 9-60 86-131 (270)
47 TIGR03593 yidC_nterm membrane 37.9 1.7E+02 0.0037 24.6 7.4 30 29-59 172-203 (366)
48 PF01345 DUF11: Domain of unkn 36.5 50 0.0011 20.9 3.1 16 40-55 40-55 (76)
49 PF05543 Peptidase_C47: Stapho 36.2 20 0.00043 27.4 1.2 35 137-186 134-168 (175)
50 PRK01318 membrane protein inse 36.0 1.8E+02 0.0039 26.3 7.3 32 29-61 138-171 (521)
51 PF10409 PTEN_C2: C2 domain of 34.7 82 0.0018 22.3 4.3 52 25-78 53-106 (134)
52 PF10065 DUF2303: Uncharacteri 34.0 60 0.0013 26.6 3.8 27 27-53 221-247 (276)
53 PF14874 PapD-like: Flagellar- 33.5 1.4E+02 0.003 19.8 5.1 24 39-63 18-41 (102)
54 PF03072 DUF237: MG032/MG096/M 32.5 46 0.00099 24.3 2.5 20 21-40 46-65 (137)
55 cd01176 IPT_RBP-Jkappa IPT dom 31.5 63 0.0014 22.0 2.9 37 130-170 21-58 (97)
56 PF11611 DUF4352: Domain of un 31.3 65 0.0014 22.1 3.2 22 41-63 36-57 (123)
57 cd06535 CIDE_N_CAD CIDE_N doma 30.9 16 0.00034 24.0 -0.1 21 164-186 57-77 (77)
58 PHA02310 hypothetical protein 29.4 62 0.0013 23.3 2.7 41 131-171 80-122 (130)
59 PF14100 PmoA: Methane oxygena 28.1 73 0.0016 26.0 3.3 28 163-190 224-252 (271)
60 PF02120 Flg_hook: Flagellar h 28.1 97 0.0021 19.9 3.4 38 9-53 11-48 (85)
61 PF06586 TraK: TraK protein; 27.8 1.6E+02 0.0034 23.1 5.2 36 26-62 170-206 (234)
62 PRK15098 beta-D-glucoside gluc 27.7 44 0.00096 31.5 2.3 33 23-55 637-681 (765)
63 KOG3150|consensus 26.0 38 0.00083 25.4 1.2 20 128-147 16-35 (182)
64 PF11325 DUF3127: Domain of un 25.6 1.1E+02 0.0024 20.4 3.3 31 160-190 33-65 (84)
65 cd06536 CIDE_N_ICAD CIDE_N dom 24.7 26 0.00056 23.2 0.1 21 163-185 59-79 (80)
66 PF14742 GDE_N_bis: N-terminal 24.7 3.2E+02 0.0069 20.9 7.8 39 25-65 76-114 (194)
67 PF00207 A2M: Alpha-2-macroglo 24.6 2.1E+02 0.0046 18.8 4.6 14 7-20 12-25 (92)
68 PF06165 Glyco_transf_36: Glyc 24.5 2E+02 0.0043 19.8 4.6 34 28-62 48-83 (110)
69 PF10633 NPCBM_assoc: NPCBM-as 24.5 61 0.0013 20.7 1.9 16 40-55 4-19 (78)
70 PF07610 DUF1573: Protein of u 24.4 77 0.0017 18.1 2.1 12 47-59 2-13 (45)
71 PF07705 CARDB: CARDB; InterP 23.8 91 0.002 20.2 2.7 17 39-55 17-33 (101)
72 PF13739 DUF4163: Domain of un 23.4 2.1E+02 0.0046 18.4 5.1 35 25-67 64-99 (101)
73 PF05753 TRAP_beta: Translocon 23.2 85 0.0018 24.1 2.7 20 36-55 33-52 (181)
74 cd06539 CIDE_N_A CIDE_N domain 23.2 29 0.00062 22.9 0.1 21 163-185 57-77 (78)
75 KOG2143|consensus 22.3 69 0.0015 29.7 2.3 14 134-147 793-806 (854)
76 PF09624 DUF2393: Protein of u 22.2 2.3E+02 0.0051 20.5 4.9 32 26-58 46-78 (149)
77 cd06537 CIDE_N_B CIDE_N domain 21.7 33 0.00071 22.8 0.1 22 163-186 56-77 (81)
78 PF05618 Zn_protease: Putative 21.2 1.7E+02 0.0037 21.3 3.9 34 29-63 83-116 (138)
79 cd03780 MATH_TRAF5 Tumor Necro 21.2 2.7E+02 0.0058 20.4 5.0 31 8-40 53-85 (148)
80 cd01615 CIDE_N CIDE_N domain, 21.1 36 0.00078 22.4 0.2 21 163-185 57-77 (78)
81 PLN03080 Probable beta-xylosid 21.0 84 0.0018 29.8 2.7 13 43-55 686-698 (779)
82 PF06205 GT36_AF: Glycosyltran 20.5 1.2E+02 0.0026 20.2 2.7 16 175-190 67-82 (90)
83 PF02883 Alpha_adaptinC2: Adap 20.2 2.9E+02 0.0063 18.7 4.9 48 30-81 11-60 (115)
No 1
>cd09020 D-hex-6-P-epi_like D-hexose-6-phosphate epimerase-like. D-Hexose-6-phosphate epimerase Ymr099c from Saccharomyces cerevisiae belongs to the large superfamily of aldose-1-epimerases. Its active site is very similar to the catalytic site of galactose mutarotase, the best studied member of the superfamily. It also contains the conserved glutamate and histidine residues that have been shown in galactose mutarotase to be critical for catalysis, the glutamate serving as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen. In addition Ymr099c contains 2 conserved arginine residues which are involved in phosphate binding, and exhibits hexose-6-phosphate mutarotase activity on glucose-6-P, galactose-6-P and mannose-6-P.
Probab=100.00 E-value=4.7e-47 Score=308.59 Aligned_cols=182 Identities=32% Similarity=0.543 Sum_probs=166.7
Q ss_pred CCCeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeCCCCee
Q psy17993 8 TLDAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGLYGCHY 86 (191)
Q Consensus 8 ~~~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl~~~~~ 86 (191)
.+.++++|.|.++ ++++.|||+|+++++|+|.+++|+++++|+|+|++ +|||++|+||||++++++++.+.|+.+..|
T Consensus 86 ~~~~~l~l~l~~~~~~~~~~P~~f~l~~~~~L~~~~L~~~l~v~N~g~~-~~p~~~g~HpYf~v~d~~~~~v~gl~~~~y 164 (269)
T cd09020 86 EDGVTVSLELDDTDETRAIWPHAFELRLTVTLGFDTLELELTVTNTGDK-PFSFTAALHTYFRVSDIEQVRVEGLEGATY 164 (269)
T ss_pred CCceEEEEEeCCChhhhhcCCCceEEEEEEEEcCCcEEEEEEEECCCCC-CeEehhccCeeEecCCccccEEeCCCCCce
Confidence 3467899999998 99999999999999999999999999999999999 999999999999999999999999999899
Q ss_pred eeccCCcceeeeecceEEeCCCCCeeEeCCCCeEEEEEeCCCceEEEEEeCCCcEEEeCCcc-CCCCCCCCCCCCCCceE
Q psy17993 87 IDKVLDNQLFRETRETVSVSSLTDRIYQNTCQEHVVTNVVSGRKMRLQKYNFPDTVIWNPWD-TGRDMKDFGDDEYPNMI 165 (191)
Q Consensus 87 ~d~~~~~~~~~~~~~~~~~~~~~D~iy~~~~~~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~-~~~~~~d~~~~~~~~fv 165 (191)
+|++.+.... ...+.+.+.+++||||.++++.+.|.+...++.|+|++.+++++||||||. +++.|+||++++|++||
T Consensus 165 ~d~~~~~~~~-~~~~~~~~~~~~Drvy~~~~~~~~i~d~~~~~~i~i~~~~~~~~VVWNP~~~~~~~~~d~~~~~~~~fv 243 (269)
T cd09020 165 LDKLTDQREK-VQGGAVTFDGEVDRVYLNTPAPLTIDDPAWGRRIRIEKSGSPSAVVWNPWIEKAARMADFPDDGYRRMV 243 (269)
T ss_pred EEcCCCcccc-ccCCceEECCccceEEeCCCCCEEEEcCCCCcEEEEEecCCCCEEEeCcchhhccccccCCccccceEE
Confidence 9987653222 224578899999999998888899999888999999999999999999999 99999999999999999
Q ss_pred EecceecCCCeEeCCCCEEEEEEEeC
Q psy17993 166 CVEAGHVTSPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 166 CiEP~~~~~~~~L~pGe~~~~~~~i~ 191 (191)
|||||++.+.+.|+||++|+++++|+
T Consensus 244 CvEp~~~~~~~~L~pG~~~~~~~~i~ 269 (269)
T cd09020 244 CVEAANVADPVTLAPGESHTLSQTIS 269 (269)
T ss_pred EECeeecCCCEEECCCCCEEEEEEEC
Confidence 99999988899999999999999985
No 2
>KOG1594|consensus
Probab=100.00 E-value=3.6e-47 Score=297.25 Aligned_cols=182 Identities=34% Similarity=0.614 Sum_probs=172.3
Q ss_pred CCCeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeCCCCee
Q psy17993 8 TLDAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGLYGCHY 86 (191)
Q Consensus 8 ~~~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl~~~~~ 86 (191)
.+...|.|.|.++ +++++|||+|++++++.|.++.|++...|+|++++ ++.|++++|+||+|++++.++++||+++.|
T Consensus 111 ~~~a~Vdl~Lk~~~~~~kiWp~~Fe~~lrv~l~~g~Lt~~~rV~Ntd~K-pFsF~~alHtYf~vsdisevrveGL~tldy 189 (305)
T KOG1594|consen 111 LGKATVDLILKSSEDDLKIWPHSFELRLRVSLGDGELTLTSRVRNTDSK-PFSFSFALHTYFRVSDISEVRVEGLETLDY 189 (305)
T ss_pred CCceeEEEEecCChhhhhhCCcceEEEEEEEEcCCceEEEEEeecCCCC-ceEEEeEeeeeEeecccceEEEeccccccc
Confidence 4568999999999 99999999999999999999999999999999999 999999999999999999999999999999
Q ss_pred eeccCCcceeeeecceEEeCCCCCeeEeCCCCeEEEEEeCCCceEEEEEeCCCcEEEeCCcc-CCCCCCCCCCCCCCceE
Q psy17993 87 IDKVLDNQLFRETRETVSVSSLTDRIYQNTCQEHVVTNVVSGRKMRLQKYNFPDTVIWNPWD-TGRDMKDFGDDEYPNMI 165 (191)
Q Consensus 87 ~d~~~~~~~~~~~~~~~~~~~~~D~iy~~~~~~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~-~~~~~~d~~~~~~~~fv 165 (191)
+|++.....+.+..+.++|++++|+||.+.+.++.|.+-..+++|.|.++|+|+.||||||. +++.|+||++++|++|+
T Consensus 190 lD~~~~~~~~tE~~davTF~~e~DrvYl~tp~e~aI~dh~~krti~l~k~g~pDaVVWNPW~kksk~maD~gde~Y~~ml 269 (305)
T KOG1594|consen 190 LDNLKNRERFTEQRDAVTFNSEVDRVYLNTPTELAIFDHEKKRTIVLKKEGLPDAVVWNPWDKKSKTMADFGDEDYKHML 269 (305)
T ss_pred cccccchhhccccCceEeeccceeeEEecCCceEEEEEeccccEEEEeccCCCceEEeChhHhhhhhhhhccccccceeE
Confidence 99887665566777789999999999999999999999899999999999999999999999 99999999999999999
Q ss_pred EecceecCCCeEeCCCCEEEEEEEe
Q psy17993 166 CVEAGHVTSPVLLHPGTTFEASQIL 190 (191)
Q Consensus 166 CiEP~~~~~~~~L~pGe~~~~~~~i 190 (191)
|||++.+..++.|+||++|++.+.+
T Consensus 270 CVe~a~v~~pI~L~PG~eW~g~q~L 294 (305)
T KOG1594|consen 270 CVESAAVESPITLKPGEEWKGRQLL 294 (305)
T ss_pred EecccccCCceeecCCccceeEEEE
Confidence 9999999999999999999999876
No 3
>cd09025 Aldose_epim_Slr1438 Aldose 1-epimerase, similar to Synechocystis Slr1438. Proteins similar to Synechocystis Slr1438 are uncharacterized members of aldose-1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen.
Probab=100.00 E-value=1.6e-33 Score=229.43 Aligned_cols=165 Identities=22% Similarity=0.264 Sum_probs=136.7
Q ss_pred CeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeCCCCeeee
Q psy17993 10 DAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGLYGCHYID 88 (191)
Q Consensus 10 ~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl~~~~~~d 88 (191)
...++|+|.++ ++++.|||+|+++++|+|.+++|+++++|+|+|++ +|||++|+||||++++.+++.+.++ +..|++
T Consensus 97 ~~~v~l~l~~~~~~~~~~P~~~~~~~~y~L~~~~L~i~~~v~N~~~~-~~p~~~g~HpYF~~~~~~~~~l~~~-~~~~~~ 174 (271)
T cd09025 97 GAGLTLTLRDNEATRAVYPFDFELELTYRLAGNTLEIAQRVHNLGDQ-PMPFSFGFHPYFAVPDKAKLSLDLP-PTRCFD 174 (271)
T ss_pred CcEEEEEEeCCHHHHhhCCceEEEEEEEEEeCCEEEEEEEEEECCCC-cEEEEEecCceeeCCchhccEEEcC-HHHHhh
Confidence 56899999888 78899999999999999999999999999999999 9999999999999998889999877 456777
Q ss_pred ccCCcceeeeecceEEeCCCCCeeEeCCCCeEEEEEeCCCceEEEEEeC-CCcEEEeCCccCCCCCCCCCCCCCCceEEe
Q psy17993 89 KVLDNQLFRETRETVSVSSLTDRIYQNTCQEHVVTNVVSGRKMRLQKYN-FPDTVIWNPWDTGRDMKDFGDDEYPNMICV 167 (191)
Q Consensus 89 ~~~~~~~~~~~~~~~~~~~~~D~iy~~~~~~~~l~~~~~~~~i~v~~~~-~~~~vvWnp~~~~~~~~d~~~~~~~~fvCi 167 (191)
+......... .....+..++|++|.+++ .+.|.+...++.|.|..++ ++++|||||+ +++||||
T Consensus 175 ~~~~~~~~~~-~~~~~~~~~~D~~y~~~~-~~~l~~~~~~~~i~l~~~~~~~~~vvw~p~-------------~~~~vCv 239 (271)
T cd09025 175 QKTDEEANTP-GQFDETEEGVDLLFRPLG-PASLTDGARGLKITLDHDEPFSNLVVWTDK-------------GKDFVCL 239 (271)
T ss_pred hccCCccCCc-ccccccccccchhhccCC-ceEEEecCCCEEEEEecCCCcceEEEecCC-------------CCcEEEE
Confidence 6543221111 223344567999998775 7888887668899998874 8999999994 5799999
Q ss_pred cceec--------CCCeEeCCCCEEEEEEEeC
Q psy17993 168 EAGHV--------TSPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 168 EP~~~--------~~~~~L~pGe~~~~~~~i~ 191 (191)
||++. .+++.|+|||+++++++|.
T Consensus 240 Ep~t~~~nA~n~~~~~~~L~PGe~~~~~~~i~ 271 (271)
T cd09025 240 EPWTGPRNALNTGERLLLLPPGETEEASVRIQ 271 (271)
T ss_pred ecCCCCccccCcCCccEEECCCCEEEEEEEEC
Confidence 99841 2468999999999999985
No 4
>COG0676 Uncharacterized enzymes related to aldose 1-epimerase [Carbohydrate transport and metabolism]
Probab=100.00 E-value=4.9e-33 Score=221.02 Aligned_cols=175 Identities=25% Similarity=0.360 Sum_probs=146.5
Q ss_pred cccccCCCeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeC
Q psy17993 3 DETTATLDAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGL 81 (191)
Q Consensus 3 ~~~~~~~~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl 81 (191)
|++.+++.+.++|.|.++ + |+.|.++++++++ .+|++++++.|.. . |+.+|||||+|.|++++.+.||
T Consensus 105 ~~~~~~~~v~v~f~L~~~~~-----p~~~~lr~~~~~g-~~le~~l~~~~~~---s--~~~AlHtYF~VgDi~qv~V~GL 173 (287)
T COG0676 105 EHDEDEDGVRVTFGLDLEDE-----PHDFTLRLTFRFG-ETLELELESYGEE---S--FQAALHTYFRVGDIEQVEVSGL 173 (287)
T ss_pred ehhcccCceEEEEEeCCCcc-----ccceEEEEEeecc-ceEEEEEEecChh---H--HHHhhcceEEecchhheEeccC
Confidence 456678888999999887 4 9999999999997 6899999888765 3 8999999999999999999999
Q ss_pred CCCeeeeccCCcceeeeecceEEeCCCCCeeEeCCCCeEEEEEeCCCceEEEEEeCCCcEEEeCCcc-CCCCCCCCCCCC
Q psy17993 82 YGCHYIDKVLDNQLFRETRETVSVSSLTDRIYQNTCQEHVVTNVVSGRKMRLQKYNFPDTVIWNPWD-TGRDMKDFGDDE 160 (191)
Q Consensus 82 ~~~~~~d~~~~~~~~~~~~~~~~~~~~~D~iy~~~~~~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~-~~~~~~d~~~~~ 160 (191)
.+..+.+.... .......+..+|+++.||||.+......|.|+..+|+|+|+..+...+||||||. ++.+|.||++++
T Consensus 174 ~~~~~~~~~~~-~~~v~~~g~~~~~~~~DriY~~~~~~~~I~D~~~~R~I~l~~~~~~~~VvWNP~~~~s~~M~d~~d~g 252 (287)
T COG0676 174 GGVCIDKVLNA-EEEVTQHGIVTFPGETDRIYLNPEPCSVIVDPALNRIITLEHQGHSSTVVWNPGHAKSSSMADMPDDG 252 (287)
T ss_pred Cceehhhhhhc-eeeccCCCceeeCCCccEEEEcCCCceEEecCcceEEEEEEecCCCCeEEECCCcccccccccccccc
Confidence 87655443332 2212223358899999999999888888999988999999999999999999999 999999999999
Q ss_pred CCceEEecceecCC-CeEeCCCCEEEEEEEe
Q psy17993 161 YPNMICVEAGHVTS-PVLLHPGTTFEASQIL 190 (191)
Q Consensus 161 ~~~fvCiEP~~~~~-~~~L~pGe~~~~~~~i 190 (191)
|++||||||+.+.. ...++|| ..++.++|
T Consensus 253 yq~mlCvEta~~~~~l~~~~~~-~~~l~~~i 282 (287)
T COG0676 253 YQTMLCVETARVGPDLKVGEPG-PVRLALKI 282 (287)
T ss_pred ceEEEEEeecccCcchhhcCCc-ceeeeeee
Confidence 99999999998875 4566666 66677766
No 5
>cd09021 Aldose_epim_Ec_YphB aldose 1-epimerase, similar to Escherichia coli YphB. Proteins similar to Escherichia coli YphB are uncharacterized members of the aldose-1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen.
Probab=99.97 E-value=1.6e-29 Score=206.05 Aligned_cols=161 Identities=15% Similarity=0.142 Sum_probs=123.8
Q ss_pred CeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeCCCCeee-
Q psy17993 10 DAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGLYGCHYI- 87 (191)
Q Consensus 10 ~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl~~~~~~- 87 (191)
+.+++|+|..+ .+ +||+|+++++|+|.+++|+++++|+|++++ +|||++|+||||++++...+++.+.. .+.
T Consensus 91 ~~~v~l~l~~~~~~---~P~~~~~~~~y~L~~~~L~i~~~~~N~~~~-~~~~~~g~H~YF~~~~~~~l~v~~~~--~~~~ 164 (273)
T cd09021 91 ADSAELQLDHEADD---PPWAYRAEQRFHLAGDGLSITLSVTNRGDR-PMPAGLGFHPYFPRTPDTRLQADADG--VWLE 164 (273)
T ss_pred CCeEEEEEecCCCC---CCEeEEEEEEEEEcCCCEEEEEEEEECCCC-CceeeeecCccEecCCCCEEEEecce--EEec
Confidence 35778888766 43 399999999999998999999999999999 99999999999999986677776542 122
Q ss_pred --eccCCcceeeeecceEEeC-------CCCCeeEeCCCCeEEEEEeCCCceEEEEEeC-CCcEEEeCCccCCCCCCCCC
Q psy17993 88 --DKVLDNQLFRETRETVSVS-------SLTDRIYQNTCQEHVVTNVVSGRKMRLQKYN-FPDTVIWNPWDTGRDMKDFG 157 (191)
Q Consensus 88 --d~~~~~~~~~~~~~~~~~~-------~~~D~iy~~~~~~~~l~~~~~~~~i~v~~~~-~~~~vvWnp~~~~~~~~d~~ 157 (191)
+.++.+.... ...++|. ..+|++|......+.+.++..++.|+|..++ ++++|||||+
T Consensus 165 d~~~lp~~~~~~--~~~~df~~~~~l~~~~~D~~f~~~~~~~~l~~~~~~~~i~v~~~~~~~~~vvwtp~---------- 232 (273)
T cd09021 165 DEDHLPTGLRPH--PPDWDFSQPRPLPDRWIDNCFTGWDGAALIWPPERGLALTIEADAPFSHLVVYRPP---------- 232 (273)
T ss_pred CCCcCCCcccCC--CcchhhcCCCcCCcccccccccCCCcceEEecCCCCcEEEEecCCCCCEEEEEcCC----------
Confidence 3333221111 1112222 3589999865556778777778999999998 9999999995
Q ss_pred CCCCCceEEecceec----------CCCeEeCCCCEEEEEEEeC
Q psy17993 158 DDEYPNMICVEAGHV----------TSPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 158 ~~~~~~fvCiEP~~~----------~~~~~L~pGe~~~~~~~i~ 191 (191)
+++|||||||.. .+.+.|+|||+++.+++|+
T Consensus 233 ---~~~~vCvEP~~~~~da~~~~~~~~~~~L~pge~~~~~~~i~ 273 (273)
T cd09021 233 ---GEDFFCLEPVSHAPDAHHGPGDPGLRVLAPGESLSLSMRIT 273 (273)
T ss_pred ---CCCeEEEccccCCCCccCCcCCCCcEEeCCCCcEEEEEEEC
Confidence 469999999951 1248999999999999985
No 6
>cd09024 Aldose_epim_lacX Aldose 1-epimerase, similar to Lactococcus lactis lacX. Proteins similar to Lactococcus lactis lacX are uncharacterized members of aldose-1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen.
Probab=99.96 E-value=1.8e-28 Score=201.31 Aligned_cols=171 Identities=20% Similarity=0.213 Sum_probs=123.8
Q ss_pred CCCeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCC-----ceEEeeC
Q psy17993 8 TLDAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVR-----RCQVNGL 81 (191)
Q Consensus 8 ~~~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~-----~~~~~gl 81 (191)
.++.+++|++.++ ++++.|||+|+++++|+|.+++|+++++|+|++++ +|||++|+||||++++.. ++.+.--
T Consensus 84 ~~~~~v~l~l~~~~~~~~~~P~~~~~~~~y~L~~~~L~i~~~v~N~~~~-~~p~~~g~HpYF~~~~~~~~~~~~~~l~~~ 162 (288)
T cd09024 84 QSDDSVTFELTDNEETLKVYPFDFELRVTYTLEGNTLKVTYEVKNPDDK-TMPFSIGGHPAFNCPLDEGEKFEDYYLEFE 162 (288)
T ss_pred ccCCEEEEEEccCcchhhcCCeEEEEEEEEEEeCCEEEEEEEEEcCCCC-ceEEEEeCCceEECCCCCCCcccceEEEEC
Confidence 3456899999888 78899999999999999999999999999999999 999999999999998643 4444311
Q ss_pred CC--Ceee--ec---cCCccee-eeecceEEeCCC---CCeeE-eCCC-CeEEEEEeCCCceEEEEEeCCCcEEEeCCcc
Q psy17993 82 YG--CHYI--DK---VLDNQLF-RETRETVSVSSL---TDRIY-QNTC-QEHVVTNVVSGRKMRLQKYNFPDTVIWNPWD 148 (191)
Q Consensus 82 ~~--~~~~--d~---~~~~~~~-~~~~~~~~~~~~---~D~iy-~~~~-~~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~ 148 (191)
.. ...+ +. ++.+... ......+.+... .|..+ .++. +.+.|.+++.++.|+|..+++++++||+|..
T Consensus 163 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~~~~~~~~~~~~~~l~~~~~~~~v~l~~~~~~~l~vwt~~~ 242 (288)
T cd09024 163 PKEELERIPLVGPLGLLGEKKPLLLNEGTLPLTHDLFDDDALIFDNLKSREVTLKSKKTGHGVTVDFDDFPYLGIWSKPN 242 (288)
T ss_pred CcccceEEecCCccccccccccccCCCCeecCCHHHhcCCEEEEcCCCccEEEEEcCCCCCEEEEEeCCCCEEEEeCCCC
Confidence 10 1221 10 1111100 011223444432 24433 3443 6788988766889999988899999999841
Q ss_pred CCCCCCCCCCCCCCceEEecceec------------C--CCeEeCCCCEEEEEEEeC
Q psy17993 149 TGRDMKDFGDDEYPNMICVEAGHV------------T--SPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 149 ~~~~~~d~~~~~~~~fvCiEP~~~------------~--~~~~L~pGe~~~~~~~i~ 191 (191)
.++|||||||+. . +++.|+|||+++.+++|+
T Consensus 243 ------------~~~~iciEP~t~~~da~~~~~~~~~~~gl~~L~PGe~~~~~~~i~ 287 (288)
T cd09024 243 ------------GAPFVCIEPWYGLADSVGFDGDLEDKEGINKLEPGESFEASYSIT 287 (288)
T ss_pred ------------CCCEEEEcCCCCCCCcCCCCccccccCCCEEeCCCCEEEEEEEEE
Confidence 258999999952 0 368999999999999984
No 7
>cd01081 Aldose_epim aldose 1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism; they catalyze the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate and the histidine as the active site acid to protonate the C-5 ring oxygen.
Probab=99.94 E-value=1.8e-25 Score=181.96 Aligned_cols=164 Identities=16% Similarity=0.101 Sum_probs=120.1
Q ss_pred CeEEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCC--CceEEeeCCCCee
Q psy17993 10 DAFCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDV--RRCQVNGLYGCHY 86 (191)
Q Consensus 10 ~~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~--~~~~~~gl~~~~~ 86 (191)
+.+|+|.+..+ .+. .|||+|+++++|+|.+++|+++++|+|++++ +|||++|+||||+++.. +++++.... ..+
T Consensus 93 ~~~v~l~~~~~~~~~-~~P~~~~l~~ty~L~~~~L~i~~~v~N~~~~-~~p~~~g~HpyF~~~~~~~~~~~l~~~~-~~~ 169 (284)
T cd01081 93 EASVTLSYDLNDGPG-GYPFPLELTVTYTLDADTLTITFTVTNLGDE-PMPFGLGWHPYFGLPGVAIEDLRLRVPA-SKV 169 (284)
T ss_pred CcEEEEEEEeCCCCC-CCCEEEEEEEEEEEeCCeEEEEEEEEeCCCC-CcceeeecCceEecCCCcccceEEEecC-CEE
Confidence 56888888877 666 8999999999999999999999999999999 99999999999999874 677776433 344
Q ss_pred eecc----CCcceee----eecceEEeC-CCCCeeEeCCC-----CeEEEEEeCCCceEEEEEeCCCcEEEeCCccCCCC
Q psy17993 87 IDKV----LDNQLFR----ETRETVSVS-SLTDRIYQNTC-----QEHVVTNVVSGRKMRLQKYNFPDTVIWNPWDTGRD 152 (191)
Q Consensus 87 ~d~~----~~~~~~~----~~~~~~~~~-~~~D~iy~~~~-----~~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~~~~~ 152 (191)
+... +.+.... .......+. ...|.+|.... ..+.|.++..++.|++.... ++++||+|+.
T Consensus 170 ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~D~~~~~~~~~~~~~~~~l~~~~~~~~l~~~~~~-~~~~v~t~~~---- 244 (284)
T cd01081 170 LPLDDLLPPTGELEVPGEEDFRLGRPLGGGELDDCFLLLGNDAGTAEARLEDPDSRISVEFETGW-PFWQVYTGDG---- 244 (284)
T ss_pred EecCCccCCCCccCCcccccccCCcCcCCcccccccccCcCCCCeeEEEEECCCCeEEEEEeCCC-CEEEEECCCC----
Confidence 4211 1110000 001112222 35788886543 26788887777788887776 9999999852
Q ss_pred CCCCCCCCCCceEEecceecC---------CCeEeC-CCCEEEEEE
Q psy17993 153 MKDFGDDEYPNMICVEAGHVT---------SPVLLH-PGTTFEASQ 188 (191)
Q Consensus 153 ~~d~~~~~~~~fvCiEP~~~~---------~~~~L~-pGe~~~~~~ 188 (191)
+.++|||||||+.. +.+.|+ ||++++.++
T Consensus 245 -------~~~~~~ciEP~t~~~da~~~~~~~~~~L~~pge~~~~~~ 283 (284)
T cd01081 245 -------GRRGSVAIEPMTSAPDAFFNNNGGLITLKPPGETRTFSI 283 (284)
T ss_pred -------CcCCEEEEccccCCCCCCCCCCCceEEeCCCCceEEEEe
Confidence 24689999999631 369999 999988765
No 8
>cd09022 Aldose_epim_Ec_YihR Aldose 1-epimerase, similar to Escherichia coli YihR. Proteins similar to Escherichia coli YihR are uncharacterized members of aldose-1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen.
Probab=99.94 E-value=6.3e-25 Score=179.92 Aligned_cols=170 Identities=15% Similarity=0.134 Sum_probs=119.6
Q ss_pred CCeEEEEEccCCCcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCC--CceEEeeCCCCee
Q psy17993 9 LDAFCIFPYLDIFCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDV--RRCQVNGLYGCHY 86 (191)
Q Consensus 9 ~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~--~~~~~~gl~~~~~ 86 (191)
++.+++|+|.+. +.+.|||.|+++++|+|.+++|+++++|+|++++ +|||++|+||||++++. .+..+.- ....+
T Consensus 87 ~~~~v~l~l~~~-~~~~yP~~~~~~~~y~L~~~~L~i~~~v~N~~~~-~~p~~~g~HpyF~l~~~~~~~~~L~~-~a~~~ 163 (284)
T cd09022 87 TDSSVTLRTRIP-PQPGYPFTLELTVTYELDDDGLTVTLTATNVGDE-PAPFGVGFHPYLSAGGAPLDECTLTL-PADTW 163 (284)
T ss_pred cCCeEEEEEEeC-CccCCCceEEEEEEEEEcCCcEEEEEEEEeCCCC-CeEeeeEecceEecCCCCcccEEEEE-ECceE
Confidence 345788888664 4578999999999999999999999999999999 99999999999999853 5665542 12223
Q ss_pred eec----cCCcceeeeecceEEeCC-------CCCeeEeCCC------CeEEEEEeCCCceEEEEE-eCCCcEEEeCCcc
Q psy17993 87 IDK----VLDNQLFRETRETVSVSS-------LTDRIYQNTC------QEHVVTNVVSGRKMRLQK-YNFPDTVIWNPWD 148 (191)
Q Consensus 87 ~d~----~~~~~~~~~~~~~~~~~~-------~~D~iy~~~~------~~~~l~~~~~~~~i~v~~-~~~~~~vvWnp~~ 148 (191)
+.. ++.+... ..+..++|.. ..|..|..+. ..++|.++.. +.|+|.. .++++++||+|..
T Consensus 164 ~~~d~~~lptg~~~-~~~~~~df~~~~~l~~~~~D~~f~~~~~~~~~~~~~~l~~~~~-~~l~l~~~~~~~~~~vyt~~~ 241 (284)
T cd09022 164 LPVDERLLPTGTEP-VAGTPYDFRTGRRLGGTALDTAFTDLTRDADGRARARLTGPDG-RGVELWADESFPWVQVFTADT 241 (284)
T ss_pred EecCCccCCCcCcC-CCCCCccCcCCccccCccccccccccccCCCCcEEEEEECCCC-CEEEEEECCCCCEEEEECCCC
Confidence 321 1211110 0111233321 4677776432 2578887765 8899987 5689999999842
Q ss_pred CCCCCCCCCCCCCCceEEecceecC--------CCeEeCCCCEEEEEEEe
Q psy17993 149 TGRDMKDFGDDEYPNMICVEAGHVT--------SPVLLHPGTTFEASQIL 190 (191)
Q Consensus 149 ~~~~~~d~~~~~~~~fvCiEP~~~~--------~~~~L~pGe~~~~~~~i 190 (191)
++....++|||||||+.. +.+.|+|||+++.+++|
T Consensus 242 -------~~~~~~~~~iclEP~~~~~da~n~~~~~~~L~pge~~~~~~~i 284 (284)
T cd09022 242 -------LPPPGRRRGLAVEPMTCPPNAFNSGTDLIVLAPGETHTASWGI 284 (284)
T ss_pred -------CCCCCCCCEEEEccCCCCCCcCcCCCCcEEECCCCEEEEEEeC
Confidence 111113689999999631 36899999999999986
No 9
>PF01263 Aldose_epim: Aldose 1-epimerase; InterPro: IPR008183 Aldose 1-epimerase (5.1.3.3 from EC) (mutarotase) is the enzyme responsible for the anomeric interconversion of D-glucose and other aldoses between their alpha- and beta-forms. The sequence of mutarotase from two bacteria, Acinetobacter calcoaceticus and Streptococcus thermophilus is available []. It has also been shown that, on the basis of extensive sequence similarities, a mutarotase domain seems to be present in the C-terminal half of the fungal GAL10 protein which encodes, in the N-terminal part, UDP-glucose 4-epimerase.; GO: 0016853 isomerase activity, 0005975 carbohydrate metabolic process; PDB: 1YGA_A 3DCD_A 2CIQ_A 2CIS_A 2CIR_A 2HTB_C 2HTA_B 3Q1N_A 1NSZ_B 1NSR_B ....
Probab=99.93 E-value=1e-25 Score=185.22 Aligned_cols=179 Identities=21% Similarity=0.203 Sum_probs=118.7
Q ss_pred eEEEEEccCC-CcccCCCcceEEEEEEEEeCCc-eEEEEEEEeCCCCceeEEeeccceeEEeC----CCCceEEeeCCCC
Q psy17993 11 AFCIFPYLDI-FCIFPYFARFRLTYRLILREKE-LHFNIGVYNPSNDLAFNFNLLLHTYLKVP----DVRRCQVNGLYGC 84 (191)
Q Consensus 11 ~~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~-L~~~l~v~N~~~~~~m~f~~g~HpyF~v~----~~~~~~~~gl~~~ 84 (191)
++++|++..+ .+.+.|||+|+++++|+|.+++ |+++++|+|.| + +|||++|+||||+++ +...+.+.+....
T Consensus 103 ~~~~~~~~~~~~~~~~yP~~~~l~~~y~L~~~~~L~i~~~v~n~~-~-~~p~~~g~HpyF~l~~~~~~~~~~~~~~~~~~ 180 (300)
T PF01263_consen 103 DSVSFTLVSDPDGEEGYPFDFRLRITYTLDENGKLTITYEVTNDG-K-PMPFNLGFHPYFNLPGEDIDDHQLQVPADEYL 180 (300)
T ss_dssp TEEEEEEEEEETTHHHSSSEEEEEEEEEEETTEEEEEEEEEEESS-S-EEEEBEEEEEEEETTCTSGTTGEEEEEEEEEE
T ss_pred ceEEEEEEecCccceeeccceeeEEEEEECCCCeEEEEEEEEecC-c-cEEeeccccceEEcCCcceeeeEEEeccceee
Confidence 5677777764 4447799999999999999999 99999999999 9 999999999999999 4445555542211
Q ss_pred eeee-ccCCccee------eeecceEEeCC---CCCeeEeCCC-CeEEEEEeCCCceEEEEEe-CCCcEEEeCCccCCCC
Q psy17993 85 HYID-KVLDNQLF------RETRETVSVSS---LTDRIYQNTC-QEHVVTNVVSGRKMRLQKY-NFPDTVIWNPWDTGRD 152 (191)
Q Consensus 85 ~~~d-~~~~~~~~------~~~~~~~~~~~---~~D~iy~~~~-~~~~l~~~~~~~~i~v~~~-~~~~~vvWnp~~~~~~ 152 (191)
.+.+ .++.+... ........+.. .+|++|.... ....+.....++.|.|+.. ++|+++||+|+...+.
T Consensus 181 ~~~~~~lptg~~~~~~~~~~d~~~~~~i~~~~~~~D~~f~~~~~~~~~~~~~~~~~~i~l~~~~~~p~~~v~t~~~~~~~ 260 (300)
T PF01263_consen 181 ELDERLLPTGELAPVKGTPFDFRQGRPIGDDIPGLDHCFVLDPPGRPVIRLRSPGSGISLEFSTSFPWLVVWTPGFDPKY 260 (300)
T ss_dssp EEETTSSEEEEEEETTTCETCGSSSEEECSSCTCEEEEEEESSSSEEEEEEETTTEEEEEEEEESSSEEEEEEETTSSTE
T ss_pred ecccccCCceeeecccCCccccccCeeecccccccceEEEcCCCccEEEEeecccceEEEEecCCCCcEEEECCCCcccc
Confidence 1111 11111000 00011122222 4799998765 5666666777888888864 5789999999741111
Q ss_pred CCCCCCCCCCceEEecceecC-CCeEeCCCCEEEEEEEeC
Q psy17993 153 MKDFGDDEYPNMICVEAGHVT-SPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 153 ~~d~~~~~~~~fvCiEP~~~~-~~~~L~pGe~~~~~~~i~ 191 (191)
+...+...+..|+|+|+++.. +.+.|+|||+++++++|+
T Consensus 261 ~~~~~~~~~~~~~~~~~~~~~~~~~~L~pGe~~~~~~~~~ 300 (300)
T PF01263_consen 261 ICIEPIAIEPDFVPDEANNPGKGLIVLAPGESYSFSTRYT 300 (300)
T ss_dssp ETTCEEECEEESECECCCECCHEGEEEETTSEEEEEEEEE
T ss_pred cCCcccccCCCeeecccccCCCCCEEECCCCEEEEEEEEC
Confidence 112222223345555555443 369999999999999984
No 10
>PRK15172 putative aldose-1-epimerase; Provisional
Probab=99.93 E-value=4e-24 Score=176.50 Aligned_cols=167 Identities=14% Similarity=0.078 Sum_probs=119.5
Q ss_pred CCeEEEEEccCCCcccCCCcceEEEEEEEEeC-CceEEEEEEEeCCCCceeEEeeccceeEEeCC--CCceEEeeCCCCe
Q psy17993 9 LDAFCIFPYLDIFCIFPYFARFRLTYRLILRE-KELHFNIGVYNPSNDLAFNFNLLLHTYLKVPD--VRRCQVNGLYGCH 85 (191)
Q Consensus 9 ~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~-~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~--~~~~~~~gl~~~~ 85 (191)
++++++|.+.... ...|||.|+++++|+|.+ ++|+++++|+|++++ +|||++|+||||+++. +.+..+. +....
T Consensus 106 ~~~~v~l~~~~~~-~~gyP~~~~~~v~y~L~~~~~L~i~~~~~n~~~~-~~P~~~g~HpYFnl~~~~~~~~~L~-~~a~~ 182 (300)
T PRK15172 106 TATSVTLTAFLPP-SYGYPFMLASQVIYSLDAATGLSVEIASQNIGDV-PAPYGVGIHPYLTCNLTSVDEYLLQ-LPANQ 182 (300)
T ss_pred cCCEEEEEEEcCC-CCCCCEEEEEEEEEEEccCCeEEEEEEEEECCCC-ceeeEEecCceEecCCCChhceEEE-EeCCe
Confidence 3446778876543 257999999999999984 799999999999999 9999999999999973 5566555 22333
Q ss_pred eee----ccCCcceeeeecceEEeCC-------CCCeeEeCCCC--eEEEEEeCCCceEEEEEeCCCcEEEeCCccCCCC
Q psy17993 86 YID----KVLDNQLFRETRETVSVSS-------LTDRIYQNTCQ--EHVVTNVVSGRKMRLQKYNFPDTVIWNPWDTGRD 152 (191)
Q Consensus 86 ~~d----~~~~~~~~~~~~~~~~~~~-------~~D~iy~~~~~--~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~~~~~ 152 (191)
++. .++.+.. ...+..++|.. .+|..|..+.+ .++|.++..++.|.+..+ .+.++||+|...
T Consensus 183 ~~~~d~~~iPtg~~-~~~~~~~df~~~~~l~~~~~D~~f~~~~~~~~~~l~~~~~g~~l~~~~~-~~~~~vyt~~~~--- 257 (300)
T PRK15172 183 VLAVDEHANPTTLH-HVDELDLDFSQAKKIAATKIDHTFKTANDLWEVRITHPQQALSVSLCSD-QPWLQIYSGEKL--- 257 (300)
T ss_pred EEecCCCcCCCccc-cCCCCCcCCCCCeECCCCCccCEEEcCCCceEEEEEeCCCCeEEEEEcC-CCEEEEECCCCC---
Confidence 442 1222110 00111233322 48999987654 678988877777777654 488999998410
Q ss_pred CCCCCCCCCCceEEecceecC--------CCeEeCCCCEEEEEEEeC
Q psy17993 153 MKDFGDDEYPNMICVEAGHVT--------SPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 153 ~~d~~~~~~~~fvCiEP~~~~--------~~~~L~pGe~~~~~~~i~ 191 (191)
.+.||||||+... +.+.|+|||+++++++|+
T Consensus 258 --------~~~~~ciEp~t~p~dA~n~~~g~~~L~pge~~~~~~~i~ 296 (300)
T PRK15172 258 --------QRQGLAVEPMSCPPNAFNSGIDLLLLEPGKTHRLFFNIG 296 (300)
T ss_pred --------CCCEEEEeCCCCCCCCCCCCCCCEEECCCCEEEEEEEEE
Confidence 2589999999632 368999999999999884
No 11
>cd09019 galactose_mutarotase_like galactose mutarotase_like. Galactose mutarotase catalyzes the conversion of beta-D-galactose to alpha-D-galactose. Beta-D-galactose is produced by the degradation of lactose, a disaccharide composed of beta-D-glucose and beta-D-galactose. This epimerization reaction is the first step in the four-step Leloir pathway, which converts galactose into metabolically important glucose. This epimerization step is followed by the phosophorylation of alpha-D-galactose by galactokinase, an enzyme which can only act on the alpha anomer. A glutamate and a histidine residue of the galactose mutarotase have been shown to be critical for catalysis, the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen. Galactose mutarotase is a member of the aldose-1-epimerase superfamily.
Probab=99.89 E-value=8.8e-22 Score=164.33 Aligned_cols=178 Identities=14% Similarity=0.205 Sum_probs=118.0
Q ss_pred CCCeEEEEEccCCCcccCCCcceEEEEEEEEeC-CceEEEEEEEeCCCCceeEEeeccceeEEeCC-----CCceEEeeC
Q psy17993 8 TLDAFCIFPYLDIFCIFPYFARFRLTYRLILRE-KELHFNIGVYNPSNDLAFNFNLLLHTYLKVPD-----VRRCQVNGL 81 (191)
Q Consensus 8 ~~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~-~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~-----~~~~~~~gl 81 (191)
.++.+|+|++.+.+....|||.|+++++|+|.+ ++|+++++++| ++ +|||++|+||||+++. +.+..+. +
T Consensus 104 ~~~~~v~l~~~~~~~~~gyP~~~~~~v~y~L~~~~~L~i~~~~~~--~~-~~p~~~g~HpyFnl~~~~~~~~~~~~L~-~ 179 (326)
T cd09019 104 VEENSVTFSLVSPDGEEGFPGNLTVTVTYTLTDDNELTIEYEATT--DK-PTPVNLTNHSYFNLAGEGSGDILDHELQ-I 179 (326)
T ss_pred ccCCEEEEEEECCcccCCCCeEEEEEEEEEECCCCEEEEEEEEEe--CC-CeEecccceeeEecCCCCCCCccceEEE-E
Confidence 346689999987655689999999999999998 89999999998 47 9999999999999984 3455554 2
Q ss_pred CCCeeee----ccCCcceeeeecceEEe------------------CCCCCeeEe--CC----CCeEEEEEeCCCceEEE
Q psy17993 82 YGCHYID----KVLDNQLFRETRETVSV------------------SSLTDRIYQ--NT----CQEHVVTNVVSGRKMRL 133 (191)
Q Consensus 82 ~~~~~~d----~~~~~~~~~~~~~~~~~------------------~~~~D~iy~--~~----~~~~~l~~~~~~~~i~v 133 (191)
....++. .++.+.........++| ...+|.+|. .+ ...++|.++..++.|.|
T Consensus 180 ~a~~~~~~d~~~iPtG~~~~~~~~~~df~~~~~l~~~~~~~~~~~~~~~~D~~f~l~~~~~~~~~~a~l~~~~sg~~l~v 259 (326)
T cd09019 180 NADRYLPVDEELIPTGEILPVAGTPFDFRKPKPIGRIDLDDEQLKLGGGYDHNFVLDKGGGKLRPAARLTSPESGRKLEV 259 (326)
T ss_pred ecCcEEeeCCCCCcCCCceecCCCCccccCccCccccccchhhcccCCCcceEEEECCCCCccceeEEEEcCCCCcEEEE
Confidence 2223431 22222110000101111 235888886 33 13578888777888888
Q ss_pred EEeCCCcEEEeCCcc-CCCCCCCCCCCCCCceEEecceecC--------CCeEeCCCCEEEEEEEe
Q psy17993 134 QKYNFPDTVIWNPWD-TGRDMKDFGDDEYPNMICVEAGHVT--------SPVLLHPGTTFEASQIL 190 (191)
Q Consensus 134 ~~~~~~~~vvWnp~~-~~~~~~d~~~~~~~~fvCiEP~~~~--------~~~~L~pGe~~~~~~~i 190 (191)
.. ..+.++||++.. ......+-..-..+.+|||||+... +.+.|+||++++...++
T Consensus 260 ~t-~~p~~~vyT~~~~~~~~~~~~~~~~~~~~iclEpq~~pdA~n~~~~g~~~L~pge~~~~~~~y 324 (326)
T cd09019 260 YT-TQPGVQFYTGNFLDGTPGGGGKVYGKRSGFCLETQHFPDAPNHPNFPSIILRPGETYRHTTVY 324 (326)
T ss_pred ec-CCCEEEEEeCCCCCcccCCCCcEeCCCCEEEEeccCCCCcccccCCCCeEECCCCEEEEEEEE
Confidence 64 468999999853 1100000000023579999999521 25899999999988775
No 12
>COG2017 GalM Galactose mutarotase and related enzymes [Carbohydrate transport and metabolism]
Probab=99.87 E-value=5.4e-21 Score=158.31 Aligned_cols=163 Identities=17% Similarity=0.153 Sum_probs=108.3
Q ss_pred EEEEEccCCCcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCc---eEEeeCCCCeeee
Q psy17993 12 FCIFPYLDIFCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRR---CQVNGLYGCHYID 88 (191)
Q Consensus 12 ~~~~~l~~~~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~---~~~~gl~~~~~~d 88 (191)
.+++.+.+. ...|||+|+++++|+|.+++|+|+++++|.+++ +|||++|+||||+++...+ .... . ...|+.
T Consensus 116 ~~~l~~~~~--~~gyP~~l~~~vtY~L~~~~L~v~~~~~n~~~~-~~p~~~g~HpYFnl~~~~~~~~~~~~-~-~~~~l~ 190 (308)
T COG2017 116 EFSLVLRDG--EDGYPGNLEATVTYTLNEDGLTVTYEVTNDGDE-PTPFNLGNHPYFNLPGDGRLEHILAI-A-SDYYLP 190 (308)
T ss_pred EEEEEeccc--CCCCCceEEEEEEEEEcCCCEEEEEEEEeCCCC-cceecccccceEecCCCCCcccEEEe-c-CCceEE
Confidence 566666655 234999999999999999999999999999999 9999999999999986543 2111 1 112332
Q ss_pred ccC-Ccceee---eecce--EEeC--CCCCeeEeCC-----CCeEEEEEeCCCceEEEEEe-CCCcEEEeCCccCCCCCC
Q psy17993 89 KVL-DNQLFR---ETRET--VSVS--SLTDRIYQNT-----CQEHVVTNVVSGRKMRLQKY-NFPDTVIWNPWDTGRDMK 154 (191)
Q Consensus 89 ~~~-~~~~~~---~~~~~--~~~~--~~~D~iy~~~-----~~~~~l~~~~~~~~i~v~~~-~~~~~vvWnp~~~~~~~~ 154 (191)
..+ ...+.. ..... ..+. ...|..|... ...+.|.++..+.+|.|..+ .+..++.||+..
T Consensus 191 ~~~~~~ip~~~~~~~~~~~~~~~~~~~~~d~~f~~~~~~~~~~~a~l~~~~~~~~l~v~t~~p~~~~yt~~~~~------ 264 (308)
T COG2017 191 VDDEEPIPTGDIKREPKPLEDDFAADDPYDHAFLLNGARGLKPAARLYDPDSGLSLEVETDEPFVQLYTGNFLA------ 264 (308)
T ss_pred cccCCCCCcccccccccccccccccccccccceeeccccCCcceEEEEcCCCCeEEEEEeCCCeEEEEeCCCCC------
Confidence 111 000000 00000 1111 1236555422 34688998888888888865 566777776531
Q ss_pred CCCCCCCCceEEeccee-cC--------CCeEeCCCCEEEEEEEeC
Q psy17993 155 DFGDDEYPNMICVEAGH-VT--------SPVLLHPGTTFEASQILQ 191 (191)
Q Consensus 155 d~~~~~~~~fvCiEP~~-~~--------~~~~L~pGe~~~~~~~i~ 191 (191)
.++|+|||||. .. ....|+|||+++..++++
T Consensus 265 ------~~~~~clEp~~~~pdA~n~~~~~~~~L~pGe~~~~~~~~~ 304 (308)
T COG2017 265 ------GRDGLCLEPQSGLPDAFNHPGFGLIVLEPGETYSAETRFR 304 (308)
T ss_pred ------cCCeEEeeeccCCCCcCCCCCcccceeCCCCEEEEEEEEE
Confidence 35899999996 31 246799999999988763
No 13
>PLN00194 aldose 1-epimerase; Provisional
Probab=99.84 E-value=2.9e-19 Score=149.70 Aligned_cols=175 Identities=16% Similarity=0.237 Sum_probs=114.8
Q ss_pred eEEEEEccCCCcccCCCcceEEEEEEEEeC-CceEEEEEEEeCCCCceeEEeeccceeEEeCC-----CCceEEeeCCCC
Q psy17993 11 AFCIFPYLDIFCIFPYFARFRLTYRLILRE-KELHFNIGVYNPSNDLAFNFNLLLHTYLKVPD-----VRRCQVNGLYGC 84 (191)
Q Consensus 11 ~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~-~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~-----~~~~~~~gl~~~ 84 (191)
.+|+|.|.+.+....|||.|+++++|+|.+ ++|+++++++|. ++ +|||++|+||||+++. +.+..+. +...
T Consensus 119 ~~v~~~l~~~~~~~gyP~~~~~~v~Y~L~~~~~L~i~~~~~n~-~~-~~p~~~g~HpYFnL~~~~~~~i~~~~L~-i~a~ 195 (337)
T PLN00194 119 PSITFKYHSFDGEEGFPGDLSVTVTYTLLSSNTLRLDMEAKPL-NK-ATPVNLAQHTYWNLAGHNSGDILSHKIQ-IFGS 195 (337)
T ss_pred cEEEEEEECCCcCCCCCEEEEEEEEEEECCCCeEEEEEEEEEC-CC-CeEEEccccceEEcCCCCCCCcCCeEEE-EecC
Confidence 689999987645679999999999999975 899999999999 88 9999999999999973 3444444 2223
Q ss_pred eeee----ccCCcceeeeecceEEeC-------------CCCCeeEeC-C------CCeEEEEEeCCCceEEEEEeCCCc
Q psy17993 85 HYID----KVLDNQLFRETRETVSVS-------------SLTDRIYQN-T------CQEHVVTNVVSGRKMRLQKYNFPD 140 (191)
Q Consensus 85 ~~~d----~~~~~~~~~~~~~~~~~~-------------~~~D~iy~~-~------~~~~~l~~~~~~~~i~v~~~~~~~ 140 (191)
.++. .++.++.....+..++|+ ..+|..|.- . ...++|.++..++.++|..+. |.
T Consensus 196 ~~~~~d~~~iPtG~~~~v~~t~~Df~~~~~l~~~~~~~~~g~Dh~f~l~~~~~~~~~~~a~l~~~~sg~~l~v~t~~-p~ 274 (337)
T PLN00194 196 HITPVDENLIPTGEILPVKGTPFDFTTPKKIGSRINELPKGYDHNYVLDGEEKEGLKKAAKVKDPKSGRVLELWTNA-PG 274 (337)
T ss_pred CEEEeCCCcCcCCceeeCCCCCcccCCCcCcchhhccccCCcceEEEECCCCcccceeeEEEEcCCCCCEEEEEeCC-CE
Confidence 3432 122221100001112221 257887762 1 124678887778899998765 89
Q ss_pred EEEeCCcc-CC-CCCCCCCCCCCCceEEecceecC--------CCeEeCCCCEEEEEEEe
Q psy17993 141 TVIWNPWD-TG-RDMKDFGDDEYPNMICVEAGHVT--------SPVLLHPGTTFEASQIL 190 (191)
Q Consensus 141 ~vvWnp~~-~~-~~~~d~~~~~~~~fvCiEP~~~~--------~~~~L~pGe~~~~~~~i 190 (191)
++|+++.. .. .+..... -....+|||||+... +.+.|.|||.+....++
T Consensus 275 v~vyT~n~~~~~~~~~~~~-~~~~~~i~lEpq~~pda~n~~~~~~~~L~pge~~~~~t~y 333 (337)
T PLN00194 275 MQFYTSNYVNGVKGKGGAV-YGKHAGLCLETQGFPDAVNQPNFPSVVVNPGEKYKHTMLF 333 (337)
T ss_pred EEEECCCCCCCccCCCCCE-eCCCCEEEEeccCCCCcccCCCCCCeEECCCCEEEEEEEE
Confidence 99998642 11 0000000 011479999999521 25899999988766554
No 14
>TIGR02636 galM_Leloir galactose mutarotase. Members of this protein family act as galactose mutarotase (D-galactose 1-epimerase) and participate in the Leloir pathway for galactose/glucose interconversion. All members of the seed alignment for this model are found in gene clusters with other enzymes of the Leloir pathway. This enzyme family belongs to the aldose 1-epimerase family, described by pfam model pfam01263. However, the enzyme described as aldose 1-epimerase itself (EC 5.1.3.3) is called broadly specific for D-glucose, L-arabinose, D-xylose, D-galactose, maltose and lactose. The restricted genome context for genes in this family suggests members should act primarily on D-galactose.
Probab=99.75 E-value=2.2e-16 Score=132.27 Aligned_cols=175 Identities=13% Similarity=0.126 Sum_probs=111.2
Q ss_pred CCCeEEEEEccCCCcccCCCcceEEEEEEEE-eCCceEEEEEEEeCCCCceeEEeeccceeEEeCCC------CceEEee
Q psy17993 8 TLDAFCIFPYLDIFCIFPYFARFRLTYRLIL-REKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDV------RRCQVNG 80 (191)
Q Consensus 8 ~~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L-~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~------~~~~~~g 80 (191)
.+..+|+|.+.+......||+.++++++|+| .+++|+++++++ +++ +|||++|.||||+++.. .+..+.
T Consensus 109 ~~~~~v~l~~~~~~~~~gyPg~l~~~vtY~L~~~~~L~i~~~a~--~d~-~tp~nlt~H~YFnL~g~~~~~~i~~~~L~- 184 (335)
T TIGR02636 109 QEEVQVKFSLESPDGDQGFPGNLTVSVTYTLTDDNELTIEYEAT--TDK-ATPFNLTNHVYFNLDGADAGSDVLSHELQ- 184 (335)
T ss_pred cCCCEEEEEEECCCcCCCCCeEEEEEEEEEECCCCEEEEEEEEE--ECC-ceEEeccccceEEcCCCCCCCChhceEEE-
Confidence 3455889999765346789999999999999 568899999998 788 99999999999999742 233332
Q ss_pred CCCCeeee----ccCCcceeeeecceEEeC------------------CCCCeeEeC---C---CCeEEEEEeCCCceEE
Q psy17993 81 LYGCHYID----KVLDNQLFRETRETVSVS------------------SLTDRIYQN---T---CQEHVVTNVVSGRKMR 132 (191)
Q Consensus 81 l~~~~~~d----~~~~~~~~~~~~~~~~~~------------------~~~D~iy~~---~---~~~~~l~~~~~~~~i~ 132 (191)
+....++. .++.++.....+..++|. ...|..|.- . ...++|.++..++.+.
T Consensus 185 i~a~~~~~~d~~~iPtG~~~~v~~t~~Dfr~~~~l~~~~~~~~~~~~~~g~D~~f~l~~~~~~~~~~a~l~~~~sg~~l~ 264 (335)
T TIGR02636 185 LNADRYLPLDEEGIPLGQLKPVDGTSFDFRKEKAIGQDFLANDQQQLAKGYDHAFLLNGERLDGKPAAILTSPDEDLSLE 264 (335)
T ss_pred EECCcEEEeCCCcCcCCceecCCCCccccCCCcCcCcccccccccccCCCcceEEEECCCCCCCceeEEEEcCCCCcEEE
Confidence 22223332 122221100001112221 147877762 1 1246788877777777
Q ss_pred EEEeCCCcEEEeCCccCCCCCCCCCCC----CCCceEEecceecC--------CCe--EeCCCCEEEEEEEe
Q psy17993 133 LQKYNFPDTVIWNPWDTGRDMKDFGDD----EYPNMICVEAGHVT--------SPV--LLHPGTTFEASQIL 190 (191)
Q Consensus 133 v~~~~~~~~vvWnp~~~~~~~~d~~~~----~~~~fvCiEP~~~~--------~~~--~L~pGe~~~~~~~i 190 (191)
|.. +.|.++||++..... +.+.. ..+..|||||+... ..+ .|.|||+++....+
T Consensus 265 v~t-~~p~~~vyT~~~~~~---~~~~~g~~~~~~~gialE~q~~pd~~n~~~~~~~~~~L~pge~~~~~t~y 332 (335)
T TIGR02636 265 VFT-TQPALQIYTGNFLAG---TPNRGGKKYVDHAGLALETQFLPDSPNHPEWGDISCILSPGQEYQHQTRY 332 (335)
T ss_pred Eec-CCCEEEEecCCCcCC---ccCCCCcEeCCCcEEEEecccCCCcccccCCCCCceEECCCCEEEEEEEE
Confidence 765 458999999742110 00101 12579999999631 123 59999998876654
No 15
>cd09023 Aldose_epim_Ec_c4013 Aldose 1-epimerase, similar to Escherichia coli c4013. Proteins, similar to Escherichia coli c4013, are uncharacterized members of aldose-1-epimerase superfamily. Aldose 1-epimerases or mutarotases are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechanism of the best known member of the family, galactose mutarotase, have shown a glutamate and a histidine residue to be critical for catalysis; the glutamate serves as the active site base to initiate the reaction by removing the proton from the C-1 hydroxyl group of the sugar substrate, and the histidine as the active site acid to protonate the C-5 ring oxygen.
Probab=99.73 E-value=4.4e-17 Score=133.64 Aligned_cols=162 Identities=12% Similarity=0.044 Sum_probs=103.8
Q ss_pred EEEEEccCC-CcccCCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCC-ceEEeeCCCCeeeec
Q psy17993 12 FCIFPYLDI-FCIFPYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVR-RCQVNGLYGCHYIDK 89 (191)
Q Consensus 12 ~~~~~l~~~-~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~-~~~~~gl~~~~~~d~ 89 (191)
.|++...-. .....|||.++.+++|+|.+++|+++++|+|+|++ +||+++|+||||..|... ++++... ...+...
T Consensus 96 ~v~l~~~~~~~~~~g~~~~l~~~i~~~l~~~~l~i~~~VtN~g~~-~~P~~~~~H~n~~~p~l~~~~~l~~p-~~~~~p~ 173 (284)
T cd09023 96 EIEVSGEVREAALFGENLRLERTIETDLGSNEIRLEDRVTNEGFR-PTPHMLLYHVNFGYPLLDEGARLEIP-SKEVTPR 173 (284)
T ss_pred EEEEEEEEEEeeeecCceEEEEEEEEecCCceEEEEEEEEeCCCC-CCcceEEeeEEcCCcccCCCCEEEec-ccccccC
Confidence 344444433 34567899999999999999999999999999999 999999999999887433 5555421 1122110
Q ss_pred cCCcceeeeecceEEeC----CCCCeeEeCC-------CCeEEEEEeCCCceEEEEEe--CCCcEEEeCCccCCCCCCCC
Q psy17993 90 VLDNQLFRETRETVSVS----SLTDRIYQNT-------CQEHVVTNVVSGRKMRLQKY--NFPDTVIWNPWDTGRDMKDF 156 (191)
Q Consensus 90 ~~~~~~~~~~~~~~~~~----~~~D~iy~~~-------~~~~~l~~~~~~~~i~v~~~--~~~~~vvWnp~~~~~~~~d~ 156 (191)
...... ....-..+. ...+.+|... ...+.|.++..+..+.|..+ .+|++++|+...
T Consensus 174 ~~~~~~--~~~~~~~~~~p~~~~~e~v~~~~~~~d~~g~~~~~l~~~~~g~~~~l~~~~~~~p~~~~W~~~~-------- 243 (284)
T cd09023 174 DAHAAE--GLASWNTYLAPTPGFAEQVYFHEPAADEDGRAPAALVNPRLGLGVEVRFDTDTLPYLTQWKNTG-------- 243 (284)
T ss_pred Chhhcc--cccccccccCCCCCCccEEEEEeccccCCCceeEEEEcCCCCcEEEEEEehhhCCHHHHHhccC--------
Confidence 000000 000000011 1123444321 13577887766767888775 499999999642
Q ss_pred CCCCCCceEEecceec-----------CCCeEeCCCCEEEEEE
Q psy17993 157 GDDEYPNMICVEAGHV-----------TSPVLLHPGTTFEASQ 188 (191)
Q Consensus 157 ~~~~~~~fvCiEP~~~-----------~~~~~L~pGe~~~~~~ 188 (191)
+.+..+||||++. ++++.|+|||+++.++
T Consensus 244 ---~~~y~lalEP~T~~p~~~~~~~~~g~l~~L~PGEs~~~~l 283 (284)
T cd09023 244 ---AGAYVLGLEPATNFPNGRAFEREQGELPTLAPGESRSYRL 283 (284)
T ss_pred ---CCccEEEEccccCCCcchHHHHHCCCcceECCCCeEEEee
Confidence 1134566999962 1369999999998765
No 16
>PRK11055 galM galactose-1-epimerase; Provisional
Probab=99.60 E-value=1.5e-13 Score=115.41 Aligned_cols=173 Identities=11% Similarity=0.108 Sum_probs=105.6
Q ss_pred CCeEEEEEccCCCcccCCCcceEEEEEEEEeC-CceEEEEEEEeCCCCceeEEeeccceeEEeCC------CCceEEeeC
Q psy17993 9 LDAFCIFPYLDIFCIFPYFARFRLTYRLILRE-KELHFNIGVYNPSNDLAFNFNLLLHTYLKVPD------VRRCQVNGL 81 (191)
Q Consensus 9 ~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~-~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~------~~~~~~~gl 81 (191)
+..+|+|.+.+......||+.++++++|+|.+ ++|+++++++ +++ ++||++|.||||+++. +.+..+. +
T Consensus 115 ~~~~v~l~~~~~~g~~GyPg~l~~~vtY~L~~~~~l~i~~~a~--~d~-~tp~nlt~H~YFnL~g~~~~~~i~~h~L~-i 190 (342)
T PRK11055 115 NDRQVTFSLSSPDGDQGFPGNLGATVTYRLTDDNRVSITYRAT--VDK-PCPVNLTNHAYFNLDGAEEGSDVRNHKLQ-I 190 (342)
T ss_pred cCCEEEEEEECCCcCCCCCeEEEEEEEEEEcCCCeEEEEEEEE--cCC-CeEEeccccceEECCCCCCCCCccceEEE-E
Confidence 34578888876545578999999999999986 4677777665 788 9999999999999974 2233332 2
Q ss_pred CCCeeee----ccCCcceeeeecceEEeC------------------CCCCeeEe-CC----CC-eEEEEEeCCCceEEE
Q psy17993 82 YGCHYID----KVLDNQLFRETRETVSVS------------------SLTDRIYQ-NT----CQ-EHVVTNVVSGRKMRL 133 (191)
Q Consensus 82 ~~~~~~d----~~~~~~~~~~~~~~~~~~------------------~~~D~iy~-~~----~~-~~~l~~~~~~~~i~v 133 (191)
....++. .++.++.....+..++|. ...|..|. .. .. .+.+.++..++.+.|
T Consensus 191 ~a~~~~~~d~~~iPTG~~~~v~~t~~DFr~~~~l~~~~~~~~~~~~~~g~D~~fvl~~~~~~~~~~a~l~~~~sg~~l~v 270 (342)
T PRK11055 191 NADEYLPVDEGGIPNGGLKSVAGTSFDFRQPKTIAQDFLADDDQQKVKGYDHAFLLQAKGDGKKPAAHLWSPDEKLQMKV 270 (342)
T ss_pred ecCCEEEECcccCccCcEeccCCCcccccCCcCcCcccccchhcccCCCcceEEEECCCCcccceeEEEEcCCCCeEEEE
Confidence 2223331 122221100001112221 24677774 21 12 356777767777777
Q ss_pred EEeCCCcEEEeCCccCCCCCCCCCCC----CCCceEEecceec------C----CCeEeCCCCEEEEEEE
Q psy17993 134 QKYNFPDTVIWNPWDTGRDMKDFGDD----EYPNMICVEAGHV------T----SPVLLHPGTTFEASQI 189 (191)
Q Consensus 134 ~~~~~~~~vvWnp~~~~~~~~d~~~~----~~~~fvCiEP~~~------~----~~~~L~pGe~~~~~~~ 189 (191)
.. ..|.++|+++...... .+.. ....-|||||... . ..+.|.||+.+.....
T Consensus 271 ~t-~~p~lqvYT~n~~~~~---~~~~g~~~~~~~gialE~q~~Pda~n~~~f~~~~~~L~pg~~y~~~t~ 336 (342)
T PRK11055 271 YT-TAPALQFYSGNFLAGT---PSRGGGPYADYAGLALESQFLPDSPNHPEWPQPDCILKPGEEYRSLTE 336 (342)
T ss_pred Ec-CCCEEEEecCCccCCc---cCCCCcEeCCCceEEEEcccCCCcccccCCCCCCeEECCCCEEEEEEE
Confidence 64 4699999996421100 0111 1135799999842 1 1579999998876554
No 17
>PTZ00485 aldolase 1-epimerase; Provisional
Probab=99.44 E-value=2.2e-11 Score=103.13 Aligned_cols=172 Identities=10% Similarity=0.063 Sum_probs=106.3
Q ss_pred eEEEEEccCCCcccCCCcceEEEEEEEEe---CCceEEE---EEEEeCCCCceeEEeeccceeEEeCCC-----------
Q psy17993 11 AFCIFPYLDIFCIFPYFARFRLTYRLILR---EKELHFN---IGVYNPSNDLAFNFNLLLHTYLKVPDV----------- 73 (191)
Q Consensus 11 ~~~~~~l~~~~t~~~yP~~f~l~~~~~L~---~~~L~~~---l~v~N~~~~~~m~f~~g~HpyF~v~~~----------- 73 (191)
.+|+|.+.+......||+.++++++|+|. +++|+++ ++++|++++ ++||+++.||||++...
T Consensus 125 ~~V~f~~~~~dg~~GfPG~l~v~vtYtL~~~~~~~L~i~y~a~~~~n~~d~-~Tp~nltnH~YFNL~g~~~~~~~~~~~~ 203 (376)
T PTZ00485 125 IGVRFNYTSPHMENGFPGELVSKVTYSIERSKPNVLKTIYDSYIPETSPAD-ATPVNIFNHAYWNLNGIPERNGKKNAVW 203 (376)
T ss_pred cEEEEEEECCCcCCCCCEEEEEEEEEEEecCCCCEEEEEEEEEeccccCCc-cceeeeccceeEEcCCCccccccccccc
Confidence 47888887654568999999999999996 5889999 888999999 99999999999999631
Q ss_pred ------CceEEeeCCCCeeee----ccCCcceeeeecceEEeC-------------------CCCCeeEe-CC--C---C
Q psy17993 74 ------RRCQVNGLYGCHYID----KVLDNQLFRETRETVSVS-------------------SLTDRIYQ-NT--C---Q 118 (191)
Q Consensus 74 ------~~~~~~gl~~~~~~d----~~~~~~~~~~~~~~~~~~-------------------~~~D~iy~-~~--~---~ 118 (191)
.+..+. +....++. .++.++.....+..++|. ...|..|. +. . .
T Consensus 204 ~~~~~i~~h~L~-i~a~~~l~~de~~IPTG~i~~v~~t~fDfr~~~~ig~~~~~~~~~~~~~~G~Dh~fvl~~~~~~~~~ 282 (376)
T PTZ00485 204 VQPESVRNHWLR-VPASRVAEADRMAIPTGEFLSVEGTGLDFRQGRVIGDCIDDVALLDRDPCGYDHPLAIDGWEKGKLM 282 (376)
T ss_pred cCCCcccceEEE-EecCcEEEeCcccCccCceeccCCCCccCcCCcccchhhhhhhhhcccCCCCCcceEECCCCCcccc
Confidence 121222 11222321 122221000001111111 23566554 22 1 1
Q ss_pred -eEEEEEeCCCceEEEEEeCCCcEEEeCCcc-CCCCCCCCCCC--CCCceEEecceec----C-----CCeEeCCCCE-E
Q psy17993 119 -EHVVTNVVSGRKMRLQKYNFPDTVIWNPWD-TGRDMKDFGDD--EYPNMICVEAGHV----T-----SPVLLHPGTT-F 184 (191)
Q Consensus 119 -~~~l~~~~~~~~i~v~~~~~~~~vvWnp~~-~~~~~~d~~~~--~~~~fvCiEP~~~----~-----~~~~L~pGe~-~ 184 (191)
.+.+.++..++.+.|.. ..|.++|++... ...... .... ....-||+||... . ..+.|.|||. +
T Consensus 283 ~~a~l~~p~sg~~l~v~T-~~P~~qiYT~n~l~~~~~~-~~g~~~~~~~giclE~Q~~PDa~n~~~~fp~~vl~pge~~~ 360 (376)
T PTZ00485 283 LHAEAKSPVTNICMKVYS-TFPCMWVYTANNKPLPASG-GPGQRYARWTGMGLEPQYFPDVANHYPKYPSCIVRRGERRF 360 (376)
T ss_pred EEEEEEcCCCCCEEEEEE-CCCEEEEECCCCCCccccC-CCCcCcCCCCEEEEeccCCCCcccccCCCCCeEECCCCeEE
Confidence 35788888888888875 469999998643 110000 0000 1125789999742 1 1478999999 4
Q ss_pred EE
Q psy17993 185 EA 186 (191)
Q Consensus 185 ~~ 186 (191)
..
T Consensus 361 ~~ 362 (376)
T PTZ00485 361 TE 362 (376)
T ss_pred EE
Confidence 43
No 18
>cd09269 deoxyribose_mutarotase deoxyribose mutarotase_like. Salmonella enterica serovar Typhi DeoM (earlier named as DeoX) is a mutarotase with high specificity for deoxyribose. It is encoded by one of four genes beonging to the deoK operon. This operon has also been found in Escherichia coli where it is more common in pathogenic than in commensal strains and is associated with pathogenicity. It has been found on a pathogenicity island from a human blood isolate AL863 and confers the ability to use deoxyribose as a carbon source; deoxyribose is not fermented by non-pathogenic E.coli K-12. Proteins in this family are members of the aldose-1-epimerase superfamily. Aldose 1-epimerases, or mutarotases, are key enzymes of carbohydrate metabolism, catalyzing the interconversion of the alpha- and beta-anomers of hexose sugars such as glucose and galactose. This interconversion is an important step that allows anomer specific metabolic conversion of sugars. Studies of the catalytic mechani
Probab=99.13 E-value=8.8e-10 Score=90.72 Aligned_cols=163 Identities=11% Similarity=0.055 Sum_probs=95.8
Q ss_pred EEEccCC-CcccCCCcceEEEEEEEEeC--CceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeCC-CCeeeec
Q psy17993 14 IFPYLDI-FCIFPYFARFRLTYRLILRE--KELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGLY-GCHYIDK 89 (191)
Q Consensus 14 ~~~l~~~-~t~~~yP~~f~l~~~~~L~~--~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl~-~~~~~d~ 89 (191)
.+.+... +-.+.||+.|+++.+|+|.. +.|+|+++|+|+++. +||+++++||+|.......+...-+. .......
T Consensus 88 ~l~l~g~~~~~~~fg~~y~a~~~i~L~~g~~~l~i~~~VtN~g~~-p~p~~~~~H~nfg~~~garl~~p~~~~~~~~~~~ 166 (293)
T cd09269 88 YLALTGEYEYVQGFGHHYLAQPSVTLRAGSALFDIGMDVTNLSAQ-PMPLMYMCHMNYAYVEGARIVQNLPDEAFVLRRS 166 (293)
T ss_pred EEEEEEEEEeeeccCccEEEEEEEEEeCCCCEEEEEEEEEECCCC-CChhhEecccccCCCCCCEEEccCCcccceeeec
Confidence 3555544 44468999999999999987 899999999999999 99999999999987322222211100 0000000
Q ss_pred cCC-------cce----eeeecc---eEEeCCCC--CeeEeCC--C-----CeEEEEEeCCCceEEEEE--eCCCcEEEe
Q psy17993 90 VLD-------NQL----FRETRE---TVSVSSLT--DRIYQNT--C-----QEHVVTNVVSGRKMRLQK--YNFPDTVIW 144 (191)
Q Consensus 90 ~~~-------~~~----~~~~~~---~~~~~~~~--D~iy~~~--~-----~~~~l~~~~~~~~i~v~~--~~~~~~vvW 144 (191)
++. ... ...... .+...... ..+|+.. . ..+.|.++.++ .+.+.+ +.||++..|
T Consensus 167 ~p~~v~p~~~w~~~~~~~~~~~~~~~~~~~P~~~~~E~V~~~~~~~~~~~~~~~~l~n~~g~-~~~~~f~~~~lP~~~~W 245 (293)
T cd09269 167 VPAHVKPTPAWLAYNEALVADPARGDVLDKPDLYDPEIVFFADDLGKYTGWAHFMMVHPDGD-AFYTRFSTAEFPYATRW 245 (293)
T ss_pred ccCccCCchhhccchHHHhhCchhhhhccCCCCCCccEEEEeecccccCCcEEEEEECCCCC-EEEEEEchhhCChhhee
Confidence 000 000 000000 01111112 2344421 1 35677776644 677765 579999999
Q ss_pred CCccCCCCCCCCCCCCCCceE--Eecceec-----------CCCeEeCCCCEEEEEEEe
Q psy17993 145 NPWDTGRDMKDFGDDEYPNMI--CVEAGHV-----------TSPVLLHPGTTFEASQIL 190 (191)
Q Consensus 145 np~~~~~~~~d~~~~~~~~fv--CiEP~~~-----------~~~~~L~pGe~~~~~~~i 190 (191)
--.. ...+| .+||++. +..+.|+|||+.+.++++
T Consensus 246 ~~~~------------~~~~v~~~~~PaT~~p~g~~~ar~~G~l~~L~pGe~~~f~l~~ 292 (293)
T cd09269 246 ILYN------------GDQQVAAFALPATCRPEGYLAAKEAGTLRTLAPGETRRFSVTT 292 (293)
T ss_pred hhcC------------CCcceEEEEccCCCCcCChHHHHHCCCcceeCCCCeEEEEEec
Confidence 6321 01233 2359863 126999999999988764
No 19
>PF14486 DUF4432: Domain of unknown function (DUF4432); PDB: 3TY1_A.
Probab=98.59 E-value=8e-07 Score=73.63 Aligned_cols=153 Identities=13% Similarity=0.146 Sum_probs=82.0
Q ss_pred CCCcceEEEEEEEE--eCCceEEEEEEEeCCCCceeEEeeccceeEEeCCC-CceEEeeCCCCeeeeccCCcceeeeecc
Q psy17993 25 PYFARFRLTYRLIL--REKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDV-RRCQVNGLYGCHYIDKVLDNQLFRETRE 101 (191)
Q Consensus 25 ~yP~~f~l~~~~~L--~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~-~~~~~~gl~~~~~~d~~~~~~~~~~~~~ 101 (191)
.+=..|+|+=++++ ..+.++++.+|+|.+.. +||+.+.+|.-|..|-. +..++. .....+....+........-.
T Consensus 123 ~fg~~l~l~r~i~~~~g~~~i~i~d~VtN~~~~-p~p~m~lyH~N~G~pll~eg~ri~-~p~~~~~~~~~~a~~~~~~~~ 200 (302)
T PF14486_consen 123 GFGENLRLERTIRLRAGSNTIRIEDRVTNLGFQ-PMPLMYLYHMNFGYPLLDEGARIV-APTKEVTPRDDRAAEGIADWD 200 (302)
T ss_dssp TTTEEEEEEEEEEE-TT-SEEEEEEEEEE-SSS--EEEEEEEEEEE-TTT-STT-EEE---EEEEEESSGGGGGGSTTTT
T ss_pred eccCcEEEEEEEEEECCCcEEEEEEEEEECCCC-CchhHHhhhhccCccccCCCcEEE-cCcccccCCchhhhcCCccce
Confidence 34455666666665 56889999999999999 99999999999998843 234443 111112211110000000000
Q ss_pred eEEeCC--CCCeeEeC-----CCCe--EEEEEeCCCceEEEEE--eCCCcEEEeCCccCCCCCCCCCCCCCCceEEecce
Q psy17993 102 TVSVSS--LTDRIYQN-----TCQE--HVVTNVVSGRKMRLQK--YNFPDTVIWNPWDTGRDMKDFGDDEYPNMICVEAG 170 (191)
Q Consensus 102 ~~~~~~--~~D~iy~~-----~~~~--~~l~~~~~~~~i~v~~--~~~~~~vvWnp~~~~~~~~d~~~~~~~~fvCiEP~ 170 (191)
.+.-.. .-+.+|+. ..+. +.|..+..+..+.|.+ +.+|++.+|-... + ...-+.|||+
T Consensus 201 ~~~~P~~~~~E~Vy~~~~~~d~~G~~~~~l~n~~~g~g~~v~f~~~~lP~~~~Wk~~~-~----------~~yv~gLEP~ 269 (302)
T PF14486_consen 201 RMPAPQPGFPEQVYFHDLLADEDGWAHAALVNPDGGLGFEVRFDTSQLPYLTQWKNYG-G----------GEYVLGLEPA 269 (302)
T ss_dssp B---S-TT---EEEEEEE---TTSEEEEEEE-SSSS-EEEEEEETTTS-EEEEEEEES------------T-EEEEEEEE
T ss_pred ecCCCCCCCCcEEEEecccccCCCcEEEEEECCCCCcEEEEEEchHHCChhHhheeCC-C----------CEeEEEEecc
Confidence 111111 13455542 1233 6788887778888887 4699999996532 1 1244589999
Q ss_pred ec-----------CCCeEeCCCCEEEEEEEe
Q psy17993 171 HV-----------TSPVLLHPGTTFEASQIL 190 (191)
Q Consensus 171 ~~-----------~~~~~L~pGe~~~~~~~i 190 (191)
+. +....|+|||+.+.++++
T Consensus 270 T~~p~g~~~~~~~G~l~~L~pge~~~~~l~~ 300 (302)
T PF14486_consen 270 TCRPEGRAAAREAGTLPMLAPGESREFSLEF 300 (302)
T ss_dssp SS-SS-HHHHHHTT--EEE-TT-EEEEEEEE
T ss_pred cCCCCCHHHHHhCCCcceECCCCeEEEEEEe
Confidence 63 136999999999998876
No 20
>KOG1604|consensus
Probab=98.13 E-value=0.00027 Score=58.50 Aligned_cols=169 Identities=16% Similarity=0.206 Sum_probs=97.6
Q ss_pred EEEEccCCCcccCCCcceEEEEEEEEeC-CceEEEEEEEeCCCCceeEEeeccceeEEeCCC-----CceEEeeCCCCee
Q psy17993 13 CIFPYLDIFCIFPYFARFRLTYRLILRE-KELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDV-----RRCQVNGLYGCHY 86 (191)
Q Consensus 13 ~~~~l~~~~t~~~yP~~f~l~~~~~L~~-~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~-----~~~~~~gl~~~~~ 86 (191)
++|...+.+..+.||=+.++.++|+|.+ +.|.+.+..+=. ++ +-|..+.-|.||++... .+-.++ +.+..+
T Consensus 132 i~f~~~s~dg~eg~PG~l~V~vtYtLn~~n~l~i~~~A~~~-~~-~TPiNLtnHsYfNL~g~~s~~I~~heI~-i~a~~~ 208 (353)
T KOG1604|consen 132 IVFSHLSPDGDEGFPGDLKVTVTYTLNVANRLLIMMEATAL-DK-ATPINLTNHSYFNLAGHNSGGIEGHEIQ-IEASKI 208 (353)
T ss_pred EEEEEECCCCCCCCCccEEEEEEEEEccCCeeeeeehhhcc-CC-CcceeeccceeEeccCCCCCCccceEEE-Eeeccc
Confidence 5666655567778999999999999986 778888877655 66 79999999999999742 222222 112222
Q ss_pred ee----ccCCcceeeeecceEEeC---------------CCCCeeEe--CC-C-----CeEEEEEeCCCceEEEEEeCCC
Q psy17993 87 ID----KVLDNQLFRETRETVSVS---------------SLTDRIYQ--NT-C-----QEHVVTNVVSGRKMRLQKYNFP 139 (191)
Q Consensus 87 ~d----~~~~~~~~~~~~~~~~~~---------------~~~D~iy~--~~-~-----~~~~l~~~~~~~~i~v~~~~~~ 139 (191)
.+ .++.++...-....+++. ...|..|. +. + ..+.+.++..+|.+.|.. +.|
T Consensus 209 ~evd~~~iPTG~I~~v~~t~fD~r~p~~lgd~~~~~~~i~g~d~n~~~~~~~~~~~l~~v~k~~hp~Sgr~lEv~T-nqP 287 (353)
T KOG1604|consen 209 TEVDDTLIPTGEITPVKGTTFDFRKPTVLGDRIKQFECIVGYDINYVLDGSVPPNKLRKVAKAVHPKSGRKLEVST-NQP 287 (353)
T ss_pred EecCCccccccceEeccCccccccCCeeccccccccccccCCCcceEEccCCCCcccEEEEEEEcCccCcEEEEEe-CCC
Confidence 21 112221111111112221 12333332 21 1 135677788888888764 357
Q ss_pred cEEEeCCccCCCCCCCCC---C--CCCCceEEecceec----C----CCeEeCCCCEEEEEEE
Q psy17993 140 DTVIWNPWDTGRDMKDFG---D--DEYPNMICVEAGHV----T----SPVLLHPGTTFEASQI 189 (191)
Q Consensus 140 ~~vvWnp~~~~~~~~d~~---~--~~~~~fvCiEP~~~----~----~~~~L~pGe~~~~~~~ 189 (191)
-+-+++-. -|.|.. . -....++|+|+... . ..+.|.|||++..++.
T Consensus 288 gvqfYTgn----~~~~~~gk~g~~y~k~g~~cletq~~pda~n~~~fp~v~l~pGE~Y~h~~~ 346 (353)
T KOG1604|consen 288 GVQFYTGN----FLPDIKGKKGAVYPKHGGLCLETQYFPDAVNHPNFPSVILRPGETYTHETV 346 (353)
T ss_pred cEEEEecc----ccccccCCCceEeeccceEEeecccCccccccCCCCceEecCCCeeeeEEE
Confidence 77777621 011100 0 00236999999642 1 2499999998876553
No 21
>COG0832 UreB Urea amidohydrolase (urease) beta subunit [Amino acid transport and metabolism]
Probab=90.06 E-value=0.56 Score=32.24 Aligned_cols=32 Identities=16% Similarity=0.090 Sum_probs=25.2
Q ss_pred EEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 35 RLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 35 ~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.++|......++++|.|+|+. |.+.|-|-.|.
T Consensus 12 ~IelN~gr~~~~i~V~NtGDR---PIQVGSHfHF~ 43 (106)
T COG0832 12 DIELNAGRPTVTIEVANTGDR---PIQVGSHFHFF 43 (106)
T ss_pred cEEEeCCCcceEEEEeecCCC---ceEeecceeeh
Confidence 456777788999999999987 66777777664
No 22
>cd00407 Urease_beta Urease beta-subunit; Urease is a nickel-dependent metalloenzyme that catalyzes the hydrolysis of urea to form ammonia and carbon dioxide. Nickel-dependent ureases are found in bacteria, archaea, fungi and plants. Their primary role is to allow the use of external and internally-generated urea as a nitrogen source. The enzyme consists of three subunits, alpha, beta and gamma, which can exist as separate proteins or can be fused on a single protein chain. The alpha-beta-gamma heterotrimer forms multimers, mainly trimers. The large alpha subunit is the catalytic domain containing an active site with a bi-nickel center complexed by a carbamylated lysine. The beta and gamma subunits play a role in subunit association to form the higher order trimers.
Probab=89.57 E-value=0.52 Score=32.51 Aligned_cols=32 Identities=13% Similarity=0.068 Sum_probs=24.9
Q ss_pred EEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 35 RLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 35 ~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.+.|..+.=+++++|+|+|+. |.++|-|-.|.
T Consensus 12 ~I~lN~gr~~~~l~V~NtGDR---pIQVGSH~HF~ 43 (101)
T cd00407 12 DIELNAGREAVTLKVKNTGDR---PIQVGSHYHFF 43 (101)
T ss_pred CeEeCCCCCEEEEEEEeCCCc---ceEEccccchh
Confidence 355666777899999999977 67778887764
No 23
>TIGR00192 urease_beta urease, beta subunit. In a number of species, including B.subtilis, Synechocystis, and Haemophilus influenzae, urease subunits beta and gamma are encoded as separate polypeptides. In Helicobacter pylori UreA and in the fission yeast Schizosaccharomyces pombe, beta subunit-like sequence follows gamma subunit-like sequence in a single chain; the fission yeast protein contains additional C-terminal regions.
Probab=89.43 E-value=0.56 Score=32.30 Aligned_cols=31 Identities=13% Similarity=0.139 Sum_probs=24.6
Q ss_pred EEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 36 LILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+.|..+.=++++.|+|+|+. |.++|-|-.|.
T Consensus 13 I~ln~gr~~~~l~V~NtGDR---PIQVGSHyHF~ 43 (101)
T TIGR00192 13 ITINEGRKTVSVKVKNTGDR---PIQVGSHFHFF 43 (101)
T ss_pred EEeCCCCcEEEEEEEeCCCc---ceEEccccchh
Confidence 55766777899999999977 67778787764
No 24
>PRK13203 ureB urease subunit beta; Reviewed
Probab=89.13 E-value=0.59 Score=32.27 Aligned_cols=31 Identities=13% Similarity=0.077 Sum_probs=24.3
Q ss_pred EEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 36 LILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+.|..+.=+++++|+|+|+. |.++|-|-.|.
T Consensus 13 I~ln~gr~~~~l~V~NtGDR---PIQVGSH~HF~ 43 (102)
T PRK13203 13 IELNAGRETVTLTVANTGDR---PIQVGSHYHFF 43 (102)
T ss_pred EEeCCCCCEEEEEEEeCCCC---ceEEccccchh
Confidence 45666777899999999977 67777777664
No 25
>PF00699 Urease_beta: Urease beta subunit CAUTION: The Prosite patterns do not match this subunit of the enzyme; InterPro: IPR002019 Urease 3.5.1.5 from EC is a nickel-binding enzyme that catalyzes the hydrolysis of urea to carbon dioxide and ammonia []: Urea + H2O = CO2 + 2 NH3 Historically, it was the first enzyme to be crystallized (in 1926). It is mainly found in plant seeds and microorganisms. In plants, urease is a hexamer of identical chains. In bacteria [], it consists of either two or three different subunits (alpha IPR005847 from INTERPRO, beta, described in this entry, and gamma IPR002026 from INTERPRO). The structure of the urease complex is known []. This subunit does not appear to take part in the catalytic mechanism. This subunit is known (confusingly) as alpha in Helicobacter.; GO: 0009039 urease activity, 0016151 nickel ion binding, 0006807 nitrogen compound metabolic process; PDB: 1EJS_B 1EJW_B 1A5N_B 1A5K_B 1A5M_B 1EJR_B 1EJX_B 1A5L_B 1KRB_B 1FWA_B ....
Probab=88.35 E-value=0.48 Score=32.57 Aligned_cols=31 Identities=13% Similarity=0.103 Sum_probs=21.9
Q ss_pred EEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 36 LILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+.|..+.=+++++|+|+|+. |.++|-|-.|.
T Consensus 12 I~lN~gr~~~~l~V~N~GDR---PIQVGSH~HF~ 42 (100)
T PF00699_consen 12 IELNAGRERITLEVTNTGDR---PIQVGSHYHFF 42 (100)
T ss_dssp EETTTTSEEEEEEEEE-SSS----EEEETTS-GG
T ss_pred EEecCCCcEEEEEEEeCCCc---ceEEccccCHH
Confidence 55666778999999999977 67777776663
No 26
>PRK13202 ureB urease subunit beta; Reviewed
Probab=87.30 E-value=0.93 Score=31.39 Aligned_cols=31 Identities=10% Similarity=0.181 Sum_probs=23.3
Q ss_pred EEEeCCc-eEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 36 LILREKE-LHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 36 ~~L~~~~-L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+.|..+. =+++++|+|+|+. |.++|-|-.|.
T Consensus 13 I~ln~grr~~~~l~V~NtGDR---PIQVGSHyHF~ 44 (104)
T PRK13202 13 IEMNAAALSRLQMRIINAGDR---PVQVGSHVHLP 44 (104)
T ss_pred EEeCCCCCceEEEEEEeCCCC---ceEEccccchh
Confidence 5566553 5889999999977 67777777664
No 27
>PRK13201 ureB urease subunit beta; Reviewed
Probab=86.61 E-value=0.98 Score=32.68 Aligned_cols=31 Identities=13% Similarity=0.061 Sum_probs=24.2
Q ss_pred EEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 36 LILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+.|..+.=++++.|+|+|+. |.++|-|-.|.
T Consensus 13 I~lN~gr~~~~l~V~NtGDR---PIQVGSHyHF~ 43 (136)
T PRK13201 13 VEINNHHPETVIEVENTGDR---PIQVGSHFHFY 43 (136)
T ss_pred eEeCCCCCEEEEEEEeCCCc---ceEeccccchh
Confidence 55666777899999999977 67777776664
No 28
>PRK13205 ureB urease subunit beta; Reviewed
Probab=86.46 E-value=0.99 Score=33.42 Aligned_cols=31 Identities=13% Similarity=0.019 Sum_probs=24.5
Q ss_pred EEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 36 LILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+.|..+.=+++++|+|+|+. |+++|-|-.|.
T Consensus 13 IelN~GR~~i~L~V~NtGDR---PIQVGSHyHF~ 43 (162)
T PRK13205 13 LTGNVGREAKTIEIINTGDR---PVQIGSHFHFA 43 (162)
T ss_pred eEeCCCCcEEEEEEEeCCCC---ceEeccccchh
Confidence 55666778899999999977 67777777664
No 29
>PRK13204 ureB urease subunit beta; Reviewed
Probab=85.49 E-value=1.2 Score=33.09 Aligned_cols=32 Identities=9% Similarity=0.073 Sum_probs=25.4
Q ss_pred EEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 35 RLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 35 ~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.+.|..+.=.+++.|+|+|+. |+++|-|-.|.
T Consensus 35 ~I~lN~gr~~~~l~V~NtGDR---PIQVGSHyHF~ 66 (159)
T PRK13204 35 PIEINQGRPRTTLTVRNTGDR---PIQIGSHFHFF 66 (159)
T ss_pred CeEeCCCCcEEEEEEEeCCCC---ceEeccccchh
Confidence 466777778999999999977 67777777664
No 30
>PF14315 DUF4380: Domain of unknown function (DUF4380)
Probab=85.46 E-value=18 Score=29.42 Aligned_cols=34 Identities=6% Similarity=0.075 Sum_probs=28.9
Q ss_pred cceEEEEEEEEeCCc--eEEEEEEEeCCCCceeEEee
Q psy17993 28 ARFRLTYRLILREKE--LHFNIGVYNPSNDLAFNFNL 62 (191)
Q Consensus 28 ~~f~l~~~~~L~~~~--L~~~l~v~N~~~~~~m~f~~ 62 (191)
...+++.+|+|.+++ ++++.+++|.++. ++++++
T Consensus 109 tgiq~~~~i~l~~~~~~i~v~~~i~N~~~~-~~~~a~ 144 (274)
T PF14315_consen 109 TGIQKERTITLDADRPSIEVTHRITNIGDW-PVEWAP 144 (274)
T ss_pred cCcEEEEEEEECCCCCEEEEEEEEEeCCCC-cceeee
Confidence 457899999998776 9999999999999 887554
No 31
>PRK13198 ureB urease subunit beta; Reviewed
Probab=85.30 E-value=1.2 Score=33.00 Aligned_cols=32 Identities=13% Similarity=0.103 Sum_probs=25.4
Q ss_pred EEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 35 RLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 35 ~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.+.|..+.=++++.|+|+|+. |.++|-|-.|.
T Consensus 40 ~I~lN~gr~~~~l~V~NtGDR---PIQVGSHyHF~ 71 (158)
T PRK13198 40 PITFNENKPVTKVKVRNTGDR---PIQVGSHFHFF 71 (158)
T ss_pred CeEeCCCCcEEEEEEEeCCCC---ceEeccccchh
Confidence 466777778999999999977 66777776664
No 32
>PRK13192 bifunctional urease subunit gamma/beta; Reviewed
Probab=79.42 E-value=2.4 Score=33.02 Aligned_cols=42 Identities=12% Similarity=0.023 Sum_probs=29.8
Q ss_pred CCCcceEEE-EEEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 25 PYFARFRLT-YRLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 25 ~yP~~f~l~-~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.+|=.+.+. =.+.|..+.=+++++|+|+|+. |.++|-|-.|.
T Consensus 110 l~PGei~~~~~~I~lN~gr~~~~l~V~NtGDR---PIQVGSHyHF~ 152 (208)
T PRK13192 110 LYPGEILPGDGEIELNAGRPAVTLDVTNTGDR---PIQVGSHFHFF 152 (208)
T ss_pred cCCCEEEcCCCCeeeCCCCCEEEEEEEeCCCC---ceeeccccchh
Confidence 455554432 2466777778899999999977 67777777664
No 33
>PRK13986 urease subunit alpha; Provisional
Probab=78.18 E-value=2.5 Score=33.27 Aligned_cols=32 Identities=9% Similarity=0.068 Sum_probs=25.4
Q ss_pred EEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 35 RLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 35 ~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.+.|..+.=++++.|+|+|+. |.+.|-|-.|.
T Consensus 117 ~I~lN~gr~~~~l~V~NtGDR---PIQVGSHyHF~ 148 (225)
T PRK13986 117 DITINAGKKAVSVKVKNVGDR---PVQVGSHFHFF 148 (225)
T ss_pred CeecCCCCcEEEEEEEeCCCC---ceeeccccchh
Confidence 466777778999999999977 67777777764
No 34
>PF04744 Monooxygenase_B: Monooxygenase subunit B protein; InterPro: IPR006833 Ammonia monooxygenase and the particulate methane monooxygenase are both integral membrane proteins, occurring in ammonia oxidisers and methanotrophs respectively, which are thought to be evolutionarily related []. These enzymes have a relatively wide substrate specificity and can catalyse the oxidation of a range of substrates including ammonia, methane, halogenated hydrocarbons and aromatic molecules []. These enzymes are composed of 3 subunits - A (IPR003393 from INTERPRO), B (IPR006833 from INTERPRO) and C (IPR006980 from INTERPRO) - and contain various metal centres, including copper. Particulate methane monooxygenase from Methylococcus capsulatus str. Bath is an ABC homotrimer, which contains mononuclear and dinuclear copper metal centres, and a third metal centre containing a metal ion whose identity in vivo is not certain[]. The soluble regions of these enzymes derive primarily from the B subunit. This subunit forms two antiparallel beta-barrel-like structures and contains the mono- and di- nuclear copper metal centres [].; PDB: 3CHX_E 3RFR_A 3RGB_A 1YEW_A.
Probab=72.73 E-value=4.6 Score=34.36 Aligned_cols=29 Identities=10% Similarity=0.154 Sum_probs=23.8
Q ss_pred EEEEEEeCCceEEEEEEEeCCCCceeEEee
Q psy17993 33 TYRLILREKELHFNIGVYNPSNDLAFNFNL 62 (191)
Q Consensus 33 ~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~ 62 (191)
..+|++-+.+|+++++|+|+|++ +...+-
T Consensus 255 ~A~Y~vpgR~l~~~l~VtN~g~~-pv~Lge 283 (381)
T PF04744_consen 255 DATYRVPGRTLTMTLTVTNNGDS-PVRLGE 283 (381)
T ss_dssp EEEEESSSSEEEEEEEEEEESSS--BEEEE
T ss_pred ccEEecCCcEEEEEEEEEcCCCC-ceEeee
Confidence 45788889999999999999999 776553
No 35
>TIGR03079 CH4_NH3mon_ox_B methane monooxygenase/ammonia monooxygenase, subunit B. Both ammonia oxidizers such as Nitrosomonas europaea and methanotrophs (obligate methane oxidizers) such as Methylococcus capsulatus each can grow only on their own characteristic substrate. However, both groups have the ability to oxidize both substrates, and so the relevant enzymes must be named here according to their ability to oxidze both. The protein family represented here reflects subunit B of both the particulate methane monooxygenase of methylotrophs and the ammonia monooxygenase of nitrifying bacteria.
Probab=68.48 E-value=6.8 Score=33.36 Aligned_cols=27 Identities=7% Similarity=0.187 Sum_probs=23.6
Q ss_pred EEEEEeCCceEEEEEEEeCCCCceeEEe
Q psy17993 34 YRLILREKELHFNIGVYNPSNDLAFNFN 61 (191)
Q Consensus 34 ~~~~L~~~~L~~~l~v~N~~~~~~m~f~ 61 (191)
.+|.+-+..|+++++|+|.|++ +...+
T Consensus 275 a~Y~VPGR~l~~~~~VTN~g~~-~vrlg 301 (399)
T TIGR03079 275 ANYDVPGRALRVTMEITNNGDQ-VISIG 301 (399)
T ss_pred cEEecCCcEEEEEEEEEcCCCC-ceEEE
Confidence 3788989999999999999999 77654
No 36
>PF05506 DUF756: Domain of unknown function (DUF756); InterPro: IPR008475 This domain is found, normally as a tandem repeat, at the C terminus of bacterial phospholipase C proteins.; GO: 0004629 phospholipase C activity, 0016042 lipid catabolic process
Probab=67.95 E-value=15 Score=24.36 Aligned_cols=38 Identities=18% Similarity=0.186 Sum_probs=28.2
Q ss_pred CcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeecccee
Q psy17993 27 FARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHTY 67 (191)
Q Consensus 27 P~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~Hpy 67 (191)
|+.+++...|.-. +-.+.|++.|.|.. +..|.+--+.|
T Consensus 6 ~~~~~v~~~~~~~--~g~l~l~l~N~g~~-~~~~~v~~~~y 43 (89)
T PF05506_consen 6 PYAPEVTARYDPA--TGNLRLTLSNPGSA-AVTFTVYDNAY 43 (89)
T ss_pred CCCCEEEEEEECC--CCEEEEEEEeCCCC-cEEEEEEeCCc
Confidence 6677777776543 33777788999999 99988866555
No 37
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=67.11 E-value=9 Score=25.35 Aligned_cols=20 Identities=20% Similarity=0.504 Sum_probs=14.4
Q ss_pred ceEEEEEEEeCCCCceeEEee
Q psy17993 42 ELHFNIGVYNPSNDLAFNFNL 62 (191)
Q Consensus 42 ~L~~~l~v~N~~~~~~m~f~~ 62 (191)
++.+.|+|+|.+++ +..+.+
T Consensus 1 ~v~~~l~v~N~s~~-~v~l~f 20 (82)
T PF12690_consen 1 QVEFTLTVTNNSDE-PVTLQF 20 (82)
T ss_dssp -EEEEEEEEE-SSS--EEEEE
T ss_pred CEEEEEEEEeCCCC-eEEEEe
Confidence 47899999999999 777664
No 38
>PF09095 DUF1926: Domain of unknown function (DUF1926); InterPro: IPR015179 Alpha-amylase is classified as family 13 of the glycosyl hydrolases and is present in archaea, bacteria, plants and animals. Alpha-amylase is an essential enzyme in alpha-glucan metabolism, acting to catalyse the hydrolysis of alpha-1,4-glucosidic bonds of glycogen, starch and related polysaccharides. Although all alpha-amylases possess the same catalytic function, they can vary with respect to sequence. In general, they are composed of three domains: a TIM barrel containing the active site residues and chloride ion-binding site (domain A), a long loop region inserted between the third beta strand and the alpha-helix of domain A that contains calcium-binding site(s) (domain B), and a C-terminal beta-sheet domain that appears to show some variability in sequence and length between amylases (domain C) []. Amylases have at least one conserved calcium-binding site, as calcium is essential for the stability of the enzyme. The chloride-binding functions to activate the enzyme, which acts by a two-step mechanism involving a catalytic nucleophile base (usually an Asp) and a catalytic proton donor (usually a Glu) that are responsible for the formation of the beta-linked glycosyl-enzyme intermediate. This entry represents a domain found in prokaryotic alpha-amylase (3.2.1.1 from EC) and 4-alpha-glucanotransferase (2.4.1.25 from EC). This domain adopts a beta-sandwich fold, in which two layers of anti-parallel beta-sheets are arranged in a nearly parallel fashion. The exact function of this domain is, as yet, unknown, however it has been proposed that it may play a role in transglycosylation reactions []. More information about this protein can be found at Protein of the Month: alpha-Amylase [].; GO: 0003824 catalytic activity, 0030246 carbohydrate binding, 0005975 carbohydrate metabolic process; PDB: 1K1X_B 1K1W_A 1K1Y_A.
Probab=53.02 E-value=27 Score=28.58 Aligned_cols=29 Identities=17% Similarity=0.111 Sum_probs=25.2
Q ss_pred CCcceEEEEEEEEeCCceEEEEEEEeCCCC
Q psy17993 26 YFARFRLTYRLILREKELHFNIGVYNPSND 55 (191)
Q Consensus 26 yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~ 55 (191)
.+....|+=+|+|.+++|.++++++ .++.
T Consensus 144 ~~~~~~l~K~y~l~~~~l~V~Y~l~-~~~~ 172 (278)
T PF09095_consen 144 EGHPITLEKRYRLTKNGLQVDYRLT-ESPE 172 (278)
T ss_dssp SEEEEEEEEEEEEETTEEEEEEEEE--ESS
T ss_pred ccCceEEEEEEEEcCCEEEEEEEEE-ECCC
Confidence 6788999999999999999999999 5544
No 39
>PF06030 DUF916: Bacterial protein of unknown function (DUF916); InterPro: IPR010317 This family consists of putative cell surface proteins, from Firmicutes, of unknown function.
Probab=51.18 E-value=42 Score=23.86 Aligned_cols=29 Identities=10% Similarity=0.177 Sum_probs=25.8
Q ss_pred CCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 40 EKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 40 ~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
+..-++++.|.|.+++ ++-|.+..++++.
T Consensus 26 ~q~~~l~v~i~N~s~~-~~tv~v~~~~A~T 54 (121)
T PF06030_consen 26 GQKQTLEVRITNNSDK-EITVKVSANTATT 54 (121)
T ss_pred CCEEEEEEEEEeCCCC-CEEEEEEEeeeEe
Confidence 4678899999999999 9999999999874
No 40
>PF14796 AP3B1_C: Clathrin-adaptor complex-3 beta-1 subunit C-terminal
Probab=50.71 E-value=63 Score=23.92 Aligned_cols=15 Identities=27% Similarity=0.350 Sum_probs=12.7
Q ss_pred eEeCCCCEEEEEEEe
Q psy17993 176 VLLHPGTTFEASQIL 190 (191)
Q Consensus 176 ~~L~pGe~~~~~~~i 190 (191)
-.|+||++.+..+-|
T Consensus 124 ~~L~pg~s~t~~lgI 138 (145)
T PF14796_consen 124 ESLEPGASVTVSLGI 138 (145)
T ss_pred cccCCCCeEEEEEEE
Confidence 589999999988765
No 41
>PLN02303 urease
Probab=43.74 E-value=25 Score=33.41 Aligned_cols=32 Identities=13% Similarity=0.151 Sum_probs=24.0
Q ss_pred EEEEeCCceEEEEEEEeCCCCceeEEeeccceeEE
Q psy17993 35 RLILREKELHFNIGVYNPSNDLAFNFNLLLHTYLK 69 (191)
Q Consensus 35 ~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~ 69 (191)
.+.|..+.=+++++|+|+|++ |.++|-|-.|.
T Consensus 142 ~i~~n~gr~~~~l~v~n~gdr---piqvgSH~hf~ 173 (837)
T PLN02303 142 SIIINAGRKAVKLKVTNTGDR---PIQVGSHYHFI 173 (837)
T ss_pred CeeeCCCCCeEEEEEeeCCCC---ceEeccccchH
Confidence 456776777899999999977 56667676553
No 42
>PF00345 PapD_N: Pili and flagellar-assembly chaperone, PapD N-terminal domain; InterPro: IPR016147 Most Gram-negative bacteria possess a supramolecular structure - the pili - on their surface, which mediates attachment to specific receptors. Many interactive subunits are required to assemble pili, but their assembly only takes place after translocation across the cytoplasmic membrane. Periplasmic chaperones assist pili assembly by binding to the subunits, thereby preventing premature aggregation [, ]. Pili chaperones are structurally, and possibly evolutionarily, related to the immunoglobulin superfamily [, ]: they contain two globular domains, with a topology identical to an immunoglobulin fold. This entry represents the N-terminal domain of pili assembly chaperone, and has a beta-sandwich fold consisting of seven strands in two sheets with a Greek key topology.; GO: 0007047 cellular cell wall organization, 0030288 outer membrane-bounded periplasmic space; PDB: 2CO6_B 2CO7_B 1L4I_B 3GFU_A 3F65_F 3F6L_A 3F6I_A 3GEW_B 3DSN_D 2OS7_B ....
Probab=41.96 E-value=71 Score=22.19 Aligned_cols=27 Identities=22% Similarity=0.342 Sum_probs=20.7
Q ss_pred EEEeCCceEEEEEEEeCCCCceeEEeec
Q psy17993 36 LILREKELHFNIGVYNPSNDLAFNFNLL 63 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~~~m~f~~g 63 (191)
+.+.++.=..+++|.|.+++ ++.++.-
T Consensus 9 ii~~~~~~~~~i~v~N~~~~-~~~vq~~ 35 (122)
T PF00345_consen 9 IIFNESQRSASITVTNNSDQ-PYLVQVW 35 (122)
T ss_dssp EEEETTSSEEEEEEEESSSS-EEEEEEE
T ss_pred EEEeCCCCEEEEEEEcCCCC-cEEEEEE
Confidence 44555556889999999999 8887763
No 43
>PF03351 DOMON: DOMON domain; InterPro: IPR005018 The DOMON domain is an 110-125 residue long domain which has been identified in the physiologically important enzyme dopamine beta-monooxygenase and in several other secreted and transmembrane proteins from both plants and animals. It has been named after DOpamine beta-MOnooxygenase N-terminal domain. The DOMON domain can be found in one to four copies and in association with other domains, such as the Cu-ascorbate dependent monooxygenase domain, the epidermal growth factor domain, the trypsin inhibitor-like domain (TIL), the SEA domain and the Reelin domain. The architectures of the DOMON domain proteins strongly suggest a function in extracellular adhesion []. The sequence conservation is predominantly centred around patches of hydrophobic residues. The secondary structure prediction of the DOMON domain points to an all-beta-strand fold with seven or eight core strands supported by a buried core of conserved hydrophobic residues. There is a chraracteristic motif with two small positions (Gly or Ser) corresponding to a conserved turn immediately C-terminal to strand three. It has been proposed that the DOMON domain might form a beta-sandwich structure, with the strands distributed into two beta sheets as is seen in many extracellular adhesion domains such as the immunoglobulin, fibronectin type III, cadherin and PKD domains [].
Probab=41.50 E-value=79 Score=21.72 Aligned_cols=35 Identities=6% Similarity=0.068 Sum_probs=25.2
Q ss_pred EEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccce
Q psy17993 31 RLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLHT 66 (191)
Q Consensus 31 ~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~Hp 66 (191)
.+.+++++.++.-.++++++..... ...+++||-+
T Consensus 4 ~~~l~w~~~~~~~~i~~~l~~~~~~-~~w~aiGfs~ 38 (124)
T PF03351_consen 4 NFSLSWTVDGDNNTIEFELTGPANT-NGWVAIGFSD 38 (124)
T ss_pred eEEEEEEEECCCCEEEEEEEeccCC-CCEEEEEEcc
Confidence 3456666666666777777876666 7888888887
No 44
>PF14310 Fn3-like: Fibronectin type III-like domain; PDB: 3ABZ_D 3AC0_D 2X40_A 2X41_A 2X42_A.
Probab=41.34 E-value=33 Score=21.60 Aligned_cols=16 Identities=38% Similarity=0.395 Sum_probs=12.3
Q ss_pred CeEeCCCCEEEEEEEe
Q psy17993 175 PVLLHPGTTFEASQIL 190 (191)
Q Consensus 175 ~~~L~pGe~~~~~~~i 190 (191)
.+.|+|||+.+.+++|
T Consensus 26 rv~l~pGes~~v~~~l 41 (71)
T PF14310_consen 26 RVSLAPGESKTVSFTL 41 (71)
T ss_dssp EEEE-TT-EEEEEEEE
T ss_pred EEEECCCCEEEEEEEE
Confidence 5789999999999887
No 45
>TIGR01451 B_ant_repeat conserved repeat domain. This model represents the conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis, and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydial outer membrane proteins.
Probab=38.60 E-value=44 Score=19.90 Aligned_cols=16 Identities=25% Similarity=0.341 Sum_probs=14.3
Q ss_pred CCceEEEEEEEeCCCC
Q psy17993 40 EKELHFNIGVYNPSND 55 (191)
Q Consensus 40 ~~~L~~~l~v~N~~~~ 55 (191)
|+.++.+++|+|.|..
T Consensus 11 Gd~v~Yti~v~N~g~~ 26 (53)
T TIGR01451 11 GDTITYTITVTNNGNV 26 (53)
T ss_pred CCEEEEEEEEEECCCC
Confidence 5789999999999987
No 46
>PF14849 YidC_periplas: YidC periplasmic domain; PDB: 3BS6_B 3BLC_B.
Probab=38.44 E-value=1.6e+02 Score=23.35 Aligned_cols=44 Identities=14% Similarity=0.123 Sum_probs=26.4
Q ss_pred CCeEEEEEccCCCcccCCCcceEEEEEEEEeCCc--eEEEEEEEeCCCCceeEE
Q psy17993 9 LDAFCIFPYLDIFCIFPYFARFRLTYRLILREKE--LHFNIGVYNPSNDLAFNF 60 (191)
Q Consensus 9 ~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~~~--L~~~l~v~N~~~~~~m~f 60 (191)
+..+++|+... +-.+.++.+|+|.++. +.++++++|.+.. +...
T Consensus 86 ~~~~vtf~~~~-------~~g~~i~k~ytf~~~~Y~~~~~i~~~n~~~~-~~~~ 131 (270)
T PF14849_consen 86 DSQSVTFTAQL-------GNGLTITKTYTFKPDSYLVDLEISVTNLSDQ-PVSL 131 (270)
T ss_dssp SEEEEEEEEE--------TTS-EEEEEEEEETT--EEEEEEEEE--SSS--EEE
T ss_pred CceEEEEEEEC-------CCCEEEEEEEEEcCCcEEEEEEEEEECCCCC-cccc
Confidence 44566666654 2237899999998654 6677778898877 6655
No 47
>TIGR03593 yidC_nterm membrane protein insertase, YidC/Oxa1 family, N-terminal domain. Essentially all bacteria have a member of the YidC family, whose C-terminal domain is modeled by TIGR03592. The two copies are found in endospore-forming bacteria such as Bacillus subtilis appear redundant during vegetative growth, although the member designated spoIIIJ (stage III sporulation protein J) has a distinct role in spore formation. YidC, its mitochondrial homolog Oxa1, and its chloroplast homolog direct insertion into the bacterial/organellar inner (or only) membrane. This model describes an N-terminal sequence region, including a large periplasmic domain lacking in YidC members from Gram-positive species. The multifunctional YidC protein acts both with and independently of the Sec system.
Probab=37.86 E-value=1.7e+02 Score=24.64 Aligned_cols=30 Identities=10% Similarity=0.119 Sum_probs=24.0
Q ss_pred ceEEEEEEEEeCC--ceEEEEEEEeCCCCceeE
Q psy17993 29 RFRLTYRLILREK--ELHFNIGVYNPSNDLAFN 59 (191)
Q Consensus 29 ~f~l~~~~~L~~~--~L~~~l~v~N~~~~~~m~ 59 (191)
...++.+|++.++ .+.++++|+|.++. +..
T Consensus 172 G~~v~k~ytf~~~sY~i~v~~~v~N~~~~-~~~ 203 (366)
T TIGR03593 172 GVTVTKTYTFDRDSYLIDVEYKVTNNGDA-PVS 203 (366)
T ss_pred CeEEEEEEEEeCCeEEEEeEEEEEeCCCC-Cee
Confidence 4678889999876 58888999999987 554
No 48
>PF01345 DUF11: Domain of unknown function DUF11; InterPro: IPR001434 This group of sequences is represented by a conserved region of about 53 amino acids shared between regions, usually repeated, of proteins from a small number of phylogenetically distant prokaryotes. Examples include a 132-residue region found repeated in three of the five longest proteins of Bacillus anthracis, a 131-residue repeat in a cell wall-anchored protein of Enterococcus faecalis (Streptococcus faecalis), and a 120-residue repeat in Methanobacterium thermoautotrophicum. A similar region is found in some Chlamydia trachomatis outer membrane proteins. In C. trachomatis, three cysteine-rich proteins (also believed to be lipoproteins), MOMP, OMP6 and OMP3, make up the extracellular matrix of the outer membrane []. They are involved in the essential structural integrity of both the elementary body (EB) and recticulate body (RB) phase. They are thought to be involved in porin formation and, as these bacteria lack the peptidoglycan layer common to most Gram-negative microbes, such proteins are highly important in the pathogenicity of the organism.; GO: 0005727 extrachromosomal circular DNA
Probab=36.53 E-value=50 Score=20.87 Aligned_cols=16 Identities=19% Similarity=0.297 Sum_probs=14.5
Q ss_pred CCceEEEEEEEeCCCC
Q psy17993 40 EKELHFNIGVYNPSND 55 (191)
Q Consensus 40 ~~~L~~~l~v~N~~~~ 55 (191)
|+.++.+++|+|.|..
T Consensus 40 Gd~v~ytitvtN~G~~ 55 (76)
T PF01345_consen 40 GDTVTYTITVTNTGPA 55 (76)
T ss_pred CCEEEEEEEEEECCCC
Confidence 5789999999999988
No 49
>PF05543 Peptidase_C47: Staphopain peptidase C47; InterPro: IPR008750 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Cysteine peptidases have characteristic molecular topologies, which can be seen not only in their three-dimensional structures, but commonly also in the two-dimensional structures. These are peptidases in which the nucleophile is the sulphydryl group of a cysteine residue. Cysteine proteases are divided into clans (proteins which are evolutionary related), and further sub-divided into families, on the basis of the architecture of their catalytic dyad or triad []. This group of cysteine peptidases belong to the peptidase family C47 (staphopain family, clan CA). The type example are the staphopains, which are one of four major families of proteinases secreted by the Gram-positive Staphylococcus aureus. These staphylococcal cysteine proteases are secreted as preproenzymes that are proteolytically cleaved to generate the mature enzyme [, , ].; GO: 0008234 cysteine-type peptidase activity, 0006508 proteolysis; PDB: 1X9Y_D 1Y4H_B 1PXV_B 1CV8_A.
Probab=36.22 E-value=20 Score=27.39 Aligned_cols=35 Identities=17% Similarity=0.475 Sum_probs=19.2
Q ss_pred CCCcEEEeCCccCCCCCCCCCCCCCCceEEecceecCCCeEeCCCCEEEE
Q psy17993 137 NFPDTVIWNPWDTGRDMKDFGDDEYPNMICVEAGHVTSPVLLHPGTTFEA 186 (191)
Q Consensus 137 ~~~~~vvWnp~~~~~~~~d~~~~~~~~fvCiEP~~~~~~~~L~pGe~~~~ 186 (191)
|.+...+|||| |..+.|+-.-+ ..+.|.+|..++=
T Consensus 134 g~~~y~~WNPW-------------~~~~~~~sa~s--~~~~~~~g~~y~W 168 (175)
T PF05543_consen 134 GQKTYYFWNPW-------------WNDVMIQSAKS--NIIPVSNGYHYNW 168 (175)
T ss_dssp SEEEEEEE-TT--------------SS-EEEETT----EEEETTTEEEEE
T ss_pred CCeEEEEeCCc-------------cCCcEEEecCC--CceeecCCCeeEE
Confidence 46789999999 45666653311 2466666665543
No 50
>PRK01318 membrane protein insertase; Provisional
Probab=35.96 E-value=1.8e+02 Score=26.28 Aligned_cols=32 Identities=19% Similarity=0.260 Sum_probs=24.3
Q ss_pred ceEEEEEEEEeCCc--eEEEEEEEeCCCCceeEEe
Q psy17993 29 RFRLTYRLILREKE--LHFNIGVYNPSNDLAFNFN 61 (191)
Q Consensus 29 ~f~l~~~~~L~~~~--L~~~l~v~N~~~~~~m~f~ 61 (191)
...++.+|++.+++ ++++++|+|.+.. ++...
T Consensus 138 g~~~~k~yt~~~~~Y~~~~~~~v~n~~~~-~~~~~ 171 (521)
T PRK01318 138 GLTFTKTYTLDRGDYMFTVEYSVNNNSGA-PVNLS 171 (521)
T ss_pred CeEEEEEEEEcCCceEEEEEEEEEcCCCC-ceeee
Confidence 46778899997654 8888999998876 55543
No 51
>PF10409 PTEN_C2: C2 domain of PTEN tumour-suppressor protein; InterPro: IPR014020 Tensins constitute an eukaryotic family of lipid phosphatases that are defined by the presence of two adjacent domains: a lipid phosphatase domain and a C2-like domain. The tensin-type C2 domain has a structure similar to the classical C2 domain (see IPR000008 from INTERPRO) that mediates the Ca2+-dependent membrane recruitment of several signalling proteins. However the tensin-type C2 domain lacks two of the three conserved loops that bind Ca2+, and in this respect it is similar to the C2 domains of PKC-type [, ]. The tensin-type C2 domain can bind phopholipid membranes in a Ca2+ independent manner []. In the tumour suppressor protein PTEN, the best characterised member of the family, the lipid phosphatase domain was shown to specifically dephosphorylate the D3 position of the inositol ring of the lipid second messenger, phosphatydilinositol-3-4-5-triphosphate (PIP3). The lipid phosphatase domain contains the signature motif HCXXGXXR present in the active sites of protein tyrosine phosphatases (PTPs) and dual specificity phosphatases (DSPs). Furthermore, two invariant lysines are found only in the tensin-type phosphatase motif (HCKXGKXR) and are suspected to interact with the phosphate group at position D1 and D5 of the inositol ring [, ]. The C2 domain is found at the C terminus of the tumour suppressor protein PTEN (phosphatidyl-inositol triphosphate phosphatase). This domain may include a CBR3 loop, indicating a central role in membrane binding. This domain associates across an extensive interface with the N-terminal phosphatase domain DSPc suggesting that the C2 domain productively positions the catalytic part of the protein on the membrane. The crystal structure of the PTEN tumour suppressor has been solved []. The lipid phosphatase domain has a structure similar to the dual specificity phosphatase (see IPR000387 from INTERPRO). However, PTEN has a larger active site pocket that could be important to accommodate PI(3,4,5)P3. Proteins known to contain a phosphatase and a C2 tensin-type domain are listed below: Tensin, a focal-adhesion molecule that binds to actin filaments. It may be involved in cell migration, cartilage development and in linking signal transduction pathways to the cytoskeleton. Phosphatase and tensin homologue deleted on chromosome 10 protein (PTEN). It antagonizes PI 3-kinase signalling by dephosphorylating the 3-position of the inositol ring of PI(3,4,5)P3 and thus inactivates downstream signalling. It plays major roles both during development and in the adult to control cell size, growth, and survival. Auxilin. It binds clathrin heavy chain and promotes its assembly into regular cages. Cyclin G-associated kinase or auxilin-2. It is a potential regulator of clathrin-mediated membrane trafficking. ; GO: 0005515 protein binding; PDB: 3N0A_A 1D5R_A 3V0D_B 3V0H_B 3V0G_A 3V0F_B 3V0J_A 3V0I_A 3AWE_B 3AWG_C ....
Probab=34.75 E-value=82 Score=22.29 Aligned_cols=52 Identities=13% Similarity=0.063 Sum_probs=33.2
Q ss_pred CCCcceEEEEEEEEeCCceEEEEEEEe--CCCCceeEEeeccceeEEeCCCCceEE
Q psy17993 25 PYFARFRLTYRLILREKELHFNIGVYN--PSNDLAFNFNLLLHTYLKVPDVRRCQV 78 (191)
Q Consensus 25 ~yP~~f~l~~~~~L~~~~L~~~l~v~N--~~~~~~m~f~~g~HpyF~v~~~~~~~~ 78 (191)
.....+.+.....|.|+ +.+.+.-.+ ...+ .+-|.+-+|++|-......+.+
T Consensus 53 ~~~~~~~~~~~~~l~GD-V~i~~~~~~~~~~~~-~~~f~~~FnT~Fi~~~~~~l~~ 106 (134)
T PF10409_consen 53 QDSVIIELPKNLPLRGD-VLIKFYHKRSSSMSK-EKMFRFWFNTGFIEPNNNVLRF 106 (134)
T ss_dssp TTCEEEEEEEEEEEESE-EEEEEEECETTECCC-EEEEEEEEEGGGSBTTTCEEEE
T ss_pred ceeEEEEeCCCCeEeCC-EEEEEEeCCCccccc-CeEEEEEEeeeeeeccccEEEc
Confidence 34566777777888764 455543333 3466 7899999999996532333444
No 52
>PF10065 DUF2303: Uncharacterized conserved protein (DUF2303); InterPro: IPR019276 This entry is represented by Bacteriiophage VT2phi_272, P20. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=33.98 E-value=60 Score=26.64 Aligned_cols=27 Identities=19% Similarity=0.175 Sum_probs=21.3
Q ss_pred CcceEEEEEEEEeCCceEEEEEEEeCC
Q psy17993 27 FARFRLTYRLILREKELHFNIGVYNPS 53 (191)
Q Consensus 27 P~~f~l~~~~~L~~~~L~~~l~v~N~~ 53 (191)
++.+.++++|+..+++|.+.|++.+..
T Consensus 221 ~~~i~~Rlr~r~~~g~l~l~l~l~~~e 247 (276)
T PF10065_consen 221 EYRITARLRYRTRGGKLVLWLRLVRPE 247 (276)
T ss_pred eEEEEEEEEEEecCCcEEEEEEEcCHH
Confidence 567888888888888888888876543
No 53
>PF14874 PapD-like: Flagellar-associated PapD-like
Probab=33.52 E-value=1.4e+02 Score=19.77 Aligned_cols=24 Identities=13% Similarity=0.086 Sum_probs=20.3
Q ss_pred eCCceEEEEEEEeCCCCceeEEeec
Q psy17993 39 REKELHFNIGVYNPSNDLAFNFNLL 63 (191)
Q Consensus 39 ~~~~L~~~l~v~N~~~~~~m~f~~g 63 (191)
-+......++++|.+.. +..|.+-
T Consensus 18 ~g~~~~~~v~l~N~s~~-p~~f~v~ 41 (102)
T PF14874_consen 18 VGQTYSRTVTLTNTSSI-PARFRVR 41 (102)
T ss_pred cCCEEEEEEEEEECCCC-CEEEEEE
Confidence 46788999999999999 9888863
No 54
>PF03072 DUF237: MG032/MG096/MG288 family 1; InterPro: IPR004306 This domain is found entirely in Mycoplasma pneumoniae proteins of unknown function. Another related domain (IPR004319 from INTERPRO) is found entirely in mycoplasmal proteins of the MG032/MG096/MG288 family and both domains often occur together.
Probab=32.54 E-value=46 Score=24.33 Aligned_cols=20 Identities=10% Similarity=-0.001 Sum_probs=16.7
Q ss_pred CcccCCCcceEEEEEEEEeC
Q psy17993 21 FCIFPYFARFRLTYRLILRE 40 (191)
Q Consensus 21 ~t~~~yP~~f~l~~~~~L~~ 40 (191)
+-++.+||.|++++++.+++
T Consensus 46 e~~q~~py~f~FeI~~kY~G 65 (137)
T PF03072_consen 46 EFFQDKPYRFKFEISFKYEG 65 (137)
T ss_pred ceeccCceEEEEEEEEEEEE
Confidence 56778899999999988875
No 55
>cd01176 IPT_RBP-Jkappa IPT domain of the recombination signal Jkappa binding protein (RBP-Jkappa). RBP-J kappa, was initially considered to be involved in V(D)J recombination because of its DNA binding specificity and structural similarity to site-specific recombinases known as the integrase family. Further studies indicated that RBP-J kappa functions as a repressor of transcription, via destabilization of the general transcription factor IID and recruitment of histone deacetylase complexes.
Probab=31.54 E-value=63 Score=21.99 Aligned_cols=37 Identities=22% Similarity=0.377 Sum_probs=22.5
Q ss_pred eEEEEEeCC-CcEEEeCCccCCCCCCCCCCCCCCceEEecce
Q psy17993 130 KMRLQKYNF-PDTVIWNPWDTGRDMKDFGDDEYPNMICVEAG 170 (191)
Q Consensus 130 ~i~v~~~~~-~~~vvWnp~~~~~~~~d~~~~~~~~fvCiEP~ 170 (191)
.+.+..++| +++-||--...+..|-. ....++|+-|-
T Consensus 21 mlEl~GenF~pnLkVWFG~veaeTmyR----~~e~l~CvvPd 58 (97)
T cd01176 21 MLELHGENFTPNLKVWFGDVEAETMYR----CEESLLCVVPD 58 (97)
T ss_pred EEEEecCcCCCCceEEECCcceEEEEE----ccceeEEecCC
Confidence 466667776 89999973213322211 12379999995
No 56
>PF11611 DUF4352: Domain of unknown function (DUF4352); InterPro: IPR021652 This entry is represented by Bacteriophage A118, Gp32. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches. This entry represents a group of putative lipoproteins of unknown function.; PDB: 3CFU_A.
Probab=31.28 E-value=65 Score=22.11 Aligned_cols=22 Identities=18% Similarity=0.318 Sum_probs=16.7
Q ss_pred CceEEEEEEEeCCCCceeEEeec
Q psy17993 41 KELHFNIGVYNPSNDLAFNFNLL 63 (191)
Q Consensus 41 ~~L~~~l~v~N~~~~~~m~f~~g 63 (191)
.-+.++++|+|.+++ +..+...
T Consensus 36 ~fv~v~v~v~N~~~~-~~~~~~~ 57 (123)
T PF11611_consen 36 KFVVVDVTVKNNGDE-PLDFSPS 57 (123)
T ss_dssp EEEEEEEEEEE-SSS--EEEEGG
T ss_pred EEEEEEEEEEECCCC-cEEeccc
Confidence 348999999999999 8887754
No 57
>cd06535 CIDE_N_CAD CIDE_N domain of CAD nuclease. The CIDE_N (cell death-inducing DFF45-like effector, N-terminal) domain is found at the N-terminus of CAD nuclease (caspase-activated DNase/DNA fragmentation factor, DFF40) and its inhibitor, ICAD(DFF45). These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of CAD/DFF40 and ICAD/DFF45 during apoptosis. In normal cells, DFF exists in the nucleus as a heterodimer composed of CAD/DFF40 as a latent nuclease and its chaperone and inhibitor subunit ICAD/DFF45. Apoptotic activation of caspase-3 results in the cleavage of DFF45/ICAD and the release of active DFF40/CAD nuclease.
Probab=30.93 E-value=16 Score=24.03 Aligned_cols=21 Identities=24% Similarity=0.479 Sum_probs=13.9
Q ss_pred eEEecceecCCCeEeCCCCEEEE
Q psy17993 164 MICVEAGHVTSPVLLHPGTTFEA 186 (191)
Q Consensus 164 fvCiEP~~~~~~~~L~pGe~~~~ 186 (191)
|-|+++-. .++.|.|||+|++
T Consensus 57 F~tLp~nT--~lmvL~~gq~W~g 77 (77)
T cd06535 57 FPTLPDNT--ELVLLTPGQSWQG 77 (77)
T ss_pred HhcCCCCc--EEEEEcCCCccCC
Confidence 54555532 2688999999863
No 58
>PHA02310 hypothetical protein
Probab=29.43 E-value=62 Score=23.34 Aligned_cols=41 Identities=17% Similarity=0.250 Sum_probs=27.5
Q ss_pred EEEEEeCCCcEEEeCCcc-CCCCCCCC-CCCCCCceEEeccee
Q psy17993 131 MRLQKYNFPDTVIWNPWD-TGRDMKDF-GDDEYPNMICVEAGH 171 (191)
Q Consensus 131 i~v~~~~~~~~vvWnp~~-~~~~~~d~-~~~~~~~fvCiEP~~ 171 (191)
+.+.+.-.++..+||-.. +-.+-.|. .-|+|+--+|||-+.
T Consensus 80 ~~i~Rg~agD~lvwnGkry~v~g~~dW~~QDGW~~~lcve~G~ 122 (130)
T PHA02310 80 VDLDRDMAGDQFIWTGRVFQLESQGSWFYQDGWGVCLAVDIGA 122 (130)
T ss_pred eeccccCCCCEEEEcCcEEEEccccceeecCCceEEEEEEecc
Confidence 345555678999999653 33333453 347899999999864
No 59
>PF14100 PmoA: Methane oxygenase PmoA
Probab=28.14 E-value=73 Score=25.98 Aligned_cols=28 Identities=18% Similarity=0.349 Sum_probs=22.9
Q ss_pred ceEEecceec-CCCeEeCCCCEEEEEEEe
Q psy17993 163 NMICVEAGHV-TSPVLLHPGTTFEASQIL 190 (191)
Q Consensus 163 ~fvCiEP~~~-~~~~~L~pGe~~~~~~~i 190 (191)
.|+|+-|.-+ .....|+||++.+++++|
T Consensus 224 g~~~~~p~~~~~~~~~l~~G~~l~~rYr~ 252 (271)
T PF14100_consen 224 GLFGANPAPAFDGPLTLPPGETLTLRYRV 252 (271)
T ss_pred CcceecccccccCceecCCCCeEEEEEEE
Confidence 5778877654 357999999999999986
No 60
>PF02120 Flg_hook: Flagellar hook-length control protein FliK; InterPro: IPR021136 This entry represents the C-terminal domain of the flagellar hook-length control protein FliK. This entry also includes YscP of the Yersinia type III secretion system, and equivalent proteins in other pathogenic bacterial type III secretion systems. During flagellar morphogenesis in Salmonella typhimurium and Escherichia coli, flagellar hook-length control protein (FliK) controls the length of the hook by directly measuring the hook length [, ]. It is considered unlikely that FliK functions as a molecular ruler for determining hook length, but that it is more likely to be employing a novel mechanism. The deduced amino acid sequences of FliK proteins from S. typhimurium and E. coli have molecular masses of 41,748 and 39,246 Da, respectively, and are fairly hydrophilic []. Sequence comparison reveals around 50% identity, with greatest conservation in the C-terminal region, with 71% identity in the last 154 amino acids - mutagenesis of this conserved region completely abolishes motility. The central and C-terminal regions are rich in proline and glutamine respectively; it is thought that they may constitute distinct domains [].; PDB: 2RRL_A.
Probab=28.13 E-value=97 Score=19.86 Aligned_cols=38 Identities=13% Similarity=-0.040 Sum_probs=21.3
Q ss_pred CCeEEEEEccCCCcccCCCcceEEEEEEEEeCCceEEEEEEEeCC
Q psy17993 9 LDAFCIFPYLDIFCIFPYFARFRLTYRLILREKELHFNIGVYNPS 53 (191)
Q Consensus 9 ~~~~~~~~l~~~~t~~~yP~~f~l~~~~~L~~~~L~~~l~v~N~~ 53 (191)
+...+.|.|... .-=.+.+++++.++.|++.+.+.|..
T Consensus 11 ~~~~~~l~L~p~-------~LG~v~v~l~~~~~~l~v~~~~~~~~ 48 (85)
T PF02120_consen 11 GSWELSLQLDPP-------ELGSVEVKLRLQGGNLSVQFTAENPE 48 (85)
T ss_dssp T--EEEE--SSG-------GG--EEEEEEEETTEEEEEEE--SSH
T ss_pred CceEEEEEEccc-------ccCcEEEEEEEeCCEEEEEEEECCHH
Confidence 344566666543 22356777788888999999888764
No 61
>PF06586 TraK: TraK protein; InterPro: IPR010563 This family consists of several TraK proteins from Escherichia coli, Salmonella typhi and Salmonella typhimurium. TraK is known to be essential for pilus assembly but its exact role in this process is unknown [].
Probab=27.78 E-value=1.6e+02 Score=23.15 Aligned_cols=36 Identities=19% Similarity=0.093 Sum_probs=27.2
Q ss_pred CCcceEEEEEEEEeCCceEEE-EEEEeCCCCceeEEee
Q psy17993 26 YFARFRLTYRLILREKELHFN-IGVYNPSNDLAFNFNL 62 (191)
Q Consensus 26 yP~~f~l~~~~~L~~~~L~~~-l~v~N~~~~~~m~f~~ 62 (191)
.+-.++++......|..|++. ++|+|+++. ++-+.-
T Consensus 170 ~~~~l~~~~~~~~~G~~l~~~~y~v~N~~~~-~v~l~E 206 (234)
T PF06586_consen 170 LPKGLRLTPVQRYEGSGLRGEVYRVTNTSDQ-PVELDE 206 (234)
T ss_pred ccccceEeeeeEEecCceEEEEEEEEeCCCC-CEEecH
Confidence 345667777777778888876 889999998 776654
No 62
>PRK15098 beta-D-glucoside glucohydrolase; Provisional
Probab=27.71 E-value=44 Score=31.53 Aligned_cols=33 Identities=18% Similarity=0.226 Sum_probs=22.6
Q ss_pred ccCCCcceEEEEE-E-----EEe------CCceEEEEEEEeCCCC
Q psy17993 23 IFPYFARFRLTYR-L-----ILR------EKELHFNIGVYNPSND 55 (191)
Q Consensus 23 ~~~yP~~f~l~~~-~-----~L~------~~~L~~~l~v~N~~~~ 55 (191)
...|||-|=|.|+ | +++ ++.++++++|+|+|+.
T Consensus 637 ~plypFG~GLSYT~F~ys~l~v~~~~~~~~~~i~v~v~V~NtG~~ 681 (765)
T PRK15098 637 GPLYPFGYGLSYTTFTVSDVKLSSPTMKRDGKVTASVTVTNTGKR 681 (765)
T ss_pred CccccccCCCCCccEEeeccEeccccccCCCeEEEEEEEEECCCC
Confidence 3579998866541 1 222 4568899999999976
No 63
>KOG3150|consensus
Probab=26.01 E-value=38 Score=25.42 Aligned_cols=20 Identities=20% Similarity=0.660 Sum_probs=15.5
Q ss_pred CceEEEEEeCCCcEEEeCCc
Q psy17993 128 GRKMRLQKYNFPDTVIWNPW 147 (191)
Q Consensus 128 ~~~i~v~~~~~~~~vvWnp~ 147 (191)
.+.|.++.+-||+.+||+|=
T Consensus 16 ~~~id~k~~rfPyCIVWTPi 35 (182)
T KOG3150|consen 16 DLEIDPKRSRFPYCIVWTPI 35 (182)
T ss_pred ccccCcccCCCCeEEEecCh
Confidence 34566677789999999984
No 64
>PF11325 DUF3127: Domain of unknown function (DUF3127); InterPro: IPR021474 This bacterial family of proteins has no known function.
Probab=25.62 E-value=1.1e+02 Score=20.35 Aligned_cols=31 Identities=23% Similarity=0.320 Sum_probs=23.8
Q ss_pred CCCceEEeccee--cCCCeEeCCCCEEEEEEEe
Q psy17993 160 EYPNMICVEAGH--VTSPVLLHPGTTFEASQIL 190 (191)
Q Consensus 160 ~~~~fvCiEP~~--~~~~~~L~pGe~~~~~~~i 190 (191)
.|.+.+|+|-|. +..+-.+++|+..+.++.|
T Consensus 33 qYP~~i~f~~~~dk~~~l~~~~~Gd~V~Vsf~i 65 (84)
T PF11325_consen 33 QYPQKICFEFWGDKIDLLDNFQVGDEVKVSFNI 65 (84)
T ss_pred cCCceEEEEEEcchhhhhccCCCCCEEEEEEEe
Confidence 366899999986 2224688999999888876
No 65
>cd06536 CIDE_N_ICAD CIDE_N domain of ICAD. The CIDE_N (cell death-inducing DFF45-like effector, N-terminal) domain is found at the N-terminus of the CAD nuclease (caspase-activated DNase/DNA fragmentation factor, DFF40) and its inhibitor, ICAD (DFF45). These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of the CAD/DFF40 and ICAD/DFF45 during apoptosis. In normal cells, DFF exists in the nucleus as a heterodimer composed of CAD/DFF40 as a latent nuclease and its chaperone and inhibitor subunit ICAD/DFF45. Apoptotic activation of caspase-3 results in the cleavage of DFF45/ICAD and release of active DFF40/CAD nuclease.
Probab=24.73 E-value=26 Score=23.22 Aligned_cols=21 Identities=19% Similarity=0.395 Sum_probs=14.5
Q ss_pred ceEEecceecCCCeEeCCCCEEE
Q psy17993 163 NMICVEAGHVTSPVLLHPGTTFE 185 (191)
Q Consensus 163 ~fvCiEP~~~~~~~~L~pGe~~~ 185 (191)
-|-|+++-.. .+.|.|||+|.
T Consensus 59 yF~tLp~nT~--l~~L~~gq~W~ 79 (80)
T cd06536 59 YFLCLPPNTK--FVLLAENEKWA 79 (80)
T ss_pred HHhhCCCCcE--EEEECCCCccC
Confidence 4556666332 68999999985
No 66
>PF14742 GDE_N_bis: N-terminal domain of (some) glycogen debranching enzymes
Probab=24.71 E-value=3.2e+02 Score=20.85 Aligned_cols=39 Identities=15% Similarity=0.065 Sum_probs=31.1
Q ss_pred CCCcceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeeccc
Q psy17993 25 PYFARFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLLLH 65 (191)
Q Consensus 25 ~yP~~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g~H 65 (191)
...-...++-+-.|.++ |.-+++|+|.+.+ |..+.+.++
T Consensus 76 ~~~~~l~l~R~r~v~~~-~~E~l~l~N~~~~-pv~~~l~l~ 114 (194)
T PF14742_consen 76 IPDGTLHLRRERFVGGG-LYERLTLTNYSPE-PVELTLSLE 114 (194)
T ss_pred cCCCeEEEEEEEEECCC-CEEEEEEEeCCCC-CEEEEEEEE
Confidence 34566777777788777 9999999999999 888887544
No 67
>PF00207 A2M: Alpha-2-macroglobulin family; InterPro: IPR001599 This entry contains serum complement C3 and C4 precursors and alpha-macrogrobulins. The alpha-macroglobulin (aM) family of proteins includes protease inhibitors [], typified by the human tetrameric a2-macroglobulin (a2M); they belong to the MEROPS proteinase inhibitor family I39, clan IL. These protease inhibitors share several defining properties, which include (i) the ability to inhibit proteases from all catalytic classes, (ii) the presence of a 'bait region' and a thiol ester, (iii) a similar protease inhibitory mechanism and (iv) the inactivation of the inhibitory capacity by reaction of the thiol ester with small primary amines. aM protease inhibitors inhibit by steric hindrance []. The mechanism involves protease cleavage of the bait region, a segment of the aM that is particularly susceptible to proteolytic cleavage, which initiates a conformational change such that the aM collapses about the protease. In the resulting aM-protease complex, the active site of the protease is sterically shielded, thus substantially decreasing access to protein substrates. Two additional events occur as a consequence of bait region cleavage, namely (i) the h-cysteinyl-g-glutamyl thiol ester becomes highly reactive and (ii) a major conformational change exposes a conserved COOH-terminal receptor binding domain [] (RBD). RBD exposure allows the aM protease complex to bind to clearance receptors and be removed from circulation []. Tetrameric, dimeric, and, more recently, monomeric aM protease inhibitors have been identified [, ].; GO: 0004866 endopeptidase inhibitor activity; PDB: 3KLS_B 3PRX_C 3KM9_B 3PVM_C 3CU7_A 4E0S_A 4A5W_A 2PN5_A 3FRP_G 3HRZ_B ....
Probab=24.56 E-value=2.1e+02 Score=18.75 Aligned_cols=14 Identities=29% Similarity=0.104 Sum_probs=6.8
Q ss_pred cCCCeEEEEEccCC
Q psy17993 7 ATLDAFCIFPYLDI 20 (191)
Q Consensus 7 ~~~~~~~~~~l~~~ 20 (191)
+++...+.+.+-++
T Consensus 12 ~~G~~~~~~~lPd~ 25 (92)
T PF00207_consen 12 ANGKATFSFTLPDS 25 (92)
T ss_dssp TTEEEEEEEE-SSS
T ss_pred CCCEEEEEEECCCC
Confidence 34455566666554
No 68
>PF06165 Glyco_transf_36: Glycosyltransferase family 36; InterPro: IPR010383 The biosynthesis of disaccharides, oligosaccharides and polysaccharides involves the action of hundreds of different glycosyltransferases. These enzymes catalyse the transfer of sugar moieties from activated donor molecules to specific acceptor molecules, forming glycosidic bonds. A classification of glycosyltransferases using nucleotide diphospho-sugar, nucleotide monophospho-sugar and sugar phosphates (2.4.1.- from EC) and related proteins into distinct sequence based families has been described []. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. The same three-dimensional fold is expected to occur within each of the families. Because 3-D structures are better conserved than sequences, several of the families defined on the basis of sequence similarities may have similar 3-D structures and therefore form 'clans'. The glycosyltransferase family 36 includes cellobiose phosphorylase (2.4.1.20 from EC), cellodextrin phosphorylase (2.4.1.49 from EC), and chitobiose phosphorylase. Many members of this family contain two copies of the domain represented in this entry.; PDB: 3QDE_A 3RRS_B 1V7V_A 1V7W_A 1V7X_A 3ACT_B 2CQT_A 3QFY_B 3QFZ_A 2CQS_A ....
Probab=24.53 E-value=2e+02 Score=19.84 Aligned_cols=34 Identities=9% Similarity=0.192 Sum_probs=18.6
Q ss_pred cceEEEEEEEEeC-CceEE-EEEEEeCCCCceeEEee
Q psy17993 28 ARFRLTYRLILRE-KELHF-NIGVYNPSNDLAFNFNL 62 (191)
Q Consensus 28 ~~f~l~~~~~L~~-~~L~~-~l~v~N~~~~~~m~f~~ 62 (191)
...+.++++.+.. +.+++ .++++|++++ +.-.++
T Consensus 48 ~gi~~~~~v~V~~~~~vEi~~l~l~N~~~~-~r~L~v 83 (110)
T PF06165_consen 48 GGIETELTVFVPPDDPVEIRRLRLTNTSNR-PRRLSV 83 (110)
T ss_dssp TTEEEEEEEE--TTSSEEEEEEEEEE-SSS--EEEEE
T ss_pred CCEEEEEEEEEcCCCCEEEEEEEEEECcCC-cEEEEE
Confidence 4455566665542 33444 4889999988 766555
No 69
>PF10633 NPCBM_assoc: NPCBM-associated, NEW3 domain of alpha-galactosidase; InterPro: IPR018905 This domain has been named NEW3, but its function is not known. It is found on proteins which are bacterial galactosidases [].; PDB: 1EUT_A 2BZD_A 1WCQ_C 2BER_A 1W8O_A 1EUU_A 1W8N_A.
Probab=24.47 E-value=61 Score=20.65 Aligned_cols=16 Identities=13% Similarity=0.212 Sum_probs=11.4
Q ss_pred CCceEEEEEEEeCCCC
Q psy17993 40 EKELHFNIGVYNPSND 55 (191)
Q Consensus 40 ~~~L~~~l~v~N~~~~ 55 (191)
|...+++++|+|.|..
T Consensus 4 G~~~~~~~tv~N~g~~ 19 (78)
T PF10633_consen 4 GETVTVTLTVTNTGTA 19 (78)
T ss_dssp TEEEEEEEEEE--SSS
T ss_pred CCEEEEEEEEEECCCC
Confidence 4567899999999976
No 70
>PF07610 DUF1573: Protein of unknown function (DUF1573); InterPro: IPR011467 These hypothetical proteins from bacteria, such as Rhodopirellula baltica, Bacteroides thetaiotaomicron and Porphyromonas gingivalis, share a region of conserved sequence towards their N termini.
Probab=24.38 E-value=77 Score=18.08 Aligned_cols=12 Identities=8% Similarity=0.155 Sum_probs=7.4
Q ss_pred EEEEeCCCCceeE
Q psy17993 47 IGVYNPSNDLAFN 59 (191)
Q Consensus 47 l~v~N~~~~~~m~ 59 (191)
++++|+|++ ++-
T Consensus 2 F~~~N~g~~-~L~ 13 (45)
T PF07610_consen 2 FEFTNTGDS-PLV 13 (45)
T ss_pred EEEEECCCC-cEE
Confidence 566777766 443
No 71
>PF07705 CARDB: CARDB; InterPro: IPR011635 The APHP (acidic peptide-dependent hydrolases/peptidase) domain is found in a variety of different proteins.; PDB: 2KUT_A 2L0D_A 3IDU_A 2KL6_A.
Probab=23.84 E-value=91 Score=20.25 Aligned_cols=17 Identities=12% Similarity=0.175 Sum_probs=14.1
Q ss_pred eCCceEEEEEEEeCCCC
Q psy17993 39 REKELHFNIGVYNPSND 55 (191)
Q Consensus 39 ~~~~L~~~l~v~N~~~~ 55 (191)
.+..++++++|+|.|..
T Consensus 17 ~g~~~~i~~~V~N~G~~ 33 (101)
T PF07705_consen 17 PGEPVTITVTVKNNGTA 33 (101)
T ss_dssp TTSEEEEEEEEEE-SSS
T ss_pred CCCEEEEEEEEEECCCC
Confidence 46789999999999987
No 72
>PF13739 DUF4163: Domain of unknown function (DUF4163); PDB: 3CYG_B.
Probab=23.44 E-value=2.1e+02 Score=18.40 Aligned_cols=35 Identities=11% Similarity=0.235 Sum_probs=19.2
Q ss_pred CCCcceEEEEEEEEeC-CceEEEEEEEeCCCCceeEEeecccee
Q psy17993 25 PYFARFRLTYRLILRE-KELHFNIGVYNPSNDLAFNFNLLLHTY 67 (191)
Q Consensus 25 ~yP~~f~l~~~~~L~~-~~L~~~l~v~N~~~~~~m~f~~g~Hpy 67 (191)
.+||.+...+.+.... +-|++.+ . .--|..|.||-
T Consensus 64 ~~~~~~~~~~~v~~~~~~ilSi~~--~------~y~ytGGAHG~ 99 (101)
T PF13739_consen 64 EFPYELNIDYEVTYNSDNILSISI--N------YYSYTGGAHGM 99 (101)
T ss_dssp -S-EEEEEEEEEEEE-SSEEEEEE--E------EEEESSSSS-E
T ss_pred CCCeEEEEEEEEEecCCCEEEEEE--E------eEEEcCCCCCC
Confidence 3566666666666543 3344444 3 56788888874
No 73
>PF05753 TRAP_beta: Translocon-associated protein beta (TRAPB); InterPro: IPR008856 This family consists of several eukaryotic translocon-associated protein beta (TRAPB) or signal sequence receptor beta subunit (SSR-beta) proteins. The normal translocation of nascent polypeptides into the lumen of the endoplasmic reticulum (ER) is thought to be aided in part by a translocon-associated protein (TRAP) complex consisting of 4 protein subunits. The association of mature proteins with the ER and Golgi, or other intracellular locales, such as lysosomes, depends on the initial targeting of the nascent polypeptide to the ER membrane. A similar scenario must also exist for proteins destined for secretion [].; GO: 0005783 endoplasmic reticulum, 0016021 integral to membrane
Probab=23.21 E-value=85 Score=24.06 Aligned_cols=20 Identities=10% Similarity=0.386 Sum_probs=17.3
Q ss_pred EEEeCCceEEEEEEEeCCCC
Q psy17993 36 LILREKELHFNIGVYNPSND 55 (191)
Q Consensus 36 ~~L~~~~L~~~l~v~N~~~~ 55 (191)
|-+.+..+.++++|.|.|+.
T Consensus 33 ~~v~g~~v~V~~~iyN~G~~ 52 (181)
T PF05753_consen 33 YLVEGEDVTVTYTIYNVGSS 52 (181)
T ss_pred cccCCcEEEEEEEEEECCCC
Confidence 44568889999999999999
No 74
>cd06539 CIDE_N_A CIDE_N domain of CIDE-A proteins. The CIDE_N (cell death-inducing DFF45-like effector, N-terminal) domain is found at the N-terminus of the CIDE (cell death-inducing DFF45-like effector) proteins. These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of the CAD/DFF40, ICAD/DFF45, and CIDE nucleases during apoptosis. The CIDE protein family includes 3 members: CIDE-A, CIDE-B, and FSP27(CIDE-C). Based on sequence similarity with DFF40 and DFF45, the CIDE proteins were initially characterized as mitochondrial activators of apoptosis. However, strong metabolic phenotypes of mice lacking CIDE-A and CIDE-B indicated that this family may play critical roles in energy balance.
Probab=23.17 E-value=29 Score=22.88 Aligned_cols=21 Identities=10% Similarity=0.166 Sum_probs=14.7
Q ss_pred ceEEecceecCCCeEeCCCCEEE
Q psy17993 163 NMICVEAGHVTSPVLLHPGTTFE 185 (191)
Q Consensus 163 ~fvCiEP~~~~~~~~L~pGe~~~ 185 (191)
-|-|+++-. .++.|.|||+|.
T Consensus 57 yF~~LpdnT--~lm~L~~gq~W~ 77 (78)
T cd06539 57 FFQTLGDNT--HFMVLEKGQKWT 77 (78)
T ss_pred HHhhCCCCC--EEEEECCCCccC
Confidence 465666632 268999999985
No 75
>KOG2143|consensus
Probab=22.28 E-value=69 Score=29.74 Aligned_cols=14 Identities=43% Similarity=1.035 Sum_probs=12.0
Q ss_pred EEeCCCcEEEeCCc
Q psy17993 134 QKYNFPDTVIWNPW 147 (191)
Q Consensus 134 ~~~~~~~~vvWnp~ 147 (191)
...|+|+++||||.
T Consensus 793 srsG~PDL~lWNp~ 806 (854)
T KOG2143|consen 793 SRSGFPDLTLWNPE 806 (854)
T ss_pred ccCCCCceeeecCC
Confidence 35789999999995
No 76
>PF09624 DUF2393: Protein of unknown function (DUF2393); InterPro: IPR013417 The function of this protein is unknown. It is always found as part of a two-gene operon with IPR013416 from INTERPRO, a protein that appears to span the membrane seven times. It has so far been found in the bacteria Anabaena sp. (strain PCC 7120), Agrobacterium tumefaciens, Rhizobium meliloti, and Gloeobacter violaceus.
Probab=22.23 E-value=2.3e+02 Score=20.45 Aligned_cols=32 Identities=16% Similarity=0.053 Sum_probs=23.2
Q ss_pred CCcceEEEEEEEEe-CCceEEEEEEEeCCCCcee
Q psy17993 26 YFARFRLTYRLILR-EKELHFNIGVYNPSNDLAF 58 (191)
Q Consensus 26 yP~~f~l~~~~~L~-~~~L~~~l~v~N~~~~~~m 58 (191)
+|-...+....++. ++++-+..+|+|.|+. ++
T Consensus 46 ~~~~~~~~~~~~l~~~~~~~v~g~V~N~g~~-~i 78 (149)
T PF09624_consen 46 KKIELTLTSQKRLQYSESFYVDGTVTNTGKF-TI 78 (149)
T ss_pred CCceEEEeeeeeeeeccEEEEEEEEEECCCC-Ee
Confidence 34455555555554 7889999999999988 54
No 77
>cd06537 CIDE_N_B CIDE_N domain of CIDE-B proteins. The CIDE_N (cell death-inducing DFF45-like effector, N-terminal) domain is found at the N-terminus of the CIDE (cell death-inducing DFF45-like effector) proteins. These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of the CAD/DFF40, ICAD/DFF45 and CIDE nucleases during apoptosis. The CIDE protein family includes 3 members: CIDE-A, CIDE-B, and FSP27(CIDE-C). Based on sequence similarity with DFF40 and DFF45, CIDE proteins were initially characterized as mitochondrial activators of apoptosis. However, strong metabolic phenotypes of mice lacking CIDE-A and CIDE-B indicated that this family may play critical roles in energy balance.
Probab=21.69 E-value=33 Score=22.78 Aligned_cols=22 Identities=14% Similarity=0.256 Sum_probs=15.0
Q ss_pred ceEEecceecCCCeEeCCCCEEEE
Q psy17993 163 NMICVEAGHVTSPVLLHPGTTFEA 186 (191)
Q Consensus 163 ~fvCiEP~~~~~~~~L~pGe~~~~ 186 (191)
-|-|+++-. .++.|.|||+|.-
T Consensus 56 yF~tLpdnT--~lm~L~~gq~W~p 77 (81)
T cd06537 56 FFELLEDDT--CLMVLEQGQSWSP 77 (81)
T ss_pred HHhhCCCCC--EEEEECCCCccCC
Confidence 355666632 2689999999863
No 78
>PF05618 Zn_protease: Putative ATP-dependant zinc protease; InterPro: IPR008503 This family consists of several hypothetical proteins from different archaeal and bacterial species.; PDB: 2PMA_B.
Probab=21.22 E-value=1.7e+02 Score=21.27 Aligned_cols=34 Identities=21% Similarity=0.145 Sum_probs=21.0
Q ss_pred ceEEEEEEEEeCCceEEEEEEEeCCCCceeEEeec
Q psy17993 29 RFRLTYRLILREKELHFNIGVYNPSNDLAFNFNLL 63 (191)
Q Consensus 29 ~f~l~~~~~L~~~~L~~~l~v~N~~~~~~m~f~~g 63 (191)
.+.+++.+.|.+....++++++|.+.- ..|.=+|
T Consensus 83 R~VV~~~~~lg~~~~~~e~tL~dR~~m-~yp~LlG 116 (138)
T PF05618_consen 83 RPVVETTLCLGGKTWKIEFTLTDRSNM-KYPMLLG 116 (138)
T ss_dssp CEEEEEEEEETTEEEEEEEEEE-S--S-S-SEEE-
T ss_pred eeEEEEEEEECCEEEEEEEEEcCCCcC-cCCEEEE
Confidence 355677888888788888888888765 5555555
No 79
>cd03780 MATH_TRAF5 Tumor Necrosis Factor Receptor (TNFR)-Associated Factor (TRAF) family, TRAF5 subfamily, TRAF domain, C-terminal MATH subdomain; TRAF molecules serve as adapter proteins that link TNFRs and downstream kinase cascades resulting in the activation of transcription factors and the regulation of cell survival, proliferation and stress responses. TRAF5 was identified as an activator of nuclear factor-kappaB and a regulator of lymphotoxin-beta receptor and CD40 signaling. Its interaction with CD40 is indirect, involving hetero-oligomerization with TRAF3. In addition, TRAF5 has been shown to associate with other TNFRs including CD27, CD30, OX40 and GITR (glucocorticoid-induced TNFR). It plays a role in modulating Th2 immune responses (driven by OX40 costimulation) and T-cell activation (triggered by GITR). It is also involved in osteoclastogenesis. TRAF5 contains a RING finger domain, five zinc finger domains, and a TRAF domain. The TRAF domain can be divided into a more dive
Probab=21.19 E-value=2.7e+02 Score=20.40 Aligned_cols=31 Identities=16% Similarity=0.152 Sum_probs=17.2
Q ss_pred CCCeEEEEEccCC--CcccCCCcceEEEEEEEEeC
Q psy17993 8 TLDAFCIFPYLDI--FCIFPYFARFRLTYRLILRE 40 (191)
Q Consensus 8 ~~~~~~~~~l~~~--~t~~~yP~~f~l~~~~~L~~ 40 (191)
.+..++-|.|... .....|| |+.+++++|-+
T Consensus 53 ~~~iSv~l~l~~g~~D~~l~wp--~~~~~tfsLlD 85 (148)
T cd03780 53 GTHLSLYFVVMRGEFDSLLQWP--FRQRVTLMLLD 85 (148)
T ss_pred CCEEEEEEEEecCccccccCcc--eEEEEEEEEEC
Confidence 3345666666433 2333454 67777777754
No 80
>cd01615 CIDE_N CIDE_N domain, found at the N-terminus of the CIDE (cell death-inducing DFF45-like effector) proteins, as well as CAD nuclease (caspase-activated DNase/DNA fragmentation factor, DFF40) and its inhibitor, ICAD(DFF45). These proteins are associated with the chromatin condensation and DNA fragmentation events of apoptosis; the CIDE_N domain is thought to regulate the activity of ICAD/DFF45, and the CAD/DFF40 and CIDE nucleases during apoptosis. The CIDE-N domain is also found in the FSP27/CIDE-C protein.
Probab=21.07 E-value=36 Score=22.42 Aligned_cols=21 Identities=24% Similarity=0.442 Sum_probs=14.9
Q ss_pred ceEEecceecCCCeEeCCCCEEE
Q psy17993 163 NMICVEAGHVTSPVLLHPGTTFE 185 (191)
Q Consensus 163 ~fvCiEP~~~~~~~~L~pGe~~~ 185 (191)
-|-|+++-.. ++.|.|||.|.
T Consensus 57 YF~tLp~nT~--l~~l~~gq~W~ 77 (78)
T cd01615 57 YFQTLPDNTV--LMLLEPGQKWT 77 (78)
T ss_pred HHhcCCCCcE--EEEECCCCCcC
Confidence 4667766332 68999999985
No 81
>PLN03080 Probable beta-xylosidase; Provisional
Probab=21.05 E-value=84 Score=29.81 Aligned_cols=13 Identities=23% Similarity=0.276 Sum_probs=8.7
Q ss_pred eEEEEEEEeCCCC
Q psy17993 43 LHFNIGVYNPSND 55 (191)
Q Consensus 43 L~~~l~v~N~~~~ 55 (191)
++++++|+|+|+.
T Consensus 686 ~~v~v~VtNtG~~ 698 (779)
T PLN03080 686 FNVHISVSNVGEM 698 (779)
T ss_pred EEEEEEEEECCcc
Confidence 6666777777654
No 82
>PF06205 GT36_AF: Glycosyltransferase 36 associated family ; InterPro: IPR010403 This domain is found in the NvdB protein (P20471 from SWISSPROT), which is involved in the production of beta-(1-->2)-glucan.; PDB: 1V7V_A 1V7W_A 1V7X_A 3ACT_B 2CQT_A 3QFY_B 3QFZ_A 2CQS_A 3QG0_B 3AFJ_A ....
Probab=20.46 E-value=1.2e+02 Score=20.18 Aligned_cols=16 Identities=38% Similarity=0.484 Sum_probs=11.0
Q ss_pred CeEeCCCCEEEEEEEe
Q psy17993 175 PVLLHPGTTFEASQIL 190 (191)
Q Consensus 175 ~~~L~pGe~~~~~~~i 190 (191)
.+.|+||++.+..+-+
T Consensus 67 ~v~L~PGe~~~v~f~l 82 (90)
T PF06205_consen 67 RVTLEPGEEKEVVFLL 82 (90)
T ss_dssp EEEE-TT-EEEEEEEE
T ss_pred EEEECCCCEEEEEEEE
Confidence 5899999999877653
No 83
>PF02883 Alpha_adaptinC2: Adaptin C-terminal domain; InterPro: IPR008152 Proteins synthesized on the ribosome and processed in the endoplasmic reticulum are transported from the Golgi apparatus to the trans-Golgi network (TGN), and from there via small carrier vesicles to their final destination compartment. These vesicles have specific coat proteins (such as clathrin or coatomer) that are important for cargo selection and direction of transport []. Clathrin coats contain both clathrin (acts as a scaffold) and adaptor complexes that link clathrin to receptors in coated vesicles. Clathrin-associated protein complexes are believed to interact with the cytoplasmic tails of membrane proteins, leading to their selection and concentration. The two major types of clathrin adaptor complexes are the heterotetrameric adaptor protein (AP) complexes, and the monomeric GGA (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) adaptors [, ]. AP (adaptor protein) complexes are found in coated vesicles and clathrin-coated pits. AP complexes connect cargo proteins and lipids to clathrin at vesicle budding sites, as well as binding accessory proteins that regulate coat assembly and disassembly (such as AP180, epsins and auxilin). There are different AP complexes in mammals. AP1 is responsible for the transport of lysosomal hydrolases between the TGN and endosomes []. AP2 associates with the plasma membrane and is responsible for endocytosis []. AP3 is responsible for protein trafficking to lysosomes and other related organelles []. AP4 is less well characterised. AP complexes are heterotetramers composed of two large subunits (adaptins), a medium subunit (mu) and a small subunit (sigma). For example, in AP1 these subunits are gamma-1-adaptin, beta-1-adaptin, mu-1 and sigma-1, while in AP2 they are alpha-adaptin, beta-2-adaptin, mu-2 and sigma-2. Each subunit has a specific function. Adaptins recognise and bind to clathrin through their hinge region (clathrin box), and recruit accessory proteins that modulate AP function through their C-terminal ear (appendage) domains. Mu recognises tyrosine-based sorting signals within the cytoplasmic domains of transmembrane cargo proteins []. One function of clathrin and AP2 complex-mediated endocytosis is to regulate the number of GABA(A) receptors available at the cell surface []. GGAs (Golgi-localising, Gamma-adaptin ear domain homology, ARF-binding proteins) are a family of monomeric clathrin adaptor proteins that are conserved from yeasts to humans. GGAs regulate clathrin-mediated the transport of proteins (such as mannose 6-phosphate receptors) from the TGN to endosomes and lysosomes through interactions with TGN-sorting receptors, sometimes in conjunction with AP-1 [, ]. GGAs bind cargo, membranes, clathrin and accessory factors. GGA1, GGA2 and GGA3 all contain a domain homologous to the ear domain of gamma-adaptin. GGAs are composed of a single polypeptide with four domains: an N-terminal VHS (Vps27p/Hrs/Stam) domain, a GAT (GGA and Tom1) domain, a hinge region, and a C-terminal GAE (gamma-adaptin ear) domain. The VHS domain is responsible for endocytosis and signal transduction, recognising transmembrane cargo through the ACLL sequence in the cytoplasmic domains of sorting receptors []. The GAT domain (also found in Tom1 proteins) interacts with ARF (ADP-ribosylation factor) to regulate membrane trafficking [], and with ubiquitin for receptor sorting []. The hinge region contains a clathrin box for recognition and binding to clathrin, similar to that found in AP adaptins. The GAE domain is similar to the AP gamma-adaptin ear domain, and is responsible for the recruitment of accessory proteins that regulate clathrin-mediated endocytosis []. This entry represents a beta-sandwich structural motif found in the appendage (ear) domain of alpha-, beta- and gamma-adaptin from AP clathrin adaptor complexes, and the GAE (gamma-adaptin ear) domain of GGA adaptor proteins. These domains have an immunoglobulin-like beta-sandwich fold containing 7 or 8 strands in 2 beta-sheets in a Greek key topology [, ]. Although these domains share a similar fold, there is little sequence identity between the alpha/beta-adaptins and gamma-adaptin/GAE. More information about these proteins can be found at Protein of the Month: Clathrin [].; GO: 0006886 intracellular protein transport, 0016192 vesicle-mediated transport, 0030131 clathrin adaptor complex; PDB: 3MNM_B 3ZY7_B 1GYU_A 1GYW_B 2A7B_A 1GYV_A 2E9G_A 1E42_B 2G30_A 2IV9_B ....
Probab=20.25 E-value=2.9e+02 Score=18.71 Aligned_cols=48 Identities=10% Similarity=0.115 Sum_probs=29.5
Q ss_pred eEEEEEEEE--eCCceEEEEEEEeCCCCceeEEeeccceeEEeCCCCceEEeeC
Q psy17993 30 FRLTYRLIL--REKELHFNIGVYNPSNDLAFNFNLLLHTYLKVPDVRRCQVNGL 81 (191)
Q Consensus 30 f~l~~~~~L--~~~~L~~~l~v~N~~~~~~m~f~~g~HpyF~v~~~~~~~~~gl 81 (191)
+++.+..+- .++.+++.+...|+... +|. +|+--++++..-.+.+...
T Consensus 11 l~I~~~~~~~~~~~~~~i~~~f~N~s~~-~it---~f~~q~avpk~~~l~l~~~ 60 (115)
T PF02883_consen 11 LQIGFKSEKSPNPNQGRIKLTFGNKSSQ-PIT---NFSFQAAVPKSFKLQLQPP 60 (115)
T ss_dssp EEEEEEEEECCETTEEEEEEEEEE-SSS--BE---EEEEEEEEBTTSEEEEEES
T ss_pred EEEEEEEEecCCCCEEEEEEEEEECCCC-Ccc---eEEEEEEeccccEEEEeCC
Confidence 344444444 68899999999999877 654 4455577774444455433
Done!