Query psy9188
Match_columns 104
No_of_seqs 142 out of 1164
Neff 6.9
Searched_HMMs 46136
Date Fri Aug 16 21:09:09 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9188.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9188hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 COG0179 MhpD 2-keto-4-pentenoa 100.0 9.6E-39 2.1E-43 239.4 8.6 101 1-103 147-248 (266)
2 KOG1535|consensus 100.0 7.9E-38 1.7E-42 224.9 7.7 103 1-103 98-200 (217)
3 TIGR02303 HpaG-C-term 4-hydrox 100.0 7.7E-34 1.7E-38 210.7 8.1 100 1-103 128-228 (245)
4 PRK15203 4-hydroxyphenylacetat 100.0 9.3E-33 2E-37 218.3 8.3 95 1-103 95-189 (429)
5 PRK10691 hypothetical protein; 100.0 2.8E-32 6E-37 199.4 8.7 103 1-103 102-205 (219)
6 PRK15203 4-hydroxyphenylacetat 100.0 3.2E-29 6.8E-34 198.3 8.4 100 1-103 308-408 (429)
7 PRK12764 hypothetical protein; 100.0 3.8E-29 8.2E-34 200.9 8.3 99 1-103 106-205 (500)
8 TIGR02305 HpaG-N-term 4-hydrox 99.9 1.3E-27 2.9E-32 172.9 7.6 98 1-103 94-191 (205)
9 PLN02856 fumarylacetoacetase 99.9 1.9E-27 4.2E-32 187.4 9.0 99 1-102 241-369 (424)
10 TIGR01266 fum_ac_acetase fumar 99.9 4.9E-27 1.1E-31 184.8 8.4 99 1-102 233-361 (415)
11 PF01557 FAA_hydrolase: Fumary 99.9 2.2E-27 4.7E-32 172.2 3.5 102 1-103 94-202 (218)
12 TIGR03220 catechol_dmpE 2-oxop 99.7 5.8E-17 1.3E-21 120.8 7.2 88 15-103 147-241 (255)
13 PRK11342 mhpD 2-keto-4-penteno 99.6 4.8E-16 1E-20 116.3 5.9 76 27-103 164-245 (262)
14 TIGR02312 HpaH 2-oxo-hepta-3-e 99.4 3E-13 6.4E-18 101.6 5.4 75 28-103 172-252 (267)
15 KOG2843|consensus 99.4 6.4E-13 1.4E-17 101.1 4.3 93 1-96 235-357 (420)
16 TIGR03218 catechol_dmpH 4-oxal 98.9 5.7E-09 1.2E-13 78.3 7.5 76 27-103 168-249 (263)
17 COG3971 2-keto-4-pentenoate hy 98.7 4.5E-08 9.7E-13 73.4 5.8 76 27-103 166-247 (264)
18 COG3970 Fumarylacetoacetate (F 98.3 1.1E-06 2.3E-11 67.8 5.0 89 1-92 221-318 (379)
19 PF11010 DUF2848: Protein of u 97.2 0.00041 8.9E-09 50.3 3.9 76 27-103 95-177 (194)
20 PRK06488 sulfur carrier protei 86.1 2 4.4E-05 25.1 4.2 24 46-74 1-24 (65)
21 cd01617 DCX Ubiquitin-like dom 83.3 1.5 3.2E-05 27.1 2.8 51 45-95 2-56 (80)
22 PRK06083 sulfur carrier protei 78.5 8.7 0.00019 24.1 5.2 32 40-75 13-44 (84)
23 KOG1379|consensus 77.7 1.3 2.9E-05 34.6 1.5 53 40-92 187-255 (330)
24 PRK08053 sulfur carrier protei 75.1 8.2 0.00018 22.7 4.2 24 47-74 2-25 (66)
25 PRK06944 sulfur carrier protei 75.1 11 0.00024 21.7 4.7 24 47-74 2-25 (65)
26 PRK07440 hypothetical protein; 70.1 14 0.00031 22.1 4.5 26 45-74 4-29 (70)
27 PRK05863 sulfur carrier protei 66.5 15 0.00032 21.5 3.9 25 46-74 1-25 (65)
28 PRK07696 sulfur carrier protei 65.6 15 0.00032 21.8 3.9 26 46-74 1-26 (67)
29 PRK05659 sulfur carrier protei 64.9 17 0.00037 21.0 4.0 10 47-56 2-11 (66)
30 cd00565 ThiS ThiaminS ubiquiti 58.1 24 0.00052 20.4 3.8 15 78-92 48-62 (65)
31 PF03990 DUF348: Domain of unk 53.0 35 0.00077 18.3 4.8 37 46-87 2-38 (43)
32 PRK15450 signal transduction p 52.1 11 0.00023 23.9 1.6 21 67-87 27-47 (85)
33 PF00498 FHA: FHA domain; Int 51.3 14 0.0003 21.1 2.0 15 75-89 54-68 (68)
34 TIGR01683 thiS thiamine biosyn 50.7 32 0.0007 19.8 3.5 15 78-92 47-61 (64)
35 PRK15175 Vi polysaccharide exp 46.6 20 0.00043 28.4 2.7 39 44-90 198-236 (355)
36 PF02538 Hydantoinase_B: Hydan 43.9 19 0.00041 29.7 2.3 50 50-102 46-100 (527)
37 COG3802 GguC Uncharacterized p 43.4 36 0.00079 26.3 3.5 73 14-86 205-282 (333)
38 COG0146 HyuB N-methylhydantoin 40.6 20 0.00043 30.2 1.9 43 51-93 48-95 (563)
39 cd03701 IF2_IF5B_II IF2_IF5B_I 40.5 19 0.0004 22.9 1.4 19 76-94 23-41 (95)
40 PRK08364 sulfur carrier protei 40.5 78 0.0017 18.6 4.8 8 46-53 5-12 (70)
41 cd03703 aeIF5B_II aeIF5B_II: T 38.9 22 0.00047 23.6 1.6 19 76-94 23-41 (110)
42 PF03300 Tenui_NS4: Tenuivirus 38.0 18 0.00039 27.4 1.2 45 18-65 152-196 (283)
43 PF02938 GAD: GAD domain; Int 37.6 27 0.00059 21.9 1.8 41 46-89 44-84 (95)
44 PF11064 DUF2865: Protein of u 35.6 17 0.00037 24.3 0.7 18 71-88 96-113 (116)
45 PRK11479 hypothetical protein; 32.3 28 0.00062 26.6 1.5 37 49-91 40-76 (274)
46 KOG3276|consensus 32.1 28 0.00062 23.5 1.3 20 68-87 73-92 (125)
47 cd03702 IF2_mtIF2_II This fami 30.2 35 0.00076 21.7 1.5 18 76-93 23-40 (95)
48 PF12077 DUF3556: Transmembran 29.6 99 0.0022 26.1 4.3 44 45-88 475-522 (574)
49 PRK10030 hypothetical protein; 29.4 36 0.00078 24.5 1.6 16 75-90 16-31 (197)
50 PF00877 NLPC_P60: NlpC/P60 fa 29.3 30 0.00065 21.6 1.0 15 76-90 48-62 (105)
51 PF15656 Tox-HDC: Toxin with a 29.2 90 0.002 21.0 3.4 31 65-95 12-43 (119)
52 PF01052 SpoA: Surface present 28.8 27 0.00058 20.8 0.7 17 76-92 25-41 (77)
53 COG4301 Uncharacterized conser 27.8 57 0.0012 25.3 2.4 27 65-91 167-194 (321)
54 TIGR02219 phage_NlpC_fam putat 27.3 36 0.00078 22.7 1.2 13 78-90 75-87 (134)
55 PF13550 Phage-tail_3: Putativ 27.2 34 0.00073 22.7 1.0 15 77-91 137-151 (164)
56 PF14894 Lsm_C: Lsm C-terminal 26.4 59 0.0013 19.6 1.9 20 67-86 1-20 (64)
57 PRK06033 hypothetical protein; 25.8 56 0.0012 20.3 1.8 17 76-92 24-40 (83)
58 cd00060 FHA Forkhead associate 25.2 72 0.0016 19.0 2.2 15 76-90 79-93 (102)
59 COG3862 Uncharacterized protei 25.1 74 0.0016 21.2 2.3 62 26-87 29-98 (117)
60 PF01191 RNA_pol_Rpb5_C: RNA p 25.0 37 0.0008 20.9 0.8 14 74-87 43-56 (74)
61 PRK11433 aldehyde oxidoreducta 24.6 2.8E+02 0.0061 20.5 5.5 17 41-57 47-63 (217)
62 cd00754 MoaD Ubiquitin domain 24.6 1.5E+02 0.0033 17.1 3.6 30 44-92 48-77 (80)
63 PRK09570 rpoH DNA-directed RNA 24.5 43 0.00094 20.9 1.1 13 75-87 47-59 (79)
64 PF13799 DUF4183: Domain of un 24.3 38 0.00083 21.4 0.8 38 43-91 42-79 (84)
65 PF15092 UPF0728: Uncharacteri 23.7 91 0.002 20.0 2.4 26 39-66 47-72 (88)
66 PRK11507 ribosome-associated p 23.5 51 0.0011 20.1 1.3 14 76-89 49-62 (70)
67 smart00537 DCX Domain in the D 23.5 1.4E+02 0.0031 18.5 3.4 52 42-93 4-60 (89)
68 PRK15078 polysaccharide export 23.2 1.1E+02 0.0025 24.1 3.5 39 44-90 211-251 (379)
69 KOG2915|consensus 22.7 70 0.0015 25.0 2.1 29 63-93 88-117 (314)
70 PF13533 Biotin_lipoyl_2: Biot 22.4 35 0.00076 18.9 0.3 14 75-88 19-32 (50)
71 TIGR03882 cyclo_dehyd_2 bacter 22.4 62 0.0014 23.0 1.7 26 10-35 159-184 (193)
72 COG0309 HypE Hydrogenase matur 22.3 56 0.0012 25.9 1.6 16 77-92 158-174 (339)
73 PF02814 UreE_N: UreE urease a 22.2 46 0.00099 19.4 0.8 16 74-89 45-60 (65)
74 TIGR02754 sod_Ni_protease nick 22.0 55 0.0012 19.8 1.2 10 79-88 11-20 (90)
75 cd00986 PDZ_LON_protease PDZ d 21.7 57 0.0012 19.1 1.2 10 79-88 24-33 (79)
76 PF00717 Peptidase_S24: Peptid 21.7 42 0.00092 19.0 0.6 10 79-88 10-19 (70)
77 PF13180 PDZ_2: PDZ domain; PD 21.5 49 0.0011 19.6 0.9 12 77-88 29-40 (82)
78 PF04312 DUF460: Protein of un 21.4 1.3E+02 0.0028 20.8 3.1 29 47-77 47-75 (138)
79 KOG1939|consensus 21.3 1.1E+02 0.0025 27.5 3.3 45 51-95 771-818 (1247)
80 PF04571 Lipin_N: lipin, N-ter 21.2 1.4E+02 0.003 19.9 3.0 35 21-55 34-71 (110)
81 TIGR03439 methyl_EasF probable 21.0 73 0.0016 24.7 2.0 27 65-91 170-198 (319)
82 PF13621 Cupin_8: Cupin-like d 20.9 38 0.00082 23.9 0.3 13 76-88 209-221 (251)
83 cd01674 Homoaconitase_Swivel H 20.8 1.6E+02 0.0035 20.1 3.4 31 66-96 23-59 (129)
84 cd00991 PDZ_archaeal_metallopr 20.3 63 0.0014 19.1 1.2 12 77-88 25-36 (79)
No 1
>COG0179 MhpD 2-keto-4-pentenoate hydratase/2-oxohepta-3-ene-1,7-dioic acid hydratase (catechol pathway) [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=100.00 E-value=9.6e-39 Score=239.43 Aligned_cols=101 Identities=45% Similarity=0.718 Sum_probs=96.1
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMT 79 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~t 79 (104)
+|+||+|.. +.+++|.++|+||+|||+|||+ +.+++.++.++.|+++||||++|+++|++|+|+|++||+|+|+++|
T Consensus 147 ~T~Rd~Q~~--~~~~~w~~aK~~d~~~Pigp~iv~~~e~~d~~~l~l~~~vNGe~~Q~g~t~~Mi~~i~~lI~~lS~~~t 224 (266)
T COG0179 147 VTARDLQME--EKGRPWTRAKGFDTFAPVGPWIVTKDEISDPQNLPLSLRVNGEVRQRGNTSDMIFSIPELIAYLSRFMT 224 (266)
T ss_pred cchhcchhh--hhcCCcccccccCCCCCceeEEeccccCCCCccceEEEEECCEEEecCcHHHcccCHHHHHHHHhCCcc
Confidence 699999975 3689999999999999999997 7888889999999999999999999999999999999999999999
Q ss_pred cCCCcEEEcCCCCCcccccCCCcc
Q psy9188 80 LEPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 80 L~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
|+|||||+||||+|++++++||+.
T Consensus 225 L~pGDvI~TGTP~Gvg~l~~GD~v 248 (266)
T COG0179 225 LEPGDVILTGTPSGVGFLKPGDVV 248 (266)
T ss_pred cCCCCEEEeCCCCCcccCCCCCEE
Confidence 999999999999999999999964
No 2
>KOG1535|consensus
Probab=100.00 E-value=7.9e-38 Score=224.94 Aligned_cols=103 Identities=46% Similarity=0.697 Sum_probs=100.5
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcccCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCcc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFIPEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTL 80 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL 80 (104)
+||||+|.+++++|+||..+|+||+|+|+|++++.+.++||.++.+.++|||+++|+++|++|+|+++.+|+|+|+++||
T Consensus 98 mtARd~q~~ak~~g~pw~l~K~~Dtf~Pis~~vpk~~v~Dp~nl~L~l~VnG~~~Q~g~T~~mifkip~li~~is~~~tL 177 (217)
T KOG1535|consen 98 MTARDWQDEAKKKGLPWTLGKGFDTFTPISAIVPKEKVPDPHNLWLWLRVNGETRQTGNTSLMIFKIPDLISRLSQIMTL 177 (217)
T ss_pred ccchhhhhhhhhcCCCeeeccccCccCcccccccHHHCCCccceEEEEEEccEEEecCchhhheecHHHHHHHHhhheee
Confidence 69999999999999999999999999999999999999999999999999999999999999999999999999999999
Q ss_pred CCCcEEEcCCCCCcccccCCCcc
Q psy9188 81 EPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 81 ~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
+|||||+||||.|++++++||..
T Consensus 178 ~~GDvILTGTP~GVg~v~~Gd~i 200 (217)
T KOG1535|consen 178 EPGDVILTGTPEGVGEVKPGDVI 200 (217)
T ss_pred cCCCEEEecCCCccccccCCCEE
Confidence 99999999999999999999963
No 3
>TIGR02303 HpaG-C-term 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, C-terminal subunit. This model represents one of two subunits/domains of the bifunctional isomerase/decarboxylase involved in 4-hydroxyphenylacetate degradation. In E. coli and some other species this enzyme is encoded by a single polypeptide containing both this domain and the closely related N-terminal domain (TIGR02305). In other species such as Pasteurella multocida these domains are found as two separate proteins (usually as tandem genes). Together, these domains carry out the decarboxylation of 5-oxopent-3-ene-1,2,5-tricarboxylic acid (OPET) to 2-hydroxy-2,4-diene-1,7-dioate (HHDD) and the subsequent isomerization to 2-oxohept-3-ene-1,7-dioate (OHED).
Probab=100.00 E-value=7.7e-34 Score=210.72 Aligned_cols=100 Identities=33% Similarity=0.422 Sum_probs=94.8
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMT 79 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~t 79 (104)
||+||+|.. .+.||.++|+||+|||+|||+ ++++++|+.++.+++++||+++|++++++|+|++.+||+++|++++
T Consensus 128 ~T~Rd~q~~---~~~~~~~aK~~D~~~plGp~i~t~~~~~d~~~l~i~l~vNGe~~q~g~t~~ml~~v~~Li~~ls~~~t 204 (245)
T TIGR02303 128 YAIRDYLEN---YYRPNLRVKNRDTFTPIGPWIVDKEDVEDPMNLWLRTYVNGELTQEGNTSDMIFSVAELIEYLSEFMT 204 (245)
T ss_pred cchHHHHhh---hcCCcccccCCCCCEeeCCcCCCHHHcCCccccEEEEEECCEEEEecCHHHhccCHHHHHHHHhcCCC
Confidence 699999964 468999999999999999997 6888888999999999999999999999999999999999999999
Q ss_pred cCCCcEEEcCCCCCcccccCCCcc
Q psy9188 80 LEPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 80 L~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
|+|||||+||||.+++++++||+.
T Consensus 205 L~pGDvIlTGTp~g~~~l~~GD~v 228 (245)
T TIGR02303 205 LEPGDVILTGTPKGLSDVKPGDVV 228 (245)
T ss_pred cCCCCEEEcCCCCCCeEcCCCCEE
Confidence 999999999999999999999975
No 4
>PRK15203 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase; Provisional
Probab=99.98 E-value=9.3e-33 Score=218.32 Aligned_cols=95 Identities=22% Similarity=0.291 Sum_probs=88.8
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcccCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCcc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFIPEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTL 80 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL 80 (104)
+|+||+|. ++||.++|+||+|||+||||.. +++.+++|+++|||+++|+++|++|+|+++++|+|+|+++||
T Consensus 95 ~t~rd~q~-----~~~~~~~K~~D~~~p~Gp~i~~---~~~~~l~i~~~vNGe~~Q~~~t~~Mi~~~~~lis~lS~~~tL 166 (429)
T PRK15203 95 VSLPEESF-----YRPAIKAKCRDGFCPIGETVAL---SNVDNLTIYTEINGRPADHWNTADLQRNAAQLLSALSEFATL 166 (429)
T ss_pred eechhhcc-----cCCcccccCCCCCcccCCeEEC---CCccceEEEEEECCEEEecCCHHHcCCCHHHHHHHHhCCCCc
Confidence 58999885 4689999999999999999843 467899999999999999999999999999999999999999
Q ss_pred CCCcEEEcCCCCCcccccCCCcc
Q psy9188 81 EPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 81 ~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
+|||||+||||+|++.+++||+.
T Consensus 167 ~pGDvI~TGTP~g~~~l~~GD~v 189 (429)
T PRK15203 167 NPGDAILLGTPQARVEIQPGDRV 189 (429)
T ss_pred CCCCEEEcCCCCCceECCCCCEE
Confidence 99999999999999999999975
No 5
>PRK10691 hypothetical protein; Provisional
Probab=99.97 E-value=2.8e-32 Score=199.45 Aligned_cols=103 Identities=38% Similarity=0.630 Sum_probs=95.8
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcccCCCC-CCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFIPEHEI-KDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMT 79 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i~~~~~-~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~t 79 (104)
+|+||+|.+.+..+.+|.++|+||+|+|+|||++.+++ .|+.+++++++|||+++|++++++|+|+++++|+++|++++
T Consensus 102 vt~r~~q~~~~~~~~~~~~~K~~D~~~~~gp~i~~~~~~~d~~~l~i~l~vNG~~~q~g~~~~mi~~~~~lia~ls~~~t 181 (219)
T PRK10691 102 LTLRDLQGKMKKAGQPWEKAKAFDNSCPISGFIPVAEFTGDPQNTTLGLSVNGEVRQQGNTADMIHPIVPLIAYMSRFFT 181 (219)
T ss_pred eEhhhhhhhhccccCCccccccCCCCcCcCCcEEchhccCCccccEEEEEECCEEEEecCHHHhccCHHHHHHHHhcCCc
Confidence 68999999876667899999999999999999854554 58899999999999999999999999999999999999999
Q ss_pred cCCCcEEEcCCCCCcccccCCCcc
Q psy9188 80 LEPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 80 L~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
|+|||||+||||.+++++++||++
T Consensus 182 L~aGDvI~TGTp~g~~~l~~GD~v 205 (219)
T PRK10691 182 LRAGDVVLTGTPEGVGPLQSGDEL 205 (219)
T ss_pred cCCCCEEEcCCCCCCEECCCCCEE
Confidence 999999999999999999999975
No 6
>PRK15203 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase; Provisional
Probab=99.96 E-value=3.2e-29 Score=198.26 Aligned_cols=100 Identities=34% Similarity=0.401 Sum_probs=94.4
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMT 79 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~t 79 (104)
+|+||+|.. ...+|.++|+||+|+|+|||+ +.+++.++.+++++++|||+++|++++++|+|+++++|+++|++++
T Consensus 308 ~t~rd~q~~---~~~~w~~~K~~d~~~plGp~~v~~d~~~d~~~l~i~l~vNG~~vq~g~t~~m~~~v~~li~~ls~~~t 384 (429)
T PRK15203 308 YAIRDYLEN---YYRPNLRVKSRDGLTPILSTIVPKEAIPDPHNLTLRTFVNGELRQQGTTADLIFSVPFLIAYLSEFMT 384 (429)
T ss_pred ccchhhhhh---hcCCceEeccCCCCcCCCCCEeChhhcCCccceEEEEEECCEEEEeeCHHHhccCHHHHHHHHhcCCC
Confidence 589999964 467899999999999999986 7888889999999999999999999999999999999999999999
Q ss_pred cCCCcEEEcCCCCCcccccCCCcc
Q psy9188 80 LEPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 80 L~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
|+|||+|+||||.|++++++||++
T Consensus 385 L~aGDvI~TGTp~g~~~l~pGD~v 408 (429)
T PRK15203 385 LNPGDMIATGTPKGLSDVVPGDEV 408 (429)
T ss_pred cCCCCEEEeCCCCCCeECCCCCEE
Confidence 999999999999999999999975
No 7
>PRK12764 hypothetical protein; Provisional
Probab=99.96 E-value=3.8e-29 Score=200.86 Aligned_cols=99 Identities=28% Similarity=0.279 Sum_probs=92.2
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMT 79 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~t 79 (104)
+|+||+|..+ ..+|.++|+||+|+|+|||+ +.+++ |+.+++++++|||+++|++++++|+|++++||+|+|+++|
T Consensus 106 vt~rD~~~~d---~~~~~~~K~~Dg~~plGp~iv~~~~~-d~~~l~i~~~vNGe~~Q~g~t~dmi~~v~~LI~~lS~~~t 181 (500)
T PRK12764 106 LGVYDLRYAD---KGSNLRSKGGDGFTPIGPALISARGV-DPAQLRVRTWVNGELVQDDTTEDLLFPFAQLVADLSQLLT 181 (500)
T ss_pred eeeehhhhhh---cCCcccccccCccEecCCCccCcccc-CccceEEEEEECCEEEEeccHHHhcCCHHHHHHHHhcCCC
Confidence 5889999864 24688999999999999996 77887 8999999999999999999999999999999999999999
Q ss_pred cCCCcEEEcCCCCCcccccCCCcc
Q psy9188 80 LEPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 80 L~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
|+|||||+||||.|++++++||+.
T Consensus 182 L~pGDvIlTGTp~g~~~l~pGD~v 205 (500)
T PRK12764 182 LEEGDVILTGTPAGSSVAAPGDVV 205 (500)
T ss_pred cCCCCEEEeCCCCCCeecCCCCEE
Confidence 999999999999999999999974
No 8
>TIGR02305 HpaG-N-term 4-hydroxyphenylacetate degradation bifunctional isomerase/decarboxylase, N-terminal subunit. This model represents one of two subunits/domains of the bifunctional isomerase/decarboxylase involved in 4-hydroxyphenylacetate degradation. In E. coli and some other species this enzyme is encoded by a single polypeptide containing both this domain and the closely related C-terminal domain (TIGR02303). In other species such as Pasteurella multocida these domains are found as two separate proteins (usually as tandem genes). Together, these domains carry out the decarboxylation of 5-oxopent-3-ene-1,2,5-tricarboxylic acid (OPET) to 2-hydroxy-2,4-diene-1,7-dioate (HHDD) and the subsequent isomerization to 2-oxohept-3-ene-1,7-dioate (OHED).
Probab=99.94 E-value=1.3e-27 Score=172.86 Aligned_cols=98 Identities=24% Similarity=0.353 Sum_probs=90.1
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcccCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCcc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFIPEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTL 80 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL 80 (104)
+|+|+.+. ..+|.++|+||+++++|||++.+++.|+.++.+++++||+++|++++++|+|++.++|+++|++++|
T Consensus 94 it~~~~~~-----~~~~~~~k~~dg~~~lGp~v~~~~~~d~~~~~~~l~vng~~~~~g~~~~~~~~~~~li~~ls~~~~L 168 (205)
T TIGR02305 94 VSLPEDSY-----YRPAIKAKCRDGFCPIGPEVPLSAIGNPDELTIYTYINGKPAQSNNTSNLVRSAAQLISELSEFMTL 168 (205)
T ss_pred eehhhhhc-----cCcchhhcccCCccccCCcccHHHcCCccccEEEEEECCEEEEeeCHHHhCcCHHHHHHHHhCCCCc
Confidence 35666543 4579999999999999999887788889999999999999999999999999999999999999999
Q ss_pred CCCcEEEcCCCCCcccccCCCcc
Q psy9188 81 EPYDLILTGKAGFILMFQQSPHS 103 (104)
Q Consensus 81 ~pGDvi~tGTp~g~~~~~~g~~~ 103 (104)
+|||||+||||.++.++++||+.
T Consensus 169 ~aGdvI~TGT~~g~~~l~~Gd~v 191 (205)
T TIGR02305 169 NPGDVLLLGTPEARVEVGPGDRV 191 (205)
T ss_pred CCCCEEEeCCCCCCeecCCCCEE
Confidence 99999999999999999999975
No 9
>PLN02856 fumarylacetoacetase
Probab=99.94 E-value=1.9e-27 Score=187.43 Aligned_cols=99 Identities=18% Similarity=0.123 Sum_probs=84.4
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCC--------------------CCCCeeEEEEE--------C
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIK--------------------DPDDVPLWLKV--------N 51 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~--------------------~~~~l~i~~~v--------N 51 (104)
|||||+|.+++ ..++|+++|+||++ +|||| |.++++ +..+++|.++| |
T Consensus 241 ~SARDiQ~wE~-~plgpf~gKsF~t~--igPwIVt~dal~p~r~~~~~~dp~~l~yl~~~~~~~~~i~l~v~v~~nG~~n 317 (424)
T PLN02856 241 WSARDIQKWEY-VPLGPFLGKSFATT--ISPWIVTLDALEPFRCDAPAQDPPPLPYLAEKNRKSYDISLEVAIKPAGQSK 317 (424)
T ss_pred chhhhhhhhhc-ccCCcccccCCCCC--CcCeEEcccccccccccccccCcccccccccccccceeEEEEEEEeeCCccc
Confidence 79999999876 45667899999975 99997 778773 22356677777 6
Q ss_pred CEEEeecccCccCCCHHHHHHH-HHcCCccCCCcEEEcCCCCCcccccCCCc
Q psy9188 52 GELRQKSTTGDMLFKTGDLISY-ISQHMTLEPYDLILTGKAGFILMFQQSPH 102 (104)
Q Consensus 52 G~~~q~~~t~~m~~~~~~lia~-~s~~~tL~pGDvi~tGTp~g~~~~~~g~~ 102 (104)
|+++|++++++|+|+++|+|+| +|++++|+|||||+||||+|+++.+.|+.
T Consensus 318 g~~~q~~nt~~M~ws~~qlIah~~s~g~tL~pGDLi~TGTpsG~~~~~~G~l 369 (424)
T PLN02856 318 ASVVCRSNFKHLYWTLAQQLAHHTVNGCNLRPGDLLGSGTISGPEPGSLGCL 369 (424)
T ss_pred ceeEEcCCHHHcCCCHHHHHHHHHhCCeecCCCCEEEeCCCCCCccCCCCCE
Confidence 6899999999999999999997 58999999999999999999998888764
No 10
>TIGR01266 fum_ac_acetase fumarylacetoacetase. This enzyme catalyzes the final step in the breakdown of tyrosine or phenylalanine to fumarate and acetoacetate.
Probab=99.94 E-value=4.9e-27 Score=184.81 Aligned_cols=99 Identities=21% Similarity=0.158 Sum_probs=86.9
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCC---------C-------------CCeeEEEEECCE----
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKD---------P-------------DDVPLWLKVNGE---- 53 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~---------~-------------~~l~i~~~vNG~---- 53 (104)
|||||+|.+++. .+.|+++|+||++ +|||| |++++++ + .+++++++|||+
T Consensus 233 ~SARDiQ~wE~~-plgpf~~KsF~ts--igPwIVT~daL~p~r~~~~~~dp~pl~yL~~~~~~~~~l~l~v~vnge~~~~ 309 (415)
T TIGR01266 233 WSARDIQAWEYV-PLGPFLAKSFGTT--ISPWVVPIDALEPFRVPNPKQDPKPLPYLCHDAPYTFDINLEVSLKGEGMSE 309 (415)
T ss_pred cchhhhhhhhcc-ccCccccccCCCC--CcCeEeccccccccccccccccccccccccccCCCcceeEEEEEEecCcCcc
Confidence 799999998875 4567899999996 99997 7888741 1 267888899987
Q ss_pred --EEeecccCccCCCHHHHHHHHH-cCCccCCCcEEEcCCCCCcccccCCCc
Q psy9188 54 --LRQKSTTGDMLFKTGDLISYIS-QHMTLEPYDLILTGKAGFILMFQQSPH 102 (104)
Q Consensus 54 --~~q~~~t~~m~~~~~~lia~~s-~~~tL~pGDvi~tGTp~g~~~~~~g~~ 102 (104)
++|++++++|+|+++|+|+|+| ++++|+|||||+||||+|+.+.+.|+.
T Consensus 310 ~~~~q~~~~~~M~ws~~qlIah~S~~g~tL~pGDLi~TGTpsG~~~~~~G~~ 361 (415)
T TIGR01266 310 PATICRSNFKHMYWTMLQQLAHHSVNGCNLRPGDLLGSGTISGSEPGSFGSM 361 (415)
T ss_pred cceEEcCCHHhcCcCHHHHHHHHhcCCcccCCCCEEEeCCCCCCcccCCCcE
Confidence 7999999999999999999999 689999999999999999999888874
No 11
>PF01557 FAA_hydrolase: Fumarylacetoacetate (FAA) hydrolase family Mutations in Swiss:P16930 cause inherited tyrosinemia type I.; InterPro: IPR002529 Fumarylacetoacetase (3.7.1.2 from EC; also known as fumarylacetoacetate hydrolase or FAH) catalyses the hydrolytic cleavage of a carbon-carbon bond in fumarylacetoacetate to yield fumarate and acetoacetate as the final step in phenylalanine and tyrosine degradation []. This is an essential metabolic function in humans, the lack of FAH causing type I tyrosinaemia, which is associated with liver and kidney abnormalities and neurological disorders [, ]. The enzyme mechanism involves a catalytic metal ion, a Glu/His catalytic dyad, and a charged oxyanion hole []. FAH folds into two domains: an N-terminal domain SH3-like beta-barrel, and a C-terminal with an unusual fold consisting of three layers of beta-sheet structures []. This entry represents the C-terminal domain of fumarylacetoacetase, as well as other domains that share a homologous sequence, including: 5-carboxymethyl-2-hydroxymuconate delta-isomerase (CHM isomerase; 5.3.3.10 from EC), which catalyses the conversion of 5-carboxymethyl-2-hydroxymuconate to 5-carboxy-2-oxohept-3-enedioate []. 5-oxopent-3-ene-1,2,5-tricarboxylate decarboxylase (OPET decarboxylase; 4.1.1.68 from EC), which catalyses the conversion of 5-oxopent-3-ene-1,2,5-tricarboxylate to 2-oxohept-3-enedioate and carbon dioxide. Bifunctional enzyme HpcE (OPET decarboxylase 4.1.1.68 from EC/HHDD isomerase 5.3.3.10 from EC), which is a duplication consisting of a tandem repeat of two FAH C-terminal-like domains. This enzyme is responsible for the degradation of 4-hydroxyphenylacetate, a product of tyrosine and phenylalanine metabolism also released by lignin catabolism []. ; GO: 0003824 catalytic activity, 0008152 metabolic process; PDB: 1SAW_B 3LZK_B 3S52_B 2WQT_Q 1SV6_C 2DFU_B 1WZO_D 3QDF_A 1GTT_B 1I7O_C ....
Probab=99.93 E-value=2.2e-27 Score=172.16 Aligned_cols=102 Identities=35% Similarity=0.524 Sum_probs=91.3
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCc
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMT 79 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~t 79 (104)
+|+|++|...+ .+.+|..+|+||+++++|||+ +++++.++.++++++++||+++|++++++|+|++.++|+|+|+.++
T Consensus 94 ~~~r~~~~~~~-~~~~~~~~k~~~~~~~~Gp~~v~~~~~~~~~~~~~~l~vnG~~~~~~~~~~~~~~~~~ll~~ls~~~~ 172 (218)
T PF01557_consen 94 VTARDLQWRER-PGLPWIADKSFDGSLVLGPWVVPPDELPDLRDLRLRLRVNGEVVQSGSTSDMLGDPAELLAWLSRGLT 172 (218)
T ss_dssp EEEHHHHHHHH-HTHSSHHHHSSTTCEEEEEEEEEHSSHSGTTSEEEEEEETTEEEEEEEGGGBSSSHHHHHHHHHTTS-
T ss_pred cchhhhhhhhh-cccchhhccCcCcceeecccccccccccCcceEEEEEEECCEEEEeccchhHHhhHHHHHHHHhCCCC
Confidence 47899998876 566899999999999999997 8888888899999999999999999999999999999999999999
Q ss_pred cCCCcEEEcCCCCCcc------cccCCCcc
Q psy9188 80 LEPYDLILTGKAGFIL------MFQQSPHS 103 (104)
Q Consensus 80 L~pGDvi~tGTp~g~~------~~~~g~~~ 103 (104)
|+|||+|+||||.|+. .+++||++
T Consensus 173 L~aGdvI~TGt~~G~~~~~~~~~l~~Gd~v 202 (218)
T PF01557_consen 173 LRAGDVILTGTPTGVGARPPPVPLQPGDRV 202 (218)
T ss_dssp B-TTEEEEEEESSTSEGSSCCEEEBTT-EE
T ss_pred CCcceEEEcCCcCCCCcccccccCCCCcEE
Confidence 9999999999998754 89999974
No 12
>TIGR03220 catechol_dmpE 2-oxopent-4-enoate hydratase. Members of this protein family are 2-oxopent-4-enoate hydratase, which is also called 2-hydroxypent-2,4-dienoate hydratase. It is closely related to another gene found in the same operon, 4-oxalocrotonate decarboxylase, with which it interacts closely.
Probab=99.69 E-value=5.8e-17 Score=120.78 Aligned_cols=88 Identities=15% Similarity=0.205 Sum_probs=68.3
Q ss_pred CCCcCcccCCCC-cccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHc-----CCccCCCcEEE
Q psy9188 15 LPWTVGKSFDTA-CPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQ-----HMTLEPYDLIL 87 (104)
Q Consensus 15 ~~~~~~K~~d~~-~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~-----~~tL~pGDvi~ 87 (104)
.+|..+|+.-+. ..+|+.+ .++.+ +..++.+++++||+++|++++++|+++|.++|+|+++ +++|+|||+|+
T Consensus 147 ~~~~~Ad~~~~~~~V~g~~~~~~~~~-~l~~~~~~l~vnG~~~~~g~~~~~lg~p~~~l~~L~~~l~~~g~~L~aGdiV~ 225 (255)
T TIGR03220 147 IQDTVADNASCGVFVLGDTRVDPRKL-DLALCGMVLEKNGEIVSTGAGAAALGSPVNAVAWLANTLGRLGIPLKAGEVIL 225 (255)
T ss_pred ccceeeecCCcceEEECCCcCCcccc-ChhhCceEEEECCEEEeecchhhccCCHHHHHHHHHHHHHHcCCCCCCCCEEE
Confidence 356677762111 2223332 12222 4455667899999999999999999999999999997 88999999999
Q ss_pred cCCCCCcccccCCCcc
Q psy9188 88 TGKAGFILMFQQSPHS 103 (104)
Q Consensus 88 tGTp~g~~~~~~g~~~ 103 (104)
||||.++.++++||++
T Consensus 226 TGt~~g~~~v~~Gd~v 241 (255)
T TIGR03220 226 SGSLAALVPVKAGDNL 241 (255)
T ss_pred CCCCCCCeeCCCCCEE
Confidence 9999999999999975
No 13
>PRK11342 mhpD 2-keto-4-pentenoate hydratase; Provisional
Probab=99.63 E-value=4.8e-16 Score=116.26 Aligned_cols=76 Identities=14% Similarity=0.166 Sum_probs=67.1
Q ss_pred cccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHcC-----CccCCCcEEEcCCCCCcccccCC
Q psy9188 27 CPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQH-----MTLEPYDLILTGKAGFILMFQQS 100 (104)
Q Consensus 27 ~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~-----~tL~pGDvi~tGTp~g~~~~~~g 100 (104)
..+|+.+ ..+++ |+.++.+++++||+++|++++++|+++|.++|+|+++. ++|+|||||+||||.++.++++|
T Consensus 164 ~VlG~~~~~~~~~-d~~~~~~~l~vng~~~q~g~~~~~lg~p~~~l~~L~~~l~~~g~~L~aGdvV~TGt~~~~~~l~~G 242 (262)
T PRK11342 164 YVIGGPAQRPAGL-DLKNCAMKMTRNNEEVSSGRGSECLGHPLNAAVWLARKMASLGEPLRAGDIILTGALGPMVAVNAG 242 (262)
T ss_pred EEECCCcCCcccC-ChhhCEEEEEECCEEEEEEcHHHhccCHHHHHHHHHHHHHHcCCCcCCCCEEEcCCCCCCeeCCCC
Confidence 3567654 44555 78899999999999999999999999999999999875 69999999999999999999999
Q ss_pred Ccc
Q psy9188 101 PHS 103 (104)
Q Consensus 101 ~~~ 103 (104)
|++
T Consensus 243 d~v 245 (262)
T PRK11342 243 DRF 245 (262)
T ss_pred CEE
Confidence 975
No 14
>TIGR02312 HpaH 2-oxo-hepta-3-ene-1,7-dioic acid hydratase. This model represents the enzyme which hydrates the double bond of 2-oxo-hepta-3-ene-1,7-dioic acid to form 4-hydroxy-2-oxo-heptane-1,7-dioic acid in the catabolism of 4-hydroxyphenylacetic acid. The gene for this enzyme is generally found adjacent to other genes of this pathway in an apparent operon.
Probab=99.41 E-value=3e-13 Score=101.62 Aligned_cols=75 Identities=15% Similarity=0.124 Sum_probs=66.4
Q ss_pred ccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHHc-----CCccCCCcEEEcCCCCCcccccCCC
Q psy9188 28 PVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQ-----HMTLEPYDLILTGKAGFILMFQQSP 101 (104)
Q Consensus 28 p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~-----~~tL~pGDvi~tGTp~g~~~~~~g~ 101 (104)
.+|+.+ .++++ |+.++.+++++||+++++|++++|+.+|.+.++|+++ ..+|+|||+|+|||+.++.++++||
T Consensus 172 v~G~~~~~~~~~-dl~~~~~~l~~nG~~~~~g~~~~~lg~P~~al~wL~~~l~~~G~~L~aGdiV~TGs~~~~~~v~~G~ 250 (267)
T TIGR02312 172 VLGGRPVRPDAL-DLRWVGAILYRNGVVEETGLAAGVLNHPANGVAWLANKLAPWGETLEAGQVVLAGSFTRPVAARSGD 250 (267)
T ss_pred EECCCCCCcccc-ChhhcccEEEECCEEEEEechhhhcCCHHHHHHHHHHHHHHcCCCCCCCCEEECCCCCCceecCCCC
Confidence 678754 45555 7788899999999999999999999999999999975 4599999999999999999999999
Q ss_pred cc
Q psy9188 102 HS 103 (104)
Q Consensus 102 ~~ 103 (104)
++
T Consensus 251 ~~ 252 (267)
T TIGR02312 251 TF 252 (267)
T ss_pred EE
Confidence 75
No 15
>KOG2843|consensus
Probab=99.36 E-value=6.4e-13 Score=101.11 Aligned_cols=93 Identities=25% Similarity=0.217 Sum_probs=73.8
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCC--------------------CCCCeeEEEEEC----C---
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIK--------------------DPDDVPLWLKVN----G--- 52 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~--------------------~~~~l~i~~~vN----G--- 52 (104)
|||||+|++|..+-+| +.+|+|-+ .++||+ +.+.+. +|.+++|.+.|| |
T Consensus 235 WSARDIQkWEYVPLGP-FlaKsfgT--TvSPWVVp~~AL~Pf~v~Np~QdP~plpYL~hd~PftfDINL~Vslkpeg~~~ 311 (420)
T KOG2843|consen 235 WSARDIQKWEYVPLGP-FLAKSFGT--TVSPWVVPTAALKPFVVDNPPQDPEPLPYLRHDIPFTFDINLEVSLKPEGQNE 311 (420)
T ss_pred cchhhcccceeecccc-hhhhhccc--ccccceeeHhhcCccccCCCCCCCCCCcccccCCCceeeeeeEEEeccCCccc
Confidence 7999999998876555 79999976 799996 544442 234555555552 2
Q ss_pred -EEEeecccCccCCCHHHHHHHHH-cCCccCCCcEEEcCCCCCccc
Q psy9188 53 -ELRQKSTTGDMLFKTGDLISYIS-QHMTLEPYDLILTGKAGFILM 96 (104)
Q Consensus 53 -~~~q~~~t~~m~~~~~~lia~~s-~~~tL~pGDvi~tGTp~g~~~ 96 (104)
..++..|...|+|++-+.+++.+ ..|.|+|||++.+||.+|..+
T Consensus 312 ~a~icKsNFKhlYWT~lQQlaHHtVnGCNLRpGDLlaSGTiSGpep 357 (420)
T KOG2843|consen 312 DALICKSNFKHLYWTPLQQLAHHTVNGCNLRPGDLLASGTISGPEP 357 (420)
T ss_pred cceeecccchhhhhhHHHHhhhcccccccCCccceeccccccCCCC
Confidence 47788999999999999999986 689999999999999998754
No 16
>TIGR03218 catechol_dmpH 4-oxalocrotonate decarboxylase. Members of this protein family are 4-oxalocrotonate decarboxylase. Note that this protein, as characterized (indirectly) in Pseudomonas sp. strain CF600, was inactive except when coexpressed with DmpE, 2-oxopent-4-enoate hydratase, a homologous protein from the same operon. Both of these enzymes are active in the degradation of catechol, a common intermediate in the degradation of aromatic compounds such as benzoate, toluene, phenol, dimethylphenol (dmp), salicylate, etc.
Probab=98.90 E-value=5.7e-09 Score=78.35 Aligned_cols=76 Identities=9% Similarity=0.119 Sum_probs=66.5
Q ss_pred cccCCcc-cCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHH-----cCCccCCCcEEEcCCCCCcccccCC
Q psy9188 27 CPVSDFI-PEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYIS-----QHMTLEPYDLILTGKAGFILMFQQS 100 (104)
Q Consensus 27 ~p~Gp~i-~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s-----~~~tL~pGDvi~tGTp~g~~~~~~g 100 (104)
..+||.+ ..+.+ |+.++.+++++||+++++++.++..-+|.+.++|+. +...|++||+|+||+..+..++.+|
T Consensus 168 ~vlG~~~~~~~~~-dl~~~~~~l~~~g~~v~~g~g~~~lG~P~~al~wL~~~l~~~G~~L~aG~iV~tGs~t~~~~v~~G 246 (263)
T TIGR03218 168 FVTGGRAANVEDL-DLRTLGVVMEKNGEVVAMGAGAAVLGHPAAAVAMLANHLAERGEEIPAGSFIMSGGITEAVAVAPG 246 (263)
T ss_pred EEECCCCCCcccc-CHhhCcEEEEECCEEEEeecccccCCCHHHHHHHHHHHHHHcCCCCCCCCEEECCcCcCceecCCC
Confidence 3678875 34444 788899999999999999999999999999999986 4679999999999999999999999
Q ss_pred Ccc
Q psy9188 101 PHS 103 (104)
Q Consensus 101 ~~~ 103 (104)
+++
T Consensus 247 ~~~ 249 (263)
T TIGR03218 247 DSV 249 (263)
T ss_pred CEE
Confidence 875
No 17
>COG3971 2-keto-4-pentenoate hydratase [Secondary metabolites biosynthesis, transport, and catabolism]
Probab=98.67 E-value=4.5e-08 Score=73.43 Aligned_cols=76 Identities=14% Similarity=0.156 Sum_probs=68.3
Q ss_pred cccCC-cccCCCCCCCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHH-----cCCccCCCcEEEcCCCCCcccccCC
Q psy9188 27 CPVSD-FIPEHEIKDPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYIS-----QHMTLEPYDLILTGKAGFILMFQQS 100 (104)
Q Consensus 27 ~p~Gp-~i~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s-----~~~tL~pGDvi~tGTp~g~~~~~~g 100 (104)
..||| ++.++++ |..++...+++||+.++.+..+..+.+|..-++|++ ...+|+.||||+||.-.+..+.++|
T Consensus 166 ~ViG~~~~~~~~l-d~~~~~~~l~rng~~~e~g~~aavLghP~~a~~wLAn~~a~~G~~Lk~G~IVl~Gs~t~~v~~~~g 244 (264)
T COG3971 166 FVIGGRAVKPDDL-DLRNVGATLYRNGVEEETGVGAAVLGHPAAALAWLANKLAAYGVPLKAGDIVLTGSFTGPVPARPG 244 (264)
T ss_pred eEECCCCCCchhh-hhhhccceeeecCEEEEeeechhhcCCcHHHHHHHHHHHHHcCCCcccCcEEecCccCccccCCCC
Confidence 36786 5666666 889999999999999999999999999999999987 4899999999999999999999999
Q ss_pred Ccc
Q psy9188 101 PHS 103 (104)
Q Consensus 101 ~~~ 103 (104)
|+.
T Consensus 245 d~~ 247 (264)
T COG3971 245 DTF 247 (264)
T ss_pred CEE
Confidence 974
No 18
>COG3970 Fumarylacetoacetate (FAA) hydrolase family protein [General function prediction only]
Probab=98.30 E-value=1.1e-06 Score=67.76 Aligned_cols=89 Identities=16% Similarity=0.151 Sum_probs=69.0
Q ss_pred CchhhhhhHHHhCCCCCcCcccCCCCcccCCcc-cCCCCCCC---CCeeEEEEECCEEE--e--ecccCccCCCHHHHHH
Q psy9188 1 MLFLFLQKHAREHGLPWTVGKSFDTACPVSDFI-PEHEIKDP---DDVPLWLKVNGELR--Q--KSTTGDMLFKTGDLIS 72 (104)
Q Consensus 1 ~TaRd~q~~~~~~~~~~~~~K~~d~~~p~Gp~i-~~~~~~~~---~~l~i~~~vNG~~~--q--~~~t~~m~~~~~~lia 72 (104)
+++||+..+ +.+...++|....+|.+||+| .-||.-.+ ...+|++.|.|+-. - ..+++.|-.++.+++.
T Consensus 221 VnlRD~Egr---saLlL~kaKdnnasCaiGPfIrlfDe~f~~~dv~~a~vtLkv~gedgf~l~G~snm~~isR~p~~l~~ 297 (379)
T COG3970 221 VNLRDFEGR---SALLLSKAKDNNASCAIGPFIRLFDETFTIDDVKSAEVTLKVTGEDGFFLEGSSNMAEISRSPEELVI 297 (379)
T ss_pred ccccccccc---cchhcccccccCccccccceEEeecCCCChhhhhhceEEEEEEccCceEEeccccHHhhccCHHHHHH
Confidence 468999876 667778999999999999998 66665444 45559999988633 2 3457899998887665
Q ss_pred H-HHcCCccCCCcEEEcCCCC
Q psy9188 73 Y-ISQHMTLEPYDLILTGKAG 92 (104)
Q Consensus 73 ~-~s~~~tL~pGDvi~tGTp~ 92 (104)
. +-+.....-|-++++||.-
T Consensus 298 Q~l~~~hqyPDG~~lflGTmf 318 (379)
T COG3970 298 QALNRDHQYPDGFALFLGTMF 318 (379)
T ss_pred HHhccCCCCCCceeEEeeeee
Confidence 5 5678899999999999953
No 19
>PF11010 DUF2848: Protein of unknown function (DUF2848); InterPro: IPR021269 This bacterial family of proteins has no known function.
Probab=97.23 E-value=0.00041 Score=50.32 Aligned_cols=76 Identities=17% Similarity=0.180 Sum_probs=59.3
Q ss_pred cccCCcc-cCCCCC-CCCCeeEEEEE--CC--EEEeecccCccCCCHHHHHHHHH-cCCccCCCcEEEcCCCCCcccccC
Q psy9188 27 CPVSDFI-PEHEIK-DPDDVPLWLKV--NG--ELRQKSTTGDMLFKTGDLISYIS-QHMTLEPYDLILTGKAGFILMFQQ 99 (104)
Q Consensus 27 ~p~Gp~i-~~~~~~-~~~~l~i~~~v--NG--~~~q~~~t~~m~~~~~~lia~~s-~~~tL~pGDvi~tGTp~g~~~~~~ 99 (104)
.|+++-+ ..+++. .+..+.++.++ || ++.|+|+.++| .++.++++.+. ....+.+|-++++||.+-.+.+++
T Consensus 95 Kpva~~~W~~~dV~dhWD~l~Lrsw~~~dg~~~lYQeGtla~l-l~p~~ll~~~~~~~~~~~~g~~m~~GT~~~~g~~~~ 173 (194)
T PF11010_consen 95 KPVAREAWRLDDVADHWDELELRSWITEDGERVLYQEGTLAAL-LPPADLLERLGEGRGDLPEGTAMFCGTVPAIGGIRP 173 (194)
T ss_pred ccchhhcCcHHHHHhhhhheeEEEEEeeCCCEEEEeecchhhc-CCHHHHHHhhhccCCCCCCCEEEEEeccccccCccc
Confidence 5788732 334443 56778888884 44 68899999866 58999999999 678899999999999888888888
Q ss_pred CCcc
Q psy9188 100 SPHS 103 (104)
Q Consensus 100 g~~~ 103 (104)
++++
T Consensus 174 a~~f 177 (194)
T PF11010_consen 174 ADRF 177 (194)
T ss_pred cceE
Confidence 8764
No 20
>PRK06488 sulfur carrier protein ThiS; Validated
Probab=86.11 E-value=2 Score=25.14 Aligned_cols=24 Identities=29% Similarity=0.256 Sum_probs=17.2
Q ss_pred EEEEECCEEEeecccCccCCCHHHHHHHH
Q psy9188 46 LWLKVNGELRQKSTTGDMLFKTGDLISYI 74 (104)
Q Consensus 46 i~~~vNG~~~q~~~t~~m~~~~~~lia~~ 74 (104)
|.+.+||+.++- ..+ ++.++++++
T Consensus 1 m~i~~Ng~~~~~---~~~--tl~~Ll~~l 24 (65)
T PRK06488 1 MKLFVNGETLQT---EAT--TLALLLAEL 24 (65)
T ss_pred CEEEECCeEEEc---CcC--cHHHHHHHc
Confidence 467889988886 222 788888876
No 21
>cd01617 DCX Ubiquitin-like domain of DCX. DCX The ubiquitin-like DCX domain is present in tandem within the N-terminal half of the doublecortin protein. Doublecortin is expressed in migrating neurons. Mutations in the gene encoding doublecortin cause lissencephaly in males and 'double-cortex syndrome' in females.
Probab=83.26 E-value=1.5 Score=27.10 Aligned_cols=51 Identities=12% Similarity=0.006 Sum_probs=40.7
Q ss_pred eEEEEECCEEEeecc----cCccCCCHHHHHHHHHcCCccCCCcEEEcCCCCCcc
Q psy9188 45 PLWLKVNGELRQKST----TGDMLFKTGDLISYISQHMTLEPYDLILTGKAGFIL 95 (104)
Q Consensus 45 ~i~~~vNG~~~q~~~----t~~m~~~~~~lia~~s~~~tL~pGDvi~tGTp~g~~ 95 (104)
.|.+..||.....+. ....+-+++.++.++++-+.|.+|-|=---|+.|..
T Consensus 2 ~I~~~rNGD~~~~g~~~~i~~~~~~sfd~lL~~lt~~l~l~~~~Vr~lyt~~g~~ 56 (80)
T cd01617 2 RVVVYRNGDPFFKGVRLLVNRRRFKSFDALLDDLTEKVQLDPGAVRKLYTLDGGH 56 (80)
T ss_pred EEEEEECCCCCCCCEEEEEChhhhCCHHHHHHHHHHHhCCCCCcEEEEEcCCCCe
Confidence 467788888777664 244667999999999999999999988888887743
No 22
>PRK06083 sulfur carrier protein ThiS; Provisional
Probab=78.54 E-value=8.7 Score=24.11 Aligned_cols=32 Identities=16% Similarity=0.165 Sum_probs=23.2
Q ss_pred CCCCeeEEEEECCEEEeecccCccCCCHHHHHHHHH
Q psy9188 40 DPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYIS 75 (104)
Q Consensus 40 ~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s 75 (104)
......|.+.|||+..+-..- -++.++++.+.
T Consensus 13 ~~~~~~m~I~VNG~~~~~~~~----~tl~~LL~~l~ 44 (84)
T PRK06083 13 GAAMVLITISINDQSIQVDIS----SSLAQIIAQLS 44 (84)
T ss_pred CCCCceEEEEECCeEEEcCCC----CcHHHHHHHcC
Confidence 456668999999998876432 25888888764
No 23
>KOG1379|consensus
Probab=77.68 E-value=1.3 Score=34.59 Aligned_cols=53 Identities=19% Similarity=0.235 Sum_probs=41.8
Q ss_pred CCCCeeEEEEECCEEEeecccCccCCCHHHHHHHH--------H--------cCCccCCCcEEEcCCCC
Q psy9188 40 DPDDVPLWLKVNGELRQKSTTGDMLFKTGDLISYI--------S--------QHMTLEPYDLILTGKAG 92 (104)
Q Consensus 40 ~~~~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~--------s--------~~~tL~pGDvi~tGTp~ 92 (104)
+..+-..-+-.||+++++..-+...|..+..|+-. + -.+.|++||||+.+|-+
T Consensus 187 NLGDSGF~VvR~G~vv~~S~~Q~H~FN~PyQLs~~p~~~~~~~~d~p~~ad~~~~~v~~GDvIilATDG 255 (330)
T KOG1379|consen 187 NLGDSGFLVVREGKVVFRSPEQQHYFNTPYQLSSPPEGYSSYISDVPDSADVTSFDVQKGDVIILATDG 255 (330)
T ss_pred eccCcceEEEECCEEEEcCchheeccCCceeeccCCccccccccCCccccceEEEeccCCCEEEEeccc
Confidence 45556677888999999999988888887665543 2 25889999999999964
No 24
>PRK08053 sulfur carrier protein ThiS; Provisional
Probab=75.07 E-value=8.2 Score=22.66 Aligned_cols=24 Identities=13% Similarity=0.101 Sum_probs=12.0
Q ss_pred EEEECCEEEeecccCccCCCHHHHHHHH
Q psy9188 47 WLKVNGELRQKSTTGDMLFKTGDLISYI 74 (104)
Q Consensus 47 ~~~vNG~~~q~~~t~~m~~~~~~lia~~ 74 (104)
.+.|||+.++-... -++.+++..+
T Consensus 2 ~i~vNg~~~~~~~~----~tl~~ll~~l 25 (66)
T PRK08053 2 QILFNDQPMQCAAG----QTVHELLEQL 25 (66)
T ss_pred EEEECCeEEEcCCC----CCHHHHHHHc
Confidence 45666666654221 1355555544
No 25
>PRK06944 sulfur carrier protein ThiS; Provisional
Probab=75.06 E-value=11 Score=21.71 Aligned_cols=24 Identities=8% Similarity=0.032 Sum_probs=12.6
Q ss_pred EEEECCEEEeecccCccCCCHHHHHHHH
Q psy9188 47 WLKVNGELRQKSTTGDMLFKTGDLISYI 74 (104)
Q Consensus 47 ~~~vNG~~~q~~~t~~m~~~~~~lia~~ 74 (104)
.+.|||+.++-... -++.++++.+
T Consensus 2 ~i~vNg~~~~~~~~----~tl~~ll~~l 25 (65)
T PRK06944 2 DIQLNQQTLSLPDG----ATVADALAAY 25 (65)
T ss_pred EEEECCEEEECCCC----CcHHHHHHhh
Confidence 46677776654221 1455555544
No 26
>PRK07440 hypothetical protein; Provisional
Probab=70.05 E-value=14 Score=22.10 Aligned_cols=26 Identities=38% Similarity=0.524 Sum_probs=17.1
Q ss_pred eEEEEECCEEEeecccCccCCCHHHHHHHH
Q psy9188 45 PLWLKVNGELRQKSTTGDMLFKTGDLISYI 74 (104)
Q Consensus 45 ~i~~~vNG~~~q~~~t~~m~~~~~~lia~~ 74 (104)
.|.+.+||+.++-... -++.+++..+
T Consensus 4 ~m~i~vNG~~~~~~~~----~tl~~lL~~l 29 (70)
T PRK07440 4 PITLQVNGETRTCSSG----TSLPDLLQQL 29 (70)
T ss_pred ceEEEECCEEEEcCCC----CCHHHHHHHc
Confidence 4788889887664321 2477777765
No 27
>PRK05863 sulfur carrier protein ThiS; Provisional
Probab=66.53 E-value=15 Score=21.54 Aligned_cols=25 Identities=16% Similarity=0.191 Sum_probs=14.3
Q ss_pred EEEEECCEEEeecccCccCCCHHHHHHHH
Q psy9188 46 LWLKVNGELRQKSTTGDMLFKTGDLISYI 74 (104)
Q Consensus 46 i~~~vNG~~~q~~~t~~m~~~~~~lia~~ 74 (104)
|.+.|||+.++-..- -++.+++..+
T Consensus 1 m~i~vNG~~~~~~~~----~tl~~ll~~l 25 (65)
T PRK05863 1 MIVVVNEEQVEVDEQ----TTVAALLDSL 25 (65)
T ss_pred CEEEECCEEEEcCCC----CcHHHHHHHc
Confidence 356777777765421 2466666654
No 28
>PRK07696 sulfur carrier protein ThiS; Provisional
Probab=65.62 E-value=15 Score=21.77 Aligned_cols=26 Identities=19% Similarity=0.377 Sum_probs=13.4
Q ss_pred EEEEECCEEEeecccCccCCCHHHHHHHH
Q psy9188 46 LWLKVNGELRQKSTTGDMLFKTGDLISYI 74 (104)
Q Consensus 46 i~~~vNG~~~q~~~t~~m~~~~~~lia~~ 74 (104)
|.++|||+.++-... .-++.++++.+
T Consensus 1 m~I~vNG~~~~~~~~---~~tv~~lL~~l 26 (67)
T PRK07696 1 MNLKINGNQIEVPES---VKTVAELLTHL 26 (67)
T ss_pred CEEEECCEEEEcCCC---cccHHHHHHHc
Confidence 356677776543211 12366666654
No 29
>PRK05659 sulfur carrier protein ThiS; Validated
Probab=64.90 E-value=17 Score=20.97 Aligned_cols=10 Identities=40% Similarity=0.687 Sum_probs=5.2
Q ss_pred EEEECCEEEe
Q psy9188 47 WLKVNGELRQ 56 (104)
Q Consensus 47 ~~~vNG~~~q 56 (104)
.+++||+.++
T Consensus 2 ~i~vNG~~~~ 11 (66)
T PRK05659 2 NIQLNGEPRE 11 (66)
T ss_pred EEEECCeEEE
Confidence 4555665543
No 30
>cd00565 ThiS ThiaminS ubiquitin-like sulfur carrier protein. ThiS (ThiaminS) is a sulfur carrier protein involved in thiamin biosynthesis in bacteria. The ThiS fold, like those of two closely related proteins MoaD and Urm1, is similar to that of ubiquitin although there is little or no sequence similarity.
Probab=58.14 E-value=24 Score=20.44 Aligned_cols=15 Identities=27% Similarity=-0.008 Sum_probs=8.8
Q ss_pred CccCCCcEEEcCCCC
Q psy9188 78 MTLEPYDLILTGKAG 92 (104)
Q Consensus 78 ~tL~pGDvi~tGTp~ 92 (104)
..|..||-|.-=+|.
T Consensus 48 ~~L~~gD~V~ii~~v 62 (65)
T cd00565 48 TPLQDGDRIEIVTAV 62 (65)
T ss_pred eecCCCCEEEEEEec
Confidence 456777766554443
No 31
>PF03990 DUF348: Domain of unknown function (DUF348) ; InterPro: IPR007137 This domain normally occurs as tandem repeats; however it is found as a single copy in the Saccharomyces cerevisiae (Baker's yeast) DNA-binding nuclear protein YCR593 (P25357 from SWISSPROT).
Probab=52.98 E-value=35 Score=18.30 Aligned_cols=37 Identities=24% Similarity=0.299 Sum_probs=27.6
Q ss_pred EEEEECCEEEeecccCccCCCHHHHHHHHHcCCccCCCcEEE
Q psy9188 46 LWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTLEPYDLIL 87 (104)
Q Consensus 46 i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL~pGDvi~ 87 (104)
+.+.+||+.+.-..++ -++++++... ...|.+.|.|.
T Consensus 2 Vtv~~dG~~~~v~T~a---~tV~~~L~~~--gI~l~~~D~v~ 38 (43)
T PF03990_consen 2 VTVTVDGKEKTVYTTA---STVGDALKEL--GITLGEEDKVS 38 (43)
T ss_pred EEEEECCEEEEEEeCC---CCHHHHHHhC--CCCCCCCCEEe
Confidence 6788999998554433 3788888764 77889999885
No 32
>PRK15450 signal transduction protein PmrD; Provisional
Probab=52.11 E-value=11 Score=23.91 Aligned_cols=21 Identities=19% Similarity=0.267 Sum_probs=17.5
Q ss_pred HHHHHHHHHcCCccCCCcEEE
Q psy9188 67 TGDLISYISQHMTLEPYDLIL 87 (104)
Q Consensus 67 ~~~lia~~s~~~tL~pGDvi~ 87 (104)
.-++||++-....|.+||++.
T Consensus 27 aLkMIAEv~s~~~l~~gDlLs 47 (85)
T PRK15450 27 ALKMIAEVKSDFALKVGDLLS 47 (85)
T ss_pred hHHHHHHHhhccccCcccccc
Confidence 457888887799999999983
No 33
>PF00498 FHA: FHA domain; InterPro: IPR000253 The forkhead-associated (FHA) domain [] is a phosphopeptide recognition domain found in many regulatory proteins. It displays specificity for phosphothreonine-containing epitopes but will also recognise phosphotyrosine with relatively high affinity. It spans approximately 80-100 amino acid residues folded into an 11-stranded beta sandwich, which sometimes contain small helical insertions between the loops connecting the strands []. To date, genes encoding FHA-containing proteins have been identified in eubacterial and eukaryotic but not archaeal genomes. The domain is present in a diverse range of proteins, such as kinases, phosphatases, kinesins, transcription factors, RNA-binding proteins and metabolic enzymes which partake in many different cellular processes - DNA repair, signal transduction, vesicular transport and protein degradation are just a few examples.; GO: 0005515 protein binding; PDB: 1LGQ_B 1LGP_A 2CSW_A 2PIE_A 3FM8_A 3MDB_B 3GQS_B 1UHT_A 1WLN_A 3POA_A ....
Probab=51.31 E-value=14 Score=21.13 Aligned_cols=15 Identities=27% Similarity=0.377 Sum_probs=9.9
Q ss_pred HcCCccCCCcEEEcC
Q psy9188 75 SQHMTLEPYDLILTG 89 (104)
Q Consensus 75 s~~~tL~pGDvi~tG 89 (104)
.+...|.+||+|--|
T Consensus 54 ~~~~~L~~gd~i~~G 68 (68)
T PF00498_consen 54 GEPVPLKDGDIIRFG 68 (68)
T ss_dssp TSEEEE-TTEEEEET
T ss_pred CCEEECCCCCEEEcC
Confidence 346778888888655
No 34
>TIGR01683 thiS thiamine biosynthesis protein ThiS. This model represents ThiS, a small thiamine-biosynthesis protein related to MoaD, a molybdenum cofactor biosynthesis protein. Both proteins are involved in sulfur transfer. ThiS has a conserved Gly-Gly C-terminus that is modified, in reactions requiring ThiI, ThiF, IscS, and a sulfur atom from Cys, into the thiocarboxylate that provides the sulfur for thiazole biosynthesis.
Probab=50.68 E-value=32 Score=19.83 Aligned_cols=15 Identities=20% Similarity=-0.135 Sum_probs=8.6
Q ss_pred CccCCCcEEEcCCCC
Q psy9188 78 MTLEPYDLILTGKAG 92 (104)
Q Consensus 78 ~tL~pGDvi~tGTp~ 92 (104)
..|..||.|.-=+|.
T Consensus 47 ~~L~~gD~veii~~V 61 (64)
T TIGR01683 47 TILKEGDRIEIVTFV 61 (64)
T ss_pred eecCCCCEEEEEEec
Confidence 346677766554443
No 35
>PRK15175 Vi polysaccharide export protein VexA; Provisional
Probab=46.59 E-value=20 Score=28.35 Aligned_cols=39 Identities=15% Similarity=0.228 Sum_probs=27.5
Q ss_pred eeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCccCCCcEEEcCC
Q psy9188 44 VPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTLEPYDLILTGK 90 (104)
Q Consensus 44 l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL~pGDvi~tGT 90 (104)
..+++..+|+.. ..++.+|+.+-+....|+|||+|.---
T Consensus 198 ~~V~l~R~~~~~--------~i~l~dL~~~~~~ni~L~~GDvI~V~~ 236 (355)
T PRK15175 198 MEVHVTRQQHYF--------TARLSDIYQYPGLDIALQPDDRITLRQ 236 (355)
T ss_pred cEEEEEECCEEE--------EEEHHHHhhCCcCCcEeCCCCEEEEcc
Confidence 457777777653 234566776667889999999997653
No 36
>PF02538 Hydantoinase_B: Hydantoinase B/oxoprolinase; InterPro: IPR003692 An appreciable fraction of the sulphur present in mammals occurs in the form of glutathione. The synthesis of glutathione and its utilization take place by the reactions of the gamma-glutamyl cycle, which include those catalysed by gamma-glutamylcysteine and glutathione synthetases, gamma-glutamyl transpeptidase, cysteinylglycinase, gamma-glutamyl cyclotransferease, and 5-oxoprolinase []. This family includes N-methylhydantoinase B which converts hydantoin to N-carbamyl-amino acids, and 5-oxoprolinase 3.5.2.9 from EC which catalyses the formation of L-glutamate from 5-oxo-L-proline. These enzymes are part of the oxoprolinase family and are related to hydantoinase_A.; GO: 0003824 catalytic activity
Probab=43.94 E-value=19 Score=29.74 Aligned_cols=50 Identities=20% Similarity=0.286 Sum_probs=29.5
Q ss_pred ECCEEEeeccc-----CccCCCHHHHHHHHHcCCccCCCcEEEcCCCCCcccccCCCc
Q psy9188 50 VNGELRQKSTT-----GDMLFKTGDLISYISQHMTLEPYDLILTGKAGFILMFQQSPH 102 (104)
Q Consensus 50 vNG~~~q~~~t-----~~m~~~~~~lia~~s~~~tL~pGDvi~tGTp~g~~~~~~g~~ 102 (104)
-+|+.+..+.. ..|-+.+..++++.. -.|+||||+++--|-..+ ....|.
T Consensus 46 ~~G~~va~~~~ip~h~~~~~~~v~~~~~~~~--~~i~~GDv~i~NDPy~Gg-~H~~Dv 100 (527)
T PF02538_consen 46 ADGRLVAQSEGIPVHVGSMPFAVKAILEYFE--DGIRPGDVFITNDPYIGG-THLPDV 100 (527)
T ss_pred CCCCeEEcCCCCcEEEechHHHHHHHHHhcc--CCCCCCCEEEEcCcccCC-ccCCcc
Confidence 36666655431 223344455555322 399999999998885432 455554
No 37
>COG3802 GguC Uncharacterized protein conserved in bacteria [Function unknown]
Probab=43.36 E-value=36 Score=26.30 Aligned_cols=73 Identities=10% Similarity=-0.054 Sum_probs=46.5
Q ss_pred CCCCcCcccCCCCcccCCcccCCCCCCCCCeeEEEEECCEEEeecc----cCccCCCHHHHHHHHHcC-CccCCCcEE
Q psy9188 14 GLPWTVGKSFDTACPVSDFIPEHEIKDPDDVPLWLKVNGELRQKST----TGDMLFKTGDLISYISQH-MTLEPYDLI 86 (104)
Q Consensus 14 ~~~~~~~K~~d~~~p~Gp~i~~~~~~~~~~l~i~~~vNG~~~q~~~----t~~m~~~~~~lia~~s~~-~tL~pGDvi 86 (104)
..+.+.+-|-=..|.+||-+-.-++++--.-.=++..+|++..+-. -++|-++++.|=-+.=++ +-.+||||=
T Consensus 205 ~NYL~LAHSKLR~as~GPEl~vG~lP~~vrG~SRI~Rdg~viwek~FlSGE~nMsHs~aNLEhhHFkY~lfrrpGDvH 282 (333)
T COG3802 205 VNYLYLAHSKLRNASFGPELLVGALPEDVRGVSRILRDGEVIWEKPFLSGEANMSHSIANLEHHHFKYALFRRPGDVH 282 (333)
T ss_pred cceEEeehhhhhccccCcceeeccCchhhcCceeeecCCEEEEecccccCccchhhhhhhhhhhhhhhhhhcCCCceE
Confidence 3445566666678999996422233222222345567998876543 368999999887776664 566899973
No 38
>COG0146 HyuB N-methylhydantoinase B/acetone carboxylase, alpha subunit [Amino acid transport and metabolism / Secondary metabolites biosynthesis, transport, and catabolism]
Probab=40.62 E-value=20 Score=30.19 Aligned_cols=43 Identities=23% Similarity=0.349 Sum_probs=31.5
Q ss_pred CCEEEeecc-----cCccCCCHHHHHHHHHcCCccCCCcEEEcCCCCC
Q psy9188 51 NGELRQKST-----TGDMLFKTGDLISYISQHMTLEPYDLILTGKAGF 93 (104)
Q Consensus 51 NG~~~q~~~-----t~~m~~~~~~lia~~s~~~tL~pGDvi~tGTp~g 93 (104)
+|..+..+. ...|.+++..+|.+.-+...|+||||+++=-|-.
T Consensus 48 ~G~lVA~a~hipVHlGsM~~~vk~~i~~~~~~~~l~pGDV~~~NdPy~ 95 (563)
T COG0146 48 EGNLVAQAPHIPVHLGSMSVSVKAMIRNNKENPDLEPGDVFITNDPYI 95 (563)
T ss_pred CCCeeccCCCCeEEecccHHHHHHHHHhhccCCCCCCCCEEEecCccc
Confidence 555555443 3567788888888776677999999999988753
No 39
>cd03701 IF2_IF5B_II IF2_IF5B_II: This family represents the domain II of prokaryotic Initiation Factor 2 (IF2) and its archeal and eukaryotic homologue aeIF5B. IF2, the largest initiation factor is an essential GTP binding protein. In E. coli three natural forms of IF2 exist in the cell, IF2alpha, IF2beta1, and IF2beta2. Disruption of the eIF5B gene (FUN12) in yeast causes a severe slow-growth phenotype, associated with a defect in translation. eIF5B has a function analogous to prokaryotic IF2 in mediating the joining of the 60S ribosomal subunit. The eIF5B consists of three N-terminal domains (I, II, II) connected by a long helix to domain IV. Domain I is a G domain, domain II and IV are beta-barrels and domain III has a novel alpha-beta-alpha sandwich fold. The G domain and the beta-barrel domain II display a similar structure and arrangement to the homologous domains in EF1A, eEF1A and aeIF2gamma.
Probab=40.52 E-value=19 Score=22.87 Aligned_cols=19 Identities=32% Similarity=0.198 Sum_probs=15.9
Q ss_pred cCCccCCCcEEEcCCCCCc
Q psy9188 76 QHMTLEPYDLILTGKAGFI 94 (104)
Q Consensus 76 ~~~tL~pGDvi~tGTp~g~ 94 (104)
+.-+|+.||.|.+|+--|.
T Consensus 23 ~~GtL~~Gd~iv~G~~~Gk 41 (95)
T cd03701 23 QNGTLKKGDVIVAGGTYGK 41 (95)
T ss_pred EcCeEecCCEEEECCccce
Confidence 5678999999999997654
No 40
>PRK08364 sulfur carrier protein ThiS; Provisional
Probab=40.50 E-value=78 Score=18.65 Aligned_cols=8 Identities=38% Similarity=0.576 Sum_probs=4.8
Q ss_pred EEEEECCE
Q psy9188 46 LWLKVNGE 53 (104)
Q Consensus 46 i~~~vNG~ 53 (104)
|++++||+
T Consensus 5 m~v~vng~ 12 (70)
T PRK08364 5 IRVKVIGR 12 (70)
T ss_pred EEEEEecc
Confidence 55566666
No 41
>cd03703 aeIF5B_II aeIF5B_II: This family represents the domain II of archeal and eukaryotic aeIF5B. aeIF5B is a homologue of prokaryotic Initiation Factor 2 (IF2). Disruption of the eIF5B gene (FUN12) in yeast causes a severe slow-growth phenotype, associated with a defect in translation. eIF5B has a function analogous to prokaryotic IF2 in mediating the joining of joining of 60S subunits. The eIF5B consists of three N-terminal domains (I, II, II) connected by a long helix to domain IV. Domain I is a G domain, domain II and IV are beta-barrels and domain III has a novel alpha-beta-alpha sandwich fold. The G domain and the beta-barrel domain II display a similar structure and arrangement to the homologous domains of EF1A, eEF1A and aeIF2gamma.
Probab=38.92 E-value=22 Score=23.59 Aligned_cols=19 Identities=21% Similarity=-0.083 Sum_probs=16.1
Q ss_pred cCCccCCCcEEEcCCCCCc
Q psy9188 76 QHMTLEPYDLILTGKAGFI 94 (104)
Q Consensus 76 ~~~tL~pGDvi~tGTp~g~ 94 (104)
+.-+|+.||.|..|+.-|+
T Consensus 23 ~~GtL~~GD~Iv~g~~~Gp 41 (110)
T cd03703 23 YDGTLREGDTIVVCGLNGP 41 (110)
T ss_pred ECCeEecCCEEEEccCCCC
Confidence 4678999999999998763
No 42
>PF03300 Tenui_NS4: Tenuivirus non-structural protein NS4; InterPro: IPR004980 This is a non-structural protein found in members of the Tenuivirus family.
Probab=37.97 E-value=18 Score=27.39 Aligned_cols=45 Identities=13% Similarity=0.237 Sum_probs=30.5
Q ss_pred cCcccCCCCcccCCcccCCCCCCCCCeeEEEEECCEEEeecccCccCC
Q psy9188 18 TVGKSFDTACPVSDFIPEHEIKDPDDVPLWLKVNGELRQKSTTGDMLF 65 (104)
Q Consensus 18 ~~~K~~d~~~p~Gp~i~~~~~~~~~~l~i~~~vNG~~~q~~~t~~m~~ 65 (104)
..+|+| +.+|.+-..-..+|..++.+.+.|.+..+|..-.++.+|
T Consensus 152 pisK~F---AvlgSlpnfl~~eD~~nl~v~~~i~d~SV~ncVIsr~Lw 196 (283)
T PF03300_consen 152 PISKNF---AVLGSLPNFLSYEDKDNLQVEISIKDASVQNCVISRSLW 196 (283)
T ss_pred hhhhhh---HHHhcCcccccccccccceEEEEEcCCcccceEeeeeeE
Confidence 367887 566654322234477899999999998888876655543
No 43
>PF02938 GAD: GAD domain; InterPro: IPR004115 This entry represetns an 2 layer alpha/beta insertion domain found in some glutamyl-tRNA amidotransferases and aspartyl tRNA synthetases [, ]. The function of this domain is not yet known.; GO: 0004812 aminoacyl-tRNA ligase activity, 0005524 ATP binding, 0005737 cytoplasm; PDB: 1ZQ1_D 1EQR_B 1IL2_B 1C0A_A 1L0W_A 1G51_B 1EFW_B 2D6F_D.
Probab=37.60 E-value=27 Score=21.93 Aligned_cols=41 Identities=20% Similarity=0.329 Sum_probs=26.1
Q ss_pred EEEEECCEEEeecccCccCCCHHHHHHHHHcCCccCCCcEEEcC
Q psy9188 46 LWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTLEPYDLILTG 89 (104)
Q Consensus 46 i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL~pGDvi~tG 89 (104)
..+++++...+ ++.... =-++.+..+-+.+.+++||+|+-.
T Consensus 44 ~~ikv~~~~~~-s~i~kf--l~e~~~~~l~~~~~a~~GD~ll~~ 84 (95)
T PF02938_consen 44 AWIKVEEGELK-SPIAKF--LSEEELKALIERLGAKPGDLLLFV 84 (95)
T ss_dssp CCEEESTCEEE-CTTCCC--CHHHHHHHHHHHTT--TTEEEEEE
T ss_pred eeeeEcCCccc-Cccccc--CCHHHHHHHHHHhCCCCCCEEEEE
Confidence 35677774443 444433 236778888889999999998864
No 44
>PF11064 DUF2865: Protein of unknown function (DUF2865); InterPro: IPR021293 This bacterial family of proteins has no known function.
Probab=35.60 E-value=17 Score=24.26 Aligned_cols=18 Identities=22% Similarity=0.296 Sum_probs=13.5
Q ss_pred HHHHHcCCccCCCcEEEc
Q psy9188 71 ISYISQHMTLEPYDLILT 88 (104)
Q Consensus 71 ia~~s~~~tL~pGDvi~t 88 (104)
...+....+|++||||.|
T Consensus 96 ~~~~~~~~t~~~GDivvt 113 (116)
T PF11064_consen 96 LAPADSDPTLRKGDIVVT 113 (116)
T ss_pred ccccccccccccCCEecC
Confidence 334556789999999986
No 45
>PRK11479 hypothetical protein; Provisional
Probab=32.28 E-value=28 Score=26.63 Aligned_cols=37 Identities=22% Similarity=0.125 Sum_probs=21.7
Q ss_pred EECCEEEeecccCccCCCHHHHHHHHHcCCccCCCcEEEcCCC
Q psy9188 49 KVNGELRQKSTTGDMLFKTGDLISYISQHMTLEPYDLILTGKA 91 (104)
Q Consensus 49 ~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL~pGDvi~tGTp 91 (104)
.-||...|+.+..+ + +....++ .-.|+|||||++.|.
T Consensus 40 ~~~~~~~pr~s~~q--~---~~g~~Vs-~~~LqpGDLVFfst~ 76 (274)
T PRK11479 40 KTWAVKFQHQSSFT--E---QGIKEIT-APDLKPGDLLFSSSL 76 (274)
T ss_pred hhcceecCcccHHH--H---hCCcccC-hhhCCCCCEEEEecC
Confidence 34555556666554 1 2222233 347999999998654
No 46
>KOG3276|consensus
Probab=32.08 E-value=28 Score=23.53 Aligned_cols=20 Identities=30% Similarity=0.600 Sum_probs=17.2
Q ss_pred HHHHHHHHcCCccCCCcEEE
Q psy9188 68 GDLISYISQHMTLEPYDLIL 87 (104)
Q Consensus 68 ~~lia~~s~~~tL~pGDvi~ 87 (104)
+||++|+|+...|+--|+.+
T Consensus 73 aeLl~ylskvLgLRksdv~l 92 (125)
T KOG3276|consen 73 AELLEYLSKVLGLRKSDVTL 92 (125)
T ss_pred HHHHHHHHHHhhhhhhheee
Confidence 58999999998888888764
No 47
>cd03702 IF2_mtIF2_II This family represents the domain II of bacterial Initiation Factor 2 (IF2) and its eukaryotic mitochondrial homologue mtIF2. IF2, the largest initiation factor is an essential GTP binding protein. In E. coli three natural forms of IF2 exist in the cell, IF2alpha, IF2beta1, and IF2beta2. Bacterial IF-2 is structurally and functionally related to eukaryotic mitochondrial mtIF-2.
Probab=30.17 E-value=35 Score=21.75 Aligned_cols=18 Identities=28% Similarity=0.226 Sum_probs=14.7
Q ss_pred cCCccCCCcEEEcCCCCC
Q psy9188 76 QHMTLEPYDLILTGKAGF 93 (104)
Q Consensus 76 ~~~tL~pGDvi~tGTp~g 93 (104)
+.-+|+.||.|.+|+--|
T Consensus 23 ~~GtL~~Gd~iv~G~~~g 40 (95)
T cd03702 23 QNGTLKVGDVLVAGTTYG 40 (95)
T ss_pred EcCeEeCCCEEEEccccc
Confidence 567899999999998544
No 48
>PF12077 DUF3556: Transmembrane protein of unknown function (DUF3556); InterPro: IPR021941 This family of transmembrane proteins is functionally uncharacterised. This protein is found in bacteria. Proteins in this family are typically between 576 to 592 amino acids in length.
Probab=29.59 E-value=99 Score=26.11 Aligned_cols=44 Identities=18% Similarity=0.273 Sum_probs=34.4
Q ss_pred eEEEEECCEEEeec----ccCccCCCHHHHHHHHHcCCccCCCcEEEc
Q psy9188 45 PLWLKVNGELRQKS----TTGDMLFKTGDLISYISQHMTLEPYDLILT 88 (104)
Q Consensus 45 ~i~~~vNG~~~q~~----~t~~m~~~~~~lia~~s~~~tL~pGDvi~t 88 (104)
+=....+||.++.. |..|=..+=+++|+-+.+.|.++||+++..
T Consensus 475 ~~y~~regE~vc~~~iGwNFGDGHLHnE~lIaAvQ~rC~FePGE~rvv 522 (574)
T PF12077_consen 475 DDYTLREGEFVCNTAIGWNFGDGHLHNEQLIAAVQRRCGFEPGEVRVV 522 (574)
T ss_pred cceeeeccceeecceeeecccCcccccHHHHHHHHHHcCCCCCeEEEE
Confidence 34455789988875 445555677899999999999999999764
No 49
>PRK10030 hypothetical protein; Provisional
Probab=29.41 E-value=36 Score=24.48 Aligned_cols=16 Identities=13% Similarity=0.030 Sum_probs=12.6
Q ss_pred HcCCccCCCcEEEcCC
Q psy9188 75 SQHMTLEPYDLILTGK 90 (104)
Q Consensus 75 s~~~tL~pGDvi~tGT 90 (104)
.--..|++||||+.-.
T Consensus 16 ~~~~~l~~GDlif~~g 31 (197)
T PRK10030 16 AFAWQPQTGDIIFQIS 31 (197)
T ss_pred hhhcCCCCCCEEEEeC
Confidence 3456999999999854
No 50
>PF00877 NLPC_P60: NlpC/P60 family; InterPro: IPR000064 The Escherichia coli NLPC/Listeria P60 domain occurs at the C terminus of a number of different bacterial and viral proteins. The viral proteins are either described as tail assembly proteins or Gp19. In bacteria, the proteins are variously described as being putative tail component of prophage, invasin, invasion associated protein, putative lipoprotein, cell wall hydrolase, or putative endopeptidase. The E. coli NLPC/Listeria P60 domain is contained within the boundaries of the cysteine peptidase domain that defines the MEROPS peptidase family C40 (clan C-). A type example being dipeptidyl-peptidase VI from Bacillus sphaericus and gamma-glutamyl-diamino acid-endopeptidase precursor from Lactococcus lactis 3.4.19.11 from EC. This group also contains proteins classified as non-peptidase homologues in that they either have been found experimentally to be without peptidase activity, or lack amino acid residues that are believed to be essential for the catalytic activity of peptidases in the C40 family. ; PDB: 3PVQ_B 3GT2_A 3NPF_B 2K1G_A 3I86_A 3S0Q_A 2XIV_A 3PBC_A 3NE0_A 3M1U_B ....
Probab=29.26 E-value=30 Score=21.60 Aligned_cols=15 Identities=33% Similarity=0.350 Sum_probs=12.0
Q ss_pred cCCccCCCcEEEcCC
Q psy9188 76 QHMTLEPYDLILTGK 90 (104)
Q Consensus 76 ~~~tL~pGDvi~tGT 90 (104)
....++|||+|+...
T Consensus 48 ~~~~~~pGDlif~~~ 62 (105)
T PF00877_consen 48 PISELQPGDLIFFKG 62 (105)
T ss_dssp EGGG-TTTEEEEEEG
T ss_pred chhcCCcccEEEEeC
Confidence 367899999999887
No 51
>PF15656 Tox-HDC: Toxin with a H, D/N and C signature
Probab=29.24 E-value=90 Score=21.02 Aligned_cols=31 Identities=19% Similarity=0.378 Sum_probs=24.7
Q ss_pred CCHHHHHHHHHcCCccCCCcE-EEcCCCCCcc
Q psy9188 65 FKTGDLISYISQHMTLEPYDL-ILTGKAGFIL 95 (104)
Q Consensus 65 ~~~~~lia~~s~~~tL~pGDv-i~tGTp~g~~ 95 (104)
.++.+-|..|++.-....+|| |+|||-+...
T Consensus 12 ~~i~~pl~~Iar~~s~~~~~I~IlSGtHG~~~ 43 (119)
T PF15656_consen 12 YDINAPLETIARRPSGDNGDIHILSGTHGYCS 43 (119)
T ss_pred hhhHHHHHHHHhCcCCCCCCEEEEeCCCCCcc
Confidence 357888999999888788886 7899977543
No 52
>PF01052 SpoA: Surface presentation of antigens (SPOA); InterPro: IPR001543 Proteins in this group are involved in a secretory pathway responsible for the surface presentation of invasion plasmid antigen needed for the entry of Salmonella and other species into mammalian cells [, ].They could play a role in preserving the translocation competence of the IPA antigens and are required for secretion of the three IPA proteins []. The C-terminal region of flagellar motor switch proteins FliN and FliM is also included in this entry. ; PDB: 3UEP_A 1O9Y_B 1YAB_A.
Probab=28.80 E-value=27 Score=20.76 Aligned_cols=17 Identities=24% Similarity=0.266 Sum_probs=11.0
Q ss_pred cCCccCCCcEEEcCCCC
Q psy9188 76 QHMTLEPYDLILTGKAG 92 (104)
Q Consensus 76 ~~~tL~pGDvi~tGTp~ 92 (104)
+...|++||+|-...+.
T Consensus 25 el~~L~~Gdvi~l~~~~ 41 (77)
T PF01052_consen 25 ELLNLKVGDVIPLDKPA 41 (77)
T ss_dssp HHHC--TT-EEEECCES
T ss_pred HHhcCCCCCEEEeCCCC
Confidence 44679999999999873
No 53
>COG4301 Uncharacterized conserved protein [Function unknown]
Probab=27.75 E-value=57 Score=25.30 Aligned_cols=27 Identities=26% Similarity=0.482 Sum_probs=24.1
Q ss_pred CCHHHHHHHHHc-CCccCCCcEEEcCCC
Q psy9188 65 FKTGDLISYISQ-HMTLEPYDLILTGKA 91 (104)
Q Consensus 65 ~~~~~lia~~s~-~~tL~pGDvi~tGTp 91 (104)
+++.+...++++ ..++.|||-++.|+-
T Consensus 167 ~tp~e~~~Fl~~l~~a~~pGd~~LlGvD 194 (321)
T COG4301 167 LTPGECAVFLTQLRGALRPGDYFLLGVD 194 (321)
T ss_pred CChHHHHHHHHHHHhcCCCcceEEEecc
Confidence 589999999987 689999999999984
No 54
>TIGR02219 phage_NlpC_fam putative phage cell wall peptidase, NlpC/P60 family. Members of this family show sequence similarity to members of the NlpC/P60 family described by Pfam model pfam00877 and by Anantharaman and Aravind (PubMed:12620121). The NlpC/P60 family includes a number of characterized bacterial cell wall hydrolases. Members of this related family are all found in prophage regions of bacterial genomes.
Probab=27.27 E-value=36 Score=22.71 Aligned_cols=13 Identities=15% Similarity=0.159 Sum_probs=10.5
Q ss_pred CccCCCcEEEcCC
Q psy9188 78 MTLEPYDLILTGK 90 (104)
Q Consensus 78 ~tL~pGDvi~tGT 90 (104)
-.|+|||+|+..+
T Consensus 75 ~~~qpGDlvff~~ 87 (134)
T TIGR02219 75 DAAQPGDVLVFRW 87 (134)
T ss_pred hcCCCCCEEEEee
Confidence 3689999999753
No 55
>PF13550 Phage-tail_3: Putative phage tail protein
Probab=27.18 E-value=34 Score=22.72 Aligned_cols=15 Identities=33% Similarity=0.485 Sum_probs=12.2
Q ss_pred CCccCCCcEEEcCCC
Q psy9188 77 HMTLEPYDLILTGKA 91 (104)
Q Consensus 77 ~~tL~pGDvi~tGTp 91 (104)
++.|+|||+|.--.+
T Consensus 137 ~~~l~pGDvi~l~~~ 151 (164)
T PF13550_consen 137 GLALEPGDVIALSDD 151 (164)
T ss_pred hccCCCCCEEEEEeC
Confidence 688999999976555
No 56
>PF14894 Lsm_C: Lsm C-terminal; PDB: 1M5Q_1.
Probab=26.45 E-value=59 Score=19.60 Aligned_cols=20 Identities=15% Similarity=0.328 Sum_probs=13.4
Q ss_pred HHHHHHHHHcCCccCCCcEE
Q psy9188 67 TGDLISYISQHMTLEPYDLI 86 (104)
Q Consensus 67 ~~~lia~~s~~~tL~pGDvi 86 (104)
+.|.-.++.+.+.|.||+|=
T Consensus 1 ~~eFa~~~~r~l~l~p~~VK 20 (64)
T PF14894_consen 1 PREFAEYLERELNLFPGMVK 20 (64)
T ss_dssp HHHHHHHHHH---HSTTTEE
T ss_pred ChHHHHHHHHhcccCccceE
Confidence 35677888899999999974
No 57
>PRK06033 hypothetical protein; Validated
Probab=25.85 E-value=56 Score=20.32 Aligned_cols=17 Identities=12% Similarity=0.038 Sum_probs=13.5
Q ss_pred cCCccCCCcEEEcCCCC
Q psy9188 76 QHMTLEPYDLILTGKAG 92 (104)
Q Consensus 76 ~~~tL~pGDvi~tGTp~ 92 (104)
....|++||+|-...+.
T Consensus 24 dlL~L~~GDVI~L~~~~ 40 (83)
T PRK06033 24 QVLRMGRGAVIPLDATE 40 (83)
T ss_pred HHhCCCCCCEEEeCCCC
Confidence 45679999999988753
No 58
>cd00060 FHA Forkhead associated domain (FHA); found in eukaryotic and prokaryotic proteins. Putative nuclear signalling domain. FHA domains may bind phosphothreonine, phosphoserine and sometimes phosphotyrosine. In eukaryotes, many FHA domain-containing proteins localize to the nucleus, where they participate in establishing or maintaining cell cycle checkpoints, DNA repair, or transcriptional regulation. Members of the FHA family include: Dun1, Rad53, Cds1, Mek1, KAPP(kinase-associated protein phosphatase),and Ki-67 (a human nuclear protein related to cell proliferation).
Probab=25.21 E-value=72 Score=18.99 Aligned_cols=15 Identities=27% Similarity=0.359 Sum_probs=12.2
Q ss_pred cCCccCCCcEEEcCC
Q psy9188 76 QHMTLEPYDLILTGK 90 (104)
Q Consensus 76 ~~~tL~pGDvi~tGT 90 (104)
....|.+||+|..|.
T Consensus 79 ~~~~l~~gd~i~ig~ 93 (102)
T cd00060 79 EPVRLRDGDVIRLGN 93 (102)
T ss_pred CcEECCCCCEEEECC
Confidence 467799999998885
No 59
>COG3862 Uncharacterized protein with conserved CXXC pairs [Function unknown]
Probab=25.11 E-value=74 Score=21.22 Aligned_cols=62 Identities=24% Similarity=0.322 Sum_probs=35.5
Q ss_pred CcccCCcccCCCCCCCCC-eeEEEEE-CCEEEe-ecccCccCC--CHHHHHHHHHc---CCccCCCcEEE
Q psy9188 26 ACPVSDFIPEHEIKDPDD-VPLWLKV-NGELRQ-KSTTGDMLF--KTGDLISYISQ---HMTLEPYDLIL 87 (104)
Q Consensus 26 ~~p~Gp~i~~~~~~~~~~-l~i~~~v-NG~~~q-~~~t~~m~~--~~~~lia~~s~---~~tL~pGDvi~ 87 (104)
.||-|--.-.+|+-+|.. +.-.++| ||+..- .--|..-|- -+.++...+|+ ..++..||++.
T Consensus 29 ~CPRG~ey~~~Ei~~pkrvvts~VrV~ng~~~~vpVKT~kPIpKel~~elmk~l~~l~v~aPVK~GDvV~ 98 (117)
T COG3862 29 RCPRGVEYGKEEITSPKRVVTSTVRVKNGELPVVPVKTEKPIPKELIPELMKELSRLKVGAPVKIGDVVI 98 (117)
T ss_pred cCCchhhhhHhhhcCcceEEEEEEEEcCCccceeecccCCCCCHHHhHHHHHHHHhheecCCeeeccEEe
Confidence 477776444566766643 3333344 665332 223433322 24566666665 68999999997
No 60
>PF01191 RNA_pol_Rpb5_C: RNA polymerase Rpb5, C-terminal domain; InterPro: IPR000783 Prokaryotes contain a single DNA-dependent RNA polymerase (RNAP; 2.7.7.6 from EC) that is responsible for the transcription of all genes, while eukaryotes have three classes of RNAPs (I-III) that transcribe different sets of genes. Each class of RNA polymerase is an assemblage of ten to twelve different polypeptides. Certain subunits of RNAPs, including RPB5 (POLR2E in mammals), are common to all three eukaryotic polymerases. RPB5 plays a role in the transcription activation process. Eukaryotic RPB5 has a bipartite structure consisting of a unique N-terminal region (IPR005571 from INTERPRO), plus a C-terminal region that is structurally homologous to the prokaryotic RPB5 homologue, subunit H (gene rpoH) [, , , ]. This entry represents prokaryotic subunit H and the C-terminal domain of eukaryotic RPB5, which share a two-layer alpha/beta fold, with a core structure of beta/alpha/beta/alpha/beta(2). ; GO: 0003677 DNA binding, 0003899 DNA-directed RNA polymerase activity, 0006351 transcription, DNA-dependent; PDB: 1EIK_A 2Y0S_Z 1DZF_A 3GTG_E 2VUM_E 3GTP_E 3GTO_E 3S17_E 3S1R_E 1I3Q_E ....
Probab=25.00 E-value=37 Score=20.94 Aligned_cols=14 Identities=14% Similarity=0.389 Sum_probs=8.7
Q ss_pred HHcCCccCCCcEEE
Q psy9188 74 ISQHMTLEPYDLIL 87 (104)
Q Consensus 74 ~s~~~tL~pGDvi~ 87 (104)
++++..+++||||=
T Consensus 43 v~r~~g~k~GdVvk 56 (74)
T PF01191_consen 43 VARYLGAKPGDVVK 56 (74)
T ss_dssp HHHHTT--TTSEEE
T ss_pred hhhhcCCCCCCEEE
Confidence 34677899999983
No 61
>PRK11433 aldehyde oxidoreductase 2Fe-2S subunit; Provisional
Probab=24.62 E-value=2.8e+02 Score=20.53 Aligned_cols=17 Identities=47% Similarity=0.653 Sum_probs=12.8
Q ss_pred CCCeeEEEEECCEEEee
Q psy9188 41 PDDVPLWLKVNGELRQK 57 (104)
Q Consensus 41 ~~~l~i~~~vNG~~~q~ 57 (104)
+....|+++|||+.+.-
T Consensus 47 ~~~~~i~~~VNG~~~~~ 63 (217)
T PRK11433 47 PEISPVTLKVNGKTEQL 63 (217)
T ss_pred CcCceEEEEECCEEEEE
Confidence 34567999999988754
No 62
>cd00754 MoaD Ubiquitin domain of MoaD-like proteins. MoaD family. Members of this family are involved in biosynthesis of the molybdenum cofactor (Moco), an essential cofactor of a diverse group of redox enzymes. Moco biosynthesis is an evolutionarily conserved pathway present in eubacteria, archaea and eukaryotes. Moco contains a tricyclic pyranopterin, termed molybdopterin (MPT), that contains the cis-dithiolene group responsible for molybdenum ligation. This dithiolene group is generated by MPT synthase, the second major step in Moco biosynthesis. MPT synthase consists of a large (MoeE) and small (MoaD) subunit. The small subunit is inserted into the lare subunit to form the active site. The small subunit, which is structurally similar to ubiquitin, contains a C-terminal thiocarboxylated glycine residue that serves as a sulfur donor for the synthesis of the MPT dithiolene group.
Probab=24.61 E-value=1.5e+02 Score=17.14 Aligned_cols=30 Identities=20% Similarity=0.142 Sum_probs=21.9
Q ss_pred eeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCccCCCcEEEcCCCC
Q psy9188 44 VPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTLEPYDLILTGKAG 92 (104)
Q Consensus 44 l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL~pGDvi~tGTp~ 92 (104)
-.+.+.|||+.+. ....|..||.|.-=.|.
T Consensus 48 ~~~~v~vNg~~v~-------------------~~~~l~~gD~v~i~ppv 77 (80)
T cd00754 48 ARVRIAVNGEYVR-------------------LDTPLKDGDEVAIIPPV 77 (80)
T ss_pred hcEEEEECCeEcC-------------------CCcccCCCCEEEEeCCC
Confidence 4678889998765 33678899988765554
No 63
>PRK09570 rpoH DNA-directed RNA polymerase subunit H; Reviewed
Probab=24.45 E-value=43 Score=20.93 Aligned_cols=13 Identities=23% Similarity=0.337 Sum_probs=10.4
Q ss_pred HcCCccCCCcEEE
Q psy9188 75 SQHMTLEPYDLIL 87 (104)
Q Consensus 75 s~~~tL~pGDvi~ 87 (104)
+++..+++||||=
T Consensus 47 ~r~~g~k~GdVvk 59 (79)
T PRK09570 47 VKAIGAKPGDVIK 59 (79)
T ss_pred hhhcCCCCCCEEE
Confidence 3577899999984
No 64
>PF13799 DUF4183: Domain of unknown function (DUF4183)
Probab=24.30 E-value=38 Score=21.40 Aligned_cols=38 Identities=26% Similarity=0.308 Sum_probs=26.0
Q ss_pred CeeEEEEECCEEEeecccCccCCCHHHHHHHHHcCCccCCCcEEEcCCC
Q psy9188 43 DVPLWLKVNGELRQKSTTGDMLFKTGDLISYISQHMTLEPYDLILTGKA 91 (104)
Q Consensus 43 ~l~i~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~~tL~pGDvi~tGTp 91 (104)
+-..-++|||.+-+.+.. ..- ....+|..+|.+..|||
T Consensus 42 ~sY~NLyINGvLQ~~~~y-tvt----------~~~Lti~~~d~~~~GtP 79 (84)
T PF13799_consen 42 NSYYNLYINGVLQPGSLY-TVT----------TGSLTINTGDPILAGTP 79 (84)
T ss_pred ceEEEEEEEEEecCCcce-EEe----------cceEEEcCCCCCCCCCc
Confidence 667889999977655442 121 34567778888888887
No 65
>PF15092 UPF0728: Uncharacterised protein family UPF0728
Probab=23.74 E-value=91 Score=19.96 Aligned_cols=26 Identities=31% Similarity=0.357 Sum_probs=20.4
Q ss_pred CCCCCeeEEEEECCEEEeecccCccCCC
Q psy9188 39 KDPDDVPLWLKVNGELRQKSTTGDMLFK 66 (104)
Q Consensus 39 ~~~~~l~i~~~vNG~~~q~~~t~~m~~~ 66 (104)
.|+. .+++.+||+.+-+.+..++.|.
T Consensus 47 ~d~n--~vel~vnge~VF~CnI~~L~Fg 72 (88)
T PF15092_consen 47 EDWN--VVELVVNGEIVFQCNITDLEFG 72 (88)
T ss_pred cccc--EEEEEECCeEEEEeCCceeecC
Confidence 3455 4778899999999998877765
No 66
>PRK11507 ribosome-associated protein; Provisional
Probab=23.55 E-value=51 Score=20.13 Aligned_cols=14 Identities=0% Similarity=-0.059 Sum_probs=11.4
Q ss_pred cCCccCCCcEEEcC
Q psy9188 76 QHMTLEPYDLILTG 89 (104)
Q Consensus 76 ~~~tL~pGDvi~tG 89 (104)
|...|+|||+|-..
T Consensus 49 RgkKl~~GD~V~~~ 62 (70)
T PRK11507 49 KRCKIVAGQTVSFA 62 (70)
T ss_pred cCCCCCCCCEEEEC
Confidence 67889999998653
No 67
>smart00537 DCX Domain in the Doublecortin (DCX) gene product. Tandemly-repeated domain in doublin, the Doublecortin gene product. Proposed to bind tubulin. Doublecortin (DCX) is mutated in human X-linked neuronal migration defects.
Probab=23.53 E-value=1.4e+02 Score=18.47 Aligned_cols=52 Identities=10% Similarity=-0.024 Sum_probs=35.0
Q ss_pred CCeeEEEEECCEEEeecc----cCccCCCHHHHHHHHHcCCccCCC-cEEEcCCCCC
Q psy9188 42 DDVPLWLKVNGELRQKST----TGDMLFKTGDLISYISQHMTLEPY-DLILTGKAGF 93 (104)
Q Consensus 42 ~~l~i~~~vNG~~~q~~~----t~~m~~~~~~lia~~s~~~tL~pG-Dvi~tGTp~g 93 (104)
....|.+..||.....+. ....+-++++++..+++...|..| -|=-.-|+.|
T Consensus 4 k~k~i~~~rNGD~~~~g~~~~v~~~~~~s~d~lL~~lt~~v~l~~~~~Vr~lyt~~G 60 (89)
T smart00537 4 KPKRIRFYRNGDRFFKGVRLVVNRKRFKSFEALLQDLTEVVKLDLPHGVRKLYTLDG 60 (89)
T ss_pred cceEEEEEeCCCCCCCCEEEEEChhhcCCHHHHHHHHhhhcccCCCCCeeEEEcCCC
Confidence 445788999998866654 235788999999999994444333 2444445554
No 68
>PRK15078 polysaccharide export protein Wza; Provisional
Probab=23.20 E-value=1.1e+02 Score=24.13 Aligned_cols=39 Identities=23% Similarity=0.270 Sum_probs=27.9
Q ss_pred eeEEEEECCEEEeecccCccCCCHHHHHHH--HHcCCccCCCcEEEcCC
Q psy9188 44 VPLWLKVNGELRQKSTTGDMLFKTGDLISY--ISQHMTLEPYDLILTGK 90 (104)
Q Consensus 44 l~i~~~vNG~~~q~~~t~~m~~~~~~lia~--~s~~~tL~pGDvi~tGT 90 (104)
-.+.+..||+.. ..++.+++.. +++...|+|||+|.--.
T Consensus 211 ~~V~l~R~g~~~--------~i~l~~ll~~g~~~~ni~L~~GDvI~Vp~ 251 (379)
T PRK15078 211 RNVVLTHNGKEE--------RISLQALMQNGDLSQNRLLYPGDILYVPR 251 (379)
T ss_pred ceEEEEECCeEE--------EEEHHHHHhcCCcccCceeCCCCEEEECC
Confidence 357777788653 2466777763 46778899999998755
No 69
>KOG2915|consensus
Probab=22.73 E-value=70 Score=24.97 Aligned_cols=29 Identities=21% Similarity=0.465 Sum_probs=20.0
Q ss_pred cCCCHHHHHHHHHcCCccCCCcEEE-cCCCCC
Q psy9188 63 MLFKTGDLISYISQHMTLEPYDLIL-TGKAGF 93 (104)
Q Consensus 63 m~~~~~~lia~~s~~~tL~pGDvi~-tGTp~g 93 (104)
++|+++ |++|-+.+.+.||++|+ +||-+|
T Consensus 88 I~Yt~D--ia~I~~~L~i~PGsvV~EsGTGSG 117 (314)
T KOG2915|consen 88 ILYTPD--IAMILSMLEIRPGSVVLESGTGSG 117 (314)
T ss_pred EEeccc--HHHHHHHhcCCCCCEEEecCCCcc
Confidence 444443 66777788999999886 566544
No 70
>PF13533 Biotin_lipoyl_2: Biotin-lipoyl like
Probab=22.41 E-value=35 Score=18.85 Aligned_cols=14 Identities=14% Similarity=0.266 Sum_probs=9.9
Q ss_pred HcCCccCCCcEEEc
Q psy9188 75 SQHMTLEPYDLILT 88 (104)
Q Consensus 75 s~~~tL~pGDvi~t 88 (104)
..+..++.||+|++
T Consensus 19 ~~G~~VkkGd~L~~ 32 (50)
T PF13533_consen 19 KEGQQVKKGDVLLV 32 (50)
T ss_pred cCCCEEcCCCEEEE
Confidence 35677788888765
No 71
>TIGR03882 cyclo_dehyd_2 bacteriocin biosynthesis cyclodehydratase domain. This model describes a ThiF-like domain of a fusion protein found in clusters associated with the production of TOMMs (thiazole/oxazole-modified microcins), small bacteriocins with characteristic heterocycle modifications. This domain is presumed to act as a cyclodehydratase, as do members of the SagC family modeled by TIGR03603.
Probab=22.38 E-value=62 Score=23.03 Aligned_cols=26 Identities=19% Similarity=0.229 Sum_probs=22.0
Q ss_pred HHhCCCCCcCcccCCCCcccCCcccC
Q psy9188 10 AREHGLPWTVGKSFDTACPVSDFIPE 35 (104)
Q Consensus 10 ~~~~~~~~~~~K~~d~~~p~Gp~i~~ 35 (104)
....+.||...|..++..-+||++.|
T Consensus 159 ~l~~~~~~l~v~~~~~~~~~gp~~~p 184 (193)
T TIGR03882 159 ALAAGRPWLLVKPGGVQPWIGPLFKP 184 (193)
T ss_pred HHHcCCceEEEEeCCceEEECCeecC
Confidence 34578999999999999999998743
No 72
>COG0309 HypE Hydrogenase maturation factor [Posttranslational modification, protein turnover, chaperones]
Probab=22.35 E-value=56 Score=25.86 Aligned_cols=16 Identities=38% Similarity=0.553 Sum_probs=11.2
Q ss_pred CCccCCCc-EEEcCCCC
Q psy9188 77 HMTLEPYD-LILTGKAG 92 (104)
Q Consensus 77 ~~tL~pGD-vi~tGTp~ 92 (104)
...++||| ||+||||+
T Consensus 158 ~~~~~~GD~vI~tg~~g 174 (339)
T COG0309 158 PSGARPGDAVIVTGTIG 174 (339)
T ss_pred cCCCCCCCEEEEcCChh
Confidence 45678888 45788875
No 73
>PF02814 UreE_N: UreE urease accessory protein, N-terminal domain; InterPro: IPR004029 Urease and other nickel metalloenzymes are synthesised as precursors devoid of the metalloenzyme active site. These precursors then undergo a complex post-translational maturation process that requires a number of accessory proteins. Members of this group are nickel-binding proteins required for urease metallocentre assembly []. They are believed to function as metallochaperones to deliver nickel to urease apoprotein [, ]. It has been shown by yeast two-hybrid analysis that UreE forms a dimeric complex with UreG in Helicobacter pylori []. The UreDFG-apoenzyme complex has also been shown to exist [, ] and is believed to be, with the addition of UreE, the assembly system for active urease []. The complexes, rather than the individual proteins, presumably bind to UreB via UreE/H recognition sites. The structure of Klebsiella aerogenes UreE reveals a unique two-domain architecture.The N-terminal domain is structurally related to a heat shock protein, while the C-terminal domain shows homology to the Atx1 copper metallochaperone [, ]. Significantly, the metal-binding sites in UreE and Atx1 are distinct in location and types of residues despite the relationship between these proteins and the mechanism for UreE activation of urease is proposed to be different from the thiol ligand exchange mechanism used by the copper metallochaperones. The N-terminal domain is termed the peptide-binding domain. Deletion of this domain does not eliminate enzymatic activity, and the truncated protein can still activate urease [].; GO: 0016151 nickel ion binding, 0006461 protein complex assembly, 0019627 urea metabolic process; PDB: 3NXZ_B 3TJA_B 3LA0_B 3TJ9_B 3NY0_A 3L9Z_A 3TJ8_A 1EAR_A 1EB0_A 1GMU_B ....
Probab=22.18 E-value=46 Score=19.42 Aligned_cols=16 Identities=13% Similarity=0.221 Sum_probs=9.6
Q ss_pred HHcCCccCCCcEEEcC
Q psy9188 74 ISQHMTLEPYDLILTG 89 (104)
Q Consensus 74 ~s~~~tL~pGDvi~tG 89 (104)
+.+...|+.||++.+-
T Consensus 45 L~r~~~L~~GDvL~~d 60 (65)
T PF02814_consen 45 LPRGTVLRDGDVLYLD 60 (65)
T ss_dssp -SSTTT--TTEEEEEC
T ss_pred CCCCcccCCCCEEEeC
Confidence 3466779999998764
No 74
>TIGR02754 sod_Ni_protease nickel-type superoxide dismutase maturation protease. Members of this protein family are apparent proteases encoded adjacent to the genes for a nickel-type superoxide dismutase. This family belongs to the same larger family (see Pfam model pfam00717) as signal peptidase I, an unusual serine protease suggested to have a Ser/Lys catalytic dyad.
Probab=22.03 E-value=55 Score=19.83 Aligned_cols=10 Identities=50% Similarity=0.630 Sum_probs=5.3
Q ss_pred ccCCCcEEEc
Q psy9188 79 TLEPYDLILT 88 (104)
Q Consensus 79 tL~pGDvi~t 88 (104)
+|.+||+|+.
T Consensus 11 ~l~~GD~vlv 20 (90)
T TIGR02754 11 TLPPGDRIIV 20 (90)
T ss_pred ccCCCCEEEE
Confidence 4555555553
No 75
>cd00986 PDZ_LON_protease PDZ domain of ATP-dependent LON serine proteases. Most PDZ domains bind C-terminal polypeptides, though binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this bacterial subfamily of protease-associated PDZ domains a C-terminal beta-strand is thought to form the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
Probab=21.74 E-value=57 Score=19.10 Aligned_cols=10 Identities=30% Similarity=0.395 Sum_probs=8.5
Q ss_pred ccCCCcEEEc
Q psy9188 79 TLEPYDLILT 88 (104)
Q Consensus 79 tL~pGDvi~t 88 (104)
.|++||+|..
T Consensus 24 gL~~GD~I~~ 33 (79)
T cd00986 24 KLKAGDHIIA 33 (79)
T ss_pred CCCCCCEEEE
Confidence 4999999975
No 76
>PF00717 Peptidase_S24: Peptidase S24-like peptidase classification. ; InterPro: IPR019759 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ].; PDB: 1KCA_H 3BDN_A 1F39_A 1JHH_A 1JHE_B 3JSP_A 1JHF_B 1JHC_A 3JSO_B 1B12_D ....
Probab=21.67 E-value=42 Score=19.03 Aligned_cols=10 Identities=30% Similarity=0.547 Sum_probs=4.9
Q ss_pred ccCCCcEEEc
Q psy9188 79 TLEPYDLILT 88 (104)
Q Consensus 79 tL~pGDvi~t 88 (104)
++.+||+|+.
T Consensus 10 ~i~~Gd~v~v 19 (70)
T PF00717_consen 10 TIKDGDIVLV 19 (70)
T ss_dssp TSSTTEEEEE
T ss_pred CeeCCCEEEE
Confidence 4445555543
No 77
>PF13180 PDZ_2: PDZ domain; PDB: 2L97_A 1Y8T_A 2Z9I_A 1LCY_A 2PZD_B 2P3W_A 1VCW_C 1TE0_B 1SOZ_C 1SOT_C ....
Probab=21.47 E-value=49 Score=19.62 Aligned_cols=12 Identities=42% Similarity=0.672 Sum_probs=8.1
Q ss_pred CCccCCCcEEEc
Q psy9188 77 HMTLEPYDLILT 88 (104)
Q Consensus 77 ~~tL~pGDvi~t 88 (104)
..-|++||+|+.
T Consensus 29 ~aGl~~GD~I~~ 40 (82)
T PF13180_consen 29 KAGLQPGDIILA 40 (82)
T ss_dssp HTTS-TTEEEEE
T ss_pred HCCCCCCcEEEE
Confidence 344999999974
No 78
>PF04312 DUF460: Protein of unknown function (DUF460); InterPro: IPR007408 This is an archaeal protein of unknown function.
Probab=21.41 E-value=1.3e+02 Score=20.79 Aligned_cols=29 Identities=28% Similarity=0.517 Sum_probs=24.8
Q ss_pred EEEECCEEEeecccCccCCCHHHHHHHHHcC
Q psy9188 47 WLKVNGELRQKSTTGDMLFKTGDLISYISQH 77 (104)
Q Consensus 47 ~~~vNG~~~q~~~t~~m~~~~~~lia~~s~~ 77 (104)
.+-+||+++.-.+...| +..++|+++++.
T Consensus 47 ildL~G~~l~l~S~R~~--~~~evi~~I~~~ 75 (138)
T PF04312_consen 47 ILDLDGELLDLKSSRNM--SRSEVIEWISEY 75 (138)
T ss_pred EEecCCcEEEEEeecCC--CHHHHHHHHHHc
Confidence 45679999999998887 899999999864
No 79
>KOG1939|consensus
Probab=21.25 E-value=1.1e+02 Score=27.51 Aligned_cols=45 Identities=18% Similarity=0.109 Sum_probs=29.7
Q ss_pred CCEEEeecc-cCccCCCHHHHHHHHH--cCCccCCCcEEEcCCCCCcc
Q psy9188 51 NGELRQKST-TGDMLFKTGDLISYIS--QHMTLEPYDLILTGKAGFIL 95 (104)
Q Consensus 51 NG~~~q~~~-t~~m~~~~~~lia~~s--~~~tL~pGDvi~tGTp~g~~ 95 (104)
+|-++..+. .--++-++.+.|.|-. ....|.||||++|-.|...+
T Consensus 771 dGgLVANAPHvPvhLGsMqt~V~~Q~k~~~~kLk~GDVlltNHP~aGG 818 (1247)
T KOG1939|consen 771 DGGLVANAPHVPVHLGSMQTTVKWQLKHWGEKLKPGDVLLTNHPSAGG 818 (1247)
T ss_pred CCCeeccCCCCceeecchHHHHHHHHHHhccCCCCCcEEEecCCCCCC
Confidence 455555443 2345556666666654 35899999999999997443
No 80
>PF04571 Lipin_N: lipin, N-terminal conserved region; InterPro: IPR007651 Mutations in the lipin gene lead to fatty liver dystrophy in mice. The protein has been shown to be phosphorylated by the TOR Ser/Thr protein kinases in response to insulin stimulation. This entry represents a conserved domain found at the N terminus of the member proteins [, ].
Probab=21.17 E-value=1.4e+02 Score=19.91 Aligned_cols=35 Identities=20% Similarity=0.251 Sum_probs=22.7
Q ss_pred ccCCCCcccCCc-c--cCCCCCCCCCeeEEEEECCEEE
Q psy9188 21 KSFDTACPVSDF-I--PEHEIKDPDDVPLWLKVNGELR 55 (104)
Q Consensus 21 K~~d~~~p~Gp~-i--~~~~~~~~~~l~i~~~vNG~~~ 55 (104)
|--|++.-..|| + -.-.+-.+.+-.+.+.|||+..
T Consensus 34 ~q~DGs~~sSPFhVRFGk~~vl~~~ek~V~I~VNG~~~ 71 (110)
T PF04571_consen 34 EQPDGSLKSSPFHVRFGKLGVLRPREKVVDIEVNGKPV 71 (110)
T ss_pred ecCCCCEecCccEEEEcceeeecccCcEEEEEECCEEc
Confidence 344777777787 3 2223334566788999999875
No 81
>TIGR03439 methyl_EasF probable methyltransferase domain, EasF family. This model represents an uncharacterized domain of about 300 amino acids with homology to S-adenosylmethionine-dependent methyltransferases. Proteins with this domain are exclusively fungal. A few, such as EasF from Neotyphodium lolii, are associated with the biosynthesis of ergot alkaloids, a class of fungal secondary metabolites. EasF may, in fact, be the AdoMet:dimethylallyltryptophan N-methyltransferase, the enzyme that follows tryptophan dimethylallyltransferase (DMATS) in ergot alkaloid biosynthesis. Several other members of this family, including mug158 (meiotically up-regulated gene 158 protein) from Schizosaccharomyces pombe, contain an additional uncharacterized domain DUF323 (pfam03781).
Probab=21.05 E-value=73 Score=24.73 Aligned_cols=27 Identities=22% Similarity=0.488 Sum_probs=22.8
Q ss_pred CCHHHHHHHHHcCC--ccCCCcEEEcCCC
Q psy9188 65 FKTGDLISYISQHM--TLEPYDLILTGKA 91 (104)
Q Consensus 65 ~~~~~lia~~s~~~--tL~pGDvi~tGTp 91 (104)
|++++.++++.+.. .|.|||.++.|.-
T Consensus 170 f~~~ea~~fL~~~~~~~l~~~d~lLiG~D 198 (319)
T TIGR03439 170 FSRPEAAAFLAGFLATALSPSDSFLIGLD 198 (319)
T ss_pred CCHHHHHHHHHHHHHhhCCCCCEEEEecC
Confidence 48899999998764 5899999999874
No 82
>PF13621 Cupin_8: Cupin-like domain; PDB: 3AL6_C 3AL5_C 2XUM_A 2Y0I_A 1MZE_A 3KCY_A 1MZF_A 1YCI_A 2ILM_A 1H2L_A ....
Probab=20.92 E-value=38 Score=23.85 Aligned_cols=13 Identities=31% Similarity=0.570 Sum_probs=9.9
Q ss_pred cCCccCCCcEEEc
Q psy9188 76 QHMTLEPYDLILT 88 (104)
Q Consensus 76 ~~~tL~pGDvi~t 88 (104)
..++|+|||+|+-
T Consensus 209 ~~~~l~pGD~Lfi 221 (251)
T PF13621_consen 209 YEVVLEPGDVLFI 221 (251)
T ss_dssp EEEEEETT-EEEE
T ss_pred eEEEECCCeEEEE
Confidence 4689999999974
No 83
>cd01674 Homoaconitase_Swivel Homoaconitase swivel domain. This family includes homoaconitase and other uncharacterized proteins of the Aconitase family. Homoaconitase is part of an unusual lysine biosynthesis pathway found only in filamentous fungi, in which lysine is synthesized via the alpha-aminoadipate pathway. In this pathway, homoaconitase catalyzes the conversion of cis-homoaconitic acid into homoisocitric acid. The reaction mechanism is believed to be similar to that of other aconitases. This is the swivel domain, which is believed to undergo swivelling conformational change in the enzyme mechanism.
Probab=20.79 E-value=1.6e+02 Score=20.12 Aligned_cols=31 Identities=6% Similarity=-0.036 Sum_probs=22.0
Q ss_pred CHHHHHHHHHcC------CccCCCcEEEcCCCCCccc
Q psy9188 66 KTGDLISYISQH------MTLEPYDLILTGKAGFILM 96 (104)
Q Consensus 66 ~~~~lia~~s~~------~tL~pGDvi~tGTp~g~~~ 96 (104)
+++++-.|+-+. ..+++||||.+|---|+|-
T Consensus 23 ~~e~la~~~~e~~dp~f~~~v~~gdilVaG~nFG~GS 59 (129)
T cd01674 23 TPEKMAEVCMENYDSEFSTKTKQGDILVSGFNFGTGS 59 (129)
T ss_pred CHHHHHHhhcccCCchhhhcCCCCCEEEeCCccCCCC
Confidence 556666665432 4588999999998877764
No 84
>cd00991 PDZ_archaeal_metalloprotease PDZ domain of archaeal zinc metalloprotases, presumably membrane-associated or integral membrane proteases, which may be involved in signalling and regulatory mechanisms. May be responsible for substrate recognition and/or binding, as most PDZ domains bind C-terminal polypeptides, and binding to internal (non-C-terminal) polypeptides and even to lipids has been demonstrated. In this subfamily of protease-associated PDZ domains a C-terminal beta-strand forms the peptide-binding groove base, a circular permutation with respect to PDZ domains found in Eumetazoan signaling proteins.
Probab=20.33 E-value=63 Score=19.11 Aligned_cols=12 Identities=25% Similarity=0.362 Sum_probs=9.5
Q ss_pred CCccCCCcEEEc
Q psy9188 77 HMTLEPYDLILT 88 (104)
Q Consensus 77 ~~tL~pGDvi~t 88 (104)
...|++||+|+.
T Consensus 25 ~aGL~~GDiI~~ 36 (79)
T cd00991 25 NAVLHTGDVIYS 36 (79)
T ss_pred hcCCCCCCEEEE
Confidence 457999999974
Done!