Query psy4903
Match_columns 73
No_of_seqs 112 out of 1019
Neff 7.4
Searched_HMMs 46136
Date Fri Aug 16 20:48:35 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy4903.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/4903hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG1214|consensus 99.6 3.4E-16 7.3E-21 118.3 6.3 66 8-73 1113-1182(1289)
2 PF00058 Ldl_recept_b: Low-den 99.6 8.9E-15 1.9E-19 74.7 5.9 39 18-56 1-42 (42)
3 KOG1214|consensus 99.6 2.1E-15 4.7E-20 114.1 4.6 68 6-73 1025-1094(1289)
4 KOG1215|consensus 99.2 1.5E-11 3.2E-16 93.4 4.8 66 7-72 524-593 (877)
5 smart00135 LY Low-density lipo 98.9 2.4E-09 5.2E-14 53.0 3.7 34 5-38 8-43 (43)
6 smart00135 LY Low-density lipo 98.8 1.2E-08 2.7E-13 50.4 4.1 34 40-73 2-35 (43)
7 KOG1215|consensus 98.7 2.5E-08 5.3E-13 76.0 4.8 62 8-73 1-63 (877)
8 PLN02919 haloacid dehalogenase 98.1 2E-05 4.3E-10 62.0 8.2 67 5-71 567-648 (1057)
9 PF08450 SGL: SMP-30/Gluconola 98.1 3.6E-05 7.8E-10 50.5 8.2 60 7-69 1-62 (246)
10 PF03088 Str_synth: Strictosid 98.0 3.8E-05 8.2E-10 44.8 6.4 62 10-72 2-82 (89)
11 PF08450 SGL: SMP-30/Gluconola 97.9 0.00014 3E-09 47.7 8.1 68 4-72 132-208 (246)
12 PLN02919 haloacid dehalogenase 97.6 0.0004 8.7E-09 54.8 8.6 67 5-71 623-707 (1057)
13 PF06977 SdiA-regulated: SdiA- 97.2 0.0018 3.8E-08 43.9 6.2 63 3-66 168-240 (248)
14 KOG1520|consensus 97.1 0.0012 2.5E-08 47.3 5.0 67 4-71 159-243 (376)
15 PF03022 MRJP: Major royal jel 97.0 0.0051 1.1E-07 42.2 7.5 60 7-67 187-255 (287)
16 COG3386 Gluconolactonase [Carb 97.0 0.0032 6.9E-08 43.8 6.3 58 5-64 162-228 (307)
17 COG3386 Gluconolactonase [Carb 96.9 0.0064 1.4E-07 42.3 7.1 60 11-72 116-188 (307)
18 TIGR02604 Piru_Ver_Nterm putat 96.6 0.0074 1.6E-07 42.3 6.0 57 8-67 126-203 (367)
19 KOG4659|consensus 96.6 0.0056 1.2E-07 49.9 5.5 63 4-67 473-553 (1899)
20 PF01436 NHL: NHL repeat; Int 96.5 0.0087 1.9E-07 27.4 4.0 25 46-71 1-25 (28)
21 PF07995 GSDH: Glucose / Sorbo 96.5 0.0078 1.7E-07 41.8 5.4 62 8-70 116-204 (331)
22 KOG4659|consensus 96.1 0.016 3.5E-07 47.4 5.8 61 7-68 408-495 (1899)
23 PF00058 Ldl_recept_b: Low-den 95.6 0.011 2.4E-07 29.6 2.1 15 59-73 1-16 (42)
24 PF10282 Lactonase: Lactonase, 95.5 0.18 3.9E-06 34.9 8.3 66 7-72 246-317 (345)
25 PF01731 Arylesterase: Arylest 95.3 0.07 1.5E-06 30.8 4.8 38 33-71 41-78 (86)
26 TIGR03606 non_repeat_PQQ dehyd 95.1 0.11 2.4E-06 38.2 6.6 42 27-69 200-251 (454)
27 KOG1520|consensus 94.8 0.097 2.1E-06 37.7 5.5 60 8-68 117-181 (376)
28 PF10282 Lactonase: Lactonase, 94.6 0.41 8.9E-06 33.1 8.1 64 8-71 194-269 (345)
29 PRK11028 6-phosphogluconolacto 93.4 1.4 3E-05 29.8 8.8 62 8-69 37-102 (330)
30 PF13449 Phytase-like: Esteras 92.9 0.42 9E-06 33.1 5.7 57 9-66 88-166 (326)
31 PF06977 SdiA-regulated: SdiA- 92.9 1.6 3.5E-05 29.6 8.4 47 8-54 24-72 (248)
32 COG3204 Uncharacterized protei 92.0 0.77 1.7E-05 32.4 6.1 62 3-65 230-301 (316)
33 COG3391 Uncharacterized conser 91.7 2.3 5E-05 30.0 8.3 63 7-71 117-184 (381)
34 PRK11028 6-phosphogluconolacto 90.5 2.4 5.1E-05 28.7 7.3 63 9-71 231-298 (330)
35 PF03088 Str_synth: Strictosid 90.2 0.46 1E-05 27.6 3.1 32 5-36 56-89 (89)
36 PF03022 MRJP: Major royal jel 88.5 1.3 2.8E-05 30.4 4.8 62 10-72 132-210 (287)
37 KOG4499|consensus 88.2 2.7 5.9E-05 29.2 6.1 66 4-70 156-234 (310)
38 PF07995 GSDH: Glucose / Sorbo 88.2 3.4 7.3E-05 28.7 6.8 60 6-66 2-71 (331)
39 PF01731 Arylesterase: Arylest 87.8 1 2.2E-05 25.9 3.4 25 5-29 53-79 (86)
40 PF05345 He_PIG: Putative Ig d 86.4 0.88 1.9E-05 23.4 2.4 19 6-24 11-29 (49)
41 COG3391 Uncharacterized conser 85.5 9.3 0.0002 27.0 7.9 60 7-67 75-136 (381)
42 COG2133 Glucose/sorbosone dehy 85.1 4 8.7E-05 29.7 5.9 59 9-69 180-261 (399)
43 TIGR02604 Piru_Ver_Nterm putat 84.9 4.1 8.9E-05 28.5 5.8 60 5-67 71-143 (367)
44 COG2706 3-carboxymuconate cycl 83.9 14 0.0003 26.6 8.5 65 8-72 193-269 (346)
45 KOG4499|consensus 81.6 4.4 9.6E-05 28.2 4.8 62 3-65 209-296 (310)
46 PF13449 Phytase-like: Esteras 75.7 16 0.00034 25.3 6.2 58 9-68 23-105 (326)
47 KOG4649|consensus 74.2 3.6 7.7E-05 29.1 2.7 24 11-34 36-61 (354)
48 PF05787 DUF839: Bacterial pro 70.4 36 0.00078 25.6 7.3 56 5-60 349-449 (524)
49 COG2706 3-carboxymuconate cycl 70.0 38 0.00083 24.4 8.2 64 8-71 42-113 (346)
50 COG3823 Glutamine cyclotransfe 67.5 11 0.00024 25.8 3.8 21 45-65 227-247 (262)
51 PF02333 Phytase: Phytase; In 62.4 37 0.0008 24.7 5.9 48 7-54 209-264 (381)
52 TIGR03606 non_repeat_PQQ dehyd 62.3 62 0.0013 24.0 7.3 60 5-65 29-103 (454)
53 PF06739 SBBP: Beta-propeller 61.2 16 0.00034 17.6 2.8 19 46-65 12-30 (38)
54 COG3204 Uncharacterized protei 60.5 22 0.00049 25.2 4.4 47 8-54 88-136 (316)
55 TIGR03075 PQQ_enz_alc_DH PQQ-d 60.2 26 0.00057 26.1 5.0 48 11-65 239-287 (527)
56 TIGR02658 TTQ_MADH_Hv methylam 59.8 62 0.0013 23.2 7.9 57 11-69 253-322 (352)
57 PF12814 Mcp5_PH: Meiotic cell 57.7 18 0.00039 21.7 3.2 17 11-27 35-52 (123)
58 TIGR02276 beta_rpt_yvtn 40-res 57.3 19 0.00042 16.6 5.5 40 15-55 1-42 (42)
59 PF00930 DPPIV_N: Dipeptidyl p 54.7 44 0.00095 23.2 5.1 48 11-59 286-340 (353)
60 PF14759 Reductase_C: Reductas 53.7 29 0.00062 19.3 3.4 26 21-46 2-29 (85)
61 PLN03216 actin depolymerizing 53.1 6.2 0.00013 24.4 0.6 15 1-16 1-15 (141)
62 PF06433 Me-amine-dh_H: Methyl 51.0 68 0.0015 23.1 5.6 49 16-64 194-255 (342)
63 KOG4649|consensus 49.7 17 0.00038 25.8 2.4 26 48-73 32-57 (354)
64 PF03664 Glyco_hydro_62: Glyco 46.9 49 0.0011 23.0 4.2 30 14-43 141-179 (271)
65 TIGR03866 PQQ_ABC_repeats PQQ- 46.5 75 0.0016 20.2 8.4 58 8-66 33-92 (300)
66 COG3823 Glutamine cyclotransfe 45.0 44 0.00096 23.0 3.7 29 7-35 230-260 (262)
67 PF10349 WWbp: WW-domain ligan 42.2 14 0.00031 22.2 1.0 15 2-16 10-24 (116)
68 PRK04043 tolB translocation pr 41.4 1.3E+02 0.0029 21.6 6.4 51 13-66 284-338 (419)
69 TIGR03866 PQQ_ABC_repeats PQQ- 40.7 95 0.002 19.7 8.8 59 8-67 209-269 (300)
70 PF05096 Glu_cyclase_2: Glutam 40.4 29 0.00063 24.0 2.4 29 6-34 231-261 (264)
71 PF14251 DUF4346: Domain of un 40.2 20 0.00042 22.1 1.4 17 8-24 42-58 (119)
72 TIGR03032 conserved hypothetic 38.0 98 0.0021 22.3 4.7 46 17-66 212-260 (335)
73 TIGR02658 TTQ_MADH_Hv methylam 34.9 1.7E+02 0.0036 21.0 5.6 52 14-65 202-266 (352)
74 cd01213 tensin Tensin Phosphot 33.3 84 0.0018 19.6 3.5 38 3-41 48-88 (138)
75 PF01011 PQQ: PQQ enzyme repea 32.3 48 0.001 15.4 1.9 12 12-23 14-25 (38)
76 PF08310 LGFP: LGFP repeat; I 30.6 58 0.0013 16.6 2.1 15 10-24 16-30 (54)
77 PF02239 Cytochrom_D1: Cytochr 30.4 2E+02 0.0044 20.4 7.9 57 12-69 43-100 (369)
78 PF11314 DUF3117: Protein of u 30.2 73 0.0016 16.6 2.4 19 25-43 17-35 (51)
79 PF02333 Phytase: Phytase; In 30.1 2.2E+02 0.0048 20.8 7.3 55 14-68 166-229 (381)
80 KOG3567|consensus 29.0 61 0.0013 24.5 2.6 56 8-67 404-485 (501)
81 PF01267 F-actin_cap_A: F-acti 28.7 1.4E+02 0.003 20.6 4.2 43 29-71 52-94 (271)
82 PF00930 DPPIV_N: Dipeptidyl p 25.6 2.3E+02 0.005 19.5 5.4 53 15-67 245-301 (353)
83 PRK04922 tolB translocation pr 25.0 2.6E+02 0.0056 19.9 7.5 55 11-66 297-355 (433)
84 PLN02553 inositol-phosphate ph 24.5 56 0.0012 21.8 1.7 15 10-24 121-135 (270)
85 cd01643 Bacterial_IMPase_like_ 24.4 59 0.0013 21.4 1.8 16 9-24 106-121 (242)
86 TIGR02067 his_9_proposed histi 24.2 62 0.0013 21.4 1.9 16 9-24 109-124 (251)
87 PF13804 HERV-K_env_2: Retro-t 23.3 2.1E+02 0.0046 18.6 4.1 18 51-68 147-164 (170)
88 PF02820 MBT: mbt repeat; Int 23.1 1.4E+02 0.0029 16.1 2.8 14 11-24 4-17 (73)
89 TIGR02647 DNA conserved hypoth 23.0 27 0.00059 19.8 -0.0 17 6-24 43-59 (77)
90 PF14884 EFF-AFF: Type I membr 22.9 1.5E+02 0.0032 23.1 3.7 31 25-55 371-401 (589)
91 smart00564 PQQ beta-propeller 22.7 85 0.0018 13.5 1.9 10 13-22 21-30 (33)
92 cd01248 PH_PLC Phospholipase C 22.6 1.7E+02 0.0037 16.9 4.2 23 45-67 19-41 (115)
93 cd00472 Ribosomal_L24e_L24 Rib 22.3 1.4E+02 0.0029 15.7 3.6 28 2-29 2-29 (54)
94 TIGR01331 bisphos_cysQ 3'(2'), 22.1 73 0.0016 21.0 1.9 16 9-24 111-126 (249)
95 PRK09439 PTS system glucose-sp 21.7 84 0.0018 20.2 2.0 14 8-21 48-61 (169)
96 cd01638 CysQ CysQ, a 3'-Phosph 21.5 72 0.0016 20.9 1.7 15 10-24 110-124 (242)
97 KOG0646|consensus 21.4 3.7E+02 0.008 20.4 5.8 21 46-66 217-237 (476)
98 PF09917 DUF2147: Uncharacteri 21.2 80 0.0017 18.4 1.8 15 8-22 70-84 (114)
99 COG5649 Uncharacterized conser 21.1 84 0.0018 19.4 1.8 15 3-17 75-90 (132)
100 PF14583 Pectate_lyase22: Olig 21.0 2.9E+02 0.0063 20.3 4.8 43 25-67 59-101 (386)
101 PRK10757 inositol monophosphat 21.0 75 0.0016 21.3 1.8 16 9-24 113-128 (267)
No 1
>KOG1214|consensus
Probab=99.64 E-value=3.4e-16 Score=118.32 Aligned_cols=66 Identities=18% Similarity=0.241 Sum_probs=63.5
Q ss_pred ccceEEeCCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903 8 SSGENRTREQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIET 73 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~ 73 (73)
|+++|+||..|.||||| +++|+++.|||.++++++++++..|+||++|+.++.|+|+|.+..+.||
T Consensus 1113 PR~iv~D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~DigLPNGLtfdpfs~~LCWvDAGt~rleC 1182 (1289)
T KOG1214|consen 1113 PRAIVVDPIRGNLYWTDWNRENPKIETSSMDGENRRILINTDIGLPNGLTFDPFSKLLCWVDAGTKRLEC 1182 (1289)
T ss_pred cceEEeecccCceeeccccccCCcceeeccCCccceEEeecccCCCCCceeCcccceeeEEecCCcceeE
Confidence 78889999999999999 6999999999999999999999999999999999999999999998885
No 2
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=99.58 E-value=8.9e-15 Score=74.66 Aligned_cols=39 Identities=26% Similarity=0.514 Sum_probs=36.8
Q ss_pred CeEEEeC--CC-cEEEEEcCCCCeEEEEeCCCceeeEEEEcC
Q psy4903 18 ALMFVAD--SN-QILRTNMDGTMAMSIVSEAAYKASGVALDI 56 (73)
Q Consensus 18 g~lywtd--~~-~I~ra~ldGs~~~~l~~~~~~~p~~lavD~ 56 (73)
|+|||+| .. +|++++|||+++++++.+++..|.||||||
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs~~~~vi~~~l~~P~giaVD~ 42 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGSNRRTVISDDLQHPEGIAVDW 42 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTSTSEEEEEESSTSSEEEEEEET
T ss_pred CEEEEEECCCCcEEEEEECCCCCeEEEEECCCCCcCEEEECC
Confidence 6899999 45 999999999999999999999999999996
No 3
>KOG1214|consensus
Probab=99.57 E-value=2.1e-15 Score=114.06 Aligned_cols=68 Identities=24% Similarity=0.487 Sum_probs=64.6
Q ss_pred eeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903 6 KFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIET 73 (73)
Q Consensus 6 ~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~ 73 (73)
+++.||..|.+..++|||| .+.|.||+|+|+..++++++++.+|+||||||..+++||+|+.++.||.
T Consensus 1025 ~IiVGidfDC~e~mvyWtDv~g~SI~rasL~G~Ep~ti~n~~L~SPEGiAVDh~~Rn~ywtDS~lD~Iev 1094 (1289)
T KOG1214|consen 1025 SIIVGIDFDCRERMVYWTDVAGRSISRASLEGAEPETIVNSGLISPEGIAVDHIRRNMYWTDSVLDKIEV 1094 (1289)
T ss_pred ceeeeeecccccceEEEeecCCCccccccccCCCCceeecccCCCccceeeeeccceeeeeccccchhhe
Confidence 4578999999999999999 7999999999999999999999999999999999999999999999973
No 4
>KOG1215|consensus
Probab=99.21 E-value=1.5e-11 Score=93.36 Aligned_cols=66 Identities=30% Similarity=0.551 Sum_probs=61.1
Q ss_pred eccceEEeCCCCeEEEeC---CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC-eee
Q psy4903 7 FSSGENRTREQALMFVAD---SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLD-YIE 72 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd---~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~-~ie 72 (73)
-++++++||..|++||+| .++|+|+.|||+.+++++..++.+|+||++|...+++||+|.... .|+
T Consensus 524 ~~r~~~v~p~~g~~~wtd~~~~~~i~ra~~dg~~~~~l~~~~~~~p~glt~d~~~~~~yw~d~~~~~~i~ 593 (877)
T KOG1215|consen 524 LPRSIAVDPEKGLMFWTDWGQPPRIERASLDGSERAVLVTNGILWPNGLTIDYETDRLYWADAKLDYTIE 593 (877)
T ss_pred CccceeeccccCeeEEecCCCCchhhhhcCCCCCceEEEeCCccCCCcceEEeecceeEEEcccCCccee
Confidence 567889999999999999 368999999999999999999999999999999999999999988 554
No 5
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.90 E-value=2.4e-09 Score=53.03 Aligned_cols=34 Identities=21% Similarity=0.264 Sum_probs=30.7
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCe
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMA 38 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~ 38 (73)
+..|.|+++||..++|||+| ...|+|+.+||+++
T Consensus 8 ~~~~~~la~d~~~~~lYw~D~~~~~I~~~~~~g~~~ 43 (43)
T smart00135 8 LGHPNGLAVDWIEGRLYWTDWGLDVIEVANLDGTNR 43 (43)
T ss_pred CCCcCEEEEeecCCEEEEEeCCCCEEEEEeCCCCCC
Confidence 45789999999999999999 69999999999863
No 6
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=98.79 E-value=1.2e-08 Score=50.41 Aligned_cols=34 Identities=38% Similarity=0.589 Sum_probs=30.1
Q ss_pred EEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903 40 SIVSEAAYKASGVALDINAKRLFWCDNLLDYIET 73 (73)
Q Consensus 40 ~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~ 73 (73)
+++...+..|+|||+|+.+++|||+|.....|++
T Consensus 2 ~~~~~~~~~~~~la~d~~~~~lYw~D~~~~~I~~ 35 (43)
T smart00135 2 TLLSEGLGHPNGLAVDWIEGRLYWTDWGLDVIEV 35 (43)
T ss_pred EEEECCCCCcCEEEEeecCCEEEEEeCCCCEEEE
Confidence 4566789999999999999999999999988863
No 7
>KOG1215|consensus
Probab=98.68 E-value=2.5e-08 Score=76.03 Aligned_cols=62 Identities=24% Similarity=0.379 Sum_probs=55.8
Q ss_pred ccceEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903 8 SSGENRTREQALMFVAD-SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIET 73 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie~ 73 (73)
|+.++++|..+.|||+| ...|+ ||+.+..+....+.+|+|++||+..+++||+|.+.+.||+
T Consensus 1 pra~~~~p~~~~l~wtd~~~~i~----dg~~~~~~~~~~~~~~ng~~id~~~~~~y~~d~~~~~i~~ 63 (877)
T KOG1215|consen 1 PRAIALLPLEGLLFWTDWGANIE----DGGERKILEKEEFEWPNGLTIDLAWQRIYWADAKNDLIES 63 (877)
T ss_pred CceEEeccccceEEEecCCcccc----cCcceEEeeccceeCCCcceecchhheeeeccccCCceEE
Confidence 56889999999999999 33466 8888999999999999999999999999999999998874
No 8
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=98.10 E-value=2e-05 Score=61.96 Aligned_cols=67 Identities=22% Similarity=0.357 Sum_probs=54.4
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeC-------------CCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSE-------------AAYKASGVALDINAKRLFWCDNLLD 69 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~-------------~~~~p~~lavD~~~~~LYW~D~~~~ 69 (73)
+.||.|+++|+.+|.+|.+| +.+|.+.+.+|.....+-.. .+..|.|+++|..++.||++|...+
T Consensus 567 l~~P~gvavd~~~g~lyVaDs~n~rI~v~d~~G~~i~~ig~~g~~G~~dG~~~~a~f~~P~GIavd~~gn~LYVaDt~n~ 646 (1057)
T PLN02919 567 LKFPGKLAIDLLNNRLFISDSNHNRIVVTDLDGNFIVQIGSTGEEGLRDGSFEDATFNRPQGLAYNAKKNLLYVADTENH 646 (1057)
T ss_pred CCCCceEEEECCCCeEEEEECCCCeEEEEeCCCCEEEEEccCCCcCCCCCchhccccCCCcEEEEeCCCCEEEEEeCCCc
Confidence 57899999999999999999 79999999998644333321 2567999999998889999998765
Q ss_pred ee
Q psy4903 70 YI 71 (73)
Q Consensus 70 ~i 71 (73)
+|
T Consensus 647 ~I 648 (1057)
T PLN02919 647 AL 648 (1057)
T ss_pred eE
Confidence 54
No 9
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=98.08 E-value=3.6e-05 Score=50.46 Aligned_cols=60 Identities=20% Similarity=0.222 Sum_probs=49.0
Q ss_pred eccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903 7 FSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLD 69 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~ 69 (73)
|+-|.+.||.+|.|||+| +.+|.|...++...+++.... |.|++++...++||.++....
T Consensus 1 l~Egp~~d~~~g~l~~~D~~~~~i~~~~~~~~~~~~~~~~~---~~G~~~~~~~g~l~v~~~~~~ 62 (246)
T PF08450_consen 1 LGEGPVWDPRDGRLYWVDIPGGRIYRVDPDTGEVEVIDLPG---PNGMAFDRPDGRLYVADSGGI 62 (246)
T ss_dssp CEEEEEEETTTTEEEEEETTTTEEEEEETTTTEEEEEESSS---EEEEEEECTTSEEEEEETTCE
T ss_pred CCcceEEECCCCEEEEEEcCCCEEEEEECCCCeEEEEecCC---CceEEEEccCCEEEEEEcCce
Confidence 467899999999999999 799999999886555433323 999999977799999987643
No 10
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=98.01 E-value=3.8e-05 Score=44.83 Aligned_cols=62 Identities=15% Similarity=0.244 Sum_probs=45.2
Q ss_pred ceEEeCCCCeEEEeC-------------------CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCe
Q psy4903 10 GENRTREQALMFVAD-------------------SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDY 70 (73)
Q Consensus 10 g~~~dp~~g~lywtd-------------------~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ 70 (73)
++.+++..|.+|||| ++++.+-+.... +..++-.++..|+|+++...++.|..++....+
T Consensus 2 dldv~~~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~-~~~vl~~~L~fpNGVals~d~~~vlv~Et~~~R 80 (89)
T PF03088_consen 2 DLDVDQDTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTK-ETTVLLDGLYFPNGVALSPDESFVLVAETGRYR 80 (89)
T ss_dssp EEEE-TTT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTT-EEEEEEEEESSEEEEEE-TTSSEEEEEEGGGTE
T ss_pred ceeEecCCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCC-eEEEehhCCCccCeEEEcCCCCEEEEEeccCce
Confidence 688999999999998 257777666654 444555689999999999999999999887776
Q ss_pred ee
Q psy4903 71 IE 72 (73)
Q Consensus 71 ie 72 (73)
|-
T Consensus 81 i~ 82 (89)
T PF03088_consen 81 IL 82 (89)
T ss_dssp EE
T ss_pred EE
Confidence 63
No 11
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=97.88 E-value=0.00014 Score=47.70 Aligned_cols=68 Identities=21% Similarity=0.242 Sum_probs=50.7
Q ss_pred eeeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCC-----eEEEEeC--CCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903 4 KMKFSSGENRTREQALMFVAD--SNQILRTNMDGTM-----AMSIVSE--AAYKASGVALDINAKRLFWCDNLLDYIE 72 (73)
Q Consensus 4 ~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~-----~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~~~ie 72 (73)
.+.+|.||+++|..+.||+++ ..+|.+..++... ++++..- ....|.|+++|. ..+||-++...+.|.
T Consensus 132 ~~~~pNGi~~s~dg~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~-~G~l~va~~~~~~I~ 208 (246)
T PF08450_consen 132 GLGFPNGIAFSPDGKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDS-DGNLWVADWGGGRIV 208 (246)
T ss_dssp EESSEEEEEEETTSSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBT-TS-EEEEEETTTEEE
T ss_pred CcccccceEECCcchheeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcC-CCCEEEEEcCCCEEE
Confidence 466799999999999999999 6899999997432 3444432 223699999996 557898877666653
No 12
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=97.65 E-value=0.0004 Score=54.80 Aligned_cols=67 Identities=10% Similarity=0.173 Sum_probs=52.5
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeC----------------CCceeeEEEEcCCCCeEEEEeC
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSE----------------AAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~----------------~~~~p~~lavD~~~~~LYW~D~ 66 (73)
+..|.|+++||..+.||++| +.+|.+.++.+....++... .+..|.++++|..++.||.+|.
T Consensus 623 f~~P~GIavd~~gn~LYVaDt~n~~Ir~id~~~~~V~tlag~G~~g~~~~gg~~~~~~~ln~P~gVa~dp~~g~LyVad~ 702 (1057)
T PLN02919 623 FNRPQGLAYNAKKNLLYVADTENHALREIDFVNETVRTLAGNGTKGSDYQGGKKGTSQVLNSPWDVCFEPVNEKVYIAMA 702 (1057)
T ss_pred cCCCcEEEEeCCCCEEEEEeCCCceEEEEecCCCEEEEEeccCcccCCCCCChhhhHhhcCCCeEEEEecCCCeEEEEEC
Confidence 34589999999999999999 68888888766544444321 1567999999998999999998
Q ss_pred CCCee
Q psy4903 67 LLDYI 71 (73)
Q Consensus 67 ~~~~i 71 (73)
..++|
T Consensus 703 ~~~~I 707 (1057)
T PLN02919 703 GQHQI 707 (1057)
T ss_pred CCCeE
Confidence 76655
No 13
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=97.18 E-value=0.0018 Score=43.91 Aligned_cols=63 Identities=16% Similarity=0.247 Sum_probs=38.1
Q ss_pred eeeeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEE--Ee------CCCceeeEEEEcCCCCeEEEEeC
Q psy4903 3 FKMKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSI--VS------EAAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 3 ~~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l--~~------~~~~~p~~lavD~~~~~LYW~D~ 66 (73)
+.+.=+||+.+||.+|.+|--+ ++.|.....+|.-...+ .. ..+.+|+|+|+|. ..+||-+..
T Consensus 168 ~~~~d~S~l~~~p~t~~lliLS~es~~l~~~d~~G~~~~~~~L~~g~~gl~~~~~QpEGIa~d~-~G~LYIvsE 240 (248)
T PF06977_consen 168 LFVRDLSGLSYDPRTGHLLILSDESRLLLELDRQGRVVSSLSLDRGFHGLSKDIPQPEGIAFDP-DGNLYIVSE 240 (248)
T ss_dssp --SS---EEEEETTTTEEEEEETTTTEEEEE-TT--EEEEEE-STTGGG-SS---SEEEEEE-T-T--EEEEET
T ss_pred ceeccccceEEcCCCCeEEEEECCCCeEEEECCCCCEEEEEEeCCcccCcccccCCccEEEECC-CCCEEEEcC
Confidence 4455689999999999998887 68899988888744433 22 3567899999997 558887654
No 14
>KOG1520|consensus
Probab=97.10 E-value=0.0012 Score=47.29 Aligned_cols=67 Identities=13% Similarity=0.292 Sum_probs=50.7
Q ss_pred eeeeccceEEeCCCCeEEEeC------CCcEEEEEcCCC--C----------eEEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903 4 KMKFSSGENRTREQALMFVAD------SNQILRTNMDGT--M----------AMSIVSEAAYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 4 ~~~~~~g~~~dp~~g~lywtd------~~~I~ra~ldGs--~----------~~~l~~~~~~~p~~lavD~~~~~LYW~D 65 (73)
+++|..++.+|| +|-+|||| .....-|.+.|. + ...++-.+++-|+|+|+-+.+..+-.|+
T Consensus 159 ~~kf~N~ldI~~-~g~vyFTDSSsk~~~rd~~~a~l~g~~~GRl~~YD~~tK~~~VLld~L~F~NGlaLS~d~sfvl~~E 237 (376)
T KOG1520|consen 159 PFKFLNDLDIDP-EGVVYFTDSSSKYDRRDFVFAALEGDPTGRLFRYDPSTKVTKVLLDGLYFPNGLALSPDGSFVLVAE 237 (376)
T ss_pred eeeecCceeEcC-CCeEEEeccccccchhheEEeeecCCCccceEEecCcccchhhhhhcccccccccCCCCCCEEEEEe
Confidence 678999999999 99999999 145666666662 1 2224446899999999998898887777
Q ss_pred CCCCee
Q psy4903 66 NLLDYI 71 (73)
Q Consensus 66 ~~~~~i 71 (73)
....+|
T Consensus 238 t~~~ri 243 (376)
T KOG1520|consen 238 TTTARI 243 (376)
T ss_pred ecccee
Confidence 765554
No 15
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=97.03 E-value=0.0051 Score=42.20 Aligned_cols=60 Identities=18% Similarity=0.142 Sum_probs=44.6
Q ss_pred eccceEEeCCCCeEEEeC--CCcEEEEEcCC----CCeEEEEe-CC-CceeeEEEEcC-CCCeEEEEeCC
Q psy4903 7 FSSGENRTREQALMFVAD--SNQILRTNMDG----TMAMSIVS-EA-AYKASGVALDI-NAKRLFWCDNL 67 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldG----s~~~~l~~-~~-~~~p~~lavD~-~~~~LYW~D~~ 67 (73)
-+.|++.|+ +|.||+++ +..|.+.+.++ .+..+|.. .. +.+|.+++++. ..+.||..-..
T Consensus 187 ~s~g~~~D~-~G~ly~~~~~~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snr 255 (287)
T PF03022_consen 187 QSDGMAIDP-NGNLYFTDVEQNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNR 255 (287)
T ss_dssp SECEEEEET-TTEEEEEECCCTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S
T ss_pred CCceEEECC-CCcEEEecCCCCeEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEEECc
Confidence 568999999 99999999 68999999998 34444543 44 89999999996 35677776544
No 16
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=97.00 E-value=0.0032 Score=43.80 Aligned_cols=58 Identities=24% Similarity=0.440 Sum_probs=44.0
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEEEEEcC---CC--Ce--EEEEeCCCceeeEEEEcCCCCeEEEE
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQILRTNMD---GT--MA--MSIVSEAAYKASGVALDINAKRLFWC 64 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ld---Gs--~~--~~l~~~~~~~p~~lavD~~~~~LYW~ 64 (73)
+.++.||+.+|....||++| ..+|.|..+| |. ++ .+.....-..|.|+++|-.+ .||+
T Consensus 162 ~~~~NGla~SpDg~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG--~lw~ 228 (307)
T COG3386 162 LTIPNGLAFSPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADG--NLWV 228 (307)
T ss_pred EEecCceEECCCCCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCC--CEEE
Confidence 67899999999999999999 6999999998 32 22 23333345789999999444 4563
No 17
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=96.90 E-value=0.0064 Score=42.31 Aligned_cols=60 Identities=17% Similarity=0.321 Sum_probs=44.6
Q ss_pred eEEeCCCCeEEEeC-C------------CcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903 11 ENRTREQALMFVAD-S------------NQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYIE 72 (73)
Q Consensus 11 ~~~dp~~g~lywtd-~------------~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie 72 (73)
.++||. |.+|+++ + ..++|..-+| ....++...+..|+|||+++.++.+|++|...++|.
T Consensus 116 ~~v~pd-G~~wfgt~~~~~~~~~~~~~~G~lyr~~p~g-~~~~l~~~~~~~~NGla~SpDg~tly~aDT~~~~i~ 188 (307)
T COG3386 116 GVVDPD-GRIWFGDMGYFDLGKSEERPTGSLYRVDPDG-GVVRLLDDDLTIPNGLAFSPDGKTLYVADTPANRIH 188 (307)
T ss_pred eeEcCC-CCEEEeCCCccccCccccCCcceEEEEcCCC-CEEEeecCcEEecCceEECCCCCEEEEEeCCCCeEE
Confidence 345565 6667665 3 3577777644 455566667999999999999999999999887764
No 18
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=96.65 E-value=0.0074 Score=42.27 Aligned_cols=57 Identities=19% Similarity=0.314 Sum_probs=45.3
Q ss_pred ccceEEeCCCCeEEEeCC---------------------CcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903 8 SSGENRTREQALMFVADS---------------------NQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd~---------------------~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~ 66 (73)
+.|++++| .|+||+++. ..|.|.+-||+..+ ++...+..|.|+++|. ...||.+|.
T Consensus 126 ~~~l~~gp-DG~LYv~~G~~~~~~~~~~~~~~~~~~~~~g~i~r~~pdg~~~e-~~a~G~rnp~Gl~~d~-~G~l~~tdn 202 (367)
T TIGR02604 126 LNSLAWGP-DGWLYFNHGNTLASKVTRPGTSDESRQGLGGGLFRYNPDGGKLR-VVAHGFQNPYGHSVDS-WGDVFFCDN 202 (367)
T ss_pred ccCceECC-CCCEEEecccCCCceeccCCCccCcccccCceEEEEecCCCeEE-EEecCcCCCccceECC-CCCEEEEcc
Confidence 56888888 689999761 35899999987766 4567899999999997 567888887
Q ss_pred C
Q psy4903 67 L 67 (73)
Q Consensus 67 ~ 67 (73)
.
T Consensus 203 ~ 203 (367)
T TIGR02604 203 D 203 (367)
T ss_pred C
Confidence 4
No 19
>KOG4659|consensus
Probab=96.59 E-value=0.0056 Score=49.87 Aligned_cols=63 Identities=17% Similarity=0.242 Sum_probs=45.3
Q ss_pred eeeeccceEEeCCCCeEEEeCCCcEEEEEcCCCCe------------------EEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903 4 KMKFSSGENRTREQALMFVADSNQILRTNMDGTMA------------------MSIVSEAAYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 4 ~~~~~~g~~~dp~~g~lywtd~~~I~ra~ldGs~~------------------~~l~~~~~~~p~~lavD~~~~~LYW~D 65 (73)
+.-+|+|+++|. .|.||+.|...|-...-+|--. +.|..-.++||..||||+..+.||-.|
T Consensus 473 ~L~~PkGIa~dk-~g~lYfaD~t~IR~iD~~giIstlig~~~~~~~p~~C~~~~kl~~~~leWPT~LaV~Pmdnsl~Vld 551 (1899)
T KOG4659|consen 473 QLIFPKGIAFDK-MGNLYFADGTRIRVIDTTGIISTLIGTTPDQHPPRTCAQITKLVDLQLEWPTSLAVDPMDNSLLVLD 551 (1899)
T ss_pred eeccCCceeEcc-CCcEEEecccEEEEeccCceEEEeccCCCCccCccccccccchhheeeecccceeecCCCCeEEEee
Confidence 456899999996 6799999955555544443211 112222788999999999999999888
Q ss_pred CC
Q psy4903 66 NL 67 (73)
Q Consensus 66 ~~ 67 (73)
..
T Consensus 552 ~n 553 (1899)
T KOG4659|consen 552 TN 553 (1899)
T ss_pred cc
Confidence 63
No 20
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=96.53 E-value=0.0087 Score=27.44 Aligned_cols=25 Identities=20% Similarity=0.251 Sum_probs=21.8
Q ss_pred CceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 46 AYKASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 46 ~~~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
+..|.|+++| ..++||-+|....+|
T Consensus 1 f~~P~gvav~-~~g~i~VaD~~n~rV 25 (28)
T PF01436_consen 1 FNYPHGVAVD-SDGNIYVADSGNHRV 25 (28)
T ss_dssp BSSEEEEEEE-TTSEEEEEECCCTEE
T ss_pred CcCCcEEEEe-CCCCEEEEECCCCEE
Confidence 3579999999 888999999988776
No 21
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=96.51 E-value=0.0078 Score=41.81 Aligned_cols=62 Identities=27% Similarity=0.336 Sum_probs=47.0
Q ss_pred ccceEEeCCCCeEEEeC---------------CCcEEEEEcCCCC------------eEEEEeCCCceeeEEEEcCCCCe
Q psy4903 8 SSGENRTREQALMFVAD---------------SNQILRTNMDGTM------------AMSIVSEAAYKASGVALDINAKR 60 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd---------------~~~I~ra~ldGs~------------~~~l~~~~~~~p~~lavD~~~~~ 60 (73)
..+|+++| .|+||++- ..+|.|.+.||+- ...++..++.+|.++++|+.+..
T Consensus 116 g~~l~fgp-DG~LYvs~G~~~~~~~~~~~~~~~G~ilri~~dG~~p~dnP~~~~~~~~~~i~A~GlRN~~~~~~d~~tg~ 194 (331)
T PF07995_consen 116 GGGLAFGP-DGKLYVSVGDGGNDDNAQDPNSLRGKILRIDPDGSIPADNPFVGDDGADSEIYAYGLRNPFGLAFDPNTGR 194 (331)
T ss_dssp EEEEEE-T-TSEEEEEEB-TTTGGGGCSTTSSTTEEEEEETTSSB-TTSTTTTSTTSTTTEEEE--SEEEEEEEETTTTE
T ss_pred CccccCCC-CCcEEEEeCCCCCcccccccccccceEEEecccCcCCCCCccccCCCceEEEEEeCCCccccEEEECCCCc
Confidence 35688888 66999983 3799999999971 34567889999999999999999
Q ss_pred EEEEeCCCCe
Q psy4903 61 LFWCDNLLDY 70 (73)
Q Consensus 61 LYW~D~~~~~ 70 (73)
||-+|.+.+.
T Consensus 195 l~~~d~G~~~ 204 (331)
T PF07995_consen 195 LWAADNGPDG 204 (331)
T ss_dssp EEEEEE-SSS
T ss_pred EEEEccCCCC
Confidence 9999976553
No 22
>KOG4659|consensus
Probab=96.14 E-value=0.016 Score=47.37 Aligned_cols=61 Identities=18% Similarity=0.283 Sum_probs=44.4
Q ss_pred eccceEEeCCCCeEEEeC--CCcEEEEE-cCCC----CeEEEEeC--------------------CCceeeEEEEcCCCC
Q psy4903 7 FSSGENRTREQALMFVAD--SNQILRTN-MDGT----MAMSIVSE--------------------AAYKASGVALDINAK 59 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd--~~~I~ra~-ldGs----~~~~l~~~--------------------~~~~p~~lavD~~~~ 59 (73)
++==+|++|+.|.||-+| +.+|.|.. +.+. +.+++..+ .+..|.|||+| ..+
T Consensus 408 h~Yy~AvsPvdgtlyvSdp~s~qv~rv~sl~~~d~~~N~evvaG~Ge~Clp~desCGDGalA~dA~L~~PkGIa~d-k~g 486 (1899)
T KOG4659|consen 408 HSYYIAVSPVDGTLYVSDPLSKQVWRVSSLEPQDSRNNYEVVAGDGEVCLPADESCGDGALAQDAQLIFPKGIAFD-KMG 486 (1899)
T ss_pred ceeEEEecCcCceEEecCCCcceEEEeccCCccccccCeeEEeccCcCccccccccCcchhcccceeccCCceeEc-cCC
Confidence 334478999999999999 68888854 3332 23444321 57789999999 566
Q ss_pred eEEEEeCCC
Q psy4903 60 RLFWCDNLL 68 (73)
Q Consensus 60 ~LYW~D~~~ 68 (73)
.||++|...
T Consensus 487 ~lYfaD~t~ 495 (1899)
T KOG4659|consen 487 NLYFADGTR 495 (1899)
T ss_pred cEEEecccE
Confidence 899999864
No 23
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=95.65 E-value=0.011 Score=29.55 Aligned_cols=15 Identities=33% Similarity=0.685 Sum_probs=12.7
Q ss_pred CeEEEEeCCCC-eeeC
Q psy4903 59 KRLFWCDNLLD-YIET 73 (73)
Q Consensus 59 ~~LYW~D~~~~-~ie~ 73 (73)
++|||+|...+ .|++
T Consensus 1 ~~iYWtD~~~~~~I~~ 16 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIER 16 (42)
T ss_dssp TEEEEEETTTTEEEEE
T ss_pred CEEEEEECCCCcEEEE
Confidence 58999999999 8763
No 24
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=95.45 E-value=0.18 Score=34.91 Aligned_cols=66 Identities=12% Similarity=0.201 Sum_probs=49.2
Q ss_pred eccceEEeCCCCeEEEeC--CCcEEEEEcCCC-CeEEEE---eCCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903 7 FSSGENRTREQALMFVAD--SNQILRTNMDGT-MAMSIV---SEAAYKASGVALDINAKRLFWCDNLLDYIE 72 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs-~~~~l~---~~~~~~p~~lavD~~~~~LYW~D~~~~~ie 72 (73)
.+.+++++|...+||-++ ...|..-.+|+. +....+ ...-..|.++++|+.++.||-+....+.|.
T Consensus 246 ~~~~i~ispdg~~lyvsnr~~~sI~vf~~d~~~g~l~~~~~~~~~G~~Pr~~~~s~~g~~l~Va~~~s~~v~ 317 (345)
T PF10282_consen 246 APAEIAISPDGRFLYVSNRGSNSISVFDLDPATGTLTLVQTVPTGGKFPRHFAFSPDGRYLYVANQDSNTVS 317 (345)
T ss_dssp SEEEEEE-TTSSEEEEEECTTTEEEEEEECTTTTTEEEEEEEEESSSSEEEEEE-TTSSEEEEEETTTTEEE
T ss_pred CceeEEEecCCCEEEEEeccCCEEEEEEEecCCCceEEEEEEeCCCCCccEEEEeCCCCEEEEEecCCCeEE
Confidence 577899999999999998 578888888653 233332 233456999999999999999998877664
No 25
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=95.25 E-value=0.07 Score=30.85 Aligned_cols=38 Identities=26% Similarity=0.367 Sum_probs=29.7
Q ss_pred cCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 33 MDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 33 ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
.||+... .+.+++..|+||++|+..+.||-++.....|
T Consensus 41 yd~~~~~-~va~g~~~aNGI~~s~~~k~lyVa~~~~~~I 78 (86)
T PF01731_consen 41 YDGKEVK-VVASGFSFANGIAISPDKKYLYVASSLAHSI 78 (86)
T ss_pred EeCCEeE-EeeccCCCCceEEEcCCCCEEEEEeccCCeE
Confidence 3454443 3456899999999999999999999877665
No 26
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=95.13 E-value=0.11 Score=38.18 Aligned_cols=42 Identities=21% Similarity=0.379 Sum_probs=35.6
Q ss_pred cEEEEEcCCCC----------eEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903 27 QILRTNMDGTM----------AMSIVSEAAYKASGVALDINAKRLFWCDNLLD 69 (73)
Q Consensus 27 ~I~ra~ldGs~----------~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~ 69 (73)
+|.|.+.||+- +..|...++.+|-|+++|+ ...||.+|.+.+
T Consensus 200 kILRin~DGsiP~dNPf~~g~~~eIyA~G~RNp~Gla~dp-~G~Lw~~e~Gp~ 251 (454)
T TIGR03606 200 KVLRLNLDGSIPKDNPSINGVVSHIFTYGHRNPQGLAFTP-DGTLYASEQGPN 251 (454)
T ss_pred EEEEEcCCCCCCCCCCccCCCcceEEEEeccccceeEECC-CCCEEEEecCCC
Confidence 69999999972 4568888999999999999 789999998753
No 27
>KOG1520|consensus
Probab=94.82 E-value=0.097 Score=37.69 Aligned_cols=60 Identities=17% Similarity=0.236 Sum_probs=47.4
Q ss_pred ccceEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeC----CCceeeEEEEcCCCCeEEEEeCCC
Q psy4903 8 SSGENRTREQALMFVAD-SNQILRTNMDGTMAMSIVSE----AAYKASGVALDINAKRLFWCDNLL 68 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~----~~~~p~~lavD~~~~~LYW~D~~~ 68 (73)
|=||+.+.+.|-+|-+| ---+....-+|...+.+..+ .+...+++.||. +..+||+|...
T Consensus 117 PLGl~f~~~ggdL~VaDAYlGL~~V~p~g~~a~~l~~~~~G~~~kf~N~ldI~~-~g~vyFTDSSs 181 (376)
T KOG1520|consen 117 PLGIRFDKKGGDLYVADAYLGLLKVGPEGGLAELLADEAEGKPFKFLNDLDIDP-EGVVYFTDSSS 181 (376)
T ss_pred cceEEeccCCCeEEEEecceeeEEECCCCCcceeccccccCeeeeecCceeEcC-CCeEEEecccc
Confidence 56999999999999999 45677777777665555544 455578999998 99999999864
No 28
>PF10282 Lactonase: Lactonase, 7-bladed beta-propeller; InterPro: IPR019405 6-phosphogluconolactonases (6PGL) 3.1.1.31 from EC, which hydrolyses 6-phosphogluconolactone to 6-phosphogluconate is opne of the enzymes in the pentose phosphate pathway. Two families of structurally dissimilar 6PGLs are known to exist: the Escherichia coli (strain K12) YbhE IPR022528 from INTERPRO [] and the Pseudomonas aeruginosa DevB IPR005900 from INTERPRO [] types. This entry contains bacterial 6-phosphogluconolactonases (6PGL) YbhE-type 3.1.1.31 from EC which hydrolyse 6-phosphogluconolactone to 6-phosphogluconate. The entry also contains the fungal muconate lactonizing enzyme carboxy-cis,cis-muconate cyclase 5.5.1.5 from EC and muconate cycloisomerase 5.5.1.1 from EC, which convert cis,cis-muconates to muconolactones and vice versa as part of the microbial beta-ketoadipate pathway. Structures have been reported for the E. coli 6-phosphogluconolactonase and Neurospora crassa muconate cycloisomerase. Structures of proteins in this family have revealed a 7-bladed beta-propeller fold [].; PDB: 3SCY_A 1L0Q_A 3HFQ_B 3FGB_A 1RI6_A 3U4Y_A 3BWS_A 1JOF_H.
Probab=94.60 E-value=0.41 Score=33.13 Aligned_cols=64 Identities=11% Similarity=0.166 Sum_probs=45.9
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcC-CCCeEEEEeC---------CCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMD-GTMAMSIVSE---------AAYKASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ld-Gs~~~~l~~~---------~~~~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
|+.++++|...++|-.+ ...|....++ .......+.. .-..|.+|++++.++.||-+....+.|
T Consensus 194 PRh~~f~pdg~~~Yv~~e~s~~v~v~~~~~~~g~~~~~~~~~~~~~~~~~~~~~~~i~ispdg~~lyvsnr~~~sI 269 (345)
T PF10282_consen 194 PRHLAFSPDGKYAYVVNELSNTVSVFDYDPSDGSLTEIQTISTLPEGFTGENAPAEIAISPDGRFLYVSNRGSNSI 269 (345)
T ss_dssp EEEEEE-TTSSEEEEEETTTTEEEEEEEETTTTEEEEEEEEESCETTSCSSSSEEEEEE-TTSSEEEEEECTTTEE
T ss_pred CcEEEEcCCcCEEEEecCCCCcEEEEeecccCCceeEEEEeeeccccccccCCceeEEEecCCCEEEEEeccCCEE
Confidence 45689999888999998 6888888888 3332222221 112699999999999999999988776
No 29
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=93.38 E-value=1.4 Score=29.82 Aligned_cols=62 Identities=11% Similarity=0.133 Sum_probs=43.4
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeC--CCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSE--AAYKASGVALDINAKRLFWCDNLLD 69 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~~ 69 (73)
+++++++|....||-+. ...|..-.+++.....++.. ....|.++++++.++.||=+....+
T Consensus 37 ~~~l~~spd~~~lyv~~~~~~~i~~~~~~~~g~l~~~~~~~~~~~p~~i~~~~~g~~l~v~~~~~~ 102 (330)
T PRK11028 37 VQPMVISPDKRHLYVGVRPEFRVLSYRIADDGALTFAAESPLPGSPTHISTDHQGRFLFSASYNAN 102 (330)
T ss_pred CccEEECCCCCEEEEEECCCCcEEEEEECCCCceEEeeeecCCCCceEEEECCCCCEEEEEEcCCC
Confidence 56788999888999886 57776666664333333322 2347999999999999998865433
No 30
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=92.90 E-value=0.42 Score=33.12 Aligned_cols=57 Identities=19% Similarity=0.302 Sum_probs=43.3
Q ss_pred cceEEeCCCCeEEEeC--C------CcEEEEEcCCCCeEEE-EeCCC-------------ceeeEEEEcCCCCeEEEEeC
Q psy4903 9 SGENRTREQALMFVAD--S------NQILRTNMDGTMAMSI-VSEAA-------------YKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd--~------~~I~ra~ldGs~~~~l-~~~~~-------------~~p~~lavD~~~~~LYW~D~ 66 (73)
-||++ +..|.+||++ . ++|.+...+|...+.+ +...+ .-.+|||+...++.||-+-.
T Consensus 88 Egi~~-~~~g~~~is~E~~~~~~~~p~I~~~~~~G~~~~~~~vP~~~~~~~~~~~~~~~N~G~E~la~~~dG~~l~~~~E 166 (326)
T PF13449_consen 88 EGIAV-PPDGSFWISSEGGRTGGIPPRIRRFDLDGRVIRRFPVPAAFLPDANGTSGRRNNRGFEGLAVSPDGRTLFAAME 166 (326)
T ss_pred hHeEE-ecCCCEEEEeCCccCCCCCCEEEEECCCCcccceEccccccccccCccccccCCCCeEEEEECCCCCEEEEEEC
Confidence 37888 8899999998 6 9999999999875555 32221 23799999988887876544
No 31
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=92.89 E-value=1.6 Score=29.56 Aligned_cols=47 Identities=17% Similarity=0.285 Sum_probs=35.5
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEE
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVAL 54 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lav 54 (73)
.|||+.+|.++.||=.. .+.|..-+++|.-.+.+--.+...|+||+.
T Consensus 24 ~SGLTy~pd~~tLfaV~d~~~~i~els~~G~vlr~i~l~g~~D~EgI~y 72 (248)
T PF06977_consen 24 LSGLTYNPDTGTLFAVQDEPGEIYELSLDGKVLRRIPLDGFGDYEGITY 72 (248)
T ss_dssp EEEEEEETTTTEEEEEETTTTEEEEEETT--EEEEEE-SS-SSEEEEEE
T ss_pred ccccEEcCCCCeEEEEECCCCEEEEEcCCCCEEEEEeCCCCCCceeEEE
Confidence 48999999999988665 689999999987666655567888999988
No 32
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=92.01 E-value=0.77 Score=32.39 Aligned_cols=62 Identities=19% Similarity=0.267 Sum_probs=44.7
Q ss_pred eeeeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEE--------eCCCceeeEEEEcCCCCeEEEEe
Q psy4903 3 FKMKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIV--------SEAAYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 3 ~~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~--------~~~~~~p~~lavD~~~~~LYW~D 65 (73)
+-++=-||+..|+.++.|+--. ...+.....+|--+..+. ++++.+|+|+|.|..+ .||-+.
T Consensus 230 ~f~~DvSgl~~~~~~~~LLVLS~ESr~l~Evd~~G~~~~~lsL~~g~~gL~~dipqaEGiamDd~g-~lYIvS 301 (316)
T COG3204 230 LFVLDVSGLEFNAITNSLLVLSDESRRLLEVDLSGEVIELLSLTKGNHGLSSDIPQAEGIAMDDDG-NLYIVS 301 (316)
T ss_pred eEeeccccceecCCCCcEEEEecCCceEEEEecCCCeeeeEEeccCCCCCcccCCCcceeEECCCC-CEEEEe
Confidence 3455568999999988776554 688888888886544443 2488899999999544 566543
No 33
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=91.69 E-value=2.3 Score=30.04 Aligned_cols=63 Identities=13% Similarity=0.175 Sum_probs=43.4
Q ss_pred eccceEEeCCCCeEEEeC-C---CcEEEEEcCCCCeEEEEeCCCc-eeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 7 FSSGENRTREQALMFVAD-S---NQILRTNMDGTMAMSIVSEAAY-KASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd-~---~~I~ra~ldGs~~~~l~~~~~~-~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
.|.|++++|..+.+|-++ + .++....-. ...++...... .|.++++|+.++++|=++...+.+
T Consensus 117 ~P~~~~~~~~~~~vYV~n~~~~~~~vsvid~~--t~~~~~~~~vG~~P~~~a~~p~g~~vyv~~~~~~~v 184 (381)
T COG3391 117 GPVGLAVDPDGKYVYVANAGNGNNTVSVIDAA--TNKVTATIPVGNTPTGVAVDPDGNKVYVTNSDDNTV 184 (381)
T ss_pred CCceEEECCCCCEEEEEecccCCceEEEEeCC--CCeEEEEEecCCCcceEEECCCCCeEEEEecCCCeE
Confidence 578999999999999998 2 444443333 22222222222 689999999999999999665544
No 34
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=90.52 E-value=2.4 Score=28.71 Aligned_cols=63 Identities=14% Similarity=0.153 Sum_probs=42.5
Q ss_pred cceEEeCCCCeEEEeC--CCcEEEEEcCCCC-eEEEEeC--CCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 9 SGENRTREQALMFVAD--SNQILRTNMDGTM-AMSIVSE--AAYKASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd--~~~I~ra~ldGs~-~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
.+++++|..+++|=++ ...|..-.++-.. ...++.. .-..|.++++++.++.||=+....+.|
T Consensus 231 ~~i~~~pdg~~lyv~~~~~~~I~v~~i~~~~~~~~~~~~~~~~~~p~~~~~~~dg~~l~va~~~~~~v 298 (330)
T PRK11028 231 ADIHITPDGRHLYACDRTASLISVFSVSEDGSVLSFEGHQPTETQPRGFNIDHSGKYLIAAGQKSHHI 298 (330)
T ss_pred eeEEECCCCCEEEEecCCCCeEEEEEEeCCCCeEEEeEEEeccccCCceEECCCCCEEEEEEccCCcE
Confidence 3578899888899887 4667666654221 2222222 224799999999999999888755544
No 35
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=90.17 E-value=0.46 Score=27.57 Aligned_cols=32 Identities=25% Similarity=0.506 Sum_probs=27.6
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCC
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGT 36 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs 36 (73)
.-||.|+++.|....++.++ ..+|.|-++.|.
T Consensus 56 L~fpNGVals~d~~~vlv~Et~~~Ri~rywl~Gp 89 (89)
T PF03088_consen 56 LYFPNGVALSPDESFVLVAETGRYRILRYWLKGP 89 (89)
T ss_dssp ESSEEEEEE-TTSSEEEEEEGGGTEEEEEESSST
T ss_pred CCccCeEEEcCCCCEEEEEeccCceEEEEEEeCC
Confidence 45899999999999999999 699999999883
No 36
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=88.54 E-value=1.3 Score=30.41 Aligned_cols=62 Identities=16% Similarity=0.277 Sum_probs=38.7
Q ss_pred ceEEeCC---CCeEEEeC--CCcEEEEEcC----CCC--e------EEEEeCCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903 10 GENRTRE---QALMFVAD--SNQILRTNMD----GTM--A------MSIVSEAAYKASGVALDINAKRLFWCDNLLDYIE 72 (73)
Q Consensus 10 g~~~dp~---~g~lywtd--~~~I~ra~ld----Gs~--~------~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~ie 72 (73)
|+++.|. .+.|||.. +.++++...+ .+. . ...+.+......|+++|. ...||.++...+.|-
T Consensus 132 gial~~~~~d~r~LYf~~lss~~ly~v~T~~L~~~~~~~~~~~~~~v~~lG~k~~~s~g~~~D~-~G~ly~~~~~~~aI~ 210 (287)
T PF03022_consen 132 GIALSPISPDGRWLYFHPLSSRKLYRVPTSVLRDPSLSDAQALASQVQDLGDKGSQSDGMAIDP-NGNLYFTDVEQNAIG 210 (287)
T ss_dssp EEEE-TTSTTS-EEEEEETT-SEEEEEEHHHHCSTT--HHH-HHHT-EEEEE---SECEEEEET-TTEEEEEECCCTEEE
T ss_pred ccccCCCCCCccEEEEEeCCCCcEEEEEHHHhhCccccccccccccceeccccCCCCceEEECC-CCcEEEecCCCCeEE
Confidence 4667663 46899998 5678887764 221 1 122333335679999997 889999999877664
No 37
>KOG4499|consensus
Probab=88.22 E-value=2.7 Score=29.25 Aligned_cols=66 Identities=14% Similarity=0.178 Sum_probs=46.5
Q ss_pred eeeeccceEEeCCCCeEEEeC--CCcEEEEEcC---C--CCeEEEEeC------CCceeeEEEEcCCCCeEEEEeCCCCe
Q psy4903 4 KMKFSSGENRTREQALMFVAD--SNQILRTNMD---G--TMAMSIVSE------AAYKASGVALDINAKRLFWCDNLLDY 70 (73)
Q Consensus 4 ~~~~~~g~~~dp~~g~lywtd--~~~I~ra~ld---G--s~~~~l~~~------~~~~p~~lavD~~~~~LYW~D~~~~~ 70 (73)
.+.+|.||+-|-....+|++| +-.|..-..| | +++.+++.- +-..|.|++||- +.+||-+-...++
T Consensus 156 ~v~IsNgl~Wd~d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~-eG~L~Va~~ng~~ 234 (310)
T KOG4499|consen 156 CVGISNGLAWDSDAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDT-EGNLYVATFNGGT 234 (310)
T ss_pred hccCCccccccccCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEcc-CCcEEEEEecCcE
Confidence 467899999999999999999 5556444433 2 568888752 334599999995 6677766544433
No 38
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=88.22 E-value=3.4 Score=28.71 Aligned_cols=60 Identities=15% Similarity=0.232 Sum_probs=42.2
Q ss_pred eeccceEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEe------CCCceeeEEEEcCC---CCeEEEEeC
Q psy4903 6 KFSSGENRTREQALMFVAD-SNQILRTNMDGTMAMSIVS------EAAYKASGVALDIN---AKRLFWCDN 66 (73)
Q Consensus 6 ~~~~g~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~------~~~~~p~~lavD~~---~~~LYW~D~ 66 (73)
.-|.+|+..|. |.||.++ ..+|.+..-+|+....+.. .....+-|||+|+. ++.||.+-.
T Consensus 2 ~~P~~~a~~pd-G~l~v~e~~G~i~~~~~~g~~~~~v~~~~~v~~~~~~gllgia~~p~f~~n~~lYv~~t 71 (331)
T PF07995_consen 2 NNPRSMAFLPD-GRLLVAERSGRIWVVDKDGSLKTPVADLPEVFADGERGLLGIAFHPDFASNGYLYVYYT 71 (331)
T ss_dssp SSEEEEEEETT-SCEEEEETTTEEEEEETTTEECEEEEE-TTTBTSTTBSEEEEEE-TTCCCC-EEEEEEE
T ss_pred CCceEEEEeCC-CcEEEEeCCceEEEEeCCCcCcceecccccccccccCCcccceeccccCCCCEEEEEEE
Confidence 35789999997 8899998 7999999988876333322 24456899999983 467766544
No 39
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=87.82 E-value=1 Score=25.94 Aligned_cols=25 Identities=20% Similarity=0.469 Sum_probs=20.5
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEE
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQIL 29 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ 29 (73)
+.+|.||+++|...+||-++ ...|.
T Consensus 53 ~~~aNGI~~s~~~k~lyVa~~~~~~I~ 79 (86)
T PF01731_consen 53 FSFANGIAISPDKKYLYVASSLAHSIH 79 (86)
T ss_pred CCCCceEEEcCCCCEEEEEeccCCeEE
Confidence 57899999999999999887 34444
No 40
>PF05345 He_PIG: Putative Ig domain; InterPro: IPR008009 This alignment represents the conserved core region of a ~90 residue repeat found in several haemagglutinins and other cell surface proteins. Sequence similarities to Hyalin (IPR003410 from INTERPRO) and the PKD domain (IPR000601 from INTERPRO) suggest an Ig-like fold so this family may be similar in function to the (IPR003791 from INTERPRO) and (IPR003790 from INTERPRO) protein families.
Probab=86.45 E-value=0.88 Score=23.41 Aligned_cols=19 Identities=11% Similarity=0.017 Sum_probs=17.6
Q ss_pred eeccceEEeCCCCeEEEeC
Q psy4903 6 KFSSGENRTREQALMFVAD 24 (73)
Q Consensus 6 ~~~~g~~~dp~~g~lywtd 24 (73)
.+|+||.+||.+|.|.|+-
T Consensus 11 ~LP~gLs~d~~tG~isGtp 29 (49)
T PF05345_consen 11 GLPSGLSLDPSTGTISGTP 29 (49)
T ss_pred CCCCcEEEeCCCCEEEeec
Confidence 5899999999999999985
No 41
>COG3391 Uncharacterized conserved protein [Function unknown]
Probab=85.46 E-value=9.3 Score=27.01 Aligned_cols=60 Identities=13% Similarity=0.236 Sum_probs=43.3
Q ss_pred eccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903 7 FSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL 67 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~ 67 (73)
.+.|+++.|...++|=++ ...|.-..++......-+.... .|.++++|+..+.+|-++..
T Consensus 75 ~p~~i~v~~~~~~vyv~~~~~~~v~vid~~~~~~~~~~~vG~-~P~~~~~~~~~~~vYV~n~~ 136 (381)
T COG3391 75 YPAGVAVNPAGNKVYVTTGDSNTVSVIDTATNTVLGSIPVGL-GPVGLAVDPDGKYVYVANAG 136 (381)
T ss_pred cccceeeCCCCCeEEEecCCCCeEEEEcCcccceeeEeeecc-CCceEEECCCCCEEEEEecc
Confidence 467899999999999887 4666665543322222222232 89999999999999999984
No 42
>COG2133 Glucose/sorbosone dehydrogenases [Carbohydrate transport and metabolism]
Probab=85.15 E-value=4 Score=29.73 Aligned_cols=59 Identities=15% Similarity=0.097 Sum_probs=42.4
Q ss_pred cceEEeCCCCeEEEeC---------------CCcEEEEEcC--------CCCeEEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903 9 SGENRTREQALMFVAD---------------SNQILRTNMD--------GTMAMSIVSEAAYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd---------------~~~I~ra~ld--------Gs~~~~l~~~~~~~p~~lavD~~~~~LYW~D 65 (73)
+-|+.+|.. +||-+- ..+|.|..-+ +.+ ..++..++.+|.|++.|+.+..||-+|
T Consensus 180 ~~l~f~pDG-~Lyvs~G~~~~~~~aq~~~~~~Gk~~r~~~a~~~~~d~p~~~-~~i~s~G~RN~qGl~w~P~tg~Lw~~e 257 (399)
T COG2133 180 GRLVFGPDG-KLYVTTGSNGDPALAQDNVSLAGKVLRIDRAGIIPADNPFPN-SEIWSYGHRNPQGLAWHPVTGALWTTE 257 (399)
T ss_pred ccEEECCCC-cEEEEeCCCCCcccccCccccccceeeeccCcccccCCCCCC-cceEEeccCCccceeecCCCCcEEEEe
Confidence 446777766 888773 1334443333 333 456777999999999999999999999
Q ss_pred CCCC
Q psy4903 66 NLLD 69 (73)
Q Consensus 66 ~~~~ 69 (73)
...+
T Consensus 258 ~g~d 261 (399)
T COG2133 258 HGPD 261 (399)
T ss_pred cCCC
Confidence 8764
No 43
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=84.86 E-value=4.1 Score=28.55 Aligned_cols=60 Identities=17% Similarity=0.215 Sum_probs=40.9
Q ss_pred eeeccceEEeCCCCeEEEeCCCcEEEE-EcCCC-----CeEEEEeC---C----CceeeEEEEcCCCCeEEEEeCC
Q psy4903 5 MKFSSGENRTREQALMFVADSNQILRT-NMDGT-----MAMSIVSE---A----AYKASGVALDINAKRLFWCDNL 67 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd~~~I~ra-~ldGs-----~~~~l~~~---~----~~~p~~lavD~~~~~LYW~D~~ 67 (73)
+.+|.|+++.+.. ||-++.++|.|. ..+|. .+++++.. . ...|+++++++. .+||+++..
T Consensus 71 l~~p~Gi~~~~~G--lyV~~~~~i~~~~d~~gdg~ad~~~~~l~~~~~~~~~~~~~~~~~l~~gpD-G~LYv~~G~ 143 (367)
T TIGR02604 71 LSMVTGLAVAVGG--VYVATPPDILFLRDKDGDDKADGEREVLLSGFGGQINNHHHSLNSLAWGPD-GWLYFNHGN 143 (367)
T ss_pred CCCccceeEecCC--EEEeCCCeEEEEeCCCCCCCCCCccEEEEEccCCCCCcccccccCceECCC-CCEEEeccc
Confidence 3467888888754 888887788887 44442 34555542 1 345889999874 589998873
No 44
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=83.91 E-value=14 Score=26.64 Aligned_cols=65 Identities=6% Similarity=0.044 Sum_probs=49.5
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcCCC-CeEEEEe---------CCCceeeEEEEcCCCCeEEEEeCCCCeee
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMDGT-MAMSIVS---------EAAYKASGVALDINAKRLFWCDNLLDYIE 72 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs-~~~~l~~---------~~~~~p~~lavD~~~~~LYW~D~~~~~ie 72 (73)
|+=|+.+|...+.|-.. +.+|.....++. .+-..++ .+-.+..+|.|.+.++.||-++.+.+.|-
T Consensus 193 PRHi~FHpn~k~aY~v~EL~stV~v~~y~~~~g~~~~lQ~i~tlP~dF~g~~~~aaIhis~dGrFLYasNRg~dsI~ 269 (346)
T COG2706 193 PRHIVFHPNGKYAYLVNELNSTVDVLEYNPAVGKFEELQTIDTLPEDFTGTNWAAAIHISPDGRFLYASNRGHDSIA 269 (346)
T ss_pred cceEEEcCCCcEEEEEeccCCEEEEEEEcCCCceEEEeeeeccCccccCCCCceeEEEECCCCCEEEEecCCCCeEE
Confidence 45588999999999887 899999998874 2222221 14556788999999999999999988763
No 45
>KOG4499|consensus
Probab=81.63 E-value=4.4 Score=28.24 Aligned_cols=62 Identities=16% Similarity=0.129 Sum_probs=40.8
Q ss_pred eeeeeccceEEeCCCCeEEEeC-----------------------CCcEEEEEcCCCCeEEEEe---CCCceeeEEEEcC
Q psy4903 3 FKMKFSSGENRTREQALMFVAD-----------------------SNQILRTNMDGTMAMSIVS---EAAYKASGVALDI 56 (73)
Q Consensus 3 ~~~~~~~g~~~dp~~g~lywtd-----------------------~~~I~ra~ldGs~~~~l~~---~~~~~p~~lavD~ 56 (73)
|.-++|.|+++|- .|.||-+- .++|.+|-+.|.+...+.. ..+..|.++--|+
T Consensus 209 ~e~~~PDGm~ID~-eG~L~Va~~ng~~V~~~dp~tGK~L~eiklPt~qitsccFgGkn~d~~yvT~aa~~~dp~~~~~~p 287 (310)
T KOG4499|consen 209 FESLEPDGMTIDT-EGNLYVATFNGGTVQKVDPTTGKILLEIKLPTPQITSCCFGGKNLDILYVTTAAKFDDPVRTNTDP 287 (310)
T ss_pred cCCCCCCcceEcc-CCcEEEEEecCcEEEEECCCCCcEEEEEEcCCCceEEEEecCCCccEEEEEehhcccCchhcccCC
Confidence 4556788888775 55555442 3677788888876555543 2666788777777
Q ss_pred CCCeEEEEe
Q psy4903 57 NAKRLFWCD 65 (73)
Q Consensus 57 ~~~~LYW~D 65 (73)
....+|=+.
T Consensus 288 ~aG~iykit 296 (310)
T KOG4499|consen 288 NAGTIYKIT 296 (310)
T ss_pred CCccEEEec
Confidence 677776543
No 46
>PF13449 Phytase-like: Esterase-like activity of phytase
Probab=75.70 E-value=16 Score=25.33 Aligned_cols=58 Identities=16% Similarity=0.192 Sum_probs=37.2
Q ss_pred cceEEeCCCCeEEEe--CC------CcEEEEEcCCC----CeEEEE------eC---CC----ceeeEEEEcCCCCeEEE
Q psy4903 9 SGENRTREQALMFVA--DS------NQILRTNMDGT----MAMSIV------SE---AA----YKASGVALDINAKRLFW 63 (73)
Q Consensus 9 ~g~~~dp~~g~lywt--d~------~~I~ra~ldGs----~~~~l~------~~---~~----~~p~~lavD~~~~~LYW 63 (73)
|||+++|..|. ||+ |. +++++..++.. ....+. .. .+ ..++||++ .....+||
T Consensus 23 Sgl~~~~~~~~-~~avSD~g~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~L~~~~G~~~~~~~~D~Egi~~-~~~g~~~i 100 (326)
T PF13449_consen 23 SGLDYDPDDGR-FYAVSDRGPNKGPPRFYTFRIDYDQGGIGGVTILDMIPLRDPDGQPFPKNGLDPEGIAV-PPDGSFWI 100 (326)
T ss_pred eeEEEeCCCCE-EEEEECCCCCCCCCcEEEEEeeccCCCccceEeccceeccCCCCCcCCcCCCChhHeEE-ecCCCEEE
Confidence 89999986654 555 33 34888777751 111111 11 11 16789999 67889999
Q ss_pred EeCCC
Q psy4903 64 CDNLL 68 (73)
Q Consensus 64 ~D~~~ 68 (73)
++.+.
T Consensus 101 s~E~~ 105 (326)
T PF13449_consen 101 SSEGG 105 (326)
T ss_pred EeCCc
Confidence 99877
No 47
>KOG4649|consensus
Probab=74.25 E-value=3.6 Score=29.10 Aligned_cols=24 Identities=4% Similarity=-0.082 Sum_probs=20.1
Q ss_pred eEEeCCCCeEEEeC--CCcEEEEEcC
Q psy4903 11 ENRTREQALMFVAD--SNQILRTNMD 34 (73)
Q Consensus 11 ~~~dp~~g~lywtd--~~~I~ra~ld 34 (73)
+++||++|.|+|-. ..+|+...+-
T Consensus 36 ~avd~~sG~~~We~ilg~RiE~sa~v 61 (354)
T KOG4649|consen 36 IAVDPQSGNLIWEAILGVRIECSAIV 61 (354)
T ss_pred EEecCCCCcEEeehhhCceeeeeeEE
Confidence 46899999999998 7888887654
No 48
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=70.39 E-value=36 Score=25.57 Aligned_cols=56 Identities=16% Similarity=0.191 Sum_probs=39.3
Q ss_pred eeeccceEEeCCCCeEEEeC--C-------------------CcEEEEEcCCC-------CeEEEEe-------------
Q psy4903 5 MKFSSGENRTREQALMFVAD--S-------------------NQILRTNMDGT-------MAMSIVS------------- 43 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~-------------------~~I~ra~ldGs-------~~~~l~~------------- 43 (73)
|.=+-|+.++|.+|.+|.+. + .+|.|-..++. ..+.++.
T Consensus 349 f~RpEgi~~~p~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~ 428 (524)
T PF05787_consen 349 FDRPEGITVNPDDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSN 428 (524)
T ss_pred ccCccCeeEeCCCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccccccC
Confidence 44467889999999999985 2 37888777654 2333222
Q ss_pred ----CCCceeeEEEEcCCCCe
Q psy4903 44 ----EAAYKASGVALDINAKR 60 (73)
Q Consensus 44 ----~~~~~p~~lavD~~~~~ 60 (73)
..+..|..|++|..++.
T Consensus 429 ~~~~~~f~sPDNL~~d~~G~L 449 (524)
T PF05787_consen 429 KCDDNGFASPDNLAFDPDGNL 449 (524)
T ss_pred cccCCCcCCCCceEECCCCCE
Confidence 24778999999986653
No 49
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=70.04 E-value=38 Score=24.40 Aligned_cols=64 Identities=9% Similarity=0.146 Sum_probs=49.6
Q ss_pred ccceEEeCCCCeEEEeC----CCcEEEEEcCCC-CeEEEEeCCC---ceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 8 SSGENRTREQALMFVAD----SNQILRTNMDGT-MAMSIVSEAA---YKASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd----~~~I~ra~ldGs-~~~~l~~~~~---~~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
|+=|+++|....||=.. ...|..=..|+- ++.++++... ..|..+++|..++.||=+....+.|
T Consensus 42 ptyl~~~~~~~~LY~v~~~~~~ggvaay~iD~~~G~Lt~ln~~~~~g~~p~yvsvd~~g~~vf~AnY~~g~v 113 (346)
T COG2706 42 PTYLAVNPDQRHLYVVNEPGEEGGVAAYRIDPDDGRLTFLNRQTLPGSPPCYVSVDEDGRFVFVANYHSGSV 113 (346)
T ss_pred CceEEECCCCCEEEEEEecCCcCcEEEEEEcCCCCeEEEeeccccCCCCCeEEEECCCCCEEEEEEccCceE
Confidence 44578899999999776 478888888874 7888877532 2359999999999999988877665
No 50
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=67.53 E-value=11 Score=25.81 Aligned_cols=21 Identities=19% Similarity=0.253 Sum_probs=17.2
Q ss_pred CCceeeEEEEcCCCCeEEEEe
Q psy4903 45 AAYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 45 ~~~~p~~lavD~~~~~LYW~D 65 (73)
+..-++|||-|+..+++|-+-
T Consensus 227 ~~nvlNGIA~~~~~~r~~iTG 247 (262)
T COG3823 227 NDNVLNGIAHDPQQDRFLITG 247 (262)
T ss_pred ccccccceeecCcCCeEEEec
Confidence 455799999999999998654
No 51
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=62.42 E-value=37 Score=24.69 Aligned_cols=48 Identities=21% Similarity=0.298 Sum_probs=32.4
Q ss_pred eccceEEeCCCCeEEEeC-CCcEEEEEcC---CCCeEEEEe---CCC-ceeeEEEE
Q psy4903 7 FSSGENRTREQALMFVAD-SNQILRTNMD---GTMAMSIVS---EAA-YKASGVAL 54 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd-~~~I~ra~ld---Gs~~~~l~~---~~~-~~p~~lav 54 (73)
=+.|.|+|...|.||..+ +.-|.+-..+ |..++.+.. ..+ ...+||+|
T Consensus 209 Q~EGCVVDDe~g~LYvgEE~~GIW~y~Aep~~~~~~~~v~~~~g~~l~aDvEGlal 264 (381)
T PF02333_consen 209 QPEGCVVDDETGRLYVGEEDVGIWRYDAEPEGGNDRTLVASADGDGLVADVEGLAL 264 (381)
T ss_dssp -EEEEEEETTTTEEEEEETTTEEEEEESSCCC-S--EEEEEBSSSSB-S-EEEEEE
T ss_pred cceEEEEecccCCEEEecCccEEEEEecCCCCCCcceeeecccccccccCccceEE
Confidence 357999999999999999 7888887776 334444322 123 45889988
No 52
>TIGR03606 non_repeat_PQQ dehydrogenase, PQQ-dependent, s-GDH family. PQQ, or pyrroloquinoline-quinone, serves as a cofactor for a number of sugar and alcohol dehydrogenases in a limited number of bacterial species. Most characterized PQQ-dependent enzymes have multiple repeats of a sequence region described by pfam01011 (PQQ enzyme repeat), but this protein family in unusual in lacking that repeat. Below the noise cutoff are related proteins mostly from species that lack PQQ biosynthesis.
Probab=62.32 E-value=62 Score=24.03 Aligned_cols=60 Identities=13% Similarity=0.157 Sum_probs=39.7
Q ss_pred eeeccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEE------e-CCCceeeEEEEcCC------CCeEEEEe
Q psy4903 5 MKFSSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIV------S-EAAYKASGVALDIN------AKRLFWCD 65 (73)
Q Consensus 5 ~~~~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~------~-~~~~~p~~lavD~~------~~~LYW~D 65 (73)
+..|.+++..|. |.||-++ ..+|.+..-++.....+. . ....-+-|||+|+. ++.||.+-
T Consensus 29 L~~Pw~maflPD-G~llVtER~~G~I~~v~~~~~~~~~~~~l~~v~~~~ge~GLlglal~PdF~~~~~n~~lYvsy 103 (454)
T TIGR03606 29 LNKPWALLWGPD-NQLWVTERATGKILRVNPETGEVKVVFTLPEIVNDAQHNGLLGLALHPDFMQEKGNPYVYISY 103 (454)
T ss_pred CCCceEEEEcCC-CeEEEEEecCCEEEEEeCCCCceeeeecCCceeccCCCCceeeEEECCCccccCCCcEEEEEE
Confidence 356788888884 6888888 488998875543333221 1 13455799999854 45788873
No 53
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=61.17 E-value=16 Score=17.57 Aligned_cols=19 Identities=21% Similarity=0.398 Sum_probs=13.9
Q ss_pred CceeeEEEEcCCCCeEEEEe
Q psy4903 46 AYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 46 ~~~p~~lavD~~~~~LYW~D 65 (73)
...|.+||+|..+ ++|-+=
T Consensus 12 ~~~~~~IavD~~G-NiYv~G 30 (38)
T PF06739_consen 12 QDYGNGIAVDSNG-NIYVTG 30 (38)
T ss_pred ceeEEEEEECCCC-CEEEEE
Confidence 3579999999665 588653
No 54
>COG3204 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=60.55 E-value=22 Score=25.24 Aligned_cols=47 Identities=11% Similarity=0.159 Sum_probs=37.8
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEE
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVAL 54 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lav 54 (73)
-|+|+.+|.++.||=+- .+.|.....+|--..++--..+..|++|+.
T Consensus 88 vS~LTynp~~rtLFav~n~p~~iVElt~~GdlirtiPL~g~~DpE~Iey 136 (316)
T COG3204 88 VSSLTYNPDTRTLFAVTNKPAAIVELTKEGDLIRTIPLTGFSDPETIEY 136 (316)
T ss_pred ccceeeCCCcceEEEecCCCceEEEEecCCceEEEecccccCChhHeEE
Confidence 47999999999999775 588888888887766666667888888753
No 55
>TIGR03075 PQQ_enz_alc_DH PQQ-dependent dehydrogenase, methanol/ethanol family. This protein family has a phylogenetic distribution very similar to that coenzyme PQQ biosynthesis enzymes, as shown by partial phylogenetic profiling. Genes in this family often are found adjacent to the PQQ biosynthesis genes themselves. An unusual, strained disulfide bond between adjacent Cys residues contributes to PQQ-binding, as does a Trp residue that is part of a PQQ enzyme repeat (see pfam01011). Characterized members include the dehydrogenase subunit of a membrane-anchored, three subunit alcohol (ethanol) dehydrogenase of Gluconobacter suboxydans, a homodimeric ethanol dehydrogenase in Pseudomonas aeruginosa, and the large subunit of an alpha2/beta2 heterotetrameric methanol dehydrogenase in Methylobacterium extorquens.
Probab=60.20 E-value=26 Score=26.09 Aligned_cols=48 Identities=17% Similarity=0.281 Sum_probs=35.3
Q ss_pred eEEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEe
Q psy4903 11 ENRTREQALMFVAD-SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCD 65 (73)
Q Consensus 11 ~~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D 65 (73)
.++||.+|.+||-- ++.- .+|+.| -..++..-+=+|||..+.++-|.=
T Consensus 239 ~s~D~~~~lvy~~tGnp~p----~~~~~r---~gdnl~~~s~vAld~~TG~~~W~~ 287 (527)
T TIGR03075 239 GSYDPETNLIYFGTGNPSP----WNSHLR---PGDNLYTSSIVARDPDTGKIKWHY 287 (527)
T ss_pred eeEcCCCCeEEEeCCCCCC----CCCCCC---CCCCccceeEEEEccccCCEEEee
Confidence 48999999999987 5433 455555 234666777889998888888863
No 56
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=59.83 E-value=62 Score=23.17 Aligned_cols=57 Identities=7% Similarity=0.070 Sum_probs=35.4
Q ss_pred eEEeCCCCeEEEeC-C----------CcEEEEEcCCCCeEEEEeCC-CceeeEEEEcCCCC-eEEEEeCCCC
Q psy4903 11 ENRTREQALMFVAD-S----------NQILRTNMDGTMAMSIVSEA-AYKASGVALDINAK-RLFWCDNLLD 69 (73)
Q Consensus 11 ~~~dp~~g~lywtd-~----------~~I~ra~ldGs~~~~l~~~~-~~~p~~lavD~~~~-~LYW~D~~~~ 69 (73)
++++|..+.+|-.. + ..|..... ..++++-.-. =..|.++++...++ .||=++...+
T Consensus 253 ia~~~dg~~lyV~~~~~~~~thk~~~~~V~ViD~--~t~kvi~~i~vG~~~~~iavS~Dgkp~lyvtn~~s~ 322 (352)
T TIGR02658 253 VAYHRARDRIYLLADQRAKWTHKTASRFLFVVDA--KTGKRLRKIELGHEIDSINVSQDAKPLLYALSTGDK 322 (352)
T ss_pred EEEcCCCCEEEEEecCCccccccCCCCEEEEEEC--CCCeEEEEEeCCCceeeEEECCCCCeEEEEeCCCCC
Confidence 89999999999843 1 23333332 2222222111 23689999998888 8887776544
No 57
>PF12814 Mcp5_PH: Meiotic cell cortex C-terminal pleckstrin homology; InterPro: IPR024774 This pleckstrin homology domain is found in eukaryotic proteins, including Mcp5, a fungal protein that anchors dynein at the cell cortex during the horsetail phase (prophase I) of meiosis. During prophase I of fission yeast all the telomeres become bundled at the spindle pole body and subsequently the nucleus undergoes a dynamic oscillation, resulting in elongated nuclear morphology known as "horsetail" nucleus. The pleckstrin homology domain is necessary for the cortical localisation of the Mcp5 protein during meiosis [].; GO: 0005515 protein binding, 0032065 cortical protein anchoring, 0005938 cell cortex
Probab=57.70 E-value=18 Score=21.70 Aligned_cols=17 Identities=0% Similarity=-0.126 Sum_probs=13.6
Q ss_pred eEEeCCCCeEEEeC-CCc
Q psy4903 11 ENRTREQALMFVAD-SNQ 27 (73)
Q Consensus 11 ~~~dp~~g~lywtd-~~~ 27 (73)
.-+||..+.|||++ ++.
T Consensus 35 fwv~~~~~~L~Ws~~~p~ 52 (123)
T PF12814_consen 35 FWVDPYTRTLYWSSSNPK 52 (123)
T ss_pred EEEeCCCCEEEecCCCCC
Confidence 44899999999998 443
No 58
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=57.32 E-value=19 Score=16.56 Aligned_cols=40 Identities=15% Similarity=0.238 Sum_probs=21.4
Q ss_pred CCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEc
Q psy4903 15 REQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALD 55 (73)
Q Consensus 15 p~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD 55 (73)
|..++||-++ ..+|...+........-+.. -..|.+++++
T Consensus 1 pd~~~lyv~~~~~~~v~~id~~~~~~~~~i~v-g~~P~~i~~~ 42 (42)
T TIGR02276 1 PDGTKLYVTNSGSNTVSVIDTATNKVIATIPV-GGYPFGVAVS 42 (42)
T ss_pred CCCCEEEEEeCCCCEEEEEECCCCeEEEEEEC-CCCCceEEeC
Confidence 4566788777 46676655432222122222 2468887764
No 59
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=54.72 E-value=44 Score=23.15 Aligned_cols=48 Identities=10% Similarity=0.248 Sum_probs=31.1
Q ss_pred eEEeCCCCeEEEeC------CCcEEEEEcC-CCCeEEEEeCCCceeeEEEEcCCCC
Q psy4903 11 ENRTREQALMFVAD------SNQILRTNMD-GTMAMSIVSEAAYKASGVALDINAK 59 (73)
Q Consensus 11 ~~~dp~~g~lywtd------~~~I~ra~ld-Gs~~~~l~~~~~~~p~~lavD~~~~ 59 (73)
+.+|+.++.+|++. ...+++++++ |...+.|....-.. ...++.+.++
T Consensus 286 ~~~d~~~~~iyf~a~~~~p~~r~lY~v~~~~~~~~~~LT~~~~~~-~~~~~Spdg~ 340 (353)
T PF00930_consen 286 LGWDEDNNRIYFTANGDNPGERHLYRVSLDSGGEPKCLTCEDGDH-YSASFSPDGK 340 (353)
T ss_dssp EEEECTSSEEEEEESSGGTTSBEEEEEETTETTEEEESSTTSSTT-EEEEE-TTSS
T ss_pred ceEcCCCCEEEEEecCCCCCceEEEEEEeCCCCCeEeccCCCCCc-eEEEECCCCC
Confidence 56799999999997 4799999999 65555444322111 3555555444
No 60
>PF14759 Reductase_C: Reductase C-terminal; PDB: 3FG2_P 3LXD_A 2YVG_A 2GR1_A 2GQW_A 2GR3_A 2YVF_A 1F3P_A 2GR0_A 2GR2_A ....
Probab=53.68 E-value=29 Score=19.34 Aligned_cols=26 Identities=19% Similarity=0.258 Sum_probs=17.5
Q ss_pred EEeC--CCcEEEEEcCCCCeEEEEeCCC
Q psy4903 21 FVAD--SNQILRTNMDGTMAMSIVSEAA 46 (73)
Q Consensus 21 ywtd--~~~I~ra~ldGs~~~~l~~~~~ 46 (73)
|||| +.+|+-+..-+..-++++....
T Consensus 2 FWSdQ~~~~iq~~G~~~~~~~~v~rg~~ 29 (85)
T PF14759_consen 2 FWSDQYGVRIQIAGLPGGADEVVVRGDP 29 (85)
T ss_dssp EEEEETTEEEEEEE-STTSSEEEEEEET
T ss_pred eecccCCCeEEEEECCCCCCEEEEEccC
Confidence 8999 6888888876665555555443
No 61
>PLN03216 actin depolymerizing factor; Provisional
Probab=53.08 E-value=6.2 Score=24.40 Aligned_cols=15 Identities=47% Similarity=0.574 Sum_probs=13.1
Q ss_pred CceeeeeccceEEeCC
Q psy4903 1 MKFKMKFSSGENRTRE 16 (73)
Q Consensus 1 ~~~~~~~~~g~~~dp~ 16 (73)
|-|||. .||+.++|.
T Consensus 1 ~~~~m~-~SGi~v~~~ 15 (141)
T PLN03216 1 MAFKMA-TTGMWVTDE 15 (141)
T ss_pred Ccceec-CCCCeeCHH
Confidence 789999 999999875
No 62
>PF06433 Me-amine-dh_H: Methylamine dehydrogenase heavy chain (MADH); InterPro: IPR009451 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methyltrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The heavy subunit forms a seven-bladed beta-propeller like structure [].; GO: 0030058 amine dehydrogenase activity, 0030416 methylamine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 3RN1_F 3SVW_F 3PXT_F 3L4O_F 3L4M_D 3SJL_F 3PXS_D 3ORV_F 3RMZ_F 3RLM_F ....
Probab=51.01 E-value=68 Score=23.07 Aligned_cols=49 Identities=12% Similarity=0.224 Sum_probs=34.2
Q ss_pred CCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCC--------c-eeeE---EEEcCCCCeEEEE
Q psy4903 16 EQALMFVAD-SNQILRTNMDGTMAMSIVSEAA--------Y-KASG---VALDINAKRLFWC 64 (73)
Q Consensus 16 ~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~--------~-~p~~---lavD~~~~~LYW~ 64 (73)
..+.+||.. ++.|+.+.+.|......-.-.+ . .|-| +|++...++||-.
T Consensus 194 ~~~~~~F~Sy~G~v~~~dlsg~~~~~~~~~~~~t~~e~~~~WrPGG~Q~~A~~~~~~rlyvL 255 (342)
T PF06433_consen 194 DGGRLYFVSYEGNVYSADLSGDSAKFGKPWSLLTDAEKADGWRPGGWQLIAYHAASGRLYVL 255 (342)
T ss_dssp TTTEEEEEBTTSEEEEEEETTSSEEEEEEEESS-HHHHHTTEEE-SSS-EEEETTTTEEEEE
T ss_pred CCCeEEEEecCCEEEEEeccCCcccccCcccccCccccccCcCCcceeeeeeccccCeEEEE
Confidence 456888888 9999999999987554432111 2 2433 7889899999953
No 63
>KOG4649|consensus
Probab=49.66 E-value=17 Score=25.75 Aligned_cols=26 Identities=27% Similarity=0.271 Sum_probs=22.4
Q ss_pred eeeEEEEcCCCCeEEEEeCCCCeeeC
Q psy4903 48 KASGVALDINAKRLFWCDNLLDYIET 73 (73)
Q Consensus 48 ~p~~lavD~~~~~LYW~D~~~~~ie~ 73 (73)
.-.-+|||+.+.++||-.....+||+
T Consensus 32 s~~~~avd~~sG~~~We~ilg~RiE~ 57 (354)
T KOG4649|consen 32 SGIVIAVDPQSGNLIWEAILGVRIEC 57 (354)
T ss_pred CceEEEecCCCCcEEeehhhCceeee
Confidence 34557899999999999999999986
No 64
>PF03664 Glyco_hydro_62: Glycosyl hydrolase family 62 ; InterPro: IPR005193 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. This is a family of alpha -L-arabinofuranosidases (3.2.1.55 from EC) which are all members of glycoside hydrolase family 62 (GH62 from CAZY). This enzyme hydrolyzed aryl alpha-L-arabinofuranosides and cleaves arabinosyl side chains from arabinoxylan and arabinan.; GO: 0046556 alpha-N-arabinofuranosidase activity, 0046373 L-arabinose metabolic process
Probab=46.89 E-value=49 Score=23.04 Aligned_cols=30 Identities=23% Similarity=0.438 Sum_probs=22.1
Q ss_pred eCCCCeEEEeC-CCcEEEEEcC--------CCCeEEEEe
Q psy4903 14 TREQALMFVAD-SNQILRTNMD--------GTMAMSIVS 43 (73)
Q Consensus 14 dp~~g~lywtd-~~~I~ra~ld--------Gs~~~~l~~ 43 (73)
|-.+-+||+++ +++|+|+.|- |+.-+++.+
T Consensus 141 D~~n~yLFfa~DnGkiYRs~~~i~nFP~~fgs~~~vvms 179 (271)
T PF03664_consen 141 DDTNMYLFFAGDNGKIYRSSMPIGNFPGGFGSSYTVVMS 179 (271)
T ss_pred cCCceEEEEcCCCCcEEEeccchhhCCCCCCCceEEEEe
Confidence 45566899998 8999999985 545555554
No 65
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=46.47 E-value=75 Score=20.16 Aligned_cols=58 Identities=17% Similarity=0.273 Sum_probs=34.9
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~ 66 (73)
+.+++++|..+.+|-+. ...|...+.++......+.. -..|..+++++.++.+|-+..
T Consensus 33 ~~~l~~~~dg~~l~~~~~~~~~v~~~d~~~~~~~~~~~~-~~~~~~~~~~~~g~~l~~~~~ 92 (300)
T TIGR03866 33 PRGITLSKDGKLLYVCASDSDTIQVIDLATGEVIGTLPS-GPDPELFALHPNGKILYIANE 92 (300)
T ss_pred CCceEECCCCCEEEEEECCCCeEEEEECCCCcEEEeccC-CCCccEEEECCCCCEEEEEcC
Confidence 45677888777777665 46677666654332222222 234677788777777765543
No 66
>COG3823 Glutamine cyclotransferase [Posttranslational modification, protein turnover, chaperones]
Probab=44.99 E-value=44 Score=22.99 Aligned_cols=29 Identities=10% Similarity=0.227 Sum_probs=20.3
Q ss_pred eccceEEeCCCCeEEEeC-C-CcEEEEEcCC
Q psy4903 7 FSSGENRTREQALMFVAD-S-NQILRTNMDG 35 (73)
Q Consensus 7 ~~~g~~~dp~~g~lywtd-~-~~I~ra~ldG 35 (73)
...|+|.||..+++|-|- . |.+..+.+++
T Consensus 230 vlNGIA~~~~~~r~~iTGK~wp~lfEVk~~~ 260 (262)
T COG3823 230 VLNGIAHDPQQDRFLITGKLWPLLFEVKLDE 260 (262)
T ss_pred cccceeecCcCCeEEEecCcCceeEEEEecC
Confidence 567888888888888775 2 6666665554
No 67
>PF10349 WWbp: WW-domain ligand protein; InterPro: IPR018826 This entry represents a sequence domain found in WW domain-binding protein that is characterised by several short PY and PT-like motifs of the PPPPY form. These appear to bind directly to the WW domains of WWP1 and WWP2 and other such diverse proteins as dystrophin and YAP (Yes-associated protein). The presence of a phosphotyrosine residue in the pWBP-1 peptide abolishes WW domain binding which suggests a potential regulatory role for tyrosine phosphorylation in modulating WW domain-ligand interactions. Given the likelihood that WWP1 and WWP2 function as E3 ubiquitin-protein ligases, it is possible that initial substrate-specific recognition occurs via WW domain-substrate protein interaction followed by ubiquitin transfer and subsequent proteolysis []. This domain lies just downstream of IPR004182 from INTERPRO in many sequences.
Probab=42.22 E-value=14 Score=22.20 Aligned_cols=15 Identities=33% Similarity=0.503 Sum_probs=12.3
Q ss_pred ceeeeeccceEEeCC
Q psy4903 2 KFKMKFSSGENRTRE 16 (73)
Q Consensus 2 ~~~~~~~~g~~~dp~ 16 (73)
+|||+|++|=++|-.
T Consensus 10 ~fKltFk~GGAieFg 24 (116)
T PF10349_consen 10 EFKLTFKSGGAIEFG 24 (116)
T ss_pred EEEEEeCCCChHHHH
Confidence 699999999887743
No 68
>PRK04043 tolB translocation protein TolB; Provisional
Probab=41.43 E-value=1.3e+02 Score=21.64 Aligned_cols=51 Identities=4% Similarity=0.122 Sum_probs=32.9
Q ss_pred EeCCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903 13 RTREQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 13 ~dp~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~ 66 (73)
..|....|+++. .++|++..++|...+.+.......+ ++.+.++.|.++..
T Consensus 284 ~SPDG~~I~F~Sdr~g~~~Iy~~dl~~g~~~rlt~~g~~~~---~~SPDG~~Ia~~~~ 338 (419)
T PRK04043 284 FVEDDKRIVFVSDRLGYPNIFMKKLNSGSVEQVVFHGKNNS---SVSTYKNYIVYSSR 338 (419)
T ss_pred ECCCCCEEEEEECCCCCceEEEEECCCCCeEeCccCCCcCc---eECCCCCEEEEEEc
Confidence 456666888886 4799999999877655554332222 45556666665543
No 69
>TIGR03866 PQQ_ABC_repeats PQQ-dependent catabolism-associated beta-propeller protein. Members of this protein family consist of seven repeats each of the YVTN family beta-propeller repeat (see TIGR02276). Members occur invariably as part of a transport operon that is associated with PQQ-dependent catabolism of alcohols such as phenylethanol.
Probab=40.69 E-value=95 Score=19.68 Aligned_cols=59 Identities=15% Similarity=0.182 Sum_probs=36.2
Q ss_pred ccceEEeCCCCeEEEeC--CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903 8 SSGENRTREQALMFVAD--SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL 67 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd--~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~ 67 (73)
+.+++++|....+|.+. ..+|..-.++.......+. .-..|.++++.+.++.||=+...
T Consensus 209 ~~~i~~s~dg~~~~~~~~~~~~i~v~d~~~~~~~~~~~-~~~~~~~~~~~~~g~~l~~~~~~ 269 (300)
T TIGR03866 209 PVGIKLTKDGKTAFVALGPANRVAVVDAKTYEVLDYLL-VGQRVWQLAFTPDEKYLLTTNGV 269 (300)
T ss_pred ccceEECCCCCEEEEEcCCCCeEEEEECCCCcEEEEEE-eCCCcceEEECCCCCEEEEEcCC
Confidence 34677788888888765 4556655554322222222 12357889998888888876543
No 70
>PF05096 Glu_cyclase_2: Glutamine cyclotransferase; InterPro: IPR007788 This family of enzymes 2.3.2.5 from EC catalyse the cyclization of free L-glutamine and N-terminal glutaminyl residues in proteins to pyroglutamate (5-oxoproline) and pyroglutamyl residues respectively []. This family includes plant and bacterial enzymes and seems unrelated to the mammalian enzymes.; PDB: 3NOK_B 2FAW_A 2IWA_A 3NOM_A 3NOL_A 3MBR_X.
Probab=40.41 E-value=29 Score=24.00 Aligned_cols=29 Identities=14% Similarity=0.205 Sum_probs=21.6
Q ss_pred eeccceEEeCCCCeEEEeC-C-CcEEEEEcC
Q psy4903 6 KFSSGENRTREQALMFVAD-S-NQILRTNMD 34 (73)
Q Consensus 6 ~~~~g~~~dp~~g~lywtd-~-~~I~ra~ld 34 (73)
-+..|||.||..+.+|-|= . +++++..+.
T Consensus 231 dVLNGIAyd~~~~~l~vTGK~Wp~lyeV~l~ 261 (264)
T PF05096_consen 231 DVLNGIAYDPETDRLFVTGKLWPKLYEVKLV 261 (264)
T ss_dssp S-EEEEEEETTTTEEEEEETT-SEEEEEEEE
T ss_pred CeeEeEeEeCCCCEEEEEeCCCCceEEEEEE
Confidence 3568999999999998875 3 777776543
No 71
>PF14251 DUF4346: Domain of unknown function (DUF4346)
Probab=40.15 E-value=20 Score=22.07 Aligned_cols=17 Identities=12% Similarity=0.071 Sum_probs=13.6
Q ss_pred ccceEEeCCCCeEEEeC
Q psy4903 8 SSGENRTREQALMFVAD 24 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd 24 (73)
-.|+++||.+|.+.=+.
T Consensus 42 ~~Gla~Dpetge~i~~~ 58 (119)
T PF14251_consen 42 DKGLAVDPETGEVIPCR 58 (119)
T ss_pred cccceeCCCCCCEEEEe
Confidence 46999999999876554
No 72
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=37.96 E-value=98 Score=22.29 Aligned_cols=46 Identities=11% Similarity=0.089 Sum_probs=31.8
Q ss_pred CCeEEEeC--CCcEEEEEcC-CCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903 17 QALMFVAD--SNQILRTNMD-GTMAMSIVSEAAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 17 ~g~lywtd--~~~I~ra~ld-Gs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~ 66 (73)
.|.||..| ...+.+...+ |....+ ...-..|.||+.. ++.++-.-+
T Consensus 212 dgrLwvldsgtGev~~vD~~~G~~e~V--a~vpG~~rGL~f~--G~llvVgmS 260 (335)
T TIGR03032 212 QGKLWLLNSGRGELGYVDPQAGKFQPV--AFLPGFTRGLAFA--GDFAFVGLS 260 (335)
T ss_pred CCeEEEEECCCCEEEEEcCCCCcEEEE--EECCCCCccccee--CCEEEEEec
Confidence 68889998 6889998887 544433 3344689998776 666665443
No 73
>TIGR02658 TTQ_MADH_Hv methylamine dehydrogenase heavy chain. This family consists of the heavy chain of methylamine dehydrogenase light chain, a periplasmic enzyme. The enzyme contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from two Trp residues in the light subunity. The enzyme forms a complex with the type I blue copper protein amicyanin and a cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=34.86 E-value=1.7e+02 Score=21.00 Aligned_cols=52 Identities=13% Similarity=0.316 Sum_probs=34.5
Q ss_pred eCCCCeEEEeC-CCcEEEEEcCCCCeEEE--EeC---C----CceeeE---EEEcCCCCeEEEEe
Q psy4903 14 TREQALMFVAD-SNQILRTNMDGTMAMSI--VSE---A----AYKASG---VALDINAKRLFWCD 65 (73)
Q Consensus 14 dp~~g~lywtd-~~~I~ra~ldGs~~~~l--~~~---~----~~~p~~---lavD~~~~~LYW~D 65 (73)
.|..|..+|.. ..+|.-+.+.|...... +.. . --.|.| ++++..++++|-..
T Consensus 202 ~~~dg~~~~vs~eG~V~~id~~~~~~~~~~~~~~~~~~~~~~~wrP~g~q~ia~~~dg~~lyV~~ 266 (352)
T TIGR02658 202 SNKSGRLVWPTYTGKIFQIDLSSGDAKFLPAIEAFTEAEKADGWRPGGWQQVAYHRARDRIYLLA 266 (352)
T ss_pred EcCCCcEEEEecCCeEEEEecCCCcceecceeeeccccccccccCCCcceeEEEcCCCCEEEEEe
Confidence 34467778887 78999998876543332 211 1 114555 99999999999843
No 74
>cd01213 tensin Tensin Phosphotyrosine-binding (PTB) domain. Tensin Phosphotyrosine-binding (PTB) domain. Tensin is a a focal adhesion protein, which contains a C-terminal SH2 domain followed by a PTB domain. PTB domains have a PH-like fold and are found in various eukaryotic signaling molecules. They were initially identified based upon their ability to recognize phosphorylated tyrosine residues. In contrast to SH2 domains, which recognize phosphotyrosine and adjacent carboxy-terminal residues, PTB-domain binding specificity is conferred by residues amino-terminal to the phosphotyrosine. More recent studies have found that some types of PTB domains can bind to peptides which are not tyrosine phosphorylated or lack tyrosine residues altogether.
Probab=33.29 E-value=84 Score=19.61 Aligned_cols=38 Identities=21% Similarity=0.232 Sum_probs=28.1
Q ss_pred eeeeeccceEEeCCCCeEEEeC---CCcEEEEEcCCCCeEEE
Q psy4903 3 FKMKFSSGENRTREQALMFVAD---SNQILRTNMDGTMAMSI 41 (73)
Q Consensus 3 ~~~~~~~g~~~dp~~g~lywtd---~~~I~ra~ldGs~~~~l 41 (73)
||++ ..|+.+--..+++|+-. ...|.-|.+|+.+++.+
T Consensus 48 ~kVS-~qGItLtDn~rk~ffrrhypl~~Vs~ca~dp~n~~~~ 88 (138)
T cd01213 48 FKVS-SQGITLTDNTRKKFFRRHYKVDSVIFCAIDPEERMWE 88 (138)
T ss_pred EEEE-cCCeeeeccccceeehhhCCcCeEEEEeeCCcccccc
Confidence 5666 57888755566788765 47899999999887554
No 75
>PF01011 PQQ: PQQ enzyme repeat family.; InterPro: IPR002372 Pyrrolo-quinoline quinone (PQQ) is a redox coenzyme, which serves as a cofactor for a number of enzymes (quinoproteins) and particularly for some bacterial dehydrogenases [, ]. A number of bacterial quinoproteins belong to this family. Enzymes in this group have repeats of a beta propeller.; PDB: 1H4I_C 1H4J_E 1W6S_A 2YH3_A 3PRW_A 3P1L_A 3Q7M_A 3Q7O_A 3Q7N_A 1G72_A ....
Probab=32.31 E-value=48 Score=15.41 Aligned_cols=12 Identities=0% Similarity=-0.078 Sum_probs=8.8
Q ss_pred EEeCCCCeEEEe
Q psy4903 12 NRTREQALMFVA 23 (73)
Q Consensus 12 ~~dp~~g~lywt 23 (73)
++|..+|.+-|.
T Consensus 14 AlD~~TG~~~W~ 25 (38)
T PF01011_consen 14 ALDAKTGKVLWK 25 (38)
T ss_dssp EEETTTTSEEEE
T ss_pred EEECCCCCEEEe
Confidence 567777777776
No 76
>PF08310 LGFP: LGFP repeat; InterPro: IPR013207 This 54 amino acid repeat is found in many hypothetical proteins. Several hypothetical proteins from Corynebacterium glutamicum (Brevibacterium flavum) and Corynebacterium efficiens along with PS1 protein contain this repeat region. The N-terminal region of PS1 contains an esterase domain which transfers corynomycolic acid. The C-terminal region consists of 4 tandem LGFP repeats. It is hypothesised that the PS1 proteins in Corynebacterium, when associated with the cell wall, may be anchored via the LGFP tandem repeats that may be important for maintaining cell wall integrity. Deletion of Q01377 from SWISSPROT protein results in a 10-fold increase in the cell volume of the organism and infers the corresponding involvement of the protein in the cell shape formation []. The secondary structure of each repeat is predicted to comprise two beta-strands and one alpha-helix.
Probab=30.63 E-value=58 Score=16.56 Aligned_cols=15 Identities=7% Similarity=0.244 Sum_probs=11.8
Q ss_pred ceEEeCCCCeEEEeC
Q psy4903 10 GENRTREQALMFVAD 24 (73)
Q Consensus 10 g~~~dp~~g~lywtd 24 (73)
|-.-+-.+|.|||+.
T Consensus 16 G~~q~F~~G~Iywsp 30 (54)
T PF08310_consen 16 GRYQDFQNGTIYWSP 30 (54)
T ss_pred EEEEEcCCeEEEEeC
Confidence 366677889999987
No 77
>PF02239 Cytochrom_D1: Cytochrome D1 heme domain; PDB: 1NNO_B 1HZU_A 1N15_B 1N50_A 1GJQ_A 1BL9_B 1NIR_B 1N90_B 1HZV_A 1AOQ_A ....
Probab=30.36 E-value=2e+02 Score=20.39 Aligned_cols=57 Identities=9% Similarity=0.195 Sum_probs=32.1
Q ss_pred EEeCCCCeEEEeC-CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCC
Q psy4903 12 NRTREQALMFVAD-SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLD 69 (73)
Q Consensus 12 ~~dp~~g~lywtd-~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~ 69 (73)
+..|-.+++|-++ +..|....+.-.....-+..+ ..|.|+++...++.+|=+....+
T Consensus 43 ~~s~Dgr~~yv~~rdg~vsviD~~~~~~v~~i~~G-~~~~~i~~s~DG~~~~v~n~~~~ 100 (369)
T PF02239_consen 43 KFSPDGRYLYVANRDGTVSVIDLATGKVVATIKVG-GNPRGIAVSPDGKYVYVANYEPG 100 (369)
T ss_dssp E-TT-SSEEEEEETTSEEEEEETTSSSEEEEEE-S-SEEEEEEE--TTTEEEEEEEETT
T ss_pred EecCCCCEEEEEcCCCeEEEEECCcccEEEEEecC-CCcceEEEcCCCCEEEEEecCCC
Confidence 3445556677666 566666665543322223333 36999999988999987765443
No 78
>PF11314 DUF3117: Protein of unknown function (DUF3117); InterPro: IPR021465 This family of proteins with unknown function appears to be restricted to Actinobacteria.
Probab=30.20 E-value=73 Score=16.61 Aligned_cols=19 Identities=11% Similarity=0.279 Sum_probs=16.0
Q ss_pred CCcEEEEEcCCCCeEEEEe
Q psy4903 25 SNQILRTNMDGTMAMSIVS 43 (73)
Q Consensus 25 ~~~I~ra~ldGs~~~~l~~ 43 (73)
...|+|..++|..|.++--
T Consensus 17 R~ivmRvPleGGGRLVvEl 35 (51)
T PF11314_consen 17 RGIVMRVPLEGGGRLVVEL 35 (51)
T ss_pred ceEEEEEecCCCcEEEEEe
Confidence 4679999999999988753
No 79
>PF02333 Phytase: Phytase; InterPro: IPR003431 Phytase (3.1.3.8 from EC) (phytate 3-phosphatase) is a secreted enzyme which hydrolyses phytate to release inorganic phosphate. This family appears to represent a novel enzyme that shows phytase activity () and has been shown to consist of a single structural unit with a six-bladed propeller folding architecture ().; GO: 0016158 3-phytase activity; PDB: 3AMS_A 3AMR_A 1QLG_A 2POO_A 1H6L_A 1CVM_A 1POO_A.
Probab=30.13 E-value=2.2e+02 Score=20.78 Aligned_cols=55 Identities=15% Similarity=0.300 Sum_probs=30.6
Q ss_pred eCCCCeEEEeC---CCcEEEEEc--CCCC--eEEEEeC--CCceeeEEEEcCCCCeEEEEeCCC
Q psy4903 14 TREQALMFVAD---SNQILRTNM--DGTM--AMSIVSE--AAYKASGVALDINAKRLFWCDNLL 68 (73)
Q Consensus 14 dp~~g~lywtd---~~~I~ra~l--dGs~--~~~l~~~--~~~~p~~lavD~~~~~LYW~D~~~ 68 (73)
+|.+|.+|-.- +..++.-.| +|.. ...++.+ .-.+|+|+++|-....||-++...
T Consensus 166 ~~~~g~~ya~v~~k~G~~~Qy~L~~~~~g~v~~~lVR~f~~~sQ~EGCVVDDe~g~LYvgEE~~ 229 (381)
T PF02333_consen 166 SPSTGALYAFVNGKDGRVEQYELTDDGDGKVSATLVREFKVGSQPEGCVVDDETGRLYVGEEDV 229 (381)
T ss_dssp -TTT--EEEEEEETTSEEEEEEEEE-TTSSEEEEEEEEEE-SS-EEEEEEETTTTEEEEEETTT
T ss_pred cCCCCcEEEEEecCCceEEEEEEEeCCCCcEeeEEEEEecCCCcceEEEEecccCCEEEecCcc
Confidence 45666555432 344444333 2332 3334433 335899999999999999998754
No 80
>KOG3567|consensus
Probab=29.04 E-value=61 Score=24.50 Aligned_cols=56 Identities=18% Similarity=0.367 Sum_probs=34.2
Q ss_pred ccceEEeCCCCeEEEeC-----------------------CCcEEEEEcCCCCeEEEE---eCCCceeeEEEEcCCCCeE
Q psy4903 8 SSGENRTREQALMFVAD-----------------------SNQILRTNMDGTMAMSIV---SEAAYKASGVALDINAKRL 61 (73)
Q Consensus 8 ~~g~~~dp~~g~lywtd-----------------------~~~I~ra~ldGs~~~~l~---~~~~~~p~~lavD~~~~~L 61 (73)
.+|++.||+...+-+-- .++|.+.... ++..+- ...+..|.||.+|. +--
T Consensus 404 v~gva~d~~~~l~vfhr~g~vwd~~sfd~~f~~~~~~pi~~~~ilvi~~~--n~~~l~~~g~~~fylphgl~~dk--dgf 479 (501)
T KOG3567|consen 404 VSGVALDPKNNLVVFHRGGHVWDGNSFDSKFVYQQIGPIEEDTILVIDPN--NAAVLQSSGKNLFYLPHGLSIDK--DGF 479 (501)
T ss_pred eeeeccCCccceEEEccCCceEeechhhhhhhhhcccccccceEEEEcCc--chhhhhhccCCceecCCcceecC--CCc
Confidence 46777788766554432 2556665555 322222 23777899999994 446
Q ss_pred EEEeCC
Q psy4903 62 FWCDNL 67 (73)
Q Consensus 62 YW~D~~ 67 (73)
||++..
T Consensus 480 ~~~tdv 485 (501)
T KOG3567|consen 480 YWVTDV 485 (501)
T ss_pred EEeecc
Confidence 787754
No 81
>PF01267 F-actin_cap_A: F-actin capping protein alpha subunit; InterPro: IPR002189 This entry represents a component of the WASH complex. The WASH complex is present at the surface of endosomes and recruits and activates the Arp2/3 complex to induce actin polymerisation. The WASH complex plays a key role in the fission of tubules that serve as transport intermediates during endosome sorting []. The WASH complex's subunit structure: F-actin-capping protein subunit alpha (CAPZA1, CAPZA2 or CAPZA3), F-actin-capping protein subunit beta (CAPZB), WASH (WASH1, WASH2P, WASH3P, WASH4P, WASH5P or WASH6P), FAM21 (FAM21A, FAM21B or FAM21C), KIAA1033, KIAA0196 (strumpellin) and CCDC53. The actin filament system, a prominent part of the cytoskeleton in eukaryotic cells, is both a static structure and a dynamic network that can undergo rearrangements: it is thought to be involved in processes such as cell movement and phagocytosis [], as well as muscle contraction. The F-actin capping protein binds in a calcium-independent manner to the fast growing ends of actin filaments (barbed end) thereby blocking the exchange of subunits at these ends. Unlike gelsolin and severin this protein does not sever actin filaments. The F-actin capping protein is a heterodimer composed of two unrelated subunits: alpha and beta (see IPR001698 from INTERPRO). Neither of the subunits shows sequence similarity to other filament-capping proteins []. The alpha subunit is a protein of about 268 to 286 amino acid residues whose sequence is well conserved in eukaryotic species [].; GO: 0003779 actin binding, 0030036 actin cytoskeleton organization, 0008290 F-actin capping protein complex; PDB: 1MQ1_D 1MWN_X 2KBM_Y 2KZ7_A 3AAE_C 3AA6_A 3LK4_G 1IZN_A 3AA7_A 3AAA_A ....
Probab=28.68 E-value=1.4e+02 Score=20.56 Aligned_cols=43 Identities=19% Similarity=0.206 Sum_probs=31.4
Q ss_pred EEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCCCCee
Q psy4903 29 LRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNLLDYI 71 (73)
Q Consensus 29 ~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~~~~i 71 (73)
.-+.++|.+..+|++..-....+.-+|+.++..|-.|+....+
T Consensus 52 ~~v~~~~~~~~viis~~n~~~~~ry~Dp~~~~~F~~DHl~~~~ 94 (271)
T PF01267_consen 52 IPVKLPGSDHKVIISKYNKLGGNRYFDPRSKKSFSVDHLKQKA 94 (271)
T ss_dssp EEE--TTSSS-EEE-GGGBESTTEEEETTTTEEEEEETTTTEE
T ss_pred EEEEeCCCCcceEEecccccCCCeEEEeccCcEEEEeeeccEE
Confidence 4466888888888887555578899999999999999976654
No 82
>PF00930 DPPIV_N: Dipeptidyl peptidase IV (DPP IV) N-terminal region; InterPro: IPR002469 In the MEROPS database peptidases and peptidase homologues are grouped into clans and families. Clans are groups of families for which there is evidence of common ancestry based on a common structural fold: Each clan is identified with two letters, the first representing the catalytic type of the families included in the clan (with the letter 'P' being used for a clan containing families of more than one of the catalytic types serine, threonine and cysteine). Some families cannot yet be assigned to clans, and when a formal assignment is required, such a family is described as belonging to clan A-, C-, M-, N-, S-, T- or U-, according to the catalytic type. Some clans are divided into subclans because there is evidence of a very ancient divergence within the clan, for example MA(E), the gluzincins, and MA(M), the metzincins. Peptidase families are grouped by their catalytic type, the first character representing the catalytic type: A, aspartic; C, cysteine; G, glutamic acid; M, metallo; N, asparagine; S, serine; T, threonine; and U, unknown. The serine, threonine and cysteine peptidases utilise the amino acid as a nucleophile and form an acyl intermediate - these peptidases can also readily act as transferases. In the case of aspartic, glutamic and metallopeptidases, the nucleophile is an activated water molecule. In the case of the asparagine endopeptidases, the nucleophile is asparagine and all are self-processing endopeptidases. In many instances the structural protein fold that characterises the clan or family may have lost its catalytic activity, yet retain its function in protein recognition and binding. Proteolytic enzymes that exploit serine in their catalytic activity are ubiquitous, being found in viruses, bacteria and eukaryotes []. They include a wide range of peptidase activity, including exopeptidase, endopeptidase, oligopeptidase and omega-peptidase activity. Over 20 families (denoted S1 - S66) of serine protease have been identified, these being grouped into clans on the basis of structural similarity and other functional evidence []. Structures are known for members of the clans and the structures indicate that some appear to be totally unrelated, suggesting different evolutionary origins for the serine peptidases []. Not withstanding their different evolutionary origins, there are similarities in the reaction mechanisms of several peptidases. Chymotrypsin, subtilisin and carboxypeptidase C have a catalytic triad of serine, aspartate and histidine in common: serine acts as a nucleophile, aspartate as an electrophile, and histidine as a base []. The geometric orientations of the catalytic residues are similar between families, despite different protein folds []. The linear arrangements of the catalytic residues commonly reflect clan relationships. For example the catalytic triad in the chymotrypsin clan (PA) is ordered HDS, but is ordered DHS in the subtilisin clan (SB) and SDH in the carboxypeptidase clan (SC) [, ]. This domain defines serine peptidases belonging to MEROPS peptidase family S9 (clan SC), subfamily S9B (dipeptidyl-peptidase IV). The protein fold of the peptidase domain for members of this family resembles that of serine carboxypeptidase D, the type example of clan SC. This domain is an alignment of the region to the N-terminal side of the active site, which is found in IPR001375 from INTERPRO. CD26 (3.4.14.5 from EC) is also called adenosine deaminase-binding protein (ADA-binding protein) or dipeptidylpeptidase IV (DPP IV ectoenzyme). The exopeptidase cleaves off N-terminal X-Pro or X-Ala dipeptides from polypeptides (dipeptidyl peptidase IV activity). CD26 serves as the costimulatory molecule in T cell activation and is an associated marker of autoimmune diseases, adenosine deaminase-deficiency and HIV pathogenesis. Dipeptidyl peptidase IV (DPP IV) is responsible for the removal of N-terminal dipeptides sequentially from polypeptides having unsubstituted N termini, provided that the penultimate residue is proline. The enzyme catalyses the reaction: Dipeptidyl-Polypeptide + H(2)O = Dipeptide + Polypeptide It is a type II membrane protein that forms a homodimer. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0006508 proteolysis, 0016020 membrane; PDB: 2RIP_A 3Q8W_B 2AJL_I 1TKR_B 1TK3_B 3C45_A 2G5P_A 3G0C_D 1R9M_C 1RWQ_A ....
Probab=25.61 E-value=2.3e+02 Score=19.54 Aligned_cols=53 Identities=13% Similarity=0.157 Sum_probs=33.4
Q ss_pred CCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903 15 REQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL 67 (73)
Q Consensus 15 p~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~ 67 (73)
|..+.++|.. -..|+....+|.....|......--.-+.+|..++.||+.-..
T Consensus 245 ~~~~~~l~~s~~~G~~hly~~~~~~~~~~~lT~G~~~V~~i~~~d~~~~~iyf~a~~ 301 (353)
T PF00930_consen 245 PDGNEFLWISERDGYRHLYLYDLDGGKPRQLTSGDWEVTSILGWDEDNNRIYFTANG 301 (353)
T ss_dssp TTSSEEEEEEETTSSEEEEEEETTSSEEEESS-SSS-EEEEEEEECTSSEEEEEESS
T ss_pred CCCCEEEEEEEcCCCcEEEEEcccccceeccccCceeecccceEcCCCCEEEEEecC
Confidence 4444455554 2788888888877554443333222457889999999988764
No 83
>PRK04922 tolB translocation protein TolB; Provisional
Probab=25.03 E-value=2.6e+02 Score=19.89 Aligned_cols=55 Identities=11% Similarity=0.175 Sum_probs=34.7
Q ss_pred eEEeCCCCeEEEeC----CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeC
Q psy4903 11 ENRTREQALMFVAD----SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 11 ~~~dp~~g~lywtd----~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~ 66 (73)
....|...+|+++. .++|+..++++...+.+.... ......++.+.++.|+++..
T Consensus 297 ~~~spDG~~l~f~sd~~g~~~iy~~dl~~g~~~~lt~~g-~~~~~~~~SpDG~~Ia~~~~ 355 (433)
T PRK04922 297 PTWAPDGKSIYFTSDRGGRPQIYRVAASGGSAERLTFQG-NYNARASVSPDGKKIAMVHG 355 (433)
T ss_pred eEECCCCCEEEEEECCCCCceEEEEECCCCCeEEeecCC-CCccCEEECCCCCEEEEEEC
Confidence 34566666777764 367999988876555444322 23345667777888887654
No 84
>PLN02553 inositol-phosphate phosphatase
Probab=24.52 E-value=56 Score=21.80 Aligned_cols=15 Identities=20% Similarity=0.051 Sum_probs=13.8
Q ss_pred ceEEeCCCCeEEEeC
Q psy4903 10 GENRTREQALMFVAD 24 (73)
Q Consensus 10 g~~~dp~~g~lywtd 24 (73)
|++.+|..+.+||+.
T Consensus 121 GvV~~P~~~e~~~A~ 135 (270)
T PLN02553 121 GVVYNPILDELFTAV 135 (270)
T ss_pred EEEecCCCCCeEEEE
Confidence 788999999999987
No 85
>cd01643 Bacterial_IMPase_like_2 Bacterial family of Mg++ dependent phosphatases, related to inositol monophosphatases. These enzymes may dephosphorylate inositol monophosphate or similar substrates.
Probab=24.36 E-value=59 Score=21.44 Aligned_cols=16 Identities=25% Similarity=0.013 Sum_probs=14.0
Q ss_pred cceEEeCCCCeEEEeC
Q psy4903 9 SGENRTREQALMFVAD 24 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd 24 (73)
-|++.+|..+.+||+.
T Consensus 106 ~GvV~~P~~~~~~~A~ 121 (242)
T cd01643 106 FGVIALPALNQTFVAF 121 (242)
T ss_pred EEEEecCCCCCEEEEE
Confidence 3788999999999986
No 86
>TIGR02067 his_9_proposed histidinol-phosphate phosphatase HisN, inositol monophosphatase family. This subfamily belongs to the inositol monophosphatase family (pfam00459). The members of this family consist of no more than one per species and are found only in species in which histidine is synthesized de novo but no histidinol phosphatase can be found in either of the two described families (TIGR01261, TIGR01856). In at least one species, the member of this family is found near known histidine biosynthesis genes.
Probab=24.21 E-value=62 Score=21.36 Aligned_cols=16 Identities=13% Similarity=-0.105 Sum_probs=14.0
Q ss_pred cceEEeCCCCeEEEeC
Q psy4903 9 SGENRTREQALMFVAD 24 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd 24 (73)
-|++.+|..+.+||+.
T Consensus 109 ~gvv~~P~~~~~~~A~ 124 (251)
T TIGR02067 109 LGVIFQPATGERWWAA 124 (251)
T ss_pred EEEEEEcCCCCEEEEe
Confidence 3778999999999987
No 87
>PF13804 HERV-K_env_2: Retro-transcribing viruses envelope glycoprotein
Probab=23.29 E-value=2.1e+02 Score=18.58 Aligned_cols=18 Identities=28% Similarity=0.543 Sum_probs=15.3
Q ss_pred EEEEcCCCCeEEEEeCCC
Q psy4903 51 GVALDINAKRLFWCDNLL 68 (73)
Q Consensus 51 ~lavD~~~~~LYW~D~~~ 68 (73)
++.+||.=+.+||.+=..
T Consensus 147 ~~iIDWaPkg~f~~nCss 164 (170)
T PF13804_consen 147 GNIIDWAPKGMFWQNCSS 164 (170)
T ss_pred eEEEEecCCceeeeccCc
Confidence 899999999999987543
No 88
>PF02820 MBT: mbt repeat; InterPro: IPR004092 The function of the malignant brain tumor (MBT) repeat is unknown, but is found in a number of nuclear proteins involved in transcriptional repression. The repeat contains a completely conserved glutamate at its amino terminus that may be important for function. The crystal structure of the two MBT repeats of human SCM-like 2 protein has been reported. Each repeat consists of an extended "arm" and a globular core. The arm of the first repeat packs against the core of the second repeat and vice versa. The structure of the core-interacting part of each arm consists of an N-terminal alpha-helix and a turn of 310 helix connected by a short beta-strand. The core consists of an Src homology 3-like five-stranded beta-barrel followed by a C-terminal alpha-helix and another short beta-strand. Each arm interacts with its partner core in a similar way, with the orientation of the N-terminal helix relative to the barrel varying slightly. There are also extensive interactions between the two barrels [].; GO: 0006355 regulation of transcription, DNA-dependent, 0005634 nucleus; PDB: 2P0K_A 3F70_B 3CEY_A 2VYT_A 2BIV_A 1OI1_A 3OQ5_A 2RJE_B 2RJD_A 2RI3_A ....
Probab=23.10 E-value=1.4e+02 Score=16.05 Aligned_cols=14 Identities=7% Similarity=0.081 Sum_probs=11.0
Q ss_pred eEEeCCCCeEEEeC
Q psy4903 11 ENRTREQALMFVAD 24 (73)
Q Consensus 11 ~~~dp~~g~lywtd 24 (73)
+++||.+...+|..
T Consensus 4 Ea~d~~~~~~~~vA 17 (73)
T PF02820_consen 4 EAVDPRNPSLICVA 17 (73)
T ss_dssp EEEETTECCEEEEE
T ss_pred EEECCCCCCeEEEE
Confidence 57899888788864
No 89
>TIGR02647 DNA conserved hypothetical protein TIGR02647. Members of this family are found, so far, only in the Gammaproteobacteria. The function is unknown. The location on the chromosome usually is not far from housekeeping genes rather than in what is clearly, say, a prophage region. Some members have been annotated in public databases as DNA-binding protein inhibitor Id-2-related protein, putative transcriptional regulator, or hypothetical DNA binding protein.
Probab=22.96 E-value=27 Score=19.85 Aligned_cols=17 Identities=6% Similarity=0.227 Sum_probs=14.2
Q ss_pred eeccceEEeCCCCeEEEeC
Q psy4903 6 KFSSGENRTREQALMFVAD 24 (73)
Q Consensus 6 ~~~~g~~~dp~~g~lywtd 24 (73)
+|.+|+.-+|..| |.|+
T Consensus 43 LheKGLI~~pdGg--yLT~ 59 (77)
T TIGR02647 43 LHEKGLTTQPDGG--YLTS 59 (77)
T ss_pred HHHcCCccCCCCC--EecH
Confidence 6788999999998 6776
No 90
>PF14884 EFF-AFF: Type I membrane glycoproteins cell-cell fusogen
Probab=22.90 E-value=1.5e+02 Score=23.05 Aligned_cols=31 Identities=19% Similarity=0.124 Sum_probs=24.1
Q ss_pred CCcEEEEEcCCCCeEEEEeCCCceeeEEEEc
Q psy4903 25 SNQILRTNMDGTMAMSIVSEAAYKASGVALD 55 (73)
Q Consensus 25 ~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD 55 (73)
.+=|.||..|++.|++.+...-..+..+.|.
T Consensus 371 ~~WIrsv~id~s~R~V~V~H~egt~v~Vsl~ 401 (589)
T PF14884_consen 371 YPWIRSVRIDSSSRQVTVTHAEGTNVQVSLH 401 (589)
T ss_pred ccceeEEEEcCCCceEEEEecCCCcEEEEEE
Confidence 3668999999999999988766666666664
No 91
>smart00564 PQQ beta-propeller repeat. Beta-propeller repeat occurring in enzymes with pyrrolo-quinoline quinone (PQQ) as cofactor, in Ire1p-like Ser/Thr kinases, and in prokaryotic dehydrogenases.
Probab=22.68 E-value=85 Score=13.50 Aligned_cols=10 Identities=0% Similarity=-0.064 Sum_probs=4.9
Q ss_pred EeCCCCeEEE
Q psy4903 13 RTREQALMFV 22 (73)
Q Consensus 13 ~dp~~g~lyw 22 (73)
+|+.+|.+-|
T Consensus 21 ~d~~~G~~~W 30 (33)
T smart00564 21 LDAKTGEILW 30 (33)
T ss_pred EEcccCcEEE
Confidence 4444555544
No 92
>cd01248 PH_PLC Phospholipase C (PLC) pleckstrin homology (PH) domain. Phospholipase C (PLC) pleckstrin homology (PH) domain. There are several isozymes of PLC (beta, gamma, delta, epsilon. zeta). While, PLC beta, gamma and delta all have N-terminal PH domains, lipid binding specificity is not conserved between them. PH domains share little sequence conservation, but all have a common fold, which is electrostatically polarized. PH domains also have diverse functions. They are often involved in targeting proteins to the plasma membrane, but few display strong specificity in lipid binding. Any specificity is usually determined by loop regions or insertions in the N-terminus of the domain, which are not conserved across all PH domains.
Probab=22.58 E-value=1.7e+02 Score=16.92 Aligned_cols=23 Identities=22% Similarity=0.252 Sum_probs=17.8
Q ss_pred CCceeeEEEEcCCCCeEEEEeCC
Q psy4903 45 AAYKASGVALDINAKRLFWCDNL 67 (73)
Q Consensus 45 ~~~~p~~lavD~~~~~LYW~D~~ 67 (73)
.-..+.-+.||..+..|||.+..
T Consensus 19 ~~~~~~~f~ld~~~~~l~W~~~~ 41 (115)
T cd01248 19 SRERRRLFRLDEKGFFLYWKDEG 41 (115)
T ss_pred CceeeEEEEEcCCCcEEEEeCCC
Confidence 34456778889999999998765
No 93
>cd00472 Ribosomal_L24e_L24 Ribosomal protein L24e/L24 is a ribosomal protein found in eukaryotes (L24) and in archaea (L24e, distinct from archaeal L24). L24e/L24 is located on the surface of the large subunit, adjacent to proteins L14 and L3, and near the translation factor binding site. L24e/L24 appears to play a role in the kinetics of peptide synthesis, and may be involved in interactions between the large and small subunits, either directly or through other factors. In mouse, a deletion mutation in L24 has been identified as the cause for the belly spot and tail (Bst) mutation that results in disrupted pigmentation, somitogenesis and retinal cell fate determination. L24 may be an important protein in eukaryotic reproduction: in shrimp, L24 expression is elevated in the ovary, suggesting a role in oogenesis, and in Arabidopsis, L24 has been proposed to have a specific function in gynoecium development. No protein with sequence or structural homology to L24e/L24 has been identifi
Probab=22.27 E-value=1.4e+02 Score=15.67 Aligned_cols=28 Identities=14% Similarity=0.153 Sum_probs=19.0
Q ss_pred ceeeeeccceEEeCCCCeEEEeCCCcEE
Q psy4903 2 KFKMKFSSGENRTREQALMFVADSNQIL 29 (73)
Q Consensus 2 ~~~~~~~~g~~~dp~~g~lywtd~~~I~ 29 (73)
+-..--.+|.-+-|-+|.+|.-.+++|.
T Consensus 2 ~~~~C~f~g~~I~PG~G~~~Vr~Dgkv~ 29 (54)
T cd00472 2 KTEKCSFCGYKIYPGHGKMYVRNDGKVF 29 (54)
T ss_pred cEEEecCcCCeecCCCccEEEecCCCEE
Confidence 3444456788899999999976544443
No 94
>TIGR01331 bisphos_cysQ 3'(2'),5'-bisphosphate nucleotidase, bacterial. Sulfate is incorporated into 3-phosphoadenylylsulfate, PAPS, for utilization in pathways such as methionine biosynthesis. Transfer of sulfate from PAPS to an acceptor leaves adenosine 3'-5'-bisphosphate, APS. This model describes a form found in bacteria of the enzyme 3'(2'),5'-bisphosphate nucleotidase, which removes the 3'-phosphate from APS to regenerate AMP and help drive the cycle.
Probab=22.15 E-value=73 Score=21.02 Aligned_cols=16 Identities=13% Similarity=0.160 Sum_probs=14.0
Q ss_pred cceEEeCCCCeEEEeC
Q psy4903 9 SGENRTREQALMFVAD 24 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd 24 (73)
-|.+.+|..|.+||+.
T Consensus 111 ~gvI~~P~~~~~~~A~ 126 (249)
T TIGR01331 111 LGVVYAPATGVTYFAT 126 (249)
T ss_pred EEEEEecCCCCEEEEE
Confidence 3778999999999986
No 95
>PRK09439 PTS system glucose-specific transporter subunit; Provisional
Probab=21.73 E-value=84 Score=20.23 Aligned_cols=14 Identities=14% Similarity=0.055 Sum_probs=9.6
Q ss_pred ccceEEeCCCCeEE
Q psy4903 8 SSGENRTREQALMF 21 (73)
Q Consensus 8 ~~g~~~dp~~g~ly 21 (73)
..|+++.|..|.+|
T Consensus 48 GdGvAI~P~~~~v~ 61 (169)
T PRK09439 48 GDGIAIKPTGNKMV 61 (169)
T ss_pred cceEEEEccCCEEE
Confidence 45777777777666
No 96
>cd01638 CysQ CysQ, a 3'-Phosphoadenosine-5'-phosphosulfate (PAPS) 3'-phosphatase, is a bacterial member of the inositol monophosphatase family. It has been proposed that CysQ helps control intracellular levels of PAPS, which is an intermediate in cysteine biosynthesis (a principal route of sulfur assimilation).
Probab=21.50 E-value=72 Score=20.90 Aligned_cols=15 Identities=13% Similarity=0.040 Sum_probs=13.7
Q ss_pred ceEEeCCCCeEEEeC
Q psy4903 10 GENRTREQALMFVAD 24 (73)
Q Consensus 10 g~~~dp~~g~lywtd 24 (73)
|++.+|..+.+||+.
T Consensus 110 gvi~~P~~~~~~~A~ 124 (242)
T cd01638 110 GVVYAPALGELYYAL 124 (242)
T ss_pred EEEecCCCCCEEEEE
Confidence 788899999999987
No 97
>KOG0646|consensus
Probab=21.41 E-value=3.7e+02 Score=20.38 Aligned_cols=21 Identities=19% Similarity=0.234 Sum_probs=16.5
Q ss_pred CceeeEEEEcCCCCeEEEEeC
Q psy4903 46 AYKASGVALDINAKRLFWCDN 66 (73)
Q Consensus 46 ~~~p~~lavD~~~~~LYW~D~ 66 (73)
-..+.++++|+.+.++|-...
T Consensus 217 p~si~av~lDpae~~~yiGt~ 237 (476)
T KOG0646|consen 217 PSSIKAVALDPAERVVYIGTE 237 (476)
T ss_pred CCcceeEEEcccccEEEecCC
Confidence 345789999999999886544
No 98
>PF09917 DUF2147: Uncharacterized protein conserved in bacteria (DUF2147); InterPro: IPR019223 This domain, found in various hypothetical prokaryotic proteins, has no known function.
Probab=21.22 E-value=80 Score=18.42 Aligned_cols=15 Identities=13% Similarity=0.206 Sum_probs=11.1
Q ss_pred ccceEEeCCCCeEEE
Q psy4903 8 SSGENRTREQALMFV 22 (73)
Q Consensus 8 ~~g~~~dp~~g~lyw 22 (73)
..|-++||.+|+.|=
T Consensus 70 ~~G~iydP~~GktY~ 84 (114)
T PF09917_consen 70 KGGKIYDPEDGKTYS 84 (114)
T ss_pred cCcEEEeCCCCCEEE
Confidence 347788888888774
No 99
>COG5649 Uncharacterized conserved protein [Function unknown]
Probab=21.13 E-value=84 Score=19.42 Aligned_cols=15 Identities=20% Similarity=0.388 Sum_probs=11.7
Q ss_pred eeeeeccceEE-eCCC
Q psy4903 3 FKMKFSSGENR-TREQ 17 (73)
Q Consensus 3 ~~~~~~~g~~~-dp~~ 17 (73)
.|+.|.+|..+ ||..
T Consensus 75 vK~tFakGAsL~dPa~ 90 (132)
T COG5649 75 VKVTFAKGASLADPAP 90 (132)
T ss_pred EEEEEecccccCCCCc
Confidence 47888889888 7754
No 100
>PF14583 Pectate_lyase22: Oligogalacturonate lyase; PDB: 3C5M_C 3PE7_A.
Probab=21.02 E-value=2.9e+02 Score=20.28 Aligned_cols=43 Identities=9% Similarity=0.154 Sum_probs=20.5
Q ss_pred CCcEEEEEcCCCCeEEEEeCCCceeeEEEEcCCCCeEEEEeCC
Q psy4903 25 SNQILRTNMDGTMAMSIVSEAAYKASGVALDINAKRLFWCDNL 67 (73)
Q Consensus 25 ~~~I~ra~ldGs~~~~l~~~~~~~p~~lavD~~~~~LYW~D~~ 67 (73)
+..++..+++....+.|....-...-|..+-...+.+|++...
T Consensus 59 ~~nly~lDL~t~~i~QLTdg~g~~~~g~~~s~~~~~~~Yv~~~ 101 (386)
T PF14583_consen 59 NRNLYLLDLATGEITQLTDGPGDNTFGGFLSPDDRALYYVKNG 101 (386)
T ss_dssp S-EEEEEETTT-EEEE---SS-B-TTT-EE-TTSSEEEEEETT
T ss_pred CcceEEEEcccCEEEECccCCCCCccceEEecCCCeEEEEECC
Confidence 4667777777655555554322222245555667888777543
No 101
>PRK10757 inositol monophosphatase; Provisional
Probab=21.02 E-value=75 Score=21.25 Aligned_cols=16 Identities=13% Similarity=0.030 Sum_probs=14.1
Q ss_pred cceEEeCCCCeEEEeC
Q psy4903 9 SGENRTREQALMFVAD 24 (73)
Q Consensus 9 ~g~~~dp~~g~lywtd 24 (73)
-|.+.+|..+.+||+.
T Consensus 113 ~GvV~~P~~~~~~~A~ 128 (267)
T PRK10757 113 VAVVYDPMRNELFTAT 128 (267)
T ss_pred EEEEEcCCCCCEEEEE
Confidence 3788999999999986
Done!