Query psy13486
Match_columns 91
No_of_seqs 100 out of 254
Neff 6.1
Searched_HMMs 46136
Date Fri Aug 16 20:36:02 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy13486.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/13486hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF03022 MRJP: Major royal jel 100.0 2.2E-31 4.7E-36 199.4 8.5 91 1-91 193-286 (287)
2 PF08450 SGL: SMP-30/Gluconola 94.6 0.077 1.7E-06 37.8 4.7 56 4-59 145-202 (246)
3 PF03088 Str_synth: Strictosid 94.1 0.17 3.6E-06 32.4 5.0 49 3-58 8-74 (89)
4 PF01436 NHL: NHL repeat; Int 92.2 0.12 2.5E-06 26.0 1.7 21 41-61 1-22 (28)
5 PF00058 Ldl_recept_b: Low-den 92.1 0.66 1.4E-05 25.2 4.8 40 5-50 1-41 (42)
6 COG3386 Gluconolactonase [Carb 90.2 2 4.4E-05 32.8 7.5 80 2-82 171-253 (307)
7 PF08450 SGL: SMP-30/Gluconola 89.1 1.4 3.1E-05 31.2 5.6 49 2-58 94-152 (246)
8 smart00284 OLF Olfactomedin-li 79.1 12 0.00025 28.3 6.8 70 4-74 83-163 (255)
9 KOG4499|consensus 78.1 7.6 0.00017 29.9 5.6 60 3-62 168-233 (310)
10 PF05787 DUF839: Bacterial pro 73.2 12 0.00026 30.8 5.9 61 3-63 360-458 (524)
11 PF06739 SBBP: Beta-propeller 69.5 4.5 9.8E-05 21.6 1.9 19 40-58 11-30 (38)
12 PF07494 Reg_prop: Two compone 63.7 7.4 0.00016 18.6 1.9 16 44-59 7-23 (24)
13 PLN02919 haloacid dehalogenase 62.4 22 0.00048 31.6 5.8 52 2-57 812-875 (1057)
14 KOG2919|consensus 53.0 45 0.00097 26.7 5.5 61 3-63 59-134 (406)
15 KOG1520|consensus 52.4 22 0.00047 28.5 3.7 14 40-53 217-231 (376)
16 COG3386 Gluconolactonase [Carb 52.3 76 0.0017 24.3 6.6 54 3-63 35-89 (307)
17 TIGR02604 Piru_Ver_Nterm putat 50.0 44 0.00095 25.6 5.0 20 40-59 182-202 (367)
18 TIGR02604 Piru_Ver_Nterm putat 46.8 20 0.00044 27.4 2.7 28 31-59 4-32 (367)
19 KOG3037|consensus 46.7 77 0.0017 25.0 5.8 47 3-57 34-87 (330)
20 PLN02919 haloacid dehalogenase 46.7 48 0.001 29.6 5.3 49 5-57 752-820 (1057)
21 PF06977 SdiA-regulated: SdiA- 45.9 68 0.0015 23.8 5.3 53 3-59 181-240 (248)
22 PF01731 Arylesterase: Arylest 45.3 27 0.00058 22.0 2.7 37 13-58 34-72 (86)
23 COG4574 Eco Serine protease in 41.8 24 0.00051 24.8 2.2 29 53-82 125-153 (162)
24 PF02191 OLF: Olfactomedin-lik 40.2 1.1E+02 0.0024 22.7 5.7 71 4-74 78-158 (250)
25 PF09844 DUF2071: Uncharacteri 39.3 48 0.001 23.9 3.6 40 43-82 24-74 (217)
26 PF11763 DIPSY: Cell-wall adhe 38.8 79 0.0017 21.5 4.3 45 31-78 62-116 (123)
27 PF12690 BsuPI: Intracellular 38.3 25 0.00054 21.7 1.7 35 38-72 15-59 (82)
28 COG3211 PhoX Predicted phospha 37.4 28 0.00061 29.5 2.3 58 4-62 428-521 (616)
29 KOG4659|consensus 36.1 26 0.00056 32.8 2.0 60 3-63 417-497 (1899)
30 TIGR02276 beta_rpt_yvtn 40-res 35.7 62 0.0014 16.1 3.7 21 4-24 3-23 (42)
31 PF08268 FBA_3: F-box associat 34.7 1.3E+02 0.0027 19.3 7.4 22 4-25 5-30 (129)
32 cd00242 Ecotin Protease Inhibi 34.5 40 0.00086 23.2 2.4 28 54-82 101-128 (136)
33 PF07995 GSDH: Glucose / Sorbo 33.3 84 0.0018 23.8 4.2 53 4-59 270-324 (331)
34 PRK11028 6-phosphogluconolacto 32.8 1.8E+02 0.0038 21.3 5.8 53 6-62 241-295 (330)
35 PF12216 m04gp34like: Immune e 31.7 35 0.00077 26.0 1.9 20 42-61 135-156 (272)
36 smart00135 LY Low-density lipo 31.6 73 0.0016 15.7 2.9 21 3-23 19-39 (43)
37 COG3040 Blc Bacterial lipocali 31.1 1.1E+02 0.0025 21.9 4.3 29 25-59 108-140 (174)
38 COG4014 Uncharacterized protei 31.0 77 0.0017 20.6 3.1 29 32-60 14-43 (97)
39 PF06977 SdiA-regulated: SdiA- 30.1 59 0.0013 24.1 2.9 37 44-80 173-211 (248)
40 TIGR02167 Liste_lipo_26 bacter 29.5 16 0.00035 18.0 -0.2 14 11-24 7-20 (26)
41 COG2813 RsmC 16S RNA G1207 met 29.3 39 0.00084 26.2 1.8 31 51-81 259-297 (300)
42 PRK03719 ecotin; Provisional 29.3 53 0.0011 23.4 2.4 23 59-82 137-159 (166)
43 PF04683 Proteasom_Rpn13: Prot 27.9 1.6E+02 0.0034 18.3 4.3 48 3-58 15-70 (85)
44 KOG1214|consensus 27.0 1.3E+02 0.0028 27.4 4.6 53 3-57 1101-1174(1289)
45 KOG0289|consensus 26.3 1.7E+02 0.0038 24.3 5.1 60 3-67 399-460 (506)
46 cd03855 M14_ASTE Peptidase M14 23.0 82 0.0018 24.1 2.6 35 9-50 277-311 (322)
47 PF10761 DUF2590: Protein of u 22.9 1.7E+02 0.0037 19.3 3.7 28 31-59 60-90 (102)
48 COG2706 3-carboxymuconate cycl 22.7 3E+02 0.0065 21.9 5.6 52 6-59 257-309 (346)
49 TIGR03032 conserved hypothetic 21.7 3.6E+02 0.0078 21.4 5.9 52 4-63 212-263 (335)
50 PF14398 ATPgrasp_YheCD: YheC/ 21.4 61 0.0013 23.9 1.6 21 42-62 211-233 (262)
51 PRK05324 succinylglutamate des 21.1 98 0.0021 23.8 2.7 35 9-50 280-314 (329)
No 1
>PF03022 MRJP: Major royal jelly protein; InterPro: IPR003534 The major royal jelly proteins (MRJPs) comprise 12.5% of the mass, and 82-90% of the protein content [], of honeybee (Apis mellifera) royal jelly. Royal jelly is a substance secreted by the cephalic glands of nurse bees [] and it is used to trigger development of a queen bee from a bee larva. The biological function of the MRJPs is unknown, but they are believed to play a major role in nutrition due to their high essential amino acid content []. Two royal jelly proteins, MRJP3 and MRJP5, contain a tandem repeat that results from a high genetic variablility. This polymorphism may be useful for genotyping individual bees [].; PDB: 3Q6P_B 3Q6K_A 3Q6T_A 2QE8_B.
Probab=99.97 E-value=2.2e-31 Score=199.45 Aligned_cols=91 Identities=36% Similarity=0.739 Sum_probs=75.9
Q ss_pred CCccCeEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEeC---CEEEEEEeCCcceecCCCCCCcceEE
Q psy13486 1 MDHNGILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVVG---DDLWVISDRMPIHLESELNFNDVNFR 77 (91)
Q Consensus 1 ~d~~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid~---~~lWvlsnrl~~f~~~~l~~~e~NfR 77 (91)
||++|+|||++++++||+|||+++||+++|+.+|++|+++|+||++|+|++ |+||++|||||+|++++++++|+|||
T Consensus 193 ~D~~G~ly~~~~~~~aI~~w~~~~~~~~~~~~~l~~d~~~l~~pd~~~i~~~~~g~L~v~snrl~~~~~~~~~~~e~nfr 272 (287)
T PF03022_consen 193 IDPNGNLYFTDVEQNAIGCWDPDGPYTPENFEILAQDPRTLQWPDGLKIDPEGDGYLWVLSNRLQRFAYGNLNPDEVNFR 272 (287)
T ss_dssp EETTTEEEEEECCCTEEEEEETTTSB-GCCEEEEEE-CC-GSSEEEEEE-T--TS-EEEEE-S--SSTGGG---SS-SEE
T ss_pred ECCCCcEEEecCCCCeEEEEeCCCCcCccchheeEEcCceeeccceeeeccccCceEEEEECcchHhhccCCCCCccEEe
Confidence 579999999999999999999999999999999999999999999999995 89999999999999999999999999
Q ss_pred EEecCHHHHHhcCC
Q psy13486 78 IFTVPLQEAVREQT 91 (91)
Q Consensus 78 I~~~~~~~~i~gt~ 91 (91)
|+++++++||+||+
T Consensus 273 i~~~~v~~~~~~t~ 286 (287)
T PF03022_consen 273 IWRARVDELIKGTV 286 (287)
T ss_dssp EEES-HHHHHTTST
T ss_pred EEEeCHHHHhcCCC
Confidence 99999999999996
No 2
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=94.60 E-value=0.077 Score=37.79 Aligned_cols=56 Identities=14% Similarity=0.043 Sum_probs=41.6
Q ss_pred cCeEEEEeCCCCcEEEeeCCCCC-CCCceeeEeecCCeeeeccceEEe-CCEEEEEEe
Q psy13486 4 NGILYFNLIDRNAVGCWNSQYPY-KPENIGHIDIDNEALIFPSDVKVV-GDDLWVISD 59 (91)
Q Consensus 4 ~gvlff~lv~~~aI~cWn~~~~~-~~~n~~iia~d~~~l~f~sdlkid-~~~lWvlsn 59 (91)
...||++......|..++...+- ...+.++++.-+...-+|+||.+| +|+|||-..
T Consensus 145 g~~lyv~ds~~~~i~~~~~~~~~~~~~~~~~~~~~~~~~g~pDG~~vD~~G~l~va~~ 202 (246)
T PF08450_consen 145 GKTLYVADSFNGRIWRFDLDADGGELSNRRVFIDFPGGPGYPDGLAVDSDGNLWVADW 202 (246)
T ss_dssp SSEEEEEETTTTEEEEEEEETTTCCEEEEEEEEE-SSSSCEEEEEEEBTTS-EEEEEE
T ss_pred chheeecccccceeEEEeccccccceeeeeeEEEcCCCCcCCCcceEcCCCCEEEEEc
Confidence 34799999999999999997543 245666776555554589999999 999999743
No 3
>PF03088 Str_synth: Strictosidine synthase; InterPro: IPR018119 This entry represents a conserved region found in strictosidine synthase (4.3.3.2 from EC), a key enzyme in alkaloid biosynthesis. It catalyses the Pictet-Spengler stereospecific condensation of tryptamine with secologanin to form strictosidine []. The structure of the native enzyme from the Indian medicinal plant Rauvolfia serpentina (Serpentwood) (Devilpepper) represents the first example of a six-bladed four-stranded beta-propeller fold from the plant kingdom [].; GO: 0016844 strictosidine synthase activity, 0009058 biosynthetic process; PDB: 2FPB_A 2V91_B 2FP8_A 3V1S_B 2FPC_A 2VAQ_A 2FP9_B.
Probab=94.11 E-value=0.17 Score=32.42 Aligned_cols=49 Identities=14% Similarity=0.287 Sum_probs=30.4
Q ss_pred ccCeEEEEeCCC-----------------CcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe-CCEEEEEE
Q psy13486 3 HNGILYFNLIDR-----------------NAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV-GDDLWVIS 58 (91)
Q Consensus 3 ~~gvlff~lv~~-----------------~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid-~~~lWvls 58 (91)
++|.+||+..+. ..+.+||+.+. -..+++. .|.||+|+.+. ++...+++
T Consensus 8 ~~g~vYfTdsS~~~~~~~~~~~~le~~~~GRll~ydp~t~----~~~vl~~---~L~fpNGVals~d~~~vlv~ 74 (89)
T PF03088_consen 8 DTGTVYFTDSSSRYDRRDWVYDLLEGRPTGRLLRYDPSTK----ETTVLLD---GLYFPNGVALSPDESFVLVA 74 (89)
T ss_dssp TT--EEEEES-SS--TTGHHHHHHHT---EEEEEEETTTT----EEEEEEE---EESSEEEEEE-TTSSEEEEE
T ss_pred CCCEEEEEeCccccCccceeeeeecCCCCcCEEEEECCCC----eEEEehh---CCCccCeEEEcCCCCEEEEE
Confidence 349999998753 34777777765 3445554 38899999998 66544433
No 4
>PF01436 NHL: NHL repeat; InterPro: IPR001258 The NHL repeat, named after NCL-1, HT2A and Lin-41, is found largely in a large number of eukaryotic and prokaryotic proteins. For example, the repeat is found in a variety of enzymes of the copper type II, ascorbate-dependent monooxygenase family which catalyse the C terminus alpha-amidation of biological peptides []. In many it occurs in tandem arrays, for example in the ringfinger beta-box, coiled-coil (RBCC) eukaryotic growth regulators []. The 'Brain Tumor' protein (Brat) is one such growth regulator that contains a 6-bladed NHL-repeat beta-propeller [, ]. The NHL repeats are also found in serine/threonine protein kinase (STPK) in diverse range of pathogenic bacteria. These STPK are transmembrane receptors with a intracellular N-terminal kinase domain and extracellular C-terminal sensor domain. In the STPK, PknD, from Mycobacterium tuberculosis, the sensor domain forms a rigid, six-bladed b-propeller composed of NHL repeats with a flexible tether to the transmembrane domain.; GO: 0005515 protein binding; PDB: 3FVZ_A 3FW0_A 1RWL_A 1RWI_A 1Q7F_A.
Probab=92.23 E-value=0.12 Score=25.98 Aligned_cols=21 Identities=19% Similarity=0.398 Sum_probs=16.7
Q ss_pred eeeccceEEe-CCEEEEEEeCC
Q psy13486 41 LIFPSDVKVV-GDDLWVISDRM 61 (91)
Q Consensus 41 l~f~sdlkid-~~~lWvlsnrl 61 (91)
|.+|.|+.++ +|++||.-..-
T Consensus 1 f~~P~gvav~~~g~i~VaD~~n 22 (28)
T PF01436_consen 1 FNYPHGVAVDSDGNIYVADSGN 22 (28)
T ss_dssp BSSEEEEEEETTSEEEEEECCC
T ss_pred CcCCcEEEEeCCCCEEEEECCC
Confidence 4589999999 99999965433
No 5
>PF00058 Ldl_recept_b: Low-density lipoprotein receptor repeat class B; InterPro: IPR000033 The low-density lipoprotein receptor (LDLR) is the major cholesterol-carrying lipoprotein of plasma, acting to regulate cholesterol homeostasis in mammalian cells. The LDL receptor binds LDL and transports it into cells by acidic endocytosis. In order to be internalized, the receptor-ligand complex must first cluster into clathrin-coated pits. Once inside the cell, the LDLR separates from its ligand, which is degraded in the lysosomes, while the receptor returns to the cell surface []. The internal dissociation of the LDLR with its ligand is mediated by proton pumps within the walls of the endosome that lower the pH. The LDLR is a multi-domain protein, containing: The ligand-binding domain contains seven or eight 40-amino acid LDLR class A (cysteine-rich) repeats, each of which contains a coordinated calcium ion and six cysteine residues involved in disulphide bond formation []. Similar domains have been found in other extracellular and membrane proteins []. The second conserved region contains two EGF repeats, followed by six LDLR class B (YWTD) repeats, and another EGF repeat. The LDLR class B repeats each contain a conserved YWTD motif, and is predicted to form a beta-propeller structure []. This region is critical for ligand release and recycling of the receptor []. The third domain is rich in serine and threonine residues and contains clustered O-linked carbohydrate chains. The fourth domain is the hydrophobic transmembrane region. The fifth domain is the cytoplasmic tail that directs the receptor to clathrin-coated pits. LDLR is closely related in structure to several other receptors, including LRP1, LRP1b, megalin/LRP2, VLDL receptor, lipoprotein receptor, MEGF7/LRP4, and LRP8/apolipoprotein E receptor2); these proteins participate in a wide range of physiological processes, including the regulation of lipid metabolism, protection against atherosclerosis, neurodevelopment, and transport of nutrients and vitamins []. This entry represents the LDLR classB (YWTD) repeat, the structure of which has been solved []. The six YWTD repeats together fold into a six-bladed beta-propeller. Each blade of the propeller consists of four antiparallel beta-strands; the innermost strand of each blade is labeled 1 and the outermost strand, 4. The sequence repeats are offset with respect to the blades of the propeller, such that any given 40-residue YWTD repeat spans strands 24 of one propeller blade and strand 1 of the subsequent blade. This offset ensures circularization of the propeller because the last strand of the final sequence repeat acts as an innermost strand 1 of the blade that harbors strands 24 from the first sequence repeat. The repeat is found in a variety of proteins that include, vitellogenin receptor from Drosophila melanogaster, low-density lipoprotein (LDL) receptor [], preproepidermal growth factor, and nidogen (entactin).; PDB: 3S2K_A 3S8Z_A 3S8V_B 4A0P_A 3SOB_B 3S94_B 4DG6_A 3SOV_A 3SOQ_A 1NPE_A ....
Probab=92.07 E-value=0.66 Score=25.24 Aligned_cols=40 Identities=20% Similarity=0.267 Sum_probs=33.5
Q ss_pred CeEEEEeCCCC-cEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe
Q psy13486 5 GILYFNLIDRN-AVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV 50 (91)
Q Consensus 5 gvlff~lv~~~-aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid 50 (91)
|-||++..... .|.+=+.... +.++|.+++ ++-|.||.||
T Consensus 1 ~~iYWtD~~~~~~I~~a~~dGs----~~~~vi~~~--l~~P~giaVD 41 (42)
T PF00058_consen 1 GKIYWTDWSQDPSIERANLDGS----NRRTVISDD--LQHPEGIAVD 41 (42)
T ss_dssp TEEEEEETTTTEEEEEEETTST----SEEEEEESS--TSSEEEEEEE
T ss_pred CEEEEEECCCCcEEEEEECCCC----CeEEEEECC--CCCcCEEEEC
Confidence 56899999999 8888887765 677888776 8899999986
No 6
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=90.19 E-value=2 Score=32.83 Aligned_cols=80 Identities=15% Similarity=0.108 Sum_probs=48.4
Q ss_pred CccC-eEEEEeCCCCcEEEeeCC-CCCCCCceeeEeecCCeeeeccceEEe-CCEEEEEEeCCcceecCCCCCCcceEEE
Q psy13486 2 DHNG-ILYFNLIDRNAVGCWNSQ-YPYKPENIGHIDIDNEALIFPSDVKVV-GDDLWVISDRMPIHLESELNFNDVNFRI 78 (91)
Q Consensus 2 d~~g-vlff~lv~~~aI~cWn~~-~~~~~~n~~iia~d~~~l~f~sdlkid-~~~lWvlsnrl~~f~~~~l~~~e~NfRI 78 (91)
+.+| .||++.-..+.|..+... ..-...+.+..+.-.+.-..|+|+.+| +|++|+ +---.-.--..++++..-.+.
T Consensus 171 SpDg~tly~aDT~~~~i~r~~~d~~~g~~~~~~~~~~~~~~~G~PDG~~vDadG~lw~-~a~~~g~~v~~~~pdG~l~~~ 249 (307)
T COG3386 171 SPDGKTLYVADTPANRIHRYDLDPATGPIGGRRGFVDFDEEPGLPDGMAVDADGNLWV-AAVWGGGRVVRFNPDGKLLGE 249 (307)
T ss_pred CCCCCEEEEEeCCCCeEEEEecCcccCccCCcceEEEccCCCCCCCceEEeCCCCEEE-ecccCCceEEEECCCCcEEEE
Confidence 4345 999999999999888776 333334444333333345689999999 999997 222221222334454444444
Q ss_pred EecC
Q psy13486 79 FTVP 82 (91)
Q Consensus 79 ~~~~ 82 (91)
++.+
T Consensus 250 i~lP 253 (307)
T COG3386 250 IKLP 253 (307)
T ss_pred EECC
Confidence 4443
No 7
>PF08450 SGL: SMP-30/Gluconolaconase/LRE-like region; InterPro: IPR013658 This family describes a region that is found in proteins expressed by a variety of eukaryotic and prokaryotic species. These proteins include various enzymes, such as senescence marker protein 30 (SMP-30, Q15493 from SWISSPROT), gluconolactonase (Q01578 from SWISSPROT) and luciferin-regenerating enzyme (LRE, Q86DU5 from SWISSPROT). SMP-30 is known to hydrolyse diisopropyl phosphorofluoridate in the liver, and has been noted as having sequence similarity, in the region described in this family, with PON1 (P52430 from SWISSPROT) and LRE. ; PDB: 2GHS_A 2DG0_L 2DG1_D 2DSO_D 3E5Z_A 2IAT_A 2IAV_A 2GVV_A 3HLI_A 2GVU_A ....
Probab=89.10 E-value=1.4 Score=31.24 Aligned_cols=49 Identities=20% Similarity=0.266 Sum_probs=34.2
Q ss_pred CccCeEEEEeCCC--------CcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe-CC-EEEEEE
Q psy13486 2 DHNGILYFNLIDR--------NAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV-GD-DLWVIS 58 (91)
Q Consensus 2 d~~gvlff~lv~~--------~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid-~~-~lWvls 58 (91)
|++|.|||+.... .+|.+++.... ...++. .+.+|+||.++ +| .|||-.
T Consensus 94 d~~G~ly~t~~~~~~~~~~~~g~v~~~~~~~~-----~~~~~~---~~~~pNGi~~s~dg~~lyv~d 152 (246)
T PF08450_consen 94 DPDGNLYVTDSGGGGASGIDPGSVYRIDPDGK-----VTVVAD---GLGFPNGIAFSPDGKTLYVAD 152 (246)
T ss_dssp -TTS-EEEEEECCBCTTCGGSEEEEEEETTSE-----EEEEEE---EESSEEEEEEETTSSEEEEEE
T ss_pred cCCCCEEEEecCCCccccccccceEEECCCCe-----EEEEec---CcccccceEECCcchheeecc
Confidence 6789999998866 46899988722 334443 48899999999 65 577643
No 8
>smart00284 OLF Olfactomedin-like domains.
Probab=79.12 E-value=12 Score=28.31 Aligned_cols=70 Identities=24% Similarity=0.310 Sum_probs=41.2
Q ss_pred cCeEEEEeCCCCcEEEeeCCCCCCCCceeeE--eecCCeeeec----c--ceEEeCCEEEEEE---eCCcceecCCCCCC
Q psy13486 4 NGILYFNLIDRNAVGCWNSQYPYKPENIGHI--DIDNEALIFP----S--DVKVVGDDLWVIS---DRMPIHLESELNFN 72 (91)
Q Consensus 4 ~gvlff~lv~~~aI~cWn~~~~~~~~n~~ii--a~d~~~l~f~----s--dlkid~~~lWvls---nrl~~f~~~~l~~~ 72 (91)
+|.|||.--+...|...|-.+.-... ..++ |-=++...|- + |+.+||.-|||+= ..-.+..-++||+.
T Consensus 83 ngslYY~~~~s~~iiKydL~t~~v~~-~~~Lp~a~y~~~~~Y~~~~~sdiDlAvDE~GLWvIYat~~~~g~ivvSkLnp~ 161 (255)
T smart00284 83 NGSLYFNKFNSHDICRFDLTTETYQK-EPLLNGAGYNNRFPYAWGGFSDIDLAVDENGLWVIYATEQNAGKIVISKLNPA 161 (255)
T ss_pred CceEEEEecCCccEEEEECCCCcEEE-EEecCccccccccccccCCCccEEEEEcCCceEEEEeccCCCCCEEEEeeCcc
Confidence 68999999999999999998763211 1111 1001122331 2 4455566799992 22244666788865
Q ss_pred cc
Q psy13486 73 DV 74 (91)
Q Consensus 73 e~ 74 (91)
..
T Consensus 162 tL 163 (255)
T smart00284 162 TL 163 (255)
T ss_pred cc
Confidence 43
No 9
>KOG4499|consensus
Probab=78.07 E-value=7.6 Score=29.89 Aligned_cols=60 Identities=13% Similarity=0.071 Sum_probs=38.8
Q ss_pred ccCeEEEEeCCCCcEEEeeCCCCC-CCCceeeEee----cCCeeeeccceEEe-CCEEEEEEeCCc
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQYPY-KPENIGHIDI----DNEALIFPSDVKVV-GDDLWVISDRMP 62 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~~~~-~~~n~~iia~----d~~~l~f~sdlkid-~~~lWvlsnrl~ 62 (91)
+...+||.--..-.|-.|+-.-|- .-.|-++|.. .+..-++|+||.|| +|+|||-+=.--
T Consensus 168 d~K~fY~iDsln~~V~a~dyd~~tG~~snr~~i~dlrk~~~~e~~~PDGm~ID~eG~L~Va~~ng~ 233 (310)
T KOG4499|consen 168 DAKKFYYIDSLNYEVDAYDYDCPTGDLSNRKVIFDLRKSQPFESLEPDGMTIDTEGNLYVATFNGG 233 (310)
T ss_pred cCcEEEEEccCceEEeeeecCCCcccccCcceeEEeccCCCcCCCCCCcceEccCCcEEEEEecCc
Confidence 455778888888888888843220 1112233332 23356799999999 999999764433
No 10
>PF05787 DUF839: Bacterial protein of unknown function (DUF839); InterPro: IPR008557 This family consists of bacterial proteins of unknown function.
Probab=73.17 E-value=12 Score=30.75 Aligned_cols=61 Identities=15% Similarity=0.287 Sum_probs=43.1
Q ss_pred ccCeEEEEeCCCC-------------------cEEEeeCCCC---CCCCceee--Eeec-------------CCeeeecc
Q psy13486 3 HNGILYFNLIDRN-------------------AVGCWNSQYP---YKPENIGH--IDID-------------NEALIFPS 45 (91)
Q Consensus 3 ~~gvlff~lv~~~-------------------aI~cWn~~~~---~~~~n~~i--ia~d-------------~~~l~f~s 45 (91)
.+|.+||++-+-. .|.+|..... .+.-...+ ++-+ ...+--|+
T Consensus 360 ~~g~vY~a~T~~~~r~~~~~~~~n~~~~n~~G~I~r~~~~~~d~~~~~f~~~~~~~~g~~~~~~~~~~~~~~~~~f~sPD 439 (524)
T PF05787_consen 360 DDGEVYFALTNNSGRGESDVDAANPRAGNGYGQIYRYDPDGNDHAATTFTWELFLVGGDPTDASGNGSNKCDDNGFASPD 439 (524)
T ss_pred CCCEEEEEEecCCCCcccccccCCcccCCcccEEEEecccCCccccceeEEEEEEEecCcccccccccCcccCCCcCCCC
Confidence 5788999987766 7899987653 22222333 3333 45677899
Q ss_pred ceEEe-CCEEEEEEeCCcc
Q psy13486 46 DVKVV-GDDLWVISDRMPI 63 (91)
Q Consensus 46 dlkid-~~~lWvlsnrl~~ 63 (91)
.|.+| .|+||+-++.=..
T Consensus 440 NL~~d~~G~LwI~eD~~~~ 458 (524)
T PF05787_consen 440 NLAFDPDGNLWIQEDGGGS 458 (524)
T ss_pred ceEECCCCCEEEEeCCCCC
Confidence 99999 8999999987664
No 11
>PF06739 SBBP: Beta-propeller repeat; InterPro: IPR010620 This family is related to IPR001680 from INTERPRO and is likely to also form a beta-propeller. SBBP stands for Seven Bladed Beta Propeller.
Probab=69.53 E-value=4.5 Score=21.59 Aligned_cols=19 Identities=16% Similarity=0.309 Sum_probs=15.6
Q ss_pred eeeeccceEEe-CCEEEEEE
Q psy13486 40 ALIFPSDVKVV-GDDLWVIS 58 (91)
Q Consensus 40 ~l~f~sdlkid-~~~lWvls 58 (91)
...+|.+|.+| +|++||.-
T Consensus 11 ~~~~~~~IavD~~GNiYv~G 30 (38)
T PF06739_consen 11 AQDYGNGIAVDSNGNIYVTG 30 (38)
T ss_pred CceeEEEEEECCCCCEEEEE
Confidence 34589999999 89999864
No 12
>PF07494 Reg_prop: Two component regulator propeller; InterPro: IPR011110 A large group of two component regulator proteins appear to have the same N-terminal structure of 14 tandem repeats. These repeats show homology to members of IPR002372 from INTERPRO and IPR001680 from INTERPRO indicating that they are likely to form a beta-propeller. This family has been built with artificially high cut-offs in order to avoid overlaps with other beta-propeller families. The fourteen repeats are likely to form two propellers; it is not clear if these structures are likely to recruit other proteins or interact with DNA.; PDB: 3V9F_D 3VA6_B 3OTT_B 4A2M_D 4A2L_B.
Probab=63.73 E-value=7.4 Score=18.60 Aligned_cols=16 Identities=13% Similarity=0.212 Sum_probs=10.7
Q ss_pred ccceEEe-CCEEEEEEe
Q psy13486 44 PSDVKVV-GDDLWVISD 59 (91)
Q Consensus 44 ~sdlkid-~~~lWvlsn 59 (91)
+.+|..| +|+||+=|+
T Consensus 7 I~~i~~D~~G~lWigT~ 23 (24)
T PF07494_consen 7 IYSIYEDSDGNLWIGTY 23 (24)
T ss_dssp EEEEEE-TTSCEEEEET
T ss_pred EEEEEEcCCcCEEEEeC
Confidence 4456667 889998765
No 13
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=62.42 E-value=22 Score=31.58 Aligned_cols=52 Identities=12% Similarity=0.133 Sum_probs=37.8
Q ss_pred CccCeEEEEeCCCCcEEEeeCCCCCCCCceeeEee-----------cCCeeeeccceEEe-CCEEEEE
Q psy13486 2 DHNGILYFNLIDRNAVGCWNSQYPYKPENIGHIDI-----------DNEALIFPSDVKVV-GDDLWVI 57 (91)
Q Consensus 2 d~~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~-----------d~~~l~f~sdlkid-~~~lWvl 57 (91)
|.+|.+|.+......|.-|+..+.. ...++- ....+..|.||.++ +|.|||-
T Consensus 812 d~dG~LYVADs~N~rIrviD~~tg~----v~tiaG~G~~G~~dG~~~~a~l~~P~GIavd~dG~lyVa 875 (1057)
T PLN02919 812 AKDGQIYVADSYNHKIKKLDPATKR----VTTLAGTGKAGFKDGKALKAQLSEPAGLALGENGRLFVA 875 (1057)
T ss_pred eCCCcEEEEECCCCEEEEEECCCCe----EEEEeccCCcCCCCCcccccccCCceEEEEeCCCCEEEE
Confidence 5678899999999999999987642 111221 12345689999999 8888774
No 14
>KOG2919|consensus
Probab=52.99 E-value=45 Score=26.75 Aligned_cols=61 Identities=18% Similarity=0.326 Sum_probs=43.9
Q ss_pred ccCeEEEEeCCCCcEEEeeCCCC-CCCC-ceeeEeecCCeeeeccceEE-----------e--CCEEEEEEeCCcc
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQYP-YKPE-NIGHIDIDNEALIFPSDVKV-----------V--GDDLWVISDRMPI 63 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~~~-~~~~-n~~iia~d~~~l~f~sdlki-----------d--~~~lWvlsnrl~~ 63 (91)
.+|.-+.++...+.+.|||.-.. |.+. -..+.+.-...+.++.|..| + +.+||+.|.|-+.
T Consensus 59 PDGSciL~~sedn~l~~~nlP~dlys~~~~~~~~~~~~~~~r~~eg~tvydy~wYs~M~s~qP~t~l~a~ssr~~P 134 (406)
T KOG2919|consen 59 PDGSCILSLSEDNCLNCWNLPFDLYSKKADGPLNFSKHLSYRYQEGETVYDYCWYSRMKSDQPSTNLFAVSSRDQP 134 (406)
T ss_pred CCCceEEeecccCeeeEEecChhhcccCCCCccccccceeEEeccCCEEEEEEeeeccccCCCccceeeeccccCc
Confidence 57888899999999999998654 4444 34455555556666666655 3 4579999999875
No 15
>KOG1520|consensus
Probab=52.36 E-value=22 Score=28.45 Aligned_cols=14 Identities=21% Similarity=0.532 Sum_probs=11.2
Q ss_pred eeeeccceEEe-CCE
Q psy13486 40 ALIFPSDVKVV-GDD 53 (91)
Q Consensus 40 ~l~f~sdlkid-~~~ 53 (91)
.|.||+|+... ++.
T Consensus 217 ~L~F~NGlaLS~d~s 231 (376)
T KOG1520|consen 217 GLYFPNGLALSPDGS 231 (376)
T ss_pred cccccccccCCCCCC
Confidence 58899999987 554
No 16
>COG3386 Gluconolactonase [Carbohydrate transport and metabolism]
Probab=52.35 E-value=76 Score=24.26 Aligned_cols=54 Identities=13% Similarity=0.111 Sum_probs=38.9
Q ss_pred ccCeEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe-CCEEEEEEeCCcc
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV-GDDLWVISDRMPI 63 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid-~~~lWvlsnrl~~ 63 (91)
+.+.|||.-+....|-+|+..+. ...+...+.. ++++..++ +|.|+++.-.+..
T Consensus 35 ~~~~L~w~DI~~~~i~r~~~~~g-----~~~~~~~p~~--~~~~~~~d~~g~Lv~~~~g~~~ 89 (307)
T COG3386 35 DRGALLWVDILGGRIHRLDPETG-----KKRVFPSPGG--FSSGALIDAGGRLIACEHGVRL 89 (307)
T ss_pred CCCEEEEEeCCCCeEEEecCCcC-----ceEEEECCCC--cccceeecCCCeEEEEccccEE
Confidence 78899999999999999998743 1233344433 58888888 7777777665543
No 17
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=50.04 E-value=44 Score=25.56 Aligned_cols=20 Identities=15% Similarity=0.165 Sum_probs=16.8
Q ss_pred eeeeccceEEe-CCEEEEEEe
Q psy13486 40 ALIFPSDVKVV-GDDLWVISD 59 (91)
Q Consensus 40 ~l~f~sdlkid-~~~lWvlsn 59 (91)
.++.|.|+.++ +|++|+--|
T Consensus 182 G~rnp~Gl~~d~~G~l~~tdn 202 (367)
T TIGR02604 182 GFQNPYGHSVDSWGDVFFCDN 202 (367)
T ss_pred CcCCCccceECCCCCEEEEcc
Confidence 47889999999 889998666
No 18
>TIGR02604 Piru_Ver_Nterm putative membrane-bound dehydrogenase domain. All proteins that score above the trusted cutoff score of 45 to this model are large proteins of either Pirellula sp. 1 or Verrucomicrobium spinosum. These proteins all contain, in addition to this domain, several hundred residues of highly variable sequence, and then a well-conserved C-terminal domain (TIGR02603) that features a putative cytochrome c-type heme binding motif CXXCH. The membrane-bound L-sorbosone dehydrogenase from Acetobacter liquefaciens (Gluconacetobacter liquefaciens) is homologous to this domain but lacks additional sequence regions shared by members of this family and belongs to a different clade of the larger family of homologs. It and its closely related homologs are excluded from the this model by scoring between the trusted (45) and noise (18) cutoffs.
Probab=46.79 E-value=20 Score=27.40 Aligned_cols=28 Identities=21% Similarity=0.267 Sum_probs=23.2
Q ss_pred eeeEeecCCeeeeccceEEe-CCEEEEEEe
Q psy13486 31 IGHIDIDNEALIFPSDVKVV-GDDLWVISD 59 (91)
Q Consensus 31 ~~iia~d~~~l~f~sdlkid-~~~lWvlsn 59 (91)
+.++|..| .+.-|-+|.+| +|+|||...
T Consensus 4 ~~l~A~~p-~~~~P~~ia~d~~G~l~V~e~ 32 (367)
T TIGR02604 4 VTLFAAEP-LLRNPIAVCFDERGRLWVAEG 32 (367)
T ss_pred EEEEECCC-ccCCCceeeECCCCCEEEEeC
Confidence 45777777 57899999999 899999874
No 19
>KOG3037|consensus
Probab=46.74 E-value=77 Score=25.00 Aligned_cols=47 Identities=21% Similarity=0.598 Sum_probs=32.4
Q ss_pred ccCeEEEEeCCCCcE-EEeeCCCCCCCCceeeEeecCCeeeeccceEE---e---CCEEEEE
Q psy13486 3 HNGILYFNLIDRNAV-GCWNSQYPYKPENIGHIDIDNEALIFPSDVKV---V---GDDLWVI 57 (91)
Q Consensus 3 ~~gvlff~lv~~~aI-~cWn~~~~~~~~n~~iia~d~~~l~f~sdlki---d---~~~lWvl 57 (91)
+.|.+|-..-+..-| .||--+..-+.| |+ .++||++..+ + .|.+|+|
T Consensus 34 rKGli~i~~sddgliHF~WkdR~~g~VE-------dD-lIifPde~eF~kV~qC~tGRVY~L 87 (330)
T KOG3037|consen 34 RKGLIYIKRSDDGLIHFCWKDRESGNVE-------DD-LIVFPDEAEFKKVDQCKTGRVYVL 87 (330)
T ss_pred ccceEEEeeCCCceEEEEecccCCCCcc-------cc-eEEccCceeEEECCCCCCCcEEEE
Confidence 788888877765554 699776542222 33 7789999855 4 5789987
No 20
>PLN02919 haloacid dehalogenase-like hydrolase family protein
Probab=46.70 E-value=48 Score=29.56 Aligned_cols=49 Identities=12% Similarity=0.102 Sum_probs=33.6
Q ss_pred CeEEEEeCCCCcEEEeeCCCCCCCCceeeEee-------------------cCCeeeeccceEEe-CCEEEEE
Q psy13486 5 GILYFNLIDRNAVGCWNSQYPYKPENIGHIDI-------------------DNEALIFPSDVKVV-GDDLWVI 57 (91)
Q Consensus 5 gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~-------------------d~~~l~f~sdlkid-~~~lWvl 57 (91)
+.||.+....+.|..|++.+.= ..+++- ....++.|.|+.++ +|.+||-
T Consensus 752 ~~LYVADs~n~~Irv~D~~tg~----~~~~~gg~~~~~~~l~~fG~~dG~g~~~~l~~P~Gvavd~dG~LYVA 820 (1057)
T PLN02919 752 KELYIADSESSSIRALDLKTGG----SRLLAGGDPTFSDNLFKFGDHDGVGSEVLLQHPLGVLCAKDGQIYVA 820 (1057)
T ss_pred CEEEEEECCCCeEEEEECCCCc----EEEEEecccccCcccccccCCCCchhhhhccCCceeeEeCCCcEEEE
Confidence 3599999999999999986531 111110 11246789999998 7888774
No 21
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=45.86 E-value=68 Score=23.83 Aligned_cols=53 Identities=13% Similarity=0.240 Sum_probs=25.2
Q ss_pred ccCeEEEEeCCCCcEEEeeCCCCCCCCceeeEeecC------CeeeeccceEEe-CCEEEEEEe
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDN------EALIFPSDVKVV-GDDLWVISD 59 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~------~~l~f~sdlkid-~~~lWvlsn 59 (91)
++|-||-=-=....|.+.+.+... ...+.-+. +.+.=|-||.+| +|+|||.|=
T Consensus 181 ~t~~lliLS~es~~l~~~d~~G~~----~~~~~L~~g~~gl~~~~~QpEGIa~d~~G~LYIvsE 240 (248)
T PF06977_consen 181 RTGHLLILSDESRLLLELDRQGRV----VSSLSLDRGFHGLSKDIPQPEGIAFDPDGNLYIVSE 240 (248)
T ss_dssp TTTEEEEEETTTTEEEEE-TT--E----EEEEE-STTGGG-SS---SEEEEEE-TT--EEEEET
T ss_pred CCCeEEEEECCCCeEEEECCCCCE----EEEEEeCCcccCcccccCCccEEEECCCCCEEEEcC
Confidence 455554444444556677654331 22222222 344558999999 999999884
No 22
>PF01731 Arylesterase: Arylesterase; InterPro: IPR002640 The serum paraoxonases/arylesterases are enzymes that catalyse the hydrolysis of the toxic metabolites of a variety of organophosphorus insecticides. The enzymes hydrolyse a broad spectrum of organophosphate substrates, including paraoxon and a number of aromatic carboxylic acid esters (e.g., phenyl acetate), and hence confer resistance to organophosphate toxicity []. Mammals have 3 distinct paraoxonase types, termed PON1-3 [, ]. In mice and humans, the PON genes are found on the same chromosome in close proximity. PON activity has been found in variety of tissues, with highest levels in liver and serum - the source of serum PON is thought to be the liver. Unlike mammals, fish and avian species lack paraoxonase activity. Human and rabbit PONs appear to have two distinct Ca2+ binding sites, one required for stability and one required for catalytic activity. The Ca2+ dependency of PONs suggests a mechanism of hydrolysis where Ca2+ acts as the electrophillic catalyst, like that proposed for phospholipase A2. The paraoxonase enzymes, PON1 and PON3, are high density lipoprotein (HDL)- associated proteins capable of preventing oxidative modification of low density lipoproteins (LPL) []. Although PON2 has oxidative properties, the enzyme does not associate with HDL. Within a given species, PON1, PON2 and PON3 share ~60% amino acid sequence identity, whereas between mammalian species particular PONs (1,2 or 3) share 79-90% identity at the amino acid level. Human PON1 and PON3 share numerous conserved phosphorylation and N-glycosylation sites; however, it is not known whether the PON proteins are modified at these sites, or whether modification at these sites is required for activity in vivo []. This family consists of arylesterases (Also known as serum paraoxonase) 3.1.1.2 from EC. These enzymes hydrolyse organophosphorus esters such as paraoxon and are found in the liver and blood. They confer resistance to organophosphate toxicity []. Human arylesterase (PON1) P27169 from SWISSPROT is associated with HDL and may protect against LDL oxidation [].; GO: 0004064 arylesterase activity
Probab=45.31 E-value=27 Score=22.00 Aligned_cols=37 Identities=14% Similarity=0.258 Sum_probs=22.7
Q ss_pred CCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe-C-CEEEEEE
Q psy13486 13 DRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV-G-DDLWVIS 58 (91)
Q Consensus 13 ~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid-~-~~lWvls 58 (91)
....|..|+..+ ..++|. .+.||+||.++ + ..|+|=+
T Consensus 34 ~~~~Vvyyd~~~------~~~va~---g~~~aNGI~~s~~~k~lyVa~ 72 (86)
T PF01731_consen 34 PWGNVVYYDGKE------VKVVAS---GFSFANGIAISPDKKYLYVAS 72 (86)
T ss_pred CCceEEEEeCCE------eEEeec---cCCCCceEEEcCCCCEEEEEe
Confidence 345566665432 334443 47899999998 4 4577643
No 23
>COG4574 Eco Serine protease inhibitor ecotin [General function prediction only]
Probab=41.77 E-value=24 Score=24.77 Aligned_cols=29 Identities=21% Similarity=0.449 Sum_probs=22.1
Q ss_pred EEEEEEeCCcceecCCCCCCcceEEEEecC
Q psy13486 53 DLWVISDRMPIHLESELNFNDVNFRIFTVP 82 (91)
Q Consensus 53 ~lWvlsnrl~~f~~~~l~~~e~NfRI~~~~ 82 (91)
.+----+|||...|.. .--|+.||||++.
T Consensus 125 ~~lrYNSkLPIVVY~P-~~VeVkYRIW~A~ 153 (162)
T COG4574 125 GMLRYNSKLPIVVYTP-DNVDVKYRIWKAE 153 (162)
T ss_pred hheeecCcCCEEEEcC-CCceEEEEEeech
Confidence 3455567999999865 4458999999985
No 24
>PF02191 OLF: Olfactomedin-like domain; InterPro: IPR003112 The olfactomedin-domain was first identified in olfactomedin, an extracellular matrix protein of the olfactory neuroepithelium []. Members of this extracellular domain-family have since been shown to be present in several metazoan proteins, such as latrophilins, myocilins, optimedins and noelins, the latter being involved in the generation of neural crest cells. Myocilin is of considerable interest, as mutations in its olfactomedin-domain can lead to glaucoma []. The olfactomedin-domains in myocilin and optimedin are essential for the interaction between these two proteins [].; GO: 0005515 protein binding
Probab=40.21 E-value=1.1e+02 Score=22.74 Aligned_cols=71 Identities=20% Similarity=0.258 Sum_probs=42.3
Q ss_pred cCeEEEEeCCCCcEEEeeCCCCCCCCceee--Eeec-CCeeeecc----ceEEeCCEEEEE---EeCCcceecCCCCCCc
Q psy13486 4 NGILYFNLIDRNAVGCWNSQYPYKPENIGH--IDID-NEALIFPS----DVKVVGDDLWVI---SDRMPIHLESELNFND 73 (91)
Q Consensus 4 ~gvlff~lv~~~aI~cWn~~~~~~~~n~~i--ia~d-~~~l~f~s----dlkid~~~lWvl---snrl~~f~~~~l~~~e 73 (91)
+|.|||.--+...|...|.++.-...-..+ .+.. .-.+.+.+ |+.+||.-|||+ ...--+..-++||++.
T Consensus 78 ngslYY~~~~s~~IvkydL~t~~v~~~~~L~~A~~~n~~~y~~~~~t~iD~AvDE~GLWvIYat~~~~g~ivvskld~~t 157 (250)
T PF02191_consen 78 NGSLYYNKYNSRNIVKYDLTTRSVVARRELPGAGYNNRFPYYWSGYTDIDFAVDENGLWVIYATEDNNGNIVVSKLDPET 157 (250)
T ss_pred CCcEEEEecCCceEEEEECcCCcEEEEEECCccccccccceecCCCceEEEEEcCCCEEEEEecCCCCCcEEEEeeCccc
Confidence 689999999999999999987632211111 1111 11222221 334455569998 3333356677888764
Q ss_pred c
Q psy13486 74 V 74 (91)
Q Consensus 74 ~ 74 (91)
.
T Consensus 158 L 158 (250)
T PF02191_consen 158 L 158 (250)
T ss_pred C
Confidence 3
No 25
>PF09844 DUF2071: Uncharacterized conserved protein (COG2071); InterPro: IPR018644 This conserved protein, similar to YgjF, has no known function.
Probab=39.29 E-value=48 Score=23.94 Aligned_cols=40 Identities=23% Similarity=0.488 Sum_probs=26.1
Q ss_pred eccceEEe--CCEEEE------EEeCCcceec---CCCCCCcceEEEEecC
Q psy13486 43 FPSDVKVV--GDDLWV------ISDRMPIHLE---SELNFNDVNFRIFTVP 82 (91)
Q Consensus 43 f~sdlkid--~~~lWv------lsnrl~~f~~---~~l~~~e~NfRI~~~~ 82 (91)
-|.++.+| +|.-|| |.+==++.+- .--++.|+|.|.+--.
T Consensus 24 lP~gl~lD~~dG~a~vsvV~F~~~~~r~~g~P~~p~~~~F~eiNvRtYV~~ 74 (217)
T PF09844_consen 24 LPAGLELDTFDGKAWVSVVPFRMKDVRPRGLPPIPLGSTFPEINVRTYVRD 74 (217)
T ss_pred CCCCCCCeEECCEEEEEEEEEEEeccccccCCCccccCCCCceEEEEEEEe
Confidence 69999999 887775 3332222222 2346789999998643
No 26
>PF11763 DIPSY: Cell-wall adhesin ligand-binding C-terminal; InterPro: IPR021746 The DIPSY domain is characterised by the distinctive D*I*PSY motif at the very C terminus of yeast cell-wall glycoproteins. It appears not to be conserved in any other species, however. In fungi, cell adhesion is required for flocculation, mating and virulence, and is mediated by covalently bound cell wall proteins termed adhesins. Map4, an adhesin required for mating in Schizosaccharomyces pombe, is N-glycosylated and O-glycosylated, and is an endogenous substrate for the mannosyl transferase Oma4p. Map4 has a modular structure with an N-terminal signal peptide, a serine and threonine (S/T)-rich domain that includes nine repeats of 36 amino acids (rich in serine and threonine residues, but lacking glutamines), and a C-terminal DIPSY domain with no glycosyl-phosphatidyl inositol (GPI)-anchor signal. The N-terminal S/T-rich regions, are required for cell wall attachment, but the C-terminal DIPSY domain is required for agglutination and mating in liquid and solid media [].
Probab=38.77 E-value=79 Score=21.52 Aligned_cols=45 Identities=24% Similarity=0.462 Sum_probs=28.0
Q ss_pred eeeEeecCCeeee---------ccceEEe-CCEEEEEEeCCcceecCCCCCCcceEEE
Q psy13486 31 IGHIDIDNEALIF---------PSDVKVV-GDDLWVISDRMPIHLESELNFNDVNFRI 78 (91)
Q Consensus 31 ~~iia~d~~~l~f---------~sdlkid-~~~lWvlsnrl~~f~~~~l~~~e~NfRI 78 (91)
..|+-.++-++|| |-.+++. +|.+|+-|+-. ..+.-..++.|||-
T Consensus 62 f~I~kn~dG~~~Ft~~e~~~~ep~~l~~l~dgri~~ts~~~---d~~~~~~~~~~~RA 116 (123)
T PF11763_consen 62 FNIIKNNDGTYQFTFVESSFSEPLDLHTLSDGRIWFTSNEY---DFSPYYFDNENFRA 116 (123)
T ss_pred EEEEecCCCcEEEEEcccCCCCcEEEEEecCCcEEEEcccc---cCCcccccccceee
Confidence 4444445556555 6677888 89999976543 34445556677773
No 27
>PF12690 BsuPI: Intracellular proteinase inhibitor; InterPro: IPR020481 BsuPI is a intracellular proteinase inhibitor that directly regulates the major intracellular proteinase (ISP-1) activity in vivo. It inhibits ISP-1 in the early stages of sporulation and then may be inactivated by a membrane-bound proteinase [].; PDB: 3ISY_A.
Probab=38.32 E-value=25 Score=21.75 Aligned_cols=35 Identities=26% Similarity=0.299 Sum_probs=18.0
Q ss_pred CCeeeeccceEEe------CC-EEEEEEeCCc---ceecCCCCCC
Q psy13486 38 NEALIFPSDVKVV------GD-DLWVISDRMP---IHLESELNFN 72 (91)
Q Consensus 38 ~~~l~f~sdlkid------~~-~lWvlsnrl~---~f~~~~l~~~ 72 (91)
+.+|+|++|.+.| +| .+|--|...- .+..-.|.+.
T Consensus 15 ~v~l~f~sgq~~D~~v~d~~g~~vwrwS~~~~FtQal~~~~l~pG 59 (82)
T PF12690_consen 15 PVTLQFPSGQRYDFVVKDKEGKEVWRWSDGKMFTQALQEETLEPG 59 (82)
T ss_dssp -EEEEESSS--EEEEEE-TT--EEEETTTT-------EEEEE-TT
T ss_pred eEEEEeCCCCEEEEEEECCCCCEEEEecCCchhhheeeEEEECCC
Confidence 4589999999986 33 6888776532 2334445543
No 28
>COG3211 PhoX Predicted phosphatase [General function prediction only]
Probab=37.45 E-value=28 Score=29.53 Aligned_cols=58 Identities=17% Similarity=0.285 Sum_probs=38.9
Q ss_pred cCeEEEEeCCCC----------------cEEEeeCCC------CCCCCceeeEeecCCeeee-------------ccceE
Q psy13486 4 NGILYFNLIDRN----------------AVGCWNSQY------PYKPENIGHIDIDNEALIF-------------PSDVK 48 (91)
Q Consensus 4 ~gvlff~lv~~~----------------aI~cWn~~~------~~~~~n~~iia~d~~~l~f-------------~sdlk 48 (91)
+|-+||++-+-. .|..|-..+ +|+=+ +=+.+.|+..+.- |++|.
T Consensus 428 ~g~Vy~~lTNn~~r~~~~aNpr~~n~~G~I~r~~p~~~d~t~~~ftWd-lF~~aG~~~~~~~~~~~~~~~~~f~~PDnl~ 506 (616)
T COG3211 428 TGEVYFTLTNNGKRSDDAANPRAKNGYGQIVRWIPATGDHTDTKFTWD-LFVEAGNPSVLEGGASANINANWFNSPDNLA 506 (616)
T ss_pred cceEEEEeCCCCccccccCCCcccccccceEEEecCCCCccCccceee-eeeecCCccccccccccCcccccccCCCceE
Confidence 345777776654 366665543 22222 3356677777777 99999
Q ss_pred Ee-CCEEEEEEeCCc
Q psy13486 49 VV-GDDLWVISDRMP 62 (91)
Q Consensus 49 id-~~~lWvlsnrl~ 62 (91)
+| .|.||+-+++--
T Consensus 507 fD~~GrLWi~TDg~~ 521 (616)
T COG3211 507 FDPWGRLWIQTDGSG 521 (616)
T ss_pred ECCCCCEEEEecCCC
Confidence 99 999999887643
No 29
>KOG4659|consensus
Probab=36.07 E-value=26 Score=32.83 Aligned_cols=60 Identities=22% Similarity=0.203 Sum_probs=40.5
Q ss_pred ccCeEEEEeCCCCcEEEeeCCCCC-CCCceeeE-------------------eecCCeeeeccceEEe-CCEEEEEEeCC
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQYPY-KPENIGHI-------------------DIDNEALIFPSDVKVV-GDDLWVISDRM 61 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~~~~-~~~n~~ii-------------------a~d~~~l~f~sdlkid-~~~lWvlsnrl 61 (91)
=+|.||-+.-....|..=.+-.|- ...|..+| |+|+ .|.||-||.+| .|.||+.--++
T Consensus 417 vdgtlyvSdp~s~qv~rv~sl~~~d~~~N~evvaG~Ge~Clp~desCGDGalA~dA-~L~~PkGIa~dk~g~lYfaD~t~ 495 (1899)
T KOG4659|consen 417 VDGTLYVSDPLSKQVWRVSSLEPQDSRNNYEVVAGDGEVCLPADESCGDGALAQDA-QLIFPKGIAFDKMGNLYFADGTR 495 (1899)
T ss_pred cCceEEecCCCcceEEEeccCCccccccCeeEEeccCcCccccccccCcchhcccc-eeccCCceeEccCCcEEEecccE
Confidence 468888887777777555443331 23455555 4556 78999999999 99999865544
Q ss_pred cc
Q psy13486 62 PI 63 (91)
Q Consensus 62 ~~ 63 (91)
-+
T Consensus 496 IR 497 (1899)
T KOG4659|consen 496 IR 497 (1899)
T ss_pred EE
Confidence 43
No 30
>TIGR02276 beta_rpt_yvtn 40-residue YVTN family beta-propeller repeat. This repeat of about 40 amino acids is found in up to 14 copies per protein. Archaea Methanosarcina mazei and Methanosarcina acetivorans each have over 10 genes that encode tandem copies of this repeat, which is also found in other species. PSIPRED predicts with high confidence that each 40-residue repeats contains four beta strands. This model overlaps somewhat with the NHL repeat (Pfam pfam01436) and also shows sequence similarity to the WD domain, G-beta repeat (Pfam pfam00400).
Probab=35.70 E-value=62 Score=16.15 Aligned_cols=21 Identities=19% Similarity=0.203 Sum_probs=17.0
Q ss_pred cCeEEEEeCCCCcEEEeeCCC
Q psy13486 4 NGILYFNLIDRNAVGCWNSQY 24 (91)
Q Consensus 4 ~gvlff~lv~~~aI~cWn~~~ 24 (91)
.+-||-+....+.|...|+.+
T Consensus 3 ~~~lyv~~~~~~~v~~id~~~ 23 (42)
T TIGR02276 3 GTKLYVTNSGSNTVSVIDTAT 23 (42)
T ss_pred CCEEEEEeCCCCEEEEEECCC
Confidence 456888888889999998854
No 31
>PF08268 FBA_3: F-box associated domain; InterPro: IPR013187 This domain occurs in a diverse superfamily of genes in plants. Most examples are found C-terminal to an F-box (IPR001810 from INTERPRO), a 60 amino acid motif involved in ubiquitination of target proteins to mark them for degradation. Two-hybid experiments support the idea that most members are interchangeable F-box subunits of SCF E3 complexes []. Some members have two copies of this domain.
Probab=34.68 E-value=1.3e+02 Score=19.33 Aligned_cols=22 Identities=23% Similarity=0.477 Sum_probs=19.0
Q ss_pred cCeEEEEeCC----CCcEEEeeCCCC
Q psy13486 4 NGILYFNLID----RNAVGCWNSQYP 25 (91)
Q Consensus 4 ~gvlff~lv~----~~aI~cWn~~~~ 25 (91)
+|+|||.-.. ...|.|-|..++
T Consensus 5 nGvly~~a~~~~~~~~~IvsFDv~~E 30 (129)
T PF08268_consen 5 NGVLYWLAWSEDSDNNVIVSFDVRSE 30 (129)
T ss_pred CcEEEeEEEECCCCCcEEEEEEcCCc
Confidence 7999999765 899999999875
No 32
>cd00242 Ecotin Protease Inhibitor Ecotin; homodimeric protease inhibitor. Protease Inhibitor Ecotin; homodimeric protease inhibitor which binds two chymotrypsin-like serine proteases to form a heterotetramer. Found in bacterial periplasm. Inhibits a broad range of serine proteases including collagenase, trypsin, chymotrypsin, elastase, and factor Xa but not thrombin. Inhibition mechanism involves binding at two different protease contact sites: the primary and secondary binding sites. Primary site loops of ecotin bind to the active site of target proteases in a substrate-like manner with the P1 residue in ecotin mimicking the interactions of a canonical P1 substrate residue.
Probab=34.51 E-value=40 Score=23.24 Aligned_cols=28 Identities=21% Similarity=0.457 Sum_probs=20.1
Q ss_pred EEEEEeCCcceecCCCCCCcceEEEEecC
Q psy13486 54 LWVISDRMPIHLESELNFNDVNFRIFTVP 82 (91)
Q Consensus 54 lWvlsnrl~~f~~~~l~~~e~NfRI~~~~ 82 (91)
+.==.+|||..+|-.-+ -|+.||||++.
T Consensus 101 ~~rYnSklPiVvY~P~~-~evrYRiW~~~ 128 (136)
T cd00242 101 MLRYNSRLPIVVYTPKD-VEVRYRIWKAG 128 (136)
T ss_pred eEecCCCCCEEEECCCC-CEEEEEEEecc
Confidence 33334688888886533 48999999986
No 33
>PF07995 GSDH: Glucose / Sorbosone dehydrogenase; InterPro: IPR012938 Proteins containing this domain are thought to be glucose/sorbosone dehydrogenases. The best characterised of these proteins is soluble glucose dehydrogenase (P13650 from SWISSPROT) from Acinetobacter calcoaceticus, which oxidises glucose to gluconolactone. The enzyme is a calcium-dependent homodimer which uses PQQ as a cofactor [].; GO: 0016901 oxidoreductase activity, acting on the CH-OH group of donors, quinone or similar compound as acceptor, 0048038 quinone binding, 0005975 carbohydrate metabolic process; PDB: 2ISM_A 2WG3_D 3HO5_A 3HO4_A 3HO3_A 2WFT_A 2WG4_B 2WFX_B 1CRU_A 1CQ1_B ....
Probab=33.32 E-value=84 Score=23.80 Aligned_cols=53 Identities=15% Similarity=0.053 Sum_probs=27.0
Q ss_pred cCeEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeee-eccceEEe-CCEEEEEEe
Q psy13486 4 NGILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALI-FPSDVKVV-GDDLWVISD 59 (91)
Q Consensus 4 ~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~-f~sdlkid-~~~lWvlsn 59 (91)
.|-+|++......|..=.....-.......++ .... -|.|+.+. +|+|||.++
T Consensus 270 ~g~~~~~~~~~~~i~~~~~~~~~~~~~~~~~~---~~~~~r~~~v~~~pDG~Lyv~~d 324 (331)
T PF07995_consen 270 RGDLFVADYGGGRIWRLDLDEDGSVTEEEEFL---GGFGGRPRDVAQGPDGALYVSDD 324 (331)
T ss_dssp TTEEEEEETTTTEEEEEEEETTEEEEEEEEEC---TTSSS-EEEEEEETTSEEEEEE-
T ss_pred cCcEEEecCCCCEEEEEeeecCCCccceEEcc---ccCCCCceEEEEcCCCeEEEEEC
Confidence 35566666665555544432221111111112 1122 46788888 999999987
No 34
>PRK11028 6-phosphogluconolactonase; Provisional
Probab=32.85 E-value=1.8e+02 Score=21.28 Aligned_cols=53 Identities=8% Similarity=0.013 Sum_probs=32.8
Q ss_pred eEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe-CC-EEEEEEeCCc
Q psy13486 6 ILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV-GD-DLWVISDRMP 62 (91)
Q Consensus 6 vlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid-~~-~lWvlsnrl~ 62 (91)
.||-+.-..+.|..|+..+. .....++..-+ .-..|.++.++ +| .||+ +|+..
T Consensus 241 ~lyv~~~~~~~I~v~~i~~~--~~~~~~~~~~~-~~~~p~~~~~~~dg~~l~v-a~~~~ 295 (330)
T PRK11028 241 HLYACDRTASLISVFSVSED--GSVLSFEGHQP-TETQPRGFNIDHSGKYLIA-AGQKS 295 (330)
T ss_pred EEEEecCCCCeEEEEEEeCC--CCeEEEeEEEe-ccccCCceEECCCCCEEEE-EEccC
Confidence 56666666788999988643 11233444433 23589999998 66 4555 66644
No 35
>PF12216 m04gp34like: Immune evasion protein; InterPro: IPR022022 The proteins in this family are related to the m04 encoded protein gp34 of pathogenic microorganisms such as Murid herpesvirus 1. m06 and m152 genes are expressed earlier in the intracellular replication phases of these microorganism' life cycles. They function to inhibit MHC-1 loading and export. gp34 is theorized to prevent immune reactions from NK cells which would ordinarily recognise and attack cells lacking MHC.
Probab=31.68 E-value=35 Score=25.99 Aligned_cols=20 Identities=20% Similarity=0.194 Sum_probs=17.1
Q ss_pred eeccceEEe--CCEEEEEEeCC
Q psy13486 42 IFPSDVKVV--GDDLWVISDRM 61 (91)
Q Consensus 42 ~f~sdlkid--~~~lWvlsnrl 61 (91)
.=++|++|| .|+|+|-+|--
T Consensus 135 ~~~~Gf~Vd~~sG~L~i~snat 156 (272)
T PF12216_consen 135 GNKDGFKVDPSSGNLYISSNAT 156 (272)
T ss_pred CCCCCeEEcCCCceEEEecCcc
Confidence 468999999 69999998865
No 36
>smart00135 LY Low-density lipoprotein-receptor YWTD domain. Type "B" repeats in low-density lipoprotein (LDL) receptor that plays a central role in mammalian cholesterol metabolism. Also present in a variety of molecules similar to gp300/megalin.
Probab=31.64 E-value=73 Score=15.70 Aligned_cols=21 Identities=19% Similarity=0.232 Sum_probs=16.5
Q ss_pred ccCeEEEEeCCCCcEEEeeCC
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQ 23 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~ 23 (91)
..+.|||+...+..|.+-+..
T Consensus 19 ~~~~lYw~D~~~~~I~~~~~~ 39 (43)
T smart00135 19 IEGRLYWTDWGLDVIEVANLD 39 (43)
T ss_pred cCCEEEEEeCCCCEEEEEeCC
Confidence 567888888888888887664
No 37
>COG3040 Blc Bacterial lipocalin [Cell envelope biogenesis, outer membrane]
Probab=31.14 E-value=1.1e+02 Score=21.95 Aligned_cols=29 Identities=24% Similarity=0.500 Sum_probs=19.8
Q ss_pred CCCCCceeeEeecCCeeeeccceEEe----CCEEEEEEe
Q psy13486 25 PYKPENIGHIDIDNEALIFPSDVKVV----GDDLWVISD 59 (91)
Q Consensus 25 ~~~~~n~~iia~d~~~l~f~sdlkid----~~~lWvlsn 59 (91)
||. -+-.|++.|+|. ...|. .++||+||=
T Consensus 108 pF~-g~Y~Vl~~d~eY-----s~aiVgsPdr~ylWlLsR 140 (174)
T COG3040 108 PFY-GDYWVLALDPEY-----SWAIVGSPDREYLWLLSR 140 (174)
T ss_pred Ccc-ccEEEEEECCCc-----cEEEEeCCCcceEEEEec
Confidence 555 567899999974 23343 458999983
No 38
>COG4014 Uncharacterized protein conserved in archaea [Function unknown]
Probab=31.04 E-value=77 Score=20.64 Aligned_cols=29 Identities=21% Similarity=0.316 Sum_probs=22.8
Q ss_pred eeEeecCCeeeeccceEEe-CCEEEEEEeC
Q psy13486 32 GHIDIDNEALIFPSDVKVV-GDDLWVISDR 60 (91)
Q Consensus 32 ~iia~d~~~l~f~sdlkid-~~~lWvlsnr 60 (91)
.+.+.+.-+.--+++||=+ +|++||+-..
T Consensus 14 avrYvnTgTvgrV~dIkkdEdG~~WV~Lds 43 (97)
T COG4014 14 AVRYVNTGTVGRVVDIKKDEDGDIWVVLDS 43 (97)
T ss_pred eEEEeecCceeeEEEEEeecCCceEEEEec
Confidence 3666777777888999988 8999997554
No 39
>PF06977 SdiA-regulated: SdiA-regulated; InterPro: IPR009722 This entry represents a conserved region approximately 100 residues long within a number of hypothetical bacterial proteins that may be regulated by SdiA, a member of the LuxR family of transcriptional regulators []. Some proteins contain the IPR001258 from INTERPRO repeat.; PDB: 3QQZ_A.
Probab=30.06 E-value=59 Score=24.15 Aligned_cols=37 Identities=19% Similarity=0.100 Sum_probs=22.0
Q ss_pred ccceEEe--CCEEEEEEeCCcceecCCCCCCcceEEEEe
Q psy13486 44 PSDVKVV--GDDLWVISDRMPIHLESELNFNDVNFRIFT 80 (91)
Q Consensus 44 ~sdlkid--~~~lWvlsnrl~~f~~~~l~~~e~NfRI~~ 80 (91)
||++.++ .|.||+||..=++.+--..+-+-+.+.-+.
T Consensus 173 ~S~l~~~p~t~~lliLS~es~~l~~~d~~G~~~~~~~L~ 211 (248)
T PF06977_consen 173 LSGLSYDPRTGHLLILSDESRLLLELDRQGRVVSSLSLD 211 (248)
T ss_dssp --EEEEETTTTEEEEEETTTTEEEEE-TT--EEEEEE-S
T ss_pred ccceEEcCCCCeEEEEECCCCeEEEECCCCCEEEEEEeC
Confidence 7888998 679999999988876433333333444333
No 40
>TIGR02167 Liste_lipo_26 bacterial surface protein 26-residue repeat. This model describes a tandem peptide repeat sequence of 25 or 26 residues, found in predicted surface proteins (often lipoproteins) from Listeria monocytogenes, L. innocua, Enterococcus faecalis, Lactobacillus plantarum, Mycoplasma mycoides, Helicobacter hepaticus, and other species.
Probab=29.45 E-value=16 Score=18.00 Aligned_cols=14 Identities=14% Similarity=0.368 Sum_probs=9.6
Q ss_pred eCCCCcEEEeeCCC
Q psy13486 11 LIDRNAVGCWNSQY 24 (91)
Q Consensus 11 lv~~~aI~cWn~~~ 24 (91)
.++.-.|+.||+.+
T Consensus 7 ~~~~ldls~wdts~ 20 (26)
T TIGR02167 7 SLTSLDVSNWDTSS 20 (26)
T ss_pred cccccccccccccc
Confidence 44443499999975
No 41
>COG2813 RsmC 16S RNA G1207 methylase RsmC [Translation, ribosomal structure and biogenesis]
Probab=29.35 E-value=39 Score=26.20 Aligned_cols=31 Identities=16% Similarity=0.475 Sum_probs=20.6
Q ss_pred CCEEEEEEeCCcceec---CCCC-----CCcceEEEEec
Q psy13486 51 GDDLWVISDRMPIHLE---SELN-----FNDVNFRIFTV 81 (91)
Q Consensus 51 ~~~lWvlsnrl~~f~~---~~l~-----~~e~NfRI~~~ 81 (91)
+|+||++.||++.+.. ..+- .+.--|+|+++
T Consensus 259 gGeL~iVan~~l~y~~~L~~~Fg~v~~la~~~gf~Vl~a 297 (300)
T COG2813 259 GGELWIVANRHLPYEKKLKELFGNVEVLAKNGGFKVLRA 297 (300)
T ss_pred CCEEEEEEcCCCChHHHHHHhcCCEEEEEeCCCEEEEEE
Confidence 7899999999987642 1222 24446777765
No 42
>PRK03719 ecotin; Provisional
Probab=29.31 E-value=53 Score=23.41 Aligned_cols=23 Identities=22% Similarity=0.395 Sum_probs=18.1
Q ss_pred eCCcceecCCCCCCcceEEEEecC
Q psy13486 59 DRMPIHLESELNFNDVNFRIFTVP 82 (91)
Q Consensus 59 nrl~~f~~~~l~~~e~NfRI~~~~ 82 (91)
+|||..+|-.- --|+.||||++.
T Consensus 137 SkLPiVVY~P~-~~evrYRiW~a~ 159 (166)
T PRK03719 137 SRLPIVVYLPK-GVEVRYRVWKAE 159 (166)
T ss_pred CCCCeEEECCC-CCEEEEEEEecc
Confidence 58888888653 358999999986
No 43
>PF04683 Proteasom_Rpn13: Proteasome complex subunit Rpn13 ubiquitin receptor; InterPro: IPR006773 This family was thought originally to be involved in cell-adhesion [, ], but the members are now known to be proteasome subunit Rpn13, a novel ubiquitin receptor. The 26S proteasome is a huge macromolecular protein-degradation machine consisting of a proteolytically active 20S core, in the form of four disc-like proteins, and one or two 19S regulatory particles. The regulatory particle(s) sit on the top and or bottom of the core, de-ubiquitinate the substrate peptides, unfold them and guide them into the narrow channel through the centre of the core. Rpn13 and its homologues dock onto the regulatory particle through the N-terminal region which binds Rpn2. The C-terminal part of the domain binds de-ubiquitinating enzyme Uch37/UCHL5 and enhances its isopeptidase activity. Rpn13 binds ubiquitin via a conserved amino-terminal region called the pleckstrin-like receptor for ubiquitin, termed Pru, domain []. The domain forms two contiguous anti-parallel beta-sheets with a configuration similar to the pleckstrin-homology domain (PHD) fold []. Rpn13's ability to bind ubiquitin and the proteasome subunit Rpn2/S1 simultaneously supports evidence of its role as a ubiquitin receptor. Finally, when complexed to di-ubiquitin, via the Pru, and Uch37 via the C-terminal part, it frees up the distal ubiquitin for de-ubiquitination by the Uch37 []. ; GO: 0005634 nucleus, 0005737 cytoplasm; PDB: 2Z4D_A 2KR0_A 2Z59_A 2R2Y_A.
Probab=27.93 E-value=1.6e+02 Score=18.33 Aligned_cols=48 Identities=21% Similarity=0.506 Sum_probs=28.4
Q ss_pred ccCeEEEEeCC-CCcEEEeeCCCC-CCCCceeeEeecCCeeeeccceEEe------CCEEEEEE
Q psy13486 3 HNGILYFNLID-RNAVGCWNSQYP-YKPENIGHIDIDNEALIFPSDVKVV------GDDLWVIS 58 (91)
Q Consensus 3 ~~gvlff~lv~-~~aI~cWn~~~~-~~~~n~~iia~d~~~l~f~sdlkid------~~~lWvls 58 (91)
+.|.|+...-. .---.||-.... -.++ .+.+.||.+.++. .|.+|+|-
T Consensus 15 ~KG~l~l~~~~d~l~~f~W~~r~~~~~~e--------~d~il~pg~~~f~~V~~c~tGRVy~Lk 70 (85)
T PF04683_consen 15 RKGLLYLYKSEDGLLHFCWKPRDTTGEVE--------DDLILFPGDATFKKVPQCKTGRVYVLK 70 (85)
T ss_dssp S-EEEEEEETTTS-EEEEEEETST---EE--------EEEEE-TTTEEEEE-TTSSTS-EEEEE
T ss_pred CCEEEEEEECCCCeEEEEEEEcCcCCCcc--------cceecCCCCeEEEECCcCCCCeEEEEE
Confidence 67888887754 334579988754 2111 2466789998775 57888884
No 44
>KOG1214|consensus
Probab=27.03 E-value=1.3e+02 Score=27.36 Aligned_cols=53 Identities=21% Similarity=0.387 Sum_probs=36.2
Q ss_pred ccCeEEEE-eCCCCcEE-----------EeeCCCC------CCCCceeeEeecCCeeeeccceEEe-C-C-EEEEE
Q psy13486 3 HNGILYFN-LIDRNAVG-----------CWNSQYP------YKPENIGHIDIDNEALIFPSDVKVV-G-D-DLWVI 57 (91)
Q Consensus 3 ~~gvlff~-lv~~~aI~-----------cWn~~~~------~~~~n~~iia~d~~~l~f~sdlkid-~-~-~lWvl 57 (91)
+..|||++ ||++.+|. -||-..| +.-+|-+|+.+++ +-.|+||.+| . . -.||=
T Consensus 1101 ~rkvLf~tdLVNPR~iv~D~~rgnLYwtDWnRenPkIets~mDG~NrRilin~D--igLPNGLtfdpfs~~LCWvD 1174 (1289)
T KOG1214|consen 1101 ERKVLFYTDLVNPRAIVVDPIRGNLYWTDWNRENPKIETSSMDGENRRILINTD--IGLPNGLTFDPFSKLLCWVD 1174 (1289)
T ss_pred eeeEEEeecccCcceEEeecccCceeeccccccCCcceeeccCCccceEEeecc--cCCCCCceeCcccceeeEEe
Confidence 34566663 66777764 4776665 4567888888776 4579999999 3 3 46874
No 45
>KOG0289|consensus
Probab=26.32 E-value=1.7e+02 Score=24.25 Aligned_cols=60 Identities=17% Similarity=0.264 Sum_probs=37.6
Q ss_pred ccCeEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe--CCEEEEEEeCCcceecC
Q psy13486 3 HNGILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV--GDDLWVISDRMPIHLES 67 (91)
Q Consensus 3 ~~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid--~~~lWvlsnrl~~f~~~ 67 (91)
++|-..-+.-+...|-|||.+|- +|+.-+.-+... =..++.+| +.+|=+=++.++.|...
T Consensus 399 ENGY~Lat~add~~V~lwDLRKl---~n~kt~~l~~~~--~v~s~~fD~SGt~L~~~g~~l~Vy~~~ 460 (506)
T KOG0289|consen 399 ENGYWLATAADDGSVKLWDLRKL---KNFKTIQLDEKK--EVNSLSFDQSGTYLGIAGSDLQVYICK 460 (506)
T ss_pred cCceEEEEEecCCeEEEEEehhh---cccceeeccccc--cceeEEEcCCCCeEEeecceeEEEEEe
Confidence 66766667777778999999975 255555444411 25566677 33555556666666654
No 46
>cd03855 M14_ASTE Peptidase M14 Succinylglutamate desuccinylase (ASTE, also known as N-succinyl-L-glutamate amidohydrolase, N2-succinylglutamate desuccinylase, and SGDS; EC 3.5.1.96) belongs to the Succinylglutamate desuccinylase (ASTE)/aspartoacylase (ASPA) subfamily of the M14 family of metallocarboxypeptidases. This group includes succinylglutamate desuccinylase that catalyzes the fifth and last step in arginine catabolism by the arginine succinyltransferase pathway. It hydrolyzes N-succinyl-L-glutamate to succinate and L-glutamate.
Probab=22.99 E-value=82 Score=24.08 Aligned_cols=35 Identities=23% Similarity=0.460 Sum_probs=26.6
Q ss_pred EEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe
Q psy13486 9 FNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV 50 (91)
Q Consensus 9 f~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid 50 (91)
|+.+.+..+-.|+-.++| ++-++.+.+.||+- ++.
T Consensus 277 F~~~~~G~~ia~~~~~~~------~~~~~~~~ivfPn~-~v~ 311 (322)
T cd03855 277 FTEFPKGTLLAEDGDKRY------VVEEDGEAIVFPNA-NVA 311 (322)
T ss_pred ceecCCCcEEEEECCEeE------EeCCCCeEEEccCC-CCC
Confidence 778888778889888775 45566789999995 443
No 47
>PF10761 DUF2590: Protein of unknown function (DUF2590); InterPro: IPR019697 This entry is represented by Bacteriophage HP1, Orf28. The characteristics of the protein distribution suggest prophage matches in addition to the phage matches.
Probab=22.94 E-value=1.7e+02 Score=19.29 Aligned_cols=28 Identities=29% Similarity=0.593 Sum_probs=21.4
Q ss_pred eeeEeecCCeeeeccceEEe---CCEEEEEEe
Q psy13486 31 IGHIDIDNEALIFPSDVKVV---GDDLWVISD 59 (91)
Q Consensus 31 ~~iia~d~~~l~f~sdlkid---~~~lWvlsn 59 (91)
+.+++.+++.+ -|.-++|. .|.+|+-+.
T Consensus 60 iellvE~D~Ri-vPGTv~i~e~~~g~l~itA~ 90 (102)
T PF10761_consen 60 IELLVEDDERI-VPGTVKIEEESPGRLWITAE 90 (102)
T ss_pred eeeeeeccccc-cCceEEEEeccCCEEEEEEE
Confidence 56778888775 79999997 578999553
No 48
>COG2706 3-carboxymuconate cyclase [Carbohydrate transport and metabolism]
Probab=22.73 E-value=3e+02 Score=21.90 Aligned_cols=52 Identities=12% Similarity=0.241 Sum_probs=38.7
Q ss_pred eEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe-CCEEEEEEe
Q psy13486 6 ILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV-GDDLWVISD 59 (91)
Q Consensus 6 vlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid-~~~lWvlsn 59 (91)
.||=+.=..++|+|.-....= . -+..+..-+..-|||-|+.++ .|.+-+.+|
T Consensus 257 FLYasNRg~dsI~~f~V~~~~-g-~L~~~~~~~teg~~PR~F~i~~~g~~Liaa~ 309 (346)
T COG2706 257 FLYASNRGHDSIAVFSVDPDG-G-KLELVGITPTEGQFPRDFNINPSGRFLIAAN 309 (346)
T ss_pred EEEEecCCCCeEEEEEEcCCC-C-EEEEEEEeccCCcCCccceeCCCCCEEEEEc
Confidence 677777788899999776431 1 277888888889999999999 665555443
No 49
>TIGR03032 conserved hypothetical protein TIGR03032. This protein family is uncharacterized. A number of motifs are conserved perfectly among all member sequences. The function of this protein is unknown.
Probab=21.68 E-value=3.6e+02 Score=21.38 Aligned_cols=52 Identities=13% Similarity=0.122 Sum_probs=39.3
Q ss_pred cCeEEEEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEeCCEEEEEEeCCcc
Q psy13486 4 NGILYFNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVVGDDLWVISDRMPI 63 (91)
Q Consensus 4 ~gvlff~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid~~~lWvlsnrl~~ 63 (91)
+|-|+|.......|..++.++. ...+|+.=+ .||+||.+. |.+-+++=.-++
T Consensus 212 dgrLwvldsgtGev~~vD~~~G----~~e~Va~vp---G~~rGL~f~-G~llvVgmSk~R 263 (335)
T TIGR03032 212 QGKLWLLNSGRGELGYVDPQAG----KFQPVAFLP---GFTRGLAFA-GDFAFVGLSKLR 263 (335)
T ss_pred CCeEEEEECCCCEEEEEcCCCC----cEEEEEECC---CCCccccee-CCEEEEEecccc
Confidence 6889999999999999998755 456777766 499999997 555555544444
No 50
>PF14398 ATPgrasp_YheCD: YheC/D like ATP-grasp
Probab=21.40 E-value=61 Score=23.91 Aligned_cols=21 Identities=19% Similarity=0.517 Sum_probs=15.1
Q ss_pred eeccceEEe-CCEEEEE-EeCCc
Q psy13486 42 IFPSDVKVV-GDDLWVI-SDRMP 62 (91)
Q Consensus 42 ~f~sdlkid-~~~lWvl-snrl~ 62 (91)
.+.=||.|| +|.+|++ .|.-|
T Consensus 211 ElGiDl~iD~~g~iWliEvN~kP 233 (262)
T PF14398_consen 211 ELGIDLGIDKNGKIWLIEVNSKP 233 (262)
T ss_pred EEEEEEEEcCCCCEEEEEEeCCC
Confidence 455678888 8999998 45444
No 51
>PRK05324 succinylglutamate desuccinylase; Provisional
Probab=21.11 E-value=98 Score=23.84 Aligned_cols=35 Identities=20% Similarity=0.442 Sum_probs=27.9
Q ss_pred EEeCCCCcEEEeeCCCCCCCCceeeEeecCCeeeeccceEEe
Q psy13486 9 FNLIDRNAVGCWNSQYPYKPENIGHIDIDNEALIFPSDVKVV 50 (91)
Q Consensus 9 f~lv~~~aI~cWn~~~~~~~~n~~iia~d~~~l~f~sdlkid 50 (91)
|+.+.+..+-+|+-.++| ++-++.+.++||+- ++.
T Consensus 280 ft~~~~G~~la~d~~~~~------~~~~~~~~ivfPn~-~v~ 314 (329)
T PRK05324 280 FTAFPKGTLLAEDGDERY------VVEHDGERIVFPNP-NVA 314 (329)
T ss_pred ceEcCCCCEEEEeCCeeE------EeCCCCeEEEccCC-CCC
Confidence 677888889999988886 56677799999995 654
Done!