Query psy9766
Match_columns 97
No_of_seqs 101 out of 470
Neff 5.7
Searched_HMMs 46136
Date Fri Aug 16 19:01:13 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy9766.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/9766hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00121 IB Insulin growth f 99.7 1.6E-17 3.6E-22 106.3 5.1 67 21-93 2-75 (75)
2 PF00219 IGFBP: Insulin-like g 99.5 1.8E-15 3.9E-20 91.2 -1.3 51 23-78 3-53 (53)
3 PF07327 Neuroparsin: Neuropar 98.4 5.4E-07 1.2E-11 60.5 4.0 48 24-79 31-78 (103)
4 PF03032 Brevenin: Brevenin/es 71.0 2.8 6E-05 24.4 1.3 13 1-13 3-15 (46)
5 PF11119 DUF2633: Protein of u 60.6 5.7 0.00012 24.4 1.3 15 1-15 9-23 (59)
6 PF08139 LPAM_1: Prokaryotic m 54.6 9.4 0.0002 19.6 1.3 15 1-15 7-21 (25)
7 PF13956 Ibs_toxin: Toxin Ibs, 51.4 11 0.00023 18.1 1.1 15 1-15 1-15 (19)
8 COG3898 Uncharacterized membra 48.9 11 0.00023 31.7 1.6 16 1-16 1-16 (531)
9 PF12273 RCR: Chitin synthesis 45.5 7.3 0.00016 26.4 0.1 8 3-10 1-8 (130)
10 PRK10350 hypothetical protein; 43.8 15 0.00032 26.1 1.4 16 1-16 1-16 (145)
11 PF07127 Nodulin_late: Late no 42.9 36 0.00078 19.7 2.8 17 6-22 11-30 (54)
12 PRK10081 entericidin B membran 38.5 20 0.00044 21.1 1.2 14 1-14 2-15 (48)
13 PF03967 PRCH: Photosynthetic 35.8 26 0.00057 24.8 1.7 12 3-14 17-28 (136)
14 PRK10386 curli assembly protei 35.1 30 0.00066 24.2 1.9 18 1-18 1-18 (130)
15 PF00734 CBM_1: Fungal cellulo 34.9 15 0.00032 19.2 0.3 23 61-83 4-27 (29)
16 PF02740 Colipase_C: Colipase, 34.3 9.6 0.00021 22.2 -0.6 25 70-94 17-45 (45)
17 PF11777 DUF3316: Protein of u 31.1 31 0.00067 22.9 1.4 15 1-15 1-15 (114)
18 cd00226 PRCH Photosynthetic re 30.2 34 0.00074 26.4 1.6 12 3-14 17-28 (246)
19 TIGR01150 puhA photosynthetic 29.8 35 0.00076 26.5 1.6 12 3-14 17-28 (252)
20 COG5510 Predicted small secret 25.5 52 0.0011 19.0 1.4 14 1-14 2-15 (44)
21 PF11153 DUF2931: Protein of u 22.0 64 0.0014 23.5 1.8 15 1-15 1-15 (216)
22 cd02875 GH18_chitobiase Chitob 21.5 57 0.0012 25.7 1.5 13 20-32 17-30 (358)
23 PF06291 Lambda_Bor: Bor prote 20.6 83 0.0018 20.9 1.9 15 1-15 1-15 (97)
No 1
>smart00121 IB Insulin growth factor-binding protein homologues. High affinity binding partners of insulin-like growth factors.
Probab=99.70 E-value=1.6e-17 Score=106.33 Aligned_cols=67 Identities=39% Similarity=0.899 Sum_probs=58.5
Q ss_pred CC-cCCCCCCCCCCCCCCCCCCceeccCccccHHhhhhcCCCCCCCCCCcCCCCCCceeeccCCCCC------CCceeec
Q psy9766 21 SC-VCNPAECETLTDDDCPDRAGTVWDPCGCCKVCARAENQPCGGPYGFFGSCAHDLECVVTDVTNE------NSGGVCT 93 (97)
Q Consensus 21 ~C-~C~~~~Cp~~p~~~C~~~~g~v~D~CgCC~vCa~~eGe~Cg~~~~~~g~C~~GL~C~~~~~~~~------~~~GvC~ 93 (97)
.| +|++..||+.|+ +|+. ++++|.||||.+||++|||+||+. .++|++||+|.++.++.+ +++|+|+
T Consensus 2 ~C~~C~~~~Cp~~p~-~C~~--~~~~d~CgCC~vCa~~eGe~Cg~~---~~~C~~GL~C~~~~~~~~p~~al~~g~g~C~ 75 (75)
T smart00121 2 RCPPCDPARCPPCPP-GCAE--LVRLDGCGCCPVCARQEGEPCGVY---TPRCAPGLRCQPPPGEERPLRALLHGRGVCM 75 (75)
T ss_pred CCCCCCHhHCcCCCC-CCCc--cccCCCCchhHHHhcccCCcCCCC---CCCCCCCCEEcCCCCCcChHHHHhccCcccC
Confidence 57 688999998774 8998 899999999999999999999963 579999999999988744 6889985
No 2
>PF00219 IGFBP: Insulin-like growth factor binding protein; InterPro: IPR000867 The insulin family of proteins groups together several evolutionarily related active peptides []: these include insulin [, ], relaxin [, ], insect prothoracicotropic hormone (bombyxin) [], insulin-like growth factors (IGF1 and IGF2) [, ], mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP) (gene INSL4), locust insulin-related peptide (LIRP), molluscan insulin-related peptides (MIP), and Caenorhabditis elegans insulin-like peptides. The 3D structures of a number of family members have been determined [, , ]. The fold comprises two polypeptide chains (A and B) linked by two disulphide bonds: all share a conserved arrangement of 4 cysteines in their A chain, the first of which is linked by a disulphide bond to the third, while the second and fourth are linked by interchain disulphide bonds to cysteines in the B chain. Insulin is found in many animals, and is involved in the regulation of normal glucose homeostasis. It also has other specific physiological effects, such as increasing the permeability of cells to monosaccharides, amino acids and fatty acids, and accelerating glycolysis and glycogen synthesis in the liver []. Insulin exerts its effects by interaction with a cell-surface receptor, which may also result in the promotion of cell growth []. Insulin is synthesised as a prepropeptide from which an endoplasmic reticulum-targeting sequence is cleaved to yield proinsulin. The sequence of prosinsulin contains 2 well-conserved regions (designated A and B), separated by an intervening connecting region (C), which is variable between species []. The connecting region is cleaved, liberating the active protein, which contains the A and B chains, held together by 2 disulphide bonds []. Insulin-like Growth Factor Binding Proteins (IGFBP) are a group of vertebrate secreted proteins, which bind to IGF-I and IGF-II with high affinity and modulate the biological actions of IGFs. The IGFBP family has six distinct subgroups, IGFBP-1 through 6, based on conservation of gene (intron-exon) organisation, structural similarity, and binding affinity for IGFs. Across species, IGFBP-5 exhibits the most sequence conservation, while IGFBP-6 exhibits the least sequence conservation. The IGFBPs contain inhibitor domain homologues, which are related to MEROPS protease inhibitor family I31 (equistatin, clan IX). All IGFBPs share a common domain architecture (IPR000867 from INTERPRO:IPR000716 from INTERPRO). While the N-terminal (IPR000867 from INTERPRO, IGF binding protein domain), and the C-terminal (IPR000716 from INTERPRO, thyroglobulin type-1 repeat) domains are conserved across vertebrate species, the mid-region is highly variable with respect to protease cleavage sites and phosphorylation and glycosylation sites. IGFBPs contain 16-18 conserved cysteines located in the N-terminal and the C-terminal regions, which form 8-9 disulphide bonds []. As demonstrated for human IGFBP-5, the N terminus is the primary binding site for IGF. This region, comprised of Val49, Tyr50, Pro62 and Lys68-Leu75, forms a hydrophobic patch on the surface of the protein []. The C terminus is also required for high affinity IGF binding, as well as for binding to the extracellular matrix [] and for nuclear translocation [, ] of IGFBP-3 and -5. IGFBPs are unusually pleiotropic molecules. Like other binding proteins, IGFBP can prolong the half-life of IGFs via high affinity binding of the ligands. In addition to functioning as simple carrier proteins, serum IGFBPs also serve to regulate the endocrine and paracrine/autocrine actions of IGF by modulating the IGF available to bind to signalling IGF-I receptors [, ]. Furthermore, IGFBPs can function as growth modulators independent of IGFs. For example, IGFBP-5 stimulates markers of bone formation in osteoblasts lacking functional IGFs []. The binding of IGFBP to its putative receptor on the cell membrane may stimulate the signalling pathway independent of an IGF receptor, to mediate the effects of IGFBPs in certain target cell types. IGFBP-1 and -2, but not other IGFBPs, contain a C-terminal Arg-Gly-Asp integrin-binding motif. Thus, IGFBP-1 can also stimulate cell migration of CHO and human trophoblast cells through an action mediated by alpha 5 beta 1 integrin []. Finally, IGFBPs transported into the nucleus (via the nuclear localisation signal) may also exert IGF-independent effects by transcriptional activation of genes. This entry represents insulin-like growth factors (IGF-I and IGF-II), which bind to specific binding proteins in extracellular fluids with high affinity [, , ]. These IGF-binding proteins (IGFBP) prolong the half-life of the IGFs and have been shown to either inhibit or stimulate the growth promoting effects of the IGFs on cells culture. They seem to alter the interaction of IGFs with their cell surface receptors. There are at least six different IGFBPs and they are structurally related. The following growth-factor inducible proteins are structurally related to IGFBPs and could function as growth-factor binding proteins [, ], mouse protein cyr61 and its probable chicken homolog, protein CEF-10; human connective tissue growth factor (CTGF) and its mouse homolog, protein FISP-12; and vertebrate protein NOV.; GO: 0005520 insulin-like growth factor binding, 0001558 regulation of cell growth, 0005576 extracellular region; PDB: 2DSQ_A 2DSP_B 2DSR_B 1WQJ_B 1H59_B 1BOE_A 3TJQ_A.
Probab=99.49 E-value=1.8e-15 Score=91.18 Aligned_cols=51 Identities=49% Similarity=1.162 Sum_probs=31.3
Q ss_pred cCCCCCCCCCCCCCCCCCCceeccCccccHHhhhhcCCCCCCCCCCcCCCCCCcee
Q psy9766 23 VCNPAECETLTDDDCPDRAGTVWDPCGCCKVCARAENQPCGGPYGFFGSCAHDLEC 78 (97)
Q Consensus 23 ~C~~~~Cp~~p~~~C~~~~g~v~D~CgCC~vCa~~eGe~Cg~~~~~~g~C~~GL~C 78 (97)
+|+++ ||+++ ++|+.++.+|+|.||||.+||++|||+||.. +++|++||+|
T Consensus 3 ~C~~~-Cp~~~-~~C~~g~~~v~d~CGCC~vCA~~~Ge~CG~~---~~~C~~GL~C 53 (53)
T PF00219_consen 3 PCDPE-CPPPP-PDCPAGVSLVLDGCGCCKVCARQEGEPCGVY---TGPCGPGLRC 53 (53)
T ss_dssp SB-HH-------SS-TTEEEE--STTS-SEEE-B-TTSEESTT---S--BSTTEEE
T ss_pred CCccc-CCCCC-CCCCCcccccCCCCcCHHHHHhhcCCcCCCc---CCCCCCcCCC
Confidence 47667 87665 5899955567799999999999999999864 5799999997
No 3
>PF07327 Neuroparsin: Neuroparsin; InterPro: IPR010850 This family consists of several locust specific neuroparsin proteins. Neuroparsins are produced by the A1 type of protocerebral median neurosecretory cells of the PI-CC system and display pleiotropic activities: inhibition of the effect of juvenile hormone, stimulation of fluid reabsorption of isolated recta, induction of an increase in hemolymph lipid and trehalose levels, and neurotrophic effects [].
Probab=98.35 E-value=5.4e-07 Score=60.49 Aligned_cols=48 Identities=42% Similarity=0.967 Sum_probs=40.1
Q ss_pred CCCCCCCCCCCCCCCCCCceeccCccccHHhhhhcCCCCCCCCCCcCCCCCCceee
Q psy9766 24 CNPAECETLTDDDCPDRAGTVWDPCGCCKVCARAENQPCGGPYGFFGSCAHDLECV 79 (97)
Q Consensus 24 C~~~~Cp~~p~~~C~~~~g~v~D~CgCC~vCa~~eGe~Cg~~~~~~g~C~~GL~C~ 79 (97)
|+.+.|-.-+ ..|++ |.+.|.||. .+||+..||.||+ +|.|++||.|.
T Consensus 31 c~~~~C~~d~-~~C~Y--G~v~d~cgr-~~CaKGPge~CGg----~G~CGeGl~C~ 78 (103)
T PF07327_consen 31 CEGANCVVDC-TKCEY--GEVTDFCGR-KVCAKGPGERCGG----YGKCGEGLRCS 78 (103)
T ss_pred cchhhccccc-ccccc--cceecccCc-chhhcCCCCcCCC----cceecCCcEec
Confidence 4445555433 58999 899999998 8999999999999 58999999994
No 4
>PF03032 Brevenin: Brevenin/esculentin/gaegurin/rugosin family; InterPro: IPR004275 In addition to the highly specific cell-mediated immune system, vertebrates possess an efficient host-defence mechanism against invading microorganisms which involves the synthesis of highly potent antimicrobial peptides with a large spectrum of activity. This entry represents a number of these defence peptides secreted from the skin of amphibians, including the opiate-like dermorphins and deltorphins, and the antimicrobial dermoseptins and temporins.; GO: 0006952 defense response, 0042742 defense response to bacterium, 0005576 extracellular region
Probab=71.00 E-value=2.8 Score=24.43 Aligned_cols=13 Identities=15% Similarity=0.358 Sum_probs=9.7
Q ss_pred CchHHHHHHHHHh
Q psy9766 1 MIRSVFWIFLSYC 13 (97)
Q Consensus 1 ~~~~~~~~~~~~~ 13 (97)
|||+|+++||+=.
T Consensus 3 lKKsllLlfflG~ 15 (46)
T PF03032_consen 3 LKKSLLLLFFLGT 15 (46)
T ss_pred chHHHHHHHHHHH
Confidence 7889888877543
No 5
>PF11119 DUF2633: Protein of unknown function (DUF2633); InterPro: IPR022576 This family is conserved largely in Proteobacteria. Several members are named as YfgG. The function is not known.
Probab=60.56 E-value=5.7 Score=24.38 Aligned_cols=15 Identities=33% Similarity=0.425 Sum_probs=11.5
Q ss_pred CchHHHHHHHHHhhc
Q psy9766 1 MIRSVFWIFLSYCLG 15 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (97)
|+|+|++|.++++++
T Consensus 9 mtriVLLISfiIlfg 23 (59)
T PF11119_consen 9 MTRIVLLISFIILFG 23 (59)
T ss_pred HHHHHHHHHHHHHHH
Confidence 788888887766654
No 6
>PF08139 LPAM_1: Prokaryotic membrane lipoprotein lipid attachment site; InterPro: IPR012640 In prokaryotes, membrane lipoproteins are synthesized with a precursor signal peptide, which is cleaved by a specific lipoprotein signal peptidase (signal peptidase II). The peptidase recognises a conserved sequence and cuts upstream of a cysteine residue to which a glyceride-fatty acid lipid is attached [,]. This lipid attachment site is found in homologues of the VirB proteins of type IV secretion systems (T4SS). Conjugal transfer across the cell envelope of Gram-negative bacteria is mediated by a supramolecular structure termed mating pair formation (Mpf) complex. Collectively, secretion pathways ancestrally related to bacterial conjugation systems are now known as T4SS. T4SS are involved in the delivery of effector molecules to eukaryotic target cells; each of these systems exports distinct DNA or protein substrates to effect a myriad of changes in host cell physiology during infection [].
Probab=54.59 E-value=9.4 Score=19.56 Aligned_cols=15 Identities=20% Similarity=0.381 Sum_probs=10.4
Q ss_pred CchHHHHHHHHHhhc
Q psy9766 1 MIRSVFWIFLSYCLG 15 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (97)
|||.+|.+.+++.++
T Consensus 7 mKkil~~l~a~~~La 21 (25)
T PF08139_consen 7 MKKILFPLLALFMLA 21 (25)
T ss_pred HHHHHHHHHHHHHHh
Confidence 378888777776554
No 7
>PF13956 Ibs_toxin: Toxin Ibs, type I toxin-antitoxin system
Probab=51.44 E-value=11 Score=18.11 Aligned_cols=15 Identities=13% Similarity=0.319 Sum_probs=7.5
Q ss_pred CchHHHHHHHHHhhc
Q psy9766 1 MIRSVFWIFLSYCLG 15 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (97)
|.|.+.++.++++.+
T Consensus 1 MMk~vIIlvvLLliS 15 (19)
T PF13956_consen 1 MMKLVIILVVLLLIS 15 (19)
T ss_pred CceehHHHHHHHhcc
Confidence 445555545555444
No 8
>COG3898 Uncharacterized membrane-bound protein [Function unknown]
Probab=48.92 E-value=11 Score=31.75 Aligned_cols=16 Identities=44% Similarity=0.769 Sum_probs=13.7
Q ss_pred CchHHHHHHHHHhhcc
Q psy9766 1 MIRSVFWIFLSYCLGR 16 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~~ 16 (97)
|||.+||+++++.+++
T Consensus 1 MiRvlfflilV~alg~ 16 (531)
T COG3898 1 MIRVLFFLILVAALGF 16 (531)
T ss_pred ChhHHHHHHHHHHHHh
Confidence 9999999999887754
No 9
>PF12273 RCR: Chitin synthesis regulation, resistance to Congo red; InterPro: IPR020999 RCR proteins are ER membrane proteins that regulate chitin deposition in fungal cell walls. Although chitin, a linear polymer of beta-1,4-linked N-acetylglucosamine, constitutes only 2% of the cell wall it plays a vital role in the overall protection of the cell wall against stress, noxious chemicals and osmotic pressure changes. Congo red is a cell wall-disrupting benzidine-type dye extensively used in many cell wall mutant studies that specifically targets chitin in yeast cells and inhibits growth. RCR proteins render the yeasts resistant to Congo red by diminishing the content of chitin in the cell wall []. RCR proteins are probably regulating chitin synthase III interact directly with ubiquitin ligase Rsp5, and the VPEY motif is necessary for this, via interaction with the WW domains of Rsp5 [].
Probab=45.48 E-value=7.3 Score=26.42 Aligned_cols=8 Identities=25% Similarity=0.559 Sum_probs=3.8
Q ss_pred hHHHHHHH
Q psy9766 3 RSVFWIFL 10 (97)
Q Consensus 3 ~~~~~~~~ 10 (97)
|||+|++|
T Consensus 1 RW~l~~ii 8 (130)
T PF12273_consen 1 RWVLFAII 8 (130)
T ss_pred CeeeHHHH
Confidence 45555443
No 10
>PRK10350 hypothetical protein; Provisional
Probab=43.80 E-value=15 Score=26.10 Aligned_cols=16 Identities=25% Similarity=0.173 Sum_probs=10.1
Q ss_pred CchHHHHHHHHHhhcc
Q psy9766 1 MIRSVFWIFLSYCLGR 16 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~~ 16 (97)
|||+|+|++++-+.+.
T Consensus 1 MKr~L~~~allP~~~~ 16 (145)
T PRK10350 1 MKRLLLLTALLPFVGF 16 (145)
T ss_pred ChhHHHHHHHhHHHHH
Confidence 8999866665444433
No 11
>PF07127 Nodulin_late: Late nodulin protein; InterPro: IPR009810 This family consists of several plant specific late nodulin sequences which are homologous to the Pisum sativum (Garden pea) ENOD3 protein. ENOD3 is expressed in the late stages of root nodule formation and contains two pairs of cysteine residues toward the proteins C terminus which may be involved in metal-binding [].; GO: 0046872 metal ion binding, 0009878 nodule morphogenesis
Probab=42.93 E-value=36 Score=19.74 Aligned_cols=17 Identities=35% Similarity=0.522 Sum_probs=7.4
Q ss_pred HHHHHHHhhccccc---cCC
Q psy9766 6 FWIFLSYCLGRSIA---FSC 22 (97)
Q Consensus 6 ~~~~~~~~~~~~~~---~~C 22 (97)
++||+++.+.+... ..|
T Consensus 11 mIiflslflv~~~~~~~~~C 30 (54)
T PF07127_consen 11 MIIFLSLFLVVTNVDAIIPC 30 (54)
T ss_pred HHHHHHHHHhhcccCCCccc
Confidence 34444444433333 466
No 12
>PRK10081 entericidin B membrane lipoprotein; Provisional
Probab=38.48 E-value=20 Score=21.09 Aligned_cols=14 Identities=21% Similarity=0.306 Sum_probs=9.0
Q ss_pred CchHHHHHHHHHhh
Q psy9766 1 MIRSVFWIFLSYCL 14 (97)
Q Consensus 1 ~~~~~~~~~~~~~~ 14 (97)
|||.+.+|++++++
T Consensus 2 mKk~i~~i~~~l~~ 15 (48)
T PRK10081 2 VKKTIAAIFSVLVL 15 (48)
T ss_pred hHHHHHHHHHHHHH
Confidence 67777776665543
No 13
>PF03967 PRCH: Photosynthetic reaction centre, H-chain N-terminal region; InterPro: IPR015810 The photosynthetic apparatus in non-oxygenic bacteria consists of light-harvesting (LH) protein-pigment complexes LH1 and LH2, which use carotenoid and bacteriochlorophyll as primary donors []. LH1 acts as the energy collection hub, temporarily storing it before its transfer to the photosynthetic reaction centre (RC) []. Electrons are transferred from the primary donor via an intermediate acceptor (bacteriopheophytin) to the primary acceptor (quinine Qa), and finally to the secondary acceptor (quinone Qb), resulting in the formation of ubiquinol QbH2. RC uses the excitation energy to shuffle electrons across the membrane, transferring them via ubiquinol to the cytochrome bc1 complex in order to establish a proton gradient across the membrane, which is used by ATP synthetase to form ATP [, , ]. The core complex is anchored in the cell membrane, consisting of one unit of RC surrounded by LH1; in some species there may be additional subunits []. RC consists of three subunits: L (light), M (medium), and H (heavy). Subunits L and M provide the scaffolding for the chromophore, while subunit H contains a cytoplasmic domain []. In Rhodopseudomonas viridis, there is also a non-membranous tetrahaem cytochrome (4Hcyt) subunit on the periplasmic surface. This entry represents the N-terminal domain of the photosynthetic reaction centre H subunit, which includes the transmembrane domain and part of the cytoplasmic domain [].; GO: 0045156 electron transporter, transferring electrons within the cyclic electron transport pathway of photosynthesis activity, 0019684 photosynthesis, light reaction, 0030077 plasma membrane light-harvesting complex; PDB: 1RZZ_H 1PST_H 2J8D_H 3DUQ_H 1FNP_H 1KBY_H 1E14_H 2HG3_H 1UMX_H 1YST_H ....
Probab=35.80 E-value=26 Score=24.82 Aligned_cols=12 Identities=33% Similarity=0.794 Sum_probs=8.9
Q ss_pred hHHHHHHHHHhh
Q psy9766 3 RSVFWIFLSYCL 14 (97)
Q Consensus 3 ~~~~~~~~~~~~ 14 (97)
-++|||||+.++
T Consensus 17 lyaFwiFFagLi 28 (136)
T PF03967_consen 17 LYAFWIFFAGLI 28 (136)
T ss_dssp HHHHHHHHHHHH
T ss_pred HHHHHHHHHHHH
Confidence 468999886664
No 14
>PRK10386 curli assembly protein CsgE; Provisional
Probab=35.15 E-value=30 Score=24.25 Aligned_cols=18 Identities=28% Similarity=0.231 Sum_probs=13.4
Q ss_pred CchHHHHHHHHHhhcccc
Q psy9766 1 MIRSVFWIFLSYCLGRSI 18 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~~~~ 18 (97)
|||.+.|+.+++++..+.
T Consensus 1 ~~r~~~~~l~~~~l~~~~ 18 (130)
T PRK10386 1 MKRYLRWIVAAELLFAAG 18 (130)
T ss_pred ChhHHHHHHHHHHHHhCc
Confidence 899998988877665433
No 15
>PF00734 CBM_1: Fungal cellulose binding domain; InterPro: IPR000254 The microbial degradation of cellulose and xylans requires several types of enzymes such as endoglucanases (3.2.1.4 from EC), cellobiohydrolases (3.2.1.91 from EC) (exoglucanases), or xylanases (3.2.1.8 from EC) []. Structurally, cellulases and xylanases generally consist of a catalytic domain joined to a cellulose-binding domain (CBD) by a short linker sequence rich in proline and/or hydroxy-amino acids. The CBD of a number of fungal cellulases has been shown to consist of 36 amino acid residues, and it is found either at the N-terminal or at the C-terminal extremity of the enzymes. As it is shown in the following schematic representation, there are four conserved cysteines in this type of CBD domain, all involved in disulphide bonds. +----------------+ | +-----|---------+ | | | | xxxxxxxCxxxxxxxxxxCxxxxxCxxxxxxxxxCx ; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0030248 cellulose binding, 0005975 carbohydrate metabolic process, 0005576 extracellular region; PDB: 1AZK_A 2CBH_A 1AZJ_A 1AZ6_A 1CBH_A 1AZH_A.
Probab=34.93 E-value=15 Score=19.24 Aligned_cols=23 Identities=30% Similarity=0.469 Sum_probs=14.0
Q ss_pred CCCCCCCCc-CCCCCCceeeccCC
Q psy9766 61 PCGGPYGFF-GSCAHDLECVVTDV 83 (97)
Q Consensus 61 ~Cg~~~~~~-g~C~~GL~C~~~~~ 83 (97)
.|||..+.. ..|.+|+.|...++
T Consensus 4 QCGG~~~~G~t~C~~g~~C~~~n~ 27 (29)
T PF00734_consen 4 QCGGIGWTGPTCCASGYTCVYQND 27 (29)
T ss_dssp EEECTT-SS---BSSSSCEEEEET
T ss_pred ccCCCCcccCCcCCCcCEeeeECC
Confidence 477655322 35999999987653
No 16
>PF02740 Colipase_C: Colipase, C-terminal domain; InterPro: IPR017914 This entry represents the C-terminal domain of colipase proteins. Colipase [, ] is a small protein cofactor needed by pancreatic lipase for efficient dietary lipid hydrolyisis. It also binds to the bile-salt covered triacylglycerol interface, thus allowing the enzyme to anchor itself to the water-lipid interface. Efficient absorption of dietary fats is dependent on the action of pancreatic triglyceride lipase. Colipase binds to the C-terminal, non-catalytic domain of lipase, thereby stabilising as active conformation and considerably increasing the overall hydrophobic binding site. Structural studies of the complex and of colipase alone have revealed the functionality of its architecture [, ]. Colipase is a small protein with five conserved disulphide bonds. Structural analogies have been recognised between a developmental protein (Dickkopf), the pancreatic lipase C-terminal domain, the N-terminal domains of lipoxygenases and the C-terminal domain of alpha-toxin. These non-catalytic domains in the latter enzymes are important for interaction with membrane. It has not been established if these domains are also involved in eventual protein cofactor binding as is the case for pancreatic lipase [].; GO: 0008047 enzyme activator activity, 0007586 digestion, 0016042 lipid catabolic process, 0005576 extracellular region; PDB: 1PCO_A 1PCN_A 1LPA_A 1N8S_C 1LPB_A 1ETH_D.
Probab=34.32 E-value=9.6 Score=22.15 Aligned_cols=25 Identities=20% Similarity=0.477 Sum_probs=15.0
Q ss_pred CCCCCCceeeccCCCCC----CCceeecc
Q psy9766 70 GSCAHDLECVVTDVTNE----NSGGVCTQ 94 (97)
Q Consensus 70 g~C~~GL~C~~~~~~~~----~~~GvC~~ 94 (97)
=+|.+||.|.....-.. ...|||.+
T Consensus 17 CPCE~GLtCe~dktivgsItNtn~GiC~d 45 (45)
T PF02740_consen 17 CPCERGLTCEGDKTIVGSITNTNFGICHD 45 (45)
T ss_dssp --BSTTEEEEECESHHHHHHTC-EEEEEE
T ss_pred ccCCCCcEEecccceEEEEEcccceeecC
Confidence 47899999986553211 36788864
No 17
>PF11777 DUF3316: Protein of unknown function (DUF3316); InterPro: IPR016879 There is currently no experimental data for members of this group or their homologues, nor do they exhibit features indicative of any function.
Probab=31.07 E-value=31 Score=22.93 Aligned_cols=15 Identities=13% Similarity=0.060 Sum_probs=10.4
Q ss_pred CchHHHHHHHHHhhc
Q psy9766 1 MIRSVFWIFLSYCLG 15 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (97)
|||.+++++++++..
T Consensus 1 MKk~~ll~~~ll~s~ 15 (114)
T PF11777_consen 1 MKKIILLASLLLLSS 15 (114)
T ss_pred CchHHHHHHHHHHHH
Confidence 888888776655543
No 18
>cd00226 PRCH Photosynthetic reaction center (RC) complex, subunit H; RC is an integral membrane protein-pigment complex which catalyzes light-induced reduction of ubiquinone to ubiquinol, generating a transmembrane electrochemical gradient of protons used to produce ATP by ATP synthase. Subunit H is positioned mainly in the cytoplasm with one transmembrane alpha helix. Provides proton transfer pathway (water channels) connecting the terminal quinone electron acceptor of RC, to the aqueous phase. Found in photosynthetic bacteria: alpha, beta, and gamma proteobacteria.
Probab=30.16 E-value=34 Score=26.43 Aligned_cols=12 Identities=33% Similarity=0.788 Sum_probs=9.1
Q ss_pred hHHHHHHHHHhh
Q psy9766 3 RSVFWIFLSYCL 14 (97)
Q Consensus 3 ~~~~~~~~~~~~ 14 (97)
-++|||||..++
T Consensus 17 l~~Fw~ffa~Li 28 (246)
T cd00226 17 IYAFWIFFAGLI 28 (246)
T ss_pred HHHHHHHHHHHH
Confidence 478999886665
No 19
>TIGR01150 puhA photosynthetic reaction center, subunit H, bacterial. This model describes the photosynthetic reaction center H subunit in non-oxygenic photosynthetic bacteria. The reaction center is an integral membrane pigment-protein that carries out light-driven electron transfer reactions. At the core of reaction center is a collection light-harvesting cofactors and closely associated polypeptides. The core protein complex is made of L, M and H subunits. The common cofactors include bacterichlorophyll, bacteriopheophytins, ubiquinone and no-heme ferrous iron. The net result of electron tranfer reactions is the establishment of proton electrochemical gradient and production of reducing equivalents in the form of NADH. Ultimately, the process results in the reduction of C02 to carbohydrates(C6H12O6) In non-oxygenic organisms, the electron donor is an organic acid rather than water. Much of our current functional understanding of photosynthesis comes from the structural determination
Probab=29.77 E-value=35 Score=26.48 Aligned_cols=12 Identities=42% Similarity=0.844 Sum_probs=9.0
Q ss_pred hHHHHHHHHHhh
Q psy9766 3 RSVFWIFLSYCL 14 (97)
Q Consensus 3 ~~~~~~~~~~~~ 14 (97)
-++|||||..++
T Consensus 17 ly~FwiFFA~Li 28 (252)
T TIGR01150 17 IYAFWIFLAGLI 28 (252)
T ss_pred HHHHHHHHHHHH
Confidence 478999886664
No 20
>COG5510 Predicted small secreted protein [Function unknown]
Probab=25.52 E-value=52 Score=19.05 Aligned_cols=14 Identities=21% Similarity=0.306 Sum_probs=8.4
Q ss_pred CchHHHHHHHHHhh
Q psy9766 1 MIRSVFWIFLSYCL 14 (97)
Q Consensus 1 ~~~~~~~~~~~~~~ 14 (97)
|+|+.++++++++.
T Consensus 2 mk~t~l~i~~vll~ 15 (44)
T COG5510 2 MKKTILLIALVLLA 15 (44)
T ss_pred chHHHHHHHHHHHH
Confidence 67766666655543
No 21
>PF11153 DUF2931: Protein of unknown function (DUF2931); InterPro: IPR021326 Some members in this family of proteins are annotated as lipoproteins however this cannot be confirmed. Currently, there is no known function.
Probab=22.02 E-value=64 Score=23.46 Aligned_cols=15 Identities=20% Similarity=0.372 Sum_probs=10.1
Q ss_pred CchHHHHHHHHHhhc
Q psy9766 1 MIRSVFWIFLSYCLG 15 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (97)
|++.+++++++++.+
T Consensus 1 mk~i~~l~l~lll~~ 15 (216)
T PF11153_consen 1 MKKILLLLLLLLLTG 15 (216)
T ss_pred ChHHHHHHHHHHHHh
Confidence 888777776555544
No 22
>cd02875 GH18_chitobiase Chitobiase (also known as di-N-acetylchitobiase) is a lysosomal glycosidase that hydrolyzes the reducing-end N-acetylglucosamine from the chitobiose core of oligosaccharides during the ordered degradation of asparagine-linked glycoproteins in eukaryotes. Chitobiase can only do so if the asparagine that joins the oligosaccharide to protein is previously removed by a glycosylasparaginase. Chitobiase is therefore the final step in the lysosomal degradation of the protein/carbohydrate linkage component of asparagine-linked glycoproteins. The catalytic domain of chitobiase is an eight-stranded alpha/beta barrel fold similar to that of other family 18 glycosyl hydrolases such as hevamine and chitotriosidase.
Probab=21.47 E-value=57 Score=25.74 Aligned_cols=13 Identities=31% Similarity=0.984 Sum_probs=8.4
Q ss_pred cCCcCCC-CCCCCC
Q psy9766 20 FSCVCNP-AECETL 32 (97)
Q Consensus 20 ~~C~C~~-~~Cp~~ 32 (97)
..|||.. +.|+|.
T Consensus 17 ~~c~~~~~~~~~p~ 30 (358)
T cd02875 17 YECPCIEPELCEPI 30 (358)
T ss_pred CCCCCCCHhhCCCc
Confidence 5698864 556654
No 23
>PF06291 Lambda_Bor: Bor protein; InterPro: IPR010438 This family consists of several Bacteriophage lambda Bor and Escherichia coli Iss proteins. Expression of bor significantly increases the survival of the E. coli host cell in animal serum. This property is a well known bacterial virulence determinant indeed, bor and its adjacent sequences are highly homologous to the iss serum resistance locus of the plasmid ColV2-K94, which confers virulence in animals. It has been suggested that lysogeny may generally have a role in bacterial survival in animal hosts, and perhaps in pathogenesis [].
Probab=20.62 E-value=83 Score=20.86 Aligned_cols=15 Identities=20% Similarity=0.144 Sum_probs=11.3
Q ss_pred CchHHHHHHHHHhhc
Q psy9766 1 MIRSVFWIFLSYCLG 15 (97)
Q Consensus 1 ~~~~~~~~~~~~~~~ 15 (97)
|||.++..++++++.
T Consensus 1 mKk~ll~~~lallLt 15 (97)
T PF06291_consen 1 MKKLLLAAALALLLT 15 (97)
T ss_pred CcHHHHHHHHHHHHc
Confidence 898888777776654
Done!