Query psy16523
Match_columns 85
No_of_seqs 107 out of 425
Neff 5.2
Searched_HMMs 46136
Date Fri Aug 16 23:27:51 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy16523.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/16523hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01130 CD36: CD36 family; I 99.9 1.4E-25 3E-30 176.4 9.5 76 1-82 43-118 (467)
2 KOG3776|consensus 99.9 9.7E-22 2.1E-26 158.7 8.0 75 1-82 54-129 (507)
3 PF02056 Glyco_hydro_4: Family 26.5 40 0.00087 24.3 1.4 13 13-25 138-150 (183)
4 PF09200 Monellin: Monellin; 21.0 51 0.0011 18.8 0.8 19 33-51 4-23 (43)
5 PF00407 Bet_v_1: Pathogenesis 20.7 2.7E+02 0.0059 18.9 4.6 38 23-66 42-85 (151)
6 PF05981 CreA: CreA protein; 19.8 1.6E+02 0.0035 20.4 3.3 43 31-78 55-97 (128)
7 PRK15076 alpha-galactosidase; 17.5 81 0.0018 25.3 1.6 15 12-26 140-154 (431)
8 PHA02757 hypothetical protein; 16.2 1.1E+02 0.0024 19.3 1.6 20 23-43 4-28 (75)
9 PF14237 DUF4339: Domain of un 16.1 65 0.0014 17.5 0.5 13 35-47 9-21 (45)
10 PF08341 Fb_signal: Fibronecti 15.8 67 0.0015 19.8 0.6 17 66-84 26-42 (72)
No 1
>PF01130 CD36: CD36 family; InterPro: IPR002159 CD36 is a transmembrane, highly glycosylated, 88kDa glycoprotein expressed by monocytes, macrophages, platelets, microvascular endothelial cells and adipose tissues. Platelet glycoprotein IV (GP IV)(GPIIIb) (CD36 antigen) is also called GPIV, OKM5-antigen or PASIV. CD36 recognises oxidized low density lipoprotein, long chain fatty acids, anionic phospholipids, collagen types I, IV and V, thrombospondin (TSP) and Plasmodium falciparum infected erythrocytes. The recognition of apoptotic neutrophils is in co-operation with TSP and avb3. Other ligands may still be unknown. CD36 is a scavenger receptor for oxidized LDL and shed photoreceptor outer segments and in recognition and phagocytosis of apoptotic cells and is the cell adhesion molecule in platelet adhesion and aggregation, platelet-monocyte and platelet-tumor cell interaction []. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; GO: 0007155 cell adhesion, 0016020 membrane
Probab=99.93 E-value=1.4e-25 Score=176.44 Aligned_cols=76 Identities=33% Similarity=0.628 Sum_probs=71.7
Q ss_pred CccCCCCcceEEEEEEeccChhhhhcCCCCCceeEEeCCeEEEeEEEeeeeEEecCCCcCceEEEEcceEEEEEeeceee
Q psy16523 1 MWRKPPVHPVIRIFIYNVTNADEFLSVPGTKPVLDEIGPYVYVLLTSFVANVIQVNKLERNSMTWSGLGFYCCLFNMELR 80 (85)
Q Consensus 1 ~W~~pp~~~~~~~y~FNvTN~eevl~g~g~kP~v~EvGPY~Y~e~~~k~ni~~~~n~~e~~t~~y~~~~~y~~~~~~~~~ 80 (85)
.|++||+|++++||+||||||+||++| |+||+|+|+|||+|+|.++|+|++|++|+ +||+|++++.|. |+.|.|
T Consensus 43 ~W~~~p~p~~~~~y~fNvTNp~ev~~~-g~kP~~~EvGPY~y~e~~~k~nv~~~~~~---~tvsY~~~~~~~--F~~~~S 116 (467)
T PF01130_consen 43 NWKKPPVPIYFKFYFFNVTNPEEVLNG-GAKPNVQEVGPYVYREYREKVNVTFNDNG---STVSYRQKRTFF--FDPELS 116 (467)
T ss_pred hhhcCCCccEEEEEEEeccCHHHHhCc-CCCceEEEeCCEEEEeeeeeeeeEEcCCC---cEEEEEeeeeEE--eccccC
Confidence 599999999999999999999999996 99999999999999999999999998873 799999999996 999988
Q ss_pred ec
Q psy16523 81 WS 82 (85)
Q Consensus 81 ~~ 82 (85)
-.
T Consensus 117 ~~ 118 (467)
T PF01130_consen 117 CG 118 (467)
T ss_pred CC
Confidence 65
No 2
>KOG3776|consensus
Probab=99.86 E-value=9.7e-22 Score=158.68 Aligned_cols=75 Identities=37% Similarity=0.622 Sum_probs=67.0
Q ss_pred CccCCCCcceEEEEEEeccChhhhhcCCCCCceeEEeCCeEEEeEEEeeeeEEecCCCcCceEEEEc-ceEEEEEeecee
Q psy16523 1 MWRKPPVHPVIRIFIYNVTNADEFLSVPGTKPVLDEIGPYVYVLLTSFVANVIQVNKLERNSMTWSG-LGFYCCLFNMEL 79 (85)
Q Consensus 1 ~W~~pp~~~~~~~y~FNvTN~eevl~g~g~kP~v~EvGPY~Y~e~~~k~ni~~~~n~~e~~t~~y~~-~~~y~~~~~~~~ 79 (85)
.|.+||++.+++||+|||||||||++| |+||+|+|+|||+|+|.+.|+|+.+.+ |+|++|.+ ++.|. ||.|.
T Consensus 54 ~W~~pp~~~~~~~yiFNvTNpde~~~~-~~kp~v~EvGPyvy~E~~~K~~~~f~~----n~tv~f~~~~k~~~--F~p~~ 126 (507)
T KOG3776|consen 54 SWLNPPYPIYLNFYIFNVTNPDEFLSG-GAKPNVQEVGPYVYREIEQKVNIDFNE----NNTVSFYQNRKTYV--FNPEE 126 (507)
T ss_pred hhhcCCCccEEEEEEEecCCHHHhhCc-CCCCceEeeCCeEEEEEecccceeecc----CCcEEEEeeccEEE--echhh
Confidence 599999999999999999999999996 999999999999999999999995555 47777777 99996 99987
Q ss_pred eec
Q psy16523 80 RWS 82 (85)
Q Consensus 80 ~~~ 82 (85)
|-.
T Consensus 127 S~~ 129 (507)
T KOG3776|consen 127 SVG 129 (507)
T ss_pred cCC
Confidence 743
No 3
>PF02056 Glyco_hydro_4: Family 4 glycosyl hydrolase; InterPro: IPR001088 O-Glycosyl hydrolases 3.2.1. from EC are a widespread group of enzymes that hydrolyse the glycosidic bond between two or more carbohydrates, or between a carbohydrate and a non-carbohydrate moiety. A classification system for glycosyl hydrolases, based on sequence similarity, has led to the definition of 85 different families [, ]. This classification is available on the CAZy (CArbohydrate-Active EnZymes) web site. Glycoside hydrolase family 4 GH4 from CAZY comprises enzymes with several known activities; 6-phospho-beta-glucosidase (3.2.1.86 from EC); 6-phospho-alpha-glucosidase (3.2.1.122 from EC); alpha-galactosidase (3.2.1.22 from EC). 6-phospho-alpha-glucosidase requires both NAD(H) and divalent metal (Mn2+, Fe2+, Co2+, or Ni2+) for activity [].; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0005975 carbohydrate metabolic process; PDB: 1U8X_X 1S6Y_A 3FEF_B 1VJT_A 1UP4_A 1UP6_C 1UP7_E 1OBB_A.
Probab=26.45 E-value=40 Score=24.30 Aligned_cols=13 Identities=23% Similarity=0.340 Sum_probs=10.1
Q ss_pred EEEEeccChhhhh
Q psy16523 13 IFIYNVTNADEFL 25 (85)
Q Consensus 13 ~y~FNvTN~eevl 25 (85)
-|++|.|||...+
T Consensus 138 Aw~iNytNP~~~v 150 (183)
T PF02056_consen 138 AWLINYTNPMGIV 150 (183)
T ss_dssp SEEEE-SSSHHHH
T ss_pred cEEEeccChHHHH
Confidence 4899999998775
No 4
>PF09200 Monellin: Monellin; InterPro: IPR015283 Monellin, a protein produced by the West African plant Dioscoreophyllum cumminsii (Serendipity berry), is approximately 70,000 times sweeter than sucrose on a molar basis. The protein adopts an alpha-beta structure, with a cystatin-like fold, where each helix packs against a coiled antiparallel beta-sheet []. ; PDB: 3PXM_A 1IV7_A 3PYJ_A 1KRL_D 4MON_B 2O9U_X 3Q2P_C 3MON_F 1MOL_B 1IV9_A ....
Probab=20.97 E-value=51 Score=18.82 Aligned_cols=19 Identities=21% Similarity=-0.080 Sum_probs=12.2
Q ss_pred eeEEeCCeEEEe-EEEeeee
Q psy16523 33 VLDEIGPYVYVL-LTSFVAN 51 (85)
Q Consensus 33 ~v~EvGPY~Y~e-~~~k~ni 51 (85)
..-|.|||+|.. ..-|-+|
T Consensus 4 ~~~e~~~~~~~s~~L~r~~i 23 (43)
T PF09200_consen 4 KGYEYQLYVYASDKLFRADI 23 (43)
T ss_dssp EEEECCCCEEECCCCCCCCC
T ss_pred eEEEeeeeeehhhhhhheec
Confidence 456899999963 3344444
No 5
>PF00407 Bet_v_1: Pathogenesis-related protein Bet v I family; InterPro: IPR000916 Allergies are hypersensitivity reactions of the immune system to specific substances called allergens (such as pollen, stings, drugs, or food) that, in most people, result in no symptoms. A nomenclature system has been established for antigens (allergens) that cause IgE-mediated atopic allergies in humans [WHO/IUIS Allergen Nomenclature Subcommittee King T.P., Hoffmann D., Loewenstein H., Marsh D.G., Platts-Mills T.A.E., Thomas W. Bull. World Health Organ. 72:797-806(1994)]. This nomenclature system is defined by a designation that is composed of the first three letters of the genus; a space; the first letter of the species name; a space and an arabic number. In the event that two species names have identical designations, they are discriminated from one another by adding one or more letters (as necessary) to each species designation. The allergens in this family include allergens with the following designations: Aln g 1, Api g 1, Bet v 1, Car b 1, Cor a 1, Dau c 1, Mal d 1 and Pru a 1. Trees within the order Fagales possess particularly potent allergens, e.g. Bet v1, the major White Birch (Betula verrucosa) pollen antigen. Bet v1 is the main cause of type I allergies observed in early spring. Type I, or immunoglobulin E-mediated (IgE-mediated) allergies affect 1 in 5 people in Europe and North America. Commonly-observed symptoms are hay fever, dermatitis, asthma and, in severe cases, anaphylactic shock. First contact with these allergens results in sensitisation; subsequent contact produces a cross-linking reaction of IgE on mast cells and concomitant release of histamine. The inevitable symptoms of an allergic reaction ensue. Recent NMR analysis [] has confirmed earlier predictions of the protein structure and site of the major T-cell epitope []. The Bet v1 protein comprises 6 anti-parallel beta-strands and 3 alpha-helices. Four of the strands dominate the global fold, and 2 of the helices form a C-terminal amphipathic helical motif. This motif is believed to be the T-cell epitope. Other proteins belonging to this family include the major pollen allergens: Aln g I from Alnus glutinosa (Alder); Api G I from Apium graveolens (Celery); Car b I from Carpinus betulus (European hornbeam); Cor a I from Corylus avellana (European hazel); Mal d I from Malus domestica (Apple). The motif is also found in: the wound-induced protein AoPR1 from Asparagus officinalis (Garden asparagus); pathogenesis-related proteins from Phaseolus vulgaris (Kidney bean) and Petroselinum crispum (Parsley) (PR1-1 and PR1-3); the disease resistance response proteins, STH-2 and STH-21, from Solanum tuberosum (Potato) and pI49, pI176 and DRRG49-C from Pisum sativum (Garden pea); the P. sativum abscisic acid-responsive proteins ABR17 and ABR18; and the stress-induced protein SAM22 from Glycine max (Soybean). ; GO: 0006952 defense response, 0009607 response to biotic stimulus; PDB: 1IFV_A 4A8V_A 4A8U_A 2K7H_A 2QIM_A 3E85_A 1H2O_A 1E09_A 1QMR_A 1FSK_D ....
Probab=20.70 E-value=2.7e+02 Score=18.93 Aligned_cols=38 Identities=21% Similarity=0.364 Sum_probs=21.6
Q ss_pred hhhcCCCCCc-e-----eEEeCCeEEEeEEEeeeeEEecCCCcCceEEEE
Q psy16523 23 EFLSVPGTKP-V-----LDEIGPYVYVLLTSFVANVIQVNKLERNSMTWS 66 (85)
Q Consensus 23 evl~g~g~kP-~-----v~EvGPY~Y~e~~~k~ni~~~~n~~e~~t~~y~ 66 (85)
|++.|.|... . +-+-||..| .++|++. .|+ ++.+++|+
T Consensus 42 e~~eGdgg~gGSIk~~~f~~~~~~~~--~Kekve~--~D~--~~~~~~y~ 85 (151)
T PF00407_consen 42 EVVEGDGGPGGSIKKWTFGPGGPFKY--VKEKVEA--IDE--ENKTITYT 85 (151)
T ss_dssp EEEESSSSTTT-EEEEEEETTSSEEE--EEEEEEE--EET--TTTEEEEE
T ss_pred EEEccCCCCCCeEEEEEecCCCCcce--eEEEEEe--ecC--CCcEEEEE
Confidence 4455555544 3 334488644 5777665 233 36888886
No 6
>PF05981 CreA: CreA protein; InterPro: IPR010292 This family consists of several bacterial CreA proteins, the function of which is unknown.
Probab=19.79 E-value=1.6e+02 Score=20.41 Aligned_cols=43 Identities=12% Similarity=0.145 Sum_probs=28.5
Q ss_pred CceeEEeCCeEEEeEEEeeeeEEecCCCcCceEEEEcceEEEEEeece
Q psy16523 31 KPVLDEIGPYVYVLLTSFVANVIQVNKLERNSMTWSGLGFYCCLFNME 78 (85)
Q Consensus 31 kP~v~EvGPY~Y~e~~~k~ni~~~~n~~e~~t~~y~~~~~y~~~~~~~ 78 (85)
...-+|.||.+..+...+-..+|.. ..++-|+..+... +||++
T Consensus 55 SisCrQtGpI~~~~~~~~ge~VF~e----~~Sl~FK~l~V~R-i~D~~ 97 (128)
T PF05981_consen 55 SISCRQTGPITIKDKSPKGEEVFSE----SKSLLFKSLRVVR-IYDAK 97 (128)
T ss_pred eEEEEEeccEeeCCCCCCcCeeEec----ccceeeeeEEEEE-EEcCC
Confidence 3457899999998766666666654 3667776666654 45543
No 7
>PRK15076 alpha-galactosidase; Provisional
Probab=17.54 E-value=81 Score=25.31 Aligned_cols=15 Identities=13% Similarity=0.171 Sum_probs=12.8
Q ss_pred EEEEEeccChhhhhc
Q psy16523 12 RIFIYNVTNADEFLS 26 (85)
Q Consensus 12 ~~y~FNvTN~eevl~ 26 (85)
+-++.|+|||.+++.
T Consensus 140 ~a~iin~tNP~divt 154 (431)
T PRK15076 140 DALLLNYVNPMAMNT 154 (431)
T ss_pred CeEEEEcCChHHHHH
Confidence 568999999998873
No 8
>PHA02757 hypothetical protein; Provisional
Probab=16.16 E-value=1.1e+02 Score=19.28 Aligned_cols=20 Identities=40% Similarity=0.788 Sum_probs=15.5
Q ss_pred hhhcCCCCCc-----eeEEeCCeEEE
Q psy16523 23 EFLSVPGTKP-----VLDEIGPYVYV 43 (85)
Q Consensus 23 evl~g~g~kP-----~v~EvGPY~Y~ 43 (85)
|+|.| .+|| .+.-.||+-|.
T Consensus 4 EflkG-SskPprnvCViTPsgpfDFn 28 (75)
T PHA02757 4 EFLKG-SSKPPRNVCVITPSGPFDFN 28 (75)
T ss_pred hhhcC-CCCCCCCEEEEeCCCCccEE
Confidence 68886 6676 47788999987
No 9
>PF14237 DUF4339: Domain of unknown function (DUF4339)
Probab=16.08 E-value=65 Score=17.51 Aligned_cols=13 Identities=23% Similarity=0.470 Sum_probs=10.2
Q ss_pred EEeCCeEEEeEEE
Q psy16523 35 DEIGPYVYVLLTS 47 (85)
Q Consensus 35 ~EvGPY~Y~e~~~ 47 (85)
++.|||+..+-+.
T Consensus 9 ~~~GP~s~~el~~ 21 (45)
T PF14237_consen 9 QQQGPFSLEELRQ 21 (45)
T ss_pred eEECCcCHHHHHH
Confidence 6899999877554
No 10
>PF08341 Fb_signal: Fibronectin-binding protein signal sequence; InterPro: IPR013552 This domain is found near the N terminus of fibronectin-binding proteins in Streptococcus where it functions as a signal sequence []. ; PDB: 2XI9_A 2XID_B 2XIC_B.
Probab=15.84 E-value=67 Score=19.84 Aligned_cols=17 Identities=35% Similarity=0.853 Sum_probs=0.0
Q ss_pred EcceEEEEEeeceeeeccc
Q psy16523 66 SGLGFYCCLFNMELRWSLT 84 (85)
Q Consensus 66 ~~~~~y~~~~~~~~~~~~~ 84 (85)
+...+|| ||....||.+
T Consensus 26 ~~~~VYC--FN~~~~~P~~ 42 (72)
T PF08341_consen 26 KSYQVYC--FNRDKHWPET 42 (72)
T ss_dssp -EEEEEE--S-TTSB-S-E
T ss_pred ceEEEEE--cCCcCCCCCc
Done!