Query 029395
Match_columns 194
No_of_seqs 172 out of 750
Neff 4.8
Searched_HMMs 46136
Date Fri Mar 29 12:14:26 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029395.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029395hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00768 X8 Possibly involve 100.0 2E-33 4.4E-38 208.6 8.8 85 5-89 1-85 (85)
2 PF07983 X8: X8 domain; Inter 99.9 3.2E-27 7E-32 173.2 6.8 72 5-76 1-78 (78)
3 COG3889 Predicted solute bindi 41.2 15 0.00033 37.7 1.6 31 15-48 651-685 (872)
4 cd04366 IlGF_insulin_bombyxin_ 36.9 55 0.0012 21.4 3.2 34 16-53 4-41 (42)
5 COG3889 Predicted solute bindi 35.6 20 0.00044 36.8 1.5 13 16-28 620-632 (872)
6 PF05283 MGC-24: Multi-glycosy 35.1 2.6E+02 0.0056 23.9 7.9 12 175-186 164-175 (186)
7 cd04368 IlGF IlGF, insulin_lik 26.7 1E+02 0.0022 22.2 3.5 38 16-53 8-59 (67)
8 cd00101 IlGF_like Insulin/insu 21.5 1.5E+02 0.0032 18.9 3.2 36 16-53 4-40 (41)
9 KOG3679 Predicted coiled-coil 18.2 83 0.0018 30.6 2.1 29 49-77 529-557 (802)
10 PF13677 MotB_plug: Membrane M 14.5 1.2E+02 0.0026 20.7 1.6 15 179-193 22-36 (58)
No 1
>smart00768 X8 Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges.
Probab=100.00 E-value=2e-33 Score=208.57 Aligned_cols=85 Identities=60% Similarity=1.187 Sum_probs=82.7
Q ss_pred cceeecCCCChHHHHHHHHHhcCCCCCCcccCCCCCccCCCCcchhhhHHHHHHHHHhCCCCCCCCCCCceEEEecCCCC
Q 029395 5 NWCVCKDGVGDPVLQKALDYACGAGADCNPIHSNGPCYNPNTVKAHCSYAVNSYFQRKGQAQGSCDFSGSATVATTDPST 84 (194)
Q Consensus 5 lwCVak~~~~~~~lq~~ldyACg~gvDCs~I~~nG~cyspct~~~~lSyA~N~YYq~~~~~~~aCdF~G~Atv~t~dPS~ 84 (194)
+|||+|+++++++||++|||||++++||++|++||+||+||++++|||||||+|||++++..++|||+|.|++++.||+.
T Consensus 1 ~wCv~~~~~~~~~l~~~~~yaCg~~~dC~~I~~~g~c~~~~~~~~~aS~a~N~YYq~~~~~~~aC~F~G~a~~~~~~ps~ 80 (85)
T smart00768 1 LWCVAKPDADEAALQAALDYACGQGADCTAIQPGGSCYSPNTVKAHASYAFNSYYQKQGQSSGACDFGGTATITTTDPST 80 (85)
T ss_pred CccccCCCCCHHHHHHHHHHHhcCCCCccccCCCCcccCCCCHHHHHHHHHHHHHHHcCCCCCcCCCCCceEEEecCCCC
Confidence 69999999999999999999999889999999999999999999999999999999999999999999999999999999
Q ss_pred Ccccc
Q 029395 85 AGCSY 89 (194)
Q Consensus 85 ~sC~f 89 (194)
++|+|
T Consensus 81 ~~C~~ 85 (85)
T smart00768 81 GSCKF 85 (85)
T ss_pred CccCC
Confidence 99985
No 2
>PF07983 X8: X8 domain; InterPro: IPR012946 The X8 domain [] contains 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen [] as well as at the C terminus of family 17 glycosyl hydrolases []. This domain may be involved in carbohydrate binding.; PDB: 2JON_A 2W61_A 2W62_A 2W63_A.
Probab=99.94 E-value=3.2e-27 Score=173.21 Aligned_cols=72 Identities=50% Similarity=1.005 Sum_probs=63.0
Q ss_pred cceeecCCCChHHHHHHHHHhcCC-CCCCcccCCCCC-----ccCCCCcchhhhHHHHHHHHHhCCCCCCCCCCCceE
Q 029395 5 NWCVCKDGVGDPVLQKALDYACGA-GADCNPIHSNGP-----CYNPNTVKAHCSYAVNSYFQRKGQAQGSCDFSGSAT 76 (194)
Q Consensus 5 lwCVak~~~~~~~lq~~ldyACg~-gvDCs~I~~nG~-----cyspct~~~~lSyA~N~YYq~~~~~~~aCdF~G~At 76 (194)
||||+|+++++++|+++|||||++ ++||++|++|++ .||+|+.++|||||||+|||++++...+|||+|.|+
T Consensus 1 l~Cv~~~~~~~~~l~~~l~~aC~~~~~dC~~I~~~g~~G~YG~~S~C~~~~~lSya~N~YY~~~~~~~~~C~F~G~at 78 (78)
T PF07983_consen 1 LWCVAKPDADDKELQDLLDYACGQGGVDCSPIQPNGTTGVYGAYSMCSPRQHLSYAFNQYYQKQGRNSSACDFSGNAT 78 (78)
T ss_dssp -EEEE-TTS-HHHHHHHHHHHTTT-SSSCCCC-EETTTTEE-TTTTS-CCHHHHHHHHHHHHHHTSSCCG-SS-STEE
T ss_pred CcceeCCCCCHHHHHHHHHHHHcCCCCChhhhCCCCcccccccccCCCHHHHHHHHHHHHHHHcCCCCCcCCCCCCCC
Confidence 799999999999999999999998 599999999999 899999999999999999999999999999999986
No 3
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=41.25 E-value=15 Score=37.73 Aligned_cols=31 Identities=29% Similarity=0.538 Sum_probs=19.6
Q ss_pred hHHHHHHHHHhcCCCCCCcccCCCCC----ccCCCCcc
Q 029395 15 DPVLQKALDYACGAGADCNPIHSNGP----CYNPNTVK 48 (194)
Q Consensus 15 ~~~lq~~ldyACg~gvDCs~I~~nG~----cyspct~~ 48 (194)
.+.|++++||+=+. -..+..+|. .|.|+-.+
T Consensus 651 ~a~y~a~vnf~n~~---Gh~~is~GPf~L~aydPdk~~ 685 (872)
T COG3889 651 YAAYVAAVNFINGY---GHAQISNGPFYLEAYDPDKLK 685 (872)
T ss_pred HHHHHHHHHHHhcc---CceEeccCceEEEEeCcccch
Confidence 35688899998865 345556665 45555433
No 4
>cd04366 IlGF_insulin_bombyxin_like IlGF_like family, insulin_bombyxin_like subgroup. Members include a number of peptides including insulin, insulin-like growth factors I and II, insect prothoracicotropic hormone (bombyxin), locust insulin-related peptide (LIRP), molluscan insulin-related peptides 1 to 5 (MIP), and C. elegans insulin-like peptides. With the exception of insulin-like growth factors, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved to yield two separate chains linked covalently by the two disulfide bonds.
Probab=36.90 E-value=55 Score=21.44 Aligned_cols=34 Identities=24% Similarity=0.357 Sum_probs=23.9
Q ss_pred HHHHHHHHHhcCCCCCCcccCCCCC----ccCCCCcchhhhH
Q 029395 16 PVLQKALDYACGAGADCNPIHSNGP----CYNPNTVKAHCSY 53 (194)
Q Consensus 16 ~~lq~~ldyACg~gvDCs~I~~nG~----cyspct~~~~lSy 53 (194)
++|.+.|.++|+.... . ..|- ||.+|++.+-.+|
T Consensus 4 ~~L~~~L~~vC~~~~~-~---~~gIvdeCC~~~Ct~~~L~~Y 41 (42)
T cd04366 4 RHLADTLALLCSEYNS-P---RRGIVDECCRKSCTLDELLSY 41 (42)
T ss_pred HHHHHHHHHHhCCCCC-C---CCChhhccCCCcCCHHHHHhh
Confidence 5788999999986211 1 1222 8999998887766
No 5
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=35.60 E-value=20 Score=36.81 Aligned_cols=13 Identities=15% Similarity=0.090 Sum_probs=7.2
Q ss_pred HHHHHHHHHhcCC
Q 029395 16 PVLQKALDYACGA 28 (194)
Q Consensus 16 ~~lq~~ldyACg~ 28 (194)
+.+.+.|.++-.+
T Consensus 620 ~~ia~vlq~~~~q 632 (872)
T COG3889 620 FSIARVLQEATTQ 632 (872)
T ss_pred HHHHHHHHHHhcc
Confidence 4455566666554
No 6
>PF05283 MGC-24: Multi-glycosylated core protein 24 (MGC-24); InterPro: IPR007947 CD164 is a mucin-like receptor, or sialomucin, with specificity in receptor/ ligand interactions that depends on the structural characteristics of the mucin-like receptor. Its functions include mediating, or regulating, haematopoietic progenitor cell adhesion and the negative regulation of their growth and/or-differentiation. It exists in the native state as a disulphide- linked homodimer of two 80-85kDa subunits. It is usually expressed by CD34+ and CD341o/- haematopoietic stem cells and associated microenvironmental cells. It contains, in its extracellular region, two mucin domains (I and II) linked by a non-mucin domain, which has been predicted to contain intra- disulphide bridges. This receptor may play a key role in haematopoiesis by facilitating the adhesion of human CD34+ cells to bone marrow stroma and by negatively regulating CD34+ CD341o/- haematopoietic progenitor cell proliferation. These effects involve the CD164 class I and/or II epitopes recognised by the monoclonal antibodies (mAbs) 105A5 and 103B2/9E10. These epitopes are carbohydrate-dependent and are located on the N-terminal mucin domain I [, ]. It has been found that murine MGC-24v and rat endolyn share significant sequence similarities with human CD164. However, CD164 lacks the consensus glycosaminoglycan (GAG)-attachment site found in MGC-24; it is possible that GAG-association is responsible for the high molecular weight of the epithelial-derived MGC-24 glycoprotein []. Genomic structure studies have placed CD164 within the mucin-subgroup that comprises multiple exons, and demonstrate the diverse chromosomal distribution of this family of molecules. Molecules with such multiple exons may have sophisticated regulatory mechanisms that involve not only post-translational modifications of the oligosaccharide side chains, but also differential exon usage. Although differences in the intron and exon sizes are seen between the mouse and human genes, the predicted proteins are similar in size and structure, maintaining functionally important motifs that regulate cell proliferation or subcellular distribution []. CD164 is a gene whose expression depends on differential usage of poly- adenylation sites within the 3'-UTR. The conserved distribution of the 3.2- and 1.2-kb CD164 transcripts between mouse and human suggests that (i) a mechanism may exist to regulate tissue-specific polyadenylation, and (ii) differences in polyadenylation are important for the expression and function of CD164 in different tissues. Two other aspects of the structure of CD164 are of particular interest. First, it shares one of several conserved features of a cytokine-binding pocket - in this respect, it is notable that evidence exists for a class of cell-surface sialomucin modulators that directly interact with growth factor receptors to regulate their response to physiological ligands. Second, its cytoplasmic tail contains a C-terminal YHTL motif found in many endocytic membrane proteins or receptors. These Tyr-based motifs bind to adaptor proteins, which mediate the sorting of membrane proteins into transport vesicles from the plasma membrane to the endosomes, and between intracellular compartments.
Probab=35.11 E-value=2.6e+02 Score=23.88 Aligned_cols=12 Identities=8% Similarity=0.074 Sum_probs=4.8
Q ss_pred chhHHHHHHHHH
Q 029395 175 NSALSFLVVLLF 186 (194)
Q Consensus 175 ~~~~~~~~~~~~ 186 (194)
-.|++|.+-+|.
T Consensus 164 iGGIVL~LGv~a 175 (186)
T PF05283_consen 164 IGGIVLTLGVLA 175 (186)
T ss_pred hhHHHHHHHHHH
Confidence 344444433333
No 7
>cd04368 IlGF IlGF, insulin_like growth factors; specific to vertebrates. Members include a number of peptides including insulin-like growth factors I and II, which play a variety of roles in controlling processes such as growth, differentiation, and reproduction. On a cellular level they affect cell cycle, apoptosis, cell migration, proliferation, and differentiation. Typically, the active forms of these peptide hormones are single chains cross-linked by three disulfide bonds.
Probab=26.69 E-value=1e+02 Score=22.23 Aligned_cols=38 Identities=21% Similarity=0.454 Sum_probs=23.9
Q ss_pred HHHHHHHHHhcCC-CCCCcccC---------CCCC----ccCCCCcchhhhH
Q 029395 16 PVLQKALDYACGA-GADCNPIH---------SNGP----CYNPNTVKAHCSY 53 (194)
Q Consensus 16 ~~lq~~ldyACg~-gvDCs~I~---------~nG~----cyspct~~~~lSy 53 (194)
.+|.+.|.++|+. |.-=.+-. ..|- |+.+|++.+-.+|
T Consensus 8 ~~Lvd~L~~vCg~RGf~~~~~~~~~~~~rr~~rGIVdeCC~~~Ctl~~Le~Y 59 (67)
T cd04368 8 GELVDTLQFVCGDRGFYFSKPTGYGSSRRRPNRGIVEECCFRSCDLRLLEMY 59 (67)
T ss_pred HHHHHHHHHHhCCCCcccCCCcccccccccccCCchhhhcCCcCCHHHHHHh
Confidence 5789999999986 32111110 1111 8999988776665
No 8
>cd00101 IlGF_like Insulin/insulin-like growth factor/relaxin family; insulin family of proteins. Members include a number of active peptides which are evolutionary related including insulin, relaxin, prorelaxin, insulin-like growth factors I and II, mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP; gene INSL4), insect prothoracicotropic hormone (bombyxin), locust insulin-related peptide (LIRP), molluscan insulin-related peptides 1 to 5 (MIP), and C. elegans insulin-like peptides. Typically, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved
Probab=21.50 E-value=1.5e+02 Score=18.94 Aligned_cols=36 Identities=31% Similarity=0.518 Sum_probs=22.8
Q ss_pred HHHHHHHHHhcCC-CCCCcccCCCCCccCCCCcchhhhH
Q 029395 16 PVLQKALDYACGA-GADCNPIHSNGPCYNPNTVKAHCSY 53 (194)
Q Consensus 16 ~~lq~~ldyACg~-gvDCs~I~~nG~cyspct~~~~lSy 53 (194)
.+|.+++.++|+. +.. ..|.. -=|+.+|+.++-+.|
T Consensus 4 ~~Lv~~l~~vC~~~~~~-~giv~-eCC~~~Ct~~~L~~Y 40 (41)
T cd00101 4 RELVRALIFVCGDRGFY-RGIVD-ECCFRGCTLRELASY 40 (41)
T ss_pred HHHHHHHHHhcCCCCCc-CCccc-ccCCCCCChHHHHhh
Confidence 4788999999986 222 11110 018999988776655
No 9
>KOG3679 consensus Predicted coiled-coil protein [General function prediction only]
Probab=18.19 E-value=83 Score=30.55 Aligned_cols=29 Identities=28% Similarity=0.600 Sum_probs=23.6
Q ss_pred hhhhHHHHHHHHHhCCCCCCCCCCCceEE
Q 029395 49 AHCSYAVNSYFQRKGQAQGSCDFSGSATV 77 (194)
Q Consensus 49 ~~lSyA~N~YYq~~~~~~~aCdF~G~Atv 77 (194)
..+-.-+..||++|+-.+-+|.|.|.--+
T Consensus 529 nelilrlqeyfekqgvkdfacsfsgsipl 557 (802)
T KOG3679|consen 529 NELILRLQEYFEKQGVKDFACSFSGSIPL 557 (802)
T ss_pred HHHHHHHHHHHHHcCcceeeeeccCCccH
Confidence 34556678999999999999999996543
No 10
>PF13677 MotB_plug: Membrane MotB of proton-channel complex MotA/MotB
Probab=14.49 E-value=1.2e+02 Score=20.73 Aligned_cols=15 Identities=33% Similarity=0.412 Sum_probs=9.0
Q ss_pred HHHHHHHHHHhhhhc
Q 029395 179 SFLVVLLFSGSMLLW 193 (194)
Q Consensus 179 ~~~~~~~~~~~~~~~ 193 (194)
+=+.++|+.+|++||
T Consensus 22 aDlmTLLl~fFVlL~ 36 (58)
T PF13677_consen 22 ADLMTLLLAFFVLLF 36 (58)
T ss_pred HHHHHHHHHHHHHHH
Confidence 334666666666655
Done!