Query 029582
Match_columns 191
No_of_seqs 171 out of 730
Neff 4.5
Searched_HMMs 46136
Date Fri Mar 29 15:13:33 2013
Command hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029582.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029582hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 smart00768 X8 Possibly involve 100.0 1E-31 2.2E-36 199.4 8.8 81 20-100 1-85 (85)
2 PF07983 X8: X8 domain; Inter 99.9 1.2E-26 2.5E-31 170.3 6.9 72 20-91 1-78 (78)
3 COG3889 Predicted solute bindi 47.6 20 0.00043 37.0 3.5 35 29-66 650-688 (872)
4 cd04366 IlGF_insulin_bombyxin_ 25.9 1.4E+02 0.0031 19.5 3.8 34 31-68 4-41 (42)
5 PF07803 GSG-1: GSG1-like prot 22.5 57 0.0012 26.2 1.7 26 4-29 14-39 (118)
6 PF09628 YvfG: YvfG protein; 16.5 86 0.0019 22.7 1.3 9 68-76 27-35 (68)
7 COG3889 Predicted solute bindi 16.0 1E+02 0.0022 32.1 2.2 12 31-42 620-631 (872)
8 PF11395 DUF2873: Protein of u 15.2 2.1E+02 0.0045 18.9 2.8 16 174-189 14-29 (43)
9 PF13677 MotB_plug: Membrane M 14.0 2.2E+02 0.0047 19.5 2.8 20 170-189 16-36 (58)
10 cd00101 IlGF_like Insulin/insu 13.4 3.8E+02 0.0082 17.1 3.7 36 31-68 4-40 (41)
No 1
>smart00768 X8 Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges.
Probab=99.97 E-value=1e-31 Score=199.37 Aligned_cols=81 Identities=49% Similarity=1.077 Sum_probs=77.3
Q ss_pred cceeeCCCCCHHHHHHHHHHhhCCCCCCcccCCCCcccCCCChhhhHhHHHHHHHHHhCCCCCCCCCCCceEEee----c
Q 029582 20 LYCLCKQGLSQSVLQKAIDYACGAGADCTPILQNGVCWNPNTVQDHCNYAVNSYFQRKGQTPGSCDFAGAAATNA----A 95 (191)
Q Consensus 20 lwCVak~~~~~~~Lq~~ldyACg~g~DCs~I~~gGsCyspct~~~haSYAfNsYYQ~~~~~~~aCdF~GtAtltt----~ 95 (191)
+|||+|+++++++||++|||||++++||++|++||+||+||++++|+|||||+|||++++..++|||+|.|++++ +
T Consensus 1 ~wCv~~~~~~~~~l~~~~~yaCg~~~dC~~I~~~g~c~~~~~~~~~aS~a~N~YYq~~~~~~~aC~F~G~a~~~~~~ps~ 80 (85)
T smart00768 1 LWCVAKPDADEAALQAALDYACGQGADCTAIQPGGSCYSPNTVKAHASYAFNSYYQKQGQSSGACDFGGTATITTTDPST 80 (85)
T ss_pred CccccCCCCCHHHHHHHHHHHhcCCCCccccCCCCcccCCCCHHHHHHHHHHHHHHHcCCCCCcCCCCCceEEEecCCCC
Confidence 599999999999999999999998799999999999999999999999999999999999999999999999977 5
Q ss_pred Cceee
Q 029582 96 GGCVY 100 (191)
Q Consensus 96 gsC~f 100 (191)
++|+|
T Consensus 81 ~~C~~ 85 (85)
T smart00768 81 GSCKF 85 (85)
T ss_pred CccCC
Confidence 67875
No 2
>PF07983 X8: X8 domain; InterPro: IPR012946 The X8 domain [] contains 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen [] as well as at the C terminus of family 17 glycosyl hydrolases []. This domain may be involved in carbohydrate binding.; PDB: 2JON_A 2W61_A 2W62_A 2W63_A.
Probab=99.93 E-value=1.2e-26 Score=170.27 Aligned_cols=72 Identities=40% Similarity=0.910 Sum_probs=62.8
Q ss_pred cceeeCCCCCHHHHHHHHHHhhCC-CCCCcccCCCCc-----ccCCCChhhhHhHHHHHHHHHhCCCCCCCCCCCceE
Q 029582 20 LYCLCKQGLSQSVLQKAIDYACGA-GADCTPILQNGV-----CWNPNTVQDHCNYAVNSYFQRKGQTPGSCDFAGAAA 91 (191)
Q Consensus 20 lwCVak~~~~~~~Lq~~ldyACg~-g~DCs~I~~gGs-----Cyspct~~~haSYAfNsYYQ~~~~~~~aCdF~GtAt 91 (191)
+|||+|+++++++||++|||||++ ++||++|++||+ .|++|+.++|+|||||+|||++++.+.+|||+|+|+
T Consensus 1 l~Cv~~~~~~~~~l~~~l~~aC~~~~~dC~~I~~~g~~G~YG~~S~C~~~~~lSya~N~YY~~~~~~~~~C~F~G~at 78 (78)
T PF07983_consen 1 LWCVAKPDADDKELQDLLDYACGQGGVDCSPIQPNGTTGVYGAYSMCSPRQHLSYAFNQYYQKQGRNSSACDFSGNAT 78 (78)
T ss_dssp -EEEE-TTS-HHHHHHHHHHHTTT-SSSCCCC-EETTTTEE-TTTTS-CCHHHHHHHHHHHHHHTSSCCG-SS-STEE
T ss_pred CcceeCCCCCHHHHHHHHHHHHcCCCCChhhhCCCCcccccccccCCCHHHHHHHHHHHHHHHcCCCCCcCCCCCCCC
Confidence 699999999999999999999998 589999999999 799999999999999999999999999999999986
No 3
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=47.57 E-value=20 Score=36.96 Aligned_cols=35 Identities=14% Similarity=0.350 Sum_probs=21.7
Q ss_pred CHHHHHHHHHHhhCCCCCCcccCCCCc----ccCCCChhhhH
Q 029582 29 SQSVLQKAIDYACGAGADCTPILQNGV----CWNPNTVQDHC 66 (191)
Q Consensus 29 ~~~~Lq~~ldyACg~g~DCs~I~~gGs----Cyspct~~~ha 66 (191)
..+.+++++||.=+. -..+-.+|. .|.|+-.+.-+
T Consensus 650 a~a~y~a~vnf~n~~---Gh~~is~GPf~L~aydPdk~~~~~ 688 (872)
T COG3889 650 AYAAYVAAVNFINGY---GHAQISNGPFYLEAYDPDKLKPIL 688 (872)
T ss_pred HHHHHHHHHHHHhcc---CceEeccCceEEEEeCcccchHHH
Confidence 456788999998765 344555565 56665544433
No 4
>cd04366 IlGF_insulin_bombyxin_like IlGF_like family, insulin_bombyxin_like subgroup. Members include a number of peptides including insulin, insulin-like growth factors I and II, insect prothoracicotropic hormone (bombyxin), locust insulin-related peptide (LIRP), molluscan insulin-related peptides 1 to 5 (MIP), and C. elegans insulin-like peptides. With the exception of insulin-like growth factors, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved to yield two separate chains linked covalently by the two disulfide bonds.
Probab=25.90 E-value=1.4e+02 Score=19.54 Aligned_cols=34 Identities=18% Similarity=0.454 Sum_probs=23.5
Q ss_pred HHHHHHHHHhhCCCCCCcccCCCCc----ccCCCChhhhHhH
Q 029582 31 SVLQKAIDYACGAGADCTPILQNGV----CWNPNTVQDHCNY 68 (191)
Q Consensus 31 ~~Lq~~ldyACg~g~DCs~I~~gGs----Cyspct~~~haSY 68 (191)
+.|-+.+.++|+.... . ..|- |+.+|+..+=.+|
T Consensus 4 ~~L~~~L~~vC~~~~~-~---~~gIvdeCC~~~Ct~~~L~~Y 41 (42)
T cd04366 4 RHLADTLALLCSEYNS-P---RRGIVDECCRKSCTLDELLSY 41 (42)
T ss_pred HHHHHHHHHHhCCCCC-C---CCChhhccCCCcCCHHHHHhh
Confidence 5688999999986211 1 1222 8999998876665
No 5
>PF07803 GSG-1: GSG1-like protein; InterPro: IPR012478 This family contains sequences bearing similarity to a region of GSG1 (Q9Z1H7 from SWISSPROT), a protein specifically expressed in testicular germ cells []. It is possible that over expression of the human homologue may be involved in tumourigenesis of human testicular germ cell tumours []. The region in question has four highly conserved cysteine residues.
Probab=22.47 E-value=57 Score=26.19 Aligned_cols=26 Identities=19% Similarity=0.344 Sum_probs=18.7
Q ss_pred HHHHHHHHHHhCCCCccceeeCCCCC
Q 029582 4 IAYLVLFMAMTGHSTALYCLCKQGLS 29 (191)
Q Consensus 4 ~~~~~l~l~~~~~~~slwCVak~~~~ 29 (191)
|.++.|+|..+....+.||+-...+.
T Consensus 14 ln~LAL~~S~tA~~sSyWC~GTqKVp 39 (118)
T PF07803_consen 14 LNLLALAFSTTALLSSYWCEGTQKVP 39 (118)
T ss_pred HHHHHHHHHHHHHhcccccccceecC
Confidence 44566666666778889999877664
No 6
>PF09628 YvfG: YvfG protein; InterPro: IPR018590 Yvfg is a hypothetical protein of 71 residues expressed in some bacteria. The monomer consists of two parallel alpha helices, and the protein crystallises as a homo-dimer. ; PDB: 2GSV_A 2JS1_B.
Probab=16.47 E-value=86 Score=22.72 Aligned_cols=9 Identities=33% Similarity=0.774 Sum_probs=7.5
Q ss_pred HHHHHHHHH
Q 029582 68 YAVNSYFQR 76 (191)
Q Consensus 68 YAfNsYYQ~ 76 (191)
-|||+||..
T Consensus 27 ~AmNaYYr~ 35 (68)
T PF09628_consen 27 HAMNAYYRS 35 (68)
T ss_dssp HHHHHHHHH
T ss_pred HHHHHHHHH
Confidence 489999975
No 7
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=16.01 E-value=1e+02 Score=32.11 Aligned_cols=12 Identities=8% Similarity=0.097 Sum_probs=5.0
Q ss_pred HHHHHHHHHhhC
Q 029582 31 SVLQKAIDYACG 42 (191)
Q Consensus 31 ~~Lq~~ldyACg 42 (191)
..+.+.|.++-.
T Consensus 620 ~~ia~vlq~~~~ 631 (872)
T COG3889 620 FSIARVLQEATT 631 (872)
T ss_pred HHHHHHHHHHhc
Confidence 334444444443
No 8
>PF11395 DUF2873: Protein of unknown function (DUF2873); InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=15.24 E-value=2.1e+02 Score=18.91 Aligned_cols=16 Identities=31% Similarity=0.478 Sum_probs=6.8
Q ss_pred HHHHHHHHHHHHHHHH
Q 029582 174 FSFALTLWVSCLVLLV 189 (191)
Q Consensus 174 ~~~~~~~~~~~~~~~~ 189 (191)
+++++-+++.++++||
T Consensus 14 l~~llflv~imliif~ 29 (43)
T PF11395_consen 14 LSFLLFLVIIMLIIFW 29 (43)
T ss_pred HHHHHHHHHHHHHHHH
Confidence 3344444444444444
No 9
>PF13677 MotB_plug: Membrane MotB of proton-channel complex MotA/MotB
Probab=14.03 E-value=2.2e+02 Score=19.47 Aligned_cols=20 Identities=25% Similarity=0.315 Sum_probs=12.6
Q ss_pred hh-HHHHHHHHHHHHHHHHHH
Q 029582 170 TN-FFFSFALTLWVSCLVLLV 189 (191)
Q Consensus 170 ~~-~~~~~~~~~~~~~~~~~~ 189 (191)
.| ..++=+++|++.+++||+
T Consensus 16 ~WlvtyaDlmTLLl~fFVlL~ 36 (58)
T PF13677_consen 16 RWLVTYADLMTLLLAFFVLLF 36 (58)
T ss_pred cHHHHHHHHHHHHHHHHHHHH
Confidence 44 445566777777766654
No 10
>cd00101 IlGF_like Insulin/insulin-like growth factor/relaxin family; insulin family of proteins. Members include a number of active peptides which are evolutionary related including insulin, relaxin, prorelaxin, insulin-like growth factors I and II, mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP; gene INSL4), insect prothoracicotropic hormone (bombyxin), locust insulin-related peptide (LIRP), molluscan insulin-related peptides 1 to 5 (MIP), and C. elegans insulin-like peptides. Typically, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved
Probab=13.36 E-value=3.8e+02 Score=17.08 Aligned_cols=36 Identities=25% Similarity=0.565 Sum_probs=22.1
Q ss_pred HHHHHHHHHhhCC-CCCCcccCCCCcccCCCChhhhHhH
Q 029582 31 SVLQKAIDYACGA-GADCTPILQNGVCWNPNTVQDHCNY 68 (191)
Q Consensus 31 ~~Lq~~ldyACg~-g~DCs~I~~gGsCyspct~~~haSY 68 (191)
.+|-+++.++|+. +.. ..|.. -=|+.+|+..+=.+|
T Consensus 4 ~~Lv~~l~~vC~~~~~~-~giv~-eCC~~~Ct~~~L~~Y 40 (41)
T cd00101 4 RELVRALIFVCGDRGFY-RGIVD-ECCFRGCTLRELASY 40 (41)
T ss_pred HHHHHHHHHhcCCCCCc-CCccc-ccCCCCCChHHHHhh
Confidence 4678899999986 112 11110 018899988776555
Done!