Query         029582
Match_columns 191
No_of_seqs    171 out of 730
Neff          4.5 
Searched_HMMs 46136
Date          Fri Mar 29 15:13:33 2013
Command       hhsearch -i /work/01045/syshi/csienesis_hhblits_a3m/029582.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/029582hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 smart00768 X8 Possibly involve 100.0   1E-31 2.2E-36  199.4   8.8   81   20-100     1-85  (85)
  2 PF07983 X8:  X8 domain;  Inter  99.9 1.2E-26 2.5E-31  170.3   6.9   72   20-91      1-78  (78)
  3 COG3889 Predicted solute bindi  47.6      20 0.00043   37.0   3.5   35   29-66    650-688 (872)
  4 cd04366 IlGF_insulin_bombyxin_  25.9 1.4E+02  0.0031   19.5   3.8   34   31-68      4-41  (42)
  5 PF07803 GSG-1:  GSG1-like prot  22.5      57  0.0012   26.2   1.7   26    4-29     14-39  (118)
  6 PF09628 YvfG:  YvfG protein;    16.5      86  0.0019   22.7   1.3    9   68-76     27-35  (68)
  7 COG3889 Predicted solute bindi  16.0   1E+02  0.0022   32.1   2.2   12   31-42    620-631 (872)
  8 PF11395 DUF2873:  Protein of u  15.2 2.1E+02  0.0045   18.9   2.8   16  174-189    14-29  (43)
  9 PF13677 MotB_plug:  Membrane M  14.0 2.2E+02  0.0047   19.5   2.8   20  170-189    16-36  (58)
 10 cd00101 IlGF_like Insulin/insu  13.4 3.8E+02  0.0082   17.1   3.7   36   31-68      4-40  (41)

No 1  
>smart00768 X8 Possibly involved in carbohydrate binding. The X8 domain, which may be involved in carbohydrate binding, is found in an Olive pollen antigen as well as at the C terminus of family 17 glycosyl hydrolases. It contains 6 conserved cysteine residues which presumably form three disulfide bridges.
Probab=99.97  E-value=1e-31  Score=199.37  Aligned_cols=81  Identities=49%  Similarity=1.077  Sum_probs=77.3

Q ss_pred             cceeeCCCCCHHHHHHHHHHhhCCCCCCcccCCCCcccCCCChhhhHhHHHHHHHHHhCCCCCCCCCCCceEEee----c
Q 029582           20 LYCLCKQGLSQSVLQKAIDYACGAGADCTPILQNGVCWNPNTVQDHCNYAVNSYFQRKGQTPGSCDFAGAAATNA----A   95 (191)
Q Consensus        20 lwCVak~~~~~~~Lq~~ldyACg~g~DCs~I~~gGsCyspct~~~haSYAfNsYYQ~~~~~~~aCdF~GtAtltt----~   95 (191)
                      +|||+|+++++++||++|||||++++||++|++||+||+||++++|+|||||+|||++++..++|||+|.|++++    +
T Consensus         1 ~wCv~~~~~~~~~l~~~~~yaCg~~~dC~~I~~~g~c~~~~~~~~~aS~a~N~YYq~~~~~~~aC~F~G~a~~~~~~ps~   80 (85)
T smart00768        1 LWCVAKPDADEAALQAALDYACGQGADCTAIQPGGSCYSPNTVKAHASYAFNSYYQKQGQSSGACDFGGTATITTTDPST   80 (85)
T ss_pred             CccccCCCCCHHHHHHHHHHHhcCCCCccccCCCCcccCCCCHHHHHHHHHHHHHHHcCCCCCcCCCCCceEEEecCCCC
Confidence            599999999999999999999998799999999999999999999999999999999999999999999999977    5


Q ss_pred             Cceee
Q 029582           96 GGCVY  100 (191)
Q Consensus        96 gsC~f  100 (191)
                      ++|+|
T Consensus        81 ~~C~~   85 (85)
T smart00768       81 GSCKF   85 (85)
T ss_pred             CccCC
Confidence            67875


No 2  
>PF07983 X8:  X8 domain;  InterPro: IPR012946 The X8 domain [] contains 6 conserved cysteine residues that presumably form three disulphide bridges. The domain is found in an Olive pollen allergen [] as well as at the C terminus of family 17 glycosyl hydrolases []. This domain may be involved in carbohydrate binding.; PDB: 2JON_A 2W61_A 2W62_A 2W63_A.
Probab=99.93  E-value=1.2e-26  Score=170.27  Aligned_cols=72  Identities=40%  Similarity=0.910  Sum_probs=62.8

Q ss_pred             cceeeCCCCCHHHHHHHHHHhhCC-CCCCcccCCCCc-----ccCCCChhhhHhHHHHHHHHHhCCCCCCCCCCCceE
Q 029582           20 LYCLCKQGLSQSVLQKAIDYACGA-GADCTPILQNGV-----CWNPNTVQDHCNYAVNSYFQRKGQTPGSCDFAGAAA   91 (191)
Q Consensus        20 lwCVak~~~~~~~Lq~~ldyACg~-g~DCs~I~~gGs-----Cyspct~~~haSYAfNsYYQ~~~~~~~aCdF~GtAt   91 (191)
                      +|||+|+++++++||++|||||++ ++||++|++||+     .|++|+.++|+|||||+|||++++.+.+|||+|+|+
T Consensus         1 l~Cv~~~~~~~~~l~~~l~~aC~~~~~dC~~I~~~g~~G~YG~~S~C~~~~~lSya~N~YY~~~~~~~~~C~F~G~at   78 (78)
T PF07983_consen    1 LWCVAKPDADDKELQDLLDYACGQGGVDCSPIQPNGTTGVYGAYSMCSPRQHLSYAFNQYYQKQGRNSSACDFSGNAT   78 (78)
T ss_dssp             -EEEE-TTS-HHHHHHHHHHHTTT-SSSCCCC-EETTTTEE-TTTTS-CCHHHHHHHHHHHHHHTSSCCG-SS-STEE
T ss_pred             CcceeCCCCCHHHHHHHHHHHHcCCCCChhhhCCCCcccccccccCCCHHHHHHHHHHHHHHHcCCCCCcCCCCCCCC
Confidence            699999999999999999999998 589999999999     799999999999999999999999999999999986


No 3  
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=47.57  E-value=20  Score=36.96  Aligned_cols=35  Identities=14%  Similarity=0.350  Sum_probs=21.7

Q ss_pred             CHHHHHHHHHHhhCCCCCCcccCCCCc----ccCCCChhhhH
Q 029582           29 SQSVLQKAIDYACGAGADCTPILQNGV----CWNPNTVQDHC   66 (191)
Q Consensus        29 ~~~~Lq~~ldyACg~g~DCs~I~~gGs----Cyspct~~~ha   66 (191)
                      ..+.+++++||.=+.   -..+-.+|.    .|.|+-.+.-+
T Consensus       650 a~a~y~a~vnf~n~~---Gh~~is~GPf~L~aydPdk~~~~~  688 (872)
T COG3889         650 AYAAYVAAVNFINGY---GHAQISNGPFYLEAYDPDKLKPIL  688 (872)
T ss_pred             HHHHHHHHHHHHhcc---CceEeccCceEEEEeCcccchHHH
Confidence            456788999998765   344555565    56665544433


No 4  
>cd04366 IlGF_insulin_bombyxin_like IlGF_like family, insulin_bombyxin_like subgroup. Members include a number of peptides including insulin, insulin-like growth factors I and II, insect prothoracicotropic hormone (bombyxin), locust insulin-related peptide (LIRP), molluscan insulin-related peptides 1 to 5 (MIP), and C. elegans insulin-like peptides. With the exception of insulin-like growth factors, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain:  Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved to yield two separate chains linked covalently by the two disulfide bonds.
Probab=25.90  E-value=1.4e+02  Score=19.54  Aligned_cols=34  Identities=18%  Similarity=0.454  Sum_probs=23.5

Q ss_pred             HHHHHHHHHhhCCCCCCcccCCCCc----ccCCCChhhhHhH
Q 029582           31 SVLQKAIDYACGAGADCTPILQNGV----CWNPNTVQDHCNY   68 (191)
Q Consensus        31 ~~Lq~~ldyACg~g~DCs~I~~gGs----Cyspct~~~haSY   68 (191)
                      +.|-+.+.++|+.... .   ..|-    |+.+|+..+=.+|
T Consensus         4 ~~L~~~L~~vC~~~~~-~---~~gIvdeCC~~~Ct~~~L~~Y   41 (42)
T cd04366           4 RHLADTLALLCSEYNS-P---RRGIVDECCRKSCTLDELLSY   41 (42)
T ss_pred             HHHHHHHHHHhCCCCC-C---CCChhhccCCCcCCHHHHHhh
Confidence            5688999999986211 1   1222    8999998876665


No 5  
>PF07803 GSG-1:  GSG1-like protein;  InterPro: IPR012478 This family contains sequences bearing similarity to a region of GSG1 (Q9Z1H7 from SWISSPROT), a protein specifically expressed in testicular germ cells []. It is possible that over expression of the human homologue may be involved in tumourigenesis of human testicular germ cell tumours []. The region in question has four highly conserved cysteine residues. 
Probab=22.47  E-value=57  Score=26.19  Aligned_cols=26  Identities=19%  Similarity=0.344  Sum_probs=18.7

Q ss_pred             HHHHHHHHHHhCCCCccceeeCCCCC
Q 029582            4 IAYLVLFMAMTGHSTALYCLCKQGLS   29 (191)
Q Consensus         4 ~~~~~l~l~~~~~~~slwCVak~~~~   29 (191)
                      |.++.|+|..+....+.||+-...+.
T Consensus        14 ln~LAL~~S~tA~~sSyWC~GTqKVp   39 (118)
T PF07803_consen   14 LNLLALAFSTTALLSSYWCEGTQKVP   39 (118)
T ss_pred             HHHHHHHHHHHHHhcccccccceecC
Confidence            44566666666778889999877664


No 6  
>PF09628 YvfG:  YvfG protein;  InterPro: IPR018590  Yvfg is a hypothetical protein of 71 residues expressed in some bacteria. The monomer consists of two parallel alpha helices, and the protein crystallises as a homo-dimer. ; PDB: 2GSV_A 2JS1_B.
Probab=16.47  E-value=86  Score=22.72  Aligned_cols=9  Identities=33%  Similarity=0.774  Sum_probs=7.5

Q ss_pred             HHHHHHHHH
Q 029582           68 YAVNSYFQR   76 (191)
Q Consensus        68 YAfNsYYQ~   76 (191)
                      -|||+||..
T Consensus        27 ~AmNaYYr~   35 (68)
T PF09628_consen   27 HAMNAYYRS   35 (68)
T ss_dssp             HHHHHHHHH
T ss_pred             HHHHHHHHH
Confidence            489999975


No 7  
>COG3889 Predicted solute binding protein [General function prediction only]
Probab=16.01  E-value=1e+02  Score=32.11  Aligned_cols=12  Identities=8%  Similarity=0.097  Sum_probs=5.0

Q ss_pred             HHHHHHHHHhhC
Q 029582           31 SVLQKAIDYACG   42 (191)
Q Consensus        31 ~~Lq~~ldyACg   42 (191)
                      ..+.+.|.++-.
T Consensus       620 ~~ia~vlq~~~~  631 (872)
T COG3889         620 FSIARVLQEATT  631 (872)
T ss_pred             HHHHHHHHHHhc
Confidence            334444444443


No 8  
>PF11395 DUF2873:  Protein of unknown function (DUF2873);  InterPro: IPR021532 This entry is represented by the human SARS coronavirus, Orf7b; it is a family of uncharacterised viral proteins.
Probab=15.24  E-value=2.1e+02  Score=18.91  Aligned_cols=16  Identities=31%  Similarity=0.478  Sum_probs=6.8

Q ss_pred             HHHHHHHHHHHHHHHH
Q 029582          174 FSFALTLWVSCLVLLV  189 (191)
Q Consensus       174 ~~~~~~~~~~~~~~~~  189 (191)
                      +++++-+++.++++||
T Consensus        14 l~~llflv~imliif~   29 (43)
T PF11395_consen   14 LSFLLFLVIIMLIIFW   29 (43)
T ss_pred             HHHHHHHHHHHHHHHH
Confidence            3344444444444444


No 9  
>PF13677 MotB_plug:  Membrane MotB of proton-channel complex MotA/MotB 
Probab=14.03  E-value=2.2e+02  Score=19.47  Aligned_cols=20  Identities=25%  Similarity=0.315  Sum_probs=12.6

Q ss_pred             hh-HHHHHHHHHHHHHHHHHH
Q 029582          170 TN-FFFSFALTLWVSCLVLLV  189 (191)
Q Consensus       170 ~~-~~~~~~~~~~~~~~~~~~  189 (191)
                      .| ..++=+++|++.+++||+
T Consensus        16 ~WlvtyaDlmTLLl~fFVlL~   36 (58)
T PF13677_consen   16 RWLVTYADLMTLLLAFFVLLF   36 (58)
T ss_pred             cHHHHHHHHHHHHHHHHHHHH
Confidence            44 445566777777766654


No 10 
>cd00101 IlGF_like Insulin/insulin-like growth factor/relaxin family; insulin family of proteins. Members include a number of active peptides which are evolutionary related including insulin, relaxin, prorelaxin, insulin-like growth factors I and II, mammalian Leydig cell-specific insulin-like peptide (gene INSL3), early placenta insulin-like peptide (ELIP; gene INSL4), insect prothoracicotropic hormone (bombyxin), locust insulin-related peptide (LIRP), molluscan insulin-related peptides 1 to 5 (MIP), and C. elegans insulin-like peptides. Typically, the active forms of these peptide hormones are composed of two chains (A and B) linked by two disulfide bonds; the arrangement of four cysteines is conserved in the "A" chain: Cys1 is linked by a disulfide bond to Cys3, Cys2 and Cys4 are linked by interchain disulfide bonds to cysteines in the "B" chain. This alignment contains both chains, plus the intervening linker region, arranged as found in the propeptide form. Propeptides are cleaved 
Probab=13.36  E-value=3.8e+02  Score=17.08  Aligned_cols=36  Identities=25%  Similarity=0.565  Sum_probs=22.1

Q ss_pred             HHHHHHHHHhhCC-CCCCcccCCCCcccCCCChhhhHhH
Q 029582           31 SVLQKAIDYACGA-GADCTPILQNGVCWNPNTVQDHCNY   68 (191)
Q Consensus        31 ~~Lq~~ldyACg~-g~DCs~I~~gGsCyspct~~~haSY   68 (191)
                      .+|-+++.++|+. +.. ..|.. -=|+.+|+..+=.+|
T Consensus         4 ~~Lv~~l~~vC~~~~~~-~giv~-eCC~~~Ct~~~L~~Y   40 (41)
T cd00101           4 RELVRALIFVCGDRGFY-RGIVD-ECCFRGCTLRELASY   40 (41)
T ss_pred             HHHHHHHHHhcCCCCCc-CCccc-ccCCCCCChHHHHhh
Confidence            4678899999986 112 11110 018899988776555


Done!