Query         psy14833
Match_columns 63
No_of_seqs    101 out of 201
Neff          5.3 
Searched_HMMs 46136
Date          Fri Aug 16 21:58:56 2013
Command       hhsearch -i /work/01045/syshi/Psyhhblits/psy14833.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14833hhsearch_cdd -cpu 12 -v 0 

 No Hit                             Prob E-value P-value  Score    SS Cols Query HMM  Template HMM
  1 KOG3516|consensus               99.8 3.4E-20 7.4E-25  149.3   4.5   63    1-63    676-739 (1306)
  2 PF07974 EGF_2:  EGF-like domai  67.3     5.6 0.00012   19.7   1.9   19   44-62      8-26  (32)
  3 PF07846 Metallothio_Cad:  Meta  42.7      18 0.00039   16.8   1.2   18   42-59      1-20  (21)
  4 PF00954 S_locus_glycop:  S-loc  36.4      51  0.0011   19.6   2.9   40   18-60     59-103 (110)
  5 PHA03099 epidermal growth fact  22.3      64  0.0014   21.4   1.6   26   37-62     40-74  (139)
  6 smart00032 CCP Domain abundant  18.6 1.2E+02  0.0026   14.7   1.9   13    1-13     26-38  (57)
  7 PF11142 DUF2917:  Protein of u  17.1      58  0.0013   18.2   0.5   14   20-33     27-40  (63)
  8 PF06204 CBM_X:  Putative carbo  16.5      98  0.0021   17.6   1.4   25   10-34     39-63  (66)
  9 PF13511 DUF4124:  Domain of un  14.9 1.3E+02  0.0027   15.9   1.5   15   17-31     14-28  (60)
 10 PF15031 DUF4528:  Domain of un  14.4      67  0.0015   20.9   0.3   12    1-12     60-71  (126)

No 1  
>KOG3516|consensus
Probab=99.80  E-value=3.4e-20  Score=149.25  Aligned_cols=63  Identities=49%  Similarity=1.159  Sum_probs=58.7

Q ss_pred             CceecccccCCC-CCCCceEeeecCCCCeEEecCCCCCCCCeecCCCCCCccCCCCceecCCCC
Q psy14833          1 MRDTCSESYPVE-NDFHPFAWWVSRNNHKMDYWGGSVFGTRKCECGILGTCIDPTKWCNCDAGQ   63 (63)
Q Consensus         1 iry~C~~s~~~~-~~~~~~~wW~~r~g~~~~yWgGa~~~~~~C~CG~~~~C~~~~~~CNCDa~~   63 (63)
                      |.|.|++|.++. .++.|++||++|++++++||||++|+++||+||+.++|++|+++|||||+.
T Consensus       676 i~y~C~~sRllnt~~g~P~Swwigr~ne~~~yWGGs~Pg~qkC~Cgi~~nC~d~~~~CNCDa~~  739 (1306)
T KOG3516|consen  676 IEYSCRESRLLNTPDGTPFSWWIGRSNEGHVYWGGSGPGLQKCECGLLGNCLDPQLYCNCDADE  739 (1306)
T ss_pred             eeeeeccceeeeCCCCCeeEEEecccCCccceecCCCCccceeeccccccccCcceeeeccCCC
Confidence            689999999855 456899999999999999999999999999999999999999999999974


No 2  
>PF07974 EGF_2:  EGF-like domain;  InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=67.29  E-value=5.6  Score=19.75  Aligned_cols=19  Identities=47%  Similarity=1.179  Sum_probs=16.2

Q ss_pred             CCCCCCccCCCCceecCCC
Q psy14833         44 CGILGTCIDPTKWCNCDAG   62 (63)
Q Consensus        44 CG~~~~C~~~~~~CNCDa~   62 (63)
                      |.-.|+|+.+...|.|++.
T Consensus         8 C~~~G~C~~~~g~C~C~~g   26 (32)
T PF07974_consen    8 CSGHGTCVSPCGRCVCDSG   26 (32)
T ss_pred             cCCCCEEeCCCCEEECCCC
Confidence            6679999987899999875


No 3  
>PF07846 Metallothio_Cad:  Metallothionein family;  InterPro: IPR012484 The sequence making up family 7 of the metallothionein superfamily are found repeated in metallothionein proteins expressed by two Tetrahymena species. Metallothioneins are low molecular mass, cysteine-rich metal-binding proteins that are thought to be involved in the regulation of levels of trace metals, and detoxification of these metals when present in excess []. Some of the metallothioneins found in this family (for example, Q8T6B3 from SWISSPROT) are known to be induced by cadmium and are thought to be involved in the cellular sequestration of toxic metal ions. The high proportion of cysteine residues allows the metal ions to be bound by the formation of clusters of metal-thiolate complexes []. Tetrahymena spp. metallothioneins differ from other eukaryotic metallothioneins mainly in the length of their sequences and in the cysteine-containing motifs they exhibit. ; GO: 0046870 cadmium ion binding
Probab=42.68  E-value=18  Score=16.80  Aligned_cols=18  Identities=44%  Similarity=1.230  Sum_probs=12.1

Q ss_pred             ecCCC--CCCccCCCCceec
Q psy14833         42 CECGI--LGTCIDPTKWCNC   59 (63)
Q Consensus        42 C~CG~--~~~C~~~~~~CNC   59 (63)
                      |-||.  ...|.||+..|-|
T Consensus         1 CccG~naK~cC~DPnsG~~C   20 (21)
T PF07846_consen    1 CCCGVNAKPCCTDPNSGCCC   20 (21)
T ss_pred             CccCCCCcccccCCCCcccc
Confidence            44665  3458898887766


No 4  
>PF00954 S_locus_glycop:  S-locus glycoprotein family;  InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=36.43  E-value=51  Score=19.59  Aligned_cols=40  Identities=23%  Similarity=0.551  Sum_probs=25.8

Q ss_pred             eEeeecCCCCeEEecCCCCCCCCee----cCCCCCCccC-CCCceecC
Q psy14833         18 FAWWVSRNNHKMDYWGGSVFGTRKC----ECGILGTCID-PTKWCNCD   60 (63)
Q Consensus        18 ~~wW~~r~g~~~~yWgGa~~~~~~C----~CG~~~~C~~-~~~~CNCD   60 (63)
                      .--|.....+-..+|.. +.  .+|    .||..+.|.. ....|.|=
T Consensus        59 ~~~w~~~~~~W~~~~~~-p~--d~Cd~y~~CG~~g~C~~~~~~~C~Cl  103 (110)
T PF00954_consen   59 RYIWNESTQSWSVFWSA-PK--DQCDVYGFCGPNGICNSNNSPKCSCL  103 (110)
T ss_pred             EEEEecCCCcEEEEEEe-cc--cCCCCccccCCccEeCCCCCCceECC
Confidence            33456667777777742 22  357    7999999953 33469984


No 5  
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=22.29  E-value=64  Score=21.40  Aligned_cols=26  Identities=27%  Similarity=0.602  Sum_probs=18.5

Q ss_pred             CCCCeecCCCCCCccC---------CCCceecCCC
Q psy14833         37 FGTRKCECGILGTCID---------PTKWCNCDAG   62 (63)
Q Consensus        37 ~~~~~C~CG~~~~C~~---------~~~~CNCDa~   62 (63)
                      |..+.|..-.++-|++         ....|||+.+
T Consensus        40 ~~i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~G   74 (139)
T PHA03099         40 PAIRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHG   74 (139)
T ss_pred             cccccCChhhCCEeECCEEEeeccCCCceeECCCC
Confidence            4567888777666654         5678999865


No 6  
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short  consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino  acid residues and have been identified in several proteins of the  complement system. A missense mutation in seventh CCP domain causes  deficiency of the b subunit of factor XIII.
Probab=18.60  E-value=1.2e+02  Score=14.68  Aligned_cols=13  Identities=15%  Similarity=0.508  Sum_probs=7.8

Q ss_pred             CceecccccCCCC
Q psy14833          1 MRDTCSESYPVEN   13 (63)
Q Consensus         1 iry~C~~s~~~~~   13 (63)
                      |+|.|...+.+.+
T Consensus        26 ~~~~C~~Gy~l~g   38 (57)
T smart00032       26 VTYSCNPGYTLIG   38 (57)
T ss_pred             EEEEcCCCCEEcC
Confidence            3567777665543


No 7  
>PF11142 DUF2917:  Protein of unknown function (DUF2917);  InterPro: IPR021317  This bacterial family of proteins appears to be restricted to Proteobacteria. 
Probab=17.09  E-value=58  Score=18.19  Aligned_cols=14  Identities=36%  Similarity=0.952  Sum_probs=11.9

Q ss_pred             eeecCCCCeEEecC
Q psy14833         20 WWVSRNNHKMDYWG   33 (63)
Q Consensus        20 wW~~r~g~~~~yWg   33 (63)
                      =|++++|...+||=
T Consensus        27 vWlT~~g~~~D~~L   40 (63)
T PF11142_consen   27 VWLTREGDPDDYWL   40 (63)
T ss_pred             EEEECCCCCCCEEE
Confidence            48999999999983


No 8  
>PF06204 CBM_X:  Putative carbohydrate binding domain  ;  InterPro: IPR009342 This domain is conserved in enzymes that have carbohydrates as substrate, and may be a carbohydrate-binding domain.; PDB: 3ACT_B 2CQT_A 3QFY_B 3QFZ_A 2CQS_A 3QG0_B 3AFJ_A 3ACS_A 1V7V_A 1V7X_A ....
Probab=16.46  E-value=98  Score=17.57  Aligned_cols=25  Identities=16%  Similarity=0.468  Sum_probs=16.9

Q ss_pred             CCCCCCCceEeeecCCCCeEEecCC
Q psy14833         10 PVENDFHPFAWWVSRNNHKMDYWGG   34 (63)
Q Consensus        10 ~~~~~~~~~~wW~~r~g~~~~yWgG   34 (63)
                      |....|..|+|+......+.+=|-.
T Consensus        39 mvt~~G~GySw~~~~~~~~lTrWr~   63 (66)
T PF06204_consen   39 MVTNSGSGYSWAKNSRDNRLTRWRN   63 (66)
T ss_dssp             EEETTSBEEEEES-TTTSEEE--BT
T ss_pred             EEcCCCceeecccccCcCccccccC
Confidence            4455677999999999988888853


No 9  
>PF13511 DUF4124:  Domain of unknown function (DUF4124)
Probab=14.88  E-value=1.3e+02  Score=15.86  Aligned_cols=15  Identities=20%  Similarity=0.155  Sum_probs=11.5

Q ss_pred             ceEeeecCCCCeEEe
Q psy14833         17 PFAWWVSRNNHKMDY   31 (63)
Q Consensus        17 ~~~wW~~r~g~~~~y   31 (63)
                      .+.=|+|.+|+.++.
T Consensus        14 ~vYk~~D~~G~v~ys   28 (60)
T PF13511_consen   14 EVYKWVDENGVVHYS   28 (60)
T ss_pred             cEEEEECCCCCEEEC
Confidence            566799999987763


No 10 
>PF15031 DUF4528:  Domain of unknown function (DUF4528)
Probab=14.43  E-value=67  Score=20.93  Aligned_cols=12  Identities=17%  Similarity=0.606  Sum_probs=9.9

Q ss_pred             CceecccccCCC
Q psy14833          1 MRDTCSESYPVE   12 (63)
Q Consensus         1 iry~C~~s~~~~   12 (63)
                      |||-|.+.+.++
T Consensus        60 iKYHCtkrp~qd   71 (126)
T PF15031_consen   60 IKYHCTKRPPQD   71 (126)
T ss_pred             eeeeecCCChhh
Confidence            799999988754


Done!