Query psy14833
Match_columns 63
No_of_seqs 101 out of 201
Neff 5.3
Searched_HMMs 46136
Date Fri Aug 16 21:58:56 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy14833.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/14833hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG3516|consensus 99.8 3.4E-20 7.4E-25 149.3 4.5 63 1-63 676-739 (1306)
2 PF07974 EGF_2: EGF-like domai 67.3 5.6 0.00012 19.7 1.9 19 44-62 8-26 (32)
3 PF07846 Metallothio_Cad: Meta 42.7 18 0.00039 16.8 1.2 18 42-59 1-20 (21)
4 PF00954 S_locus_glycop: S-loc 36.4 51 0.0011 19.6 2.9 40 18-60 59-103 (110)
5 PHA03099 epidermal growth fact 22.3 64 0.0014 21.4 1.6 26 37-62 40-74 (139)
6 smart00032 CCP Domain abundant 18.6 1.2E+02 0.0026 14.7 1.9 13 1-13 26-38 (57)
7 PF11142 DUF2917: Protein of u 17.1 58 0.0013 18.2 0.5 14 20-33 27-40 (63)
8 PF06204 CBM_X: Putative carbo 16.5 98 0.0021 17.6 1.4 25 10-34 39-63 (66)
9 PF13511 DUF4124: Domain of un 14.9 1.3E+02 0.0027 15.9 1.5 15 17-31 14-28 (60)
10 PF15031 DUF4528: Domain of un 14.4 67 0.0015 20.9 0.3 12 1-12 60-71 (126)
No 1
>KOG3516|consensus
Probab=99.80 E-value=3.4e-20 Score=149.25 Aligned_cols=63 Identities=49% Similarity=1.159 Sum_probs=58.7
Q ss_pred CceecccccCCC-CCCCceEeeecCCCCeEEecCCCCCCCCeecCCCCCCccCCCCceecCCCC
Q psy14833 1 MRDTCSESYPVE-NDFHPFAWWVSRNNHKMDYWGGSVFGTRKCECGILGTCIDPTKWCNCDAGQ 63 (63)
Q Consensus 1 iry~C~~s~~~~-~~~~~~~wW~~r~g~~~~yWgGa~~~~~~C~CG~~~~C~~~~~~CNCDa~~ 63 (63)
|.|.|++|.++. .++.|++||++|++++++||||++|+++||+||+.++|++|+++|||||+.
T Consensus 676 i~y~C~~sRllnt~~g~P~Swwigr~ne~~~yWGGs~Pg~qkC~Cgi~~nC~d~~~~CNCDa~~ 739 (1306)
T KOG3516|consen 676 IEYSCRESRLLNTPDGTPFSWWIGRSNEGHVYWGGSGPGLQKCECGLLGNCLDPQLYCNCDADE 739 (1306)
T ss_pred eeeeeccceeeeCCCCCeeEEEecccCCccceecCCCCccceeeccccccccCcceeeeccCCC
Confidence 689999999855 456899999999999999999999999999999999999999999999974
No 2
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=67.29 E-value=5.6 Score=19.75 Aligned_cols=19 Identities=47% Similarity=1.179 Sum_probs=16.2
Q ss_pred CCCCCCccCCCCceecCCC
Q psy14833 44 CGILGTCIDPTKWCNCDAG 62 (63)
Q Consensus 44 CG~~~~C~~~~~~CNCDa~ 62 (63)
|.-.|+|+.+...|.|++.
T Consensus 8 C~~~G~C~~~~g~C~C~~g 26 (32)
T PF07974_consen 8 CSGHGTCVSPCGRCVCDSG 26 (32)
T ss_pred cCCCCEEeCCCCEEECCCC
Confidence 6679999987899999875
No 3
>PF07846 Metallothio_Cad: Metallothionein family; InterPro: IPR012484 The sequence making up family 7 of the metallothionein superfamily are found repeated in metallothionein proteins expressed by two Tetrahymena species. Metallothioneins are low molecular mass, cysteine-rich metal-binding proteins that are thought to be involved in the regulation of levels of trace metals, and detoxification of these metals when present in excess []. Some of the metallothioneins found in this family (for example, Q8T6B3 from SWISSPROT) are known to be induced by cadmium and are thought to be involved in the cellular sequestration of toxic metal ions. The high proportion of cysteine residues allows the metal ions to be bound by the formation of clusters of metal-thiolate complexes []. Tetrahymena spp. metallothioneins differ from other eukaryotic metallothioneins mainly in the length of their sequences and in the cysteine-containing motifs they exhibit. ; GO: 0046870 cadmium ion binding
Probab=42.68 E-value=18 Score=16.80 Aligned_cols=18 Identities=44% Similarity=1.230 Sum_probs=12.1
Q ss_pred ecCCC--CCCccCCCCceec
Q psy14833 42 CECGI--LGTCIDPTKWCNC 59 (63)
Q Consensus 42 C~CG~--~~~C~~~~~~CNC 59 (63)
|-||. ...|.||+..|-|
T Consensus 1 CccG~naK~cC~DPnsG~~C 20 (21)
T PF07846_consen 1 CCCGVNAKPCCTDPNSGCCC 20 (21)
T ss_pred CccCCCCcccccCCCCcccc
Confidence 44665 3458898887766
No 4
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=36.43 E-value=51 Score=19.59 Aligned_cols=40 Identities=23% Similarity=0.551 Sum_probs=25.8
Q ss_pred eEeeecCCCCeEEecCCCCCCCCee----cCCCCCCccC-CCCceecC
Q psy14833 18 FAWWVSRNNHKMDYWGGSVFGTRKC----ECGILGTCID-PTKWCNCD 60 (63)
Q Consensus 18 ~~wW~~r~g~~~~yWgGa~~~~~~C----~CG~~~~C~~-~~~~CNCD 60 (63)
.--|.....+-..+|.. +. .+| .||..+.|.. ....|.|=
T Consensus 59 ~~~w~~~~~~W~~~~~~-p~--d~Cd~y~~CG~~g~C~~~~~~~C~Cl 103 (110)
T PF00954_consen 59 RYIWNESTQSWSVFWSA-PK--DQCDVYGFCGPNGICNSNNSPKCSCL 103 (110)
T ss_pred EEEEecCCCcEEEEEEe-cc--cCCCCccccCCccEeCCCCCCceECC
Confidence 33456667777777742 22 357 7999999953 33469984
No 5
>PHA03099 epidermal growth factor-like protein (EGF-like protein); Provisional
Probab=22.29 E-value=64 Score=21.40 Aligned_cols=26 Identities=27% Similarity=0.602 Sum_probs=18.5
Q ss_pred CCCCeecCCCCCCccC---------CCCceecCCC
Q psy14833 37 FGTRKCECGILGTCID---------PTKWCNCDAG 62 (63)
Q Consensus 37 ~~~~~C~CG~~~~C~~---------~~~~CNCDa~ 62 (63)
|..+.|..-.++-|++ ....|||+.+
T Consensus 40 ~~i~~Cp~ey~~YClHG~C~yI~dl~~~~CrC~~G 74 (139)
T PHA03099 40 PAIRLCGPEGDGYCLHGDCIHARDIDGMYCRCSHG 74 (139)
T ss_pred cccccCChhhCCEeECCEEEeeccCCCceeECCCC
Confidence 4567888777666654 5678999865
No 6
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Probab=18.60 E-value=1.2e+02 Score=14.68 Aligned_cols=13 Identities=15% Similarity=0.508 Sum_probs=7.8
Q ss_pred CceecccccCCCC
Q psy14833 1 MRDTCSESYPVEN 13 (63)
Q Consensus 1 iry~C~~s~~~~~ 13 (63)
|+|.|...+.+.+
T Consensus 26 ~~~~C~~Gy~l~g 38 (57)
T smart00032 26 VTYSCNPGYTLIG 38 (57)
T ss_pred EEEEcCCCCEEcC
Confidence 3567777665543
No 7
>PF11142 DUF2917: Protein of unknown function (DUF2917); InterPro: IPR021317 This bacterial family of proteins appears to be restricted to Proteobacteria.
Probab=17.09 E-value=58 Score=18.19 Aligned_cols=14 Identities=36% Similarity=0.952 Sum_probs=11.9
Q ss_pred eeecCCCCeEEecC
Q psy14833 20 WWVSRNNHKMDYWG 33 (63)
Q Consensus 20 wW~~r~g~~~~yWg 33 (63)
=|++++|...+||=
T Consensus 27 vWlT~~g~~~D~~L 40 (63)
T PF11142_consen 27 VWLTREGDPDDYWL 40 (63)
T ss_pred EEEECCCCCCCEEE
Confidence 48999999999983
No 8
>PF06204 CBM_X: Putative carbohydrate binding domain ; InterPro: IPR009342 This domain is conserved in enzymes that have carbohydrates as substrate, and may be a carbohydrate-binding domain.; PDB: 3ACT_B 2CQT_A 3QFY_B 3QFZ_A 2CQS_A 3QG0_B 3AFJ_A 3ACS_A 1V7V_A 1V7X_A ....
Probab=16.46 E-value=98 Score=17.57 Aligned_cols=25 Identities=16% Similarity=0.468 Sum_probs=16.9
Q ss_pred CCCCCCCceEeeecCCCCeEEecCC
Q psy14833 10 PVENDFHPFAWWVSRNNHKMDYWGG 34 (63)
Q Consensus 10 ~~~~~~~~~~wW~~r~g~~~~yWgG 34 (63)
|....|..|+|+......+.+=|-.
T Consensus 39 mvt~~G~GySw~~~~~~~~lTrWr~ 63 (66)
T PF06204_consen 39 MVTNSGSGYSWAKNSRDNRLTRWRN 63 (66)
T ss_dssp EEETTSBEEEEES-TTTSEEE--BT
T ss_pred EEcCCCceeecccccCcCccccccC
Confidence 4455677999999999988888853
No 9
>PF13511 DUF4124: Domain of unknown function (DUF4124)
Probab=14.88 E-value=1.3e+02 Score=15.86 Aligned_cols=15 Identities=20% Similarity=0.155 Sum_probs=11.5
Q ss_pred ceEeeecCCCCeEEe
Q psy14833 17 PFAWWVSRNNHKMDY 31 (63)
Q Consensus 17 ~~~wW~~r~g~~~~y 31 (63)
.+.=|+|.+|+.++.
T Consensus 14 ~vYk~~D~~G~v~ys 28 (60)
T PF13511_consen 14 EVYKWVDENGVVHYS 28 (60)
T ss_pred cEEEEECCCCCEEEC
Confidence 566799999987763
No 10
>PF15031 DUF4528: Domain of unknown function (DUF4528)
Probab=14.43 E-value=67 Score=20.93 Aligned_cols=12 Identities=17% Similarity=0.606 Sum_probs=9.9
Q ss_pred CceecccccCCC
Q psy14833 1 MRDTCSESYPVE 12 (63)
Q Consensus 1 iry~C~~s~~~~ 12 (63)
|||-|.+.+.++
T Consensus 60 iKYHCtkrp~qd 71 (126)
T PF15031_consen 60 IKYHCTKRPPQD 71 (126)
T ss_pred eeeeecCCChhh
Confidence 799999988754
Done!