Query psy10482
Match_columns 84
No_of_seqs 132 out of 1024
Neff 7.2
Searched_HMMs 46136
Date Fri Aug 16 19:19:05 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy10482.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/10482hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF01607 CBM_14: Chitin bindin 99.5 8.8E-15 1.9E-19 79.2 3.1 45 28-75 8-53 (53)
2 smart00494 ChtBD2 Chitin-bindi 99.5 3.2E-14 6.9E-19 77.3 5.2 53 19-76 2-55 (56)
3 PF03427 CBM_19: Carbohydrate 91.7 0.23 5E-06 28.1 2.7 21 36-56 35-55 (61)
4 PRK10040 hypothetical protein; 77.0 3.7 8E-05 22.3 2.6 28 13-45 14-42 (52)
5 TIGR02659 TTQ_MADH_Lt methylam 73.4 1.7 3.7E-05 29.6 1.0 41 30-71 73-120 (186)
6 PF08475 Baculo_VP91_N: Viral 69.3 7.7 0.00017 26.4 3.4 33 29-61 107-139 (183)
7 PF09064 Tme5_EGF_like: Thromb 60.8 8.3 0.00018 19.3 1.7 21 41-61 12-32 (34)
8 PF12662 cEGF: Complement Clr- 59.2 8.7 0.00019 17.6 1.5 16 50-68 5-20 (24)
9 PF13511 DUF4124: Domain of un 50.0 7.5 0.00016 20.9 0.6 25 7-31 5-29 (60)
10 cd00064 FU Furin-like repeats. 44.9 20 0.00043 18.4 1.8 15 50-67 34-48 (49)
11 PF07645 EGF_CA: Calcium-bindi 43.7 17 0.00038 18.2 1.4 29 33-61 10-39 (42)
12 PF02975 Me-amine-dh_L: Methyl 42.1 2.4 5.1E-05 27.1 -2.5 41 30-71 11-58 (123)
13 PF00220 Hormone_4: Neurohypop 40.1 14 0.00031 13.1 0.5 6 49-54 4-9 (9)
14 PF08779 SARS_X4: SARS coronav 37.5 26 0.00056 20.5 1.5 19 38-56 5-25 (84)
15 PF13948 DUF4215: Domain of un 34.2 40 0.00087 17.5 1.9 8 50-57 20-27 (47)
16 PF07771 TSGP1: Tick salivary 32.3 33 0.00071 21.7 1.6 24 15-43 19-45 (120)
17 cd00109 KU BPTI/Kunitz family 30.0 48 0.001 17.2 1.8 23 53-78 17-39 (54)
18 PF06387 Calcyon: D1 dopamine 28.9 36 0.00079 23.2 1.4 17 44-60 106-122 (186)
19 PF07403 DUF1505: Protein of u 27.7 62 0.0013 20.5 2.2 10 29-38 30-39 (114)
20 PF07127 Nodulin_late: Late no 26.9 98 0.0021 16.4 2.7 20 3-22 13-32 (54)
21 smart00197 SAA Serum amyloid A 26.3 15 0.00033 22.7 -0.7 25 30-54 25-49 (103)
22 PHA02714 CD-30-like protein; P 24.8 39 0.00084 21.0 0.9 18 47-67 20-37 (110)
23 PF01826 TIL: Trypsin Inhibito 24.6 38 0.00083 17.7 0.8 16 41-60 31-46 (55)
24 PF12191 stn_TNFRSF12A: Tumour 23.9 41 0.00088 21.7 0.9 16 49-67 34-49 (129)
25 PF09717 CPW_WPC: Plasmodium f 21.5 73 0.0016 17.2 1.5 29 47-79 5-35 (60)
26 PHA02067 hypothetical protein 21.1 75 0.0016 22.3 1.8 12 12-23 15-26 (221)
27 KOG2397|consensus 20.6 1.1E+02 0.0024 23.9 2.8 22 40-61 65-88 (480)
28 TIGR01492 CPW_WPC Plasmodium f 20.4 1E+02 0.0022 16.9 2.0 11 49-59 8-18 (62)
29 smart00131 KU BPTI/Kunitz fami 20.2 88 0.0019 16.1 1.6 21 54-77 18-38 (53)
No 1
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=99.52 E-value=8.8e-15 Score=79.17 Aligned_cols=45 Identities=36% Similarity=0.897 Sum_probs=35.5
Q ss_pred EccCccCCcccEEcCCCceeEeeCCCCCeEeccCCCccCCCCCCC-ccc
Q psy10482 28 TYPHPQLCDQFYKCTNGTLTLEQCENGLLYDGNGNVHNHCNYYWG-VDC 75 (84)
Q Consensus 28 ~~~~p~~C~~y~~C~~G~~~~~~Cp~Gl~Fd~~~~~~~~C~~~~~-v~C 75 (84)
+++++.+|++||+|.+|.++++.||.|++||+++ +.|++++. +.|
T Consensus 8 ~~~~~~~C~~Y~~C~~g~~~~~~C~~g~~fd~~~---~~C~~~~~~~~C 53 (53)
T PF01607_consen 8 FYPHPDDCRKYYQCVNGQAVEQRCPEGLYFDPSS---QRCVPPSNVVQC 53 (53)
T ss_dssp EE--SS-SSEEEEEETTEEEEEE-TTS-EE-TTT---SSEE-TTT-TT-
T ss_pred eEeCCCCCCEEEEeeCCcEECCCCcCCCEECcCc---CEEcCCccCCCC
Confidence 7899999999999999999999999999999999 99999998 666
No 2
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=99.52 E-value=3.2e-14 Score=77.30 Aligned_cols=53 Identities=38% Similarity=0.968 Sum_probs=48.1
Q ss_pred CCCCCC-CCeEccCccCCcccEEcCCCceeEeeCCCCCeEeccCCCccCCCCCCCcccC
Q psy10482 19 GCPEPY-GVQTYPHPQLCDQFYKCTNGTLTLEQCENGLLYDGNGNVHNHCNYYWGVDCG 76 (84)
Q Consensus 19 ~C~~~~-g~~~~~~p~~C~~y~~C~~G~~~~~~Cp~Gl~Fd~~~~~~~~C~~~~~v~C~ 76 (84)
.|+... | +++||.+|++||+|.+|.++.+.||.|++||..+ +.|++++.++|.
T Consensus 2 ~C~~~~~g--~~~~~~~C~~y~~C~~~~~~~~~C~~g~~f~~~~---~~C~~~~~~~C~ 55 (56)
T smart00494 2 ECPGRGDG--LYPHPTDCSKYYQCSNGRPIVGSCPAGLVFDPAT---QTCDWPDNVGCG 55 (56)
T ss_pred cCCCCCCc--cccCcccCCeeEEcCCCcEEeccCcCCCeECCCC---CCcCCcccCCCC
Confidence 466642 4 8999999999999999999999999999999999 999999999885
No 3
>PF03427 CBM_19: Carbohydrate binding domain (family 19); InterPro: IPR005089 A carbohydrate-binding module (CBM) is defined as a contiguous amino acid sequence within a carbohydrate-active enzyme with a discreet fold having carbohydrate-binding activity. A few exceptions are CBMs in cellulosomal scaffolding proteins and rare instances of independent putative CBMs. The requirement of CBMs existing as modules within larger enzymes sets this class of carbohydrate-binding protein apart from other non-catalytic sugar binding proteins such as lectins and sugar transport proteins. CBMs were previously classified as cellulose-binding domains (CBDs) based on the initial discovery of several modules that bound cellulose [, ]. However, additional modules in carbohydrate-active enzymes are continually being found that bind carbohydrates other than cellulose yet otherwise meet the CBM criteria, hence the need to reclassify these polypeptides using more inclusive terminology. Previous classification of cellulose-binding domains were based on amino acid similarity. Groupings of CBDs were called "Types" and numbered with roman numerals (e.g. Type I or Type II CBDs). In keeping with the glycoside hydrolase classification, these groupings are now called families and numbered with Arabic numerals. Families 1 to 13 are the same as Types I to XIII. For a detailed review on the structure and binding modes of CBMs see []. This entry represents CBM19 from CAZY which consists of 60-70 residues with chitin-binding function.; GO: 0004568 chitinase activity, 0006032 chitin catabolic process
Probab=91.72 E-value=0.23 Score=28.12 Aligned_cols=21 Identities=29% Similarity=0.656 Sum_probs=19.1
Q ss_pred cccEEcCCCceeEeeCCCCCe
Q psy10482 36 DQFYKCTNGTLTLEQCENGLL 56 (84)
Q Consensus 36 ~~y~~C~~G~~~~~~Cp~Gl~ 56 (84)
.+|-.|.+|.++...||.|+.
T Consensus 35 G~faiC~~G~Wv~~~C~~Gt~ 55 (61)
T PF03427_consen 35 GSFAICDHGTWVITPCPAGTT 55 (61)
T ss_pred CCEEEcCCCcEEEeeCCCCCE
Confidence 369999999999999999985
No 4
>PRK10040 hypothetical protein; Provisional
Probab=77.03 E-value=3.7 Score=22.30 Aligned_cols=28 Identities=18% Similarity=0.420 Sum_probs=19.9
Q ss_pred ccCCCCCCCCCCCCeEccCccCC-cccEEcCCCc
Q psy10482 13 LLQAPSGCPEPYGVQTYPHPQLC-DQFYKCTNGT 45 (84)
Q Consensus 13 ~~~~~~~C~~~~g~~~~~~p~~C-~~y~~C~~G~ 45 (84)
....+..|+++.|. +. .| ..++.|.+|.
T Consensus 14 a~a~n~PCSG~KGG--is---~C~g~~FvCnDGS 42 (52)
T PRK10040 14 ADAGNKPCSGKKGG--IS---HCTGGKFVCNDGS 42 (52)
T ss_pred HHhcCCCCCCCCCC--cc---cccCCEEEeCCCc
Confidence 45678899998762 44 45 4588999885
No 5
>TIGR02659 TTQ_MADH_Lt methylamine dehydrogenase light chain. This family consists of the light chain of methylamine dehydrogenase light chain, a periplasmic enzyme. This subunit contains a tryptophan tryptophylquinone (TTQ) prothetic group derived from Trp-114 and Trp-165 of the precursor, numbered according to the sequence from Paracoccus denitrificans. The enzyme forms a complex with the type I blue copper protein amicyanin and cytochrome. Electron transfer procedes from TQQ to the copper and then to the heme group of the cytochrome.
Probab=73.43 E-value=1.7 Score=29.58 Aligned_cols=41 Identities=22% Similarity=0.400 Sum_probs=32.2
Q ss_pred cCccCCcccEEcC-CCc------eeEeeCCCCCeEeccCCCccCCCCCC
Q psy10482 30 PHPQLCDQFYKCT-NGT------LTLEQCENGLLYDGNGNVHNHCNYYW 71 (84)
Q Consensus 30 ~~p~~C~~y~~C~-~G~------~~~~~Cp~Gl~Fd~~~~~~~~C~~~~ 71 (84)
.||..|+.|-+|. +|. -..-+||+|..-.+.+ |.++|.-|.
T Consensus 73 ~Dp~sCdYWRyCaiDG~LCsCCGGs~tsCPPGTe~Sp~s-WiGTC~NP~ 120 (186)
T TIGR02659 73 NDIQACDYWRHCSIDGNICDCSGGSLTSCPPGTKLATSS-WVASCYNPT 120 (186)
T ss_pred CCCcccchhhhcccCCceEcCcCCCCcCCCCCCCCCCce-eEEeccCCC
Confidence 6899999999996 664 4577899999888877 556776664
No 6
>PF08475 Baculo_VP91_N: Viral capsid protein 91 N-terminal; InterPro: IPR013682 This domain is found in Baculoviridae including the nucleopolyhedrovirus at the N terminus of the viral capsid protein 91 (VP91) [].
Probab=69.25 E-value=7.7 Score=26.45 Aligned_cols=33 Identities=24% Similarity=0.394 Sum_probs=28.8
Q ss_pred ccCccCCcccEEcCCCceeEeeCCCCCeEeccC
Q psy10482 29 YPHPQLCDQFYKCTNGTLTLEQCENGLLYDGNG 61 (84)
Q Consensus 29 ~~~p~~C~~y~~C~~G~~~~~~Cp~Gl~Fd~~~ 61 (84)
-+|+.+=++|..-.+-....+.||.+..||...
T Consensus 107 ~~~~~d~~kF~~~~DDGwv~~~CP~~~~FD~~~ 139 (183)
T PF08475_consen 107 RPHPNDPTKFIARGDDGWVDMDCPENERFDGNK 139 (183)
T ss_pred ecCCCCCceEEEEecCCcEEEeCCCcccccccc
Confidence 478888899988776669999999999999986
No 7
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=60.83 E-value=8.3 Score=19.28 Aligned_cols=21 Identities=19% Similarity=0.444 Sum_probs=14.1
Q ss_pred cCCCceeEeeCCCCCeEeccC
Q psy10482 41 CTNGTLTLEQCENGLLYDGNG 61 (84)
Q Consensus 41 C~~G~~~~~~Cp~Gl~Fd~~~ 61 (84)
|.......=.||+|++.|...
T Consensus 12 CDpn~~~~C~CPeGyIlde~~ 32 (34)
T PF09064_consen 12 CDPNSPGQCFCPEGYILDEGS 32 (34)
T ss_pred cCCCCCCceeCCCceEecCCc
Confidence 444444455799999988764
No 8
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=59.17 E-value=8.7 Score=17.60 Aligned_cols=16 Identities=25% Similarity=0.740 Sum_probs=12.5
Q ss_pred eCCCCCeEeccCCCccCCC
Q psy10482 50 QCENGLLYDGNGNVHNHCN 68 (84)
Q Consensus 50 ~Cp~Gl~Fd~~~~~~~~C~ 68 (84)
.|++|+..++.. ..|.
T Consensus 5 ~C~~Gy~l~~d~---~~C~ 20 (24)
T PF12662_consen 5 SCPPGYQLSPDG---RSCE 20 (24)
T ss_pred eCCCCCcCCCCC---Cccc
Confidence 588888888877 7774
No 9
>PF13511 DUF4124: Domain of unknown function (DUF4124)
Probab=50.03 E-value=7.5 Score=20.86 Aligned_cols=25 Identities=16% Similarity=0.171 Sum_probs=15.2
Q ss_pred HHhhhcccCCCCCCCCCCCCeEccC
Q psy10482 7 VVLAQPLLQAPSGCPEPYGVQTYPH 31 (84)
Q Consensus 7 ~~~~~~~~~~~~~C~~~~g~~~~~~ 31 (84)
+|.+.+....-..|...+|.+.|.|
T Consensus 5 ~l~~~a~aa~vYk~~D~~G~v~ysd 29 (60)
T PF13511_consen 5 LLAASAAAAEVYKWVDENGVVHYSD 29 (60)
T ss_pred HHhHHHhhccEEEEECCCCCEEECc
Confidence 3334444456778887778656654
No 10
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=44.90 E-value=20 Score=18.36 Aligned_cols=15 Identities=33% Similarity=0.838 Sum_probs=11.9
Q ss_pred eCCCCCeEeccCCCccCC
Q psy10482 50 QCENGLLYDGNGNVHNHC 67 (84)
Q Consensus 50 ~Cp~Gl~Fd~~~~~~~~C 67 (84)
.||+|.+++... ..|
T Consensus 34 ~C~~~~~~~~~~---~~C 48 (49)
T cd00064 34 ECPEGTYADTEG---GVC 48 (49)
T ss_pred cCCCCceecCcC---CcC
Confidence 799999998666 555
No 11
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=43.68 E-value=17 Score=18.17 Aligned_cols=29 Identities=21% Similarity=0.379 Sum_probs=20.7
Q ss_pred cCCcccEEcCCC-ceeEeeCCCCCeEeccC
Q psy10482 33 QLCDQFYKCTNG-TLTLEQCENGLLYDGNG 61 (84)
Q Consensus 33 ~~C~~y~~C~~G-~~~~~~Cp~Gl~Fd~~~ 61 (84)
..|.....|.|- ..+.=.|++|+..+...
T Consensus 10 ~~C~~~~~C~N~~Gsy~C~C~~Gy~~~~~~ 39 (42)
T PF07645_consen 10 HNCPENGTCVNTEGSYSCSCPPGYELNDDG 39 (42)
T ss_dssp SSSSTTSEEEEETTEEEEEESTTEEECTTS
T ss_pred CcCCCCCEEEcCCCCEEeeCCCCcEECCCC
Confidence 467667778764 35666799999977765
No 12
>PF02975 Me-amine-dh_L: Methylamine dehydrogenase, L chain; InterPro: IPR013504 Methylamine dehydrogenase (1.4.99.3 from EC) is a periplasmic quinoprotein found in several methylotrophic bacteria []. It is induced when grown on methylamine as a carbon source MADH and catalyses the oxidative deamination of amines to their corresponding aldehydes. The redox cofactor of this enzyme is tryptophan tryptophylquinone (TTQ). Electrons derived from the oxidation of methylamine are passed to an electron acceptor, which is usually the blue-copper protein amicyanin (IPR002386 from INTERPRO). RCH2NH2 + H2O + acceptor = RCHO + NH3 + reduced acceptor MADH is a hetero-tetramer, comprised of two heavy subunits and two light subunits. The light subunit forms two antiparallel beta sheets, and contains the active site of this enzyme which is accessible via a hydrophobic channel between the heavy and light subunits. The redox cofactor TTQ is formed from two posttranlationally modified tryptophan residues within this subunit [].; GO: 0016638 oxidoreductase activity, acting on the CH-NH2 group of donors, 0009308 amine metabolic process, 0055114 oxidation-reduction process, 0042597 periplasmic space; PDB: 1MAE_L 1MAF_L 3C75_M 2MAD_L 2OJY_H 2IUP_H 2HJ4_D 2AGY_H 2AGX_D 2HXC_D ....
Probab=42.06 E-value=2.4 Score=27.06 Aligned_cols=41 Identities=20% Similarity=0.343 Sum_probs=27.9
Q ss_pred cCccCCcccEEcC-CCc------eeEeeCCCCCeEeccCCCccCCCCCC
Q psy10482 30 PHPQLCDQFYKCT-NGT------LTLEQCENGLLYDGNGNVHNHCNYYW 71 (84)
Q Consensus 30 ~~p~~C~~y~~C~-~G~------~~~~~Cp~Gl~Fd~~~~~~~~C~~~~ 71 (84)
.||..|+.|-+|. +|. ...-+||+|..-.+.+ |.++|.-|.
T Consensus 11 ~D~~sCdYWRyCaiDG~LCsCcGGs~tsCPPGte~Sp~s-WvGtC~NP~ 58 (123)
T PF02975_consen 11 GDPTSCDYWRYCAIDGFLCSCCGGSVTSCPPGTEPSPVS-WVGTCRNPA 58 (123)
T ss_dssp SHTTSTTBGGGTT-EEEBGGGGTSBTTCB-TT-EEESCE-EEEEEEETT
T ss_pred CCcccccceeeecccCcEEcccCCCcccCCCCCccCcce-eEEEccCCC
Confidence 6788999988886 442 4567899999888876 445675554
No 13
>PF00220 Hormone_4: Neurohypophysial hormones, N-terminal Domain; InterPro: IPR022423 Oxytocin (or ocytocin) and vasopressin [] are small (nine amino acid residues), structurally and functionally related neurohypophysial peptide hormones. Oxytocin causes contraction of the smooth muscle of the uterus and of the mammary gland while vasopressin has a direct antidiuretic action on the kidney and also causes vasoconstriction of the peripheral vessels. Like the majority of active peptides, both hormones are synthesized as larger protein precursors that are enzymatically converted to their mature forms. Peptides belonging to this family are also found in birds, fish, reptiles and amphibians (mesotocin, isotocin, valitocin, glumitocin, aspargtocin, vasotocin, seritocin, asvatocin, phasvatocin), in worms (annetocin), octopi (cephalotocin), locust (locupressin or neuropeptide F1/F2) and in molluscs (conopressins G and S) []. The pattern developed to detect this category of peptides spans their entire sequence and includes four invariant amino acid residues. .; GO: 0005185 neurohypophyseal hormone activity, 0005576 extracellular region
Probab=40.09 E-value=14 Score=13.12 Aligned_cols=6 Identities=33% Similarity=0.883 Sum_probs=3.0
Q ss_pred eeCCCC
Q psy10482 49 EQCENG 54 (84)
Q Consensus 49 ~~Cp~G 54 (84)
..||-|
T Consensus 4 ~nCP~G 9 (9)
T PF00220_consen 4 RNCPIG 9 (9)
T ss_pred ccCCCC
Confidence 356544
No 14
>PF08779 SARS_X4: SARS coronavirus X4 like; InterPro: IPR014888 The structure of the coronavirus X4 protein (also known as 7a and U122) shows similarities to the immunoglobulin like fold and suggests a binding activity to integrin I domains []. ; PDB: 1XAK_A 1YO4_A.
Probab=37.45 E-value=26 Score=20.53 Aligned_cols=19 Identities=32% Similarity=0.749 Sum_probs=12.7
Q ss_pred cEEcCCCcee--EeeCCCCCe
Q psy10482 38 FYKCTNGTLT--LEQCENGLL 56 (84)
Q Consensus 38 y~~C~~G~~~--~~~Cp~Gl~ 56 (84)
|..|..|..+ ...||+|.+
T Consensus 5 yqecvrgttvllkepc~~~ty 25 (84)
T PF08779_consen 5 YQECVRGTTVLLKEPCPSGTY 25 (84)
T ss_dssp EEEEETTSEEEEE-SSTT-EE
T ss_pred HHhHhcCceEEecCCCCCCCc
Confidence 6679998744 477999865
No 15
>PF13948 DUF4215: Domain of unknown function (DUF4215)
Probab=34.20 E-value=40 Score=17.54 Aligned_cols=8 Identities=25% Similarity=0.920 Sum_probs=4.1
Q ss_pred eCCCCCeE
Q psy10482 50 QCENGLLY 57 (84)
Q Consensus 50 ~Cp~Gl~F 57 (84)
.|.+|.+-
T Consensus 20 ~CGDgii~ 27 (47)
T PF13948_consen 20 ICGDGIIV 27 (47)
T ss_pred ccCCCeEE
Confidence 45555544
No 16
>PF07771 TSGP1: Tick salivary peptide group 1; InterPro: IPR011694 This entry contains a group of peptides derived from a salivary gland cDNA library of the tick Ixodes scapularis (Black-legged tick) []. Also present are peptides from a related tick species, Ixodes ricinus (Sheep tick). They are characterised by a putative signal peptide, indicative of secretion, and conserved cysteine residues.
Probab=32.33 E-value=33 Score=21.73 Aligned_cols=24 Identities=29% Similarity=0.690 Sum_probs=13.0
Q ss_pred CCCCCCCCCCCCeEccCc---cCCcccEEcCC
Q psy10482 15 QAPSGCPEPYGVQTYPHP---QLCDQFYKCTN 43 (84)
Q Consensus 15 ~~~~~C~~~~g~~~~~~p---~~C~~y~~C~~ 43 (84)
.....||.... -+.. ..|.-| |.+
T Consensus 19 ~~~~~Cp~~~r---~~~~~~~~~C~YY--C~~ 45 (120)
T PF07771_consen 19 AAAHGCPDKTR---PASDTNREGCNYY--CWN 45 (120)
T ss_pred hhhhcCCCCCc---CCCCCCCCCcEEE--cCC
Confidence 34457888643 1222 347755 874
No 17
>cd00109 KU BPTI/Kunitz family of serine protease inhibitors; Structure is a disulfide rich alpha+beta fold. BPTI (bovine pancreatic trypsin inhibitor) is an extensively studied model structure.
Probab=30.00 E-value=48 Score=17.18 Aligned_cols=23 Identities=22% Similarity=0.508 Sum_probs=14.5
Q ss_pred CCCeEeccCCCccCCCCCCCcccCCC
Q psy10482 53 NGLLYDGNGNVHNHCNYYWGVDCGNR 78 (84)
Q Consensus 53 ~Gl~Fd~~~~~~~~C~~~~~v~C~~r 78 (84)
.+.+||+++ +.|..-....|.+.
T Consensus 17 ~~~~yd~~~---~~C~~f~~~gc~~~ 39 (54)
T cd00109 17 PRYYYDATT---KQCEPFTYGGCGGN 39 (54)
T ss_pred cEEEEeCCC---CccceeECCCccCC
Confidence 467788888 77865444456543
No 18
>PF06387 Calcyon: D1 dopamine receptor-interacting protein (calcyon); InterPro: IPR009431 This family consists of several D1 dopamine receptor-interacting (calcyon) proteins. D1/D5 dopamine receptors in the basal ganglia, hippocampus, and cerebral cortex modulate motor, reward, and cognitive behaviour. D1-like dopamine receptors likely modulate neocortical and hippocampal neuronal excitability and synaptic function via Ca2+ as well as cAMP-dependent signalling []. Defective calcyon proteins have been implicated in both attention-deficit/hyperactivity disorder (ADHD) [] and schizophrenia.; GO: 0050780 dopamine receptor binding, 0007212 dopamine receptor signaling pathway, 0016021 integral to membrane
Probab=28.93 E-value=36 Score=23.18 Aligned_cols=17 Identities=12% Similarity=0.368 Sum_probs=13.0
Q ss_pred CceeEeeCCCCCeEecc
Q psy10482 44 GTLTLEQCENGLLYDGN 60 (84)
Q Consensus 44 G~~~~~~Cp~Gl~Fd~~ 60 (84)
...+.+.||+|+++...
T Consensus 106 a~wYDqsCPdGFv~khk 122 (186)
T PF06387_consen 106 AIWYDQSCPDGFVLKHK 122 (186)
T ss_pred eeeecccCCCcceeecc
Confidence 34667799999999743
No 19
>PF07403 DUF1505: Protein of unknown function (DUF1505); InterPro: IPR009981 This family consists of several uncharacterised Caenorhabditis elegans proteins of around 115 resides in length. Members of this family contain 6 highly conserved cysteine residues. The function of this family is unknown.
Probab=27.75 E-value=62 Score=20.47 Aligned_cols=10 Identities=10% Similarity=0.501 Sum_probs=5.5
Q ss_pred ccCccCCccc
Q psy10482 29 YPHPQLCDQF 38 (84)
Q Consensus 29 ~~~p~~C~~y 38 (84)
+.+..+|..+
T Consensus 30 ~~~~~~C~~~ 39 (114)
T PF07403_consen 30 WSSSKNCNDS 39 (114)
T ss_pred eecCCccccc
Confidence 4556666443
No 20
>PF07127 Nodulin_late: Late nodulin protein; InterPro: IPR009810 This family consists of several plant specific late nodulin sequences which are homologous to the Pisum sativum (Garden pea) ENOD3 protein. ENOD3 is expressed in the late stages of root nodule formation and contains two pairs of cysteine residues toward the proteins C terminus which may be involved in metal-binding [].; GO: 0046872 metal ion binding, 0009878 nodule morphogenesis
Probab=26.92 E-value=98 Score=16.39 Aligned_cols=20 Identities=20% Similarity=0.267 Sum_probs=7.7
Q ss_pred hhhHHHhhhcccCCCCCCCC
Q psy10482 3 SLFSVVLAQPLLQAPSGCPE 22 (84)
Q Consensus 3 ~~~~~~~~~~~~~~~~~C~~ 22 (84)
+.+++++.+........|..
T Consensus 13 iflslflv~~~~~~~~~C~~ 32 (54)
T PF07127_consen 13 IFLSLFLVVTNVDAIIPCKT 32 (54)
T ss_pred HHHHHHHhhcccCCCcccCc
Confidence 33444433332323345533
No 21
>smart00197 SAA Serum amyloid A proteins. Serum amyloid A proteins are induced during the acute-phase response. Secondary amyloidosis is characterised by the extracellular accumulation in tissues of SAA proteins. SAA proteins are apolipoproteins.
Probab=26.34 E-value=15 Score=22.75 Aligned_cols=25 Identities=12% Similarity=0.267 Sum_probs=19.1
Q ss_pred cCccCCcccEEcCCCceeEeeCCCC
Q psy10482 30 PHPQLCDQFYKCTNGTLTLEQCENG 54 (84)
Q Consensus 30 ~~p~~C~~y~~C~~G~~~~~~Cp~G 54 (84)
.+-.+|++|+.|..+.--.+.+|.|
T Consensus 25 AN~~nsDKYFharGNydAaqRGpgG 49 (103)
T smart00197 25 ANHQNADKYFHARGNYDAAQRGPGG 49 (103)
T ss_pred hhhcccchhhhhhcchHHHhcCCcc
Confidence 4457899999998876666677765
No 22
>PHA02714 CD-30-like protein; Provisional
Probab=24.79 E-value=39 Score=20.99 Aligned_cols=18 Identities=11% Similarity=0.346 Sum_probs=13.7
Q ss_pred eEeeCCCCCeEeccCCCccCC
Q psy10482 47 TLEQCENGLLYDGNGNVHNHC 67 (84)
Q Consensus 47 ~~~~Cp~Gl~Fd~~~~~~~~C 67 (84)
+..+|+.++++|++. +.|
T Consensus 20 f~kqC~~dYY~Dpe~---g~C 37 (110)
T PHA02714 20 FGKTCPKDYYLEPED---GLC 37 (110)
T ss_pred hhccCCCcccccCCC---Cce
Confidence 446899999999887 555
No 23
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=24.56 E-value=38 Score=17.70 Aligned_cols=16 Identities=31% Similarity=0.754 Sum_probs=11.9
Q ss_pred cCCCceeEeeCCCCCeEecc
Q psy10482 41 CTNGTLTLEQCENGLLYDGN 60 (84)
Q Consensus 41 C~~G~~~~~~Cp~Gl~Fd~~ 60 (84)
|..| =.|++|++++..
T Consensus 31 C~~g----C~C~~G~v~~~~ 46 (55)
T PF01826_consen 31 CVEG----CFCPPGYVRNDN 46 (55)
T ss_dssp -ESE----EEETTTEEEETT
T ss_pred CCcc----CCCCCCeeEcCC
Confidence 5554 469999999887
No 24
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=23.92 E-value=41 Score=21.72 Aligned_cols=16 Identities=19% Similarity=0.638 Sum_probs=10.8
Q ss_pred eeCCCCCeEeccCCCccCC
Q psy10482 49 EQCENGLLYDGNGNVHNHC 67 (84)
Q Consensus 49 ~~Cp~Gl~Fd~~~~~~~~C 67 (84)
..||.|..||+.. +.|
T Consensus 34 ~~Cp~G~sWs~dL---dkC 49 (129)
T PF12191_consen 34 TPCPRGSSWSADL---DKC 49 (129)
T ss_dssp SSS-SSEEEETTT---TEE
T ss_pred CCCCCCCcCcccc---ccc
Confidence 4677788888777 655
No 25
>PF09717 CPW_WPC: Plasmodium falciparum domain of unknown function (CPW_WPC); InterPro: IPR006387 This entry represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats. It is named for motifs of CPxxW and (less well conserved) WPC. Its function is unknown.
Probab=21.53 E-value=73 Score=17.20 Aligned_cols=29 Identities=14% Similarity=0.290 Sum_probs=15.9
Q ss_pred eEeeCCCCCeEeccCCCccCCCCCCCc--ccCCCC
Q psy10482 47 TLEQCENGLLYDGNGNVHNHCNYYWGV--DCGNRK 79 (84)
Q Consensus 47 ~~~~Cp~Gl~Fd~~~~~~~~C~~~~~v--~C~~r~ 79 (84)
+...||.|.. .... +.|..|... .|..+.
T Consensus 5 ys~~CP~~W~-~~~~---~~C~ap~~Y~GpC~~~~ 35 (60)
T PF09717_consen 5 YSQPCPEGWI-LSDD---GSCIAPDSYNGPCKRKI 35 (60)
T ss_pred cccccccccc-cCCC---CeEcCCcccCCCCcCcc
Confidence 3445777766 4444 567666432 365543
No 26
>PHA02067 hypothetical protein
Probab=21.12 E-value=75 Score=22.29 Aligned_cols=12 Identities=17% Similarity=0.349 Sum_probs=8.4
Q ss_pred cccCCCCCCCCC
Q psy10482 12 PLLQAPSGCPEP 23 (84)
Q Consensus 12 ~~~~~~~~C~~~ 23 (84)
....+++.|...
T Consensus 15 sA~AASFDCakA 26 (221)
T PHA02067 15 SAMAASFDCNTT 26 (221)
T ss_pred cccCCCCCcccc
Confidence 446788999743
No 27
>KOG2397|consensus
Probab=20.59 E-value=1.1e+02 Score=23.90 Aligned_cols=22 Identities=32% Similarity=0.632 Sum_probs=15.1
Q ss_pred EcCCCc--eeEeeCCCCCeEeccC
Q psy10482 40 KCTNGT--LTLEQCENGLLYDGNG 61 (84)
Q Consensus 40 ~C~~G~--~~~~~Cp~Gl~Fd~~~ 61 (84)
-|.+|. +-...||+|.+|...+
T Consensus 65 DC~DGsDEPGtsACpngkF~C~N~ 88 (480)
T KOG2397|consen 65 DCLDGSDEPGTSACPNGKFYCVNQ 88 (480)
T ss_pred cCCCCCCCCccccCCCCceeeeec
Confidence 366665 3346788888887766
No 28
>TIGR01492 CPW_WPC Plasmodium falciparum CPW-WPC domain. This model represents a domain of about 61 residues in length with six well-conserved cysteine residues and six well-conserved aromatic sites. The domain can be found in tandem repeats, and is known so far only in Plasmodium falciparum. It is named for motifs of CPxxW and (less well conserved) WPC.
Probab=20.41 E-value=1e+02 Score=16.87 Aligned_cols=11 Identities=9% Similarity=0.413 Sum_probs=4.9
Q ss_pred eeCCCCCeEec
Q psy10482 49 EQCENGLLYDG 59 (84)
Q Consensus 49 ~~Cp~Gl~Fd~ 59 (84)
..||.|.....
T Consensus 8 ~~CP~~W~~~~ 18 (62)
T TIGR01492 8 SPCPENWIQKN 18 (62)
T ss_pred ccCCccceecC
Confidence 34555444433
No 29
>smart00131 KU BPTI/Kunitz family of serine protease inhibitors. Serine protease inhibitors. One member of the family is encoded by an alternatively-spliced form of Alzheimer's amyloid beta-protein.
Probab=20.23 E-value=88 Score=16.05 Aligned_cols=21 Identities=24% Similarity=0.509 Sum_probs=13.0
Q ss_pred CCeEeccCCCccCCCCCCCcccCC
Q psy10482 54 GLLYDGNGNVHNHCNYYWGVDCGN 77 (84)
Q Consensus 54 Gl~Fd~~~~~~~~C~~~~~v~C~~ 77 (84)
..+||+.+ +.|..-....|..
T Consensus 18 ~w~yd~~~---~~C~~f~~~gc~~ 38 (53)
T smart00131 18 RYYYDPET---GTCEPFTYGGCGG 38 (53)
T ss_pred EEEEeCCC---CccceeECCCccC
Confidence 56777777 7776544445544
Done!