Query psy5836
Match_columns 66
No_of_seqs 207 out of 1024
Neff 8.0
Searched_HMMs 46136
Date Fri Aug 16 19:17:42 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy5836.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/5836hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 PF14843 GF_recep_IV: Growth f 99.5 1.1E-14 2.3E-19 85.3 3.0 58 2-64 29-97 (132)
2 smart00261 FU Furin-like repea 99.4 5.2E-13 1.1E-17 65.2 2.9 45 18-64 2-46 (46)
3 cd00064 FU Furin-like repeats. 99.3 1.9E-12 4.1E-17 64.1 2.5 42 22-65 1-42 (49)
4 KOG1025|consensus 98.7 5.2E-09 1.1E-13 75.9 2.6 59 2-64 516-577 (1177)
5 PTZ00214 high cysteine membran 98.7 8.5E-09 1.8E-13 74.3 3.5 62 3-65 560-627 (800)
6 PF03302 VSP: Giardia variant- 98.7 1.4E-08 3E-13 68.4 4.0 58 4-62 92-153 (397)
7 PTZ00214 high cysteine membran 98.6 5.3E-08 1.1E-12 70.3 3.8 57 4-61 417-478 (800)
8 PF14843 GF_recep_IV: Growth f 98.6 3E-08 6.4E-13 58.1 1.8 38 2-39 85-132 (132)
9 PF00757 Furin-like: Furin-lik 98.4 1.4E-07 3.1E-12 56.3 2.6 42 17-64 47-89 (149)
10 KOG4260|consensus 97.5 2.5E-05 5.5E-10 51.1 -0.0 48 7-56 183-235 (350)
11 PF00757 Furin-like: Furin-lik 97.4 8.2E-05 1.8E-09 44.5 1.2 53 2-64 77-129 (149)
12 KOG4258|consensus 97.1 0.00026 5.7E-09 52.0 1.5 56 3-64 215-274 (1025)
13 KOG1025|consensus 97.0 0.0003 6.5E-09 52.1 1.6 35 23-61 223-258 (1177)
14 PTZ00382 Variant-specific surf 96.5 0.004 8.6E-08 34.8 3.0 49 2-55 4-54 (96)
15 PF13948 DUF4215: Domain of un 95.7 0.0068 1.5E-07 29.6 1.4 27 38-64 1-27 (47)
16 PF07562 NCD3G: Nine Cysteines 95.1 0.0071 1.5E-07 30.3 0.3 47 2-63 3-53 (54)
17 PF03302 VSP: Giardia variant- 94.5 0.028 6.1E-07 38.2 2.1 40 7-48 274-313 (397)
18 cd00185 TNFR Tumor necrosis fa 93.6 0.099 2.1E-06 28.9 2.8 22 4-25 13-40 (98)
19 KOG4258|consensus 93.4 0.024 5.1E-07 42.2 0.1 19 2-20 262-280 (1025)
20 PF07699 GCC2_GCC3: GCC2 and G 90.7 0.48 1E-05 22.8 3.0 9 37-45 27-35 (48)
21 PHA02637 TNF-alpha-receptor-li 88.2 0.77 1.7E-05 27.0 3.0 42 7-48 31-75 (127)
22 cd00185 TNFR Tumor necrosis fa 88.0 1.4 2.9E-05 24.2 3.8 12 52-65 75-86 (98)
23 PHA02637 TNF-alpha-receptor-li 84.1 1.4 3E-05 25.9 2.7 40 3-42 42-87 (127)
24 PF00020 TNFR_c6: TNFR/NGFR cy 84.1 0.55 1.2E-05 21.5 0.8 16 8-23 1-18 (39)
25 PF09064 Tme5_EGF_like: Thromb 82.6 0.92 2E-05 20.8 1.2 14 7-20 21-34 (34)
26 smart00208 TNFR Tumor necrosis 80.3 2.8 6E-05 18.9 2.5 16 8-23 1-18 (39)
27 PF12662 cEGF: Complement Clr- 76.6 1.7 3.6E-05 18.3 1.0 15 41-55 6-20 (24)
28 KOG0196|consensus 72.8 5.5 0.00012 30.3 3.4 54 8-64 263-318 (996)
29 PF12798 Fer4_3: 4Fe-4S bindin 70.7 1 2.2E-05 16.8 -0.4 10 52-61 5-14 (15)
30 PHA02714 CD-30-like protein; P 70.1 2.6 5.5E-05 23.9 1.0 19 6-24 22-41 (110)
31 PF12661 hEGF: Human growth fa 64.3 3.3 7.1E-05 14.8 0.5 8 58-65 4-11 (13)
32 PF09305 TACI-CRD2: TACI, cyst 63.6 3.9 8.5E-05 19.4 0.8 22 7-28 2-26 (41)
33 PF01826 TIL: Trypsin Inhibito 61.0 3.7 8.1E-05 19.9 0.5 13 8-20 37-49 (55)
34 PF12191 stn_TNFRSF12A: Tumour 60.4 7.9 0.00017 22.8 1.8 34 6-42 34-68 (129)
35 PF01683 EB: EB module; Inter 58.5 20 0.00043 17.1 3.0 13 51-65 36-48 (52)
36 PF01607 CBM_14: Chitin bindin 56.0 7 0.00015 18.1 1.0 16 6-21 29-45 (53)
37 KOG2813|consensus 55.7 9 0.0002 26.2 1.7 31 17-47 234-268 (406)
38 PF14670 FXa_inhibition: Coagu 51.8 7 0.00015 17.8 0.6 10 40-49 22-31 (36)
39 PF12800 Fer4_4: 4Fe-4S bindin 50.3 11 0.00023 14.3 0.9 9 2-10 8-16 (17)
40 PF07974 EGF_2: EGF-like domai 48.2 11 0.00023 16.7 0.9 12 52-65 19-30 (32)
41 PF13922 PHD_3: PHD domain of 47.8 4.2 9E-05 21.4 -0.6 25 17-42 43-68 (69)
42 PF05571 DUF766: Protein of un 47.0 12 0.00026 25.0 1.3 20 7-26 25-44 (296)
43 KOG1836|consensus 41.6 30 0.00064 28.4 2.8 54 6-65 759-821 (1705)
44 PF09257 BCMA-Tall_bind: BCMA, 40.9 8 0.00017 17.9 -0.2 19 11-29 5-23 (39)
45 PF12837 Fer4_6: 4Fe-4S bindin 40.8 6.3 0.00014 16.3 -0.5 10 52-61 13-22 (24)
46 PF00037 Fer4: 4Fe-4S binding 40.4 7.7 0.00017 15.8 -0.3 10 52-61 12-21 (24)
47 smart00180 EGF_Lam Laminin-typ 39.7 25 0.00054 16.5 1.5 8 57-64 33-40 (46)
48 PF02822 Antistasin: Antistasi 37.4 23 0.00049 14.9 1.0 18 4-22 6-23 (26)
49 KOG0994|consensus 34.9 21 0.00046 28.6 1.1 18 7-24 949-968 (1758)
50 KOG4611|consensus 34.5 24 0.00051 24.9 1.2 26 7-32 102-127 (747)
51 COG1150 HdrC Heterodisulfide r 31.6 25 0.00054 22.2 0.9 14 2-15 45-58 (195)
52 PF00008 EGF: EGF-like domain 29.8 25 0.00054 15.2 0.5 13 51-65 19-31 (32)
53 PF13746 Fer4_18: 4Fe-4S diclu 26.3 25 0.00054 18.0 0.2 11 52-62 56-66 (69)
54 smart00494 ChtBD2 Chitin-bindi 25.9 51 0.0011 15.0 1.3 10 7-16 32-41 (56)
55 KOG3744|consensus 25.7 34 0.00075 22.6 0.8 21 7-27 32-52 (311)
56 PF13187 Fer4_9: 4Fe-4S diclus 25.7 19 0.00042 17.0 -0.3 12 2-13 6-17 (55)
57 KOG3525|consensus 25.6 57 0.0012 22.8 1.9 39 4-45 207-245 (431)
58 smart00289 WR1 Worm-specific r 25.2 59 0.0013 14.1 1.4 6 57-62 21-26 (38)
59 PF07645 EGF_CA: Calcium-bindi 25.1 26 0.00056 16.0 0.1 10 52-63 25-34 (42)
60 TIGR03478 DMSO_red_II_bet DMSO 24.5 28 0.00061 23.5 0.2 26 35-61 188-213 (321)
61 PF12838 Fer4_7: 4Fe-4S diclus 23.1 24 0.00051 16.6 -0.2 10 2-11 6-15 (52)
62 PF04942 CC: CC domain; Inter 22.4 53 0.0012 14.9 0.9 14 7-20 19-32 (36)
63 PF13484 Fer4_16: 4Fe-4S doubl 21.8 16 0.00035 18.1 -1.0 10 2-11 6-15 (67)
No 1
>PF14843 GF_recep_IV: Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=99.51 E-value=1.1e-14 Score=85.32 Aligned_cols=58 Identities=47% Similarity=1.171 Sum_probs=45.3
Q ss_pred CcccccCCCCe-----eeCCccccccCcccc------ccCCCCCcccccccCCCccccccccccccccccCccc
Q psy5836 2 STCVSRCPPRS-----FPNQGGVCWPCHESC------ETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 2 ~~Cv~~C~~g~-----y~~~~~~C~~C~~~C------~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
++||.+||.++ |.+ .++|++||++| .+|+||++++|++|+. ++ +.++||..||.|+|.
T Consensus 29 g~Cv~~C~~~~~~~~~~~~-~~~C~~Ch~~C~~~~~~~~C~Gp~~~~C~~C~~--~~--~~~~Cv~~CP~~~~~ 97 (132)
T PF14843_consen 29 GTCVASCPDGYYFDGSYED-SNTCQPCHPECLPSNGSETCTGPGADQCLSCAH--FR--DGGQCVSSCPEGTYG 97 (132)
T ss_dssp TEEESSTSSSSSSSSEEEE-TTEEEE--TTB---TTSSSBSSSSTTTBSSBSS--EE--ETTEEESS--EEEEB
T ss_pred Ccchhhccccccccccccc-CCcCcccChhhCccccCcCcCCCCcchhhhccc--cC--CCCcccCcCCCCCcc
Confidence 68999999763 333 79999999999 9999999999999994 44 479999999999643
No 2
>smart00261 FU Furin-like repeats.
Probab=99.36 E-value=5.2e-13 Score=65.25 Aligned_cols=45 Identities=38% Similarity=1.037 Sum_probs=40.9
Q ss_pred cccccCccccccCCCCCcccccccCCCccccccccccccccccCccc
Q psy5836 18 GVCWPCHESCETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 18 ~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
+.|.+||++|.+|.|+++++|++|..++++. .++||..||+++|.
T Consensus 2 ~~C~~C~~~C~~C~~~~~~~C~~C~~~~~l~--~~~Cv~~C~~~~~~ 46 (46)
T smart00261 2 GECKPCHPECATCTGPGPDDCTSCKHGFFLD--GGTCVSECPPGTYA 46 (46)
T ss_pred CccccccccccccCCcCcCcCccCCcccCCC--CCCccCcCCCCccC
Confidence 5799999999999999999999999988775 48999999999884
No 3
>cd00064 FU Furin-like repeats. Cysteine rich region. Exact function of the domain is not known. Furin is a serine-kinase dependent proprotein processor. Other members of this family include endoproteases and cell surface receptors.
Probab=99.29 E-value=1.9e-12 Score=64.08 Aligned_cols=42 Identities=40% Similarity=1.043 Sum_probs=38.0
Q ss_pred cCccccccCCCCCcccccccCCCccccccccccccccccCcccC
Q psy5836 22 PCHESCETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFES 65 (66)
Q Consensus 22 ~C~~~C~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~~ 65 (66)
+||++|.+|+|+++++|++|..++++. .++||..||+++|.+
T Consensus 1 ~C~~~C~~C~g~~~~~C~~C~~~~~~~--~~~Cv~~C~~~~~~~ 42 (49)
T cd00064 1 PCHPSCATCTGPGPDQCTSCRHGFYLD--GGTCVSECPEGTYAD 42 (49)
T ss_pred CCCCccCCCcCcCcCcCccCcCccCCC--CCcccccCCCCceec
Confidence 589999999999999999999998854 699999999999874
No 4
>KOG1025|consensus
Probab=98.75 E-value=5.2e-09 Score=75.91 Aligned_cols=59 Identities=42% Similarity=0.959 Sum_probs=48.4
Q ss_pred CcccccCCC--Ceee-CCccccccCccccccCCCCCcccccccCCCccccccccccccccccCccc
Q psy5836 2 STCVSRCPP--RSFP-NQGGVCWPCHESCETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 2 ~~Cv~~C~~--g~y~-~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
++||+.|.. |-+. ..+..|..||+.|+||+|+++++|+.|. .|. +...||..||+|--+
T Consensus 516 g~CVe~C~~l~g~~rf~~~~~C~~ChPEC~TCnG~G~d~C~~Ca--Hf~--dgp~CV~~CP~G~~G 577 (1177)
T KOG1025|consen 516 GTCVEKCNLLGGEPRFVNSRECERCHPECETCNGPGADNCLQCA--HFR--DGPHCVSDCPDGVTG 577 (1177)
T ss_pred ceehhhccccCCcccccccceecccChhhccCCCCCccchhhhh--hcC--CCcchhccCCCcccC
Confidence 579999973 3333 3468999999999999999999999998 443 578999999998644
No 5
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=98.74 E-value=8.5e-09 Score=74.30 Aligned_cols=62 Identities=21% Similarity=0.588 Sum_probs=52.3
Q ss_pred cccccCCCCeeeCCccccccCccccccCCCCCcccccccCCCcccccc----cccccc--ccccCcccC
Q psy5836 3 TCVSRCPPRSFPNQGGVCWPCHESCETCAGAGQDSCLTCAPAHLRVTD----LAICLQ--QCPEGYFES 65 (66)
Q Consensus 3 ~Cv~~C~~g~y~~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~~----~~~Cv~--~Cp~g~y~~ 65 (66)
+|. +|..|||.+....|++|+++|.+|.++++.+|++|..+++|... .+.||. .|++|||.+
T Consensus 560 ~Ct-~C~~GYyl~~~~~C~~Cd~~C~~C~~~~~~~C~sC~~g~yL~~~~~~~~g~Cv~~~~C~~GYY~d 627 (800)
T PTZ00214 560 KCT-TTKKGYGISPDGKLLECDPTCLACTAPGPGRCTRCPSDKLLKRASGAATGSCVDPGACVDGYYAD 627 (800)
T ss_pred ccc-ccccCcccCCCCccccccchhhhccCCCCCcceECCCCCEeccccccccccccCcccCCCCcccC
Confidence 454 57789998778889999999999999999999999999988532 357884 899999986
No 6
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=98.72 E-value=1.4e-08 Score=68.36 Aligned_cols=58 Identities=38% Similarity=0.966 Sum_probs=48.2
Q ss_pred ccccCCCCeeeCCccccccCccccccCCCCCcccccccCCCcccccc----ccccccccccCc
Q psy5836 4 CVSRCPPRSFPNQGGVCWPCHESCETCAGAGQDSCLTCAPAHLRVTD----LAICLQQCPEGY 62 (66)
Q Consensus 4 Cv~~C~~g~y~~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~~----~~~Cv~~Cp~g~ 62 (66)
+...||+|||.+ ++.|.+||++|.+|.++.+++|++|.++++|... .++|+..|-++.
T Consensus 92 ~c~~C~~G~y~~-~~~C~~C~~~C~~C~~~~~~~Ct~C~~g~~L~~~~~~~~~tC~~~c~~~~ 153 (397)
T PF03302_consen 92 CCSECPDGYYKN-GNKCVPCHESCATCSGGAPNQCTSCKPGKVLKYGTDGTKGTCGAECTEDC 153 (397)
T ss_pred cccCCCCCcccc-CCCCCCCCccccccCCCCCCCCcccCCCcccccccCCCcCcccccccccc
Confidence 456899999985 6799999999999999999999999999987641 367887775543
No 7
>PTZ00214 high cysteine membrane protein Group 4; Provisional
Probab=98.58 E-value=5.3e-08 Score=70.28 Aligned_cols=57 Identities=32% Similarity=0.870 Sum_probs=48.2
Q ss_pred ccccCCCCeeeCCccccccCccccccCCCCCcccccccCCCccccc-----cccccccccccC
Q psy5836 4 CVSRCPPRSFPNQGGVCWPCHESCETCAGAGQDSCLTCAPAHLRVT-----DLAICLQQCPEG 61 (66)
Q Consensus 4 Cv~~C~~g~y~~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~-----~~~~Cv~~Cp~g 61 (66)
|+..|++|||. ..+.|.+||++|.+|.++++..|++|.++++|.. ..++||..|+++
T Consensus 417 C~c~C~~G~y~-~~g~C~~C~~sCa~C~~~~~~~CtsC~~g~~L~~~~~~~~~gtCv~~C~~~ 478 (800)
T PTZ00214 417 CRCVCKPTFYN-SSGTCTPCTDSCAVCKDGTPTGCQQCSPGKILEFSIVSSESADCVDQCSVG 478 (800)
T ss_pred ccccCCCCccc-CCCCccCCCCcccccCCCCcCcCccCCCCcEecccccCCCCCceeccCCCC
Confidence 55578999996 4789999999999999999999999999987742 357899999864
No 8
>PF14843 GF_recep_IV: Growth factor receptor domain IV; PDB: 1N8Z_C 1S78_A 3N85_A 2A91_A 3MZW_A 3BE1_A 3U2P_A 2AHX_B 3LTF_A 3I2T_A ....
Probab=98.57 E-value=3e-08 Score=58.06 Aligned_cols=38 Identities=45% Similarity=1.224 Sum_probs=28.4
Q ss_pred CcccccCCCCe---------eeCCccccccCcccc-ccCCCCCccccc
Q psy5836 2 STCVSRCPPRS---------FPNQGGVCWPCHESC-ETCAGAGQDSCL 39 (66)
Q Consensus 2 ~~Cv~~C~~g~---------y~~~~~~C~~C~~~C-~~C~g~~~~~C~ 39 (66)
++||++||.|+ |.+..++|++||++| ..|+||.+.+|.
T Consensus 85 ~~Cv~~CP~~~~~~~~~~~~~~~~~~~C~~Ch~~C~~gC~Gp~~~~C~ 132 (132)
T PF14843_consen 85 GQCVSSCPEGTYGENFPIYYYPDSNGVCQPCHPSCTQGCTGPSASDCQ 132 (132)
T ss_dssp TEEESS--EEEEBTTSEEEEEE-TTSBEEE-STTSTT-BSSSSTTCC-
T ss_pred CcccCcCCCCCccccccceecCCCCCccCCcCccccCcCCCCCcccCC
Confidence 57999999994 556689999999999 899999998873
No 9
>PF00757 Furin-like: Furin-like cysteine rich region; InterPro: IPR006211 The furin-like cysteine rich region has been found in a variety of proteins from eukaryotes that are involved in the mechanism of signal transduction by receptor tyrosine kinases, which involves receptor aggregation [].; GO: 0004714 transmembrane receptor protein tyrosine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation, 0007169 transmembrane receptor protein tyrosine kinase signaling pathway, 0016020 membrane; PDB: 3U2P_A 2AHX_B 1N8Y_C 1IGR_A 1S78_A 3MZW_A 2A91_A 1N8Z_C 3N85_A 3H3B_B ....
Probab=98.45 E-value=1.4e-07 Score=56.34 Aligned_cols=42 Identities=40% Similarity=1.120 Sum_probs=31.3
Q ss_pred ccccccCcccc-ccCCCCCcccccccCCCccccccccccccccccCccc
Q psy5836 17 GGVCWPCHESC-ETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 17 ~~~C~~C~~~C-~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
...| ||++| ..|+||++++|++|+ .+.+ +++||+.||+|+|.
T Consensus 47 ~~~C--Ch~eClGGC~gp~~~~C~aCr--~~~~--~g~Cv~~CP~~~Y~ 89 (149)
T PF00757_consen 47 PGDC--CHEECLGGCSGPNPSDCYACR--HFSY--NGTCVEQCPPGKYE 89 (149)
T ss_dssp CGGB----TTEESEESSSSTTSESCES--SEEE--TTEEESS-STT-EE
T ss_pred cccc--hhHHHhhcCCCCCCCcCHHHh--CcCc--CCeecccCChhhcc
Confidence 4555 99999 689999999999999 3443 69999999999874
No 10
>KOG4260|consensus
Probab=97.47 E-value=2.5e-05 Score=51.09 Aligned_cols=48 Identities=29% Similarity=0.627 Sum_probs=40.1
Q ss_pred cCCCCeeeC----CccccccCccccc-cCCCCCcccccccCCCcccccccccccc
Q psy5836 7 RCPPRSFPN----QGGVCWPCHESCE-TCAGAGQDSCLTCAPAHLRVTDLAICLQ 56 (66)
Q Consensus 7 ~C~~g~y~~----~~~~C~~C~~~C~-~C~g~~~~~C~~C~~~~~~~~~~~~Cv~ 56 (66)
.|..+||.. +..+|.+||..|. .|+|+++.+|..|+.+|.|. ...||+
T Consensus 183 ~Cg~eyfes~Rne~~lvCt~Ch~~C~~~Csg~~~k~C~kCkkGW~ld--e~gCvD 235 (350)
T KOG4260|consen 183 YCGIEYFESSRNEQHLVCTACHEGCLGVCSGESSKGCSKCKKGWKLD--EEGCVD 235 (350)
T ss_pred ccchHHHHhhcccccchhhhhhhhhhcccCCCCCCChhhhcccceec--cccccc
Confidence 567778753 2689999999997 99999999999999999986 466664
No 11
>PF00757 Furin-like: Furin-like cysteine rich region; InterPro: IPR006211 The furin-like cysteine rich region has been found in a variety of proteins from eukaryotes that are involved in the mechanism of signal transduction by receptor tyrosine kinases, which involves receptor aggregation [].; GO: 0004714 transmembrane receptor protein tyrosine kinase activity, 0005524 ATP binding, 0006468 protein phosphorylation, 0007169 transmembrane receptor protein tyrosine kinase signaling pathway, 0016020 membrane; PDB: 3U2P_A 2AHX_B 1N8Y_C 1IGR_A 1S78_A 3MZW_A 2A91_A 1N8Z_C 3N85_A 3H3B_B ....
Probab=97.36 E-value=8.2e-05 Score=44.49 Aligned_cols=53 Identities=30% Similarity=0.729 Sum_probs=30.6
Q ss_pred CcccccCCCCeeeCCccccccCccccccCCCCCcccccccCCCccccccccccccccccCccc
Q psy5836 2 STCVSRCPPRSFPNQGGVCWPCHESCETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 2 ~~Cv~~C~~g~y~~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
++||++||.|+|.- +.+|.. ...|..-.. .=.. ..++..++.||..||.||..
T Consensus 77 g~Cv~~CP~~~Y~~-~~rCVt-~~~C~~~~~-------~~~~-~~~~~~~~~Cv~~CP~gy~~ 129 (149)
T PF00757_consen 77 GTCVEQCPPGKYEF-GRRCVT-KEECPSMCE-------SSKE-YPFLIFNGSCVRECPSGYME 129 (149)
T ss_dssp TEEESS-STT-EEE-TTEEE--HSSHHCCTT-------TTST-TSEEEETTEEESS-TTTEEE
T ss_pred CeecccCChhhccc-ccEeec-cccCCchhh-------cccc-cceeeeCCcccccCCCCCEE
Confidence 67999999999975 666653 234432211 0011 22223579999999999986
No 12
>KOG4258|consensus
Probab=97.07 E-value=0.00026 Score=52.03 Aligned_cols=56 Identities=32% Similarity=0.926 Sum_probs=40.3
Q ss_pred cccccCCCCee--eCCccccccCcccc-ccCCCCCc-ccccccCCCccccccccccccccccCccc
Q psy5836 3 TCVSRCPPRSF--PNQGGVCWPCHESC-ETCAGAGQ-DSCLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 3 ~Cv~~C~~g~y--~~~~~~C~~C~~~C-~~C~g~~~-~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
.|..+||..-- -++++.| ||..| ..|..+.. +.|.+|+. ++ .+|.||.+||.+||.
T Consensus 215 ~CQK~Cp~~~g~gC~ang~c--CH~eClGgC~~~~dptaC~aCr~--~y--~~G~CV~~Cp~~~Y~ 274 (1025)
T KOG4258|consen 215 HCQKSCPENCGNGCTANGEC--CHEECLGGCSQPNDPTACVACRG--LY--DDGVCVEACPPDYYL 274 (1025)
T ss_pred chhccCccccccCccccccc--ccHhhccCcCCCCChhHhHHhhc--cc--cCCceeccCCcchhh
Confidence 46667764221 2345565 99999 68998875 78999983 22 259999999999986
No 13
>KOG1025|consensus
Probab=97.05 E-value=0.0003 Score=52.06 Aligned_cols=35 Identities=34% Similarity=1.089 Sum_probs=29.1
Q ss_pred Ccccc-ccCCCCCcccccccCCCccccccccccccccccC
Q psy5836 23 CHESC-ETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEG 61 (66)
Q Consensus 23 C~~~C-~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g 61 (66)
||..| ..|+||+..+|++|. .|. +.+.|+..||.-
T Consensus 223 Ch~~CagGCtGp~~~dC~AC~--~f~--d~G~C~~~CP~~ 258 (1177)
T KOG1025|consen 223 CHSECAGGCTGPKDKDCIACR--KFN--DSGICVETCPQL 258 (1177)
T ss_pred HHHHhccCCCCCCCCCeeeec--ccc--cCceecccCCCc
Confidence 89999 699999999999998 443 468888888865
No 14
>PTZ00382 Variant-specific surface protein (VSP); Provisional
Probab=96.47 E-value=0.004 Score=34.79 Aligned_cols=49 Identities=24% Similarity=0.667 Sum_probs=38.0
Q ss_pred CcccccCCCCeeeCC-ccccccCc-cccccCCCCCcccccccCCCccccccccccc
Q psy5836 2 STCVSRCPPRSFPNQ-GGVCWPCH-ESCETCAGAGQDSCLTCAPAHLRVTDLAICL 55 (66)
Q Consensus 2 ~~Cv~~C~~g~y~~~-~~~C~~C~-~~C~~C~g~~~~~C~~C~~~~~~~~~~~~Cv 55 (66)
.+| ..|..|||.+. ...|.+|. +.|.+|.++ ..|..|..+|++. .+.|+
T Consensus 4 ~~C-t~C~~g~~~~~~~~~C~~C~~~~C~~C~~~--~~C~~C~~GY~~~--~~~Cv 54 (96)
T PTZ00382 4 AVC-TSCDSDKKPNKDGSGCVLCSVGNCKSCVVD--GVCGECNSGFSLD--NGKCV 54 (96)
T ss_pred ccc-CcCCCCCccCCCCCcCCcCCCCCCcCCCCC--CccccCcCCcccC--CCccc
Confidence 356 47899998865 45699998 599999864 5799999998764 56776
No 15
>PF13948 DUF4215: Domain of unknown function (DUF4215)
Probab=95.72 E-value=0.0068 Score=29.61 Aligned_cols=27 Identities=22% Similarity=0.507 Sum_probs=22.1
Q ss_pred ccccCCCccccccccccccccccCccc
Q psy5836 38 CLTCAPAHLRVTDLAICLQQCPEGYFE 64 (66)
Q Consensus 38 C~~C~~~~~~~~~~~~Cv~~Cp~g~y~ 64 (66)
|+.|..+|++....++|++.|++|+..
T Consensus 1 C~~C~~G~~l~~~~~~C~~~CGDgii~ 27 (47)
T PF13948_consen 1 CLQCQDGYYLNQNNNKCEPICGDGIIV 27 (47)
T ss_pred CCcccCccEEcCCCCccccccCCCeEE
Confidence 678888998876678899999998653
No 16
>PF07562 NCD3G: Nine Cysteines Domain of family 3 GPCR; InterPro: IPR011500 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). GPCR family 3 receptors (also known as family C) are structurally similar to other GPCRs, but do not show any significant sequence similarity and thus represent a distinct group. Structurally they are composed of four elements; an N-terminal signal sequence; a large hydrophilic extracellular agonist-binding region containing several conserved cysteine residues which could be involved in disulphide bonds; a shorter region containing seven transmembrane domains; and a C-terminal cytoplasmic domain of variable length []. Family 3 members include the metabotropic glutamate receptors, the extracellular calcium-sensing receptors, the gamma-amino-butyric acid (GABA) type B receptors, and the vomeronasal type-2 receptors [, , , ]. As these receptors regulate many important physiological processes they are potentially promising targets for drug development. This entry represents a conserved sequence, found in the extracellular region, that contains several highly-conserved Cys residues that are predicted to form disulphide bridges.; GO: 0004930 G-protein coupled receptor activity, 0007186 G-protein coupled receptor protein signaling pathway; PDB: 2E4X_B 2E4Y_A 2E4U_B 2E4V_B 2E4W_A.
Probab=95.08 E-value=0.0071 Score=30.28 Aligned_cols=47 Identities=32% Similarity=0.848 Sum_probs=23.2
Q ss_pred CcccccCCCCeeeCC---ccccccCccccccCCCCCcccccccCCCccccc-cccccccccccCcc
Q psy5836 2 STCVSRCPPRSFPNQ---GGVCWPCHESCETCAGAGQDSCLTCAPAHLRVT-DLAICLQQCPEGYF 63 (66)
Q Consensus 2 ~~Cv~~C~~g~y~~~---~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~-~~~~Cv~~Cp~g~y 63 (66)
.+|...|+.|++... ...| |. +|+.|..+.+... +...|+ .||+|++
T Consensus 3 S~CS~~C~pG~~k~~~~~~~~C---------Cw-----~C~~C~~~~is~~~~~~~C~-~C~~~~w 53 (54)
T PF07562_consen 3 SVCSEPCPPGQRKKIQKGQPSC---------CW-----DCVPCPEGEISNQTDSTSCT-KCPEGQW 53 (54)
T ss_dssp --SS----TTTEEEE--SSS-S----------------EEEE--TTEEEE--ETTEEE-E--TTEE
T ss_pred cccCCCCCCCCEECccCCCcce---------EE-----EeecCCCCcEECCCCccccc-cCCCccC
Confidence 578999999998743 3334 22 6788887777653 456787 6888876
No 17
>PF03302 VSP: Giardia variant-specific surface protein; InterPro: IPR005127 During infection, the intestinal protozoan parasite Giardia lamblia virus undergoes continuous antigenic variation which is determined by diversification of the parasite's major surface antigen, named VSP (variant surface protein).
Probab=94.48 E-value=0.028 Score=38.22 Aligned_cols=40 Identities=28% Similarity=0.761 Sum_probs=29.9
Q ss_pred cCCCCeeeCCccccccCccccccCCCCCcccccccCCCcccc
Q psy5836 7 RCPPRSFPNQGGVCWPCHESCETCAGAGQDSCLTCAPAHLRV 48 (66)
Q Consensus 7 ~C~~g~y~~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~ 48 (66)
.|..|||.+ ...|.+|+..|.+|.+ ....|++|..++|+.
T Consensus 274 ~C~~g~~~~-~~~C~~C~~~c~tC~g-~~~~Ct~C~~Gyy~~ 313 (397)
T PF03302_consen 274 TCANGYYKD-VGGCTKCTAGCATCTG-DAATCTACADGYYKS 313 (397)
T ss_pred cccCCcccc-ccccccCCCcccccCC-ccccccchhcccccc
Confidence 566677764 6788888888888885 457888888887753
No 18
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=93.58 E-value=0.099 Score=28.86 Aligned_cols=22 Identities=36% Similarity=1.105 Sum_probs=11.7
Q ss_pred ccccCCCCeeeCC------ccccccCcc
Q psy5836 4 CVSRCPPRSFPNQ------GGVCWPCHE 25 (66)
Q Consensus 4 Cv~~C~~g~y~~~------~~~C~~C~~ 25 (66)
|-..||.|++... +.+|.+|.+
T Consensus 13 cC~~C~pG~~~~~~C~~~~~t~C~~C~~ 40 (98)
T cd00185 13 CCDQCPPGTYLSKDCTPGSDTVCEPCPP 40 (98)
T ss_pred CCCCCCCCccCCCcCCCCCCCeecCCCC
Confidence 3456777766532 244555544
No 19
>KOG4258|consensus
Probab=93.41 E-value=0.024 Score=42.24 Aligned_cols=19 Identities=32% Similarity=0.819 Sum_probs=15.5
Q ss_pred CcccccCCCCeeeCCcccc
Q psy5836 2 STCVSRCPPRSFPNQGGVC 20 (66)
Q Consensus 2 ~~Cv~~C~~g~y~~~~~~C 20 (66)
+.||.+||.++|...+.+|
T Consensus 262 G~CV~~Cp~~~Y~~e~~RC 280 (1025)
T KOG4258|consen 262 GVCVEACPPDYYLFENWRC 280 (1025)
T ss_pred CceeccCCcchhhhcccee
Confidence 5799999999998766555
No 20
>PF07699 GCC2_GCC3: GCC2 and GCC3; InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []: Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction []. Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases []. This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=90.73 E-value=0.48 Score=22.77 Aligned_cols=9 Identities=44% Similarity=1.191 Sum_probs=4.0
Q ss_pred cccccCCCc
Q psy5836 37 SCLTCAPAH 45 (66)
Q Consensus 37 ~C~~C~~~~ 45 (66)
.|+.|..+.
T Consensus 27 ~C~~Cp~g~ 35 (48)
T PF07699_consen 27 SCTPCPPGS 35 (48)
T ss_pred cCccCcCCC
Confidence 444444443
No 21
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=88.20 E-value=0.77 Score=27.00 Aligned_cols=42 Identities=14% Similarity=0.519 Sum_probs=28.3
Q ss_pred cCCCCeeeCCccccccCcccc---ccCCCCCcccccccCCCcccc
Q psy5836 7 RCPPRSFPNQGGVCWPCHESC---ETCAGAGQDSCLTCAPAHLRV 48 (66)
Q Consensus 7 ~C~~g~y~~~~~~C~~C~~~C---~~C~g~~~~~C~~C~~~~~~~ 48 (66)
.|.++.|...+.-|.+|.+.= .-|+..+.+.|..|..+.|+.
T Consensus 31 ~C~e~EY~~~~~CC~kCPPGt~v~~~Ct~~t~T~C~PCp~GTYTe 75 (127)
T PHA02637 31 KCKDNEYKRHNLCCLSCPPGTYASRLCDIKTNTQCTPCGSGTFTS 75 (127)
T ss_pred CCCCCcCcCCCeEcCCCCCCCEEeCcCCCCCCcccccCCCCCeec
Confidence 566666654456677776554 456666677888888887654
No 22
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=87.96 E-value=1.4 Score=24.24 Aligned_cols=12 Identities=25% Similarity=0.913 Sum_probs=7.7
Q ss_pred cccccccccCcccC
Q psy5836 52 AICLQQCPEGYFES 65 (66)
Q Consensus 52 ~~Cv~~Cp~g~y~~ 65 (66)
..|. |++|||.+
T Consensus 75 t~C~--C~~G~y~~ 86 (98)
T cd00185 75 TVCG--CKPGFYCL 86 (98)
T ss_pred CeEe--CCCCCEec
Confidence 4554 78887753
No 23
>PHA02637 TNF-alpha-receptor-like protein; Provisional
Probab=84.11 E-value=1.4 Score=25.93 Aligned_cols=40 Identities=28% Similarity=0.751 Sum_probs=22.1
Q ss_pred cccccCCCCeeeCC------ccccccCccccccCCCCCcccccccC
Q psy5836 3 TCVSRCPPRSFPNQ------GGVCWPCHESCETCAGAGQDSCLTCA 42 (66)
Q Consensus 3 ~Cv~~C~~g~y~~~------~~~C~~C~~~C~~C~g~~~~~C~~C~ 42 (66)
.|-..||.|+|... ..+|.+|.+.=.+=..-....|+.|.
T Consensus 42 ~CC~kCPPGt~v~~~Ct~~t~T~C~PCp~GTYTe~~N~~~~C~~C~ 87 (127)
T PHA02637 42 LCCLSCPPGTYASRLCDIKTNTQCTPCGSGTFTSHNNHLPACLSCN 87 (127)
T ss_pred eEcCCCCCCCEEeCcCCCCCCcccccCCCCCeeccCCCCCcccccC
Confidence 45667888877632 34676776553322223345666665
No 24
>PF00020 TNFR_c6: TNFR/NGFR cysteine-rich region; InterPro: IPR001368 A number of proteins, some of which are known to be receptors for growth factors have been found to contain a cysteine-rich domain at the N-terminal region that can be subdivided into four (or in some cases, three) repeats containing six conserved cysteines all of which are involved in intrachain disulphide bonds []. CD27 (also called S152 or T14) mediates a co-stimulatory signal for T and B cell activation and is involved in murine T cell development. Tyrosine-phosphorylation of ZAP-70 following CD27 ligation of T cells has been reported [], but not confirmed independently. CD30 was originally identified as Ki-1, an antigen expressed on Reed-Sternberg cells in Hodgkin's lymphomas and other non-Hodgkin's lymphomas, particularly diffuse large-cell lymphoma and immunoblastic lymphoma. CD30 has pleiotropic effects on CD30-positive lymphoma cell lines ranging from cell proliferation to cell death. It is thought to be involved in negative selection of T-cells in the thymus and is involved in TCR-mediated cell death. CD30 is a member of the TNFR family of molecules, activate NFkB through interaction with TRAF2 and TRAF5. CD40 (Bp50) plays a central role in the regulation of cell-mediated immunity as well as antibody mediated immunity. It is central to T cell dependent (TD)-responses and may influence survival of B cell lymphomas. CD95 (also called APO-1, fas antigen, Fas tumour necrosis factor receptor superfamily, member 6, TNFRSF6 or apoptosis antigen 1, APT1) is expressed, typically at high levels, on activated T and B cells. It is involved in the mediation of apoptosis-inducing signals. Other proteins known to belong to this family [, , , ] are, tumour Necrosis Factor type I and type II receptors (TNFR), Rabbit fibroma virus soluble TNF receptor (protein T2), lymphotoxin alpha/beta receptor, low-affinity nerve growth factor receptor (LA-NGFR) (p75), T-cell antigen OX40, Wsl-1, a receptor (for a yet undefined ligand) that mediates apoptosis and Vaccinia virus protein A53 (SalF19R). CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/).; GO: 0005488 binding; PDB: 3TJE_F 3QD6_S 3ALQ_U 3IJ2_Y 3BUK_D 1SG1_X 1NCF_B 1EXT_A 1FT4_B 1TNR_R ....
Probab=84.09 E-value=0.55 Score=21.54 Aligned_cols=16 Identities=38% Similarity=1.271 Sum_probs=10.2
Q ss_pred CCCCeeeCCc--cccccC
Q psy5836 8 CPPRSFPNQG--GVCWPC 23 (66)
Q Consensus 8 C~~g~y~~~~--~~C~~C 23 (66)
|+.|+|.+.. ..|.+|
T Consensus 1 C~~g~y~~~~~~~~C~~C 18 (39)
T PF00020_consen 1 CPPGTYSDSENHPQCLPC 18 (39)
T ss_dssp ECTTEEEESSCSSSEEEE
T ss_pred CccCcccCCCCCCcCCcc
Confidence 6888888642 455444
No 25
>PF09064 Tme5_EGF_like: Thrombomodulin like fifth domain, EGF-like; InterPro: IPR015149 This domain adopts a fold similar to other EGF domains, with a flat major and a twisted minor beta sheet. Disulphide pairing, however, is not of the usual 1-3, 2-4, 5-6 type; rather 1-2, 3-4, 5-6 pairing is found. Its extended major sheet (strands beta-2 and beta-3 and the connecting loop) projects into thrombin's active site groove. This domain is required for interaction of thrombomodulin with thrombin, and subsequent activation of protein-C []. ; GO: 0004888 transmembrane signaling receptor activity, 0016021 integral to membrane
Probab=82.62 E-value=0.92 Score=20.80 Aligned_cols=14 Identities=29% Similarity=0.838 Sum_probs=10.3
Q ss_pred cCCCCeeeCCcccc
Q psy5836 7 RCPPRSFPNQGGVC 20 (66)
Q Consensus 7 ~C~~g~y~~~~~~C 20 (66)
.||+||..+.+..|
T Consensus 21 ~CPeGyIlde~~~c 34 (34)
T PF09064_consen 21 FCPEGYILDEGSMC 34 (34)
T ss_pred eCCCceEecCCccC
Confidence 48899988776554
No 26
>smart00208 TNFR Tumor necrosis factor receptor / nerve growth factor receptor repeats. Repeats in growth factor receptors that are involved in growth factor binding. TNF/TNFR
Probab=80.29 E-value=2.8 Score=18.87 Aligned_cols=16 Identities=25% Similarity=0.877 Sum_probs=9.6
Q ss_pred CCCCeeeCCc--cccccC
Q psy5836 8 CPPRSFPNQG--GVCWPC 23 (66)
Q Consensus 8 C~~g~y~~~~--~~C~~C 23 (66)
|+.|+|.+.. ..|++|
T Consensus 1 C~~g~y~~~~~~~~C~~C 18 (39)
T smart00208 1 CKEGTYCSDGNHSSCLRC 18 (39)
T ss_pred CCCCeecCCCCCccCccC
Confidence 6778887643 345444
No 27
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=76.64 E-value=1.7 Score=18.31 Aligned_cols=15 Identities=27% Similarity=0.437 Sum_probs=7.0
Q ss_pred cCCCccccccccccc
Q psy5836 41 CAPAHLRVTDLAICL 55 (66)
Q Consensus 41 C~~~~~~~~~~~~Cv 55 (66)
|.+||.|..+...|+
T Consensus 6 C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 6 CPPGYQLSPDGRSCE 20 (24)
T ss_pred CCCCCcCCCCCCccc
Confidence 455554444444444
No 28
>KOG0196|consensus
Probab=72.79 E-value=5.5 Score=30.35 Aligned_cols=54 Identities=24% Similarity=0.650 Sum_probs=24.0
Q ss_pred CCCCeeeC-CccccccCccccccCCCCCcccccccCCCccccc-cccccccccccCccc
Q psy5836 8 CPPRSFPN-QGGVCWPCHESCETCAGAGQDSCLTCAPAHLRVT-DLAICLQQCPEGYFE 64 (66)
Q Consensus 8 C~~g~y~~-~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~~~~~-~~~~Cv~~Cp~g~y~ 64 (66)
|..||-.. .+..|++|.+.-..=.. +...|..|+++-.... ....|. |-.|||.
T Consensus 263 C~aGye~~~~~~~C~aCp~G~yK~~~-~~~~C~~CP~~S~s~~ega~~C~--C~~gyyR 318 (996)
T KOG0196|consen 263 CKAGYEEAENGKACQACPPGTYKASQ-GDSLCLPCPPNSHSSSEGATSCT--CENGYYR 318 (996)
T ss_pred ecCCCCcccCCCcceeCCCCcccCCC-CCCCCCCCCCCCCCCCCCCCccc--ccCCccc
Confidence 44554331 24555555544322111 2345666665443321 123443 6666663
No 29
>PF12798 Fer4_3: 4Fe-4S binding domain; InterPro: IPR001450 This superfamily includes proteins containing domains which bind to iron-sulphur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwich. Ferredoxins are iron-sulphur proteins that mediate electron transfer in a range of metabolic reactions; they fall into several subgroups according to the nature of their iron-sulphur cluster(s) [, ]. One group, originally found in bacteria, has been termed "bacterial-type", in which the active centre is a 4Fe-4S cluster. 4Fe-4S ferredoxins may in turn be subdivided into further groups, based on their sequence properties. Most contain at least one conserved domain, including four Cys residues that bind to a 4Fe-4S centre. ; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding
Probab=70.72 E-value=1 Score=16.80 Aligned_cols=10 Identities=40% Similarity=1.291 Sum_probs=7.3
Q ss_pred cccccccccC
Q psy5836 52 AICLQQCPEG 61 (66)
Q Consensus 52 ~~Cv~~Cp~g 61 (66)
+.|+..||.|
T Consensus 5 ~~C~~~Cp~~ 14 (15)
T PF12798_consen 5 GACVEVCPTG 14 (15)
T ss_pred hHHHHHhcCC
Confidence 5678888865
No 30
>PHA02714 CD-30-like protein; Provisional
Probab=70.08 E-value=2.6 Score=23.91 Aligned_cols=19 Identities=26% Similarity=0.807 Sum_probs=15.3
Q ss_pred ccCCCCeeeCC-ccccccCc
Q psy5836 6 SRCPPRSFPNQ-GGVCWPCH 24 (66)
Q Consensus 6 ~~C~~g~y~~~-~~~C~~C~ 24 (66)
.+|+.+||.|. ...|.+|-
T Consensus 22 kqC~~dYY~Dpe~g~CtACV 41 (110)
T PHA02714 22 KTCPKDYYLEPEDGLCTACV 41 (110)
T ss_pred ccCCCcccccCCCCceeeec
Confidence 47999999984 78888773
No 31
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=64.35 E-value=3.3 Score=14.82 Aligned_cols=8 Identities=38% Similarity=1.111 Sum_probs=4.7
Q ss_pred cccCcccC
Q psy5836 58 CPEGYFES 65 (66)
Q Consensus 58 Cp~g~y~~ 65 (66)
||+||..+
T Consensus 4 C~~G~~G~ 11 (13)
T PF12661_consen 4 CPPGWTGP 11 (13)
T ss_dssp E-TTEETT
T ss_pred CcCCCcCC
Confidence 77887654
No 32
>PF09305 TACI-CRD2: TACI, cysteine-rich domain; InterPro: IPR015384 Cytokines can be grouped into a family on the basis of sequence, functional and structural similarities [, , ]. Tumor necrosis factor (TNF) (also known as TNF-alpha or cachectin) is a monocyte-derived cytotoxin that has been implicated in tumour regression, septic shock and cachexia [, ]. The protein is synthesised as a prohormone with an unusually long and atypical signal sequence, which is absent from the mature secreted cytokine []. A short hydrophobic stretch of amino acids serves to anchor the prohormone in lipid bilayers []. Both the mature protein and a partially-processed form of the hormone are secreted after cleavage of the propeptide []. There are a number of different families of TNF, but all these cytokines seem to form homotrimeric (or heterotrimeric in the case of LT-alpha/beta) complexes that are recognised by their specific receptors. This entry represents a cysteine-rich domain found in the TACI family of proteins. Members of this family are predominantly found in tumour necrosis factor receptor superfamily, member 13b (TACI), and are required for binding to the ligands APRIL and BAFF []. ; PDB: 1XU1_T 1XUT_A.
Probab=63.59 E-value=3.9 Score=19.43 Aligned_cols=22 Identities=23% Similarity=0.674 Sum_probs=14.9
Q ss_pred cCCC--CeeeCC-ccccccCccccc
Q psy5836 7 RCPP--RSFPNQ-GGVCWPCHESCE 28 (66)
Q Consensus 7 ~C~~--g~y~~~-~~~C~~C~~~C~ 28 (66)
+|+. |.|+|. -+.|..|.+.|.
T Consensus 2 sC~keqg~YyD~LL~~CisC~siCg 26 (41)
T PF09305_consen 2 SCPKEQGQYYDPLLRDCISCKSICG 26 (41)
T ss_dssp ---GGGTEEEETTTTEEEEGGGTTT
T ss_pred CchhhhhHHHHHHHHHHccchhhhc
Confidence 4553 567766 789999998886
No 33
>PF01826 TIL: Trypsin Inhibitor like cysteine rich domain; InterPro: IPR002919 This domain is found in proteinase inhibitors as well as in many extracellular proteins. The domain typically contains ten cysteine residues that form five disulphide bonds. The cysteine residues that form the disulphide bonds are 1-7, 2-6, 3-5, 4-10 and 8-9. This inhibitor domain belongs to MEROPS inhibitor family I8 (clan IA). Proteins containing this domain inhibit peptidases belonging to families S1 (IPR001254 from INTERPRO), S8 (IPR000209 from INTERPRO), and M4 (IPR001570 from INTERPRO) [] and are restricted to the chordata, nematoda, arthropoda and echinodermata. Examples of proteins containing this domain are: chymotrypsin/elastase inhibitor from Ascaris suum (pig roundworm) Acp62F protein from Drosophila melanogaster Bombina trypsin inhibitor from Bombina maxima (large-webbed bell toad) Bombyx subtilisin inhibitor from Bombyx mori (silk moth) von Willebrand factor ; PDB: 2P3F_N 1HX2_A 1CCV_A 1EAI_D 2H9E_C 1COU_A 1ATE_A 1ATB_A 1ATD_A 1ATA_A ....
Probab=61.03 E-value=3.7 Score=19.91 Aligned_cols=13 Identities=46% Similarity=1.132 Sum_probs=7.1
Q ss_pred CCCCeeeCCcccc
Q psy5836 8 CPPRSFPNQGGVC 20 (66)
Q Consensus 8 C~~g~y~~~~~~C 20 (66)
|+.||+.+.++.|
T Consensus 37 C~~G~v~~~~~~C 49 (55)
T PF01826_consen 37 CPPGYVRNDNGRC 49 (55)
T ss_dssp ETTTEEEETTSEE
T ss_pred CCCCeeEcCCCCE
Confidence 4666666544444
No 34
>PF12191 stn_TNFRSF12A: Tumour necrosis factor receptor stn_TNFRSF12A_TNFR domain; InterPro: IPR022316 The tumour necrosis factor (TNF) receptor (TNFR) superfamily comprises more than 20 type-I transmembrane proteins. Family members are defined based on similarity in their extracellular domain - a region that contains many cysteine residues arranged in a specific repetitive pattern []. The cysteines allow formation of an extended rod-like structure, responsible for ligand binding []. Upon receptor activation, different intracellular signalling complexes are assembled for different members of the TNFR superfamily, depending on their intracellular domains and sequences []. Activation of TNFRs can therefore induce a range of disparate effects, including cell proliferation, differentiation, survival, or apoptotic cell death, depending upon the receptor involved []. TNFRs are widely distributed and play important roles in many crucial biological processes, such as lymphoid and neuronal development, innate and adaptive immunity, and maintenance of cellular homeostasis []. Drugs that manipulate their signalling have potential roles in the prevention and treatment of many diseases, such as viral infections, coronary heart disease, transplant rejection, and immune disease []. TNF receptor 12 (also known as TWEAK receptor, and fibroblast growth factor-inducible-14 (Fn14)) has been implicated in endothelial cell growth and migration []. The receptor may also play a role in cell-matrix interactions [].; PDB: 2KN0_A 2RPJ_A 2KMZ_A 2EQP_A.
Probab=60.42 E-value=7.9 Score=22.82 Aligned_cols=34 Identities=26% Similarity=0.544 Sum_probs=19.6
Q ss_pred ccCCCCeeeCC-ccccccCccccccCCCCCcccccccC
Q psy5836 6 SRCPPRSFPNQ-GGVCWPCHESCETCAGAGQDSCLTCA 42 (66)
Q Consensus 6 ~~C~~g~y~~~-~~~C~~C~~~C~~C~g~~~~~C~~C~ 42 (66)
..||.|+..+. ...|.+|. +|.+ -|....|..|.
T Consensus 34 ~~Cp~G~sWs~dLdkCm~Cs-sC~~--~P~S~~C~~C~ 68 (129)
T PF12191_consen 34 TPCPRGSSWSADLDKCMSCS-SCPA--YPKSPFCQGCP 68 (129)
T ss_dssp SSS-SSEEEETTTTEEEECC-CHCC---TT-CCCCCHS
T ss_pred CCCCCCCcCcccccccccCc-cCCC--CCCCccccCCC
Confidence 47999998765 66887774 3322 23345566665
No 35
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=58.54 E-value=20 Score=17.06 Aligned_cols=13 Identities=46% Similarity=1.042 Sum_probs=9.2
Q ss_pred ccccccccccCcccC
Q psy5836 51 LAICLQQCPEGYFES 65 (66)
Q Consensus 51 ~~~Cv~~Cp~g~y~~ 65 (66)
.+.|+ |++||...
T Consensus 36 ~g~C~--C~~g~~~~ 48 (52)
T PF01683_consen 36 NGRCQ--CPPGYVEV 48 (52)
T ss_pred CCEeE--CCCCCEec
Confidence 46776 99987654
No 36
>PF01607 CBM_14: Chitin binding Peritrophin-A domain; InterPro: IPR002557 This entry represents a chitin binding domain []. It is found in (amongst others) the Peritrophin-A chitin binding proteins, particularly the peritrophic matrix proteins of insects and animal chitinases [, , ]. Copies of the domain are also found in some baculoviruses. It is an extracellular domain that contains six conserved cysteines that probably form three disulphide bridges. Chitin binding has been demonstrated for a protein containing only two of these domains [].; GO: 0008061 chitin binding, 0006030 chitin metabolic process, 0005576 extracellular region; PDB: 1DQC_A.
Probab=55.98 E-value=7 Score=18.12 Aligned_cols=16 Identities=25% Similarity=0.625 Sum_probs=7.7
Q ss_pred ccCCCCeeeCC-ccccc
Q psy5836 6 SRCPPRSFPNQ-GGVCW 21 (66)
Q Consensus 6 ~~C~~g~y~~~-~~~C~ 21 (66)
..||.|++.|. .++|.
T Consensus 29 ~~C~~g~~fd~~~~~C~ 45 (53)
T PF01607_consen 29 QRCPEGLYFDPSSQRCV 45 (53)
T ss_dssp EE-TTS-EE-TTTSSEE
T ss_pred CCCcCCCEECcCcCEEc
Confidence 46788777664 45543
No 37
>KOG2813|consensus
Probab=55.66 E-value=9 Score=26.22 Aligned_cols=31 Identities=35% Similarity=0.985 Sum_probs=24.2
Q ss_pred ccccccCc----cccccCCCCCcccccccCCCccc
Q psy5836 17 GGVCWPCH----ESCETCAGAGQDSCLTCAPAHLR 47 (66)
Q Consensus 17 ~~~C~~C~----~~C~~C~g~~~~~C~~C~~~~~~ 47 (66)
..+|..|| ..|.+|.|.+..-|++|.....|
T Consensus 234 ~~~C~~C~G~G~~~C~tC~grG~k~C~TC~gtgsl 268 (406)
T KOG2813|consen 234 HDLCYMCHGRGIKECHTCKGRGKKPCTTCSGTGSL 268 (406)
T ss_pred cchhhhccCCCcccCCcccCCCCcccccccCccce
Confidence 56788887 46889999999999999865444
No 38
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=51.83 E-value=7 Score=17.83 Aligned_cols=10 Identities=20% Similarity=0.497 Sum_probs=3.7
Q ss_pred ccCCCccccc
Q psy5836 40 TCAPAHLRVT 49 (66)
Q Consensus 40 ~C~~~~~~~~ 49 (66)
+|+.+|.|..
T Consensus 22 ~C~~Gy~L~~ 31 (36)
T PF14670_consen 22 SCPPGYKLAE 31 (36)
T ss_dssp E-STTEEE-T
T ss_pred ECCCCCEECc
Confidence 3455554443
No 39
>PF12800 Fer4_4: 4Fe-4S binding domain; InterPro: IPR001450 This superfamily includes proteins containing domains which bind to iron-sulphur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwich. Ferredoxins are iron-sulphur proteins that mediate electron transfer in a range of metabolic reactions; they fall into several subgroups according to the nature of their iron-sulphur cluster(s) [, ]. One group, originally found in bacteria, has been termed "bacterial-type", in which the active centre is a 4Fe-4S cluster. 4Fe-4S ferredoxins may in turn be subdivided into further groups, based on their sequence properties. Most contain at least one conserved domain, including four Cys residues that bind to a 4Fe-4S centre. ; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding; PDB: 1BD6_A 1BQX_A 1BWE_A 1BC6_A.
Probab=50.32 E-value=11 Score=14.27 Aligned_cols=9 Identities=44% Similarity=1.276 Sum_probs=5.9
Q ss_pred CcccccCCC
Q psy5836 2 STCVSRCPP 10 (66)
Q Consensus 2 ~~Cv~~C~~ 10 (66)
+.|+..||.
T Consensus 8 ~~C~~~Cp~ 16 (17)
T PF12800_consen 8 GSCVDVCPT 16 (17)
T ss_dssp SSSTTTSTT
T ss_pred chHHhhccC
Confidence 357777774
No 40
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=48.16 E-value=11 Score=16.70 Aligned_cols=12 Identities=33% Similarity=0.949 Sum_probs=8.8
Q ss_pred cccccccccCcccC
Q psy5836 52 AICLQQCPEGYFES 65 (66)
Q Consensus 52 ~~Cv~~Cp~g~y~~ 65 (66)
++|+ |.+||+.+
T Consensus 19 g~C~--C~~g~~G~ 30 (32)
T PF07974_consen 19 GRCV--CDSGYTGP 30 (32)
T ss_pred CEEE--CCCCCcCC
Confidence 5565 89998875
No 41
>PF13922 PHD_3: PHD domain of transcriptional enhancer, Asx
Probab=47.82 E-value=4.2 Score=21.39 Aligned_cols=25 Identities=28% Similarity=0.850 Sum_probs=19.1
Q ss_pred ccccccCcccc-ccCCCCCcccccccC
Q psy5836 17 GGVCWPCHESC-ETCAGAGQDSCLTCA 42 (66)
Q Consensus 17 ~~~C~~C~~~C-~~C~g~~~~~C~~C~ 42 (66)
--+|+.|-..| .-|.||+ ..|++|.
T Consensus 43 Mi~Cq~CGAFCHDDCIgps-kLCvsCl 68 (69)
T PF13922_consen 43 MIMCQGCGAFCHDDCIGPS-KLCVSCL 68 (69)
T ss_pred HHHHhhccchhccccccHH-HHhHhhc
Confidence 35788898888 6899984 5777774
No 42
>PF05571 DUF766: Protein of unknown function (DUF766); InterPro: IPR008485 This family consists of several eukaryotic proteins of unknown function.
Probab=46.96 E-value=12 Score=25.02 Aligned_cols=20 Identities=35% Similarity=0.871 Sum_probs=13.5
Q ss_pred cCCCCeeeCCccccccCccc
Q psy5836 7 RCPPRSFPNQGGVCWPCHES 26 (66)
Q Consensus 7 ~C~~g~y~~~~~~C~~C~~~ 26 (66)
.||-|+-.|++..|++|..+
T Consensus 25 aCprG~R~n~~~iC~pC~~~ 44 (296)
T PF05571_consen 25 ACPRGYRVNESSICQPCDDS 44 (296)
T ss_pred CCCCccccCCCcccccCCCC
Confidence 56777766666777777644
No 43
>KOG1836|consensus
Probab=41.55 E-value=30 Score=28.43 Aligned_cols=54 Identities=28% Similarity=0.661 Sum_probs=32.7
Q ss_pred ccCCCCeeeCC----ccccccCcc----cc-ccCCCCCcccccccCCCccccccccccccccccCcccC
Q psy5836 6 SRCPPRSFPNQ----GGVCWPCHE----SC-ETCAGAGQDSCLTCAPAHLRVTDLAICLQQCPEGYFES 65 (66)
Q Consensus 6 ~~C~~g~y~~~----~~~C~~C~~----~C-~~C~g~~~~~C~~C~~~~~~~~~~~~Cv~~Cp~g~y~~ 65 (66)
..|..|||.+. ..-|++|+- .| .++. ...-.|+.|++++. +-.|. .|-+|||.+
T Consensus 759 ~~C~~GfYg~~~~~~~~dC~~C~Cp~~~~~~~~~~-~~~~iCk~Cp~gyt----G~rCe-~c~dgyfg~ 821 (1705)
T KOG1836|consen 759 AQCVDGFYGLPDLGTSGDCQPCPCPNGGACGQTPE-ILEVVCKNCPPGYT----GLRCE-ECADGYFGN 821 (1705)
T ss_pred hhhcCCCCCccccCCCCCCccCCCCCChhhcCcCc-ccceecCCCCCCCc----ccccc-cCCCccccC
Confidence 36788999753 233888852 22 2221 23457888887764 34565 678888765
No 44
>PF09257 BCMA-Tall_bind: BCMA, TALL-1 binding; InterPro: IPR015337 Cytokines can be grouped into a family on the basis of sequence, functional and structural similarities [, , ]. Tumor necrosis factor (TNF) (also known as TNF-alpha or cachectin) is a monocyte-derived cytotoxin that has been implicated in tumour regression, septic shock and cachexia [, ]. The protein is synthesised as a prohormone with an unusually long and atypical signal sequence, which is absent from the mature secreted cytokine []. A short hydrophobic stretch of amino acids serves to anchor the prohormone in lipid bilayers []. Both the mature protein and a partially-processed form of the hormone are secreted after cleavage of the propeptide []. There are a number of different families of TNF, but all these cytokines seem to form homotrimeric (or heterotrimeric in the case of LT-alpha/beta) complexes that are recognised by their specific receptors. Members of this entry, which are predominantly found in the tumour necrosis factor receptor superfamily member 17, BCMA, are required for binding to tumour necrosis factor ligand TALL-1 []. ; PDB: 2KN1_A 1OQD_R 1XU2_T.
Probab=40.93 E-value=8 Score=17.93 Aligned_cols=19 Identities=32% Similarity=0.704 Sum_probs=13.1
Q ss_pred CeeeCCccccccCcccccc
Q psy5836 11 RSFPNQGGVCWPCHESCET 29 (66)
Q Consensus 11 g~y~~~~~~C~~C~~~C~~ 29 (66)
.||...-..|.+||-.|..
T Consensus 5 EYfD~LL~aCkPChLRCsn 23 (39)
T PF09257_consen 5 EYFDRLLHACKPCHLRCSN 23 (39)
T ss_dssp EEEETTTTEEEEHHHHHTS
T ss_pred HHHHHHHHhcccceeecCC
Confidence 3444347899999988753
No 45
>PF12837 Fer4_6: 4Fe-4S binding domain; InterPro: IPR001450 This superfamily includes proteins containing domains which bind to iron-sulphur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwich. Ferredoxins are iron-sulphur proteins that mediate electron transfer in a range of metabolic reactions; they fall into several subgroups according to the nature of their iron-sulphur cluster(s) [, ]. One group, originally found in bacteria, has been termed "bacterial-type", in which the active centre is a 4Fe-4S cluster. 4Fe-4S ferredoxins may in turn be subdivided into further groups, based on their sequence properties. Most contain at least one conserved domain, including four Cys residues that bind to a 4Fe-4S centre. ; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding
Probab=40.76 E-value=6.3 Score=16.31 Aligned_cols=10 Identities=50% Similarity=1.308 Sum_probs=6.6
Q ss_pred cccccccccC
Q psy5836 52 AICLQQCPEG 61 (66)
Q Consensus 52 ~~Cv~~Cp~g 61 (66)
+.|+..||.+
T Consensus 13 g~C~~~Cp~~ 22 (24)
T PF12837_consen 13 GDCVRVCPEG 22 (24)
T ss_pred hHHHHhcchh
Confidence 4577777754
No 46
>PF00037 Fer4: 4Fe-4S binding domain; InterPro: IPR001450 This superfamily includes proteins containing domains which bind to iron-sulphur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwich. Ferredoxins are iron-sulphur proteins that mediate electron transfer in a range of metabolic reactions; they fall into several subgroups according to the nature of their iron-sulphur cluster(s) [, ]. One group, originally found in bacteria, has been termed "bacterial-type", in which the active centre is a 4Fe-4S cluster. 4Fe-4S ferredoxins may in turn be subdivided into further groups, based on their sequence properties. Most contain at least one conserved domain, including four Cys residues that bind to a 4Fe-4S centre. ; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding; PDB: 1DUR_A 1H98_A 1BD6_A 1BQX_A 1BWE_A 1BC6_A 3BK7_A 1FCA_A 1FDN_A 2FDN_A ....
Probab=40.44 E-value=7.7 Score=15.84 Aligned_cols=10 Identities=30% Similarity=0.939 Sum_probs=6.7
Q ss_pred cccccccccC
Q psy5836 52 AICLQQCPEG 61 (66)
Q Consensus 52 ~~Cv~~Cp~g 61 (66)
+.|+..||.+
T Consensus 12 g~C~~~CP~~ 21 (24)
T PF00037_consen 12 GRCVEACPFD 21 (24)
T ss_dssp THHHHHSTTS
T ss_pred chhhhhcccc
Confidence 4577777765
No 47
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=39.69 E-value=25 Score=16.52 Aligned_cols=8 Identities=38% Similarity=1.310 Sum_probs=4.7
Q ss_pred ccccCccc
Q psy5836 57 QCPEGYFE 64 (66)
Q Consensus 57 ~Cp~g~y~ 64 (66)
.|++|||.
T Consensus 33 ~C~~g~~g 40 (46)
T smart00180 33 RCAPGYYG 40 (46)
T ss_pred cCCCCcCC
Confidence 56666654
No 48
>PF02822 Antistasin: Antistasin family; InterPro: IPR004094 Peptide proteinase inhibitors can be found as single domain proteins or as single or multiple domains within proteins; these are referred to as either simple or compound inhibitors, respectively. In many cases they are synthesised as part of a larger precursor protein, either as a prepropeptide or as an N-terminal domain associated with an inactive peptidase or zymogen. This domain prevents access of the substrate to the active site. Removal of the N-terminal inhibitor domain either by interaction with a second peptidase or by autocatalytic cleavage activates the zymogen. Other inhibitors interact direct with proteinases using a simple noncovalent lock and key mechanism; while yet others use a conformational change-based trapping mechanism that depends on their structural and thermodynamic properties. This group of serine protease inhibitors belong to MEROPS inhibitor family I15, clan IO. They inhibit serine peptidases of the S1 family (IPR001254 from INTERPRO) [] and are characterised by a well conserved pattern of cysteine residues. Many of the proteins that belong to this family are anti-coagulants.; GO: 0004867 serine-type endopeptidase inhibitor activity; PDB: 1HIA_I 1BX8_A 1BX7_A 3BG4_D 1C9P_B 1EJA_B 1SKZ_A.
Probab=37.37 E-value=23 Score=14.94 Aligned_cols=18 Identities=22% Similarity=0.425 Sum_probs=8.3
Q ss_pred ccccCCCCeeeCCcccccc
Q psy5836 4 CVSRCPPRSFPNQGGVCWP 22 (66)
Q Consensus 4 Cv~~C~~g~y~~~~~~C~~ 22 (66)
|...||.||-.|. +.|..
T Consensus 6 C~~~C~~Gf~~D~-~GC~~ 23 (26)
T PF02822_consen 6 CRMYCPYGFKTDE-NGCPT 23 (26)
T ss_dssp ES---TT-EEE-T-TSSEE
T ss_pred cCCcCCCcccCCC-CCCCc
Confidence 7778999988763 44443
No 49
>KOG0994|consensus
Probab=34.95 E-value=21 Score=28.58 Aligned_cols=18 Identities=39% Similarity=0.946 Sum_probs=12.9
Q ss_pred cCCCCeeeCC--ccccccCc
Q psy5836 7 RCPPRSFPNQ--GGVCWPCH 24 (66)
Q Consensus 7 ~C~~g~y~~~--~~~C~~C~ 24 (66)
.|-+|||.+. ++.|++|.
T Consensus 949 ~CA~~~fGnP~~GGtCq~Ce 968 (1758)
T KOG0994|consen 949 ICADNHFGNPSEGGTCQKCE 968 (1758)
T ss_pred hhcccccCCcccCCcccccc
Confidence 5667787754 78888874
No 50
>KOG4611|consensus
Probab=34.50 E-value=24 Score=24.93 Aligned_cols=26 Identities=27% Similarity=0.714 Sum_probs=20.6
Q ss_pred cCCCCeeeCCccccccCccccccCCC
Q psy5836 7 RCPPRSFPNQGGVCWPCHESCETCAG 32 (66)
Q Consensus 7 ~C~~g~y~~~~~~C~~C~~~C~~C~g 32 (66)
.|..|+|.+.++-|.+|..+|..=..
T Consensus 102 ncasgfyrndngyctkcetscsemns 127 (747)
T KOG4611|consen 102 NCASGFYRNDNGYCTKCETSCSEMNS 127 (747)
T ss_pred cccccceECCCcccccccccHhhhcC
Confidence 56789999889999999998864433
No 51
>COG1150 HdrC Heterodisulfide reductase, subunit C [Energy production and conversion]
Probab=31.65 E-value=25 Score=22.17 Aligned_cols=14 Identities=36% Similarity=1.004 Sum_probs=11.3
Q ss_pred CcccccCCCCeeeC
Q psy5836 2 STCVSRCPPRSFPN 15 (66)
Q Consensus 2 ~~Cv~~C~~g~y~~ 15 (66)
++|..+||.|-|.+
T Consensus 45 G~CT~sCPs~r~t~ 58 (195)
T COG1150 45 GTCTGSCPSGRFTD 58 (195)
T ss_pred CcccCCCCCcccCC
Confidence 67999999987654
No 52
>PF00008 EGF: EGF-like domain This is a sub-family of the Pfam entry This is a sub-family of the Pfam entry; InterPro: IPR006209 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length.; GO: 0005515 protein binding; PDB: 1WHE_A 1CCF_A 1APO_A 1WHF_A 2VJ3_A 1TOZ_A 4D90_B 3CFW_A 1EDM_B 1IXA_A ....
Probab=29.78 E-value=25 Score=15.18 Aligned_cols=13 Identities=38% Similarity=0.810 Sum_probs=9.3
Q ss_pred ccccccccccCcccC
Q psy5836 51 LAICLQQCPEGYFES 65 (66)
Q Consensus 51 ~~~Cv~~Cp~g~y~~ 65 (66)
...|. |+.||..+
T Consensus 19 ~y~C~--C~~G~~G~ 31 (32)
T PF00008_consen 19 GYTCE--CPPGYTGK 31 (32)
T ss_dssp EEEEE--EBTTEEST
T ss_pred CEEeE--CCCCCccC
Confidence 45675 89998765
No 53
>PF13746 Fer4_18: 4Fe-4S dicluster domain
Probab=26.27 E-value=25 Score=18.03 Aligned_cols=11 Identities=36% Similarity=1.102 Sum_probs=8.0
Q ss_pred cccccccccCc
Q psy5836 52 AICLQQCPEGY 62 (66)
Q Consensus 52 ~~Cv~~Cp~g~ 62 (66)
+.|+..||.+-
T Consensus 56 grCv~~CP~~I 66 (69)
T PF13746_consen 56 GRCVRVCPAGI 66 (69)
T ss_pred ChHhhhcCCCC
Confidence 56777898763
No 54
>smart00494 ChtBD2 Chitin-binding domain type 2.
Probab=25.90 E-value=51 Score=15.00 Aligned_cols=10 Identities=20% Similarity=0.481 Sum_probs=6.1
Q ss_pred cCCCCeeeCC
Q psy5836 7 RCPPRSFPNQ 16 (66)
Q Consensus 7 ~C~~g~y~~~ 16 (66)
.||.|++.+.
T Consensus 32 ~C~~g~~f~~ 41 (56)
T smart00494 32 SCPAGLVFDP 41 (56)
T ss_pred cCcCCCeECC
Confidence 5677765554
No 55
>KOG3744|consensus
Probab=25.75 E-value=34 Score=22.63 Aligned_cols=21 Identities=38% Similarity=0.683 Sum_probs=14.0
Q ss_pred cCCCCeeeCCccccccCcccc
Q psy5836 7 RCPPRSFPNQGGVCWPCHESC 27 (66)
Q Consensus 7 ~C~~g~y~~~~~~C~~C~~~C 27 (66)
.||-|+-.|+...|++|+.+-
T Consensus 32 vCprG~R~Na~~~C~pC~~~l 52 (311)
T KOG3744|consen 32 VCPRGQRTNAQKYCQPCTESP 52 (311)
T ss_pred CCCCccccchhhhcCCCCCCC
Confidence 467777666677777776543
No 56
>PF13187 Fer4_9: 4Fe-4S dicluster domain; PDB: 2WSF_C 2WSE_C 2O01_C 2WSC_C 3LW5_C 2VKR_C 1KQG_B 1KQF_B 3GYX_J.
Probab=25.70 E-value=19 Score=16.96 Aligned_cols=12 Identities=33% Similarity=0.861 Sum_probs=8.1
Q ss_pred CcccccCCCCee
Q psy5836 2 STCVSRCPPRSF 13 (66)
Q Consensus 2 ~~Cv~~C~~g~y 13 (66)
+.|+..||.+.+
T Consensus 6 g~C~~~CP~~~~ 17 (55)
T PF13187_consen 6 GRCVEACPVGVI 17 (55)
T ss_dssp THHHHHSTTT-E
T ss_pred chHHHHCCccCe
Confidence 468888987654
No 57
>KOG3525|consensus
Probab=25.62 E-value=57 Score=22.79 Aligned_cols=39 Identities=23% Similarity=0.725 Sum_probs=29.2
Q ss_pred ccccCCCCeeeCCccccccCccccccCCCCCcccccccCCCc
Q psy5836 4 CVSRCPPRSFPNQGGVCWPCHESCETCAGAGQDSCLTCAPAH 45 (66)
Q Consensus 4 Cv~~C~~g~y~~~~~~C~~C~~~C~~C~g~~~~~C~~C~~~~ 45 (66)
++..|+.. ......|..++.++.++.++ ...|+.|....
T Consensus 207 ~~~~~~~~--p~y~~~C~~~~~s~~s~~~~-~~~~~~~~~~~ 245 (431)
T KOG3525|consen 207 CATQCGKK--PQYRERCASCLASTYSSGGP-TEECIVCTDPR 245 (431)
T ss_pred cccccCCC--ccccccccccccccccCCCC-cceeeeecCCC
Confidence 44555552 22368999999999999999 88999998654
No 58
>smart00289 WR1 Worm-specific repeat type 1. Worm-specific repeat type 1. Cysteine-rich domain apparently unique (so far) to C. elegans. Often appears with KU domains. About 3 dozen worm proteins contain this domain.
Probab=25.19 E-value=59 Score=14.11 Aligned_cols=6 Identities=67% Similarity=1.879 Sum_probs=2.8
Q ss_pred ccccCc
Q psy5836 57 QCPEGY 62 (66)
Q Consensus 57 ~Cp~g~ 62 (66)
.||.||
T Consensus 21 ~CP~g~ 26 (38)
T smart00289 21 SCPSGY 26 (38)
T ss_pred CCCCCC
Confidence 455444
No 59
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=25.08 E-value=26 Score=15.98 Aligned_cols=10 Identities=50% Similarity=1.148 Sum_probs=6.0
Q ss_pred cccccccccCcc
Q psy5836 52 AICLQQCPEGYF 63 (66)
Q Consensus 52 ~~Cv~~Cp~g~y 63 (66)
..|+ |++||.
T Consensus 25 y~C~--C~~Gy~ 34 (42)
T PF07645_consen 25 YSCS--CPPGYE 34 (42)
T ss_dssp EEEE--ESTTEE
T ss_pred EEee--CCCCcE
Confidence 4565 666665
No 60
>TIGR03478 DMSO_red_II_bet DMSO reductase family type II enzyme, iron-sulfur subunit. This model represents the iron-sulfur subunit, typically called the beta subunit, of various proteins that also contain a molybdopterin subunit and a heme b subunit. The group includes two distinct but very closely related periplasmic proteins of anaerobic respiration, selenate reductase and chlorate reductase. Other members of this family include dimethyl sulphide dehydrogenase and ethylbenzene dehydrogenase.
Probab=24.45 E-value=28 Score=23.54 Aligned_cols=26 Identities=27% Similarity=0.797 Sum_probs=16.6
Q ss_pred cccccccCCCccccccccccccccccC
Q psy5836 35 QDSCLTCAPAHLRVTDLAICLQQCPEG 61 (66)
Q Consensus 35 ~~~C~~C~~~~~~~~~~~~Cv~~Cp~g 61 (66)
...|+.|..- .-......||..||.+
T Consensus 188 ~eKC~~C~~R-ie~G~~PaCv~aCP~~ 213 (321)
T TIGR03478 188 SEKCIGCYPR-IEKGIAPACVKQCPGR 213 (321)
T ss_pred hhhCCCchhh-hccCCCCHHHhhcCcc
Confidence 4689999631 1111246899999976
No 61
>PF12838 Fer4_7: 4Fe-4S dicluster domain; InterPro: IPR001450 This superfamily includes proteins containing domains which bind to iron-sulphur clusters. Members include bacterial ferredoxins, various dehydrogenases, and various reductases. Structure of the domain is an alpha-antiparallel beta sandwich. Ferredoxins are iron-sulphur proteins that mediate electron transfer in a range of metabolic reactions; they fall into several subgroups according to the nature of their iron-sulphur cluster(s) [, ]. One group, originally found in bacteria, has been termed "bacterial-type", in which the active centre is a 4Fe-4S cluster. 4Fe-4S ferredoxins may in turn be subdivided into further groups, based on their sequence properties. Most contain at least one conserved domain, including four Cys residues that bind to a 4Fe-4S centre. ; GO: 0009055 electron carrier activity, 0051536 iron-sulfur cluster binding; PDB: 3CF4_A 1K0T_A 2VKR_C 1JB0_C 3PCQ_C.
Probab=23.11 E-value=24 Score=16.61 Aligned_cols=10 Identities=40% Similarity=1.122 Sum_probs=6.0
Q ss_pred CcccccCCCC
Q psy5836 2 STCVSRCPPR 11 (66)
Q Consensus 2 ~~Cv~~C~~g 11 (66)
+.|+..||.+
T Consensus 6 ~~C~~~CP~~ 15 (52)
T PF12838_consen 6 GACVEACPTG 15 (52)
T ss_dssp -HHHHH-TTH
T ss_pred CchHHhcCcc
Confidence 4578888865
No 62
>PF04942 CC: CC domain; InterPro: IPR007026 This short domain contains four conserved cysteines that are probably required for the formation of two disulphide bonds. The domain is only found in proteins from Caenorhabditis species. The domain is named after the characteristic CC motif.
Probab=22.40 E-value=53 Score=14.90 Aligned_cols=14 Identities=29% Similarity=0.548 Sum_probs=8.6
Q ss_pred cCCCCeeeCCcccc
Q psy5836 7 RCPPRSFPNQGGVC 20 (66)
Q Consensus 7 ~C~~g~y~~~~~~C 20 (66)
.||.||....++.|
T Consensus 19 ~CP~G~~~i~g~~C 32 (36)
T PF04942_consen 19 VCPSGYTVITGNGC 32 (36)
T ss_pred cCCCCCEEECCCcc
Confidence 57777766544554
No 63
>PF13484 Fer4_16: 4Fe-4S double cluster binding domain
Probab=21.85 E-value=16 Score=18.07 Aligned_cols=10 Identities=30% Similarity=0.996 Sum_probs=7.4
Q ss_pred CcccccCCCC
Q psy5836 2 STCVSRCPPR 11 (66)
Q Consensus 2 ~~Cv~~C~~g 11 (66)
+.|++.||.+
T Consensus 6 ~~C~~~CP~~ 15 (67)
T PF13484_consen 6 GKCAEACPTG 15 (67)
T ss_pred hHHHHhCcHh
Confidence 4688889865
Done!