Query psy17098
Match_columns 204
No_of_seqs 199 out of 317
Neff 5.5
Searched_HMMs 46136
Date Fri Aug 16 21:49:20 2013
Command hhsearch -i /work/01045/syshi/Psyhhblits/psy17098.a3m -d /work/01045/syshi/HHdatabase/Cdd.hhm -o /work/01045/syshi/hhsearch_cdd/17098hhsearch_cdd -cpu 12 -v 0
No Hit Prob E-value P-value Score SS Cols Query HMM Template HMM
1 KOG0196|consensus 100.0 3.8E-53 8.3E-58 408.9 14.2 165 1-195 146-317 (996)
2 KOG0196|consensus 99.8 2.9E-21 6.3E-26 188.3 4.8 75 130-204 201-277 (996)
3 PF12662 cEGF: Complement Clr- 89.3 0.21 4.6E-06 28.9 1.2 19 186-204 2-20 (24)
4 PF02793 HRM: Hormone receptor 83.1 0.89 1.9E-05 31.5 2.0 49 130-184 2-55 (66)
5 PF02793 HRM: Hormone receptor 80.3 0.8 1.7E-05 31.7 0.9 49 52-106 2-55 (66)
6 KOG1226|consensus 79.4 4.7 0.0001 40.9 6.2 138 38-195 403-575 (783)
7 PHA02817 EEV Host range protei 78.8 6.3 0.00014 34.4 6.2 91 79-189 48-146 (225)
8 smart00180 EGF_Lam Laminin-typ 77.7 1.9 4.1E-05 28.1 2.0 24 107-138 17-40 (46)
9 KOG1225|consensus 75.3 4 8.6E-05 39.9 4.3 69 105-195 262-336 (525)
10 PF07974 EGF_2: EGF-like domai 74.4 3 6.6E-05 25.4 2.2 23 173-195 6-28 (32)
11 smart00008 HormR Domain presen 72.1 5.3 0.00011 28.2 3.3 48 130-183 3-53 (70)
12 PF12661 hEGF: Human growth fa 70.0 2.5 5.4E-05 20.9 0.8 9 187-195 1-9 (13)
13 cd00055 EGF_Lam Laminin-type e 69.6 3.4 7.4E-05 27.1 1.7 27 106-140 17-43 (50)
14 PHA02639 EEV host range protei 67.5 21 0.00046 32.2 6.9 90 79-189 109-206 (295)
15 PF00053 Laminin_EGF: Laminin 67.1 2.4 5.1E-05 27.6 0.6 28 106-141 16-43 (49)
16 PF07645 EGF_CA: Calcium-bindi 64.7 3.4 7.3E-05 26.2 0.9 20 184-203 23-42 (42)
17 PF00084 Sushi: Sushi domain ( 64.3 9.1 0.0002 24.3 3.0 32 79-110 25-56 (56)
18 PF14670 FXa_inhibition: Coagu 61.0 4.8 0.0001 25.2 1.1 19 185-203 18-36 (36)
19 smart00032 CCP Domain abundant 60.9 12 0.00025 23.5 3.0 32 79-110 26-57 (57)
20 smart00008 HormR Domain presen 58.5 7 0.00015 27.5 1.7 48 52-105 3-53 (70)
21 KOG4260|consensus 57.8 8.8 0.00019 35.0 2.6 22 173-195 156-177 (350)
22 cd00033 CCP Complement control 57.0 13 0.00028 23.4 2.8 32 79-110 25-56 (57)
23 PF07699 GCC2_GCC3: GCC2 and G 56.5 6.4 0.00014 25.6 1.2 13 128-140 10-22 (48)
24 PF01683 EB: EB module; Inter 54.1 18 0.00039 23.6 3.1 17 185-203 36-52 (52)
25 smart00032 CCP Domain abundant 53.1 14 0.0003 23.1 2.4 30 159-188 28-57 (57)
26 KOG0994|consensus 52.8 12 0.00025 40.2 2.9 59 106-195 1035-1093(1758)
27 PF00084 Sushi: Sushi domain ( 52.4 10 0.00023 24.0 1.7 31 158-188 26-56 (56)
28 KOG1214|consensus 42.1 62 0.0013 34.0 6.0 98 32-203 764-863 (1289)
29 KOG4260|consensus 41.6 13 0.00027 34.0 1.0 39 94-141 155-193 (350)
30 PF06725 3D: 3D domain; Inter 39.6 12 0.00026 27.0 0.4 55 1-56 2-57 (75)
31 KOG1836|consensus 31.8 33 0.00072 38.1 2.4 38 95-140 398-438 (1705)
32 PF12946 EGF_MSP1_1: MSP1 EGF 30.8 16 0.00035 23.3 -0.0 18 185-204 20-37 (37)
33 PF01393 Chromo_shadow: Chromo 29.3 13 0.00027 25.6 -0.8 31 30-61 19-49 (58)
34 PHA03001 putative virion core 29.2 51 0.0011 26.7 2.5 28 25-61 48-75 (132)
35 cd00185 TNFR Tumor necrosis fa 29.0 31 0.00066 25.6 1.2 29 112-140 17-46 (98)
36 PHA02831 EEV host range protei 28.3 1.9E+02 0.0041 25.9 6.3 33 157-189 164-196 (268)
37 PF12947 EGF_3: EGF domain; I 27.7 30 0.00066 21.4 0.8 18 184-203 19-36 (36)
38 PF06138 Chordopox_E11: Chordo 27.7 55 0.0012 26.4 2.5 28 25-61 49-76 (130)
39 PF11767 SET_assoc: Histone ly 27.0 40 0.00086 23.9 1.4 14 26-39 31-44 (66)
40 PHA02927 secreted complement-b 26.1 1.8E+02 0.0038 25.4 5.6 32 157-188 230-261 (263)
41 PF13121 DUF3976: Domain of un 25.1 15 0.00033 23.5 -0.9 8 25-32 14-21 (41)
42 PHA02639 EEV host range protei 24.9 3.5E+02 0.0077 24.4 7.5 90 79-189 46-145 (295)
43 KOG4564|consensus 24.1 81 0.0018 30.6 3.4 49 130-184 62-115 (473)
44 smart00179 EGF_CA Calcium-bind 23.9 93 0.002 17.9 2.5 10 186-195 24-33 (39)
45 PF04260 DUF436: Protein of un 23.6 34 0.00075 28.8 0.7 22 26-63 56-77 (172)
46 cd00034 ChSh Chromo Shadow Dom 23.5 27 0.00058 23.6 -0.0 29 31-61 18-46 (54)
47 KOG0994|consensus 23.4 80 0.0017 34.3 3.3 39 95-141 765-807 (1758)
48 COG4475 Uncharacterized protei 21.7 45 0.00097 28.0 1.0 28 25-68 60-87 (180)
49 cd00054 EGF_CA Calcium-binding 21.4 1.1E+02 0.0024 17.2 2.5 10 186-195 24-33 (38)
50 PF07354 Sp38: Zona-pellucida- 21.4 50 0.0011 29.7 1.3 31 111-141 221-252 (271)
51 PF00954 S_locus_glycop: S-loc 20.8 45 0.00098 24.9 0.8 10 188-197 100-109 (110)
52 TIGR01440 conserved hypothetic 20.1 50 0.0011 27.8 1.0 26 26-67 56-81 (172)
No 1
>KOG0196|consensus
Probab=100.00 E-value=3.8e-53 Score=408.89 Aligned_cols=165 Identities=45% Similarity=0.907 Sum_probs=144.8
Q ss_pred CccCCCCCcCC-CC---ceeeeeEEEEe-ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhccccCCCCCCCCCccc
Q psy17098 1 RIAAGEGRFNT-NT---EVNINTEVKSI-PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAHFPTTPTGREVTF 75 (204)
Q Consensus 1 tiAade~~f~~-~~---~~~~N~e~r~~-pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~Fp~T~ag~~~~s 75 (204)
||||||+ |++ +. .||||||+|+| ||||+||||||||+||||||+|||||||+||++++|||.||||++|++.++
T Consensus 146 tIAades-f~~~d~~~r~mklNte~rs~Gpltk~GfYlAFqd~GAC~aLlsVrvyYkkCp~~~~nlA~FpeTvtGa~~ts 224 (996)
T KOG0196|consen 146 TIAADES-FTQSDLGDRVMKLNTEVRSVGPLTKKGFYLAFQDQGACLALLSVRVYYKKCPETTRNLAHFPETVTGAESTS 224 (996)
T ss_pred eeeccCc-eecccCccceeEeeeeeEeccccccceEEEEEecCCceEEEEEEEeeeecChHHHhhhccCCCcCCCCCccc
Confidence 8999999 555 32 38999999999 999999999999999999999999999999999999999999999999999
Q ss_pred eEeEEeEEccCceec--CCCceEEeccceEEeeccceeecCCcccccccccccCCcccCCccccccccCCCCCCCCceee
Q psy17098 76 IEQATGRCVENAEEV--GTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTF 153 (204)
Q Consensus 76 lv~v~G~CV~~A~~~--~~P~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~s 153 (204)
||+|.|+||+||++. ..|+|||++||+|++|+|+|.|++|||+...+.+| ++||.|+||+.. | +
T Consensus 225 lv~v~GtCV~nA~~~~~~~p~m~C~~dGeWlvpiG~C~C~aGye~~~~~~~C----~aCp~G~yK~~~------~-~--- 290 (996)
T KOG0196|consen 225 LVEVRGSCVPNAEEEPVPVPRMYCSGDGEWLVPIGGCVCKAGYEEAENGKAC----QACPPGTYKASQ------G-D--- 290 (996)
T ss_pred eEEecccccCCCccCCCCCceeEEcCCCcEEEEcCceeecCCCCcccCCCcc----eeCCCCcccCCC------C-C---
Confidence 999999999999975 56999999999999999999999999986556665 999999999984 2 1
Q ss_pred EEeeecccccCccccCCCeEEecccCeEEccCCCeecCCCce
Q psy17098 154 IEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCKCKPGFE 195 (204)
Q Consensus 154 lv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~C~~Gye 195 (204)
.....|.+|+..... ....|.|..||.
T Consensus 291 --~~C~~CP~~S~s~~e-------------ga~~C~C~~gyy 317 (996)
T KOG0196|consen 291 --SLCLPCPPNSHSSSE-------------GATSCTCENGYY 317 (996)
T ss_pred --CCCCCCCCCCCCCCC-------------CCCcccccCCcc
Confidence 246789999874322 245678888876
No 2
>KOG0196|consensus
Probab=99.83 E-value=2.9e-21 Score=188.29 Aligned_cols=75 Identities=49% Similarity=0.994 Sum_probs=71.0
Q ss_pred ccCCccccccccCCCCCCCCceeeEEeeecccccCcccc--CCCeEEecccCeEEccCCCeecCCCceeccccCccC
Q psy17098 130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEV--GTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCN 204 (204)
Q Consensus 130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~--~~P~~~C~~~G~W~vp~g~C~C~~Gye~~~~~~~C~ 204 (204)
++||..+.+||+||+|++|++.++||+|+|+||+||++. +.|+|||++||+|+||+|+|.|++|||+..++.+|+
T Consensus 201 kkCp~~~~nlA~FpeTvtGa~~tslv~v~GtCV~nA~~~~~~~p~m~C~~dGeWlvpiG~C~C~aGye~~~~~~~C~ 277 (996)
T KOG0196|consen 201 KKCPETTRNLAHFPETVTGAESTSLVEVRGSCVPNAEEEPVPVPRMYCSGDGEWLVPIGGCVCKAGYEEAENGKACQ 277 (996)
T ss_pred ecChHHHhhhccCCCcCCCCCccceEEecccccCCCccCCCCCceeEEcCCCcEEEEcCceeecCCCCcccCCCcce
Confidence 799999999999999999999999999999999999854 569999999999999999999999999887888885
No 3
>PF12662 cEGF: Complement Clr-like EGF-like
Probab=89.27 E-value=0.21 Score=28.88 Aligned_cols=19 Identities=26% Similarity=0.927 Sum_probs=15.4
Q ss_pred CCeecCCCceeccccCccC
Q psy17098 186 GGCKCKPGFEADVEKQTCN 204 (204)
Q Consensus 186 g~C~C~~Gye~~~~~~~C~ 204 (204)
-.|.|.+||+.+..+.+|.
T Consensus 2 y~C~C~~Gy~l~~d~~~C~ 20 (24)
T PF12662_consen 2 YTCSCPPGYQLSPDGRSCE 20 (24)
T ss_pred EEeeCCCCCcCCCCCCccc
Confidence 3699999999877777774
No 4
>PF02793 HRM: Hormone receptor domain; InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=83.13 E-value=0.89 Score=31.46 Aligned_cols=49 Identities=29% Similarity=0.552 Sum_probs=36.5
Q ss_pred ccCCccccccccCCCCCCCCceeeEEeeecccccCcc-----ccCCCeEEecccCeEEcc
Q psy17098 130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAE-----EVGTPTYLCKGDGKWYLP 184 (204)
Q Consensus 130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~-----~~~~P~~~C~~~G~W~vp 184 (204)
..||..+-...-.|+|.+|. .+.-.|..... .....+..|..+|+|...
T Consensus 2 ~~C~~~~d~~~cWp~T~~G~------~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~ 55 (66)
T PF02793_consen 2 SYCPATWDGWLCWPPTPAGE------TASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH 55 (66)
T ss_dssp SCB--EEESSCEE-SBETTE------EEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred CCCChhhCCcCCCCCCCCCC------EEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence 46899999999999999984 35678998884 123467899999999974
No 5
>PF02793 HRM: Hormone receptor domain; InterPro: IPR001879 G-protein-coupled receptors, GPCRs, constitute a vast protein family that encompasses a wide range of functions (including various autocrine, paracrine and endocrine processes). They show considerable diversity at the sequence level, on the basis of which they can be separated into distinct groups. We use the term clan to describe the GPCRs, as they embrace a group of families for which there are indications of evolutionary relationship, but between which there is no statistically significant similarity in sequence []. The currently known clan members include the rhodopsin-like GPCRs, the secretin-like GPCRs, the cAMP receptors, the fungal mating pheromone receptors, and the metabotropic glutamate receptor family. There is a specialised database for GPCRs (http://www.gpcr.org/7tm/). The secretin-like GPCRs include secretin [], calcitonin [], parathyroid hormone/parathyroid hormone-related peptides [] and vasoactive intestinal peptide [], all of which activate adenylyl cyclase and the phosphatidyl-inositol-calcium pathway. These receptors contain seven transmembrane regions, in a manner reminiscent of the rhodopsins and other receptors believed to interact with G-proteins (however there is no significant sequence identity between these families, the secretin-like receptors thus bear their own unique '7TM' signature). Their N terminus is probably located on the extracellular side of the membrane and potentially glycosylated. This N-terminal region contains a long conserved region which allow the binding of large peptidic ligand such as glucagon, secretin, VIP and PACAP; this region contains five conserved cysteines residues which could be involved in disulphide bond. The C-terminal region of these receptor is probably cytoplasmic. Every receptor gene in this family is encoded on multiple exons, and several of these genes are alternatively spliced to yield functionally distinct products. This domain is found in the extracellular part of some of the secretin-like (family 2) GPCRs including the calcitonin receptor; corticotropin releasing factor receptor 1; diuretic hormone receptor; glucagon-like peptide 1 receptor; and parathyroid hormone peptide receptor.; GO: 0004930 G-protein coupled receptor activity, 0016020 membrane; PDB: 3EHT_A 3EHU_A 2L27_A 3EHS_A 4DLO_B 2QKH_A 3C5T_A 3C59_A 3IOL_A 3N7R_B ....
Probab=80.35 E-value=0.8 Score=31.70 Aligned_cols=49 Identities=29% Similarity=0.538 Sum_probs=36.2
Q ss_pred ecChhhhhccccCCCCCCCCCccceEeEEeEEccCcee-----cCCCceEEeccceEEee
Q psy17098 52 ITCPETTINFAHFPTTPTGREVTFIEQATGRCVENAEE-----VGTPTYLCKGDGKWYLP 106 (204)
Q Consensus 52 ~~CP~~~~~la~Fp~T~ag~~~~slv~v~G~CV~~A~~-----~~~P~~~C~~~G~W~~p 106 (204)
..||.+..++.-.|+|.+|. .|.-.|-..... ...-+..|+.+|+|...
T Consensus 2 ~~C~~~~d~~~cWp~T~~G~------~a~~~CP~~~~~~~~~~~g~a~R~C~~nG~W~~~ 55 (66)
T PF02793_consen 2 SYCPATWDGWLCWPPTPAGE------TASQPCPSGFYGFGSDTSGNATRNCTENGTWEEH 55 (66)
T ss_dssp SCB--EEESSCEE-SBETTE------EEEEECSTTSTCCTSSTTSEEEEEEETTSSBECS
T ss_pred CCCChhhCCcCCCCCCCCCC------EEEeeCcCCccccccccceeEEEEcCCCCcCCCc
Confidence 36999999999999999974 266799998842 13457889999999873
No 6
>KOG1226|consensus
Probab=79.37 E-value=4.7 Score=40.88 Aligned_cols=138 Identities=23% Similarity=0.536 Sum_probs=82.8
Q ss_pred CCceEeE-------EEEEEEEecChhhh--hccccCCCCCCCCCccceEeEEeEEccCce-ecCCCceEEeccceEEeec
Q psy17098 38 QGACISL-------LAIKVYYITCPETT--INFAHFPTTPTGREVTFIEQATGRCVENAE-EVGTPTYLCKGDGKWYLPS 107 (204)
Q Consensus 38 ~GaCval-------~sVrVyy~~CP~~~--~~la~Fp~T~ag~~~~slv~v~G~CV~~A~-~~~~P~~~C~~~G~W~~p~ 107 (204)
.+.|..| ..|.|=-++||... ..+..-|--.+ .+-.+.|.-.|.=.=+ ..+....+|+++|. ..=
T Consensus 403 ~~~c~~i~~gd~v~f~v~~~~~~C~~~~~~~~~~i~pvgf~---e~l~v~v~~~C~C~C~~~~e~~s~~C~g~G~--~~C 477 (783)
T KOG1226|consen 403 RRKCSGITIGDEVTFEVSVTAKKCPPEDQKGSFIIRPVGFT---ETLEVIVQYNCECDCQDQGEPNSALCHGNGT--FVC 477 (783)
T ss_pred cCCCCCcccCceeEEEEecccccCCCccccceEEEccCCCC---cceEEEeeccccccccccCCCCccccCCCCc--EEe
Confidence 4566654 56777788999332 44555444333 2334556666632222 23667889997774 446
Q ss_pred cceeecCCcccccccccccCCcccCCccccccc------cCCCC---CCCCceeeEEeeecccccCcccc----------
Q psy17098 108 GGCKCKPGFEADVEKQTCNGTCHACPETTINFA------HFPTT---PTGREVTFIEQATGRCVENAEEV---------- 168 (204)
Q Consensus 108 g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A------~Fp~T---~~g~~~~slv~v~G~CV~na~~~---------- 168 (204)
|.|.|++||-. ..| -||..-+..+ .++.. .+| .=.=+.|+|+-|....
T Consensus 478 G~C~C~~G~~G----~~C-----EC~~~~~ss~~~~~~Cr~~~~~~vCSg----rG~C~CGqC~C~~~~~~~i~G~fCEC 544 (783)
T KOG1226|consen 478 GQCRCDEGWLG----KKC-----ECSTDELSSSEEEDKCRENSDSPVCSG----RGDCVCGQCVCHKPDNGKIYGKFCEC 544 (783)
T ss_pred cceecCCCCCC----Ccc-----cCCccccCcHhHHhhccCCCCCCCcCC----CCcEeCCceEecCCCCCceeeeeeec
Confidence 99999999973 345 5666655553 11111 122 1123567777776422
Q ss_pred -C--CCe---EEecccCeEEccCCCeecCCCce
Q psy17098 169 -G--TPT---YLCKGDGKWYLPSGGCKCKPGFE 195 (204)
Q Consensus 169 -~--~P~---~~C~~~G~W~vp~g~C~C~~Gye 195 (204)
+ -++ ..|.+-|+ =--|+|.|.+||.
T Consensus 545 DnfsC~r~~g~lC~g~G~--C~CG~CvC~~Gwt 575 (783)
T KOG1226|consen 545 DNFSCERHKGVLCGGHGR--CECGRCVCNPGWT 575 (783)
T ss_pred cCcccccccCcccCCCCe--EeCCcEEcCCCCc
Confidence 1 122 45888887 6789999999998
No 7
>PHA02817 EEV Host range protein; Provisional
Probab=78.82 E-value=6.3 Score=34.40 Aligned_cols=91 Identities=23% Similarity=0.374 Sum_probs=48.5
Q ss_pred EEeEEccCc-----eecCCCceEEeccceEEeeccceeecCCcccccccccccCCcccCCcc-cccccc--CCCCCCCCc
Q psy17098 79 ATGRCVENA-----EEVGTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPET-TINFAH--FPTTPTGRE 150 (204)
Q Consensus 79 v~G~CV~~A-----~~~~~P~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g-~~k~A~--Fp~T~~g~~ 150 (204)
|.=+|.++. .......++|+.||+|..+.-.|. . ..||.- ..+-.. .|. ...
T Consensus 48 Vty~C~~G~~~~gY~L~G~~~i~C~~dG~Ws~~~P~C~------~-----------v~C~~P~i~NG~v~~~~~---~~~ 107 (225)
T PHA02817 48 VTFFCGNNTRGVRYTLVGEKNIICEKDGKWNKEFPVCK------I-----------IRCRFPALQNGFVNGIPD---SKK 107 (225)
T ss_pred EEEEeCCCCCCCCEEECCCCeEEECCCCcCCCCCCeee------e-----------eECCCCCCcCceeEcccc---CCc
Confidence 566898862 223567899999999975443342 1 133311 111000 010 000
Q ss_pred eeeEEeeecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098 151 VTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK 189 (204)
Q Consensus 151 ~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~ 189 (204)
..--..+.=+|-++=.-.+.....|..+|.|.-+.-.|.
T Consensus 108 y~yg~~Vty~C~~Gy~L~G~~~~tC~~~G~WSp~~P~C~ 146 (225)
T PHA02817 108 FYYESEVSFSCKPGFVLIGTKYSVCGINSSWIPKVPICS 146 (225)
T ss_pred eEcCCEEEEEcCCCCEEcCCCceEECCCCeECCCCCEee
Confidence 001112445665443333456788999999997766665
No 8
>smart00180 EGF_Lam Laminin-type epidermal growth factor-like domai.
Probab=77.73 E-value=1.9 Score=28.05 Aligned_cols=24 Identities=29% Similarity=0.883 Sum_probs=20.4
Q ss_pred ccceeecCCcccccccccccCCcccCCccccc
Q psy17098 107 SGGCKCKPGFEADVEKQTCNGTCHACPETTIN 138 (204)
Q Consensus 107 ~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k 138 (204)
+|.|.|++|+.. ..| ..|++++|.
T Consensus 17 ~G~C~C~~~~~G----~~C----~~C~~g~~g 40 (46)
T smart00180 17 TGQCECKPNVTG----RRC----DRCAPGYYG 40 (46)
T ss_pred CCEEECCCCCCC----CCC----CcCCCCcCC
Confidence 689999999984 344 999999998
No 9
>KOG1225|consensus
Probab=75.30 E-value=4 Score=39.86 Aligned_cols=69 Identities=32% Similarity=0.715 Sum_probs=43.1
Q ss_pred eeccceeecCCcccccccccccCCcccCCccccccccCCCCCCCCceeeEEeeecccccCcc-cc---CCCe--EEeccc
Q psy17098 105 LPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTFIEQATGRCVENAE-EV---GTPT--YLCKGD 178 (204)
Q Consensus 105 ~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~-~~---~~P~--~~C~~~ 178 (204)
.--|.|.|.+||.-. .|. . ..||.. . +| -.+-+.|.|+-|-. .. +.++ -.|+..
T Consensus 262 c~~G~CIC~~Gf~G~----dC~-e-~~Cp~~-c---------s~----~g~~~~g~CiC~~g~~G~dCs~~~cpadC~g~ 321 (525)
T KOG1225|consen 262 CVEGRCICPPGFTGD----DCD-E-LVCPVD-C---------SG----GGVCVDGECICNPGYSGKDCSIRRCPADCSGH 321 (525)
T ss_pred EeCCeEeCCCCCcCC----CCC-c-ccCCcc-c---------CC----CceecCCEeecCCCccccccccccCCccCCCC
Confidence 334788888888731 230 0 345554 1 11 23456788888874 11 1122 568888
Q ss_pred CeEEccCCCeecCCCce
Q psy17098 179 GKWYLPSGGCKCKPGFE 195 (204)
Q Consensus 179 G~W~vp~g~C~C~~Gye 195 (204)
|.+. .|+|.|.+||.
T Consensus 322 G~Ci--~G~C~C~~Gy~ 336 (525)
T KOG1225|consen 322 GKCI--DGECLCDEGYT 336 (525)
T ss_pred Cccc--CCceEeCCCCc
Confidence 9998 99999999997
No 10
>PF07974 EGF_2: EGF-like domain; InterPro: IPR013111 A sequence of about thirty to forty amino-acid residues long found in the sequence of epidermal growth factor (EGF) has been shown [, , , , ] to be present, in a more or less conserved form, in a large number of other, mostly animal proteins. The list of proteins currently known to contain one or more copies of an EGF-like pattern is large and varied. The functional significance of EGF domains in what appear to be unrelated proteins is not yet clear. However, a common feature is that these repeats are found in the extracellular domain of membrane-bound proteins or in proteins known to be secreted (exception: prostaglandin G/H synthase). The EGF domain includes six cysteine residues which have been shown (in EGF) to be involved in disulphide bonds. The main structure is a two-stranded beta-sheet followed by a loop to a C-terminal short two-stranded sheet. Subdomains between the conserved cysteines vary in length. This entry contains EGF domains found in a variety of extracellular and membrane proteins
Probab=74.38 E-value=3 Score=25.41 Aligned_cols=23 Identities=35% Similarity=1.008 Sum_probs=19.6
Q ss_pred EEecccCeEEccCCCeecCCCce
Q psy17098 173 YLCKGDGKWYLPSGGCKCKPGFE 195 (204)
Q Consensus 173 ~~C~~~G~W~vp~g~C~C~~Gye 195 (204)
..|+..|.=..+.+.|.|.+||.
T Consensus 6 ~~C~~~G~C~~~~g~C~C~~g~~ 28 (32)
T PF07974_consen 6 NICSGHGTCVSPCGRCVCDSGYT 28 (32)
T ss_pred CccCCCCEEeCCCCEEECCCCCc
Confidence 45888887777789999999997
No 11
>smart00008 HormR Domain present in hormone receptors.
Probab=72.09 E-value=5.3 Score=28.17 Aligned_cols=48 Identities=23% Similarity=0.473 Sum_probs=38.2
Q ss_pred ccCCccccccccCCCCCCCCceeeEEeeecccccCcccc---CCCeEEecccCeEEc
Q psy17098 130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEV---GTPTYLCKGDGKWYL 183 (204)
Q Consensus 130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~---~~P~~~C~~~G~W~v 183 (204)
..||..+....-.|.|.+|. .+.-.|....... .....+|..+|.|..
T Consensus 3 ~~C~~~wD~~~CWp~t~~G~------~~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~ 53 (70)
T smart00008 3 LGCPATWDGIICWPQTPAGQ------LVEVPCPDYFSGFSYKTGASRNCTENGGWSP 53 (70)
T ss_pred CCCccccccccCCCCCCCCC------EEEecCCCcccCcCCCCCEEEEcCCCCCCcc
Confidence 46999999999999999985 3467899887532 126789999999985
No 12
>PF12661 hEGF: Human growth factor-like EGF; PDB: 2YGQ_A 2E26_A 3A7Q_A 2YGP_A 2YGO_A 1HRE_A 1HAE_A 1HAF_A 1HRF_A.
Probab=69.99 E-value=2.5 Score=20.94 Aligned_cols=9 Identities=44% Similarity=1.497 Sum_probs=7.0
Q ss_pred CeecCCCce
Q psy17098 187 GCKCKPGFE 195 (204)
Q Consensus 187 ~C~C~~Gye 195 (204)
.|.|.+||.
T Consensus 1 ~C~C~~G~~ 9 (13)
T PF12661_consen 1 TCQCPPGWT 9 (13)
T ss_dssp EEEE-TTEE
T ss_pred CccCcCCCc
Confidence 499999998
No 13
>cd00055 EGF_Lam Laminin-type epidermal growth factor-like domain; laminins are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation; the laminin-type epidermal growth factor-like module occurs in tandem arrays; the domain contains 4 disulfide bonds (loops a-d) the first three resemble epidermal growth factor (EGF); the number of copies of this domain in the different forms of laminins is highly variable ranging from 3 up to 22 copies
Probab=69.59 E-value=3.4 Score=27.14 Aligned_cols=27 Identities=26% Similarity=0.765 Sum_probs=21.8
Q ss_pred eccceeecCCcccccccccccCCcccCCccccccc
Q psy17098 106 PSGGCKCKPGFEADVEKQTCNGTCHACPETTINFA 140 (204)
Q Consensus 106 p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A 140 (204)
..|.|.|++|+.. ..| ..|++++|.+.
T Consensus 17 ~~G~C~C~~~~~G----~~C----~~C~~g~~~~~ 43 (50)
T cd00055 17 GTGQCECKPNTTG----RRC----DRCAPGYYGLP 43 (50)
T ss_pred CCCEEeCCCcCCC----CCC----CCCCCCCccCC
Confidence 4799999999983 334 99999999765
No 14
>PHA02639 EEV host range protein; Provisional
Probab=67.50 E-value=21 Score=32.21 Aligned_cols=90 Identities=21% Similarity=0.320 Sum_probs=48.3
Q ss_pred EEeEEccC---c-eecCCCceEEeccceEEeeccceeecCCcccccccccccCCcccCCcc---ccccccCCCC-CCCCc
Q psy17098 79 ATGRCVEN---A-EEVGTPTYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPET---TINFAHFPTT-PTGRE 150 (204)
Q Consensus 79 v~G~CV~~---A-~~~~~P~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g---~~k~A~Fp~T-~~g~~ 150 (204)
|.=+|-++ - .......+.|..||+|..+.-.|. . ..| . -|.. ..+....... .-|
T Consensus 109 V~y~C~~g~~~gY~L~G~~~~~C~~dG~WS~~~P~C~------~----i~C----~-~P~i~nG~v~~~~~~~~~~yg-- 171 (295)
T PHA02639 109 IYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICK------M----INC----R-FPALQNGYINGIPSNKKFYYK-- 171 (295)
T ss_pred EEEEeCCCCCCceEECCCCeEEECCCCeECCCCCeee------e----EEe----C-CCCCCCCceEcccCCCceecC--
Confidence 55678774 1 223567899999999986544442 0 012 1 1211 1110000000 111
Q ss_pred eeeEEeeecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098 151 VTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK 189 (204)
Q Consensus 151 ~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~ 189 (204)
..|.=+|.+.=.-.+.+.+.|..+|+|.-+.=.|.
T Consensus 172 ----~~V~fsC~~GY~L~Gs~~~tC~~nG~Ws~~~P~C~ 206 (295)
T PHA02639 172 ----TRVGFSCKSGFDLVGEKYSTCNINATWFPSIPTCV 206 (295)
T ss_pred ----CEEEEEcCCCCeEcCCCcEEECCCCeECCCCCeEe
Confidence 12345666544434567889999999997776674
No 15
>PF00053 Laminin_EGF: Laminin EGF-like (Domains III and V); InterPro: IPR002049 Laminins [] are the major noncollagenous components of basement membranes that mediate cell adhesion, growth migration, and differentiation. They are composed of distinct but related alpha, beta and gamma chains. The three chains form a cross-shaped molecule that consist of a long arm and three short globular arms. The long arm consist of a coiled coil structure contributed by all three chains and cross-linked by interchain disulphide bonds. Beside different types of globular domains each subunit contains, in its first half, consecutive repeats of about 60 amino acids in length that include eight conserved cysteines []. The tertiary structure [, ] of this domain is remotely similar in its N-terminal to that of the EGF-like module (see PDOC00021 from PROSITEDOC). It is known as a 'LE' or 'laminin-type EGF-like' domain. The number of copies of the LE domain in the different forms of laminins is highly variable; from 3 up to 22 copies have been found. A schematic representation of the topology of the four disulphide bonds in the LE domain is shown below. +-------------------+ +-|-----------+ | +--------+ +-----------------+ | | | | | | | | xxCxCxxxxxxxxxxxCxxxxxxxCxxCxxxxxGxxCxxCxxgaagxxxxxxxxxxxCxx sssssssssssssssssssssssssssssssssss 'C': conserved cysteine involved in a disulphide bond 'a': conserved aromatic residue 'G': conserved glycine (lower case = less conserved) 's': region similar to the EGF-like domain In mouse laminin gamma-1 chain, the seventh LE domain has been shown to be the only one that binds with a high affinity to nidogen []. The binding-sites are located on the surface within the loops C1-C3 and C5-C6 [, ]. Long consecutive arrays of LE domains in laminins form rod-like elements of limited flexibility [], which determine the spacing in the formation of laminin networks of basement membranes [].; PDB: 3TBD_A 3ZYG_B 3ZYI_B 2Y38_A 1KLO_A 1NPE_B 3ZYJ_B 1TLE_A.
Probab=67.06 E-value=2.4 Score=27.57 Aligned_cols=28 Identities=29% Similarity=0.793 Sum_probs=21.1
Q ss_pred eccceeecCCcccccccccccCCcccCCcccccccc
Q psy17098 106 PSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAH 141 (204)
Q Consensus 106 p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~ 141 (204)
..|.|.|+++|+. .. |..|..++|.+..
T Consensus 16 ~~G~C~C~~~~~G----~~----C~~C~~g~~~~~~ 43 (49)
T PF00053_consen 16 STGQCVCKPGTTG----PR----CDQCKPGYFGLPS 43 (49)
T ss_dssp TCEEESBSTTEES----TT----S-EE-TTEECSTT
T ss_pred CCCEEeccccccC----Cc----CcCCCCccccccC
Confidence 5789999999983 34 4999999998864
No 16
>PF07645 EGF_CA: Calcium-binding EGF domain; InterPro: IPR001881 A sequence of about forty amino-acid residues found in epidermal growth factor (EGF) has been shown [, , , , , ] to be present in a large number of membrane-bound and extracellular, mostly animal, proteins. Many of these proteins require calcium for their biological function and a calcium-binding site has been found at the N terminus of some EGF-like domains []. Calcium-binding may be crucial for numerous protein-protein interactions. For human coagulation factor IX it has been shown [] that the calcium-ligands form a pentagonal bipyramid. The first, third and fourth conserved negatively charged or polar residues are side chain ligands. The latter is possibly hydroxylated (see aspartic acid and asparagine hydroxylation site) []. A conserved aromatic residue, as well as the second conserved negative residue, are thought to be involved in stabilising the calcium-binding site. As in non-calcium binding EGF-like domains, there are six conserved cysteines and the structure of both types is very similar as calcium-binding induces only strictly local structural changes []. +------------------+ +---------+ | | | | nxnnC-x(3,14)-C-x(3,7)-CxxbxxxxaxC-x(1,6)-C-x(8,13)-Cx | | +------------------+ 'n': negatively charged or polar residue [DEQN] 'b': possibly beta-hydroxylated residue [DN] 'a': aromatic amino acid 'C': cysteine, involved in disulphide bond 'x': any amino acid. ; GO: 0005509 calcium ion binding; PDB: 2VJ3_A 1TOZ_A 1LMJ_A 1UZQ_A 1UZK_A 1UZJ_B 1UZP_A 1EMO_A 1EMN_A 2RR0_A ....
Probab=64.69 E-value=3.4 Score=26.15 Aligned_cols=20 Identities=40% Similarity=0.934 Sum_probs=14.9
Q ss_pred cCCCeecCCCceeccccCcc
Q psy17098 184 PSGGCKCKPGFEADVEKQTC 203 (204)
Q Consensus 184 p~g~C~C~~Gye~~~~~~~C 203 (204)
..-.|.|++||+.+....+|
T Consensus 23 Gsy~C~C~~Gy~~~~~~~~C 42 (42)
T PF07645_consen 23 GSYSCSCPPGYELNDDGTTC 42 (42)
T ss_dssp TEEEEEESTTEEECTTSSEE
T ss_pred CCEEeeCCCCcEECCCCCcC
Confidence 45679999999966666555
No 17
>PF00084 Sushi: Sushi domain (SCR repeat); InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13. Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=64.30 E-value=9.1 Score=24.32 Aligned_cols=32 Identities=31% Similarity=0.662 Sum_probs=24.2
Q ss_pred EEeEEccCceecCCCceEEeccceEEeeccce
Q psy17098 79 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC 110 (204)
Q Consensus 79 v~G~CV~~A~~~~~P~~~C~~~G~W~~p~g~C 110 (204)
+.=+|-.+-.......+.|+.+|+|..+.-.|
T Consensus 25 ~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C 56 (56)
T PF00084_consen 25 VTFSCNPGYELSGSSTITCQSNGQWSPPPPTC 56 (56)
T ss_dssp EEEEESTTEEEESSSEEEEETTSEESSTTSEE
T ss_pred EEEcCCCCCEecCCCEEEECCCCEECCCCCCC
Confidence 55688777776666899999999998664433
No 18
>PF14670 FXa_inhibition: Coagulation Factor Xa inhibitory site; PDB: 3Q3K_B 1NFY_B 1LQD_A 1G2L_B 1IQF_L 2UWP_B 2VH6_B 3KQC_L 2P93_L 2BQW_A ....
Probab=60.98 E-value=4.8 Score=25.23 Aligned_cols=19 Identities=37% Similarity=0.945 Sum_probs=13.7
Q ss_pred CCCeecCCCceeccccCcc
Q psy17098 185 SGGCKCKPGFEADVEKQTC 203 (204)
Q Consensus 185 ~g~C~C~~Gye~~~~~~~C 203 (204)
.-.|.|.+||..+.++.+|
T Consensus 18 ~~~C~C~~Gy~L~~D~~tC 36 (36)
T PF14670_consen 18 SYRCSCPPGYKLAEDGRTC 36 (36)
T ss_dssp SEEEE-STTEEE-TTSSSE
T ss_pred ceEeECCCCCEECcCCCCC
Confidence 4579999999987777665
No 19
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Probab=60.93 E-value=12 Score=23.46 Aligned_cols=32 Identities=31% Similarity=0.647 Sum_probs=22.6
Q ss_pred EEeEEccCceecCCCceEEeccceEEeeccce
Q psy17098 79 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC 110 (204)
Q Consensus 79 v~G~CV~~A~~~~~P~~~C~~~G~W~~p~g~C 110 (204)
+.=.|-++=.......+.|..+|+|..+.-.|
T Consensus 26 ~~~~C~~Gy~l~g~~~~~C~~~g~Ws~~~p~C 57 (57)
T smart00032 26 VTYSCNPGYTLIGSSTITCLEDGTWSPPAPTC 57 (57)
T ss_pred EEEEcCCCCEEcCCCeeEECCCCEECCCCCcC
Confidence 45577777554556899999999998654333
No 20
>smart00008 HormR Domain present in hormone receptors.
Probab=58.49 E-value=7 Score=27.53 Aligned_cols=48 Identities=23% Similarity=0.486 Sum_probs=37.2
Q ss_pred ecChhhhhccccCCCCCCCCCccceEeEEeEEccCceec---CCCceEEeccceEEe
Q psy17098 52 ITCPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEV---GTPTYLCKGDGKWYL 105 (204)
Q Consensus 52 ~~CP~~~~~la~Fp~T~ag~~~~slv~v~G~CV~~A~~~---~~P~~~C~~~G~W~~ 105 (204)
..||.+-.++.-.|.|.+|.. |.-.|=...... ..-...|+.+|.|..
T Consensus 3 ~~C~~~wD~~~CWp~t~~G~~------~~~~CP~~~~g~~~~~~a~R~C~~~G~W~~ 53 (70)
T smart00008 3 LGCPATWDGIICWPQTPAGQL------VEVPCPDYFSGFSYKTGASRNCTENGGWSP 53 (70)
T ss_pred CCCccccccccCCCCCCCCCE------EEecCCCcccCcCCCCCEEEEcCCCCCCcc
Confidence 469999999999999999753 455787766532 126788999999975
No 21
>KOG4260|consensus
Probab=57.79 E-value=8.8 Score=34.98 Aligned_cols=22 Identities=45% Similarity=0.978 Sum_probs=17.2
Q ss_pred EEecccCeEEccCCCeecCCCce
Q psy17098 173 YLCKGDGKWYLPSGGCKCKPGFE 195 (204)
Q Consensus 173 ~~C~~~G~W~vp~g~C~C~~Gye 195 (204)
-+|.+||. .-..|.|.|.+||.
T Consensus 156 G~C~GdGs-R~GsGkCkC~~GY~ 177 (350)
T KOG4260|consen 156 GSCHGDGS-REGSGKCKCETGYT 177 (350)
T ss_pred CcccCCCC-CCCCCcccccCCCC
Confidence 45777773 45789999999997
No 22
>cd00033 CCP Complement control protein (CCP) modules (aka short consensus repeats SCRs or SUSHI repeats) have been identified in several proteins of the complement system. SUSHI repeats (short complement-like repeat, SCR) are abundant in complement control proteins. The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. Typically, 2 to 4 modules contribute to a binding site, implying that the orientation of the modules to each other is critical for function.
Probab=56.99 E-value=13 Score=23.45 Aligned_cols=32 Identities=34% Similarity=0.670 Sum_probs=23.4
Q ss_pred EEeEEccCceecCCCceEEeccceEEeeccce
Q psy17098 79 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC 110 (204)
Q Consensus 79 v~G~CV~~A~~~~~P~~~C~~~G~W~~p~g~C 110 (204)
+.=.|-++=.......+.|..+|+|..+.-.|
T Consensus 25 ~~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C 56 (57)
T cd00033 25 VTYSCNEGYTLVGSSTITCTENGGWSPPPPTC 56 (57)
T ss_pred EEEECCCCCeEeCCCeeEECCCCeECCCCccc
Confidence 45578776555567799999999998665444
No 23
>PF07699 GCC2_GCC3: GCC2 and GCC3; InterPro: IPR011641 Protein phosphorylation, which plays a key role in most cellular activities, is a reversible process mediated by protein kinases and phosphoprotein phosphatases. Protein kinases catalyse the transfer of the gamma phosphate from nucleotide triphosphates (often ATP) to one or more amino acid residues in a protein substrate side chain, resulting in a conformational change affecting protein function. Phosphoprotein phosphatases catalyse the reverse process. Protein kinases fall into three broad classes, characterised with respect to substrate specificity []: Serine/threonine-protein kinases Tyrosine-protein kinases Dual specific protein kinases (e.g. MEK - phosphorylates both Thr and Tyr on target proteins) Protein kinase function has been evolutionarily conserved from Escherichia coli to human []. Protein kinases play a role in a multitude of cellular processes, including division, proliferation, apoptosis, and differentiation []. Phosphorylation usually results in a functional change of the target protein by changing enzyme activity, cellular location, or association with other proteins. The catalytic subunits of protein kinases are highly conserved, and several structures have been solved [], leading to large screens to develop kinase-specific inhibitors for the treatments of a number of diseases []. Tyrosine-protein kinases can transfer a phosphate group from ATP to a tyrosine residue in a protein. These enzymes can be divided into two main groups []: Receptor tyrosine kinases (RTK), which are transmembrane proteins involved in signal transduction; they play key roles in growth, differentiation, metabolism, adhesion, motility, death and oncogenesis []. RTKs are composed of 3 domains: an extracellular domain (binds ligand), a transmembrane (TM) domain, and an intracellular catalytic domain (phosphorylates substrate). The TM domain plays an important role in the dimerisation process necessary for signal transduction []. Cytoplasmic / non-receptor tyrosine kinases, which act as regulatory proteins, playing key roles in cell differentiation, motility, proliferation, and survival. For example, the Src-family of protein-tyrosine kinases []. This entry represents various ephrin type A and B receptors, which have tyrosine kinase activity.
Probab=56.48 E-value=6.4 Score=25.57 Aligned_cols=13 Identities=31% Similarity=0.777 Sum_probs=9.9
Q ss_pred CcccCCccccccc
Q psy17098 128 TCHACPETTINFA 140 (204)
Q Consensus 128 ~C~~Cp~g~~k~A 140 (204)
.|+.||.|+|+..
T Consensus 10 ~C~~Cp~GtYq~~ 22 (48)
T PF07699_consen 10 KCQPCPKGTYQDE 22 (48)
T ss_pred ccCCCCCCccCCc
Confidence 4588888888755
No 24
>PF01683 EB: EB module; InterPro: IPR006149 The EB domain has no known function. It is found in several Caenorhabditis sp. and Drosophila sp. proteins. The domain contains 8 conserved cysteines that probably form four disulphide bridges and is found associated with kunitz domains IPR002223 from INTERPRO
Probab=54.12 E-value=18 Score=23.55 Aligned_cols=17 Identities=35% Similarity=1.079 Sum_probs=13.0
Q ss_pred CCCeecCCCceeccccCcc
Q psy17098 185 SGGCKCKPGFEADVEKQTC 203 (204)
Q Consensus 185 ~g~C~C~~Gye~~~~~~~C 203 (204)
.|.|.|.+||.+ .+.+|
T Consensus 36 ~g~C~C~~g~~~--~~~~C 52 (52)
T PF01683_consen 36 NGRCQCPPGYVE--VGGRC 52 (52)
T ss_pred CCEeECCCCCEe--cCCCC
Confidence 489999999985 44454
No 25
>smart00032 CCP Domain abundant in complement control proteins; SUSHI repeat; short complement-like repeat (SCR). The complement control protein (CCP) modules (also known as short consensus repeats SCRs or SUSHI repeats) contain approximately 60 amino acid residues and have been identified in several proteins of the complement system. A missense mutation in seventh CCP domain causes deficiency of the b subunit of factor XIII.
Probab=53.14 E-value=14 Score=23.10 Aligned_cols=30 Identities=30% Similarity=0.661 Sum_probs=19.5
Q ss_pred cccccCccccCCCeEEecccCeEEccCCCe
Q psy17098 159 GRCVENAEEVGTPTYLCKGDGKWYLPSGGC 188 (204)
Q Consensus 159 G~CV~na~~~~~P~~~C~~~G~W~vp~g~C 188 (204)
=.|.+.=.-.....+.|..+|+|..+.-.|
T Consensus 28 ~~C~~Gy~l~g~~~~~C~~~g~Ws~~~p~C 57 (57)
T smart00032 28 YSCNPGYTLIGSSTITCLEDGTWSPPAPTC 57 (57)
T ss_pred EEcCCCCEEcCCCeeEECCCCEECCCCCcC
Confidence 345444332344789999999999765544
No 26
>KOG0994|consensus
Probab=52.77 E-value=12 Score=40.20 Aligned_cols=59 Identities=36% Similarity=0.835 Sum_probs=40.5
Q ss_pred eccceeecCCcccccccccccCCcccCCccccccccCCCCCCCCceeeEEeeecccccCccccCCCeEEecccCeEEccC
Q psy17098 106 PSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPS 185 (204)
Q Consensus 106 p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~ 185 (204)
.+|.|.|.+.-+ +..| -.|.++++|+|+ |. -.-.|--|++ ..| .|| .-+
T Consensus 1035 ~tGQCpClpNv~----G~~C----DqCA~N~w~laS------G~-------GCe~C~Cd~~--~~p--qCN------~ft 1083 (1758)
T KOG0994|consen 1035 FTGQCPCLPNVQ----GVRC----DQCAENHWNLAS------GE-------GCEPCNCDPI--GGP--QCN------EFT 1083 (1758)
T ss_pred ccCcCCCCcccc----cccc----cccccchhcccc------CC-------CCCccCCCcc--CCc--ccc------ccc
Confidence 457777775444 3444 899999999995 42 2456666663 344 566 368
Q ss_pred CCeecCCCce
Q psy17098 186 GGCKCKPGFE 195 (204)
Q Consensus 186 g~C~C~~Gye 195 (204)
|+|.|+|||-
T Consensus 1084 GQCqCkpGfG 1093 (1758)
T KOG0994|consen 1084 GQCQCKPGFG 1093 (1758)
T ss_pred cceeccCCCC
Confidence 9999999995
No 27
>PF00084 Sushi: Sushi domain (SCR repeat); InterPro: IPR000436 Sushi domains are also known as Complement control protein (CCP) modules, or short consensus repeats (SCR), exist in a wide variety of complement and adhesion proteins. The structure is known for this domain, it is based on a beta-sandwich arrangement; one face made up of three beta-strands hydrogen-bonded to form a triple-stranded region at its centre and the other face formed from two separate beta-strands []. CD21 (also called C3d receptor, CR2, Epstein Barr virus receptor or EBV-R) is the receptor for EBV and for C3d, C3dg and iC3b. Complement components may activate B cells through CD21. CD21 is part of a large signal-transduction complex that also involves CD19, CD81, and Leu13. Some of the proteins in this group are responsible for the molecular basis of the blood group antigens, surface markers on the outside of the red blood cell membrane. Most of these markers are proteins, but some are carbohydrates attached to lipids or proteins [Reid M.E., Lomas-Francis C. The Blood Group Antigen FactsBook Academic Press, London / San Diego, (1997)]. Complement decay-accelerating factor (Antigen CD55) belongs to the Cromer blood group system and is associated with Cr(a), Dr(a), Es(a), Tc(a/b/c), Wd(a), WES(a/b), IFC and UMC antigens. Complement receptor type 1 (C3b/C4b receptor) (Antigen CD35) belongs to the Knops blood group system and is associated with Kn(a/b), McC(a), Sl(a) and Yk(a) antigens. CD molecules are leucocyte antigens on cell surfaces. CD antigens nomenclature is updated at Protein Reviews On The Web (http://prow.nci.nih.gov/). ; PDB: 1HFH_A 1HFI_A 1E5G_A 2UWN_A 2V8E_A 2XWB_H 3HS0_I 2OK5_A 2XWJ_J 3HRZ_D ....
Probab=52.40 E-value=10 Score=24.04 Aligned_cols=31 Identities=32% Similarity=0.682 Sum_probs=21.7
Q ss_pred ecccccCccccCCCeEEecccCeEEccCCCe
Q psy17098 158 TGRCVENAEEVGTPTYLCKGDGKWYLPSGGC 188 (204)
Q Consensus 158 ~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C 188 (204)
.=+|-.+-...+...+.|..+|.|.-+.-.|
T Consensus 26 ~~~C~~Gy~~~g~~~~~C~~~g~W~~~~p~C 56 (56)
T PF00084_consen 26 TFSCNPGYELSGSSTITCQSNGQWSPPPPTC 56 (56)
T ss_dssp EEEESTTEEEESSSEEEEETTSEESSTTSEE
T ss_pred EEcCCCCCEecCCCEEEECCCCEECCCCCCC
Confidence 4456555554455889999999999776544
No 28
>KOG1214|consensus
Probab=42.12 E-value=62 Score=33.99 Aligned_cols=98 Identities=21% Similarity=0.566 Sum_probs=0.0
Q ss_pred EEEeccC-CceEeEEEEEEEEecChhhhhccccCCCCCCCCCccceEeEEeEEccCceecCCC-ceEEeccceEEeeccc
Q psy17098 32 YFAFRDQ-GACISLLAIKVYYITCPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEVGTP-TYLCKGDGKWYLPSGG 109 (204)
Q Consensus 32 YlAfqD~-GaCval~sVrVyy~~CP~~~~~la~Fp~T~ag~~~~slv~v~G~CV~~A~~~~~P-~~~C~~~G~W~~p~g~ 109 (204)
|-+|-|- +.||.|.-= +.+. .|+..+..=..+ +..|.-.| -..-.
T Consensus 764 gy~F~dd~~tCV~i~~p---------------------ap~n---------~Ce~g~h~C~i~g~a~c~~hG---gs~y~ 810 (1289)
T KOG1214|consen 764 GYEFADDRHTCVLITPP---------------------APAN---------PCEDGSHTCAIAGQARCVHHG---GSTYS 810 (1289)
T ss_pred cceeccCCcceEEecCC---------------------CCCC---------ccccCccccCcCCceEEEecC---CceEE
Q ss_pred eeecCCcccccccccccCCcccCCccccccccCCCCCCCCceeeEEeeecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098 110 CKCKPGFEADVEKQTCNGTCHACPETTINFAHFPTTPTGREVTFIEQATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK 189 (204)
Q Consensus 110 C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~ 189 (204)
|.|.+||. .++..|... -.|.+.---.+ -.|+..+ ....|.
T Consensus 811 C~CLPGfs--GDG~~c~dv-DeC~psrChp~------------------A~Cyntp------------------gsfsC~ 851 (1289)
T KOG1214|consen 811 CACLPGFS--GDGHQCTDV-DECSPSRCHPA------------------ATCYNTP------------------GSFSCR 851 (1289)
T ss_pred EeecCCcc--CCccccccc-cccCccccCCC------------------ceEecCC------------------Ccceee
Q ss_pred cCCCceeccccCcc
Q psy17098 190 CKPGFEADVEKQTC 203 (204)
Q Consensus 190 C~~Gye~~~~~~~C 203 (204)
|.+||+ ..+.+|
T Consensus 852 C~pGy~--GDGf~C 863 (1289)
T KOG1214|consen 852 CQPGYY--GDGFQC 863 (1289)
T ss_pred cccCcc--CCCcee
No 29
>KOG4260|consensus
Probab=41.59 E-value=13 Score=33.98 Aligned_cols=39 Identities=31% Similarity=0.684 Sum_probs=28.3
Q ss_pred ceEEeccceEEeeccceeecCCcccccccccccCCcccCCcccccccc
Q psy17098 94 TYLCKGDGKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAH 141 (204)
Q Consensus 94 ~~~C~~~G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~ 141 (204)
..+|.+||+ .-.+|.|.|.+||.. .. |..|-.++|...+
T Consensus 155 nG~C~GdGs-R~GsGkCkC~~GY~G----p~----C~~Cg~eyfes~R 193 (350)
T KOG4260|consen 155 NGSCHGDGS-REGSGKCKCETGYTG----PL----CRYCGIEYFESSR 193 (350)
T ss_pred CCcccCCCC-CCCCCcccccCCCCC----cc----ccccchHHHHhhc
Confidence 567777774 356899999999983 23 4888887776543
No 30
>PF06725 3D: 3D domain; InterPro: IPR010611 This short presumed domain contains three conserved aspartate residues, hence the name 3D. This conservation is suggestive of a cation binding function. The central aspartate is found in a DTG motif that is suggestive of a peptidase like active site.; GO: 0004553 hydrolase activity, hydrolyzing O-glycosyl compounds, 0009254 peptidoglycan turnover, 0019867 outer membrane; PDB: 2G5D_A 2G6G_A 3CZB_A 2PI8_D 2AE0_X 2PIC_A 2GAE_A 2PJJ_A 2PNW_A.
Probab=39.63 E-value=12 Score=26.96 Aligned_cols=55 Identities=22% Similarity=0.224 Sum_probs=29.3
Q ss_pred CccCCCCCcCCCCceeeeeEEEEe-ecCCCceEEEeccCCceEeEEEEEEEEecChh
Q psy17098 1 RIAAGEGRFNTNTEVNINTEVKSI-PVTKKGVYFAFRDQGACISLLAIKVYYITCPE 56 (204)
Q Consensus 1 tiAade~~f~~~~~~~~N~e~r~~-pls~~GfYlAfqD~GaCval~sVrVyy~~CP~ 56 (204)
|||.|-+.++.++.+-|+.+...- ...-.|+|+ -||+|+=+-=-++-||+-.=.+
T Consensus 2 SiAvD~~viPlGt~v~i~~~~~~~~~~~~~~~~~-aqDtGgAIkG~riDif~g~g~~ 57 (75)
T PF06725_consen 2 SIAVDPSVIPLGTPVYIPGPPLDDGGKPFNGLYI-AQDTGGAIKGNRIDIFFGSGDE 57 (75)
T ss_dssp EEEE-TTTS-TT-EEEEEEETTETTTSEEEEEEE-EEEE-TT--SSEEEEEEEESHH
T ss_pred eEEECCccCCCCCEEEEEeccccCCCceeeEEEE-EeccCccccCCEEEEEeCCHHH
Confidence 689999878777776666641000 011246677 6999985554577777654333
No 31
>KOG1836|consensus
Probab=31.75 E-value=33 Score=38.14 Aligned_cols=38 Identities=29% Similarity=0.769 Sum_probs=26.6
Q ss_pred eEEeccceEE---eeccceeecCCcccccccccccCCcccCCccccccc
Q psy17098 95 YLCKGDGKWY---LPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFA 140 (204)
Q Consensus 95 ~~C~~~G~W~---~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A 140 (204)
-+|++.|+-. .-.|+|.|++|+.. +.| -.|-++++.+-
T Consensus 398 C~c~~~gsl~~~~~~~g~c~c~P~v~g----~~c----D~ca~g~~~~~ 438 (1705)
T KOG1836|consen 398 CICNSAGSLSAQCDDTGRCQCKPGVTG----QKC----DRCAPGFYGLP 438 (1705)
T ss_pred cCCCCccchhhhhccCCcceecccccc----ccc----CccCcccccCc
Confidence 4566666543 33599999999984 234 78888888775
No 32
>PF12946 EGF_MSP1_1: MSP1 EGF domain 1; InterPro: IPR024730 This EGF-like domain is found at the C terminus of the malaria parasite MSP1 protein. MSP1 is the merozoite surface protein 1. This domain is part of the C-terminal fragment that is proteolytically processed from the the rest of the protein and is left attached to the surface of the invading parasite [].; PDB: 1N1I_C 2FLG_A 1CEJ_A 2NPR_A 1B9W_A 1OB1_F.
Probab=30.84 E-value=16 Score=23.27 Aligned_cols=18 Identities=28% Similarity=0.737 Sum_probs=11.9
Q ss_pred CCCeecCCCceeccccCccC
Q psy17098 185 SGGCKCKPGFEADVEKQTCN 204 (204)
Q Consensus 185 ~g~C~C~~Gye~~~~~~~C~ 204 (204)
.-+|.|.+||.. .++.|+
T Consensus 20 ~eecrCllgyk~--~~~~C~ 37 (37)
T PF12946_consen 20 SEECRCLLGYKK--VGGKCV 37 (37)
T ss_dssp EEEEEE-TTEEE--ETTEEE
T ss_pred CEEEEeeCCccc--cCCCcC
Confidence 346899999995 555563
No 33
>PF01393 Chromo_shadow: Chromo shadow domain Web page maintained by Rein Aasland; InterPro: IPR008251 Chromo shadow domain is distantly related to chromo domain. It is always found in association with a chromo domain. The CHROMO (CHRromatin Organization MOdifier) domain [, , , ] is a conserved region of around 60 amino acids, originally identified in Drosophila modifiers of variegation. These are proteins that alter the structure of chromatin to the condensed morphology of heterochromatin, a cytologically visible condition where gene expression is repressed. In one of these proteins, Polycomb, the chromo domain has been shown to be important for chromatin targeting. Proteins that contain a chromo domain appear to fall into 3 classes. The first class includes proteins having an N-terminal chromo domain followed by a region termed the chromo shadow domain [], eg. Drosophila and human heterochromatin protein Su(var)205 (HP1); and mammalian modifier 1 and modifier 2. The second class includes proteins with a single chromo domain, eg. Drosophila protein Polycomb (Pc); mammalian modifier 3; human Mi-2 autoantigenand and several yeast and Caenorhabditis elegans hypothetical proteins. In the third class paired tandem chromo domains are found, eg. in mammalian DNA-binding/helicase proteins CHD-1 to CHD-4 and yeast protein CHD1.; GO: 0005634 nucleus; PDB: 3Q6S_C 2FMM_B 3P7J_B 1E0B_B 3I3C_A 1DZ1_B 1S4Z_A 3KUP_D.
Probab=29.29 E-value=13 Score=25.65 Aligned_cols=31 Identities=23% Similarity=0.463 Sum_probs=26.4
Q ss_pred ceEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098 30 GVYFAFRDQGACISLLAIKVYYITCPETTINF 61 (204)
Q Consensus 30 GfYlAfqD~GaCval~sVrVyy~~CP~~~~~l 61 (204)
-||+=|.|.|- +.+++-++-|.+||..+..|
T Consensus 19 ~~likwk~~~~-~~~v~~~~~~~k~Pq~vI~F 49 (58)
T PF01393_consen 19 MFLIKWKNSGE-KDLVPSKEANEKCPQKVIKF 49 (58)
T ss_dssp EEEEEETTSSS-EEEEEHHHHHHHSHHHHHHH
T ss_pred EEEEEECCCCc-eEEeeHHHHHHHCcHHHHHH
Confidence 38999999886 88888999999999988654
No 34
>PHA03001 putative virion core protein; Provisional
Probab=29.20 E-value=51 Score=26.66 Aligned_cols=28 Identities=25% Similarity=0.660 Sum_probs=22.0
Q ss_pred ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098 25 PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINF 61 (204)
Q Consensus 25 pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~l 61 (204)
.|.+.=||||++|.- |||++|---...+
T Consensus 48 ~v~eStFylvvrd~d---------iFyfkcdkG~is~ 75 (132)
T PHA03001 48 HVDKSTFYLVVKDKD---------IFYFKCDKGSISQ 75 (132)
T ss_pred EecccEEEEEEecCc---------EEEEEecCCeEee
Confidence 567888999999974 9999997654433
No 35
>cd00185 TNFR Tumor necrosis factor receptor (TNFR) domain; superfamily of TNF-like receptor domains. When bound to TNF-like cytokines, TNFRs trigger multiple signal transduction pathways, they are involved in inflammation response, apoptosis, autoimmunity and organogenesis. TNFRs domains are elongated with generally three tandem repeats of cysteine-rich domains (CRDs). They fit in the grooves between protomers within the ligand trimer. Some TNFRs, such as NGFR and HveA, bind ligands with no structural similarity to TNF and do not bind ligand trimers.
Probab=28.99 E-value=31 Score=25.62 Aligned_cols=29 Identities=24% Similarity=0.532 Sum_probs=17.0
Q ss_pred ecCCccccccc-ccccCCcccCCccccccc
Q psy17098 112 CKPGFEADVEK-QTCNGTCHACPETTINFA 140 (204)
Q Consensus 112 C~~Gye~~~~~-~~C~~~C~~Cp~g~~k~A 140 (204)
|.+|+.....- ..-+..|+.||+|+|...
T Consensus 17 C~pG~~~~~~C~~~~~t~C~~C~~g~ys~~ 46 (98)
T cd00185 17 CPPGTYLSKDCTPGSDTVCEPCPPGTYTDS 46 (98)
T ss_pred CCCCccCCCcCCCCCCCeecCCCCCCcccC
Confidence 78888853210 000234688888888665
No 36
>PHA02831 EEV host range protein; Provisional
Probab=28.28 E-value=1.9e+02 Score=25.89 Aligned_cols=33 Identities=18% Similarity=0.407 Sum_probs=21.6
Q ss_pred eecccccCccccCCCeEEecccCeEEccCCCee
Q psy17098 157 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGCK 189 (204)
Q Consensus 157 v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C~ 189 (204)
|.=+|-+.=.-.+.....|..+|.|.-+.=.|.
T Consensus 164 Vty~C~~GY~L~Gss~~tC~~nG~Wsp~~P~C~ 196 (268)
T PHA02831 164 VNFKCKKGFILLGSSVSTCDINSIWYPGIPKCV 196 (268)
T ss_pred EEEEcCCCCEECCCccEEECCCCeECCCCCCcc
Confidence 345565443334556889999999986555553
No 37
>PF12947 EGF_3: EGF domain; InterPro: IPR024731 This entry represents an EGF domain found in the the C terminus of malarial parasite merozoite surface protein 1 [], as well as other proteins.; PDB: 2NPR_A 1N1I_C 1B9W_A 1YO8_A 2RHP_A.
Probab=27.72 E-value=30 Score=21.45 Aligned_cols=18 Identities=44% Similarity=0.990 Sum_probs=11.8
Q ss_pred cCCCeecCCCceeccccCcc
Q psy17098 184 PSGGCKCKPGFEADVEKQTC 203 (204)
Q Consensus 184 p~g~C~C~~Gye~~~~~~~C 203 (204)
+.-.|.|++||+- ++..|
T Consensus 19 ~~~~C~C~~Gy~G--dG~~C 36 (36)
T PF12947_consen 19 GSYTCTCKPGYEG--DGFFC 36 (36)
T ss_dssp TSEEEEE-CEEEC--CSTCE
T ss_pred CCEEeECCCCCcc--CCcCC
Confidence 3578999999994 44443
No 38
>PF06138 Chordopox_E11: Chordopoxvirus E11 protein; InterPro: IPR009201 This group represents a virion core protein, vaccinia E11L type.
Probab=27.67 E-value=55 Score=26.40 Aligned_cols=28 Identities=21% Similarity=0.618 Sum_probs=21.9
Q ss_pred ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098 25 PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINF 61 (204)
Q Consensus 25 pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~l 61 (204)
.|.+.=|||+++|.- |||++|---...+
T Consensus 49 ~veeStFylvvrd~d---------iFyfk~dkG~vs~ 76 (130)
T PF06138_consen 49 DVEESTFYLVVRDKD---------IFYFKCDKGRVSP 76 (130)
T ss_pred EecccEEEEEEecCc---------EEEEEecCCeEee
Confidence 667888999999953 9999997655443
No 39
>PF11767 SET_assoc: Histone lysine methyltransferase SET associated; InterPro: IPR024636 The SET domain is a protein-protein interaction domain found in protein lysine methyltransferase enzymes. This entry represents a domain of unknown function which is associated with the SET domain and found in histone lysine methyltransferases [].
Probab=27.04 E-value=40 Score=23.90 Aligned_cols=14 Identities=29% Similarity=0.634 Sum_probs=11.8
Q ss_pred cCCCceEEEeccCC
Q psy17098 26 VTKKGVYFAFRDQG 39 (204)
Q Consensus 26 ls~~GfYlAfqD~G 39 (204)
.-+.|||+.|.|..
T Consensus 31 ~d~tGfYIvF~~~~ 44 (66)
T PF11767_consen 31 DDRTGFYIVFNDSK 44 (66)
T ss_pred ecCCEEEEEECChH
Confidence 46899999999864
No 40
>PHA02927 secreted complement-binding protein; Provisional
Probab=26.11 E-value=1.8e+02 Score=25.44 Aligned_cols=32 Identities=16% Similarity=0.357 Sum_probs=21.9
Q ss_pred eecccccCccccCCCeEEecccCeEEccCCCe
Q psy17098 157 ATGRCVENAEEVGTPTYLCKGDGKWYLPSGGC 188 (204)
Q Consensus 157 v~G~CV~na~~~~~P~~~C~~~G~W~vp~g~C 188 (204)
|.=+|-+.=.-.+.+.+.|..+|.|.-+.=.|
T Consensus 230 v~y~C~~Gy~l~G~~~~~C~~~g~Ws~~~P~C 261 (263)
T PHA02927 230 VDFKCKYGYKLSGSSSSTCSPGNTWQPELPKC 261 (263)
T ss_pred EEEECCCCCeEcCCCCeEECCCCEECCCCCee
Confidence 44566554444466789999999998655555
No 41
>PF13121 DUF3976: Domain of unknown function (DUF3976)
Probab=25.10 E-value=15 Score=23.50 Aligned_cols=8 Identities=50% Similarity=1.032 Sum_probs=6.9
Q ss_pred ecCCCceE
Q psy17098 25 PVTKKGVY 32 (204)
Q Consensus 25 pls~~GfY 32 (204)
-|+|||||
T Consensus 14 tltkrgfy 21 (41)
T PF13121_consen 14 TLTKRGFY 21 (41)
T ss_pred eeehhhHH
Confidence 57999998
No 42
>PHA02639 EEV host range protein; Provisional
Probab=24.88 E-value=3.5e+02 Score=24.38 Aligned_cols=90 Identities=20% Similarity=0.304 Sum_probs=48.0
Q ss_pred EEeEEccCceecCCCceEEecc---ceEEeeccceeecCCcccccccccccCCcccCCc--cccccccCCCCCCCCceee
Q psy17098 79 ATGRCVENAEEVGTPTYLCKGD---GKWYLPSGGCKCKPGFEADVEKQTCNGTCHACPE--TTINFAHFPTTPTGREVTF 153 (204)
Q Consensus 79 v~G~CV~~A~~~~~P~~~C~~~---G~W~~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~--g~~k~A~Fp~T~~g~~~~s 153 (204)
|.=+|.++=.......++|..| |.|..+.-.|. . ..||. ...+-...+.. . ...
T Consensus 46 V~Y~Cn~GY~L~G~~~~~C~~d~~nG~WS~~~P~C~------~-----------~~C~~Pp~~~nG~i~~~~---~-~y~ 104 (295)
T PHA02639 46 IEYTCNTDYALIGDRFRTCIKDKNNAIWSNKAPFCM------L-----------KECNDPPSIINGKIYNKR---E-MYK 104 (295)
T ss_pred EEEEeCCCCEECCCCeEEEeCCCCCCEECCCCCEEe------e-----------ccCcCCCCCCCcEEecCC---C-ceE
Confidence 5568887766666678999975 68985544443 0 22321 11111111110 0 000
Q ss_pred -EEeeecccccC---cc-ccCCCeEEecccCeEEccCCCee
Q psy17098 154 -IEQATGRCVEN---AE-EVGTPTYLCKGDGKWYLPSGGCK 189 (204)
Q Consensus 154 -lv~v~G~CV~n---a~-~~~~P~~~C~~~G~W~vp~g~C~ 189 (204)
-..+.=+|.++ -. -.+..++.|..+|+|.-+.-.|.
T Consensus 105 ~G~~V~y~C~~g~~~gY~L~G~~~~~C~~dG~WS~~~P~C~ 145 (295)
T PHA02639 105 VGDEIYYVCNEHKGVQYSLVGNEKITCIQDKSWKPDPPICK 145 (295)
T ss_pred CCCEEEEEeCCCCCCceEECCCCeEEECCCCeECCCCCeee
Confidence 01234567664 11 22456789999999996555563
No 43
>KOG4564|consensus
Probab=24.13 E-value=81 Score=30.58 Aligned_cols=49 Identities=27% Similarity=0.457 Sum_probs=37.5
Q ss_pred ccCCccccccccCCCCCCCCceeeEEeeecccccCcc-----ccCCCeEEecccCeEEcc
Q psy17098 130 HACPETTINFAHFPTTPTGREVTFIEQATGRCVENAE-----EVGTPTYLCKGDGKWYLP 184 (204)
Q Consensus 130 ~~Cp~g~~k~A~Fp~T~~g~~~~slv~v~G~CV~na~-----~~~~P~~~C~~~G~W~vp 184 (204)
..|+..+.+..=.|.|.+|. .|.-.|..-=. ..+....+|.+||.|.-+
T Consensus 62 ~~Cn~twD~~lCWp~t~~G~------~v~~~CP~yf~~~~~~~~~~v~R~C~~~G~W~~~ 115 (473)
T KOG4564|consen 62 LGCNGTWDGILCWPRTPAGT------LVTVPCPDYFPGFSYSTSGNVTRNCTSNGTWSER 115 (473)
T ss_pred CCCCCcCCccccCCCCCCCc------eEEecCccccCCCcccCCCceEeecCCCCccCCC
Confidence 57999999999999999985 24567774332 345678999999999954
No 44
>smart00179 EGF_CA Calcium-binding EGF-like domain.
Probab=23.90 E-value=93 Score=17.94 Aligned_cols=10 Identities=40% Similarity=1.258 Sum_probs=8.3
Q ss_pred CCeecCCCce
Q psy17098 186 GGCKCKPGFE 195 (204)
Q Consensus 186 g~C~C~~Gye 195 (204)
-.|.|.+||.
T Consensus 24 ~~C~C~~g~~ 33 (39)
T smart00179 24 YRCECPPGYT 33 (39)
T ss_pred eEeECCCCCc
Confidence 4589999997
No 45
>PF04260 DUF436: Protein of unknown function (DUF436) ; InterPro: IPR006340 Members of this family are uncharacterised proteins of about 180 amino acids from the Bacillus/Clostridium group of Gram-positive bacteria, found in no more than one copy per genome. ; PDB: 1V8D_C.
Probab=23.56 E-value=34 Score=28.81 Aligned_cols=22 Identities=23% Similarity=0.662 Sum_probs=13.8
Q ss_pred cCCCceEEEeccCCceEeEEEEEEEEecChhhhhcccc
Q psy17098 26 VTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAH 63 (204)
Q Consensus 26 ls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~ 63 (204)
+.++|+||||| -|-..+|.|..
T Consensus 56 ~~~~gi~LA~Q----------------cCEHlNRALvv 77 (172)
T PF04260_consen 56 LKERGIYLAFQ----------------CCEHLNRALVV 77 (172)
T ss_dssp HHTTT-EEEEE------------------GGGTT-EEE
T ss_pred HHHcCcEEEEE----------------chhhhhHHHHh
Confidence 47899999998 36666666654
No 46
>cd00034 ChSh Chromo Shadow Domain, found in association with N-terminal chromo (CHRromatin Organization MOdifier) domain; Chromo domains mediate the interaction of the heterochromatin with other heterochromatin proteins, thereby affecting chromatin structure (e.g. Drosophila and human heterochromatin protein (HP1) and mammalian modifier 1 and modifier 2)
Probab=23.48 E-value=27 Score=23.59 Aligned_cols=29 Identities=28% Similarity=0.474 Sum_probs=25.0
Q ss_pred eEEEeccCCceEeEEEEEEEEecChhhhhcc
Q psy17098 31 VYFAFRDQGACISLLAIKVYYITCPETTINF 61 (204)
Q Consensus 31 fYlAfqD~GaCval~sVrVyy~~CP~~~~~l 61 (204)
||+=|+| .=..+++.++-|.+||..+..|
T Consensus 18 fl~kwk~--~~~~lVp~~~~~~k~P~~vI~F 46 (54)
T cd00034 18 FLAKWKD--GQASLVPNKELNVKCPLLVISF 46 (54)
T ss_pred EEEEEeC--CeEEEEEHHHHHhhCcHHHHHH
Confidence 8888888 6678999999999999988754
No 47
>KOG0994|consensus
Probab=23.36 E-value=80 Score=34.33 Aligned_cols=39 Identities=28% Similarity=0.673 Sum_probs=26.9
Q ss_pred eEEeccceEE----eeccceeecCCcccccccccccCCcccCCcccccccc
Q psy17098 95 YLCKGDGKWY----LPSGGCKCKPGFEADVEKQTCNGTCHACPETTINFAH 141 (204)
Q Consensus 95 ~~C~~~G~W~----~p~g~C~C~~Gye~~~~~~~C~~~C~~Cp~g~~k~A~ 141 (204)
..|++.|+-. +.-|.|.|+|+-- +.+| --|.+|+|.|-+
T Consensus 765 CnCnptGSlS~vCn~~GGqCqCkPnVV----GR~C----dqCApGtyGFGP 807 (1758)
T KOG0994|consen 765 CNCNPTGSLSSVCNPNGGQCQCKPNVV----GRRC----DQCAPGTYGFGP 807 (1758)
T ss_pred cccCCCccccccccCCCceecccCccc----cccc----cccCCcccCcCC
Confidence 4466666543 2348999997764 3445 999999999863
No 48
>COG4475 Uncharacterized protein conserved in bacteria [Function unknown]
Probab=21.69 E-value=45 Score=28.01 Aligned_cols=28 Identities=21% Similarity=0.330 Sum_probs=20.3
Q ss_pred ecCCCceEEEeccCCceEeEEEEEEEEecChhhhhccccCCCCC
Q psy17098 25 PVTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAHFPTTP 68 (204)
Q Consensus 25 pls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~Fp~T~ 68 (204)
-+.++|+|||||. |-..++.|..=-++.
T Consensus 60 ~~~~tGv~lA~Q~----------------CEHlNRAlvvEr~~a 87 (180)
T COG4475 60 VLKKTGVHLAFQG----------------CEHLNRALVVEREVA 87 (180)
T ss_pred HHhccceEEEeeh----------------hhhhhhHhhhhHhhh
Confidence 3578999999984 777777776544443
No 49
>cd00054 EGF_CA Calcium-binding EGF-like domain, present in a large number of membrane-bound and extracellular (mostly animal) proteins. Many of these proteins require calcium for their biological function and calcium-binding sites have been found to be located at the N-terminus of particular EGF-like domains; calcium-binding may be crucial for numerous protein-protein interactions. Six conserved core cysteines form three disulfide bridges as in non calcium-binding EGF domains, whose structures are very similar. EGF_CA can be found in tandem repeat arrangements.
Probab=21.41 E-value=1.1e+02 Score=17.19 Aligned_cols=10 Identities=40% Similarity=1.222 Sum_probs=8.4
Q ss_pred CCeecCCCce
Q psy17098 186 GGCKCKPGFE 195 (204)
Q Consensus 186 g~C~C~~Gye 195 (204)
..|.|.+||.
T Consensus 24 ~~C~C~~g~~ 33 (38)
T cd00054 24 YRCSCPPGYT 33 (38)
T ss_pred eEeECCCCCc
Confidence 4699999997
No 50
>PF07354 Sp38: Zona-pellucida-binding protein (Sp38); InterPro: IPR010857 This family contains a number of zona-pellucida-binding proteins that seem to be restricted to mammals. These are sperm proteins that bind to the 90 kDa family of zona pellucida glycoproteins in a calcium-dependent manner []. These represent some of the specific molecules that mediate the first steps of gamete interaction, allowing fertilisation to occur [].; GO: 0007339 binding of sperm to zona pellucida, 0005576 extracellular region
Probab=21.39 E-value=50 Score=29.74 Aligned_cols=31 Identities=26% Similarity=0.720 Sum_probs=23.0
Q ss_pred eecCCcccc-cccccccCCcccCCcccccccc
Q psy17098 111 KCKPGFEAD-VEKQTCNGTCHACPETTINFAH 141 (204)
Q Consensus 111 ~C~~Gye~~-~~~~~C~~~C~~Cp~g~~k~A~ 141 (204)
.|.|||--| .....|.+-|-.|++|+|+...
T Consensus 221 sC~PGfG~N~~~h~~C~~CCVvCsPgTysp~~ 252 (271)
T PF07354_consen 221 SCRPGFGKNDILHSDCPSCCVVCSPGTYSPDD 252 (271)
T ss_pred ccCCCCCcCcccCCCCCCeeEECCCcccCCCC
Confidence 578999854 2234576779999999998763
No 51
>PF00954 S_locus_glycop: S-locus glycoprotein family; InterPro: IPR000858 In Brassicaceae, self-incompatible plants have a self/non-self recognition system, which involves the inability of flowering plants to achieve self-fertilisation. This is sporophytically controlled by multiple alleles at a single locus (S). There are a total of 50 different S alleles in Brassica oleracea. S-locus glycoproteins, as well as S-receptor kinases, are in linkage with the S-alleles []. Most of the proteins within this family contain apple-like domain (IPR003609 from INTERPRO), which is predicted to possess protein- and/or carbohydrate-binding functions.; GO: 0048544 recognition of pollen
Probab=20.80 E-value=45 Score=24.93 Aligned_cols=10 Identities=60% Similarity=1.551 Sum_probs=9.0
Q ss_pred eecCCCceec
Q psy17098 188 CKCKPGFEAD 197 (204)
Q Consensus 188 C~C~~Gye~~ 197 (204)
|.|.+||+|+
T Consensus 100 C~Cl~GF~P~ 109 (110)
T PF00954_consen 100 CSCLPGFEPK 109 (110)
T ss_pred eECCCCcCCC
Confidence 9999999983
No 52
>TIGR01440 conserved hypothetical protein TIGR01440. Members of this family are uncharacterized proteins of about 180 amino acids from the Bacillus/Clostridium group of Gram-positive bacteria, found in no more than one copy per genome.
Probab=20.12 E-value=50 Score=27.83 Aligned_cols=26 Identities=23% Similarity=0.401 Sum_probs=18.8
Q ss_pred cCCCceEEEeccCCceEeEEEEEEEEecChhhhhccccCCCC
Q psy17098 26 VTKKGVYFAFRDQGACISLLAIKVYYITCPETTINFAHFPTT 67 (204)
Q Consensus 26 ls~~GfYlAfqD~GaCval~sVrVyy~~CP~~~~~la~Fp~T 67 (204)
+.++|+||||| -|-..+|.|..=-++
T Consensus 56 ~~~~gi~lA~Q----------------~CEHlNRALvvEr~~ 81 (172)
T TIGR01440 56 LKKTGVTLAFQ----------------GCEHINRALVMERSV 81 (172)
T ss_pred hhhcCeEEEEe----------------chhhhhHHHHHhHHH
Confidence 35899999998 477777777654433
Done!